AI RESEARCH
vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models
arXiv CS.AI
•
ArXi:2603.04444v2 Announce Type: replace-cross As large language models (LLMs) diversify across modalities, capabilities, and cost profiles, the problem of intelligent request routing -- selecting the right model for each query at inference time -- has become a critical systems challenge. We present vLLM Semantic Router, a signal-driven decision routing framework for Mixture-of-Modality (MoM) model deployments.