AI RESEARCH
Scalable Prompt Routing via Fine-Grained Latent Task Discovery
arXiv CS.AI
•
ArXi:2603.19415v1 Announce Type: cross Prompt routing dynamically selects the most appropriate large language model from a pool of candidates for each query, optimizing performance while managing costs. As model pools scale to include dozens of frontier models with narrow performance gaps, existing approaches face significant challenges: manually defined task taxonomies cannot capture fine-grained capability distinctions, while monolithic routers struggle to differentiate subtle differences across diverse tasks.