AI RESEARCH
Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers
arXiv CS.AI
•
ArXi:2603.11114v1 Announce Type: cross Sparse Mixture-of-Experts (MoE) architectures enable efficient scaling of large language models through conditional computation, yet the routing mechanisms responsible for expert selection remain poorly understood. In this work, we