AI RESEARCH

Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers

arXiv CS.AI

ArXi:2603.11114v1 Announce Type: cross Sparse Mixture-of-Experts (MoE) architectures enable efficient scaling of large language models through conditional computation, yet the routing mechanisms responsible for expert selection remain poorly understood. In this work, we