AI RESEARCH

MoE Routing Testbed: Studying Expert Specialization and Routing Behavior at Small Scale

arXiv CS.LG

ArXi:2604.07030v1 Announce Type: new Sparse Mixture-of-Experts (MoE) architectures are increasingly popular for frontier large language models (LLM) but they