AI RESEARCH
MoE Routing Testbed: Studying Expert Specialization and Routing Behavior at Small Scale
arXiv CS.LG
•
ArXi:2604.07030v1 Announce Type: new Sparse Mixture-of-Experts (MoE) architectures are increasingly popular for frontier large language models (LLM) but they