AI RESEARCH
Routers Learn the Geometry of Their Experts: Geometric Coupling in Sparse Mixture-of-Experts
arXiv CS.LG
•
ArXi:2605.12476v1 Announce Type: new Sparse Mixture-of-Experts (SMoE) models enable scaling language models efficiently, but