AI RESEARCH

Routers Learn the Geometry of Their Experts: Geometric Coupling in Sparse Mixture-of-Experts

arXiv CS.LG

ArXi:2605.12476v1 Announce Type: new Sparse Mixture-of-Experts (SMoE) models enable scaling language models efficiently, but