AI RESEARCH
Geometric Asymmetry in MoE Specialization: Functional Decorrelation and Representational Overlap
arXiv CS.LG
•
ArXi:2605.16349v1 Announce Type: new Mixture-of-Experts (MoE) architectures achieve scalable capacity through sparse routing, yet the geometric structure of expert specialization remains poorly understood. We