AI RESEARCH

Geometric Asymmetry in MoE Specialization: Functional Decorrelation and Representational Overlap

arXiv CS.LG

ArXi:2605.16349v1 Announce Type: new Mixture-of-Experts (MoE) architectures achieve scalable capacity through sparse routing, yet the geometric structure of expert specialization remains poorly understood. We