AI RESEARCH
Grouter: Decoupling Routing from Representation for Accelerated MoE Training
arXiv CS.LG
•
ArXi:2603.06626v1 Announce Type: new Traditional Mixture-of-Experts (MoE)