AI RESEARCH

Grouter: Decoupling Routing from Representation for Accelerated MoE Training

arXiv CS.LG

ArXi:2603.06626v1 Announce Type: new Traditional Mixture-of-Experts (MoE)