AI RESEARCH
ReLibra: Routing-Replay-Guided Load Balancing for MoE Training in Reinforcement Learning
arXiv CS.LG
•
ArXi:2605.08639v1 Announce Type: new Load imbalance is a long-standing challenge in Mixture-of-Experts (MoE)