AI RESEARCH

ReLibra: Routing-Replay-Guided Load Balancing for MoE Training in Reinforcement Learning

arXiv CS.LG

ArXi:2605.08639v1 Announce Type: new Load imbalance is a long-standing challenge in Mixture-of-Experts (MoE)