AI RESEARCH

MUON+: Towards More Effective Muon via One Additional Normalization Step for LLM Pre-training

arXiv CS.LG

ArXi:2602.21545v3 Announce Type: replace Muon has recently emerged as a strong optimizer for large language model pre-