AI RESEARCH
MUON+: Towards More Effective Muon via One Additional Normalization Step for LLM Pre-training
arXiv CS.LG
•
ArXi:2602.21545v3 Announce Type: replace Muon has recently emerged as a strong optimizer for large language model pre-