AI RESEARCH

OrScale: Orthogonalised Optimization with Layer-Wise Trust-Ratio Scaling

arXiv CS.LG

ArXi:2605.07815v1 Announce Type: new Muon improves neural-network