AI RESEARCH
OrScale: Orthogonalised Optimization with Layer-Wise Trust-Ratio Scaling
arXiv CS.LG
•
ArXi:2605.07815v1 Announce Type: new Muon improves neural-network