AI RESEARCH

Muon Dynamics as a Spectral Wasserstein Flow

arXiv CS.AI

ArXi:2604.04891v1 Announce Type: cross Gradient normalization is central in deep-learning optimization because it stabilizes