AI RESEARCH

Optimizer-Induced Low-Dimensional Drift and Transverse Dynamics in Transformer Training

arXiv CS.AI

ArXi:2602.23696v3 Announce Type: replace-cross We analyze cumulative parameter trajectories of transformer