AI RESEARCH

Stochastic Gradient Descent in the Saddle-to-Saddle Regime of Deep Linear Networks

arXiv CS.LG

ArXi:2604.06366v1 Announce Type: new Deep linear networks (DLNs) are used as an analytically tractable model of the