AI RESEARCH
Large Spikes in Stochastic Gradient Descent: A Large-Deviations View
arXiv CS.LG
•
ArXi:2603.10079v1 Announce Type: new