AI RESEARCH
Almost Bayesian: The Fractal Dynamics of Stochastic Gradient Descent
arXiv CS.AI
•
ArXi:2503.22478v2 Announce Type: replace-cross We show that the behavior of stochastic gradient descent is related to Bayesian statistics by showing that SGD is effectively diffusion on a fractal landscape, where the fractal dimension can be accounted for in a purely Bayesian way. By doing this we show that SGD can be regarded as a modified Bayesian sampler which accounts for accessibility constraints induced by the fractal structure of the loss landscape. We verify our results experimentally by examining the diffusion of weights during.