AI RESEARCH
Does Sparse Connectivity Improve Generalization? Convolutional Networks Below the Edge of Stability
arXiv CS.LG
•
ArXi:2603.04807v2 Announce Type: replace-cross Gradient descent on overparameterized neural networks typically operates at the Edge of Stability (EoS), where the largest Hessian eigenvalue hovers around a step-size-dependent threshold. We study how sparse connectivity changes generalization below this threshold in two-layer ReLU networks. Prior results have shown that for fully-connected networks (FCNs), generalization guarantees in this regime degrade and become vacuous on high-dimensional spherical inputs. Our analysis reveals that sparse connectivity fundamentally alters this picture.