AI RESEARCH

Dynamical structure of vanishing gradient and overfitting in multi-layer perceptrons

arXiv CS.LG

ArXi:2604.02393v1 Announce Type: new Vanishing gradient and overfitting are two of the most extensively studied problems in the literature about machine learning. However, they are frequently considered in some asymptotic setting, which obscure the underlying dynamical mechanisms responsible for their emergence. In this paper, we aim to provide a clear dynamical description of learning in multi-layer perceptrons. To this end, we