AI RESEARCH
Variational Deep Learning via Implicit Regularization
arXiv CS.AI
•
ArXi:2505.20235v3 Announce Type: replace-cross Modern deep learning models generalize remarkably well in-distribution, despite being overparametrized and trained with little to no explicit regularization. Instead, current theory credits implicit regularization imposed by the choice of architecture, hyperparameters, and optimization procedure. However, deep neural networks can be surprisingly non-robust, resulting in overconfident predictions and poor out-of-distribution generalization.