AI RESEARCH
Latent Algorithmic Structure Precedes Grokking: A Mechanistic Study of ReLU MLPs on Modular Arithmetic
arXiv CS.LG
•
ArXi:2603.23784v1 Announce Type: new Grokking-the phenomenon where validation accuracy of neural networks on modular addition of two integers rises long after