AI RESEARCH

Why Grokking Takes So Long: A First-Principles Theory of Representational Phase Transitions

arXiv CS.AI

ArXi:2603.13331v1 Announce Type: new Grokking is the sudden generalization that appears long after a model has perfectly memorized its