AI RESEARCH
Why Grokking Takes So Long: A First-Principles Theory of Representational Phase Transitions
arXiv CS.AI
•
ArXi:2603.13331v1 Announce Type: new Grokking is the sudden generalization that appears long after a model has perfectly memorized its