AI RESEARCH
Grokking as Structural Inference: Transformers Need Bayesian Lottery Tickets
arXiv CS.AI
•
ArXi:2605.15787v1 Announce Type: cross Why does a Transformer that has memorized its