AI RESEARCH
Repeated-Token Counting Reveals a Dissociation Between Representations and Outputs
arXiv CS.LG
•
ArXi:2605.09239v1 Announce Type: cross Large language models fail at counting repeated tokens despite strong performance on broader reasoning benchmarks. These failures are commonly attributed to limitations in internal count tracking. We show this attribution is wrong. Linear probes on the residual stream decode the correct count with near-perfect accuracy at every post-embedding layer, across all model depths. This holds even at the exact layers where the wrong answer crystallizes while the model simultaneously outputs an incorrect count.