AI RESEARCH

When Symbol Names Should Not Matter: A Logistic Theory of Fresh-Symbol Classification

arXiv CS.LG

ArXi:2605.07120v1 Announce Type: new Template tasks have emerged as a clean testbed for asking whether transformers reason with abstract symbols rather than concrete token names. We study the fixed-label classification version of this problem, where train and test examples share latent templates but may use disjoint vocabularies. Unlike next-token prediction, the model need not emit unseen symbols; it must learn a decision rule invariant to symbol renaming. We analyze regularized kernel logistic classification in the transformer-kernel regime.