AI RESEARCH
Projected Autoregression: Autoregressive Language Generation in Continuous State Space
arXiv CS.AI
•
ArXi:2601.04854v3 Announce Type: replace-cross Standard autoregressive language models generate text by repeatedly selecting a discrete next token, coupling prediction with irreversible commitment at every step. We show that token selection is not the only viable autoregressive interface. \textbf{Projected Autoregression} replaces token selection with continuous prediction in embedding space followed by discrete projection at commitment time. The model predicts next-token vectors via regression and contrastive objectives, while discrete tokens arise only by nearest-neighbor projection.