AI RESEARCH

A Triadic Suffix Tokenization Scheme for Numerical Reasoning

arXiv CS.AI

ArXi:2604.11582v1 Announce Type: cross Standard subword tokenization methods fragment numbers inconsistently, causing large language models (LLMs) to lose positional and decimal structure - a primary driver of errors in arithmetic and scientific reasoning. We