AI RESEARCH
Data Language Models: A New Foundation Model Class for Tabular Data
arXiv CS.AI
•
ArXi:2605.06290v1 Announce Type: new Every major data modality now has a foundation model that understands it natively: text has language models, images have vision models, audio has audio models. Tabular data, the modality on which many consequential real-world AI decisions are made, does not. Every approach to tabular AI today, from gradient-boosted trees to the latest tabular foundation models, requires a preprocessing pipeline before any model can consume the data. None of them understand tabular data as a modality. We.