AI RESEARCH

What do you think about Tabular Foundation Models [D]

r/MachineLearning

I've seen TabPFN-3's recent results, and there is a lot of buzz about foundation models for tabular data (TabICL, TabPFN). The performance that those models achieve is really amazing. What makes me a little suspicious about them? They can analyze small datasets only, so a few MB of data, and you need to have a large GPU machine and download a few GB of model to predict on a few MB of data. That doesn't sound rational. I really miss the old school approach of running a single decision tree or a linear model on the data.