AI RESEARCH
Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice
arXiv CS.AI
•
ArXi:2512.24503v2 Announce Type: replace-cross Data teams at frontier AI companies routinely train small proxy models to make critical decisions about pre