AI RESEARCH

Rethinking Data Curation in LLM Training: Online Reweighting Offers Better Generalization than Offline Methods

arXiv CS.LG

ArXi:2605.05227v1 Announce Type: new Data curation is a critical yet under-explored area in large language model