AI RESEARCH
Rethinking Data Curation in LLM Training: Online Reweighting Offers Better Generalization than Offline Methods
arXiv CS.LG
•
ArXi:2605.05227v1 Announce Type: new Data curation is a critical yet under-explored area in large language model