AI RESEARCH
DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning
arXiv CS.AI
•
ArXi:2602.11089v2 Announce Type: replace-cross In the current landscape of Large Language Models (LLMs), the curation of large-scale, high-quality