AI RESEARCH

Can RL Improve Generalization of LLM Agents? An Empirical Study

arXiv CS.AI

ArXi:2603.12011v1 Announce Type: new Reinforcement fine-tuning (RFT) has shown promise for