AI RESEARCH
Can RL Improve Generalization of LLM Agents? An Empirical Study
arXiv CS.AI
•
ArXi:2603.12011v1 Announce Type: new Reinforcement fine-tuning (RFT) has shown promise for