AI RESEARCH
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning
arXiv CS.LG
•
ArXi:2605.02913v1 Announce Type: new Reinforcement learning (RL) has become a central post-