AI RESEARCH

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

arXiv CS.LG

ArXi:2605.02913v1 Announce Type: new Reinforcement learning (RL) has become a central post-