AI RESEARCH
Selective Rollout: Mid-Trajectory Termination for Multi-Sample Agent RL
arXiv CS.LG
•
ArXi:2605.05802v1 Announce Type: new Group-relative RL