AI RESEARCH

Selective Rollout: Mid-Trajectory Termination for Multi-Sample Agent RL

arXiv CS.LG

ArXi:2605.05802v1 Announce Type: new Group-relative RL