ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

ArXi:2603.18815v1 Announce Type: new Multi-turn LLM agents are increasingly important for solving complex, interactive tasks, and reinforcement learning (RL) is a key ingredient for improving their long-horizon behavior. However, RL