AI RESEARCH

Customizing multiturn AI agents with reinforcement learning

Amazon Science • January 13, 2026

Leveraging existing environment simulators and reward functions based on verifiable ground truth boosts task success rate, even with small models and small

Read Full Article