How Curosr Trains Agentic Models with RL

Towards AI
Generative AI Reinforcement Learning

Composer 2 is a frontier-level model specialized entirely for agentic software engineering. Instead of simply answering isolated chat queries, the model navigates entire repositories, runs shell commands, edits files, and interacts with development environments to solve complex user intents. CursorBench Performance Building a model that succeeds in this open-ended domain requires moving beyond standard public benchmarks. Cursor’s approach relies on a core principle: Minimize train-test mismatch by.