ARC-AGI-3 offers $2M to any AI that matches untrained humans, yet every frontier model scores below 1%
The Decoder
•
AI Research
The new ARC-AGI-3 benchmark drops AI systems into interactive game environments that humans solve with ease. No frontier model breaks the 1% mark because the benchmark strips away their biggest advantages.