The AI Benchmark Where Simple Beats Smart

The results from ARC-AGI-3 landed this week, and everyone reported the same headline: frontier AI scored below 1%. GPT-5.4 got 0.26%. Claude Opus 4.6 got 0.25%. Gemini 3.1 Pro led the pack at a breathtaking 0.37%. Humans, meanwhile, solved 100% of the environments. That is a damning result. But it is not the most interesting one. The most interesting result is what scored higher than all of them. Simple CNN and graph-search algorithms - the kind of techniques that computer science students learn in their second year - reached 12.58.