ARC-AGI 3 Paper alleges that Gemini 3 (and other frontier models) intentionally or not “cheated” their ARC-AGI 1 and 2 scores through memorisation of similar benchmark tasks during training

r/singularity
Generative AI AI Research

Submitted by /u/Westbrooke117 [link] [comments]