I thought Gemini was supposed to be the long context king?

Just saw this MRCR v2 benchmark and Gemini 3.1 Pro drops from 71.9% at 128K all the way to 25.9% at 1M tokens. Meanwhile Claude Opus holds at 78.3%. Turns out having a big context window and actually being able to USE it are two very different things. submitted by /u/Additional-Alps-8209 [link] [comments]