Anthropic's Claude Opus 4.6 saw through an AI test, cracked the encryption, and grabbed the answers itself

The Decoder
Generative AI LLMs AI Research

Anthropic's Claude Opus 4.6 independently figured out it was being tested during a benchmark, identified the specific test, and cracked its encrypted answer key. According to Anthropic, this is the first documented case of its kind. The article Anthropic's Claude Opus 4.