Anthropic's Claude Opus 4.6 saw through an AI test, cracked the encryption, and grabbed the answers itself
The Decoder
•
Generative AI
LLMs
AI Research
Anthropic's Claude Opus 4.6 independently figured out it was being tested during a benchmark, identified the specific test, and cracked its encrypted answer key. According to Anthropic, this is the first documented case of its kind. The article Anthropic's Claude Opus 4.