We ran GPT-5.4 (xhigh) an additional ten times on Tier 4 to get a pass@10 score (1 minute read)

TLDR AI
Generative AI LLMs

In one of these runs, it solved another problem no model had solved before.