We ran GPT-5.4 (xhigh) an additional ten times on Tier 4 to get a pass@10 score (1 minute read)
TLDR AI
•
Generative AI
LLMs
In one of these runs, it solved another problem no model had solved before.