Mythos just obliterated SWE-bench with a 93.9% score. The era of the solo mega-corp is actually here.

r/artificial
Generative AI

The new SWE-bench numbers for Mythos just dropped, and the gap between it and the current best is terrifying. ​SWE-bench Verified: ​Mythos: 93.9% ​Opus 4.6: 80.8% ​SWE-bench Pro: ​Mythos: 77.8% ​Opus 4.6: 53.4% ​That Pro score is a nearly 25% jump in autonomous coding. Factor in the rumors around Project Glasswing giving it deep architectural understanding, and the barrier between a prompt and a fully deployed product is basically gone. ​Imagine what you will be able to build when Mythos drops. ​All you need is a laptop and an idea.