Mythos just obliterated SWE-bench with a 93.9% score. The era of the solo mega-corp is actually here.
r/artificial
•
Generative AI
The new SWE-bench numbers for Mythos just dropped, and the gap between it and the current best is terrifying. SWE-bench Verified: Mythos: 93.9% Opus 4.6: 80.8% SWE-bench Pro: Mythos: 77.8% Opus 4.6: 53.4% That Pro score is a nearly 25% jump in autonomous coding. Factor in the rumors around Project Glasswing giving it deep architectural understanding, and the barrier between a prompt and a fully deployed product is basically gone. Imagine what you will be able to build when Mythos drops. All you need is a laptop and an idea.