I tested GLM-5.1 — it beat GPT-5.4 & Claude Opus 4.6 and is 7.8× cheaper.
Towards AI
•
Generative AI
An MIT-licensed model just hit on SWE-Bench Pro, beating both GPT-5.4 and Claude Opus 4.6 at real-world software engineering. I spent…