I tested GLM-5.1 — it beat GPT-5.4 & Claude Opus 4.6 and is 7.8× cheaper.

Towards AI
Generative AI

An MIT-licensed model just hit on SWE-Bench Pro, beating both GPT-5.4 and Claude Opus 4.6 at real-world software engineering. I spent…