I Tested Mistral Medium 3.5 on 18 Tasks — The 128B Just Killed Magistral and Devstral

Towards AI
Generative AI Open Source AI

One set of weights now does the job of two specialized models. I ran 18 coding tasks to see if the merged-model gamble actually works.