Field report: coding with Qwen 3.6 35B-A3B on an M2 Macbook Pro with 32GB RAM

r/LocalLLaMA
Generative AI Open Source AI

TL;DR: I finally have this working and doing real work within the tight specs of my 32GB RAM Mac. So for those who would like to fly like Julien Chaumond, here's an updated HOW-TO, an explanation of why I did everything I did, and my personal take on how well it actually works. This is a snapshot in time. I'll keep posting revised versions as my setup improves. HOW-TO * We're going to use llama.cpp to run the model locally. But, these models are really new and bugs are constantly being fixed. So we need to build llama.cpp from source. This is easier than it sounds.