As of today, what's the *most stable* model to run on a 32Gb RAM Mac w/ 256k context?
r/LocalLLaMA
•
Generative AI
Open Source AI
Hey everyone, I've been playing around with Gemma4 and Qwen3.6 on my 32Gb Macbook Pro M2 Max since their release but I'm struggling at finding: The best software to run it (oMLX, llama.cpp