If you haven't yet given Gemma 4 a go...do it today

r/LocalLLaMA
Generative AI Open Source AI AI Research

I have a modest rig that allows me to run Qwen 3.5 27B or even 35B via Ollama. Qwen has been amazing to work with and I've been fine with the slow drip trade-off. Then Google released Gemma4. Its fast - like 4 or 9B fast. Accuracy and confidence wise, reminds me of that first release of Gemini Pro that could actually produce code that would run. As a "local guy" this shift in useability and confidence for a small self hosted LLM reminded me of what Deepseek brought to the table years ago with the thinking capability.