Gemma 4 31B vs Qwen 3.5 27B: Which is best for long context worklows? My THOUGHTS...
r/LocalLLaMA
•
Open Source AI
My setup: i7 12700K | RTX 3090 TI | 96GB RAM Models: Qwen 3.5 27B UD Q5/Q6_K_XL | Gemma 4 31B UD Q4_K_XL To the point: Right now, Gemma 4 31B and Qwen 3.5 27B are the best local models for a 24GB card. Period. I've tested everything. These are the first two models that actually feel state-of-the-art for their size. Most models up to this point have just been moderately-performing novelties. But not extremely useful for real use-cases outside of rewriting, summarization, minor code, and RPG-ing. But all local models have performed poorly over long context reasoning and analysis.