Small Gemma 4, Qwen 3.6 and Qwen 3 Coder Next comparison for a debugging use-case

r/LocalLLaMA
Open Source AI

Nothing extensive to see here, just a quick qualitative and performance comparison for a single programming use-case: Making an ancient website that uses Flash for everything work with modern browsers. I let all 3 models tackle exactly the same issue and provided exactly the same multi-turn feedback. Gemma 4 and Qwen 3.6 both nailed the first issue in a functionally equivalent way and provided useful additional feedback. Q3CN went for a convoluted fix. All three missed a remaining breaking issue after the proposed fix. Gemma 4 then made a simple, spot-on fix.