Qwen3.5-9B is actually quite good for agentic coding

r/LocalLLaMA
Generative AI AI Hardware Open Source AI

I have to admit I am quite impressed. My hardware is an Nvidia Geforce RTX 3060 with 12 GB VRAM so it's quite limited. I have been "model-hopping" to see what works best for me. I mainly did my tests with Kilo Code but sometimes I tried Roo Code as well Originally I used a customized Qwen 2.5 Coder for tools calls, It was relatively fast but usually would fail doing tool calls. Then I tested multiple Unsloth quantizations on Qwen 3 Coder. 1-bit quants would work also relatively fast but usually failed doing tool calls as well.