Qwen3.5-9B is actually quite good for agentic coding
r/LocalLLaMA
•
Generative AI
AI Hardware
Open Source AI
I have to admit I am quite impressed. My hardware is an Nvidia Geforce RTX 3060 with 12 GB VRAM so it's quite limited. I have been "model-hopping" to see what works best for me. I mainly did my tests with Kilo Code but sometimes I tried Roo Code as well Originally I used a customized Qwen 2.5 Coder for tools calls, It was relatively fast but usually would fail doing tool calls. Then I tested multiple Unsloth quantizations on Qwen 3 Coder. 1-bit quants would work also relatively fast but usually failed doing tool calls as well.