Should I switch from Qwen 3.5 27B (dense) to Qwen 3.6 35B-A3B for tool calls & vision? Need Docker config review + VRAM advice
r/LocalLLaMA
•
Generative AI
Open Source AI
Hi r/LocalLLaMA, I'm currently running Qwen3.5-27B-UD-Q4_K_XL locally via llama.cpp with OpenWebUI and considering upgrading to Qwen3.6-35B-A3B (GGUF). Before making the switch, I'd appreciate some community feedback on performance, intelligence, and my current setup.