Should I switch from Qwen 3.5 27B (dense) to Qwen 3.6 35B-A3B for tool calls & vision? Need Docker config review + VRAM advice

r/LocalLLaMA
Generative AI Open Source AI

Hi r/LocalLLaMA, I'm currently running Qwen3.5-27B-UD-Q4_K_XL locally via llama.cpp with OpenWebUI and considering upgrading to Qwen3.6-35B-A3B (GGUF). Before making the switch, I'd appreciate some community feedback on performance, intelligence, and my current setup.