QWEN3.6 + ik_llama is fast af
r/LocalLLaMA
•
Generative AI
Running qwen3.6 UD_Q_4_K_M on 16GB vram + 32GB ram with 200k cw + tok/s submitted by /u/_BigBackClock [link] [comments]