Qwen3.6 GGUF is so good for debugging.

r/LocalLLaMA
Generative AI

Using unsloth dynamic quant on 16GB vram + 32GB dram. 200k q8_0 k cache (context window) submitted by /u/_BigBackClock [link] [comments]