Qwen3.6 GGUF is so good for debugging.
r/LocalLLaMA
•
Generative AI
Using unsloth dynamic quant on 16GB vram + 32GB dram. 200k q8_0 k cache (context window) submitted by /u/_BigBackClock [link] [comments]