Unsloth/Qwen3.6-35b-a3b -> Q5_K_S vs Q4_K_XL

r/LocalLLaMA
Generative AI

I run both from unsloth with recommended settings, and what I found is that Q4_K_XL does a LOT better job in my use case - web research, document research, transcript, python and html coding and code debugging Especially in websearch It looks to me that reasoning is a lot stronger in Q4 model Has anybody else noticed that? submitted by /u/KringleKrispi [link] [comments]