Qwen3.5 27B - Steampunque's hybrid Q6_K_H quant beats unsloth Q4-Q5 K_XL?
r/LocalLLaMA
•
Generative AI
I want to share my initial findings on hybrid quant from steampunque to ignite further testing and discussion on the topic: In the end i think there is some overthinking in unsloth quants that may come from calibration maybe or compared to steampunques approach with high quality start/end quants that may produce this difference? Not sure hope it will help with improvement of this great model. submitted by /u/Ok-Importance-3529 [link] [comments]