If you're using Nvidia's NVFP4 of Qwen3.5-397, try a different quant
r/LocalLLaMA
•
AI Hardware
If the quant is working well for you, awesome. It's KLD is quite divergent, and that translates to real intelligence lost. The larger the model, the less this is visible, so if you don't see it, rocksauce. if you do, try Sehyo's NVFP4 or Quantrio's AWQ, which is very accurate. submitted by /u/Phaelon74 [link] [comments]