How bad is 1-bit quantization but on a big model?

r/LocalLLaMA
Generative AI

I'm planning on running Qwen3.5-397B-A17B then saw that the IQ1_S and IQ1_M have quite small size, how bad are they compared to the original and are they comparable to like Gwen3.5 122B or 35B? submitted by /u/FusionBetween [link] [comments]