Qwen 3.5 "Weight Drift" Fix? Automated Tool + Inconclusive NIAH Results

r/LocalLLaMA
Open Source AI

The Context I’ve been following this thread for Qwen 3.5 by u/EvilEnginer, claiming a 90% error reduction by scaling specific ssm_conv1d.weight tensors. My Testing I’m interested in seeing if we can confirm their results and make this fix a standard, transparent utility for the community. Based on the findings shared by u/EvilEnginer regarding tensor scales in the final blocks, I’ve written an independent tool to automate the detection and repair of this drift.