vLLM Just Merged TurboQuant Fix for Qwen 3.5+

r/LocalLLaMA
Open Source AI AI Tools

Previously it was throwing a 'Not Implemented' error due to Mamba layers. Going to test it now! submitted by /u/havenoammo [link] [comments]