vLLM Just Merged TurboQuant Fix for Qwen 3.5+
r/LocalLLaMA
•
Open Source AI
AI Tools
Previously it was throwing a 'Not Implemented' error due to Mamba layers. Going to test it now! submitted by /u/havenoammo [link] [comments]