Qwen 3.5 122B completely falls apart at ~ 100K context
r/LocalLLaMA
•
Open Source AI
AI Tools
Is anyone else having issues with Qwen 122B falling apart completely at ~ 100K context? I am using VLLM with the olka-fi MXFP4 quant. When the model hits this threshold it abruptly just stops working. Agents work great up until this point, and then it just stops following instructions for than maybe 1 step. I saw someone mention this about 27B yesterday, but now I can't find the post. It's definitely happening with 122b as well submitted by /u/TokenRingAI [link] [comments.