I haven't experienced Qwen3.5 (35B and 27B) over thinking. Posting my settings/prompt
r/LocalLLaMA
•
Open Source AI
I felt the need to make a post about these models, because I see a lot of talk about how they think for extended periods/get caught in thinking loops/use an excessive amount of reasoning tokens. I have never experienced this. In fact, I've noticed the opposite - I have been singularly impressed by how few tokens my Qwen instances use to produce high quality responses.