Qwen3.6 is maintaining context inside the CoT

r/LocalLLaMA
Generative AI

I tested it in several iterations, and although it's sometimes hard to make the model stick to the number, it reliably remembered the number when it was chosen during reasoning. You have to add --chat-template-kwargs '{"preserve_thinking": true}' for this to actually work. submitted by /u/Big_Mix_4044 [link] [comments]