Should we use a non-thinking model for code after using a thinking one for plan? (Agentic coding)
r/LocalLLaMA
•
Generative AI
I usually use Qwen3.6 27B (slow as heck on my RX 6800 but it works) for plan and Qwen3.6 35B A3B for the coding. But I was thinking the other day if I should remove the thinking from the code model. Is there a way to disable the thinking from the code model just for the initial hand-off from plan to code but keep it afterwards? My reasoning is that this might help in following instructions from the plan directly but dealing with any new tools/information the plan model did not on its turn. Any insight will be appreciated. submitted by /u/ismaelgokufox [link] [comments.