I trained TIME: short context-triggered thinking on Qwen model instead of overthinking
r/LocalLLaMA
•
Open Source AI
AI Research
Started this as a personal project for my Open-WebUI setup to use. Somehow it ended up as an ACL 2026 paper. Not some lab paper, it is personal solo independent paper that happened. TIME is basically my attempt to train Qwen3 models to think in short bursts wherever the response actually needs it, instead of dumping one giant reasoning block at the start. Not just “make thinking shorter" or “turn thinking on/off per task” or "split thinking to interleaving reasoning for the task" like: let the model re-think mid-response when context gives it a reason to.