Llama.cpp now with a true reasoning budget!
r/LocalLLaMA
•
Generative AI
Open Source AI
I'm happy to report that llama.cpp has another nice and exciting feature that I know a lot of you have been waiting for - real for reasoning budgets! Until now, `--reasoning-budget` was basically a stub, with its only function being setting it to 0 to disable thinking via passing `enable_thinking=false` to templates. But now, we