Experiences with DS4 on long-lived agents

r/LocalLLaMA
Open Source AI AI Research

Holy cow, if you guys are running background agents or heavy tool-calling pipelines, you need to test the new Deepseek v4 flash model immediately. For context, I maintain an open-source agent platform - basically a persistent daemon that handles background python execution and SQLite state management. Because our agents run 24/7 sometimes making hundreds of tool calls an hour, API costs are usually our biggest bottleneck. Up until yesterday, Deepseek 3.2 was our primary low-cost model. Insane price and comparable perf to SOTA models.