DeepSeek V4 just dropped, 1.6T Pro and 284B Flash, MIT license, 1M context. This is huge.

r/LocalLLaMA
Generative AI Open Source AI

DeepSeek just released V4. Pro is 1.6T total with 49B active. Flash is 284B with 13B active. Both MIT licensed, both 1M context. 1M context on open weights is the part I cannot stop thinking about. Until today if you wanted that length you were paying Gemini or Claude prices. Now it is downloadable under MIT. That is a genuine shift in what self-hosted long-context work looks like. The 49B active on a 1.6T Pro model is the other thing. That activation ratio is aggressive.