Would a fully open SmolLM4-750M with 16K context make sense?
r/LocalLLaMA
•
AI Tools
I’ve been thinking about a possible gap in the current small local model space: a modern fully open ~750M model. Hugging Face already has SmolLM2 at 135M, 360M, and 1.7B, and SmolLM3 pushes the family to 3B with long context, multilingual, and reasoning. The Smol Models repo also describes the goal pretty clearly: fully open, compact models that can run effectively on-device while still having strong performance. So my idea is: SmolLM4-750M High-level target: ~750M parameters 16K context Causal LM Fully open weights Fully open data recipe.