We just shipped Gemma 4 support in Off Grid 🔥- open-source mobile app, on-device inference, zero cloud. Android live, iOS coming soon.

r/LocalLLaMA •
Generative AI AI Hardware Open Source AI AI Tools

We shipped Gemma 4 (E2B and E4B edge variants) in Off Grid today - our open-source, offline-first AI app for Android and iOS. What makes this different from other local LLM setups: → No server, no Python, no laptop. Runs entirely on your phone's NPU/CPU. → Gemma 4's 128K context window, fully on-device - finally useful for long docs and code on mobile. → Native vision: point your camera at anything and ask Gemma 4 about it. → Whisper speech-to-text, Stable Diffusion image gen, tool calling - all in one app. → ~15-30 tok/s on Snapdragon 8 Gen 3 / Apple A17 Pro.