Pocket LLM v1.3.0: Offline local LLM chat on Android with LiteRT + ONNX builds
r/LocalLLaMA
•
Generative AI
Open Source AI
Hi everyone, I've been working on Pocket LLM, an Android app for running local LLMs fully offline for private, real-time chat. The latest v1.3.0 update adds: • LiteRT for Gemma 4 E2B, Gemma 4 E4B, and Qwen3-0.6B • Persistent local chat history • Previous Chats • Thinking Mode for ed models Better markdown rendering • Themes, font size settings, and a polished chat UI The goal is to make local LLMs on Android usable as an actual app, not just a basic. Repo: Releases / prebuilt APKs: Would love feedback, especially on model, performance across devices, and UI/UX.