I Crammed RAG, a Vector Database, and a Gemma LLM into a Mobile App. Here’s What Happened.
Towards AI
•
Generative AI
Open Source AI
No cloud. No API keys. No excuses. The full on-device pipeline - from writing a note to getting an answer. Nothing in this flow touches a network after the initial model download. It started with a paranoid thought. I was taking meeting notes on my - project decisions, research fragments, half-formed ideas - and I wanted to ask a question across all of them. What did I decide about the API design last month? What did that article say about PostgreSQL indexing? The obvious answer was to pipe everything into one of the big AI APIs.