Google released "Always On Memory Agent" on GitHub - any utility for local models?

I saw a press release about this as a way for small orgs to get around the labor of manually creating a vector db. What I was wondering is whether: (1) it's possible to modify it to use a local model instead of an API for Gemini 3.1 Flash-Lite, and (2) if so, would it still be useful, since Gemini 3.1 Flash-Lite has an incoming context of 1M tokens and a 64K output context. submitted by /u/makingnoise [link] [comments]