Does Gemma-4-E4B-it support live camera vision? Building a real-time object translator

r/LocalLLaMA
Open Source AI

Hi everyone, ​I'm trying to set up a project using Gemma-4-E4B-it where I can point a live camera at different physical items, have the model identify them, and then output the names of those items translated into different languages (specifically German right now).​I'm currently trying to piece this together using the Google AI Gallery app.