How I Built a Spatial Intelligence Agent That Sees, Thinks, and Speaks — Using Gemini Live API

Dev.to AI
Generative AI

Created for the Gemini Live Agent Challenge What if your could be a skilled human guide - one that sees the world through your camera, understands what matters, and tells you only what you need to hear? That's Drishti (दृष्टि - Sanskrit for "Vision"). It's a spatial intelligence agent built on Google's Gemini Live API that transforms any smartinto a real-time navigation companion for visually impaired users. No special hardware. No wearable devices. Just a on a chest lanyard and a voice that understands your world.