AI RESEARCH
Look, Zoom, Understand: The Robotic Eyeball for Embodied Perception
arXiv CS.CV
•
ArXi:2511.15279v2 Announce Type: replace-cross In embodied AI, visual perception should be active rather than passive: the system must decide where to look and at what scale to sense to acquire maximally informative data under pixel and spatial budget constraints. Existing vision models coupled with fixed RGB-D cameras fundamentally fail to reconcile wide-area coverage with fine-grained detail acquisition, severely limiting their efficacy in open-world robotic applications.