AI RESEARCH

Eyes on Target: Gaze-Aware Object Detection in Egocentric Video

arXiv CS.AI

ArXi:2511.01237v2 Announce Type: replace-cross Human gaze offers rich supervisory signals for understanding visual attention in complex visual environments. In this paper, we propose Eyes on Target, a novel depth-aware and gaze-guided object detection framework designed for egocentric videos. Our approach injects gaze-derived features into the attention mechanism of a Vision Transformer (ViT), effectively biasing spatial feature selection toward human-attended regions.