AI RESEARCH

Instance-level Visual Active Tracking with Occlusion-Aware Planning

arXiv CS.CV

ArXi:2604.21453v1 Announce Type: new Visual Active Tracking (VAT) aims to control cameras to follow a target in 3D space, which is critical for applications like drone navigation and security surveillance. However, it faces two key bottlenecks in real-world deployment: confusion from visually similar distractors caused by insufficient instance-level discrimination and severe failure under occlusions due to the absence of active planning. To address these, we propose OA-VAT, a unified pipeline with three complementary modules. First, a