AI RESEARCH
Instance-level Visual Active Tracking with Occlusion-Aware Planning
arXiv CS.CV
•
ArXi:2604.21453v1 Announce Type: new Visual Active Tracking (VAT) aims to control cameras to follow a target in 3D space, which is critical for applications like drone navigation and security surveillance. However, it faces two key bottlenecks in real-world deployment: confusion from visually similar distractors caused by insufficient instance-level discrimination and severe failure under occlusions due to the absence of active planning. To address these, we propose OA-VAT, a unified pipeline with three complementary modules. First, a