AI RESEARCH
QueST: Persistent Queries as Semantic Monitors for Drift Suppression in Long-Horizon Tracking
arXiv CS.CV
•
ArXi:2605.09513v1 Announce Type: new Tracking points in videos is typically formulated as frame-to-frame correspondence, where each point is matched locally to the next frame. While this works over short horizons, errors accumulate under articulation, occlusion, and viewpoint change, leading to silent semantic drift that existing trackers cannot detect or correct. In this work, we revisit long-horizon tracking from a monitoring perspective and