AI RESEARCH
Towards Visual Query Localization in the 3D World
arXiv CS.CV
•
ArXi:2605.01498v1 Announce Type: new Visual query localization (VQL) aims to predict the spatio-temporal response of the most recent occurrence in a sequence given a query. Currently, most research focuses on visual query localization in 2D videos, while its counterpart in 3D space has received little attention. In this paper, we make the first attempt to address visual query localization in the 3D world by