AI RESEARCH
Beyond Detection: A Structure-Aware Framework for Scene Text Tracking
arXiv CS.CV
•
ArXi:2605.17270v1 Announce Type: new Modern visual object trackers show impressive results on general targets, yet their performance drops substantially when dealing with scene text. Although currently underexplored, tracking text in videos is essential for dynamic text manipulations such as segmentation, removal, and editing. To fill this gap, this paper formalizes this specific task as Scene Text Tracking and presents the first systematic work for it.