AI RESEARCH
VL-UniTrack: A Unified Framework with Visual-Language Prompts for UAV-Ground Visual Tracking
arXiv CS.CV
•
ArXi:2605.04574v1 Announce Type: new UAV-ground visual tracking (UGVT) aims to simultaneously track the same object from both the UAV and the ground view. However, existing two-stream methods suffer from isolated feature extraction and rely heavily on implicit appearance matching, which struggles to establish reliable correspondence under drastic view differences, leading to tracking unreliability. To address these limitations, we propose VL-UniTrack, a fully unified framework enhanced by visual-language prompts.