AI RESEARCH

LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models

arXiv CS.AI

ArXi:2601.06550v2 Announce Type: replace-cross Multi-Object Tracking (MOT) is evolving from geometric localization to Semantic MOT (SMOT) to answer complex relational queries, yet progress is hindered by semantic data scarcity and a structural disconnect between tracking architectures and Multi-modal Large Language Models (MLLMs). To address this, we