AI RESEARCH
ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object Tracking
arXiv CS.CV
•
ArXi:2505.20381v4 Announce Type: replace Referring Multi-Object Tracking (RMOT) aims to track targets specified by language instructions. However, existing RMOT paradigms heavily rely on explicit visual-textual matching and consequently fail to generalize to complex instructions that require logical reasoning. To overcome this, we propose Reasoning-based Multi-Object Tracking (ReaMOT), a novel task that elevates tracking to a cognitive level, requiring models to infer and track specific targets satisfying implicit constraints via logical reasoning.