AI RESEARCH

Progressive Online Video Understanding with Evidence-Aligned Timing and Transparent Decisions

arXiv CS.CV

ArXi:2604.18459v1 Announce Type: new Visual agents operating in the wild must respond to queries precisely when sufficient evidence first appears in a video stream, a critical capability that is overlooked by conventional video LLMs evaluated in offline settings. The shift to an online, streaming paradigm