AI RESEARCH

CRAFT: Critic-Refined Adaptive Key-Frame Targeting for Multimodal Video Question Answering

arXiv CS.AI

ArXi:2605.19075v1 Announce Type: cross Grounded multi-video question answering over real-world news events requires systems to surface query-relevant evidence across heterogeneous video archives while attributing every claim to its ing source. We