AI RESEARCH
Are Video Reasoning Models Ready to Go Outside?
arXiv CS.AI
•
ArXi:2603.10652v1 Announce Type: cross In real-world deployment, vision-language models often encounter disturbances such as weather, occlusion, and camera motion. Under such conditions, their understanding and reasoning degrade substantially, revealing a gap between clean, controlled (i.e., unperturbed) evaluation settings and real-world robustness. To address this limitation, we propose ROVA, a novel