AI RESEARCH
Geometry-Guided Camera Motion Understanding in VideoLLMs
arXiv CS.AI
•
ArXi:2603.13119v1 Announce Type: cross Camera motion is a fundamental geometric signal that shapes visual perception and cinematic style, yet current video-capable vision-language models (VideoLLMs) rarely represent it explicitly and often fail on fine-grained motion primitives. We address this gap with a framework of $\textbf{benchmarking}$, $\textbf{diagnosis}$, and $\textbf{injection