AI RESEARCH

EchoPrune: Interpreting Redundancy as Temporal Echoes for Efficient VideoLLMs

arXiv CS.CV

ArXi:2605.10050v1 Announce Type: new Long-form video understanding remains challenging for Video Large Language Models (VideoLLMs), as the dense frame sampling