AI RESEARCH
EchoPrune: Interpreting Redundancy as Temporal Echoes for Efficient VideoLLMs
arXiv CS.CV
•
ArXi:2605.10050v1 Announce Type: new Long-form video understanding remains challenging for Video Large Language Models (VideoLLMs), as the dense frame sampling