AI RESEARCH

KiToke: Kernel-based Interval-aware Token Compression for Video Large Language Models

arXiv CS.CV

ArXi:2604.03414v1 Announce Type: new Video Large Language Models (Video LLMs) achieve strong performance on video understanding tasks but suffer from high inference costs due to the large number of visual tokens. We propose KiToke, a