OmniSelect: Dynamic Modality-Aware Token Compression for Efficient Omni-modal Large Language Models

ArXi:2605.18041v1 Announce Type: new Omnimodal large language models (OmniLLMs) have recently gained increasing attention for unified audio-video understanding. However, processing long multimodal token sequences