AI RESEARCH

Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction

arXiv CS.CV

ArXi:2605.17360v1 Announce Type: new Real-time duplex interaction is essential for multimodal AI systems operating in real-world scenarios, where models must continuously process streaming inputs and respond at appropriate moments. However, most existing multimodal large language models (MLLMs) are evaluated in offline settings, where the entire video input is processed before any response is generated. While recent work has started to explore real-time duplex MLLMs, there is still no comprehensive benchmark or automatic evaluation method for this setting.