AI RESEARCH

MIBench: Evaluating LMMs on Multimodal Interaction

arXiv CS.AI

ArXi:2603.13427v1 Announce Type: cross In different multimodal scenarios, it needs to integrate and utilize information across modalities in a specific way based on the demands of the task. Different integration ways between modalities are referred to as "multimodal interaction". How well a model handles various multimodal interactions largely characterizes its multimodal ability. In this paper, we