AI RESEARCH

Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation

arXiv CS.CL

ArXi:2505.22095v2 Announce Type: replace Multimodal Retrieval-Augmented Generation (MRAG) has shown promise in mitigating hallucinations in Multimodal Large Language Models (MLLMs) by incorporating external knowledge. However, existing methods typically adhere to rigid retrieval paradigms by mimicking fixed retrieval trajectories and thus fail to fully exploit the knowledge of different retrieval experts through dynamic interaction based on the model's knowledge needs or evolving reasoning states. To overcome this limitation, we