AI RESEARCH

M-MiniGPT4: Multilingual VLLM Alignment via Translated Data

arXiv CS.AI

ArXi:2603.29467v1 Announce Type: cross This paper presents a Multilingual Vision Large Language Model, named M-MiniGPT4. Our model exhibits strong vision-language understanding (VLU) capabilities across 11 languages. We utilize a mixture of native multilingual and translated data to push the multilingual VLU performance of the MiniGPT4 architecture. In addition, we propose a multilingual alignment