KG-CMI: Knowledge graph enhanced cross-Mamba interaction for medical visual question answering

ArXi:2604.00601v1 Announce Type: new Medical visual question answering (Med-VQA) is a crucial multimodal task in clinical decision and telemedicine. Recent methods fail to fully leverage domain-specific medical knowledge, making it difficult to accurately associate lesion features in medical images with key diagnostic criteria. Additionally, classification-based approaches typically rely on predefined answer sets.