AI RESEARCH
Learning from Medical Entity Trees: An Entity-Centric Medical Data Engineering Framework for MLLMs
arXiv CS.CL
•
ArXi:2604.25296v1 Announce Type: new Multimodal Large Language Models (MLLMs) have shown transformative potential in medical applications, yet their performance is hindered by conventional data curation strategies that rely on coarse-grained partitioning by modality or department. Such fragmented approaches fail to capture the hierarchical and interconnected nature of clinical medical knowledge, limiting the models' ability to perform fine-grained recognition and complex reasoning. In this paper, we propose a novel Entity-Centric Medical Data Engineering framework.