Beyond Medical Diagnostics: How Medical Multimodal Large Language Models Think in Space

ArXi:2603.13800v1 Announce Type: new Visual spatial intelligence is critical for medical image interpretation, yet remains largely unexplored in Multimodal Large Language Models (MLLMs) for 3D imaging. This gap persists due to a systemic lack of datasets featuring structured 3D spatial annotations beyond basic labels. In this study, we