AI RESEARCH
Decoding the Delta: Unifying Remote Sensing Change Detection and Understanding with Multimodal Large Language Models
arXiv CS.CV
•
ArXi:2604.14044v1 Announce Type: new While Multimodal Large Language Models (MLLMs) excel in general vision-language tasks, their application to remote sensing change understanding is hindered by a fundamental "temporal blindness". Existing architectures lack intrinsic mechanisms for multi-temporal contrastive reasoning and struggle with precise spatial grounding. To address this, we first