AI RESEARCH

HiSem: Hierarchical Semantic Disentangling for Remote Sensing Image Change Captioning

arXiv CS.CV

ArXi:2605.15024v1 Announce Type: new Remote sensing image change captioning (RSICC) aims to achieve high-level semantic understanding of genuine changes occurring between bi-temporal images. Despite notable progress, existing methods are fundamentally limited by a shared modeling assumption: changed and unchanged image pairs, which have intrinsically different semantic granularities, are processed under a unified modeling strategy. This modeling inconsistency leads to semantic entanglement between coarse-grained change existence judgment and fine-grained semantic understanding.