AI RESEARCH
UAV as Urban Construction Change Monitor: A New Benchmark and Change Captioning Model
arXiv CS.CV
•
ArXi:2605.04409v1 Announce Type: new Remote Sensing Image Change Captioning (RSICC) aims to generate spatially grounded natural language descriptions of scene evolution from bi-temporal imagery, moving beyond binary change masks toward semantic-level understanding. However, existing methods rely on implicit feature differencing without explicitly modeling structured change semantics, and struggle to reconcile the conflicting representation demands of change detection and caption generation.