AI RESEARCH

DiffuSAM: Diffusion Guided Zero-Shot Object Grounding for Remote Sensing Imagery

arXiv CS.LG

ArXi:2604.18201v1 Announce Type: cross Diffusion models have emerged as powerful tools for a wide range of vision tasks, including text-guided image generation and editing. In this work, we explore their potential for object grounding in remote sensing imagery. We propose a hybrid pipeline that integrates diffusion-based localization cues with state-of-the-art segmentation models such as RemoteSAM and SAM3 to obtain accurate bounding boxes.