AI RESEARCH

DiffCap-Bench: A Comprehensive, Challenging, Robust Benchmark for Image Difference Captioning

arXiv CS.AI

ArXi:2605.04503v1 Announce Type: cross Image Difference Captioning (IDC) generates natural language descriptions that precisely identify differences between two images, serving as a key benchmark for fine-grained change perception, cross-modal reasoning, and image editing data construction.