AI RESEARCH
HINT: Composed Image Retrieval with Dual-path Compositional Contextualized Network
arXiv CS.CV
•
ArXi:2603.26341v1 Announce Type: new Composed Image Retrieval (CIR) is a challenging image retrieval paradigm. It aims to retrieve target images from large-scale image databases that are consistent with the modification semantics, based on a multimodal query composed of a reference image and modification text. Although existing methods have made significant progress in cross-modal alignment and feature fusion, a key flaw remains: the neglect of contextual information in discriminating matching samples.