AI RESEARCH
Generating a Paracosm for Training-Free Zero-Shot Composed Image Retrieval
arXiv CS.CV
•
ArXi:2602.00813v3 Announce Type: replace Composed Image Retrieval (CIR) is the task of retrieving a target image from a database using a multimodal query, which consists of a reference image and a modification text. The text specifies how to alter the reference image to form a ''mental image'', based on which CIR should find the target image in the database. The fundamental challenge of CIR is that this ''mental image'' is not physically available and is only implicitly defined by the query.