AI RESEARCH
MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network
arXiv CS.AI
•
ArXi:2603.29291v1 Announce Type: cross Composed Image Retrieval (CIR) uses a reference image and a modification text as a query to retrieve a target image satisfying the requirement of ``modifying the reference image according to the text instructions''. However, existing CIR methods face two limitations: (1) frequency bias leading to ``Rare Sample Neglect'', and (2) susceptibility of similarity scores to interference from hard negative samples and noise.