AI RESEARCH
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
arXiv CS.CV
•
ArXi:2208.09843v2 Announce Type: replace Image-Text Retrieval (ITR) is challenging in bridging visual and lingual modalities. Contrastive learning has been adopted by most prior arts. Except for limited amount of negative image-text pairs, the capability of constrastive learning is restricted by manually weighting negative pairs as well as unawareness of external knowledge. In this paper, we propose our novel Coupled Diversity-Sensitive Momentum Constrastive Learning (CODER) for improving cross-modal representation.