AI RESEARCH

WikiCLIP: An Efficient Contrastive Baseline for Open-domain Visual Entity Recognition

arXiv CS.CV

ArXi:2603.09921v1 Announce Type: new Open-domain visual entity recognition (VER) seeks to associate images with entities in encyclopedic knowledge bases such as Wikipedia. Recent generative methods tailored for VER nstrate strong performance but incur high computational costs, limiting their scalability and practical deployment. In this work, we revisit the contrastive paradigm for VER and