AI RESEARCH

Interpretable Zero-shot Referring Expression Comprehension with Query-driven Scene Graphs

arXiv CS.CV

ArXi:2603.25004v1 Announce Type: new Zero-shot referring expression comprehension (REC) aims to locate target objects in images given natural language queries without relying on task-specific