AI RESEARCH

Representation Learning with Semantic-aware Instance and Sparse Token Alignments

arXiv CS.CV

ArXi:2601.08165v2 Announce Type: replace Medical contrastive vision-language pre-