AI RESEARCH

SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models

arXiv CS.AI

ArXi:2603.19028v1 Announce Type: cross Models that bridge vision and language, such as CLIP, are key components of multimodal AI, yet their large-scale, uncurated