AI RESEARCH

Gated Relational Alignment via Confidence-based Distillation for Efficient VLMs

arXiv CS.CV

ArXi:2601.22709v3 Announce Type: replace Vision-Language Models (VLMs) achieve strong multimodal performance but are costly to deploy, and post-