AI RESEARCH

Benchmarking and Mitigating Sycophancy in Medical Vision Language Models

arXiv CS.CV

ArXi:2509.21979v4 Announce Type: replace Visual language models (VLMs) have the potential to transform medical workflows. However, the deployment is limited by sycophancy. Despite this serious threat to patient safety, a systematic benchmark remains lacking. This paper addresses this gap by