AI RESEARCH

HealthBench Professional: Evaluating Large Language Models on Real Clinician Chats

arXiv CS.CL

ArXi:2604.27470v1 Announce Type: new Millions of clinicians use ChatGPT to clinical care, but evaluations of the most common use cases in model-clinician conversations are limited. We