Blending Human and LLM Expertise to Detect Hallucinations and Omissions in Mental Health Chatbot Responses

ArXi:2604.06216v1 Announce Type: new As LLM-powered chatbots are increasingly deployed in mental health services, detecting hallucinations and omissions has become critical for user safety. However, state-of-the-art LLM-as-a-judge methods often fail in high-risk healthcare contexts, where subtle errors can have serious consequences. We show that leading LLM judges achieve only 52% accuracy on mental health counseling data, with some hallucination detection approaches exhibiting near-zero recall.