AI RESEARCH

Expert Evaluation and the Limits of Human Feedback in Mental Health AI Safety Testing

arXiv CS.AI

ArXi:2601.18061v3 Announce Type: replace Learning from human feedback~(LHF) assumes that expert judgments, appropriately aggregated, yield valid ground truth for