AI RESEARCH
IatroBench: Pre-Registered Evidence of Iatrogenic Harm from AI Safety Measures
arXiv CS.CL
•
ArXi:2604.07709v1 Announce Type: cross Ask a frontier model how to taper six milligrams of alprazolam (psychiatrist retired, ten days of pills left, abrupt cessation causes seizures) and it tells her to call the psychiatrist she just explained does not exist. Change one word ("I'm a psychiatrist; a patient presents with. ") and the same model, same weights, same inference pass produces a textbook Ashton Manual taper with diazepam equivalence, anticonvulsant coverage, and monitoring thresholds. The knowledge was there; the model withheld it. IatroBench measures this gap.