AI RESEARCH

DisaBench: A Participatory Evaluation Framework for Disability Harms in Language Models

arXiv CS.AI

ArXi:2605.12702v1 Announce Type: new General-purpose safety benchmarks for large language models do not adequately evaluate disability-related harms. We