AI RESEARCH

Fair and Calibrated Toxicity Detection with Robust Training and Abstention

arXiv CS.LG • May 15, 2026

ArXi:2605.14074v1 Announce Type: new Fairness in toxicity classification involves three integrated axes: ranking, calibration, and abstention