RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models

ArXi:2601.03699v2 Announce Type: replace As large language models (LLMs) become integral to safety-critical applications, ensuring their robustness against adversarial prompts is paramount. However, existing red teaming datasets suffer from inconsistent risk categorizations, limited domain coverage, and outdated evaluations, hindering systematic vulnerability assessments. To address these challenges, we