AI RESEARCH

HatePrototypes: Interpretable and Transferable Representations for Implicit and Explicit Hate Speech Detection

arXiv CS.CL

ArXi:2511.06391v2 Announce Type: replace Optimization of offensive content moderation models for different types of hateful messages is typically achieved through continued pre-