AI RESEARCH

HatePrototypes: Interpretable and Transferable Representations for Implicit and Explicit Hate Speech Detection

arXiv CS.CL • March 10, 2026

ArXi:2511.06391v2 Announce Type: replace Optimization of offensive content moderation models for different types of hateful messages is typically achieved through continued pre-

Read Full Article