AI RESEARCH
HatePrototypes: Interpretable and Transferable Representations for Implicit and Explicit Hate Speech Detection
arXiv CS.CL
•
ArXi:2511.06391v2 Announce Type: replace Optimization of offensive content moderation models for different types of hateful messages is typically achieved through continued pre-