- New world-first standards set new rules for how tech giants must tackle worst-of-the-worst online content (eSafety Commissioner, Australia)
- A study analyzing the spread of misogyny and other hate speech on social media; A systematic evidence review of case studies to understand online gender-based violence in the Indian Context (ANVESHAK)
- From Research to Action: Evidence-Based Strategies to Combat Online Hate Speech Against Women in Armenia (European Partnership for Democracy)
- Select Recent Advances in Online Hate Speech Moderation: Multimodality and the Role of Large Models (ACL Anthology)
- The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse (arXiv)
- Towards Efficient and Explainable Hate Speech Detection via Model Distillation (arXiv)
- ReZG: Retrieval-augmented zero-shot counter narrative generation for hate speech (Neurocomputing)
- Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF (ACL Anthology)
preventhate.org |
Averting and Countering Online Hatred