Process / pipeline
Hate Speech Detection — Automated Classification of Harmful Text
Hate speech detection is a natural-language-processing task that automatically identifies hateful, offensive, or harmful text on social media and online platforms. The task was sharpened by Davidson and colleagues (2017), who showed why separating genuine hate speech from merely offensive language is a hard, distinct classification problem rather than a single toxicity score.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Davidson, T., Warmsley, D., Macy, M. & Weber, I. (2017). Automated Hate Speech Detection and the Problem of Offensive Language. ICWSM, 11(1), 512-515. DOI: 10.1609/icwsm.v11i1.14955 ↗
- Fortuna, P. & Nunes, S. (2018). A Survey on Automatic Detection of Hate Speech in Text. ACM Computing Surveys, 51(4), 1-30. DOI: 10.1145/3232676 ↗