Process / pipeline

Hate Speech Detection — Automated Classification of Harmful Text

Hate speech detection is a natural-language-processing task that automatically identifies hateful, offensive, or harmful text on social media and online platforms. The task was sharpened by Davidson and colleagues (2017), who showed why separating genuine hate speech from merely offensive language is a hard, distinct classification problem rather than a single toxicity score.

Open in MethodMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Davidson, T., Warmsley, D., Macy, M. & Weber, I. (2017). Automated Hate Speech Detection and the Problem of Offensive Language. ICWSM, 11(1), 512-515. DOI: 10.1609/icwsm.v11i1.14955
  2. Fortuna, P. & Nunes, S. (2018). A Survey on Automatic Detection of Hate Speech in Text. ACM Computing Surveys, 51(4), 1-30. DOI: 10.1145/3232676

Related methods

ScholarGateHate Speech Detection (Automated Hate Speech Detection). Retrieved 2026-06-04 from https://scholargate.app/en/text-mining/hate-speech-detection