Linganisha mbinu
Pitia mbinu ulizochagua bega kwa bega; safu zinazotofautiana zinaangaziwa.
| Kugundua Matamshi ya Chuki× | BERT Embeddings× | |
|---|---|---|
| Nyanja | Uchimbaji wa Matini | Uchimbaji wa Matini |
| Familia | Process / pipeline | Process / pipeline |
| Mwaka wa asili≠ | — | 2019 |
| Mwanzilishi≠ | — | Devlin, Chang, Lee & Toutanova (Google AI) |
| Aina≠ | NLP text-classification task | Contextual transformer text-representation method |
| Chanzo asilia≠ | Davidson, T., Warmsley, D., Macy, M. & Weber, I. (2017). Automated Hate Speech Detection and the Problem of Offensive Language. ICWSM, 11(1), 512-515. DOI ↗ | Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. NAACL-HLT, 4171-4186. DOI ↗ |
| Majina mbadala | offensive language detection, toxic content detection, Nefret Söylemi Tespiti | contextual embeddings, transformer embeddings, BERT Tabanlı Metin Gömülmeleri |
| Zinazohusiana | 4 | 4 |
| Muhtasari≠ | Hate speech detection is a natural-language-processing task that automatically identifies hateful, offensive, or harmful text on social media and online platforms. The task was sharpened by Davidson and colleagues (2017), who showed why separating genuine hate speech from merely offensive language is a hard, distinct classification problem rather than a single toxicity score. | BERT-based text embeddings, introduced by Devlin and colleagues at Google AI in 2019, turn text into context-sensitive dense vectors using a bidirectional Transformer encoder. Because the meaning of a word shifts with its context, BERT produces richer representations than static methods such as Word2Vec or topic models like LDA. |
| ScholarGateSeti ya data ↗ |
|
|