Comparar métodos
Examine os métodos selecionados lado a lado; as linhas que diferem ficam destacadas.
| Supervised Text Classification× | Classificação de Texto× | |
|---|---|---|
| Área≠ | Political Science | Mineração de texto |
| Família | Process / pipeline | Process / pipeline |
| Ano de origem≠ | 2013 | — |
| Autor original≠ | Machine-learning classification tradition (formalized for political text by Grimmer & Stewart; category-proportion variant by Hopkins & King) | — |
| Tipo≠ | Supervised machine-learning classification of documents | Supervised NLP classification task |
| Fonte seminal≠ | Grimmer, J., & Stewart, B. M. (2013). Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts. Political Analysis, 21(3), 267–297. DOI ↗ | Joachims, T. (1998). Text Categorization with Support Vector Machines: Learning with Many Relevant Features. ECML 1998. Lecture Notes in Computer Science, vol 1398. Springer. DOI ↗ |
| Outros nomes | Supervised document classification, Text categorization, Automated text coding, Supervised content analysis | text categorization, document classification, topic classification, metin sınıflandırma |
| Relacionados≠ | 5 | 4 |
| Resumo≠ | Supervised text classification trains a statistical model on documents that humans have hand-labeled, then uses it to assign categories — topic, tone, position, relevance — to the much larger set of unlabeled documents. Unlike dictionary methods, which apply a fixed word list, a supervised classifier learns from examples which textual features predict each category, so it can capture context-dependent and non-obvious cues. Grimmer and Stewart present it as a core text-as-data workflow, and a key insight is that for many political-science questions the goal is not perfect document-by-document labels but accurate estimates of category proportions across a corpus. | Text classification, also called text categorization, is a supervised natural-language-processing task that automatically assigns documents to predefined categories. Building on the support-vector-machine approach to text categorization established by Joachims (1998) and consolidated in the text-mining literature by Aggarwal and Zhai (2012), it powers tasks such as spam detection and topic classification by learning from labelled examples. |
| ScholarGateConjunto de dados ↗ |
|
|