Confronta i metodi
Esamina i metodi selezionati fianco a fianco; le righe che differiscono sono evidenziate.
| Supervised Text Classification× | Structural Topic Model× | |
|---|---|---|
| Campo | Political Science | Political Science |
| Famiglia | Process / pipeline | Process / pipeline |
| Anno di origine≠ | 2013 | 2014 |
| Ideatore≠ | Machine-learning classification tradition (formalized for political text by Grimmer & Stewart; category-proportion variant by Hopkins & King) | Margaret Roberts, Brandon Stewart & Dustin Tingley |
| Tipo≠ | Supervised machine-learning classification of documents | Mixed-membership topic model with document-level covariates |
| Fonte seminale≠ | Grimmer, J., & Stewart, B. M. (2013). Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts. Political Analysis, 21(3), 267–297. DOI ↗ | Roberts, M. E., Stewart, B. M., Tingley, D., Lucas, C., Leder-Luis, J., Gadarian, S. K., Albertson, B., & Rand, D. G. (2014). Structural Topic Models for Open-Ended Survey Responses. American Journal of Political Science, 58(4), 1064–1082. DOI ↗ |
| Alias | Supervised document classification, Text categorization, Automated text coding, Supervised content analysis | STM, Structural topic modeling, Covariate-aware topic model, Topic model with metadata |
| Correlati | 5 | 5 |
| Sintesi≠ | Supervised text classification trains a statistical model on documents that humans have hand-labeled, then uses it to assign categories — topic, tone, position, relevance — to the much larger set of unlabeled documents. Unlike dictionary methods, which apply a fixed word list, a supervised classifier learns from examples which textual features predict each category, so it can capture context-dependent and non-obvious cues. Grimmer and Stewart present it as a core text-as-data workflow, and a key insight is that for many political-science questions the goal is not perfect document-by-document labels but accurate estimates of category proportions across a corpus. | The Structural Topic Model (STM) is a text-as-data method that discovers latent themes in a corpus while letting document metadata — party, time, gender, treatment condition — shape those themes. Introduced by Roberts, Stewart, Tingley and colleagues in 2014, it generalizes correlated topic modeling so that topic prevalence (how much a document is about a topic) and topic content (the words used to express a topic) can both depend on covariates. The result is a single model that simultaneously estimates topics and how their use varies across known groups, with uncertainty. |
| ScholarGateInsieme di dati ↗ |
|
|