Сравнение методов

Просматривайте выбранные методы рядом; строки с различиями подсвечены.

	Structural Topic Model ×	Supervised Text Classification ×
Область	Political Science	Political Science
Семейство	Process / pipeline	Process / pipeline
Год появления≠	2014	2013
Автор метода≠	Margaret Roberts, Brandon Stewart & Dustin Tingley	Machine-learning classification tradition (formalized for political text by Grimmer & Stewart; category-proportion variant by Hopkins & King)
Тип≠	Mixed-membership topic model with document-level covariates	Supervised machine-learning classification of documents
Основополагающий источник≠	Roberts, M. E., Stewart, B. M., Tingley, D., Lucas, C., Leder-Luis, J., Gadarian, S. K., Albertson, B., & Rand, D. G. (2014). Structural Topic Models for Open-Ended Survey Responses. American Journal of Political Science, 58(4), 1064–1082. DOI ↗	Grimmer, J., & Stewart, B. M. (2013). Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts. Political Analysis, 21(3), 267–297. DOI ↗
Другие названия	STM, Structural topic modeling, Covariate-aware topic model, Topic model with metadata	Supervised document classification, Text categorization, Automated text coding, Supervised content analysis
Связанные	5	5
Сводка≠	The Structural Topic Model (STM) is a text-as-data method that discovers latent themes in a corpus while letting document metadata — party, time, gender, treatment condition — shape those themes. Introduced by Roberts, Stewart, Tingley and colleagues in 2014, it generalizes correlated topic modeling so that topic prevalence (how much a document is about a topic) and topic content (the words used to express a topic) can both depend on covariates. The result is a single model that simultaneously estimates topics and how their use varies across known groups, with uncertainty.	Supervised text classification trains a statistical model on documents that humans have hand-labeled, then uses it to assign categories — topic, tone, position, relevance — to the much larger set of unlabeled documents. Unlike dictionary methods, which apply a fixed word list, a supervised classifier learns from examples which textual features predict each category, so it can capture context-dependent and non-obvious cues. Grimmer and Stewart present it as a core text-as-data workflow, and a key insight is that for many political-science questions the goal is not perfect document-by-document labels but accurate estimates of category proportions across a corpus.
ScholarGateНабор данных ↗	v1 3 Источники PUBLISHED	v1 2 Источники PUBLISHED

Перейти к поиску → Скачать слайды