방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| Topic Modeling for Communication Research× | Dictionary-Based Text Analysis× | |
|---|---|---|
| 분야 | Communication | Communication |
| 계열≠ | Machine learning | Process / pipeline |
| 기원 연도 | 2003 | 2003 |
| 창시자≠ | David Blei et al. (LDA); Roberts, Stewart & Tingley (STM) | Lexicon tradition (Pennebaker LIWC; General Inquirer) |
| 유형≠ | Unsupervised probabilistic model of latent themes in document collections | Word-count text measurement against predefined category dictionaries |
| 원전≠ | Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022. link ↗ | Pennebaker, J. W., Mehl, M. R., & Niederhoffer, K. G. (2003). Psychological aspects of natural language use: Our words, our selves. Annual Review of Psychology, 54, 547–577. DOI ↗ |
| 별칭 | LDA for communication, Structural topic modeling in communication, Topic models for media texts, İletişim Araştırmaları için Konu Modelleme | Lexicon-based text analysis, Word-count text analysis, Dictionary method for content analysis, Sözlük Tabanlı Metin Analizi |
| 관련≠ | 3 | 4 |
| 요약≠ | Topic modeling is an unsupervised technique for discovering the latent themes that run through a large collection of documents, representing each document as a mixture of topics and each topic as a distribution over words. In communication research it surfaces the issues, frames, and themes in news archives, social media, and political text at a scale no manual reading can match, with Latent Dirichlet Allocation (LDA) and the Structural Topic Model (STM) as the dominant variants. | Dictionary-based text analysis measures concepts in text by counting how often words belonging to predefined category lists — dictionaries — appear in each document. It is the workhorse lexicon method behind tools like LIWC and the General Inquirer, prized for its transparency and scalability: a category score is simply the share of a document's words that match the category's word list. |
| ScholarGate데이터셋 ↗ |
|
|