방법 비교

선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.

	Topic Modeling for Communication Research ×	Dictionary-Based Text Analysis ×
분야	Communication	Communication
계열≠	Machine learning	Process / pipeline
기원 연도	2003	2003
창시자≠	David Blei et al. (LDA); Roberts, Stewart & Tingley (STM)	Lexicon tradition (Pennebaker LIWC; General Inquirer)
유형≠	Unsupervised probabilistic model of latent themes in document collections	Word-count text measurement against predefined category dictionaries
원전≠	Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022. link ↗	Pennebaker, J. W., Mehl, M. R., & Niederhoffer, K. G. (2003). Psychological aspects of natural language use: Our words, our selves. Annual Review of Psychology, 54, 547–577. DOI ↗
별칭	LDA for communication, Structural topic modeling in communication, Topic models for media texts, İletişim Araştırmaları için Konu Modelleme	Lexicon-based text analysis, Word-count text analysis, Dictionary method for content analysis, Sözlük Tabanlı Metin Analizi
관련≠	3	4
요약≠	Topic modeling is an unsupervised technique for discovering the latent themes that run through a large collection of documents, representing each document as a mixture of topics and each topic as a distribution over words. In communication research it surfaces the issues, frames, and themes in news archives, social media, and political text at a scale no manual reading can match, with Latent Dirichlet Allocation (LDA) and the Structural Topic Model (STM) as the dominant variants.	Dictionary-based text analysis measures concepts in text by counting how often words belonging to predefined category lists — dictionaries — appear in each document. It is the workhorse lexicon method behind tools like LIWC and the General Inquirer, prized for its transparency and scalability: a category score is simply the share of a document's words that match the category's word list.
ScholarGate데이터셋 ↗	v1 3 출처 PUBLISHED	v1 2 출처 PUBLISHED

검색으로 이동 → 슬라이드 다운로드