방법 비교

선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.

	잠재 디리클레 할당 (Latent Dirichlet Allocation, LDA)×	Word2Vec ×
분야≠	머신러닝	텍스트 마이닝
계열≠	Latent structure	Process / pipeline
기원 연도≠	2003	2013
창시자≠	Blei, D. M.; Ng, A. Y.; Jordan, M. I.	Tomas Mikolov et al.
유형≠	Generative probabilistic topic model (three-level hierarchical Bayesian)	Neural word-embedding model
원전≠	Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022. DOI ↗	Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. link ↗
별칭≠	LDA, topic model, Blei-Ng-Jordan model, probabilistic topic modeling	word embeddings, skip-gram, continuous bag-of-words, Word2Vec Kelime Gömülmeleri
관련≠	3	4
요약≠	Latent Dirichlet Allocation (LDA) is a generative probabilistic model for collections of discrete data, introduced by Blei, Ng, and Jordan in 2003. It treats each document as a mixture of latent topics and each topic as a probability distribution over words, enabling unsupervised discovery of thematic structure across large text corpora. It is one of the most cited papers in machine learning and natural language processing.	Word2Vec is a neural word-embedding technique introduced by Mikolov and colleagues in 2013 that maps each word in a text corpus to a dense numeric vector. Words that appear in similar contexts end up close together in the vector space, so the embeddings capture semantic similarity that can be measured arithmetically.
ScholarGate데이터셋 ↗	v1 3 출처 PUBLISHED	v1 1 출처 PUBLISHED

검색으로 이동 → 슬라이드 다운로드