Võrdle meetodeid

Vaata valitud meetodeid kõrvuti; erinevad read on esile tõstetud.

	Latent Dirichlet Allocation (LDA)×	Word2Vec ×
Valdkond≠	Masinõpe	Tekstikaeve
Perekond≠	Latent structure	Process / pipeline
Tekkeaasta≠	2003	2013
Looja≠	Blei, D. M.; Ng, A. Y.; Jordan, M. I.	Tomas Mikolov et al.
Tüüp≠	Generative probabilistic topic model (three-level hierarchical Bayesian)	Neural word-embedding model
Algallikas≠	Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022. DOI ↗	Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. link ↗
Rööpnimetused≠	LDA, topic model, Blei-Ng-Jordan model, probabilistic topic modeling	word embeddings, skip-gram, continuous bag-of-words, Word2Vec Kelime Gömülmeleri
Seotud≠	3	4
Kokkuvõte≠	Latent Dirichlet Allocation (LDA) is a generative probabilistic model for collections of discrete data, introduced by Blei, Ng, and Jordan in 2003. It treats each document as a mixture of latent topics and each topic as a probability distribution over words, enabling unsupervised discovery of thematic structure across large text corpora. It is one of the most cited papers in machine learning and natural language processing.	Word2Vec is a neural word-embedding technique introduced by Mikolov and colleagues in 2013 that maps each word in a text corpus to a dense numeric vector. Words that appear in similar contexts end up close together in the vector space, so the embeddings capture semantic similarity that can be measured arithmetically.
ScholarGateAndmestik ↗	v1 3 Allikad PUBLISHED	v1 1 Allikad PUBLISHED

Mine otsingusse → Laadi slaidid alla