Порівняння методів

Переглядайте обрані методи поруч; рядки з відмінностями підсвічено.

	Розподіл Діріхле для прихованих тем (LDA)×	Word2Vec ×
Галузь≠	Машинне навчання	Інтелектуальний аналіз тексту
Родина≠	Latent structure	Process / pipeline
Рік появи≠	2003	2013
Автор методу≠	Blei, D. M.; Ng, A. Y.; Jordan, M. I.	Tomas Mikolov et al.
Тип≠	Generative probabilistic topic model (three-level hierarchical Bayesian)	Neural word-embedding model
Основоположне джерело≠	Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022. DOI ↗	Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. link ↗
Інші назви≠	LDA, topic model, Blei-Ng-Jordan model, probabilistic topic modeling	word embeddings, skip-gram, continuous bag-of-words, Word2Vec Kelime Gömülmeleri
Пов'язані≠	3	4
Підсумок≠	Latent Dirichlet Allocation (LDA) is a generative probabilistic model for collections of discrete data, introduced by Blei, Ng, and Jordan in 2003. It treats each document as a mixture of latent topics and each topic as a probability distribution over words, enabling unsupervised discovery of thematic structure across large text corpora. It is one of the most cited papers in machine learning and natural language processing.	Word2Vec is a neural word-embedding technique introduced by Mikolov and colleagues in 2013 that maps each word in a text corpus to a dense numeric vector. Words that appear in similar contexts end up close together in the vector space, so the embeddings capture semantic similarity that can be measured arithmetically.
ScholarGateНабір даних ↗	v1 3 Джерела PUBLISHED	v1 1 Джерела PUBLISHED

Перейти до пошуку → Завантажити слайди