ScholarGate
アシスタント

手法を比較

選択した手法を並べて確認できます。異なる行はハイライト表示されます。

潜在的ディリクレ配分法 (LDA)×K平均法クラスタリング×非負値行列因子分解 (NMF)×Word2Vec×
分野機械学習機械学習機械学習テキストマイニング
系統Latent structureMachine learningLatent structureProcess / pipeline
提唱年2003196719992013
提唱者Blei, D. M.; Ng, A. Y.; Jordan, M. I.MacQueen, J.Lee, D. D. & Seung, H. S.Tomas Mikolov et al.
種類Generative probabilistic topic model (three-level hierarchical Bayesian)Partitional clustering (centroid-based)Matrix decomposition with non-negativity constraintsNeural word-embedding model
原典Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022. DOI ↗MacQueen, J. (1967). Some Methods for Classification and Analysis of Multivariate Observations. Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, 1, 281–297. link ↗Lee, D. D., & Seung, H. S. (1999). Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755), 788–791. DOI ↗Mikolov, T., Chen, K., Corrado, G. & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. link ↗
別名LDA, topic model, Blei-Ng-Jordan model, probabilistic topic modelingK-Ortalamalar Kümeleme, k-ortalamalar kümeleme, k-means, centroid clusteringNMF, NNMF, nonnegative matrix factorization, non-negative matrix approximationword embeddings, skip-gram, continuous bag-of-words, Word2Vec Kelime Gömülmeleri
関連3344
概要Latent Dirichlet Allocation (LDA) is a generative probabilistic model for collections of discrete data, introduced by Blei, Ng, and Jordan in 2003. It treats each document as a mixture of latent topics and each topic as a probability distribution over words, enabling unsupervised discovery of thematic structure across large text corpora. It is one of the most cited papers in machine learning and natural language processing.K-Means Clustering is a centroid-based partitional clustering algorithm, traced to J. MacQueen in 1967, that splits data into k clusters by assigning each observation to its nearest cluster centre. It is widely used for marketing segmentation, customer grouping, and exploratory analysis.Non-negative Matrix Factorization (NMF) is a family of algorithms, introduced by Lee and Seung in their landmark 1999 Nature paper, that decomposes a non-negative data matrix V into the product of two lower-rank non-negative matrices W (basis components) and H (encoding coefficients). Unlike PCA or SVD, the non-negativity constraint forces the algorithm to learn strictly additive, parts-based representations, making the factors directly interpretable as building blocks of the original data.Word2Vec is a neural word-embedding technique introduced by Mikolov and colleagues in 2013 that maps each word in a text corpus to a dense numeric vector. Words that appear in similar contexts end up close together in the vector space, so the embeddings capture semantic similarity that can be measured arithmetically.
ScholarGateデータセット
  1. v1
  2. 3 出典
  3. PUBLISHED
  1. v1
  2. 1 出典
  3. PUBLISHED
  1. v1
  2. 3 出典
  3. PUBLISHED
  1. v1
  2. 1 出典
  3. PUBLISHED

検索へ スライドをダウンロード

ScholarGate手法を比較: Latent Dirichlet Allocation · K-Means Clustering · Non-negative Matrix Factorization · Word2Vec. 2026-06-18に以下より取得 https://scholargate.app/ja/compare