방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| 디리클레 과정 혼합 모형× | 베이즈 회귀× | 잠재 디리클레 할당 (Latent Dirichlet Allocation, LDA)× | |
|---|---|---|---|
| 분야≠ | 베이지안 | 베이지안 | 머신러닝 |
| 계열≠ | Bayesian methods | Bayesian methods | Latent structure |
| 기원 연도≠ | 1973 | — | 2003 |
| 창시자≠ | Ferguson (1973); mixture model formulation by Lo (1984) | — | Blei, D. M.; Ng, A. Y.; Jordan, M. I. |
| 유형≠ | Nonparametric Bayesian mixture model | Bayesian linear model | Generative probabilistic topic model (three-level hierarchical Bayesian) |
| 원전≠ | Ferguson, T. S. (1973). A Bayesian analysis of some nonparametric problems. The Annals of Statistics, 1(2), 209–230. DOI ↗ | Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A. & Rubin, D. B. (2013). Bayesian Data Analysis (3rd ed.). CRC Press. ISBN: 978-1439840955 | Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022. DOI ↗ |
| 별칭≠ | DPMM, DP mixture model, infinite mixture model, Dirichlet process mixture | bayesian linear regression, probabilistic regression, bayesian regresyon | LDA, topic model, Blei-Ng-Jordan model, probabilistic topic modeling |
| 관련≠ | 3 | 2 | 3 |
| 요약≠ | The Dirichlet Process Mixture Model (DPMM) is a nonparametric Bayesian clustering method introduced through Ferguson's (1973) Dirichlet process prior that places a probability distribution over distributions. Unlike finite mixture models, the DPMM does not require the analyst to specify the number of clusters in advance; instead it infers the number of components from the data, allowing an effectively unbounded mixture that grows as more observations arrive. | Bayesian regression is a probabilistic version of linear regression that treats the model parameters as uncertain quantities. Instead of returning a single best-fit estimate, it combines prior knowledge with the observed data to produce a full posterior probability distribution for each parameter, from which credible intervals and predictions are read off. | Latent Dirichlet Allocation (LDA) is a generative probabilistic model for collections of discrete data, introduced by Blei, Ng, and Jordan in 2003. It treats each document as a mixture of latent topics and each topic as a probability distribution over words, enabling unsupervised discovery of thematic structure across large text corpora. It is one of the most cited papers in machine learning and natural language processing. |
| ScholarGate데이터셋 ↗ |
|
|
|