Módszerek összehasonlítása
Tekintse át a kiválasztott módszereket egymás mellett; az eltérő sorok kiemelve jelennek meg.
| Hierarchikus klaszterezés× | Főkomponens-analízis× | UMAP× | |
|---|---|---|---|
| Tudományterület | Gépi tanulás | Gépi tanulás | Gépi tanulás |
| Módszercsalád | Machine learning | Machine learning | Machine learning |
| Keletkezés éve≠ | 1963 | 2002 | 2018 |
| Megalkotó≠ | Ward, J. H. | Jolliffe, I.T. (textbook); Pearson & Hotelling (origins) | McInnes, L.; Healy, J.; Melville, J. |
| Típus≠ | Unsupervised clustering (agglomerative) | Unsupervised dimensionality reduction | Nonlinear manifold-learning dimension reduction |
| Alapmű≠ | Ward, J. H. (1963). Hierarchical Grouping to Optimize an Objective Function. Journal of the American Statistical Association, 58(301), 236–244. DOI ↗ | Jolliffe, I.T. (2002). Principal Component Analysis (2nd ed.). Springer. DOI ↗ | McInnes, L., Healy, J. & Melville, J. (2018). UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv:1802.03426. link ↗ |
| Alternatív nevek≠ | Hiyerarşik Kümeleme, hiyerarşik kümeleme, agglomerative clustering, hierarchical agglomerative clustering | Temel Bileşenler Analizi (PCA), PCA, principal components analysis, Karhunen-Loève transform | UMAP (Uniform Manifold Approximation and Projection), uniform manifold approximation and projection, manifold dimension reduction |
| Kapcsolódó≠ | 4 | 3 | 5 |
| Összefoglaló≠ | Hierarchical clustering is an unsupervised method that groups observations into nested clusters and draws the result as a dendrogram, so the number of clusters need not be fixed in advance. Its agglomerative form rests on the objective-function grouping criterion introduced by Joe Ward in 1963. | Principal Component Analysis (PCA) is an unsupervised dimensionality-reduction method — given its modern textbook treatment by Ian Jolliffe (2002) — that compresses high-dimensional data into fewer dimensions while preserving the maximum possible variance. It re-expresses correlated variables as a small set of uncorrelated principal components ordered by how much of the data's variation each one captures. | UMAP (Uniform Manifold Approximation and Projection) is a fast, scalable nonlinear dimension-reduction method grounded in manifold-learning theory, introduced by McInnes, Healy and Melville in 2018. It compresses high-dimensional data into a low-dimensional embedding for visualisation and downstream analysis. |
| ScholarGateAdatkészlet ↗ |
|
|
|