방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| DBSCAN× | 결정 트리× | 계층적 군집화× | K-평균 군집화× | |
|---|---|---|---|---|
| 분야 | 머신러닝 | 머신러닝 | 머신러닝 | 머신러닝 |
| 계열 | Machine learning | Machine learning | Machine learning | Machine learning |
| 기원 연도≠ | 1996 | 1984 | 1963 | 1967 |
| 창시자≠ | Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. | Breiman, Friedman, Olshen & Stone | Ward, J. H. | MacQueen, J. |
| 유형≠ | Density-based clustering algorithm | Recursive partitioning (if-then rules) | Unsupervised clustering (agglomerative) | Partitional clustering (centroid-based) |
| 원전≠ | Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. (1996). A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. Proceedings of the 2nd KDD, 226–231. link ↗ | Breiman, L., Friedman, J.H., Olshen, R.A. & Stone, C.J. (1984). Classification and Regression Trees. Wadsworth. DOI ↗ | Ward, J. H. (1963). Hierarchical Grouping to Optimize an Objective Function. Journal of the American Statistical Association, 58(301), 236–244. DOI ↗ | MacQueen, J. (1967). Some Methods for Classification and Analysis of Multivariate Observations. Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, 1, 281–297. link ↗ |
| 별칭≠ | DBSCAN Kümeleme, density-based clustering, density-based spatial clustering | Karar Ağacı (Decision Tree), karar ağacı, classification tree, regression tree | Hiyerarşik Kümeleme, hiyerarşik kümeleme, agglomerative clustering, hierarchical agglomerative clustering | K-Ortalamalar Kümeleme, k-ortalamalar kümeleme, k-means, centroid clustering |
| 관련≠ | 3 | 5 | 4 | 3 |
| 요약≠ | DBSCAN is a density-based clustering algorithm, introduced by Ester, Kriegel, Sander and Xu in 1996, that groups together points lying in dense regions and flags points in sparse regions as noise. It is effective on noisy data and on clusters of irregular, non-spherical shapes. | A Decision Tree is an interpretable classification and regression method, formalised by Breiman, Friedman, Olshen and Stone in their 1984 CART framework, that partitions the data with hierarchical if-then rules. Each split sends observations down one branch or another until a prediction is read off the leaf. | Hierarchical clustering is an unsupervised method that groups observations into nested clusters and draws the result as a dendrogram, so the number of clusters need not be fixed in advance. Its agglomerative form rests on the objective-function grouping criterion introduced by Joe Ward in 1963. | K-Means Clustering is a centroid-based partitional clustering algorithm, traced to J. MacQueen in 1967, that splits data into k clusters by assigning each observation to its nearest cluster centre. It is widely used for marketing segmentation, customer grouping, and exploratory analysis. |
| ScholarGate데이터셋 ↗ |
|
|
|
|