השוואת שיטות
סקרו את השיטות שבחרתם זו לצד זו; שורות שבהן יש הבדל מודגשות.
| K-Means ניתן להסבר× | DBSCAN× | עץ החלטה× | אשכול היררכי× | אשכול K-Means× | |
|---|---|---|---|---|---|
| תחום | למידת מכונה | למידת מכונה | למידת מכונה | למידת מכונה | למידת מכונה |
| משפחה | Machine learning | Machine learning | Machine learning | Machine learning | Machine learning |
| שנת המקור≠ | 2020 | 1996 | 1984 | 1963 | 1967 |
| הוגה השיטה≠ | Dasgupta, S.; Moshkovitz, M.; Frost, N.; Rashtchian, C. | Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. | Breiman, Friedman, Olshen & Stone | Ward, J. H. | MacQueen, J. |
| סוג≠ | Explainable unsupervised clustering algorithm | Density-based clustering algorithm | Recursive partitioning (if-then rules) | Unsupervised clustering (agglomerative) | Partitional clustering (centroid-based) |
| מקור מכונן≠ | Dasgupta, S., Frost, N., Moshkovitz, M., & Rashtchian, C. (2020). Explainability of k-Means Clustering. Proceedings of the 37th International Conference on Machine Learning (ICML), PMLR 119. link ↗ | Ester, M., Kriegel, H.-P., Sander, J. & Xu, X. (1996). A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. Proceedings of the 2nd KDD, 226–231. link ↗ | Breiman, L., Friedman, J.H., Olshen, R.A. & Stone, C.J. (1984). Classification and Regression Trees. Wadsworth. DOI ↗ | Ward, J. H. (1963). Hierarchical Grouping to Optimize an Objective Function. Journal of the American Statistical Association, 58(301), 236–244. DOI ↗ | MacQueen, J. (1967). Some Methods for Classification and Analysis of Multivariate Observations. Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, 1, 281–297. link ↗ |
| כינויים≠ | ExKMC, interpretable k-means, decision-tree k-means, explainable clustering | DBSCAN Kümeleme, density-based clustering, density-based spatial clustering | Karar Ağacı (Decision Tree), karar ağacı, classification tree, regression tree | Hiyerarşik Kümeleme, hiyerarşik kümeleme, agglomerative clustering, hierarchical agglomerative clustering | K-Ortalamalar Kümeleme, k-ortalamalar kümeleme, k-means, centroid clustering |
| קשורות≠ | 5 | 3 | 5 | 4 | 3 |
| תקציר≠ | Explainable K-Means is a post-hoc and in-model interpretability approach to standard K-Means clustering that replaces or approximates cluster assignments with a small axis-aligned decision tree. Each leaf of the tree corresponds to one cluster, and every data point is assigned to a cluster by following a simple sequence of threshold rules on individual features — making cluster membership fully transparent and human-readable. | DBSCAN is a density-based clustering algorithm, introduced by Ester, Kriegel, Sander and Xu in 1996, that groups together points lying in dense regions and flags points in sparse regions as noise. It is effective on noisy data and on clusters of irregular, non-spherical shapes. | A Decision Tree is an interpretable classification and regression method, formalised by Breiman, Friedman, Olshen and Stone in their 1984 CART framework, that partitions the data with hierarchical if-then rules. Each split sends observations down one branch or another until a prediction is read off the leaf. | Hierarchical clustering is an unsupervised method that groups observations into nested clusters and draws the result as a dendrogram, so the number of clusters need not be fixed in advance. Its agglomerative form rests on the objective-function grouping criterion introduced by Joe Ward in 1963. | K-Means Clustering is a centroid-based partitional clustering algorithm, traced to J. MacQueen in 1967, that splits data into k clusters by assigning each observation to its nearest cluster centre. It is widely used for marketing segmentation, customer grouping, and exploratory analysis. |
| ScholarGateמערך נתונים ↗ |
|
|
|
|
|