So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Gap Statistic× | Chỉ số Calinski-Harabasz× | |
|---|---|---|
| Lĩnh vực | Đánh giá mô hình | Đánh giá mô hình |
| Họ | MCDM | MCDM |
| Năm ra đời≠ | 2001 | 1974 |
| Người khởi xướng≠ | Robert Tibshirani, Guenther Walther, Trevor Hastie | Tadeusz Calinski, Jerzy Harabasz |
| Loại≠ | Statistical criterion | Cluster quality metric |
| Công trình gốc≠ | Tibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 411-423. DOI ↗ | Calinski, T., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics, 3(1), 1-27. DOI ↗ |
| Tên gọi khác≠ | gap index, Tibshirani gap statistic | variance ratio criterion, pseudo F-statistic, CH index |
| Liên quan | 5 | 5 |
| Tóm tắt≠ | The Gap Statistic, developed by Tibshirani, Walther, and Hastie in 2001, is a principled statistical method for determining the optimal number of clusters in a dataset. It compares the observed within-cluster sum of squares to the expected value under a null hypothesis of no clustering structure, providing a theoretically grounded approach to cluster number selection. | The Calinski-Harabasz Index, also called the Variance Ratio Criterion, was introduced by Calinski and Harabasz in 1974. It is a metric that measures the ratio of between-cluster variance to within-cluster variance, adjusted for the number of clusters and data points. Higher values indicate better-separated, more compact clusters. |
| ScholarGateBộ dữ liệu ↗ |
|
|