ScholarGate
Асистент

Сравнение на методи

Прегледайте избраните методи един до друг; редовете с разлики са откроени.

Gap Statistic×Индекс на Калински-Харабаш×Инерция×
ОбластОценка на моделиОценка на моделиОценка на модели
СемействоMCDMMCDMMCDM
Година на възникване200119741967
СъздателRobert Tibshirani, Guenther Walther, Trevor HastieTadeusz Calinski, Jerzy HarabaszStuart Lloyd, James MacQueen
ТипStatistical criterionCluster quality metricClustering quality metric
Основополагащ източникTibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 411-423. DOI ↗Calinski, T., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics, 3(1), 1-27. DOI ↗Lloyd, S. P. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2), 129-137. DOI ↗
Други названияgap index, Tibshirani gap statisticvariance ratio criterion, pseudo F-statistic, CH indexWCSS, within-cluster sum of squares, cluster cohesion
Свързани555
РезюмеThe Gap Statistic, developed by Tibshirani, Walther, and Hastie in 2001, is a principled statistical method for determining the optimal number of clusters in a dataset. It compares the observed within-cluster sum of squares to the expected value under a null hypothesis of no clustering structure, providing a theoretically grounded approach to cluster number selection.The Calinski-Harabasz Index, also called the Variance Ratio Criterion, was introduced by Calinski and Harabasz in 1974. It is a metric that measures the ratio of between-cluster variance to within-cluster variance, adjusted for the number of clusters and data points. Higher values indicate better-separated, more compact clusters.Inertia, also called Within-Cluster Sum of Squares (WCSS), is a measure of cluster cohesion that quantifies how tightly points are grouped around their cluster centroids. Lower values indicate more compact, cohesive clusters. Inertia is the primary objective function for k-means clustering and has been a fundamental metric since the method's introduction.
ScholarGateНабор от данни
  1. v1
  2. 1 Източници
  3. PUBLISHED
  1. v1
  2. 1 Източници
  3. PUBLISHED
  1. v1
  2. 2 Източници
  3. PUBLISHED

Към търсенето Изтегляне на слайдове

ScholarGateСравнение на методи: Gap Statistic · Calinski-Harabasz Index · Inertia (Within-Cluster Sum of Squares). Извлечено на 2026-06-20 от https://scholargate.app/bg/compare