ScholarGate
Asistent

Porovnat metody

Prohlédněte si vybrané metody vedle sebe; řádky, které se liší, jsou zvýrazněny.

Statistika mezery (Gap Statistic)×Calinski-Harabaszův index×Setrvačnost×
OborHodnocení modelůHodnocení modelůHodnocení modelů
RodinaMCDMMCDMMCDM
Rok vzniku200119741967
TvůrceRobert Tibshirani, Guenther Walther, Trevor HastieTadeusz Calinski, Jerzy HarabaszStuart Lloyd, James MacQueen
TypStatistical criterionCluster quality metricClustering quality metric
Původní zdrojTibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 411-423. DOI ↗Calinski, T., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics, 3(1), 1-27. DOI ↗Lloyd, S. P. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2), 129-137. DOI ↗
Další názvygap index, Tibshirani gap statisticvariance ratio criterion, pseudo F-statistic, CH indexWCSS, within-cluster sum of squares, cluster cohesion
Příbuzné555
ShrnutíThe Gap Statistic, developed by Tibshirani, Walther, and Hastie in 2001, is a principled statistical method for determining the optimal number of clusters in a dataset. It compares the observed within-cluster sum of squares to the expected value under a null hypothesis of no clustering structure, providing a theoretically grounded approach to cluster number selection.The Calinski-Harabasz Index, also called the Variance Ratio Criterion, was introduced by Calinski and Harabasz in 1974. It is a metric that measures the ratio of between-cluster variance to within-cluster variance, adjusted for the number of clusters and data points. Higher values indicate better-separated, more compact clusters.Inertia, also called Within-Cluster Sum of Squares (WCSS), is a measure of cluster cohesion that quantifies how tightly points are grouped around their cluster centroids. Lower values indicate more compact, cohesive clusters. Inertia is the primary objective function for k-means clustering and has been a fundamental metric since the method's introduction.
ScholarGateDatová sada
  1. v1
  2. 1 Zdroje
  3. PUBLISHED
  1. v1
  2. 1 Zdroje
  3. PUBLISHED
  1. v1
  2. 2 Zdroje
  3. PUBLISHED

Přejít na hledání Stáhnout prezentaci

ScholarGatePorovnat metody: Gap Statistic · Calinski-Harabasz Index · Inertia (Within-Cluster Sum of Squares). Získáno 2026-06-20 z https://scholargate.app/cs/compare