手法を比較

選択した手法を並べて確認できます。異なる行はハイライト表示されます。

	Calinski-Harabasz Index（キャリンスキー・ハラバス指数）×	デイビス・ボールディン指数 ×	Dunn Index ×	Gap Statistic ×	慣性 ×
分野	モデル評価	モデル評価	モデル評価	モデル評価	モデル評価
系統	MCDM	MCDM	MCDM	MCDM	MCDM
提唱年≠	1974	1979	1974	2001	1967
提唱者≠	Tadeusz Calinski, Jerzy Harabasz	David L. Davies, Donald W. Bouldin	Joseph C. Dunn	Robert Tibshirani, Guenther Walther, Trevor Hastie	Stuart Lloyd, James MacQueen
種類≠	Cluster quality metric	Cluster quality metric	Cluster quality metric	Statistical criterion	Clustering quality metric
原典≠	Calinski, T., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics, 3(1), 1-27. DOI ↗	Davies, D. L., & Bouldin, D. W. (1979). A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1(2), 224-227. DOI ↗	Dunn, J. C. (1974). Well-separated clusters and optimal fuzzy partitions. Journal of Cybernetics, 4(1), 95-104. DOI ↗	Tibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 411-423. DOI ↗	Lloyd, S. P. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2), 129-137. DOI ↗
別名≠	variance ratio criterion, pseudo F-statistic, CH index	DBI, Davies Bouldin index	Dunn's index, separation coefficient	gap index, Tibshirani gap statistic	WCSS, within-cluster sum of squares, cluster cohesion
関連	5	5	5	5	5
概要≠	The Calinski-Harabasz Index, also called the Variance Ratio Criterion, was introduced by Calinski and Harabasz in 1974. It is a metric that measures the ratio of between-cluster variance to within-cluster variance, adjusted for the number of clusters and data points. Higher values indicate better-separated, more compact clusters.	The Davies-Bouldin Index, introduced by Davies and Bouldin in 1979, is a metric for evaluating clustering quality based on the average similarity between each cluster and its most similar neighboring cluster. Lower values indicate better clustering, with a minimum of 0 representing perfectly separated, non-overlapping clusters.	The Dunn Index, introduced by Joseph C. Dunn in 1974, is a metric that captures cluster quality by measuring the ratio of the minimum between-cluster distance to the maximum within-cluster diameter. Higher values indicate well-separated and compact clusters, with better clustering quality.	The Gap Statistic, developed by Tibshirani, Walther, and Hastie in 2001, is a principled statistical method for determining the optimal number of clusters in a dataset. It compares the observed within-cluster sum of squares to the expected value under a null hypothesis of no clustering structure, providing a theoretically grounded approach to cluster number selection.	Inertia, also called Within-Cluster Sum of Squares (WCSS), is a measure of cluster cohesion that quantifies how tightly points are grouped around their cluster centroids. Lower values indicate more compact, cohesive clusters. Inertia is the primary objective function for k-means clustering and has been a fundamental metric since the method's introduction.
ScholarGateデータセット ↗	v1 1 出典 PUBLISHED	v1 1 出典 PUBLISHED	v1 1 出典 PUBLISHED	v1 1 出典 PUBLISHED	v1 2 出典 PUBLISHED

検索へ → スライドをダウンロード