विधियों की तुलना करें
चुनी हुई विधियों की आमने-सामने समीक्षा करें; भिन्नता वाली पंक्तियाँ रेखांकित हैं।
| गैप स्टैटिस्टिक (Gap Statistic)× | जड़ता (Inertia)× | |
|---|---|---|
| क्षेत्र | मॉडल मूल्यांकन | मॉडल मूल्यांकन |
| परिवार | MCDM | MCDM |
| उद्भव वर्ष≠ | 2001 | 1967 |
| प्रवर्तक≠ | Robert Tibshirani, Guenther Walther, Trevor Hastie | Stuart Lloyd, James MacQueen |
| प्रकार≠ | Statistical criterion | Clustering quality metric |
| मौलिक स्रोत≠ | Tibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 411-423. DOI ↗ | Lloyd, S. P. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2), 129-137. DOI ↗ |
| उपनाम≠ | gap index, Tibshirani gap statistic | WCSS, within-cluster sum of squares, cluster cohesion |
| संबंधित | 5 | 5 |
| सारांश≠ | The Gap Statistic, developed by Tibshirani, Walther, and Hastie in 2001, is a principled statistical method for determining the optimal number of clusters in a dataset. It compares the observed within-cluster sum of squares to the expected value under a null hypothesis of no clustering structure, providing a theoretically grounded approach to cluster number selection. | Inertia, also called Within-Cluster Sum of Squares (WCSS), is a measure of cluster cohesion that quantifies how tightly points are grouped around their cluster centroids. Lower values indicate more compact, cohesive clusters. Inertia is the primary objective function for k-means clustering and has been a fundamental metric since the method's introduction. |
| ScholarGateडेटासेट ↗ |
|
|