MCDMCluster Number Selection

Gap Statistic

Gap Statistic for Cluster Evaluation · Also known as: gap index, Tibshirani gap statistic

The Gap Statistic, developed by Tibshirani, Walther, and Hastie in 2001, is a principled statistical method for determining the optimal number of clusters in a dataset. It compares the observed within-cluster sum of squares to the expected value under a null hypothesis of no clustering structure, providing a theoretically grounded approach to cluster number selection.

Tools & resources

Download slides

Learn & explore

Read the full method

Members only

Method map

The neighbourhood of related methods — select a node to explore.

Gap Statistic

Calinski-Harabasz Index Davies-Bouldin Index Elbow Method Inertia (Within-Cluster…Silhouette Score Dunn Index

When to use it

Use the Gap Statistic when you want a statistically principled method for selecting cluster count that accounts for the null hypothesis of no structure. It works better than the Elbow Method when the true elbow is ambiguous. However, it is computationally more expensive due to reference dataset generation and assumes uniform null distribution, which may not hold for all data types.

Strengths & limitations

Strengths

Theoretically grounded in statistical inference
More objective than the Elbow Method; produces a numerical criterion
Works well when clustering structure is moderate to strong
Applicable to any clustering algorithm and distance metric

Limitations

Computationally expensive; requires generating and clustering many reference datasets
Assumes uniform null distribution, which may not suit all data
Can be sensitive to the number of reference datasets generated
May perform poorly when true clusters are non-convex or have very different sizes

Frequently asked

How many reference datasets should I generate?

Typically 100 to 500 reference datasets are sufficient. More datasets provide more stable variance estimates but increase computation. Start with 100 and increase if results seem unstable.

What if the Gap Statistic suggests k=1, meaning no clustering?

This indicates weak or absent clustering structure in your data. You may need to re-examine your data, try different clustering algorithms, or reconsider whether clustering is appropriate for your problem.

Can I use the Gap Statistic with non-Euclidean distances?

Yes, but generating appropriate reference datasets becomes more complex. For non-Euclidean distances, you need to generate reference data that respects your distance metric or use alternative null models.

How does the Gap Statistic compare to silhouette score?

The Gap Statistic is a statistical test for selecting k, while silhouette score evaluates the quality of a given clustering. They serve complementary purposes: use Gap Statistic to choose k, then validate with silhouette score.

Sources

Tibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 411-423. DOI: 10.1111/1467-9868.00293 ↗

How to cite this page

ScholarGate. (2026, June 3). Gap Statistic for Cluster Evaluation. ScholarGate. https://scholargate.app/en/model-evaluation/gap-statistic

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Calinski-Harabasz IndexModel Evaluation↔ compare
Davies-Bouldin IndexModel Evaluation↔ compare
Elbow MethodModel Evaluation↔ compare
Inertia (Within-Cluster Sum of Squares)Model Evaluation↔ compare
Silhouette ScoreModel Evaluation↔ compare

Compare side by side →

Referenced by

Calinski-Harabasz Index Davies-Bouldin Index Dunn Index Elbow Method Silhouette Score

Related reference concepts

K-Means Clustering Cluster Analysis Clustering Algorithms Model-Based Clustering Hierarchical Cluster Analysis Text Clustering

Spotted an issue on this page? Report or suggest a fix →

MCDMCluster Number Selection

Gap Statistic

Gap Statistic for Cluster Evaluation · Also known as: gap index, Tibshirani gap statistic

Tools & resources

Download slides

Learn & explore

Read the full method

Members only

Method map

The neighbourhood of related methods — select a node to explore.

Gap Statistic

Calinski-Harabasz Index Davies-Bouldin Index Elbow Method Inertia (Within-Cluster…Silhouette Score Dunn Index

When to use it

Strengths & limitations

Strengths

Theoretically grounded in statistical inference
More objective than the Elbow Method; produces a numerical criterion
Works well when clustering structure is moderate to strong
Applicable to any clustering algorithm and distance metric

Limitations

Computationally expensive; requires generating and clustering many reference datasets
Assumes uniform null distribution, which may not suit all data
Can be sensitive to the number of reference datasets generated
May perform poorly when true clusters are non-convex or have very different sizes

Frequently asked

How many reference datasets should I generate?

Typically 100 to 500 reference datasets are sufficient. More datasets provide more stable variance estimates but increase computation. Start with 100 and increase if results seem unstable.

What if the Gap Statistic suggests k=1, meaning no clustering?

Can I use the Gap Statistic with non-Euclidean distances?

How does the Gap Statistic compare to silhouette score?

Sources

Tibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 411-423. DOI: 10.1111/1467-9868.00293 ↗

How to cite this page

ScholarGate. (2026, June 3). Gap Statistic for Cluster Evaluation. ScholarGate. https://scholargate.app/en/model-evaluation/gap-statistic

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Calinski-Harabasz IndexModel Evaluation↔ compare
Davies-Bouldin IndexModel Evaluation↔ compare
Elbow MethodModel Evaluation↔ compare
Inertia (Within-Cluster Sum of Squares)Model Evaluation↔ compare
Silhouette ScoreModel Evaluation↔ compare

Compare side by side →

Referenced by

Calinski-Harabasz Index Davies-Bouldin Index Dunn Index Elbow Method Silhouette Score

Related reference concepts

K-Means Clustering Cluster Analysis Clustering Algorithms Model-Based Clustering Hierarchical Cluster Analysis Text Clustering

Spotted an issue on this page? Report or suggest a fix →

Gap Statistic

Read the full method

Method map

When to use it

Strengths & limitations

Frequently asked

Sources

How to cite this page

Which method?

Referenced by

Similar methods

Related reference concepts

Gap Statistic

Read the full method

Method map

When to use it

Strengths & limitations

Frequently asked

Sources

How to cite this page

Which method?

Referenced by

Similar methods

Related reference concepts

Gap Statistic

Read the full method

Method map

When to use it

Strengths & limitations

Frequently asked

Sources

How to cite this page

Related methods

Which method?

Referenced by

Similar methods

Related reference concepts

Gap Statistic

Read the full method

Method map

When to use it

Strengths & limitations

Frequently asked

Sources

How to cite this page

Related methods

Which method?

Referenced by

Similar methods

Related reference concepts