Benchmarking of Clustering Validity Measures Revisited

Connor Simpson,Ricardo J. G. B. Campello,Elizabeth Stojanovski

Published 2025 in Statistical analysis and data mining

ABSTRACT

Validation plays a crucial role in the clustering process. Many different internal validity indices exist for the purpose of determining the best clustering solution(s) from a given collection of candidates, for example, as produced by different algorithms or different algorithm hyper‐parameters. In this study, we present a comprehensive benchmark study of 26 internal validity indices, which includes highly popular classic indices as well as more recently developed ones. We adopted an enhanced revision of the methodology presented in Vendramin et al. (2010), developed here to address several shortcomings of this previous work. This overall new approach consists of three complementary custom‐tailored evaluation sub‐methodologies, each of which has been designed to assess specific aspects of an index's behavior while preventing potential biases of the other sub‐methodologies. Each sub‐methodology features two complementary measures of performance, alongside mechanisms that allow for an in‐depth investigation of more complex behaviors of the internal validity indices under study. Additionally, a new collection of 16,177 datasets has been produced, paired with eight widely used clustering algorithms, for a wider applicability scope and representation of more diverse clustering scenarios.

PUBLICATION RECORD

Publication year
2025
Venue
Statistical analysis and data mining
Publication date
2025-11-08
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1002/sam.70061 arXiv 2511.05983
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Model-Based Clustering, Classification, and Density Estimation Using mclust in R
2023cited by this paper
CVIK: A Matlab-based cluster validity index toolbox for automatic data clustering
2023cited by this paper
Characterizing and Comparing External Measures for the Assessment of Cluster Analysis and Community Detection
2021influential reference
Are cluster validity measures (in) valid?
2021cited by this paper
A comparative study of validity indices on estimating the optimal number of clusters
2021influential reference
A survey of cluster validity indices for automatic data clustering using differential evolution
2021influential reference
Estimating the Optimal Number of Clusters Via Internal Validity Index
2021cited by this paper
The area under the ROC curve as a measure of clustering quality
2020influential reference
Statistical Comparative Analysis and Evaluation of Validation Indices for Clustering Optimization
2020cited by this paper
Cluster validity index for irregular clustering results
2020influential reference
MDCGen: Multidimensional Dataset Generator for Clustering
2019cited by this paper
Evolving controllably difficult datasets for clustering
2019cited by this paper
A Novel Cluster Validity Index Based on Local Cores
2019cited by this paper
Model-Based Clustering and Classification for Data Science
2019cited by this paper
Comparison of Internal Validity Indices for Fuzzy Clustering
2019cited by this paper
An Internal Validity Index Based on Density-Involved Distance
2019cited by this paper
A white paper on good research practices in benchmarking: The case of cluster analysis
2018cited by this paper
An approach to validity indices for clustering techniques in Big Data
2018cited by this paper
hdbscan: Hierarchical density based clustering
2017cited by this paper
A Study of Cluster Validity Indices for Real-Life Data
2017cited by this paper
Comparison of Internal Clustering Validation Indices for Prototype-Based Clustering
2017cited by this paper
Comparative Study between Validity Indices to Obtain the Optimal Cluster
2017cited by this paper
Performance Evaluation of Cluster Validity Indices (CVIs) on Multi/Hyperspectral Remote Sensing Datasets
2016influential reference
Ground truth bias in external cluster validity indices
2016influential reference
mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models
2016cited by this paper
On comparing partitions
2015cited by this paper
Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection
2015cited by this paper
A comparison study of clustering validity indices
2015cited by this paper
What are the true clusters?
2015cited by this paper
Comparing hard and overlapping clusterings
2015cited by this paper
Density-Based Clustering Validation
2014influential reference
An extensive comparative study of cluster validity indices
2013influential reference
Density-Based Clustering Based on Hierarchical Density Estimates
2013cited by this paper
A comparison of clustering quality indices using outliers and noise
2012influential reference
Relative Validity Criteria for Community Mining Algorithms
2012cited by this paper
Investigation of Internal Validity Measures for K-Means Clustering
2012cited by this paper
Validity index for clusters of different sizes and densities
2011cited by this paper
Towards a standard methodology to evaluate internal cluster validity indices
2011cited by this paper
Scikit-learn: Machine Learning in Python
2011cited by this paper
Relative clustering validity criteria: A comparative overview
2010influential reference
Comparing Clusterings in Space
2010cited by this paper
Information Theoretic Measures for Clusterings Comparison: Variants, Properties, Normalization and Correction for Chance
2010cited by this paper
On Using Class-Labels in Evaluation of Clusterings
2010cited by this paper
Characterization and evaluation of similarity measures for pairs of clusterings
2009influential reference
Sum-of-Squares Based Cluster Validity Index and Significance Analysis
2009cited by this paper
Clustering: Science or Art?
2009cited by this paper
A density-based cluster validity approach using multi-representatives
2008cited by this paper
Multi-criteria decision making methods
2005cited by this paper
A new cluster validity measure and its application to image compression
2004cited by this paper
Validity index for crisp and fuzzy clusters
2004cited by this paper
Finding the Number of Clusters in a Dataset
2003cited by this paper
Cluster ensembles --- a knowledge reuse framework for combining multiple partitions
2003cited by this paper
Model-Based Clustering, Discriminant Analysis, and Density Estimation
2002cited by this paper
Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions
2002cited by this paper
Estimating the number of clusters in a data set via the gap statistic
2001cited by this paper
The similarity metric
2001cited by this paper
Clustering validity assessment: finding the optimal partitioning of a data set
2001cited by this paper
On Clustering Validation Techniques
2001cited by this paper
A Collaborative Approach to Combine Multiple Learning Methods
2000cited by this paper
Quality Scheme Assessment in the Clustering Process
2000cited by this paper
How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis
1998cited by this paper
Trimmed $k$-means: an attempt to robustify quantizers
1997cited by this paper
An examination of procedures for determining the number of clusters in a data set
1994influential reference
A Validity Measure for Fuzzy Clustering
1991cited by this paper
Finding Groups in Data: An Introduction to Cluster Analysis
1990cited by this paper
A Criterion for Determining the Number of Groups in a Data Set Using Sum-of-Squares Clustering
1988cited by this paper
Algorithms for Clustering Data
1988influential reference
Silhouettes: a graphical aid to the interpretation and validation of cluster analysis
1987cited by this paper
FCM: The fuzzy c-means clustering algorithm
1984cited by this paper
A monte carlo study of thirty internal criterion measures for cluster analysis
1981cited by this paper
A Stopping Rule for Partitioning Dendrograms
1980cited by this paper
A Cluster Separation Measure
1979cited by this paper
Estimating the Dimension of a Model
1978cited by this paper
A general statistical framework for assessing categorical clustering in free recall.
1976cited by this paper
Well-Separated Clusters and Optimal Fuzzy Partitions
1974cited by this paper
Methods of Comparing Classifications
1974cited by this paper
A dendrite method for cluster analysis
1974cited by this paper
Some methods for classi cation and analysis of multivariate observations
1967cited by this paper
THE PRINCIPLES AND PRACTICE OF NUMERICAL TAXONOMY
1963cited by this paper

CITED BY

No citing papers are available for this paper.