We investigate the role of the initialization for the stability of the қ-means clustering algorithm. As opposed to other papers, we consider the actual қ-means algorithm (also known as Lloyd algorithm). In particular we leverage on the property that this algorithm can get stuck in local optima of the қ-means objective function. We are interested in the actual clustering, not only in the costs of the solution. We analyze when different initializations lead to the same local optimum, and when they lead to different local optima. This enables us to prove that it is reasonable to select the number of clusters based on stability scores.
How the initialization affects the stability of the $k$-means algorithm
Sébastien Bubeck,M. Meilă,U. V. Luxburg
Published 2009 in Esaim: Probability and Statistics
ABSTRACT
PUBLICATION RECORD
- Publication year
2009
- Venue
Esaim: Probability and Statistics
- Publication date
2009-07-31
- Fields of study
Mathematics, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-16 of 16 references · Page 1 of 1
CITED BY
Showing 1-57 of 57 citing papers · Page 1 of 1