Loading...
Thumbnail Image
Item

Stability Selection of the Number of Clusters

Reizer, Gabriella v
Citations
Altmetric:
Abstract

Selecting the number of clusters is one of the greatest challenges in clustering analysis. In this thesis, we propose a variety of stability selection criteria based on cross validation for determining the number of clusters. Clustering stability measures the agreement of clusterings obtained by applying the same clustering algorithm on multiple independent and identically distributed samples. We propose to measure the clustering stability by the correlation between two clustering functions. These criteria are motivated by the concept of clustering instability proposed by Wang (2010), which is based on a form of clustering distance. In addition, the effectiveness and robustness of the proposed methods are numerically demonstrated on a variety of simulated and real world samples.

Comments
Description
Date
2011-04-18
Journal Title
Journal ISSN
Volume Title
Publisher
Research Projects
Organizational Units
Journal Issue
Keywords
Consistency, Cross validation, Hierarchical clustering, Instability, k-means clustering, Spectral clustering, Stability
Citation
Reizer, Gabriella v. "Stability Selection of the Number of Clusters." 2011. Thesis, Georgia State University. https://doi.org/10.57709/1958395
Embargo Lift Date
2011-04-27
Embedded videos