A comparison of indices for identifying the number of clusters in hierarchical clustering: A study on cognition in schizophrenia patients

Md Atiqul Islam (Corresponding author), Behrooz Z. Alizadeh, Edwin R. van den Heuvel, Richard Bruggeman, Wiepke Cahn, Lieuwe de Haan, René S. Kahn, Carin Meijer, Inez Myin-Germeys, Jim van Os, Durk Wiersma

Research output: Contribution to journalArticleAcademicpeer-review

9 Citations (Scopus)

Abstract

Finding clusters in a complex dataset is not straightforward. Different indices were developed to quantify the number of clusters. Their performances were studied using unrealistic simulations, since they were considered at low dimensions. We investigated 14 indices for eight-dimensional data using simulations based on cognition measures. We focused on hierarchical clustering with Ward’s agglomerative technique. Results indicated that Duda and Hart, Hartigan and Gap/pc were best performing. They estimated the number of clusters within ±1 with high probabilities. Duda and Hart index was most consistent, while Gap/pc and WGap/pc together made a good distinction between single and multiple clusters.

Original languageEnglish
Pages (from-to)98-113
Number of pages16
JournalCommunications in Statistics
Volume1
Issue number2
DOIs
Publication statusPublished - 3 Apr 2015

Keywords

  • Cluster analysis
  • cluster indices
  • hierarchical clustering
  • homogeneous subgroups
  • number of clusters

Fingerprint

Dive into the research topics of 'A comparison of indices for identifying the number of clusters in hierarchical clustering: A study on cognition in schizophrenia patients'. Together they form a unique fingerprint.

Cite this