Variable selection in cluster analysis: an approach based on a new index
Contributo in Atti di convegno
Data di Pubblicazione:
2013
Citazione:
Variable selection in cluster analysis: an approach based on a new index / Morlini, Isabella; Zani, S.. - STAMPA. - (2013), pp. 71-79. ( Joint Meetings on Classification and Data Analysis Group of the Italian Statistical Society, CLADAG 2010 Firenze, ita ) [10.1007/978-3-642-28894-4_9].
Abstract:
In cluster analysis, the inclusion of unnecessaryvariables may mask the true group structure. For the selection ofthe best subset of variables, we suggest the use of two overallindices. The first index is a distance between two hierarchicalclusterings and the second one is a similarity index obtained asthe complement to one of the previous distance. Both criteria canbe used for measuring the similarity between clusterings obtainedwith different subsets of variables. An application with a realdata set regarding the economic welfare of the Italian Regionsshows the benefits gained with the suggested procedure.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
Correction for chance; Comparison of partitions; Hierarchical clusterings; Similarity
index.
Elenco autori:
Morlini, Isabella; Zani, S.
Link alla scheda completa:
Link al Full Text:
Titolo del libro:
Classification and data mining
Pubblicato in: