Cluster Analysis Notes
Cluster Analysis Notes
Divisive Methods
Start with one all-inclusive cluster
Repeatedly divide into smaller clusters
A Dendrogram shows the cluster hierarchy
Measuring Distance
Between records
Between clusters
Measuring Distance Between Records
Distance Between Two Records
For 22 utilities:
Correlation-based similarity
Statistical distance (Mahalanobis)
Manhattan distance (absolute differences)
Maximum coordinate distance
Gower’s similarity (for mixed variable types:
continuous & categorical)
Measuring Distance Between Clusters
Minimum Distance
(Cluster A to Cluster B)
We chose k = 3
Average
Cluster #Obs distance in
cluster
Cluster-1 12 1748.348058
Cluster-2 3 907.6919822
Cluster-3 7 3625.242085
Overall 22 2230.906692