SlideShare a Scribd company logo
7
Most read
9
Most read
10
Most read
Hierarchical clustering
NAME - A.K.M.ASHEK FARABIPresented By:
Content
•Clustering
•Types of Clustering
•Hierarchical Clustering
•Key Concepts in Hierarchal Clustering
•Types of Hierarchal Clustering
•Difference between Partitional and Hierarchical clustering
•The Application of Hierarchical Clustering
•Hierarchical Clustering pros and cons
•Summary
4/10/2018 HIERARCHICAL CLUSTERING 2
Clustering
•Clustering is the classification of objects into different groups,
or more precisely, the partitioning of a data set into subsets
(clusters), so that the data in each subset (ideally) share some
common trait - often according to some defined distance
measure.
4/10/2018 HIERARCHICAL CLUSTERING 3
Clustering (cont..)
•Also called “unsupervised learning” or “data mining”
•Organizing data into classes such that there is
• high intra-class similarity
• low intra-class similarity
•More informally, finding natural groupings among objects.
4/10/2018 HIERARCHICAL CLUSTERING 4
Clustering (cont..)
What is a natural grouping among these objects?
Clustering is subjective
Rahim’s Family School Employs Females Males
4/10/2018 HIERARCHICAL CLUSTERING 5
Types of Clustering
1. Hierarchical algorithms
2. Partitional clustering
4/10/2018 HIERARCHICAL CLUSTERING 6
Hierarchical Clustering
•Produces a set of nested clusters organized as a hierarchical
tree.
•Can be visualized as a dendrogram.
»A tree-like diagram that records the sequences of merge or splits
4/10/2018 HIERARCHICAL CLUSTERING 7
Key Concepts in Hierarchal Clustering
Dendrogram tree representation
2
3
4
5
6
object
lifetime
1. In the beginning we have 6
clusters: A, B, C, D, E and F
2. We merge clusters D and F into
cluster (D, F) at distance 0.50
3. We merge cluster A and cluster B
into (A, B) at distance 0.71
4. We merge clusters E and (D, F)
into ((D, F), E) at distance 1.00
5. We merge clusters ((D, F), E) and C
into (((D, F), E), C) at distance 1.41
6. We merge clusters (((D, F), E), C)
and (A, B) into ((((D, F), E), C), (A, B))
at distance 2.50
7. The last cluster contain all the objects,
thus conclude the computation
4/10/2018 HIERARCHICAL CLUSTERING 8
Types of Hierarchal Clustering
Two main types of hierarchical clustering
• Agglomerative:
» Start with the points as individual clusters
» At each step, merge the closest pair of clusters until only one cluster (or
K clusters) left
• Divisive:
» Start with one, all-inclusive cluster
» At each step, split a cluster until each cluster contains a point (or there
are K clusters)
4/10/2018 HIERARCHICAL CLUSTERING 9
Difference between Partitional and
Hierarchical clustering
Partitional clustering
» Partitional clustering is faster
than hierarchical clustering.
» Partitional clustering requires
stronger assumptions such as
number of clusters and the
initial centers.
» partitional clustering
algorithms require the number
of clusters to start running.
Hierarchical clustering
» Hierarchical clustering is slower
than Partitional clustering.
» Hierarchical clustering requires
only a similarity measure.
» Hierarchical clustering does not
require any input parameters
4/10/2018 HIERARCHICAL CLUSTERING 10
The applications of Hierarchical
Applications
» Wireless Sensor Network
» Audio Event Detection
» Web cluster engines
» Bioinformatics
» And many more.
4/10/2018 HIERARCHICAL CLUSTERING 11
Hierarchal Clustering pros and cons
Pros..
» Doesn't required number of
clusters to be specified.
» Easy to implement.
» Produces a dendrogram,
which helps with
understanding the data.
Cons..
» Can never undo any previous
steps throughout the
algorithm.
» Generally has long runtimes.
» Sometimes difficult to identify
the number of clusters by the
dendrogram.
4/10/2018 HIERARCHICAL CLUSTERING 12
Summary
•Hierarchical algorithm is a sequential clustering algorithm
»Use distance matrix to construct a tree of clusters (dendrogram)
»Hierarchical representation without the need of knowing of clusters (can
set termination condition with known of clusters)
•Major weakness of agglomerative clustering methods
»Can never undo what was done previously
»Sensitive to cluster distance measures and noise/outliers
4/10/2018 HIERARCHICAL CLUSTERING 13
4/10/2018 HIERARCHICAL CLUSTERING 14

More Related Content

What's hot (20)

PPT
3.5 model based clustering
Krish_ver2
 
PPTX
Clustering
Dr. C.V. Suresh Babu
 
PPTX
Ensemble learning
Haris Jamil
 
PDF
Hierarchical Clustering
Carlos Castillo (ChaTo)
 
PPTX
Introduction to Clustering algorithm
hadifar
 
PPTX
Grid based method & model based clustering method
rajshreemuthiah
 
PPT
3.3 hierarchical methods
Krish_ver2
 
PPTX
Cluster Analysis Introduction
PrasiddhaSarma
 
PPT
Randomized Algorithms
Ketan Kamra
 
PPTX
Random forest
Musa Hawamdah
 
PPTX
Hierarchical clustering
ishmecse13
 
PPTX
Machine learning with scikitlearn
Pratap Dangeti
 
PPTX
K-Nearest Neighbor Classifier
Neha Kulkarni
 
PPTX
K means clustering
keshav goyal
 
PDF
Machine Learning Clustering
Rupak Roy
 
PPT
Clustering
M Rizwan Aqeel
 
PDF
Dimensionality Reduction
mrizwan969
 
PPTX
CART – Classification & Regression Trees
Hemant Chetwani
 
3.5 model based clustering
Krish_ver2
 
Ensemble learning
Haris Jamil
 
Hierarchical Clustering
Carlos Castillo (ChaTo)
 
Introduction to Clustering algorithm
hadifar
 
Grid based method & model based clustering method
rajshreemuthiah
 
3.3 hierarchical methods
Krish_ver2
 
Cluster Analysis Introduction
PrasiddhaSarma
 
Randomized Algorithms
Ketan Kamra
 
Random forest
Musa Hawamdah
 
Hierarchical clustering
ishmecse13
 
Machine learning with scikitlearn
Pratap Dangeti
 
K-Nearest Neighbor Classifier
Neha Kulkarni
 
K means clustering
keshav goyal
 
Machine Learning Clustering
Rupak Roy
 
Clustering
M Rizwan Aqeel
 
Dimensionality Reduction
mrizwan969
 
CART – Classification & Regression Trees
Hemant Chetwani
 

Similar to Hierarchical clustering (20)

PPTX
Hierarchical methods navdeep kaur newww.pptx
dhaliwalharsh055
 
PPTX
Clustering on DSS
Enaam Alotaibi
 
PPTX
Unsupervised learning (clustering)
Pravinkumar Landge
 
PPTX
Machine Learning : Clustering - Cluster analysis.pptx
tecaviw979
 
PDF
Clustering[306] [Read-Only].pdf
igeabroad
 
PDF
CLUSTERING IN DATA MINING.pdf
SowmyaJyothi3
 
PDF
Data mining
EmaSushan
 
PPTX
Unsupervised%20Learninffffg (2).pptx. application
ShabirAhmad625218
 
PPTX
UNIT - 4: Data Warehousing and Data Mining
Nandakumar P
 
PPTX
Data Mining: clustering and analysis
Datamining Tools
 
PDF
algoritma klastering.pdf
bintis1
 
PPTX
Clusters techniques
rajshreemuthiah
 
PDF
Non hierarchical clustering
Learnbay Datascience
 
PPT
automatic classification in information retrieval
Basma Gamal
 
PPTX
Hierarchical clustering machine learning by arpit_sharma
Er. Arpit Sharma
 
PPTX
machine learning - Clustering in R
Sudhakar Chavan
 
PDF
Similarity distance measures
thilagasna
 
PPTX
DS9 - Clustering.pptx
JK970901
 
PPTX
06-Clustering.pptx
Shree Shree
 
PPTX
unitvclusteranalysis-221214135407-1956d6ef.pptx
kalyanasundaram68
 
Hierarchical methods navdeep kaur newww.pptx
dhaliwalharsh055
 
Clustering on DSS
Enaam Alotaibi
 
Unsupervised learning (clustering)
Pravinkumar Landge
 
Machine Learning : Clustering - Cluster analysis.pptx
tecaviw979
 
Clustering[306] [Read-Only].pdf
igeabroad
 
CLUSTERING IN DATA MINING.pdf
SowmyaJyothi3
 
Data mining
EmaSushan
 
Unsupervised%20Learninffffg (2).pptx. application
ShabirAhmad625218
 
UNIT - 4: Data Warehousing and Data Mining
Nandakumar P
 
Data Mining: clustering and analysis
Datamining Tools
 
algoritma klastering.pdf
bintis1
 
Clusters techniques
rajshreemuthiah
 
Non hierarchical clustering
Learnbay Datascience
 
automatic classification in information retrieval
Basma Gamal
 
Hierarchical clustering machine learning by arpit_sharma
Er. Arpit Sharma
 
machine learning - Clustering in R
Sudhakar Chavan
 
Similarity distance measures
thilagasna
 
DS9 - Clustering.pptx
JK970901
 
06-Clustering.pptx
Shree Shree
 
unitvclusteranalysis-221214135407-1956d6ef.pptx
kalyanasundaram68
 
Ad

Recently uploaded (20)

PPTX
Accounting Skills Paper-I, Preparation of Vouchers
Dr. Sushil Bansode
 
PPTX
quizbeenutirtion-230726075512-0387d08e.pptx
domingoriahlyne
 
PPTX
PYLORIC STENOSIS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PPTX
Mrs Mhondiwa Introduction to Algebra class
sabinaschimanga
 
PPTX
Company - Meaning - Definition- Types of Company - Incorporation of Company
DevaRam6
 
PDF
water conservation .pdf by Nandni Kumari XI C
Directorate of Education Delhi
 
PPTX
ANORECTAL MALFORMATIONS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PPTX
MALABSORPTION SYNDROME: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PDF
A guide to responding to Section C essay tasks for the VCE English Language E...
jpinnuck
 
PPTX
Various Psychological tests: challenges and contemporary trends in psychologi...
santoshmohalik1
 
PPTX
How to Manage Resupply Subcontracting in Odoo 18
Celine George
 
PPTX
HIRSCHSPRUNG'S DISEASE(MEGACOLON): NURSING MANAGMENT.pptx
PRADEEP ABOTHU
 
PDF
Living Systems Unveiled: Simplified Life Processes for Exam Success
omaiyairshad
 
PPTX
Folding Off Hours in Gantt View in Odoo 18.2
Celine George
 
PPTX
Maternal and Child Tracking system & RCH portal
Ms Usha Vadhel
 
PDF
Module 1: Determinants of Health [Tutorial Slides]
JonathanHallett4
 
PPTX
DIARRHOEA & DEHYDRATION: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PPTX
Top 10 AI Tools, Like ChatGPT. You Must Learn In 2025
Digilearnings
 
PPTX
Presentation: Climate Citizenship Digital Education
Karl Donert
 
PPTX
IDEAS AND EARLY STATES Social science pptx
NIRANJANASSURESH
 
Accounting Skills Paper-I, Preparation of Vouchers
Dr. Sushil Bansode
 
quizbeenutirtion-230726075512-0387d08e.pptx
domingoriahlyne
 
PYLORIC STENOSIS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
Mrs Mhondiwa Introduction to Algebra class
sabinaschimanga
 
Company - Meaning - Definition- Types of Company - Incorporation of Company
DevaRam6
 
water conservation .pdf by Nandni Kumari XI C
Directorate of Education Delhi
 
ANORECTAL MALFORMATIONS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
MALABSORPTION SYNDROME: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
A guide to responding to Section C essay tasks for the VCE English Language E...
jpinnuck
 
Various Psychological tests: challenges and contemporary trends in psychologi...
santoshmohalik1
 
How to Manage Resupply Subcontracting in Odoo 18
Celine George
 
HIRSCHSPRUNG'S DISEASE(MEGACOLON): NURSING MANAGMENT.pptx
PRADEEP ABOTHU
 
Living Systems Unveiled: Simplified Life Processes for Exam Success
omaiyairshad
 
Folding Off Hours in Gantt View in Odoo 18.2
Celine George
 
Maternal and Child Tracking system & RCH portal
Ms Usha Vadhel
 
Module 1: Determinants of Health [Tutorial Slides]
JonathanHallett4
 
DIARRHOEA & DEHYDRATION: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
Top 10 AI Tools, Like ChatGPT. You Must Learn In 2025
Digilearnings
 
Presentation: Climate Citizenship Digital Education
Karl Donert
 
IDEAS AND EARLY STATES Social science pptx
NIRANJANASSURESH
 
Ad

Hierarchical clustering

  • 1. Hierarchical clustering NAME - A.K.M.ASHEK FARABIPresented By:
  • 2. Content •Clustering •Types of Clustering •Hierarchical Clustering •Key Concepts in Hierarchal Clustering •Types of Hierarchal Clustering •Difference between Partitional and Hierarchical clustering •The Application of Hierarchical Clustering •Hierarchical Clustering pros and cons •Summary 4/10/2018 HIERARCHICAL CLUSTERING 2
  • 3. Clustering •Clustering is the classification of objects into different groups, or more precisely, the partitioning of a data set into subsets (clusters), so that the data in each subset (ideally) share some common trait - often according to some defined distance measure. 4/10/2018 HIERARCHICAL CLUSTERING 3
  • 4. Clustering (cont..) •Also called “unsupervised learning” or “data mining” •Organizing data into classes such that there is • high intra-class similarity • low intra-class similarity •More informally, finding natural groupings among objects. 4/10/2018 HIERARCHICAL CLUSTERING 4
  • 5. Clustering (cont..) What is a natural grouping among these objects? Clustering is subjective Rahim’s Family School Employs Females Males 4/10/2018 HIERARCHICAL CLUSTERING 5
  • 6. Types of Clustering 1. Hierarchical algorithms 2. Partitional clustering 4/10/2018 HIERARCHICAL CLUSTERING 6
  • 7. Hierarchical Clustering •Produces a set of nested clusters organized as a hierarchical tree. •Can be visualized as a dendrogram. »A tree-like diagram that records the sequences of merge or splits 4/10/2018 HIERARCHICAL CLUSTERING 7
  • 8. Key Concepts in Hierarchal Clustering Dendrogram tree representation 2 3 4 5 6 object lifetime 1. In the beginning we have 6 clusters: A, B, C, D, E and F 2. We merge clusters D and F into cluster (D, F) at distance 0.50 3. We merge cluster A and cluster B into (A, B) at distance 0.71 4. We merge clusters E and (D, F) into ((D, F), E) at distance 1.00 5. We merge clusters ((D, F), E) and C into (((D, F), E), C) at distance 1.41 6. We merge clusters (((D, F), E), C) and (A, B) into ((((D, F), E), C), (A, B)) at distance 2.50 7. The last cluster contain all the objects, thus conclude the computation 4/10/2018 HIERARCHICAL CLUSTERING 8
  • 9. Types of Hierarchal Clustering Two main types of hierarchical clustering • Agglomerative: » Start with the points as individual clusters » At each step, merge the closest pair of clusters until only one cluster (or K clusters) left • Divisive: » Start with one, all-inclusive cluster » At each step, split a cluster until each cluster contains a point (or there are K clusters) 4/10/2018 HIERARCHICAL CLUSTERING 9
  • 10. Difference between Partitional and Hierarchical clustering Partitional clustering » Partitional clustering is faster than hierarchical clustering. » Partitional clustering requires stronger assumptions such as number of clusters and the initial centers. » partitional clustering algorithms require the number of clusters to start running. Hierarchical clustering » Hierarchical clustering is slower than Partitional clustering. » Hierarchical clustering requires only a similarity measure. » Hierarchical clustering does not require any input parameters 4/10/2018 HIERARCHICAL CLUSTERING 10
  • 11. The applications of Hierarchical Applications » Wireless Sensor Network » Audio Event Detection » Web cluster engines » Bioinformatics » And many more. 4/10/2018 HIERARCHICAL CLUSTERING 11
  • 12. Hierarchal Clustering pros and cons Pros.. » Doesn't required number of clusters to be specified. » Easy to implement. » Produces a dendrogram, which helps with understanding the data. Cons.. » Can never undo any previous steps throughout the algorithm. » Generally has long runtimes. » Sometimes difficult to identify the number of clusters by the dendrogram. 4/10/2018 HIERARCHICAL CLUSTERING 12
  • 13. Summary •Hierarchical algorithm is a sequential clustering algorithm »Use distance matrix to construct a tree of clusters (dendrogram) »Hierarchical representation without the need of knowing of clusters (can set termination condition with known of clusters) •Major weakness of agglomerative clustering methods »Can never undo what was done previously »Sensitive to cluster distance measures and noise/outliers 4/10/2018 HIERARCHICAL CLUSTERING 13