0% found this document useful (0 votes)

75 views13 pages

Cluster Analysis GP Seminar

Cluster analysis is used to classify objects into homogeneous groups called clusters without prior knowledge of group membership. It involves selecting variables and a distance measure, choosing a clustering procedure like hierarchical or non-hierarchical, deciding the number of clusters, interpreting results. Hierarchical methods include agglomerative approaches like single, complete, average linkage and divisive methods that join or split clusters.

Uploaded by

Arnab Mukherjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views13 pages

Cluster Analysis GP Seminar

Uploaded by

Arnab Mukherjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Presented By:-

SABYASACHI GHATWAL
RA 1832001010027
CLUSTER ANALYSIS
Cluster analysis is a class of techniques that are used to classify objects or cases into relative groups called
clusters. Cluster analysis is also called classification analysis or numerical taxonomy. In cluster analysis, there is no
prior information about the group or cluster membership for any of the objects.

Cluster Analysis has been used in marketing for various purposes. Segmentation of consumers in cluster analysis is
used on the basis of benefits sought from the purchase of the product. It can be used to identify homogeneous
groups of buyers.

Cluster analysis involves formulating a problem, selecting a distance measure, selecting a clustering procedure,
deciding the number of clusters, interpreting the profile clusters and finally, assessing the validity of clustering.
The variables on which the cluster analysis is to be done should be selected by keeping past research in mind. It
should also be selected by theory, the hypotheses being tested, and the judgment of the researcher. An
appropriate measure of distance or similarity should be selected; the most commonly used measure is the
Euclidean distance or its square.

Clustering procedures in cluster analysis may be hierarchical, non-hierarchical, or a two-step procedure. A

hierarchical procedure in cluster analysis is characterized by the development of a tree like structure. A hierarchical
procedure can be agglomerative or divisive. Agglomerative methods in cluster analysis consist of linkage methods,
variance methods, and centroid methods. Linkage methods in cluster analysis are comprised of single linkage,
complete linkage, and average linkage.

The non-hierarchical methods in cluster analysis are frequently referred to as K means clustering. The two-step
procedure can automatically determine the optimal number of clusters by comparing the values of model choice
criteria across different clustering solutions. The choice of clustering procedure and the choice of distance measure
are interrelated. The relative sizes of clusters in cluster analysis should be meaningful. The clusters should be
interpreted in terms of cluster centroids.
• USES AND OBJECTIVES
• Used to classify objects (cases) into homogeneous
groups called clusters.
• Objects in each cluster tend to be similar and
dissimilar to objects in the other clusters.
• Both cluster analysis and discriminant analysis are
concerned with classification.
• Discriminant analysis requires prior knowledge of
group membership.
• In cluster analysis groups are suggested by the data.
An Ideal Clustering Situation

Variable 1

Variable 2
Statistics Associated with Cluster Analysis

• Agglomeration schedule. Gives information on the objects or

cases being combined at each stage of a hierarchical clustering
process.

• Cluster centroid. Mean values of the variables for all the cases
in a particular cluster.

• Cluster centers. Initial starting points in nonhierarchical

clustering. Clusters are built around these centers, or seeds.

• Cluster membership. Indicates the cluster to which each

object or case belongs.
Statistics Associated with Cluster Analysis
• Dendrogram (A tree graph). A graphical device for displaying
clustering results.

-Vertical lines represent clusters that are joined together.

-The position of the line on the scale indicates distances at

which clusters were joined.

• Distances between cluster centers. These distances indicate how

separated the individual pairs of clusters are. Clusters that are widely
separated are distinct, and therefore desirable.

• Icicle diagram. Another type of graphical display of clustering results.

Conducting Cluster Analysis
Formulate the Problem

Select a Distance Measure

Select a Clustering Procedure

Decide on the Number of Clusters

Interpret and Profile Clusters

Assess the Validity of Clustering

Classification of Clustering Procedures
Clustering Procedures

Hierarchical Nonhierarchical

Agglomerative Divisive

Linkage Variance Centroid Sequential Parallel Optimizing

Methods Methods Methods Threshold Threshold Partitioning

Ward’s
Method

Single Complete Average

Linkage Linkage Linkage
Hierarchical Clustering Methods
• Hierarchical clustering is characterized by the development of
a hierarchy or tree-like structure.
-Agglomerative clustering starts with each object in a
separate cluster. Clusters are formed by grouping objects into
bigger and bigger clusters.
-Divisive clustering starts with all the objects grouped in a
single cluster. Clusters are divided or split until each object is in
a separate cluster.
• Agglomerative methods are commonly used in marketing
research. They consist of linkage methods, variance methods,
and centroid methods.
Hierarchical Agglomerative Clustering-Linkage Method

• The single linkage method is based on minimum

distance, or the nearest neighbor rule.

• The complete linkage method is based on the

maximum distance or the furthest neighbor approach.

• The average linkage method the distance between two

clusters is defined as the average of the distances
between all pairs of objects
Linkage Methods of Clustering
Single Linkage
Minimum Distance

Cluster 1 Cluster 2
Complete Linkage
Maximum
Distance

Cluster 1 Cluster 2
Average Linkage

Average Distance
Cluster 1 Cluster 2
Hierarchical Agglomerative Clustering-
Variance and Centroid Method
• Variance methods generate clusters to minimize the within-cluster
variance.

• Ward's procedure is commonly used. For each cluster, the sum of

squares is calculated. The two clusters with the smallest increase in the
overall sum of squares within cluster distances are combined.

• In the centroid methods, the distance between two clusters is the

distance between their centroids (means for all the variables),

• Of the hierarchical methods, average linkage and Ward's methods have

been shown to perform better than the other procedures.
Other Agglomerative Clustering Methods

Ward’s Procedure

Centroid Method

Stat 151 Fall 2019 Syllabus
No ratings yet
Stat 151 Fall 2019 Syllabus
9 pages
Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
No ratings yet
Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
3 pages
Knowledge Acquisition and Sharing - Data Mining: INF 791 Lecture 4: Cluster Analysis
No ratings yet
Knowledge Acquisition and Sharing - Data Mining: INF 791 Lecture 4: Cluster Analysis
43 pages
Cluster Analysis: Prof. (DR.) H. J. Jani Mba Programme, Sardar Patel University Vallabh Vidyanagar - 388 120
No ratings yet
Cluster Analysis: Prof. (DR.) H. J. Jani Mba Programme, Sardar Patel University Vallabh Vidyanagar - 388 120
41 pages
Cluster Analysis
No ratings yet
Cluster Analysis
9 pages
Cluster Analysis
No ratings yet
Cluster Analysis
9 pages
Cluster Analysis BRM Session 14
No ratings yet
Cluster Analysis BRM Session 14
25 pages
Cluster Analysis
No ratings yet
Cluster Analysis
15 pages
Cluster Analysis
No ratings yet
Cluster Analysis
25 pages
Cluster Analysis: Consumer Segmentation
No ratings yet
Cluster Analysis: Consumer Segmentation
17 pages
Cluster Analysis
No ratings yet
Cluster Analysis
33 pages
Cluster Analysis
No ratings yet
Cluster Analysis
33 pages
Cluster Analysis: Clusters Classification Analysis Numerical Taxonomy
No ratings yet
Cluster Analysis: Clusters Classification Analysis Numerical Taxonomy
50 pages
Chapter 20: Cluster Analysis: Advance Marketing Research
No ratings yet
Chapter 20: Cluster Analysis: Advance Marketing Research
40 pages
Chapter Twenty: Cluster Analysis
No ratings yet
Chapter Twenty: Cluster Analysis
35 pages
BA2 7 Cluster
No ratings yet
BA2 7 Cluster
33 pages
Cluster Analysis
No ratings yet
Cluster Analysis
20 pages
Cluster analysis
No ratings yet
Cluster analysis
23 pages
Lecture 02 - Cluster Analysis 1
No ratings yet
Lecture 02 - Cluster Analysis 1
59 pages
Cluster Analysis: Classification Analysis, or Numerical Taxonomy
No ratings yet
Cluster Analysis: Classification Analysis, or Numerical Taxonomy
13 pages
Chapter Twenty: Cluster Analysis
No ratings yet
Chapter Twenty: Cluster Analysis
46 pages
Cluster Analysis CH 20
No ratings yet
Cluster Analysis CH 20
2 pages
Session-13b BRM PDF
No ratings yet
Session-13b BRM PDF
18 pages
Cluster Analysis
No ratings yet
Cluster Analysis
6 pages
Market Segmentation - Cluster Analysis
No ratings yet
Market Segmentation - Cluster Analysis
18 pages
Presentation Malo
No ratings yet
Presentation Malo
65 pages
Cluster Analysis
No ratings yet
Cluster Analysis
34 pages
Advanced Marketing Research: Session 17: Cluster Analysis
No ratings yet
Advanced Marketing Research: Session 17: Cluster Analysis
8 pages
Chapter Twenty: Cluster Analysis
No ratings yet
Chapter Twenty: Cluster Analysis
41 pages
Lec35
No ratings yet
Lec35
18 pages
In Marketing, Cluster Analysis Is Used For: Statistical
No ratings yet
In Marketing, Cluster Analysis Is Used For: Statistical
3 pages
Cluster Analysis
No ratings yet
Cluster Analysis
61 pages
Malhotra MR6e 20
No ratings yet
Malhotra MR6e 20
46 pages
8.Cluster Analysis HCA
No ratings yet
8.Cluster Analysis HCA
31 pages
Cluster Analysis
100% (1)
Cluster Analysis
4 pages
Cluster Analysis
No ratings yet
Cluster Analysis
24 pages
Block 18 ST3188
No ratings yet
Block 18 ST3188
29 pages
Aula - Análise de Clusters
No ratings yet
Aula - Análise de Clusters
93 pages
Bacher 2002 Cluster Analysis
No ratings yet
Bacher 2002 Cluster Analysis
199 pages
L18_19_Clustering
No ratings yet
L18_19_Clustering
48 pages
DA Seminar
No ratings yet
DA Seminar
29 pages
MA Unit 5
No ratings yet
MA Unit 5
7 pages
Cluster Analysis
No ratings yet
Cluster Analysis
11 pages
SPSS Week7
No ratings yet
SPSS Week7
42 pages
SPSS Week7
No ratings yet
SPSS Week7
42 pages
SPSS Tutorial Cluster Analysis PDF
No ratings yet
SPSS Tutorial Cluster Analysis PDF
42 pages
SPSS Tutorial Cluster Analysis
No ratings yet
SPSS Tutorial Cluster Analysis
42 pages
11 Chapter 3
No ratings yet
11 Chapter 3
17 pages
Cluster Analysis
No ratings yet
Cluster Analysis
2 pages
10.cluster Analysis
No ratings yet
10.cluster Analysis
68 pages
Cluster Analysis
No ratings yet
Cluster Analysis
12 pages
Cluster Analysis
No ratings yet
Cluster Analysis
30 pages
Chapter-5-Cluster Analysis PDF
No ratings yet
Chapter-5-Cluster Analysis PDF
5 pages
Chapter 23 - Cluster Analysis
100% (1)
Chapter 23 - Cluster Analysis
16 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Research Methodology Approaches
From Everand
Research Methodology Approaches
Jerry H. Swift
No ratings yet
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Extending the Boundaries: An Expansive Journey into Nonparametric Curve Estimation
From Everand
Extending the Boundaries: An Expansive Journey into Nonparametric Curve Estimation
Pasquale De Marco
No ratings yet
Julia for Data Science
From Everand
Julia for Data Science
Anshul Joshi
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Glossary of Research Methodology
From Everand
Glossary of Research Methodology
Dr. Awadhesh Kishore
No ratings yet
Benefits of Appraisal
No ratings yet
Benefits of Appraisal
1 page
Aneshensel 2009 Toward Explaining Mental Health Disparities
No ratings yet
Aneshensel 2009 Toward Explaining Mental Health Disparities
18 pages
IND TOC Contents ICH M4E (R2) - The-CTD - Efficacy
No ratings yet
IND TOC Contents ICH M4E (R2) - The-CTD - Efficacy
69 pages
Knowledge (2) Comprehension (3) Application (4) Analysis (5) Synthesis (6) Evaluation
No ratings yet
Knowledge (2) Comprehension (3) Application (4) Analysis (5) Synthesis (6) Evaluation
5 pages
MBA Project Submission 2017-2018 Circular
No ratings yet
MBA Project Submission 2017-2018 Circular
9 pages
Bar Graphs Lesson Plan
No ratings yet
Bar Graphs Lesson Plan
8 pages
Technical Writing Presentation Skills
100% (1)
Technical Writing Presentation Skills
25 pages
Tugas UAS Manpro - Muhammad Yusuf Syamsul Assegaf (1162003020)
No ratings yet
Tugas UAS Manpro - Muhammad Yusuf Syamsul Assegaf (1162003020)
14 pages
Defense Group 1 Ch1 2
No ratings yet
Defense Group 1 Ch1 2
27 pages
An Examination of The Relationship Between Internal and External Audit in The Saudi Arabian Corporate Sector
No ratings yet
An Examination of The Relationship Between Internal and External Audit in The Saudi Arabian Corporate Sector
16 pages
Autokorelasi Spasial
No ratings yet
Autokorelasi Spasial
35 pages
Unit IV-commitment and Economic Dispatch
No ratings yet
Unit IV-commitment and Economic Dispatch
47 pages
02 ABE Review - Sampling Techniques
No ratings yet
02 ABE Review - Sampling Techniques
41 pages
Report On Survey
100% (1)
Report On Survey
12 pages
Anticancer NMR
No ratings yet
Anticancer NMR
4 pages
Literature Review 5000 Words
100% (1)
Literature Review 5000 Words
6 pages
Pretest Posttest Design
No ratings yet
Pretest Posttest Design
7 pages
MBB7003M Data Analytics and The Blockchain Assignment Brief
No ratings yet
MBB7003M Data Analytics and The Blockchain Assignment Brief
9 pages
Kul 6 - Bias Dan Confounding Dalam Epidemiologi
No ratings yet
Kul 6 - Bias Dan Confounding Dalam Epidemiologi
52 pages
Inferential Statics
No ratings yet
Inferential Statics
33 pages
Sport Physiology Dissertation Topics
100% (1)
Sport Physiology Dissertation Topics
7 pages
Academic Performance
No ratings yet
Academic Performance
3 pages
Unpacking Dark Patterns Understanding Dark
No ratings yet
Unpacking Dark Patterns Understanding Dark
23 pages
Arañas - Prelim Exam-Analytical Chem (AMCC)
No ratings yet
Arañas - Prelim Exam-Analytical Chem (AMCC)
3 pages
Non-Parametric Survival Models
100% (1)
Non-Parametric Survival Models
4 pages
Slsus500 - Audit Evidence
No ratings yet
Slsus500 - Audit Evidence
12 pages
Chapter 9 - Audit Sampling - Substantive Tests of Account Balances - Answers
No ratings yet
Chapter 9 - Audit Sampling - Substantive Tests of Account Balances - Answers
49 pages
IJAST-V2I2P103
No ratings yet
IJAST-V2I2P103
5 pages
An Innovation Resistance Theory Perspective On Mobile Payment Solutions
No ratings yet
An Innovation Resistance Theory Perspective On Mobile Payment Solutions
11 pages

Cluster Analysis GP Seminar

Uploaded by

Cluster Analysis GP Seminar

Uploaded by

Presented By:-

Clustering procedures in cluster analysis may be hierarchical, non-hierarchical, or a two-step procedure. A

• Agglomeration schedule. Gives information on the objects or

• Cluster centers. Initial starting points in nonhierarchical

• Cluster membership. Indicates the cluster to which each

-Vertical lines represent clusters that are joined together.

-The position of the line on the scale indicates distances at

• Distances between cluster centers. These distances indicate how

• Icicle diagram. Another type of graphical display of clustering results.

Select a Distance Measure

Select a Clustering Procedure

Decide on the Number of Clusters

Interpret and Profile Clusters

Assess the Validity of Clustering

Linkage Variance Centroid Sequential Parallel Optimizing

Single Complete Average

• The single linkage method is based on minimum

• The complete linkage method is based on the

• The average linkage method the distance between two

• Ward's procedure is commonly used. For each cluster, the sum of

• In the centroid methods, the distance between two clusters is the

• Of the hierarchical methods, average linkage and Ward's methods have

You might also like