0% found this document useful (0 votes)

336 views10 pages

SPSS Annotated Output K Means Cluster Anal

The document describes using k-means cluster analysis on customer usage data from a telecommunications provider to segment their customer base. Initially, a 3-cluster solution was obtained but did not capture all important groups. A 4-cluster solution identified a potentially profitable "Internet" customer cluster missed previously. Examining the final cluster centers and distances between clusters provided insight into the natural groupings of customers and how they compare.

Uploaded by

Aditya Mehra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

336 views10 pages

SPSS Annotated Output K Means Cluster Anal

Uploaded by

Aditya Mehra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

SPSS ANNOTATED OUTPUT K-MEANS CLUSTER ANALYSIS|

K-means cluster analysis is a tool designed to assign cases to

a fixed number of groups (clusters) whose characteristics are
not yet known but are based on a set of specified variables. It
is most useful when you want to classify a large number
(thousands) of cases.
A good cluster analysis is:
 Efficient. Uses as few clusters as possible.
 Effective. Captures all statistically and commercially
important clusters. For example, a cluster with five customers
may be statistically different but not very profitable.

The K-Means Cluster Analysis procedure begins with the

construction of initial cluster centers. You can assign these
yourself or have the procedure select k well-spaced observations
for the cluster centers.
After obtaining initial cluster centers, the procedure:
 Assigns cases to clusters based on distance from the
cluster centers.
 Updates the locations of cluster centers based on the mean
values of cases in each cluster.
These steps are repeated until any reassignment of cases would
make the clusters more internally variable or externally
similar.

A telecommunications provider wants to segment its customer base

by service usage patterns. If customers can be classified by
usage, the company can offer more attractive packages to its
customers.

1. To run the cluster analysis, from the menus choose:

Analyze > Classify > K-Means Cluster...
Figure 1. K-Means Cluster Analysis dialog box
2. If the variable list does not display variable labels in
file order, right-click anywhere in the variable list and
from the context menu choose Display Variable
Labels and Sort by File Order.
3. Select Standardized log-long distance through Standardized
log-wireless and Standardized multiple
lines through Standardized electronic billing as analysis
variables.
4. Type 3 as the number of clusters.
5. Click Iterate.Figure 2. Iterate dialog box

6. Type 20 as the maximum iterations.

7. Click Continue.
8. Click Options in the K-Means Cluster Analysis dialog
box.Figure 3. Options dialog box
9. Select ANOVA table and Cluster information for each
group in the Statistics group.
10. Select Exclude cases pairwise in the Missing Values
group. There are many missing values due to the fact that
most customers do not subscribe to all services, so
excluding cases pairwise maximizes the information you can
obtain from the data... at the cost of possibly biasing the
results.
11. Click Continue, then click OK in the K-Means Cluster
Analysis dialog box.

Figure 1. Initial cluster centers for three-cluster solution

The initial cluster centers are the variable values of

the k well-spaced observations.

Figure 1. Iteration history for three-cluster solution

The iteration history shows the progress of the clustering
process at each step. In early iterations, the cluster centers
shift quite a lot. By the 14th iteration, they have settled down
to the general area of their final location, and the last four
iterations are minor adjustments.
If the algorithm stops because the maximum number of iterations
is reached, you may want to increase the maximum because the
solution may otherwise be unstable. For example, if you had left
the maximum number of iterations at 10, the reported solution
would still be in a state of flux.

Figure 1. ANOVA table for three-cluster solution

The ANOVA table indicates which variables contribute the most

to your cluster solution. Variables with large F values provide
the greatest separation between clusters.

Figure 1. Final cluster centers for three-cluster solution

The final cluster centers are computed as the mean for each
variable within each final cluster. The final cluster centers
reflect the characteristics of the typical case for each
cluster.
 Customers in cluster 1 tend to be big spenders who purchase
a lot of services.
 Customers in cluster 2 tend to be moderate spenders who
purchase the "calling" services.
 Customers in cluster 3 tend to spend very little and do not
purchase many services.

Figure 1. Distances between final cluster centers for three-

cluster solution

This table shows the Euclidean distances between the final

cluster centers. Greater distances between clusters correspond
to greater dissimilarities.
 Clusters 1 and 3 are most different.
 Cluster 2 is approximately equally similar to clusters 1
and 3.
These relationships between the clusters can also be intuited
from the final cluster centers, but this becomes more difficult
as the number of clusters and variables increases.

Figure 1. Number of cases in each cluster for three-cluster

solution
A large number of cases were assigned to the third cluster,
which unfortunately is the least profitable group. Perhaps a
fourth, more profitable, cluster could be extracted from this
"basic service" group.

Figure 1. K-Means Cluster Analysis dialog box

1. To run a cluster analysis with four clusters, reopen the

K-Means Cluster Analysis dialog box.
2. Type 4 as the number of clusters.
3. Click Save.Figure 2. Save dialog box

4. Select Cluster membership and Distance from cluster

center.
5. Click Continue.
6. Click OK in the K-Means Cluster Analysis dialog box.
7. The saved variables can be used to create a useful boxplot.
From the menus, choose:
Graphs > Chart Builder...
8. Click the Gallery tab, select Boxplot from the list of
chart types, and drag and drop the Simple Boxplot icon onto
the canvas.
9. Drag and drop Distance of Case from its Classification
Cluster Center onto the y axis.
10. Drag and drop Cluster Number of Case onto the x axis.
11. Click OK to create the boxplot.
Figure 3. Chart Builder

Figure 1. Plot of distances from cluster center by cluster

membership for four-cluster solution
This is a diagnostic plot that helps you to find outliers within
clusters. There is a lot of variability in cluster 2, but all
the distances are within reason.

Figure 1. Final cluster centers for four-cluster solution

This table shows that an important grouping is missed in the

three-cluster solution. Members of clusters 1 and 2 are largely
drawn from cluster 3 in the three-cluster solution, and they are
unlikely to be big spenders. However, members of cluster 1 are
highly likely to purchase Internet-related services, which
establishes them as a distinct and possibly profitable group.
Clusters 3 and 4 seem to correspond to clusters 1 and 2 from the
three-cluster solution.
Figure 1. Distances between final cluster centers for four-
cluster solution

The distances between the clusters have not changed greatly.

 Clusters 1 and 2 are the most similar, which makes sense
because they were combined into one cluster in the three-
cluster solution.
 Clusters 2 and 3 are the most dissimilar, since they
represent opposite spending behaviors.
 Cluster 4 is still equally similar to the other clusters.

Figure 1. Number of cases in each cluster for four-cluster

solution

Nearly 25% of cases belong to the newly created group of "E-

service" customers, which is very significant to your profits.

Using k-means cluster analysis, you initially grouped the

customers into three clusters. However, this solution was not
very satisfactory, so you reran the analysis with four clusters.
These results were better, and from the final cluster centers,
you saw that a potentially profitable "Internet" grouping was
missed in the three-cluster solution.
This example underscores the exploratory nature of cluster
analysis, since it is impossible to determine the "best" number
of clusters until you have run the analyses and examined the
solutions.
The next step for the company is to try to construct a model
that classifies the customers according to their demographic
information. With such a model, the company can customize offers
for individual prospective customers. For information on how the
company builds such a model, see Using Discriminant Analysis to
Classify Telecommunications Customers.

The K-Means Cluster Analysis procedure is a tool for finding

natural groupings of cases, given their values on a set of
variables. It is most useful when you want to classify a large
number (thousands) of cases.
 The TwoStep Cluster Analysis procedure allows you to use
both categorical and continuous variables, and can
automatically select the "best" number of clusters.
 If you want to cluster variables instead of cases, or have
a small number of cases, try the Hierarchical Cluster
Analysis procedure.
 If your k-means analysis is part of a segmentation
solution, these newly created clusters can be analyzed in
the Discriminant Analysis procedure.

See the following texts for more information on k-means

cluster analysis:
Aldenderfer, M. S., and R. K. Blashfield. 1984. Cluster
Analysis. Newbury Park: Sage Publications.

International Market Channels
No ratings yet
International Market Channels
31 pages
K Means Clustering
100% (1)
K Means Clustering
13 pages
Xuberance'20 - BM Placement Brochure
33% (3)
Xuberance'20 - BM Placement Brochure
45 pages
A Complete Guide To Study in China
No ratings yet
A Complete Guide To Study in China
34 pages
Location Working 5
No ratings yet
Location Working 5
63 pages
All Chaps Final Combined PDF
100% (3)
All Chaps Final Combined PDF
418 pages
Understanding Political Theory
No ratings yet
Understanding Political Theory
77 pages
30 Senior Loan Officer Interview Questions and Answers
No ratings yet
30 Senior Loan Officer Interview Questions and Answers
17 pages
Amos Annotated Output Sem Cfa PDF
No ratings yet
Amos Annotated Output Sem Cfa PDF
31 pages
Vulnerability Classification on Source Code QRS24 STV
No ratings yet
Vulnerability Classification on Source Code QRS24 STV
11 pages
Cluster Analysis - Part B
No ratings yet
Cluster Analysis - Part B
25 pages
Investment Appraisal Techniques Underlying Ghanaian Oil Marketing Companies Investment Decisions, A Case of Goil Company Limited
No ratings yet
Investment Appraisal Techniques Underlying Ghanaian Oil Marketing Companies Investment Decisions, A Case of Goil Company Limited
95 pages
FINAL - DBS POSTGRADUATE DISSERTATION Guidebook
No ratings yet
FINAL - DBS POSTGRADUATE DISSERTATION Guidebook
51 pages
Ahmadi Javid Et Al 2019 A Method For Risk Response Planning in Project Portfolio Management
No ratings yet
Ahmadi Javid Et Al 2019 A Method For Risk Response Planning in Project Portfolio Management
19 pages
Statistical modeling for biomedical researchers 1st Edition William D. Dupont pdf download
100% (1)
Statistical modeling for biomedical researchers 1st Edition William D. Dupont pdf download
63 pages
Marketing Analytics
No ratings yet
Marketing Analytics
111 pages
10.cluster Analysis
No ratings yet
10.cluster Analysis
68 pages
Department of Education Gonzaga National High School: Republic of The Philippines
No ratings yet
Department of Education Gonzaga National High School: Republic of The Philippines
60 pages
Forensic Auditing and Productivity of Ni
No ratings yet
Forensic Auditing and Productivity of Ni
78 pages
Econometrics PART ONE
No ratings yet
Econometrics PART ONE
33 pages
Dabm Lab Manual
No ratings yet
Dabm Lab Manual
34 pages
Business Statistics Question Answer MBA First Semester-1
No ratings yet
Business Statistics Question Answer MBA First Semester-1
59 pages
MBA Question Bank Jan Feb 2023 June July 2023 I II Sem
No ratings yet
MBA Question Bank Jan Feb 2023 June July 2023 I II Sem
36 pages
Social Consciousness PPT MODIFIED
100% (1)
Social Consciousness PPT MODIFIED
23 pages
ACF and PACF Plots
No ratings yet
ACF and PACF Plots
3 pages
Cheng2017 Critikal Thinking
No ratings yet
Cheng2017 Critikal Thinking
41 pages
Literature Review Drama
100% (1)
Literature Review Drama
6 pages
CBC Food and Beverage NC III
100% (1)
CBC Food and Beverage NC III
76 pages
Michael Eshete (Assessment of Project Quality Management Practice in Case of Awash, Dashen and United Bank
No ratings yet
Michael Eshete (Assessment of Project Quality Management Practice in Case of Awash, Dashen and United Bank
55 pages
Determining A Prony Series For A Viscoelastic Material From Time Varying Strain Data
No ratings yet
Determining A Prony Series For A Viscoelastic Material From Time Varying Strain Data
26 pages
Central Tendency in R Programming
100% (1)
Central Tendency in R Programming
6 pages
Introduction To Descriptive Statistics
No ratings yet
Introduction To Descriptive Statistics
73 pages
Sales Force Management Old
No ratings yet
Sales Force Management Old
13 pages
Project Synopsis
No ratings yet
Project Synopsis
58 pages
MSC Applied Statistics Project
No ratings yet
MSC Applied Statistics Project
25 pages
SPSS ANNOTATED OUTPUT Discriminant Analysis 1
No ratings yet
SPSS ANNOTATED OUTPUT Discriminant Analysis 1
14 pages
SPSS ANNOTATED OUTPUT Discriminant Analysis 1
No ratings yet
SPSS ANNOTATED OUTPUT Discriminant Analysis 1
14 pages
Cluster Analysis: Mala Srivastava
No ratings yet
Cluster Analysis: Mala Srivastava
21 pages
Final Semi Detailed LP
No ratings yet
Final Semi Detailed LP
5 pages
Becoming A Spy - Covert Naturalistic Observation - UXmatters
No ratings yet
Becoming A Spy - Covert Naturalistic Observation - UXmatters
16 pages
Econometrics With Stata PDF
No ratings yet
Econometrics With Stata PDF
58 pages
Cluster Analysis For Market Segmentation
No ratings yet
Cluster Analysis For Market Segmentation
24 pages
BA4101 - Statistics - For - Management - Revised
No ratings yet
BA4101 - Statistics - For - Management - Revised
21 pages
Cost Accounting
No ratings yet
Cost Accounting
12 pages
Xuberance'20 - HRM Placement Brochure
No ratings yet
Xuberance'20 - HRM Placement Brochure
52 pages
Chapter4 Anova Experimental Design Analysis
No ratings yet
Chapter4 Anova Experimental Design Analysis
31 pages
?concept Paper
No ratings yet
?concept Paper
13 pages
IT445 Project
No ratings yet
IT445 Project
10 pages
BRM Data Analysis Techniques
No ratings yet
BRM Data Analysis Techniques
53 pages
Pizza Corner
100% (2)
Pizza Corner
12 pages
Exploratory Factor Analysis
100% (1)
Exploratory Factor Analysis
33 pages
Modeling, Simulation and Optimization
No ratings yet
Modeling, Simulation and Optimization
20 pages
Business Analytics - The Science of Data Driven Decision Making
No ratings yet
Business Analytics - The Science of Data Driven Decision Making
55 pages
SPSS ANNOTATED OUTPUT Multiple Regression
No ratings yet
SPSS ANNOTATED OUTPUT Multiple Regression
12 pages
New Thesis Topics in Orthopedic
100% (2)
New Thesis Topics in Orthopedic
8 pages
SPSS Annotated Output Factor Analysis
No ratings yet
SPSS Annotated Output Factor Analysis
11 pages
BRM CH 21
No ratings yet
BRM CH 21
31 pages
Quiz 1 Answers
No ratings yet
Quiz 1 Answers
10 pages
0210108402-24-Ind426-2018-04-Ppt 3 Conjoint Analysis
No ratings yet
0210108402-24-Ind426-2018-04-Ppt 3 Conjoint Analysis
12 pages
Research Methods For Business: A Skill Building Approach Day 1
No ratings yet
Research Methods For Business: A Skill Building Approach Day 1
49 pages
Internal Marketing - A Review On A Broadened Concept and Its Operationalisation
No ratings yet
Internal Marketing - A Review On A Broadened Concept and Its Operationalisation
21 pages
Pantia, Hero - Scope and Limitation Activity
No ratings yet
Pantia, Hero - Scope and Limitation Activity
4 pages
Ms Excel
No ratings yet
Ms Excel
9 pages
Cluster Analysis BRM Session 14
No ratings yet
Cluster Analysis BRM Session 14
25 pages
Ebook 037 Tutorial Spss K Means Cluster Analysis PDF
No ratings yet
Ebook 037 Tutorial Spss K Means Cluster Analysis PDF
13 pages
Kanpur Confectionaries BP0268A PDF
No ratings yet
Kanpur Confectionaries BP0268A PDF
7 pages
Theatre Introduction and CPRC
No ratings yet
Theatre Introduction and CPRC
4 pages
Ba7207 Business Research Methods Question Bank Edited
No ratings yet
Ba7207 Business Research Methods Question Bank Edited
9 pages
Interpretation Cluster Analysis
No ratings yet
Interpretation Cluster Analysis
8 pages
Discriminant Analysis Chapter-Seven
No ratings yet
Discriminant Analysis Chapter-Seven
7 pages
Keda S Sap Implementation
No ratings yet
Keda S Sap Implementation
6 pages
ICL Case Study
No ratings yet
ICL Case Study
5 pages
Clairvoyance 101
No ratings yet
Clairvoyance 101
11 pages
Malhotra Mr05 PPT 20
100% (1)
Malhotra Mr05 PPT 20
41 pages
NURS FPX 6616 Assessment 1 Community Resources and Best Practices
No ratings yet
NURS FPX 6616 Assessment 1 Community Resources and Best Practices
5 pages
National Institute of Technology, Tiruchirappalli MBA Trimester Examination, Basic Data Analytic Marathon Exam
No ratings yet
National Institute of Technology, Tiruchirappalli MBA Trimester Examination, Basic Data Analytic Marathon Exam
22 pages
Large Sample Test
100% (1)
Large Sample Test
7 pages
Econometrics I: Chapter 2: Two Variable Regression Analysis: Some Basic Ideas
No ratings yet
Econometrics I: Chapter 2: Two Variable Regression Analysis: Some Basic Ideas
18 pages
11-12-K Means Using SPSS
No ratings yet
11-12-K Means Using SPSS
4 pages
Nature of Business: Organisation Narrative
No ratings yet
Nature of Business: Organisation Narrative
4 pages
Chapter 6 Section 4-5: Probability: Multiple Choice
No ratings yet
Chapter 6 Section 4-5: Probability: Multiple Choice
7 pages
Sales Management Associate Program JD PDF
No ratings yet
Sales Management Associate Program JD PDF
6 pages
Statistical Methods For Decision Making
100% (1)
Statistical Methods For Decision Making
15 pages
Interview Preparations - NielsenIQ
No ratings yet
Interview Preparations - NielsenIQ
1 page
Pre-COVID: Cure - Fit Case Study
No ratings yet
Pre-COVID: Cure - Fit Case Study
3 pages
Introduction To Factor Analysis (Compatibility Mode) PDF
No ratings yet
Introduction To Factor Analysis (Compatibility Mode) PDF
20 pages
Cluster Analysis With SPSS
No ratings yet
Cluster Analysis With SPSS
8 pages
BA Project Group33
No ratings yet
BA Project Group33
10 pages
Sampling
100% (2)
Sampling
24 pages
The Hypothetico-Deductive Method 5th Edition
No ratings yet
The Hypothetico-Deductive Method 5th Edition
19 pages
Assignment Algorithm (Hungarian Method/ FLOOD's Technique) Step:1
No ratings yet
Assignment Algorithm (Hungarian Method/ FLOOD's Technique) Step:1
1 page
Basic Econometrics (BA 4th)
No ratings yet
Basic Econometrics (BA 4th)
4 pages
Statistical Analysis: Session 2: Measures of Central Tendency
100% (1)
Statistical Analysis: Session 2: Measures of Central Tendency
41 pages
8 - Updated Ch15-Time Series Analysis and Forecasting
No ratings yet
8 - Updated Ch15-Time Series Analysis and Forecasting
39 pages
Pretest English For Academic and Professional Purposes (Write The Word of The Correct Answer)
No ratings yet
Pretest English For Academic and Professional Purposes (Write The Word of The Correct Answer)
3 pages
BRM 2 Marks
100% (1)
BRM 2 Marks
16 pages
Field Study 2-Matrix of Activities
No ratings yet
Field Study 2-Matrix of Activities
2 pages
Assignment-Based Subjective Questions/Answers
No ratings yet
Assignment-Based Subjective Questions/Answers
3 pages
Chapter13 Slides
No ratings yet
Chapter13 Slides
24 pages
QT Presentation
No ratings yet
QT Presentation
16 pages
Multivariate Analysis
No ratings yet
Multivariate Analysis
15 pages
Notes For Mba (Business Research-524) : Q-1 What Is Business Research? Define / Types of Business Research?
No ratings yet
Notes For Mba (Business Research-524) : Q-1 What Is Business Research? Define / Types of Business Research?
5 pages
Childrens Play
No ratings yet
Childrens Play
17 pages
BRM Second Internal Question Paper
No ratings yet
BRM Second Internal Question Paper
2 pages
Multivariate Analysis IBS
No ratings yet
Multivariate Analysis IBS
20 pages
Profit Driven Business Analytics: A Practitioner's Guide to Transforming Big Data into Added Value
From Everand
Profit Driven Business Analytics: A Practitioner's Guide to Transforming Big Data into Added Value
Wouter Verbeke
No ratings yet

SPSS Annotated Output K Means Cluster Anal

Uploaded by

SPSS Annotated Output K Means Cluster Anal

Uploaded by

SPSS ANNOTATED OUTPUT K-MEANS CLUSTER ANALYSIS|

K-means cluster analysis is a tool designed to assign cases to

The K-Means Cluster Analysis procedure begins with the

A telecommunications provider wants to segment its customer base

1. To run the cluster analysis, from the menus choose:

6. Type 20 as the maximum iterations.

Figure 1. Initial cluster centers for three-cluster solution

The initial cluster centers are the variable values of

Figure 1. Iteration history for three-cluster solution

Figure 1. ANOVA table for three-cluster solution

The ANOVA table indicates which variables contribute the most

Figure 1. Final cluster centers for three-cluster solution

Figure 1. Distances between final cluster centers for three-

This table shows the Euclidean distances between the final

Figure 1. Number of cases in each cluster for three-cluster

Figure 1. K-Means Cluster Analysis dialog box

1. To run a cluster analysis with four clusters, reopen the

4. Select Cluster membership and Distance from cluster

Figure 1. Plot of distances from cluster center by cluster

Figure 1. Final cluster centers for four-cluster solution

This table shows that an important grouping is missed in the

The distances between the clusters have not changed greatly.

Figure 1. Number of cases in each cluster for four-cluster

Nearly 25% of cases belong to the newly created group of "E-

Using k-means cluster analysis, you initially grouped the

The K-Means Cluster Analysis procedure is a tool for finding

See the following texts for more information on k-means

You might also like