Chapter

Uploaded by

batengarania

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Chapter

Uploaded by

batengarania

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

11

Unsupervised Data
Mining

Business Analytics, 1e
By Sanjiv Jaggia, Alison Kelly, Kevin Lertwachara, and Leida Chen

9/25/21
Chapter 11 Learning Objectives (LOs)

LO 11.1 Conduct hierarchical cluster analysis.

LO 11.3 Conduct association rule analysis.
Introductory Case: Nutritional Facts of Candy
Bars
• Aliyah is an honors student at a prestigious business school in
Southern California. She is also a fledgling entrepreneur and owns a
vending machine business. Aliyah is aware that California consumers
are becoming increasingly health conscious when it comes to food
purchase. Aliyah wants to come up with a better selection of candy bars
and strategically group and display them in her vending machines.

• Aliyah wants to use the information to accomplish the below tasks.

1. Analyze the nutritional fact data and group candy products according to their
nutritional content.
2. Select a variety of candy bars from each group to better meet the taste of today’s
consumers.
3. Display the candy bars in her vending machines according to the grouping.
11.1: Hierarchical Cluster Analysis (1/14)
• Unsupervised data mining requires no knowledge of the
target variable.
• The algorithms allow the computer to identify complex
processes and patterns without any specific guidance from
the analyst.
• It is an important part of exploratory data analysis because
it makes no distinction between the target variable 𝑦 and
the predictor variables 𝑥!, 𝑥", ⋯ , 𝑥# .
• Uses similarity measures: Euclidian, Manhattan, Jaccard’s
• We explore two core unsupervised data mining techniques:
cluster analysis and association rule analysis.
11.1: Hierarchical Cluster Analysis (2/14)
• Cluster analysis is an unsupervised data mining technique
that groups data into categories that share some similar
characteristic or trait.
– Similar within a cluster, dissimilar across clusters
– Uses similarity measures
• Allows useful exploratory analysis by summarizing a large
number of observations in a data set into a small number of
clusters.
• The cluster characteristics or profiles help us understand
and describe the different groups.
• A popular application of cluster analysis is called customer
or market segmentation.
• Two common clustering techniques: hierarchical clustering
and k-means clustering.
11.1: Hierarchical Cluster Analysis (3/14)
• Hierarchical clustering is a technique that uses an
iterative process to group data into a hierarchy of
clusters.
– Agglomerative clustering (AGNES): top-down, starts
with each observation being its own cluster, iteratively
merges clusters that are similar moving up the hierarchy
– Divisive clustering (DIANA): bottom-up, starts with a
single cluster, iteratively separating the most dissimilar
observations moving down the hierarchy
• We focus on agglomerative clustering, which is the
most commonly used approach.
• The methods can be adapted to implement divisive
clustering.
11.1: Hierarchical Cluster Analysis (4/14)
• With AGNES, each observation in the data initially forms its own cluster.
• The algorithm then successively merges these clusters into larger clusters
based on their similarity until all observations are merged into one final
cluster, referred to as a root.
• Uses (dis)similarity measures.
– Numeric: Euclidean distance or Manhattan distance
– Categorical: matching, Jaccard’s coefficient
• Uses the z-score standardization.
• Linkage methods to evaluate (dis)similarity between clusters.
– Single: nearest distance between a pair of observations not in the same cluster
– Complete: farthest distance between a pair of observations not in the same cluster
– Centroid: distance between the center/centroid or mean values of the clusters
– Average: average distance between all pairs of observations not in the same cluster
– Ward’s: uses error sum of squares (ESS), which is the squared difference between
individual observations and the cluster mean; measures the loss of information that
occurs when observations are clustered
11.1: Hierarchical Cluster Analysis (5/14)
11.1: Hierarchical Cluster Analysis (6/14)
• Once AGNES completes the clustering process, data are
usually represented in a treelike structure.
– Called a dendrogram
– Branches are clusters
– An observation is a “leaf”
– Visually inspect the clustering result and determine the appropriate number
of clusters
• The height of each branch (cluster) or sub-branch (sub-
cluster) indicates how dissimilar it is from the other
branches or sub-branches with which it is merged.
• The greater the height, the more distinctive the cluster is
from the other clusters.
11.1: Hierarchical Cluster Analysis (7/14)
11.1: Hierarchical Cluster Analysis (8/14)
• Relying solely on the height of a dendrogram tree branch may
lead to statistically distinctive clusters that have little or no
practical meaning.
• We often take into account both quantitative measures (such as
a dendrogram) and practical considerations to determine the
number of clusters.
• We should also review the profile of each cluster using
descriptive statistics.
• Another common approach to profile clusters is to incorporate
variables that were not used in clustering but of interest to the
decision maker.
• The ability of a clustering method to discover useful hidden
patterns of the data depends on how it is implemented: data
transformations, distance measures, algorithm, linkage.
• Try several techniques, use the one that makes the most sense.
11.1: Hierarchical Cluster Analysis (9/14)

• Example: Consider the crime crate, median

income, and poverty rate for 41 cities.
11.1: Hierarchical Cluster Analysis (10/14)
• With Excel
11.1: Hierarchical Cluster Analysis (11/14)
• With Excel
11.1: Hierarchical Cluster Analysis (12/14)
• With R
11.1: Hierarchical Cluster Analysis (13/14)
• With R
11.1: Hierarchical Cluster Analysis (14/14)
• With R
11.3: Association Rule Analysis (1/9)
• Association rule analysis is essentially a “what goes with what” study.
– Designed to identify events that tend to occur together
– Also known as affinity analysis or market basket analysis
• Classic application of market basket analysis: retail companies seek to
identify products that consumers tend to purchase together.
– Display products next to each other on a shelf
– Develop promotional campaigns to cross-sell or up-sell
• Other examples
– Improve sales and customer service
– Help diagnose illnesses based on different symptoms that occur together
• Association rules are If-Then logical statements that represent
relationships among different items or item sets.
– Designed to identify hidden patterns and co-occurring events in data
– If is the antecedent, then is the consequent
– Antecedents and consequents can comprise a single product or a combination of
products
– Products or a combination of products is called items or an item set
11.3: Association Rule Analysis (2/9)
• One inherent problem with searching for hidden relationships between
items or item sets is dealing with the extremely large number of
possible combinations.
• Let 𝑛 be the number of items. The number of possible combinations
exponentially increases: 3! − 2 !"# + 1.
– Example: 100 items gives 5.15378E+47 possible combinations
– The search problem becomes extremely computationally intensive and time-
consuming.
• There are several algorithms that can be used to perform association
rule analysis in a more efficient manner. They all focus on the
frequency of item sets.
• One of the most widely used algorithms is called the Apriori method.
– Designed to recursively generate item sets that exceed a predetermined frequency
threshold: the support of the item or item set.
– Set a minimum support value, below which an item or item set is excluded, thus
making the analysis more computationally feasible.
– Eliminates infrequent items that are below the support value, makes it easier to
analyze relevant information in a large data set.
11.3: Association Rule Analysis (3/9)
• With enough data, we can propose many of these If-Then association rules.
– We need a way to evaluate the effectiveness of these rules
– Only the strong associations that occur frequently have the potential to reappear consistently in
the future
• Support: the probability of the If-Then statement
!"#$%& '( )&*+,*-).'+, .+-/"0.+1 $')2 *+)%-%0%+) *+0 -'+,%3"%+)
4')*/ +"#$%& '( )&*+,-*).'+,

• Confidence of the association rule: probability that the antecedent and the
consequent occur given the antecedent occurs
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛𝑠 𝑖𝑛𝑐𝑙𝑢𝑑𝑖𝑛𝑔 𝑏𝑜𝑡ℎ 𝑎𝑛𝑡𝑒𝑐𝑒𝑑𝑒𝑛𝑡 𝑎𝑛𝑑 𝑐𝑜𝑛𝑠𝑒𝑞𝑢𝑒𝑛𝑡
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛𝑠 𝑖𝑛𝑐𝑙𝑢𝑑𝑖𝑛𝑔 𝑎𝑛𝑡𝑒𝑐𝑒𝑑𝑒𝑛𝑡
• Both of these can be misleading, if the antecedent and consequent are
common yet unrelated.
!"#$%&'#('
• The lift ratio evaluates the strength of the association: )*+'(,'& ("#$%&'#('
!"#$%& '( )&*+,*-).'+, .+-/"0.+1 -'+,%2"%+)
– Expected confidence = 3')*/ +"#$%& '( )&*+,*-).'+,
– Compares the confidence of the association rule with the overall unconditional probability
– Lift = 1: level of association is the same as no rule at all (random guessing)
– Lift > 1: strong (positive) association
– Lift between 0 and 1: negative association
11.3: Association Rule Analysis (4/9)
• Example: Consider the below table of transactions.

• For the association rule {mascara} => {eye liner}, compute

the support, confidence, and lift ratio.
11.3: Association Rule Analysis (5/9)

$
• 𝑆𝑢𝑝𝑝𝑜𝑟𝑡 = !% = 0.50
$
• 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 = = 0.71
&
'
• 𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 = !% = 0.60
%.&!
• 𝐿𝑖𝑓𝑡 𝑟𝑎𝑡𝑖𝑜 = = 1.19
%.'%
• The lift ratio is greater than 1, indicating a strong
association between the purchase of mascara and eyeliner.
• The association is 19% stronger than guessing at random.
11.3: Association Rule Analysis (6/9)
• Example: The store manager at an electronics store
collects data on the last 100 transactions. Five possible
products were purchased: a keyboard, an SD card, a
mouse, a USB drive, and/or a headphone.
11.3: Association Rule Analysis (7/9)
• With Excel
11.3: Association Rule Analysis (8/9)
• With R
11.3: Association Rule Analysis (9/9)
• With R

Quiz 10 - Regression, Cluster Analysis, & Association Analysis
No ratings yet
Quiz 10 - Regression, Cluster Analysis, & Association Analysis
3 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Online Course Assignments
No ratings yet
Online Course Assignments
8 pages
Data Mining
No ratings yet
Data Mining
7 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
15 pages
DW&DM(Unit -4)
No ratings yet
DW&DM(Unit -4)
9 pages
Unit III Data Mining Techniques
No ratings yet
Unit III Data Mining Techniques
17 pages
Data Mining U3
No ratings yet
Data Mining U3
19 pages
Big Data Analytics Algorithm, Tools in Systematic Review
No ratings yet
Big Data Analytics Algorithm, Tools in Systematic Review
7 pages
chapter 3 p4
No ratings yet
chapter 3 p4
18 pages
CC Unit - 4 Imp Questions
No ratings yet
CC Unit - 4 Imp Questions
4 pages
Group#10 (Cluster Analysis)
No ratings yet
Group#10 (Cluster Analysis)
53 pages
DM - 01 - 02 - Data Mining Functionalities PDF
No ratings yet
DM - 01 - 02 - Data Mining Functionalities PDF
63 pages
data analytics-1
No ratings yet
data analytics-1
21 pages
DM Unit 1 PDF
No ratings yet
DM Unit 1 PDF
9 pages
UNIT 4 NOTES
No ratings yet
UNIT 4 NOTES
21 pages
MR Unit 3
No ratings yet
MR Unit 3
16 pages
DataMining Aug2021
100% (2)
DataMining Aug2021
49 pages
Data Mining Unit2
No ratings yet
Data Mining Unit2
9 pages
PTDLKT
No ratings yet
PTDLKT
11 pages
DWM 5
No ratings yet
DWM 5
9 pages
Chapter 8 - Consumer Perception and Preference
No ratings yet
Chapter 8 - Consumer Perception and Preference
29 pages
Dmbi Unit-4
No ratings yet
Dmbi Unit-4
18 pages
Lesson 8 - Introduction To Data Analysis
No ratings yet
Lesson 8 - Introduction To Data Analysis
33 pages
Data Mining
No ratings yet
Data Mining
22 pages
On Unit-3
No ratings yet
On Unit-3
30 pages
Data Minning Unit 2-1
No ratings yet
Data Minning Unit 2-1
10 pages
L18&19 Data Exploration
No ratings yet
L18&19 Data Exploration
50 pages
Data Mining Issues and Tasks
No ratings yet
Data Mining Issues and Tasks
5 pages
UNIT 3 DWDM Notes
No ratings yet
UNIT 3 DWDM Notes
32 pages
Comparison of Segmentation Approaches: by Beth Horn and Wei Huang
No ratings yet
Comparison of Segmentation Approaches: by Beth Horn and Wei Huang
12 pages
Unit-3 DWDM 7TH Sem Cse
No ratings yet
Unit-3 DWDM 7TH Sem Cse
54 pages
Cluster Analysis-Unit 11
No ratings yet
Cluster Analysis-Unit 11
37 pages
DWM important answer
No ratings yet
DWM important answer
8 pages
Cluster Analysis: Prentice-Hall, Inc
No ratings yet
Cluster Analysis: Prentice-Hall, Inc
33 pages
Module 5 - Supervised Learning Algorithms
No ratings yet
Module 5 - Supervised Learning Algorithms
38 pages
factor analysis
No ratings yet
factor analysis
18 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
Data Wrangling
No ratings yet
Data Wrangling
30 pages
Cluster Analysis
No ratings yet
Cluster Analysis
101 pages
Data Minig Tecnique
No ratings yet
Data Minig Tecnique
1 page
CH11 PPT
No ratings yet
CH11 PPT
33 pages
ADVANCE AIML CIE3 ANS
No ratings yet
ADVANCE AIML CIE3 ANS
5 pages
8.Cluster Analysis HCA
No ratings yet
8.Cluster Analysis HCA
31 pages
Data Mining University Answer
No ratings yet
Data Mining University Answer
10 pages
Data Driven Summary
No ratings yet
Data Driven Summary
15 pages
Lec 02
No ratings yet
Lec 02
33 pages
VVVVIMP AI 3,4,5
No ratings yet
VVVVIMP AI 3,4,5
26 pages
Dwdm Unit-II Notes
No ratings yet
Dwdm Unit-II Notes
29 pages
Unit 3 PPT (BA)
No ratings yet
Unit 3 PPT (BA)
19 pages
Predictive Analysis 5
No ratings yet
Predictive Analysis 5
8 pages
Introduction to Classification and Classification Algorithms
No ratings yet
Introduction to Classification and Classification Algorithms
9 pages
Data Mining Slides
No ratings yet
Data Mining Slides
65 pages
Data Mining Mid 2
No ratings yet
Data Mining Mid 2
20 pages
Introduction To Data Mining For Business Analytics
No ratings yet
Introduction To Data Mining For Business Analytics
51 pages
2nd Unit NN Final Class Notes (1)
No ratings yet
2nd Unit NN Final Class Notes (1)
50 pages
Machine Learning Clustering AlgorithmsI
No ratings yet
Machine Learning Clustering AlgorithmsI
129 pages
Advanced Mining Techniques
No ratings yet
Advanced Mining Techniques
8 pages
Ba Group 5
No ratings yet
Ba Group 5
18 pages
DM
No ratings yet
DM
8 pages
Natural Continuous Runge-Kutta Method For Delay Differential Equations
No ratings yet
Natural Continuous Runge-Kutta Method For Delay Differential Equations
14 pages
Back-Propagation Algorithm
No ratings yet
Back-Propagation Algorithm
51 pages
AWGN
No ratings yet
AWGN
3 pages
Opt lp2
No ratings yet
Opt lp2
22 pages
Sadcx
No ratings yet
Sadcx
19 pages
Network Optimization Models: Maximum Flow Problems
No ratings yet
Network Optimization Models: Maximum Flow Problems
14 pages
Tom M CMU ANN Lecture Notes
No ratings yet
Tom M CMU ANN Lecture Notes
68 pages
Disk Scheduling Nov 2023
No ratings yet
Disk Scheduling Nov 2023
9 pages
15 - Updated - DSP LAB MANUAL R 20 - III - I ECE DEPT, SVREC, NANDYAL MANUAL PDF
No ratings yet
15 - Updated - DSP LAB MANUAL R 20 - III - I ECE DEPT, SVREC, NANDYAL MANUAL PDF
17 pages
D Star
No ratings yet
D Star
32 pages
23 Quick Sort
No ratings yet
23 Quick Sort
53 pages
Activation Functions: Sigmoid, Tanh, Relu, Leaky Relu, Prelu, Elu, Threshold Relu and Softmax Basics For Neural Networks and Deep Learning
No ratings yet
Activation Functions: Sigmoid, Tanh, Relu, Leaky Relu, Prelu, Elu, Threshold Relu and Softmax Basics For Neural Networks and Deep Learning
15 pages
Datastructures Using c
No ratings yet
Datastructures Using c
2 pages
Subject:-Data Structure Using C 3 Semester A: G.H.Raisoni College of Engineering
No ratings yet
Subject:-Data Structure Using C 3 Semester A: G.H.Raisoni College of Engineering
6 pages
DSA by Shradha Didi & Aman Bhaiya
No ratings yet
DSA by Shradha Didi & Aman Bhaiya
9 pages
Algorithms Minimum Spanning Trees (MST) Solutions
No ratings yet
Algorithms Minimum Spanning Trees (MST) Solutions
5 pages
Deleted 31421 - Aritificial Intelligence Cheat Sheet
No ratings yet
Deleted 31421 - Aritificial Intelligence Cheat Sheet
2 pages
DEWE-43 User Manual
No ratings yet
DEWE-43 User Manual
37 pages
18MC9122 - Design and Analysis of Algorithms
No ratings yet
18MC9122 - Design and Analysis of Algorithms
4 pages
TRW Assignment 1
No ratings yet
TRW Assignment 1
10 pages
Task 5
No ratings yet
Task 5
2 pages
MAN 327 2.0 Micro-Computers & Their Applications Group Assignment
No ratings yet
MAN 327 2.0 Micro-Computers & Their Applications Group Assignment
7 pages
O.R- Unit - I, II, III
No ratings yet
O.R- Unit - I, II, III
44 pages
Computational Lab in Physics: Finding Roots of Nonlinear Functions
No ratings yet
Computational Lab in Physics: Finding Roots of Nonlinear Functions
12 pages
ECE1004 Signals-And-Systems ETH 2 AC39
No ratings yet
ECE1004 Signals-And-Systems ETH 2 AC39
2 pages
Information Theory and Coding 2marks
No ratings yet
Information Theory and Coding 2marks
12 pages
MA1006 A-SNM Unit-4 QB New PDF
No ratings yet
MA1006 A-SNM Unit-4 QB New PDF
24 pages
Tybms Regular Exams Operations Research Set 1
No ratings yet
Tybms Regular Exams Operations Research Set 1
6 pages
Bit Reduction for Locality-Sensitive Hashing
No ratings yet
Bit Reduction for Locality-Sensitive Hashing
12 pages
Image Compression Using Discrete Cosine Transform and Adaptive Huffman Coding
No ratings yet
Image Compression Using Discrete Cosine Transform and Adaptive Huffman Coding
5 pages

Chapter

Uploaded by

Chapter

Uploaded by

11

LO 11.1 Conduct hierarchical cluster analysis.

• Aliyah wants to use the information to accomplish the below tasks.

• Example: Consider the crime crate, median

• For the association rule {mascara} => {eye liner}, compute

You might also like