Chapter
Chapter
Unsupervised Data
Mining
Business Analytics, 1e
By Sanjiv Jaggia, Alison Kelly, Kevin Lertwachara, and Leida Chen
9/25/21
Chapter 11 Learning Objectives (LOs)
• Confidence of the association rule: probability that the antecedent and the
consequent occur given the antecedent occurs
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛𝑠 𝑖𝑛𝑐𝑙𝑢𝑑𝑖𝑛𝑔 𝑏𝑜𝑡ℎ 𝑎𝑛𝑡𝑒𝑐𝑒𝑑𝑒𝑛𝑡 𝑎𝑛𝑑 𝑐𝑜𝑛𝑠𝑒𝑞𝑢𝑒𝑛𝑡
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑟𝑎𝑛𝑠𝑎𝑐𝑡𝑖𝑜𝑛𝑠 𝑖𝑛𝑐𝑙𝑢𝑑𝑖𝑛𝑔 𝑎𝑛𝑡𝑒𝑐𝑒𝑑𝑒𝑛𝑡
• Both of these can be misleading, if the antecedent and consequent are
common yet unrelated.
!"#$%&'#('
• The lift ratio evaluates the strength of the association: )*+'(,'& ("#$%&'#('
!"#$%& '( )&*+,*-).'+, .+-/"0.+1 -'+,%2"%+)
– Expected confidence = 3')*/ +"#$%& '( )&*+,*-).'+,
– Compares the confidence of the association rule with the overall unconditional probability
– Lift = 1: level of association is the same as no rule at all (random guessing)
– Lift > 1: strong (positive) association
– Lift between 0 and 1: negative association
11.3: Association Rule Analysis (4/9)
• Example: Consider the below table of transactions.
$
• 𝑆𝑢𝑝𝑝𝑜𝑟𝑡 = !% = 0.50
$
• 𝐶𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 = = 0.71
&
'
• 𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 = !% = 0.60
%.&!
• 𝐿𝑖𝑓𝑡 𝑟𝑎𝑡𝑖𝑜 = = 1.19
%.'%
• The lift ratio is greater than 1, indicating a strong
association between the purchase of mascara and eyeliner.
• The association is 19% stronger than guessing at random.
11.3: Association Rule Analysis (6/9)
• Example: The store manager at an electronics store
collects data on the last 100 transactions. Five possible
products were purchased: a keyboard, an SD card, a
mouse, a USB drive, and/or a headphone.
11.3: Association Rule Analysis (7/9)
• With Excel
11.3: Association Rule Analysis (8/9)
• With R
11.3: Association Rule Analysis (9/9)
• With R