0% found this document useful (0 votes)

30 views36 pages

ML Classification Tree

This document provides a summary of a presentation on machine learning classification trees. It begins with learning outcomes related to information theory, entropy, information gain, and the classification tree algorithm ID3. It then discusses decision tree learning and how decision trees can represent target functions and be converted to rule sets. Examples of classification trees are provided for predicting whether to play tennis based on weather conditions. The document outlines the top-down induction approach for generating decision trees, and how entropy and information gain are used to determine the best attribute to split the data on at each node in the tree.

Uploaded by

admin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views36 pages

ML Classification Tree

Uploaded by

admin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

MACHINE LEARNING

Classification Tree

Presented by: Dr. S. N. Ahsan

(Slides adopted from the book Machine Learning, written by Tom Mitchell, and
I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations)
Welcome!!

Lecture 06
Learning Outcomes
• Information Theory, Entropy & Information Gain.
• Classification Tree Algorithm ID3.
• True Pruning

3
Decision Tree Learning
 Decision tree learning is a method for approximating discrete-valued
target functions, in which the learned function is represented by a
decision tree.
 Learned trees can also be re-represented as set of IF-THEN rules to
improve human readability
When to consider decision tree:
 Instances describable by attribute-value pairs
 Target function is discrete valued
Examples:
 Equipment or medical diagnosis
 Credit risk analysis

4
Decision Tree for PlayTennis (Example)
Decision tree representation:
Each internal node tests an attribute
Each branch corresponds to attribute value
Each leaf node assigns a classification

Converting a Tree to Rules

IF (Outlook = Sunny) ∧ (Humidity = High)
THEN PlayTennis = No
IF (Outlook = Sunny) ∧ (Humidity = Normal)
THEN PlayTennis = Yes
….

5
Example of a Decision Tree

Splitting Attributes
Tid Refund Marital Taxable
Status Income Cheat

1 Yes Single 125K No

2 No Married 100K No Refund
No
Yes No
3 No Single 70K
4 Yes Married 120K No NO MarSt
5 No Divorced 95K Yes Single, Divorced Married
6 No Married 60K No
7 Yes Divorced 220K No TaxInc NO
8 No Single 85K Yes < 80K > 80K
9 No Married 75K No
NO YES
10 No Single 90K Yes
10

Training Data 6
Another Example of Decision Tree

MarSt Single,
Married Divorced
Tid Refund Marital Taxable
Status Income Cheat
NO Refund
1 Yes Single 125K No
Yes No
2 No Married 100K No
3 No Single 70K No NO TaxInc
4 Yes Married 120K No < 80K > 80K
5 No Divorced 95K Yes
NO YES
6 No Married 60K No
7 Yes Divorced 220K No
8 No Single 85K Yes
9 No Married 75K No There could be more than one tree that fits
10 No Single 90K Yes the same data!
10

7
Top-Down Induction of Decision Trees (Approach)
• Main loop:
Many Algorithms:
1. A  the “best” decision attribute for next node
– Hunt’s Algorithm
2. Assign A as decision attribute for node
3. For each value of A, create new descendant of node
– CART
4. Sort training examples to leaf nodes – ID3, C4.5
5. If training examples perfectly classified, Then STOP, – SLIQ, SPRINT
Else iterate over new leaf nodes

 Greedy strategy: Split the records based on an attribute test that optimizes certain criterion
. The following are issues
1) Determine how to split the records
How to specify the attribute test condition? How to determine the best split?
2) Determine when to stop splitting

8
How to determine the Best Split & Measured Node Impurity

– Greedy approach:
Nodes with homogeneous class distribution are preferred
– Need a measure of node impurity:
C0: 9
C0: 5
C1: 5 C1: 1

Non-homogeneous, High degree of impurity Homogeneous, Low degree of impurity

Following two are the most commonly

used method to measure node impurity:
1. Gini Index
2. Entropy

9
Entropy

• S is a sample of training examples

• p⊕ is the proportion of positive examples in S
• p⊖ is the proportion of negative examples in S
• Entropy measures the impurity of S
Entropy(S)  - p⊕log2 p⊕ - p⊖log2 p⊖

10
Information Gain
Gain(S, A) = expected reduction in entropy due to sorting on A

11
12
13
14
Entropy, a common way to measure impurity

15
2- Class Cases

16
Information Gain

17
Calculating Information Gain

18
Classification Tree Example
How would you distinguish Class I from Class II

19
Training Examples

20
Selecting the Next Attribute(1/2)
Which attribute is the best classifier?

21
Selecting the Next Attribute(2/2)

Ssunny = {D1,D2,D8,D9,D11}
Gain (Ssunny , Humidity) = .970 - (3/5) 0.0 - (2/5) 0.0 = .970
Gain (Ssunny , Temperature) = .970 - (2/5) 0.0 - (2/5) 1.0 - (1/5) 0.0 = .570
Gain (Ssunny, Wind) = .970 - (2/5) 1.0 - (3/5) .918 = .019

22
Decision Tree Based Classification
Advantages:
– Inexpensive to construct
– Extremely fast at classifying unknown records
– Easy to interpret for small-sized trees
– Accuracy is comparable to other classification techniques for many
simple data sets

Practical Issues of Classification:

- Underfitting and Overfitting
- Missing Values
- Costs of Classification

23
Divide and Conquer
Constructing Decision Trees
outlook temperature humidity windy play
sunny 85 85 FALSE no
sunny 80 90 TRUE no
overcast 83 86 FALSE yes
rainy 70 96 FALSE yes
rainy 68 80 FALSE yes
rainy 65 70 TRUE no
overcast 64 65 TRUE yes
sunny 72 95 FALSE no
sunny 69 70 FALSE yes
rainy 75 80 FALSE yes
sunny 75 70 TRUE yes
overcast 72 90 TRUE yes
overcast 81 75 FASLE yes
rainy 71 91 TRUE no
Which attribute to select ?
24
Divide and Conquer
Constructing Decision Trees
• Which is the best attribute?
– The one which will result in the smallest tree
• Popular impurity criterion: information gain
– Information gain increases with the average purity of the subsets that an attr
ibute produces
• Strategy: choose attribute that results in greatest information gain

25
Divide and Conquer
Constructing Decision Trees
outlook play
sunny no
sunny no
overcast yes
rainy yes
rainy yes
rainy no
overcast yes
sunny no
sunny yes
rainy yes
sunny yes
overcast yes
overcast yes
rainy no [2,3] [4,0] [3,2]

26
[2,3] [4,0] [3,2] [2,2] [4,2] [3,1]

• The number of either yeses or nos is zero,

the information is zero
• The number of yeses and nos is equal, the
information reaches a maximum
[3,4] [6,1] [6,2] [3,3]

27
Divide and Conquer:
Constructing Decision Trees

[2,3] [3,2]
[4,0]

• Info ([2,3]) = -2/5 * log 2/5 – 3/5 * log 3/5 =0.971

• Info ([4,0]) = -4/4 * log 4/4 – 0/4 * log 0/4 =0
• Info ([3,2]) = -3/5 * log 3/5 – 2/5 * log 2/5 =0.971
• Info ([2,3],[4,0],[3,2])
= 5/14 *0.971 + 4/14 *0 + 5/14 *0.971 = 0.673 bits

28
Divide and Conquer:
Constructing Decision Trees
play [9,5]

• Play: Info ([9,5]) = -9/14 * log 9/14 – 5/14 * log 5/14

= 0.94 bits
• Gain (outlook) = Info ([9,5]) - Info ([2,3],[4,0],[3,2])
= 0.94 - 0.673 = 0.247 bits
• Gain (temperature) = 0.029 bits
• Gain (humidity) = 0.029 bits
• Gain (windy) = 0.048 bits
29
?
30
Info ([2,3]) = 0.971

[0,2] [1,1] [1,0]

• Info ([0,2]) = 0
• Info ([1,1]) = -1/1 * log 1/1 – 1/1 * log 1/1
• Info ([1,0]) = 0
• Info ([0,2], [1,1], [1,0]) = 0.4 bits
• Gain (temperature) = 0.971 – 0.4 = 0.571 bits

31
Divide and Conquer:
Constructing Decision Trees
• Gain (temperature) = 0.571 bits
• Gain (humidity) = 0.971 bits
• Gain (windy) = 0.020 bits

32
Example (1/2)

33
Example (2/2)

34
Review Questions
1. What is entropy?
2. What will be the value of entropy if the distribution is homogenous.
3. What is Information Gain?
4. How we select attribute for the root node of the tree.
5. What is Tree Pruning?

35
Thank you

Decision Tree
No ratings yet
Decision Tree
14 pages
Unit6 -2 Classification-Decision-Trees_25625586-1bf9-4821-a721-70db2d7805ef
No ratings yet
Unit6 -2 Classification-Decision-Trees_25625586-1bf9-4821-a721-70db2d7805ef
36 pages
Decision Tree
100% (4)
Decision Tree
66 pages
CSC454_10
No ratings yet
CSC454_10
36 pages
decision tree
No ratings yet
decision tree
66 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
Decision Trees: Decision Tree Is One of The Most Widely Used and
No ratings yet
Decision Trees: Decision Tree Is One of The Most Widely Used and
53 pages
Wk. 5.2. Decision Trees (27.10.2020)
No ratings yet
Wk. 5.2. Decision Trees (27.10.2020)
57 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
فاينل تعلم
No ratings yet
فاينل تعلم
144 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
unit-4[1].docx ML
No ratings yet
unit-4[1].docx ML
42 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
Ch02 DecisionTree
No ratings yet
Ch02 DecisionTree
41 pages
Decision Tree
No ratings yet
Decision Tree
58 pages
Trees
No ratings yet
Trees
78 pages
Classification: Decision Trees: Business Analytics Lecture 7/8
No ratings yet
Classification: Decision Trees: Business Analytics Lecture 7/8
35 pages
Decision Tree
No ratings yet
Decision Tree
33 pages
Machine Learning Unit-3.2
No ratings yet
Machine Learning Unit-3.2
61 pages
20210913115613D3708 - Session 05-08 Decision Tree Classification
No ratings yet
20210913115613D3708 - Session 05-08 Decision Tree Classification
37 pages
CSE445 NSU Week_4
No ratings yet
CSE445 NSU Week_4
48 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Decision Tree: Courtesy: Prof. Pabitra Mitra, CSE, IIT Kharagpur
No ratings yet
Decision Tree: Courtesy: Prof. Pabitra Mitra, CSE, IIT Kharagpur
73 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
06 Classification Decision Tree
No ratings yet
06 Classification Decision Tree
42 pages
2 ML Ch3 Decision Trees Final
No ratings yet
2 ML Ch3 Decision Trees Final
70 pages
decision-tree-intro-MDT903
No ratings yet
decision-tree-intro-MDT903
40 pages
Class Basic
No ratings yet
Class Basic
75 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
07.2.decision Trees
No ratings yet
07.2.decision Trees
33 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
DWDM UNIT 4
No ratings yet
DWDM UNIT 4
80 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
DT Classifier
No ratings yet
DT Classifier
45 pages
06 - Decision Trees
No ratings yet
06 - Decision Trees
14 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
DMDW-CO3-SESSION-14
No ratings yet
DMDW-CO3-SESSION-14
55 pages
AIML Lect5 Decision Tree
No ratings yet
AIML Lect5 Decision Tree
33 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
Data Mining: Classification-1
No ratings yet
Data Mining: Classification-1
53 pages
Decision Tree: Dept of CS & IT Bahauddin Zakariya University, Sahiwal Campus
No ratings yet
Decision Tree: Dept of CS & IT Bahauddin Zakariya University, Sahiwal Campus
31 pages
Decision Trees Edited
No ratings yet
Decision Trees Edited
56 pages
Data Minning Unit 5 PDF
No ratings yet
Data Minning Unit 5 PDF
19 pages
07.2.Decision Trees_ML
No ratings yet
07.2.Decision Trees_ML
32 pages
Classification With Decision Trees I: Instructor: Qiang Yang
No ratings yet
Classification With Decision Trees I: Instructor: Qiang Yang
29 pages
ID3 Dozier Seals
No ratings yet
ID3 Dozier Seals
30 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
Lecture 19 - Decision Tress
No ratings yet
Lecture 19 - Decision Tress
21 pages
Lec05 Classification DecisionTree
No ratings yet
Lec05 Classification DecisionTree
67 pages
Decision Tree.pptx
No ratings yet
Decision Tree.pptx
41 pages
3-Classification, Clustering and Prediction
No ratings yet
3-Classification, Clustering and Prediction
142 pages
Becoming A Wealth Attractor: Discover Ten Secrets That Will Make Money Come To You
From Everand
Becoming A Wealth Attractor: Discover Ten Secrets That Will Make Money Come To You
Victor C. Okoro
No ratings yet
Sort Your Money Out: and Get Invested
From Everand
Sort Your Money Out: and Get Invested
Glen James
No ratings yet
Cross-Validation in Machine Learning
No ratings yet
Cross-Validation in Machine Learning
18 pages
Oral Communication in Context: Quarter 1 - Module 1: Functions, Nature and Process of Communication
No ratings yet
Oral Communication in Context: Quarter 1 - Module 1: Functions, Nature and Process of Communication
19 pages
Ege Üniversitesi Elektrik Elektronik Mühendisliği Bölümü Kontrol Sistemleri II Dersi 5.uygulama
No ratings yet
Ege Üniversitesi Elektrik Elektronik Mühendisliği Bölümü Kontrol Sistemleri II Dersi 5.uygulama
6 pages
BT4470 PPT
No ratings yet
BT4470 PPT
12 pages
The Future of Technology Worksheet
No ratings yet
The Future of Technology Worksheet
11 pages
Unsupervised Learning: Part III Counter Propagation Network
100% (1)
Unsupervised Learning: Part III Counter Propagation Network
17 pages
5028-Article Text-16342-1-4-20230406
No ratings yet
5028-Article Text-16342-1-4-20230406
8 pages
DL - Assignment 7 Solution
100% (1)
DL - Assignment 7 Solution
5 pages
SSRN Id3421486 PDF
No ratings yet
SSRN Id3421486 PDF
8 pages
Convolutional - Autoencoder - and - Transfer - Learning - For - Automatic - Virtual - Metrology (IEEE RA-L, July 2022)
No ratings yet
Convolutional - Autoencoder - and - Transfer - Learning - For - Automatic - Virtual - Metrology (IEEE RA-L, July 2022)
8 pages
Cyber Cafe Management System DEEPAK SHINDE
No ratings yet
Cyber Cafe Management System DEEPAK SHINDE
36 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Graph-Based Clustering and Data Visualization Algorithms
No ratings yet
Graph-Based Clustering and Data Visualization Algorithms
1 page
Mehta, Rastegari - 2022 - Separable Self-Attention For Mobile Vision Transformers
No ratings yet
Mehta, Rastegari - 2022 - Separable Self-Attention For Mobile Vision Transformers
18 pages
A Comprehensive Review of Artificial Neural Network Techniques Used For Smart Meter-Embedded Forecasting System
No ratings yet
A Comprehensive Review of Artificial Neural Network Techniques Used For Smart Meter-Embedded Forecasting System
12 pages
Altınbaş University Graduate Education Institute Instructors: E-Mail: Aytug - Boyaci@altinbas - Edu.tr
No ratings yet
Altınbaş University Graduate Education Institute Instructors: E-Mail: Aytug - Boyaci@altinbas - Edu.tr
27 pages
PR - Exam - WS21 - 22 Sol
No ratings yet
PR - Exam - WS21 - 22 Sol
19 pages
Edge Detection Techniques On Digital Images - A Review
No ratings yet
Edge Detection Techniques On Digital Images - A Review
4 pages
Aryan BDA Assignment
No ratings yet
Aryan BDA Assignment
6 pages
Yuang Bat Book1
No ratings yet
Yuang Bat Book1
796 pages
Pothole Segmentation - CNN
No ratings yet
Pothole Segmentation - CNN
44 pages
Advanced Process Control Presentation Questionnaire
No ratings yet
Advanced Process Control Presentation Questionnaire
1 page
Data Mining Written Notes 1
No ratings yet
Data Mining Written Notes 1
35 pages
Delving Deep Into Rectifiers: Surpassing Human-Level Performance On Imagenet Classification
No ratings yet
Delving Deep Into Rectifiers: Surpassing Human-Level Performance On Imagenet Classification
11 pages
Xception Net
No ratings yet
Xception Net
8 pages
Hadoop Distributed File System
No ratings yet
Hadoop Distributed File System
5 pages
DICT Webinar PPT - Introduction To Data Science and IOT
No ratings yet
DICT Webinar PPT - Introduction To Data Science and IOT
20 pages
Creating A Neural Network From Scratch in Python
100% (1)
Creating A Neural Network From Scratch in Python
12 pages
Connect Four Game
No ratings yet
Connect Four Game
6 pages
SC Question Bank
No ratings yet
SC Question Bank
2 pages

ML Classification Tree

Uploaded by

ML Classification Tree

Uploaded by

MACHINE LEARNING

Presented by: Dr. S. N. Ahsan

Converting a Tree to Rules

1 Yes Single 125K No

Non-homogeneous, High degree of impurity Homogeneous, Low degree of impurity

Following two are the most commonly

• S is a sample of training examples

Practical Issues of Classification:

• The number of either yeses or nos is zero,

• Info ([2,3]) = -2/5 * log 2/5 – 3/5 * log 3/5 =0.971

• Play: Info ([9,5]) = -9/14 * log 9/14 – 5/14 * log 5/14

[0,2] [1,1] [1,0]

You might also like