0% found this document useful (0 votes)

0 views

lec06decisiontreesandid3algorithm_727c2262eb504a6ee5d0bcf1f5c4d0c3_

The document discusses the Decision Trees and ID3 Algorithm, focusing on their application in decision-making processes and classification tasks. It explains how decision trees are constructed, the importance of attributes in classification, and the statistical measures of entropy and information gain used to evaluate these attributes. The ID3 algorithm is highlighted as a method for building decision trees by selecting the most informative attributes based on their ability to classify training examples.

Uploaded by

aslantepeemin

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

lec06decisiontreesandid3algorithm_727c2262eb504a6ee5d0bcf1f5c4d0c3_

Uploaded by

aslantepeemin

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Machine Learning & Applications

Decision Trees
& ID3 Algorithm
INSTRUCTOR: JAWAD RASHEED
Agenda ▪ How Decision-making process evolve?
▪ Decision Trees
▪ ID3 Algorithm
▪ Statistical Test
▪ Entropy, and Information Gain
▪ Finding attributing as best classifier

March 17, 2025 2

Decision-Making
Process

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 3

Decision-Making process
● What to do this weekend?
● If my friends are visiting
● We will go to see the downtown area of the city.

● If not
● Then, if it’s sunny I’ll play and go to park
● But if it’s windy and I’m rich, I’ll go shopping
● If it’s rainy, I’ll stay in

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 4

Decision Tree

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 5

Decision Tree
● Decision tree learning is one of the most widely used and practical methods for inductive inference

● Decision tree learning is a method for approximating discrete-valued target functions, in which the
learned function is represented by a decision tree.

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 6

Decision Tree
● Learned trees can also be re-represented as sets of if-then rules to improve human readability.

● It is robust to noisy data and capable of learning disjunctive expressions.

● It has been successfully applied to a broad range of tasks from learning to diagnose medical cases to
learning to assess the credit risk of loan applicants.

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 7

Decision Tree Representation
● Decision trees classify instances by sorting them down the tree from the root to some leaf node,
which provides the classification of the instance.

● Each node in the tree specifies a test of some attribute of the instance.

● An instance is classified by starting at the root node of the tree, testing the attribute specified by this
node, then moving down the tree branch corresponding to the value of the attribute in the given
example.

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 8

Decision Tree Representation

● Test the following instance for the given decision tree:

(Outlook = Sunny, Temperature = Hot, Humidity = High, Wind = Strong)
March 17, 2025 INSTRUCTOR: JAWAD RASHEED 9
Appropriate problems for Decision Tree
Learning
● Decision tree learning is generally best suited to problems with the following characteristics:
● Instances are represented by attribute-value pairs
● The target function has discrete output values
● Disjunctive descriptions may be required
● The training data may contain errors
● The training data may contain missing attribute values

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 10

ID3 Algorithm

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 11

ID3 algorithm
● In decision tree learning, ID3 (Iterative Dichotomiser 3) is an algorithm
invented by Ross Quinlan
● ID3, learns decision trees by constructing them top-down approach
● It begins with the question “which attribute should be tested at the root
of the tree”? Ross Quinlan
● To answer this question, each instance attribute is evaluated using a School of Computer
Science & Engineering,
statistical test. University of New South
Wales, Sydney Australia
● Statistical Test: Which attribute is the best?
Published paper in
Machine Learning, 1986

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 12

Statistical Test

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 13

Statistical test – which attribute is the
best?
● We would like to select the attribute that is most useful for classifying examples.

● What is a good quantitative measure of the worth of an attribute?

● We will define a statistical property, called information gain, that measures how well a given
attribute separates the training examples according to their target classification

● ID3 uses this information gain measure to select among the candidate attributes at each step while
growing the tree.

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 14

Entropy measures
● In order to define information gain precisely, we begin by defining a measure commonly used in
information theory, called entropy.

● It characterizes the (im)purity of an arbitrary collection of examples.

𝑐
𝐸𝑛𝑡𝑟𝑜𝑝𝑦 𝑆 ≡ − 𝑝⊕ 𝑙𝑜𝑔2 𝑝⊕ − 𝑝⊖ 𝑙𝑜𝑔2 𝑝⊝ 𝐸𝑛𝑡𝑟𝑜𝑝𝑦 𝑆 ≡ ෍ −𝑝𝑖 𝑙𝑜𝑔2 𝑝𝑖
𝑖=1

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 15

Entropy measures
𝑐
𝐸𝑛𝑡𝑟𝑜𝑝𝑦 𝑆 ≡ − 𝑝⊕ 𝑙𝑜𝑔2 𝑝⊕ − 𝑝⊖ 𝑙𝑜𝑔2 𝑝⊝ 𝐸𝑛𝑡𝑟𝑜𝑝𝑦 𝑆 ≡ ෍ −𝑝𝑖 𝑙𝑜𝑔2 𝑝𝑖
𝑖=1

● For Example: Suppose S is a collection of 14 examples of some Boolean concept, including 9 positive
and 5 negative examples

9 9 5 5
𝐸𝑛𝑡𝑟𝑜𝑝𝑦 [9⊕ , 5⊖ ] = − 14
𝑙𝑜𝑔2 14
− 14
𝑙𝑜𝑔2 14
= -(0.6429)(-0.6374) – (0.3571)(-1.4854)
= -0.4098 + 0.5305
= 0.940

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 16

Entropy measures
● Notice that the entropy is 0 if all members of S belong to the same class.

● If all members are positive, then 𝑝⊖ is 0, and

𝐸𝑛𝑡𝑟𝑜𝑝𝑦 𝑆 = −1 . 𝑙𝑜𝑔2 1 − 0 . 𝑙𝑜𝑔2 0
= −1 . 0 − 0 . 𝑙𝑜𝑔2 0 = 0

● Note: Entropy is 1 when the collection contains an equal number of

positive and negative examples.

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 17

Information gain
● Given entropy as a measure of the impurity in a collection of training examples.
● We can now define a measure of the effectiveness of an attribute in classifying the training data.
● The measure we will use, is called information gain.
● The information gain, Gain(S, A) of an attribute A relative to a collection of examples S, is defined as

𝑆𝑣
𝐺𝑎𝑖𝑛 𝑆, 𝐴 ≡ 𝐸𝑛𝑡𝑟𝑜𝑝𝑦 𝑆 − ෍ 𝐸𝑛𝑡𝑟𝑜𝑝𝑦(𝑆𝑣 )
𝑆
𝑣 ∈𝑉𝑎𝑙𝑢𝑒𝑠(𝐴)

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 18

Day Outlook Temperature Humidity Wind PlayTennis
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7
D8
Overcast
Sunny
Cool
Mild
Normal
High
Strong
Weak
Yes
No
Information
D9
D10
Sunny
Rain
Cool
Mild
Normal
Normal
Weak
Weak
Yes
Yes
gain
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 19

Example: Information gain calculation
𝑆𝑣
𝐺𝑎𝑖𝑛 𝑆, 𝐴 ≡ 𝐸𝑛𝑡𝑟𝑜𝑝𝑦 𝑆 − ෍ 𝐸𝑛𝑡𝑟𝑜𝑝𝑦(𝑆𝑣 )
𝑆
𝑣 ∈𝑉𝑎𝑙𝑢𝑒𝑠(𝐴) Day Outlook Temperature Humidity Wind PlayTennis
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes

Lets do it on the board ☺ D5

D6
D7
Rain
Rain
Overcast
Cool
Cool
Cool
Normal
Normal
Normal
Weak
Strong
Strong
Yes
No
Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 20

Information gain: Training examples
9 9 5 5
𝐸𝑛𝑡𝑟𝑜𝑝𝑦 [9⊕ , 5⊖ ] = − 𝑙𝑜𝑔2 − 𝑙𝑜𝑔2
14 14 14 14
= 0.940
Day Outlook Temperature Humidity Wind PlayTennis
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 21

Which attribute is the best classifier?

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 22

Information gain

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 23

Which attribute should be tested next?

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 24

Decision Tree Representation

● Test the following instance for the given decision tree:

(Outlook = Sunny, Temperature = Hot, Humidity = High, Wind = Strong)
March 17, 2025 INSTRUCTOR: JAWAD RASHEED 25
For queries: [email protected]

March 17, 2025 INSTRUCTOR: JAWAD RASHEED 26

New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Screenshot 2024-02-06 at 1.43.15 PM
No ratings yet
Screenshot 2024-02-06 at 1.43.15 PM
66 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Unit 3
No ratings yet
Unit 3
46 pages
Ai Mod3@Azdocuments - in
No ratings yet
Ai Mod3@Azdocuments - in
42 pages
03 02 Decision Trees (1)
No ratings yet
03 02 Decision Trees (1)
61 pages
ML - Unit 2 - Part I
No ratings yet
ML - Unit 2 - Part I
15 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
21 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Video Tutorial: Decision Tree Learning
No ratings yet
Video Tutorial: Decision Tree Learning
21 pages
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
No ratings yet
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
19 pages
Module 3-1 PDF
No ratings yet
Module 3-1 PDF
43 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
module 2
No ratings yet
module 2
42 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
AIML- Module 3- Updated
No ratings yet
AIML- Module 3- Updated
42 pages
Unit II Part 1
No ratings yet
Unit II Part 1
62 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
81 pages
Decision Trees
No ratings yet
Decision Trees
7 pages
3 Decision Tree Learning
No ratings yet
3 Decision Tree Learning
38 pages
Lec-3-Decision Trees
No ratings yet
Lec-3-Decision Trees
47 pages
Unit2 ML
No ratings yet
Unit2 ML
19 pages
ID3
No ratings yet
ID3
7 pages
ML Lecture 13-14
No ratings yet
ML Lecture 13-14
33 pages
Unit 2
No ratings yet
Unit 2
20 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
W7-8_ Decision Trees
No ratings yet
W7-8_ Decision Trees
81 pages
Ms. Mehroz Sadiq: 11/23/2020 Bahria University Islamabad 1
No ratings yet
Ms. Mehroz Sadiq: 11/23/2020 Bahria University Islamabad 1
75 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
Decision Tree: Dept of CS & IT Bahauddin Zakariya University, Sahiwal Campus
No ratings yet
Decision Tree: Dept of CS & IT Bahauddin Zakariya University, Sahiwal Campus
31 pages
Unit-2 Notes
No ratings yet
Unit-2 Notes
20 pages
Module 3
No ratings yet
Module 3
101 pages
AI_01_ID3
No ratings yet
AI_01_ID3
7 pages
Deep Learning: Decision Trees I
No ratings yet
Deep Learning: Decision Trees I
45 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
MAchine Learning 1
No ratings yet
MAchine Learning 1
17 pages
MAchine Learning 2
No ratings yet
MAchine Learning 2
16 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
Lecture 6 - Decision Trees
No ratings yet
Lecture 6 - Decision Trees
43 pages
Module 3
No ratings yet
Module 3
102 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
L6 Decision Tree Classifier
No ratings yet
L6 Decision Tree Classifier
46 pages
Decision Trees
No ratings yet
Decision Trees
14 pages
Decision Trees-Lecture 9&10
No ratings yet
Decision Trees-Lecture 9&10
60 pages
Machine Learning Notes - Lec 04 - Decision Tree Learning
No ratings yet
Machine Learning Notes - Lec 04 - Decision Tree Learning
108 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
02 DecisionTrees Done
No ratings yet
02 DecisionTrees Done
68 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
54 pages
Wk. 5.2. Decision Trees (27.10.2020)
No ratings yet
Wk. 5.2. Decision Trees (27.10.2020)
57 pages
Machine Learning: MVJ21CS62
No ratings yet
Machine Learning: MVJ21CS62
12 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
Decision Tree 2
No ratings yet
Decision Tree 2
20 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Mod 3 AIML QB With Answers
No ratings yet
Mod 3 AIML QB With Answers
26 pages
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
Algebra - Drill Sheets Gr. 3-5
From Everand
Algebra - Drill Sheets Gr. 3-5
Nat Reed
No ratings yet
DAA Space and TimeTradeoff
No ratings yet
DAA Space and TimeTradeoff
24 pages
Assignment #1 TNE
No ratings yet
Assignment #1 TNE
4 pages
Unit31 LZ78
No ratings yet
Unit31 LZ78
15 pages
Wavelet Transform
100% (1)
Wavelet Transform
27 pages
An Introduction Of: Support Vector Machine
No ratings yet
An Introduction Of: Support Vector Machine
36 pages
Unit I SNM
No ratings yet
Unit I SNM
38 pages
Polynomial Function PDF
100% (1)
Polynomial Function PDF
5 pages
18CSC305J ARTIFICIAL INTELLIGENCE - Set4
No ratings yet
18CSC305J ARTIFICIAL INTELLIGENCE - Set4
6 pages
Workers/Machine Jobs: Assignment Problems
No ratings yet
Workers/Machine Jobs: Assignment Problems
6 pages
Data Science Using Python-I
No ratings yet
Data Science Using Python-I
3 pages
S. J. Orfanidis, ECE Department Rutgers University, Piscataway, NJ 08855
No ratings yet
S. J. Orfanidis, ECE Department Rutgers University, Piscataway, NJ 08855
25 pages
R-course_part7-ML_exercise-sheet-2024
No ratings yet
R-course_part7-ML_exercise-sheet-2024
8 pages
EE398a 3DVideoCoding 2012
No ratings yet
EE398a 3DVideoCoding 2012
31 pages
IAT-III Question Paper With Solution of 18EC54 Information Theory and Coding Dec-2020-Harsha B K
No ratings yet
IAT-III Question Paper With Solution of 18EC54 Information Theory and Coding Dec-2020-Harsha B K
24 pages
NA9 E Lagrange 2
No ratings yet
NA9 E Lagrange 2
60 pages
Nge Kutta
No ratings yet
Nge Kutta
10 pages
Analysis and Design of FIR Filters Using Window Function in Matlab
No ratings yet
Analysis and Design of FIR Filters Using Window Function in Matlab
6 pages
Frequent Pattern Based Clustering Methods
No ratings yet
Frequent Pattern Based Clustering Methods
23 pages
Srinivasa Institute of Engineering and Technology
No ratings yet
Srinivasa Institute of Engineering and Technology
2 pages
12.1. Euclidean Algorithm by Subtraction
No ratings yet
12.1. Euclidean Algorithm by Subtraction
3 pages
Cruz CoE702 Review Paper
No ratings yet
Cruz CoE702 Review Paper
1 page
Assignment 1
No ratings yet
Assignment 1
5 pages
12 Image Reconstruction
No ratings yet
12 Image Reconstruction
20 pages
Solution of Equations and Eigenvalue Problems
No ratings yet
Solution of Equations and Eigenvalue Problems
18 pages
Improved Complete Ensemble Empirical Mode Decomposition Using JAYA HYBRID
No ratings yet
Improved Complete Ensemble Empirical Mode Decomposition Using JAYA HYBRID
11 pages
The Simplest Form of An Array Is One-Dimensional Array. The Syntax To Define An Array Is As Follows. Type Arr-Name (Size) E.G. Int S
No ratings yet
The Simplest Form of An Array Is One-Dimensional Array. The Syntax To Define An Array Is As Follows. Type Arr-Name (Size) E.G. Int S
43 pages
Pde Parabolic
No ratings yet
Pde Parabolic
49 pages
Local Adversarial Search
No ratings yet
Local Adversarial Search
44 pages
Unit III Authentication and Hash Function
No ratings yet
Unit III Authentication and Hash Function
24 pages
Fast Kde
No ratings yet
Fast Kde
1 page