0% found this document useful (0 votes)

9 views47 pages

Lec-3-Decision Trees

decision tree in machine learning

Uploaded by

uzair31531

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views47 pages

Lec-3-Decision Trees

decision tree in machine learning

Uploaded by

uzair31531

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 47

DECISION TREES

Introduction

It is a method that induces concepts from examples

(inductive learning)

Most widely used & practical learning method

The learning is supervised: i.e. the classes or categories of the

data instances are known

It represents concepts as decision trees (which can be

rewritten as if-then rules)

The target function can be Boolean or discrete valued

1
DECISION TREES

Training a decision tree – The ID3 and CaRT Algorithm

• Ross Quinlan, CS (ID3: 1986, C4.5: 1993)
• Uses entropy as the impurity function
• Meant primarily for categorical attributes and for classification
• Breimanetal, Statistics (CaRT: 1984)
• Uses Gini impurity
• Meant for classification and regression

2
DECISION TREES

Decision Tree Representation

1. Each node corresponds to an attribute

2. Each branch corresponds to an attribute value

3. Each leaf node assigns a classification

3
DECISION TREES

Example

4
DECISION TREES

Example
Outlook
Sunny Rain
Overcast
Humidity Wind
High Normal Strong Weak

A Decision Tree for the concept PlayTennis

An unknown observation is classified by testing its attributes
and reaching a leaf node
5
DECISION TREES

Decision Tree Representation

Decision trees represent a disjunction(OR) of

conjunctions(AND) of constraints on the attribute values of
instances

Each path from the tree root to a leaf corresponds to a

conjunction of attribute tests (one rule for classification)

The tree itself corresponds to a disjunction of these

conjunctions (set of rules for classification)

6
DECISION TREES

Decision Tree Representation

7
DECISION TREES

Basic Decision Tree Learning Algorithm

Most algorithms for growing decision trees are variants of a
basic algorithm

An example of this core algorithm is the ID3 algorithm

developed by Quinlan (1986)

It employs a top-down, greedy search through the space of

possible decision trees

The Greedy Algorithm is a popular approach used in constructing decision trees. It

follows a step-by-step process to determine the optimal splits for creating the tree
nodes. This algorithm works in a greedy manner, meaning it makes locally optimal
decisions at each step without considering the global optimum.

8
DECISION TREES

Basic Decision Tree Learning Algorithm

First of all we select the best attribute to be tested at the root

of the tree

For making this selection each attribute is evaluated using a

statistical test to determine how well it alone classifies the
training examples

9
DECISION TREES

Basic Decision Tree Learning Algorithm

We have

D12 D11 - 12 observations

D1
D2 D5
D10 D4 - 4 attributes
D6 • Outlook
D3
D14 • Temperature
D8 D9 • Humidity
D7 D13 • Wind

- 2 classes (Yes, No)

10
DECISION TREES

Basic Decision Tree Learning Algorithm

Outlook
Sunny Rain
Overcast

D1 D8 D10 D6
D3
D14
D11 D12 D4
D9 D2 D7 D5
D13

11
DECISION TREES

Basic Decision Tree Learning Algorithm

The selection process is then repeated using the training

examples associated with each descendant node to select the
best attribute to test at that point in the tree

12
DECISION TREES

Outlook
Sunny Rain
Overcast

D1 D8 D10 D6
D3
D14
D11 D12 D4
D9 D2 D7 D5
D13

What is the
“best” attribute to test at this point? The possible choices are
Temperature, Wind & Humidity
13
DECISION TREES

Which Attribute is the Best Classifier?

The central choice in the ID3 algorithm is selecting which

attribute to test at each node in the tree

We would like to select the attribute which is most useful for

classifying examples

For this we need a good quantitative measure

For this purpose a statistical property, called information

gain is used

14
15
16
Entropy

To Define Information Gain precisely, we begin by defining a

measure which is commonly used in information theory called
Entropy.

Entropy basically tells us how impure a collection of data is. The

term impure here defines non-homogeneity.

Given a collection of examples/dataset S, containing positive and

negative examples of some target concept, the entropy of S
relative to this Boolean classification is:

17
To illustrate this equation, we will do an example that calculates
the entropy of our data set in Fig: 1. The dataset has 9 positive
instances and 5 negative instances, therefore-

18
19
By observing closely on equations 1.2, 1.3 and 1.4; we can come to
a conclusion that if the data set is completely homogeneous then
the impurity is 0, therefore entropy is 0 (equation 1.4), but if the
data set can be equally divided into two classes, then it is
completely non-homogeneous & impurity is 100%, therefore
entropy is 1 (equation 1.3).

20
The Information Gain

21
Information Gain:

Given Entropy is the measure of impurity in a collection of a dataset, now we

can measure the effectiveness of an attribute in classifying the training set.

The Information gain, is simply the expected reduction in entropy caused by

partitioning the data set according to this attribute.

The information gain (Gain(S,A) of an attribute A relative to a collection of

data set S, is defined as-

22
23
To become more clear, let’s use this equation and measure the information
gain of attribute Wind from the dataset of Figure 1.

The dataset has 14 instances, so the sample space is 14 where the sample has 9
positive and 5 negative instances.
The Attribute Wind can have the values Weak or Strong.

Therefore,
Values(Wind) = Weak, Strong

24
25
So, the information gain by the Wind attribute is 0.048. Let’s
calculate the information gain by the Outlook attribute.

26
These two examples should make us clear that how we can
calculate information gain.
The information gain of the 4 attributes of Figure 1 dataset
are:

27
Remember, the main goal of measuring information gain is to
find the attribute which is most useful to classify training set.
Our ID3 algorithm will use the attribute as it’s root to build the
decision tree.
Then it will again calculate information gain to find the next
node.
As far as we calculated, the most useful attribute is “Outlook” as
it is giving us more information than others. So, “Outlook” will
be the root of our tree.

28
29
30
We can now measure the information gain of Temperature and
Wind by following the same way we measured Gain(S,
Humidity). Finally, we will get:

31
So Humidity gives us the most information at this stage. The node
after “Outlook” at Sunny descendant will be Humidity.

The High descendant has only negative examples and the Normal
descendant has only positive examples.

So both of them become the leaf node and can not be furthered
expanded.

If we expand the Rain descendant by the same procedure we will

see that the Wind attribute is providing most information.

I am leaving this portion for the readers to do the calculation on

their own. Therefore our final decision tree looks like Figure 4:

32
33
34
35
36
37
Decision Boundary for Decision Trees

38
39
40
41
42
DECISION TREES

From Decision Trees to Rules

Next Step: Make rules from the decision tree

After making the identification tree, we trace each path from

the root node to leaf node, recording the test outcomes as
antecedents and the leaf node classification as the consequent

For our example we have:

If the Outlook is Sunny and the Humidity is High then No

If the Outlook is Sunny and the Humidity is Normal then Yes
...
43
DECISION TREES

Hypothesis Space Search

ID3 can be characterized as

searching a space of
hypotheses for one that fits
the training examples

The space searched is the set

of possible decision trees

ID3 performs a simple-to-

complex, hill-climbing
search through this
hypothesis space
44
DECISION TREES

Hypothesis Space Search

It begins with an empty tree,

then considers more and
more elaborate hypothesis
in search of a decision tree
that correctly classifies the
training data

The evaluation function that

guides this hill-climbing
search is the information
gain measure

45
DECISION TREES

Hypothesis Space Search

• ID3 searches the space of possible decision trees: doing
hill-climbing on information gain.

• It maintains only one hypothesis (unlike Candidate-

Elimination).

• It cannot tell us how many other viable ones there are.

• It does not do back tracking. Can get stuck in local optima.

• Uses all training examples at each step.

• Results are less sensitive to errors.

46
DECISION TREES

Reference

Sections 3.4.2 – 3.5 of T. Mitchell

Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Screenshot 2024-02-06 at 1.43.15 PM
No ratings yet
Screenshot 2024-02-06 at 1.43.15 PM
66 pages
Unit 3
No ratings yet
Unit 3
46 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Unit-3 MLT
No ratings yet
Unit-3 MLT
74 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
3. Tree Models
No ratings yet
3. Tree Models
42 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
ml unit 3 part 1
No ratings yet
ml unit 3 part 1
42 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
ML_Unit-2_Material
No ratings yet
ML_Unit-2_Material
20 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Ai Mod3@Azdocuments - in
No ratings yet
Ai Mod3@Azdocuments - in
42 pages
Lect 8-Decision Tree-2
No ratings yet
Lect 8-Decision Tree-2
16 pages
03 02 Decision Trees (1)
No ratings yet
03 02 Decision Trees (1)
61 pages
Module 2 Notes
No ratings yet
Module 2 Notes
20 pages
Decision Tree: Dept of CS & IT Bahauddin Zakariya University, Sahiwal Campus
No ratings yet
Decision Tree: Dept of CS & IT Bahauddin Zakariya University, Sahiwal Campus
31 pages
Decision Tree Learning and Inductive Inference
No ratings yet
Decision Tree Learning and Inductive Inference
37 pages
Deep Learning: Decision Trees I
No ratings yet
Deep Learning: Decision Trees I
45 pages
AI_01_ID3
No ratings yet
AI_01_ID3
7 pages
Decision Tree 2
No ratings yet
Decision Tree 2
20 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Module 3-1 PDF
No ratings yet
Module 3-1 PDF
43 pages
Decision Trees
No ratings yet
Decision Trees
7 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
ID3
No ratings yet
ID3
7 pages
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
48 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Unit2 ML
No ratings yet
Unit2 ML
19 pages
Dec Tree
No ratings yet
Dec Tree
17 pages
Video Tutorial: Decision Tree Learning
No ratings yet
Video Tutorial: Decision Tree Learning
21 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
module 2
No ratings yet
module 2
42 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
Unit 4a Decision Tree
No ratings yet
Unit 4a Decision Tree
90 pages
NOTES Module 3 - Chapter 6 - Decision Tree Learning
No ratings yet
NOTES Module 3 - Chapter 6 - Decision Tree Learning
20 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
AIML- Module 3- Updated
No ratings yet
AIML- Module 3- Updated
42 pages
W7-8_ Decision Trees
No ratings yet
W7-8_ Decision Trees
81 pages
Practice Q Machine Learning Ans
No ratings yet
Practice Q Machine Learning Ans
54 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
21 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
ML-Lec5
No ratings yet
ML-Lec5
7 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
The ID3 Algorithm
No ratings yet
The ID3 Algorithm
9 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
DM UNIT III (1)
No ratings yet
DM UNIT III (1)
87 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
Unit IV Notes
No ratings yet
Unit IV Notes
20 pages
ID3 Algorithm
100% (1)
ID3 Algorithm
3 pages
2025-Lecture07-P1-ID3
No ratings yet
2025-Lecture07-P1-ID3
41 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Unit-3
No ratings yet
Unit-3
157 pages
Anomaly Detection Report
No ratings yet
Anomaly Detection Report
33 pages
8.1. Machine Learning Decision Tree
No ratings yet
8.1. Machine Learning Decision Tree
48 pages
1.predicting Quality Indicators
No ratings yet
1.predicting Quality Indicators
66 pages
Deshpande Et Al. 2024 - NLP Driven - Chatbot For Career - Counseling
No ratings yet
Deshpande Et Al. 2024 - NLP Driven - Chatbot For Career - Counseling
7 pages
Entropy and Information Gain
No ratings yet
Entropy and Information Gain
3 pages
Download ebooks file Pro Machine Learning Algorithms: A Hands-On Approach to Implementing Algorithms in Python and R 1st Edition V Kishore Ayyadevara all chapters
100% (2)
Download ebooks file Pro Machine Learning Algorithms: A Hands-On Approach to Implementing Algorithms in Python and R 1st Edition V Kishore Ayyadevara all chapters
50 pages
AI-based Chatbot For Skin Disease Prediction Using CNN and ID3 Decision Tree
No ratings yet
AI-based Chatbot For Skin Disease Prediction Using CNN and ID3 Decision Tree
46 pages
[1.2]
No ratings yet
[1.2]
58 pages
Assignment 4 DT NB LR Solution
No ratings yet
Assignment 4 DT NB LR Solution
5 pages
Unit 3 Supervised Learning Technique
No ratings yet
Unit 3 Supervised Learning Technique
46 pages
Module 04
No ratings yet
Module 04
75 pages
MLS+1+-+Decision+Trees+and+Random+Forests
No ratings yet
MLS+1+-+Decision+Trees+and+Random+Forests
16 pages
Machine Learning Unit 4
100% (1)
Machine Learning Unit 4
78 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
52 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
Science BSC Computer Science Semester 5 2022 November Elective I Artificial Intelligence Cbcs
No ratings yet
Science BSC Computer Science Semester 5 2022 November Elective I Artificial Intelligence Cbcs
29 pages
PDF Principles of Data Mining Undergraduate Topics in Computer Science Max Bramer Download
100% (3)
PDF Principles of Data Mining Undergraduate Topics in Computer Science Max Bramer Download
61 pages
Naïve Bayes-DecisionTrees-RandomForest-SVM
No ratings yet
Naïve Bayes-DecisionTrees-RandomForest-SVM
26 pages
Asc399 Feb23
No ratings yet
Asc399 Feb23
6 pages
Coincent Data Analysis Answers
No ratings yet
Coincent Data Analysis Answers
16 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
Random Forest Regression
No ratings yet
Random Forest Regression
57 pages
Towards Pentesting Automation Using The Metasploit Framework
No ratings yet
Towards Pentesting Automation Using The Metasploit Framework
8 pages
Assignment 3
No ratings yet
Assignment 3
8 pages
Decision Tree For Classification (ID3 Information Gain Entropy)
No ratings yet
Decision Tree For Classification (ID3 Information Gain Entropy)
3 pages
ML Q
No ratings yet
ML Q
40 pages
Performance Analysis of Deep Neural Network and Machine Learning Algorithms For Diabetes Prediction
No ratings yet
Performance Analysis of Deep Neural Network and Machine Learning Algorithms For Diabetes Prediction
6 pages
Machine Learning
No ratings yet
Machine Learning
99 pages