Entropy and Information Gain

Uploaded by

leo1nick0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views3 pages

Entropy and Information Gain

Uploaded by

leo1nick0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Explain the concepts of Entropy and Information Gain

in Decision Tree Learning.

While constructing a decision tree, the very first question to be answered is, Which
Attribute Is the Best Classifier?
The central choice in the ID3 algorithm is selecting which attribute to test at each node
in the tree.

We would like to select the attribute that is most useful for classifying examples.

What is a good quantitative measure of the worth of an attribute? We will define a

statistical property, called information gain, that measures how well a given attribute
separates the training examples according to their target classification.
ID3 uses this information gain measure to select among the candidate attributes at each
step while growing the tree.

ENTROPY MEASURES THE HOMOGENEITY OF EXAMPLES

Entropy characterizes the (im)purity of an arbitrary collection of examples.
Given a collection S, containing positive and negative examples of some target concept,
the entropy of S relative to this boolean classification is

where p+, is the proportion of positive examples in S and p-, is the proportion of
negative examples in S.

In all calculations involving entropy, we define 0log0 to be .

Let, S is a collection of training examples,
p+ the proportion of positive examples in S
p– the proportion of negative examples in S
Examples
Entropy (S) = – p+ log2 p+ – p–log2 p– [0 log20 = 0]
Entropy ([14+, 0–]) = – 14/14 log2 (14/14) – 0 log2 (0) = 0
Entropy ([9+, 5–]) = – 9/14 log2 (9/14) – 5/14 log2 (5/14) = 0,94
Entropy ([7+, 7– ]) = – 7/14 log2 (7/14) – 7/14 log2 (7/14) = = 1/2 + 1/2 = 1
INFORMATION GAIN MEASURES THE EXPECTED REDUCTION IN ENTROPY
Given entropy as a measure of the impurity in a collection of training examples, we can
now define a measure of the effectiveness of an attribute in classifying the training data.

Now, the information gain is simply the expected reduction in entropy caused by
partitioning the examples according to this attribute.
More precisely, the information gain, Gain(S, A) of an attribute A, relative to a
collection of examples S, is defined as,

where Values(A) is the set of all possible values for attribute A, and S, is the subset of S
for which attribute A has value v (i.e., S_v= {s ∈ S|A(s) = v})
For example, suppose S is a collection of training-example days described by attributes
including Wind, which can have the values Weak or Strong.
Information gain is precisely the measure used by ID3 to select the best attribute at each
step in growing the tree.

The use of information gain is to evaluate the relevance of attributes.

Decision Tree
No ratings yet
Decision Tree
18 pages
Tasks on Decision Trees
No ratings yet
Tasks on Decision Trees
11 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
NOTES Module 3 - Chapter 6 - Decision Tree Learning
No ratings yet
NOTES Module 3 - Chapter 6 - Decision Tree Learning
20 pages
Decision Tree-Using Entropy
No ratings yet
Decision Tree-Using Entropy
17 pages
AIML- Module 3- Updated
No ratings yet
AIML- Module 3- Updated
42 pages
10b Understanding Entropy Information Gain
No ratings yet
10b Understanding Entropy Information Gain
10 pages
ID3 Algorithm & ROC Analysis
No ratings yet
ID3 Algorithm & ROC Analysis
51 pages
DAA Project
No ratings yet
DAA Project
20 pages
2c Decision Tree Algorithm
No ratings yet
2c Decision Tree Algorithm
21 pages
Artificial Intelligence 11. Decision Tree Learning
No ratings yet
Artificial Intelligence 11. Decision Tree Learning
25 pages
Entropy&IG
No ratings yet
Entropy&IG
18 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
Unit2 ML
No ratings yet
Unit2 ML
19 pages
Decision Tree Example
No ratings yet
Decision Tree Example
21 pages
Video Tutorial: Decision Tree Learning
No ratings yet
Video Tutorial: Decision Tree Learning
21 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
decision trees
No ratings yet
decision trees
26 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
AI_01_ID3
No ratings yet
AI_01_ID3
7 pages
3 Decision Tree Learning
No ratings yet
3 Decision Tree Learning
38 pages
7. Decision Tree & Random Forest
No ratings yet
7. Decision Tree & Random Forest
41 pages
03 InformationGain
No ratings yet
03 InformationGain
20 pages
id3algorithm-200307175839
No ratings yet
id3algorithm-200307175839
22 pages
ML Unit 3
No ratings yet
ML Unit 3
36 pages
ID3 Lecture4
No ratings yet
ID3 Lecture4
25 pages
ML-Lec5
No ratings yet
ML-Lec5
7 pages
2024-Lecture11-MLAlgorithms
No ratings yet
2024-Lecture11-MLAlgorithms
84 pages
Module 3
No ratings yet
Module 3
101 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
43 pages
AI-day-3-14th mar-2023
No ratings yet
AI-day-3-14th mar-2023
12 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
21 pages
2.decision Tree
No ratings yet
2.decision Tree
74 pages
7_DecisionTree
No ratings yet
7_DecisionTree
58 pages
Aiml Easy Solution
No ratings yet
Aiml Easy Solution
70 pages
module 2
No ratings yet
module 2
42 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
Ai Mod3@Azdocuments - in
No ratings yet
Ai Mod3@Azdocuments - in
42 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
Decision Trees / NLP
No ratings yet
Decision Trees / NLP
27 pages
Chapter4 Machine Learning Part3
No ratings yet
Chapter4 Machine Learning Part3
43 pages
Classification: Decision Trees
No ratings yet
Classification: Decision Trees
30 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
9 pages
SDG Sdgs DF
No ratings yet
SDG Sdgs DF
23 pages
Decision Trees
No ratings yet
Decision Trees
11 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
AI- UNIT VI
No ratings yet
AI- UNIT VI
40 pages
Unit 2
No ratings yet
Unit 2
20 pages
Module 3-1 PDF
No ratings yet
Module 3-1 PDF
43 pages
ML Lecture 13-14
No ratings yet
ML Lecture 13-14
33 pages
Data Mining Unit 2
No ratings yet
Data Mining Unit 2
41 pages
Module 3
No ratings yet
Module 3
102 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Unit-3 MLT
No ratings yet
Unit-3 MLT
74 pages
What Is Entropy and Why Information Gain Matter in Decision Trees
No ratings yet
What Is Entropy and Why Information Gain Matter in Decision Trees
10 pages
23 Id3
No ratings yet
23 Id3
20 pages
3. Classification Trees,
No ratings yet
3. Classification Trees,
48 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Zafira fk,+4 Vol11No1 855+ (36-47) +
No ratings yet
Zafira fk,+4 Vol11No1 855+ (36-47) +
12 pages
Research Article: Credit Card Fraud Detection Through Parenclitic Network Analysis
No ratings yet
Research Article: Credit Card Fraud Detection Through Parenclitic Network Analysis
10 pages
Predicting Customer Churn On OTT Platforms
No ratings yet
Predicting Customer Churn On OTT Platforms
19 pages
Newzen - Python List - 2021
No ratings yet
Newzen - Python List - 2021
3 pages
Sentiment Analysis of Tamil Movie Reviews Via Feature Frequency Count
No ratings yet
Sentiment Analysis of Tamil Movie Reviews Via Feature Frequency Count
7 pages
Clarans Clustering
No ratings yet
Clarans Clustering
26 pages
Data-Driven Auditing: A Predictive Modeling Approach To Fraud Detection and Classification
No ratings yet
Data-Driven Auditing: A Predictive Modeling Approach To Fraud Detection and Classification
19 pages
Fundamental Remote Sensing
No ratings yet
Fundamental Remote Sensing
55 pages
Capstone Design Project Weekly Progress Report: Dynamic Autoselection of Machine Learning Model in Networks of Cloud
No ratings yet
Capstone Design Project Weekly Progress Report: Dynamic Autoselection of Machine Learning Model in Networks of Cloud
6 pages
Neural Language Model, RNNS: Pawan Goyal
No ratings yet
Neural Language Model, RNNS: Pawan Goyal
15 pages
Internship Report On Machine Learing
No ratings yet
Internship Report On Machine Learing
30 pages
Unit 1
No ratings yet
Unit 1
22 pages
Chapter 14 - Managerial Control
No ratings yet
Chapter 14 - Managerial Control
49 pages
Fake News Detection Using Machine Learning Report PDF
No ratings yet
Fake News Detection Using Machine Learning Report PDF
52 pages
Efficiency Improvement in Classification Tasks Using Naive Bayes PDF
No ratings yet
Efficiency Improvement in Classification Tasks Using Naive Bayes PDF
5 pages
Create A Classification Model With Azure Machine Learning Designer
No ratings yet
Create A Classification Model With Azure Machine Learning Designer
19 pages
MIS (Module1 and Module 2)
100% (1)
MIS (Module1 and Module 2)
5 pages
BE Project13
No ratings yet
BE Project13
4 pages
Supporting A Complex Audit Judgment Task: An Expert Network Approach
No ratings yet
Supporting A Complex Audit Judgment Task: An Expert Network Approach
23 pages
Face Detection Using Template Matching: Deepesh Jain Husrev Tolga Ilhan Subbu Meiyappan
No ratings yet
Face Detection Using Template Matching: Deepesh Jain Husrev Tolga Ilhan Subbu Meiyappan
19 pages
Welcome To The Course!: Michael (Mike) Gelbart
No ratings yet
Welcome To The Course!: Michael (Mike) Gelbart
17 pages
Employee Attrition Analysis Using XGBoost
No ratings yet
Employee Attrition Analysis Using XGBoost
6 pages
Review Questions On Clustering DBSCAN and HAC
No ratings yet
Review Questions On Clustering DBSCAN and HAC
2 pages
AI Based Chatbot Using NLP to Facilitate Software Development Process
No ratings yet
AI Based Chatbot Using NLP to Facilitate Software Development Process
26 pages
Data Preprocessing: Modern Data Analytics (G0Z39A) Prof. Dr. Ir. Jan de Spiegeleer
No ratings yet
Data Preprocessing: Modern Data Analytics (G0Z39A) Prof. Dr. Ir. Jan de Spiegeleer
82 pages
Decision Trees
No ratings yet
Decision Trees
7 pages
A Survey On Road Sign Detection and Classification
No ratings yet
A Survey On Road Sign Detection and Classification
3 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
Wine Prediction
100% (1)
Wine Prediction
13 pages
Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning
No ratings yet
Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning
49 pages

Entropy and Information Gain

Uploaded by

Entropy and Information Gain

Uploaded by

Explain the concepts of Entropy and Information Gain

in Decision Tree Learning.

What is a good quantitative measure of the worth of an attribute? We will define a

ENTROPY MEASURES THE HOMOGENEITY OF EXAMPLES

In all calculations involving entropy, we define 0log0 to be .

The use of information gain is to evaluate the relevance of attributes.

You might also like