0% found this document useful (0 votes)

10 views14 pages

Module 10- Part 2- Boosting models

The document discusses boosting models in machine learning, specifically focusing on AdaBoost, Gradient Boosting Machine (GBM), and XGBoost. It explains the differences between bagging and boosting, highlighting how boosting iteratively improves models based on previous errors. Additionally, it outlines the pros and cons of XGBoost, noting its complexity compared to other boosting methods.

Uploaded by

Aashir Aftab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views14 pages

Module 10- Part 2- Boosting models

Uploaded by

Aashir Aftab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Module 10- Part II

Boosting Models
AdaBoost, GBM, XGBoost

Prof. Pedram Jahangiry

Class Modules
• Module 1- Introduction to Machine Learning
• Module 2- Setting up Machine Learning Environment
• Module 3- Linear Regression (Econometrics approach)
• Module 4- Machine Learning Fundamentals
• Module 5- Linear Regression (Machine Learning approach)
• Module 6- Penalized Regression (Ridge, LASSO, Elastic Net)
• Module 7- Logistic Regression
• Module 8- K-Nearest Neighbors (KNN)
• Module 9- Classification and Regression Trees (CART)
• Module 10- Bagging and Boosting
• Module 11- Dimensionality Reduction (PCA)
• Module 12- Clustering (KMeans – Hierarchical)

Prof. Pedram Jahangiry

Road map ML Algorithm

Supervised Unsupervised

Dimensionality
Regression Classification Clustering
Reduction

Linear / Logistic Principle K-Mean

Polynomial regression Component
Penalized Analysis
regression (PCA)
KNN KNN Hierarchical

SVR SVM SVC

1. Decision Trees (DTs)

Tree-based Tree-based
Regression models Classification models 2. Bagging, Random Forests
3. Boosting

Prof. Pedram Jahangiry

Topics
Part I
1. Bagging vs Boosting
2. AdaBoost
3. Gradient Boosting Machine (GBM)
4. XGBoost

Part II
Pros and Cons

Prof. Pedram Jahangiry

Part I
1. Bagging vs Boosting
2. AdaBoost
3. Gradient Boosting Machine (GBM)
4. XGBoost

Prof. Pedram Jahangiry

Bagging vs Boosting

• Bagging consists of creating many “copies” of the training data

(each copy is slightly different from another) and then apply the
weak learner to each copy to obtain multiple weak models and then
combine them.
• In bagging, the bootstrapped trees are independent from each other.

• Boosting consists of using the “original” training data and iteratively

creating multiple models by using a weak learner. Each new model
would be different from the previous ones in the sense that the weak
learner, by building each new model tries to “fix” the errors which
previous models make.
• In boosting, each tree is grown using information from previous tree.

Prof. Pedram Jahangiry

AdaBoost (Adaptive Boosting)
• Forest of weak learners (trees with only 1 feature;
stumps).
• Each tree (stump) depends on the previous tree’s
errors rather than being independent.

1) Starting with usual splitting criteria!

2) Each tree (stump) gets different weight based on
its prediction accuracy.
3) Each observation gets a weight inversely related
to its predicted outcome. (ex, misclassified ones
get more weight).
Source: Towards data science
4) Aggregation is done based on each weak
learner’s weight.

Prof. Pedram Jahangiry

AdaBoost
Key features:
• Adaptive: Updates the weights of misclassified instances at each step.
• Tends to be sensitive to noise and outliers.
• Can be used with various base classifiers, but most commonly used with decision stumps.

• AdaBoost is old: AdaBoost is a popular boosting technique introduced by Yoav Freund and Robert
Schapire in 1996.

Prof. Pedram Jahangiry

Gradient Boosting Machine (GBM)
Source: Geeksforgeeks

• In gradient boosting, each weak learner corrects its

predecessor’s error.
• Unlike AdaBoost, the weights of the training instances
are not tweaked, instead, each predictor is trained using
the residual errors of predecessor as labels.
• Unlike AdaBoost, each tree can be larger than a stump.
However, the trees are still small. By fitting a small tree
to the residuals, the GBM slowly improve 𝑓መ in areas
where it does not perform well.

• Learning rate shrinks the contribution of each tree. There is a trade-off between learning rate and
number of trees. Learning rate slows down the process even further, allowing for more and different
shaped trees to attack the residuals.
• Aggregation is done by adding the first tree predictions and a scaled (shrunk) version of the following
trees.

Prof. Pedram Jahangiry

Extreme Gradient Boosting (XGBoost)
• XGBoost is a refined and customized version of a gradient boosting decision tree system, created
with performance and speed in mind.
• Extreme refers to the fact that the algorithms and methods have been customized to push the limit
of what is possible for gradient boosting algorithms.

Prof. Pedram Jahangiry

Put it all together!

Prof. Pedram Jahangiry

Part II
Pros and Cons

Prof. Pedram Jahangiry

XGBoost’s Pros and Cons

Pros:

Cons:
• XGBoost is more difficult to understand, visualize and to tune compared to AdaBoost and
random forests. There is a multitude of hyperparameters that can be tuned to increase
performance.

Prof. Pedram Jahangiry

Class Modules
✓ Module 1- Introduction to Machine Learning
✓ Module 2- Setting up Machine Learning Environment
✓ Module 3- Linear Regression (Econometrics approach)
✓ Module 4- Machine Learning Fundamentals
✓ Module 5- Linear Regression (Machine Learning approach)
✓ Module 6- Penalized Regression (Ridge, LASSO, Elastic Net)
✓ Module 7- Logistic Regression
✓ Module 8- K-Nearest Neighbors (KNN)
✓ Module 9- Classification and Regression Trees (CART)
✓ Module 10- Bagging and Boosting
• Module 11- Dimensionality Reduction (PCA)
• Module 12- Clustering (KMeans – Hierarchical)

Prof. Pedram Jahangiry

Handout9 Trees Bagging Boosting
100% (1)
Handout9 Trees Bagging Boosting
23 pages
PHP Unit I - V - Notes
No ratings yet
PHP Unit I - V - Notes
113 pages
Module 10-Part 3- Advanced Boosting Models
No ratings yet
Module 10-Part 3- Advanced Boosting Models
11 pages
Boosting
No ratings yet
Boosting
6 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
Ada Boost
No ratings yet
Ada Boost
2 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Boosting
No ratings yet
Boosting
12 pages
106-110
No ratings yet
106-110
6 pages
Ensemble,Voting,Bagging,Boosting
No ratings yet
Ensemble,Voting,Bagging,Boosting
15 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
Boosting
No ratings yet
Boosting
2 pages
XGBoost & Adaboost
No ratings yet
XGBoost & Adaboost
22 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
XGBoost - A Powerful Machine Learning Algorithm For Beginners
No ratings yet
XGBoost - A Powerful Machine Learning Algorithm For Beginners
3 pages
_LECTURE+NOTES_Boosting
No ratings yet
_LECTURE+NOTES_Boosting
8 pages
Exp 3
No ratings yet
Exp 3
11 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Chapter Five
No ratings yet
Chapter Five
42 pages
Module 3.5 Ensemble Learning XGBoost
No ratings yet
Module 3.5 Ensemble Learning XGBoost
26 pages
Module 5,1 Ensemble_Bagging, RF,Boosting
No ratings yet
Module 5,1 Ensemble_Bagging, RF,Boosting
66 pages
ML QB Solutionss
No ratings yet
ML QB Solutionss
16 pages
Bagging vs Boosting in Machine Learning
No ratings yet
Bagging vs Boosting in Machine Learning
5 pages
Session 10 - Ensemble Methods (XGBoost)
No ratings yet
Session 10 - Ensemble Methods (XGBoost)
37 pages
Types of Boosting
No ratings yet
Types of Boosting
4 pages
Plagiarism
No ratings yet
Plagiarism
20 pages
Bagging vs Boosting in Machine Learning - GeeksforGeeks
No ratings yet
Bagging vs Boosting in Machine Learning - GeeksforGeeks
9 pages
gbt
No ratings yet
gbt
24 pages
Extreme Gradient Boosting
No ratings yet
Extreme Gradient Boosting
8 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Petrophysics-Driven Well Log Quality Control Using Machine Learning-2
No ratings yet
Petrophysics-Driven Well Log Quality Control Using Machine Learning-2
15 pages
Machine Learning
No ratings yet
Machine Learning
93 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Bagging vs Boosting - Javatpoint
No ratings yet
Bagging vs Boosting - Javatpoint
8 pages
Introduction To Boosting - 2
No ratings yet
Introduction To Boosting - 2
79 pages
Ensemble methods_b45145f8047e51ea0d65d32fc07eb528
No ratings yet
Ensemble methods_b45145f8047e51ea0d65d32fc07eb528
21 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
14 pages
Trees, Boosting, and Random Forest
No ratings yet
Trees, Boosting, and Random Forest
14 pages
XGBoost_ Unleashing the Power of Gradient Boosting
No ratings yet
XGBoost_ Unleashing the Power of Gradient Boosting
10 pages
Plagiarism
No ratings yet
Plagiarism
18 pages
Gradient Boosting in ML
No ratings yet
Gradient Boosting in ML
5 pages
Xg Boost
No ratings yet
Xg Boost
5 pages
ML-Lecture-15-Ensemble
No ratings yet
ML-Lecture-15-Ensemble
27 pages
16-Ensemble Learning - Cont... - 12-04-2024
No ratings yet
16-Ensemble Learning - Cont... - 12-04-2024
13 pages
Breast Cancer Tumor Prediction Using XGBOOST
No ratings yet
Breast Cancer Tumor Prediction Using XGBOOST
1 page
Bagging Vs Boosting in Machine Learning
No ratings yet
Bagging Vs Boosting in Machine Learning
4 pages
22 Boosting
No ratings yet
22 Boosting
32 pages
ML mod1
No ratings yet
ML mod1
48 pages
ML U3 Notes
No ratings yet
ML U3 Notes
10 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
chapter 3- boosting theory
No ratings yet
chapter 3- boosting theory
7 pages
Course DataCamp Classification With XGBoost
100% (1)
Course DataCamp Classification With XGBoost
39 pages
Unit V -Multiple Learners
No ratings yet
Unit V -Multiple Learners
54 pages
FAQ - Boosting - Ensemble Techniques - Great Learning
No ratings yet
FAQ - Boosting - Ensemble Techniques - Great Learning
2 pages
Baysian Final
No ratings yet
Baysian Final
7 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
8 pages
AdaBoost Classifier in Python (Article) - DataCamp
100% (1)
AdaBoost Classifier in Python (Article) - DataCamp
9 pages
Efficient Algorithms and Structures with Heaps: Definitive Reference for Developers and Engineers
From Everand
Efficient Algorithms and Structures with Heaps: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Graphic Design Business Plan Example
No ratings yet
Graphic Design Business Plan Example
35 pages
CH 04
No ratings yet
CH 04
12 pages
CH 12
No ratings yet
CH 12
19 pages
GROUP2-Ak Bank Part A PDF
No ratings yet
GROUP2-Ak Bank Part A PDF
19 pages
Chapter 5: Personnel Planning and Recruiting
No ratings yet
Chapter 5: Personnel Planning and Recruiting
20 pages
Chapter 1.1 Principles of Marketing
No ratings yet
Chapter 1.1 Principles of Marketing
10 pages
Chap 05 Power Point Slides
No ratings yet
Chap 05 Power Point Slides
103 pages
CCNA 2 Final Exam Taken
No ratings yet
CCNA 2 Final Exam Taken
25 pages
Analog Electronics End Semester (Autumn) Question 2019
No ratings yet
Analog Electronics End Semester (Autumn) Question 2019
3 pages
Analytical Paragraph+ Modals
No ratings yet
Analytical Paragraph+ Modals
30 pages
Agency form
No ratings yet
Agency form
2 pages
Script PDF
No ratings yet
Script PDF
3,065 pages
How To Write A Professional CV and Cover Letter
100% (1)
How To Write A Professional CV and Cover Letter
9 pages
Translating Files - MS
No ratings yet
Translating Files - MS
5 pages
30504-revise-unseen-fiction 3
No ratings yet
30504-revise-unseen-fiction 3
15 pages
Action Verbs: A Project LA Activity
No ratings yet
Action Verbs: A Project LA Activity
26 pages
8085 Microprocessor
100% (11)
8085 Microprocessor
70 pages
Adverbs of Degree
100% (3)
Adverbs of Degree
6 pages
Artigo BM Newestversion
No ratings yet
Artigo BM Newestversion
7 pages
Logical Reasoning Class 5 Paper
No ratings yet
Logical Reasoning Class 5 Paper
11 pages
Encrypt Password Field in SQL Server, Registry Information & Query String With VB
100% (1)
Encrypt Password Field in SQL Server, Registry Information & Query String With VB
3 pages
R Markdown: Here's All You Have To Know For STAT 327
No ratings yet
R Markdown: Here's All You Have To Know For STAT 327
2 pages
Test Bank-Chapter Three (Operating Systems) Multiple Choice Questions
0% (1)
Test Bank-Chapter Three (Operating Systems) Multiple Choice Questions
8 pages
Let's Learn About: Past Tense
100% (1)
Let's Learn About: Past Tense
13 pages
Speech Based Emotion Recognition
No ratings yet
Speech Based Emotion Recognition
26 pages
CS411 Midterm Reference MCQ's File by Faisal
100% (2)
CS411 Midterm Reference MCQ's File by Faisal
56 pages
Course Work 3022
No ratings yet
Course Work 3022
12 pages
OMR06
No ratings yet
OMR06
2 pages
Eg4115 Datasheet
No ratings yet
Eg4115 Datasheet
2 pages
Chapter 1, Part 4
No ratings yet
Chapter 1, Part 4
21 pages
Unit 3 Number Patterns: Sense MODULE 1: Numbers and Number
No ratings yet
Unit 3 Number Patterns: Sense MODULE 1: Numbers and Number
21 pages
Lab Assignment 1 - F20
No ratings yet
Lab Assignment 1 - F20
2 pages
MMW Module3 Problem Solving and Reasoning 1
No ratings yet
MMW Module3 Problem Solving and Reasoning 1
69 pages
A1350FW5xx Communication
No ratings yet
A1350FW5xx Communication
45 pages
d10517 PDF
No ratings yet
d10517 PDF
26 pages
Rounak's Resume PDF
No ratings yet
Rounak's Resume PDF
1 page

Module 10- Part 2- Boosting models

Uploaded by

Module 10- Part 2- Boosting models

Uploaded by

Module 10- Part II

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Linear / Logistic Principle K-Mean

SVR SVM SVC

1. Decision Trees (DTs)

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

• Bagging consists of creating many “copies” of the training data

• Boosting consists of using the “original” training data and iteratively

Prof. Pedram Jahangiry

1) Starting with usual splitting criteria!

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

• In gradient boosting, each weak learner corrects its

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

Prof. Pedram Jahangiry

You might also like