0% found this document useful (0 votes)

31 views

Learning Rate (Or Eta)

This document summarizes key hyperparameters for XGBoost models, including parameters that control regularization and complexity (e.g. learning rate, max depth), hyperparameters related to tree construction (e.g. min child weight, subsample), and other parameters like the objective function and number of estimators. It provides definitions and recommended value ranges for hyperparameters like learning rate (0.01-0.3), number of estimators (100-1000), and max depth (3-10). The document also notes that hyperparameters should be tuned based on the specific task and dataset.

Uploaded by

anuragpanda222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as ODT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views

Learning Rate (Or Eta)

Uploaded by

anuragpanda222

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as ODT, PDF, TXT or read online on Scribd

You are on page 1/ 4

 Learning Rate (or eta):

o Definition: Controls the step size shrinkage used in updating the weights of the model
during each boosting iteration.
o Where to use: Central to controlling the step size during boosting and preventing
overfitting.
o When to use: Lower values make the boosting process more conservative and require
more boosting rounds to converge, while higher values may lead to overfitting.
o XGBoost Hyperparameter: learning_rate
o Recommended values: Typically in the range [0.01, 0.3].
o
 Number of Estimators (n_estimators):
o Definition: Number of boosting rounds or trees to build.
o Where to use: Dictates the number of boosting rounds and the overall complexity of
the model.
o When to use: Higher values can improve model performance, but increasing the
number of estimators also increases computation time.
o XGBoost Hyperparameter: n_estimators
o Recommended values: Depends on the size of the dataset and computational
resources, but typically in the range [100, 1000].
o
 Maximum Depth (max_depth):
o Definition: Maximum depth of a tree in the ensemble.
o Where to use: Controls the depth of individual trees and the complexity of the model.
o When to use: Higher values allow for more complex trees, but too high may lead to
overfitting.
o XGBoost Hyperparameter: max_depth
o Recommended values: Typically in the range [3, 10].
o
 Minimum Child Weight (min_child_weight):
o Definition: Minimum sum of instance weight required in a child node. It helps
prevent overfitting by controlling the minimum size of child nodes.
o Where to use: Ensures that each leaf node has a minimum number of instances, thus
reducing the complexity of the model.
o When to use: Higher values make the algorithm more conservative and reduce the
risk of overfitting.
o XGBoost Hyperparameter: min_child_weight
o Recommended values: Typically in the range [1, 10].
o
 Subsample:
o Definition: Fraction of observations to be randomly sampled for each tree. It
introduces randomness and reduces overfitting.
o Where to use: Controls the randomness of the data sampling process for each tree.
o When to use: Lower values make the model more robust to noise but may lead to
underfitting.
o XGBoost Hyperparameter: subsample
o Recommended values: Typically in the range [0.5, 1.0].
o
 Colsample bytree:
o Definition: Fraction of features to be randomly sampled for each tree. It introduces
randomness and reduces overfitting.
o Where to use: Controls the randomness of feature selection for each tree.
o When to use: Lower values reduce overfitting by introducing more randomness in
feature selection.
o XGBoost Hyperparameter: colsample_bytree
o Recommended values: Typically in the range [0.5, 1.0].
o
 Gamma:
o Definition: Minimum loss reduction required to make a further partition on a leaf
node. It acts as regularization by controlling the complexity of trees.
o Where to use: Helps prevent overfitting by penalizing overly complex trees.
o When to use: Higher values make the algorithm more conservative.
o XGBoost Hyperparameter: gamma
o Recommended values: Typically in the range [0, 0.2].
o
 Regularization Parameters (reg_alpha and reg_lambda):
o Definition: L1 and L2 regularization terms applied to the weights. They help prevent
overfitting by penalizing large parameter values.
o Where to use: Controls the amount of regularization applied to the model.
o When to use: Increase values to increase regularization and reduce overfitting.
o XGBoost Hyperparameters: reg_alpha, reg_lambda
o Recommended values: Typically in the range [0, 0.5].

 Subsample bytree:
o Definition: Fraction of observations to be randomly sampled for each tree. It
introduces randomness and reduces overfitting.
o Where to use: Similar to subsample, but this parameter specifically controls the
randomness for sampling observations when constructing each tree.
o When to use: Can be useful for further fine-tuning the sampling process at the tree
level.
o XGBoost Hyperparameter: subsample_bytree
o Recommended values: Typically in the range [0.5, 1.0].
o
 Lambda:
o Definition: L2 regularization term on weights. It penalizes large coefficients and
helps prevent overfitting.
o Where to use: Provides an alternative way to control regularization compared to
reg_lambda.
o When to use: Can be used as an additional regularization term to further control
overfitting.
o XGBoost Hyperparameter: lambda
o Recommended values: Typically in the range [0, 0.5].
o
 Alpha:
o Definition: L1 regularization term on weights. It encourages sparsity in the weight
vectors.
o Where to use: Provides an alternative way to control regularization compared to
reg_alpha.
o When to use: Useful when dealing with high-dimensional data or when you suspect
that many features are irrelevant.
o XGBoost Hyperparameter: alpha
o Recommended values: Typically in the range [0, 0.5].
o
 Scale Pos Weight:
o Definition: Controls the balance of positive and negative weights. It is useful for
imbalanced classification tasks.
o Where to use: Can be used to address class imbalance by assigning different weights
to positive and negative examples.
o When to use: Relevant for binary or multi-class classification tasks with imbalanced
class distributions.
o XGBoost Hyperparameter: scale_pos_weight
o Recommended values: Typically set to the ratio of negative examples to positive
examples.

 Tree Booster Parameters:

o Definition: Parameters specific to the tree booster (XGBoost's default booster).
o Where to use: Control the behavior and performance of individual trees in the
ensemble.
o When to use: When fine-tuning the boosting algorithm for specific requirements.
o XGBoost Hyperparameters:
 tree_method: Method used to grow trees. Options include auto, exact,
approx, hist, and gpu_hist.
 grow_policy: Controls how trees are added during training. Options include
depthwise and lossguide.
 max_leaves: Maximum number of leaves for each tree. Can be used instead of
max_depth.
 sample_type and normalize_type: Parameters related to histogram-based
algorithms.
 rate_drop and skip_drop: Parameters for dropout regularization.

 Objective Function:
o Definition: The loss function to be optimized during training.
o Where to use: Define the specific task and type of problem (e.g., regression,
classification).
o When to use: When you need to specify a custom loss function or use a non-default
objective.
o XGBoost Hyperparameters:
 objective: The objective function to optimize. Common options include
reg:squarederror for regression tasks and binary:logistic or
multi:softmax for classification tasks.
 eval_metric: Evaluation metric used during training. Different from the objective
function, it's used to monitor performance during training. Common options include rmse,
mae, logloss, error, auc, etc.
 Other Parameters:
o Definition: Additional parameters that control various aspects of the XGBoost
algorithm.
o Where to use: Fine-tuning specific aspects of the algorithm or handling edge cases.
o When to use: Depending on the specific requirements of the task.
o XGBoost Hyperparameters:
 booster: The type of boosting model to use (default is gbtree for tree booster).
 verbosity: Controls the level of details printed during training.
 nthread: Number of threads to use for parallel computation.
 random_state: Seed for random number generation.
These additional hyperparameters offer more flexibility and control over the behavior and
performance of the XGBoost model. When tuning hyperparameters, it's essential to consider
the specific requirements of your task and dataset, and to experiment with different
combinations to find the optimal settings.

Effective Xgboost
No ratings yet
Effective Xgboost
221 pages
Xgboost Presentation
100% (3)
Xgboost Presentation
54 pages
Paper Quilling Lesson Plan
90% (10)
Paper Quilling Lesson Plan
2 pages
A Detailed Lesson Plan in Arts
100% (1)
A Detailed Lesson Plan in Arts
5 pages
Learning Rate (Or Eta)
No ratings yet
Learning Rate (Or Eta)
2 pages
Xg boosting reference
No ratings yet
Xg boosting reference
6 pages
XGBoost_ Unleashing the Power of Gradient Boosting
No ratings yet
XGBoost_ Unleashing the Power of Gradient Boosting
10 pages
Hyperparametric Tuning of XG and RFC
No ratings yet
Hyperparametric Tuning of XG and RFC
2 pages
XGBoost Parameters - Xgboost 1.5.0-Dev Documentation (Dragged) 2
No ratings yet
XGBoost Parameters - Xgboost 1.5.0-Dev Documentation (Dragged) 2
2 pages
XGBoost
No ratings yet
XGBoost
4 pages
Xgboost: Notebook
No ratings yet
Xgboost: Notebook
8 pages
Gentle Introduction of XGBoost Library _ by Mohit Sharma _ Medium
No ratings yet
Gentle Introduction of XGBoost Library _ by Mohit Sharma _ Medium
17 pages
Parameter's Resumes
No ratings yet
Parameter's Resumes
18 pages
xgboost_2019
No ratings yet
xgboost_2019
21 pages
Comparative Analysis of XGBoost
No ratings yet
Comparative Analysis of XGBoost
20 pages
Xgboostcomp
No ratings yet
Xgboostcomp
21 pages
Fine-tuning XGBoost in Python like a boss _ by Félix Revert _ Towards Data Science
No ratings yet
Fine-tuning XGBoost in Python like a boss _ by Félix Revert _ Towards Data Science
5 pages
XGBoost Tuning 1597155827
No ratings yet
XGBoost Tuning 1597155827
7 pages
Hyperparameter_Tuning_in_Machine_Learning_1706249573
No ratings yet
Hyperparameter_Tuning_in_Machine_Learning_1706249573
9 pages
S-2
No ratings yet
S-2
10 pages
Loan
No ratings yet
Loan
3 pages
Performance Improvement of Model
No ratings yet
Performance Improvement of Model
4 pages
AIEdge MLArchive
No ratings yet
AIEdge MLArchive
93 pages
05.XGBoost
No ratings yet
05.XGBoost
6 pages
T7
No ratings yet
T7
37 pages
Machine Learning
No ratings yet
Machine Learning
93 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
Xg Boost
No ratings yet
Xg Boost
13 pages
XGBoost and Upgrades
No ratings yet
XGBoost and Upgrades
14 pages
NorthBay Summarizes Model Specific Issues
No ratings yet
NorthBay Summarizes Model Specific Issues
19 pages
05 - Ensemble Learning
No ratings yet
05 - Ensemble Learning
39 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
14 pages
Hyper Parameters
No ratings yet
Hyper Parameters
24 pages
XGBoost WM
No ratings yet
XGBoost WM
39 pages
Breast Cancer Tumor Prediction Using XGBOOST
No ratings yet
Breast Cancer Tumor Prediction Using XGBOOST
1 page
XGBoost Algorithm
No ratings yet
XGBoost Algorithm
26 pages
XGBoost Parameters - Xgboost 1.5.0-Dev Documentation (Dragged) 3
No ratings yet
XGBoost Parameters - Xgboost 1.5.0-Dev Documentation (Dragged) 3
2 pages
Session 10 - Ensemble Methods (XGBoost)
No ratings yet
Session 10 - Ensemble Methods (XGBoost)
37 pages
Thesis Final Version Julian Van Erk
No ratings yet
Thesis Final Version Julian Van Erk
30 pages
X Boost
No ratings yet
X Boost
2 pages
Out-of-Core GPU Gradient Boosting: Rong Ou
No ratings yet
Out-of-Core GPU Gradient Boosting: Rong Ou
5 pages
Hyperparameter tuning
No ratings yet
Hyperparameter tuning
4 pages
Week 7 - Tree-Based Model
100% (1)
Week 7 - Tree-Based Model
8 pages
Extreme Gradient Boosting
No ratings yet
Extreme Gradient Boosting
8 pages
To Improve The Performance of Models Predicting Ba
No ratings yet
To Improve The Performance of Models Predicting Ba
6 pages
Hyper Parameters
No ratings yet
Hyper Parameters
7 pages
XGBoost & Adaboost
No ratings yet
XGBoost & Adaboost
22 pages
XGBoost and Random Forest Algorithms
100% (1)
XGBoost and Random Forest Algorithms
6 pages
alogos used
No ratings yet
alogos used
3 pages
B210317003 - Zeeshan Asghar - Assignment No 02
No ratings yet
B210317003 - Zeeshan Asghar - Assignment No 02
6 pages
Hyperparameter tuning is the process of optimizing the model
No ratings yet
Hyperparameter tuning is the process of optimizing the model
3 pages
PA DA2_merged
No ratings yet
PA DA2_merged
29 pages
Xg Boost
No ratings yet
Xg Boost
5 pages
XGBoost
No ratings yet
XGBoost
4 pages
Xgboost PDF
100% (1)
Xgboost PDF
128 pages
Chapter 7 - Ensemble
No ratings yet
Chapter 7 - Ensemble
12 pages
Session 1 on Introduction to LightGBM Notes
No ratings yet
Session 1 on Introduction to LightGBM Notes
13 pages
XGboost Tutorial
100% (1)
XGboost Tutorial
13 pages
XGboost Vs Other
No ratings yet
XGboost Vs Other
2 pages
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Java: Advanced Guide to Programming Code with Java: Java Computer Programming, #4
From Everand
Java: Advanced Guide to Programming Code with Java: Java Computer Programming, #4
Charlie Masterson
No ratings yet
Question_bank
No ratings yet
Question_bank
2 pages
Recurrent Neural Networks: Pytorch
No ratings yet
Recurrent Neural Networks: Pytorch
6 pages
EncoderDecoderSeq2Seq DeepLSTM
No ratings yet
EncoderDecoderSeq2Seq DeepLSTM
7 pages
ElysianSync AI Presentation
No ratings yet
ElysianSync AI Presentation
15 pages
PME Notes
No ratings yet
PME Notes
35 pages
PME Notes (Unemployment)
No ratings yet
PME Notes (Unemployment)
8 pages
Grade Thresholds - November 2022: Cambridge International AS & A Level Chemistry (9701)
No ratings yet
Grade Thresholds - November 2022: Cambridge International AS & A Level Chemistry (9701)
2 pages
Doctor Faustus: Christopher Marlowe'S
100% (2)
Doctor Faustus: Christopher Marlowe'S
36 pages
Lesson Plan Making Shadows
No ratings yet
Lesson Plan Making Shadows
1 page
PALMA Comm - MediaM1A1
No ratings yet
PALMA Comm - MediaM1A1
4 pages
Kingston University ISC 2013-14 Prospectus
No ratings yet
Kingston University ISC 2013-14 Prospectus
31 pages
Ae Online - Week 4 Portfolio Task 2 - Conventions of Report Writing N
No ratings yet
Ae Online - Week 4 Portfolio Task 2 - Conventions of Report Writing N
2 pages
W4 - Catch Up Friday Grade 11 12 - DLL
No ratings yet
W4 - Catch Up Friday Grade 11 12 - DLL
3 pages
PROGRESS TEST - Copy
No ratings yet
PROGRESS TEST - Copy
2 pages
Zopiatis 2021
No ratings yet
Zopiatis 2021
5 pages
Change Control Approach - PRINCE2 2017
No ratings yet
Change Control Approach - PRINCE2 2017
4 pages
Observation Task 5
No ratings yet
Observation Task 5
2 pages
Performance of Schools Guidance Counselor Board Exam
No ratings yet
Performance of Schools Guidance Counselor Board Exam
5 pages
PSL I & PSL II Early Childhood Education Lesson Plan
No ratings yet
PSL I & PSL II Early Childhood Education Lesson Plan
5 pages
Elementary Teacher Resume
No ratings yet
Elementary Teacher Resume
1 page
Samantha Dilisio Resume Final
No ratings yet
Samantha Dilisio Resume Final
2 pages
ED MATH 4 - Plane and Solid Geometry
100% (5)
ED MATH 4 - Plane and Solid Geometry
3 pages
Out 2
No ratings yet
Out 2
7 pages
Infinity Box UserManual
No ratings yet
Infinity Box UserManual
10 pages
Cambridge IGCSE ™: Chemistry 0620/22
No ratings yet
Cambridge IGCSE ™: Chemistry 0620/22
3 pages
CLIL Readers 2 Where Do You Live Resources
No ratings yet
CLIL Readers 2 Where Do You Live Resources
8 pages
Bio Karim Lahidji en
No ratings yet
Bio Karim Lahidji en
3 pages
Performance Improvement Plan (PIP)
No ratings yet
Performance Improvement Plan (PIP)
7 pages
NLC Newsletter 1ST Week
No ratings yet
NLC Newsletter 1ST Week
2 pages
Online Self-Regulated Learning Questionnaire (OSLQ) (Barnard Et Al., 2009)
No ratings yet
Online Self-Regulated Learning Questionnaire (OSLQ) (Barnard Et Al., 2009)
2 pages
Name With Initials Mr. H N Samara : Masters Degree in Human Resource Management Proposal For Project Report
No ratings yet
Name With Initials Mr. H N Samara : Masters Degree in Human Resource Management Proposal For Project Report
46 pages
Landa M Jacobs Resume
No ratings yet
Landa M Jacobs Resume
1 page
Ciarb Training Diary 2024
No ratings yet
Ciarb Training Diary 2024
23 pages
Identifying and Defining Research Questions: Discussion
No ratings yet
Identifying and Defining Research Questions: Discussion
4 pages

Learning Rate (Or Eta)

Uploaded by

Learning Rate (Or Eta)

Uploaded by

 Learning Rate (or eta):

 Tree Booster Parameters:

You might also like