0% found this document useful (0 votes)

4 views

Unit 6_model selection (1)

Uploaded by

htomar3490

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Unit 6_model selection (1)

Uploaded by

htomar3490

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

INT247

Machine Learning Foundation

Model Evaluation and
Hyperparameter Tuning
Streamlining Workflows With Pipelines
• It allows us to a model including an arbitrary number of
transformations steps and apply it to make predictions
about new data.
Model Evaluation
• One of the key step in building ML model is to estimate its
performance.
• Model can suffer from under fitting, if this is too simple (high
bias).
• Model can suffer from over fitting, if this is too complex
(high variance).
• To find an acceptable bias-variance tradeoff, model should
be evaluated carefully.
• Holdout cross validation and k-fold cross validation helps us
to obtain reliable estimates of the model’s generalization
error.
Holdout Method
• In this, split initial dataset into separate training and test dataset – the
former is used to train the model and latter is used to estimate its
performance.
• We are also interested in tuning and comparing different parameters
settings to further improve the performance. This process is called
model selection.
• Model selection refers to a given ML problem for which we want to
select the optimal values of tuning parameters, also called
Hyperparameters.
• However, if we reuse the same test dataset over and over again during
model selection, it will become part of training data and thus the model
will be more likely to overfit.
• A better way of using the holdout method for model selection is to
separate the data into three parts: a training set, a validation set and a
test set.
Holdout Method
• The training set is used to train the model and the
performance on the validation set is used for model
selection.
• The advantage of having a test set that the model hasn’t
seen during the training and model selection steps is that we
can obtain a less biased estimate of its ability to generalize
to new data.
Holdout Method
• A disadvantage of holdout method is that the performance
estimate is sensitive to how we partition the training and
validation subsets; the estimate will vary for different
samples of the data.
K-fold Cross Validation
• In K-fold cross validation, we randomly split the training data
set into k folds without replacement, where k-1 folds are
used for model training and one fold is used for testing.
• This procedure is repeated k times so that we obtain k
models and performance estimates.
• Since K-fond cross validation is a resampling technique
without replacement, the advantage of this approach is that
each sample point will be part of a training and test data
exactly once.
• Thus, it yields a lower variance estimate of the model
performance than the holdout method.
K-Fold Cross Validation
K-Fold Cross Validation
• A special case of k-fold cross validation is the leave one out
(LOO) cross validation method.
• In LOO, we set the number of folds equal to the number of
training samples so that only one training sample is used for
testing during each iteration.
• This is recommended approach for working with small datasets.
• A slight improvement over the standard K-fold cross validation
approach is stratified K-fold cross validation, which can yield
better bias and variance estimates, especially in cases of
unequal proportions.
• In stratified cross validation, the class proportions are
preserved in each fold to ensure that each fold is representative
of the class proportions in the training dataset.
Debugging Algorithms with Learning and
Validation Curves
• There are basically two diagnostic tools that improve the
performance of a learning algorithm: Learning curves and
Validation Curves.
Tuning Hyperparameter Via Grid Search
• It is a brute force exhaustive search paradigm where we
specify a list of values for different hyperparameters, and
the computer evaluates the model performance for each
combination of those to obtain the optimal set.
Confusion Matrix
• A matrix that lays out the performance of a learning
algorithm.
• It reports the counts of the true positive, true negative, false
positive and false negative.
Thanks

Trevitesting
No ratings yet
Trevitesting
23 pages
Steam Engine Report
67% (6)
Steam Engine Report
26 pages
Konica Minolta Bizhub PRESS C1070 PDF
No ratings yet
Konica Minolta Bizhub PRESS C1070 PDF
8 pages
Lecture 4.1 AML
No ratings yet
Lecture 4.1 AML
12 pages
Learning Best Practices For Model Evaluation and Hyperparameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyperparameter Tuning
17 pages
Module 6_ML
No ratings yet
Module 6_ML
30 pages
Cofusion Matrix Cross- Validation
No ratings yet
Cofusion Matrix Cross- Validation
34 pages
Mining Process
No ratings yet
Mining Process
33 pages
CH 05 Optimization Technique
No ratings yet
CH 05 Optimization Technique
58 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
Cross-Validation in Machine Learning
No ratings yet
Cross-Validation in Machine Learning
18 pages
AML - MODULE - 4
No ratings yet
AML - MODULE - 4
12 pages
UNIT03
No ratings yet
UNIT03
52 pages
Slide 1: Fast and Informative Model Selection Using Learning Curve Cross-Validation
No ratings yet
Slide 1: Fast and Informative Model Selection Using Learning Curve Cross-Validation
71 pages
model-validation
No ratings yet
model-validation
5 pages
Unit 2
No ratings yet
Unit 2
28 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
Lecture Testmodels
No ratings yet
Lecture Testmodels
31 pages
ML m5_2
No ratings yet
ML m5_2
24 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
Sampling Methods in Machine Learning
No ratings yet
Sampling Methods in Machine Learning
13 pages
Cross Validation LN 12
No ratings yet
Cross Validation LN 12
11 pages
Cross Validation LN 12
No ratings yet
Cross Validation LN 12
11 pages
Data Splitting and Bias Variance Tradeoff
No ratings yet
Data Splitting and Bias Variance Tradeoff
14 pages
4-ResamplingMethods 1
No ratings yet
4-ResamplingMethods 1
23 pages
MI_Unit 5
No ratings yet
MI_Unit 5
72 pages
cross validation
No ratings yet
cross validation
5 pages
Week 10 - PROG 8510 Week 10
No ratings yet
Week 10 - PROG 8510 Week 10
16 pages
Ml Unit4 Notes
No ratings yet
Ml Unit4 Notes
20 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
3. Cross Validation
No ratings yet
3. Cross Validation
16 pages
Unit-2 L3 (3)
No ratings yet
Unit-2 L3 (3)
23 pages
Lecture-4 Model Evaluation
No ratings yet
Lecture-4 Model Evaluation
28 pages
P-2.1.2 Cross Validation and Regularization
No ratings yet
P-2.1.2 Cross Validation and Regularization
37 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
Chapter 5 2025
No ratings yet
Chapter 5 2025
19 pages
ML - Module 5
No ratings yet
ML - Module 5
80 pages
7 ML
No ratings yet
7 ML
38 pages
Unit V
No ratings yet
Unit V
12 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Machine Learning-Lecture 02
No ratings yet
Machine Learning-Lecture 02
28 pages
Answer-4 Shreyansh
No ratings yet
Answer-4 Shreyansh
4 pages
Module3-Ensemble Learning
No ratings yet
Module3-Ensemble Learning
107 pages
ADS
No ratings yet
ADS
20 pages
Model Selection NEW
No ratings yet
Model Selection NEW
24 pages
ML 1 Lecture 2
No ratings yet
ML 1 Lecture 2
50 pages
Mi Unit 5 2m
No ratings yet
Mi Unit 5 2m
3 pages
MIS410 Lecture9-10
No ratings yet
MIS410 Lecture9-10
40 pages
ML m5_1
No ratings yet
ML m5_1
37 pages
Cross Validation: Chandan B K Mrs. S Asst Professor, Department of Computer Science Engineering
No ratings yet
Cross Validation: Chandan B K Mrs. S Asst Professor, Department of Computer Science Engineering
21 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
AIML-HC Mod 03
No ratings yet
AIML-HC Mod 03
46 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
1.2.2_Cross_validation
No ratings yet
1.2.2_Cross_validation
11 pages
15-The Bias - Variance - Trade-Off-08-04-2024
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
23 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Unit 4 A
No ratings yet
Unit 4 A
16 pages
2. Linear Regression, Polynomical, Gradiant Descent
No ratings yet
2. Linear Regression, Polynomical, Gradiant Descent
42 pages
Lec-1 Bias-variance-Tradeoff
No ratings yet
Lec-1 Bias-variance-Tradeoff
24 pages
CHP 3
No ratings yet
CHP 3
70 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
73 pages
Lect_03_Evaluation_Part_2
No ratings yet
Lect_03_Evaluation_Part_2
40 pages
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
From Everand
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
Satou Takahiro
No ratings yet
Service Manual: Speaker System
No ratings yet
Service Manual: Speaker System
2 pages
Microsoft Cloud Networking For Enterprise Architects
No ratings yet
Microsoft Cloud Networking For Enterprise Architects
12 pages
Presume (1) - 2
No ratings yet
Presume (1) - 2
3 pages
Ex06a Getline Class+Exercise
No ratings yet
Ex06a Getline Class+Exercise
1 page
Vsphere VDDK 800 Programming Guide
No ratings yet
Vsphere VDDK 800 Programming Guide
176 pages
zd621 Thermal Transfer Parts Catalog
No ratings yet
zd621 Thermal Transfer Parts Catalog
2 pages
Translate Reverse Pitching
No ratings yet
Translate Reverse Pitching
8 pages
1.1.1 Simple Linear Regression
No ratings yet
1.1.1 Simple Linear Regression
4 pages
PDF V
No ratings yet
PDF V
1 page
SAP Printing
No ratings yet
SAP Printing
39 pages
ATS 12-13 Security Testing 1
No ratings yet
ATS 12-13 Security Testing 1
37 pages
Device Manager
No ratings yet
Device Manager
2 pages
Manual Eng C1-Rnse
No ratings yet
Manual Eng C1-Rnse
10 pages
AD. Domain Persistence: Golden Ticket Attack
No ratings yet
AD. Domain Persistence: Golden Ticket Attack
23 pages
Electronic Battery With Arduino: Emmanuel Garcia Escobedo
No ratings yet
Electronic Battery With Arduino: Emmanuel Garcia Escobedo
5 pages
Bruker MXRF WBNR Investigation of Concrete
No ratings yet
Bruker MXRF WBNR Investigation of Concrete
36 pages
8051 Serialcommunication
No ratings yet
8051 Serialcommunication
49 pages
MEP B
No ratings yet
MEP B
54 pages
Precedence Network Analysis: Civ4101 Civil Engineering Management
No ratings yet
Precedence Network Analysis: Civ4101 Civil Engineering Management
7 pages
ASEAN CERT Incident Drill (ACID) 2020: 7 October 1000hours (GMT+8)
No ratings yet
ASEAN CERT Incident Drill (ACID) 2020: 7 October 1000hours (GMT+8)
7 pages
Saep 25
No ratings yet
Saep 25
61 pages
Dbms Project Report
100% (1)
Dbms Project Report
41 pages
FPVFreerider Manual PDF
No ratings yet
FPVFreerider Manual PDF
13 pages
GE 12 Society and Technology
No ratings yet
GE 12 Society and Technology
3 pages
Mis Teau Exam Jan April 2019
No ratings yet
Mis Teau Exam Jan April 2019
2 pages
MODULE FOR TEACHING AND LEARNING TECHNOLOGY 2 Revised 2022 New
100% (1)
MODULE FOR TEACHING AND LEARNING TECHNOLOGY 2 Revised 2022 New
47 pages
What Is F777 Fighter?
No ratings yet
What Is F777 Fighter?
7 pages

Unit 6_model selection (1)

Uploaded by

Unit 6_model selection (1)

Uploaded by

INT247

Machine Learning Foundation

You might also like