0% found this document useful (0 votes)

9 views

Lecture-4 Model Evaluation

Model Evaluation

Uploaded by

Rimsha Shabbir

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Lecture-4 Model Evaluation

Model Evaluation

Uploaded by

Rimsha Shabbir

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Model Evaluation

Lecture 04
Discussion
2

Beware of your
deeds in this
life, for they
are adding up
in the next
Agenda:
•A Quick Review: Machine Learning Models
•Model Validation
•Objective Function and Cost Functions
•Predictive Vs Descriptive Models
•Model Evaluation Methods
•Hold-Out Method
•Cross Validation
•Types of Cross Validation
•Bootstrap Sampling
•Lazy Vs Eager Learners
•Activity

3
Machine Learning Models
• Some concepts are going to be discussed in detail
later: Classification Tasks
• Fitting a model:
• Learning is done to fit a model
• Best representation out of the data to answer unknown
facts
• Generic Model -> Trained Model (Model Parameters)
• Trained model is prepared to answer
• Facts that are already hidden inside the recorded
observations

4
Model Validation:
• process by which we ensure that our models can
perform acceptable in “the real world.”
• In more technical terms:
model validation allows you to predict how your
model will perform on datasets not used in the
training

5
Training Vs Testing Error:
• Model learns from the mistakes done during the training
process
• During training questions and their answers are shown to
the model and on the basis of errors it improved its
learning
• Just like student’s feedback in the class
• Training errors occurs during the learning process of the
model (when it is under learning or training)
• We want to reduce these errors
• Test data is provided to check the model performance
before deployment process
• E.g. R Squared that we used in Regression

6
Model Validation (Recap):

Training Error: We get by calculating the classification error of a model on the same
data the model was trained on

Test Error: We get this by using two completely disjoint datasets: one to train the
model and the other to calculate the classification error. Both datasets need to
have values for y. The first dataset is called training data and the second, test data.

7
Objective Function and Cost Function:
• Objective Function
• Objective: What is prime to achieve?
• Our objective is to minimize the prediction errors
• Takes data and model and attempts to find best values for parameters that
maximize the reward.
• Linear Regression: We tried to find the optimized o/p (Slope and Intercept)
• Finds out the values of model parameters based on any optimization
technique (OLS/LSM, Gradient Decent etc.)
• Cost Function/Error Function
• Error Function: During training a single error on one example
• Cost Function: On all examples accumulated calculated error
*We want to minimize the cost function and it is done by considering objective function
by using some optimization strategy

8
Machine Learning Models
•Needs training data to learn
•Needs test data to evaluate performance
•Predictive Vs. Descriptive Models
• Predictive Models (supervised learning – maps input
features to target)
• Descriptive Models (Unsupervised Learning – learns
hidden facts from data)

9
Predictive Vs Descriptive Models:
• Predictive Models
• Holds the predictive strength
• Builds a relationship from observations in the form of a function
that is used to give output of unknown instances
• Supervised Learning
• Descriptive Models
• Don’t give any o/p or answer but describe the provided instances
• They show the learned relationship and hidden patterns inside
the provided data
• Retrieve hidden facts out of the data which were unknown for us
• No fixed answer
• Unsupervised Learning

10
Training a Model:

• Main three methods of model

evaluation
• Hold-Out Method
• Cross Validation
• LOOCV
• Bootstrap Sampling
• Lazy Vs. eager learners

11
Hold Out Method

12
Hold Out Method

• Data is split into training and test partition.

(usually, 80% and 20% respectively
• A small portion is held- out for testing
• Model learns on the training set with features and
associated target value
• After training, the model is evaluated on the test
set using an evaluation metric such R-Squared

13
Hold Out Method
• Some times a third Partition i.e. validation data is
also factored out
• Training-validation-test data
• Validation data is used in place of test data after
each iteration of training to tune the model
parameters
• Model is tested on validation data and then training
continues to tune the model parameters for better
performance
• Test data is used only one at the end to evaluate the
performance of the final model

14
Hold on! point of ponder
• We validated our model once
• What if the split we made just happened to be very
conducive to this model?
• Didn’t we significantly reduce the size of our training
dataset by splitting it like that?

15
Hold Out Method-Limitation:
•Very restrictive in case of small volume of
dataset
•Partitioning can result in unbalanced class
representation in the training data
• Dividing the data into 3 partitions can result in
uneven representation of certain types of
instances (classificiation – unblanaced classes
problem)
• The test data may also be unbalanced
•Stratified Random Sampling may be used to
avoid unbalanced class problem to some extent
16
Cross Validation:
It is the process by which the machine learning
models are evaluated on a separate set known as
validation set or hold-out set with which the best
hyper-parameters are found, so that we get the
optimal model, that can be used on future data and
which is capable of yielding the best possible
predictions

17
Cross Validation:

Cross Validation Cycle

18
Types of Cross Validation in Data Science:

19
K-Fold Cross Validation Method
• Repeated hold-
out application
• K iteration of hold-out
validation are
undertaken
• The dataset is
partitioned into k
disjoint random
partitions called folds
• One fold is used as test
while the rest k-1 folds
are used for training
• The process is repeated k
times with next fold as
test fold

20
K-Fold Cross Validation Method

K = 10 21
K-Fold Cross Validation Method
• 5 Fold and 10 fold cross validation methods are very popular
• In 10 fold CV, 10 folds of data are created by random sampling
• Each fold is disjoint with each other fold
• One after the other 1 fold is taken as test while the rest 9 folds
are used for training the model
• After k iterations the average model performance across all folds is
reported
• A special variant of k-fold validation is LOOCV
• Leave One out Cross Validation
• In LOOCV only one record is left out for validation and the rest of n-1
records are used for training in each iteration (n is the total number of
records)

22
LOOCV

Illustration of leave one out cross-validation (LOOCV) when n = 8 observations.

A total of 8 models will be trained and tested

23
Bootstrap Sampling

• Used to pick out training and test data from a data set.
• Usually employed when the size of data set is very small
• It randomly picks n instances from data set with replacement
• An instance may be duplicated in the training set
• It can create infinitely many training data sets of n instances from
a dataset of size n
• Records may be repeated multiple times.

24
Lazy Vs. Eager Learners
• Eager Learners
• Build a generalized representation of learning into a
model
• Take more time in training
• Follows typical ML STEP
• Abstraction -> generalization->model
• Fast in predictions
• Lazy Learners
• Skip the learning phase
• No abstraction or generalization
• Depend on the training data for predictions
• Also called instance based learning or non-parameteric learning
• Take more time in prediction
25
Supervised Learning Models –
Capabilities Over Vs. Underfitting of Models
• Underfitting – Model too simple to capture essential details
• Overfitting - Model is overly complex and lacks generalization

26
Supervised Learning Models - Errors

• Model Error
• Incorrect prediction
• Errors due to Bias
• Generally resulted due to under
fitting
• Errors due to Variance
• Results due to overfitting
• A low bias and low variance
model is desirable

27
Thank you
Any Question?

Quality Associate Case Problem
100% (1)
Quality Associate Case Problem
2 pages
HW 8 Sol
No ratings yet
HW 8 Sol
15 pages
Homework #6: Student: Mario Perez
No ratings yet
Homework #6: Student: Mario Perez
8 pages
Dana S. Dunn, Suzanne Mannes - Statistics and Data Analysis For The Behavioral Sciences-McGraw-Hill Companies (2001)
100% (1)
Dana S. Dunn, Suzanne Mannes - Statistics and Data Analysis For The Behavioral Sciences-McGraw-Hill Companies (2001)
758 pages
Machine Learning Methods in Environmental Sciences
100% (2)
Machine Learning Methods in Environmental Sciences
365 pages
Data Splitting and Bias Variance Tradeoff
No ratings yet
Data Splitting and Bias Variance Tradeoff
14 pages
Mining Process
No ratings yet
Mining Process
33 pages
Cross-Validation in Machine Learning
No ratings yet
Cross-Validation in Machine Learning
18 pages
CH 05 Optimization Technique
No ratings yet
CH 05 Optimization Technique
58 pages
Lect_03_Evaluation_Part_2
No ratings yet
Lect_03_Evaluation_Part_2
40 pages
UNIT03
No ratings yet
UNIT03
52 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
Unit 3 (ML)
No ratings yet
Unit 3 (ML)
26 pages
CHP 3
No ratings yet
CHP 3
70 pages
P-2.1.2 Cross Validation and Regularization
No ratings yet
P-2.1.2 Cross Validation and Regularization
37 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
Cofusion Matrix Cross- Validation
No ratings yet
Cofusion Matrix Cross- Validation
34 pages
15-The Bias - Variance - Trade-Off-08-04-2024
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
23 pages
Unit 6_model selection (1)
No ratings yet
Unit 6_model selection (1)
13 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Choosing Model and Tuning
No ratings yet
Choosing Model and Tuning
20 pages
Ch6-Models selection Evaluating Classifiers
No ratings yet
Ch6-Models selection Evaluating Classifiers
28 pages
Module 6_ML
No ratings yet
Module 6_ML
30 pages
Unit6 Part3 General Procedure
No ratings yet
Unit6 Part3 General Procedure
19 pages
ML Unit 2
No ratings yet
ML Unit 2
35 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
43 pages
Wa0001.
No ratings yet
Wa0001.
173 pages
7 ML
No ratings yet
7 ML
38 pages
model-validation
No ratings yet
model-validation
5 pages
ML.1Lecture.2 (Old)
No ratings yet
ML.1Lecture.2 (Old)
23 pages
RecSysEvaluation - 1
No ratings yet
RecSysEvaluation - 1
100 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
73 pages
Slide 1: Fast and Informative Model Selection Using Learning Curve Cross-Validation
No ratings yet
Slide 1: Fast and Informative Model Selection Using Learning Curve Cross-Validation
71 pages
Model Selection NEW
No ratings yet
Model Selection NEW
24 pages
Quiz 1 Materials
No ratings yet
Quiz 1 Materials
159 pages
CSC407_Chapter 5-6
No ratings yet
CSC407_Chapter 5-6
42 pages
Week 4 Lecture Slides BUS265 2023
No ratings yet
Week 4 Lecture Slides BUS265 2023
45 pages
Machine Learning-Lecture 02
No ratings yet
Machine Learning-Lecture 02
28 pages
ML 1 Lecture 2
No ratings yet
ML 1 Lecture 2
50 pages
Lecture 14
No ratings yet
Lecture 14
17 pages
Module3-Ensemble Learning
No ratings yet
Module3-Ensemble Learning
107 pages
ML - Module 5
No ratings yet
ML - Module 5
80 pages
Sampling Methods in Machine Learning
No ratings yet
Sampling Methods in Machine Learning
13 pages
ML3 - Evaluation
100% (1)
ML3 - Evaluation
65 pages
Cross Validation LN 12
No ratings yet
Cross Validation LN 12
11 pages
Cross Validation LN 12
No ratings yet
Cross Validation LN 12
11 pages
Rajib Mall Lecture Notes
No ratings yet
Rajib Mall Lecture Notes
94 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
Lecture5
No ratings yet
Lecture5
26 pages
Mlfa Autumn 22 Lec 02
No ratings yet
Mlfa Autumn 22 Lec 02
24 pages
K Fold
No ratings yet
K Fold
21 pages
Lecture 4.1 AML
No ratings yet
Lecture 4.1 AML
12 pages
Unit 3
No ratings yet
Unit 3
17 pages
Lecture-5-HCL-DSE - Sumita Narang-2
No ratings yet
Lecture-5-HCL-DSE - Sumita Narang-2
40 pages
Testing Machine Learning Algorithms
No ratings yet
Testing Machine Learning Algorithms
3 pages
Module 2
No ratings yet
Module 2
19 pages
Unit 2
No ratings yet
Unit 2
28 pages
Module 2 - ML
No ratings yet
Module 2 - ML
53 pages
Classification: Evaluation: Data Mining and Text Mining (UIC 583 at Politecnico Di Milano)
No ratings yet
Classification: Evaluation: Data Mining and Text Mining (UIC 583 at Politecnico Di Milano)
53 pages
ML U-4
No ratings yet
ML U-4
63 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
40 Machine Learning Algorithms
From Everand
40 Machine Learning Algorithms
Anam Giri
No ratings yet
Ways to Achieve Quality
From Everand
Ways to Achieve Quality
chakrapani srinivasa
5/5 (1)
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
Stat Packages
No ratings yet
Stat Packages
50 pages
Ashishpk 12-09 21
No ratings yet
Ashishpk 12-09 21
21 pages
Answers Review Questions Econometrics
84% (25)
Answers Review Questions Econometrics
59 pages
BSNL Research
No ratings yet
BSNL Research
3 pages
Wang Et Al 11 How Accurate Is The Square Root of Time Rule in Scaling Tail Risk A Global Study
No ratings yet
Wang Et Al 11 How Accurate Is The Square Root of Time Rule in Scaling Tail Risk A Global Study
12 pages
Assignment #2
No ratings yet
Assignment #2
15 pages
CHAPTER5M322
No ratings yet
CHAPTER5M322
9 pages
Nonlinear Filtering - Extended Kalman Filter - Linearization and Random Variables
No ratings yet
Nonlinear Filtering - Extended Kalman Filter - Linearization and Random Variables
12 pages
Forming Inferences About Some Intraclass Correlation Coefficients
No ratings yet
Forming Inferences About Some Intraclass Correlation Coefficients
17 pages
PH1700 Session 4a - Stu - Poisson Distribution
No ratings yet
PH1700 Session 4a - Stu - Poisson Distribution
30 pages
CH 6 Single Index
No ratings yet
CH 6 Single Index
20 pages
Chapter-2
No ratings yet
Chapter-2
51 pages
Stochastic Processes, Fall 2010: Time and Place
No ratings yet
Stochastic Processes, Fall 2010: Time and Place
2 pages
Uplift Modeling
No ratings yet
Uplift Modeling
4 pages
Binomial Distribution
No ratings yet
Binomial Distribution
38 pages
Median Rank Based On Mean Order Number
No ratings yet
Median Rank Based On Mean Order Number
5 pages
Lecture 3 Sampling and Sampling Distribution - Probability and Non-Probability Sampling
No ratings yet
Lecture 3 Sampling and Sampling Distribution - Probability and Non-Probability Sampling
16 pages
MBA 8040 MODEL BUILDING With Data Transformations PDF
No ratings yet
MBA 8040 MODEL BUILDING With Data Transformations PDF
17 pages
1621 - Stat6111 - Lnba - TK3 - W6 - S10 - Team7
No ratings yet
1621 - Stat6111 - Lnba - TK3 - W6 - S10 - Team7
3 pages
Instant Ebooks Textbook An Introduction To Nonparametric Statistics First Edition Kolassa Download All Chapters
100% (3)
Instant Ebooks Textbook An Introduction To Nonparametric Statistics First Edition Kolassa Download All Chapters
62 pages
Business Statistics 1st Model Test 2 Chapter
No ratings yet
Business Statistics 1st Model Test 2 Chapter
2 pages
Institute of Mathematical Statistics The Annals of Statistics
No ratings yet
Institute of Mathematical Statistics The Annals of Statistics
55 pages
Large Sample Test
100% (1)
Large Sample Test
7 pages
Answer 1: A. Ho: The Primary News Source and Educational Level Are Independent
No ratings yet
Answer 1: A. Ho: The Primary News Source and Educational Level Are Independent
15 pages

Lecture-4 Model Evaluation

Uploaded by

Lecture-4 Model Evaluation

Uploaded by

Model Evaluation

• Main three methods of model

• Data is split into training and test partition.

Cross Validation Cycle

Illustration of leave one out cross-validation (LOOCV) when n = 8 observations.

You might also like