Cofusion Matrix Cross- Validation

The document discusses the importance of confusion matrices and cross-validation in evaluating machine learning models. A confusion matrix helps visualize the performance of classification models by showing true positives, false positives, true negatives, and false negatives, while cross-validation is a technique for assessing model stability by training on subsets of data and testing on unseen data. Various cross-validation methods, including k-fold and stratified k-fold, are outlined, along with their advantages and limitations.

Uploaded by

Priyanka Naik

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Cofusion Matrix Cross- Validation

Uploaded by

Priyanka Naik

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Confusion Matrix &

Cross Validation
Confusion Matrix
• Machine learning models are increasingly used in
various applications to classify data into different
categories.
• However, evaluating the performance of these models
is crucial to ensure their accuracy and reliability. One
essential tool in this evaluation process is the confusion
matrix.
• The confusion matrix is a matrix used to determine the
performance of the classification models for a given set of
test data.
• It can only be determined if the true values for test data are
known. The matrix itself can be easily understood, but the
related terminologies may be confusing.
• Since it shows the errors in the model performance in the
form of a matrix, hence also known as an error matrix.
Some features of Confusion matrix are given below:
• Classification Models have multiple categorical outputs. Most
error measures will calculate the total error in our model, but
we cannot find individual instances of errors in our model.
• The model might misclassify some categories more than
others, but we cannot see this using a standard accuracy
measure.
• Furthermore, suppose there is a significant class imbalance in
the given data. In that case, i.e., a class has more instances of
data than the other classes, a model might predict the
majority class for all cases and have a high accuracy score;
when it is not predicting the minority classes. This is where
confusion matrices are useful.
A confusion matrix presents a table layout of the different
outcomes of the prediction and results of a classification
problem and helps visualize its outcomes.

It plots a table of all the predicted and actual values of a

classifier.
2x2 Confusion Matrix
• True Positive: The number of times our actual positive values
are equal to the predicted positive. You predicted a positive
value, and it is correct.
• False Positive: The number of times our model wrongly
predicts negative values as positives. You predicted a negative
value, and it is actually positive.
• True Negative: The number of times our actual negative values
are equal to predicted negative values. You predicted a negative
value, and it is actually negative.
• False Negative: The number of times our model wrongly
predicts negative values as positives. You predicted a negative
value, and it is actually positive.
Confusion
Matrix Metrics
• Consider a confusion matrix made
for a classifier that classifies
people based on whether they
speak English or Spanish.
• From the above diagram, we can
see that:
• True Positives (TP) = 86
• True Negatives (TN) = 79
• False Positives (FP) = 12
• False Negatives (FN) = 10
•Just from looking at the matrix, the
performance of our model is not very clear.
To find how accurate our model is, we use
the following metrics:

•Accuracy: The accuracy is used to find the

portion of correctly classified values. It tells
us how often our classifier is right. It is the
sum of all true values divided by total values.

In this case:
Accuracy = (86 +79) / (86 + 79 + 12 + 10)
= 0.8823 = 88.23%
•Precision: Precision is used to
calculate the model's ability to
classify positive values
correctly. It is the true positives
divided by the total number of
predicted positive values.

In this case,
Precision = 86 / (86 + 12) =
0.8775 = 87.75%
•Recall: It is used to calculate the
model's ability to predict positive
values. "How often does the
model predict the correct positive
values?". It is the true positives
divided by the total number of
actual positive values.

In this case,
Recall = 86 / (86 + 10) = 0.8983
= 89.83%
•F1-Score: It is the harmonic
mean of Recall and Precision.
It is useful when you need to
take both Precision and Recall
into account.

In this case,
F1-Score = (2* 0.8775 *
0.8983) / (0.8775 + 0.8983) =
0.8877 = 88.77%
• To scale a confusion matrix, increase the
Scaling a number of rows and columns. All the True
Confusion Matrix Positives will be along the diagonal. The other
values will be False Positives or False Negatives.
• https://www.simplilearn.com/tutorials/machine-learning-
tutorial/confusion-matrix-machine-learning
Cross Validation
• Cross-validation is a technique for validating the model
efficiency by training it on the subset of input data and
testing on previously unseen subset of the input data.
• We can also say that it is a technique to check
how a statistical model generalizes to an
independent dataset.
• In machine learning, there is always the need to test the
stability of the model. It means based only on the
training dataset; we can't fit our model on the training
dataset.
• For this purpose, we reserve a particular sample of the
dataset, which was not part of the training dataset.
• After that, we test our model on that sample before
deployment, and this complete process comes under
cross-validation.
• This is something different from the general train-test
split.
steps of cross-validations
• Reserve a subset of the dataset as a validation set.
• Provide the training to the model using the training
dataset.
• Now, evaluate model performance using the validation
set. If the model performs well with the validation set,
perform the further step, else check for the issues.
Methods used for Cross-Validation
1.Validation Set Approach
2.Leave-P-out cross-validation
3.Leave one out cross-validation
4.K-fold cross-validation
5.Stratified k-fold cross-validation
Validation Set Approach
• We divide our input dataset into a training set and test
or validation set in the validation set approach. Both the
subsets are given 50% of the dataset.
• But it has one of the big disadvantages that we are just
using a 50% dataset to train our model, so the model
may miss out to capture important information of the
dataset. It also tends to give the underfitted model.
Leave-P-out cross-validation
• In this approach, the p datasets are left out of the
training data. It means, if there are total n datapoints in
the original input dataset, then n-p data points will be
used as the training dataset and the p data points as
the validation set. This complete process is repeated for
all the samples, and the average error is calculated to
know the effectiveness of the model.
• There is a disadvantage of this technique; that is, it can
be computationally difficult for the large p.
Leave one out cross-validation
• This method is similar to the leave-p-out cross-validation, but
instead of p, we need to take 1 dataset out of training. It
means, in this approach, for each learning set, only one
datapoint is reserved, and the remaining dataset is used to
train the model. This process repeats for each datapoint.
Hence for n samples, we get n different training set and n test
set.
• It has the following features:
• In this approach, the bias is minimum as all the data points are used.
• The process is executed for n times; hence execution time is high.
• This approach leads to high variation in testing the effectiveness of
the model as we iteratively check against one data point.
K-Fold Cross-Validation
• K-fold cross-validation approach divides the input
dataset into K groups of samples of equal sizes. These
samples are called folds. For each learning set, the
prediction function uses k-1 folds, and the rest of the
folds are used for the test set. This approach is a very
popular Computer Vision approach because it is easy to
understand, and the output is less biased than other
methods.
steps for k-fold cross-validation
• Split the input dataset into K groups
• For each group:
• Take one group as the reserve or test data set.
• Use remaining groups as the training dataset
• Fit the model on the training set and evaluate the performance of
the model using the test set.
• Let's take an example of 5-folds cross-validation. So, the
dataset is grouped into 5 folds. On 1st iteration, the first fold is
reserved for test the model, and rest are used to train the
model. On 2nd iteration, the second fold is used to test the
model, and rest are used to train the model. This process will
continue until each fold is not used for the test fold.
Stratified k-fold cross-validation
• This technique is similar to k-fold cross-validation with
some little changes. This approach works on
stratification concept, it is a process of rearranging the
data to ensure that each fold or group is a good
representative of the complete dataset. To deal with the
bias and variance, it is one of the best approaches.
• It can be understood with an example of housing prices,
such that the price of some houses can be much high
than other houses. To tackle such situations, a stratified
k-fold cross-validation technique is useful.
Holdout Method
• This method is the simplest cross-validation technique
among all. In this method, we need to remove a subset
of the training data and use it to get prediction results
by training it on the rest part of the dataset.
• The error that occurs in this process tells how well our
model will perform with the unknown dataset. Although
this approach is simple to perform, it still faces the issue
of high variance, and it also produces misleading results
sometimes.
Comparison of Cross-validation to
train/test split
• Train/test split: The input data • Cross-Validation dataset: It is
is divided into two parts, that are
used to overcome the disadvantage
training set and test set on a
of train/test split by splitting the
ratio of 70:30, 80:20, etc. It
provides a high variance, which is dataset into groups of train/test splits,
one of the biggest disadvantages. and averaging the result. It can be
• Training Data: The training data is used if we want to optimize our model
used to train the model, and the that has been trained on the training
dependent variable is known. dataset for the best performance. It is
• Test Data: The test data is used to more efficient as compared to
make the predictions from the train/test split as every observation is
model that is already trained on the
training data. This has the same used for the training and testing both.
features as training data but not the
part of that.
Limitations of Cross-Validation
• For the ideal conditions, it provides the optimum output. But
for the inconsistent data, it may produce a drastic result. So,
it is one of the big disadvantages of cross-validation, as
there is no certainty of the type of data in machine learning.
• In predictive modeling, the data evolves over a period, due
to which, it may face the differences between the training
set and validation sets. Such as if we create a model for the
prediction of stock market values, and the data is trained on
the previous 5 years stock values, but the realistic future
values for the next 5 years may drastically different, so it is
difficult to expect the correct output for such situations.
Applications of Cross-Validation
• This technique can be used to compare the
performance of different predictive modeling methods.
• It has great scope in the medical research field.
• It can also be used for the meta-analysis, as it is already
being used by the data scientists in the field of medical
statistics.
• https://www.youtube.com/watch?v=PF2wLKv2lsI&t=151
s
x y
1 1.2
2 2.3
•Given the dataset:
3 2.9 •We will perform 5-fold cross-validation to
evaluate the performance of a linear
4 4.1 regression model on this dataset.
5 5.5 •Cross-validation provides a more reliable
estimate of model performance by
6 6.7 partitioning the data into different subsets
(folds) and evaluating the model's
performance on unseen data from each fold.
7 7.8
8 8.9
9 9.6
10 10.8
Steps:
1.Split the data into 5 folds.
2.For each fold:
1.Use 4 folds for training and 1 fold for testing.
2.Fit a linear regression model to the training data.
3.Calculate the mean squared error (MSE) on the test fold.
3.Compute the average MSE across all folds as the
overall performance metric.
• Let's perform this cross-validation.
Results of 5-Fold Cross-Validation
• Mean Squared Errors (MSE) for each fold:
• Fold 1: 0.0558
• Fold 2: 0.0452
• Fold 3: 0.1156
• Fold 4: 0.0093
• Fold 5: 0.0533
• Average MSE across all folds:
• Average MSE=0.0558

Mathematics
100% (1)
Mathematics
8 pages
CV Bambang Sumintono June 2015
No ratings yet
CV Bambang Sumintono June 2015
9 pages
ADaMIG v1.1
No ratings yet
ADaMIG v1.1
104 pages
Unit 2
No ratings yet
Unit 2
28 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
6 Model Evalution
No ratings yet
6 Model Evalution
16 pages
Presentation On Classification
No ratings yet
Presentation On Classification
18 pages
Cross Validation
No ratings yet
Cross Validation
10 pages
ml_pyq_ans
No ratings yet
ml_pyq_ans
37 pages
Lecture 5 Evaluation_Classifer
No ratings yet
Lecture 5 Evaluation_Classifer
61 pages
ADS
No ratings yet
ADS
20 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
P-2.1.2 Cross Validation and Regularization
No ratings yet
P-2.1.2 Cross Validation and Regularization
37 pages
Chapter2 1 33
No ratings yet
Chapter2 1 33
18 pages
ML 4
No ratings yet
ML 4
21 pages
Cross Validation in ML
No ratings yet
Cross Validation in ML
5 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
25 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
37 pages
Module 6
No ratings yet
Module 6
24 pages
ML 5
No ratings yet
ML 5
14 pages
SML Updated UNIT 4
No ratings yet
SML Updated UNIT 4
44 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
Machine Learning Terminology
No ratings yet
Machine Learning Terminology
16 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
TR Rain Error
No ratings yet
TR Rain Error
6 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
CH 05 Optimization Technique
No ratings yet
CH 05 Optimization Technique
58 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
DS Notes Unit - V
No ratings yet
DS Notes Unit - V
13 pages
Unit V
No ratings yet
Unit V
12 pages
Module3-Ensemble Learning
No ratings yet
Module3-Ensemble Learning
107 pages
ML.1Lecture.2 (Old)
No ratings yet
ML.1Lecture.2 (Old)
23 pages
Module 3 - ML
No ratings yet
Module 3 - ML
101 pages
Chapter 3 Model Evaluation Final
No ratings yet
Chapter 3 Model Evaluation Final
30 pages
Xchapter 1
No ratings yet
Xchapter 1
31 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
cross validation
No ratings yet
cross validation
5 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Validation Over Under Fir Unit 5
No ratings yet
Validation Over Under Fir Unit 5
6 pages
ML - Module 5
No ratings yet
ML - Module 5
80 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
CSC4316 9
No ratings yet
CSC4316 9
40 pages
Cross Validation
No ratings yet
Cross Validation
4 pages
chapter 1 capstone project ai class 12
No ratings yet
chapter 1 capstone project ai class 12
5 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
Unit 6-Feature Engineering and Sensitivity Analysis
No ratings yet
Unit 6-Feature Engineering and Sensitivity Analysis
63 pages
3. Cross Validation
No ratings yet
3. Cross Validation
16 pages
Cross Validation - Notes
No ratings yet
Cross Validation - Notes
10 pages
Mining Process
No ratings yet
Mining Process
33 pages
Module 6_ML
No ratings yet
Module 6_ML
30 pages
AI351 Lecture 2 - Common Evaluation Metrics
No ratings yet
AI351 Lecture 2 - Common Evaluation Metrics
50 pages
AI_Lecture 3
No ratings yet
AI_Lecture 3
50 pages
Chapitre_2-converti
No ratings yet
Chapitre_2-converti
26 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
Lecture Note #6_PEC-CS701E
No ratings yet
Lecture Note #6_PEC-CS701E
11 pages
Evaluation
No ratings yet
Evaluation
21 pages
Cross-Validation in Machine Learning - Javatpoint
No ratings yet
Cross-Validation in Machine Learning - Javatpoint
8 pages
Cross-Validation in Machine Learning
No ratings yet
Cross-Validation in Machine Learning
18 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
ML Unit-3 - RTU
No ratings yet
ML Unit-3 - RTU
20 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
Accuracy and Error Measures
No ratings yet
Accuracy and Error Measures
46 pages
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
NMIT College Links
No ratings yet
NMIT College Links
2 pages
Employment Readiness Training Program Schedule
No ratings yet
Employment Readiness Training Program Schedule
12 pages
Week 3 Project PPT Template1
No ratings yet
Week 3 Project PPT Template1
7 pages
Module.1
No ratings yet
Module.1
42 pages
Tata_Motors_Presentation
No ratings yet
Tata_Motors_Presentation
10 pages
Unit 1.2 - Continuous Intergation and Development
No ratings yet
Unit 1.2 - Continuous Intergation and Development
24 pages
MYCIN, DART, XCON
No ratings yet
MYCIN, DART, XCON
34 pages
Registration
No ratings yet
Registration
1 page
Data Classification
No ratings yet
Data Classification
12 pages
DocScanner 13-Jul-2023 5-39 PM
No ratings yet
DocScanner 13-Jul-2023 5-39 PM
5 pages
Research Proposal (39038)
No ratings yet
Research Proposal (39038)
7 pages
Formulating and Clarifying The Research Topic
No ratings yet
Formulating and Clarifying The Research Topic
51 pages
Rishabh (2)
No ratings yet
Rishabh (2)
1 page
Reliability of High Voltage Equipment
No ratings yet
Reliability of High Voltage Equipment
8 pages
A Study of Risk Factors Affecting Building Construction Projects IJERTV3IS120480
No ratings yet
A Study of Risk Factors Affecting Building Construction Projects IJERTV3IS120480
5 pages
Fuzzy Sets, Fuzzy Logic
No ratings yet
Fuzzy Sets, Fuzzy Logic
223 pages
Instant Access to (Ebook) Policy : From Ideas to Implementation, in Honour of Professor G. Bruce Doern by Glen Toner; Leslie A. Pal; Michael Prince ISBN 9780773585058, 0773585052 ebook Full Chapters
100% (4)
Instant Access to (Ebook) Policy : From Ideas to Implementation, in Honour of Professor G. Bruce Doern by Glen Toner; Leslie A. Pal; Michael Prince ISBN 9780773585058, 0773585052 ebook Full Chapters
71 pages
Land Use-Transport Model
No ratings yet
Land Use-Transport Model
31 pages
Performance Management Report of Coffee
No ratings yet
Performance Management Report of Coffee
19 pages
From generation to judgement & LLM - artigo
No ratings yet
From generation to judgement & LLM - artigo
36 pages
UNIT-4 Questionnaire PDF
No ratings yet
UNIT-4 Questionnaire PDF
18 pages
SURVEY REPORT Levelling1
No ratings yet
SURVEY REPORT Levelling1
14 pages
Solid Waste Management in Household
No ratings yet
Solid Waste Management in Household
5 pages
Melamine Formaldehyde Curing Studies and
No ratings yet
Melamine Formaldehyde Curing Studies and
7 pages
A RESEARCH PROJ-WPS Office
No ratings yet
A RESEARCH PROJ-WPS Office
10 pages
Umbrex Product Management Diagnostic Guide First
No ratings yet
Umbrex Product Management Diagnostic Guide First
50 pages
Iconies 2018 Uin Maliki Malang
No ratings yet
Iconies 2018 Uin Maliki Malang
10 pages
Fundamentals of Surveying Laboratory (Ce03L)
No ratings yet
Fundamentals of Surveying Laboratory (Ce03L)
4 pages
Literature Review Lancaster University
100% (2)
Literature Review Lancaster University
8 pages
Ready Mixed Concrete Quality Control Guide
No ratings yet
Ready Mixed Concrete Quality Control Guide
24 pages
NN Theory
No ratings yet
NN Theory
138 pages
Kotak Mahindra TRAINING Development
100% (1)
Kotak Mahindra TRAINING Development
92 pages
Ebook Management of Safety Information From Clinical Trials Report of CIOMS Working Group VI
No ratings yet
Ebook Management of Safety Information From Clinical Trials Report of CIOMS Working Group VI
306 pages
Carolina Sánchez Roncancio
No ratings yet
Carolina Sánchez Roncancio
3 pages
Information Literacy Module
No ratings yet
Information Literacy Module
36 pages
Customer Profiling: Be Able To Understand What Customers Want and The Most Effective Way of Making The
No ratings yet
Customer Profiling: Be Able To Understand What Customers Want and The Most Effective Way of Making The
11 pages
Thesis - Chapter - 1 (BAHALA NA)
100% (1)
Thesis - Chapter - 1 (BAHALA NA)
18 pages

Cofusion Matrix Cross- Validation

Uploaded by

Cofusion Matrix Cross- Validation

Uploaded by

Confusion Matrix &

It plots a table of all the predicted and actual values of a

•Accuracy: The accuracy is used to find the

You might also like