Confusion Matrix

Environmental scientists want to classify a genetic variant using a machine learning model. They construct a confusion matrix to evaluate the model using 500 samples. The matrix tracks the model's predicted and actual classifications of samples as containing or not containing the variant. Based on the data, the scientists populate the matrix with values for true positives, false positives, true negatives, and false negatives.

Uploaded by

Kittu Bhargavi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views

Confusion Matrix

Uploaded by

Kittu Bhargavi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Confusion Matrix:

A confusion matrix provides a summary of the predictive results

in a classification problem. Correct and incorrect predictions are
summarized in a table with their values and broken down by
each class.

Confusion Matrix for the Binary Classification

2. Calculate a confusion matrix:

Let’s take an example:
We have a total of 10 cats and dogs and our model predicts
whether it is a cat or not.

Actual values = [‘dog’, ‘cat’, ‘dog’, ‘cat’, ‘dog’, ‘dog’, ‘cat’, ‘dog’,
‘cat’, ‘dog’]
Predicted values = [‘dog’, ‘dog’, ‘dog’, ‘cat’, ‘dog’, ‘dog’, ‘cat’, ‘cat’,
‘cat’, ‘cat’]

Remember, we describe predicted values as Positive/Negative

and actual values as True/False.

Definition of the Terms:

True Positive: You predicted positive and it’s true. You predicted
that an animal is a cat and it actually is.

True Negative: You predicted negative and it’s true. You

predicted that animal is not a cat and it actually is not (it’s a
dog).
False Positive (Type 1 Error): You predicted positive and it’s
false. You predicted that animal is a cat but it actually is not (it’s
a dog).

False Negative (Type 2 Error): You predicted negative and it’s

false. You predicted that animal is not a cat but it actually is.

Classification Accuracy:
Classification Accuracy is given by the relation:

Recall (aka Sensitivity):

Recall is defined as the ratio of the total number of correctly
classified positive classes divide by the total number of positive
classes. Or, out of all the positive classes, how much we have
predicted correctly. Recall should be high.

Precision:
Precision is defined as the ratio of the total number of correctly
classified positive classes divided by the total number of
predicted positive classes. Or, out of all the predictive positive
classes, how much we predicted correctly. Precision should be
high.

Trick to remember: Precision has Predictive Results in the

denominator.

F-score or F1-score:
It is difficult to compare two models with different Precision
and Recall. So to make them comparable, we use F-Score. It is
the Harmonic Mean of Precision and Recall. As compared to
Arithmetic Mean, Harmonic Mean punishes the extreme values
more. F-score should be high.

Specificity:
Specificity determines the proportion of actual negatives that
are correctly identified.
Example to interpret confusion matrix:
Let’s calculate confusion matrix using above cat and dog
example:
Classification Accuracy:
Accuracy = (TP + TN) / (TP + TN + FP + FN) =
(3+4)/(3+4+2+1) = 0.70

Recall: Recall gives us an idea about when it’s actually yes, how
often does it predict yes.
Recall = TP / (TP + FN) = 3/(3+1) = 0.75

Precision: Precsion tells us about when it predicts yes, how

often is it correct.
Precision = TP / (TP + FP) = 3/(3+2) = 0.60

F-score:
F-score = (2*Recall*Precision)/(Recall+Presision) =
(2*0.75*0.60)/(0.75+0.60) = 0.67

Specificity:
Specificity = TN / (TN + FP) = 4/(4+2) = 0.67

The AUC-ROC curve, or Area Under the Receiver Operating

Characteristic curve, is a graphical representation of the performance of a
binary classification model at various classification thresholds. It is
commonly used in machine learning to assess the ability of a model to
distinguish between two classes, typically the positive class (e.g.,
presence of a disease) and the negative class (e.g., absence of a
disease).
Environmental scientists want to solve a two-class
classification problem for predicting whether a
population contains a specific genetic variant. They
can use a confusion matrix to determine how many
ways automated processes might confuse the
machine learning classification model they're
analyzing. Assuming the scientists use 500 samples
for their data analysis, a table is constructed for their
predictive and actual values before calculating the
confusion matrix.

Predicted without Predicted with the

the variant variant
Actual number
without the variant
Actual number with
the variant
Total predictive
Total predicted value
value

After creating the matrix, the scientists analyze their

sample data. Assume the scientists predict that 350
test samples contain the genetic variant and 150
samples don't. If they determine the actual number of
samples containing the variant is 305, the actual
number of samples without the variant is 195. These
values become the "true" values in the matrix and the
scientists enter the data in the table:
Predicted with the
Predicted without the variant variant
Actual number without the True negative = 45 False positive =
Predicted with the
Predicted without the variant variant
variant = 195 150
Actual number with the variant True positive =
False negative = 105
= 305 200
150 350

Twi Radiographic Interpretation Part 2
No ratings yet
Twi Radiographic Interpretation Part 2
46 pages
Precision Gold Manual
0% (2)
Precision Gold Manual
15 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
54 pages
Wa0013.
No ratings yet
Wa0013.
9 pages
Confusion Matrix and outliers
No ratings yet
Confusion Matrix and outliers
32 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Confusion Matrix
No ratings yet
Confusion Matrix
3 pages
Homework 02 Key Answer STAT 4444
No ratings yet
Homework 02 Key Answer STAT 4444
5 pages
Confusion Matrix Machine Learning
No ratings yet
Confusion Matrix Machine Learning
9 pages
Module 4 - Confusion Matrix-1
No ratings yet
Module 4 - Confusion Matrix-1
18 pages
Accuracy Precision and Recall
No ratings yet
Accuracy Precision and Recall
21 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
Ch 07 Evaluation
No ratings yet
Ch 07 Evaluation
25 pages
BA
No ratings yet
BA
11 pages
CE880_Lecture6_slides
No ratings yet
CE880_Lecture6_slides
25 pages
Confusion Matrix
No ratings yet
Confusion Matrix
14 pages
EVALUATION - notes
No ratings yet
EVALUATION - notes
15 pages
jmpuse
No ratings yet
jmpuse
14 pages
Probability MIT
No ratings yet
Probability MIT
116 pages
Confusion Matrix, Accuracy, Precision, Recall, F1 Score
No ratings yet
Confusion Matrix, Accuracy, Precision, Recall, F1 Score
1 page
3.2 - Boolean Expressions, Comparison & Logical Operators
No ratings yet
3.2 - Boolean Expressions, Comparison & Logical Operators
10 pages
Unit 580 Choosing A Sufficient Number of Cases With Answers
No ratings yet
Unit 580 Choosing A Sufficient Number of Cases With Answers
9 pages
Module 7 - Evaluation Measures
No ratings yet
Module 7 - Evaluation Measures
27 pages
LAB7
No ratings yet
LAB7
3 pages
Lecture 2 Classifier Performance Metrics
No ratings yet
Lecture 2 Classifier Performance Metrics
60 pages
MIT18 05S14 Readings
No ratings yet
MIT18 05S14 Readings
291 pages
Midterm Review Worksheet
No ratings yet
Midterm Review Worksheet
11 pages
Accuracy and error measures
No ratings yet
Accuracy and error measures
14 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Hypothesis Test For Mean Using Given Data (Standard Deviation Known-Z-Test)
No ratings yet
Hypothesis Test For Mean Using Given Data (Standard Deviation Known-Z-Test)
12 pages
Confusion Matrix
No ratings yet
Confusion Matrix
21 pages
WINSEM2024-25_CBS3006_ETH_VL2024250505168_2025-01-09_Reference-Material-IV
No ratings yet
WINSEM2024-25_CBS3006_ETH_VL2024250505168_2025-01-09_Reference-Material-IV
20 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Week 6a
No ratings yet
Week 6a
33 pages
Uv
No ratings yet
Uv
41 pages
Confusion Metrics
No ratings yet
Confusion Metrics
7 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Lecture 8 Hypothesis Testing
No ratings yet
Lecture 8 Hypothesis Testing
44 pages
EVALUATION PPT
No ratings yet
EVALUATION PPT
25 pages
Confusion Matrix
No ratings yet
Confusion Matrix
2 pages
EVALUATION
No ratings yet
EVALUATION
10 pages
Hypothesis-Testing-Activity
No ratings yet
Hypothesis-Testing-Activity
3 pages
Introduction To Machine Learning IIT KGP Week 2
100% (1)
Introduction To Machine Learning IIT KGP Week 2
14 pages
PADM - Evaluation and Assessment
100% (1)
PADM - Evaluation and Assessment
41 pages
Result Analysis
No ratings yet
Result Analysis
14 pages
Chi-Square Test: by DR - RM Saxena
No ratings yet
Chi-Square Test: by DR - RM Saxena
18 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
10 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Unit 580 Choosing A Sufficient Number of Cases Without Answers
No ratings yet
Unit 580 Choosing A Sufficient Number of Cases Without Answers
4 pages
Stat Merge
No ratings yet
Stat Merge
162 pages
Chi Square
No ratings yet
Chi Square
18 pages
PLM
No ratings yet
PLM
5 pages
Mit18 05 s22 Probability
No ratings yet
Mit18 05 s22 Probability
112 pages
Econ 306 HW 3
No ratings yet
Econ 306 HW 3
7 pages
Session-11 Machine Learning - Jupyter Notebook
No ratings yet
Session-11 Machine Learning - Jupyter Notebook
11 pages
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
No ratings yet
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
15 pages
Evaluation 1 7
No ratings yet
Evaluation 1 7
7 pages
BA quiz 2( 4-7)
No ratings yet
BA quiz 2( 4-7)
14 pages
STAT200 Week7 Homework Solutions
No ratings yet
STAT200 Week7 Homework Solutions
10 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Assignment 7
No ratings yet
Assignment 7
3 pages
Utilizing Xai Technique To Improve Autoencoder Based Model For Computer Network Anomaly Detection With Shapley Additive Explanation (Shap)
No ratings yet
Utilizing Xai Technique To Improve Autoencoder Based Model For Computer Network Anomaly Detection With Shapley Additive Explanation (Shap)
20 pages
Download Pediatric Diagnostic Labs for Primary Care An Evidence based Approach Rita Marie John (Editor) ebook All Chapters PDF
100% (1)
Download Pediatric Diagnostic Labs for Primary Care An Evidence based Approach Rita Marie John (Editor) ebook All Chapters PDF
40 pages
Last Minute Revision
100% (1)
Last Minute Revision
63 pages
06 Oa Comparison of Spot PDF
No ratings yet
06 Oa Comparison of Spot PDF
5 pages
DUET Paper
No ratings yet
DUET Paper
8 pages
Open Perimeter Interface (OPI)
No ratings yet
Open Perimeter Interface (OPI)
15 pages
Custom Conrol Simplex 05791311
No ratings yet
Custom Conrol Simplex 05791311
94 pages
Decision Analysis: Matthew Scotch, PHD, MPH
No ratings yet
Decision Analysis: Matthew Scotch, PHD, MPH
29 pages
L-0025614809-pdf
No ratings yet
L-0025614809-pdf
16 pages
IQ 3 English Ok Final
No ratings yet
IQ 3 English Ok Final
88 pages
AIML Suggestion Answer
No ratings yet
AIML Suggestion Answer
36 pages
Chlortetracycline and Tetracyline - FAO
No ratings yet
Chlortetracycline and Tetracyline - FAO
18 pages
Confusion Matrix
No ratings yet
Confusion Matrix
2 pages
Immediate Download The Art of Feature Engineering: Essentials For Machine Learning 1st Edition Pablo Duboue Ebooks 2024
100% (5)
Immediate Download The Art of Feature Engineering: Essentials For Machine Learning 1st Edition Pablo Duboue Ebooks 2024
52 pages
Decomposing The Effect of Color On Memory: How Red and Blue Affect Memory Differently
No ratings yet
Decomposing The Effect of Color On Memory: How Red and Blue Affect Memory Differently
27 pages
Weka: A Tool For Data Preprocessing, Classification, Ensemble, Clustering and Association Rule Mining
No ratings yet
Weka: A Tool For Data Preprocessing, Classification, Ensemble, Clustering and Association Rule Mining
4 pages
Artificial Intelligence For Mental Health and Mental Illnesses: An Overview
No ratings yet
Artificial Intelligence For Mental Health and Mental Illnesses: An Overview
18 pages
Toronto Clinical Neuropathy Score and Modified Toronto Clinical Neuropathy Score Diagnostic Tests in Distal Diabetic Sensorimotor Polyneuropathy Patients
No ratings yet
Toronto Clinical Neuropathy Score and Modified Toronto Clinical Neuropathy Score Diagnostic Tests in Distal Diabetic Sensorimotor Polyneuropathy Patients
10 pages
Fibrosis Quística Screening
No ratings yet
Fibrosis Quística Screening
5 pages
Clinical Reliability of The Furcation Arrow As A Diagnostic Marker
No ratings yet
Clinical Reliability of The Furcation Arrow As A Diagnostic Marker
6 pages
Decision Tree
No ratings yet
Decision Tree
52 pages
NHMRC Levels and Grades (2009) PDF
No ratings yet
NHMRC Levels and Grades (2009) PDF
24 pages
Phishing
No ratings yet
Phishing
13 pages
Data Science Interview Questions and Answers For 2020 PDF
No ratings yet
Data Science Interview Questions and Answers For 2020 PDF
20 pages
Introduction To Biostatistics1
No ratings yet
Introduction To Biostatistics1
23 pages
Clinical Biochemistry and Metabolic Medicine 2012
100% (6)
Clinical Biochemistry and Metabolic Medicine 2012
441 pages
Viral Acute Anterior Uveitis: Clinical Signs Useful For Differential Diagnosis
No ratings yet
Viral Acute Anterior Uveitis: Clinical Signs Useful For Differential Diagnosis
9 pages