0% found this document useful (0 votes)

14 views7 pages

Evaluation 1 7

Uploaded by

Manish Mavi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views7 pages

Evaluation 1 7

Uploaded by

Manish Mavi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

CHAPTER-11 EVALUATION

AI PROJECT CYCLE
Problem Scoping ----- > Data Acquisition ---- > Data
Exploring

------ > Modelling ---- > Evaluation

Evaluation
 Evaluation is the final stage in AI Project Cycle. Once a model
has been made and trained, it needs to go through proper
testing so that one can calculate the efficiency and performance
of the model. Hence, the model is tested with the help of
Testing Data.
 Evaluation is the process of understanding the reliability and
final performance of any AI model by giving the test data set into
the model and comparing it`s output with actual answers.
Q. We must keep in mind that it is not advisable to use the data
that we used to create the model to evaluate it. Why?
Or
What do you think by the term ‘Overfitting’? Explain.
Ans-Training data must not be used for evaluation purposes
because a model simply remembers the whole of training data,
therefore always predicts the correct output for any point in the
training set whenever training data is fed again. But it gives very
wrong answers if a new dataset is introduced to the model. This
situation is known as overfitting.
-----------------
Evaluation is basically done by two things:
1. Prediction: The output given by the machine after training
and testing the data is known as Prediction. (Output of the
machine)
2. Reality: Reality is the real situation and real scenario where
prediction has been made by the machine. (Reality or truth)

EXAMPLES OR SCENARIOS
1. Case 1
Is this a Football?

1. Prediction = YES
2. Reality = YES
3. True Positive
Here, we can see in the picture that it’s a football. The model’s
prediction is Yes which means it's football. The Prediction
matches Reality. Hence, this condition is
termed as True Positive.

2. Case 2
Is this a Football?
1. Prediction = NO
2. Reality = NO
3. True Negative
Here this is Not an image of Football hence the reality is
No. In this case, the machine has predicted it correctly as a
No. Therefore, this condition is termed as True Negative.

3. Case 3
Is this a Football?

1. Prediction = YES
2. Reality = NO
3. False Positive (Type 1 Error)
Here the reality is that it is not Football. But the machine
has incorrectly predicted that this is Football. This case is
termed False Positive.

4. Case 4
Is this a Football?

1. Prediction = NO
2. Reality = YES
3. False Negative (Type 2 Error)

Here, a Football has been in a different look because of

which the Reality is Yes but the machine has incorrectly
predicted it as a No which means the machine predicts
that it is not Football.
Therefore, this case becomes False Negative.

Confusion Matrix-
 The comparison between the results of Prediction and reality is
called the Confusion Matrix.
 It is a record that helps in evaluation.
 It is not a calculation; it is a performance measurement for
machine learning classification problems where output can be
two or more classes.

Now again consider the example of football:

Result of comparison between prediction and reality can be
recorded in a confusion matrix.

Parameters to evaluate the Model-

There are four methods to evaluate the model.

Accuracy-
 It is the percentage of correct predictions out of all the
observations.
 A prediction is correct if it matches the reality.
 All True positive and True Negative are the cases in which
the Prediction matches with reality.

Accuracy formula:
( )

OR
( )
( )

Note: Here Total cases/observations= TP+TN+FP+FN

EXAMPLE
Let us again take the football example.
Assume that the model always predicts that object is not football.
But in reality, there is a 5% chances of object being a football. In
this case, for 95 cases, the model will be right but for 5 cases in
which the object was a football, the model predicted it to be not a
football.
Here,
1. True Positives = 0
2. True Negatives = 95
3. Total cases = 100
4. Therefore, accuracy becomes:
95+0/100 = 95%

Precision Parameter-
It is defined as the percentage of true positive cases versus
all the cases where the prediction is true. It takes True
Positives and False Positives.

Note: If Precision is high, this means the True Positive cases

are more, giving lesser False predictions.

In the above example, all the Positive conditions would be taken

into account that is,
 True Positive (Prediction = Yes and Reality = Yes)
 False Positive (Prediction = Yes and Reality = No)

Recall Parameter
 Recall is an important metric because it measures the ability
of a model to identify all of the positive instances in a dataset.
 It is calculated as the number of true positives divided by the
total number of positives, including both true positives and
false negatives.


Note:
 Precision Focuses on the correctness of positive predictions while
Recall emphasizes capturing all relevant instances.
 Precision evaluates the correctness of positive predictions,
while recall determines how well the model recognizes all
pertinent instances.

F1 Score
 It can be defined as the measure of balance between precision
and recall.

 An ideal situation is there when we have a value of 1 for both

Precsion and Recall. Then F1 score would also be 1(100%). It
is known as the perfect value for F1 Score. A model is having
a good performance if F1 Score is high.

Evaluation-Practice Questions(Answer key)
No ratings yet
Evaluation-Practice Questions(Answer key)
4 pages
EvaluationQuestions Class 10 Ai
No ratings yet
EvaluationQuestions Class 10 Ai
6 pages
Evaluation - Grade 10 AI
No ratings yet
Evaluation - Grade 10 AI
12 pages
Lesson 4 - Performance Metrics
No ratings yet
Lesson 4 - Performance Metrics
46 pages
UNIT 7 EVALUATION.docx
No ratings yet
UNIT 7 EVALUATION.docx
13 pages
Harding V Commercial Union Assurance Company
100% (2)
Harding V Commercial Union Assurance Company
2 pages
Evaluation New
No ratings yet
Evaluation New
42 pages
Module 12
No ratings yet
Module 12
39 pages
Rights of Agent Sem 3
No ratings yet
Rights of Agent Sem 3
11 pages
Evaluation in AI
No ratings yet
Evaluation in AI
20 pages
Evaluation notes
No ratings yet
Evaluation notes
12 pages
Syarikat Perkapalan Timor V United Malayan B
No ratings yet
Syarikat Perkapalan Timor V United Malayan B
8 pages
Evaluation
No ratings yet
Evaluation
32 pages
Evaluation 1
No ratings yet
Evaluation 1
23 pages
c10 Ai Evaluation -2024-25
No ratings yet
c10 Ai Evaluation -2024-25
29 pages
Evaluation Class x Ai 417
No ratings yet
Evaluation Class x Ai 417
19 pages
Instant Access to Hong Kong Food Culture 2nd Edition Adele Wong ebook Full Chapters
100% (2)
Instant Access to Hong Kong Food Culture 2nd Edition Adele Wong ebook Full Chapters
55 pages
Evaluation AI X
No ratings yet
Evaluation AI X
6 pages
A Field of Computer Science That Focuses On Enabling Computers To Identify and Understand Objects and People in Images and Videos
No ratings yet
A Field of Computer Science That Focuses On Enabling Computers To Identify and Understand Objects and People in Images and Videos
136 pages
5.10AI -2B
No ratings yet
5.10AI -2B
15 pages
EVALUATION
No ratings yet
EVALUATION
12 pages
EvaluationNotes
No ratings yet
EvaluationNotes
12 pages
Grade 10 Unit 7 - Evaluation
No ratings yet
Grade 10 Unit 7 - Evaluation
50 pages
Evaluation-Important Questions
No ratings yet
Evaluation-Important Questions
12 pages
Notes of Evaluation
No ratings yet
Notes of Evaluation
5 pages
7 - (G.R. NO. 150416. July 21, 2006)
No ratings yet
7 - (G.R. NO. 150416. July 21, 2006)
7 pages
The Power of Love Connecting to the Oneness Instant Access
100% (11)
The Power of Love Connecting to the Oneness Instant Access
14 pages
Unit-7 Evaluation Notes
No ratings yet
Unit-7 Evaluation Notes
9 pages
Artigo - Modeling Time Series of Ground Water Head Fluctuations Subjected To Multiple Stresses
No ratings yet
Artigo - Modeling Time Series of Ground Water Head Fluctuations Subjected To Multiple Stresses
11 pages
Part B Chapter 7 (Evaluation)
No ratings yet
Part B Chapter 7 (Evaluation)
5 pages
EVALUATION
No ratings yet
EVALUATION
10 pages
Evaluation 2
No ratings yet
Evaluation 2
15 pages
2023091538
No ratings yet
2023091538
15 pages
SCHEERLINCK K - Depth Configurations. Proximity, Permeability and Territorial Boundaries in Urban Projects
No ratings yet
SCHEERLINCK K - Depth Configurations. Proximity, Permeability and Territorial Boundaries in Urban Projects
14 pages
Case Study Robust Laptops
No ratings yet
Case Study Robust Laptops
2 pages
Ch 7 - notes evaluation
No ratings yet
Ch 7 - notes evaluation
3 pages
Unit 7 - Evaluation
No ratings yet
Unit 7 - Evaluation
7 pages
Malnutrition Case
No ratings yet
Malnutrition Case
3 pages
Part B Unit 7 Evaluation
No ratings yet
Part B Unit 7 Evaluation
11 pages
04 Evaluation Revision Notes
No ratings yet
04 Evaluation Revision Notes
5 pages
Evaluation Class X
33% (3)
Evaluation Class X
19 pages
2.Confusion matrix and Performmance Metrics
No ratings yet
2.Confusion matrix and Performmance Metrics
15 pages
X Unit 7 Evaluation
No ratings yet
X Unit 7 Evaluation
5 pages
Unit 7 - AI (Evaluation)
No ratings yet
Unit 7 - AI (Evaluation)
28 pages
Ch 07 Evaluation
No ratings yet
Ch 07 Evaluation
25 pages
Screenshot 2025-03-17 at 12.15.59
No ratings yet
Screenshot 2025-03-17 at 12.15.59
3 pages
AI-Evaluation
No ratings yet
AI-Evaluation
30 pages
Evaluation Grade10 Ai
No ratings yet
Evaluation Grade10 Ai
32 pages
Grade 7 Literature
No ratings yet
Grade 7 Literature
7 pages
Ch-EVALUATION
No ratings yet
Ch-EVALUATION
7 pages
Pneumonia Case Study
100% (3)
Pneumonia Case Study
33 pages
Evaluation
No ratings yet
Evaluation
12 pages
Test 2
100% (1)
Test 2
8 pages
EVALUATION - notes
No ratings yet
EVALUATION - notes
15 pages
UNIT-3
No ratings yet
UNIT-3
13 pages
517-c-30072-Assignment Chapter Evaluation
No ratings yet
517-c-30072-Assignment Chapter Evaluation
10 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
Confusion Matrix
No ratings yet
Confusion Matrix
13 pages
Q ClassX AI Evaluation
No ratings yet
Q ClassX AI Evaluation
12 pages
Basic Details Avkahada Chakra: Name: Vandana Pavanesh
No ratings yet
Basic Details Avkahada Chakra: Name: Vandana Pavanesh
4 pages
MS EVALUATION WORKSHEET
No ratings yet
MS EVALUATION WORKSHEET
3 pages
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
No ratings yet
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
13 pages
Dialectical Journal 11
No ratings yet
Dialectical Journal 11
1 page
AI Project Evaluation 1
No ratings yet
AI Project Evaluation 1
5 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
Marketing Information System
No ratings yet
Marketing Information System
9 pages
Evaluation Worksheet
No ratings yet
Evaluation Worksheet
2 pages
Vijay Tendulkar'S Kamala: Jibe at Value System of Yellow Journalism
No ratings yet
Vijay Tendulkar'S Kamala: Jibe at Value System of Yellow Journalism
6 pages
الجرامر في محاضرة واحدة
No ratings yet
الجرامر في محاضرة واحدة
14 pages
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
No ratings yet
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
7 pages
Evaluation Exercise
No ratings yet
Evaluation Exercise
3 pages
Analytical Solution To Problems Os Hydraulic Jump in Horizontal Triangular Channels
No ratings yet
Analytical Solution To Problems Os Hydraulic Jump in Horizontal Triangular Channels
4 pages
Aiunit 7 10
No ratings yet
Aiunit 7 10
4 pages
1051637-Worksheet Part b Unit7 Evaluation
No ratings yet
1051637-Worksheet Part b Unit7 Evaluation
5 pages
Evaluation Question Answers
No ratings yet
Evaluation Question Answers
7 pages
Revision Notes Book Corporate Finance Chapter 1 18
No ratings yet
Revision Notes Book Corporate Finance Chapter 1 18
15 pages
EVALUATION
No ratings yet
EVALUATION
4 pages
Module 2021 Year 5 Set 2
No ratings yet
Module 2021 Year 5 Set 2
25 pages
10 Ai Evaluation tp01
No ratings yet
10 Ai Evaluation tp01
5 pages
AI Evaluation
No ratings yet
AI Evaluation
24 pages
Score Sheet in Music
No ratings yet
Score Sheet in Music
2 pages
Factor Analysis
No ratings yet
Factor Analysis
13 pages
English L - 4 & 6
No ratings yet
English L - 4 & 6
2 pages
EVALUATION PPT
No ratings yet
EVALUATION PPT
25 pages
Act 4103 - The Indeterminate Sentence Law
No ratings yet
Act 4103 - The Indeterminate Sentence Law
3 pages
D20 - Star Wars - Netbook of Prestige Classes
No ratings yet
D20 - Star Wars - Netbook of Prestige Classes
65 pages
ARCASIA Students Design Competition TOR
No ratings yet
ARCASIA Students Design Competition TOR
4 pages
Upper Intermediate and Advanced Placement Test
No ratings yet
Upper Intermediate and Advanced Placement Test
5 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet

Evaluation 1 7

Uploaded by

Evaluation 1 7

Uploaded by

CHAPTER-11 EVALUATION

------ > Modelling ---- > Evaluation

Here, a Football has been in a different look because of

Now again consider the example of football:

Parameters to evaluate the Model-

There are four methods to evaluate the model.

Note: Here Total cases/observations= TP+TN+FP+FN

Note: If Precision is high, this means the True Positive cases

In the above example, all the Positive conditions would be taken

 An ideal situation is there when we have a value of 1 for both

You might also like