Ch 7 - notes evaluation

Chapter 7 discusses the importance of evaluating AI models to ensure their reliability and effectiveness in predicting outcomes. It outlines various evaluation metrics such as accuracy, precision, recall, and the F1 score, emphasizing the need to balance precision and recall for optimal model performance. The chapter also highlights common reasons for AI model inefficiencies, including lack of training data and inefficient coding.

Uploaded by

dhanashriam06

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Ch 7 - notes evaluation

Uploaded by

dhanashriam06

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Ch 7- Evaluation

Introduction
In the Evaluation stage, we will explore different methods of evaluating an AI model. Model
Evaluation is an integral part of the model development process. It helps to find the best model
that represents our data and how well the chosen model will work in the future. Evaluation is
the process of understanding the reliability of any AI model, based on outputs by feeding test
dataset into the model and comparing with actual answers.

Importance of Evaluation –
 Following are the some of the advantages of evaluating a model:
 Evaluation ensures that the model is operating correctly and optimally.
 Evaluation is an initiative to understand how well it achieves its goals.
 Evaluation helps to determine what works well and what could be improved in a
program

Reasons for inefficiency of AI Model –

Lack of training data – This could be due to less data available for developing an AI model
Unauthenticated / wrong data – If the data is not unauthenticated and correct due to
negligence or data collected from unauthorized resources then the model will not give good
results.
Inefficient coding – If the written algorithms are not correct and relevant, AI model will not
give desired output.
Not Tested- If the model is not tested properly, then it will not be efficient.

Terminologies of Model Evaluation – Evaluation of AI model can be done using

various terminologies. Let us understand this with an example
Imagine that you have come up with an AI based prediction model which has been deployed
in a forest which is prone to forest fires. Now, the objective of the model is to predict whether
a forest fire has broken out in the forest or not. Now, to understand the efficiency of this model,
we need to check if the predictions which it makes are correct or not. Thus, there exist two
conditions which we need to ponder upon: Prediction and Reality. The prediction is the output
which is given by the machine and the reality is the real scenario in the forest when the
prediction has been made. Now let us look at various combinations that we can have with these
two conditions.
True Positive - a forest fire has broken out in the forest. The model predicts a Yes which means
there is a forest fire. The Prediction matches with the Reality. Hence, this condition is termed
as True Positive.
True Negative- Here there is no fire in the forest hence the reality is No. In this case, the
machine too has predicted it correctly as a No. Therefore, this condition is termed as True
Negative.
False Positive - Here the reality is that there is no forest fire. But the machine has incorrectly
predicted that there is a forest fire. This case is termed as False Positive.
False Negative. - Here, a forest fire has broken out in the forest because of which the Reality
is Yes but the machine has incorrectly predicted it as a No which means the machine predicts
that there is no Forest Fire. Therefore, this case becomes False Negative.
Confusion matrix
The result of comparison between the prediction and reality can be recorded in what we call
the confusion matrix. The confusion matrix allows us to understand the prediction results. Note
that it is not an evaluation metric but a record which can help in evaluation.

Prediction and Reality can be easily mapped together with the help of this confusion matrix.
Evaluation Methods
Accuracy-
Accuracy is defined as the percentage of correct predictions out of all the observations.
A prediction can be said to be correct if it matches the reality. Here, we have two conditions in
which the Prediction matches with the Reality: True Positive and True Negative.

Precision
Precision is defined as the percentage of true positive cases versus all the cases where the
prediction is true. That is, it takes into account the True Positives and False Positives.
Recall
It can be defined as the fraction of positive cases that are correctly identified. it considers True
Positives (There was a forest fire in reality and the model predicted a forest fire) and False
Negatives (There was a forest fire and the model didn’t predict it)

Which Metric is Important

we must say that if we want to know if our model’s performance is good, we need these two
measures: Recall and Precision. For some cases, you might have a High Precision but Low
Recall or Low Precision but High Recall. But since both the measures are important, there is a
need of a parameter which takes both Precision and Recall into account.
F1 Score- F1 score can be defined as the measure of balance between precision and recall.

An ideal situation would be when we have a value of 1 (that is 100%) for both Precision and
Recall. In that case, the F1 score would also be an ideal 1 (100%). It is known as the perfect
value for F1 Score.

The AI Wealth Creation Blueprint PDF
67% (3)
The AI Wealth Creation Blueprint PDF
50 pages
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
100% (8)
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
148 pages
How To Hack Atm
87% (15)
How To Hack Atm
1 page
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
88% (8)
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
56 pages
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
95% (20)
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
471 pages
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
81% (48)
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
708 pages
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
100% (10)
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
821 pages
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
100% (10)
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
708 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
100% (25)
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
306 pages
In Rack Sprinklers Design
100% (2)
In Rack Sprinklers Design
51 pages
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
100% (24)
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
52 pages
The Fabric of Reality
100% (1)
The Fabric of Reality
6 pages
Banana Pancakes - Ukulele Chord Chart
100% (1)
Banana Pancakes - Ukulele Chord Chart
2 pages
75 Productivity Hacks - System Sunday
100% (7)
75 Productivity Hacks - System Sunday
75 pages
Military Remote Viewing Manual
100% (5)
Military Remote Viewing Manual
72 pages
Machine Learning For Humans
100% (4)
Machine Learning For Humans
97 pages
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
No ratings yet
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
20 pages
Manual Lab CMT450 - Unit Operation
No ratings yet
Manual Lab CMT450 - Unit Operation
17 pages
EVALUATION
No ratings yet
EVALUATION
12 pages
c10 Ai Evaluation -2024-25
No ratings yet
c10 Ai Evaluation -2024-25
29 pages
Evaluation AI X
No ratings yet
Evaluation AI X
6 pages
EvaluationNotes
No ratings yet
EvaluationNotes
12 pages
Evaluation 1
No ratings yet
Evaluation 1
23 pages
Evaluation - Grade 10 AI
No ratings yet
Evaluation - Grade 10 AI
12 pages
04 Evaluation Revision Notes
No ratings yet
04 Evaluation Revision Notes
5 pages
Evaluation Grade10 Ai
No ratings yet
Evaluation Grade10 Ai
32 pages
Evaluation
No ratings yet
Evaluation
32 pages
Grade 10 Unit 7 - Evaluation
No ratings yet
Grade 10 Unit 7 - Evaluation
50 pages
Part B Chapter 7 (Evaluation)
No ratings yet
Part B Chapter 7 (Evaluation)
5 pages
5.10AI -2B
No ratings yet
5.10AI -2B
15 pages
EVALUATION PPT
No ratings yet
EVALUATION PPT
25 pages
AI-Evaluation
No ratings yet
AI-Evaluation
30 pages
Ch-EVALUATION
No ratings yet
Ch-EVALUATION
7 pages
Evaluation__1646538719041
No ratings yet
Evaluation__1646538719041
65 pages
AI SS CH 7 LM
No ratings yet
AI SS CH 7 LM
39 pages
EVALUATION
No ratings yet
EVALUATION
10 pages
Evaluation in AI
No ratings yet
Evaluation in AI
20 pages
EVALUATION - notes
No ratings yet
EVALUATION - notes
15 pages
Unit 7 - Evaluation
No ratings yet
Unit 7 - Evaluation
7 pages
EVALUATION
No ratings yet
EVALUATION
4 pages
A Field of Computer Science That Focuses On Enabling Computers To Identify and Understand Objects and People in Images and Videos
No ratings yet
A Field of Computer Science That Focuses On Enabling Computers To Identify and Understand Objects and People in Images and Videos
136 pages
Evaluation Question Answers
No ratings yet
Evaluation Question Answers
7 pages
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
No ratings yet
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
7 pages
517-c-30072-Assignment Chapter Evaluation
No ratings yet
517-c-30072-Assignment Chapter Evaluation
10 pages
EvaluationQuestions Class 10 Ai
No ratings yet
EvaluationQuestions Class 10 Ai
6 pages
Evaluation Class X
50% (2)
Evaluation Class X
19 pages
AI Evaluation
No ratings yet
AI Evaluation
3 pages
Unit 7 - AI (Evaluation)
No ratings yet
Unit 7 - AI (Evaluation)
28 pages
UNIT 7 EVALUATION.docx
No ratings yet
UNIT 7 EVALUATION.docx
13 pages
AI Evaluation
No ratings yet
AI Evaluation
24 pages
Evaluation
No ratings yet
Evaluation
12 pages
Q ClassX AI Evaluation
No ratings yet
Q ClassX AI Evaluation
12 pages
Evaluation-Important Questions
No ratings yet
Evaluation-Important Questions
12 pages
Chapter 7 (Evaluation)
No ratings yet
Chapter 7 (Evaluation)
2 pages
Evaluation 1 7
No ratings yet
Evaluation 1 7
7 pages
AI Project Evaluation 1
No ratings yet
AI Project Evaluation 1
5 pages
Evaluation 2
No ratings yet
Evaluation 2
15 pages
Evaluation Worksheet
No ratings yet
Evaluation Worksheet
2 pages
Evaluation notes
No ratings yet
Evaluation notes
12 pages
Part B Unit 7 Evaluation
No ratings yet
Part B Unit 7 Evaluation
11 pages
1051637-Worksheet Part b Unit7 Evaluation
No ratings yet
1051637-Worksheet Part b Unit7 Evaluation
5 pages
Cbse - Department of Skill Education Artificial Intelligence
No ratings yet
Cbse - Department of Skill Education Artificial Intelligence
12 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
MS EVALUATION WORKSHEET
No ratings yet
MS EVALUATION WORKSHEET
3 pages
Evaluation Compressed
No ratings yet
Evaluation Compressed
14 pages
Aiunit 7 10
No ratings yet
Aiunit 7 10
4 pages
10 Ai Evaluation tp01
No ratings yet
10 Ai Evaluation tp01
5 pages
1006_ai_evaluation
No ratings yet
1006_ai_evaluation
4 pages
chapter 8
No ratings yet
chapter 8
25 pages
2.Confusion matrix and Performmance Metrics
No ratings yet
2.Confusion matrix and Performmance Metrics
15 pages
UNIT 3-Practice Sheet 3 (1)
No ratings yet
UNIT 3-Practice Sheet 3 (1)
2 pages
Class 10 Chapter 7 -EVALUATION
No ratings yet
Class 10 Chapter 7 -EVALUATION
17 pages
3008_revision_cv_evaluation
No ratings yet
3008_revision_cv_evaluation
20 pages
Evaluation
No ratings yet
Evaluation
10 pages
Class X - Artificial Intelligence - Evaluation - Question Bank
83% (6)
Class X - Artificial Intelligence - Evaluation - Question Bank
8 pages
Evaluation
No ratings yet
Evaluation
2 pages
Screenshot 2024-12-17 at 8.54.03 PM
No ratings yet
Screenshot 2024-12-17 at 8.54.03 PM
4 pages
The Secrets of A Slot Machine
No ratings yet
The Secrets of A Slot Machine
4 pages
My Ai Cheat List
100% (11)
My Ai Cheat List
3 pages
Roadmap How To Learn AI in 2024 (Uncovered AI)
No ratings yet
Roadmap How To Learn AI in 2024 (Uncovered AI)
6 pages
Teas Topics To Study
100% (12)
Teas Topics To Study
6 pages
2045: The Year Man Becomes Immortal
No ratings yet
2045: The Year Man Becomes Immortal
9 pages
Wisc V Interpretation
100% (1)
Wisc V Interpretation
8 pages
Rationality From AI To Zombies
86% (7)
Rationality From AI To Zombies
1,813 pages
Tech Trend 2024 Report-2
No ratings yet
Tech Trend 2024 Report-2
11 pages
From Music To Mathematic
100% (1)
From Music To Mathematic
4 pages
Attention Is All You Need
67% (3)
Attention Is All You Need
11 pages
Mind Control Patents
100% (1)
Mind Control Patents
41 pages
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
100% (7)
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
145 pages
Psych Unit 7a Practice Quiz
No ratings yet
Psych Unit 7a Practice Quiz
4 pages
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
No ratings yet
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
456 pages
Radiodetection PCM
No ratings yet
Radiodetection PCM
8 pages
Manual Che301
No ratings yet
Manual Che301
53 pages
A Popular Theory Explaining The Evolution of The Universe
No ratings yet
A Popular Theory Explaining The Evolution of The Universe
2 pages
Unit-4 MM 2022
No ratings yet
Unit-4 MM 2022
17 pages
Gunasekarage et al (2007)
No ratings yet
Gunasekarage et al (2007)
34 pages
Scada Datasheet EN
No ratings yet
Scada Datasheet EN
8 pages
VS2003 Jscript Es-Es
No ratings yet
VS2003 Jscript Es-Es
802 pages
Trigno Identities
No ratings yet
Trigno Identities
31 pages
Objects & Classes: Softuni Team
No ratings yet
Objects & Classes: Softuni Team
42 pages
Remote Controlled Fan Regulator Project Report
100% (2)
Remote Controlled Fan Regulator Project Report
11 pages
Short Titles Long Titles
No ratings yet
Short Titles Long Titles
2 pages
Activity 4B Impedance of RLC Circuits: Parallel RLC Circuit: Electrical Engineering Department
No ratings yet
Activity 4B Impedance of RLC Circuits: Parallel RLC Circuit: Electrical Engineering Department
10 pages
25 Moments&Eqm
No ratings yet
25 Moments&Eqm
4 pages
Lecture 2a. Introduction To Fault Analysis
100% (2)
Lecture 2a. Introduction To Fault Analysis
54 pages
Summative Test in Math 7
100% (1)
Summative Test in Math 7
4 pages
Laboratory Experiment 3
No ratings yet
Laboratory Experiment 3
14 pages
Reflexive Verbs and Pronouns
No ratings yet
Reflexive Verbs and Pronouns
3 pages
Optimization Technique
100% (6)
Optimization Technique
30 pages
Mioe Charger Rs 121-37581g
No ratings yet
Mioe Charger Rs 121-37581g
161 pages
Threads- Threading Issues
No ratings yet
Threads- Threading Issues
19 pages
Electrochemistry
No ratings yet
Electrochemistry
51 pages
Name: Date:: Unit 1 Worksheet
100% (1)
Name: Date:: Unit 1 Worksheet
70 pages
An Efficient Method For Antenna Design Optimization Based On Evolutionary Computation and Machine Learning Techniques
No ratings yet
An Efficient Method For Antenna Design Optimization Based On Evolutionary Computation and Machine Learning Techniques
12 pages
Time Study Equipments
No ratings yet
Time Study Equipments
6 pages
Object-Oriented Software Engineering: Practical Software Development Using UML and Java
No ratings yet
Object-Oriented Software Engineering: Practical Software Development Using UML and Java
65 pages
Jai Paras Construction & Engg. Co. Bill of Quantity For Storm Water Line
No ratings yet
Jai Paras Construction & Engg. Co. Bill of Quantity For Storm Water Line
3 pages
VBA Objects - The Ultimate Guide
No ratings yet
VBA Objects - The Ultimate Guide
29 pages
Lesson 6:: Understanding The Z-Scores
100% (1)
Lesson 6:: Understanding The Z-Scores
52 pages

Ch 7 - notes evaluation

Uploaded by

Ch 7 - notes evaluation

Uploaded by

Ch 7- Evaluation

Reasons for inefficiency of AI Model –

Terminologies of Model Evaluation – Evaluation of AI model can be done using

Which Metric is Important

You might also like