Al Evaluation

Uploaded by

poornimagidagar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Al Evaluation

Uploaded by

poornimagidagar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

AI Evaluation

Evaluation
Evaluation refers to systematically checking and analysing the merit,
correct- ness and reliability of an AI model based on the outputs
produced by it.
Evaluation Metrics
Evaluation Metrics refers to the measures used to test the quality of
the Al model.
The Causes behind the Performance of AI Model are:
1.Overfitting
Overfitting refers to a situation when an Al model performs so well as
the test data it got, fitted exactly against its training data and thus AI
model always produced correct result.
2.Underfitting
Underfitting refers to a situation when an Al model is not complex
enough to capture the structure and relationships of its training data
and predict effective outcomes.
3. Generalization
Generalization refers to how well the concepts learned by a machine
learning model apply to specific examples not seen by the model
when it was learning. The goal of a good machine learning model is to
generalize well from the training data to any data from the problem
domain. This allows us to make predictions in the future on data the
model has never seen.
Ideally, an Al model should be balanced between underfitting and
overfitting to be a good fit.
Confusion Matrix
A Confusion Matrix is a technique using a chart or table for
summarizing the performance of a classification-based Al model by
listing the predicted values of an Al model and the actual/correct
outcome values.
A confusion table includes both predictive and actual values in
context of AI model, which are:
◆ the Actual Value represents the actual result (observed or
measured).
True
Actual Values
False

◆ the Predicted Value is the value of the outcome/result of the Al

model, produced on the basis of its algorithm and learning.
Positive
Predicted Values
Negative

Before we proceed to how to create and use confusion matrices, some

terms associated with confusion matrix are:
(i) True Positive (TP). True positive refers to an instance for which
both predicted value of the AI model and actual value are positive.
For example, while testing a patient for Covid, if the test also
produced the result (predicted value) as positive and the actual result
(actual value) is also positive, it is True positive.
(ii) True Negative (TN). True negative refers to an instance for which
both predicted value of the AI model and actual value are negative.
For example, while testing a patient for Covid, if the test also
produced the result (predicted value) as negative and the actual result
(actual value) is also negative, it is True negative.
(iii) False Positive (FP) (also called Type I Error). False positive
refers to an instance for which predicted value of an Al model is
positive but actual value is negative. For example, while testing a
patient for Covid, if the test produced the result (predicted value) as
positive and the actual result (actual value) is negative, it is False
positive.
(iv) False Negative (FN) (also called Type II Error). False negative
refers to an instance for which predicted value of an Al model is
negative but actual value is positive. For example, while testing a
patient for Covid, if the test produced the result (predicted value) as
negative and the actual result (actual value) is positive, it is False
negative.
Note
A Confusion Matrix is a technique using a chart or table (N x N
matrix) for summarizing the performance of a classification-based Al
model by listing the predicted values of an Al model and the
actual/correct outcome values.
Using the confusion matrices, you need to compute the following
values to evaluate an AI model:
◆ Accuracy rate. This is the percentage of times the predictions out
of all the observations are correct.
◆ Precision rate. This is the rate at which the desirable predictions
turn out to be correct (True Positives out of all positives).
◆ Recall. It is a rate of correct positive predictions to the overall
number of positive instances in the dataset.
◆ F1 score. It is a measure of balance between precision and recall.

F1 Score refers to a metric that balances Precision and Recall and

hence balances the impact of False Positives and False negatives.
The metrics Precision, Recall and F1 score range from 0 to 1.
For all the Al models developed, the AI model with the higher F1
score is chosen.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Proposal Paper - Campus Dining
No ratings yet
Proposal Paper - Campus Dining
7 pages
1006_ai_evaluation
No ratings yet
1006_ai_evaluation
4 pages
Evaluation
No ratings yet
Evaluation
2 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
2.Confusion matrix and Performmance Metrics
No ratings yet
2.Confusion matrix and Performmance Metrics
15 pages
Ch-EVALUATION
No ratings yet
Ch-EVALUATION
7 pages
Unit-7 Evaluation Notes
No ratings yet
Unit-7 Evaluation Notes
9 pages
confusion_matrix
No ratings yet
confusion_matrix
5 pages
Grade 10 Unit 7 - Evaluation
No ratings yet
Grade 10 Unit 7 - Evaluation
50 pages
Understanding the Confusion Matrix in Machine Learning
No ratings yet
Understanding the Confusion Matrix in Machine Learning
4 pages
Unit 7 - AI (Evaluation)
No ratings yet
Unit 7 - AI (Evaluation)
28 pages
Confusion Matrix
No ratings yet
Confusion Matrix
10 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
517-c-30072-Assignment Chapter Evaluation
No ratings yet
517-c-30072-Assignment Chapter Evaluation
10 pages
Ch 07 Evaluation
No ratings yet
Ch 07 Evaluation
25 pages
Confusion Matrix & Evaluation Metrics in Machine Learning
No ratings yet
Confusion Matrix & Evaluation Metrics in Machine Learning
23 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
2 pages
Confusion Matrix
No ratings yet
Confusion Matrix
21 pages
Confusion Matrix
No ratings yet
Confusion Matrix
23 pages
Unit 7 - Evaluation
No ratings yet
Unit 7 - Evaluation
7 pages
Confusion Matrix
No ratings yet
Confusion Matrix
13 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
UNIT 7 EVALUATION.docx
No ratings yet
UNIT 7 EVALUATION.docx
13 pages
AI EVALUTION (4)
No ratings yet
AI EVALUTION (4)
2 pages
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
No ratings yet
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
7 pages
Screenshot 2025-03-17 at 12.15.59
No ratings yet
Screenshot 2025-03-17 at 12.15.59
3 pages
Confusion Matrix
No ratings yet
Confusion Matrix
3 pages
BA
No ratings yet
BA
11 pages
Confusion Matrix Machine Learning
No ratings yet
Confusion Matrix Machine Learning
9 pages
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
No ratings yet
Confusion Matrix: A Confusion Matrix Is A Summary of Prediction Results On A Classification Problem
13 pages
Confusion Matrix and outliers
No ratings yet
Confusion Matrix and outliers
32 pages
Evaluation Exercise
No ratings yet
Evaluation Exercise
3 pages
Intel Assignment ----
No ratings yet
Intel Assignment ----
13 pages
Part 8_Confusion Matrix
No ratings yet
Part 8_Confusion Matrix
21 pages
10 Ai Evaluation tp01
No ratings yet
10 Ai Evaluation tp01
5 pages
AI Evaluation
No ratings yet
AI Evaluation
3 pages
AI Project Evaluation 1
No ratings yet
AI Project Evaluation 1
5 pages
Part B Unit 7 Evaluation
No ratings yet
Part B Unit 7 Evaluation
11 pages
ML Evaluation Metrics (1)
No ratings yet
ML Evaluation Metrics (1)
20 pages
Confusion Matrix in Machine Learning fgvbn
No ratings yet
Confusion Matrix in Machine Learning fgvbn
4 pages
Evaluation 1 7
No ratings yet
Evaluation 1 7
7 pages
Evaluation Metrics For Machine Learning: Negative (Actual) 98 Positive (Actual) 1
No ratings yet
Evaluation Metrics For Machine Learning: Negative (Actual) 98 Positive (Actual) 1
2 pages
EVALUATION
No ratings yet
EVALUATION
10 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
10 pages
Confusion Matrix: Dr. P. K. Chaurasia
No ratings yet
Confusion Matrix: Dr. P. K. Chaurasia
13 pages
Unit II_2.9_Confusion Matrix in ML @ CSJMU_6 Slides Handouts
No ratings yet
Unit II_2.9_Confusion Matrix in ML @ CSJMU_6 Slides Handouts
2 pages
Lecture_8_How to Interpret a Confusion Matrix for a Machine Learning Model
No ratings yet
Lecture_8_How to Interpret a Confusion Matrix for a Machine Learning Model
12 pages
Risk Security and Regulatory Compliance
No ratings yet
Risk Security and Regulatory Compliance
12 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
2023-07-31T16-38-59.851Z-Confusion Matrix in Machine Learning
No ratings yet
2023-07-31T16-38-59.851Z-Confusion Matrix in Machine Learning
10 pages
EVALUATION - notes
No ratings yet
EVALUATION - notes
15 pages
Understanding the Confusion Matrix in Machine Learning
No ratings yet
Understanding the Confusion Matrix in Machine Learning
14 pages
9__ROC__AUC
No ratings yet
9__ROC__AUC
27 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
Evaluation notes
No ratings yet
Evaluation notes
12 pages
6.evaluation Metrics - UNIT 2
No ratings yet
6.evaluation Metrics - UNIT 2
4 pages
Evaluation
No ratings yet
Evaluation
2 pages
MathCoDictionAIry Confusion Matrix 1683714098
No ratings yet
MathCoDictionAIry Confusion Matrix 1683714098
3 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
Midterm Exam
No ratings yet
Midterm Exam
6 pages
The Impact of Psychological and Contextual Factors On Student Achievement in Chemistry: A Quantitative Study in Sri Lankan Senior Secondary Schools
100% (1)
The Impact of Psychological and Contextual Factors On Student Achievement in Chemistry: A Quantitative Study in Sri Lankan Senior Secondary Schools
17 pages
Family Planning in Ethiopia PDF
No ratings yet
Family Planning in Ethiopia PDF
2 pages
[Ebooks PDF] download Planning Research in Hospitality and Tourism 2nd Edition Levent Altinay full chapters
No ratings yet
[Ebooks PDF] download Planning Research in Hospitality and Tourism 2nd Edition Levent Altinay full chapters
67 pages
MBA Salary Case Study
No ratings yet
MBA Salary Case Study
18 pages
Literature Review On HPC
100% (2)
Literature Review On HPC
7 pages
Direct and Indirect Assessment Tools Handout
No ratings yet
Direct and Indirect Assessment Tools Handout
2 pages
Analisis Probit-Modul MP
No ratings yet
Analisis Probit-Modul MP
11 pages
Learning From Labeled and Unlabeled Data On A Directed Graph
No ratings yet
Learning From Labeled and Unlabeled Data On A Directed Graph
8 pages
P and S Question Bank (2024-2025)
No ratings yet
P and S Question Bank (2024-2025)
8 pages
Technical Report Writing - Handout (Civil Eng.)
No ratings yet
Technical Report Writing - Handout (Civil Eng.)
38 pages
2019 Seminar Proposals
No ratings yet
2019 Seminar Proposals
11 pages
MMW-STATS
No ratings yet
MMW-STATS
2 pages
Quarterly Journal of Contemporary Research Volume 7 March Special Edition 20198758relationship Between Workforce Diversity and Employee Commitment
No ratings yet
Quarterly Journal of Contemporary Research Volume 7 March Special Edition 20198758relationship Between Workforce Diversity and Employee Commitment
12 pages
SLIDE FOR Aral - Pan
No ratings yet
SLIDE FOR Aral - Pan
23 pages
8602 Assignment 2
No ratings yet
8602 Assignment 2
13 pages
NSTP 2 Answers
No ratings yet
NSTP 2 Answers
8 pages
Introduction and Company Profile
No ratings yet
Introduction and Company Profile
73 pages
Students Time Management
No ratings yet
Students Time Management
20 pages
Chapter 3 and 4
50% (2)
Chapter 3 and 4
12 pages
EPOC Study Designs About
No ratings yet
EPOC Study Designs About
3 pages
15 Action Research Examples in Education 2023
No ratings yet
15 Action Research Examples in Education 2023
19 pages
PHD Thesis Methodology Section
100% (3)
PHD Thesis Methodology Section
7 pages
Probability and Statistics Review PT 1
No ratings yet
Probability and Statistics Review PT 1
57 pages
Proposal
No ratings yet
Proposal
6 pages
"Handwriting & Individuality" A Psychological, Biomechanical and Growth & Developmental Perspective of Handwriting
0% (1)
"Handwriting & Individuality" A Psychological, Biomechanical and Growth & Developmental Perspective of Handwriting
4 pages
012 Ind Tacr 05a 2 PDF
No ratings yet
012 Ind Tacr 05a 2 PDF
427 pages
Stevenson Survival Analysis 195.721 PDF
No ratings yet
Stevenson Survival Analysis 195.721 PDF
31 pages
Iraqi Efl Teachers' Assessment Literacy: Perceptions and Practices
No ratings yet
Iraqi Efl Teachers' Assessment Literacy: Perceptions and Practices
19 pages