0% found this document useful (0 votes)

4 views

unit 4

Uploaded by

for181fun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

unit 4

Uploaded by

for181fun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Model Evaluation and

Selection
Unit 4
Model Evaluation
 Model evaluation is the process of using different evaluation metrics
to understand a machine learning model's performance, as well as its
strengths and weaknesses.
 It is a crucial step in the development and deployment of machine
learning systems.
 The primary goal of model evaluation is to determine how well the
model generalizes to unseen data and whether it meets the desired
objectives.
Model Evaluation techniques

 Data mining and Machine learning

 Independent and dependent variable
 Splitting of data
 Under fitting and over fitting
 Model evaluation and selection
Data mining:

 Data mining is the process of discovering patterns, trends, and

insights from large sets of data.
 It involves extracting useful information and knowledge from raw
data by using various techniques, including statistical analysis,
machine learning, and artificial intelligence.
 Data mining helps businesses and researchers make informed
decisions, predict future trends, identify relationships between
variables, and gain a deeper understanding of their data.
 It is widely used across industries such as finance, marketing,
healthcare, and telecommunications to uncover valuable
insights that can lead to improved strategies, increased
efficiency, and better decision-making.
Independent and dependent variable

Independent Variables: The variable that are not affected by the

other variables are called independent variables
Dependent Variables: The variables which depend on other
variables or factors
Machine Learning:
Training Data, Validation Data, Testing Data

 Training Data:
 Training data are collections of examples or samples that are
used to 'teach' or 'train the machine learning model.
 The model uses a training data set to understand the patterns
and relationships within the data, thereby learning to make
predictions or decisions without being explicitly programmed to
perform a specific task.
 It is the set of data that is used to train and make the model
learn the hidden features/patterns present in the data.
 Validation Data:
 The validation data is a set of data that is used to validate the
model performance during training.
 This data is held aside during the modelling process and used
only to evaluate a model after the modelling is complete.
 After training a machine learning model using the training data,
the model's performance is evaluated using the validation data.
 This evaluation typically involves measuring metrics such as
accuracy, precision, recall, F1 score, or other relevant
performance indicators, depending on the nature of the
problem being solved.
 Testing Data:
 The testing data is used to evaluate the accuracy of the trained
algorithm.
 Data that held aside during the modelling process and used
only to evaluate a model after the modelling is complete.
 Test data has the same variables as the training data, the same
set of independent variables and the dependent variable.
Overfitting:
 Definition:
 Overfitting occurs when a model learns the training data too well,
capturing noise or random fluctuations in the data as if they were
genuine patterns. Consequently, the model performs well on the
training data but fails to generalize to new, unseen data.
 Characteristics:
 Low bias: The model has low bias as it fits the training data very
closely. Bias is the inability for ML model to get a proper relationship
between variables.
 High variance: However, it has high variance because it fails to
generalize well to unseen data. In ML the difference in fits between
data sets is called variance.
 It will have excellent performance on training data but poor
performance on test data.
 Causes:
 Using a too complex model or algorithm.
 Having too many features relative to the amount of training data.
 Insufficient regularization. Regularization refers to techniques that
are used to compare machine learning models in order to minimize
the adjusted loss function and prevent overfitting or under fitting.
Using Regularization, we can fit our machine learning model
appropriately on a given test set and hence reduce the errors in it.
Loss functions are a measurement of how good your model is in
terms of predicting the expected outcome.
 Regularization in machine learning is a technique used to prevent
overfitting and improve the generalization ability of a model
 Remedies:
 Simplify the model by reducing the number of features or
decreasing its complexity.
 Cross-validation to tune hyper parameters and prevent
overfitting.
 Early stopping during training to prevent the model from
learning noise in the data.
Under fitting
Definition: Under fitting occurs when a model is too simple to capture the underlying
structure of the data. In other words, the model fails to learn the patterns in the
training data, resulting in poor performance not only on the training data but also on
unseen data (test data).
Characteristics:
 High bias: The model is biased toward a certain set of assumptions and fails to
capture the complexity of the data.
 Poor performance: Both on training and test data, the model's performance is
poor.
Causes:
 Using a too simple model or algorithm.
 Insufficient training data.
 Insufficient training time.
Remedies:
 Increase model complexity by adding more features or
increasing the model's capacity.
 Use more advanced algorithms that can capture complex
patterns.
 Gather more training data.
 Train the model for longer periods.
How to overcome over fitting and under fitting
in model?

 Introduce a validation set

 Variance-bias tradeoff
 Cross-validation
 Hyper parameter tuning
 Regularization
Introduce Validation set:
 A validation set is a set of data used to train the model with the goal of
finding and optimizing the best model to solve a given problem.
 The training set is used to train the model. The validation set is used to
fine-tune the model's hyper parameters. The test set serves as a
benchmark to assess the model's performance on new data.

Variance-bias tradeoff:
 If the algorithm is too simple then it may be on high bias and low variance
condition and thus is error-prone. If algorithms fit too complex then it may
be on high variance and low bias.
 The ideal model lies between these two extremes.
 If we make the model more complex (to reduce bias), you risk increasing
variance.
 If you simplify the model (to reduce variance), you risk increasing bias.
 The challenge is to find a balance where the model is complex enough to
capture important patterns but simple enough to generalize well.
Cross-validation
 Cross-validation is a technique used to evaluate the
performance of a machine learning model by splitting the data
into multiple parts. Instead of using just one training and one
test set, the data is divided into "folds," and the model is
trained and tested on different combinations of these folds.
 How It Works:
1. Split the Data: The dataset is divided into k equal parts (folds).
2. Train and Test:
The model is trained on k-1 folds.
It is tested on the remaining 1 fold.
3. Repeat: This process is repeated k times, with each fold used as the
test set once.
4. Average Results: The final model performance is the average of the
results from all folds.
Example:

K-Fold Cross-Validation (k=5)

• Split data into 5 parts.
• Train on 4 parts and test on the 5th part.
• Repeat this 5 times, using a different part as the test set
each time.
• Average the results to get a reliable performance estimate

Prevents Overfitting:

•By training the model on different parts of the data and

validating on unseen data, it ensures that the model doesn’t
memorize the training data.
•This helps in identifying models that perform well on both
training and test data.
Advantages :
1. Improved Model Performance: Provides a reliable estimate of model
performance by testing on multiple data splits.
2. Reduces Overfitting: Prevents memorization by validating on different
subsets of data.
3. Effective for Hyperparameter Tuning: Helps identify optimal model
parameters through repeated evaluation.
4. Ensures Stability and Reliability: Produces consistent and unbiased
performance estimates.

Disadvantages :
1. Computationally Expensive: Involves multiple training and testing cycles,
increasing resource usage.
2. Time-Consuming: Can be slow for large datasets or complex models.
3. Not Always Necessary: May be excessive for small datasets or already well-
performing models.
4. Risk of Data Leakage: Improper splitting can introduce information from
test data into training.
Hyper parameter tuning:
Hyperparameters are external configurations that are not learned from
the data but set before training.

Examples:
• Learning Rate: Controls how much the model adjusts during training.
• Number of Trees: In decision trees or random forests.
• Batch Size: Number of samples processed before updating the model.

• Hyperparameter tuning is the process of finding the best settings for a

machine learning model to improve its accuracy and performance.

• Poorly tuned hyperparameters can lead to underfitting or overfitting.

• Tuning helps identify the best combination to achieve higher accuracy

and better generalization.
Model Evaluation Metrics
 Model evaluation is the process of using different evaluation metrics
to understand a machine learning model's performance, as well as
its strengths and weaknesses.
 To evaluate the performance of a classification model, different
metrics are used, and some of them are as follows:
1. Accuracy
2. Confusion Matrix
3. Precision
4. Recall
5. F-Score
6. AUC(Area Under the Curve)-ROC
1.Confusion Matrix
 Classification is the process of categorizing a given set of
data into different categories.
 In machine learning, to measure the performance of the
classification model, we use the confusion matrix
 The confusion matrix is a tool used to evaluate the
performance of a model and is visually represented as a
table.
 It provides a deeper layer of insight to data practitioners on
the model's performance, errors, and weaknesses.
The Confusion Matrix Structure
Let’s learn about the basic structure of a
confusion matrix, using the example of
identifying an email as spam or not spam.
 True Positive (TP) - Your model predicted the
positive class. For example, identifying a
spam email as spam.
 True Negative (TN) - Your model correctly
predicted the negative class. For example,
identifying a regular email as not spam.
 False Positive (FP) - Your model incorrectly
predicted the positive class. For example,
identifying a regular email as spam.
 False Negative (FN) - Your model incorrectly
predicted the negative class. For example,
identifying a spam email as a regular email.
 In general, the table is divided into four
terminologies, which are as follows:
 True Positive(TP): In this case, the
prediction outcome is true, and it is
true in reality, also.
 True Negative(TN): in this case, the
prediction outcome is false, and it is
false in reality, also.
 False Positive(FP): In this case,
prediction outcomes are true, but they
are false in actuality.
 False Negative(FN): In this case,
predictions are false, and they are true
in actuality.
1.Accuracy
 Accuracy is a metric that measures how often a machine learning model correctly predicts
the outcome.
 You can calculate accuracy by dividing the number of correct predictions by the total
number of predictions
2.Precision
 The precision metric is used to overcome the limitation of Accuracy. The
precision determines the proportion of positive prediction that was
actually correct
 Precision is defined as the ratio of correctly classified positive samples
(True Positive) to a total number of classified positive samples (either
correctly or incorrectly).
 Precision measures the accuracy of the positive predictions made by the
model.
 A high precision indicates that the model has a low false positive rate,
meaning it is good at avoiding misclassifying negative instances as
positive.
Recall:
 It is also similar to the Precision metric; however, it aims to calculate the
proportion of actual positive that was identified incorrectly
 It is the ratio of true positive predictions to all actual positive instances in
the dataset, including both true positives and false negatives.
 Recall measures the ability of the model to correctly identify all positive
samples.
 A high recall indicates that the model has a low false negative rate,
meaning it is good at capturing positive instances without missing many.
F1 Score
 The F1 score is the harmonic mean of precision and recall.
 It balances both precision and recall, making it a useful metric for
situations where there is an imbalance between the classes or when both
false positives and false negatives are equally important.

 The F1 score ranges from 0 to 1, where 1 indicates perfect precision and

recall, and 0 indicates the worst possible performance.
 It's used as a single measure to compare different models or to tune
model parameters in classification tasks.
Difference between Precision and Recall in Machine
Learning:
Stochastic Gradient Descent (SGD)

 Stochastic Gradient Descent (SGD) is an optimization

algorithm in machine learning, particularly when dealing with
large datasets.
 It is a variant of the traditional gradient descent algorithm but
offers several advantages in terms of efficiency and
scalability, making it the go-to method for many deep-
learning tasks.
 Gradient Descent is an iterative optimization algorithm
used to minimize a loss function, which represents how far
the model’s predictions are from the actual values. The main
goal is to adjust the parameters of a model (weights, biases,
etc.) so that the error is minimized.
How SGD works:

Step 1: Initialize the model parameters randomly or with some starting values.
Step 2: Randomly select one data point (or a mini-batch of data points).
Step 3: Calculate the gradient of the loss function with respect to the model
parameters using that single data point.
Step 4: Update the model parameters by moving in the opposite direction of the
gradient (to minimize the loss). The update rule is typically:

θ=θ−η⋅∇L(θ)
where:
θ are the model parameters,
η is the learning rate (step size),
∇L(θ) is the gradient of the loss function with respect to the parameters.

Step 5: Repeat this process for a specified number of iterations (epochs), going
through the dataset multiple times.
Advantages of SGD:

1. Faster Convergence:
Quicker updates: SGD updates parameters using a single data point (or
mini-batch) at a time, leading to faster parameter adjustments compared to
traditional gradient descent, which requires computing gradients over the
entire dataset.
Frequent updates: With each data point processed, the model receives
immediate feedback, speeding up convergence in the early stages.
Resource efficiency: SGD doesn't require loading the entire dataset into
memory, making it faster and more computationally efficient.

2. Ability to Handle Large Datasets:

Memory efficiency: SGD processes data one point at a time, avoiding the
need to store the full dataset in memory, which is useful for large-scale
datasets.
Scalability: As datasets grow, SGD remains efficient by working
incrementally, avoiding the memory and performance issues faced by batch
gradient descent.

SML Updated UNIT 4
No ratings yet
SML Updated UNIT 4
44 pages
Unit IV
No ratings yet
Unit IV
51 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
DSOST3
No ratings yet
DSOST3
31 pages
Machine Leafning
No ratings yet
Machine Leafning
5 pages
Model Evaluation
No ratings yet
Model Evaluation
39 pages
Lecture 12 - Machine Learning
No ratings yet
Lecture 12 - Machine Learning
18 pages
15-The Bias - Variance - Trade-Off-08-04-2024
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
23 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
Model Validation & Data Partition
No ratings yet
Model Validation & Data Partition
14 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
MACHINE LEARNING NOTES ANNA UNIVERSITY
No ratings yet
MACHINE LEARNING NOTES ANNA UNIVERSITY
9 pages
AIML-Unit 5 Notes-Assignment 5
No ratings yet
AIML-Unit 5 Notes-Assignment 5
24 pages
All DL
No ratings yet
All DL
72 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
116 pages
Overfitting & Feature Engineering.pptx
No ratings yet
Overfitting & Feature Engineering.pptx
37 pages
7 ML
No ratings yet
7 ML
38 pages
MLquestions
No ratings yet
MLquestions
26 pages
ML Unit 2 Part 1
No ratings yet
ML Unit 2 Part 1
47 pages
Machine Learning # 2
No ratings yet
Machine Learning # 2
17 pages
Unit_I_2
No ratings yet
Unit_I_2
78 pages
Lec-1 Bias-variance-Tradeoff
No ratings yet
Lec-1 Bias-variance-Tradeoff
24 pages
ML 5
No ratings yet
ML 5
14 pages
Unit - 2 Deep Learning
No ratings yet
Unit - 2 Deep Learning
26 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
Machine Learning Notes "2023
No ratings yet
Machine Learning Notes "2023
31 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
Chapter2 1 33
No ratings yet
Chapter2 1 33
18 pages
Choosing Model and Tuning
No ratings yet
Choosing Model and Tuning
20 pages
Underfitting and Overfitting
No ratings yet
Underfitting and Overfitting
4 pages
ML MAKAUT unit-3
No ratings yet
ML MAKAUT unit-3
6 pages
AI - W7L14
No ratings yet
AI - W7L14
22 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
ML Fundamentals
No ratings yet
ML Fundamentals
15 pages
P-2.1.2 Cross Validation and Regularization
No ratings yet
P-2.1.2 Cross Validation and Regularization
37 pages
Untitled
No ratings yet
Untitled
11 pages
ML Models Concepts
No ratings yet
ML Models Concepts
32 pages
module3_DS_ppt
No ratings yet
module3_DS_ppt
68 pages
Unit 2
No ratings yet
Unit 2
28 pages
Unit 1b - Fundamentals of Machine Learning
No ratings yet
Unit 1b - Fundamentals of Machine Learning
31 pages
unit 2 (1)
No ratings yet
unit 2 (1)
23 pages
ML.1Lecture.2 (Old)
No ratings yet
ML.1Lecture.2 (Old)
23 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
43 pages
LECTURE - 1
No ratings yet
LECTURE - 1
35 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
8 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
ML & DL
No ratings yet
ML & DL
19 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
Model Evaluation in ML
No ratings yet
Model Evaluation in ML
12 pages
Module 3 Data Science Machine Learning
No ratings yet
Module 3 Data Science Machine Learning
53 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
MLT_Notes
No ratings yet
MLT_Notes
28 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Oo
No ratings yet
Oo
3 pages
UNIT_-_5
No ratings yet
UNIT_-_5
20 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Lecture-3 Unit 3
No ratings yet
Lecture-3 Unit 3
22 pages
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
Lecture 4 Unit 1
No ratings yet
Lecture 4 Unit 1
23 pages
Module 2
No ratings yet
Module 2
31 pages
Instant Download Mathematical quantization 1st Edition Nik Weaver PDF All Chapters
100% (3)
Instant Download Mathematical quantization 1st Edition Nik Weaver PDF All Chapters
61 pages
Assignment 0 PDF
No ratings yet
Assignment 0 PDF
4 pages
Supply Chain Strategies For Perishable Products: The Case of Fresh Produce
No ratings yet
Supply Chain Strategies For Perishable Products: The Case of Fresh Produce
29 pages
IDIA Maths Module
100% (1)
IDIA Maths Module
174 pages
CHAPTER 1 - HUMSS A - Group 4
No ratings yet
CHAPTER 1 - HUMSS A - Group 4
7 pages
Assignment
No ratings yet
Assignment
12 pages
Hypergeometric function
No ratings yet
Hypergeometric function
17 pages
Geometry+Final+Exam+Review+Test+(3)
No ratings yet
Geometry+Final+Exam+Review+Test+(3)
12 pages
XJXJXJX
No ratings yet
XJXJXJX
30 pages
Lec - 30 Final
No ratings yet
Lec - 30 Final
38 pages
Solved Problems On Taylor Series
100% (3)
Solved Problems On Taylor Series
8 pages
Integration (Basic Integration)
No ratings yet
Integration (Basic Integration)
7 pages
Physics Chapter Summary
No ratings yet
Physics Chapter Summary
58 pages
Module 2 Fractions
No ratings yet
Module 2 Fractions
11 pages
ArcGIS Course Outline
No ratings yet
ArcGIS Course Outline
2 pages
Math 2232 Course Outline
100% (1)
Math 2232 Course Outline
3 pages
Adaptive Control - Wikipedia PDF
No ratings yet
Adaptive Control - Wikipedia PDF
20 pages
Ibdp-Maths-Aa-Sl-Integration Worksheet 2023-24.
No ratings yet
Ibdp-Maths-Aa-Sl-Integration Worksheet 2023-24.
14 pages
Solving Euler Equations
No ratings yet
Solving Euler Equations
8 pages
Binomial Theorem
No ratings yet
Binomial Theorem
27 pages
Structural Analysis
No ratings yet
Structural Analysis
3 pages
Effective Gamification Race or Escape
No ratings yet
Effective Gamification Race or Escape
22 pages
January 2019 Paper 2: 3 (A) Describe What Is Meant by The Term Vector Quantity
No ratings yet
January 2019 Paper 2: 3 (A) Describe What Is Meant by The Term Vector Quantity
1 page
Determinant_Factors_on_Labor_Absorption_in_Small_a
No ratings yet
Determinant_Factors_on_Labor_Absorption_in_Small_a
13 pages
Seminar Report
No ratings yet
Seminar Report
34 pages
Risk & Safety in Engineering: Dr. Jochen Köhler
No ratings yet
Risk & Safety in Engineering: Dr. Jochen Köhler
7 pages
Class CPM 2023
No ratings yet
Class CPM 2023
55 pages
Josephson Effect Experiment
No ratings yet
Josephson Effect Experiment
25 pages
Hukum Faraday PDF
No ratings yet
Hukum Faraday PDF
10 pages
Materials 20231122121023
No ratings yet
Materials 20231122121023
17 pages

unit 4

Uploaded by

unit 4

Uploaded by

Model Evaluation and

 Data mining and Machine learning

 Data mining is the process of discovering patterns, trends, and

Independent Variables: The variable that are not affected by the

 Introduce a validation set

K-Fold Cross-Validation (k=5)

•By training the model on different parts of the data and

• Hyperparameter tuning is the process of finding the best settings for a

• Poorly tuned hyperparameters can lead to underfitting or overfitting.

• Tuning helps identify the best combination to achieve higher accuracy

 The F1 score ranges from 0 to 1, where 1 indicates perfect precision and

 Stochastic Gradient Descent (SGD) is an optimization

2. Ability to Handle Large Datasets:

You might also like