0% found this document useful (0 votes)

3 views

unit-online-1.2

The document discusses the concept of Empirical Risk Minimization (ERM) in neural networks and deep learning, emphasizing its importance in selecting models that minimize empirical risk based on training data. It explains the distinction between empirical risk and true risk, as well as the impact of bias and variance errors on model performance. Additionally, it highlights regularization techniques to prevent overfitting and improve model generalization on unseen data.

Uploaded by

aakilalig

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

unit-online-1.2

Uploaded by

aakilalig

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

NEURAL NETWORKS & DEEP LEARNING

(21MCA24DB3)

Prepared & Presented By:

Dr. Balkishan
Assistant Professor
Department of Computer Science & Applications
Maharshi Dayanand University
Rohtak
Empirical Risk Minimization
The empirical risk minimization principle states
that the learning algorithm should choose a
function/model/hypothesis which minimizes
the empirical risk
Understanding the concept of risk

• What is loss function

• Given a set of inputs and outputs, loss function
measures the difference between the predicted
output and the true output.
• But this is applicable only to the given set of inputs
and outputs.
• We want to know what the loss is over all the
possibilities.
• This is where “true risk” comes into picture.
• True risk computes the average loss over all the
possibilities.
What exactly is Empirical Risk Minimization

• If we compute the loss using the data points in our

dataset, it’s called empirical risk.
• It’s “empirical (experimental)” and not “true”
because we are using a dataset that’s a subset of
the whole population.
• When we build our learning model, we need to pick
the function that minimizes the empirical risk i.e.
the difference between the predicted output and
the actual output for the data points in our dataset.
• This process of finding the function that minimizes
the empirical risk is called empirical risk
minimization.
Importance of Empirical Risk Minimization

• ERM is essential to understanding the limits

of machine learning algorithms and to form a
good basis for practical problem-solving
skills.
Empirical risk minimization (ERM)
• It is a principle in statistical learning theory which
defines a family of learning algorithms and is used to
give theoretical bounds on their performance.
• The idea is that we don’t know exactly how well an
algorithm will work in practice (the true "risk")
because we don't know the true distribution of data
that the algorithm will work on but as an alternative
we can measure its performance on a known set of
training data.
• We assumed that our samples come from this
distribution and use our dataset as an approximation.
Example of Empirical Risk Minimization
• Example: We want to build a model that can differentiate
between a male and a female based on specific features.
• If we select 150 random people where women are really
short, and men are really tall, then the model might
incorrectly assume that height is the differentiating feature.
• For building a truly accurate model, we have to gather all
the women and men in the world to extract differentiating
features.
• Unfortunately, that is not possible! So we select a small
number of people and hope that this sample is
representative of the whole population.
• If we compute the loss using the data points in our dataset,
it’s called empirical risk.
• It is “empirical” and not “true” because we are using a
dataset that’s a subset of the whole population.
• When our learning model is built, we have to pick a
function that minimizes the empirical risk that is the delta
between predicted output and actual output for data points
in the dataset.
• This process of finding this function is called empirical
risk minimization (ERM).
• We want to minimize the true risk.
Training and Testing of Model
Training and Testing of Model
Model Fitting
Model (Function) Fitting
• How well a model performs on training /evaluation
datasets will define its characteristics

Underfit Overfit Good Fit

Training Dataset Poor Very Good Good

Evaluation Very Poor Poor Good

Dataset
Model Fitting – Visualization

Variations of model fitting

Errors in Machine Learning

• In machine learning, an error is a

measure of how accurately an
algorithm can make predictions for
the previously unknown dataset.
• On the basis of these errors, the
machine learning model is selected
that can perform best on the
particular dataset.
Machine Learning Errors
• Reducible errors: These errors can be
reduced to improve the model accuracy.
Such errors can further be classified into
bias and Variance.
• Irreducible errors: These errors will
always be present in the model
regardless of which algorithm has been
used. The cause of these errors is
unknown variables whose value can't be
reduced.
What is Bias

• In general, a machine learning model

analyses the data, find patterns in it
and make predictions.
• While training, the model learns these
patterns in the dataset and applies
them to test data for prediction.
• While making predictions, a
difference occurs between
prediction values made by the
model and actual values/expected
values, and this difference is known
as bias errors or Errors due to bias.
What is a Variance Error

• The variance would specify the amount of

variation in the prediction if the different training
data was used.
• In simple words, variance tells that how much
a random variable is different from its
expected value.
• Ideally, a model should not vary too much from
one training dataset to another, which means the
algorithm should be good in understanding the
hidden mapping between inputs and output
variables.
• Variance errors are either of low variance or
high variance.

Over-fitted model where we see model performance on, a)

training data b) new data
Regularizing a Deep Network
(Technique to prevent overfitting)
• Regularization is a technique which makes
slight modifications to the learning algorithm
such that the model generalizes better.
• This in turn improves the model’s
performance on the unseen data.
• Reduce the complexity of the model

DL Unit-2
No ratings yet
DL Unit-2
24 pages
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
No ratings yet
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
14 pages
Unit 1-Week2: Linear Regression, Bias, Variance, Under and Over Fitting, Curse of Dimensionality and ROC
No ratings yet
Unit 1-Week2: Linear Regression, Bias, Variance, Under and Over Fitting, Curse of Dimensionality and ROC
53 pages
unit-1.2-Perceptron-2024
No ratings yet
unit-1.2-Perceptron-2024
107 pages
UNIT1 ERM and PAC Learning
No ratings yet
UNIT1 ERM and PAC Learning
20 pages
Empirical Risk Minimization
No ratings yet
Empirical Risk Minimization
7 pages
Empirical Risk Minimization
No ratings yet
Empirical Risk Minimization
3 pages
Unit 2
No ratings yet
Unit 2
97 pages
Unit 2
No ratings yet
Unit 2
76 pages
Chapter 08
100% (2)
Chapter 08
202 pages
Machine Learning Math Essentials _12.02.2025
No ratings yet
Machine Learning Math Essentials _12.02.2025
88 pages
ML 01
No ratings yet
ML 01
24 pages
1. Statistical Learning Theory
No ratings yet
1. Statistical Learning Theory
100 pages
INT354 Unit 1 Part1
No ratings yet
INT354 Unit 1 Part1
16 pages
Formal Model and Empirical Risk Minimization: Dr. Shahid Hussain (In-Charge SE Program) G42, Ground Floor, CS Dept., CIIT
No ratings yet
Formal Model and Empirical Risk Minimization: Dr. Shahid Hussain (In-Charge SE Program) G42, Ground Floor, CS Dept., CIIT
5 pages
Chapter2 1 22
No ratings yet
Chapter2 1 22
9 pages
Regression
No ratings yet
Regression
45 pages
Machine Learning Interview Question
No ratings yet
Machine Learning Interview Question
72 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Lecture-4 Emprical Risk and Optimization
No ratings yet
Lecture-4 Emprical Risk and Optimization
20 pages
Lecture 1
No ratings yet
Lecture 1
5 pages
Lec 25
No ratings yet
Lec 25
15 pages
Class14 PDF
No ratings yet
Class14 PDF
29 pages
Risk Minimization
No ratings yet
Risk Minimization
12 pages
DL_Unit1 (1)
100% (1)
DL_Unit1 (1)
79 pages
Neural Networks Economics
No ratings yet
Neural Networks Economics
27 pages
Module3_notes
No ratings yet
Module3_notes
18 pages
CH 1
No ratings yet
CH 1
24 pages
ML-2
No ratings yet
ML-2
155 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
25 pages
vsat2k_ML_Ch1a Evaluation of Learning Algorithms - Jan 2025
No ratings yet
vsat2k_ML_Ch1a Evaluation of Learning Algorithms - Jan 2025
19 pages
Top 100 ML Interview Q&A
100% (1)
Top 100 ML Interview Q&A
39 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Machine Learning
No ratings yet
Machine Learning
92 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
supervised learning
No ratings yet
supervised learning
61 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
DSA5102_lecture1
No ratings yet
DSA5102_lecture1
60 pages
Bias Variance Tradeoff
No ratings yet
Bias Variance Tradeoff
71 pages
ML-1-PPT-UNIT-1
No ratings yet
ML-1-PPT-UNIT-1
93 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
Week11_regularization and optimization
No ratings yet
Week11_regularization and optimization
75 pages
Linear Regression
No ratings yet
Linear Regression
37 pages
n27 PDF
No ratings yet
n27 PDF
3 pages
SSRN Id3588594
No ratings yet
SSRN Id3588594
27 pages
CS601_Machine Learning_Unit 1_Notes_1672759748
No ratings yet
CS601_Machine Learning_Unit 1_Notes_1672759748
13 pages
ML Lecture 1 Iitg
No ratings yet
ML Lecture 1 Iitg
32 pages
2. Linear Regression, Polynomical, Gradiant Descent
No ratings yet
2. Linear Regression, Polynomical, Gradiant Descent
42 pages
Lec SML Basic Theory 2
No ratings yet
Lec SML Basic Theory 2
49 pages
Module 3-DL
No ratings yet
Module 3-DL
12 pages
Unit 2(P1)
No ratings yet
Unit 2(P1)
15 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
3 Bias Variance Tradeoff
No ratings yet
3 Bias Variance Tradeoff
9 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Unit I
No ratings yet
Unit I
14 pages
Lecture1
No ratings yet
Lecture1
56 pages
Unit 1-1
No ratings yet
Unit 1-1
75 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
Regularization Linear Models
No ratings yet
Regularization Linear Models
23 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
AI in Drilling Rop
No ratings yet
AI in Drilling Rop
102 pages
XGBoost WM
No ratings yet
XGBoost WM
39 pages
END of Year Project Report
No ratings yet
END of Year Project Report
44 pages
Zhang-Zhao2019 Article FluorescenceMicroscopyImageCla
No ratings yet
Zhang-Zhao2019 Article FluorescenceMicroscopyImageCla
12 pages
Optimizing Stroke Recognition With MediaPipe and Machine Learning an Explainable AI Approach for Facial Landmark Analysis
No ratings yet
Optimizing Stroke Recognition With MediaPipe and Machine Learning an Explainable AI Approach for Facial Landmark Analysis
26 pages
Stoltzfus (2011) Logreg
No ratings yet
Stoltzfus (2011) Logreg
6 pages
Kinetics of Flotation.
100% (1)
Kinetics of Flotation.
24 pages
AI ( X ) PRACTICE PAPER 1
No ratings yet
AI ( X ) PRACTICE PAPER 1
5 pages
Electricity Load Forecasting - Intelligent
No ratings yet
Electricity Load Forecasting - Intelligent
10 pages
Health Dataset Synopsis New
No ratings yet
Health Dataset Synopsis New
9 pages
Cse-564 (Final Viva Voce Ppt)
No ratings yet
Cse-564 (Final Viva Voce Ppt)
32 pages
ML Lecture 2
No ratings yet
ML Lecture 2
19 pages
B E - Computer-Engg
No ratings yet
B E - Computer-Engg
27 pages
20ag3ep15 BTP Report
No ratings yet
20ag3ep15 BTP Report
27 pages
Ccw331-Business Analytics Printed Notes
100% (1)
Ccw331-Business Analytics Printed Notes
59 pages
Exercise 03
No ratings yet
Exercise 03
5 pages
Sharpening The Blade Missing Data Imputation Using Supervised Machine Learning
No ratings yet
Sharpening The Blade Missing Data Imputation Using Supervised Machine Learning
24 pages
Deep Learning with PyTorch 1st Edition Eli Stevens Luca Antiga And Thomas Viehmann Eli Stevens Luca Antiga And Thomas Viehmann pdf download
No ratings yet
Deep Learning with PyTorch 1st Edition Eli Stevens Luca Antiga And Thomas Viehmann Eli Stevens Luca Antiga And Thomas Viehmann pdf download
49 pages
What Is Machine Learning by Coursera
No ratings yet
What Is Machine Learning by Coursera
47 pages
Predicting Soccer Players' Fitness Status Through A Machine-Learning Approach
No ratings yet
Predicting Soccer Players' Fitness Status Through A Machine-Learning Approach
11 pages
Predicting Users' Eat-Out Preference From Big5 Personality Traits
No ratings yet
Predicting Users' Eat-Out Preference From Big5 Personality Traits
14 pages
Ait401 DL Syllubus
100% (1)
Ait401 DL Syllubus
13 pages
19 Assessing Model Accuracy
No ratings yet
19 Assessing Model Accuracy
16 pages
ass
No ratings yet
ass
39 pages
Unit-1 PRCV
No ratings yet
Unit-1 PRCV
86 pages
CS4740/5740 Introduction To NLP Fall 2017 Neural Language Models and Classifiers
No ratings yet
CS4740/5740 Introduction To NLP Fall 2017 Neural Language Models and Classifiers
7 pages
CH11
No ratings yet
CH11
36 pages
Train Test Split in Python
No ratings yet
Train Test Split in Python
11 pages
STRIPS Planner Final
No ratings yet
STRIPS Planner Final
17 pages
Ml-1-Guided-Bus Report
No ratings yet
Ml-1-Guided-Bus Report
35 pages

unit-online-1.2

Uploaded by

unit-online-1.2

Uploaded by

NEURAL NETWORKS & DEEP LEARNING

Prepared & Presented By:

• What is loss function

• If we compute the loss using the data points in our

• ERM is essential to understanding the limits

Underfit Overfit Good Fit

Training Dataset Poor Very Good Good

Evaluation Very Poor Poor Good

Variations of model fitting

• In machine learning, an error is a

• In general, a machine learning model

• The variance would specify the amount of

Over-fitted model where we see model performance on, a)

You might also like