Lec8 (1)

The document discusses the concepts of bias and variance in machine learning, highlighting their impact on model accuracy and prediction errors. It explains the types of errors, their causes, and strategies to mitigate high bias and high variance, emphasizing the importance of the Bias-Variance trade-off in model development. The goal is to achieve a balance that allows the model to generalize well to unseen data while capturing the essential patterns in the training data.

Uploaded by

ku777965

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Lec8 (1)

Uploaded by

ku777965

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Dr.

Supriyo Mandal,
Ph.D. (IIT Patna)
Postdoc (ZBW, University of Kiel,
Germany)

Course code: CS31002 (L-T-P-Cr: 3-1-0-4) Course Name: Machine Learning

Credits: 4
source: https://www.youtube.com/watch?v=EuBBz3bI-aA&ab_channel=StatQuestwithJoshStarmer
vIn machine learning, an error is a measure of how accurately an algorithm can make predictions for the
previously unknown dataset.
vOn the basis of these errors, the machine learning model is selected that can perform best on the particular dataset.
There are mainly two types of errors in machine learning, which are:
vReducible errors: These errors can be reduced to improve the model accuracy. Such errors can further be
classified into bias and Variance.
vIrreducible errors: These errors will always be present in the model.
vWhile making predictions, a difference occurs between prediction values made by the model and actual
values, and this difference is known as bias errors or Errors due to bias.
vIt can be defined as an inability of machine learning algorithms such as Linear Regression to capture the
true relationship between the data points.
vEach algorithm begins with some amount of bias because bias occurs from assumptions in the model, which
makes the target function simple to learn. A model has either:
vLow Bias: A low bias model will make fewer assumptions about the form of the target function.
vHigh Bias: A model with a high bias makes more assumptions, and the model becomes unable to capture
the important features of our dataset. A high bias model also cannot perform well on new data.
vGenerally, a linear algorithm has a high bias, as it makes them learn fast. The simpler the algorithm, the
higher the bias it has likely to be introduced. Whereas a nonlinear algorithm often has low bias.
vSome examples of machine learning algorithms with low bias are Decision Trees, k-Nearest Neighbours and
Support Vector Machines.
vAt the same time, an algorithm with high bias is Linear Regression, Linear Discriminant Analysis and
Logistic Regression.
vHigh bias mainly occurs due to a much simple model.

vBelow are some ways to reduce the high bias:

vIncrease the input features as the model is underfitted.
vDecrease the regularization term.
vUse more complex models, such as including some polynomial features
vThe variance specify the amount of variation in the prediction if the different training data
was used.
vIdeally, a model should not vary too much from one training dataset to another, which means the
algorithm should be good in understanding the hidden mapping between inputs and output variables.
vVariance errors are either of low variance or high variance.
vLow variance means there is a small variation in the prediction of the target function with changes
in the training data set.
vAt the same time, High variance shows a large variation in the prediction of the target function with
changes in the training dataset.
vA model that shows high variance learns a lot and perform well with the training dataset, and does not
generalize well with the unseen dataset.
vAs a result, such a model gives good results with the training dataset but shows high error rates on the test
dataset.
vWith high variance, the model learns too much from the dataset, it leads to overfitting of the model.
vA model with high variance has the below problems:
• A high variance model leads to overfitting.
• Increase model complexities.
• Usually, nonlinear algorithms have a lot of flexibility to fit the model, have high variance.

vSome examples of machine learning algorithms with low variance are, Linear Regression, Logistic
Regression, and Linear discriminant analysis.
vAt the same time, algorithms with high variance are decision tree, Support Vector Machine, and K-
nearest neighbours.
• Reduce the input features or number of parameters as a model is overfitted.
• Do not use a much complex model.
• Increase the training data.
• Increase the Regularization term.
vThere are four possible combinations of bias and variances:-
• Low-Bias, Low-Variance: The combination of low bias and low variance shows an ideal machine learning
model. However, it is not possible practically.
• Low-Bias, High-Variance: With low bias and high variance, model predictions are inconsistent and
accurate on average. This case occurs when the model learns with a large number of parameters and
hence leads to an overfitting
• High-Bias, Low-Variance: With High bias and low variance, predictions are consistent but inaccurate on
average. This case occurs when a model does not learn well with the training dataset or uses few numbers
of the parameter. It leads to underfitting problems in the model.
• High-Bias, High-Variance: With high bias and high variance, predictions are inconsistent and also
inaccurate on average.
vHigh variance can be identified if the model has:
vLow training error and high test error.

vHigh Bias can be identified if the model has:

vHigh training error and the test error is almost similar to training error.
vWhile building the machine learning model, it is really important to take care of bias and variance in order to
avoid overfitting and underfitting in the model.
vIf the model is very simple with fewer parameters, it may have low variance and high bias.
vWhereas, if the model has a large number of parameters, it will have high variance and low bias.
vSo, it is required to make a balance between bias and variance errors, and this balance between the bias
error and variance error is known as the Bias-Variance trade-off.
vFor an accurate prediction of the model, algorithms need a low variance and low bias. But this is not
possible because bias and variance are related to each other:
• If we decrease the variance, it will increase the bias.
• If we decrease the bias, it will increase the variance.

vBias-Variance trade-off is a central issue in supervised learning.

vIdeally, we need a model that accurately captures the regularities in training data and simultaneously
generalizes well with the unseen dataset.
vUnfortunately, doing this is not possible simultaneously. Because a high variance algorithm may perform well
with training data, but it may lead to overfitting to noisy data.
vWhereas, high bias algorithm generates a much simple model that may not even capture important regularities
in the data. So, we need to find a sweet spot between bias and variance to make an optimal model.
vHence, the Bias-Variance trade-off is about finding the sweet spot to make a balance between bias and
variance errors.

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Formulation of A Biodegradable and Biosynthetic Latex Paint
No ratings yet
Formulation of A Biodegradable and Biosynthetic Latex Paint
166 pages
Brass Tactics
No ratings yet
Brass Tactics
6 pages
ML Lec-7
No ratings yet
ML Lec-7
12 pages
Bias and Variance
No ratings yet
Bias and Variance
6 pages
Bias and Variance in Machine Learning
100% (1)
Bias and Variance in Machine Learning
7 pages
Merge +1
No ratings yet
Merge +1
107 pages
Bias and Variance
No ratings yet
Bias and Variance
36 pages
Bias, Variance, and Tradeoff
No ratings yet
Bias, Variance, and Tradeoff
8 pages
Lec 3
No ratings yet
Lec 3
13 pages
Ensemble Method
No ratings yet
Ensemble Method
12 pages
Bias and Variance in Machine Learning - Javatpoint
100% (2)
Bias and Variance in Machine Learning - Javatpoint
6 pages
Bias and Variance
No ratings yet
Bias and Variance
7 pages
module 3 modified
No ratings yet
module 3 modified
48 pages
Overview of Bias and Variance
No ratings yet
Overview of Bias and Variance
3 pages
Unit 4
No ratings yet
Unit 4
50 pages
24.-Bias-and-Variance
No ratings yet
24.-Bias-and-Variance
15 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
machine learning-unit 3
No ratings yet
machine learning-unit 3
18 pages
ML Decode
No ratings yet
ML Decode
130 pages
ML Decode
No ratings yet
ML Decode
130 pages
Bias Variance dichotomy
No ratings yet
Bias Variance dichotomy
11 pages
11 July Unit 1 - Copy
No ratings yet
11 July Unit 1 - Copy
47 pages
Machine Learning Math Essentials _12.02.2025
No ratings yet
Machine Learning Math Essentials _12.02.2025
88 pages
12 Bias-Variance_Underfit_overfit
No ratings yet
12 Bias-Variance_Underfit_overfit
4 pages
Bias-variance
No ratings yet
Bias-variance
8 pages
Bias - Variance Trade Off
No ratings yet
Bias - Variance Trade Off
11 pages
gbt 4.4
No ratings yet
gbt 4.4
25 pages
40_Machine_Learning_Interview_Questions
No ratings yet
40_Machine_Learning_Interview_Questions
55 pages
4 - Bias-Variance Tradeoff
No ratings yet
4 - Bias-Variance Tradeoff
28 pages
Bais and Variance
No ratings yet
Bais and Variance
4 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
ML Bias and Variance
No ratings yet
ML Bias and Variance
14 pages
Unit 2
No ratings yet
Unit 2
97 pages
Chapter2 1 22
No ratings yet
Chapter2 1 22
9 pages
uf,of, bias-variance tradeoff
No ratings yet
uf,of, bias-variance tradeoff
3 pages
Regularization Linear Models
No ratings yet
Regularization Linear Models
23 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
Bias - Varience Trade Off
No ratings yet
Bias - Varience Trade Off
4 pages
MACHINE LEARNING NOTES ANNA UNIVERSITY
No ratings yet
MACHINE LEARNING NOTES ANNA UNIVERSITY
9 pages
Bias and Variance
No ratings yet
Bias and Variance
4 pages
Bias Variance Overfitting
No ratings yet
Bias Variance Overfitting
3 pages
vsat2k_ML_Ch1a Evaluation of Learning Algorithms - Jan 2025
No ratings yet
vsat2k_ML_Ch1a Evaluation of Learning Algorithms - Jan 2025
19 pages
Ensemble Learning
No ratings yet
Ensemble Learning
46 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
Bias and Variance in Machine Learning _ GeeksforGeeks
No ratings yet
Bias and Variance in Machine Learning _ GeeksforGeeks
13 pages
Underfitting & Overfitting
No ratings yet
Underfitting & Overfitting
13 pages
Bias-Variance Tradeoff Presentation
No ratings yet
Bias-Variance Tradeoff Presentation
11 pages
Machine Learning Volume I 280820241047
No ratings yet
Machine Learning Volume I 280820241047
4 pages
2.2 ML Session Bias Variance Tradeoffs
No ratings yet
2.2 ML Session Bias Variance Tradeoffs
38 pages
2. Linear Regression, Polynomical, Gradiant Descent
No ratings yet
2. Linear Regression, Polynomical, Gradiant Descent
42 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
Bias and Variance.pptx
No ratings yet
Bias and Variance.pptx
21 pages
Unit - 2 Deep Learning
No ratings yet
Unit - 2 Deep Learning
26 pages
2 1 TXT Bias Variance
No ratings yet
2 1 TXT Bias Variance
4 pages
DL_Unit1 (1)
100% (1)
DL_Unit1 (1)
79 pages
Session 3
No ratings yet
Session 3
26 pages
Variance and Bias
No ratings yet
Variance and Bias
14 pages
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
TDX SP2 NB Service Manua PDF
No ratings yet
TDX SP2 NB Service Manua PDF
209 pages
TEACHING ENGLISH AS A FOREIGN LANGUAGE Proposals for the language classroom (García-Pastor(ed.)) (Z-Library)
No ratings yet
TEACHING ENGLISH AS A FOREIGN LANGUAGE Proposals for the language classroom (García-Pastor(ed.)) (Z-Library)
237 pages
Final Book - Lordegan Project
100% (1)
Final Book - Lordegan Project
8 pages
Stable Matching Problem: Gale-Shapley Algorithm Hospital Optimality Context
No ratings yet
Stable Matching Problem: Gale-Shapley Algorithm Hospital Optimality Context
36 pages
Dawn Vocabulary 5 Feb 2025
No ratings yet
Dawn Vocabulary 5 Feb 2025
3 pages
ARALING PANLIPUNAN - issues-and-concerns-in-the-implementation-of-MATATAG-Curriculum
No ratings yet
ARALING PANLIPUNAN - issues-and-concerns-in-the-implementation-of-MATATAG-Curriculum
3 pages
Series Bible Pitch Deck Standards 1 0
100% (2)
Series Bible Pitch Deck Standards 1 0
186 pages
SAP PS Budget Management
No ratings yet
SAP PS Budget Management
8 pages
Edd PDF
No ratings yet
Edd PDF
8 pages
Aim High 4
No ratings yet
Aim High 4
31 pages
Studies On Different Methods For Removal of Phenol in Waste Water-Review
No ratings yet
Studies On Different Methods For Removal of Phenol in Waste Water-Review
9 pages
Managerial Accounting:: An Introduction To Concepts, Methods, and Uses
100% (1)
Managerial Accounting:: An Introduction To Concepts, Methods, and Uses
22 pages
Mac vs. PC
No ratings yet
Mac vs. PC
18 pages
MIS in Supply Chain Management
No ratings yet
MIS in Supply Chain Management
18 pages
CHAPTER 3 Force and Motion in Elementary Science
No ratings yet
CHAPTER 3 Force and Motion in Elementary Science
20 pages
Tutorial Sheet1.1
No ratings yet
Tutorial Sheet1.1
3 pages
Suspention System
50% (2)
Suspention System
25 pages
Research Paper1
No ratings yet
Research Paper1
8 pages
DCMP-lab1 Handout
No ratings yet
DCMP-lab1 Handout
2 pages
Rome
No ratings yet
Rome
19 pages
UNIT 2 QUESTION PAPERS BY PUSHPA (1)
No ratings yet
UNIT 2 QUESTION PAPERS BY PUSHPA (1)
4 pages
Proposal For GNM WTP Revamping
No ratings yet
Proposal For GNM WTP Revamping
4 pages
Chapter - 1 Introduction: Figure 1-1 TSR System Architecture
No ratings yet
Chapter - 1 Introduction: Figure 1-1 TSR System Architecture
5 pages
Dictionary of Petroleum Exploration, Drilling & Production 2nd Ed (2014)
No ratings yet
Dictionary of Petroleum Exploration, Drilling & Production 2nd Ed (2014)
777 pages
Social Perception
No ratings yet
Social Perception
5 pages
Characterization of AluminumSteel
No ratings yet
Characterization of AluminumSteel
8 pages
DATA-PRIVACY-ACT-CONSENT-FORM_Amended01
No ratings yet
DATA-PRIVACY-ACT-CONSENT-FORM_Amended01
1 page
Satisfaction Level of Salaried Taxpayers Towards E-Filing: Demographic Perspective
No ratings yet
Satisfaction Level of Salaried Taxpayers Towards E-Filing: Demographic Perspective
4 pages