0% found this document useful (0 votes)

6 views

ML Lectures - 33 34

Uploaded by

Abdhullah Afthah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

ML Lectures - 33 34

Uploaded by

Abdhullah Afthah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

TE (CS)

Spring Semester 2024

Machine Learning (CS-324)

Lecture #33-34
Bias & Variance Tradeoff

Dr Syed Zaffar Qasim

Assistant Professor (CIS)

Bias & Variance

▪ Algorithm selection - an important step in forming an
accurate prediction model but
o deploying an algorithm with a high accuracy can be
a difficult balancing act.
▪ The fact that each algorithm can produce vastly
different models
o based on the hyperparameters provided
o can lead to dramatically different results.
▪ Hyperparameters are the algorithm’s settings,
o similar to the controls on the dashboard of an airplane
o except hyperparameters are lines of code!`

CS-324: Machine Learning 1

Bias & Variance
▪ A constant challenge in machine learning
o navigating underfitting and overfitting,
o describe how closely the model follows the actual
patterns of the dataset.
▪ To understand underfitting and overfitting,
o one must first understand a model’s bias and variance
o the two fundamental causes of prediction error.

Bias & Variance

▪ Bias refers to the gap between the predicted value and
the actual value.

Fig 1

▪ In the case of high bias,

o predictions likely to be skewed in a certain direction
o away from the actual values.
▪ Variance describes how scattered the different
predicted values are with respect to each other.

CS-324: Machine Learning 2

Bias & Variance
▪ Assume there are many training sets, all unique, but
equally representative of the population.
▪ A model with a high bias will produce similar errors
for an input regardless of the training set it was
trained with;
o the model biases its own assumptions about the real
relationship
o over the relationship demonstrated in the training
data.
▪ A model with high variance, conversely, will produce
different errors for an input depending on the
training set it was trained with.

Bias & Variance

▪ A model with high bias is inflexible.
▪ A model with high variance may be so flexible that it
models the noise in the training set.
▪ A model with high variance over-fits the training data,
o while a model with high bias under-fits the training data.

CS-324: Machine Learning 3

Visualize Bias and Variance at a dartboard.
▪ Imagine that the center of the target, or the bull’s-eye,
perfectly predicts the correct value of your model.

Fig 2

▪ The more the dots deviate from the bull’s-eye, the

higher the bias.
▪ A model with high bias but low variance will throw darts
that are far from the bull's eye, but tightly clustered.
▪ A model with high bias and high variance will throw darts
all over the board; the darts are far from the bull's eye and
each other. 7

Visualize Bias and Variance at a dartboard.

▪ Let X be a sample from a population specified up to a

parameter θ, and let d = d(X) be an estimator of θ.
▪ To evaluate the quality of this estimator, we can
measure how much it is different from θ, that is,
(d(X)−θ) 2.
▪ But since it is a random variable,
o we need to average this over possible X and
o consider r(d, θ), the mean square error of the estimator
d defined as
𝑟 𝑑, 𝜃 = 𝐸[ 𝑑 𝑋 − 𝜃 2 ]

CS-324: Machine Learning 4

Visualize Bias and Variance at a dartboard.
▪ The mean square error can be rewritten as follows—d
is short for d(X):

▪ We then write error as the sum of these two terms, the

variance and the square of the bias:
r(d, θ) = Var(d) + (bθ(d))2

Underfitting and Overfitting

▪ Mismanaging the bias-variance trade-off can result in
the model becoming
o overly simple and inflexible (underfitting) or
o overly complex and flexible (overfitting).

▪ Underfitting results from low variance, high bias

▪ Overfitting means high variance, low bias.

CS-324: Machine Learning 5

Underfitting
▪ Underfitting is when the model is overly simple than
the real complexity of patterns in the data.
o e.g. when trying to fit a line to data sampled from a
third-order polynomial.

▪ It can lead to inaccurate predictions for both the

training data and test data.
▪ Common causes of underfitting
o insufficient training data to cover all possible patterns
o the training and test data not properly randomized.
11

Overfitting
▪ A natural temptation
o add complexity to the model
o in order to improve accuracy,
o can, in turn, lead to overfitting.

▪ Overfitting typically occurs when

o a model, besides learning the underlying function,
o also perfectly learns to classify noisy training
examples.
12

CS-324: Machine Learning 6

Overfitting
▪ A model that memorizes noise or coincidence in the
data fails to achieve the generalization ability.
▪ For example, when fitting a sixth-order polynomial to
noisy data sampled from a third-order polynomial.
▪ An overfitted model will yield accurate predictions
from the training data but prove less accurate at
formulating predictions from the test data.
▪ Overfitting can also occur if
o the training and test data aren’t randomized before
they are split and
o patterns in the data aren’t distributed across the two
segments of data.

Overfitting in polynomial classifiers

▪ The eight training examples in Fig. fall into two groups.

▪ The two classes are linearly separable, but noise has caused
one negative example to be mislabeled as positive.
▪ The high-order polynomial on the right overfits the data,
o ignoring the possibility of noise,
o in an attempt to avoid any error on the training set.
▪ The ideal solution often somewhere between the extremes
of linear classifiers and high-order polynomials.
▪ The best choice can be determined experimentally.
14

CS-324: Machine Learning 7

Overcoming the underfitting and overfitting
▪ To eradicate both underfitting and overfitting,
o modify the model’s hyperparameters to ensure that
they fit patterns
o in both the training and test data and
o not just one-half of the data.
▪ This may also mean re-randomizing the training and
test data or adding new data points so as to better
detect underlying patterns.
▪ However, in most instances, we probably need to
consider switching algorithms or
o modifying hyperparameters based on trial and error
o to minimize and manage the issue of bias-variance trade-
off.
15

Underfitting and Overfitting

▪ Specifically, this might entail switching from linear

regression to non-linear regression to reduce bias by
increasing variance.
▪ Or it could mean increasing “k” in k-NN to reduce
variance (by averaging together more neighbors).
▪ A third example could be reducing variance by
switching from a single decision tree (which is prone
to overfitting) to a random forest with many decision
trees.

CS-324: Machine Learning 8

Regularization
▪ Another effective strategy to combat overfitting and
underfitting (Breiman 1998).
▪ In this approach, we write an augmented error function
E = error_on_data + λ × model_complexity
▪ The second term penalizes complex models with large
variance, where λ gives the weight of this penalty.
▪ When we minimize the augmented error function, E,
o instead of the error on data only,
o we penalize complex models and
o thus decrease variance
o thus amplifying the bias error.

Regularization
▪ If λ is taken too large,
o only very simple models are allowed and
o we risk introducing bias.
▪ λ is optimized using cross-validation.
▪ In effect, this add-on parameter provides a warning alert
o to keep high variance in check
o while the original parameters are being optimized.

CS-324: Machine Learning 9

Set Theory for Beginners: Foundational Mathematics for Software Developers, #1
From Everand
Set Theory for Beginners: Foundational Mathematics for Software Developers, #1
Subhomoy Haldar
No ratings yet
Cis Sultan .2021
No ratings yet
Cis Sultan .2021
3 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Marriage Resume Biodata Sample Download-Converted (Autorecovered)
No ratings yet
Marriage Resume Biodata Sample Download-Converted (Autorecovered)
1 page
Amway - Onmichannel Strategy
100% (1)
Amway - Onmichannel Strategy
23 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
15-The Bias - Variance - Trade-Off-08-04-2024
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
23 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
4 - Bias-Variance Tradeoff
No ratings yet
4 - Bias-Variance Tradeoff
28 pages
MACHINE LEARNING NOTES ANNA UNIVERSITY
No ratings yet
MACHINE LEARNING NOTES ANNA UNIVERSITY
9 pages
Chapter 1-ML
No ratings yet
Chapter 1-ML
27 pages
Unit 4
No ratings yet
Unit 4
50 pages
DL_Unit1 (1)
100% (1)
DL_Unit1 (1)
79 pages
emsemble methods-pages-deleted
No ratings yet
emsemble methods-pages-deleted
2 pages
uf,of, bias-variance tradeoff
No ratings yet
uf,of, bias-variance tradeoff
3 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
11 July Unit 1 - Copy
No ratings yet
11 July Unit 1 - Copy
47 pages
Unit IV
No ratings yet
Unit IV
51 pages
module 3 modified
No ratings yet
module 3 modified
48 pages
Underfitting & Overfitting
No ratings yet
Underfitting & Overfitting
13 pages
Unit - 2 Deep Learning
No ratings yet
Unit - 2 Deep Learning
26 pages
Merge +1
No ratings yet
Merge +1
107 pages
Ensemble Method
No ratings yet
Ensemble Method
12 pages
016-Overfitting vs Underfitting
No ratings yet
016-Overfitting vs Underfitting
32 pages
Bias and Variance
No ratings yet
Bias and Variance
4 pages
Bias and Variance.pptx
No ratings yet
Bias and Variance.pptx
21 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
Machine Learning Juunit2.pdf Lands
No ratings yet
Machine Learning Juunit2.pdf Lands
7 pages
Lec8 (1)
No ratings yet
Lec8 (1)
19 pages
ML Bias and Variance
No ratings yet
ML Bias and Variance
14 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Bias and Variance
No ratings yet
Bias and Variance
36 pages
Week 15
No ratings yet
Week 15
41 pages
UNDERFITTING_OVERFITTING
No ratings yet
UNDERFITTING_OVERFITTING
7 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
Unit_I_2
No ratings yet
Unit_I_2
78 pages
Bias - Variance Trade Off
No ratings yet
Bias - Variance Trade Off
11 pages
Chapter 9 - Learning Techniques
No ratings yet
Chapter 9 - Learning Techniques
25 pages
Bias Variance dichotomy
No ratings yet
Bias Variance dichotomy
11 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
Regularization Linear Models
No ratings yet
Regularization Linear Models
23 pages
Ensemble Learning
No ratings yet
Ensemble Learning
46 pages
ML Decode
No ratings yet
ML Decode
130 pages
ML Decode
No ratings yet
ML Decode
130 pages
All DL
No ratings yet
All DL
72 pages
ML3 - Evaluation
100% (1)
ML3 - Evaluation
65 pages
module3_DS_ppt
No ratings yet
module3_DS_ppt
68 pages
Bias and Variance in Machine Learning
No ratings yet
Bias and Variance in Machine Learning
3 pages
Machine Learning
No ratings yet
Machine Learning
57 pages
Machine Learning Math Essentials _12.02.2025
No ratings yet
Machine Learning Math Essentials _12.02.2025
88 pages
BIAS AND VARIANCE
No ratings yet
BIAS AND VARIANCE
4 pages
Lec-1 Bias-variance-Tradeoff
No ratings yet
Lec-1 Bias-variance-Tradeoff
24 pages
unit 4
No ratings yet
unit 4
34 pages
24.-Bias-and-Variance
No ratings yet
24.-Bias-and-Variance
15 pages
08_eval-intro_notes (1)
No ratings yet
08_eval-intro_notes (1)
10 pages
Complete ML Concepts
No ratings yet
Complete ML Concepts
30 pages
DL-Lec 2 -bias-variance-tradeoff
No ratings yet
DL-Lec 2 -bias-variance-tradeoff
33 pages
ppt5dl
No ratings yet
ppt5dl
33 pages
Lec 3
No ratings yet
Lec 3
13 pages
Biasvariancetradeoff 210313075413
No ratings yet
Biasvariancetradeoff 210313075413
13 pages
AI-Driven Time Series Forecasting: Complexity-Conscious Prediction and Decision-Making
From Everand
AI-Driven Time Series Forecasting: Complexity-Conscious Prediction and Decision-Making
Raghurami Reddy Etukuru Ph.D.
No ratings yet
Blueray HT f5530k
No ratings yet
Blueray HT f5530k
59 pages
Leading Change
No ratings yet
Leading Change
6 pages
MCA Internship Certificates
No ratings yet
MCA Internship Certificates
73 pages
Cleaning Equipment Parts
No ratings yet
Cleaning Equipment Parts
11 pages
EVS Global Warming, Ozone Depletion
No ratings yet
EVS Global Warming, Ozone Depletion
35 pages
Coal India
No ratings yet
Coal India
83 pages
Instruction Manual and Spare Parts List: Welding Torch PKI 250/500/630/300
No ratings yet
Instruction Manual and Spare Parts List: Welding Torch PKI 250/500/630/300
48 pages
Preliminary Pages SIP 2025-2028
No ratings yet
Preliminary Pages SIP 2025-2028
14 pages
Wilkins, A Zurn Company: Aggregate Production Planning: Group-6
No ratings yet
Wilkins, A Zurn Company: Aggregate Production Planning: Group-6
10 pages
Consumer Society Lesson Plan
No ratings yet
Consumer Society Lesson Plan
5 pages
Jones and Hensher 2004
No ratings yet
Jones and Hensher 2004
28 pages
Vehicle load distribution
No ratings yet
Vehicle load distribution
7 pages
Assistant Manager (Production Control)
No ratings yet
Assistant Manager (Production Control)
1 page
Deck Checked! Improve Your Clash Royale Deck
No ratings yet
Deck Checked! Improve Your Clash Royale Deck
1 page
THESIS v1
100% (1)
THESIS v1
30 pages
ADA Solved
No ratings yet
ADA Solved
14 pages
Oop 5 A B Lab Manual 2018 19student1
No ratings yet
Oop 5 A B Lab Manual 2018 19student1
64 pages
Sustainability of Accounting Profession at The Age
No ratings yet
Sustainability of Accounting Profession at The Age
20 pages
The Financial System
No ratings yet
The Financial System
10 pages
Grant Thornton Mauritius Tax Alert The Finance Act 2017
No ratings yet
Grant Thornton Mauritius Tax Alert The Finance Act 2017
15 pages
TIP-Course-1 Final
No ratings yet
TIP-Course-1 Final
116 pages
Atmel 42005 8 and 16 Bit AVR Microcontrollers XMEGA E Manual
No ratings yet
Atmel 42005 8 and 16 Bit AVR Microcontrollers XMEGA E Manual
447 pages
TKKD
No ratings yet
TKKD
7 pages
01 Invitation To Bid 17C8 Butterfly Ridge Elem New School Construction
No ratings yet
01 Invitation To Bid 17C8 Butterfly Ridge Elem New School Construction
2 pages
Ten Codes
No ratings yet
Ten Codes
2 pages
ApplicationSnapshot JAMAICAROYAL-1098202010234104
100% (1)
ApplicationSnapshot JAMAICAROYAL-1098202010234104
8 pages
Dbms Class 12
No ratings yet
Dbms Class 12
6 pages

ML Lectures - 33 34

Uploaded by

ML Lectures - 33 34

Uploaded by

TE (CS)

Spring Semester 2024

Machine Learning (CS-324)

Dr Syed Zaffar Qasim

Bias & Variance

CS-324: Machine Learning 1

Bias & Variance

▪ In the case of high bias,

CS-324: Machine Learning 2

Bias & Variance

CS-324: Machine Learning 3

▪ The more the dots deviate from the bull’s-eye, the

Visualize Bias and Variance at a dartboard.

▪ Let X be a sample from a population specified up to a

CS-324: Machine Learning 4

▪ We then write error as the sum of these two terms, the

Underfitting and Overfitting

▪ Underfitting results from low variance, high bias

CS-324: Machine Learning 5

▪ It can lead to inaccurate predictions for both the

▪ Overfitting typically occurs when

CS-324: Machine Learning 6

Overfitting in polynomial classifiers

CS-324: Machine Learning 7

Underfitting and Overfitting

▪ Specifically, this might entail switching from linear

CS-324: Machine Learning 8

CS-324: Machine Learning 9

You might also like