0% found this document useful (0 votes)

16 views40 pages

Lec06-PracticalML

Uploaded by

Tayyab Mughal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views40 pages

Lec06-PracticalML

Uploaded by

Tayyab Mughal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Lecture 06

Practical Machine Learning

Slides taken from Andrew Ng Course

Agenda
• Problems encountered when Appling ML to
real world problems and advice on how to
tackle them
– The problem of overfitting
• Regularization
– Model Selection
• How to choose a hypothesis given multiple hypothesis
– Bias-Variance Tradeoff
– Precision-Recall
Recap
• Supervised Learning
– Decision Trees
– Linear Regression
– Logistic Regression
– Practical ML
Practical ML
The problem of
overfitting
Machine Learning
Example: Linear regression (housing prices)
Price

Price

Price
Size Size Size

Overfitting: If we have too many features, the learned hypothesis

may fit the training set very well ( ), but fail
to generalize to new examples (predict prices on new examples).
Example: Logistic regression

x2 x2 x2

x1 x1 x1
( = sigmoid function)
Practical ML
Regularization

Machine Learning
Intuition

Price
Price

Size of house Size of house

Suppose we penalize and make , really small.

+ 1000 𝜃32 + 1000 𝜃42
Regularization.

Small values for parameters

― “Simpler” hypothesis
― Less prone to overfitting
Housing:
― Features:
― Parameters:

𝑛 2
σ
+ 𝜆 𝑗=1 𝜃𝑗
Regularization.

Price

Size of house
In regularized linear regression, we choose to minimize

What if is set to an extremely large value (perhaps for too large

for our problem, say )?
Price

Size of house
Practical ML
Regularized Linear
regression
Machine Learning
Regularized linear regression
Gradient descent
Repeat

𝜆
− 𝜃𝑗
𝑚
Practical ML
Regularized logistic
regression
Machine Learning
Regularized logistic regression.

x1
Cost function:

𝑛
𝜆
+ ෍ 𝜃𝑗2
2𝑚
𝑗=1
Gradient descent
Repeat

𝜆
− 𝜃𝑗
𝑚
Practical ML
Model Selection

Machine Learning
Evaluating the hypothesis
Dataset:
Size Price
2104 400
1600 330
2400 369
1416 232
3000 540
1985 300
1534 315
1427 199
1380 212
1494 243
Training/testing procedure for linear regression

- Learn parameter from training data (minimizing

training error )

- Compute test set error:

𝑚𝑡𝑒𝑠𝑡
1 𝑖 𝑖 2
𝐽 𝜃 = ෍ (ℎ 𝑥 ) − 𝑦
2𝑚𝑡𝑒𝑠𝑡
𝑖=1
Model selection

Models trained using

training data

How to choose the best model (degree of the features in

this case):
• Report test set error 𝐽𝑡𝑒𝑠𝑡 (𝜃)
• but 𝐽𝑡𝑒𝑠𝑡 (𝜃) is likely to be an optimistic estimate of
generalization error.

The same problem is encountered while trying to find the

optimal regularization parameter 𝜆
Evaluating your hypothesis
Dataset:
Size Price
2104 400
1600 330
2400 369
1416 232
3000 540
1985 300
1534 315
1427 199
1380 212
1494 243
Train/validation/test error
Training error:

Cross Validation error:

Test error:
Practical ML
Bias-Variance
Tradeoff
Machine Learning
Bias error: error from erroneous
assumptions in the learning algorithm
(under fitting)

Variance: error from sensitivity to small

fluctuations in the training set. High
variance can cause an algorithm to model
the random noise in the training data rather
that the intended output (overfitting).
Bias/variance
Price

Price

Price
Size Size Size

High bias “Just right” High variance

(underfit) (overfit)
Bias/variance
Training error:

Cross validation error:

Diagnosing bias vs. variance
Suppose your learning algorithm is performing less well than
you were hoping. ( or is high.) Is it a bias
problem or a variance problem?
Bias (underfit):
𝐽𝑡𝑟𝑎𝑖𝑛 𝜃 will be high
error

(cross validation
error)
𝐽𝑡𝑟𝑎𝑖𝑛 𝜃 ≈ 𝐽𝑐𝑣 𝜃

Variance (overfit):
(training error) 𝐽𝑡𝑟𝑎𝑖𝑛 𝜃 will be low
𝐽𝑡𝑟𝑎𝑖𝑛 𝜃 << 𝐽𝑐𝑣 𝜃
degree of
polynomial d
Practical ML

Regularization and
bias/variance
Machine Learning
Linear regression with regularization
Model:
Price

Price

Price
Size Size Size
Large xx Intermediate xx Small xx
High bias (underfit) “Just right” High variance (overfit)
Choosing the regularization parameter
Choosing the regularization parameter
Model:

1. Try
2. Try
3. Try
4. Try
5. Try

12. Try
Bias/variance as a function of the regularization parameter
Debugging a learning algorithm:
Suppose you have implemented regularized linear regression to predict
housing prices. However, when you test your hypothesis in a new set of
houses, you find that it makes unacceptably large errors in its
prediction. What should you try next?

- Get more training examples Fix High variance

- Try smaller sets of features Fix High variance
- Try getting additional features Fix High bias
- Try adding polynomial features Fix High bias
- Try decreasing Fix High bias
- Try increasing Fix High variance
Practical ML

Error Metric for skewed

classes:
Precision/Recall

Machine Learning
Cancer classification example
Train logistic regression model .( if cancer,
otherwise)
Find that you got 1% error on test set.
(99% correct diagnoses)

Only 0.50% of patients have cancer.

Precision/Recall
in presence of rare class that we want to detect
Actual Class Precision
1 0 (Of all patients where we predicted
True False 𝑦 = 1, what fraction actually has
Pred. 1 Positive Positive cancer?)
𝑇𝑟𝑢𝑒 𝑃𝑜𝑠.
Class =
False True 𝑇𝑟𝑢𝑒 𝑝𝑜𝑠. + 𝐹𝑎𝑙𝑠𝑒 𝑝𝑜𝑠.
0 Negative Negative Recall
(Of all patients that actually have
cancer, what fraction did we correctly
detect as having cancer?)
𝑇𝑟𝑢𝑒 𝑃𝑜𝑠.
=
𝑇𝑟𝑢𝑒 𝑝𝑜𝑠. + 𝐹𝑎𝑙𝑠𝑒 𝑛𝑒𝑔.
Trading off precision and recall

Logistic regression:
Predict 1 if
Predict 0 if
Suppose we want to predict (cancer)
only if very confident. (set high threshold)
High Precision, low threshold
Suppose we want to avoid missing too many cases of
cancer (avoid false negatives). (set low threshold)
High Recall, low Precision
More generally: Predict 1 if threshold.
F1 Score (F score):

How to compare precision/recall numbers?

Precision Recall Average

(P) (R)
Algorithm 1 0.5 0.4 0.45 0.444

Algorithm 2 0.7 0.1 0.4 0.175

Algorithm 3 0.02 1.0 0.51 0.0392

Summary
• Problems encountered when Appling ML to
real world problems and advice on how to
tackle them
– The problem of overfitting
• Regularization
– Model Selection
• How to choose a hypothesis given multiple hypothesis
– Bias-Variance Tradeoff
– Precision-Recall

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
5 CIE IGCSE Additional Mathematics Paper 2 Topical Past Paper Factors of Polynomials
100% (1)
5 CIE IGCSE Additional Mathematics Paper 2 Topical Past Paper Factors of Polynomials
15 pages
Advice For Applying Machine Learning: Deciding What To Try Next
No ratings yet
Advice For Applying Machine Learning: Deciding What To Try Next
30 pages
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
No ratings yet
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
30 pages
6. ML Tips and Tricks
No ratings yet
6. ML Tips and Tricks
32 pages
Chap8 Advice
No ratings yet
Chap8 Advice
29 pages
10 Advice for Applying Machine Learning
No ratings yet
10 Advice for Applying Machine Learning
25 pages
Docs Slides Lecture10
No ratings yet
Docs Slides Lecture10
30 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
Week 6 Lecture Notes
No ratings yet
Week 6 Lecture Notes
9 pages
linear+regression+with+multiple+variable
No ratings yet
linear+regression+with+multiple+variable
30 pages
Lecture 3-Linear-Regression-Part2
No ratings yet
Lecture 3-Linear-Regression-Part2
45 pages
Lecture 02
No ratings yet
Lecture 02
43 pages
10: Advice For Applying Machine Learning: Deciding What To Try Next
No ratings yet
10: Advice For Applying Machine Learning: Deciding What To Try Next
8 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
6应用机器学习的建议
No ratings yet
6应用机器学习的建议
79 pages
L2_Problems in ML & Performance Evaluation - Copy
No ratings yet
L2_Problems in ML & Performance Evaluation - Copy
30 pages
BiasVariance
No ratings yet
BiasVariance
14 pages
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
No ratings yet
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
20 pages
Lecture 7 - Part A - Mutli Class and Overfitting and Regularization
No ratings yet
Lecture 7 - Part A - Mutli Class and Overfitting and Regularization
43 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
28 pages
Lec3 Linear Regression With Multiple Vars
No ratings yet
Lec3 Linear Regression With Multiple Vars
30 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
ML 01
No ratings yet
ML 01
24 pages
Logistic Regression
No ratings yet
Logistic Regression
24 pages
Bias-Variance Trade-Off
No ratings yet
Bias-Variance Trade-Off
28 pages
ML U-4
No ratings yet
ML U-4
63 pages
Machine Learning and Pattern Recognition Week 2
No ratings yet
Machine Learning and Pattern Recognition Week 2
7 pages
Lec11
No ratings yet
Lec11
43 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
116 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
32 pages
Lecture 15 - Recap and Midterm Review
No ratings yet
Lecture 15 - Recap and Midterm Review
37 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
11-Bias Variance Tradeoff
No ratings yet
11-Bias Variance Tradeoff
95 pages
2. Linear Regression, Polynomical, Gradiant Descent
No ratings yet
2. Linear Regression, Polynomical, Gradiant Descent
42 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
Week11_regularization and optimization
No ratings yet
Week11_regularization and optimization
75 pages
Gansp Awareness Quiz PDF
No ratings yet
Gansp Awareness Quiz PDF
13 pages
Introduction To Machine Learning: The Problem of Overfitting
No ratings yet
Introduction To Machine Learning: The Problem of Overfitting
8 pages
06LogisticRegression
No ratings yet
06LogisticRegression
55 pages
Quiz 1 Materials
No ratings yet
Quiz 1 Materials
159 pages
Lecturenotes Cse176
No ratings yet
Lecturenotes Cse176
80 pages
EE2211 Lecture 7
No ratings yet
EE2211 Lecture 7
43 pages
Lecturenotes PDF
No ratings yet
Lecturenotes PDF
80 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
Data Science L20_Regularization
No ratings yet
Data Science L20_Regularization
41 pages
CMPE257 - W2C3 - ML Fundamentals_ Part 2
No ratings yet
CMPE257 - W2C3 - ML Fundamentals_ Part 2
34 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
Linear Regression Python Programming
No ratings yet
Linear Regression Python Programming
25 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
2021 10 11 - Intro ML - Inserm
No ratings yet
2021 10 11 - Intro ML - Inserm
41 pages
EE353 - 769 06 Intro To ML
No ratings yet
EE353 - 769 06 Intro To ML
27 pages
Machine Learning Updated
No ratings yet
Machine Learning Updated
14 pages
Unit 7 - Week 4: Assignment 4
No ratings yet
Unit 7 - Week 4: Assignment 4
5 pages
Regularization 1704650055
No ratings yet
Regularization 1704650055
32 pages
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Essential Software: Dr. Muhammad Yousuf Tufail
No ratings yet
Essential Software: Dr. Muhammad Yousuf Tufail
53 pages
Digital Halftoning: C. A. Bouman: Digital Image Processing - January 8, 2018
No ratings yet
Digital Halftoning: C. A. Bouman: Digital Image Processing - January 8, 2018
29 pages
Interpretation PCM
No ratings yet
Interpretation PCM
7 pages
Chapter 15_ Time Series Regression and Forecasting
No ratings yet
Chapter 15_ Time Series Regression and Forecasting
47 pages
06 NaiveBayes Example
No ratings yet
06 NaiveBayes Example
2 pages
Opt Slides Ump
No ratings yet
Opt Slides Ump
46 pages
time complexity
No ratings yet
time complexity
3 pages
Mid Sem II Questions: What Is Symbol Rate Packing?
No ratings yet
Mid Sem II Questions: What Is Symbol Rate Packing?
2 pages
Tutorial
No ratings yet
Tutorial
28 pages
Chapter 6_Integer Programing Full
No ratings yet
Chapter 6_Integer Programing Full
44 pages
Information Theory and Coding
No ratings yet
Information Theory and Coding
9 pages
LU Decomposition Method
No ratings yet
LU Decomposition Method
5 pages
Huffman Coding Paper
No ratings yet
Huffman Coding Paper
3 pages
dcc2015s1-lecture03
No ratings yet
dcc2015s1-lecture03
56 pages
Tynchyshyn SVM
No ratings yet
Tynchyshyn SVM
4 pages
Data Structures
No ratings yet
Data Structures
6 pages
Array Sorting Algorithms in Python
No ratings yet
Array Sorting Algorithms in Python
8 pages
Digital Assignment - 1: Name: M Bhanu Prakash REG NUMBER: 19MID0088
No ratings yet
Digital Assignment - 1: Name: M Bhanu Prakash REG NUMBER: 19MID0088
13 pages
SENG 313 Graphs Algorithm - PART1
No ratings yet
SENG 313 Graphs Algorithm - PART1
35 pages
19Z701-AI-Unit-2-1 Search Strategies
No ratings yet
19Z701-AI-Unit-2-1 Search Strategies
84 pages
Digital Signal Processing in The Analysis of Genomic Sequences
No ratings yet
Digital Signal Processing in The Analysis of Genomic Sequences
13 pages
DS Problem Solutions
No ratings yet
DS Problem Solutions
7 pages
Recursion Tree Method
No ratings yet
Recursion Tree Method
21 pages
Notes of Soft Computing
100% (1)
Notes of Soft Computing
2 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
dsal-oral-questions-2023-24
No ratings yet
dsal-oral-questions-2023-24
2 pages
0-1-knapsack
No ratings yet
0-1-knapsack
36 pages
Warshall Algorithm: Algorithm, or The WFI Algorithm
No ratings yet
Warshall Algorithm: Algorithm, or The WFI Algorithm
15 pages
Worksheet 4.1. Example of An Algorithm
No ratings yet
Worksheet 4.1. Example of An Algorithm
8 pages