0% found this document useful (0 votes)

41 views

Week 4 Lecture Slides BUS265 2023

This document discusses evaluating machine learning models to avoid overfitting. It introduces concepts like training and test sets, cross-validation, and various performance metrics. A case study on predicting used car prices is presented to illustrate model building and selection. Different regression models are fit to the car price data and their performance on training and test sets is compared to select the best model.

Uploaded by

Alberto Lugo Gonzalez

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views

Week 4 Lecture Slides BUS265 2023

Uploaded by

Alberto Lugo Gonzalez

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

BUS265 Machine Learning and

Digital Technology
Lecture 4: Building a Machine Learning Model for Prediction

Dr Valentin Danchev School of Business and Management

Queen Mary University of London
Model performance

• When building a supervised model, how do I know that my

model is any good?
• With powerful, flexible algorithms searching for patterns or
models, there is a serious danger of overfitting.
- overfitting sometimes is a difficult concept
- generally the idea is that “if you look hard enough you’ll find
something” even if it does not generalize beyond the particular
training data.
- “if you torture the data long enough, it will confess” (Ronald Coase)

2
Machine learning process

3
Model performance
• Generalisation:
- we want models to apply not just to the exact training
set but to the general population from which the training
data came

4
Model performance

• There is no single choice or procedure that will eliminate

over-fitting
- recognize over-fitting and manage complexity in a
principled way

5
Model complexity via geometric interpretation
3 models
(which do you prefer?)

6
How can we judge whether our modeling has overfit?

Under-fitting Good Fit Over-fitting

7
Tool for model performance evaluation:
The fitting curve
Under-fitting Over-fitting
• Over-fitting: model “memorizes”
the properties of the particular
training set rather than learning
the underlying phenomenon Good Fit
• In-sample evaluation is in
favour or “memorizing”
• On the training data the right
model would be best
• But on new data it would be
bad

8
Finding the best-fitting model

Holdout data = Test data

9
Holdout validation

We are interested in generalization –

the performance on data not used for training
Given only one data set, we hold out some data for evaluation
• holdout set for final evaluation is called the test set
• Accuracy on training data is sometimes called “in-sample”
accuracy vs. “out-of-sample” accuracy on test data
• a.k.a. “holdout accuracy”
• an estimate of “generalization accuracy”

10
Holdout validation — simple hold-out set
Partition data into training and testing
set (2/3 to 1/3 or 80% to 20%)
• In some domains it makes sense
to partition temporally (training set
before time t, test set after time t)

Challenges:
1. What if by accident you selected a particularly
easy/hard test set?
2. Do you have an idea of the variation in model
accuracy due to training? What would be the
model accuracy if you select a different
training set?

11
Holdout validation — Cross-validation (CV)
4-fold CV
• Partition data into k “folds”
(randomly)
• Run training/test evaluation k
times

12
Holdout validation — Cross-validation (CV)

• Each fold is test set once (rest are combined for training set)
• Eventually tests on all data (each data point once)
• Can compute average and variance of accuracy measure(s) across folds
• Better use of a limited dataset: CV computes its estimates over all the data

13
How to choose the model
Measuring predictive ability

Metrics to evaluating classification models

• Accuracy: the fraction of predictions our
model got right.

• But in business applications different errors

(different decisions) have different costs and
benefits associated with them

14
Metrics to evaluate classification models—Accuracy

• Accuracy is the fraction of predictions our model got right.

• But in business applications different errors (different decisions)

have different costs and benefits associated with them

15
Metrics to evaluate classification models—
Confusion Metrix
Confusion matrix represents different sorts of errors made by a classification model
Actual

“confusion matrix”
or
+ -
“contingency table”
Y True+ False+ Entries are counts of
Predicted correct classifications
Not all errors are equal: think about a False and counts of errors
Negative (False-) result in medicine that
N False- True-
indicates that a person does not have a
specific disease/condition when the person
actually does have the disease/condition. More on classification next week…

16
Metrics to evaluate regression models
• R Square
• Does not take into account the overfitting problem (Adjusted R)
• Used for classical in-sample evaluation
• Mean Square Error (MSE)
• Root Mean Square Error (RMSE) (square root of MSE)
• Bayesian information criterion (BIC)—includes a penalty term for
the number of parameters in the model to address overfitting
• The closer the model predictions are to the observations, the
smaller the MSE/RMSE/BIC
• Models with lower MSE/RMSE/BIC are generally preferred

17
CASE STUDY: Predicting Used Car Value

Link to the Colab notebook:

https://colab.research.google.com/drive/1AgaUxnarPm5Vdk
3fQ1jZlyX_noD3Dixr?usp=sharing

18
Price cars

19
Prediction setup

20
Prediction setup

21
Prediction setup

22
Loss function

23
Square loss

24
Mean Squared Error (MSE)

25
Case study: used cars data

26
Case study - used cars: features

27
Case study: models by hand

28
Case study: Car price model results

29
External validity, avoiding overfitting and model
selection

30
Underfit, overfit

31
Underfitting and overfitting the original data

32
Overfitting

33
Reason for overfitting

34
Model fit evaluation

35
Model fit evaluation

36
Finding the best model by best fit and penalty

37
Finding the best model by training and test samples

38
5-fold cross-validation

39
Finding the best model by cross-validation

40
Case study: Model selection

41
Case study: Model selection

42
Case study: Model selection

Model 4
The lowest RMSE
on the test sample

43
Acknowledgements

• Courses/slides by Foster Provost, Panos Adamopoulos, Karolis Urbonas, Leonid

Zhukov, Mladen Kolar, John Kelleher, Chirag Shah, Gabor Bekes and Gabor
Kezdi

44
Thank you

Thank you

Analytics Optimization With Columnstore Indexes in Microsoft SQL Server
No ratings yet
Analytics Optimization With Columnstore Indexes in Microsoft SQL Server
285 pages
n5GHgyU2SdKRk - 4a04 Qug - Activity Exemplar - Automatidata Project Proposal
No ratings yet
n5GHgyU2SdKRk - 4a04 Qug - Activity Exemplar - Automatidata Project Proposal
3 pages
Artificial Intelligence For R-2017 by Krishna Sankar P., Shangaranarayanee N. P., Nithyananthan S.
0% (1)
Artificial Intelligence For R-2017 by Krishna Sankar P., Shangaranarayanee N. P., Nithyananthan S.
8 pages
Evaluating A Machine Learning Model
No ratings yet
Evaluating A Machine Learning Model
14 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
Model Evaluation in ML
No ratings yet
Model Evaluation in ML
12 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
Lecture 4 Evaluation
No ratings yet
Lecture 4 Evaluation
58 pages
TR Rain Error
No ratings yet
TR Rain Error
6 pages
APS1070 Lecture (3) Slides
No ratings yet
APS1070 Lecture (3) Slides
70 pages
Quiz 1 Materials
No ratings yet
Quiz 1 Materials
159 pages
11 - Model Eval and Tuning
No ratings yet
11 - Model Eval and Tuning
17 pages
Evaluation
No ratings yet
Evaluation
18 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Week 10_Lecture 10
No ratings yet
Week 10_Lecture 10
59 pages
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
28 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
73 pages
Lecture 7 - Feature Selection & Model Optimization
No ratings yet
Lecture 7 - Feature Selection & Model Optimization
48 pages
Strategy Deck
No ratings yet
Strategy Deck
16 pages
CSC4316 9
No ratings yet
CSC4316 9
40 pages
Certified Artificial Intelligence Practitioner 3
No ratings yet
Certified Artificial Intelligence Practitioner 3
36 pages
ML U-4
No ratings yet
ML U-4
63 pages
Chapter 19
No ratings yet
Chapter 19
30 pages
Model Evaluation-I
No ratings yet
Model Evaluation-I
68 pages
2020 Evaluation PDF
No ratings yet
2020 Evaluation PDF
25 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
Lecture 5 Evaluation_Classifer
No ratings yet
Lecture 5 Evaluation_Classifer
61 pages
Week 4 - Intro to ML
No ratings yet
Week 4 - Intro to ML
37 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
ML Model Evaluation
No ratings yet
ML Model Evaluation
17 pages
chapter 1 capstone project ai class 12
No ratings yet
chapter 1 capstone project ai class 12
5 pages
19 ML Intro
No ratings yet
19 ML Intro
31 pages
Machine Learning # 2
No ratings yet
Machine Learning # 2
17 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
Xchapter 1
No ratings yet
Xchapter 1
31 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
No ratings yet
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
30 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
7 ML
No ratings yet
7 ML
38 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
ML 5
No ratings yet
ML 5
14 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
Exam PA Knowledge Based Outline
No ratings yet
Exam PA Knowledge Based Outline
22 pages
PE IV - Practical Machine Learning
No ratings yet
PE IV - Practical Machine Learning
7 pages
ML - Module 5
No ratings yet
ML - Module 5
80 pages
Developing A Machining Learning Models From Start To Finish.
No ratings yet
Developing A Machining Learning Models From Start To Finish.
59 pages
DM 09 Classification and Prediction 19112024 102854am
No ratings yet
DM 09 Classification and Prediction 19112024 102854am
21 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
SML Updated UNIT 4
No ratings yet
SML Updated UNIT 4
44 pages
Tuning Decision Trees Python
No ratings yet
Tuning Decision Trees Python
50 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
EDAN96_2024_Last_lecture-1
No ratings yet
EDAN96_2024_Last_lecture-1
78 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
ML-2-PPT-UNIT-2
No ratings yet
ML-2-PPT-UNIT-2
214 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
CHP 3
No ratings yet
CHP 3
70 pages
Practice Makes Perfect Statistics
From Everand
Practice Makes Perfect Statistics
Sandra McCune
No ratings yet
The Handbook of Online Marketing Research: Knowing Your Customer Using the Net
From Everand
The Handbook of Online Marketing Research: Knowing Your Customer Using the Net
Joshua Grossnickle
2/5 (1)
LP Oral Com June W2
No ratings yet
LP Oral Com June W2
14 pages
Cognitive Continuum Theory
No ratings yet
Cognitive Continuum Theory
12 pages
Translation Theories, Strategies, and Basic Theoretical
67% (3)
Translation Theories, Strategies, and Basic Theoretical
23 pages
Communication Technologies in Education by SWAYAM PDF
No ratings yet
Communication Technologies in Education by SWAYAM PDF
395 pages
Ai VS Ci
No ratings yet
Ai VS Ci
5 pages
3 ArtificialNeuralNetworks PDF
No ratings yet
3 ArtificialNeuralNetworks PDF
77 pages
Lasswell's Model: Model or One Way Model of
No ratings yet
Lasswell's Model: Model or One Way Model of
3 pages
Ann FL
No ratings yet
Ann FL
102 pages
Scikit Learn Tutorial PDF
100% (2)
Scikit Learn Tutorial PDF
151 pages
Fingernet: An Unified Deep Network For Fingerprint Minutiae Extraction
No ratings yet
Fingernet: An Unified Deep Network For Fingerprint Minutiae Extraction
9 pages
IIR Filter-Sofyan Ahmadi
No ratings yet
IIR Filter-Sofyan Ahmadi
9 pages
Lore So What - CV Template
No ratings yet
Lore So What - CV Template
2 pages
Artificial Neural Networks: Torsten Reil
No ratings yet
Artificial Neural Networks: Torsten Reil
47 pages
Crisp DM Thesis
100% (3)
Crisp DM Thesis
4 pages
Deep Learning Glossary
No ratings yet
Deep Learning Glossary
41 pages
JanuaryFebruary 2023
No ratings yet
JanuaryFebruary 2023
2 pages
Alexnet Tugce Kyunghee
No ratings yet
Alexnet Tugce Kyunghee
35 pages
Dhanush - Diabetes Report
No ratings yet
Dhanush - Diabetes Report
4 pages
Syllabus For Post Graduate Program in Machine Learning & Artificial Intelligence - Demo
No ratings yet
Syllabus For Post Graduate Program in Machine Learning & Artificial Intelligence - Demo
4 pages
Detectx: A Deepfake Detection System
No ratings yet
Detectx: A Deepfake Detection System
17 pages
The Problem of Concept Drift - Definitions and Related Work
No ratings yet
The Problem of Concept Drift - Definitions and Related Work
7 pages
Week 6 Decision Trees
No ratings yet
Week 6 Decision Trees
47 pages
Chat GPT
No ratings yet
Chat GPT
2 pages
Fruit Classification and Grading
No ratings yet
Fruit Classification and Grading
3 pages
Edge Detection
No ratings yet
Edge Detection
30 pages
2.6 Anomalies in DBMS
No ratings yet
2.6 Anomalies in DBMS
4 pages
09-Digital Control Systems
No ratings yet
09-Digital Control Systems
2 pages

Week 4 Lecture Slides BUS265 2023

Uploaded by

Week 4 Lecture Slides BUS265 2023

Uploaded by

BUS265 Machine Learning and

Dr Valentin Danchev School of Business and Management

• When building a supervised model, how do I know that my

• There is no single choice or procedure that will eliminate

Under-fitting Good Fit Over-fitting

Holdout data = Test data

We are interested in generalization –

Metrics to evaluating classification models

• But in business applications different errors

• Accuracy is the fraction of predictions our model got right.

• But in business applications different errors (different decisions)

Link to the Colab notebook:

• Courses/slides by Foster Provost, Panos Adamopoulos, Karolis Urbonas, Leonid

You might also like