0% found this document useful (0 votes)

23 views

Supervised Learning Overview, Formulation, Train-Test Split: EE514 - CS535

This document provides an overview of supervised machine learning. It defines machine learning and the goal of making a system learn from examples to make predictions. It describes the process of using training data to build a model using an algorithm, and then making predictions on new data. It discusses supervised learning problems like regression and classification, and the process of splitting data into training and test sets to evaluate a model's performance on new data and avoid overfitting. Key terms like features, labels, loss functions, and generalization

Uploaded by

Zubair Khalid

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views

Supervised Learning Overview, Formulation, Train-Test Split: EE514 - CS535

Uploaded by

Zubair Khalid

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Machine Learning

EE514 – CS535

Supervised Learning Overview,

Formulation, Train-test Split

Zubair Khalid

School of Science and Engineering

Lahore University of Management Sciences

https://www.zubairkhalid.org/ee514_2021.html
Machine Learning: Overview
What is Machine Learning?
Given examples (training data), make a machine learn system
behavior or discover patterns

Data
Model

Algorithm f ( x)
Final output which
enables us to make
Given to us We need to design it predictions
Machine Learning: Overview
Algorithms vs Model

- Linear regression algorithm produces a model, that is, a vector of values

of the coefficients of the model.

- Decision tree algorithm produces a model comprised of a tree of if-then

statements with specific values.

- Neural network along with backpropagation + gradient descent: produces

a model comprised of a trained (weights assigned) neural network.
Machine Learning: Overview
Nature of ML Problems
1. Supervised Learning
The learning algorithm would receive a set of inputs along with the corresponding correct
outputs to train a model
Training Data

All labeled
Model Prediction
data
Supervised Learning
Regression
Regression: Quantitative Prediction on a continuous scale
Examples: Prediction of
- Age of a person from his/her photo
- Price of 10 Marla, 5-bedroom house in 2050
- USD/PKR exchange rate after one week What do all these problems
- Efficacy of Pfizer Covid vaccine have in common?
- Average temperature/Rainfall during monsoon
Continuous outputs
- Cumulative score in ML course
- Probability of decrease in the electricity prices in Pakistan
- No. of steps per day

Predicting continuous outputs is called regression

Supervised Learning
Classification
Classification: Given a data sample, predict its class (discrete)
Examples: Prediction of
- Gender of a person using his/her photo or hand-writing style
What do all these problems
- Spam filtering
have in common?
- Object or face detection in a photo
Discrete outputs: Categorical
- We will be back on Campus on Feb 1
- Temperature/Rainfall normal or abnormal during monsoon Yes/No (Binary Classification)
- Letter grade in ML course
Multi-class classification:
- Decrease expected in electricity prices in Pakistan next year multiple classes
- More than 10000 Steps taken today

Predicting a categorical output is called classification

Supervised Learning Setup
Nomenclature

In these regression or classification problems, we have

- Inputs – referred to as Features
- Output – referred to as Label
- Training data – (input, output) for which the output is known and is
used for training a model by ML algorithm
- A Loss, an objective or a cost function – determines how well a trined
model approximates the training data
- Test data – (input, output) for which the output is known and is used
for the evaluation of the performance of the trained model
Supervised Learning Setup
Nomenclature - Example
Predict Stock Index Price

- Features (Input)
- Labels (Output)
- Training data

?
?
?
Supervised Learning Setup
Formulation

Regression:

Classification:
Supervised Learning Setup
Example

Data of 200 Patients:

- Age of the patient
- Cholesterol levels
Model (h) Prediction of Oxygen Saturation
- Glucose levels
- BMI
- Height
- Heart Rate
- Calories intake
- No. of steps taken
Supervised Learning Setup
Example

Model (h) Prediction

MNIST Data:
- Each sample 28x28 pixel image
- 60,000 training data
- 10,000 testing data
Supervised Learning Setup
Learning
Supervised Learning Setup
Hypothesis Class

Q: How?
A:
Supervised Learning Setup
Q: How do we evaluate the performance?
A:

Loss Function
Supervised Learning Setup

0/1 Loss Function:

Zero-one loss is defined as

Interpretation:
- Note normalization by the number of samples. This makes it the loss per sample.
- Loss function counts the number of mistakes made by hypothesis function D.
- Not used frequently due to non-differentiability and non-continuity.
Supervised Learning Setup

Squared Loss Function:

Squared loss is defined as (also referred to as mean-square error, MSE )

Interpretation:
- Again note normalization by the number of samples.
- Loss is always nonnegative.
- Loss grows quadratically with the absolute error amount in each sample.
Root Mean Squared Error (RMSE):
RMSE is just square root of squared loss function:
Supervised Learning Setup

Absolute Loss Function:

Absolute loss is defined as

Interpretation:
- Loss grows linearly with the absolute of the error in each prediction.
- Used in regression and suited for noisy data.

* All of the losses are non-negative

Supervised Learning Setup
Learning

(Optimization problem)

Recall

Q: How can we ensure that hypothesis h will give low loss on the input not in D?
Supervised Learning Setup

Interpretation:
- 0% loss error on the training data (Model is fit to every data point in D).
- Large error for some input not in D
- First glimpse of overfitting.
Revisit:
Q: How can we ensure that hypothesis h will give low loss on the input not in D?

A: Train/Test Split
Supervised Learning Setup
Generalization: The Train-Test Split

How to carry out splitting?

You can only use the test dataset once after deciding on the model using training dataset
Supervised Learning Setup
Learning (Revisit after train-test split)

Evaluation
Supervised Learning Setup
Generalization loss
Supervised Learning Setup
Generalization: The Train-Test Split

Q: Idea:
Validation data is used evaluate the loss for a function h that is
determined using the learning on the training data-set. If the loss
on validation data is high for a given h, the hypothesis or model
needs to be changed.
Supervised Learning Setup
Generalization: The Train-Test Split

More explanation* to better understand the difference between validation and test data:

- Training set: A set of examples used for learning, that is to fit the
parameters of the hypothesis (model).

- Validation set: A set of examples used to tune the hyper-parameters of

the hypothesis function, for example to choose the number of hidden
units in a neural network OR the order of polynomial approximating
the data.

- Test set: A set of examples used only to assess the performance of a

fully-specified model or hypothesis.

Adapted from *Brian Ripley, Pattern Recognition and Neural Networks, 1996
Supervised Learning Setup
Generalization: The Train-Test Split (Example)

Cross validation simulates multiple train-test splits on the training data

Supervised Learning Setup
Reference:

• https://www.cs.cornell.edu/courses/cs4780/2018fa/

• CB: sec 1.1

• HTF section 2.1

• KM: sec. 1.1, 1.2

Feedback: Questions or Comments?

Email: [email protected]

AIF C01 Study Guide
100% (1)
AIF C01 Study Guide
28 pages
Pmwj80 Apr2019 Wang How To Aply AI in Project Management
No ratings yet
Pmwj80 Apr2019 Wang How To Aply AI in Project Management
12 pages
UX Portfolio
100% (3)
UX Portfolio
27 pages
5 - Model For Predictions - ML
No ratings yet
5 - Model For Predictions - ML
52 pages
Lecture 2.2 Example Data Preparation Feature Engineering
No ratings yet
Lecture 2.2 Example Data Preparation Feature Engineering
25 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
31 pages
Codes and Concepts of ML-Developer-2
No ratings yet
Codes and Concepts of ML-Developer-2
17 pages
Data Mining Assignment Help
No ratings yet
Data Mining Assignment Help
5 pages
ML 3170724 Unit-3
No ratings yet
ML 3170724 Unit-3
48 pages
AIML Unit-IV & V
100% (1)
AIML Unit-IV & V
47 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
Classification and Regression
No ratings yet
Classification and Regression
15 pages
Lecture 4 Machine Learning - Bcsc
No ratings yet
Lecture 4 Machine Learning - Bcsc
45 pages
UNIT03
No ratings yet
UNIT03
52 pages
Week 4
No ratings yet
Week 4
13 pages
Machine Learning & Data Mining
No ratings yet
Machine Learning & Data Mining
4 pages
CHP 3
No ratings yet
CHP 3
70 pages
ML Interview Questions and Answers
100% (1)
ML Interview Questions and Answers
25 pages
Chapter 03_1731422626
No ratings yet
Chapter 03_1731422626
42 pages
Unit III 1
No ratings yet
Unit III 1
21 pages
Chapter III - Supervised and Unsupervised Algorithms
No ratings yet
Chapter III - Supervised and Unsupervised Algorithms
122 pages
AI Phase2
No ratings yet
AI Phase2
13 pages
(Pec Cs701e)
No ratings yet
(Pec Cs701e)
4 pages
Sample Paper For The Machine Learning Course Ajay Sharma
No ratings yet
Sample Paper For The Machine Learning Course Ajay Sharma
19 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
Machine Learning
No ratings yet
Machine Learning
115 pages
ML Unit 2
No ratings yet
ML Unit 2
18 pages
Unit3ModellingandEvaluationpptx 2023 09 02 15 19 21
No ratings yet
Unit3ModellingandEvaluationpptx 2023 09 02 15 19 21
49 pages
Slide 1
No ratings yet
Slide 1
29 pages
Interview Questions For DS & DA (ML)
100% (1)
Interview Questions For DS & DA (ML)
66 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
No ratings yet
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
102 pages
MAchineLearningNotes
No ratings yet
MAchineLearningNotes
6 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
Unit 4_Question Bank and answers
No ratings yet
Unit 4_Question Bank and answers
23 pages
Complete ML Notes
No ratings yet
Complete ML Notes
62 pages
Chapter 4
No ratings yet
Chapter 4
18 pages
ML 2
No ratings yet
ML 2
166 pages
chapter3
No ratings yet
chapter3
9 pages
Unit-I (R20 Syllabus) Machine Learning Basics
No ratings yet
Unit-I (R20 Syllabus) Machine Learning Basics
50 pages
Classification in Data Mining 12
No ratings yet
Classification in Data Mining 12
7 pages
Data Science Interview Questions
100% (1)
Data Science Interview Questions
68 pages
Unit - 2 Deep Learning
No ratings yet
Unit - 2 Deep Learning
26 pages
ML Question Bank Solution
No ratings yet
ML Question Bank Solution
95 pages
training
No ratings yet
training
2 pages
Regression 0
No ratings yet
Regression 0
108 pages
1
No ratings yet
1
42 pages
Lecture-5 Classification in ML
No ratings yet
Lecture-5 Classification in ML
50 pages
Unit 1
No ratings yet
Unit 1
20 pages
Interview Questions
100% (1)
Interview Questions
67 pages
Unit 3
No ratings yet
Unit 3
22 pages
Introduction
No ratings yet
Introduction
41 pages
dbms-10 marks
No ratings yet
dbms-10 marks
32 pages
In5490 Classification
No ratings yet
In5490 Classification
85 pages
Lecture 3
No ratings yet
Lecture 3
15 pages
Unit 4 Updated Notes
No ratings yet
Unit 4 Updated Notes
13 pages
statistic inference unit 2 notes
No ratings yet
statistic inference unit 2 notes
34 pages
Sample Q - A For Module 3 - 4
No ratings yet
Sample Q - A For Module 3 - 4
18 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Python Machine Learning
From Everand
Python Machine Learning
Sebastian Raschka
4/5 (18)
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Assignment PAS 1
No ratings yet
Assignment PAS 1
4 pages
Self Organizing Maps
No ratings yet
Self Organizing Maps
23 pages
Expanded Table of Contents: About The Author Foreword Preface
No ratings yet
Expanded Table of Contents: About The Author Foreword Preface
3 pages
AL and ML Assessment Week 11
No ratings yet
AL and ML Assessment Week 11
2 pages
ზიგოტის გაყოფა, ბლასტულაცია
No ratings yet
ზიგოტის გაყოფა, ბლასტულაცია
14 pages
University of Rizal System: Unit1 - Overview of Emerging Technologies
No ratings yet
University of Rizal System: Unit1 - Overview of Emerging Technologies
6 pages
RBF Elm PNN-2020
No ratings yet
RBF Elm PNN-2020
24 pages
What Is Nanophysics
No ratings yet
What Is Nanophysics
24 pages
4.2) Alexnet
No ratings yet
4.2) Alexnet
10 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
68 pages
Machine Learning 2: Exercise Sheet 6
No ratings yet
Machine Learning 2: Exercise Sheet 6
1 page
Amparo de La Hera Portillo: Course Certificate
No ratings yet
Amparo de La Hera Portillo: Course Certificate
1 page
FTTC - Architecture 3.0 - 060617
No ratings yet
FTTC - Architecture 3.0 - 060617
1 page
Zhang Et Al., 2022
No ratings yet
Zhang Et Al., 2022
43 pages
Article Hour Yazi 24-3-2011
No ratings yet
Article Hour Yazi 24-3-2011
2 pages
Major Project On: "Age and Gender Detection Master''
No ratings yet
Major Project On: "Age and Gender Detection Master''
28 pages
Allianz Risk Barometer 2019 APPENDIX
No ratings yet
Allianz Risk Barometer 2019 APPENDIX
35 pages
Size Does Matter: Nanoparticles Microparticles
No ratings yet
Size Does Matter: Nanoparticles Microparticles
5 pages
Machine Learning For Robots: Course 1: Ros Deep Learning With Tensorflow 101
No ratings yet
Machine Learning For Robots: Course 1: Ros Deep Learning With Tensorflow 101
4 pages
Robotics in Our Everyday Lives
No ratings yet
Robotics in Our Everyday Lives
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
2 pages
CRIMINAL DETECTION BY FACE RECOGNITION USING MACHINE LEARNING Ijeame
No ratings yet
CRIMINAL DETECTION BY FACE RECOGNITION USING MACHINE LEARNING Ijeame
3 pages
Chapter 1 Intro
100% (1)
Chapter 1 Intro
19 pages
Philippine License Plate Character Recognition Using Faster R-CNN With Inceptionv2
No ratings yet
Philippine License Plate Character Recognition Using Faster R-CNN With Inceptionv2
5 pages
Simple and Efficient Architecture Search For Convolutional Neural Networks
No ratings yet
Simple and Efficient Architecture Search For Convolutional Neural Networks
14 pages
Grid Search Random Search Genetic Algorithm A Big
No ratings yet
Grid Search Random Search Genetic Algorithm A Big
11 pages
LAPORAN Jarkom
No ratings yet
LAPORAN Jarkom
16 pages
jrc120469 Historical Evolution of Ai-V1.1
No ratings yet
jrc120469 Historical Evolution of Ai-V1.1
36 pages

Supervised Learning Overview, Formulation, Train-Test Split: EE514 - CS535

Uploaded by

Supervised Learning Overview, Formulation, Train-Test Split: EE514 - CS535

Uploaded by

Machine Learning

Supervised Learning Overview,

School of Science and Engineering

- Linear regression algorithm produces a model, that is, a vector of values

- Decision tree algorithm produces a model comprised of a tree of if-then

- Neural network along with backpropagation + gradient descent: produces

Predicting continuous outputs is called regression

Predicting a categorical output is called classification

In these regression or classification problems, we have

Data of 200 Patients:

Model (h) Prediction

0/1 Loss Function:

Squared Loss Function:

Absolute Loss Function:

* All of the losses are non-negative

How to carry out splitting?

- Validation set: A set of examples used to tune the hyper-parameters of

- Test set: A set of examples used only to assess the performance of a

Cross validation simulates multiple train-test splits on the training data

• CB: sec 1.1

• HTF section 2.1

• KM: sec. 1.1, 1.2

You might also like