0% found this document useful (0 votes)

3 views

Week 4 - Intro to ML

The document provides an introduction to machine learning, explaining its methods for detecting patterns in data and predicting outcomes. It covers types of machine learning, including supervised and unsupervised learning, along with applications and challenges faced in the field. Key concepts such as bias-variance trade-off, model selection, and the importance of data quality are also discussed.

Uploaded by

Shozab Raza

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Week 4 - Intro to ML

Uploaded by

Shozab Raza

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 37

Introduction to Predictive Lecture 1

Analytics & Machine Learning

Introduction- What is Machine Learning
Machine learning is a set of methods for automatically detecting
patterns in data and using them for predicting future data and
guiding decision making. In other words, learning from data.
Why use Machine Learning
• Machine Learning outperforms traditional solutions which
require a lot of fine-tuning or rules. E.g. ML spam filters vs
traditional logic
• Machine learning adopts to changing environments and
new data, which reduces time taken using traditional
approaches.
• Can solve problems even with highly complex problem
and/or large amounts of data at hand.
Applications of Machine Learning

Analyzing images to detect anomalies or Detecting brain tumors in brain scans Classifying topics using NLP
faults in production line

Forecasting Sales Chatbots for customized and quick Recommending products based on buyer
interactions with customers behaviors
• Supervised learning, the objective is to learn
a function to predict an output variable Y
based on observed input variables (also called
features) x1, . . . , xp. We develop methods that
Types of learn this function based on labelled data
which we call the training data.
machine
learning • Unsupervised learning, we are given only
inputs and the goal is to find “interesting”
patterns in this data. It is used for clustering
Supervised learning
In supervised learning, the output or response variable can be of any
type. However, most methods address two main classes of supervised
learning problems:
• In regression, the response is a quantitative scalar (such as the
income of a worker).

• In classification, the response is a nominal or categorical variable Y=

{1, . . . ,C}, where C is the number of classes. When C = 2, this is called
binary classification; if C > 2, this is called multiclass classification
• Some important algorithms
• Linear regression
• Logistic regression
• Supper Vector Machines (SVMs)
• Decision Trees & Random Forests
Supervised • K-Nearest Neighbors (KNNs)
Learning • K-Means
Examples

• Regression:
• Predicting income of an individual
• Number of Covid-19 patients in the next 2
months
• House prices
• Sales forecasting – units / value

• Classification:
• Cancer – No cancer
• Fraud – Secure
• Churn – No churn
• Good customer – bad customer
Supervised Learning
Predict house prices
•Y=
• X = f(X1, X2, X3, ….Xn)
Unsupervised Learning
• The training data is unlabelled

• Some important algorithms:

• Principal Component Analysis (PCA)
• K-Means
• t-distributed Stochastic Neighbor Embedding (t-SNE)
• Isolation Forests
• Apriori
Unsupervised learning
Find similar
passengers
Data mining
• Data mining is the process of extracting interesting and previously
unknown patterns and relations from large databases, drawing on
the fields of machine learning, statistics, and database technology

• In data mining, the analysis should be both useful and understandable

to the data owner.
Data science
• Data science is a multidisciplinary field that combines knowledge
and skills from statistics, machine learning, software engineering,
data visualization, and domain expertise (in our case, business
expertise) to uncover value from large and diverse data sets

• Data scientists often work directly with stakeholders (say, product

managers) to translate data analysis results into action.
Data analysis process

1. Problem formulation.

2. Data collection and preparation.

3. Exploratory data analysis (EDA).

4. Model building, estimation, and selection.

5. Model evaluation.

6. Communicate results.
Evaluating model performance
• Training set: for exploratory data analysis, model building, model
estimation, model selection, etc.

• Test set: for model evaluation

Training and test data
• Because we are interested on the estimating how well a model will predict future
data, the test set should be kept in a “vault” and brought in strictly at the end of
the analysis. The test set does not lead to model revisions.

• We generally allocate 70-80% of the data to the training sample.

• A higher proportion of training data leads to more accurate model estimation, but
higher variance in estimating the expected loss.

• The split of the data into the training and test sets is often random, but sometimes
there are reasons to consider alternative schemes.
• The validation or the test set should be as
representative of the data that the model
will “see” in production
Data
Mismatch • Don’t test it on apples and productionize to
predict oranges.
Key concepts

• The bias-variance trade-off and model selection.

• Overfitting.

• Parametric vs non-parametric models.

• No-free lunch theorem.

• Accuracy vs interpretability.
Simple vs complex model

High bias/Low variance High variance/Low bias

Underfitting Overfitting
Another example
Bias-Variance tradeoff
• An important decision for data scientist is to choose model complexity

• Having a very complex model can lead to accurate predictions on the

train set but fail miserably on the test set

• Having a very simple model can lead to bad predictions both on train
set and test.

• How to balance this?

The bias variance trade-off
• Mathematically we can show that :

• Error = bias^2 + variance + irreducible error

• We would like our model to be flexible enough to be able to

approximate (possibly) complex relationships between Y and X.
Bias variance
tradeoff
Bias variance tradeoff
• Typically, the more complex we make the model, the better its
approximation capabilities, which translates into lower bias.

• On the other hand, increasing model complexity leads to higher

variance. This is due to the larger (effective) number of parameters to
estimate.

• Hence, we would like to find the optimal (problem specific) model

complexity that minimises our expected loss over the validation curve
Bias variance
• Increasing model complexity will always reduce the training error,
but there is an optimal level of complexity that minimises the test error.
Model selection
• Model selection is a set of methods (such as cross validation) that
allow us to choose the right model among options of different
complexity. It will be a fundamental part of our methodology

• We conduct model selection on the training data

• Model selection also includes hyper parameter tuning which we will

cover in detail later
Overfitting
• We say that there is overfitting when an estimated model is
excessively flexible, incorporating minor variations in the training data
that are likely to be noise rather than predictive patterns.

• An overfit model has small training errors, but may predict poorly. In
essence, it has memorized the training set.

• Not being misled by overfitting is an important reason why we use a

test set.
Illustration
• This example uses data extracted from the fueleconomy.gov website run by
the US government, which lists different estimates of fuel economy for
passenger cars and trucks.

• For each vehicle in the dataset, we have information on various

characteristics such as engine displacement and number of cylinders, along
with laboratory measurements for the city and highway miles per gallon
(MPG) of the car.

• We here consider the unadjusted highway MPG for 2010 cars as the
response variable, and a single predictor, engine displacement.
Example

• A scatter plot reveals a nonlinear association between the two

variables. We therefore need a model that is sufficiently flexible to
capture this nonlinearity.
Parametric vs non parametric approach
• Paramteric models assume an underlying distribution of the parameters
that we want to measure such as we know if we want to fit a linear
regression or a polynomial regression. You know what the regression line
would look like.
• Non parametric models do not assume any statistical underlying
distribution of the dataset. Hence you let the data decide what would the
function be.
• Decision makers mostly prefer parametric models because it is easier to
estimate a parametric model, easier to do predictions, a story can be told
according to a parametric model and the estimates have better statistical
properties compared to those of non-parametric regression.
Parametric vs non parametric
• Here is a picture where both parametric and non-parametric
regression results are shown. OLS (linear regression line) predicts a
negative relationship between X and Y. Nonparametric estimation fits
a 'highly wiggly' function to the data (most of the times you can
choose the smoothness of the function)
Challenges in Machine
Learning
“Bad Data” and
“Bad Algorithm”
• Insufficient training data

• Nonrepresentative training data.

• Poor quality of data. Missing

values, errors and noise!

• Overfitting/Underfitting the data

Group Dynamics For Teams 5th Edition Textbook
0% (1)
Group Dynamics For Teams 5th Edition Textbook
28 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Thesis Manuscript
No ratings yet
Thesis Manuscript
388 pages
Fundamentals of Marketing (MGT-210) : Mr. Abid Saeed
No ratings yet
Fundamentals of Marketing (MGT-210) : Mr. Abid Saeed
19 pages
ML-1-PPT-UNIT-1
No ratings yet
ML-1-PPT-UNIT-1
93 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Unit6 Part3 General Procedure
No ratings yet
Unit6 Part3 General Procedure
19 pages
Day 2. Lecture - Machinelearning
No ratings yet
Day 2. Lecture - Machinelearning
32 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
Unit 3
No ratings yet
Unit 3
55 pages
2.0 Machine Learning Introduction
No ratings yet
2.0 Machine Learning Introduction
24 pages
MachineLearning Jan2nd
100% (2)
MachineLearning Jan2nd
171 pages
DS-05 Introduction To Machine Learning
No ratings yet
DS-05 Introduction To Machine Learning
103 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
machine learning
No ratings yet
machine learning
37 pages
Classification
No ratings yet
Classification
53 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Machine Learning and Data Mining
No ratings yet
Machine Learning and Data Mining
88 pages
Lecture 1
No ratings yet
Lecture 1
19 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
Lecture 4.2 Supervised Learning Classification
No ratings yet
Lecture 4.2 Supervised Learning Classification
25 pages
Interview Questions On Machine Learning
100% (4)
Interview Questions On Machine Learning
22 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
MachineLearning in short
No ratings yet
MachineLearning in short
10 pages
Data Science and Applications Notes
No ratings yet
Data Science and Applications Notes
4 pages
Big Data Analytics - Unit 3
No ratings yet
Big Data Analytics - Unit 3
55 pages
module3_DS_ppt
No ratings yet
module3_DS_ppt
68 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
Chapter 01 Introduction to ML
No ratings yet
Chapter 01 Introduction to ML
178 pages
AI and ML For Business Antim Prahar WITH ANSWERS
No ratings yet
AI and ML For Business Antim Prahar WITH ANSWERS
26 pages
Fiches Machine Learning
No ratings yet
Fiches Machine Learning
21 pages
Predictive Analysis 1
No ratings yet
Predictive Analysis 1
22 pages
Module 3 Data Science Machine Learning
No ratings yet
Module 3 Data Science Machine Learning
53 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
Pattern Recognition Application
No ratings yet
Pattern Recognition Application
43 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
10 pages
Tesla Stock Marketing Price Prediction
No ratings yet
Tesla Stock Marketing Price Prediction
62 pages
Machine Learning Report
No ratings yet
Machine Learning Report
58 pages
Intro To Data Science Lecture 1
No ratings yet
Intro To Data Science Lecture 1
7 pages
Unit V - Big Data Programming
No ratings yet
Unit V - Big Data Programming
22 pages
Intro To ML
No ratings yet
Intro To ML
26 pages
Unit 1 Machine Learning - PDF Lands
No ratings yet
Unit 1 Machine Learning - PDF Lands
5 pages
Sample Q - A For Module 3 - 4
No ratings yet
Sample Q - A For Module 3 - 4
18 pages
Statistics for Data Science
No ratings yet
Statistics for Data Science
39 pages
Summer of Science-Final Report
100% (1)
Summer of Science-Final Report
7 pages
Chap2-Some Unique Features of Data Science Projects
No ratings yet
Chap2-Some Unique Features of Data Science Projects
44 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Air quality prediction using machine learning
No ratings yet
Air quality prediction using machine learning
29 pages
Chapter 4- Machine Learning
No ratings yet
Chapter 4- Machine Learning
81 pages
INTRODUCTION
No ratings yet
INTRODUCTION
51 pages
COS10022 DSP Week02 Regressions
No ratings yet
COS10022 DSP Week02 Regressions
41 pages
Types of Machine Learning Algorithms
No ratings yet
Types of Machine Learning Algorithms
14 pages
Data Science
No ratings yet
Data Science
38 pages
AI & ML Interview Preparation
No ratings yet
AI & ML Interview Preparation
15 pages
ML Unit 2 Part 1
No ratings yet
ML Unit 2 Part 1
47 pages
Supervised Machine Learning - Linear Regression
No ratings yet
Supervised Machine Learning - Linear Regression
92 pages
Machine Learning Part: Domain Overview
No ratings yet
Machine Learning Part: Domain Overview
20 pages
Statistical Prediction and Machine Learning
100% (2)
Statistical Prediction and Machine Learning
314 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
4.2 Chapter 4 ppt 2
No ratings yet
4.2 Chapter 4 ppt 2
15 pages
Management accounting
No ratings yet
Management accounting
4 pages
Management Presentation Article
No ratings yet
Management Presentation Article
14 pages
Management accounting
No ratings yet
Management accounting
10 pages
Inbound 3474181369408235760
No ratings yet
Inbound 3474181369408235760
9 pages
Competency Standard - Agro-Food Processing - Baking (Bread and Biscuit) - Skill Certificate II
No ratings yet
Competency Standard - Agro-Food Processing - Baking (Bread and Biscuit) - Skill Certificate II
174 pages
International Journal of Industrial Ergonomics: Sisay A. Workineh, Hiroshi Yamaura
No ratings yet
International Journal of Industrial Ergonomics: Sisay A. Workineh, Hiroshi Yamaura
9 pages
Unit 34 DM HND - Assignment Structure and Guidelines
No ratings yet
Unit 34 DM HND - Assignment Structure and Guidelines
6 pages
Implementation Study of Smart Mobility and Smart Living in Commuter Line at Sudirman Station, Central Jakarta
No ratings yet
Implementation Study of Smart Mobility and Smart Living in Commuter Line at Sudirman Station, Central Jakarta
5 pages
Medication Error in Template
No ratings yet
Medication Error in Template
5 pages
39225912
No ratings yet
39225912
16 pages
NAMA: Wimbi Achmad Sauqi Zainal Abidin Kelas: Pai4/VI NIM: 0301182192 1
No ratings yet
NAMA: Wimbi Achmad Sauqi Zainal Abidin Kelas: Pai4/VI NIM: 0301182192 1
10 pages
Best Practice of Garments Washing Factory PDF
No ratings yet
Best Practice of Garments Washing Factory PDF
3 pages
Cyclical Learning Rate For Training Neural Networks (Leslie Smith)
No ratings yet
Cyclical Learning Rate For Training Neural Networks (Leslie Smith)
10 pages
Ic Engine
No ratings yet
Ic Engine
78 pages
RTL Assessment 2
No ratings yet
RTL Assessment 2
12 pages
HACCP in Small Companies: Benefit or Burden?
No ratings yet
HACCP in Small Companies: Benefit or Burden?
16 pages
204 Processing Customer Account
No ratings yet
204 Processing Customer Account
25 pages
Literature Review On Improper Waste Disposal
100% (1)
Literature Review On Improper Waste Disposal
8 pages
Checked - Entrep Learning Packets - Ms - Curioso
No ratings yet
Checked - Entrep Learning Packets - Ms - Curioso
45 pages
ML Lesson Plan
No ratings yet
ML Lesson Plan
4 pages
Andres Torres Bio
No ratings yet
Andres Torres Bio
2 pages
MTH302 Quiz3
100% (4)
MTH302 Quiz3
13 pages
Organisation Theory and Behaviour Assignment
No ratings yet
Organisation Theory and Behaviour Assignment
4 pages
Att - 1440423357377 - Arun Uas Final Report
100% (1)
Att - 1440423357377 - Arun Uas Final Report
32 pages
Graphic Organizers and Their Effects On The Reading Comprehension of Students With LD
No ratings yet
Graphic Organizers and Their Effects On The Reading Comprehension of Students With LD
14 pages
Buy ebook Assessment of Children and Youth with Special Needs 5th Edition Libby G. Cohen cheap price
100% (2)
Buy ebook Assessment of Children and Youth with Special Needs 5th Edition Libby G. Cohen cheap price
82 pages
Leadership and Management
100% (1)
Leadership and Management
31 pages
Spirituality and The MMPI-2 PDF
No ratings yet
Spirituality and The MMPI-2 PDF
12 pages
Figure and Table PMP
No ratings yet
Figure and Table PMP
148 pages