Supervised Machine Learning

Uploaded by

syedmar3297

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Supervised Machine Learning

Uploaded by

syedmar3297

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Supervised Machine

Learning
By Dr. Raivrajsinh S. Vaghela
Outline
• Basics of Supervised Learning
• Prediction
• Classification
• Understanding Datasets
• Feature Selection
• Feature Normalization
• Data Cleaning
• Training, Testing & Validation Sets
Basics of Supervised Learning

• Supervised learning uses a training set to teach models to yield the

desired output.

• This training dataset includes inputs and correct outputs, which allow
the model to learn over time.

• The algorithm measures its accuracy through the loss function,

adjusting until the error has been sufficiently minimized.
Supervised Learning
• Supervised learning can be separated into two types of problems
when data mining—classification and regression:
Classification
• Classification uses an algorithm to accurately assign test data into specific
categories.
• It recognizes specific entities within the dataset and attempts to draw some
conclusions on how those entities should be labeled or defined.
• Common classification algorithms are
• Linear classifiers,
• Support vector machines (SVM),
• Decision trees,
• k-nearest neighbor,
• Random forest.
Classification
• Classification is a type of supervised learning that categorizes input
data into predefined labels. It involves training a model on labeled
examples to learn patterns between input features and output
classes. In classification, the target variable is a categorical value. For
example, classifying emails as spam or not.
•
The model’s goal is to generalize this learning to make accurate
predictions on new, unseen data. Algorithms like Decision Trees,
Support Vector Machines, and Neural Networks are commonly used
for classification tasks.
Regression
• Regression is used to understand the relationship between dependent
and independent variables.
• It is commonly used to make projections, such as for sales revenue
for a given business.
• Linear regression,
• logistical regression,
• and polynomial regression are popular regression algorithms.
Regression

• Regression is a supervised learning technique used to predict

continuous numerical values based on input features. It aims to
establish a functional relationship between independent variables
and a dependent variable, such as predicting house prices based on
features like size, bedrooms, and location.
• The goal is to minimize the difference between predicted and actual
values using algorithms like Linear Regression, Decision Trees, or
Neural Networks, ensuring the model captures underlying patterns in
the data.
Data
• Data is the driving force of ML.
• Data comes in the form of words and numbers stored in tables, or as
the values of pixels and waveforms captured in images and audio files.
We store related data in datasets.
• For example, we might have a dataset of the following:
• Images of cats
• Housing prices
• Weather information
• Datasets are made up of individual examples that contain features and a
label.
• You could think of an example as analogous to a single row in a
spreadsheet.
• Features are the values that a supervised model uses to predict the label.
The label is the "answer," or the value we want the model to predict.
• In a weather model that predicts rainfall, the features could be latitude,
longitude, temperature, humidity, cloud coverage, wind direction, and
atmospheric pressure.
• The label would be rainfall amount.
dataset
• A dataset is characterized by its size and diversity. Size indicates the number of examples.
Diversity indicates the range those examples cover. Good datasets are both large and highly
diverse.

• Some datasets are both large and diverse. However, some datasets are large but have low
diversity, and some are small but highly diverse. In other words, a large dataset doesn’t
guarantee sufficient diversity, and a dataset that is highly diverse doesn't guarantee
sufficient examples.

• For instance, a dataset might contain 100 years worth of data, but only for the month of July.
Using this dataset to predict rainfall in January would produce poor predictions. Conversely,
a dataset might cover only a few years but contain every month. This dataset might produce
poor predictions because it doesn't contain enough years to account for variability.
Characterized
• A dataset can also be characterized by the number of its features. For
example, some weather datasets might contain hundreds of features,
ranging from satellite imagery to cloud coverage values.
• Other datasets might contain only three or four features, like
humidity, atmospheric pressure, and temperature.
• Datasets with more features can help a model discover additional
patterns and make better predictions.
• However, datasets with more features don't always produce models
that make better predictions because some features might have no
causal relationship to the label.
Understanding of Dataset context of
Supervised learning.
Model generation from Labeled Example
• In supervised learning, a model is the complex collection of numbers
that define the mathematical relationship from specific input feature
patterns to specific output label values. The model discovers these
patterns through training.

The model takes in a single labeled example and provides a prediction.

The model compares its predicted value with
the actual value and updates its solution.

An ML model updating its predicted value.

• The model repeats this process for each labeled example in the dataset.
• In this way, the model gradually learns the correct relationship
between the features and the label.
• This gradual understanding is also why large and diverse datasets
produce a better model.
• The model has seen more data with a wider range of values and has
refined its understanding of the relationship between the features
and the label.
Evaluating

• We evaluate a trained model

to determine how well it
learned. When we evaluate a
model, we use a labeled
dataset, but we only give the
model the dataset's features.
• We then compare the
model's predictions to the
label's true values.
Advantages of Supervised Learning

• The power of supervised learning lies in its ability to accurately

predict patterns and make data-driven decisions across a variety of
applications.
Labeled training data benefits supervised learning by enabling models to
accurately learn patterns and relationships between inputs and outputs.
Supervised learning models can accurately predict and classify new data.
Supervised learning has a wide range of applications, including classification,
regression, and even more complex problems like image recognition and
natural language processing.
Well-established evaluation metrics, including accuracy, precision, recall, and
F1-score, facilitate the assessment of supervised learning model
performance.
Disadvantages of Supervised Learning

• Although supervised learning methods have benefits, their limitations

require careful consideration during problem formulation, data collection,
model selection, and evaluation.
• Overfitting: Models can overfit training data, which leads to poor
performance on new, unseen data due to the capture of noise.
• Feature Engineering: Extracting relevant features from raw data is crucial
for model performance, but this process can be time-consuming and may
require domain expertise.
• Bias in Models: Training data biases can lead to unfair predictions.
• Supervised learning heavily depends on labeled training data, which can be
costly, time-consuming, and may require domain expertise.

Amendment No. 3 June 2018 TO Is 15658: 2006 Precast Concrete Blocks For Paving - Specification
No ratings yet
Amendment No. 3 June 2018 TO Is 15658: 2006 Precast Concrete Blocks For Paving - Specification
3 pages
Ai Project Cycle
100% (1)
Ai Project Cycle
29 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
Week 4 - Intro to ML
No ratings yet
Week 4 - Intro to ML
37 pages
Unit 3
No ratings yet
Unit 3
55 pages
DATA MINING JNTUH CSE R18
No ratings yet
DATA MINING JNTUH CSE R18
20 pages
Feature and Feature Extractionlect2
No ratings yet
Feature and Feature Extractionlect2
28 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
DSUR_EA2352001010391_W3
No ratings yet
DSUR_EA2352001010391_W3
3 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Fam QB Ans
No ratings yet
Fam QB Ans
9 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
61 pages
2.4 Covariation
No ratings yet
2.4 Covariation
18 pages
Ch6-Models selection Evaluating Classifiers
No ratings yet
Ch6-Models selection Evaluating Classifiers
28 pages
1 - Supervised Learning & Its Types
No ratings yet
1 - Supervised Learning & Its Types
24 pages
ML & DL Notes
No ratings yet
ML & DL Notes
30 pages
UNIT03
No ratings yet
UNIT03
52 pages
Machine Learning Summer Training
No ratings yet
Machine Learning Summer Training
118 pages
DSUR_EA2352001010391_W7
No ratings yet
DSUR_EA2352001010391_W7
3 pages
DATA MINING MODULE 3
No ratings yet
DATA MINING MODULE 3
27 pages
AIDS C04-Session-20
No ratings yet
AIDS C04-Session-20
17 pages
MLE
No ratings yet
MLE
15 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
20 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
Lecture 7 Overview of ML models
No ratings yet
Lecture 7 Overview of ML models
77 pages
CH1
No ratings yet
CH1
64 pages
Data Science S3mca
No ratings yet
Data Science S3mca
55 pages
Bank Marketing Data
100% (2)
Bank Marketing Data
14 pages
module 1
No ratings yet
module 1
47 pages
22BCS14374 - Sanya - Singh - Assignment 2
No ratings yet
22BCS14374 - Sanya - Singh - Assignment 2
8 pages
UNIT 3 DM
No ratings yet
UNIT 3 DM
34 pages
Machine Learning Assignment (1)
No ratings yet
Machine Learning Assignment (1)
5 pages
Manual Data
No ratings yet
Manual Data
13 pages
Unit - III
No ratings yet
Unit - III
40 pages
AI Notes
No ratings yet
AI Notes
12 pages
Chapter 01 Introduction to ML
No ratings yet
Chapter 01 Introduction to ML
178 pages
ML 2
No ratings yet
ML 2
166 pages
Model Selection NEW
No ratings yet
Model Selection NEW
24 pages
AI(Part-II)
No ratings yet
AI(Part-II)
11 pages
Module 1 - Introduction To Data Analytics
No ratings yet
Module 1 - Introduction To Data Analytics
21 pages
ML Unit 2
No ratings yet
ML Unit 2
35 pages
Unit 3
No ratings yet
Unit 3
33 pages
MCC Mba ML and Ai May30 2024
No ratings yet
MCC Mba ML and Ai May30 2024
201 pages
AML - MODULE - 4
No ratings yet
AML - MODULE - 4
12 pages
UNIT-04: Introduction To Data Mining: Data Mining Techniques KDD Process Association Rules.
No ratings yet
UNIT-04: Introduction To Data Mining: Data Mining Techniques KDD Process Association Rules.
40 pages
Chapter - 4
No ratings yet
Chapter - 4
14 pages
Chapter 4 Data Mining
No ratings yet
Chapter 4 Data Mining
5 pages
Module 2 - ML
No ratings yet
Module 2 - ML
53 pages
Machine learning assignment (3) (1)
No ratings yet
Machine learning assignment (3) (1)
5 pages
CSL0777 L08
No ratings yet
CSL0777 L08
29 pages
Machine Learning Ppts
No ratings yet
Machine Learning Ppts
38 pages
Business Analytics Process and Data Exploration
No ratings yet
Business Analytics Process and Data Exploration
38 pages
Machine learning assignment (3)
No ratings yet
Machine learning assignment (3)
5 pages
4 Classification
No ratings yet
4 Classification
20 pages
Data Science Methodology
No ratings yet
Data Science Methodology
26 pages
Data Mining UNIT-2 Notes
No ratings yet
Data Mining UNIT-2 Notes
91 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Mathematics for Data Science: Linear Algebra with Matlab
From Everand
Mathematics for Data Science: Linear Algebra with Matlab
César Pérez López
No ratings yet
Stakeholder Tools Guidance Notes
No ratings yet
Stakeholder Tools Guidance Notes
3 pages
Math 120 Final Exam Review With Answer Key
No ratings yet
Math 120 Final Exam Review With Answer Key
17 pages
How To Backup MTK Android Stock Firmware For Future Use PDF
No ratings yet
How To Backup MTK Android Stock Firmware For Future Use PDF
12 pages
E1/T1 Interface Plug-In Card FOR Alstom e-DXC Enhanced Digital Cross-Connect User'S Manual
No ratings yet
E1/T1 Interface Plug-In Card FOR Alstom e-DXC Enhanced Digital Cross-Connect User'S Manual
41 pages
Estimation Report - Arrigo Residence
No ratings yet
Estimation Report - Arrigo Residence
8 pages
Daily Inspection Checklists
100% (1)
Daily Inspection Checklists
59 pages
Mohit Sample 1
No ratings yet
Mohit Sample 1
52 pages
Raz lf50 Batterypower CLR Ds
No ratings yet
Raz lf50 Batterypower CLR Ds
8 pages
Complete Download (Ebook) Clean Architecture with .NET by Esposito, Dino ISBN 9780138203283, 0138203288 PDF All Chapters
100% (10)
Complete Download (Ebook) Clean Architecture with .NET by Esposito, Dino ISBN 9780138203283, 0138203288 PDF All Chapters
65 pages
Analog Camera User Manual B3xx T3xx D3xx
No ratings yet
Analog Camera User Manual B3xx T3xx D3xx
14 pages
Springleaf Designs: Formatting, Formulas, and Charts
No ratings yet
Springleaf Designs: Formatting, Formulas, and Charts
5 pages
Complete Download Unity 2018 Augmented Reality Projects Build four immersive and fun AR applications using ARKit ARCore and Vuforia 1st Edition Jesse Glover PDF All Chapters
100% (1)
Complete Download Unity 2018 Augmented Reality Projects Build four immersive and fun AR applications using ARKit ARCore and Vuforia 1st Edition Jesse Glover PDF All Chapters
77 pages
Less On: Understanding Data and Ways To Systematically Collect Data
No ratings yet
Less On: Understanding Data and Ways To Systematically Collect Data
48 pages
Lista de Precios Compunix 2018: Precios Son Mas Iva 12%
No ratings yet
Lista de Precios Compunix 2018: Precios Son Mas Iva 12%
3 pages
Kavya
No ratings yet
Kavya
2 pages
Ku Coin
No ratings yet
Ku Coin
79 pages
Electric Motor: Project Description
No ratings yet
Electric Motor: Project Description
20 pages
4.docx
No ratings yet
4.docx
28 pages
Invoice Google Pixel 7 Pro
No ratings yet
Invoice Google Pixel 7 Pro
1 page
Bibekananda Roy - Cyber - Security - Expert - Resume - Latest24
No ratings yet
Bibekananda Roy - Cyber - Security - Expert - Resume - Latest24
2 pages
Sand Probe
No ratings yet
Sand Probe
1 page
Lexfo-WhitePaper-The Lazarus Constellation
No ratings yet
Lexfo-WhitePaper-The Lazarus Constellation
54 pages
720255937-Đề-cương-Tiếng-anh-11-I-learn-smart-world-11-HK-II-Key
No ratings yet
720255937-Đề-cương-Tiếng-anh-11-I-learn-smart-world-11-HK-II-Key
10 pages
Inventory Homework2
No ratings yet
Inventory Homework2
2 pages
(1111) An Introduction To R Programming
No ratings yet
(1111) An Introduction To R Programming
136 pages
Fundamentals of Electrical Transformers A Comprehensive Overview
No ratings yet
Fundamentals of Electrical Transformers A Comprehensive Overview
14 pages
MATH22 - Engineering Data Analysis Module 2
No ratings yet
MATH22 - Engineering Data Analysis Module 2
30 pages
Future of The Modern Data Stack 2022 Report
No ratings yet
Future of The Modern Data Stack 2022 Report
16 pages
Dispensadores Gaslin
No ratings yet
Dispensadores Gaslin
6 pages

Supervised Machine Learning

Uploaded by

Supervised Machine Learning

Uploaded by

Supervised Machine

• Supervised learning uses a training set to teach models to yield the

• The algorithm measures its accuracy through the loss function,

• Regression is a supervised learning technique used to predict

The model takes in a single labeled example and provides a prediction.

An ML model updating its predicted value.

• We evaluate a trained model

• The power of supervised learning lies in its ability to accurately

• Although supervised learning methods have benefits, their limitations

You might also like