0% found this document useful (0 votes)

3 views

supervised_learning

The document provides an overview of supervised learning in machine learning, detailing its importance, types (regression and classification), and various algorithms used. It explains concepts such as linear and logistic regression, classification methods, and evaluation techniques like holdout method and cross-validation. Additionally, it discusses the significance of metrics like R-squared value and ROC curve in assessing model performance.

Uploaded by

vivekchaudhary0714

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

supervised_learning

Uploaded by

vivekchaudhary0714

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

9/8/2018

SUPERVISED LEARNING (Regression & Classification)

(SESSION-2018-19)

OVERVIEW OF MACHINE LEARNING

Machine Learning is the process of creating models that can perform a certain task
without the need for a human explicitly programming it to do something.

1
9/8/2018

WHAT IS SUPERVISED LEARNING?

FEATURES
Dogs and cats both have 4 legs and a tail.
Dogs come in small to large sizes. Cats, on
the other hand, are always small.
Dogs have a long mouth while cats have
smaller mouths.
Dogs bark while cats meow.
Different dogs have different ears while
cats have almost the same kind of ears.

WHY IS IT IMPORTANT?
Supervised learning gives the algorithm experience which can be used to
output the predictions for new unseen data
Experience also helps in optimizing the performance of the algorithm

2
9/8/2018

TYPES OF SUPERVISED LEARNING

(REGRESSION & CLASSIFICATION)
Regression: Regression analysis is a form of predictive modelling technique
which investigates the relationship between a dependent and independent
variable.

USES OF REGRESSION
Determining the strength of predictors (strength of the effect that the
independent variable have on the dependent variable)
Forecasting an effect
Trend forecasting

3
9/8/2018

LINEAR VS LOGISTIC REGRESSION

LINEAR REGRESSION

4
9/8/2018

LINEAR REGRESSION

5
9/8/2018

LINEAR REGRESSION

R-SQUARED VALUE
R-squared value is a statistical measure of how close the data are to the
fitted regression line.

It is also known as coefficient of determination, or the coefficient of multiple

determination

6
9/8/2018

GOODNESS OF FIT

CONT.…
When the value of R square is equal to 1 then the actual values lies on the regression line.

7
9/8/2018

MEAN SQUARED ERROR

GRADIENT DESCENT

8
9/8/2018

EXAMPLE

CONT.…

9
9/8/2018

CONT.…

CONT.….

For Slope

10
9/8/2018

CONT.…

CODE

11
9/8/2018

LOGISTIC REGRESSION
Logistic Regression produces result in a binary format which is used to predict the
outcome of a categorical dependent variable. So the outcome should be
discrete/categorical such as:

LOGISTIC REGRESSION CURVE

12
9/8/2018

EXAMPLE

CONT.…

13
9/8/2018

CLASSIFICATION
Classification is a process of categorizing a given set of data into classes, It can be
performed on both structured or unstructured data.
The process starts with predicting the class of given data points. The classes are often
referred to as target, label or categories.

CLASSIFICATION TERMINOLOGIES

14
9/8/2018

TYPES OF LEARNERS IN CLASSIFICATION

CLASSIFICATION ALGORITHMS
In machine learning, classification is a supervised learning concept which basically
categorizes a set of data into classes.

15
9/8/2018

LOGISTIC REGRESSION
It is a classification algorithm in machine learning that uses one or more independent
variables to determine an outcome.
It will have only two possible outcomes.

NAIVE BAYES CLASSIFIER

It is a classification algorithm based on Bayes’s theorem which gives an assumption of
independence among predictors.
In simple terms, a Naive Bayes classifier assumes that the presence of a particular
feature in a class is unrelated to the presence of any other feature.

16
9/8/2018

STOCHASTIC GRADIENT DESCENT

It is a very effective and simple approach to fit linear models.
Stochastic Gradient Descent is particularly useful when the sample data is in a large
number.

K-NEAREST NEIGHBOR
It is a lazy learning algorithm that stores all instances corresponding to training data
in n-dimensional space.
It is a lazy learning algorithm as it does not focus on constructing a general internal
model, instead, it works on storing instances of training data.

17
9/8/2018

DECISION TREE
The decision tree algorithm builds the classification model in the form of a tree
structure.

RANDOM FOREST
Random decision trees or random forest are an ensemble learning method for
classification, regression, etc.
It operates by constructing a multitude of decision trees at training time and outputs
the class that is the mode of the classes or classification or mean
prediction(regression) of the individual trees.

18
9/8/2018

ARTIFICIAL NEURAL NETWORKS

A neural network consists of neurons that are arranged in layers, they take some input
vector and convert it into an output. The process involves each neuron taking input
and applying a function which is often a non-linear function to it and then passes the
output to the next layer.

SUPPORT VECTOR MACHINE

The support vector machine is a
classifier that represents the training
data as points in space separated
into categories by a gap as wide as
possible. New points are then
added to space by predicting which
category they fall into and which
space they will belong to.

19
9/8/2018

CLASSIFIER EVALUATION

HOLDOUT METHOD
This is the most common method to evaluate a classifier. In this method, the given data
set is divided into two parts as a test and train set 20% and 80% respectively.

The train set is used to train the data and the unseen test set is used to test its
predictive power.

20
9/8/2018

CROSS-VALIDATION
Over-fitting is the most common problem prevalent
in most of the machine learning models. K-fold
cross-validation can be conducted to verify if the
model is over-fitted at all.
In this method, the data set is randomly
partitioned into k mutually exclusive subsets, each
of which is of the same size. Out of these, one is
kept for testing and others are used to train the
model. The same process takes place for all k
folds.

CLASSIFICATION REPORT

21
9/8/2018

ROC CURVE
Receiver operating characteristics or ROC curve is used for visual comparison of
classification models, which shows the relationship between the true positive rate and
the false positive rate. The area under the ROC curve is the measure of the accuracy
of the model.

ALGORITHM SELECTION

22
9/8/2018

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
58% (77)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (78)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Phone Codes
78% (27)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
Sample Mental Health Progress Note
96% (47)
Sample Mental Health Progress Note
3 pages
2025 MandateForLeadership FULL
70% (10)
2025 MandateForLeadership FULL
920 pages
How To Kiss A Woman's Breast
60% (114)
How To Kiss A Woman's Breast
14 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (7)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
1001 Songs
70% (71)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Data Mining CS4168 Lecture 5 Basics of Classification 1
No ratings yet
Data Mining CS4168 Lecture 5 Basics of Classification 1
25 pages
Types of Learning - Lecture 1
No ratings yet
Types of Learning - Lecture 1
16 pages
Unit 2
No ratings yet
Unit 2
57 pages
Iet Cipher ML Bootcamp (Session-1)
No ratings yet
Iet Cipher ML Bootcamp (Session-1)
67 pages
lbr151 ryePS
No ratings yet
lbr151 ryePS
3 pages
dataminingshort Question part2
No ratings yet
dataminingshort Question part2
17 pages
ML Detention Work
No ratings yet
ML Detention Work
3 pages
ML-UNSUPERVISED
No ratings yet
ML-UNSUPERVISED
35 pages
Big Data Analytics Algorithm, Tools in Systematic Review
No ratings yet
Big Data Analytics Algorithm, Tools in Systematic Review
7 pages
DS - UNIT - III - QB & Ans
No ratings yet
DS - UNIT - III - QB & Ans
25 pages
Viva Data Mining Lab
No ratings yet
Viva Data Mining Lab
11 pages
Difference Between Classification and Regression
No ratings yet
Difference Between Classification and Regression
1 page
HPC Mini Project Report
100% (1)
HPC Mini Project Report
12 pages
Plagiarism
No ratings yet
Plagiarism
24 pages
Cs3491 Aiml Unit 4 Qbank
No ratings yet
Cs3491 Aiml Unit 4 Qbank
27 pages
A Study of Classification Algorithms Using Rapidminer
No ratings yet
A Study of Classification Algorithms Using Rapidminer
12 pages
Zzplagiarism
No ratings yet
Zzplagiarism
24 pages
Introduction of The Popular Machine Learning Algorithm
No ratings yet
Introduction of The Popular Machine Learning Algorithm
5 pages
DWBI4
No ratings yet
DWBI4
10 pages
SRU ADA Unit-3
No ratings yet
SRU ADA Unit-3
78 pages
Zzplagiarism
No ratings yet
Zzplagiarism
23 pages
Data Mining and Visualization Question Bank
100% (1)
Data Mining and Visualization Question Bank
11 pages
Manish NTCC Presentation Sem 5
No ratings yet
Manish NTCC Presentation Sem 5
11 pages
mod 4
No ratings yet
mod 4
49 pages
Machine Learning: B.Tech (CSBS) V Semester
No ratings yet
Machine Learning: B.Tech (CSBS) V Semester
17 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
supervised learning
No ratings yet
supervised learning
8 pages
Classifying in Machine Learning
No ratings yet
Classifying in Machine Learning
26 pages
5 What Is Data-WPS Office
No ratings yet
5 What Is Data-WPS Office
19 pages
30905022071_AGNIK KR JANA_CA2
No ratings yet
30905022071_AGNIK KR JANA_CA2
9 pages
Recursive Hierarchical Clustering Algorithm
No ratings yet
Recursive Hierarchical Clustering Algorithm
7 pages
Screenshot 2025-01-03 at 8.05.30 PM
No ratings yet
Screenshot 2025-01-03 at 8.05.30 PM
20 pages
Introduction-to-scikit-learn
No ratings yet
Introduction-to-scikit-learn
10 pages
61 JBS1753
No ratings yet
61 JBS1753
13 pages
Unit 4
No ratings yet
Unit 4
17 pages
Survey Paper On Classification
No ratings yet
Survey Paper On Classification
6 pages
House Price Predictor Using ML Through A
No ratings yet
House Price Predictor Using ML Through A
4 pages
Cluster
No ratings yet
Cluster
22 pages
Credit Card Fraud Detection Using Predictive Modeling: A Review
No ratings yet
Credit Card Fraud Detection Using Predictive Modeling: A Review
7 pages
Machine Learning in Big Data
No ratings yet
Machine Learning in Big Data
10 pages
Group-3 Data Mining
No ratings yet
Group-3 Data Mining
56 pages
U21amg05 Aif and ML Unit 04 Notes
No ratings yet
U21amg05 Aif and ML Unit 04 Notes
42 pages
Introduction-to-Machine-Learning and Their Types
No ratings yet
Introduction-to-Machine-Learning and Their Types
10 pages
Machine Learning 1707965934
No ratings yet
Machine Learning 1707965934
15 pages
Algorithms
No ratings yet
Algorithms
7 pages
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
No ratings yet
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
5 pages
Machine Learning Section4 Ebook v03
No ratings yet
Machine Learning Section4 Ebook v03
20 pages
ML NOTES
No ratings yet
ML NOTES
13 pages
Question-Answers in Machine Learning
No ratings yet
Question-Answers in Machine Learning
14 pages
Random Forest Vs Logistic Regression For Binary Classification
No ratings yet
Random Forest Vs Logistic Regression For Binary Classification
25 pages
Machine Learning GNIT Suggestions
No ratings yet
Machine Learning GNIT Suggestions
7 pages
2015-Elsevier-Multi-objective-optimization-of-shared-nearest-neighbor-similarity-for-feature-selection
No ratings yet
2015-Elsevier-Multi-objective-optimization-of-shared-nearest-neighbor-similarity-for-feature-selection
12 pages
(Articulo) A Comparative Study of Categorical Variable Encoding PDF
No ratings yet
(Articulo) A Comparative Study of Categorical Variable Encoding PDF
4 pages
Unit 5
No ratings yet
Unit 5
84 pages
AI Lecture 02
No ratings yet
AI Lecture 02
40 pages
PT2
No ratings yet
PT2
22 pages
Doan 2015
No ratings yet
Doan 2015
8 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
5 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Python Machine Learning
From Everand
Python Machine Learning
Sebastian Raschka
4/5 (18)
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Advanced Statistic
No ratings yet
Advanced Statistic
33 pages
lecture07_95791
No ratings yet
lecture07_95791
84 pages
PHE IA 3 question bank
No ratings yet
PHE IA 3 question bank
1 page
PHE IA 2 question bank
No ratings yet
PHE IA 2 question bank
1 page
Module 4_18CV46_WST
No ratings yet
Module 4_18CV46_WST
20 pages
PHE IA 1 question bank
No ratings yet
PHE IA 1 question bank
1 page
Assignment_ECE_CVE-1
No ratings yet
Assignment_ECE_CVE-1
1 page
rohini_70491230464
No ratings yet
rohini_70491230464
26 pages
Question Bank 3 ECE_CVE
No ratings yet
Question Bank 3 ECE_CVE
2 pages
Question Bank_ECE_CVE maths
No ratings yet
Question Bank_ECE_CVE maths
2 pages
Assignment ECE_CVE 2
No ratings yet
Assignment ECE_CVE 2
2 pages
Question Bank 2 ECE_CVE
No ratings yet
Question Bank 2 ECE_CVE
3 pages
Is Zc415 (Data Mining BITS-WILP)
No ratings yet
Is Zc415 (Data Mining BITS-WILP)
4 pages
Exam 2003 B
No ratings yet
Exam 2003 B
20 pages
Support Vector Machine
No ratings yet
Support Vector Machine
14 pages
[9,10] Transformers_3
0% (1)
[9,10] Transformers_3
92 pages
ML Bits & Answers
No ratings yet
ML Bits & Answers
4 pages
LINFO2262: Decision Trees + Random Forests: Pierre Dupont
No ratings yet
LINFO2262: Decision Trees + Random Forests: Pierre Dupont
43 pages
12 Types of Neural Network Activation Functions
No ratings yet
12 Types of Neural Network Activation Functions
38 pages
Artificial Neural Networks: Classification Using Multilayer Perceptron Model
No ratings yet
Artificial Neural Networks: Classification Using Multilayer Perceptron Model
15 pages
Artificial Neural Network
100% (1)
Artificial Neural Network
35 pages
1 Neural Networks
No ratings yet
1 Neural Networks
16 pages
Implementation of Single Layer Perceptron Model Using MATLAB
No ratings yet
Implementation of Single Layer Perceptron Model Using MATLAB
5 pages
(Ebook) Deep Learning Foundations by Taeho Jo ISBN 9783031328787, 3031328787 - The complete ebook is available for download with one click
100% (1)
(Ebook) Deep Learning Foundations by Taeho Jo ISBN 9783031328787, 3031328787 - The complete ebook is available for download with one click
80 pages
Uji Chi Square
No ratings yet
Uji Chi Square
5 pages
GANS-ppt
No ratings yet
GANS-ppt
22 pages
Solutions for Problems from Neural Networks and Learning Machines, 3rd Edition by Simon Haykin
No ratings yet
Solutions for Problems from Neural Networks and Learning Machines, 3rd Edition by Simon Haykin
5 pages
Iterative Autoassociative Memory Models For Image Recalls and Pa
No ratings yet
Iterative Autoassociative Memory Models For Image Recalls and Pa
6 pages
Grokking Machine Learning Final Release 1st Edition Luis G. Serrano 2024 scribd download
No ratings yet
Grokking Machine Learning Final Release 1st Edition Luis G. Serrano 2024 scribd download
40 pages

supervised_learning

Uploaded by

supervised_learning

Uploaded by

9/8/2018

SUPERVISED LEARNING (Regression & Classification)

OVERVIEW OF MACHINE LEARNING

WHAT IS SUPERVISED LEARNING?

TYPES OF SUPERVISED LEARNING

LINEAR VS LOGISTIC REGRESSION

It is also known as coefficient of determination, or the coefficient of multiple

MEAN SQUARED ERROR

LOGISTIC REGRESSION CURVE

TYPES OF LEARNERS IN CLASSIFICATION

NAIVE BAYES CLASSIFIER

STOCHASTIC GRADIENT DESCENT

ARTIFICIAL NEURAL NETWORKS

SUPPORT VECTOR MACHINE

You might also like