Regression in M.L

Regression analysis is a statistical method used to model relationships between variables. It allows prediction of continuous outcomes like sales, temperature, or salary based on independent variables. Linear regression finds a linear relationship between dependent and independent variables, while logistic regression handles classification problems with binary outcomes. Polynomial regression extends linear regression to model nonlinear relationships using polynomial transformations of independent variables.

Uploaded by

Saif Jutt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views13 pages

Regression in M.L

Uploaded by

Saif Jutt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Regression in Machine Learning

What is Regression?
• Regression analysis is a statistical method to model the relationship
between a dependent (target) and independent (predictor) variables
with one or more independent variables. More specifically,
Regression analysis helps us to understand how the value of the
dependent variable is changing corresponding to an independent
variable when other independent variables are held fixed. It predicts
continuous/real values such as temperature, age, salary, price, etc.
What is Regression? (cont.)
• Suppose there is a marketing company A, who does
various advertisement every year and get sales on
that. The below list shows the advertisement made
by the company in the last 5 years and the
corresponding sales.
• Now, the company wants to do the advertisement of
$200 in the year 2019 and wants to know the
prediction about the sales for this year. So to solve
such type of prediction problems in machine
learning, we need regression analysis.
What is Regression? (cont.)
• Regression is a supervised learning technique a which helps in finding
the correlation between variables and enables us to predict the
continuous output variable based on the one or more predictor
variables. It is mainly used for prediction, forecasting, time series
modeling, and determining the causal-effect relationship between
variables.
• In Regression, we plot a graph between the variables which best fits
the given datapoints, using this plot, the machine learning model can
make predictions about the data. In simple words, "Regression shows
a line or curve that passes through all the datapoints on target-
predictor graph in such a way that the vertical distance between the
datapoints and the regression line is minimum." The distance
between datapoints and line tells whether a model has captured a
strong relationship or not.
Terminologies Related to the Regression Analysis:
• Dependent Variable: The main factor in Regression analysis which we want to
predict or understand is called the dependent variable. It is also called target
variable.
• Independent Variable: The factors which affect the dependent variables or which
are used to predict the values of the dependent variables are called independent
variable, also called as a predictor.
• Outliers: Outlier is an observation which contains either very low value or very
high value in comparison to other observed values. An outlier may hamper the
result, so it should be avoided.
• Multicollinearity: If the independent variables are highly correlated with each
other than other variables, then such condition is called Multicollinearity. It
should not be present in the dataset, because it creates problem while ranking
the most affecting variable.
• Underfitting and Overfitting: If our algorithm works well with the training
dataset but not well with test dataset, then such problem is called Overfitting.
And if our algorithm does not perform well even with training dataset, then such
problem is called underfitting.
Types of Regression
Linear Regression:
• Linear regression is a statistical regression method which is used for
predictive analysis.
• It is one of the very simple and easy algorithms which works on regression
and shows the relationship between the continuous variables.
• It is used for solving the regression problem in machine learning.
• Linear regression shows the linear relationship between the independent
variable (X-axis) and the dependent variable (Y-axis), hence called linear
regression.
• If there is only one input variable (x), then such linear regression is
called simple linear regression. And if there is more than one input
variable, then such linear regression is called multiple linear regression.
• The relationship between variables in the linear regression model can be
explained using the below image. Here we are predicting the salary of an
employee on the basis of the year of experience.
Linear Regression: (cont.)
• Y= aX+b
• Here, Y = dependent variables (target variables)
• X= Independent variables (predictor variables)
• a and b are the linear coefficients
• In linear regression, coefficients are the values that multiply the predictor values.
Suppose you have the following regression equation: y = 3X + 5. In this equation, +3 is the
coefficient, X is the predictor, and +5 is the constant.
• Some popular applications of linear regression are:
• Analyzing trends and sales estimates
• Salary forecasting
• Real estate prediction
• Arriving at ETAs in traffic
Logistic Regression:
• Logistic regression is another supervised learning algorithm which is
used to solve the classification problems. In classification problems,
we have dependent variables in a binary or discrete format such as 0
or 1.
• Logistic regression algorithm works with the categorical variable such
as 0 or 1, Yes or No, True or False, Spam or not spam, etc.
• It is a predictive analysis algorithm which works on the concept of
probability.
• Logistic regression is a type of regression, but it is different from the
linear regression algorithm in the term how they are used.
Logistic Regression: (cont.)
• Logistic regression uses sigmoid function or logistic function which is
a complex cost function. This sigmoid function is used to model the
data in logistic regression. The function can be represented as:

• f(x)= Output between the 0 and 1 value.

• x= input to the function
• e= base of natural logarithm.
Logistic Regression: (cont.)
• When we provide the input values (data) to the function, it gives the
S-curve as follows:
• It uses the concept of threshold levels, values above the threshold
level are rounded up to 1, and values below the threshold level are
rounded up to 0.
• There are three types of logistic regression:
• Binary(0/1, pass/fail)
• Multi(cats, dogs, lions)
• Ordinal(low, medium, high)
Polynomial Regression:
• Polynomial Regression is a type of regression which models the non-
linear dataset using a linear model.
• It is similar to multiple linear regression, but it fits a non-linear curve
between the value of x and corresponding conditional values of y.
• Suppose there is a dataset which consists of datapoints which are
present in a non-linear fashion, so for such case, linear regression will
not best fit to those datapoints. To cover such datapoints, we need
Polynomial regression.
• In Polynomial regression, the original features are transformed into
polynomial features of given degree and then modeled using a
linear model. Which means the datapoints are best fitted using a
polynomial line
Polynomial Regression: (cont.)
• The equation for polynomial regression also
derived from linear regression equation that
means Linear regression equation Y= b0+
b1x, is transformed into Polynomial
regression equation Y= b0+b1x+ b2x2+
b3x3+.....+ bnxn.
• Here Y is the predicted/target output, b0,
b1,... bn are the regression coefficients. x is
our independent/input variable.
• The model is still linear as the coefficients
are still linear with quadratic.

FGGHHDH GDD HH
100% (3)
FGGHHDH GDD HH
433 pages
Unit - Iii Data Analysis
No ratings yet
Unit - Iii Data Analysis
39 pages
Unit 2
No ratings yet
Unit 2
67 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
26 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
4 ML
No ratings yet
4 ML
41 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
TYPES OF SUPERVISED LEARNING2
No ratings yet
TYPES OF SUPERVISED LEARNING2
66 pages
Unit 2 Notes - Final
No ratings yet
Unit 2 Notes - Final
32 pages
Unit - 2 MLA
No ratings yet
Unit - 2 MLA
57 pages
6 Regression Analysis
No ratings yet
6 Regression Analysis
12 pages
Regression: UNIT - V Regression Model
100% (1)
Regression: UNIT - V Regression Model
21 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
DA Unit-3
No ratings yet
DA Unit-3
13 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
9 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Unit 2
No ratings yet
Unit 2
19 pages
DOC-20240831-WA0023.
No ratings yet
DOC-20240831-WA0023.
22 pages
Regression Modelling
No ratings yet
Regression Modelling
25 pages
5.REGRESSION-1
No ratings yet
5.REGRESSION-1
46 pages
MLT Unit 2 Linear Regression
No ratings yet
MLT Unit 2 Linear Regression
26 pages
Lecture 2
No ratings yet
Lecture 2
17 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
12 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Notes 2
No ratings yet
Notes 2
22 pages
ML unit-2 half
No ratings yet
ML unit-2 half
16 pages
Unit 2 Topic 1 REGRESSION
No ratings yet
Unit 2 Topic 1 REGRESSION
19 pages
Supervised Learning
No ratings yet
Supervised Learning
24 pages
Regression
No ratings yet
Regression
11 pages
Regression Unit-2
No ratings yet
Regression Unit-2
5 pages
Unit - II_DA
No ratings yet
Unit - II_DA
22 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-34-62
No ratings yet
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-34-62
29 pages
TSEGIII AI
No ratings yet
TSEGIII AI
6 pages
Unit 2linear Regression Bayesian Learning
No ratings yet
Unit 2linear Regression Bayesian Learning
12 pages
UNIT 2-3 - Notes - unit-2-3-notes
No ratings yet
UNIT 2-3 - Notes - unit-2-3-notes
16 pages
Data Science
No ratings yet
Data Science
5 pages
AAI Lecture 10 Sp 25
No ratings yet
AAI Lecture 10 Sp 25
37 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
unit-2-3-notes
No ratings yet
unit-2-3-notes
16 pages
Unit - 3 Machine Learning
No ratings yet
Unit - 3 Machine Learning
30 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
unit-3 part 2 DA
No ratings yet
unit-3 part 2 DA
20 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
UNIT-2 ML
No ratings yet
UNIT-2 ML
39 pages
ARTIFICIAL INTELLIGENCE LEC 4
No ratings yet
ARTIFICIAL INTELLIGENCE LEC 4
13 pages
Regression
No ratings yet
Regression
4 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
Unit 2 - NOTES1 - ML
No ratings yet
Unit 2 - NOTES1 - ML
35 pages
MLT Unit 2
No ratings yet
MLT Unit 2
53 pages
Machine Learning Class Slide
No ratings yet
Machine Learning Class Slide
44 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
LECTURE Regression
No ratings yet
LECTURE Regression
12 pages
What Are Linear Models in Machine Learning[1].Docx (Unit3 Ml)
No ratings yet
What Are Linear Models in Machine Learning[1].Docx (Unit3 Ml)
60 pages
5_AML Lecture 5_Linear regression
No ratings yet
5_AML Lecture 5_Linear regression
56 pages
B.Tech_5thSem_KCS055_Unit 2_1
No ratings yet
B.Tech_5thSem_KCS055_Unit 2_1
4 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
DA2
No ratings yet
DA2
12 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Grounding in Running Shoes On Indices of Performance in Elite Competitive Athletes
No ratings yet
Grounding in Running Shoes On Indices of Performance in Elite Competitive Athletes
10 pages
PR 2 Exam 1st - 050459
No ratings yet
PR 2 Exam 1st - 050459
8 pages
FCH_IME672A_JAN_2018
No ratings yet
FCH_IME672A_JAN_2018
2 pages
Resident Perception and Level of Acceptance On Curfew Hours in GSC
No ratings yet
Resident Perception and Level of Acceptance On Curfew Hours in GSC
23 pages
Mining Equipment Maintenance PDF
100% (4)
Mining Equipment Maintenance PDF
93 pages
Tracy Report
No ratings yet
Tracy Report
8 pages
Practical Research 2
No ratings yet
Practical Research 2
6 pages
7 Pillars of Stat Wisdom PDF
No ratings yet
7 Pillars of Stat Wisdom PDF
38 pages
Business Statistics
50% (4)
Business Statistics
500 pages
Assessmentof Overlay Roughnessin Long Term Pavement Performance Test Sites
No ratings yet
Assessmentof Overlay Roughnessin Long Term Pavement Performance Test Sites
11 pages
Pengaruh Motivasi Terhadap Kinerja Karyawan Dengan Budaya Organisasi Sebagai Variabel Moderating (Studi Pada Pt. Randugarut Plastic Indonesia)
No ratings yet
Pengaruh Motivasi Terhadap Kinerja Karyawan Dengan Budaya Organisasi Sebagai Variabel Moderating (Studi Pada Pt. Randugarut Plastic Indonesia)
11 pages
MATHE13 M Lesson 3 PDF
No ratings yet
MATHE13 M Lesson 3 PDF
22 pages
MVP 2
No ratings yet
MVP 2
2 pages
A Confirmatory Factor Analysis of The End-User Computing Satisfaction Instrument
No ratings yet
A Confirmatory Factor Analysis of The End-User Computing Satisfaction Instrument
10 pages
Walter R. Gilks, Sylvia Richardson (Auth.), Walter R. Gilks, Sylvia Richardson, David J. Spiegelhalter (Eds.) - Markov Chain Monte Carlo in Practice-Springer US (1996)
No ratings yet
Walter R. Gilks, Sylvia Richardson (Auth.), Walter R. Gilks, Sylvia Richardson, David J. Spiegelhalter (Eds.) - Markov Chain Monte Carlo in Practice-Springer US (1996)
487 pages
Journal of Criminal Justice: John L. Worrall, Robert G. Morris
No ratings yet
Journal of Criminal Justice: John L. Worrall, Robert G. Morris
8 pages
Case Study - What is a Data Strategy Group 2 analysis
No ratings yet
Case Study - What is a Data Strategy Group 2 analysis
7 pages
Reducing Aht - Business Credit Services: By: Subhash Mandal
No ratings yet
Reducing Aht - Business Credit Services: By: Subhash Mandal
46 pages
Statistical Methods in Experimental Chemistry
100% (1)
Statistical Methods in Experimental Chemistry
103 pages
Cost_Benefit_Analysis_of_Residential_Sprinklers_-_Final_Report_March_2012_5
No ratings yet
Cost_Benefit_Analysis_of_Residential_Sprinklers_-_Final_Report_March_2012_5
127 pages
Template
No ratings yet
Template
21 pages
WGU C784 – APPLIED HEALTHCARE STATISTICS PRE ASSESSMENT TEST EXAM QUESTIONS AND VERIFIED ANSWERS GRADED A+ 2024 UPDATE
No ratings yet
WGU C784 – APPLIED HEALTHCARE STATISTICS PRE ASSESSMENT TEST EXAM QUESTIONS AND VERIFIED ANSWERS GRADED A+ 2024 UPDATE
17 pages
Famous Statisticians Statistics and Probability: Submitted By: Nabil A. Uy Submitted To: Jamil A. Silongan
No ratings yet
Famous Statisticians Statistics and Probability: Submitted By: Nabil A. Uy Submitted To: Jamil A. Silongan
9 pages
Chemistry 531 The One-Dimensional Random Walk
No ratings yet
Chemistry 531 The One-Dimensional Random Walk
11 pages
scom program - jmsb
No ratings yet
scom program - jmsb
5 pages
Unity University Department of Computer Science: Simulation of Inventory Management System
No ratings yet
Unity University Department of Computer Science: Simulation of Inventory Management System
7 pages
Labuyo
No ratings yet
Labuyo
19 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
3 pages
Econometrics in STAN
No ratings yet
Econometrics in STAN
39 pages

Regression in M.L

Uploaded by

Regression in M.L

Uploaded by

Regression in Machine Learning

• f(x)= Output between the 0 and 1 value.

You might also like