Gradient Descent for Linear Regression: repeat until convergence: (:=:=) − α ( −) 1 ∑ − α ( ( −) ) 1 ∑

The document discusses gradient descent, an algorithm used in machine learning to minimize some cost function. It works by taking the derivative of the cost function to determine the direction of steepest descent, and taking steps down in that direction. For linear regression specifically, the gradient descent equations are derived as repeating updates to the parameters θ0 and θ1 by subtracting a step size times the average derivative over the training examples. This process iteratively improves the accuracy of the hypothesis.

Uploaded by

Laith Bounenni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views1 page

Gradient Descent for Linear Regression: repeat until convergence: (:=:=) − α ( −) 1 ∑ − α ( ( −) ) 1 ∑

Uploaded by

Laith Bounenni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

27/09/2020 Machine Learning - Home | Coursera

The way we do this is by taking the derivative (the tangential line to a function) of our cost function. The slope of
the tangent is the derivative at that point and it will give us a direction to move towards. We make steps down the
cost function in the direction with the steepest descent, and the size of each step is determined by the parameter
α, which is called the learning rate.

The gradient descent algorithm is:

repeat until convergence:

θj := θj − α ∂θ∂ j J(θ0 , θ1 )

where

j=0,1 represents the feature index number.

Intuitively, this could be thought of as:

repeat until convergence:

θj := θj − α[Slope of tangent aka derivative in j dimension][Slope of tangent aka derivative in j dimension]

Gradient Descent for Linear Regression

When specifically applied to the case of linear regression, a new form of the gradient descent equation can be
derived. We can substitute our actual cost function and our actual hypothesis function and modify the equation to
(the derivation of the formulas are out of the scope of this course, but a really great one can be found here):

repeat until convergence: {

1 m
θ0 := θ0 − α ∑ (h θ (x i ) − y i )
m i=1
m
1
θ1 := θ1 − α ∑ ((h θ (x i ) − y i )x i )
m i=1
}

where m is the size of the training set, θ0 a constant that will be changing simultaneously with θ1 and xi , yi are
values of the given training set (data).

Note that we have separated out the two cases for θj into separate equations for θ0 and θ1 ; and that for θ1 we
are multiplying xi at the end due to the derivative.

The point of all this is that if we start with a guess for our hypothesis and then repeatedly apply these gradient
descent equations, our hypothesis will become more and more accurate.

https://www.coursera.org/learn/machine-learning/resources/JXWWS 1/1

Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
Machine Learning Notes by Standard Andrew Ng
No ratings yet
Machine Learning Notes by Standard Andrew Ng
142 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
Linearna Regresija - NG
No ratings yet
Linearna Regresija - NG
7 pages
01B-DL2023-LinearModels
No ratings yet
01B-DL2023-LinearModels
47 pages
cs229 Notes1 PDF
No ratings yet
cs229 Notes1 PDF
28 pages
Regression
No ratings yet
Regression
30 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
15 pages
CS229
No ratings yet
CS229
69 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
cs229 2
No ratings yet
cs229 2
275 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
7 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
12 pages
07_Gradient_Descent_For_Linear_Regression_10_min
No ratings yet
07_Gradient_Descent_For_Linear_Regression_10_min
5 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
Gradient Descent: Disclaimer: This PPT Is Modified Based On Hung-Yi Lee
No ratings yet
Gradient Descent: Disclaimer: This PPT Is Modified Based On Hung-Yi Lee
38 pages
Cost Function: y 2m 1 (Y ) 2m 1
No ratings yet
Cost Function: y 2m 1 (Y ) 2m 1
1 page
Linear+regression+with+one+variable
No ratings yet
Linear+regression+with+one+variable
48 pages
Machine Learning Notes Cs229 1
No ratings yet
Machine Learning Notes Cs229 1
217 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
Math YHPLinear Regression
No ratings yet
Math YHPLinear Regression
13 pages
Gradient Descent in Linear Regression
No ratings yet
Gradient Descent in Linear Regression
30 pages
L3 Linear Regression and Gradient Descent
No ratings yet
L3 Linear Regression and Gradient Descent
46 pages
[PR 2024] Lec2 Regression II
No ratings yet
[PR 2024] Lec2 Regression II
41 pages
CS229 Lecture 2 PDF
100% (1)
CS229 Lecture 2 PDF
48 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
3 pages
A Layman's Guide to the Project
No ratings yet
A Layman's Guide to the Project
34 pages
Slide 3 - Linear Regression One Variable
No ratings yet
Slide 3 - Linear Regression One Variable
60 pages
Tom Mitchell Provides A More Modern Definition
No ratings yet
Tom Mitchell Provides A More Modern Definition
10 pages
What Is Machine Learning?
No ratings yet
What Is Machine Learning?
12 pages
What Is Machine Learning by Coursera
No ratings yet
What Is Machine Learning by Coursera
47 pages
L4 More On Linear Regression and Polynomial Regression
No ratings yet
L4 More On Linear Regression and Polynomial Regression
37 pages
Gradient Descent (v2)
No ratings yet
Gradient Descent (v2)
38 pages
ML:Introduction: Week 1 Lecture Notes
No ratings yet
ML:Introduction: Week 1 Lecture Notes
10 pages
2. Linear_ Regression_SGD
No ratings yet
2. Linear_ Regression_SGD
71 pages
(Machine Learning Coursera) Lecture Note Week 1
No ratings yet
(Machine Learning Coursera) Lecture Note Week 1
8 pages
Chapter04_Training_Models
No ratings yet
Chapter04_Training_Models
33 pages
5.1Loss Function, Optimization,Gd
No ratings yet
5.1Loss Function, Optimization,Gd
39 pages
Linear Regression With Multiple Features
No ratings yet
Linear Regression With Multiple Features
7 pages
lec6_7_Linear_regression
No ratings yet
lec6_7_Linear_regression
38 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
Lecture 2-Linear-Regression-Part1
No ratings yet
Lecture 2-Linear-Regression-Part1
80 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
Gradient Descent and SGD
No ratings yet
Gradient Descent and SGD
8 pages
Updating_Weight
No ratings yet
Updating_Weight
9 pages
ML:Introduction: Week 1 Lecture Notes
No ratings yet
ML:Introduction: Week 1 Lecture Notes
8 pages
05 Gradient Descent
No ratings yet
05 Gradient Descent
23 pages
Stochastic Gradient Descent Algorithm
No ratings yet
Stochastic Gradient Descent Algorithm
6 pages
Lecture 8: Gradient Descent and Logistic Regression
No ratings yet
Lecture 8: Gradient Descent and Logistic Regression
39 pages
DSCTP 2022 1 ML Slides
No ratings yet
DSCTP 2022 1 ML Slides
110 pages
Lecture3_upload
No ratings yet
Lecture3_upload
28 pages
Machine Learning - SoS 2017
No ratings yet
Machine Learning - SoS 2017
15 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
A Short Course in Automorphic Functions
From Everand
A Short Course in Automorphic Functions
Joseph Lehner
No ratings yet
Normal Equation: θ = (X X) X y
No ratings yet
Normal Equation: θ = (X X) X y
1 page
Gradient Descent Tips: X X X X X
No ratings yet
Gradient Descent Tips: X X X X X
1 page
7 PDF
No ratings yet
7 PDF
1 page
7 PDF
No ratings yet
7 PDF
1 page
Support Vector
No ratings yet
Support Vector
2 pages

Gradient Descent for Linear Regression: repeat until convergence: (:=:=) − α ( −) 1 ∑ − α ( ( −) ) 1 ∑

Uploaded by

Gradient Descent for Linear Regression: repeat until convergence: (:=:=) − α ( −) 1 ∑ − α ( ( −) ) 1 ∑

Uploaded by

27/09/2020 Machine Learning - Home | Coursera

The gradient descent algorithm is:

repeat until convergence:

j=0,1 represents the feature index number.

Intuitively, this could be thought of as:

repeat until convergence:

θj := θj − α[Slope of tangent aka derivative in j dimension][Slope of tangent aka derivative in j dimension]

Gradient Descent for Linear Regression

repeat until convergence: {

You might also like