Gradient Descent

Uploaded by

Anshu Goud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views9 pages

Gradient Descent

Uploaded by

Anshu Goud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

What is Gradient Descent?

• Let’s say you are playing a game where the players are at the top of a
mountain, and they are asked to reach the lowest point of the mountain.
Additionally, they are blindfolded.

• The best way is to observe the ground and find where the land descends.
From that position, take a step in the descending direction and iterate this
process until we reach the lowest point.
• Gradient Descent is an optimization algorithm used for minimizing the
cost function .It is basically used for updating the parameters of the
learning model.

• What is a Cost Function?

• It is a function that measures the performance of a model for any given
data. Cost Function quantifies the error between predicted values and
expected values and presents it in the form of a single real number.
• After making a hypothesis with initial parameters, we calculate the Cost
function. And with a goal to reduce the cost function, we modify the
parameters by using the Gradient descent algorithm over the given data.
Here’s the mathematical representation for it:
Gradient descent is an iterative optimization algorithm for finding the local minimum of a
function.

To find the local minimum of a function using gradient descent, we must take steps proportional
to the negative of the gradient (move away from the gradient) of the function at the current point.

If we take steps proportional to the positive of the gradient (moving towards the gradient), we
will approach a local maximum of the function, and the procedure is called Gradient Ascent.
• The goal of the gradient descent algorithm is to minimize the
given function (say cost function). To achieve this goal, it
performs two steps iteratively:
1.Compute the gradient (slope), the first order derivative of
the function at that point
2.Make a step (move) in the direction opposite to the
gradient, opposite direction of slope increase from the
current point by alpha times the gradient at that point
Alpha is called Learning rate – a tuning parameter in the
optimization process. It decides the length of the steps.
Alpha – The Learning Rate

• We have the direction we want to move in, now we must decide the
size of the step we must take.

• *It must be chosen carefully to end up with local minima.

• If the learning rate is too high, we might OVERSHOOT the minima

and keep bouncing, without reaching the minima

• If the learning rate is too small, the training might turn out to be too
long
Nature of Learning rate
1.a) Learning rate is optimal, model converges to the minimum

2.b) Learning rate is too small, it takes more time but converges to the
minimum

3.c) Learning rate is higher than the optimal value, it overshoots but
converges ( 1/C < η <2/C)

4.d) Learning rate is very large, it overshoots and diverges, moves

away from the minima, performance decreases on learning
Minibatch and Stochastic Gradient Descent

1.Batch Gradient Descent: Parameters are updated after computing

the gradient of the error with respect to the entire training set

2.Stochastic Gradient Descent: Parameters are updated after

computing the gradient of the error with respect to a single training
example

3.Mini-Batch Gradient Descent: Parameters are updated after

computing the gradient of the error with respect to a subset of the
training set

Gradient Descent
No ratings yet
Gradient Descent
17 pages
DL Unit -2
No ratings yet
DL Unit -2
20 pages
CCS355 Neural Networks and Deep Learning
No ratings yet
CCS355 Neural Networks and Deep Learning
142 pages
Gradient_Descent_(1)
No ratings yet
Gradient_Descent_(1)
8 pages
Gradient Descent
No ratings yet
Gradient Descent
4 pages
Lec05-1-Gradient Descent-Detailed
No ratings yet
Lec05-1-Gradient Descent-Detailed
62 pages
Adam Optimizer
No ratings yet
Adam Optimizer
22 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
AI33
No ratings yet
AI33
6 pages
Gradient Descent DS Rohit Sharma Fench Knjs
No ratings yet
Gradient Descent DS Rohit Sharma Fench Knjs
15 pages
Gradient Descent and Cost Function
No ratings yet
Gradient Descent and Cost Function
14 pages
Gradient Descent Unit3
No ratings yet
Gradient Descent Unit3
9 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Gradient Descent - A Quick, Simple Introduction - Built in
No ratings yet
Gradient Descent - A Quick, Simple Introduction - Built in
15 pages
Gradient Descent in Linear Regression
No ratings yet
Gradient Descent in Linear Regression
30 pages
GRADIENT DESCENT
No ratings yet
GRADIENT DESCENT
5 pages
What Is Gradient Descent - Built in
No ratings yet
What Is Gradient Descent - Built in
11 pages
chp2 Gradient Descent algorithm
No ratings yet
chp2 Gradient Descent algorithm
5 pages
4. Gradient Descent
No ratings yet
4. Gradient Descent
15 pages
Gradient Descent Algorithm is a first
No ratings yet
Gradient Descent Algorithm is a first
5 pages
LInear
No ratings yet
LInear
14 pages
Yash 21bsds12
No ratings yet
Yash 21bsds12
3 pages
Gradient Decent
No ratings yet
Gradient Decent
40 pages
Introduction-to-Gradient-Descent (2)
No ratings yet
Introduction-to-Gradient-Descent (2)
8 pages
14-RMSProp and Adam Optimization-12!08!2024
No ratings yet
14-RMSProp and Adam Optimization-12!08!2024
2 pages
Deep Learning (Part 8) - Coursesteach
No ratings yet
Deep Learning (Part 8) - Coursesteach
16 pages
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
No ratings yet
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
37 pages
Gradient Descent Final
No ratings yet
Gradient Descent Final
27 pages
2. Gradient Descent (GD)- GD With Momentum- Nesterov Accelerated GD- Stochastic GD - OrIGINAL
No ratings yet
2. Gradient Descent (GD)- GD With Momentum- Nesterov Accelerated GD- Stochastic GD - OrIGINAL
25 pages
ML - WEEK 06
No ratings yet
ML - WEEK 06
31 pages
Gradient Descent
No ratings yet
Gradient Descent
14 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
Gradient Descent (3) (2)
No ratings yet
Gradient Descent (3) (2)
27 pages
GD Types
No ratings yet
GD Types
98 pages
MAT6007 - Session8 - Gradient Descent
No ratings yet
MAT6007 - Session8 - Gradient Descent
16 pages
Assignment B 4 GradientDescent
No ratings yet
Assignment B 4 GradientDescent
5 pages
Gradient Descent
No ratings yet
Gradient Descent
12 pages
Gradient Descent Algorithm in Machine Learning
No ratings yet
Gradient Descent Algorithm in Machine Learning
21 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
Gradient Descent_PR
No ratings yet
Gradient Descent_PR
31 pages
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
No ratings yet
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
24 pages
UNIT III Part-2
No ratings yet
UNIT III Part-2
39 pages
5.1Loss Function, Optimization,Gd
No ratings yet
5.1Loss Function, Optimization,Gd
39 pages
Gradient Descent
No ratings yet
Gradient Descent
2 pages
Upload_Unit_2
No ratings yet
Upload_Unit_2
19 pages
Backpropagation, Sgmiod Neuron & Gradient Discend
No ratings yet
Backpropagation, Sgmiod Neuron & Gradient Discend
29 pages
Gradient Descend
No ratings yet
Gradient Descend
64 pages
5 Optimizers
No ratings yet
5 Optimizers
10 pages
Gradient Descent
No ratings yet
Gradient Descent
7 pages
Unit VI Optimization Techniques question bank solved answer
No ratings yet
Unit VI Optimization Techniques question bank solved answer
20 pages
gradient-descent-from-scratch-complete-intuition
No ratings yet
gradient-descent-from-scratch-complete-intuition
8 pages
Gradient Descent a Fundamental Optimization Algorithm
No ratings yet
Gradient Descent a Fundamental Optimization Algorithm
30 pages
Paper 2
No ratings yet
Paper 2
27 pages
ML Lecture # 03 Gradient Descent
No ratings yet
ML Lecture # 03 Gradient Descent
23 pages
Gradient Descent Algorithm.Y... (1)
No ratings yet
Gradient Descent Algorithm.Y... (1)
10 pages
Chapter 4
No ratings yet
Chapter 4
65 pages
WINSEM2024-25_CSE4006_ETH_AP2024254000693_2025-01-08_Reference-Material-I
No ratings yet
WINSEM2024-25_CSE4006_ETH_AP2024254000693_2025-01-08_Reference-Material-I
40 pages
Linear Regression- Gradient Descent Method
No ratings yet
Linear Regression- Gradient Descent Method
15 pages
Lecture Notes 3 &4
No ratings yet
Lecture Notes 3 &4
35 pages