0% found this document useful (0 votes)

52 views

Lecture10v01 Descent2

This lecture discusses gradient descent algorithms for finding the minimum of a function. Gradient descent works by iteratively moving in the direction of the negative gradient of the function. The lecture generalizes gradient descent to multiple dimensions using vector gradients. It discusses problems that can arise with gradient descent, such as choosing step sizes and stopping criteria. Potential fixes discussed include line search methods and Newton steps. The lecture notes that gradient descent is prone to converging at local minima rather than the global minimum.

Uploaded by

gacongnghiep7786

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views

Lecture10v01 Descent2

Uploaded by

gacongnghiep7786

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Lecture 10: descent methods

Generic descent algorithm Generalization to multiple dimensions Problems of descent methods, possible improvements Fixes Local minima

Gradient descent (reminder)

Minimum of a function is found by following the slope of the function

f
f(x) guess

f(m) m

Gradient descent (illustration)

f
f(x) guess next step

f(m) m

Gradient descent (illustration)

f
f(x) guess next step new gradient

f(m) m

Gradient descent (illustration)

f
f(x) guess

next step f(m) m

Gradient descent (illustration)

f
f(x) guess

f(m)

stop m

Gradient descent: algorithm

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

guess

Gradient descent: algorithm

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

Direction: downhill

Gradient descent: algorithm

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

step

Gradient descent: algorithm

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

Now you are here

Gradient descent: algorithm

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

Stop when close from minimum

Gradient descent: algorithm

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

guess = x
direction = -f(x) step = h > 0 x:=xhf(x) f(x)~0

Until stopping criterion is satisfied

Example of 2D gradient: pic of the MATLAB demo

Illustration of the gradient in 2D

Example of 2D gradient: pic of the MATLAB demo

Illustration of the gradient in 2D

Example of 2D gradient: pic of the MATLAB demo

Illustration of the gradient in 2D

Example of 2D gradient: pic of the MATLAB demo

Definition of the gradient in 2D

This is just a genaralization of the derivative in two dimensions. This can be generalized to any dimension.

Example of 2D gradient: pic of the MATLAB demo

Illustration of the gradient in 2D

Example of 2D gradient: pic of the MATLAB demo

Gradient descent works in 2D

Generalization to multiple dimensions

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

guess

Generalization to multiple dimensions

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

Direction: downhill

Generalization to multiple dimensions

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

step

Generalization to multiple dimensions

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

Now you are here

Generalization to multiple dimensions

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

Until stopping criterion is satisfied

Stop when close from minimum

Generalization to multiple dimensions

Start with a point (guess) Repeat
Determine a descent direction Choose a step Update

guess = x
direction = -f(x) step = h > 0 x:=xh Vf(x)

Until stopping criterion is satisfied

Vf(x)~0

Multiple dimensions
Everything that you have seen with derivatives can be generalized with the gradient.

For the descent method, f(x) can be replaced by

In two dimensions, and by

in N dimensions.

Example of 2D gradient: MATLAB demo

The cost to buy a portfolio is:

Stock N

Stock i

Stock 2

If you want to minimize the price to buy your portfolio, you need to compute the gradient of its price:

Stock 1

Problem 1: choice of the step

When updating the current computation: - small steps: inefficient - large steps: potentially bad results

f(x)

guess

f(m)

stop m

Too many steps: takes too long to converge

Problem 1: choice of the step

When updating the current computation: - small steps: inefficient - large steps: potentially bad results

f(x)

Next point (went too far)

f(m) m

Current point

Problem 2: ping pong effect

[S. Boyd, L. Vandenberghe, Convex Convex Optimization lect. Notes, Stanford Univ. 2004 ]

Problem 2: ping pong effect

[S. Boyd, L. Vandenberghe, Convex Convex Optimization lect. Notes, Stanford Univ. 2004 ]

Problem 2: (other norm dependent issues)

[S. Boyd, L. Vandenberghe, Convex Convex Optimization lect. Notes, Stanford Univ. 2004 ]

Problem 3: stopping criterion

Intuitive criterion:

In multiple dimensions:

Or equivalently

Rarely used in practice. More about this in EE227A (convex optimization, Prof. L. El Ghaoui).

Fixes
Several methods exist to address this problem
- Line search methods, in particular - Backtracking line search - Exact line search - Normalized steepest descent - Newton steps

Fundamental problem of the method: local minima

Local minima: pic of the MATLAB demo

The iterations of the algorithm converge to a local minimum

Local minima: pic of the MATLAB demo

View of the algorithm is myopic

Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Daa Spring 2024 & Fall 2023
No ratings yet
Daa Spring 2024 & Fall 2023
9 pages
Dynamic Programming: Briana B. Morrison With Thanks To Dr. Hung
No ratings yet
Dynamic Programming: Briana B. Morrison With Thanks To Dr. Hung
52 pages
Optimization
No ratings yet
Optimization
38 pages
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
No ratings yet
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
37 pages
Chap 3 Heuristics
No ratings yet
Chap 3 Heuristics
9 pages
Operation Management (MB0048)
No ratings yet
Operation Management (MB0048)
7 pages
Lecture15 Regularization
No ratings yet
Lecture15 Regularization
47 pages
The Knight's Tour
No ratings yet
The Knight's Tour
3 pages
Mids 21
No ratings yet
Mids 21
10 pages
Endsem
No ratings yet
Endsem
10 pages
LInear
No ratings yet
LInear
14 pages
Op Tim Ization Uw 06
No ratings yet
Op Tim Ization Uw 06
29 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
Multi-Variable Optimization: Mathematical Modeling (STAT 420/620) Fall 2014 Lecture 3 - September 3, 2014
No ratings yet
Multi-Variable Optimization: Mathematical Modeling (STAT 420/620) Fall 2014 Lecture 3 - September 3, 2014
11 pages
Lecture 4 Dynamic Programming
No ratings yet
Lecture 4 Dynamic Programming
47 pages
Idterm Eview Heet: Disclaimer
No ratings yet
Idterm Eview Heet: Disclaimer
8 pages
Penalty Functions: - The Premise - Quadratic Loss - Problems and Solutions
No ratings yet
Penalty Functions: - The Premise - Quadratic Loss - Problems and Solutions
21 pages
Violations of LP Conditions
No ratings yet
Violations of LP Conditions
28 pages
Linear Models (Unit II) Chapter III 1
No ratings yet
Linear Models (Unit II) Chapter III 1
24 pages
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
No ratings yet
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
24 pages
Lecture 4: Model-Free Prediction: David Silver
No ratings yet
Lecture 4: Model-Free Prediction: David Silver
51 pages
Ain Optimization G8
No ratings yet
Ain Optimization G8
33 pages
Dynamic Programming
No ratings yet
Dynamic Programming
4 pages
9.1 Optimization
No ratings yet
9.1 Optimization
29 pages
02_Reduced O. T. Introduction
No ratings yet
02_Reduced O. T. Introduction
42 pages
DL Assignment 1
No ratings yet
DL Assignment 1
3 pages
Introduction To Optimization: Anjela Govan North Carolina State University SAMSI NDHS Undergraduate Workshop 2006
No ratings yet
Introduction To Optimization: Anjela Govan North Carolina State University SAMSI NDHS Undergraduate Workshop 2006
29 pages
Calculus_3_Applications_of_Derivatives 3
No ratings yet
Calculus_3_Applications_of_Derivatives 3
15 pages
An Introduction To Gradient Descent and Linear Regression
No ratings yet
An Introduction To Gradient Descent and Linear Regression
8 pages
MCS 031
No ratings yet
MCS 031
15 pages
Don't Drink + Derive
No ratings yet
Don't Drink + Derive
120 pages
Transport
No ratings yet
Transport
45 pages
Practical 3: 12.1 Editinganexistingfunction
No ratings yet
Practical 3: 12.1 Editinganexistingfunction
9 pages
QT _Unit 3 - Linear Programming
No ratings yet
QT _Unit 3 - Linear Programming
88 pages
N.O. Study Guide
No ratings yet
N.O. Study Guide
7 pages
Introduction To Competitive Coding: S E S S I O N 1
No ratings yet
Introduction To Competitive Coding: S E S S I O N 1
30 pages
An Introduction To An Introduction To Optimization Optimization Using Using Evolutionary Algorithms Evolutionary Algorithms
No ratings yet
An Introduction To An Introduction To Optimization Optimization Using Using Evolutionary Algorithms Evolutionary Algorithms
45 pages
4 Module Daa
No ratings yet
4 Module Daa
14 pages
Week 7
No ratings yet
Week 7
53 pages
SVM Assignment ABA Course To Be Returned With Your Answers
No ratings yet
SVM Assignment ABA Course To Be Returned With Your Answers
10 pages
Matlab Lecture PDF
No ratings yet
Matlab Lecture PDF
22 pages
lecture 4
No ratings yet
lecture 4
46 pages
Matlab Opt An Dint
No ratings yet
Matlab Opt An Dint
43 pages
Optimisation & Decision Making ENGM072: Dr. Franjo Cecelja
No ratings yet
Optimisation & Decision Making ENGM072: Dr. Franjo Cecelja
26 pages
Lecture 12_penalty function optimization (1)
No ratings yet
Lecture 12_penalty function optimization (1)
22 pages
Rl Exam Tutti
No ratings yet
Rl Exam Tutti
47 pages
Lect 5- Gradient Descent
No ratings yet
Lect 5- Gradient Descent
31 pages
Lecture04. Training Models (Regression in Chapter 4)
No ratings yet
Lecture04. Training Models (Regression in Chapter 4)
44 pages
Code Optimization
No ratings yet
Code Optimization
149 pages
PDF Test Bank for Introductory Statistics, 9th Edition, Prem S. Mann download
100% (27)
PDF Test Bank for Introductory Statistics, 9th Edition, Prem S. Mann download
34 pages
Optim
No ratings yet
Optim
70 pages
ML MODULE 5 FULL NOTES
No ratings yet
ML MODULE 5 FULL NOTES
23 pages
Bcs Higher Education Qualifications BCS Level 4 Certificate in IT April 2011 Examiner'S Report Software Development
No ratings yet
Bcs Higher Education Qualifications BCS Level 4 Certificate in IT April 2011 Examiner'S Report Software Development
22 pages
Op Tim Ization Intro
No ratings yet
Op Tim Ization Intro
49 pages
Autodesk Maya 2025: A Comprehensive Guide, 16th Edition
From Everand
Autodesk Maya 2025: A Comprehensive Guide, 16th Edition
Prof. Sham Tickoo
No ratings yet
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet
Autodesk Maya 2022: A Comprehensive Guide, 13th Edition
From Everand
Autodesk Maya 2022: A Comprehensive Guide, 13th Edition
Prof. Sham Tickoo
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Couchbase Certified Java Developer - Exam Practice Tests
From Everand
Couchbase Certified Java Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet

Lecture10v01 Descent2

Uploaded by

Lecture10v01 Descent2

Uploaded by

Lecture 10: descent methods

Gradient descent (reminder)

Gradient descent (illustration)

Gradient descent (illustration)

Gradient descent (illustration)

next step f(m) m

Gradient descent (illustration)

Gradient descent: algorithm

Until stopping criterion is satisfied

Gradient descent: algorithm

Until stopping criterion is satisfied

Gradient descent: algorithm

Until stopping criterion is satisfied

Gradient descent: algorithm

Until stopping criterion is satisfied

Now you are here

Gradient descent: algorithm

Until stopping criterion is satisfied

Stop when close from minimum

Gradient descent: algorithm

Until stopping criterion is satisfied

Example of 2D gradient: pic of the MATLAB demo

Example of 2D gradient: pic of the MATLAB demo

Example of 2D gradient: pic of the MATLAB demo

Example of 2D gradient: pic of the MATLAB demo

Example of 2D gradient: pic of the MATLAB demo

Example of 2D gradient: pic of the MATLAB demo

Generalization to multiple dimensions

Until stopping criterion is satisfied

Generalization to multiple dimensions

Until stopping criterion is satisfied

Generalization to multiple dimensions

Until stopping criterion is satisfied

Generalization to multiple dimensions

Until stopping criterion is satisfied

Now you are here

Generalization to multiple dimensions

Until stopping criterion is satisfied

Stop when close from minimum

Generalization to multiple dimensions

Until stopping criterion is satisfied

For the descent method, f(x) can be replaced by

In two dimensions, and by

Example of 2D gradient: MATLAB demo

Problem 1: choice of the step

Too many steps: takes too long to converge

Problem 1: choice of the step

Next point (went too far)

Problem 2: ping pong effect

Problem 2: ping pong effect

Problem 2: (other norm dependent issues)

Problem 3: stopping criterion

Fundamental problem of the method: local minima

Local minima: pic of the MATLAB demo

The iterations of the algorithm converge to a local minimum

Local minima: pic of the MATLAB demo

You might also like