0% found this document useful (0 votes)

558 views

Chapter 9 Newton's Method

Newton's method uses both first and second derivatives to find the minimum of a function, performing better than steepest descent which only uses first derivatives. It works by constructing a quadratic approximation of the function around the current point and finding the minimum of that approximation. This minimum then becomes the next iteration point. The method converges rapidly if started close to the solution but may not be a descent method and could fail to converge if the Hessian is not positive definite. The Levenberg-Marquardt modification adds a damping parameter to ensure the search direction always points towards descent. Newton's method can also be applied to nonlinear least squares problems.

Uploaded by

Hajra Swati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

558 views

Chapter 9 Newton's Method

Uploaded by

Hajra Swati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Chapter 9 Newton’s Method

An Introduction to Optimization
Spring, 2014

Wei-Ta Chu

1
Introduction
The steepest descent method uses only first derivatives in
selecting a suitable search direction.
Newton’s method (sometimes called Newton-Raphson method)
uses first and second derivatives and indeed performs better.
Given a starting point, construct a quadratic approximation to
the objective function that matches the first and second
derivative values at that point. We then minimize the
approximate (quadratic function) instead of the original
objective function. The minimizer of the approximate function
is used as the starting point in the next step and repeat the
procedure iteratively.

2
Introduction
We can obtain a quadratic approximation to the twice
continuously differentiable function using the
Taylor series expansion of about the current point ,
neglecting terms of order three and higher.

Where, for simplicity, we use the notation

Applying the FONC to yields

If , then achieves a
minimum at

3
Example
Use Newton’s method to minimize the Powell function:

Use as the starting point . Perform three

iterations.
Note that . We have

4
Example
Iteration 1.

5
Example
Iteration 2.

6
Example
Iteration 3.

7
Introduction
Observe that the th iteration of Newton’s method can be
written in two steps as
1. Solve for
2. Set
Step 1 requires the solution of an system of linear
equations. Thus, an efficient method for solving systems of
linear equations is essential when using Newton’s method.
As in the one-variable case, Newton’s method can be viewed as
a technique for iteratively solving the equation

where and . In this case is the Jacobian

matrix of at ; that is, is the matrix whose
entry is ,
8
Analysis of Newton’s Method
As in the one-variable case there is no guarantee that Newton’s
algorithm heads in the direction of decreasing values of the
objective function if is not positive definite (recall
Figure 7.7)
Even if , Newton’s method may not be a descent
method; that is, it is possible that
This may occur if our starting point is far away from the solution
Despite these drawbacks, Newton’s method has superior
convergence properties when the starting point is near the
solution.
Newton’s method works well if everywhere.
However, if for some , Newton’s method may fail
to converge to the minimizer.

9
Analysis of Newton’s Method
The convergence analysis of Newton’s method when is a
quadratic function is straightforward. Newton’s method reaches
the point such that in just one step starting from
any initial point .
Suppose that is invertible and
Then, and
Hence, given any initial point , by Newton’s algorithm

Therefore, for the quadratic case the order of convergence of

Newton’s algorithm is for any initial point
10
Analysis of Newton’s Method
Theorem 9.1: Suppose that and is a point such
that and is invertible. Then, for all
sufficiently close to , Newton’s method is well defined for all
and converge to with an order of convergence at least 2.
Proof: The Taylor series expansion of about yields

Because by assumption and is invertible, there

exist constants , and such that if ,
, we have

and by Lemma 5.3, exists and satisfies

11
Analysis of Newton’s Method

The first inequality holds because the remainder term in the

Taylor series expansion contains third derivatives of that are
continuous and hence bounded on
Suppose that . Then, substituting
in the inequality above and using the assumption that
we get

12
Analysis of Newton’s Method
Subtracting from both sides of Newton’s algorithm and
taking norms yields

Applying the inequalities above involving the constants and

Suppose that is such that

Then

13
Analysis of Newton’s Method
By induction, we obtain

Hence, and therefore the sequence

converges to . The order of convergence is at least 2 because
. That is,

14
Analysis of Newton’s Method
Theorem 9.2: Let be the sequence generated by Newton’s
method for minimizing a given objective function . If the
Hessian and , then the search
direction

from to is a descent direction for in the sense that

there exists an such that for all

15
Analysis of Newton’s Method
Proof: Let , then using the chain rule, we
obtain

Hence,

because and .
Thus, there exists an so that for all ,
This implies that for all

16
Analysis of Newton’s Method
Theorem 9.2 motivates the following modification of Newton’s
method

where
that is, at each iteration, we perform a line search in the
direction
A drawback of Newton’s method is that evaluation of
for large can be computationally expensive. Furthermore, we
have to solve the set of linear equations . In
Chapters 10 and 11 we discuss this issue.
The Hessian matrix may not be positive definite. In the next we
describe a simple modification to overcome this problem.

17
Levenberg-Marquardt Modification
If the Hessian matrix is not positive definite, then the
search direction may not point in a descent
direction.
Levenberg-Marquardt modification:

Consider a symmetric matrix , which may not be positive

definite. Let be the eigenvalues of with
corresponding eigenvectors . The eigenvalues are real,
but may not all be positive.
Consider the matrix , where . Note that the
eigenvalues of are .

18
Levenberg-Marquardt Modification
Indeed,

which shows that for all , is also an eigenvector of

with eigenvalue .
If is sufficiently large, then all the eigenvalues of are
positive and is positive definite.
Accordingly, if the parameter in the Levenberg-Marquardt
modification of Newton’s algorithm is sufficiently large, then
the search direction always points
in a descent direction.

19
Levenberg-Marquardt Modification
If we further introduce a step size

then we are guaranteed that the descent property holds.

By letting , the Levenberg-Marquardt modification
approaches the behavior of the pure Newton’s method.
By letting , this algorithm approaches a pure gradient
method with small step size.
In practice, we may start with a small value of and increase
it slowly until we find that the iteration is descent:

20
Newton’s Method for Nonlinear Least Squares
Consider , where ,
are given functions. This particular problem is called a
nonlinear least-squares problem.
Suppose that we are given measurements of a process at
points in time. Let denote the measurement times and
the measurements values. Note that and
We wish to fit a sinusoid to the measurement data.

21
Newton’s Method for Nonlinear Least Squares
The equation of the sinusoid is

with appropriate choices of the parameters .

To formulate the data-fitting problem, we construct the
objective function

representing the sum of the squared errors between the

measurement values and the function values at the
corresponding points in time.
Let represent the vector of decision variables. We
obtain the least-squares problem with

22
Newton’s Method for Nonlinear Least Squares
Defining , we write the objective function as
. To apply Newton’s method, we need to
compute the gradient and the Hessian of .
The th component of is

Denote the Jacobian matrix of by

Thus, the gradient of can be represented as

23
Newton’s Method for Nonlinear Least Squares
We compute the Hessian matrix of . The th component
of the Hessian is given by

Letting be the matrix whose th component is

We write the Hessian matrix as

24
Newton’s Method for Nonlinear Least Squares
Therefore, Newton’s method applied to the nonlinear least-
squares problem is given by

In some applications, the matrix involving the second

derivatives of the function can be ignored because its
components are negligibly small.
In this case Newton’s algorithm reduces to what is commonly
called the Gauss-Newton method:

Note that the Gauss-Newton method does not require

calculation of the second derivatives of

25
Example

The Jacobian matrix in this problem is a matrix

with elements given by

We apply the Gauss-Newton algorithm to find the sinusoid of

best fit.
The parameters of this sinusoid are

26
Newton’s Method for Nonlinear Least Squares
A potential problem with the Gauss-Newton method is that the
matrix may not be positive definite.
This problem can be overcome using a Levenberg-Marquardt
modification:

This is referred to in the literature as the Levenberg-Marquardt

algorithm because the original modification was developed
specifically for the nonlinear least-squares problem.
An alternative interpretation of the Levenberg-Marquardt
algorithm is to view the term as an approximation to
in the Newton’s algorithm.

Using-generative-artificial-intelligence--GenAI--in-ma_2025_Journal-of-Busin
No ratings yet
Using-generative-artificial-intelligence--GenAI--in-ma_2025_Journal-of-Busin
13 pages
Garda Trainee Candidate Information Booklet 2024
No ratings yet
Garda Trainee Candidate Information Booklet 2024
20 pages
Ee332 - Lab-Sheets - Student Workbook
No ratings yet
Ee332 - Lab-Sheets - Student Workbook
6 pages
General Introduction To Fractal Geometry
No ratings yet
General Introduction To Fractal Geometry
15 pages
An Introductory Course in Stochastic Processes
100% (1)
An Introductory Course in Stochastic Processes
105 pages
Method of Multiple Scales and Averaging
No ratings yet
Method of Multiple Scales and Averaging
48 pages
STE Action Plan 2021-2022
100% (1)
STE Action Plan 2021-2022
5 pages
"Newton's Method and Loops": University of Karbala College of Engineering Petroleum Eng. Dep
No ratings yet
"Newton's Method and Loops": University of Karbala College of Engineering Petroleum Eng. Dep
11 pages
Newton Gauss Method
No ratings yet
Newton Gauss Method
37 pages
W9 Newton's Method
No ratings yet
W9 Newton's Method
10 pages
Levenberg Marquardt
No ratings yet
Levenberg Marquardt
7 pages
Appendix B Hand Out Gauss Newton Derivation
No ratings yet
Appendix B Hand Out Gauss Newton Derivation
8 pages
Roots of Equations: 1.0.1 Newton's Method
No ratings yet
Roots of Equations: 1.0.1 Newton's Method
20 pages
Applications of Newton Raphson Method in Computational Sciences
100% (2)
Applications of Newton Raphson Method in Computational Sciences
3 pages
Finite Difference Methods: Autumn 2 0 0 9
No ratings yet
Finite Difference Methods: Autumn 2 0 0 9
33 pages
Fractal Dimension Analysis
No ratings yet
Fractal Dimension Analysis
27 pages
Orbital Mechanics
No ratings yet
Orbital Mechanics
47 pages
Regula Falsi Method
No ratings yet
Regula Falsi Method
50 pages
Wolfram Mathematica Tutorial Collection - Differential Equation Solving With DSolve (2008) (p118)
100% (5)
Wolfram Mathematica Tutorial Collection - Differential Equation Solving With DSolve (2008) (p118)
118 pages
GEG 402 Slides of Numerical Analysis of Ordinary Differential Equations 3
100% (1)
GEG 402 Slides of Numerical Analysis of Ordinary Differential Equations 3
54 pages
3 CrankNicolson
No ratings yet
3 CrankNicolson
75 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Oundary Alue Roblems: Dr. Johnson
No ratings yet
Oundary Alue Roblems: Dr. Johnson
33 pages
Applications of Numerical Methods Matlab
No ratings yet
Applications of Numerical Methods Matlab
15 pages
14.2.7 - Numerical Approximation Euler's Method
No ratings yet
14.2.7 - Numerical Approximation Euler's Method
23 pages
Bisection Method in Multiple Dimensions
No ratings yet
Bisection Method in Multiple Dimensions
6 pages
7 - Hidden Pairs A.K.A. Hidden Matching Pairs - Solving A More Diffic - Full-HD
No ratings yet
7 - Hidden Pairs A.K.A. Hidden Matching Pairs - Solving A More Diffic - Full-HD
46 pages
CT7 Pu 15 PDF
No ratings yet
CT7 Pu 15 PDF
8 pages
1104.4025 (Methods in Ma Thematic A For Solving Ordinary Differential Equations)
No ratings yet
1104.4025 (Methods in Ma Thematic A For Solving Ordinary Differential Equations)
13 pages
Lecture Finite Difference Crank
No ratings yet
Lecture Finite Difference Crank
37 pages
Vector Calculus - Understanding The Gradient
No ratings yet
Vector Calculus - Understanding The Gradient
6 pages
Kharrat-Toma Transform and Its Application in Solving Some Ordinary Differential Equations With Initial Boundary Conditions
No ratings yet
Kharrat-Toma Transform and Its Application in Solving Some Ordinary Differential Equations With Initial Boundary Conditions
6 pages
Numerical Method: One Error Estimation
No ratings yet
Numerical Method: One Error Estimation
32 pages
Celestial Observations
No ratings yet
Celestial Observations
18 pages
Ashenafi Agizaw
No ratings yet
Ashenafi Agizaw
103 pages
Statistics 580 Nonlinear Least Squares: I I I I I I I 2 N I I 2
No ratings yet
Statistics 580 Nonlinear Least Squares: I I I I I I I 2 N I I 2
14 pages
Lecture On BVP
No ratings yet
Lecture On BVP
26 pages
Vedic Mathematics
No ratings yet
Vedic Mathematics
13 pages
A Practical ImplementationOfHJM
No ratings yet
A Practical ImplementationOfHJM
336 pages
DMG Program Guide
No ratings yet
DMG Program Guide
28 pages
Some Notes On Least Squares, QR-factorization, SVD and Fitting
No ratings yet
Some Notes On Least Squares, QR-factorization, SVD and Fitting
12 pages
Parameter Searching For Epidemiology Models Using Bayesian Optimization
No ratings yet
Parameter Searching For Epidemiology Models Using Bayesian Optimization
14 pages
Solving First Order Ordinary Differential Equations Using Least Square Method A Comparative Study
No ratings yet
Solving First Order Ordinary Differential Equations Using Least Square Method A Comparative Study
8 pages
Basic of Finite Difference Method
No ratings yet
Basic of Finite Difference Method
7 pages
The ACTION in HEART of PHYSICS in Matter, Radiation and Space Time
No ratings yet
The ACTION in HEART of PHYSICS in Matter, Radiation and Space Time
16 pages
ST202 - Stochastic Processes
No ratings yet
ST202 - Stochastic Processes
5 pages
Numerical Methods For Solving Elliptic Boundary Value Problems
No ratings yet
Numerical Methods For Solving Elliptic Boundary Value Problems
152 pages
3 - Roots of Complex Numbers
No ratings yet
3 - Roots of Complex Numbers
10 pages
Pradeep PPT Maths
No ratings yet
Pradeep PPT Maths
6 pages
(MADHU MANGAL PAUL) Numerical Analysis For Scienti
100% (1)
(MADHU MANGAL PAUL) Numerical Analysis For Scienti
666 pages
Summary
No ratings yet
Summary
3 pages
Lecture Notes On Numerical Methods For Differential Equations
No ratings yet
Lecture Notes On Numerical Methods For Differential Equations
146 pages
Calculus of Variations Project
No ratings yet
Calculus of Variations Project
10 pages
Calculus of Variations
No ratings yet
Calculus of Variations
15 pages
Symbolic TB
No ratings yet
Symbolic TB
1,498 pages
Probability Todhunter PDF
100% (1)
Probability Todhunter PDF
648 pages
What Are Neural Nets
No ratings yet
What Are Neural Nets
4 pages
3-Gauss Elimination Method
No ratings yet
3-Gauss Elimination Method
12 pages
MATH2070: LAB 4: Newton's Method: PDF Version
No ratings yet
MATH2070: LAB 4: Newton's Method: PDF Version
33 pages
14 Newton
No ratings yet
14 Newton
24 pages
19 Newton Method
No ratings yet
19 Newton Method
10 pages
Chapter 9 Lecture Notes
No ratings yet
Chapter 9 Lecture Notes
3 pages
Exercises of Numerical Analysis
From Everand
Exercises of Numerical Analysis
Simone Malacrida
No ratings yet
ECA-2 Lab - 8 PDF
No ratings yet
ECA-2 Lab - 8 PDF
8 pages
Manual Complex Engineering Problem: COMSATS University Islamabad Abbottabad Campus
No ratings yet
Manual Complex Engineering Problem: COMSATS University Islamabad Abbottabad Campus
35 pages
Math 351 Homework Problems Spring 2021, Professor Ramras
No ratings yet
Math 351 Homework Problems Spring 2021, Professor Ramras
2 pages
Project 3-Differential CMOS Amplifier
No ratings yet
Project 3-Differential CMOS Amplifier
2 pages
Wifi Motion Intelligence: The Fundamentals.: A Case Study by Telef Onica, Quantenna, and Aerial Technologies
No ratings yet
Wifi Motion Intelligence: The Fundamentals.: A Case Study by Telef Onica, Quantenna, and Aerial Technologies
7 pages
Switch Parasitic Array Antenna For 5G Communication: COMSATS University Islamabad Abbottabad Campus-Pakistan
No ratings yet
Switch Parasitic Array Antenna For 5G Communication: COMSATS University Islamabad Abbottabad Campus-Pakistan
55 pages
2nd Year Test 4th Chapter
No ratings yet
2nd Year Test 4th Chapter
1 page
2009 Hoyles&Noss Abstraction
No ratings yet
2009 Hoyles&Noss Abstraction
20 pages
HP Color LaserJet Managed MFP E87640 Service Cost Data
No ratings yet
HP Color LaserJet Managed MFP E87640 Service Cost Data
32 pages
Unit 4-DBMS
No ratings yet
Unit 4-DBMS
20 pages
ASUS Control Center Express EM WEB
No ratings yet
ASUS Control Center Express EM WEB
136 pages
Preview ISA+TR84.00.09-2017
No ratings yet
Preview ISA+TR84.00.09-2017
17 pages
History of Nand Flash Technology
No ratings yet
History of Nand Flash Technology
6 pages
F1 Driver Data Sheet - IT - FIT - 40 - 220... 240 - 350 - CS - D - L
No ratings yet
F1 Driver Data Sheet - IT - FIT - 40 - 220... 240 - 350 - CS - D - L
6 pages
Flash Your Lenovo Ideapad Laptop BIOS From Linux Using UEFI Capsule Updates
No ratings yet
Flash Your Lenovo Ideapad Laptop BIOS From Linux Using UEFI Capsule Updates
11 pages
Distributed Quantum Advantage For Local Problems: Alkida Balliu Sebastian Brandt Xavier Coiteux-Roy
No ratings yet
Distributed Quantum Advantage For Local Problems: Alkida Balliu Sebastian Brandt Xavier Coiteux-Roy
65 pages
5.2.7 Packet Tracer - Configure and Modify Standard IPv4 ACLs
No ratings yet
5.2.7 Packet Tracer - Configure and Modify Standard IPv4 ACLs
7 pages
Spring Mobile Real Time Application
No ratings yet
Spring Mobile Real Time Application
14 pages
CP300 Qig
No ratings yet
CP300 Qig
12 pages
CE 351, L9, Traffic Control Devices
No ratings yet
CE 351, L9, Traffic Control Devices
43 pages
Design Patterns Explained A New Perspective on Object Oriented Design 2nd Edition Alan Shalloway - The complete ebook version is now available for download
No ratings yet
Design Patterns Explained A New Perspective on Object Oriented Design 2nd Edition Alan Shalloway - The complete ebook version is now available for download
55 pages
01 Object Oriented Design Notes
No ratings yet
01 Object Oriented Design Notes
21 pages
Awareness - Seminat Keselamatan ICT JKNT
No ratings yet
Awareness - Seminat Keselamatan ICT JKNT
62 pages
English Grammar Hand Written Class Notes PDF Download
No ratings yet
English Grammar Hand Written Class Notes PDF Download
366 pages
Erules - XML Format 16.3.2023
No ratings yet
Erules - XML Format 16.3.2023
6 pages
SAMPLE PAPER Class X Maths
No ratings yet
SAMPLE PAPER Class X Maths
9 pages
Workstation Checklist
No ratings yet
Workstation Checklist
3 pages
Web Engineering Lab 13
No ratings yet
Web Engineering Lab 13
15 pages
1 - Q1 Emp Technology
No ratings yet
1 - Q1 Emp Technology
15 pages
Nokia_S13X400H_Transponder_Data_Sheet_EN_2024
No ratings yet
Nokia_S13X400H_Transponder_Data_Sheet_EN_2024
3 pages
Ozanne Et Al 2022 Shall Ai Moderators Be Made Visible Perception of Accountability and Trust in Moderation Systems On
No ratings yet
Ozanne Et Al 2022 Shall Ai Moderators Be Made Visible Perception of Accountability and Trust in Moderation Systems On
13 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
40 pages
Drawbacks of Procedural Language
No ratings yet
Drawbacks of Procedural Language
32 pages

Chapter 9 Newton's Method

Uploaded by

Chapter 9 Newton's Method

Uploaded by

Chapter 9 Newton’s Method

Where, for simplicity, we use the notation

Use as the starting point . Perform three

where and . In this case is the Jacobian

Therefore, for the quadratic case the order of convergence of

Because by assumption and is invertible, there

and by Lemma 5.3, exists and satisfies

The first inequality holds because the remainder term in the

Applying the inequalities above involving the constants and

Suppose that is such that

Hence, and therefore the sequence

from to is a descent direction for in the sense that

Consider a symmetric matrix , which may not be positive

which shows that for all , is also an eigenvector of

then we are guaranteed that the descent property holds.

with appropriate choices of the parameters .

representing the sum of the squared errors between the

Denote the Jacobian matrix of by

Thus, the gradient of can be represented as

Letting be the matrix whose th component is

We write the Hessian matrix as

In some applications, the matrix involving the second

Note that the Gauss-Newton method does not require

The Jacobian matrix in this problem is a matrix

We apply the Gauss-Newton algorithm to find the sinusoid of

This is referred to in the literature as the Levenberg-Marquardt

You might also like