Lifted Newton Optimization
Lifted Newton Optimization
Use in Optimization
Moritz Diehl
Optimization in Engineering Center (OPTEC), K.U. Leuven, Belgium
joint work with Jan Albersmeyer (U. Heidelberg)
Overview
with
Why to lift and increase the system size?
Write
with
To solve
can eliminate
Define
PROPOSITION: if then
and
Original Problem:
Lifted Problem:
( obtained by lifting )
Lifted Gauss-Newton: Quadratic Subproblems
Linearized lifted problem = structured QP subproblem:
function value
gradient
Gradient evaluation via adjoint differentiation
function value
gradient
Lifted Newton = Full Space SQP
then full space SQP and lifted Newton iterations are identical, with
(need full derivatives only w.r.t. u, not µ, due to symmetry of KKT systems)
Overview
Derivative is given by
Lifted residual is
(if all have the same sign, last variable is "leader" and contracts fastest)
Convergence for motivating toy example
Practical Conclusions from Theorem
Two cases:
CPU time per iteration: 8.9 s for unlifted, 11.8 s for lifted (~1 min total)
(Note: formulation e.g. in AMPL would involve 27 million variables)
Summary
Lifting offers advantages for Newton type optimization:
• faster local convergence rate (proven in simplified setting)
• more freedom for initialization