
















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An introduction to optimization methods, focusing on second order methods. It covers the concept of taylor expansion, the newton method, gauss-newton method, quasi-newton methods, and their applications in unconstrained and constrained optimization. The document also discusses the advantages of second order methods over gradient-based methods and the computational issues involved.
Typology: Exercises
1 / 24
This page cannot be seen from the preview
Don't miss anything!

















Second Order Optimization Methods
Marc Toussaint U Stuttgart
Conjugate Gradient
Plain Gradient
2nd Order
x ← x − f (x) f ′(x)
x ← x − f ′(x) f ′′(x) For x ∈ Rn: x ← x − ∇^2 f (x)-1∇f (x)
Input: initial x ∈ Rn, functions f (x), ∇f (x), ∇^2 f (x), tolerance θ Output: x 1: initialize stepsize α = 1 and damping λ = 10−^10 2: repeat 3: compute ∆ to solve (∇^2 f (x) + λI) ∆ = −∇f (x) 4: repeat // “line search” 5: y ← x + α∆ 6: if f (y) ≤ f (x) then // step is accepted 7: x ← y 8: α ← α^0.^5 // increase stepsize towards α = 1 9: else // step is rejected 10: α ← 0. 1 α // decrease stepsize 11: end if 12: until step accepted or (in bad case) α||∆||∞ < θ/ 1000 13: until ||∆||∞ < θ
(If ∇^2 f (x) is a “tree”: Dynamic Programming on the “Junction Tree”)
f (x) = φ(x)>φ(x) ∇f (x) = 2∇φ(x)>φ(x) ∇^2 f (x) = 2∇φ(x)>∇φ(x) + 2φ(x)>∇^2 φ(x)
The Gauss-Newton method is the Newton method for f (x) = φ(x)>φ(x) with approximating ∇^2 φ(x) ≈ 0
The approximate Hessian 2 ∇φ(x)>∇φ(x) is always semi-pos-def!
3: compute ∆ to solve (∇φ(x)>∇φ(x) + λI) ∆ = −∇φ(x)>φ(x)
∇^2 f (x) ≈ ∇f^ (x^2 )^ − ∇f^ (x^1 ) x 2 − x 1
∇^2 f (x) ∆x =! y ∆x =! ∇^2 f (x)−^1 y
∇^2 f (x) = y y> y>∆x ∇
(^2) f (x)− (^1) = ∆x∆x> ∆x>y
Convince yourself that the last line solves the desired relations [Left: how to update ∇^2 f (x). Right: how to update directly ∇^2 f (x)-1.]
Input: initial x ∈ Rn, functions f (x), ∇f (x), tolerance θ Output: x 1: initialize H-1^ = In 2: repeat 3: compute ∆ = −H-1∇f (x) 4: perform a line search minα f (x + α∆) 5: ∆ ← α∆ 6: y ← ∇f (x + ∆) − ∇f (x) 7: x ← x + ∆ 8: update H-1^ ←
( I − y ∆∆>>y
)> H-
( I − y ∆∆>>y
)
Directly use (Gauss-)Newton/BFGS → will boost performance of these constrained optimization methods!