Dogleg and Steihaug Method, Lecture Notes - Mathematics - | Study notes Mathematical Methods

C12.1B: CONTINUOUS OPTIMISATION

LECTURE 7: THE DOGLEG AND STEIHAUG METHODS

RAPHAEL HAUSER

MATHEMATICAL INSTITUTE, UNIVERSITY OF OXFORD

1. Variants of Trust-Region Methods. The generic trust region method we

introduced in Lecture 6 is a fairly general algorithmic framework:

(i) Although we made a specific choice for defining and updating the trust region

Rk, other choices are possible, for example by considering balls in the norms

k · k1or k · k∞. We will not pursue this matter further.

(ii) There is freedom in the choice of the model function mk. We chose to inves-

tigate only quadratic model functions whose linear part coincides with the

first order Taylor approximation of f, but this leaves many possibilities for

choosing the matrix Bk. We discuss this issue in Section 2 below.

(iii) The point yk+1 should be obtained via an approximate solution of the trust

region subproblem

min

y∈Rk

mk(y).(1.1)

Theorem 1.2 of Lecture 6 shows that it is desirable to choose an approximate

computation that uses the Cauchy point as a benchmark, but other than that

there is complete freedom in choosing a method for this computation. Two

of the most widely used methods in this context are the dogleg method of

Section 3.1 and Steihaug’s method of Section 3.2.

2. Choice of the model function. Let us discuss a few methods for choosing

the matrix Bkthat determines the model function

mk(x) = f(xk) + ∇f(xk)T(x−xk) + 1

2(x−xk)TBk(x−xk).

2.1. Trust-Region Newton Methods. If the problem dimension is not too

large, the choice

Bk=D2f(xk)

is reasonable and leads to a model function mkthat is simply the second order Taylor

approximation of the objective function faround the current iterate xk. Methods

based on this choice of model function are called trust-region Newton methods.

It is important to understand that trust-region Newton methods are not simply

the Newton-Raphson method with an additional step-size restriction. In fact, trust-

region Newton methods overcome most of the unwanted aspects of the dynamical

behaviour of the Newton-Raphson method while retaining all its advantages with re-

gards to convergence speed:

(i) In the neighbourhood of a saddle point or a local maximiser x∗of f, the

Newton-Raphson method is attracted to x∗. This is unwanted, because x∗

is a spurious solution of the minimisation problem min f(x). Trust-region

Newton methods are not attracted to such solutions because the trust-region

framework ensures that the sequence (f(xk))Nis decreasing.

Dogleg and Steihaug Method, Lecture Notes - Mathematics -, Study notes of Mathematical Methods

Related documents

Partial preview of the text

Download Dogleg and Steihaug Method, Lecture Notes - Mathematics - and more Study notes Mathematical Methods in PDF only on Docsity!

C12.1B: CONTINUOUS OPTIMISATION

LECTURE 7: THE DOGLEG AND STEIHAUG METHODS