

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Information on the gradient projection method for solving simple constrained problems. It includes two exercises: the first theoretical, involving the proof that the gradient projection method with a specific stepsize minimizes a quadratic over-estimate of a function with lipshitz continuous gradients, and the second practical, requiring the use of matlab to solve a specific optimization problem with given constraints and different parameter values, comparing the results of the gradient method and nesterov method.
Typology: Study notes
1 / 2
This page cannot be seen from the preview
Don't miss anything!


Homework 5: Methods for Simple Constrained Problems
Due November 14, 2012 in class
NOTE: All norms are Euclidean unless clearly indicated otherwise.
Exercise 1: (20 points) Consider the gradient projection method with the stepsize α = 1/L for a function f with Lipshitz continuous gradients, where L is the Lipschitz constant (see the notes on First-Order Methods). Show that
xk+1 = argmin y∈X
Q(y; xk),
where Q(y; x) is the quadratic over-estimate of f , i.e.,
Q(y; x) = f (x) + ∇f (x)′(y − x) +
‖y − x‖^2 for all x, y ∈ Rn.
Exercise 2: (40 =20 I + 20 II, each (a)(b) is 10 points) This exercise will require the use of Matlab to develop a code for solving the following problem: minimize f (x 1 , x 2 ) = 12 (x^21 + ax^22 )
subject to (x 1 , x 2 ) ∈ [0, 10] × [0, 10], where a is a parameter to be specified shortly.
(I) Solve the problem by using the first-order gradient method with initial point x 0 = (10, 10) and a stepsize αk = (^1) L , where L is the Lipshcitz gradient constant for the given function. Using Matlab, run simulations to solve the following instances of the problem: (a) a = 100. (b) a = 10000. (I) Repeat part (I) by using the Nesterov first method instead, with the same initial point as in part (I).
For both parts (I) and (I) to report your simulation results, do the following:
(1) Report the value of the constant L that you have computed. (2) Run the algorithm for 1000 iterations in each part (a) and (b). (3) Provide a plot showing the function values obtained by the algorithm in cases (a) and (b) as the iteration index is varying k = 1,... , 1000.
(4) Provide your comments regarding the results, such as in which of the instances the algorithm converges faster? How would you justify the faster convergence? What causes the slow-down in one of the instances? Is the slow-down significant?
For a full credit, please provide your “m” file that you created to solve this problem, as well as the Matlab plot you obtained in item (3). Have fun!