Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Understanding Duality & Subgradient Method in Nonlinear Programming, Slides of Computer Science

All India Institute of Medical Sciences Computer Science

An in-depth exploration of dual computational methods in nonlinear programming. It covers the concept of duality, the role of lagrangian relaxation, and the structure of dual problems. The document also delves into dual derivatives, subgradients, and the key subgradient method property. Additionally, it discusses the non-differentiable dual and the subgradient method for solving non-differentiable optimization problems.

Typology: Slides

2012/2013

Uploaded on 03/27/2013

ekana 🇮🇳

(44)

370 documents

1 / 8

This page cannot be seen from the preview

Don't miss anything!



NONLINEAR PROGRAMMING

LECTURE 21: DUAL COMPUTATIONAL METHODS

LECTURE OUTLINE

• Dual Methods

• Nondifferentiable Optimization

********************************

• Consider the primal problem

minimize f (x)

subject to x ∈ X, gj

(x) ≤ 0, j =1,...,r,

assuming −∞ <f

∗ < ∞.

• Dual problem: Maximize

q(µ)= inf L(x, µ)= inf

x∈X{f (x)+ µ g(x)}

x∈X

subject to µ ≥ 0.

Docsity.com

Discover Slides of Computer Science All India Institute of Medical Sciences

Partial preview of the text

Download Understanding Duality & Subgradient Method in Nonlinear Programming and more Slides Computer Science in PDF only on Docsity!

′

NONLINEAR PROGRAMMING

LECTURE 21: DUAL COMPUTATIONAL METHODS

LECTURE OUTLINE

• Dual Methods

• Nondifferentiable Optimization

• Consider the primal problem

minimize f (x)

subject to x ∈ X, gj†(x) ≤ 0 , j = 1,... , r,

assuming −∞ < f

∗ < ∞.

• Dual problem: Maximize

q(μ) = inf L(x, μ) = inf x∈X {f (x) + μ g(x)} x∈X†

subject to μ ≥ 0.

PROS AND CONS FOR SOLVING THE DUAL

• The dual is concave.

• The dual may have smaller dimension and/or

simpler constraints.

• If there is no duality gap and the dual is solved

exactly for a Lagrange multiplier μ

∗

, all optimal pri-

mal solutions can be obtained by minimizing the

Lagrangian L(x, μ

∗

) over x ∈ X.

• Even if there is a duality gap, q(μ) is a lower

bound to the optimal primal value for every μ ≥ 0.

• Evaluating q(μ) requires minimization of L(x, μ)

over x ∈ X.

• The dual function is often nondifferentiable.

• Even if we find an optimal dual solution μ

∗

, it may

be difficult to obtain a primal optimal solution.

′

′ ′

DUAL DERIVATIVES

• Let

xμ† = arg min L(x, μ) = arg min f (x) + μ g(x). x∈X x∈X†

Then for all μ ∈

q(˜ μ ) = inf f (x) + ˜μ g(x) x∈X† ≤ f (xμ) + ˜μ g(xμ) = f (xμ) + μ g(xμ) + (˜ ′ μ − μ) ′ g(xμ) = q(μ) + (˜μ − μ) ′ g(xμ).

• Thus g(xμ) is a subgradient of q at μ.

• Proposition: Let X be compact, and let f and g

be continuous over X. Assume also that for every

μ, L(x, μ) is minimized over x ∈ X at a unique point

xμ. Then, q is everywhere continuously differen-

tiable and

∇q(μ) = g(xμ), ∀ μ ∈ r† .

′

NONDIFFERENTIABLE DUAL

• If there exists a duality gap, the dual function is

nondifferentiable at every dual optimal solution.

• Important nondifferentiable case: When q is

polyhedral, that is,

q(μ) = min aiμ + bi† , i∈I†

where I is a finite index set, and ai† ∈

r†

and bi†

are given (arises when X is a discrete set, as in

integer programming).

• Proposition: Let q be polyhedral as above, and

let Iμ† be the set of indices attaining the minimum

Iμ† = i ∈ I | a i μ + bi† = q(μ).

The set of all subgradients of q at μ is

∂q(μ) = g � g = ξiai, ξi† ≥ 0 , ξi† = 1.

i∈Iμ i∈Iμ

KEY SUBGRADIENT METHOD PROPERTY

• For a small stepsize it reduces the Euclidean

distance to the optimum.

M g k μk μk^ + sk^ g k μk+1^ = [ μk^ + sk^ g k^ ]+ μ* < 90 o Contours of q

• Proposition: For any dual optimal solution μ

∗

we have

∗ ‖μ k+ − μ ∗ ‖ < ‖μ k† − μ ‖,

for all stepsizes s

k†

such that

2 q(μ ∗ ) − q(μ k ) 0 < s k† <. ‖gk^ ‖^2

STEPSIZE RULES

• Diminishing stepsize is one possibility.

• More common method:

α k† q k† − q(μ k ) k† s = , ‖gk^ ‖^2

where q

k† ≈ q ∗

and

0 < α k† < 2.

• Some possibilities:

− q k†

is the best known upper bound to q

∗

0 = 1

and α

k†

decreased by a certain factor every

few iterations.

− α k†

= 1 for all k and

q k† = 1 + β(k) ˆ k† q ,

where ˆq

k† = max 0 ≤i≤k†q(μ i

Understanding Duality & Subgradient Method in Nonlinear Programming, Slides of Computer Science

Related documents

Partial preview of the text

Download Understanding Duality & Subgradient Method in Nonlinear Programming and more Slides Computer Science in PDF only on Docsity!

NONLINEAR PROGRAMMING

LECTURE 21: DUAL COMPUTATIONAL METHODS

LECTURE OUTLINE

• Dual Methods

• Nondifferentiable Optimization

• Consider the primal problem

minimize f (x)

subject to x ∈ X, gj†(x) ≤ 0 , j = 1,... , r,

assuming −∞ < f

• Dual problem: Maximize

subject to μ ≥ 0.

PROS AND CONS FOR SOLVING THE DUAL

• The dual is concave.

• The dual may have smaller dimension and/or

simpler constraints.

• If there is no duality gap and the dual is solved

exactly for a Lagrange multiplier μ

, all optimal pri-

mal solutions can be obtained by minimizing the

Lagrangian L(x, μ

) over x ∈ X.

• Even if there is a duality gap, q(μ) is a lower

bound to the optimal primal value for every μ ≥ 0.

• Evaluating q(μ) requires minimization of L(x, μ)

over x ∈ X.

• The dual function is often nondifferentiable.

• Even if we find an optimal dual solution μ

, it may

be difficult to obtain a primal optimal solution.

DUAL DERIVATIVES

• Let

Then for all μ ∈ 

• Thus g(xμ) is a subgradient of q at μ.

• Proposition: Let X be compact, and let f and g

be continuous over X. Assume also that for every

μ, L(x, μ) is minimized over x ∈ X at a unique point

xμ. Then, q is everywhere continuously differen-

tiable and

NONDIFFERENTIABLE DUAL

• If there exists a duality gap, the dual function is

nondifferentiable at every dual optimal solution.

• Important nondifferentiable case: When q is

polyhedral, that is,

where I is a finite index set, and ai† ∈ 

and bi†

are given (arises when X is a discrete set, as in

integer programming).

• Proposition: Let q be polyhedral as above, and

let Iμ† be the set of indices attaining the minimum

The set of all subgradients of q at μ is

∂q(μ) = g � g = ξiai, ξi† ≥ 0 , ξi† = 1.

KEY SUBGRADIENT METHOD PROPERTY

• For a small stepsize it reduces the Euclidean

distance to the optimum.

• Proposition: For any dual optimal solution μ

we have

for all stepsizes s

such that

STEPSIZE RULES

• Diminishing stepsize is one possibility.

• More common method:

where q

and

• Some possibilities:

is the best known upper bound to q

and α

decreased by a certain factor every

few iterations.

= 1 for all k and

where ˆq

), and β(k) > 0 is

adjusted depending on algorithmic progress

of the algorithm.

Then for all μ ∈

where I is a finite index set, and ai† ∈