1 The Lagrange dual function
1.1 The Lagrangian
1.2 The Lagrange dual function
- is always concave.
- take care the case where the is not differentiable.
1.3 Lower bounds on optimal value
1.4 Linear approximation interpretation
The Lagrangian and lower bound property can be interpreted as a linear approximation of the indicator functions of the sets and .
Where is the indicator function for the nonpositive reals,
and similarly, is the indicator function of .
Note: 通过 indicator function 去除 constraints,如果不满足则设置很大的惩罚。 Note: 整个思想其实就是, replacing the hard constraints with soft versions.
1.5 Examples
- Least-squares solution of linear equations
- Standard form LP
- Equality constrained norm minimization (!!!)
- Two-way partitioning problem (nonconvex)
[!note] the infimum of a quadratic form
- if , the infimum is 0.
- if , then we can find a vector such that , let . then , when . (This is a frequently used trick)
[!note] Dual norm
Obtain the lower bound:
- Lagrangian is a good way to obtain a lower bound for some problem. This can be useful when we need to analyze some algorithms. (I guess)
- A another way to get o lower bound is to augment its feasible set. (relaxation with constraints)
1.6 The Lagrange dual function and conjugate functions
Consider an optimization problem with linear inequality and equality constraints,
Then we can write the dual function as
The domain of follows from the domain of :
Examples:
- Equality constrained norm minimization #confusion
- Entropy maximization: the conjugate of the function , with scalar variable , is .
- Minimum volume covering ellipsoid
Note: an equivalent form of : .
2 The Lagrange dual problem
- The dual feasible is to describe a pair with and .
- The Lagrange dual problem is a convex optimization problem. This is the case whether or not the primal problem is convex.
2.1 Making dual constraints explicit
The domain of the dual function
- Lagrange dual of standard form LP
- Lagrange dual of inequality form LP
2.2 Weak duality
We denote the optimal value of the Lagrange dual problem as . Then, we have a simple inequality:
- The weak duality inequality holds when and are infinite.
- The optimal duality gap of the original problem is , which is always nonnegative.
2.3 Strong duality and Slater’s constraint qualification
If the equality
holds, then we say that strong duality holds.
- does not hold in general
- (usually) holds for convex problems
- sufficient conditions that guarantee strong duality in convex problems are called constraint qualifications
Slater’s condition: There exists an such that
Such a point is sometimes called strictly feasible. Slater’s theorem states that strong duality holds, if Slater’s condition holds and the problem is convex.
- also guarantees that the dual optimum is attained when .
- The affine inequalities do not need to hold with strict inequality.
2.4 Examples
- Least-squares solution of linear equations
- Lagrange dual of LP: One of the two is feasible, then the strong duality holds.
- Lagrange dual of QCQP:
- Entropy maximization
- Minimum volume covering ellipsoid
- A nonconvex quadratic problem with strong duality: trust region problem
Note: for , when , there is a with . (A trick) Note: strong duality holds for any optimization problem with quadratic objective and one quadratic inequality constraint.
2.5 Mixed strategies for matrix games
3 Geometric interpretation
3.1 Weak and strong duality via set of values
which is the set of values taken on by the constraint and objective functions. The optimal value of primal problem is easily expressed as
Then, we have
In particular, the inequality
defines a supporting hyperplane to . (non vertical)
- Epigraph variation
3.2 Proof of strong duality under constraint qualification
3.3 Multicriterion interpretation
4 Saddle-point interpretation
5 Optimality conditions
If strong duality holds, then is primal optimal and is dual optimal if:
- is feasible
- is feasible
5.1 Certificate of suboptimality and stopping criteria
5.2 Complementary slackness
assume satisfies the primal constraints and
equality holds if and only if the two inequalities hold with equality:
- first inequality: minimizes over
- 2nd inequality: for this is known as complementary slackness.
5.3 KKT optimality conditions
if strong duality holds, then is primal optimal and is dual optimal if
- for and for
- for
- is a minimizer of
conversely, these four conditions imply optimality of and strong duality
if problem is convex and the functions are differentiable #4 can written as 4’. the gradient of the Lagrangian with respect to vanishes:
conditions 1,2,3,4’ are known as Karush-Kuhn- Tucker (KKT) conditions.
6 Perturbation and Sensitivity analysis
6.1 The perturbed problem
- is positive: relax the th inequality constraint
- is negative: tight the th inequality constraint
we define as the optimal value of the perturbed problem:
- we can have
- when the original problem is convex, the function is a convex function of and .
6.2 A global inequality
Assume that strong duality holds, and that the dual optimum is attained. Let be optimal for the dual of the unperturbed problem. Then for all and we have
Sensitivity interpretations: ![[b21a7823aef5cf9d0fb8973dff1540e 1.jpg|450]]
- If is large and we tighten the th constraint (i.e., choose ), then the optimal value is guaranteed to increase greatly.
- If is large and positive and we take , or if is large and negative and we take , then the optimal value is guaranteed to increase greatly.
- If is small, and we loosen the th constraint (), then the optimal value will not decrease too much.
- If is small and positive, and , or if is small and negative and , then the optimal value will not decrease too much.
The inequality just gives a lower bound on the perturbed optimal value.
6.3 Local sensitivity analysis
Suppose that is differentiable at . Then, provided strong duality holds, the optimal dual variables are related to the gradient of at :
If , then the constraint is inactive, and it follows that the constraint can be tightened or loosened a small amout without affecting the optimal value. We suppose that (active). The th optimal Lagrange multiplier tells us how active the constraint is:
- If is small, the constraint can be loosened or tightened a bit without much effect on the optimal value
- If is large, it means that if the constraint is loosened or tightened a bit, the effect on the optimal value will be great
7 Examples
Simple equivalent reformulations of a problem can lead to very different dual problems.
- Introducing new variables and equality constraints: can add information for the Lagrangian dual function
- Transforming the objective
- Implicit constraints: make the explicit constraints implicit
8 Semidefinite optimization
matrices are symmetric matrices
Lagrangian and dual function · we associate with the constraint a Lagrange multiplier · define Lagrangian as
· dual function
8.1 Dual semidefinite program
- Weak duality: always
- Strong duality: if primal SDP or dual SDP is strictly feasible
we can readily check whether strong duality holds before we start to solve the problem
if , then
[!note] Matrix decomposition For a matrix , we can decompose as
For example, let
8.2 complementary slackness
the primial and dual object values at feasible are equal if
9 Theorems of alternatives
two related fesibility problems
- weak alternatives: if at most one is feasible
- strong alternatives: if exactly one is feasible
wear alternative is easy to check, for strong alternatives, we need to check both infeasible is impossible
9.1 Nonlinear inequalities
9.2 Theorem of alternatives for linear matrix inequality
we can see that the proof of above questions is achieved by strong duality.