10.1. Convex Optimization#

Main references for this section are [6, 9, 17].

10.1.1. Convex Optimization Problems#

Definition 10.1 (Convex optimization problem general form)

Let V be an n-dimensional real vector space. Let f:VR be a convex function with S=domf. Let CSV be a convex set.

A mathematical optimization problem of the form

minimize f(x)subject to xC

is known as a convex optimization problem.

In other words, a convex optimization problem minimizes a convex function over a convex set. We can write the optimal value of the minimization problem as

p=inf{f(x)|xC}.

The set of optimal points for the minimization problem is

Xopt={xC|f(x)=p}.

Note

  1. f is the objective function (or cost function) being minimized.

  2. x is the optimization variable.

  3. C is the feasible set. Any xC is a feasible point.

  4. The problem is feasible if C.

  5. If C=, then the problem is infeasible.

  6. If the problem is infeasible then p=.

  7. If the problem is feasible then p<.

  8. If p=, then the problem is unbounded below.

  9. We say that x is an optimal point for the minimization problem if f(x)=p.

  10. If Xopt is nonempty, then we say that the optimal value p is attained at some point.

  11. It is possible that p is finite and yet the optimal set is empty.

Remark 10.1 (Closedness of the feasible set)

Some authors prefer to define convex optimization as the minimization of a convex function over a closed and convex set.

  1. We have not insisted that C be a closed set in the general definition of convex optimization problem.

  2. The closedness of C is an additional property which can provide additional guarantees on the existence of an optimal point. We shall introduce it whenever needed in the discussion below.

  3. Recall from Theorem 9.170 that a closed and convex set C is an intersection of all the halfspaces that contain it. Let {Ai}iI be the set of halfspaces that contains C.

  4. Then, each half space can be written as

    Ai={xV|x,aibi}.
  5. Thus, xC is equivalent to x,aibi for every iI.

  6. This gives us the motivation to consider the feasible set in the form of intersection of sublevel sets of convex functions.

Majority of convex optimization problems can be transformed into a functional form where the constraints are expressed in the form of sublevel sets of convex functions and the level sets of affine functions.

10.1.1.1. Convex Optimization Standard Form#

Definition 10.2 (Convex optimization problem standard form)

Let V be an n-dimensional real vector space. A mathematical optimization problem of the form

(10.1)#minimize f0(x)subject to fi(x)0,i=1,,mhj(x)=0,j=1,,p

with optimization variable xV is called a convex optimization problem in standard form if

  1. The objective function f0:VR is a convex function.

  2. The inequality constraint functions fi:VR are convex functions for i=1,,m.

  3. The equality constraint functions hj:VR are affine functions for j=1,,p.

Observation 10.1 (Discussion on the convex optimization standard form)

  1. The domain of the problem is given by

    D=domf0i=1mdomfij=1pdomhj.
  2. domhj=V for every j=1,,p since hj are affine functions.

  3. Thus,

    D=domf0i=1mdomfi.
  4. By definition domfi are convex for i=0,,m. Hence D is convex.

  5. The feasible set C is given by

    C=domf0i=1mfi1(,0]j=1phj1(0).
  6. Then, fi1(,0] are sublevel sets of convex functions hence they are also convex sets.

  7. By Theorem 4.209, the level sets of affine functions are affine sets.

  8. Thus, hj1(0) is an affine set. Hence, it is a convex set.

  9. Thus, C is an intersection of convex sets.

  10. Thus, C is a convex set.

  11. We note that we can rewrite the standard form as

    minimize f0(x)subject to xC

    to match with Definition 10.1.

Remark 10.2 (Adding closedness of objective and constraint functions)

Suppose we add an additional requirement that the function fi for i=0,,m are closed.

  1. Recall from Definition 3.64 that a function is closed if all its sublevel sets are closed.

  2. In particular, this means that the domain of a closed function is also closed.

  3. Thus, D is a closed set.

  4. Then, fi1(,0] are closed sets since fi are closed functions.

  5. Since V is finite dimensional, hence affine sets are closed. Hence hj1(0) is closed for every j=1,,p.

  6. Thus, C is an intersection of closed and convex sets.

  7. Thus, C is a closed and convex set.

  8. Recall from Theorem 3.92 that a function is closed if and only if it has a closed epigraph.

  9. Also, recall from Theorem 3.119 that a function is closed if and only if it is l.s.c. (lower semicontinuous).

  10. Further, recall from Theorem 9.172 that every convex function has a closure which is l.s.c. (hence closed) given by its lower semicontinuous hull and the closure of a convex function is also convex.

  11. Thus, if any of the fi for i=0,,m is convex but not a closed function, we can replace it by its lower semicontinuous hull to convert it to a closed convex function. This way, be can bring such problems to the standard form for convex optimization.

Remark 10.3 (Comparison of two forms)

We have seen two forms of describing convex optimization problems.

  1. Definition 10.1 describes it as minimizing a convex function over a convex set.

  2. Definition 10.2 describes it in the form of minimizing a convex objective function with convex inequality constraints and affine equality constraints.

  3. As seen in comments above, the second form can be converted into the first form.

  4. The first form is more general. It is very useful in establishing the theoretical properties of convex optimization problems.

  5. The second form is more useful from practical perspective. Almost all real life convex optimization problems can be reduced to this form. It is easier to describe algorithms to solve convex optimization problems in terms of this form.

In the sequel, we will liberally use either form for proving theoretical results and developing algorithms.

Definition 10.3 (Concave function maximization problem)

Let V be an n-dimensional real vector space. A mathematical optimization problem of the form

(10.2)#maximize f0(x)subject to fi(x)0,i=1,,mhj(x)=0,j=1,,p

with optimization variable xV is called a concave maximization problem if

  1. The objective function f0:VR is a concave function.

  2. The inequality constraint functions fi:VR are convex functions for i=1,,m.

  3. The equality constraint functions hj:VR are affine functions for j=1,,p.

Remark 10.4 (Concave maximization as a convex optimization problem)

We note that maximizing f0 is same as minimizing f0. Further, if f0 is concave then f0 is convex. Thus, (10.2) is equivalent to the problem

minimize f0(x)subject to fi(x)0,i=1,,mhj(x)=0,j=1,,p

which is a convex optimization problem in standard form. With an abuse of notation, we shall call a concave function maximization program also a convex optimization program.

10.1.1.2. Proper Convex Functions#

It is useful to extend these definitions to proper convex functions as objective functions and/or inequality constraint functions.

Definition 10.4 (Optimization problem general form for proper convex functions)

Let V be an n-dimensional real vector space. Let f:V(,] be a proper convex function with S=domf. Let XV be a convex set.

A mathematical optimization problem of the form

minimize f(x)subject to xX

is known as a convex optimization problem for proper convex functions.

In other words, this problem minimizes a proper convex function over a convex set. We can write the optimal value of the minimization problem as

p=inf{f(x)|xX}.

The set of feasible points is given by C=Xdomf which is a convex set.

The set of optimal points for the minimization problem is

Xopt={xC|f(x)=p}.

Definition 10.5 (Convex optimization problem standard form for proper convex functions)

Let V be an n-dimensional real vector space. A mathematical optimization problem of the form

(10.3)#minimize f0(x)subject to fi(x)0,i=1,,mhj(x)=0,j=1,,p

with optimization variable xV is called a convex optimization problem in standard form for proper convex functions if

  1. The objective function f0:V(,] is a proper convex function.

  2. The inequality constraint functions fi:V(,] are proper convex functions for i=1,,m.

  3. The equality constraint functions hj:VR are affine functions for j=1,,p.

The reader can check that the definitions of problem domain, feasible set, optimal value, optimal solutions/points, etc. can be easily extended by using the effective domains of f0,,fm.

10.1.1.3. Local and Global Optima#

Theorem 10.1 (Local minimum is a global minimum in convex optimization)

Let f:VR be a convex function. Let Cdomf be a convex set. Consider the problem of minimizing f over C. Let x be locally optimal for f over C. Then, x is globally optimal for f over C.

In other words,

f(y)f(x)yC.

Proof. Since x is a local minimum of f over C, hence there exists r>0 such that

f(x)=inf{f(z)|zB[x,r]C}.

In other words, f(x)f(x) for every xB[x,r]C.

  1. Let yC be any point such that yx.

  2. We need to show that f(y)f(x).

  3. Let t(0,1] be such that z=x+t(yx)B[x,r].

  4. Since C is convex, hence zC as x,yC and z is their convex combination.

  5. Thus, zB[x,r]C.

  6. By the local optimality condition

    f(x)f(z)=f(x+t(yx))=f((1t)x+ty)(1t)f(x)+tf(y).

    We used the fact that f is convex.

  7. Cancelling and rearranging the terms, we get tf(x)tf(y).

  8. Since t>0, hence it reduces to f(x)f(y).

  9. Thus, x is indeed globally optimal.

Corollary 10.1 (Local minimum is a global minimum in for proper convex functions)

Let f:V(,] be a proper convex function. Let X be a convex subset of V. Consider the problem of minimizing f over X. Let x be locally optimal for f over X. Then, x is globally optimal for f over X.

In other words,

f(y)f(x)yX.

Proof. We note that the feasible set is C=Xdomf. Since both X and domf are convex, hence C is convex.

  1. If C=, then there are no feasible points. Hence, no local/global optima.

  2. We now consider the case where C.

  3. Following the argument in Theorem 10.1, if x is locally optimal for f over C, then it is globally optimal for f over C.

  4. Consequently, it is globally optimal for f over X as f(x)=>f(x) for every xXdomf.

The argument can be modified to show that if f is strictly convex, then a locally optimal point for f is strictly globally optimal point.

Theorem 10.2 (Local minimum is strict global minimum for strictly convex functions)

Let f:VR be a strictly convex function. Let Cdomf be a convex set. Let x be locally optimal for f over C. Then, x is strictly globally optimal for f over C.

In other words,

f(y)>f(x)yC.

Proof. There exists r>0 such that f(x)f(x) for every xB[x,r]C.

  1. Let yC be any point such that yx.

  2. Let t(0,1) be such that z=x+t(yx)B[x,r].

  3. Since C is convex, hence zC as x,yC and z is their convex combination.

  4. Thus, zB[x,r]C.

  5. Note that z=(1t)x+ty. Thus, z is distinct from x and y and lies in the line segment between them.

  6. By the local optimality condition

    f(x)f(z)=f((1t)x+ty)<(1t)f(x)+tf(y).

    We used the fact that f is strictly convex.

  7. Cancelling and rearranging the terms, we get tf(x)<tf(y).

  8. Since t>0, hence it reduces to f(x)<f(y).

  9. Thus, x is indeed strictly globally optimal.

Corollary 10.2 (Local minimum is strict global minimum for proper strictly convex functions)

Let f:V(,] be a proper strictly convex function. Let X be a convex subset of V. Let x be locally optimal for f over X. Then, x is strictly globally optimal for f over X.

In other words,

f(y)>f(x)yX.

10.1.1.4. Optimal Sets#

The optimal sets of a convex optimization problem are also convex.

Theorem 10.3 (Optimal set is convex for a convex optimization problem)

Let f:VR be a convex function. Let Cdomf be a convex set. Let the optimal value for the minimization of f over C be given by

p=inf{f(x)|xC}.

Let the optimal set (the set of optimal points) be given by

Xopt={xC|f(x)=p}.

Then, Xopt is a convex set.

Proof. If Xopt is empty, then it is convex trivially. Now consider the case where Xopt is nonempty.

  1. Let x,yXopt.

  2. Then, x,yC.

  3. Let t[0,1].

  4. Let z=tx+(1t)y.

  5. Since C is convex, hence zC. Thus, zdomf.

  6. Since f is convex, hence

    f(z)tf(x)+(1t)f(y)=tp+(1t)p=p.
  7. But p=inf{f(x)|xC}.

  8. Thus, f(z)p.

  9. Combining, the two inequalities, we get p=f(z).

  10. Thus, zXopt.

  11. Thus, for every x,yXopt and every t[0,1], z=tx+(1t)yXopt.

  12. Thus, Xopt is indeed convex.

Theorem 10.4 (Optimal points for minimization of strictly convex functions)

Let f:VR be a strictly convex function. Let Cdomf be a convex set. Then, the minimization problem

minimize f(x)subject to xC

has at most one optimal point.

Proof. Let the optimal value for the minimization of f over C be given by

p=inf{f(x)|xC}.

Let the optimal set (the set of optimal points) be given by

Xopt={xC|f(x)=p}.
  1. By Theorem 10.1, Xopt is a convex set.

  2. If Xopt is empty or a singleton, there is nothing more to prove.

  3. For contradiction, assume that there are two distinct points x,yXopt.

  4. We have p=f(x)=f(y).

  5. Let z=12x+12y.

  6. Thus, it is a convex combination of x and y.

  7. By convexity of Xopt, zXopt. Thus, f(z)=p.

  8. By strict convexity of f

    f(z)<12f(x)+12f(y)=p.
  9. This contradicts the fact that p is the optimal value for the minimization problem.

  10. Thus, Xopt must be either empty or a singleton.

  11. Thus, the minimization problem has at most one optimal point.

10.1.2. Concave Objective Functions#

Theorem 10.5 (Minimization of concave function and relative interior)

Let V be an n-dimensional real normed linear space. Let f:VR be a concave function with S=domf. Let CS be a convex set. Consider the problem of minimizing f over C. Let the optimal value be given by

p=inf{f(x)|xC}.

Let the set of optimal points for the minimization problem be given by

Xopt={xC|f(x)=p}.

If Xopt contains a relative interior point of C, then f must be constant over C; i.e.,

Xopt=C.

Proof. Assume that xriCXopt.

  1. Let yC be any vector distinct from x.

  2. Since xriC, hence due to Theorem 9.144, there exists s>1 such that z=x+(s1)(xy)C. z is a point behind x on the line from y to x which belongs to C.

  3. Letting t=1s we can rewrite it as

    x=tz+(1t)y.
  4. Thus, x is a convex combination of z and y.

  5. By concavity of f, we have

    f(x)tf(z)+(1t)f(y).
  6. Since x is an optimal point over C and z,yC, hence f(x)f(z) and f(x)f(y).

  7. Thus,

    f(x)tf(z)+(1t)f(y)tf(x)+(1t)f(x)=f(x).
  8. This can be true only if f(x)=tf(z)+(1t)f(y) which in turn implies that f(x)=f(z)=f(y).

  9. Thus, f must be constant over C.

Corollary 10.3 (Linear functions attain minimum only at boundary points)

Let V be an n-dimensional real normed linear space. Let f:VR be a nonzero linear functional. Let CV be a convex set. Then, f cannot attain a minimum at any relative interior point of C.

In other words, f can attain a minimum only at the relative boundary of C (if it does attain a minimum over C).

Proof. We note that every linear functional is also concave.

  1. Let xriC.

  2. Then, there exists r>0 such that B(x,r)affCC.

  3. Since f is linear, hence f cannot be constant over B(x,r)affC.

  4. Thus, by Theorem 10.5, it cannot attain a minimum at x.