10.12. Quadratic Programming#

10.12.1. Quadratic Functions#

Definition 10.41 (Quadratic function)

A function f:RnR of the form

f(x)=12xTAx+bTx+c

where ASn, bRn and cR, is known as a quadratic function.

The matrix A is known as the matrix associated with the quadratic function.

Remark 10.8 (Gradient and Hessian of a quadratic function)

Let

f(x)=12xTAx+bTx+c

be a quadratic function. Then, the gradient is given by:

f(x)=Ax+b.

And, the Hessian is given by:

2f(x)=A.

See Example 5.8 and Example 5.13 for reference.

10.12.1.1. Stationary Points#

Theorem 10.86 (Stationary points of quadratic functions)

Let a quadratic function f:RnR be given by

f(x)=12xTAx+bTx+c

where ASn, bRn and cR.

  1. xRn is a stationary point if and only if Ax=b.

  2. If AO, then x is a global minimum point of f if and only if Ax=b.

  3. If AsuccO, then x=A1b is a strict global minimum point of f.

  4. If AsuccO, then the minimum value of f is c12bTA1b.

Proof. (1) is a direct implication of the fact that f(x)=0 if and only if Ax+b=0.

(2) We are given that 2f(x)=AO.

  1. Thus, 2f(x)O for every xRn.

  2. By Theorem 8.4, if x is a stationary point of f, then it is a global minimum point.

  3. By first part, x is a stationary point if and only if Ax=b.

(3) We are given that AsuccO.

  1. Then, A is invertible.

  2. Hence, x=A1b is the unique solution to the equation Ax=b.

  3. By parts (1) and (2), it is the unique (hence strict) global minimizer of f.

(4) We know that strict global minimum point of f is given by a=A1b with Aa=b. Therefore,

f(a)=12aTAa+bTa+c=12aTb+bTa+c=c+12bTa=c12bTA1b.

10.12.1.2. Coerciveness#

Theorem 10.87 (Coerciveness of quadratic functions)

Let a quadratic function f:RnR be given by

f(x)=12xTAx+bTx+c

where ASn, bRn and cR.

f is coercive if and only if AsuccO; i.e., A is positive definite.

Proof. Assume that A is positive definite.

  1. Then, all eigenvalues of A are positive.

  2. Let λ be the smallest eigenvalue of A.

  3. Then, xTAxλx2 for every xRn.

  4. Thus,

    f(x)λ2x2+bTx+cλ2x2bx+c Cauchy Schwartz inequality =λ2x(x2λb)+c.
  5. We can see that f(x) as x.

  6. Thus, f is coercive.

Now, assume that f is coercive.

  1. We need to show that A must be positive definite.

  2. Thus, all eigenvalues of A must be positive.

  3. For contradiction, assume that an eigenvalue of A is negative.

  4. Let λ<0 be such an eigenvalue with the corresponding normalized eigenvector v such that Av=λv.

  5. Then, for any tR,

    f(tv)=t22vTAv+tbTv+c=λt22+tbTv+c.
  6. Clearly, f(tv) as t since λ is negative.

  7. Thus, it contradicts the hypothesis that f is coercive.

  8. We now consider the possibility where there is a 0 eigenvalue.

  9. Then, there exists a normalized eigenvector v such that Av=0.

  10. Then, for any tR,

    f(tv)=tbTv+c.
  11. If bTv=0, then f(tv)=c for every tR.

  12. If bTv>0, then f(tv) as t.

  13. If bTv<0, then f(tv) as t.

  14. In all the three cases, f(tv) does not go to as tv.

  15. Thus, f is not coercive. A contradiction to the hypothesis.

  16. Hence, the eigenvalues of A must be positive.

  17. Hence, A must be positive definite.

10.12.1.3. Nonnegative Quadratics#

It is useful to work with quadratic functions which are nonnegative on the entire Rn.

The basic quadratic form f(x)=12xTAx is nonnegative on entire Rn if A is positive semidefinite.

For the general quadratic function, we need to incorporate the contribution from b and c terms also.

Theorem 10.88 (Nonnegativity of quadratic function)

Let a quadratic function f:RnR be given by

f(x)=12xTAx+bTx+c

where ASn, bRn and cR.

The following statements are equivalent.

  1. f(x)0 for every xRn.

  2. [AbbT2c]O; i.e., this n+1×n+1 symmetric matrix is positive semidefinite.

Proof. Assume that (2) is true.

  1. Then, for every xRn

    [x1]T[AbbT2c][x1]0

    due to positive semidefiniteness.

  2. But

    [x1]T[AbbT2c][x1]=[xT1][Ax+bbTx+2c]=xTAx+2xTb+2c=2(12xTAx+bTx+c)=2f(x).
  3. Thus, f(x)0 for every xRn.

For the converse, assume (1) is true.

  1. We need to show that [AbbT2c] is positive semidefinite.

  2. We shall first show that A is positive semidefinite.

  3. For contradiction, assume that A is not positive semidefinite.

  4. Then, there exists a negative eigenvalue λ<0 and corresponding normalized eigenvector v for A such that Av=λv.

  5. Then, for any tR

    f(tv)=λt22+tbTv+c.
  6. Then, f(tv) as t.

  7. This contradicts the hypothesis that f is nonnegative everywhere.

  8. Thus, A must be positive semidefinite.

  9. We now need to show that for any yRn and any tR,

    [yt]T[AbbT2c][yt]0.
  10. This condition is equivalent to

    12yTAy+tbTy+ct20

    for every yRn and tR.

  11. If t=0, then this condition reduces to

    yTAy0yRn.
  12. This is valid for every yRn since A is p.s.d.. as established earlier.

  13. For t0, we have

    t2f(yt)=t2(12t2yTAy+1tbTy+c)=12yTAy+tbTy+ct2.
  14. By hypothesis, t2f(yt)0 for every yRn and t0.

  15. Thus, 12yTAy+tbTy+ct20 for every yRn and tR.

  16. Thus, [AbbT2c] is indeed p.s.d..

10.12.2. Quadratic Optimization Problems#

We consider the following possibilities:

  1. The objective function is a quadratic function.

  2. Both the objective function as well as the inequality constraints function are quadratic.

10.12.2.1. Quadratic Program#

Definition 10.42 (Quadratic program)

A convex optimization problem is known as a quadratic program (QP) if the objective function is a (convex) quadratic and the constraint functions are affine.

A general quadratic program has the following form:

(10.30)#minimize 12xTPx+qTx+rsubject to GxhAx=b

where

  1. xRn is the optimization variable.

  2. PS+n is a symmetric positive semidefinite matrix.

  3. qRn and rR.

  4. f(x)=12xTPx+qTx+r is a (convex) quadratic objective function.

  5. GRm×n and hRm describe the m affine inequality constraints.

  6. ARp×n and bRp describe the p affine equality constraints.

10.12.2.2. Quadratically Constrained Quadratic Program#

Definition 10.43 (Quadratically constrained quadratic program)

A convex optimization problem is known as a quadratically constrained quadratic program (QCQP) if the objective function and the inequality constraint functions are (convex) quadratic while the equality constraint functions are affine.

A general quadratic program has the following form:

(10.31)#minimize 12xTP0x+q0Tx+r0subject to 12xTPix+qiTx+ri0i=1,,mAx=b

where

  1. xRn is the optimization variable.

  2. PiS+n are symmetric positive semidefinite matrices for i=0,,m.

  3. qiRn and riR for i=0,,m.

  4. f0(x)=12xTP0x+q0Tx+r0 is a (convex) quadratic objective function.

  5. fi(x)=12xTPix+qiTx+ri are (convex) quadratic inequality constraint functions for i=1,,m.

  6. ARp×n and bRp describe the p affine equality constraints.