9.4. Cones#

Main references for this section are [6, 17, 67].

Throughout this section, V is a real vector space. Some material is specific to Rn. Rest of the material is applicable for any real vector space. Wherever necessary, V is endowed with a norm :VR or a real inner product ,:V×VR. It is also equipped with a metric d(x,y)=xy. The norm in general is not necessarily induced by the inner product (if the vector space is indeed endowed with an inner product).

We start with the basic definition of a cone.

9.4.1. Cones#

Definition 9.22 (Cone)

A set C is called a cone or nonnegative homogeneous, if for every xC and t0, we have txC.

In other words, a set is a cone if it is closed under nonnegative scalar multiplication.

  • By definition we have 0C.

  • Some authors ([67]) prefer to restrict the definition to t>0 thus the origin is not included in the cone by default.

  • In our definition, a cone always includes the origin.

  • Thus, a cone is always nonempty.

  • When we think of cones, ice-cream cones naturally come to mind. Hence, we kind of think that cones happen to be pointed (at origin) and emanate in one general direction from there. However, our definition above is more general. It allows for lines (passing through origin) and linear subspaces to be cones too.

  • Pointed cones are a special class of cones which are discussed below. An ice-cream cone is a pointed cone.

9.4.2. Properties of Cones#

9.4.2.1. Intersection#

Property 9.3 (Intersection of cones)

Let {Ci|iI} be a collection of cones. Then their intersection iICi is a cone.

Proof. Let xiICi and let t0.

  1. xCi for every iI.

  2. Hence txCi for every iI since every Ci is a cone.

  3. Hence txiICi.

  4. Hence iICi is a cone.

9.4.2.2. Cartesian Product#

Property 9.4 (Cartesian product of cones)

Let V and W be real vector spaces. Let C1V and C2W be cones. Then their Cartesian product C1×C2 is a cone.

Proof. Let xC1×C2 and t0.

  1. Then there exist yC1 and zC2 such that x=(y,z).

  2. Then tyC1 and tzC2 since both are cones.

  3. Hence (ty,tz)C1×C2.

  4. But (ty,tz)=t(y,z)=tx.

  5. Hence txC1×C2.

  6. Hence C1×C2 is a cone.

9.4.2.3. Set Addition#

Property 9.5 (Set addition of cones)

Let C1,C2V be cones. Then their vector sum C1+C2 is a cone.

Proof. Let xC1+C2 and t0.

  1. Then there exist x1C1 and x2C2 such that x=x1+x2.

  2. Then tx1C1 and tx2C2 since they are cones.

  3. Hence tx=t(x1+x2)=tx1+tx2C1+C2.

  4. Hence C1+C2 is a cone.

9.4.2.4. Closure#

Property 9.6 (Closure)

Let CV be a cone. Then its closure clC is a cone.

Proof. Let xclC and t0.

  1. Then there exists a sequence {xk} of C such that xkx.

  2. Then txkC for every k since C is a cone.

  3. Consider the sequence {txk} of C.

  4. We have limktxk=tx.

  5. Hence txclC.

  6. Hence clC is a cone.

9.4.2.5. Linear Transformation#

Property 9.7 (Linear transformation )

The image and inverse image of a cone under a linear transformation is a cone.

Proof. Let V and W be real vector spaces and T:VW be a linear transformation.

Image of a cone is a cone.

  1. Let CV.

  2. Let D=T(C).

  3. Let vD and t0.

  4. Then there exists uC such that v=T(u).

  5. Since C is a cone, hence tuC.

  6. Then T(tu)=tT(u)=tvD.

  7. Hence D is a cone.

Inverse image of a cone is a cone.

  1. Let DW.

  2. Let C=T1(D).

  3. Let xC and t0.

  4. Then y=T(x)D.

  5. Also T(tx)=tT(x)=tyD since D is a cone.

  6. Hence txC.

  7. Hence C is a cone.

9.4.3. Convex Cones#

Definition 9.23 (Convex cone)

A set C is called a convex cone if it is convex and a cone.

Example 9.11 (Convex cones)

  • A ray with its base at origin is a convex cone.

  • A line passing through origin is a convex cone.

  • A plane passing through origin is a convex cone.

  • Any subspace is a convex cone.

Theorem 9.45 (Subspace as convex cone)

A linear subspace is a convex cone.

Proof. Let VV be a subspace. We know that V is convex since V contains all its linear combinations and every convex combination is a linear combination. Now, let vV. Then, tvV for every t0 since V is closed under scalar multiplication. Thus, V is a cone too. Thus, V is a convex cone.

Theorem 9.46 (Half spaces as convex cone)

Let V be a real vector space. Let aV. Consider the set

H={xV|x,a0}.

Then H is a convex cone.

Proof. Half space as a cone

  1. Let xH and t0.

  2. Then tx,a=tx,a0.

  3. Hence txH.

Half space as convex

  1. Let x,yH and t[0,1].

  2. Then

tx+(1t)y,a=tx,a+(1t)y,a0.
  1. Hence H is convex.

9.4.3.1. Characterization#

Theorem 9.47 (Convex cone characterization)

A set is a convex cone if and only if it is closed under addition and nonnegative scalar multiplication.

In other words, C is a convex cone if and only if for every x1,x2C and t1,t20, the following holds true:

t1x1+t2x2C.

Proof. Let CV. Let x1,x2C.

If C is a convex cone, then:

  1. x=12x1+12x2C.

  2. But then, 2x=x1+x2C since C is a cone.

  3. Thus, C is closed under addition.

  4. C being a cone, it is closed under nonnegative scalar multiplication.

  5. Combining, we see that t1x1+t2x2C.

Now, assume that C is closed under addition and nonnegative scalar multiplication.

  1. C is a cone since it is closed under nonnegative scalar multiplication.

  2. In particular tx1C for all t[0,1] and (1t)x2C for all t[0,1].

  3. Since C is closed under addition, hence tx1+(1t)x2C for all t[0,1].

  4. Thus, C is convex too.

Theorem 9.48 (Set addition characterization)

A cone C is convex if and only if C+CC.

Proof. Assume that a cone C satisfies C+CC.

  1. Let x,yC and t[0,1].

  2. Since C is a cone, hence tx and (1t)y belong to C.

  3. Hence tx+(1t)yC+CC.

  4. Hence C is convex.

For converse, assume that C is a convex cone.

  1. Let x,yC.

  2. Since C is a cone, hence 2x,2yC.

  3. Since C is convex, hence

    122x+122y=x+yC.
  4. Hence C+CC.

9.4.3.2. Intersection of Convex Cones#

Theorem 9.49 (Intersection of arbitrary collection of convex cones)

Let {Ai}iI be a family of sets such that Ai is a convex cone for all iI. Then iIAi is a convex cone.

Proof. Let x1,x2 be any two arbitrary elements in A=iIAi.

x1,x2Ax1,x2AiiIt1x1+t2x2Ait1,t20iI since Ai is a convex conet1x1+t2x2A.

Hence A is a convex cone.

As a consequence, an arbitrary intersection of half-spaces (at origin) and hyperplanes (at origin) is a convex cone. Thus, the solution set of a system of linear equations and inequalities is a convex cone if the equations and inequalities are homogeneous.

9.4.3.3. Containing and Contained Subspaces#

Theorem 9.50 (Containing and contained subspaces)

Let C be a convex cone. Then, there is a smallest linear subspace containing C given by:

UCC={xy|x,yC}=affC.

And, there is a largest linear subspace contained in C given by

L(C)C.

Proof. We show that U=CC is a subspace containing C.

Since 0C, hence 0=00U. Also, this means that C0=CU.

Let u,vU.

  1. u=xy for some x,yC.

  2. v=zw for some z,wC.

  3. u+v=(x+z)(y+w).

  4. Since C is a convex cone, it is closed under addition.

  5. Hence, x+z,y+wC.

  6. Hence, u+vU (as it is a difference of two vectors in C).

  7. Thus, U is closed under vector addition.

Let tR, uU.

  1. u=xy for some x,yC.

  2. tu=txty.

  3. If t0, then tx,tyC, thus tuU.

  4. If t<0, then tx,tyC. We can write tu=(ty)(tx). Thus, tuU.

  5. Thus, U is closed under scalar multiplication.

Next, we show that it is the smallest subspace containing C.

  1. Let V be a subspace that contains C

  2. Then V contains C also.

  3. Then, V contains C+(C)=CC=U also.

  4. Thus, U is the smallest subspace containing C.

In summary, C is closed under addition and nonnegative scalar multiplication. C contains 0. To be a subspace, a set must be closed under multiplication by 1 too. CC is the smallest such subspace.

Since C contains 0, hence its affine hull contains 0 too. Thus, its affine hull must be a linear subspace. The affine hull then is the smallest subspace containing C. Thus,

affC=CC.

We show that L=(C)C is a subspace contained in C.

[zero vector]

  1. Since 0C and 0C, hence 0L.

[Scalar multiplication]

  1. Let a nonzero xL. Then, xC and xC.

  2. xCxC.

  3. xCxC.

  4. This means that xC and xC too.

  5. Thus, xL.

  6. In summary xL implies x,xC and x,xC.

  7. Then, for any t0

    1. xCtxC.

    2. xCtxC.

    3. txCtxC.

    4. Thus, txL.

  8. And, for any t<0

    1. xCtxC.

    2. xCtxC.

    3. txCtxC.

    4. Thus, txL.

  9. Thus, L is closed under scalar multiplication.

[Vector addition]

  1. Let u,vL.

  2. Then, u,vC and u,vC.

  3. Thus, u,vC.

  4. Since C is closed under addition, hence u+vC and uvC.

  5. But then, uvC and u+vC.

  6. Thus, u+vL. uvL too.

  7. Thus, L is closed under addition.

Thus, L is a vector space contained in C. L is the largest such subspace since any subspace contained in C should be contained in C too.

9.4.4. Conic Combinations#

Definition 9.24 (Conic combination)

A point of the form t1x1++tkxk with t1,,tk0 is called a conic combination (or a non-negative linear combination) of x1,,xk.

  • A convex cone is closed under non-negative linear/conic combinations.

  • One way to prove that a set is a convex cone is to show that it contains all its conic combinations.

Theorem 9.51 (Convex cone characterization with conic combinations)

Let C be a convex cone. Then for every x1,,xkC, every conic combination t1x1++tkxk with ti0 belongs to C.

Conversely, if a set C contains all conic combinations of its points, then it is a convex cone.

In other words, C is a convex cone if and only if it is closed under conic combinations.

Proof. Assume that C is a convex cone. Then it is closed under addition and nonnegative scalar multiplication.

  1. Let x1,,xkC.

  2. Then, t1x1,,tkxkC for all t1,,tk0 since C is closed under nonnegative scalar multiplication.

  3. Then, t1x1++tkxkC since C is closed under addition.

  4. Thus, C contains all its conic combinations.

For the converse, assume that C contains all its conic combinations.

  1. Let xC.

  2. Then, txC for all t0 since tx is a conic combination.

  3. Thus, C is a cone.

  4. Now, let x,yC and t[0,1]. Then, 1t[0,1] too.

  5. Thus, tx+(1t)y is a conic combination of x,y.

  6. Hence, tx+(1t)yC.

  7. Thus, C is convex.

  8. Combining, C is a convex cone.

Here is another proof that a linear subspace is a convex cone using the idea of conic combinations.

  1. Every subspace contains the 0 vector.

  2. Every conic combination is also a linear combination.

  3. A subspace is closed under linear combinations.

  4. Hence, it is also closed under conic combinations.

  5. Hence, it is a convex cone.

Remark 9.5 (Conic combinations and nonnegative orthant)

We recall that the nonnegative orthant of Rk is given by

R+k={tRk|t0}={tRk|t1,,tk0}.

Thus, the coefficients for conic combinations of k points are drawn from R+k.

Theorem 9.52

A conic combination of conic combinations is a conic combination.

Proof. Let SV. Note that S is arbitrary (no convexity or conic structure assumed).

  1. Consider n points yi, i=1,,n described as below.

  2. Let yi=j=1mjti,jxi,j be conic combinations of mj points:

    • xi,1,,xi,mjS.

    • ti,j0.

  3. Consider the conic combination y=i=1nriyi. with ri0.

  4. We need to show that y is a conic combination of points of S.

Towards this:

y=i=1nriyi=i=1nrij=1mjti,jxi,j=i=1nj=1mjriti,jxi,j.

Consider the terms:

si,j=riti,j.

Since ri0 and ti,j0, hence si,j0. Hence,

y=i,jsi,jxi,j

is a conic combination of points of S.

The idea of conic combinations can be generalized to infinite sums and integrals.

9.4.5. Conic Hulls#

Definition 9.25 (Conic hull)

The conic hull of a set S is the set of all conic combinations of points in S. i.e.

cone(S){t1x1+tkxk|xiS,tR+k,i=1,,k,kN}.

Property 9.8

A conic hull is a convex cone.

Proof. Let C be the conic hull of a set S.

  1. Let x,yC.

  2. Then, x,y are conic combinations of S.

  3. Let z=t1x+t2y with t1,t20.

  4. Then, z is a conic combination of conic combinations of S.

  5. By Theorem 9.52, z is a conic combination of S.

  6. Since C contains all conic combinations of S, hence C contains z.

  7. Thus, for any x,yC, z=t1x+t2y with t1,t20 is in C.

  8. Thus, C is a convex cone.

Property 9.9

Conic hull of a set is the smallest convex cone that contains the set.

Proof. Let S be an arbitrary set and C be its conic hull.

  1. We have already shown that C is a convex cone.

  2. Assume D to be a convex cone such that SD.

  3. Then, D contains every conic combination of S since a convex cone is closed under conic combinations.

  4. Thus, CD since C is the set of all conic combinations of S.

Theorem 9.53 (Conic hull of a convex set)

Let C be a convex set. Let K be defined as:

K{tx|t0,xC}.

Then, K is the conic hull of C.

Proof. Let H be the conic hull of C; i.e., H is the set of all conic combinations of C. We show that KH and HK.

KH

  1. For any xC and t0, tx is a conic combination of C.

  2. Hence txH.

  3. Thus, KH.

HK

  1. Let x=t1x1++tkxk be a conic combination of C.

  2. Thus, ti0 and xiC.

  3. By definition of K, 0K.

  4. If ti=0 for 0ik, then x=0. So xK.

  5. Now consider the case where at least one ti>0.

  6. Let t=ti. Clearly, t>0.

  7. Consider z=t1tx1++tktxk.

  8. Note that z is a convex combination of x1,,bxkC.

  9. Since C is convex, hence zC.

  10. Then, bx=tzS since t>0 and zC.

  11. Thus, K contains all conic combinations of C.

  12. Thus, HK.

Property 9.10 (Conic hull of a convex hull)

Let SV be a nonempty set. Let C=convS. Then

coneS=coneC.

In other words, the conic hull of the convex hull of a set is same as the conic hull of the set.

Proof. Since SC, hence coneSconeC. We now prove the converse.

  1. Let xconeC.

  2. Then x is a nonnegative combination of some vectors in C. There exist a positive integer p, p vectors x1,,xpC and p scalars t1,,tp such that

    x=i=1ptixi.
  3. Each xi is a convex combination of some vectors in S.

  4. Hence x is a nonnegative combination of some vectors in S.

  5. Hence xconeS.

  6. Hence coneCconeS.

9.4.5.1. Unique Conic Representations#

Recall from Carathéodory theorem that in an n dimensional vector space, every point in the convex hull of a set can be represented as a convex combination of n+1 points belonging to the set.

Similar representation is possible in conic hulls too.

Theorem 9.54 (Conic representation theorem)

Let V be an n-dimensional real vector space. Let SV. Let xcone(S). Then, there exist k linearly independent vectors x1,,xkS such that xcone({x1,,xk}); i.e., there exists tR+k such that

x=i=1ktixi.

Since the k vectors are linearly independent, hence kn.

Proof. The proof is similar to Carathéodory theorem.

  1. Let xcone(S).

  2. Then, there exist m points x1,,xmS and tR+m such that

    x=i=1mtixi.
  3. We can assume that ti>0 for every i1,,m. Otherwise, we can simply drop the corresponding points from the conic combination.

  4. If x1,,xm are linearly independent, then k=m, mn and we are done.

  5. Let us consider the case when they are linearly dependent.

  6. Then, there exists a nontrivial linear combination equally zero vector:

    r1x1++rmxm=0.
  7. Then, for any αR

    x=i=1mtixi+αi=1mrixi=i=1m(ti+αri)xi.
  8. This representation is a conic combination if ti+αri0 for every i=1,,m.

  9. Since ti>0 for every i, hence, this set of inequalities is satisfied for all αA where A is a closed interval with a nonempty interior.

    1. Let Ai be the solution set for i-th inequality.

    2. We have A=iAi.

    3. α=0 satisfies every inequality. Thus, 0Ai. Thus, A.

    4. If ri=0, then Ai=R.

    5. If ri>0, then Ai=[tiri,).

    6. If ri<0, then Ai=(,tiri].

    7. Since not all ri=0, hence there are several Ai with finite endpoints (either left or right).

    8. Thus, there are three possibilities for A: [a,b], or [a,) or (,b].

    9. Both a and b correspond to an endpoint of one of the Ai.

  10. If we pick α as one of the endpoints of A, then, ti+αri0 for every i and tj+αrj=0 for some j1,,m.

  11. Thus, we obtain a conic representation of x of at most m1 vectors.

  12. This process can be carried out repeatedly until we obtain a conic representation of x of k linearly independent vectors.

  13. Since the k vectors x1,,xk so obtained are linearly independent, hence kn.

9.4.6. Pointed Cones#

Definition 9.26 (Pointed cone)

A cone CV is called pointed if xC and xC implies x=0.

In other words, a pointed cone, doesn’t contain a line.

Example 9.12 (The nonnegative orthant is a pointed convex cone)

Recall from Definition 9.14 that the nonnegative orthant is defined as:

R+n={xRn|xi0,1in}.

In other words, for xR+n, every component is non-negative.

Let x,yR+n. Let α,β0 and consider their conic combination

z=αx+βy.

It is obvious that all components of z are also nonnegative. Hence zR+n. Thus, R+n is closed under conic combinations. Hence, R+n is a convex cone.

Finally, R+n is pointed as xR+n and xR+n both hold true only if x=0.

9.4.7. Proper Cones#

Definition 9.27 (Proper cone)

A cone KV is called a proper cone if it satisfies the following:

  • K is convex.

  • K is closed.

  • K is solid; i.e., it has a nonempty interior.

  • K is pointed.

Example 9.13 (Non-empty interior)

Consider the following sets in R2:

C1={(x1,x2)|x10,x2=0}
C2={(x1,x2)|x1,x20}

Both are closed convex cones. C1 doesn’t have an interior. All points in C1 are on the boundary of C1.

C2 has a non-empty interior; e.g., the point (1,1)C2 is not on the boundary.

9.4.8. Norm Cones#

Definition 9.28 (Norm cone)

Let :VR be any norm on V. The norm cone associated with the norm is given by the set

C{(x,t)|xt}

C lies in the product space V×R.

If V=Rn, then a norm cone belongs to Rn+1.

Theorem 9.55

A norm cone is convex. Moreover, it is a convex cone.

Example 9.14 (Second order cone)

The second order cone is the norm cone for the Euclidean norm in the Euclidean space Rn, i.e.

C={(x,t)|x2t}.

From definition, CRn+1.

This can be rewritten as

C={[xt]|[xt]T[I001][xt]0,t0}

9.4.9. Barrier Cones#

Definition 9.29 (Barrier vector)

Let C be a convex set of V. A vector vV is called a barrier vector to C if for some βR,

x,vβxC.

In other words, the set of inner products of points in C with v is bounded from above.

Definition 9.30 (Barrier cone)

The set of all barrier vectors to a convex set C is called its barrier cone.

Theorem 9.56

The barrier cone of a convex set is convex.

Proof. Let C be a convex set and let B be its barrier cone. Let u,vB. Let t0.

x,0=00xC.

Thus, 0B.

x,uαx,tutαxC.

Thus, the set of inner products with tu is bounded from above by tα. Thus, tuB.

x,uα and x,vβx,u+vα+βxC.

Thus, the set of inner products with u+v is bounded from above by α+β. Thus, u+vB.

Thus, B is closed under nonnegative scalar multiplication and vector addition. B is a convex cone.

9.4.10. Properties of Convex Cones#

We consider operations on convex cones which generate convex cones.

Property 9.11 (Closure under set intersection)

If K1 and K2 are convex cones, then K=K1K2 is convex cone.

Proof. We show that K is closed under nonnegative scalar multiplication.

  1. Let xK and t0.

  2. Then, xK1 and xK2.

  3. Hence, txK1 and txK2 since both are closed under nonnegative scalar multiplication.

  4. Thus, txK.

  5. Hence, K is closed under nonnegative scalar multiplication.

We show that K is closed under vector addition too.

  1. Let x,yK.

  2. Then, x,yK1 and x,yK2.

  3. But then, x+yK1 and x+yK2 since both are closed under vector addition.

  4. Thus, x+yK.

  5. Hence, K is closed under vector addition.

Thus, K is closed under nonnegative scalar multiplication and vector addition. K is a convex cone.

Property 9.12 (Closure under set addition)

If K1 and K2 are convex cones, then K=K1+K2 is convex cone.

Proof. We show that K is closed under nonnegative scalar multiplication.

  1. Let xK and t0.

  2. Then, x=x1+x2 where x1K1, and x2K2.

  3. Then, tx1K1 and tx2K2 since K1 and K2 are cone.

  4. Then, tx=t(x1+x2)=tx1+tx2K.

  5. Thus, K is closed under nonnegative scalar multiplication.

We show that K is closed under vector addition too.

  1. Let x,yK.

  2. Then, x=x1+x2 with some x1K1 and x2K2.

  3. And, y=y1+y2 with some y1K1 and y2K2.

  4. Then, x+y=(x1+y1)+(x2+y2).

  5. Now, x1+y1K1 and x2+y2K2 since K1 and K2 are closed under addition (they are convex cones).

  6. Thus, x+yK.

  7. Thus, K is closed under vector addition.

Thus, K is closed under nonnegative scalar multiplication and vector addition. K is a convex cone.

We mention that by Theorem 9.34, K is convex. Hence, we just needed to show that K is a cone too.

Property 9.13 (Positive scalar multiplication)

Let K be a convex cone. Then

tK=Kt>0.

Proof. We proceed as follows:

  1. Let t>0.

  2. Let xtK.

  3. Then, 1txK.

  4. But then, t1tx=xK too since K is closed under nonnegative scalar multiplication.

  5. Thus, tKK.

  6. Similarly, xK implies xtK.

  7. Thus, KtK.

  8. Hence, K=tK for all t>0.

Note that 0K={0}K.

Property 9.14 (Convex hull of the union)

If K1 and K2 are convex cones, then

K1+K2=conv(K1K2).

Proof. By Corollary 9.3,

conv(K1K2)=t[0,1][(1t)K1+tK2].

Now for t(0,1), by Property 9.13, (1t)K1=K1 and tK2=K2.

Thus, for t(0,1):

(1t)K1+tK2=K1+K2.

For, t=0 we are left with K1 and for t=1, we are left with K2.

Since 0K1 and 0K2, hence K1K1+K2 and K2K1+K2. Thus,

conv(K1K2)=t[0,1][(1t)K1+tK2]=K1+K2.

Property 9.15 (Intersection as union)

If C1 and C2 are convex cones, then

C1C2=t[0,1](tC1(1t)C2).

Proof. We first show that for every t(0,1)

tC1(1t)C2=C1C2.
  1. Let xC1C2.

  2. Then xC1 and xC2.

  3. Since 0<t<1, hence due to Property 9.13, C1=tC1 and C2=(1t)C2.

  4. Hence xtC1 and x(1t)C2.

  5. Hence xtC1(1t)C2.

  6. Hence C1C2tC1(1t)C2.

  7. Conversely, let xtC1(1t)C2.

  8. Then xtC1 and x(1t)C2.

  9. Hence xC1 and xC2.

  10. Hence xC1C2.

  11. Hence tC1(1t)C2C1C2.

If t=0 or t=1, then

tC1(1t)C2={0}C1C2

since every convex cone contains the origin. Together,

C1C2=t[0,1](tC1(1t)C2).

9.4.11. Cone Generated by a Convex Set#

The direct sum vector space VR has been described in Definition 9.4.

Observation 9.3 (Convex sets as cross sections of cones)

A convex set CV can be regarded as a cross section of some convex cone KVR.

Let K be conic hull of points (x,1)VR such that xC. Then,

K={(tx,t)VR|xC,t0}.

Now consider the hyperplane in VR given by:

H={(y,t)VR|t=1}.

The intersection of H with K can be regarded as C.

HK={(tx,t)VR|xC,t=1}={(x,1)VR|xC}.

The projection of HK on V is given by C (by dropping the last coordinate).

Remark 9.6

For every convex set CV, there is precisely one convex cone KVR generated by the set {(x,1)VR|xC} (its conic hull).

These convex cones have only (0,0) in common with the half space {(x,t)VR|t0}.

We shall call this class of convex cones in VR generated by the convex sets in V as C.

An operation that is closed under the class C corresponds to an operation on the convex sets in V; e.g., if C1 and C2 are convex sets with corresponding cones K1 and K2, then C1C2 is another convex set corresponding to a different convex cone K3. It is natural to ask if there is a way to construct K3 from K1 and K2 directly in VR.

Each vector (x,t)VR can be split as a direct sum with xV and tR. Thus, it is possible to define different kinds of partial sums on VR. Recall that partial sums on convex sets preserve the convexity. It turns out that they can do more. We can define partial sums which are closed under the family C of convex cones in VR generated by the convex sets in V.

We can define four types of partial sums:

  1. Addition in V, intersection in R.

  2. Addition in V, addition in R.

  3. Intersection in V, intersection in R.

  4. Intersection in V, addition in R.

Suppose that K1 and K2 are convex cones generated by the convex sets C1 and C2 respectively. Let K be their partial sum. Let us find out what is the corresponding convex set C in V based on the type of partial sum in VR.

[Addition in V, intersection in R.]

  1. In this case, (x,1)K if and only if x=x1+x2 for some (x1,1)K1 and (x2,1)K2.

  2. Thus, the convex set corresponding to K is C=C1+C2.

[Addition in V, addition in R.]

  1. (x,1)K if and only if x=x1+x2 and 1=t1+t2 with t10 and t20 for some (x1,t1)K1 and (x2,t2)K2.

  2. Thus, C is the union of the sets t1C1+t2C2 over t10, t20 and t1+t2=1.

  3. But, this is same as C=conv(C1C2) as per Theorem 9.38.

[TODO] Clarify this further. It is not obvious.

[Intersection in V, intersection in R]

  1. (x,1)K if and only if (x,1)K1 as well as (x,1)K2.

  2. Thus, C=C1C2.

[Intersection in V, addition in R]

  1. (x,1)K if and only if (x,t1)K1 and (x,t2)K2 for some t1,t20 with t1+t2=1.

  2. In this case, we can write C as:

    C={t1C1t2C2|t1,t20,t1+t2=1}={(1t)C1tC2|t[0,1]}.

[TODO] Clarify this further. It is not obvious.

9.4.12. Positive Semidefinite Cone#

Theorem 9.57 (The convex cone of positive semidefinite matrices)

The set of positive semidefinite matrices S+n is a convex cone.

Proof. Let A,BS+n and θ1,θ20. We have to show that θ1A+θ2BS+n.

AS+nvTAv0vRn.
BS+nvTBv0vRn.

Now

vT(θ1A+θ2B)v=θ1vTAv+θ2vTBv0vRn.

Hence θ1A+θ2BS+n.

9.4.13. Linear System with Nonnegativity Constraints#

Consider the system

P={xRn|Ax=b,x0}

where ARm×n and bRm. Without loss of generality, we shall assume that the rows of A are linearly independent. This is a linear system Ax=b with the nonnegativity constraint x0. If we write A in the form of column vectors as

A=[a1an]

Then, the set Q={Ax|xR+n} can be written as

Q=cone{a1,an}.

In other words, Q is the conic hull of the column vectors of A. We can think of A as a linear mapping of the nonnegative orthant (a convex cone) R+n from Rn to another convex cone in Rn given as a conic hull of the columns of A.

We can now see that P is nonempty if bQ.

Definition 9.31 (Basic feasible solution)

Let P={xRn|Ax=b,x0} where ARm×n and bRm. Assume that the rows of A are linearly independent. Then, vRn is a basic feasible solution (in short “bfs”) of P if the columns of A corresponding to the positive entries of v are linearly independent.

Consequently, v has at most m has positive entries. All other entries of v are 0.

Theorem 9.58 (Existence of basic feasible solution)

Let P={xRn|Ax=b,x0} where ARm×n and bRm. Assume that the rows of A are linearly independent.

If P is nonempty; i.e. P, then it contains at least one basic feasible solution.

Proof. Recall that

Q={Ax|xR+n}=cone{a1,an}

where a1,,an are columns of the matrix A.

  1. If P, then bQ.

  2. In other words, b is a conic combination of columns of A.

  3. By the conic representation theorem, there exists a subset of k linearly independent vectors among {a1,,an} such that b is their conic combination.

  4. In other words, there exist k indices 1i1<<ikn and k numbers vi1,,vik>0 such that

    b=j=1kvijaij

    and {ai1,,aik} are linearly independent.

  5. Consequently km since columns of A belong to Rm.

  6. Let

    v=j=1kvijeij

    where eij are unit vectors of Rn.

  7. Clearly, v0 and Av=b.

  8. Therefore, vP and v is a basic feasible solution.

The basic feasible solutions of P are the extreme points of P. Recall that a point is an extreme point if it cannot be expressed as a nontrivial convex combination of two distinct points of a set.

Theorem 9.59 (Equivalence between basic feasible solutions and extreme points)

Let P={xRn|Ax=b,x0} where ARm×n and bRm. Assume that the rows of A are linearly independent.

Then v is a basic feasible solution of P if and only if v is an extreme point of P.

Proof. Let v be a basic feasible solution of P.

  1. Then b=Av and v has k positive entries with km.

  2. Without loss of generality, assume that first k entries of v are positive. This can be easily achieved by shuffling the columns of A in the linear system Ax=b.

  3. Therefore, v1,,vk>0 and vk+1,,vn=0.

  4. Also, the first k columns a1,,ak of the matrix A are linearly independent since v is a basic feasible solution.

  5. For contradiction, assume that v is not an extreme point of P; i.e., vextP.

  6. Then, there exist y,zP with yz and t(0,1) such that v=ty+(1t)z.

  7. Since y,zP, hence y0 and z0.

  8. Since the last nk entries of v are zero, hence the last nk entries of y and z also must be zero as they have to be nonnegative.

  9. Since y,zP, hence Ay=b and Az=b.

  10. Therefore,

    b=i=1kyiai=i=1kziai.
  11. This implies that

    i=1k(yizi)ai=0.
  12. But, a1,,ak are linearly independent by hypothesis.

  13. Thus, yi=zi for i=1,,k must hold.

  14. Then, y=z.

  15. We arrive at a contradiction.

  16. Thus, v must be an extreme point of P.

For the converse, assume that v is an extreme point of P.

  1. Again, by contradiction, assume that v is not a basic feasible solution.

  2. Thus, the columns of A corresponding to the positive entries of v are linearly dependent.

  3. Assume that there are k positive entries in v and WLOG, assume that they correspond to first k columns of A.

  4. Then, since the corresponding columns are linearly dependent, hence there exists a nonzero vector tRk such that

    i=1ktiai=0.
  5. We can extend t to Rn by appending nk zeros such that At=0.

  6. Since the first k entries of v are positive, we can choose a sufficiently small r>0 such that y=vrt0 and z=v+rt0.

  7. Note that Ay=Az=b.

  8. Therefore, y,zP.

  9. At the same time, it is easy to see that

    v=12y+12z.
  10. Thus, v is a convex combination of two distinct points of P.

  11. This contradicts our hypothesis that v is an extreme point of P.

  12. Thus, v must be a basic feasible solution.