9.19. Infimal Convolution#

Note

Some results in this section appear without complete proof at the moment. They have been collected from various sources. The proofs will be added later.

9.19.1. Definition#

Definition 9.82 (Infimal convolution)

Let V be a real vector space. Let f,g:VR be extended real-valued functions. Their infimal convolution, denoted by fg:VR, is defined as

(9.11)#(fg)(x)inf{f(y)+g(z)|y,zV,y+z=x}.

The infimal convolution of f with g is called exact at x if the infimum in (9.11) is attained at some y,zV such that y+z=x.

The infimal convolution is called exact if it is attained at every xX.

Remark 9.17 (Alternative definitions)

It is easy to see that

(fg)(x)=inf{f(y)+g(z)|y,zV,y+z=x,f(y)<,g(z)<}.

Another formulation is

(fg)(x)=inf{f(y)+g(xy)|yV}.

This definition is reminiscent of the convolution operation in signal processing. The word infimal reminds of the infimum appearing in the definition. This is the motivation for the name infimal convolution.

Infimal convolution is also known as epigraphical addition because performing an infimal convolution of f and g amounts to the adding of the strict epigraphs of f and g. Recall from Definition 2.52 that the strict epigraph of a function f:VR is given by

episf{(x,t)VR|f(x)<t}.

We show in Property 9.42 that episfg=episf+episg.

From the definition, it is clear that fg is the largest of all the functions h:VR satisfying

h(y+z)f(y)+g(z)y,zX.

Example 9.87 (Cost minimization)

Consider a situation where a person sources a certain product from two different manufactures.

  1. The person needs to acquire x total quantity.

  2. The total supply is given by the equation x=y+z where y is the quantity sourced from manufacturer A and z is the quantity sourced from manufacture B.

  3. Suppose the total cost of acquisition from manufacture A is given by a function f(y). It may not be linear as the manufacturer may have different levels of discounts as y increases.

  4. Similarly, the cost of acquisition from manufacture B is given by the function g(z).

  5. Then the total cost is given by f(y)+g(z) subject to the condition that y+z=x.

  6. Then the cost minimization problem is given by

    h(x)=inf{f(y)+g(z)|x=y+z}=(fg)(x).
  7. We will also be interested in the set of solutions to the infimal problem; i.e.,

    {(y,z)|f(y)+g(z)=(fg)(x)}.

9.19.2. Basic Properties#

9.19.2.1. Domain#

Property 9.40 (Domain of infimal convolution)

Let f,g:VR be given extended valued functions. Then

domfg=domf+domg.

Proof. Let zdomf+domg.

  1. Then there exist xdomf and ydomg such that z=x+y.

  2. Then f(x)< and g(y)<.

  3. By definition of infimal convolution

    (fg)(z)f(x)+g(y)<.
  4. Hence zdomfg.

  5. Hence domf+domgdomfg.

For the converse, let zdomfg.

  1. Then (fg)(z)<.

  2. First consider the case where (fg)(z)=tR.

  3. Then for any ϵ>0, there exist x,y with x+y=z such that

    tf(x)+g(y)t+ϵ.
  4. Since f(x)+g(y) is finite, hence f(x)< and g(y)<.

  5. Hence x<domf and ydomg.

  6. Thus z=x+ydomf+domg.

  7. Now consider the case where (fg)(z)=.

  8. Then for every MR, there exist x,y with x+y=z such that

    f(x)+g(y)M.
  9. Thus f(x)< and g(y)< for such pairs of x,y.

  10. Consequently z=x+ydomf+domg.

  11. Combining the two cases domfgdomf+domg.

Finally

  1. We have already shown that domf+domgdomfg.

  2. Thus domf+domg=domfg.

9.19.2.2. Epigraph#

Property 9.41 (Connection with epigraph)

(fg)(x)=inf{rR|(x,r)epif+epig}.

Proof. Let t=(fg)(x). Define S={rR|(x,r)epif+epig}. We need to show that t is the greatest lower bound of the set S.

  1. Let (x,r)epif+epig.

  2. Then there exists (y,r1)epif and (z,r2)epig so that x=y+z and r=r1+r2.

  3. Thus f(y)+g(z)r1+r2=r (by epigraphs).

  4. By infimal convolution definition tf(y)+g(z)r.

  5. Thus we have shown that tr if (x,r)epif+epig.

  6. Thus t is a lower bound of the set S.

  7. Now choose any ϵ>0.

  8. Then there exists ydomf and zdomg with x=y+z such that

    f(y)+g(z)t+ϵ.
  9. Note that (y,f(y))epif and (z,g(z))epig.

  10. Hence (y+z,f(y)+f(z))epif+epig.

  11. Consequently (y+z,t+ϵ)epif+epig.

  12. Replacing x=y+z, we have (x,t+ϵ)epif+epig.

  13. Thus, for every ϵ>0, t+ϵS.

  14. Hence t must be the greatest lower bound of the set S.

  15. In other words, t=infS as desired.

Property 9.42 (Strict epigraph of infimal convolution)

Let f,g:VR be given extended valued functions. Then

episfg=episf+episg.

Proof. We show the set inclusion from both sides.

  1. Let (x,r)episfg.

  2. Then (fg)(x)<r.

  3. Then there exist ydomf and zdomg with x=y+z such that

    f(y)+g(z)<r.

    This is directly from the definition of infimal convolution.

  4. Consequently, we can split r as r=r1+r2 such that f(y)<r1 and g(z)<r2.

    1. Let s=f(y), t=g(z).

    2. Let d=r(s+t)>0.

    3. Let e=d2>0.

    4. Let r1=s+e and r2=t+e.

    5. Then r1+r2=s+t+2e=s+t+d=r.

    6. Also f(y)=s<r1 and g(z)=t<r2.

  5. Then (y,r1)episf and (z,r2)episg by definition of strict epigraphs.

  6. Hence (y+z,r1+r2)=(x,r)episf+episg.

  7. Thus episfgepisf+episg

For the converse

  1. Let (x,r)episf+episg.

  2. Then there is (y,r1)episf and (z,r2)episg so that

    x=y+z and r=r1+r2.
  3. Thus f(y)<r1 and g(z)<r2.

  4. Thus f(y)+g(z)<r1+r2=r.

  5. Hence (fg)(x)f(y)+g(z)<r.

  6. Hence (x,r)episfg.

  7. Hence episf+episgepisfg.

Property 9.43 (Epigraph of infimal convolution)

Let f,g:VR be given functions. Then

epif+epigepifg.

The equality holds if and only if the infimal convolution is exact at each xV where (fg)(x)R.

Proof. We first show the inclusion.

  1. Let (x,r)epif+epig.

  2. Then there is (y,r1)epif and (z,r2)epig so that

    x=y+z and r=r1+r2.
  3. Thus f(y)r1 and g(z)r2.

  4. Thus f(y)+g(z)r1+r2=r.

  5. Hence (fg)(x)f(y)+g(z)r.

  6. Hence (x,r)epifg.

  7. Hence epif+epigepifg.

Next we show that the reverse inclusion holds if infimal convolution is exact at each xV where (fg)(x)R.

  1. Let (x,r)epifg.

  2. Then (fg)(x)r.

  3. Consider first the case where (fg)(x)=.

  4. Then for any MR, there exist ydomf and zdomg with x=y+z such that f(y)+f(z)<M.

  5. In particular, there exist y,z so that f(y)+f(z)<r.

  6. Hence (x,r)epif+epig.

  7. Now if (fg)(x)R then the infimal convolution is exact.

  8. Since the infimal convolution is exact, hence there exist ydomf and zdomg with x=y+z such that

    f(y)+g(z)=(fg)(x)r.

    This is directly from the definition of infimal convolution.

  9. Consequently, we can split r as r=r1+r2 such that f(y)r1 and g(z)r2.

  10. Then (y,r1)epif and (z,r2)epig by definition of epigraphs.

  11. Hence (y+z,r1+r2)=(x,r)epif+epig.

  12. Thus episfgepif+epig.

Finally, to show that this condition is necessary, assume that there exists a point xV where (fg)(x)R and the infimal convolution is not exact.

  1. Let t=(fg)(x).

  2. Then (x,t)epifg.

  3. Since the infimal convolution is not exact, hence for any ydomf and zdomg with x=y+z

    t<f(y)+g(z).
  4. For contradiction, assume that (x,t)epif+epig.

  5. Then there exist (y,t1)epif and (z,t2)epig so that x=y+z and t=t1+t2.

  6. Thus f(y)t1 and g(z)t2.

  7. Thus f(y)+g(z)t1+t2=t.

  8. We have arrived at a contradiction.

  9. Hence (x,t)epif+epig.

9.19.2.3. Convexity#

Theorem 9.274 (Infimal convolution of convex functions)

Let f:V(,] be a proper convex function and g:VR be a real valued convex function. Then fg is convex.

Proof. Define h(x,y)=f(y)+g(xy).

  1. Since f and g are convex, hence h is convex.

  2. We can easily show that for any xV, there exists yV such that h(x,y)<.

    1. Pick any xV.

    2. Choose any ydomf.

    3. Then f(y)<.

    4. Since g is real valued, hence g(xy)<.

    5. Thus h(x,y)=f(y)+g(xy)<.

  3. Then due to Theorem 9.118

    (fg)(x)=infyVh(x,y)=infyV[f(y)+g(xy)]

    is convex as a partial minimization of the function h(x,by) w.r.t. y.

It is still possible that fg is not a proper function and may be equal to at some x.

Example 9.88 (Set distance function as infimal convolution)

Let CV be a nonempty convex set.

  1. The indicator function IC is a proper convex function.

  2. The norm function is a real valued convex function.

  3. By Theorem 9.274, IC is convex.

  4. But then

    (IC)(x)=infyV(IC(y)+xy)=infyCxy=dC(x).
  5. Thus dC (the distance function to a convex set) is convex.

9.19.3. Conjugates#

Theorem 9.275 (Conjugate of infimal convolution of proper functions)

For two proper functions h1,h2:V(,], it holds that:

(h1h2)=h1+h2.

Proof. Pick yV. Then

(h1h2)(y)=supxV{x,y(h1h2)(x)}=supxV{x,yinfuV(h1(u)+h2(xu))}=supxVsupuV{x,yh1(u)h2(xu)}=supuVsupxV{xu,y+u,yh1(u)h2(xu)}=supuV{supxV{xu,yh2(xu)}+u,yh1(u)}=supuV{h2(y)+u,yh1(u)}=h2(y)+supuV{u,yh1(u)}=h2(y)+h1(y)=(h1+h2)(y).

Theorem 9.276 (Conjugate of sum of convex functions)

Let h1:V(,] be a proper convex function and h2:VR be a real valued convex function. Then

(h1+h2)=h1h2.

Proof. The proof is more complicated and requires the application of Fenchel’s duality theorem (Theorem 10.85).

  1. Pick any yV.

  2. Elaborating the conjugate function

    (h1+h2)(y)=supxV{x,y(h1+h2)(x)}=supxV{x,yh1(x)h2(x)}=infxV{h1(x)+h2(x)x,y}=infxV{h1(x)+g(x)}

    where g(x)=h2(x)x,y.

  3. We note that since h2 is a real valued convex function, hence g is also a real valued convex function.

  4. We note that

    (ridomh1)(ridomg)=(ridomh1)V=ridomh1.
  5. Applying the Fenchel’s duality theorem (Theorem 10.85),

    infxV{h1(x)+g(x)}=supzV{h1(z)g(z)}.
  6. Further

    g(z)=supxV{x,zg(x)}=supxV{x,zh2(x)+x,y}=supxV{x,yzh2(x)}=h2(yz).
  7. Consequently

    (h1+h2)(y)=infxV{h1(x)+g(x)}=supzV{h1(z)g(z)}=infzV{h1(z)+g(z)}=infzV{h1(z)+h2(yz)}=(h1h2)(y).

This completes the proof.

Corollary 9.28 (Function sum as conjugate of infimal convolution of conjugates)

Let h1:V(,] be a proper closed convex function and h2:VR be a real valued convex function. Then

h1+h2=(h1h2).

Proof. We proceed as follows.

  1. Since h1 is proper and h2 is real valued, hence h1+h2 is proper.

  2. Since both h1 and h2 are closed functions, hence h1+h2 is closed.

  3. Since both h1 and h2 are convex functions, hence h1+h2 is convex.

  4. Thus h1+h2 is a proper, closed convex function.

  5. By Theorem 9.242,

    (h1+h2)=h1+h2.
  6. By Theorem 9.276,

    (h1+h2)=h1h2.
  7. Hence

    h1+h2=(h1+h2)=[(h1+h2)]=[h1h2].

Theorem 9.277 (Representation of the infimal convolution by conjugates)

Let h1:V(,] be a proper convex function and h2:VR be a real valued convex function. Suppose h1h2 is a real valued function. Then

h1h2=(h1+h2).

Proof. We proceed as follows.

  1. By Theorem 9.275

    (h1h2)=h1+h2.
  2. Since h1 is proper and convex and h2 is real valued and convex, hence by Theorem 9.274, h1h2 is convex.

  3. By hypothesis h1h2 is also real valued.

  4. Thus h1h2 is proper and closed.

  5. Since h1h2 is proper, closed and convex; hence by Theorem 9.242

    (h1h2)=h1h2.
  6. Hence

    h1h2=[(h1h2)]=[h1+h2].

9.19.4. Convexity#

An interesting application of constructing a convex function from a convex set in VR is the fact that addition of epigraphs leads to another convex function. This is also known as infimal convolution.

Theorem 9.278 (Infimal convolution)

Let f1,,fm:V(,] be proper convex functions. Let f:V(,] be defined as

f(x)=inf{f1(x1)++fm(xm)|xiV,x1++xm=x}.

Then, f is a proper convex function.

Proof. Let Fi=epifi and F=F1++Fm.

  1. Since fi is convex, hence Fi=epifi are convex sets for i=1,,m.

  2. Sum of convex sets is convex. Hence, F is a convex set.

  3. By definition of F, (x,t)F if and only if there exists xiV and tiR with (xi,ti)Fi such that x=x1++xm and t=t1++tm.

  4. It follows that fi(xi)ti for i=1,,m.

  5. Consequently, f1(x1)++fm(xm)t1++tm.

  6. By Theorem 9.111, a function g:V(,] defined as

    g(x)inf{tR|(x,t)F}

    is a proper convex function.

  7. Let T={tR|(x,t)F}. Then g(x)=infT.

  8. We note that

    T={t|(xi,ti)Fi,t=t1++tm,x=x1++xm}.
  9. We claim that g=f.

To show that g=f, we need to do the following.

  1. Show that domf=domg.

  2. Then, show that for every xdomf, f(x)=g(x).

Assume that xdomg.

  1. Then, there is no tR such that (x,t)F.

  2. Thus, there is no (xi,ti)Fi for i=1,,m such that x=x1++xm.

  3. Thus, for every xiV such that x=x1++xm there is at least one i such that there is no tiR such that (xi,ti)Fi.

  4. Thus, for every xiV such that x=x1++xm there is at least one i such that fi(xi)=.

  5. Thus, for every xiV such that x=x1++xm, f1(x1)++fm(xm)=.

  6. Thus, f(x)=.

  7. Thus, xdomf.

Assume that xdomf.

  1. Then, f(x)=.

  2. Thus, for every xiV such that x=x1++xm, f1(x1)++fm(xm)=.

  3. Thus, for every xiV such that x=x1++xm, there exists at least one i such that fi(xi)=.

  4. Thus, for every xiV such that x=x1++xm, there exists at least one i such that xidomfi.

  5. Thus, for every xiV such that x=x1++xm, there exists at least one i such that there is no tiR with (xi,ti)Fi.

  6. Thus, there is no (xi,ti)Fi for i=1,,m such that x=x1++xm.

  7. Thus, there is no tR such that (x,t)F.

  8. Thus, xdomg.

Thus, xdomfxdomg. Thus, domf=domg.

Let us now consider some xdomf=domg. Then, both f(x) and g(x) are finite. Our goal is to show that f(x)=g(x).

We first show that f(x)g(x).

  1. For every xiV with x=x1++xm, and fi(xi) finite, we have:

    f(x)f1(x1)++fm(xm).
  2. Thus, for every (xi,ti)Fi with x=x1++xm, we have

    f(x)f1(x1)++fm(xm)t1++tm.
  3. Thus, for every (x,t)F with (xi,ti)Fi, x=x1++xm, t=t1++tm, we have

    f(x)t.
  4. Taking the infimum on the R.H.S. over the set T, we get

    f(x)=f(x)g(x)=z.

Now, for contradiction, assume that f(x)<g(x).

  1. For every tR such that (x,t)F, we have g(x)t.

  2. Thus, for every (xi,ti)Fi with x=x1++xm, we have

    g(x)t1++tm.
  3. Then, for every (xi,ti)Fi with x=x1++xm, we have

    f(x)<t1++tm.
  4. Thus, there exists r>0 such that for every (xi,ti)Fi with x=x1++xm, we have

    f(x)+rt1++tm.

TO BE COMPLETED.