Directional Derivatives
Contents
9.15. Directional Derivatives#
This section deals with the subject of directional derivatives for convex functions.
9.15.1. Convex Real Functions#
For real functions
Remark 9.13 (Domain of a convex real function)
The domain of a convex real function is an interval.
Let
Now let
be an interval.We say
as the left end point of .We say
as the right end point of . and may or may not belong to .If both
and belong to , then is a closed interval.If neither
nor belongs to , then is an open interval.
9.15.1.1. Characterization#
Theorem 9.199 (Characterization of real convex functions)
Let
The following are equivalent.
is convex over .For every
with ,For every
with ,For every
with ,
Proof. (1)
Let
Then,
and .Also, verify that
Thus,
(2)
Let
and .WLOG, assume that
.Let
and .Let
.Then,
.From the hypothesis, we have
Using the previous argument backwards, this implies
Thus,
is convex.
(2)
Pick any
with .By hypothesis (2)
(2)
Pick any
with .By hypothesis (2)
9.15.1.2. One Sided Derivatives#
Recall from Definition 2.83 that
if
if the limit exists.
Similarly, if
if the limit exists.
We introduce two helper functions
and
where
Then
and
An interesting property of convex functions is that the one sided derivatives always exist. On the real line, there are only two directions to move; left and right. The one sided derivatives play the role of directional derivatives on the real line.
Lemma 9.4 (Monotonicity of
Let
is a nondecreasing function of . is a nonincreasing function of .
Proof. Monotonicity of
Let
such that .Let
such that and .Consider the three points
.Then
Hence
.Hence
is a nondecreasing function of .
The argument for monotonicity of
Observation 9.6 (One sided derivatives as infimum/supremum)
Due to the monotonicity of
Similarly, due to the monotonicity of
Theorem 9.200 (Real convex functions and one sided derivatives)
Let
If
If
Proof. We are given that
Let
.Then there exists
such that .For
, defineLet
such that .Let
, , .Since
is convex, hence by Theorem 9.199But that means
Thus, whenever
(up to ), .Thus,
is a nondecreasing (monotone) function of in some interval where .Then,
exists.
A similar argument shows that
9.15.1.3. Continuity#
Theorem 9.201 (Convex real function is continuous)
Let
is continuous at every .If
, then is continuous from the right at .If
, then is continuous from the left at .
In other words,
Proof. We proceed as follows.
Let
.By Theorem 9.200, the one sided derivatives
and exist.Then, by limit arithmetic
Similarly,
Thus,
Thus,
is continuous at .Since
was arbitrary, hence is continuous on .
Now consider the case where
By Theorem 9.200, the one sided derivative
exists.Then, by limit arithmetic
Hence
is continuous from the right at .
A similar argument holds for continuity from the left at
9.15.1.4. Properties of One Sides Derivatives#
Theorem 9.202 (Properties of one-sided derivatives)
Let
We have
for every .If
then both and are finite.If
and , then .The functions
are nondecreasing over .The function
is right-continuous at every interior point of . If then is right-continuous at .The function
is left-continuous at every interior point of . If then is left-continuous at .The function
is upper-semicontinuous at every .The function
is lower-semicontinuous at every .
Proof. (1)
If
, then by convention . Hence .If
, then by convention . Hence .Now let
.Then there is
such that .Pick any
such that .Then, using the three points
, we havedue to Theorem 9.199.
Taking the limit
, we see thatholds true for every
.
(2)
Let
.Let
such that .Then we have
Similarly, we have
By (1), we have
Hence both are finite at interior points of
.
(3)
Let
.Due to Theorem 9.199, we have
We also have
Combining, we get
.
(4)
Let
such that .From (3), we have
.From (1), we have
.Combining, we have
.Hence
is nondecreasing.Similarly,
.Hence
is nondecreasing.
(5)
Pick any
such that (if ).Then
.We can pick
and such that .Then
.We established in Theorem 9.201 that
is continuous.Taking the limit
, we obtainThis is valid since
is continuous.Now taking the limit
on the R.H.S., we obtainSince
is nondecreasing by claim (4), henceTogether, we must have
Hence
is right continuous at .
(6)
An argument similar to (5) shows that
is left continuous at every except for (if ).
(7) Upper semicontinuity of
We need to show that for every
there exists such thatPick some
.Consider any
.By (6)
is right continuous at .Hence there exists
such that for every , we haveBy (4),
is nondecreasing. Hence for everyHence for every
, we haveNow, let
.By monotonicity of
, for everyHence for every
Combining the two, for every
, we haveHence
is u.s.c. at .Now, if
then let .By right continuity of
at , there exists such that for every , we haveAlso
.Hence
is u.s.c. at .If
, then by convention .Hence
is u.s.c. at .
(8) Lower semicontinuity of
The argument is similar to (7).
9.15.2. Directional Derivatives of Proper Functions#
Definition 9.70 (Directional derivative)
Let
provided the limit exists.
We say that
The directional derivative is a scalar quantity (
we mean that
Since
With
Remark 9.14 (Directional derivative for zero vector)
If
We can see this from the fact that
A useful result is for computing the directional derivative of a function which is the pointwise maximum of a finite number of proper functions.
We recall from Theorem 3.22 that the interior of a finite intersection of sets is the intersection of their interiors. This is useful in identifying the interior of the domain for a pointwise maximum of a finite set of functions.
9.15.3. Differentiability#
9.15.3.1. Differentiability of Proper Functions#
Definition 9.71 (Differentiability of proper functions)
Let
The unique vector
If
9.15.3.2. Gradient and Directional Derivatives#
Theorem 9.203 (Gradient and directional derivatives)
Let
In other words, the directional derivative is the projection of the gradient in the specified direction.
Proof. For
Since
is differentiable at , hence
In particular, if we take the limit of
along the direction of as where and , thenSplitting the terms, we get
Multiplying with
and simplifying, we get:Thus,
9.15.3.3. Gradient in #
Remark 9.15 (Gradient in
It is imperative to compare the definition of gradients in this section
with Definition 5.1
(differentiability of functions from
To better develop our understanding of gradients, let us
examine the gradient in the Euclidean space
A vector
The individual coordinates are obtained via
Let
Following the notation in Observation 5.1,
the derivative of
We don’t have to check for
Compare this with (9.5).
For
Then, the definition of
Now consider the components of
By Theorem 9.203, the directional
derivative in the direction
Thus,
The partial derivatives of
Then, for an arbitrary direction
Recall from Definition 9.70,
that the directional derivative is independent
on the choice of the inner product. This
is also clear from the expression
However, this means that the gradient itself must
depend on the choice of inner product.
If
In the following, we shall assume that
Consider the inner product given by
where
Then,
Thus,
Thus, the gradient w.r.t. the inner product
9.15.3.4. Gradient in #
Remark 9.16 (Gradient in
We next look at the vector space of real matrices.
The standard basis is a family of unit matrices
The standard inner product is given by
Let
The gradient is given by
The directional derivative for some direction
Consider the inner product given by
where
Then,
Thus,
9.15.4. Proper Convex Functions#
9.15.4.1. Existence of Directional Derivatives#
An important property of directional derivatives
is that if
Theorem 9.204 (Existence of directional derivatives for convex functions.)
Let
Proof. This is a consequence of the directional differentiability of the scalar convex functions.
Define the convex function
asLet
.Then
is an interval of values for which .Since
, hence .We now note that
It is the right hand derivative of
at .By Theorem 9.200,
exists.Hence
exists for every and every .
Observation 9.7 (Relation between the directional derivatives in opposite directions)
We can see that
Hence
Hence
Observation 9.8 (Directional derivative as infimum)
Let
This follows from the fact that
9.15.4.2. Upper Semicontinuity#
The next result generalizes the upper semicontinuity property of the right hand derivatives of real convex functions.
Theorem 9.205
Let
holds true for every
Furthermore if
Proof. Limit superior
Choose any
.By definition of the directional derivative, there exists a
such thatDue to Observation 9.8, for every
and every , we haveNow
Hence for sufficiently large
, we haveBy taking the limit superior on the L.H.S. as
, we haveSince this is valid for every
, hence we must haveas desired.
Continuous differentiability
We are given that
is differentiable over .Then
is also continuous over .Let
.Let
be a sequence of converging to .Let
be any nonzero direction.Due to Theorem 9.203, for every
,Hence
By replacing
with in the previous argument, we haveHence
Thus we have
But then this must be an equality. Hence
Since this is valid for every nonzero direction
, hence we must haveHence
is continuous at every .Hence
is continuously differentiable at every .
9.15.4.3. Directional Derivatives Map#
The existence of directional derivatives in all directions
allows us to consider a mapping from a direction
We shall refer to such maps by
Theorem 9.206 (Convexity and homogeneity of
Let
Nonnegative homogeneity: For any
Proof. Convexity
Let
and .Let
.Then,
We used the convexity property of
in this derivation.Thus,
is convex.
Nonnegative homogeneity
For
,Thus, the homogeneity property is trivial for
.Now consider
.Then,
Thus,
is nonnegative homogeneous.
9.15.4.4. As Linear Underestimator#
Directional derivatives are a linear underestimator for convex functions.
Theorem 9.207 (Directional derivative as linear underestimator)
Let
Proof. Note that
Thus,
9.15.5. Pointwise Maximum of Finite Set of Functions#
9.15.5.1. Directional Derivative#
Theorem 9.208 (Directional derivative of a maximum of functions)
Let
with
Let
Let
In other words, the directional derivative of a pointwise maximum of functions equals the maximum of directional directives of functions which attain the pointwise maximum at a specific point.
Proof. The key idea here is that for computing the
directional derivative
Since
, there exists such that and are all defined over this open ball.Let
.For every
, let be defined aswith
. . Thus, . Hence, are well defined.Then,
We used the fact that
exists for every .Thus,
is continuous from the right at for every .Let
and .Then,
. Alternatively .Since
are continuous from the right, hence there exists such that for every .Minimizing
over all pairs of and , there exists such that for any and ,
We can now compute the directional derivative.
For every
,Consequently, for any
We used the fact that
for every .Taking the limit
,
9.15.5.2. Finite Set of Convex Functions Case#
Theorem 9.209 (Directional derivative of pointwise maximum of convex functions)
Let
with
Let
where
Proof. Since
By Theorem 9.204,
the directional derivatives
By Theorem 9.208,
where