This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
0}. These classes of functions turn out to be useful in studying well-conditioned convex functions, uniformly convex and uniformly smooth convex functions, as well as in the study of the geometry of normed spaces. As an illustration of the use of these classes of functions we study the differentiability of convex functions with
XV111
Introduction
respect to arbitrary bornologies. Using one of the characterizations and the Br0ndsted-Rockafellar theorem one obtains the following interesting result of Asplund and Rockafellar: Let X be a Banach space and / : I - > l a proper lower semicontinuous convex function; if / * is Frechet differentiable at x* e int(dom/*) then V/*(x*) € X. In Section 3.4 we introduce the well-conditioned convex functions and give several characterizations of this notion using the conjugate and the subdifferential of the function. An important special case is that of wellconditioning with linear rate. This situation is studied in Section 3.10. In Section 3.5 we study uniformly convex and uniformly smooth convex functions, respectively. To any convex function / one associates the gages pj and 07 of uniform convexity and uniform smoothness, respectively. The gage pf has an important property: the mapping 0 < t i-» t~2pf(t) is nondecreasing. Because a/* = ( p / ) * and a/ = (p/*) # f° r a n v proper lower semicontinuous convex function / , the mapping 0 < i 4 t~2crf(t) is nonincreasing for such a function; moreover, one obtains that for such an / , / is uniformly convex if and only if / * is uniformly smooth and / is uniformly smooth if and only if / * is uniformly convex. Then one establishes many characterizations of uniformly convex functions and of uniformly smooth convex functions. In these characterizations appear functions (gages or moduli) belonging to different subclasses of A introduced in Section 3.3. These gages and moduli are sharp enough in order to obtain that / is c-strongly convex if and only if /* is Frechet differentiable on X* and V / * is c _1 -Lipschitz. Even if the results are established in general Banach spaces the natural framework for uniformly convex and uniformly smooth convex function is that of reflexive Banach spaces. This is due to the fact that when there exists a proper lower semicontinuous and uniformly convex function on a Banach space whose domain has nonempty interior, the space is necessarily reflexive. Section 3.6 is dedicated to the study of those convex functions which are uniformly convex on bounded sets and uniformly smooth on bounded sets, respectively. Under strong coercivity of the function one shows that these notions are dual. In Section 3.7 we study the function / v : X ->• E, f^x) = / „ " tp(t) dt, where
E,
x such that {0} x Vo C {{x,Ax)
\xeB}-[F<
Ao];
(ii) for every U £ Nx there exist A > 0 and V £ Ny0 such that {0} x V C {(x, Ax)\xe
At/} ~[F<
A];
(iii) there exists xo £ X such that (:ro,j4a;o) £ d o m F and F(xo,-) continuous at AXQ;
is
(iv) X and Y are metrizable, 0 E tbD and either F is cs-closed and gr A is cs-complete or F is cs-complete; (v) X is a Frechet space, Y is metrizable, F is li-convex and 0 £ lhD; (vi) X is a Frechet space, F is Isc and 0 £ lbD; (vii) X and Y are Frechet spaces, F is Isc and 0 £ %CD; (viii) dimy 0 < oo and 0 £ 'D,
Convex analysis in locally convex spaces
122
(ix) there exists XQ £ X such that F(xo, •) is quasi-continuous and {0} and D are unitedThen for x* £ X*, x £ domip and e > 0 we have: tp*(x*) = mm{F*(x* - A*y\y*) dM*) Proof.
= {A*y*+x*
\ y* £ Y*},
(2.56)
| (x*,y*) € dEF(x, Ax)}.
(2.57)
It is easy to verify that Vx* £ X* :
0 and V € Ky0 such that V C [g < A] - e(XU); (ii) there exist XQ € R, B € 3x X] - G(B);
and V0 € Ny0 such that V0 C [g <
(iii) there exists j/o € dorngfllmC such that g is continuous at yo; (iv) X, Y are metrizable, g has proper conjugate, either g is cs-closed and C is cs-complete, or g is cs-complete and C verifies (Hz), and 0 £ lbD; (v) X, Y are Frechet spaces, g, C are li-convex and 0 6 lbD, (vi) dim Y0 < oo and 0 6 ' D . Then Vi'el*
: ip*{x*)=mm{g*{y*)
\ x* G C*(*)}.
Moreover, ifxE domtp = C - 1 (domg) is such that
0 then de(p(x) C C* {deg(y)) (with equality i/grC is a linear sub space). Proof. Then
Let x* £ X*. Consider F : X x Y -)• I , F(x,y)
0. Let x* € dE(p(x). Since
• (ii) =» (i) and (iv) V (v) => (i). Important situations when gr C is a linear subspace are: C = A"1 with A £ &(X, Y) and 6 = A with A a densely defined closed operator {i.e. A : D(A) -> Y, D(A) being a dense linear subspace of X, A being a linear operator and gr A a closed subset of X x Y; see also Exercise 1.11). T h e o r e m 2.8.7 Let / , € A(X). Assume that d o m / D domg ^ 0 and let Xo := lin (dom/ — doing). Assume that one of the following conditions holds: (i) there exist Ao £ K, B £ S ^ and VQ £ 3sfx0 SMc/t ^ f l £ VoC[/ 0, £ = £x + e 2 . Then f{x) + (y* o H){x) + (f + y*o H)* {x*) < (x, x*) + ex, g(H(x))+g*(y*)<(H(x),y*)+e2. Adding them side by side we get ip(x) + g*(y*) + (/ + y* ° H)* (x*) < {x,x*) + e. Using Eq. (2.68) we obtain that x* € deip(x). Therefore Eq. (2.69) holds. Let F(x, y) := f(x) + g(y) and G : X =t Y with gr C := epi H; then F £ A(X x Y) and 6 is convex. Since g is increasing on H(dom H) + Q, it follows that (p(x) = int{F(x,y) + iep\H{x-,y) \ y 6 Y] for every x € X. Hence ip is the marginal function associated to the convex function F + iepi H ', hence ip is convex. If one of the conditions (i) — (vi) is verified, then F and 6 satisfy the corresponding condition of Theorem 2.7.4. (When / and g are cs-complete and have proper conjugates, similarly to the proof of Proposition 2.2.17, we get that F is cs-complete.) As noticed after the proof of that theorem, also the perturbed function F, F(x,y) = F(x,y) — {x,x*), satisfies the same condition. Therefore there exists z* £ Y* such that a : = p*{x*) = min{(Ai/i + • • • + A n / n )*(x*) | ( A i , . . . , An) e A„}, E, >p := m a x { / i , / 2 } . Suppose that one of following conditions is verified: (i) there exists XQ G dorafa n d o m / 2 such that fa is continuous at XQ; (ii) X is a Frechet space, fa, fa are li-convex and 0 G l 6 (dom/i — dom/2); (iii) dim X < 00 and ^dom fa) n *(dom fa) ^ 0. £ T(E) and > is continuous on domy D [0, ||a; — j/||]. It is obvious that >'(t) = g'(y + tu,u) for every £ £ domyj. By Theorem 2.1.5(v) we have that lim g'(y + tu,u) = lim 1 + | 0 then 0 for s > 0, and so we get a function i(i) := a\t\ with a > 0, t~p(f(t) is nondecreasing (nonincreasing) on P then the mapping s i-» s~g R, f(x) := ip(\\x\\). Then f*{x*) = >#(||:r*||) for every x* £ X* and f**{x) = <^*#(||a;||) for every x £ X. Proof, (i) Let ip £ A0nN0 and assume that 0 such that 0; then there exists e > 0 such that e. Then ip < ip, and so (p* > ip*- In 0. For 0 < s < e~lip(e) we have that 0] — max {sup{is - e}} < max{se,sup{i(s — e~1ip(e)) | t > e} — se, and so limS4.0 s~lip#(s) = 0. This means that cpy(t) for all* > 0 and c > 1. Let d > 1 and s > 0. Then dq 0} > sup{(ds)(dq/pt) 0} = 0, 0. Moreover, 0. (vi) => (vii) Let 8 := (ph. By Lemma 3.3.l(i) we have that 6 £ N0 n fi0Furthermore, from the definition of 9, we have that #(||a;*||) > ds{x) for every (x,x*) £ Of. (vii) =$• (vi) Let (iii) Let 7 £]0,1[ and V : K+ -^ W+, <^(7 • ds(x)) > c • v?(7 • ds(x)). Hence (iii) holds for this tj). Because t • ^(jt) > JQ (t) := (1 - 7) JQ tp(^s) ds. (vii) => (viii) is obvious. (viii) =>• (vii) Consider the function 6 : 1 + -> ! + , • R, tp(t) = f(x + tu); '-(0) < <^+(0) < ¥>'_(*), and so f'(x,u) 0. (vi) =>• (vii) and (vii) =*> (vi) are immediate with 8 := iph and ip := 8e, respectively. (vi) =>• (iii) as in the proof of Theorem 3.4.3, with tp2(t) '•— (1 7) J0 (fijs) ds, where 7 G ]0,1[ is fixed. (vii) •&• (viii) is obvious; for <= take 0 the gage of uniform continuity of 0, respectively. (vii) O (viii) is obvious with 6 := (ph and tp := 6e, respectively. (viii) =>• (i) Let (x,x*), {y,y*) £ g r d / ; from the hypothesis we have that (y -x,y*- 08(t) = stp~1(s) - tpiip-1 (s)), t — y _ 1 ( s ) being the unique point where the supremum is attained; it is obvious that this assertion is also valid for s = 0. Therefore (||x||) -<9|| • ||(x) for x ^ 0, we have that (b) •£> (c). The expression of the set of supporting hyperplanes of Ux at a point of Sx given by Eq. (3.56) shows that (c) =$(a). Conversely, if X is smooth, using the expression of the set of hyperplanes mentioned above, 9|| • ||(x) has only one element for every x £ Sx, and so d\\ • ||(x) is a singleton for every x £ X \ {0}. Using (iii) we obtain that df{x) is a singleton for every x £ X, whence (a) => (b). The last part follows from Corollary 3.3.3. (v) (a) =£• (b) Suppose that X is strictly convex and let x,y £ X, x ^ y, and A £ ]0,1[. We have seen in (i) that /(Ax + (1 - X)y) < A/(x) + (1 - (\\x\\) + y),r 0 and 7 €]0,r[. In order to show that a°tr £ fii, let e > 0. Because lim(j.0 0. / / one of the above conditions holds and X is a Banach space then X is reflexive. Proof. The equivalence of (i) and (ii) is given by Proposition 3.7.4, the equivalence of (ii), (iii) and (v) follows from Proposition 3.6.3 because fv and (ftp)* are strongly coercive, and the equivalence of (iv) and (v) follows from Proposition 3.7.7. The last statement follows by Proposition 3.6.3. • A dual result holds, too. Theorem 3.7.9 are equivalent: inf/ = —/*(0) = 0 (such a sequence exists!). Suppose that (z„) 7^ 0. Then there exists a > 0 such that P := {n G N | ||a;n|| > a} is infinite. Therefore Vn G P : /(a; n ) > / " ( i n ) > ¥»#(||a:„||) > 0 => (xn) —> 0 for every sequence (xn) C X. Let x* € X* be fixed. Suppose that /*+(0;:r*) > 0, i.e. there exists fi > 0 such that f*(tx*) > tfj, for every t > 0. It follows that f*(^x*) > \[i for every n € N. Therefore there exists (xn) C X such that • 1, p(\) := /(Ax + (1 - X)y) is a lsc convex function and [0,1] C dom<^. By Proposition 2.1.6 (p is continuous on [0,1]. As (p(0) < t < ip(l), there exists A G]0,1[ such that y>(A) = t. Taking z := Xx + (1 - X)y the conclusion follows. • 0 such that ip(x) > 0, 2) $(x) > 0 for every x > 0, 3) liminf^-xx, y>(x)/x > 0. 1), (ii) was established in [Vladimirov et al. (1978)] and [Aze and Penot (1995)]. Section 3.8: The results on best approximation are classical. Most of them can be found in the books [Holmes (1972); Precupanu (1992)]; the other results are folklore. However we mention that the formula (3.66) for the directional derivative of the distance function was obtained by Lewis and Pang (1998)) in Euclidean spaces. Section 3.9: The implication (a) =>• (c) of Theorem 3.9.1 is the main result of [Soloviov (1993)] (see also [Soloviov (1997)]); the convex case is established (even in a more general framework) in [Asplund and Rockafellar (1969)] (see also a\\ - x\\ = a\\x\\, sup{\ip(en)\ | n G N} = sup{n | n G N} = oo, which shows that 0 for every i £ R + \ {1}. It follows that ip'p is increasing on R|_, and so y p is strictly convex and f'p(0) = 0 for t > 0. Hence y>p is increasing on M+. Let now p e]2,oo[; then p p '(f) < 0 for every i € R+ \ {1}. It follows that ip'p is decreasing on Rj., and so y p is strictly concave and 0. Hence <^p is decreasing on R+. Exercise 2.8 Let D :=]0, oo[2; D is an open convex set and / is twice differentiable on D. Consider ip(x,y) := arctan | for (x,y) € D. Then, for (x,y) e D, we have R, ^>(i) = VI +* 2 - Then 0 for every t G R. By this remark the convexity of / is immediate. Using Taylor's formula for functions of a real variable, we have that for all t, T G R, there exists d G ]0,1[ such that fit + T) - 0, then consider the functions gn,9 defined (a.e.) by the formulas y/l and SdomF = (F*)oo- So g<x>(u*) = mmv*eY*(F*)oo(u*,v*) for every u* G X*. Let D := Pry (dom F) and To = linD. It is clear that condition (ii) of Theorem 2.7.1 implies that 0 G lD, and so YQ~ = D1- = D~. The conclusion follows from the last part of Exercise 2.23. Exercise 2.27 We consider that a — b := a + (—6), the sum being taken in the sense of convex analysis: (+oo)+(—oo) — +oo. We have that i n f l € x (f(x) — g{x)) equals inff/^)x£Ji. • (ii) Since if is nondecreasing we obtain that 0. (ii) =>• (iii) This implication is obvious because 0 < f'(x, —x) + f'(x,x). (iii) =>• (i) For x € d o m / and t G 7 we have that tx G d o m / , and so, by hypothesis, pf(tx) < }'{tx,tx) = tf'(tx,x); hence 0 for every t 6 / . But 0 for all s,t G i n t / with s < t. Hence • (iv) Let (x,x*) G g r d / . Then (—x,x") < f'(x,—x) < —pf(x), whence the conclusion. Assume now that / is continuous at 0; of course, 0 G int(dom/) and / is continuous on int(dom/) in this case. Moreover df(x) ^ 0 for every x G int ( d o m / ) , and so the implication (iv) =>• (v) is obvious. (v) =>• (i) Consider x G int(dom/). Taking x* G df(x) furnished by our hypothesis, we have that f'(x,x) > (x,xm) > pf(x). Hence pf(x) < f'(x,x) for every x G int(dom/). Let now x G d o m / . Then tx G i n t ( d o m / ) for every t G int I. As in the proof of the implication (iii) =>• (i) we obtain that the function ip is nondecreasing. Exercise 2.29 Take first x* G ffi=1df{xi). Then {x - X i , x * ) < f(x) - f(xi) for x G d o m / . Multiplying with Ai, then adding side by side the corresponding inequalities, we get {x - x,x*) < / ( x ) - c o / ( x ) for x G d o m / . Take now (x,t) = H^i^jfajttj) € co(epi/) with (xj,tj)jeT^ C e p i / and A G A p . From the • 1) is clear. 3) =*> 2) Let 2a := liminfx-xx, ip(x)/x > 0; then there exists p > 0 such that ax for every x > p. Let x > 0 be such that ip(x) = 0; hence (x,0) € epitp — co(epi / ) C R 2 . Using Caratheodory's theorem (Exercise 1.1), for every i g {1,2,3} there exist the sequences (AJJ, (x\), (tln) C R+ such that: A> + A2 + A3 = 1, ¥>(*») < ti for all n e N and 1 < i < 3, ( £ 3 = 1 A n < ) -* * and ( 2 f = i ^n*n) —> i>ix) = 0- Without loss of generality, we assume that (x\)->x\ ax'n, and so \ntn > a\nxln. So we get the contradiction 0 = T{ > ay{ > 0. Therefore y1 = y2 = y3 = 0, and so x = 0. 1) => 3) Suppose that liminfz-Kx, ip(x)/x = 0; then there exists (xn) C R+ such that (x„) —¥ oo and ip(x„) = anxn with (a„) —¥ 0. Let x € P; there exists n* € N such that xn > x for n > nx. Set A„ := x/x„ €]0,1[ for n > nx\ then K(xn,f{xn)) (t) = 0 <S> t = 0} the affine hull of the set A (p. 2) the set {xeX\ f[x) = inf / } for / : X -+ I the set {y € (X, d) | d(y, x) < e} the set J3(0,1) in a normed space (X, || • ||) the class of all bounded subsets of the tvs X the set cl A \ int A the adjoint of the convex process C (p. 26) the cone cl (cone(A — a)) the nvs of real continuous functions on the interval [a, b] the set of convex and Lipschitz functions on (X, || • ||) the closure of the set A C (X, r ) the convex hull of the set A (p. 2) and co A, respectively the lsc convex hull of the function / (p. 63) the conic hull of the set A (p. 2) and cone A, respectively the upper Dini directional derivative (p. 58) the set {y G (X, d) \ d(y, x) < e} metric or distance 363 • Y see p. 14 the set {x G X | (x,x*) = a } the sets { x G X | <x,x*) < a } , {x G X | (x,x*) > a } the sets { x G X | (x,x*> > a } , {x G X | (X,X*) < a } the function from X into X defined by Idx (x) = x the set {T[x) | x G X } for the operator T : X -+Y the set \Jx€X ft(x) for the multifunction ft : X =4 Y the number inf^gx g(x), where g : X —> R the interior of the set A C (X, r ) the interior of A C Y w.r.t. the topology on Y C (X, T) the set {x G X | T(x) = 0} for T G L(X, Y) separated locally convex space the space of (classes of) Lebesgue integrable functions on ]0,1[ the space of linear operators T : X —> Y the space of continuous linear operators from X into Y — s u p J g / infi>-j A, G R for the net (Xi)iei C R — s u p V € X ( a ) inf x e y / ( x ) G R for / : (X, r ) ->• R — infje/ s u p i t • Aj G R for the net (\i)iei C R 0}_ the class of lsc proper convex functions / : X —> R the class of tu*-lsc proper convex functions h : X* —> R t h e s e t {(Ai,...,A„) € R + | Ai + --- + A„ = 1} for n e N the element of C.E for which Su(x) := 1 if x — u, 0 otherwise the duality mapping of the normed space (X, || • ||) (p. 230) — the function defined by V>A{X) := inf{< | (x,t) G A} the conjugate of ip G A :
A];
S«C/J
that
Formulas for conjugates and
s-subdifferentials
127
(iii) there exists XQ G dom / n dom g such that g is continuous at XQ ; (iv) X is metrizable, f,g have proper conjugates, f is cs-closed, g is cs-complete and 0 G l 6 ( d o m / - domg); (v) X is a Frechet space, f,g are li-convex and 0 G l i > (dom/ — domg); (vi) X is a Frechet space, f,g are Isc and 0 € l 6 ( d o m / — domg); (vii) X is a Frechet space, f,g are Isc and 0 G I C (dom/ — domg); (viii) dimX 0 < oo and 0 G *(dom/ — domg); (ix) g is quasi-continuous and d o m / and domg are united; (x) X is a Frechet space, f,g are li-convex and (0,0) G lb[{(x,x) X} — dom / x dom g).
\x G
Then for x* € X*, x G dom / Pi domg and e > 0 we have: ( / + )*(**) = mi n {/*(** - y*) + g*(y*) | v* G X*} = (/*Og*)(x*), (2.63) de(f + 9)(x) = {J{deif(x)
+ dE2g(x) \ei,e2
> 0, ex + e2 =e),
d(f + g)(x) = df(x) + dg(x). Proof. Taking A = Idx, the conclusion follows from Theorem 2.8.3 under conditions (i) - (iii), (v) - (ix). If (iv) holds, taking Y = X and $(x,y) := f(x) + g(x — y), condition (iv) of Theorem 2.7.1 is verified. If (x) holds consider F : X x X -)• I , F{x,y) := f{x) + g{y) and A : X -> X x X, A(x) :— (a;, a;). Then condition (v) of Theorem 2.8.6 holds; applying it we obtain the conclusion, taking into account that A*{x*,y*) = x* + y*. • The same implications as in Theorem 2.7.1 (mentioned in Proposition 2.7.2) hold. Corollary 2.8.8 Let X be a Frechet space and f, g G A(X). / / / and g are li-convex then for every x G * 6 (dom/ + domg) we have (/Dg)(x) = (/*+$*)•(*) = m&x{(x,x*)
- f*(x*)
- g*(x*) \ x* € X*}. (2.64)
Proof. Let a;o G s 6 (dom/ + domg) and consider h G A(X), h(x) := g(a;o — x). Then dom/i = XQ — domg, and so d o m / - dom/i = d o m / + domg — XQ- Therefore 0 G l 6 ( d o m / — dom/i), and so condition (v) of
Convex analysis in locally convex spaces
128
Theorem 2.8.7 is satisfied. From formula (2.63) applied for x* = 0 we get (fn9)(x0)
= inf. (/Or) + h{x)) = - ( / + h)*(0) = - min ( / • ( * * ) + >»*(-**))• X*£X *
But h*{—x*) = s u p { - (x,x*) — g(xo — x) \ x € X} = g*(x*) — (x0,x*), and so (2.64) holds for x = x0. O Of course, one can obtain the conclusion of the preceding corollary also for other situations corresponding to conditions of Theorem 2.8.7. For example, if there exist A0 £ M, B € "S>x, VQ € Nx0 (x) s u c n that Vo C [/ < A0] n B + [g < A0], where X0 - aff(dom/ + dom^), then Eq. (2.64) holds. Proposition 2.8.9 Let f,g € A(X) and take D = d o m / - domg. If dim(lin£>) < oo then {D = rintZ?, while if X is metrizahle, f,g have proper conjugates, f is cs-closed, g is cs-complete and tbD ^ 0 then lbD = rint D; similarly for the situations corresponding to conditions (v)-(vii) of Theorem 2.8.7. Suppose that X is a Banach space and f, g are li-convex. Then for every x € lbD there exist n, A > 0 such that (x + nUx) n aff D C [f < A] n \UX - [g < A] D \UXProof.
$(x,y)
(2.65)
The first part follows from Proposition 2.7.2 taking Y = X and
=f(x)+g(x-y).
Suppose that X is a Banach space and / , g are li-convex; consider x £ lb D. Replacing / by / , f(u) — f(u + x), we may suppose that x = 0. It follows that condition (v) of Theorem 2.8.7 holds, and, as noted after its proof, condition (i) is verified. Therefore there exist 77 > 0, Ao G B. and B £"Bx such that rjUx n aff D C [/ < Ao] n B - [g < A0]. Taking A' > max{Ao, 0} such that B C X'Ux and A = A' + n we obtain that rjUx n aff D C [/ < A0] 0 B - [g < A0] D (B + nUx)
c [/ < A]n xux - [f < A]n \ux. The proof is complete. Another important result is the following.
•
Formulas for conjugates and
E-subdifferentials
129
Theorem 2.8.10 Let Y be ordered by the convex cone Q, f £ A(X), H : X —> Y' be convex and g £ A(y) be Q-increasing on H(domH) + Q. Then tp :— f + goH is convex. Assume that 0 £ D, where D :— H(domHC\ d o m / ) — domg + Q, and consider Yo := linD. Assume that one of the following conditions holds: (i) for every U £ J^x there exist A > 0 and V € Ky0 such that V C H (XU n [/ < A] n d o m F ) - [g < A] + Q; (ii) there exist XQ 6 R, B £ 1$x and VQ £ Ny0 such that V0 C H (B n [/ < A0] n domH) -[g<
A0] + Q;
(iii) there exists XQ £ d o m / fl i f - 1 (domg) such that g is continuous at H(x0); (iv) X, Y are metrizable, f, g have proper conjugates, either f, g are cs-closed and epiif is cs-complete, or f,g are cs-complete and epiH is cs-closed, and 0 G lbD; (v) X, Y are Frechet spaces, f,g, epiH are li-convex and 0 £ (vi) dim Y0 < oo and
tb
D;
i
0£ D;
(vii) g is quasi-continuous, and domg and H(domH united.
f~l d o m / ) + Q are
Then for x* £ X*, x £ dormp = d o m / n H~1(domg) have:
and e >Q, we
deV{x) = | J {0Er (f + y*° H){x) \y*£Q n
dE2g(H(x)),
(2.66) El
+ e2 = e} . (2.67)
Proof. have
First observe that for all x* £ X*, y* £ Q+ and x £ dom<£ we
V ' O O < (/ + V* o H)*(x*) + g*(y*), de
+
+ y*o H)(x) \y*£Q n
de2g(H(x)),
(2-68) El
+ e2 = e], (2.69)
Convex analysis in locally convex spaces
130
without any supplementary condition on / , g and H. Indeed, let x € dom<^ = d o m / n H-^domg), x* € X* and y* £ Q+. Then f{x) + (y* o H)(x) + (f + y*o H)* (x*) > (x, x*), g(H(x))+g*(y*)>(H(x),y*), whence, adding them side by side, we get g*(y*) + (/ + y* ° H)* (x*) > (a;, a;*) -
jnf
,
=
.1TUix)+g{y)-{x,x''))
inf
(f(x)+g(y)-(x,x*)
+
(z,z*))=:(3.
{x,y+z)EepiH
Using again the fact that g is increasing on H(domH) a =
Jnf
w
xedom H
=
inf iGdom H
j?^
n
+ Q, we have
VW + 9{y) ~ (x,x*))
yeH(x)+Q
(f(x)+g(H(x))
- {x,x*)) = ~
Formulas for conjugates and
s-subdifferentials
131
and P=
inf
inf
(f(x) + g(y) - (x, x*) + (H(x) +q-y,
z")).
iGdomif q€Q, y€Y
It follows that P = - o o if z* $ Q+. If z* £ Q+ then -0=
sup x€dom
sup((x,x*)-f(x)-(H(x),z*)
+
(y,z*)-g(y))
Hy£Y
= (/ + Z*o #)*(*•) + •(**)• Taking into account Eq. (2.68), it is clear that Eq. (2.66) holds. In the case (vii) consider F : I x Y ->• K, F(x,y) := f(x)+g(H(x)+y)(x,x*). Because 0 € D and g is Q-increasing on H(domH) + Q, there exists xo € d o m / n H~1(domg). It follows that F(x0, •) is quasi-continuous and {0} and D — Pry (dom F) are united. Applying Theorem 2.7.1(ix) we have that inf x e x F(x,0) = max y . e y.(—F*(0,y*)). With a similar calculation as above we obtain that Eq. (2.66) holds. Let x 6 donup and x* 6 d£ip(x). Using Eq. (2.66), there exists y* € Q+ such that ip*(x*) = (f + y* o H)*(x*) + g*{y*). It follows that {f+y*oH){x)+{f+y*oHY{x*)-(x,x*)+9{H{x))+g*{y*)-{H{x),y*)
< e.
Taking £ l := (/ + y* o tf)(x) + (/ + y* o #)*(x*) - (x,x*) (> 0 by the Young-Fenchel inequality) and e 2 = e - ei, we have that y* € d£2g(H(x)) and Z* € d£l(f + y* oH)(x). Therefore the inclusion C holds in Eq. (2.67); taking into account Eq. (2.69), we have the desired equality. D We use the preceding theorem to obtain formulas for conjugates and subdifferentials of functions of "max" type. Corollary 2.8.11
Let / i , . . . , / „ E \(X)
y.X^W,
and
¥>(*):=/i(z)V---V/„(z).
Suppose that dom ip = f}™=1 dom /i ^ 0 and e € IR+. For x € dom ip denote by I(x) the set {i G l , n | fi(x) =
Convex analysis in locally convex spaces
132
while for every x £ dom
(Ai, . . . , A„) G A„,
77 e [0,e], Yli=1Xi^x^
- ¥>(x)+ri-£o}
,
dV{x) = U { 5 (E" = 1 A J i ) {x) I (Al!' • •'An) € A"' Vifl{x) Proof.
: Ai = o}.
Let us consider the functions
H:X->(W\Rl),
H(x):=( "•" 3: «"-•«,
(A(*)>••"'/«(*)) [00
if x € fX=i dom/,, otherwise,
g(y):=yiV.---Vyn.
It is clear that condition (iii) of the preceding theorem is verified. Taking into account the expressions of g* and deg(y) given in Corollary 2.4.17 we obtain immediately the relations from our statement. D An application of the previous result is given in the following example. Example 2.8.1
Let / € A(X) and consider /+ := / V 0. Then
f df(x) df+(x) = l U { W ) ( x ) | A e [ 0 , l ] } { d(0f)(x) = didomf(x)
if f(x) > 0, if f(x) = 0, if f(x)<0.
A useful particular case of the result in Corollary 2.8.11, in which we can give explicit formulas for ip* and detp without supplementary conditions, is presented in the following corollary. Corollary 2.8.12
--1J=I
Let f{ 6 A(X,) for i € T~n and Xi-+W,
tp(x1,...,xn):=f1(xi)V---Vfn(xn).
For x = (xi,... ,xn) S domtp = Yl7=i dom/j consider I(x) := {i G l , n | fi(Xi) =
(Xu...,\n)eAn}.
(2.70)
Formulas for conjugates and e-subdifferentials
For x — (xi,..., dMx)
133
xn) E dom
= {j{~[[n.=1dei(^fiKxi)\
Eni=o
( A i , . . . , A „ ) G A n , et > 0, V~^n £i = £
1 Xi Xi
' Z^i=1 ^ ^
- ^
£
- o) >
S ^ ^ - l J l n ^ ^ A i / O C ^ ) ! (A1,...,An)€A„, VigJ(a:) : A; = o} . Proof. We apply the preceding corollary to the functions fa : X ->• R, /i(a;) := fi(xi), then we use Theorem 2.3.1 (viii) for arbitrary n and Corollary 2.4.5. • Other situations when one has explicit formulas for tp* and d
3
+ d(»g)(x) I (A,M) G A 2 , A / ( X ) + ng(x) =
Proof. When / and g verify one of the conditions (i)-(iii), (v), (viii)— (x) of Theorem 2.8.7 and A,/x > 0, then the functions A/ and fig also verify the same condition. If condition (vi) or (vii) holds then, by the relations among the classes of convex functions on page 68, condition (v) holds. So, (Xf + ng)*{x*)=Toia{(Xf)*{u*) + (jjig)*{v*) \u*+v* = x * } a n d d(\f + ng)(x) = d(Xf)(x) + d(fig)(x). Using Corollary 2.8.11 one obtains the conclusion. • Taking into account that for x G dom / , one has df(x) + N(dom f, x) = df(x) (see also Exercise 2.23), d(Xf)(x) = \df(x) for A > 0 and d{0f)(x) = N(domf,x), we get for f,g G A(X) and x G X with f(x) = g(x) G R,
U{W)W+8WWI(A,|f)6A 2 } -co(df(x)Udg{x))
+ N(domf,x)
+N(domg,x).
This formula can be extended easily to a finite number of functions. Using Corollary 2.8.12 we get the following result concerning the maxconvolution.
134
Convex analysis in locally convex spaces
Corollary 2.8.14 have that
Let / i , / 2 € A{X) and e £ 1+ . For every x* £ X* we
(hQf2y(x*)=rnin{(\1f1y(x*)
+ (\2f2y(x*)
| (A1;A2) £ A 2 } .
(2.71)
Suppose that (/i0/ 2 )(a:) = max{/i(2;i),/ 2 (a; 2 )}, where xi £ d o m / i , x~2 £ dom /2 and x = x~i + x2. Then
de{hOh){x) = \J{dEl(^h)(xi)ndE2(\2f2)(x2) e i , e 2 > 0 , £i+e2<e
+ X1f1(x1)
I (Ai,A2) e A2,
+ X2f2(x2)-(f10f2)(x)}.
(2.72)
Furthermore, if fi(x\) = f2(x2) *^en Eq. (2.73) is verified, while if fi(xi) 72(^2) *^en i?g. (2.74) is verified, where d(fi0f2)(x)=[j{d(X1f1)(x1)nd(X2f2)(x2) d(fi0f2)(x) Proof.
I (A1;A2) G A 2 } ,
= dMxx) nN(domf2-:x2).
>
(2.73) (2.74)
Let x* £ X*; then
{fi0f2)*(x*)
= s u p i e X ( ( x , x * ) - i n f f / ^ ) V / 2 ( z 2 ) I *! + x2 = x}) = sup{(ii,a;*) + (a;2,a;*) - fi(xi)
V f2(x2)
\x1} x2 £ X)
= min{(A 1 / 1 )*(x*) + (A 2 / 2 )*(s*) | (Ai,A2) £ A 2 } , the last equality being obtained by using Eq. (2.70). The inclusion "D" of Eq. (2.72) can be verified easily. Consider x* £ de(fi0f2)(x). By Eq. (2.71), there exist (Ai, A2) £ A 2 such that (/i0/2)*(x*) = (Ai/i)*0O + (A 2 / 2 )*(x').
(2.75)
Using the preceding relation we obtain that 0 < [ ( A 1 / 1 ) ( ^ 1 ) + (A 1 / 1 )*( 2; *)-(5; 1 ,a;*)] + [(A 2 / 2 )(i 2 ) + (A 2 / 2 )*(:c')-<S 2 ) a :*)] < (fiOh)(x)
+ ( / i O / 2 ) * 0 0 - (x,x*) < e.
Taking e, := (Aj/i)(zj) + {Xifi)*(x*) - (x~i,x*) > 0, i £ {1,2}, using the preceding relation and Eq. (2.75), we get ( / i 0 / 2 ) ( z ) + £ 1 - Ai/i(3fi) + e 2 - A 2 / 2 (x 2 ) < e, and so x* belongs to the set on the right-hand side of relation (2.72).
Formulas for conjugates and
s-subdifferentials
135
Let us prove now the equalities (2.73) and (2.74). The inclusions "D" follow directly from Eq. (2.72). Let us prove the converse inclusions. Let x* G d{faOfa)(x) = d0(faOfa)(x). By Eq. (2.72), there exist (Ai, A2) G A 2 and £i, £2 > 0 such that x* £1 + e 2 < Xifi(xi)
£dei(Xifi)(x1)ndea(Xif2)(x2), + A 2 / 2 (x 2 ) - (faOfa){x) < 0.
Therefore £1 = £2 = 0 and Ai/i(xi) + A 2 /2(x 2 ) = {fa(>fa){x); hence x* belongs to the set on the right-hand side of Eq. (2.73) when / i ( x i ) = fa{x2)If fi{x~i) > / 2 ( ^ 2 ) , the preceding relation shows that A2 = 0 , Ai = 1, and so x* G 9/i(11),
x* G 0(0 • / 2 )(z 2 ) = <9idom/2(^2) = -/V(dom/ 2 ;x 2 ).
This shows that x* belongs to the set on the right-hand side of relation (2.74). • Note that in Eq. (2.72) we can take E\ + £2 = £ + •••. Moreover, if /1 is continuous at x\ and fa is continuous at a;2, then in Eq. (2.74) we have d(fiOfa)(x) = {0}. Indeed, in this situation, N(domfa;x~2) = {0} (since x~2 G int(dom/ 2 )); since /1O/2 is continuous at x, d(fiQfa)(x) ^ 0. In particular, it follows that X\ is a minimum point of fa. Furthermore, in this situation, /1O/2 is even Gateaux differentiate. Introducing the convention that 0 • df(x) := N(domf;x) for / G A(X) and x G d o m / , formula (2.75) may be written in the form d(fa0fa)(x)
=
dfa(x1)odfa(x2),
where A o B represents the harmonic sum of the sets A and B, which generalizes the inverse sum of A and B. In order to have more explicit formulas in Corollary 2.8.11 it is necessary to impose some supplementary conditions. Let us give such conditions and the corresponding formulas in three situations for n = 2. Corollary 2.8.15 Let fa, fa G A(X), e G 1+ and
Convex analysis in locally convex spaces
136
Then, for all x* € X* and x £ dom/i D dom/2 we have: ¥>*(*•) = min{(Ai/i)*(a;I) + (A 2 / 2 )*(^) | (Ai, A2) e A 2 , x\ + x*2 = x*}, dMx)
= \J{dEl(Xifi)(x)+dE2(X2f2){x) e0+ei
I (Ai,A2) G A 2 , e 0 > ei,e 2 > 0,
+ e2 = e, Xifi(x)
dtp(x) = \J{d(X1h)(x)+d(X2f2)(x)
+ X2f2{x) >
I (Ai,A2) G A2>
Xih(x) + x2f2{x) = tp(x)}. Let defined by
-e0},
a
* £ X* and consider the functions / i , / 2 : X —>• K
/i(a;) : = m a x { ( i , a ; ; ) , . . . , ( j ; , < ) } ,
/ 2 (x) := |{a;,a;*)|.
Applying the preceding corollary, we get the following formulas for every xeX: d
h(x)
= { ]C" = 1 A ^i I (Ai, • • •, A„) € A„, ^ = 0 if (x, x*) < /!(a;)} , r {x*} 0/2 (*) = < {Az* I A € [ - 1 , +1]} 1 {-a;*}
2.9
if if if
(x,x*) > 0 , (x, x*) = 0, (x,x*) < 0 .
Convex Optimization with Constraints
Let us come back to the general problem of convex programming as considered in Section 2.5, (P)
min f(x),
x £ C,
where / € A(X), C C X is a convex set and C (~l d o m / ^ 0; the spaces considered in the present section are separated locally convex spaces if not stated explicitly otherwise. Since x is solution of (P) exactly when it minimizes f + ic, x is solution of (P) if and only if 0 € d(f + i
Convex optimization
Proof.
with
137
constraints
In the conditions of our statement we have that
VxeCndom/
: d(f + ic){x) = df(x) + duc{x) = df(x) +
N{C;x),
whence the conclusion is obvious.
•
Very often the set C from problem (P) is introduced as the set of solutions of a system of equalities and/or inequalities. Let Y be ordered by a closed convex cone Q CY and H : X -» Y' be a Q-convex operator; the set C := {x € X \ H(x)
min f{x), H{x)
An element x € X for which H(x)
min f(x),
H(x)
y.
Let us consider the function $ : I
X
F 4 l ,
/(l) *(*,y):=«{ (x, y) := | v wy ' ' oo
i
f
^ ^ ^ otherwise.
(2.76) v '
The problem (Py) becomes now (Py)
min $(x,y),
i £ l
We obtain, without difficulty, that $ is convex. Moreover $*(z*,-y*) = sup{(x,a;*) + (y, -y*) - *(x,y)
\x€X,yeY}
= sup{(x,x*) - (y,y*> - / ( z ) | x € X,y £ Y,#(aO < Q y} = sup{(a;,a;*) -(H(x)+q,y*) = sup{(z,:r*) - (H(x),y*)
- f(x) \ x € dom//, g € Q} - f{x) \ x £
domH}
+ sup{-(g,y*) | 9 € Q}. Hence $*(x*,-y*)
= sup{(x,x*)
if y* e Q+ and $*{x*,-y*)
- f(x) - (H(x),y*)
= oo if y* g Q+.
\x£domH}
138
Convex analysis in locally convex spaces
The function L:XxQ+->R,
L(x,y*):=
{ ^x ^ oo
) + (H(x),y*)
ifxGdomtf, if x $. dom H,
is called the Lagrange function (or Lagrangian) associated to problem (Po)- The above definition of L shows that L(x,y*) — f(x) + (y* o H){x) with the convention that y*(oo) = oo for y* G Q+, convention which is used in the sequel. By what was shown above, we have that = 8up(-L(x,yt)) = -hdL(x,y*). x x xex ^ Therefore, the dual problem of problem (Po) (see Section 2.6) is V y * e Q + : &(0,-y*)
(£>o)
max ( - $*(0, *)),
y*£Y*,
or equivalently, (Do)
max (mfxeX L(x,y*)),
y* £ Q+•
We have the following result. Theorem 2.9.2 Let f G A(X), x G domf and H : X -> (Y',Q) be a Q-convex operator, where Q
3x0edomf
: -H{x0)
€ intQ.
Then the problem (D0) has optimal solutions andv(Po) = v(Do), i.e. there exists y* G Q+ such that inf{/(x) | H(x)
\ x G X}.
Furthermore, the following statements are equivalent: (i) x is a solution of (Po); (ii) H(x)
+ y*o H)(x)
and
(H(x),y*)
= 0;
(hi) there exists y* G Q+ such that (x, y*) is a saddle point for L, i.e. Vx E X, Vy* G Q+ : L(x,y*) < L{x,y*) < L{x,y*)-
(2.77)
Proof. The Slater condition (S) ensures that (XQ, 0) G dom $ and $(x0, •) is continuous at 0, where $ is the function denned by Eq. (2.76). Applying Theorem 2.7.1, there exists y* G Y* such that v(P0) = - $ * ( 0 , - y * ) . If v(Po) = —oo, then $*(0,2/*) = oo for every y* G Y*; in this case we may
Convex optimization with constraints
139
take y* = 0. If f(Po) > —oo, using the expression of $*, we have that y* e Q+. Therefore v(P0) = inf{/(x) | H(x) < Q 0} = - $ * ( 0 , - | T ) = i n f { i ( x , r ) I a; € X } = v(D0). (i) =S- (ii) Of course, x being a solution for (P 0 ), we have that H(x)
< f(x) < f(x) + (H(x),y*)-
(2.78)
Relation (2.78), without the term from its middle, says that x is a minimum point for f + y* o H, and so 0 € d(f + y* o H)(x); taking x = x in relation (2.78) we obtain that (H{x),y*) - 0. (ii) =>• (hi) Since 0 € d(f + y* o H)(x) we have Vx G X : L ( x , r ) = /(x) + (^(x),y*) < f{x) + (H(x),y*)
= LfoiT).
Furthermore, for y* £ Q+ we have that L(x,y*) = f{x) + (H(x),y*)
< f(x) = f{x) + (H{x),y*)
=
L(x,?).
Therefore Eq. (2.77) holds, i.e. (x,y*) is a saddle point of L. (iii) => (i) Taking successively y* = 0 and y* = 2y* on the left-hand side of Eq. (2.77) we obtain that (Hix),y*) = 0. Using again the left-hand side of Eq. (2.77), we obtain that (i?(x), y*) < 0 for every y* e Q+; thus, using the bipolar theorem (Theorem 1.1.9), we have that —i?(x) € Q++ = Q, i.e. x is an admissible solution of (P 0 ). From the right-hand side of relation (2.77) we get Vx e X, H{x)
< f(x) + (H{x),F)
Therefore x is solution of problem (Po)-
<
fix). D
The element y* € Q+ obtained in Theorem 2.9.2 is called a Lagrange multiplier of problem (P). Note that when H is finite-valued (i.e. d o m # = X) and continuous, V x e d o m / : dif +
y*oH)ix)=8fix)+diy*oH)ix);
Convex analysis in locally convex spaces
140
the fact that Q is closed was used only for the implication (iii) =4- (i) of the above theorem, to prove that x is an admissible solution. An important particular case is when there are a finite number of constraints. Let / , g i , . . . ,gn 6 A(X) and consider the problem (Pi)
min f(x),
gi{x) < 0,1 < i < n.
The dual problem of (Pi) is (Di)
max mix€X(f(x)
+ \igi(x)-]
hA n (/ n (i)), Ai > 0 , . . . , A„ > 0.
Then the following result holds. Theorem 2.9.3 dition holds, i.e.
Let f,gi,...,gn
€ A(X). Suppose that the Slater con-
3xo € d o m / , \/i£l,n
: gi(x0) < 0.
Then: (i) the dual problem (Di) has optimal solutions and u(Pi) = i.e. there exist (Lagrange multipliers) A i , . . . , A„ £ M+ such that
v(D{),
mf{f(x)\gi(x)<0,...,gn(x)<0} -ini
{f(x)+J1g1(x)-\
+ Xn9n(x) \ x 6 X) .
(ii) Let x 6 dom / ; x is a solution of problem (Pi) if and only if gi(x) < 0 for every i 6 l , n and there exist A i , . . . , An € K+ such that Xigi(x) = 0 for i £ l , n and 0 € d (/ + Aiffi + . . . +
\XJ-
*ngn)
If the functions g\,..., alent to
gn are continuous at x, the last condition is equiv-
0 € df(x) + A-!0ffi(2c) + . . . +
\ndgn(x).
Proof. Let us consider H : X -> (Mn*,E!J:), H(x) := (gi(x),.. .,gn(x)) for x G n r = i d o m 5 j , i?(x) = oo otherwise; H is R™-convex. The result stated in the theorem is an immediate consequence of the preceding theorem. When gi are continuous at x one has, by Theorem 2.8.7 (iii), that d(f + Aiffi + . . . + \n9n)(x) and d(\igi)(x)
= Jidg^x).
= df(x) + d(Xigi)(x)
+ ... +
d(\ngn)(x), D
Convex optimization with constraints
141
Corollary 2.9.4 Let f,gi,--.,gn '• X —> R be Gateaux differentiable continuous convex functions. Suppose that there exists x$ £ X such that 9i{xo) < 0 for every i 6 l,n. Then x £ X is solution of problem (Pi) if and only if gi(x) < 0 for every i £ l , n and there exist A i , . . . , An £ R+ such that —\7f(x)=\iVgi(x) Corollary 2.9.5 N([g < l\,x)
+ ... + \nVgn(x)
and \igi(x) = 0 for i £ l,n.
•
Let g 6 A(X), 7 £ ] inf g, 00 [ and S £ [5 < 7]. Tften = \J {d(Xg)(x) | A > 0, X(g(x) - 7) = 0}.
(2.79)
Proof. The inclusion "D" in Eq. (2.79) holds without any condition on g and 7. Indeed, let x* £ d(Xg)(x) for some A > 0 with X(g(x)— 7) = 0. Then for x £ [g < 7] we have that (a: — x, x*) < Xg(x) — Xgix) = X(g(x) — 7) < 0, and so x* £ N([g < ^],x). Let x* £ N([g < j];x). Then x is a solution of the problem (Pi)
min {x, —x*), h(x) := g(x) — 7 < 0,
whence, by Theorem 2.9.3, there exists A > 0 such that Xh(x) = X(g(x) — 7) = 0 and 0 G d(-x* + Xh){x), i.e. x* £ d{Xh){x) = d(Xg)(x). Therefore the inclusion " c " of Eq. (2.79) holds, too. • Note that d(Xg)(x) = Xdg(x) if A > 0 and d(0g)(x) = didomg(x) = N(domg;x). In the case of normed vector spaces, taking A = X in Proposition 3.10.16 from Section 3.10, we have another sufficient condition for the validity of formula (2.79). Note that we can obtain the characterization of optimal solutions in Theorem 2.9.3, when the functions gi are continuous, using Corollary 2.9.5 and the formula for the subdifferential of a sum. Indeed, x £ dom / is a solution of (Pi) if and only if x is minimum point of the function / + icx + htc„, where Cj := {x | gi(x) < 0}. By hypothesis d o m / f l f)" = 1 int d ^ 0, and so, for every x £ d o m / (~l H i l i ^ i w e have d(f + tCl+...
+ icn) (x) = df{x) + N(Cf,x)
+ ... +
N(Cn;x).
Using formula (2.79) we obtain the desired characterization. Theorem 2.9.2 can be extended further to the case when there are also linear constraints. Let us consider the problem (P 2 )
min f(x),
H{x)
Convex analysis in locally convex spaces
142
where T G £ ( X , Z). Consider the Lagrange function associated to problem
L2 : X x (Q+ x Z*) - • 1 ,
L 2 ( a M , V ) := /(a:) + (H(x),y*)
+
(Tx,z*).
The dual problem associated to (P 2 ) is (£>2)
max (mixeXL2(x,y*,z*)),
y* G Q+, z* G Z*.
Then the following result holds. Theorem 2.9.6 Let f G A{X), H : X -> (Y,Q) be a Q-convex operator, where Q C Y is a closed convex cone, T G L(X,Z) be a relatively open operator and x G d o m / . Suppose that there exists XQ € d o m / such that f,H are continuous at xo, —H{XQ) G int<5 and TXQ = 0. Then the problem (D2) has optimal solutions and V{PQ,) = v{D2), i.e. there exists y* G Q+, z* G Z* such that inf{/(x) I H(x)
\ x G X).
Furthermore, the following statements are equivalent: (i) x is solution of problem (P2); (ii) H(x)
and (H(x),y*)
= 0;
(iii) there exists (y*,z*) G Q+ x Z* such that (x,(y*,z*)) point of L2, i.e. L2(x,y*,z*)
< L2{x,y*X)
<
is a saddle
L2{x,y*,T)
for all x G X and all y* G Q+, z* G Z*. Proof.
Let us consider the perturbation function
*:Ix(7xZ)->I, v
;
'
f(x) $(1,y,z) := , y y ^ ' ' ' 00
if H(x)
We intend to prove that condition (i) of Theorem 2.7.1 is verified. Note first that P r y x Z ( d o m $ ) = {{H{x) + q,Tx) | x G d o m / , q £ Q} C Y x ImT.
A minimax
theorem
143
Taking into account that / is continuous at xo G d o m / , there exists Uo G N x ( i o ) such that Vxe%
: f{x)<M:=f(x0)
+ l.
Since —H{XQ) G intQ, there exists Vo G Ny such that —H(XQ) +VQCQ. There exists V € Ny such that V + V CV0. Since if is continuous at x0, there exists {/ € Nx(zo)) [/ C t/o, such that H(x) € H(x0) — V for every x € U. Since T is relatively open, there exists W G NimT such that W C T(LT). Of course, V x W G N y x i m r ( 0 , 0 ) ; let (j/,z) G V x W. There exists x £U such that Tx = z. Since x G f/, we have that i ? ( i ) = H(xo) — y' for some j / ' G V. Then y - H(x) =y + y'-
H(x0) G - # ( x 0 ) + V + V C - # ( x 0 ) + V0 C Q;
hence -ff(x) < Q y. Since x £ U C Uo, we have that $(x,2/,z) < M, i.e. condition (i) of Theorem 2.7.1 is verified. Therefore there exists (y*,z*) G Y* x Z* such that v(P2) = -4>*(0, -y*, -z*). But $*(0, -y*, -z") = s u p ( w ) 6 X x y x Z ( ( i / , -y*) + (z, -z*) - $ ( x , y , z)) = sup{-(y,y*)
- (z,z*) - f(x) | H ( i )
= s u p { - ( # (x) + q, y*) - (Tx, z*) - f(x) = - inf{f(x) + (H(x),y*)
\xeX,qeQ}
+ (Tx, z") \ x G X } + sup{-(q,j/*) | q G Q}.
Thus **cn _„* —?*\ - J - i n f i e x ^ C i . y * , ^ * ) * l U ' y ' z ) ~ \ oo
if if
2/* e Q + , j / * £ Q+.
Therefore y* G Q + if u(P 2 ) G K and we can take j7* = 0 if v(P2) = - o o . The proof of the second part is completely similar to that of Theorem 2.9.2. We note only that, / and H being continuous at xo, we have d(f + y* o H)(x) = df(x) + d(y* o H)(x) for every x G d o m / . •
2.10
A Minimax Theorem
In the preceding section we have obtained some results which are stated using saddle points. In the sequel we establish a quite general result on the
144
Convex analysis in locally convex spaces
existence of saddle points as well as a minimax theorem. Let A and B be two nonempty sets and / : A x B -> JR. It is obvious that sup inf f(x,y) xeAvtB
< inf sup f(x,y). y£BxeA
(2.80)
The results which ensure equality in the preceding inequality are called "minimax theorems," the common value being called saddle value. Note that if / has a saddle point, i.e. there exists (x,y) € A x B such that VxeA,Vy&B
: f(x,y)
< f(x,y)
< f(x,y),
(2.81)
then max inf f(x,y) x£A yeB
= minsup f{x,y), yeB
(2.82)
xeA
where max (min) means, as usual, an attained supremum (infimum). Indeed, let (x,y) € Ax B satisfy Eq. (2.81). It follows that ini: sup f(x,y) v£Bx€A
< sup f(x,y) xeA
< f(x,y)
< infB :(x,y) < sup infB y£ xeAv&
f{x,y).
Using Eq. (2.80) we obtain that all the terms are equal in the preceding relation, and so (2.82) holds (the maximum being attained at x and the minimum at y). Conversely, if Eq. (2.82) holds then / has saddle points. Indeed, let x G A and y e B be such that inf (x,y) = sup ini' f(x,y) y£B
xeAVZB
= irtf swp f(x,y) y€Bx€A
= sup
f(x,y).
x£A
Since inf y 6 s(x,y) < f(x,y) < supx€Af(x,y), from the above relation it follows that all the terms are equal in these inequalities; this shows that (x, y) is a saddle point of / . So we have obtained the following result. Theorem 2.10.1 Let A and B be two nonempty sets and f : A x B ->• R. Then f has saddle points if and only if condition (2.82) holds. • The following result is an enough general minimax theorem (frequently utilized) which gives also a sufficient condition for the existence of saddle points. Theorem 2.10.2 Let X be a locally convex space, Y be a linear space, Ac X be a nonempty convex compact set and B CY be a nonempty convex set. Let also f : A x B -)• M. be a function with the property that f(-,y)
A minimax
theorem
145
is concave and upper semicontinuous for every y G B and f(x, •) is convex for every x E A. Then max inf f(x,y)
— inf ma,xf(x,y).
X&A y£B
y£B
(2.83)
xEA
If moreover Y is a locally convex space, B is compact and f(x, •) is lower semicontinuous for every x € A, then maxmmf(x,y)
= min max f(x,y);
xeA y£B
yeB
(2.84)
x£A
in particular f has saddle points. Proof. Let a e K be such that a > max x£j 4 inf ye B f(x,y). For every x S A there exists yx 6 B such that f(x,yx) < a. Since f(-,yx) is upper semicontinuous at x, there exists an open neighborhood Vx of x such that f(u,yx) < a for every u G Vx. Since A is compact and A C UxeA ^ > there exist x\,..., xp € A such that A C U?=i Vxt • Let 2/j := yXi • Consider the sets d
:=co{{f(x,yi),...,f(x,yp))\xeA}cW,
C2 := {(ui,...,up)
\v,i>a\/i€
l,p} .
It is obvious that C\ and Ci are nonempty convex sets, and C2 has nonempty interior. Moreover C\ fl Ci = 0. In the contrary case there exist q 6 N, ( A i , . . . , A?) 6 Aq and Xi,..., xq € A such that V i e l . p : a < Zf := 2^,
>^jf{xj,yi).
Since f(-,yi) is concave, taking xo = 13?=i ^ j 2 ^ ' Vi G l , p : a < Zj <
we nave
that i o £ i and
f(x0,yi).
There exists io £ l , p such that £0 £ VXi . Therefore f{xo,yi0) < a, a contradiction. Applying Theorem 1.1.3 there exists (/zi,..., fj,p) € W \ {0} such that . ,mf(x,yi) t=l
< V
UiUi.
*—'i=l
Letting u; -> 00 we have that /ij > 0 for every i, and so we can suppose that (/xi,..., Up) € A p . Taking u = ( a , . . . , a) we obtain that ^xeA
: > .
Uif{x,yi)
< a.
Convex analysis in locally convex spaces
146
Let 2/0 := Y%=i ViVi € B. Since f(x, •) is convex, we obtain t h a t f(x, y0) < a for every x € A. Therefore a > i n f j / e B m a x x g ^ / ( a ; , 2 / ) , and so (2.83) holds. In Eq. (2.83) the suprema with respect t o x are attained because the functions f{-,y), y £ B, and i n f y e s f(-,y) are upper semicontinuous and A is compact. W h e n B is compact and the functions f(x,-) are lsc for all x € A we obtain t h a t the infima with respect to y € B are attained in Eq. (2.83), i.e. Eq. (2.84) holds. • Note t h a t the convexity assumptions in the preceding theorem can be weakened. More precisely, we can suppose t h a t A is a nonempty compact subset of a topological space, B is nonempty, and / satisfies t h e following two conditions (similar to concavity and convexity, respectively):
Vxi,...,xp
e A, V ( A i , . . . , A p ) e A p , 3x0
&A,\fyeB
:
f(xo,y) > y \
,Kf(zi,y),
• ' — ' 1 = 1
\/yi,...,yq
GB,
V ( / / ! , . . . , / * , ) € A , , 3y0 eB,Vx£A
: v—yQ
f(x,yo) 2.11
<
2^,.=iHjf{x,yj).
Exercises
Exercise 2.1 Let X be a linear space, / € A.(X) and x,u £ X. Prove that the mapping %p :]0, oo[—• R defined by ip(t) :— t • f(x + t~lu) is convex. E x e r c i s e 2.2 (a) Let I C R be an interval and / : / — > • R. Suppose that / is locally nondecreasing, i.e. for every a € I there exists e > 0 such that the restriction of / to IC\[a — s, a + e] is nondecreasing. Prove that / is nondecreasing. (b) Let g : [a, b] —• R (a, 6 € R, a < b) be a continuous function. 1) Assume that g'+{x) g R exists for every x € [a,b[; show that there exists x' £ [a,b[ such that g(b) — g(a) < g'+(x')(b — a). 2) Assume that g'-(x) € R exists for every x S ]a, 6]; show that there exists x" € ]a, b] such that g(b) — g(a) > gL (x")(b — a). (c) Let X be a locally convex space and A C X be an open convex set. If / : A —> R is locally convex, i.e. for every a G A there exists a neighborhood V of a such that f\vnA is convex, prove that / is convex. Exercise 2.3
Let / : R n -*• R be a quasi-convex function and i ^ f .
For
Exercises x eRn
147
consider the function fx : R n ->• R, fx(t) = f(x + tx). Prove that / is lsc at i t t V i e l "
: fx
is lsc at 0,
/ is use at J o V i e R "
: fx, is use at 0.
If R n is replaced by an infinite dimensional normed space then the above properties do not hold, generally. Exercise 2.4 Let X be a linear space, / : X —> R be a convex function and A > i n f i e x fix)- Prove that {x G X | f(x) < \y
= {x G X | f{x) < A}.
Moreover, if X is a topological vector space and / is continuous, then int{x € X | f(x) < A} = {x G X | / ( z ) < A}. Exercise 2.5 Let X be a topological vector space and / : X —> R a convex function. Prove that: ( a ) [ / < * ] = c l [ / < *] f° r every t G ] inf / , oo[ if and only if / is lsc; (b) [ / < * ] = i n t [ / < *] for every t G ] inf / , cx)[ if and only if / is continuous on d o m / ; ( c ) [/ = *] = b d [ / < t] for every t G ] inf / , oo[ if and only if / is continuous (on X ) . Exercise 2.6 Let / : R" —)• R be a proper convex function for which all the partial derivatives df/dxi exists at a G int(dom/). Prove that / is Gateaux differentiable at a (even Frechet differentiate because / is Lipschitz on a neighborhood of a). Exercise |1 + t\p + is strictly and (f2 is
2.7 Let p G [1, oo[ and
E x e r c i s e 2.8
Take /3 G [1, <x[ and consider the function
/:R2^I,
2
f(x,y):={ (^tan|)^V^T^ [ oo
for x, y > 0, otherwise,
where, by convention, arctan | := ^ for a; > 0. Prove that / is a lsc convex function, but not strictly convex. Exercise 2.9
Consider the function
/: c[o, i] -> R,
fix) := /o1 y r r w o F ^ -
Convex analysis in locally convex spaces
148
Prove that / is convex and Frechet differentiable of order 2 on C[0,1] with
V/(x)(«) = f1 -£L= dt, Jo V l + x
V2f(X)(u, V) = ^
2
,7/^3-2 dt 2
Jo ( l + 2 : ) v l + z 2
for all u,v £ C[0,1]. Moreover, V / is continuous on C[0,1]. E x e r c i s e 2.10
Consider the function /:L1(0,1)->R,
/ ( i ) : = / 0 V l + (i(t))!*,
where L 1 (0,1) is the Banach space of (classes of) Lebesgue integrable functions on the interval [0,1]. Prove that / is a Gateaux differentiable convex function with xu Vx.uSL^O.l) : V/(x)(u) = / :dt, Jo
%/rn
but / is nowhere Frechet differentiable. Exercise 2.11
Using properties of convex functions, prove that
V a , 6 e R + , V p , g e ] l , o o [ , £ + ^ = 1 : ab < ±a p + \b\ the equality being valid if and only if ap = bq, and V n € N, V x i , . . . , x „ 6 R + , V a i , . . . , a „ 6]0,1[, a i + . . . + a „ = 1 : a m + . . . + a n x „ > X™1 • • -xZn, the equality being valid if and only if x\ = • • • = xn. In particular, VneN, Vsi,...,x„ €R+
: £(xi + . . . + xn) >
yX\---xn.
E x e r c i s e 2.12 Let / : X —>• R be a proper function. Prove that / is convex if and only if / + x* is quasi-convex for every x* G X*. Exercise 2.13 Let <x\,... ,an > 0 (n > 1) be such that ai-\ \-an < 1. Prove that the function / : P " —> R, / ( x ) := x"1 • x%2 • • • x"n, is concave; moreover, if a\ + • • • + a„ < 1, then / is strictly concave. E x e r c i s e 2.14 Consider / : R n ->• R, f(xi,...,x„) Prove that / is convex. Exercise 2.15
:= In ( X £ = 1
expxk).
Consider p € R \ {0} and the function xP"->R,
f{t,x):=
t"
nr=i*i'
Prove that / is strictly convex for p € ] — o o , 0 [ U ] n + l , oo[ and convex if p = n + 1.
149
Exercises Let c G R n and A : = { i £ P " \(x\c)> g : A -> R,
0}. Prove that the function
p(x)
(x\c)"
niu^
is strictly convex for p > n + 1 (it is possible to prove that g is strictly convex for p > n). The function g has been used by Karmarkar for establishing his interior point algorithm. Finally prove that the function
h : P n ->• R,
h(x) := (T[n xA
is strictly convex. Exercise 2.16 Let tp : R+ —> R+ be a continuous convex function such that ^i(t) = 0 •«• t = 0. Show that
Jo
i>'- (V>- x «))
=
7o
V-H^-H*))
for every a > 0. Exercise 2.17 function
Let X, Y be normed spaces and T G L(X,Y). f:Y^R,
f(y):=M{\\x\\\Tx
Prove that the
= y},
is a sublinear functional. Moreover, if T is an open operator, then dom f = X and / is continuous. Exercise 2.18 Let / : Rk —> R be a strongly coercive proper lsc function. Prove that co(epi/) is a closed set. E x e r c i s e 2.19 Let X be a locally convex space, / G T(X), g : X —> R be a proper function and a, /3 > 0. Prove that co (co(/ + ag) + jig) = co ( / + ( a 4- /3)g) and ( ( / + ag)** + fig)" = (f + (a + fig)* . Exercise 2.20 Let X be a separated locally convex space and A,B,CcXbe nonempty sets. If C is bounded and A + C C B + C prove that A C coB. Exercise 2.21 Let (X, \\-\\) be a normed space, / G T(X) and L > 0. Prove the equivalence of the following statements: (i) d o m / = X a n d V x , t / G X (ii) 3 a GR, V i € X (iii)
VMGX
: \f(x) - f(y)\ < L \\x - y||;
: / ( x ) < L ||x|| + a;
: /oo(u)
Exercise 2.22 Let X be a locally convex space and / € T(X). foo(u) = sup{/(z + u) — f(x) I a; G d o m / } for every u 6 l .
Prove that
150
Convex analysis in locally convex spaces
E x e r c i s e 2.23 Let X, Y be separated locally convex spaces, / € r ( X ) , F € F(X x Y), x € dom / and A £ R, e £ R+ be such that [/ < A] and d£f(x) are nonempty. Prove that: [/ < A]oo = [foo < 0], ( a e / ( x ) ) 0 0 = N{domf;x), (/*)«, = Sdom/, foe = Sdom/* and {v* £ y* | (F*)oo(0,t;*) < 0} = ( P r y ( d o m F ) ) . In particular, if {v* £ Y* | (F*) o o (0,u*) < 0} is a linear subspace then {0} and Pry (dom F) are united. E x e r c i s e 2.24 Let X be a separated locally convex space and / £ I \ X ) . Consider K := {u £ X | /«,(w) < 0} and X 0 := K D (-.K"). Prove that: (i) if is a closed convex cone, Xo is a closed linear subspace and f(x + u) = f(x) for all a: 6 X and u 6 Xo. (ii)
lin(dom/*)=X6L.
(iii) The function / : X / X 0 -»• I , f(x) := f(x), is well defined, fe r ( X / X 0 ) , /OO(M) = /oo(«) for every u € X, f* = f*\x± and 5 £ / ( £ ) = 9 E /(a;)| X j. for all x € X and £ 6 R+, where x represents the class of x £ X. E x e r c i s e 2.25 Let X be a separated locally convex space and / £ T(X). Assume that there exists u £ X such that /<X>(M) < 0 < fco(-u). Prove that for every e > 0 there exists fe £ T(X) such that f(x) < fc(x) < f(x) + s for every x e X and argmin/ e = 0. E x e r c i s e 2.26 Let X, Y be separated locally convex spaces and F 6 A(X x Y). Assume that F satisfies one of the conditions (ii)-(viii) of Theorem 2.7.1. Prove that the marginal function g : X* —¥ R, g(x") := inf y * e y* F*(x*,y*) is convex, iu*-lsc, the infimum is attained for every x* £ X* and goo(u") = mmv*ey(F*)oo(u* ,v*). Moreover, {v* £ Y* \ (F*)oo(0, «*) < 0} is a linear subspace. E x e r c i s e 2.27 (Toland-Singer duality formula) Let X be a separated locally convex space, / : X —>• R be a proper function and g £ T(X). Then inf (/(*) - ff(i)) = .inf. (g\x')
- f*{x'))
•
E x e r c i s e 2.28 Let X be a separated locally convex space, / £ T(X) be such that 0 € d o m / a n d p £ P. Consider the following assertions: (i) the mapping P 3 t i - > t~pf(tx) (ii) f'(x, —x) +pf(x) (iii) pf(x) (iv)
is nondecreasing for every x £ d o m / ;
< 0 for every x £ d o m / ;
< f'(x,x)
for every x € d o m / ;
{x,x*) >pf(x)
for all (x,x*) € g r d / ;
(v) for every x £ int(dom / ) there exists x* 6 df(x) such that (x, s*) > pf(x). Prove that (i) <=> (ii) <=> (iii) =$- (iv). Moreover, if / is continuous at 0 then all five assertions above are equivalent.
Exercises
151
E x e r c i s e 2.29 Let / : X —> IR be a function such that c o / is proper. Assume that for some x € d o m c o / we have that x = ^2i=i ^»^« a n ( ^ co/(ic) = ^ * = 1 ~\if{x~i) with Xi G d o m / , Ai > 0 for i € ITfc and £)* = 1 ^> = 1- Prove that 8c5/(3f) = n t i df(xi). Exercise 2.30 Let X be a separated locally convex space and / € A(X). Assume that 0 G d o m / and / | x 0 is continuous at 0, where Xo •— aff(dom/). Prove that f = sup{(x,x*) |x* G<9/(0)W < { =
f'(0,x) f'(Q,x) = oo / ' ( 0 , x ) = oo
if if if
xGXo, xGXo\_Xo, xeX\X0,
the supremum being attained for x G Xo. Moreover, Xo is closed if and only if V x G X : / ' ( 0 , x ) = s u p { ( x , x * ) | x* 6 3/(0)} . E x e r c i s e 2.31 Let X be a separated locally convex space and / G A(X) be continuous on i n t ( d o m / ) , supposed to be nonempty. Prove that for all x,y € int(dom/) there exist z £]x,y[and z* G df{z) such that f(y)—f{x) = (y — x, z*). E x e r c i s e 2.32
Let X be a separated locally convex space and / G T(X).
(i) Consider the conditions: a) df is single valued on dom.3/, b) (df)~1(xl)!~) (df)~1(x2) = 0 for all distinct elements Xi,X2 G X*, c) /* is strictly convex on every segment [xi,X2] C Imdf, d) dom df = int(dom/). Prove that a) <=> b) •<=>• c); moreover, if int(dom/) ^ 0 and / is continuous on int(dom/) then a) => d). (ii) If / is continuous on int(dom/) and ( 9 / ) _ 1 is single-valued on show that / is strictly convex on int(dom/).
Imdf,
E x e r c i s e 2.33 Let X, Y be separated locally convex spaces and / G A(X x Y). Prove that if / is continuous at (xo,j/o) £ d o m / , then Prx* (df{x0, j/o)) = df(-,yo)(x0) Exercise 2.34 Consider
and
P r y ( d / ( x 0 , y 0 )) = d / ( x 0 , -){yo)-
Let X be a separated locally convex space and / o , / i 6 A(X).
v := inf{/ 0 (x) | / i ( x ) < 0},
v* := sup inf (/o(x) + A/i(x)), x>o xex
with the convention 0 • oo = oo. Suppose that » £ R . Prove that v* =v&[rfe>Q
: inf{/i(x) | / 0 (x) < y - e} > 0].
E x e r c i s e 2.35 Let {X, \\-\\) be a normed linear space and / : X —> R be a proper convex function. Prove that there exists x* G X* with x* < / if and only if there exists M > 0 such that f(x) + M \\x\\ > 0 for all x G X.
Convex analysis in locally convex spaces
152
Exercise 2.36 Let (X, ||-||) be a normed linear space and / i , / 2 : X -> R be proper convex functions. Prove that there exist x* 6 X* and a € R such that —/i < x" + a < J2 if and only if there exists M > 0 such that fi(xi) + f2(%2) + M\\xi - x21| > 0 for all on, x2 € X. E x e r c i s e 2.37 Let X be a linear space, (Y, ||-||) be a normed linear space, T : X -* Y be a linear operator, yo 6 Y and / : I -> R be a proper convex function. Prove that f(x) + \\Tx + yo\\2 > 0 for all x € X if and only if there exists j / * € y * such that fix) -2(Tx + y0,y*) - ||y*|| 2 > 0 for all x e X. E x e r c i s e 2.38 Let (X, ||||) be a normed vector space, C,D C X be closed convex cones and x G X, x* € X*. Prove that d(x,C)
: = i n f { | | x - x | | | x € C} = m a x { - ( x , x * ) | x* 6 Ux* n C + } ,
d ( x * , C + ) = m i n { | | x * - x * | | | a;* 6 C + } = s u p { - ( x , x * ) | x G C/x n C} , a,ndsupx€Uxncd(x,D)
= supx,eUx,nD+
d(x* ,C+).
E x e r c i s e 2.39 Let / € A(X) and x € d o m / be such that f(x) Consider the sets d
>
inf/.
:= {z* € X* | ( i - x,x*) < f(x) - /(a?) Vx e [/ < /(*)]},
d ^ / ( x ) := {x* a ' l f i - f , x*) < f{x) - / ( J ) Vx € [/ < /(*)]}. Prove that dKf{x) = d-f(x) differential of / at x.
= [1, oo[-d/(x). The set a < / ( x ) is Plastria's sub-
E x e r c i s e 2.40 Let X be a normed space, (an)n>i C X and (A n )„>i C [0,oo[ be such that ^ ^ L j A n = 1. Consider the function
/:X->R,
f{x) = Y°°
A„||x-a„|| 2 .
*•—'n=l
1) Prove that the following statements are equivalent: (a) S^Li-^i|l a "ll 2 < oo (i.e. 0 € d o m / ) , (b) d o m / ^ 0, (c) d o m / = X. 2) Prove that / is finite, convex and continuous when 5Z^Li^ill a "l| 2 < °°3) Suppose that ]C!!!Li^n|lan||2 < °°- Prove that for every x £ X one has 8f{x) = { 2 V 0 0
A n ||x - On|| xmn I x*n e a|| • IKi - a„) Vn € N ) .
E x e r c i s e 2.41 Let X be a normed space and / 6 T(X). following statements are equivalent:
Prove that the
(a) lim||J.||_too f{x) = 00; (b)
[/ < A] is a bounded set for every A > inf^ex / ( x ) ;
(c) there exists Ao > inf^gx / ( x ) such that [/ < Ao] is bounded;
Exercises
153
(d) there exists a, 0 £ R, a > 0, such that f(x) > a||x|| + /? for every x G X; (e) liminf| )x ||_ foo / ( x ) / | | x | | > 0; (f)
0£int(dom/*).
Moreover, if dim X < oo then the conditions above are equivalent to ( c ')
[/ < iaf /] is nonempty and bounded.
Furthermore, if p, q £ ]l, oo[ are such that 1/p + 1/q = 1, the following statements are equivalent: (g)
liminf|NHoo/(x)/||x||p>0;
(h) l i m s u p | | : c , | H o o r(x")/\\x*\\q
< oo;
(i) there exists a, /3 £ R, a > 0, such that f(x) > a||x|| p + P for every i 6 X ; (j) there exists a,/3 £ R, a > 0, such that /*(x*) < cx||a;*||9 + /? for every a;* e X * . E x e r c i s e 2.42 Let f,fn : R m ->• R (n £ N) be convex functions such that (fn{x)) -¥ f{x) for every x £ R m . Assume that / is coercive. Prove that there exist a,P £ R with a > 0, no £ N such that / n ( x ) > a \\x\\ + /3 for all x G R m and n > no. Exercise 2.43 Let (X, ||||) be a n.v.s. and C C X be a nonempty closed convex set. Consider the following assertions: (i) there exists XQ G X* such that (x,x*,) > \\x\\ for every x G C; (ii) there exist xo £ -X", X*, £ X* such that (x — xo, x*>) > ||x — xo|| for every x eC; (iii) int(dom sc) 7^ 0; (iv) a) there exists x*, G X* such that {u, x*>) > 0 for every u £ Coo \ {0} and b) for every sequence (x n ) C C with (||x n ||) —• oo and ( | | x „ | | _ 1 xn) -^ u one has Prove that (i) => (ii) =S> (iii) => (iv). If 0 £ C then (ii) =>• (i). Moreover, if X is a reflexive Banach space then (iv) =>• (ii). Exercise 2.44 Let X be a normed space and / £ A(X) be lower bounded. Consider A > i n f l € x / ( x ) =: inf / and p > 0. We envisage the conditions: (a)
[f<X}CpU;
(b) V x £ X (c) V x ' G
A
: f(x) > inf / + "2'°f/t/x»
A
~lnf 2p
f
• max{0, l|a?l| - p};
: / * ( * ' ) < / * ( 0 ) + p||x'||.
Prove that (a) =>• (b) & (c). Exercise 2.45 equivalences:
Let X be a normed space and / G T(X). Prove the following
Convex analysis in locally convex spaces
154
(a) liminf||.1.||_+00 / ( x ) / | | x | | > 0 - » 0 £ int(dom/*); in this case f(x) liminf 4f-if = sup{/i > 0 | /* is upper bounded on (J.U*}. ||x||-KX>
||X||
(b) liminf || B | H o o f(x)/\\x\\
= 0 « 0 6 Bd(dom/*).
(c) liminfHJII-KX, f(x)/\\x\\
< 0 O 0 ^ cl(dom/*); in this case
f(x)
liminf ~-r{- = - d d o m / . (0). llxH-KX)
E x e r c i s e 2.46
||Z||
Let p 6 ]1, oo[ and f:£p^R,
/ ( ( * „ ) „ > i ) := Y°°
n\xn\n.
Prove that / is finite, convex, continuous and lim||„||_+0O f*(y)/\\y\\
— 1.
E x e r c i s e 2.47 Let X, Y be Hilbert spaces, C C X be a nonempty closed convex set and A G L(X,Y) be a surjective operator. Consider the problem min i||Ac|| 2 ,
(P)
xeC.
A solution x of problem (P) is called a spline function in C associated to A. (a) Prove that (P) has optimal solutions if C + ker ^4 is closed. (b) Prove that x G C is an optimal solution of (P) if and only if y := A* (Ax) satisfies the relation (x\y) = min{(x \y) \ x € C}. E x e r c i s e 2.48 Let X,Y be Hilbert spaces, C := {x 6 X | Vi, 1 < i < k : (x\a,i) < fii}, where ai,...,ajb G -X', Pi,...,Pk G R and A G £ ( X , F ) be a surjective operator. Consider the problem (P)
min
5 | | A E | | 2 , x G C.
Prove that (P) has optimal solutions. Suppose, moreover, that there exists x € X such that (x \ a{) < /3i for every i, 1 < i < k. Prove that l i s a solution of (P) if and only if there exists (\i)i
V Xkdk a n d V i , 1 < i < A: : A;((x| a») — /?;) = 0.
Exercise 2.49 Let X be a Hilbert space and 01,02,03 be three non colinear elements of X (i.e. they are not situated on the same straight-line). Prove that there exists a unique element x G X such that V i £ l
: | | x - O i | | + | | x - a 2 | | + | | i - a 3 | | < ||x - ai|| + ||x - o 2 || + ||x - a 3 ||.
Moreover, prove that x G co{oi,a2,03}. The point x is called the Toricelli point of the triangle of vertices a\, 02,03.
Bibliographical notes
155
E x e r c i s e 2.50 Determine the optimal solutions (when they exist) and the value of the problem (Pf)
min £ (tx{t) + fiy/1 + (u(t)) 2 ) dt, x e Xi,
for every fj, € R + and every i g {1, 2, 3,4}, where Xi:=C[0,l], X 3 := | x e C [ 0 , l ] | / o 1 x ( t ) d t = 0 J , E x e r c i s e 2.51 (P)
X a : = 2/(0,1), X» := { x G 1/(0,1) | /^ x(t)dt = o } .
Consider the (convex) optimization problem
max J,,1 x(t) dt, xeX,
x(0) = x(l) = 0, J,,1 y/l +{x'(t))2dt
< L,
where L > 0 and X = C^O, 1] := {x : [0,1] —• R | x derivable, x continuous on [0,1]} or X = AC[0,1] := {x : [0,1] -¥ R | x absolutely continuous}. Determine the optimal solutions of problem (P), when they exist, and its value (using, eventually, the dual problem).
2.12
Bibliographical N o t e s
Many results of this chapter are well-known and can be found in several books treating convex analysis: [Rockafellar (1970); Hiriart-Urruty and Lemarechal (1993)] (for finite dimensional spaces), [Ekeland and Temam (1974); Ioffe and Tikhomirov (1974); Barbu and Precupanu (1986); Castaing and Valadier (1977); Phelps (1989); Aze (1997)]. In the sequel we point only those results that are not contained in these books but the last one. Sections 2.1-2.5: The assertion (vii) of Theorem 2.1.5 was established in [Zalinescu (1983b)]. The characterization of convex functions using upper Dini directional derivatives in Theorem 2.1.16 as well as Lemma 2.1.15 are established by Luc and Swaminathan (1993). Theorem 2.2.2 was established by Crouzeix (1980) in finite dimensional spaces and Theorem 2.2.11 by Zalinescu (1978). The notion of quasi-continuous function was introduced by Joly and Laurent (1971); Proposition 2.2.15 and Corollary 2.2.16 are established by Moussaoui and Voile (1997). The notions of cs-convex, cs-closed and cs-complete functions are introduced by Simons (1990), while that of lcs-closed function by Amara and CiligotTravain (1999); all the results concerning lcs-closed functions are due to Amara and Ciligot-Travain (1999). Theorem 2.2.22 (for I = N) and Proposition 2.2.24 can be found in [Kosmol and Muller-Wichards (2001)]; when d i m X < oo Theorem 2.2.22, Corollary 2.2.23 and Proposition 2.2.24 are stated in [Marti (1977)] under the weaker hypothesis that the pointwise boundedness or pointwise convergence holds on a dense subset of C. Proposition 2.2.17 is stated in [Zalinescu (1992b)]. Proposition 2.4.3 is established by Rockafellar (1966), Theorem 2.4.11 is proved in the book [Rockafellar (1970)] in R n and stated by Hiriart-Urruty
156
Convex analysis in locally convex spaces
(1982) in the general case; one can find another proof in [Aze (1997)]. The local boundedness of df in Theorem 2.4.13 and Corollary 2.4.10 can be found in many books on convex analysis. The assertions (iv) and (vii) of Theorem 2.4.14 are established in [Zalinescu (1980)]; for (i) see also [Anger and Lembcke (1974)]. Theorem 2.4.18 is from the book [Ioffe and Tikhomirov (1974)]. Theorem 2.5.2 was proved by Polyak (1966) under the more stringent conditions that / is a quasi-convex function which is bounded and Lipschitz on bounded sets and attains its infimum on every closed convex subset of X. Theorem 2.5.5 and Lemma 2.5.3 are established by Borwein and Kortezov (2001), while Lemma 2.5.4 and other results on non-attaining convex functionals are established by Adly et al. (2001a). Sections 2.6-2.9: The systematic use of perturbation functions for calculating conjugates and subdifferentials was done for the first time by Rockafellar (1974). The author of this book continued this approach in [Zalinescu (1983a); Zalinescu (1987); Zalinescu (1989); Zalinescu (1992a); Zalinescu (1992b); Zalinescu (1999)]; this permitted, for example, to give simpler proofs to several results stated in [Kutateladze (1977); Kutateladze (1979a); Kutateladze (1979b)]. Theorem 2.6.2(i) was established by Moussaoui and Seeger (1994), but Theorem 2.6.2(h) and Theorem 2.6.3 are new. Corollaries 2.6.4-2.6.7 can be found in [Hiriart-Urruty and Phelps (1993)] and [Moussaoui and Seeger (1994)]; for other results in this direction see the survey paper [Hiriart-Urruty et al. (1995)]. The sufficient conditions for the fundamental duality formula and for the validity of the formulas for conjugates and e-subdifferentials are, mainly, those from author's survey paper [Zalinescu (1999)], where one can find detailed historical notes; here we mention only the first use (to our knowledge) of them; actually, all these sufficient conditions are mentioned in that paper, excepting those which use li-convex or lcs-closed functions. So, conditions (iii) and (viii) of Theorem 2.7.1 and the corresponding ones in Theorems 2.8.1, 2.8.3, 2.8.7 and 2.8.10 are the classical ones and can be found in all the mentioned books which treat them. Condition (i) of Theorem 2.7.1 was used by Rockafellar (1974) when Yo = Y and by Zalinescu (1998) in the present form (see also [Combari et al. (1999)]), (ii) and (vi) were introduced in [Zalinescu (1983a)] for Yo = Y (and R replaced by a separated lcs ordered by a normal cone) and in [Zalinescu (1999)] in the present form, (iv) was introduced in [Zalinescu (1992b)] with (Hwi) replaced by (Hx), (v) is stated by Amara and Ciligot-Travain (1999) for X, Y Frechet spaces, $ a lcs-closed function and ,b replaced by lc, (vii) was introduced by Rockafellar (1974) for X, Y Banach spaces (X even reflexive) and Yo = Y, and by Zalinescu (1987) in the present form, while (ix) was used by Joly and Laurent (1971) for $ lsc and by Moussaoui and Voile (1997) for arbitrary 3>; note that condition (b) of [Cominetti (1994), Th. 2.2] implies condition (2.54). Theorem 2.7.4(iv) was established in [Zalinescu (1992b)]; the other conditions were introduced in [Zalinescu (1999)] (but in (v) it was assumed that F is lsc and C is closed). Condition (iii) of Theorem 2.8.1 was introduced in [Ekeland and Temam (1974)], (vii) was introduced in [Zalinescu (1984)] (for Y0 = Y), (iv) was intro-
Bibliographical notes
157
duced in [Zalinescu (1992b)], (i) and (viii) in [Zalinescu (1998)] for X,Y normed spaces, while conditions (ii) and (vi) were introduced in [Zalinescu (1999)]. Condition (vii) of Theorem 2.8.3 was introduced by Borwein (1983) for Yo = Y (see also [Zalinescu (1986)]), (x) was introduced by Borwein and Lewis (1992), (ix) was introduced by Moussaoui and Voile (1997), (i) by Zalinescu (1998) for X, Y normed spaces, (ii) was introduced by Combari et al. (1999), (iv) was introduced by Zalinescu (1999) (a slightly stronger form was used in [Simons (1990)]) as well as conditions (v) and (vi). For C G L(X, Y), generally, Theorem 2.8.6 is obtained under the corresponding conditions of Theorem 2.8.3 (taking / = 0). For C a densely defined closed linear operator, Rockafellar (1974) and Hiriart-Urruty Hiriart-Urruty (1982) obtained the results for g lsc, X, Y Banach spaces and Yo = Y (in [Rockafellar (1974)] X is reflexive and e = 0); Aze (1994) obtained the formula for the conjugate under condition (ii) in normed spaces. For general C condition (iv) was used by Zalinescu (1992b); the other conditions (excepting (v)) were used in [Zalinescu (1999)]. Condition (ix) of Theorem 2.8.7 was used by Joly and Laurent (1971), (vii) for X a Banach space was introduced by Attouch and Brezis (1986), (vi) was introduced by Zalinescu (1992b), (i) was introduced by Aze (1994) in an equivalent form in normed spaces, while (ii) was introduced by Combari et al. (1999). Corollary 2.8.8 was obtained by Voile (1994) for X a Banach space, / , g lsc and '* replaced by !C. Formula (2.65) from Proposition 2.8.9 was obtained by Aze (1994) and Simons (1998b) for x = 0 € ( d o m / — domg) 1 when / , g are lsc. The case / = 0 of Theorem 2.8.10 was considered by several authors; condition (iii) was used in [Hiriart-Urruty (1982); Lemaire (1985); Zalinescu (1983a); Zalinescu (1984)]; the algebraic case was considered in [Kutateladze (1979a); Kutateladze (1979b); Zalinescu (1983a)]; Zalinescu (1984) considered a stronger version of (v) (for lsc functions and Yo = Y); note that in these papers g is assumed to be Q-increasing on the entire space. The general case was considered in [Combari et al. (1994)] under (iii) and a condition stronger than (v) (/, g, H lsc and lc instead of lb) and in [Combari et al. (1999)] under condition (i) and a variant of (iv); it was also considered by Moussaoui and Voile (1997) under condition (vii). The formula for d(p(x) in Corollary 2.8.11 can be found in [Combari et al. (1994)]. The formula for d
158
Convex analysis in locally convex spaces
Section 2.10: The minimax theorem is also classical and can be found in the books [Barbu and Precupanu (1986); Simons (1998a)]. Exercises: Exercise 2.2 is from [Penot and Bougeard (1988)], Exercise 2.3 is from [Crouzeix (1981)], Exercise 2.6 is from [Marti (1977)], Exercise 2.15 is from [Crouzeix et al. (1992)] (there in a more complete form), Exercises 2.18 and 2.29 are from [Hiriart-Urruty and Lemarechal (1993)], Exercise 2.19 is from [Lions and Rochet (1986)], Exercise 2.20 is the celebrated cancellation lemma from [Hormander (1955)], Exercise 2.21 is from [Hiriart-Urruty (1998)], Exercise 2.22 is from [Jourani (2000)], the assertions in Exercise 2.23 are well-known (see also [Aze (1997)]), Exercise 2.25 can be found in [Borwein and Kortezov (2001)] (in normed spaces), the formula for goo under condition (vii) of Theorem 2.8.1 in Exercise 2.26 is obtained in [Amara and Ciligot-Travain (1999)] and [Amara (1998)], Exercise 2.27 is the well-known Toland-Singer duality formula (see [Toland (1978)] and [Singer (1979)]), Exercise 2.30 is from [Zalinescu (1999)], the equivalence of a) and c) (for Banach spaces) in Exercise 2.32 can be found in [Bauschke et al. (2001)] (see also [Barbu and Da Prato (1985)]), Exercises 2.33 and 2.34 are from [Eremin and Astafiev (1976)], Exercises 2.35, 2.36 and 2.37 are from [Simons (1998a)], the last formula in Exercise 2.38 is from [Walkup and Wets (1967)], Exercise 2.39 is from [Penot (1998a)], the formula for the subdifferential of the function considered in Exercise 2.40 and its proof are from [Aussel et al. (1995)], the equivalences (e) •£> (f) and (g) & (h) of Exercise 2.41 are from [Zalinescu (1983b)], the equivalence of conditions (ii)-(iv) in Exercise 2.43 are established in [Adly et al. (2001b)] in reflexive Banach spaces, a weaker variant of the implication (a) => (c) of Exercise 2.44 is stated in [Aze and Rahmouni (1996)], Exercise 2.45(c) is from [Borwein and Vanderwerff (1995)], Exercises 2.47, 2.48 axe from [Laurent (1972)].
Chapter 3
Some Results and Applications of Convex Analysis in Normed Spaces
3.1
Further Fundamental Results in Convex Analysis
Throughout this chapter X, Y are normed spaces and X*,Y* are their duals endowed with the dual norms. As an application of Ekeland's variational principle and of some results relative to convex functions, we state the following multi-propose generalization of the Br0ndsted-Rockafellar theorem. Theorem 3.1.1 (Borwein) Let X be a Banach space and f £ r ( X ) . Consider e £ P, XQ £ d o m / , XQ £ def{xa) and /3 € K+. Then there exist xe £ X, y* € Ux- and Ae £ [—1, +1] such that \\xe - Xo\\ + P • \(X£ - X0,X*0)\ < y/E, x% := x*0 + VE(y* + /3\ex*) £ df{xe).
(3.1) (3.2)
Moreover ||* e - soil < Ve,
\te-xl\\
\(xe-x0,x*)\<e x*e£d2ef(x0),
+ P\\xl\\),
+ JilP,
\f(xe)-f(x0)\<e
(3.3) (3.4)
+ ^/fi,
(3.5)
with the convention 1/0 = oo. Proof.
The function
|| • Ho •. x - • K, \\x\\0~\\x\\
+
p-\(x,x*)\,
is, obviously, a norm on X, equivalent to the initial norm. Therefore {X, ||-||0) is a Banach space. Consider the function g := / — XQ. It is obvious 159
160
Some results and applications of convex analysis in normed spaces
that Xo £ dom = d o m / = dom<9£/ and g is lsc and lower bounded: WxGX
: g{x)>g(x0)-e
[& x*0 £
dsf{x0)].
We apply Ekeland's theorem (Theorem 1.4.1) to g, y/e and the metric d given by d(x,y) := \\x — y\\o. So there exists xe £ domg such that g{xE) + y/e- \\x£ -x0\\o \/x£X,
x^xe
< g(x0),
(3.6)
: g(x£) < g(x) + y/e • \\x - xe\\0.
(3.7)
From Eq. (3.6) we obtain that g(xe) + Ve(\\xs - ar0|| +0-\{xe-
x0, XQ)\) < g(x0) < g(xe) + e,
whence Eq. (3.1) follows immediately. Let us consider the function h : X -> E, h(x) := g(x)+y/e-\\x-xE\\0
-
f{x)-{x,xl)+y/E-\\x-xE\\+Py/e-\(x-xe,xl)\;
by Eq. (3.7) xe is a minimum point of h. Therefore 0 £ dh(xe). Since h is the sum of four convex functions, three of them being continuous, and taking into account the expression of the subdifferential of a norm (Corollary 2.4.16) and of the absolute value of a linear functional (at the end of Section 2.8), we have that 0 £ dh(xe) = df(x£)
- x* + y/i • Ux- + PVe • [ - 1 , +1] • x%.
Therefore there exists x* £ df(xs), y* £ Ux* and Xe £ [—1,1] such that Eq. (3.2) is verified. The estimations from Eq. (3.3) follow immediately from Eqs. (3.1) and (3.2). Using again Eqs. (3.1) and (3.2) we obtain that |(a;e -x0,x*
-XQ)\ = y/e-\{xE
- xQ,y* + p\Ex%}\
< v/i(||z e - x0\\ • W\
+ 0\Xe\ -\(xe-
< y/e(\\xs - x0\\ + fi • \{x£
x0,x*0)\)
-x0,x*0)\)
= e.
(3.8)
From Eqs. (3.1) and (3.8) we get |(a;e -x0,x*)\
< \{xe -x0,x*
-x^)\
+ \(x£
-X0,XQ)\
^IP,
Further fundamental
results in convex
analysis
i.e. the estimation in Eq. (3.4) holds, too. Since x$ G dEf(xo) df(xE), we get (X0 ~Xe,X*£
-XQ)
+ (X0 -X£,X*Q)
= (X0 ~Xe,X*) < (X0
< f(x0)
-XE,XQ)
161
and x* G
~f(Xe)
+£.
Using relations (3.1) and (3.8), we obtain that \f(x0) - f(xe)\
< \(x0-xe,x*0)\+s<e
+ VE//3,
i.e. the second relation in (3.5) holds. Using Eq. (3.8) and the fact that XQ G dEf(x0), x* G df(xE), we obtain that (x-X0,X*e)
= (X-X£,X*)
+ (Xe -X0,X*
-XQ)
+ (XE
-X0,XQ)
< f{x) - f(xe) +s + f(x£) - f{x0) + e - f(x) - f(x0) + 2e for every x G X, i.e. x£ G d2ef(x0); holds, too.
hence the first relation in Eq. (3.5) •
The next result is well-known. The first part is an immediate consequence of Borwein's theorem, while the density part, which follows easily from the first one, will be reinforced in Theorem 3.1.4 below. Theorem 3.1.2 (Br0ndsted-Rockafellar) Let X be a Banach space and f € r ( ^ ) - Consider e > 0 and (XO,XQ) G grd6f. Then there exists (x£,x*) G gr<9/ such that \\x£— XQ\\ < ^Jl and \\X*—XQ\\ < y/e. Inparticular domf C cl(dom<9/) and dom/* C cl(Im<9/). D Another consequence of Borwein's theorem is the following result which will be completed in Proposition 3.1.10 below. Proposition 3.1.3
Let X be a Banach space and f G T(X). Then
f(x) = s u p ^ z - z,z*) + f(z) I (z,z*) G grdf} = sup{(x, z*) - /•(**) | z" G lm(df)}
(3.9)
for every x G d o m / . Proof. The second equality in Eq. (3.9) is obvious because f(z)+f*(z*) = {z, z*) for any (z, z*) G grdf. Let x G dom / be fixed. When (z, z*) G grdf we have (a; — z, z*) + f(z) < f(x) for every i £ l ; hence f(x) > sup{(a; z,z*) + f(z) | [z,z*) egrdf}. Because / is lsc, by Theorem 2.4.4 (hi), there
162
Some results and applications of convex analysis in normed spaces
exists x* € de/2f(x). Applying Borwein's theorem we get {xe,x*) £ grdf such that x* 6 def(x). Therefore (xe — x,x*) < f{xe) - f(x) + e, whence f(x) — e < {x — xe,x*) + f(xe). Hence relation (3.9) holds. • As announced before, the density results in Theorem 3.1.2 can be stated in a stronger form. In the next result (xn) —>/ x means that (a;n) —> x and {f(xn)) -* f(x), and similarly for (xn) ->•/• x*. Theorem 3.1.4
Let X be a Banach space and f € r ( X ) . Then
(i) Vx e d o m / , 3((xn,xn))
Cgrdf
: (a;„) -+/ x;
(ii) Vx* E dom/*, 3 ( ( x „ , < ) ) C gvdf
: « ) - • , . x*.
Proof, (i) Consider x € d o m / . Because / is lsc, by Theorem 2.4.4 (hi), for every n S N there exists x* € dn-2f(x). Applying Borwein's 2 theorem for (x,x*), e = n~ and /? = 1, we get (xn,xn) € grdf such that \\xn — x\\ < n _ 1 , \f(xn) — f(x)\ < n~2 + n~x. Hence (xn) -^/ x. (ii) Because / * is lsc, it is sufficient to show that for every e > 0 there exists (y, y*) e df such that \\y* - x*\\ < e a n d / * ( y * ) < f*(x*)+e. Fixx £ d o m / . Let £> 0 and take r := \\x\\ + 2e-1 (e + f(x) + f*(x*) - (x,x*)) > 0. Consider also the function g := fO(x* + irux) € A(X). Then g(x) > (fnx*)(x) = (x,x*) - f*(x*) for every x € X. It follows that g € T(X) and f(x) + (—x,x*) > g~(0) > —f*(x*). By Proposition 3.1.3 there exists (z,z*) € dg such that -g*(z*) = g(z) - (z,z*) > -f*(x*) - e/2. But + (x* + trUx)*(z*) = f*(z*) + r l|ar* - z*\\. Therefore g*(z*) = f*(z*) f*(z*) +
r\\x*-z*\\
Because f*(z*) > (x,z*) - f{x) > - \\x\\ • \\z* - x*\\ + (x,x*) - f(x), from the preceding inequality we obtain that (r — \\x\\) \\z* — x*\\ < e + f(x) + f*(x*) — (x,x*), and so \\z* - x*\\ < e/2. The inequality above shows also that f*(z*) < f*(x*)+e/2. On the other hand, because (z,g~(z)) € cl(epi^), there exists (zn) C dom / and (un) C rUx such that (zn + un) —> z and limsup(/(z n ) + (u„,a;*»
(3.10)
we have that
0 < £„ = f{Zn) + (U„, X*) - (Zn + Un, Z*) + (Un, Z* - X*) +
f*(z*).
Taking into account that (un,z* — x*) < r \\x* — z*\\, from Eq. (3.10) we obtain that limsup£ n < 0. Hence lime„ = 0. Because (un) is bounded, so
Further fundamental results in convex analysis
163
is (zn). Therefore there exists r' > 0 such that (zn) c r'Ux- Take n E N so that S := en < 1 and VS(r' + 2)(1 + \\z*\\) < e/2; set z := zn. Of course, we have that z* G dsf(z). By Borwein's theorem (for /? = 1) there exists (y,y*) G df such that \\z - y\\ < y/S, \\z* - y*\\ < VS{1 + ||z*||) and |/(«) ~ f(v)\ <S + VS. Hence ||j/* - x*\\ < y/S{l + \\z*\\) + e/2 < e and /*(»*) =
+ f*(z*) + f{z) -
f(y)
< VS(V5 + r') (1 + ||z*||) + \f5 \\z*\\ -5 + f(x*) < f*(x*) + e/2 + VS(r' + 2)(1 + ||«*||) < f*(x*)
+s/2 + 6 + V6 + e.
The proof is complete.
•
Using Br0ndsted-Rockafellar's theorem we can add other sufficient conditions for the validity of the conclusions of Theorems 2.8.1 and 2.8.7. Proposition 3.1.5 Let X,Y be Banach spaces, F £ T(X x Y), A G L(X,Y), D = {Ax - y \ (x,y) € d o m F } and E = {Ax - y \ (x,y) € dom&F}. Then ic
E = iiE = ic(coE) = ri(coE) = icD = riD.
Moreover, if one of the above sets is nonempty then icE = E — D; in particular the sets ICE and E are convex. Proof. Consider the linear operator T : X xY ->• Y defined by T(x, y) := Ax — y. Since domdF C dom.F and d o m F is convex, we have that E = T(domdF)
CcoE
= T(co{domdF))
By Theorem 3.1.2 we have that d o m F C domdF. nuity of T, we have that
CD = T ( d o m F ) . Hence, using the conti-
E C D C T ( d o m d F ) C T(domdF) = E. Using the properties of the affine hull (see p. 2), it follows that aff E C aff D c aff E. Therefore, if aff E is closed then aff D — aff E; the above relation shows that %CE c %CD. Let us show the converse inclusion. Let 2/o G %CD and consider the function F 0 6 T(X x Y), F0(x,y) :— F(x,y — yo)Of course, domF 0 = d o m F + (0,j/o) and dom9F 0 = d o m 9 F + {0,y0). Therefore 0 € i c ( P r y ( d o m F 0 ) ) . Taking
164
Some results and applications of convex analysis in normed spaces
such that dF0(x,Ax) = dF(x,Ax — y0) ^ 0. Therefore yo € E. So we obtained that *D = ICD C E C D, which shows that aff E = aff-D and *D C iE. It follows that icE = icD = ic(coE). As observed after the proof of Theorem 2.8.1, in our situation, if %CD is nonempty we have that ic D = rintD. Suppose that %CD 9^ 0 (for example). From what was proved above, we have that rinti? = lD = icD = rintE = lE = icE C E C D C E. Hence E~ = T°E = D:. The conclusion follows.
•
Remark 3.1.1 Taking into account the preceding proposition, for X, Y Banach spaces and F G T(X x 7 ) , we may add to the sufficient conditions in Theorem 2.8.1 the following conditions: 0 G ri{Ax - y \ (x,y) £ ic
0 £ {Ax -y\(x,y)€ ic
0 € {Ax -y\(x,y)e
domdF}, domdF}, co(domdF)},
y 0 = cone{ Ar - y | (x, y) 6 dom OF} is a closed linear space. Indeed, the first three conditions are, evidently (using the preceding proposition), equivalent to 0 £ lc{Ax — y \ (x,y) £ d o m F } . In the fourth case y 0 = aff {Ax — y\(x,y)e co(dom OF)}, and so 0 e lc{Ax — y\(x,y)e domF}. An important particular case of the preceding proposition is when X = Y, A = Idx and F(x,y) = f(x) + g(y) with f,g 6 T(X); in this case D — d o m / — domg and E = domdf — domdg. Using the Borwein theorem one obtains a formula for the subdifferential of a composition g o A of a lsc convex function and a continuous linear operator without qualification conditions. Theorem 3.1.6 Let X,Y be Banach spaces, A e C{X,Y), g £ T(Y), f = g o A and x 6 d o m / . Then x* 6 df(x) if and only if there exists a net ((yi,yt))ieI C grds such that (yt) ->• y := Ax, (g(yi)) ->• g{y), ((yi-y,y*))^Oand(A*y*)^x*. If X is reflexive one can take sequences instead of nets and impose norm convergence instead of w* -convergence.
Further fundamental
Proof.
Let (fa,y*))i€l
results in convex
analysis
165
C gidg be such that fa) -> y, (fa - y,y*)) ->•
0, {gfa)) -> g(y) and (i4'i/r) ^ x*. Then (y - yhy*) i G I and t / S F . It follows that (x' - x,A*y*) + (y- yhy*)
= (Ax' - yuy*)
< gfa-gfa)
for all
< f(x') - gfa)
for alH G / and x' G X. Taking the limit we obtain that (x' — x,x*) < /(x') - f(x) for all x' G X, and so x* G d/(x). Let now x* G df(x) and consider (£„)„6N 4- 0- Using Corollary 2.6.5 we have that x* G w*-clA*(d£ng(y)) for every n G N. Let A/" be a base of w*neighborhoods of x* and consider / = Nx M with (n, V) y (n1, V) iff n > n' and V CV. Then for all i = (n, V) G / there exists z* G dEng(y) such that A*£* G V. Taking /? = 1 and e = e^ in Theorem 3.1.1, there exists (yi,y*) G 3g such that \\yi - T/|| < e„, ||y* - ^ | | < en, \g(yt) - g(y)\ < en(en + l) and \(Vi ~ V,V*)\ < en(en + 1). Because \\A*z* - A*y*\\ < \\A*\\ • \\y* - z*\\ < en \\A*\\ and (A*z*)ieI ^> x*, we obtain that (A*y*)i€l ^> x*. If X is reflexive then w*-cli*(9 fB g(j/)) = cl A* (dSng(y)), and so, for every n G N, we can take z*n G dEng(y) such that ||A*z* — x*|| < e„. The conclusion follows. • Using the preceding result one obtains a similar formula for the subdifferential of the sum of two lsc proper convex functions. Theorem 3.1.7 Let X be a Banach space, / i , / 2 G T(X) and x G dom / i fl dom / 2 . Then x* e S f / i + M (x) if and only if there exist two nets {(xk,i>xt,i))ieI C gr<9/fc, k = l,2, such that ( x M ) i e / ->• x, (fk(xk,i))ieI -> /fc(x), ( ( x M - x , x ^ i ) ) . e / -^0 for k = 1,2 and (x*^ + x*2x*. 7/X is reflexive one can take sequences instead of nets and impose norm convergence instead of w* -convergence. Proof. We apply the preceding result iorY ~ XxX, A: X -> Y defined by Ax := (x,x) and# : Y -> E defined by #(xi,x 2 ) := /i(xi)+/2(x2). Then / := /1+/2 = g°A. The sufficiency is immediate by taking y = (x, x) = Ax, Vi = (xi,i,X2,i) and y* ~ (xl^xlj) for i G i". Let x* G df(x). By Theorem 3.1.6 there exists a net ((yi,y*))ieI C gvdg verifying the conditions of the theorem with y := (x,x). Taking yt = (xi ) j,x 2 ,i) and y* = ( x ' ^ x ^ j ) and taking into account that dg(x\,X2) — dfi(x{) x 0f2(x2), we have that ((xk,i,x*ki))ieI C gvdfk and (xk,i)iei -»• x for k = 1,2. Assume that (fi(xi,i)) A fi(x)- Because /1 is lsc at x, limsup i 6 / /i(xi,i) > /i(x). Thus
166
Some results and applications of convex analysis in normed spaces
there exists S > 0 such that J := {i £ I \ fi(xiti) — fi(x) > 5} is cofinal. It follows that g(yi) - g(y) > 6 + (f2{x2,i) — f2{x)) for all i £ J. Taking the limit inferior, we obtain the contradiction 0 > S. Therefore {fk(%k,i))ieI -^ fk(x) for k — 1,2. The inequalities fi(xi,i)
- fi(x) < (xij -x,x{A)
for i £ I imply that ({xij — x,x{ti})
< (yi -y,y*)
- {h(x2,i)
-
f2(x))
-> 0.
D
Using Br0ndsted-Rockafellar theorem we obtain another famous result. Theorem 3.1.8 (Bishop-Phelps) Let X be a Banach space and C C X, C 7^ X, be a nonempty closed convex set. Then (i)
The set of support points of C is dense in B d C .
(ii) The set of support functionals of C is dense in the set of continuous linear functionals which are bounded above on C. Moreover, ifC is bounded, then the set of support functionals of C is dense in X*. Proof, (i) Let x0 6 Bd C and e £ ]0,1[. Taking into account that C ^ X, there exists xi £ X \C such that \\xi — x 0 || < £2- Applying a separation theorem, there exists XQ £ Sx> such that SVLPX£C{X,XQ) < (XI,XQ). SO, for every x £ C, {x - X0,XQ)
= (x - XI,XQ)
whence XQ £ de2tc(xo)have that
+ (xi - X0,XQ)
< (x - xi,xl)
+ \\x\ - x0\\
<
e2,
Applying the Br0ndsted-Rockafellar theorem we
3xe £ C, 3a;* £ dbc{xe)
: \\x£ - x0\\ < e, \\x* - XQ\\ < e < 1.
Since \\XQ\\ = 1, we have that x*e ^ 0, and so xE is a support point of C with \\x£ — xo\\ < e. The conclusion holds. (ii) Using the second part of Theorem 3.1.2 we have that dom(tc)* C cKJmdic) = cl(Imdt c \ {0}), which shows that the conclusion of the theorem is true.
•
The following theorem will be used for a simple proof of the Rockafellar theorem.
Further fundamental results in convex analysis
167
Theorem 3.1.9 (Simons) Let X be a Banach space, f € T(X) and x € X, n G R be such that inf / < 77 < f(x). Consider L :=
sup -fizz rr, x€X\{x} ||ar — ar||
and dx • X -V R, dj(x) := | | i — x||. Then: (i) 0 < -L < oo and inf (/ + Ldx) > n; (ii) V e e ] 0 , l [ , 3 i / 6 X
: ( / + L«fc)(i/) < i n f ( / + i d * ) + e L | | i - y\\;
(iii) V e € ] 0 , l [ , 3 («,*•) G g r d / and\\z*\\ < (l + s)L;
: {x - z,z*) > (I - e)L\\x - z\\ > 0
(iv) Ve €]0,1[, 3(z,z*) € g r S / : (z - z,z*> + /(*) > 17 and L < \\z*\\<(l + e)L. Proof, (i) Because 77 > inf / , it is clear that L > 0. Since 77 < /(a;) and / is lsc at x, there exists p > 0 such that / ( x ) > 77 for every a; € B(x,p). Furthermore, since / € r ( X ) , there exist x* € X* and a 6 R such that f(x) > (a;!a;*) ~ Q f° r every a; £ X. Therefore 77 — f(x)
V* £ X, \\x - x\\ > p : 5_jlZg) < ||x*|| + _ ] _ < li^H + 2. IF ~ x\\
\\x ~ x\\
P
This relation and the choice of p show that L < 00. From the expression of L we obtain immediately that inf (/ + Ldx) > 77. (ii) Let e e]0,1[. Since (1 - e)L < L, from the definition of L there exists y £ X, y ^ x, such that (77 — f{y))/\\x — y\\ > (1 — e)L, and so (/ + Ldx)(y) < 77 + eLdi(y) < inf(/ + Ldx) + eL\\y - x\\. (iii) Let e €]0,1[ be fixed. The function / + Ldx 1S proper, lsc and bounded from below, while the element y from (ii) is in dom(/ + Ldx) = d o m / . Taking the metric d o n X defined by d(xi,X2) := L\\xi—X2\\, (X,d) is a complete metric space. Using Ekeland's variational principle we get the existence of z € X such that (/ + Ldx)(z) + EL\\Z - 7,11 < (/ + Ldx)(y)
168
Some results and applications of convex analysis in normed spaces
and (f + Ldw)(z) < {f + Ld¥)(x) + eL\\x - z\\
VxeX.
The first relation and (ii) give ||^ — yj| < \\x — y\\, and so z ^ I . The second relation says that z is a minimum point of the function / + Ld^ + sLdz. Taking into account that cfe- and dz are continuous convex functions, it follows that 0 G d(f + Ldw + eLdz)(z) = df(z) + Ldd^{z) +
eLddz(z).
But ddz{z) = d\\ • ||(0) = Ux- and dd^(z) = {x* € Ux* | (z - x,x*) = \\z — x\\}. Hence there exist z* G df(z) and x*, y* £ Ux* such that z* = Lx* + sLy* and (x - z, x*) — \\x - z\\. Therefore ||z*|| < (1 + e)L and (x — z, z*) = {x — z, Lx* + eLy*) = L\\x — z\\ + eL(x — z, y*) > L\\x - z\\ - eL\\x - z\\ = L(l - e)\\x - z\\ > 0. (iv) Let e £]0,1[ be fixed and consider e' := e/3. Let us take M := (1 + 2e')L. Since / + Md% > f + Ld% > r], we can apply Ekeland's theorem for / + Md-x, an element xo of dom / , e' > 0 and the metric defined at (iii). We get so the existence of z e X such that V i e l :
(f + Mtk)(z)<(f
+
Mds)(x)+e'L\\x-z\\.
As in the proof of (iii), there exist z* € df(z), x*, y* € Ux* s u c h that z* = Mx* +e'Ly* and (x-z,x*) = \\x - z\\. Thus ||z*|| < (1 + s)L and (x - z, z*) > (M - e'L)\\x - z\\ = (1 + e'L)\\x - z\\. So, for x = z we have that (x — z, z*) + f(z) = f(x) > rj, while for ~x yi z we have (x-z,z*)
+ f(z) > (l + e')L\\x-z\\
+ f(z) = (f + Ldw)(z)+e'L\\x-z\\
> v.
Therefore (x — z, z*) + f(z) > T). Moreover ||i - z|| • \\z*\\ >(x-x,z*) >V~
= [(x - z, z*) + f(z)] - [(x - z, z*) + f(z)]
f(x)
for every x € X; the last inequality holds because z* G df(z). ||af — x\\ for x 7^ x, we obtain that ||z*|| > L.
Dividing by D
Convexity and monotonicity
of
subdifferentials
169
Using the preceding theorem we reinforce slightly Proposition 3.1.3. Proposition 3.1.10 Let X be a Banach space and f 6 T(X). relation (3.9) holds for every x 6 X.
Then
Proof. Taking into account Proposition 3.1.3 we have to show that oo = sup{(a; — z,z*) + f(z) \ (z,z*) € gr<9/} when x £ d o m / . So, let x £ X \ d o m / and take inf / < 77 < 00. Using Theorem 3.1.9 (iv), there exists {z,z*) E gr<9/ such that (a; - z,z*) + f(z) > rj. The conclusion follows. • The maximal monotone operators are of a great importance in the theory of evolution equations. A significant example of such operators is the subdifferential of a proper lsc convex function on a Banach space. Theorem 3.1.11 (Rockafellar) Let X be a Banach space and f 6 T(X). Then df is a maximal monotone operator. Proof. Let (x,x*) EXxX*\gvdf. Then x* £ df{x), and so 0 i df(x), where / := / — x*. It follows that inf/ < fix). Applying assertion (iii) of Simons' theorem, there exists (z, z*) € g r 9 / such that (x — z,z*} > 0. Consider z* := z* + x*\ we have that (z, z*) £ df and (z -x,z*— x*) < 0, which shows that the set df U {(x,x*)} is not monotone. Therefore df is a maximal monotone operator. • 3.2
Convexity and Monotonicity of Subdifferentials
The aim of this section is to show that the monotonicity of an abstract subdifferential of a lower semicontinuous function ensures its convexity; among these abstract subdifferentials one can mention the Clarke subdifferential we introduce below. Throughout this section (X, ||.||) is a normed vector space. Consider | / M c I and x e clM. In the sequel we shall denote by (x n ) - > M X a sequence (xn) C M with (xn) -» x; more generally, x - > M X will mean x £ M and x —> x. The Clarke's tangent cone of M at x is defined by Tc{M,x):={ueX\V{tn)
10, V(xn)-*Mx,
3(un)^u, V n e N : xn + tnun 6 M).
Recall another cone introduced in Section 2.3: C(M,x) := e l f | J
t-(M-x))
=cone{M-x).
170
Some results and applications of convex analysis in normed spaces
Because clM -x C C(M,x) Note (Exercise!) that
C C(clM,i), we have C{M,x) = C(clM,:r).
0ETc(M,x)cC(M,x).
(3.11)
Several properties of the tangent cone in the sense of Clarke are collected in the following proposition. Proposition 3.2.1 (i) Tc{M,x)
Let 0 ^ M C X and xEc\M.
Then:
is a nonempty closed convex cone;
(ii) TC(M, x) = Tc(cl M, x) = Tc(M D V, x) for all V E M{x); (iii)
if M is a convex set then Tc(M,x)
=
C(M,x).
Proof, (i) It is obvious that Xu E Tc(M,x) for A > 0 and u E Tc(M,x); hence Tc(M,x) is a cone. Let u,v € Tc(M,x) and consider (tn) 4- 0 and (xn) ~>M x. Then there exists (un) -> u such that x'n := xn + tnun G M for every n E N. Of course (xJJ —>• 3;. Hence there exists (vn) —> u such that £n + in(w„ + u„) = x^ + i n w n G M for every n. Therefore u + v E Tc(M,x). Consider now (uk)ke^ C Tc(M,x) with (ufc) -+ u E X. Let us show that u E Tc{M,x). For this take (tn) I 0 and (xn) - > M £• Because ufc E Tc{M,x), there exists (u*) ->• ufc such that xn + tnukn E M for every n G N. Hence xn+tnukn E M for all k, n E N. For every n EN there exists k'n E N such that ||u* — uk\\ < n~l for every k > k'n. Let (fc„) C N be an increasing sequence such that kn > k'n; then ||u* - uk\\ < n _ 1 for all k,n E N with k > A;„. Let u„ := u* n . Of course, £„ + £ n u„ = xn + tnuknn G M for every n. As ||M„ - u|| = \\unn - u|| < ||u*n - ukn || + \\ukn - u\\ < n _ 1 + ||u*n - u||, we have that (un) ->• u. Hence u G Tc(M,x). Therefore Tc(M,x) is a closed convex cone. (ii) Let u E Tc(M,x) and take (tn) I 0 and (xn) - > C I M £• For every n G N there exists xn E M such that \\xn — xn\\ < tn/n, i.e. xn = xn + tnn~1u'n with u'n E Ux- Hence (xn) ->M x. Therefore there exists («„) -> u with xn + tnun = xn + tn(n~lu'n + un) G M C clM for every n E N. As (n _ 1 u^ + u„) —> u, we have that u G I b ( c l M , x ) . Conversely, let u G Tc(c\M,x) and take (tn) I 0 and (z n ) -tM x. There exists (un) -> u with xn + tnun G cl M for every n. Like above, for every n E N there exists u^ G Ux such that a;„ + tnun + tnn~lu'n G M . Because ( u „ + n _ 1 u ^ ) —> ii, we have that w E TQ^M^ X). Hence TQ^M, X) — Tc(dM,x).
Convexity and monotonicity
of
subdifferentials
171
The equality Tc{M,x) = 7 b ( M n V,x) is obvious when V is a neighborhood of x. (hi) Let M be convex. Taking into consideration (ii) we may assume that M is closed. Consider x E M and take (£„) 4- 0 and (xn) —»M X. There exists no € N such that tn < 1/2 for n > no. Take un := x + x - 2xn for n > n0 and un = 0 otherwise. Of course, (un) -¥ x — x. As xn + tnun = (1 — 2tn)xn + tnx + tnx S M for n > no, we have that x-x € Tc{M,x). It follows that C(M,x) C T c ( M , x ) . Using Eq. (3.11) we obtain the conclusion. D From (ii) and Eq. (3.11) we obtain that Tc(M,x)
Cn '
C(MHV,x)
=:
TB{M,x);
'V'gJV(x)
is the well known tangent cone in the sense of Bouligand of M atxed M. Let now / : X —> M. be a proper function and x 6 dom / . It is natural to consider the tangent cone Tc (epi/, (x, f(x))). This cone is related to the Clarke—Rockafellar directional derivative of / at x introduced as follows: TB{M,X)
,*,_ . f'{x,u):=sup
, limsup e>0 tiQ,(x,a)->epi,(x,f(x))
. f := sup ml
. „ mi
fix + tv) — a
ll«-«ll<e
*
sup
e>0 *>0 0
f(x +
ml
f(x)
II"-'"ll<
£
tv)-a
.
*
It follows that , ,_ , . , / ' t( a ; , w ) = s u p inf
sup
f{x + tv) -
mi
lk-"ll<E
£ > 0 <5>0 o
f(x)
,
*
and even /t(x,u)=sup
limsup
inf
e>0 (4.0, x->/5 ll«-«ll<e
/ (
*+
tV
} ~~f{x)
(3.12)
'
if / is lower semicontinuous at x, where x —>/ x~ means x —>• x and f(x) —>• f(x). It is obvious that / t ( x , 0) < 0 and /''"(xjAu) = A/ 1 '(x, u) for A > 0 and u G X. We may have f^(x, 0) = — oo even if / is continuous at x. Take for example / : R -»• E, f(x) := — \/\x\, and x = 0 (Exercise!). The next result shows the connection between the preceding two notions.
172
Some results and applications of convex analysis in normed spaces
Proposition 3.2.2 Let f : X —> R be a proper function and x G d o m / . Then epi ft(x, •) = Tc (epi / , (x, f(x))). In particular ft(x, •) is a Isc convex function; ft(x, •) is a Isc sublinear function if and only if ft(x, 0) = 0. Moreover, if f is convex then epi ft(x, •) = cl (epi f'(x, •)). Proof. Let (u,A) G Tc (epi f, (x, f(x))); assume that ft(x,u) > A, and take ft(x, u) > A' > A. Then there exist EQ > 0, ((xn,an)) -»epi/ (x,f(x)) and (tn) 4- 0 such that inf„ 6 c( U)£o ) t~* (f(xn + tnv) — an) > A'. Because (u, A) 6 Tc (epi f, (x, f(x))), there exists the sequence ((un,Xn)) converging to (u, A) such that (xn,an) + tn(un,\n) G e p i / , i.e. f(xn + tnun) < &n + tnXn for all n G N. Since (un) -» u, there exists no such that un G D(u,eo) for n > no- Hence
v^
-r
f(xn + tnv)-an
f(xn +
tnun)-an
A <
inf — < — < A„ Vn > n 0 . ||w—u||<e0 tn tn Taking the limit we get the contradiction A' < A. Let now f^(x~,u) < A and take ((xn,an)) —>epif (x,f(x)) and (tn) I 0. Let us fix (ek) I 0. Because • infr
5>0
• inft
sup 0
\\x-x\\<S, \a-f(x)\<S
f(x + tv) - a —
\\v-u\\<ek
t
for every k G N, there exists 8k > 0 such that . inf
sup 0«<<5fc,||x-x||<(5 fc ,/(x)
u
f(x + tv) -a — £
ll»- ll< *
. < A.
*
There exists n'k G N such that 0 < t„ < 8k, \\xn — x\\ < 8k, \an — f(x)\ < 8k for all n > n'k. Therefore inf„€£>(U)efc) t" 1 (f(xn + tnv) - an) < A, which shows that for every k and every n>n'k there exists u* 6 D(u, e^) such that f(xn + tnUn) < a „ + Xtn. Consider an increasing sequence (n^) C N such that nk > n'k for every k G N. Taking un := u* for n^
C epi f'(x, •) C cone(epi/ - (x,
f(x))),
Convexity and monotonicity
we have that epip(x,
of
subdifferentials
•) = C (epi/, (x,f(x)))
173
= cl(epi/'(:r, •))•
D
+
Note that when / is not lsc at x, f^(x, •) and / (x, •) may be different, where, as usual, / is the lower semicontinuous envelope of / . Take for example / : R ->• R, f(x) = 0 for x ^ 0, /(0) = 1; / t ( 0 , •) = - c o , but / (0, •) = 0. When / is lsc at x then f*(x, •) and / (x, •) coincide. When / has additional properties, Z1" has a simpler expression. Proposition 3.2.3 Let f : X —> R be a proper function and x 6 d o m / . Then for every u € X we have: (i) limsupj.^. xP(x,u) ,-lv- N / f'(x,u)<sup
< f^(x,u);
• r inf
moreover, if f is lsc atx • inff
sup
f(x + tv)-
£>0 S>0 |j^—^||<<S,0
< inf
then
f(x)
*
sup
.
*>° \\z-x\\<S,0
(3.13)
*
(ii) / / / is continuous at x then
,-iv- x f'{x,u)—
•r
. t f{y + tv) - f(x)
sup inf
sup
mi
e>0 <5>0 \\x-x\\<S,0
.
ll"-ull<e
*
(hi) If f is finite and L-Lipschitz on B(x,r) for some r > 0 and L > 0, then ttr\ • f(x + tu) - / ( : r ) /'(a;,u) = inf( sup s>0
||x-x||<(5,0
t
f(x + tu) — f(x) , ,. .. = limsup^—-—'-— J -±-L < L -\\u\\. (iv) If f is finite and Gateaux differentiate tinuous at x then f^(x, •) = V/(3;).
on B(x,r),
(3.14)
and V / is con-
Proof. Throughout the proof u € X is a fixed element. (i) Let f*(x, u) < X and consider e > 0; there exists 6 > 0 such that sup \\x-x\\<8,0
. inf lk-«ll<
f(x + tv)-f(x) e
^ < A.
*
Take 6' := 6/2. Let x' e £(x,<5') with \f{x') - f(x)\ < 6' and x, t be such that \\x - x'|| <6',0
174
Some results and applications of convex analysis in normed spaces
0 < t < 8 and f(x) < f(x) + 8. It follows that sup {mt^D^r1
(f{x + tv) - f(x))
| t G ]0, 8'}, x e D{x', 8'), f(x)
l
< sup {miv€D(u>e)r
(f(x + tv) - f{x)) \te]0,8],
x £
D(x,8),
f(x) < f(x) + 5} < A for all x' e B(x,8') with \f(x') - f(x)\ < 8'. Therefore f^(x',u) < A for such x', and so lizn supx^. w f^(x,u) < f^(x,u). The first inequality in Eq. (3.13) follows immediately from Eq. (3.12), while the second is obvious. (ii) Taking into account Eq. (3.13), one must show the inequality >. For this take e > 0 and 8 > 0. There exists 8' € ]0,8] such that f(x) < f(x) + 8 for every x € D(x,8'). Then sup 0<£<<5 \\x~x\\
inf
^
H
llu-ull^e
M
>
sup
t
inf u
0
^
e
+
^ ~ ^ , t
whence the conclusion follows. (iii) The inequality < is proved in Eq. (3.13). Assume that / is Lipschitz on B(x,r) with Lipschitz constant L. Take A > lim sup x _,. s ^ 0 Hx+tu)~Hx) and fix e > 0. Consider 80 > 0 such that sup|| ;c _j||< 5o)0
<
f(x + tv) - f(x + tu)
f{x +
tu)-f{x)
+ tU
] ~ fix) < / ( * + tU] ~ ™ + Le.
Hence sup \\x-x\\<5 0
. , inf ll"-«ll<£
fix + tv) - fix) ^ '-—i±-± < t
sup \]x-x\\<6 0
fix + tu) - i fix) — — - ^ - L + Le, t
Convexity and monotonicity
of
subdifferentials
175
and so . mi
sup
inf
*>° ||i-i||<<5,0
<
£
sup
/(a; + fa) - / ( a ; ) — t J fix + tu)J - L f(x) -^——-— -^+Le<\
||a:-x||<<So,0«<<5o
+ Le.
*
Taking the limit for e -> 0, we obtain \ ^> rhm inf -t A
sup
e4-0 d>0 \\x-x\\<6,0
f(x + tv)-f(x) ^—L = ttf—f ] { x , u\) .
-t inf ll"-«ll<e
t
Taking into account (ii), we get Eq. (3.14). Taking 5 = r / ( l + ||w||), for x € B(x,S) and t £]0,S] we have that x + tu,x € B(x,r), and so f{x + tu) — f(x) < tL \\u\\, whence f^(x,u) < L \\u\\. (iv) Let r > 0 be such that / is Gateaux differentiable on B(x, r). Since V / is continuous at x, V / is bounded on a neighborhood of x. So we may assume that ||V/(x)|| < L for x € B(x,r). Using the mean-value theorem we obtain that / is i-Lipschitz on B(x,r), and so Eq. (3.14) holds. Let (tn) i 0 and (xn) -»• x be such that f^(x,u) = \imt~1 (f(xn + tnu) — f(xn)). But xn+tnu, xn £ 5(3;, r) for n > no (for some no £ N); applying the mean value theorem, there exists 6n £]0,i„[ such that f(xn + tnu) — f(xn) — V / ( x n + 6nu)(tnu) for every n > no- Because V / is continuous at x, we obtain that ft(x,u) = Vf(x)(u). O Let us introduce now the Clarke subdifFerential of / : X —> K at xeX with f(x) £ E. This is dcf(x)
= {x* € X* | {u,x*) < f\x,u)
Vu € X } ;
if f(x) £ E we consider that dcf(x) = 0. Taking into account Theorem 2.4.14, dcf(x) ^ 0 if and only if /^(af, 0) = 0. Moreover, dcf(x) is a nonempty w*-compact subset of X* if / is Lipschitz on a neighborhood of x. Other properties of dc are collected in the following result. Theorem 3.2.4 domg.
Let f,g : X —> R be proper functions and x £ d o m / n
(i) If / and g coincide on a neighborhood ofx then dcf(x) (ii) If f is convex then dcf(x) (iii)
=
df(x).
If x is a local minimum of f then 0 €
dcf(x).
— dcg(x).
176
Some results and applications of convex analysis in normed spaces
(iv) \imsupx_>x dcf(x)
C dcf(x),
where kmsupx^
dcf(x)
is the set
{x* G X* | 3 (xn) -+f x, 3 « ) % x*, Vn G N : < G 3 c / ( a ; n ) } • (3.15) (v) If g is finite and Lipschitz on a neighborhood of x then
(f + g)Hx, •) < fHx, •) + g\x, •), dcU + g)(?) c dcf(x) + dcg(x). (vi) / / g is Gateaux differentiate tinuous at x then
on a neighborhood of x and Vg is con-
(f + g)H*, •) = fHx, •) + V5(af),
dc(f
+ g)(x) = dcf(x)
+ Vg(x).
Proof, (i) and (iii) are obvious because p{x, •) = (flv^ix, •) for every V G N(x), while (ii) follows from the second part of Proposition 3.2.2. (iv) Let x* G limsupx^ fX-dcf(y); t n e n there exist (xn) ->/ x and (a£) ^ x* such that x*n G dcf(xn) for all n G N. Let M £ I be fixed. Then (u,a;*) < p{xn,u) for every n G N, whence, by Proposition 3.2.3 (i), (u,x*) < ft(x,u), and so x* G dcf(x). (v) Let r > 0 be such that g is L-Lipschitz on B(x, r); we may assume that L > 1. Let u £ X, e > 0 and 5 > 0 be fixed and take 6' = mm{S/{2L),r/(l + e + \\u\\)} > 0. Let v, x and t be such that \\v-u\\ < e , \\x-x\\ <5',0
+ g)(x) ~
f{x + tv) - f{x) t
g{x + tu) - g{x) t
whence, taking first the supremum with respect to v G D(u,e), we obtain that a < Ai(5) + A2(5) + Le, where a :=
sup ||a!-s||
.
mi H"-«ll<£
(f + g)(x + tv)-(f
+ g)(x)
t
U+9)(x)<(f+gm+s' Ai(S)
:=
sup \\x-x\\<S,0
inf — Il«-«II<£
A2(d) := sup {t'1 (g(x + tu) - g{x)) \x
eD(x,5)}.
—-^-L, t
,
Convexity and monotonicity
of
subdifferentials
177
Therefore /3 < Ai(S) + A2{5) + Le for every 5 > 0, where /3:=inf 5 >0 '
sup ||a!-»||<',0
inf if + 9)(x+ tv) - jf + g)(X) ^ £ Il -"II< * u
(f+9)(x)<(f+g)(x)+S'
Taking the limit of Ai (S) + A2 (5) for S —> 0 (taking into account that A\ and A2 are nondecreasing for 5 > 0), we get inf
sup
inf
(/ + g)(* + * 0 - ( / + g)(*)
5
>°\\x-x\\<5,0
•
f
< inf s>0
t
sup \\x-x\\<6,0
X
f(
f
inf ll"-"ll<
—£
+
tv
)
~
/(*)
-
—L
*
. 9(x + tu) — g(x) + inf sup — '-—^^- + Le. <5 >°||x-x||<(5 t
Taking now the limit for e —> 0, we get the desired conclusion. The relation for the subdifferentials follows from the preceding inequality and the fact that g^(x, •) is continuous. (vi) As mentioned in the proof of Proposition 3.2.3 (iv), g is Lipschitz on a neighborhood of x and so the conclusion of (iv) holds with g^(x, •) = Vg(x). Applying again (iv) for f + g and — g (we may assume that g is finite on X; otherwise take g = 0 outside a neighborhood of x), we obtain that f*(x, •) < (f + g)^(x, •) — Vg(x). The conclusion is now obvious. • In the rest of this section we use an abstract subdifferential. Before introducing this notion let us consider the following class of finite-valued convex functions: C(X) := {g : X -> E | g is convex and Lipschitz}; of course, €(X) is a convex cone in the vector space Rx of all functions from X into E. Let d ( X ) C E x be the cone generated by X*U{d [a , 6] | a, b € X) (C €{X)) and €2{X) C E x be the cone generated by
X*u{dfaM\a,beX}WD0, where
Do := { ^2k>QVkdlk
(Hk) C K+, ^2k>Ql*k
These cones will be used below.
=
*' (u*)*>°
conver
Sent} •
178
Some results and applications of convex analysis in normed spaces
We call an abstract subdifferential on the nonempty class T C M. a multifunction d : X x T =4 X*, which associates to (x,f) a set denoted by df(x), satisfying condition (PI) below: (PI) 0 G limsup y _> x d/(y) + dg(x) if / € T, g € <£(X), f(x) £ 1 and a; is a local minimum of / + g, where limsup _yX df{y) is defined as in Eq. (3.15) and dg(x) is the Fenchel subdifferential of g at x. A stronger form of (PI) is (P2) 0 € ~8f(x) + dg(x) if / 6 T, g e €(X), f(x) 6 1 and a; is a local minimum of / + g. Sometimes one asks (P3) df(x) = df(x) if / e
Tn€(X).
Of course, (P2) holds if (P4) and (P5) below are satisfied: (P4) 0 6 df (x) if / 6 J-, f(x) € E and x is a local minimum of / , (P5) T + €(X) C T and 8{f + g)(x) C df(x) + dg(x) when / G T and 9 e €(X). Note that WX + €(X) = RX,
A(X) + €(X) = A(X),
T(X) + €(X) = T(X).
Remark 3.2.1 From the preceding theorem we observe that Clarke's subdifferential dn and the Fenchel subdifferential d are abstract subdifferentials x on R and A(X), respectively; in fact they satisfy conditions (P1)-(P5). There are many other subdifferentials which satisfy condition (PI). Remark 3.2.2 If the abstract subdifferential d on T satisfies the stronger condition (P2) then for any proper function / 6 T one has df(x) C df(x) for all x € d o m / . Indeed, if x* 6 df(x) then a; is a (local) minimum point of / + (—x*), and so, by (P2), 0 £ df(x) + d(-x*)(x) = df(x) - x*. Hence x* € df(x). The following approximate mean value theorem proves to be useful in nonconvex analysis.
Convexity and monotonicity
of
subdifferentials
179
Theorem 3.2.5
(Zagrodny) Let (X, ||-||) be a Banach space and d be x an abstract subdifferential on T C E . Let f 6 T be Isc, a,b G X with a G d o m / and a ^ b, and r 6 l with r < f(b). Then there exist (xn) —>•/ c € [a, b[ and x*n G df(xn) for every n G N such that (i) r — f(a) < lim inf (6 - a, x*n), (ii) 0 < lim inf (c (iii)
xn,x*n),
f^(r-/(a))
(iv) ||& - a|| (/(c) - /(a)) < ||c - a|| (r - / ( a ) ) . Proof. There exists x* G -X"* such that (6 - a, x*) = r — f(a). Consider h := / — x*; then /i(a) < h(b). Because h is Isc, there exists c G [a,b[ such that h(c) < h(x) for all a; G [a,b]. Therefore c = (1 — /x)a + fib for some /i G [0,1[. It follows that c — a = fi(b — a) and ||c - a|| = fi \\b — a\\. Therefore /(c) - f(a) = h(c) - h(a) + (c — a,x*) < fi(r - / ( a ) ) , whence (iv) follows. Let 7 < h(c); then there exists r > 0 such that 7 < h(x) for every x G [a, b]+rUx- Otherwise, for every n G N there exists xn G [a, b ] + n - 1 { / x such that h(xn) < 7. Hence arn = dn + yn with d„ G [a, 6] and y n G n~1UxAs the segment [a,b] is a compact set, there exists a subsequence (dnk) converging to d G [a,b]. It follows that (xnh) —> d, and so, by the lower semicontinuity of h, we get the contradiction h(d) < 7. Let r > 0 correspond to 7 := h(c) — 1, and take C/ := [a, b] + rUx', of course, U is closed. Like above, for any n G N there exists r„ G ]0, r[ such that /i(z) > h(c) — n~2 for 3; G [a,b] + rnUx', choose tn > n such that 7 + tnrn > h(c) — n~2. Then one has Vz G U : /i(c) < h(x) + tnd[atb](x)
+ n~2.
(3.16)
Indeed, the inequality is obvious for x G [a, b] + rnUx- If x £ U \ ([a, b] + r„[/x) then d[0i6](x) > r„, and so /i(x) + t„d[a)ft](a;) > 7 + i n r n > ft.(c)-n-2. Consider i?„ — h + iu + *nd[0)(,]. Prom Eq. (3.16) we have that Hn(c) < infx Hn + n - 2 ; moreover, Jf„ is Isc and bounded from below. Applying Corollary 1.4.2 for Hn, c, e ~ n~2 and A := n _ 1 , we get un G X such that Hn(un)
< Hn(c),
||c-u„|| < n_1,
(3-17)
1
(3.18)
Hn{un) < Hn(x) + n - ||z - u n ||
Va; G X.
Since (un) —> c G [a, 6[, we may assume that un G intf/ for every n.
180
Some results and applications of convex analysis in normed spaces
Relation (3.18) can be written as ( / + tnd[atb] + n~ldUn - x*) (u„) < ( / + tnd[aib] + n - 1 d „ „ - x*) (x) for every x 6 U, where da denotes d[a,a] = ^{o}- Therefore un is a local minimum of/ + (tnd[a,b] +n~1dUn -x*); the function between the parentheses being convex and Lipschitz, by (PI) we have that 0 6 limsup <9/(2/) + d(tnd[a
n-Hl).
Let yn £ [a, b] be such that d{a^{un) = \\un — yn\\ (such an element exists because [a, b] is compact). Because 0, and so (yn) -> c. Since c ^ b, we may suppose that yn ^ b for every n 6 N. Hence yn = (1-A n )a+A„6 with An € [0,1[. From Proposition 3.8.3(H) on page 239, dd[a,t](u„) = d||-|| (u n -y n )r\N([a,b],y n ). Therefore ((1 - A)o + \b- yn,v*n) = (A - A„) (6 - a , < ) < 0 for every A € [0,1[. Hence (b — a, u*) < 0 for every n S N. Moreover, (c - u n , u*) = (c - y n ,v*) - (u„ - yn,v*) It follows that (b-a,u*n) n_11|& — a\\, whence
> (b-a,x*)
< - ||u n - y„|| < 0.
- ^(fr-a,^)
> r - f(a)
-
liminf (b — a,u*) > r — / ( a ) . Similarly, {c — un,u*n) > (c — u„,x*) — n _ 1 | | c - u n | | , whence liminf (c — un,u*n) > 0. From Eq. (3.17) we get f(un) - (un,x*) < /(c) - {c,x*), and so /(c) < liminf f(un) < l i m s u p / ( u n ) < /(c); hence l i m / ( u n ) = /(c) which shows that (un) —>f c. Because u*k e l i m s u p . ^ ^ df(u), for every k £ N there exist (u„,*) ->•/ Ufc and (u*^) ^> u£ (for n -» oo) such that u*nk G df(un>k)
for all n, A; e N.
Convexity and monotonicity
of
subdifferentials
181
It follows that ((c - uniu,u*nk)) -> ( c - Ufc,u£) for n -> oo. Therefore for every A: € N there exists n'fc € N such that IK,*-Wfc|| < fc_1, \f(un,k)(c-unik,u*nik)-
f(uk)\ < k'1,
\(b-a,u*nk
(c-uk,u*k)\
- u*k)\ < AT1,
< k~l
for n > n'k. Consider (nk) C N an increasing sequence such that nk > n'k for all k e N. Let xn := u„,fc and a;* := u^ fc for nk < n < nk+i- Since for nk
~ Uk\\ + \\uk - C\\ < k~l
+ \\uk - C\\ ,
l / W - Z t o l ^ + l/toWtol, (b - a, < ) = {b-a, u*k) + {b-a, < < f c - u*k) > (b - a, u*k) (c-xn,x*n)
= (c-u n , f c ,u* ) A .) > (c-uk,u*k)
k'1
- fc-1,
we obtain that (xn) —•/ c, lim inf (6 — a, xn) > lim inf (b — a, u£) >r — f(a) n—>oo
k—•oo
and lim inf (c — x n ,a;*) > lim inf (c — Ufc,u^) > 0. n—s-oo
fc-+oo
Taking into account that b — c = (1 — /j,)(b — a) and \\b — c|| = (1 — /i) ||b — o||, from the preceding two relations we get lim inf (b- xn,x*n) = lim inf ((1 - fi) (b- a, a:*) + {c-
xn,x*n))
> (1 — /x) lim inf (b — a,xn) + lim inf (c — a;„,a;*) > K The proof is complete.
(r - / ( a ) ) . •
Remark 3.2.3 The conclusion of Theorem 3.2.5 remains valid if (PI) holds only for functions g in the cone £i(X) or in the cone C2PO defined on page 177. Indeed, for the first part just note that in the proof of Theorem 3.2.5 we used (PI) only for g = i„d[aj6j + n~1dUn — x* € £i(X). For the second part one must notice that one can find tn > 0 such that 7 + tnrn > h(c), and so Eq. (3.16) holds with d[Q>(,] replaced by d?a bi. Applying the Borwein-Preiss variational principle for p = 2, Hn,
182
Some results and applications of convex analysis in normed spaces
c, e := 2n~2 and A := \/2/n, we find un € B(c, y/2/n) and a function 0 2 ,„ generated by the sequences (fJ.n,k)k>o C K+ with X^>0Mn,ib = 1 and (wn,fc)fc>o C B(c, \/2) (see Eq. (1.11) on page 31) such that un is a local minimum of / + (tndfa M + n~1@2,n — x*). Then, in the expression of un we have tn \\un — yn || instead of tn and 6* 6 23/2Ux* instead of &£ 6 C/x* (see also Exercise 2.40). The rest of the proof is the same. Corollary 3.2.6
Let X be a Banach space and d be an abstract subx differential on T C M. . If f G T is a Isc proper function then for every x e d o m / there exists ((xn,xn)) C gr<9/ such that (xn) —»/ x and liminf (x — xn,x^) > 0. In particular d o m / C cl(domd/). Proof. Let x S d o m / . If z is a local minimum of / then 0 belongs to limsupy^ xdf(y) by (P2) (just take g = 0). Therefore there exists (xn) —>f x and (a;*) —> 0 such that a:* 6 df(xn) for every n € N. Of course, lim (x — xn, x^) = 0 in this case, and so the conclusion holds. Assume that x is not a local minimum of / . Then there exists (xn) —> x such that f(xn) < f{x) for every n £ N. Since / is lsc, we get (xn) -»•/ x. Applying the preceding theorem for xn, x and r = f(x), there exists ((xk,n,x%tn))keN C domdf such that (xk,n) ->/ yn G [x„,a;[ for k -» oo and 0
< |jx-lj (/W
_
/(^«)) < Hminf *,_>«, (a; - a;fc>„,a;^„),
/(l/n) < /(*„) + | ^ f (/(*) - /(*»)) < /(*) for every n € N. Since \\yn - x\\ < \\xn - x\\ we have that (yn) -> x, which, together with the preceding inequality and the lower semicontinuity of / , yields (yn) ->•/ x. Then, for every n 6 N there exists k'n € N such that lk*,n - Vn\\ < n _ 1 , |/(a:fc,„) - f(yn)\ < n _ 1 and 0 < (x - Xk,n,x*kn) f o r every k > k'n. Let (fc„) C N be an increasing sequence such that kn > k'n for n € N. Then lkfc„,n - z|| < ||a;*„,n - y n || + \\yn - x\\ < n _ 1 + \\yn - x\\, l/(**»,») - f(x)\
< \f{xkn,n)
- f(yn)\
+ \f(Vn) -
f(x)\
< n - 1 + |/(y„) - /(a:)| , 0 < (z-Z*„,„,4„,n) for all n £ N. Taking xn := Xkn,n and a;* := a;^ n , we have the desired conclusion. •
Convexity and monotonicity
of
subdifferentials
183
When T = A(X) and d is the Fenchel subdifferential, the statement of the preceding corollary is very close to assertion (i) of Theorem 3.1.2. Theorem 3.2.7 X
Let X be a Banach space, d be an abstract subdifferential
on T C K and f £ T be a Isc proper function. If df is a monotone multifunction then f is convex. Proof. We prove the theorem in three steps: 1) a £ d o m / , b £ domdf => [a,b] C d o m / , 2) g r d / C gr<9/, 3) / is convex. 1) Assume that there exists b' £ [a, b] \ dom / ; so b' — Xa + (1 — X)b with A £ ]0,1[. Fix y* £ df(b) and take r £ ffi, r > / ( a ) + A-1(l-A)||6-a||.||2/*||. By Theorem 3.2.5, there exist (xn) - » / c £ [a, b'[ and x* € df(xn) n € N such that liminf (&' — a;n,a;*) > 0,
for every
liminf (b' — a, a;*) > r — f(a).
Since df is monotone we get the contradiction l|6-a||-||y*ll>ll&-c||-||y*||><6-c)i/*> = liminf (b — xn,y*) > liminf (b — xn,x*n) - liminf (A(l - A)" 1 (b' - a,x*n) + (b' - xn, x*n)) > A(l - A) - 1 liminf ( 6 - a , x * ) + liminf (b' >
A ( 1
_
A )
-i
( r
_
/ ( a ) )
-xn,x*n)
.
2) Let x* 6 df{b) with b 6 X. Consider x £ d o m / . Applying again Theorem 3.2.5 for r = f(b), there exist (xn) ->f c € [x, b[ and x* € df(xn) for every n 6 N such that liminf (b-xn,x*n)
> g f f } (/(6) - / ( * ) ) .
Since 9 / is monotone and c = Ax + (1 — A)6 with A € ]0,1], we have that A (6 — x,x*) = (b — c,x*) = liminf (& — x n ,x*) > liminf (b — x n ,a;*) >A(/(6)-/(x)), and so (x — b, x*) < f(x) — f(b) for every x £ d o m / . Therefore x* £ df(x). 3) Let x, y £ dom / and z := (1 — X)x + Xy with A £ ]0,1[. By Corollary 3.2.6, there exists (yn) c dom df such that (yn) ->•/ y. Let a;n := (1-A)x + Ay„ for n £ N; by 1) we have that zn £ d o m / for every n. Using again
184
Some results and applications of convex analysis in normed spaces
Corollary 3.2.6, there exists ((zn,k,zn k)) c STOf s u c r i t n a t (zn,k) —>•/ zn for k —> oo and liminffc_>oo(^n — zn,kiznk) — 0. By 2) we have that z*nk G df(zn>k), and so (x-zntk,z^k)+f(zn,k)
< f(x),
(yn-
zn,k,znk)
+ f{zn>k) < f(yn)
for all n,k G N. Multiplying the first inequality by (1 — A) > 0 and the second one by A > 0 we get (zn - zn,k, z*n>k) + f(zn,k)
< (1 - A)/(x) + Xf(yn)
Vn, k G N.
Taking the limit inferior for k —> oo we get f(zn) < (1 — A)/(x) + Xf(yn) for every n. Passing to the limit inferior for n —> oo and taking into account the lower semicontinuity of / and the fact that (yn) ->/ y we get f(z) < (1 — A)/(a;) + Xf(y), and so / is convex. • We give now another short proof for the Rockafellar theorem on the maximal monotonicity of the subdifferential of a lsc convex function. Theorem 3.2.8 Let X be a Banach space and f € T(X). maximal monotone.
Then df is
Proof. Let x G X and x* ^ df(x). Then x is not a minimum point for g := f — x*, and so there exists z E X such that g{z) < g(x). Let r G E be between these numbers. By Theorem 3.2.5 applied for d on r(X), there exists ((yn,yn)) c Sr9g such that (yn) —>fl y € [z,x[ and lim inf (x - yn,y*n) > | f 5 | [ (r - g{z)) > 0. Therefore (x - yn,y*n) > 0 for n > no, for some n 0 G N. It follows that y* := yno + x* € df(y), with V '•= 2/n0) and (x — y,x* — y*) < 0. Therefore df is maximal monotone. • Theorem 3.2.5 can also be used for proving that two convex functions with the same subdifferential differ by an additive constant. In fact one obtains even more. Theorem 3.2.9
Let C be an open convex subset of the Banach space X jf
and d an abstract subdifferential on T C IK . Let f G T be proper and lower semicontinuous and g G T(X). Assume there exists e G R+ such that df(x) C dg{x) + eUx*
V i G C.
(3.19)
Then C D dom / = C f) dom g and 9{x) - g{y) ~e\\x-
y\\ < f(x) - f(y) < g(x) - g(y) +e\\x~
y\\
(3.20)
Convexity and monotonicity
of
subdifferentials
185
for all x £ C and y £ C fl domg. Proof. Consider 7 > e. By Corollary 3.2.6 we have that d o m / C cl(dom9/), and so C fl domdf ^ 0. Let y £ C fl d o m 9 / and x £ C. Prom our assumption we have that y £ doming, and so y € domg. We are going to prove that /(*) - /(!/) < $(*) - 5(2/) + e II* - 2/11 •
(3-21)
The inequality is obvious if x £ domg or 1 = y, and so assume that x £ domg and x ^ y. Because dg(y) ^ 0, we have that g'(y,u) > - 0 0 for all u £ X. Consider ip : E -»E defined by
g'(y,u),
and this quantity is finite. Therefore there exists to £ ]0, ||:r — y||[ such that g'(y + t0u,u)
9 g'(x0,u) < 9{V + *>">to - 9{V) + | ( 7 - e ) = g
f(x0)-f(y)
(3.22)
Assume that Eq. (3.22) does not hold, i.e.
f(xo) - f(y) > 9{x0) - g(y) + 7 Iko - y\\ • Choose r € E such that r < f(xo) and r - f(y) > g(x0) - g(y) + 7 ||*o - y\\ • By Theorem 3.2.5, there exist (xn) ->f z £ every n £ N such that
\I),XQ\
and x*n £ df(xn)
liminf (a?o -x„,a;*) > — - — — (r - /(j/)).
(3.23) for
186
Some results and applications of convex analysis in normed spaces
Because (xn) —> z we have that (||a;o — x n ||) —> ||xo — z||, and so
]xo-xn\\
I
\\XQ-y\\
Taking into account Eq. (3.23), there exists no G N such that *o-*» - * \ Fo-a;n|| /
>
g(»°)-gy M^cro — 2/11
vn>n0.
+ 7
Since ( x „ ) —>• z G [y, xo[ C C, we m a y a s s u m e t h a t xn G C for n > no- B y E q . (3.19) we have t h a t xn = zn + eu*n with z* G dg(xn) a n d u* G t / x * for n > no- Using also t h e convexity of g, it follows t h a t g(x0)-g(xn) ^ / so^gn ||ar0 — a;„||
\ ||z 0 - z„|l
A
>
9(xo)-9(y)
/
+ 7
_
£
V n
^no_
||ar0 — 3/1
Taking into account the lower semicontinuity of g at z and Eq. (3.19) we get g{xo)-
g(z)
g(x0) - g(y)
,
,
IN-2/H
IFO-^II
Since 2 G [y, io[, z = y + su for some s G [0, io[- Using the convexity of
—ii-
rr- = — :
,
,
— <
(x0,u).
\\x0 - z\\ t0 - s Therefore Eq. (3.22) holds. Consider T~{te
]0, ||rc - y||] I f(y + tu) - f(y) < g(y + tu) - g(y) + jt} .
From Eq. (3.22) we have that t0 G T, and so t := supT G]0, ||a; - y||]. Because
_ , , > * & • « ) = V (<) > V (0) = ff+(»,0),
whence g'(y,u) G 1R. Moreover, y G d o m / D domg. Proceeding as above with y replaced by y (noting that we used only that y G dom / n dom g and g'(y,u) G R), we obtain some s 0 G]0, ||a; - y\\] such that f(y + s0u) - f(y) < g(y + s0u) - g(y) + -ys0.
Convexity and monotonicity
of
subdifferentials
187
As f{y) — f(y) < g(y) — g(y) + 7*, we obtain by addition that f(y +(t + s0)u) - f{y) < g(y + (t + s0)u) - g{y) + 7 ( I + s 0 ), contradicting our choice of t. Hence t = \\x — y\\, and so f(x) — f(y) < g(x) — g(y) + 7 \\x — y\\. Because 7 > e is arbitrary, we obtain that Eq. (3.21) holds. Taking i G C f l dom g and a fixed y G C n dom df, from Eq. (3.21) we obtain that C n dom g c C f l d o m / . Let now j / e C f l d o m / . From Corollary 3.2.6 we get a sequence (yn) C dom df with (yn) —>/ y. Because y G C, we may assume that yn G C for every n € N. Let i G C n dom g be fixed. From Eq. (3.21) we have that
f{x) - f(yn) < g(x) - g(yn) + e\\x-yn\\
Vn € N.
Since g is lsc at y, we get fix) - f(y) < g(x) - g(y) + e\\x-
y\\,
(3.24)
and so y £ dom g. Hence C (~1 dom / = C fl dom g. Let x, y € C fl dom / = C fl dom g; then Eq. (3.24) holds. Interchanging x and 1/ in Eq. (3.24), we obtain also that
f{x) - f{y) > g(x) - g{y) - e \\x - y\\, which proves that Eq. (3.20) holds for all x,y G C f] d o m / . Equation (3.20) being obvious for x & C \ dom / = C\ dom g and y £ Cr\ dom g, the conclusion follows. D A more precise result is the following. Corollary 3.2.10 Let X be a Banach space and d an abstract subdifferential on T C K. . Consider a proper and lower semicontinuous function / € f and g € T(X). If df(x) C dg(x) for every x G X then there exists S;£R such that f = g + k. Proof. Take C — X and e ~ 0 in Theorem 3.2.9. Fixing x0 G d o m / = dom g, we obtain that f(x) = g(x) + f(xo) — g(xo) for every x G X. • Corollary 3.2.11 Let X be a Banach space and f,g€ dg(x) for every x G X then f = g + k for some k €.R.
T(X). Ifdf(x)
C
Proof. Apply the preceding corollary on F(X) and the Fenchel subdifferential. •
188
Some results and applications of convex analysis in normed
spaces
The next result completes Proposition 2.4.3. Corollary 3.2.12 Let X be a Banach space and T : X =4 X* be a maximal cyclically monotone multifunction. Then there exists f G T(X), unique up to an additive constant, such that T = df. Proof. Because T is maximal monotone, g r T ^ 0. Fix (XO,XQ) G g r T and consider fr defined by (2.37) on page 85. Then, by Proposition 2.4.3 we have that fr G T(X) and g r T C grdfr- Since T is maximal monotone we even have g r T = grdfr, and so T = df. The uniqueness follows from Corollary 3.2.11. • 3.3
Some Classes of Functions of a Real Variable and Differentiability of Convex Functions
Consider the following useful classes of functions: A = {
A0
Nk = {ip G A | P 3 11-> t~kip(t) € 1 + is nondecreasing}, Jfc G {0,1,2},
= {<^ G yi | iimmrkip(t) = o}, ke {o, i},
r
= {ip G .A |
To Si
= r n A), r^ := y G r 0 1 iiminft_+0Ot-V(*) > o}, = r n n i , s? ~ {^eEi liimsup^^rVC*) <°o}.
Of course, N2 C Ni C N0, E? C Si C fti C ft0 and Tg C T 0 C T C A^. Moreover, if
(s) := inf{t > 0 | y>(t) > s } ,
(s) := supjr- > 0 |
(3.25)
it is obvious that p e < / , i/ie £ iVo,
Differentiability
of convex
functions
189
Furthermore, one can extend
(i) Ifip£A0r\N0
then tpe,tph £ Cl0; if(p£Cl0r\N0
then
(ii) Let
i> £ r 0 <s> v # e Si and v e rg o v # e sf • (iv) Let p,q £ ]1, oo[ be such that 1/p + 1/q = 1 and
190
Some results and applications of convex analysis in normed spaces
particular v*(s) > sup{st - st/2 \t£ [0,e]} - es/2 > 0. Hence
-
Therefore s i-> s~qip#(s) is nonincreasing on P. The proof for the other statement is similar. (v) For x* £ X* we have that r(x*)=sup{(x,x*)-p(\\x\\)\xeX} = sup{(x,a;*) -
\\x\\ = t}
= sup sup ((x,x*) — (p(x)) =sup(t\\x*\\—(p(x)) t>0 \\x\\=t
= tp* (\\x*\\).
t>0
The equality f**(x) = <^ ## (||a;)|| is obtained similarly.
D
As an illustration of the use of the above classes of functions let us give the following interesting result about the differentiability of a convex function. First recall that a bornology on X is a family B of nonempty bounded subsets of X having the properties: 1) every set B of B is symmetric, 2) B is directed upwards, i.e. for all B±, B2 £ B there exists B3 £ B such that S i U B2 C B3 and 3) 1){B | B £ B} = X. The topology induced by the bornology B on X*, denoted by Tg, is the topology of uniform convergence on every set B of B. The most common homologies on X are the Gateaux, the Hadamard and the Frechet homologies which are obtained taking as B the classes of finite symmetric subsets, the weakly
Differentiability
of convex
functions
191
compact symmetric subsets and the bounded symmetric subsets of X, respectively. To the Gateaux bornology corresponds the weak* topology on X*, while to the Prechet bornology corresponds the norm topology on X*. It is clear that the topology m which corresponds to the bornology B is finer than the weak* topology and coarser than the norm topology. The function g : X —>• E is said to be S-diflFerentiable at x if / is finite on a neighborhood of x and there exists x* £ X* such that lim
/(x
t->0
+
^)-/(x) t
\ '
/
'
v
v
the limit being uniform w.r.t. u 6 B, for every B £ B; x* is denoted by Vf(x) and is called the ^-differential or ^-derivative of / at x. Of course, for the Gateaux, Hadamard and Prechet bornologies one obtains the Gateaux, Hadamard and Prechet derivatives, respectively. Condition 3) implies that / is Gateaux differentiable at x whenever / is B-differentiable at x for the bornology B; in particular the ^-derivative is unique when it exists. T h e o r e m 3.3.2 Let f € A(X) be continuous atx£ bornology on X. Consider the following statements:
doxnf and B be a
(i) / is B-differentiable at x, (ii) / is B-smooth
at x, i.e. l i m ^ o i - 1 ^ ^ ) = 0, where, for t > 0,
o-B(t) := sup{/(x + ty) + f{x - ty) - 2f{x) \ y € B}, (iii) every selection of df is norm to T& continuous at x, (iv) for every x* £ df(x) there exists a selection 7 of df which is norm to TB continuous at x and j(x) = x*, (v)
there exists a selection of df which is norm to TB continuous at x,
(vi) (a;*) -4 x* whenever x* £ df(x), (xn) -)• x,
x* £ df(xn)
for n £ N, and
(vii) (x*n) ^4 x* whenever x* £ df(x), and (en) -» 0, (xn) ->• x,
0 < en, x*n £ denf(xn)
for n £ N,
(viii) (x*) ^4 x* whenever x* € df(x), and (e n ) -> 0.
0 < e„, x* £ denf(x)
for n £ N,
TTien (i) <£> (ii) <=> (iii) «• (iv) <s> (v) o (vi) <£= (vii) o (viii). Moreover, if X is a Banach space and f is Isc then (vi) => (vii).
192
Some results and applications of convex analysis in normed spaces
Proof, Then
(i) => (ii) Assume that / is S-differentiable at x. Let u € X.
lim ^ tio
+ tU) +
^
~ ^
~
2f
^
t
= lim f{* + *4.o = (u,Vf(x))
tu)
t +
~
/(5f)
I lim / ( * ~ t\o (-u,Vf(x))=0.
tu)
t
~
/ (
^
As the limits in the right-hand side of the first equality above is uniform w.r.t. u £ B, for every B £ B, we obtain that / is ^-smooth at x. (ii) => (i) Take x* £ df(x); such an element exists by Theorem 2.4.9. Consider B £ B and take u £ B; one has OBH) > f(x + tu)+f(x-tu)
-2f(x)
>f(x
+ tu) - f(x) -t(u,x*)
> 0,
and so 0
tu)-m
_{UiX.}<*BM
vues.
Hence Eq. (3.26) holds, the limit being uniform w.r.t. u £ B. (i) => (iii) Let 7 : D —• X* be a selection of df, where D := dom df; of course, int(dom/) C D and 7(2;) = V/(af). Let B £ B. Consider aB : K+ - > ! + ,
a B ( t ) := sup{f(x + tu) - f(x) -t(u,j(x))
\u£
B}. (3.27) Because / is S-differentiable at x, lim^o £ _ 1 <7B(£) = 0, i.e. a £ fii. Because / is convex, <7j9 is convex, too. For x £ D, u £ B and £ > 0 we have that (z + tu - x, 7(0:)) < /(& + tu) - /(:r) < / ( £ ) + t (u, j(x)) + aB(t) -
f(x),
whence t (u, j(x) - i{x)) - aB(t) < f(x) - f(x) + (x-x,
7(2)).
Taking first the supremum with respect to u £ B, then with respect to t > 0 on the left-hand side of the above inequality we obtain that (CT B ) # (sup u€B
\{U,J(X)
- j(x))\)
< f(x) - S{x) +
< f(x) - f(x) + \\x - x\\ • | | 7 ( i ) | |
(x-x,j(x)) Vx £ D.
Differentiability
of convex
functions
193
Since / is continuous at x, by Theorem 2.4.13, df is bounded on a neighborhood of x, and so 7 is bounded on a neighborhood of x. Prom the above inequality we obtain that l i m ^ ^ ^ ^ s ) * (sup u £ B \(u,7(x) — 7(x))|) = 0. Because, by Lemma 3.3.1, er# G To, we get lim^-yj s u p u e B \(u,f(x) — 7(x))| = 0. Therefore 7 is norm to Tg continuous at x. (iii) =>• (iv) and (iv) => (v) are obvious. (v) =£- (i) Let 7 be a selection of df which is norm to T& continuous at x. Consider B 6 B and e > 0. Then there exists 5 > 0 such that D(x, 6) C D and s u p u € B \(u,j(x) - 7(x))| < e for all x G D(x,6). Because B is bounded, B c /3Ux for some ft > 0. Fix u G 5 and take <£ : K -> 1 ,
f(x+tu)-f(x)-t(u,j(x))
+ tu,-u)
- (u,7(x)),
< ip'+(t) for every t G int(domy>).
=
+ su)-j(x))
ds
for all t G [0,S']. Hence limt4.oi-1o\B(£) = 0, where WB is denned by Eq. (3.27). It follows that / is S-differentiable at x and V/(x) = 7(1). (iii) => (vi) Let a;* G df(x) and ((x n ,x*)) C gr<9/ be such that (xn) ->• x. Assume that x* is not the limit of the sequence (x*) for the topology TB- It follows that there exist e 0 > 0, B G B and an infinite subset P of N such that s u p u 6 B \{u,x*n — x*)\ > e 0 for every n G P. From (iii) => (i) we have that df(x) = {x*}, and so xn ^ x for n E P. Since (x n ) —> x, we may assume that xn ^ xm for distinct n,m E P- Let 7 be a selection of 9 / such that 7(x n ) := x* for n E P, j(x) := x* and 7(x) an (arbitrary) element of df(x) otherwise. From (iii) we obtain that 7 is norm to TQ continuous at x, and so (x*) n e p TA- x*, contradicting our choice of P. (vi) =>• (iii) Let 7 be a selection of df, and assume that 7 is not norm to rg continuous at x. Then (since x has a countable base of neighborhoods) there exists a sequence (x n ) C D converging to x such that (7(x n )) does not converge to 7(2:) w.r.t. TB; this contradicts (vi). (vii) =>• (vi) is obvious; just take en = 0 for n G N. (vii) => (viii) is obvious; just take for n G N. (viii) =>• (vii) Let x* E df(x), 0 < en and x* G dSnf(xn) be such that
194
Some results and applications of convex analysis in normed spaces
(en) ->• 0 and (xn) -> x. Consider e := sup{e„ | n 6 N}. Since / is continuous at x, by Corollary 2.2.12, / is Lipschitz on a neighborhood of x, while from Theorem 2.4.13 we have that b\f is bounded on a neighborhood of x. Hence there exist r,m > 0 such that \f{x) — f(y)\ < m\\x — y\\ for x,y e D(x,r) and def(D(x,r)) C mUx*- There exists n0 € N such that xn £ D(x,r) for n > n 0 . Let x* € dSnf{x). Then {y - x, x*) = (y - xn, x*) + (xn - x, x*) < f(y) - f(xn)
+en + \\xn - x\\ • \\x"\\
< f(y) - f(x) + m \\x - xn\\ +en + m- \\xn - x\\ for all y € X, and so d
enf(xn)
C den+2m\\xn-x\\f(x)
Vn > n0.
Because (e„ + 2m \\xn — x\\) —> 0, from our hypothesis we obtain that
(OneP^X*Assume now that X is a Banach space. (vi) =^ (vii) Let x* e df(x), 0 < e„, a;* e dSnf(xn) be such that (e n ) -^ 0 and (x„) —> x. Because xn 6 int(dom/) for large n and / and / coincide on int(dom/), we may suppose that / is lsc. By the Borwein theorem, for every n € N there exists (x'n,x'*) e gr<9/ such that \\xn - x'n\\ < ^/E^ and 11a;* - x'*\\ < y/e^. Therefore (a;^) ->• x. By (vi) we obtain that (x'*)n£N -A x*. As (\\xn — x'*\\) -¥ 0 and TB is coarser than the norm topology, we obtain that (a;*)neN TA x*. • Because any selection of df coincides with the gradient of / at the points x where df{x) is a singleton, from the preceding theorem we get the following. Corollary 3.3.3 If f & A(X) is continuous and Gateaux differentiable on the open set D c d o m / then f is B-differentiable on D if and only if V / is norm to TB continuous on D. • As a consequence of Theorems 3.3.2 and 3.1.2 we get the following interesting result. Corollary 3.3.4 Let X be a Banach space and f £ T(X). differentiable atx* 6 int(dom/*) then V/*(aT*) € X.
If f* is Frechet
Proof. By the Br0ndsted-Rockafellar theorem there exists ((a;„,a;*)) C grdf such that « ) ->• x*. Then xn 6 df*(x*n) for the duality (X*,X**).
Well conditioned functions
195
Taking the Prechet homology, from the implication (i) => (vi) of the preceding theorem we get (xn) -» V/*(z*). Because X is closed in X** we obtain that Vf*(x*) G X. • A similar result holds for Gateaux differentiability of /* under supplementary conditions on X. Corollary 3.3.5 Let X be a weakly sequentially complete Banach space and f G T(X). / / / * is Gateaux differentiable at x* G int(dom/*) then V / * 0 T ) G X. Proof. As in the proof of the preceding corollary, there exists a sequence ((xn,x*n)) in grdf such that (a;*) -+ x*. We have again that xn G 9/*(x*) for the duality (X*,X**). Taking now the Gateaux homology, from the implication (i) =>• (vi) of the preceding theorem we get (xn) -» Vf*(x*) for the weak* topology of X**, i.e. ((x*,xn)) -» (x*, V/*(z*)> for all x* G X*. In particular (xn) is a weakly Cauchy sequence, and so, by hypothesis, (xn) converges weakly to some x G X. It follows that V/*(i*) =x € X. • Note that every reflexive Banach space is weakly sequentially complete. The next result shows that for convex functions the usual sufficient condition for Prechet differentiability is also necessary. Corollary 3.3.6 Let f G T(X) be continuous and Gateaux differentiable on a neighborhood of x G d o m / . Then f is Prechet differentiable atx if and only if V / is continuous at x. Proof. We may assume that / is continuous and Gateaux differentiable on the ball B(x,r) C d o m / with r > 0. Because V / can be prolonged to a selection of df, the conclusion follows from Theorem 3.3.2. • Note that for convex functions defined on finite dimensional normed vector spaces Gateaux and Frechet differentiability coincide (use for example Theorem 3.3.2 and take into account the fact that the norm and weak topologies coincide). This is not true for infinite dimensional normed spaces as the functions in Exercises 2.10 and 3.1 show. 3.4
Well Conditioned Functions
Let / G A{X) be such that argmin/ := {x G X \ f{x) = i n f / } ^ 0, •0 G A and 5 C X be a nonempty closed convex set. We say that / is
196
Some results and applications of convex analysis in normed
spaces
^-conditioned with respect to S if MxeX
: f{x)>mff
+ i(>(ds{x))
(see Eq. (3.62) on page 237 for the definition of ds). If there exists rj > 0 such that the above inequality holds for every x € X with ds(x) < T), we say that / is locally ip-conditioned with respect to S. Let us note that if / is V'-conditioned with respect to the subset S and tp £ A0, then argmin/ C S. Taking into account this fact, we say that / is ^-conditioned if / is ^-conditioned with respect to argmin/. Assume that S := argmin/ ^ 0 and consider the conditioning gage of / defined by
V>/:t+ -*•!+,
ipf(t)~mi{f(x)-Mf\xeX,ds(x)=t}.
(3.28)
It is clear that tpf 6 A and / is ^/-conditioned. The following result holds. Proposition 3.4.1 ipf £ Nx. Proof. Let t > exists a sequence ipf(ct). Let (en) I ct+en. Let An :=
Let / £ A(X) be such that S := argmin/ ^ 0. Then
0 and c > 1; we must show that ipj{ct) > cipf(t). There (xn) C X such that ds{xn) = ct and (f(xn) — inf/) —> 0. For every n there exists an £ S such that \\xn — an\\ < (||a;„ — an\\ — ct + t) / \\xn — an\\ 6 [C1,1[. We have that
ds ((1 - Xn)an + A„a;n) > ds(xn)
- (1 - A„) \\xn - an\\ = ct-ct
+ t = t.
Because A i-> ds ((1 — A)an + Xxn) is a continuous function, there exists fin £ ]0, An] such that ds ((1 — £«„)an + nnxn) = t- We obtain so that V'/W < / ( ( I - fJ-n)an + HnXn) - inf / < (1 -fin)f(an)
+ Hnf(xn)
-inf/
= Mn (/(^n) - inf / ) < An (f(x„) - inf / ) . Because (An) -* c _ 1 , we obtain that ipf(t) < c~l • ip/(ct).
O
Let us remark we used only that / ( ( l - X)a + Xx) < (1 — X)f(a) + Xf(x) for every a £ S, x £ X and A £ [0,1], in other words the fact that / is star-shaped at every a 6 5; of course, this condition ensures the convexity of 5. Corollary 3.4.2 Let f and S be as in Proposition 3.4-1- If ip '• [0,a] -> M.+ , where a > 0, is such that Vx£X,
ds(x)
: f(x) > inf / + V
(ds(x)),
Well conditioned functions
197
then f is ipa-conditioned, where ,
™
™
, , ,
[ V>(*) I x^t
for for
te \0,a\, t €ja,oo[.
Proof. From the definition of ipf we have that ipf > ip on [0,a]. For ds(x) = 7 > a we have that / ( i ) > inf / + V/(7) > inf / + -V>/(a) >
inf
/ + ~^(a)
= inf / + ^ d s ( : r ) = inf / + >« (d 5 (a:)), a whence the conclusion.
•
Proposition 3.4.1 shows that ipf(t) > 0 for every t > t0 if ipf (to) > 0. It is important the case in which ipf(t) > 0 for every t > 0, i.e. ipf £ Ao', in such a case we say that / is well-conditioned. The following result holds. Theorem 3.4.3 Let X be a Banach space and f S T(X) be such that S := argmin/ ^ 0. T/je following statements are equivalent: (i) £/iere exist ip £ A and to > 0 s«c/t i/rni VW > 0 for every t G ]0, to[ and f is ip-conditioned; (ii) for every sequence (xn) C X with (f(xn)) —> inf/ we have that (ds(xn)) -> 0; (iii) tftere ezisis ip £To such that f is ip-conditioned; (iv) B t r e E i , V x * 6 l * : /*(x*) < /*(0) + tj(i*) + <7(||x*||); (v) 3 V € r 0 , V i e S , V ( z , a : * ) e g r 0 / : ( i - i , a ; * ) > (vi)
B ^ e y i o n A f o , V(x,x*)egrdf
:
ip(ds(x));
(vii) 3flGn 0 n7Vo, V(a;,a:*)egra/ : ds(x) < 9(\\x*\\); (viii) V(z n ) C X : (d 8 / ( T n ) (0)) -> 0 => (d s (x„)) -> 0. Furthermore, in the equivalence (iii) <£> (iv) we can take a = ip&, ip =
198
Some results and applications of convex analysis in normed spaces
tpf £ A0 fl iVi), we have that ip := cotpf £ T 0 (see Lemma 3.3.1(iii)). Of course, / is ^-conditioned. (iii) => (ii) is obvious, since for ip £ T 0 we have that (tp(tn)) -> 0 =>• (*n) ->• 0. (ii) =>• (i) Suppose that there exists £o > 0 such that V/(*o) = 0- Then there exists (x„) C X such that ds(xn) = t0 and (f(xn) - inf / ) -> 0, which evidently contradicts the hypothesis. As / is ^/-conditioned, the conclusion holds. (iii) =>• (iv) Using Theorems 2.8.10, 2.3.1(ix) and Corollaries 2.4.7, 2.4.16 (see also Exercise 3.7) we have that (V o d s ) * = t £ + ^ # ° | | - | | ,
{t*s + ^o\\.\\)*=^ods.
(3.29)
Using Eq. (3.29), for x* € X* we have that f{x*)
= sup((x,x*)-f(x)) x€X
< sup {(x,x*) - i n f / -
ip(ds(x)))
xeX
= no) +tyo dsy (x*) = no) + L*S(X*)+^*(\\x*\\). The conclusion follows with a := ip#; a € Si by Lemma 3.3.1 (iii). (iv) =>• (iii) is obtained similarly, using Eq. (3.29), with ip := a#. (iii) => (v) Let x £ S and (x,x*) € grdf. Then f(x)>f{x)+ip(ds{x))
and f{x) > f(x) + (x -
x,x*).
Adding these two relations we obtain the conclusion with the same ip. (v) =>• (vi) From (v) we obtain that VxeS,
V(x,x*)
Egrdf
: \\x - x\\ • \\x*\\ > ip(ds(x)),
whence ||x*|| • ds(x) > tp(ds(x)) for every (x,x*) £ grdf. The conclusion follows taking ip € A0 defined by
V(*) = ( l - 7 ) * - v ( 7 * ) -
Well conditioned
functions
199
Suppose that (iii) does not hold for this ip. Then there exists x E X such that f(x) < ini f+ip(ds(x)). It follows that x £ S and there exists c E ]0,1[ such that f(x)<mif We take e = c • ip(^j(ds{x))) there exists u E X such that f(u) + e • \\u — x\\ < f(x)
+
c-iP(ds(x)).
> 0. Using Ekeland's variational principle,
and f(u) < f(x) + e • \\u — x\\ Vx E X.
The last relation being equivalent to df(u)C\eUx* i1 0, there exists u* E X* such that (u,u*) E gr<9/ and ||u*|| < e. FVom the first relation, by the choice of x, the definitions of x[> and e, and the hypothesis, it follows that \\u — x\\ < (1 — -y)ds{x)
and
(p(j(ds(x))).
But ds(u) > ds(x) - \\u - x\\ > ds(x) - (1 - j)ds(x)
= 7 • ds(x),
whence the contradiction
6(t) = sup{d s (x) I {x,x*) 6 grdf, ||a;*|| < t}.
It is obvious that 9 is nondecreasing, 6(0) = 0 (i.e. 8 E NQ) and ds(x) < 6(\\x*\\) for every (a:,a;*) E grdf. If limt|o^(*) > 0, there exists £0 > 0 such that 0(t) > £0 for every t > 0. Prom the definition of 6, for every n E N there exists (xn,x^) E df such that ||a;*|| < 1/n and ds(xn) > EQ. Of course this contradicts the hypothesis. D We remark that we used the convexity of %[> only for the implication (iv) => (iii); for the other implications it is sufficient that ij) E N\. The function ipf verifies this condition. Also, we used that X is a Banach space only in the implication (vi) =>• (iii). We also note that in the implications (iii) <$ (iv), (v) =>• (vi) <S> (vii) •£> (viii), (vi) =*> (iii) =>• (ii) =*> (i) we can
200
Some results and applications of convex analysis in normed spaces
take S C X a nonempty closed convex set, and in the implication (iii) => (v) we used only the fact that S C argmin/. Corollary 3.4.4 Let X be a Banach space, f € T(X) and (x, x*) € grdf. The following statements are equivalent: (i) 3iPeT0,
Vx€X
: f(x)>f(x)
(ii) 3 f f € S i , V i * 6 P
: f*(x*)
(iii) 3 ^ € r 0 , \/(x,x*)Ggrdf (iv) 3ip£A0nN0,
(x-x,x*)+rP(\\x-x\\);
< f(x*)+{x,x*
- x*)+a(\\x* - x*\\);
: (x - aT,x* - z*) > V(||a; - z||);
\/(x,x*)egrdf
(v) 3den0nN0,v(x,x*)egTdf (vi) V(xn)cX
+
: y>(||x - z | | ) < ||x* - x*\\;
•.
\\x-x\\ <eq\x* -x*\\y,
: (da/(:Cn)(x*))->0=^(:En)->z;
(vii) x* 6 int(dom/*) and / * is Frechet differentiable at x*. Proof. Taking the auxiliary function g : X —> R denned by g(x) = f(x) — (x, x*), we may assume that x* = 0. The equivalence of conditions (i)-(vi) follows from Theorem 3.4.3 by taking S := {x} (taking into account, eventually, the remarks done after the proof of that theorem). (ii) =>• (vii) Taking into account that (x, x*) £ gr<9/, we have that Vi'eT
: 0 < f*(x*)
- f*(x*)
- (x,x* -x*)
whence x* £ int(dom/*) and lim sup
r(x*)-r(x*)-(x,x*-x*)
< l i
m s u p
t|0
t
^ = l i m ^ = 0 . *4-° t
This shows that / * is Frechet differentiable at a;* and V/*(a;*) = x. (vii) => (ii) Because (x*,x) € g r ( 9 / ) _ 1 C gr<9/* (the last one for the duality (X*,X**)), we obtain that x e df*{x*). Since / * is Frechet differentiable at x*, we have that V/*(aT") = x. For t € K+ consider a(t) := suv{r{x*)
- f*{x*) - (x,x* -x*)
= sup{r(x*+tu*)-r(x*)-t(x,u*)
\ \\x* -x*\\ = t} | | K | | = 1}.
Because the function 11-> f*(x* + tu*) - f*(x*) — t {x,u*) (6 M.+) is convex and lsc, it follows that a G T. Furthermore, from the Frechet differentiability of the function / * it follows that lim^o £-1
Well conditioned
functions
201
A sufficient condition for (i) from the preceding corollary is that / ( ( l - X)x + Xx) + A(l - X)rp(\\x - i||) < (1 - X)f(x) + Xf(x)
(3.30)
for all x £ X and A £ ]0,1[. Indeed, if Eq. (3.30) holds and x € dom / then {f{x + X(x - x)) - /0c))/A < f{x) - f{x) - (1 - X)^{\\x - x\\) for all x £ X and all A 6 ]0,1[, whence Vxel
: f{x)>f{x)
+ f'{x,x-x)+ip{\\x-x\\),
(3.31)
and so VieX,Vi*ea/(i)
: f{x)>f{x)
+ (x-x,x*)+iP{\\x-x\\).
(3.32)
A function / e A(X) for which there exists ip &Ao satisfying condition (3.30) is said to be uniformly convex at x (6 d o m / ) . Let / 6 A(X) and x € d o m / . The gage of uniform convexity of / at x is related to relation (3.30), and is defined by Pf,x{t) '•= i n f \ —
A)/(z) + Xfjx) - / ( ( l - X)x + Ax) A£]0,1[, A(l - A) \\x — x\\ = t
= inf { <* ' X»W
+
Y(f_+^ ~ ^
+ A
«»> | u e SX, A 6 ]0,1[} (3.33)
(with the convention oo - oo = oo). Using Theorem 2.1.5(vii) one obtains easily that pf^ € No (Exercise!). We may introduce also a gage related to relation (3.31). More exactly consider tif,x(t) ••= inf {f(x) - f(x) - f'(x, x - x) | x E dom / , ||a; - x\\ = t} = inf {f(x + tu) — f(x) — tf'(x, u) | u e Sx H cone(dom / - x)} . Because the mapping M+ 3 11-> f(x+tu) — f(x)—tf'(x, u) 6 M+ belongs to T C N\, we obtain that j ? ^ € Nx. As in the proof of (3.30) =* (3.31) we have that i9/,j > pf^. Thus, when / is uniformly convex at x € d o m / (i.e. P/,x 6 ^o) then $/,j € .Ao- One may ask if the converse implication holds. The answer is affirmative when / is Frechet differentiable at x € dom / .
202
Some results and applications of convex analysis in normed spaces
Proposition 3.4.5 Let f 6 F(X) be Frechet differentiable at x 6 d o m / . If flf,x € AQ then f is uniformly convex at T. Proof.
Let t > 0 be such that D(x, t) C int(dom/). By hypothesis
f{x)>f{x)
+ f'{x,x-x)+6
VxeX,
\\x-x\\ = t/2,
where 6 = i9/ iS (i/2) > 0. On the other hand, because / is Prechet differentiable at x, a selection 7 : dom df -> X* of df exists which is continuous at x (by Theorem 3.3.2). Hence there exists n > 0 such that Il7(*) - V/(3c)|| < S/t for ||z - x\\ < j]. Let a e]0,1[ be such that 1 - a < 2t}/t. Consider x 6 X arbitrary satisfying ||a; — x\\ — t and take z := ax + (l-a)^^= x-\-±=^-{x-x) € int(dom/). Hence \\z -x\\ = ^^-t < r], and so \\j(z) - Vf(x)\\ < 5/t. Of course, f(x) > f(z) + f'(z,xAs |(37 + x) = -^hx + -^tiz,
z) = f(z) + / ' (z, we
l
-^(x-
x)) .
(3.34)
have that
STT/(z) + S T l / W > / ( § ( * + *)) •
(3-35)
Multiplying Eq. (3.35) with a+1 and adding the result to (3.34) we obtain f(x) + af(x) > (1 + a ) / (I(3f + x)) + f (z, ±=2(3f - x)) , whence f{x) + f{x) - 2 / {\{x + x))>^
(/ (1(3- + a:)) - f(x) + \f< (z,x - x)) 6
>^(
+ y^x-x)
+ I / ' (z,3f- a;))
>^(<5+i
Uniformly convex and uniformly
3.5
smooth convex
203
functions
Uniformly Convex and Uniformly Smooth Convex Functions
We "globalize" the notion of uniformly convex function at a point as follows. We say that the proper function / : X —> K is p-convex, where p £ A, if / ( ( l - X)x + Xy) + A(l - X)p(\\x - i/H) < (1 - X)f(x) + Xf(y). for all x, y £ X and all A £ ]0,1 [. Of course, if / is p-convex then / is convex; we say that / is uniformly convex if there exists p £ AQ such that / is p-convex. We say that / is strongly convex (with rate c or c- strongly convex) if / is pc-convex for some c > 0, where pc £ AQ, pc{t) •= \ct2. It is clear that when / is uniformly convex then / is uniformly convex at every point of its domain and when / is uniformly convex at every point of its domain then / is strictly convex. When / £ A(X) and A C X is a convex set we say that / is uniformly convex on A if / + LA is uniformly convex. It is natural to introduce the gage of uniform convexity of the convex function / . Namely, it is the function p/ : K+ -»• R+ defined by p/(i)=inf|
(l-X)f(x)+Xf(y)~f((l-X)x
+ Xy)
A(l - A)
AG]0,1[,
x,y £ d o m / , \\x — y\\ = t > . Of course, Pf > p when / is p-convex. Hence / is uniformly convex if, and only if, p/ £ Ao. The gage of uniform convexity of the convex function / has a remarkable property. Proposition 3.5.1
Let f £ A(X).
Vt>0, Vc>l i.e. the function 11-» t~2pf(t)
Then :
Pf(ct)>c2pf(t),
is nondecreasing on P.
Proof. Let t > 0 and c £ ]1,2[ be such that p/(ct) < oo. Let us fix e > 0. There exist x,y £ d o m / and A G]0, | ] such that ||y — x\\ = ct and n , ^ Pf(ct)
. . (l-X)f(x) + pe >
+ Xf(y)-f((l-X)x ^ — -
+ Xy)
.
204
Some results and applications of convex analysis in normed spaces
Consider x\ := (1 - X)x + Xy and xc := (1 - c l)x + c ly. We have that \\xc — x\\ — t and x\ = (1 — c\)x + cXxc (of course cX 6 ]0,1[). Hence f(xc)
< (1 - c-1)^)
f(xx)
< (1 - cX)f(x) + cXf{xc) - cA(l -
f(xx)
> (1 - X)f(x) + Xf(y) - A(l - X)pf(ct) - eX(l - A).
+ c-'fiy)
- c-\l
-
c-^pfict), cX)pf(t),
From these relations (multiplying the first with cX > 0) we obtain that c2Pf(t) < Pf(ct) + ec• ±Eh < PM)
+£-£~C-
Letting e -> 0, we obtain that c2/9/(£) < pf(ct) for all c € ]1, 2[ and t > 0. By mathematical induction one obtains immediately that c2npf(t) < Pf(cnt) for c e ] l , 2[, n £ N and t > 0, and so c2pf(t) < pf(ct) for every c > 1 and t > 0. The conclusion holds. D A dual notion (as we shall see) for uniform convexity is that of uniform smoothness. Let / € A(X), a £ A and O ^ i c d o m / . We say that / is er-smooth on A if for all x,y £ X and all A 6]0,1[ such that (1 — A)a; + Xy £ A we have / ( ( l - X)x + Xy) + A(l - X)a(\\x - y\\) > (1 - A)/(i) + A/(y),
(3.36)
or equivalently
f(x) + A(l - AMIMI) > (1 - X)f(x - Xy) + Xf(x + (1 - X)y) for all z £ A, y £ X and A £]0,1[; we say that / is cr-smooth if / is cr-smooth on dom / . Having in view this definition, it is natural to consider the following gage of uniform s m o o t h n e s s o n A of the function / : af,A{t)
••= s u p ^
(1 - A)/(:r) + Xf(y) - / ( ( l - X)x + Xy) A€]0,1[, A(l - A) x,y£X,
(l-X)x
+ Xy£A,
f (1 - X)f(x - Xty) + Xf(x + (1 - X)ty) - f(x) sup
\\x-y\\=t\ Ae]Q>1[j
A(l - A)
x £ A, y £ Sx
}•
(3.37)
Uniformly convex and uniformly
smooth convex functions
205
c/.dom / is denoted by CTJ and is called the gage of uniform smoothness of / . It is obvious that 07,4 < a if / is cr-smooth on A. Sometimes it is useful to consider the following midpoint gage of smoothness: a°fA(t) := sup {f(x) + f{y) - 2f{\x + \y) \ \x + \y £ A, \\x - y\\ = 2t} = sup {f{x - ty) + f{x + ty) - 2f{x) \ x £ A, y £ SX}-
(3.38)
It is easy to obtain that 2cr°fA{t/2) < af,A(t) < o-°fA{t) for t > 0. As above, c^dom/ w m ^ e denoted simply
Let f £ A(X), h £ A{X*) have proper conjugates
(i) J / / and h are p-convex then f* and h* are p# -smooth, respectively. (ii) / / / and h are cr-smooth then f* and h* are o# -convex, respectively. Proof, (i) Suppose that / is p-convex. Let t > 0 be fixed and consider x* eX*,y* £ Sx* a n d A £ ] 0 , l [ ; t a k e ^ := x*-\ty*, x\ := x* + (l-X)ty*. Consider also 70 < f*{xo) and 7J < f*{x{). From the definition of / * there exist Xo, xi £ X such that 70 < {X0,XQ) - f(x0),
71 < (xi,xl)
-
f{x{).
Multiplying the first relation by 1 — A and the second one by A, then adding them to both sides of the (Young-Fenchel) inequality 0 < f(xx) +f*{x*)
~{xx,x*),
206
Some results and applications of convex analysis in normed spaces
where x\ :— (1 — X)xo + Xx\, we obtain that (1 - A)7o + A71 < f(xx) + /*(**) - (1 - A)/(ar0) - A / f o ) + A(l - A)* (x, < f*(x*)
+ A(l - A) (t \\x0 - i i | | - p(\\x0 - nil))
+
x0,y*)
\(l-\)P*(t).
Letting 7; —>• f*(x*) for i = 1,2, we obtain that /* is p # -smooth. The same calculation shows that h* : X —> E is /9*-smooth if /i is p -convex. (ii) Suppose now that / is cr-smooth. Consider x^,x\ 6 X* and A 6 ]0,1[. For x, y € X, denoting x*x := (1 - X)XQ + Az*, we have that
>(l-X)((x-Xy,x*0)-f(x-Xy)) +
X({x + (l-X)y,x*1)-f(x
> (1 - A) (x - Xy,X*0) + X(x+(1-
+
(l-X)y))
X)y,x\) - f(x) - A(l - A)a(||y||)
= {x, x*x) - f{x) + A(l - A) ((y, x\ - x*0) -
a(\\y\\)).
Since x, y € X are arbitrary, we have that
(i-x)r(x*0) + xr(x*1) > snpxeX
((x, x*x) - f{x)) + A(l - A)supJ/€js: ((y, x{ - XQ) -
= rix*x) +
Xil-X)o-#i\\x*-x*1\\),
which means that / * is cr#-convex. The proof for the fact that h* is cr#-convex when h is cr-smooth is similar. • Corollary 3.5.4 Let f 6 T(X). Then o{* = ipf)* and 07 = ( p / . ) # Furthermore t !-• t~2afit) is nonincreasing on P. In particular, if there exists to > 0 such that 07 (£0) < 00 then a fit) < 00 for every t > 0. Proof. From the preceding proposition we have that /* is (p/) # -smooth # and (<7/) -convex, while / = /** is (p/.) # -smooth and (07.)* -convex. Therefore we have that 07 < (/?/•)* a n d Pf > (07)*. Passing to the conjugates in the second inequality we obtain that 07 < (p/*) # < (07)**. Therefore erf = (p/») # = (07)** (in particular we have obtained again
Uniformly convex and uniformly
smooth convex functions
207
that aj is convex and lower semicontinuous). In a similar way we obtain that 07. = (/9/)*. Taking into account Proposition 3.5.1, Lemma 3.3.1(iv) and the relation Of = (p/*)*, we have that t >-> t~2a;{t) is nonincreasing on P. The remainder of the conclusion is immediate. • We say that / G A(X) is uniformly smooth (on the nonempty subset A of dom / ) if there exists a G fii such that / is cr-smooth (on A). When A is a singleton {x} C dom / , / is uniformly smooth on {x} if and only if / is smooth at x in the sense of Theorem 3.3.2. Taking into account Proposition 3.5.2, if / is uniformly smooth o n ^ C l then there exists £o > 0 such that A + SQBX C d o m / , while if / is uniformly smooth then d o m / = X. Summarizing Propositions 3.5.1-3.5.3 (and using Lemma 3.3.1(iii)) we obtain the following important result. Theorem 3.5.5
Let f £T(X).
Then
(i) / is uniformly convex 44> / * is uniformly smooth; (ii) / is uniformly smooth •$=>• f* is uniformly convex. It is quite natural to say that the proper function / : X —> E is uniformly Frechet differentiable on 0 ^ A C X if there exists £o > 0 such that A + SQBX C d o m / , / is Frechet differentiable at any point of A and nm /Or+
t-H)
*»)-/(*) t
V M
uniformly with respect to x € A and u € Sx- Of course, when A = {a} this means that / is Frechet differentiable at a. Similarly to the proof of Theorem 3.3.2, one obtains easily that / G A(X) is uniformly Frechet differentiable on A C dom / if and only if / is continuous at some point of its domain and is uniformly smooth on A. The following result gives other characterizations of uniformly smooth convex functions. Theorem 3.5.6
Let f G A(X). Consider the following
statements:
(i) / is uniformly smooth; (ii)
( d o m / ) ! / 0 and there exists a<x G fii such that
V x G ( d o m / r , V y G X : f(y) < f{x) + f{x,y-x)
+ a2{\\y - x\\); (3.39)
(iii) d o m / = X, f'(x, •) is linear for every x G X and there exists &$ €
•
208
Some results and applications of convex analysis in normed spaces
Oi such that \/x,y£X
+ f,(y,y-x)
: f'(x,x-y)
(iv) grdf ^ 0 and there existsCT4G fti
SMCA
(3.40)
i/ioi
y(x,x*)egrdf,
V j / e X : /(i/)(o;) + (y-x,x*>+a4(||y-a:||); (3.41) (v) dom 9 / = X and there exists as G fii such that V(x,x'),(v,v')£&df
• (y-x,y*-x*)
(3.42)
(vi) dom df = X and there exists ipi G To such that
v{x,x*),(v,v')e&df
: m>f(x)
+ (y-x,x*) + M\\y*-x'\\); (3.43)
(vii) dom df = X and there exists V>2 € T 0 suc/i i/iai V (*,**), (y,y*)egidf
: (i/-i,y*-a;*) > ^(||y*-i'||);
(3.44)
(viii) dom df — X and there exists y> £ Ao(l No such that \/(x,x*),(y,y*)
£grdf
: \\y - x\\ > V{\\y* - x*\\);
(3.45)
(ix) dom df = X and there exists 6 £ flo f) No such that V(z,z*), (,*) G g r d / : 0(\\y - x\\) > \\y* - x*\\; (x) d o m / = X, f is Frechet differentiate $7I such that Vx,yeX
: (x-y,
Vf(x)
on X and there exists &§ G
- V/(j/)) < a6(\\y - x\\);
(xi) d o m / = X, f is Frechet differentiate uous.
(3.46)
(3.47)
and V / is uniformly contin-
Then conditions (i) - (iii) and (iv) - (xi) are equivalent, respectively, and (iv) =£• (ii). Furthermore, if f is continuous at some point of dom / (or X is Banach space and f is lower semicontinuous) then the eleven statements are equivalent; in such a case if one of conditions (i) - (xi) holds then dom / = X and f is continuous. Proof, (i) =>• (ii) Let a G fii satisfy Eq. (3.36); we already observed above that d o m / = X. From Eq. (3.36) we have that A" 1 (f{x + \{y - x)) - f(x)) + (1 - AMHi, - x\\) > f(y) -
f(x)
Uniformly convex and uniformly
smooth convex functions
209
for all x,y G X and A G]0,1[; taking the limit for A -¥ 0 we obtain Eq. (3.39) with cr2 : = a(ii) => (iii) Let x0 G ( d o m / ) \ From Theorem 2.1.13 we have that f'(xo,u) G R for every u G X. Let t0 > 0 be such that <72(£0) < oo. From Eq. (3.39), it follows that {y G X \ \\y - x0\\ = t0} C d o m / , and so B(x0,t0) C d o m / (because d o m / is a convex set). Therefore (dom/) 1 + toBx C dom / , whence, as in the proof of Proposition 3.5.2 we obtain that d o m / = X. Changing x and y in Eq. (3.39), then adding side by side the corresponding relations, we obtain
V i j e l
: 0
+ f'(y,x-y)+2a2(\\y-x\\).
(3.48)
Let x G X be fixed and u £ X \ {0}. Consider the function
>p'_{t) =-f'{x+
tu,-u).
Taking into account Theorem 2.1.5, for — t < 0 < t we obtain that
+ f'{x, -u) = v ; ( 0 ) - ¥>'_(0) < ¥»'_(*) = -f'(x
+ tu,-u)
ip'+(-t)
- f{x + tu,u)
1
< (2i)- 2 CT2 (2i|H|) = (2i||t i ||)- 1 2 CT2 (2i|| U ||) • ||u|| for every t > 0. Letting t —>• 0, we obtain that f'(x,u) + f'(x,—u) < 0. Since the converse inequality holds (f'(x, •) being sublinear), we have that f'(x, -u) = -f'(x,u), and so f'(x, •) is linear. In this situation Eq. (3.40) coincides with Eq. (3.48) (and so 0-3 = 2o-2). (iii) => (ii) Let ^3 : 1+ -> R + be defined by CT3(t) = sup{cr3(s) | s G [0, t]}. It is obvious that W3 e fli C\ No and ^3 > a. Consider the function ip defined in the proof of the preceding implication. In our situation
210
Some results and applications of convex analysis in normed spaces
It follows t h a t
f(x + u) - f{x) = p(l) - tp(0) = £ if/it) dt
0V(o) + '-W*NI))^ = y>'(0) + /J1"11 t~la3{t) dt = f'{x, u) + a2(\\u\\), where ^ ( i ) = / 0 s~1a:3(s) ds; it is obvious that o (i) From (ii) =>• (iii) we have that d o m / = X and f'(x, •) is linear for every x £ X. Let 07 : K+ —> R+ be defined by af(t) - sup{f(y)
- f(x) - f'(x, y - x) \ x,y G X, \\y - x\\ = t}
= sup{/(a; + tu) - f(x) - tf'(x,u)\x,ueX,
\\U\\ = 1}.
(3.49)
Since t H-> f[x + tu) — f(x) — tf'(x,u) is convex and continuous for all x,u € X, it follows that / is convex and lower semicontinuous. From the obvious inequality / <^2, we have that 5/ G fii. Let x,y G X and A G]0,1[; consider O;A := (1 - A)a; + Ay. From the definition of Of we have that
f(x) < f(xx)+f'(xx,x-xx) + af(\\x-xx\\), f(y) < f(xx) + f'(xx,y - xx) + 37(||y - xx\\). Multiplying the first relation with 1 — A and the second one with A, then adding them side by side, we obtain (1 - \)f(x)
+ Xf(y)
< f(xx) + A(l - X)[f'(xx,x-
y) + f'(xx,y-
+ (1 - X)af(\ \\y - x\\) + \af((l
x)]
- A) \\y - x\\)
(3.50)
(a:A) + 2A(l-A)a? / (||i/- a J ||) (a:A) + 2A(l-A)(r2(||i/-a;||); we have used the convexity of 5 / . Therefore / is uniformly smooth. (iv) =>• (i) Let (X0,XQ) 6 grdf and take t0 > 0 such that c 4 (i) < 1 for t G [0,i 0 ]- From Eq. (3.41) we obtain that f(x) < f(x0) + t0 \\XQ\\ + 1 for x G B(xo,to), and so B(xo,to) C d o m / and / is continuous at every point of B(xo,to). Hence domdf + toBx C domdf C d o m / , which implies that dom df = dom / = X and / is continuous on X.
Uniformly convex and uniformly
smooth convex functions
211
Let x,y G X and A G]0,1[. With the notation from the proof of the implication (ii) =*- (i), there exists x*x € df(x\). Replacing f'(x,-) by x* G df(x) and f'{x\, •) by x*x in the proof of that implication, we obtain that / is 2
+ (y-x,y*)
and
0 < (y - x,x* - y*) + a5{\\y - x\\),
Eq. (3.41) follows by adding side by side these relations, with 04 := 05. (iv) =» (vi) From (iv) =£• (i) we have that dom / = X and / is continuous. Let (a;, a;*), (y, y*) G gr<9/ be fixed and take an arbitrary 2 G X. From Eq. (3.41) we have that f(y + z) < f(y) + (z,y*) + cr4(H^||), and so 0
f(x) -(y +
< f(y) - f{x) -(y-x,
z-x,x*)
x*) - (z, x* - y*) + a 4 (||z||).
Therefore VzeX
:
(z,x*-y*)-ai(\\z\\)
Taking the supremum with respect to z £ X and using Lemma 3.3.1(v) we obtain that Eq. (3.43) holds with ipi := af. By Lemma 3.3.1(iii) we have that V"i G r 0 . (vi) => (vii) Changing x and y in Eq. (3.43) and summing the obtained relations side by side, Eq. (3.44) holds with ^2 := 2V>i(vii) =$• (viii) From the obvious inequality (y — x,y* — x*) < \\y — x\\ • \\y* — x*\\ we obtain immediately that Eq. (3.45) holds with ip G A0nN0, ip(t) := t~li})2{t) for t> 0. (viii) => (ix) and (ix) =^ (viii) The first implication is obvious by taking 6 := iph and the second one by taking ip := 6e using Lemma 3.3.l(i). (ix) => (v) The implication is obvious by taking cr5 G fii, (T5(t) :— t-6(t) for t > 0. From what was shown above we have that conditions (i)-(iii) and (iv)(ix) are equivalent, respectively, and (iv) =£- (i). Moreover d o m / = X and / is continuous if (iv) holds.
212
Some results and applications of convex analysis in normed spaces
(ix) => (xi) From what was said above, / is continuous; from Eq. (3.46) we have that df is single valued; by Corollary 2.4.10 we have that / is Gateaux differentiable. Using again Eq. (3.46), V / is uniformly continuous. Using Corollary 3.3.3 we obtain that / is Frechet differentiable. (xi) =$> (ix) This implication is obvious; just take 6 the gage of uniform continuity of V / . (v) =>• (x) In our hypothesis, as shown above, / is Frechet differentiable. Hence Eq. (3.47) holds with a& := 05. (x) =£• (v) is obvious with 05 := erg. (ii) => (iv) when / is continuous at some point of its domain. From (ii) => (hi) we have that d o m / = X, and so / is continuous on X, and f'(x, •) = V/(x) for every x € X. Therefore Eq. (3.41) holds with <74 := a2. (ii) => (iv) when / is lsc and X is a Banach space. From Theorem 2.2.20 we have that / is continuous on (dom/) 1 ^ 0. The conclusion follows from the previous case. • The proof of the preceding theorem shows that 07 < 07 < 2 5 / . Moreover, when / is continuous, one can take 02 = 0-4 and 03 = 0 5 = CFQ. Remark 3.5.1 In Theorem 3.5.6 the implication (ix) =>• (iv) holds with ai(t):=J*e(s)d8. Indeed, if (ix) holds, by the preceding theorem d o m / = X and / is Frechet differentiable. Let x,y E X. Taking V
:E^K,
+
ty)-f(x)-t-(y-x,Vf(x)),
we have that
tp is derivable
V / ((1 - t)x + ty) - V/(a:)) < \\y - x\\ • 8 (t \\y - x\\)
for every ( 6 1. It follows that /(!/) - f(x) ~{y-x,
Vf(x))
= J,,1
= fSy~xh(s)ds
(t \\y - x\\) dt
= ai(\\y-x\\).
Corollary 3.5.7 Let f : X —> E be a continuous convex function and p, q £ E be such that 1 < p < 2 < q and p _ 1 + q"1 = 1. The following statements are equivalent: (i) there exists L\ > 0 such that f is a-uniformly smooth, where o~(t) := ^t" for t > 0;
Uniformly convex and uniformly
(ii) 3 L 2 > 0 , Vx,yeX, f(y)
smooth convex
Vx* e 8f(x) + (y-x,x*)
functions
213
: +
^\\x-y\\p;
(iii) 3 L 3 > 0 > V ( a : , a : * ) , ( i / , y * ) € g r a / : f(y) > fix) + (y- x,x*) + ^ (iv) 3 i 4 > 0 , V(x,x'),
(y,y*)£grdf
||ar* - y*\\q ;
:
(y-X,y*-X*)<^\\X-yf(v) 3 L 5 > 0 , V(x,x'),
(y,y*)£grdf
:
Vx,yeX.
Proof. The conditions (i)-(vi) of the corollary correspond to conditions (i), (iv) (or (ii)), (vi), (v) (or (iii)), (vii) and (ix) of the preceding theorem, respectively. So, we have that (i) => (ii) with L 2 ~ L\ and (ii) => (i) with Li := 22~PL2 (using directly Eq. (3.50)). Also, (ii) =* (iii) with L3 := L\~l, (iii) =» (v) with L5:= L3, (v) => (vi) with L 6 := (L5q/2)P~\ (vi) => (ii) with Z/2 := ^6 (by the preceding remark), (ii) =*• (iv) with L\ := L2 and (iv) =>• (ii) with L 2 := 2L±/p (for this one uses the proof of the implication (iii) => (ii) of the theorem). • Remark 3.5.2 For p = 2we may take the same Lfc = L > 0 for all k £ 1,6 in the preceding corollary, as the proof shows. The next result shows that every uniformly convex function on a Banach space is coercive and its subdifferential is onto. Proposition 3.5.8 convex. Then
Let X be a Banach space and f £ T(X) be uniformly
fix) limiiif i - H > 0 IMI-"» HxH2
and
badf
= X*.
Moreover, if (xn) C X is a minimizing sequence for f then (xn) converges to the unique minimum point of f.
214
Some results and applications of convex analysis in normed spaces
Proof. By the Br0ndsted-Rockafellar theorem there exists grdf, and so Vz G X : f{x) > f(x0) + (x-
x0,z5).
(XO,XQ)
G
(3.51)
It follows that f(x0) + A (x - x0,
XQ)
< f ((1 - X)x0 + Xx) < (1 - X)f(x0) + Xf(x) - A(l - X)Pf(\\x - aroll)
for all x G X and A G]0,1[, and so f(x) > f(x0) + (x - XQ,XQ) + (1 A)/?/(||z - £o||) for x G X and A G ]0,1[. Letting A -> 0 we obtain that f(x0) + {x- x0,x*0) + Pf(\\x - a:0||) < f{x)
(3.52)
for all x e X . Dividing the last inequality by ||a; — zo|| , taking into account that (x — XQ, XQ) > — \\x — x0\\ • \\XQ\\ and Proposition 3.5.1, we obtain that liminf|| :c ||_ >00 /(a;)/||a;|| > lim'mtt-^co Pf(t)/t2 > 0. From Eq. (3.51) we obtain that / is bounded below on bounded sets, and so, / being coercive, / is bounded from below. Let (xn) C d o m / be a minimizing sequence of / , i.e. (f{xn)) ->• inf / 6 E. Because / is /^/-convex we have
\pf{\\xn - xm\\) < \f{xn) + \f{xm)
<\f{Xn) +
- f{\xn + \xm)
\f{xm)-Mf
for all n,m e N. Because (tn) -> 0 when (tn) C [0,oo[ and (Pf(tn)) -> 0, we obtain that (x„) is a Cauchy sequence. As X is a Banach space, there exists x e X such that (xn) -»• x. The function / being lsc, inf/ < f(x) < lim/(a; n ) = inf/. Therefore f(x) = inf/, and so 0 € df(x). Replacing (X0,XQ) by (x, 0) in Eq. (3.52) we obtain that VxeX
: P/(||z-:r||)(aO-/Cr).
From this relation it follows immediately that every minimizing sequence of / converges to ~x. Let now x* G X*. Since f — x* is uniformly convex, by what precedes, there exists x € X such that f(x) — (x, x*} < f(x) — {x, x*) for every x E X, i.e. x* G 0/(x). Therefore I m 9 / = X*. D The subdifferential of / G T(X) is onto even in slightly weaker conditions. We call / : X —> E strongly coercive if rim||a.||_>00 /(a;)/ ||a;|| = oo.
Uniformly convex and uniformly
smooth convex
functions
215
Corollary 3.5.9 Let X be a Banach space and f G T(X). Assume that f is strongly coercive and uniformly convex on every convex and bounded subset of X. Then Im df = X*. Proof. Let us show first that / attains its infimum on X. Indeed, let A > inf/. Since / is coercive, there exists r > 0 such that [/ < A] C rUxSince / is uniformly convex on C := rUx, we have that g := f + LC is uniformly convex and lower semicontinuous. From the preceding result there exists x G X such that g(x) = inf g. Since inf = inf/, it follows that f(x) = inf / . Let now x* G X* be arbitrary. Then f — x* satisfies the hypotheses of our statement, and so / - x* attains its infimum on X. Thus 0 G Im<9(/— x*), whence x* G Imdf. D In the next result we give several (characteristic) properties of uniformly convex functions. T h e o r e m 3.5.10 Let X be a Banach space and f G T(X). The following statements are equivalent: (i) / is uniformly convex; (ii) 3 V i £ r 0 , \/x,yedomf (hi) 3 ^ G To, V(x,x*)
: f{y) > f{x)+f'{x,y-x)+M\\y
~ x\\);
G gidf, Vy € X :
f(y)>Hx) + (y-x,x*) + M\\y-x\\); (iv) 3^3 G To, V(x,x*),(y,y*)£gvdf Hy)>f{x)
+
(y-x,x*)+M\\v-x\\);
(v) 3 V > 4 e r 0 , V(x,x*),(y,y*)£grdf {y-x,y*
:
:
-x*) > ip4(\\y - x\\);
(vi) 3
V (*,*•), (y,y*) Cgrdf
: \\y* - x*\\ >
(vii) 30eN0nn0,
V(x,x*), (y,y*)egrdf
•. \\y-x\\ < e(\\y* -x*\\);
(viii) (df)-1 is single-valued and uniformly continuous on Imdf; (ix) / * is uniformly smooth; (x) 3 ( n G S i , V ( x , x * ) G g r 9 / , \/y* £ X* : /*(!/*) < r ( x * ) + ( x , y * - x * ) + C T l ( ! | y * - x * | | ) ;
(3.53)
216
Some results and applications
of convex analysis in normed
(xi) d o m / * = X * and 3
£ grdf
spaces
:
(y-x,y*-x*)
(df)-1(i) O (ix) follows from Theorem 3.5.5. (iii) •£> (x) as in the proof Theorem 3.4.3, with a\ := ipf and il>2 •"= of(x) => (ix) First of all let us note that dom/* = X*. Indeed, in our conditions there exists to > 0 such that o-f(to) < 00. From Eq. (3.53) we have that Im df+toBx C dom /*. On the other hand, from the Br0ndstedRockafellar theorem we have that dom/* C Imdf (the closure being taken in the norm topology of X*). Therefore dom/* = X*. Since /* is w*-lsc, /* is ||-||-lsc, and so /* is ||-||-continuous. Let x*, y* e X* and A £ ]0,1[ consider a;^ := (1 - A)a;* + Ay*. Suppose thatCTjdly*- a;*||) < 00. Since Im<9/ = X*, there exists ((un,un)) C grdf such that (u*) —> x\. From the hypothesis we have that /*(**) < f*«)
+ <«n,x* - < ) + ^ ( I K -
/•(1/*) < / * « ) + («n,y* - < ) W I K
x'\\), ~y*\\)-
Multiplying the first relation with 1 - A and the second one with A, then adding them, we obtain that ( l - A ) / * ( 0 + A/*(y*)
< / * « ) + (un,x*x - < ) + (1 -
AMIK
- x*\\) + AdidK - y*\\)-
Uniformly convex and uniformly smooth convex functions
217
Since / * is continuous, df* : X* =4 X** is locally bounded (see Theorem 2.4.13); as g r ^ / ) " 1 C gr<9/*, (df)-1 is locally bounded, too. It follows that the sequence (un) is bounded. Furthermore, we have that ||u* - x* \\ -¥ A | | j / * - i l l < \\y*-x*\\and\\u'n-y*\\^(l-\)\\y*-x*\\ < \\y* - x*\\. Passing to the limit in the above relation and taking into account the convexity of u\, we obtain that (1 - A)/*(a;*) + A/*(y*) < / * K ) + 2A(1 - A ^ d l i , * -
x*\\).
Therefore /* is 2
f*(yn) < f*(x*)+(yn,yn
- **>,
o < (yn -x,x*-
y*n)+o-2(\\yn - x*\\).
Adding these two inequalities side by side we obtain that f*(y:)
+ (x,y*n-x*)+cr2(\\yn-y*\\)
Vn G N.
Because o2 is continuous on [0, oo[\{to}, where to := sup(dom
there exists cx > 0 such that f is p-convex with p(t) := ^tq
(ii) 3c 2 >0,Vx,2/Gdom/ : f(y)>f{x)
+ f'(x;y-x)
+
for
218
Some results and applications of convex analysis in normed spaces
(hi)
3c3 > 0 , V(x,x*)egrdf, f(y)>f(x)
VyGX
+ {y-x,x')
(iv) 3c4>0,V(x,x*),(y,y*)egrdf f(y)>f(x)
(vi) 3c6>0^(x,x*),(y,y*)egvdf
+
f\\x-y\\9;
+
f\\x-y\\q;
:
+ (y-x,x*)
(v) 3c5>0,V(x,x*),(y,y*)€grdf
(vii)
:
: (y - x,y* - x") > *f \\x - y\\q ; : \\x* - y*\\>
2
-f\\x -
yW^1; c
i-j>
there exists cj > 0 suc/i t/tai / * is a-smooth, where a(t) := -1—tp;
(viii) 3c 8 > 0, W(x,x*) e g r 3 / , V i / * e r
:
/*(y*) < /*(**) + (x,y* - **) + ^
\\x* - y*\\p ;
(ix) d o m / * = X * and 3 c 9 > 0 , V(x,x*), (y,y*) e grdf
:
(y-X,y*-X*)<2-^\\X*-y*\f; (x) dom/* = X*, f* is Frechet differentiable on X* and 3c10>0,Vx*,y*eX*
: ||V/*(**) - V/*(i/*)|| < c\oP \)x* - 2 / T " 1 .
Proof. The conditions (i)-(vi), (vii)-(x) of the corollary correspond to conditions (i)-(vi) and (ix)-(xii) of Theorem 3.5.10. One must only put in evidence the relations among the different constants. These follow from the relations among the gauges in (the proof of) Theorem 3.5.10 and among the constants in Corollary 3.5.7. So, the implications (i) =>• (ii) => (iii) =>• (iv) =>• (v) => (vi) =>• (x) and (viii) =3> (ix) hold, with c2 := c\, c3 := c 2 , d := C3, C5 := a, c% :— C5, cio := 2ce/<7 and eg := eg, respectively. For the equivalence (vii) <S> (i) apply Corollary 3.5.4; one obtains the implications with c-j := Ci and c\ := c-j. Applying again Theorem 3.5.10, everyone of the conditions (vii)-(x) implies that /* is finite, Frechet differentiable on X* and grdf* = (grdf)"1 (the former set being considered for the duality (X*,X**)). So, the equivalence of conditions (vii)-(x) with the corresponding relations among the constants C7-C10, is obtained applying Corollary 3.5.7. D Remark 3.5.3 For p = 2 we may take the same c^ = c > 0 for all A; 6 1,10 in the preceding corollary, as the proof shows.
Uniformly convex and uniformly
smooth convex
functions
219
In the next result we add some characterizations of uniformly smooth convex functions on Banach spaces which differ slightly from those of Theorem 3.5.6. Theorem 3.5.12 Let X be a Banach space and f £ T(X). statements are equivalent:
The following
(i) / is uniformly smooth; (ii) /* is uniformly convex; (iii) 3Vi € To, Vy* € X*, V(x,x*)
£ grd/ :
/*(y*) > A * * ) + <*>if - **> + iMIIv* - **ll); (iv) 3ax£
Ei, Vy 6 X, V f o z ' ) £ g r d / : f(y) < fix) + (y-x,x*)
(v) 3^2 £ To, V(x,x*),
+
(y,y*)egrdf
:
(y-x,y*-x*)>ip2(\\y*-x*\\Y, (vi) d o m / = X and 3
-x*)
< cr2{\\y - x\\);
(vii) 3 ^ £ . A o n i V 0 , V(a:,a:*), (y,y*)egrdf
: 111/-i|| > ¥»(||y*-i*||);
(viii) 3 ^ € i V 0 n « o , V(x,x*), ( 2 / , y * ) € g r a / : Hy* - s*|| < fl(||j/ - x\\). In (iii) and (v) one can take V'I and tp2 in TQ instead of T0, while in (iv) and (vi) one can take a\ and a2 in S^ instead ofEi, respectively. Proof, (i) O (ii) is obtained in Theorem 3.5.5. Moreover /* is a*-convex. Because X is a Banach space and / is lsc conditions (iv) and (vi) coincide with conditions (iv) and (v) of Theorem 3.5.6; so the equivalence of conditions (i), (iv) and (vi) follows using this theorem. (ii) =>• (iii) =$• (v) => (vii) are obvious, with tpi := af, fa := 2fa and
x*) < \\y -
Z ||
• Hi,* -x*\\<
\\y - x\\ 6(\\y - x\\) = a6(\\y -
x\\),
220
Some results and applications of convex analysis in normed spaces
where a6(t) := t6(t), and so Eq. (3.42) holds. It is obvious that
Uniformly convex and uniformly
smooth convex functions
on bounded sets
221
uniformly convex. Applying the preceding theorem we obtain that X* is reflexive, and so X is reflexive, too. • 3.6
Uniformly Convex and Uniformly Smooth Convex Functions on Bounded Sets
In the sequel we study the uniform convexity or uniform smoothness on bounded subsets. Having / £ A(X) and r > 0, we denote by pf>r, af>r and a°fr the gages of uniform convexity, uniform smoothness and midpoint uniform smoothness of / on rl/x, respectively. We say that / is uniformly convex on bounded sets if / is uniformly convex on rUx for all r > 0. Similarly, we say that / is uniformly smooth on bounded sets if dom f = X and / is uniformly smooth on rUx for all r > 0. We begin with the following auxiliary results. Lemma 3.6.1 lent:
Let f € T(X).
Then the following statements are equiva-
(i) / is strongly coercive, (ii) V r > 0 , 3 a 6 i , V i £ X : f{x)>r\\x\\
+ a,
(iii) dom / * = X* and f* is bounded on bounded sets. The statements obtained from (i)-(iii) interchanging f, f* and X, X*, respectively are also equivalent. Proof. Since / and /* are proper, convex and lower semicontinuous, they are bounded from below by amne continuous functions, and so they are bounded below on bounded sets. (i) => (ii) Take r > 0; since / is strongly coercive, there exists p > 0 such that f(x) > r \\x\\ for ||x|| > p. Since / is bounded from below by some a < 0 on pUx, we have that f(x) > r \\x\\ + a for all x £ X. (ii) =£. (i) Assume that f(x) > r ||x||+a for every x € X; then, obviously, liminfu^n^oo f{x)/ \\x\\ > r. The conclusion is obvious. (ii) <s> (iii) Let g := r ||-||+a. We have that f> g & f* < g* — iTux. -a <«• f*(x*) < —a for every x* £ rUx*- The conclusion follows. The proof of the other statements are similar. • Proposition 3.6.2
Let f
eT{X).
(i) Assume that f is strongly coercive. If f is uniformly convex on bounded sets then f* is uniformly smooth on bounded sets, while if f is
222
Some results and applications of convex analysis in normed spaces
uniformly smooth on bounded sets then f* is uniformly convex on bounded sets. (ii) Assume that f* is strongly coercive. remain valid interchanging f and /*.
Then the assertions in (i)
Proof. We prove only (i), the proof of (ii) being similar. Let / be strongly coercive. Let r > 0 be fixed. Of course, there exists a e l such that f*(x*) > a + 1 for every x* € (r + l)Ux*. Because / is strongly coercive, there exists r' > 0 such that f(x) > (r + 1) ||a;|| — a for every x 6 X, \\x\\ > r'. Since for x* 6 (r + l)Ux> and ||a;|| > r' we have that (x, x*) - f(x) < (r + 1) ||x|| - (r + 1) ||x|| + a < a + 1, it follows that Vz* € (r + l)t/ x « : f*{x*)=sup{{x,x*)-f{x)\xer'Ux}.
(3.54)
Assume first that / is uniformly convex on bounded sets and take r > 0. The above argument shows that there exists r' > 0 satisfying (3.54). Take t, x*, y*, X, XQ, x*, 70, 71, x0 and xi like in the proof of Proposition 3.5.3(i), but with t e]0,1] and x* € rllx*- Then XQ,X* £ (r + l)Ux*, and so, by Eq. (3.54), we may take xo, x\ 6 r'Ux- Continuing in the same way like in the proof of Proposition 3.5.3(i), but with pjy instead of p, we obtain that &f*,r(t) < (Pf,r')*(t) for every t e]0,1]. It follows that / * is uniformly smooth on rUx* • Assume now that / is uniformly smooth on bounded sets and take r > 0. As above, there exists r' > 0 satisfying Eq. (3.54). Consider XQ,XI € rllx*, x e r'Ux, y £ X, A e]0,1[ and denote x*x := (1 - X)XQ + Xx*. Proceeding like in the proof of Proposition 3.5.3(ii), but with ajy instead of a, and taking into account Eq. (3.54), one obtains that / * is (afy)*-convex on rUx*. The conclusion follows from Lemma 3.3.1. • In the next two results we point out the relationships among uniform convexity on bounded sets, uniform smoothness on bounded sets and uniform continuity on bounded sets of the gradient. Proposition 3.6.3 statements:
Let f G T(X) be convex.
Consider the following
(i) / is bounded and uniformly smooth on bounded sets, (ii) / is Frechet differentiable on X = d o m / and V / is uniformly continuous on bounded sets, (hi) / * is strongly coercive and uniformly convex on bounded sets.
Uniformly convex and uniformly
smooth convex functions
on bounded sets
223
Then (i) O (ii) <= (iii). Moreover, if f is strongly coercive then (i) =$• (hi); in this case X* is reflexive (also X is reflexive if X is a Banach space). Proof, (i) =£• (ii) Since / is bounded on bounded sets, by Theorem 2.4.13, / is Lipschitz on bounded sets; since / is uniformly smooth on bounded sets, / is smooth at any x £ X, and so / is Frechet differentiable on X (see Theorem 3.3.2). In order to show that V / is uniformly continuous on bounded sets, let r > 0 and L > 0 be a constant Lipschitz for / on (r + l)Ux- Consider x,x' 6 rUx, t G]0,1] and y € Sx- Taking into account the inequalities f{x + ty) + f{x - ty) - 2 fix) > fix + ty) - fix) - t (y, V/(x)> > 0, and the similar one for x', we have that |V/(i)(ty)-V/(a;')(ty)| < |/0r) - fix')\ + \fix + ty) - fix' + ty)\ + | / ( * + ty) - fix) - Vfix)ity)\ <2a°firit)
+
+ \fix' + ty) - fix') - V/(x # )(«y)|
2L\\x-x'\\.
Hence ||V/(aO - V/(x')|| = s u P y 6 S x t - 1 \Vfix)ity)
- V/(z')('2/)l
<2t-1o-°f
+ 1)UX, \\x - x'\\ < t}.
224
Some results and applications of convex analysis in normed
spaces
Then for t € ]0,1], x G rUx and y € Sx we have that
f{x + ty) + f(x - ty) - 2f(x) = (f(x + ty) - f{x)) + (f{x - ty) -
f(x))
= V / ( x + 6y)ity) - Vf(x - ffy){ty) = * (V/(x + Oy) - Vf(x < t \\Vfix + Oy) - Vfix
- O'y)) (y)
- 9'y)\\ < tu{2t),
where 6,6' € [0, t] are obtained by using the mean value theorem. It follows that a0,rit) < iw(2£) for t 6 [0,1], and so, because V / is uniformly continuous on rUx, f is uniformly smooth on rUx(iii) =J» (i) Because /* is strongly coercive, by Lemma 3.6.1 / is bounded on bounded sets; using Proposition 3.6.2(ii) we obtain also that / is uniformly smooth on bounded sets. (i) =• (iii) (When / is strongly coercive.) Again, by Proposition 3.6.2(i) / * is uniformly smooth on bounded sets, while from Lemma 3.6.1 we obtain that / * is strongly coercive. In this case / * + t r c/ x . is uniformly convex and continuous at 0, and so, by Theorem 3.5.13, X* is reflexive. Of course, if X is a Banach space, X is reflexive, too. • P r o p o s i t i o n 3.6.4
Let f £ T(X). Consider the following
statements:
(i) / is strongly coercive and uniformly convex on bounded sets, (ii) / * is bounded and uniformly smooth on bounded sets, (iii) / * is Frechet differentiable on X* = dom /* and Vf* is uniformly continuous on bounded sets. Then (i) => (ii) o (iii). Moreover, if f is bounded on bounded sets then (ii) =*• (i); in this case X* is reflexive (also X is reflexive if X is a Banach space). Proof, (ii) •& (iii) follows from the preceding proposition applied to /*. (i) => (ii) follows immediately applying Lemma 3.6.1 and Proposition 3.6.2. (ii) => (i) (When / is bounded on bounded sets.) By Lemma 3.6.1 / * is strongly coercive; being also uniformly smooth on bounded sets, using Proposition 3.6.2, we have that / is uniformly convex on bounded sets. In this case, since X* is a Banach space, we obtain that X* is reflexive applying the preceding proposition; so, if X is a Banach space then X is reflexive. •
Uniformly convex and uniformly
smooth convex functions
on bounded sets
225
In the next result we characterize uniform convexity on bounded sets without asking coercivity of the function. P r o p o s i t i o n 3.6.5 Let X be a Banach space and f € T(X). Then f is uniformly convex on bounded sets if and only « / $ / , B ( £ ) > 0 for every t > 0 and every bounded set B C X with B n dom / ^ %, where $},B{t) := inf{/(y) - f(x) - f'{x,y-
x)\xeBndomf, 2 / € d o m / , \\y-x\\
= *}.
Proof. Assume that / is uniformly convex on bounded sets and consider B C X a bounded set with B D d o m / ^ 0. Let r > 1 be such that B C (r — l)Ux- By hypothesis g := f + LTUX is uniformly convex. Hence, from Theorem 3.5.10, there exists ip € To such that g(y) > g(x) + g'(x,y — x ) + i>(\\y ~ x\\) f° r a u %>y £ domp, whence /(y) > /(a;) + g'(x,y — x) + ip(\\y — x\\) for all x 6 -BH dom / and y £ rf/x H dom / . But for such x and y we have that g'(x,y — x) = f'{x,y - x). It follows that /(y) > f(x) + f'(x,y — x) + if>(t) for all x £ B 0 d o m / , y € d o m / with ||y — x\\ = t and £ € (0,1]. Since t9/,s = infx£.Bndom/ $/,x (see page 201 for the definition of tf/.z) and i9/]X € TVi, # / , B S N\, too; in particular $/,# is nondecreasing. But •df>B(t) > i/j(t) for t € [0,1], and so i?/,s(*) > 0 for every t > 0. Conversely, assume that $ftB if) > 0 for every t > 0 and every bounded set B C X with B n d o m / ^ 0. Consider r > 0 such that rt/x n d o m / ^ 0. T h e n / 0 / ) > /(a;) + /'(a;, 2/ - x) +tif,rUx (\\y - x\\) for all x <= r a n d o m / and y € d o m / . Taking # = f + huXi w e obtain that g(y) > g(x)+g'(x,y — x) + flf,rUx{\\y ~ x\\) f° r a u ^iJ/ € domg. As seen above, i?/ r s € N\. By our hypothesis we have that flf^B € .Ao; using Exercise 3.12 we obtain that •tp = COI?/,B S TQ. It follows that condition (ii) of Theorem 3.5.10 holds, and so g is uniformly convex. Therefore / is uniformly convex on bounded
sets.
•
We end this section with characterizations of uniform convexity and uniform smoothness on bounded sets in the case X = W1. P r o p o s i t i o n 3.6.6
Let f € r ( R n ) .
(i) Assume that either n — 1 or dom / is closed and /|dom/ is continuous. Then f is uniformly convex on bounded sets if and only if f is strictly convex. (ii) Assume that dom / = E". Then f is uniformly smooth on bounded sets if and only if f is differ•entiable (Gateaux or Frechet).
226
Some results and applications of convex analysis in normed
spaces
Proof. (i) Suppose first that / is uniformly convex on bounded sets. Taking into account that pftX is nondecreasing for x G d o m / , it follows that / is uniformly convex at any x G d o m / . Then, as observed at the beginning of Section 3.5, / is strictly convex. Assume now that d o m / is closed and /|dom/ is continuous but / is not uniformly convex on bounded sets. Then there exists r > 0 such that rUx H dom / ^ 0 and / + trux is not uniformly convex. Taking into account Exercise 3.3 we have that for some e > 0 and any n € N, there exist xn,yn G rUx H dom / such that \xn — yn\ = 2e and §/(*») + §/(l/») < / {\(Xn + Vnj) + £•
(3.55)
Since (xn) and (yn) are bounded we may suppose that (xn) -> x G cl(dom / ) and (yn) —> y G cl(dom/); of course, ||a; — y\\ = 2e. Because d o m / is closed and /|ci(dom/) is continuous, x,y,\{x + y) G d o m / and (f(xn)) ->• f(x), (/(l/n)) -»• f(y), {f{\{xn + Vn))) -»• / ( | ( x + y)). Taking the limit in Eq. (3.55) we get \f(x) + \f{y) < f(^(x + y)), and so / is not strictly convex. In the case n = 1, without any supplementary condition, by Proposition 2.1.6 we have that /| c i(dom/) is continuous. As above, we get (xn), (yn) C [—r, rjndom / and e > 0 for which Eq. (3.55) holds. Taking x, y € cl(dom / ) the limits of (xn) and (yn), we have that \f{x) + \f{y) < f(\(x+y)). Since | ( x + y) £ int(dom/), the preceding inequality implies that x,y € d o m / , and so / is not strictly convex in this case, too. (ii) Since d o m / = E n , by Corollary 2.2.21, / is continuous, and so / is bounded on bounded sets. On the other hand, because the Gateaux and Frechet bornologies coincide on Rn, Theorem 3.3.2 shows that Gateaux and Frechet differentiability of / coincide in our case. Moreover, by Corollary 3.3.3, V / is continuous when / is differentiable, and so V / is uniformly continuous on bounded sets in such a case. The conclusion follows applying Proposition 3.6.3. D
3.7
Applications to the Geometry of Normed Spaces
The subdifferential of the norm has an interesting geometric interpretation. We have already seen that a hyperplane is a set Hx*ta := {x € X \ (x, x*) = a}, where x* G X* \ {0} and a G R; if dimX > 1 then Hx. 0. One can establish easily (Exercise!) that for
Applications
to the geometry of normed spaces
227
x*,y* G Sx*, a,/3 > 0 we have Hx* 0; then Hx*>a is a supporting hyperplane of Ux atx£ Sx if and only if a = 1 and x* G d\\ • \\(x). Therefore the set of supporting hyperplanes of Ux at x G Sx is {ff x .,i | x* e a|| • ||(x)}
(3.56)
(Exercise!). If Ux has a unique supporting hyperplane at every point of its boundary Sx we say that X is smooth; note that this is a geometric property (of the norm), but not a topological property. The dual property (as we shall see later on) is that of strictly convex space. We say that X is strictly convex if
Vx,y€X,x*y,
1^1 = 115,11 = 1
:
||I( I + y )|| < 1.
Of course, the above condition is equivalent to \{x + y) G Bx for all x,y G Ux, x^y. Throughout this section ip : R+ —>• K+ is a weight function, i.e.
fv(x) := jf 1 1 y>(a) da.
(3.57)
In the sequel we shall consider also ip : E+ ->• 1 + ,
tp{t) := /„*
(3.58)
and so fv(x) = V'CII^II) f° r every x G X. An important example of weight function is given by tp(t) = t p _ 1 (and so tp(t) = Hp) for p G]l,oo[. In the next lemma we collect some properties of the function if) defined above. Lemma 3.7.1 Let ip be a weight function and ip be defined by Eq. (3.58). Then ip is strictly convex, derivable with ip' — if onR+, lim^oo t~li4>{t) — oo, rp*(t) := Jg ¥'~ 1 (s) ds and tp(t) + ip#(s) = ts if and only if s = tp(t). Proof. It is obvious that i/> is derivable and ip'(t) — ip(t) for t > 0. Moreover, limj-xx, i(>(t)/ \t\ = limt-yoo tp'(t) = oo. Let s > 0; the function 6 : E+ -)• R defined by 9{t) := st - ip{t) is derivable and 6'(t) = s -
228
Some results and applications of convex analysis in normed spaces
It follows that 8 is increasing on [0,
i,#(s) = s
= ff^uM")
= ti
where the second equality is obtained by integrating by parts in a RiemannStieltjes integral, while the last equality is obtained by changing the variable: u = y -1 (i>). • Having a weight function
Let ip be a weight function.
Then:
(i) fv is convex and continuous. (ii) (Uy = i>*o\\.\\,i.e. allx* EX*. (hi)
(Unx')
= 4,*{\\x'\\) =
lSX'n'P-1(t)dtfor
For every x £ X one has dU(x)
= {x* € X* | (x,x*) = \\x\\ • 11*1, ||x*|| = v'(INI)} =
dfv(-x)
=
(iv) The following statements are equivalent: (a) X is smooth, (b) fv is Gateaux differentiable on X, (c) || • || is Gateaux differentiable on X \ {0}. Moreover, if X is smooth then Vfv is norm-weak* continuous. (v) The following statements are equivalent: (a) X is strictly convex, (b) fv is strictly convex, (c) dfv is strictly monotone, (d) Vx,y 6 X,
x^y
: dfv(x)ndfv(y)
= 9.
(vi) X is reflexive <$ X is a Banach space and lm(dfv)
= X*.
Proof. Let us denote fv by / . (i) Let x, y 6 X and A e]0,1[. Then f(Xx + (1 - X)y) = fl>(\\Xx + (1 - A)y||) < xP(X\\x\\ + (1 - A)||y||) < \j,{\\x\\) + (1 - AMHi/H) - Xf(x) + (1 -
X)f(y),
the last but one inequality being strict if ||a;|| ^ \\y\\ (because xp is strictly convex). Therefore / is convex. It is obvious that / is continuous.
Applications to the geometry of normed spaces
229
(ii) The conclusion is immediate from Lemma 3.3.1 and Lemma 3.7.1. (iii) We have that x* 6 df(x) & f(x) + /*(x*) = (x,x*) «• V(||z||) + ^ # (||z*||) = (x,x*). Because (x,x*) < ||x|| • ||x*||, using Lemma 3.7.1, we obtain that x* e df(x) & (x,x*) = ||x|| • ||x*|| and ||x*|| =
X)f(y)
if ||x|| =4 \\y\\. Let now ||x|| = ||y|| — p; of course p > 0 (otherwise x = y = 0!). We want to prove that ||#(A)|| < p, where 6{p) := p.x + (1 - p ) y . Let us consider e := min{A, 1 - A} > 0. It is obvious that ||#(A — e)|| < p, \\6(X + e)|| < p and 0(A) = \6{X - e) + \d{X + e); moreover 6(X - e) ^ 6(X + e). Thus we have ||6>(A)|| < p. Therefore /(Ax + (1 - X)y) = V(||Az + (1 - X)y\\) < i>(p) = Atf(IMI) + (1 - A)V(NI) = A/(x) + (1 -
X)f(y).
The implication (b) =>• (c) follows from Theorem 2.4.4(H), while the implication (c) => (d) is obvious. (d) =*• (a) Let x,y £ Sx, x ^ y. Suppose that \(x + y) £ Sx and let x* £ 0/(2±X). Then, by (iii), y>(l) = ¥>(IMI) = ¥>(lll/ll) = v C H ^ I D = 11**11 and ¥>(!) = ( | ( x + y),x*) < \\\x\\ • ||x*|| + \\\y\\ • \\x*\\ = y>(l).
230
Some results and applications of convex analysis in normed spaces
Therefore ||x|| • ||a;*|| — {x,x*) and \\y\\ • \\x*\\ — {y,x*). Using again (iii) we obtain the contradiction x* G df(x)ndf(y). Therefore |||(a; + 2/)|| < 1, which shows that X is strictly convex. (vi) Suppose that X is reflexive, and let a;* EX*. Let us consider the function g : X —> R, g{x) := f(x) — (x,x*). It is obvious that g is convex, continuous and g(x) > ip(\\x\\) — \\x\\ • \\x*\\; therefore lim^x^00g(x) — oo. By Theorem 2.5.1(h) there exists x G X such that g(x) < g{x) for every x G X, i.e. (x — x,x*) < f{x) — f(x) for every x. This relation proves that x* G df(x). Thus we have Im(df) = X*. Conversely, suppose that X is a Banach space and Im(df) = X*. Let x* G X*; taking into account James' theorem (see [Diestel (1975), Th. 1.6]), it is sufficient to show that x* attains its supremum on Ux- Of course, we may suppose that ||x*|| = y ( l ) . By hypothesis, there exists x € X such that x* G df(x). Therefore (x,x*) = \\x\\ -\\x*\\ and ||x*|| = y(||x"||) = (/'(l). Hence ||x|| = 1 and VxeUx
• (x,x*)-\\x\\'\\x*\\
=
\\x*\\>(x,x*).
Thus x* attains its supremum on Ux at x.
•
The multifunction $ x •'= d{\ ||-||2) : X =| X* is called the duality mapping of X. Therefore $jc(:c) = {x* G X* | (x,x*) = \\x\\2 = ||a;*||2 } .
(3.59)
From the preceding theorem we obtain that $ x is single-valued -£> X is smooth •O || • || is Gateaux differentiable o n I \ {0}. Moreover, if X is smooth then $ x is norm-weak* continuous, identifying $x{%) with its unique element for every x E X. Related to the relationships between strict convexity and smoothness of a normed space and its dual one has the following result. T h e o r e m 3.7.3 If X* is smooth (resp. strictly convex), then X is strictly convex (resp. smooth). If X is a reflexive Banach space then X* is smooth (resp. strictly convex) if and only if X is strictly convex (resp. smooth). Proof. To begin with, let X* be smooth and x,y G Sx, x ^ y; suppose that |(a; + y) G Sx- Taking x* G ^ ( ^ j f 2 ) , as in the proof of assertion (v)
Applications to the geometry of normed spaces
231
of the preceding theorem (taking ip(i) = t for t > 0), we obtain that l = \\x\\2 = \\y\\2 = \\r\\2 = (x,x*) = (y,x*). From this relation we obtain that {a;* £ X* \ (x,x*) = 1} and {x* £ X* | (y, x*) = 1} are distinct supporting hyperplanes of Ux* at x* £ Sx*, which contradicts the fact that X* is smooth. Therefore X is strictly convex. Suppose now that X* is strictly convex but X is not smooth at x £ SxThen there exist a;*, y* £ Sx* such that {x £ X \ (x,x*) = 1} and {x 6 X | (x,y*) = 1} are distinct supporting hyperplanes of Ux at x. Therefore x* ^ y* and <5f,x*) = <3F,y*> = (x,l(x*+y*))
< ||3f|H|±(:i:'+i,*)|| = | | | ( x * + y ' ) | | < 1.
Because (af,a;*) = 1, we obtain that |||(a;* + y*)|| = 1, which contradicts the fact that X* is strictly convex. Therefore X is smooth. If X is reflexive and if X is strictly convex (resp. smooth) then X** is strictly convex (resp. smooth), whence, using the first part, we have that X* is smooth (resp. strictly convex). • Let us introduce other possible geometric properties of a normed space. We say that X is uniformly smooth at x if V £ > 0 , 35 >0, WyeSUx
: \\x + y\\ + \\x - y\\ - 2 ||z|| < e \\y\\,
locally uniformly smooth if X is uniformly smooth at any x £ Sx and uniformly smooth if V e > 0 , 3 5 > 0 , V z £ S x , Vy£5Ux
: \\x+ y\\ + \\x - y\\-2\\x\\
< e\\y\\;
it is obvious that X is not uniformly smooth at 0. Consider the gage of uniform smoothness of the norm at x defined by ax : 1+ -> K+, ffx(t) := sup{||x + ty\\ + \\x - ty\\ - 2 ||z|| | y £
Sx};
it is clear that ax(t) < 2t \\x\\ for t > 0 and ax £ T. It is obvious that X is uniformly smooth at x if and only if <jx € S i . Considering also the gage of uniform smoothness of the norm of X defined by ax •• K+ ->• M+,
0\x- := sup{
we have that ax(t) < 2t for t > 0 and crx € I\ too. So, X is uniformly smooth if and only if ax £ S i . In the next result we give some characterizations of the previous notions.
232
Some results and applications of convex analysis in normed spaces
Theorem 3.7.4
Let
(i) Let x € X \ {0}; X is uniformly smooth at x O ||-|| is Frechet differentiable at x O / ^ is Frechet differentiable at x; (ii) X is locally uniformly smooth & ||-|| is Frechet differentiable on X \ {0} «• / , is Frechet differentiable ^ / v G C ^ X ) ; (iii) X is uniformly smooth & fv is uniformly smooth on rUx for some (every) r > 0. Proof. Denote fv by / . (i) The first equivalence is immediate from Theorem 3.3.2, while the second one is obvious because / = ipo ||-||, where ip is denned by Eq. (3.58); ip and V' - 1 are derivable on F. (ii) This is an immediate consequence of (i); notice that / is always Frechet differentiable at 0 and V/(0) = 0. The last equivalence follows from the final part of Theorem 3.3.2. (iii) Assume first that / is uniformly smooth on rUx (even on rSx) for some r > 0. Then there exists a 6 £l\ such that f(x + ty) + f(x — ty) — 2/(z) < o-(t) for all x £ rSx, y S Sx and t > 0, or equivalently rl>(\\x + ty\\) + r/,(\\x - ty\\) - 2tK\\x\\) < a(t) for all x 6 rSx, y 6 Sx and t > 0. Because ip(t) > i/j(t0) + ¥>(to) • (t - to) for all t, to > 0, we obtain that Mr)
(||u + tr~ly\\ + ||u - tr-ly\\
- 2 ||u||) < a(t)
for all u,y e Sx and t > 0. It follows that rip(r) • ax(t) < cr(ir) for t > 0, and so ax S f^i. Hence X is uniformly smooth. Assume now that X is uniformly smooth; then ax G T fl Hi. Let r > 0. Because the function xp : E -> E, %/>(t) := V'd*!)) i s uniformly smooth on [0,r] (being derivable on E), there exists a £ fti such that V>(s) < VW + (s - t) • V'(i) +
\\x\\\), \\x\\\),
and so, because |||a; ± ty\\ — \\x\\\ < t and a is nondecreasing, f(x + ty)+f(x-ty)-2f(x)
< ip(\\x\\) (\\x + ty\\ + \\x - ty\\ - 2 ||ar||) + 2<7(t)
Applications
to the geometry of normed spaces
233
for x £ rUx, y £ Sx and t > 0. Taking 7 £ ]0, r[, we obtain that y?(||z||) (||i + ty\\ + \\x - ty\\ - 2 ||x||) < 2 ^ ( 7 ) for a; 6 7C/X and >(||z||) (||z + ti/|| + ||x - ty\\ - 2 ||x||) < r
r
for x £ rC/x \ ~fUx • Therefore a°Lr(t) < max (2t
W 4 i , (\\xn\\) -> ||x|| =* ( x „ ) ^ . Similarly, X* (with its dual norm) has the weak* Kadec-Klee property if the above property holds in X* but with w replaced by w*. Proposition 3.7.5
The following implications hold:
X is uniformly smooth =>• X is locally uniformly smooth => X is smooth. Moreover, if X* is smooth and has the weak* Kadec-Klee property, then X is locally uniformly smooth. Proof. The above implications obviously follow from Theorems 3.7.2(iv) and 3.7.4. Assume now that X* is smooth and has the weak* Kadec-Klee property. Let
234
Some results and applications of convex analysis in normed spaces
so (Vfv(xn)) -4 Vfv(x). Using now Theorem 3.3.2 we obtain that fv is Frechet differentiable. The conclusion follows from Theorem 3.7.4. • Consider now two other notions. We say that X is locally uniformly convex if VxeSx,
V £ > 0 , 36 > 0 , Vy£Sx
: \\y - x\\ > e =» ||±(:r + y)\\ < 1 -<J,
and X is uniformly convex if V e > 0 , 3 J > 0 , Vx,yeSx
: ||z/ - x|| > £ => HK^ + ^ l < 1 - <*•
Of course, every uniformly convex space is locally uniformly convex, and any locally uniformly convex space is strictly convex. Moreover, note that X is locally uniformly convex if and only if for every x € Sx and every sequence (xn) C Sx, {\\^(x + xn)\\) -> 1 => (\\x„ - x\\) - • 0, while X is uniformly convex if and only if for all sequences (xn), (yn) C Sx, {\\^(xn + Z/n)||) —• 1 => (||a;„ - y n ||)-> 0. The next result shows that any locally uniformly convex space has the Kadec-Klee property (even in a stronger form). Proposition 3.7.6 Assume that X is a locally uniformly convex space. If the net {xi)i^i C X converges weakly to x 6 X and (||x»||)tei -> ||a;||, then (\\xi - x\\)i
(||XJ||
- x j ) i e / 4- ||x|| • x', i.e. •
In the next result we give dual characterizations for uniformly convex and locally uniformly convex spaces.
Applications
Theorem 3.7.7
to the geometry of normed spaces
Let ip be a weight function.
(i) X is locally uniformly convex 4* fv x e X,
235
Then is uniformly convex at any
(ii) X is uniformly convex <$• fv is uniformly convex on rUx for any (some) r > 0. Proof. Denote again fv by / . (ii) Assume first that X is uniformly convex, but fv is not uniformly convex on rUx for some r > 0. Taking into account Exercise 3.3, there exists eo > 0 such that for every 6 > 0, there exist x,y £ rUx with
||z - 2/|| = eo and f{\x + \y) > \f{x) + \f{y) - 8. Taking (Sn) I 0, there exists (xn), (yn) C rUx such that \\xn — yn\\ = eo and ip{\\\xn + ±yn\\) > ty(\\xn\\) + %ip(\\yn\\) - $n for every n 6 N. Let xn = r)nun and yn = finvn with rj n ,/i n G [0,r] and un,vn € Sx for every n e N. Then for every n € N we have that ^{kVn
+ \Hn)
> Ip iW^VnUn
+ ^HnVn\\)
> | ^ ( » ? n ) + | V > ( A O ~ <*n- ( 3 . 6 0 )
Passing to a subsequence if necessary, we may suppose that (r/n) -* t] £ [0,r], (/i n ) -> n € [0,r] and ( | | | u n + §v n ||) -> 7 € [0,1]. From Eq. (3.60) we obtain that ip (^77 + |/j) > \ip{rj) + \ip{p). Because ip is strictly convex we obtain that 77 = /J. Since £0 = ||%Wn ~ MnVnll < Vn \\un ~ Vn\\ + \r)n - fln\
< Znn + fln
(3.61)
for every n € N, it follows that n > 0. Since |||7? n u„ + | ^ „ ? j n | | < r ? n | | i u n + |?j„|| + ! | j u „ - J 7 „ |
V n
^ N,
from Eq. (3.60) we obtain that ip(rfy) > V'(7?)) a n d so 7 > 1. It follows that ( | | | u n + ^vn\\) -> 1, and so (||u n — u n ||) -» 0. Passing to the limit in Eq. (3.61), we get the contradiction eo < 0. Therefore / is uniformly convex. Assume now that / is uniformly convex on rUx for some r > 0. Then for e > 0 and x,y € Sx with \\x — y\\ > e we have that / (\rx + \ry) < \f{rx) + \f{ry) - \pJ>T (r \\x - y\\), and so $ (r | | | i + |j/||) < >(r) | p / i r ( r e ) < ^ ( r ) . Taking r' E]0,r[ such that ip{r') = ip(r) — \pf,r(re) and S :— 1 — r ' / r > 0, we have that |||a; + | j / | | < 1 — S. Therefore X is uniformly convex.
236
Some results and applications of convex analysis in normed spaces
For the proof of (i) fix x £ Sx for the sufficiency (and take pftX instead of /3/,A) and fix X 6 X for the necessity. In this case the sequence (nn) is constant while (fj,n) C [0, oo[ is bounded. Then we can proceed as in the proof of (ii). • We end this section with two results which prove the complete duality between uniform smoothness and uniform convexity. Theorem 3.7.8 are equivalent:
Let
The following
statements
(i) X is uniformly smooth, (ii) ftp is uniformly smooth on rlJx for some (any) r > 0, (iii) ftp is Frechet differ•entiable and Vfv bounded sets,
is uniformly continuous on
(iv) X* is uniformly convex, ( v ) (f
Let ip be a weight function.
The following
statements
(i) X is uniformly convex, (ii) f^ is uniformly convex on rUx for some (any) r > 0, (iii) X* is uniformly smooth, (iv)
{ftp)* is uniformly smooth on rUx* for some (any) r > 0,
(v) (ftp)* is Frechet differentiable and V/£ is uniformly continuous on bounded sets. If one of the above conditions holds and X is a Banach space then X is reflexive. Proof. The proof is similar to that of the preceding theorem (and follows from the preceding theorem when X is reflexive). •
Applications
to the best approximation
problem
237
We end this section with another sufficient condition for refiexivity of a Banach space. Proposition 3.7.10 Let X be a Banach space. If the dual norm is Frechet differentiable on X* \ {0} then X is reflexive. Proof. Let ip be a weight function. Then, by Theorem 3.7.2(ii), {ftp)* = / v - i (on X*). Because the dual norm is Frechet differentiable on X* \ {0}, using Theorem 3.7.4(h) we have that {fv)* is Frechet differentiable on X*. By Corollary 3.3.4 we have that V(/¥>)*(a;*) G X for every x* G X*, and so Im(9/ V ) = X*. By Theorem 3.7.2(vi) we obtain that X is reflexive. •
3.8
Applications to the Best Approximation Problem
Let C C {X, || • ||) be a nonempty set and consider the distance function dG : X -»• K,
dc{x) := d{x, C) := ini{\\x - c|| | c G C).
(3.62)
It is well known that dc is Lipschitz with Lipschitz constant 1 (Exercise!). Having i 6 X , a n important problem consists in determining the set Pc{x) = {x e C | ||x - x\\ = dc{x)}; x € Pc{x) is called a best approximation of x by elements of C. It follows that the multifunction Pc introduced above is bounded on bounded sets. Indeed, if x G rUx, x G Pc{x) and c is a fixed element of C, then \\x\\ < \\x — 3;|| + ||a;|| < ||a; — c|| + ||a;|| <2r+ \\c\\. Remark 3.8.1 [0,1].
If x G Pc{x) then x e Pc (Ax + (1 - \)x) for every A G
Indeed, taking x\ := Xx + (1 — X)x, this follows immediately from the inclusion B {x\, \\x\ — x\\) C B {x, \\x — af||). The following notion is important in the best approximation theory. We say that the nonempty set C C X is a Chebyshev set if Pc{x) is a singleton for every x G X. Note that for xo G X, Pc{xo) is the set of optimal solutions for everyone of the following minimization problems: {Pi)
min \\x — Xo\\, x € C,
(P 2 )
min | | | a ; - x o | | 2 , x G C.
Moreover v{Pi) = dG{x0),
v{P2) -
\d2c{x0).
238
Some results and applications of convex analysis in normed spaces
In the (best) approximation theory, the fundamental problems are: the existence, the uniqueness and the characterization of best approximations. Because Pc{x) = {x} if x £ C, Pc(x) = 0 if x £ (clC) \ C, and Pc(x) = 0 for every x £ X \ C if C is open (Exercise!), it is quite natural to consider, generally, elements XQ £ X \ C and to assume that C is a closed set (taking into account also that dc(xo) = ^cic(^o))- Because we propose ourselves to apply the results on convex functions obtained till now (but this is not the sole reason), we shall assume in this section that C is convex, too. The first result gives sufficient conditions for the existence and the uniqueness of best approximations, respectively. Theorem 3.8.1 Let C C X be a nonempty closed convex set and xo € X. (i) If X is a reflexive Banach space then PC{XQ) ^ 0. (ii) If Xo :— linC is a space of finite dimension then
PC{XQ)
^ 0.
(iii) If X is a strictly convex normed space then Pc(xo) has at most one element. Proof, (i) Let us consider the function / := || • —xo\\ + ic- By hypothesis / is convex and lsc; moreover lim||a,||_>0O/(a;) = oo. By Theorem 2.5.1(h), there exists x £ X such that f(x) < f(x) for every x £ X, i.e. x £ Pc(x0). (ii) There exists (xn)n>i C C C X0 such that (||a;n - £ o | | ) ~» dc{xa). It follows that (xn) is bounded. The subspace Xo being of finite dimension, Xo is isomorphic to W (p = dim Xo) whence (xn) contains a subsequence (xnk) convergent to x £ Xo C X; hence (\\xnh -xo\\) -> ll^-^oll = dc{xo). Since the set C is closed, x £ C. Therefore x £ PC{XQ). (iii) We have already seen that Pc(xo) — {%o} for xo £ C. Let XQ $. C. Suppose that there are two distinct elements x\,X2 in Pc(xo). Then \(x\ +x2) £ C, and \\xi - x0\\ = \\x2 - x0\\ = dc(x0) > 0Since X is strictly convex, we obtain | | | ( z i +x2) -x0\\
= |||(a;i - x0) + \{x2 - x0)\\ < \\\xx - a;0|| + \\\x2 - a;o|| =
dc{x0).
This contradiction proves that Pc(xo) has at most one element.
•
Prom the preceding theorem we obtain that any closed convex subset of a strictly convex and reflexive Banach space is Chebyshev. In the next section we shall show that any weakly closed Chebyshev subset of a Hilbert space is convex.
Applications
to the best approximation
problem
239
The following duality result reveals itself to be useful sometimes. Theorem 3.8.2 X\C. Then
Let C C X be a nonempty closed convex set and XQ €
dc(%o) —
max inf (XQ — x,x*), %*&x* zee
Tidrixn) = max inf ((x 0 — x,x*) - k\\x*\\2) • If C is a cone, then dc(xo) = Proof.
max x*eux.n-c+
(x0,x*),
l\d?r(xo)
=
max ((x0,x*) - Jh\\x*\\2) . x*e-c+ v '
Let
/ ! , / 2 : X - > R , h{x):=\\x-x0\\,
h(x) := \\\x - xa\\2.
(3.63)
Using Fenchel-Rockafellar duality formula (Corollary 2.8.5) and the formulas for the conjugates of the norm and its square, we get dc(x0)
= inf (A(a;) + ic(x)) = max ( - / * ( - i * ) - i*c(x*)) =
\dl{xo)
max x*eux,
[ (XQ,X*) — sup(x,x*)\ = max inf (XQ — x,x*), \ J x*eux. xec x€C
= M(Mx)
+ icix)) = max ( - / ; ( - * • ) - £ ( * * ) )
= max [ (x0,x*) — ^\\x*\\2 -
sup(x,x*}
= max inf ((in — x, ,x*) — k\\x*\\2) • When C is a cone, LC(X*) = 0 if x* G -C+, follows immediately.
— oo otherwise; the conclusion •
In the next result we summarize several properties of dc and \$QProposition 3.8.3
Let G C X be a nonempty convex set. Then
(i) (dc)* = uUx, +tc
= iUx. + sc and ( | ^ , ) * = \ ||-||2 + sc.
(ii) If x £ X and x € PQ (X) then ddc{x) = d\\-\\(x~x)nN(C,x),
d(\dl){x)
= $x(x-x)nN{C,x);
(3.64)
in particular, for every x G C we have that ddc{x) = Ux.nN(C,x),
di±d?c)ix) = {0}.
(3.65)
240
Some results and applications of convex analysis in normed spaces
(iii) For any x £ C and any u £ X we have that (dc)'(x,u)
= d(u,C{C,x)),
{±d2c)'(x,u)
= \ [d(u,C{C,x))]2
. (3.66)
Proof, (i) Since dc = \\-\\Oic and \d% = {\ ||-|| 2 )Dtc, the given formulas follow immediately from Theorem 2.3.1(ix), Corollary 2.4.16 and Theorem 3.7.2(h) for tp(t) = t. (ii) If x £ X and x £ Pc(x) then dc(x) = \\x — x\\ + tc(x) and {\d%) (x) = \ \\x - x | | 2 + LC{X). Using Corollary 2.4.7, Corollary 2.4.16 and Theorem 3.7.2(iii) for
= max{(u, a;*) | x* €
ddc(x)}
= max{(u, x*) | x* E Ux* n N(C, x)} = d(u, C(C, x)), the last equality being obtained applying the last part of the preceding theorem for C replaced by C(C, x). • In the following theorem we give several characterizations of best approximations in general normed spaces. Theorem 3.8.4 Let C C X be a nonempty convex set, XQ £ X \ C and x € C. The following statements are equivalent: (i)
x£Pc{x0);
(ii) 3a;* G 5x», Vx £ C : {x-xo,x*) (iii)
3I*6 5J.,
VieC
—
\\x-x0\\and(x-x,x*)>0;
: (x - x0,x*) > \\x - x0\\;
(iv) 3x* 6 P , V i £ C : (x - x0,x*) = \\x*\\2 = \\x - x0\\2 and {x — x,x*) > 0, or, equivalently, $x(xo —x)f\ N(C;x) ^ 0; (v) 3x* E X*, Vx € C : \\x*\\ = \\x-x0\\ and (x-x0,x*) >\\x-x0\\2; (vi) (ifX is smooth) Vx € C : (x-x, ^x(x-x0)) > 0, or, equivalently, $x(xo - x) € N(C;x) (identifying $ x ( ^ ) with its unique element); (vii)
(if C is a cone) 3x* £ Sx* n —C+ : (xo,x*) = \\x - XQ\\.
Proof, (i) =*• (ii) If x £ Pc(x0) then 0 £ dfi(x) + diC{x), where fx is defined in Eq. (3.63). The conclusion is immediate if we take into account the expression of the subdifferential of the norm.
Applications
to the best approximation
problem
241
(ii) => (iii) With x* from (ii) we have that (x - x0, -x*) = (x-x,
-x*) - (x-
x0,x*) = (x-x,
-x*) - \\x — ar0|| > 0
for every x £ C. The element —x* satisfies the desired condition. (iii) =>• (i) With x* from (iii) we have that VieC i.e. x £
: ||a; — xo|| > (x —
XQ,X*)
> \\x — Xo\\,
PC{XQ).
The implications (i) => (iv) =>• (v)=*>(i) are proved exactly as above, but using this time the function f2 defined in Eq. (3.63). Condition (vi) is identical to condition (iv) when X is smooth (d/ 2 (x) = $x(x); if X is smooth $x(x) is a singleton, whose element is denoted by §x(x), too). Suppose now that C is a cone. (iii) =>• (vii) With x* from (iii) we have that \/t>0,
Vx£C
: {tx,x*)>(x0,x*)
+
\\x-x0\\.
Therefore (x,x*) > 0 for every x e C, i.e. x* € C+, and (x0, -x*) > \\x -
XQ\\
> (x0 - x, -x*) > (x0,
-x*),
whence —x* is the desired element. (vii) =>• (iii) With a;* from (vii) we have that Vx£C
: (x — x0, —x*) > (xo,x*) = \\x — x0\\,
which shows that — x* satisfies the condition of (iii). The following result is sometimes useful.
•
Corollary 3.8.5 Let C C X be a nonempty convex set and x £ C. Then x e Pc(x + tu) for all u e $^ x (N(C; x)) and t£R+. Proof. The conclusion is obvious for u = 0 or t = 0. Assume that u ^ 0 and t > 0. Then x + tu £ C; otherwise, taking u* £ $x{u) H N(C,x), we obtain the contradiction t \\u\\ = (x + tu-x, u*) < 0. Since $ x ((x + tu) ~x)n N(C,x) - *($x(w) n N(C,x)) ^ 0, the conclusion follows from assertion (iv) of the preceding theorem. • We end this section with the following useful result.
242
Some results and applications of convex analysis in normed spaces
Proposition 3.8.6 Assume that X is reflexive and strictly convex and let C C X be a nonempty closed convex set. Then Pc(x) is a singleton for every x £ X and we identify it with its element. Moreover: (i) Pc is norm-weak continuous. (ii) / / X has the Kadec-Klee property then Pc is norm-norm ous.
continu-
(iii) If X is a Hilbert space then Pc has Lipschitz constant 1. Proof. We already observed that Pc is single-valued in our case. (i) Let (a;„)neN C X converge to x £ X. Because Pc is bounded on bounded sets, the sequence (Pc(xn)) is bounded. The space X being reflexive, there exists a subsequence (Pc(xnk))ken converging weakly to x £ C (C being closed). So, \\xnk — Pc{xnk)\\ < \\xnk — c\\ for every c £ C. Taking the liminf inferior for k —> oo, we obtain that ||a; — 5:|| < ||a; — c\\ for every c £ C, and s o i = Pc(x). Pc{x) being the unique weak limit point of (Pc(xn)), it follows that (Pc(xn)) —> Pc{%) weakly. Therefore Pc is norm-weak continuous. (ii) The conclusion is obvious from (i) and the definition of the KadecKlee property. (iii) Assume that X is a Hilbert space with scalar product (• | •); we identify X* with X by the Riesz theorem, and so $x(x) = x for every x £ X. From assertion (vi) of the preceding theorem we have that (x - Pc(x) | Pc(x) - c) > 0 for all c € C and x £ X. Taking x,y £ X we have (a; - Pc{x) \ Pc(x) - Pc(y)) > 0 and {y - Pc(y) \ Pc(y) - Pc(x)) > 0. Adding these two relations side by side we get [x-y-
Pc(x) + Pc(y) | Pc(x) - Pc(v)) > 0,
whence
\\Pc(x) - Pc(y)\\2 <(x-y\ Hence ||P c (a;) - Pc(y)\\
Pc(x) - Pc(y)) < \\x - y\\-\\Pc(x) - Pc{y)\\ •
< \\x - y\\ for all x,y £ X.
D
Remark 3.8.2 The proof of assertion (i) of Proposition 3.8.6 shows that Pc is norm-weak continuous for every weakly closed Chebyshev subset of a reflexive Banach space.
Characterizations
3.9
of convexity in terms of
243
smoothness
Characterizations of Convexity in Terms of Smoothness
We begin this section with the following interesting result. Theorem 3.9.1 Let X be a normed space, / : X ->• R be a proper function, x G X and x* G X*. Consider the function g : X —> R, g(x) := f(x) — (x,x*). Consider the following conditions: (i) / is Isc (resp. weakly Isc) at x, x* G dom/*, /* is Frechet (resp. Gateaux) differentiable atx* and V/*(af*) —x. (ii) g(x) = infg(X) and (xn) -> x (resp. (xn) —> x) whenever g(xn) —> g(x), i.e. (g,X) is Tikhonov (resp. weakly Tikhonov) well posed. (iii) x £ d o m / and f**(x) = f(x). Then (i) => (ii) => (iii). Furthermore, if f is convex then (i) & (ii). Proof, (ii) =£- (iii) Since g is proper and g(x) — inf g(X), we have that x 6 domp = d o m / . Moreover [Vx € X : g(x) > g{x)] &x*e
df{x)
O x* e dom/* and f(x) = (x,x*) -
f*(x*)
=» f@) < /**(*)• Since the other inequality is always true we obtain that f**(x) = f(x). Note that in cases (i) and (ii) x* € dom/* and x £ df*(x*). Replacing eventually the function / by /i:X-»l,
h(x) := f(x + x) + f*(x*)
- (x + x,x*)
(hence h*(x*) = f*(x* + x*) — f*(x*) — (x,x*)), we may assume that x = 0, x* = 0, /*(0) = 0 and 0 G 9/*(0) (what we do in the sequel). In this situation g = f. We get immediately that Vz G X : f{x) > f**(x)
> /**(0) = 0.
(i) => (ii) To begin with, consider that / is Isc at x — 0 and /* is Frechet differentiable at x* = 0. Since V/*(0) = 0, Ve > 0, 36(e) > 0,Vx* G X*, \\x*\\ < 6(e) : f*(x*) < e\\x*\\. Consider the functions 7, V : E + - > I + ,
f{t) := inf{e | e > 0, 6(e) > t},
ip(t) := t-y(t).
244
Some results and applications of convex analysis in normed spaces
It is obvious that 7 G iV0 Pi fio, and so
: /*(**) <¥>(||s'||).
Prom this relation and Lemma 3.3.l(v) we obtain that VxeX
: /"(a:)>(¥>o||.||na:)=¥,#(||a;||).
By Lemma 3.3.1(iii) we obtain that
>t-
f(xn).
Taking the limit for n —> 00 (n G P ) , we obtain that f*(tx*) > t for every t > 0, and so 0 = V/*(0)(a;*) > 1. This contradiction proves that (a;n) -^> 0. Since / is u;-lsc at 0 we obtain that 0 < /(0) < liminf f(x„) = 0. Therefore f(x) > /(0) for every x G X and (/(*„)) -> /(0) =>• (z„) - A 0. (ii) =>• (i) if / is convex. The lower semi-continuity (weak and strong) of / at x = 0 is obvious since /(0) = inf/ (= - / * ( 0 ) = 0). To begin with, suppose that for every sequence (x„) C X, (f(xn)) —> 0 =*> (x n ) -> 0. Consider the function <^ : R+ -> E + ,
y>(t) := inf{/(z) | a; G X, ||a;|| = *}.
It is obvious that y(0) = 0; moreover ip{i) > 0 for t > 0 (indeed, if y>(i) = 0, then there exists (xn) C X such that (/(z„)) -> 0 and ||a;„|| = t for every n; by hypothesis we have that (xn) -4 0, and so t = 0). Furthermore ip £ Ni and so
Characterizations of convexity in terms of smoothness
245
Then ip(r) < f(jx) < jf(x), which proves that P(T)/T <
V n e N : H^ll - Hx*il > 1 n
n
{XniX*)
> 1 (a:,,,^)- ^ > f(Xn) > 0. (3.67) n
n
It follows that (x n ,x*) > fi for every n 6 N, which shows that (xn) has no subsequences weakly converging to 0. Suppose that (xn) contains a bounded subsequence {xnk). From relation (3.67) we have that 0 < f{xnk) < \\xnh\\ • IN*||/"* -> 0, whence, from our hypothesis we get the contradiction xnk —> 0. Therefore an := ||a;n|| —> oo. Suppose that limsup n _ +00 an/n < oo. Let tn := y/a^, (—> oo) and yn := ^-xn; of course, (llVnll) -»• oo. Then 0 < f(yn) < £ / ( x „ ) < %h\\x*\\ ->" 0. Therefore (fiVn)) -> 0, and so (yn) -^> 0; this contradicts the fact that (||2/n||) -> oo. Therefore lim s u p j j ^ ^ an/n — oo. Let (nk) C N be an increasing sequence such that (ank/nk) -> oo. Let us take tk := ankl^/nk (-> oo) and t/fc := ^-x nfc ; of course (\\yk\\) -> oo. Then
o < /(»*) < ^/(* n j < ^ - i m = -U|z*|| -" °tk
tkUk
JUk
Therefore (f(yk)) -»• 0, whence (yk) —>• 0, contradicting the fact that (yk) is not bounded. What we showed above proves that f*'+(0;x*) < 0 for every x* £ X*. Therefore, 0 G (dom/*) 8 . Since /*+(0, •) is sublinear (and finite in this case), we have f*'+(0;x*) = 0 for every x* £ X*. It follows that /* is Gateaux differentiable at 0 and V/*(0) = 0 . D Theorem 3.9.2 Assume that X is a Banach space and f : X —>• R is a proper function with dom df* nonempty and open, the sub differential
246
Some results and applications of convex analysis in normed spaces
being taken for the duality (X*, X**). Then f is convex provided one of the following conditions holds: (i) / is Isc and f* is Frechet differ•entiable at any x* € dom df*, (ii) X is weakly sequentially complete, f is weakly Isc and /* is Gateaux differentiable at any x* £ dom df*. Proof. Let g := /** (for the duality (X, X*)). Because g* = f* is proper we have that g € T(X) and g < f. Using Corollaries 3.3.4 and 3.3.5 for g, in both cases we have that Vf*(x*) 6 X for any x* € dom df*. Let D := {V/*(a;*) | x* G dom df*} C X. Using the preceding theorem, in both cases we have that D C d o m / and f(x) = g(x) for every x £ D. Let x 6 domg. Using the Br0ndsted-Rockafellar theorem, there exists (xn) C dom<9<7 such that (xn) —> x and (g(xn)) -> g(x). Since Imdg C dome?/*, it follows that (xn) C D. Thus f{x) < liminf f(xn) = liminfg(x n ) = g(x), which, together with g < f, implies that f(x) — g(x). As for x ^ domg we have that f(x) > g(x) = oo, we obtain that f(x) = g(x) for all x 6 X, and so / is convex. • Remark 3.9.1 (a) The condition "dom df* is nonempty and open" in the preceding theorem can be replaced by "dom df* = (domdf*)1 ^ 0"; (b) under the conditions of the preceding theorem / is strictly convex on int(dom/). Indeed, when dom df* = (domdf*)1 ^ 0, we have 0 / (dom 3/*)* C (dom/*) 1 = int(dom/*) and /* is continuous on int(dom/*) (see Theorem 2.2.20). Hence, by Theorem 2.4.9, int(dom/*) C dom df* (for the duality (X*,X**)). It follows that dom df* is open. The second statement follows from Exercise 2.32. In the next result the convexity of a closed subset of a Hilbert space is characterized by the differentiability of the square of the distance function. Theorem 3.9.3 Let (X, (• | •)) be a Hilbert space and 0 ^ C C X be a closed set. Then C is convex if, and only if, d^~, is Frechet differentiable. Proof. We identify X* with X by the Riesz theorem. Consider / = i||-|| 2 + ic- Then f*(y) = sup {(x | y) - \ \\xf \xZC}
= W\y\?-\dUy)
= \ \\y\\2 - \ inf { \\y - x\\2
\x£C)
Characterizations
of convexity in terms of
smoothness
247
for every y € X. Therefore
4 = ll-H 2 -2/',
(3-68)
and so dom f*=X and /* is continuous on X. In particular dome*/* = X is open. From Eq. (3.68) we obtain that /* is Frechet differentiable if d2c is so. Assume that d^ is Frechet differentiable; then / * is also Frechet differentiable. Because / is lower semicontinuous (the set C being closed), assertion (i) of the preceding theorem shows that / is convex, and so C = dom / is a convex set. Assume now that C is a convex set. By Theorems 2.8.7 (iii) and 3.7.2 (i) we have that f* = \ \\-\\ OIQ and the convolution is exact. Since \ ||-|| is Frechet differentiable and dom df* — X, by Corollary 2.4.8 we have that / * is Frechet differentiable. D Remark 3.9.2 Formula (3.68) shows that d?c is the difference of two (finite and continuous) convex functions for every nonempty subset C of a Hilbert space; in particular dc (and also dc) has directional derivatives at any point. The next result refers to Chebyshev subsets of Hilbert spaces. Theorem 3.9.4 Let (X, (• | •)) be a Hilbert space and 0 ^ C C X be a weakly closed set. Then the following four conditions are equivalent: (i) C is convex; (ii) C is a Chebyshev set; (iii) dc is Gateaux differentiable; (iv) dc is Frechet differentiable. Proof. The implication (i) => (ii) follows from Theorem 3.8.1, the equivalence of (i) and (iv) follows from the preceding theorem, while (iv) =s> (iii) is immediate. As in the proof of the preceding theorem, consider / = 11|-|| + ic and identify X* with X. Because C is weakly lower semicontinuous, / is weakly lsc, too. Assuming that (iii) holds, from Eq. (3.68) we have that / * is Gateaux differentiable on X* = X. Taking into account that X is weakly sequentially complete, by Theorem 3.9.2 (ii) we obtain that C is convex. Hence (iii) =* (i).
248
Some results and applications of convex analysis in normed spaces
(ii) =• (iii) Assume that C is a Chebyshev set and denote by Pc(x) the best approximation of x G X by elements of C. We shall show that VxeX
: df*(x) = {Pc(x)},
(3.69)
and so, by the continuity and convexity of /*, we obtain that / * is Gateaux differentiable, or, equivalently, d2-, is Gateaux differentiable. Indeed, let x € X. Then, by Eq. (3.68), / * (*) = \ \\x\\2 -\\\x=
Pc(x)\\2
= (x | Pc(x))
- \
\\Pc{x)f
(x\Pc(x))-f(Pc(x)),
and so, by Theorem 2.4.2 (iii), Pc{x) € df*(x) for every x € X. Let now x e Xbe fixed and y 6 df*(x). Take xn := x+n"1 (y — Pc(x)); of course, (xn) —> x. As observed in Remark 3.8.2, Pc is norm-weak continuous, and so (Pc(xn)) —> Pc(x). Because df* is monotone, we have that 0 < (xn - x | Pc{xn)
- y) = n'1 (y - Pc(x) | Pc(xn)
- y),
whence (y - Pc(x) | Pc(xn) — y) > 0 for every n 6 N. Taking the limit for n -» oo, we get - ||y - P c (a;) || > 0, i.e. y = Pc(x), and so Eq. (3.69) holds for every x E X. O
3.10
Weak Sharp Minima, Well-behaved Functions and Global Error Bounds for Convex Inequalities
As in the other sections of this chapter, (X, || • ||) is a normed vector space. Let / 6 A(X) and 5 C X be a nonempty set. One says that 5 is a set of weak sharp minima if there exists a > 0 such that f(x)>f(x)+a-ds(x),
VxeX,
Vie5;
(3.70)
this notion generalizes that of sharp minimum point, obtained when S — {x}. It is clear that S C argmin / C cl S if S is a set of weak sharp minima for / . Hence S = argmin/ when S is closed. This is a reason for taking in the sequel S = argmin/. In this case Eq. (3.70) becomes f(x)>udf
+ a-ds(x)
VxeX,
Weak sharp minima, well-behaved functions and global error bounds
249
i.e. f is ^-conditioned, with ij)a{t) = at for t > 0. Applying Theorem 3.4.3 we obtain a part of the next result, where, as usual, d(x, 0) := oo. Theorem 3.10.1 Let f G T(X) be such that S := argmin/ ^ 0 and a > 0. Consider the following statements:
a:
V x G X : f(x)>mif
+
(ii
Vx*eaUx.
= r(0)
(iii (iv
V x G S , V(x,z*) G g r d f :
: f*(x*)
d(0,df{X\S))
(v Vx* 6 aBx* (X,X*)]; (vi) Vx £X, (vii) VxeS,
a-ds(x); + L*s(x*); (x-x,x*)>a-ds(x);
>a; '• df*(x*) C 5 [df*(x*) being taken for the duality
Vx ePs(x)
••
f'(x,x-x)>a-ds(x);
1
VM€$^ (A^(5,X))
: / ' ( z , u ) > a ||u||;
(viii) V I £ 5, Vu 6 $ y ( ^ ( S , ! ) ) n c o n e ( d o m / - x ) : f'(x,u) (ix) V x G S , V u e l
> a \\u\\;
: /'(z,u)>a-d(u,C(S,z));
(x) Vx e 5, aUx* n iV(S,x) C 0/(x). TTien (i) <£> (ii) =» (iii) =» (iv) O (v), (i) =4> (vi) =• (vii) <3> (viii) and (vi) => (ix) =>• (x). / / X is a Banach space then (v) =£• (i). Moreover, if X is a reflexive Banach space then (vii) =>• (i) and (x) =>• (i); hence all ten conditions are equivalent in this case. Proof. The implications (i) =£- (iii) => (iv) follow immediately from the implications (i) =>• (v) =>• (vi) of Theorem 3.4.3 (taking V1 := V"a defined above), while (iv) £> (v) is obvious. The equivalence (vii) •£> (viii) is also obvious because dom/'(5;, •) = cone(dom/ — x). (i) => (ii) Taking if) = ipa, condition (iii) of Theorem 3.4.3 is satisfied; hence /* (0) e K and A * ' ) < r(0)+i.s(*')+1>*{\\*'\\)
= r(0)+ts{xl+iaUx.
(**) Vz* G X*.
Because for a G S we have that (a,a;*) < f(a) + f*{x*) = - / * ( 0 ) + f(x*), we obtain that /*(0) + t,*s(x*) < f(x*) for every x* G X*. Therefore the conclusion holds. (ii) =$• (i) From the hypothesis, taking again ip = ipa, we have that Wx*eX*
: f*(x*)
+ L*s(x*)+rP#(\\x*\\).
Using the implication (iv) => (iii) of Theorem 3.4.3, the conclusion holds.
250
Some results and applications of convex analysis in normed spaces
(i) => (vi) Take x G X and x G Ps(x). Then for every t G ]0,1[ we have that x G f s ( ( l - t)x + tx) (see Corollary 3.8.5), and so f (x + t(x — x)) - f{x) > ads ((1 - t)x + tx) = at \\x - x\\ =
atds{x).
Dividing by t and letting M O w e get the conclusion. (vi) =* (vii) Let x G S and u G $ ^ ( ^ ( 5 , 1 ) ) . By Corollary 3.8.5 we have that x G Ps(aT + u). From (vi) we get the conclusion. (vi) => (ix) Let x e S and u € X. From (vi) and Proposition 3.8.3(iii) we have that f'(x,u)
> a • ds(x + u) >a{ds(x)
+ (ds)'(x,u))
>a-d(u,C(S,x)).
(ix) =^ (x) Let x £ S. From (ix) and Proposition 3.8.3 (iii) we have that f'(x,u) > a(ds)'(x,u) for every u e X, and so atlx* n N(S,x)
= a • dds(x) = a • {ds)'(x, -)(0) C f'(x, -)(0) =
df(x).
(v) => (i) when X is a Banach space. Suppose that (i) does not hold. Then there exists x 6 X such that f(x) < inf / + a • ds(x). It follows that x $. S. Therefore there exists a' € ]0, a[ such that f(x) < inf / + a' • ds(x). We fix e s ] a ' , a [ . Using Ekeland's variational principle, we obtain the existence of u € X such that f(u) + e-\\u-
x\\ < f(x) and /(it) < f(x) +e-\\u-
x\\ Vx e X. (3.71)
The last relation is equivalent to df{u) D eUx* ^ 0- Hence there exists u* £ df(u) such that ||u*|| < e < a. From our hypothesis it follows that df*(u*) C 5, and so u 6 5. By the choice of a' and the first relation of (3.71) we have that s • ds(x) < e • \\u — x\\ < a' • ds(x). As ds(x) > 0, we obtain the contradiction e < a'. Assume now that X is a reflexive Banach space. (vii) =$> (i) Let x G dom / and take x € Ps(x) (such an element exists by Theorem 3.8.1). By Theorem 3.8.4(iv) $x(x -x) D N(S,x) ^ 0. It follows that x — x £ &X1 (N(S,x)), and so f'(x,x — x) > a \\x — x\\ = ads{x). The conclusion follows from the inequality f'(x,x — x)< f(x) — f(x). (x) =» (i) Let x G X \ S and take z G Ps(x). By Theorem 3.8.4 (iv), there exists x* G $x(a; — x)n N(S,x). Then u* := a\\x — aT||_1(a; — x) G
Weak sharp minima, well-behaved functions and global error bounds
aUx' n N(S,x),
whence u* G df(x).
251
It follows that
f(x) = f(x) — f(x) > (x — x,u*) = a\\x — x\\ = a • ds{x). The proof is complete.
•
Using the preceding theorem we get the following estimate for the best coefficient in Eq. (3.70). Corollary 3.10.2 Let X be a Banach space and f 6 T(X) such that S := argmin/ ^ 0. Then
Moreover, if X is reflexive, then
inf xex\s
m_ZMi=M(fr d(x,S)
I V
x—x
= inf {f'(x,u)\xeS, — inf
x E dom f\S,
x 6 Ps(x)
x — x\\ '
f'(x,u) d(u,C(S,x))
ueSxn
Qx1 (N(S,
x))}
xes, uex\C{S,x)
Proof. It is obvious that anyone of the numbers appearing in the statement of the corollary is nonnegative. By the equivalences (i) •£> (vi) <4> (vii) o (ix) 4> (iv) of the preceding theorem, if one of them is positive then all of them are equal. The conclusion follows. • Remark 3.10.1
For every nonempty closed convex set A C X we have
d(u,C(A,x))>\\u\\,
yxtA^ue^iNiA^)).
(3.72)
Indeed, we have that A — argmindA and dA(x) > inf dA + 1 • dA(x) for every x € A. From the implication (i) => (vii) of the preceding theorem and Proposition 3.8.3(iii) one gets Eq. (3.72). As we shall see later the notion of weak sharp minima is closely related to that of global error bound for (convex) inequality systems. We introduce now another notion which is related to well-conditioning and weak sharp minima. As seen in Theorem 3.4.3, when argmin / is nonempty, / e T(X) is well-conditioned if and only if (d(0,df(xn))) —> 0 =>•
252
Some results and applications of convex analysis in normed spaces
(d(xn,S)) -> 0. So, it is natural to consider a class of functions with a similar property: T := {/ G T(X) | (d (0, df(xn)))
-+ 0 => ( / ( i n ) ) -»• inf / } .
(3.73)
In the next result we give a characterization of the elements of T. Proposition 3.10.3
Let / G r ( X ) . Then / G T «• F/(t) > 0, Vi > inf/,
where, for t £ i , f 7 (t) := inf {d(O,0/(a;)) | * G X, / ( x ) > t} = d{0,df(X\
Proof.
[/ < t))). (3.74)
It is obvious that
/ ^ » 3 ( i
n
) c l
: ( d ( 0 , 5 / ( x „ ) ) ) - > 0 , (/(!„)) ^
«*• 31 > inf/, 3(x n ) cX\[f
inf/
: (d(0,a/(x„))) -> 0
^ 3 f > i n f / : f/(t) = 0 . Hence the conclusion holds.
•
It is obvious that ff is nondecreasing. Moreover, for t < inf / we have that r~f(t) = d(Q,Imdf); in particular, by Br0ndsted-Rockafellar's theorem, F/(inf/) = d(0,dom/*) when X is a Banach space. For a more detailed discussion of the position of 0 and dom/* see Exercise 2,45. In order to give other expressions for r/ (t) we introduce several related quantities. First, we mention a simple property which will be used repetitively in this section. Lemma 3.10.4 Let f G T(X), t G R, and x, y G dom / such that f(y) < t < f(x). Then there exists z e]x,y[ such that f(z) = t. In particular \\y - z\\ < \\y - x\\ and \\x - z\\ < \\y - x\\. Proof. Of course, the mapping
Weak sharp minima,
and global error bounds
253
Proposition 3.10.5 Let f 6 T(X) and x £ d o m / with f(x) Consider the following numbers:
> inf/.
/(*) - fiv) 2 / € [ / < / ( * ) ] } , \\x -2/11
7i(x) : = s u p
<^
well-behaved functions
f / ( » ) -t-
>,},
— a;
73(a;) : = s u p | - / ' (x, yp— 74(2;)
= sup{-/'(a;,u) \ue
75(2:)
= d(o,a/(x)),
76 (2;)
= sup {-f'{x,
2/ € [/ < / ( * Sx},
-u)\ueSxn
N{[f < f{x)],x)}
.
Then 71(2;) = 72(2;) = 73(2;) = 74(2;) = 75(2;) s]0,00]. Moreover, if f is continuous at x and X is a Hilbert space, identified with X* by Riesz theorem, then 71 (2;) = 7 6 (a;). Proof. Let us denote, during the proof, the set [/ < f(x)] by S. It is obvious that 71(2;) > 0, 72(2;) > 0 and 75(2;) > 0 (because 5 ^ 0 ) . Since f'(x,y — x) < f(y) - f(x) < 0 for y G S, we also have that 73(2;) > 0, and so 74(2;) > 0 as, obviously, 74(2;) > 73(2:). Let first t < f(x) be such that [/ < t] ^ 0, and show that sup
fix) - fiv)
Si?) -1
\\x-y\\
d(x, [ / < * ] ) "
(3.75)
Consider (yn) C [/ < t] such that (||a; - yn\\) -> d(x, [f < t\). Then s u p fi*) ~ fiv) > /(*) - /(yn) ;. fix) -1 i/e[/<*] \\x~y\\ ~ \\x-yn\\ ~l|a;-2/nll
)
f{x) -1 d(x, [/<*])'
which proves the inequality > in Eq. (3.75). Conversely, let y € [/ < i\. Using Lemma 3.10.4 we get A € [0,1[ such that f(y') — t, where y' :— A2; + (1 - X)y. Of course, \\x - y'\\ = (1 - A) \\x - y || and t < Xf(x) + (1 - A)/(y), whence (1 - A) (f(x) - f{y)) < f(x) - t. It follows that / ( * ) _ - fiv) \\x-y\\
_ (1 - A) ifix) - f(y)) (l-X)\\x-y\\
fjx) - t -\\x-y'\\-
<
<
f{x) - t dix,[f
which proves the inequality < in Eq. (3.75). Hence Eq. (3.75) holds.
254
Some results and applications of convex analysis in normed spaces
Now, by Eq. (3.75), we have 71 (a;) = sup
sup —
t
\\x-y\\
r-^ = sup t
\
= 72 0*0
(Note that the values of t with [/ < t] = 0 influence neither 71 (z) nor 72 (z)-) Let y & S and take u := (y — x)/ \\y — x\\. Since /+(a:, y — x) < f(y) — f(x), we have that (f(x) — f(y)) / ||x - y\\ < —f'(x,u). It follows that 7i(x) < 73(0;). On the other hand, we have that x + tu E S for t E ]0, | | 2 / - aril -1 ], and so x f f(x) ~ f(x + tu) 1 u) = sup < ,, ; , rrr- i e l O . l l f f - a r i r ] } <-Yi(ar), I x - ( a : + tu) which implies that 73 (x) < 71 (x). Hence 71 (x) = 73(2:). It was observed above that 0 < 73(2;) < 74(1). Let u E Sx- Suppose first that there exists t > 0 such that y := x + tu E S; then —f'(x,u) = ~f'ixi |iyI^|[) < 73(2;). In the contrary case f(x + tu) > f(x) for every t > 0, and so f'(x,u) > 0, whence —f'(x,u) < 0 < 73(2;). Hence 73(2;) = 74(a)Assume that 7 £ 1 is such that 74(2;) < 7 (thus 7 > 0). It follows that —f'(x,u) < 7||u|| for all u E X, and so 0 < 7 ||u|| + f'(x,u) for u E X. Hence 0 is a minimum point for 7||-|| + f'(x,-), and so 0 £ 5 ( 7 INI + /'(*,-)) (0). But ,/, -fix,
d (7 INI + fix,
•)) (0) = yUx.
+ df'(x, -)(0) = 7 tf x - + 8f(x).
(3.76)
Therefore *yUx- D df(x) ^ 0, which implies that d(0,df(x)) < 7. Conversely, if d(0,df(x)) < 7 (hence 7 > 0 because 0 ^ df(x)), then 7C/X* n df(x) 7^ 0 (since the norm of X* is u;*-lsc and df(x) is iy*-closed), and so 0 E "fUx* + df(x). By Eq. (3.76) we obtain that 0 is a minimum point of 7 ll'll + f'(x> •)> which shows that 74(2;) < 7. Hence 74(2;) = 75(2;). Assume now that X is a Hilbert space (identified with its topological dual) and / is continuous at x; then, by Theorem 2.4.9 f'(x,u)=max{(u\x*)\x*
Edf{x)}
Vu£l,
(3.77)
and, by Corollary 2.9.5, K := N([f < f(x)],x) = M+df(x). It is clear that 7e(2;) < 75(2:). Let y E S and take u := (x — y)/ \\x - y\\; it was observed above that 7 :— —f'(x,-u) > 0. Then for x* E df(x) we have that
Weak sharp minima,
well-behaved functions
and global error bounds
255
(u\x*) = -(-u\x*) > -f'(x,-u) > 7. Take u = PK(u). By Theorem 3.8.4(vi) we have that (u - u\v - u) < 0 for every v G K. Replacing v by tv with t > 0 we get (u — u\v) < 0 = (u — u\u) for all v G K. In particular 7 < (u\x*) < (u\x*) for all x* G df(x), and so 5 ^ 0. Since P K is Lipschitz with Lipschitz constant 1 [see Proposition 3.8.6(iii)] and 0 = PK{0), we have that ||u|| < 1. Taking Mo := u/ \\u\\, we have that u0 G Sx r\N{[f < f(x)],x) and, by (3.77), f'(x,-u) < - 7 . Hence 7e(») > 7> whence je(x) > 73(a;). • With the aid of the numbers 71 (x)-76 (x) we introduce the functions r) :] inf/, oo[-> [0,oo],
r)(t) = inf {ji(x) \ x G [/ = t}} ,
i G 176.
Of course, if [/ = t] = 0 then r\ (t) = +00. From the preceding result we obtain immediately Corollary 3.10.6 Let f € T(X). Then r)(t) = r){t) = r^(i) = r)(t) = rhf(i) for every t > inf/. Moreover, if X is a Hilbert space identified with its dual and / is continuous at every point of [f = t] ^ 0 then r\ (t) = r5 (t). In the sequel we shall denote by 77 any of the functions r^-r^ defined above. From the preceding proposition we obtain that r/(t) = inf { r / ( « ) | s > t } ,
Vt>inf/,
(3.78)
where ff is defined by Eq. (3.74). Example 3.10.1 1) Any coercive function / G T(X) with X a reflexive Banach space is in T; see the solution of Exercise 3.15 for details. 2) Let / : E2 -> E be defined by f{x, y) := y/x2 + y2 - x. It is obvious that / is a sublinear functional with inf / = 0 and argmin/ = K+ x {0}. One has ff(t) = r/(t) = 0 for t > 0 (see the solution of Exercise 3.18 for details). Consider now the function // : [inf / , oo[->- [0,oo] defined by
As usual, we take inf 0 = +00. Hence //(inf/) = 0 if inf/ is not attained; if inf / is attained then argmin / is a set of weak sharp minima if and only if //(inf/) > 0. Even for t > inf / the number // is related to weak sharp minima. Indeed, for t > inf/, taking h : X ->• E with h(x) := m a x { / ( i ) -
256
Some results and applications of convex analysis in normed spaces
t, 0} we have that inf h = 0 and argmin/i = [ / < * ] • Then [/ < t] is a set of weak sharp minima for h if and only if lf(t) > 0. This remark suggests having formulas for lf{t) similar to those in Corollary 3.10.2. Proposition 3.10.7 Let X be a Banach space and f G T(X). Consider t € [inf/, oo[ with [/ < t] ^ 0. Then lf(t)=d{0,df(X\[f
(3.80)
Assume now that X is reflexive. Then lf(t) = inf | / ' (y, y ^ y )
xedomf\[f
P[f
(3.81)
= inf {/'(i/.u) | y G [/ = t], u G Sx D S* 1 (JV([/ < f],j/))}
(3.82)
= inf
.
d{u,C([f
ye [/ = *], « a \ c ( [ / < y
(3.83) Moreover, if either (i) i > inf / or (ii) iV([/ < £],&) = B+df(x) x £ [f = t], then It (t) = kf(t)
:= inf | / ' (y, ^ j
for every
y G [/ = t], u G * ^ (3/(2/)) J . (3.84)
Proof. Ii t — inf/ the conclusion is given by Corollary 3.10.2. Assume that t > inf / and denote by h(t), l2(t), h(t), and h(t) the quantities on the right-hand side of Eqs. (3.80)-(3.83), respectively. Consider h : X -> R with h(x) := max{/(i) - t, 0}; we have that inf h = 0 and argmin/i = [/ < £]. The equality of lf{t) and li(t) follows directly from Corollary 3.10.2 because U := X \ [f < t] is open and f\u = h\u +1, and so df(x) = dh(x) for every x e U. Since, obviously, //(£) = /^(O), we apply again Corollary 3.10.2 for h. Hence, in order to obtain Eqs. (3.81)-(3.83), it is sufficient to observe the following: a) Let x G d o m / \ [/ < t] and y G P[f
Weak sharp minima,
well-behaved functions
and global error bounds
257
c) Let y £ [/ = t] and u G X \C([f < t],y). Because u $. C([f < t],y), we have that y+su £ [f < t] for every s > 0. Hence f(y+su) = h(y+su)+t for all s > 0. If y + su £ d o m / for s > 0 then f'(y,u) — h'(y,u) = oo. Assume that y + sou € dom / for some so > 0; as above we obtain that f(y) = t and so, once again, f'{y,xy) = h'(y,xy). Let us prove now relation (3.84). The case (ii) is an immediate consequence of Eq. (3.82). So, let t > inf/. Because df{y) C N([f < t],y) for every y G [/ = t], we have that kf(t) > l3(t) — lf(t). Let now x G d o m / \ [/ < t] and take y € P[f
i||z — x\\2 subject to f(z) — t < 0,
and the Slater condition holds, we have that 0 € 3 ( | | | • — x\\2 + \(f — t)) for some A > 0, i.e. y is a minimum point of |||-— x|| 2 + A(/ — t). Hence $x(x — y) f)d(Xf)(y) ^ 0. Assuming that A = 0, we obtain that \\y — x\\ < \\z — x\\ for every z G dom / . Taking z := x we obtain the contradiction \\y — x|| < 0, i.e. y = x. Therefore A > 0. Then u :— \~1(x — y) e $ ^ {dfiv))- ^ follows that I fit) = l2(t) > kf(t). The proof is complete. • Note that the equality N([f < t],x) - R+df(x) for x G [/ = t] is not guaranteed for arbitrary / G T(X) when t > inf / (compare with Corollary 2.9.5). However it holds if / is continuous at x £ [f = t]. Relation (3.80) shows that // is nondecreasing on [inf/, oof when X is a Banach space. In fact If has this property in any normed vector space. Proposition 3.10.8
Let f € T(X) and inf/ < tx < t2 < oo. Then
J/(*0 < J/(*2)
and
lf{tx)
In particular If is nondecreasing on [inf / , oo[. Moreover, if X is a reflexive Banach space then rf{t) < lf(t) for every t €]inf/, oo[, and so rf is nondecreasing on ] inf/,oo[; hence rf = ff in this case. Proof. then
Let first x G d o m / and take t\,t2
€ K such that ti < t2 < f(x);
d(x, [f < h}) - d(x, [/ < t2])'
(3.85)
258
Some results and applications of convex analysis in normed
spaces
It is obvious that Eq. (3.85) holds if [/ < *i] = 0. Assume that [/ < t{\ f 0 and take x G [/ < *i]. Let A := (/(x) - t2)/{f(x) - h) G ]0,1[. Then / (Ax + (1 - A)x) < A/(x) + (1 - A)/(x) < A^ + (1 - A)/(x) = t2. Hence Ax + (1 - A)x G [/ < t2], and so d(x, [f < t2]) < A ||x - x||. Taking the infimum for x G [/ < ii] we get Eq. (3.85). The inequality Eq. (3.85) is obvious if x ^ dom / and [/ < t\] =^ 0. Using Eq. (3.85) we obtain that inf
. / y - \
x6dom/\[/<*i] d(x, [f < ii])
< <
inf
*&-*
x6dom/\[/
. . ml
fjx) ~ t2 —.—rr-,
x€dom/\[/
and inf , / ( ? - * < inf ^ - 4 l iedom/\[/
Weak sharp minima, well-behaved functions and global error bounds Since z G [/ < t] is arbitrary, we get rf(t) = rf(t) < lf(t).
259 •
Corollary 3.10.9 Let X be a reflexive Banach space, f € r ( X ) and t > inf/. / / / is Gateaux differentiable on [f = t] C int(dom/) then If{t) = rf(t) = inf{||V/(x)|| \xe[f
= t}}.
(3.86)
Proof. If [/ = t] — 0 then, obviously, r/(t) = //(£) = oo, and so Eq. (3.86) holds. Let [/ = t] be nonempty and take x0 G [/ < t] and a; G [/ = £]. Since x G int(dom / ) we obtain that }xo, x] C int(dom / ) by using Theorem 1.1.2(ii). As / is continuous on int(dom/), it follows that ]a;o,a;[c int[/ < t]. Because x e[f = t], x $ int[/ < t]\ hence, by Theorem 1.1.3 applied for x and [/ < t], we get u* G Sx* n N([f < t],x). Let u € ^ ^ ( u * ) . Taking into account relation (3.82) we get ||V/(a;)|| > (u,Vf(x)) = f'(x,u). It follows that r / ( t ) = r 5 ( t ) = i n f { | | V / ( a : ) | | | are [/ = *]}
> inf {f(x,u) \xe[f = t],ue Sxn^x1 (N([f < tlx))} = '/(*)• Using Proposition 3.10.8 we obtain that Eq. (3.86) holds.
•
One says that / £ T(X) is well behaved (asymptotically) or has good asymptotic behavior if lf(t) > 0 for every t > inf/. The next result shows that the class T of functions defined in Eq. (3.73) coincides with the class of functions with good asymptotic behavior when the space is reflexive. Theorem 3.10.10 Then f e?
Let X be a reflexive Banach space and f G T(X).
& rf{t) > 0 V i G ] i n f / , o o [ O lf(t) > 0 Vi G]inf / , oo[.
Proof. The conclusion is an immediate consequence of Proposition 3.10.8 and relation (3.78). • As mentioned before, the set argmin/ is a set of weak sharp minima of / (i.e. / is well-conditioned with linear rate) exactly when inf/ is attained and //(inf/) > 0. One can ask if there are other relations between wellconditioning and well-behaving of convex functions. Proposition 3.10.11 Let f G F(X). If f is well-conditioned then f has good asymptotic behavior. Moreover, limtj.inf f(t — inf / ) - 1 i / ( £ ) = oo.
260
Some results and applications of convex analysis in normed spaces
Proof. Let S := argmin/ = [/ < inf/] ^ 0. Of course, if d o m / = S it is nothing to prove (lf{t) = oo for all t > i n f / ) . We may suppose that inf/ = 0 and d o m / ^ S. Since / is well conditioned, by Proposition 3.4.1, the function ip := ipf € AQ f~l Ni; moreover, because d o m / ^ S, tp is finite at some to > 0, and so ip € CIQ. Therefore, by Lemma 3.3.1, ipe € flo n Ao- The choice of V> shows that f(x) > rp (d(x,S)) for every x e X. Let t > 0 = inf / and a; 6 d o m / \ [/ < *]. Taking a > 0, we have either d(x,5) > a, and so ^ g g ) > ^ g ^ ' g ^ > ^ (because i> e NJ, or d(x,S) < a, and so ,v x L > K Hence d(x,S)
f
d(x, S)
— a'
. ft 1>(a
> sup < mm —, I \a a
inf {a > 0 | ip(a) > t}
a > 0} > sup
a > 0, V(a) > t
t ipe(t)
Taking si = 0 and s 2 — t in Eq. (3.85), we have that f(x)-t ^ f{x) > d{x,[f
. * > ~ VeW
It follows that lf(t) > t/rpe(t) > 0 for every t > 0, and so / is wellbehaved. Furthermore, t~xlf{t) > l/ipe(t), and so limt4.o^-12/(£) = oo because ipe £ QQ. D It is obvious that any function / € T(R) has good asymptotic behavior, but, for example, / := exp is not well conditioned because argmin/ = 0. The next result furnishes a sufficient condition on the function If in order that / be well conditioned. Theorem 3.10.12 Let X be a Banach space and f 6 T{X) be such that inf/ > - o o . Let If be defined by Eq. (3.79). If'//(<) > 0 for every t > inf/, i.e. f has good asymptotic behavior, and there exists a > 0 such that 6(a) < oo, where 8{s):
_ rs+mt' '" /inf/
dt lf(t)
Vs>0,
then f is well-conditioned and ipf > 6e. Proof. We may assume that inf / = 0. If / is constant on dom / there is nothing to prove. Because // is nondecreasing and positive on ]0,oo[ (eventually taking the value oo) and 6(a) < oo, we have that 6(s) > 0
Weak sharp minima,
well-behaved functions
and global error bounds
261
for every s > 0; we set 0(0) := 0. It follows that 6 is continuous and nondecreasing on K+; 6 is even increasing on / ( d o m / ) . It follows that 6 G A) f~l iVo fl fio; and so 8e G Ao (by Lemma 3.3.1). Let xo G d o m / with f(xo) > 0 and p € ]1, oo[. Consider the sequence (s„) C P, sn :— p~nf(x0) for every n G NU{0}. Then the series J2n>i *1~(g~)n ls convergent. Indeed, If being nondecreasing (see Proposition 3.10.8), //(s n ) < lf{t) < Z/(s n _i), and so l/Z/(s„_i) < l/lf{t) < l///(s„) for s n < £ < s„_i and n G N. Hence Sn—1 ~ $n ^ -T—; r- <
I /
at Sn—1 T ^ T < ~T-?
Sn $n $n+l N— = P - T - ? \—
w RT VnGN.
//(Sn-l) 7 Sn //(*) lf(sn) lf(sn) 1-5 Therefore the series ^ n > 1 ^"r ^ is convergent and its sum is smaller than r«o Jo
dt i,(t)-
The relations above show that ^-\
2^n>l
S n _ i — Sn _
lf(sn)
/v-^
S n - 1 — S n _ Sp —
~P\^n>l
lf(sn-i)
Si\
If (S0) J
dt _ so — si ) < oo. _^1_ ^L_1L
lj{t)~
(3.87)
lf(sQ)
Let s > 0 be fixed. Since so = /(^o) > si > inf / , from the definition of //, there exists x[ G [f < si] such that ||x 0 — a^H < (s0 — si)/Z/(si) + 2 _ 1 e. By Lemma 3.10.4 there exists xi G [zo^i] with /(a;i) = Si; of course, ll^o — xi\\ < (so — s i ) / ' / ( s i ) + 2 _ 1 e. Continuing in this way we find a sequence (a;n)n>o C d o m / such that f(xn)
= Sn,
\\xn - Xn+1\\
< -y^ - r - + —-r l s f\ n+l) % +
V 71 > 0.
(3.88)
Prom Eq. (3.87) we obtain that the series ^2n>0 \\xn — xn+i\\ is convergent, and so (xn) is a Cauchy sequence. As X is a Banach space, the sequence (xn) converges to some x G X. It follows that 0 = inf / < f(x) < liminf f(xn) < lims„ = 0. Hence x G 5 := argmin/, and so S ^ 0. Using Eq. (3.88) we obtain that Si—\
S{
11*0 - xn\\ < ^i=i Wx^ - Xi\\ < £ . = 1 [-ijr^1 ^ H S i - i - S i l l S ^ ^
whence, by Eq. (3.87),
.
( r° dt
so-Sl\
c
+ 2i
r° dt dt + e.
262
Some results and applications of convex analysis in normed spaces
As e > 0 and p > 1 are arbitrary, we obtain that d(xo,S) < \\XQ — x\\ < 6(s0) = 0 (f(x0)). It follows that f(x0) > 6e {d{x0,S)). As x0 G d o m / \ S was arbitrary and 6e G AQ, f is well conditioned. • Consider now the system of inequalities 9i(x)<0, where gi,...,gm
fori = l,...m,
(3.89)
£ A(X). We assume that the solution set C := {x G X | gt(x) < 0, Vi € T7m}
of the system (3.89) is nonempty. We are interested in the following global error bound for (the solutions set) C: dc{x) < a • max [gi(x)]+
Vx G X,
(3.90)
l
for some a > 0 (which, of course, depends on the functions gi), where 7+ := 7 V 0 for 7 € E. Of course, g := maxj^j | i € l , m } € A(X), the system (3.89) is equivalent to g{x) < 0,
x e X,
(3.91)
and C = [g < 0]; Eq. (3.90) becomes dc(z) < a • [g(x)]+ V x G X .
(3.92)
Taking /i := g+ := g V 0, we have that inf h — 0 (under our assumption that C / 0) and C = argmin/i. Hence the global error bound for C holds if and only if C is a set of weak sharp minima for h. On the other hand Eq. (3.92) is equivalent to lg(0) > a - 1 , and so the global error bound for C holds if and only if [g < 0] ^ 0 and / 5 (0) > 0. In the next result we collect several sufficient conditions for the existence of a > 0 verifying Eq. (3.92). T h e o r e m 3.10.13 Let g G T(X) be such that C := [g < 0] ^ 0. Consider the following conditions: (i)
there exists u G X such that goo(u) < 0;
(ii)
there exists n > 0 such that sup x€ r
(hi)
there exists n > 0 such that sup xe r fl=0 i d(x, [g < —rj\) < 00;
<0 i
d(x, [g < —n]) < 00;
Weak sharp minima,
well-behaved functions
and global error bounds
263
(iv) inf
sup
inf
">°x&[g=o]y£[9<-n]
J t M = inf -g(y)
(v) 0 > infff andrg(0) (vi)
sup ^ l i ^ ^ l
I>O I € [J=O]
< oo;
(3.93)
V
> 0;
Z 9 (0)>0.
Then (i) =4> (ii) =» (iii) <$• (iv) =^ (v) and (ii) => (vi). If g is not constant on domg and [g < 0] C int(domg) i/ien (iii) => (ii). If X is a reflexive Banach space, then (v) =4> (vi); moreover, if g is not constant, [g = 0] C int(domg) and g is Gateaux differentiate on [g = 0] then (vi) =>• (v) andlg(0) = rg(0). Proof. First note that the equality in Eq. (3.93) follows from Eq. (3.75) when [g = 0] ^ 0. When [g = 0] = 0, both sides of the equality are —oo. The implications (ii) =s> (iii) & (iv) are obvious. (i) => (ii) Let us take u € Sx such that 0. Consider a: € [p < 0]. Taking 7 := —r)/goo(u) > 0, we have that g{x + 7U) < g(x) + 75oo(«) < 75oo(u) = ~V, and so d(x, [5 < —n}) < \\x + 7U — a;|| = 7 < 00. (iii) =£• (ii) Assume that g is not constant on domg and [g < 0] C int(dom<7). We shall show that sup{d(x, [g < -rj\) I x € [g = 0]} = sup{d(x, [3 < - I J ] ) | a; € [3 < 0]} > 0 (3.94) for any n > 0. If [p < 0] = 0 then [p < 0] = [g = 0] and Eq. (3.94) is obvious. Assume that g(xo) < 0 and take x £ domg with g(x) ^ g(xo); we may assume that g{x0) < g{x). If g(x) > 0, by Lemma 3.10.4, there exists y e]x0,x] such that g(y) = 0. Assume that g(x) < 0. Since 0 < g(x) — g(xo) < t_1 (g{xo + t(x—xo))—g(xo)) for t > 1, there exists t > 1 such that p(xo+t(a;—xo)) > 0. The set A := {* > 1 | g(xo + t(x — XQ)) < 0} is a closed bounded interval; take t := max A. Since XQ +t(x — xo) G [g < 0] C int(domg), there exists t > t such that xo + t(x — x0) € dom g. It follows that g(x0 + t(x — x0)) > 0, and so necessarily xQ + t(x — xo) € [3 = 0]. Hence there exists y E [g = 0] such that x Z.\x§,y\. Let us prove Eq. (3.94) for n > 0. This relation is obvious if [g < —rj\ = 0. Assume that [g < —rj\ •£ 0 and take x £ [g < 0]\[g < —rj\. Consider
264
Some results and applications of convex analysis in normed
spaces
x
o £ [9 < — rj]- As above, there exist y £ [g — 0] and A e]0,1[ such that x — Xy + (1 - A)xo. Take now z 6 [g < -rj]; then Az + (1 - A)xo £ [g < -rj], and so d(x, [g < -rj]) < \\x - Az - (1 - X)x0\\ = A \\y - z\\ < \\y - z\\, whence d(x, [g < -rj]) < d(y, [g < -r)]). It follows that sup x€ [ 9=0 ] d(x, [g < —rj]) > sup x£ [ 5<0 ] d(x, [g < —rj]). As the converse inequality is obvious, Eq. (3.94) holds. Hence (hi) =$> (ii) in our case. (iv) ^ (v) If 0 = inf g then C = [g = 0] ^ 0 and [g < -77] = 0 for any 77 > 0. Hence d(x, [g < —rj]) = 00 for x £ [g = 0] and 77 > 0, whence the contradiction inf,,>o sup a . € [ 9=0 i rj~1d(x, [g < —77]) = 00. Hence 0 > inf g. The conclusion then follows from the relation r s (0) = r|(0) in Corollary 3.10.6 and the obvious inequality • , d(x, [g < -77]) . d(x, [g < -rj]) 1 uy u^y 1111 inf usup — ^_> u sup inf — 2n / ~.\ * i>°xe{g=o] V xe[g=o]i>° V r g{0) " "
t
(ii) => (vi) Because sup a . 6 [ a<0 ] d(x, [g < —rj]) < 00, we have that [g < —77] 7^ 0. Consider 0 < /x < 77. Since [g < —/j] C [g < —rj], we obtain that su Pxg[0<-ji] d(x, [g < —rj]) < 00. From the implication (ii) =>• (v) with 0 replaced by — fi we obtain that rg(—fj,) > 0. Using Proposition 3.10.8 we obtain that / 9 (0) > rg(-fj.) > 0. Assume now that X is a reflexive Banach space. Then the implication (v) => (vi) is true because, by Proposition 3.10.8, lg(0) > rg(0). (vi) => (v) Assume that g is not constant, / s (0) > 0 and g is Gateaux differentiable on [g — 0] C int(dom #). It is sufficient to show that 0 > inf g; then the conclusion follows from Corollary 3.10.9. If [g = 0] = 0 then 0 > inf g (because C ^ 0). Assume that [g = 0] ± 0; if [g < 0] = 0 then every x € C is a minimum point of g, and so 0 S dg(x) — {Vg(x)}. Assuming that g is 0 on dom<7 then [g = 0] = doing = X (domg being closed and open), a contradiction. It follows that domg \ [g < 0] ^ 0, and so, by relation (3.81) in Proposition 3.10.7, / 5 (0) = 0. Hence [g < 0] ^ 0.D Coming back to the system (3.89), several conditions in the preceding theorem can be formulated in terms of the functions gi, i G l,m, when g := max i € j-^g { . Let us denote the set {j e l,m | gj(x) = g(x)} by I(x) for x € domg. Assume that gi G A.(X) is continuous on int(dom<7;) and 0 ¥• [9 < 0] C int(domgi) for every i e l,m. Then, by Corollary 2.8.13, we
Weak sharp minima,
well-behaved functions
and global error bounds
265
have that
dg(x) = co (U i 6 / ( B ) d9i^)
Va: e
b < 0],
and so, by Theorem 2.4.9, g'{x,u) = max max{(u,x*) \ x* 6 dgi(x)}
\/x G [g < 0], Vu G X.
iei(x)
Moreover, since epi^ = f)iei^epigi ¥" 0, the relation 500 = max.ieY-^(9i)oo holds. For example, condition (i) of the preceding theorem reduces to the existence of u € X such that (gi)oo(u) < 0 for every i £ 1, m. The condition [g < 0] 7^ 0 in (iv) says that the Slater condition holds for the system of inequalities (3.74), while the condition r s (0) > 0 may be described in several ways, taking into account that rg = r3g for 1 < j < 5. One of them (corresponding to rjj) is
0 d
^ (u, e[ , = o ] «>(U 6/{ , ) ^(*))).
another one (corresponding to rg) being inf
sup min
xe[g=0] u€Sx
«£/(x)
min
(u,x*) > 0.
x'edgi(x)
We can consider (3.91) as being an unconstrained inequality, in contrast with g(x) < 0,
xeA,
(3.95)
where g € T(X) and A C X is & nonempty closed convex set. This inequality is equivalent with the system g(x) < 0,
dA(x) < 0,
x£X.
(3.96)
Of course, the Slater condition does not hold for the system (3.96). We assume that the solution set C := {a; € A | g(x) < 0} of the system (3.95) is nonempty. We say that the global error bound holds for (3.95) if there exists a > 0 such that dc{x)
Vx£l.
(3.97)
Theorem 3.10.14 Let X be a reflexive Banach space, g € T(X) and A C X be a closed convex set such that C := {x £ A \ g(x) < 0} is
266
Some results and applications of convex analysis in normed spaces
nonempty. Assume that one of the conditions below holds: PA{x) n domg ^ 0 \/x£[g<
0],
(3.98)
[g < 0] n A C int(doms).
(3.99)
Then the following statements are equivalent: (i) there exists a > 0 such that Eq. (3.97) holds, (ii) 3 7 > 0 , V i e AD[g = 0], Vu 6 $^(iV(C,2;)) : max{<;'(a;,u),d(u,C(j4,a;))} > 7||u|| • Proof.
Consider /i := max{<7,oU}. By Proposition 3.10.7 we have that
lh(0)=mt{ti(x,u)
u£Sxn^1(N(C,x))}.
\x£C,
Let x £ C; we wish to evaluate h'(x, u) for u € X. Because domh'(x, •) = doing'(x, •) = cone(dom<7 — x), we take u £ cone(dom p — x). Let first x £ [g = 0] n A. Taking into account Proposition 3.8.3(iii), we have that 9ix h\x,u)=m^jlim v ' [no
+ tu) 9{x
- \lirndA{x t t\a
= m&x{g'(x,u),d(u,C(A,x))}
+
tu) dA{x)
t
-
\ J
.
Let now x £ [g < 0] D A. Since u £ cone(domg — x), there exists t' > 0 such that x' := x + t'u £ doing. As ![£,z'] is continuous, there exists t > 0 such that x + tu £ [g < 0] for t £ [0,1]. It follows that ,,. max{g(x+ h (x, u) = lim = d(u,C(i4,a;)).
tu),dA(x+
tu)\
dA(x = hm —
+tu) (3.100)
Let x £ [g < 0}nA and u £ S^-fl*^ 1 (_/V(C,a;))ncone(dom0-a;). Let us show in our conditions that h'(x,u) > 1. As seen above, there exists t > 0 such that x + tu £ [g < 0] for t £ [0,1]. By Corollary 3.8.5 we have that x + tu £ C and x £ Pc(x + tu) for t > 0. It follows that z + iu ^ A. Assume that x £ PA(x + tu). Suppose first that Eq. (3.98) holds. Then there exists x £ PA(x + tu) n domp. Of course, \\x + iu — x\\ < \\x + tu — x\\ — t. Since g\[xjX] ls continuous and g(x) < 0, there exists A G]0,1[ such that
Weak sharp minima, well-behaved functions
and global error bounds
267
g(Xx + (1 - X)x) < 0. Of course, x' := Xx + (1 — X)x 6 A, and so x' G C. The contradiction follows: t - d(x + tu,C) < \\x + tu-x'\\
< X\\x + tu-x\\
+ (l-X)\\x
+ tu-x\\
< t.
Suppose now that Eq. (3.98) holds. Because x fi PA(X + tu), there exists x E A such that ||a; + tu - x\\ < \\x + tu- x\\ = t. From Eq. (3.98) we have that x G int(dom #), and so there exists A G ]0,1[ such that Xx + (1 — X)x G dom g. Decreasing A if necessary, we may assume that g{Xx + (1 - X)x) < 0. Proceeding as above we get the same contradiction. Hence x G PA(X + tu); using Eqs. (3.100) and (3.72) we obtain that h'+(x,u) > 1. Concluding the above discussion we obtain that /x > /^(O) > min{l,/u}, where H := inf { maac(g'(x,u),d(u,C(a,x)))
\ x G [g = 0] fl A, u£Sxn$x1(N(C>x))}-
The conclusion follows.
•
In the next result we present two sufficient conditions in order that the global error bound holds for the constrained system (3.95). Proposition 3.10.15 Let X be a reflexive Banach space, g G T(X) and A be a closed convex subset of X such that C := [g < 0] fl A ^ 0. Assume that either C C int(domg), or (3.98) and A fl [g = 0] C int(dom<7) holds. Consider the following statements: (i) (ii)
there exists a > 0 such that Eq. (3.97) holds, 3 K > 0 , V Z G An[g
= Q], Vi* G N(A,x),
Vy* G dg(x)
11**11 + 1 < K | | Z * + ^ U .
:
(3.101)
(iii) the set dg {A PI [g = 0]) is bounded and
0 1
^ (U^ n[fl=0] (^(-)+^(^-)))-
Then (iii) =^ (ii) =^ (i). Proof. It is obvious that Eq. (3.98) or Eq. (3.99) holds, and so conditions (i) and (ii) of the preceding theorem are equivalent. It follows that everyone of the statements (i), (ii) and (iii) above is true if A n [g = 0] — 0, and so the conclusion holds in this case. Assume that AD[g = 0] ^ 0; in our conditions AC\ [g = 0] C int(domg).
268
Some results and applications of convex analysis in normed spaces
(hi) => (ii) By hypothesis, there exist 7,77 > 0 such that ||x* + y*\\ > 7 and \\y*\\
= N(A,x)+N([g
+ M+dg(x)
(3.102)
for every x £ A fl [g = 0]. Let x £ A n [g - 0] and u £ Sx n * ^ ( J V ( C , i ) ) . As observed in the proof of the preceding theorem, h'(x,u) — max.{g'(x, u),d(u,C(A,~x))}, where h := m a x f p , ^ } . Since 5 and d^ are continuous at x, h is so. It follows that h'(x,u) = max{(u,u*) | u* £ dh(x)}. From Corollary 2.8.13 and Proposition 3.8.3(h) we have that dh(x) = co {dg(x) U dA(x)) = co (dg(x) U (Ux* n N ( A , z ) ) ) . Take u* £ $ x ( u ) n N(C,x). From Eq. (3.102), there exist x* £ N(A,x), y* £ dg(x) and A > 0 such that u* = x* + Xy*. Since u* ^ 0, fi := ||a;*|| + A > 0. It follows that / i ~ V £ dh(x). On the other hand, by Eq. (3.101) we have that 1 = ||u*|| = \\x* + Xy*\\ > K _ V Hence max.{g'(x,u),d(u,C(A,x))} The proof is complete.
= h'(x,u) > ( u , ^ _ 1 u * ) = / J _ 1 > K _ 1 . •
During the proof of the preceding result we obtained formula (3.102) under the hypothesis that A D int[ < 0] 7^ 0 and g is continuous at x £ An [g = 0]. Another sufficient condition for it, which is a kind of local error bound for A n [g < 0], can be given.
Monotone multifunctions
269
Proposition 3.10.16 Let g £ A(X) and C := {x £ A \ g(x) < 0}. Assume that the system (3.95) is metrically regular at x £ An [g = 0], i.e. there exist 5,7 > 0 such that jdc{y)<max{g(y),dA(y)}
Vy£B(x,8).
(3.103)
Ifx £ (domgY, then Eq. (3.102) holds. Proof. The inclusions D are easy (and true always). Conversely, let x* £ N(C,x) C\ Sx>- By Proposition 3.8.3(H) we have that x* £ ddc{x), and so 72;* £ d(h + (•B(:C,5))(X) = dh(x), where h = max.{g,dA} (see Remark 2.4.1). By Corollary 2.8.11, dh
(*) = U Ae[0ll] K1 - A )^( a ; ) + A W ) ;
the equality d(A#)(a;) = \dg(x) holds also for A = 0 because a; £ (domg)\ Using again Proposition 3.8.3(h) we obtain that x* £ N(A,x) + M+dg(x). Hence (3.102) holds. • A particular case of Proposition 3.10.16 is when g — ds with B c X a nonempty closed convex set. Then Eq. (3.103) implies that N{A DB,x)
= N(A,x)
+
N(B,x).
(Compare with Corollary 2.8.4 for X = Y and A =
3.11
ldx.)
Monotone Multifunctions
Consider the subset SE '•= \ fJ- '• E —> [0, oo[ supp/x is finite, V^
fi(x) = 1 >
of E B , where £? is a nonempty set and supp/j := {x £ E \ fi(x) ^ 0}; if A C E is a nonempty set, then
270
Some results and applications
of convex analysis in normed
spaces
In the sequel we take E = X x X*. Consider also the functions P : tox*. -»• X,
p(/i) := £ ( ^ .
. ^ , * * ) " x,
) 6 M
q : ttxX- ->• X*, qM := Y,(x,x.)eXxXAx>x*) r : ftxx- - • E,
r(/x) := £ ( B i B . ) 6 J f
XJC .M*,**)
' x* > " <*'**>'
Note that p, q and r are affine functions, i.e. p (Xp-i + (1 — A ) ^ ) = <\p(/Ui) + (1 - A)p(/x2) for every /ii,/i 2 € ?xxJf, A e [0,1], and similarly for q,r. The following characterization of monotone subsets of X x X* holds. Lemma 3.11.1 and only if
Let M C X x X* be a nonempty set; M is monotone if
V>e
r(n)>{p(n),q(n)).
Proof. Suppose that M is monotone and let ji € <;M', then supp/x = {(xi,xl),..., (x n ,a;*)} for some n G N. Take a* := n(xi,x*) for 1 < i < n. Then:
Hi*) ~ (P(M).?(/*)) = ^2i=1ai(xi,xi)
Conversely, taking (x±,x\), CM, we have that
~ \J2i=1aiXi'J2
{x2,xl)
a x =1
J j)
E M and /J = \&(Xi,x\) + ^ ( x 2 , ^ ) €
KA») - (P(M),«0*)) = I ( ^ i ^ i ) + I (Z2,Z2> - ( l ^ i + \xi, \x\ + \x*2) = \{xi -x2,x\ -x2) > 0, and so M is monotone.
•
To M C X x X* we associate the function ™ XM:X^R,
, s XM(X)
(x,q(u)) — r(u) :=
sup
P6CM
v
'
y v w /
v w
l + lb(M)ll
'
Monotone
multifunctions
271
It is obvious that XM is a lsc convex function as supremum of a family of continuous affine functions. In the sequel domM denotes P r x ( M ) . Lemma 3.11.2 Let I / M C X x X* ie o monotone set and M = M - (w,w*), where (w,w*) G X x X* is fixed. Then: (i) d o m M C co(domM) C domxM(ii) domM = domM — w and domx^f = domxM — w. (hi) Let Xo C X be a closed linear subspace such that dom M C Xo and M+{Q}xX£ = M. Consider MQ := {{x,x*\Xo) | (x,x*) G M } C XQxX^. Then XM0 =
XM\X0,
domxMo = domxM C X0
and M is maximal monotone O Mo is maximal monotone (in XQ X XQ ). (iv) Suppose that M is maximal monotone and domM C x0+Xo, where Xo C X is a closed linear subspace and x0 G X. Then M + {0} x XQ = M and domxM C aff(domM). Proof, (i) Let (x,x*) € M and fj, G TM be fixed. For (y,y*) € M, from the monotonicity of M, we have that (y - x, y* - x*) = (j/, j/*) - (j/, a;*) - (a;, y*) + (x,
x*)>0.
Multiplying with (J,(y,y*) for (y,y*) G supp/x, then adding the corresponding relations, we obtain that r
(v) ~
(P(M),Z*>
- (x,q(lJ)) + (x,x*) > 0,
and so \(x,x*)\ + \\p(ji)\\ • \\x*\\ > (x,x*) - (p(fi),x*) > (x,q{n)) ~ r(fi). It follows that XM(X) < max{|(a;,a;*)|,||x*||} < oo. Therefore domM C domxM! since domxM is a convex set, we have co(domM) C domxM, too.
272
Some results and applications of convex analysis in normed spaces
(ii) It is obvious that dom M = dom M-w. We note that if ju € cjp taking n(x,x*) := Jl({x,x*)-(w,w*)) [<£> Jl(x,x*) = fJ,{(x,x*) + (w,w*))], then /J, £ <;M- SO, for ]2 e ^ and the corresponding /x € ?M, we have that p(u) = Y^
u(x,x*) -x = 5 ^
^
^{x,x*) • (x - w)
= P(M) - w, and similarly ?(£) = gr(/i)-w*,
r(Ji) = r ( » - (w,q(n)) - {p(iJ,),w*) + (w,w*).
Let a; € X and Jl G Cjg • From the above relations we have that (x - w, q(ji)) - r(p,) = (x-w,
q{n) - w*) - r(n) + (w, q(n)) +
= {x,q{^)) -r(n) <XM(x)-(l where /3 :— max.{\(x,w*)\,
+ {p(fi),w*}
+ \\p(jj)\\)+P(l
+ \\p(ji)\\),
\\w*\\}, and so
(x-w,q{Ji))-r{Jl)
^ r ^
~{x,w*)
(p(n),w*)-{w,w*)
1 + lbOOH .
n
,
n
/
/ x , flx
< (XM(X) + «- 1 + | b 0 i ) || < (i + IMIMx*(«) + fl •
Therefore domxM — w C domxjgr • Since M = M + (w, w*), it follows also the converse inclusion. (iii) It is obvious that M0 is a monotone subset of XQ X XQ. (It is well-known that XQ is identified with {z*|x 0 I x* 6 X*} endowed with the norm ||u*|| := inf{||a:'|| | x*\Xo = u*}.) Let a; 6 X \ XQ and fix (2:0,^0) £ -^- Since a; — xo ^ -^0, there exists u* 6 X^1 such that (x — x0,u*) > 0. Therefore, taking fj, = <5(Xo,x*+tu*); (a:,^ + £u*) - (X0,XQ + tu*) XM\X)
>
——r.
fj
(x - x0,x%) + t (x - x0, u*) =
——r
n
I + IFOII I + IWI for every t > 0, and so XM(X) = 00. It follows that dom%M C XQ. Let (i € CM; consider Ho : XQ x XX -4 E,
/z0(a:,u*) := V "
fi(x,x*).
It is clear that /io £ CM0 • Furthermore, we have that POo) = p(/x).
90o) = g(/i)Uo and r(/i 0 ) = r(fi).
Monotone multifunctions
273
It follows that for x G X0, (x,q(fj)) -r(fi)
1 + I|P(/0II
_ (x,q(fj,0)) - r(/j,0)
1 + IIP(W))||
'
Conversely, let /io G TM0 with supp^o = {(a;i,uj),...,(a; n ,u*)}, n G N. For every i G l , n we take (only) one a;* G -X"* such that x*\x0 = u*, and define /i G CM by fi{x,x*) = fi0(xi,u*) if (a;,a;*) = (a:*, a:?), /u(a;,a;*) = 0 otherwise. We note that to this fi corresponds, by the procedure described above, /i 0 - Therefore, XM0(X) = XM{X) for every x G X0, i.e. XM0 = XM\X0-
Suppose that M is maximal monotone and let (x,u*) G XQ X XQ be such that (y - x,v* —u*) > 0 for every (y,v*) G M0. Then there exists x* G X* such that x*\x0 = u*. It follows that (y - x,y* — x*) = {y — x,y*\x0 — u*) > 0 for every (y,y*) G M, and so (x,x*) G M; hence (x,u*) G M 0 . Suppose now that MQ is maximal monotone and let (x,x*) G X x X* be such that (y — x,y* — x*) > 0 for every (y,y*) G M. Fixing (2/o,2/o) € M > t h e n (yo,Vo + z*) G M for every z* G X ^ . Therefore (7/0 — x,yl + z* — x*) > 0 for every z* G XQ 1 , which implies that x G (XQ 1 ) = X0. We obtain so that (y - x,y*\x0 - x*\x0) > 0 for every (2/,2/*) € M, and so (a;,a;*|xo) G M). From the definition of Mo there exists z* G X* such that (a;,**) G M and 2*|x0 = x*\x0 (& z* - z* € XQ 1 ). From our hypothesis it follows that (a;, a;*) G M, and so M is maximal monotone. (iv) Suppose now that M is maximal monotone and d o m M C i o + ^ O i where XQ C X is a closed linear subspace and xo G X. Consider (a;, a;*) G M and u* G X ^ . Then V(J/,J/*)GM
: (2/ - a:,y* - (x* + u')> = (y - x,y*
- a:*) > 0,
because y — x G X0, and so (a;,a;*-|-w*) G M. Therefore M + {0} XXQ- = M. Fix xi G domM; consider M\\= M- (a;i,0) and Xi := lin(domMi) = aff(domM) — x\. Since M\ is maximal monotone, from what precedes it follows that Mi + {0} x X^ = Mi, while from (iii) it follows that dom XMX C X\. Using (ii) we obtain that domxM C x\ + Xi — aff(domM). D Consider the function h : ttxx- x X* -> K,
/i(/«,^):='-(/*)-(p(Ai),a:*>-
(3-104)
274
Some results and applications of convex analysis in normed spaces
The function h is affine with respect to fi and x*, respectively, and w*continuous with respect to x*. Lemma 3.11.3 Let X be a Banach space and Mi, M2 be nonempty monotone subsets of X x X*. Suppose that 0 G (domxMi - domxM 2 ) J •
(3.105)
Then there exists 7 > 0 such that V(/ii,/i 2 )
GCMJ X < T M 2 , 3 A G
[0,27], (xl,x*2) e l ' x r
: ||a:I||, | | ^ | | < 7,
2
+ h(n2,x*2) + A \\q(fn) + q(ji2)\\ > |(A + ||a:J + x*2\\2 ). (3.106)
h(to,xi) Proof. that
By Proposition 2.8.9, there exists n G (0,1] and v G [l,oo) such
r]U C {x G X I XMi(x) < v, Ikll < ^} - {a; £ I I XM2(x) < v, \\x\\ < v). (3.107) Let 7 := 5v2/r] and fix (/ii,/j 2 ) G ?Mi X ?M 2 - Consider first the case in which max{||g(//i)|| , | | Q ( ^ 2 ) | | } < 7- In this situation we take {x\,x2) — (9(^1)11(^2)) and A = ||<7(A*I) + g(/i2)||- From Lemma 3.11.1 we have that h(fii,x*) = r(fii) - (p(fJ-i),q{Hi)) >0iori — 1,2, and so Eq. (3.106) holds. Consider now the case max{||g(/ii)||, ||g(/i 2 )||} > 7- We may assume that 7 < ||g(/ii)||. There exists v £ X such that IMI=7?
and
(v,q(ni))
> 77? = 5v2.
(3.108)
From Eq. (3.107) there exist x,y G X such that XMl{x) < v, There exist also
XM2(y) < v,
||2/|| < v
and
x-y
= v.
0 G X* such that
lk*ll = */
and
(p{m),x*) = -v \\p(fii)\\, i = 1,2.
Since 7 > 1/ we have that ||a;*|| < 7 for i = 1,2; since XMi(x) < v and a; = w + y, we have that Kfii,xl)
=r(m)
- {p(fii),x{) = ri/ix) + v Wpifi^W > (x,q(ni)) - 1 /
= (v + y,q(m))
-v
= {v,q(fJ-i)) + (y,q(fii))
-v.
From Eq. (3.108) we obtain that h(m,xl)
> 5v2 + (y,q(m))
-v>Av2
+ (y,q(fn)).
(3.109)
Monotone multifunctions
Since
XM2{V)
275
< v,
h(H2,x*2) =r(n2)-{p{H2),X2)
=r{ti2)+v\\p(ii2)\\
> (y,q(fJ^))-v-
(3.110)
Prom Eqs. (3.109) and (3.110) we obtain h{m,xl)+h(n2,x;)
> 4v2 + (y,q{m) + q{n2))-v
> 3i/ 2 -i/||g(/ii) + g(/i 2 )||,
and so 2h(fi1,x*1) + 2h(n2,x*2) + 2v • ||<jf(/n) + q(jn)\\ - v2 - \\x*x + x*2\\2 > &v2 - v2 - \\x\ + x*2f > 5u2 - 4z/2 > 0. Hence Eq. (3.106) holds with A = v (< 27) in this case.
•
Theorem 3.11.4 Let X be a reflexive Banach space and M\, M2 be nonempty monotone subsets of X x X*. Suppose that condition (3.105) is satisfied. (i) Then there exist (a:*, £2) £ X* x X* and z £ X such that for every (2/1,2/*) € Mi, (2/2,2/2) € M 2 we have 2 (Vl - z,y{ - x\)+2 (2/2 - 2,2/2* - x*2) > ||*||2 + ||a:I + x*2\f+2 {z,x\ + x*2). (3.111) (ii) If moreover Mi and M2 are maximal monotone then there exist z £ X and X-t 1 X 2 € X* such that (z,x*) £ Mi, (z,x2) £ M 2 and INI2 + I K + Z2II2 + 2 (z, x\ + x*2) = 0.
(3.112)
In particular dom Mi n dom M2 / 0. Proof, (i) Consider 7 > 0 given by the preceding lemma, A := <;MX X ?M 2 , B := {(x\,xl,z) £ X* x X* x X | ||a;*||, ||i*|| < 7, ||,z|| < 27} and the function k : A x 5 -» E defined by M0*i>M2), (^,0:2,3)) := 2(/i(/x1,a;J) + / i ^ . i a ) ~ 11 I|2
11 *
,
(Z,Q(»I)
+ 9(^2)))
* M2
-11*11 -Ita + ^ll • It is obvious that the sets A, B are convex, and k is convex (even affine) with respect to (^i,/z 2 ) € A and concave with respect to (x{,x2,z) £ B. Furthermore, endowing X* xX* x X with the product topology w* xw* xw,
276
Some results and applications of convex analysis in normed spaces
B is compact and k is upper semicontinuous with respect to Using the minimax theorem (Theorem 2.10.2) we have that
(x\,x\,z).
inf max k = max inf k. A
B
B
A
Let (ni,fi2) 6 A. Using the preceding lemma, there exist A € [0,27] and x\,x2 € X* with ||:r*||, \\x2\\ < 7 such that 2h(n1,x*1) + 2h(n2,x*2) + 2\ • \\q(m) + q(fi2)\\ - A2 - \\xl + x*2\\2 > 0. Since X is reflexive, there exists z 6 X such that
INI = A, (z,q{ni) + q(fi2)) = -A • \\q(ni) + q(^)\\ • It follows that k((fii,fi2),(xl,x2,z)) > 0. Therefore inf^maxsA; > 0. Hence maxsinf^fc > 0, i.e. there exists (x\,x2,z) 6 B such that 2(h(jn,xl) + h(fi2,x*2) - (z,q(l*i) + 9(/*2)» > N l 2 + ll*i + for all (MI,A»2) obtain that
e
^f
-4- Taking ^ = £(„<,„•) with (yi,y*) € Mj, i = 1,2, we
2 ((2/1,2/!*) - <2/i,**) + (2/2,2/2*) - - (^,2/r +2/2)) >
\\z\\2+\\xl+x*2\\2
i.e. Eq. (3.111) holds. (ii) For (xl,x2,z) found at (i), from Eq. (3.111) we obtain that (i/i,»i)eMi
(v2,yJ)eM 2
> \ (ikll2 + ll*i + *2l|2 + 2 (*,** + z5)) . Since for a; £ X and a;* € X* the inequality ||x|| 2 +||x*|| 2 +2(x,x*) > ||x|| 2 + ||x*|| 2 -2||:r|H|z*|| = ( N | - | | z * l l ) 2 (3.113) holds, we obtain that /
^f „ ^
~ Z ' "l
_ X
l) + ,
ln f
,
(2/2 - 2, 2/2 - *3> > °-
Because Mi is monotone, it is obvious that ,
^
„ (2/i-^-2/i-*i) = 0
Monotone
277
multifunctions
for {z,x\) G Mi, and, since Mi is maximal monotone, (yi,yt)e.M1 for (z,x{) $ Mi. The same argument being true also for M 2 , we obtain that (z,xl) G Mi and (z,x%) £ M 2 . Furthermore Eq. (3.112) holds. • It is obvious that X x {0} C X x X* is a maximal monotone set whose domain is X. Taking in the preceding theorem one of the monotone sets to be X x {0} we obtain the following result. Theorem 3.11.5 Let X be a reflexive Banach space and M C X x X* be a nonempty monotone set. Then (i)
there exists (x,x*) £ X x X* such that V ( y , y ' ) G M : 2 (y - x,y* - x*) > \\xf + \\x*\\2 + 2 (x,x*);
(ii) M is maximal monotone if and only if for every (w,w*) 6 X x X* t/iere exists (a;, a;*) € M suc/i £/m£ ||X-U)||2 + | | X * - U ; * | | 2 + 2 ( X - K ; , X * - W * )
=0.
(3.114)
Proof, (i) Taking Mi := M in the preceding theorem and M 2 := X x {0}, condition (3.105) is satisfied, and so there exists (a;*,x 2 ,z) £ X* x X* x X verifying Eq. (3.111). Fixing (yi,y*) £ Mi = M we obtain that WyGX
: (yi-z,yl-xl)
+
(z,X2)>(y,xl);
therefore x% = 0. Taking (x,x*) = (z,x\), the conclusion is immediate. (ii) Suppose that M is maximal monotone and consider (w,w*) G X x X*. It is obvious that Mi := M — (iu,w*) and M 2 : = I x {0} are maximal monotone. Using Theorem 3.11.4(ii) we obtain the existence of (u, u*) G Mi such that ||u|| + ||u*|| + 2(u,u*) = 0, and so there exists (x,x*) G M satisfying Eq. (3.114). We prove now the converse implication. Let (w,w*) G X x X* be such that (y -w,y* —w*) > 0 for every (y,y*) € M. For this (w,w*) G XxX*, using our hypothesis, there exists (x,x*) € M satisfying Eq. (3.114), and so | | x - i u | | 2 + ||a;* -u>*|| 2 < 0, i.e. (w,w*) = (x,x*) e M. • Taking into account the expression of the duality mapping 4>x given in Eq. (3.59), from Eq. (3.113) we obtain that IWI2 + ||z*H2 + 2 (x, x*) = 0 O x* G -*x(x)
<* x* €
$x(-x).
278
Some results and applications of convex analysis in normed spaces
This remark gives us the possibility to obtain other characterizations of maximal monotone subsets (multifunctions). Theorem 3.11.6 Let X be a reflexive Banach space and M be a nonempty monotone subset of X x X*. Then M is maximal monotone if and only if M + gr{-$x) = X x X*. Proof. Assertion (ii) of the preceding theorem shows that the maximal monotonicity of M is equivalent to each of the following relations: V{w,w*) E X x X*, 3(x,x*) \/(w,w*) eX
xX*
£ M : w* - x* € : (w,w*)6M
+
-$x(w-x), gr(-$x).
Therefore the conclusion of the theorem holds.
•
Theorem 3.11.7 Let X be a reflexive Banach space and T : X =t X* be a monotone multifunction. IfT is maximal monotone then I m ( $ x +T) — X*. Conversely, if X is smooth and strictly convex, and!m($x+T) = X*, then T is maximal monotone. Proof. To begin with, suppose that T is maximal monotone. Let x* £ X*; because (0,a;*) £ X x X*, from the preceding theorem we get the existence of the elements {y,y*) £ g r T and (u,u*) £ g r $ x such that (y>U*) + (~u,u*) = (0,a;*). Therefore u = y and x* = y* + u*, i.e. x* £T(y) + $x{y). Suppose now that X is strictly convex and smooth. Prom Theorem 3.7.2 we have that $ x is single-valued and strictly monotone. Let (x, x*) e X x X* be such that (y — x,y* — x*) > 0 for every (y,y*) € g r T . Since x*+$x{x) € X* = I m ( $ x + T ) , there exists u £ X such that x* + $x(x) G $ x ( u ) + T(u). It follows that 0 < (u - x,x* + $x{x) - $ x ( " ) - x*) = - {x - u,$x{x)
- $x{u))
< 0,
and so (x — u, $x(x) — $x(w)) = 0. Since $ x is strictly monotone, we have that x = u, and therefore (x,x*) € g r T . Hence T is maximal monotone.• Another version of the preceding result is the following. Theorem 3.11.8 Let X be a reflexive Banach space and T : X z3 X* be a monotone multifunction. If T is maximal monotone then Vy<EX,3x£X
: 0 £ T(x) + $x(x
- y).
(3.115)
Monotone
279
multifunctions
Conversely, if X is smooth and strictly convex, and Eq. (3.115) holds, then T is maximal monotone. Proof. It is sufficient to note (identifying X** with X) that $ x * = ^x1' T is maximal monotone if and only if T _ 1 : X* =4 X is maximal monotone and Im{$x,
+T'1)
= X &Vy £ X, 3x* £ X* : j , £ $x.(x*) + T ^ x * ) &Vy£X,
3x£X
: *x(j/-i) nT(i) ^ 0
o V y € X , 3 x € X : 0 £ T(x) + $x(x The proof is complete.
- y). D
Remark 3.11.1 Since T C X x X* is maximal monotone if and only if AT is maximal monotone for some/any A > 0, it follows that, when X is strictly convex and smooth, we have that T is maximal monotone <=> I m ( A $ x + T) = X* for some/any A > 0 <£• Vy € X, 3x £ X : 0 6 XT(x) + $x{x ~ y) for some/any A > 0. Another very important result is the following one. T h e o r e m 3.11.9 Let X be a reflexive Banach space and2\, T2 :Xz=s, X* be maximal monotone multifunctions. Suppose that 0 £ (domxTx - domxT 2 ) 1 • Then T\ + T2 is maximal monotone. In particular domTi D domT 2 ^ 0. Proof. Let Mx = gr 7\, M 2 = gr T2 and M = gr(Ti +T2). The hypothesis shows that Mi and M 2 are maximal monotone and condition (3.105) is satisfied. From Theorem 3.11.4 it follows that dom MiOdom M2 = domTin domT 2 T^ 0, and so M is a nonempty monotone set. To prove the maximal monotonicity of M we use Theorem 3.11.5. Let (w,w*) £ X x X* and T\,f 2 : X =} X*, fi{x) := Tj(x + w) - ±w* (i = 1,2). It is obvious that Ti (i = 1,2) is maximal monotone, while from Lemma 3.11.2 we get that grTi, grT 2 satisfy condition (3.105). We can apply Theorem 3.11.4; so there exists (z^z^z) £ X* x X* x X such that (z,z?) £ grf; (i = 1,2) and
280
Some results and applications of convex analysis in normed
spaces
Denoting z + w by x and z* + \w* by x*, we obtain that (x,x*) E grTi (i = 1,2) and ii
M2
, II
*
,
*
*I|2
, o /
*
i
*
*\
r\
\\x — w\\ + \\x1 + x2 — w || + 2{x — w,x1 + x2 — w } = 0. Taking x* = x\ -\- x2, we obtain that (x,x*) € M and II
Il2 ,
II
*
*||2
. o /
„*
*\
rv
||a;-u;|| + ||a; — «; || + 2 (a; — iy,a; — w ) = 0. Using Theorem 3.11.5 we have that M is maximal monotone, and so T \ + I 2 is maximal monotone. • The sufficient condition from the preceding theorem is formulated using the auxiliary function x, which is not easy to calculate. In the following theorem we establish other sufficient conditions (in fact equivalent with the mentioned one) for the maximal monotonicity of the sum. Theorem 3.11.10 Let X be a reflexive Banach space and T\,T2 : X =3 X* be maximal monotone multifunction. Then int (domTi — domT^) — int (domxxi
—
domxr 2 ) = (domxrj — domXT2Y • (3.116) Therefore int (domTi — domT2) is a convex set; moreover, the following statements are equivalent: 0 6 {domxT, -domxrj,
(3.117) i
0 € (co(domT1)-co(domT2)) , i
(3.118)
0 6 (domTi - domT 2 ) ,
(3.119)
0 G int (domTj - domT 2 ),
(3.120)
each of these conditions ensuring that T\ + T2 is maximal monotone. Furthermore, if int (domTi - d o m ? ^ ) 7^ 0 (equivalently, i/(domxTi — domxr 2 ) 1 is nonempty), then int (domTi - domT 2 ) = domTi - domT 2 = domxTj - domxT2> (3.121) and so dom Ti — dom T2 is a convex set.
Monotone
Proof.
multifunctions
281
Taking into account Lemma 3.11.2, we have that int (domTi - domT 2 ) C (domTi -
domT2y
C (co(domTi) - co(domT 2 )) 1 C (domxTx - d o m x T 2 ) \ which shows that the inclusions C from Eq. (3.116) hold and (3.120) =>• (3.119) => (3.118) => (3.117). Furthermore, by Proposition 2.8.9 we have that int (domxTi — domxT 2 ) = (domxTi - domxT 2 )* •
(3.122)
Let us show that (domxTi - domxT 2 ) 1 C domTi - domT2, which together with Eq. (3.122) will prove Eq. (3.116). Let w be an element of (domxTi — domxT 2 ) 1 - Consider the multifunction Ti defined by T\{x) = T(x + w). We have that domTi = domTi —w and domxf = domxTi —w. Therefore Ti and T2 verify condition (3.105). From Theorem 3.11.4 it follows that domTi n domTx 7^ 0, »-e. w € domTi - domT 2 . Therefore Eq. (3.116) holds. From this relation we obtain that (3.117) =J> (3.120). From what was proved above and Theorem 3.11.9 we have that each of the conditions (3.117)-(3.120) ensures that T\ + T 2 is a maximal monotone multifunction. Furthermore, if int (domTi — domT 2 ) ^ 0, from Eq. (3.116) and int (domTi
-
domT 2 ) C domTi - domT 2 C domxTi — domxr 2
we get Eq. (3.121). The convexity of the set int (domTi - d o m T 2 ) and, when this is nonempty, of the set domTi — domT 2 follows from Eqs. (3.116) and (3.121). • Conditions (3.117)-(3.120) from preceding theorem can be "relativized." Theorem 3.11.11 Let X be a reflexive Banach space and Ti,T 2 : X =4 X* be maximal monotone multifunctions. Then ic
(domTi - domT 2 ) = "(domxTj - domxr 2 ) •
(3.123)
282
Some results and applications of convex analysis in normed spaces
Therefore l c (domTi — domT2) is a convex set and the following are equivalent: 0 £ ^domxTx - dom X T 2 ), 0 € i c (co(domTi)-co(domT 2 )), 0 £
ic
statements
(3.124) (3.125)
(domTi - d o m r 2 ) ,
(3.126)
domTi — domT2 is neighborhood of the origin in lin (domTi — domT2), (3.127) M
A (domTi — domT2) is a closed linear subspace, (3.128)
each of these conditions ensuring that 7\ + T2 is maximal monotone. Furthermore, if sc (domTi - d o m T 2 ) ^ 0 (equivalently, «/8C(domxTi — domxT 2 ) is nonempty), then ic
(domTi
- domT 2 ) = domTi - domT 2 = domxTx - domxr 2 ,
(3.129)
and so dom T\ — dom T2 is a convex set. Proof. To begin with, suppose that condition (3.124) holds; we wish to prove that Ti + T2 is maximal monotone; if we succeed we shall have that domTi n domT 2 ^ 0, i.e. 0 £ domTi - domT 2 . In our hypothesis XQ := aff(domxT! — domxr 2 ) is a closed linear subspace. Replacing eventually Tj by T; where Ti{x) = Ti(x+w) (i = 1,2) with w £ dom XTi H dom \T2 > w e m a Y assume that 0 £ dom \TX l~) dom xr 2 , and so domTi UdomT2 C XQ, Let i £ {1,2}. Since Tj is maximal monotone, it follows that grTj + {0} x X0X = grTj. Consider T° : X0 =t X0X,
T?(x) = {x*\Xo | x* £ Tt(x)}.
From Lemma 3.11.2 (iii) it follows that Tf is maximal monotone and dom XT; = dom \T° • X0 being a reflexive Banach space and 0 £ (dom XT° — domxT°)% from the preceding theorem we have that T® + T° is maximal monotone and (domxro — domxT 0 ) 4
=
int(domT° — domT°)
(with respect to XQ),
i.e. tc
(domxTi - domxT 2 ) =
%c
{domTi - domT 2 ) = intx 0 (domTi - domT 2 ). (3.130)
Monotone
multifunctions
283
It is obvious that (2\ + T 2 )° = T? + T2°. From Lemma 3.11.2 (iii) we have that T\ + T2 is maximal monotone. Let us prove now the other statements. Taking into account Lemma 3.11.2(iii), (iv), we have that domTi — domT 2 C domxTi — domxr 2 C aff(domTi) — aff(domT 2 ) Caff (dom Ti - d o m T 2 ) , which imply that aff(domTi - domT 2 ) C aff(domxTi - d o m x r j C aff (dom T\ - domT 2 ). Therefore aff (dom T\ —dom T2) is closed if and only if aff (dom \Ti —dom XT 2 ) is closed; in this situation these sets are equal. Taking into account this remark it follows that the inclusion C holds in Eq. (3.123). Let us prove the converse inclusion. Let w G tc (domxTi — domxT 2 ); this means that X0 := aff(domxTi - domxr 2 ) is a closed set and w G l (domxTi — domxr 2 )- Taking 7\ as in the proof of the preceding theorem, we have that 0 G i e (domxf — domxr 2 )- From the first part of the proof it follows that domTi H domT 2 ^ 0, whence we have that w G domTi domT 2 . We have obtained so that l
(domxTi
—
domxr 2 ) C domTi — domT 2 (C domxTi — domxr 2 C X0),
and so the inclusion D holds in Eq. (3.123), too. Note that relation Eq. (3.130), proved above when Eq. (3.124) holds, shows that (3.124) => (3.127). Condition (3.127) ensures that 0 G ^ d o m T i domT 2 ) and aff (domTi — domT 2 ) = lin(domTi — domT 2 ) is closed. Hence (3.127) =*• (3.126). The implications (3.126) =• (3.125) =$• (3.124) are obvious. The implication (3.127) => (3.128) is obvious, too. Suppose that (3.128) holds. Denoting by X0 the closed linear space (J A > 0 (domTi — domT 2 ), we obtain that Xo = UA>O (co(domTi) — co(domT 2 )), i.e. condition (3.125) holds. Therefore conditions (3.124)-(3.128) are equivalent. If zc(domTi - d o m T 2 ) ^ 0 (or equivalently lc (domxTi — domxr 2 ) 7^ 0), then relation (3.129) follows similarly as in the proof of the preceding theorem (or one reduces to this theorem). • When X is reflexive, if T : X =$ X* is maximal monotone, taking also
284
Some results and applications of convex analysis in normed
spaces
the maximal monotone set {0} x X*, we obtain that (domxTT
= int(domx:r) = int(co(dom T)) = int(domT) = (domT)*, (3.131) and so int(domT) is a convex set; if this set is nonempty, then (domT)* = domT = co(domT) = dom^T,
(3.132)
and so domT is a convex set, too. In the above relations we may replace * with ic. The convexity of the set domT also holds without interiority conditions. Theorem 3.11.12 Let X be a reflexive Banach space and T : X r3 X* be a maximal monotone multifunction. Then dom T and Im T are convex sets. Proof. It is obvious that for every I E I and A > 0 the multifunction TXi\ defined by TXi\(u) = XT(u + x) is maximal monotone. Using Theorem 3.11.7 we have that 0 6 Im($x + TC,A), »-e. there exists y € X such that 0 6 $x(y - x) + XT(y). We have obtained so that \/x G X, A > 0, 3(xx,x*x)
G XxX*
: A - 1 : ^ G T(xx),
-x*x G
*x(xx-x). (3.133) Let x G X be fixed; for every A > 0 consider (x\,x*x) given by Eq. (3.133). Consider also u G domT and u* G T(u) (fixed for the moment). From (x\ — u, A_1a;A - u*) > 0 we obtain that
||:EA — x\\ = (x\ — x, —x*x) < (x — u,x*x) + X(x — xx,u*) + X(u - x,u*), (3.134) and so, for AG (0,1] we have that \\xx - x\\2 < \\x - u\\ • \\xx\\ + X\\xx - x\\ • \\u*\\ +X\\x
u\\ • ||U*|| (3.135)
where a := ||a; — u|| + A||u*|| and /? := A ||a; - u|| • ||u*|| (we have taken into account that \\x^\\ — \\xx — x ||). Prom Eq. (3.135) it follows that there exists M > 0 such that ||x A || = \\xx - x\\ < M for every A G (0,1]. Let 7 := limsupA_j.0+ ||a;,\ - x\\ < oo and (An) C (0,1] be such that (A„) -¥ 0 and (\\xXn — x\\) ->• 7. Since (x\n) is bounded and X is reflexive, we may suppose that x*Xn -^-> x* G X*. Considering A = Xn in Eq. (3.134) and then taking the limit, we obtain that j 2 < (x — u,x*) for
Monotone
multifunctions
285
every u € domT; from this we get immediately that j 2 < (x-u,x*) for every u £ co(domT). Therefore, l i m ^ o ^ A = x (with respect to the norm) for every x € co(domT). Since (x\) C domT, we obtain that co(domT) c domT. Hence domT is a convex set. Since I m T = d o m T - 1 and T _ 1 is maximal monotone, we have that I m T is a convex set, too. • The hypothesis that X be reflexive for having Eqs. (3.131) and (3.132) is not necessary (under interiority conditions). T h e o r e m 3.11.13 Let X be a Banach space and T : X =} X* be a maximal monotone multifunction. Then condition (3.131) holds. In particular int(domT) is a convex set. Furthermore, i/int(domT) ^ 0 then condition (3.132) holds, and so domT is a convex set. Proof. To prove condition (3.131) it is sufficient to show that domT D (domxr) 1 - Taking into account Lemma 3.11.2(h) it is even sufficient to prove that 0 G domT if 0 e ( d o m x r ) ^ Let 0 e (dom^r)'- From Theorem 2.2.20 we have that XT is continuous at 0, and so there exists n, v > 0, 7] < 1 < v, such that XT(X) < v for ||x|| < n. Therefore V / i G s r , Vz, Hxll
(3.136)
Let K := u/n > 1. Let us prove that V/LiSCT : r(p) + KMn)\\>0.
(3.137)
Let fi €
max
inf h(n,x*) = inf
max
(r(/z) - (p(/x),z*))
= inf (r(^) + « | | p ( / i ) | | ) > 0 .
286
Some results and applications of convex analysis in normed spaces
Hence there exists u* € KUX* such that h(fi,u*) > 0 for every fi £ STTaking /j = S^XtX^ with (x,x*) £ g r T , we obtain that (x,x*) — (x,u*) = (x — 0,x*— u*) > 0 for every (x,x*) £ grT. Since T is maximal monotone, it follows that (0,u*) 6 grT, andsoO € domT. Therefore Eq. (3.131) holds. Suppose now that int(domT) ^ 0 (equivalently, (domxr) 1 7^ $)• Since int(domT) c domT C d o m ^ r , from Eq. (3.131) and Theorem 1.1.2 we obtain that Eq. (3.132) holds. • The next result refers to the local boundedness of monotone multifunctions. The multifunction T : X ^Z X* is called locally bounded at x £ X if there exists 77 > 0 such that T (B(x, r))) is a bounded set; of course, T is locally bounded if T is locally bounded at any x € X. Theorem 3.11.14 Let X be a Banach space and T : X =t X* be a monotone multifunction. Suppose that XQ G (co(domT))\ Then T is locally bounded at x$. Proof. From Lemma 3.11.2 we have that co(domT) C domx^. Therefore x0 G (domxr)*) and so XT is continuous at XQ. Then there exists 77,7 > 0 such that XT(X) < 7 for every x G D(xo,2n). In particular, V(x,x')e&T,\\x-x0\\
: (* + ".**>-<*.**> < 7> I + IHI
and so ||x*|| < 7 ( 1 + 77 + ||x||)/r? =: 1/.
•
The following partial converse of the preceding theorem holds. Theorem 3.11.15 Let X be a Banach space and T : X =$ X* be a maximal monotone multifunction for which dom T is a convex set. If T is locally bounded atxG domT, then x £ int(domT). Proof. Let C := domT; therefore C is a nonempty closed convex set. From our hypothesis there exists 77 > 0 such that T (B(x,r])) is a bounded set. Since x £ domT, there exists (xn) C domT such that (xn) —> x; we can suppose that (xn) C B(x,r)). Let (x*) C X* be such that xn € T(xn) for every n € N. Of course (x*) is bounded, and so it has a subnet (x*)iei which u;*-converges to x* £ X*. Let (XJ)J £ / be the corresponding subnet of (xn). We have that (XJ — y,x* — y*) > 0 for every (y,y*) £ g r T and i £ / . Passing to the limit we obtain that (x~ — y, x* — y*) > 0 for every (y>y*) 6 &T. Since T is maximal monotone we have that (x,x*) £ g r T ,
Monotone
287
multifunctions
and so x G domT. Because T (B(x,r]/2)) C T(B(x,T})) for every x £ domTfl B(x,r]/2), with similar arguments as above we obtain that C n B{x,n/2)
= domT n B{x,n/2)
CdomT.
(3.138)
Suppose that x $ int C, and so x G Bd C. From Bishop-Phelps theorem, it follows that there exists x € Bd C D B(x, rj/2) and x* € X* \ {0} such that (x-x,x*)>0
VxGCDdomT.
From Eq. (3.138) we have that x G domT; let XQ G T ( X ) . Taking into account the preceding relation, for t > 0 we have that (x -y,x*Q + tx* - y*} = (x - y, x*Q - y*) + t (x - x, x*) > 0 for all {y,y*) G grT, and so {x^ +tx* \ t > 0} C T(x); this contradicts the boundedness of T(B(x,r))). Therefore x € intC; from Eq. (3.138) it follows immediately that x G int (domT). D The following result shows that the converse of Theorem 2.4.13 holds in Banach spaces for Isc convex functions. Corollary 3.11.16
Let X be a Banach space and f G T(X).
(i) Let x G d o m / . x G int(dom/).
Then df is locally bounded at x if and only if
(ii) The following assertions are equivalent: (a) df is locally bounded at any x G d o m / , (b) d o m / is open, (c) / is continuous. (iii) df is locally bounded at any x G X if and only if dom f = X and f is continuous. Proof, (i) The sufficiency is proved in Theorem 2.4.13. Assume that df is locally bounded at x. By the Br0ndsted-Rockafellar theorem we have that x G dom df, while from the Rockafellar theorem we have that df is maximal monotone. Using the preceding theorem we obtain that x € int(dom<9/) c int(dom/). (ii) (a) <£> (b) is obvious from (i); (c) => (b) because d o m / = {x G X \
f{x) < oo}. (b) =r> (c) Note first that / is continuous at any x G X \ dom / because / is Isc. The space X being complete and / Isc, by Theorem 2.2.20 we have that / is continuous at any x G (dom/) 1 = int (dom/). Therefore / is continuous. D
288
Some results and applications of convex analysis in normed spaces We close this section with the following important theorem.
T h e o r e m 3.11.17 (Rockafellar) Let X be a Banach space and T : X =3 X* be a maximal monotone multifunction. Suppose that either X is reflexive and T is locally bounded at some point of dom T or int (co(dom T)) ^ 0. Then i n t ( d o m T ) ^ 0 and its closure is equal to d o m T . Furthermore T is locally bounded at every point o / i n t ( d o m T ) and is not locally bounded at any point of B d ( d o m T ) . Proof. If X is Banach space and int ( c o ( d o m T ) ) ^ 0, from Theorem 3.11.13 we obtain t h a t i n t ( d o m T ) ^ 0 and d o m T is a convex set. T h e rest of the conclusion follows from Theorems 3.11.14 and 3.11.15. If X is a reflexive Banach space, using Theorem 3.11.12, we have t h a t d o m T is convex, and so, from Theorem 3.11.15, we have t h a t i n t ( d o m T ) ^ 0. T h e rest of the conclusion follows from the first part. •
3.12
Exercises
Exercise 3.1
Consider the function
f-.e'^R,
/(*)-£L,M1+1A, *•—'k>l
where x = {xk)k>\- Prove that / is well-defined, continuous and strictly convex. Moreover show that / is Gateaux differentiate on I1 with V / ( x ) = ( ( 1 + AT1) \xk\1/k ^
sgn(x fc ))
V i = (*»)*>! 6 I1,
' k>i
where sgn(7) := 1 for 7 > 0 and sgn(7) := - 1 for 7 < 0, but / is nowhere Prechet differentiate. Exercise 3.2 Let / : (°° ->• 1 be defined by f{x) := limsup|a;„|. Prove that d o m / = £°°, / i s convex and continuous but nowhere Gateaux differentiable. E x e r c i s e 3.3 Let (X, \\ • ||) be a normed space and / € A(X). Prove that the statements (i), (ii) and (iii) below are equivalent: (i) / is uniformly convex; (ii) Ve > 0, 3d > 0, Vx,y £ d o m / , \\x - y\\ > (>,=)6, f(Xx + (1 - X)y) < A/(x) + (1 - X)f(y) - A(l - A)<5; (iii)
VA e [0,1]
:
Ve > 0, 3d > 0, Vz,j/ g d o m / , \\x - y\\ > ( > , = ) * : / ( | x + \y) <
yw + km-o-. (iv) Prove that the statements obtained from (ii) and (iii) by fixing x G dom / are equivalent to the fact that / is uniformly convex at x.
Exercises
289
Exercise 3.4 Let / : t1 -¥ R be defined by f(x) := E n > i kA=|1+1/A:- By Exercise 3.1 / is strictly convex, continuous and Gateaux differentiable but nowhere Prechet differentiable. Prove that p/,x(t) = 0 for every t > 0, flf,x(t) > 0 for all t > 0 and x G A := {x G £l | h m s u p ^ ^ \xk\1/k < 1}, and df,x(t) = 0 for all t > 0 and x € ix \ A. Exercise 3.5
Let (X, ||-||) be a normed space.
(i) Let A := {x"n | n 6 N} C Sx* and consider the function
f-.X^R,
f(x):=J2
-, 2- n (x,x;) 2 .
*—'n>l
Prove that / is convex and uniformly Frechet differentiable, and -
+i*c,
(1>°dc)'=i.c+il>*°\\-\\,
( i c + V> # °IHl)"=V>°«fc.
Moreover, if x 0 6 X \ C and s € Pc(zo) then dg(x0) = {x* G Sx* DN(C,x) \ (xo -x,x*) — \\x0 -x\\}Exercise 3.8 Assume that X is a reflexive Banach space and / G T(X) is such that S := a r g m i n / is nonempty and bounded. Prove that df(S) is a neighborhood of 0 (for the norm topology) if S is a set of weak sharp minima of F. Exercise 3.9 Let X be a normed vector space, g G A(X) be such that S := {x G X | g(x) < 0} is nonempty and h := max(,0). We say that g is metrically regular at the nonempty set S C S if there exist 6, a > 0 such that ds{x) < ah(x) for all x G X with d§(x) < S; when S — {z}, one says that g is metrically regular at z. Consider the following conditions: (i) S is a set of weak sharp minima for ft; (ii) g is metrically regular at any nonempty set S C S; (iii) g is metrically regular at S; (iv) g is metrically regular at any z G S.
290
Some results and applications of convex analysis in normed spaces
Prove the following implications: (i) -<=> (ii) •!=> (iii) => (iv). Moreover, if S is compact prove that (iv) => (i). Exercise 3.10 Let X be a normed space, / 6 I \ X ) and fi > 0. Let us consider the Baire regularization of / defined by / B (-,/!) : X -> R,
fB(x,n)
:= inf {/(y) + M ||x - «/||
\y€X).
1) Prove that for every fi > 0 the function /s(-,fi) is either identical —oo, or finite, convex and Lipschitz with Lipschitz constant /J,. Moreover, / B ( - , ^ I ) < / B ( - , M 2 ) < / for 0 < (ii < H2 and limM_>oo / B ( X , ^ ) = f(x)
for every x G X .
2) Let so G X be fixed. Prove the equivalence of the following statements: (a) df(xo)Clfj,UX' / 0, (b) f(x0) = fB(x0,(J,), (c) / B (xo,/i) G R and a / s ( x o , / i ) =
a/(x 0 ) n nUx> • Exercise 3.11 Let X be a normed space, / e r ( X ) and fi > 0. Let us consider the Moreau regularization of / defined by / M ( - , / I ) : X -> R,
/ M ( X , P ) := inf { f(y)
+ %\\x - y\\2 \ y e X}
.
1) Prove that for every fi > 0 the function / M ( - , P ) is finite, convex and continuous, / M ( - , M 0 < /M(-,A*2) < / if 0 < /zi < /Z2 and limM_>oo fM(x,fi) = / ( x ) for every x G X . Moreover, if x* 6 df(x) then / ( x ) — ^-||x*|| 2 < / M ( X , / ^ ) . In the sequel we assume that X is a reflexive Banach space. 2) Prove that there exists J^ : X —• X (unique if X is also strictly convex) such that V x € X : /M(x,^) = /(JM(x))+f||x-JM(x)||2; furthermore, limM_,oo J^(x) = x and lim^-x^ / (J^(x)) = / ( x ) for every x € dom/. 3) ^
Assume that df(x) ^ 0. Prove that ||x*|| < dafM(0)
£ 5/M(X,^).
for all fj, > 0 and
Moreover, if x^ e 5 / M ( X , ^ ) for all ^J > 0 and x* ;
—>
x*
for some net {(ii)iei —> oo then x* S -Pd/(x)(0); conclude that lim^-too ||xjl|| = <*d/(*)(0). 4) Assume that X is smooth. Prove that /M(-,/*) is Gateaux differentiable on X and V / M ( X , P ) = /z$x(x — J^(x)) for every x 6 X , and w* — lim^-,00 V / M ( X , ^ ) = Xo if 5 / ( x ) ^ 0, where xS is the unique element of minimal norm of df(x). Moreover, if X* has the Kadec-Klee property (see p. 233) then XQ = lim^-Kx, V/M(x,/i). 5) Prove that /M(-JA') is Frechet differentiate on X and V / M ( - , M ) tinuous when X is locally uniformly smooth.
IS c o n
~
Exercise 3.12 Let
Exercises
291
Exercise 3.13 Let (X, (• | •)) be a Hilbert space and K C X be a closed convex cone. For every i 6 l w e denote by PK(X) the projection of a; on K; PK(X) exists and is unique. Let us consider K~ := —K+ = {y € X | Vx € K : (x \ y) < 0}. Prove that Vx6X
: x = PK(x) + PK-(x)
and (PK(x)\PK-(x))
Conversely, prove that x\ = PK(X) and X2 = PK-{x) x\ € K, X2 € K~ and [x\ \ X2) = 0.
= 0.
if a; = x\ + X2 with
Exercise 3.14 Let X, Y be Hilbert spaces, A € £ ( X , Y) be a surjective operator, C C Y be a nonempty closed convex set and yo € Y be fixed. (a) Prove that the operator T := A o 4 * is bijective (X* and Y* being identified with X and Y, respectively). (b) Prove that the problem (P) min ||a;||2, Ax + yo G C, has a unique solution. Moreover, x £ X is the solution of (P) if and only if x = (A* a T _ 1 ) ( y — yo), where y is the solution of the equation y = Pc(y — pT_1y + pT~lyo) for some (any) p > 0. Exercise 3.15 Let X be a reflexive Banach space and g G T(X) be a coercive function, i.e. lim,||!I.||_K50 g(x) = 00. Prove that g € T (see Eq. (3.73) on page 252). Exercise 3.16 Let X be a reflexive Banach space and g e T(X). Prove that g 6 T whenever (a) 0 € (doing*) 1 , or more generally, (b) 0 € '(domg*) a n c i lin(dom*) is w*-closed. Exercise 3.17 Let (X, ||||) be a normed space, g S T(X) and A C X be a closed convex set such that C := ACi[g < 0] is nonempty. Assume that goo{u) < 0 for some u e J4OO f"l Sx- Prove that for a := (—9oo(u))_1 one has dc{x) < a • g+(x) Exercise 3.18
Vx € A.
Let 31,92,33 : K2 -¥ R be defined by:
gi(x,y) := yjx2 + y2 - x - 1,
g2{x,y) := y2 - 2x - 1,
g 3 := max{gi,g 2 }-
The functions 31,32,33 are convex, 31 and 32 being differentiable on 1R2 \ {(0,0)} and R 2 , respectively. Prove that »»i(t) = fe9i(*) = 0 Vt > - 1 = inf Si, r M ( t ) = * OT (t) = 2 Vt > - 0 0 = inf 32, r 9 3 (0) = r 9 1 (0) = 0, fc93(0) = k92(0) = 2, where rg and fc9 are defined on page 255.
292
3.13
Some results and applications of convex analysis in normed spaces
Bibliographical Notes
Section 3.1: We used the book [Phelps (1989)] in the presentation of the theorems of Borwein, Br0ndsted-Rocka£ellar and Bishop-Phelps, as well as Proposition 3.1.3; in the statement of Borwein's theorem we added an estimation given in [Penot (1996b), Prop. l]. The statement (i) of Theorem 3.1.4 can be found in [Phelps (1993)], while (ii) is established by Attouch and Beer (1993). For Simons' theorem and the proof of the Rockafellar theorem we used [Simons (1994b); Simons (1994a)]. Note that in many books one prefers to give the proof of Rockafellar's theorem in reflexive Banach spaces using Theorem 3.11.7. Proposition 3.1.5 is stated by Zalinescu (1999); for X = Y, A - ldx and F(x, y) f(x)+g(y), Attouch and Thera (1996) (for X reflexive) and Simons (1998b) (in an equivalent formulation) obtained the weaker formula i n t ( d o m / — domp) = int(domc)/ — domdg). Theorems 3.1.6 and 3.1.7 are obtained by Penot (1996b). A result similar to Theorems 3.1.7 is obtained by Thibault (1995a). Section 3.2: The results on the Clarke tangent cone, the Clarke-Rockafellar directional derivative and the Clarke subdifferential can be found in the books [Clarke (1983); Rockafellar (1981)]. Theorem 3.2.5 (with the conclusions (i), (iii) and (iv)) was originally stated by Zagrodny (1988) for the Clarke-Rockafellar subdifferential and r = f(b). The present statement subsumes those in [Thibault (1995b)] and [Aussel et al. (1995), Th. 4.2]; the proof follows the lines of the corresponding result in [Thibault (1995b)]. Theorem 3.2.7 was stated for the first time by Poliquin (1990) in R" for do; the present statement subsumes the corresponding results in [Luc (1993), Th. 3.1], [Correa et al. (1994)] and [Aussel et al. (1995), Th. 5.5] (see also Remark 3.2.3). The proof of Theorem 3.2.8 belongs to Luc (1993). The statements and proofs of Theorem 3.2.9 and Corollaries 3.2.10, 3.2.11 are from [Thibault and Zagrodny (1995)]; there it is assumed that the subdifferential d satisfies several conditions (among them conditions (P4) and (P5)) which imply our condition (PI). Note that Corollary 3.2.11 is a famous result of Rockafellar (1966). Section 3.3: The lowest and greatest quasi-inverses were introduced by Penot and Voile (1990). The equivalence of (i) and (ii) of Theorem 3.3.2 for the Frechet homology can be found in the book [Phelps (1989)], while that for the Gateaux homology in [Giles (1982)]; the equivalence of conditions (i), (iii), and (v) for the Gateaux and Frechet homologies can be found in [Giles (1982)]; the equivalence of (i), (vi), (vii) and (viii) are obtained by Borwein and Vanderwerff (2000). Corollaries 3.3.4 and 3.3.5 are stated by Asplund and Rockafellar (1969). Section 3.4: In writing this section we followed [Cornejo et al. (1997)]. Note that Proposition 3.4.1 was obtained by Penot (1996a) under more general conditions. Corollary 3.4.4 is established, mainly, by Zalinescu (1983b). Theorem 3.4.3 generalizes Corollary 3.4.4 to the case when argmin/ is not a singleton; the equivalence of conditions (i), (ii), (v) and (viii) is established by Lemaire (1992). For a detailed study of the problems considered in this section for the nonconvex
Bibliographical notes
293
case see [Penot (1998b)]. Proposition 3.4.5 is established by Butnariu and Iusem (2000) for Frechet differentiable functions. Section 3.5: The results of this section are mainly from [Zalinescu (1983b)] and [Aze and Penot (1995)]. The strongly convex functions were introduced by Polyak (1966), the uniformly convex functions by Levitin and Polyak (1966) and the uniformly smooth convex functions by Aze and Penot (1995); the notion of uniformly smooth function introduced by Shioji (1995) coincides, for convex functions, with that of Aze and Penot (1995). Proposition 3.5.1 was established by Vladimirov et al. (1978), while Propositions 3.5.2, 3.5.3, Corollary 3.5.4 and Theorem 3.5.5 were established by Aze and Penot (1995). Theorem 3.5.6 is new for X a general normed space. The equivalences of the conditions of Theorem 3.5.12 to which one adds the conditions (iv) and (v) of Theorem 3.5.6 are established by Aze and Penot (1995), too. Theorem 3.5.10 and the implication (i) => (iii) of Proposition 3.6.4 were established, in reflexive Banach spaces, by Zalinescu (1983b). Theorem 3.5.13 and its corollary shows that the existence of uniformly convex or uniformly smooth convex functions on a Banach space is very close to the refiexivity of the space; Theorem 3.5.13 was established by Vladimirov et al. (1978) under the supplementary hypothesis that the function / is finite valued and bounded on bounded sets. The equivalence of conditions (i) and (vi) in Corollary 3.5.7 is mentioned in [Borwein and Preiss (1987)], while the equivalence of conditions (iii), (v) and (vi) (see also Remark 3.5.2) can be found in [HiriartUrruty (1998)] for X - R", p = 2 and / Frechet differentiable. The equivalence of conditions (i), (iii) and (v) in Corollary 3.5.11 was established by Rockafellar (1976) in Hilbert spaces for p = 2 with the same constant c (see also Remark 3.5.3). Applications to the study of uniformly convex and locally uniformly convex Banach spaces can be found in [Vladimirov et al. (1978); Zalinescu (1983b); Aze and Penot (1995)]. Sections 3.6, 3.7: The results of Section 3.6 are new. The characterizations of the geometrical properties of Banach spaces presented in Section 3.7 are classical for
294
Some results and applications of convex analysis in normed spaces
[Dontchev and Zolezzi (1993)]). The other results in this section can be found in the survey paper [Soloviov (1997)]. Apparently Theorem 3.9.3 was stated by Vlasov (1972), while Theorem 3.9.4 was stated by Efimov and Stechkin (1959). Section 3.10: The presentation of this section follows, mainly, [Zalinescu (2001)]. The notion of set of weak sharp minima, in the present form, was introduced by Burke and Ferris (1993), generalizing the notion of sharp minimum point introduced by Polyak (1980); in the same paper Polyak introduces the notion of set of weak sharp minima in a slightly different form. The equivalence of conditions (i), (vi), (ix) and (x) is stated by Burke and Ferris (1993) for d i m X < oo; (i) <^ (x) is mentioned in [Jourani (2000)] for X a Hilbert space for / € A(X) with S closed (an inspection of the proof shows that this is true also in our case). The equivalence of (i) and (ii) of Theorem 3.10.1 is established by Lemaire (1989). The proof for the equivalence of (i), (ii) and (v) is taken from [Cornejo et al. (1997)]. The class T of functions defined by Eq. (3.73) was introduced and studied by Auslender and Crouzeix (1989) in finite dimensional spaces. The numbers //(£), kj(t) and rf(t) were introduced in [Auslender and Crouzeix (1989)] (as in Eqs. (3.79) and (3.82)); in this paper Propositions 3.10.3, 3.10.8, Eq. (3.84) of Proposition 3.10.7 and Theorem 3.10.10 were stated, too. Formula (3.84) Auslender et al. (1993) simplify several proofs from [Auslender and Crouzeix (1989)] and mention that those results hold in reflexive Banach spaces; one can find the reflexive case of Proposition 3.10.8 and Theorem 3.10.10 in [Cominetti (1994)]. Proposition 3.10.11 and Theorem 3.10.12 are from [Dolecki and Angleraud (1996)]. The problem of global error bounds for convex inequalities is studied in many articles. The results stated in this section are related to those in the articles [Auslender and Crouzeix (1988); Deng (1997); Deng (1998); Lewis and Pang (1998); Klatte and Li (1999)]. Excepting [Deng (1997); Deng (1998)], the results in the above mentioned papers are stated in finite dimensional spaces. Klatte and Li (1999) introduced the numbers r^(0) (1 < i < 6) for / finite valued with 0 > inf / and established Corollary 3.10.6. The implications (i) => (vi) and (ii) => (vi) of Theorem 3.10.13 can be found in [Deng (1997)] and [Deng (1998)], respectively, while the other ones can be found in [Klatte and Li (1999)] (in the mentioned framework of finite valued functions on R"). Theorem 3.10.14 and Propositions 3.10.15, 3.10.16 were established by Lewis and Pang (1998). Section 3.11: Coodey and Simons (1996) associated an adequate convex function to a multifunction. Simons (1998b) gave an interesting approach of monotone multifunctions theory by using these associated convex functions. Excepting Theorem 3.11.15, which is an amelioration of a result in [Aze and Penot (1995)], its corollary, and Theorem 3.11.17, which is established in [Rockafellar (1969)] (with a different proof), all the results can be found in the book [Simons (1998a)] and partially in [Coodey and Simons (1996); Simons (1998b)]. Exercises: Exercises 3.1 and 3.4 are from [Butnariu and Iusem (2000)], Exercise 3.2 is from [Phelps (1989)], Exercises 3.3 and 3.12 are from [Zalinescu (1983b)], for more general renorming theorems than those in Exercise 3.5 see
Bibliographical notes
295
[Deville et al. (1993)], the second formula from Exercise 3.7 is proved in [Cornejo et al. (1997)], Exercise 3.8 is from [Burke and Ferris (1993)] (there X = R"), Exercise 3.9 is from [Deng (1998)] (with g finite and continuous), Exercise 3.10 is from [Borwein and Vanderwerff (1995)], Exercise 3.11 (without (v)) is from [Barbu and Precupanu (1986)], Exercise 3.13 is a classical result from [Moreau (1965)], Exercise 3.14 is from [Kassara (2000)], Exercise 3.15 (for X = Rn) is from [Auslender and Crouzeix (1989)], Exercise 3.17 for X a Banach space can be found in [Deng (1997)] for g finite and continuous and in [Jourani (2000)] for g satisfying the condition d(g+iA)(x) — dg(x) + N(A,x) for all x € ( A n d o m g ) \ C , the functions considered in Exercise 3.18 are from [Klatte and Li (1999)].
This page is intentionally left blank
Exercises — Solutions
E x e r c i s e 1.1 Let x £ coA and consider P := {m £ N | 3 A i , . . . , A m 6 P, xi, •.. ,xm € A : 5Zr=i ^k = 1> S r = i ^fc2^ = x}- Of course, taking into account the formula for coA on page 2, P ^ 0. Let p := m i n P € N. Assume that p > n + 1. Since p E P, there exist A i , . . . , Ap 6 P and xi, • • ., xp 6 A such that Y7k=i ^fc = 1 a n < i £/b=i ^A^A! = x. Since xi — xp,..., x p _i — x p are linearly dependent, there exists (/ii,.. • ,(ip-i) € R p _ 1 \{0} such that Sfc=i P * ( ^ — X P ) = 0, or equivalently, there exists ( ^ i , . . . , /ip) £ R p \ {0} such that 53*=i Hk = 0 and S^-iMfc^fc = 0- It follows that x = J2fc=i(^* + ty**)35* for every i £ R. Taking £ = t := min{—Afc//jfc | jtj, < 0} = — Aj-//^- > 0, we obtain that x = ^jt^jKAfc +tfik)xk, and so m i n P < p — 1, a contradiction. E x e r c i s e 1.2 Without loss of generality we suppose that 0 £ A. Let Xo •= lin A. If XQ = {0}, then A = {0} and the properties of the statement are obvious. Suppose that dimXo > 1. Taking into account that dimXo < d i m X < oo, Xo is a closed linear subspace, and so we may replace X by Xo, which is the same as saying that \inA = X; in this situation *A = A1. Since X is finite dimensional and A is convex, to prove the relations of the statement it is sufficient, taking into account Theorem 1.1.2, to verify that A1 ^ 0. Since lin A = X, there exists a base { e i , . . . , ej,}, k > 1, of X with elements of A. The element x := ^ ( e H he*) £ A\ Indeed, let x = AieH VXke-k S X and 5 := min{ 2k\ , , . . . , 2k\ . }; for every t € ] — 5,8[ we have that x + tx € A. Therefore x £ A*. E x e r c i s e 1.3 Doing a translation, we may suppose that 0 € H , and so H = kerx* for some x* £ X* \ {0}. Let x 0 € intn(A n H) and xi £ A \ H; without loss of generality we suppose that (xi,x*) = 1. Prom the hypothesis, there exists U eWx such that (x0 + U)r\H - x0 + UHH C A. Let U0:={x&U\ \{x,x*)) < 1/4} 6 Kx- Consider fi > 0 such that /x(xo — xi) € i/o and take 0 < 7 < 2u+\) • 297
Exercises - Solutions
298 Let u G Uo be fixed. Then
\x0 + | x i + 7 U = {\ - 7 («,£*)) (xo + u1) + ( | + 7 ( u , x * ) ) x i ,
(*)
where « := 1/2 _ 7 7
1/2 - 7 V ) + Ml72-T<^>) ' 1 ^ ^ + '<"'''>' + <"»''» * f' the last inequality being valid for our choice of 7. Since 0, w and /x(xo — £1) are in the convex set Uo, we have that u' G [/, and so xo + u' G A. Prom (*) we obtain that | x o + \x\ +~fu G A by the convexity of A Therefore \xo + 5X1 +7C/0 C A, which proves that int A ^ 0. Let now M C X be as in the statement of the exercise. If A C M it is nothing to prove. Otherwise let xi £ A\ M and consider Mi := aff(M U {xi}). Doing a translation, we assume that x\ — 0. Therefore Mi = lin M — (M — xo) + Rxo for some xo € M. By the Dieudonne theorem (Theorem 1.1.8) Mi is a closed linear subspace of X. Since M is a closed hyperplane of Mi and (AnMi)tlM = AHM, by what precedes we have that ixitMi{A C\ Mi) ^ 0. If Mi = aff A the proof is complete. Otherwise, since Mi is a closed afiine subset of X in a finite number of steps (< codimM) we obtain that rint A = mtagA A 7^ 0. Exercise 1.4 Since A + i n t C = U a eA( a + hitC), the set A + i n t C is open. Hence A + int C C int (A + C). Let x ^ A + int C. Since the set A + int C is a convex set with nonempty interior, there exists x* G X* \ {0} such that Vo € A, Vc 6 int C : (x, x*) < (a, x") + (c, x*). Since C C cl(int C) we have that (x, x*) < (x, x*) for every x £ A + C. Therefore x £ int(A + C), which proves that int(A + C) = A + int C. Suppose now that A n int C 7^ 0. The inclusion cl(A n C) C cl A l~l clC is obvious (and true without any hypothesis on A and C); so it is sufficient to prove the converse inclusion. Without loss of generality, we suppose that 0 € A 0 int C (doing, if needed, a translation). Let pc be the Minkowski gauge associated to C. By Proposition 1.1.1 pc is continuous and i n t C = { x € X | p c ( x ) < 1},
clC = {xG X \pc(x)
< 1}.
Let x € c\A PI clC; there exists ( i „ ) C A such that (xn) —• x. Since p c is continuous, we have p c ( x n ) —• pc(x) < 1. If the set P := {n G N | pc(x„) < 1} is infinite, then xn G A n C for everyraG -P, whence x = lim n € p xn G cl(A n C). If P is finite, there exists no G N such that pc(xn) > 1 for every n > no. Since x G clC, we have that pc(xn) -+ 1. For every n > no, consider xn := „„ ," U l a?n- Since A is convex and 0,x n G A, we have that x n G A. Since
Exercises - Solutions
299
pc{xn) = npc{xn)/(npc{xn) + 1) < 1, we have that xn e A n C for every n > no- But (x n ) -» x, and so x G cl(A fl C) in this case, too. Therefore cl(AnC) = c l A n d C . Suppose now that C is a convex cone with nonempty interior; we have int C = int(clC). Using what we proved above, taking A = c l C we have that cl C + int C = cl C + int(cl C) = int(cl C + cl C) = int(cl C) = int C. E x e r c i s e 1.5 (a) It is obvious that Xo + C is a convex cone. Let us show by mathematical induction on p that Xo + C is closed. Let p = 1. If ai G Xo we have that Xo + C = Xo, and so Xo + C is closed. Suppose that a\ £ Xo- Let us consider (xn) C Xo + C, (x n ) —> x G X. Then there exist (y„) C Xo, (tn) C R+ such that xn = yn + tnai for every n G N. If (tn) —> oo then ( —^-j/n ) —> a i , whence we get the contradiction oi G clXo = Xo. Therefore (tn) contains a bounded subsequence, and so it contains a subsequence (t„k)keN convergent to t £ R+. Therefore (ynk) —> x — ta\ =: y G Xo, which proves that x = y + ta\ G Xo + C. Therefore the statement is proved for p = 1. Suppose now that the statement is true for p > 1; we are going to prove it for p + 1. As for p = 1, consider two situations. To begin with, suppose that — o i , . . . , — a p +i G Xo + C; then, as one can easily verify, X 0 + C — Xo + l i n { o i , . . . , a p + i } . Since the dimension of l i n { a i , . . . , a p + i } is less than or equal to p + 1 < oo, using Theorem 1.1.8, we obtain that Xo + C is a closed linear subspace. Suppose now that there exists % such that —a* ^ Xo + C; without loss of generality, we assume that i = p + 1. Let
C •= {Y^^i^l
Vi, 1 < i < p : Ai >0J.
By the hypothesis of mathematical induction we have that Xo + C closed. Let (xn) C Xo + C, (xn) ->• x e X. There exist (j/„) C X 0 + C, (tn) C JR+ such that xn = yn + t„ap+i for every n G N. If (t n ) —• oo then Xo + C 9 T"Un —> —fflp+i, whence — a p +i G Xo + C C Xo + C, which is a contradiction. Therefore (tn) contains a subsequence (tnk)keN convergent to t G R+. Therefore (ynk) —> x — ta,p+i =: y G Xo + C which proves that x = y + £a p +i G Xo + C. Therefore the statement is true for p + 1. Thus we obtained that Xo + C is a closed set for every p > 1. (c) Let K := {x G X | V i G l,fc : (x,ifii) < ai}. If the set K is empty, the conclusion is obvious. Suppose that K ^ 0. Consider the functions T,A:
X->R*,
r(s)~((x,
A ( i ) := a - T ( i ) ,
where a := ( a i , . . . , ajt) e l * . It is obvious that T is a continuous linear operator. Therefore A is continuous and T(Xo) is a linear subspace of Rfc. It follows that T(X 0 ) is closed. Let us consider P :- {y G Rfc | Vi G TTfc : 2/i > 0}; then
300
Exercises - Solutions
K = A~l{P) and A(K) C P. Let (xn) C X0 + K, (xn) -*• x G X. There exist (2/n) C Xo and (z„) C K such that x„ = yn + 2n for every n G N. Therefore T(X 0 ) + P 9 A(z„) - T(yn) = A(xn) -> A(x). By (b) we have that T(X0) + P is closed, whence A(x) G T'(-X'o) + P. Therefore there exists x0 G X0 such that A(x + xo) € P, i.e. x £ Xo + K. Hence Xo + K is a closed set. E x e r c i s e 1.6 Without loss of generality, we assume that ||xo|| = 1. It is clear that \x G P(a) if A G R and x G P(a). Let x, y G P(a); we have that {x I xo) > ||z|| • cos a,
(y | x0) > \\y\\ • cos a,
and so (x + y I xo) > (||x|| + II2/H) • cos a > \\x + y\\ • cos a. Therefore x + y € P ( Q ) . It is evident that P ( a ) is closed. Note that P(a) = {x = Xxo + y G X | A = (x \ xo), y = x — Axo, (x | xo) > ||x|| • cos a } = JAxo + 2/ | A > 0 , {y\xo) =^0, A > yj\2 + \\y\\2 • cos a } = {Ax0 +2/ I A > 0, (y|xo) = 0, A sin a > ||j/|| • cos a } . Therefore P(0)
= {Ax0
I A > 0},
P (TT/2) = {Axo + y/X
> 0, (y | x 0 ) = 0}.
Since P(a) is a cone, we have that P(a)° = P(a)+. To begin with, let us show that P ( 0 ) + = P(TT/2). Let x = Axo and y = fixo + v, with A,p > 0, (u | xo) = 0. Then (x \ y) = Xy. > 0, an so P(TT/2) C P ( 0 ) + . Let now y e P ( 0 ) + . There exists fi € R and » 6 l such that (v \xo) = 0, y = /IXQ+V. Then (xo | ^ x o + u ) = ft > 0, which proves that y G P(TT/2). Therefore P(0)+ = P(TT/2). Let us consider now a €]0, f [ . Let x G P ( a ) and y G P ( f / 2 — Q). By what was proved above, there exist X,/i > 0, u , u e X such that x — Xxo + u,
(u I xo) = 0, A sin a > \\u\\ • cos a,
y = fj,xo + v,
(v\xo) = 0, fj, cos a > \\v\\ • sin a.
Therefore X/J. > \\u\\ • \\v\\. We obtain that (x\y)
= Xfi+(u\v)
> XfM-\\u\\ • ]\v\\ > 0.
It follows that y G P(a)+, whence P(?r/2 - a) C P(a)+. Let now y G P ( a ) + ; there exists /i G R and u G X such that (v \ xo) — 0, y = fj,xo +v. If v — 0, since xo G P ( a ) , we have that (xo | y) = (xo | fixo) = ^ > 0,
Exercises - Solutions
301
sin oi whence y G P(n/2 — a). Suppose that v ^ 0 and consider x := XQ — TTT, \\v\\ • cos a Then x G P{a), and so sinQ
i \ \
(X \y) = Lb — rr—r. K
'yj
p
||t>|| -cosa
^ i x
sin a
' '
H
-
.. .,
• (V \v) = Lb y
v
• \\v\\ > 0 .
cosa
" " -
We get so that Lisin(n/2 — a) > \\v\\-cos(n/2 — a), i.e. y 6 P(TT/2 — a). Therefore P(a)+ = P{ir/2 — a) in this case, too. Furthermore, we remark that P(a) is pointed if and only if a G [0, 7r/2[, and i n t P ( a ) = {x G X \ Z(x,x 0 ) < a}. Exercise 1.7 Let (u\,... ,un,un+i) € Kp< and ( x i , . . . j Xn)#n+l ) G Kp. by the generalized means inequality (see Exercise 2.11), xwi + ...+ xnun
= on (a]~ xiwi) + ... + a„ (a~
Then,
xnu„)
-^ / - l \ai i -l \a» > [ax XlUl) • • • [an XnUn)
> (a? 1 • • • a n " ) _ 1 p - V _ 1 I z n + i l K + i l = |x„+lU n +i| > —Xn+\Un+\. Therefore x\u\ + . . . + xnu„ + xn+\un+i > 0, and so Kp< C (Kp)+. Let now ( i n , . . . , u n , u n + i ) € {Kp)+. Then xiin + . . . +xnun +x„+iun+i for every ( x i , . . . , xn, xn+i) G Kp, and so XlUl + . .. +X„U„ > px"1
• " X " " |«n+l|
>0
V x i , . . . , X n > 0.
Taking xt = 1 and Xj = 0 for j ^ j , we obtain that Ui > 0 for every i G l,ra. Moreover, if m = 0 then, taking X2 = . . . = xn — 1, we obtain ui + ... + un > p~lxai | u n + i | for every x > 0. Leting x —>• oo we get u„+i = 0. The same conclusion is true if U{ = 0 for some i G 2, n. Assume that u, > 0 for every i G l,n and take XJ := ai/ui. The above inequality implies that 1 > pa™1 • • - a " " (u" 1 • • - u " " ) - 1 |« n +i|, whence | u n + i | < p'u™1 • • -u%n. Hence (ui,... ,u„,un+i) G Kpi. It follows that Kpi — (Kp)+. Hence Kp is a closed convex cone for every p > 0. Exercise 1.8 Taking xo :— (1,0,1) in Exercise 1.6, we get easily P = Also, taking n — 2 and a\ = a? = 1/2, it is obvious that P = K y j -
P(ir/4).
Exercise 1.9 Let / : X —* R, / ( x ) = Q||X|| — f{x). It is obvious that / is a sublinear continuous functional. Therefore C is a closed convex cone. If x, —x G C then ¥>(x) > a||x||, -(f(x) -
Exercises - Solutions
302
and so, adding these relations, we have that 0 > 2a||z||, i.e. x = 0. Therefore C is pointed. Since a < \\(p\\, there exists x G X such that a||a;|| < f(x), and so 0 > inf/. Therefore, using Exercise 2.4, we have that int C = {x G X | f(x) < 0 = /(0)} # 0. Furthermore, using Corollary 2.9.5, we have that N(C; 0) = R+ • 5/(0) = R+ • (aUx> -
= R+ • (tp + aUx*)-
Exercise 1.10 Let y £ intCR(aT) and V G Jix be such that y+V C 3?(x). Because 2/o G (Im^R)*, there exists A > 0 such that y\ := (1 + A)yo — Ay £ ImK. It follows that »/o =_(1-A)y + Ayi, where A := (1 + A ) - 1 £]0,1[. Let xi G 5l~1(yi) and take u := (1 — X)x+Xxi. Consider /j, G]0,1] and (a;M,2/M) := (1—fj,)(xo,yo)+fJ-(xi,yi) 6 g r X Then y0 = l^y + (1 - 7/i)s/M. for every v £ V we have (ZM>S/O + 7M«) = 7 M ( 2 T . 2 / +
W
where
_ zM := J^x
_
,,
+ (1 -
M +x_*x
e
^0' ^
) + (1-7M)(:C/'.2/M) ^ S r K
where ;
7^ : =
N 7M)XM
that
VpG]0,l],
_
A — LtX
= —-=
Jt follows
U
=a:o + - — =
fl + A — fl\
=u.
(J, + A — (J,A
Hence yo + J^V C R(a:'M), and so yo £ int9l(a;^). Since every element of the •£_ ^ with fi £]0,1], we obtain that yo € interval ]0,1] can be written writtej as int3?(a;) for every x G]:EO,M]. Exercise 1.11 Suppose that domC is dense and x\,X2 G C*(y*); this means (see page 26) that {x,x{) = (y,y*) — (x,x%) for every (x,y) £ grC (because grC is a linear subspace), and so (x,x\ — x^) = 0 for every x G domC. As domC is dense, it follows that x\ = x%. Suppose now that domC is not dense and take x G X \ cl(domC). By Theorem 1.1.5, there exists x* € X* such that (x, x*} < 0 = {x,x*} for every x € domC (because domC is a linear subspace). So, (x*,0) G (grC) + , whence 0,x* G C*(0). Because x* ^ 0, the statement is proved. Applying the preceding statement for C and G* we get the last assertion. E x e r c i s e 1.12 Let (xn) C X be a Cauchy sequence. Since \d(x„, x) — d(xm, x)\ < d(xn,xm) for all n, m G N and x € X, we have that the sequence (d(xn,x)) C R is Cauchy, and so it is convergent. Let / : X —} R+ be defined by f(x) := 2\imd(xn,x). From the inequality d(xn,x) < d(xn,x') +d(x',x) for n G N,
303
Exercises - Solutions
x, x' 6 X, we obtain that / is Lipschitz with Lipschitz constant 2. Moreover, we have that (/(x„)) —> 0. By hypothesis, there exists x 6 X such that VxeX
: /(x) < / ( x ) + d ( x , x ) .
Taking x = x n , we get / ( x ) < f(x„)+d(x„,x) for every n € N. Taking the limit, we get / ( x ) < | / ( x ) , and so / ( x ) = 0. This shows that (x„) —> x. E x e r c i s e 1.13 Applying Ekeland's variational principle for xo € d o m / and e = 1, there exists i i g I such that / ( x i ) + d(xo,xi) < /(xo) and / ( x i ) < / ( x ) + d(x,xi) for each x € X \ {xi}. Suppose that xi ^ argmin/. Then there exists x e X \ {xi} such that / ( x ) + d(xi,x) < / ( x i ) < f(x) + d(x, x i ) , whence the contradiction 0 < 0. Therefore xi £ argmin/, and so d(xo, a r g m i n / ) < d(xo,xi) < /(xo) — / ( x i ) = /(xo) — inf/. As this inequality is obvious for xo ^ dom / , the conclusion holds. E x e r c i s e 1.14 As in the proof of Exercise 1.13, there exists xi € X such that / ( x i ) < / ( x ) + d(x,xi) for each x € X \ {xi}. From the hypothesis there exists X2 £ 3l(xi) such that d(x2,xi) + f{xi) < / ( x i ) . If xi / X2 we get the contradiction d(x2,xi) + /(X2) < /(X2) + d(xi,X2). Therefore X2 = xi € 3l(xi). Exercise 1.15 lim
(i) Indeed,
/ ( x ) = c x 3 # V A e R , 3 p > 0, Vx € X, \\x\\ > p : f(x) > A
||x||->oo
»VAeR, 3p>0
: [ / < A] C D(0,p).
(ii) Let A > inf i e R" f{x). By hypothesis [/ < A] is a closed. By (i) [/ < A] is also bounded, and so it is compact. Applying the well-known Weierstrass' theorem for / | [ / < A ] we get the desired conclusion. E x e r c i s e 2.1 Let ti,t2 €]0,oo[ and A €]0,1[. Consider t := Ati + (1 - \)t2 > 0. Because ^ - ( x + t^lu) + (1 ~^)* 2 ( z + t^u) = x + \u, the convexity of / gives
f{x+
i«) < ^f(x+t^u)
+ il=^a/(x +
t^u),
whence ip (Aii + (1 — A)*2) < A'0(ti) + (1 — A)V>(*2). Hence ?/> is convex. E x e r c i s e 2.2 (a) Assume that the conclusion does not hold. Then there exist xi,X2 £ / such that xi < x 2 and / ( x i ) > fixi)- Let A := {x 6 [xi.xa] | / ( x i ) < / ( x ) } and take x := sup A Of course, x < x 2 and, from the hypothesis, x > x\. lix $ A then / ( x ) < / ( x i ) . But there exists e > 0 such that f\[x-i,x] is nondecreasing, and so f(x) < / ( x ) < / ( x i ) for all x G [x —e,x], contradicting the choice of x. Therefore x € A, and so x < X2. Using again the hypothesis, there exists 0 < £ < X2 — x such that f\[x,x+e] is nondecreasing. Because / ( x i ) < / ( x )
304
Exercises - Solutions
we obtain that f(xi) < f(x + e), and so x + s G A, contradicting again the choice of x. Hence / is nondecreasing on / . (b) 1) Consider fj, := (g(b) — g(a))/(b — a) and h(x) := g(x) — fix. Then h is continuous and there exists h'+ = g'+ — fi. There exists c G [a, b] such that h(c) < h(x) for every x G [a,b]. Because h(a) = h(b) we can take c G [a,b[. It follows that /+(c) = h'+(c) — fi > 0, whence the conclusion. For 2) take c € ]a, 6]. (c) It is sufficient to consider only the case X = K. So A C R is an open interval and for every a G A there exists e > 0 such that the restriction of / to ]a — e,a + s[C A is convex. It follows that there exists /+(6) = limZ4.(, (f(x) — f(b))/(x - b) G R and fL(b) = l i m , t t (/(*) - /(&))/(* - 6) < /+(&) for every 6 G]a — e, a + e[. So the functions f'_, f'+ : A —• R are locally nondecreasing. From (a) we obtain that f'_ and f'+ are nondecreasing on A. Because / is locally convex, it is continuous. Let now a, b G A, a < b, and A g]0,1[. Consider c :— (1 — \)a + Xb. Using (b) we obtain x' G [a,c[ and x" € ]c, 6] such that / ( a ) - /(c) > f'+(x')(a - c) > fL (c)(o - c) and /(ft) - /(c) > fL (x")(b — c)>fL (c)(b — c). Multiplying the first relation with A and the second one with 1 — A, then adding them we obtain that /(c) < (1 — A)/(a) + A/(6). Therefore / is convex. Exercise 2.3 It is obvious that the implications "=*•" are true. Suppose that fx is use at 0 for every x € Rn and let R 9 A > f(x) = /x(0). Since fx is use at 0, it follows immediately that x £ [f < X]\ The function / being quasi-convex, [/ < -M is convex; since dimR" < oo, we obtain that x € int([/ < A]). Therefore / is use at x. Suppose now that fx is lsc at 0 for every x 6 R n . Suppose, by way of contradiction, that / is not lsc at x. Then there exists R 3 A < f(x) such that x G cl[/ < A]. Therefore [/ < A] ^ 0. Let a G ' [ / < A] ( / 0 by Exercise 1.2). Using Exercise 1.2 we obtain that ] a , x [ c l [ / < A] C [/ < A], contradicting the fact that the function fa-m is lsc at 0. Therefore / is lsc at x. If X is an infinite dimensional normed space, there exists a linear functional ip : X —>• R which is not continuous. It is obvious that ip is quasi-convex (in fact is convex!), but
kXk.
It is clear that ip is well defined and linear. Moreover sup{\
Exercises - Solutions
305
E x e r c i s e 2.4 Let x G X be such that / ( x ) < A and take x G X. If / ( x + x) < A we have that x + tx G [/ < A] for t G [0,1[, while if / ( x + x) > A we have that x + tx G [ / < A] for *G [0,i[, where * : = /( _ A | _~^^ ( . Therefore x G [/ < A]'. Let us prove now the converse inclusion. Since A > inf / , there exists xo € X such that /(xo) < A. Let x G [/ < A]1. There exists t > 0 such that x := x + i"(x — xo) 6 [/ < A]. So we obtain that x = JT|W + yrj^o a n ( ^ m
< m m
+ i ^ / ( * o ) < T ^ A + ^ A = A,
which proves the converse inclusion. If / is continuous then int[/ < A] D {x G X | / ( x ) < A} # 0, and so, since [/ < A] is a convex set, int[/ < A] = [/ < A]'. E x e r c i s e 2.5 (a) Assume that / is lsc and t G ] inf / , oo[. Since [/ < t] is closed, the inclusion [/ < t] D cl[/ < t] is obvious. Let x G [/ < i\. Take xo 6 [/ < t]. Then x* := (1-A)x 0 +Ax G [/ < t] for A G [0,1[, whence x = limx-fi x\ G cl[/ < t). Assume now that [/ < t] = cl[/ < t] for t G ] inf / , oo[. If t0 := inf / G K, we have that [/ < to] ~ f]t>t [f < i\. It follows that [/ < t] is closed for every ( e R . Hence / is lower semicontinuous. (b) Assume that / is continuous on dom / and t G ] inf / , oo[. Let x G [/ < £]; since / is continuous at x, [/ < t] is a neighborhood of x, which shows that [/ < <] is a neighborhood of x, too. Hence x G int[/ < t]. It follows that [/ < t] C int[/ < t\. Let now x G int[/ < t] and take x 0 € [/ < t]. As the mapping A H H J := (1 - A)x0 + Ax is continuous at 1 G R, there exists A > 1 such that xx € [/ < t]. It follows that x = jX\ + ^j^xo G [/ < t]. Hence [ / < * ] = int[/ < t]. Assume now that [f < t] = int[/ < t] for every t G]inf/, oo[. Consider x G d o m / and take t > f{x). Hence x G int[/ < t], and so / is bounded above on a neighborhood of x. Therefore, by Theorem 2.2.9, / is continuous at x. (c) Assume that / is continuous on X and t e ] i n f / , oo[. Since [/ < t] is closed in our case, from (b) we obtain that b d [ / < t] — (cl[/ < £]) \ (int[/ < £]) = [f
Exercises - Solutions
306
tf'(a,ei) for any t € R Let i £ R n ; then x = xiei + .. . + x„e„ with x\,... ,xn R By the sublinearity of / ' ( a , - ) it follows that f'(a,x) < f'(a,xiei) + ••• f'(a,x„e„) = xif'(a,ei) + --- + xnf'(a,en), and f'(a,—x) < f'(a,—xiei) • •• + f'(a,-xne„) = -xif'(a,ei) - ••• - xnf'(a,e„). Therefore 0 < f'(a,x) f'(a, —x) < 0, and so f'(a,tx) = tf'(a,x) for every t € R It follows that / ' ( a , is linear, and so / is Gateaux differentiate.
S + + + •)
E x e r c i s e 2.7 It is obvious that
' m _ / p ( 2 - ( i - * ) p - 1 - ( i + *) p - 1 )
V w
?
-
1
\ p (2 + (t - I ) ' " - (t + l)
p_1
)
if
*e[o,i[,
if t € ]1, oo[,
,„« m _ / P(P - 1) ((1 - <)P"2 - (1 + *) P_2 )
if * 6 [0,1[,
^ -\
if te]i,oo[.
(t)
2
2
pip-i)((t-ir -(t+ir j
Because limt-fi <£>p(£) = p(2 — 2P~1) = lim^i y p (£), <£ is derivable on [0, oo[ and
df
°Mx,y) =P(iP(x,y)f-1 -j4== OX
y/xi 1
y-(x,y)=P(
and so
~
4- y2 X
y/xz + y2
+ {
+ („(*,„))*
^
y/xz + y2
307
Exercises - Solutions Hence
d2f(x, y)(h, k) = (V(x, y)?-> (Pi? - 1) + {
{yh X
- pL-2
2
(x + y2)yjx2
+ y2
>0
for all (x,y) G D and (/i, &) G R 2 . Using Theorem 2.1.11 we obtain that f\o is convex, and so the function g : R2 —> R defined by g(x,y) := f(x,y) for (x,y) 6 £*, g(x,y) := oo otherwise, is convex. It is obvious that for any a > 0, l i m i n f ^ j , ) . ^ , , ) ^ , ^ ) = ( f ) ' ? a = / ( a , 0 ) and liminf(X>y)_>(o,o)S(a;,y) = 0 = /(
v t G R :
It follows that
Let i , « £ X be fixed elements. Replacing t by x(t) and r by u(t), t G [0,1] in the above inequality, then integrating on [0,1], we obtain that
f{x +
u)-f{x)-.fo1;^dt\
This inequality proves that / is Prechet differentiable at x and that V / ( x ) has the announced expression. Let us prove that V / is Prechet differentiable. For all t, r G R there exists 0€]O,1[ such that ¥>'(« + r ) -
9r)r2.
From the expression of tp'" given above we have that |y?'"(£)| < 3 for every t e R. Therefore V t , r G R : |^'(* + r ) - V ( t ) - y " ( * ) r | < § r 2 .
Exercises - Solutions
308
Denoting by B(x) the bilinear functional defined by B(x) : X x X -> R, W
B(x)(u,v):= v A
UV
f
.
2
Jo (l+x )^/^TP
dt, ^
and using the preceding relation we obtain, as above, that Vx,u,v£X
< I f> u2\v\dt < l\\u\\2\\v\\,
: \(Vf(x + u)-Vf(x)-B(x)(u,-))(v)\
and so Vx,u€X : \\Vf(x + u)-Vf(x)-B(x)(u,-)\\ < |||w|| 2 . Therefore V / is Prechet differentiable on X and V 2 / ( x ) = B(x) for every x e X. PYom the expression of V 2 / ( x ) and the expression of tp'" we obtain easily that V 2 / is Lipschitz with Lipschitz constant 3. Therefore V 2 / is continuous on X. E x e r c i s e 2.10 l
Consider X := L 1 (0,1); X is a Banach space for the norm:
Nl •= !0 \u{t)\dtioxuex. Using the function tp defined in the solution of the preceding exercise, taking also into account that tp is convex, / is convex. Moreover V £ , r e R : -\T\ < tp'(t)r < tp(t + T) -
+ (X + Tnu)2
9n •=
Tn
- V I + X2
XU ,
9 :
sJ\+X2
It is obvious that lim^-nx) gn = g a.e.; from the above inequality we obtain that |<7n| < |w| a.e. for every n € N. Using Lebesgue's theorem (see [Precupanu (1976)]) we obtain that linin-Hx, f0 gn dt = J0 g dt. This proves that / is Gateaux differentiable at x, and the expression of V / ( x ) is that of the statement. Let x € X be arbitrary; we show that / it is not Prechet differentiable at x. We may assume that x is finite a.e. Of course, [0,1] = \JmeNAm, Since limm^xm.eas(Am)
where Am ~ {t € [0,1] | \x{t)\ < m } . = 1, there exists mo G N such that meas(Amo)
> |.
Exercises - Solutions
309
But yj\ + (t + r ) 2 - yjl + t2 -
tr
Vi + t* 1-
2
y/i + (t + r) + N/TTF ^
2i 2 + tr N /TTF( 1 /i
+ (t + r)2 + VT+t*y
for T = n and |t| < mo we have that
y/l + (t + r ) 2 + v T T t 2 ~ 2-y/l + (n + mo) 2 2
2i + r t
mo
v / T T t 2 ( N / l + (t + r ) 2 + v / T + 1 2 ) 2
2
'
n + 2m 0
Vl+m2
V 1 + (n ~ ™o)2
mo
%
V 1 + ™o '
the limits being taken for n —• oo. Therefore there exists no > mo and £o > 0 such that VteAm0,
Vn>n0
tn
: y / l + (t + n ) 2 - y / l 4-1 2 -
> e 0 n.
vl +* For every n > no there exists a measurable set Bn such that Bn C A m „ and meas(Bn) —ri~2.Let us consider the functions rnl]
,„
Un:[0,l]->R,
f n if t € B„, « „ : = | Q .f f £ ^ ^
^
Then ||u n || = — —> 0. From the above inequality we have that / ( Vl + (a + M n ) 2 - \ / l + a 2 - ^L-]dt>e0[ Jo \ Vl+a; /
-/o
undt = £o\\un\\,
which shows that / it is not Frechet differentiate at x. E x e r c i s e 2.11 The function / : R - > H , / ( t ) := expt is strictly convex. Consequently, for a,b € P, o p ^ 6" and p,q£]1,oo[, ^ + J = 1 we have that ab = / ( ± l n a * + > & * ) < ± / ( l n a ' ) + i/(ln6«) = \c? + \b«. Therefore the statement is true. It is obvious for a = 0 or 6 = 0. The function g : P ->• R, g(t) := - l n t is strictly convex. Let n G N \ {1}, a i , . . . ,an €]0,1[ with a i + . . . + an = 1 and x i , . . . ,x„ € P, not all equal. Then - ln(aixi H
1- anx„) = g (cnxi H < aip(xi) H
1- a „ x n ) l - a „ j ( i j = - I n (x" 1 •• - x " n ) ,
310
Exercises - Solutions
and so aixi H
1- a„xn > x"1 • • • x°n.
It is obvious that the above inequality is still valid when xi,. •., xn € [0, oo[ are not all equal. The final statement is now obvious. Exercise 2.12 The implication => is obvious. Assume that f + x* is quasiconvex for every x* € X* and consider x, y € dom / with x / y and A € ]0,1[; set z :— (1 — \)x + \y. There exists x* € X* such that (x — y,x*) = f(y) — f(x). Since f + x* is quasi-convex, we have that f(z) + (z,x*) < max {/(a;) + (x,x*),f(y)
+ (y,x*)}
= (1 - A) (/(*) + (*, x')) + A (/(») + (y, **» , whence / ( ( l - \)x + Xy) < (1 - A)/(x) +
\f(y).
E x e r c i s e 2.13 Note that d2f . . ^-'""^ Z&(*) oxi
df i \ * , — (a;) = —f(x), OXi to-^/M. Xi
a i ( a i - 1)L —- ^r^f(x), —
=
while for i ^ j , d2f
, . _ otiOtj
(*) = r^/to-
The function / is concave (i.e. —/ is convex) if and only if V2f(x) defined for every x G P", i.e. ai{ai 1] 2 Vu = ( t t l l . . . ,u„) 6 R n : r 2- u ^-'i=i a;?
is nonnegatively
+ 2V ^ ^ -^-^i
< o, _
or, equivalently,
V«eR" : (Y^_ a; Mi) < 5^"_ QiU«2Taking an+i = 1 — ^ " = 1 O i and un+i = 0, the preceding inequality follows from the ( strict) convexity of the function t —> t2 on R; the inequality is strict if ^27=ia' < 1 anc ^ u / 0) a n d so / is strictly concave in this case. Exercise 2.14 We have that df/dxi
— = -(expxi)(expxj) (72/ j «C # 7j
= (expa:;) ( S * = i
(V" \*—'« = !
expa;*) /
ex a;
P fc)
>
an<
^
so
Vi, j 6 l , n , i ^ i-
Exercises - Solutions So, for all x,ueRn
311
we have that V 2 /(x)(w, u) equals
(EL*"*)"2 ((ELi»0 (EI=1w(~*)a) - (EI =1 ^^) 2 ) where ?/& = expx* for k € IT"- Applying the well-known Cauchy-Schwarz inequality ( E L i a * 6 * ) 2 ^ ( E L i a i ) ( E L i b f c ) f o r
a
* = -Jy*
a n d 6fc
=
Vvku^
we obtain that V 2 / ( x ) ( u , it) > 0. Hence / is convex. E x e r c i s e 2.15 One obtains easily that
§«,*) = f/(.,*), ^(,,I)„-±/(M), fM = -tt ' g « , * ) = ^ / M , 0«,*>=iy/e..>, ^ f t . ) while for i ^ j , 32f
1
CfX% OXj
X%Xj
Since / ( t , x) > 0 for every (t, x) 6 P x P", the fact that / is strictly convex (resp. convex) is equivalent to the fact that the quadratic form
is positively (resp. nonnegatively) defined, or, equivalently, that the quadratic form ?-^s2 p
+ Y*n
2u2i - 2 V "
*-~ii=\
is positively (resp. nonnegatively) defined. quadratic form is / 2=1
/
A =
P
-1
- 1 2 -1 1
V -1
UU
sm + 2 V
*-~ii=\
' J
*-^l
The matrix associated to the last
-1
_1
1 2
1 1 2
\
/
To study the positiveness of the matrix associated to this quadratic form, we have to determine its eigenvalues. Let 6„ := det(At„+i —A). After some computations we obtain that
Exercises - Solutions
312
p
x _
P
P
P
1 1
A-l 0
0 A-l
••• •••
0 0
1
0
0
•••
A-l
Developing with respect to the last row we obtain that 5n = (A - l)6n-i
- (A - l ) " " 1 (A + 1/p),
and so 5n = (A - I ) " " 1 * ! - (n - 1)(A - l ) " - 1 (A + 1/p) = (A - I ) " " 1 [A2 - (n + 2 - 1/p) A + 1 - (n + l)/p] . Let Ai, A2 be the roots of the polynomial between the square brackets. Because the other roots of the characteristic polynomial are all 1, the quadratic form under consideration is positively (resp. nonnegatively) defined if and only if Ai and A2 are positive (resp. nonnegative). For p < 0 we have that Ai + A2 = n + 2 - 1/p > 0,
A1A2 = 1 - (n + l ) / p > 0,
and so Ai, A2 > 0. When p > 0 the roots Ai, A2 are positive (resp. nonnegative) if and only ifp > n + 1 (resp. p > n + 1 ) . Therefore / is strictly convex if and only if p € ] —oo,0[U]n+l,oo[ and is convex if and only ifp 6 ] —oo,0[U[n+l, oo[. Since g{x) = f((x\c),x) and the function x *-¥ (x\c) is linear, it follows immediately that g is (strictly) convex for (p > n + 1) p > n + 1. Since h(x) = / ( l , x) (for p = n + 2), we obtain immediately that h is strictly convex. E x e r c i s e 2.16 We give only the proof for ip'+7 the proof for ip'_ being similax. The function i/> is increasing, while ip'+ is nondecreasing and 0 < ip'(t) < 00 for t > 0 (if V'(*o) = 0 for some t0 > 0 then ip'(t) = 0 for t S [0, to], and so ip{t) = ip{G) = 0 for t € [0, to])- It follows that the mapping P 3 t H 1/ (ip'+ o i/>-1) is positive and nonincreasing on P. Let us fix a > 0 and take j3 := ip(a) > 0. From what was said above, the integral jff 1 J*JX •. exists in the sense of Riemann for all 7 6 ]0, f)[. Considering it as a Riemann-Stieltjes integral, we have that r0
fit
ri>~1(0)
1
[a
l
Exercises - Solutions
313
where S := ip 1(-y). Let S = so < «i < . . . < s„ = a be a division A of the interval [5,a] and take at := s;_i for i G l , n . Then ip'(o-i) G dip(si-i), and so
S(A, (ffi)) = ^" =i ^ - y (V>(«i) - tf(*-l)) > E L ( S i - S -!) = Q - 6> because ip(si) — ip(si-i) > ip'(si-i) • (si — Sj_i) for all i 6 l , n . Taking now
F(x,y):=\ v 'y;
'
M
\ +cx>
* if
1 * = »• Tx ^ y.
It is obvious that F is convex and / is the marginal function associated to F. Therefore, / is convex. Furthermore, for A > 0 and y 6 Y we have that /(Ay) = inf{||:r|| | Tx = Xy} = inf { A • H A " 1 ^ | T (A" 1 *) = y} = A i n f { | | u | | | T u = y} = A/(y). If T is open, there exists p > 0 such that pf/y C T(Ux)VyGpf/F, 3 x G f / x
Therefore
: Tx = y,
and so /(y) < 1 for every y 6 pUy; this implies that d o m / = Y and / is continuous. Exercise 2.18 By Exercise 1.15 there exists x G R* such that fix) < f(x) for every x €Rk. We assume, without loss of generality, that x = 0 and f(x) = 0. Let (x,£) G co(epi/). Then there exists a sequence ((xn,tn)) C co(epi/) converging to (x,t). By the Caratheodory theorem (Exercise 1.1), for every n € N there exist Aj, 6 [0,1], ( x ^ f j 6 e p i / for t G l,k + 2 such that E * ^ 2 AB„ = 1 a n d (x„,t„) = S,fc=i2 K(xi,ti). We may assume that for every i G l,k + 2: (A*n) -> Ai € [0,1], ( O -»• t« 6 [0, oo], ( A X ) _-* V G [0, co], (Aj, ||x l n ||) -> if G [0, oo] and (|| x n||) —• /*' G [0,oo]; moreover, if p' < oo, we may assume that {xln) —>• xl G Rk. Of course, £ * + 2 *' = 1- F i r s * of all, because (E?=i 2 AJ,*™) -> t = £ * = 2 7% we have that 7 s 6 [0, oo[ for every i G 1, A; + 2. Assume that / / = 00. By our hypothesis we have that (||xn|| 1
1
t'n)
—
* °°>
which implies that t — 00, A = 0 and 77* = 0. Consider the sets / := (i 6 l,fc + 2 I JJ* > 0} and J := l,fc + 2 \ I. Hence (xj,) -> x i € Rk \ {0}, A; > 0, and so tl < 00, for i € I; moreover, (x*,i l ) £ e p i / for i G / , because /
314
Exercises - Solutions
is lsc. Let 7 := E ; e j 7* when J is nonempty and A := E ; e j ^' when I is nonempty. It follows that (x,t) = (0,7) if 7 = 0, (x,t) = J2iei *•*(*',**) if J = 0 and (x,t) = J2i€I \'(x',t') + (0,7) if I, J are nonempty. It is obvious that (x,t) £ co(epi/) if 7 or J is empty. If both of them are nonempty then ( M ) = E i g / * ' ( * ' . *') + (1 - A) (0, (1 - A) _ 1 7) 6 co(epi/) if A ± 1 and (x,t) = E i e / V ( a ; % * i + 7 ) 6 c o ( e p i / ) i f A = l. E x e r c i s e 2.19 If d o m / (~1 dom3 = 0 then ( / + 7 3 ) (x) = 00 for all 7 > 0 and x € X, and the equalities are obvious. So assume that d o m / PI doing ^ 0. Let us begin with the first equality. Since co(/ + ag) 4- fig < f + ag + fig = f + (a + fi)g we have that co (co(/ + ag) + fig) < co ( / + (a + fi)g). In order to show the converse inequality it is sufficient to show that co ( / + (a + fi)g) < co(/ + ctg) + fig, or equivalently, epi (co(/ + ag) + fig) C co (epi ( / + (a + fi)g)). So let (x,£) be in the first set; there exist ti,<2 £ R such that t = t\ + fit2 and {x,ti) £ co(epi(/ + ag)) and (£,£2) 6 epig. Therefore (x,ti) = lim; 6 /(x', x and / £ T(X), we have that / ( * ) < limmf / ( E J I ^ ^ O ^ ^ E j l x ^ / ^ )
^
" ^ E j l x ^ '
and so l i m i e / (/(x) + 7* - E J i i A j s j ) = 0 and ( x , / ( x ) + 7*) 6 e p i / for i € 7, where 7° := max ( 0 , E ? i i ^)s'j:~ f(x))- Because [x), s) + (a + fi)r)) belongs to epi ( / + (a + fi)g) for all i € 7, 1 < j < m, and (x, / ( x ) + -y{ + (a + fi)t2) £ epi ( / + (a + fi)g), we have that
£ j l i ^ 8 A> (*' > *' + ( Q + ^ ) + ^
t +
£ /(a:)+7i
=(^' 0 ^( '
(*< '<*> + V + (« + flfc)
-^^^
+(a+/3)i2
)
£ c o ( e p i ( / + (a + /3)g)). Taking the limit we obtain that (x, t\ + fit2) — (x, t) £ co (epi ( / + (« + fi)g))Assume that h := co ( / + (a + /?)g) is proper; then, by Theorem 2.3.4, we have that ( / + (a + /?))** = h. Because h is proper, from the first part we have co (co(/ + ag) + fig) proper, too, and so co(/ + ag) is proper; in the contrary case, by Proposition 2.2.5, co(/ + ag)(x) = —00 for all x € d o m c o ( / + ag) D d o m / fl domg, and so co(/ + ag)(x) + fig(x) = —00 for x £ d o m / PI domg. Therefore (f+ag)** = c o ( / + a g ) and so ( ( / + ag)** + fig)** = (/ + (a + fi)g)**. Hence ( ( / + ag)** + fig)* = ( / + (a + fi)g)*'• This relation is obvious when h is not proper. E x e r c i s e 2.20 Assume that C is bounded and A + C C B + C. Then SA+C <
Exercises - Solutions
315
SB+C, and so SA + sc < SB + sc- Because C is bounded, as noted on page 79, sc(x*) € R for every x* G X*. Prom the preceding inequality we get SA < SBUsing Theorem 2.4.14(vi) we obtain that A C coB. E x e r c i s e 2.21 (i) => (ii) is obvious: / ( x ) < /(0) + L \\x\\. (ii) =*• (iii) Let u € X. Then foo(u) = lim^^o t " 1 (/(t«) - /(0)) < lim^oo r
1
(L \\tu\\ + a - /(0)) = L ||u||.
(iii) =>• (i) Let first x G dom / and take y € X. Then / ( y ) - fix) < s u P t > 0 r
l
(/(x + t(y - x)) - f{x)) =fooiy-x)
\\x - y\\.
This shows that d o m / = X, and so the above inequality holds for all x,y G X. Changing a; and y we obtain that / is L-Lipschitz. Exercise 2.22 Let u G X be fixed and consider 7 := sup{/(x + u) — f(x) \ x e d o m / } . The inequality /<x,(u) > 7 follows immediately from Eq. (2.28) (see page 74). Hence the conclusion holds if 7 = 00. Assume that 7 < 00. Fix xo 6 d o m / ; then xo + nu 6 d o m / for every n £ M By Eq. (2.28) we have that f{xo + ku) — f(xo + (fc — l)u) < 7 for every k G N. Summing for k from 1 t o n we get f(xo + nu) — /(xo) < nj. Dividing by n and letting n —¥ 00, we obtain that /oo(u) < 7Exercise 2.23 Let x0 € [/ < A] be fixed. Take first u € [f < A]oo- Then fixo+tu) < A for every t > 0, and so /<x>(w) = limj.+oo t~1if(xo+tu)—/(xo)) < 0. Hence [/ < A] 00 C [/» < 0]. Let now u £ [/oo < 0]. Then /(xo + 1 « ) — /(xo) < tfooiu) < 0 for every t > 0, and so xo+tu € [/ < A] for t > 0. Hence w € [/ < A]cx). Let x* € d€f(x) be fixed and consider u* G idefix))^. Then x* + tu* £ defix) for every t > 0. It follows that (x' — x,x* + tu*) < f(x') — f(x) + e for all x' G dom / and t > 0. Passing to the limit for t —• 00 we get (x' — x, w*) < 0 for every x' G d o m / , and so u* G iV(dom/;x). Conversely, if u* G N ( d o m / ; x ) then (x' — x,u*) < 0 for every x' G d o m / ; this together with (x' — x,x*) < f(x>) ~ fix) + £ f ° r x> e d o m / gives (x' — x,x* + tu*) < f(x') — / ( x ) + e for all x' G d o m / and t > 0. Therefore x* + <M* G 9 £ / ( X ) for £ > 0, and so
Fix (x*,A) G e p i / * and let (u*,7) G ( e p i / * ) ^ . Then (x* +tu*,\ + t^) G e p i / * , whence (x,x* + tu*) — / ( x ) < A + tj for all x G d o m / and t > 0. Taking the limit for t —• 00 we obtain that (x,u*) < 7 for every x G d o m / ; hence «dom/(w*) < 7, i-e. (w*,7) G episaom/- Conversely, if Sdom/(w*) < 7 then (x,u*) < 7 for every x G d o m / ; this together with (x,x*) — f(x) < A for x G d o m / gives (x,x* + tu*) — / ( x ) < A + t 7 for all x G d o m / and t > 0. Therefore /*(x* + t u * ) < A + t-y for t > 0, and so (u*,7) G (epi/*)oo- Therefore (epi/*)oo = episdom/, which shows that (/*) The formula /oo = Sdom/* follows applying the preceding one for / * .
Exercises - Solutions
316
Let D := P r y ( d o m F ) and take v* £ Y* such by what precedes, SdomF(0,w*) < 0, and so (y,v*) Hence (j/,w*) < 0 for every y € D, i.e. v* € D~. (y>v*) < 0 f° r every y 6 D, and so SdOmF(0,v*) <
that (F*) o o (0,i'*) < 0; then, < 0 for every (x,y) € d o m F . Conversely, if v* € D~ then 0.
Exercise 2.24 (i) Since f<x, is a lsc function, it is clear that if is a closed convex cone and so Xo is a closed linear subspace of X. Let u £ l o and x £ d o m / . Taking into account Eq. (2.27), we have that f(x + u)-
f(x) < /«,(«) < 0,
f(x -u)-
f(x) < foci-u)
< 0.
< /«,(«) + foc(-u)
< 0,
Adding these two relations side by side, we get 0 < 2 ( ± / ( x + u) + \f{x
-u)-
f(x))
and so f(x + u) — f(x) = /oo(w) = 0. Let now x £ d o m / and u G XQ. If x + u S d o m / , by what precedes we obtain that x — (x + u) + {—u) S d o m / ; hence x + u £ dom / and so f(x + u) = f(x). (ii) Note first that d o m / * C XQ", and so lin(dom/*) C XQ". Indeed, let x* G d o m / * ; taking xo £ d o m / and u £ Xo, we have (xo + tu,x*) < /(xo + tu) + f(x")
= /(xo) + f*(x*)
Vt € R,
whence (u, x*) = 0. Hence x* 6 XQ". To complete the proof it is sufficient to show that (dom/*) C Xo (<=> X5 1 C (dom/*)"1"1" = lin(dom/*)). So, let u € (dom/*)"1"; then Sdom/*(w) = 0, and so, by the preceding exercise, foo(u) = 0. Therefore (dom/*)" 1 C K, whence (dom/*)J"CXo. (iii) As Xo is a closed linear subspace of X , the quotient space X/Xo is a separated locally convex space, too (X/Xo being endowed with the quotient topology). Moreover, its topological dual is XQ", and its weak* topology is the trace of w* on XQ". When 1 6 I , its class x is x + Xo. From (i) we obtain that / is well defined. Moreover, e p i / = {(x,t) \ (x,t) € e p i / } = 7r(epi/), where TT is the canonical projection of X x R on (X x R)/(Xo x {0}). Hence e p i / is a convex set. Furthermore, because e p i / + Xo x {0} = e p i / , e p i / is also closed. Hence / is a proper lsc convex function. The relation /00(G) = /oo(w) follows immediately from (i) and Eq. (2.27). Using (i) one obtains easily the other two relations. Exercise 2.25 Replacing eventually / by e _ 1 /> w e m a y t a k e e = 1. Because /oo(w) < 0, the mapping R 9 t i - » f(x + tu) € R is nonincreasing for every x € X and d o m / + R+u = d o m / , as seen in Remark 2.2.2. On the other hand, since limt_>oo * _ 1 (f(x — tu) — f(x)) = foo(-u) > 0 for x £ d o m / , for every such x
Exercises - Solutions
317
/ ( x - tu) > f{x) + £7 Vt>tx,
(°)
there exists tx > 0 such that
where 7 := §/<x>(—«) > 0. Consider g : X —• K defined by 3(3;) := exp(— t) for a; = tu with £ > 0 and g{x) := 00 elsewhere. Consider f\ := /•• Hence d o m / i = d o m / + domg = d o m / + R+u = d o m / . Let x € d o m / ; then there exists £1 > 0 such that / i ( x ) = / ( x - ?,«) + exp(-t„) < f(x) + fl(0) = / ( x ) + 1;
(°°)
this fact is obvious because, by (°), limt-><x> (f(x — tu) + exp(—t)) = 00. Moreover, because f(x) < f(x — tu) for every t > 0, we have that / i ( x ) = inf{/(x - tu) + e x p ( - t ) | t > 0} > inf{/(x -tu)\t>0}
= f(x).
Therefore f < fi < f + 1. Let us show that f\ is lsc. Consider (xi)nsi —> x € X with {fi(xi))ieI —> fi< 00. We may assume that Xi € d o m / i = d o m / for every i 6 7. We must show that / ( x ) < fi. By (°°) we have that fi(xi) = f(xi — Uu) + exp(—U), where U :— tXi > 0. Suppose that (£») —> cx>. Then for every t > 0 there exists it € I such that £i > £ for i X it. Hence /(x< — £;u) + exp(—U) > f(xi — tu), for i y it, and so fi > f(x — tu) for every t > 0. This contradicts (°). Therefore (£i)ig/ has a subnet (t'j)j^j converging to t € R+; hence fi > f (x — tu) + e exp(—£) > / i ( x ) because / is lsc at x — tu. It follows that /1 is lsc. Since inf f\ = inf / + inf g — inf / and / i ( x ) = / ( x — txu) +exp(—t x ) > f(x — txu) > inf / = inf/1 for every x G d o m / i , we have that argmin/i = 0. Exercise 2.26 As proved in Proposition 2.7.2, condition (ii) holds if one of conditions (iii)-(viii) is verified. As noted before this proposition, condition (ii) also holds for F, where F(x,y) := F(x,y) - (x,x*) with x* e X*. Taking ip :— F(-,0), using Theorem 2.7.1, we have *>*(x*) = sup«x,x*> -F(x,0)) xex = min y'&Y*
= - xinf F(x,0) = min €x y*ev
F'(0,y')
F*(x*,y*).
Therefore g is u;*-lsc and the values of g are attained. It is obvious that tdomF satisfies condition (ii) of Theorem 2.7.1, too. So, using once again this theorem, we have, as above, that SdomV(u*)
— SUp ( ( X , X * ) - tdom v(x)) = SUp ( ( x , X * ) - tdom F {x, 0 ) ) x£X x€X =
min (tdomF)*(U*,V*) =
min
SdomF(u*,v*)
Exercises - Solutions
318
for every u* G X*. From Exercise 2.23 we have that Sdom^ = (
=
SU
P {{x^x*)-9*{x*)))=mi[f(x)+
x*£-X*
, i n t , [5*(a:*)-sup ((x,x*)
mf
x^Ji.
- f(x))]
=
(g*(x*) -
(x,x*))]
x tA
inf
(g*(x*) -
f*(x*)).
Exercise 2.28 Let x G dom / be fixed and consider y>: P ->• R,
Exercises - Solutions
319
preceding inequality we get, as above, (x — x,x*) < t — co/(57). It follows that (x — x,x*)
XiXi + A io x -x,x")
whence (a; — x~i0, x*) < f{x)—f(xi0)
< y*
~\if(x~i) +
\i0f(x)-cof(x),
because Xi0 > 0. Therefore x* G Pli=i df(x~i).
Exercise 2.30 Let / 0 := f\x0Then dfo{0) = {x* \x0 \ x* 6 9/(0)} and / ' ( 0 , x ) = fo(0,x) for x G Xo, / ' ( 0 , x ) = 00 x e X \ Xo- Since /o is continuous at 0 (G XO), we have that /o(0,x) = max{(x,x*) | x* G 9/o(0)} for all x G -Xo. Hence, for x G Xo, / ' ( 0 , i ) = m a x { ( x , i * ) | x * G 9/o(0)} = sup{(x,x*) | x* G 9/o(0)} = sup{(x,x*|x 0 > I a;* € 9/(0)} = sup { f o x ' ) | x* G 9 / ( 0 ) } . If x $ Xo = XQ"1, there exists x* G X* such that x"*|x0 = 0 and (x,37*) 7^ 0. Taking x*0 G 9/(0) ( / 0 by Theorem 2.4.12), we obtain that sup{(x,x*) I a;* G 9/(0)} > sup {(x,x"0 + tx*) \ t G R} = 00 = /'(0,57). Let now i 6 I o \ Xo- Since /o is continuous at 0, by Theorem 2.2.11, /o is Lipschitz at 0, i.e. there exists Vo G N x 0 such that Vo C dom/o and |/O(:E) — fo(y)\ < pv0(x — y) for all x,y G Vo, where py 0 is the Minkowski gauge associated to Vo. It follows that /o(0, x) < pv0(x) for all x G X0. Replacing eventually Vo by a subset, we may suppose that Vb = V fl Xo, where V G !Nx • So we have that /o(0, x) < pv(x) for all x € Xo- It follows that V i e X o , Vi* G d / ( 0 ) : {x,x*) =
(x,x*\Xo)
As x G Xo, there exists a net (XJ) C Xo converging to x. From the relation above, taking into account the continuity of x* G 9/(0) a n d p v , we obtain that (51, x*) < pv(x) for all x* G 9/(0), and so sup{(x, x*) | x* G 9/(0)} < pv(x) < 00. Exercise 2.31 We take x / y (the other case being obvious). Consider the function ip : R —> R, (^(t) = / ( ( l — t)x + tj/) — t(f(y) — / ( x ) ) . Because ? is convex, y(') < ^(0) = v ( l ) f° r every t G [0,1]. Since / is continuous on i n t ( d o m / ) , ip is continuous on [0,1]. Therefore there exists to G [0,1] such that
+
(t-t0)(y-x))~f(z).
320
Exercises - Solutions
Dividing by t — to for t G]£o, 1] and taking the limit for t —> to we get f(y) — f(x) < f'(z,y — x), while by dividing by t — to for t G [0,
= max{(y - x, z*) \ z* G = mm{{y-x,z*)
\ z*
df(z)}, €df(z)},
we obtain the conclusion. Exercise 2.32 (i) The equivalence of a) and b) is true for any multifunction T : X =3 X*, and follows immediately. a) =>• c) Suppose that df is single valued on domdf. Assume that there exist distinct Xo,x\ G X* and A G]0,1[ such that [xo,x|] C Imdf and f{x\) — (1 - A)/*(x3) + Xf'(xl), where x*x := (1 - A)xS + \x\ e Imdf. Let x 6 X such that x^ G df{x). Then, by Young-Fenchel inequality, o = /(*) + / * ( * ! ) - < * , * A > = (i - A) (/(x) + /•(*;) -
(x,x*0)) + A(/(X) + /•(*;> - <*,*:» > o.
It follows that / ( # ) + /*(x*) — (x, x*) = 0 for i G {0,1}, whence the contradiction
{x'0,xl}cdf{x). c) => a) Assume that df is not single valued on domdf. Then there exist x G X and distinct xS,xJ G df(x). Of course, [x3,xl] C df(x) C Imdf. Hence f(x) + f(x'x)-{x,x*x) = 0 for AG [0,1], where x j := (1 - A)xS + \x{. It follows that f*(x*\) = (1 —A)/*(xS) + A/*(xi), which shows that / * is not strictly convex on [zo.aJi]Assume now that / is continuous on D := i n t ( d o m / ) and D / 0. Suppose there exists x G dome*/ \ D. By Theorem 1.1.3 there exists x* ^ 0 such that (x — x,x*) < 0 for every x G d o m / . Taking x* G df(x), we obtain that x*+tx* G df(x) for every t > 0, contradicting a). (ii) Consider the topology w* on X*. Then (/*)* = / and df* = (df)'1. By (i) we obtain that / is strictly convex on every segment [xo,xi] C dom df. But, by Theorem 2.4.9, i n t ( d o m / ) C dom df. The conclusion follows. Exercise 2.33 Let (x*,y*) G df(x0,yo); it is obvious that x* G df(-,yo)(xo) and y* G df(xo,-)(yo), even without assuming that / is continuous at (xo,yo). Let x* G df(-,yo)(xo). Since / is continuous at (xo,yo) G d o m / , the function f'{(xo,yo),) is finite, sublineax and continuous. Furthermore, since x* G df(-, yo)(xo), we have that VxGX
: (x,x*) < f{xo + x,y0) -
f(x0,yo),
and so V x G X : (x,x*)'((xo,2/o),(x,0)).
Exercises - Solutions
321
Consider the linear subspace X x {0} of X x Y and the linear functional tpo '• X x {0} —> R, <po(x,0) := (x,x*); taking into account the above inequality and the Hahn-Banach theorem, there exists a linear functional y i l x Y ^ R such that
: tp(x,y) < f'((x0,yo),
(x,y)).
Since f'({xo, yo), •) is continuous, from this inequality we get that
F{x,t):={ fo{x) ' \ CO
* if
f {
} *\^\ fl(X) > t,
and its associated marginal function h : R —$• R; h is a decreasing convex function such that /i(0) = v G R and /i**(0) = v*. By Theorem 2.6.1(v) we have that v* = v if and only if h is lsc at 0. Since h is decreasing, we have: v*
= w « . V e > 0 , 35>0,Vte]-6,5[\ -&Ve>0,
3<5>0
h(t)
>y-e
/i(<5) > v - e
^ • V e > 0 , 35 > 0
[fi(x) < 6 => f0(x) > y - e]
^Ve>0,
[fo(,x) < v - e =*• fi(x) > 5]
35>0
& Ve > 0 : inf{/i(z) | f0(x) < v - e} > 0, i.e. the conclusion holds. Exercise 2.35 "=>" If x* < / then /(a;) > (x,x*) > - ||x|| • ||x*|| for every x € X, and so the conclusion holds with M = \\x*\\. "•$=" Let g : X ->• R be defined by g(x) := M||x||. The function g being continuous, from the Fenchel-Rockafellar duality formula we have 0 < inf (/(x) + M | | x | | ) = mj« ( - / • ( * ' ) - V ( - * * ) ) =
max
(-/*(**))•
Since —/*(x*) = inf^ex (/(x) — (x,x*)), the conclusion follows. Exercise 2.36 "=>•" If - / i < x* + a < f2 then / i ( x i ) > - (xi,x*> - a and / 2 ( x 2 ) > <x 2 ,x*) + a, and so / i ( x i ) + / 2 ( x 2 ) > (x 2 - x i , x * ) > - ||x 2 - x i | | - | | x * | | for all x i , x 2 6 X, and so the conclusion holds with M = \\x*\\. "<=" Let f,g : X x X -> 1 be defined by / ( x i , x 2 ) := / i ( x i ) + / 2 ( x 2 ) and g{xi,X2) := M | | x 2 — Xi||. The function g being continuous, from the Fenchel-
322
Exercises - Solutions
Rockafellar duality formula we have 0<
inf
(f(xi,x2)
+ g(xi,x2))
=
max {-f*{x{, x*2) - g*(-x\,
x\tX2^X
-x\)).
x^,x^€X*
But g*(—x{, —x2) — 0 if x\ = — x2 € Ux*, *(—£*, —x2) — oo otherwise. Therefore there exists x* € X* such that 0 < -f*(-x*,x*)
=
inf
(fi(xi)
+ f2(x2)
+ (xi,x*)
-
(x2,x*))
X\,X2&X
= inf (/i(xi) + ( x i , x * ) ) + inf xi£X
(f2(x2)-(x2,x*)).
!2fcA
Note that both infima are finite. Taking a := inf^gx (fi{x) + (x,x*)) get the conclusion.
S R, we
E x e r c i s e 2.37 The implication "^=" follows immediately from the obvious inequalities - 2 (y,y*) <2\\y\\ • ||y*|| < || y || 2 + ||y*|| 2 . "=>•" Consider the topology cr(X, X') on X; so X is a separated locally convex space and T : X —• Y is a continuous linear operator. Take g : Y —>• R defined by g(y) '•= \\y + 2/o||2- The function g being continuous, from the Fenchel-Rockafellar duality formula we have 0 < inf• (f(x) + \\Tx + y0\\)= xzx
max ( - / * ( T V ) ~ *(-y*)) • y*ey
But g'i-V*)
= (2/o,2/*> + i | | » * | | 2 ,
- / * ( T V ) = inf ( / ( * ) - < T * , * * ) )
(see eventually Theorems 2.3.1 and 3.7.2); the conclusion follows replacing y* by 2y*. E x e r c i s e 2.38 Taking / : X -t K defined by f(x) := \\x — x\\ and g :— ic, we have that d(x,C) = inf x e x (f(x) + g(x)). We notice that f*(x*) = (x,x*) + tux, (x") and g* = t_c+. Because / is continuous on X, applying the FenchelRockafellar duality formula we get the first formula. In order to prove the second formula take / := x* + iux and the same g as above. Notice that f*(x") = \\x* — z*|| for x* € X*. Because / is continuous at 0 6 d o m / n domp, we can again use the Fenchel-Rockafellar duality formula. Hence i n f x e u x n c (x,x*) — max a ,. g c + (— ||x* — x*||). The conclusion follows. Using the previous formulas we get sup xeuxnc
d(x,D)=
sup xeuxtic
sup x*eux,nD+
(— (x,x*)) —
sup x*eux.r\D+
d(x*,C+).
323
Exercises - Solutions
Exercise 2.39 The inclusions [l,oo[-df(x) C d-f(x) C d< f{x) are obvious < (and true without convexity). Let x* G d f(x) and x € [/ < /(x)]; there exists x0 e X such that f(x0) < f(x). It follows that (1 - X)x + Ax0 € [/ < /(x)] for every A €]0,1[, and so ((1 - X)x + \x0-x,x*)
((1 - X)x + Xx0)-f(x)
<
Taking the limit for A —> 0 we obtain that (x — x,x")
(l-X)f(x)+Xf{x0)-f{x). < f(x) — / ( x ) , and so
x* e d±f(x). Let now x* € d-fix); this means that x is an optimal solution of the convex problem (P) min f{x) - (x, x*), g(x) := f(x) - / ( x ) < 0. By Theorem 2.9.3, there exists A > 0 such that 0 e d(f - x* + Xg)(x) d{(l + A)/ - x*), i.e. x* e (1 + A)3/(x). Therefore x* e
-
d
Exercise 2.40 1) It is obvious that (c) => (a) and that (c) =£• (b). (b) => (c) Let XQ € d o m / ; therefore ^Cn^iAnlko — o„|| 2 < oo. Let x € X be fixed. Then II* - an|| 2 < (\\x - xo|| + ||xo - anil) 2 < 2(||x - x 0 || 2 + ||x 0 - a„|| 2 ). So we obtain that f{x) = Y°°
A „ | | x - a „ | | 2 < 2||x - xo|| 2 + 2Y" 0 0 A n ||x 0 - a„|| 2
<2||x-xo||2+2/(xo),
(')
which shows that x e dom / . Taking xo = 0 in the above proof, we have that (a) =£• (c). 2) Suppose that ^ ^ j A n U a n H 2 < 00. By 1) we have that d o m / = X. Consider the function fn • X —¥ R,
f„ := X\dai + • • • + X„dan,
where da := d{a} : X - > R , da(x) := ||x — a\\. It is obvious that limn-»oo / n ( x ) = / ( x ) for every x E X. Since / n is a convex function for every n € N, using Theorem 2.1.3(iii), we obtain that / is convex. Applying relation (*) for xo = 0, we obtain that f{x) < 2 + 2/(0) for all x e Ux- Since 00 > / ( x ) > 0 for every x 6 X, it follows that / is finite and continuous on X. 3) Assume that X^nt=iAn||an||2 < 00. First note that the series ^nt=i ^n ll a "ll is convergent; this follows immediately from the inequality 2An ||a n || < An + An ||an|| 2 .
Secondly, recall that ddl(x) = 2da(x)dda(x) x,a € X.
= 2 ||x - a\\ • d\\-\\ (x - o) for all
324
Exercises - Solutions
Let x G X be fixed and consider x*n G 9||-|| (x — a„) for every n > 1. Since xJi G f/x* and An ||x — an\\ < \ n \\x\\ + X„ \\an\\, we obtain that the series S n > i 2^™ 11^ — "n|| x„ is convergent; let a;* be its sum. Because f{u) = lim/„(M) for every u G X and Y^l=x 2 ^* \\x ~ a *ll xl e dfn{x), we obtain immediately that x* £ a / ( x ) . Take now x* G df(x). Consider the function h„ := / — / „ ; the function hn is also convex and continuous. It is clear that hn = A„+id 2 + hn+i, and so dh„{x) = \n+iddln+1(x) + dhn+\{x) for all n > 1. As x* G df(x) and / = Aid„j + hi (and so d / ( x ) = A i S d ^ x ) + dh\(x)), there exists x* G 9||-|| (a; — ai) such that x* — 2 ||x — ai|| x\ G dhi(x). As hi = A2d„2 + /12, there exists X2 G 51|• || (x — 02) such that x* — 2 ||x — a\\\x\ — 2 ||x — a2||x2 € dh2(x). Continuing in this way we obtain a sequence (x£) n >i C X* such that x* G 9||-|| (x -an),
x* - ^2
2 \\x - ak\\ x*k € dhn(x)
Vn G N.
But for n G N and u G J7x we have that (h„)'(x,u)
< hn(x + u) -hn(u)
Afc (||x + u-ajfc|| 2 - ||x - a j t | | 2 )
= Y^ *—•'k=n+l
'
< rn \\u\\, where
E
oo
A*(2||x|| + 1 + 2 | K | | ) . Of course, (r„) -* 0. It follows that (hn)'(x,u) < rn \\u\\ for every u £ X, and so dhn(x) C r„Ux«- Therefore ||x* - Y2=i 2 Hx _ a*H x *il ^ r " f o r e v e l T n e N> whence we get x* = 2 X3 n >i Wx ~ a "ll x n- Hence the formula for df(x) holds. Exercise 2.41 It is obvious that (d) =>• (e) =>• (a) =>• (b) => (c); moreover these implications hold for an arbitrary function / . (c) => (b) Let xo G X be such that /(xo) < A and A > A. Consider x G [/ < A] and t := (A - A)/(A - / ( x 0 ) ) G ]0,1[. Then f(tx0 Therefore
+ (1 - <)x) < tf(x0) _ A—A A - /(xo)
+ (1 - t ) / ( x ) < tf(x0)
+ (1 - «)A = A.
_ A — /(xo) A - /(xo)
., , T ,
whence we obtain that x G
A — f(xo)
T
t)
r.
^ -ri
A—A
' [ / - A ] " r ~ 7 T ^ •Xo-
A - f(xo) A - /(xo) Therefore [/ < A] is bounded. The conclusion evidently holds for A < A. (Note that we didn't use the convexity of / . )
Exercises - Solutions
325
(b) =>• (a) The proof is immediate by way of contradiction (without any condition on / ) . (e) => (d) Since (e) is true, there exist a,p > 0 such that f(x) > a\\x\\ for every x G X, \\x\\ > p. Since / G T(X), by Theorem 2.2.6, there exist x* G X* and 7 G R such that f(x) > {x, x*) + 7 for every i f l . Therefore V x G X , \\x\\
: /(x)>-||x||.||x*||+7>-p||x*||+7.
Taking /? := min{0,7 - p||x*||}, condition (d) is satisfied. (a) =>• (e) We show it by way of contradiction. Therefore there exists a sequence (x„) C X such that (||x„||) —• 00 and lim„->oo/(x n )/||x n || = p> < 0. Therefore, for every fc G N there exists n'fc G N such that V n e N , n > n ' f c : ||x„|| > fc2 and / ( x n ) / | | x n | | < 1/fc; hence there exists an increasing sequence (n^kzn C N such that f(x„k)/\\x„k || < 1/fc and | | x „ J | > fc2 for every k. WefixxG dom / and consider tk := l|xnfc||/fc > k and yfc := (1 - ±-)x + ^ i n t . Then (\\yk\\) -»• 00 and /(»*) < (1 - **') / ( ^ ) + tk'fixn,)
< 1 + max{0,
f(x)},
a contradiction. Therefore the implication (a) => (e) is true. (d) => (f) Let g : X -> R, g(x) := a • ||x|| + (3. The function 3 is convex; using Theorem 2.3.1 and Corollary 2.4.16 we have that g*(x*) = Laux, (x*) — j3. Since / > g, we have that /* < g", whence aUxm C dom/*; the conclusion follows. (f) =>• (d) Since /* is u>*-lsc, /* is || • ||-lsc, too; since 0 G int(dom/*), using Theorem 2.2.20 it follows that /* is continuous at 0. Therefore there exist a,/3 € R, a > 0, such that /*(x*) < fi for every x* G aUx*- Therefore /* < Laux. +P, whence, taking the conjugates, we obtain that / ( x ) > /**(x) > a • \\x\\ — /3 for every x G X. Let dimJf < 00. The implication (b) => (c') is obvious. Assume that (c') holds but (b) does not. Then there exist A € R and (x„) C [/ < A] such that (||x„||) -»• 00. We may assume that ( | | x n | | _ 1 x „ ) -> u / 0. It follows that u G [/ ^ M°°, and so, by Exercise 2.23, u G [/ < inf /],*,, contradicting the boundedness of [/ < inf/]. Suppose now that p,q G ]1, co[, 1/p + 1/q = 1. The implications (i) =*• (g) and (j) =^ (h) are obvious, while the proof of the implication (g) => (i) is similar to that of (e) => (d) above. The equivalence (g) •£> (h) follows immediately from the anti-monotonicity of the conjugation and using the relation ( i | | • || p )* = i | | • \\q. Suppose that (h) holds. Then there exist a,p > 0 such that /*(x*) < a||x*|| ? for every x* G X*, ||x*|| > p. Since d o m / * is a convex set, the preceding inequality shows that d o m / * = X*. Let
326
Exercises - Solutions
x* e X* \ {0}, ||a;*|| < p; consider x* := px7||x*||. Then
n o
=,. (M,.
+
(i _ M) 0 ) < H r ( f ! , + (i _ M)/-(o)
< a p « ^ + f l - J £ ! h / ' ( ( ) ) < ap" + m a x { 0 , / * ( 0 ) } . p V p i Taking 0 := ap« + max{0, /*(0)}, (j) is verified. E x e r c i s e 2.42 Let 7 > 0 be such that | / n ( 0 ) | < 7 for every n € N. By the preceding exercise there exists r > 0 such that f(x) > 37 for x € rSx- Since rUx is a compact set, by Corollary 2.2.23 we have that (/„) converges uniformly to / on rUx, and so there exists no £ N such that \fn(x) — f(x)\ < 7 for x 6 rUx and n>no; hence fn(x) > 27 for x € rSx and n > no- Let x 6 R m be such that ||x|| > r and n > no. Then
2lS
' • (iSl' + i1 ~Hi)°) s H A W + ( ' - H i ) M 0 ) s H / - ( I , + 7 '
whence fn{x) > r~lry \\x\\. Taking m := inf / ( R m ) , which is in R by the preceding exercise, we obtain that fn(%) > m — 7 for x € r£/x and n > n 0 . It follows that fn(x) > a\\x\\ + [3 for x € R m and n > n 0 , where a := r_1f and /3 := min{m — 7, 0}. E x e r c i s e 2.43 The implication (i) => (ii) is obvious. Assume that (ii) holds. Hence there exists xo 6 X and XQ 6 X* such that <x,x*.>>(x 0 ,x*) + | | x - x o | | > | | x | | - / 3
Vx 6 C,
(°)
where /? := ||xo|| — (xo,xo). If y5 < 0, (i) obviously holds. Assume that f3 > 0. Since 0 $ C, there exist x* £ X* and S > 0 such that (x,x*)>5
VxeC.
(°°)
Multiplying (°°) with 7 := <S-1/? > 0 and adding the result to (°) we get {x, XQ + 7X*) > ||x|| for every x £ C. Note that conditions (ii)—(iv) are invariant under translations; so we may assume, and we will, that 0 £ C. (i) =>• (iii) Prom (i) we have that / := ic+xo satisfies condition (d) in Exercise 2.41, and so, by the same exercise, 0 € int(dom/*). Since /*(x*) = L*C{X* —XQ) = sc(x* — XQ), we have that d o m / * = XQ + domsc- Hence (iii) holds. (iii) =>• (i) Assume that (iii) holds, and take — X*, £ int(domsc). Then 0 6 int(dom/*) with / defined above. By the implication (f) => (d) of Exercise 2.41 we have that (X,XQ) > a ||x|| — (5 for every x 6 C, for some a, {3 € R, a > 0.
327
Exercises - Solutions
Multiplying eventually with a - 1 , we may suppose that a = 1. As in the proof of the implication (ii) => (i) above, we obtain that (i) holds. (i) => (iv) Let c £ C be fixed and take u e Coo, and t > 0; of course, c+tu € C. From (°) we obtain that (c + tu, x*0) > \\c + tu\\ > t ||u|| - ||c||
Vt > 0;
letting t —> oo, we get {u, Xo) > \\u\\. Hence (a) holds. Moreover, if (xn) C C with (H^nll) —• oo and ( | | x „ | | _ 1 x n ) -^ u, we have that ( | | x n | | _ 1 xn,Xo) > 1, whence (u,Xo) > 1. T h u s u / 0 . Assume now that X is a reflexive Banach space. (iv) =$• (i) Let Xo G X* be such that (u, XQ) > 0 for every u E C-oo \ {0} and x{ 6 X*, 5 > 0 satisfy (°°). It follows that ( u . ^ ) > 0 for every u G OooThere exists i) > 0 such that M ) > - T ? Vxec. (°°°) Otherwise there exists (xn) C C such that (xn,Xo) < —n for every n G N. Since ll^nll • H^oll > «, we have that ||x„|| -¥ oo. Passing to a subsequence if necessary (X being a reflexive space), we may suppose that ||a;n|| _1 xn ^y u. It follows that u € Coo (C being closed). By (b) we have that u / 0. Since ( ||a; n || 1 xn, xS) < 0, we obtain that {u, XQ) < 0, contradicting the choice of XQ. Hence (°°°) holds. Replacing XQ by J _ 1 (?j + S)xl + XQ, we have that (°°) still holds and (u, XQ) > 0 for \ {0} (i.e., we may take x\ — XQ). Assume that C does not satisfy (i). Then for every n 6 N* there exists xn € C such that (nS <) (x„,nxo) < \\xn\\. Hence ||a; n || —> oo. As above, we may assume that | | x n | | _ 1 xn A w; hence u € Coo, and so, by (b), u ^ 0. But, since ( ||a;n|| _ xn,xo) < n-1 ,we have that (u, XQ) < 0, a contradiction.
E x e r c i s e 2.44 (a) => (b) Suppose that (a) holds. Consider xo € X such that f(xo) < A. Let x £ X; it is clear that (b) is true for ||a;|| < p. Consider that ||a;|| > p; then f(x) > A. Since for f(x) — oo the inequality (b) is obviously true, let f(x) € R If f{tx + (1 - t)x 0 ) < A for every t e]0,1[ we have that \\tx + (1 — £)xo|| < p for every t G]0,1[, and so the contradiction ||x|| < p. Therefore there exists t\ €]0,1[ such that f(t\x + (1 — ti)xo) > A. By Theorem 2.1.12(vii), there exists t2 €]0,1[ such that f(t2x + (1 - t2)xo) < A. Since the function ]0,1[9 t t-t f(tx + (1 — t)xo) is convex, it is continuous. So, there exists t e]0,1[ such that A = f(tx + (1 - F)x 0 ) < t / ( x ) + (1 -
t)f(xo).
But t | | i | | - (1 - t )\\x0\\ < \\tx + (1 - t )x 0 || < p,
Exercises - Solutions
328
and so t < {p+\\xo\\)/(\\x\\
+ \\x0\\).
Therefore
A < t(f(x) - /(*„)) + /(*„) <
II^^+^HII
• (/(*) - /(x°)) + /(*»)•
Thus
/(-) > /(*o) + J ^ ± M . (A - /(,„)) = M ^ A . (A _ / ( l o ) ) + A P+IMI
P+ll*o||
>JI4^-(A-/(«o)) + A. Ip Since f(xo) can be taken as close to inf / as wanted, we have that /(x)>A+M^-(A-inf/)>inf/+M^.(A-inf/), which shows that the conclusion holds. (b) => (c) Set p := (A - inf /)/(2p) and consider g : X ->• E, g(x) := p,max{0, ||x|| — p}; g is a continuous convex function. For computing g* we could apply the conjugation formulas established in Section 2.8, but we proceed directly. g*(x*) = s u p x g x ((x,x*) - pmax{0, \\x\\ - p}) = max |sup| W |< p (a;,x*),sup| N | > p ((x,x*) - p(\\x\\ - p ) ) } = max|p||x*||,sup|| I ||> p ((x,x*) - p\\x\\) + ppj . But sup|| x || >p ((a;,a;*) —/i||x||) = oo if ||x*|| > p, while if ||x*|| < p then <*,**> - p\\x\\ < ||*|| • (||z*|| - p) < p(\\x*\\ - p), and so sup ((x,x')-p\\x\\)+pp IWI>p
Therefore g*(x*) = max{p • ||a;*||, i^ux.
(x*)}.
Since / > inf/ + g and inf/ = — /*(0), by conjugation we obtain that /* < /*(0) + g*, and so (c) holds. (c) =>• (b) Using (c) we obtain that /* < /*(0) + g*. Since g** = g, by conjugation we obtain that (b) holds. E x e r c i s e 2.45 (a) The mentioned equivalence was established in Exercise 2.41. Let p > 0 be such that liminfuxn-too / ( x ) / | | x | | > p. Then there exists p > 0 such that f(x) > p • \\x\\ for ||x|| > p. Since / S T(X), / is lower bounded on
329
Exercises - Solutions
liUx, and so there exists 7 € R such that f(x) > n]\x\\ + 7 for every i £ l , As in the proof of Exercise 2.41 we obtain that /* < iMc/x» — 7. Therefore /* is upper bounded on [iUx* • Since \i £]0,liminf||:E||_+(xl /(aO/||z||[ is arbitrary, we obtain that the inequality "<" of the relation to be proved holds. Suppose now that n > 0 and /* is upper bounded on pUx* • Then there exists 7 £ R such that /* < t M c/ x . — 7- Taking the conjugates, we obtain that f(x) > fJ.\\x\\ + 7 for every x £ X, whence liminf|| x ||_j 0 0 /(x)/||a;|| > fx. So we have obtained the inequality ">" of the desired relation. Before beginning effectively the proof of the other assertions let us do some remarks. If /j > 0 and g:X-*R,
g(x) := f(x) + ii\\x\\,
then, by using Theorem 2.8.7, we have that g*(0) = min{/*(x*) + ipvx. (—X*) \ x* £ X*} = min{/*(a;*) | x* £ fiUx*}Let fj, > 0 be such that liminf||a.||_KX) /(a;)/||a;|| > — fj,. By what precedes, we get the existence of 7 6 K such that f(x) > — fj,\\x\\ + 7 , i.e. g(x) := f(x) + (J,\\X\\ > 7, for every x € X. Therefore 0 € domg*. By the preceding displayed relation we have that fiUx' D d o m / * ^ 0, and so d<jom/*(0) < /J.; hence -ddom/*(0) > min{0, liminf|| j ; ||_ > 0 0 /(x)/||x||} . Let now (x„) C d o m / * be such that (||xnll) —^ ddom/*(0). Using the YoungFenchel inequality, for every n 6 N we have that V x e l :
/ ( * ) > (x,x'n) - f(x*n)
> -Hsll • Hi'll -
f K ) ,
and so, dividing by ||i;|| and then passing to the limit inferior for ||x|| —> 00, we get V n e N : liminf|| B ||_ M)0 /(*)/||a!|| > - | K | | . Taking now the limit, we have that liminf|| a .||_ HX> /(a:)/||x|| > -d d om/*(0). So we obtained that min{0,liminf|| a ! ||_ > o o /(x)/||i||} = -d d om/*(0).
(*)
Let us prove now the other two assertions. (b) If liminf|| a: ||_, 00 /(a;)/||a;|| = 0, by (*) we have that 0 € cl(dom/*); using (a) we have that 0 ^ int(dom/*). Therefore 0 £ Bd(dom/*). Conversely, if 0 6 Bd(dom/*) we have 0 ^ int(dom/*), and so, by (a), liminf||x||_KX, /(x)/||a;|| < 0; using the fact that ddom/*(0) = 0 and (*) we obtain liminf||;E||_+0O / ( x ) / | | x | | = 0.
330
Exercises -
Solutions
(c) If liminfii^n-too / ( x ) / | | x | | < 0, using (*) we obtain the equality from (c) and 0 < ddom/*(0), i.e. 0 ^ cl(dom/*). Conversely, if 0 $ cl(dom/*) then ddom/*(0) > 0; using again relation (*), it follows that liminf||rE||_+00 / ( x ) / | | x | | < 0. Exercise 2.46 Let x = (x„)„>i € £p; then (x„) —• 0, whence there exists ls no > 1 such that \xn\ < \ for n > no- Since the series 52n>in/2n convergent and n | x n | n < n / 2 " for n > no, it follows that the series ^2n>1n\xn\n converges; therefore x € d o m / . The function (P B \n £ M is obviously convex. It follows that the function
fn:t?^m,
/»(*):= V "
*—Jk=l
k\xk\\
is convex, too. Since f(x) = lim fn(x) for every x 6 lv', using Theorem 2.1.3(H), we obtain that / convex. Let p 6]0,1[ and x = (xn)n>i € pUtv. Then \xn\ < p for every n > 1. It follows that
MxtpUtv
: }{x)
npn € R.
The function / being convex, using Theorem 2.2.13, we obtain the continuity of / on dom / = V. Since f(e„) = n, where e„ = ( 0 , . . . , 0 , 1 , 0 , . . . ) , 1 being on position n, f is not bounded on pUiv for every p > 1. Applying Exercise 2.45 (a) for f*:£q = (£p)* —t R, where q = p/(p-l), since (/*)* - f, we have that liminfiij,^,*, f*(y)/\\y\\ = 1. Exercise 2.47 (a) Let K := A(C); it is obvious that if is a convex set. Using Corollary 1.3.15 we have that K is also a closed set. Since the function Y 3 V '-* |IMI 2 +1K(V) is convex, lsc and coercive, and the space Y is reflexive, using Theorem 2.5.1(h), there exists y € K such that |||y|| 2 < f |M| 2 for every y S K. Taking x e C such that y = Ax we obtain that x is a solution of problem (P). (b) Let x e C and / : X ->• R, f(x) := i||Ax|| 2 . By the PshenichnyiRockafellar theorem, x is a solution of (P) if and only if df(x) D (—N(C;x)) ^ 0. Since d(±\\ • \\2)(y) = {y} for y £ Y, from Theorem 2.8.6 we have that df{x) = {A*(Ax)} for x 6 X. Therefore x is a solution of (P) if and only if A*(Ax) e —N(C;x), i.e. (x\v) < (x\v) for every x € C, where i; = A*(Ax). Exercise 2.48 Using Exercise 1.5 (c) we have that C + ker A is a closed set, while using Exercise 2.47 (a) we have that (P) has optimal solutions. Suppose that there exists x with the mentioned property; then Slater's condition is satisfied for the problem min |||v4x|| 2 ,
(x, ipi) - on < 0, 1 < i < k.
The conclusion follows applying Corollary 2.9.4.
Exercises - Solutions
331
Exercise 2.49 Consider the function / : X -> R, f(x) := \\x — oi|| + \\x a.211 + \\x — d3||. It is obvious that the function / is convex, continuous and lim||I.||_>00 f(x) = oo, i.e. f is coercive. The function / is even strictly convex. Indeed, for x,y G X, we have that \\x + y\\ = \\x\\ + \\y\\ if and only if there exists A,n > 0, A + fj, > 0, such that Xx = py (i.e. x, y are on a half-line passing through the origin). Using this fact, if there exist x, y G X, x / y and A G]0,1[ such that f(Xx + (1 — X)y) = Xf(x) + (1 — X)f(y), then 01,02,03 are on the straight-line determined by x,y, which is a contradiction. Therefore there exists a unique element x G X such that f(x) < f(x) for every x € X. Moreover x is characterized by 0 € df(x). But f)f/v\-f
T,l
if
^X-di)
u g {ai,a2,a3},
Denote ej := (x — ai)/\\x — aj|| when x ^ a;. If x ^ (ai,02,03}, using the relation ei + e2 + e3 = 0 we obtain that
V3
"\ 1 v - 3
1
^i^Wx-aiWj
a<
^i=x\\x-ai\\'
and so x G 00(01,02,03}. If x £ {01,02,03}, of course x G 00(01,02,03}. Therefore the conclusion holds. Prom 0 G df(x) we obtain that x = 03 if and only if e\ 4- ei G Ux, which is equivalent to (ei|e2) < — \, i.e. ^(01,02,03) > 120°. Suppose that x g {01,02,03}. Then ei + e2 + ez = 0, whence (e; \ej) — —| for i / j . Therefore Z(o;, x, aj) = 120° for i / j . Thus we obtained the known properties of Toricelli's point. Exercise 2.50 For every n > 0 and i G {1, 2, 3,4} we consider the function
U-.Xi^
R,
/„(*) := J
(tx(t) + py/\ + (x(i))2)
Prom Exercises 2.9 and 2.10 we have that / M is convex; moreover / is a C2 function on Xi and X3 and only Gateaux differentiable on X2 and X j . In every case we have that Vx.ueXi
: V/M(x)(U)= T L w + M - ^ M g ^ )
Jo V
V 1 + (*W)2/
dt.
Moreover v(Pt)
> v(P£),
v(P?) > «(P 4 ").
Let us take i = 2 and ^ > 0. The function to be minimized being convex and Gateaux differentiable, x^ € X2 is a solution of (.Rf) ^ anc ^ o n l y ^ ^fp{ x v) = 0-
332
Exercises -
Solutions
It is obvious that V/o(z) / 0 for every x € X2. So, let fj, > 0. The condition V / M ( J ; M ) = 0 is equivalent to
VBEXJ
:
f
(t + n
Jo \
*jj(f)
)u(t)dt = 0,
2
V l + W*)) /
which is, obviously, equivalent to each of the following relations: XjAf)
V l + (*M(*))a
+
i = 0 a.e. t € [0,1],
x^t)
=
"
~*
a.e. t e [0,1].
v / ^ ^
The function xM above is in X2 if and only if /u > 1. Therefore for \i > 1 the problem (P^) has a unique optimal solution, xM, while for /x < 1 the problem (P^) has no optimal solutions. Let us determine ^(P^) when fj, £ [0,1[. For every a e]0,1[ and /3 € ] 1 , oo[ consider the function
^:[O,I]^R,
« a ,(t):={_ l(t _° 1+a) ;[
^L1;;};
An elementary computation shows that = M (l - a) + a/3 ( | - I + f ^ l + ^ + ^ I n (/? + V ^ + 0 5 ) ) •
U{xa,p)
It follows that fii{xn-\n2) —> —00 for // € [0,1[, and so ^(P^) = —00. Let now » = 1. Since x n - i n2 € X\ we obtain that v{P£) = —00 for /z € [0,1[. Doing again the calculations of the case i — 2 we obtain that (P?) has optimal solutions if and only if fi > 1. The solution is unique and is the function xM determined above. Therefore v(Pjl) = v(P£) for /x > 1. For /1 = 1 we have already seen that v(Pi) > ^(P^). Considering for every n 6 N the function Il
-[0,1HR'
^ W - l ^(i-i)
if
te]i-i,i],
we observe that x" € Xi and limn-xx, / i ( x " ) = f\(x\) (one can use Lebesgue's theorem). Therefore u ( P / ) = ^(Pf). Let i = 3. Similarly to the case i = 2, j / M is an optimal solution for (P^) if and only if V/M(j/M) = 0. For /j. = 0 this is not possible, and so (P°) has no optimal solutions. Let fj, > 0 and i/M an optimal solution of (P3"). The condition V/M(j/M) = 0 is equivalent to
333
Exercises - Solutions
Recall the following result: if x e C[a,b] (a,b eR, a < b) and fa x(t)u(t)dt = 0 for every u 6 C[a, b] with J u(t)dt = 0, then £ is a constant function. Indeed, let c := (6 — a ) - 1 / u(t) dt and u := x — c; by hypothesis we have that Ja (u(t) — c)2dt = f (u(t) — c)u(t) dt = 0 and so x(t) = c for every t e [a, b]. Therefore there exists a e R such that V*G [0,1] : n
,
V
^
=+t = a;
it follows that V i e [0,1] : y „ ( t ) =
^ - l L = = . ^H2 - (a - t)2
Since j / M e C[0,1], we necessarily have that /u > |a — t\ for every t e [0,1], i.e. 1 — fi < a < ft. It follows that p > \. In this case, yM e X3 if and only if a = | . So, for (j, < i the problem (P^) has no optimal solutions, while for fi > | the problem (P3") has a unique optimal solution, namely
yM : [0,1] -+ R,
2
y„(t) =
In a similar manner, but using this time the result: if x e L°°(o,6) (0,6 e R, o < 6) and f x(t)u(t)dt = 0 for every function u e Ll(a,b) with the property / u(t) dt = 0, then x is constant a.e. (the proof being similar to that presented above), we obtain that (P£) has optimal solutions if and only if \i > | ; in this case the unique solution is yM denned above. Let now fj, < | ; for a e]0, | [ , P € P consider the function
r ya,0 : [0,1] -J- R,
y t t ,^(t):=<
0
if
te[o,a],
0 { -p
if if
t€]a,l-a[, t e [1 - a, 1].
ya„3 e X4 and /(y t t l | 8 ) = aP
faV^+P-2
- 1 + a ) + /x(l - 2a).
Since / M (y n -i,„2) -> —00, we have that v(P£) = - c o for n < | . Taking into account that for every (a,P) € ]0, | [ x ]0, oo[ the function yatp can be approximated pointwisely by a sequence {ya,0,k)keN C X3 such that Ij/a.^.jfcl < \ya,p\ and using eventually Lebesgue's theorem, we obtain that v(Pg) = —00 for fi < | . Approximating similarly yi/2, we obtain that v(P3 ) = v(P4' ). Elementary calculations
Exercises - Solutions
334 show that
/* e [0,1[, //G [l,co[, and f —oo
if
M G [0, | [ , ^ G [i,oo[.
The solution is complete. E x e r c i s e 2.51 Recall that x G AC[0,1] if and only if x is derivable at almost every t G [0,1] and x' G L^O, 1). Note that for x G X, J* x(t) dt = - J* tx'(t) dt. Taking into account this fact, to problem (P) we associate the problems (Pi)
min Jo tx(t)dt,
xGXi,
J* y/l + (x(i)) 2 dt < L,
i = 3,4, the spaces X3 and X\ are those introduced in Exercise 2.50. Note that v(P) = —v(P3) if X = C^O, 1], and x is a solution of (P) if and only if x' is a solution of (P 3 ); if X = AC[0,1] then v(P) = -v(P4) and x is a solution of (P) if and only if x1 is a solution of [PA). Considering for i G {3,4} the functions f,g:Xi^R,
f(x):=-titu(t)dt,
g(x) := J* y/l + (u(t))* dt,
the problem (Pi) becomes (Pi)
mmf(x),
xeXi,
g(x)
The function / is linear and g is convex and Gateaux differentiable; moreover the differential of g is given by Vg(x)(u) = J0 xu/^/1 + x2 dt for i , t i £ l j (see Exercises 2.9 and 2.10). Therefore the problems (Pi), i G {3,4}, are convex problems. If L < 1, then the problem (Pi) has no admissible solutions for every i G {3,4}, while if L = 1, (Pi) has only an admissible solution, x = 0, which is the optimal solution, too. Let L > 1 and i = 4. Since g(0) = 1 < L, Slater's condition is verified for the convex problem (P4). Therefore an admissible solution 1 6 X 4 (g(x) < L) is optimal for (P4) if and only if there exists A > 0 such that \(g(x) - L) = 0,
V / ( x ) + \Vg(x)
= 0.
Let x G X4 be an optimal solution of (P4) and A > 0 satisfying the above relations.
Exercises - Solutions
335
Since V / ( s ) ^ 0 we obtain that A > 0, and so g(x) = L. Therefore
Vnei 4 : f1 ( . Xx^=
+ t]u(t)dt = o.
As seen in the solution of Exercise 2.50, the above relation is possible only when A > 1 and the sole function which satisfies it is i/A : [ 0 , 1 ] - • R,
yx(t)
•t
y/*-(h-t)*
Since g{yx) = L we have that 2Aarcsin ^ = L. But the function ]0,1] 3 t >-> j arcsin t € R is increasing with range ]1, TT/2]. Since in convex programming the necessary condition is also sufficient, we obtain that for L S]l,7r/2] the problem (Pi) has the unique solution y\, where 2Aarcsin ^y = L. The (unique) solution of problem (P) is, in this case, the function *:[0,1]^R,
, ( t ) : = ^ - ( f - i p - ^ - | ,
(•)
with A defined above. For L > 7r/2 the problem (P4) (and (P), too) has no optimal solutions. To compute the value of problem (P4), we associate its dual problem. By Theorem 2.9.3 we have that «(Pi) = v(D4) (and (D4) has optimal solutions), where (£>4)
max i n f x e x 4 ( / ( s ) + A ( p ( x ) - L ) ) ,
A > 0.
Taking into account the result obtained in solving Exercise 2.50, we have that
v(Di) = max { A2 arcsin i + \ J\2
- \ - \ L \
A> \ \ .
If i = 3, the same argument as for the case i — 4 proves that (P3) has a unique solution for L 6]l,7r/2[, having the same expression, while for L > TT/2 the problem has no optimal solutions. Moreover v(Ps) = v(P4) for every L > 1. In conclusion, problem (P) has a unique solution for L g]l,7r/2[ and has no optimal solutions for L > n/2 when X = C 1 [0,1], and has a unique solution for L 6]l,7r/2] and has no optimal solutions for L > 7r/2 when X = AC[0,1]; in both cases the solution is given by formula (*). Moreover V(P)=S
\SJ^L-\-\^L
I =^
if if
L€]l,7r/2], ^6W2,oo[,
where \L is the unique positive solution of the equation 2A arcsin ^
Le]l,n/2].
= L when
336
Exercises - Solutions
E x e r c i s e 3.1 First recall that i1 := {(xk)k>i C K | ^2k>1 \xk\ < 00} is a Banach space with respect to the norm \\x\\l := ^2k>1 \xk\- Its dual is £°° := {(xk)k>i C R I (xk) is bounded}, the dual norm being ||CiCfc)||00 := s u p ^ \xk\Let x = (xk)k>i € i1. Since (xk) -> 0 we have that \\xk\\ < 1 for k > ko, and so |a?fcj1+1'* < \xk\ for k > ko. It follows that the series defining f(x) is convergent, and so / ( x ) € R. Moreover, because \xk\ < 1 (k > 1) for ||x|| < 1, the above inequality shows that f(x) < 1 for x £ Uti; hence / is continuous if we succeed to show that / is (strictly) convex. Let us show that / is strictly convex. So, let x,y 6 I1 with x ^ y and A e]0,1[. Of course, there exists ko > 1 such that Xk0 / yk0- Since the mapping ipa : K -> R, <^a(<) := | i | a , is strictly convex for every a > 1 (see the examples after Theorem 2.1.10), we obtain that \\xk + (1 - A)j/jfc|1+1/* < A|x*| 1 + 1 / f c + (1 - A) \yk\1+1/k, with strict inequality for k = fco. Summing up these inequalities side by side we get /(Ax + (1 - X)y) < A/(x) + (1 - X)f(y). Let x* = (x'k)k>i C (^)* = ^°° be in df(x) with x = (xk)k>i- Let us fix ko 6 N. Taking J/J, := Xk for k ^ ko and y*;0 = xfco + M in the definition of df(x) we get \xko + « | 1 + 1 / * ° - | X A , 0 | 1 + 1 A O > ux'kQ for every u € R, i.e. x'^0 6 9yi + i/j : o (xj : o ). Using Theorem 3.7.2(iii) we obtain that x'k0 — (1 + •£-) Ix^ol1^*0 sgn(xjt 0 ). Therefore 9 / ( x ) (which is nonempty because / is continuous on i1) has a unique element x* whose components are x'k := (1+ \) \xk\X sgn(xfc). Taking into account Corollary 2.4.10 we obtain that / is Gateaux differentiable. Consider now x = (xk) 6 Z1 and show that / is not Frechet differentiable at x. We distinguish two cases: 1) l i m s u p j . ^ ^ Ix^l 1 '* > 0 and 2) l i m s u p j . ^ ^ Ixfcl1'* = 0. 1) In this case there exist r\ > 0 and an infinite set P C N such that \xk |1/,fc > 77 for every k € P. For n 6 N consider u" 6 i1 defined by u^ := 0 if k < n or k € N \ P and u^ •— —2xk otherwise. It is obvious that (KM"!!!) —• 0 for n —>• 00. But ||V/(x + u n ) - V / ( x ) | | = 2
sup k£P,k>n
((l + k-1)-\xk\1/k)>2r)
V
Vn € N. '
Taking into account Corollary 3.3.6 we obtain that / is not Frechet differentiable at x. 2) In this case (\xk\1/k) ->• 0. For n € N consider un S ^ defined by uk := fc_2sgn(xj,) this case
for fc > n and u]J := 0 otherwise; it is obvious that (Uu™!!!) -+ 0. In
| | V / ( x + un) - V / ( x ) | | = sup ( ( 1 + AT1) • \(\xk\ + k~2)1/k k>n
^
I
> Um ((1 + AT1) • \(\xk\ + k~2)1,k
-
\xk\1/k\) 1'
- |x*| 1 / f c |) = 1
for every n € N. As above we obtain again that / is not Frechet differentiable at
337
Exercises - Solutions
Exercise 3.2 Because \xn\ < \\x\\ for every n, we have that 0 < f(x) < \\x\\ for every x 6 £°°. Because l°° 3 x >-> \x„\ £ R is convex, by Theorem 2.1.3(iii) / is convex (even sublinear). It follows that / is continuous, being bounded above on Ue°°. Let x e Z°° be fixed. If f(x) = 0 consider u := ( 1 , . . . , 1,...) G (°°. Then for any t > 0 we have that fix + tu) =t — f(x — tu), and so limtj.o t~l (/(5" + tu) + f(x - tu) - 2/(x)) = 2 > 0. Assume that fix) > 0 and consider an infinite subset P of N such that lim„gp|a;n| = f(x); without loss of generality we assume that xn > | / ( x ) for n £ P (replacing eventually x by — x). Let P i , P2 be infinite disjoint subsets of P such that P = Pi U Pi- Let u € l°° be such that un :— 1 for n € Pi and u„ := 0 for n e N \ P i . Let 0 < t < \f{x). Then \xn + tu\ = xn + t for n € Pi and \x„ + tu\ = \x„\ for n € N \ P i . It follows that f(x+tu) = l i m s u p n £ N \x„ +tu\ > lim n 6 p 1 (x„+<) = f(x)+t. Since \x„— tu\ = \xn\ for n 6 N \ P i D P2, we have that limsup n e N V v P l |x„—1«| = f(x). Because \xn— tu\ = xn—t for n 6 P i , it follows that limsup n g N \xn — tu\ = }{x). Therefore limtiot-1(f(x + tu) + f(x-tu)-2f(x)) > l i m u o t~^ (f(x) + t + f(x) - 2f{x)) = 1 > 0. Hence, in both cases there exists u € £°° such that l i m ^ o < _ 1 ( / ( 5 + tu) + fix — tu) — 2fix)) > 0. By Theorem 3.3.2 / is not Gateaux differentiate at x. Exercise 3.3 Because pf is nondecreasing on [0, oo[, it is obvious that the three situations in (ii) are equivalent. Also for the equivalence of the three situations in (iii) it is sufficient to show that p : [0, oo[—> [0,oo] defined by pis) := i n f { i / ( x ) + \fiy)
- fi\x
+ \y) \ x,y e d o m / , \\x - y\\ = s},
is nondecreasing (of course, inf 0 = +00). Indeed, taking 0 < s' < s and x, y € d o m / with ||x — y\\ = s, then applying Theorem 2.1.5(vii) for A — 1/2, ipit) := fix + tiy — x)) and 0 < j < 1 we obtain that i / ( i ) + \fix
+ °iiy - x)) - fix + £iy - x)) < | / ( x ) + \f{y)
- f(\x
+ \y),
which shows that p(s') < p(s) because (| ^-(3/ — aOII = s'. Let now x, y e d o m / with \\x — y\\ = t > 0 and A 6]0,1[; we may assume that A e]0,1/2]. Set z := (1 - \)x + Xy; then f(z) = / ((1 - 2X)x + 2A(|(x + y))) < (1 - 2A)/(x) + 2 A / ( i ( x + y)) < (1 - 2\)f(x)
+ 2A ( i / ( x ) + \fiy)
- p(||x -
= ( l - A ) / ( x ) + A/(2/)-2Ap(t) <(l-A)/(x) + A/(y)-2A(l-A)^).
W||)
338
Exercises - Solutions
Therefore 2p(t) < Pf(t) < 4p(t) for t > 0. These inequalities show that (iii) <=>• (i). Since the implications (i) =S- (ii) => (iii) are obvious, the equivalence of these three conditions is proved. (iv) Fixing x 6 dom / in the preceding proof we get the conclusion. E x e r c i s e 3.4 Let x G I1 and t > 0. Consider e n e E1 with e£ := 1 if k = n and e£ := 0 otherwise. Then pftX{t) < 4 {\f{x + ten) + \f(x) - }{x + \ten)) for every n g N , and so 0 < p/,x(t) < 41im„_oo ( | |x» + t\1+1/n + | | * n | 1 + 1 / " - \xn + | t | 1 + 1 / n ) = 0. Because / is Gateaux differentiable, we have (see page 201) that */,.(*) = inf{r>/(»,i) | | | » - x | | = t }
VxG£\
t>0,
where Df{y, x) := f(y) - f(x) - (y - x, Vf{x)). Note first that \\y — x\\ — ||y — x|| and D/(y,x) > Df(y,x) for any y € i1, where y € I1 is denned by yk := yk if sgn(j/* - x*,) = sgn(xfc) and yk := 2xfc - yk otherwise. It is obvious that \yk — xk\ = |j/fc — a;fc| for every k > 1, and so | | v - x | | = | | » - x | | . Then
£>/(Vlx) - Df(y,x) = /(») - /(y) -(y-y,
V/(*)> = J ^ fl*.
where 6k := \yk\1+1/k-\yk\1+1/k-(l + i-)sgn(xk)-\xk\1/k (yk-yk)It is sufficient to show that 6k > 0 for every k > 1. Of course, #& = 0 if sgn(yfc — xk) — sgn(x/t) or xk = 0. So, assume that this is not the case. Then uk := 1 — yk/xk > 0, p* := 1 + £ £ ]1, 2], yk = 2xk - yk and 6k = \xk\Pk (|1 - uk\Pk - |1 +
P( W&| =
+ 2p*Ufc) = |x,| Pfc ^ ( u * ) ,
because yk - yk = 2(yk - xk) and sgn(yfc - xk) = -sgn(xfc), where <^p(£) := |1 — t\p — |1 + t\p + 2pt for t > 0. Using Exercise 2.7 we have that tpp is nondecreasing for p S]l,2], and so ipPk(uk) > 0. Hence 6k > 0 for every A; > 1, and so Df(y,x) > Df(y,x). Assume now that x € I1 \ A and t > 0. Of course, l i m s u p j , ^ ^ |£jfc| = 1. Consider un := sgn(x„) • en. Then #/,*(*) < / ( x + tun) - f(x) - t {un, V / ( x ) ) = (|x„| + i ) 1 + 1 / " - | x n | 1 + 1 / r l - t ( l + i ) | x n | 1 / n . It follows that 0 < tf/,*(t) < liminfn-^oo ( ( | x n | + t)l+1/n
- \xn\1+1/n
Fix x € A and take 7 := 1 — l i m s u p j . ^ ^ |xji|
*:[0,7[-R, 0(s):=Zk>_Xvh
- t ( l + ±) M 1 / n ) = 0. > 0. Consider the function
+
lXkll/k k
)-
339
Exercises - Solutions
The function 0 is well-defined, continuous, increasing and (f>(0) = \\x\\. Indeed, as l i m s u p ( - ^ - + M 1 A > ) = s + limsup|x )i; | 1/ * : < 1
Vs6[0,7[,
by the Cauchy-Hadamard test of convergence we have that the series defining (f>(s) is convergent. Moreover, for any k > 1 the function 4>k defined by <j>k(s) :=
( •j^pi + \xk\
\ k
J is continuous and increasing on K+, and so
[0,7[. Also, for so €]0,7[, we have that 0 < 4>k{s) < (j>k(so) for all s e [0,so] and k > 1, and so the series defining cj> is uniformly convergent on [0, so]. It follows that 0 is continuous on [0,7[. It is obvious that >(0) — \\x\\. Let to := 4>(~y/2) — \\x\\ > 0. Take now t € ]0, to]; we have that (f>(0) < t + \\x\\ < 4>{-f/2), and so there exists (a unique) s := St G]0,7/2] such that <^>(s) = t + ||a;||. We intend to show that dj,x{t) = Df{w,x), where w := wt 6 C1 is defined by + \xk\1/k]
wk:=sgn(xk)-(~^j
V* > 1. /
\ k
First note that sgn(wfc -Xk) = sgn(xfc) and \wk - Xk\ = f ^ + |x fc | 1/A: J - \xk\, whence \\w — x\\ = t. This also implies that w = (w — x) + x € E1. Take y € i1 with \\y — x\\ = t. Replacing, if necessary, y by y as above, we may assume that sgn(yk — Xk) — sgn(xk) for k > 1. Then, by the convexity of / , Df (y, x) - Df (w, x) = f(y) - f{w) -(y-w,
V / ( x ) ) >{y-w,
Vf(w)
-
Vf{x)).
But ( V / H - Vf(x))k
= (l + l)
^(xk)
( J ^
+ \xk\1/k
- M
1 A
) = s-sgn(xfc).
Hence Df{y,
x) - Df(w, x)>s = s
J2k>1
s
&n(xk) {Vk ~ wk)
( E f c > 1 sgn(Xk^ (yk ~ Xk"> ~ E f c > i s S n ( x *) (™«= =s ( E ^ iw - Xfci - E t > x i""= - *fc0 = °-
_ x
*))
It follows that i?/,i(t) = Df(wt,x) > 0, the last inequality because / is strictly convex and wt =£ x. As i?flX is nondecreasing we obtain that i?/, x (t) > 0 for every t >0. Exercise 3.5 (i) Taking / „ : X ->• R, / n ( x ) := X)fc=i 2 _ f c (*> xife)2> w e h a v e t h a t fn is convex, 0 < fn{x) < \\x\\2 and {fn(x)) -> / ( x ) for every x G X. Therefore /
340
Exercises - Solutions
is convex, nonnegative and continuous (being bounded above on Ux)- Let x € X, M 6 S x a n d t £ R \ {0}. Then
f{x + tu) - f(x) - V
, - n + l (x,xn)
• {u,xn)
= l*l£ > 2-"(«,^>2<|*|. *•—'rc>l
Hence / is uniformly Frechet difFerentiable and V / ( x ) = S n > i 2~™+1 (x, xn) • x"n for every x € X. It is obvious that g := y/J is quasiconvex and g(Xx) = |A| g(x) for all x 6 X and A £ R. Using Corollary 2.2.3 we obtain that g is sublineax, and so g is a semi-norm. For x 6 X we have that f(x) = 0 4=S> s(x) = 0<S>[Vn£N : (x, x*) = 0] ^ x € A X . Therefore is a norm if and only if A x — {0} if and only if X* — A±A- — w*cl(linA). For x,y E X and A €]0,1[ we have (by a simple calculation) that A/(x) + (1 - X)f(y) - /(Ax + (1 - A)y) = A(l - A) • f(x - y). So, / is strictly convex if and only if f(z) = 0 ^ 2 = 0, which is equivalent to y/J be a norm. Of course, if A is ||-||-dense in Sx* then lin A is ||-||-dense in X*, which implies that lin A is w* -dense in X*. (ii) Assume that X* is separable; then Sx* is separable, too; hence there exists A :— {x* | n € N} C Sx* dense in Sx*- Consider / defined in (i). From (i) we have that / is strictly convex, and so is |||| 2 + / . As in (i) we obtain that IHIj := \J\\-\\2 + f is a norm on X.
Because ||-|| < Mj^ < \/2||-||, H-^ is
equivalent to ||-||. Since ||-||j is strictly convex, from Theorem 3.7.2(v) we have that (X, H'llj) is strictly convex. Recall that X is separable when X* is separable. Indeed, if X* is separable there exists A := {x*n \ n S N} C Sx* dense in Sx* • For every n S N consider x„ € Sx such that {xn,x*n) > 1/2 and take B := {xn \ n 6 P^}. Let x* 6 Sx*; there exists n € N such that ||x* — x*|| < 1/2. Then (x n ,x*) — ( x n , x ^ ) + (xn,x* — x^) > 1/2 — ||x* — x^|| > 0, which shows that x* £ Bx. Therefore B1' = {0}, and so X is separable. Assume now that X is reflexive and separable. Because X* is separable, there exists an equivalent norm ||-1| x on X such that {X, HHj) is strictly convex. Using Theorems 3.7.3 and 3.7.2(iv) we obtain that 5GHI1)2 is Gateaux differentiable. Taking h{x*) := Y,n>i 2~n (xn,x*)2 , where {xn \ n £ N} C Sx is ||-||-dense in Sx, by (i) we have that k := h + jflMIJ) 2 is strictly convex and Gateaux differentiable. Using again Theorem 3.7.2, it follows that Vk is a norm on X* equivalent to ||||J such that (X* ,Vk) is smooth and strictly convex. Then the
Exercises -
Solutions
341
dual norm of Vk on X** = X, denoted by |||| 2 , is equivalent to the initial norm and (X, ||• || 2 ) is smooth and strictly convex. Exercise 3.6 The fact that P 9 t H* t~pip(t) is nondecreasing means that ip(ct) < cp(p(t) for all c e]0,1[ and t > 0. Let 0 < si < s 2 and v? e (si) < ^J. Then there exists t < ft such that ip{t) > si. Let c := (si/s2) 1 / ' p €]0,1[. Then si < y(*) = V (c (c - 1 *)) < c*V (c - 1 *) = —V (c - 1 *) , $2
and so S2 <
= sup((x,x*) -ipo(dc(x)))
= min(^o(A) +
{\dc)*(x*))
# = min (> (A) + A c ^ ( A - V ) )
= min (V*(A) + A ( ^ ( A " V ) + iUx. (A~ V ) ) ) = min fV"*(A) + L*C{X*) + i\ux, =
(z*)) = <-c(x*) +
min
, V"*(A)
ih(x*)+ip#(\\x*\\),
because V* is nondecreasing; for x* = 0 the equality is obvious. Since ip o dc is continuous and convex, there other formula follows from the bi-conjugate theorem. Exercise 3.8 By Theorem 3.10.1, there exists c > 0 such that f*{x*) = /*(0) + i*s{x*) for all x* € cUx>- Because 5 is bounded, t j is continuous, and so subdifferentiable at any x* S X*. From the preceding relation we obtain that df{x*) = diS{x*) C S for every x* e X* with ||a;*|| < c. Hence, for x* € cBx* there exists x G S such that x € df(x*), or, equivalently, x* 6 df(x). Therefore cBx- C 0 / ( 5 ) .
342
Exercises - Solutions
E x e r c i s e 3.9 The implications (i) =$• (ii) •» (iii) => (iv) are obvious, while the implication (iii) => (i) follows from Corollary 3.4.2 applied to h and xj)(t) := a~1t for t 6 [0,(5]. Assume that S is compact and (iv) holds. Then for every z S S there exist 8z,az > 0 such that ds{x) < azh(x) for every x € D(z,6z). Since S is compact and S C \JZ£sB(z,5z/2), there exists a finite set So C S such that S C Uzes0B(z,6z/2). Let a := max{a z | z € So} > 0 and 5 := min{5 z /2 | z £ So} > 0. Consider i 6 l with ds(x) < 5. Since 5 is compact, there exists z e S such that ||x — z\\ < 5. For this z, there exists zo 6 5o such that z 6 £(20, SZQ/2). Hence \\x — zo\\ < \\x — z\\ + \\z — zo\\ < 5 + SZQ/2 < 5Z0. By the choice of 5Z0 and azo we have that ds(x) < azoh(x) < ah(x). The proof is complete.
E x e r c i s e 3.10 1) Note that / B ( - , M ) — /El/i|| • ||. Therefore JB{-,P) is a convex function and dom/s(-,/x) = X. So, if / s ( - , / i ) takes the value —oo then it is identical —oo. Suppose that JB{-,P) does not take the value —oo; hence the function is finite. Let us prove that }B{-,P) is Lipschitz with Lipschitz constant p,. Let x i , X2 £ X. For e > 0 there exists yi € X such that fB{xi,p)
> f{y\) + [J.\\xi — a/i11
-e,
and so fB(x2,p)-fB(xi,p)
< f(yi)+p\\x2-yi\\-f{yi)-p\\x1-y1\\+e
<
/J,\\x2-xi\\+e.
Since e > 0 is arbitrary, we obtain that IB(X2,H)
- fB(xi,fi)
< p\\x2 -
xi\\.
Interchanging x\ and x2 we obtain that /s(-,/*) is Lipschitz, with Lipschitz constant p. It is obvious that for 0 < pi < p2 one has /B(-,A*I) < /s(-,P2) < /• Let us prove now the last statement of 1). Note that, since / € T(X), there exists x* € X* and 7 6 R such that VxG X : f(x) >
(x,x*)+-y.
Let x € X be fixed, and R 3 A < f(x). Since / is lsc at x, there exists p > 0 such that f(x) > A for x € D(x,p). Let ^ := max{||x*||, (A + ||x*|| — 7 — (x,x*))/p}.
Exercises - Solutions
343
For p > p we have fB(x,p)
= mm\
inf
(f{y) + p\\x-y\\),
> mkJ
inf
(A + p\\x - y\\),
\_yeD(x,p)
inf
(/(y) + p\\x - y\\) Uy,x*) + 7 + p\\x - y\\) \
y$D(x,p)
> min{A,inf{(x,x*) +t{u,x*) > min {A, inf {(x, x*)+j
inf
)
+ 7 + pt | u G 5, t > p}}
+ t(p-
\\x* ||) | t > p}}
> A. Therefore lim^,-^ / B ( X , p ) = f{%)- Since x is arbitrary, we have the desired conclusion. 2) Let XQ G X be fixed and g : X -¥ R, ff(x) := / ( x ) + /J||X — a?o||- It is clear that dg{x0) = df{x0) + pUx* when XQ G d o m / . (a) =>• (b) From the relation df(xo)C\pUx* 0 G dg(xo). Therefore
Vx G X •. f(x0) =
# 0 we have that xo G d o m / and
g(xo)
and so / ( x 0 ) = / s ( x 0 , / f ) . (b) =>• (c) By what was proved above and using the hypothesis we have that —00 < /(xo) = /s(xo,/i) < 00. Therefore fB{xo,p) G R. From this, taking into account 1), it follows that /B(-,A*) is convex and continuous. Therefore df(xo,p) ^ 0- Since JB{-,P) is an exact convolution at xo = xo + 0, using Corollary 2.4.7, we have that dJB{xo,p) = df{xo) f\pUx-(c) => (a) Since JB{XO,P) € R, by 1) we have that /s(-,/i) is convex and continuous. Therefore df(xo,p) ^ 0, and so (a) is true. E x e r c i s e 3.11 1) Since / € T(X), there exist x* G X* and 7 G R such that V x G X : /(x) > ( x , x * ) + 7 .
(°)
Therefore for every x € X, V2/GX : / ^ ( 2 / ) : = / ( y ) + f | | x - 2 / | | 2 > f | | x - 2 / | | 2 - | | x * | | - | | x - y | | + <x,x*>+7; (*) this implies that /M(X,/U) > —00 for every x € X. Since fsi{-,p) = / d f II ' II 1 we have that dom/M(-,A*) = d o m / + X = X; hence / M ( - , M ) is finite. Since / is convex and ^|| • ||2 is convex and continuous, it follows that fivi{-,p) is convex and continuous. The inequalities /M(-,A*I) < fiv[{-,p2) < / are obvious for 0 < pi < p2- The proof of lim^-nx, JM{X,P) = f(x) for every x G X is completely analogue to that of Exercise 3.10.
Exercises - Solutions
344
Let x* e df(x). VySX
Then : (y-XjX*)
+ z\\y-x\\a<m
+
$\\y-x\\2-f(x).
Taking the infimum with respect to y in both members of the inequality we obtain — jH|x*|| 2 < / M ( X , P ) — fix), i.e. our conclusion. (In this case relation lim^-Kx) / M ( X , / Z ) = fix) is immediate.) 2) Prom (*), the function f^^ is coercive; being also lsc, since X is a reflexive Banach space, there exists an element xM 6 X such that /M(x,/i)=/(^)+f||x/i-x||2.
(")
(If X is strictly convex, using Theorem 3.7.2(v), the function f || • || is strictly convex, whence f^.lX is strictly convex, too; in this case xM is unique.) Furthermore, using Corollary 2.4.7, we have dfM(x,[i)
= df{x^)r\fj.^x{x
-x,j.).
(***)
Let us consider the operator J^ : X —> X, J M (x) := xM. Relation (**) proves the formula of the statement. Let x 6 d o m / be fixed, and (i > 0. By (°) and (**) we have that <JM(x),x*> + 7 + IWMx) - x\\2 < fM(x,fi) < f{x), and so, for r\ := fix) — 7 — (x, x*), we have that V^>0 :
f||J^)-zll2-||s*ll-ll-W-zll-^O.
Thus we have
V M > 0 : ||J M (x)- a ;||< M - 1 (||x*|| + Vlk1| 2 + 2 W ) . Taking the limit for n —»• 00 we obtain that limM->co J^ix) = x. By (**) we have that / ( J M ( x ) ) < / M ( X , / / ) < f(x) for every /J, > 0. Since / is lsc and limM_,oo Jji(x) = x, we obtain that fix) < liminf / (J M (x)) < limsup / (J M (x)) < and so l i m ^ o o / (J M (x)) = f(x). 3) Let x* 6 cJ/(x) and x^ e dfu{x,n) is continuous). Using (***) we have that
fix),
for /i > 0 (x^ exists because / M ( , / ^ )
fi{x - J M (x),x*) = [i2\\x - J^(x)|| 2 = ||x* ||2 and, taking into account 1) and (**), (x - J M (x),x;) < fix) - fiMx))
< ^ | | x * | | 2 - 1 \\x -
J,ix)\\2.
345
Exercises - Solutions
Using the above relations we obtain that ||a;*|| < ||a;*||. Since x* € df(x) is arbitrary, we have that ||x^|j < dgf(x)(0). Let now (fj.i)iei —*• oo and x* = w* — limx*^ (such a convergent net exists because Ux- is iu*-compact). Because x*p € 9 / ( J M ( x ) ) and (JM(a:)) —> x, by Theorem 2.4.2(ix) we have that x* € df(x). Moreover, because the norm of X" is w*-\sc, we have that dgf^(0) < \\x*\\ < liminfjg/ ||xjlj|. Prom the first part we get lim||x* ; || = da/(z)(0) = ||x*||, and so x* € Pgf(x)(0). Therefore lim^^oo ||x* || = d o / ( l ) ( 0 ) . 4) Assume that X is smooth. By Theorem 3.7.2(iv), the duality mapping $ x is single-valued. From (***) we have that 5 / M ( X , / J ) = p.$x{x — J^(x)). Using Corollary 2.4.10 we obtain that fu(-,p) is Gateaux differentiable and V / M ( X , / I ) = p.$x{x - J^(x)) for every x e X. Assume now that df(x) ^ 0. The space X being reflexive, by Theorem 3.7.3, X* is strictly convex. It follows that -Pe/(x)(0) is a singleton {XQ}. From 3) we obtain that XQ is the sole limit point of the net ( V / M ( : K , M ) ) f ° r A* *~• °°> a n d so XQ = w* — limM_>oo V / M ( X , M ) - Because ( | | V / M ( X , M ) I I ) —> ||xo||, we obtain that ( V / M ( X , / J ) ) —> XQ when X* has the Kadec-Klee property. 5) Let now X be locally uniformly smooth and consider x G X; from 4) we have that / M ( - , M ) is Gateaux differentiable and V/Af(x,/i) = pNfi(x — J^(x), where / 2 := | | | • || 2 . Then for y e X we have 0 < / M ( 2 / , M ) - / M ( X , P ) - {y-x, 2
VfM(x,fj,))
< HMx)) + fHy - J„(z)|| - f(J,(x)) - $||* - JM(*)||2 -(y- x,fj.vf2{x - JM(x))) x J x J X = Kf*(y - M( )) - M - »( )) -(y-«, v/ 2 (x - J M (X)))).
From Theorem 3.7.4 we obtain that /M(-,AO is Frechet differentiable at x. Using Theorem 3.3.2 we obtain that V / M ( - , / I ) is continuous. Exercise 3.12 Let ip 6 Ao 0 No and if) = coip. It is obvious that 0 < tp <
(**„)-•*',
(Ai)-»V,
(AUn)-^!/',
(AX)->r;,
w i t h x ' . f , ^ , ^ e l + and A* € [0,1] for i € {1,2,3}. Of course, A1 + A2 + A3 = 1. y1 +y2 +y3 - x and T1 + T2 + T3 = 0. Therefore T1 = T2 - T3 = 0. If A' > 0, since ( A ^ ) -¥ 0, we have that tl — 0, and so (p(xln)) —>• 0. Since ip
Exercises - Solutions
346
is nondecreasing, from our hypothesis we obtain that (xzn) —> x% = 0, which, at its turn, implies that yl — 0. If A' = 0 and y% > 0 then (x'n) -¥ oo. Therefore there exists rij 6 N such that xn > p for every n > n,. Thus, for n > n; we have that £J, >
= (x,anx)
E co(epiy?) C e p i ^ .
Hence (x,0) € epii/'. Therefore ^(s;) = 0 for every x > 0. The solution is complete. Exercise 3.13 Let 1 6 X and x\ e A". By Theorem 3.8.4(vi) we have that xi=
PK(x)&\/y
eK
: (x - Xi\xi
- y) >0
•«-Vt/€-ftT : (x — xi | Xi) > (x — Xi \y) <&VyeK
:
(x-xi\xi)>0>(x-xi\y)
<$ x — x\ G K~, (x — xi | xi) = 0. Let xi = P K ( S ) . By the above characterization we have that X2 := x — xi S -fiT-, x — X2 = x\ 6 (K~)~ = K, (x — xi \ X2) = (x\\x
— X2) = 0.
By the same characterization we have that X2 = PK- (X)- Therefore the conclusion is true. Suppose now that x = xi +X2 with xi S K, x% € K~ and (xi | X2) = 0. Then xi E K, x — xi € K~ and (x — xi | x i ) = 0. Therefore, using again the above characterization, we have that xi = PK(X), and so X2 = PK- (X). Exercise 3.14 (a) Using the equivalence (i) &• (ii) of Theorem 1.3.16 with T = Idy and C = A, there exists m > 0 such that \\A*y\\ > m\\y\\ for every y e Y* = Y. It follows that \\Ty\\ • Hvll > (Ty,y) = (A'y,Amy)
= \\A'y\\2 > m2 \\y\\2.
So, \\Ty\\ > m2 \\y\\ for every y G Y. Of course, this relation implies that T is injective. Since T* = T, the preceding inequality can be written as ||T*j/|| > m2 \\y\\ for every y € Y. Using again the equivalence (i) •«• (ii) of Theorem 1.3.16 with T replaced by Idy and C replaced by our T, we obtain that T is surjective, and so T is bijective.
Exercises - Solutions
347
(b) Consider the function / : X ->• R, / ( s ) = \ \\x\\2 + tc{Ax + y0). It is obvious that / is coercive and strictly convex. Using Theorem 2.5.1 and Proposition 2.5.6 we obtain that / has a unique minimum point x G X. By Corollary 2.8.5 there exists v 6 N{C\Ax + yo) such that x = — A*v. Of course, y := Ax + j/o G C, and so Ax = —{A o A*)v = — Tv, whence y — yo = —Tv. It follows that T _ 1 (j7 - y0) — -v, and so {A* o T)(y - y0) = —A*v = x. Since (y — y,v) < 0 for every y € C, we have that
Vj/GC : (y-y^-iy-pT-'y
+
pT-'yo^^O,
where p > 0. By Theorem 3.8.4(vi) we have that y = Pc{y - pT~ly + pT~1y0). Conversely, if y = Pc(y - p T " l y + pT _ 1 yo) for p > 0 and x = (A* o T _ 1 ) ( j / yo), taking v := —T_1(j7 — yo) we have that x = — A*v, Ax + j/o = y € C and w £ iV(C; Ax + j/o)- Using again Corollary 2.8.5 we obtain that x is the solution of (P). E x e r c i s e 3.15
Applying Exercise 2.41 we find a, /3 G M, a > 0, such that
g(x) > a\\x\\ + 0 for every x G X. Consider ( ( a ^ ^ n ) ) C gcdg with (x*) -4 0. Let s G dom g be fixed. Then " I W I + 0 - \\xn -x\\
• | K | |
There exists no G N such that ||x*|| < a / 2 for n > no, and so \\xn — x\\ < 2a~1(g(x) + a \\x\\ — fi) for n > no- Hence the sequence (xn) is bounded. Assume that the conclusion is false. Then there exist an increasing sequence (rik) C N and t > infg such that g{xnk) > t for every k € N. Because X is reflexive and (xn) is bounded, we may assume that ( x „ J 4 i 6 l As (xj;) M! 0, by Theorem 2.4.2(ix), we have that 0 € dg(x), and so p(x) = inf g. The inequality £ + (x — x„k,x*nk) < g(x„k) + (x — Xnk,x*nk) < g{x) yields the contradiction t < inf p. E x e r c i s e 3.16 (a) Because g* is iu*-lsc, g* is w-lsc, too. Since X* is a Banach space it follows that g* is continuous at 0, and so 0 G int(dom*). By the preceding exercise we get g G T. (b) Let K and Xo be as in Exercise 2.24 (with / replaced by g). Taking into account Exercise 2.24, we obtain that has the properties in (a), and so <7 has good asymptotic behavior. Let ((x„,x^)) N C dg be such that (||xJi||) —• 0. Then, by the same Exercise 2.24, we have that ((x^,a;* | X ± ) ) N C dg. Since (Iknlx-1- II) ""* 0, we have that (g(x^)) = (g(xn)) —• inf g = inf g. Hence g G T. Exercise 3.17 Consider / := g + iA- Then foo{u) = gx{u) + tA00(w) = — a~l < 0. Using Theorem 3.10.13 we get the existence of an a > 0 satisfying the conclusion (maybe not our a ) . In fact, for x G d o m / \ C = ( A n doing) \ C and
348
Exercises - Solutions
t := a- fix) we have that / (x + tu) < fix) + t/ 0 0 (u) = 0. So x + tuE C, whence dcix) < \\x + tu — x\\ = t = a • fix). E x e r c i s e 3.18 It is obvious that inf
= ( - ^ i _ - 1 , - ^ J L _ ) for (x,y)
/
(0,0) and Vg2ix,y)
=
2
(-2,2y) for (x,y) £ R . Using Corollary 3.10.9 we obtain that »•«(*) = kgi(t) •
f
= inf {\\Vgxix,y)\\
1 , 2t + 2 x + l+t
•• i n f •
\ y/x2 + y2 - x - 1 = t)
x> t-±l }=Q
W >-l,
and *•«(«) = *«(*) = inf {||V 5 2(x, j/)|| I y 2 - 2x - 1 = t} = {2y/l
+ y» | y € R } = 2
for any t € R. Note that giix,y) = giix,y) • c(x,y), where e(x, «/) := ^ a ; 2 + y 2 + x + 1 > 1. It follows that 93(3;, 1/) = 32(2;, y) > 0 if y 2 — 2a; > 1 and 33(2;, y) = giix,y) < 0 if 2/2 - 2x < 1; moreover, [31 = 0] = [32 = 0] = [33 = 0] = {(x,y) € R 2 I y 2 = 2x + 1}. Prom the very definition of r / = rj and A;/ we obtain that »-M(0) = r 9 1 (0) and fcM(0) = fcOT(0).
Bibliography
Adly, S., Ernst, E. and Thera, M. (2001a) "A characterization of convex and semicoercive functionals", J. Convex Anal. 8(1), 127-148. Adly, S., Ernst, E. and Thera, M. (2001b) "Stabilite de l'ensemble des solutions d'une inequation variationnelle non coercive", C. R. Acad. Sci. Paris Ser. I Math. 333, 409-414. Amara, C. (1998) Direction de majoration, ensembles sous-seriellement convexes. Applications a l'optimisation, PhD thesis, University of Avignon, Avignon, France. Amara, C. and Ciligot-Travain, M. (1999) "Lower CS-closed sets and functions", J. Math. Anal. Appl. 239(2), 371-389. Anger, B. and Lembcke, J. (1974) "Hahn-Banach type theorems for hypolinear functionals", Math. Ann. 209, 127-151. Asplund, E. and Rockafellar, R. T. (1969) "Gradients of convex functions", Trans. Amer. Math. Soc. 139, 443-467. Attouch, H. and Beer, G. (1993) "On the convergence of subdifferentials of convex functions", Arch. Math. (Basel) 60(4), 389-400. Attouch, H. and Brezis, H. (1986) Duality for the sum of convex functions in general Banach spaces, in "Aspects of mathematics and its applications", North-Holland, Amsterdam, pp. 125-133. Attouch, H. and Thera, M. (1996) "A general duality principle for the sum of two operators", J. Convex Anal. 3(1), 1-24. Aubin, J.-P. and Frankowska, H. (1990) Set-valued analysis, Birkhauser Boston Inc., Boston, MA. Auslender, A., Cominetti, R. and Crouzeix, J.-P. (1993) "Convex functions with unbounded level sets and applications to duality theory", SIAM J. Optim. 3(4), 669-687. Auslender, A. and Crouzeix, J.-P. (1988) "Global regularity theorems", Math. Oper. Res. 13(2), 243-253. Auslender, A. and Crouzeix, J.-P. (1989) "Well-behaved asymptotical convex 349
350
Bibliography
functions", Ann. Inst. H. Poincare Anal. Non Lineaire 6(suppl.), 101-121. Analyse non lineaire (Perpignan, 1987). Aussel, D., Corvellec, J.-N. and Lassonde, M. (1995) "Mean value property and subdifferential criteria for lower semicontinuous functions", Trans. Amer. Math. Soc. 347(10), 4147-4161. Aze, D. (1994) "Duality for the sum of convex functions in general normed spaces", Arch. Math. (Basel) 62(6), 554-561. Aze, D. (1997) Elements d'analyse convexe et variationnelle, Ellipses, Paris. Aze, D. and Penot, J.-P. (1995) "Uniformly convex and uniformly smooth convex functions", Ann. Fac. Sci. Toulouse Math. (6) 4(4), 705-730. Aze, D. and Rahmouni, A. (1996) "On primal-dual stability in convex optimization", J. Convex Anal. 3(2), 309-327. Barbu, V. and Da Prato, G. (1985) "Hamilton-Jacobi equations in Hilbert spaces: variational and semigroup approach", Ann. Mat. Pura Appl. (4) 142, 303349 (1986). Barbu, V. and Precupanu, T. (1986) Convexity and optimization in Banach spaces, second edn, D. Reidel Publishing Co., Dordrecht. Bauschke, H. H., Borwein, J. M. and Combettes, P. L. (2001) "Essential smoothness, essential strict convexity, and Legendre functions in Banach spaces", Commun. Contemp. Math. 3(4), 615-647. Bauschke, H. H., Borwein, J. M. and Tseng, P. (2000) "Bounded linear regularity, strong CHIP, and CHIP are distinct properties", J. Convex Anal. 7(2), 395-412. Borwein, J. and Kortezov, I. (2001) "Some generic results on nonattaining functionals", Set-Valued Anal. 9(1-2), 35-47. Wellposedness in optimization and related topics (Gargnano, 1999). Borwein, J. M. (1983) "Adjoint process duality", Math. Oper. Res. 8(3), 403-434. Borwein, J. M. and Lewis, A. S. (1992) "Partially finite convex programming. I. Quasi relative interiors and duality theory", Math. Programming 57(1, Ser. B), 15-48. Borwein, J. M. and Lewis, A. S. (2000) Convex Analysis and Nonlinear Optimization. Theory and Examples, Springer-Verlag, New York. Borwein, J. M. and Preiss, D. (1987) "A smooth variational principle with applications to sub differentiability and to differentiability of convex functions", Trans. Amer. Math. Soc. 303(2), 517-527. Borwein, J. M. and Vanderwerff, J. D. (1995) "Convergence of Lipschitz regularizations of convex functions", J. Funct. Anal. 128(1), 139-162. Borwein, J. M. and Vanderwerff, J. D. (2000) Convex functions of Legendre type in general Banach spaces, Technical report, CECM, Simon Fraser University, Barnaby. Bourbaki, N. (1964) Elements de mathematique. Topologie generale, Hermann, Paris. Burke, J. V. and Ferris, M. C. (1993) "Weak sharp minima in mathematical programming", SIAM J. Control Optim. 31(5), 1340-1359.
Bibliography
351
Butnariu, D. and Iusem, A. N. (2000) Totally convex functions for fixed points computation and infinite dimensional optimization, Kluwer, Dordrecht. Caristi, J. (1976) "Fixed point theorems for mappings satisfying inwardness conditions", Tran. Am. Math. Soc. 215, 241-251. Carja, O. (1989) "Range inclusion for convex processes on Banach spaces; applications in controllability", Proc. Amer. Math. Soc. 105(1), 185-191. Carja, O. (1998) Elements of nonlinear functional analysis, Editura Universita^ii "Al.I.Cuza", Ia§i. Romanian. Castaing, C. and Valadier, M. (1977) Convex analysis and measurable multifunctions, Springer-Verlag, Berlin. Lecture Notes in Mathematics, Vol. 580. Cioranescu, I. (1990) Geometry of Banach spaces, duality mappings and nonlinear problems, Kluwer Academic Publishers Group, Dordrecht. Clarke, F. H. (1983) Optimization and nonsmooth analysis, John Wiley & Sons Inc., New York. A Wiley-Interscience Publication. Combari, C , Laghdir, M. and Thibault, L. (1994) "Sous-differentiels de fonctions convexes composees", Ann. Sci. Math. Quebec 18, 554-561. Combari, C , Laghdir, M. and Thibault, L. (1999) "On subdifferential calculus for convex functions defined on locally convex spaces", Ann. Sci. Math. Quebec 23(1), 23-36. Cominetti, R. (1994) "Some remarks on convex duality in normed spaces with and without compactness", Control Cybernet. 23(1-2), 123-138. Parametric optimization. Coodey, M. and Simons, S. (1996) "The convex function determined by a multifunction", Bull. Austral. Math. Soc. 54(1), 87-97. Cornejo, O., Jourani, A. and Zalinescu, C. (1997) "Conditioning and upperLipschitz inverse subdifferentials in nonsmooth optimization problems", J. Optim. Theory Appl. 95(1), 127-148. Correa, R., Jofre, A. and Thibault, L. (1994) "Subdifferential monotonicity as characterization of convex functions", Numer. Funct. Anal. Optim. 15(5& 6), 531-535. Crouzeix, J.-P. (1980) "Conditions for convexity of quasiconvex functions", Math. Oper. Res. 5(1), 120-125. Crouzeix, J.-P. (1981) Continuity and differentiability properties of quasi-convex functions on R n , in S. Schaible and W. Ziemba, eds, "Generalized Concavity in Optimization and Economics", Academic Press, pp. 109-130. Crouzeix, J.-P., Ferland, J. A. and Schaible, S. (1992) "Generalized convexity on affine subspaces with an application to potential functions", Math. Programming 56(2, Ser. A), 223-232. Deng, S. (1997) "Computable error bounds for convex inequality systems in reflexive Banach spaces", SIAM J. Optim. 7, 274-279. Deng, S. (1998) "Global error bounds for convex inequality systems in Banach spaces", SIAM J. Control Optim. 36(4), 1240-1249. Deville, R., Godefroy, G. and Zizler, V. (1993) Smoothness and renormings in Banach spaces, Longman Sci. Tech., Harlow. Lecture Notes in Mathematics,
352
Bibliography
Vol. 580. Diestel, J. (1975) Geometry of Banach spaces, Vol. 485 of Lecture Notes in Mathematics, Springer-Verlag, Berlin. Dieudonne, J. (1966) "Sur la separation des ensembles convexes", Math. Ann. 163, 1-3. Dolecki, S. and Angleraud, P. (1996) "When a well behaving convex function is well-conditioned", Southeast Asian Bull. Math. 20(2), 59-63. Dontehev, A. L. and Zolezzi, T. (1993) Well-posed optimization problems, Springer-Verlag, Berlin. Efimov, N. V. and Stechkin, S. B. (1959) "Support properties of sets in Banach spaces and Chebyshev sets", Dokl. Akad. Nauk SSSR 127(2), 254-257. Ekeland, I. (1974) "On the variational principle", J. Math. Anal. Appl. 47, 324353. Ekeland, I. and Temam, R. (1974) Analyse convexe et problemes variationnels, Dunod. Collection Etudes Mathematiques. Eremin, I. I. and Astafiev, N. N. (1976) Introduction to the theory of linear and convex programming, Nauka, Moskow. (Russian). Giles, J. R. (1982) Convex analysis with application in the differentiation of convex functions, Pitman (Advanced Publishing Program), Boston, Mass. Hamel, A. (1994) "Remarks to equivalent formulation of Ekeland's variational principle", Optimization 3 1 , 233-238. Hiriaxt-Urruty, J.-B. (1982) e-sub differential calculus, in "Convex analysis and optimization (London, 1980)", Pitman, Boston, Mass., pp. 43-92. Hiriaxt-Urruty, J.-B. (1998) Optimisation et analyse convexe, Presses Universitaires de France, Paris. Hiriart-Urruty, J.-B. and Lemarechal, C. (1993) Convex analysis and minimization algorithms. I, Springer-Verlag, Berlin. Fundamentals. Hiriart-Urruty, J.-B., Moussaoui, M., Seeger, A. and Voile, M. (1995) "Subdifferential calculus without qualification conditions, using approximate subdifferentials: a survey", Nonlinear Anal. 24(12), 1727-1754. Hiriart-Urruty, J.-B. and Phelps, R. R. (1993) "Subdifferential calculus using e-subdifferentials", J. Funct. Anal. 118(1), 154-166. Holmes, R. B. (1972) A course on optimization and best approximation, SpringerVerlag, Berlin. Lecture Notes in Mathematics, Vol. 257. Holmes, R. B. (1975) Geometric functional analysis and its applications, SpringerVerlag, New York. Graduate Texts in Mathematics, No. 24. Hormander, L. (1955) "Sur la fonction d'appui des ensembles convexes dans un espace localement convexe", Ark. Mat. 3, 181-186. Ioffe, A. and Tikhomirov, V. (1974) Theory of extremal problems, Nauka, Moskow. Jameson, J. (1972) "Convex series", Proc. Cambridge Philos. Soc. 72, 37-47. Jeyakumar, V. and Wolkowicz, H. (1992) "Generalizations of Slater's constraint qualification for infinite convex programs", Math. Program. 57, 85-101. Joly, J.-L. and Laurent, P.-J. (1971) "Stability and duality in convex minimization
Bibliography
353
problems", R.I.R.O. 2, 3-42. Jourani, A. (2000) "Hoffman's error bound, local controllability, and sensitivity analysis", SIAM J. Control Optim. 38(3), 947-970. Kassara, K. (2000) "Feedback spreading control laws for semilinear distributed parameter systems", Systems Control Lett. 40(4), 289-295. Kelley, J. (1955) General topology, Van Nostrand, Princeton. Klatte, D. and Li, W. (1999) "Asymptotic constraint qualifications and global error bounds for convex inequalities", Math. Programming 84(5), 137-160. Kosmol, P. and Miiller-Wichards, D. (2001) Homotopy methods for optimization in orlicz spaces, in V. Bulatov and V. Baturin, eds, "Proceedings of the 12th Baikal International Conference on Optimization Methods and their Applications", Institute of System Dynamics and Control Theory of SB RAS, Irkutsk, pp. 224-230. Kusraev, A. G. and Kutateladze, S. S. (1995) Subdifferentials: theory and applications, Kluwer Academic Publishers Group, Dordrecht. Translated from the Russian. Kutateladze, S. S. (1977) "Formulas for the computation of subdifferentials", Dokl. Akad. Nauk SSSR 232(4), 770-772. Kutateladze, S. S. (1979a) "Convex operators", Uspekhi Mat. Nauk 34(1(205)), 167-196. Russian. Kutateladze, S. S. (1979b) "Convex e-programming", Dokl. Akad. Nauk SSSR 245(5), 1048-1050. Laurent, P. (1972) Approximation et optimisation, Hermann, Paris. Lemaire, B. (1985) Subdifferential of a convex composite functional. Application to optimal control in variational inequalities, in "Nondifferentiable Optimization", Springer-Verlag, Sopron, 1984. Lemaire, B. (1989) "Quelques resultats recents sur 1'algorithme proximal", Seminaire d'Analyse Numerique, Toulouse. Lemaire, B. (1992) "Bonne position, conditionnement et bon comportement asymptotique", Semin. Anal. Convexe, Univ. Sci. Tech. Languedoc, Expo. 5. Levitin, E. S. and Polyak, B. T. (1966) "Convergence of minimizing of sequences in the conditional-extremum problem", Dokl. Akad. Nauk SSSR 168, 9971000. Russian. Lewis, A. S. and Pang, J.-S. (1998) Error bounds for convex inequality systems, in J.-P. Crouzeix, J.-E. Martinez-Legaz and M. Voile, eds, "Proceedings of the 5th International Symposium on Generalized Convexity held in Luminy, June 17-21, 1996", Kluwer Academic Publishers, Dordrecht, pp. 75-110. Li, W. and Singer, I. (1998) "Global error bounds for convex multifunctions and applications", Math. Oper. Res. 23(2), 443-462. Lifsic, E. A. (1970) "Ideally convex sets", Funkcional. Anal, i Prilozen. 4(4), 76-77. Lions, P.-L. and Rochet, J.-C. (1986) "Hopf formula and multitime HamiltonJacobi equations", Proc. Amer. Math. Soc. 96(1), 79-84.
354
Bibliography
Luc, D. T. (1993) "On the maximal monotonicity of subdifferentials", Acta Math. Vietnam. 18(1), 99-106. Luc, D. T. and Swaminathan, S. (1993) "A characterization of convex functions", Nonlinear Anal. 20(6), 697-701. Marti, J. T. (1977) Konvexe Analysis, Birkhauser Verlag, Basel. Moreau, J. (1965) "Proximite et dualite dans un espace de Hilbert", Bull. Soc. Math. France 93, 273-299. Moussaoui, M. and Seeger, A. (1994) "Sensitivity analysis of optimal value functions of convex parametric programs with possibly empty solution sets", SIAM J. Optim. 4(3), 659-675. Moussaoui, M. and Voile, M. (1997) "Quasicontinuity and united functions in convex duality theory", Comm. Appl. Nonlinear Anal. 4(4), 73-89. Penot, J.-P. (1996a) "Conditioning convex and nonconvex problems", J. Optim. Theory Appl. 90(3), 535-554. Penot, J.-P. (1996b) "Subdifferential calculus without qualification assumptions", J. Convex Anal. 3(2), 207-219. Penot, J.-P. (1998a) Are generalized derivatives useful for generalized convex functions?, in "Generalized convexity, generalized monotonicity: recent results (Luminy, 1996)", Kluwer Acad. Publ., Dordrecht, pp. 3-59. Penot, J.-P. (1998b) Well-behavior, well-posedness and nonsmooth analysis, in "Proceedings of the 4th International Conference on Mathematical Methods in Operations Research and 6th Workshop on Well-posedness and Stability of Optimization Problems (Sozopol, 1997)", Vol. 12, pp. 141-190. Penot, J.-P. and Bougeard, M. L. (1988) "Approximation and decomposition properties of some classes of locally D.C. functions", Math. Programming 41(2 (Ser. A)), 195-227. Penot, J.-P. and Voile, M. (1990) "Inversion of real-valued functions and applications", Z. Oper. Res. 34(2), 117-141. Penot, J.-P. and Zalinescu, C. (2000) "Harmonic sum and duality", J. Convex Anal. 7(1), 95-113. Phelps, R. R. (1989) Convex functions, monotone operators and differentiability, Vol. 1364 of Lecture Notes Math., Springer-Verlag, Berlin. Phelps, R. R. (1993) Convex functions, monotone operators and differentiability, Vol. 1364 of Lecture Notes Math., second edn, Springer-Verlag, Berlin. Poliquin, R. A. (1990) "Subgradient monotonicity and convex functions", Nonlinear Anal. 14, 305-317. Polyak, B. T. (1966) "Existence theorems and convergence of minimizing of sequences in extremum problems with constraints", Dokl. Akad. Nauk SSSR 166, 287-290. Russian. Polyak, B. T. (1980) Sharp minimum, Technical report, Institute of Control Sciences, Moskow, USSR. Precupanu, A. (1976) Mathematical Analysis. Real functions, Editura Didactica §i Pedagogical, Bucharest. Romanian. Precupanu, T. (1992) Linear topological spaces and elements of convex analysis,
Bibliography
355
Editura Academiei Romane, Bucharest. Romanian. Robinson, S. M. (1976) "Regularity and stability for convex multivalued functions", Math. Oper. Res. 1(2), 130-143. Rockafellar, R. T. (1966) "Characterization of the subdifferentials of convex functions", Pacific J. Math. 17, 497-510. Rockafellar, R. T. (1969) "Local boundedness of nonlinear, monotone operators", Michigan Math. J. 16, 397-407. Rockafellar, R. T. (1970) Convex analysis, Princeton University Press, Princeton, N.J. Princeton Mathematical Series, No. 28. Rockafellar, R. T. (1974) Conjugate duality and optimization, Society for Industrial and Applied Mathematics, Philadelphia, Pa. Lectures given at the Johns Hopkins University, Baltimore, Md., June, 1973, Conference Board of the Mathematical Sciences Regional Conference Series in Applied Mathematics, No. 16. Rockafellar, R. T. (1976) "Monotone operators and the proximal point algorithm", SIAM J. Control Optimization 14(5), 877-898. Rockafellar, R. T. (1981) The theory of subgradients and its applications to problems of optimization, Heldermann Verlag, Berlin. Convex and nonconvex functions. Seeger, A. and Voile, M. (1995) "On a convolution operation obtained by adding level sets: classical and new results", RAIRO Rech. Oper. 29(2), 131-154. Shioji, N. (1995) "On uniformly convex functions and uniformly smooth functions", Math. Japonica 4 1 , 641-655. e
Simons, S. (1990) "The occasional distributivity of o over + and the change of variable formula for conjugate functions", Nonlinear Anal. 14(12), 1111— 1120. Simons, S. (1994a) "Subtangents with controlled slope", Nonlinear Anal. 22(11), 1373-1389. Simons, S. (1994b) "Swimming below icebergs", Set-Valued Anal. 2(1-2), 327337. Set convergence in nonlinear analysis and optimization. Simons, S. (1998a) Minimax and monotonicity, Vol. 1693 of Lecture Notes Math., Springer-Verlag, Berlin. Simons, S. (1998b) "Sum theorems for monotone operators and convex functions", Trans. Amer. Math. Soc. 350(7), 2953-2972. Singer, I. (1979) "A Fenchel-Rockafellar type duality theorem for maximization", Bull. Austral. Math. Soc. 20(2), 193-198. Soloviov, V. (1993) "Duality for nonconvex optimization and its applications", Anal. Math. 19(4), 297-315. Soloviov, V. (1997) "Dual extremal problems and their applications to minimax estimation problems", Russian Math. Surveys 52(4), 685-720. Stoer, J. and Witzgall, C. (1970) Convexity and Optimization in Finite Dimensions I, Vol. 163 of Die Grundlehren der Mathematischen Wissenschaften, Springer Verlag, Berlin. Sullivan, F. (1981) "A characterization of complete metric spaces", Proc. Am.
356
Bibliography
Math. Soc. 83, 345-346. Thibault, L. (1995a) A generalized sequential formula for sub differentials of sums of convex functions defined on Banach spaces, in "Recent developments in optimization (Dijon, 1994)", Springer, Berlin, pp. 340-345. Thibault, L. (1995b) "A note on the Zagrodny mean value theorem", Optimization 35, 127-130. Thibault, L. and Zagrodny, D. (1995) "Integration of subdifferentials of lower semicontinuous functions on Banach spaces", J. Math. Anal. Appl. 189(1), 33-58. Toland, J. F. (1978) "Duality in nonconvex optimization", J. Math. Anal. Appl. 56, 399-415. Ursescu, C. (1975) "Multifunctions with convex closed graph", Czechoslovak Math. J. 25(100)(3), 438-441. Vainberg, M. M. (1964) Variational methods for the study of nonlinear operators, Holden-Day Inc., San Francisco, Calif. With a chapter on Newton's method by L. V. Kantorovich and G. P. Akilov. Translated and supplemented by Amiel Feinstein. Vladimirov, A. A., Nesterov, Y. E. and Chekanov, Y. N. (1978) "On uniformly convex functionals", Vestnik Moskov. Univ. Ser. XV Vycisl. Mat. Kibernet. 3, 12-23. Russian. Vlasov, L. P. (1972) "Several theorems on Chebyshev sets", Mat. Zametki 11, 135-144. Voile, M. (1992) "Some applications of the Attouch-Brezis condition to closedness criterions, optimization, and duality", Sem. Anal. Convexe 22, Exp. No. 16, 15. Voile, M. (1993) "Sous-differentiel d'une enveloppe superieure de fonctions convexes", C. R. Acad. Sci. Paris Ser. I Math. 317(9), 845-849. Voile, M. (1994) "Sur quelques formules de dualite convexe et non convexe", SetValued Anal. 2(1-2), 369-379. Set convergence in nonlinear analysis and optimization. Walkup, D. W. and Wets, R. J.-B. (1967) "Continuity of some convex-cone-valued mappings", Proc. Amer. Math. Soc. 18, 229-235. Willard, S. (1971) General topology, Addison-Wesley, Reading. Zagrodny, D. (1988) "Approximate mean value theorem for upper subderivatives", Nonlinear Anal. 12, 1413-1428. Zalinescu, C. (1978) "A generalization of the Farkas lemma and applications to convex programming", J. Math. Anal. Appl. 66(3), 651-678. Zalinescu, C. (1980) "On an abstract control problem", Numer. Funct. Anal. Optim. 2(6), 531-542 (1981). Zalinescu, C. (1983a) "Duality for vectorial nonconvex optimization by convexification and applications", An. §tiinl;. Univ. "Al. I. Cuza" Ia§i Sec^. I a Mat. (N.S.) 29(3, suppl.), 15-34. Zalinescu, C. (1983b) "On uniformly convex functions", J. Math. Anal. Appl. 95(2), 344-374.
Bibliography
357
Zalinescu, C. (1984) Duality for vectorial convex optimization, conjugate operators and sub differentials. The continuous case. Unpublished. Presented at the Conference "Mathematical Programming — Theory and Applications", Eisenach. Zalinescu, C. (1986) "Letter to the editor: on J. M. Borwein's paper: "Adjoint process duality"", Math. Oper. Res. 11(4), 692-698. Zalinescu, C. (1987) "Solvability results for sublinear functions and operators", Z. Oper. Res. Ser. A-B 31(3), A79-A101. Zalinescu, C. (1989) Stability for a class of nonlinear optimization problems and applications, in "Nonsmooth optimization and related topics (Erice, 1988)", Plenum, New York, pp. 437-458. Zalinescu, C. (1992a) "A note on d-stability of convex programs and limiting Lagrangians", Math. Programming 53(3, Ser. A), 267-277. Zalinescu, C. (1992b) "On some open problems in convex analysis", Arch. Math. (Basel) 59(6), 566-571. Zalinescu, C. (1998) Mathematical programming in infinite dimensional normed spaces, Editura Academiei, Bucharest, Romania. (Romanian). Zalinescu, C. (1999) "A comparison of constraint qualifications in infinitedimensional convex programming revisited", J. Austral. Math. Soc. Ser. B 40(3), 353-378. Zalinescu, C. (2001) Weak sharp minima, well-behaving functions and global error bounds for convex inequalities in Banach spaces, in V. Bulatov and V. Baturin, eds, "Proceedings of the 12th Baikal International Conference on Optimization Methods and their Applications", Institute of System Dynamics and Control Theory of SB RAS, Irkutsk, pp. 272-284.
This page is intentionally left blank
Index
function B-differentiable, 191 S-smooth, 191 bcs-complete, 68 coercive, 100 strongly, 214 concave (strictly), 40 conjugate (Fenchel), 75 convex, 39 strictly, 40 strongly, 203 cost, 99 cs-closed, 68 cs-complete, 68 cs-convex, 67 distance, 237 ideally convex, 68 indicator, 41 Lagrange, 138 lcs-closed, 68 li-convex, 68 Lipschitz, 66
best approximation, 237 bornology, 190 Frechet, 190 Gateaux, 190 Hadamard, 190 cone, 1 dual, 7 normal, 87 recession, 6 tangent (of Clarke), 169 convolution, 43 directional derivative, 56 Clarke-Rockafellar, 171 e-, 58
upper Dini, 58 domain of function, 39 multifunction, 12 operator, 42 duality strong, 107 weak, 107 duality mapping, 230
locally, 66 marginal, 43 objective, 99 performance, 107 perturbation, 106 proper, 39 ^-conditioned, 196 with respect to, 196
epigraph of function, 39 strict, 39 operator, 42 359
360
Q-decreasing, 42 Q-increasing, 42 quasi-continuous, 67 quasi-convex, 41 recession, 74 p-convex, 203 cr-smooth, 204 on a set, 204 subdifferentiable, 80 sublinear, 4 support, 79 uniformly convex, 203 at a point, 201 on a set, 203 on bounded sets, 221 uniformly Frechet differentiable, 207 uniformly smooth on a set, 207 on bounded sets, 221 value, 107 weight, 227 well-behaved (asymptotically), 259 well-conditioned, 197 gage of conditioning, 196 smoothness (midpoint), 205 uniform convexity, 203 at a point, 201 uniform smoothness, 205 of the norm, 231 on a set, 204 global error bound, 262, 265 graph of multifunction, 12 half-space closed, 5 open, 5 hull affine, 2 conic, 2 closed, 7 convex, 2
Index closed, 7 linear, 2 lsc convex, 63 hyperplane closed, 5 supporting, 5 image of multifunction, 12 interior algebraic, 3 relative, 3 with respect to, 2 quasi relative, 15 Kadec-Klee property, 233 weak*, 233 Lagrange multiplier, 139, 140 Lagrangian, 138 lsc envelope, 62 lsc regularization, 62 max-convolution, 43 metrically regular system, 269 Minkowski gauge, 4 multifunction, 12 bcs-complete, 13 closed, 13 cs-complete, 13 cs-convex, 13 ideally convex, 13 inverse, 12 lcs-closed, 13 li-convex, 13 locally bounded, 286 at a point, 286 monotone, 82 cyclically, 84 maximal, 82 strictly, 82 normed space smooth, 227 strictly convex, 227
Index uniformly convex, 234 locally, 234 uniformly smooth, 231 at a point, 231 locally, 231 operator Q-concave, 42 Q-convex, 42 relatively open, 119 point local maximum, 105 local minimum, 105 saddle, 138 sharp minimum, 248 support, 5 problem convex programming, 99 dual, 107 normal, 109 primal, 107 stable, 109 process, 26 adjoint, 26 closed, 26 convex, 26 quasi-inverse greatest, 188 lowest, 188 semi-norm, 4 separation of two sets, 5 proper, 5 strict, 5 series b-convex, 9 Cauchy, 9 convergent, 9 convex, 9 set absorbing, 4 admissible solutions, 99
affine, 1 balanced, 1 bcs-complete, 9 Chebyshev, 237 constraints, 99 convex, 1 cs-closed, 9 cs-complete, 9 ideally convex, 9 level, 39 strict, 39 lower cs-closed (lcs-closed), 11 lower ideally convex (li-convex monotone, 82 maximal, 82 strictly, 82 polar, 7 quasi-continuous, 67 star-shaped, 1 symmetric, 1 weak sharp minima, 248 sets united, 17 Slater's condition, 138 solution admissible, 137 e-, 106 optimal, 99 space algebraic dual, 3 barreled, 9 complete, 8 first countable, 8 Prechet, 8 linear, parallel to, 2 orthogonal, 7 quasi-complete, 8 topological dual, 4 subdifferential, 80 abstract, 178 Clarke, 175 e-,82 Fenchel, 80 subgradient, 80 e-,82
362
support functional, 5 point, 5 theorem Alaoglu-Bourbaki, 8 Baire I, 34 Baire II, 34 Banach-Steinhaus, 25 biconjugate, 77 bipolar, 7 Bishop-Phelps, 166 Borwein, 159 Borwein-Preiss, 31 Br0ndsted-Rockafellar, 161 closed graph, 25 Dieudonne, 7 Eidelheit, 5 Ekeland, 29 Ioffe-Tikhomirov, 97 open mapping, 25 Pshenichnyi-Rockafellax, 136 Robinson, 23 Rockafellar, 169, 288 Simons, 22, 167 Ursescu, 23 Zagrodny, 179 value of a problem, 99 saddle, 144 Young-Fenchel inequality, 75
Symbols and Notations
A denotes A + B, A-B — Af A1 — -Aoo
A+, A++ A0, A00 Ax, A x x { A 'bA ic A A A0 aff A argmin / B[x,e) Bx Kx BdA
— — — — — — — — — — —
e*
—
C(A;a) C[a, b] C(X) clA co A, co A co/ cone A, cone A Df{x,) D(x,s) d
— — — — — — — ~
the closure of the set A C (X, r) the sets {a + b | a € A, b€ B}, {a - b | a € A, b £ B} the function defined by (Af)(y) = inf{/(x) | Ax = y} the algebraic interior of A (p. 3) the cone (~]t>0 t(A — a) with a € A when A = co A the sets {x* G X* | Vx £ A : {x,x") > 0} and (A+)+ thesets {x* e X* j Vx e A (x,x*) > - 1 } and (A 0 ) 0 the sets {x* € X* \Vx e A <x,x*) = 0} and (A-1)-1 the relative algebraic interior of A (p. 3) 'A if the parallel subspace to aff A is barreled, 0 otherwise 'A if aff A is closed, 0 otherwise the set {(p : R+ -+ R+ \
364 diam^l — dim A — dom/ — domH — domK — dyi(x), d(x, ^4) — epi / — epi s / — epi# — f\A — / — /oo — /+ — f'(t), f"(t) — /+(£), /-(£) — / ' ( a , •) — / T ( a , •) — f'e(a, •) — /*,/** — /iD/2 — /1O/2 — [/ < A], [/ < A] f,p — gr ft — grT — (Hx), (Hwx) — HX',a — H x*,a>Hx*,a — H }*,a>Ht*,a — Idx — ImT — Imft — inf 3, infx 3 — int yl — inty A — kerT — lcs — L 1 (0,1) — L(X, Y) — £>(X, Y) — liminfig/ A, liminf^^a f(x) l i m s u p i € / A;
Symbols and Notations the diameter of A C (X, d), i.e. swp{d(x,y) \ x, y
Symbols and Notations
365
l i m s u p ^ , , f(x) lin A — lsc — £p(p>l) — £°° — N — N(A; a) — No, Ni, N2 — N(x), Jfx{x) — ySx — yfx — P — V — Pc(x) — Prx — PA{X) — p, q, r — qri A — R, R — R+, R+ — RE, R — D? : X =£ Y —
•R-1
— inf v e W ( o ) sup.,. ev f(x) 6 R for / : (X, T) -> R the linear hull of the set A (p. 2) lower semicontinuous thespace{(a;n)cR|||(x„)||p:=(£n>iMP)1/p
— or1 : y = t x , or^y) — ^ |yeft(z)}forft:X =4 Y
ft o S ft + S ft(A) ft_1(B) rec A ri A rint A Sx S(f, C) S(P) Se(f,C) Se(P) SA supp/i T* To(M, x) tvs (t n )4-0 Ux use v(f, C)
— — — — — — — — — — — — — — — — — — — — —
the composition of the multifunctions ft and S (p. 12) the sum of the multifunctions ft and S (p. 12) the set {y € Y | 3 x € A : (x, y) € ft} the set {x £ X | 3 y eB : (x,y)eX) the set {u £ X \ Va 6 A : a + u e A} the set rint A if aff A is closed, 0 otherwise the relative interior of A, i.e. intaff.4 A the set Ux \ Bx the set {x £ C | V i € C : f(x) < f(x)} the set 5 ( / , C) for the problem: (P) min f(x), xeC the set {xeC\\/xsC : f{x)
Symbols and Notations
366 v(P) — w — w* — w.r.t. — X', X* — X/Xo — (X, d) — (X, r ) — Xi —¥ x, (xi) —> x, (xi) —* x — (x^) -^->- x* — {xnk)ke®t {xnk) [x,y],[x,y[ — }x,y[ — (x,x*) — y* — 2/i
v(f, C) for the problem: (P) min f(x), xeC the topology <x(X, X*) on the lcs X the topology a(X*,X) on the topological dual X* of X with respect to the spaces L{X,R) and £ ( X , R ) , respectively the quotient space of X w.r.t. the linear subspace Xo C X a metric space a topological space x = \imxi — the net (xi) converges to x the net (xi) converges to x for the topology w of X
the net (x*) converge to x* for the topology w* of X* — a subsequence of (xn) {(1 - X)x + Xy | A G [0,1]} and {(1 - X)x + Xy | A G [0,1[} the set {(1 - A)x + Ay | A G ] 0 , l[} the image of x G X by x* G X* the set Y U {oo} — the fact that J/2 — 2/i £ Q,for Q CY convex cone the function tdom / the set { 1 , . . . , n} for n G N a norm, the norm of the element x the Fenchel subdifferential of the function / (p. 80) the e-subdifferential of the function / (p. 82) the Clarke subdifferential of the function / (p. 175) abstract subdifferential (p. 178) the partial derivative of the function / with respect to x, d(df/dxi)/dxj the class of lsc convex functions / G A the sets TnAo, {
Symbols and
Notations
367
the gage of uniform convexity of / on rUx (p- 221) the gage of uniform convexity of / at x (p. 201) the sets r l~l fii, {a € Si | \imsupt^root~2ifi(t) < oo} gages of uniform smoothness of / (p. 204) see p. 210 gages of uniform smoothness of / on A (p. 204) gages of uniform smoothness of / on rUx (p- 221) the gage of smoothness of / at x (p. 191) the gage of smoothness of the norm at x (p. 231) the gage of uniform smoothness of the norm of X (p. 231) the set {/i: E -¥ [0, oo[ | supp/i is finite, ^2x€EKx) — 1} the set { u : I ->• [ji€I Xi\ V i e / : u(i) € X{} the topology associated with a metric d the topology generated by the family of semi-norms V function of the form Qp(x) := X2 n>0 /in ||z — Mn||p (p- 31) see pages 201, 225, respectively the gradient (the differential) of the function / at a the second order differential of the function / at a for all, for every there exists (at least one) maximum of extended real numbers or logical "or" minimum of extended real numbers or logical "and" a := b, b =: a means a is equal to b by definition the end of a proof or of a theorem without proof