This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
=
' It follows from properties (a) and (b) that the Borel functions X.JVV'+ N~ (It), X.JVV'+ NifJ
have the same (finite) integral with respect to A. Since they bound between them the function x ~ N~(f), the latter is A-measurable and we have (fl,f) =
L
dJ.(x)N,(1)·
(11.2)
The following facts are thus established: Let f be a bounded, positive, Borel (respectively, universally measurable) function with compact support. The function x ~ Nif) is then Borel-measurable (respectively, A-measurable for every Radon measure A with compact support). This result extends immediately to a Borel (respectively, universally measurable) function witb compact support, but not bounded, by means of passage to a monotone limit. Since the space E is a-compact, we can next remove the restriction on the supports of the measure A and the function f Let A be a universally measurable set. Put N(x,A) =
N~(A).
It follows immediately from the above that the mapping (x,A).JVV'+ N(x,A) is a kernel on (E,81u(E)), and the relation N(x,f) = N~(f)
is established, by means of passage to a monotone limit, for every positive, universally measurable function f Keeping the previous meaning of A and p" relation (11.2) says that
AN = ft. Since the measure ft is a Radon measure, and since A is an arbitrary Radon measure with compact support, the kernel N is a dispersion. It only remains to verify the uniqueness of the kernel N, implied by the following observation: Let N and M be two diffusions such that Nf = Mf for every function f E re.;r: The kernels N and M are then identical. The measures E~ and E~M are, indeed, for every x E E, Radon measures that coincide on re.;r; they are thus equal and hence, for every positive universally measurable function f, Mf~ = <E~M,f> = <E~,f> = Nr·
The proof of the following theorem (which we shall in large part omit) is very similar to that of Theorem 11. It is, in fact, a little simpler, since we suppose that the space Eis LCC. T12 THEOREM Let E be an LCC space, and let ~loc be the vector space of locally bounded universally measurable functions on E. (a) Let N be a dispersion on E. Then the function f .JVV'+ Nf maps re.;r into ~loc' (b) Conversely, let V be a positive linear mapping of re.;r into ~loc; there then exists a dispersion N on E such that Nf = Vffor every function f Ere.;r, and such a kernel is unique.
IX, 13
178
Kernels and Resolvents
Proof We shall give a proof by contradiction for assertion (a). Suppose that there exists a function f E
and shows that the mapping X.JVV'-)- Nif) is universally measurable whenfis first a bounded Borel function with compact support, then an arbitrary positive Borel function, and finally a universally measurable function, as in the proof of the preceding theorem. 13
Examples of kernels
(a) Let (E,C) be a measurable space. Set I(x,A) = I..ix)
(x
E
E, A
E
C).
We obtain a kernel I, which satisfies the relation If = f for every measurable function f It is called the identity kernel. (b) Let n(x,y) be a positive function on the set E x E, measurable with respect to the a-field C x C, and let A. be a fixed positive measure on E. A kernel on (E,C) is obtained by setting N(x,A) = n(x,y) dJ.(y) (x E E,A E tf).
L
It is evident that N(x,j)
= SE n(x,y)f(y) dJ.(y).
Such a function n(x,y) is called a kernel function. It should be well noted that it is necessary to choose a measure A. in order to associate a kernel with a kernel function. Suppose for example that E is the space R3 (given the Borel a-field, and the Euclidian metric d). The kernel function r(x,y) = Ijd(x,y)
is the Newtonian kernel function in R3. If A. is chosen to be a Lebesgue measure, the function N(x,j) =
f.
RS
n(x,y)f(y) dA.(y)
is the Newtonian potential of the mass distribution with density f [one also considers the potentials ofpositive measures of the form x
-"I/II'+f.
RS
n(x,y) dlt(y),
but we are not interested in these potentials for now]. More generally, the kernel of order (X in Rn (0 < (X < 2; the case (X = 2 is that of the Newtonian kernel) is the kernel function [d(x,y)]cx-n. (c) Convolution kernels. Let E be a locally compact group, written multiplicatively, and let It be a positive measure on E. Define, for every x E E and every positive universally measurable function f, NI"
= tf(X Y ) dp.(y).
179
The Potential Theory of a Single Kernel
IX, 14, D15
We obtain a continuous kernel on E. Let A be a positive measure on E. We have (AN,!)
= (A,Nf) =
fE dA(X) f!(XY) df'(Y) = (A' f',f),
where the asterisk denotes convolution. We thus have AN = A * fl, which explains the name "convolution kernel" given to kernels of this type (note that we do not have Nf = fl *f).
2.
The Potential Theory
of a
Single Kernel
The following problem is of sufficient generality to contain almost all the classical forms of potential theory: given a family (Ni)iE I of dispersions on an LCC space E, to study the measurable functions f on E that satisfy the inequalities Nif
for every
i El
(or the Radon measures fl that satisfy the inequalities flNi < fl). The very special case of this problem, where the family (Ni) is a one-parameter semigroup of sub-Markov kernels, will occupy us for most of this book. As an introduction to this already very complex case, we quickly treat the case where the family (Ni) is reduced to a single kernel N. We do not use probabilistic methods in this section, which is essentially a compilation of results of Deny [(50), (51)] and Doob. The reader can find the probabilistic interpretations in Doob's paper (61). Excessive functions 14 We begin by studying excessive functions, and this study is more interesting for kernels on an abstract space. We thus denote by (E,C) a measurable space, and by N a kernel on (E,C). We set N° = I (the identity kernel), and denote by Nn the nth power of N under composition, for n > 1. The functions we call excessive (respectively, invariant) are called superharmonic (harmonic) by Deny, and superregular (regular) by Doob.
D15 DEFINITION Let f be a positive measurable function; f is said to be excessive with respect to the kernel N if Nf
One says that f is invariant with respect to the kernel N
iff is everywhere finite and if
Nf=f
The words "with respect to the kernel N" will be omitted in the rest of this section. Example The kernel N is sub-Markov (respectively, Markov) if and only if the constant 1 is excessive (respectively, invariant). Remarks (a) It is evident that the inequalities f> Nf> ... > Nnf> Nn+1J> ...
hold iffis excessive. We denote by N°Ofthe limit of N'1asn is not in general a kernel).
~ 00
(the mappingf.AN+ N°Of
IX, T16, D17, T18
180
Kernels and Resolvents
(b) Definition 14 does not limit the size of exCessive functions. The constant
+ 00, for
example, is an often undesirable excessive function. We are thus led to consider certain classes of excessive functions, for example, the class of excessive functions [such that the function N OO f is everywhere finite, or the class of excessive functions f such that the function Nf is everywhere finite. These last functions are called (following Doob) excessive functions in the strict sense. Note that the set {f = oo} is then negligible for every measure Br£N. (c) Classical potential theory also deals with superharmonic functions, which are not necessarily positive. We could introduce here the "signed excessive functions," which are measurable functions f having the following properties: (ex) for every x
E
-
00
< + 00
> O.
((3)
E and every n
for every
x
E
E; Nn(x,f-)
< + 00
Nf
The first condition implies that the integrals Nn(x,f) are all defined. The notion of a (signed) excessive function in the strict sense is obtained by replacing condition (ex) by the condition N(x, If I) < + 00 for every n > o. Signed invariant functions are assumed to be finite. These definitions are borrowed from Doob. We do not use them in this book. T16 THEOREM Let f and g be two excessive functions, and ex, {3 be two positive constants. The functions exf + {3g and fAg are then excessive. Let (fn)neN be a pointwise convergentsequence ofexcessivefunctions. Thefunctionlimn-+oofn is then excessive.
The proof is immediate. Potentials D17
DEFINITION
The potential kernel associated with a kernel N is
Letf be a positive measurablefunction. The function Gf is called the potential of the function f
We shall often use the obvious identity NG
+ 1= GN + 1= G.
Kernels of the form AG, where G is the potential associated with a kernel N, and A is a positive constant, are called elementary kernels by Deny. T18 THEOREM Let f be a positive measurable function. The function g = Gf is then excessive, and at every point x E E NOOgr£
=0
or
NOOgr£
=
00.
Conversely, every excessive function g that has this property is the potential of a positive measurable function. Proof It is clear that
00
NkGjr£ =
!
Nnjr£. n=k
(18.1)
The Potential Theory of a Single Kernel
181
IX, TI9
It then follows at once (for k = I) that the function Gfis excessive. Let x be a point such that N OO Gfx < 00; there then exists an integer k such that the first member of (I8.!) is finite, and it follows that the series in the second member converges. We thus have 00
lim
2 NnplJ = lim N 7JGjX = 0,
7J-+ 00
7J
7J-+ 00
or NOOGfX = 0. Conversely, let g be an excessive function such that the function NOOg takes only the values and 00. Let f be the function defined by
°
f(x)
=(
g(X) - Ngx
at every point x where g(x) is finite.
00
if g(x) =
00.
We show that Gf = g. It is clear that equality holds on the set {g = oo}, since then Gf ~ f = 00. Let x be a point where g is finite. The function g is then integrable for every measure Nk(x,dy) and the same is true off, which is dominated by g. We can thus write from which we deduce that k
2 NnplJ =
g(x) - Nk+1ga:.
n=O
Finally, the relation NOOg X< g(x)
< 00 implies that NOOgx = 0, so that 00
2 Nnfa: =
g(x).
n=O
Thus we have Gf = g.
(Riesz decomposition) Let g be an excessive function such that the function NOOg is finite; g can then be written uniquely as the sum of an invariant function and a potential. T19
COROLLARY
Proof Let x be a point of E, and let k be an integer such that Nkgx < 00. The function Nk-lg is integrable for the measure ea;N and dominates the functions N7Jg (p > k). It thus follows from Lebesgue's theorem that (ea:N,N OO g)
= Hm (ea:N ,N7Jg) = Hm N 7J+1ga: = 7J-+ 00
N OO ga:.
7J-+ 00
The function NOOg is hence invariant. Let f be the excessive function g - NOOg. It is clear that NOOf = 0, and the preceding theorem implies that f is the potential of the function f-Nf. Suppose that we have a decomposition of the form
g=
f' + h,
where f' is excessive and h is invariant. Then N OOg = NOOf'
+ h,
and thus we obtain h ~ NOOg, with equality if and only if NOOf' = 0, i.e., iff' is a potential. This implies the uniqueness of the decomposition of g into an invariant function and a potential, and shows that NOOg is the largest invariant minorant of g.
IX, T2Q-T22
T20 N°Og
Kernels and Resolvents
182
Let g be a finite potential (or more generally a potential such that everywhere). Every excessive function dominated by g is then a potential.
COROLLARY
< 00
T21 THEOREM Suppose that the kernel G is proper (No. 2). Every excessive function f is then the limit of an increasing sequence offinite potentials. Proof Since E is the union of a sequence of sets with finite potentials, the function Gl is the limit of an increasing sequence of finite potentials gn. Let fn = ngn: the potentials fn converge everywhere to + 00. It then suffices to note that the functions fn A fare potentials, which increase to f
The reduite of an excessive function on a set We begin by defining notation. Let A be a measurable set and let A' be its complement. We denote byJA (respectively,JA ,) the kernel defined byJAf = fIA (respectively,JA,f= fIA,) for every measurable function f; by N A (respectively, NA') the kernel NJA (respectively, NJA,); by GA (respectively, GA,) the potential kernel associated with the kernel N A (respectively, N A'). T22 THEOREM Let f be an excessive function. The collection of excessive functions that dominate f on A has a smallest element, equal to
(22.1) This function is called the reduite off on A. Proof Denote by HA the kernel J A HA! = g. The inequality
+ JA,GA,NA .
It is evident that HA = HAJA . Set
k
JAf + 2,JA,N~.4:NAf<1 2l=O
is clear for k = 0; we show by induction that it is true for every k. Apply the kernel N to both sides, obtaining k+l
NAf + 2, N~:NAf 2l=1
< Nf·
Apply now JA', then add JA! to both sides. There comes k+l
JAf + 2JA,N~,NAf< JAf + JA,Nf
The induction is thus possible. This inequality gives the relation g = HAf < f for k -+ 00. It is clear that g = f at every point of A. Let us prove thatg is excessive. We compute Ng, using the identity 1+ NA,GA , = GA,. We obtain Ng = NHA != N A !+ NA,GA,NA != GA,NA! We thus have Ng = g at every point of A'. On the other hand, the relations Ng < N! JAI, and therefore HAh > HA! = g. Now h majorizes HAh; we thus have h > g, and g actually is the smallest excessive function that majorizes f on A.
The Potential Theory of a Single Kernel
183
IX, T24
23 Remarks (a) The potential of the function g - Ng is at most equal to g, and it is equal to g if N°Og = O. This happens at least in the following two cases: (1) if the functionfis a finite potential, since then N°Og < Nj = 0; (2) if the potential of the function JAf is finite, since GJAf > f on A, so that GJAf > g everywhere, and consequently N°Og < N°O(GJAi) = O. Suppose in particular that the kernel N is sub-Markov, and that the potential G(IA) is finite. The reduite of the function 1 on A is then a potential, which is called the equilibrium potential of A. (b) We are going to indicate a characterization of potentials by means of the reduite, analogous to the characterization most commonly used in classical potential theory. We suppose that the kernel G is proper, and we retain the notation of the preceding paragraphs. Denote by g an excessive function such that the function Ng is finite. We can now show that g is a potential if and only iflim n __ oo HAng = 0 for every decreasing sequence (An)neN of measurable sets which has empty intersection. Suppose in fact that g is a potential, and put HAtlg = h n • The functions h n decrease as n increases, and we have seen that Nh n = h n on (An. Let h = lim n __ oo h n. We have Nh = h from Lebesgue's theorem, and this implies the equality h = 0 from T19. Conversely, suppose that g is not a potential. There then exists a nonzero invariant function h, dominated by g. We are going to construct a decreasing sequence of sets An An = 0. such that HAnh = h for every n, and Consider first a set A' such that the function GA,h is finite, and put A = (A'. We have NAh + NA,h = h, and consequently
nn
HAh = JAh
+ JA,GA,NAh =
= JAh + JA,h
JAh
+ JA,GA,(h -
NA,h)
= h.
Since the kernel G is proper. we can choose an increasing sequence of measurable sets B n such that G(IB n ) < 00 for every nand Un B n = E. Then let A~ = B n
n
{h
< n},
and
An = (A~.
We have for every n GA~h < nG(/B) < 00. It follows from the preceding results that HAnh = h for every n, and consequently lim n __ oo HAng > h #: 0, while An = 0.
nn
~
Let h be a positive measurablefunction, equal to zero on the complement of A, and let f be its potential Gh. Every excessive function that dominates f on A dominates it everywhere. T24
THEOREM
Proof Let u be an excessive function that dominates f on A, and let v be the excessive function u A f We denote by j the positive function equal to v - Nv on the set {v < oo}, and to + 00 on the set {v = oo}. The potential of j is equal to + 00 on the set {v = oo}, and to v - N°Ov on {v < oo}; it is thus everywhere less than v (T18). Apply the kernel N to both sides of the inequality v < Gh, yielding Nv
< NGh,
from which, by adding h to both sides, we obtain h
+ Nv < h + NGh =
Gh.
We have Ghx = vex) on A and consequently, at every point x of A hex) ~ vex) - Nv X = j(x).
n {j <
oo},
IX, T25-T27, D28
184
Kernels and Resolvents
This inequality evidently holds also on {j = oo}. It thus holds on all of the set {h and hence everywhere. We therefore have also so that v = f, and consequently f
f= Gh < u.
> O} c
A,
< Gj < v,
T25 COROLLARY (Domination principle) Let g and h be two positive measurable functions, and let A be the set {h > O}. The relation
Ggz implies the inequality Gg
> Gh
Z
for every
x
E
A
> Gh.
The function 1 is excessive if the kernel N is sub-Markov, and the same holds for every function of the form a + Vg, where g is positive and measurable and where a is a positive constant. We thus have the following result. T26 COROLLARY (Complete maximum principle) Suppose that the kernel N is sub-Markov. Let g and h be two positive measurable functions, a a positive constant. The relation a + Ggz > Ghz for every x such that h(x) > 0
implies the inequality a
+ Gg > Gh.
Remark These two "principles" are also satisfied for kernels proportional to G, Le., for the elementary kernels of Deny (No. 17). It is interesting to note that the excessive functions can be characterized without explicit mention of the kernel N. T27 THEOREM Suppose that the kernel G is proper. A positive measurable function f is excessive if and only if the following property is satisfied: For every measurable function h (not necessarily positive) with a potential Gh that is welldefined andfinite, the relation
f(x)
> Gh
f(x)
> Gh
implies
Z
for et'ery x such that
Z
for every
x
E
h(x)
>0
E.
(27.1)
We postpone until later (Nos. 70 and 72) the proof of this theorem. Excessive measures The theory of excessive measures is, in general, easier than that of excessive functions, and we often put the emphasis on this latter theory, here and in all that follows. We suppose now that E is a locally compact, a-compact space and that N is a diffusion-kernel on E. All of the measures we consider will be defined on the a-field !!liE), and will be positive. The results below are borrowed from article [(52)] of Deny, where a more general notion of "kernel" is used, which is more satisfactory for the theory of excessive measures. D28 DEFINITION A Radon measure ft on E is said to be excessive (respectively, invariant) with respect to the kernel N if ftN < N (respectively, ftN = N). The measure ftN is then finite on compact sets, and is hence a Radon measure (No. 10). The following theorem corresponds to Theorem 16. Notice the disappearance of the countability restrictions.
185
The Potential Theory of a Single Kernel
IX, T29, 30, T31-T33
T29 THEOREM (a) Let A and fl be two excessive measures, cx and {J be two positive numbers. The measures CXA + {Jfl, A A fl are then excessive. (b) Let fl be a Radon measure, equal to the weak limit of a family (fli)iel of excessive measures, which is filtering either to the right or to the left. The measure fl then is excessive. Proof Statement (a) is obvious. To establish (b), we need only prove that (fl,Nf) < (fl,f) for every function fE ~}. Now we have (fli,Nf) < (fli ,f), and (fl,f) = limi (fli,f). The point to establish thus is the relation: (fl,Nf)
< lim (fli,Nf).
(29.1)
i
This is obvious if the family is filtering to the left. Assume it is filtering to the right, and denote by hi a density of fli with respect to fl. Let U be any relatively compact open set: The functions h/u increase with i, their integrals remain bounded, and they thus converge in the space Ll(fl). The relation limi fli(g) = fl(g) holds for any function g E~} that has its support in U; the Ll limit of the functions h/u thus is equal to I u , and we get fl(f) = sup fli(f) i
for every universally measurable function f that is positive, bounded, and equal to 0 on the complement of U. This relation now extends, by an increasing passage to the limit, to all universally measurable positive functions, and (29.1) follows. 30 The potential kernel G is not necessarily a diffusion. We shall say that a Radon measure fl belongs to the domain of G if the measure flG is finite on compact sets (it is then a Radon measure). If fl is excessive, we set flNa) = lim n flNn. T31 THEOREM (Riesz decomposition) Let fl be an excessive measure. Then fl can be written uniquely as the sum of an invariant measure and of a potential. To be precise, the measure fl - flN belongs to the domain of G, the measure flNa) = lim flN n is invariant, and n
The proof of this theorem is identical to that of Theorem 19. It can also be verified, as in No. 20, that every excessive measure dominated by a potential flG (where fl belongs to the domain of G) is a potential. The following theorem is obvious. We mention itonly for the sake of its name. T32 THEOREM ("Principle of the uniqueness of masses") Let A and fl be two n:easures belonging to the domain of G. The relation AG = flG then implies A = fl. Proof We have in fact A + AGN = AG = flG = fl + flGN, and the measures AGN and flGN are two equal Radon measures. The following theorem corresponds to Theorem 22 and uses the same notation. We will not give a proof for it.
T33 THEOREM Let fl be an excessive measure. The collection of excessive measures that dominate fl on A has a smallest element fl', equal to
IX, T34, 35
Kernels and Resolvents
186
This measure could be called the "reduite" of p, on A, but in general it isn't. It can be verified, as in No. 22, that the measures p,' and p,'N are equal on A'. Suppose, in particular"that p, is a potential AG. The measure p,', since it is dominated by p" is the potential of a well-determined measure A', which is called the balayee* of A on A. We have A' = p,' - p,' N, so that A' is carried by A. The statement analogous to Theorem 24 is true for excessive measures (the proof carries over without change). Let then A" be a second measure carried by A, whose potential coincides with AG on A. The potentials A'G and A"G have the same restriction to A. They are thus equal, and hence A' = A". We therefore have the following theorem. T34 THEOREM (Principle of balayage) Let A be a measure that belongs to the domain of G, and let A be a universally measurable set. There exists a unique measure A' with the following properties: (1) A' is carried by A. (2) A'G < AG, and these two potentials have the same restriction to A.
Appendix: Connections with Martingale Theory 35 The analogies between potential theory and martingale theory can perhaps be illumi-
nated by the following remarks. Let (.o,§',P) be a complete probability space, and let (§'n)nEN be an increasing family of sub-a-fields of §'. Denote by E the set N x .0, and by C the a-field on E consisting of the subsets of the form
U {n}
X
An'
nEN
where each set An is §'n-measurable. The C-measurable mappings from E into R are then of the form (n,w) ~ Xn(w),
where each partial mapping X n is §'n-measurable. The definition of stochastic processes adapted to the family (§'n) is thus recovered. Introduce on the set of these processes the equivalence relation defined by f"'Ooo./
(Xn)nEN
f"'Ooo./
(Yn)nEN if and only if X n = Y n a.s. for every n EN.
Let X = (Xn)nEN be a process with positive values adapted to the family (§'J. For each n, denote by Yn a version of the generalized conditional expectation of X n+1 with respect to §'n' and define The mapping N is not well defined, due to the indeterminacy in the choice of conditional expectations, but by passing to the quotient by the equivalence relation one can obtain a mapping that formally has all the properties of a sub-Markov kernel-in particular, the behavior under passage to a monotone limit. The "excessive (respectively, invariant) functions" with respect to the "kernel" N are then the equivalence classes of generalized supermartingales (respectively, martingales), and all of f"'Ooo./,
• "Swept out measure."
187
Semigroups and Resolvents
IX, D36, D37, 38
the elementary theory we have developed carries over without difficulty. Naturally, no truly important theorem on supermartingales is obtained by this method. In particular, the fundamental theorems on the behavior of paths have no parallel in potential theory.
3. Semi8roups and Resolvents The results of this section are not deep, but they are very useful technical tools. They come mostly from Hunt's papers (78). Semigroups of kernels D36 DEFINITION Let (E,tff) be a measurable space. A family (Nt)teR+ [respectively (Nt)t> 0] ofkernels on (E,tff) is said to be a semigroup ofkernels (respectively, a semigroup in the broad sense) if the relation
holds for every pair (s,t) of numbers >0 (respectively, >0). The semigroup is said to be sub~Markov (Markov) if all of the kernels N t are sub~Markov (Markov). A semigroup ofdispersions on a locally compact space constitutes a particular case of this definition. A semigroup in the broad sense can always be transformed into a true semigroup. It suffices to set No = I (the identity kernel). We prefer, however, to maintain the distinction between these two types of semigroups. Let (Nt ) be a semigroup of kernels. It is possible to produce from it new semigroups of kernels (N:) (where p denotes a number >0) by setting
Nf = e-ptNt · These semigroups are sometimes better behaved than (Nt ) itself. Supermedian and excessive functions D37 DEFINITION Let (Nt) be a semigroup in the broad sense on (E,tff). A positive function f defined on E is said to be p~supermedian (p > 0) with respect to the semigroup (Nt) if f is
tff~measurable and
e-PtNtf < f
for every
t
> O.
(37.1)
The function f is said to be p~excessive if, moreover, Hm e-PtNtf = f.
(37.2)
t--+O
The function f is said to be p-invariant
iff
e-PtNtf= f
is everywhere finite and if for every
t
> O.
38 Remarks (a) Functions that are O-supermedian (O-excessive) are called simply supermedian (excessive). (b) The functions that are p-supermedian (p-excessive) with respect to the semigroup (Nt ) are identical to the supermedian (excessive) functions with respect to the semigroup (N:).
IX, 39
188
Kernels and Resolvents
(c) A p-supermedian (p-excessive) function is also q-supermedian (q-excessive) for every
q > p. One can thus say that "the larger p is, the more p-supermedian (p-excessive) functions
there are." (d) Relation (37.1) implies that the function t ~ e-ptNt!X is decreasing for every x E E. Moreover, if condition (37.2) is satisfied, this function is right continuous from Lebesgue's theorem. (e) Let! be a p-supermedian (p-excessive) function. The function N t! is then p-supermedian (p-excessive) for every t > 0. (!) Let (!n)nEN be a sequence of p-supermedian functions. It follows immediately from Fatou's lemma that the function! = limninf!n is also supermedian. Suppose that the sequence is increasing, and that the functions!n are p-excessive. We then have I
=
sup In n
=
sup sup e-PtNtln = sup sup e-PtNt!n n
t -PtN! = sup e t,
t
n
t
so that! is p-excessive. (g) Supermedian or excessive measures are defined similarly. The theory of excessive measures cannot be developed satisfactorily under our current hypotheses. We shall see, in turn, that the theory of excessive measures becomes very simple-much simpler than that of excessive functions-when suitable hypotheses are made on the semigroup (Nt ). Resolvents
Let (Nt) be a semigroup in the broad sense. We say that (Nt ) is a measurable semigroup if the function
39
(t,X) ~ Nix,!)
is measurable (with respect to the natural product a-field on ]O,oo[ x E) for every positive C-measurable function! Then, for every number p > 0, define Vp(X,!) =
f.oo e-PtNt(x,A) dt
(A
E
C).
(39.1)
It is clear that the mapping (x,A) ~ Vix,A) is a kernel V p, and that the notation Vix,!) we have used is consistent. The family of kernels (Vp)p>o is called the resolvent of the semigroup (Nt), and the kernel V = Vo is called the potential of the semigroup (Nt). More generally, let p, be a bounded measure on the half-line R+. Define a kernel NI' by
the relation Nil) = f.ooNtfdP,(t).
(39.2)
The kernels N t are of this form (for p, = Ct), just as are the kernels Vp (p, is then the measure with density e- pt on R+, which we denote by e p ). It is easy to verify that N;.Np.
the symbol
=
= Np..;' =
N;.*p.
* denoting convolution. The formula e + (q - p)e * e p = q
q
where q and p are two numbers such that q Vq
Np.N;.,
(39.3)
e p,
> P > 0, then gives us the fundamental formula
+ (q -
p)Vq V2J = V p,
(39.4)
IX, D40, 41, D42, D43, 44
Semigroups and Resolvents
189
which is known as the resolvent equation. Now we are going to forget measurable semigroups for a moment and study the families of kernels which satisfy (39.4) for their own sake. D40 DEFINITION (E,C), such that
A resolvent on a measurable space (E,C) is a family ( Vp) p> 0 of kernels on
(40.1)
and
for every pair of numbers p, q such that q > P > O. A resolvent (Vp) is said to be proper (respectively, sub-Markov, Markov) if the kernels V p are all proper (respectively, if the kernels p V p are all sub-Markov, Markov).
We shall mainly be interested in sub-Markov resolvents in the rest of this book. The reader can simplify his task by supposing, for the rest of this section, that all the resolvents considered are sub-Markov. 41 Let f be a positive measurable function. According to formula (40.1) the function p ~ Vpfis decreasing. We can thus put Vof
=
Vf = sup Vpf = lim Vpf. p
p-+O
Let (fn) be a sequence of positive measurable functions that increases to f We have Vf = sup Vpf = sup sup VIn p p n
=
sup sup VIn n
=
P
sup Vfn n
so that V is a kernel. It is easily verified that VVp = VpV; V = V p + pVVp (p
> 0).
D42 DEFINITION Let f be a positive measurable function. The function Vrf (r the r-potential off The function Vf = Vof is called the potential off D43
DEFINITION
> 0) is called
We say that the resolvent (Vp) is closed if the kernel V is proper.
Suppose that E is a locally compact, a-compact space. We then say, in a slightly more precise sense, that the resolvent (Vp) is closed if all of the kernels Vip > 0) are dispersion kernels. 44 Let (Vp ) be a resolvent and let r be a positive number. The family of kernels (p
> 0)
is a new resolvent. Suppose that the resolvent (Vp ) is proper: The resolvent (V;) is then closed for every r > O. Indeed, V~f = lim p _ o V p+rf < Vrf for every positive measurable functionf(we shall see later that the inequality is, in fact, an equality). This property is the reason for the interest in the resolvents (V;). Suppose that the resolvent (Vp ) is associated with a measurable semigroup (Nt ) by formula (39.1). The resolvent (V;) is then associated with the semigroup (ND = (e-rtNt ).
Supermedian and excessive functions We now define supermedian and excessive functions with respect to a resolvent. The connection between this definition and Definition 37 is examined later (No. 65).
IX, D45, T46
Kernels and Resolvents
190
A positive measurable function f defined on E is said to be r-supermedian (r > 0) with respect to the resolvent (V1J if D45
DEFINITION
pVP+rf
for every
> O.
(45.1)
if in addition
The function f is said to be r-excessive
Hm p V1J+rf =
The function f is said to be r-invariant
p
iff
f.
(45.2)
is everywhere finite and if
PV 1J+rf = f
for every
p
> O.
The words "with respect to the resolvent (V1J )" will usually be omitted. Functions that are O-excessive (O-supermedian, O-invariant) will be called simply excessive (supermedian, invariant). The r-supermedian (r-excessive) functions with respect to the resolvent (V1J) are identical with the supermedian (excessive) functions with respect to the resolvent (V;). First properties
We suppose henceforth that the resolvent (V1J) is proper.
(a) Let f be a positive measurable function, and x a point in E. The function p ~ V1Jfx is then decreasing, right continuous, and continuous on every open interval where it is finite. (b) Let f be an r-supermedian function. The function p ~ pVr+1JfIX is then increasing and continuous for every x E E. T46
THEOREM
Proof The fact that the function p ~ V1JfIX is decreasing follows immediately from the resolvent equation, and has already been used. Let Po, p, e be three numbers such that o < Po < p, 0 < e < p - Po, and V1Jo f lX < 00. We then have V1JfIX = V1J+e f IX + eV1J V1J+ef IX and V1J _ef IX = V1JfIX + eV1J V1J_ef IX, where the quantities eV1JV'lJ-efIX, eV1JV1J+efIX are dominated by(p - po)V1J V 1Jo f'x = V 1Jo f lX V1JfIX < 00. It then follows that the function p ~ V1Jpx is continuous on the interval ]Po,oo[. Consider next an arbitrary number p > 0; since the kernel V1J is proper, f is the limit of an increasing sequence of positive functions fn such that the functions V1Jfn are finite. The function q ~ Vqr is thus equal, on ]p,oo[, to the upper envelope of the continuous functions q ~ Vqfn IX. It is therefore decreasing and lower semicontinuous (l.s.c.), and consequently right continuous. In summary, the function p ~ V1JfIX is decreasing, and has at most one point of discontinuity Po, to the right of which it is finite and continuous, and to the left of which it equals + 00. Suppose next that fis r-supermedian, and let p and q be two numbers such that 0 < P < q. We then have and consequently, applying the kernel Vr+q ,
< (q - p)Vr+qf, p)Vr+qVr+1J f < pVr+qf + (q -
p(q - p)Vr+qVr+1J! and
pVr+qf + p(q -
p)Vr+qf= qVr+qf
Semigroups and Resolvents
191
IX, T47-T49
Since the left side is equal to PVr+pf from the resolvent equation, we see that the function p JW+ PVr+ p! is increasing. Now this function can have, from (a), only a single point of discontinuity Po, to the left of which it equals + 00, and to the right of which it is finite; this cannot happen for an increasing function, and it follows that the function p .A./II'+ Vr + p is continuous. T47 THEOREM Let f be a positive measurable function. (a) The function f is r-supermedian if and only if f is s-supermedian for every s > r. (b) Suppose that f is r-supermedian and that there exists a number s > 0 such that f is s-excessive. The function f is then r-excessive.
Proof The relation p Vr+p! < f implies p Vs+p! < f for every s > r according to T46(a). Conversely, the relation p V s+p! < f for every s > r implies p Vr+pf < f from the right continuity of the function s JW+ Vs+pf. Let rand s be two positive numbers. The equalities lim p ~+pf = I p-+ 00 can be written, respectively,
and
lim PYs+pf = I p-+ 00
lim (p - s)Vpf = f, p-+ 00 and consequently are equivalent, since the ratio (p - r)/(p - s) tends to 1 as p --+ and
lim (p - r) Vpf = I
p -+ 00
00.
T48 THEOREM (a) Let f and g be two r-supermedian (respectively, r-excessive) functions, and let lX, fJ be two positive constants. The function lXf + fJg is then r-supermedian (respectively, r-excessive). The function fAg is r-supermedian. (b) Let (fn)neN be a sequence of r-supermedian functions. The function f = lill1 inffn is n-+oo then r-supermedian. (c) Let (fn)ne N be an increasing sequence of r-excessive functions. The function f = limnfn is then r-excessive.
Proof We have
P Vr+if A g)
< (p Vr+pf) A (p Vr+pg) < fAg.
This function is hence r-supermedian. In order to establish (b), we use Fatou's lemma, which implies that p
~+p (limninfIn) < limninf p ~H!n < limn infIn·
Under the hypothesis of (c), we have, from T36(b), lim p ~+pf = sup p ~+pf = sup sup p ~+pfn p-+ 00 p P n
= so that f is r-excessive.
sup sup p ~+pfn n
=
P
sup In n
= f,
T49 THEOREM Let q and r be two positive numbers, and f an r-supermedian function. The function Vqf is also r-supermedian.
Proof For every p
> 0, we have p Vr+pVq! = Vq(p Vr+p!)
< Vqf·
IX, T5Q-T53
Kernels and Resolvents
192
Resolvent identities
T50
LEMMA
Let f be a positive measurable function. The function Vrf is r-supermedian.
(We shall see later that this function, under very general conditions, is actually r-excessive.) Proof This is an immediate consequence of the resolvent equation, pVr+pVr!
+
Vr+p!= Vrf
(50.1)
T51 LEMMA Let f be a positive measurable function with a finite r-potential Vrf The functions of the form (51.1) are then finite. Proof Let e be a strictly positive number such that r function (51.1) is dominated by (-v,.+e)k-l-v,.!=
+ e < Pb ... , r + e < Pk.
~l (e-v,.+el-l-v,.f <
e
The
Ll -v,.f,
e
from T49 and T50. T52 THEOREM Let f be a positive measurable function such that all the functions Vrf (r > 0) are finite. The function r JVV'+ V r! is then infinitely differentiable on the interval ]0,00[, and we have the relations (52.1)
and
n
d r -v,.! = n! ( -1 )n+l(-v,.t(I - r -v,.)f. dr n
(52.2)
Proof From the resolvent equation and T46(a) we have
lim Vqf - -v,.! = - lim Vq-v,.f = q-+r q - r q-+r
-(-v,.)~.
The two formulas are then established by induction: We leave the details of the proof to the reader. The following lemmas will permit us to establish another important identity. T53 LEMMA Let r be a number >0, and h a finite, positive, measurable function such that all of the functions Vp+,n are finite (p > 0). Suppose that there exists a p > 0 such that p V p+,.h = h;
(53.1)
the function h is then r-invariant. Proof The equality
Vq+,.h = V p+,n
+ (p -
q) Vq+rVp+,.h
holds for every q < P (from the resolvent equation), and also for every q the functions that enter are finite. Replacing V p+,.h by hip we obtain 1 Vq+rh = - h
p
+
and consequently qVq+,.h = h for every q > O.
p-q Vq+rh, p
> p, since all of
193
Semigroups and Resolvents
IX, T54, T55, 56
T54 LEMMA Let f be a positive measurable function with finite r-potential (r function is then the only r-invariant function dominated by VrI
> 0). The zero
Proof We have Vrf = lim£--+o Vr+£f (T46). Since the function Vrf is finite, we can write
lim eVr+£Vrf = lim (~f £-+0
~+£f) =
o.
£-+0
Let then h be an r-invariant function dominated by Vrf; we have h = e~+£h = lim e~+£h £-+0
< lim e~+£~f= o. £-+0
The following theorem is stated only for the kernel V and for a closed resolvent. It extends to the kernels Vr [consider the resolvents (V;) of No. 44] and then, as r ~ 0, to the kernel V even when the resolvent (Vp ) is not closed. T55 THEOREM * Suppose that the resolvent (Vp) is dosed, and let p be a number have the identity:
>0.
We
00
pV = !(pVp)n.
(55.1)
n=l
Proof Both sides being kernels, and the left-hand side being a proper kernel, it suffices to verify the equality 00
pVf= !(pVp)nf n=l
for every positive measurable function f with finite potential VI It follows immediately from the resolvent equation that, for all n
> 0, pVf= pVp[
+ (PVp)'1 + ... + (pVp)n-y + (pVp)npVI
We thus need only show that lim (p Vp)nVf =
o.
n-+oo
But these functions decrease when n increases, and are dominated by Vf(see T49 and T50). The limit h = lim(pVp)nVf n-+oo
thus exists and clearly satisfies the relation p Vph = h (Lebesgue's theorem). It is hence invariant from T53, and zero from T54.
A supplementary hypothesis 56 We suppose from now on that the resolvent (Vp ) satisfies the following hypothesis, which will be studied in more detail in No. 68.
There exists a number s gn such that
> 0, and an increasing sequence offinite s-supermedian functions limg n = n-+oo
• This identity has been used by Deny (see No. 68).
+00.
(56.1)
IX, T57, 058, 059
Kernels and Resolvents
194
We continue to suppose also that the resolvent is proper. Hypothesis (56.1) is satisfied in two very important cases: when the resolvent is sub-Markov (take s = 0 and gn = n); when the kernels V2) are strictly positive (i.e., when all of the measures Ea;V2) are different from 0). Indeed, choose any s > 0; since the kernel Vs is proper, the function 1 is equal to the limit of an increasing sequence (h n ) of positive functions with finite s-potentials. Put gn = nV ~n; the functions gn are s-supermedian (T50) and since the function V sl is everywhere strictly positive we have Hm gn = lim n~l = + 00. n
n
Here is an important consequence of this hypothesis.
T57 THEOREM r-excessive.
Let f be a positive measurable function; the function Vrf (r
~
0) is then
Proof We have the relation pV2J+r Vrf + V2J+rf = Vrf. Suppose first that/is bounded by one of the functions gn of No. 56. We then have, for large enoughp,
V2J+rf < V2J+rgn
=
1
p+r-s
(p
+
r - s)V2J+rgn
1 < p+r-s gm
and thus limj)-+oo V2J+rf = 0, so that the function Vrfis r-excessive. To treat the case wheref is arbitrary it then suffices to note that the functions Vr(f A gn) are r-excessive and to apply T48.
Regularization of supermedian functions
Letfbe an r-supermedian function (r we can put
> 0). Since the function p.A.J"V'+ pV2J+rfis increasing,
J=
lim p V2J+rf· 2)-+ 00
This function is r-supermedian (T49 and T48) and dominated by f. Also for every s > 0, so that depends only on f, and not on r.
J
D58
DEFINITION
D59
DEFINITION
ViIA.)
=
J=
limj)-+oo p V2J+sf
The function 1 is called the regularization of the r-supermedian function f.
Let A be a measurable set; A is said to be a set of potential zero 0 for every p > o.
if
It suffices that V2)(IA.) = 0 for a single value of p. Indeed this implies that Vq(IA.) = 0 for every q > P from T46(a), and for q < P we have
Vq(IA.) = ViIA.)
+ (q -
p)VqViIA.) = O.
We employ in what follows the expression "almost everywhere," when it will not lead to ambiguity, as synonymous with the expression "except for the points of a set of potential zero." Let f and g be two positive measurable functions. equal almost everywhere. The potentials V2)fand V2)g are then equal for every p > O. In particular, iffandg are r-excessive we have f = limj)-+oo p V2J+rf = limj)-+oo p Vr+2)g = g.
IX, T60, 061, T62
Semigroups and Resolvents
195
T60 THEOREM Let f be an r-supermedian function; the function I is then r-excessive, equal to f almost everywhere, and is the largest r-excessive function dominated by f.
Proof The function pVr+pfis r-supermedian (T49) and (r + p)-excessive (T57), and hence r-excessive (T47). It follows from T48(c) that the functionlis r-excessive. Let g be an r-supermedian function dominated almost everywhere by f. We have p Vr+pg < P Vr+pf for every p, and consequently also g < Thus I is, in particular, the largest rexcessive function dominated by f. It remains to show that we actually havel = falmost everywhere. To see this, consider a number t greater than r and the number s of No. 56. The functions gn of No. 56 and the function f are also t-supermedian, and I = limp -+ oo p Vp+tf. The functions fn = fA gn are t-supermedian and finite, so thatf = sUPnfn and
J.
I
= sup pVp+tf = p
sup sup pVpHfn p
n
=
sup sup pVp+tfn = sup In. n
n
p
We thus need only know that In = fn almost everywhere or, since the functions fn are finite and dominate lm that Vtln = Vtfn. But we have, from the relation pVt+pfn
p-- 00
Excessive functions and potentials D61 DEFINITION Let f be an r-excessive function (r > 0). Then f is said to be purely r-excessive, or to be an r-potential, if there exists no r-invariant function dominated by f and distinct from o.
The expression ''! is an r-potential" by no means implies that there exists a positive function g such that f = Vrg; it is borrowed from classical potential theory in the unit disk, where the superharmonic functions that satisfy this condition are effectively Green potentials (of positive measures). We use rather the expression',! is purely r-excessive" in this chapter. T62
THEOREM
(Riesz decomposition)
Let f be an r-excessive function such that the function h = lim p Vp+rf p--O
is finite. The function h is then r-invariant and the function f - h is purely r-excessive. This decomposition off into an r-invariant function and a purely r-excessive function is unique. Proof Since the function p ~ p VP+r f is increasing, and p ~ VP+rf is decreasing, the function h is finite if and only if all of the functions Vp+rf are finite. Suppose then that h is finite. It follows from Lebesgue's theorem that qYa+r h
= limpqVq+rVp+rl= lim p--o
pq (Vp+rf- Ya+rf) p--o q - P
= h.
Thus h is r-invariant. If h' is an r-invariant function dominated by f we have h' = pVp+,h'
< pVp+rf
for every p
> 0,
and consequently h' < h. Thus h is the largest r-invariant minorant off. It then follows that the function f - h is purely r-excessive (If it admitted a nonzero r-invariant minorant k,
196
Kernels and Resolvents
IX, T63, T64
h + k would be a minorant off, and larger than h.) We leave to the reader the uniqueness of the decomposition, which is easily proved.
Let f be an r-excessive function, such that the functions Vp+rf are finite for every p > O. The function f is purely r-excessive if and only if T63
COROLLARY
lim p VfJ+rf = O.
fJ-+O
This applies in particular to a finite functionf of the form Vrg (T54). The following theorem is particularly useful.
-+
(a) Every r-excessive function (r > 0) is the limit of an increasing sequence offinite r-potentials ofpositive functions. (b) If the resolvent (VfJ) is closed, then property (a) holds for r = 0 also. If the resolvent is sub-Markov, the r-potentials considered can moreover be supposed bounded. T64
THEOREM
Proof Property (a) can be deduced immediately from property (b) by replacing the resolvent (VfJ) by the resolvent (V;) of No. 44. We shall thus establish only (b), supposing that the resolvent is closed. The proof will be divided into several parts. (1) Let f be a purely excessive finite function. Define
(p
> 0);
this function is positive. Since all of the functions VfJf are finite for p > 0 (from the inequality pVfJf
VqDfJf = p(Vqf - pVqVfJf) = p[(Vqf - (p - q)VqVfJf) - qVqVfJf] = p(VfJf - qVqVfJf)
< pVfJf
It then follows that the function VDfJfis finite and equal to pVfJf - Vilimq--+o qVqf). This
limit is zero sincefis purely excessive. We thus have VDfJf= pVfJf (2) Consider the function VI. If this function is zero at a point x, then all of the measures pVfJ(x,dy) are zero. Every excessive functionfis thus zero at x. In other words, if we put j = lim nVl,
n-+oo
we have f = f A j. Since the kernel V is proper the function I is the limit of an increasing sequence of functions in with finite potentials. Define
jn = nVin· The functions jn are purely excessive (T54), and their upper envelope is equal to j. (3) Letfbe an arbitrary excessive function. Putg n = f Ajn (f Ajn A n in the sub-Markov case), and denote by hn the function gm the regularization of the supermedian function gn' Since hn is dominated by jn it is finite and purely excessive. The sequence (h n) is increasing and converges to f, since its limit is an excessive function which is equal almost everywhere to f The function (p,n) -"t/It+ P VfJh n is increasing in p and n. We thus have
f
=
sup sup p VfJh n = lim nVnh n. n
fJ
n
Semigroups and Resolvents
197
IX, T65, T66, D67
Put n Vnh n = fn. The sequence (fn) increases to f and the functions fn are the potentials of positive functions (bounded in the sub-Markov case), from (1). We can now examine the connection between the excessive functions with respect to a measurable semigroup and the excessive functions with respect to the resolvent of this semigroup. We restrict ourselves to the sub-Markov case. T65 THEOREM Let (Nt ) be a measurable semigroup of sub-Markov kernels, and let (V p) be the resolvent associated with this semigroup (see No. 39). (1) Every r-supermedian function (r > 0) with respect to the semigroup (Nt ) is r-supermedian with respect to the resolvent (V p). (2) The r-excessive functions with respect to the semigroup are identical to those with respect to the resolvent.
Proof We begin by studying the case where r = 0, supposing the resolvent (Vp) to be closed. Letfbe a supermedian function with respect to the semigroup (Nt ). For every x E E we have
pVpf~ =
ioope-PtNtf~dt
0
so that f is supermedian with respect to the resolvent. Suppose next that f is supermedian with respect to (Nt). Since the function t .J\./\i'+ Ntf~ is decreasing, we have limpVpf~ = lim Ntf~. p-+ 00 t-+O In particular, iffis excessive with respect to the semigroup (Nt ) we have f(x) for every x, and f is excessive with respect to the resolvent. Let g be a positive measurable function. We have
(65.1) lim:p--oopVpf~ -
NtVg~ = 100 N8g~ ds, so that the function Vg is excessive with respect to the semigroup. Finally, let f be an excessive function with respect to the resolvent. Since the latter is closed there exist positive functions gn with potentials that increase to f (T46). The functions Vg n are then excessive with respect to (Nt ), and f has the same property from No. 38(f). Now stop supposing that the resolvent (Vp ) is closed, and apply the result obtained above to the closed resolvent (V;) (r > 0) and to the semigroup (e-rtNt). We obtain the statement of the theorem for the case where r is strictly positive. The case where r is zero then follows immediately, since a function is supermedian (excessive) if and only if it is r-supermedian (r-excessive) for every r > O. The importance of (65.1) makes it worth being stated explicitly. T66 COROLLARY Keep the notation of No. 65. Let f be an r-supermedian function with respect to the semigroup (Nt ). We have
J= t-+O lim Ntf· The domination theorem D67 DEFINITION A point x E E is said to be nonpermanent [for the resolvent (Vp)] if the measure e~Vp is zero for every p ~ O. Otherwise, the point x is said to be permanent.
Kernels and Resolvents
IX, T68
198
Suppose that there exists a p > 0 such that cxVp = O. We then have cxVa = 0 for every q > P and, for q < p, cxVa = cxV2J + (q - p)cxV2JVa = 0, so that the point x is nonpermanent. We denote by E2J the set of permanent points of E. We have E2J = {x: Vl x > O}, so that E2J is measurable. The following theorem makes clear the significance of the hypothesis of No. 56. Here is an example of a resolvent that does not satisfy the hyothesis of T68. Take a space E composed of two points a and b and put, for every p > 0,
caV2J = 0
and
CbV2J = Ca'
The compositions V2J Va are identically zero, so that the resolvent equation is satisfied. The point a is nonpermanent. and the set {a} is not of potential zero. The difficult part of Theorem 68 [the implication (c) => (d)] is adapted from Deny (52). ~
T6t" THEOREM Let (V2J) be a proper resolvent. The following statements are equivalent: (a) Property (56.1) is satisfiedfor an s > 0; (b) The set of nonpermanent points is ofpotential zero; (c) Property (56.1) is satisfied for every s > 0, and for s = 0 if the resolvent is closed; (d) Let f be a positive measurable function, and u a supermedian function such that
u(x)
> Vfx
f(x)
> o.
f(x)
> O.
for every x such that
Then we have u > Vf; (e) Let f and g be two positive measurable functions such that VgX We then have Vg
> Vfx
at every point x such that
> Vf ("domination principle").
Proof We begin by establishing the equivalence of (a), (b), and (c). The set Eo of nonpermanent points is the set of points where the supermedian function + 00 differs from its regularization, so that (a) => (b) from T60 [which is a consequence of (a)]. With the supposition that (b) is satisfied, equip the set E2J of permanent points with the a-field induced by tt. Since the set Eo is negligible for every measure cX V2J , we can define a resolvent (W2J) on E2J by putting Wix,f) = V 2J(x,f') for every p > 0, every x E E 2J , and every positive measurable function f defined on E 2J , f' denoting any measurable extension off to E. The kernels W 2J are then strictly positive on E 2J , and this implies the existence for every s > 0 of an increasing sequence of functions h n defined on E2J , which are finite, s-supermedian with respect to the resolvent (W2J ), and which tend to + 00 on E2J (this point was established in No. 56). It then suffices to put
gn(x)
=(
hn(X) for x
E
E2J
n
E
Eo
for x
to obtain s-supermedian functions with respect to (V2J ), which satisfy (56.1). Finally, the implication (c) => (a) is clear. The reasoning of No. 56 shows that the functions gn exist also for s = 0 if the resolvent is closed.
IX, T69, T70
Semigroups and Resolvents
199
The rest of the theorem will be established using the scheme (c) => (d) => (e) => (b). It will suffice to prove the implication (c) => (d) in the case where the resolvent is closed. Indeed, this implication will then be established for each of the resolvents (V;) (r > 0). Since the function u is r-supermedian for every r > 0, the relation
>
u(x)
VflX
> Vrr
for every x such that
f(x)
>0
will then imply u
>
Vrf
for every
> 0,
r
and consequently also u > Vof, which is the desired result. Suppose then that the resolvent is closed, and denote by (gn) an increasing sequence of finite supermedian functions that tend to + 00. Put fn = f A gn' We have u(x)
>
Vfn
for every x'such that
IX
fn(x)
> 0,
and it suffices to show that u > Vfn for every n. We shall use, to this end, the elementary domination principle of the preceding section (T24). Let p be a number >0, and let N be the kernelpVp • The potential kernel associated with N(in the sense of No. 17) is equal to / + PV from the identity in No. 55. Every supermedian fuqction with respect to the resolvent (Vp ) is excessive with respect to N (in the sense of No. 14). The relation gn(x)
+ pu(x) ~fn(x) + pVfn(x)
at every point x such that
fix)
>0
hence implies, from T24, the relation gn
+ pu > fn + pVfn'
Since p is arbitrary and the functions gn and fn are finite, this implies u > Vfm and assertion (d) is established. The implication (d) => (e) is clear. Finally, we establish the implication (e) => (b). The function VC/Eo) is zero at every point of Eo. We thus have
o=
Vox
> V(/Eo)1X
at every point x such that
IEo(x)
> O.
Thus V(IE o) = 0 everywhere from (e), and this implies that Eo is a set of potential zero. The following statement is very useful. T69 COROLLARY Properties (d) and (e) of the preceding statement are satisfied by every sub-Markov resolvent and by every proper resolvent (Vp) with strictly positive kernels.
Let (Vp ) be a closed resolvent that satisfies the equivalent conditions of No. 68. It is sometimes of interest to know how to determine if a function g is supermedian with respect to the resolvent (V p), without having to form the functions pVpg. T70 THEOREM A positive measurable function g is supermedian if and only if the following property holds: For every measurable function h (not necessarily positive) with a well-defined and finite potential Vh, the relation (70.1) g(x) > VhlX for every x such that hex) > 0 implies g(x)
> Vh lX
for every x
E
(70.2)
E.
Proof Suppose that g is supermedian. The relation (70.1) can also be written g(x)
+
V(h-y
>
V(h+)1X
for every x such that
h+(x)
> O.
IX, T71
Kernels and Resolvents
200
We then have, from T68(d), which is equivalent to (70.2). Conversely suppose that g satisfies the property in the statement. Denote by f a positive measurable function dominated by g and with finite potential VI The function h = p(f - p V"f) then admits a well-defined and finite potential, equal to p V"f Now we have on the set
{g - pV"f> O}
and a fortiori on the set {f - p V"f > O} = {h > O}. We thus have g > Vh = P V"f everywhere. Since the kernel V is proper, g is equal to the limit of an increasing sequence of functions fn of the preceding type. We thus have g > PV"g, which shows that g is supermedian. The pseudo-reduite of a function
The notion we define now does not coincide with the classical notion of the reduite of f (which would be the lower e:Q.velope of the excessive functions that dominate f on A). This is why we call it the pseudo-reduite of I It is in fact not certain that the following theorem has any usefulness, and we shall only outline its proof.
T7t
Let A be a measurable set and fa supermedian function with respect to the resolvent (V,,). The collection of supermedian functions that dominate f on A has a smallest element, which we shall call the pseudo-reduite off on A. THEOREM
Proof For every p > 0, put N" = pV", and denote by g" the reduite off on A relative to the kernel N" (No. 22). The reader can easily verify the following facts: (a) Every excessive function with respect to the kernel N" is excessive with respect to every kernel N q for q < p. (b) When p increases, there are thus fewer and fewer excessive functions with respect to N", so that the reduite g" increases. Put g = lim~oo g". (c) The function g is supermedian with respect to the resolvent (V,,). It is equal to f on A, and every supermedian function that dominates f on A dominates g everywhere. It then follows that g is the desired pseudo-reduite ofI Remark Suppose that A is a set of potential zero; the pseudo-reduite of f on A is then clearly equal to fIA • There would be a very different result with the classical reduite.
Connections between Sections 2 and 3
Let N be a kernel, and let
be the potential kernel associated with N. We are going to show that a resolvent (V,,) can be constructed so that G = Vo, and that this resolvent is, in turn, associated with a semigroup of kernels. Put, for every number a in ]0,1],
201
Semigroups and Resolvents
Let b be a number such that 0
< b < a. Then
GaGo = GoGa = I
and consequently
+ (a + b)N + (a 2 + ab + b2)N2 + ... ,
(a - b)GoGa + bGo
Put then, for every p
~
IX, T71
= (a
- b)GaGo + bGo
=
aGa.
0, V2J
=
1
p+1
G1 /(21+1)·
The kernels V2J constitute a resolvent such that Vo = G. Put, on the other hand, for every t ~ 0, 2 2 tN t N Pt = e-t(l + +~ + ...).
1!
It is easily verified that the kernels (Pt) constitute a measurable semigroup with resolvent
(V2J)-this last point follows from the possibility of integrating the exponential series term by term. Suppose now, for simplicity, that the kernel G is proper (it could be supposed only that the kernels Ga are proper for every a > 0). Since the kernel G is strictly positive, the resolvent (V2J) satisfies the hypothesis of No. 56. Let A be a set of potential zero. The relation GIA ~ lA shows that A is empty, and it follows (T60) that the supermedian functions with respect to the resolvent (V2J) are excessive. Let/be an excessive function with respect to N. We have p VJ =
(1 + p +1 1 p+ 1 p
Nf +
(p
1
+
N 2f
1)2
+ ...)
~ P ~ / ( 1 + p: 1 + (p ~ 1)' + .. -) = J, so that f is supermedian (and hence excessive) with respect to (V2J). Conversely, suppose that / is excessive with respect to (V2J). Then / is the limit of an increasing sequence of potentials (T63(b», and hence it is excessive with respect to N. The resolvent equation is such a useful analytic tool that one could occasionally think of using it in the elementary situation of Section 2. Theorem 27, for example, reduces immediately to Theorem 70-whose proof by resolvents is very natural. We shall see later on other examples of the use of the resolvents (V21) associated with a kernel G.
X
CHAPTER
Construction of Resolvents and Semigroups
We now study, following Hunt, this problem: Given a kernel V, which satisfies the complete maximum principle, does there exist a sub-Markov resolvent (Vp ) such that Vo = V? Is this resolvent associated with a semigroup? The answer to this question is only partially known, but what is known shows that all "nice" kernels of potential theory fit into Hunt's probabilistic theory. Only Nos. 14 and 16 are indispensable for understanding the following chapters. We consider only proper kernels in this chapter.
The Domination Principle
1.
Dl
Let V be a proper kernel on a measurable space (E,tC); V is said to satisfy the domination principle if for every pair (f,g) ofpositive measurable functions, the relation DEFINITION
VfX
> VgX
for every x
E
E such that
g(x)
>0
implies XEE.
IX.T25 and IX.T69 furnish examples of kernels that satisfy the domination principle. D2 DEFINITION A kernel V is said to satisfy the complete maximum principle if for every constant a > 0 and for every pair (f,g) ofpositive measurable functions the relation
a
+ Vfx > VgX
implies a
+
Vfx
for every x such that
> VgX
for every
g(x)
>0
XEE.
This principle clearly implies the domination principle. We have seen examples of kernels that satisfy the complete maximum principle in IX.26 and in IX.69 (the subMarkov case). 202
X, 3, T4
The Domination Principle
203
3 Here is another, very useful, form of the complete maximum principle. Let f be a measurable function (not necessarily positive) such that Vf makes sense, and suppose that the function Vftakes value> 0 at certain points. Let P = {x:f(x) > O}. If V satisfies the complete maximum principle, we have sup Vfe = sup Vfl:. reEE reEP
(3.1)
(This property is sometimes called the "weak principle of the positive maximum.") To establish (3.1), denote the right side by a; we have on the set
{x:f+(x)
> O} =
P,
and consequently a+ + V(f-) > V(f+) everywhere, so that a+ > Vf. Since the function Vfattains strictly positive values we have finally a+ > 0, and hence a+ = a. This establishes (3.1). Conversely, it is easy to see that property (3.1) implies (for a proper kernel) the complete maximum principle. We shall be particularly interested in the case where E is a locally compact, a-compact space given the a-field of universally measurable sets, and where V is a continuous diffusion-kernel on E. The following theorem then allows us to simplify the verification of the domination principle. The strict positivity hypothesis made on V will be commented on in No. 5.
T4
THEOREM
Suppose that V is a continuous and strictly positive diffuSion-kernel, and that
the relation Vfre
> vgre
for every x such that
g(x)
>0
(4.1)
implies, when f and g belong to ~}(E), Vr
> vgre
for every
xEE.
(4.2)
The kernel V then satisfies the domination principle. Proof Let f and g be two positive universally measurable functions such that
for every x such that
g(x)
> o.
We shall show that Vf > Vg. Let AI be the set of l.s.c. (lower semicontinuous) functions that dominate f, and let B g be the set of bounded, positive, u.s.c. (upper semicontinuous) functions dominated by g. We have the following relations, which are immediate consequences of classical results from the theory of Radon measures:
Vj= inf Vj~ f'EA,
Vg = sup Vg'. g'EB g
We have, on the other hand, V(X,g')
= f
J{g'>O}
V(x,dy)g'(y)
=
sup
f
Kcompact JK Kc{g'>O}
V(x,dy)g'(y).
X,5
204
Construction of Resolvents and Semigroups
It thus suffices to show that
Vf' > V(g'IK)
for every functionf' E AI' every functiong' E B g , and every compact K contained in {g' Let then ep be a positive continuous function with compact support, such that Vepa;
>0
for every
x
E
> O}.
K.
The existence of such a function is an immediate consequence of the Borel-Lebesgue theorem, since the kernel V is strictly positive. For every e > 0 we have V(f'
+
eepY
> V(g'IKY
for every
x
E
K.
Denote by C the set of functions h' E ce~(E) dominated by f' + eep. The family of functions Vh'(h' E C) is filtering to the right, and admits f' + eep as its upper envelope; these functions are, on the other hand, continuous, whereas the function V(g'IK ) is U.S.c. (cf. IX.IO). Theorem 6 then implies the existence of a function h' E C such that Vh'a;
> V(g'IK)a;
for every
x
E
K.
Since the function Vh' is continuous, and the function V(g'IK) is u.s.c., there exists a compact neighborhood L of K such that Vh'a;
> V(g'IK)a;
for every
x
E
L.
Denote by D the set of functions j' E ce}(E) with support in L, which dominate g'I K' Another application of Theorem 6, analogous to that above, shows that there exists a function j' E D such that Vh'a; > Vj'a; for every x E L. This inequality then holds for every x such that j'(x) x, from (4.2). We thus have, V(f'
+
eep)
> 0, and consequently also for every
> Vh' > Vj' > V(g'IK).
which concludes the proof, since e was arbitrary. S Remarks (a) Let E1) be the set of permanent points for the kernel V (see IX.D67). We have E1) = {x: VIa; > O}. Since the function VI is l.s.c., E1) is open. Instead of supposing that V is strictly positive, suppose that the set (E1) is of potential zero. For every positive Borel function f defined on E1) and every x E E1) set W(x,j) = V(x,f'), where f' denotes any Borel extension off to E. The kernel W defined in this way on E1) is continuous and strictly positive, and thus satisfies the domination principle if V has the property of the statement. It then follows that the kernel V itself satisfies the domination principle. (b) Suppose that the continuous kernel V satisfies the complete maximum principle for the elements of ce}(E); it can then be shown, exactly as above, that V satisfies the complete maximum principle. It is not necessary to suppose that V is strictly positive: the function Vf + eVep of the foregoing proof can be replaced by Vf + e. Here now is the topological lemma we have used in the course of this proof, and which we shall have occasion to use again. It is a very easy generalization of the classical Dini's lemma.
205
Construction of Resolvents
X, T6, T7, T8, 9
T6 THEOREM Let K be a compact space, fan l.s.c. function on K, and g a u.s.c. function on K such that f(x) > g(x) for every x E K. Let :Yt' be a set of continuous functions, filtering to the right, with upper envelope f; there then exists a function h E :Yt' such that on K.
f>h>g
Proof For each x E K, choose a function ha: E :Yt' such that ha:Cx) > g(x). Since g is a u.s.c. function, we have hiy) > g(y) for all points y in a neighborhood Va: of x. There then exist a finite number of neighborhoods Va:'1 ... , Va: n , which cover K, and it suffices to take for h an element of :Yt' that dominates the functions ha: 1 , ... , h~""n .
Some consequences of the domination principle The lemmas we give now are borrowed from Deny (53). T7 THEOREM Let V be a kernel that satisfies the domination principle, and let f and g be two finite, positive, measurable functions such that the functions Vf and Vg are finite. Then, if p denotes a constant > 0, the equality f+pVf=g+pVg implies the equality f = g. Proof Putf' = f - (f A g) and g' = g - (f A g); we havef' A g' = 0 andf' + pVf' = g' + p Vg'. Then Vg'a: > Vf'a: at every point x such that g'(x) = 0 and consequently at every point x such that f'(x) > O. The domination principle then implies the inequality Vg' > Vf', but we can show in the same way Vf' > Vg', so that Vf' = Vg', hencef' = g' and finally f = g. ~
T8 THEOREM Let V be a proper kernel which satisfies the domination principle; there then exists at most one resolvent (Vp) such that Vo = v.
Proof Let (Vp) and (Wp) be two resolvents such that Vo = Wo = V, and let f be a positive, finite, measurable function such that the function Vf is finite. From the resolvent equation we have (I + pV)Vpf= (I + pV)Wpf= Vf for every p > O. It then follows from the preceding theorem that Vpf = Wpf. Since the kernel V is proper, every positive measurable function g is the limit of an increasing sequence of functions of the same type as! We thus have also Vpg = Wpg, and the theorem is established. The following result will be pointed out without proof: if the kernel V satisfies the domination principle, so do all of the kernels I + PV (p > 0). This result is clear when there exists a resolvent (Vp ) such that Vo = V, from IX.T55 and IX.T25.
2. Construction
if Resolvents
9 Let V be a kernel that satisfies the complete maximum principle. We give in this section sufficient conditions for the existence of a resolvent (Vp) such that Vo = V. Such a resolvent,
X, TIO
206
Construction of Resolvents and Semigroups
if it exists, is necessarily sub-Markov since the constant I is supermedian from the complete maximum principle and IX.T70. Here first is an example which shows that such a resolvent does not always exist. Let (Up) be a closed Markov resolvent on a measurable space (E,C); the kernel U = Uo satisfies the complete maximum principle from IX.T69. Let F be the set obtained by adjoining a point (X to E and let ff be the (J- field generated by C and {(X}. For every positive ff-measurable functionf defined on F set Vfx
=
Uf'x + f«(X)
for every
x
and
E,
E
vpt = !«(X),
where f' denotes the restriction off to E. It can easily be verified that V is a kernel and satisfies the complete maximum principle. We show that there cannot exist a sub-Markov resolvent (V p) such that Vo = V. If there were one, in fact, we would have (1
+ pV)Vpf=
Vf
for every positive measurable function f Suppose that f is zero at (X and denote by g the function equal to U pt' on E, and to zero at (X; then (1
+ pV)g =
VI,
and consequently Vpf = g from T7. In particular, we would have pVilE) = lE·
This equality and the inequality p VpI
ViI{a}Y = 0
would imply for every
x
E
E,
which would in turn imply the same property when p ~ 0, contradicting the definition of V. We begin by establishing, following Hunt (78), the existence of the resolvent (Vp ) associated with a uniform kernel.
TIO THEOREM Let V be a kernel on the measurable space (E,C), such that the function VI is bounded, and which satisfies the complete maximum principle. There then exists a subMarkov resolvent (Vp) such that Vo = V. * Proof Let r'§ be the space of bounded measurable functions, a Banach space under the uniform convergence norm (written 11 • 11). The restriction to r§ of the mapping f Jo.N+ Vf is a bounded operator, which we again denote by V. Let P be a number> O. There exists at most one bounded operator Vp on r§ such that pVVp = pVpV = V - Vp,
since this relation implies (1 + P V) Vp = V, and the kernel of the operator (I + PV) is zero from T8. With the supposition that such an operator V p exists, we point out several of its properties. (a) V p is positive. Let g E r'§ be a function < O. We show that a contradiction is obtained by supposing that the function Vpg attains a strictly positive value. We have in fact
* This resolvent is unique from T8.
207
X, TI0
Construction of Resolvents
and consequently, if the maximum of the function V1Jg is > 0, sup V1Jgll: =
Il:EE
sup V1Jg ll: > 0 Il:E{(g-1JVPg) > O}
from (3.1). This is absurd, since V1Jgx < 0 at every point x such that g(x) > p V1Jgx. (b) IlpV1J 11 < 1. Since V1J is positive, it suffices to verify the relation p V1Jl < 1. Now we have
1 >pV1Jpl for every x such that 1 - pV1Jl x > O. In other words,
1 > V[p(1 - pV1Jl)]X for every x such that p(1 - pV1Jl)X > 0, and consequently, from the complete maximum principle, 1 > V[P(1 - P V1Jl)] = P V1J1.
(c) The existence of V1J for some p
1/11 VII for p
=
> 0 implies that of V1H-e for 0 < e < p (or 0 < e <
0).
Consider, indeed, the series of operators
V1J(I - eV1J + e 2V;+ ...
+ (_l)ne nV:+
which converges for lel < 1/11 V1J II, and consequently for sum of this series by A, we have
.. '),
IcI < P
from (b). Denoting the
eV1JA = eAV1J = V1J - A. The operator V commutes with V1J , and hence with A. It then follows that
pV(V1J - A) = epVV1JA = e(V - V1J)A, or
pVV1J
+ eV1JA =
(p
+ e)VA,
or finally
V-A=(p+e)VA. We can thus set V1J+ e = A. Now consider the existence of the operators V1J: set Vo = V, and use the series to define V1J for pE [0,1/11 VII[. A second extension gives us V1J for pE [0,2/11 VII], a third extension doubles this interval, etc. The operator V1J can thus be defined for p > O. It is clear, moreover, that V1J is a continuous function of p on [0,00[. Let p and q be two numbers> O. We verify the relation
V1J - Vq = (q - p)V1JVq (which implies in particular, by interchanging p and q, that the operators V1J and Vq commute). It will suffice for us to verify the relation
since the kernel of the operator (I + PV) is zero. The computation is then immediate. Let then (fn)nEN be a decreasing sequence of bounded positive functions, which converges to zero. For every p we have lim VJn n ..... <Xl
<
lim Vfn = 0, n ..... <Xl
X, Tll
Construction of Resolvents and Semigroups
208
from the fact that V is a uniform kernel. The operators V 1l are hence the restrictions to t§ of kernels on (E,C), for which we use the same symbols. The relations: V1l Vqf = VqV1l f, VJ= Vqf + (q - p)V1l~!
(0
< p < q),
Vi = Hm V1l! (= sup V1l!) (continuity at p = 0), 1l.. . O 1l are satisfied for bounded, positive, measurable functions f, and hence also when f is positive measurable. The kernels V1l thus constitute the desired sub-Markov resolvent. Remark Suppose that E is a locally compact space, and that V is a continuous kernel on E which tends to 0 at infinity. All of the arguments we have made on the Banach space t§ can then be made on the Banach space f'Co(E), and it then follows that the kernels V1l are continuous and tend to 0 at infinity. The following theorem is in essence due to Hunt [(78), pp. 351-358]. It has been refined by Lion [see (87) and (88)], from whom we borrow the following proof. It should be recalled that the complete maximum principle is a necessary condition for the existence of a resolvent of the type considered (IX.T69), and that such a resolvent is unique (T8). .....
Ttt THEOREM Let E be a locally compact, a-compact* space, and let V be a continuous kernel on E which tends to 0 at infinity and satisfies the complete maximum principle. There then exists a sub-M arkov resolvent (V1l) on E such that Vo = V, consisting of continuous kernels which tend to 0 at infinity.
Proof We limit ourselves to constructing the restrictions of kernels V 1l to the space f'Co(E), and leave to the reader the task of extending them to the universally measurable functions. Let (Kn)neN be a sequence of compact subsets of E such that E = Un K n. For each n, let fn be a continuous function with compact support, with values in [0,1], and equal to 1 on K n. The function Vfn belongs to f'Co: it is thus bounded, and coefficients An > 0 can be chosen so that the series !n An and !n An 11 Vfn 11 converge. We then set
an = (na) A 1.
The function a belongs to f'Co, is strictly positive everywhere, and the function Va belongs to f'co. The functions an are continuous and increase to 1 as n ---+ 00: the set {an = I} ends up containing every compact subset K of E, provided n is large enough. The functions Van, finally, are bounded. For every n and every positive universally measurable function/we now set
It can easily be verified that
vn is a continuous kernel,
which tends to 0 at infinity and satisfies the complete maximum principle. Since the function Vnl = V(a n) is bounded, there exists a sub-Markov resolvent (V;:) consisting of continuous kernels, which tend to 0 at infinity, and such that Vf: = vn. • It is supposed that E is given the a-field f!ju(E), as in all problems that concern dispersion-kernels.
209
Construction of Resolvents
Let m and n be two integers such that n
X, TU
< m, and let fbe an element of ~}. We show that
Denote these functions by kn,k m, respectively. From the resolvent equation we obtain
kn =
vn(:' -
Pk n)
= VU -
pankn),
and an analogous formula for km. Consequently we have
k n - km = V(pamkm - pankn). A contradiction is then obtained by supposing that the function km - k n takes on a strictly positive value; in fact it then takes on strictly positive values on the set {p(ank n - amkm) > O} which is absurd since an < am. But the function f has compact support; the function V;(f/a n) is thus equal to V;f whenever n is large enough. It follows that the limit
VJ=
lim
V;f
(11.1)
n-+oo
exists (and is u.s.c.) for every function f E ~}. This property extends by uniform convergence to the elements of ~t, from the relation lip 1 < 1, then to ~o by linearity. It is clear that IlpVpfll < Ilf 11· Let g be an element of ~}; the function Vg belongs to ~t, and we thus have
V;
VpVg = lim V;Vg.
(11.2)
n-+oo
But we have then
pV:Vg = Pv;vn(.K.) = v n(-!.) - V;(-!.) = Vg - V:(-!.).
(11.3)
an an an an The left side thus increases with n, and the function p V pVg is thus l.s.c. We saw above that it was u.s.c., and hence it is continuous. Finally, a passage to the limit shows immediately that this function is dominated by Vg. It thus belongs to ~t. Let then f be an element of ~}; choose the function g E~} so that f ~ Vg, and set h = Vg - f The functions Vpf and Vph are u.s.c., and their sum is the function VpVg, which belongs to
(q - p)V:V;f= V;f - V;f (p> then shows that the VIJ form a resolvent.
0, q > 0)
X,12
To establish the second point, note that for every
functionfE~}
Hm pV;Vnj = lim (Vnj - V;f) = 1>-+0
Since every function g also have
210
Construction of Resolvents and Semigroups
E~}
and every n we have
o.
1>-+0
is dominated by a function of the form vnf (fE ~}), we Hm pV;g =
o.
1>-+0
Now we have seen that V1>g < V;g for everyp, whenever n is large enough so that an is equal to 1 on the support of g. We thus have a fortiori Hm p V1>g =
,
o.
jl->-O
\
This extends to functions g E ~t by uniform convergence, in view of the inequality lip V 1> I1 < 1. Let then f be an element of~}; the function g = Vf belongs to ~t and we thus have Hm pV1>Vj = o. 1> ..... 0
But we have p V1> Vf = Vf - V1>! [an obvious passage to the limit starting with (11.3)]. We therefore have also lim V1>f = VI, 1>-+0
which concludes the proof.
3. Construction
cif Semigroups
12 The construction of the previous section allows us to associate a sub-Markov resolvent with every "nice" kernel, which satisfies the complete maximum principle. We are now going to give sufficient conditions for such a resolvent to be associated with a sub-Markov semigroup. The essential tool for the construction of the semigroup is the Hille-Yosida theorem, which we state here without mentioning infinitesimal generators, a subject which the reader can find treated in the following works: Dunford and Schwartz (67), Hille and Phillips (77), Yosida (121), and also Loeve (89), which has the advantage of giving an introduction to the Russian work on infinitesimal generators of Markov semigroups. We begin by recalling several results on semigroups and resolvents in Banach spaces. Let @J be a Banach space, ordered by a closed convex proper* cone @J+ (we take @J+ = {O} if 81 doesn't have a natural order structure). The only topology we shall consider on 81 will be the strong topology defined by the norm (written 1I • 11). An operator A on 81 is said to be sub-Markov if IIA 11 < 1 and if A is positive (Ax E 81+ for every x E 81+). A sub-Markov semigroup on 81 is a family (Tt)t>o of sub-Markov operators on @J, such that TsTt = T s+ t for every s > 0, t > O.
We always complete this definition by putting To = I, but this convention is not necessary. The semigroup is said to be strongly continuous if Hm 1;x = x
for every
X E
81.
t-+O
• We understand a proper cone (in French: cone sail/ant) to be a cone P such that P
(1
(-P) =
o.
Construction of Semigroups
211
X, T13
It can then be easily shown that the function t .A.N+ Ttx is continuous on the interval [0,00[. A sub-Markov resolvent on fJB is a family (V2»2»o of operators on fJB, such that the
operators pV2> are sub-Markov and the resolvent equation holds: for every
p
> 0, q > 0.
(12.1)
Although this expression is not classical, we say that the resolvent (V2» is strongly continuous if for every X E fJB. (12.2) This definition can be put in another, very useful, form: It follows immediately from (12.1> that the image VifJB) does not depend on p; denote it by!:». The resolvent is then strongly continuous if and only if!:» is dense in fJB. Condition (12.2), in fact, implies immediately that !:» is dense. Conversely, the relation x = Vay implies limpV2>x
=
2> .... 00
lim (VaY - V2>Y
+ qV2>Vqy) =
VqY
=
x.
2> .... 00
Relation (12.2) thus holds for every x E!:». Since the operators p V2> are sub-Markov, it holds for every x E !:», and hence for every x if !:» is dense. Let (Tt) be a strongly continuous sub-Markov semigroup on fJB; a sub-Markov resolvent on fJB can then be defined by setting for every
X E
fJB.
Let x' be a continuous linear functional on fJB, orthogonal to !:». It can easily be verified that (x,x' ) = Hm (p V2>x,x') = 0. 2> .... 00
We thus have x' = 0, and!:» is dense in fJB from the Hahn-Banach theorem. The resolvent (V2» is hence strongly continuous. We call it the resolvent of the semigroup (Tt). Here then is the Hille-Yosida theorem. The proof we give is borrowed in large part from Yosida (121) and Neveu. * We shall only indicate the steps, leaving the verification of details to the reader. ...
Tt3 THEOREM Let (V2» be a strongly continuous sub-Markov resolvent on fJB. There then exists a strongly continuous sub-Markov semigroup (Tt) with (V2» as its resolvent, and this semigroup is unique. Proof We begin by supposing that there exists an operator V such that, for every p
> 0, (13.1)
It can then be easily verified that V(fJB) = !:». We also set !:»2 = V2(fJB); we then have !:»2 = V;(fJB) for every p 0, and the relation
>
Hm (p V2»2x
=
X
for every
X E
fJB
2> .... 00
• See "Theory of Markov Semigroups," University of Calif. Publications in Statistics, 2 (1958), 319-394.
X, TB shows that
Construction of Resolvents and Semigroups !?)2
212
is dense in f16. Note also the relation
(a) For every p
(I
> 0 set
+ pV)(I -
(13.2)
pV1J) = I.
A1J = p(pV1J - I), T:1J)
= exp (tA1J) =
e-1Jt exp (tp . pV1J).
It can easily be verified that the operators T~1J) constitute a strongly continuous sub-Markov
semigroup (we even have lim 1 Tt(1J) - III = 0). t--+O
We are going to show that these semigroups converge to the desired semigroup (Tt) whenp-+ 00. (b) With this in mind, note that the formula
~ A 1J
dp
= - (p v:
1J
is a consequence of the formula: V1J It then follows that rp
=-
-
1.. A2 p2
1)2 = -
1J
Y; (which comes from the resolvent equation).
~ 7:(1J) = _ ..!..- T(1J)A 2 t 2 t 1J. dp p If x belongs to !?)2 we have, since x is of the form y 2y,
.E:..- T(1J) t X dP
.!...2 T(p)(p v:p)2y. t
2 2 .!....2 T(lI) t A 1J V y = -
= -
P
P
The norm of the latter is at most equal to t Ily 1 jp 2, and we thus obtain, by integrating,
1I T~1J) x
T~q) xii < t .! _.! . Ilyll.
-
p
q
We can hence set Ttx = liIllp--+oo T~p)x for every x E !?)2. Since the operators T~p) are sub-Markov, this limit also exists for every x E !!)2 = f1l, and defines a sub-Markov operator Tt on f1l. The function t JV'.I+ Ttx is the uniform limit of the functions t JV'.I+ T~p)x on every compact interval of R+, when x belongs to !?)2. It is therefore continuous on !?)2' and hence on !?)2 = f1l by passage to the limit. The relation T~p)T~p) = Ts~~ then passes to the limit, and it follows that (Tt) is a strongly continuous semigroup, for which we still must find the resolvent. (c) We have d - T(p)
dt
t
= -d
dt
exp (tA )
=
T(p) A
1J
t
1J
and consequently, all of the operators T~1J), A 1J , V1J commuting, d .. - T(1J) t
dt
vi = T(p) A t
1J
Vx
=-
T(1J)(pV: )x t 1J
=
-pV: T(1J) x. 1J t
213
X,014
Construction of Semigroups
This derivative thus converges to - Ttx when p ~ 00, the convergence being uniform on every compact interval of [0,00[, and we obtain the formula d
-~Vx= -~x.
dt
Denote the resolvent of (Tt) by (Wp ). Integrating the above formula by parts, it follows that Wpx =
J:
00
e-Pt~x
dt = -
o
f.oo e- pt -d (~Vx) dt dt
0
= Vx - p f.ooe-Pt~vx dt,
or Wp(I + P V) = V. Now V p satisfies an analogous formula, and the operator I + P V is invertible, from (13.2). We thus have Wp = Vp as desired. (d) Let (TD be a second strongly continuous semigroup which has (Vp ) as its resolvent, and let x' be an element of the dual f!J' of f!J. The two continuous functions t AN+ (Ttx,x' > and t AN+ (T;,x' > have the same Laplace transform and are hence equal. The equality of the two semigroups can now be deduced. It remains for us to free ourselves of the auxiliary hypothesis concerning the existence of V. The proof of uniqueness given above clearly is independent of this hypothesis. Take then an arbitrary strongly continuous sub-Markov resolvent (Vp)p>o, and consider for every s > 0 the resolvent (Vs,p)p>o defined by
These resolvents satisfy the auxiliary hypothesis. There thus exists for each of them a strongly continuous sub-Markov semigroup (Ts,t) such that
Now the semigroup (e-(S-r)t Tr,t) also satisfies this relation for every number r thus have, from the uniqueness established above, T s,t = e-(s-r) t T r,t
(0
E
[O,s]. We
< r < s).
It then follows that the semigroup (est Ts,t) does not depend on s, and is strongly continuous and sub-Markov: We denote it by (Tt). The resolvent of (Tt) clearly is (Vp).
The Hille-Yosida theorem will allow us to complete the construction of the semigroup associated with a kernel that satisfies the complete maximum principle. We begin with a definition.
f:
D14 DEFINITION Let E be a locally compact, a-compact space and let (Pt)teR+ be a semigroup of sub-Markov dispersion-kernels on E. We say that (Pt) is a Feller semigroup if: (1) Each kernel Pt is continuous and tends to 0 at infinity. ~ (2) Po = I, andfor every functionfE ~o(E)Jimt-+o Pt! = f niformly on E.
A
Such a semigroup is not necessarily measurdte (in the sens f IX.39) when E is given the a-field f!Ju(E). A resolvent can, however, be associated with it in the following manner.
X, TI5
Construction of Resolvents and Semigroups
214
(a) Let f be an element of CC%; the mapping t ~ Ptf of R+ into CCo is bounded and continuous, which allows us to set
(integrating in CCo).
Vflf= f.ooe-fltptfdt
We thus define a positive linear mapping of CC% into ~o (which extends moreover to an operator on CCo, of norm at most equal to lip). It then follows from IX.T11 that the mapping f ~ Vflfis the restriction to ~% of a dispersion-kernel Vfl on E (continuous, and tending to 0 at infinity). (b) Let £ be the collection of bounded Bore! functions f such that the function t -A.J\t+ (p"Ptf) is Borel for every bounded Radon measure p, on E, and such that the following relations are satisfied.
<,.,V.f) =
fe-"(P,Pd) dt
Vflf = Vqf + (q - P)VflVqf
(p
(p
> 0)
> 0, q > 0).
(14.1)
(14.2)
£ contains the bounded, positive, l.s.c. functions (an obvious passage to the limit starting with the continuous functions). It then follows from I.T20 that £ contains all bounded Borel functions. (c) Let f be a bounded universally measurable function, and let f' and f" be two bounded Borel functions such thatf' < f < f", and (p,v'p,f') = (p,Vfl,f"). The function t -A.J\t+ (p"Ptf) is then bounded between two Borel functions that are equal a.e.; it is thus Lebesgue measurable, and relation (14.1) is still true. It can be verified in the same manner that (14.2) is true. Here then is Hunt's theorem.
-+
T15 THEOREM Let V be a continuous dispersion-kernel on E, which tends to 0 at infinity and satisfies the complete maximum principle. * Suppose that the image of ~%(E) under V is dense in ~o(E). There then exists a Feller semigroup (Pt) on E such that Vf =
f.oo Ptf dt
(15.1)
for every positive universally measurable function f This semigroup is unique. Proof We saw in No. 11 that there exists a sub-Markov resolvent (Vfl) on the Banach space ~o(E) such that Vf = limfl~o V flf for every function f E CC%(E). The relation Vf= ViI
+ pV)f
(fE~%)
shows that the image of ~0 under V fl contains the image of ~% under V. The resolvent (Vfl) is therefore strongly continuous, and from T13 there exists a strongly continuous sub-Markov semigroup (Pt) on ~o such that for every p > 0 and every x E E,
v"r =
fe-.tpJ"dt
Relation (15.1) is then deduced, whenfbelongs to
(fE 'C}). ~},
by lettingp tend to O.
• In other words, from IX.Tll and X.T4, a positive linear mapping V of r;% into '6'0 such that the relation + VpJ > Vg3: on {g > O} (a a positive constant, f e ~}-, g e ~~) implies the same inequality for every x.
a
Construction of Semigroups
215
The mappingf JW+ Ptf is, from IX.Ttt, the restriction to Pt (continuous, tending to 0 at infinity). The relations
X,16 ~o
of a sub-Markov kernel
Vf= f.\f dt and can then be extended to universally measurable functions, as in No. t4. The existence of a Feller semigroup satisfying (15.1) is thus established. Let (P;) be a second Feller semigroup having the same property, and let (V;) be its resolvent. It follows from (15.1) that V~ = V, and from T8 that V; = V p for every p. Let then f be an element of ~J("; the continuous functions t J\III'+ Pt/x and t ~ P;/x have the same Laplace transforms and are hence identical; so that the kernels Pt and P; are themselves equal. Remark It can be shown that (Pt) is the only semigroup of sub-Markov kernels that satisfies (15.1) and such that, for every function f E ~J(", the function (t,x) J\III'+ Ptfx is measurable with respect to the a-field 88(R+) x 88iE). We shall not prove this result. Passage from the sub-Markov case to the Markov case t6 Suppose that the semigroup (Pt) we have constructed is Markov. We shall see later that, using probabilistic methods, the potential theory relative to the kernel V can be studied in a very detailed manner. These methods are not directly applicable in the subMarkov case, where we have to reduce to the Markov case by the following method. Let (Pt) be a Feller semigroup of sub-Markov kernels on a measurable space (E,C). Adjoin to E an additional element 0, put E U {o} = E', and denote by C' the a-field generated by C and the set {o}. Define then kernels P; on (E',C') by setting P't(x,A) = Pt(x,A) for x P~(x,{
oD =
P~(o,A) =
E
E, AcE, A
1 - Pt(x,E) for x
IA(o)
(A
E
E
E;
E
C; (16.1)
C').
It is trivial to verify that we thus obtain Markov kernels on (E',C'), which again constitute
a semlgroup. We adopt the following very important convention: We identify every function defined on E with its extension to E' which vanishes at the point o. It is clear with this convention that Pt/ = P;f for every function f defined on E and, iff is defined on E', Pd = !(o)
+ Pt(! -
j(o)).
(16.2)
A sub-Markov resolvent (VJI) on (E,tC) can be extended in the same way to a Markov resolvent (V;) on (E',C'), by putting
pV;! =
j(o)
+ PYv(! -
j(o)).
(16.3)
It will be noted that if (VJI) is the resolvent of (Pt), then (V;) is the resolvent of (P;). Let us next consider the case where E is a locally compact, a-compact space, and where the sub-Markov semigroup (Pt) is a Feller semigroup on E. E' can then be considered to be the Alexandrov (one-point) compactification of E, 0 being the point at infinity (an
X, 17, D18, TI9
Construction of Resolvents and Semigroups
216
isolated point if E is compact). In this case the semigroup (P;) is a (Markov) Feller semigroup on E'. Analogous considerations apply to resolvents that take elements of CCo(E) into CCo(E). Ray resolvents
17 Let E be a locally compact, a-compact space, and let (Vp ) be a sub-Markov resolvent on the Banach space CCo(E). The hypothesis of strong continuity on this resolvent plays an essential role in the construction of the semigroup (Pt) associated with (Vp ) through the Hille-Yosida theorem. We seek now to replace strong continuity by a less restrictive condition, following Ray (106). The results that follow will not be used in later chapters. From No. 16 above, we lose no generality in limiting ourselves to the study of Markov resolvents (V p) on a compact space E with kernels that leave the space CC(E) invariant. We note first that for each p > 0, if f/ p is the convex cone of continuous p-supermedian functions (IX.T45), the vector space f/ p - f/ p is independent of p. To see this, let p and q be such that 0 < P < q. Every p-supermedian function is then q-supermedian (IX.T47), and it suffices to show that every continuous q-supermedian function f is equal to the difference of two continuous p-supermedian functions. Now we have f = [f + (q - p) Vpf] - (q - p) Vpf; these two functions are continuous, and the second is psupermedian from IX.T50. It will thus suffice to show that the function h + (q - p)Vph is p-supermedian for every q-supermedian function h. This property holds when h is of the form Vqg (g > 0), since then h + (q - p) Vph = Vpg; it thus holds for every excessive function h from IX.T64. We conclude finally by noting that, for every p, a function is p-supermedian if and only if it is equal almost everywhere to a p-excessive function, from IX.T60. We can now pose the following definition.
DI8
Let (Vp) be a Markov resolvent consisting of continuous diffusion-kernels on a compact space E. We say that (Vp) is a Ray resolvent if the cone f/ q of continuous q-supermedian functions separates the points of E for some q > O. DEFINITION
The condition then holds for every q > O. The cone f/ q is closed under the operation A; the space f/ q - f/ q is thus closed under the operations V and A , contains the constants, and separates the points of E. It is then dense in CC(E) by the Stone-Weierstrass theorem. TI9 THEOREM (Ray) Let (Vp) be a Ray resolvent. There then exists a unique measurable semigroup (Pt) on the measurable space (E,gjo(E» with (Vp) as its resolvent, which has the following property: The function t ~ Ptf X is right continuous for every function f E CC(E) and every x E E. Proof Let q be a number> 0, and let J be the closure in CC(E) of the image space Vp(CC(E». J is a Banach space, invariant under the operators V p, on which the resolvent (Vp ) is strongly continuous. There thus exists a strongly continuous sub-Markov semigroup (Pt) on J such that
(p> O,!E J).
(19.1)
The function I belongs to J, and Vpl = lip. It follows from the uniqueness of Laplace transforms that Ptl = 1 for every t ~ O.
Construction 01 Semigroups
217
X, T19
LetIbe an element of !7q' The functions p V'P+ql increase to the q-excessive regularization lof/whenp-+ 00 (IX.T46 and T60). We set (19.2) Pt! = Hm Pt(pV'P+qf), 'P-+ 00 and in particular Pol = J. We next extend the mapping I .J\I\t+ P tlto!7q - !7q by linearity. To show that the mapping t.J\l\t+ Ptlx is right continuous (and free of oscillatory discontinuities), we begin with the case where I is of the form Vqg, g E ~+(E). We then have oo
e-qtPtf = e-qtpt(f. e-qSPsf dS)
=
f.oo e-qSPsg ds.
The function t.J\l\t+ e-qtPtlx is thus continuous and decreasing. Suppose next that/belongs to !7q; the functions V'P+ql are of the preceding type, taking for g the positive function (I - qV'P+ql)· It follows from (19.2) that the function t.J\l\t+ e-qtPtlx is decreasing and l.s.c.-i.e. right continuous. The stated result is then clear by linearity when I belongs to !7q - !7q. Suppose next that I belongs to !7q - !7q and is positive; I is then the difference of two continuous q-supermedian functions g and h such that g > h. This inequality implies that pV'P+qg ~ pV'P+~ for every p, and hence Ptg > Pth for every t from (19.2). The relation I ~ 0 thus implies Ptl > O. We note finally that the function Ptf, when I belongs to !7q' is the upper envelope of an increasing sequence of continuous functions. Ptl is thus a Baire function, and this result extends to !7q - !7q by linearity. We have seen in No. 18 that the space !7q - !7q is dense in ~(E). The positivity of Pt implies on the other hand the relation IIPtll1 < IIIII (in the uniform norm). The mapping 1.J\I\t+ Ptl thus extends by continuity to a linear mapping, of norm 1, from ~(E) into ~(E), the space of bounded, 86o(E)-measurable functions on E [where 86o(E) is the Baire a-field on El. It can then be shown, as in IX.II-12, that the mappings Pt so defined are the restrictions to ~(E) of Markov kernels on the measurable space (E,86o(E)), which we denote by the same symbols. We show that the following three properties hold for every bounded Baire function I: (s ~ 0, t
> 0);
(19.3)
(b) The function (t,x).J\I\t+ Ptlx is measurable with respect to the a-field 86(R+)
(c)
V.I =
X
86o(E);
f.oo e-"'P,f dt
(p
> 0).
These three properties are indeed true when I belongs to J: (a) and (c) from the HilleYosida theorem, and (b) from the strong continuity of the semigroup, which implies that the function (t,x) .J\I\t+ Pt/x is continuous. They extend then to the case where I belongs to !7q from (19.2) and Lebesgue's monotone convergence theorem, then to ~(E) by linearity and continuity. The space :Ye of bounded Baire functions, which satisfy (a), (b) and (c), contains ~(E), is closed under passage to monotone limits, and therefore includes all bounded Baire functions from I.T20. The existence of the desired semigroup is thus established. Let (P;) be a second semigroup which has the same properties, and let I be an element of CC(E); the functions t.J\l\t+ ptr and t.J\l\t+ P;!X are right continuous and
X, 020, T21
Construction of Resolvents and Semigroups
218
have the same Laplace transforms. They are hence equal, and we see then that the kernels Pt and P~ are equal. In Chapter XI we study concepts analogous to those which we introduce now, following Ray. We keep the notation of the preceding numbers. D20 DEFINITION We say that the point x E E is a branching point for the Ray resolvent (VI}) if there exists a number p > 0 and a positive measure # of mass 1, distinct from ex, such that (20.1) (#,f> < f(x) for every function fE //1.1. TIt THEOREM The following properties are equivalent: (a) x is not a branching point for (VlJ); (b) exPo = ex (c) lim Ptfx = f(x) for every function f E ~(E) t-+O
(d) lim qVq / x = f(x) for every function f E ~(E). q-+oo
Proof We have from formula (19.2) for every functionfE
Po/x = limqVlJ+q flll
//1.1'
< f(x).
q-+oo
The measure exPo thus satisfies inequality (20.1), so that (a) implies (b). The implication (b) => (c) follows from the right continuity of the function t ~ Ptf X for t = O. The implication (c) =>(d) is a well-known property of Laplace transforms, and it only remains to show that (d) implies (a). Let # be a positive measure of mass 1 such that #(f) < f(x) for every function f E // 1.1. Let g be a continuous function with values in [0,1]. Apply # to both sides of the equality 1 = pVlJg
+ pVi 1 -
g).
The two potentials belong to // 1.1' so that (#, VlJg> = V lJtJ:. We thus have (#,!> = lex) for every functionfEJ", from the relation! = limq -+ oo qVlJ+q! Now wehave!(x) =f(x) from (d); the relation! < f then gives us (#,f> > f(x) for every function f E // 1.1. Since the measure # satisfies (20.1), this inequality can be replaced by an equality. Since the space // 1.1 - / / 1.1 is dense in ~(E), we have # = ex and property (a) follows.
CHAPTER
XI
Convex Cones and Extremal Elements
The main object of this chapter is to prove Choquet's fundamental theorem on integral representations in compact convex sets. This theorem could be considered now as a particular case of a theory of "balayage" defined by a convex cone of continuous functions on a compact set. Since this general theory has as yet no other important applications, we preferred to present it after Choquet's theorem, in Section 3, in order not to impose it on the reader interested only in convex cones. In Section 1 we have grouped a few auxiliary results on compact sets, which are not all indispensable for our purpose, but which are sometimes hard to find in the literature. All vector spaces considered in this chapter will be supposed real.
1.
Compact Convex Sets
Sublinear functions
Dl DEFINITION Let E be a vector space. A real-valued function p defined on E is said to be a sublinear function if it is subadditive: p(x
+ y) < p(x) + p(y)
(x, Y E E)
and positive homogeneous:
p(AX) = Ap(X)
(x
E
E, A > 0).
A linear functional f on E is said to be dominated by p if f(x) < p(x) for every x We shall need the following form of the Hahn-Banach theorem.
E
E.
T2 THEOREM Let E be a vector space. p a sublinear function on E, Fa subspace of E, and f a linear functional on F such that
f(x)
< p(x)
for every x
E
F.
There then exists a linear functional g on F, which is dominated by p and extends f. 219
XI, T3, 4
220
Convex Cones and Extremal Elements
Proof* Let fJJ be the set of all ordered pairs (h,H), where H is a subspace of E containing F, and h a linear functional on H, dominated by p on H and extending f: fJJ is clearly nonempty, and inductive for the order relation < defined by:
< (h',H»~ (H c
«h,H)
H' and h' extends h).
Let then (g,G) be a maximal element of fJJ (from Zorn's lemma); the theorem will be established if we show that G = E, which follows immediately from the next lemma. Let (h,H) be an element of fJJ, a an element of E that does not belong to H, and H' the vector space H E8 Ra. Let h' be the linear functional on H' defined by
LEMMA
+
h'(x
=
ra)
Then h' will be dominated by p on H'
h(x)
+
(x E H, rE R).
rA
if and only if
sup [h(x) - p(x - a)] a;EH
< A < inf [p(y + a) -
h(y)].
'YEH
These two conditions are always consistent. Proof of Lemma Since the two functions h' and p are positive-homogeneous, h' will be dominated by p if and, only if h'(x
+
a) = h(x)
+
A
+
a)
h'(x - a) = h(x) - A
for every x
E
H. The consistency of the two conditions follows from the inequality:
h(x)
+
h(y) = h(x
which gives us, for every x
E
+ y)
+ y)
+ p(y +
a)
H and every y E H,
. h(x) - p(x - a)
< p(y + a) -
h(y).
We illustrate the preceding theorem by proving a result on the extension of positive linear functionals, which Choquet (40) has used to study the moments problem. This theorem can be omitted with no inconvenience. T3 THEOREM Let E be an ordered vector space, H a subspace of E, and H the vector space consisting of the elements of E that are dominated by and dominate elements of H. Every positive linear functional h defined on H can be extended to a positive linear functional on H.
Proof The function p defined on
H by
p(x) = inf h(u)
(x E 11)
uEH u2::a;
is sublinear and equal to h on H. There thus exists a linear functional h' on H, which is dominated by p and extends h. We have p(x) < 0 for every x < 0 belonging to H, and hence h'(x) < O. The linear functional h' is thus positive. 4 Here now are several remarks on the subject of Theorem 2 (we keep the notation of that theorem). • Borrowed from Dunford-Schwartz (67) p. 62. See also No. 4(b) below.
221
XI, 5, T6
Compact Convex Sets
(a) Let x be a point of E; denote by F the vector space Rx, and by f the unique linear functional on F such that f(x) = p(x). The sublinearity of p then implies the relation o
fact which implies the following result: Any sublinear function p is the upper envelope of the linear functionals dominated by p.
Here is a very important direct consequence: There exists a unique linear functional dominated by p if and only if p itself is a linear functional. (b) Suppose now that E is a locally convex topological vector space (LCS) and that the sublinear function p is continuous. Denote by E' the product LCS R x E, by A' the convex open set {(t,x) E E': t > p(x)} (the set of points of E' situated above the graph of p), and by M ' the linear manifold {(t,x): x E F, t = f(x)} (the graph of f). There then exists, from Theorem 1, p. 69 of Bourbaki (16), a closed hyperplane H', which contains M ' and does not meet A'. It can be verified directly that H' is the graph ofa continuous linear functional on E which is dominated by p and extends f
Theorem 2 can be proved by this reasoning, since any vector space can be given the finest locally convex topology (under which all finite convex functions are continuous). Compact convex sets 5 Let E be a LCS, and K a compact convex subset of E. We denote by d the vector space consisting of the continuous affine functions on K, i.e., of the continuous functions f from K into R such that tf(x)
+ (1
- t)f(y) = f(tx
+ (1
- t)y)
(x,y E K, t E [0,1]).
The subspace of d consisting of the restrictions to K of the continuous affine functionals on E separates the points of K; hence this is also true, even more so, of d. We denote by [/ the convex cone in CC(K) consisting of the concave continuous functions. This cone is closed under the operation A.; the vector space [/ - [/ is thus closed under the operations A. and v, contains the constants, and separates the points of K. [/ - [/ is therefore dense in the Banach space CC(K), from the Stone-Weierstrass theorem. * Let E' be the dual of E. The topology induced on K by the weak topology a(E,E ' ) on E is Hausdorff and coarser than the initial topology on K. These two topologies are thus the same. We now establish two results borrowed from Mokobodzki (103). Note that T6 can be deduced from T7 and Dini's lemma. T6
Every continuous affinefunction f on K is the uniform limit on K ofcontinuous affine functions on E. THEOREM
Denote by F the LCS R x E. Set a = inf:l:EKf(x), and consider the following two subsets of F: u= {(t,x)ER x K: a - e < t
Prooft
• See Bourbaki (16).
t This proof was communicated to us by R. Phelps.
XI, T7
Convex Cones and Extremal Elements
222
where e is a number >0. These two sets are disjoint, the first is compact convex, and the second is convex closed. There hence exists a closed affine hyperplane H* which separates U and V; H cannot be parallel to the line R X {O} since every hyperplane of this type that intersects U also intersects V. H is hence the graph of a continuous affine function g, which is within e off on K. Let B be a Banach space and let B' be the dual of B, with the weak topology. Take for K the unit ball of B', and consider a linear functional f on B', which is weakly continuous when restricted to K. f is the uniform limit on K of a sequence (fn)neN of weakly continuous linear functionals on B'. These functionals arise from elements of B, and ~ a Cauchy sequence in B; it then follows that f itself arises from an element of B. We have thus established an important theorem due to Banach [see, e.g., Bourbaki (17), p. 74; Dunford and Schwartz (67), p. 428]. Several analogous results follow in the same way from Theorem 6. The expression "g is strictly dominated by f" in the following statement means that g(x) < f(x) for every x E K. Example of an application
"
prt.n1.-
T7 THEOREM (a) Let f be a finite convex l.s.c. function defined on K. Denote by d, the set of restrictions to K of continuous affine functions on E, which are strictly dominated by f on K; we then have (7.1) f= sup g. ge.9l1f
(b) Suppose in addition that f is affine. The set d, is then filtering to the right. (c) Let f be a finite convex u.s.c. function defined on K; the set re, of continuous convex functions on K which strictly dominate f is then filtering to the left, and
f= inf g.
(7.2)
ge~,
Proof Using the notation of the proof of T6 we set, under hypothesis (a), W
= {(t,x) E R
X K:
t
> f(x)}.
W is then a closed convex subset of F = R X E; for every point z = (s,y) of R x K such that there exists a closed hyperplane H, which strictly separates z from W. This hyper-
plane cannot be parallel to R x {O}; it is hence the graph of a function g E d" and relation (7.1) follows. Suppose next thatfis affine; to establish (b), we recall that the convex hull of the union of two convex compact subsets Band B' of F is compact [being the image of [0, I] x B x B' under the mapping (t,x,y) ~ tx + (1 - t)y]. Let then hand h' be two elements of d, and a a constant dominated by hand h' on K. Denote by B the compact convex set {(t,x) ER x K: a < t < h(x)}, and by B' the analogous set with h replaced by h'. The reader can verify directly from the fact that f is affine that the convex hull C of B U B' is disjoint from W. Since C is compact there exists a closed hyperplane, which strictly separates C and W; this hyperplane is the graph of an affine function g that dominates hand h' and is strictly dominated by f on K. Statement (b) is thus established. Suppose finally thatfis convex u.s.c. Let g and g' be two convex bounded l.s.c. functions on K that strictly dominate f We shall show that there exists a function h E re, dominated
* Bourbaki (16), Chapter 1I, Section 3, Prop. 4, p. 73; Dunford and Schwartz (67), V. 2.7, Theorem 10, p.417.
Compact Convex Sets
223
XI, 8
by g and g'-this will imply in particular that re! is filtering to the left. Let a be a constant that dominates g and g'; denote by B the compact convex set {(t,x) E R X K: g(z)
< t < a},
and by B' the analogous set with g replaced by g'. Let C be the convex hull of the union B U B', which is compact. Set k(x) = inf {t: (t,x) E Cl. This function is convex l.s.c. and strictly dominates f It hence is equal, from (a), to the upper envelope of the set ..#k' Denote by :Ye the set of functions of the form ho V hI V ..• V hn
(n EN; ho, ... , hn E ..#k) ;
:Ye is a family of continuous functions, which is filtering to the right. There then exists from X.T6 a function h E :Ye, which strictly dominates f
It only remains to show that the lower envelope of re! is equal to f It suffices from the above to construct for each point x E K and each t > f(x) a convex l.s.c. function g t.aJ on K, which strictly dominates f and is such that g t.ix) = t. Let us thus choose a number b, which dominates f on K (such a number exists since fis u.s.c. and finite), and denote byGt,aJ the convex hull of the point (t,x) and of {b} x K: Gt.aJ is a "stalactite" hanging over the graph off It then suffices to put
gt.iy) = inf {s ER: (s,y) E G t.aJ }.
Extreme points of a compact convex set 8 We denote by Jt+ (respectively, Jtt) the collection of positive (respectively, positive with unit mass) Radon measures on K. The barycenter of a measure ft E.Lt will be written b(p,). We begin by recalling, or proving quickly, some elementary properties of barycenters. (a) Let ft be a measure in .Lt with barycenter x, and let f be a convex (respectively, afJine),jinite, u.s.c. or l.s.c.function on K. We then have
f(x)
f(y) dp,(y)
[respectivelY,f(x)
=
L
l
f(y) dp,(y)
Equality of the two sides in the case where f is continuous affine is, in fact, the very definition of b(ft); one passes from there to the inequality for a continuous or l.s.c. convex function by means of T7(a) and II.T36. The case of a convex u.s.c. function then follows from TI(c) and II.T36. Equality for a l.s.c. or u.s.c. affine function is obtained finally by considering the convex functions f and - f (b) Let L be a compact subset of K. The closed convex hull of L is equal to the set of barycenters of the measures p, E .Lt, which are carried by L. This is a well-known property [see, e.g., Bourbaki (18), Chap. Ill, Section 4, Prop. 7, p.87]. (c) Every measure A. E.Lt is the weak limit of discrete measures belonging to .Lt and having the same barycenter as A.. Here (following Choquet) is the principle of the construction of such measures. Since the topology of E is locally convex and K is compact, there exist arbitrarily fine open coverings of K consisting of a finite number of convex open sets COl' CO2, ••• , COn' Put
XI, 9
224
Convex Cones and Extremal Elements
Each point Xi belongs to wi , the measure () is discrete and has the same barycenter as A, and we have IA(/) - ()(/)/ < 'YJ for every continuous function I whose oscillation on each W i is less than 'YJ. It then follows that () is very near to A in the weak sense whenever the covering is fine enough. 9 Recall that a point x
E
R is said to be extreme if it admits no representation of the form
x = ty
+ (1
- t)z
(t
E
[0,1], yE K, z
E
K)
(9.1)
other than the trivial ones for which t = 0 or 1, or y = z = x. Here are several properties of extreme points. (a) A point x is extreme if and only if the set of measures p, E.Lt such that b(p,) = x consists of Ere alone.
Suppose that x admits a nontrivial representation of the form (9.1). The measure tE1I + (1 - t)E z is then distinct from Ere and has x as its barycenter. Conversely, suppose that there exists a measure p, E .Lt distinct from Ere' and such that b{p,) = x. Let y #: x be a point in the support of p, and let V be a closed convex neighborhood of y, which does not contain x; we then have 0 < p,(V) < 1. Put V' = K",-V, A = Iv . p,1p,(V), A' = Iv' . p,1p,(V'), y = b(A), z =b(x'), and t = p,(V). The point x then admits the nontrivial representation x = ty + (1 - t)z, and is hence not extreme. (b) Let Y be a compact subset of K, and Z the closed convex hull of Y. Every extreme point of Z then belongs to Y. Every point x of Z is, in fact, the barycenter of a measure p, E .Lt carried by Y [No. 8(b)]. If x belongs to Z"'- Y this measure is distinct from Ere' and x cannot be extreme in Z from No.9(a). (c) We call (following Choquet) every intersection of K with an open half-space of E a slice of K. Choquet has established the following property: A point x E K is extreme if and only if the slices of K containing x form a fundamental system of neighborhoods of x. Suppose first that x is not extreme, and consider a nontrivial representation of the form x = ty + (1 - t)z. Every slice of K containing x contains at least one of the points y,z. Let then V, W be neighborhoods of x, which do not contain y,z, respectively; the neighbor-
hood V n W contains no slice of K containing x. Conversely, let x be an extreme point of K. Since the topology of K is induced by the weak topology a(E,E'), every neighborhood of x contains a finite intersection TnT.n···nT I 2 n
of slices of K, which contain x. Let Cb C2 , ••• , Cn be the (compact and convex) complements of Tb ... , T n , respectively. Since the point x is extreme, and does not belong to the union Cl U C 2 U ... U Cn' it cannot belong to the convex hull C of this union. Now C is compact; there thus exists a closed hyperplane H, which strictly separates C from x, and H determines a slice T of K, containing x and contained in T I n T 2 n ... n Tn •
225
XI, DI0, Tll, T12
Compact Convex Sets
The Krein-Milman theorem
When the compact set K is metrizable, Choquet's theorem can easily be proved without relying on the Krein-Milman theorem (we shall do this in Section 2). However, we will need the Krein-Milman theorem to prove the Choquet theorem in the nonmetrizable case. We follow the development of Bauer (2). DI0 DEFINITION A face* of the compact convex set K is any nonempty convex subset F of K, which has the property Every measure p, E.At such that b(p,)
E
F is carried by F.
(10.1)
Let x be a point of K; {x} is then a face if and only if x is extreme. Tll THEOREM (a) Let (Fi)iEI be a family of faces of K, with nonempty intersection F. F is then a face of K. (b) Let F be a face of K and fa finite l.s.c. concave function on K. The set G ={x E F: f(x)
= inff(Y)} 1JEF
is then a face of K. (c) Every face of K contains at least one extreme point. Proof (a) The intersection of two faces of K is clearly either empty or a face of K. We can thus assume that the family (Fi)iEI is filtering to the left. Let then p, be a measure in .At such that b(p,) E F; we have fl(Fi ) = 1 for every i E I, and thus fl(F) = 1 from II.T35. F is hence a face of K. (b) The set G is nonempty and compact since f is l.s.c. Put a = inf1JEF f(Y); we have p,(f) > a for every measure p, E .At carried by F, with equality if and only if fl is carried by G. Consider then a measure A E.At such that b(A) E G; since F is a face, A is carried by F. Since the functionfis concave we have also from No. 8(a) that a = j(b(A»
> A(f).
The measure A is thus carried by G, and it follows that G is a face. The family of faces of K, ordered by inclusion, is inductive downwards by (a). Every face thus contains a minimal face (Zorn's lemma). Assertion (c) will thus be established if we show that a face F, which contains at least two distinct points y and z, is not minimal. Indeed, denote by f a continuous affine function that separates y and z; the set G of points of Fwherefattains its minimum on Fis then a face of K [from (b)], which is contained in F and distinct from F. Here are several important consequences of this theorem. The first is known as Bauer's "minimum principle." T12 THEOREM Let f be a finite l.s.c. concave function on K, which is positive at every extreme point,. f is then positive on K.
Proof The set of points of K where f attains its minimum is a face of K, which contains an extreme point from Tll(c). This minimum is thus positive. • One usually calls faces of a compact convex polyhedron KeRn the compact convex subsets F of K, which satisfy property (10.1).
XI, T13, TI4
226
Convex Cones and Extremal Elements
This property also holds when f is a finite, u.s.c., concave function: f is then, in fact, the lower envelope of the set of continuous affine functions, which dominate f, and the latter are positive from the preceding theorem. * T13
Let K and K' be two compact convex sets, u a continuous affine mapping of K onto K', and x'anextreme point of K'. There then exists at least one extreme point x of K such that x' = u(x). THEOREM
Proof It can be verified directly that the set u-1 [{x'}] is a face of K.
T14
THEOREM
(Krein-Milman)
The closed convex hull of the set of extreme points of K
is equal to K. Proof Let L be the closed convex hull of the set of extreme points. If there exists an x E ~L, there exists a continuous affine function h on E such that f(x) < and infllEL f(Y) ~ 0, which contradicts T12.
°
Convex cones with compact base It frequently happens that the compact convex set K appears as the base of a closed
convex cone C with apex 0, i.e., as the intersection of C and a closed hyperplane that misses 0. Moreover, one can always reduce to this situation by identifying K with the subset K' = {I} x K of the product vector space R x E, and then putting C = {tx: t E R+, X E K'}. It is then appropriate to modify the notation of this chapter in the following manner. (a) Every real-valued function f defined on K can be considered as the restriction to K of a unique positive-homogeneous function defined on C, which lve still denote by f We are thus led to denote by re (respectively, .91,.:7) the collection of continuous (respectively, continuous affine, continuous concave) positive homogeneous functions defined on C. (b) Every positive linear functional on the space re ("conical measure" on C) can be written f.A.N+ p(f) where p denotes a positive measure on K. We denote by Ba; (X E C) the conical measure f .A.N+ f(x); it is clear that Bta; = tBa; for every t > 0. Let p be a measure in Jt+; the point x E C characterized by the relation: f(x) = p(f) for every function fEd is called the resultant of p, written rCfl).
(15.1)
In other words we have reO) = 0, rep) = p(l) . b(plp(1» for every measure p ¥= 0, and in particular rep) = b(p) if p belongs to Jtt. The inequality (8.1) then gives us f(rCfl»
< p(f)
(15.2)
for every measure p E Jt+ and every positive-homogeneous convex function f on C, which is l.s.c. or U.S.C. (c) A point x E K is extreme if and only if the relation x = y + Z (Y,z E C) implies the existence of a number t E [0, I] such that y = tx and z = (1 - t)x, or again, if every element y of C which is dominated by x in the intrinsic ordert of C, is proportional to x.
* These results are improved in Bauer (7).
t The intrinsic order of C is defined by x ~ Y <=> for some Z E C, x
+Z = Y
(x, Y E
C).
The Choquet Theorem
227
2.
XI, D16, 17, DI8
The Choquet Theorem
We keep henceforth the notation of No. 15. The idea of proving the Choquet theorem by means of an order relation on 1+ is due to Bishop and DeLeeuw (8), but the order relation below was used for the first time by Choquet (33). D16
DEFINITION
We denote by (A.
-< the order relation on vI/+ defined by
-< ft) <=> (A.(f) > ft(f) for every function f
E
!7).
(16.1)
Remarks (a) This is indeed an order relation, since the vector space !7 - !7 is dense in CC (for uniform convergence on K). (b) The relation A. -< ft implies A.(f) = ft(f) for every functionf E d, and thus r(A.) = r(ft). (c) We have Cx -< ft for every x E C and every measure ft E..4+ such that r(ft) = x. The measures Cx (x E C) are thus the minimal elements of J(+ under the relation -< . (d) The relation A. -< ft says that ft is "nearer to the boundary of K" than A.. We seek to study the measures on K which are "as near the boundary of K as possible," i.e., the maximal measures under the order -<. These measures will be called simply maximal in what follows. (e) The positive measures ft such that A. -< ft are also called balayages of ,1.* This expression will be justified in No. 50. The following definition, which we borrow from Mokobodzki (102) (but which has been used before, in different forms, by several other authors), will play a fundamental role. A lower envelope! [J - (- j)] can be similarly defined. 17
D18 DEFINITION Let f be a function in CC. The function
inf g
(18.1)
ge!/ g?f
is called the upper envelope off, and denoted by
J.
This function is clearly positive homogeneous, concave, and u.s.c. We have f <J and 1(Y) < sUPxeKf(x) for every yE K, so thatJis bounded. For every measure A. E..4+ we can thus set p;.(f) = ,1(/) = inf A.(g), (18.2) ge!/ g?f
the second equality following from II.T36 since !7 is closed under the operation A. The following properties are immediate:
f=J if fE!7;
(if) =
if
for every function f ~
(f + g)
<J + g
E
CC and every
(f,g E CC).
It then follows that the functionf ~ p;.(f) is sublinear on CC.
* That is, "swept out" measures.
t
> 0;
XI, T19-T22
Convex Cones and Extremal Elements
228
T19 THEOREM Let A. be a positive measure. The linear functionals on~, dominated by the sublinear function P;.' are identical with the positive measures on K, which are balayages of A.. Proof Let fl be a balayage of A.; for every function f E ~ and every function g E f/ such that g > f we then have fl(f) < fl(g) < 1(g), which yields the inequality fl(f) < pif) < p;.(f) by passage to the limit inferior on g. Conversely, let fl be a linear functional on ~, which is dominated by the sublinear function P;.. The relation f < 0 implies pif) < 0, and thus fl(f) < 0; fl is thus a positive measure on. K. Let f be a function in f/; then f = J, hence p;.(f) = 1(f)andfl(f) < 1(f). It then follows that fl is a balayage of 1. T20
COROLLARY
For every measure 1
E
Jt+ and every function f
E ~
p;.(f) = sup fl(f).
we have (20.1)
p.evlt+ p.>-;'
Proof The sublinear function P;. is the upper envelope of the family of linear functions which it dominates (No. 4), and the latter are the balayages of 1 from the preceding result. The existence and characterization of maximal measures
-.
T21
THEOREM
Every measure 1
E
Jt+ admits a maximal balayage.
Proof Let Jt;. be the family of balayages of 1, ordered by the relation -< ; it will suffice to show that Jt;. is inductive (Zorn's lemma). Now let i J\,f\,f+ fli be an increasing mapping of a totally ordered set I into Jt; since the measures fli are positive and have the same total mass, they admit a weak cluster point fl. We have fl(f) = limi fllf) for every function f E f/, since the mapping i J\,f\,f+ fllf) is decreasing. Since the set f/ - f/ is dense in ~, fl is the weak limit of the fli' and also the least upper bound of the fli under the order -< . This establishes the theorem. Here now is the most important result concerning maximal measures; it is due to Mokobodzki (102). -.
T22 (a) (b) (c) (d)
Let 1 be a positive measure on K. The following statements are equivalent The measure 1 is maximal; 1(f) = 1(j) for every function f E ~; 1(f) = 1(1) for every function f E - f/; The measure 1 is carried by each of the sets B t = {x E K:f(x) = lex)} (fE -!/). THEOREM
Proof We know that 1 is maximal if and only if the set of balayages of 1 consists of 1 alone, or again (T19 and No. 4) if 1 = P;.' Statements (a) and (b) are thus equivalent, and (b) clearly implies (c). We show conversely that (c) implies (a): Let fl be a balayage of 1; we have fl{f) > 1{f) for every functionfE -f/, and also fl{f) < pif) = 1{f), from (c). Since the space f/ - f/ is dense in ~ under uniform convergence on K, we have fl = 1, and it follows that 1 is maximal. Finally, the relation f < implies immediately the equivalence of (c) and (d).
1
229
The Choquet Theorem
XI, T23-T25
Maximal measures and extreme points (The metrizable case) We denote by OK the set of extreme points of K (it will be noted that this set has not yet entered the discussion). The first result does not require that K be metrizable. T23 THEOREM (a) f(x) = J(x) for every point x (b) OK = Bf
n
E
OK and every functionfE~;
fE-!/'
(c) Let A be a positive measure on K, such that every compact subset disjoint from OK is A-negligible,. A is then maximal. Proof If x is extreme every balayage of Ca; is equal to Ca; [9(a)]; (a) then follows from formula (20.1). We thus have OK c nfE-!/' Bf • Conversely, let x be a point of this intersection; the measure Ca; is then maximal from T22. Every measure p, E 1+ such that r(p,) = x is thus equal to Ca; [17(c)], and x is extreme from 9(a). Suppose finally that A satisfies the hypothesis of (c), and let f be an element of~. The function f is u.s.c., so that K"""Bf is the union of the sequence of compact sets {J - f > Iln }(n EN). These sets are disjoint from OK' their union is thus A-negligible, and so A is carried by Bf • It then follows from T22 that A is maximal. The idea of using a strictly convex function in the proof of the following theorem is borrowed from Bonsall (12) (this article contains a very short and elegant proof of the Choquet theorem in the metrizable case, which is the origin of the proof we give here).
J-
T24 THEOREM Suppose that K is metrizable. The set OK is then the intersection of a sequence of open sets. A measure A E 1+ is maximal if and only if it is carried by OK'
Proof Since the set K is metrizable, the space ~ (given the norm of uniform convergence on K) admits a countable dense subset. Since the space f/ - f/ is dense in ~, there exists a sequence Cfn)nEN of elements of -f/, which separates the points of K. We can suppose that all of these functions lie between - I and 1 on K; put then
f= nEN !
21nf~.
This function is also convex continuous and is linear on no open segment contained in K. Since the function J is concave, we thus have J(x) > f(x) at every nonextreme point x E K. It follows then from T23(a) that B f = OK; OK is thus the intersection of the sequence of open sets f < Iln} (n EN), and every maximal measure A is carried by OK from T22(d). Conversely, if A is carried by OK' A is carried by every set B g (g E -f/) from T23(a), and thus is maximal from T22(d). Here then is Choquet's existence theorem for the metrizable case.
{l-
...
T25 THEOREM Suppose that K is metrizable. Every point x measure fl carried by the set of extreme points of K.
E
K is then the resultant of a
Proof Let p, be a maximal balayage of Ca; (T21); we have rep,) = x, and p, is carried by OK'
The uniqueness theorem The version of the uniqueness theorem that we give is borrowed from the article by Loomis (91) [see Cartier, Fell, Meyer (24)]. We begin by establishing, following Cartier, the identity between the order -< and the "strong" order introduced by Loomis.
Convex Cones and Extremal Elements
XI, T26, 27 T26
THEOREM
230
Let A and fl be two positive measures on K; the following three properties
are equivalent; (a) A -< fl; (b) For every finite family (A i)i=l n ofpositive measures on K such that A = ~~=1 Ai, there exists a finite family (fli)i=l n ofpositive measures on K, such that Ai -< fli for
and
i = 1, . . . , n;
(c) The same statement as (b), replacing Ai -< fli by r(Ai)
= r(fli)'
Proof* Suppose that (a) holds. Let E be the product vector space ~n, and let F be the subspace of E consisting of the elements of E of the form (1,1, ... ,f) (n times, f E ~. Consider the sublinear function p on E defined by ,fn) = A1(h) + A2(!2) + ... + An(!n)' The linear functional (I, I, ,f) -A/II'+ fl(f) on F is dominated by p on F. It can thus be extended to all of E, from T2, by a linear functional dominated by p. This functional can be written (f1,h, ... ,fn) -A/II'+ #1(f1) + fl2(f2) + ... + flnCfn)' P(f1,f2'
flh ... , fln denoting linear functionals on ~. We have fllf) < Ai(!) for every i and every function f E ~; consequently fli is a balayage of Ai from T19. Finally, we have 1i fli = fl,
and property (b) is established. Property (b) clearly implies (c). To show that (c) implies (a), consider a functionfE [/, and a number 8 > O. Cover K with a finite number of closed convex sets Wb W 2 , ••• , W m on each of which the oscillation of/is less than 8. Put ei = W i "'" Uj
A(f) =
1 Ai(f) > 1 [f(r{A i» i
- 8A i {I)]
i
from the condition on the oscillation of the (positive-homogeneous) function This last expression is equal to ~f{r(fli»
- 8A{I)
> ~ fli{f)
- 8A{I)
=
f
on
Wi•
fl(f) - 8A{I),
i
i
from (15.2). We thus have A -< fl, and the theorem is established. Let x be an element of C. We define a subdivision of x to be any finite family (X i)i=1.2. ... .n of elements of C such that x = ~:=1 Xi' The subdivisions of a measure fl E 1+ are defined similarly. The set of all subdivisions of x is partially orderedt by the relation "s is less fine than t," which we write s -l t, and which is stated, if s = (X i)i=l. ... .n and t = (yj)j=l . ... .k' as
27
(s -l t}<::>(there exists a partition of {I, 2, ... , Xi
k} into n sets J1 ,
•••
,In such that
= ~jEJiYj for all i).
* One may note that the first part of the proof can be generalized as follows: let E be a vector space, PI' P2' ... ,pn be sublinear functions on E, x' be a linear functional dominated by PI + P2 + ... + pn. One may then find linear functionals x~, x~, ... , x~ on E, dominated, respectively, by Ph P2' ... ,pn, such that x' = x~ + x~ + ... + x~. We shall not give details here, since No. 51 will yield an extension of this result to "continuous sums" of sublinear functions. t This translates the French "preordonne"; note that (s -I t and t -I s) doesn't imply s = t.
231
XI, T28, T29
The Choquet Theorem
We associate with every subdivision s = (X i )i=1 . ... .n of x the measure e s = ~:=1 erei • The relation s 1 t clearly implies e s -< et. The relation e s -< ft, where ft is an element of Jt+, is equivalent from T26 to the existence of a subdivision (fti)i=1 . ... .n of ft such that Xi = r(fti) for every i. Let S be a collection of subdivisions of x, which is filtering to the right for the relation 1 ; we say then simply that S is afiltering set. Such sets are natural objects of study in certain applications of Choquet's theory, in particular, in the theory of group representations, which was the origin of Loomis's work. Let ft be a positive measure on K, and let (fti)i=1.2 . ... .n and (ft;)i=1.2 . ... .k be two subdivisions of p; there then exist positive measures Ai; (i < i < n, 1 <j < k) such that Pi = ~i Aii for every i, and P; = ~i Aii for every j. * Denote then by Sp. the collection of subdivisions of X = rep) of the form (r(Pi»i=l . ... ,n' where (Pi)i=l . ... .n denotes a subdivision of p. It is clear from above that this set is filtering. The following theorem then summarizes the main results of Loomis's theory. T28 THEOREM (a) Let A and P be two positive measures. The relations A -< p and Sl c Sp. are equivalent. (b) Let R be a filtering set of subdivisions of x. The set of measures e s (s ER) then admits an upper bound for the order -<, which we denote by eR' We have R c S(ER); if R is of the form Sp. (p E Jt+) we have eR = p. (c) Every filtering set R is contained in a maximal filtering set (for the relation c). (d) A filtering set R is maximal if and only if there exists a maximal measure pE Jt+ such that R = S p.' This measure is then unique.
Proof Statement (a) is a simple translation of T26. Since the relation -< is a true order, we see in particular that the relation Sl = Sp. (p, A E Jt+) implies A = p. To establish (b), note that the relations s 1 t, f E!/ imply (e s ,!) > (et,f). Since the set !/ - !/ is dense in rc, it follows that the mapping s ~ es from R into JI+ admits a single cluster point in the weak topology, along the filter of sections of the partially ordered set R. This mapping thus admits a weak limit eR' and it can be verified easily that eR is the supremum of the measures e s (s ER). The relation e s -< eR for every subdivision SE R implies R c S(ER) from T26. Finally, if R is of the form Sp.' we have e s -< p for every SE R, hence eR -< p; on the other hand R = Sp. c S(ER)' so that p -< eR and lastly p = eR. The collection of filtering sets of subdivisions of x, ordered by inclusion, is clearly inductive; (c) is thus an immediate consequence of Zorn's lemma. Suppose finally that R is a maximal filtering set, and let p be a measure such that ReSp." We have R c S(ER)' R eSp" thus R = Sp. = S(ER) and finally p = eR. In other words, eR is the only measure p such that R esp.. It then follows in particular that eR is maximal. Conversely, let p be a maximal measure, and let R be a maximal filtering set containing Sp. [one exists, from (c)]. The measure eR is then maximal, we have p -< eR and thus p = eR. Consequently Sp. = R, and Sp. is indeed a maximal filtering set. Theorem 28 implies Choquet's uniqueness theorem:
-+
T29 THEOREM The following two statements are equivalent: (a) The cone C is a lattice under its intrinsic order. (b) For every point x E C there exists a unique maximal measure ft E JI+ such that rep) = x.
* Let /i (respectively, I;) be Borel functions on K into [0,1], such that Pi We then set Ail = fl;p.
= ~p
(respectively,
pi = f;p).
Convex Cones and Extremal Elements
XI, T30
232
Proof Suppose that the cone C is a lattice. Let x be an element of C, and let (X i )i=l, ... . n and (X;)i=l . ... ,k be two subdivisions of x. The "decomposition lemma" [see Bourbaki, (18) Chapter. 11, Section 1, No. 1, p. 19] implies the existence of elements Yii (1 < i < n, 1 <j < k) of C such that Xi = ~i Yii for every i, and = ~i Yii for every j. In other words the set of all subdivisions of x is filtering; it is therefore the unique maximal filtering set and (b) follows from T28. To establish the converse, we note first that the set 1~ of maximal measures is always a convex cone, which is a lattice for its intrinsic order. To see this let A and ft be two maximal measures; A and ft then charge no set B, (fE -f/; cr. No. 22). The same is then true of the measures A + ft, A " ft, and A V ft (these last symbols with respect to the natural order on 1+), measures which are thus maximal from T22. It follows that 1~ is a cone and that A " ft and A V ft are, respectively, the inf and the sup of A and ft in 1~. Now the function ft Jo..N+ r(ft) maps 1~ onto C (T21), and it is clearly additive and increasing when ..,/I~ and C are given their intrinsic orderings. Suppose that property (b) is satisfied: this mapping is then an isomorphism, and it follows that C is a lattice. We say (following Choquet) that K is a simplex if the cone C is a lattice. The reader will find other characterizations of the case of uniqueness in the article by Choquet-Meyer
x;
(44).
The following theorem will be generalized later (No. 36).
Suppose that K is a simplex. For every x E K, let ft:» be the unique maximal measure with barycenter x. The mapping x .J\I\/'+ ft:»(f) is then Borel for every function f E cc. Let A be a positive measure; the unique maximal measure ft such that A -< p is given by the formula T30
THEOREM
(f E rc).
(30.1)
Proof For every measure A E 1+ and every functionfE -f/ we have from (20.1) Pl(f)
= ACj)=
sup ()(f)
=
ft(/),
;'-<6
ft denoting the unique maximal measure such that r(ft) = reA). It follows, taking A = e:» (x E K), that
lex) = p:»(f) (fE -[/).
The function x Jo..N+ ftif) is thus u.s.c. when f belongs to - f/; it is hence Borel when f belongs to f/ - f/, and this property extends to every f E CC by passage to the uniform limit on K. Keeping the same meaning as above for A and p we have, for every function f E - f/,
ft(f)
=
A(])
=
fKl(X) dA(X)
=
fK fti/ ) dA(X).
The two sides of (30.1) therefore coincide for f E - f/, and consequently also (by linearity and continuity) for every f E CC. Maximal measures and extreme points (general case) We resume the discussion of Nos. 23-24 in order to show that the maximal measures are still, in a weak sense, "carried" by OK when K is not metrizable. The situation is, however, much less satisfactory than in the metrizable case. We follow a proof of Choquet's.
233
XI, T31, T32
The Choquet Theorem
T31 THEOREM Let fl be a maximal measure, and (!n)n?1 a decreasing sequence o/positive continuous functions on K, which converges to 0 at every point of OK; we then have limn-+oop(fn) = O. Proof We have fl( -c) = fl( -c) for every function c E
~
(T22); i.e.
fl(c) = sup fl(g). g6-!/ gS;C
Let us use this relation to construct inductively continuous functions gl' g2' ... , belonging to -/7 and satisfying the following properties (where e denotes a number> 0):
> fl(fJ - e/2; ft(gJ > ft(gl AfJ - e/4;
fl(gl)
gl
< gl Af2'
gn
< gn-l Afm
We then have the inequalities: fl(fn - gn) = fl(fn - fn A gn-l) < fl(fn-l - gn-l)
+ (fn A gn-l -
gn)
+ e/2n
It can be easily verified by induction that this quantity is less than e(1 - 1/2n ) < e. Denote by g the convex u.s.c. function infn gn; it is < 0 at the points of OK and hence everywhere (T12), and we thus have fl(infnfn) < fl(infnfn - g) < e. The theorem is established. The following theorem, borrowed from the article by Choquet-Meyer (44), is an improvement of a result of Bishop and DeLeeuw (8). We denote by r the paving consisting of the compact subsets of K.
T32
THEOREM
Every maximal measure fl is carried by every r-analytic set containing OK'
Proof We are going to begin by showing that fl is carried by every element of ra, which contains OK' Let A be such a set and B the complement of A; it will suffice for us to show that fl(C) = 0 for every compact Cc B. Now B is the intersection of a decreasing sequence (Gn)neN of open sets. Choose then a decreasing sequence (fn)neNof positive continuous functions such that fn equals 1 on C, and 0 off of Gn' The preceding theorem implies the relation fl(limnfn) = 0, and thus fl( C) = O. Set then, for every subset A of K, fl*(A) = inf fl(B). Be.Jra
B:JA
This function is a Choquet capacity with respect to the paving r , from III.T23. Every r -analytic set A containing OK is thus capacitable. Now the outer capacity of such a set equals fl(l) from above, the inner capacity is hence also equal to ft(1), and it follows that ft is carried by A. Let B be a compact Baire set disjoint from OK; the complement of B is r-analytic from II.T29: fl therefore does not charge B. The set K"".0K is thus inner negligible for the restriction of fl to the Baire a-jield of K.
XI, 33, 34, T35, T36
Convex Cones and Extremal Elements
234
Theorem 32 implies the identity of the maximal measures and the measures carried by OK when OK is a .?r-analytic set (and in particular when OK is compact). It has been possible, on the other hand, to construct examples: of nonmaximal measures carried by every .?r-analytic set containing OK [Mokobodzki, in Choquet-Meyer (44)]. of maximal measures, which charge compact subsets disjoint from OK [Bishop and DeLeeuw(8)].
33
The existence of dilations 34 The problem whose definitive solution is given below has been studied by several authors. We mention in particular Blackwell (9) (who established Theorem 36 for an interval on the line) and Sherman (108) (who proved it for discrete measures). An unpublished paper by Fell [who considered the "strong" order of Loomis (91) in place of the order -<] has been an essential stage in the solution of the problem, due to Cartier. Theorem 36 has, on the other hand, been established by Mokobodzki (unpublished). We denote by vi{ the space of measures on K with the weak topology. The mapping x ~ ea; permits the identification of K with a compact subset of vII+. We denote by D the closed convex cone of the product topological vector space vi{ x vii consisting of pairs (A,,u) such that A -< ,u. The set ~ of pairs (A,,u) E D such that 1.(1) = 1(= ,u(1)) is a compact base for D. We denote finally by Do the compact set of elements of D of the form (ea;,1J) (x E K). Suppose that K is metrizable. We say that a Markov kernel T on the Borel O'-field of K (see the beginning of Chapter IX) is a dilation on Kif r(ea;T) = x for every x E K. T35 THEOREM (Cartier) Let (A,,u) be an element of D; there exists a positive measure () on Do whose resultant (in the space vi{ X 1) is equal to (A,,u). Proof Since the theorem is clear for the pair (0,0), we reduce to the case where (A,,u) belongs to ~. All that then remains to be proved, from 8(b) is that ~ is the closed convex hull of Do, or (T14) that every extreme point of ~ belongs to Do. Now let (rx.,{J) be an element of ~, which does not belong to Do; the measure rx. is the sum of two nonproportional positive measures rx.' and rx.". There then exist from T26, two measures {J' and {J" such that rx.' -< {J', rx." -< {J", and {J = {J' + (Ju; we thus have (rx.,{J) = (rx.' ,(J') + (rx." ,(J").
The pair (rx.,{J), the sum of two nonproportional elements of Do, is hence not extreme in Do, and the theorem is established. It will be noticed that this statement did not suppose that K was metrizable. This is not true of the following theorem. T36
(Cartier) Suppose that K is metrizable; let A and,u be two positive measures on K. The relation A -< ,u is equivalent to the existence of a dilation Ton K such that AT = ,u. THEOREM
Proof Suppose first that there exists such a dilation. Let f be an element of f/; we have
p.(f)
=
t
T(x,f) dA(X)
< tf(x) dA(X) = AU),
from (15.2), and therefore A -< ,u. Conversely, suppose that A -< ,u; consider the measure () on Do associated with the pair (A,,u) by the preceding theorem, and denote by h the continuous mapping (sa;,fj) ~ x
235
The Choquet Theorem
XI, 37
of Do onto K. The compact space K is metrizable, and hence so are the image of () under h equals A, () admits a disintegration
o=
.Lt and
Do. Since
IK O. dA(X),
where «()re}lJEK is a measurable family of probability laws on Do (in the sense of I1.D13; the compact metric spaces K and Do are naturally given their Borel a-fields) such that ()re is carried by h-I({x}) for every x E K. * We denote by T re the resultant of ()1lJ (in 1); TIlJ is an element of Jlt, and the mapping X.A.f\l+ TIlJ(A) is Borel on K for every Borel subset A of K. There hence exists a Markov kernel T on K such that 8 re T = T re for every x. Since
J:
TIlJ dA(X) the resultant of () in .L X .L is (A,ft), we have K established. Theorem 36 will be generalized later (No. 52).
= AT = ft,
and the theorem is
Extension to certain cones without a compact base 37 Let E be a locally convex space, and let C be a closed convex cone contained in E, containing no line, and with apex o. We say that a point x E C, distinct from 0, belongs to an extremal ray of C if the relations x = y + Z, Y E C, Z E C imply the existence of a number t E [0,1] such that y = tx, Z = (I - t)x. One can ask under what conditions C has the following properties (which are satisfied for cones with compact base): (1) The closed convex hull of the union of the extremal rays of C is equal to C (a generalization of the Krein-Milman theorem); (2) Every point x E C is the resultant of a positive measure ft on a compact subset of C, carried in a more or less precise sense by the union of the extremal rays of C (a generalization of Choquet's theorem). The reader can consult Choquet's notes on this subject, cited in the bibliography. We limit ourselves here to indicating a procedure that sometimes allows a reduction to the theory of compact convex sets. Following Choquet we call every compact subset of C of the form {h < I} a cap of C, where h is a mapping from C into R+ U { + oo} which is linear in the following sense: h(O) = 0 h(x
+ y) = h(x) + h(y)
h(tx) = th(x)
C)
(x,y
E
(x
C, t
E
E
R+).
Since the set {h < I} is closed, the function h is l.s.c. on C. Let X o be a point of C different from 0; suppose that X o belongs to a cap {h We can suppose that h(xo) = 1. Set H
= {x: h(x) ~ I}
HI
= {x: h(x) = I}.
< I} of C.
and
• Bourbaki, (20): Chapter VI, Section 3, No. 1, Theorem 1, p. 58. The generalization of T36, which will be given in No. 52, will be proved without making use of this theorem.
XI, 37
Convex Cones and Extremal Elements
236
We shall see that the set H b although it is not, in general, compact, can play the role of a base of C for the integral representation of elements of H. We want to find the extreme elements of H. It is clear first of all that 0 is extreme, since C contains no line. We have hey) # 0 for every yE H different from 0, since if not H would contain the half-line R+y, and would not be compact. Let then y be a point such that 0 # hey) # I, and let YI be the point yjh(y) E HI; y admits the representation y = [I - hey)] . 0 + hey) . YI and thus cannot be extreme. In other words, we have hex) = 1 for every extreme point x # O. Let then x = y + z be a decomposition of x (y E C, Z E C) such that y # 0, Z # 0; set YI = yjh(y), ZI = zjh(z). We have
= h(y) . YI + h(z) . ZI; since x is extreme, we have YI = ZI = x: Y and Z are thus proportional to x, x
and it follows that x belongs to an extremal ray of C. Conversely, the reader can verify directly that every point of HI that belongs to an extremal ray is an extreme point of H. In other words, the extreme points of Hare: the point 0 and the points of HI located on the extremal rays of C. We denote by 8}] the set of these latter points. Now apply Choquet's theorem to the point X o (recall that h(xo) = I) and the compact convex set H: there exists a maximal probability measure p, whose barycenter is equal to X o' Since the function h is affine l.s.c. on H, we have (No. 8) h(xo)
= 1=
fHh(y) dp,(y).
The measure p, thus does not charge the set {h < I}. It then follows, in particular, that p, has no mass at O. It is known (T32) that the maximal measure p, is carried by the closure of 8H ; since it does not charge {O}, ft is carried by the closure of 81: X o hence belongs to the closed convex hull of 8}]. We have thus obtained a natural extension of the Krein-Milman theorem. Suppose that His metrizable. The maximal measure p, is then carried by 8H , and does not charge {O}; it is thus carried by 8}], and so we have a generalization of Choquet's theorem. Let s = (Xi )i=I.2 • . . . . n be a subdivision of Xo (No. 27); since the function h is increasing, all of the points Xi belong to H and Theorems 26 and 28 apply without modification to any two measures A and p, on H with barycenter xo. Suppose, in particular, that the cone C is a lattice; the set S of all the subdivisions of X o is then filtering, and X o is the barycenter of a unique maximal measure ft carried by H, defined by p,(f) = Hm (es,f> sES
for every positive-homogeneous and continuous function f defined on C, the limit being taken along the directed set S. It will be noted that the cap H does not enter into the formula, a fact which implies the following consequence: Let H = {h < I} and H' = {h' < I} be two caps for C such that h(xo) = h'(xo) = 1. Let p, and ft' be two measures with barycenter x o, carried, respectively, by Hand H', and maximal in these caps. We then have p,(f) = p,'(f) for every continuous and positive-homogeneous function f on C. In other words, p, and p,' define the same "conical measure" on C. Here now is the most important case to which the preceding theory applies.
The Choquet Theorem
237
XI, T38, 039, T40
T38 THEOREM Let X be a locally compact, a-compact space, and let C be a cone ofpositive measures on X, which is closed in the weak topology. Every point of C is then contained in a cap ofC.
Proof Let flo be an element of C, and let (Xn)neN be a sequence of open, relatively compact subsets of X whose union is X. We have fl(Xn) < 00 for every positive measure fl and every n EN; we can thus find a sequence (an)neN of strictly positive numbers such that
We then set, for every positive measure fl, h(fl) =
Z anfl(X n)· n
This function is clearly linear and l.s.c. on .L+(X), and the set of positive measures fl such that h(fl) < I is weakly compact. The set H = {fl E C: h(fl) < I} is therefore a cap of C which contains flo. An application of the Krein-Milman theorem Choquet has given a simple proof of the famous theorem of S. Bernstein on completely monotone functions. We reproduce this proof here, with a view to later applications. We denote by R: the open half-line R+"'{O}. Let f be a real-valued function on R:, and h be a positive number; we shall denote by tJ.hfthe function x ~ f(x + h) - f(x). The iterated difference operators tJ.h1 tJ. h2 ... tJ.hn can be defined in the same way. One can show easily that these operators commute. D39 DEFINITION Let f be a real-valuedfunction defined on R:. We say that f is completely monotone if f is infinitely differentiable and if we have for every integer p
> O.
(39.1)
The function f then is positive, decreasing, and convex.
T40
THEOREM
Let f be a real-valuedfunction defined on R:; the following statements then
are equivalent: (1) f is completely monotone. (2) The function f is positive; for every integer p (h b h2 ••• hp) ofpositive numbers, we have:
>
I, and every finite sequence (40.1)
(3) There exists a positive measure f-t on
f(x) for every x
> O.
R+ such that
= f.oo e-~t dfl(t)
(40.2)
Moreover, this measure is unique.
Proof The implication (3) => (1) and the uniqueness of fl are elementary and well-known results on Laplace transforms. To show that (1) implies (2), we start with the remark that, if g is completely monotone, then the same is true for -tJ.hK. Indeed, the operators
XI, T40 ~h,
238
Convex Cones and Extremal Elements
and DP obviously commute, and we have (-l)P DPLlh8'(x) = (-l)P(DPg(x
+ h) -
DPg(x)) = (-l)P DPHg(X
+ u),
where u belongs to the interval [O,h]; hence the first member is negative. One then easily deduces that the first member of (40.1) is a completely monotone function, and therefore a positive function. This establishes (2). We now prove that (2) implies (3), beginning with the case of bounded functions. Let us denote by C the set of all bounded functions that verify (40.1); C is a convex cone, the elements of which are decreasing and convex functions. The limit lim t -+o+ I(t) thus exists for every function lE C; we shall denote it by 1(0), and call Cl the set of all lE C such that 1(0) < 1. We provide C with the topology of pointwise convergence on R:; it is clear that Cl then is compact. On the other hand, all elements of Cl are convex, and therefore continuous, functions. The topology induced on Cl thus is equal to the topology of pointwise convergence on a countable dense set which is metrizable. One finally sees that Cl is a metrizable cap of C associated with the l.s.c. linear function I ~ 1(0). We now prove that every nonzero extremal point of Cl is an exponential t ~ e-~t (perhaps a constant). * Let indeed I be an extremal point; 1(0) is equal to 1, and I belongs to some extremal ray of C (No. 37). Let us write: I(x) = I(x
+ h) + (/(x)
- I(x
+ h)).
The functions x ~ I(x + h) and x ~ I(x) - I(x + h) belong to C; since lis extremal, they must be proportional to f, and a constant k exists, such that I(x + h) = kl(x). Taking x = 0, we find k = I(h). The relation I(x + h) = l(x)/(h) shows that I is a decreasing exponential function, or the constant 1. Let us denote by E the subset of Cl whose elements are the decreasing exponential functions, the constants 1 and 0; E is a closed set, which contains the extremal points of E. Every point of Cl thus is the barycenter of a positive measure of mass 1 carried by E (theorem of Krein-Milman); if we observe that C = U t~O tCl , we get the representation (3) for bounded completely monotone functions. Let then I be an unbounded function which satisfies (2), and let h be a strictly positive number; the function x ~ !(x + h) belongs to C, and therefore has a representation: f(x
Then we have for every k f(x
+
h
+ h) = iOOe-~t dftit)
(x
> 0).
(40.3)
> 0: +
k) =
iOOe-~te-ktdft",(t) = iOOe-~tdfth+k(t)
which implies dfth+k(t) = e-ktdft",(t) (uniqueness of Laplace transforms) It follows that the measure ft defined by dft(t) = ehtdft",(t) does not depend on h, and (40.3) then becomes f(x
We replace x
+ h) =
iooe-l,,*hltdp(/).
+ h by x and get the representation (40.2).
• In fact, one can easily prove that the decreasing exponential functions and the constant 1 are extremal elements of Cb but we don't need this here.
239
Balayage Defined by a Convex Cone of Functions
3. BalayaBe Defined by a Convex Cone
XI, D41
cif Functions
In all of the classical forbears of Newtonian potential theory certain convex cones of functions, which play a fundamental role, are seen to appear: superharmonic functions, plurisuperharmonic functions, concave functions, and excessive functions. These cones are always closed under the operation A, and the functions of which they consist are generally lower semicontinuous. One can thus imagine the "general potential theory" as the study of convex cones of functions that have these two properties. This ambitious "general theory" so far has certainly not reached its definitive form. Its main interest at this time comes from the better insight it gives into older results (Choquet's theorem, Shilov boundaries) and from having simplified their proofs. This certainly is enough to justify the study here. Given the incomplete state of the theory, it appeared sufficient for us to give the general ideas in their simplest form. We limit ourselves in particular to the study of convex cones of continuous functions. The results of this section are borrowed from Bauer (3) and Mokobodzki. D41 DEFINITION Let X be a compact space and [/ a subset ofCC(X). We denote by partial ordering on .L+(X) defined by for every
fE f/).
-<
the
(41.1)
Let A and ft be two positive measures such that A -< ft; we say then that ft is a balayage of A (relative to f/). Remarks (a) Let [/' be the closed convex cone generated by [/ in CC(X). The partial
orderings defined by [/ and [/' are the same. Two closed convex cones contained in CC(X) are the same if and only if they define the same partial ordering on .L+(X) (HahnBanach theorem). (b) Let [/ be a convex cone contained in CC(X), and let [/1 be the collection of functions of the formfl A f2 A •.• A fn (n EN, fb ... , fn E f/); the identity
shows that !/1 is a convex cone, which is closed under the operation A. Let -< and -< 1, respectively, be the partial orderings associated with !/ and [/1; the relation A -< 1 ft clearly implies A -< ft, and the relations ea; -< A and ea; -< 1 A are equivalent for every x E X. The inequalities ft(fl) < fl(X), ... , ft(fn) < fn(x) indeed imply ft(fl A f2 A ••• A fn)
< fl(X) A ••• A fn(X)
= ea;(fl A ••• A fn)·
(c) We assume henceforth that !/ contains the constant 1. The relation A -< ft then implies A(1) > ft(1); the collection of balayages of a measure A is thus weakly compact.
Suppose that X is a compact convex subset of a locally convex space, and that [/ is the collection of continuous concave functions on X. The relation -< then coincides with that which we have used in the preceding section.
Example
XI , D42--D44
240
Convex Cones and Extremal Elements
D42 DEFINITION A point x E f/ is said to belong to the boundary (of X relative to 9) there exists no balayage of ere distinct from ere'
if
The boundary will be denoted by a!/X. Remarks 41(a) and (b) imply that the boundary is not changed if f/ is replaced by the closed convex cone, closed under A, which is generated by f/.
Example Suppose that there exists afunction f E f/, which attains a strict negative minimum at a point X.~ f(x) < 0; f(y) > f(x) for every yE X""{x}. The point x then belongs to the boundary. Indeed let p be a balayage of p(f) < f(x) and p,(f) = p({x})f(x)
+f, x
ere;
we have
fey) dp,(y)
,{re}
> p,({ x })f(x) + p,(X",,{x })f(x) =
p,(1)f(x),
X""
where the inequality is strict if p charges {x}. The relation 1 E f/ implies that p,(l) < 1, hence p(1)f(x) > f(x), with strict inequality if p,(1) < 1. The comparison of these inequalities then gives p(X""{x}) = 0, p,(1) = 1, and hence p = ere'
D43 DEFINITION Let A be a subset of x; we say that A is a Shilov set (relative to 9) if the relations: f E f/; inff(x) ~ -1 (43.1) re eA
imply the inequality infreex f(x)
> -1.
Remarks (a) The set X is always a Shilov set; the empty set is a Shilov set if and only if every function f E f/ is positive. (b) Let A be a compact subset of X; definition 43 then takes the following form: A is a compact Shilov set if and only if every function f E f/ which takes on a value < 0 attains its minimum at a point of A. (c) The Shilov sets remain the same if f/ is replaced by the closed convex cone generated by f/ and closed under the operation A. D44 DEFINITION Suppose that f/ is a convex cone closed under the operation A and containing the positive constants. Let A be a Shilov set. For every function f E ~(X) we set (44.1) = inf g.
lA
1
ge!/ g?! onA
Ix·
We write in place of
lA
lA
Remarks (a) The relation f < 1 implies < 1; the relation f > -1 implies :;=: -1, from the fact that A is a Shilov set. The function is thus bounded for every function f E ~(X); since it is upper semicontinuous, it is integrable for every measure A E .L+(X). Since the cone f/ is closed under A we have (AlA)
=
lA
inf (A,g).
ge!/ g?! onA.
(b) We set
PA,A (f)
=
(A,lA)'
(44.2)
241
Balayage Defined by a Convex Cone of Functions
XI, T45-T48
This function is finite and sublinear on ~(X). We then have the following theorem, valid under the hypotheses of D44, which generalizes T19. T45 THEOREM Let A be a compact Shilov set. The linear functionals on ~(X) dominated by the sublinear function pA,A are identical with the balayages of A carried by A. Proof Let ft be a balayage of A carried by A; we have ft(f) < ft(g) < A(g) for every function g E f/ which dominates f on A, and hence ft(f) < PA.if). Conversely, let 4> be a linear functional on ~(x) dominated by PA.A; the relation f < implies PA.if) < 0, and hence 4>(f) < 0: 4> is thus a positive measure. The relations f > 0, f = on A, imply PA.if) < 0, and hence 4>(f) < 0. Thus 4> is carried by A. Finally we have PA,if) < A(f) if fbelongs to f/, hence 4>(f) < A(f), and 4> is a balayage of A.
°
T46
COROLLARY
°
We have*
sup
PA ;.(f) =
ft(f).
A-<,p JI.
carried by A
(An immediate consequence of Nos. 4 and 45). Here is another consequence of Theorem 45. It was proved in Choquet-Deny (41) (by another method). T47 THEOREM Let f/ be a closed convex cone contained in ~(X), and containing the positive constants. The following two conditions are equivalent: (a) f/ is closed under the operation A; (b) There exists a family of pairs (Ba:i,fti)iEb where Xi denotes a point of X, and fli a positive measure on X of mass at most equal to 1, such that f/
= {f E ~(X):
f(x i)
< flif)
for every i El}.
(47.1)
Proof The collection of functions defined by the right side of (47.1) is clearly a closed convex cone, closed under the operation A, and containing the positive constants. Conversely, let f/ be a cone having these properties; denote by -< the partial ordering associated with f/, and by f/' the collection of functions f, which satisfy the inequality ft(f) < f(x) for every x E X and every measure ft, which is a balayage of Ba:' Since the cone f/' satisfies condition (b), it suffices to show that f/ = f/'. Now let -<' be the partial ordering associated with f/'; the relations Ba: -< ft and Ba: -<' ft are clearly equivalent. The formula !(x) = sup ft(f) t:c-<JI.
and the analogous formula relative to f/' show that the functions J are the same for the two cones f/ and f/'. It then follows from Theorem 45 that every positive measure A admits the same balayages relative to f/ and f/'. The Hahn-Banach theorem shows finally that the two cones are identical. T48 THEOREM (Bauer) Suppose that the convex cone f/ is closed under the operation A, contains the positive constants, and separates the points of X. (a) Every function fE f/ which attains values < attains its minimum at a point of the boundary. (b) A compact set A is a Shilov set if and only if it contains the boundary.t
°
• Mokobodzki has proved that this extends to U.S.c. functions. t The closure of the boundary 0.9'X is called the Shi/ov boundary of X (with respect to 9').
XI, 49
Convex Cones and Extremal Elements
242
Proof We limit ourselves here to reasoning in the case where [/ contains all of the constants; the general case can then be deduced by the procedure described in No. 49. If [/ contains all of the constants the restriction on fbecomes unnecessary in statement (a). We call a face of X any nonempty compact subset F of X which has the following property: For every x E F, every balayage of Ex is carried by F. (For example, X is a face.) The following assertions can then be established by reasoning identical to that of Theorem 11. Every face contains a minimal face. Let F be a face, and f an element of [/; the collection of points of F where f attains its minimum on F is a face. Every minimal face is a point of the boundary. These three assertions clearly imply (a). To establish (b), we note first that the boundary is a Shilov set from (a); the same thus holds true for any compact set which contains the boundary. Conversely, let A be a compact Shilov set, and let x be a point not in A; there exists at least one linear functional on
One can then reduce to the case of a compact space and of a cone of continuous functions which contains all of the constants, by means of the following construction, which is very similar to the construction used in No. X.16. Denote by 2 = X U {o} the Alexandrov compactification of X (the point at infinity 0 is isolated if X is compact). Identify
+ A') = A + [f A (f' + (A.' -
A»],
and this last function is equal, from (49.1), to the upper envelope of a family, which is filtering to the right, of elements of [/; it thus belongs to [/ from Dini's lemma (applied on 2). Let A be a bounded positive measure on X, and let # be a balayage of A (relative to [/); we have #(1) < A(1) from (49.1). The following measure is then a balayage of A on 2, relative to !?: p, = # + (A(1) - #(1»Ea. Conversely, it is easy to show that every balayage of A relative to !? is of this form. The relation then follows easily.
243
XI, 50
Balayage Defined by a Convex Cone of Functions
Here again are several remarks whose verification is almost trivial. Let A be a Shilov set in X relative to f/; the set A u {a} is then a Shilov set relative to !/. Conversely, let B be a compact Shilov set relative to !/; the boundary point then belongs to B from T48, and B",,{a} is a Shilov set for f/. It then follows that Theorem 48 remains true under the hypotheses of 1:his number: the Shilov sets relative to f/ which are closed in X are the closed sets which contain the boundary f/'X.
a
a
Example Take for X the open unit ball of Rn (n > 2), and for f/ the convex cone consisting of the positive superharmonic functions continuous in the unit ball and zero at the boundary (in other words, vanishing at infinity in X). Every element of f/ is of the form Uv, where v is a positive measure on X and U denotes the Green's kernel. f/ contains all functions of the form Uf, where f is positive, continuous, and has compact support in X; it is well known that f/ satisfies the hypotheses of No. 49. Let A and fl be two bounded positive measures on X; the relation A -< fl implies 50
(AU,f)
=
(A,Uf)
> (fl,Uf) =
(50.1)
(flU,f)
for every functionfE ~}(X), and consequently AU
> flU.
(50.2)
Conversely, this last relation implies A -< fl, since for every function g = Uv E f/ [v E 1+(X)] we have (A,UV)
=
(AU,V)
> (flU,v) = (fl,Uv).
In particular, let fl be the balayage of A (in the classical sense) on an open set w of X; it is known that AU > flU on X, and AU = flU on w. We thus have A -< fl, so that fl is a balayage of Ain the sense of this chapter. This justifies, in part, the use of this terminology. Since the functions of f/ are positive, we have Ba; -< 0 for every x E X, and the boundary is empty. Every compact set A c X is thus a compact Shilov set. The function is hence defined for every function f E f/; this function is well known in classical potential theory, where it is called the reduite off on A.
lA
lA
Theorems on the existence of dispersions We are now going to extend Theorem 36 of the preceding section (on the existence of dilations) to the balayage defined by a convex cone f/. We begin by establishing a very general result, which implies a large number of existence theorems for dispersions; it is due to V. Strassen (117). Here first are several points of vocabulary. Denote by E a Banach space (with norm written 11 • ID and by S the collection of sublinear functions on E; consider a mapping q: w ~ qw from a measurable space (n,§) into S. We shall say that q is bounded if there exists a positive constant K such that Iqw(x) I < K IIx 11 for every x E E and every wE n*; we shall say that q is weakly measurable if the real-valued function w JV\I'+ qw(x) is measurable for every x E E. • The reader can easily verify the relation Iqw(x) - qw(y) I ~ K the functions qw on E.
Ilx - yll,
which implies the continuity of
XI, T51
244
Convex Cones and Extremal Elements
T51 THEOREM (Strassen) Let E be a separable Banach space, and (n,~ a measurable space with a complete bounded positive measure A. Let p: w .A./II'+ p w be a bounded, weakly measurable mapping from n into the set S. Denote by s the sublinear function
s(x)
=
L
PcO<x) dA(W)
(x
E
E).
(51.1)
Let x' be an element of the dual E' of E. The following properties are equivalent: (a) x' is dominated by s [(x',x) < s(x)for every x E E]; (b) There exists a bounded, weakly measurable mapping W.A./II'+ x~ from n into E' such that x' is dominated by p w for A-almost all w, and such that (x',x) =
fn (x~,x) dA.(w)
for every
x
E
E.
(51.2)
Proof It is clear that (b) implies (a). To establish the reverse implication, we first recall several facts from measure theory. A function f defined on n, with values in E, is said be to measurable if it is the uniform limit of measurable elementary functions. A measurable function f is said to be integrable if the quantity
IIfl11
=
fnl'f(w) 1 dA( w)
is finite. One denotes by 2}; the vector space of integrable functions with values in E, by L}; the quotient space of 2}; by the subspace of a.s. zero functions. It can be shown that L};, with the norm 11 • 111 defined above, is a Banach space. Since the measure A is bounded, the constant mappings from n into E are integrable. We can clearly assume, without loss of generality, that A. is a probability law. Denote by () the mapping which associates with each x E E the equivalence class of the constant function equal to x; () is an isomorphism of E onto a subspace of L};, which we identify with E in what follows. Let f be a real-valued integrable function, and x an element of E. We denote by fx the integrable function W.A./II'+ f(w)x with values -in E; it can be shown without difficulty that the vector space generated by the classes of these functions is dense in L1. Let G be a continuous linear functional, of norm K, on the normed space 21 (or, by passing to the quotient, on L1). It can be shown * that there exists a weakly measurable mapping W.A./ll'+ g'(w), with values in the ball of radius K in the Banach space E' such that for every integrable function fwith values in E we have G(f)
=
L
(g'(w),j(w» dJ.(w).
(51.3)
Now let us return to the proof of Theorem 51. Let f be an integrable function with values in E; since the real-valued function W.A./ll'+ Pw(f(w)) is obviously integrable, we can extend the sublinear function s from X to L1 by setting s(f) =
fn PCJlf(w)) dA.(w)
(fE L1:).
• Here is a quick proof of this result. For every x E E, consider the linear functional/ .A./II'+ G(/x) on the space £1, with norm at most equal to K Ilxll. The dual of V being LOO, there exists a unique element hz of LOO, with norm at most equal to K Ilxll, such that G(Jx) = E[fh z ] for every function/ E V; it is c1earthat hz +" = hz + h" and h t% = th z • Let p be a linear and isometric lifting of Loo into.flJ oo (see VII.TU); denote by H z the function p E.flJ oo , and by g' the linear functional x .A./II'+ Hz ( w) on E. The desired properties can be easily verified.
("f
Balayage Defined by a Convex Cone of Functions
245
XI, T52
The linear functional x' on E, dominated by s on E, can be extended from T2 to a linear functional ~' on L};, which is dominated by s. This linear functional can be written
~'(f) = fn (x~,f(w)
dA(W)
(f E L1J),
(51.4)
where W.J\.f\t+ x~ denotes a bounded, weakly measurable mapping from n into E'. Formula (51.2) says that ~' extends x'; the theorem will hence be established if we show that x~ is dominated by pw for almost every w. Let (Xn)nEN be a dense sequence in E; it clearly suffices to verify that a.s. (x~,xn) < Pw(xn) for every nE N. To see this, we write the relation ~'(IAXn) < s(IAxn) for every A E:F as
fA
(X~,Xn) dA(W)
Pw(Xn) dA(W),
and apply remark IL9(a). Here is a consequence of Theorem 51. Let X and Y be two compact metrizable spaces, and let A be a positive measure on X. Denote by Kthe set consisting of the positive measures on Y of mass at most equal to 1; K is compact and metrizable in the weak topology. Let X.A.f\l+ M z be a mapping from X into the set of nonempty compact convex subsets of K. Suppose that the set of pairs (x,O) E X x K such that 0 E M z is closed. For every function f E
It can easily be verified that the function
I
is bounded and upper semicontinuous on X;
we can thus set
p;.(f) = (A,l), and we obtain the following theorem.
Let fl be a positive measure on Y. The following two properties are equivalent: (a) fl(f) < pif) for every function f E
THEOREM
i
Proof It is clear that (b) implies (a). To establish the reverse implication apply the preceding theorem, taking for (Q,:F) the space X with the a-field of A-measurable sets, for E the space
=
Ix
(t.,j) dA(X)
(52.1)
for every functionfE
(trx,f)
< pi!)
for every function f E
(52.2)
Let (!n)nEN be a dense sequence in
XI, T53, 54
Convex Cones and Extremal Elements
246
Remark It frequently happens that Ell E M Il for every x E X. Then the family (T:Jxex defined by TIl if TIl belongs to M Il T'= { Il Ell otherwise
is also a Borel family, and T; E M Il for every x EX. It can, in fact, be shown that one can always find a Borel family equal a.e. to (TIl)IlEX and having this property. Here is the result which directly generalizes Theorem 36. Let X be a compact metrizable space, [/ a convex cone of continuous functions on X, containing the positive constants and closed under the operation A. We denote by -< the partial ordering associated with [/, and we call an [/-dilation any kernel T on X (with the Borel a-field) such that Ell -< EIlT for every x E X. We have the following statement. T53 THEOREM Let A and fl be two positive measures on X. The following properties are equivalent: (a) A -< fl,· (b) There exists an [/-dilation T on X such that AT = fl. Proof We apply the preceding theorem, taking Y = X and denoting by M Il the set of balayages of Ell; conditions 52(a) and 53(a) are equivalent from T45.
The theory of maximal measures 54 Let us return to the situation of No. 48: X is compact (not necessarily metrizable), [/ is a convex cone of continuous functions which is closed under A, contains the positive constants, and separates points. The partial ordering associated with [/ is then an ordering [since [/ - [/ is dense in ~(X) from the Stone-Weierstrass theorem]. It is easy to develop a theory of maximal measures that generalizes that of the preceding section. We point out a few results. Every positive measure admits a maximal balayage. A positive measure A is maximal if and only if A(f) = A(j) for every functionf E ~(X) (a consequence of Nos. 4 and T45). Suppose that X is metrizable. The boundary is then the intersection of a sequence of open sets. The measure A is maximal if and only if it is carried by the boundary. * Theorems 31 and 32, relative to the nonmetrizable case, also extend without difficulty. However, a criterion for the uniqueness of maximal balayages in the general case is not known. This extension of the theory of maximal measures seems as a matter of fact to present rather little of interest for analysis; only the theory of the boundary associated with a vector space of continuous functions, developed by Bishop and DeLeeuw (8) and Bauer (3), has received important applications. But one can show [see Section 6 of ChoquetMeyer (44)] that this theory reduces directly to that of convex cones with compact base. We refer the reader to the article cited for more details.
* The proof of this theorem is slightly different from that of the corresponding theorem T24. One notes first (from T46, for example) that the mapping f.1l/ll+! is continuous under the topology of uniform convergence. Then let (fn)nEN be a dense sequence in ~(X); A will be maximal if and only if A(fn) = M!n) for every n, hence if A is carried by the sets {x: fn(x) = !n(X)} whose intersection is equal to the boundary.
APPENDIX
Chapters I and IT The results of these two chapters should be considered classical. Those given without proof figure in works on measure theory, e.g., Dunford and Schwartz (67), Halmos (72), Loeve (89), and Bourbaki (18) for that which concerns Radon measures. Bourbaki is alone in establishing Theorems 11.35 and 11.36, which are indispensable for potential theory. The notion of the Baire a-field has been popularized by Halmos (72), from which come Theorems 29 and 30. The ecart that defines convergence in probability in No. 22 was communicated by P. Cartier. The theorems on uniform integrability are due to Vitali (1907) with the exception of T22 (La Vallee Poussin, 1915) and of the Dunford-Pettis criterion [established in Dunford (65), and generalized in Dunford-Pettis (66)]. The treatment of conditional independence follows Loeve (89). Chapter ITI The theorems on compact pavings have an obvious topological interpretation [Theorem 4, due to Alexander, is used by Kelley to prove Tykhonov's theorem; see Kelley (82) pp. 139-143]. The notion of a semicompact paving does not admit an analogous interpretation, but has no particular interest either. The presentation of the theory of analytic sets is new. The theorem on separating Suslin sets is adapted from Kuratowski (86). The theorems on uniformization of Borel or analytic sets would be out of our subject. They are found in Kuratowski (86), Bourbaki (14), and especially Sion (109). In a general manner, Sion's articles are recommended for readers who desire to go deeper into the theory of analytic sets. Article (10) of Blackwell shows the natural character of the theory of analytic sets in probability theory: they are seen to appear whenever one wishes to eliminate certain pathological characteristics of general probability spaces. The "topological" form of Choquet's theorem (as opposed to the "abstract" form) is given in Choquet (25)-a work the richness of which we are far from exhausting. It is also the object of two excellent treatments in Brelot (21) and (22), where one will find as well its main applications to potential theory. Articles (28)-(31) of Choquet treat related theorems: Definitions of capacities, applications, theorems of capacitability under weaker 247
Appendix
248
hypotheses. Strassen indicates in (116) certain connections between information theory and that of capacities. The theory of regular measures is essentially due to Alexandrov. It has also been the object of publications by Marczewski and by Ryll-Nardzewski in Fundamenta Mathematica (1953). This theory recently aroused a new mterest. It was first discovered that regular measures on quasicompact (i.e. non-Hausdorff compact) spaces are a natural tool for harmonic analysis on non-Abe1ian groups. On the other hand, we had communication of unpublished papers by L. Schwartz and N. Bourbaki showing that locally bounded regular measures on the Borel field of a Hausdorff space share practically all the "nice" properties of Radon measures. Probabilists are well acquainted with regular measures on Polish spaces, because of a famous paper by Prokhorov. * Chapter IV
The proof of Theorem 9, familiar to readers of Bourbaki, has been popularized in the United States by a paper by Nelson (Ann. Math., 69, 1959). Example 11 is taken from Doob, as is everything having to do with separability, with numbers of upcrossings and downcrossings, etc. [see the classical presentation of the subject in Doob (56)]. The proof of Theorem 22 is very close to the reasoning of Bourbaki concerning "regulated" functions (Functions of Real Variables, Chap. 2, 1, Theorem 3). The appendix to paragraph 2 comes from Meyer (94), and certain points there have been since developed by Feldman (69); there is unfortunately no known application for these theorems. It should be pointed out also that Chung has defined an interesting notion of right-separability for processes, stronger than ordinary separability. See (47). Stopping times have been used, without formal definition, since the beginning of the theory of processes. The idea appeared entirely clearly for the first time with Doob in 1936. Doob has employed them systematically in martingale theory, together with increasing families of a-fields. These ideas have passed from there to the theory of Markov processes, where they play an essential role. Most of the results about measurable processes and stopping times are of a very elementary nature. Many of them will be found in Dynkin (68) where they appear as lemmas in the work on Markov processes. The systematic study of these ideas, undertaken independently by Chung and Doob, and the author, has led to the discovery of a certain number of new and useful results. The work of Chung and Doob (47) goes a great deal farther than ours. On this subject of stopping times, see also the papers by Courrege and Priouret cited in "Additions to the Bibliography." Theorem 53 on hitting times is adapted from a work of Blackwell and Freedman in the probability seminar at Berkeley. It will be noted that the capacitability theorem is not used in its full strength: it is only necessary to know that an analytic set is measurable for every measure [Saks (107), p. 50]. Chapter V
With the exception of several results pointed out in the text, all the theorems of this chapter are due to Doob [(56) or (62)]; we owe some of them, unpublished, to a personal communication. • "Convergence of random processes and limit theorems in probability theory," Theor. Prob. Appl., 1 (1956).
249
Appendix
The general convergence theorems (arbitrary filtering index set) have been studied by Helms (73), Chow (45) and (46), Krickeberg (84), and Krickeberg and Pauc (85). A detailed bibliography will be found at the end of the last article. We have cited here only a result of Helms. Recent study is based on the introduction of "Vitali conditions" implying a kind of almost sure convergence. The theory is unfortunately far from having the simplicity of the convergence theorems which we present here. Chapter VI Again this is entirely due to Doob, with the exception of Theorems 15 and 16, published here for the first time [but 15 is part of the folklore of martingales, and 16 is very close to a result of Ray (106) on Markov processes]. The class (D) was invented by Doob in (56), for certain martingales. Theorems 19-20 are taken from Meyer (96). Applications of martingale theory to potential theory will be found in Doob (57)-(60). Chapter VII The theorem on the existence of the Doob decomposition was first worked out for martingales for the form (/0 Xt), where (Xt ) denotes a Markov process, and / an excessive function. The desired increasing process is then a Markov additive functional. In this form, after a first fundamental study of Volkonskii (120), the problem was solved indev pendently by Sur (118) and Meyer (95); in the particular case where (Xt ) is Brownian motion, other solutions are due to Ventzel, * and McKean and Tanaka.t The uniqueness theorem (in a slightly different form, better adapted to the theory of Markov processes) is established in Meyer (95). The extension to martingales is made in Meyer (96) and (97), to the theory of energy in Meyer (100). We have known of the work of S. Watanabe on Markov processes, in which several results on energy are found and used. Finally, we point out that a more general problem than that of the Doob decomposition (the decomposition of a process into a sum of a martingale and a process whose paths are of bounded variation) has been studied, and in part resolved, by Fisk (71). Fisk's statement is unfortunately not simple. K. Ito and S. Watanabe have introduced an extremely interesting decomposition of a positive supermartingale into a product of a martingale and a natural decreasing process. This may be found in their paper cited in "Additions to the Bibliography." Chapter VIII Some bibliographic comments on Section 1 are found in the text. The applications to the general theory of processes constitute a new draft of Meyer (99), freed of a certain number of mistakes and obscurities. Section 3 is a new draft of the end of Meyer (97). Courrege (48), (49) can be consulted for the applications to stochastic integrals. Chapters IX and X The results of Section 1 are classical. The origin of the results of Section 2 is explained in the text. Theorem 27 (or 70) is probably new. The Appendix to Section 2 makes clear • Dokl. Akad. Nauk. SSSR, 137 (1961), 17-20. t Mem. Coil. Sci. Univ. Kyoto 33 (1960), 479-506.
Appendix
250
the connections between potential theory and Snell's paper (115); the relations between the theory of elementary kernels and the theory of discrete martingales can in fact be pursued farther, so as to give the Doob decomposition in the discrete case (VII, Section 1). The results of Section 3 are due to Hunt in the case of a sub-Markov semigroup; we use systematically resolvents instead of semigroups. Theorem 68 is due to Hunt in the subMarkov case, and to Deny in the general case. The origin of the theorems in Chapter X is explained in the text. The idea of passing from the sub-Markov case to the Markov case in No. 16, by the adjoining of a point at infinity (an idea which will be found again later) is ~orrowed from Doob (1957). The results of these two chapters can be completed on several points. We do not study here the connections between the diverse "principles" of potential theory: in particular, we accord little attention to the aspect of measures and balayage, and we leave aside everything having to do with energy. The reader interested in these questions should consult the article (42) of Choquet and Deny (potential theory on a finite set), and the volumes of the Seminar Brelot-Choquet-Deny. The results of Chapter X, on the other hand, are extended to convolution kernels in Deny (54) and (50). Chapter XI We have profited, in writing this chapter, from a seminar on the Choquet theorem directed by R. Phelps at the University of Washington. The form of the Hahn-Banach theorem given in No. 2 is evidently the "good" one; it is due to Banach himself. We owe Nos. 8(e) and 9(e) to a personal communication from Choquet. The sources of Section 2 are indicated in the text. Herve (74) gives another very simple proof of Choquet's existence theorem in the metrizable case. The first simplified proof of the uniqueness theorem (in the form: uniqueness of the maximal measure representing a point, for a cone which is a lattice) was given by Meyer, but has never been published, since it was immediately improved very considerably by Choquet. In this form, it figures in Choquet and Meyer (44). The connections between the theory of Loomis and that of Choquet (including the existence of dilations) have been studied by Fell and Meyer, but their articles have not been published because of the discovery by Cartier of the identity of the orders of Loomis and Choquet [see Meyer (101)]. The theory of Section 3 can be extended to cones of l.s.c. functions. This extension has been done by Edwards in a paper to appear soon. Mokobodzki (104) knows a nice way of reducing the l.s.c. case to the continuous one, and has some very interesting results which generalize to these cones his theorem on convex functions (T7).
BIBLIOGRAPHY
1. Alexandrov, A. D. "Additive Set Functions in Abstract Spaces 11/' Mat. Sb., 9 (1941), 563-628. 2. Bauer, H. "Minimalstellen von Funktionen und Extremalpunkte," Arch. Mat., 9 (1958), 389-393. 3. . "Frontiere de Shilov et probleme de Dirichlet," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny, Institut Henri Poincare, Paris, 3e annee, 1958-59, 23 pages. 4. . "Minimalstellen von Funktionen und Extremalpunkte 11," Arch. Mat., 11 (1960), 200-205. 5. . Shilovscher Rand und Dirichletsches Problem," Ann. Inst. Fourier, Grenoble, 11 (1961), 89-136. 6. . "Supermartingale und Choquet Rand," Arch. Mat., 12 (1961), 210-223. 7. . "Kennzeichnung kompakter Simp1exe mit abgeschlossener Extremalpunktmenge," Arch. Mat., 14(963), 415-421. 8. Bishop, E., and K. DeLeeuw. "The Representations of Linear Functionals by Measures on Sets of Extreme Points," Ann. Inst. Fourier, Grenoble, 9 (1959), 305-331. 9. Blackwell, D. "Comparison of Experiments," Proceedings of the Second Berkeley Symposium on Math. Stat. and Prob., 1950 (Berkeley and Los Angeles: University of California Press, 1951), 93-102. 10. . "On a Class of Probability Spaces," Proceedings of the Third Berkeley Symposium on Math. Stat. and Prob., 1954-55 (Berkeley and Los Angeles: University of California Press, 1956), Vol. 11, 1-6. 11. Blackwell, D., and L. E. Dubins. "A Converse to the Dominated Convergence Theorem," Illinois J. Math., 7 (1963), 508-514. 12. Bonsall, F. F. "On the Representation of the Points ofa Convex Set," J. London Math. Soc., 38 (1963), 332-334. 13. Bourbaki, N. Elements de Ma thematique, Livre Ill, Topologie Generale, (3rd ed., Paris: Hermann, 1961) (Actualites Scientifiques et Industrielles 1142), Chaps. 1, 2. 14. . Ibid. (2nd ed., Paris: Hermann, 1958) (Actualites Scientifiques et Industrielles 1045), Chap. 9. 15. . Ibid., Chap. 10. 16. . Elements de Mathematique, Livre V, Espaces Vectoriels Topologiques (Paris: Hermann, 1951) (Actualites Scientifiques et Industrielles 1189), Chaps. 1, 2. 251
Bibliography
17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28.
29.
30.
31.
32.
33. 34. 35. 36. 37.
252
. Ibid. (Paris: Hermann, 1955) (Actualites Scientifiques et Industrielles 1229), Chaps. 3-5. . Elements de Mathematique, Livre VI, Integration (Paris: Hermann, 1952) (Actualites Scientifiques et Industrielles 1175), Chaps. 1-4. . Ibid. (Paris: Hermann, 1956) (Actualites Scientifiques et Industrielles 1244), Chap. 5. . Ibid. (Paris: Hermann, 1959) (Actualites Scientifiques et Industrielles 1281), Chap. 6. Brelot, M. Elements de la theorie classique du potentiel (Paris: Centre de Documentation Universitaire, 1959). . Lectures on Potential Theory (Bombay: Tata Institute of Fundamental Research, 1960). Biihlmann, H. "L2-Martingales and Orthogonal Decomposition," Z. Wahrscheinlichkeitstheorie, 1 (1963), 394-414. Cartier, P., J. M. G. Fell, and P. A. Meyer. "Comparaison des mesures portees par un ensemble convexe compact," Bull. Soc. M. France, 92 (1964), 435-445. Choquet, G. "Theory of Capacities," Ann. Inst. Fourier, Grenoble, 5 (1955), 131-295. . "Ensembles .;('-analytiques et .;('-sousliniens. Cas general et cas metrique," Ann. Inst. Fourier, Grenoble, 9 (1959), 75-81. "Forme abstraite du theoreme de capacitabilite," Ann. Inst. Fourier, Grenoble, 9 (1959), 83-89. . "Sur les fondements de la theorie fine du potentiel," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, lere annee, 1957, 10 pages. . "Diametre transfini et ~omparaison de diverses capacites," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 3e annee, 1958-59, 7 pages. . "Theoremes de convergence," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 3e annee, 1958-59, 9 pages. . "Etudes des encombrements et capacites associes a un noyau," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny, Institut Henri Poincare, Paris, 3e annee, 1958-59, 10 pages. . "Existence et unicite des representations integrales au moyen des points extremaux dans les cones convexes," presented at Seminaire Bourbaki, Institut Henri Poincare, Paris, Decembre 1956, 15 pages. . "Le theoreme de representation integrale dans les ensembles convexes compacts," Ann. Inst. Fourier, Grenoble, 10 (1960), 333-344. . "Limites projectives d'ensembles convexes et elements extremaux," Compt. Rend., 250 (1960), 2495-2497. . "Representation integrale dans les cones convexes sans base compacte," Compt. Rend., 253 (1961), 1901-1903. . "Ensembles et cones convexes faiblement complets/' Compt. Rend., 254 (1962), 1908-1910 and 2123-2125. . "Mesures coniques maximales sur les cones convexes faiblement complets," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 6e annee, 1961-62, V01. 2, 15 pages.
253 38. 39. 40. 41. 42. 43. 44.
45. 46. 47. 48.
49. 50. 51.
52.
53.
54. 55.
56. 57.
Bibliography
. "Axiomatique des mesures maximales. Application aux cones convexes faiblement complets," Compt. Rend., 255 (1962), 37-39. . "Etude des mesures coniques. Cones convexes saillants faiblement complets sans generatrices extremales," Compt. Rend., 255 (1962), 445-447. . "Le probleme des moments," presented at Seminaire d'Initiation a l'Analyse. Institut Henri Poincare, Paris, lere annee, 1962, 10 pages. Choquet, G., and J. Deny. "Ensembles semi-reticules et ensembles reticules de fonctions continues," J. Math. Pures Appl., 9 (1957), 179-189. . "Modeles finis en theorie du potentiel," J. d'Anal. Math., 5 (1956-57), 77-135. . "Sur l'equation de convolution fl = fl * a," Compt. Rend., 250 (1960), 799-801 (see also Deny [51]). Choquet, G., and A. P. Meyer. "Existence et unicite des representations integrales dans lesensembles convexes compacts quelconques," Ann. Inst. Fourier, Grenoble, 13 (1963), 139-154. Chow, Y. S. "Martingales in a a-Finite Measure Space, Indexed by Directed Sets," . Trans. Am. Math. Soc., 97 (1960), 254-285. Chow, Y. S. "Convergence Theorems of Martingales," 2. Wahrscheinlichkeitstheorie, 1 (1962-63), 340-346. Chung, K. L., and J. L. Doob, "Fields, Optionality and Measurability," Amer. J. Math., 87 (1965), 397-424. Courrege, P. "Integrales stochastiques et martingales de carre integrable," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 7e annee, 1962-63, 20 pages. . "Integrales stochastiques associees a une martingale de carre integrable," Compt. Rend., 256 (1963), 867, 870. Deny, J. "Familles fondamentales, noyaux associes," Ann. Inst. Fourier, Grenoble, 3 (1951), 73-101. . "Sur l'equation de convolution fl = fl * a," presented at Seminaire de Theorie du Potential, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 4e annee, 1959-60, 11 pages. . "Les noyaux elementaires," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 4e annee, 1959-60, 12 pages. . "Les principes fondamentaux de la theorie du potentiel; elements de la theorie du potentiel par rapport a un noyau de Hunt," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 5e annee, 1960-61, 16 pages. . "Noyaux de convolution de Hunt et noyaux associes a une famille fondamentale," Ann. Inst. Fourier, Grenoble, 12 (1962), 643-667. . "Les principes du maximum en theorie du potentiel," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 6e annee, 1961-62, Vol. 2, 8 pages. Doob, J. L. Stochastic Processes (New York: J. Wiley and Sons; London: Chapman and Hall, 1953). . "Semi-martingales and Subharmonic Functions," Trans. Am. Math. Soc., 77 (1954), 86-121.
Bibliography 58. 59.
60. 61. 62.
63.
64. 65. 66. 67. 68.
69. 70. 71. 72. 73.
254
. "A Probability Approach to the Heat Equation," Trans. Am. Math. Soc., 80 (1955), 216-280. . "Probability Methods Applied to the First Boundary Value Problem," Proceedings of the Third Berkeley Symposium on Math. State and Probability, 1954-55, 2, 49-80 (Los Angeles: University of California Press, Berkeley, 1956). . "Probability Theory and the First Boundary Value Problem," IllinoisJ. Math., 2 (1958), 19-36. . "Discrete Potential Theory and Boundaries," J. of Math. Mech., 8 (1959), 433-458. . "Notes on Martingale Theory," Proceedings ofthe Fourth Berkeley Symposium on Math. State andProbe (Los Angeles: University of California Press, Berkeley, 1961), Vol. 2, 95-103. Doob, J. L., J. L. Snell, and R. E. Williamson. Application of Boundary Theory to Sums ofIndependent Random Variables, "Contributions to Probability and Statistics," 182-197 (Stanford: Stanford Univ. Press, 1960). Dubins, L. E. "Rises and Upcrossings of Nonnegative Martingales," Illinois J. Math., 6 (1962), 226-241. Dunford, N. "A Mean Ergodic Theorem," Duke Math. J., 5 (1939), 635-646. Dunford, N., and B. J. Pettis. "Linear Operations on Summable Functions," Trans. Am. Math. Soc., 47 (1940), 323-392. Dunford, N., and J. T. Schwartz. Linear Operators. Part I: General Theory (New York: Interscience Publishers, 1963). Dynkin, E. B. Fundamentals ofthe Theory ofMarkov Processes (in Russian) (Moscow: Fizmatgiz, 1959). The references in the text are to the English translation: Theory of Markov Processes, translated by D. E. Brown (London: Pergamon Press; Englewood Cliffs, N.J.: Prentice-Hall, 1961). Feldman, J., "On Measurability of Stochastic Processes in Product Space," Pacific J. of M., 12 (1962), 113-120. Feller, W. "A Simple Proof for Renewal Theorems," Comm. Pure Appl. Math., 14 (1961), 285-293. Fisk, D. L. Quasimartingales and Stochastic Integrals (Department of Mathematics Research Monographs, Kent State University, Kent, Ohio, 1964). Halmos, P. R. Measure Theory (Princeton, N.J.: D. Van Nostrand Co., 1950). Helms, L. L. (see also Johnson and Helms) "Mean Convergence of Martingales," Trans. Am. Math. Soc., 87 (1958), 439-446.
74. Herve, M. "Sur les representations integrales a l'aide des points extremaux dans un ensemble compact convexe metrisable," Compt. Rend., 253 (1961), 366-368. 75. Hewitt, E. "Integration by Parts for Stieltjes Integrals," Am. Math. Monthly, 67 (1960), 419-423. 76. Hewitt, E., and L. J. Savage. "Symmetric Measures on Cartesian Products," Trans. Am. Math. Soc., 80 (1955), 470-501. 77. Hille, E., and R. S. Phillips. Functional Analysis and Semigroups (Providence, R.I.: American Mathematical Society colloquium publications, 1957). 78. Hunt, G. A. "Markoff Processes and Potentials I, 11," Illinois J. Math., 1 (1957), 44-93, 316-362. 79. Ionescu Tulcea, C. A. "On the Lifting Property 1," J. Math. Anal. Appl., 3 (1961) 537-546.
255
Bibliography
80. Johnson, G., and L. L. Helms. "Class (D) Supermartingales," Bull. Am. Math. Soc., 69 (1963), 59-62. 81. Kendall, D. G. "Extreme-Point Methods in Stochastic Analysis," Z. Wahrscheinlichkeitstheorie, 1 (1963), 295-300. 82. Kelley, J. L. General Topology (Princeton, N.J.: D. Van Nostrand Co., 1955). 83. Kelley, J. L., I. Namioka et al. Linear Topological Spaces (Princeton, N.J.: D. Van Nostrand Co., 1964). 84. Krickeberg, K. "Convergence of Martingales with a Directed Index Set," Trans. Am. Math. Soc., 83 (1956), 313-337. 85. Krickeberg, K., and C. Pauc. "Martingales et Derivation," Bull. Soc. Math. 91 0963), 455-544. 86. Kuratowski, C. Topologie 1, 3rd ed., Monografje Matematyczne Tom XX (Warzawa: Polskie Towarzystwo Matematyczne, 1952). 87. Lion, G. "Construction du semi-groupe associe a un noyau de Hunt," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 5e annee 1960-61, 9 pages. 88. . "Theoreme de representation d'un noyau par l'integrale d'un semi-groupe," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny, Institut Henri Poincare, Paris, 6e annee, 1961-62, Vo!. 1, 9 pages. 89. Loeve, M. Probability Theory, 2nd ed. (Princeton, N.J.: D. Van Nostrand Co., 1960). 90. Loomis, L. H. An Introduction to Abstract Harmonic Analysis (Princeton, N.J.: D. Van Nostrand Co., 1953). 91. . "Unique Direct Integral Decompositions on Convex Sets," Am. J. Math., 84 (1962), 509-526. 92. Maharam, D. "On a Theorem of Von Neumann," Proc. Am. Math. Soc., 9 (1958), 987-994. 93. . "On Two Theorems of Jessen," Proc. Am. Math. Soc., 9 (1958), 995-999. 94. Meyer, P. A. "Separabilite des processus stochastiques," Compt. Rend., 248 (1959), 3106-3107. 95. . "Fonctionnelles multiplicatives et additives de Markov," Ann. Inst. Fourier, Grenoble, 12 (1962), 125-230. 96. . "A Decomposition Theorem for Supermartingales," Illinois J. Math., 6 (1962), 193-205. 97. . "Decomposition of Supermartingales : The Uniqueness Theorem," Illinois J. Math., 7 (1963),1-17. 98. . "Sur les demonstrations nouvelles du theoreme de Choquet," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris 6e annee 1961-62, Vo!. 2, 9 pages. 99. . "Une presentation de la theoriedesensembles sousliniens. Applicationaux processus stochastiques," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 7e annee, 17 pages. 100. . "Interpretation probabiliste de la notion d'energie," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris 7e annee, 1962-63. 101. - - - . "Progres recents dans le theorie des cones convexes a base compacte," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny, Institut Henri Poincare, Paris, 8e annee, 1963-64, 10 pages.
Bibliography
256
102. Mokobodzki, G. "Balayzge defini par un cone convexe de fonctions numeriques sur un espace compact," Compt. Rend., 254 (1962), 803-805. 103. . "Quelques proprietes des fonctions numeriques (s.c.i. ou s.c.s.) sur un ensemble convexe compact," presented at Seminaire de Theorie du Potentiel, directed by M. Brelot, G. Choquet, and J. Deny. Institut Henri Poincare, Paris, 6e annee, 1961-62, 3 pages. 104. "Generic unpublished paper." 105. Neveu, J. Bases mathematiques du calcul des probabilites (Paris: Masson et Cie, 1964). 106. Ray, D. "Resolvents, Transition Functions, and Strongly Markovian Processes," Annals Math., 70 (1959), 43-72. 107. Saks, S. Theory of the Integral. 2nd ed. Translated by L. C. Young. (Warszawa: Monografie Matematyczne nO 7, 1937). 108. Sherman, S. "On a Theorem of Hardy, Littlewood, Polya, and Blackwell," Proc. Nat. Acad. Sciences, 37 (1951), 826-831. 109. Sion, M. "On Uniformization of Sets in Topological Spaces," Trans. Am. Math. Soc., 96 (1960), 237-246. 110. . "Topological and Measure Theoretic Properties of Analytic Sets," Proc. Am. Math. Soc., 11 (1960), 769-776. 111. . "On Analytic Sets in Topological Spaces," Trans. Am. Math. Soc., 96 (1960), 341-354. 112. . "Continuous Images of Borel Sets," Proc. Am. Math. Soc., 12 (1961), 385-391. 113. . "On Capacitability and Measurability," Ann. Inst. Fourier., Grenoble, 13 (1963), 83-98. 114. Sion, M., and D. W. Bressler. "The Current Theory of Analytic Sets," Can. J. Math., 16 (1964), 207-230. 115. Snell, J. L. "On a Theorem of Hardy, Littlewood, Polya, and Blackwell," Proc. Nat. A cad. Sciences, 37 (1951), 826-831. 116. Strassen, V. "Messfehler und Information," Z. Wahrscheinlichkeitstheorie, 2 (1964), 273-305. 117. . "The Existence of Probability Measures with Given Marginals," Ann. M. Stat., 36 (1965), 423-439. 118. Sur, M. G. "Continuous Additive Functionals of a Markov Process," Dokl. Akad. Nauk. SSSR, 137 (1961), 800-803 [Soviet Math. 2 (1961), 365-368]. 119. Suslin, M. "Sur une definition des ensembles measurables (B) sans nombres transfinis," Compt. Rend., 164 (1971), 88-91. 120. Volkonskii, V. A. "Additive Functionals of Markov Processes," Tr. Mosk. Mat. Oble. 9 (1960), 143-189. 121. Yosida, K. Lectures on Semi-Group Theory and Its Application to Cauchy's Problem in Partial Differential Equations (Bombay: Tata Institute of Fundamental Research, 1957). Additions to the bibliography The following papers appeared while the book was in press, and they could not be inserted in the bibliography because of the numbering system adopted in this series.
257
Bibliography
1. Volume 15 (1965) of the Ann. Inst. Fourier contains the Proceedings of the Colloquium on Potential Theory (Orsay, June 1964). Among the papers that might interest the reader, because of their relationship with this book, we may cite: Deny, J. "Principe complet du maximum et contractions" (259-274). Fuglede, B. "Le theoreme du minimax et la theorie fine du potentiel" (65-88). Ho, K., and S. Watanabe. "Transformation of Markov Processes by Multiplicative Functionals" (13-30). Neveu, J. "Relations entre la theorie des martingales et la theorie ergodique" (31-42). 2. The following papers contain results on stopping times, which are very useful complements to the theory given in Chapter IV: Courrege, Ph., and P. Priouret. "Temps d'arret d'une fonction aleatoire, theoremes de decomposition," Compt. Rend., 259(1965), 3933-3935. - - - . (Same title, to appear in 1965 in Publ. Inst. Stat. Univers. de Paris.)
INDEX OF NOTATIONS
(Numbers refer to Section in which notation is first used)
IA (indicator function of A), 1.4
T(A) (O'-field generated by A), 1.6 T(h, i El) (O'-field generated by the ft.), 1.7 B(E) (Borel O'-field), 1.10 B o(E) (Baire 0'- field), 1.10 a.s. (almost surely), 11.2 Ea; (unit mass), 11.4 E[f] (expectation), 11.5 LP, LP, 11/ll p , Loo, LOO, 11/1100,11.8 O'(Ll, LOO), 0'(L2, L2), 11.10 P ® Q (product law), 11.14
I:-
dQ(x) (integral oflaws), IU5
(p,,/), p,(f), 11.34 ... Bp (,u-measurable sets), 11.37 _ B u (universally measurable sets), 11.37 - E[X F], E[X ft., i E I] (conditional expectation), 11.46 -P[A F], P[A ft., i E I] (conditional probability), 11.46 -E[X FIG], 11.46 - A(F) (F-analytic sets), 111.7 1* (outer capacity), 111.23 X H , X t _ (right and left limits), IV.25
I I
IV.30 _Fp{Ta stopping time), IV.35 t(n), H(n), IV.43 X H (X a process, H a random variable), IV.48 DB (hitting time), IV.53 « (stronger order), VII.8 TA (T a stopping time, A prior to T), VII.38 ... V Fs" (Sn an increasing sequence of stopping times), VIII.38 ..(Ft)teR , F oo , F H , +
n
e[(Xt )] (energy), VII.57 259
Index of Notations [8, T[, [8, T] (stochastic intervals), VIII.l3 N(x, A) (kernel), IX.1 Nf(extension of N to functions), IX.1 N(x,f), Nfx, IX.5 ftN (ft a measure), IX.6 MN (composed kernels), IX.8 I (identity kernel), IX.13 Nn,IX.l4 (regularization of a supermedian function), IX.58 b(ft) (barycenter), XI.8 r(ft) (resultant), XI.15 -<, XI.l6 (also XI.41) f (upper envelope), XI.18 OK (extreme points), XI.23 osX (boundary), XI.42
1
260
INDEX
.~~,
A""-B, 2
A /::; B, 2 +Abelain group, 151 additive function, 41 additivity, countable, 12 additivity theorems, martingales and, 78-79 )< A(~)-analytic set, 34 affine function, 29, 222-223 affine mapping, 226 Alexander, J. W., 247 Alexandrov, A. D., 47 n., 248 Alexandrov compactification, 215, 242 analytic set, disjoint, 38; of R, 37 a.s. ("almost surely"), 12 £i(E),8
Baire function, 217 Baire set, 23, 233; compact, 46 Baire a-field, 8, 21, 23, 49, 51, 54, 233, 247 balayage, 1, 227, 229, 242-243; defined by convex cone of functions, 239; theory of, 219 balayee, 186 Banach, S., 250 Banach space, 154, 174,206,208,210,216,222, 243-244,250 barycenter, of measure, 223-224, 236, 238 base space, 52 Bauer, H., 225, 226 n., 241, 246 Bishop, E., 227, 233-234 Blackwell, D., 30, 37-38, 71, 88, 234, 247 Blackwell spaces, 37 Borel, Emile, 247 Borel family, 245-246 Borel function, 108, 147, 214, 231-232, 245
Borel-Lebesgue property, 46 Borel-Lebesgue theorem, 204 Borel mapping, 35, 38 Borel set, 36, 69 Borel a-fields, v, 7-10, 12 n., 13, 21, 23, 25, 35, 54, 176-177, 235, 246, 248; see also a-field Bourbaki, Nicholai (pseud.), 2, 13, 21, 24, 30, 32 n., 33 n., 35 n., 37, 39, 59 n., 175, 203 n., 221-223, 235 n., 247-248 Brelot, M., v, 39,247,250 Brownian motion, 103, 147, 149 canonical process, 53-55 capacities, 39-47; application to measure theory, 44-45; construction of, 40; right-continuous, 45 Caratheodory theorem, 22, 48-49 Cartier, P., 229, 234, 247, 250 Cauchy criterion, 86 Cauchysequence, 86,222
Choquet, G., v, 32, 34 n., 40, 175, 220, 223-224, 227,232-235,241,246-247,250 Choquet-Deny theorem, 151-153 Choquet ~-capacity, 39,42,44,46, 163 Choquettheorem, 32,35,40,71,219,225, 227238,250 Chow, Y. S., 84, 249 Chung, K. L., 68 n., 73, 248 compactification, 35, 215, 242 compact convex set, 219-225; extreme points of, 223-224 compact interval, 60 compactness criterion, 20 compact pavings, 32-33 261
262 compact set, 46-47, 55; metrizable, 225 )l£.rJ~Eber1in-Shmuliantheorem, 20 compact space, 25 energy, computation of, 141; monotone concomplete maximum principle, 1, 184 conditional expectation, 27, 120; fundamental vergence and, 142-146; results on, 140-148 properties of, 28-30; generalized, 28 equilibrium potential, 183 equivalences, 52-53, 58 conditional probability, 28 events, 7 conditioning, 27-31 excessive functions, 179-180, 182, 187-190, 195conical measure, 226 197; reduite of, 182-184 conjugate exponents, 88 excessive measures, 184-186 continuous increasing processes, 123-128 convergence, ~8e'8'- tMOt~<>,-Mti, 87-88; existence theorem, 119 Lebesgue's theorem of, 144 (see also Lebes- expectation, conditional, 27; mathematical, 12gue's theorem); monotone, 45; of random 31 extreme points, 229 variables, 14-15 convergence theorem, applications of, 149-156 convex cones, with compact base, 226; theory !F, 22: 65, 68 faces, family of, 225 of, 1, 219, 239 convexity theorem, martingales and, 78-79 family, countable, 33; of faces, 225; left- and right-continuous, 70-71, 139; of a-fields, 65, convex sets, compact, 219-225 72,85, 104, 110, 114, 129, 132, 152, 186,225; convolution kernel, 179 subdivision of, 230 countable set, 55-56, 62 !F-analytic sets, 34-36, 40 Courrege, P., 113 n., 248 Fatou's lemma, 13, 18, 84, 88, 96, 153, 169, 191 Fell, J. M. G., vi, 229, 234, 250 Daniell extension theorem, 21, 24, 44-45 decomposition, Reisz, see Reisz decomposition Feller, W., 151 Feller semigroup, 213-214, 216 DeLeeuw, K., 227, 233-234 filtering set, 231 denseness property, 46 Deny,J., v, 151, 171, 179, 184,198,205,241,250 finite intersection property, 32 first canonical process, 53 Denzel, J. M. G., vi diffusion-kernel, 216 Fisk, D. L., 249 dilations, existence of, 234-235 !F-measurable elements, 47 Dini's lemma, 204, 209, 221, 242 !F-measurable sets, 7 Dirichlet integral, 146 !F-Suslin set, 35 n. discontinuity, times of, 129 Fubini's theorem, 15, 109, 114 function, additive and subtractive, 44; affine, dispersion (dispersion-kernel), 175-176, 214 29, 222-223; bounded, 238; excessive, 179domination principle or theorem, 13, 184, 197180, 182, 187-190, 195-197; lower semi198,202-205 continuous (l.s.c.), 25, 51, 60 n., 176, 203, 205, Doob, J. L., 1, 10-11, 30, 55-56, 63, 68 n., 77, 209, 223; pseudo-reduite of, 200; reduite of, 84,87,90, 149, 151, 179-180,248-249 Doob decomposition, existence of, 122-123; 182-184; step (elementary random variable), 9,20; sublinear, 219, 230; supermedian, 187uniqueness of, 111-113 190; unbounded, 238; upper semicontinuous Doob's inequality, 82 Doob's theorem, 87-88 (u.s.c), ~04-~5, 222-223, ').27, 232-233 Co."...~~ h1.D·n,-o.t'OlA...<:..,. Dubins, L. E., 30, 79, 83-84, 88 Dunford, N., 14-15,20,23 n., 30,210,220,222, Gabriel, P., vi, 60 n. Getoor, R., vi 247 Dunford-Pettis compactness criterion, 20 l-t J(/, 11, 16-17 Dynkin, E. B., 11 Hahn-Banach theorem, 2 n., 14, 211, 239, 241, 250 Halmos, P. R., 247
n..
Index
263 Hausdorff topological space, 33, 45-46, 51, 59, 70,248 Helms, L. L., 86, 96 n., 101, 102 n., 103, 249 Herve, M., 250 Hewitt, E., 114, 149-150 Hille, E., 210 Hille-Yosida theorem, 210-211, 213, 217 Hirsch, W. M., vi hitting times, measurability of, 71 HOlder's inequality, 88 Hunt, G. A., 1, 71, 82, 202, 208, 250 Hunt's theorem, 214 hyperplane, 222-224
w
Ii,--
kernel function, 178 Kolmogorov's inequality, 82 Krein-Milman theorem, 225-226, 235, 237 Krickeberg, K., 249 Laplace transform, 213, 216, 218, 237 lattice, 3, 236 law, induced, 150 LCC (locally compact) spaces, 8-9, 37, 68, 70, 160, 177, 179 Lebesgue measure, 8, 13, 55 Lebesgue's theorem, 13, 18-19, 21, 45, 89, 100, 106, 108, 111, 113, 115, 123, 126, 144, 174, 181, 214, 217 Levy, Paul, 85 lifting theorem, 154-156 Lion, G., v locally compact space 8-9; see also LCC spaces, Loeve, M., 150 n., 210, 217 Loomis, L. H., 21, 229, 231, 234, 250 l.s.c. (lower semicontinuous) function, 25, 51, 60 n., 176, 203, 205, 209, 223 Lusin spaces, 37
image probability laws, 15, 53 increasing process, 105-111; continuous and discontinuous parts of, 107; integrable, 111113; integration with respect to, 109-111; "natural," 111-113; stochastic process and, 108-109 independence, 26 index set, 48 indicator function, 28 induction hypothesis, 43 inequalities, fundamental, in martingale theory, It 1, 22 1(E),3 . 80-84 ~nf (f, g)~ ~ mapping, 57, 59, 74, 222, 235, 245; affine, 226; mtegrabIlIty, 19-20 i 1_ Borel 35' coordinate 62' identity 7 23' . 'D "".. t,~\ ..u.... et ~l'.'U ..\ \,.,2 . '.' " '" Hlt~8U1t'9" vOW~r ~ r mclusIon 23' kernel as 173' measurable 35 . . h ' . . c . f 12 31 ,Uf' " " , , ~ntegratlOn t ~o~y, resume 0, 62; notation for, 3; positive linear, 176-177; mternally neglIgIble sets, 22-24 universally measurable 178 intersection, nonempty, 37 " Maraham D. 154 ' intersection property, finite, 32 I" Markov additive functionals 124 invariant function, 181 Markov process, 30, 61, 77, 108, 248-249; ~onescu-T~lcea theorem, 154 theory of, 127 Isometry, lInear, 155 Markov semigroup 1 210 215 Ito, K., 123,249 martingale, 1; ac~ssible ~topping times and, Jensen's inequality, 29, 79, 85-86, 163 133-134: applications of theory, 149-164; Johnson, G., 96, 101, 102 n., 103 continuous parameter, 93-103; countable case, 84c-90; defined, 77; examples of, 78; Kelley, J. L., 247 fundamental inequalities in, 80-84; generkernel(s), 1, 173-186; composition of, 175; alized, 77; monotone convergence and, 146continuous, 176, 179, 203; convolution, 178; 148; quasi-left-continuous, 164c-165, 169; defined, 173; diffusion, 203; dispersion, 175right-continuous, 94c-l00, 110, 115, 124, 135, 163-164; square-integrable, 163-165, 167; 176,214; domination principle and, 202-205; elementary, 180; on locally compact space, uniformly integrable, 84c-86, 90-92, 101-103; 175; Markov, 217, 234-235; Newtonian, 103, see also submartingale; supermartingale 140, 178; positive, 199; potential theory of, martingale theory~ v, 77-164, 186-187, 249; 179-187; semigroups of, 187; sub-Markov, applications of, i49-164 173, 179, 183, 186-187, 197, 215; uniform, mathematical eXPectations, and probability laws, 12-31 206,208
"1 "
Index
264
maximal measures, 228-229, 246 paving of, 158-163 measurable processes, 34-74 Pettis, B. J., 20, 241, 247 measurable space, 7, 65 p-excessive function, 187 measure, barycenter of, 223-224, 236, 238; Phelps, R., 250 construction of, 21-24; excessive, 184-186; p-invariant function, 187 maximal, 228-229, 246-247; as reduite of P-negligible set, 22 function, 186; regular, 248; application ,of polar set, 1 capacities to, 44-45; complements to, 32-51; Polish space, 35, 38 convergence theorems and, 153-154 ~polyhedron, faces of, 225 n. metrizable space, 23,50,52,55,61-62; compact, potential, bounded, 123, 128; class D, 123; 59 defined, 180; equilibrium, 183; generation of, 106; Newtonian, 147, 178, 195, 239; regular, Meyer, P. A., 124, 126 n., 127 n., 156 n., 163 n., 229,233-234,246,248-250 123; right-continuous, 117 minorant, 181 potential theory, v, 1; analogies with martingale theory, 186; analytic tools, of 173-246; modification, separable, 56 "general," 239; "minimum principle" of, 99; Mokobodzki, G., v, 33 n., 221, 227, 234, 241, 1\1'\.(" \ " 250 Co y)'\ bPI') L..O. . Newtonian, 239 ,'M:~·l'"fm'Mtotoneconv~rgen~, 146-148;energyand, probability, conditional, 28; defined, 12; as 142-146; see also Lebesgue's theorem random variable, 27 >t p" 24 probability law, 54; integration of, 15; and mathematical expectations, 12-31; measures p,-measurable sets, 25 and, 21; projective system of, 54 natural increasing process, 136, 141; accessible probability law P, 21 stopping times and, 135-139; difference of, probability space, 13, 22, 27, 30, 52 probability theory, 1, 7-74,202 126 processes, construction of 53-54; equivalent Neveu, J., 32, 211 52; general properties of 52-55; increasing, Newtonian kernel, 103, 140, 178 see increasing processes; martingale as, 104; Newtonian outer capacity, 39, 42 martingale theory and, 156-164; measurable, Newtonian potential, 147, 178, 195, 239 see measurable processes; progressively measnonempty intersection..,37 urable, 68 n.; separable, 55-62; a-fields and, nonmetrizable spaces, 8 65-66; state of, 52 non-negative random variable, notation, explanation of, 2-3, 7, 11, 128-129 product law, 26 product paving, 32-33 product a-field, 9 a,7 product vector space, 226 .,.:lprojective limit, 48 open interval, 58 projective system, 48 open sets, 46-47 pseudo-reduite, of function, 200 4 "sal sampling thjorem, 90-92 Pyke, R., vi orthogonal variabl114 oscillatory discontmuity, 59-62, 68, 100,217 Q,22 outer capacity, 39,42 q-excessive regularization, 217 q-supermedian function, 216 P, 12, 21 path, 52-53, 104, 107; regularity properties of, 93-98; right-continuous, 109; oscillatory R, analytic sets of, 37; real line, 8, 10 Radon law, 51 discontinuities of, 59-62 Radon measure, v, 3, 21, 24-25, 32, 175-177, Pauc, C., 249 179, 185, 203 n., 214, 233, 247-248; compaved set, 32, 34 pletion of, 25; excessive, 184 paving, 32; compact, 32-33; product, 32-33; Radon-Nikodym theorem, 14, 20, 153-154 semicompact, 32, 35, 47, 49, 62, 64
265
<
Index
random variable, 13, 52, 58, 65, 67, 69, 74, 77, 105, 109, 138, 168; convergence of, 14; definition of, 7; elementary, 9; independent, 150; integrable, 29-30; measurable, 30; nonnegative, 77; real-valued, 9-11, 26; symmetric~ 149-.151; uniformly integrable, 16-21, 86, 116-117 Ray, D., 216, 249 Ray resolvents, 216-218 real line, 8, 10 real-valued function, 9, 22 reduite, 182-184, 243 regularity, hypothesis of, 62 regularization, q-excessive, 217 regular measures, 47-51; compact paving and, 50; defined, 47; projective limits of, 48; theory of, 248 resolvent, 188-189; closed, 199; closed Markov, 206; proper, 198; Ray, 216-218; strongly continuous, 211; sub-Markov, 189, 199, 202, 206, 208, 211 resolvent identities, 192 r-excessive function, 195 Riesz decomposition, 89, 97, 181, 185, 195 right-continuous capacities, 45 right-continuous functions, 99-100 right-continuous martingale, 94-100, 110, 115, 124, 135, 163-164 r-potential, 195 r-supermedian function, 190
}(~ Saks, S., 248 Savage, L. J., 149-150 Schwartz, J. T., 14-15, 20, 23 n., 30, 210, 220, 222, 248 Schwartz's inequality, 126 semicompact paving, 32-34, 47, 49, 64 semicontinuous function, 25 semigroup, construction of, 210-218; Feller, 213-214; of kernels, 187; Markov, 210, 215; strongly continuous, 210; sub-Markov, 210, 212; see also l.s.c.; u.s.c. separability, 55, 59, 62-64 separable processes, 55-62; defined, 55-56; example of, 56; universal, 56-59 separating sets, 56-59 separation theorem, 36 n. set-function, 12, 40 sets, see under adjective as in compact set, index set, etc.
Sherman, S., 234 Shilov boundary, 241 n. Shilov set, 240-243 a-algebra, 7 a-compact space, 184, 215 a-fields, 7-9, 20, 24, 28, 30, 32, 35, 39, 48, 50, 53, 66,68,78,139,149,155; enlargement of, 21; examples of, 8; family of, 65, 85 (see also family); generated by collection of functions, 8; independent, 26; product, 9 singleton, 38 Snell, J. L., 151, 250 space, metrizable, 23, 50,52,55,61-62; compact, 59 &.tate space, 52 stochastic boundary function, 89 stochastic Dirichlet solution, 89-90, 97 stochastic intervals, 156 stochastic processes, 21, 34 n., 52-74, 158, 186; defined, 52 stopping times, 165; accessible, 130-140; classification of, 128-140; examples of, 6768; inaccessible, 139 ; Gi} I' 1 sampling theorem and, 99; properties of, 66-67 ; superposition of chains of, 74; systems and chains of, 72-74 Stone-Weierstrass theorem, 2, 23, 216, 221,246 Strassen, V., 243-244, 248 strict positivity hypothesis, 203 subdivision, defined, 230 sublinear function, 219-221, 230 sub-Markov kernel, 173, 179, 183, 186-187, 197, 215 subadditive function, 44 subadditivity, 43 submartingale, 77, 79, 81; nonnegative, 88 sup (j,g), 3 supermartingale, 77-78, 81, 89; continuous, 96, 103; discrete case, 104-105; existence theorem and, 114-128; generalized decomposition of, 123 ;' generation of, 104-108 ; nonnegative, 90; positive, 83; right-continuous, 93-95, 98, 105-106; strong order properties of, 106-107; uniform integrability properties of, 114-122; uniqueness theorem and, 113 supermedian function, 187-190; regularization of, 194-195 v Sur, M. G., 124-125, 249 Suslin sets, 247 symbols, see notation, explanation of symmetric difference, 2
X.
Index
266
thinness, 1 universally measurable sets, 25 time, change of, 108 upcrossings and downcrossings, 60, 82 time set, 52 lL u.s.c. (upper semicontinuous) function, 2, 204topological space, 23; Hausdorff, 33,45-46,51 205,222-223,227,232-233 59,70,248 it- "Y, 11 topology, 2 vector spaces, topological, 2 transitivity, of image laws, 15 t-supermedian function, 195 Watanabe, S., vi, 123,249 well-measurable sets, 156-158 ultrafilter, 119 Williamson, R. E., 151 uniform integrability properties, existence theorem, and, 114-128 Yosida, K., 210-211 uniqueness theorem, 113, 229-232 Zorn's lemma, 10-11, 51, 220, 228 universal completion of (0, :F), 23
ABCDEFGHIJ 7069876