Cole Mathematics Series) - PDF Free Download

The Wadsworth Mathematics Series Series Editors Raoul H. Bott, Harvard University David Eisenbud, Brandeis University ...

Author: Richard Durrett

155 downloads 1990 Views 3MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

The Wadsworth Mathematics Series Series Editors

Raoul H. Bott, Harvard University David Eisenbud, Brandeis University Hugh L. Montgomery, University of Michigan Paul J. Sally, Jr., University of Chicago Barry Simon, California Institute of Technology Richard P. Stanley, Massachusetts Institute of Technology W. Beckner, A. Calderon, R. Fefferman, P. Jones, Conference on Harmonic Analysis in Honor of Antoni Zygmund M. Behzad, G. Chartrand, L. Lesniak-Foster, Graphs and Digraphs J. Cochran, Applied Mathematics: Principles, Techniques, and Applications W. Derrick, Complex Analysis and Applications, Second Edition R. Durrett, Brownian Motion and Martingales in Analysis A. Garsia, Topics in Almost Everywhere Convergence K. Stromberg, An Introduction to Classical Real Analysis R. Salem, Algebraic Numbers and Fourier Analysis, and L. Carleson, Selected Problems on Exceptional Sets

Brownian Motion and Martingales in Analysis

Richard Durrett University of California, Los Angeles

Wadsworth Advanced Books & Software Belmont, California A Division of Wadsworth, Inc.

Acquisitions t?ditor:.lohn Kimmel Production Editor: Marta Kongsle Designer: Janet Bollow Copy Editor: Mary Roybal Technical Illustrator: Brown & Sullivan

U 1984 by Wadsworth, Inc. All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transcribed, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher, Wadsworth Advanced Books & Software, Belmont, California 94002, a division of Wadsworth, Inc. Printed in the United States of America

1 2 3 4 5 6 7 8 9 10-88 87 86 85 84

ISBN 0-534-03065-3

Library of Congress Cataloging in Publication Data

Durrett, Richard, 1951Brownian motion and martingales in analysis. (The Wadsworth mathematics series) Bibliography: p. 313 Includes index. 1. Brownian motion processes. 2. Martingales (Mathematics) 1. Title. II. Series. QA274.75.D87 1984 515 84-7230 ISBN 0-534-03065-3

Preface

In the years that have passed since the pioneering work of Kakutani, Kac, and Doob, it has been shown that Brownian motion can be used to prove many results in classical analysis, primarily concerning the behavior of harmonic and analytic functions and the solutions of certain partial differential equations. In spite of the many pages that have been written on this subject, the results in this area are not widely known, primarily because they appear in articles that are scattered throughout the literature and are written in a style appropriate for technical journals. The purpose of this book, then, is to bring some of these results together and to explain them as simply and as clearly as we can. In Chapters 1 and 2, we introduce the two objects that will be the cornerstones for our later developments-Brownian motion and the stochastic integral. This material is necessary for all that follows, but after digesting Chapters 1 and 2, readers can turn to their favorite applications. The remaining seven chapters fall into four almost independent groups. In Chapters 3 and 4, we will study the boundary limits of functions that are harmonic in the upper half space H = {x a R": Xd > 0}. Chapter 3 is devoted to developing the relevant probabilistic machinery, that is, we define conditioned Brownian motions (or h-transforms) and give their basic properties. In Chapter 4, we apply these results to the study of harmonic functions; to be precise, we show that (modulo null sets) nontangential convergence, nontangential

boundedness, and finiteness of the "area function" are equivalent, and we investigate the relationship between these notions and their probabilistic counterparts. In Chapter 5, we turn our attention to analytic functions and use Brownian motion to prove results about their boundary limits and mapping properties (e.g., Picard's theorem). The first results are, I think, some of the most striking applications of Brownian motion. By observing that a complex Brownian motion never visits 0 at.any (positive) time, we can make Privalov's theorem "obvious," and we can Xb%mbiguously along Brownian paths to prove that functions in the vanlinnT4Ass have nontangential limits, without relying on factorization tl*lbremsto rei *'e the zeros. V

VI

Preface

In Chapters 6 and 7, we use Brownian motion to study the classical Hardy spaces HP, p > 0, that is, the set of functions that are analytic in D = {z: IzI < 1 } and have sup Jf(reb91'dO < oc. r<1

The first task is to establish the equivalence of H° to a subspace of a space of martingales. After this is done, we can prove results in analysis by proving their probabilistic analogues. Some of the topics we investigate in this way are: (i) boundary limits of functions in H", (ii) the relationship between the L° norms of conjugate functions, (iii) Fefferman's duality theorem (H')* = BMO, and (iv) properties of functions in BMO. In Chapters 8 and 9, we investigate the relationship between Brownian motion and partial differential equations. The key words associated with the developments in Chapter 8 are heat equation, Feynman-Kac formula, CameronMartin transformation, Dirichlet problem, Poisson's equation, and eigenvalues of Schrodinger operators. Although the middle four of these topics have been treated in many books, our approach, which is based on stochastic integration, is five-sixths new and allows us to treat all the equations in the same manner. One slightly kinky aspect of our development is that the words semigroup and generator do not appear in the book (well, only once). We picked up this idea ("all you need is Ito's formula") and many others in the book from conversations with Mike Harrison. In all the equations in Chapter 8, the highest-order term is of the form 'A. If we want to replace this term with something more general, then we have to replace our Brownian motion with a more general diffusion. In Chapter 9, we take a feeble first step in this direction. We introduce stochastic differential equations and solve them to get processes that we can run to solve second-order parabolic equations. In many respects, this is a strange place to stop, since at this point there are obviously a number of other things to talk about, such as elliptic equations, Neumann boundary conditions, small time asymptotics for diffusions, large A asymptotics for eigenvalues, Malliavin calculus, and so on. However, the book had to end somewhere. Since this is a book about the relationship between probability and analysis, we have tried to write it so that it could be read by someone in either field. In keeping with this aim, the only formal prerequisite for this book is a familiarity with measure theory as usually developed in the first quarter of a graduate course in analysis or probability. We have included an appendix that describes all the results we need from a first-year graduate course in probability theory (chiefly the basics of results of martingale theory). Anyone who has had a graduate course in probability can safely skip the appendix, and more advanced readers can start with Chapter 2 or turn to their favorite application. Readers who skip the appendix should keep in mind, however, that when we call something a "standard result" it can (usually) be found in the appendix. In addition to trying to keep the book self-contained, we have tried to do

Preface

,II

several things to make the book easy to read. (a) The first two chapters contain a number of exercises to help the reader who is learning the material for the first time. (b) We are content in most cases to give the reader a taste of an area by confining our attention to a special situation or proving somewhat less than the

best possible result and referring the reader to the literature for more refined results. (c) Last but not least, we have tried to keep attention focused on the "ideas behind the proofs" and the most important techniques by making remarks about the how and why and, occasionally, by reviewing at the end of a proof the key steps that led to the conclusion. The last point, along with the decision to supply details usually left to the reader, may cause the pace to be too slow at times, but as Stroock and Varadhan said in their preface, we believe "it

is preferable to bore than to batter." The idea of writing this book was born while I was visiting Stanford during the fall and winter quarters of 1979-1980. When I arrived, my academic godfather, Kai Lai Chung, suggested that we write a book on Brownian motion. Several months later, when it became clear that the book would not be finished before my visit was over, he changed his mind, but I was on my way. Returning to UCLA in the spring, I taught the first of three seminars that led to this book, the other two being at Cornell in the spring semester of 1981 and at UCLA in the fall quarter of 1982. Between seminars, I revised and expanded my notes, and finally, on a leave of absence in the winter quarter of 1983, I (almost) finished writing this book. A number of people read the manuscript or heard lectures from it at various stages of its development and made suggestions for improving the exposition. In this regard I would especially like to thank Donald Burkholder, Rene Carmona,

Burgess Davis, Richard Dudley, Daniel Revuz, Barry Simon, and Ruth Williams. Other people who provided advice on specific points are thanked at appropriate places in the text. From time to time during the writing of this book, I was supported by funds from the National Science Foundation (at Cornell and during four summers at UCLA) and the Sloan Foundation (for two months in Paris and for one quarter while I was finishing the book). I am very grateful for the support of these foundations, but by far the most important member of the supporting cast is my wife, Susan, who has patiently endured the writing of this book.

Contents

1

Brownian Motion

1.1

1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 1.10 1.11

1 Definition and Construction The Markov Property 7 The Right Continuous Filtration, Blumenthal's 0-1 Law 17 Stopping Times The Strong Markov Property 21 Martingale Properties of Brownian Motion 25 Hitting Probabilities, Recurrence, and Transience 27 The Potential Kernels 30 Brownian Motion in a Half Space 32 Exit Distributions for the Sphere 36 Occupation Times for the Sphere 39 43 Notes on Chapter 1

2

Stochastic Integration

2.1

Integration w.r.t. Brownian Motion 44 Integration w.r.t. Discrete Martingales 48 The Basic Ingredients for Our Stochastic Integral 50 The Variance and Covariance of Continuous Local Martingales 52 Integration w.r.t. Continuous Local Martingales 55 The Kunita-Watanabe Inequality 59 Stochastic Differentials, the Associative Law 62 Change of Variables, Ito's Formula 64 Extension to Functions of Several Semimartingales 67 Applications of Ito's Formula 70 Change of Time, Levy's Theorem 75 Conformal Invariance in d > 2, Kelvin's Transformations 78 Change of Measure, Girsanov's Formula 82 Martingales Adapted to Brownian Filtrations 85 Notes on Chapter 2 89 A Word about the Notes 90 ix

2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 2.10 2.11 2.12 2.13 2.14

1

11

44

X

('onk'nhM

3

Conditioned Brownian Motions

3.1

3.2 3.3 3.4 3.5

Warm-Up: Conditioned Random Walks 91 Brownian Motion Conditioned to Exit H = Rd-1 x (0, oo) at 0 Other Conditioned Processes in H 97 Inversion in d > 3, B, Conditioned to Converge to 0 as t -* oo A Zero-One Law for Conditioned Processes 102

4

Boundary Limits of Harmonic Functions

91

4.1

105 Probabilistic Analogues of the Theorems of Privalov and

4.2 4.3

105 Spencer 108 Probability Is Less Stringent than Analysis Equivalence of Brownian and Nontangential Convergence in

4.4 4.5

d=2

94 100

113

Burkholder and Gundy's Counterexample (d = 3)

116

With a Little Help from Analysis, Probability Works in d > 3: Brossard's Proof of Calderon's Theorem

119

5

Complex Brownian Motion and Analytic Functions

5.1

Conformal Invariance, Applications to Brownian Motion 123 Nontangential Convergence in D 126 Boundary Limits of Functions in the Nevanlinna Class N 128 Two Special Properties of Boundary Limits of Analytic Functions 132 Winding of Brownian Motion in C - {0} (Spitzer's Theorem) 134 Tangling of Brownian Motion in C - { -1, 11 (Picard's Theorem) 139

5.2 5.3 5.4 5.5 5.6

123

6

Hardy Spaces and Related Spaces of Martingales

6.1

6.8 6.9

Definition of HP, an Important Example 144 First Definition of .11", Differences Between p > 1 and p = 1 146 A Second Definition of #P 152 Equivalence of H" to a Subspace of &P 155 Boundary Limits and Representation of Functions in HP 158 Martingale Transforms 162 J anson' s Ch aracter izat i on o f A' 166 Inequalities for Conjugate Harmonic Functions 170 Conjugate Functions of Indicators and Singular Measures 180

7

H' and BMO,

7.1

The Duality Theorem for .,#' 184 A Second Proof of (Jiu)* = .V.#0 188

6.2 6.3 6.4 6.5 6.6 6. 7

7.2 7.3 7.4

I' and J1(

144

184

Equivalence of BMO to a Subspace of M. && 192 The Duality Theorem for H 1, Fefferman-Stein Decomposition

199

('onlenN

xl

7.5 7.6 7.7 7.8

Examples of Martingales in -4.,# 205 The John-Nirenberg Inequality 208 The Garnett-Jones Theorem 211 A Disappointing Look at (.,ff°)* When p < 1

8

PDE's That Can Be Solved by Running a Brownian Motion 219

A 8.1

Parabolic Equations 219 The Heat Equation 220 The Inhomogeneous Equation 223 The Feynman-Kac Formula 229 The Cameron-Martin Transformation

8.2 8.3 8.4 B 8.5 8.6 8.7 8.8

215

234

Elliptic Equations 245 The Dirichlet Problem 246 Poisson's Equation 251 The Schrodinger Equation 255 Eigenvalues of A + c 263

9

Stochastic Differential Equations

9.1

PDE's That Can Be Solved by Running an SDE 271 Existence of Solutions to SDE's with Continuous Coefficients Uniqueness of Solutions to SDE's with Lipschitz Coefficients Some Examples 283 Solutions Weak and Strong, Uniqueness Questions 286 Markov and Feller Properties 288 Conditions for Smoothness 290 293 Notes on Chapter 9

9.2 9.3 9.4 9.5 9.6 9.7

Appendix A.1

A.2 A.3 A.4 A.5 A.6 A.7 A.8

271

A Primer of Probability Theory

294

Some Differences in the Language 294 Independence and Laws of Large Numbers 296 Conditional Expectation 300 Martingales 302 Gambling Systems and the Martingale Convergence Theorem Doob's Inequality, Convergence in L", p > 1 306 Uniform Integrability and Convergence in L' 307 Optional Stopping Theorems 309

References

313

Index of Notation Subject Index

327

325

274 278

303

1

Brownian Motion

1.1

Definition and Construction A d-dimensional Brownian motion is a process B t > 0, taking values in Rd, that has the following properties:

(i) if to < tl <

< t,,, then B(to), B(t1) - B(to), ... ,

B(ti_1) are

independent (ii) if s, t > 0, then

P(B(s + t) - B(s) EA) = f

(2it)-1/2e-lxl2/21

A

(iii) with probability 1, t - B, is continuous. (i) says that B, has independent increments. (ii) says that the increment B,+, - Bs

has a d-dimensional normal distribution with mean 0 and covariance tl, that is, the coordinates B., - Br are independent and each one has a normal distribution with mean zero and variance t. (iii) is self-explanatory. The first question that must be confronted in any discussion of Brownian motion is, "Is there a process with these three properties?" The answer to this question is yes, of course. There are dozens of books about Brownian motion, and there are at least four or five essentially different constructions (one of which we will give below), so there can be no doubt that the process exists. For the moment, however, I want to try to shake your faith in this fact by pointing out that there are two things to worry about: (a) Are assumptions (i) and (ii) consistent? (b) If we specify the distribution of Bo, then (i) and (ii) determine the distribution of (B(t1), . . . for any finite set of times. Is (iii) consistent with these distributions?

To build the suspense a little bit, let's fix our attention on the case d = 1 and see what happens if we change (ii) to 1

2

I

Brownien Motion

(ii') P(8,+1 - Bs a A) =

dx JA

where f is a family of probability densities, that is, each ft z 0 and f f, = 1. If u > 0, then (i) implies I .f (x).f.(Y - x) dx,

(*)

or introducing the characteristic function (a.k.a. Fourier transform)

.r(e) = Je6f(x)dx (") f+.(B) =.f (0)f (e). It is easy to show that if t -+f ,(O) is continuous, then (") implies f (9) = exp(cet) (and if you work hard you can show this is true when X(0) is bounded and measurable), so the distributionsf(x) are far from arbitrary. When

f(x) =

(2nt)-1/2e-x2/21

f (O) = exp(-102/2)

so (*) holds, but shifting our attention now to question (b), this is not the only possibility. The Cauchy density with parameter t 1r(x) = lr(t 2 + x2)

has

f(9) = exp(-t!OI), so these f's are another possible choice, and the process that results is called the Cauchy process, C, t >- 0. It is easy to see from the formulas for the normal and Cauchy densities that (1)

Bt ° t 1/2B1

and

C1

tC1,

so as t-+ 0, P(IBI > e) -0, P(I CI > e) - 0. On this basis one might naively expect that both processes can be defined in such a way that the paths are continuous. This is true for Brownian motion but false for the Cauchy process, and, in fact, we will see in Section 1.9 that with probability 1 the set of discontinuities of C, is dense in [0, oc). The preceding discussion has hopefully convinced you that the fact that Brownian paths are continuous is not obvious, so we turn now to the somewhat

tedious details of the construction of Brownian motion. For pedagogical reasons, we will first pursue an approach that leads us to a dead end, and then we will retreat a little to rectify the difficulty.

Fix x c- R" and for each 0 < t l < ... < t define a measure µ11,.,,,t on (R")° by

I

I)rflnltlon end Construction

1.1

µ,,...,,,,(A, x ... x An) =

f',

dx1 ... j dx" ll n

where xo = x, to = 0, and

p,(x,y) = (2µt)

12e-Iv-x12/2r

In this notation, (+) says

µt,,u(R x A) = µt+"(A) This is the first step in showing that the family of µ's is a consistent set of finite

dimensional distributions (or, for short, f.d.d.'s), that is, if {s1, ... , sn-1 } c

... ,tn} and t;O{s1, ...

{t1,

sn-1}, then

µs,,....sn-1(A1 x ... x An-1)

=

x ... x Aj-1 x R x Aj x ... x An-1).

It is easy to check (details are left to the reader) that the measures given above are a consistent set of f.d.d.'s, so we can use Kolmogorov's extension theorem to give our first construction of Brownian motion. (2)

Let fl, = {functions w : [0, oo) - R} and .moo = a-algebra generated by the finite dimensional sets to): w (t;) E A; for 1 < i < n}, where each A; E 1d, the set of Borel subsets of Rd. Given a consistent set of f.d.d.'s µ,,....,,", there is a unique probability measure µ on (00,.Fo) so that for all the finite dimensional sets,

p({w:(o,.EAi,l
, (A1 x .. x A.).

Proof This result is a consequence of the Caratheodary Extension theorem. For a detailed proof, see Chung (1974), page 60, or Breiman (1968), pages 23-24. At this point we are at the dead end referred to above. If C = {c o: t - (0, is continuous}, then Co.°Fo, that is, C is not a measurable set. The easiest way of proving this is to do the following. Exercise 1 set

A eS if and only if there is a sequence of times t1, t2, .. , and a so that

BEgP{1,2,...}

A = {w : (co,,,co,,, ...)EB}.

The problem above is easy to solve. Let Q2 = {m2-" : m, n > 0} be the dyadic rationals. Kolmogorov's theorem guarantees that we can define on some probability space (SI, .F, P) a family of random variables {B, t E Q2} with the desired joint distributions. To extend B, to a process defined on [0, oo), we will show that with probability 1, t -+ B, is uniformly continuous on Q2 fl [0, T] for each T < oo, so there is a unique continuous extension to [0, oo). To prove the last result, it suffices to show that each coordinate is uniformly continuous on Q2 fl [0, 1], that is, we can suppose without loss that d = 1 and

V

I

erownun Motlon

by scaling that T= 1. The key to proving the result in this case is the following trivial consequence of (1): (3)

EIB,I4=Ct2 where C=EIB,I4<00, and a not-so-trivial computation due to Kolmogorov. Let y < 1/4, S > 0, and observe that a4P(I X I > a) < E I X I4 (Chebyshev's inequality), so

P(IB(j2-") - B(i2-")I > ((j - i)2-")y for some 0 < i <j < 2,j - i < 2"b) < E((j - i)2-n) -4yEIB(j2-n) - B(i2-n) I4 where the sum is over the set of (Q) satisfying the conditions on the left-hand side. Using (3), we see that the right-hand side of the last inequality

CE((j -

i)2-n)-4y+2

C - 2" , 2nb . (2nb2-n)-4y+2 e

since there are 2" choices for i, (0 1, and if we pick S small enough, then r > 0 and E2-" < oc, so the Borel Cantelli lemma implies the following: (4)

For almost every w there is an N (which depends on co) so that for all n >- N,

I B(j2-") - B(12-")I < ((j - i)2-")y

for all 0
r-q

<2-N(1-b),pick m>Nsothat

2-(m+1)(1-6) <

r - q < 2"'),

and write 2-4k

q = i2-m - 2-41 -

r =j2-' + 2-r1 + ... + 2 where m < q, < . . .
IB(i2-m) -

B(j2-m)I

S

((2mb)2-m)Y.

r

q

(i - 1)2-'° Figure 1.1

i2-'"

j2-'"

(j + 1)2

1.1

I)eflnlllun rnd (onNUuellun

To estimate I B(q) - B(i2 m)I now, we observe that using (4) and the triangle inequality gives k

I B(q) -

B(i2-m)I <

(2-qi)Y

Y i=1 00

Y (2-Y)./ = C2-Ym j=m+1

and

JB(r) - B(j2-m)I < >

(2-7)J =

C2-Ym.

j=m+1

Combining the last three inequalities shows I B(q) - B(r)I < (since

2-m(1-6) < 21-air

C2Ym(1-a) < C'Ir

-

qIY -

- qI), so Bi is Holder continuous with exponent T.

Note: In the last part of the proof we used two conventions that we will use throughout the book (without further notice):

(a) The letter C denotes a constant c (0, oo), whose value is unimportant and may change from line to line. (b) When a constant changes within a string of inequalities, we will use C', C", ... to alert you and to avoid absurd expressions like 2ec < C.

Combining the observations above, we see that with probability 1 the process Bi(ro) initially defined for t E Q2 has a unique extension so that t -p B1(w) is continuous on [0, oo). This is our Brownian motion. Now that the birth has been accomplished, we want to tidy it up a little bit. To do this, we

observe that the map that takes w into the trajectory t -+ B,(w) maps the original 12 into C = the set of continuous functions from [0, oo) to R", so if we build a Brownian motion starting from Bo = x, the mapping gives us a measure P. on (C, ') where ' is the smallest a-algebra that makes all coordinate

maps (o - w(t) measurable, and under PX the random variables Bi(w) = w(t) are a Brownian motion starting at x. The last picture of Brownian motion is the one we will use in what follows, so it is important to get it firmly in mind. We have a special measure space (C, '), one family of random variables B, (w) = w(t), and a family of probability measures PX, so that under Px, Bi is a Brownian motion with PX(BO = x) _ 1. This setup gives us all the Brownian motions we will ever need. If we want a

Brownian motion in which Bo is random and has distribution i, then we let PP(A) = J u(dx)PX(A),

and this gives us a Brownian motion with P,,(Bo EA) = µ(A).

The argument above, which led to the continuity of the Brownian path, is an important one, so it is good to stop for a minute and reconsider the details.

6

I

Hrownlrn Motion

Show that if' we replace (3) by

Exercise 2

EIX,-XIfl
(3')

where a and fi are positive, then the paths of X are Holder continuous of order y for any y < a/f. (This is Kolmogorov's criterion for a stochastic process to have continuous paths.) Let B, be a one-dimensional Brownian motion. Since EIB,-BBI2"=Cmlt-srm

where Cm = El B1 12- < oo, applying the result in Exercise 2 and letting m - o0 shows that Brownian paths are Holder continuous of order y for any y < 1/2.

It is easy to show that Brownian paths are with probability 1 not Lipschitz continuous at any point. Let A,,= {c o: there is an s E [O, 1] such that IB, - B I < C I t - s I when It(- si < 2/n}

1)

=maxjlB(k+jlB(k+j-1)

Ykn ,

l

n

:j=0, 1,2}, 1
n

JJ

B. = {at least one of the Yk <

4Cl n 1

Since A c

using the Brownian scaling relation (2) and the formula for the normal density shows nP {I = nP IB(1)I _<

4C

1

/2-7r)

B(n)I <

f

4n

}3 3

n

3

-).

Oasn

which proves our claim, since n An is increasing and C is arbitrary. From the last result we immediately get (5)

With probability 1, t - B, is not differentiable at any point. This fact is due to Paley, Wiener, and Zygmund (1933). The simple proof given

above is due to Dvoretsky, Erdos, and Kakutani (1961). Since functions of bounded variation are differentiable a.e., (5) has the following corollary: (6)

With probability 1, t -* B, has unbounded variation in any interval (a, b) with

a
The next result sharpens and extends (6).

1.2

(7)

The Msrkov Ikoperly

7

Asn -+ c, z"

Y B(tm2-") - B(t(m - 1)2-")I2 - t a.s. M=1

Proof Let Am," = I B(tm2-") - B(t(m - 1)2-")I2. For each nAm,", m = 1, 2" are independent and have EAm,,, = E(B(t2-"))2 = t2-", so

(2.

E > Am " -

z 1

... ,

n

= > E(Am " - t2-n)2 = 2ntz2 z"E(Bi - 1)z m=1

M=1

With this established, the rest is easy. Using Chebyshev's inequality gives 7/ z"

PI

z

Y- Am " - t

> E < E Itz2 "E(Bi - 1)z,

\\m=1

and then the desired conclusion follows from the Borel Cantelli lemma.

(7) says that the quadratic variation of Bs, 0 < s < t is almost surely t. This shows that (a) locally the Brownian path looks like t 112 and (b) B, is irregular in a very regular way. The next result is more evidence for (a). Exercise 3 Generalize the proof of (5) to show that if y > 1/2, then with

probability 1, t

B, is not Holder continuous of order y at any point. (The

borderline case y = 1/2 will be treated in Section 1.3.)

1.2 The Markov Property Since Brownian motion has independent increments, it is easy (I hope) to believe that (1)

If we let s > 0, then B5 , - Bs, t > 0, is a Brownian motion that is independent of what happened before time s.

The last sentence is, I think, the clearest expression of the Markov property of Brownian motion. This section is devoted to formulating and proving a result that makes this precise, but reduces (1) to a rather cryptic formula that requires several definitions to even explain what it means. Why do I want to do this? It is not for sport or for the love of secret code,

but because the "cryptic form" of the Markov property is the most useful for doing computations and proving theorems. It is easy (I am told by my students) to get lost in the measure theoretic details, so before we plunge into

them, I will do two sample applications to convince you that the Markov property is easy to understand and use: If you want to compute PX(Bt = 0 for some t e [a, b]) where 0 < a < b, then it seems reasonable to break things down according to the value Example 1

$

I

Brownian Motion

of B. and use the fact that Brownian motion has "independent increments" to conclude that PX(B, = 0 for some t e [a, b])

= JPa(X,Y)1y(Bt = 0 for some t< b - a) dy where p (x, y) =

(27rt)-a12e-IX-yl2/2r

Example 2 The next level of complication is to compute P (B, = 0 for some t e [0, a] and some t e [a, b]). This time, when we break things up according to Ba the first factor in the answer changes to pa (x, y) = P; (B. = y, B, = 0 for some s c- [0, a] ) Pa (x, y),

but the second factor stays the same, that is, PX(B, = 0 for some t e [0, a] and some t e [a, b])

= JPa(XY)P(Br = 0 for some t:5 b - a) dy. Intuitively the last equality holds, because if we condition on the value of Ba, then the behavior of B, for t e [a, b] is independent of whether or not Brownian motion hit 0 in the interval [0, a]. The Markov property below will allow us to prove that this formula is correct. The first step in giving a precise statement of (1) is to explain what we mean by "what happened before time s." The first thing that comes to mind is

.5°=a(B,:r<s) where the right-hand side denotes the smallest a-field, which makes all the B r:5 s measurable. Ultimately, for technical reasons, we will trade this in for some a-fields that are a little large, but for the moment these (organic) a-fields are fine and we can state and prove our first version of the Markov property. (2)

If we let s > 0, then for any t > 0, B5+r - B5 is independent of what happened before time s. To be precise,

(2)

Ifs > 0, t > 0, and f is bounded, then for all x e R°, E.(.f(B,+S - B5)!. °) =

EE.f(B,+5 -

B5).

Proof This is almost an immediate consequence of the definition of Brownian motion. If sl < s2 < < s < s and A = {w : B5.((o) a C;, 1 < i:5 n} where the C, e. (the Borel subsets of Rd), then definition of Brownian motion implies 1A and f(B, - B5) are independent, so in this case (*) Ex(f(B, - B5);A) = Px(A)EX.f(B, - B5).

1.2

9

'rhr Murkov Property

To prove (2) we need to show that (+) holds for all A

To do this, we

will use an extension theorem, commonly called Dynkin's it - A theorem, which was tailor-made for situations like this. (3)

Let sad be a collection of subsets of Q that contains S2 and is closed under intersection. Let I be a collection of sets that satisfy

(i) if A, Be l and A

B, then A - Beg

(ii) if A E 9 and A,, T A, then A E 9.

If d c 9, then the a-field generated by .4, a(d) c 9. The proof of this result is not hard, but as you will discover if you try to prove it for yourself, it is not trivial either. Since the ideas involved in giving an efficient proof of this are not needed in the developments that follow, we will call this a result from measure theory and refer you to page 5 of Blumenthal and Getoor (1968) or page 34 of Billingsley (1979) for a proof. The formulation above is from Billingsley. With (3) in hand, the proof of (2) is trivial. Let sa?' = the sets of the form < s < s and let 9 = the collection {B,. E C;, 1 < i < n} where s1 < s2 < of A for which (*) holds, and observe that I clearly satisfies (i) and (ii), so 9 Q(sd) = .F°. (The reader should also notice that it is not so easy to show that if A, B E then A fl B E 9, but the proof of (i) is trivial.) Our next step toward the Markov property is (4)

If we let s z 0, then for any t >- 0, B,+r - Bs is independent of what happened before time s and has the same distribution as B, - B0. To be precise,

(4)

Ifs >- 0, t > 0, and f is bounded, then for all x E R°, EE(f(B,+s) I: °) = EB(,)f(BB)

where the right-hand side is the function p(x) = Es f(B,) evaluated at x = Bs.

Proof To prove (4), we will prove a slightly more general result (let g(x,y) _

f(x + M. (5)

If s >_ 0, t >- 0, and g is bounded, then Ex(9(Bs, Br+s - Bs) I.°fs°) = Ta(BS)

where (Pg(x) =

J(x, y)

(2nt)-12e-I yl2r21 dy.

To prove (5), we observe that if g(x, y) = 91(x)g2(y), then

(Pa(x) =91(x)J and

92(Y)(2itt)-12e-IrI2I2tdY

10

I

Brownlan Motion

Ex(9(Bs, B,+s - BS)I fF) = 91(B3)Ex(92(B,

-

91(BS)Es(92(81+5 - BS))

by (2), so the equality holds in this case.

To extend the last result to all bounded measurable functions, we need an extension theorem for functions. The one we will use is a "monotone class theorem." (6)

Let sad be a collection of subsets of 92 that contains Q and is closed under be a vector space of real-valued functions on S2 satisfying

intersection. Let (i) if A C-,4, 1A E

(ii) if f e ,Y are nonnegative and increase to a bounded function f, then f e .Y. contains all bounded functions on S2 that are measurable with respect Then to a(d) = the o-field generated by sad. As before, we will refer the reader to Blumenthal and Getoor (1968) for the proof (page 6 this time) and we will content ourselves with applying (6) to complete the proof of (5). Let ./ = the set of all rectangles (A x B where A, B E Md and Af = the set of all bounded g for which (5) holds. Taking g, = lA and g2 = 1B and applying our first result shows that (i) holds. It is clear that ., satisfies (ii), so applying (6) now proves (5) and hence (4). With (4) proved, the last step is to extend the reasoning to a general measurable "function of the path B,+s, t > 0." To describe this class of functions

we need some notation. Recall that our Brownian motions are a family P.,, x c- Rd of measures on (C, W). For s > 0, define the shift transformation O : C C by setting (Ow)(t) = co(s + t) for t > 0. In words, we cut off the part of the path before time s and then shift the time scale so that time s becomes time 0.

With this notation introduced, it is clear how to define a "function of the path B,+S, t > 0." It is simply Yo Os where Y: C R is some cC measurable function. We can now finally state the "cryptic form" of the Markov property. (7)

Ifs > 0 and Y is bounded and W measurable, then for all x c Rd, O) = EBtsi Y

EX(Yo 0.1

where the right-hand side is the function 9 (x) = EE Y evaluated at x = B.

Proof To prove (7) we start by retracing the steps in the proof of (4). First let s = t° < t1 < ... < t and Ai = B(ti) - B(ti_1). From the proof of (2), we see that if f1, . , f are bounded, .

.

E. (II f(oi) ; A) = E =1

f(A1)) Px(A), (fl j-1

so repeating the proof of (4) shows that if g is a bounded measurable function on (Rd)n+1

Ex(9(Bs, A1, ... , An)I°) = O(BS)

1.3

The Right Contlnuoue VIllrrllon, Blumenthal's 0-1 Law

Ii

where n

(Y) =

Jg(Y,z1, ... ,z

(z) dz,

proving (7) in the case where Y depends on finitely many coordinates, and applying the monotone class theorem again proves (7).

(7) is a very important formula which will be used many times below. You should try a few of the following exercises to see how (7) is used in computations. Exercise 1

Let B, be a one-dimensional Brownian motion. Let S = inf{t > 1 B, = 0} and T = inf{t > 0: B, = 0}. Show that S = To 0, and use the Markov property to conclude that if t 0, :

Px(S > 1 + t) = JPi(xY)P(T>t)dY. Let B, and T be as in Exercise 1 and let R = sup{t < 1 : B, = 0}. Use the Markov property to conclude that if 0 < t 5 1,

Exercise 2

Px(R < t) = JPt(xY)P(T> 1 - t) dy.

The last two formulas can be used to compute the distributions of R and S after we find Py(T > s) in Section 1.5. The next three examples will be important in Chapter 8. Exercise 3 Let u(t, x) = EX f(B,) where f is bounded. Use the Markov property

to conclude that if 0 < s < t, u(t - s, Bs).

E.(.f(B1)I.

Let u(t, x) = EE f o g(B,) dr where g is bounded. Use the Markov property to conclude that if 0 < s < t, Exercise 4

Ex

(f"g(B,)/dr

.

°l =

Jsg(B,)dr+u(t-s,B.). o

Exercise 5 Let c, = f o c(B,) dr and u(t, x) = Ex exp(c,) where c is bounded. Use the Markov property to conclude that if 0 < s < t, EX(exp(cr)I. °) = exp(c.,)u(t - s,B.).

1.3

The Right Continuous Filtration, Blumenthal's 0-1 Law The first part of this section is devoted to a technical matter. For reasons that will become apparent in Section 1.4, it is convenient to replace the fields J°

12

1

Brownian Motion

defined in the last section by the (slightly larger) fields

+= u>t

which are nicer because they are right continuous, that is,

\

(1) t>s

t>s

u

u>t

=n ,o= u

s

u>s

In words, the+ allow us "an infinitesimal peek at the future." Intuitively, this should not give us any information that is useful for predicting the value of Bt+s - B. In this section we will show that this intuition is correct, and furthermore that + and F° are equal (modulo null sets). The first step in proving the last result is to prove that the simplest form of the Markov property proved in Section 1.2 holds when 3 ° is replaced by

s

(2)

If f is bounded, then for all s, t > 0 and x e Rd, Ex(f(B,+s)jffl's+) = EB(s)f(B,)

where the right-hand side is the function 9(x) = Ex f(B,) evaluated at x = B(s).

Proof The result is trivial if t = 0, so we will suppose t > 0. Let r e (s, s + t). Applying (4) from Section 1.2 gives Ex(f(B,+s) I

°) = EB(r)f(Bt+s-r),

so if we let cp(x, u) = Ex

and integrate over A, we get

Ex(.f(B,+s) ; A) = Ex((P (Br, t + s - r) ; A).

Now if f is bounded and continuous, W (x, t) =

f(27[t)-d/2 e-I v-xj'12f(Y) dY

is a bounded continuous function of (x, t) ; so letting r j s, we conclude that Ex(f(Bt+s); A) = Ex((P(Bs, t); A)

for all bounded continuous functions. To extend this to all bounded functions, let A' = the set of all bounded functions for which the equality holds, let sad = the set of all rectangles (al, bl) x x (ad, bd) where for each i - oo < a; < b< < oo..*' is clearly a vector space that satisfies the hypotheses of our monotone class theorem ((6) in Section 1.2).

As for d, if A e.4, then 1A is an increasing limit of bounded nonnegative continuous f,,, so lA E.°.

With (2) established, it is routine to prove that the Markov property holds in the general form: (3)

Ifs >- 0 and Y is bounded and ' measurable, then for all x e Rd,

1.3

The Right Contlnuoui Filtration, Blumenthel's 0-I law

13

EE(Yo©SI.FS+) = EB(S)Y

where the right-hand side is the function cp(x) = E Y evaluated at x = Bs.

Proof Rereading the proof of (7) in Section 1.2 reveals that all we did was use induction and the monotone class theorem to turn the special case Yo BS = f(B,+S) into the general case. To help the reader check this, we will repeat the proof with the numbers changed below. To prove (3), we start by retracing the steps in the proof of (2). First let s = t° < tl < < t and 0; = B(ti) - B(ti_1). From the proof of (2), we see that iff1, ... , fn are bounded and continuous,

f(0i);A) =ExI

Ex \I

\t-1

t-1

f(4)I PX(A),

/

so repeating the proof of (2) shows that if g is a bounded measurable function on (Ra)n+1 , EX(g(BS, O1 , ... , 0n) I

+) _)I/(BS)

where n

I'

W (y) =

Jg(y,z1, ... , Zn)

Pt,-,,-1(Zi) dz, !=1

proving (3) in the case where Y depends on finitely many coordinates, and applying the monotone class theorem now proves the desired result. Combining (3) above with (7) from Section 1.2 gives EE(YoBSI #;+) = EB(S)Y= EX(YoO51°).

The last equality shows that conditional expectations of functions of the future are the same for and . °. Since it is trivial that E(X I.°rs+) = E(X I S°) for c F5+, we are led immediately to conclude that (4)

If Z e ' is bounded, then for all s > 0 and x e R", EE(Z

+) = Ex(Z

I3S°).

Proof By the monotone class theorem it suffices to prove the result when Z = II"-1 f (B,.) where the f are bounded and measurable, but in this case Z = X(Yo 9,) where Xe.F° and Yeq, so Ex(Z I

+) = XEE(Yo BSI3 S+) = XEB(S) YC FS°.

Since FO c F5+, it follows (from the definition of conditional expectation) that EX(Z Ex(Z I3S°). To steal a line from Chung (1974), page 341, "The reader is urged to ponder

over the intuitive meaning of this result and judge for himself whether it is obvious or incredible." Above we have treated the result as a technical necessity (i.e., an obvious result). The rest of this section is devoted to the other viewpoint.

14

I

Brownlmn Motion

The fun starts when we take s = 0 in (4) to get (5)

Blumenthal's 0-1 Law. If A e .moo, then for all x e Rd, Px(A) a {0,1 }.

Proof Since A e .Fo , 'A =

The Markov property (3) implies EB(o)lA = Ex 'A,

Px a.s.,

so PX(A) = E. 'A= lA(X) e {O, 11, proving the desired result.

In words, Blumenthal's 0-1 law says that the "germ field",o = nE,O. o is trivial. This result is very useful in studying the local behavior of Brownian paths. Let B, be a one-dimensional Brownian motion. (6)

Letr=inf{t _ 0:B,>0}. ThenP0(r=0)= 1. Proof P0(i < t) > P0(B, > 0) = 2

since the normal distribution is symmetric about 0. Letting t 10, we conclude P0(i=0)=li1mP0(r> t

I2'

so it follows from (5) that P0(r = 0) = 1. Once Brownian motion must hit (0, oo) immediately starting from 0, it must also hit (- oc, 0) immediately, and this forces it to cross 0 infinitely many times in [0, E] for any E > 0. (7)

Let t be a sequence of numbers that decreases to 0. Then PO almost surely, we have B,n > 0 for infinitely many n and B, < 0 for infinitely many n. Proof Let A _ {B,n > 0}. Then

P0(limsupAn) = lim P0I U A N-ao

n=N

/

lim sup PO(AN) = N-ao

2'

so it follows from the 0-1 law that P0(limsupAn) = 1. (7) is a prime example of how the Blumenthal 0-1 law forces the Brownian path to behave erratically. The following is a more humorous example: (8)

If you run a two-dimensional Brownian motion for a positive amount of time, it will write your name an infinite number of times, that is, if g : [0, 1] -. R2 is a continuous function, E > 0 and to 10, then P0 almost surely, we will have

1.3

The Right continuous Filtration, Blumenthal's 0-1 Law

B(Otn)

sup

15

_ g(O) < 1.

to

0<0<1

for infinitely many n. The results above are just a few of the many results about the local behavior of Brownian paths that can be studied using Blumenthal's 0-1 law. In the exer-

cises below, we give some more examples. For the rest of the section, we restrict our attention to one-dimensional Brownian motion. Exercise 1

Let t. 10. Show that

lim sup B(tn)/t,''2 = oo, PO a.s., n-m

so Brownian paths are not Holder continuous of order 1/2. Exercise 2 Let t -+f(t) be an increasing function with f(t) > 0 for t > 0. Use Blumenthal's 0-1 law to conclude that

lim sup Bt/f(t) = c, Po a.s. t-0 where c is a constant c- [0, 00].

Up to this point all the arguments have been soft. If we want to figure out how fast Bt can grow, we have to do some computations. The first two steps are useful to know, so we have stated them as separate exercises. Exercise 3 If S = [0, T], then

Po (sup Bt > x) < 2Po(BT > x).

J

tES

Hint: It suffices to prove the result when S is finite. In this case the result is a consequence of independent increments and the fact that PO(B, > 0) = 1/2 for all t > 0. In Section 1.5 we will show that the inequality is an equality when

S = [0, T]. Exercise 4 e-Z2/2

ze-:2/2

dz < 1 X

x

=

J

e-x2/2

x 1

and

fe_z2/2dz

1

-e-x22

as x -+ oo.

x

Here and in what follows, f(x) - g(x) as x - oc meansf(x)/g(x) - 1 as x

oo.

Exercise 5 Law of the Iterated Logarithm. Let ap(t) = (2t log log(1/t))1/2. Then lim sup Bt/cp(t) = 1,

40

PO a.s.

16

1

Brownian Motion

(a) The first step is to explain why this is the right order of magnitude. To do this we observe P(B, > (ct log log(l /t) )1/2)

=

P(

> (clog log(1/t))1/21 ~

(cloglog(1/t))1/2

``

f

so if we let t" = a" where a < 1,

JJ

P(B(t") > (ct"loglog(l/t"))112)

- (clog(-nlog

a))_1/2(-n log a)cJ2'

so the sum is finite if c > 2 and infinite if c < 2. (b) To prove lim sup B,/(p(t) < 1, we observe that since t -+ (p (t) is increasing, using the result of Exercise 3 shows BS

PO C max

to+15s
> (1 + E)1 <

f

Po(max

BS > (1 + E)(p(t"+1)J

n+l<s5tn

< 2Po(B," > (1 + 2Po(B,"+1 >

.,

"`(1 + E)(P(tn+1))

by scaling, so if a is close to 1, then I(x-(l + E) > 1, and the result follows from the Borel Cantelli lemma. (c) To prove liminfBtl(p(t) > 1, observe that Po(B," - B,n+1 > (1 - E)(P(t")) = Po(Btn(1-a) > (1 - E)(P(tn)) = Pa (B,"

> (11 -Ea (P(t")) ,

so if a is close to 0, then (1 - E)/ l - a < 1, and the other Borel Cantelli lemma implies

P0(B," - B,n+1 > (1 - E)(p(tn) i.o.) = 1. An easy argument (see Chung (1974), Theorem 9.5.2, for details) now improves the last result to

PO(B," - B,"+1 > (1 - E)(,(4,) and B,"+1 > 0 i.o.) = 1,

which proves the desired result.

The results above concern the behavior of B, as t -' 0. By using a trick, we can use this result to get information about the behavior as t - oc. Exercise 6 If B, is a Brownian motion starting at 0, then so is X, = tB(1/t),

t>0.

Proof It is easy to check that EX, = 0, EX,2 = t, and EX,X, = S A t. Since < tn, (X(t1), ... , X(t")) has a multivariate normal disfor each t1 < t2 <

1.4

17

Stopping Tlmee

tribution, it follows that X has the right finite dimensional distributions and hence must be a Brownian motion. Since a d-dimensional Brownian motion is a vector of d independent onedimensional Brownian motions, the last result generalizes trivially to d > 1. Combining the last result with Blumenthal's 0-1 law leads to a very useful result.

Let ffl;' = a(B5 : s > t). Let .sad =n

.'.

.sd is called the asymptotic a-field, since it concerns the asymptotic behavior of B, as t - oc. As was the case for _FO', one can think of a lot of events that are in .sad, but they are all trivial. (9)

If A e d, then either PX(A) = 0 or PX(A) = 1.

Proof Since the asymptotic a-field for B is the same as the germ a-field for X, we see that P0(A)e{0,1}. To improve this to the conclusion given, we so 1A can be written as lA = 1B o 01. Applying the Markov observe that A e property gives IPA) = EX(lB001) = EXEX(lBO011 -'Vi) = EXEB(1) lB

= J'(2x)2

dy. e-I X-vl2I2P,(B)

Taking x = 0, we see that if P0(A) = 0, then PP(B) = 0 for a.e. y and hence P,,(A) = 0 for every x. Repeating the last argument shows that if P0(A) = 1, then PX(A) = 1 for every x, and proves the desired result.

Remark: The reader should note that the conclusion above is stronger than that in Blumenthal's 0-1 law. In that case, for example, if A = {Bo e C}, then P(A) may depend on x.

1.4 Stopping Times In this section and the next we will develop and explore the most important property of Brownian motion-the strong Markov property. The intuitive idea is simple, but since the rigorous 1 rmulation requires a number of definitions that somewhat obscure the intuitive content, we will begin with a simple example.

Let B, be a one-dimensional Brownian motion, let a > 0, and let T. = inf{t > 0: B, = a} be the first time Brownian motion hits a. If we know that T. = t, then this says something about the past (BS 0 a for s < t) and the

IS

I

Brownian Motion

present (B, = a), but nothing about B for u > To, so it seems clear that B(T. + t), t 0, will have the same distribution as a Brownian motion starting at a and that this process will be independent of T.. The argument in the last paragraph may be convincing, but it is not rigorous. In' Section 1.5 we will prove a result that captures the essence of the reasoning used above and makes it precise. To prepare for this, we need to define and discuss a class of random times S that "say nothing about the behavior of the process after time S." A random variable S taking values in [0, oo] is said to be a stopping time if for all t >- 0, IS < t} If you think of our Brownian motion B, as giving the value of a stock or some other commodity, and think of S as the time at which you sell the quantity in question, then the condition IS < t} E.F,+ has a simple interpretaE+.

tion: The decision to sell at a time < t must be based on the information available at time t, that is, based on n,.0 a(BS : s < t + E). From the definition given above, it should be painfully obvious that we have made a choice between IS < t} and IS < t} and also between E.+ and E S °. The quantity we have defined above is what is officially known (see Dellacherie and Meyer (1978), page 115, (49.2)) as a "wide sense stopping time." This name comes from the fact that if t - 9, is an increasing family of o-fields, then {S < t} e 9, implies that

IS
but {S < t} c W, only implies that

{S
u>t

In general, there can be a difference between the definitions IS < t} e,§, and {S < t} E 9 but (as we have shown above) there is no difference if t -+ W, is right continuous. (2) and (3) below show that when checking that something is a stopping time, it is nice to know that the two definitions are equivalent. (1)

If G is an open set and T = inf{t >- 0 : B, E G }, then T is a stopping time.

Proof Since G is open,

{T
where the union is overall rational q, so T is a stopping time.

Remark: The reader should observe that if t > 0, IT = t} = {BS 0 G for all s < t, but there are t j t so that

Therefore IT = t} (and hence {T < t}) E.f,+ but is not in the results in the last section, the difference is a null set.

E G }. .

°, although by

1.4

19

StoppInR TImes

If T. is a sequence of stopping times and T. I T, then T is a stopping time.

Proof {T < t} = U {T < t}. n=1

If T. is a sequence of stopping times and T. T T, then T is a stopping time.

Proof {T< t} = n {Tn < t}. n=1

If K is a closed set and T = inf{t >_ 0 : B, E K}, then T is a stopping time.

Proof Let Gn=U{D(x,1/n):xEK} where D(x,r)_{y:Ix-yl- 0 : B, E G,, 1. Since Gn is open, it follows from (1) that(T)s a stopping time. I claim that as n oo, T T T. To prove this we need to consider two cases. (i) Tn T oc. Since T > T. for all n, it follows that T = co and hence Tn T T.

(ii) If Tn T t < oo, then by the argument in (i), T > t. On the other hand, as B,. Since B(T,,) E G for all n, it follows that B E n Gn = K TT T t, B(T,,) and T!5; t, completing the proof.

One of the reasons we have written out the last proof in such great detail is to show its dependence on the fact that K is closed. (2) implies that if T = inf{ t > 0 : B, E An} is a stopping time and An T A, then T = inf {t > 0 : B, E A} is also ; but to go the other way, that is, let An A and conclude Tn T T, we need to know that A is closed. The last annoying fact makes it difficult to show that the hitting time of a Borel set is a stopping time, and, in fact, this is not true unless the a-fields are completed in a suitable way. To remedy this difficulty

and to bring ourselves into line with the "usual conditions" (les conditiones habituelles) of the general theory of Markov processes, we will now pause to complete our a-fields. Let X = {A : Pz(A) = 0 for all x e Rd}. A set A E AV is said to be a null set. Since our Brownian motion is a collection of measures P on our probability space (C, '), a set A can be safely ignored only if PX(A) = 0 for all x E Rd.

Let .W, = .F,+ v N' (where here, and if we ever use this notation below, sat v .4 means the smallest a-field containing sad and .4). {.F t >_ 0} is called

the augmented filtration and is the one which we will use throughout the rest of the book. It is trivial to see that the Markov property (formula (3) in Section 1.3) remains valid when . + is replaced by 3 (the conditional expectation is unchanged because we have only added null sets), so it seems reasonable to

use the larger filtration F, it allows more things to be measurable and still retains the Markov property. As we mentioned above, the real reason for wanting to use the completed filtration is that it is needed to make TA = inf{t > 0 : BLEAT measurable for every Borel set. Hunt (1957-8) was the first to prove this. The reader can find

20

1

Brownian Motion

a discussion of this result in Section 10 of Chapter 1 of Blumenthal and Getoor (1968) or in Chapter 3 of Dellacherie and Meyer (1978). We will not prove it here because the results we have given above imply the following: (5)

If A is a countable union of closed sets and T = inf{t > 0 : B,EA}, then T is a stopping time.

and this result is sufficient for our applications.

In Section 1.5 we will state and prove the strong Markov property. If one does not worry about details, it can be described in one sentence: If S is a stopping time, then formula (3) in Section 1.3 holds if we replace the fixed

time s by the stopping time S. To make the resulting formula meaningful, we have to define Os and S. It is clear how we should define the random shift: (6)

(Bs(0)(t) _

co(S((o) + t) on IS < oo} f undefined on {S = oo}. is a little more subtle. We defined F _ t >_ 0), so, by analogy, we should set .Fs = n>o Q(BtA(S+E)

The second quantity nE>o

t > 0). The definition we will give is less transparent than the last one, but easier to work with. (7)

.mss ={A:An {S01. Intuitively, a set A &A, if it depends only upon what happened before time S. To see why (7) says this, let A = {maxo<s<sIBSI > 3}. A obviously depends only on what happened before time S and is in .Fs, since for any t >_ 0

An{S
IBSI>3,S
J

To get a feel for ffl;s and stopping times in general, the reader should try a few of the exercises below. Solutions to most of the exercises can be found in Blumenthal and Getoor (1968), Dellacherie and Meyer (1978), and in many other books about Markov processes.

If S and T are stopping times, then S A T = min {S, T), S v T = max{S, T}, and S + T are also stopping times. In particular, if t >_ 0, then S A t, S v t, and S + t are stopping times. Exercise 1

Exercise 2

If S and T are stopping times, then T + S o BT is also.

Exercise 3

Let S be a stopping time, let A E .mss, and let

rS R= too

on A on AE.

Show that R is a stopping time. Exercise 4

Let S < T be stopping times. Show that Fs c

-FT.

1.5

The Strong M.rkov Property

21

Exercise 5 Show that the definition of .mss we have given is equivalent to requiring A fl {S < t} e. for all t.

Exercise 6

(i) If t is a constant, then {S < t}, {S = t}, and {S > t} are in .mss.

(ii) If T is a stopping time, then {S < TI, {S = T), and {S > T J are in .Fs (and also in FT). Exercise 7

If T" . Tin a sequence of stopping times, then dT = n" FT".

1.5 The Strong Markov Property In this section we will prove the strong Markov property: (1)

If S is a stopping time and Y is bounded and le measurable, then for all x e R", EX(YoOsl.Fs) = EB(s) Y on

IS < oo}

where the right-hand side is the function cp(x) = EE Y evaluated at x = B(S).

Setting S = Ta and Y(w) = f(w) where t > 0 and f is bounded and using (1) gives that E0(.f(BT.+JI FTa) = E. f(B,)

on

{Ta < oo

since B(TT) = a on {Ta < oo}. Extending the argument above to Y(w) _ f(w(t1), ... , w(t")) where 0 < tl < ... < t" and f: (R°)" - R is bounded justifies the remark we made at the beginning of Section 1.4: On { Ta < 00 1, B(Ta + t), t > 0, is a Brownian motion that is independent of .F(T.) and hence of Ta.

For some applications we will need to let the function Y that we apply to the shifted path 0sw depend on the time S, so we will prove (1) in a more general form that allows for this. (2)

Let (s, w) -+ Y(w) be bounded and R x ' measurable. If S is a stopping time, then for all x c R', EX(I s o Os l.Fs) = EB(s)Ys

on {S < oo }

where the right-hand side is the function qp(x, t) = EX Y, evaluated at x = B(S),

t = S. Proof Let x E R°. We will first prove the result under the assumption that there is a sequence of times t" t oo so that PX(S < oo) = Y"PX(S = Q. If we let Z"(w) = Y"((o) and A e Fs, then

EX(YsoOs;Afl{S
22

I

Brownfen Motion

Now if A E mss, Ants= tn} = (A fl {S < to}) - (A fl {S 5 tn_ 1 }) a .f(tn), so it follows from the Markov property that the sum above = Y Ex(EB(in)Zn ; A fl {S = to}) n=1

m

= > E.(EB(s)Ys;Afl {S = to}) n=1

{S < cc}).

= EX(EB(s) Ys ; An l

To prove the result in general, we let Sn = ([2"S] + 1)/2n, where [x] _

largest integer <x, and let n - oc. The first thing to observe is that if t c- (m2-", (m + 1)2-n], {Sn < t} = {S < m2-"}, so Sn is a stopping time and Sn 1 S as n - oc. To be able to take the limit, we will first restrict our attention to Y's of the form n

(*) Y,((O) =fo(s) f f(w(t )) i=1

< to and fo, ... , f are bounded and continuous. In

where 0 = to < t1 <

this case if we let (p (x, s) = EE Y., then

(p(x,s) =fo(s)EE fj j(B[i) i=1

III(xi)(27COi)-df2e_Jxi-xi-jIZneidxl ... dxn

=fo(s)

i=1

where Ai = ti - ti-1, so the dominated convergence theorem implies that (x, s) - lp(x, s) is bounded and continuous. Let A e.Ors. If m2-" < t < (m + 1)2-", then A fl {Sn < t} = A n is <m2-") e.°F(m2-"), so A E.F(S.). Applying the special case of (2) proved above to Sn and observing that {Sn < co} = {S < oo} gives Ex(Ysn o B s " ; A

Now as n

n {S < oo}) = Ex(T(B(Sn), Sn); A fl is < o0})-

oo, Ys" o Bs"

Y. o Os and tp(B(Sn), Sn) - cp(B(S), S), so the bounded

convergence theorem implies that (2) holds when Y,((o) has the form given in (*), and an application of the monotone class theorem completes the proof.

Remark: Later we will want to generalize this result without repeating the argument, so we want the reader to observe that: (i) in the first part of the proof all we did was dissect Y into the Zn, apply the ordinary Markov property on each piece {S = to}, and sum things up to get the desired formula; (ii) in the second part of the argument all we did was use the fact that P(x, s) is bounded and continuous to extend the special case to the general case. The rest of this section is devoted to examples and exercises that illustrate how the strong Markov property is used to derive formulas for various quantities associated with Brownian motion. Since in most cases the reasoning that

1.5

23

The Stroni Mirko, Property

is used to discover the formula is much different (and much more important!)

from the sequence of steps used to "derive" it, I have tried in the first two examples to explain the intuition behind the result as well as the mechanics of obtaining it from (1). Example 1 Let G be an open set, let A c 8G, let T = inf {t : B, 0 G}, and let

u(x) = PX(BT a A). I claim that if we let S > 0 be chosen so that D(x, S) _ {y: ly - xl < S} c G and let S = inf {t >- 0: B, O D(x, S)}, then u(x) = Exu(Bs).

Intuitive Proof Since D(x, S) c G, B, cannot exit G without first exiting D(x, S) at some point y, and after this occurs the probability of exiting G in A is u(y) independent of how B got to y, so PX(BT e A) = Exu(Bs).

Proof To make the intuitive proof rigorous, we have to write things down in such a way that we can apply (1). Working back from the answer, we see that we want Y= I(BTEA).

To check that this leads to the right formula, we observe that since D(x, 6) c G,

T(Osw)=inf{r>0:co(S+r)0G}=T-S, so

BTOOS=BT and

l(BTEA)O0, = I(BTEA).

(In words, co and the shifted path Osco must exit G in the same place.) With the last equality in hand, the rest is easy. u(x) = Exl(BTEA) = Ex(l(BTEA)OOS) = ExEx(I(BTEA)0 BSI.

),

so applying (1) and recalling the definition of u shows

u(x) = Exu(BS) An example of a situation that requires the generality of (2) is the following:

Let B, be a one-dimensional Brownian motion, let a > 0, and T.=inf{t: B, = a). Then P0(Ta -a).

Example 2 (3)

We will prove the result in words first and then show how it follows from (2).

24

1

Brownlan Motion

Intuitive Proof Clearly {B, z a} c {Ta < t}. To compute the probability of {Ta < t, B, < a} = {Ta < t, B, < a}, we observe that if B., hits a at some time s < t, then the strong Markov property implies that B, - B(TT) is independent of what happened before Ta. The symmetry of the normal distribution implies

that for s < t, P(B, - Bs < 0) = 1/2, so we have

P0(T.
(4)

and since P0(Ta = t) < PO(B, = a) = 0, the result follows.

Proof To make the intuitive proof rigorous, we have to prove (4). To deduce this from (2), we let YS((0) _

ifs
f1

t 0 otherwise.

We do this so that

-(1 ifTa
0

otherwise.

If we let S = Ta in (2), we get EO(YTa o 0T.I ST) = Q (B (T.), T.)

where

(0 1 /2

ifs>_t ifs < t, x = a,

so

Eo(YTaOBTa FT.) =Zi1(Ta<,). Taking expected values gives

P0(Ta
ZP0(Ta <

t)

and proves (4). The reader can now see why I said in Section 1.4 that the rigorous formulation of the Markov property somewhat obscures the intuitive content. To test his understanding of the intuition, the reader should try some of the following exercises. In each case, B, is a one-dimensional Brownian motion. Exercise 1

Let Ta = inf It >- 0: B, = a}.

(i) Use the strong Markov property to conclude that under Po, {Ta, a >- 0} has stationary independent increments, that is, if a < b, Tb - T. is independent of Ta and has the same distribution as Tb_a. (ii) Let gpa(A) = Eo exp(-AT.). By (i), tpa(A)gpb(I) = spa+b('), and the Brownian e scaling relationship implies Ta = a sTl, that is, Pa(A) = Cpl(.1a2). Combine

1.6

25

Msrtingsle Propertler of Brownisn Motion

the last two observations with an argument from Section 1.1 to conclude that (pa(2) = e-``' for some c c (0, oo) (actually c = f ). Exercise 2

(i) Let T+ = inf{t >- 0 : B, > a). Use the strong Markov property to conclude that P0(T, = Ta) = 1 for all a >- 0, so a -+ T° is with probability 1, continuous for each fixed a.

(ii) Show that if s < t, P0(T. is discontinuous at some point a e Is, t]) > 0, and then use scaling and independent increments to conclude that the probability must be 1. Exercise 3 Let T = inf{t : B, (a, b)} and A. >0.

(i) Use the strong Markov property to conclude that if x e (a, b), Exe-zTa = Ex(e-dT . T. < Tb) +

Ex(e- IT ; Tb <

Ta)Eb(e-ATa)

(ii) Interchanging the roles of a and b in (i) and using the fact that Exe-'TY = e-Ir-xh gives us two equations in two unknowns that can be solved to yield Ex(e -AT, Tb < Ta) = Ex(e-AT, T.

< Tb) =

sinh(

(x - a))

smh(V(b - a)) sink( (b - x)) smh( 2.1(b - a))

Exercise 4 Let R, = inf{u > t : B. = 0} and let r = Ro. In Section 1.3 we

showed that Po(T = 0) = 1. Use this fact and the strong Markov property to conclude that Z((o) = It: B,((o) = 0} is a closed set that has no isolated points and hence must be uncountable (for the last step, see Hewitt and Stromberg (1969), page 72).

1.6 Martingale Properties of Brownian Motion In the first half of this chapter we concentrated on the Markov properties of Brownian motion. In this section we describe some of its martingale properties. We start with the one-dimensional case: (1)

B, is a martingale.

Note: To be precise we should say that for all x e R", Bt (that is, the coordinate maps on (C, 6, Px)) is a martingale w.r.t.. (the a-fields defined in Section 1.4), but this is too rigorous, so we will stick to the casual statement here and below.

26

I

Brownian Motion

Proof The Markov property implies that E.(B,1.9;) = EB(S)(B,-s) = Bs,

since the symmetry of the normal distribution implies that EEB,, = y for all

U>0. The next martingale is not as obvious as the first, but, as we will see in Chapter 2, it is just as important. (2)

B, - t is a martingale. The proof is a simple computation: EX(B, IBS) = EX(B., + 2BS(B5 - B5) + (B, - B5)2I# ) = B. + 2BSEX(B, - B.,13 s) + EX((B, - BS)21.fs)

=Bs +0+(t- s), since B, - Bs is independent of .mss and has mean 0 and variance t - s.

(1) and (2) generalize immediately to B,, the ith component of a d-dimensional Brownian motion. Repeating the proof of (2) shows (3)

If i

j, B,B/ is a martingale.

Proof EX(BtB/ I F)

= BsBs + BBEX(B! - Bs

BSEX(B,` -

BslJs)

+ EX((Br - BS) (B/ - Bj)I.` s)

Since B,` - Bs, B/ - Bs, and . are independent, the last three terms are 0. (1), (2), and (3) are special cases of the following result: (4)

If f E C 2 (i.e., the second-order partial derivatives of f are continuous) and f and Of are bounded, then

f(B,) - f 2 Af(B.,) ds is a martingale. 0

We will prove this result in Chapter 2, but unfortunately we want to use it in the next section and we do not want to give a direct proof now. Stuck in this position, we will engage in the somewhat undesirable approach of using (4), or more precisely the following corollary of (4), before we prove it. (5)

Let G be a bounded open set and i = inf{ t : B, 0 G}. If f E C 2 and Of = 0 in G, then f(B,,,z) is a martingale.

If the reader wants to insist on a strictly logical development, then he should read Chapter 2 and then come back to read the last part of Chapter 1. I think this is unnecessary, though. It is easy to see that the results in Sections 1.7-1.10 are not dependent on the proof of Ito's formula (which will be given in Section 2.8).

One of the nice things about martingales is that they allow us to compute various quantities associated with Brownian motion. We will see a number of

1.7

Hitting Probabilities, Recurrence, and Transience

27

instances of this later, especially in the next section. The exercises below give a number of other applications. Exercise 1

Let B, be a d-dimensional Brownian motion and let T =

I B, I = r). Use the fact that if Ixl < r, EXT = (r2 - Ixlz)/d. inf { t :

I B, I z

- td is a martingale to conclude that

If u is harmonic (i.e., Au = 0) and bounded, then (4) implies u(B,) is a martingale. Combine this observation with the martingale convergence theorem and (9) of Section 1.3 to prove Liouville's theorem : Any bounded harmonic function is constant. Exercise 2

Exercise 3 The Exponential Martingale. Let B, be a one-dimensional Brownian motion.

(i) Use the fact that E0(exp(OB,)) = exp(ezt/2) to prove that exp(0B, - ezt/2) is a martingale.

(ii) Let a>0and T=inf{t:B,=a+bt}.Ifb<0 and 0 - 0,orifb>0and

0 >- 2b, then the martingale in (i) is bounded, so the optional stopping theorem can be applied to conclude ez

1=Eal expIOBT -

2T ;T<001

=Eolexp(0a+BbT-e2T ;T 0, setting 0 = b + (bz + 22)112 andJ solving gives Eo(e-AT) = e o(6+(b2+2x)1/2).

Letting 2 - 0 gives Po(T < oo)

-

l fe-

1.7

<0 b> 0. b

zab

Hitting Probabilities, Recurrence, and Transience In this section we prove some results concerning the range of Brownian motion

{B,: t >- 0}. We start with the one-dimensional case. Let a < x < b and T = inf {t : B, 0 (a, b) }. Since B, is a martingale, it is easy to guess the distribution of BT: (1)

PX(BT=a)=b-a PX(BT=b)

a =bx - a.

(This is the only probability distribution with support {a, b} and mean x.)

28

I

Brownlsn Motion

Proof The first step is to show that T < oo, Px a.s. Observe that if y e (a, b),

Py(T> 1) 1) < 1, and it follows from the Markov property that PX(T>n) B, is a martingale, T is a stopping time, and B, e (a, b) for t < T, so we can apply the optional stopping theorem to conclude

x=EXBT=aPX(BT=a)+bPX(BT=b). Since PX(BT = a) + PX(BT = b) = 1, the expression above can be rewritten as

x-a=(-a+b)PX(BT=b), so

PX(BT _ b) _ x

-a

b - a'

proving the result. Let Tx = inf{t : B, = x}. From (1), it follows immediately that (2)

For all x, PO(Tx < oc) = 1.

Proof Suppose x > 0. PO(TX < T_MX) = M/M + 1, and the right-hand side approaches 1 as M - oo. It is trivial to improve (2) to conclude that (3)

For any s < oo, PO(B, = x for some t Z s) = 1.

Proof By the Markov property, P0(B, = x for some t >- s) = Eo(PB(S)(TX < oo)) = 1.

The conclusion of (3) implies that with probability 1 there is a sequence of times t"Too (which will depend on the outcome co) so that B," = x (a conclusion we will hereafter abbreviate as "B, = x infinitely often" or "B, = x i.o."), so in the terminology of the theory of Markov processes, one-dimensional Brownian motion is recurrent.

In order to study Brownian motion in d > 2, we need to generalize (1). In view of (5) in Section 1.6 and the spherical symmetry of Brownian motion, an obvious way to do this is to let cp (x) = f(I x 12) and try to pick f so that' cp = 0.

A little differentiation gives D1f(IxI2)

=f(Ix12)2x1

D«f(Ix12)=f"(IxI2)4x, +2f(IxI2). (Now you see why we wrote f(I x12).) Therefore, for Ocp = 0 we need

0 = Y(f"(IxI2)4x? + 2f'(Ix12) 4IxV2f,(Ix12) 2df(IxI2) = +

1.7

Hitting ProbabUltles, Recurrence, and Transience

Letting y =

1x12,

29

we can write the above as

4y "(Y) + 2df'(y) = 0, or, if y > 0,

f"(Y) = 2 dPY) From the last equation we see f'(y) = Cy-d'2 guarantees A(p = 0 for x

0,

so we can let

cp(x)=loglxl d=2 (p(x) = 1x12-d

d > 3.

We are now ready to imitate the proof of (1) in d > 2. Let Sr = inf{t : 1 B,l = r} and r < R. Since qp is bounded and has A(p = 0 in {x: r < lxl < R}, applying the optional stopping theorem at T = S, A SR gives cp(x) = Excp(BT) = cp(r)P(S,. < SR) + cp(R)(1 - P(S, < SR)),

and solving gives (4)

Px(S, < SR) -

g,(R) - gq(x) (G(R) - (P(r)

In d = 2, the last formula says (5)

Px(S, < SR) =

logR - loglxl log R - l og r oo in (5), the right-hand side goes to 1; so Px(S, < 00) = 1

If we fix r and let R

for x, r > 0, and repeating the proof of (3) shows that two-dimensional Brownian motion is recurrent in the sense that if G is any open set, then Ps(B, e G i.o.) = 1. If we fix R, let r -- 0 in (5), and let So = inf{t > 0: B, = 0}, then for x

0

Px(S0 <SR)
0

Since this holds for all R and since the continuity of Brownian paths implies SR T oo as R T oo, we have Px(S0 < oo) = 0 for all x 0. The strong Markov property implies PO(B, = 0 for some t >- E) = EO[PB(E)(To < oo)] = 0

(6)

for all e > 0, so PO (B, = 0 for some t > 0) = 0, and thanks to our definition of So as inf{t > 0 : B, = 01, we also have PX(SO < 00) = 0 for x = 0. In d > 2, Brownian motion will not hit 0 at a positive time even if it starts there. For d >- 3, formula (4) says Rz-d - 1x12-d Px(S, < SR) - R2-d - r2-d

If we fix r and let R -> oo in (6), the right-hand side approaches (x/r)2-d < 1;

so iflxl>r,

30

Brownian Motion

I

Px(S. < co) = (r/Ixl)a-2 < 1.

(7)

From the last result it follows easily that for d >_ 3, Brownian motion is "transient." (8)

As

Proof The strong Markov property implies Px(I B,I < M112 for some t >_ SM) = Ex(PB(SM)(SMI/2 < co))

= (M1/2/M)d-2 -+0

asM At this point, we have derived the basic facts about the recurrence and transience of Brownian motion. To review for a moment, what we have found is that

(i) PX(IBtI < 1 for some t > 0) - 1 iff d!5 2

(ii) Px(B,=0for some t>0)=0in d _ 2. The reader should observe that these facts can be traced to properties of what we have called (p, the (unique up to linear transformations) spherically symmetric function that has Agp(x) = 0 for all x 0 0, that is: (9)

(p(x) =

IxI log IxI Ix12-a

d=1

d=2

d> 3

and the features relevant for (i) and (ii) above are

(i) T(x) -> co as lxl -* oo iff d< 2 (ii) T(x) -3 co as x - 0 in d >_ 2. Exercise 1

Generalize the reasoning used in the first part of the proof of (1) to show that if A is a set with I A I < oo (IAI = Lebesgue measure of A) and T = inf{t: B,0A}, then there is an s > 0 (that depends only on Iso Al) that Ex exp(ET) < co for all x e R°.

1.8

The Potential Kernels If f is a nonnegative function, then fExf(B,) dt

f(B,) dt =

Ex J 0

0

= J J p, (x, y)f(y) dy dt 0

= Jf JI p,(x, y) dtf(y) dy 000

IN The Potentlil Kernels

31

where p, (x, y) = is the transition density for Brownian (2nt)_d/2, motion. As t - oo, p, (x, y) so if d < 2, then f p,(x, y) dt = oo. When d > 3, changing variables t = I x - y 12/2s gives Ix - yI2 10 d/2 ('D s e s I P,(x,Y) dt = J \ 2s2 / r x -yI2/

(Ids

r

Ix - yI2-d7r-d/2 1 2-i7rd/21)Ix-YI2

s(at)-2e-sds

fo d

where r(a) = f o sa-le-sds is the usual gamma function, so if we define G(x, y) =

f

p,(x, y) dt,

0

then G(x,y) < oo for x

y, and

f(B,) dt = fG(xY)f(Y)dY.

Ex J 0

We call G(x,y) the potential kernel, because it will turn out (see Section 8.6)

that G(. , y) is the potential of a unit charge at y. At the moment we are not prepared to discuss this, so we will only say that (1) is a useful formula for Brownian motion, and with an eye on applications in the next section and later on, we will define the potential kernels for d < 2 by

G(x,y) = I Pt(x,Y) - a,dt ,Jo

where the a, are constants we will choose to make the integral converge (at least when x y). When d = 1, we let a, = p,(0, 0). With this choice, G(x,y)

-

w ,/-f(e-(

2n

o

1)t-112

dt

and the integral converges, since the integrand is <- 0 and as t - oc. Changing variables u = (y - x)2/2t gives (2)

G(x,y) =

-(Y2 x)2 J(e_u - 1) ( 2n [n-XI 2

-1y - x

tux ) (Y- >2

,lo

(f e

sds) u-312du

du2u-3/2 js -Iy - xl raDdse-ss 1/2=-Iy-xl. dse-s f

I

Jo

.J

0

uu 2

-(y - x)2/213/2

32

I

Brownl.n Motion

The computation is almost the same for d = 2. The only thing that changes

is the choice of at. If we try at = pt(0, 0) again, then for x y the integrand - - t-1 as t -- 0 and the integral diverges (for the wrong reason), so we let at = pt(0, e1) where e1 = (1, 0). With this choice of a, we get

G(x,y) = 2n

(3)

f

Zn fOOO(

I

f

1/zt

e-1/zt)t-1 dt

e-sdst-1 2

dt

J

IX-yl /zt

dse s

2n

-

(elX-ylnt

o

1/zs

t-1 dt

o IX-yl2/2s

= Zn (

dse-s/ (-log(Ix

J

o

- yJ2)) _

log(Ix - YO.

To sum up, the potential kernels are given by

d F(21 d//zI)Ix_ylz -

d>3

-'log(jx-yl)

d=2

-I Ix - yj

d= 1.

The reader should note that in each case, G(x,y) = Ccp(I x - yI) where p is the

harmonic function we used in Section 1.7. This is, of course, no accident. x - G(x, 0) is obviously spherically symmetric and, as we will see in Section 8.6, satisfies AG(x, 0) = 0 for x 0, so the results above imply G (x, 0) = A + Bop (I x I ).

The formulas above correspond to A = 0, which is nice and simple. But what about the weird looking B's? What is special about them? The answer is simple: They are chosen to make AG(x, 0) _ -So (a point mass at 0) in the distributional sense. It is easy to see that this happens in d = 1: tp'(x) =

1

x>0 x<0,

so

p"(x)

= - 260.

More sophisticated readers can easily check that this is also true in d > 2. (See F. John (1982), pages 96-97.)

1.9 Brownian Motion in a Half Space Let H = {x E Rd : xd > 0} be the upper half space and let r = inf{t : Bt 0 H}. For several applications below, we will need to know about the behavior of

1.9

33

Brownian Motion In a Half Space

B(t A T)-Brownian motion "killed when it leaves H." In this section, we will derive some of the basic formulas concerning this process. Let x e Rd-1 and y > 0. Since T = inf{t > 0 : Bd = 0}, it follows from (3) of Section 1.5 and obvious symmetries of Brownian motion that P(x,Y)(T

< t) = 2P(Bt > y) = 2

f

(27rt)- /2e-Z2/2tdz.

To find the probability density of T, we change variables z = (t 112y)/s112 to obtain (1)

P(X.Y)(T < t) = 2 f o

(27rt)-1/2e-v2/Zs(

l

3/2y) ds

t -1/Zye-Y2/2s

ft (27CS)

ds.

0

Since the exit time depends only on the last coordinate, it is independent of the first d - 1 coordinates and we can compute the distribution of B, by writing s)(27rs)-(d-1)/2e-IX-eI2/2s

P(X,Y)(B= = (0, 0)) = f dsP(X.Y)(T = 0

ds s- (d+2)/2 e-(IX-BI2+Y2)/2s

y l(27r)d/2 fo

Changing variables s = (Ix - 012 + y2)/2t gives y

.

(2/2 J0_(Ix_0+y2) 2t2

dt(

(d+2)l2e

2t

t,

Ix - 012 + y2)

so we have (2)

P(x,;)(Bs = (0, 0)) =

y

I'(d/2)

(1X - 012 +y2)d/2

7Cd/2

where r(a) = $o ya-1e-Ydy is the usual gamma function.

When d = 2 and x = 0, probabilists should recognize this as a Cauchy distribution. At first glance, the fact that B, has a Cauchy distribution might be surprising, but a moment's thought reveals that this must be true. If we let

B0 = 0, T = inf{t >- 0: B? = s}, and Cs = B1(T), then the strong Markov property and spatial homogeneity of Brownian motion imply that C5 has stationary independent increments. An obvious scaling argument implies C5 sC1, and symmetry implies C5 = -C5, so if we let ape(s) = E(exp(i6C5)), = then the observations above imply that (pe(s)cpe(t) = (pe(s + t), cpe(s) = T,(1), and tpe(s) _ cp_e(s). Since 0 -* cpe(1) is continuous, the second equation implies

s - ape(s) = (p..,(1) is continuous, and a simple argument (details left to the reader) shows that for each 0, ape(s) = exp(ces) and the last two equations imply that ce = -a101, so CS has a Cauchy distribution. The representation of the Cauchy process given above allows us to justify

34

1

Brownl.n Modon

the remark made in Section 1.1 that the set of discontinuities of Cr is dense in [0, oo). We will simply outline the steps and leave the details to the reader. Exercise 1

(i) With probability 1,

C= sup Br > Bl , o
so s - T is discontinuous at s = C. (ii) If a < b, then by scaling, Po (s -+ TS is discontinuous in [a, b]) is independent

of the value of b - a and hence must be = 1.

The discussion above has focused on how Br leaves H. The rest of the section is devoted to studying where it goes before it leaves H. We begin with the case d = 1. (3)

If x, y > 0, then PX(B, = y, To > t) = Pt(x,Y) - Pt(x, -Y) where

p,(x,Y) =

(27rt)-d12elr-x12121

Proof The proof is a simple extension of the argument we used in Section 1.5 to prove that P0(T. < t) = 2Po(Br > a). Let f > 0 and f(x) = 0 for x < 0. Clearly

Ex(f(B,); To > t) = Exf(B,) - EE(f(B,); To < t). If we let f(x) =f(-x), then it follows from the strong Markov property and symmetry of Brownian motion that E.(f(B,); To <_ t) = Ex[Eof(Br-TO); To <_ t]

= EE[Eof(Br-Td ; To 5 t] = Ex[f(B:); To :!9 t] = EE(.f(B,)),

since f(y) = 0 for y z 0. Combining this with the first equality shows

EE(f(B,); To > t) = Exf(B,) - Exf(Br)

= J (Pt(x,Y) -Pr(x, -Y))f(Y)dy, proving (3). The last formula generalizes easily to d >- 2. (4)

If x, y c- H,

Px(B, = y, 'r > t) = p,(x,Y) - p,(x,Y)

1.9

38

Brownlnn Motion In o Half Np.ce

where

Y=(YO---Ya-i,-Yd) For some of the developments below, we will need the following formula, which is an easy consequence of (4), and the results in Section 1.8. (5)

If xe H and G is the potential kernel defined in Section 1.5, then Ex

(ff(Bt) d t I = J G (xY).f (Y) dy - J G (xy).f(Y) dy /

whenever the two integrals on the right-hand side are finite.

Proof .f(Bt) dt =

Ex J0

fm

Ex(.f(B,); i > t) dt

0

=

- p(x, ))f(Y) dY JO J H

J(Pt(xY) - at).f(Y) dY

(Pt(x,Y) - a,).f(Y) dY -

= J (0

JH

J

G(xY).f(Y) dY - JG(xY)f(Y)dY. J

(The last two equalities being justified by the fact that f I G(x, y) f(y) I dy and f IG(x, y)f(y)I dy are finite.)

Remark: The proof given above simplifies considerably in the case d >- 3; however, part of the point of the proof above is that, with the definition we have chosen for G in the recurrent case, the formulas and proofs can be the same for all d.

Let GH(x,y) = G(x,y) - G(x,y). We think of GH(x, y) as the "expected occupation time (density) at y for a Brownian motion starting at x and killed when it leaves H." The rationale for this interpretation is that =f(Bt)

Ex

\J

dt1I = JGH(x,Y).f(Y) dy

/

whenever the right-hand side exists in the sense specified in (5). With this interpretation for GH introduced, the reader should pause for a minute and imagine what y - GH(x, y) looks like in one dimension. If you don't already know the answer, you will probably not guess the behavior as y -+ 00. So much for small talk. The computation is easier than guessing the answer:

G(x,Y)=-Ix-yl, so

36

1

kowdu Mod=

GN(x,y)= -Ix-yl+Ix+yl. Separating things into cases, we see that

1-(x-y)+(x+y)=2y whenO
GH(x,y)

- (y - x) + (x + y) = 2x when x < y,

so we can write GH(x, y) = 2(x A y)

for all x, y > 0.

It is somewhat surprising that y -+ GH(x, y) is constant = 2x for y > x, that is, all points y > x have the same expected occupation time!

1.10 Exit Distributions for the Sphere Having dealt with the half space, the next object to contemplate is the sphere. Let D = {x: I xI < 1 } and T = inf{t : B, 0 D}. In the last section we computed the exit distribution and occupation density for the half space. The key to the second computation was reflecting points across H. Eventually (in Section 3.4) we will show there is a reflection function for the sphere (namely, inversion x - x/Ixl z) that allows us to compute things for D as we did for H. At this point this tool is not available, so we will forget about the occupation time density for the moment and we will cheat to find the exit distribution : We will look up the answer in Port and Stone (1978) and verify that their formula is correct. (1)

If f is bounded, then Exff(Bj) = J[DIIx-

I xl d f(y)

-yl

dir(y)

where it = surface measure on 8D normalized to be a probability measure.

Proof To prove that our "guess" is right, we begin by proving a version of (5) in Section 1.6 that is easier to work with. (2)

Suppose u is C2 in D and continuous on D. If Au = 0 in D, then u(x) = Exu(Bt).

Proof Let T. = inf{t : B,0D(0, 1 - 1/n)}. Since T. T and

1 - 1/n, it is

easy to see that r,, T T a.s. From (5) in Section 1.6, it follows that u(x) = Exu(B,,,). Letting n -+ oo and using the bounded convergence theorem proves (2).

In light of (2), we will be finished when we verify that the right-hand side of (1) has the indicated properties whenever f is C. The first, somewhat painful, step in doing this is to show that for fixed y,

_ (3)

i

z

x- ylld

is harmonic in D.

37

Exit DI.tributlonr for the Sphere

1.10

To warm up for this, we observe p/2

=PIx-ylp-2(xi-yi),

(x;-y;)2 so we have

+(1-Ixl2).-d(x,-yi) 1X-Y1 d+2

D,ky(x)_(-2x,) x _I yId Differentiating again gives I

Dttky,(x) = (-2)

-d(x,

d+ 2

(-2x,).

x-yl d+2i)

I

+ (1 -

J

X

d

d(d + 2)(x - y,)

J

2)

(

Ix -

yId+4

yId+z

Ix -

Summing the last expression on i gives

-2d

4ky(x) = Ix - yId + 4d Ix12 - x'y Ix - yId+2 +

(d2

+ 2d)(1 - Ix12) Ix-

- -2dlx-y12 Ix - yId+z

yId+z

- d2(1 - Ix12) Ix - yId+z

(1 -IxIZ) Ix - yId+z'

Ix - yId+z

and if we replace 1 by Iy12, the expression collapses ekY(x) =

2d1d+z(-Ix - y12 +2

IX12

I x-

-

Iy12 - Ix12) = 0,

since

Let fE C°° and let x E D

u(x) =

J ndn(y)f(y)kY(x)

f(x)

xcOD.

In D, u is a linear combination of the ky, so bringing the differentiation under the integral (and leaving it to the reader to justify this in an exercise below) gives

Du(x) = L d-n(y).f(y)OkY(x) = 0, an

so u satisfies the conditions of (2) that concern its behavior in D. To check the behavior of u at 8D, the first step is to show

I(x) = f

dit(y)1-

x1

z

-1.

x - yId This is just a calculus "exercise," but since it is a rather difficult one, we will Jan

38

1

Brownlan Motion

use a soft noncomputational approach instead. Applying the results above with

f - 1, we see AI = 0 in D and I is invariant under rotations, so starting a Brownian motion at 0 and applying (5) of Section 1.6 with G = D(0, r), r < 1, shows I(x) = 1(0) = 1 for all x e D. To show that u(x) -+f(y) as x -+ y e 8D, we observe that if z y, k,. (y) - 0, and if S > 0, the convergence is uniform for z e Bo = 8D - D(y, S), so if we let

B,=8D-B0, then h.(x).f(z) d7r(z) - 0 IBS}

and

h=(x ) dn(z)

fBj

1

from which it follows easily that u(x) -+fly), and we have established that (3) is correct. The derivation given above is a little unsatisfying, since it starts with the answer and then verifies it, but it is simpler than messing around with Kelvin's transformations (see pages 100-103 in Port and Stone (1978) or Section 3.4 below), and it also has the merit, I think, of explaining why k,,(x) is the probability density of exiting at y : k,, is a nonnegative harmonic function that has k,,(0)= 1 and k,,(x)-*0when andz y. Exercise 1 h e(x,Y)

J

Show that the functions =

Y

(Ix-912+y2)d/2

are harmonic in H = { (x, y) : x e Rd-ly > 0} and satisfy

(i) f dxh.(x, y) = 1 (ii) as y - 0, f D(O,E)c dx ha(x, y) -> 0 for all E > 0.

Exercise 2 Differentiating under the Integral Sign. It is easiest to do the proof and then decide what we need to assume. Suppose

u(x) =

f

K(x, y)f(y) dm(y)

S

and write

u(x + hei) - u(x) = f (K(x + he;, y) - K(x, y))f(y) din (y) s h

=

-K (x + 6ej,Y).f(Y) dm (Y). Js Jo

From the last expression, it follows easily that we have

1.1 1

(4)

39

Occupation Three for the Npherc

aK(x,y)f(y)dm(y) is continuous in an open set G and that

Suppose u;(x) = f

for some h > 0,

Js

JJX +

dAdm(y) < oo.

Then 8u/3x; exists and equals u;.

1.11 Occupation Times for the Sphere In the last section we considered the exit distributions, that is, how B, leaves D. To continue our development of D in parallel with that of H, we now consider where B, goes before it leaves D. Let T = inf{t : B, 0 D} and f be a bounded with (5) in Section 1.9, we should expect function. By analogyJGD(xY)f(Y)dY. (1)

E. f.f(B,) dt = In this section we will show that this is correct and find an explicit formula for the "Green's function" GD in the same way that we found the exit distributions in the last section. We will find a property that characterizes GD, look up the answer, and verify that it is correct. For simplicity, we begin with the case d > 3. In this case, if G(x,y) is the potential kernel defined in Section 1.8 and we suppose f - 0 on D`, then J G(x,Y)If(Y)I dy

`J

DIX

cyldllfI dy <

where c = F(d/2 - 1)/2nd"Z, so Fubini's theorem can be applied to conclude that f(B) dt.

w(x) = JG(xY)f(Y)dY = E. f,0

The strong Markov property implies that E. "f(B,) dt = EEE8,T,

dt = E.w(BT)

Js

I-X2 w(Y) d7r(Y) aD X -YI d

Recalling the definition of w and using Fubini's theorem gives

E. j'f(Bt) dt = w(x) - EXw(Bz) 0

=

J(G(xz)

- Ji

IX

-I YII2

G(Y, z) dn(Y))f(z) dz,

40

1

Brownian Motion

so (1) holds in d > 3 with (2)

lldG(Y,z)dlt(Y)

GD(x,z) = G(x,z) - J

1x-Y1

At this point, it "only" remains to do the integral. This is easy if z = 0, for then

G(y,z)=1yid_2=C for ye0D, and we have (3a)

GD(x, 0) =

x1 d-2

- C,

since, by results in the last section,

f x-lylddn(Y)=1. I

When z j4 0, however, G(y, z) is not constant on 8D and we are left with a difficult integral to do. In Section 3.4 we will see that knowing what inversion x -. x/Ix12 does to Brownian motion makes it possible to evaluate this integral, but since that is not available now, we will have to resort to more underhanded means to evaluate the integral. We will look up what we know to be the answer in Folland (1976) and then verify that it is correct. Although this approach is morally reprehensible (we are using analysis to prove something in probability!), it has the advantage of demonstrating that GD(x, y) is nothing more than the Green's function for D with Dirichlet boundary conditions.

Folland (1976), page 109, defines the Green's function for D to be the function K(x, y) on D x D determined by the following properties :

(i) for each y e D, K(., y) - G(., y) is harmonic in D and continuous on D (ii) for each y e D, x e 8D, K(x, y) = 0. Note: For convenience, we have changed his notation to conform to ours and interchanged the roles of x and y. This interchange makes no difference, since Green's function is symmetric, that is, K(x, y) = K(y, x). (See Folland, page 110.)

It is easy to see that our GD is equal to the K defined above, since results in the last section imply

(i) GD(x, z) - G(x, z) = is harmonic in D,

(ii) if x -* x e 8D,

f

1 - IX12

J8DIX-YI d

G(y, z) dn(Y)

1. 1I

Occupstlon Times for the Nphcrc

11-IxnI2

Ixn -Y

I2 G(Y,

41

z) dn(Y) -> G(x, z).

Now that we have made this connection, it is easy to find the Green's function. We turn to page 123 in Folland (1976) and find that if y 0, (3b)

GD(x,y) = G(x,y) -

IyI2-dG(x,Y/IY12).

To check that this is true, we observe that (i) since y/IyI 2 0 D, it follows from results in Section 1.8 that the second term is harmonic in D, and (ii) if x c 8D, IyI2-dG(xy/IYI2)

G(x , Y) -

= lx

C

- yl

C

1

d-2

C yId-2

Ix -

IYId 2

x

Y

d-2

yI2

C xlyl _

ylyl-11d-2

= 0,

since x j = 1 implies

Ixlyl -yIy1-112 = x121y12 - 2x-y + 1 =1y12-2x-y+Ix12

=Ix-Y12. At this point we have found the Green's function for the ball in d > 3, so we turn our attention to d < 2. In this situation, the first step in our computation fails, so we will begin with the second, that is, we will define

H(x,y) = G(x,y) - EXG(BT, y)

and then show that H satisfies (1). Let f be a C°° function that has compact support K c D, and let u(x) = JH(xY).f(Y)dYu1(x) = J'G(xY)f(Y)d), G(BT,Y).f(Y)dY

u2(x) = EX J

= Exu1(BT) = J8D

A simple but somewhat tedious calculation shows that Au1 = -f (see John (1982), Section 4.1), and it follows from results in the last section that Due = 0, so adding the last two results we see that Au = -f in D. Combining this with the fact that u(x) = u2(xn) = EXnul (B) -+ 0 as xn x e 8D gives us what we need to show H = GD.

42

(4)

I

Brownian Motion

Suppose u is CZ in D and continuous on D. If Au = -f in D and u = 0 on 8D, then ('

u(x) = Ex f f(B) dt. 0

Proof Let Tn = inf{t : BOD(0, 1 - 1/n)}. From (5) in Section 1.6, fofATn

f

\u(Bs) ds = u(BtATn) +

thin

f(BS) ds

Jo

is a martingale, and hence Ih In

f(Bs) ds.

u(x) = Exu(BfATn) + Ex 0

Now ExT < co and f is bounded, so letting t - oo and then n -. co and applying the dominated conve theorem gives

u(x)=Eds. Jf(B) From (4) and the remarks above it, we can now conclude that H = GD. The last detail is to find formulas for GD. In d = 2, we have (5a)

(5b)

GD(x, 0) = - I log lxl. If y 54 0,

GD(x,Y) = - I (log Ix - YI - log(xlyl - ylyl-1)) To get formula (5b), look at the proof of (3b) given above. In d = 1, we have GD(x,Y) = G(x,Y) - x

I

2 =Ix - yl - x + 2

G(l,Y) -

12

xG(- I, y)

l(1-y)- -2 x (y+l). 1

and considering the two cases x < y and x > y leads to (6)

GD (x , y)=

(1-x)(1+y) -1
(1-y)(1+x) -1<x
.

The reader should note that if we let h(x) = GD(x, y), then

(i) h(-1) = 0, h(1) = 0 (ii) h is linear on [-1,y] and [y, 1], that is, it is harmonic if x (iii) if x > y > z, h'(x) - h'(z) = -2, that is, ZAh(y) = -60

y

and h is the only function with these properties. At this point we have, by hook and by crook, found the occupation times

Notes on Chapter I

43

and exit distributions. Although the derivations we have given were a little crazy,

there was a method to our madness. We were laying the groundwork for explaining the connection exit distributions -> Dirichlet problem occupation times «-+ Poisson's equation.

In the last two sections we have used solutions to equations on the right-hand side to find the quantities on the left. In Sections 8.5 and 8.6 we will exploit the connection in the other direction, that is, we will run Brownian motion to solve the P.D.E.'s.

Notes on Chapter 1 Robert Brown, an English botanist, was the first to observe that pollen grains in water move continuously and very erratically. The mathematical formulation and study were initiated by Bachelier (1900) and Einstein (1905), who derived the law of the position of the particle and applied this to the determination of molecular diameters. N. Wiener (1923) was the first to put Brownian motion on a firm mathematical foundation by defining it as a measure on the space of continuous functions, and later with Paley and Zygmund (1933) proved that the paths were nowhere differentiable. At about the same time, Khintchine (1933) proved the law of the iterated logarithm, and the detailed study of the Brownian path was under way. In this connection we must mention the name of P. Levy, who is responsihle for much of our detailed knowledge of the Brownian path. His book (1948) is a classic and still a source of inspiration. See Chung (1976) for a recent exposition of some of Levy's ideas. Given the developments above, the reader may find it surprising that the strong Markov property of Brownian motion was first proved by Hunt in 1956. Hunt also noticed the connection between occupation times and Green's functions. At about the same time, Doob (1955b, 1956) noticed the connection between Brownian motion and the heat equation and Dirichlet problem, and the interplay between probability theory and analysis began. We have more to say about this in the following chapters.

2 Stochastic Integration

2.1

Integration w.r.t. Brownian Motion In this section we will show that, even though with probability 1 s - B,((O) does not have bounded variation, it is possible to define f O H, dB, for processes H, that are "nonanticipating." The first step is to give a precise description of the collection of integrands. This will require several definitions.

The first and most intuitive concept of being "nonanticipating" is the following:

H(s, (o) is said to be adapted to F, t >_ 0, if for each t we have H, C- A.

We encountered this notion in our discussion of the Markov property in Chapter 1. In words, it says that the value at time t can be determined from the

information we have at time t. The definition above, while intuitive, is not strong enough. Since there are an uncountable number of times, it does not give us enough control over the behavior of H as a function of (s, w). We will, therefore, restrict our attention to a smaller class.

Let A be the a-field of subsets of [0, oo) x f that is generated by the adapted processes that are right continuous and have left limits. A process H is said to be optional if H(s, (o) e A.

The definition above is very abstract; but for once I want to urge you not

to think about what it means. The optional Q-field is difficult to describe explicitly, and for the theory we will develop below an explicit description is irrelevant. In all the examples that we will consider below, it is trivial to use the definition above to show that the process under consideration is optional. The optional processes will be the integrands for our integral with respect to Brownian motion. As in the theory of the Lebesgue integral, we will start with simple integrands and then, little by little, extend to the general case. 44

2.1

45

Integration w.r.t. Brownlnn Motion

H(s, co) is said to be a basic optional process if H(s, co) = l[a,b)(s)C(w) where C e .I'.. Let Ac = the set of basic optional processes. If H = l[a,b)C, then it is clear that we should define

f HS dBs = C(w) (B(b, co) - B(a, w)) and

JHSdBS = JH1[Otl(S)dBs. The second formula above defines a process we will denote as (H B), and the first defines a random variable we will call (H B),.. The reason we restrict our attention to optional integrands is so that we have (1)

If He bAc = {HE A.: sup I H(s, w) I < oc }, then (H B), is a martingale.

Proof 0

0
C(B,-Ba) a
C(Bb - Ba) b < t < oo,

so it is clear that (H B)t e . and EI (H B), I < oc. To check the martingale property, it suffices to consider the case a < s < t < b. In this case, E((H- B)1I $) - (H' B)s = E((H- B)t - (H' B)S

)

= E(C(B, - BS) I .mss) = CE(B, - BS I J' "S) = 0.

The next formula will be important in extending the integral from AO to larger classes. (2)

If H, KebA0, then E((H- B),(K- B),) = E

f t HSKsds.

Proof Replacing H and K by Hlfo,,) and Kl[o.,), it suffices to prove the result

when t = co. Let H = l[a,b)C and K = l[c,d)D. Since the formula above is linear in H and in K, we can assume without loss that [a, b) and [c, d) are either disjoint or equal.

Case 1: b < c. In this case, Jo HSKS ds = 0, so we need to show that the left-hand side is 0.

E((H- B)c(K- B), Ab) = E((H- B)b(K- B)dl b) = (H' B)bE((K- B)dJ. b) = (H'B)b(K-B)b = 0, since (K- B), is a martingale and (K- B)b = 0.

46

2

Stochmstk Inteantlon

Case2: a=c,b=d. E((H- B)c0(K- B).1-Ova) = E(CD(Bb - B.)21. ) = CDE((B, - Ba)21,,Fa)

= CD(b - a) =JHSKsds, so taking expected values proves the result. H(s, (o) is said to be a simple optional process if H can be written as the sum of a finite number of basic optional processes (multiplying by constants would not enlarge the class). Let A, = the set of all simple optional processes. + H" where the Ht e Ao, then we let If He A, and H = H 1 +

JHdBs= > JHndBS. M=1

We will leave it to the reader to prove that this is a good definition (i.e., the integral is independent of how H is written). Since the sum of a finite number of martingales is a martingale, it is easy to see (3)

If He bA1 = {He A, : sup I H(s, w)I < oc }, then (H- B), is a martingale.

Since the formula in (2) is linear in H and in K, it generalizes immediately to (4)

If H, K e bA 1, then

E((H B),(K B),) = E f

H SK.ds.

Taking H = K, we get a formu la that is the key to our next extension. (5)

If HebA1i

E(H B)t = E f Hs ds. 0

Let A2 be the set of all optional processes that have

f

IIHIIB=(E Hsds)1/2<00.

Let .df2 be the set of all martingales adapted to At > 0 that have IIX112 =

(supEX,2)112 < 00.

The next result shows that these are good norms for discussing stochastic integration. (6)

If HebA1i then IIH. B112 = IIHIIB.

2.1

47

Inteantlon w.r.t. Brownlrn Motion

Proof Recalling the relevant definitions and using (5) gives IIHIIB=E fH.,ds=supE f Hsds J

=

r

,Jo

f

The definitions above should suggest our strategy for extending the integral

from Al to A2: We will pick a sequence H"ebA1 so that II H" - HIIB

0,

and we will show that H" B converges to a limit which is independent of the sequence of approximations chosen. To carry out the first step in this program, we need to show (7)

If HE A2, then there is a sequence H" e bA1 so that II H" - HII B -. 0.

Proof Let G,` = 2' f ,'_ 2-rHS 1(j Hs I <, ds where HS = 0 for s < 0, and let Gr6." = -.0, and as m co, IIG'.m GtIIB _+ 0, so G-'([2-t]12-). As t -+ oo, IIG" if we let H" = G",'- and m" -+ oc fast enough, II H" - HIIB -. 0. To carry out the second step in our program, we need to show

-

(8)

.,lf2 is complete.

Proof Standard martingale convergence theorems imply that if X c .,#2, then

as t - oo, X, converges almost surely and in L2 to a limit X. with EXX _ sup, EX 2, and the martingale can be recovered from X. by XX = Let .°F. = t > 0). Since X. = lim XX e , the observation above shows that X - X,,,) maps #2 one-to-one into L2(.F.). On the other hand, if Ye I

then YY = E(YI. ) is a martingale with Y, -. Y as t -+ oo, and Jensen's

inequality shows that

EY,2 = E(E(J' ',)2) < E(E(Y2I F )) = EY2,

so Y, E tie. Combining this with the previous observation shows that X -+ X. is an isometry from .#2 onto and proves (8). With the last two results established, we are ready to take limits to define H- B for He A2. Let H " e bA 1 be such that I I H" - H II B - 0 as n - oo. (6) implies

IIH" - H"IIB = II H" - B - Hm BII2, so H " B is a Cauchy sequence in .1f2, and (8) implies that H" B converges to a limit in . if 2. Since we have convergence

of H" B for any sequence of approximations, an easy argument shows that the limit is independent of the sequence chosen, and we can define H B to be the common value of the limits. The last step in our definition of the stochastic integral is to make a trivial but useful extension of the class of integrands. Let A3 be the set of all optional processes that have f o HS ds < oo a.s. for each t. To define H B in this generality, let T. = inf It: f o H, ds > n} and let Hs = H Since H" E A2, we know how to define H" B. On the other

hand, if m < n, H'-B- H" B = (Hm - H") B = 0 for t < Tm, so we can define H B by setting (H B)s = (H" B), for s < T".

48

2

Stochastic Integration

Remark: We will show in Section 2.11 that this collection of integrands is essentially unimprovable, that is, if we let

T=inf{t: JH52ds=oo then on

{JT

0

Hs ds = o0

lim tfT

oo .

2.2 Integration w.r.t. Discrete Martingales Our second step toward the general definition of the stochastic integral is to discuss integration w.r.t discrete time martingales. The definition of the integral in this case is trivial, but by looking at the developments in the right way, we will jump to an important conclusion-if we want our stochastic integrals to be martingales, then the integrands should be "predictable" (a notion we will describe in this section) rather than merely optional. Let n > 0, be a martingale w.r.t.. . If H,,, n > 1, is any process, we can define n

F,

H.(Xm-Xm J

M=1

By analogy with results in Section 2.1, you might expect that if H e .F and each H. is bounded, then (H X). is a martingale. A simple example shows that this is false. Let S. be the symmetric simple random walk, that is, S. _ 1 + +

-1) = 1/2. Let where the ; are independent and have P(c , = 1) = .F. = but if we let H = S. is a martingale (H- S),, = Y bm(`Sm -'Sm-1) _ Y m = n, m=1

m=1

which is clearly not a martingale. If we think of cn as the net amount of money we would win for each dollar

bet at time n and let H,, be the amount of money we bet at time n, then the "problem" with the last example becomes clear: We should require that H e 5,,_1 for n > 1 (and let Fo = {0, 0}), that is, our decision on how much to bet at time n must be based on the previous outcomes 1, ... ,i_1 and not on the outcome we are betting on! A process H that has H effl _1 for all n 1 is said to be predictable since its value at time n can be predicted (with certainty) at time n - 1. The next

-

result shows that this is the right class of integrands for discrete time martingales.

2.2

(I)

49

Integration w.r.t. INecrete Martingales

Let X" be a martingale. If H is predictable and each H" is bounded, then (H X)" is a martingale.

Proof The boundedness of the H. implies El (H X)" I < oo for each n. With this established, we can compute conditional expectations to conclude E((H. X)"+1

(H' X)" + E(H"+I (X"+, - X")I (HX)" + H"+1E(X"+1 - X.13") = (H' X)",

since Hn+1 c- .f" and E(Xn+1 -

0.

The definition and proof given above relied very heavily on the fact that the time set was {0, 1, 2, . . . }. The distinction between predictable and optional becomes very subtle in continuous time, so we will begin by considering a simple example : Let (S2, P) be a probability space on which there is defined

a random variable T with P(T < t) = t for 0 < t < 1 and an independent random variable with P( = 1) = P( = - 1) = 1/2. Let X. _

0 tT

and let . = o (XS : s < t). X is a martingale with respect to .°t-;, but f o X S dXs = Z = 1, so Y, = J0 Xs dX5 is not, and hence if we want our stochastic integrals which are martingales we must impose some condition that rules out X. The problem with the last example is the same as in the first case, and again

there is a gambling interpretation that illustrates what is wrong. Consider the game of roulette. After the wheel is spun and the ball is rolled, people can bet at any time before (<) the ball comes to rest but not after (>-). One way of requiring that our bet be made strictly before T is to require that the amount

of money we have bet at time t is left continuous, that is, we cannot react instantaneously to take advantage of a jump in the process we are betting on. Weakening the last requirement we can, by analogy with the optional a-field, state the following definition.

Let H be the a-field of subsets of [0, oo) x fl that is generated by the left continuous adapted processes. A process H is said to be predictable if H(s,co)ell. The definition of H, like the definition of A, makes it easy to verify that something is predictable, but it does not tell us what sets in H look like. This time, however, it is easy to describe the a-field precisely: (2)

II = a((a, b] x A : A e Fo).

Proof Clearly, the right-hand side c H. Let H(s, co) be adapted and left continuous and let H"(s, co) = H(m2-", (o) for m2-" < s < (m + 1)2-". Since H is adapted, H"(s, co) E a((a, b] x A : A c.ya), and since H is left continuous, H"(s, w) - H(s, co) as n - oo.

50

2

Stochutlc Integration

Remark: Based on the result above, you might guess that A = a([a, b) x A : A e.F.), but you would be wrong. I would like to thank Bruce Atkinson for catching this mistake in an earlier version of this chapter.

then H is the Exercise 1 Show that if H(s, co) = l(a,b](s)WA(W) where A limit of a sequence of optional processes; therefore, H is optional and II c A.

2.3 The Basic Ingredients for Our Stochastic Integral To define a stochastic integral we need four ingredients:

a probability space (fl, F, P)

a filtration F = {. t > 01 a process X, t > 0, that is adapted to F a class of integrands H, t >_ 0. In Section 2.2 we described the class of integrands that we will consider: the predictable processes. In this section, we will describe the assumptions that we will make concerning the process X and the filtration IF. This will require a number of definitions and explanations. (1)

X is said to be a local martingale (w.r.t. F) if there are stopping times T. T 00 t >_ 0}). The stopping times T. are so that X,,,Ta is a martingale (w.r.t. said to reduce X. In the same way, we can define local submartingale, locally bounded, locally of bounded variation, and so on. Remark :

You should think of a local martingale as something that would be a martingale if it had EIXI < oc. There are several reasons for working with local martingales rather than with martingales :

(i) It frees us from worrying about integrability. For example, if X, is a martingale and 9 is a convex function, then tp(X,) is always a local submartingale, but we can conclude that p(X,) is a submartingale only if we know EI (p(X,)I < oo, a fact that may be either difficult to check or false in some applications. (ii) Often we will deal with processes defined on a random time interval [0, T). If T < oo, then the concept of martingale is meaningless, but it is trivial to define a local martingale : If there are stopping times T T T so that.... (iii) Since most of our theorems will be proved by introducing stopping times T. to reduce the problem to a question about nice martingales, the proofs are no harder for local martingales defined on a random time interval than for martingales. Reason (iii) is more than just a feeling. There is a construction that makes

2.3

The Bask Ingredients for Our Stochastic Integral

51

it almost a theorem. Let X be a local martingale defined on [0, r) and let T T r be a sequence of stopping times that reduces X. Let t T1

0
t-1 T1+1
T2

T2+1
t-2 T2+2- 0, is a martingale (exercise) to which standard theorems can be applied. In our development of the stochastic integral, we will consider only continuous local martingales. We do this because (i) as we saw in Section 2.2, treating the jumps properly is a delicate matter, (ii) in all our applications the local martingales are continuous, and (iii) the assumption of continuity allows us to considerably simplify many of the main proofs and formulas of the theory. This assumption is not without its drawbacks (there is no such thing as a free lunch!). At several points below (e.g., in the proof of Girsanov's formula) we will construct martingales by letting X, = E(XI.F,). To guarantee that X, is continuous (which is necessary for our theory to apply), we must assume that all martingales adapted to EF have a continuous version (i.e., if X, is a martingale,

then there is a Y, such that t - Y, is continuous and P(X, = Y,) = 1), and this forces us to show (in Section 2.14) that the Brownian filtration has this property.

The last result is interesting in its own right and would be in the book in any case, so I don't think this is too great a price to pay for the enormous simplifications that result. The, following is an example of the simplifications mentioned above: (2)

When X has continuous paths, we can always take T. = inf{t : I X4 > n} or any other sequence T.' < T that has T. T oc as n T oo.

In words, every continuous local martingale is locally a uniformly bounded martingale.

Proof Let S be a sequence that reduces X. Ifs < t, then applying the optional t T,;, gives stopping theorem to X(r A S

As nToo,

for all

r >_ 0, and I X(r A T, A m, so it follows from a standard result on convergence of conditional expectations that E(X(t A T,,)I- (s A T,;,)) = X(s A T,;,),

proving the desired result.

52

2

Stochastic Integration

Note: In the last proof we made a minor mistake that we will repeat several times below: We implicitly assumed that Xo = 0. This mistake is rarely serious,

but it does mean that some of the statements that we make are not correct. For example, (2) and the remark after it are not true unless we assume that X0 is integrable (resp. bounded). Similar "typos" will appear several times below.

The reader who is careful enough to detect them should have no trouble correcting them.

In our definition of local martingale in (1), we assumed that is a martingale (w.r.t. F,.T,,, t >- 0). We did this with the proof of (2) in mind. The next exercise shows that we get the same definition if we assume is a martingale t > 0). Exercise 1 Let S be a stopping time. Then t >- 0, if and only if it is a martingale

is a martingale w.r.t..

S,

t >- 0.

2.4 The Variance and Covariance of Continuous Local Martingales If you go back and look at the proofs in Section 2.1, you will see that in the proofs of (1) and (2) we used only two facts about Brownian motion: (a) E(B, - B., I.F) = 0 (b) E((B1 - B.,)2 L ) = t - s.

You will see also that after (2) was established, simple considerations of linearity showed that (3) through (5) held, and that after (5) was established, all further considerations used (5) only and no other facts about Brownian motion. In Section 2.5 we show that the claim in the second part of the paragraph is true, that is, once (5) is suitably generalized, we can repeat the arguments in the last part of Section 2.1 (with some minor modifications) to define the integral w.r.t. a local martingale. This section is devoted to the generalization of (5). The key to doing this is to notice that E((B, - BS)2I

) = E(Br I

) - B2,

so (b) above says B? - I is a martingale. This motivates (part of) the following definition. (1)

If XX is a continuous local martingale, then we define the variance process <X>, to be the unique predictable increasing process A, that has Ao = 0 and makes X,z - A, a local martingale.

This result is a special case of (2)

The Doob-Meyer Decomposition. If Y, is a local submartingale, then there is a

2.4

The Variance and Covarlence of (:ontlnuoun Local Martingales

53

unique predictable increasing process A, that has AO = 0 and makes Y, - A, a local martingale. Since the proof of this result is rather technical and the details are irrelevant

for later developments, we will content ourselves here with simply trying to give you a feeling for what A is and why predictability is important. The reader can find a nice proof in K. M. Rao (1969), so if you want to keep things strictly self-contained, you should read the heuristic discussion below, put the book down, read Rao's article, and then resume the development of the theory in this section-we will prove everything else. The first step in understanding (2) is to see why all the conditions are necessary for uniqueness. To do this, we will prove the result for discrete time submartingales, because the construction is trivial in this case. We let AO = 0 and define for n >_ 1

A,,=

E(Y,, I

Y,.-1

From the definition, it is immediate that A,, c a submartingale), and

E(Y

-

E(Y,,I.Fn-1)

is increasing (since Y is

- A.

= Y,.-1 - A,,-1,

so A has the desired properties. To see that A is unique, observe that if B is another process with the desired property, then A,, - B,, is a martingale and Therefore An and it follows by induction that A,, - B = A 0 - Bo = 0. The key to the uniqueness may be summarized as "any predictable discrete time martingale is constant." The last statement fails miserably if predictable is replaced by optional, and with a little work (exercise) one can construct a submartingale Y for which there are many optional increasing processes with AO = 0 that make Y - A. a martingale (e.g., Y,, = b1 + + where the ; are independent and have Ec1 > 0). The last paragraph explains why predictability is needed in discrete time. The same arguments apply in continuous time, that is, the requirement that A, be predictable and increasing is needed to rule out the possibility of producing another process Ai by adding a predictable martingale (e.g., Brownian motion) to A,. The only change is that the uniqueness statement must be formulated more carefully. (3)

Any local martingale that is the difference of two predictable increasing processes is constant.

Remark: This is implicit in the proof of the Doob-Meyer decomposition, but to make it clear that (2) is the only result we are taking for granted, we will show that (2) (3).

54

2

stochastic interation

Proof If X, = A, - A; is a local martingale and A A; are predictable and increasing, then Y, = A, is a local submartingale that has two Doob-Meyer

decompositions, since Y, - (A, - Ao) = Ao is a local martingale and Y, (A; - Ao) = X, - Ao is also. Since the Doob-Meyer decomposition is unique, it follows that A, - Ao = A; - A;, that is, A, - A' = A 0 - Ao. With a little care we can improve (3) to (4)

Any (continuous) local martingale that is predictable and locally of bounded variation is constant. Remark: The result is true in general, but for simplicity we prove it here only for continuous processes.

Proof In light of (3), all we have to do is show that a predictable process that is locally of bounded variation can be written as a difference of two predictable

increasing processes. To do this we open up any real analysis book (e.g., Royden (1968)) and observe that if X, is optional, continuous, and locally of

bounded variation, then the decomposition given there for a function of bounded variation expresses X, as a difference of two optional continuous increasing processes, proving (4).

The variance process is important for defining and using the stochastic integral. Since we will spend a lot of time considering this process and discussing

its properties below, we will drop the subject for the moment and turn to the definition of the covariance of two local martingales. (5)

If X and Y are two local martingales, we let

<X, Y>, = a(<X + Y>, - <X - Y>,). If X and Y are random variables with mean zero,

cov(X, Y) = EXY = 4(E(X + Y)2 - E(X - Y)2) = 4(var(X + Y) - var(X - Y)), so it is natural, I think, to call <X, Y), the covariance of X and Y. The following result is useful for computing <X, Y>,. (6)

<X, Y>, is the unique predictable process A, that is locally of bounded variation, has A o = 0, and makes X, Y, - A, a local martingale.

Proof From the definition, it is easy to see that

X, Y,-<X, >,=a[(X,+Y,)2-<X+Y>,-(X,-Y,)2+<X-Y>,] is a local martingale. To prove the converse, observe that if A, and A; are two processes with the desired property, then A, - A; = (X, Y, - A') - (X, Y, - A,)

is a predictable local martingale that is locally of bounded variation and hence ° 0. For some of the developments below, the following result will be important.

2.5

Integretlon w.r.t. Continuous Local Martlngala

Exercise 1

55

If S :!g Tare stopping times and <X>s = <X>,., then X is constant

on [S, T]. Sketch of Proof It suffices to prove the result when X and (T - S)1(S<.) are bounded. In this case, applying Doob's inequality to Y, = X(S+1)AT - XS shows

E sup Y,2 < 4E(XT - Xs)2 = 4E(XT - Xs) = 0. t

2.5 Integration w.r.t. Continuous Local Martingales In this section we will explain how to integrate predictable processes w.r.t. continuous local martingales. To bring out the analogies with the development

in Section 2.1, we will start by integrating the simplest class of integrands w.r.t. a continuous martingale and then, little by little, extend to the general case.

H(s, w) is said to be a basic predictable process if H(s, (o) = where C e .Fa. Let IIo = the set of basic predictable processes. If H= 1(a, b] C and X is a continuous local martingale, then it is clear that we should define

f H,dX, = C(w)(Xb((O) - Xa((D)) and

(H X), = JIi1(Oi](s)dXc. The analogue of (1) of Section 2.1 in this situation is (1)

If X is a martingale and HebII0 = {He1I0: supIH(s,co)I < co}, then is a martingale. Proof

0

0
(H X),= C(X,-Xa) a
b < t < cc,

so it is clear that (H X), E. and El (H X),I < oo. To check the martingale property, it suffices to consider the case a < s < t < b. In this case, E((H' X), I mss) - (H' X)s = E((H' X)r - (H' X)sI. )

=CE(X,-X,I)=0.

56

2

Stochastic Ints

cotton

The next formula will be important in extending the integral from 110 to larger classes. (2)

If X and Y are bounded martingales and H, Ke bllo, then 5 0

and, consequently, ('

E((H - X), (K - Y),) = E

HSKS d <X, Y>5. J0

Proof Under the assumptions above, E((H X),(K Y),) = E

,

so it suffices to prove the first result. Let H = l(a,b]C and K = 1(c 41D. Since the first formula is linear in H and in K, we can assume without loss that (a, b] and (c, d] are either disjoint or equal.

Case 1. b < c. In this case, f o HSKS d <X, Y% = 0, so we need to show that

, = 0, in other words, (H X), (K Y), is a martingale. To prove this we observe that

(H - X),(K X), =

0

0
C(Xb - X.) D(Y, - YY)

c
C(X,,-Xa)D(Yd- Yc)

d
so if we let J = C(Xb - Xa)DI(c.d], then (H- X),(K Y), = (J- Y), = amartingale, since JebMO.

Case2: a=c,b=d.If aSI. ) 0

= CDE[(X - Xa)(YY - Ya) - <X, Y>u + <X, Y>aI Ft] YaXu+XaYa+<X,Y%

=CD[X,Y,-<X,Y(>,-X.Y,- Y,X,+XaYa+<X,Y%] _ (H ' X ), (K . Y), - J HS KS d <X, Y>.,. o

With (2) established, then, as we promised in Section 2.4, everything follows as before. H(s, (o) is said to be a simple predictable process if H can be written

as the sum of a finite number of basic predictable processes. If He II, and H = H' + + H" where the H' e llo, then we let JH5mdx.

HS dXs = M=1

(and again we leave it to the reader to show that the right-hand side is independent of how H is written).

2.5

Integration w.r.t. Continuous Local Mrrtingrlee

57

Since the sum of a finite number of martingales is a martingale, it is easy to see (3)

If X is a martingale and HEbII, = {HE11, : suplH(s,w)I < cc}, then (H- X), is a martingale. Since the formulas in (2) are linear in H and in K, they generalize immediately to

(4)

If X and Y are bounded martingales and H, Kc b111, then , = f HHK5 d<X, Y% 0

and, consequently,

E((H X), (K Y)t) = E

Y>a J t HsKs d <X,

Taking H = K in the second formula in (4), we get a result that is the key to our next extension. (5)

If X is a bounded martingale and H e b111, then

E(H-X)2 = E f Hs d<X>5 0

Let 112 (X) be the set of all predictable processes that have 1/2

s IIHIIx =(EJHd<x>)

< oo.

Let #2 be the set of all martingales adapted to F that have EX,2)1,2.

II XII2 = (sup t

The next result shows that the relationship between these two norms is the same as that between the two corresponding norms in Section 2.1. (6)

If HE b11,, then IIH' XII2 = IIHIIx

Proof Recalling the relevant definitions and using (5) gives

IIHIIx=EJHs d<X>s=supE f H., d<X>., 0'

sup E(H X)2 = IIH t

X. X112

As the definitions given above should suggest, our strategy for going from 111 to 112 will be the same as the one used in Section 2.1 to go from A, to A2. We start with (7)

If He 112(X), then there is a sequence H " c b11, with IIH" - HII x + 0.

58

2

Stochastic Integration

Proof Define a measure on the predictable a-field by setting Q(A x (s, t]) = E(1A(<X>, - <X>,))

when A e.

.

Unscrambling the definitions shows that 112(X) = L2(Q), and an application of the monotone class theorem proves (7). (8)

.,112 is complete.

Proof This proof is exactly the same as the proof of (8) in Section 2.1. With (7) and (8) established, we can define H X for He 112 just as we did in Section 2.1. Let H" a bII1 so that 11H" - H1 1 ,- +0 and observe that H" X converges to a limit in lie and that, consequently, the limit is independent of the sequence of approximations chosen. The extension to 113(X) = {H: f o Hs d<X> < oo a.s. for all t >- 0} is also easy. Let T. = inf{t : f o H.,2 d<X> > n}, let Hs = Hs 1(s
If X is a local martingale and He 113(X), then (H - X), is a local martingale.

Proof By stopping at T. = inf{t : f o Hs d<X% or IX,l > n}, it suffices to show that if X is a bounded martingale and H e 112 (X), then (H X), is a martingale. Let H" be a sequence of elements in b111 that converges to H in 112 (X). It follows from (3) above that (H"- X), is a martingale. In other words, ifs < t, E((H"' X)11. ) = (H" - X)S.

To complete the proof, it suffices to show that this equality persists when n -* oo. The first step is to show that (H" X)5 - (H X)5 in L2. To do this, we observe that Doob's inequality and the isometry property of H -. H X imply E(sup I (H" - X), - (H' X), 12) -< 4IIH" X - H XII2 t

To deal with the conditional expectation, we observe that from Jensen's inequality X),)2

-+ 0. E[E((H" . X ) ,1 .) - E((H . X), I. )] 2 < E((H" . X), - (H. ) - E((H X ), 1,F,) in L2, and it follows that Therefore E((H" X ) !

E((H- X)tlfs) = (H' X)s For several applications that follow (e.g., in Chapters 8 and 9), it is important to make a trivial extension of the integral we have defined above. S is said to be a continuous semimartingale if St can be written as X, + A,

2.6

59

The Kunits-Watanabe Ioequallty

where X, is a continuous local martingale and A, is a continuous adapted process that is locally of bounded variation. A nice feature of continuous semimartingales that is almost unheard of for their more general counterparts is (10)

Let S, be a semimartingale. If X, and A, are chosen so that AO = 0, then the decomposition S, = X, + A, is unique. Proof If X, + A, is another decomposition, then A, -A; is a continuous local martingale and locally b.v., so A, - A; is constant and hence - 0. Given a unique decomposition, it is easy to extend our definition of the

stochastic integral to continuous semimartingales. If H e (bII = the set of locally bounded predictable processes, we can define (H' A), (o)) =

t H.,(o)) dA,(w)

J0

as a Lebesgue-Stieltjes integral (which exists for a.e. co), and since t bH

II3(X),

we can define (H X), and (since by the uniqueness of the decomposition this is an unambiguous definition. From the definitions above, it follows immediately that we have (11)

If X is a semimartingale and He (bII, then (H X), is a semimartingale.

2.6 The Kunita-Watanabe Inequality In this section we will prove a Cauchy-Schwarz inequality due to Kunita and Watanabe and apply this result to extend the formula given in Section 2.5 for the covariance of two stochastic integrals. (1)

If X and Y are local martingales and H and K are two measurable processes, then almost surely 'o

Jo

IH5K5I Id<X,Y>51

l1/2

f o

HS2d<X>S/

'0

)1/2

(fo

where ld<X, Y>51 stands for dV5 where Vs is the variation of r - <X, Y>, on [0,s].

Remark: This result is from Meyer (1976). He attributes the proof to P. Priouret. Notice that H and K are not assumed to be predictable. We assume only that H(s, co) and K(s, (o) are measurable with respect to 9 x .F where . 9 is the Borel subsets of R. The reason we can attain this level of generality is that the notion of martingale does not enter into the proof after the first line.

60

2

Stochastic Integration

Proof

Observe that if s 5 t, <X+ AY, X + AY>, >- <X+ AY, X + A,Y>S. If we let <M, N>' = <M, N>t - <M, N>S, then Step ] :

0:!5; <X+AY,X+A,Y>t- <X+A,Y,X+AY>S _ <X, X>.,+ 22 <X, y>S + )2 < Y, y>s

for all s, t, and A. Now a quadratic ax2 + bx + c that is nonnegative at all the rationals and not identically 0 has at most one real root (i.e., b2 - 4ac < 0), so we have that (<X,

y>.,1)2 < <X, X>5< y' y>1

Step 2: Let 0 = to < t 1 < < t be an increasing sequence of times, let hi, ki, 1 < i:!5; n, be random variables, and define simple measurable processes n

H(s, (o) = Y hi (o)) I

(S)

i=1 n

K(s,w) _ Yki(w)Iui 1'k>(s) i=1

From the definition of the integral, the result of Step 1, and the CauchySchwarz inequality, it follows that n

f HKKd<X,

Y>S

Jo

< Y- Ihikil I <X, Y>ti-1 i=1

i=1

Ihil1<x'X>ri-1
<(n

h?<X X>ti )1/Zl

i=1

`

ti

1

i=1

k?titi `

)1J2

proving that for simple measurable processes:

(2)

oo HK5d<X,

I

J0

Y>S

I<

(f

2d.,)1/2,

H2d<X>s)1/2 (fo.

0

Step 3: Let M be a large number and let T = inf{t : <X>t or t > M}. By the monotone convergence theorem, it suffices to prove (1) when H = K = 0 for s >- T and I HS I, l KS l < M for s :!g T. Having restricted our attention to [0, T], <X> and are finite measures; so using the bounded convergence theorem, we see that (2) holds for bounded measurable processes. To improve (2) to (1) (and complete the proof), let JS be a measurable process taking values in { -1,1 } such that

61

The Kunlts-W.tenube Inequ.Ilty

2.6

f ld<X, Y>sl = J JSd<X, Y>, 0

0

and apply (2) to H, = IHSI and Ks = JIK,I With (1) established, we are now ready to generalize (4) of Section 2.5 from

HebH, to HelI3(X). (3)

If X and Y are local martingales and H e II3 (X), Ke II3 (Y), then t = J HS K, d <X, Y>s 0

Proof What we need to show is that

(*) (H' X)t(K- Y)t - J H., Ksd<X, Y>s 0

is a local martingale. By stopping at T. = inf{t : IX,1, Y1, f o IH,I2 d<X>., or f 'O 1K.,12 ds > n}, it suffices to show that if X and Yare bounded, He II2(X), K e II2 (Y), and Zt is the quantity defined in (*), then Zt is a martingale.

Let H" and K" be sequences of elements of bII, that converge to H and K in r12 (X) and 112 (Y), respectively, and let Zt" be the quantity that results when H" and K" replace H and K in (*). By results in Section 2.5, Zt" is a martingale, that is, if s < t, E(ZZ" I.F,) = Z,". To complete the proof it suffices to show that this equality persists when n --p oo. The first step is to show Zs --+ Zs in L'. The triangle inequality implies that

E supI(H"' X),(K Y)t - (H. X)r(K- Y)rl t

<E supI((H" - H)' X),(K"' t

Y)tl

+ El sup l (H - X ), ((K" - K)- Y), j

To estimate the first term, we observe that from the inequalities of CauchySchwarz and Doob, it is

< {E(supI((H" - H)' X)tI2)E(supI(K"'

Y)t12)11/2

t

t

< 411(H" - H) X112 IIK

Y112 = 411H" - HIIxIIK"IIY , 0

as n -+ oo, since II H" - HII x -- 0 and IIK"IIY -' IIKIly < oo. A similar estimate shows El suptl(H X)t((K" - K) Y)t11 -> 0, so we have shown (H"- X)s(K" . X)s

- (H X),(K. Y)5 in L1. Repeating the last argument and using the Kunita-Watanabe inequality instead of Cauchy-Schwarz, we see that

JtHSK. d<X,Y>., - (HHK,d<X,Y>S o

0

f'JHs -H31IKslld<X,Y>SI+ f IH5IIKs -KslId<X,Y>,I, 0

0

62

2

Stochosde Integretlon

and the first term 0 < (JtIx"-H312d<X>sll/Z(JtI

"I2dsJl/2

Therefore J H"Ks d <X, Y>, -' 0

Hs Ks d <X, Y>s fo

in L1, and it follows that Z, - Z, in L1. To deal with E(Z" we observe that from Jensen's inequality EIE(Z,"l9S) - E(Z1 S )1 <- EIZZ" - Z,I - 0, ) - E(Z,1.fs) in L1, and it follows that E(Z, I.`NS) = Z., The formula in (3) generalizes readily to sums of stochastic integrals.

so E(Z," I.

(4)

If X = i l H'- X' and Y = YT

1

K- YJ where each He II3(X`) and each

K' e 113 (YY), then

<X, Y>, _

Hs Ks d <X `, Yi>

i,;

0

In the developments that follow there is one application of the KunitaWatanabe inequality that comes up so often that we have given it a name and left it as an exercise for the reader. Exercise 1 then

The Usual Domination Argument. If X, Ye.#2 with Xo = Yo = 0,

EX. Y. = E<X, Y>,, that is, both expectations exist and are equal.

2.7 Stochastic Differentials, the Associative Law In some computations, it is useful to write the integral relationship

Y= f

K., dX.

as the formal equation d Y, = K dX,

where the dY, and dX are fictitious objects known as "stochastic differentials." A good example is the derivation of the following formula, which (for obvious reasons) we call the associative law: (*)

H(K-X)=(HK)X.

2.7

Stochastic Differentlahi, the Aaaoclative Law

63

Proof Using Stochastic Differentials d(H Y), = H, d Y. Letting Y, = (K X),

and observing dY, = K dX, gives d(H (K X)), = H d(K X), = HK dX, d((HK) X),. The above proof is not rigorous, but the computation is useful because it tells us what the answer should be. Once we know the answer, it is routine to verify it by checking that it holds for basic predictable processes and then following the extension process we used for defining the integral to conclude that it holds in general. (1)

If H, Ku IIo, then (*) holds.

Proof Let H = l(G.blC and K = 1(CAD. Without loss, we can assume that either (i) b < c or (ii) a = c, b = d. In case (i), both sides of the equation are - 0 and hence equal. In case (ii),

0
0

D(X,-Xa) a
0

0
CD(X,-Xa) a
If HebII1 and KE12(X), then (*) holds. Proof Let K" e Il, such that K"- Kin 112 (X). Since His bounded, HK" -+ HK in fl2(X), and it follows that HK " X -+ HK X in J'2. To deal with the left side of (*), we observe that

=E f H. 0

=E J

H2 (K., -K.')2d<X)s

0

< CIIK- K"IIz, so as n - oo , H (K" - X) - H (K X) in ..k2 and the result follows. (3)

If He FI2(K X) and Ke 112(X), then (*) holds.

64

2

Stochoodc Iotevedon

Proof Let H" e b111 such that H" -+ H in 112 (K X). Since II H" (K X) we have H"-(K- X) --+ H- (K- X) in,#'. To deal with the right side of (*), we observe that

IIH"K - HKIIX=E

f(Hs -Hs)2KS d<X% 0

=IIH"-HIIx.X-.0, so H"K X -),HK X in ,# z. By stopping at T. = inf{t : f o K, d<X> or f, H., d > n} and letting n - oo, we can extend (3) to (4)

If

and KEH3(X), then (*) holds.

2.8 Change of Variables, Ito's Formula This section is devoted to a proof of the following useful formula: (1)

If X is a continuous local martingale and f has two continuous derivatives, then with probability(' 1,

f) - f(Xo) = J .f '(X) dX+ Jf"(X5)d<X>5. 0

Remark: If A, is a continuous process that is locally of bounded variation, then

f(A,) - f(A0) = JJ'(AS)dAS. 0

As the reader will see in the proof, the second term comes from the fact that local martingale paths have quadratic variation <X>,, and the 1/2 in front of it comes- from expanding f in a Taylor series.

Proof By stopping at TM = inf{t : IX I or <X> >- M}, it suffices to prove the result when IXj and <X> < M. From calculus we know that if a < b, there is a c(a, b) a [a, b] such that (2)

f(b) -f(a) = (b - a)f'(a) + Z(b - a)2f"(c(a,b)). Let t be a fixed positive number. For each S > 0, define a random partition of [0, t] by

or

<X> -<X>.>S}.

(Note that this partition depends on S even though we have not recorded the dependence in the notation.) From (2) it follows that

(3)

65

Chunee of Vorlobin, It6'u Formul

2.8

f(X) -f(X0) = Y-f(X1i+1) -f(Xn) i

= Y_f'(Xri)(Xri+1 - Xii) + i

where gi(w)

I

gi(w)(X,i+1 -

X,i)2

=f"(c(X.,X=i+1)).

Comparing (3) with (1), it becomes clear that we want to show

f'(X'i) (X'i+1 - Xii) -

(4a)

(4b)

f'(Xs) dXS Jo

1 2E9i(w)(Xti+1

f"(Xjd<X>s.

- Xi)2 -' o

The proof of (4a) is easy. If we let H." =f (X,,) when se [ti, ti+1) and = 0 otherwise, then (H".

(Xi)(X,i+1 - Xti) =

X)t

Let H, = f'(XX) when s < t and = 0 otherwise. Since fis uniformly continuous on [-M, M], we see that as n --> oo, supIHs - HI -> 0. Since <X>. :!5; M, it follows that H" - H in 112 (X) and hence H" X - H X in .,412. To prove (4b), we start by proving the result when f" = 1, that is, (5)

As S

0,

Y_ (X1i+1

- X i)2 - <X )t

in probability.

-

Proof Let Di = (X,i+1 X.)2 - (<X>,i+1 - <X)t.) E(A1I. ) = 0, and if i < j, then Die . , so

and

observe

E(A1A) = EE(AiOtl. )= E(AiE(AAI. ,)) = 0. and it follows that E(Y Ai)2 = Y_ EDiA, = E EO?. i

iJ

i

Using the trivial inequality (x - y)2 < (2x)2 + (2y)2 now gives

A < 4(X,i+1 - Xi)4 + 4(<X )[i+1

- <X >,i)2.

To estimate the right-hand side, we observe that by the definition of the ti,

Y(<X )ri+1

- (X >,i)2 ,i) = 6<X>, (Xii+1 - Xi)4 (Xi+l - Xi)2' i i

62

that

66

2

Stochiude Inteandon

and since martingale increments are orthogonal, z

E ((x+i-X,)Z Combining the estimates above shows

EA < 4(SM + 52M2), and it follows that Y-(Xt,+i - X<<)2 -

<X>, in

L2,

proving (5). With (5) established, we can now prove (4b) as we did (4a). If we let Gs = g when SG [t;, ti+1) and =f"(XX) for s >_ t and let As

=

(Xk+, - X<<)2,

t,s 8

then

Y gi(w) (Xi+i i

- Xi)2 = J

Gs dAs. 0

(5) implies that as n -+ oo, As converges in probability to <X>5, and the uniform continuity of f" implies that Gs -> f"(X At) uniformly in s, so to complete the proof all we have to do is justify taking the limit inside the integral (6)

f

dAs

fof(Xs)d<X>.,o

-+

o

o

To do this, we observe that by taking subsequences we can suppose that with probability 1, A",,, converges weakly to <X>SA,, in other words, if we fix

w and regards

A,,, and s - <X>s,, as distribution functions, then the

associated measures converge weakly. Having done this, we can fix w and deduce (6) from the following simple result: (7)

If (i) measures µ on [0, t] converge weakly to µm, a finite measure, and (ii) g is a sequence of functions with Ig,, I < M that have the property that whenever s a [0, t] - s we have g(s) then as n - oc J gn dµn

f g dµ..

Proof By letting µ;,(A) = t]), we can assume that all the P. are probability measures. A standard construction (see Exercise 1 below) shows that there is a sequence of random variables X,, such that X has distribution µ g(XW), and as n - oo, X,, - X,. a.s. The convergence of g to g implies so the result follows from the bounded convergence theorem. (7) is the last piece in the proof of (1). Tracing back through the proof, we

2.9

67

N xtennlon to I unctloer of Several tiemlmsrtlnpeles

see that (7) implies (6), which in turn completes the proof of (4b), so adding (4a) and using (3) gives that for each t1,

Jf'(X)dXs +'

f(X,) -f(X0) = 0

Jt

-(XS)d<X>s

a.s.

0

Since each side of the formula is a continuous function oft, it follows that with probability 1 the equality holds for all t >- 0, the statement made in (1). Exercise 1

to a limit j

Let y. be a sequence of probability measures that converges weakly

and let

oc,x]), 1 < n < oo, be the corresponding

sequence of distribution functions. Let U be a random variable with P(0 < U < u) = u for all u e (0, 1), and let X = F,-1(U) where F1(y) _ sup{x:

y}. Then X,, has distribution

and X, --+ X. a.s.

Remark: Although (1) is by far the most important formula in this section, (5) is also useful because it says that local martingale paths have quadratic variation <X>,. This result has the following useful consequences: Exercise 2 If S < T are stopping times and X is constant on [S, T], then <X>S = <X>TExercise 3

If X is a continuous martingale with bounded variation, then

<X> - 0 and hence X is constant. The reader should note that the proof of the last result is somewhat circular, since the definition of <X> relies on the uniqueness of the Doob-Meyer decomposition, which already implies the result above (see (4) in Section 2.4).

2.9 Extension to Functions of Several Semimartingales In this section, we use the version of Ito's formula that was proved in the last section to prove a much more powerful form of the result. The extension is based on the following simple application of the formula derived in the last section so it is almost an independent proof. If we letf(x) = x2 in (1) in the last section, then we get (1)

X,Z - Xo =

f 2X5dXs+<X>, 0

so ifXo=0,

J2XdX. = XX2 - <X >, in contrast to So 2s ds = t 2. From (1), we get the following useful result:

68

2

(2)

Integration by Parts.

Stochastic Integration

X, Y, - XoYo=

f

r YsdX., +J'XSdYc+<X,Y>1.

0

0

Proof Applying (1) to X, + Y, and X, - Y gives (X, + Y,)2 - (Xo + Yo)2 =

f

2Xs d(X + Y)., + f t 2 Y d(X + Y)., + <X + Y>,

0

(X, - Y)2 - (Xo - Yo)2 =

f

0

2X, d(X - Y)S -

f 2Y d(X - Y)5 + <X - Y>,. 0

0

The linearity of (H X) in X allows us to break the stochastic integrals into two pieces, using the distributive law: (3)

H- (X+Y)=(H Multiplying the two equations above by 1/4 and subtracting gives (2). If AS is of bounded variation and Y is continuous, then it follows from the theory of Riemann-Stieltjes integration that A,Y-AO YO = f YdA5+ J, A,dY. 0

o

Adding this to (2), we see that if R, = X, + A, is a continuous semimartingale and Y is a continuous local martingale, then R, Y, - RO YO = f , Y dRs +

f Rs dYs + <X, Y>,.

Repeating the last argument and interchanging the roles of X and Y, we see that if S, = Y + A' is another continuous semimartingale, then (4)

R, S, - Ro So = f , Ss dRs + fo, Rs dSs + <X, Y>,. Jo

so (2) holds for semimartingales if we define the covariance of two semimartingales to be the covariance of their "martingale parts." (This is unambiguous, since we have a unique decomposition.) The last observation and induction allow us to prove the following generalization of Ito's formula: (5)

... , X,d are continuous semimartingales and f : R° - R has continuous second-order partial derivatives, then If X,1,

.f(Xc) -.f(Xo) _

fo Dif(XS) d4s + 1

Y JDf(A;)d<XiXJ>s. 2i, o

Remark: At the end of the proof, we will state a version of this result that requires less differentiability on f, and we will prove the more general result by saying "the same proof works." To prepare for this, the reader should observe that f appears only in the first paragraph of the proof.

2.9

69

Extenelon to FunctlonN of Several Scmlmrrtingxlefi

Proof By stopping, it suffices to prove the result when I X,' l < M for all i, t. Since any continuous function on [ - M, M]" can be approximated by polynomials g^ in such a way that gn, Dig", and Dijg^ converge to f, Di f, and Dij f uniformly, it suffices to prove the result when f is a polynomial and, by linearity, xk^ where k1, k2, .... k" E {1, . . . , d}. (The when f is a monomial xklxk2

reader should note that k1..... k" are superscripts, not powers, e.g., our monomial might be x1x4xlx1x2.) If n = 1 and k1 = k, then (5) says that

Xk -Xo= f

1dXs,

which is trivially true. To prove the result for a general monomial, we use induction. Let Y = IIm=, Xk (m) be a monomial for which (5) holds, and let Zt = Xk("+1) Applying (4) gives

YZ,= JZ5dY.+ f Y SdZZ+,. 0

0

Applying (5) to Y gives Yt

-

n X.,k(m)

t

i
dXk(`) s

m=1

Y 2 ,J
m*i

t

n

o

m=1

+ -1

Xsk(m)

d<Xk(t) XkU)

s

m#i,j

so using the associative law, JZ1dY =

`

r

nj+1

1

X(m) dX) + -

11

J

n+l

t

n X(m) d<Xk(i) Xk(J)>m1

i#j o m1

m#i

By definition,

f Y dZt = f t(

^ X (m)1 dX1). J o\m1 H To evaluate the third term < Y, Z>we , observe that by the formula for the covariance of stochastic integrals, t

,J

jt

[ =

XS(m)

. 1 d<Xk(i)

Xk(n+1)>s

0m

i
m#i

Adding the last three equalities gives

Yt Zt = I

jt(y 0

1

+-

X (mdxki
m=1

m#i t

2 i,j
n+1

Tl j j Xk(s

d<Xk(i)e Xk(j)>

s

m m

(Notice that for each i in the sum for
70

2

Stochastic Integration

Remark : For applications to partial differential equations, it is desirable to have a version of (5) that assumes a minimum amount of differentiability. By inspecting the proof given above, the reader can see that we have actually shown the following: (6)

If X,1,

..., X° are continuous semimartingales and X,"', ..., X' are locally

b.v., then

=

f(X) - f(Xo)

Y JDJ(x)dX: s+ 1 > JDiif(Ac)d<XiXJ> s

i=1

2 1
0

o

provided all the derivatives in the formula exist and are continuous.

Proof It suffices to prove the result when f is a polynomial and, by linearity, when f is a monomial ...

2.10 Applications of Ito's Formula Our first application of Ito's formula shows that it is useful to know that the result holds for semimartingales. Applying Ito's formula with X,1 = X, and X2 = <X>, we obtain (1)

f(X', <XX) -f(Xo,0) =

D1f(X,, <X>s)dX + JD2f(Xs<X>s)d<X>s J

o

+ 2 JDiif(<x>s)d<x>s. 0

From (1) we see that if (ZD11 + D2)f= 0, then f(X <X>,) is a local martingale. Examples of such functions are f(x, y) = x, x2 - y, x3 - 3xy, ... so if we let X = Brownian motion, then we recover two results from Section 1.6: B, and

B; - t are local martingales, and we also get some new results: B, - 3tB,, B4 - 6tB,2 + 3t2, ... are local martingales. These local martingales are useful for computing expectations for Brownian motion.

Exercise l

Let T = inf{t : IB,I > a}. Then

(i) Eor = EoB = a2 (ii) EOT2 = E,(-B,' + 6TB?) = 5a4.

If we notice that f(x, y) = exp(x - y/2) satisfies (ZD11 + D2)f = 0, then we get another useful result. (2)

The Exponential Formula. If X is a continuous local martingale, then exp(X, - i <X>,) is a local martingale.

If we let Z, = exp(X, -'<X>,), then (1) says that

2.10

71

Appllcutlooa of ItO'/ Formula

(*) Z, - Zo = J Z. dXs, 0

or, in stochastic differential notation, that dZ, = Z,dX5. This property gives Z, the right to be called 1'xp(X,), the martingale exponential of X, (the script ' serving to remind us that it is exp(X, - 2'<X>,), not exp(X,)). As in the case of the ordinary differential equation

.f' (x) =.f(x)9(x), it is possible to prove (under suitable assumptions) that Z is the only solution

of (*). See Doleans-Dade (1970) for details. This martingale will play an important role in Section 2.13 when we discuss Girsanov's transformation. Letting X, = OB, in (2) gives us a family of (local) martingales exp(OB, - 02t/2). These martingales are also useful for computing the distribution of quantities associated with Brownian motion. Exercise 2

Let Ta = inf{t : B, = a}. Then for a, A, > 0,

Eoexp(-ATa) =

ezA,

and if you are good at inverting Laplace transforms, you can recover a result we proved in Chapter 1 (see Section 1.5) : PO(T0 = s) =

(2xs3)112Qe-a2/2s.

Exercise 3 Let T = inf{t : IB1 I > a} and let I//JA) = Eoexp(-ATa). Applying

the strong Markov property at timer gives Eoexp(-) Ta) = E0(exp(-AT); BL = a) + E0(exp(-AT) 412a(A); Bz = -a). Since T and Bt are independent, we have Va(A) = 21(1 + V/2a(A))Eoexp(-AT),

and solving gives

Eoexp(-.1T) = 2e-aH/(1 +

e-2o-1-2-1).

The next result shows why Brownian motion is relevant to the study of harmonic functions. If B, is a d-dimensional Brownian motion, then Ito's formula becomes (3)

.f(B) -f(Bo) =

('

J

DJf(Bs) M +2 Y J

D«.f(BS) ds,

so if Af= E; D1; f = 0, then f(B,) is a local martingale, or more generally, (4)

f(B,) - fo I Af(BS) ds is a local martingale.

72

2

Stochastic Integration

Note. If f and Of are bounded, then the expression above is bounded and hence a martingale. This proves (4) in Section 1.6, And pays off our debt. From (4) we immediately get two useful corollaries : (5)

Let f e CZ. If Of = 0 in G, D(x, S) c G, and r = inf{t : B, 0,6(x, S)}, then fly) div(Y).

f(x) = Exf(BT) = J D(x,a)

(6)

The Maximum Principle. Let G be a bounded open set. If f is continuous on G and Of = 0 in G, then

maxf(x) = maxf(x). xcG

xcaG

For several developments below it is important to know that there is a converse to (5). (7)

Suppose f is bounded in G and has the averaging property

(*) f(x) _

f(Y) dx(Y). aD(x,a)

whenever D(x, S) c G, then fe C°° and has Of = 0 in G.

Proof We will first show thatfe C. Let x e D and pick S > 0 so that D(x, 26) c G. Let 0, be a nonnegative, infinitely differentiable function that is not = 0 but vanishes on [SZ, oc). It is easy to check that g(Y) = J q1 (IY - xI2)f(Y) dY

is infinitely differentiable in D(x, r). Changing to polar coordinates and using (*) gives

g(Y) =

q (I zI 2)f(x + z) dz

J

=C

(O,d)

rd-1 0(r 2) (f

f o

f(x + z) dir(t)

an(0,r)

= C'f(Y),

so fe C. Now that we have shown that f e C°°, the rest is easy. Ito's formula implies

that f(BB) -

f I Lf(BS) ds is a local martingale, 0

so if Of(x) 0 0 at some x e G, then we can pick S > 0 small enough so that b (x, S) (-- G and Af(y) 0 for all y ED (x, 6), and we can apply the optional stopping theorem at time i = inf{t : B, 0 D(x, S)} to contradict (*).

For our last application, consider what happens when we take f(x) = xl

2.10

-I

73

Appllcstloau of ItO'r Formul

in (3). In this case, f(B) = 1B,I is the "radial part" of Brownian motion and is called the (d-dimensional) Bessel process. If x # 0, \1/2

=D;(>xxJI

D,Ixl

D 1x1 =

=

2xjJ1/22x;= 1x1-lxi

2 ) 3 / 2 (2xr)2

4( xi

+

(

xi2

1/2

i

so 01x1 =

1x1-31x12 +

dlxl-1 = (d

- 1)lxl-1.

If we let R, = IBrl and restrict our attention to d >- 2 to ensure that To = inf{t > 0: B, = 0} = co a.s., we can apply Ito's formula to Brownian motion stopped at TE = inf{t : IBrl < E or a"1} and let a -* 0 to conclude that

Rr-Ro=E

(8)

1

r

:

0RS19dBs+

(d-1)RS1ds. 2

fo

Replacing t by t n T. in the last equation, taking expectations, and letting f; -- 0 gives

EER1= xl +E,,2 J(d_ 1)R51ds, so

at E.

R, = Ex(d 2

1)R`

1.

The right-hand side is (d - 1)/21x1 at t = 0, so we say Rr has "infinitesimal drift" (d- 1)/21x1. Having computed the drift, the next step is to compute the variance process. (9)

r = t,

or, to use another phrase we will be using later, Rr has "infinitesimal variance" 1 (independent of x).

Proof Let R' = f o RS 1Bs dBs. r = > r = Y ti

=

t

J(R1B)2ds o

JR2Rds=t.

We will gradually explain what the infinitesimal drift and variance mean as the story unfolds below. For the moment, the important thing to understand is how they are computed using Ito's formula. Exercise 4 Let S, = R?. Show that S, - td is a martingale and

<S>= J4Srdr, 0

74

2

Stochastic Integration

so the infinitesimal drift is d (independent of position) and the infinitesimal variance is four times the value of S (which is 1x12 if the Brownian motion is at x). Exercise 5 Let d _> 2 and let

loglxl d=2 (P(x) _ IIX12` d > 3. Use Ito's formula to show that q,(R,) is a local martingale. Exercise 6 Suppose

X, - J

b(XX) ds is a local martingale

0

and

<X >, =

f r a(XS) ds

where a > 0 is continuous and b is bounded on compact intervals. Use Ito's formula to show 9(X,) is a local martingale if and only if 1]f (p "(x)a(x)

+ (p'(x)b(x) = 0,

so if we normalize cp to have q(0) = 0 and 9'(0) = 1, then

9W =

f z exp

(-

dy.

Jo a(z) dZ/

o

In Chapter 9 we will show how to construct processes X that satisfy the two conditions above. We will call such an X "a diffusion process with infinitesimal drift b(x) and infinitesimal variance a(x)," and we will call cP "the natural scale for this process." One of the reasons for interest in the natural scale is that it allows us to generalize the results in Section 1.7. Exercise 7 Let X be the process described in Exercise 6 and let T. = inf{t : X, = a}. Then

ps(T.
t_ f0 , = f Vu(B,) Vv(B9) A 0

75

Change of Time, Levy'. Theorem

2.11

Exercise 9 Let D = {z : I z j < 11 be the unit ball in R° and let T = inf {t : BD}. Show that if u and v are harmonic in D, continuous on b, and have u(0)v(0) = 0, then

I u(y)v(y)dir(y) =

Jan

Vu(x)Vv(x)GD(0,x)dx Jn

where GD(O, x) is the occupation time density for Brownian motion starting at 0 and killed when it leaves D. Analysts should recognize the last equality as a consequence of Green's theorem.

2.11 Change of Time, Levy's Theorem In this section we will prove Levy's characterization of Brownian motion and

use it to show that every continuous local martingale is a time change of Brownian motion. (1)

If Xr is a continuous local martingale with X0 = 0 and <X> = t, then X is a Brownian motion. Proof It suffices to show for any s < t that Xr - XS is independent of .mss and has a normal distribution with mean 0 and variance t - s or, if we introduce

X=

X, and .emu = fs+, that the last conclusion holds with s = 0.

By Ito's formula, e10Xs dXs - 2 r etexs ds.

eieXt - 1 = i9

fo

Jo

Let A e.Fo. The first term on the right is a local martingale, so if we replace

t by t A T (T = inf{t : I X, > n}), integrate over A, and let n - oo, we get B2

Jr

j(t) - P(A) = 0 - 2

j(s) ds, wherej(s) = E(ei°Xs ; A). 0

Since we know a priori that I j(s)I < 1, it follows that j is continuous, j has a continuous derivative ... Differentiating the equation gives 02

j'(t) =

(t),

which, together with the initial condition j(0) = P(A), shows that j(t) _ P(A)e-0242. Since j(t) = E(eiOXt ; A), this shows that E(e`°XtI.4--o) = e-02y2, that is, Xr is independent of FQ and has a normal distribution with mean 0 and variance t. An immediate consequence of (1) is: (2)

Every continuous local martingale with <X> = oo is a time change of Brownian motion.

76

2

Stochastic Integration

Let y(u) = inf{t : <X>, > u} and let Y. = X(U). Since y(u), u >- 0, is an increasing family of stopping times, the optional stopping theorem implies that (3)

YU, .FY(u), u > 0, is a local martingale.

Proof By stopping at T = inf{t : IX,I > n}, it suffices to show that if X is a bounded martingale, then Y is a martingale. To do this we observe that if ul < u2, then y(ul) < y(u2) are stopping times, so we have E(XV(U2)I'1Y(u,))

= XY(u,) = Y(ui),

proving (3). Repeating the argument above shows (4)

YY - u, JFY(u), u >- 0, is a local martingale.

Exercise 1 of Section 2.4 implies that u -+ Y is continuous, so combining (3) and (4) with Levy's characterization shows that Y is a Brownian motion.

The result above can be easily extended to local martingales with P(<X> < oo) > 0 (and we will see below that this is an important extension). Let T = <X>.. The ,proofs of (3) and (4) show that Y and YY - u are local martingales on [0, T), and repeating the proof of (1) shows that YU, U < T, is a Brownian motion run for a random amount of time. Since Brownian paths are continuous, it follows immediately from the last observation that (5)

lim X exists on {<X>, < oo}. tTOD

In Chapter 1 we showed that Brownian motion has (6)

lirn sup B, = oo, lim inf B,

oo.

Using this observation, we can sharpen (5) to conclude that (7)

The following sets are equal almost surely: C = {lim X exists} t- 00

B= {sup JXJ
{sup X,
A={<X>,
Remark: The last result justifies the assertions we made in Sections 2.1 and 2.5 that A3 and II3 (X) are essentially the largest possible classes of integrands.

2.11

Change of rime, 1.lvy,o Theorem

77

Remark: We return to the result above when we study boundary limits of harmonic functions in Chapter 4. For related results on discrete martingales, see Neveu (1975), Chapter 7. In the discrete case, these results are only true when some regularity assumptions are imposed to control the size of the jumps. One of the joys of working with continuous martingales is that such considerations are unnecessary.

Convergence is not the only property of local martingales that can be studied using (2). Almost any almost-sure property concerning the Brownian path can be translated into a corresponding result for local martingales. This immediately gives us a number of theorems about the behavior of paths of local martingales. We will state only the most famous of these-the law of the iterated logarithm. (8)

Let L(t) =

2t log logt for t >- e. Then on { <X%, = oo

limsupX/L(<X>,) = 1 r-m

a.s.

Proof By time substitution, it suffices to prove the result for Brownian motion, and we have done this in Section 1.3. Remark :

As before, the result is true in the discrete case only if some regularity

conditions are imposed on the jumps, and the proofs are more difficult. See Neveu (1975), Chapter 7, for discrete time martingales and Lenglart (1977) and Lepingle (1978) for general continuous time martingales. In our proofs of (7) and (8), we have swept a little dirt under the rug. If X is a continuous local martingale with <X> = oo and we define then (r, a(Y5 : s < t), but T(u) = inf{t : <X>, > u}, Y. = X,(u), and W. = it may be larger. For example, if X, = 0 for t < 1 and = B, - B1 for t >- 1,

Technical Note:

then!N,=a(B,:s
a(B5-B1:1 <s
that 9, is larger than the usual filtration associated with Yis a technical nuisance, but not a real problem. The proofs of (3) and (4) show that Y. and Yu - u are local martingales with respect to mar,,, so the proof of (1) shows that E(etors+r l Wrs) =

e-02r/2

and Y is what might be called a Brownian motion w.r.t. mar. In other words, (i) t -+ Y is continuous and adapted to Wr, (ii) for all s, t > 0, Y+, - Y, is independent of Wrs and has a normal distribution with mean 0 and variance t.

It is not hard to show that the results we have proved above for our special presentation are also valid for the more general Brownian motions we have just defined, but that is a big leap to make all at once, so all that we will claim now is the easy-to-verify fact that the results we used in the proofs of (7) and (8) are valid for a general Brownian motion.

79

2

Stochastic Integration

2.12 Conformal Invariance in d >_ 2, Kelvin's Transformations In this section we will determine when a vector (X,', . . . , X°) of continuous local martingales is a time change of a multidimensional Brownian motion. The characterization of Brownian motion given in Section 2.11 generalizes easily, but when there are two or more components, we cannot straighten out all the <X', Xj), with only one time change, and only special vectors of local martingales can be time-changed into Brownian motion. If we consider X, of the form f(B,) where f : Rd - Rd, then (modulo trivial modifications) in d = 2, f must be analytic, and in d > 3,f(x) = Ax where A is an orthogonal matrix. The first step in deriving these results is to prove the following generalization of Levy's theorem ((1) in the last section) (1)

If X,', ... , X" are continuous local martingales with Xo = 0 and <X`, X'>, _ b;,t, then X, = (X,', . . . , X°) is an n-dimensional Brownian motion. Proof If Y = Y, a(X,` where Y ; a? = 1, then it follows from the proof of the one-dimensional result that ifs < t, then

rs)I. ) = e-(r-s)e2/z or, in terms of the X`, E(epEj(eaj)(xi-xs)) . ) = e-(r-s)e2/2

Since any v c R" can be written as 9a where Y_l a; = 1 and 0 e [0, oo), this shows that X, has independent increments and X, - XS has a d-dimensional normal distribution with mean 0 and covariance (t - s)I, proving (1). From (1), it follows immediately that a vector (X,', . . . , X°) of continuous local martingales is a time change of Brownian motion if and only if there is an increasing process A, such that <X`, Xj>, = b;,A,. The most important instance of martingales of this type occurs when we compose an analytic function with a complex Brownian motion. To explain this we need some notation. Let C, be a complex Brownian motion, that is, C, = B' + iB, where B,', B, are independent Brownian motions. Let f be analytic in C; in other words, if we write z = x + iy and f = u + iv, then f satisfies the Cauchy-Riemann equations au

av

au

-all

OX

ay

ay

ax

Differentiating again gives a2u axe

_

a2y

a2u

axay

aye

azv ayax'

and since v has partial derivatives of all orders,

2.12

Conformal Invariance In d

Au=-+-

a2u

32u

ax2

2, Kelvin's Transformations

a2v

aZU

axay

ay2

ayax

79

=o

that is, u is harmonic. A similar argument shows that v is harmonic, so it follows from results in Section 2.10 that U, = u(C,) and V, = v(C,) are martingales, and J rAt

r =

(2)

0 rhi

Vv(Cs) 2 ds

< 0

where Vu =

au au lay av ay) and Vv = (ax' ax' ay

From the- Cauchy-Riemann equations, it follows that Vu(z) 12 = Vv(z) 12 for all z, so , = , = 0.

(3)

Proof Suppose u(C0)v(C0) = 0. From (3) of Section 2.6, , = J HHK5d<X, Y%. 0

From Ito's formula, if C, =/ B,1 + iB2, then U, = u(Cr)

_

t

Diu(C5) dB.,' = U11 + U,2,

i=1 J 0

and V can be written in a similar way as V,1 + V,2. Now

I L Mil Vj 2

22

i=1 j=1

r

JD1u(C)Div(C5)ds, i=1

and from the Cauchy-Riemann equations, (D1u)(D1v) + (D2u)(D2v) = -(D1u)(D2u) + (D2u)(D1u) = 0, so , - 0, proving (3). Combining (1), (2), and (3) above with the results of Section 2.11, we get Levy's result concerning the conformal invariance of Brownian motion. (4)

If f is analytic and C, is a complex Brownian motion, then f(C,) is a time change of a complex Brownian motion. To be precise, if we let Qr=

JIf'(Cs)l'1s 0

and

80

2

Stochastic Integration

y,=inf{s: o..>-t} (which is defined for all t, since the recurrence of C implies a. = oo), then f(C(y,)) is a complex Brownian motion. Remark: The reader might enjoy looking at Levy's original proof (see page 270 of Levy (1948)) for an intuitive but nonrigorous explanation.

Having found that analytic functions map two-dimensional Brownian motion into a time change of itself, it now seems natural to ask what functions

do this for n-dimensional Brownian motion. Let Ft` = the ith component of f(B,). If f(B,) is a time change of Brownian motion, we must have , CSiA so it follows from Ito's formula that if f is the ith component off, then (5) V i12 is independent of i.

The last two conditions imply that f is conformal (angle-preserving), so a wellknown theorem of Liouville (see Spivak (1979), Vol. III, pages 302-310) implies that f must be a composition of mappings from the following list:

(a) translation: x - x + y (b) multiplication by a constant: x - cx (c) orthogonal transformation : x - Ax where A is a matrix with (Ax, Ay) _ (x, y) for all x, y

(d) inversion: J(x) =

x/IxI2.

We will call anything that can be made from the mappings listed above a Kelvin transformation. The first three transformations clearly map Brownian motion into a time change of itself, but if d >- 3, the last transformation does not. The simplest way to prove this is to use two facts from Chapter 1.

First Proof When d > 3 , IB,I -> oo as t - oo, so J(B,) - 0, something which is impossible for a time change of Brownian motion in d > 3, since in this case P(B, = 0 for some t > 0) = 0 and, as we mentioned above, I B, I oo as t - oo. A second, more tedious, way of proving this is to use Ito's formula to show that the components of J(B,) are not local martingales. Second Proof Let Ji(x) = xi/IxI2. A simple computation shows that i

DiJI

=

i

xl IxI4 _ 2

i=1

I4+xI2 +

_ x,

i

IxI6

IXI4

-4x1+24x3

1

IxI4

1

IXI6

+-2x IXI4

1

1

2.12

Confornul Invevlence In d .-e 2, Kelvin's Transformations

91

so

AJ1=

- 6x1

+

IXI' = IXI

8x1

+,=

IXI6

=2

8xlx? Ixl4 + Ixlb

- 2x1

d

(

a (-4 - 2d)xl + X, j 8x?/

= IX' (4 - 2d), and we see that J1 is harmonic if and only if d = 2. Applying Ito's formula now, we see that J1(B,) is not a local martingale, and hence J(B,) cannot be a time change of Brownian motion. This is a shame, because J(B,) has the right covariances I

IX I6 +

IZ

Ix14

XXI8

- --4x2, +

+ iI xl

+ 4x2,

1

IXIe

x? d

,-1x11

Ixla

Ixlb

IXIa

4x1x2

- 2x1x2 4x1x2 - 2x1x2 1X16 + IXI6 +4X X

and

VJ1' 0J2= 1x18

1

d

2

x?

i-3 Y X18

Ix18

=4x1x2 `

X' l

8-

4

x62=0.

11

I

What kind of process is J(B,)? One clue comes from the first proof given above. J(B,) -+ 0 as t -+ oo, so if the reader knows something about the results of Chapter 3, he might guess that J(B,) is "Brownian motion conditioned to converge to 0 as t -+ oo" (whatever that means). This guess is correct, but we

will not be in a position to explain this until Section 3.4. For the moment, we prove the following result, which is useful for studying the behavior of J(B,). (6)

If u is harmonic, then l B,I2-du(J(B,)) is a local martingale.

Proof Let f(x) = Ixl2-d and g(x) = u(J(x)). By Ito's formula, it suffices to show that 0(fg) = 0. Recalling that D«(.fg) = (D<<.f )g + 2DJD1g +.fD«g,

we take things in three steps.

82

2

Stochastic Integration

D`f=2-d 2x; Ixl"

2

Diif

2-d

(2 - d) d (2x;)2

d

Dig

2

2

IXI 2 - d O.f=d XId

XI

d+2

2

IXI -(2-d)dlxId+z=0

(a)

d

= Y Dju(J(x))D;Jj(x) j=1

=

d d

xj

Dju(J(x))

j=1

1

(2xi) + D,u(J(x))1X z

IX ``

- (2 - d) J_ Y Y Dju(J(x))2x` 41 + i D.u(J(x))' V f Vg d

i=1 j=1

1

IXI

(2-d)

2

d

d

IXIz

t=1

IXI

Xi

d

- = Diu(J(x)) Ixlz

(b)

IXI" d

d

d

D1,g = Y Y Djku(J(x))D,J,(x)D,Jk(x) + Y j=1

j=1 k=1

Since VJ, OJk =

8jk/1X14,

it follows that d

Ag =

Au(J(x))/1x14 + j Diu(J(x))AJj(x). j=1

If u is harmonic, then Au = 0, so multiplying byf(x) = Ixl2-d gives d

fAg

I

X

d 2

Y Diu(J(x)) 1 Xi4(4 j=1 x

- 2d),

(c)

and combining this with (a) and (b) gives

A(.fg) = (Af)g + 2Vf' Vg +f(Ag) = 0.

2.13 Change of Measure, Girsanov's Formula In this section we will show that if Xis a continuous semimartingale with respect to a measure P and Q is a measure equivalent to P, then X is a semimartingale with respect to Q, and we will give explicit formulas for the martingale and the bounded variation processes in its decomposition. We will see in Chapter 8 that this rather esoteric-sounding result is actually quite useful.

Let Q be equivalent to P, let a = dQ/dP (Radon-Nikodym derivative), where this and other nonsubscripted expectations are taken w.r.t. P, and we have chosen versions of the conditional expectations so that t a, is continuous. There is a simple interpretation for a,. and let a, = E(a I

2.13

(1)

$3

Change of Measure, Glrsanov'M Formula

Let Q, and P, be the restrictions of Q and P to S . Then a, = dQ,/dP,.

Proof Clearly at = E(al.flE.

.

If A e

,

then

f at dP, = fX a, dP, ,JA

and it follows from the definition of conditional expectation that

f, at dP =

Q

Q,

JA

so a, = dQ,/dP, (= dQ,/dP).

The next step in our development is to show (2)

Y is a local martingale/Q if and only if at Y is a local martingale/P.

Proof Let Y, be a martingale/Q. We want to show that a, Y, is a martingale/P, or that for all s < t and A E #s, f" a, Y, dP =

f aS Y dP. A

To do this, let EQ denote expectation with respect to Q and observe that ))

E(a, Y, 1A) = E(Y lAE(a

= E(E(Y,'Aa

,)) = E( Y, lAa)

= EQ(Y lA) = EQ(YlA).

Since Yis a martingale/Q, and reversing the steps just executed above and condi-

tioning on F, instead of f shows that EQ(Y lA) = E(YlAa) = E(a9Y IA)-

From the last equation, it follows immediately that if Y, is a local martingale/

Q, then a, Y, is a local martingale/P. To prove the converse, observe that (a) if /3 = dP/dQ, A = EQ(/JI. ), and Z, is a local martingale/P, then $,Z, is a local martingale/Q and (b) if Pt and Qt are the restrictions of P and Q to .F then

fl,=dPjdQ,andat=dQ,/dPt,so f3,=a; 1. With (2) established, we are ready to prove Girsanov's formula. (3)

If X is a local martingale/P and we let A, = $o as Y
(*) a,(X,-At)-aoXo= J(c -A.)das+ J;dX.- J;dA5+t. 0

84

2

Stochantlc Integration

Remark : At this point, we need the assumption made in Section 2.3 that our filtration only admits continuous martingales, so there is a continuous version of a to which we can apply our integration-by-parts formula. Since as and X. are local martingales/P, the first two terms on the right in (*) are local martingales/P, and in view of (2), if we want XX - A, to be a local

martingale/Q, we need to choose A so that the sum of the third and fourth terms = 0, that is,

J;dA., = ,. 0

From the last equation, it is clear that we want At =

J1d<X> as

s

and that this equality will make the bounded variation terms in (*) cancel each other. The last detail remaining is to prove that the integral that defines A, exists. Let T inf{t : a< < n 1}. If t < T,,, then Id
f

as

by t he Kunita-Watanabe inequality, so A, is well defined fort < T = lim,,-. T

Since a, = E( a I S) is uniformly integrable, the optional stopping theorem implies that

Letting n - oo, we see that a = 0 a.s. on IT < oo}, so P{T < oc} < P{a = 0} 0 (since Q is equivalent to P), and it follows that the integral that defines A exists.

A typical application of Girsanov's formula is to the following: Problem Let X, be a local martingale/P. Given a function b, find a measure Q so that X,

0

is a local martingale/Q. Solution :

From (3), we see that we want

(*) J d = fo b(X)d<X> o

as

.

If a, = 1 + f O HS dXs, then by the formula for the covariance of two stochastic integrals

2.14

85

Martingales Adopted to Brownian Flltratlons

, =

JH5d<X>5. 0

Substituting the last equality into (*) gives H5/a5 = b(X5), or a5b(X5)dX,.

JH5dX5 =

fo

The last equality can be written as

a, - 1 = f

a.b(X5) dX5,

0

so it follows from the exponential formula ((2) in Section 2.10) that (*) is satisfied if and only if ar = exp(Y, - z < Y>,) where Y, = Jb(X5)dX5. 0

This result will be very useful in Chapter 8. To get an idea of what a, and Y, look like, consider the following simple examples :

Example l b(x) - p (Brownian motion plus a constant drift): Y,=p(B,-Bo)

at = exp µ(B, - Bo) -2 . Example 2 b(x)

ax (the Ornstein-Uhlenbeck process) :

Y, = -a JO B5dB5 ' =

a(B2 - Bo

IB5I2 ds

fo a2

r

at=exp -a(B, - Bo - t) - 2 f.jBj2ds)_

2.14 Martingales Adapted to Brownian Filtrations Let B = {fit, t >- 0} be the filtration generated by a one-dimensional Brownian

motion B. In this section we will show (1) all local martingales adapted to B are continuous and (2) every martingale XE.,#2(B) can be written as X0 +

86

2

stochastic Inteustion

f o Hs dBg where He 1Z2(B). The generalizations of these results to d dimensions will be obvious once the reader has seen the proofs below. (1)

All local martingales adapted to B are continuous.

Proof Let X, be a local martingale adapted to B and let T. be a sequence of is stopping times that reduces X. It suffices to show that for each n, continuous, or in other words, it is enough to show that the result holds for martingales of the form Y, = E(YI -4,) where Ye .,,. We build up to this level of generality in three steps.

f is a bounded continuous function. If t >- n, Let Y = so t - Y, is trivially continuous for t > n. If t < n, the Markov property of Brownian motion implies that Step 1:

then Y =

E(YI -4,) = h(n - t, B,) where

h(s x) =

f

1

2res

e-(y-x)2asf(y) dy

It is easy to see that h(s,x) is a continuous function on (0, oo) x R, so Y is continuous for t < n. To check continuity at t = n, observe that changing variables y = x + (z/,) gives h (s, x) = Jy=l=_e:z2/2f(x

?) dz,

+ ssf

so the dominated convergence theorem implies that as t T n, h(n - t, B,) f(B,,) Step 2: Let Y = f1(B,1) f2 (B,2) where t 1 < t2 < n and f1 f2 are bounded and continuous. If t >- t1, then Y =.fi(B,,)E(f2(Br)I °. ),

so the argument from Step 1 implies that Y is continuous on [t1, oo). On the other hand, if t < t1, then Y = E(Y1I -4,) and Y, =.f1(B,,)E(.f2(B,2)I-4r1) f, (B1,)g(Br1)

where

9(x) =

1

e cy X11 V2 `1y 2(y)dy

2n(t2 - t1)

is a continuous function, so

Y,= E(fi(B,)g(B,)j.

)

for

t
2.14

87

Martlngales Adopted to Brownian Flltratlons

and it follows from Step 1 that Y is continuous on [0, t1]. Repeating the argument above and using induction, it follows that the result holds if Y =fl (B,1) . . . f (B,k) where tl < t2 < continuous functions.

< tk < n and fl , ... , f are bounded

Step 3: Let Ye Mn with El YI < oc. It follows from a standard application of the monotone class theorem that for any e > 0, there is a random variable X` of the form considered in Step 2 that has EI X` - Y' < e. Now

IE(X`I_V,)-E(YIM,)I <E(IX`- YIIM,), and if we let Z, = E(I X` - YI IM,), it follows from Doob's inequality that

2P(supZ,>2)<EZn=EIX`- YI<e. t
Now X`(t) = E(X`I 9,) is continuous, so letting a - 0 we see that for a.e. CO, Y(w) is a uniform limit of continuous functions, so Y is continuous. Remark: I would like to thank Michael Sharpe for telling me about the proof given above.

Having proved (1), we now turn our attention to proving that every martingale X e . #2(B) can be written as X0 + YO H., dB, where He f12(B). The first step in doing this is to take a general filtration LF and an Xc -02(l) with X0 = 0

and examine {H X : He I12(X)} as a subspace of ..lf p = {Ye t'(2 : Yo = 0}. (Here and below when the filtration is not specified, it is F.) In Section 2.1, we showed that X -+ X,., maps J'2 onto L2(S ), so (X, Y) _ EXW Y.,, makes .i?2 a Hilbert space. In the theory of Hilbert spaces, the following

definition applies:

X and Y are orthogonal if (X, Y) = 0. This notion is weaker than the following concept from the theory of martingales :

X and Y are uncorrelated if <X, Y> _- 0, that is, X, Y, is a martingale.

The first definition corresponds to E<X, Y> = 0, so it is much less restrictive than the second. In some circumstances, however, if Y is orthogonal to all the martingales in a subspace .K, then it is also uncorrelated with them. A subspace .K of . #2 is said to be stable if it is

(a) closed in the #2 topology (b) closed under stopping : if X e .N' and T is a stopping time, XT = XrA, E .N'. (3)

Let N' be a stable subspace. If X e .N' and He 112(X ), then H X e .N'.

Proof The key to the proof is the following: (4)

Let -4/' be a stable subspace and .N'' = { Y : (X, Y) = 0 for all X E X) be the orthogonal complement of A. If X e .N' and Ye A', then X, Y, is a martingale.

88

2

Stochastic Integration

Proof Let X e ./V'. If T is a stopping time, then X T e .N', so if Ye .N'', then 0 = (XT, Y) = E(XTYom) = E(XTYT),

(4)

since martingale increments are orthogonal. (31 now follows from (5)

If for all bounded stopping times T we have EZT = EZo, then Z, is a martingale.

Proof Lets < t, A E ., and let T = son A and T = ton A`. By hypothesis, EZT = EZo = EZ, so subtracting EZ, lAc from both sides, we find EZS 'A = EZ, IA. Since this holds for all A e Fs, then ZS = E(Z, I, ), and Z, is a martingale.

With (4) established, it is easy to prove (3): If Ye.K', then <X, Y> = 0, so = f H., d <X, Y>s = 0 0

and E((H X)c Y.) = Ec = 0. This shows that H X is in the orthogonal complement of .N', that is, H X e.41". We have proved (3). Since {H X : He 112(X)} is a stable subspace, it follows from (3) that we have (6)

The smallest stable subspace containing X is {H X : He 112(X)}. Having established some properties of stable subspaces, we are now ready to prove the second result we stated at the beginning of this section.

(2)

If Xe .4'2(B) and X0 = 0, then there is an He 112(B) such that

X,= f

H, dB.,. 0

Proof Let ,' = {H B : He n2(B) }.. ( is a stable subspace of A'2. Let Y be a martingale in the orthogonal complement .K'. We will show that Y =_ 0. The key to doing this is the following formula: E(eiO t-B,)

)o2/2.

Ye-et-S

(7)

As we observed in Section 2.11 when we proved a similar formula, it suffices to prove the result when s = 0. Ito's formula shows that eteat

= 1 + iB

t eboas

dBs - e2 f t e ieas ds.. 2 Jo

o

Let A e.Fo and multiply both sides by 'A Y. If we take expectations and let j(t) = E(e'eBt lA Y), then we get

j(t) = E(1A Y) + ME (Y f r 1Ae'eas) \

o

/

- 22

f,

ds.

Jo

Since Y, is a martingale and A e moo, E(lAY) = E(lA Yo). To deal with the second term on the right, we observe that Z, = j O 1Ae`BBS dBs e .K, so it follows from (3)

that = 0 and hence that E(YZZ) = 0. Combining the last two observa-

Notes on Chapter 2

89

tions with the formula above shows that e2

j(t) = E(1AYo) -

2

Jt

j(s)ds. 0

As we argued in Section 2.11, this result implies that

Pt) =

82

j(t) j(0) = E(1AYo),

so we have j(t) = e-e2,12E(IAY0), proving (7).

The next step in the proof is to extend (7) to a statement about an arbitrary finite set of increments of Bt. Let 0 = to < t1 <

< to and

let Qn = fm=1 exp(iOm(B(tm) - B(tm_1))). From the orthogonality of martingale increments and (7), it follows that E(Qn Y,,) = E(Qn Y(tn))

= EE(QnY(tn)I Fn 1)

E(Qn-tE(eien(1(tn)-B(tn-1))Y(tn)In-1))

=

= E(Qn-1 Y(tn 1))e

(tn to 1)en/2,

so it follows by induction that n7

(8)

E(Qn Y.) = E I Y0 1 1 e-Um tm-1)Bm/2 = 0, M=l

since Yo = 0. Let it be the signed measure Y,P on -4.,, where µ(A) = JA Y. dP. Since El Y.I < oo, it has bounded variation. Let v be the measure on R" that is the image of p under the mapping w -+ (B, (co), ... , B,n(w) - B,n_1((o)). The measure v is 0, since (8) shows that its Fourier transform is 0. This fact in turn implies that for any bounded measurable function on R",

f f(B,,, ...,B,^-B,n_1)Y.dP= f fdv=0 n

x^

and, taking 'limits, that for any bounded ZE.4, E(ZY..) = 0. Taking Z = sgn(Y,,) (= 1 if Y. > 0, -1 if Ym < 0), we see that E(I Y,. I) = 0, so Y,,,

0.

Notes on Chapter 2 Most of Chapter 2 follows Meyer (1976). To be precise, the proofs of the main results in Sections 2.5, 2.6, 2.8, 2.11, 2.13, and 2.14 were obtained by specializing his proofs to the

continuous case and translating them from Strasbourg French to southern California English. In writing some of the rest of the material, I have used a number of other sources, primarily McKean (1969), Friedman (1975), and Rogers' article in Williams (1981). The reader can find a beautiful treatment of stochastic integration in Dellacherie (1980). The approach taken there shows that it is "inevitable" that we were led to integrate predictable processes w.r.t. semimartingales.

90

2

Stochastic Inte*ntlon

A Word about the Notes As the reader has probably already noticed, the note above says nothing about the history of stochastic integration. This situation exists because in my several attempts to write such notes I have found that I have neither the time nor the patience to read all the original papers and to try to figure out who did what and when. Because of this, I have contented myself to merely list the relevant papers in the references and to confine the discussion in the notes to where I got the proofs presented here and where the reader can find more about the subject.

3 Conditioned Brownian Motions

3.1 Warm-Up: Conditioned Random Walks Let Bt be a d-dimensional Brownian motion, let H = Rd-' x (0, oo) be the upper half space, and let r = inf{t : B, 0 H} be the exit time from H. In the next two sections, we define Brownian motion conditioned to exit H at 0 and con-

ditioned to never leave H. To prepare the reader for the Brownian motion definitions, this section is devoted to a discussion of the analogous problems for the simple random walk. To construct the simple random walk in Z', let X,, X2, ... be independent

random variables with P(Xm = v) = 1/2d for v = e,, - e, , ... , ed, - ed and X for define the random walk starting at x by letting So = x and S. = n > 1. The simple random walk is the discrete analogue of Brownian motionit has independent increments and paths that are as continuous as they can be. 0) Let N = inf{n >- 0: S. = 0}. The random walk analogue of is SN = 0), but there is a big difference. If the starting point So = xeH, then PX(SN = 0) > 0, so the process can be defined by elementary conditional probabilities. In order to understand how (SnANI SN = 0) behaves, we start by considering what happens at the first step. If So = x e H, then by the definition of conditional probability and the Markov property, (1)

PX(S, = YJSN = 0) =

PX(S1 = Y,SN = 0) PX(SN = O)

- PX(S1

= Y)Py(SN = 0) PX(SN = O)

It is easier to see what is going on if we write the right-hand side in more that is, abstract notation. Let p(x, y) be the transition probability for

11/2dif xeH, Ix - yj = 1

p(x,y)=

1

ifxoH,x=y

0

otherwise.

Let h(x) = PX(SN = 0), and let g(x, y) = PX(S, = yISN = 0). By considering the two cases x e H and x = 0, we see that for all x with h(x) > 0, 91

92

(2)

3

Conditioned Brownlrn Motion

q(x,y) =

h(x)-1p(x,y)h(y)

The special relationship between q and p causes some nice cancellation to occur when we consider the joint distribution of the first two steps. If x, y e H, then a simple extension of the computation in (1) shows that

Px('S1=y,S2=ZISN=0)=PX(S1=y,S2=Z,SN=O)/Px(SN=0) = p (x, y)p(y, Z)h (Z)/h (x) = (h(x)-1p(x,y)h(y))(h(y)-'p(y,Z)h(Z))

= q(x,y)q(y,z)

(and similar computations show that the result above is true whenever y or x = 0), so the joint distribution of the first two steps is the product of the onestep probabilities, a trait indicative of the Markov property. The reader can easily confirm that the pattern above persists for any finite dimensional distribution and hence that (S,,,,NI SN = 0) is a Markov chain with state space HU {0} and transition probability q given by (2). The construction above can be generalized considerably. Let p(x, y) be the transition probability for a Markov chain on a countable set S, that is, p(x, y) >- 0 and Y,,p(x, y) = 1. Let h(x) >- 0 be a harmonic function for p, that is,

h(x) = Yp(x,y)h(y) Y

Given these two ingredients, we can define a transition probability by setting

q(x,y) =

h(x)-1p(x,y)h(y)

for all x with h(x) > 0. The q defined in the last paragraph is commonly called an h-transform of p. These processes were introduced by Doob in a paper titled "Conditioned Brownian Motion and the Boundary Limits of Harmonic Functions," so the reader should not be surprised that these processes will appear several times below. The first occurrence will be in the next section : by choosing the right harmonic function, we will get an h-transform of Brownian motion with Bt = 0 a.s. The answer will be obvious when you see it, so you should spend a moment thinking about what the h should be. It is one of the functions in Chapter. 1. In Section 3.3 we will encounter another h-transform : Brownian motion conditioned to never leave H (again you should try to guess the h), so for the rest of this section we will consider the analogous problem for simple random walk. Since the conditioning affects only the last component, it suffices to consider the one-dimensional case. Imitating the proof of (1) in Section 1.7 shows that if Tx = inf{n > 0 : S = x} and a < x < b, then (3)

b-x Px(T°
b-a

x-a

Pp(T
a

If we let a = 0 and b -> oo, we see that PX(TO < oc) = 1, in other words, the conditioning event has probability 0.

3.1

Warm up: Conditioned Random Walks

93

To construct (S.I To = oo), we take an obvious approximation: we consider To > TM), compute its transition probability qM(x, y), and let M -, oo. From

formula (2) above, we see that if 0 < x < M and Ix - yl =1, then gM(x,y) _

I P(To> TM) r -.

2 P.(To > TM) ly/M y 2 x/M 2x

-

A remarkable aspect of the last formula is that M does not appear on the righthand side (except in the requirement that x < M), so if x > 0 and Ix -- yl = 1, then (4)

q (x, y) = Mim qM(x, y) = 2x

is the transition probability for random walk conditioned to have To = oo. The reader should observe that q. is itself an h-transform with h(x) = x, and the generalization to Brownian motion should then be obvious. The last sentence tells you one of the things I wanted you to guess. For one last hint about the other, observe that in the situations described above, h = 0 on the forbidden set and this keeps the h-transform from going there. Exercise 1

It is interesting to see what happens when we condition an asym-

metric random walk, that is, let P(X = 1) = p, P(X,, _ -1) = q = I - p, and let S. = X, + + X with X,, X2 ... independent. (a) Define a function qp by setting cp(O) = 0 and p(x) = cp(x - 1) + (q/p)X, x 0 0, and check that is a martingale. (b) Repeating the argument for (3) above, we see that PX(Tb
w(b) - (n(a)' so if x > 0, PX(To = oo) = q (x)/(p(oo) where (a

(P(oo) _ X

1=

oo if p oo if q9
(c) If p > 1/2, then PX(TO = oo) > 0 and To = oc) (with the elementary definition) is a Markov chain with transition probability (5)

q(x,y) = p(x,y)

9(y) when x > 0. (P (X)

As x - co, (p (x + 1)1(p (x) -- 1. Therefore &, x + 1) -p; in other words, as x gets large, the effect of the conditioning vanishes, and the transition probabilities approach those of the original random walk. (d) If p 5 1/2, then PX(TO = oo) = 0, but if we define (S,, IT,) = oo) as the limit of To > TM), then it is a Markov chain with the transition probability

94

3

Conditioned Brownian Motions

given by (5). This time, as x - oo, (p (x + 1)/q (x) - q/p, so q(x, x + 1) - q, that is, as x gets large, the conditioned process is again a random walk, but the conditioning changes the probability of x -+ x + 1 from p to q >_ 1/2.

3.2 Brownian Motion Conditioned to Exit H = Rd-1 x (0, oo) at 0 Rd-1 x (0, oo) be the upper half space and let r = inf{t : B, 0 H}. In this section, we will define Brownian motion conditioned to have B, = 0. As we indicated in the last section, this will be done by introducing a suitable htransform of the stopped process. To see which h to choose, we begin with some formal calculations. Let z e H and write z = (x, y) where x e Rd-1 and y > 0.

Let H =

Let y' < y, let H' =

Rd-1

x (y', oo), and let r' = inf{t : B, H'}. Using the

definition of conditional probability, the strong Markov property, and formula (1) from Section 1.9 gives P.(B(r')=(X,y')IB,=0)=PZ(B(T')=(x',y'),B, = 0)

PZ (Br = 0)

PZ(B(r') = (x',Y))PZ.(B, = 0) PZ(B, = 0)

-CyIZ --yho(z) Z' Id ho(Z) ' where C = I'(d/2)/ird/2 and ho(z) = y/Izld is 1/C times the probability (density) of exiting H at 0 starting from z, and using a notation that we will use throughout this chapter, we have written z' = (x', y') where x' e Rd-1 and y' > 0.

The last formula tells us what the distribution of B(r') should be under P° = PZ(. IBz = 0). By extending the last computation, we can compute the distribution of B(t n r') under P. Let.F' = .F(r') and A e 3v'. Using the definition of conditional probability, two properties of conditional expectation, and, finally, the strong Markov property gives P°(A) = PZ(A, B, = 0)lho(z) = EZEZ(1A 1a =old')/ho(z) = EZ(1AEZ(1BL=oIF'))/ho(z)

= EZ(lAho(B(r')))/ho(z)

The computations above have all been formal, but they have also told us what to guess, so we turn now to the task of defining Brownian motion conditioned to have B, = 0. The first task is to say something specific about the probability space on which our conditioned process will be defined. To keep things simple, we will suppose that we have Brownian motion defined on our special probability space (C,') as a family of measures P, zERd, that make

3.2

Brownlxn Motion Conditioned to Exit H - Rd

'

x (0, oo) at 0

9S

the coordinate maps B,(w) = w, a Brownian motion started at z, and we conzeR", that make the coordinate maps a struct a new family of measures P.01 Brownian motion started at z and "conditioned to exit at 0." Let H. = Rd-1 x (2-", oo), let T" = inf{t : Bt0H"}, and let .Fn = ,F(T"). The first step in our construction is to define P° on S . If A c,F", then in light of the formal calculations above, we let

(I)

P°(A) = E2(ho(Bt,,);A)/ho(z)

The first thing to check is whether these definitions are consistent. Let m < n

and A e.Fm c". Since h0 is harmonic (see Section 1.10, Exercise 1) and is a bounded martingale, and it then follows from the bounded in H", optional stopping theorem that ho(BrnT,,) =

Integrating over A e Fm and using the definition of conditional expectation gives EZ(ho(B(A,,,,);A) =

0) established, the With the consistency of our definitions of next thing to do is to put these processes together to construct (B,A I Bt = 0). Let Y"(w) = B.'In (a random variable taking values in C). (1) specifies the distribution of Y", and hence of (Y1, Y2, ... , Y"), for any n. In the last paragraph, we showed that these finite-dimensional distributions are consistent, so it follows from the Kolmogorov extension theorem that we can construct on P) an infinite sequence of random variables with these finite-dimensional distributions. Since the random variables Y" satisfy some probability space (S2,

Y"(t) = Yn+1(t), t:!5; z", we can define a process on [0, r) by letting Y(t) = Y"(t) for t < Tn. Y is our candidate for (BI, IB, = 0). The first thing to check is that the conditioning has accomplished its goal. (2)

As t T T, Yt -+ O a.s.

Proof Let Ga = {(x, y) : 0 < y < 5 Then

1,

IxI > S} and let Ta =inf{t : Y e Ga}.

P(Ta < T") = Ez(ho(B:n); T < T")/ho(z)

Since ho(B,n) is a bounded martingale, using the optional stopping theorem at time Ta shows that P(T6 < T") = Ez(ho(B(Ta)); TT < T")lho(z) < sup{ho(w) : we Ga}/ho(z)

<

Sd+1

I ho(z).

Sd

Since the right-hand side is independent of n, this expression gives P(Ta < T) < S/ho(z). Letting 6 = 2-m and summing gives

96

3

Conditioned Brownian Motion

P(T2-m < T for infinitely many m) = 0.

Combining the last observation with the simple-to-prove fact that P(B° -+ 0 as t T r) = 1 proves the result. With (2) established, we can extend Yto be a continuous process on [0, co) by setting Y = 0, t > T, and our construction is almost complete. The last detail is to move the measure back to (C, W). This is no problem, however: co -+ Y(w) maps 0 into C, so we simply let P° be the image of P under this mapping.

The construction of P° above can be extended by translation to define measures PZB, 0 E Rd 1, that are "Brownian motion starting at z and conditioned to exit H at (0, 0)." The next result justifies the description in quotation marks. (3)

If

then PZ (A) = lim PZ (A I I BT - (0, 0) I < e).

Proof Let D. = {>li : 19 - 01 < e} and let h`(z) = PZ(I BT - (0, 0)I < e)

=

Jh(z)d/. De

Since he(B,A,,,) is a bounded martingale, we have for each A EA that PZ(AIIBL - (0,0)l < e)

A)lhe(z) = PB(A)

ase-+0. With (3) established, it is easy both to believe and to prove: (4)

Let A e .S and z c H. If we let g (0) = P .-'(A), then g(Bz) = PZ(AI Bt)

Proof By the bounded convergence theorem, it suffices to prove the result when A n > 0. To prove the result in this case, we observe that from the definition of conditional expectation and the strong Markov property it follows that if I is a Borel subset of 8H, then PZ(B. EI,A) =

EZ(hr(B(;)); A) where hl(z) = the probability of exiting H in I starting from z. Now hr(z) =

I he(z) d0,

,J

so applying Fubini's theorem (everything is > 0) and recalling the relevant definitions, we find that

3.3

Other Conditioned Proceases In 11

97

PP(B,aI,A) = JEz(ho(B);A)dO

_

ho(z)PB(A) d0 I

= JIho(z)g(0)d9 I

= Ez(g(BT); Bz E I).

Since I is an arbitrary Borel set, this shows that g(Bt) is a version of P2(AIBT) and completes the proof.

Note: The approach we have taken to the definition of PB follows Appendix II of Brossard (1975) with some minor modifications. This approach is slightly different from the one used in Section 3.1 (see the definition given in (2)), but it is easy to make the connection. Exercise 1

If ffis bounded and f = 0 on all, then

Exf(B,) = 4,'(x, y)f(y) dv, where

9B(x,Y) = he(x)-p'(x,y)ho(Y)

and pH is the transition probability for Brownian motion killed when it leaves H (see Section 1.9 for a description of this process).

3.3 Other Conditioned Processes in H As we promised in Section 3.1, in this section we will define Brownian motion

conditioned to never leave H. In light of the discussion above, there is not much to say: It is an h-transform with h(z) = Zd, and it is defined almost exactly like the processes constructed in the last section. Let G. = Rd-1 x (0, 2"), let T. = inf{t : B,0G"}, and let .F (T"). If A e .F., we let (1)

PP°(A) = Ez(h.(B,);A)/h,(z) where h., (z) = Zd. As before, it is easy to see that these definitions are consistent,

so if we let Y"(co) = B( A T"), then we can construct (Y1, Y2, ) on some probability space with Y"(t) = Yn+1(t) for t :!g T", and we can define a process on [0, lim T") by letting Y, = Y"(t) for t < T". To see that lim T. = oo a.s., we 0 (since he = 0 on 8H), so the strong Markov observe that property of Brownian motion and an obvious scaling imply that (T,.,, - T")/2", n = 1 , 2, ... , are indpendent and identically distributed, which is more than enough for the desired conclusion.

98

3

CondltbnW Brownlnn Modons

With two examples of conditioned Brownian motion constructed, it is natural to ask if there are any other interesting examples, or what is the same : Can we describe the set of all nonnegative functions that are harmonic

in H? To do this we will start by considering the analogous problem in D = {z:Izl<1}. (2)

u >- 0 is harmonic in D if and only if there is a finite measure µ on 8D such that

u(z) = JkY(z)diu(Y) where ky(z)

= Iz-_

Izlz

1

. yid

Proof Now ky(z) is the probability density (with respect to the surface prob-

ability measure ir) of exiting D at y when we start at z, so as we showed in Section 1.10, each of the functions ky is harmonic. It is routine to show, using the result on differentiating under the integral sign given in Section 1.10, that A

Jk(z) du(y) = JAk(z) du(y) = 0

(details are left to the reader), so it follows that all the functions defined above are harmonic.

To prove the converse requires a little more thought, but less tedious computations. For 0 < r < 1, let u, be the measure on 8D that has density tp(y) = u(ry) with respect to n. u, has total mass Ju(rY)dlr(Y) = u(0).

Since all these measures are concentrated on a compact set and have the same

total mass, the Helly selection theorem implies that there is a subsequence p,, that converges to a limit u. It follows from the definition of µ, that J

ky(z) d,(Y) = Jky(z)u(rY) dir(t) = u(rz).

Letting r = r and n --), oo, we get

u(z) = lim u(zr,,) = J'k(z) dy(y),

since p,, converges weakly top and y - ky(z) is continuous. To translate (2) into a result about H, we will use one of Kelvin's transforma-

tions (discussed in Section 2.12) to map D one-to-one onto H. To figure out which transformation we want, we start by calculating what the inversion J(z) = z/IzI2 does to D(ed, 1), the ball of radius 1 centered at ed = (0, ... ,0, 1).

3.3

Other Conditioned Procewe In //

99

If zeaD(ed, 1) _ {z : Iz - ed1 = 1}, then Z1 + ... + Zd-1 + (Zd - 1)2 = 1.

The last component of J is :

JAZ) = Zd/(Z1 + ... + zd) The first equality above implies that

zl+...

Z2

2

so Jd(z) = 1/2 for all ze3D(ed,1). A little thought shows that J maps D(ed, 1) one-to-one onto the half space H' = {z : Zd > 1/2}, so K(z) = J(z + ed) - (ed/2) maps D one-to-one onto H. From (6) in Section 2.12, it follows that there is also a one-to-one correspondence between positive harmonic functions in D and in H, that is, if u is harmonic in D, then I xI2-du(K(x)) is harmonic in H, and, conversely, if u is harmonic in H, then IxI2-du(K(x)) is harmonic in D. Combining the last observation with (2) gives an integral representation for positive harmonic functions in H. It is easy to see without computation what functions will appear in the decomposition. The ky(z) with y -ed get mapped to Zd

Cylz

- K(y)Id'

that is, the probability density of exiting D at y gets mapped to a constant (that depends upon y) times the probability density of exiting at K(y), and when y = -ed, ky gets mapped to Czd. To check the second claim, observe that this ky is mapped to a function harmonic in H that vanishes on 8H, and that mapping things back to D and using (2) shows that there is (up to a constant multiple) only one such function). Combining the observations above with (2) gives: (3)

If u > 0 is harmonic in H, then there is a constant C and a measure µ on 8H such that u(z) = CZd +

f

he(z) dµ(9)

aH

Remark: The reader should note that the measure µ is not necessarily finite and that this is only an "only if" statement. To assert that such a u is harmonic, we need a condition such as he(ed) dµ(9) < oo 'aH

to assert that u < oo in H. It turns out that the condition just mentioned is necessary and sufficient for u * oo and harmonic in D, but we will not pursue this here.

100

3

Conditioned Brownien Motions

(3) gives us the answer to the question asked at the beginning of this section. It shows that all nonnegative functions that are harmonic in H are linear combinations of zd and the he's, so the processes we have constructed in the last two

sections are the only interesting examples of conditioned processes in H. Nontrivial linear combinations being uninteresting, since, for instance, if p is a probability measure and h(z) = Jho(z)diu(O),

then t he corresponding h-transform is just Brownian motion conditioned to have B

a

f

3.4 Inversion in d >_ 3, Bt Conditioned to Converge to 0

ast -goo Let J(x) = x/IxI2. If B, is a Brownian motion in d >_ 3, then (as we observed in Section 2.12) J(B,) is not a Brownian motion, since J(B,) -+ 0 as t - oo. What it is, of course, is an h-transform of Brownian motion. To prove this and IJ(x)I2-d. Now to discover the h, let u be a harmonic function and let g(x) =

J(J(x)) = x, g(J(x)) =

IxI2-du(J(x)) is harmonic, so

IxI2-d, and

B,I2-du(J(B,))I Bo = J(x)) = I J(x)I2-du(J(J(x)))'= g(x)u(x)

E(g(J(B,))u(J(B,))I J(Bo) = x) = E(I

Since u is harmonic, we can rewrite the last expression as

g(x)u(x) = g(x) fP,(x,y) u(y) dy where p,(x,Y) = P(B, = YI Bo = x) =

let P,(x, y) = If weJPt(xY)(Y)u(Y)dY

(2nt)-d/2e-Ix-y12/2t.

yI J(B0) = x), then we have shown that = E(g(J(B,))u(J(B,))I J(Bo) = x)

= fg(x)&(:!, Y) g(y)u(y)dy g(Y)

for all harmonic functions u. It follows that Pt(x,Y) = g(x)Pt(x,Y)g(Y)-1,

that is, J(B,) is an h-transform of Brownian motion with h(x) = 1/g(x) =

XI2-d,

a constant multiple of the potential kernel defined in Section 1.8. After the fact, it should be obvious that this is the right h. It is nonnegative, is harmonic when x 0 0, and converges to 0 as IxI -> oo.

Inversion In d a 1, B, Conditioned to Converge to 0 ss i -+ w

3.4

101

Using the last result, we can compute the occupation time density for Brownian motion killed when it leaves D = {z; I z I < 11. Let f be bounded and

=0onD`,andletT=inf{t:B,0D}. Ind> 3, o

w(x) = E. f f(B,) ds < oo, 0

and the strong Markov property implies that w(x) = Ex f f(BS) ds + EXw(B.). 0

so we have

Ex J If(BS) ds = w(x) - Exw(B,). 0

To compute the second term on the right, we observe that since J(x) = x on 8D, it follows that (for x 0) Exw(B.) = Exw(J(B,)) = h(J(x))-1EJ(x)w(B,)h(B.)

where h(x) =

IxI2-d. Since h

= 1 on 8D, it follows that

E,,w(B.) = h(J(x))-1E,(x)w(Bc) = Ix12-dE!(x)

tf(B5)ds

= X 2-d E!(x)

f(B5) ds, J0W

since f=0 on D`. Combining bining the results of the last two paragraphs shows that (for x

ds = w(x) -

Ex

0) :

IxI2-dw(J(x)),

so we have (1)

E. J f(B.,)ds = JGD(xY)f(Y)dY 0

where

GD(x,Y) = G(x,y) -

IxI2-dG(J(x),Y)

It is trivial to extend the last formula to x = 0. To do this, we observe that

asx-+0, x12-dG(J(x),Y) I

=

CIxl2-dlxIxI-2

__

yl 2-d

converges to C(= F(d/2)/lyd-2). If we set Go(0, y) = G(0, y) - C, then (1) also holds for d >_ 3.

102

3

('onditlonod Brownlan Motions

3.5 A Zero-One Law for Conditioned Processes This section is devoted to proving a zero-one law, which is an important tool both for establishing properties of the conditioned processes and for deducing analytical results about boundary limits of harmonic functions from their probabilistic counterparts. Since our only applications will be to the second topic, we will prove the zero-one law only for Brownian motion conditioned to exit H at (0, 0). At the end of the proof, however, it will be obvious how to prove the result for other conditioned processes. To state our result, we need several definitions. (1)

An event Ae. lA o 0T = 1A

is said to be shift invariant if for all stopping times T < T P= a.s. for all z e H

where 0T is the (random) shift operator defined in Section 1.5 by (BTw)(t) = w(T(w) + t).

In words, we can determine whether or not a shift invariant event occurs by looking at B T :!g t < T, that is, these events concern the behavior of Bt as t T T. The following are typical examples of shift invariant events: (i) {lim u(B1) exists} t1 r

(ii) {lim sup I u(B,) I < oo } tTr

(iii) {B, e A infinitely often as t T T} {to: for all s > 0 there is a u e (T - E, T) with B. e A}.

Let 5 be the collection of unvariant events. Although the examples above suggest that there are a large number of events in 5, the next result shows that

this impression is wrong. These events are all 0 or Q (give or take a set of measure 0). (2)

If A e f, then z -+ PB(A) is constant and the constant is either 0 or 1. The key to the proof is the following lemma, which is a strong Markov property

for P°: (3)

If A e Fz and T < T is a stopping time, then EB(1A0OTI FT) = P07T)(A).

Proof We will prove the result first for T < T,,. By the bounded convergence theorem, it suffices to prove the result when A eJFm(-.F(Tm)) and m > n. If Be .° T, then by the definition of/ P f, EO L lA o 0T ; B] = Ez [hO (Bs ) (lmA o BT) ; B]/h9 (z)

= E. [(he(B=m) 1A) ° 0T ; B]/ho(z),

since Btm(w) = BTm(0TW). Applying the strong Markov property and using the

A r*ro-one Law for Co.awoned rrocea ea

3.5

103

definition of P2 with z = B(T), we find that the above = EZ[EB(T)(he(B=) lA); B]lhe(z) = EZ[he(BT)PB(T)(A); B]lhe(z)

To convert this to the desired form, we observe that the optional stopping theorem implies that if Ce.FT, then EE(he(BT)lc) = EjhO(B(TJ) IC), so the definition of Pe for Ce.FT may be rewritten as P!(C) = EZ(he(BT) lc)/he(z),

and it follows that if Ye.FT is bounded, EB(Y) = EE(he(BT)Y)lhe(z)

Applying this result with Y = P, (T) (A)1B and tracing back through the chain of equalities above, we get: EB [lA o OT; B] = EB [Pa(T)(A) ; B]

for all Be.IFT, which is the desired result in the case T:!!9 r,,. To remove this assumption, apply the result above to T A T,,, let B = B n {T< and let n - oo to conclude that the last equality holds in general.

Proof of (2) Let A E.f and let (p(z) = PB(A). From (3), it follows that if T < 'r is a stopping time, then (p(z) = P°(A) = Pe(A o BT) = Ee(PB(A0OTI Ee(Pa(T)(A)) = EB((p(BT)) = EZ(he(BT)(p(BT))lhe(z)

Applying this result at T = inf{t : B,OD(z, S)}, we see that g(z) = he(z)(p(z) has the averaging property g(z) =

J aD(z, a)

g(Y) d7r(Y),

so it follows from (7) in Section 2.10, that g is C °° and has Ag = 0. Since g > 0, it follows from (3) of Section 3.3 that g(z) = Cza +

Jh(z)dz(Y) e

and, since 0 < (p(z) < 1, g(z) = he(z)(p(z) < he(z). Combining the last two observations, we conclude that u(101') = 0 and C = 0, so (p is constant. With then this result in hand, we can easily complete the proof. If Be.J _

0 OA

= Ez(E'('A00,,1°n);B) = (A) ; B)

104

3

Conditioned Brownien Motion

It follows from the definition of cp and the fact that qp is constant that the above

=

B)

= cp(z)PB(B)

= Pe(A)PB(B),

and we have shown that PB(A (1 B) = P°(A)PB(B)

for all B e

and hence for all B e

Letting B = A in the last equation gives

PB(A) _ (P°(A))2,

so P!(A) = 0 or 1, and the proof of (2) is complete. Remark :

As we promised at the beginning of this section, it is easy to gener-

alize this result to other conditioned processes, since the keys to the proof above are (a) the strong Markov property of the conditioned process and (b) the fact that if g is harmonic and 0 < g(x) < ha(x), then g(x) = che(x). In the terminology of the theory of convex sets, (b) says that he is an extreme point of the cone of nonnegative functions that are harmonic in H, and our general zero-one law may be phrased as follow: If h is an extreme point, then the zero-one law holds for the corresponding h-transform. Looking at the representations given in Section 3.3, we can easily see that in H or D the converse is also true. If h is a nontrivial linear combination, then the asymptotic random variable limit. B, has a nontrivial distribution.

Note: The proof of the zero-one law in this section, like the definition given in Section 3.2, is from Appendix II of Brossard (1975).

4 Boundary Limits of Harmonic Functions

4.1

Probabilistic Analogues of the Theorems of Privalov and Spencer Let u be a function that is harmonic in H =

x (0, cc). As the section titles in this chapter indicate, we are concerned with the existence of the limit of u(z) when z - (0, 0) e 8H in two special ways: Rd-1

(a) when z -+ (0, 0) along Brownian paths (i.e., u(B,a) -> a limit a.s. where BB is a Brownian motion conditioned to exit H at (0, 0)) (b) when z - (0, 0) nontangentially. To explain notion (b), we need a few definitions. For each a > 0, let Va be a cone of height 1 with opening a and peak at 0, that is,

Va={(x,y)cH:Ix -0I 0.) Let

Ya =

elim

exists for all sequences z e V.' with z,, --+ (0, 0)

We say that u has a nontangential limit at 0 if 0 e 2 = n2o. From these definitions, it should be clear why the convergence defined above is called nontangential. This notion is useful in analysis because it is strong, much better, for instance, than radial convergence (limy-o u(0, y) exists) ;

but it is not too strong, as is unrestricted convergence, which implies that the boundary limit is a continuous function (on the set S where this function is defined, S being given the topology it inherits from Rd).

Two important problems in the theory of harmonic functions are (a) to 105

106

4

Boundary Limit, of Harmonic Functiono

find methods for computing 2' and (b) to find conditions which guarantee that 2' = H. In this chapter we will concern ourselves with two classical results regarding the first problem: nontangential convergence is equivalent (modulo null sets) to nontangential boundedness and to finiteness of the "area

function." These are the results I have called the theorems of Privalov and Spencer, even though, as the reader will see below, these designations are slightly inaccurate historically. I will detail who did what, and when, after I describe what has been done. To state the first result, we need some more notation. Let (Nau)(0) = sup{lu(z)I : zE V.01 .Na = {O: (Nau) (0) < oc I.

We say that u is nontangentially bounded at 0 if O e At = n.Na. It is clear that 2 c AT. Privalov's theorem asserts that the opposite inclusion is true modulo a null set, that is, 2' = .N' a.e.

(1)

or, to be precise, the symmetric difference 'AA = (2' - .N') U (.K - 2') is a null set. To state Spencer's theorem, we have to define the area function. Let yz-dIVu(z)Izdzl

(Aou)(0)

1/2

va

(recall that y is the last coordinate of z). (Aau) (0) is called the area function because if f = u + iv is an analytic function with Ref = u, then (Aou) (0) is the area of the image of Va under f counting multiplicities (this interpretation fails for d > 3). Let

.d. = {O: (Aau)(0) < co} sd1 = nsdla .

With these definitions in hand, we can state Spencer's theorem (which is actually two theorems). (2)

.' = sd1 a.e. Remark: With the symbols 2', .N', and sat defined, I can now tell the history of the results given above.

d=2 2'=.N' N

sat

.N' 7 sat

d> 3

Privalov(1916) Marcinkiewicz and Zygmund (1938) Spencer (1943)

2' _ .N' Calderon (1950a) .N' c sat .N D sat

Calderon (1950b) Stein (1961)

4.1

Probabilistic Analogues of the Theorems of Prlvslov and Spencer

107

From the order in which the results were proved, the reader can see the relative difficulties of the three results.

So much for history. The main task in this chapter is to prove that if u

is harmonic in H, then 2' = At = d a.e. The first step in doing this is to define the analogous probabilistic sets 2*, V*, and s.4* using Brownian motion and to show that 2* = }( * = d* a.e. The generalization of nontangential convergence or boundedness to the Brownian setting is easy. We ask for convergence or boundedness along almost every Brownian path, and to get a subset of Rd-1 we let {O: lim u(B,) exists PO a.s. }

ttr

)

U* = sup I u(B)I t
.N*={6: U'
t= f 1Vu(Bs)I2ds,

t«,

and in Section 2.11 we saw that

{lim U exists} _ {, < oo},

tft

so combining these ideas suggests the following definition :

S/* = {6 : Ji Vu(BS)I2 ds < oo PO a.s

I.

With all the relevant definitions introduced and explained, we can state our probabilistic analogue of the theorems of Privalov and Spencer: (3)

2*= K*=d* The proof of this result is quite short. We start with an apparently weaker statement proved in Section 2.11 and take conditional expectations to prove the desired result. Let

Y' _ {a): limu(Bt) exists}

ttt

.N-{w:U*,
108

4

Boundary Limitm of Harmonic Functions

In Section 2.11 we showed that (4)

Yw = .N'' = d' a.s. (Here, as above, a.s. means PZ a.s. for all z e H.) At first it may appear that (4) has little to do with (3), but a little reflection shows that they are closely related. The fact that £w = Vw = saw implies (by the definition of conditional expectation) that P=(2 'lBz) = PZ(.N `'I Bz) = PZ(d°I B=)

and using (4) from Section 3.2 now gives that the following three functions of 0 are equal a.e.: PO(2'w), 1'B(_41'w), PB(.4 ')

To translate this into the desired result, we note that it follows from the definitions of 22*, *, and d* and the zero-one law that

22*_{B:zPB(2°')=1} .N'* _ 10: z .4* {0: z

PB(.N'w)

1}

Pe(sa/w)

so l..(9) = PB(22w), 1.,,-.(0) = P!(.Kw), and 1,.(0) = PB(sa"w) (the last three equalities holding for all z and 0). Exercise 1

Show that if we let

Uz = sup u(B,) '
lr*={0:U7
Notes: The developments in this section and in most of the rest of this chapter follow Brossard's (1975) these de troisieme cycle. Only the last chapter of his thesis, which contains the material in Section 3.5, has been published, however (see Brossard 1976). The fact that 22w = .K°' is due to Doob (1953), page 382. Doob and others wrote a number of papers concerning the boundary limits of harmonic and analytic functions along Brownian paths (in the "fine topology"). If the reader is interested in this topic, he or she should look at the papers by Doob listed in the references.

4.2 Probability Is Less Stringent Than Analysis The title of this section refers to the facts that the sets 2, .N', and sad defined in analysis are always smaller than their probabilistic counterparts 2*, V*,

Probability Is Less Stringent Than Analysis

4.2

109

Figure 4.1

and 4* and that the difference can have positive measure in d >_ 3. We will prove the second result in Section 4.4. In this section we will prove that (1)

JV*DJVaDJV. If we combine this statement with the facts that 9* = JV* = sd* and that . = JV = sd (part of which we will prove in Section 4.5), we see that the probabilistic notions are less strict than their analytical counterparts. Proof of (1)

To simplify drawing pictures, we will prove the result only when d = 2. In view of the differences between d = 2 and d > 3 mentioned above, this reduction should worry you a little bit, but, nonetheless, I will leave it to you to check that the same proof with more tedious computations works when d > 3. The proof begins with truncation and a standard construction. Let M > 0 and ' = 10 : Na(O) < M}. If 0n -* 0, then U. Va(0n) - Va(B), so we have lim infNa(On) > Na(0),

n-oo

that is, Xl is closed.

Let S2 = UeE, Va(0). S2 is the shaded region in Figure 4.1. For each open interval in A", there is a little pyramid in S2`. These pyramids combine to give

0 its sawtoothed appearance, or, as other more poetic authors have put it, S2` is a little mountain range. We are interested in S2 because u is bounded by M on Q. A look at Figure 4.1 suggests the following strategy for showing that '' c A* a.e. We first show

that (2)

If a = inf It: B, S2} is the exit time from S2, then for a.e. O e '' there is a z e H such that PB(a = i) > 0. If Pa(o = r) > 0 and U* = supr 0, and it follows from the zero-one law (theorem (2) in Section 3.5) that P°(U* < oo) = 1 for all z, that is, 0 e X*. To prove (1), we need only to prove the innocent-looking fact stated in (2). This result would be easy to prove if were always a finite union of intervalsthe only exceptional points would be the end points of the intervals-but unfortunately, the set -Y./' can be very ugly. A typical nightmare is u(z) = YBcehe(z) where the sum is over all rational 0, he(z) is the probability density of exiting H

110

4

Boundary Limits of Harmonic Function

x+ 2 Figure 4.2

at 0, and the ce > 0 are chosen so that the sum converges and is harmonic in H. In this case, A' = 8H - UD(0, re) where the union is over rational 0 and re > 0,

so )r is dense and it is hard to tell which 0 c.*' will be "good" (that is, have PB(a = T) > 0 for some z). We will eventually derive a criterion (due to Marcinkiewicz) for a 0 E A' to be good, but the reader would not see the reason for the definition if we gave it now, so we will first see what kind of estimates we can get on PB(r = T) and then introduce the criterion when it will appear more natural. Proof of (2) We start by observing that, in order to prove that PB(a = T) > 0 for some z, all we have to do is look at 8S2 near 0. Let I, J be intervals centered at 0 with radii r and 2r, respectively. Let

Al = {(x,y)e092:xeI,y < 1) A2=8S2-A1 a; = inf{t: B,EA4} i = 1, 2. It is easy to estimate Pe(a2 < T). By definition, Pe(a2 < T) = EZ(he(B(a2)); a2 < T)lhe(z) Now C1 = sup {ho(z) : z c A2 } < oo, so it follows that (3)

PB(a2 < T) < C1/h0(z),

and if we let z -> (0, 0) in such a way that he(z) co, then Pa(o 2 < T) - 0. This occurs, for instance, if z --> (0, 0) in Vb for any b < oo, so to prove (2) we need to estimate PB(a1 < T) for zE V6. As in the case of a2i we start with the equality

Peal < T) = EZ(he(B(a1)); a, < T)/he(z), but this time we have to work harder to estimate the right-hand side. We begin by looking at the geometry of 8S2 and how it relates to the Poisson kernel.

If (x, y)eA1, then y = d(x,.f)/a where d(x,. t) is the distance from x to the closest point in A'. . Let x' c- (x - ay12, x + ay/2) and y' = d(x', -' )Ia. Since y = d(x, )F)la, then x' E.Yi and, consulting the definition of S2, we see that (x', y') is the point on 8S) that is above x' (see Figure 4.2). As the reader can easily check, we have

2

-

y

,2
<

3y

2

,

4.2

111

Probshlllty In Lees Stringent l'hen Anslyele

so

he(x',y')-y'

y (x -

he(x,Y)

1 (2)2-2

(x-0)2+y2 9)2

2

+ (Y')2

3

9

If we let g(x') = he(x', y') > 2he(x, y), we have 2

x+(ay/2)

.-(ay/2)

x+(ay/2)

hx,(x,Y) dx'

g(x')h.,(x,Y) dx' ? 9 he(x,Y)

x-(ay/2)

and x+(ay/2)

y12

2

hx (x,Y)dx' > ay (ay/2)2 = a'

x-(ay/2)

so if we let C2 = 9a/4, it follows that if (x, y) e A 1, Cx+(ay/2)

g(x')hx.(x,Y) dx'

he(x,Y) <- C2 ('x-(ay/2)

< C2 J g(x')hx-(x,Y)dx'. r

since x e I, ay = d(x, JY) < r, and I and J have radii r and 2r. To estimate Pz(Q1 < t), we observe that from the last inequality E=(he(B(a1)); a1 < T) < C2EZ(EB(a1)(g(Bt); B,eJ))

< by the strong Markov property, so we have g(X)hx,(z)/he(z) dx'.

EZ(he(B(u1)); a1 < i)lhe(z) S C2 JJ

Now if z e Vb(O), then

he(z)

(x-B)2+y2 <(x-0)2+1
(x - x')2 +

and

g(x') = he(x',Y') = (x, - 0)2z + (y,)2 < d(x )/a (x' ,- 0)2 It follows that if z e V6, then (4)

PAa Pe(U1

< T) < C2(b2 + 1)

d(x', r) dx.

r (x' - 8)2

At this point (as we promised earlier), it is clear what we need for a point 0 to be good : (e)

e+1 d(x, dY)

a-1 x -

B)2

d8 < oo,

112

4

Boundary Umlta of Harmonic Functions

for then if J is small, PZ(a1 < T) < 1 for all z e V,(0). The following lemma due to Marcinkiewicz says that this is a good definition of good. (5)

For a.e. 0 E -*', gyp,, (0) < oo.

Proof Since gyp,,, (0) is determined by t- (1(0 - 1, 0 + 1), we can suppose that %' r- [-n,n]. If OE.X'', then fn+l T,r(0) <

d(x,

-n-1 (x - 0)) dx,

so

f

gyp,,. (0) d0 <

f f n+l d(x,') dx d0 ,r

1 (x - 0)

J

n+1

d(x, A') d0 dx -n-1 f"' (x - 0), n+1

-n-1

d(x, )

.L (x

0),1(XE )rc)

-

d0 dx,

since d(x,1) = 0 if x E .'f . Now if x c -'Cc and 0 et-, then I x - 0I > d(x, .-(), so we can replace the integral over .'f by an integral over 0 E [x - d(x, Jfl, x + d(x, If we make the replacement and change variables toy = 0 - x, it follows that <2

n+"

f

-n-1

d(x,.) Y2

1(XE)C)dydx

n+1

l(xc,rc)dx<4(n+l)
2 n-1

From the computations above, it follows that

If

cp,, (0) d0 < oo,

proving (5). Having proved (5), we can now easily complete the proof of (2). We have shown that (3)

Pe(QZ < T):!5; C1/he(z)

where C1 = sup {he (z) : z e A 2 }, and also that if z e V6 , (4)

Pz(Ql
C2(ba+

0).)V)

1)

dx,

where C2 = 9a/4. Therefore, if 0 is good (and a.e. 0 is good), we can pick r small enough so that PB(o1 < T) < 1/2 for all zE Vb and then zE VB close enough to (0, 0) so that PB(a2 < T) < 1/2.

4.3

Equlvelence of Brownlen end Nontengentlel Convergence In d - 2

113

With (1) finally proved, it is interesting to look back and see what was involved in proving (1). The hard part was to show (2), a purely probabilistic result about exit probabilities for sawtoothed regions, and the rest of the proof consisted of the few lines following the statement of (2), so the result is valid for an arbitrary measurable function u! Note: The result that ,V* .N is due to Brelot and Doob (1963). The proof given here is again from Brossard (1975).

4.3 Equivalence of Brownian and Nontangential Convergence

ind=2

In this section we will show that in two dimensions, 2' = 2a*, that is, the analytic definition of nontangential convergence agrees a.e. with the probabilistic defini-

tion of convergence along the paths of conditioned Brownian motion. The proof of this statement relies heavily on the fact that in two dimensions, Brownian motion can make a loop around a point that cuts it off from the rest of the plane, and hence the maximum principle implies that somewhere along the path the value of u is larger than the value at the point. This technique cannot possibly work in d > 3, but there is a good reason for this. In Section 4.4

we will see that in d > 3, we may have 2* = 8H and 2' = 0. In Section 4.1 we showed that 22* = .N'*, and in Section 4.2 we showed that .N'* X.. Combining these two facts with the trivial result .iV see that to show 2' = 22* it is enough to show

Ya, we

for then it follows that 22* = X* * = 2'o = .Na for all a > 0 and, hence, that

22*=.N'*=22=.'. Remark: From results in Section 4.1, we have that -*'* = .sad*, so we can add s4* to the string of equalities. It is also known (see Stein (1961), or see Brossard (1976) for a probabilistic proof) that in any dimension .N' = d, so in two dimensions all six sets are equal. Proof of (1)

As in Section 4.2, the result follows easily from a simple-sounding

result about Brownian motion that we will have to struggle to prove. In this case, we will have to work just to state the result. (2).

There is a constant e. (that depends only on a) such that if (u, 1) and z = (x, y) are in V; and y < 1/3, then P(U,1) (Bt, 0 < t < r, "makes a loop around z") > E.

From the remarks at the beginning of this section, it is probably clear what we

114

4

Boundary Umitn of Harmonic Functions

Figure4.3 0,=4n-(0-00), W(z)=(0,-00+0)/2n=2

mean by "making a loop around z," but to make this description precise and to have a useful criterion for determining when it has occurred, we need to define w(z) = the number of times B, 0 < t < T, winds around z. Let (U, V) = B, - z. The probability that B, = z for some 0 < t < T is zero under P(O.,1) (this is a consequence of (4) in Section 3.2), so there is a unique process 0, 0 < t < T, with continuous paths such that 00 E [0, n) and cos0i = V/(U2 + V2)112.

Let A E (0, 2ir) be the size of the angle formed by (u, 1), z, and (0, 0) when we

regard the right-hand side of the arc as the inside. One look at 1, igure 4.3 shows that 0z - 00 + A is a multiple of 2n, so w(z) = (0L - 00 + A)/2n is an integer that counts the number of times B, 0 < t < T, has wound around z.

With the definitions above, we can state the conclusion of (2) as 0) >_ e. We will prove this result in a moment, but first, to motivate ourselves for this undertaking, let us observe that (1) follows easily from (2). P(U,1)(w(z)

Proof of (1) assuming (2) Let 0 e 2' *. It suffices to show the following : (3)

If z. E V., z -. (0, 0), and

o cc [- oo, oo], then u(B,)

a as t r T Pe a.s.

Since (3) implies that the number a is independent of the sequence chosen, and hence the limit exists for any sequence converging to (0, 0) in V.. The first step in the proof of (3) explains our interest in w(z) 0 0. Let G.

be the component of the open set H - {B,: 0 < t < T} that contains z. If

4.3

Equivalence of Brownian and Nontengentlal Convergence In d e 2

115

w(z) # 0, then G= is bounded and OGZ c {B,: 0 < t < tjl, so it follows from the maximum principle that u(z) < max{u(B,) : tcSZ} where

S== {te[0,z):B,e3GZ}. Let A. = {w(zn) 0} and let B be the set of w that are in infinitely many A. B is a shift invariant event (in the sense defined in Section 3.5) and 1B = lim sup 1,,n, so Fatou's lemma and (2) imply that for all u, P(0,1)(B) > lim sup P(u,1)(A,) > U

and it follows from our zero-one law ((2) in Section 3.5) that Pe(B) = 1 for all z e H. Combining the last result with the inequality we derived from the maximum principle, we see that (recall 0 e Y

Pe(limu(B,) > lim `Tt

f

n-+ ao

1,

and repeating the argument upside down gives P.0 Climu(B,) < lim

l

nom

`f%

1,

proving that u(B,) - a = limu(z )PB a.s. To prove (1) now, we need only to prove (2). The idea of the proof is simple : For each u and z, the probability in (2) is positive, so we can use the strong Markov property to prove that the probability is bounded below for (x, 1/2) e V.0 and then use scaling to improve this to the conclusion in (2). The first step is to prove: (4)

There is a J9 > 0 (that depends only on a) such that if (u, 1) and z = (x, 1/2) are in V., then P(0

0) >

u,

(W (Z)

Proof Let g9(u, v, x,y) = P(Ou,v)(w(x, y)

0), let

T},=inf{t>0:B< y, then

of Section

B V

(5)

M U, v, x, y) _ (u

- x)z + (v - y)z he u,) '

and the strong Markov property implies that if v > s > y, then (6)

ge(u, v, x, y) > I dr.fe(u, v, r, s)ge(r, s, x, y)

1.9 and the

116

4

Boundary lAmld of Harmonic Function

It is easy to see that if (r, 3/4) e V,°, then ge(r, 3/4, x, 1/2) > 0, and there is a constant y > 0 such that if (u, 1), (r, 3/4) e Va, then fe(u, 1, r, 3/4) >- y. Applying (6) with v = 1, s = 3/4, and y = 1/2 gives (4).

(2) now follows easily. By applying (6) with v = 1 and s = 2y, doing an obvious scaling, and then using (4), we get go (U, 1, x, y) ? J drf (u, 1, r, 2y)ge(r, 2y, x, y)

= f drfo(u, 1, r, 2y)ge(r/2y,1, x12y,1/2) B+ay

>

drfe(u, 1, r, 2y)fl B

-ay

Now if (u, 1) and (r, 2y) E Va and y < 1/3, it follows from (5) that fe(u, 1, r, 2y) =

1 - 2y

2y

(u-r)2+(1 -2y)2 (r-B)2+4y2 2y

1/3

1/9 (r - 0)2 + 4y2

(u - 9) 2 + 1 1

1.

Changing variables to x = (r - O)lay, we see that 1

go (u, 1, x, y) > 3

f-1 (a2X2 + 4)

dx,

proving (2).

4.4 Burkholder and Gundy's Counterexample (d = 3) In this section we give an example, due to Burkholder and Gundy (1973), of

a function u that is harmonic in H = R2 x (0, oo) and that has V* .

[0, 1]2. With a little more work, one can produce an example with -41* = 8H and .N'1 = 0, but we leave this as an exercise for the idle reader. .N'1

(Here and throughout the book, A = B(A

B) means A A B (A - B) is a null

set.)

As Burkholder and Gundy say in their paper (I have edited their remarks slightly, since I have modified their counterexample), "Roughly speaking we construct a bed with an infinite number of vertical spines of varying height on the unit square. The function u is defined to be large at the end of each spine and small nearly everywhere else." The spines, placed at locations that are dense in [0, 1]2, are made so small that, with probability 1, Brownian motion will hit only finitely many of them. The first step in the construction is to introduce the spines.

4.4

117

Burkholder end Gundy's Counterexample (d = 3)

ll/ / l V /" / /Ilk%LVIIQ! N / /I

Figure 4.4

2j-12k-1 zl:l \ D"= {( 2"

E"

<j k<2n 1,0
=\2J2. 1,2k2" 1,21111:1

<j,k<2"-1l.

see Figure 4.4. The first thing to show is that there are infinitely many spines in each cone.

To get a mental picture of (1)

If 0 E [0, 1]2, then V1(0) f1 E.

0 for all n.

Proof V1(0) = {(x, y) : 10 - x I < y < 11. The worst case is 0 = (0, 0), or more generally, (2j/2", 2k/2"). In this case, I(1/2", 1/2")I = (2/22.)1/2 < 1/2n-1 so (1/2", 1/2. 1/2n-1)E V1(0).

If we can construct a harmonic function n with u(z) > n for z E E,,, then it will follow from (1) that N1 u = oo for all 0 E [0, 1]'. This takes care of [0,1]2 c .N'`. The next part of the construction is designed to keep [0,1]2 c -4/*. Let x [0, 2b"] A. = [ - b,,, bn] x [ - b,,, B"= {z:Iz - WI
T"=inf{t:B,EB"}. As b" - oo, P(0,0,2)(S,, < T) -- 0, and as E. --> 0, P(0,0,2)(T < oo) 0 (here we use the fact that d > 3). Therefore we can pick E,, < 1/2n and b" > n + 2 so that P(O, o, 2) (Bt E A. - B,, for all 0 < t < T) >- 1 - 2" .

118

4

Boundary Llmite of Harmonic Functions

Since Y1/2" < oc, it follows from the last result that with probability I there is

an N (that will depend on co) such that Bt E AN - BN for all 0:5 t < r, and hence we can construct u in such a way that it is bounded on each set A. - B", it will then follow that {sup«LIu(B)I < oo} has P(0.0,2) probability 1, and from (4) of Section 3.2 that the event has P(o, o, 2) probability 1 for a.e., 0. At this point our mission is clear : We want to construct a harmonic function

that is large on E. and small on A. - B". To do this, we use the following result of J. L. Walsh (1929).

Let K be a compact set in Rd such that R" - K is connected, and suppose u is harmonic on an open set containing K. Then u can be uniformly approximated by harmonic polynomials on K. RUNGE'S THEOREM FOR R°

K,, = (A" - Bn) U E,, is compact, and R3 - K,, is connected. Let U and V be disjoint open sets with A. - B" c U and D. c V. Let wn(z) equal 0 on U and equal 2,, on V. Then w" is harmonic on U U V, and Runge's theorem guarantees that we can find a harmonic polynomial u,, such that (2)

for zeA, - B,, Iu,,(z)I < 1/2" Iu,, (z) - 2,I < 1/2" for zeE". From (2) it follows immediately that we have

(3)

X I u"(z) I converges uniformly on compact subsets of H. n=1

Proof If K is compact, then K c A" - B" for all n sufficiently large. Since each u" is harmonic, that is, if z e H and D(z, r) c H, then (4)

un(y) dit(y),

un(Z) = y eJoD(x,r)

so it follows from (3) that u(z) = Y', u,,(z) also satisfies (4) and hence is harmonic in H. From the construction above, we see that u is bounded on each set A,, - B", so it remains only to pick the A. such that u(z) > n for z e E". Let Al = 2. E1 = {(l/2, 1/2, 1)} and (1/2, /2,1/2EA" - B. for all n 2, so we have (recall e" < 2-") °° 1\"

u(1/2,1/2,1)= X u"(1/2,1/2,1)>2- Y_

2

n=2 2

n=1

> 1.

Suppose now that A1, ... , An_1 have been chosen so that u(z) > m for all z c Em, m < n - 1. If we pick 2,, such that n-1

inf Y um (z) + 2n > n + l , zEEO m=1

it follows that if z E E", then

u(z)>n+l-

ao

m=n+1

m

(,)

>n.

4.5

Broewrd's Proof of Calderon's Theorem

119

4.5 With a Little Help from Analysis, Probability Works in d >_ 3: Brossard's Proof of Calderon's Theorem In the last section we saw that when d > 3, A^ * - .N' may have positive measure. This result makes morally (if not logically) certain that it is impossible to show

that 2' = .N' by using purely probabilistic methods. In this section, we will show that if we borrow a simple result from analysis, namely, (1)

If u is harmonic and l ul < M in H, then u has a nontangential limit at a.e. point of 8H, then we can use probability to prove the local version of this result :

(2)

If u is harmonic and Nau < M for all 0 e S, then u has a nontangential limit at a.e. point of S. (2) implies that .Na c 2', so when we combine this with the trivial contain-

ments 2 c Y. c .Na, we get Calderon's theorem: 2' = K. (1) is a basic fact about harmonic functions and can be found in Chapter 2 of Stein and Weiss (1971), but it is a simple result and the analytical proof has some interesting probabilistic aspects, so I will give the details here. The first step in proving (1) is to prove: (3)

If l u(x) l < M for all x e H, then there is a function f with l f(x) l < M so that u(z) = EZ f(BL) where z = inf{t : B, 0 H}, that is, u(z) = Jho(z)f(0)dO.

Remark: This is Theorem 2.5 of Stein and Weiss (1971). It gives the Poisson integral representation for bounded harmonic functions. Since u + M > 0 in H, this theorem is, except for the assertion about f, a special case of (3) in Section 3.3.

Notation: Since functions of the form given in (3) will appear many times below, we let Yf(z) = I he(z)f(0) d0.

Proof of (3) We begin by proving the following semigroup property for u(-,y): (4)

If u is bounded in every HE = {(x, y) : y > E}, then for all r, s > 0

u(x, r + s) = f he(x, r)u(O, s) d0.

120

4

Boundary Limits of Harmonic Functions

Proof Let T = inf{t : B O H,}. Since

is a bounded martingale,

u(x, r + s) = E(x.,+s)U(BT)

=

f ho(x, r)u(0, s) d9.

Remark: The stopping time argument used above is a substitute for (and proof of!) the maximum principle used in the analytic proof (see Stein and Weiss (1971), pages 52-53).

With (4) established, the rest of the proof of (3) is soft analysis. Since I u I < M in H, there is a sequence yk 10 such that u( , yk) converges weakly to a limit f, that is, for all g e L' Ju(OYk)o(O) dO

Jf(O)o(O) de.

and so we have in particular that

f he(x,Y)f(O)d9.

Jho(x y)u(O, Yk) dO

-, By (4), the left-hand side is u(x, y + yk). Since u is continuous in H, u(x, y + yk) - u(x, y) as yk 10, proving (3). With the Poisson integral representation established, the proof of (1) is reduced to the task of showing that functions of the form u = gf have nontangential limits a.e. Since the value of u comes from integrating f with respect to a spherically symmetric kernel, it does not take too much inspiration to guess that the Lebesgue points, that is, the points where 1

r"

If(x+O)-f(B)Idx->0

are going to be good. The next result confirms this guess and, since a.e. point is a Lebesgue point, proves (1). (5)

If 00 is a Lebesgue point for f, then u = 9/f has nontangential limit f(00) at (001 0).

Remark: This result and its proof are classics (see Stein and Weiss (1971), pages 62-63).

Proof Since he(x, y) >- 0 and f he(x, y)d9 = 1, I u(x,Y) -.f(eo)I = Jho(xY)(f(O) -f(00))dO J he(x,Y)I.f(0) -.f(Bo)I d9.

4.3

Broeuurd'N Proof of C.Ideron'M'11eorem

121

Now if (x, y) e Va(00), then h9(x,Y) h0(00,Y)

0)2

(/(00 + Y2)d/2 ((X - 0)2 + y2)d/2 if 100 - 01

(a2 + 1)d/2 K2a2+1

d/2

(K - 1)2a2 + 1)

< ay

if 100 - 01 = Kay where K > 1.

The last expression - 1 as K -+ oc, so the ratio above is bounded, and it follows that 1 u(x,Y) -f(Oo)1 < C I ho(Oo,Y)I f(0) -f(0o)I d0 - 0

if 00 is a Lebesgue point. With the proof of (1) completed, we turn now to the main business of this

section, namely, the proof of (2). We begin with our usual construction. Let .7' _ {0 : Na(0) < n}, let 0 = UBEi- Va(O), and let a be the exit time from Q. I u1 < n on fl, so as t t o, u(B,) -* a limit that we call u(B0), and the optional stopping theorem implies that u(z) = Ezu(Ba)

Since {a = T} c {Bt e . r }, we can write u(Z) = uo(z) + u1(z) + u2(z) where u2(z) = Ez(u(B0) : a < T)

2ul(z)

uo(z) = Ez(u(Bj: Be.*) = -Ez(u(B,) : Bte.*, a < T). Now u0 is a bounded harmonic function, so by (1) it has nontangential limits a.e., and, furthermore, the limit of u0 is f 1, To handle ul and u2, we observe that each is bounded by nPz(a < T), so it suffices to show that for a.e. 0 c _* , Pz(a < T) - 0 when z (0, 0) in Vb(0). (This is similar to the result for PO proved in Section 4.2, but it is simpler.) To prove this result, we use an idea due to Calderon: it suffices to show that Pz(a < T) < CPZ(B,O_C) for the right-hand side is a bounded harmonic

function that, by (3) and (5) above, at a.e. point of V. To prove the inequality, observe that if (x, y) e H - ), then either y > 1 or y < d(x, it")la. If y > 1 (the trivial case), then P(x.Y)(B, 0

) >-

P(x.Y)(BTe[-n, n]) > el.

If y < d(x, 1')/a, then D(x, ay) c . ''° and P(x.Y)(BT0'

) < P(x.Y)(B,eD(x,ay)) > e2(a) > 0,

122

4

Boundary Umlt of Harmonic Functions

so there is an e > 0 (that depends on a) such that inf P=(B,O.,r) >_ E.

z c H-11

From the Markov property, P.(B,

P. (BT0.7E'',o'
= E.(PB(C)(BT 0 i(); o < r)

>EPZ(a
It is possible to use probabilistic techniques to prove that .N' c Qt and d c Al. Since these proofs require more analytical preliminaries, the reader is referred to Brossard (1976).

Exercise l Let Nau(O) = sup {u(z) : z e V,,'j .No = {O : N.(u) (9) < oo }.

Imitate the proof given above and use the integral representation for nonnegative harmonic functions (formula (3) in Section 3.3) in place of (3) above to prove Carleson's (1961) theorem: Aa c-- Y.

Remark: The reader should note that Carleson's theorem implies that for Burkholder and Gundy's counterexample, .N,1 fl [0, 1]Z = a so although we have done nothing to force u to take on large negative values, it will do so automatically.

5 Complex Brownian Motion and Analytic Functions

5.1

Conformal Invariance, Applications to Brownian Motion The main purpose of this chapter is to show how Levy's result concerning the conformal invariance of Brownian motion can be used to study analytic functions. In this section, we will exploit the connection in the other direction : we will use the conformal invariance to derive properties of two-dimensional Brownian motion. We will start with the two path properties proved in Section 1.7.

(1)

LetSo=inf{t>0:B,=0}. If x# 0, then PX(SO < cc) = 0.

Remark: As in Section 1.7, it follows from (1) that the result also holds when

x=0.

Proof It suffices to prove the result when x = 1. To do this, we let C, be a complex Brownian motion starting at 0, let F = exp(Ct), and observe that Levy's theorem ((4) in Section 2.12) implies that F is a time change of Brownian motion starting at 1, that is, if we let at =

f

Iexp(C) I2 ds = JexP(2Re(Cs))ds

0

0

and

then (since the recurrence of Re Cs implies a, T oc as t T oo), F(y ), u >_ 0, is a time

change of Brownian motion. Since 0 is not in the range of exp, it follows that Pl (So < oc) =

0 for some u >_ 0) = 0. 123

124

5

Complex Brownian Motion and Analytic Functions

6

4L 27ri

0

- 27ri ez

Figure 5.1

(2)

Let S1 = inf{t >- 0 : IBI 5 1}. If Ix! > 1, then

PX(S1
q,= j'Cs4ds

((x-')' = -x-2)

0

and

y = inf{t : Qr > u}, then is a Brownian motion starting at x. Let T1 = inf {t > 0 : I Cr I > 1). It is trivial to show that P1IX(T1 < oo) = 1 (see Exercise I in Section 1.7 for a more

general result). At time t = T1, IC,I = 1, so at time u = o(T1) (which is
B, + A, then Bi tells us the size of P, that is, IPI = lexp(C1)I = exp(B, ), and B, tells us how many times F has wound around 0, counting clockwise loops -2n and counterclockwise loops +2rz (see Figure 5.1). In Section 5.5 we will use this observation to prove Spitzer's result about the winding of Brownian

5.1

125

Conformal Invarlsoce, Applications to Brownian Motion

motion. At this point, we will content ourselves to make the following simple observation: (3)

Suppose Bo 0 and let 0, be the (net) number of times {B5 : 0 5 s S t } has wound around 0. Then with probability 1, lim sup 0, = oc,

oc,

lim inf 0,

and consequently 0, = 0 i.o. The exponential function is also useful for computing exit distributions. It

maps the strip G = {z : I Im zI < n/2} one-to-one onto the half space H = {z : Re z > 0}, so Levy's theorem allows us to compute the exit distributions for G from known formulas for the exit distributions for H. Let CC be a complex Brownian motion starting at 0. Let T = inf {t : CC 0 G }. Clearly, T < oo a.s. To compute the distribution of CT, we observe that if F = exp(Ct) and q, y° are the 0 is a Brownian motion, processes defined in the proof of (1), then B. = then y(QT) = T, so and furthermore, if we let r = inf{u >- 0 : exp(CT) = F/(Y(aT)) = B(QT) = B,

that is, exp maps the exit distributions from G to corresponding exit distributions from H. With the last identity established, it is trivial to compute the exit distribution from G. PO (Ins CT = it/2, Re CT >- a) = P1(Im Bz > e °)

=

rte e°

x2 + 1 i

dx

by (2) from Section 1.9. Changing variables x = tan y, dx = seczy dy converts the above to

f

'

1 TC

dy=

1

2

- - tan - (e), 1

it

and differentiating gives (recall that (tan-lx)' _ (1 + (4)

P0(Im CT = n/2, Re CT = a) =

1

x2)-')

e°

it e2°+1

= (2n cosh a)-1 The last example is just one of many that can be done with this method. The technique used above can, in principle at least, be used to compute the exit distribution for any simple connected region, since the Riemann mapping theorem implies that any such region (that is not the whole plane) can be mapped one-to-

one onto the disk. Carrying out the details of this computation, however, is usually very difficult even in simple examples such as polygons where the Schwarz-Christoffel formula gives an almost explicit formula for the mapping.

Note: Ever since Levy discovered the conformal invariance, many people have used it to prove results about two-dimensional Brownian motion; see Ito and

126

5

Complex Brownian Motion and Analytic Functions

McKean (1964), McKean (1969), Davis (1979a, 1979b), and Lyons and McKean (1980). Formulas (1), (2), and (3) each appear several times in these references.

5.2 Nontangential Convergence in D In the next two sections, we will be concerned with the nontangential limits of analytic functions. For some of the developments, we will need to know that several of the theorems proved in Chapter 4 for the upper half space H are also valid in D. In this section, we will use a combination of proof by analogy and by conformal mapping to prove the results we need. The first thing we have to do is define nontangential convergence in our new setting. Let Sa(6) be the convex hull of the disk {lzl < a} and the point e'°. (The S is for Stolz domain.) A function f, analytic or not, is said to have nontangential c whenever z e Sa(O) and z,, - e'°. limit cat e'0 if for all a < 1, The first step in investigating nontangential convergence in D probabilistically is to define Brownian motion conditioned to exit D at e'9. It is easy to see that this can be done by imitating the approach used in Section 3.2. We let we let T. = inf{t : B,l = 1 - 1/n}, and, if A (1)

PB(A) =

A)lke(z)

where

1lz - etelz 1212

ke(z)

is the probability density of exiting D at e'° starting from z. Since ke >- 0 is harmonic, repeating the arguments of Section 3.2 shows that we can paste these processes together using Kolmogorov's extension theorem, to get a process Y defined for t < T. The last problem, then, is to show that Y -+ ei° as t -+,r by modifying the proof of (2) from Section 3.2. Filling in the details in the sketch above is left as an exercise for the reader. The first result we want to generalize from H to D is that (2)

Y*=_V*=Y=.N', where these quantities are defined in the obvious way by analogy with the definitions given in Section 4.1. It should not be hard to believe that we could prove this result by patiently working our way through Sections 4.1 to 4.3 and noticing that the proof given there works, with minor modifications, in our new setting. Fortunately, there is another alternative.

Proof g(z) = (z - 1)/(z + 1) maps H' = {Rez > 0} one-to-one onto D. Since g is analytic, if we let a = inf{t : B, H'}, then Levy's theorem (and a little common sense) shows that

is a time change of a Brownian motion start-

5.2

Nontangentld Convergence In t)

127

ing at g(B0) and run until it exits D. The last result makes it easy to believe that "g maps B, conditioned to exit H at y to a time change of B, conditioned to exit D at g(y)," and, in fact, using (3) of Section 3.2, the reader can easily convert this belief into a proof. With the statement in quotation marks established, the desired result fol-

lows immediately, since the last conclusion implies that g(2') = 2D and g(.N'f;) _ .4,D (where the subscript indicates the domain under consideration), and it follows from the definitions of nontangential convergence and boundedness that g(54) = 22D and g(.KH) = VD. The argument above can obviously be used to obtain many other results about conditioned processes or harmonic functions in D from facts about the corresponding objects in H. For the arguments in the next two sections, we will also need to generalize results (3) and (5) of Section 4.5 from H to D. (3)

If Iu(z)I < M for all zeD, then there is a function f with I f(x) I < M so that u(z) = E.. f(B,) and, furthermore, u has nontangential limit f at a.e. point of 8D.

Proof Let g, H', and a be defined as in the proof of (2). Since g is analytic, a simple computation (or an application of Levy's theorem) shows that v(z) = u(g(z)) is harmonic in H' = {Rez > 0}. Since Iv(z)I < M, it follows from (3) and (5) in Section 4.5 that there is a function fo defined on 8H' so that v(z) _ EZ fo(B,) and v has nontangential limit fo at a.e. point of 8H'. To take these two results back to D, letf(z) = fo(g(z)) for z e 8D - {11 and let h(z) = (1 + z)/(1 - z) be the inverse of g. Since h(B,), t < u, is a time change of Brownian motion run until it exits D by unscrambling the definitions and applying Levy's theorem we get u(z) = Ez f(BL). The second conclusion follows from the second paragraph of the proof of (2). Exercise 1 Let u be a measurable function defined in D and define the two corresponding maximal functions UU* = sup{Iu(B,)I : t < i} Nau(O) = sup{Iu(z)I : zeS,(O)}.

Use the reasoning we called Calderon's argument in Section 4.5 to show that there is a constant C (whose value depends only on a) such that for all , > 0, A) All. A similar result holds in the upper half space : fdx

1)(UL* > A) A) 1.

In view of the counterexample in Section 4.4, it is much more difficult to show results in the other direction.

128

5

Complex Brownien Motion and Analytic Functions

5.3 Boundary Limits of Functions in the Nevanlinna Class N A function f analytic in D = {z : Izl < 1) is said to be in the Nevanlinna class N if 2n

sup ,<1

o

log'

where log' x = max {log x, 0}. These functions were introduced by F. and R. Nevanlinna (1922), who showed: (1)

An analytic function f is in N if and only if it is a quotient of two bounded analytic functions. From this representation theorem and the fact that bounded harmonic functions have nontangential limits ((3) in Section 5.2), it follows immediately that

(2)

if fEN, then the nontangential limit exists at a.e. point of D. In this section, we will use Brownian motion to give a proof of (2) that does not rely on the decomposition in (1). The key to our proof is the fact that twodimensional Brownian motion will not hit the countable set Zf = {z : f(z) = 0}, so although it may not be possible to define log(f(z)) as a function analytic in D, it is possible to define log(f(B,)) so that it is a time change of Brownian motion. Suppose for simplicity (and without loss of generality) that f(0) = 1. Since Brownian motion starting from 0 will, with probability 1, not hit Zf, there is a unique continuous process L, that has L. = 0 and exp(L,) = f(B,) for t < T.

_ It is easy to see what L, is for small times. Let S > 0 be chosen so that D(0, 6) fl Zf = 0. Since f(0) = 1, there is a unique analytic function g that has g(O)=O and exp(g(z)) = f(z) for all z E D(0, S), so for t < Ta =inf {t : B OD(0, S)}, L, = g(B,). The last observation shows that fort < Ta, L, is an analytic function of Brownian motion. By iterating this result and using the strong Markov property, we get : (3)

L, t < T, is a complex local martingale.

Proof We begin with some notation: A` = U D(Y, E) yEA

T' = inf{t : B, E (Zf U

T. =

T is the sequence of times that we will use to reduce L. See Figure 5.2 for a picture of (Zf U D)1I The first thing to check is that T T T as n - oo. Since L, is continuous, it is immediate that T2 T T. To see that T T, we observe that PO (B, e Z f for some

t> 0) = 0, and since the Brownian path is continuous, there is a lower bound on the distance between Bt, 0 < t < T, and the closed set Zf fl D(0, r).

5.3

Boundary limits of Function In the Nevanllnna Clan N

129

Figure 5.2

To prove that L(t A T.) is a martingale, we have to further subdivide time. The strategy is simple : For each z e D - Z1, we will choose a ball D(z, SZ) r D - Z1, define the associated exit times a = inf {t : B, 0 D (z, SZ) }, and then exhaust T. by iterating these exit times. To do this requires a little patience and a lot of notation. Let

SE(z) _ (1 - i)sup{S : D(z, S) c D - Zf} R.(z) = D(z, 6.(z)) u ,(z) = inf (t: Bt RE (z) }

So= 0,Zo=0 and form

- 1,

S. = inf{t > Sm_1 : B,0Riin(Zm_1)} A T. Z. = B(Sm)

M = inf{m : S. = The next result explains the reason for our choice of 6,(z). (4)

For each E > 0, there is a constant p (E) > 0 such that if z c- D - Z1, then Pz(u.(z) E (Zf U D`)`) > p(E)

Proof Since sup {6 : D(z, S) c D - Zf} < 1, there is at least one point, say zo ED` U Zf, that is within e/2 of the boundary of R,(z). Since D(zo, E) C (Zf U D`)`, looking at Figure 5.2 and experimenting with a few cases, one finds that

inf P-(a.(E) E D(zo, E)) > 0,

ZED-Zf

proving (4).

110

5

Complex Brownian Motion and Analytic Function

From (4) and the strong Markov property, it follows immediately that

P(M > m) < (1 - p(IIn))m, so P(M < oo) = 1. Since SM = T", all that remains is to put the pieces together to prove (3). The argument preceding the statement of (3) shows that L(t A S1) is a martingale. Combining this result with the strong Markov property shows that for any m, L(t A Sm) is a martingale, that is, E(L(t A S.)IJ¢"'S) = L(s A Sm).

Since I LI I < n for t< T,,2 and T.2 > Sm, it follows from the bounded convergence theorem that L(t A T") is a martingale.

Remark: In Section 6.4, we need to use the fact that iff is an analytic function with f(O) = 1 and we define G, to be the unique continuous process that has Go = 1 and G," = f(Bj, then G, t < r, is a local martingale. The reader should look back over the last proof now to verify that (a) the proof above (with trivial modifications) proves this fact and (b) in the proof above we never use the fact that f e N. To use the condition f c- N to conclude something about the convergence of L, as OT, T, we have to replace the stopping times T" by i, = inf{t : IB,I > r}. To do this we will discard Im L,, because we have very little control over its size (and we know that there is a trick for deducing the convergence of Im L, from that of Re L,). Even Re L, is not so nice on {z : IzI < r}. The function f is bounded, so Re L, = log I f(Bt) I is bounded above, but not below, if f has zeros. To get around

this problem and to make the connection with the condition feN, we truncate below. (5)

If a > - oo and U° = (Re L,) v a, then Ua(t n r,) is a submartingale. Proof In the proof of (3) we showed that L(t A T") is a martingale, so it follows that U°(t A T") is a submartingale, and the optional stopping theorem gives

E(U°(t n r, n T")I,Fs) > U°(s A i, n T").

Since U,", t < T is uniformly bounded, letting n - oo and using the bounded convergence theorem proves (5).

Remark: Continuing to prepare for Section 6.4, we observe that IG,I =IFI1/" is bounded for t < T so the argument above shows that G(t n r,) is a complex martingale. Up to this point, we have not used the fact that f e N. The next result ends this generality. (6)

If f e N, then lim Re L, exists a. s. `T`

5.3

Boundary Limits of Functions In the Nevsnllnn. Cl.. N

131

Proof Since f e N, J2n

EU°(T,) < Ial +

log+ I f(re`B)I dn(O) < C < oo. 0

Let T = inf{t : IB,I >_ 1 - 1/n} and let y : [0, oo) -+ [0, T) be the stretching func-

tion defined in Section 2.3. y is a predictable increasing function that has y(n) < T, so U°(y,), t z 0, is an L1-bounded submartingale, and it follows that Y° as as t - oe, U" (y,) converges to a limit Y° a.s. If Y° > a, we have Re L, t -- T. On the other hand, if Y° < a for all a > - oo, then lim sup Re L, < inf Y° tt=

oo,

a

so the limit exists in this case also, and we have proved (6). Combining (6) with the results of Section 2.11, we can early conclude that (7)

If f E N, then lim Im L, exists a.s. et=

Proof Let U, = Re L V = Im L,. Since L, is locally an analytic function of B the variance processes , and , are equal. It follows from a result in Section 2.11 that {lim U exists} _ {, < oo}

={,
tt

From (6) and (7), it follows that as t T T, (Re L Im L,) converges to a limit and hence so does f(B,) = exp(L,). Combining this result with (2) of the last section, we see that (2) is true.

Remark: From the proof of (7) and the results of Section 2.11, we see that (Re Lt, Im L,) __ lim(Re L Im L,) is finite a.s. If we let F, = exp(L,), then P(IL,l < oo) = 1 and P(F, = 0) = 0. With a little thought, we can strengthen the last conclusion to : (8)

If feN and f * 0, then EIlog IF,I I < oo.

Proof Simple arguments (left to the reader) show that it is enough to prove the result when f(0) = 1. In this case, the arguments above can be applied to conclude that as r T 1, F(T,) - F(T) and sup E log' I F(T,) I < oo. r<1

so it follows from Fatou's lemma that Elog+ IF,I < oo. To estimate the negative part, we observe that if T. is the sequence of stopping times defined in the proof of (3), then Re L(t A T,.) is a bounded martingale, so E Re L(T, A 0. Letting n -> oo, observing that Re L(t A T,) is bounded above, and using Fatou's

132

S

Complex Brownian Motion and Analytic Functions

lemma (upside down), we see that E Re L(-r,),-!f 0, so E Re L(T,)- < E Re L(t,)+, and it follows from another application of Fatou's lemma that El logIF,II < cc. Exercise 1

Jensen's Formula.

(a) Show that iff(z) is analytic and free from zeros in IzI < r, then d9 logl f(0)I = Jloglf(rei0)p.

(b) If f has zeros a1, ... , a in IzI < r (multiple zeros being repeated according to their multiplicity), then apply (a) to

rz - a,z

F(z) =f(z) i=1 r(z - a;) which is free from zeros and has IF(z)I = I f(z)I on IzI = r, and conclude that J logl f(re`B)I

d9

= logIF(0)I

= logl f(0)I - `Y log (L_) What is the probabilistic interpretation of the correction term? Note:

As the date on the Nevanlinnas' paper suggests, these results are class-

ical. For an analytic treatment, see Duren (1970), pages 16-17. (1) is his Theorem 2.1; (2) and (7), when combined, are his Theorem 2.2. The proofs given here, like many of the results in the book, are "new" but can be traced directly to results in the literature. The idea of taking logs along the Brownian path is a natural generalization of a proof in Section 6.4 (where we take nth roots), a tactic that in turn was suggested by a remark in the introduction of Getoor and Sharpe (1972).

5.4 Two Special Properties of Boundary Limits of Analytic Functions In this section, we will prove two results that show that the possibilities for the boundary behavior of a function analytic in D are much more limited than those of a function harmonic in D. The first result is a very strong uniqueness theorem (that fails miserably for harmonic functions). (1)

If an analytic function f has nontangential limit 0 on a set of positive measure, then f =_ 0.

Remark: This result is due to Privalov (1924). IffeN, (1) is a consequence of (8) in Section 5.3, but the f here is not assumed to satisfy any growth condition.

5.4 Two Special I'ropcrtlee of Boundary l.lmlte of Analytic Functions

133

Proof If f has nontangential limit 0 on a set of positive measure, then it follows from (2) in Section 5.2 that P(f(B,) - 0 as t T T) > 0. If f is not = 0, then .f is nonconstant, and it follows from Levy's theorem that f(B,), t < T, is a time change of Brownian motion run for an amount of time

f

I

The last conclusion, however, contradicts the first, since if W is a twodimensional Brownian motion, then

Px(W=0for some t>0)=0 and

P0

,(W-). 0ast-.oo)=0.

Remark: The proof above is taken from Burkholder (1976). As he says on page 147: The advantage of a good probabilistic proof is not that the technical details become easier, although this is sometimes the case, but that the underlying

ideas become more transparent. For example, the truth of Privalov's theorem becomes evident once we see that it is a question of whether or not Brownian motion hits a particular point in the complex plane with positive probability. By using the fact that two-dimensional Brownian motion is recurrent, we can get another result about the boundary behavior of analytic functions that is due to Plessner (1928). As in Chapter 4, it is easy to prove the probabilistic analogue. (2)

With probability 1, either (i) lim f(B,) exists rTT

or (ii) for all e > 0, { f(Br) : t e [T - E, T) } is dense in C.

Proof If we let U, = Ref(B,), then the first event is a.s. equal to { < 00} and the second is a.s. { (U>t = oo }. In order to translate this statement into analytical terms, we need a lemma. Let S2,(O) be the Stolz domain defined in Section 5.2, and if A is a Borel set, let S.(A) = UeEASS(O) (3)

For each a < 1 and Borel set A, as e ---> 0, P(BT e A and B, e SS(A) for all t e [T - e, T)) --> P(B, e A).

Proof Let h(z) = PZ(B, E A). This result is an immediate consequence of two observations:

1.14

S

Complex Brownian Motion and Analytic Function.

(a) A simple generalization of Calderon's argument in Section 4.5 shows that

there is an e > 0 (that depends only on a) such that if z 0 S2(A), then h (z) < 1 - E. (b) It follows from results in Section 5.2 that h(B1) -> lA(BT) as t T T.

With (3) established, we are ready to prove Plessner's theorem. (4)

Except for a set of 0 of Lebesgue measure zero, either

(i) f has nontangential limit at e`° or (ii) for all a > 0j (S,,(0)) is dense in C.

Proof Let A be a set of positive measure such that for all 0 e A, limf(z) does not exist as z - e'° in S2(0). If we let Po be the law of Brownian motion conditioned to exit D at eie, it follows from results in Section 5.2 that if 0 e A, then Po (1 m f(B,) exists) = 0, Ift and it follows from (4) in Section 3.2 that PO Clim f(B1) exists, BT E Al = 0.

Combining the last result with (2) above shows that for all e > 0, P0(BtaA, {f(B1) : tE[T - E, T)} is not dense in C) = 0, which in view of (3) implies that f(S,,(A)) is dense in C. Let A,, = {0 : lim f (z) does

not exist as z --> e`° within S2(0)}. Since the last conclusion holds for any set A c A. that has positive measure, a simple argument (which is left to the reader) shows that for a.e. 0EAa, S,, (0) is dense in C. Note: The main ideas in this section, including the fact that boundary values of analytic functions can be studied via Levy's theorem, are due to Doob (1961). As usual, however, we are not using the original source. Our proof of Privalov's theorem follows Burkholder (1976). Our proof of Plessner's theorem is from Davis (1979a) ; specifically, (2), (3), and (4) are his Theorems 4.2, 4.3, and 4.6. For classical proofs of these results, see Theorems 1.9 and 1.10 in Chapter 14 of Zygmund (1959).

5.5 Winding of Brownian Motion in C - {0} (Spitzer's Theorem) Let B, and B' be two independent Brownian motions with Bo = 1 and Bo = 0, and let C, = B, + iB' be a complex Brownian motion. Since Ct almost surely never hits 0, we can define the total angle swept out up to time t to be the unique

5.5

Winding of Brownlen Motion In C- 10} (Spltzer's Theorem)

135

process 0, with continuous paths that has 00 = 0 and sin(B,) = B,21I('I for all t > 0. In words, the process 0, records the angle and keeps track of the number of times the path has wound around 0, counting clockwise loops - 2n and counterclockwise loops +2n. Spitzer (1958) proved the following limit theorem for 0,: (1)

As t - oc, P(20,/log t< y) --> Jy

dx

+xZ

l

1

n*

Spitzer's proof is ingenious but requires a lot of computation (see Ito and McKean (1964), pages 270-271, for a succinct version). In this section, we use the conformal invariance of Brownian motion to give a simple proof of this result. The proof given below is based on Durrett (1982) but incorporates some

improvements that I learned from Messulam and Yor (1982) and an earlier work of D. Williams (unpublished).

Let D, = A, + iB, be a complex Brownian motion with Do = 0, and let F = exp(D,). By Levy's theorem, if we let

a,= J0 Iexp(D.)12ds and

yu=inf{t: a, >u}, then C. = F(y"), u > 0, is a complex Brownian motion with Co = 1. The first advantage of constructing C, as F(y,) is that we can easily write down the angle process 0, = B(y,). The second, and more crucial, observation is that (2)

a, = f Iexp(DS)I2ds = f exp(2A5)ds, 0

so y and B are independent. Let S = y(e2u). Time e2" in process C corresponds to time S in process D. The main idea of the proof is to show that 9(e2u) - B(y(e2u)) = B(S") is approximately B(Tu), where T. = inf{t : A, > u} (our guess is motivated by (2) above). Once we show this, (1) follows immediately, because a simple scaling gives a

(3)

B(Tu)/u

B(T1),

and the right-hand side, which is the hitting distribution of {(x, y) : x = 1}, is (by results in Section 1.9) a Cauchy distribution with parameter 1. To show that B(SS) is approximately B(Tu), we start by showing that S. is

approximately T. The following estimate is crude, but it is sufficient for proving (1). (4)

Ife>0,then asu-+oo P(TJ(l_o < S. < Tuo +E)) -> I.

1.36

5

Complex Brownian Motion end Analytic Function

Proof To get the lower bound, observe that P(SU <_ To(1-e)) =

P(y(e2o) < To(1-E))

= P(e2u < a(To(1-e))),

and from (2), it follows that ° (Tu(1-E)) < To(1-e) exp(2u(1 - E)),

so we have (5a)

P(Su < Tu(1-e) < P(e2ue < Tu(1-e)), a

_e) u2T(1_E) which -+ 0 as u oo, since To get the upper bound, observe that

P(SU z To(1+E)) = P(e2u

- c(To(1+e)))

and let L,(u) be the Lebesgue measure of {se [0, Tu(1+e)] : AS > u(1 + e/2)}. From (2), it follows that a(To(1+E)) ? LE(u) exp(2u(1 + E/2)),

so we have (5b)

P(S,, > Tu(1+e)) <

P(e-eu > Le(u)

> 0),

a

which 0 as u -+ oo since LE(u) uLE(1). With (4) established, completing the proof of (1) is routine. If we let

DE(u) = sup{IB(t) - B(T)I/u: te[T.(1_E), T(1+e)]} and recall that O(e2u) = B(SS), then (4) implies that for fixed E > 0,

P(

B(TT)

O(e2u)

U

U

as u -+ oo. Now the distribution of A(u) is independent of u and tends to 0 as 6 - 0, and (3) says that B(T,)/u d B(T1), so it follows from a routine computation that 0(e2u) = B(T1) u

as u --* oo

stands for convergence in distribution). With a little effort, (1) can be improved to a limit theorem that describes when the winding occurs. Using the notation introduced in the proof of (1), we can state this result as the following : (where

(6)

As u -> oo, the finite-dimensional distributions of O(e2`o)/u, t > 0, converge to those of B(T,), t >- 0 (a Cauchy process with parameter 1).

To understand what (6) says, look at Figure 5.3 and observe that if u = 1 - E and E is small, then B(T1_E) is almost B(T1). Therefore, if N = ego, then most of

5.5

Winding of Brownion Motion in ('

-

{0} (Spitzer's Theorem)

137

time S. zT

u/2

u

1

Figure 5.3

the winding has occurred by time e2u(1-`) = N1 ` = o(N). To see when in [0,N'-'] most of this winding occurred, observe that if t1 < t2 < < t.(a) are the times of the jumps > 6 in magnitude, the winding at time N is almost a sum of contributions from times proportional to N`1, N`2, ... , N`n(b).

Technical Remark: Some probabilists may have expected (6) to contain a claim of weak convergence as a sequence of random elements of D (the space

of functions that are right continuous and have left limits-see Billingsley (1968) for details). We have refrained from doing this because the claim is false : C = the set of continuous functions that is closed in the usual (Skorokhod J1) topology, so we cannot have weak convergence. Refinement 1: Rate of Convergence In his unpublished paper, Williams said, "The reader will find it a useful exercise to see how good a `rate of convergence' ... he can get by refining our method." We will give a partial solution below

and leave the rest of the details to the reader. Looking back at the proof, we see that the difference between O(e2u)/u and B(T")/u = B(T1) is determined by the size of S" - T", which in turn was controlled by the estimates (5a)

P(Su < T"(1-E)) _< P(ez"E < T

(5b)

P(SS > T.(,,,)<-

P(e-'u > LE(u) > 0).

138

S

Complex Brownian Motion mnd Analytic Functions

To estimate the right-hand side of (5a), we observe that e2uE

- T(1-E))

P(e2uE

P G 2(1

C Tl

_ E) 2

,

so if we let E(u) = (log u)1+alu where S > 0, then (7a)

/

P(S <

u2+2a

)1+a) 

-. 0

u2

as u -.oc. To improve (5b), we observe that, by considering AS ? u + K}

meas {s E [0,

a

we see that

meas {s E [0, TK] : As > 0},

so if we let M. = meas {s e [0, Ta] : AS > 0} and let K(u) -+ co, then repeating the argument for (5b) shows (7b)

P(e-K(°) > MK(a))

P(SU > Tu+K(a)) <

-0

asu -.oo. The last two results show that S - T = 0(log u) and hence that O(e2u) U

_

0((logu)1I2)' u

`\

u

/J

or in terms oft = e2u, the error is smaller than C(log log t)112/log t. This estimate is not sharp (since (7a) is not), but it is not far off.

It is an interesting and difficult problem to determine the limiting behavior of Refinement 2: Joint Distribution of the Winding about Two Points

(2/log t) (Of, O') where O and 0' denote the winding about a and b 0 a. After partial results of Messulam and Yor (1982) (see Theorem 3.2 of their paper), this problem has recently been solved by Jim Pitman and Marc Yor. The final result is one of those tantalizing theorems that are easy to discover but difficult to prove, so I will only give you a hint of what the result is and let you figure out the precise statement for yourself. (8)

Ifa54 b,thenast --.oc

1gt(0,9e(X+Z,Y+Z). The limit is independent of a and b. Z represents the contribution of big windings

around {a, b}, and X and Y are the contribution of little windings around {a} and {b}, respectively.

Note: I would like to thank Marc Yor for several useful conversations concerning the material in this section, and Jim Pitman for pointing out an error in the original version of the technical note. Formulating a good weak convergence theorem in this context is tricky; you need to take a very careful look at Figure 5.3.

5.6

5.6

Tangling of Brownian Motion In C- { - I, I } (Picard's Theorem)

139

Tangling of Brownian Motion in C - { -1, 1 } (Picard's Theorem) If f is a nonconstant function that is analytic in the entire complex plane and B, is a complex Brownian motion, then Levy's theorem tells us thatf(B,) is a time change of Brownian motion. Since two-dimensional Brownian motion is recurrent (see Section 1.7), it is immediate that if f is a nonconstant entire function, then the range f(C) is dense in C. Picard's little theorem asserts that more is true.

(1)

If f is a nonconstant entire function, then the range off omits at most one complex number. In this section, we give Davis's (1975) proof of Picard's little theorem. The

idea of the proof is simple: Without loss of generality, we can assume that {-1, 1} Of(C) and f(0) = 0. Let B, be a Brownian motion starting at 0. If we can find times to < t1 such that B,0 = B,, butf(B,), to < t < t1 is not homotopic

to a constant in C - { - 1, 11, then we will have a contradiction that proves Picard's theorem, because B to < t < t1, can be continuously shrunk to 0 and f(B,), to < t< t1, cannot. In order to carry out the plan described in the last paragraph, it is convenient to make a minor modification in what we are trying to prove. Identify the points of {z : I z I < .1 }, call this set 0, and let C be C - { - 1, 11 with the points of 0 identified. Let 0 be the component off -1(0) containing 0, and let 0 be C with the points of 0 identified. To prove (1), we will prove : (2)

There is a time o with Po(a < oo) = 1 such that B, e 0 and f(B), 0 < t < a, is not homotopic to 0 in C.

Remark: This result has to be formulated carefully to be true. Iff(z) = e2 - 1, then -1 Of(C), and Spitzer's theorem shows that there is an infinite sequence of times a,, t co such that f(B,.) e 0 and f(B,), 0 < t < a,,, is not homotopic to 0 c 0. in e, but this can never happen when To prove (2), it suffices to show that (3)

There is a time T with Po(r < oo) = 1 such that for all T > r, f(B,), 0 < t < T, is not homotopic to 0 in C. For then we can let a = inf {t > i : C, e 0}. There are then a number of ways of

proving (3). Davis gave one proof (1974) and later suggested another one ((1979a), page 920). The proof given below is our version of his second proof. Most of the effort is spent introducing a lot of notation for keeping track of the tangling around the two points. We start by introducing six sets (see Figure 5.4),

Ao={X+iy:-1 <X<1,y=0} A1={X+iy:x<-1,y=0} A2={x+iy:x> 1,y=0}

140

5

Complex Brownian Motion and Analytic Function.

Figure 5.4

A3={x+iy:lxl = l,y>0}

A4={x+iy:lxl=l,y<0} 4

A = UA;, =o

and an associated sequence of stopping times. Let B, be a complex Brownian motion with B0 = 0. Let T. = 0 and, for n > 0, let Ti+i = inf{t > 7 : f(B,) c A - A;,,} where the sequence of indices i is defined by f(BT) a Ai. for n > 0 (recall that we have assumed f(0) = 0, so io = 0). It is clear that if we are given the sequence of indices, then we can compute the homotopy class of the path, but this sequence contains a lot of irrelevant information, so we define a reduced sequence inductively by the following procedure: (a) At time 0, we write 0. (b) At time T,,, if there is only one number in the string, we add i to the end of the string. (c) At time T,,, if there are k > 1 numbers in the string and the new number i is the same as the next to last, we erase the last number. Otherwise we add i to the end of the string.

Convention (c) allows us to delete irrelevant loops from the sequence of numbers. (To check your understanding of the procedure, you should try the example drawn in Figure 5.4. Its reduced sequence is 03240.) Besides cutting down on the length of the sequence, the erasing also drastically reduces the number of possibilities for the sequence. The reader can easily check that between two successive occurrences of 0, the sequence must be one of four types :

5.6

Tangling of Brownian Motion In C- { - 1, I } (Picard's Theorem)

141

(i) 03240, 0324130, 032413240, .. . (ii) 03140, 0314230, 031423140, .. . (iii) 04230, 0423140, 042314230, .. . (iv) 04130, 0413240, 041324130, ... .

From the observations above, it is clear that we can compute the homotopy class of the path from the reduced sequence and that to prove the theorem it suffices to show that the length of the reduced sequence -> oo almost surely. The

fact that the reduced sequence - oo can be made obvious by the observation that if the last two digits of the string are 03 or 04, then the probability we will add a number is greater than 1/2, while in any other case the probability we will add a number equals 1/2. To make the last argument precise, we would need to obtain an estimate on

how frequently i = 0. Rather than trying to do this, we will adopt a slightly different approach to avoid this difficulty. Let No = 0, and for k >- 1 let Nk = inf{n > Nk_1 : i = 0 and the reduced sequence corresponding to iNk-1 ... i is not 0}. At time Rk = TNk, the number of zeros in the reduced string will either be the same or differ by one from the number of zeros at time Rk_1. To describe the conditions that result in the three possible outcomes, we need some definitions. Any of the sequences that appear in the lists (i), (ii), (iii), and (iv) above is called a zero-block. A zero-block b2 is said to be the inverse of the zero-block b1 if b2 is obtained by reading b1 backwards, that is, if b1 = 0324130, then b2 = 0314230. The reason for the name inverse is explained by the next result. Let ak be the last zero-block in the reduced string at time Rk, and let bk be the zero-block that If bk = the number of zeros at time Rk is obtained by reducing ... minus the number of zeros at time Rk_1, then

1.I

-1 _ k

,'Nk.

if bk is the inverse of ak

if the first nonzero digit of bk (which must be 3 or 4) is different from the last nonzero digit of ak 0 otherwise.

+1

At this point we have almost, but not quite, avoided the difficulty referred to above. It follows from the strong Markov property and symmetry that 1I,

(Rk)) = 2

PO(Yk+1 = - 1 I.F(Rk)) = h(ak,f(BRk)),

where h is a function that has h(a, x) < 1/2 for all a, x. It is somewhat unfortunate that sup h (a, x) = 1 /2 (contrary to what we claimed in an earlier version of this proof), but this difficulty is easy to remedy. The culprits are the zero-blocks of length five : 03240, 03140, 04230, and 04130. In this first or second case we have lim h (a, x) = 1/2, .1 xt1

142

s

Complex Brownlin Motion and Analytic Functions

since the next winding will, with a probability approaching 1, be around 1, and symmetry implies that the two possibilities have equal probability. A similar argument shows that in the third or fourth case

lim h(a,x) = 1/2,

x1-1

but the other zero-blocks, luckily, are our friends. It is easy to check that if a is not one of the four zero-blocks mentioned above, then sup h (a, x) = C(a) < 1 /2

XE(-1,1)

because each such a involves winding about both -1 and 1, and hence has lim h (a, x) = lim h (a, x) = 0. x11

xl-1

Combining this with the observation that if a is a sequence of zero-blocks with length co and x e (-1, 1) then h(a,,, xa) - 0, it follows that if we exclude the four bad zero-blocks then sup h (a, x) < 1/2. a, X

To complete the proof now, it suffices (as we will show below) to ignore the

bad zeros and count only the good ones. Let No = 0 and for k >- 1 let Nk = inf{n > Nk_1 : i = 0 and the reduced sequence which corresponds to LNk_1 ... L contains a zero-block that has length greater than 5}. If we call a zero that is

at the right end of a zero-block of length greater than 5 a "good zero," then arguments given above show that at time Rk = TNk the number of good zeros in the reduced string will be the same or differ by one from the number of zeros at time Rk_1, and, furthermore, if we let a', b;,, and k be defined in the obvious way then we can repeat the arguments above with "primes" attached. Having done this we come at last to a point where (at last!?) we can use the strong Markov property and symmetry to conclude that Pork+1 = 1 J.F(Rk)) = 1/2

{(BR,))

where h' is a function that has h'(a', x) < 1/2 for all x and a'. The last observavation makes it clear that Sk = ' + + k -> oc a.s. To prove this, let rik be a sequence of random variables that have nk =k on 0} and are defined on {lk = 0} in such a way that

Po(ilk = -1

I-IF(Rk-1)) = q

If we let S,' = nl +

+ qk, then S is a sum of independent random variables,

so

kk2-q>0a.s. and it follows that Sk > Sk -* oo a.s.

S.6

Tangling of Brownian Motion In C- { - I, I } (Pkard'. Theorem)

143

Problem I Having shown that the length of the reduced string that describes the homotopy class -+ co a.s., it is natural to inquire about the rate ot'growth. 11' Lt denotes the length at time t, then it is trivial that

Lt1?4min(

10"

where O is the winding around a defined in Section 5.5. This suggests (to me at least) that I Ltl should grow faster than log t. What is the right rate of growth? Someone who understands Jacobi's modular function well can probably solve this problem by looking carefully at how Brownian motion exits H (see McKean (1969)). Once one proves Picard's little theorem using Brownian motion, it is natural to try to prove his "great theorem" as well. This theorem states that if f has an essential singularity at z = a, then in each neighborhood of a, f assumes

each complex number (with one possible exception) an infinite number of times. See Conway (1978), page 300, for the relevant definitions and an analytic

proof. Davis (1979a) has given a probabilistic proof, but since his proof still requires a fair amount of complex analysis, we refer the reader to pages 928-930 of his paper for the details. A second type of extension of the results above is to ask if Brownian motion in C - {-1, 1} gets tangled in other senses as well. McKean (1969) claimed on

page 112 that "the plane Brownian path undoes itself i.o. as t T 00 from the point of view of homology with integral coefficients." This is incorrect. Lyons and McKean (1980) "corrects this error and takes the matter further." Note: I would like to thank Jim Pitman for pointing out an error in the original version of the proof given above. He is not to be blamed, however, if my "first aid" fails to save the victim.

6 Hardy Spaces and Related Spaces of Martingales

6.1

Definition of H", an Important Example Let D = {z : I z I < 1} be the unit disk in the complex plane. A function u that is harmonic in D is said to be in h", 0 < p < oo, if

I u(re`BI d(0) <

dp(u) = sup J

.

r<1

Extending this notion, we say an analytic function f is in H° if 2,,

dd (.f) = sup

I .f (re `B) I p dx (O) < o0

r<1 J o

(here and in what follows, dn(9) = d9/27r). The class Hp has been studied extensively since the time of Hardy (1915). In this chapter, we will describe some of the results that have been obtained. To prepare for these developments, we devote this section to the consideration

of one example that will appear at several points in the development of our story: P(z) _ (1 + z)/(l - z).

p sends -1 - 0, 0 -> 1, 1 --> oo, and a little thought shows that p maps D one-to-one onto the half plane Re z > 0. To find u = Rep and v = Im p, we write

l+z 1-z P(z)1-z 1-z

1+z-z-IzI2

I1-z12

so

u(z) = 1 - IZI2 I1-z12 144

v(z) - 2Imz I1-zi2*

6.1

145

Definition of NP, an Important Example

We have seen u(z) many times above. It is the probability density (w.r.t. n) that a Brownian motion starting at z will exit D at 1. From this interpretation of u and the strong Markov property it follows that 2n

u(re`B) dn(O) = u(0) = 1 0

for all r < 1 and also that if 0

0, then as r -+ 1, u(re`8) -+ 0. Combining the last two observations shows that if p > 1, fo2n

ju(re`o)JPdn(9) -' oo

as r - 1 (if not, then u - O in L 1), so u e h 1 but u O h P for any p>

1.

At this point, we might hope that p e H1. In Section 6.5, we will see that the limiting behavior of u precludes this, but for the moment we do not know this, so we will show that p 0 H' by computing f02 n

I v(re`a) I dn(O) = 2 fon

2r sin 9

dO

1 - 2r cos O + r2 2n it

_ log(1 - 2r cos 9 + r2) io

=2logl I+rl-.oo

as

It doesn't miss by much since lim

2r sin O

sin O

,t, 1 + r2 - 2r cos 9

O

1 - cos 9

0

'

and as 0 -+ 0,

-

sing 0+0(03) 1-cos0 Z+0(94)

2

0'

so we have lim v(re`B) e LP

rt1

for all p < 1.

As our experience with u indicates, this result is not, in general, enough to conclude that v e hP (in that case limit 1 u(re`B) = 0 e L°°). This time, however,

nothing bad happens. For r > 0, 1 + r2 > 2r z 2r cos 0, so I2r sin 01 < 12r sin 0l 1 + r2 - 2r cos O - 2r - 2r cos 9

-

I sin o l

1 - cos 9

and if p < 1, it follows that dd(v) <

Zn( Isin9l )Pdn(9) < ao,

fo \I -cos 0)

that is, v e hP.

146

6

Hardy Spaces and Related Spaces of Martingales

To sum up, what we have found is that

uehP forp < 1 vEhP forp < 1, so

peHP forp<1. The list above shows that we may have Re f e h' but Im f o h'. In Section 6.7,

we will show that this example is the worst behavior we can encounter when Refeh1, and that this cannot happen if 1 is replaced by pe(1, oo), that is, if 1 0, log p maps D one-to-one onto the strip {z : - n/2 < Im z < n/2}, and we see that Re(i log p) E h°°, but Im(i log p) 0 h°°.

The last example is just one of many that we can produce by modifying p.

In Section 6.7, when we study conjugation (i.e., the map Ref - Im f ), the functions

p(z)" 2logP(z),

P(z) 2p(-z)

2zZ2

will be used to show that certain inequalities are sharp (experts are invited to guess the inequalities). At first, the ubiquitous appearance of p may seem mysterious. In Section 6.5, we show that it is in some sense inevitable : If f c HP, p > 1, then there is a function 9 e LP(8D, n) such that

f(z) = Jp(ze0)qi(e)dir(o), so any f e HP, p > 1, is a linear combination of copies of p composed with rotations of D. (Here and in what follows, the integration is over any convenient interval of length 2n if the limits are not explicitly indicated.)

6.2 First Definition of '&P, Differences Between p > 1

andp=1

In this section, we will define the spaces .,f#P, p > 0, that are the martingale analogues of the Hardy spaces introduced in Section 6.1. The definition is designed so that if f e HP and B, is a Brownian motion starting at 0, then Ref(B,), t < i = inf{t : will be an element of .,f#P, but we will not be able to show this until the end of Section 6.4. For the moment, our goal is to define -#P and use martingale theory to prove some simple results that point out some differences between #1 and .d#P, p > 1.

6.2

F Int Definition of ..M". Differences Between p > I and p - 1

147

Let B, be a Brownian motion starting at 0 (defined for convenience on our

special probability space (C, ')) and let F be the filtration generated by IS,. Let X* = sup, IXI and let . #P = {X: X is a local martingale w.r.t. 9 and EIX*IP < oo}. If p >- 1, then EIX*I < oo, so X is uniformly integrable, and a standard martingale convergence theorem implies that

(i) as t oo, X, (ii) X, = E(Xao I

X.. a.s.

)

It is clear that El X. I P < EIX * I P. When p > 1, there is a converse inequality. (1)

If Xc

P with p > 1, then

EIX,IP < EIX*IP <

(P )PEIXXIP. p-1

Remark: This is a standard result from martingale theory, but aspects of the proof are important for what follows, so we will prove it here. The key to the proof is the following result (Doob's inequality), which for later purposes we will prove as an equality. (2)

Let X, be a continuous martingale with X0 = c and let X, = sup,<,XX. Then for > c, AP(X, > A) = E(X, ; X, > A).

Proof Let T = inf{s : X9 > Al A t. Since T is a bounded stopping time, it follows from the optional stopping theorem that E(X,I. T) = XT.

_ Since X, has continuous paths and Xo = c < A, we have XT = 2 on {T < t} _ {X, > 2). Combining this result with the definition of conditional expectation, we have 2P(X, > A) = E(XT; X, > A) = E(X, ; X, > A).

Remark: (2)'

The same proof (with a few minor changes) shows the following:

If X is a continuous submartingale, then

2P(X,>2)<E(X,;X,>A) without any restriction on X0 or A, since in this case E(X,I .T) >- XT and

XT>A on {TA}. To obtain (1) from (2), we integrate and use Fubini's theorem (everything

148

6

Hardy Spaces and Related Spaces of Mortingsloa

is nonnegative) as follows:

EX°=

j

p21P(X,>A)dl

0

Jp2i (A1 `

o

=L f Xr+ P

P- l

J

XdP I d) x r> x}

rp2p-2

(

\o

dA dP

fn X,+XP-1 dP.

If we let q = pl(p - 1) be the exponent conjugate to p and apply Holder's inequality, we see that the above q(EI X,+ IP)1/P(EIX[I P) 1/9

At this point, we would like to divide both sides of the inequality above by (EI XrI p)1/9 to get

(*)

(EX/') 1/P < q(EI X,+I P)'/P. of

Unfortunately, the laws arithmetic do not allow us to divide by something that may be oo. To remedy this difficulty, we observe that P(X, n n > 1) < P(X, > 2), so repeating the proof above shows that (E(Xr A n)P)1/P < q(EI X,+I P)1/P

and letting n -+ oo proves (*).

The last step in proving (1) is to let t -+ oo and use the monotone and dominated convergence theorems to conclude that if we let X = sup, X,, then we have P

(1')

EIXIP<

pPI)EIXXIP,

a one-sided result from which (1) follows immediately. Let IIXIIP = (EI X *IP)1/P It is easy to see that if p >- 1, . IIP defines a norm on .. #P. From formula (1) and the discussion preceding it, we see that X -> X a(B, : t > 0) = W. maps dl" one-to-one into a subspace of where Since we only allow continuous martingales in . P, the image of .1/" under this map is, in general, L"(Jm), however, we also only consider Brownian filtrations, so it follows from (1) and results in Section 2.14 that if p > 1, X -+ X"" are maps .,12p one-to-one onto L"(F.), and furthermore that dl" and equivalent as metric spaces. When p = 1, the last two claims are false. Rather than giving an explicit counterexample, I will prove the following result, which II

I learned from Gundy (1980a). The theorem originally appeared in Gundy (1969). (3)

If X > 0 is a continuous martingale with Xo = 1, then

6.2

First Definition of .N°, Differences Between p > I snd p - I

149

EX* Z 1 + EX log' X., where X,. = lim X, (which exists, since X >_ 0).

Proof Let TA = inf{t : X, > 2}. If 2 > 1, then X(TA) = 2 on {Tz < co {X* > Al, so it follows from (2) that 2P(Tz < t) = E(X l(T

< )).

Now as t -+ co, Xt --> X. and 1(TZ E(Xc

E(XX 1(x. >z))

We therefore have

EX* = 1 + f"') P(TA < co) d2

-

> 1 + Jf 2 'E(X.,) 1(xm>z)) d2 1

= 1 + E (X. f

)

-11(x1O>z) d2

= 1 + E(X. (log X.)+). Let M1 be the image of , " under the mapping X - Xc. The last result shows that if Y is a nonnegative random variable with EY = 1 and EY(log Y) +

= co, then Y e L 1 - M 1. The next result shows that these are the only nonnegative random variables in L' - M1. (4)

If X > 0 is a continuous submartingale and X7 = sup { Xsl : s < t}, then EX,* < 2(1 + EX, log+ c).

Proof To simplify the computation, we will first transform the estimate obtained in (2). (5)

Under the hypotheses of (2) or (2)', if I > 0, then 2P(X, > 22) < E(X,; X, > 2).

Proof By assumption, we have

22F(X, >21)<E(X,;X, >22) <E(X,;X, >2)+E(X,;Xt<2,A' >22) <E(X;Xt> 2)+2P(X > 21), and subtracting 2P(X, > 21) from both sides proves (5). _ With (5) established, it is routine to prove (4). Since X,* = applying (5) gives

150

6

Hardy Spacer and Related Spaces of Martingales

E(X$*/2) =

f

P(X,* > 22) d2

< 1 + f 2-1 f

XXdPdl

,1 {xt >z}

1

xtv1

=1+

A-1XXd2dP n

1

= 1 + EX,(log XX)+

Remark: I learned the trick above from one of my students (R. Banuelos), but as several people who have read preliminary versions of this text have

pointed out, the idea has appeared in the literature many times before. To fully appreciate the simplifications that follow from using (5) rather than (2), the reader is invited to integrate (2) to prove that EX,* < e

e

1(1 + EX,log+ XX).

This is Problem 7 on page 355 in Chung (1974). A solution can be found on page 317 of Doob (1953). If we let Y = E(YIA) and apply (5) to XX = I Y, 1, we get

EY* < 2(1 + El YI(log I YI)+).

Now (x log x)' = 1 + log x, which is increasing, so (p (x) = (1 + x log x) v 1 is convex and increasing, and it follows that the right-hand side is E(p(IE(YI. )I) < Ecp(E(I YI

< E(E((p(I 11)

)) )) = E(p (I YI ),

so letting t -, oo gives (6)

EY* < 2(1 + El YI log+ I YI)

and justifies the remark we made above (4). As we mentioned then, (3) shows that the mapping X -+ X.. sends _#1 into

a proper subspace of L'. Since every X e L gives rise to a martingale X, = E(XI.) with X. = X, the following definition seems natural: Let Y1 = {martingales X that can be written as X, = E(XI.°y;) with XEL'(.F.)} = the set of uniformly integrable martingales. Another space of martingales that is important in the theory of martingales is V-1 = {martingales X such that sup,E)XI < oo} = L' bounded martingales. It is clear that m Y1. The canonical example of a martingale in X 1 - Y1 is X, = B,AT, where B, is a Brownian motion starting at 1 and T = inf{t > 0 : B, = 0}. The dl'''.optional stopping theorem implies that EX, = 1, so EI X,I = EX, _ 1 and X e On the other hand, X, -> 0 a.s., so X, - E(X,,,) I.9;) and X0 21.

6.2

Flat Definition of .,A", Differences Between p > I and p

151

I

Note: As the story unfolds, the reader will see that there is a close contlccl ion between this example and Rep, where p(z) = (1 + z)/(1 - z) is the function we studied in Section 6.1. If we wanted to, we could define.*' P and 2P for p > 1 in the obvious ways, .)VP = {martingales X with supElX,IP < oo} Y P = {X, = E(X I. ) with X e LP},

but there is no reason to do this. Standard results from martingale theory imply that V-P = IfP = . P for p > 1. Because of this fact, we will not consider .%° and IfP, p > 1, below, and we will simplify our notation by defining IIXILr = SupEIXI

IIXII,=EIXXI if XX=E(XWI.'). Another thing that might occur to you is to define A log . ( = {martingales X with sup EI X I log' X < oo }. The proof of (6) given above shows that .7t' log _V' c dl', but it is easy to show that the inclusion is strict. Let Z >_ 0 be a random variable with EZ < co and EIZI log' IZI = oo. If we let B, be a Brownian motion that is independent of Z,

letT=inf{t:B, [-Z,Z]}, and let

X* = Z, so Xede', but

urn EIXI log IXI = EIZI log' IZI = oo, t-OD

so X O X' log X . Generalizing this example, we see that . # 1 cannot be characterized as {martingales X with sup Et/i (X,) < oc } for any 0. Two formulas sum up this section :

.''1

If1 Z) .,/f1

AT log JY

.'P=YP=_#P for p> 1. Exercise 1

Let

If°° _ {X: X, = E(XI,,), XeL°°} ..lf°° = {martingales X : X * e L°° }.

Show that 2°° = W°°. Remark: It is interesting to note that although the proof of (1) is very simple, the constant is the best possible. For a proof of this assertion and a lot more, see Dubins and Gilat (1978). They conjecture that there is no martingale that

is * 0 and has EIX*IP=

p-1 PEIX

IP

1:52

6

Hardy Spacer and Related Spaces of Martingaka

Pitman (1979) has proved this for the case p = 2 and has shown that, although the inequality is sharp, it can be improved to

E sup XX+ - inf X,

(

t

I

2

/

< 4EX.2.

6.3 A Second Definition of #p In this section, we will introduce a second equivalent norm for the .. #P spaces. This norm is the analogue of the area function in analysis and, as we shall see below, it is much easier to work with in many circumstances. In Section 2.1,

we associated with each continuous local martingale X, t > 0, a predictable increasing process <X>,, with <X>0 = 0, that makes Xr2 - <X>, a local martingale. Since t <X>, is increasing, lim,. <X>t exists. If we let <X>,, denote the limit, and if p > 1, we can define a norm by letting <<X»P

= II<X>112IIP>

where II YII c denotes the L° norm (EI YIP)1IP (the 1/2 is necessary because , = c2<X>,)

One rationale for this definition can be found in Section 2.11. If we let yt = sup{s : <X%< t}, then X(y,) is a Brownian motion run for an amount of time <X>,,, so the Brownian scaling relationship B, = t112B, suggests that X* and <X> 0I2 should be about the same size. It is a delicate problem to formulate a result which makes this precise. If Xt = then <X>,. = 1, but since P(X * > A) > 0 for all 2, there can be no inequality of the form P(X * > 2) <AP(<X>1/2 > 2/B). The other direction is just as bad. If T = inf{t : IB,I = 1} 00 and Xt = BT, then X* - 1, but A) > 0 for all A. To get around these problems, we change the left-hand side of the desired inequalities. (1)

Let /3 > 1 and S > 0. If T is a stopping time for Brownian motion, then (la) P(B* > PA, T112 < 62) <

62

- 1)z P(B* > A) z

(Ib) P(r1/2 > #A, B* < 62) < pea

1

P(T1/2 > 2).

Remark: As we will see in the proof of (2), the key point here is that for some /3 < oo, we have an inequality of the form P(f > #A, g < 6A) < C0P(f > A),

where C,, -+0as6-.0. Proof It is enough to prove the result for T bounded, for if the result holds for T A n for all n > 1, it also holds for T. Let S1 = inf{t : I B(T A t)I > Al. Let S2 = inf{t : IB(T A t) I > fit}. Let T - (S2)2.

6.3

153

A Second Definition of ..M'

It is easy to check that if B* = sups,, IB3I, then P(B* > $1, T1/2 < 61)

(fi-I)2) < (/3 _ 1)-2.1-2E(B(T A S2 A T) - B(T A S1 A T))2. Now if R1 <- R2, are both bounded stopping times, we have

E(B(R2) - B(R1))2 = E(B(R2)2 - B(R1)2) = E(R2 - R1), so the above 1)-2A-2 E((T A S2 A T) -(T A S1 A T)) < ( _ 1)-22-2(S2)2P(S1 < oo) 1)-262P(B* > A),

proving the first inequality. To prove the other inequality Let S1 = inf{t : (T A t)112 > .1}. Let S2 = inf{t : (T A t) 112 >

Let T = inf{t : IB(T A t)I > 6.1}.

Again, it is easy to check that P(T 1/2 > #A, B* < 62) < P(T A S2 A

T - T A S1 A T> $2)2 - 22)

<($2_1)-11-2E(TAS2A T-TAS1 A T), and using the stopping time result mentioned in the first part of the proof, it follows that the last expression above _ (/22 _ 1)-11-2E(B(T A S2 A T)2 - B(T A S1 A T)2)

<

($2

-

1)-11-2(S2)2P(S1 < oo)

_ ($2 _.1)-162P(T1n > 2), proving the second inequality.

Remark: The inequalities above are called "good 2" inequalities, although the reason for the name is obscured by our formulation (which is from Burkholder (1973)). The name "good .1" comes from the fact that early versions of this and similar inequalities (see Theorems 3.1 and 4.1 in Burkholder and Gundy (1970), or page 148 in Burkholder, Gundy, and Silverstein (1971)) were formulated as P(f > A) < C$,KP(g > A) for all A that satisfy P(g > A) < KP(g > $2), $, K > 1. The next result shows why we are interested in good A inequalities. First, we need a definition. A function cp is said to be moderately increasing if /P is a nondecreasing function with 9(0) = 0 and if there is a constant K such that

p(21)0. Examples

(i) T(x)=x",0
MA

6

Hardy Spaces and Related Spaces of Martingales

Proof for (ii)

Since log+ xy S log' x + Iog+ y,

2x + 2x log+ 2x

x+xlog+x

<2+

2 log 2

1+log+x,

The bound now follows by considering two cases, x < 1 and x >_ 1. (2)

If qp is a moderately increasing function, then there are constants c, C e (0, 00) (that depend only on the growth rate K) such that cEgo(T'12) < Eq(B*) < CEgp(T112).

Remark: We will only use the result for 9(x) = xp, but it is nice to know that the only property of xp we need for the proof is that (p(22) < Kq(2) for all

2>0. Proof To prove the result, it suffices to show: (3)

If X, Y > 0 satisfy

P(X>22, Y2) forallb>0 and 9 is a moderately increasing function, then there is a constant C (that depends only on the growth rate K) such that

Etp(X) < CE9(Y). Proof It is enough to prove the result for bounded gyp, for if the result holds for cp A n for all n > 1, it also holds for gyp. T is the distribution function, a measure on [0, oo) that has

T(h) = foodtp(2) = J

1(h>A)d(p(2). 0

If Z is a nonnegative random variable, taking expectations and using Fubini's theorem gives E4p(Z) = E f

l(Z>A) dgp(A) =

JP(Z> A) d4 (2).

0

From our assumption it follows that

P(X > 22) = P(X > 22, Y:5 52) + P(Y > b2) < 62P(X > A) + P(Y > 62), or integrating dq(2), E4 (2-1X) =

f

P(2-1X < 2) dq (2)

0

< 62E9 (X) + Egp(b-1Y).

Pick 62K < 1 and then pick N > 0 so that 2' > b-1. From the growth condition and the monotonicity of gyp, it follows that

6A Equivalence of Ho too Subspace of .No

115

E9(6-'Y)!5 KNEcO(Y)

Combining this with the last inequality and using the growth condition gives E(p(X) < KEcp(2-1X) < K62Etp(X) + KN+1Eco(f

Solving for E.p(X) now gives KN+1

Ecp(X) <

1-

K62Ecp(Y),

proving (3) and hence (2).

Applying (2) to the case cp(x) = xP and recalling the results of Section 2.11, we get the following inequality that will be useful in studying the spaces Al P.

(4)

There are constants c, C E (0, oo) (that depend only on p) so that for all

0
cE<X)Pl2 < E(X*)P < CE<X>?l2.

As the reader might expect, the constants in (4) are not very good. Ifp = 2, then EX.,, = E<X>, so it follows from (1) of Section 6.2 that E<X%, = EXX < EIX* 12 < 4EXW = 4E<X),,,,,

but in the proof above, K = 4, so if we take S = 1/3 and N = 2, then

C=

KN+1

1 -K62

= 115.2.

Remark: There are other ways of proving (4) directly; see Getoor and Sharpe (1972) for an interesting proof using stochastic integration.

6.4 Equivalence of H" to a Subspace of V'

l

Let B, be a complex Brownian motion starting at 0 and let T = inf{t : I B,I = 1}. In this section, we will show that the mapping f - Ref(B), t < T, maps Ho = { f E HP : f (O) = 0} one-to-one into Al,! = {X: X is a local martingale on [0, T) and X,* = sup,,, I X, I E LP}, and that furthermore, the HP norm off is equivalent to the Al,! norm of its image, that is, if we let u = Ref, U, = u(B,), t < T, and U* = sup«TIUI, then: (1)

There are constants c, C E (0, oo) (that depend only on p) such that for all fe HoP,

cElU*IP
15(t

6

(2)

There is a constant Cc (0, oo) (that depends only on p) such that for all feHP,

Hardy Spaces and Related Spaces of Martingale@

dd(f) < EIF*IP < Cdd(f) Remark: It is curious that the best constant we can obtain from the proof given below is C = e (independent of p).

Proof If r <'l and T, = inf{t : IBI = r}, then IF(T,)I < F*, so we have I

ER I f(re`B) I

P dir(O) = EI F(T,) I P < E(F*)P.

Taking the supremum over r < 1 gives dd(f) < E(F*)P. To prove the other inequality takes some work. We start with the trivial case, p > 1. Since f is analytic in D and bounded on D(0, r) = {z : Izl < r}, it follows from results in Chapter 2 that f(B(t A T,)) is a complex martingale and, hence, that I f(B(t A r,))I is a submartingale. Noting that I f(B(-r,))I e LP, p > 1, and applying Doob's inequality, we get EIF*(T,)IP <(i-f7i) Ej

f(B(T,))IP

Letting r T 1 and using the monotone convergence theorem gives P

EIF*IP <- (i-:P::--,) dP(f)-

proving (1) in the case p > 1 with a constant slightly larger than the one advertised in the remark, since x dx log (x

1)s =

dx (x log x - x log (x - 1))

= log x - log(x - 1) - (x - 1)-1

y-1 - (x - 1)-1 dy < 0.

f,;x 1

To prove the result when 0 < p < 1, we will have to be more devious. First consider 1/2 < p < 1 and let us now suppose f(0) = 1. Since f(B,j is a time change of a complex Brownian motion starting at 1, P(f(B1) = 0 for some 0 < t < T) = 0, and we can define a pathwise square root GG by requiring G

to be continuous and to have Go = 1, and GG = F for all 0 <- t < T. As we mentioned in Section 5.3, (3)

G, I < T, is a local martingale

and furthermore (4)

If r < 1, G(t A T,) is a martingale. (See the remarks after the proofs of (3) and (5) in Section 5.3.) With (4) proved, the rest is easy. Since

6.4

157

Equlvilsoce of He to o Subcpoce of .+Y"

EIG(r,)I2p =

zn

f

lf(re`B)11dm(0) < dp(f)

0

and 2p > 1, it follows from Doob's inequality that 12p2p

1

)2"EIGI2P

Taking a supremum over r and applying the monotone convergence theorem gives

EI F*I P = EIG*(r)I2p <

12p

2p2p- 1

I

dp(f)

This proves the result when f(O) = 1. If f(O) # 0, applying the result above to

g(x) = f(x)/f(0) proves the inequality for f. To extend the result to f with f(0) = 0, consider g,,(x) = f(x) + c and let a -. 0. The argument above can be extended top > 1/n by taking nth roots, with the result that p

EIFF*Ip < np pT1dp(f) Letting n - oo, the last inequality gives El F* I p < edp(f ),

proving (2) with C = e. The result in (2) shows that the Hp norm of f and the .,#,! norm of f(B,), t < r, are equivalent. As we mentioned above, we want to go one step further and consider Hp as a space of real local martingales by using the mapping f - Re f(B,), t < T. Since every harmonic function u has a conjugate harmonic function u defined by the requirements that 4(0) = 0 and f = u + iu is analytic in D, the mapping is one-to-one on He = { fe Hp : f(0) = 0}. The next result shows that if we restrict our attention to this subspace, then the Hp norm off and the .A'' norms of Ref(B,), t < r, are equivalent (i.e., (1) holds). (5)

There is a constant C e (0, oo) (that depends only on p) such that for all f e Ho, EIU*Ip < EIF*Ip < CEIU*IP.

Proof Since l a l < (a2 + b2)1/2 for all a, b, we have I u (z) l < I f (z) I for all z, proving the inequality on the left. To prove the other inequality, we observe that if we let U = u(B,), t < r, then it follows from results in Sections 2.11 and 6.4 that E(U*)p < CEpl2 pl2 < CE(U*)p (here, as before and ever shall be, the value of C is unimportant and will change

1 ui

6

Hardy Spaces and Related Spaces of Martingales

from line to line). It is trivial that E(F*)P 5 2P(E(U*)P + E(U*)P).

Combining this with the other three estimates proves (5).

Given the results above and in Chapter 4, it is now easy to prove the "maximal function characterization of Hr" due to Burkholder, Gundy, and Silverstein (1971). Let S,,(0) be the convex hull of the disk {lzl < a} and the point e`° and let (Nau)(0) = sup{lu(z)I : zeS,,(0)} be the nontangential maximal function. A simple generalization of the argument given at the end of Section 4.5 shows that

SSJ

Po (Osup, u(B.) > .11 < Cl {O: Nau(0) > All,

where C depends only on a, and integrating gives zn

Eo(UU*)P < C

l NQul P(0) d7r(0) 0

(a result that holds for any measurable u). To prove the other inequality, we use (2) from Section 4.3 (which generalizes easily from H to D). This result implies that there is an E > 0 such that if z e S8(0), then

Po(B,, 0 < t < r, makes a loop around z) > E, so it follows from the maximum principle that Pa(U* > Nau(0)/2) >_ E,

and hence that E0(U*)P > E f lNaull(0)dn(0). Combining this with the other inequality and (1) above gives the maximal function characterization of HP cdd(f) <- fo z l NaulP(0)dn(0) < Cdd(f)

6.5 Boundary Limits and Representation of Functions in H" Since HP c N, it follows from (2) in Section 5.3 that (1)

If fe HP, then the nontangential limit off exists at a.e. point of 8D.

In this section, we will investigate the relationship between f (defined in D) and its nontangential limit (defined on 8D), which we will also denote by f.

6.5

159

Boundary Limits and Representation of Function. la HP

There are two reasons why we want to do this: (a) The mean convergence of H" functions to their boundary values (3) and the consequent Poisson integral representation (5) are simple consequences of the equivalence established in Section 6.4, and (b) for developments below, it is nice to know that we can consider a function f e H" as a function in LP(8D). Our first topic in this section is LP convergence to boundary values. As

usual, we start with the probabilistic result and then deduce its analytical counterpart. (2)

Let f c- HP and let F = f(B), t < T. If T. is any sequence of times TT, then as n -- oo, f(BL) a.s. and in LP. Proof The a.s. convergence is a consequence of (2) in Section 5.2. (2) of Section

6.4 shows that F* E LP. Since IF(T) - f(Bz) I < (2FT*)P and F(T) -f(B) a.s., the LP convergence follows from the dominated convergence theorem. (3)

Iffe HP, then as rT 1, I

I .f (re `B) - .f (e `e) I P do (0)

0.

Proof It suffices to show that the result holds for any sequence r T 1. If we let T. = inf {t: I B, I > and apply (2), it follows that -f(B) in LP, or, BT), if we let u denote the distribution of

1-f(y)IPdvn(x,y)- 0. This conclusion is similar to the one desired, but it is different enough to make it painful to obtain one from the other. In the face of this hard difficulty, we will take a soft solution. We recall the following result from real analysis: (4)

If X E LP and X. --* X a.e., then X,, -+ X in LP if and only if EI

P -- EI XI P.

We leave the proof as an exercise for the reader (see Chung (1974), page 97), and then we are done, because

F(T) -f(B.)

in

LP

implies J J

implies

f

I.f(rne's) - f(eie)I Pdxt(0)

0.

The last result is valid for p > 0. When p > 1, we can use the LP convergence to express f in terms of its boundary values.

1M)

6

(5)

Let k9(z) be the probability density (w.r.t. n) of exiting D at e`9 starting from z.

Hardy Spacer and Related Spaces of Martingales

IffeHP f(z) =

p > 1, then d7r(0).

Proof From (2) of Section 6.4, it follows that EIF,*IP < oc, and since p > 1, F, t < t, is uniformly integrable. If v is a stopping time, the optional stopping theorem implies that .f(Ba) = E(.f(BJ13o),

and it follows from the strong Markov property that E(f(BT)1 3o) = Jko(B0)f(eio) dn(0).

The last two results imply that the equality in (5) holds for a.e. z e D. The left-hand side, f(z), is clearly a continuous function of z. In Section 3.3, we showed that the right-hand side is harmonic in D, and hence continuous, so the equality holds for all z e D.

The equality in (5) does not hold when p < 1. The half-plane mapping p(z) = (1 + z)/(1 - z) has boundary limit Example 1

p(e`B) = i sin 0/(1 - cos 0),

so there are two problems :

(i) Rep (z) > 0 in D, but Jko(z)Rep(eio)dir(O) = 0

(ii) Isin 01/(1 - cos 0) - 2/101 as 0 - 0, so Jke(z)IImP(eio)Idir(O) = oo.

The resolution of the first problem is obvious. Rep = ko, so Rep (z) = Jko(z)d50(O).

where So is a point mass at 0. This example suggests that we might generalize

the representation in (5) to allow measures on 8D that are not absolutely continuous w.r.t. it. The next result shows that this generalization does not enlarge by very much the class of functions that can be represented. (6)

The following three classes of functions are the same :

(i) the set of u that can be written as

6.5

Boundary LImlts and Representation of Function In H'

161

U(Z) = fko(z)d!L(o),

where p is a signed measure with finite variation (ii) the set of u that can be written as a difference of two positive harmonic

functions (iii) h' = the set of harmonic functions u with sup flu(reio)IdO < oo. r<1

Proof (i) (ii) (iii) is trivial. Since the proof of (iii) (i) is very similar to the proofs given in Sections 3.3 and 3.5, it is left as an exercise for the reader. Remark: For developments in Section 6.8, it is useful to know that (6) implies that any ueh1 can be written as u1 - u2, where u1, u2 > 0 and 1lull1 = llu1 11 + 1

ll u2ll 1. To prove this, we observe that if u = Jko(z)dP(O). then llull 1 = the variation p = sup,, µ(A) - µ(A`).

The last result completes our consideration of boundary limits of functions in HP. Since we will deal with p >- 1 for most of the rest of this chapter, the important results to remember are (1) and (5). They show that a function feHP, p > 0 has a nontangential limit at a.e. point of 8D and that if p > 1, the values off in D can be recovered from the boundary limits, so we can think off in HP as being a function in LP(8D, it) (and we will do this when we prove the duality theorem). A similar viewpoint is possible for local martingales Xc- ,`f?, p >_ 1. Since this will simplify things below, we will take a few minutes now and spell out the details. X,* e LP, so standard arguments imply that if we let

_

X,

t
t} X, lim

t

IT

then Y is a martingale, Ye #P (the space defined in Section 6.2), and Y can be reconstructed from its limiting value by Y, = E(Y.l JFt).

The last representation is the probabilistic analogue of the Poisson integral representation. When X, = f(B,), t < r, and feHP, p > 1, the relationship is closer than an analogy, since results above imply that Y, = f(BT) (the righthand side being the nontangential limit off evaluated at BT).

Notes: Result (3) about the mean convergence of f(re`B) to f(e`B) was first proved by F. Riesz (1923) (see, for example, Duren (1970), pages 20-22). The key to Riesz's proof was the following factorization theorem :

162

(7)

6

Hardy Specs and Related Specs of Mertlnoalse

Every function f in H° can be written asf(z) = b(z)g(z), where Ib(z)I < 1 and g e H" is a function that does not vanish in D. Once (7) was established, (1) followed from known results. The function b has boundary limits, since it is bounded, and since g is never 0, we can pick n > 1/p and consider g'1" to reduce the result to the easy case where p > 1. The reader should note that taking nth roots to reduce to the trivial case p > 1 was also the key to the proof of (3) in Section 6.2, but in that proof, we

used the fact that Brownian motion did not hit zero to construct a pathwise nth root, so we did not have to factor out the zeros.

6.6 Martingale Transforms Martingale transforms are a natural generalization of the following: Example 1 Let f = u + iv be an analytic function with f(0) = 0, and let B, be a complex Brownian motion starting at 0. Ito's formula implies that u(B,) = f Vu(BS) dB. 0

v(B,) = JVv(Bs).dBs, 0

and the Cauchy-Riemann equations say that Vv =

(_0 0)Vu,

so if we let

H,=Vu(B,) A= (-0

11 0

'

then we can write

u(B) =

JJJ.dB , 0

v(B) = JAI-I.dB3. 0

The last equation obviously makes sense if B is a d-dimensional Brownian motion, H is a locally bounded (R'-valued) predictable process, and A is any d x d matrix. To define the transform of a general local martingale X in this setting, we now recall that since we have assumed that our filtration is generated by a Brownian motion, it follows from results in Section 2.14 that

Xt=Xo+ f 0

Mmrtingde TnMformw

6.6

163

We can therefore define the transform of X by A as

(A*X)t= (whenever this makes sense, e.g., if H is locally bounded). In this section, we will study properties of martingale transforms as mappings between the V' spaces. The results we prove here are analogues of classical results about conjugate functions that we will prove by probabilistic methods in Section 6.7. The first and most basic result is: (1)

If p > 0, X -. A * X is a bounded linear transformation from #P to M". Proof If Xt = 0

then

(A*X)t= f 0

so

IAH s12ds fo'

< C J, I HHIZ ds 0

= C<X>t, where

C= sup{IAYI2:IyI = 1}, so the desired conclusion follows from the equivalence of norms demonstrated in Section 6.3. When p > 1, the norms on . #P and it" are equivalent, so we have : (2)

If p > 1, X A * X is a bounded linear transformation from IC" to IC". The next example shows that this is false when p = 1. Example 2

Let Br be a two-dimensional Brownian motion starting at 0, let

i=inf{t:B' = -1}, and let Xt=BTnL= If we let A = (1

lot

.dBs.

\0 0 I

(Or, if you want, \l

0

then

I I 1

164

6

Hardy Spacer and Related Spacer of Martlnpal"

dB., (A*Xj-JO^T

(, ). 1

Now X, = B;A T > -1 and it is trivial that if a >- -1, l a l < 2 + a, so

EIX,I <2+EX,=2 and Xe.(1. On the other hand, it follows from results in Section 1.9 that (A * X). = B, has a Cauchy distribution

P((A*X) X)= x1

1

n1+x2'

°°

so E(A * X)W = oo, and Fatou's lemma implies that lim inf E(A * X )t+ > oo. t- 00

In the last example,

IIA*XII,r =supEIA *X1, = oo, I

but it doesn't miss by much

2 fo dx2,.,ny-1

I >y)=n

1

as y -- oo. The next result, first proved by Burkholder (1966), shows that this is the worst possible behavior for XE.%r1. The statement and proof given below are from Burkholder (1979a). (3)

If , < <X>, for all t > 0, then AP(Y* > A) < 211X11,,-.

Proof Doob's inequality implies that AP(X* > A) < supEJX,l = IIXII,, f

so it suffices to estimate the probability of {Y* > A, X* < Al. To do this, we

observe that by stopping at T = inf {t: <X>, > n}, we can suppose that E<X>0 < cc, that is, Xe.#2. Let i = inf {t :1X1 > Al. It is trivial that if we let Y,* = sup fl l Y 1 : s :!g t}, then

AP(Y* > A, X* < A) < ,1P(Y* > A).

Applying Doob's inequality to the submartingale Y' T(Ye 1f2, since , < <X>,) gives

22P((Y*)2 > A2) < EY, . Now X, Ye.,&2, so

EY2=ET <E<X>T=EX2.

6.6

MYrtlnade'rromformn

165

X

Y

Figure 6.1

To finish up, we observe that IX,1 < 2 and IX,I is uniformly integrable, so

EX, <2EIXzl Combining the inequalities above shows that AZP(Y* > A, X* < 2) < AIIXILI

and proves the desired result. A simple example shows that 2 is the best possible constant in (3). Example 3 Let B, be a one-dimensional Brownian motion, and make the following definitions (for a picture, see Figure 6.1):

T,=inf{t:IB,I=l} Tz =inf{t> T, :IB,-B(Ti)I = l}

T3=inf{t>TZ:B,=0 orIB,-B(T2)I=2}

B(T1)=for-1 B(T2) = 2, 0, or - 2 B(T3) = 4, 0, or -4

166

6

Hardy Spac.u and Related Spaces of Martingales

W (s, w)

O(+', (0)

1

1

1

-1

1

1

0

0

S
r

X, =

cps dB,

Y=

0

Os dB.,. 0

Since (p2

we have <X> _ . A simple calculation shows that if we look at B, X, and Yat times T1, T2, and T3, we have:

X

B 1,2,4 1,2,0 1,0,2

1,0,-2

1,2,4 1,2,0 1,0,0 1,0,0

Y 1,0,2

1,0, -2 1,2,2 1,2,2

Y* 2 2 2 2

with a similar table for a = - 1. From the last computation it follows that supElX,l = El XXl = 1 t

and

P(Y*>2)=1, so forA=2, 2P(Y* > A) = 2supEpXc1,

showing that (3) is sharp.

(1), (2), and (3) are the main results on martingale transforms, but, of course, it is also possible to consider how X -p A * X behaves on other spaces. We will mention only three results and leave the proofs as exercises. Exercise I If Xe.,#°° = the bounded martingales, then (A * X) may be unbounded. In Chapter 7, we will see that it does not miss by much. (A * X) C- M_#6 and, furthermore, X-+ (A * X) is a bounded map from M.,#0 to M.,#0. Exercise 2 If X e 21 = the uniformly integrable martingales, then (A * X)

need not be in A-'. (Hint: Recall our discussion of A' log A'.) Exercise 3

If X e X log .7f', then (A * X) e .,lf 1. Is (A * X) e .*-log . ''?

6.7 Janson's Characterization of Jj 1 Having seen an example of an X e .K 1 and a matrix A such that A * X 0 .JY 1, it is natural (if somewhat precocious) to ask which X e AA" have A * X e A 1

6.7

167

Jumon'. Chnncterizetlon of .N'

for all matrices A. If we let J1 denote this collection, then (1) in Scclion 6.6 implies that J1 - .,#' and, as you might guess from the title of this rcclion, J 1 = .111 In fact, more is true : There is a finite set of matrices A 1 , ... , A. such that if A, * X E .7Y 1 for i = 1,

..., m, then X E .,6f 1 (and hence A

To discover which sets of matrices have this property, we start by making the trivial observation that A...... A. cannot have a common eigenvector in Rd, for if we have A;y = ) y for i = 1, ... , m, then we can let r = inf {t: y B, = -1 } and X = y B, A . X is Example 2 of Section 6.6, so X E .tY 1 - _#', but

fori=1, ...,m, (A;*X),=

eAL

0

Janson's (1977) theorem says that this trivial necessary condition is sufficient. (1)

Let A,, ... , Am be matrices that do not have a common eigenvector in Rd. Let A° be the identity matrix. If the transforms A; * X, i = 0, ... , m, are all in

*-', then XE.,#1.

The key to the proof is Janson's generalization of the subharmonicity lemma of Chao and Taibleson (1973), which is in turn a generalization of ideas of

Stein and Weiss (1971), who attribute the idea to Calderon.... Using the notation in (1), we can state this result as: (2)

There is a p0 < 1 (that depends only on the matrices A1, ... , Am) such that if F = (1 + "_0 (X02)"2, then Ff is a local submartingale for all p > p0. Once we prove (2), (1) follows immediately. To see this, observe that if we can pick p < 1 so that G, = FP is a local submartingale, then E(Gt 1P) = E RI +

(Xi) 2

1/2

< E 1 + : IXt ]

(since the L2 norm of (1,X°, ... , Xm)ERm+2 is less than its L1 norm). It follows from our assumption that sup, EG, "P < co, and, since 1/p > 1, it follows from Doob's inequality that

oo > E(sup G,)'IP = E(sup F) > E(sup IXI ), so X E .4f 1. If we keep track of the constants, we get

E(sup I X I) < E(sup G,)' '

<

- (1/Ifl- 1

sup EG, /P ,

<-(1 -p) 1(1 + E

IIAj*XII,).

i=o

To get rid of the 1 on the right-hand side and replace p by po, we apply the

168

6

Hardy Spaces and Related Spaces of Martingales

last result to X/e, multiply both sides of the inequality by e, and let a - 0, p -po to get E(supjXI) :!5; (1 -Po)-1 Y

(3)

IjAj*XILr.

j=o

t

Proof of (2) We start by observing that the result is trivial for p = 2 (F = ) and forp > 2 (since a convex function of a local submartingale 1 + o (Xi)2m is a local submartingale), so we assume p < 2. Let g(x) = (1 + 1x12)2. A little differentiation gives Dig= P(l + Ix12)cp-2'22x; 2

DO = 2.p 2 2(1 +

1x12)cp-a'24xtxj

i:Aj

D«g = .P 2 2(1 + 1 x12)cv-a'24x? + 2(1 +I x12)(p-2'22. 2 Applying Ito's formula, we conclude that

FP-FOP =

f tpXsFs-ZdXs i=o ,Jo

+1

2 i=o ;=o

22 f tPP-

1)4XsXgFs

a

d<X`,X')s

0

+ i JPFr2d<xi>. 1

m

2 i=o

o

The first term on the right-hand side is a local martingale, so to prove the result we need to show that we can pick p < 1 so that the second term plus the third term is 0. If X= f Hsd Bs, then

<X= JAHJ2ds o

and

<X`, X'>' = J(AiJ1AJ1)ds. o

To complete our proof, we need to show: (4)

22 - 1) Yi; 4xtx;(At4P, A;(P) + (1 + 1x12) > 1Aj4,12 > 0 1

(P

for all x e R" 1 and cp e Rd. To do this, we observe that if B,; is the angle between xjA;cp and x;A;cp, then Y(x;A;cp,x;A;(p) _ Y1x;Aicp1lx;A;cpl cosBtj <-Y(IxjAj(p I)2 <1x12YIA1w12.

6.7

169

JanMon's Characterization of.#'

Looking at the first inequality above, we see that there is equality Oiily if ©;j = 0 for all i, j and I x,l = cl A;'pl for all i. Now if x, 9 # 0, it follows from the last equality that Ixol : 0, and since Aocp = cp, the first equality in the last sentence implies that cp is an eigenvector of all the A;, contradicting our assumption, so we must have Y_x.x,(A;'p, Aj'p)

< IXIZ

Ai9IZ

for all nonzero x c Rm 1 and rp e Rd. The value of the left-hand side is not changed by multiplying x or 'p by a positive constant, and it is continuous on the compact set {(x,'p)ERm+1+d: Ixl = 1, Icpl = 1}, so the supremum of the expression over K is a number S < 1. Hence the expression in (4) is > (5)

(1+Ixl')YIAi'I2-2(2-1)4a(IXIzYIAiTl2). When p = 1, the coefficient of the second term is a < 1, so we can pick po 0 for all p > po, and this completes the proof.

Readers familiar with Janson's (1977) proof (which is for d-adic martingales) should notice that the outline is the same, but two details are different: (a) Ito's formula replaces the computation Janson does for "small Remark:

jumps" (our jumps have size zero), and then, since we do not have a small/large dichotomy, we need only his first compactness argument. (b) The restriction on the matrices sounds the same, but it is different. In Janson's theorem, the matrices do not have a common eigenvector in Ro = {X: Y; x. = 0}. O) has no eigenvector in R2, it follows from Janson's theorem

Since I 0

I

that we have : (6)

If d = 2 and X and its conjugate martingale k are in . ' 1, then X E If d = 3, then any matrix A has a real eigenvalue and, hence, also an eigenvector

in R" (take the real or imaginary part), so it takes at least two matrices to characterize 1. We leave it to the reader to discover what happens if we take 0

1

0

A, _ (-1

0

0 AZ =

0

0

0

0 0

-1

0

1

0

0

0 0.

If you are very clever, you will discover Riesz transforms. If you get stuck, you can find the connection spelled out in Gundy and Varopoulos (1979).

171)

6

Hardy Space. and Related Spaces of Martingales

6.8 Inequalities for Conjugate Harmonic Functions With each harmonic function u, there is associated a unique conjugate harmonic function it, which has (a) u(0) = 0 and (b) u + iu is analytic in D. In this section,

we will investigate conjugation as an operation on hP, the set of harmonic functions u in D with dp(u) = sup f

<

.

Let Ilullp = (dp(u))"P. If p > 1, this equation defines a norm on hP. Our first result, due to M. Riesz (1927), shows that if p > 1, u -+ u is a bounded linear map from hP to hP. (1)

If p > 1, then there is a constant C (that depends only on p) such that Ilullp < CIIuIIP.

Proof Let U = u(B,), t < T, and let U = u(B,), t < T. If llullP = 1, then Doob's inequality implies that E(U*) < (p/(p - 1))P, and it follows from results in Section 6.4 that Ilullp < El

U*IP < KEI U*IP,

so (1) holds with C = K11P(pl(p - 1)). To be fair, we should observe that (1) is easy to prove analytically-there is a simple argument using Green's theorem that is due to P. Stein (1933) and that gives a much better value for Cp, namely,

Cp=2(Pl(P- 1))lh' 1
2 < p < oo.

Since this proof has some interesting probabilistic aspects, we present it here. If the theorem is true for some 1 < p < oo, then it is also true for the conjugate index q = p/(p - 1) with C. = Cp (exercise for the reader: See Zygmund

(1959), page 255, for the answer), so it suffices to prove the result when 1 0 in D. To prove the result, we compute 82

_

OZ7

1

u P(z) W

Pu

= az. (

(Z) azi

=P(P- 1)uP z(Z)(az/z+pup-1(z)8z? Summing and noticing that Au = 0, we get

(a) AuP(z) = p(p -

1)u"-z(z)IVu(z)I2.

Similarly, if we let v = u, then

6.8

Inequalities for Conjugate Harmonic Functions

171

a2 1 flP = a2 u2 + v2)p/2

04

Oz?

_

i

(p22 (u+ v2)(P-z)/z (2u Z + 2llZi

_ .p

2 2. (u2 + v2)(p-4)/2I 2u -Z + 2v

+ P (U2 + vz)(P-z)/z 2 2

(u az,)

z

+ 2u

Z

azZ + 2 az;

(av)z zJ

z \\

+ 2v

Zl. aZ; JJ

Summing and using the relationships Au = 0, Av = 0, Vu Vv = 0, and lVul = I Vv 1, we get

(b) Al./ I P =p(p - 2)(u2 +

v2)(P-4)/z(u2JVul2 + v210u12)

+ p(u2 + v2)(p-2)/2(lVu 12 + 0 + 1Vu12 + 0) =P2 (U 2 + v2)(p-2)/21ou12.

Since p - 2 < 0, it follows from (a) and (b) that we have flp < p

(c) Al

p IAlulp.

This is the key inequality for the proof. From here, completing the proof is easy, and how you do it is a matter of background. A probabilist would reason as follows: Ito's formula says that if h is C2

in D, then fort < r=inf{t:B,0D}, h (B,) - h (Bo) =

JAh(Bs).dBs+iJVh(Bo)ds 0

(here we have written B, as a real two-dimensional Brownian motion). The first term on the right is a martingale for t < r, (since ,,., < C(t n r,)), so 1

Eoh(B,) = h(O) + 2E

Ah(Bs)ds. 0

Letting h = up, and then h = If lp, and using (c), gives fo2n f2x

p

I

I

I

o

since 1u(0)l = l f(0)1.

An analyst would use the divergence theorem 0zx

a

JP_(reb0)rdo = f

Acp(z)dA(z),

J IzI
where A' denotes Lebesgue measure(.' This implies that znIf(re`B)lpdO

ar

J

p p -

1

ar J 0

172

6

Hardy Spicea and Related Spaces of Martingales

ivp(Bt)

Figure 6.2

so integrating from 0 to r and recalling that v(0) = 0, we have a second proof of the inequality

J:(re1I0< pp1

f lu(re`Pdo. 0

Since Riesz's theorem is an old and important one, there are many different proofs. Calderon (1950c) has given another proof that you can find in Zygmund (1959), pages 253-255. Pichorides (1972) has given a refinement of Calderon's

proof and has found the best possible constant: CP = tan(n/2p) if 1
2). The function that shows that this inequality is sharp is simple. Let fe(z) = p(z)l = ((1 + z)l(l - z))fi. Since P E H" for p e (0, 1),

f# E H° for p e (0, 1/#). Since p maps D one-to-one onto {z: Re z > 0} = {re'B : r > 0, 0 < n/2}, f,, maps D one-to-one onto the cone IF,, = {re`O : r > 0,

0 < fln/2}. From the last observation (and Levy's theorem), it follows that j,,(Bt), t < 7, is a time change of a Brownian motion C, that starts at 1 and runs until it leaves F''. Drawing a picture (see Figure 6.2) reveals that if up _ Re f,, and vp = Im f,, , then

limIvP(B) I = tanfl imlu,,(BJ)I. tT=

If 1

Itan7E l

IlupiLP,

6.8

173

Inequalities for Conjugate Harmonic Functions

so the optimal constant CP z tan(n/2p) for all 1 cot(ir/2p) for all 1 < p < oU, so Pichorides's constants are the best possible. As the constants in the last remark might suggest, Riesz's theorem is false for p = 1. We have seen the counterexample many times: the half-plane map p(z) = (1 + z)/(1 - z). By computations in Section 6.1, u(z) = Re p(z) Chl, but u(z) = Imp(z) is not, since

n({9:Iu(e'B)I > A}) - 2/n2 as A -+ oc.

The next result, due to Kolmogorov (1925), shows that (up to a constant multiple) this is the worst behavior we can have for ueh1. (2)

There is a constant C such that if ueh1, then n({e: lu(e`a)I >_ 2}) < Ca.-11IuII1.

Remark: In the jargon, this is called a weak type (1, 1) inequality. We will give Davis's (1974) proof, because it has the advantage of identifying the best

constant C = 1.347... . Proof Let ZZ = Xt + iY be a complex Brownian motion, and let a = inf{t : I Y I = 11. Since t - I X, I is a submartingale and the stopping time a depends

only upon the Y component, it is easy to see that if x and - 1 < y < 1 are real numbers, (i) EoIXaI >EIYIXal (ii) Es+i,IXal > EirI X1I.

The next step is to show :

(iii) Let x be a real number and fi a stopping time for Z. Then EXIX API > EOIX.IPX(, ? a).

Proof By (ii) and the triangle inequality, EOIXXI < E.IXal < EXIX..pl + EXI Xa - X.A$I.

By the strong Markov property, Exl X8 -

Xa

-

and we have, using (i),

a> (the centering makes it look like starting at 0, but I Y,,, until a). Combining the results above, we have EOIXQI <ExIXZAPI +EoIX2IPP(«> fi),

proving (iii).

> 0, so there is less time

174

6

Hardy Spaces and Related Spaces of Martlnaalee

We are now ready to prove Kolmogorov's inequality. Let f = u + iu. By Levy's theorem, if we let a(t) = f O I f'(B5)IZ ds fort < T, then Z,(,) = f(B,), t < T,

defines a Brownian motion run for an amount of time y = Q(T). If we write Z, = X, + iY and let T, = inf{t : IB,I > r}, then IX",(tr)I = iu(BtAT,)I is a bounded submartingale, so EI X.Aa(t,)I <_ EI XC(tr)I = EIU(Bt,)I

Ilulll.

As r T 1, T, T T and Q(T,) T Q(T) = y, so using Fatou's lemma gives

(iv) EIXanvl < IIulIi.

We have finally assembled all the ingredients to complete the proof: n({O: Iu(e`a)I > 1}) = P(I YYI >- 1) P(YY 1) = P(a < Y),

so it follows from (iii) and (iv) that the above <- (EopX.l)-'EolX2Arl

< (EofXal)-'Ilulll. Substituting u/), for u, we have n({e: lu(e`B)I z A)):5

(EoPXaI)-'llulil/2.

It is clear from the argument above that the inequality is sharp. If we let g(z) _ (2/7r) log p(z), then g maps D one-to-one onto the strip {z : -1 < Im z < 11, so

1 =n({O:1u(eie)I > 1})=P(a<-y) = (EojXal)-'EoiXal = (EOIXaI)-'llulll.

To compute the constant, observe that by (4) in Section 5.1, the probability density of X, is ex'/2/(I + exn), so

_

zi

(-

8

n=0 °°

z

n=0

000

(-1)n(2n +

1)-2

If you recognize that nz/8 = En '=O (2n + 1)-2, then you can write the constant as (1-3-z+5-z-7-z+...)/(1+3-z+5-z+7-z+...).

Although the inequalities (i)-(iv) are important, the two main ideas in the proof of (2) are the observation that : (a) If we time change f(B), t < T, to obtain a Brownian motion Z, = X, + iY, t < y, then

6.8

175

Inequrlltlai for Conjugate H.rmonlc Functions

n({H: u(e")I >_ 1}) = P(I YYI >_ 1) 5 P(Y* > 1),

and (b) forueh',P(Y*>_ 1)/Ilulllis largest when y=a=inf{t:IY,I= I;. With this philosophy as a guide, the reader should have no problem proving +i onesided version of Kolmogorov's inequality. Exercise l

If u c h' and u > 0, then

({O: u(e`") > }) < 2IIu111

7r(l

+y2),

JA

and this constant is the best possible.

Hint: The function that shows that this inequality is sharp is

f(z) = it + (1 -

z)'

i+

which maps D one-to-one onto {x + iy : x > O, y < 2} and has f(0) = 1. In this case, the reflection principle shows that

n({O:u(e`")>2})=2 f Z

dy + y2). n(1

Applying (2) to u(rz) shows that if 1lull1 = 1, then 7C({O :

I u(re")I > A}) < CA-',

soifp<1 2n 0

u(re`")IPdn(O)= f"Op

< [P2P1dA+

>.l})dA

Jp1 C` d= 1

P,

and we have proved another inequality due to Kolmogorov: (3)

If u e h1 and p < 1, there is a constant C (that depends only on p) such that dd(u) < CIIull f.

The argument above proves (3), but since it is not very interesting and does not give a very good value of the constant, we will give two more proofs of (3). The first is a purely probabilistic one, and because of this, it gives a

crude value for the constant. The second is an analytic translation of the probabilistic argument and gives the best constant for positive u. Proof 2 Again without loss of generality, we can suppose that u > 0 in D and IIuhl1 = 1. In this case, u(0) = 1 and u(0) = 0, so (u(B,), u(B,)), t < 'r, is a time change of a two-dimensional Brownian motion starting from (1,0) and running for an amount of time y. Since u >- 0, y must be smaller than 71, the time it takes a Brownian motion starting at 1 to hit 0, and it follows that

176

n

1lardy Spacra and Related Spacca of M.rtlnialra

dp(u) < /;(U*)p < CE 200< -CETv1Z.

In Chapter 1, we found that the probability density of T1 is (tit) - 3/2 t- 3/2 e- 1/2t ,

l

so ifp < 1, ETf2 =

(2n)-3/2

f aD

t(p- 3)/2 e - 1/21 dt.

0

To evaluate the integral, let u = 1/2t to obtain (2ir)-3/22(1-p)12

r u-(1+p)l2e-"du J0

and observe that the value of the integral is I'((1 - p)/2) if p < 1. The reader should observe that if u(x) = Re p(x), then y = T1. This suggests that for nonnegative functions Ilullp/Ilullp should be largest for the Poisson mapping. The proof above cannot be used to prove this fact, since we have used the clumsy estimates dp(u) < EI U*Ip < CEc/2. This inaccuracy can be avoided if we abandon the correspondence and translate the proof into analytical terms. Proof 3 The idea for this proof is due to Littlewood (1926). Again without loss of generality, we can suppose that u > 0 in D and II U 111 = 1. A function f analytic in D is said to be subordinate to g if there is an analytic function co with Ico(z)I < Izl such thatf(z) = g(w(z)).

One reason we are interested in this concept is that

(i) If u > 0 and u(0) = 1, thenf = u + iu is subordinate to p(z) _ (1 + z)/(1 - z).

Proof Let w(z) = p-1(f(z)); co is analytic and maps D into D and 0 into p-1(1) = 0. Let a(z) = w(z)lz, z 0, and a(z) = w'(0) at z = 0; o is analytic in D and has I u(z) I < 1 on 8D, so it follows from the maximum principle that Io(z)I < 1 in D, that is, Iw(z)I 5 Izl. Another reason for our interest in subordination is (ii) If f is subordinate to g, then f02

I f(re`a)I 'd7r(0) r}. By hypothesis, EI J (Br)I p = El g(w(B1r))I p

6.8

Inequalities for Conjugate Harmonic Functions

177

If we subject w(B,), t < z to Levy's time change, we get a Browniun mot ion B,' run for an amount of time y,. Since Iw(z)I < Izi, it follows that y, r; inf{t : IB;I > r}. Combining this with the fact that Ig(B',Tr)IPisa submurtingulc gives

EIg((o(B,,))IP=EIg(BY.)IP<EIg(Bt)IP and proves (ii). Combining (i) and (ii) proves (3) with CP = dP(q), where q(z) = Imp (z).

To compute the value of the constant, write p(z) = R(z)e"(z), where R(z) = I p (z) I and cb(z) E (- n/2, 7r/2). Since there is no confusion about which root to take, F(z) = R(z)Pe'P(z) is analytic in D. By the mean-value theorem,

1 = Re F(0) = f

Re F(reie) d7r(O)

,J n n

= f

R(re'B)Pcos(p(D(re`B))d7r(O). n

As r

1, cb(re'B) -+ sgn(9)n/2 and R(re'B) -> Iq(e's)I. Since gehP and Icos(p(D(reie))I < 1, it follows from the dominated convergence theorem that

1= f

cosP2 lim f rT1

,1

Iq(reie)IPdm(0), n

so dd(q) = sec(pn/2).

Remark: If at the beginning of the computation of the constant we take an arbitrary f with u = Ref > 0 and use the inequality I ((re') I < n/2, we get a purely analytical proof with the constant given above. This argument is due to Hardy (1928). Proof 3 shows that (3a)

If u > 0, dd(u) < sec (12) IIuIIl

(3b)

If uEh1, dP(u) < 2 sec 12 I IIuIIl

.

The Poisson kernel shows that (3a) is a sharp result. Since (3b) is obtained from (3a) by using the triangle inequality, we should expect that (3b) is not sharp, and indeed it is not. Burgess Davis (1976) solved the problem of finding

the optimal constant in inequality (3b). He showed that the smallest value for C, is IIull,,, where u = (k1 + k_1)/2. In this case, the corresponding analytic function is g(z)

1 (I +z 1-z =2 1-z+1+z

2z (1-zZ),

178

6

Hardy Spaces and Related Spaces of Mortinplea

which maps D one-to-one onto S = C - {x + iy : x = 0, IyI >- 11. To prove his theorem, Davis uses Levy's theorem to reduce the result to an optimal stopping problem for Brownian motion, which is solved by considering related discrete time problems. Since the argument is rather lengthy and the improvement on (3b) is rather slight, the reader is referred to Davis's paper for the details or

Davis (1979b) for a sketch of the proof. A. Baernstein (1978) has given a purely analytical proof of this result. Up to this point, we have only discussed u e h° for p > 1. For 1 < p < oo, the class hP was preserved under conjugation. For p = 1, this was false, but it

was almost true: uehP for all p < 1. When p < 1, things fall apart-there is an analytic function f = u + iv such that uehP for all p < 1, and yet foN (hence v 0 hP for any p > 0). The example is a randomly chosen function f(Z, (0) =

(4)

n=1

Sn(w)g(ZZn)

where g(z) = z/(1 - z2) and fin, n > 1, are independent random variables with

-1)= 1/2 (for analysts, let S = [0, 1] and n((o) = sgn(cos(2nw)), the nth Rademacher function). I claim that (4a)

For every w, Ref (. , co) a hP for all p < 1 and

(4b)

With probability 1, Yo = (0: lim,fl f(re`B, (o) exists} has Lebesgue measure 0.

The proof of (4a) is a straightforward, but somewhat lengthy, calculation and is therefore left to the reader (see Duren (1970), page 66). We will proceed, then, with the more interesting claim (4b), which is implied by the following result. (5)

Let gn(z), n > 1, be complex-valued and continuous in IzI < 1 except at a finite number of points z with IzI = 1.

If (i) Y' O, I g (z) I < oo and for all r < 1 the convergence is uniform on D, {z : IzI < r}, and (ii) for each N, as r T 1 we have oo

gn(re`B)I2. oo

uniformly in 0,

n=N

then with probability 1, f(z, (o) = Y',

(w)gn(z) has a radial limit almost

nowhere.

Proof Let E = {(0, (o) : lim,.1 f(re`B, co) exists}. It suffices to show that for almost every 0 the section Ee = {w : (0, co) e E} has probability 0, for then the desired conclusion follows from Fubini's theorem.

6.8

179

Inequalities for Conjugate Harmonic Functions

Suppose 0 is such that eie is not a discontinuity point of any of the an, i,ntl P(EB) = a > 0. We are going to construct a decreasing sequence of events An. n > 0, with AO = EB and do this in such a way that limn-,. P(A,) Z a/2. I'or

k,n> 1, we let B,(k) = {coeAn_1 : I f(re`B,w)I < k for all r < 1}.

B,(k) T An_1 as k T oo, so if we have already constructed An_1 with P(A,,_1) z

(1/2 + 1/(n + 1))a, then we can pick kn so that P(B,(kn)) > (1/2 + 1/(n + 2))a and let An = B,(k,J. Let B =n,, 1 An. By construction, P(B) > a/2 and, for all n, z

00

i 9m(re') m(w)

1B < k2P(B).

m=n

The functions bi4, 0 < j < k < oo, are an (incomplete) orthonormal set in L2(S2), so

Y

(E1B

m)2 _< II1B 2.

n

0<m
It follows that we can pick M large enough so that

Y

(E1Bnm)2 <

(PB)2MSm
Since B c AM, we have (since m = 1) kMP(B) > E I

Y gm(ret

Z

m=M

1B/

g g Y gm(re` )gn(re`

ao

= EI

\m=M

gm(re`B)21B + 2

MSm
Cauchy-Schwarz implies that the last expression above 2

>

M<m
<2 m=M

Igm(re` lI2 gn(reiO)I2) 1/2

I gm(re`

Y

Y (E1BSm

(M<m5n

II2)1/2(Ign(rei0)I2)

n)2) 1/2

/2 P(B)

1

n=M

Combining the results above, we see that

kMP(B) ?

-2), m=M

contradicti ng (ii) and proving (5). The next exercise recaptures the main aspects of the last proof in a simpler setting.

Exercise 2 If W 1 Ia.I2 = oo, and SN = N 1 with probability 0.

then limN-,, SN exists

I NO

6

Hardy Spaces and Related Spaces of Martingalee

Remark: Probabilists will recognize this exercise as a special case of the Kolmogorov three-series theorem (see Chung (1974), pages 118-119).

Proof Let C > 0 and B = {w : supN I SN((O) I < C }. If (o e B, I S,,(w) - Sm(w) 15 2C for all m, n, and K

4C2P(B) >- E ( k=m+1

2

ak 4

1B )

n

= P(B)

akI2 + 2

YY

m+15j
k=m+1

Again, SJSk, j < k, is an orthonormal system in L2(Q), so we can pick M large enough such that P(B) z,

YK M+Yj
and can apply Cauchy-Schwarz to show that 4C2P(B) > (k= Y_

I akI z)

P(B) (1

- 2),

Mn+1

which is a contradiction, since we have supposed that Y_ IakI2 = oo. There are many other results for conjugate functions. At this point, we have

not covered all the inequalities on the first three pages of Chapter VII of Zygmund (1959) ! Another inequality that can be proved, using results from Section 6.2, is the following. Exercise 3 Zygmund's Inequality. If gpeL1(8D) and we let p denote the conjugate function on 8D (i.e., the boundary limits of the conjugate of Y(p), then there is a constant C such that zn

f2,,

I:edo

(e`e)log(p(e1e)dB.

6.9 Conjugate Functions of Indicators and Singular Measures In this section, we will investigate the conjugate functions of u = 91A, A C 8D,

and u = /j where u is a measure with µ(8D) < oo, which is singular w.r.t. surface measure. In each case, it turns out that (*) qp(O) = limu(re`B) rt l

exists for a.e. 0

and the value of rz(O : qp(O) < y) denotes only on n(A) or µ(8D), respectively.

6.9

Conjugate Functions of Indicators and Singular Measures

181

The result for 1,, was first observed by Stein and Weiss (1959), who proved the result by computing the distribution when A = Ui (ai, bi) is a disjoint union of intervals (see pages 273-276 of their paper) and observing that the distribution depended only on Yi Iai - bit. The result becomes transparent if we look

at f = u + iu through the eyes of a Brownian motion B, starting at 0. Doing this, we observe: (a) If r = inf {t : B, 0 D}, then f(B,), t < r, is a time change of a Brownian motion C, starting at f(0) = n(A) and run for an amount of time

y= J

I .f'(B,,)IZds.

(b) If 0< ir(A) < 1, then 0< u < 1 in D and u(B,) a (0, 1) for all t < r. This implies that y < T = inf It : Re C, 0 (0, 1) }. (c) Since 1A E L1(8D, n), it follows from results in Section 6.5 that

lim Re C, = lim u(B,) = 1A(BJ a.s., t1= tty so we have y T. (d) Since T < oo, it follows that lim u(B,) = lim Im C, = Im CT, tfv

tfT

and using the equivalent of Brownian and nontangential convergence in d = 2 proved in Section 4.3, we conclude that (*) holds and that (1)

7r (0 : OP (0) > y) = PP(A) (Im CT > y).

To compute the value of the right-hand side, we observe that z --, exp(in(x - 1/2)) maps the strip {0 < Rez < 1} one-to-one onto the half space {Rez > 0} (see Figure 6.3) and sends 1

-.i

1 + i t -> ie-z' 2

a -- exp(in(a - 1/2)),

so if we let a = n(A), b = n(a - 1/2), and c = -2n/2, then it follows from results in Section 1.9 that _fec

CT > y) =

1

cos b

n (cos b)2 + (y - sin b)2

dy,

and changing variables x cos b = y - sin b shows the last integral

=1

((tan-, e`

sin cos b b)

- tan'

(-e-

- e cos b

in b)l

Remark: To be fair to the analysts, we should say that the proof above is essentially due to Calderon (1966) and was later rediscovered by Davis (1973b),

who wrote the proof in probabilistic language. We leave it to the reader to

1$2

6

Hardy Spaces and Related Spaces of Martlnf{ales

a 0

1

exp(iir(z - 1))

-le-ami:

exp(i17(a -

Figure 6.3

show that (1) is consistent with the result given by Stein and Weiss (1959) (see (4.3) on page 273) that sinh Y + i sin

/

expl tl{O:IcP(e)I >Y}I) = 2

2

A2

sinh 2 - i sin 12 I

At this point, the reader can probably guess how we are going to prove the result for u = 9aµ when u is singular. Let f = u + iu and observe that

(a) f(B,), t < t, is a time change of a Brownian motion Cr starting at f(O) = µ(8D) and running for an amount of time

y= Jf(B)I2ds. (b) If 0 < µ(8D) < oc, then 0 < u < oo in D and u(B) a (0, oo) for all t < T. oo)}. This implies that y <_ T' = inf{t : Re (c) Since 9au E h', it follows from results in Section 6.5 that

6.9 Co*gate Functions of Indicators and Singular Measure.

lim Re Ct = lim u(B1) = 0

ttr

183

a.s.,

ttt

and so we have y = T'. (d) Since T' < co, it follows that lim 4(B) = lim Im Ct = Im C(T'), tt=

tty

and using the equivalence of Brownian and nontangential convergence in d = 2 proved in Section 4.3, we conclude that (*) holds and that (2)

7r (0 : p(0) > y) = PP(aD) (Im C(T') > Y)

This time it is trivial to compute the right-hand side. If we let a = µ(8D), then

Pa(Im C(T') > y) = l JJ

a 1t

a2

x2

dx.

The proof above is due to B. Davis (1973).

7 H1 and BMO, Jf 1 and Rd(

7.1

The Duality Theorem for # 1 In Chapter 6, we saw that ifp > 1, X -. X.,, maps .,!!P one-to-one onto in such a way that the., #P norm of X is equivalent to the LP norm of X.. From this observation, it follows immediately that every continuous linear functional qP on MP can be written as cp(X) = E(X,, Y), where YEL9, q = p/(p - 1). When p < 1, the equivalence of MP and LP breaks down and the reasoning above fails to identify the dual space (,#P)*. In this section, we will consider the problem of describing (.,lfl)*. The first step in the solution is to introduce a decomposition due to Bernard and Maisoneuve (1977), which expresses a general XE.,!!1 as a .,#1. As will be the case many times below, (a) sum of very simple elements of the probabilistic definition was developed after and imitates the definition in-

vented by analysts (see Coifman (1974), Latter (1977), Coifman and Weiss (1977)) and, more embarrassingly, (b) we will often assume (e.g., in the proof of (4)) that our martingales start at 0 at time 0 and forget to mention this. (1)

A martingale A E.,kl is said to be an atom if there is a stopping time T such that

(i) A,=Oif t
If A is an atom, IIAII, = EA* < 1.

It follows from (2) and the triangle inequality that (3)

If A" is a sequence of atoms and c" is a sequence of numbers with Y-1c.1 < oo, then X= >c"A"E.,ff' and 11X111 < >1c"1-Clearly, di1. The (3) allows us to construct many examples of martingales in next result shows that every XE if 1 can be built up in this way and furthermore

thatY-lc"1
7.1 no Dudity 'Theorem for .,N'

(4)

185

For all X e #', there is a sequence of atoms A", n e Z, and a sequence of constants c", n e Z, with y_n Ic"I < 6IIXII1 such that as N, oo, N

I c"A" - X in fl'. n= -N

Remark: I think the proof of this result is beautiful. It is a trivial computation, but it is also an ingenious idea. To convince yourself of the latter, you should put the book down for a few minutes and try to construct your own decomposition.

Proof One answer to the problem is: for each n e Z, let

T"=inf{t:IXI>2"} and let An _ (X(t A

Tn+1)

- X(t A T"))/c".

The definition is arranged so that >c,,A, is a telescoping series, so we have N

X, - Y c"Ai = X(t) - X(t A TN+1) + X(t A T-N). n= -N

The last term on the right is <2-N and, hence, in as N - oo. To estimate ' norm of Y = X(t) - X(t A TN+1), we observe that Y = 0 on the {X* < 2N+1 } and Y* 2X*, so by the dominated convergence theorem, EY* < E(2X * ; X* > 2N+') -. 0 as n -> oo. Up to this point, the values of the cn's and the precise form of the stopping times have not entered into the proof. We must now choose the c"'s to make the A"'s atoms. To do this, we observe that

X(t A Tn+l) - X(t A T")I < I2n+1 _ (-2")I = 3 ' 2", so if we want IAt I < P(Tn < co)-', we must pick cn = 3 - 2"P(Tn < oo).

Having done this, we find that

EIcnl=Y3'2"P(X*>2") n 2"

2n

P(X* > 2")dy

= n

:!5; 6

J

f2"-

P(X*>Y)dy=611XII1.

0

Remark 1: It is important to observe that we use 2", n e Z, and not just n > 0. We do this (that is, use n < 0) and use a sequence that grows geometrically, so

that the picture remains the same, if we multiply by 2'. The last feature is

1N6

7

H' wd BMO..t' snd 44&J

crucial if we are going to prove an estimate like (4), which is unaffected if the quantities under consideration are multiplied by a constant. Remark 2: While it is important to let T" = inf{t : I X,I > a"} and to pick c" = (a + 1)a"P(T" < oo), the actual choice of a is not crucial. If we repeat the last computation above in this generality, we find that Y_ICI<

(a+l)

(1 - a-')

EX*.

The constant is optimized by taking a = 1 + -,,,[2-, and for this value of a we get a constant = 3 + 2.,/2- = 5.828... , which hardly seems worth the effort.

With the decomposition in (4) established, it is "easy" to find the dual of .A". A linear functional 'p will be continuous if and only if (5)

sup {IT (X)I : Xis an atom} < oc.

To be precise, a linear functional defined on the linear span of the atoms will have a continuous extension to df' if and only if (5) holds. As in the case of the decomposition, it is hard to guess the answer (the reader is again invited to try), but if somebody tells you the answer and shows you which atoms to use as test functions, it is not hard to fill in the details.

We say that Y has bounded mean oscillation (and write Ye .AV) if YE.,lf2 and there is a constant c such that for all stopping times T, (6)

El Y, - YTI < cP(T < oo).

The infimum of the set of constants for which (6) holds is called the M.,616 norm of Y and is denoted as 11 YII* This definition may not look very natural now, but it will by the end of the next proof. (7)

Let sd be the set of atoms. For all Ye _#', III YII* < sup(IE(X, YJ : Xed) < II YII*. Proof We will first prove the inequality on the right. If X e sad and T is a stopping time for which (1) holds, then

E(X, YT) = EE(XJ E(YTE(XXI FT)) = E(YTXT) = 0,

since XT = 0. From this it follows that EX. Y.1 = I EX.(Y. - YT) 1

< E(X*I Y. - YTI) < P(T < oo)-'El Y. - YTI <- 11

YII*.

7.1

The Duality Theorem for.,N'

187

To prove the other inequality, let T be an arbitrary stopping time, let Zoo = sgn(Y, - YT), and let Z, = E(Z,,,,I.) be the martingale generated by this random variable. Since IZ,I < 1, X, = (Z, - ZTA,)/2P(T < oo) is an atom. I'rom the definition of Z,,,,, it follows that

EIY. - YTI =E(Zc(YY- YT))=EZ,Y.-EZ1YT. The first computation in the proof shows that E(Zao YT) = E(ZT YT) = E(ZT Y.),

so we have

EIY. - YTI

=E((ZW-ZT)YT),

and it follows that

E Y. - YT = E ((Z. - ZT) 2P(T < oo)

1\2P(T < oo) Y-)

= E(X Yom).

Taking the supremum over all stopping times now gives the desired result. With (4) and (7) established, it is now routine to conclude that R.,#&. To prove (41)* c RMO, we observe :

(a) .#2 c ill, with IIXII2 =

(EIX*I2)I12

> EI X*I = IIXIII, so if co is a continuous linear functional on #', cp induces a continuous linear functional

on M2 with IIPII2 = sup{Iw(X)I :IIX112 < 1} < 119111(b) from the duality theorem for ..1f2, it follows that there is a YE .% 2 such that

cp(X) = EXm Y. for all X E JI'2. Since d c M', it follows from (7) that YE'4A' V.

(c) from the atomic decomposition, it follows that .,6l2 is dense in M, so the correspondence cp -+ Y defined in (b) is one-to-one.

To prove that (A'l)* = R.,!!U, we now have to prove that all the linear functionals given above are continuous. This follows from the next result (Fefferman's inequality). (8)

If X, YE.,#2, then I E(Xa YaD)I

611XII I

II Y.

Proof From (4), it follows that X can be written as 1" c"A", where All, n E Z, is a sequence of atoms and N

Y_ c"A< < X* + 1

for all N, t.

.=-N

Since X E A'2, we have X* E L2, and it follows from the Cauchy-Schwarz inequality and the dominated convergence theorem that

EX. Y. = Y c"E(A" Y.). n

INN

7

H' and BMU, .41 nd w..4(J

Using the triangle inequality now with the results of (7) and (4) gives the desired conclusion : IEX. Y. I

EIcnI IE(A"YY)I n

I IcnI II YII* < 611XII1II

YII*.

n

Remark: What we have shown above is that if (p e (.,#1)*, then there is a Ye-V.,#O such that cp(X) = E(X, Y.) for all Xe.A'2. In Section 7.2, when we give the "classical" proof of the duality result, we improve this conclusion slightly by showing that qp(X) = E<X, Y>W for all Xe.441. (9)

If there is a constant c such that for all stopping times T

(*) EI Y. - YTI < cP(T < oo), then it follows that we have (**) E(I Y. - YTII 30"T) < c

a.s.

for all stopping times.

Proof Applying (*) to the stopping time

T' =

(T

ifE(IY(-

oo

otherwise,

we see that if P(T' < oc) > 0, then cP(T' < oc) Z E(I YO -YT E(E(I Y. - YTII JET) 1(T' x,))

> cP(T' < oo), a contradiction, so P(T' < oc) = 0.

7.2 A Second Proof of (,&')* = R,& O In this section, we will give a second proof of the duality theorem for -#' following Meyer (1976). This approach starts with a somewhat different definition of -4,V9.

Let Xe.,#2 with X0 = 0. We say that Xe-"02 if there is a constant c such that, for all stopping times T, (1)

E(XQ - XT)2 < c2P(T < oo).

The infimum of the constants with this property is called the _q,#02 norm of X and is denoted by <<X >> *.

7.2

A Secood Proof of (. 4')* - 4Y4N

189

Remark 1: From Jensen's inequality for conditional expectations, it follows

that if Xe. .,lf02, then Xe-V..lf0 and <<X>> Z IIXII*. An inequality in the other direction, <<X>> 5 CIIXII*, is also true and is a consequence oI' the two proofs of the duality theorem. We will also give a direct proof of the second inequality in Section 7.6. XT Remark 2: Since E((X, the definition of -4.,k02 can be written as

(1)

IJIT)

= E(<X )m -

E(<X> - <X )r) : c2P(T < oo) or, in view of (9) in Section 7.1, as

E(<X> -

c2

a.s.

The first step in our second proof of the duality theorem is the same as in the first proof. We show that every continuous linear functional on .,K1 comes from a Ye 9.,0 02. (2)

If 9 is a continuous linear functional on A1, then there is a YE-V.,#02 such that for all Xe.df2 cp (X) = EX. Y..

Proof It suffices to prove the result when <<9>> 1 = sup { I (p(X) I : <<X>> 1 5 1 } = 1. Jensen's inequality implies that «X))2 = (E<X)W)112 z E(<X>10012) _ <<X>>,, so tp induces a continuous linear functional on .,#2, and it follows as in Section 7.1 that there is a Ye.112 such that 9(X) = EX, Y. for all Xe A'2.

To show that Ye"02, let T be a stopping time and let X, = Y, - YT,,,. X is the stochastic integral J Y where J = 1[T,.,), so using our formula for the covariance of two stochastic integrals ((3) of Section 2.6), we have <X, Y> = JJd3 o

<X,X)r =

J,2d3. J0

Since J32 = J, it follows that

<X,X>.=<X,Y)m=,-. Let Z denote the common value of the three expressions. Since X, YE.,#',

it follows from the inequalities of Doob, Cauchy-Schwarz, and KunitaWatanabe that

EZ-E<X,Y).=EX.Y, . On the other hand, we have EX. Y. = (p (X) <- << X ))1 = E(Z 1/2),

191)

7

H' mnd BMU, .N' rnd M,

and since Z = 0 on IT= oc }, it follows from the Cauchy-Schwarz inequality applied to Z 1(T< .) that E(Z112) < (EZ)112P(T < 00)1/2.

Combining the last three results proves that EZ < (EZ)112P(T < 00)112,

that is,

EZ = E( - T) < P(T < 00). Since this result holds for all stopping times, it shows that YE -V.,#0 with <
If XE., (1 and Ye .,lf02, then

J'ld<X,Y> l >*.

E

Proof By stopping, it suffices to prove the result when X, Y, <X>, and are bounded. Since <X>-'14<X>114 = 1, it follows from the Kunita-Watanabe inequality that I d<X, Y>s12 < E fo"" <X> 1/2 d<X>tE fO'* <X>1"2 dr.

EJ

Since <X>, has bounded variation, ordinary (Riemann-Stieljes) integration gives <X>-112

d <X )t = 2<X> a1o/2,

J0 so

E

f

<X>t 1/2d<X>, = 2<<X>>1

0

To estimate the second integral, we fix co and integrate by parts to obtain E J oo <X>112 dt = E<X> 0

=E

r

J0

2

-

fOD

d<X>i12

0

- td<X>tl2

At this point, it is very easy to complete the proof if we leave out one detail. I claim that since <X> is adapted to F, the last expression

7.2

A Second Proof of (.t') - :M.A'

= E f " E(< Y>.

191

- ,I A)d<X>9 2

If you accept this, then there is nothing left to show, for definition (I") of 5U. WC) 2

implies that the above is < KY>)*E(<X>1/2) 00 = <<X»1<>2

Combining the inequalities above shows that I d<X, Y>sI < (2<<X>>1)1J2(<<X»1<> )1/2,

EJ 0

which is the desired inequality. To complete the proof of (3), it remains only to justify the equality claimed above. To do this, we will prove a general result: (4)

If Z is a bounded random variable and A is a bounded increasing process adapted to IF with AO = 0, then E fo'O E(Z

I S)

Some care is needed in defining the integrand, since A may be singular and, for each t, E(Z I.;) is defined only up to a null set. In our situation, there is no problem. The Brownian filtration admits only continuous martingales, so we take versions of E(Z I.) that are continuous in t for each w. In the language of the general theory, we are taking the optional Technical Remark:

projection of the process YY = Z (which is constant in time), but in our situation

we do not need this notion, since there is only one reasonable way to define E(Z I.) for all t simultaneously.

Proof Suppose without loss of generality that Z >_ 0.

EZA = E

A ko

\n / -A Ck

n 1/

=Ek>E(Z(A\n/-ACkn = E kj (A 1 =1

/ ki)

1/)I,

- A (k n

n

1) E(Z

I Ak/n)

-> E f 0 E(Z I.,) dA1 0

as n - oo, by the dominated convergence theorem.

192

7

H' aad BMO, .N' rnd 91.+YPV

Taking Z = in (4), we see that

Jd<X>/2

- J F 112

0

0

=J

E(

)d<X)''2

0

=f

-f

d<X)'lz

0

- ,J

)d<X>t

which completes both the proof of (3) and the proof of the duality theorem. As we mentioned in Section 7.1, one advantage of the new proof is that it gives

a formula for the linear functional that is valid on the whole space and not just on a dense subset.

7.3 Equivalence of BMO to a Subspace of L# Our next goal is to prove the duality theorem (H1)* = BMO. One-third of the work for the proof of this result was done in Sections 6.4 and 6.5, when we showed:

(a) f - Re f(B,), t < r, maps Ho one-to-one into .,ktl and, furthermore, the //T' H' norm off is equivalent to the Xe..,R#', norm of its image (b) if we let X, = Ref(BfAT), then so if we let M: f-+ Ref(BEAT), then the results in (a) hold when the r is erased.

It follows from (b) that M(H0) is a closed subspace of .#1 and that all continuous linear functionals on Ho have the form A(Mf), where A e (.,#1)*. Since (.,ff 1)* = _4.,#0 (the second third of the work), it then remains to identify M(H, )* c -4.,#(9 and to show that we can map BMO one-to-one onto M(Ho )*

in such a way that the BMO norm of a function is equivalent to the .4,f(9 norm of its image. The answer to the first question can be found by very naive reasoning :

M(HH) = {XE.,ffl : X0 = 0, X, = h(B,), t <,r, and X is constant for t > r}, a space we call lf,; , so if L.,L(O,, is defined in the obvious way, we should have

When we prove that (H1)* = BMO in Section 7.4, we will show that this reasoning is correct. To prepare for that result, this section is devoted to determining which harmonic functions h give rise to martingales in (.,ff,; )* =

R.,1f0,, (the last third of the work). The first baby step in doing this is to observe : (1)

If we let ('

Yf= J ke(z).f(e`B) dn(0), then h = Yip where 9 e LZ(8D, n).

7.3

Equivalence of BMO to a Subspace of A WO

193

Proof c .,llti , so the result follows from (2) in Section 7.3. The last result is not much but, in view of the remarks at the end of Scct ion 6.5, it allows us to think of h as being a function (p e LZ (8D, n) and to conclude that t < T

XJh(Bt) t

- gq(Bz)

t > T

is a continuous martingale e.,ll2. A more substantive conclusion results if we use the definition of M.,KO: for all stopping times T, E((Xao - XT)ZI FT) S C2.

So it follows from the strong Markov property that

E((XX -

`YT)2I_f

T) = W(BTnz)

where w(z) = 0 on OD and w(z) = Jko(z)(co(ei) - YTV(z))'21r(O)

when z e D. So in terms of w, the condition for x to be in M.,lfO is (2)

For all z e D, w (z) < c2.

The aim of this section is to show that the last definition is equivalent to the following notion in analysis. (3)

T is in BMO if there is a constant c such that for all intervals I, 19

-WIIZI

f

I

where

=

dO (P

(P,

I

is the average value of p on I. The smallest positive constant with this property is denoted by Conditions (2) and (3) have a very similar form. To emphasize this we will introduce some notation. Let II

.le(re`l') =

jl/(l - r) if Iii - OI < (1 - r)n 0

otherwise

Ap (z) = JJo(z)(e)d7t(O) x'(z) = JJ9(z)(4(ei9) - y (z))2drt(O).

194

7

..e :ir.4u

H' .ed 3MO,

If we recall that dn(0) = d0/2n and observe that as z ranges over D, the supports of the maps 0 -' je(z) run through all possible intervals I, then we see that (3) says simply that w(z) 5 c2 for all z e D. The first step in understanding the relationship between (2) and (3) is to

understand the relationship between ke(z) and je(z). To begin, we will look at the asymptotic behavior of k9(r) as r - 1. (4)

ke(r) =

I lr-

erelz

1 - r2 (r - cos 0)Z + (sin 0)2

1 - r2 1 + r2 - 2r cos O

(1+r)(1-r) (1 -r)2+2r(1 -cos0) 1+r (1+2r(1-cos0)1-1 1-r (1-r)2 J ' so if we let 0 = y(1 - r), then

(1 - r)kl(l-,)(r) -- 2(1 +

y2)-1

2n times the density of a Cauchy distribution with parameter 1. The last result should be no surprise. If we approach the boundary of the disk and rescale the picture so that we are at a distance 1 from the boundary, then the boundary will approach a straight line, so the rescaled exit distribution will approach the Cauchy distribution and the factor of 2n arises since on OD we are looking at the density w.r.t. A = d0/2n.

The computations above show that the natural place for viewing ke(r) is at 0 = y(1 - r). Since this is the width of the support of je(r), the strict positivity of the Cauchy density leads easily to : (5)

There is a constant A such that, for all z and 0, je(z) 5 Ake(z).

Proof From (4) we see that je(z) < C,ke(z) for all Izl < r, so it suffices to consider what happens when r - 1. From (4), it follows that

ke r

O

_ l + r (1 + 2r(1 - cos 0)1 r)2

1-r

J

Now 1 - cos 0 = e(0), where e(0) _ 02/2 as 0-0, so Is(0)15 CO' for Oe [0, 2n] and it follows that if r < 1 and 101 < (1 - r) 7r, then

1+2r(1-r)Z0)51+(1 2r)2C(1-r)Zn2, proving (5).

7.3

Equivalence of BMO to a Subspace of :,Y.,NtV

195

With (5) established, it is easy to prove: (6)

For all zED, w(z) < Aw(z).

Proof By (5), Aw(z) ?

- 9a(p(z))2j9(z) dir(e)

J((e1°)

- Ap(z))2je(z)dn(O),

since a - $ ((p(e`) - a)2j9(z) drz(O) is minimized at a = 9atp. In more familiar terms, the mean p = EX minimizes E(X - a)2, since

E(X-a)2=E(X-p+p-a)2=E(X-p)2+(p-a2). Having proved (6), we turn our attention to the other comparison. From the proof of (6), it is clear that it would be enough to show that there is a constant B such that (*)

BO(z) >- J(q(eio) - #(p(z))2ke(z) dir(e),

for this would imply, by the argument above, that i(z) z w(z). The proof of (6) was easy, because all we had to do was show that some multiple of ke(z) was >- je(z). This statement is false if we interchange the roles of je and k9, so we will have to work harder to prove the result we want-we have to add up a large number of functions of the form cl[a,b] to make something z ke. Because of this difficulty, we will prove a slightly weaker result than (*) that is still sufficient to prove that the norms are equivalent. (7)

There is a constant B such that for all z e D, B (sup

)?

(z)

J(cp(ei0

- #(p(z))2ke(z)d7t(9).

Proof As we mentioned above, we prove this by adding up multiples of lla bi to make something > ke(z). The first step is to introduce the intervals. Let

Io = [-(1 - r)n,(1 - r)ir] and for n > 1, I,, = 2"I0, let N be the largest integer such that 2N(1 - r) < 1, and write

F. =

10 + n 1 + fl--,-]-IN For a picture of the decomposition, see Figure 7.1.

19b

7

H' ud sM0, .,#' and iM.,re)

FT

Figure 7.1

k8(.9) = 19/(1 + 180(1 - cosO)), Io =

N= 3

Estimating the integral over Io is easy. If 0 e Io, then (4) implies that ( ao)

1-r2 < 1-r2 ke((1-r)2+2r(1-cos0) r) = (1-r)2 = i + r < 2je(r),

so we have (co(eie)

(bo) 20(r) ? J

- Pl(o(r))2ke(r) dn(0).

to

(The reason for this strange numbering will become apparent as the proof goes on.) Estimating the rest of the integrals requires more work. The first step in estimating the integral over I" - In_, is to observe

(a") There is a constant C such that if r > 1/2 and n < N, (1 - r)ke(r) < C4-" for all 0eI,, - In_1.

Proof From (4), it follows that

(1 - r)ka(r) = (l + r) 1 + 2r(1 - cos 0)1 (1 - r) Now if n < N and 0 E I" -

then

1 - cos0 > I - cos(2"-ln(1 - r)),

and we have from calculus that

inf

XE(0,l)

1 - Zosx-e>0, x

7.3

Equivalence of BMO to a SubNpace of :N. w V

197

so

(1 - r)ke(r) < 2(1 + E(2"-17,)2)-1, proving (an). The estimate in (a") takes care of ke(r) on In - In_1. To estimate the rest of the integral fin (7), we let 1/2

IIfIII

=(

If 12 dO) JI"

and

a" =

f

J9 jel

In

F

.

Since I I I I n2 is a norm, II(p

- a0II2

IIp - anII2 + E Ilak - ale-1II2 k=1

The first term 1/2

((p

-a")2dO)

<-II(pII*IInV12'

r"

The second term n

= k=1i Iak - ale-1 I.I1/2 To estimate Iak -ale-l I, we observe thatI dO I) 1/2

Iak - ale-1I = ((ak - ak-1)2)1/2 = (

(ak

-ale-1)2

k -1

(f

k

((P 1

1)2 I

- ale

k-

Ik

+ (f

de1/2

(ale-w)2IIk)

<-II(PII*+(2

II

d61 I)1/2

(p)2 IId81

(ale -

-1 )1/2

<3II(pII*'

Summing the estimates above, we find that

II' - aoII < (3n +

1)II(pII*II"I112,

so

(co - a0)2 d6 < 16n2 II(p

*2nn(1

- r).

Combining this with the estimate (an) and recalling that ao = 0c0, shows that

forn
J n -I"-1

((p - 9(p)2ke(z)dO < C'II(pII2n2/2-n.

198

7

H' and IMO,. w' and L *C)

The last detail is to estimate the integral over [-n, n] - IN. To do this, we observe that IN D [ - n/2, n/2], so cos 9 < 0 on [ - n, n] - IN, and it follows that from the proof of (aN+l) if r > 1 /2 and 0 e [ - n, n] - IN, then

(1 - r)ke(r) < 2(1 +

(2Nn)2)-1

and repeating the arguments used to prove

proves :

(q - Ycp)'ko(z)dO < C'Il(pjI*NZ/2-N

(bN+t)

Adding up the estimates (bo) + (bl) +

+ (bN+l) and recalling that II0I* _

sup, w(z), proves (7).

With (7) proved, it follows immediately (for reasons given after the proof of (6)) that we have (8)

For all z e D, BII(pII*

? w(z).

Combining this result with (6) shows that if we let «(V>>* = sup w(z) Z

(= the M.,#C norm of the associated martingale, Ytp(BtAt)), then A IIq'II* < <
> <

For some arguments in Section 7.4 and beyond, it is useful to write the definition of << >>* in a slightly different way. Let tp e L2(8D, n) and let u = ° p. Since u(z) = E,cp(B,), then w(z) = E,(u(Bj) - u(z))2 = E,u(B)2 - u(z)Z. Since U, = u(B,,,) e .,tl Z, it follows that

w(z) = EZ
so in view of the results of Section 1.11, the condition for q' e BMO can be written as (9)

sup =

f GD(z, W) I Vu(w)12 dw < oo, D

where GD(z, w) =

2

n

log

1 -wz z-w

is the Green's function for D.

7.4 The Duality Theorem for H', Feffermao-Stein DeoompoItion

199

The last result is very similar to a classical analytical characteriilltion of BMO. A positive measure .1 on D is said to be a Carleson measure if there is a constant c such that for every sector

S={re`B:1-h
(p e BMO if and only if the measure defined by 1Vu(z)Izlog111dxdY,

z

where u = 99, is a Carleson measure and, furthermore, the constant K in the definition can be chosen so that Clll(pll* < K((p) <- C211911*

For an analytic proof see Garnett (1980), Chapter 6, Section 3. Even though (9) and (10) are very similar, I do not know how to get the second result from the first.

7.4 The Duality Theorem for H', Fefferman-Stein Decomposition In this section, we will identify (H,)* as BMO. There is a standard way to recover a complex linear functional A from the real part of the corresponding functional on the associated real Banach space : A(f) = Re A(f) - i Re A(if ), so, to make the transition to martingales easier, we consider only real linear functionals. The identification of (HO)* is a four-step procedure. (1)

If A e (Ho)*, then there are functions g, and g2 in L°° (8D, n) such that if f = u + iv c-H', then

A(f)= J

u91+vg2dn.

By results in Sections 7.2 and 7.3, the mapping f --> (u, v)lao identifies Ho with a subspace K' of L' (8D) x L' (8D) in such a way that

Proof of (1)

1If IIHI and 11uIILI + 11v1IL, are equivalent norms. If A is a continuous linear

functional on HO, then it gives rise in an obvious way to a linear functional on K', which by the Hahn-Banach theorem extends to a continuous linear

functional on all of L' x L'. Since (L' x L1)* = L°° x L°°, it follows that there exist g, and 92 E L°° such that (1) holds. The next step is to show:

. #' rnd 940

200

7

(2)

If fl , f2 e Ho and u;, v; are the boundary limits of Refs, Imf , then

H' rnd DMO,

fuiu2dlr = n

an

Proof Let U,'= u1(B) for t < T. Ito's formula implies

u;(B,) = 5 Vu,(Bs) dB, 0

so the formula for the covariance of two stochastic integrals gives

= JVui(Bs).Vu2(Bs)ds. 0

A similar argument shows

, = JVvi(B5)Vv2(B)ds , 0

and the Cauchy-Riemann equations imply Vv;

_1

=C0

Vu;.

0)

so we have Vu1 Due = Vv1 Vv2 and = ,. Since we have assumed the f e H02, we have U`, V' e if 2 with Uo = Vo = 0, so the usual domination argument shows

EoUa1U.2=Eo. =Eo =EoVIV2, proving (2).

(2) allowsjui us to write

A(f) =

+ u2 d7r, n

where g2 has the obvious meaning: it is the boundary limit of the conjugate function of .g2. The rest of the proof is very easy. Recalling that conjugation is a martingale transform, we see that (3)

91 + 62 a BMO.

Proof Since g1, 92 cL°°, they are in BMO. Let g3. = g2, u; _ Yg;. By results in Section 7.3, g; is in BMO if and only if T

sup E.

I V u; (B) 12 ds < oo.

Z

fo The Cauchy-Riemann equations imply that I Vu2l = I Vu3 1, so it follows that

g3 =g2eBMO.

7.4 The Duality Theorem for II', Fefferman-Stelo Decompwltlon

201

Given the martingale duality theorem and the results of the last Iwo sections, the last step is trivial. (4)

If h c- BMO and we let

A(f) = faD uhdn for all f e H02, then A has an extension that is a continuous linear functional

on H. Ut = u(BtA t). Since Proof Let h E BMO, fE Ho , u = Ref, and let Ht = Uo = 0 and BMO c H2, the usual domination argument shows

uhdrz=EU,,,, H..

Using the martingale duality theorem and the equivalence of the two norms shows EUUHH < 6IIUII1IIHII* < ClI f 11111h1j.

and completes the proof of (4).

Remark: The theorem above makes no mention of Carleson measures, but it is otherwise the same as the one given by Fefferman and Stein (1972), on page 145. The only difference is that we have carried out the two main steps, (3) and (4), using the correspondences developed in the last two sections to reduce these steps to results about martingales. As a corollary of the duality theorem, we get the following result from BMO. (5)

If cp e BMO, then qp can be written as g 1 + 92 + c, where c is a constant and g1, 92cL°°, and furthermore, this representation can be done in such a way that 119111.+119211.
Proof The first conclusion is trivial. If q E BMO, then

f-

f (Ref)cpdn ,Jao

defines a continuous linear functional on Ho, so it follows from the proof of the duality theorem that these are g,, g2 E L°° with

f (Ref )cpdn = f (Ref )(91 +92)dn .Jan

.Jan

for all f E H02, and taking test functions of the form YO, where 0 E L2 and

= 0, shows that cp - (g1 + g2) is constant.

do

202

7

ll' and BMO, . N' and :9 #C

To prove the second conclusion, we observe that, given a cp E BMO, the first equation in this proof defines a continuous linear functional A on Ho with II A II 1 = sup { I AJ I II f II 1 < 11 < C II w II * Now Ho is equivalent to a subspace of L' x L', so it follows from the Hahn-Banach theorem that we can extend A to be a linear functional I, on L1 x L' in such a way that :

I1"II1x1 < C'IITII*1

where IIF 1.1 = sup{Ir(u1,u2)I : IIu1II1 + IIu2II1 < 1} (the constant changes only because we change from the L1 norm to the L' x L' norm). Now any I' E (L' x L1)* comes from a pair (q1, g2) E L°° with IIg111., IIg2Ii. < IIFIIL'xL'

(consider functions of the form (u1, 0), (0, u2)), so the proof is complete. It is easy to generalize the last proof to prove a result about martingales.

To do this, we observe that if A1, ... , Am are matrices without a common eigenvector in R°, then Janson's theorem ((1) in Section 6.7) implies that if we let A° be the identity matrix, then the mapping X - (Ao * X, ... , Am * X)

embeds #' in 21 x

x .1 in such a way that

IIXIII
(6)

(recall that if YE 2', then II YIIy = II YIIy') If tp(X) = E<X, Y> defines a continuous linear function on .,#', then (6) implies that

O(A°*X, ...,Am*X) _ T(X) defines a continuous linear functional on a subspace of 2'' x . . . x Y', which the Hahn-Banach theorem asserts can be extended to the whole space.

Since 21 is isomorphic to

(2' x .. x T')* = ii

x

... , Ym c-.' such that if X °,

we have (2'1)* = #' and, hence, .. x r, that is, there are martingales Y°, .

.

.

,

Xm E 21, then

m

(X°,

,Xm)

m

E(XXY.) _ i=o

r=o

E<X`, Y`),,o,

and hence we have m

E<X, Y> _ (p(X) = (Ao * X, ... ,Am * X) = Y Eco i=o

If XX= O'H.,-dBs

and

Y, =

f 0

0

then

(A HS, Ks) ds

. = JO0

=

(H.,, AT KK) ds = <X, AT * Y`> J0

,

7.4

The Durllty,rheorem for 11', F'effermrn-Stein DecompoNltlon

20.1

so we have m

E<X, Y>c = Y E<X,AT * Y`). t=o

and, since this equality holds for all X e .,#' ,

Y= I AT * Y` i=O

Now in the arguments above we started with an arbitrary Ye. -V.#0 so it follows from the last equation that we have : (7)

If A1, ..., A. do not have a common eigenvector in R', then any YeR,#O can be written as

Y°+

m

AT*Y`,

with Y`efor i = 0, 1, ... , in, and furthermore this can be done in such a way that II Y`II.

CII Y11*-

Since the proof of (7) uses the Hahn-Banach theorem, it does not tell us how to find the martingales Y°, ... , Ym. In simple cases, this can be done "by hand." Let d = 2,

Example 1

A = (_ 0

\

1

11

0 '

and let Xt = B,'AL where

i=

inf{t : I B, 01 = 11. If we let Y, = BfAt 0 and pick a, b so that (1, 0) = a0 + bAO,

then

Xt=af

tAT

( A S

A0 dB,

,Jo

,10

= (a Y,) + A * (b YY).

Finding a decomposition in general, however, seems to be quite a difficult problem. A. Uchiyama succeeded in doing this recently for d-adic martingales (1982a) and has generalized the construction to BMO functions (1982b), but

I do not know how to prove the result for RMO, and I invite the reader to consider this problem.

The key to deducing the duality theorem for Hp from its probabilistic analogue was the fact that the mapping M (mnemonic for Martingale) defined by M: p

Y p(B,j

t > 0

has I M(p 11 * < C II (p I * . For some applications, we will need to know that the inverse N, defined by I

N:X-+cp(e'e)=E(XXIBT=e`°), is also continuous.

204

7

H' and BMU, .N' and s ,#O

(8)

There is a constant C such that, for all X c- -4,#0,

INXII* <

CIIXII*.

Proof From the duality theorem for Ho and the open mapping theorem (see Dunford and Schwarz (1957), Chapter II, Section 2), it follows that

f fgp dn

If II* < C sup

.

J

OCHO

Ihll,si

Since Bt has a uniform distribution, J(NX)co drz = EE(XXcp(B,)IBL) = E(X,gp(BL))

(all the expectations above exist, since X e . #2 and cP e H2). If we let U = ecp(B,A,:), then it follows from the martingale duality theorem that IE(X,cp(B,))I < CIIXII*IIUIII <-

C'IIXII*.

From (8) and duality, it follows that if N is extended in the obvious way it maps Al' continuously into H'. Combining this result with a trivial result for p > 1 gives the following : (9)

If X e ,# 1 and NX = cp, then there is a unique analytic function f with Ref = Ycp and Imf(O) = 0. If p > 1 and we denote this function by FX, then there is a constant C < oe that depends only on p such that IFXIIp< CIIXIIP.

Proof For p > 1, this result is trivial, since IIFXIIp < CIIReFXIILP(aD) and

Re FX IIp(aD) = E I E(X.I B,) JP

<EE(IX.rIB,) = EIX,IP < EIX*IP = IIXIIp

To prove the result for p = 1, we observe that if Xe.llo, then FXeHO, so the duality result and the Hahn-Banach theorem imply that IIf111 _< C sup IIw

J(Ref)P d ir.

7.5

Examples of Mardnades In tt 4 ')

205

If we let u = 9ap, then we have

J(Re FX )cpdn

E(E(X,IB)u(B)) = EE(XCO U. I B)

= E(X, U.) < CIIXII1IIUII*

..#2.

Since .jf 2 is dense in .,# 1,

the desired result follows.

Remark: The last result is false when p < 1. It is known that there are nontrivial continuous linear functionals on H° for p < 1 (see Duren, Romberg, and Shields (1969)). In Section 7.8, we will show that there are no nontrivial continuous linear functionals on .,lf°, p < 1, so X -> FX cannot be continuous. Note: We learned the results on M and N, (8) and (9) above, from Varopoulos (1979), who attributes the results to Maurey. Exercise 1 Sharpen (3) by showing that if qp a BMO and Cp is the conjugate function, then Cp e BMO and

«w»* _ «w»*, IIwII* <- CIIgII* As a corollary of the last result, we see that if cp e L°°, then CP e BMO, a result first proved independently by Spanne (1966) and Stein (1967). The function f(z) = i log z shows that we may have tp e L°° but Cp 0 L°°.

7.5 Examples of Martingales in R ,#C In Section 7.1, we defined the -4.,#C norm II X II * of a martingale to be the smallest number c such that for all stopping times T (1)

E(I X, - XTI) 5 cP(T < oo).

In this section, we will try to explain what kind of martingales are in -4.#6 by giving four examples and describing some of their properties. Example 1

If X c- . e l f °° = {X : X* e L°° }, then it is immediate that X e RJie2

and I I X II * _< 2II X II

.

With a little thought, we can replace the 2 by a 1:

(EIXc -XTI)2 <E (X. -XT)2 EX .2 -EXr2. <- IIXIIW.

206

7

H' and HMO, . N' and .4.4#0

There are also unbounded martingales in M..#O. Perhaps the simplest example is the following. Example 2 Let Xt = Bt 1. If T is a stopping time, then Xm - XT = B1 - BT 1

is independent of . and has a normal distribution with mean 0 and variance (1 - T)+. Since Bc c1/2B1 and EIB1I =

2JrO

(27r)- 1/2xe-x2/2dx = (2/7C)112,

it follows that E(I X. - XTII

(2/7r)112P(T < oo)

and, taking T - 0, that IIXII * =

(2/it)112

Once you get started, it is easy to use Brownian motion to construct

numerous examples of unbounded martingales in 1f0. An obvious modification of the last example is to let S be a random variable that is independent of B, t > 0, and has an exponential distribution P(S > t) = e-xt. If we let Xt = Bs, then the lack-of-memory property, P(S > t + ul S > t) = P(S > u), of the exponential implies that X is in .4.,610. Another possibility is to introduce

the stopping times R" = inf{t > Rn_1 : I B(t) - B(Rn_1)I > 1} and let Xt = B(t A RN), where N is an independent random variable with a geometric distribution P(N > n) = (1 - p)", n = 0, 1, 2.... The next example is a variation

of this-N depends on B, t < TN, and is chosen to try to produce a large maximum. Example 3 Let Ro = 0, and for n >- 1, let R" = inf{t > Rn_1 : I B(t) - B(Rn_1)I

> 11. Let N = inf{n : B(R") - B(Rn_1) = -1 } and let Xt = B(t A RN). See Figure 7.2 for a sample path. It is easy to get a bound on the 8.610 norm of X. If T is a stopping time, then on {Rk < T < Rk+1, N > k}, XT = k + a for some ae(-1, 1), and P(N > k + jI.FT) <- (1/2)j-1 (the worst case is a - 1). On the other hand, a look at Figure 7.2 shows that I X. - XTI < N - k (the worst case is a -> - 1), so combining the estimates gives that E(I X. - XTI I.FT) < E(N -

2,

and we have IIXII* < 2. A look at the parenthetical remarks above shows that this is not the right answer, but we are not far from it. It is easy to compute f(a) = E(I X. - XTI I XT = a, T < R1) and maximize to find IIXII* = 9/8 (details of this computation are sketched in Exercise 1 at the end of this section). We have taken the trouble to compute the -4A'(9 norm of the last example so that we can check the accuracy of some of the inequalities below. By abstracting the construction above, you can produce many examples of martingales in .4.610. The next construction is so general that we can think of it as giving the typical element of -4,NO (see the proofs of (1) and (5) in the next section).

7.5

207

Examples of Msrtlogsles 1n .W.NP"

N=5

R,

T

R,

R,

R,

R,

Figure 7.2

Example 4 Let T" be an increasing sequence of stopping times with To = 0

and I B(t) - B(Tn_1)I < A for t e [T"_1, T"], n >- 1. Let N be a stopping time

for .F(T"), that is, {N < n} E.F(T"), and suppose that for all n > 1, P(N > (Tn_1)) < 0 < 1 on {N > n - 1}. If we let X, = B(t A TN), then X E R.,W0 and as the reader can easily show

nI.

IIXII* < A(1 + (1- 0)-1). The four examples above should give you an idea of what type of martingales are in -4.Af(9 and, hopefully, make the results in the next two sections more obvious. To lead you in the direction of the John-Nirenberg inequality ((1) in the next section), we will now compute P(X* > A) for the three unbounded examples. Example 2.

P(sup X, > ..) = 2P(B1 > i±) =

P(X * > 2) < 2P(sup, Xt > 2) =

4

2

e k2 /2 r

e-v212 dy.

exze X212 dx

21T

4

e x2/22-1. As 2 -+ o o.

2n

Example 3.

If n is a positive integer, then

P(X*>n)_(l/2)".

2011

7 Wand IMO.. N' and :4..WE)

E x a m p l e 4.

I f n is a positive integer, then

P(X * > nA) < P(N > n) < (1 - 0)". Exercise 1

Compute the -4.,lfC norm of the martingale in Example 3.

(a) By the strong Markov property and independent increments of Brownian motion, it suffices to consider the case XT = a e (- 1, 1) a.s. on IT < co}. (b) When XT = a, the distribution of IXX - XTI is given by the following table :

X.

I XX -XTI

-1

a+1

Probability (1-a) 2

( a +1 ) 2 /I)n+i

2

n

n-a

(a 2 1)

I\

Summing over the possibilities, we get

E(IX.-XTI)- (a+1)(2-a)/2 ifa>-0 P(T
1(a+1)(1-a)

if a<0.

The maximum value occurs for a = 1/2, and here the value is 9/8.

7.6 The John-Nirenberg Inequality In this section, we will prove a classical result on BMO functions due to John and Nirenberg (1961), which was discovered almost a decade before it was known that (H1)* = BMO. Following our usual inclination, we will prove the probabilistic analogue first and then use this result to prove the analytical result. (1)

There is a constant Ce(0, oc) such that if IIXII* < 1, then

P(X* > A)
The first step is to prove :

If Ye.4,06 and S > T are stopping times, then EI YS - YTI < II YII*P(T < oc).

Proof If T is a stopping time, then Z, = YY - Y, AT is a martingale and Iz l is a submartingale which is dominated by an integrable r.v. (recall that Ye.,A'1),

209

The John-Nlrenberg Inequality

7.6

so the optional stopping theorem implies that

El Ys- YTI =EIZSI <EIZmi =EIY. - YTI, proving the result, since, by definition of II Y II *

EIY. - YTI <-IIYII*P(T 1 and define a sequence of stopping times Ro = 0 and, for n >- 1, R" = inf{t > R.-j: JX, - X(Rn_1)I > a}. We have assumed that II X II * < 1, so applying the lemma with S = R;+1 and T = Ri gives

P(Ri < c0)

EIXR.+l - XR.I > aP(Ri+1 < oc).

Therefore, P(Ri+1 < co IRi < oo) < 1/a and, by induction, P(R,, < oo) < (1/a)"P(R0 < oo). Let n be an integer and let an < 2 < a(n + 1). Then

P(X* > )) < P(X* > an) < P(R" < oo) < (1/a)"P(Ro < oo)

< a(l/a)'I° = ae-'(""I'. Setting a = e, to maximize (log a)/a, gives (1).

Remark: To summarize the proof in loose language : The amount of wiggling a martingale in -4,1f0 can do has a geometric upper bound because of (2),

and, hence, even if all the movement occurs in the same direction, a large maximum will not be produced. To translate (1) into a result about BMO functions, we recall that in Section

7.3 we showed that if q E BMO and we let u = 9vp and U, = u(B,A,), then UE. J1(!9 and there are constants c, Ce(0, oo) such that cII9II* < II UII* < CII(P

(*)

Combining the last observation with (1) gives: (3)

The John-Nirenberg Inequality. There are constants C, y e (0, oo) such that if * < 1 and $ cp do = 0, then {0 : I4 (ete)l > Al I < Ce-".

II

II

Proof (*) implies that II U II * < C, so 110: Iq(e`e)l > All < 21tP(U* > A).-5 C'e-'Ice.

It is easy to improve (3) to the usual result. (4)

Let C, y be the constants in (3). If II'II* < 1, then for all intervals I, {O E I :

I q(e`e) - cprl > All <

Ce-yxlI1.

Proof If! = (a - hn, a + hit), then >V (eie) _ cp(exp(i(a + Oh))) - gyp is in BMO with I IV/ II * < 1 and 10 = 0, so (4) follows from applying (3) to 0. The conclusion of (1) can be improved in a similar way.

210

7 W .ndBMo,.M' snd91_

(5)

Let C be the constant in (1). If IIXII* S 1, then for all stopping times T, P (sup I tzT

- XTI > A) :9 Ce--"P(T < oc). /l

Proof This could be proved in the same manner as (4), but it seems easier to observe that repeating the proof of (1) with Ro = T proves (5). From (5), we see that the definition of -4,#C as sup E(I

XT I

I T < oo) < oo

T

implies the much stronger conclusion that (6)

For all a < (eIIXII*)-1, supE(ealxw -xTIIT < oo) < o0 T

(in both cases, the supremum is over all stopping times). From (6), it follows that ...,df0 can be defined as the set of all martingales with

supE(IXc,, -XTI"IT 1. In particular, when p = 2, IIXII* < oo if and only if <(X)>, < oo. By integrating (5), we get a sharper result. (7)

There is a constant C < oo such that IIXII* <- <X»* <- CII X II*

Proof We proved the left-hand inequality in Section 7.2. To prove the righthand inequality, we observe that it follows from (5) that if IIXII* 5 1, then

(oo),

E(X. - XT)2 < fO'O 22P (sup I Xt - XTI > .1 t>_ T

<

J

proving (7).

As a corollary of the John-Nirenberg inequality, we get the following striking result due to Zygmund (1929): (8)

If (p is continuous on 8D, then its conjugate function Cp has

JexP()(o)I)do

< oo

for all A < oo .

Proof Since every continuous (p is the uniform limit of trigonometric polynomials, we can, for any e > 0, write qp = gyp, + P2, where gyp, is a trigonometric

7.7

211

The G.rnett-Jones Theorem

polynomial (and hence has a p , e L°°) and 92 has 1 1P 21 1 , < II cp2 IIm ` r, u i I he desired result follows from (6). On page 253 of Zygmund (1959), you can find a continuous cp whose conjugate ap is unbounded: cp(e")

Exercise 1

°°

Y

=

sinnO

nlogn

cp(e

_

°°

cos n9

= n=2 Nogn

Calderon's Proof of the John-Nirenberg Inequality. Let

a(x) = sup{P(X* > x) : IIXII* < 1}.

By stopping at the first time, XXI > x, we see that a(x) < 1/x and a(x)a(y) > a(x + y). Taking logs gives log a(x) + log a(y) > log a(x + y), and one easily concludes from this that lim sup 1 log a(n) = inf 1 log a(m) < 0, R- D

n

m>1 m

proving the result with a constant that is not explicit, but is trivially the best possible.

7.7 The Garnett-Jones Theorem In this section, we will determine which martingales in -4.lf(9 are almost bounded, or, to be precise, we will find the closure of .,#°° in

The solution

to this problem is again the probabilistic analogue of a previously proven analytical result, and the analytical result can be recovered from the probabilistic one. To state the result, we first need some notation. Let ao(X) be the supremum of the set of all a so that supE(ealXw_XTIIT < oc) < oc

T

where the sup is taken over all stopping times (by (6) in Section 7.6, ao(X) > (eII X II*) ' > 0). Intuitively, ao(X) is the exponential rate at which P(I XX - XTI > .) goes to zero for the worst choice of T. Given this interpretation, it should not be surprising that ao(X) can be used to measure how well an X in M MO can be approximated by a Ye.#°°. (1)

There are constants c, Ce(0, oo) such that c

ao(X)

< inf IIX - YII* < Y.M-

C ao(X)

and, consequently, the closure of .,lf°° in R.&(9 is {X : ao(X) = oc }.

212

7

H' and BMO.. #' and 4U.wtV

Remark: This result and the proof we will give below are due to Varopoulos (1979).

Proof The inequality on the left is an easy consequence of the John-Nirenberg

inequality. If YeA' and IIX - Y I* = a, it follows from (5) in Section 7.6 that if Z = X - Y and a > 1/e, then E(e°ho_ZTlla) < C,,P(T < oc), so

P(I Za, -ZTI > 2) < Let Ao = II YII, If K > 2 and A > K.,, Then

P(IX(-XTI>2)A-22o)
(l K)2) C

e-Qa(K-2)/KaP(T

< Ca

< oc).

At this point, we have shown that ao(X) > a(K - 2)/Ka. Letting a 1/e and K-> oo gives that ao(X) > Ilea, so the left-hand inequality in (1) holds with c = 1/e (the constant is inherited from the John-Nirenberg inequality). To prove the other inequality is more difficult. Given a large a and a martin-

gale Xe.A.J1 with ao(X) > a, we need to construct Ze.#' with IIx - Z Ih < C/a. This construction is accomplished in two stages. First, we construct an approximating martingale of the form described in Example 4 of Section 7.5, but we are forced to take 2 large to make 0 < 1, and we end up with a M'#6 norm that is too large. Then we use an ingenious construction, due to Varopoulos, to introduce a sequence of stopping times that smooths the transitions

between the times constructed in our first attempt and reduces the M.# C norm to the right size. The first part of the construction is straightforward. We let 2 be fixed and define R. inductively by letting Ro = 0 and Rn = inf{t > Rn_1 : I X(t) - X(Rn_1)I

> Al for n >- 1. If we let n = X(R,,) - X(Rn-1) on {Rn < oo} and Z =

Y-, S. 1(Rn
).

Z, will differ from X, by a martingale that is bounded. To see this, observe that when Rn < t < Rn-1,

X,-Z,=X,-E(> m1(Rm
= X, - X(Rn) +

E(

Y_

\m=n+1

Sm1(Rm
From the definition of the R, it follows that I X, - X(R,,) I < A. To get a bound on the other term, observe that if supE(e2 a_XTII T < oo) < K, T

7.7

213

The Gernett-Jones Theorem

then P(Rm+1 < oo I Rm < oc) < K/eau Therefore, if Ke -a l < 1, then (*)

E( m=n+1 "0

Sm1(Rm<W)I/

< 1(1 - Ke

ax)-1

JF

and we have Y, = X, - Z, c-,#'. Y, = X, - Z, is not the martingale in .,lf°° that we want. From Example 4 in Section 7.5, we see that the 8..110 norm of Z satisfies IIZII* < 2(1 + (1 Ke-az)-1), but this is too big because we have no control over K and, hence, over the choice of 2. To circumvent this difficulty, we use the construction referred to above. Pick 0 < 1 and then 2 so large that Ke-az < e-ax° and (for convenience) 2a = M is an integer. Let y = e-e, let So = 0, and, for n > 0,

1<j<M,let

SfM+j = inf{t : P(Rn+1 < co l.wt) > .

Since P(Rn+, < oo I F(Rn)) <

M-j}.

Ke-a l < yM, we have

R. < SnM+1 < ... < S(n+1)M :!

Rn+1

Let U"

_

I

M

M

Y 1(SnM+j
U" is a staircase that allows us to climb from { U. = 0} = {S"M+1 = oo } to {U" = 1} = {S(n+1)M < oo}. Let 00

M=1

Z, = E(Z' I If Rn < t < Rn+, , then

14 - z;I = E( m=n

m(1(Rm
\m=n+1

<2+2(1

-

Keal)-1

by the arguments used to prove (*) (observe that I m(1(Rm
Z' _

°°

Y

Ym(k)

k=1

1

M

1(Sk<00)'

214

7

H' sod lAW..N' and M.r Or

and we see from Example 4 in Section 7.5 that for all 0 < 1,

IIZ'II* < M(1 +

Letting 0T 1 proves the desired result with C = (2e - 1)/(e - 1) = 2.42. Example 1

Let X, be the martingale in Example 3 of Section 7.5. It is easy to see that ao(X) = log2 = .693, so (1) gives 1

.5307 =

e(.693)

2.42 < inf IIX- Z II* <= 3.49 693 ZEN

The upper bound is hardly informative, since

II X II

s = 9/8 = 1.125, but the

computation suggests the following question I could not answer. In Example 3, do we have

Problem

inf IIX- ZII* = IIXII*?

ZE.R'0D

An even simpler question is: Can you construct an example with this property? With (1) established, we turn our attention to proving the analytical result.

Since this is a thankless job and requires more work than a direct analytical proof (see Garnett and Jones (1982)), we just sketch the details. (2)

The Garnett-Jones Theorem. Let ao(cp) be the supremum of the set of a that has

sup1

exp(alrp-q,iI)d9
where rpr =

f 9 dB. r

There are constants c, Cc(0, oo) such that

< C) ` inf IIq'-'II*a ao((P) C

'P

Proof As in the proof of (1), the left-hand inequality follows easily from the John-Nirenberg inequality, so we leave the details as an exercise for the reader and turn now to the proof of the more difficult right-hand inequality. From the developments above, the plan of attack should be clear. We pass from 9 to U, = Yrp(B,A) = M(p, decompose U, = XX' + X,' according to the construction in the proof of (1), and then let tij(e`B) = E(XX I Bz = e`B) = NXj. From results about the maps M and N defined in Section 7.4, we see that to prove (2), it suffices to show that (3)

ao(T) < Cao(Mtp).

7.8

215

A Disappointing Look at (..N') When p < I

for then it will follow that 11(p - 0III*

CII U - X`II * < ao(U)

` «o(p) The proof of (3), however, is tedious and very similar to the proof of (7) in Section 7.3, so the reader is referred to Varopoulos (1979) for the details. It is inevitable when we use the correspondence between BMO and R.#O (or H" and .,#P) that the resulting inequalities will not be very precise. In the case of (2), an analytical proof gives a much stronger result. As Garnett and Jones (1978) remark, if we norm BMO by is constant}

(which we know from Section 7.4 to be an equivalent norm), then it follows from results of Helson-Szego (1960) and Hunt, Muckenhoupt, and Wheeden (1973) that n

gcLf

III-9II**-2ao((p)

(see Garnett (1980), Section 6). Varopoulos (1980) has shown (see his Theorem 4.2) that there is an analogous result for R.,# !2,

«X-Y>>*<2 «o

1

r inf and that the constant is the best possible. The reader is referred to his paper for the details of this and many other interesting developments (including a probabilistic proof of the Corona theorem).

7.8 A Disappointing Look at (J )) * When p < 1 In this section, we will consider the martingale spaces .,#P, 0

sponding HP space, which in the case 0 < p < 1 was first found by Duren, Romberg, and Shields (1969) (see Duren (1970), pages 115-118). In this section, we will show that this approach is not feasible-there are no nontrivial continuous linear functionals on .,#P, 0 < p < 1. This result, attributed to P. A. Meyer in a footnote in Getoor and Sharpe (1972), is in sharp contrast with the results for dyadic martingales obtained by Herz (1974b). In the dyadic case, a martingale is in (.,'P)* if and only if there is a constant C such that, for all stopping times T, E(I Y,,,, - YTI I FT) < CP(T < oo)1/P. In this section, we use the techniques of Section 7.1 to show that the same result is true for the

Brownian filtration and that, unfortunately, no nonconstant martingale has this property.

216

7

H' ind BMO..N' mod W Vr)

The first step in our study of the martingales in . KP is to obtain an atomic decomposition. The arguments given in Section 7.1 generalize with very little work to the case p < 1. (1)

A martingale A e d#P is said to be an atom if there is a stopping time T such that

(i) A,=Oift
IA,IP
If we let dd(X) = E(X *)P, then it follows from the definition that (2)

If A is an atom, dd(A) = E(A*)P < 1.

(3)

If A" is a sequence of atoms and Yn I cnIP < oo, then as N - oo, XN = Y _N cnAn converges to a limit Xedi", and dd(X) < En IcnIP.

The proof of (4) in Section 7.1 leads to the following : (4)

For all X e #P, there is a sequence of atoms A", n e Z, and a sequence of constants cn, n e Z, such that

(i) asN -oo, _n and (ii) En IcnIP < 2(21/P + 1)Pdd(X).

The proof is the same as the proof in Section 7.1. Let

T"=inf{t:IXI>2"/P} A" _ (X(t A Tn+1) - X(t A T"))/c" cn = (21/P + 1)2' "P(T < oc)1/P.

It follows from computations in Section 7.1 that (i) holds. To check (ii), we observe that dP(>cnA"J < ICnIP

_ (21/P + I)PY2"P(T < oc)

"

_

(21/P

+ 1)P Y 2"P((X *)P > 2") n

< 2(21/P + 1)E(X*)P.

Remark: In the argument above, the fact that p < 1 is important. If p > 1, the triangle inequality gives that cnA"I m-

P

<_ F.lcnl = n

(2"P

+ 1)Y(2"P((X*)P >

2"))1/P,

and the right-hand side > (21/P + 1) (yn 2"P((X *)P > 2"))1/P. Since the spaces .diP, p > 1, are easy to handle directly, we leave it to the reader to figure out how to define atoms for p > 1.

7.8

217,

A Dlwppolnting Look of (A")* Whenp < I

With the atomic decomposition established, it is easy to find the droll of .#P. From results in Section 7.2, if 6p is a continuous linear functional on . NP,

then there is a Ye.,lf2 such that if Xe.,N2, (p(X) = EX. Y.. To see what properties Y must have, we follow (7) of Section 7.1. Let T be a stopping time,

let Z. = sgn(Y. - YT), and let Z, =

Since Z,i 5 1, X, = (Z, ZTAr)/2P(T < oo)l/P is an atom. By a computation in Section 7.1, E I Y. - Y1.I =

E(Z.(Y. - YT)) = E((Z,0 - ZT) Y.), SO

EIY - YTI

E(X,

2P(T < 00)11P'

Taking the supremum over all stopping times, we see that there must be a constant c such that for all T, (5)

EI Y - YTI < cP(T < oo)'IP.

If p = 1, this inequality reduces to (6) of Section 7.1, so on the surface everything is fine. We have a family of spaces, which we might call MP, 0 < p < 1, that generalize R..#D and have (.,NP)* c MP. Unfortunately condition (5) is too strong when p < 1: (6)

If p < 1, then the only martingales Ye 4'2 that satisfy (5) are constant.

Proof The idea of the proof is simple. Suppose that C = 1. If we can find a T such that 0

and if we can pick an > 0 and define another stopping time

T' =

(T 00

if E(I Y.,, - YTI I FT) > E

otherwise

with P(T' < oo) = S, a small number chosen to have PIP-' < E, then we are done, since if (5) holds with c = 1, we have

P(T' < oo)1/P >_ E(I Y. - YT.I) > eP(T' < 00), which implies that 6'IP-1 > e, a contradiction. The technical problems we now face are (a) to find a stopping time with E(I Y. - YTII JFT) * 0 (easy) and (b) to shrink P(T < oo) to an appropriate size. To solve these problems, we observe that since every continuous local martingale is a time change of Brownian motion, it suffices to prove the result when Y, = B,A, and z is a stopping time. Let T. = inf{t : IB,I > a). Since Ye.,N2 and I Y. - Y,I < 2Y*, it follows from Theorem 9.5.4 in Chung (1974) that as E(I Y. - Y(Ta)II _F(Ta)) -'E(IYYII n a>o

.

(Ta)).

The right-hand side is E I Y. I (exercise), but we do not need this. The expectation

218

7

H' ud DM0, .N' rnd A,#&

of the limit is I Y,,,1, which is > 0 if Y * 0 (recall that YY = E(YY IA,)), so it must be positive with positive probability, and hence, if a is small enough and T = Ta, then E(I Y. - YTII.`FT) # 0, and we have accomplished (a). Since T' has a continuous distribution, the solution of (b) is trivial. We let I

T, - IT 100

if E(I Y. - YTII FT) otherwise

and 0 < T < A

and vary A to make P(T' < oo) the right size.

Remark: The last part of the proof obviously breaks down for dyadic martingales. In that setting, if you want a fixed value for the stopping time, say T = 1, then the probability of taking on that value cannot be arbitrarily small. It is this curiosity that allows nontrivial examples on the dyadic filtration.

8 PDE's That Can Be Solved by Running a Brownian Motion

A Parabolic Equations In the first half of this chapter, we will show how Brownian motion can be used to construct (classical) solutions of the following equations: U, = 20u

ut=2Au+g u,=ZOu+cu in (0, oo) x R" subject to the boundary condition: u is continuous in [0, oo) x Rd and u(0, x) = f(x) for x e Rd. The solutions to these equations are (under suitable assumptions) given by Ex(f(B,))

J(t E(f(B)+-s,BS)ds) \

E. (f(B) exp (f o

c

(BS) ds)

-

b(BS) dB, fo I b(BS) 2 dso /I \J 2 In words, the solutions may be described as follows : E. .f(Br) eXp

(i) To solve the heat equation, run a Brownian motion and let u(t, x) _ Ex.f (B,)

(ii) To solve the inhomogeneous equation u, - ZAu = g, add the integral of g along the path. 219

221)

8

PDE's That Can Be Solved by Running a Brownian Motion

(iii) To introduce cu, multiply f(B,) by exp(f o c(B5) ds) before taking expected values. In more picturesque terms, we think of the Brownian particle as having mass 1 at time 0 and changing size according to m' = c(B,) m, and when we take expected values, we take the particle's weight into account. (iv) To introduce b Vu, we multiply f(B,) by what may now seem to be a very strange-looking factor. By the end of Section 8.4, this factor will look very natural, that is, it will be clear that this is the only factor that can do the job. In the first four sections of this chapter, we will say more about why the expressions we have written above solve the indicated equations. In order to bring out the similarities and differences in these equations, we have adopted a

rather robotic style. Formulas (2) through (6) and their proofs have been developed in parallel in the four sections, and at the end of each section we discuss what happens when something becomes unbounded.

8.1

The Heat Equation In this section, we will consider the following equation:

(1)

(a) u, = 'Au in (0, oo) x Rd (b) u is continuous in [0, oo) x Rd and u(0, x) = f(x). The equation derives its name from the fact that if the units of measurement are chosen suitably and if we let u(t, x) be the temperature at the point x c Rd at time t when the temperatures at time 0 were given by f(x), then u satisfies (1).

The first step in solving (1), as it will be many times below, is to prove : (2)

If u satisfies (a), then M, = u(t - s, B.) is a local martingale on [0, t).

Proof Applying Ito's formula gives s

- u,(t - r, B,) dr

u(t - s, BS) - u(t, Bo) = 0

+ JVu(t - r,B,)dB, 0

Du(t - r, B) dr,

+2 0

which proves (2), since - u, + i Au = 0 and the second term is a local martingale.

If we now assume that u is bounded, then M, 0 < s < t, is a bounded martingale. The martingale convergence theorem implies that as s T t, M, converges to a limit. If u satisfies (b), this limit must bef(B), and since MS is uniformly integrable, it follows that

Ms = E,,(f(B,)I3 ) Taking s = 0 in the last equation gives us a uniqueness theorem.

8.1

(3)

221

The Nest Equation

If there is a solution of (1) that is bounded, it must be v(t, x) = Exf(B,).

Now that (3) has told us what the solution must be, the next logical step is to find conditions under which v is a solution. It is (and always will be) casy to show that v is a "generalized solution," that is, we have (4)

Suppose f is bounded. If v is smooth (i.e., it has enough continuous derivatives so that we can apply Ito's formula in the form given at the end of Section 2.9), then it satisfies (a).

Proof /The Markov property implies that Ex(f(B,)IJ1s) = EB(S)(f(Bt-s)) = v(t - s, B.).

Since the left-hand side is a martingale, v(t - s, BS) is also. If v is smooth, then repeating the calculation in the proof of (2) shows that

v(t-s,B.)-v(t,B0)= fo (-vt+ZAv) (t- r, B,) dr 5

+ a local martingale, so it follows that the integral on the right-hand side is a local martingale. Since this process is continuous and locally of bounded variation, it must be = 0, and hence, - vt + zAv = 0 in (0, oo) x Rd (vt and Ov are continuous, so if - vt + Av 0 at some point (t, x), then it is 0 on an open neighborhood of that point, and, hence, with positive probability the integral is * 0, a contradiction). It is easy to give conditions that imply that v satisfies (b). In order to keep the exposition simple, we first consider the situation when f is bounded. In this situation, the following condition is necessary and sufficient: (5)

If f is bounded and continuous, then v satisfies (b).

Proof (Bt - Bo) d= t1"2N, where N has a normal distribution with mean 0 and variance 1, so if t 0 and x -+ x, the bounded convergence theorem implies that

v(t,

Ef(x + t, 2N) -f(x). The last step in showing that v is a solution is to find conditions that guarantee that it is smooth. In this case, the computations are not very difficult. (6)

If f is bounded, then v e C°° and hence satisfies (a).

Proof We will show only that v c C Z, since that is all we need to apply Ito's formula. By definition, v(t, x) = Exf(B) =

$(2xt)_42e_1x_Y1212tf(Y)dY.

222

8

PDE'. flit ('an Be Solved by Running a Brownian Motion

A little calculus gives Die-Ix-yl2/2t =

-(xi -

Yi)e-Ix-y12/2tt

- Yi)2 Dve-Ix-yl2/2r = (xi - Yi)(x, Dite-Ix-yl2/2r = ((xi

t)e-Ix-yl2/2tt2 Y,)e-Ix-yl2/2rt2

i 5 j.

If f is bounded, then it is easy to see that for a = i or ij,

f

IDe-Ix-yl2/2tl

I f(Y)I dy < o0

and is continuous in R°, so the result follows from our result on differentiating under the integral sign (an exercise at the end of Section 1.10). For some applications, the assumption that f is bounded is too restrictive. To see what type of unbounded f we can allow, we observe that, at the bare minimum, we need Ex I f(B) I < oo for all t. Since I E.If(Br)I = J (2nt)

wee-Ix-yl2/2tIf(y)Idy,

a condition that guarantees this is (*)

IxI 2log+If(x)I -.0

as x- oo.

By repeating the proofs of (5) and (6) above and doing the estimates more carefully, it is not hard to show: (7)

If f is continuous and satisfies (*), then v satisfies (1).

Note: All we have done in this section is rewrite well-known results in a different language. An analyst (see, for example, Folland (1976), Section 4A) would write the first equation as

Btu-Du=0 u(0, x) = f(x)

and take Fourier transforms to get

t) -

t) = 0 =f(e)e-4,M21CI2r

t)

Now KK(x) = (47rt) -d/2 e- 1-"1'/4' has e-4n21C12t,

so it follows that u(t, x) = JKt(x - Y)f(Y) dY,

and we have derived the result without reference to Brownian motion.

8.2

The Inhomogeneous Equstton

223

Given the simplicity of the derivation above, we would be foolish to chiim that Brownian motion is the best way to study the heat equation in (0, cx)) x R. The situation changes, however, if we turn our attention to (0, oo) x G, where G is an open set (e.g., G = {z : IzI < 1}), and try to solve (1')

(a) u, = 'Au in (0, oo) x G (b) u is continuous in [0, oc) x G and u(0, x) = f(x) x e G u(t, x) = 0 t > 0, x e 8G. In this context, the analyst (for example, Folland (1976), Sections 4B and 7E) must look for solutions of the form f (x) exp(A,jt), "separation of variables," and show that the initial condition can be written as

f(x) _

a; f (x).

Proving this even in the special case G = {z: Izi < 1} requires a lot more work than when G = Rd, but for Brownian motion the amount of work is almost the same in both cases. We let T = inf{t : B, t G } and let

v(t, x) = EX(f(B); t < T). Repeating the proofs above shows

(2)

If u satisfies (a), then MS = u(t - s, Bs) is a local martingale on [0, T A t).

(3')

If there is a solution of (1) that is bounded, it must be v(t, x).

(4)

If v is smooth, then it satisfies (a).

(5')

If f is bounded and continuous, then u is continuous in [0, oo) x G and u(0, x)

f(x). If the reader is patient, he or she can also show that (6')

If f is bounded, then v, Div, and Djv, 1 < i, j < d, all exist and are continuous, so v satisfies (a).

Note: v will not necessarily satisfy the other boundary condition u(t, y) = 0 for y e G. We will discuss this point when we consider the Dirichlet problem in Section 8.5.

8.2 The Inhomogeneous Equation In this section, we will consider what happens when we add a function g(t, x) to the equation we considered in the last section, that is, we will study (1)

(a) u, = 'Au + g in (0, oo) x Rd (b) u is continuous in [0, oo) x Rd and u(0, x) = f(x).

224

8 PDE'e Thai ('en Be Sowed by RunIa a Brownlen Motion

The first step is to observe that we know how to solve the equation when g = 0, so we can restrict our attention to the case f 0. Having made this simplification, we will now solve the equation above by blindly following the procedure used in the last section. The first step is to prove (2)

If u satisfies (a), then

M, = u(t - s, Bs) + f sg(t - r, B,) dr 0

is a local martingale on [0, t).

Proof Applying Ito's formula gives

u(t - s, Bs) - u(t, Bo) = fos (-ut +

ZAu)(t - r, B,) dr + fos Vu(t

- r, B,) dB,,

wh ich proves (2), since - ur + Au = -g and the second term is a local martingale.

If g is bounded and u is bounded on [0, t] x R" and satisfies (a), then M 0 S s < t, is a bounded martingale. By the argument in the last section, if u satisfies (b), then

lim M, = fo g(t - s, B,) ds ,Tr

and

g(t - s, Bs) dsl).

M, = Ex o

Taking s = 0 gives Suppose g is bounded. If there is a solution of (1) that is bounded on [0, t] x Rd, it must be //

v(t,x) = ExI

g(t - s,Bs)ds). r,B,)drl.°Fs/

\ fo Again, it is easy to show (4)

Suppose g is bounded. If v is smooth, then it satisfies (a) ax, in (0, oo) x Rd.

Proof The Markov property implies that rg(t -

Ex\J

s

t-s

J(t-r,B,)dr+EB(.,)(fo

l

225

&2 7% I.homoQsn.ou. Equatlon

Since the left-hand side is a martingale, it follows that J3

v(t - s, B) +

g(t - r, B,) dr 0

is also. If v is smooth, then repeating the calculation in the proof of (2) shows that s

g(t - r, B,) dr

v(t - s, Bs) - v(t, Bo) + J0

= f (-vt+ZAv+g)(t-r,B,)dr 0

+ a local martingale,

Again, we conclude that the integral on the right-hand side is a local martingale and, hence, must be = 0, so we have (- vt + 4v + g) = 0 a.e. The next step is to give a condition that guarantees that v satisfies (b). As in the last section, we will begin by considering what happens when everything is bounded. (5)

If g is bounded, then v satisfies (b).

Proof If I g I < M, then rt

g(t - s, Bs) ds

E.

<Mt- 0.

J0

The last step in showing that v is a solution is to check that it is smooth enough. In this case,

v(t,x) = J'dr J(2xs)_2e__2/2sg(t - s,Y)dy, 0

and the normal density (2irs)-d/2e-Iv-XI2/2s

PS(x,Y) _

- oo if x = y and s -+ 0, so things are not as simple as they were in the last section. The expression we have written for v above is what Friedman (1964) would call a volume potential and would write as V(x, t) =

f f

Z(x, t ; , T)

i) d di.

J To D

To translate between notations, set To = 0, D = Rd, Z(x, t ; , T) = Pt-,(x, ),

and change variables s = t - r, y = . Because of their importance for the

226

$

PDE'e That (:nn Be Solved by Ruuing . Brownlen Motion

parametrix method, the differentiability properties of volume potentials are well known. Since the calculations necessary to establish these properties are quite tedious, we will content ourselves to state what the results are and indicate why they are true. The results we will state are just Theorems 2 to 5 of Friedman (1964), so the reader who is interested in knowing the whole story can find the missing details there. (6a)

If g is a bounded measurable function, then v(t, x) is continuous on (0, oo) x R°.

Proof This result follows easily from the bounded convergence theorem. (6b)

If g is bounded and measurable, then the partial derivatives Div = avlax; are continuous, and Div = J' JDP(xY)(t - s,Y)dyds.

Proof The right-hand side is

-

f f ,J

0

(2ns)-d/z (Xi - Yi) e- Ix-vl2/2sg(t S

,J

- s,Y) dY ds

= - JEx[(xi oS

-

B

s/g(t - S, B)]

Although the last formula looks suspicious because we are integrating s-1 near 0, everything is really all right. If lgl < M, then Exl(x1 - B5)g(t - s, BS)l < MEEI x; - BsI = CMs1'2, so

- B8)g(t - S, BS)l < 00

f

0

and it follows from the exercise in Section 1.10 that the partial derivatives Div exist and are continuous. The computations above are not hard to make rigorous, but you should save your strength. Things get very nasty when we take second derivatives. (6c)

Suppose that g is a bounded continuous function and that for any N < oo there are constants C, a e (0, oc) such that l g(t, x) - g(t, y)l < CJx - yla whenever

lxl, lyl, and t < N. Then the partial derivatives Dijv = a2v1ax;axx are continuous, and

Dv =

JDiiPs(xY)(t - s, y) dy ds. Ef

Proof Suppose for simplicity that i = j. In this case, the right-hand side is

227

8.2 The Inhomo$sneoa. Equ.tlo.

ft

f

(2ns)-az

((xi -Z Y,Z - S

)

`

Jo,J

e- i=-yi2/zsg(t - s,Y) dy ds

=

fo'

-sg(t-.s,B.)]tIS,

ExL(xi-BZ)

S

but this time, however, ExI(xt - Bs )2 - SI = sEoI(Bi)2 - 1I so

(x;-B:)2-S

= 00.

S2

We can overcome this problem if f is Holder continuous at x, because we can write [((x;-s2t2- sg(t

E.

- sB)

S

s

=Exl(x`-B2) S

s(g(t-s,Bj -g(t-s,x)))f

The second expression < Cs-1+a, so its integral from s = 0 to t converges absolutely, and with a little work (6c) follows. (See Friedman (1964), pages 10-12, for more details.) The last detail now is : (6d)

Let g be as in (6c). Then 8v/8t exists, and

x) + at (t, x) = g(t,

f'drf

A-,(x,Y)g(r,Y) dY

o

Proof To take the derivative w.r.t. t, we rewrite v as

v(t,x) = JJ'Ptr(xY)frY)'1Y. 0

Differentiating the left-hand side w.r.t. t gives two terms. Differentiating the upper limit of the integral gives g(t, x). Differentiating the integrand gives

fai--

pt-.(X y)g(r, y) dy dr

0

tr lJ

2d(2n(t

-

2

r))-(d+2)12

exp (

c

+

fj

r

+ IX (2n(t - r))-dl2 12(t

2(t

-) I g(r,Y) dy dr

X-Y

- r)I2) exp ( 2I(t -

2d :tdrr -Exg(r,Bt_r)dr+

'

dr

) 2) g(r,Y) dy dr

2N

PDE's'Ib.t Can Ile Solved by Rwtnlni a Brownimn Motion

In the second integral, we can use the fact that Ex(Ix - Bt_,I2) = C(t - r) to cancel one of the t - is and make the second expression like the first, but even if we do this,

' dr

J0t-r This is the difficulty that we experienced in the proof of (6c), and the remedy is the same : We can save the day if g is locally uniformly Holder continuous in x. For further details, see pages 12-13 of Friedman (1964). As in the last section we can generalize our results to unbounded g's. To see what type of unbounded g's can be allowed, we will restrict our attention to the homogeneous case g(t, x) = f(x). At the bare minimum, we need f(B,) ds < oo,

Ex J0

and if we want (b) to hold, we need to know that if t 10 and x

x, then

f0,"

f(B5) ds -* 0.

Ex

If we strengthen the last result to uniform convergence for x c R", then we get a definition that is essentially due to Kato (1973). A function f is said to be in Kd if (*) lim sup Ex (fot I f(B5) I ds) = 0. t4 o

x

By Fubini's theorem, we can write the above as t m sup J kt(x, y) I .f(y) I dy = 0,

where J(27s)_d/2e_k_Y12/2sds.

kt(x,y) = 0

By considering the asymptotic behavior of kt(x, y) as t -+ 0 and we can cast this condition in a more analytical form as

(**) limsupJ a. o

x

w(Ix-yl).f(y)dy=0, Ix-Y1
where

-log Izl

d>3 d=2

1

d= 1.

IZI

(P(z) _

(d z)

This is Theorem 4.5 of Aizenman and Simon (1982). The details of the proof,

8.3

The Feynman-Koc Formula

229

though simple, are a little tedious, so they are left to the reader. We will have more to say about these spaces at the end of the next section. For the developments there, we will also need the space K,"', which is defined in the obvious way: fcKK°° if for every R < co, fl(lxl
8.3 The Feynman-Kac Formula In this section, we will consider what happens when we add cu to the right-hand side of the equation we considered in Section 8.1, that is, we will study (1)

(a) u,='Au+cu in (0,co) x Rd (b) u is continuous in [0, co) x R' and u(0, x) = f(x).

(2)

If c(x) < 0, then this equation describes heat flow with cooling. As in Section 8.1, the solution u(t, x) gives the temperature at the point x c Rd at time t, but here we do not assume that there is perfect conduction of heat. Instead, we assume that heat at x dissipates at the rate k(x) = -c(x). We will see below that this corresponds to Brownian motion with killing at rate k, that is, the probability that a particle survives until time t is exp(-$ok(B,)ds). The first step in solving (1) is to prove: If u satisfies (a), then S

Ms=u(t-s,Bs)exp( c(B,)dr) \\ o

ff

is a local martingale on [0, t).

Proof Let c, = f o c (Bs) ds. Applying Ito's formula gives that u(t - s, BS) exp(cs) - u(t, Bo)

- ur(t - r, B,) exp(cr) dr +

Jexp(c,)Vu(t - r, B,) dB,

_ f s0

+

Ju(t - r, Br) exp(c,) dCr + 2

JAu(t - r, Br) exp(cr) dr,

0

which proves (2), since -u, + cu +

ZAu = 0 and the second term is a local

martingale.

If c is bounded and u is bounded on [0, t] x Rd and satisfies (a), then M, 0 < s < t, is a bounded martingale, so by an argument we have used in the last two sections lim MS = f(B,) exp(c,) STt

and

00

8 PDE's That ('en Be Solved by Rwol" Brownien Motion

M. = so taking s = 0 gives (3)

Suppose that c is bounded. If there is a solution of (1) that is bounded on [0, t] x Rd, it must be v(t, x) = EX(f(B,) exp(ct)).

As before, it is easy to show (4)

Suppose that c is bounded. If v is smooth, then it satisfies (a) a.e. in (0, oo) x Rd.

Proof The Markov property implies that EE(.f(Bt) exp(c) l.9s) = exp(cs)EB(S)(f(Bf-s) exp(ct-s)),

so if we let v(t, x) = EE(f(B,) exp(c)), then the last equality shows that v(t - s, Bs) is a martingale. If v is smooth, then repeating the calculation in the proof of (2) shows that

v(t - s, B) exp(cs)- v(t, Bo)

=

s

(- v, + cv + Z Av) (t - r, B,) exp (c,) dr + a local martingale.

J0

so again, we conclude that the integral on the right-hand side is a local martingale and, hence, must be - 0, so we have that - vt + cv + 1 Av = 0 a.e. The next step is to give a condition that guarantees that v satisfies (b). As before, we begin by considering what happens when everything is bounded. (5)

If c is bounded and f is bounded and continuous, then v satisfies (b).

Proof If Icl < M, then e_Mt < exp(c) < e", so exp(ct) --*I as t - 0. Since f is bounded, this result implies that E, exp(ct)f(B) - EE.f(Bt) - 0, and so the desired result follows from (5) in Section 8.1. This brings us to the problem of determining when v is smooth enough to be a solution. To solve the problem in this case, we use a trick to reduce our result to the previous case. We observe that exp

(f c (Bs) ds) = 1 + Jc(Bs)exP(,fc(Br)dr)ds fJ

so taking expected values gives that

v(t, x) = I + JEx [c(Bs)exp(jtc(Br)dr)f(Bt)]ds. Conditioning on .mss and using the Markov property, we can write the equation above as

8.3

231

The Feyameo-Kec Formula

v(t, x) = 1 +

JExc(Bs)v(tsBs)ds. 0

The second term on the right-hand side is of the form considered in the list section. If we start with the trivial observation that if c and f are bounded, then v is bounded on [0, t] x Rd, and if we apply (6a) and (6b) from the last section,

we see that v is continuous and the derivatives 8v/8x; are continuous. This implies that, for each N, I v(t, x) - v(t, y)I < CI x - yI whenever IxI, IyI, and t < N. If we assume that c is bounded and locally Holder continuous, then it follows from (6c) and (6d) that we have (6)

Suppose that f is bounded. If c is bounded and locally Holder continuous, then v is smooth and, hence, satisfies (a). As in the last two sections, we can generalize the results above to unbounded

c's. Given the formula above, which expresses v as a volume potential, it is perhaps not too surprising that the appropriate assumption is c E Kd. The key to working in this generality is what Simon (1982) calls Khasmin'skii's lemma: (7)

Let f >_ 0 be a function on Rd with

a = sup Ex (f,f(BS) dsl < 1. J

x

T hen sup E. exp

(f f(BS) dsl < (1 - a)-1.

x

Proof The Markov property and nonnegativity off imply that sup Ex X

f ... j' dsl ...

f(Bsn) < an,

so the desired result follows by noticing that

J...J

Jo...

nl

Jo

and summing on n. From the last result, it should be clear why assuming c c Kd is natural in this context. This condition guarantees that sup E. ( Jt I c (B.) Ids) X

/

o

-0

and, hence, that sup Ex exp x

\ fo, I c (

BS) I

ds) J

- 1.

232

s

PDE'r Tbat Can B. 9olwd by Ruing Brownian Motion

With these two results in hand, we can proceed with developing the theory much as we did in the case of bounded coefficients. Since Simon (1982) has written a lengthy and very readable account of how to develop the theory in this generality, we will content ourselves just to briefly describe a few of the results given in his paper. Part of our motivation for doing this is to establish the connection

between the notation we use and the way in which mathematical physicists write things. To make it easy for the reader to find the results in Simon's paper, we have used his theorem numbers below.

Let Ho = - A and V = - c z

H=Ho+V=-A+V, and define a linear operator e- 1H by setting

(e-`Hf) (x) = E. (exp

(_

, V (BS) ds).f(B,) I .

J

THEOREM B.1.1 Let V- E Kd and V E KK'0c. Then for every t> 0 and p < q < co, e-`H is bounded from L° to L.

Proof Since the proof relies on some things that we have not explained above, we simply sketch the proof and refer the reader to Simon (1982) for details. Let Ile " 111,9 denote the norm of e-`H as a map from L" to L. Step 1: p = oo, q = eo. The Feynman-Kac formula shows that if t is small, then

e 1H A. < Cll.fll., so the semigroup property e-(s+t)H = e-sHe-H imples that lle-t"11.0 m < CeAt,

where A = T -'In C. Step 2: p = 2, q = co. Using the Cauchy-Schwarz inequality in the FeynmanKac formula gives e-tH fl < (e-t(Ho+2V)1)1/2(e-uHol fI2)1/2.

Applying Step 1 to Ho + 2 V gives e-t(Ho+2v)Ih.0 < C'e' `,

and an easy estimate shows that e-tH09lh = sup J(2xt)_2pt(x,y)g(y)dy z

< (2nt)-112II9ll1,

8.3

233

The Feynmra-Kac Formula

so

IIe-tH fhlm < C"IIf112

Step 3: p = 1, q = 2. Since = <e-`Hf, g>, we have Ile-IH11

1,2 =

Ile-1H11

2.o0

Step 4: p = 1, q = oc. By the semigroup property, lie-

W111'. G

Ile-IH12II1,2Ile-1H/2II2..

Step 5: Steps 1 and 4 show that a-` is bounded from L°° to L°° and from L' to L. The result now follows by "duality and interpolation." The next result gives another reason why the spaces Kd are well suited for studying Shrodinger semigroups. PROPOSITION B.1.4

Ile-`"II,,.,, = 1, then VEKd.

If V :!g 0 and

Proof This is an easy consequence of Jensen's inequality and is left as an exercise for the reader. Theorem B.1.1. shows a-`" maps L°° into L. With a little work this can be improved considerably. THEOREM B.3.1

Let V- e Kd, V+ e Kd10'. If f E L°°, then a-tH f is a continuous

function. THEOREM B.7.1

e-nHf(x)

Let V- e Kd, V+ E Kd ' . Then

=

,

where a-"(x, y) is jointly continuous in x, y, and t in the region t > 0. Let V- eKd, V+ e Kd"'. If f e L°°, then, for any t > 0, e-tH f has a distributional gradient in L'10; THEOREM B.3.4

To get more smoothness, one has to assume more boundedness. Suppose for simplicity that d > 2 and, for 0 < a < 2, let Kd be the set of all functions that satisfy :

(i) sup JIX X

< oo if a # 1

1)

Ix -

(ii) limsup r4o

yI-(d-2+a)I f(y)I dy

x

D(x,r)

yI-(d-2+a)I f(y)I dy

= 0 if a = 1.

234

$

PDE'e 111.t ('Mn Be Solved by Running is Brownian Motion

Let a < 2. Let V E Kd, V+ E Kd'°°. Suppose that the restric-

THEOREM 18.3.5

tion of' V to some hounded open set G lies in Kd. If feL°°, then for each t > 0, e- "`feC°(G) = the set of functions whose derivatives of order [a] are Holder continuous of order a - [a].

Remarks: The fact that V is not supposed to be smooth restricts the last

result to a < 2. The reader should also observe that by writing e-` = e-`12 e-`12 and applying Theorem B.1.1, we can conclude that the results above hold if f E L°° is replaced by f E UP1 L°.

Inspired by the work of Feynman (1948), Kac (1949) proved the first version of what is now known as the Feynman-Kac formula. He proved his result in d = 1 for potentials V = - c, which are bounded below, by discretizing time, passing to the limit, and ignoring a few details along the way. Rosenblatt (1951) extended Kac's work to d>- 2 and filled in the missing details (e.g., Note :

Holder continuity is needed if one wants the solution to be C 2). Since that time,

there have been a number of papers extending the result to more general processes and potentials. The results we have mentioned above are only a small sample of what is known. If the reader would like to see more examples of how

probability can be used to study these problems, he should look at McKean (1977), Berthier and Gaveau (1978), and at recent work of Carmona and Simon. Perhaps the best place to begin is Simon's (1982) survey paper.

8.4 The Cameron-Martin Transformation In this section, we will consider what happens when we add b . Vu to the righthand side of the equation considered in Section 8.1, that is, we will study: (1)

(a) u

x Rd and u(0, x) = f(x).

in

Physically, the extra term corresponds to a force field. In this section, we will see that the probabilistic effect is to add an infinitesimal drift b to our Brownian motion. The first step in solving (1) is to prove: (2)

If u satisfies (a), then MS = u(t - s, BS) exp

\Jo (f

S

1

b(B,) dB, - 2 JI1'(8r)I2th)

is a local martingale on [0, t).

Proof/ Let ZS =f o b(B,) dBr - i f o I b(Br) 12 dr. Applying Ito's formula to J (x0 , x 1 , ... , xd+ 1) = U (t - x0 , X1, ... , xd) exp (xd+1)

X°=s,X. =Bsfor1
8.4

The Cameron-Martin Trandormitloo

239

gives:

u(t - s, B) exp(Z9) - u(t, BO) = f - u, (t - r, B,) exp(Z,) dr o

+ J exp(Z,)Vu(t - r, B,) dB, 0

+ f u(t - r, B,) exp(Z,) dZ, 0 s

+2

J0

Au(t - r, B,) exp(Z,) dr

Jsiut - r,B,)exp(Z,)d,

+ i=1('

+ 2 f s u(t - r, B,) exp(Z,) d,. 0

To check this result, observe that the first three lines are the terms involving

first derivatives of f. The last three lines are the terms with D;jf where (a)

l
d + 1, respectively. The terms with i or j = 0 vanish, because X° is locally b.v. Applying the associative law and the formula for the covariance of two stochastic integrals to the mess above gives - u, (I - r, B,) exp(Z,) dr + 2 local martingales 0 fs

+ Ju(1 - r,B,)exp(Z.)(-zI b(B,)I2)dr °

Liu(t - r, B,) exp(Z,) dr

+2 0

s

jDtu(t_rB,)exp(Zr)bi(B,)dr

+ i=1

0 s

+

u (t - r, B,) exp (Z,) I b (B,) 12 dr. 0

2

The third and sixth terms cancel, and if u satisfies (a), the sum of the first, fourth, and fifth is 0, proving (2). At this point, the reader can probably anticipate the next step. (3)

Suppose that b is bounded. If there is a solution of (1) that is bounded on [0, t] x Rd, it must be

v(t, x) = E.(f(B,) exp(Z)).

2M

N

PDE's That Con N. Solved by Rundni

Brownian Motion

This time, however, we cannot simply define our problems away, because Xs = f o b(B,) dB, may be unbounded. Let T. = inf{t : IXtI > n}. The exponenis a local martingale. So tial formula implies that exp(ZZ) - exp(Xt i<X>) observing that <X>, z 0 and stopping at s A T. gives

Eexp(ZSAT) = Eexp(Z0) = 1.

Letting n - oc now and using Fatou's lemma shows Eexp(Z5) < 1, that is, if we let Y = exp(Z5), then Y is an Ll bounded martingale. Applying the last result with b replaced by 2b gives 1

>Eexp(2 f Sb(B,)-dB,-4.1 f lb(B,)I2dr) \\

/

2 ,J o

,10

> exp(-sb*)EY2, where b* = suplb(x)I, and we can use the martingale convergence theorem to conclude that if u satisfies (1), then lim M, = f(B) exp(ZZ) stt

and

M. = EE(f(B)eXp(Z,)I.), so taking s = 0 proves (3). There are many other ways to prove (3). Let

Exercise

Xs= 0

and observe that <X>s = J S I b (B,)12 dr < Cs, 0

so it follows from Levy's theorem (see Section 2.11) that

E (sup exp (Xs)) < E (exp ( sup B.) o<s
l

0<s5b't

1

and the right-hand side is finite since we have (see Section 1.5) Po (B* > a) = 2Po (Bt > a)

for a > 0

Note: This proof was originally in the text. I would like to thank Tom Liggett for pointing out the simpler proof used above. As before, it is easy to show the following: (4)

If v is smooth, it satisfies (a) a.e. in (0, oc) x Rd.

Proof The Markov property implies that Ex(.f(B) exp(ZZ)I. ) = exp(ZS)EB(s)(.f(Bt-s)exp(ZZ-s)),

8.4

237

The Cameron-Martin Transformation

so if we let v(t, x) = Ex(f(B,) exp(ZZ)), then the last equality shows that v(t - s, Bs) is a martingale. If v is smooth, then repeating the calculation in the proof of (2) shows that v(t - s, Bs) exp(ZS) - v(t, Bo)

=

fS

0

(- v, + b - Vv + Av) (t - r, B,) exp(Z,) dr + a local martingale. ?

Again, we conclude that the integral on the right-hand side is a local martingale and, hence, must be -0, so we have -v, + b - Vv + zAv = 0 a.e. The next step is to give a condition that guarantees that v satisfies (ii). As before, we begin by considering what happens when everything is bounded. (5)

If b is bounded and f is bounded and continuous, then v satisfies (b).

Proof Let Y = exp(Z,). As t - 0, Y,-+ 1 almost surely, and we have Ex Y* < cc. Since f is bounded, it follows from the dominated convergence theorem that

EE.f(B) exp(Z,) - Exf(B,) - 0, and so the desired result follows from (5) in Section 8.1. This brings us last, but not least, to the problem of determining when v is smooth enough to be a solution. As you might guess by extrapolating from the last three sections, this is a very difficult problem. The reader is invited to think about how he might try to solve this problem. We do not know a very simple way of doing this, so we will put off consideration of this point until Chapter 9, when we will confront the problem in a more general situation. Having skipped smoothness, the last item on our outline is: What happens if b is unbounded? To answer this question and to prepare for developments in Chapter 9, we will look at our solution through the eyes of Cameron and Martin (1949). Let P. be the measure on (C, ') that makes the coordinate maps B,(co) = ws a Brownian motion starting at x, and define a new measure on (C, by setting

Q.(A) = f

dPx

for A e

,

JA A

where Ib(BS)12ds.

2 fo

f o b(B5) dB, is a local martingale with <X> = fo Ib(B,)I2 ds, it follows from the exponential formula that exp(Z,) is a local martingale/Ps. Let T = inf{t : IZ,I > n}. By stopping at T. A t and letting n -+ oo, we see that Since X,

Q. (C) = E,, exp Z, < 1.

In some situations (e.g., b bounded), we will have Q.,(C) = 1. The next result shows that when this occurs, Q,, makes the coordinate maps behave like a Brownian motion plus a drift.

2,3$

B

(6)

If QS(C) = I, then under Qx

PDE's That ('en Be Solved by Running a Brownian Motion

W, = B, - Jb(B5)th 0

is a Brownian motion starting at x.

Proof Let W' be the jth component of W, Our first goal is to prove that W' is a one-dimensional Brownian motion starting at x;. The first step in doing this is to observe that it suffices to show that if for all 0, U = exp(iO W' + 02t/2) is a local martingale under QX, because then EQX(exp(i0(W' + 02t/2))I.) = exp(iOW' + 02s/2), and

exp(-02(t - s)/2).

EQx(exp(i0(W' -

In other words, W' - WS' is independent of J and has a normal distribution with mean 0 and variance t - s. By (2) from Section 2.13, U is a local martingale/Qx if and only if U exp(Z,) is a local martingale/Px. Unscrambling the definitions gives

U exp(Z,) = exp (i0 (B?

- f" b'(BS) ds) + 02 t 1 o

,1

- 12

exp\fotb(B.) dBs

J

fo

where

C, = ioBi + f b(B.,) dB,, 0

and

D, =

- 02t

2

+ i0

b'(BS) ds + 0

1

2

`

I b(BS)I2 ds.

fo

Now, if we let X, = fo b(BS) dB, then

, = t, , = Jbi(Bs)ds ,

and <X >=

' t 0

,

so D, = i, and it follows from the exponential formula that U exp(Z,) is a local martingale.

8.4

239

The Cameron-Martin' 1'ranaformatlon

The computation above shows that each component WJ is u local martingale. To complete the proof, we observe that (5) in Section 2.8 implics that <W', W'5, is the same under Q as it was under P,,, so <W`, WJ>, = t1Ut. The desired result follows from the characterization of multidimensional 1rownian motion given in Section 2.12. (6) shows that under Qx, the coordinate maps, which we will now call X,((o) = co, satisfy

X,=W+ f

b(Xs)ds

,I0

where W is a Brownian motion, an equation that we can write formally as (*) dX, = dB, + b(X,) dt

(here, to facilitate comparison with the results above, we have replaced the W (for Wiener process) with the letter B, which we usually use to denote Brownian motion). There is an obvious connection between (*) and (1): (2')

If u satisfies (a) and X satisfies (*), then M, = u(t - s, XS) is a local martingale on [0, t).

Proof Applying Ito's formula gives s

s

-u,(t - r, X,) dr +

u(t - s, Xs) - u(0, Xo) =

Vu(t - r, X,) dX, J0

0 s

+2Y_

D;ju(t-r,X,)d<X`,X'>,.

ii fo

Since <X`, X'>, = 6,,r, it follows that the right-hand side

f

- u, (t - r, X,) dr + a local martingale

o S

d

+Y f i=1

D,u(t - r, X,)b`(X,) dr

0

+ 2 J Lu(t - r, X,) dr, 0

which proves (2') since - u, + b Vu + 'Au = 0. From the discussion above, it should be clear how to approach the problem of solving (*) and (1) when b is only locally bounded. Let b" = bl(jxjs"). Since b" is bounded, we can solve (*) and get a process X" that satisfies the original equation for t < T" = inf{t : IXI > n}. The measures µ" on (C, W) that give rise to X" have the property that if m < n, then it,, and It agree on .F (T.), so we can let n co and construct a process that solves the original equation for t < T. =

240

8

PDE'. flit ('en Be Solved by Running

Browden Motion

lim T". When 17:,, (x) } has positive probability, we say that the process explodes. The next result gives a simple condition that rules out explosion. (7)

If x b(x) < C(1 + lxl2), then the process does not explode starting from any

xeR°. Proof Let b", X", and T" be as above. Let g(x) = 1 + lx12 and, to ease the notation, let Y = X,". Applying Ito's formula ula gives

9(Y)-9(Yo)=Y J2YsdYs+1Y2ds 2i

0

= a l ocal martingale +

f 2 Y`b.'(Y) ds + td, 0

and our hypothesis implies that Y J2Ysib(Ys)ds < 2C f g(Y)ds, 0

so applying Ito's formula to exp(- (2C + d)t)g(Y) gives

exp(-(2C + d)t)g(Y) - g(Y0) = a local martingale

+Y f exp(-(2C +d)s)2Ysb;,(Y)ds o

i

+

f

-(2C + d)exp(-(2C + d)s)g(Y) ds

0

+ 2 i f exp(-(2C + d)s)2d
The inequalities above show that the sum of the last three integrals is :50, so S, =- exp(-(2C + d)t)g(Y) is a local supermartingale, and since S, > 0, it is a supermartingale. Applying the optional stopping theorem at time T. A t now shows that

Exexp(-(2C + d)t)9(YT.A) < 9(x), so

PP(T

t)

exp( l

+ n d)t) 9(x),

and the desired result follows immediately.

Remark: (7) implies that if b(x) = lxlav where v 0 is some fixed vector, then the process does not explode if S < 1. We will see in a minute that it does explode if S > 1, so the condition above is sharp. Perhaps the best way to get a feel for the properties of solutions of (*) dX, = dB, + b(X,) dt

8.4

The Cameron-Martin Trwformation

241

is to consider the one-dimensional case. In this case, the analysis is simple because we can find a function cp that makes cp(X,) a local martingale (thiN is the "natural scale"). To see what p to choose, use Ito's formula to conclude thin

w(X)-w(Xo)=fo 4'(Xs)dxs+2 Jco"(X5)ds = a local martingale + fo cp'(XS)b(X9) + (p"(X,) ds, 2 so

if we want qp(X,) to be a local martingale, we must have

gp'(x)b(x) + 2cp"(x) = 0,

that is,

9"(x) = -2b(x)tp'(x) 9'(y) = C exp

(f

y

- 2b (x) dx)

o

cp (z) = B +

exp (

f

Y

- 2b (x) dx dy.

\ o

JoOz

Taking B = 0 and C = 1 in the last expression, we get a function that is very useful in studying the behavior of X. We have used 9 to try to remind you of the

results in Sections 1.7 and 3.1. If it did, you should have no trouble with the following exercises.

Exercise 2 Let T, = inf{t : X, = c}. Then if a < x < b, PX(Tb < T.) = (P(x) - (P(a) (p(b) - cp (a)

Exercise 3 Since 9 is increasing, 9(oo) = limX1. p(x) and cp(- oo) = limsl-. rp(x) exist. Show that X is recurrent (i.e., PX(Ti, < oo) = 1 for all x and y) if and only if tp(oo) = oo and q,(- oo) oo. To see what this means in a concrete case, consider b(x) = Clxla for Ixj > 1, and b(x) = 0 otherwise. In this case, qp(z)=

JZexp(_ fi

so when S > -1, tp(oo) < oo, and when S < -1,

cp(z)-zexp(- f

\

as z -+ oo. In the critical case 6 = -1,

242

N

PDE's 71st ('rn Be Solved by Running s Brownian Motion

cp(z) = 1 +

JexP(_2cloY)dY , 1

sotp(oo)= oo if and only if2C< 1. The last exercise shows that X is recurrent if and only if (p(- oo, oo) _ (- oo, oc). If we think about the results of Section 2.11, this conclusion is obvious. 9(X,) is a time change of Brownian motion run for a random amount of time T. If qp(- oo, oo) # (- oo, oo), then this time must be finite, whereas if gyp(- co, oo) = (- oo, oo), it must be infinite. By looking at the scale function, we can also tell exactly when the process will explode.

Exercise 4 A Special Case of Feller's Test. Let T.,, be the explosion time defined above. Either PX(T. = oo) = 1 or PX(T.. < oo) = 1, depending on whether or not J

0

(1P(x) - (o(-cc))dco(x) = oo =

f

((p(oo) - 9 (x))dcp(x)

o

Solution:

The key to the proof of (7) was the fact that

S, = (1 + XX2)exp(-A,t) > 0 is a supermartingale if A is sufficiently large. To get the optimal result on explosions, we have to replace 1 + x2 by a function that is tailor-made for the process. A natural choice that "gives up nothing" is a function g >- 0 that makes e-rg(X,) a local martingale. To solve this problem, it is convenient to look at things on

the natural scale, that is, let Y, = 9(X,) and find f = g 9-1, for then when we apply Ito's formula, the second term on the right is a local martingale:

e-f(Y)ds +

e-`.f(Ii) -f(Y0)

s

fo

+ Je8f"(}c)d3. 0

To evaluate the third term on the right, we observe that

2

Y - Y. = J gp'(XS) dX + bounded variation, 0

so

Combining this result with other observations above, we see that e-`f(Y) _ e-`g(XX) is a local martingale if and only if

zf"(x)gp'(x)2 -f(x) = 0.

8.4

The Cameron-Martin Transformation

243

To solve this equation, we iterate. Let .fo

.f"(x) =

f s d(p(x)

f'd(p(y)f._jy)

0

0

and let 00

f= n=o I.f" It is easy to show that f" < (f1)"/n !, so the series converges and has 1 + f1 <_ f :r'

exp(f1). If the solution f that we construct -oo as we approach either end of cp(- oo, oc), then we are done, because then we can let [a", b"] T tp(- oo, oo), let

T. = inf{t : X, 0 [a", b"] }, and apply the optional stopping theorem at time T" A t to conclude that if x c [a", b"], then

P.(T"
etf(x) .f(a") n f(b")

-0

as n - co (generalizing the inequality in the proof of (7)), so there is no explosion. On the other hand, iff(x) stays bounded as, say, x T 9(oo), we are also done, for then we let t = inf{t : XX = 0}, let 0 < x < oo, and apply the optional stopping theorem at time t A T" to conclude that

1 < g(x) =

1+

=1+ Letting n

< t) < t).

g(oo)EX(e-TAT"; T" g(oo)EX(e-T"; T"

oo, now have

EX(e-T"; T.

< t)1

Ex(e-T. ; T.

< t),

which is a contradiction, unless PX(T.,, < t) > 0, so X explodes in this case.

The last detail now is to relate the behavior of g to the behavior of the integrals above. Going back to natural scale and using the inequality 1 + f1 < f < exp(fl), we see that X does not explode if and only if f1 - oo at both ends of q,(- oo, oo), which is the condition given in the theorem.*

Note: This exercise is from McKean (1969). The result is due to Feller. Now that the one-dimensional case has been discussed in general, the last step is to consider two concrete examples.

Example l The d-Dimensional Bessel Process. In Section 2.10 (see (8) and (9)), we showed that if R, = IB,I, where B, is a d-dimensional Brownian motion, then

Rj-2

r

r

d 0

2

IR51ds= fRs1B5.dB., o

244

8

PDH's 71n1 Can He Solved by Nwainit a Hrownien Motion

and , - t, so R, is a solution of(*) with b(x) = (d - 2)/2x and underlying Brownian motion I

B;=

Since the drill coefficient here is of the form b(x) = Cx-1, by applying Exercise 3 we see that R, is recurrent if and only if

2C= d - I < 1, that is, d!9 2. The Ornstein-Uhlenbeck Process.

Example 2

b (x) = - ax. In this case there is an amusing way of solving

(*) dX = - aX dt + dB by purely formal calculations: dX _

_ ax +

dt

dB dt

= (_ aeat)X + eat dB dt dt d (eatX) = eat dB dt dt eat dX

eatXX - Xo =

f

eas dBs

0

Xt = e-atX0 + eat

eas dBs. 0

All the calculations above are formal, but it doesn't really matter. Once we know what to guess, it is easy to check that the formula defines an OrnsteinUhlenbeck process

E,,(Xt - X0) = (e-at - 1)x - - axt as t - 0 t

E.(Xt - X0) 2 = ((1 - e-at)x)2 + E.

(e-at

=0(t2)+t+o(t) ast->0.

2

eas dBs 0

so it follows that Xt + f aXs ds is a local martingale and 0

<X>t = t.

The representation given above for the Ornstein-Uhlenbeck process is nice because it makes certain facts about the process obvious.

8.4

The Cameron-Martin Transformation

245

(a) If Xo = x, then XX has a normal distribution with mean a-°'x and vnrinnce (1 - e-z"')/2a, and hence (b) As t -+ oo, X, converges in distribution to normal with mean 0 and variance 1/2a.

Note: The discussion above of Example 2 is based on Section 16.1 of Brciman (1968). Although the first steps in developing the theory in this section were taken by Cameron and Martin (1944b), the formulation given above is due to Girsanov (1960). In our development above, we have basically followed Meyer (1976), but we have also incorporated some material from Friedman (1975) and Stroock and Varadhan (1979).

B

Elliptic Equations In the next three sections of this chapter, we show how Brownian motion can be used to construct (classical) solutions of the following equations:

0 ='Au

0=2Au+g

0='Au+cu in an open set G subject to the following boundary condition: u is continuous

inGandu=fonBG. The solutions to these equations are (under suitable assumptions) given by EF.f(BT)

E.(f(BT) + J g (BS) ds \\ EX (.f(B,) exp

o

\ fo

c(B.)

dsll

where r = inf{t : B, 0 G }. Comparing the solutions above to the solutions given

for the equations in Part A of this chapter shows that (except for a minor modification of the second solution) all we have done is replace t by T. From this viewpoint, the changes made above may seem ad hoc, but if we reverse our perspective and rewrite the first solutions in terms of space-time Brownian motions h, = (t - s, BS) run until time r = inf{s : B9 0 (0, oc) x Ra}, we see that the recipes are exactly the same.

24

N

PDE's 71n1('a.11r Solved by Running s Brownian Motion

8.5 The Dirichlet Problem In this section, we will consider the most classical form of the Dirichlet problem, that is, we will study: (1)

(a) Au=0in(1

_ (b) u is continuous on G and u = f on G. If we let h(t, x) = u(x), then h satisfies the heat equation with h(0, x) = u(x), that is, u is an equilibrium distribution for the heat equation in which the 8G is held at a fixed temperature f (which may vary from point to point). As in the first half of this chapter, the first step in solving (1) is to show:

(2)

Let T = inf{t : Bt 0 G }. If u satisfies (a), then M, = u(B,) is a local martingale on [0,,r).

Proof Applying Ito's formula gives ('

u(B,) - u(B0) = f Vu(BS) . dB., + 1 J Au(BS) ds, 2

o

which proves (2), since Au =_ 0 and the first term is a local martingale on [0, T). If u is bounded and satisfies (a), then MS, 0 < s < T, is a bounded local mar-

tingale, so M., converges to a limit as s T T. If G is bounded, T < oo a.s. (see Exercise 1 in Section 1.7), so if u satisfies (ii), the limit must be f(B,) and since Ms, S < T, is bounded, it follows that MM = E.(f(B,) I A).

Taking s = 0 in the last equation gives (3)

Suppose that G is bounded. If there is a solution of (1) that is bounded, it must be

v(x) = Exf(Bz) As in the first half of the chapter, it is easy to show: (4)

Suppose that G and f are bounded. If v is smooth, then it satisfies (i).

Proof The Markov property implies that on {T > s} EE(f(BjI.9;) = v(Bs) Since the left-hand side is a local martingale on [0, T), it follows that v(B.,) is also.

If v is smooth, then repeating the calculation in the proof of (2) shows that, for S E [0, T),

v(B.) - v(B0) = 2

Av(B,) dr + a local martingale, fo

so it follows that the integral on the right-hand side is a local martingale and, hence, must be 0, so we have Av = 0 in D.

Up to this point, everything has been the same as that in Section 8.1.

8.5

247

The Dirichlet Problem

Differences appear when we consider the boundary condition (b), since it is no

longer sufficient for f to be bounded and continuous. The open set (i Must satisfy a regularity condition. A point y c- 8G is said to be a regular point if Py(T = 0) = 1. (5)

Let G be any open set. Suppose that f is bounded and continuous and y is it v(y). regular point of 8G. If x e G and x -+ y, then

Proof The first step is to show (5a)

If t > 0, then x - Px(T < t) is lower semicontinuous. Proof Px(XSEG` for some se(E,t]) = f pE(x, y)PP(T < t - s). Since y - (T < t - s) is bounded and measurable and

p.(x,y) = it follows from the dominated convergence theorem that (27rE)-12e-Is-yI2/2E

x -. PX (XS e G' for some s E (c, t] )

is continuous for each E > 0. Letting E 10 shows that x --> Px(T < t) is an increas-

ing limit of continuous functions and, hence, by a standard argument, that if

x - y, then lim inf Ps (T < t) > Py(T < t).

If y is regular for G and t > 0, then Py(T < t) = 1, so it follows from (5a) that if x -+ y, then lim inf P,t.(T <- t) >- 1. n-OD

With this established, it is easy to complete the proof. Since f is bounded and continuous, it suffices to show (5b)

If y is regular for G and x,, --> y, then for all S > 0 Ps,, (BL E D(y, S)) -+ 1.

Proof Let E > 0 and pick t so small that PO

(sup

\0 <s
IBSI > 2J

<

Since Px,,(T < t) - 1 as x - y, it follows from the choices above that lim inf Px.(Bt a D(y, S)) > lim inf Px. (T < t, sup I Bt - x

\

o<Ss(

2)

> liminfP, (T < t) - PO sup IBtI >

n-

(O<S!5,

>1-E. Since r was arbitrary, this result proves (5b) and, hence, (5).

Sl 2

24$

x

PDE's That Can He Solved by Ruaaini a Brownies Motion

(5) shows that if G is regular (i.e., every point of 8G is regular), then v(x) = Exf(BT) will satisfy the boundary condition (b) for any bounded continuous f. It is easy to see that there is a converse to this result. Exercise 1 Let G be an open set and let y e 8G have Py(r = 0) < 1 (and, hence, by Blumenthal's zero-one law, Py(r = 0) = 0). Let f be a continuous function on 8G withf(y) = 1 andf(z) < 1 for all other z e 8G. Show that there is a sequence of points x" - y such that lim info-.,) v(xn) < 1.

Hint: To find these points, start a Brownian motion at y and run it until it exits D(y, 1/n). From the discussion above, we see that for v to satisfy (b) it is sufficient (and almost necessary) that each point of 8G be a regular point. This situation raises two questions :

Do irregular points exist? What are sufficient conditions for a point to be regular? In order to answer the first question we will give two examples.

Let d> 2 and let G = D - {0}, where D = {x : Ixj < 1}. If we let To = inf{t > 0: B, = 01, then PO(To = oo) = 1, so 0 is not a regular point of OR Example 1 (trivial)

Example 2 Lebesgue's Thorn. Let d>3 3 and let G= (-1, 1)d - U-n=1 [2-", 2-n-1] x [-an, (See Figure 8.1 for a look at G (1 {x : x3 = ... = Xd = 0}. Younger readers will notice that G is a cubistic cousin of Pac-Man with infinitely many very small teeth.) I claim that if a" j 0 sufficiently fast, then 0 is an]d-1

not a regular point of 8G. To prove this result, we observe that since threedimensional Brownian motion is transient and P0((B, , Bi) = (0, 0) for some t > 0) = 1, then with probability 1, a Brownian motion B, starting at 0 will not hit In = {x : x1 e [2-", 2-n-1], X2 = X3 = = Xd = 01, and furthermore, for a.e. w the distance between {B.: 0 < s < oo } and In is positive. From the last observation, it follows immediately that if we let Tn = inf{t : B, c- [2-", 2-n-1] x

[a.,an]d-1} and pick an small enough, then P0(Tn < oo) < 3-". Now Y' l 3-"= 3-1(3/2) = 1/2, so if we let i =inf{t > 0 : BOG} and u =inf{t > 0 : B, (-1, 1)d}, then we have PO(T< a) :5

PO

n=1

(7'n
so

Po(i > 0) > PO(T = o-) >

2

and 0 is an irregular point. The last two examples show that if G` is too small near y, then y may be

8.5

The Dirlchlet Problem

249

Figure 8.1

irregular. The next result shows that if G` is not too small near y, then y is regular. (5c)

Poincare's Cone Condition. If there is a cone V having vertex y such that v fl D (y, r) c G `, then y is a regular point.

Proof The first thing we have to do is explain what we mean by a cone. In Section 4.1, we defined Va = {(x,y)c- H: I x - 01 < ay,y,

< 1},

an object that might be called a cone with opening a, vertex (0, 0), and direction ed = (0, ... , 0, 1). Generalizing this definition, we define a cone with opening a, vertex z 1, and direction z2 as follows :

Va(zl,z2)={z:z=z1+y(z2+w)where w 122 andllwll
P-,(B,EV(z1,22))=E>0, where a is a constant that depends only on a, so an easy argument shows that if VV(z, z') fl D(z, r) c G` for some r > 0, then

2011

8

PDE'a That Can Be Solved by Running a Brownian Motion

lim inf PZ(B, E G`) >_ e.

40

Combining the last conclusion with the trivial inequality P.(-r < t) > PZ(B, e G`) shows that

PZ(T=0)=limPZ(Te, tdo

and it follows from Blumenthal's zero-one law that PZ(r = 0) = 1.

The last result, called Poincare's cone condition, is sufficient for most examples (e.g., if G is a region with a smooth boundary). The ultimate result on

regularity is a necessary and sufficient condition due to Wiener (1924). To describe Wiener's test, we would have to define and explain the notion of capacity, so we will content ourselves to state what Wiener's test says about Example 2 above and refer the reader to Ito and McKean (1964), page 259, or Port and Stone (1978), page 68, for details. (5d)

In d = 3, Po(T = 0) = 0 if and only if - oo < Y log (2"a") < oo. n=1

In d > 4, Po(T = 0) = 0 if and only if Y (2"an)d- 3 < oo. n=1

In contrast, Poincare's cone condition implies that Po(T = 0) = 1 if lim inf 2"a" > 0. n_ao

The last result completes our discussion of the boundary condition, so we now turn our attention to determining when v is smooth. As in Section 8.1, this is true under minimal assumptions on f. (6)

Let G be any open set. If f is bounded, then v is smooth and, hence, satisfies (a).

Proof Let x e G and pick 6 > 0 so that D(x, 6) c G. If we let a = inf {t : B, D(x, 6)}, then the strong Markov property implies that (*) v(x) = EE.f(B,) = EX[EB(Q)f(Bt)] v(Y) dn(Y), D(x,a)

where it is surface measure on D(x, 6) normalized to be a probability measure, so it follows from (7) in Section 2.10 that v c- C°°. As in the last four sections, our last topic is to discuss what happens when something becomes unbounded. This time we will focus on G and ignore f. By repeating the arguments above, we can easily show the following:

8.6

(7a)

Polison's Equation

251

Suppose that f is bounded and continuous and that each point of 8G is regular. If for all xeG, Px(T < oo) = 1, then v satisfies (1) and is the unique solution. Conversely, we have

(7b)

Suppose that f is bounded and continuous and that each point of 8G is regular. If for some x e G, PX(T < oc) < 1, then the solution of (1) is not unique. Proof of (7b) Since h(x) = PX(T = oo) has the averaging property given in (*), it is C°° and has Ah = 0. Since each point of 8G is regular, P,(T = oo) < Px(T > 1)

- 0 as x - y e 8G. The last two observations show that h is a solution of (1) with f - 0, proving (7b). By working a little harder, one can show that adding aPP(T = oo) is the only way to produce new bounded solutions. (7c)

Suppose that f is bounded and continuous. If u is bounded and satisfies (1), then there is a constant C such that u(x) = EXf(BT) + CPP(T = ao).

Proof See Port and Stone (1978), Theorem 4.2.12.

8.6 Poisson's Equation In this section, we will see what happens when we add a function of x to the equation considered in the last section, that is, we will study (1)

(a) -Au = -gin G

_

(b) u is continuous in G and u = 0 on 8G.

If G = R" and the boundary condition is ignored, then any solution of (1) is called a potential of the charge distribution, because (if the units are chosen correctly) the gradient of the solution gives the force field associated with electrical charges distributed according to g. As in the first five sections of this chapter, the first step in solving (1) is to show (2)

Let T = inf{t : B, 0 G }. If u satisfies (a), then

M= u(B) + J(Bs)ds 0

is a local martingale on [0, T).

Proof Applying Ito's formula as we did in the last section gives u(B,) - u(B0) = fo Vu(BS) dBs + J Du(BS) ds, 2

0

252

8

PDB's not ('rn N. Solved by Rannin a Brownlan Motion

N and the first term on the right-hand side is a local martinpalc on 0, r). If G is hounded, then /s.,r (x) for all xeG (see Exercise 1 in Section 1.7), so if g is hounded and u is hounded and satisfies (a), then for s < T which proven (2). sinec JtAu -

5 1fu11z. + tIIgII.,

so M,, N < T, is a uniformly integrable martingale, and if u satisfies (b), then lim Al, = fo g(B,) dt

.fr

M,

Ex\Jog(B,)dtIFs//

.

Taking s = 0 now gives (3)

Suppose that G and g are bounded. If there is a solution of (1) that is bounded, it must be

v(x) = E. (fo g(B,) dt.

/

Again, it is easy to show (4)

Suppose that G and g are bounded. If v is smooth, then it satisfies (a) a.e. in G.

Proof The Markov property implies that on {T > s}, Ex\Jog(B,)dtl#s/

=

J g(B)di + En(s)(J(Bf)dt).

Since the left-hand side is a local martingale on [0, T), it follows that g(B,)dt + v(B.,)

is also. If v is smooth, then repeating the calculation in the proof of (2) shows that for s e [0, T), s

s

v(B,) - v(B0) + f g(B,) dr = f (ZOu + g) (B,) dr + a local martingale, 0

0

so again, we conclude that the integral on the right-hand side is a local martingale and, hence, must be - 0, so we have 'Au + g = 0 a.e. in D. After the discussion in the last section, the conditions needed to guarantee that the boundary conditions hold should come as no surprise. (5)

Suppose that G and g are bounded. Let y be a regular point of G. If x,, e G and y, then 0.

x

8.6

253

PolMon'. Equation

Proof We begin by observing: (i) In the last section we showed that if r - 0. P,,.(r > E) -, 0, and (ii) if G is bounded, then we have (see Exercise I of Section 1.7) C = sup Exr < oo. X

Combining the last two observations with the Markov property shows that for any r; > 0, Iv(x0I < EIIgII. +

E),

which proves (5). Last, but not least, we come to the question of smoothness. We begin with the case d > 3, because in this case,

dt < oo,

w(x) = E. f Joo00

so the strong Markov property implies that

w(x) = Ex f

dt + Exw(BL),

o

and we have and

(*) v(x) = w(x) - Exw(BT).

The last equation, allows us to verify that v is smooth by proving that w is, a task that is made simple by the fact that w(x) = c J I x - ylz-dg(Y) dy.

The first derivative is easy: (6a)

If g is bounded and has compact support, then w is C'. Proof As before, we will content ourselves to show that the expression we get by differentiating under the integral sign converges and leave it to the reader to apply Exercise 1 of Section 1.10 to make the argument rigorous.

Ix so

(x. _

ylz-d

C

\(z y)zJ1

/Z

2 - d,-Yi)z D,Ix_yIz-d=()((x 2

and we have

Diw(x) = c J (2 - d)iX`_ yig(Y)dY,

2(x,-Yi),

LOA

8

PD6's Thai ('en He Solved by Ruudni a Brownian Motion

the integral on the right-hand side being convergent, since Yld g(.v) I dy <- 11911.

J

lx

dyld-1

< 00.

L>O)

As in Section 8.2, trouble starts when we consider second derivatives. If

i#j, then ylz-d = (2 - d)(-d)lx - YI-d-2(x1 - yi)(x, - Y) Dijl x In this case, the estimate used above leads to JDiilx

-y12_dl < Ix- yl-d,

which is (just barely) not locally integrable. As in Section 8.2, if g is Holder continuous of order a, we can get an extra Ix - yla to save the day. The details are tedious, so we will content ourselves to state the result : (6b)

If g is Holder continuous, then w is C2.

The reader can find a proof either in Port and Stone (1978), pages 116-117, or in Gilbarg and Trudinger (1977), pages 53-55. Combining (*) with (6a) and (6b) gives (6)

Suppose that G is bounded. If g is Holder continuous, then v is smooth and hence satisfies (a).

Proof (6b) implies that w is CZ. Since w is bounded, it follows from (6) in the last section that x -p Esw(BT) is C. The last result settles the question of smoothness in d > 3. To settle the question in d = 1 and d = 2, we need to find a substitute for (*). To do this, we let

w(x) = JG(xY)g(Y)dY where G is the potential kernel defined in Section 1.8, that is, I

n

G(x,y) _

log(lx - yl) d=2

- llx - yl

d = 1.

G was defined as

p, (x, y) - a, dt 0 foo

where the a, were chosen to make the integral converge, so if $ gdx = 0, we see that

T0

J G(x,y)9(Y)dy = li m Ex

9(Bt)dt.

N.7

255

The Schrlidinger Equation

Using this interpretation of w, we can easily show that (*) holds, so tlgnin our problem is reduced to proving that w is C 2, which is a problem in calculuN. (nice all the computations are done, we find that (6) holds in d:5 2 and that in d I ,

it is sufficient to assume that g is continuous. The reader can find dctuils in either of the sources given above. On the basis of what we have done in the last five sections, our next step should be to consider what happens when something becomes unbounded. For the sake of variety, however, we will not do this, but instead, we will show how (1) can be used to study Brownian motion. Example 1 Let d = 1, G = (-1, 1), and g = 1. In this case, formulas (3) through (6) imply that v(x) = EX2 is the unique solution of (1)

(a) ?u"(x) _ -1 in (- 1, 1) (b) u is continuous on [-1, 1] and u(- 1) = u(1) = 0, so u(x) = 1 - x2. Once you see the one-dimensional case, it is easy to do the general case. Exercise 2

Let d > 2, D = {x: I x < 11, T = inf {t : B, 0 D}. Then

Exr = d(1 - Ix12). Remark: This result can also be proved by observing that IB,I2 - dt is a martingale, so Ix12 = 1 - dExr.

8.7 The Schrodinger Equation In this section, we will consider what happens when we add cu to the left-hand side of the equation considered in Section 8.5, that is, we will study: (1)

(a) 2'Au+cu=0inG (b) u is continuous in G and u = f on aG. We will explain the physical significance of this equation in the next section. For the moment, you should consider it simply as the inevitable next step in the progression established in Sections 8.1, 8.2, 8.3, 8.5, and 8.6. As in the first six sections of this chapter, the first step in solving (1) is to show

(2)

Let i = inf{t : Bt G }. If u satisfies (a) then

Mt = u(B,) exp (I c(B) ds) ff

is a local martingale on [0, r).

26

8

PDE'e That ('em Be Solved by fling a Brownian Motion

Proof Let c, =

c(B,)dLs. Applying Ito's formula gives

u(B,) exp(c,) - u(B0) =

fr

exp(c,)Vu(B5) dB, +

u(BS) exp(cs) dos o

0 e

+

Au(B,) exp(c,) ds, 0

2

which proves (2), since dc, = c(B,) ds, IAu + cu = 0, and the first term on the right-hand side is a local martingale on [0, r). At this point, the reader might expect that the next step, as it has been six times before, is to assume that everything is bounded and conclude that if there is a solution of (1) that is bounded, it must be v(x) = EE(.f(Bt) exp(cz))

We will not do this, however, because the following simple example shows that this result is false. Example 1 Let d = 1, G = (-n/2, n/2), and c = 1/2. If u(x) = cos x, then u' (x) = - sin x and u"(x) _ - cos x, so z u" + cu = 0 and u = 0 on 0G. But

there is obviously another solution: u = 0. We will see below that the trouble with the last example is that c = 1/2 is too large or, to be precise, if we let t

w(x) = Exexp(

c(B,)ds ),

then w = oo. The rest of this section is devoted to showing that if w * oo, then "everything is fine." The development will require several stages. The first step is to show (3a)

If w # oc and G is connected, then w(x) < oo for all x e G.

Proof Let c* = suplc(x)I. By Exercise 1 in Section 1.7, we can pick ro so small that if T, = inf{t : JB, - Bol > r} and r:5 ro, then Exexp(c*T,) < 2 for all xeG. If D(x, r) c G, then the strong Markov property implies that w(x) = E.[exp(c(T,))w(B(T,))] < Ex[exp(c*T,)w(('B(T,))]

= EX[exp(c*T)] J

w(y)dir(y), 8D(z,r)

since the exit time T, and location are independent (here TC is surface measure on D(x, r) normalized to be a probability measure). If S < ro and D(x, S) C G, multiplying the last equality by rd-1 and integrating from 0 to b gives

w(x) < 2 S

f

w(z) dz

D(z,6)

where C is a constant (that depends only on d).

8.7

257

The Schrtldinger Equation

Repeating the argument above and using c(T,) > -c*T,, givca it lower bound of w(x) > 2-'-C d

f",(X,6) w(Y) dy. ,a)

Combining the last two bounds gives: (3b)

Let S < ro. If D(x, 2S) c G and y e D(x, S), then w(x) > 2-(d+2)w(Y)

Proof w(x) > 2-1(26)d fD(x,26) w(z) dz

2-1 (26) d

w(z) dz

J d

>2 1( S)d 2 1Cw(Y)_ 2-(d+2)w(y). From (3b), we see that if w(x) < co, 26 < ro, and D(x, 26) < G, then w < oo

on D(x, S). From this result, it follows that Go = {x: w(x) < oo} is an open subset of G. It is easy to see that Go is also closed (if x - y e G and D(y, 36) c G, then for n sufficiently large, D(x,,, 26) c G and y e D(x,,, S), so w(y) < oc). From the last results, it follows (if G is connected) that Go = G, so (3a) holds. With (3a) established, we are ready to prove our uniqueness result. (3)

Suppose that f and c are bounded and that w # oc. If there is a solution of (1) that is bounded, it must be

v(x) = E.(f(B,) exp(ct)). Proof If u satisfies (a), then (2) implies that X3 = u(BS,,,)exp(cSA,) is a local martingale on [0, T). If u, f, and c are bounded and u satisfies (b), then letting s T T gives

u(x) = Ex(f(BT) exp(cT) ; T < t) + Ex(u(B,) exp(c,); T > t).

Since f is bounded and w(x) = Ex exp(ct) < oo, the dominated convergence theorem implies that as t --> co, the first term converges to

Ex(f(Bj To show that the second term -0, we observe that EX[u(B,) exp(c,); T > t] = Ex[Ex(u(B,)

T > t]

= Ex[u(B,) eXp(c,)w(B,); T > t]

and use the trivial inequality

w(x) > exp(-c*)PP(T < 1)

2

8

PDE'u 7ba1 Van Be Solvod by Rrrta/ Brownian Motion

to conclude that inf w(x) = is > 0. XEo

Replacing w(B,) by e,

EE[u(B,) exp(c,); T > t] < e-'EE[u(B,) exp(c.); r > t] < E 1IIuIImEx[exp(ci); T > t] - 0 as t

oo, since w(x) = E., exp(cT) < oo.

This completes our consideration of uniqueness. The next stage in our program, fortunately, is as easy as it always has been. (4)

Suppose that f and c are bounded and that w * oo. If v is smooth, then it satisfies (a) a.e. in G.

Proof The Markov property implies that on {T > s}, Ex(exp(c)f(BT)IS') = exp(cs)Ea(s)(eXp(c,)f(BT)) Since the left-hand side is a local martingale on [0, T), it follows that exp(c,)v(B,)

is also. If v is smooth, then repeating the calculation in the proof of (2) shows that for s e [0, T),

v(B) exp(c,) - v(B0) =

f(f& + cv) (B,) exp(c,) dr + a local martingale,

so it follows that the integral on the right-hand side is a local martingale and, hence, must be = 0, so we have ZAv + cv = 0 a.e. in G. Having proved (4), the next step is to consider the boundary condition. As in the last two sections, we need the boundary to be regular. (5)

Suppose that f and c are bounded and that w # oo. If f is continuous, then v satisfies (b) at each regular point of 8G. Proof Let y be a regular point of 8G. We showed in (5a) and (5b) of Section 8.5 that if S > 0 and x - y, then Px,,(r < S) -> l and Px,,(BT a D(y, S)) - 1, so if cl < M, then Psn(e-"ra < exp(c3) < e"i") - 1 and, since f is bounded and continuous,

EXJexp(c)f(B.); T < S) -+f(x). To control the contribution from the rest of the space, we observe that

Exjexp(ct)f(Bj; T > S) < eMall f < eMaIIfIImjIwII.P..(r

r > 8) a)

0.

This brings us finally to the problem of determining when v is smooth enough to be a solution. To solve the problem in this case, we use the same

8.7

29

The Schr6dinger Equation

trick used in Section 8.3 to reduce our result to the previous case. We observe that exp

(f

T

T

c(BS) dsl

=1+

J

o

T

c(B) exp (

\

,)o

c(B,) dr) ds, s

/

so multiplying by f(BT) and taking expected values gives

v(x) = 1 +

Ex

(c(Bs)exP(f

c(B,) dr)1(S<)) ds.

Conditioning on .'F5 and using the Markov property, we can write the above as

v(x) = 1 + f Ex(c(BS)w(BS); r > s) A 0

The second term on the right-hand side is of the form considered in the last section, so if c and f are bounded and w * oo, then v is bounded, so it follows from results in the last section that v is C 1. If c is Holder continuous, then the right-hand side is Holder continuous, and we can use (6) from the last section to conclude (6)

Suppose that f and c are bounded and that w * oo. If c is Holder continuous, then v is smooth and, hence, satisfies (a). Combining (4) through (6), we see that if, in addition to the conditions in (6), we assume that f is continuous, then v satisfies (1). Just as in the last section, this fact can be used to study Brownian motion. Example 2 Let d = 1, G = (-1, 1), c = - fl < 0, and f = 1. In this case,

v(x) = Exexp(-flr) < 1, so v is the unique solution of (1), and we can find v by "guessing and verifying." Let u(x) = acosh(bx) and recall that cosh x =

ex +2 ex sinh x = ex _ ex

cosh'x = sinh x sinh'x = cosh x. so we have

2u" - flu =

z

a2 - #a cosh(bx).

From the last equation, we see that u(x) = a cosh(bx) satisfies (a) if and only if

b2/2 = fi, that is, b = (, so picking a to satisfy the boundary condition, we find that

Ex exp(-fir) =

cosh(x 2fl)

cosh()

260

$

PDE's That Cam He Solved by Rua.M* a Nrownian Motion

When x = 0, the expression reduces to

E0exp(-/1r) = cosh(.,/2/1) '.

The formula above is valid only for fi > 0, but if we let /1= -a, a > 0, then cosh(

2a) =

e` 2- + e-"1-2' 2

=cos( 2a).

The resulting expression, Eo exp(at) = cos( 2a)-1,

makes sense for a < n2/8 and Too as a T n2/8. We leave it to the reader to prove

that the formula above is correct. We will return to this example in the next section.

If you know something about Bessel functions, you can extend the last result to higher dimensions.

Example 3 Let d>2, 2, G= {z: Izl < 11, and c=- -fl < 0. Again, v(x) = Ex exp(- fir) < 1, so v is the unique solution of (1). This time, however, it is not so trivial to guess the answer, so we just state the result: v(x) = CIXII-112Id/2

1(

where I, is the modified Bessel function m

m2

(Zv+ 2m

/(m!I'(v + m + 1))

M=0

(see Ciesielski and Taylor (1962) or Knight (1981), pages 88-89, for details). It is one of the great mysteries of life that the distribution of z starting from 0 is the same as the total amount of time a (d + 2)-dimensional Brownian motion spends in {z : I z I > 1 } (which can also be computed using the methods of this section). For more on this phenomenon, see Getoor and Sharpe (1979).

tis

Example 4 G = Rd, c(x) = -a - flk(x), where a, fl e(0, oc), and k(x) > 0. Since G is unbounded, this example is, strictly speaking, not covered by the results above. However,

E. (exp (- f f 'k (Bs) ds)) < 1 o

JJJ

and we have supposed that a > 0, so v(x) = fo dt e-'Ex

(exp (_fljk(Bs)ds)) f

nicely convergent, and it is not hard to show (details are left to the reader) that v is the bounded solution of (1), the boundary condition (b) being regarded as vacuous.

8.7

'the SchrtMtnaer Iqustlon

261

The most famous instance of this solution occurs when d = 1 and

k(x) =

I

t0

x>0 x<0

(which again does not satisfy our hypothesis). In this case, Kac (1951) showed that a(a + A

v(0) = l/

so inverting the Laplace transform, e-a

a(a + )

o

1

`

7r

o

e -Ps

s(t - s)

ds dt,

and observing that under PO the distribution of t-' l {s e [0, t] : BS > 0} I is independent of t, we get Levy's arcsine law: Po (I Is c [0, t] : BS > 0} I < Bt) =

1

n

dr

r(1 - r)

= 2 aresin(O). it

The reader should note that t-' l {s E [0, t] : Bs > 0} I

1/2

as t oc. In fact, the distribution of this quantity is independent of t, and its density has a minimum at t = 1/2! Examples where (1) can be solved explicitly are rare. A second famous example, due to Cameron and Martin (1944b), is k(x) = x2. In this case, Eo (exp

(- f

f (B5)2 ds)) =

(sech((2p)1/2t))1i2.

ff

The reader is invited to try to derive this equation. The proof given on pages 10-11 of Kac (1949) is a beautiful example of Kac's computational ability. Up to now, we have focused our attention on the question of the existence of solutions to (1). The probabilistic formula for the solution can also be used to study properties of the solution. Perhaps the most basic result is (7)

Harnack's Inequality. Let u > 0 and satisfy 'Au + cu = 0 in D = {x : I xI < 1}. Then for any r < 1, there is a constant C (depending only on r, c) such that if x, y e D (0, r), then

u(x) < Cu(y).

Proof Pick ro so small that if Tr = inf { t : I B, - Bo I > r} and r < ro, then Exexp(c*Tr) < 2, where c* = supIc(x)I. Repeating the first computation in the proof of (3a), we see that if S < ro and D(x, S) c G, then

u(x) < 2 b° f D(x,S)

u(z) dz

762

8

PDE's 9181 Can Be Solved by Rpmly u Brownian Motion

Figure 8.2

and

u(x) > 2-'-C J

u(y) dy.

,(x,6) Therefore, since we have assumed that u >- 0, we can repeat the proof of (3b) to conclude that if S < ro, D(x, 26) c G, and y e D(x, S), then (*) u(x) 2-(d+2)u(y) The desired result now follows from a simple covering argument. Fix r < 1

and pick b < ro A (1 - r)/2. Given x, y e D(0, 1 - r), there is a sequence of points xo = x, x1, ... , xm = y (with m < 3/6), such that for all i, we have Ixi - xi_1I < 2(5/3 and D(xi, 26) c D (see Figure 8.2). It follows from (*) that the inequality holds with C = 2(a+2)316

Remarks: (a) The constant in the proof above grows like exp(C(1 - r)-1) ; by working harder, you can get (1 - r)-,' (exercise). (b) The result in (7) is true if D is replaced by any connected open set G and D(0, r) is replaced by K, a compact subset of G. The details of the covering argument are much more complicated in this case. See Chung (1982), Exercise 3 on page 205, for a proof, or spend an hour or two and find your own. (c) The result above is also true if we assume only that c e Kd°` (the space defined in Section 8.2). The original proof, due to Aizenman and Simon (1982), was given in this generality. The fact that c can be unbounded causes quite a bit of trouble, but by following the outline of the proof of Theorem B. 1.1 given in Section 8.3 and using a clever time reversal argument as a substitute for the

8.8

EIReev.l.m of A + c

263

self-adjointness used in Step 3, they succeed in proving the key eslimule: 11' S < 60, then u(x) < C

u(y)dn(y) Jan(z, a)

(their (1.13)), and once this result is established, things follow pretty much as before. The reader can find the details clearly explained in their paper. Exercise 1

Use the Poisson integral representation ((2) of Section 3.3) to show

that ifu>0andAu=0inD,then foranyr<1,wehave forallx,ycD(0,r) that u(x) <

1-r

C1 +r)d

u(Y)

Hint: The worst case is u(z) = (1 - zl2)/z - 1". Problem Let r = inf{t : B, D} and suppose that EZ exp(cL) < oo. If we let gy(z) = EZ(eXp(ct)IB= - Y),

then any solution of (1) satisfies

jY((Y)kY(z)tht(Y).

u(z) = n

If we had good estimates on gy(z), then we could give a proof of (7) that is similar to the proof of Harnack's inequality which is given in Exercise 1, but I do not know how to do this. Note: The applications to Brownian motion are based in part on Section 2.6 of Ito and McKean (1964), where the reader can find more details and other applications. The proofs of (3a), (3b), and Harnack's inequality follow Chung (1982), which is, in turn, based on his previous work with several coauthors. These results were also discovered independently by Aizenman and Simon (1982), who proved their results for c e K,"'. Harnack's inequality is just one of several properties of solutions of Schrodinger's equation that can be studied using probabilistic methods. A good place to start learning about these results is Carmona and Simon (1981). They use probabilistic methods to study the exponential decay of Schrodinger eigenfunctions, and they give numerous references to earlier work on related topics.

8.8 Eigenvalues of A + c In this section, we will break the pattern set down in the first seven sections of this chapter and study a new type of problem:

N4

8

(I)

(a) }Au4 ru - AuinG

PDR's flit Con Be Nolvod by Rtanlty w Brownlnn Motion

(b) u is continuous in G and it

0 on 0G.

A function is that satisfies (I) is said to be an eigenfunction of ZA + c (with Dirichict boundary conditions), and 1 is the corresponding eigenvalue. These functions are of interest because they correspond to the pure tones of a drum made in the shape of G. More generally, they are stationary states of the Schrodingcr equation

iu,=IAu+cu. A good way of getting a feel for (1) is to consider a simple example. Example 1

Let D = (0, 1), c = 0. In this case, if we let u(x) = sin(nnx), then z

z

2 u"(x) = - (n2) sin(nnx) _ - (n2) u(x),

2 , -2n , -92 z

so (a) has solutions for A = -

2

2

,

.... The next result, which

is well known (see Courant and Hilbert (1953), vol. I), shows that this example is typical.

(A) If c is bounded and smooth enough (e.g., Holder continuous), then there is an infinite sequence of (real) eigenvalues

00>10>Al>12> 13... such that A,, -> - oo as n -* oo. Some of the eigenvalues may be multiple, but the first is simple, and the corresponding eigenfunction can be chosen

to be >0. In this section, we will investigate the probabilistic meaning of to and the

corresponding eigenfunction uo, and derive some characterizations of lo, ending with a "variational formula" for A,,. Symbolically, we will show that

4

q'1 !! kP2=V1i :! (PI:l0.

We will define the symbols above as we need them. The first inequality involves the function w used in the last section (and is from Chung and Li (1983)). Let w f(x) = E. (exp Jf(B)

(2)

dtl

)

cpo= -sup{O:we+e *ooinG}. 1
8.8

EIaenvshm of e + c

?Au+cu+Ou=O

265

inG

u=O inOG has a unique solution u - 0, so - 0 is not an eigenvalue. To work with go, it is convenient to recast it in a more compututitinal form. Let at = supE.(exp(ct); i > t) x

where

Jc(B5) A

Ct =

0

The strong Markov property implies that as+t = E.x(eXp(cs+t); i > s + t) = E.(eXp(cs)EB( )(exp(ct) l(t>t)); T > s) < a,EE(exp(cs); T > s) = atas,

Taking logarithms gives log as+t < log as + log at,

that is, bt = log at is subadditive. From the last observation, it follows easily that

(*) lim b/t = inf bs/s. f-.O

S>0

Proof It is clear that lim inf b/t > inf bs/s. t-oo

s>o

To prove the other inequality, observe that if s > 0, ns < t < (n + 1)s, and r = t - ns, then subadditivity implies

bt
co gives

lim sup bt/t < bs/s, t- ao

which proves (*) since s is arbitrary.

Let p, = limt-., (1/t)logat. It is easy to see that (3)

go <
Proof Let 1 > Cpl. Then if t >- to, log at < It (i.e., at < ett), so if 0 < -1, then

E. (exp (fo (0 + c) (BS) ds) ;T > t) = eetat <

f

f

2"

$

PDE's hut ('in no Solved by Rum doll a Brownlnn Motion

from which it follows easily that E.,

(exp (j

(0 + c) (B.) ds)) < 00.

Since this holds whenever -0 > cpl, it follows that

cpo= in fl-0:wc+0 *oc in G} 5 cp1. Having proved that 2o < cpl, we are now in a situation where we can use results of Donsker and Varadhan (1974-1976). Our next step (following their (1976) paper) is to show that (4)

cpi

=inf sup c + Au u

x

where the infimum is taken over all u e C+ = {u a C°° : u > 0 on G }.

Proof If 1 > >/i2, there is a ueC°° that is > 0 on G and has

c+Au
Ex(exp(f c(Bs)ds) ;T > t1
To prove this inequality, we observe that Au/2 + cu :!5; lu, so

vr- Av-cv=lue1`-

u+cule">0 inG x (0,00)

v(t, x) > 0

///

v(0, x) >- 1

on 8G x (0, oo) on G.

and using Ito's formula gives s

v(t - s, Bs) exp(c.,) - v(t, x) = f - V, (t - r, B,) exp(c,) dr 0

exp(c,)Vv(t - r, B,) dB,

+ 0

+ f v(t - r, B,) exp(c,)c(B,) dr 0

Ov(t - r, B,) exp(c,) dr.

+2 0

Since (VI -

A

v-

cv) >- 0, we see that

v(t - s, Bs) exp(cs)

8.8 Figeavdua of n + e

267

is a nonnegative local supermartingale on [0, T), so

Ev((t - T)+,

v(t, x)

Ex(eXp(c,); T > t),

proving the desired inequality and completing the proof of (4). Let >/il = sup inf

where the Jc + Adµ, 2u/

supremum is taken over all

probability measures on G and the infimum over all u c- C' 0. In this notation,

biz=infsuprrc+Audy µ

U

`J

2u

.

J

The next step is to show that, in our situation, the infimum and supremum can be interchanged. (5)

02 = 0i

Proof We only need to know that 0z < t/il, but since inequality in the other direction is trivial, we begin by proving it. Let F(x, y) be a function defined on some product space A x B. If (x,,, c A x B, then sup F(x,,, y) > F(x,,, x

y

y such that sup F(x,,, y) j inf sup F(x, y) x

y

inf F(x,

y

T sup inf F(x, y),

x

y

x

it follows that inf sup F(x, y) > sup inf F(x, y), x

y

y

x

proving iP2 > 01 .

The last inequality is valid for any function F, but as anyone who has heard of game theory can tell you,

(i) simple examples, for example, F(x, y) = lx - yl, show that we may have (here x - u, y - µ) inf sup F(x, y) > 0 = sup inf F(x, y) x

y

y

x

(ii) but if x -+ F(x, y) is convex for fixed y, y - F(x, y) is concave for fixed x, and F is continuous in a suitable sense, then inf sup F(x, y) = sup inf F(x, y). x

y

y

x

Theorems of the type described in (ii) are called min-max theorems. The one we use is due to Sion (1958).

26N

N

PDE's 71N ('rn 14 Solved by Rowing r Brownimn Motion

To apply this result, we write, with u = c-", Di(I" = (." l) D(je"

(."l),I/1 -+ ,I(I),h)2.

This change of variables makes (Let

+

J G(`µ,

l d= J(c++iXi2)d.

h) denote the right-hand side of the last equation. If we use the usual weak topology on p, then for fixed h e bC °°, p - G(p, h) is continuous and linear. On the other hand, if we use the C2 topology on C°° (i.e., h -+ h if and only if D,,h - Dah uniformly on G for all a with IaI < 2), then for a fixed probability measure, h

G(µ, h) is continuous and

G(p,0hl+(1-0)hz)= f cdp+0 f Mldµ+(1-0)f AZzdu +

JIOVh1 + (1 - 0)Vh2I z dµ.

2 To conclude that h - G(p, h) is convex, it suffices to show that for all a, b, and 0 e [0,1 ] (Oa + (1 - 0)b)2 < Oaz + (1 - 0)b z.

Proof Since (Oa + (1 - 0)b)2 < (OIal + (1 - 0)1b 1)2 and the result is trivial when a = 0, letting c = IbI/IaI it suffices to show that for all c > 0, (0 + (1 0)c)2 < 0 + (1 - 0)cz. This is true when c = 0. Differentiating the difference reveals that

aac = 2(1 - e)c - 2(1- 0)(0 + (1- 0)c) >0 ifc> 1

<0 ifc<1. Checking the value at c = 1 reveals that 0 + (1 - 0)cz - (0 + (1 - 0)c)2 = 0, so the inequality holds for all c. With the inequality above verified, we have shown that the hypotheses of Sion's theorem are satisfied and, hence, that (5) holds. This brings us to the fifth stage in the cycle: (6)

01 < (P l

1 = lim log sup E,,(exp(ct) ; r > t). rm t x

Proof We begin by making the simple observation that y f c dp is continuous and J(µ) = inf f A u/2u dp is upper semicontinuous, so there is a measure µo with 01 = JcdPo + J(µo).

8.8

269

EIQemduen of A + e

This result frees us from having to deal with supremum over µ in the dellnition of Vi, and prepares us for the main part of the proof. In order to focus attention

on the main steps of the proof, we will not prove first equality in detail. A complete proof (which involves discretizing time and passing to the limit) can be found on pages 601-602 of Donsker and Varadhan (1976). Let v(t, x) = Ex(exp(c); i t). 8t

j'lo(v(tx))d/Lo(x) =

> J v(t x) at v(t1

(t, x)

dµo(x)

x) (A + cl v(t, x) dµo(x)

J

_ f cdµo + J Av(t,x)dµo(x)

? Jcdµo+APO) =0i, so

f log(v(t, x)) dµo > 4i, t. Jensen's inequality implies that log Jdµo(x)v(t, x) > 0, t,

and we have

J dµo(x)v(t, x) > exp(4i, t), which proves (6). The last link in the chain is: (7)

4P 1 5 4.

Proof Let 1= q,1. We want to show that 1 is in the spectrum, that is, if we let R1f(x) = f

m

e-uEx(.f(B,)

exp(c)) dt,

0

then R, is not a bounded operator on C(G). If R, 1 is bounded, then we have 00 >

j'dzo(x)J e`v(t,x)dt o

f dte " f dµo(x)v(t,x), ,J

but it follows from (4), (5), and the proof of (6), that the last expression is

270

N

PDE's That Can Be Solved by Rumina

Brownian Motion

dt r "cxp(ot t) = oo.

Z 0

This is a contradiction, which proves (7) and completes the chain

A0
1. The equality of A0 and p0 in the case c > 0 is due to Khas'minskii (1959), who showed that )0 < 0 was equivalent to the existence of a solution

of2Au+cu =0that is > OonG. 2. The function u that minimizes sup c + x

Au 2u

is the eigenfunction go associated with 2o. The equality of 02 and A0 was first proved by Protter and Weinberger (1966). 3. The function

J (µ) = inf

f

0u 2u dµ

defined in the proof of (6) is -1 times what Donsker and Varadhan call I(µ). If dµ = fdx where f is smooth, then

J(µ) = -

f JVgl2dx

where g =f112 (see Donsker and Varadhan (1975-1976), I, Section 4), so substituting this equality in the definition of 01 gives

Ao = - inf

cg2 + I Vg I2 dx,

9 E Lz J

119112=1

the classical Rayleigh-Ritz variational formula for the first eigenvalue. The approach we have taken is certainly not the most direct way of proving this result; see Courant and Hilbert (1953), vol. I, Chapter 6. 4. The results we have proved above for Brownian motion are only a small part (and a relatively trivial one) of the theory of large deviations for Markov processes developed by Donsker and Varadhan. Their theory is one of the most important developments in probability theory in the last ten years, but you will have to learn about this from somebody else.

9 Stochastic Differential Equations

9.1 PDE's That Can Be Solved by Running an SDE Let Lf(x) = i >;, A,j(x)D,j f(x) + Y, b,(x)D, f(x), where the A,j(x) and b,(x) are

(for the moment) arbitrary. In this chapter, we will consider the following equation : (1)

(a) u, = Lu in (0, oo) x Rd (b) u is continuous in [0, co) x Rd and u(0, x) = f(x).

In Section 8.4, we solved (1) in the special case A(x) - I by first solving the stochastic differential equations

dX"=dB,+b(XX)dt Xo =x and then running the resulting processes to solve (1): u(t, x) = Ef(X, ).

On the basis of the results for A - I, it seems reasonable to try to solve the general case by solving

Xo = x

(*) dXX = o (XX) dB, + b(Xx) dt

(where o is a d x d matrix) and letting u(t, x) = Ef(XX ). To see which a to pick, we apply Ito's formula to get s

u(t - s, X) - u(t, Xo) =

- u, (t - r, X,) dr 0

+I D,u(t-r,X,)dX, '

oS s

+2

ij

Jo

D;,u(t - r, X,) d<X', X'>r 271

272

9

Stochutlc I)Ifferentlol Equatloou

(here we have taken the liberty of dropping the superscript x to simplify the formulas). Now dXs = bi(XS) ds + Y aij(Xs) dBs ,

j

so it follows from the formula for the covariance of two stochastic integrals that <Xi1XjX= Y_

aik(X,)ajk(Xs)dS

k

and we have 5

u(t - s, X5) - u(t, X0) = f - u, (t - r, X,) dr 0

+Y

Diu(t - r, X,)bi(Xr) dr

i

+ a local martingale 5

+ 2Y

Diju(t - r,Xr)(aaT)ij(X,)dr,

so if CUT = A, then we have

u(t - s, XS) - u(t, Xo) =

(- ur + Lu) (t - r, X,) dr + a local martingale. 0

The condition A = auT obviously restricts the set of A's we can consider, for if A = auT, then for each xERd"1 (i.e., Rd viewed as d x 1 matrices), XTAx = XTQQTX = Ior TXIZ > 0,

that is, A is nonnegative definite. If we assume (as we can without loss of generality) that A is symmetric, then this condition is also sufficient, because results in linear algebra tell us that any nonnegative definite symmetric matrix can be written as UT DU, where U is an orthogonal matrix (i.e., UTU = I) and D is a diagonal matrix. This observation allows us to define a by setting or = UTCU, where C > 0 is the diagonal matrix that has CZ = D. With this choice of a, we have: (2)

If u satisfies (i) and X satisfies (*), then M., = u(t - s, Xs) is a local martingale on [0, t).

Proof By computations above,

u(t - s, Xs) - u(t, Xo) = f (- ut + Lu) (t - r, X,) dr + a local martingale. o

If If u satisfies (i), the first term is = 0.

Remark: Looking back at the proof of (2), we see that after we used Ito's formula to conclude that

9.1 M's That Can Be Solved by Running n SDE

273

u(t - s, Xs) - u(t, Xo) = f - U, (t - r, X,) dr 0

+1: f5 Diu(t-r,X,)dX, i

2

0

+ 1Y_

D;ju(t - r, X,)d<X`, XJ) o S

all we did was work out what dX, and d<X`, X'>, were and plug in their values, so we have the same conclusion if X satisfies : (**)

(i) For each i, XX - f o b;(X) ds is a local martingale (ii) For each i, j,

A;;(X)ds.

<X`,X'>, = J0

We will see in the next section that there is essentially no difference between (*) and (**).

As has been the case many times before in Chapter 8, the last result leads immediately to a uniqueness theorem. (3)

Let X be a solution of (*) (or (**)), with X0 = x. If there is a solution of (1) that is bounded, it must be v(t, x) = Ef(X).

Proof If u satisfies (1), then MS = u(t - s, X) is a bounded local martingale on [0, t) that converges to f(X) as s r t and satisfies M, = E(f(X,) j A), so taking s = 0 proves (3).

Remark: (3')

The reader should note that (3) is also a uniqueness result for (*):

Suppose there is a solution of (1) that is bounded. If X, and X7 are two solutions of (*) with X0 = Xo = x (constructed, perhaps, on different probability spaces using different Brownian motions), then

Ef(X:) = Ef(X') (2) and (3) may look the same as the corresponding steps in Sections 8.1 through 8.7, but when we start to consider the existence of solutions, things become very different. We first have to construct solutions of (*) and then run them to produce solutions of (1). The first task will be accomplished in Sections 9.2 through 9.5 (since I am a probabilist, a neophyte, and a pedagogue,

we will spend some time investigating the countryside on the way to our destination-constructing "weak" solutions of (*)). In Sections 9.6 and 9.7, we turn our attention to v(t, x) = Ef(XX) and prove the analogues of the results

274

9

Stochutk 1)Iffercntrl Equatlou

that we called (4), (5), and (6) in Sections 8.1 through 8.7. The first two results, which are easy consequences of the Markov and Feller properties, are dispensed with in Section 9.6. In Section 9.7, we confront (but do not conquer) the problem of proving (6).

9.2 Existence of Solutions to SDE's with Continuous Coefficients In this section, we will describe Skorohod's approach to constructing solutions of stochastic differential equations. In order to focus our attention on o and not on b (which we have already considered in Section 8.4), we assume that b - 0. The same method works when there is a (bounded) continuous b * 0, but there are more estimates to do and, as parenthetical qualification suggests, many of the complications involve issues (e.g., explosions if b is too big) that we have considered earlier. Skorohod's idea for solving stochastic differential equations was to discretize time to get an equation that can be solved by induction, and then pass

to the limit and extract subsequential limits to solve the original equation. Given this approach, it is natural (and almost necessary) to assume that each A,3 is a continuous function of x, and, for the moment, we will suppose that A is bounded, that is, IAij(x)I < M for all i, j, x. For each n, define X"(t) by setting X"(0) = x and, for m2-" < t < (m + 1)2-",

X"(t) = X"(m2-") + o,(X"(m2-"))(B, - B(m2-")), where the second term is the matrix Q(X"(m2-")) times the Brownian increment

(B, - B(m2-n)). Since X" is a stochastic integral with respect to Brownian motion, the formula for covariance of stochastic integrals implies

<X", xi>, = k k

J0

Aij(Xn([2"s]/2")) us,

and it follows that if s < t, then

<M(t-s), so we have (see Section 6.3)

E sup I X,i(u) - XX(s)I° < CEI<XX>, - <Xi>slp12 uc[s.r]

so by taking subsequences, we can guarantee that for each i and rational t, converges weakly to a limit as k -+ oo. Taking p = 4, we see that

9.2

Existence of Solutions to SDE's with Continuous Coefficients

275

CM(t - s)2,

E sup I XX(u) u E[s, t]

so the processes X satisfy Kolmogorov's continuity condition uniformly in n. Combining the last observation with a standard result (Theorem 14.3 in Billingsley (1968)), we can conclude that the measures induced on (C,'6) by the X"(k) converge weakly to a limit QX on (C, W). I claim that under Q., the coordinate maps XX(co) = cot satisfy (*). To prove this, we will first show that if f e C 2 and we let Lf = 2 Y A;,Dijf then we have (1)

f(X) -f(Xo) -

Lf(XS)ds J is a local martingale/QX.

Proof It sufficesto show that if f, D; f, and D,j f are bounded, then the process above is a martingale. Ito's formula implies that

f(X"(t)) -f(X"(s)) _ >J iD1f(X"(r))dX.(r) s

t

+ 2Y

Dijf(X"(r))d<X,:,XI>,, s

and it follows from the definition oY" that

f 0

so if we let

L"f(r) _ Y A,J(X" ([2"r] 2-"))Djjf(X, ),

then f(X"(t)) - f(X"(s)) - f s L"f(r) dr is a local martingale. The Skorohod (1956) representation theorem implies that we can construct processes Yk = X"(k) on some probability space in such a way that with probabil-

ity 1 as k -- oc, Y, converges to a limit Y, uniformly on [0, T] for any T < oo. If s < t and g : C - R is a bounded continuous function that is measurable with respect to F,, then

E(9(Y) IftY) -

-f(Y) - JLf(Yr)dr}) = limE(9(Yk).{f(Yk) -f(YY) - f" L,f(r) drjI = 0,

which proves (1).

276

9

Stochautlc l)Ifferendal Equation

xixj, we see that under Q. the

Applying (1) to fi(x) = xi and coordinates X; are local martingales with

Aij(X)ds,

<X`, X'X = 0

that is, X solves the problem we called (**) in Section 9.1. The final step is to construct a Brownian motion B such that

(*) Xt-X0= ra(X.)dBs. J0

If a is invertible for each x, then the proof is trivial. We let Bt = fo u-1(X5) dX5.

The associative law implies that this process satisfies (*). To see that it is a Brownian motion, observe that each component B; is a local martingale and that

B',Bj>t-

aik1(Xs)Qji1(Xs)A kI (XD ds ki

-

o t

(o-'Au-1)ij(/Xs) ds = bijt,

0

since a is symmetric and

a-'Au-1 = a-1a2a-1 = I. When a is not invertible, for example, when a - 0, one has to first enlarge the space by adding independent Brownian motions and then use some linear algebra to get around the fact that a-1 does not exist. Since the details get a little messy, we leave it to the reader either to figure out how to do this or to look up the answer in Ikeda and Watanabe (1981) on pages 89-91.

The discussion above shows how to solve (*) when A is bounded and continuous. We now deal with a general continuous A. Let 0 < g(x) < 1 be a continuous function such that A(x) = g(x)A(x) is bounded. Let Q(x) _ g(x)1"2a(x'') and let Y5 be a solution of dYs = &(YY) dB,. Let

at=Jg(Ys)ds (
T=

g(Ys)ds, J

and for s < T, let

ys=inf{t:a,>s} Xs = Y(ys)

Since ys is an increasing family of stopping times, each component Xs is a local martingale. To compute <Xi, Xj>s, we observe that if t < T, then t

XXX? - JA()ds = 0

YY(t, YYit) - JAiJ(Y(s))ds. 0

9.2

Existence of Solutions to SDE's with Contlnuoue Coefficients

Changing variables s = o, in the integral above and observing that )'(n,) converts the right-hand side to

277

r

Ylt)

yr(nYv) -

fo

which is a local martingale, so we have .

JAj(Xs)ds.

<X' 1X'>1 =

0

As before, we can construct a Brownian motion B such that for t < T,

X, - X0 = J a(Xs) dB, If P(T = oo) = 1, then we have solved (*) for all times and we are done. If P(T < oo) > 0, then, as you might guess from the discussion in Section 8.4, we are also done, for the process has exploded, in other words, lim,1T Y = 00 a.s. on IT < oo }. It is easy to explain why this is true-for any R, there is a 6 such that if Ixl < R and S2R = inf{t : IXI > 2R}, then Px(S2R > S) > 6; therefore, with probability 1, X, cannot leave I x I < 2R and return to I x I < R infinitely

many times in a finite time interval. A full proof, however, requires the strong Markov property, which we have not established, and a number of unpleasant

details, so we leave the rest to the reader. Again, a complete proof can be found in Ikeda and Watanabe (1981), this time on pages 160-162. To get an idea of when explosions occur, consider the following: A;i(x) = (1 + Ix16)5;j. _ In these processes, if we let g(x) = (1 + lxl8)-1, then A(x) = I, so only the Example 1

second part of the construction is necessary: W

T= Jo ( 1+IBIb)-1ds. In d = 1, 2, Brownian motion is recurrent, so T = oo (and the process never explodes). In d > 3, Fubini's theorem implies that

ExT = E.

(1 + iBsl6)-1 ds 0

=C

lx

The integrand -

f

-

1

1

y1d-2 I + Iylb lyl2-a-d

dy.

as y -. oo, so if

r2-b-drd-1 dr < o0

(i.e ., S > 2), we have EXT < oo.

Conversely, we have (2)

If trace(A) < C(1 + lxl2), then QX(T = cc) = 1.

27K

9

Stochutlc Differential Equ.tlor

Proof This proof is the same as the proof of (7) in Section 8.4. Let p(x) _ 1 + 1x12. By Ito's formula, w(Xt)-(P

(x°)=Y

it,

i=1 Jo

o t

d

= a local martingale +

A;;(X5) ds. 0i=1

The last integral < C f o cp(X5) ds, so another application of Ito's formula shows

that e-ctcp(X) - tp(Xo) = a local martingale + Y f t e-cSAjj(Xs) ds 0

+ J(_C)eco(X)ds = a local supermartingale.

If we let T. = inf{t: IXI > n}, it follows from the optional stopping theorem that Note: Our treatment of the existence of solutions follows Section 4.2 of Ikeda and Watanabe (1981), who in turn got their proof from Skorohod (1965). The treatment of explosions here and in Section 8.4 is from Section 10.2 in Stroock and Varadhan (1979). They also give a more refined result due to Hasminskii.

9.3

Uniqueness of Solutions to SDE's with Lipschitz Coefficients The first and simplest existence and uniqueness result was proved by K. Ito (1946).

(1)

If for all i, j, x, and y we have I a (x) - aij(y) I < KI x - y I and I bi(x) - bi(y) I < KI x - y1, then the stochastic differential equation X= x+ fo t a(XS) dB., + fo t b(X) ds

has a unique solution.

Proof We construct the solution by successive approximation. Let X° = x and define :

X' = x + fo a(X: -1) dBs

+ J b(X: -1) ds for n > 1. 0

9.3

279

Uolqueo.M of Solution to SDE'. with Llp.chltz Codflclsnte

Let A,,(t) = E ( sup Xs - Xs -' j

.

I

Ossst

J

We estimate A. by induction. The first step is easy: XSl

= x + a(x)Bs + b(x)s,

so

IX91- X°I < Ia(x)Bsi + Ib(x)Is.

Squaring and using the fact that sup Ia(x)BSI ossst

a 1112

sup lo(x)BI,

o<s<1

gives

A1(t) < C1t + C2t3/2 + Ib1212 < C(t + t2)

where

C=C1+C2+lb12. To bound Am(t) form > 2, we recall that Ia + b12 < 2a2 + 2b2 and observe that this implies that for n >- 1,

2E sup

ta( s) -

a(Xs-1)dBs I 2

o5t5T fo t

+ 2E sup

05tsT

2

fb(Xn) - b(Xs -1) ds 0

To bound the second term, we observe that the Cauchy-Schwarz inequality implies that e b'(Xs) - b`(Xs 1) dsl2 <

\

\J

f t (b`(X:) - b`(X: -1))2 ds/

(fo

/, 1 dsJ

so we have

(a) 2E sup Jb(x:) - b(x:')ds O
2

< 2TE f T Ib(X:)

- b(Xs-1)I2ds

0

0

T

< 2dTK2E

I Xs - Xs -1 12 ds. 0

To bound the other term, let ai be the ith row of a and observe that Doob's inequality implies that 2

E sup (\fo (a,(X.,) - a,(Xs 1)) dBs) < 4E 0
J

(f

T

\0

l2

(ai(XS) -

fT

= 4E

I a,(Xs) - a,(X:-1)I2ds, 0

J

280

9 Stochudc Differential Equations

so we have 2

(b) 2E sup

fo, (a(XS) - a(Xs-1)).dBs

O
2

< 2EY sup

i=10
(ai (X") - ai (X"-')) - dB,

I f,0

<8EY JI1(x:) - o(X:-1)I2ds t=1

0

T

< 8d2K2E

IXs - Xs-1I2ds. 0

Combining the last two inequalities shows that T

(c) 0"+1(T) < BE

I Xs - Xs -1 12 ds 0

A. (s) ds 0

(where B = 2dTK2 + 8d2K2 depends on T). Since Al(t) <- C(t + t2), iterating (c) gives that if t < T, then

C(s + s2) ds = BC (+) t

A2(t) < B 0

A3(t)
and it follows by induction that if t <- T, then (d) O"(t) < Bn-1C

2t"+1

t"

With this estimate established, the rest of the proof is routine. Chebyshev's inequality shows that

P (Osup I X, -

X°-'l

> 2-n) < 22nD"(T)

5t5T

Since the right-hand side is summable, the Borel-Cantelli lemma implies that

P ( sup I Xn -

X°-'l

> 2-" i.o

0
lJ = 0,

so with probability 1, Xe -+ a limit Xt uniformly on [0, T], and it follows from the estimates above that

(e) for all m < n < oo,

E(sup IXm-Xsl2) <( n Ak(T)1/2 //I2. 05s
f

=m+1

If we let XX° = Xt, then the result also holds for n = oc.

9.3

Uniquenew of Solutions to SDE's with Lipechits Coeflkbete

281

Proof If n < oo, then it follows from the triangle inequality that

sup IX9 -XSI2

O<s
E Ok(7

sup IXm - XT-1I

05s
k=m+1

2

2

k=m+1

Letting n -> oc and using Fatou's lemma proves the second claim. At this point, we have assembled all the ingredients. The rest of the proof

consists of applying what we have learned to prove (1). To see that Xt is a solution, we observe that if we let

Y:=x+ J a(Y)dB5 +

,

th en it follows from the proof of (c) above that (c')

Ersup IYt-ZtI2) -BE

f

`0
T

IY-ZsI2ds.

o

Letting Yt = X' and Zt = Xt shows that

E sup IX°+1 0
fT

Xti2) < BE

f

J0

IXs - XXI2 ds

<;BTE (sup IX '- X, OSs
f

0

byO,e solimXtn+1=X

t

To prove uniqueness, observe that (e) implies that

E( sup Ix-X,I21 <(-

/

0<s
z

Am(T)1/2

\`m=1

so if Y is another solution with E supo<ss T I Ys I2 < oo and we let

4P(t)=E(sup IXs0<s
Y11

2),

/

then p(0) = 0, (c') implies that lp(t) 0, so cp = 0 and, hence, X = Y. To remove the integrability condition on Y, observe that for any R < oe, we can modify a and b outside IxI > R in such a way that a and b still satisfy IQ;;(x) - a;;(y)I < KI x - yI and Ib;(x) - bt(y)I < KI x - yI, but

a and b are = 0 off some compact set. When we do this, any solution has E(supo<s
282

9

Stochwflc Differential Equations

Before the reader forgets the proof given above, we would like to observe

that the argument above gives a continuity result for x - Xx (the solution starting at X0 = x). Let X and Y denote solutions of (*) with X0 = x and Yo = y, and let X" and Y" be the sequence of processes generated by the construction above when X° = x and YO - y. If we let

0;,(t)=E(sup

IXS - YsI2

J

o<s
observe that Do(t) = I x - yI2, and iterate (c'), we see that if t < T, then

A (t)
Blx-yl2sds=B2lx-y12?

O'(t)
2

fo

It follows by induction that if t < T, then (d') 0;,(t) < B"I x - y1271,' so summing as we did in the proof of (e) gives

(e') E ( sup IX1 -

YYI2)

<

2

(

Ak(T)1/2/

<

Ix-y12CT,

where

= 00 ((BT)" \1/z CT

- k-o

n.

fI

< 00

and

BT = 2dK2T2 + 8d2K2T. Remark: If X0 = X and Y° = Y are random, then the same result holds with Ix - y12 replaced by El X - YI2. At this point, we have completed Ito's construction of solutions to (*). Before we proceed, however, the reader should note that we are not in the usual Markov process setup (cf. Section 1.1). We started with a Brownian motion defined on some probability space (S2, S, P), and by successive approximation we defined for each x c R° a process XX that satisfies

X" = x +

ft

Jo

a(X') dB., + f t b(X') ds. Jo

In other words, we have one probability measure (P) and a family of stochastic processes (X', x E Rd), rather than one set of random variables (the coordinate maps on (C, ')) and a family of measures (Px, x e Rd) (which was what we got

9.4 Some Examples

283

from Skorohod's construction in Section 9.2 and from the Camermi-Martin transformation in Section 8.4). Note: The material in this section has appeared previously in a number of places. The proof of (1) above is a hybrid of the proofs in Friedman (IV75), pages 98-102, and Stroock and Varadhan (1979), pages 124-126, with cmc small improvement : We estimate A(t) = E sup { X; - X; -1 I2 : 0 S .c' s t , rather than ElXt - x1 I2, and this simplifies the argument somewhat.

9.4 Some Examples Having solved our stochastic differential equation using two methods under two different sets of conditions, it is time to compare the results by looking at some examples. We begin with a "trivial" one. Example 1 Let A (x) - 0. Then the stochastic differential equations become deterministic : dXs = b(X,) ds.

Skorohod's approach allows us to construct solutions by successive approxima-

tion whenever b is continuous: In contrast, Ito's approach requires that b be Lipschitz continuous but implies that for such b, there is, for each x e R°, a unique (deterministic) process X, that has X0 = x and that solves (*). A simple family of examples shows that uniqueness need not hold if the assumption of Lipschitz continuity is replaced by Holder continuity of any order < 1.

Example2 Letd= 1,a-0,b(x)=Ixl6,0
so to make dXs = b(XX) ds, we first pick p so that p6 = p - 1 (i.e.,p = (1 - 6)-1) and then pick C such that Cp = Ca (i.e., p = C1-b or C = 1/(1 - 8)1-a)

Example 3 Let d = 1, a(x) = lxla, b - 0. Since a is continuous, Skorohod's approach allows us to construct solutions by successive approximation for any

S > 0. In contrast, Ito's approach requires that Q(x) = I X1612 be Lipschitz continuous, so we are restricted to S > 2, but in this range we can conclude that the solution is unique.

Although the condition required by Ito's approach is not sharp in this case, there is a good reason why Ito's approach, or any other approach that proves uniqueness, does not work for small 6-the solution of (*) is not unique

284

9

Stochutk Differential Equations

when 6 < 1. To prove this statement observe that by using the approach in the second part of Section 9.2, we can think of a(x) = 1/g(x) where g(x) = IxI-a and solve the equation by time-changing a Brownian motion. If 6 < 1, then

(a) Eo f IB,I ads= f s-612EoIBi ads 0

0

=

fos-112 dsl EoIB1I-6 < o0

(since the first factor is < oo for 6 < 2 and the second for 6 < 1), so we can construct a solution of dX, = I XSlaiz dB, that starts at 0 and does not stay there. Once we have two solutions starting from 0, it is easy to see that there are

two solutions starting from x 0 0. If x > 0

and

i = inf{t : B, = 0}, then by

results in n Section 1.9,

(b) E. J

xlyl-a(x AY)dy

TIBSI-a1(a,<x)ds= I z

Y1-ady < 00

(whenever 6 < 2), and we have I B,I a1(Bs>z)ds <

1-6

T < co

fol

(for any 6 < c0). Combining the last two results gives

T=

BsI -ads < oo

a.s.,

0

that is, starting at x, we hit 0 at a time T < c0. Once we hit zero, we have two choices : We can stop, or we can continue by using the time substitution (or, if you are more sophisticated, you can stop the first time the local time of B, at zero exceeds a fixed or exponentially distributed level, or ... ). At this point, we have settled the uniqueness question when S z 2 (unique) and when 6 < 1(not unique). Looking at the proofs of (a) and (b) more carefully reveals that you cannot escape from 0 when 6 >_ 1 and you cannot reach 0 when 6 >- 2 (see Exercises 1 and 2 below for converses). On the basis of this conclusion,

you might guess that the solution is unique when 1 < 6 < 2 and that all solutions stop when they hit 0. This conjecture is indeed true. The first fact is a consequence of a theorem of Yamada and Watanabe (1971). (1)

Suppose that (i) there is a strictly increasing function p with la(x) - a(y)I < p(I x - yl) that has p(0) = 0 and

JP2(u)du = c0 for all a> 0

9.4

285

Some I xemplee

(ii) there is an increasing and concave function ) with I b(x) - b(y)I c A (I % that has A (O) = 0 and

i'I )

A-1(u)du= oo forallE>0. fo,

Then (*) has a unique solution.

Proof See pages 168-170 of Ikeda and Watanabe (1981). (1) implies that our equation has a unique solution when 6 > 1, so we now have a complete picture of Example 3. unique for 6 > 1 The solution of (*) is not unique for 6 < 1.

-

While (1) helps us with Example 3; Example 2, on the other hand, helps us to understand the reason for the difference between the assumptions about a and b in (1). When b(x) = x1 and a = 0, (1) implies that

unique for 6 > 1 I not unique for 6 < 1,

solution of (*) jl

which is the same as the conclusion for Example 3, except for the fact that it pertains to a(x) = Ixlaiz To justify my remark that "you cannot escape when 6 > 1," show that for all e > 0, Exercise 1

IBSI-1 ds = oc P. a.s. fo

Proof By scaling and monotonicity, it suffices to prove the result where a is replaced by T1 = inf{t : IB,I - 1}, and to do this, it suffices to show that T1

lim sup nom

fo

I B.I 11(IBSI <2 no ds > 0 PO a.s.

To prove the last result, let R0 = 0, Sn = inf{t > Rn : IB,I = 2-"}, Rn+1 = inf{tr> Sn : B, = 0} for n -> 0, and N = sup{n : S. < T1}. Observe that N f S, N >Y 2"(Sm - Rm) 0Tl > Y M=O Rm

J

m=0

where EN > 2" and the S. - R. are i.i.d. with E(Sm - Rm) = C2 2n. A simple argument (compute variances) shows that the lim inf >- C a.s. Exercise 2 To justify my remark that "you cannot reach 0 when 6 > 2," show

that if r=inf{t:B,=0}, then for allx

JIBI_2ds= oo P., a.s. 0

0

286

9 Stochudc Dlfferontlil Equtlow

Proof The game is the same as in the last example, but the stopping times are different and don't work as well. Let S_1 = 0, R. = inf{t > Sn_1 : IB,I = 2"j, S. = inf{t > R" : IB,I = 2-"} for n >- 0, and N = sup{n : S" < T1}. Again, we have T, 0

N`

Sm

N

> Y 2Z"(S, - Rm),

L

m=0 Rm

m=0

but this time EN = 1 and E22"(Sm - Rm) = C, so an even simpler argument shows that the lim inf = oo a.s.

9.5 Solutions Weak and Strong, Uniqueness Questions Having solved our stochastic differential equation twice and seen some examples, our next step is to introduce some terminology that allows us to describe in technical terms what we have done. The solution constructed in Section 9.3 using Ito's method is called a strong solution. Given a Brownian motion A and an x e R°, we constructed a process X, on the same probability space in such a way that

(*) X = x +

f

A(X,) dB, +

0

f

b(XS) ds.

0

In contrast, the solution constructed in Section 9.2 by discretizing and taking limits is called a weak solution. Its weakness is that we first defined X, on some probability space and then constructed a Brownian motion such that (*) holds. With each concept of solution there is associated a concept of uniqueness. We say that pathwise uniqueness holds, or that there is a unique strong solution, if whenever B, is a Brownian motion (defined on some probability space (f2,.F, P)) and X and X are two strong solutions of (*), it follows that, with probability 1, X = Xt for all t >- 0.

We say that distributional uniqueness holds, or that there is a unique weak solution, if all solutions of (*) give rise to the same probability law on (C, ') when we map co e f2 --> X(w) E C.

Ito's theorem implies that pathwise uniqueness holds when the coefficients

are Lipschitz continuous. It is easy to show that in this case there is also distributional uniqueness. (1)

If the coefficients o and b are Lipschitz continuous, then distributional uniqueness holds.

Proof Let B, and B, be two Brownian motions, and let Xe and X, be the sequence of processes defined in the proof of Ito's theorem when X° = X° = x and the Brownian motions are B, and Bt, respectively. An easy induction shows that for each n, X" and X" have the same distribution, so letting n - oo proves M.

9.5

Solutlonn Week and Strong, Unlqueneaa Questions

287

With two notions of solution, it is natural, and almost inevitable, to risk about the relationship between the two concepts. A simple example due to Tanaka (see Yamada and Watanabe (1971)) shows that, contrary to the naive idea that it is easier to be weak than to be strong, you may have it unique weak solution but several strong solutions.

Let a(x) = 1 for x > 0 and = -1 for x < 0. Let W be a Brownian motion starting at 0 and let Example 1

B: = I a(W)dW Since B = e W is a local martingale with = t, B is a Brownian motion. The associative law implies that

a(W)-B=u(W)Z-W= W, so we have

W=

Ja()dBs.

Since a(- x)

c (x) for all x

0, we also have

-W = f a(-W)dBs. 0

The last two equations show that there is more than one strong solution of dXs = o (xs) dBs. To prove that there is a unique weak solution, we observe that if dXs = a(XX)dBs, then X is a local martingale with <X> = t and, hence, a Brownian motion.

In the other direction, we have the following result of Yamada and Watanabe (1971) : (2)

Pathwise uniqueness implies distributional uniqueness.

We will not prove this because we are lazy and this is not important for the developments below. The reader can find a discussion of this result in Williams

(1981) and a proof in either Ikeda and Watanabe (1981), pages 149-152, or Stroock and Varadhan (1979), Section 8.1. The last result and Tanaka's example are the basic facts about the differences between pathwise and distributional uniqueness. The best results about distributional uniqueness are due to Stroock and Varadhan (1969). To avoid the consideration of explosion, we will state their result only for bounded coefficients. (3)

Suppose that

(i) A is bounded, continuous, and positive definite at each point (ii) b is bounded and Borel measurable. Then there is a unique weak solution.

Much of Chapters 6 and 7 in Stroock and Varadhan (1979) is devoted to preparing for and proving (3), so we will not go into the details here. The key

20

9 Stochudc Differential Equador

step is to prove the result when b = 0 and I Aij(x) - 5,i1 5 s for all x E R°. The reader can find a nice exposition of this part in Ikeda and Watanabe (1981), pages 171-176.

In the material that follows, we will develop the theory of stochastic differential equations only for coefficients that are Lipschitz continuous. We do this not only because we have omitted the proof of (3), but also because in the developments in Section 9.6, the proofs of the Markov and Feller properties, we will use the fact that our processes are constructed by Ito's iteration scheme. You can also prove these results in the generality of (3) by knowing that there is a unique solution to the martingale problem, but for this you have to read Stroock and Varadhan (1979).

9.6 Markov and Feller Properties Having constructed solutions of (*) and having considered their nature and number at some length, we finally turn our attention to finding conditions that guarantee that v(t, x) = Ef(X") is a solution of (1) of Section 9.1. In Section 8.1, when we dealt with XX = B, a Brownian motion, the first step was to observe that if f is bounded, then the Markov property implies that

Ex(f(B)IJs) = v(t - s, B,), so v(t - s, B,) is a martingale. To generalize this proof to our new setting, we need to show that the X, have the Markov property, that is, (1)

If f is bounded and continuous and v(t, x) = Ef(Xx), then

E(f(Xt)I.)=v(t-s,X ). Proof Let X,.x(t) (the process starting at x at time s) be defined as the solution of

X, = x +

a(X,) dB,. + fS b(X,) dr g

for t >- s, and X,,x(t) = x for t < s. It follows from uniqueness that if s < I < u, then

X,,x(u) = X,,x,,x(,)(u)

a.s.

(recall that all the random variables X, x(u) are defined on the same pyobability space, (C, ', P)). From the last result, it follows immediately that if < s < t < u, if A 1, ... , A. are Borel sets, and if f is bounded, then 0< s1 < E(.f(XXx); Xx(s1)EA1, ... ,

X x(s1) E A1, ... , X x(s,,) c (recall that Xx = X0.x(t)). To prove (1), it is enough to prove the following:

9.6

289

Markov end Pallor Propertler

v(u - t, Xx).

(2)

To this end, observe that for any y e R°, Xr.v(u) a u(Br+s - Br, s Z 0)

and is, therefore, independent of ., so

E(f(Xr,v(u))I) = Ef(X,,,(u)) = v(u - t,y). Now if Y : S - R° is .yr measurable and takes on only a finite number of values yl, ... , y,,, then n

X"Y(u) _ Y l(r=vt)X y1(u)

a.s.

It follows from the last result that

E(f(XX,r(u))I.) = v(u - t, Y). To prove this equality, let A e., with A c { Y = y,} and observe that it follows from the first result that E(f(Xt,y(u)); A) = E(f(X1,v,(u)); A)

= v(u - t, y;)P(A) = E(v(u - t, Y); A). To extend our results to a general Ye., (and, hence, to prove (1')), pick a sequence Yn of random variables that take on only finitely many values and have Y. - Ya.s. and EI Yn - YI2 - 0 as n - oo. From the continuity result (e') in Section 9.3 (or, to be more precise, the remark afterwards), we have that XX,rn(u) - XX,r(u)

in L2

and, hence, E(f(Xr,r(u))I.°1t)

in L.

By the result above for simple Y, the left-hand side is v(u - t, Yn). To complete the proof of (2) (and, hence, of (1)), it suffices to prove (3)

Suppose that f is bounded and continuous. Then for fixed t, x -> v(t, x) = Ef(X x) is continuous.

Proof The continuity result (e') in Section 9.3 implies that

E(sup IXS -XXIZSlx-yl2CT, `OSsST

lJ

where CT is a constant whose value depends only on T. From this result, it follows immediately that if xn - x, X, n - Xt in probability and, hence, v(t,

Remark:

Ef(Xx^) -. Ef(Xx) = v(t, x).

The proof of (1) given above follows Stroock and Varadhan (1979), pages 128-130. 1 think it is a good example of the power of the idea from measure

290

9

Stochardc Dlfferoadal Equadooe

theory that to prove an equality for the general variable Y, it is enough to prove the result for an indicator function (for then you can extend by linearity and take limits to prove the result). With the Markov property established, it is now easy to prove (4)

Suppose that f is bounded. If v is smooth, then it satisfies part (a) of (1) in Section 9.1.

Proof The Markov property implies that E(.f(X, )I#;) = v(t - s, X. ,x).

Since the left-hand side is a martingale, v(t - s, Xs) is also a martingale. If v is smooth, then repeating the computation in the proof of (2) (at the beginning of this chapter) shows that s

v(t - s, XS) - v(t, x) =

(- v, + Lv) (t - r, X,) dr + a local martingale, 0

where

Lf= 1 YA`VD;;f+ Yb`Dif, 2 ii

so it follows that the integral on the right-hand side is a local martingale. Since

this process is continuous and locally of bounded variation, it must be = 0 and, hence, (- v, + Lv) = 0 in (0, ao) x Rd. With (4) established, the next step is to show the following: (5)

If f is bounded and continuous, then v satisfies part (b) of (1) of Section 9.1.

Proof From the continuity result used in the last proof, it follows that if

x - x and t - 0, then

x in probability and, hence,

v(tn, xn) = E.f(X',,(t,,)) - f(x)

9.7 Conditions for Smoothness In this section, we finally confront the problem of finding conditions that guarantee that v(t, x) = Ef(X) is smooth and, hence, satisfies (1)

(a) u, = Lu in (0, oo) x Rd (b) u is continuous in [0, oo) x Rd and u(0, x) = f(x).

In this category, the probabilistic approach has not been very successful. By purely analytical methods (i.e., the parametrix method; see Friedman (1964), Chapter 1), one can show (2)

Suppose that Aj(x) and b;(x) are bounded for each i and j and that

9.7

Condltlm for Smoottm u

291

(a) there is an a > 0 such that for all x, y e Rd, >Y,Aij(x)yj > alYl l

(b) there is a J > 0 and C < oo such that for all i, j, x, and y, l Aij(x) - A.,(Y)l <- Clx - ylP l bi(x) - b.(Y)l < Clx - ylP.

Then there is a positive function p,(x, y) that is jointly continuous in all of its variables and such that if f is a bounded continuous function, then v(t, x) =

, Y)f(Y)

dy.

Remark: This result is a combination of several theorems in Friedman (1964); see Friedman (1975), pages 141-142. For analysts, p,(x, y) is the fundamental solution with pole at x (i.e., p,(x, ) = Sx at t -+ 0) ; for probabilists, p,(x, y) is the transition probability

Pt(x,Y) = P(Xt = Y)

On the other hand, the best result I know of, which can be proved by purely probabilistic means, is on page 122 of Friedman (1975). (3)

Suppose that b, a, and f are C2 and that these functions and their derivatives of order < 2 are bounded by C(1 + lxl") for some C, y < oo. Then v is smooth and, hence, satisfies (1).

Remark: The reader should observe that although (3) requires more smoothness for the coefficients, it does not require the "strict ellipticity," (a) in (2), and hence can be applied in situations where a degenerates. In this context, the results obtained from (3) are almost the same as those obtained by Olenik (1966) using purely analytical methods. Probabilists (and anyone who does not read Italian) can find this result in Stroock and Varadhan (1979), Theorem 3.2.6.

Proof Since the proof is rather lengthy, we content ourselves with simply giving an idea of what is involved by assuming that everything is bounded and indicating why D,v exists. (In our defense, we would like to observe that not even Friedman (1975) spells out the details for the second derivatives; see page 123.)

To deal with derivatives with respect to x,, we will show that x -' X; is, in the "L2 sense," a differentiable function of x. To explain this statement, we need a definition. A function g(x, co) on Rd x i2 is said to have 8g/8x, =fin the LZ sense if

E (g (x + he,, co) - g (x, co) h

ash - 0.

-.l (x, (v) 2 -

0

M 9 Stochudc DIfferendel Equetloas

With this definition introduced, we can state our first differentiability result as (4)

If Djr and Deb exist for all j and are bounded and continuous, then 8Xx18x; exists in the LZ sense; furthermore, if we let e; (t) = (8; (t), ... , e°(t)) = 8X, 18x;, then 8i satisfies

e;(t) = e; +

f 8/ (s)D;b(XS) ds + Jo

f i

8/ (s)DDQ(XS) dBs,

,J o

where e; is the ith unit vector (i.e., (e;)3 = bij).

Proof There is only one way to start the proof of a result like this. Let h > 0 and write

h (Xi +hei - Xj) = e1 + J(b(xi) - b(XS )) ds

+ J(o(x:'i) - U(XS )) dB,,. 0

To change the first integral on the right-hand side into something that looks like the first integral in the desired answer, we write f- b(XS) ds 0

_

f ids f ld9 db(Xs

+O(Xs+hei-Xs))

('o

0

ds f 1

de Y D;b(Xs + 6(Xx +hei - X:)) Xs

+hei.

=J

.

i

0

j

XS .;

h

where XS - is the jth coordinate of X. ,x. The same trick works for the second term, with the result that h

r` a(XX +he,) - a(Xss) dB.'

= =

1

dBs

h

0

f r dBs 0

1

dB-

d

u(XS + e(X: +hei - XS))

0

f dO i> DQ(X: + B(X: +hei - Xs ))

+he,._

XS .i

Xs

gives a stochastic integral equation for Xs+hei - Xs Ah(s) =

s

h

s

in which the coefficients are almost the ones given in (4). The last step in the proof of (4), then, is to prove a result that says that if the coefficients of the equation converge in a suitable way, then so do the solutions. There is a large

293

Notes on Chapter 9

body of literature on this subject, which goes under the heading "Ntuhllity of solutions" (e.g., Jacod and Memin (1981)). A result that is sufficient For our purposes is given on pages 118-119 of Friedman (1975); the desired concluNilm follows immediately from that result. (4) shows that if the coefficients a and b are C1, then so is the solution XX when viewed as a function of x. Once this result is shown, it is not hard to show that v(t, x) = Ef(XX) is C'. One does this by proving the "chain rule": (5)

D1Ef(Xx) = YEDf(Xx)81(t). Proof The proof iss based on the trick used t*.'i prove (3). We write E(/'J (Xs +hei)

-J (`Ys ))

=E

f1

d6 def(Xs + O(Xs +he, - Xs ))

= D f 1 deY Df(X5 + e(xs Jo

i

+hei

- X x))

XS +he

,j

- X51i

h

and let h -* 0. Further details are left to the reader. A complete discussion of the results in this section can be found in Section 5.5 of Friedman (1975).

Notes on Chapter 9 To steal a line from somebody, this book ends "not with a bang, but with a whimper." The results in this chapter are but a small sample of the results known about SDE's and their relationships with PDE's, and even worse, in many cases we have thrown up our hands and referred the reader to Friedman (1975), or Stroock and Varadhan (1979), or Ikeda and Watanabe (1981) for the details. In our defense, we can only say that the book had to end somewhere and that the three sources of which we have referred are all good places to learn more about the subject.

Appendix A Primer of Probability Theory

A.1 Some Differences in the Language For an analyst, reading the probability literature must be like being an American in England. The language that is spoken is basically the same, but some of the

words are different or have slightly different meanings. My first task, then, is to explain some of the colloquialisms that probabilists use. For convenience of exposition, we will begin at the very beginning.

A probability space is a triple (0, F , P) where fl is a set, F is a a-field of subsets of S2, and P is a probability measure, that is, a nonnegative, countably

additive function on .F that has P(Q) = 1. Let JP be the set of all Borel subsets of R. A function X : S2 - R is said to be measurable if for each Be R we have that {oo : X(co) e B} e F. For convenience, the phrase "X is measurable with respect to .-" is often abbreviated Xe JF, and measurable functions are commonly referred to as random variables. In measure theory, one often talks about a sequence of functions f converging to a limit f "in measure" or "almost everywhere." These concepts are also used in probability, but they go by different names. A sequence of random variables X is said to converge in probability to a asn -.oo. limit Xif for all X is said to converge to X almost surely if P(co : X,, (to) - X(CO) as n -+ oo) _

1. The last conclusion is usually abbreviated as X - X a.s. The words almost surely and their abbreviation a.s. are used throughout

probability as substitutes for almost everywhere and a.e. For instance, if P(co : X(co) = Y(w)) = 1, then we say that X = Y a.s.

As in measure theory, X. - X a.s. implies that X. - X in probability, and the converse is false, but X - X in probability implies that there is a subsequence X a.s. We will prove the last statement, because the proof gives us an excuse to state some more definitions. 294

2"

Al .Some Difference. In the Language

The indicator of a set A is the function

_ IA (w)

1

wEA

0 w0A.

The notation is meant to suggest that this function is 1 on A. We do not uMr because it looks too much like X, our favorite letter for random variables, and we do not call this a characteristic function, because that term is reserved for something else (see Chapter 6 of Chung (1974)). If A. is a sequence of sets, then lim sup An = {w : lim sup lAn = 1 } n-m

m

m

=nUAn.

N=1n=N

The set defined above is usually referred to as {w : w e A. i.o. }, where i.o. is short for infinitely often. As the next result indicates, we often make {w : co e A i.o.} even shorter by dropping the w's.

Borel-Cantelli Lemma. If In P(An) < oc, then P(An i.o.) = 0.

Proof For any N, P(lim sup An) X a.s. Let Ek 10, pick nk - oo such that P(I Xnk - XI > Ek) < 2-k, and then apply (1). This proof is, of course, nothing more than the standard proof from measure theory translated into the language of probability theory. Up to this point, all of the changes have been semantic. When we turn our attention to integration, we encounter our first serious differences in notation. What an analyst would write as

XdP (assuming that f I X I dP < oc) n J ` a probabilist writes as EX (assuming that EIXI < oo) and calls the expected Jn

value of X, or the mean of X. One clear advantage of the probabilistic notation is indicated by the typography of the last sentence-EX consumes less space and does not have to be displayed. There is also one clear disadvantage. To steal a quip from Dynkin, "If you use E to denote expectation with respect to

P, then what do you use for expectation with respect to Q?" The obvious answer, F, is obviously unacceptable. Dynkin's remedy is to write PX instead of EX. Although this suggestion has considerable merit and would be useful at several points in the text, we will stick with the traditional notation.

2%

Appendix A Primer of Probability Theory

Extending the notation above to integration over sets, we will let E(X; A) = fA X dP.

Again, the notation is for typographical convenience and is motivated by the fact that the set A often has a complicated description. The proof of the next result illustrates the use of this notation. (2)

Chebyshev's Inequality. Let Y > 0 and let cp > 0 be a function that is increasing on [0, oo). Then

cp(a)P(Y> a) < Ecp(Y).

Proof Since p is increasing and >_ 0,

cp(a)P(Y> a) < E((p(Y); Y> a) < Ecp(Y). This result is trivial but useful. The following is a typical application : (3)

P(I XI > e) <

Ez X

.

A.2 Independence and Laws of Large Numbers I have heard it said that "probability is just measure theory plus the notion of independence." Although I think that this statement is about as accurate

as saying that "complex analysis is just real analysis plus," there is no doubt that independence is one of the most important concepts in probability. We begin with what is hopefully a familiar definition and then work our way up to a definition that is appropriate for our current setting. Two sets A and B are said to be independent if P(A fl B) = P(A)P(B).

Two random variables X and Y are said to be independent if for all Borel sets A and B,

P(XeA, YEB) = P(XEA)P(YEB). Two or-fields

and 5 are said to be independent if for all A E _5F and Be

P(A fl B) = P(A)P(B).

The third definition is a generalization of the second: Let F = a(X) = the a-field generated by X(= the smallest a-field .IF such that X E.F), let W = a(Y), and observe that A e a(X) if and only if A = {w : X((o) E C } where C E M. The second definition is, in turn, a generalization of the first : Let X = 'A, let Y = 1B,

A.2

Independence and lawn of large Numbers

297

and observe that if A and B are independent, then so are A` and B, A' iinI ,,e, A and SZ, A and 0 and so on. In view of the last two remarks, when we define what it means for several things to be independent, we take things in the opposite order.

a-fields F1, ... , S are said to be independent if whenever A, E.#, I'ur i= 1..... n we have that P rn

_ fl P(A1).

`t-1 Af i

i=1

Random variables X1, ... , X,, are said to be independent if whenever A, E. 9 for i = 1, ... , n we have that

P(n {X,EAi}l = JJ P(X,EA,).

/

\i-1

i=1

Sets Al, ... , A. are said to be independent if whenever I c { 1, ... , n} we have that

P (n IEI Ai//

= 1 1 P(Ai). iEI

If you think about it for a minute, you will see that the third definition is what we get when we specialize the second to X = lAi. It is important to note that the last definition is not equivalent to requiring that P(Ai fl Aj) = P(A,)P(AJ) whenever i j (this is called pairwise independence). Example 1 Let X1, X2 , and X3 be independent random variables that have P(X, = 1) = P(Xi = -1) = 1/2, and let Al = {X2 = X3}, A2 = {X3 = X1}, and

A3 = {X1 = X2}. These events are pairwise independent, since P(A, fl A) = P(X1 = X2 = X3) = 1/4 = P(A,)P(AJ), but they are not independent, since P(A1 flA2 flA3) = P(X1 = X2 = X3) = 1/4 1/8 = P(A1)P(A2)P(A3) Of the three definitions in the second list above, the first is the most important, so if it is unfamiliar, it would be a good idea to spend a minute and prove the next three results to get acquainted with the ideas involved. (1)

If

J are independent and X, E , have El X; I < oo, then n

n

E H Xi = n i=1

HEX,.

i=1

Note that EIII7=1 Xii < oc is part of the conclusion. The proof of this result follows a plan of attack that is standard in measure theory: Prove the result first for Xi = 1Ai, use linearity to extend the result to simple random variables, Ximonotone convergence to extend it to Xi > 0, and write Xi = X+ to prove the result in general. (2)

If X1, .. . , X. are independent random variables, then the a-fields S i = a(Xi)

are independent, and, consequently, if fl, ... , f are Borel functions, then f1(Xi), . . . ,fn(XX) are independent.

298

Appendix

(3)

Generalize the proof of (2) to conclude that if 1 < n1 < nz < the f : S2 R"i-ni-1 are Borel measurable, then

A Primer of Probability Theory

A (X1 , ... ,

Jk(Xnk-1 +1 ,

.

< nk = n and

, Xnk)

and independent.

Hint: Start with f that are of the form H .1,1.(X.) and then use the approach described for (1) to work your way up to theJ general result. Sequences of independent random variables are very important in probability theory, because they are (hopefully) what is generated when an experiment is repeated or a survey is taken. Motivated by this example, one of the main problems of the subject is to give conditions under which the "sample mean" (X1 + + Xn)/n approaches the "true mean" as n - oo, and to estimate the rate at which this occurs. Much of the first quarter of a graduate probability course is devoted to developing these results, but since they are not essential for what we will do, we will just prove one sample result to illustrate some of the concepts in this section and then state two more results in order to give you a taste of the theory. (4)

Let X1, Xz, ... be a sequence of independent and identically distributed random

variables (i.e., P(X; < x) is independent of i) that have EXz < oc. As n -, oo, 1(X1 n

+

EX1 in probability.

+ Xn)

Proof Letµ=EX;, Y=X; -p. Now n(X1+...

+XX)_P=n(Y1+... +Y),

so it suffices to show that the right-hand side - 0 in probability. The key to doing this is to observe that

)2(n n =E

E

YY

Y

i=1j=1

i=1 n

_

n

n

EY Y = > t=1j=1

(since if i

nEY1

i=1

j, EY Yj = EYEY, = 0). If we let S. = Y1 +

then

ES = Cn,

i.e.,

S" z

E

n

= C n

and Chebyshev's inequality implies that

/ Sn

PI

//

1\z

>E <E-LEIS"I =CE

\n- -

n

-c

n

->O.

+ Yn and C = EYz,

A.2

299

todepeadence and Lows of Large Numbers

The result above is a weak form of the weak law of large numhcrs. The strongest form of the strong law is (5)

Let X1, X2, ... be a sequence of i.i.d. random variables (for a translation, see (4)) with EI Xj I < co. As n

n (X1 + ... + Xn) -, EX1

oc,

a.s.

Analysts may recognize this result as being a consequence of Birkhoff's ergodic theorem. It is easy to show (see Exercise 2 below) that EI X; I < oo is necessary + X,)/n to converge to a finite limit so the condition in (5) is for (X1 + "sharp." The weak law holds in a little greater generality: (6)

Let X1, X2, ... be a sequence of i.i.d. random variables. There is a sequence of constants an such that 1

n

+ Xn) - a - 0 in probability

(Xl +

if and only if nP(I XiI > n) - 0. In this case, we can take an = E(Xi ; I Xi 1 < n).

Proof We leave the proof as an exercise for the reader. Once someone WIN you to look at Xi,n = Xi I(IXil,,n) and use Chebyshev's inequality, the rest is not hard. See Feller (1971), page 235, for a solution. The next two exercises are much easier, but the first is much more important.

The Second Borel-Cantelli Lemma. If A1, A2, ... are independent and >P(An) = co, then P(An i.o.) = 1. Exercise 1

Hint:

/

PI

If M:5 N < oo, then

\

A I = [1 (1 - P(An)),

n n/ =M

and, for any M, the right-hand side - 0 as N

co.

Exercise 2 Let Xl, X2, ... be a sequence of i.i.d. random variables with EI X,I = oo. Then

P(ISn-S.+1I>ni.o.)1, so Sn/n cannot converge to a finite limit on a set of positive probability. Hint : 00

n=0

W

P(IX;I>n)> fo P(IXI>x)dx=El XI.

300

Appendix A Primer of Probability Theory

A.3 Conditional Expectation Given a probability space (52, .to, P), a a-field F c .moo, and a random variable

Xe Fo (i.e., FO measurable) with EI XI < oo, we define E(XI.F) to be any random variable Y that has (1) Ye.F

(ii) for all A e .F, f

dP = fA YdP.

A

Y is said to be a version of E(XI fl. Any two versions of E(XI -F) are equal almost surely.

Interpretation: We think of .t as describing the information we have at our disposal : For each A e F, we know whether or not A has occurred. E(X I.F) is, then, our "best guess" of the value of X given the information at our disposal. Some examples should help to clarify this. In each case, you should check that the answer we have given satisfies (i) and (ii).

Example l If X e .F, then E(X I.F) = X, that is, if we know X, then our "best guess" is X itself. In general, the only thing that can keep X from being E(XIF) is condition (i). Example 2

Suppose that 521, ... , n. is a partition of S2 into disjoint sets, each

of which has positive probability, and that F = a(521, ... , 52 ), the a-field generated by these sets. Then on 52;,

E(XI F) = E(X ; f1i) P(52i)

In words, the information in F tells us the element of the partition which contains our outcome, and given this information, our best guess for X is the average value of X over 52;.

Example 3 Suppose that X is independent of F, that is, P({X e Al fl B) =

P(XeA)P(B) for all Ae. and Be.F. In this case, E(XI,fl = EX, that is, the information in .F is of no help in guessing the value of X. Let X > 0 and let Q be the measure that has density X with respect to P, that is, Q (A) = JA XdP. Let P' and Q' be the restrictions of P and Q to .F. Then Q' << P', and E(X I.) is the Radon-Nikodym derivative dQ'/dP'. This is, in fact, how we show that the conditional expectation exists. Conditional expectation has many of the same properties as ordinary Example 4

expectation :

(a) linearity E(aX + YI °F) = aE(XI F) + E(YI F )

A.3

Conditional Expectation

3111

(b) order preserving

if X < Y, then E(XI.F) < E(Y13) (c) monotone convergence if X T X, then E(X I.°F) T E(XI F)

(d) Jensen's inequality if cp is convex and EI XI, EI gp(X)I < oe, then

q(E(XI.F)) < (e) LP convergence

if X. - X in LP, p > 1, then E(XI _F) in LP

(f) dominated convergence if X -> X a. s. and Y with EY < co, then E(XXI.t) -), E(XI f) a.s.

These properties are not very hard to prove using the definition of conditional expectation. To prove (a), we simply check that the right-hand side a .F and has the same integral as aX + Y over all A e .F. To prove (b), we observe that if A e .F,

f fA XdP< SA and applying this result to A = {E(XIfl > JA

A

we conclude that A = 0,

that is, E(XI.F) < E(YI). With these two examples as a guide, you should be able to prove (c) through

(e), but, for the moment, I want to discourage you from doing this. It is more important to understand the following properties of conditional expectation, which have no analogue for ordinary expectation, and I leave the proofs of these properties as recommended exercises. E(E(Xj

)) = E(X).

If F c .F2, then (i) E(E(XI.°,Fi) I Fz) = E(X I.°wi) (ii) E(E(XI _O--z)I.Fj) = E(XI.F1).

In words, the smaller a-field always wins out. (3)

IfAe4andEIYI
By using linearity and taking limits, we can easily extend this to

302

Appendix A Primer of Probability Theory

(4)

If XE!N and El YI, El XYI < oo, then

E(XYI 9) = XE(YI 9).

From (4) (and (d)), we get a geometric interpretation of the conditional expectation. (5)

Suppose that EX 2 < oo. Then L2 (Fo)

y C .moo

: EY2 < oo } is a Hilbert space,

and L2(5F) is a closed subspace. In this case, E(XI.9F) is the projection of X

onto L2(3F), or, in statistical terms, E(XI.fl is the random variable YE.F that minimizes the mean square error E(X - Y)2.

A.4 Martingales Let ffl be an increasing sequence of a-fields. A sequence X. is said to be adapted for all and X E(Xn+1 I.° ) = X is said to be a martingale. If, in the last definition,

to J if

= is replaced by < or > , then X. is said to be a supermartingale or submartingale, respectively.

To give an example of a martingale, consider successive tosses of a fair coin, and let = 1 if the nth toss is heads and = -1 if the nth toss is tails.

Let S. = 1 +

+ . S is called the (symmetric) simple random walk. It

represents the amount of money a gambler has after the nth toss if each time the coin is tossed he bets one dollar on the coin coming up heads. Let # = As we mentioned before, we think of J as giving the information we have at time n, which in this case is the outcomes of the first n tosses. S To prove this, we observe that S. = 1 + is a martingale with respect to oo, and, by (3) in Section A.2, n+1 is independent of + we have

/ + Ee(Sn+1 I .F.) = E(S

KK

bn+1 l

E(SS I Win) +

I .F.)

If the successive tosses have P( = 1) < 1/2, then the computation above shows that

S corresponds to betting on an unfavorS is able game, we see that there is nothing "super" about a supermartingale. The

name comes, instead, from the fact that superharmonic functions, when composed with Brownian motion, give rise to supermartingales. For what follows, we will need a few simple facts about martingales, the proofs of which we leave as exercises for the reader.

A.5 Gambling Systems and the Martingele Convergence Theorem

303

(1)

If X is a martingale and m < n, then

(2)

If X is a martingale and T is a convex function with El (p(X,)l < rfj for nil n, then qp(X,) is a submartingale.

(3)

If X,, is a submartingale and qp is an increasing convex function with E(p(X,) < CZ)

for all n, then (p(X,) is a submartingale. (4)

Orthogonality of Martingale Increments. If X. is a martingale with EX,2 < 00 for all n and V!9 m 5 n, then

E((Xn-Xm)Xi)=0. Since the proof of (4) is a classic example of manipulating conditional expectations, we will give the proof and let the reader justify the steps

E((X, - Xm)Xi) = EE((X, - Xm)XiI&m = E[X,E(X, - XmI `fm)] = E[Xi(E(Xnl

Xm)]

=0. From the proof above, it is immediate that we have (5)

Under the hypotheses of (4), E((Xn - Xm)ZIFm) = E(X,

X.2.

A.5 Gambling Systems and the Martingale Convergence Theorem Let ? be an increasing sequence of o-fields. H. is said to be predictable if for all n > 1. In words, the value of the process at time n may be predicted (with certainty)

from the information available at time n - 1. You should think of H. as the amount of money a gambler bets at time n. This amount can be based on the outcomes at times 1, ... , n - 1, but it cannot depend on the outcome at time n. Once we start thinking of H. as a gambling system, it is natural to ask how much money we would win if we used it. For concreteness, let us suppose that the game consists of flipping a coin and that for each dollar bet we win one dollar when the coin comes up heads and lose one dollar when the coin comes up tails (most games in casinos reduce to this situation when you ignore all the ritual). Let S,, be the net amount of money we would have won at time n if we had bet one dollar each time. If we bet according to a gambling system H, then our net winnings at time n would be

304

Appendix

A Primer of Probability Theory

n

(H. S)n = Y Hm(Sm - Sm-1), m=1

since S. - Sm_1 = + I or -1 when the mth toss results in a win or loss, respectively.

-

The next result is the most basic fact about gambling systems and is, apparently, little known. It says that there is no system for beating an unfavorable game. (1)

Let Xn be a supermartingale. If H,, >- 0 is predictable and each H is bounded, then (H- X)n is a supermartingale.

Proof (H- X). + E(Hn+1(X,,+1 - Xn)I` n)

E((H.

= (H' X),, + Hn+1E(Xn+1 (1-1 -

since E(Xn+1 -

0 and Hn+, > 0.

Remark: The same result is obviously also valid for submartingales and for martingales (and in the second case without the restriction H. > 0). To keep from being repetitious, we will state our results for only one type of process and leave it to the reader to translate the result to the other two. In my remark preceding (1), I did not mean to dismiss gambling systems as worthless. There is one system that allows us to prove the martingale convergence theorem. Let h > 0, let No = 0, and for k >- 1 let N2k_1 = inf{m > N2k_2 : X. < a}

N2k=inf{m> N2,_1:Xm?a+h} if N2k_1 < m - 1 < N2k for some k otherwise U. = sup{k: N2k < n}.

Hm

11 0

Since X(N2k-1) < a and X(N2k) > a+ h, between times N2k_1 and N2k, X crosses from < a to > a + h. H. is a gambling system that tries to take advantage

of the "upcrossings." In stock-market terms, we buy when Xm < a and sell when Xm > a + h. In this way, every time an upcrossing is completed, we make

a profit

h. Last but not least, U. is the number of upcrossings completed

by time n. (2)

The Upcrossing Inequality. If X. is a submartingale, then

hEU, < E(X - a) + - E(Xo - a) Proof Let Y. = (Xn - a)+. Since Y is a submartingale that upcrosses [0,h] the same number of times that X,, upcrosses [a, a + h], it suffices to prove the result when a = 0 and Xn > 0. In this case, we have that

A.5 Gambling Systems and the Martingele Convergence Theorem

35

hU" S (H X)",

since a final incomplete upcrossing (if there is one) makes a nonncgolivc contribution to the right-hand side. Let Km = 1 - Hm.

and it follows from (1) that E(K X)" > E(K X)o = 0, so E(H X)" S E(X" Xo), proving (2) in the special case and, hence, in general.

Remark: We have proved the result above in its classical form even though this approach is a little misleading. The key fact is that E(K X)" > 0, that is, even by buying high and selling low, we cannot lose money on a submartingale,

or in other words, it is the reluctance of submartingales to go from above a + h to below a that limits the number of upcrossings. From the upcrossing inequality, we easily obtain (3)

The Martingale Convergence Theorem. If X. is a submartingale with sup,, EX < oo, then as n -> oo, X. converges almost surely to a limit X with El XI < oo.

Proof Since (X - a)+ < X + + IaI, (2) implies that EU" 5 (IaI + EIX"I)/h, so if we let U = lim U,, be the number of upcrossings of [a, a + h] by the whole sequence, then EU < co and, hence, U < oo a.s. Since this result holds for all rational a and h,

U {lim inf X" < a < a + h < lim sup X" a,hEQ

rm

n-m

has probability 0, and lim sup X" = lim inf X"

n-

n,ao

a.s.,

which implies X = limX" exists a.s. Fatou's lemma guarantees that EX+ S

lim inf EX < oo, so X < oc a.s. To see that X > - oo, we observe that EX,- = EX - EX" < EX,, - EXo (since X,, is a submartingale), and another application of Fatou's lemma shows that EX - < lim inf EX,,- < oo. From (3), it follows immediately that we have (4)

If X" < 0 is a submartingale, then as n -+ oo, X"

X a.s.

The last two results are easy to rationalize. Submartingales are like increasing sequences of real numbers-if they are bounded above, they must converge almost surely. The next example shows that they need not converge in L1.

Example 1 (Double or nothing) Suppose that we are betting on a symmetric simple random walk and we use the following system: H" _

2n-1

on S,,-, = n - 1

0

otherwise.

3%

Appendix

A Primer of ProbablUty Theory

In words, we start by betting one dollar on heads. If we win, we add our winnings to our original bet and bet everything again. When we lose, we lose everything

and quit playing. Let X" = 1 + (H- S)n. From (1), it follows that X is a martingale. The definition implies that X. > 0, so EI XnI = EX. = EXo = 1, but it is easy to see that P(X" > 0) = 2-", so X" -. 0 a.s. as n - oo. This is a very important example to keep in mind as you read the next three sections.

A.6

Doob's Inequality, Convergence in L", p > 1 A random variable N is said to be a stopping time if {N = n} e .F" for all n < oo.

If you think of N as the time a gambler stops gambling, then the condition above says that the decision to stop at time n must be measurable with respect to the information available at that time. The following is an important property of stopping times : (1)

If X. is a submartingale and N is a stopping time, then XfAN is a submartingale.

Proof Let Hn = 1(N> n). Since {N > n } = {N< n - 1 }` E and it follows from (1) of the last section that (H - X). = (2)

Hn is predictable, is a submartingale.

If X. is a submartingale and N is a stopping time with P(N < k) = 1, then EXo < EXN < EXk .

Proof Since XNnn is a submartingale, it follows that EXo = :!g EXNnk = EXN. To prove the right-hand inequality, let H. = 1(N<m). Now {N < m} = {N < m - 11 so (H X)n = Xn - XN,n is a submartingale and EXk EXN = E(H X)k > E(H X )O = 0. Remark: Let Xn be the martingale described in the last section and let N = inf In: X,, = 0}. Then EX, = 1 > 0 = EXN, so the first inequality need not hold for unbounded stopping times. In Section A.8, we will consider conditions that guarantee that EXo < EXN for unbounded N. From (2), we immediately get (3)

Doob's Inequality. If X. is a submartingale and A = 10<M Al , then

).P(A) < EX. lA < EX. .

Proof LetN=inf{m:Xm>Aorm>n}.XN>AonAandN=nonA`,so it follows from (2) that ).P(A) < E(XN 1A) < EX. lA

(observe that XN = Xn on A`). The second inequality in (3) is trivial.

A.7

Uniform IntearebWty and Convergence In L'

307

Integrating the inequality in (3) gives (4)

If X,, = max Xm, then for p > 1, OSmSn

p

E(X,°) A) dl

<

=

0

JP2P' (2_i fn

X

J X,,2x}

X dPl di

/

(iox

n PAP- 2 d2 dP

f

P

P-1 fn If we let q = p/(p - 1) be the exponent conjugate to p and apply 11Older's inequality, we see that the above < q(EI X,. I P)1/P(EIXnIP)1/4

At this point, we would like to divide both sides of the inequality uhovc by (EIX,,IP)1/q to prove (4). Unfortunately, the laws of arithmetic do not allow us to divide by something that may be oo. To remedy this difficulty, we observe that P(X,, A N > A) < P(X,, > A), so repeating the proof above shows that (E(X, A N)") "P -< q(EI X,, I P) 1/p,

and letting N -+ oc proves (4). From (4), we get the following LP convergence theorem: (5)

If X,, is a martingale, then for p > 1, sup,, EI X. I P < oo implies that X,, - X in LP.

Proof From the martingale convergence theorem, it follows that X,, - X a.s. Since I X I is a submartingale, (4) implies that supra I X. I e LP, and it follows from the dominated convergence theorem that E I X. - X I P -+ 0.

Remark: Again, the martingale described at the end of the last section shows that this result is false for p = 1.

A.7 Uniform Integrability and Convergence in L1 In this section, we will give conditions that guarantee that a martingale converges

in L'. The key to this is the following definition:

30$

Appendix A Primer of Probability Theory

A collection of random variables {Xi, i e I } is said to be uniformly integrable if lim (supE(IX;I ;IX;I > M)I =0.

\ieI

M- O

J

Uniformly integrable families can be very large. (1)

If X e L', then {E(XI.fi) } is uniformly integrable.

0, then the dominated converProof If A. is a sequence of sets with gence theorem implies that E(I XI ; Aj 0. From the last result, it follows that if e > 0, we can pick S > 0 such that if P(A) < S, then E(I XI ; A) < e.

Pick M large enough so that EI XIIM < S. Jensen's inequality and the definition of conditional expectation imply that

E(JE(XI.F) JE(XI.F) > M) < E(I XI ; E(IXI I.F)

> M),

and we have that P(E(I XI I _F) > M)

< E(E(IMI I.f )) = EIXI < s,

so for this choice of M,

suupE(IE(XI.9)I ; E(XI.f)I > M) < e, and since a was arbitrary, the collection is uniformly integrable. Another common example is Exercise 1

Let (p be any function with 9(x)lx

oo as x - oo, for example,

qp(x)=x" or cp(x)=xlogx. If Eq(IXi1)
If X -. X a.s., then the following statements are equivalent: (a) {X,,, n > 0} is uniformly integrable

(b) X. - X in L' (c) EIXXI

Proof (a)

EIXI < oo.

(b) : Let

M 9M(x)

M<x

X -M<x<M

-M

x < -M

and observe that by patiently checking the nine possible cases,

<EI(pM(X.)-gM(X)1 As n

oo, the first term

M).

0. If e > 0 and M is large, then the second term < e.

A.8

:9

Optional Stopping Theorems

To bound the third term, we observe that uniform integrability implieN that that EIXI < sup. EIXI < oo, M the

that

sup El X

E.

- XI < 2e, proving (b).

the

(b)=(c): EIXI - EIXU :!9 El IX.1-IXI <EIX -XI. (c) =>(a): Let /IM(x) = x on [0, M - 1], OM = 0 on [M, oo), and let dIM he

linear on [M - 1, M]. If M is large, EIXI - Eom(I XI) < e/2. The bounded convergence theorem implies that E//M( 1 X I) - EI//M(X I ), so using (c) we get that if n > no, E(IXXI ; IXI > M):!5; EI X1 - E,M(Xfl) < E.

By choosing M larger, we can make E(I XI ; I XI > M) < r: for 0 < n < no, so X is uniformly integrable.

We are now ready to state the main theorems of this section. Since we have already done all the work, the proofs are short. (3)

For a submartingale, the following statements are equivalent:

(a) it is uniformly integrable (b) it converges in L'.

Proof (a) => (b): Uniform integrability implies that supElXl < ou, which by (3) of Section A.5 implies that X -> Xa.s., which by (2) above implies that X -> X in L'. The converse, (b) (a), is a corollary of (2). (4)

For a martingale, the following statements are equivalent:

(a) it is uniformly integrable (b) it converges in L' (c) there is an integrable random variable X such that X = E(XI..). Proof (a) (b) => (c) :

(b) : This result follows from the proof given in (3). Let X = lim X,,. If m > n, then E(Xm I-F) = X,,, so if A e .F,,,

E(X ; A) = E(Xm ; A). As m - oo, Xm IA --> X1A in L1, so we have that E(X ; A) _

E(X, A) for all A e 3, and it follows that X. = E(Xl S ). (c) (a): This result follows from (1) above.

A.8 Optional Stopping Theorems In this section, we will prove a number of results that allow us to conclude that if X. is a submartingale and M :!g N are stopping times, then EXM < EXN. The first step is to show (1)

If X. is a uniformly integrable submartingale, then for any stopping time N, XNAf is uniformly integrable.

310

Appendix A Primer of Probability Theory

Proof We begin by observing that XNnn is a submartingale with EXlvnn < EX

(since X is a submartingale), so sup EX,',,, < sup EX.' < oo n

(since/Xn - X. in L1), and it follows from the martingale convergence theorem that XNAn -+ XN a.s. and EI XNI < oo. With this result established, the rest is easy, since M) < E(I XNI ; I XNI > M) + E(IXnI ; IXnI > M)

E(IXNnnI ;

(2)

and Xn is uniformly integrable. From (1) it follows immediately that we have: If Xn is a uniformly integrable submartingale, then for any stopping time N< oo,

EXo < EXN < EX..

Proof (2) in Section A.6 implies that EXa < EXNnn 5 EX.. Letting n - oo and observing that (1) above and results in Section A.7 imply that XNAf - XN in L' and X. - X. in L1, gives the desired result. From (2), we get the following useful corollary: (3)

The Optional Stopping Theorem. If XNAf is a uniformly integrable submartingale,

then for any stopping time M < N, EXM < EXN.

We have given (3) a name since it is the basic result in this section and it is usually the result we are referring to when we use the words "optional stopping theorem" in the text. In applying (3), the following fact is useful: (4)

If Xn is a submartingale and N < co is a stopping time with (a) EIXNI < oo (b) E(I XXI ; N > n) - 0, then XNnn is uniformly integrable.

Proof E(IXNnnI; IXNnnI > M) <_ E(IXNI;IXNI > M) + E(IXXI;N> n, IXnI > M).

Let e > 0. If we pick no large enough, then E(IXnI ; N > n) < e/2 for all n > no. Having done this, we can pick M large enough so that E(I XNI ; I XNI > M) < E/2

and, for 0:!9 n :!g no, E(IXnI; IXnI > M) < e/2, so it follows from the first inequality that E(I XNnnI ; IXNnnI

> M) < E

for all n, and hence XN.,, is uniformly integrable.

Finally, there is one stopping theorem that does not require uniform integrability :

A.8

(5)

Options[ Stopping Theorem.

311

If Xn is a nonnegative supermartingale and N.-!5; oc is a stopping time, then EXo Z EXN, where X. = lim Xn (which exists by (4) of Section A.5). Proof EXo > EXNAf. The monotone convergence theorem implies that E(XN, N < oo) = lim E(XN ; N< n), n- w

and Fatou's lemma implies that E(XX ; N = oo) <_ lim inf E(XX ; N > n), n-m

so adding the last two lines gives the desired result.

References

In the list of references below, we have for convenience shortened the standard abbreviations for the sources referred to most often : AMS BAMS

CMP CPAM JFA LMS

LNM MAA PAMS

PJM Sem.

TAMS TPA

ZfW

American Mathematical Society Bulletin of the American Mathematical Society Communications on Mathematical Physics Communications on Pure and Applied Mathematics Journal of Functional Analysis London Mathematical Society Lecture Notes in Mathematics, Springer-Verlag, New York Mathematical Association of America Proceedings of the American Mathematical Society Pacific Journal of Mathematics Seminar de Probabilites (Strasbourg) Transactions of the American Mathematical Society Theory of Probability and Its Applications Zeitschrift fur Wahrscheinlichkeitstheorie and Verwandte Gebiete

Agmon, S. (1965). Lectures on elliptic boundary value problems. Van Nostrand, New York. (1982). Lectures on exponential decay of solutions of second-order elliptic equaliona, Mathematical Notes. Princeton Univ. Press, Princeton. , Ahlfors, L. (1966). Complex analysis. 2d ed. McGraw-Hill, New York.

Aizenman, M., and B. Simon (1982). Brownian motion and a Harnack inequality for Schrodinger operators. CPAM 35:209-273. Ash, J. M., ed. (1976). Studies in harmonic analysis. MAA Studies in Mathematics, vol, 13, MAA, Washington, D.C.

Austin, D. G. (1966). A sample function property of martingales. Ann. Math, Slat, 37:1396-1397. Bachelier, L. (1900). Theorie de las speculation. Ann. Sci. Ecole Norm. Sup. 17:21-86, Baernstein, A., II (1978). Some sharp inequalities for conjugate functions. Indiana Math, J, 27:833-852. 313

314

Reference.

Bary, N. K. (1964). A treatise on trigonometric series. Pergamon Press, New York. Bernard, A., and B. Maisoneuve (1977). Decomposition atomique de martingales de la classe H1. In Sem. XII, pp. 303-323. Springer LNM 649. Bers, L., F. John, and M. Schechter (1964). Partial differential equations. AMS, Providence, RI. Berthier, A., and B. Gaveau (1978). Critere de convergence des fonctionelles de Kac et applications en mechanique et en geometrie. JFA 29:416-424. Billingsley, P. (1968). Convergence of probability measures. John Wiley, New York. Blumenthal, R. M., and R. K. Getoor (1968). Markov processes and their potential theory. Academic Press, New York. Breiman, L. (1968). Probability. Addison-Wesley, New York. Brelot, M., and J. L. Doob (1963). Limites angulaires et limites fines. Ann. Inst. Fourier (Grenoble) 13:395-415. Brossard, J. (1975). These de troisieme cycle. (1976). Comportement nontangentiel et comportement Brownien des fonctions

harmonique dans un demi-espace: Demonstration probabiliste d'un theoreme de Calderon et Stein. In Sem. XII, pp. 378-397. Springer LNM 649. Burkholder, D. L. (1962). Successive conditional expectation of an integrable function. Ann. Math. Stat. 33:887-893. (1964). Maximal inequalities as necessary conditions for almost everywhere convergence. ZfW 3:75-88. (1966). Martingale transforms. Ann. Math. Stat. 37:1494-1505. (1970). Martingale inequalities. In Martingales, ed. H. Dinges, pp. 1-8. Springer LNM 190. (1973). Distribution function inequalities for martingales. Ann. Prob. 1: 19-42. (1975). One-sided maximal functions and H°. JFA 18:429-454. (1976). Harmonic analysis and probability. In Studies in harmonic analysis, ed. Ash (1976), pp. 136-149. (1977a). Brownian motion and classical analysis. Proc. Symp. Pure Math. 31:5-14. (1977b). Exit times of Brownian motion, harmonic majorization and Hardy spaces. Adv. in Math. 26:182-205. (1978). Boundary value estimation of the range of an analytic function. Michigan Math. J. 25:197-211. (1979a). Martingale theory and harmonic analysis in Euclidean spaces. In Harmonic analysis in Euclidian space, ed. Weiss and Wainger (1979). (1979b). A sharp inequality for martingale transforms. Ann. Prob. 7:858-863.

(1981). A geometrical characterization of Banach spaces in which martingale difference sequences are unconditional. Ann. Prob. 9:997-1011. (1982). A nonlinear partial differential equation and the unconditional constant of the Haar system in L. BAMS (New Series) 7:591-595. (in press). Boundary value problems and sharp inequalities for martingale transforms. Ann. Prob.

Burkholder, D. L., B. Davis, and R. F. Gundy (1972). Integral inequalities for convex functions of operators and martingales. In Proceedings of the Sixth Berkeley Symposium, vol. II, pp. 223-240. Burkholder, D. L., and R. F. Gundy (1970). Extrapolation and interpolation of quasilinear operators on martingales. Acta Math. 124:249-304.

(1972). Distribution function inequalities for the area integral. Studia Math. 44:527-544.

References

315

(1973). Boundary behavior of harmonic functions in a half space and Hriiwnion motion. Ann. Inst. Fourier (Grenoble) 23:195-212. Burkholder, D. L., R. F. Gundy, and M. L. Silverstein (1971). A maximal function characterization of the class H5. TAMS 157:137-153. Calderon, A. P. (1950a). On the behavior of harmonic functions at the boundary. 7'A MN 68:47-54. (1950b). On a theorem of Marcinkiewicz and Zygmund. TAMS 68:55.61. (1950c). On the theorems of M. Riesz and Zygmund. PAMS 1:533-535. (1966). Singular integrals. BAMS 72:426-465. Calderon, A. P., and A. Zygmund (1964). On higher gradients of harmonic functions, Studia Math. 24:211-226. Cameron, R. H., and W. T. Martin (1944a). Transformation of Wiener integrals under translations. Ann. Math. 45:386-396. (1944b). The Wiener measure of Hilbert neighborhoods in the space of real continuous functions. J. Math. Phys. 23:195-209. (1945a). Evaluations of various Wiener integrals by use of Sturm-Liouville differential equations. BAMS 51:73-90. (1945b). Transformation of Wiener integrals under a general class of linear Ironsformations. TAMS 58:184-219. (1949). The transformation of Wiener integrals by nonlinear trunsformntions. TAMS 66:253-283. Carleson, L. (1958). An interpretation problem for bounded analytic functions. Amer. J. Math. 80:921-930. (1961). On the existence of boundary values for harmonic functions in several variables. Archiv fur Math. 4:393-399. (1962). Interpolations by bounded analytic functions and the corona prohlem. Ann. Math. 76:547-559. Carmona, R. (1978). Pointwise bounds for Schrodinger eigenstates. CMP 62:97 106. (1979a). Processus de diffusion gouvern6 par la forms de Dirichlet de l'oporulcur tie Schrodinger. In Sem. XIII, pp. 557-569. Springer LNM 721.

(1979b). Regularity projects of Schrodinger and Dirichlet semigroups. JI"A 33:259-296. Carmona, R., and B. Simon (1981). Pointwise bounds on eigenfunctions and wave packots in n-body quantum systems V. CMP 80:59-98. Chao, J. A., and M. H. Taibleson (1973). A subregularity inequality for conjugate systems on local fields. Studia Math. 46:249-257. Chung, K. L. (1974). A course in probability theory. 2d ed. Academic Press, New York. (1976). Excursions in Brownian motion. Archiv fur Math. 14:155-177. (1979). On stopped Feynman-Kac functionals. In Sem. XIV, pp. 347-356. Springer LNM 784. (1981). Feynman-Kac functional and the Schrodinger equation. In Seminar on Stochastic Processes I, ed. E. Cinlar, K. L. Chung, and R. Getoor. Birkhauser, Boston. (1982). Lectures from Markov processes to Brownian motion. Springer-Verlag, New York. Chung, K. L., R. Durrett, and Z. Zhongxin (1983). Extension of domains with finite gauge. Math. Annalen 264:78-79. Chung, K. L., and P. Li (1983). Comparison of probability and eigenvalue methods for the Schrodinger equation. Adv. in Math. Chung, K. L., and K. M. Rao (1980). Sur la theorie du potentiel avec la fonctionelle de

316

Reference.

Feynman-Kac. C. R. Acad. Sci. Paris 29OA: 629-63 1. (1981). Potential theory with the Feynman-Kac functional. Prob. Math. Stat. 1. Chung, K. L., ahd S. R. S. Varadhan (1979). Positive solutions of the Schrodinger equation in one dimension. Studia Math. 68:249-260. Ciesielski, Z., and S. J. Taylor (1962). First passage times and sojurn times for Brownian motion in space and exact Hausdorff measure. TAMS 103: 434-450. Coifman, R. R. (1974). A real variable characterization of H". Studia Math. 51:269-274. Coifman, R. R., and C. Fefferman (1974). Weighted norm inequalities for maximal functions and singular integrals. Studia Math. 51:241-250.

Coifman, R. R., and R. Rochberg (1980). Another characterization of BMO. PAMS 79:249-254. Coifman, R. R., and G. Weiss (1977). Extensions of Hardy spaces and their use in analysis. BAMS 83:569-645. Conway, J. B. (1978). Functions of one complex variable. 2d ed. Springer-Verlag, New York. Courant, R., and D. Hilbert (1953, 1962). Methods of mathematical physics. Vols. I, II. Interscience, New York. Davis, B. (1968). Comparison tests for martingale convergence. Ann. Math. Stat. 39:21412144.

(1969). A comparison test for martingale inequalities. Ann. Math. Stat. 40:505-508.

(1970). On the integrability of the martingale square function. Israel J. Math. 8:187-190. (1973a). On the distributions of conjugate functions of nonnegative measures. Duke Math. J. 40:695-700. (1973b). An inequality for the distribution of the Brownian gradient function. PAMS 37:189-194. (1974). On the weak type (1, 1) inequality for conjugate functions. PAMS 44:307311.

(1975). Picard's theorem and Brownian motion. TAMS 213:353-362. (1976). On Kolmogorov's inequalities 11a < Coll f 111, 0 < p < 1. TAMS 222:179192.

(1979a). Applications of the conformal invariance of Brownian motion. In Harmonic analysis in Euclidean space, ed. Weiss and Wainger (1979). (1979b). Brownian motion and analytic functions. Ann. Prob. 7:913-932. (1980). Hardy spaces and rearrangements. TAMS 261:211-233. Deift, P., W. Hunziker, B. Simon, and E. Vock (1978). Pointwise bounds on eigenfunctions

and wave packets in n-body quantum systems IV. CMP 64:1-34. Dellacherie, C. (1980). Un survol de la theorie de l'integrale stochastique. Stoch. Proc. Appl. 10:115-144. Dellacherie, C., and P. A. Meyer (1978). Probabilities and potential. English translation of Probabilites et potentiel. North Holland, Amsterdam. Doleans-Dade, C. (1970). Quelques applications de la formule de changements de variables pour les semimartingales. ZJW 16:181-194.

Donsker, M. D., and M. Kac (1950). A sampling method for determining the lowest eigenvalue and the principal eigenfunction of Schrodinger's equation. J. Res. Nat. Bureau of Standards 44:551-557. Donsker, M. D., and S. R. S. Varadhan (1974). Asymptotic evaluation of certain Wiener integrals for large time. In Proceedings of the International Conference on Function Space Integration. Oxford Univ. Press. (1975a). Asymptotics for the Wiener sausage. CPAM 28:525-566; errata, p. 677.

Rd.resoes

317

(1975b). On a variational formula for the principal eigenvaluc fur upeuiturs with a maximum principle. Proc. Nat. Acad. Sci., USA 72:780-783. (1975-1976). Asymptotic evaluation of certain Markov process exhrrlntwns for large time. CPAM (I) 28:1-47; (II) 28:279-301; (III) 29: 389-461. (1976). On the principal eigenvalue of second-order elliptic differontlal operators. CPAM 29:595-622. Doob, J. L. (1953). Stochastic processes. John Wiley, New York. (1954). Semimartingales and subharmonic functions. TA MS 77:86 121 (1955a). Martingales and one-dimensional diffusion. TA MS 78:168 208. (1955b). A probability approach to the heat equation. TAMS 80:216 280.

(1956). Probability methods applied to the first boundary value problem. In Proceedings of the Third Berkeley Symposium, vol. II, pp. 49 NO.

(1957). Conditional Brownian motion and the boundary limits of harmonic functions. Bull. Soc. Math. France 85:431-458. (1958a). Boundary limit theorems for a half-space. J. Math, Pure-,% Appl. 37:385392.

(1958b). Probability theory and the first boundary value prohlciii Illinois J. Math.

2:19-36. (1959). A nonprobabilistic proof of the relative Fatou theorem ,4nn. Inst. FourIa (Grenoble) 9:293-300. (1960). Relative limit theorems in analysis. J. Analyse Math. 8 2H) 306. (1961). Conformally invariant cluster value theory. Illinois J. Moth. 5:521-549. (1962). Boundary properties of functions with finite Dirichict integrals. Ann. Inst, Fourier (Grenoble) 12: 573-622. (1963). One-sided cluster-value theorems. Proc. LMS 13:461 470. (1964). Some classical function theory theorems and their modern versions. Ann, Inst. Fourier (Grenoble) 15:113-136.

(1966). Remarks on the boundary limits of harmonic functions. J. SIAMNumerical Analysis 3:229-235. Dubins, L. E., and D. Gilat (1978). On the distribution of maxima of mariingales. PAMS 68:337-338. Dubins, L. E., and G. Schwarz (1965). On continuous martingales. Proc. Nat. Acad. Sci., USA 53:913-916.

Dunford, N., and J. T. Schwarz (1957). Linear operators, Part I: General theory. Interscience, New York. Duren, P. L. (1970). The theory of H" spaces. Academic Press, New York. Duren, P. L., B. W. Romberg, and A. L. Shields (1969). Linear functionals on HP spaces with 0 < p < 1. J. Reine Angew. Math. 238: 32-60.

Durrett, R. (1982). A new proof of Spitzer's result on the winding of two-dimensional Brownian motion. Ann. Prob. 10:244-246. Dvoretsky, A., P. Erdos, and S. Kakutani (1961). Nonincreasing everywhere of the Brownian motion process. In Proceedings of the Fourth Berkeley Symposium, vol. II, pp. 103-116. Dynkin, E. B. (1960). Markov processes and related problems of analysis. Russian Math. Surveys 15, no. 2, pp. 1-24. (1963). Markov processes and problems in analysis. In Proceedings of the International Congress, Stockholm, 1962. AMS Translations, series 2, vol. 31, pp. 1-24, (1981). Markov processes and related problems of analysis Selected papers. LMS Lecture Notes. Cambridge Univ. Press, Cambridge.

318

References

Einstein, A. (1905). On the movement of small particles suspended in a stationary liquid demanded by the molecular kinetic theory of heat. Ann. Phys. 17. (1926). Investigations on the theory of the Brownian movement. Reprinted by Dover Books, New York, 1956. Fatou, P. (1906). Series trigonometriques et series de Taylor. Acta Math. 30: 335-400. Fefferman, C. (1971). Characterization of bounded mean oscillation. GAMS 77:587-588. (1976). Harmonic analysis and H' spaces. In Studies in harmonic analysis, ed. Ash (1976), pp. 38-75. Fefferman, C., and E. M. Stein (1972). H° spaces in several variables. Acta Math. 129:137193.

Feller, W. (1971). An introduction to probability theory and its applications. Vol. II. John Wiley, New York. Feynman, R. J. (1948). Space-time approach to nonrelativistic quantum mechanics. Rev. Mod. Phys. 20:367-387. Folland, G. B. (1976). Introduction to partial differential equations. Princeton Univ. Press, Princeton. Freedman, D. (1970). Brownian motion and diffusion. Holden-Day, San Francisco. Friedman, A. (1964). Partial differential equations of parabolic type. Prentice-Hall, Englewood Cliffs, NJ. (1969). Partial differential equations. Holt, Rinehart & Winston, New York. (1975). Stochastic differential equations and applications. Academic Press, New York. Garnett, J. (1979). Two constructions in BMO. In Harmonic analysis in Euclidean space, ed. Weiss and Wainger (1979). (1980). Bounded analytic functions. Academic Press, New York. Garnett, J., and P. Jones (1978). The distance in BMO to L'. Ann. Math. 108:373-393. (1982). BMO from dyadic BMO. PJM99:351-371. Garsia, A. M. (1970). Topics in almost everywhere convergence. Markham, Chicago.

(1973a). The Burges-Davis inequality via Fefferman's inequality. Ark. Mat. 11:229-237. (1973b). On a convex function inequality for martingales. Ann. Prob. 1: 171-174. (1973c). Martingale inequalities: Seminar notes on recent progress. Benjamin, Reading, MA. Getoor, R. K., and M. J. Sharpe (1972). Conformal martingales. Invent. Math. 16: 271-308. (1979). Excursions of Brownian motion and Bessel processes. ZfW 47:83-106. Gikhman, I. I., and A. V. Skorohod (1973). Stochastic differential equations. SpringerVerlag, New York. Gilbarg, D., and N. S. Trudinger (1977). Elliptic partial differential equations of second order. Springer-Verlag, New York. Girsanov, I. V. (1960). On transforming a certain class of stochastic processes by absolutely continuous change of measure. TPA 5:285-301. (1961). On Ito's stochastic integral equation. Soviet Math. 2:506-509. (1962). An example of nonuniqueness of the solution of K. Ito's stochastic integral equation. TPA 7:325-331.

Gundy, R. F. (1968). A decomposition for L' bounded martingales. Ann. Math. Slat. 39:134-138.

(1969). On the class LlogL, martingales, and singular integrals. Studia Math. 33:109-118. (1980a). Inequalites pour martingales a un et deux indices: l'espace H'. L`cole d'ete

Reference.

319

de Probabilitcfs de Saint-Flour, VIII. Springer LNM 774. (1980b). Local convergence of a class of martingales in multidimcnsioniil time. Ann. Prob. 8:607-614. Gundy, R. F., and E. M. Stein (1979). H° theory for the polydisc. Proc. Nat. Acad. Sci., USA 76:1026-1029. Gundy, R. F., and N. Varopoulos (1976). A martingale that occurs in harmonic analysis.

Ark. Mat. 14:179-187. (1979). Les transformations de Riesz et les integrales stochastiques. C. R. Acad. Sci. Paris 289A: 13-16. Gustafson, K. E. (1980). Introduction to partial differential equations. John Wiley, New York. Hardy, G. H. (1915). The mean value of the modulus of an analytic function. Proc. LMS 14:269-277. (1928). Remarks on three recent notes in the Journal. J. LMS 3: 166 169. Hardy, G. H., and J. E. Littlewood (1926). Some new properties of Fourier constants. Math. Ann. 97:159-209.

(1930). A maximal theorem with function-theoretic applicutions. Acta Math. 54:81-116. (1931). Some properties of conjugate functions. J. Reine Anyewt-. Math. 167:405423.

Helson, H., and G. Szego (1960). A problem in prediction theory. Ann. Mat. Pura Appl. 51:107-138. Herz, C. (1974a). Bounded mean oscillation and regulated martingales. TAMS 193:199216.

(1974b). Hp spaces of martingales 0
Hoffman-Ostenhof, M. and T., and B. Simon (1980a). Brownian motion and a consequence of Harnack's inequality, nodes of quantum wave functions. PA MS 80: 301305.

(1980b). On the nodal structure of atomic eigenfunctions. J. Phys. A. 13:1131-1133. Hunt, G. A. (1956). Some theorems concerning Brownian motion. TAMS 81:294-391. Hunt, R. A., B. Muckenhoupt, and R. L. Wheeden (1973). Weighted norm inequalities for conjugate functions and Hilbert transforms. TAMS 176:227-251. Ikeda, N., and S. Watanabe (1981). Stochastic differential equations and diffusion processes. North Holland, Amsterdam. Ito, K. (1944). Stochastic integrals. Proc. Imp. Acad. Tokyo 20:519-524. (1946). On a stochastic integral equation. Proc. Japan Acad. 22:2, 32-35. (1950a). Brownian motions in a Lie group. Proc. Japan Acad. 26:8, 4-10. (1950b). Stochastic differential equations in a differentiable manifold. (I) Nagoya Math. J. 1:35-47; (1I) Mem. Coll. Sci. U. Kyoto, Series A, 28, 1, 81-85. (1951a). On a formula concerning stochastic differentials. Nagoya Math. J. 3:5565.

(1951b). On stochastic differential equations. Memoirs AMS, no. 4. (1960). Wiener integral and Feynman integral. In Proceedings of the Fourth Berkeley Symposium, vol. II, pp. 227-238. Ito, K., and H. McKean, Jr. (1964). Diffusion processes and their sample paths. SpringerVerlag, New York. Jacod, J. (1979). Calcul stochastique et problemes de martingales. Springer LNM 714.

320

References

Jacod, J., and J. Memin (1981). Weak and strong solutions of stochastic differential equations: Existence and stability. In Stochastic integrals, ed. Williams (1981), pp. 169-212.

Janson, S. (1977). Characterization of H' by singular integral transformations on martingales and R". Math. Scand. 41:140-152. (1979). Singular integrals on local fields and generalizations to martingales. In Harmonic analysis in Euclidean space, ed. Weiss and Wainger (1979).

(1981). BMO and commutators of martingale transforms. Ann. Inst. Fourier (Grenoble) 31:265-270. John, F. (1961). Rotation and strain. CPAM 14:391-413. (1982). Partial differential equations. 4th ed. Springer-Verlag, New York. John, F., and L. Nirenberg (1961). On functions of bounded mean oscillation. CPAM 14:415-426. Jones, P. (1980). Carleson measures and the Fefferman-Stein decomposition of BMO (R). Ann. Math. 111: 197-208. Kac, M. (1946). On the average of a certain Wiener functional and a related limit theorem in the calculus of probability. TAMS 59:401-414. (1949). On distributions of certain Wiener functionals. TAMS 65:1-13. (1951). On some connections between probability theory and differential and integral equations. In Proceedings of the Second Berkeley Symposium, pp. 189-215. (1953). An application of probability theory to the study of Laplace's equation. Ann. de Societe Math. Polonaise 25:122-130. (1959). Probability and related topics in physical science. Interscience, New York. (1966a). Can one hear the shape of a drum? Amer. Math. Monthly 73:1-23. (1966b). Wiener and integration in function spaces. BAMS 72:52-68. (1970). On some probabilistic aspects of classical analysis. Amer. Math. Monthly 77:586-597. (1972). On applying mathematics: Reflections and examples. Quart. J. Appl. Math. 30:17-29. (1974). A stochastic model related to the telegrapher's equation. Rocky Mountain

J.4:497-509. Kac, M., and J. M. Luttinger (1974). Probabilistic methods in scattering theory. Rocky Mountain J. 4:511-537. (1975). Scattering length and capacity. Ann. Inst. Fourier (Grenoble) 25:317-321. Kahane, J. P. (1976). Brownian motion and classical analysis. Bull. LMS 7:145-155. Kakutani, S. (1944a). On Brownian motion in n-space. Proc. Imp. Acad. Tokyo 20:648652.

(1944b). Two-dimensional Brownian motion and harmonic functions. Proc. Imp. Acad. Tokyo 20:706-714.

(1945). Markov processes and the Dirichlet problem. Proc. Imp. Acad. Tokyo 21:227-233. Kato, T. (1973). Schrodinger operators with singular potentials. Israel J. Math. 13:135148.

Katznelson, Y. (1968). An introduction to harmonic analysis. John Wiley, New York.

Kellog, O. D. (1929). Foundations of potential theory. Springer-Verlag. Reprinted by Dover Books, New York, 1954.

Khas'minskii, R. Z. (1959). On positive solutions of the equation Au + Vu = 0. TPA 4:309-318. Khintchine, A. (1933). Asymptotische Geseteze der Wahrscheinlichkeitsrechnung. Ergebn. Math., Berlin.

Refer nc.N

321

Knight, F. B. (1971). A reduction of continuous square-integrable martingaleN to Brownian motion. In Martingales, ed. H. Dinges, pp. 19-31. Springer LNM 190. (1981). Essentials of Brownian motion and diffusion. AMS, Providence, RI. Kolmogorov, A. (1925). Sur les fonctions harmoniques conjuguees et les series de Fouricr.

Fund. Math. 7:24-29. Koosis, P. (1980). Lectures on HP spaces. LMS Lecture Notes. Cambridge Univ. Press, Cambridge. Kunita, H., and S. Watanabe (1967). On square integrable martingales. Nagoya Math. J. 30:209-245. Ladyzenskaya, O. A., V. A. Solonnikov, and N. N. Ural'ceva (1968). Linear and quasilinear equations of parabolic type. AMS Translations of Mathematical Monographs 23. Latter, R. (1977). A decomposition of HP(R") in terms of atoms. Studia Math. 62:92-101. (1979). The atomic decomposition of Hardy spaces. In Harmonic analysis in Euclidean space, ed. Weiss and Wainger (1979).

Lebesgue, H. (1924). Conditions de regularite, conditions d'irregularite, conditions d'impossibilite dans le probleme de Dirichlet. C. R. Acad. Sci. Paris, pp. 349-354. Lenglart, E. (1977). Sur la convergence presque sure des martingales locales. C. R. Acad. Sci. Paris 284A: 1085-1088. Lepingle, D. (1978). Sur le comportement asymptotique des martingales locales. In Sem. XII, pp. 148-161. Springer LNM 649. Levy, P. (1939). Sur certain processes stochastiques homogenes. Compositio Math. 7:283339.

(1940). Le mouvement Brownien plan. Amer. J. Math. 62:487-550. (1948). Processus stochastiques et mouvement Brownien. Gauthier-Villars, Paris. (1951). Wiener's random function. In Proceedings of the Second Berkeley Symposium, pp. 171-186. Lieb, E. (1976). Bounds on the eigenvalues of the Laplace and Schrodinger operators.

BAMS 82:751-753. Lieb, E., and B. Simon (1982). Pointwise bounds on eigenfunctions and wave packets in n-body quantum systems VI : Asymptotics in the two-cluster region. Adv. Appl. Math. 1:324-343. Littlewood, J. E. (1926). On a theorem of Kolmogorov. J. LMS 1:229-231. (1929). On a theorem of Zygmund. J. LMS 4: 305-307. Loomis, L. H. (1946). A note on the Hilbert transform. BAMS 52:1082-1086. Lyons, T. J., and H. P. McKean (in press). Winding of plane Brownian motion. Preprint. Oxford Univ. Press, New York. McKean, H. P. (1969). Stochastic integrals. Academic Press, New York. (1977). -A plus a bad potential. J. Math. Phys. 18:1277-1279. Marcinkiewicz, J., and A. Zygmund (1938). A theorem of Lusin. Duke Math. J. 4:473-485. Messulam, P., and M. Yor (1982). On D. Williams' punching method and some applications. J. LMS. Meyer, P. A. (1966). Probability and potentials. Blaisdel, Waltham, MA. (1976). Un cours sur les integrabes stochastiques. In Sem. X, pp. 245-400. Springer LNM 511. (1977). Le dual de H'(R') demonstrations probabilistes. In Sem. XI, pp. 132-195. Springer LNM 581. Millar, P. W. (1968). Martingale integrals. TAMS 133:145-166. Moser, J. (1961). On Harnack's theorem of elliptic differential equations. CPAM 14:377 591.

322

Reference.

Nevanlinna, F. and R. (1922). Uber die Eigenschaften analytischer Funktionen in der Umgebung einer singularen Stelle oder Linie. Acta Soc. Sci. Fenn. 50, no. 5. Neveu, J. (1975). Discrete parameter martingales. North Holland, Amsterdam. Olenik, O. A. (1966). Alcuni risultati sulle equazioni lineari e quasilineari ellitico-paraboliche a derivate parziali del second ordine. Rend. Classe Sci. Nat. A cad. Naz. Lincei, Series 8, 40:775-784.

Paley, R., N. Wiener, and A. Zygmund (1933). Note on random functions. Math. Z. 37:647-668. Paley, R. E. A. C., and A. Zygmund (1932). A note on analytic functions in the unit circle. P. Camb. Phil. Soc. 28:266-272. Petersen, K. E. (1977). Brownian motion, Hardy spaces, and bounded mean oscillation. LMS Lecture Notes. Cambridge Univ. Press, Cambridge. Pichorides, S. K. (1972). On the best values of the constants in the theorems of M. Riesz, Zygmund, and Kolmogorov. Studia Math. 46:165-179. Pitman, J. (1979). A note on LZ maximal inequalities. In Sem. XV, pp. 251-258. Springer LNM 850.

Plessner, A. (1928). Uber das Verhalten analytischen Funktionen auf dem Rande des Definitions-bereiches. J. Reine Angew. Math. 158:219-227. Port, S., and C. Stone (1978). Brownian motion and classical potential theory. Academic Press, New York. Privalov, I. I. (1916). Sur Ies fonctions conjuguees. Bull. Soc. Math. France (1916), pp. 100103.

(1924). Generalization of a theorem of Fatou. Mat. Sbornik 31:232-235. Protter, M. H., and H. F. Weinberger (1966). On the spectrum of general second-order equations. BAMS 72:251-255. (1967). Maximum principles in differential equations. Prentice-Hall, New York. Rao, K. M. (1969). On decomposition theorems of Meyer. Math. Scand. 24:66-78. Ray, D. (1954). On the spectra of second-order differential operators. TAMS 77:299-321. Riesz, F. (1923). Uber die Randwerte einer analytischen Funktion. Math. Z. 18:87-95. Riesz, M. (1927). Sur les fonctions conjuguees. Math. Z. 27:218-244. Rosenblatt, M. (1951). On a class of Markov processes. TAMS 71:120-135 Royden, H. (1968). Real analysis. 2d ed. Macmillan, New York. Sharpe, M. J. (1980). Local times and singularities of continuous local martingales. In Sem. XIV, pp. 76-101. Springer LNM 784. Simon, B. (1974-1975). Pointwise bounds on eigenfunctions and wave packets in n-body

quantum systems. (I) PAMS 42:395-401; (II) PAMS 45:454-456; (III) TAMS 208:317-329. (1979). Functional integration and quantum physics. Academic Press, New York. (1980). Brownian motion, L° properties of Schrodinger operators and the localization of binding. JFA 35:215-229.

(1981). Large time behavior of the L° norm of Schrodinger semigroups. JFA 40:66-83. (1982). Schrodinger semigroups. BAMS (new series) 7:447-526. (1983). Instantons, double wells, and large deviations. BAMS (new series) 8: 323326.

Sion, M. (1958). On general minmax theorems. PJM 8:171-176. Skorohod, A. V. (1956). Limit theorems for stochastic processes. TPA 1:261-290. (1961). Existence and uniqueness of solutions to stochastic differential equations. Sibirsk. Mat. Zur. 2:129-137.

Rsfv.ncss

323

(1965). Studies in the theory of random processes. Addison-Wesley, Reading, MA. Spanne, S. (1966). Sur l'interpolation entre les espaces Ann. Scusla Norm Sup. Pisa 20:625-648. Spencer, D. C. (1943). A function theoretic identity. Amer. J. Math. 65:147-160. Spitzer, F. (1958). Some theorems concerning two-dimensional Brownian motion. TAMS 87:187-197. Spivak, M. (1979). A comprehensive introduction to differential geometry. Publish or Perish, Inc., Berkeley, CA.

Stein, E. M. (1961). On the theory of harmonic functions of several variables II. Acta Math. 106:137-174. (1962). Conjugate harmonic functions in several variables. In Proceedings of the International Congress of Mathematics, Stockholm, pp. 414-420. (1967). Singular integrals, harmonic functions and differentiability properties of functions of several variables. Proc. Symp. Pure Math. 10: 316 335, (1969). A note on LlogL. Studia Math. 31:305 310. (1970a). Singular integrals and differentiability properties of functions. Princeton Univ. Press, Princeton. (1970b). Topics in harmonic analysis related to the Littlewood-Paler theory. Annals of Mathematical Studies, no. 63. Princeton Univ. Press, Princeton. Stein, E. M., and G. Weiss (1959). An extension of a theorem of Marcinkiewicz and its applications. J. Math. Mech. 8:263-284. (1960). On the theory of harmonic functions of several variables 1. Acta Math. 103:25-62. (1968). Generalization of the Cauchy-Riemann equations and representations of the rotation group. Amer. J. Math. 90:163-196. (1971). Introduction to Fourier analysis on Euclidean spaces. Princeton Univ. Press, Princeton. Stein, P. (1933). On a theorem of M. Riesz. J. LMS 8:242-247. Stroock, D. W. (1973). Application of Fefferman-Stein type interpolation to probability theory and analysis. CPAM 26:477-496. (1981). The Malliavin calculus and its applications. In Stochastic integrals, ed. D. Williams (1981), pp. 394-432.

(1983). The Malliavin calculus and its application to second-order parabolic equations. Math. Systems Theory 14:25-65.

Stroock, D. W., and S. R. S. Varadhan (1969). Diffusion processes with continuous coefficients. CPAM 22: (I) 345-400, (II) 479-530.

(1970). On the support of diffusion processes with applications to the strong maximum principle. In Proceedings of the Sixth Berkeley Symposium, vol. III, pp. 333-360. (1972a). On degenerate elliptic parabolic operators of second order and their associated diffusions. CPAM 25:651-713. (1972b). Diffusion processes. In Proceedings of the Sixth Berkeley Symposium, vol. III, pp. 361-368. (1974). A probabilistic approach to H"(Re). TAMS 192:245-260. (1979). Multidimensional diffusion processes. Springer-Verlag, New York. Taibleson, M. H. (1975). Fourier analysis on local fields. Mathematical Notes. Princeton Univ. Press, Princeton. (1979). An introduction to Hardy spaces on local fields. In Harmonic analysis in Euclidean space, ed. Weiss and Wainger (1979).

324

References

Taibleson, M. H. and G. Weiss (1979). The molecular characterization of Hardy spaces. In Harmonic analysis in Euclidean space, ed. Weiss and Wainger (1979). Tanaka, H. (1964). Existence of diffusions with continuous coefficients. Mem. Fac. Sci. Kyushu, Series A 18:89-103. Tsirel'son, B. S. (1976). An example of a stochastic differential equation having no strong solution. TPA 20:416-418. Uchiyama, A. (1982). A constructive proof of the Fefferman-Stein decomposition of BMO(R"). Acta Math. 148:215-241. (1983). A constructive proof of the Fefferman-Stein decomposition of BMO on simple martingales. In Conference on harmonic analysis in honor of Antoni Zygmund,

vol. II, ed. Beckner, Calderon, Fefferman, and Jones, pp. 495-505. Wadsworth, Belmont, CA. Varopoulos, N. (1977a). BMO functions and the a equation. PJM 71:221-273. (1977b). A remark on BMO and bounded harmonic functions. PJM 73:257-259. (1979). A probabilistic proof of the Garnett-Jones theorem on BMO. PJM 90: 201221.

(1980a). Aspects of probabilistic Littlewood-Paley theory. JFA 38:25-60. (1980b). The Helson-Szego theorem and Ap functions for Brownian motion and several variables. JFA 39:85-121.

Walsh, J. B. (1977). A property of conformal martingales. In Sem. XI, pp. 490-501. Springer LNM 581. Walsh, J. L. (1929). The approximation of harmonic functions by harmonic polynomials and harmonic rational functions. BAMS 35:499-544. Weiss, G. (1970). Complex method in harmonic analysis. Amer. Math. Monthly 77:465474.

Weiss, G., and S. Wainger, eds. (1979). Harmonic analysis in Euclidean space. Proceedings of Symposia in Pure Mathematics, vol. 35. AMS, Providence, RI. Widder, D. V. (1975). The heat equation. Academic Press, New York. Wiener, N. (1923). Differential space. J. Math. Phys. 2:131-174. (1924a). Certain notions in potential theory. J. Math. Phys. 3:24-51. (1924b). The Dirichlet problem. J. Math. Phys. 3:127-146. (1924c). Un probleme de probabilitese enombrables. Bull. Soc. Math. de France 52:569-578. (1925). Note on a paper of O. Perron. J. Math. Phys. 4:21-32. (1930a). Generalized harmonic analysis. Acta Math. 55:117-258. (1930b). The homogeneous chaos. Amer. J. Math. 80:897-936.

Williams, D., ed. (1981). Stochastic integrals. Proceedings LMS Durham Symposium. Springer LNM 851. (ND). A simple geometric proof of Spitzer's winding number formula for 2dimensional Brownian motion. Unpublished paper. University College of Swansea, Wales.

Yamada, T., and S. Watanabe (1971). On the uniqueness of solutions to stochastic differential equations. J. Math. Kyoto 11: (1) 155-167, (II) 553-563. Zygmund, A. (1929). Sur les fonctions conjuguees. Fund. Math. 13:284-303; correction, pp. 18, 312. (1949). On the boundary values of several complex variables. Fund. Math. 36: 207235.

(1959). Trigonometric series. 2d ed. Cambridge Univ. Press, London.

Index of Notation

Rd

1

D

36

,Pd

3

ky

36

.

C

5

he

38

w

5

GD(x, y)

39

sofa

106

P.

5

A

44

sat

106

11

49

2*

107

<X>, <X, Y>

52

.

V*

107

54

a*

107

10

n

55

PB a.s.

107

12

b1I°

55

Yf

119

55

S,(0)

126

°

Pµ

P, (X, y) 05

+

5 8

8

K.

106

'

106

A,u

106

.FS'

17

d

17

n1

56

N,u

127

T.

17

112(X)

57

hp

144

.

19

II Hllx

57

dp(u)

144

TA

19

.,r f2

57

144

Os

20

113 (X)

58

.FS

20

D(x, 6)

23

fbH J

59

Hp it X*

80

. ffp

147

i.o.

28

PB

96

148

28

144

147

5

102

IIXIIp X,*

28,71

VB

105

."p

150

G(x, y)

31

-T,

105

.Y'-p

150

H

32

2'

105

IlXllx.

151

35

N,u

106

IIXIII

151

D; A

GH(x,y)

149

325

326

Index of Noteda

192

Q

272

BMO

193

X,"

282

163

II(pII*

193

Xs X

288

it

170

<(p>>

198

a.s.

294

(AIP)*

184

203

'A

295

203

i.o.

295

<M >l Ho (A*X),

152 155

R.,1f10h

91#0

186

M N

IIXII*

186

ao(X)

211

Q(X)

296

"OZ

188

Kd

228

E(X I.F)

300

<<X%

188

Kd'O0

229

192

L

271

Al

Subject Index adapted process, 44 almost surely, 294 associative law, 62 asymptotic a-field, 17 atomic decomposition

forp=1,184 forp<1,216 Blumenthal's 0-1 law, 14 Borel Cantelli lemmas, 295, 299

bounded mean oscillation, 186, 193, 199, 205 Brownian motion, 1 Brownian paths nondifferentiability, 5 quadratic variation, 6 Holder continuity, 7, 15 BurklLNder-Gundy mequalihes, 155 Burkholder's weak type inequality, 164

Cameron-Martin transformation, 234 Cauchy process, 2, 33 Chebyshev's inequality, 296 Ciesielski and Taylor's "paradox," 260 conditional expectation, 300

conditioned Brownian motions in H, 94, 97 in R° - {0}, 100 in D, 126 conjugate harmonic functions, 162, 170 covariance of two local martingales, 54

of two stochastic integrals, 61 of two semimartingales,

Doob's inequality, 147, 306 sharp constant in, 151 Doob-Meyer decomposition, 52 distributive law, 68 duality theorem

for.*', 184 for H', 199 Dynkin's n A theorem, 9

explosions

Janson's theorem, 167 Jensen's formula, 132 Jensett's inequality (for conditional expectations), 301 John-Nucnherg inequality, :1114, 209

from too much drift, 240 from loo much variance, 277

Fellcr's test, 242 exponential nuirtingale, 27, 70

Fefferman's inequality, 187, 190

Fefferman-Stein decomposition. 202 Feynman-Kac formula, 229 filtration, 50 fine topology, 108

Garnett-Jones theorem, 211, 214

Girsanov's formula, 82 good A-inequalities, 153 Green's function, 39 Gundy's L log L inequality, 148

Hardy space H°, 144 Harnack's inequality (for solutions of Schrt dinger's equation). 261 heat equation, 220 Helson-SzegO theorem, 215 h-transform, 92, 94

68

Dirichlet problem, 43, 246 Donsker and Varadhan, 266

for a local martingNlp, ri4 for several semimartingales, AN

Kelvin's Iunnsformations, No, 414

Khasmin'xMu's

231

Kolmoguruv's extension IIICI m, I continuity t,tw,1uru, 6 three series Ilies trni, 11411

(other) weak IvIs' inequality, I I i Kunita-Watanahe inequality, !9

law of the iterated logarithm for Brownian motion, I 1 for continuous local martingales, 77 Lebesgue's thorn, 248 Levy's theorem, 75 Levy's arc sin law, 261 local martingale, 50

Markov property of Brownian motion, 7 of diffusions, 288 martingale, 302 martingale convergence theorem, 305 martingale transforms, 162 maximum principle, 72, 115 monotone class theorem, 10

independence, 296 integration by parts, 68

U's formula

natural scale, 74, 93, 241 327

32

Subject Iodex

Nevanlinna class, 128 nontangential convergence,

predictable sequence, 48, 303

105

optional projection, 191 optional a-field, 44 optional stopping theorem,

quadratic variation of Brownian motion, 7 of continuous local martingales, 65

310

orthogonality of martingale increments, 303

Pac-Man, 249 Picard's theorems, 139, 143 Poincarb's cone condition, 249

Poisson's equation, 43, 251 Poisson integral representation of harmonic functions nonnegative functions, 98, 99 bounded functions, 119 functions in H°, 158 potential kernels, 31 predictable a-field, 49

Rayleigh-Ritz formula, 270 regular point, 247 Riesz's inequality, 170 sharp constant in, 172 Riesz transforms, 169

Schrodinger equation, 255, 263, 264 Schrodinger semigroups,

stable subspace, 87 stochastic differential equations existence of solutions, 274, 278 uniqueness, 278, 286 examples, 283 stopping time, 18, 306 stretching function, 51 strong Markov property for Brownian motion, 21 for conditioned Brownian motion, 102 submartingale, 302 subordination (of analytic functions), 176 supermartingale, 302

232

semimartingale, 58 shift, 10 shift invariant events, 102 simple random walk, 48, 91, 303 Spitzer's theorem (on the winding of Brownian motion), 134

uniform integrability, 308 usual domination argument, 62

variance process, 52 Varopoulos' staircase, 213 volume potential, 225

Cole

Read more

Cole

Read more

Cole

Read more

Martina Cole

Read more

Brooks Cole Laboratory Series for Chemistry

Read more

Mortal Cole

Read more

Cole Cameron's Revenge

Read more

Halo: The Cole Protocol

Read more

Halo: The Cole Protocol

Read more

La doctora Cole

Read more

Cole for Christmas

Read more

Halo The Cole Protocol

Read more

The Cole Protocol

Read more

Halo: The Cole Protocol

Read more

Halo: The Cole Protocol

Read more

Halo: The Cole Protocol

Read more

Halo: The Cole Protocol

Read more

Scout 'n' Cole

Read more

Halo The Cole Protocol

Read more

Sunset Express: An Elvis Cole Novel (Elvis Cole Novels)

Read more

Sunset Express: An Elvis Cole Novel (Elvis Cole Novels)

Read more

Fourier Analysis and Its Applications (The Wadsworth and Brooks Cole Mathematics Series)

Read more

Brownian Motion and Martingales in Analysis (Wadsworth & Brooks Cole Mathematics Series)

Read more

Introduction to Fourier Analysis and Wavelets (Brooks Cole Series in Advanced Mathematics)

Read more

Fourier Analysis and Its Applications (The Wadsworth and Brooks Cole Mathematics Series)

Read more

Chasing Darkness: An Elvis Cole Novel (Elvis Cole Novels)

Read more

Fourier Analysis and Its Applications (The Wadsworth and Brooks Cole Mathematics Series)

Read more

Introduction to Fourier Analysis and Wavelets (Brooks Cole Series in Advanced Mathematics)

Read more

Brownian Motion and Martingales in Analysis (Wadsworth & Brooks Cole Mathematics Series)

Read more

Introduction to Fourier Analysis and Wavelets (Brooks Cole Series in Advanced Mathematics)

Read more

Recommend Documents

Cole

Cole

One night can leave a trail a decade long. Red, Hot & Blue, Book 5 Ten years ago, a wicked curve ball and a scout for t...

Cole

Martina Cole

Brooks Cole Laboratory Series for Chemistry

Brooks/Cole Laboratory Series for Chemistry http://chemistry.brookscole.com Chemical Education Resources http://www.CER...

Mortal Cole

Cole Cameron's Revenge

Halo: The Cole Protocol

Halo: The Cole Protocol

La doctora Cole

LA DOCTORA COLE ESTE LIBRO ES PARA LORRAINE, MI AMOR Para nuestros hijos Michael Gordon, Lise Gordon y Roger Weiss, y ...