Lecture Notes in Mathematics Edited by A. Dold and BEckmann Subsenes Adviser
Instltut de Mathematlque, Unlverslte de Strasbourg P A Meyer
1193
Geometrical and Statistical Aspects of Probability in Banach Spaces Actes des Journees SMF de Calcul des Probabllites dans les Espaces de Banach, organlsees a Strasbourg les 19 et 20 JUIn 1985
Edited by X Fernlque, B Heinkel, M. B. Marcus and P.A. Meyer
Springer-Verlag Berlin Heidelberg New York Tokyo
Editors
Xavier Fernique Bernard Heinkel Paul-Andre Meyer Institut de Recherche Mathematique Avancee 7 rue Rene Descartes 67084 Strasbourg Cedex, France Michael B. Marcus Department of Mathematics Texas A & M University College Station, Texas 77843, USA
Mathematics Subject Classification (1980): 46820, 60B05, 60B 10, 60B 12, 60F05, 60F 15, 60F 17, 60G 15,62005, 62E20 ISBN 3-540-16487-1 Springer-Verlag Berlin Heidelberg New York Tokyo ISBN 0-387-16487-1 Spnnger-Verlag New York Heidelberg Berlin Tokyo
This work IS subject to copyright. All rights are reserved, whether the whole or part of the material IS concerned, specifically those of translalion, reprinting, re-use of Illustrations, broadcasting, reproduction by photocopying machine or similar means, and storage In data banks. Under § 54 of the German Copyright Law where copies are made for other than private use, a fee IS payable to "Verwertungsgesellschaft Wort", Munich
© by Springer-Verlag Berlin Heidelberg 1986 Pnnted In Germany Printing and binding: Beltz Offsetdruck, Hemsbach/Bergstr. 2146/3140-543210
Preface le calcul des probabilites dans les espaces de Banach est a~tuellement un sujet
en plein essor auquel des rencontres internationales sont consacrees regulierement depuis une dizaine d'annees. Les 19 et 20 juin 1985, une trentaine de specialistes de ce sujet se sont reunis
a
Strasbourg sous le patronage de la Societe Mathematique
de France, pour faire le point des developpements les plus recents, notamment en matiere de fonctions aleatoires gaussiennes, de processus empiriques et de theoremes limites pour des variables aleatoires
a valeurs
dans un espace de Banach. Les
principaux exposes de ces deux journees ont ete rediges par leurs auteurs, ce qui a permis de composer ces Actes que la Societe Springer a eu l'amabilite d'accueillir dans sa collection Lecture Notes in Mathematics. Ces deux journees ont ete assombries par la disparition, le 7 juin 1985, d'Antoine Ehrhard qui etait l'un des plus brillants representants de la jeune generation de probabilistes. Nous avons ressenti cruellement son absence, celle du mathematicien bien sur, mais surtout celle de l'homme de coeur sensible et attachant qu'il etait.
Les editeurs
Table of Contents
BORELL, C.,
A brief survey of Antoine Ehrhard's scientific work.
1
DOUKHAN, P. and LEON, J.R., Invariance principles for the empirical measure of a mixing sequence and for the local time of Markov processes. GUERRE, S., Almost exchangeable sequences in
q L ,1:S:q<2.
4 22
HEINKEL, B., An application of a martingale inequality of Dubins and Freedman to the law of large numbers in Banach spaces. LEDOUX,
M.,
29
On the small balls condition in the central Limit theorem in uniformly convex spaces.
44
LEDOUX, M. and MARCUS, M.B., Some remarks on the uniform convergence of gaussian and Rademacher Fourier quadratic forms.
53
MASSART, P., Rates of convergence in the central limit theorem for empirical processes. SCHWARZ, MoB., Mean square convergence of weak martingales.
73
110
YUKICH, J.E., Metric entropy and the central limit theorem in Banach space s.
113
A BRIEF SURVEY OF ANTOINE EHRHARD'S SCIENTIFIC WORK Christer BORELL Dept. of Mathematics Chalmers University of Technology Goteborg, Sweden
For a couple of years Antoine Ehrhard gave us pleasure with a series of brillant ideas on Gaussian measures and convexity. The loss of him is the loss of a very seriously working young mathematician as well as the loss of a very good friend. For a complete list of Antoine Ehrhard's publications, see at the end of this survey. As a background to Ehrhard's scientific work it is appropriate to recall the Laplace-Beltrami operator and its relations to isoperimetry. Needless to say, this fascinating area is far from completed and, indeed, it seems very hard to unify since it is rooted in so many branches of pure and applied mathematics. In particular,
2
n
like the Laplace operator 6 = 'V in R , the so-called number operator n L = -6 + x.'V in R merits its own study. This central point underlines most of Ehrhard's papers. To master the number operator and isoperimetry Ehrhard first introduced the so-called
k-dimensional Gaussian
machinery of general interest
s)~etrizations
and he developed a streamlined
CMS]. For brevity, we only recall the definition of
n-dimensional Gaussian symmetrizations.
~
Suppose
~ (dx) = e and let
is the canonical Gaussian measure in
-lxI
hE R
n
2
R
n
i.e.
/2 dx/,f2FT n
be a fixed unit vector. Set
there exists a unique non-decreasing function
f
*
gOh
in the direction of
n-dimensional Gaussian symmetrization of
is called the h
Here,
if
f
is the indicator function of a set,
f
*
f
becomes
the indicator function of an affine half-space. The number operator
L
is related to Gaussian Dirichlet integrals as follows
Ehrhard's perhaps most central result states that the integral
decreases, in the weak sense, under Gaussian symmetrizations of decreasing convex function
F:
[0,+00 C
~ R
f
for every non-
[ASE]. Tha familiar isoperimetric
inequalities for torsional rigidity, principal frequency, and Newtonian capacity thereby get their Gaussian counterparts now with affine half-spaces as extremals CASE].
2
The same source of ideas also led Ehrhard to a very neat proof of the Gross logarithmic Sobolev inequality [LN] and to an inequality of the Poincare type rASE]. Under the leadership of Professor Xavier Fernique, Antoine Ehrhard very early became familiar with the Banach space aspect of stochastic processes, which has been of greatest significance to Ehrhard's maturity as a mathematician. For an important joint publication, see CCR]. Another result stemming from this background is the following remarkable inequality:
~-1(11(6A+ (1-6)B));;, 6~-1(I1(A))+ 0-6H- 1 (I1(B)) , 0<6< 1 , A,B convex,
(1) where
a
Ha)
r
-co
[MS]. In particular, i f the heat equation in
A
is a convex body in
(int A) X R+
A X [OJ , then the function
n
u = 0
and i f on
u
is the solution of
(CiA) X R+
~-1(u(.,t)) is concave for every
Finally, in his last paper [AlP] in
satisfying
R
t>O
and
u= 1
on
[MS].
Ehrhard investigated the case of equality
as well as in several other inequalities for Gaussian measures. Thus, for n R are non-empty convex domains, Ehrhard proved that equality example, i f A,B occurs in (1) i f and only i f ei ther A=B or A and B are parallel affine (l)
'¥
half-spaces. The arguments leading to this very definite result are extremely penetrating and mixed with youthful enthusiasm and conviction. The loss of Antoine Ehrhard is an irreplaceable loss to the area he so successfully invented. The scientific progress will now proceed much slower and with much less substance, too. But for ever, we will remember an artist ; an artist painting with convex bodies and the Gaussian law.
PUBLICATIONS [CR]
(en collaboration avec X. Fernique). Fonctions aleatoires stables irregulieres. C.R. Acad. Sc. Paris, t.292, Serie I (1981), 999-1001. Regularite des fonctions aleatoires stables. These de 3eme cycle (1982), 9-43. Lois stables et propriete de Slepian. Ann. SC. de l'Universite de Clermont 71 (1982), 81-94. Sur la densite du maximum d'une fonction aleatoire gaussienne. Seminaire de Probabilites XVI, 1980/81, Lecture Notes in Math. 920, 581-601. Une demonstration de l'inegalite de Borell. Ann. Sc. de l'Universite de Clermont 69 (1981), 165-184.
3
rMS]
Syrnetrisation dans l'espace de Gauss. Math. Scand. 53 (1983), 281-301. Un principe de syrnetrisation dans les espaces de Gauss. Probability in Banach spaces IV -Oberwolfach 1982- • Lecture Notes in Math. 990, 92-101.
[ ASE]
Inegalites isoperimetriques et integrales de Dirichlet gaussiennes. Ann. Scient. Ec. Nonn. Sup., 4e serie, t. 17 (1984), 317-332.
[ IN]
Sur l'inegalite de Sobolev logarithmique de Gross. Seminaire de Probabilites XVIII, 1982/83, Lecture Notes in Math. 1059, 194-196.
[AlP]
Elements extremaux pour les inegalites de Brunn-Minkowski gaussiennes. Annales de l'Institut Henri Poincare. Vol. 22 nO 1 (1986)
149-168.
Sur la densite du maximum d'un processus gaussien. These d'Etat (1985), 87-104.
These de 3eme cycle (soutenue
a
Strasbourg le 12.02.1982) :
Fonctions aleatoires stables. Densite du maximum d'une fonction aleatoire gaussienne. Publication de l'IRMA de Strasbourg, nO 156. These d'Etat (soutenue
a
Strasbourg le 24.05.1985)
Convexite des mesures gaussiennes.
Publication de l'IRMA de Strasbourg, nO 273.
INVARIANCE PRINCIPLES FOR THE EMPIRICAL MEASURE OF A MIXING SEQUENCE AND FOR THE LOCAL TIME OF MARKOV PROCESSES, P. DOUKHAN * ,J.R. LEON **
ABSTRACT.
We show an invariance principle for the empirical measure of a stationary strongly mixing sequence indexed by the unit ball of some Sobolev space
Hs We also obtain invariance principle and law of iterated logarithm for the local time of Markov processes indexed by
Hs .
We note that the regularity condition
s > d/2
in the first framework for
random variables with values in a compact riemannian manifold in the continuous case of the brownian motion on
* universite
Paris-Sud U.A. CNRS 743 "Statistique Appliquee" Mathematique, Bat. 425 91405
ORSAY (France)
E becomes
s > d/2-1
E
** Universidad
Central de Venezuela Facultad de Ciencias Departamentado de Matematicas Apartado Postal nO 21201 CARACAS (Venezuela).
5
1.
I NTRODUCT ION
This work is divided in two parts.
The first one is devoted to investigate a
rate of convergence in the weak invariance principle for the empirical process {~k;
of a strictly stationary strongly mixing sequence
Xn k=0,1 , ... } valued in a
L2(E,~) of uniformly bounded 1 n functions satisfying an entropy condition: X (f) = L [f(~k)-Ef(i;k)]' f E F n /ri k=1
metric space
E indexed by a compact class
F of
The typical case is obtained for a d-dimensional E with
riemannian compact manifold
F unit ball of the Sobolev space
result can be shown only if s > d/2.
Hs of the manifold (see Gine [14]) ; a In this discrete case we expose some of the
results of [10] made in collaboration with Frederic Portal.
Rates of convergence
essentially depend on the entropy condition for F . The second part of this paper studies the asymptotic behaviour of n
ZnU) =
~ Jf f(X u ) du, fEL2(~). In 0
Here
{X t : t> O}
is a continuous parameter
recurrent ergodic stationary Markov process with values in a compact riemannian manifold E or in lRd ; ~ denotes the invariant measure of the process. We first give an invariance principle in a general framework. brownian motion on
E;
We also study the case of the
we give an invariance principle and a L.I.L. uniform on the
classe F , unit ball of Hs for s > d/2-1. We also study the case of diffusions on Finally we discretize the Zn process by L f(X) f E F. O
In
Our result can be compared to those of [7], which works with the non-uniform case of one dimensional our result is uniform on
diffusions. Our speed of discretization is quite lower but
F.
The process
E by Baxter and Brosamler [4]; L.I.L.
Z (f) n
has been studied for the case of
they prove non uniform central limit theorem and
They use, as we do, mixing techniques.
Battacharya ([3J) extends their
results to a non compact case with martingale techniques.
A first uniform result,
based on a martingale approach, is given by Bolthausen [5] for the case of brownian motion on the
d-dimensional torus; his results are extended here for a general
6
riemannian compact manifold.
The non uniform problem is also considered by
D. Florens [13] for the case of one dimensional diffusions. We use, in this paper, mixing and Hilbert space techniques which are by-products of [10] for example.
The Markov case satisfies a mixing assumption as it is
shown in Rosenblatt
The mixing notion used here is strong mixing defined as
follows CX
{x
v ; v>- O}
v if CXv-> 0 CX
and
~1]
is said to be strongly mixing, with mixing coefficient
where,
v = Sup {llP (A n B) - lP (A) lP (B) I ; A E r~ , Bn;+v,e.2. O}
r;
is the
a-field generated by
{xv; s
We expose here invariance principles results for discrete and continuous parameter processes to compare them. seen that regularity conditions
In the compact riemannian manifold case we have s > d/2
for discrete parameter becomes
s > d/2-1
for continuous parameter.
2. INVARIANCE PRINCIPLE FOR THE EMPIRICAL MEASURE OF A MIXING SEQUENCE OF RANDOM VARIABLES. In this section we expose some results of [10] made in collaboration with Frederic Portal ; a complete version of this work will appear elsewhere with proofs. Let
(~k)k>O
a strictly stationary sequence of strongly mixing random varia-
bles with values in a polish measured space a-field of E and
~
a non negative a-finite measure.
compact subset of L2(E,~), X (f) n The class
(E,B(E) ,~)
vie
define n 1 L
/ri
k=1
[f(~k)-Ef(~k)]
where Let
B(E) r
is the Borel
a finite entropy
fEr
r is supposed uniformly bounded, the law of
~
has a bounded o density with respect to ~ ,and there is some aE]0,1/3] such that L cx~ < k=O (here {cx k} denote the mixing coeffirients of the sequence {~k})' The process Xn is C(F) valued, where (C(F), II.IIJ is the space of contiriuous functions on the 00
00
compact set
r e'lui p!Jed with uniform norm.
7
We give an estimate of Prohorov distance of Xn to the centered gaussian process Y with covariance defined for f ,9 E F by:
g"
and
Eg(t:o ) .
A reconstruction of the process
Y gives a weak invariance principle with rate
of convergence. The method is based on estimations of central limit theorem rates in Prohorov
Xn depending on the dimension of repartitions and, from another hand, on estimations of
metric convergence given in [11] for the finite repartitions of the process
the oscillations of the THEOREM 1.
Fa
L:t
I:
satisfying
k=O
a~ E
such that
{fEF-F;
=
<00, SUp
dlJ
<6 2} /
{t:k}
then there is a constant only depending on the
sequence
L2 (E,IJ) , the values of
N sufficiently b.ig finite subset of
rJ (
2 ,1/(1-0) 11- 0 U " L E\k~N fk(X)) lJ(dx) J V=
OE]0,1/3]
suppose that there is a
an orthonormal denumerable basis of
are defined for
6
J f2
{X~(f) ; fE Fo} ~ C {o2 U + B(\V}.
Let
U,V,B
Xn process based on,
(
resp. U
1 2 )1/(1-0) 11- 0 ( E k~N e fk(x) lJ(dx) J resp. V k [J (
I:
kEN
JK
by
IlfkI12}, 00
L: e- 1 1I f I1 2, k[N k k 00)
B6" Sup {L: 6 k(f, f k) 2 ; f E F6} kiN the sequence
is positive and satisfies
SUp
{I:
kEJK
:
~~~~r~
The same estimate is valid for the gaussian process
ek(f,f k )2 ; fEF} <00.
Y (see [10])
2.1. Uniformity test on a d -dimensional compact riemannian manifold Here
II~~II
00
IJ
~a<
is the uniform measure on 00.
E and the law v of Co
E. satisfies
We extend results of Gine [14] in a mixing framework.
The test of Gine rejects hypothesis "v = IJ"
for big values of T(s) n
8
T(s) (w) = n II ~ (a os/2 k=a k k n The sequence
11
k) (vn(w) - v) II :s .
{ok; k=a,1, ... } is the ordered sequence of eigenvalues of the
Laplace Beltrami operator 6 with eigenspace
Ek and 11 k is the orthogonal pros Sobolev space of the manifold E(s E lR )
jection
H_ s "" Ek , Hs being the index w whose norm is Ilfll s = ( L o~ L (f,f.)2)1/2 k=a flE k J random measure :
Su p
v
n
= n1
L k=1
0t; 00
°
If there exist
0 ( n -2/(3b)+8/15,J
for
S
> d/2
The sequence
for
k
a,1, ... } sa tis fi es
k
•
a < b < 1/4, a <° ~ n ....
sequence of gaussian processes
lP
the empirical
Xn process which can be written
For this we consider the
a n =
v n
n
{I a k 0V 2 I , k=a, 1, ... } <
THEOREM 2.
We write
00,
00
1/5
satisfying
° < wand
n 2 an
L
n=a
we can construct an identically distributed
{Y n ; n=1,2, ... }
with the law of
Y
such that:
(Sup {I Xn(f) - Yn(f) I ; f E Bs } -> pn ) -< p n
,here
Pn = 0 ((£n(£n n))s/d (£n n)(d-2s)/(3d))
denotes the closed unit ball of
n ....
for
00
and
B
s
Hs .
is V = 1J" then the 1imit law of T(S) We use this result to show that, if n a sum of dependent x2 random variables from another hand, for v f 1J" satisII
II
f a for some k such that a f a then we show that the limit law k k 2 1 of n- / (T(S) -nC) is gaussian for some constant C
fying
11 (V)
n
So that the test is consistent. In the case when
E = S1
continuous functions with index
is the unit circle equiped with a set of Holder s > 1/2 we obtain a more precise statement.
2.2. Invariance principle for the empirical measure on the real line. 1 Here E = R , IJ is Lebesgue measure on lR 1 and 2 1 F = {fEL (R ) k~O (k+1)s (f,h k)2 ~ 1} where {h k} the sequence of normalised Hermite functions (c. f. [23]) .
9
THEOREM 3.
[9].
Under the same mixing assumptions than in theorem 2, we obtain
= 0 (/9,n (9,n n) (9,n n)(5-6s)/18)
P
n
for
n
->-
00
if
S
> 5/6
Remark : The main interest of this result is to escape from a compact context. In [10] and [9] invariance principles for non parametric estimators are obtained with the same methods.
Kernel and projection estimators of density and regression
function are considered.
3.
F varies with the index n.
In this case the class
L.I.L.
INVARIANCE PRINCIPLE AND
FOR THE LOCAL TIME OF CONTINUOUS
PARAMETER MARKOV PROCESSES. {X t ; t > O}
We write
a continuous parameter Markov process with values in a
complete separable metric space stationary with marginal law
E
~
This process is supposed to be homogeneous and Moreover, its infinitesimal generator
unbounded non-negative linear operator, whose domain,
D(L)
is dense in
satisfies (i)
L is self-adjoint and onto.
(ii) the spectrum of
L is discrete.
(iii) 0 is a single eigenvalue of L associated to the constant eigenfunction 1 . ? L-(~)
Under those hypotheses, the Hilbert space 11.11
(resp. (.,.)) its norm (resp. scalar product)
an orthonormal basis of
L2(~)
is separable; we write
and
{em; m = 0,1,2, ... } is
such that
00
Lf = L A (f,e)e m=O m m m Thus the spectrum of
L is
for
fED(L), here
{Am; m ~ O}
Ao = 0
<
A1
~
The semi-group
A2 ~ ... and Pt
eo=1 .
associated to the
(Ptf,g) = Ef(X t ) g(X o )' satisfies -A t e m (f ,em) em f E L2(~) , t > 0
Markov process, defined by 00
P f = L t m=O
Condition (iii) implies ergodicity of the process proposition 2.2)
The operator
Pt
{X t ; t
~
O}
is a contraction verifying
(cf. Battacharya [3],
10
IIPtfll~e
-A t
at ~ c e
is strongly mixing with The Green operator G1 =
2 fEL (fJ)
1 Ilfllfor
(f,1) = 0 ; thus the process
-A t
for a
1
c
>0
2 L (fJ)
G is defined on
-1)
(f,e m em' fE1 L2 (fJ)
This operator is continuous on LG=I-S
where
ra'
Gf = J Ptf dt if o
by
jection on
fE1l- and
D(L) n 1l-
L2 (fJ)
and
S
and verifies
is the orthogonal pro-
eo (Sf = (f,e )e ) . o 0
We consider the Hilbert space
~
2 Hs ={fEL (fJ)
The operators Hs
m=1 Lr
m
8s
m
Gr
and
(H s ' 11.11)
s
>0
with norm (resp. scalar product)
1/2 ),s (f,e )2 1 m=1 m m J w
lifll s =
[L
are formally described by Seeley [20] for r E [ 2 Gs / 2 and the domain of Ls / . We also write H_ s it is an Hil bert space with the norm 11.II_ s
IITII_ s = Sup {IT(f)1 ; We write
for
< w}
AS (f,e )2
is the range of
the dual space of
H s
defined by :
11·ll s (resp. (.,.)s)
Note that
> O}
l-
with range
is the identity operator on
t
(Rosenblatt [21]) .
a thus : Gf = L: Am m=1
{X+c
fEH s '
Ilfll s=1} =
the unit closed ball of
C~1 A~s
(T(e m))2f/2
H . s
The aim of this work is the study of asymptotic behaviour of the functionnal
In
Z (f) = -1
n
In
L2 (fJ)
It is defined on an
H-s
f(X ) du
0
u
because
valued random variable for 00
(*)
L:
m=1
A-(1+s) m 00
Indeed,
EIIZnll: s
= E
L:
m=1
s
<
E!IZn(f)!!2~nllfI12. We consider Zn as
>a
verifying:
CIn r
n
00
-s Am
J0
2 e
(X ) m u dU)
11
r
00
A- s (1 - n~) (p u em' em) du m=1 m 0 -A u n ~) e m du 2 I: A-ms r (1 E II zn II 2-s n Jo m=1 -nA 2A-(2+s) (1-e m) I: E II zn II 2-s = 2 I: A-(1+s) n m=1 m m=1 m E II Zn II 2-s
2
I:
00
00
00
This calculus
is analogous to those of Baxter and Brosamler ([4] , theorem
(4.3)) .
We now define a gaussian random vector
Z on
H
-s
For this we assume the
following technical hypothesis: (H)
\ There is a uniform random variable on the interval
~ probability spRce (0,A, We consider an i.i.d.
~)
(~ ) '0
sequence
with variance 1 and we define, for Z(f) =
{X t ; t
that
m m/
~
O}
[0,1] defined on the same and independent of it.
of gaussian centered random variables
s satisfying (*)
-1/2 (f, e ) t: m ' f E Bs Am m m=1 2 I: A-(1+s) thus ZEH -s a. s. m=1 m
/2
00
I:
00
E II ZII :s
Note that
An abstract construction of Z can be made using the operator T = 2 G1+s 2 ~ A~(1+S) m=1 hypothesis (*), satisfying:
whose trace is
I
H_ s
<
We define a gaussian measure M on
00
H-s , under
(z,T 1)_s (z,T 2 )_s M(dz)
The law of Z is
M.
We also note that measurability condition of [3] is satisfied here because of the separability of E. The random variables valued processes. the compact subset THEOREM 4.
Zn and
C(B s ) the space of continuous functions defined on
Here we note Bs of
The sequence
under the hypotheses (i),
Z, H-s - valued, can be considered as
L2(~) and real valued equiped with uniform norm.
{Z n ; (ii),
n
> 1}
-
converges in distribution to
(iii) and (*) •
Z
in
C(B s ) ,
12
Proof.
Under the former hypotheses, Battacharya shows convergence of finite repar-
titions ([3], remark 2.1.1.).
{Z n ; n -> 1} result from flattly concentrated property using De Acosta's method [1] . Let
FmcH -s
The tightness of the sequence
be the m-dimensional space defined by ;
F = {T E H ; T e = 0 for m -s k FE the m
We note
k = 0 and
E-vicinity of Fm in 00
M+ m
Note that
will
= {T E H-s
; T e
0
=0
k > m}
H-s and -s
L A k=m+1 k
M~ cF~ and E Z~ (e k) < 2/A k ' so Bienayme- Tchebi~ev inequal ity
implies: lP (Z n
t.
FE) > lP (Z n m-
t.
lP (Z n
t.
lP (Z n
t.
EE) > 1 - lP ( L A-s Zn2 m\k=m+1 k A- s E FE) > 1 - E-2 L mk=m+1 k
ME) m 00
(e) k
~
E2)
00
Z2 (e ) n k
00
L
k=m+1 Thus the sequence
Zn is flattly concentrated
COROLLARY 5. The sequence of real random variables tion to
IIZII: s '
X2
infinite sum of weighted
theorem 4 follows. 2
II Zn II -s
converges in distribu-
random variables.
: Using the direct construction of Z, note that: liZ II 2-s
B~~~r~
(t:m)m>O
2
00
-(1+s) 2 t:m
L Am
m=1
being an i.i.d. sequence of normal random variables.
In view to investigate iterated logarithm behaviour of construction of the brownian process
Zt
Levy's construction ([16], 1.5, p. 19).
with base
M, law of
Zn' we now make a direct Z
We use the
Let
2 {Xn,k ; n = 0,1, ..• , k odd, k = 1, ... ,2 n-1} the Haar basis of L [0,1] defined by 'V 2- n/2 for k i 2nt i k+1, = 0 else, and Z k an i.i.d. array of reali'V n, N t 'V r X k(u) du . zations of Z defined on (~,A, lP), we write ZN(t) = L L Z n=O k n,k Jo n, This serie converges normally a.s. Note, for this, that:
13
From another hand Fernique ([12], theorem 1.3.2.) shows that there is an 2 / a 2) < such th a t : A = E exp (11z11 -s
00
Th us
•
a >0 lP (EN ~ a 2-(N+1)/2 V I,n(22N)) _< A 2- N. k_ 'V
Borel-Cantelli lemma implies then the continuity of the limit quence
of the se-
The limiting covariance is computed as
ZN(t) 'V
'V
UAV
00
E (Z(u),U)_s = (Z(V),V)_s = L n=O
L
k odd O
E(Z,U)_s (Z,V)_s Jr o
Xn k(w) dw '
'V
'V
E (Z(u),U) -s (Z (v),V) -s = Let
Z(t)
UAV
E(Z,U)
-s
( Z, V)
-s
{Z(n,t)
'V
Z(t), we set part of t
0 < t < 1} an i.i.d. sequence of continuous realizations of [tT Z(t) L Z(k,1) + Z ([tl, t-[tJ), where [t] denotes the integer k=1 The process {Z(t) t > O} is the brownian process with basis M;
it belongs to C (R+,H -s ) under condition (*). PROPOSITION 6.
The a.s.
compact subset
/2 B1
~rQQf.
cluster set of
H_ s (B 1
of
We see, like Bolthausen ([5])
Z(t) (2 t £n[£n t])-1/2 for {fEL 2(fJ); IIfl11 ~ 1}) under that this set is
t ~
00
is the
condition
(*).
/f (B s ) = /2 B1 using the
results of Kuelbs and Lepage ([18]) . 3.1. Brownian motion on a compact riemannian manifold. The space {X t ; t > O} LEMMA
7.
Remark
E is here a d-dimensional compact riemannian manifold and
is the brownian motion on
(i).
For
(ii).
Vo, S
S > d/2, > 0,
For s = d/2
------
For s > d/2-1 Proof.
where
E.
fJ
is the uniform measure on
E.
is a bounded r.v.
s > d/2 - 2/(2+0)=> E II Z1 11 :;0 <
00
•
the r.v.
Z1 E H-s admits moments of every order. there is a 0 > 0 verifying Iii).
is the Laplace operator of the Riemann The eigenvalues of L , (Am)m>O satisfy Am 'V c m2/d ([ 19]) for m.-+ 00,
Note first that
manifold
IIZ111:s
E and
L=-
c > 0 is some constant.
(1"
where
(I,
From another hand, Gine ([14]) shows:
14
VS
(;)
> d/2
II Z1 11
00
:S
II Z1 11 :s (i ;)
3C L
m=1
~
h, k
116xll:s~C
, VX E E
A- s m
[fo em
2 (X )
u dU]
1 1 2 2 1 + L A-ms r em (X u) du = r 116 x II -s du < C J J u m=1 o o 00
fr1 = E[ r ~ A-ms \J em Jm= 1 o
E Ilz111:;
Let
>0
(X)
du
)2 11+6/2 J
> 0 satisfying h+k = s and p = (2+6)/6, q E II z II 2+ 6< E A B 1 -s -
inequality implies
A=
1 [~ A~hp (\Jr em (X) u o m=1
B
1 A-kp (Jr e (X) dU)\2 m=1 m o m u
where
\2 6/2 du \ 1 ) J
because
<2
L
m=1 00
Setting
hp
kp + 1, we have
E liz 11 2+6 < 2 L
1 -s -
m=1
A- s - 2/(2+6) C6/ 2 m
Zn can be rewrite Zn = n- 1/ 2 (Zi1~ ... + z~n)) , where (X ) is a stationary sequence of random variables e EH J k= 1em u dU] m -s
Note that k 00
with
hp > d/2, and
~
EB
Z(k) 1
(2+6)/2, Holder
L
m=1
zi 1 )
[r
= Z1'
Moreover, Baxter and Brosamler ([4])
show that this sequence is
¢-mixing (with ¢n ~ c an ,0 < a < 1). Thus the following theorem 8 will result from lemma 7 and : THEOREM A.
(Dehling, Philipp [s]).
Let
{x
V
, v> 1} -
a strictly stationary sequence
of random variables with values in a separable Hilbert space tation and having a strongly mixing with
(2+6) CJ.
n
-order fini te moment
=
(0 < 6 < 1). for
together with a brownian motion
{Z\t) ; t > O}
n~
= 0 (It £n ( £n t)
a.s.
centered at expec-
If the sequence is
it can be reconstructed
with covariance
probability space such that :
II L x V - X( t) II v
H,
r
on another
15
f
Here
Vx,yEH,
is defined by 00
(fx,y) = E(x,x 1)(y,x 1) + L E {(x,x 1) (y,x) + (x,x) (y,x 1)} v=2 \J V > d/2-1 ,
Let
S
compact in
H
and its a.s. cluster s",t for
-s
0 E0
There is some
o
n
is
-'<0
12
is conditionally
B 1
where
with
0
, VfE H
s
,
1i m
n-+oo
lP (0 ) = 1 0
and:
[Z (f) ( w) (2 £n (£n n)) -1 / 2 ] = 12 II f II -1 n
Using lemma 7 we see that theorem A works for the process
Proof :
us to make a strong approximation. of theorem.
B1 is the
H • 1
closed unit ball of
Vw E 0
the sequence
Zn (2 £n(~n n))-1/2
THEOREM 8.
Z
n
it allows
With help of proposition we get the first part
The second part follows from conditional compactness of the sequence
and from a.s. convergence of the upper 1 imit for any quence of theorem applied with have
H
(2+0)-order moments because
construct a sequence
(U k )k>1
=
and
lR
f E Hs which is another conseXv = z;v) (f). Those random variable
E Iz;k) (f)1 2+0 ~ Ilfll ;+0 E Ilz;k) II
with same law that
(Z;k) (f))k>1
L Uk - W(t) a = 0 (It £n(£n t) ) a.s., k
W(t)
motion
such that
:;6
We
so that a brownian a" 12 II fll -1 .
Theorem 8 closes the conjecture (8.11) of [4] for the case of the brownian
Remarks.
-------
motion on a compact riemannian manifold. The case of torus rity condition
Td , studied by Bolthausen ([5]) , leads to the same regula-
s > d/2-1.
The method used is there martingale theory .
• Conservative diffusions. t ; t ~ O} is a diffusion on E with infinitesimal generator L = - (:., +V, for some vector field V. Let v(dx) = ¢(x) ~(dx) the invariant measure of this If
{X
process (c. f. Ikeda, Watanabe [15]), 2, . L* is the adjoint of L in L \~). and strictly pos it; ve. (c. f. [15 ]) L in L2(v) valent here.
Opera tor
L*¢ = 0, where , ¢ can be choosen COO
the function
¢ is solution of
If the field
V is
COO
L is self-adjointed ; n L2 (v)
if V " 2'1
~n(¢)
Gine [14] shows that the Sobolev norm with index s associated to is equivalent to Ii ·11 s because norms on L2 (v) and L2(~) are equiIf the eigenvalues of L have the same asymptotic behaviour than those
16
of
-6, then an analogous of theorem 8 is true for such diffusions
proof of lemma 7 is still valid
indeed the
here.
Rd
3.2. Diffusions on
d R solutions of a stochastic
Results of § 3 are applied to diffusions on differential equation (S.D.E.) : dX Here
{W t ; t
t
S(X ) dt t
-=
dW
+
t
~O} is a brownian motion on Rd and S is a function.
Such
diffusions have been studied, for example by Albev«rio, Hoegh-Krohn and Streit ([2]) and Carmona ([6]).
Ck
We write
real functions defined on
R
d
o (0 -< k < 00) the space of
k time differentiable
and with a compact support.
Let
lJ(dx) -= ,p2(x) dx
a absolutely continuous law, we define, like [2] , the bilinear form
0(7)
If the form D(V)
sucn that
C;cD(L),
E is closed, there is a self-adjointed linear operator r::(f,g) -= (Lf,g).
L~oc
(R d )
H -= -6+ V and
THEOREM B ([2]).
Let
for
then
V
It is the case for fE
c;
where S -= 2 ,p-1 7,p
C~cD(L)
,p-1
and
L(,p-1
lJ(dx)
= ,p2(x)
dX t -= S(X t ) dt
c 1 ,c 2
~
0 and
(R
and
L1 -= 0
If
= ,p-1 Hf for fE C~
dx
a law equivalent to Lebesgue measure satis-
+
dW
{X
t
; t
~
O}
to the S.D.E ...
t
d
is markovian with invariant law
The operators Hand same spectrum.
(R d ) ; then
2 ) satisfies div S (x) ~ -c, Ixl - c2 for some constants d d SE Lioc (R ) , ,p-1 M E L~oc (R ) , then the process {X t ; t ~ O}
div SE
L~oc
n
L on
V -= ,p-1 M .
fying S -= 2 ,p-1 V,p E L2(IJ) , there is a unique solution
If
o
E is an Hilbert space with the norm
Lf -= - M - S. Vf
,p-1 V,p, ,p-1 ME where
of
C1
f Vf • Vg dlJ
E (f,g) The domain
E on
IJ •
L with domains
It is discrete if
D(H)cL 2 (R d ) ,D(L)cL 2 (1J)
lim V(x) -= Ixl-wo
+
00 •
have the
17
Condition (*) is satisfied with the help of Tamura's result ([24]) asserts that lim Ad/2+d/m k- 1 = c > 0 under assumption Am i.
k->oo
k
3R
>0
o < Inf
{V(x)
> R} < Sup
Ixl
ii.
VX E lR
IDa V(x) I iii.
3C
COROLLARY 9. assumptions
s
> d/2
+
If
Am
dim -
>0 ~
Ixl
< Ca
+
m
; Ixl
> R} <
00
d
Ix 1 2 ) (m- a) /2
> R => x.V(x) ~ C Ix 1m V
is equivalent to Lebesgue measure,
m> 0
for some
and 13 =
2 ¢
-1
2
= ¢-1
6¢
4
17¢ E L (~) n L1oc (lR
Zn
implies that the sequence
Conditions
(1
-
{V(x) Ixl-
which
d
satisfies
),
then
converges in distribution to
Am are satisfied by homogeneous polynomials
Z
in
V with degree
m•
V(x) = c Ixl 2 for some constant
A multidimensional Ornstein-Uhlenbeck (i.e. c
> 0)
satisfies hypotheses of Corollary 9
if
s
> d-1
.
We do not get here an iterated logarithm law (L.I.L.) because the lemma of the former section is no longer valid.
To avoid this problem and obtain a uniform L.I.L.
we now reduce the class of functions used. From here we suppose ¢ bounded, thus L2 (lR d ) cL 2(~); the Sobolev space of L2 (lR d ) constructed with tensor products of Hermite functions is denoted by
* Hand
The process
s
Z1
is a bounded random variable with values in
Here and
h is the m-th normalized Hermite function. With the help of the estimate m II hmII = 0 (m-1 I 12) of Szego ([23]) we see, summing by parts on spheres of 00
that: liZ 11*2 < C d 1- s 1 -s-
~
d- s -7/6 •
p
p=1 The strongly mixing property of the
logous method than for theorem 8,
{X t
t
> O} process implies, by an ana-
18
> d - 1/6 ,
S
For
THEOREM 10.
there is some
In
lim [(Zn(f) (w) n-+oo
II· II
The Use of the norm
Proof : Note that
J IfI2+8d~ <
00
of
-1
oo c0
1P (0 ) = 1 o
with
f f dl.Jl
such that ;
(2 £n(£n n)f1/2] " 1211fll_1
*
is valid here because of inclusion
Hs cH - 1
* d HscL (lR) thus
fEH * satisfies integrability condition s ([3], theorem 2.7) and individual L.I.L. is satisfied. 00
From the other hand, theorem A still applies.
The process
Zn
satisfies a
The strong invariance principle with speed o ((£n (£n n))-1/2) in H*-s - ) , H process Z satisfies E(Z,H) *2 "8 -2s (G(H - H - R) where R v -s v v v v v ~ v s 2 for v E I'ld ; its cova ri ance is a trace cl ass opera tor because H "8 /
limit
"f H (x)
v
a orthonormal basis of ~ *2 E(Z,H)_s "
Z
VEl'ld
z
H* and: s 8- s (G(H
VEl'ld
v
Rv ) , Hv - R) .s. 2
v
11 J
A
v
II OX II
IJ (dx)
v
H
v
*2 -s lJ(dx)
is
<
00
* H associated to the gaussian 1aw of -s This compact set is the a.s. cluster set of (Zn - EZ n )/ 12 £n(£n n) . Thus we get a compact subset
•
Z.
K of
The theorem follows. Remark. -----the class
* CocH s 00
Note that C~.
' so that this theorem establishes a uniform L.LL.
on
This result is connected with the conjecture (8.11) of [4] .
3.3. Discretization.
Ox
is a bounded random variable of H_ ; for example if s t s > d/2 and if {X t ; t ~ O} is a diffusion on a compact riemannian d-dimensional manifold this condition is realized. Suppose here that
The discretization of the process
Z
n
is, for 6
>0
Z (f)".l n,6
In
For 6
> 0 fixed, we see that [8] implies Zn,6 converge in distribution to a
gaussian process
Z6
such that -v 1/2
if
k(v)
1 e (-+-) 1-e -v
19
With the help of an i.i.d. sequence of normal realizations explicit constructions of Z and Z6 00 Z= L E:m em m m=1 00 Thus EIIZ 6-ZII_2s = 6 L Ik(A m6) m=1
4
(~)m>1
we give
00 Z6 = IS. L kO'm 6 ) E: em m m=1 An6 2 A- s 2..1 A-s < 6 L {2 A (T)} Am6· m m m=1
;;r.
00
For a diffusion on a compact riemannian manifold: 2
0, if s>d/2 EIIZ-ZI1 6 -s Then the Prohorov and Levy's distances of those gaussian random variables on Hare 0(6 0/3) for 6 -> 0, 0 = s - d/2+1 -s From the other hand, a precise analysis of the results of [11] shows that Dudley distance
d3 can be estimate :
d3 (lP Z ' lP Z,) = 0 (6- 11 / 4 n- 1/ 4 ) for n->oo , 6-> 0 if s > d/2. n,6 Ll Thus the discretized process Zn,6(n) converge in distribution to Z for 6 (n) = 0 (n- 1/ 11 ) when n->00 If, moreover 6(n) = 0 (n -1/(40+11)) then ' lP ) - 0 (n- 0/(40+11)) d (lP 3 Zn,6(n) ZFor great values of
s this speed is approximately
n- 1/ 4
20
BIBLIOGRAPHY [1]
A. De Acosta.
Existence and convergence of probability measures on Banach Trans. Amer. Math. Soc. 152, pp. 273-298 (1970).
spaces.
[2]
S. Albeverio, R., Hoegh-Krohn, L. Streit.
Energy forms, Hamiltonian and
distorted Brownian paths. J. of Math. Phys. 18, nO 5, pp. 907-917 (1977).
[3]
R.M. Battacharya.
On the functionnal central limit theorem and the law of
the iterated logarithm for Markov processes. Zeit. fur Wahr.
und Verw.
Gebiete 60, pp. 185-201 (1982).
[4]
J.R. Baxter, G.A. Brosamler.
Energy and the law of the iterated logarithm.
Math. Scand. 38, pp. 115-136 (1976).
[5]
E. Bolthausen.
On the asymptotic behaviour of the empirical random field of
the Brownian motion. Stoch. Pro and their Appl. 16, pp. 199-204, (1983).
[6]
R. Carmona.
Processus de diffusion gouverne par la forme de Dirichlet de
l'operateur de Schrodinger. 1977-1978, L.N.M.
[7]
Seminaire de probabilite XIII, Strasbourg
721, pp. 557-569 (1979).
D. Dacunha-Castelle, D. Florens. Choix du parametre de discretisation pour estimer le parametre d'une diffusion. C.R.A.S. Serie I, Paris, t.299, pp. 65-69 (1984).
[8]
H. Dehling, W. Philipp. Almost sure invariance principles for weakly dependent vector-valued random variables. Ann. of Prob. 10, pp. 689-701 (1982).
[9]
P. Doukhan. Fonctions d'Hermite et statistiques des processus melangeants (Submitted to pUblication, 1985).
[10]
P. Doukhan, J.R. Leon, F. Portal. Principe d'invariance faible pour la mesure empirique d'une suite de variables aleatoires dependantes.
(Submitted
to publication, 1985).
[11]
P. Doukhan, J.R. Leon, F. Portal. Calcul de la vitesse de convergence dans le theoreme central limite vis
a
vis des distances de Prohorov, Dudley et
Levy dans le cas de variables aleatoires dependantes. Prob. and Math. Stat. VI.2 , 1985.
21
[12]
X. Pernique. Regularite des trajectoires des fonctions aleatoires gaussiennes. L.N.M. 480, Springer (1975).
[13]
D. Plorens. Theoreme de limite centrale des fonctionnelles de diffusions. C.R.A.S. Serie I, paris, t. 299, pp. 995-998 (1984).
[14]
E. Gine. Invariant test for uniformity on compact riemannian manifolds based
on Sobolev norms. Ann. of Stat. 3,
[15]
pp. 1243-1266 (1975).
N. Ikeda, S. Watanabe. Stochastic differential equations and diffusion processes. North-Holland, Tokyo (1981).
[16]
K. Ito, H.P. Mc Kean. Diffusion processes and their sample paths. Springer Verlag, Berlin (1974).
[17]
J. Kuelbs. Kolmogorov law of the iterated logarithm for Banach space valued random variables. Illinois J. of Math. 21, pp.
[18}
784-800 (1977).
J. Kuelbs, R. Lepage. The law of the iterated logarithm for Brownian motion in a Banach space. Trans. of the Amer. Math. Soc: 185, pp.253-264 (1973).
[19]
S. Minakshisundaram, A. Pleijel. Some properties of the eigenfunctions of the Laplace-operator on riemannian manifolds. Can. J. of Math. 1, pp. 242-
256 (1943).
[20]
L. Nirenberg. Pseudodifferential operators. Proc. of Symp. in pure Math. XVI. Global Analysis, A.M.S. Providence, pp. 149-167 (1970).
[21]
M. Rosenblatt. Markov processes. Springer Verlag, New York (1971).
[22]
R.T. Seeley. Complex powers of an elliptic operator. Proc. Symp. in pure Math. X, A.M.S., Providence, pp. 288-307 (1968).
[23]
G. Szego. Orthogonal polynomials. A.M.S. Providence (1939).
[24]
H. Tamura. Asymptotic formulas with sharp remainder estimates for eigenvalues
of elliptic operators of second order. Duke Math. J. 49, pp. 87-119 (1982).
ALMOST EXCHANGEABLE SEQUENCES q L ,1';q<2.
IN
Sylvie GUERRE Equipe d'Analyse U.A. N° 754 Universite Paris VI 4 place Jussieu 75230 PARIS CEDEX 19 Tour 46/0 - 4eme etage
Let
(O,B,P)
be a probability space and
q L
=
+
00
•
We
< +
00
•
Does
Lq(O,B,P) , 1,; q <
consider the two following problems : Problem 1 : Let
Lq ,
be a weakly null sequence in
(Xn)n€ E
(x
there exist a subsequence
) ~. k€ E
of
1 ,; q
which is almost symmetric?
is almost symmetric if
V
€
> 0 ,
a
(1-e)
liz
k € IN
x
01.
~
n k+i
II,;
III:
Problem 1 was stated for Problem 2
V
such that
(01.)
E
,V
R (E)
permutation of
n
) n k k€ IN such that
(X
II ,;
01. X ~
q> 2
(1 + e)
IiI:
and solved for
!!J
01. X
nk+n(i)
~
q € 2E
n k+ i in
[8J
and a exchangeable sequence
of
n
Ilzk
I: k=O
-
If in addition
k
< +
17 ~
in
00
?
q
[We will say that
(Xn)n E IN
(Zk)k€ IN
after the change of density This definition comes from
is almost exchangeable after the change of density
is i.i.d., we will say that ~J
(Xn)n€ E
is almost
•
[2J •
The following implications are well known :
(X (X
n n
)
is almost i.i.d. after the change of density k€ E
)
~
-Uis almost exchangeable after the change of density
k€ E
k )
k
a
~,
X
+00
~
,
Under the same hypothesis, does there exist a positive density
subsequence
(X
E
~
~
is almost symmetric. kE E
By the Finetti's theorem (cf. r2J), we also have that
~
~
i.i.d.
23
is almost exchangeable
~ (x
)
is almost i.i.d., conditionnally to its tail field.
~
kE E
On the contrary, a modification of Example 2 in [2J by Y. Raynaud shows that there q exists an almost symmetric sequence in L with no almost exchangeable subsequence after any change of density. q L , 1,; q < +
A lot of results are known about symmetric subspaces of particular about those that are isomorphic to
tP
for some
00
,
in
p • In almost all these
papers, symmetric sequences are in fact i.i.d. or at least exchangeable: this is the most natural way to find symmetric sequences in
Lq-spaces. I recall here the
results that are closely related to the problems 1 and 2 and that motivated these questions. Kadec-Pelczynski For
[9J.
q > 2 , every weakly null sequence in
q L :
- either is isomorphic to the unit vector basis of
of
t
2
- or has a subsequence which is almost equivalent to the unit vector basis q t •
Dacunha-Castelle For
D. Aldous
[4J. i-symmetric subspaces of
1,; q < 2 [lJ. (Case
[10J.
Every infinite dimensional subspace of is isomorphic to
for some
More precisely: If there exists (Yn)nE E
p E [l,qJ
on
are means of Orlicz spaces.
q = 1).
J.L. Krivine and B. Maurey
tP
q L
q L , 1,; q < +
00
,
has a subspace which
p
(Xn)nE fl
is a weakly null sequence in
such that for all
e> 0
q L , l,;q<+oo,
there exists a sequence of blocks
(Xn)nE E such that (Yn)nE E is (1+e)-equivalent to the unit t P and almost i.i.d. after a change of density.
vector basis of
This last result answers positively to problem 1 and 2 if we are allowed to take (Xn)nE ~
blocks on
and not only subsequences.
Recall now partial known positive answer to problem 1 : THEOREM 1. If
q> 2 , every weakly null sequence of
q L
has an almost symmetric
subsequence. If of
q 1,; q< 2 , every sequence in L , which is equivalent to the unit vector basis 2 t has an almost symmetric subsequence.
The case
q
1
The case
q E 2E
is due to H.P. Rosenthal. is proved in
The proof of the general case
[ 8J • [6J
uses the theory of stability
[10J
: in stable
spaces, there is a natural way to find almost symmetric sequences. First, recall a
24
few definitions : A Banach
x
space
x
in
and two ultrafilters
lim Ilx + y II = lim m,'!' n m m,'!' The~
defined by
0
V x E X
For
(O!
,rn
V x E
0(X)
E R
2
X
lim n,1/
(xn)nE E
lim n,'/)
*
0
ST(X) = lim
is defined by
+
n
y
1/
and
on
'!'
and
Ii •
m
is a function from
X to
R+
such that
Ilx + x 11 n
n,11 where
and
Ilx
, we define the ~
0!0
(x ) c.,,, n n,- .1." , we have :
is stable if given two bounded sequences
0!0*
ST
by :
lim m,'!'
(xn)nE 11
by
71 and
and
and
V •
Ibll = 0(0) = lim Il x II •
Let
n
n,V The spreading model R ON)
r 3J
def ined by
(x ) nEE n
and
'I'
is the comp let ion of
under the norm k
Ii I: i=l
.e .11
o!. ~
lim n.
~
~
«xn)nE E
is supposed to have no
convergent subsequences).
In a stable space, every spreading model is
i-symmetric. The proof of theorem 1
uses a sufficient condition (S.C.) in stable spaces for a sequence
(xn)nE E
to
have a subsequence which is almost equivalent to the fundamental sequence of its spreading model : Let
(S.C.)
0
(xn)nE ~
be the type defined by
on a stable space
X • If
*
O! k0 and IITII,; 1} K (0) = [T /3: k E:N , 3: O! l' .. • ,O! k E Rk suc h th a t T --O! 10 ~,. ... 1 is relatively compact for the uniform convergence on bounded sets of X , then
(xn)nE 11 by
has a subsequence which is almost equivalent to the spreading model defined
(xn)nE:N
and thus almost symmetric.
This condition was used by J.L. Krivine and B. Maurey in the case of written in
[6J. Its proof uses Ascoli's theorem.
Let now
(Xn)nE ~
q
L
be a weakly null sequence in
equivalent to the unit vector basis of
.e
2
that
(Xn)nE 11
type
0
verifies (S.C.)
defined by
can show that
0
2
L (0 X [0,+ oo[
(Xn)nE 11
00
q > 2
which is this is the
[9J). It is shown in
[6J
suppose for simplicity that 1~ q< 2 and that the q is symmetric ri.e.: V xE L , 0(X)=0(-X)J • One
n
is entirely determined by 0 11 and a function Tf belonging to dt dP @ ----=t1) which is the weak limit in that space of t
Xn)nE E
1 ~ q < +
(even in the case
only case to consider because of Kadec-Pelczynski's result
(1- cos t
.eP-types and is
q
' by the formula:
q V X E L , +00
where
K
q
I
'0
(1 -
25
ux = 1 - cos t
X
and
<,>
is the inner product in
Moreover, this representation has the following property q (] ~ (] unifonnly on bounded sets of L
n
(]
u
n-+ CO
n
in
II(] 1\ n
) Ilcrll
n
-+
+00
To prove theorem 1, it is thus sufficient to show that if 1"
belongs to Lq(dP
@
t
«(])
K
1 d11) • As q
then
1"
=O'n(] n 1
*
1,
n Ct
k
cr n
(u n)nE N
has a subsequence which converges in 2 is equivalent to Q , one can show that:
(Xn)nE N
such that :
whe re
lim
€ (w, t) = 0 a •e • 0 On the other hand, we know that k n t~
0' ,(]
n
1"
1- U
n
n
=
(1 -
u ~ )
[6J
i=l We deduce from t\ese two facts that n 2 - I: (O'~t) A(w) k 1" n i=l ~ U n 2 1 - e + I: (O'~) ~ i=l where
S
n
=
[10' ~ I,
Sup
~
n
n
I: (0',) ~ i=l S
2
(w, S t) n
l,;i,;k}.
Taking a subsequence of k
€
n
(Xn)nE N
'
we can suppose
_ 0 ' >0
TI-+CO
___
~>
0 •
n
2
Then
converges a.e. to
~
1- e -at A(w)
1"
gence theorem
(U n)
2
L (cP 0
d11) • This implies that q q is relatively compact for the unifonn convergence on bounded sets of L nE IN
converges also in
and by Lebesgue's dominated conver-
t
K «(]) 1 prove s theo rem 1 by (S. C.) in tha t ca se.
and
This theorem does not give any answer to problems 1 or 2 for weakly null sequenq ces of L 1,; q < 2 which are not equivalent to the unit vector basis of £2. The following result gives a negative answer to problem 2 in that case : THEOREM 2. Let (i)
1,; q < p< 2
(Xn)nE lli
There exists a bounded sequence in
is equivalent to the unit vector basis of
such that
26
has no almost exchangeable subsequence after any change of
(Xn)nE f>l
(ii)
density. If
(Xn)nE f>l K «(J) 1 q for the uniform convergence on bounded sets of L • (iii)
(J
is the type defined by
Remark: Property (iii)
gives a hope that
(Xn)nE ~
is not relatively compact
has no almost symmetric subse-
quence but this question is still open. In fact, the two natural ways to find almost symmetric subsequences in
Lp-spaces (namely the theory of probability with almost
exchangeable subsequences and the stability of those spaces) do not work for this (Xn)nE ~
sequence
of proof
Sketch
[7J.
-I Let where
U(w, t)
l-e
X(u,w)
1
+00 (l-costu) X(u,w) 0
if
Sk (w) fN
du u 1'+1
u E [O,l/NJ u E [N2k-l,N2k+1J
if
is a fixed positive constant and
(Sk)k E 11
a sequence of i,i,d. random varia-
' k E~) = 1) = P(<:k = 2) = 1/2 and (J(F.: k k I t is possible to verify that U defines a symmetric type (J
bles such that
P(F.:
flJ on
q L (cf. fllJ)
by the way defined in theorem 1 and such that U(J = U
By construction, the function
U
oscilates a lot
no a.e. convergent subsequenceJ and this prevents
f(F.:k)kE 11 K «(J) 1
is i.i.d. and so has
to be uniformly relatively
compact and proves property (iii). On the other hand, it is easy to see that -K t P -2K t P 1- e
l'
,; ~ (w, t) ,; 1 - e
U
satisfies the inequalities
l'
+00
(where
K
l'
r
(1-
o
Using techniques of stability, it is possible to show that these inequalities imply that
(J
is "equivalent" to an
£,1' -type
(Xn)nE ~ This proves property (i) •
be defined by a sequence of
£,1'
.
Lq (cpdP)
and a subsequence
(Xnk\E
(such that
UT = 1- e
-K t P l' )
and thus can
which is equivalent to the unit vector basis
Proof of (ii) : We follow an idea of Suppose that there exists a density
T
f 2J • cp ~
,
an exchangeable sequence of
(Xn)nE ~
such that :
(Zk\E 11
in
27 X
+00 I: k=l
liz
-
k
~ II
cp
<+00 Lq (cpdP)
Then, i t is shown in
that
[ 2J
x .
1
U(w,
cp(w)
nk
17q
~t
(1 _ e 11) = w - lim q k~+oo
CO
)
it Zk = w -
(1 - e
lim k~+
i t Zk
Q
= E Q
Where
is the
t E R+
[1 - e
J
Q =
B
!
W (w) t
[1
o
t
cos(
t~
r
liN
·0
11
co(w)
ro
•
by the functions
W
~ U(w,
t
cp(w)
1/) q
for
(Zk)kE:N)
is also generated by :
Q
liN
When
0
is smaller than the tailfield of
(Q
+00 ~
k E :N
for all
cr-field generated in
Let us show that
w
)
00
u)J X(u,w) q
u
+
u)J pH
co(w)
N 2k+l
+00
du
\1 q
[1- cos(
du p+l
I:
Sk(w)!
(1- cos N2k-1
k=O
u
111 co(w) q
+ 00, it is easy to see that:
t du c tP (1- cos - - , - , - u ) - - ~ co(w) l/q uP+l co(w)p/q
N 2k+l
N 2k+l <.lu d u ! ( --<+00 . 2k-l 1- cos - - - I ' u) p+l ~ 2k-l p+l N co(w)p q u N u t
I
for
k E :N
•
This proves that N 2k+ 3
cp
IN 2k+l ~1 < 2 J 2k+l uN As
(Sk)kE:N
Q
belongs to N 2k+3
generates
~1 u B,
Q
and Sk N 2k+l
I
< .
N
2k-l
also for all
u
k
du p+l
B.
is equal to
This situation is impossible because that would mean that t U(w,
I )=
it Zk 1- e
for all
k E :N
,
cp(w)p q and this is obviously false (for example, This proves property (ii)
t
U(w,
and theorem 2.
co(w)
r/) q
t
~0
because we can write
28 BIBLIOGRAPHIE d Su b spaces a f Ll . v~a ran om measures. Trans. Amer. Math. Soc., 267, (1981), 445-463.
[lJ
ALDOUS D.J.
[2J
BERKES I. and ROSENTHAL H.P. : Almost exchangeable sequences of random variables. To appear in Zeitschrift fUr Wahrscheinlichkeitstheorie verw. Gebiete.
[3J
BRUNEL A. and SUCHESTON L. : On B-convex Banach spaces. Math. Systems theory, t. 7 n 0 4, 1973.
[ 4J
DACUNHA-CASTELLE D. : Variables aleatoires echangeables et espaces d'Orlicz. Seminaire Maurey-Schwartz, Ecole Poly technique, 1974/75, exposes 10 et 11 •
[5J
DACUNHA-CASTELLE D. et KRIVINE J.L. : Sous-espaces de L • Israel Journal of Math., 26 (1977), 320-351.
[6J
GUERRE S.
Types et suites symetriques dans LP , 1,; p<+ A paraitre dans Israel Journal of Math.
[7J
GUERRE S.
Sur les suites presque echangeables dans Preprint.
[8J
JOHNSON W.B., MAUREY B., SCHECHTMAN G., TZAFRIRI L. : Syrrnnetric structures in Banach space s. Memoirs of the American Math. Soc., May 1979, vol. 19, nO 217,
[9J
KADEC H.I. and PEDCZYNSKI A. : Bases, lacunary sequences and complemented subspaces of L • Studia Math., TPXXI, 1962.
1
00
q L , 1,; q< 2 •
[lOJ
KRIVINE J. L. et MAUREY B. : Espaces de Banach stables. Israel Journal of Math., vol. 39, nO 4, (1981).
[llJ
LEVY P.
Theorie de l'addition des variables aleatoires. Gauthier-Villars.
AN APPLICATION OF A MARTINGALE INEQUALITY OF DUBINS AND FREEDMAN TO THE LAW OF LARGE NUMBERS IN BANACH SPACES
Bernard HF,INKEL Departement de Mathematique 7, rue Rene Descartes 67084 STRASBOURG Cedex (France)
ABSTRACT: In a real, separable, p_uniformly smooth Banach space the law of large numbers in the Prohorov setting is studied by a method depending on a result of Dubins and Freedman which compares the distribution of a real valued martingale with the one of the associated conditional variances. Some laws of large numbers of Kolmogorov- Brunk type are also given.
Several recent papers have improved very much the knowledge on the strong law of large numbers (SLLN) and on the law of the iterated logarithm (LIL) for random variables (r. v. ) taking their values in a Banach space€l:jUipped with a regular norm. The key idea in these papers is to use the regularity of the norm for reducing the infinite dimensional SLLN or LIL to a scalar SLLN or LIL. But then for solving this finite dimensional SLLN or LIL problem, sophisticated techniques - multiple truncations, iteration of martingale exponential inequalities - have to be used. Here we will introduce a new approach of the SLLN in a Banach space with a regular norm, approach which allows both to obtain new results in the non i. i. d. setting and also to show in a simpler way statements which were known previously. The cornerstone of this method is a result of Dubins and Freedman [3] which compares the distribution of a real valued martingale and the ones of the associated conditional variances. Before to state and to prove the results we recall some definitions. § 1. SOME DEFINITIONS.
In all the sequel we will denote by (B, II II) a real separable Banach space which is p-uniformly smooth (1 < P " 2); this means that its modulus of smoothness 'If t
> U', P (t) = sup( 1/2 ( Ilx+tYII + IIx-tyll) - 1 , Ilxll = IIYII = 1 ) ,
p:
30
satisfies: p (t)
"c
tP
C being a positive constant. It is well known that the norm II II is differentiable away from the origin
let's de-
note by D the derivative of II II.
If now one as sociates to D the following fonction F p l 'l xt-O F(x) :: Ilxll - D(xl Ilxll) and:
B
~
B'
F( 0):: :)
one can check that F has the following two properties [ 19J ( i)
IIF(x)II
B
,
::
Ilxlltl
~ C
> 0: 'l (x. y) E B 2 . Ilx+yllP - Ilxil P I" p F(x)(y) I + C liYllP P P These two properties are crucial for reducing the infinite dimensional SLLN to a (ii)
I
scalar one. Other geometrical properties of a p-uniformly smooth space that we will use are its reflexivity [5 J and the fact that it is of type p [15
J.
Usually the SLLN problem in (B. II II) is stated in the following way "Let (X ) be a sequence of independent. centered. B-valued r. v. and denote by k Sn ::: Xl +... +X the associated partial sums; under what hypotheses does n (S
In) converge a. s. to 0 ?
n
(1)"
This point of view is very restrictive because two other asymptotic behaviours of the sequence (S
p( sup II S n
n
n
In) are worth of interest:
Inll <+ ~ ) :: 1. which behaviour can be called a bounded law of large
numbers (BLLN)
(2) .
and:
p( w : S (tij In n
~
0 weakly) :: 1. which is a law of large numbers in the weak topo-
logy (WTLLN)
(3) •
The main goal of this paper is to study the three forms (1), (2) and (3) of the LLN. under Prohorov boundedness conditions of the r. v., in p-uniformly smooth spaces; the good geometrical properties of these spaces allow to see that even if the asymptotic behaviours (l), (2) and (3) seem to be close. they happen under hypotheses which are very different. In an appendix we will also state without proof some Kolmogorov-Brunk type SLLN which can also be obtained by applying the same Dubins-Freedman comparison result.
§ 2. PROHOROV'S BOUNDED LAW OF LARGE NUMBERS. A sufficient condition for the BLLN in the classical Prohorov setting is as follows:
31
THEOREM 1 : Let (X ) be a sequence of independent, centered, r. v. with values k in a real separable p-uniformly smooth ( 1 < P " 2 ) Banach space (B, II Ill, such that:
~~ K > 0 : 'l k E (N", wh~~:
Ilxkll" K (k / L k) 2
a. s.
L x = Log(Log sup (x, e)). 2
Let's define for every integer n : lI.(n) = 2-2n
2 sup ( E f (X ) k
I:
IlfIIB." 1)
k E I(n) n l n where: I(n) = (2 + 1, .•• , 2 + ) ,
and suppose that the following hold: a) The sequence ( Sn /n ) is stochastically bounded. b) The sequence ( 2-2np Log n
Ilxkl12P) is stochastically bounded.
I:
k E I(n) c)
:3:
E
> 0
exp (- E
I:
/
lI.(n))
< + al
n ;;, 1
Then:
P(
sup II S n
/n II < + IX))
n
=1
REMARK: One checks easily that condition b) above holds for instance if the sequence ( n -2p L n I: II x l1 2P ) is stochastically bounded. k 2 1 "k"n Now, let's give the proof of Theorem 1.
PROOF: An easy symmetrization argument, similar to the one used in the proof of Lemma 2. 1
of [17
J
shows that it suffices to prove Theorem 1 for symmetrical-
ly distributed r. v. X
• So we only consider that case. Another symmetrization k argument shows that b) implies that the sequence ( 2-2np Log n I: Ilxkl12P) 1 k E I(n) is L -bounded. For technical simplicity we make the following three assumptions - which aren't a loss of generality - : i
):3: K' >
0: 486 K' (2p
'In;;'2, 'lj EI(n),
where of course C
p
+ 2C p ) " 1/8 and :
n Ilx.II"K'2 /Logn a.s., J denotes the constant involved in the fundamental inequality
recalled in Section 1. ii )
I:
exp ( _4K ,2 / lI.(n))
<+
al •
n;;, 1
iii)
sup 2 -2np Log n I: n k E I(n)
E Ilxkf P
Now, let's start the proof itself. 1£ one denotes by T the r. v. 2- n n
"K ,2 .
I:
k E I(n)
X
k
' an easy application of the Borel-
32
Cantelli lemma and of the symmetry of the X
k
shows that the conclusion of Theo-
rem 1 holds if the following is true: 3: t > 0 :
I: p( II T II >t) n:2: 1 n
<+
(4)
dl •
By symmetry [1 J, hypothesis a) implies: sup
E II S
n
n
In
<+ ~ ,
liP
and therefore also:
c:=:
sup E II T If n
n
< + It
•
So (4) will hold if we find t' > 0 such that: I: p( II T liP - E II T liP n:2: 1 n n
> t')
<+
ill •
In order to find such a t', we begin by looking for good bounds for the quantities: u
n
:=: p( II T liP - E II T liP > Zx ) , n n
where x :2: sup (Z. c) will be specified later. Suppos e that the integer n :2: Z is fixed and denote by Z 1 , •.. , Z
(X
j
I
the r. v. Zn Zn)j E I(n) ; we will also denote Tn by T and I\(n) by 1\. The symbol I: will
denote a sum taken on the set of integers (l, ... , Zn). For finding a bound for u First case :
Log n
, we will consider two cases
n
s; K'Z
I 1\.
By adapting a well known trick of Yurinskii [Zl J , Ledoux [19J has noticed that a r. v. of the type
II T liP - E II T liP
II T liP - E II T liP:=:
I:
where:
\:=:
E( II TIIP
and:
3'k:=:
0"
11
k
can be written as a martingale:
'
I 3'k)
-
(Zl , .•• , Zk)
E( II TIIP
I
3'k_l)
,3' 0 :=: ( Q , 0) .
For our needs this decomposition is not precise enough, so we will refine it by setting: V k :=: 1, .•• ,Zn , _ c - A k k where y > 0 will be chosen later, and: B
Q'k :=: E ( II T liP I A 13 k :=: E ( II T liP I
k
Bk
I 3'k ) I 3'k
- E ( II T If I A
) - E ( II T liP I B k
The following inequality obviously holds: P( II T liP - E II T liP> Zx ) s; p(
I: Q'k > x )
In a first step we will bound p( I: Q'k > x ).
+ p(
k
I: 13
k
> x ) .
I 3'k_l)
,
I 3'k - 1
) •
33
In order to do this we notice for beginning that ( 2 : ~ , 3'. ) . n l";k,,j J I"J"Z is a martingale to which we plan to apply the following comparison result of Dubins and Freedman [3 J
:
LEMMA 1 : Let (Sk ' 3'k) be a real valued martingale and denote by (Y k ) its increments. 1£ one defines for every k := 1, .•• , n : Vk
=:
E ( Y
~
I 3'k-l)
V a>O. '1b>O,
- with 3' 0 := ( (/J , 0)
P(3:j:=I, ... ,n:(Y
I
- then:
+... +Y );<,a(V +... +V )+b)"I/(I+alL j I j
For applying this lemma to our situation we need first a bound for the r. v.
I a
k
I
this bound will be obtained by an easy change in the computations of the proof of Lemma 1 in [19 J E ( II TIP I A
:
I 3'k) " E ( II T-zklI P I k
I 3'k) + p E ( I F( T-Zk)(Zk) I A I I 3'k) k k + C E ( II zkllP I 3'k) , Ak p A
I
and:
From these two inequalities one deduces: a
k
"p E( I F(T-Zk)(Zk) I
Ak
+ P E ( I F(T-Zk)(Zk) I
I I J k ) + C E ( IlzkllP I I 3'k ) p Ak
Ak
II3'k_l) + C p E (1lzklP I l3'k_l) Ak
The same computation can obviously be done for - a
I ak
k
, so:
I "p E ( I F(T-Zk)(Zk) I A I l3'k ) + C E ( IlzklP I A l3'k) k p k P +p E ( I F(T-Zk)(Zk) I II3'k_l) + C p E ( Ilzkll I I J k _ 1 )· Ak Ak
1£ one puts now a := P ( U n A ) , one sees that: k 1 "k"Z Z Z E !: E(a~ l3'k 1)" 8pZ sup E( 1rr'-ZkIIZ(p-l)IA ) 11. + 8C KI a / Log n 1 "k "Zn k p 4 3 ,, 8p Z a / sup (E liT _zk I18 (p-l)) 1 /4 11. + 8C Z K'Z a/Log n 1 "k " Zn p By assumption a) and symmetry there exists L > 0, such that: sup sup n n l"k"Z
(E liT _zk I18 (p-l)) 1 /4
"
L.
34
Hence: E I: E
(a~
I d k _1 )
"
3 4
8p2 lI.a /
L
..,
ac"p
+
K'
2
a/Log n
and so by assumption cl , one has for n large enough: E I: E (a 2 d ) s: a 3 / 4 k k-l
I
Applying now Lemma 1 for a ::: a - I /2 and b ::: 1, one obtains: p( I: a
>x)" p( I: a
k
k
+ p(
I: a
1 2 " a /
+
> x, I: a
> x, I: a
k
1 (a /4 /(
1 2 a- / I: E (
:2:
k
k
a~
< a-I /2 I: E (
x-l/ ) "
I d k _1 ) + 1 a~ I d k _ 1 ) + 1
)
1 4
2 a /
From the inequalities: a
"p(
sup 1 "k" 2 n
II
z.
I:
1"j" k
II> y/3) "2 p( II
I: ZJ'
II >y/3 )
J
an application of Hoffmann-J~rgensen'sLemma ( [14 J Lemma 4.4 ) gives that for every y
p(
~
a
k
:2: 81 : > x)" 2 (2)3/4 p
1 2 / ( II I: Zk
I> y/9)
"4 (2)3/4 P( II I: Zk II > y/27)
"
32 p
2
( II I: Zk II > y/81 ).
Now. fix Y:2: sup ( 486. 243 c), such that: 'l n E f'-I ,
32 P( II I: Zk II > y /81 ) "
1/2
- such a choice is of course possible by hypothesis a) -
; y being fixed, we put:
x:::y/243. For such a couple (x, y) one has:
p( II T
liP - E
II T
liP
> 2x) "
(6)
2 p( I: 13 k > x )
In the next step of the proof we will bound the right-hand side of (6) by using another martingale result, also due to Dubins and Freedman [3J
:
LEMMA 2 : Let (Sk ' d k ) l"k"n be a real valued martingale and denote by (Y k ) increments. Suppose that: 'lk:::l, •.. , n Then:
lykl"l
i!2
a.s . .
'l u E R . 'l 'A > 0 , 'l v > 0
'A(u+ Y E ch (
+ ... +Y) n " ch ('Au/v) v + VI + .. ,+V n 1
where: 'l k::: 1, .... n, e(x) ::: exp x - I
V
k
::: E (
Y~
I d k _1)
exp (v e('A/v) ) ,
and:
- x
For applying this lemma to our situation, we first notice that by the same computations as for the r. v. a
k
one has:
35
I 13k I
I F(T-Zk)(Zk)I B I
,;:p E (
k
+ P E (
I F(T-Zk)(Zk)
If for every k = 1 •...• 2
Y
= (yI-p Log n
k
n
I
Bk
l3'k) + C
I I 3
p
E
(II
zklf I B 13
I
) + Cp E (
zkllP I
Bk
I dk-I
) .
one defines:
I K 1 (2p+2C ) ) 13 k p
onehas: Ifk=l. ••. , 2
n
,
IYkl,;:I
a.s.
Now we want to bound: an=P(LY
k
> xyI-PLogn IK'(2 P+2C ) ) • p
by using Lemma 2. Let's notice that: VI +",+V
n
2 2 2 2 ,;:(Logn) (8p 1I+8C K 1 ILogn) p
I K 1 2 (2p+2C )2 ,;: 4 Log n . p
Hence:
a
,;: p(
n
0 I
I K 1 (2p+2C p ) (v+4Log n) ) .
v+V 1+'" +V 2 ) L Y > xyI-p Log n n k
Remembering now that p E
J 1.2 J
and also the assumption i) made on K I. one ob-
tains : an ';:P( (I/v+V + .•. +V ) I 2n
LY
k
> 16Logn/(v+4Logn)).
Applying now Lemma 2 for A = v = 4 Log n. one gets: A an ,;: exp 4Log n I ch(8Log n) ,;: 2 n . So for the integers n -4 u ,;: 4 n
:l:
nO(x) and such that Log n ,;: K 1
2
I 11 • one has:
n
Now we go to the second case. Second case: Log n > K'
2
I 11 •
It is easy to see that the relation (6) remains true - for the same couples (x. y) -
and that the only thing to do is to apply Lemma 2 in a different manner.
If one defines: If k -1, n ••• ,2.
Yk --
one has: n Ifk=I .... ,2 .
a. s.
Furthermore: V + .•. +V I 2n So u
n
s;
K,2(8p2+8C2(K,2 ILog n)) I 11 2 (2p+2C)2 p
p
,;: 4 K ,2 I 11
can be bounded by :
01
v+v + ... +V ) L 13k > xK ' yI-p I (2p+2C ) (IIv+ 4K,2) I p 2n 2 Now we apply Lemma 2 for A = v = 4K 1 I 11 and we obtain: 2 p(
36 u
n
,,2 exp (4K,2
111.) I
ch (8K,2
111.)
,,4 exp _ (4K,2
I
11.) •
Putting together the results obtained in the two cases, one gets 'In :2: N(x), u " 4 ( n- 4 + exp _ (4K,2 I 11.) ). n
Hence: I:
u
n :2: 1
<+!Xl ,
n
and this finishes the proof of Theorem 1.
§ 3. PROHOROV'S LAW OF LARGE NUMBERS IN THE WEAK TOPOLOGY, It is natural to expect that this asymptotic behaviour can be obtained under some
"weak topological refinement" of the hypotheses of Theorem 1. The precise result is as follows:
THEOREM 2 : Let (X ) be a sequence of independent, ce'1tered, r. v. with values k in a real separable, p-uniformly smooth ( 1
I II),
that: 3: K > 0:
'IkE N,
a. s,
Let's define for every integer n and every £ E B' : A(n, f) = 2 -2n I: E f 2 (X ) k k E I(n) Suppose that assumptions a), b),
cl
of Theorem 1 are fulfilled and that the follo-
wing one holds also:
V
d)
E
> 0,
V fEB' ,
exp ( _ E
I:
I
A(n, f))
<
+
til ,
n :2: 1
Then:
In ....
p( w : S (w ) n
0 weakly)
= 1.
PROOF: By Theorem 1, we know that:
p(
sup
II
n
S
n
In
I
<
+
al )
=1
A standard argument then gives [13 J E
sup
I
Sn
In
I
<
+
(7)
al
n
Furthermore the one dimensional Prohorov SLLN [20 'If E B',
f( Sn
In)
....
0
J implies
a. s.
This property and (7) show that (Sn
In ,
0(X
l
, ..• , X ) ) is a weak sequential n
amart of class (B) (for the definition and the main properties of weak sequential amarts, see for instance [4J V.3 ),
37
The space B being reflexive, the conclusion of Theorem 2 follows immediately from a well known convergence theorem of weak sequential amarts due to BruneI and Sucheston [2 J.
§ 4. PROHOROV'S STRONG LAW OF LARGE NUMBERS. The sufficient condition for the SLLN in the Prohorov setting can now be guessed easily from Theorems 1 and 2 ; the statement is as follows : THEOREM 3 : Let (X ) be a sequence of independent, centered, r. v. with values k in a real, separable, p-uniformly smooth ( 1 < P " 2 ) Banach space (B, II II ), such that: 2 K > 0
E i'-I
11 k
a. s.
Suppose that the following hold:
In
....
a I)
Sn
b ')
2-2np Log n
c')
11
in probability.
0
II X
I:
k E I(n) E
>0,
I:
exp (-
k
I
E
11
2p
i\.(n) )
.... o in <+
til
probability .
•
n :2: 1
Then: S
n
In
->
0 a. s.
REMARK: It is easy to see that b ' ) holds for instance if : n - 2p L n I: II X 11 2 p .... 0 in probability. k 2 1 ~"n PROOF: By a classical argument [17
J
it is sufficient to consider the symmetrical
case. By symmetrization ( see [10 J proof of Lemma 1 ) b') implies: b ll )
lim 2-2np Log n n .... + ~
E II X
I:
k E I(n)
k
fP
:: O.
The ideas of the proof of Theorem 3 are the same as those used for proving Theorem 1 ; so we will keep the same notations as in the proof of Theorem 1 and we will only detail what needs to be detailed. First one notices [1 EIITnllp ....
J
that from a') it follows that: E II Sn
In
liP .... 0 , and also:
o.
So the the conclusion of Theorem 3 will be true if we show: Vx >
I:
0,
n :2: 1
p( II T
n
liP - E II T
n
liP> ?,x)
< + dl •
Without loss of generality it suffices to check (8) only for x E
(8 )
J 0,
243- P [ • Fix
such an x. As previously, we denote by u we will bound u
n
n
the general term of the series involved in (8) ;
by considering again two classes of indices n, slightly different
38
from those taken in the proof of Theorem 1. Let's suppose - and this of course isn't a loss of generality - that: n 'In:2:2, 'ljEI(n), Ilx.II,,2 /Logn a.s •• J For simplicity we put 8 :: exp ( -8(2p+2C ) Ix) p
We consider fir st the following situation: I\. Log n" 8.
First case:
It is easy to see that if one puts y :: 243 x
l/p
, relation (6) holds for n large enough.
Now we will again apply Lemma 2 ; for this we introduce the following sequence ( Yk ) :
'l k :: 1, • • •. 2
n
One has: 'l k :: 1, .. "
2
n
I "
Yk
1 a. s.
and:
VI
+... + V 2
where:
d
"
n
n
....
(2(Log n)
2
I\. + 2Log n d
n
),
0. by condition bll) .
Therefore:
V n :
Vl+•.• +V n
,,48Logn.
2
Applying now Lemma 2 with v:: 4 8 Log nand A :: 4 v (2p+2C ) p 3/2 u s: ( 2/ch2Log n ) exp 4 81 12 Log n ,,4 nn
It remains to consider the complementary situation:
A Log n > 8 .
Second case:
If one choos es now:
Y
:: ( 8
k
I
I\. (2p+2C) P
) Sk
one has: 'l k :: 1. ... , 2
n
,
I Yk I
,,1
a. s.
and: 'In:2:n'(x)
VI +... +V
n
"48
2
I
1\..
2
Applying finally Lemma 2 for v :: 4 8 2 u
n
,,(2
I
I
ch(x387/8/21\.)) exp (4x482/Al
I\. and
A:: v x
2
one gets 3 7 ,,4 exp (_ x 8 18 141\.)
Collecting the partial results we obtain: 'l x E JO, 243- P C, II N(x) EU'J, II 'Y (x) > 3/2 'l n:
°:
and (8) immediately follows from hypothesis c ' ) •
lx,
one gets:
39 REMARK: Hypothesis b I) in Theorem 3 seems at first glance somewhat surpnsmg and artificial. To shed light on its meaning we will now give some corollaries of Theorem 3 which will show that b l ) is a very weak hypothesis.
COROLLAR Y 1 : Let (X ) be a sequence of independent, centered, r. v. with k values in a real, separable, p-uniformly smooth ( 1 < P " 2 ) Banach space (B, (B,
I II),
I II)
such that:
I
:3: K > 0 : Vk E N,
Xk
II " K
a. s.
(k/L k) 2
Suppose that the following hold: 1) n- P I: 1/ X liP .... 0 in probability. k 1 "k" n 2)
'l
>0
E
I:
exp (-
E
I
<+~ •
1I.(n))
n ;;, 1
Then:
a. s.
The proof of this result is very easy. First: 2-2np
II
I:
,, (c/Log n) 2-2np
I/2p
X
k E I(n)
I:
I
X
k E I(n)
k
k
liP
and so hypothesis b') of Theorem 3 is fulfilled by applying 1). By symmetrization one has n -p
I:
1
~"n
E
I
.... o
liP
X k
the space B being of type p, it follows that the sequence ( Sn
In )
converges to 0
in LP(B). All the assumptions of Theorem 3 being fulfilled, this ends the proof of Corollary 1. In the case p :: 2, Corollary 1 reduces to Prohorov's SLLN proved in [11 J
•
Let's
stay for a moment in this situation p :: 2 for making precise the difference between the hypotheses of Theorem 3 and those of the result in [11
J•
One observes first
that the following easy corollary of Theorem 3 holds: COROLLAR Y 2 : Let (X ) be a sequence of independent, centered, r. v. k values in a real, separable, 2-uniformly smooth Banach space (B, 1/
II),
that:
II
:3: K > 0: 'l kEN.
X
k
"K (kl L k)
1/
2
a. s.
Suppose that the following hold: a') b ,)
Sin n
( 1
c ,) 'l
I
n
2
.... 0 in probability. L n ) 2
e: > 0 ,
I: 1 "k", n I:
I
X k 1/
exp ( -
E
2
I
.... o ~robability 1I.(n))
< +~
n ;;, 1
Then:
Sin n
.... 0 a. s.
•
with such
40 The gap between hypothesis b ' ) in the above statement and assumption 1) in Corollary I is clear. It is easy to give examples of sequences of r. v. which belong to the domain of application of Corollary 2, but not to the one of Corollary I ; the sequence considered in the last section of [18 In [II
J
J provides
such an example.
it has been noticed that in the special case of Hilbert spaces - which
are of course 2-uniformly smooth - condition 1) in Corollary I is necessary for the 5LLN in the Prohorov setting. More generally, is it possible to simplify the hypotheses of Theorem 3 by making additional cotype restrictions on B ? Godbole has characterized the spaces of cotype q in terms of 5LLN for the sequence ( II X
Ilq / kq-l) ( [6J Theorem 2. I ) ; unfcrtunately his result will of no k help in our situation. The ( partial) result we are able to prove is as follows COROLLARY 3 : Let 3/2 ,;;; p,;;; 2 and 2 ,;;; q,;;; 2p - I ; consider a sequence (X ) of k independent. centered, r. v. with values in a real, separable, p-uniformly smooth Banach space (B,
I Ill.
of cotype q.
Suppose that the following hold: i ) 3: K > 0 ii) V
€
V kEN
> 0
II X k " exp ( -
L
€ /
,;;; K (k/ L k) 2
<+
A(n))
a. s.
l!l
n ;;, I
Then:
5
o a.
/ n
n
s.
5
n
o in
/ n
probability.
PROOF: Of course the only thing to do is to check that the weak law of large numbers implies the strong one. As 5
n
/ n _
0 in probability, it is sufficient to consi-
der the symmetrical case. By cotype q and assumption lim n _ +
n -q
Hence: -2p L n 2n
II
E
L
I s:k:s:n
!II
L
ls:k,;;;n
E II X
k
n,
one has: q :: X Il k
11
O.
2p ,;;; K 2p - q n- q (L n)q+I-2 p 2
L
Ell X
l,;;;k,;;;n
k
Il q
,
and Corollary 3 immediately follows, by application of Theorem 3. REMARK: Let's suppose that conditions i) and ii) of Corollary 3 are fulfilled for a sequence of symmetrically distributed r. v. X • If the weak law of large numbers k q holds, then, by Godbole's result nL II X Ilq 0 a. s •• So the following k implication holds : l,;;;k,;;;n q 5 / n - 0 in probability => n- 0 a. s. n this shows the very special geometric nature of these spaces which are both p-uniformly smooth ( 3/2 s: P ,;;; 2 ) and of cotype q
(2,;;; q s: 2p - I ) •
41
§ 5. APPENDIX
SOME LAWS OF LARGE NUMBERS OF KOLMOGOROV-BRUNK
TYPE.
In [9J and [12J the SLLN of Kolmogorov and Brunk are studied in 2-uniformly smooth spaces under hypotheses requiring both assumptions on the strong and on the weak moments of the r. v •• Results stronger than the classical ones known for type 2 spaces are obtained. By applying - in a more elementary manner as above - the martingale technique developed in section 2, Kolmogorov-Brunk type SLLN can also be obtained in p-uniformly smooth ( 1 < P " 2 ) spaces; in the case p = 2 these results are stronger than the ones given previously in [9J and [12J • The proofs being elementary, we only state the results.
THEOREM 4 : Let (X ) be a sequence of independent, centered, r. v. with values k in a real, separable. p-uniformly smooth ( 1 < P " 2 ) Banach space (B,
I II).
Suppose that: i ) There exists a sequence of positive numbers (d ) , converging to 0, such that j "if j EN. Xj " j dj a. s.
II
ii)
I
Sn / n .... 0 in probability.
1£ one defines for every integer n : r(p,n) = 2-2np
Ell X. J
I:
j E I(n)
11
2p
then the following implication holds
( 3: k integer, k;" 1:
(l\.(n)
I:
+ r(p, n))k < + ~ )
n ;" 1
=>
S
n
/ n
.... 0 a. s.
As an obvious corollary of Theorem 4 one has the following result which can be compared with the well known law of large numbers in type p spaces of HoffmannJ~rgensen and Pisier [15J
COROLLAR Y 4 : Let (X ) be a sequence of independent, centered, r. v. with values k in a real, separable. p-uniformly smooth ( 1 < P " 2 ) Banach space (B,
I II).
Suppos e that: a)
Sn / n .... 0 in probability. I:
b)
k- 2p E
k;" 1 c)
3: j ;" 1 :
I:
I
Xk
2P 11
( !(n) ) j
< + <+
~
~
n ;" 1 Then:
S
n
/ n
.... 0
a. s.
42
Bounded laws of large numbers and laws of large numbers in the weak topology can of course also be considered in the Kolmogorov-Brunk setting. We only give an example of such a result : THEOREM 5 : Let (X ) be a sequence of independent, centered, r. v. with values k in a real separable. p-uniformly smooth ( 1 < P ~ 2 ) Banach space (B,
I II ).
Suppose that: a) The seguence ( Sn / n ) is stochastically bounded. 2p b) I: kE X fp < + ~
I
k :2: 1
c)
3: j
:2:
1:
k
I:
(A (n))j
< + (fJ
II
I
•
n :2: 1
Then:
p(
sup n
Sn / n
<
+al)
=
1
Let's conclude by mentioning an application of the above results. It is well known that all the classical situations in which one can conclude that a Banach space valued. symmetrically distributed r. v.
X - and it is always possible to
reduce to that case - satisfies the LIL. are the union of a Prohorov type BLLN and of a SLLN ( [16 J [8 J
•
.
[7 J
) or
of a law of large numbers in the weak topology
Therefore it is not surprising that Ledoux's recent necessary and sufficient
condition for the LIL in uniformly convex spaces [19J can also be obtained as a corollary of Theorem 1 and Theorem 5. The computations for proving this observation are left to the reader. REFERENCES.
[lJ
DE ACOSTA, A. : Inequalities for B_valued random vectors with applications to the strong law of large numbers. Ann. Prob. 9 (1981), 157-161
[2J
BRUNEL, A. et SUCHESTON. L. : Sur les amarts faibles a valeurs vectorielles. C. R. Acad. Sc. Paris 282, Ser. A (1976). 1011-1014
[3J
DUBINS. L. E. and FREEDMAN. D. A. : A sharper form of the Borel-Cantelli lemma and the strong law. Ann. Math. Stat. 36 (1965). 800 -807
[4J
EGGHE, L. : Stopping time techniques for analysts and probabilists. Cambridge University Press - Cambridge 1984
[5 J
ENFLO. P. : Banach spaces which can be given an equivalent uniformly convex norm. Israel J. of Math. 13 (1972). 281 - 288
[6 J
GODBOLE, A. : Strong laws of large numbers and laws of the iterated loga_ rithm in Banach spaces. PHD. Michigan State University 1984
[7 J
HEINKEL, B. : Relation entre theoreme central-limite et loi du logarithme itere da.'1s les espaces de Banach. Z. Wahrscheinlichkeitstheorie verw. Gebiete 49 (1979). 211-220
43
[8 J
HEINKEL. B. : Sur la loi du logarithme Here dans les espaces reflexiis. Seminaire de Probabilites 16 - 1980/81 - Lecture Notes in Math 920. 602-608
[9J
HEINKEL, B. : On the law of large numbers in 2-uniiormly smooth Banach spaces. Ann. Prob. 12 (1984), 851-857
[lOJ HEINKEL, B. : The non i. i. d. strong law of large numbers in 2-uniiormly smooth Banach spaces. Probability Theory on Vector Spaces III - Lublin 1983 Lecture Notes in Math 1080, 90-118 [11 J HEINKEL, B. : Une extension de la loi des grands nombres de Prohorov. Z. Wahrscheinlichkeitstheorie verw. Gebiete 67 (1984), 349-362 [12J HEINKEL, B. : On Brunk's law of large numbers in some type 2 spaces. to appear in "Probability in Banach spaces 5" - Medford 1984 - , Lecture Notes in Math [13J HOFFMANN-J~RGENSEN, J. : Sums of independent Banach space valued random variables. Studia Math 52 (1974). 159-186 [14J HOFFMANN-J~RGENSEN, J. : Probability in Banach space. Ecole d'ete de Probabilites de St Flour 6 _ 1976 - Lecture Notes in Math 598, 1-186 [15J HOFFMANN-J~RGENSEN, J. and PISIER, G. : The law of large numbers and the central limit theorem in Banach spaces. Ann. Prob. 4 (1976), 587-599 [l6J KUELBS, J. : Kolmogorov law of the iterated logarithm for Banach space valued random variables. Ill. J. of Math. 21 (1977), 784-800 [17J KUELBS, J. and ZINN, J. : Some stability results for vector valued random variables. Ann. Prob. 7 (1979), 75-84 [18J
LEDOUX, M. : Sur une inegalite de H. P. Rosenthal et Ie theoreme limite central dans les espaces de Banach. Israel J. of Math. 50 (1985),290-318
[19 J
LEDOUX, M. : La loi du logarithme itere dans les espaces de Banach uniiormement convexes. C. R. Acad. Sc. Paris 300, Ser. I, n° 17 (1985), 613-616
[20J STOUT, [21 J
w.
F. : Almost sure convergence. Academic Press, New York 1974
YURINSKII, V. V. : Exponential bounds for large deviations. Theor. Prob. Appl. 19 (1974). 154-155
ON
THE
SMALL
BALLS
CONDITION IN
IN
THE
UNIFORMLY
CENTRAL
LIMIT
CONVEX
SPACES
THEOREM
Michel Ledoux Departement de Mathematique, Universite Louis-Pasteur 7, rue Rene-Descartes, F-67084 Strasbourg, France
Introduction. Let
E
be a Banach space. By random variable with values in
we mean a measurable map
X
with its Borel a-field
~(E)
probability measure on
~(E)
copies of
and, for each
X
with values in
E
measure on
such that the image of JP Denote by n
:2:
1 ,
Sn
(X) nnE:JN ~
(O,d, JP) by
X
into
E
IN
E
equipped
defines a Radon
a sequence of independent
Xl + .•• + X • A random variable n
is said to satisfy the central limit theorem (CLT
Ill) n (sn / r"
if the sequence
from some probability space
E
X
in short)
converges weakly to a Gaussian Radon probability
E.
In his remarkable work on the Glivenko-Cantelli problem, observed the following characterization of the
CLT
M.
[T]
Talagrand
for random variables with a
strong second moment which relates the central limit property to conditions on small balls: if the
CLT
lim
X
takes its values in
iff, for each
IlsJ
1m JP[ -
<
E
and
IE[llxI1 2 }
<
QQ
,
then
X
satisfies
E > 0 ,
E }
Iii (Actually, as detail led in
> o •
[T] ,
this equivalence holds in the more general setting
of non-separable range spaces and in the framework of empirical processes.)
Although it seems rather difficult to verify these conditions on small balls, the preceding property is intriguing since it reduces a central limit property in Banach spaces to some kind of weak convergence on the line by taking norm. This property also lies at some intermediate stage since, as we will see below, a random
45 variable K in
X with values in
E
satisfies the
CLT
iff there exists a compact set
E such that S
lim inf IP [ ...E. E K} n- CO /ri
> 0
(Sn/jll)n E ~
and the sequence
Ilsnll lim inf IP [ - -
< M}
>
is stochastically bounded as soon as for some
M> 0
0 •
v£
n-oo
M. Talagrand (oral communication) raised the question whether the equivalence he proved holds without the strong second moment assumption which is not necessary in general for the
CLT • In this note, we answer this question in a positive way
in uniformlY convex spaces. Precisely, we will establish the following result
THEOREM 1 • Let with values in
E
E. Then 2
(1)
lim t t-ao
(ii)
for each
be
a uniformly convex Banach space and X satisfies the
JP[ Ilxll > t}
CLT
X a random variable
iff
0
and E
> 0 ,
Ils)1 lim inf I P [ n-'"
vn
<
E
}
>
o .
This result will follow easily from a new quadratic estimate of sums of independent random variables in uniformly convex spaces obtained in
[L2] •
Preliminary results. We begin this section by a characterization of the which follows easily from the concentration's inequality of M. Kanter
CLT
[K] • I am
grateful to Prof. X. Fernique for useful informations on this result.
PROPOS IT ION 2 Then
X
. Let
satisfies the
lim inf IP[ n- OO
S n
vn
X
CLT
E K }
be a random variable with values in a Banach space iff there is a compact set
>
0
K in
E
E such t fl-'lt
(1)
.
46
Further, the sequence IlsJ lim inf JP[-
(Sn / [ii)n EC:IN
<
M }
is stochastically bounded iff for some
M> 0
o •
>
hi (1)
Proof. The necessity of
(2)
and
(S / Ill) :IN. Assume to begin wit h that nnE
implies the stochastic boundedness of X
is symmetric.
There exist
6 > 0
(2)
is obvious. Let us first show why
and
k
such that for all integers
o
k
~
k
0
n
and
Ilsnkll
IP[-- <
~
M }
>
6 •
>
MJk })"""2
By Kanter's inequality Ilsnll
3
( 1 + kJP[ - -
2
1
~
6
hi
and thus
-
It follows that the sequence JE[llxII
CI }
copy of of that
<
for all
CD
CI
1
'n) n E :IN (s n / V"
is stochastically bounded and also tl'Bt
< 2 • In the non-symmetric case, let
X; the symmetric random variable
X - X'
satisfies
M) and hence the preceding conclusions apply to JE(llxll} <
implies that
00
;
X'
(2 )
be an independent (with
2M
instead
X- X' • In particular, we have
therefore the strong law of large numbers combined with
(2)
X must be centered. Hence the conclusion to the second part of Pro po-
sition 2 holds by classical considerations involving Jensen's inequality. The first part is established in the same way.
Since in cotype 2 spaces, random variables stochastically bounded satisfy the
CLT
X such that
(Sn/jn)n E l'J
is
[p-z] , the previous proposition yields
immediately the following corollary.
COROLLARY 3 with values in
Let
E
E. Then
be a Banach space of cotype 2 and X satisfies the
CLT
iff
(2)
X
a random variable
holds.
47
We now turn to the small balls condition. Since for a centered Gaussian Radon probability, each ball centered at the origin of positive radius has positive mass, it is clearly necessary for a random variable for each
£
IlsJ lim inf JP[-
> 0 ,
>
}
2
showed that when JE[llxI1 } <
[T]
to satisfy the
£
CLT
that
0 .
Jri
n .... oo
M. Talagrand
<
X to satisfy the
(3)
CD,
is also sufficient for
X
CLT • For the sake of completeness, we reproduce here Talagrand's
proof of this result; it will illustrate the idea we will use next in uniformly convex spaces.
THEOREM 4 • Let 2 JE[llxI1 } <
that
Proof. Let
X be a random variable with values in a Banach space Then
00
•
£
> 0
X satisfies the
be fixed
<
liminf
£
}
and
CLT
6(£) > 0
6
(3)
iff
E such
holds.
be such that
6 •
>
Choose a finite dimensional subspace
of
H
E such that if
T
denotes the quotient
E.... E/ and 11.11 the quotient norm given by Ilr(x)11 = d(x,H) , then H 2 JE[IIT(X)11 } :s; 6.£2 • For each n, IIT(S )11 - JE[IIT(S )II} can be written as a
map
n
n
martingale n
IIT(S n )11 -
JE[l/T(S n )II}
with increments
d. , i ~
=
2: d. i= 1 ~
= 1, ••• ,n , such that, for each
i
[yJ) and thus by Chebyschev's inequality
(cf
S
F [IIIT(
Since
~ )11 -
;n
S
E[IIT( ~ )II} I
>
£
Jll
IITII:s; 1 , S
lim 1nf n ....
OO
JP[ IIT( ~)/1 In
<
£
}
>
6 ,
}
:s;
6 •
48 and hence, intersecting, S
lim sup JE[IIT(
< 2
n )II}
£
n .... oo
X therefore satisfies the
CLT
by classical arguments (cf
[P2]).
A short analysis of this proof shows the central role of the martingale trick 2 leading to the quadraUc estimate and of the integrability condition JE[llxI1 } <
CD
providing tightness at some point. An improved version, in uniformly convex spaces, of the previous martingale argument of Yurinskii was recently obtained
in
[L2]
in some work on the law of the iterated logarithm. It will allow us to establish Theorem
which thus characterizes in those spaces the
condition
(3)
and a moment condition which is necessary for the
Recall that a Banach space a
= 6(£)
6
CLT through the small balls
> 0
E
such that for all
CLT .
is uniformly convex if for each x,y
in
E with
Ilxll
= Ilyll
= 1
£
> 0 and
there is Ilx-yll:2: £
1 - II~II > 6 • According to a well-known fundamental result of G. Fisier
one has
2
[P1] , every uniformly convex Banach space admits an equivalent norm (denoted again
E
is p-smooth for some
p > 1
i.e.
11.11) with corresponding modulus of
smoothness
pet) satisfying
sup [ Hllx + tyll + Ilx - tyll) - 1, p(t),s; Kt P
for all
t > 0
1 }
Ilxll
and some positive finite constant
K.
This p-smooth norm is uniformly Frechet-differentiable away from the origin with derivative F (0)
and C
>
P
0
(cf
D : E - [a}
=
0 ,
then
....
E*
IIF (x)11 p
such that i f
F (x) p
= Ilx11 P- 1 for all
x
p 1 Ilxll - D(x/llxll)
=
in
for
x
I
0
and, for some constant
E
[H-J] ),
IIF (x) - F (y)11 P P
,;;
p 1 C II x _ y II -
for all
x,y
in
E.
The following lemma was the key point in the proof of the main result of It will allow to aChieve our wish in the next section.
(4)
[L2].
49
LEMMA 5 • Let satisfying
E
(4)
p > 1
be a p-smooth Banach space for some
Let also
(Y)i"n
E-valued random variables and let
with norm
11.11
be a finite sequence of independent bounded
S
written as a martingale
with increments
where
C
d
i
i
= l, ... ,n
such that, for each
is the constant appearing in
i
(4) •
Before turning to the proof of Thearem 1 , let us point out that a quotient E
norm of
holds true for any quotient norm of
E, property
constant
(4)
is also p-smooth, and, if
11.11
of a p-smooth Banach space
11.11
with uniform
C.
Proof of Theorem 1 • We may and do assume that
11.11
denotes the p-smooth
for some
p
> 1 for which
(4)
E
is equipped with a p-smooth
and Lemma 5 hold. By the previous remark,
these will also hold for every quotient norm with uniform constant moreover
p
C. We assume
<2
Condition
(i)
is well-known to be necessary for
X
to satisfy the
CLT
( [A-A-G] , [p-Z] ). Let us show the suffiency part of the theorem and assume first that
X
is symmetric. Proposition 2 and
(ii)
imply that the sequence
is stochastically bounded and thus, from the integrability results of
(Sn / /ll)n E:N [P2], we
know that
<
sup
CD
•
n
Let
E ~
0
be fixed. For each
n, define
(5 )
50
u.
1.
= u.(n) 1.
1, •• . ,n
i
n
U = L
and set number
(i)
u
i=l
n
and
combine to imply the existence or a real
(ii)
i
such that
0=0(£»0
< £}
lim im IP[ Ilun II
(s n / v" Tn) n
Since the sequence
°.
>
(6)
is stochastically bounded under
E}II
does not contain an isomorphic copy or
c
,Theorem 5.1 or
o
[p-zJ
is pregaussian, that is, there exists a Gaussian random variable the same covariance structure as
and
E
ensures that G in
E with
X. The integrability or Gaussian random vectors
H or
allows then to choose a rinite dimensional subspace denotes the quotient map
(ii)
E such that ir
T
E ..... Ej H '
We now apply Lemma 5 to the sum
T(U )
F
n
P
the Frechet derivative or the quotient norm or
will thererore denote below
EjH' For each
n, we have by
orthogonality,
n S
2p2
2 JE[F (T(U - Ui))(T(U.))}
L i=l
s
P
n
2C
+
2
1
~
E[llu _ u.11 2 (p-1)}
i=l
n
2 IE[IIT(U.)11 p}
r i=l
1.
2p2 n- JEtlIT(G)11 2 }
n
+
1.
(8 )
1.
2C 2 n JE[llu (n)11 2P } 1
since by independence 2 JE[F (T (U - u. ) ) (T (u. ) )} p
n
1.
1.
s
(where the supremum runs over the unit ball
(E/H)~
= Now, by symmetry and
(5) , ror each
i
1, •• • ,n
or the dual or
E/H) and
X
51
K
Further,
lit
,;;
S
JP[
lP[
Ilxll
n l-p
Ilxll ;:.
t } d t
2p
o
1
,;;
S
n
o
>
t
hi. } d
t
2p
so that
o
lim
by
(i)
and dominated convergence. These observations and
(8)
therefore imply that
lim sup n~OO
(by
(7)) and since
JE[IIT(U )II P }
lim sup n-
(6)
holds, by intersection,
<
E
P
n
CtO
which easily implies that
X
In the general case, let random variable so does
2
X - X'
satisfies the
X'
satisfies
CLT
(using
(i)
be an independent copy of (i)
and
(ii)
one more time).
X
; the symmetric
and thus the
CLT
and therefore
X.
Conclusion. It is an open problem to know whether Theorem 1 holds true in any Banach space; since we used the fact that random variables satisfying pregaussian in spaces which do not contain an isomorphic copy of
(ii)
are
co' a general
statement should probably include the condition
(iii)
X
is pregaussian
as an additional (necessary) assumption. It would also be interesting to know in what spaces, random variables satisfying CLT
iff the sequence
(sn / yll In) n Em
(i)
(and possibly
(iii)) verify the
is stochastically bounded (which is weaker
52
than
(ii) ). At the present, only trivial situations in which this happens (such
that cotype 2 spaces or spaces satisfying A-Ros(2)
[Ll]
like
L
P
(1,s; p < DO)
spaces) have been described.
References.
[A-A-G]
de Acosta, A., Araujo, A., Gine, E. : On Poisson measures, Gaussian measures, and the central limit theorem in Banach spaces. Advances in Prob., vol. 4, 1-68, Dekker, New York (1978). Hoffmann-Jprgensen, J. : On the modulus of smoothness and the G -conditions O!
in B-spaces. Aarhus Preprint Series 1974-75, n02 (1975). [K]
Kanter, M. : Probability inequalities for convex sets. J. Multivariate Anal. 6, 222-236
[Ll]
(1978)
Ledoux, M. : Sur une incgalitc de H.P. Rosenthal et Ie theoreme limite central dans les espaces de Banach. Israel J. Math. 50, 290-318 (1985).
[L2]
Ledoux, M. : The law of the iterated logarithm in uniformly convex Banach spaces. Trans. Amer. Math. Soc. (May 1986).
[Pl]
Pisier, G. : Martingales with values in uniformly convex spaces. Israel J. Math. 20, 326-350 (1975).
[P2]
Pisier, G. : Le theoreme de la limite centrale et la loi du logarithme itere dans les espaces de Banach. Scminaire Maurey-Schwartz 1975-76, exposes 3 et 4, Ecole Polytechnique, Paris (1976). Pisier, G., Zinn, J. : On the limit theorems for random variables with values in the spaces
[T]
L
P
(2 ,,; p < (0) • Zeit. fur Wahr. 41, 289-304 (1978).
Talagrand, M. : The Glivenko-Cantelli problem. Annals of Math., to appear (1984).
[y]
Yurinskii, V.V. : Exponential bounds for large deviations. Theor. Probability AppL 19, 154-155 (1974).
OF
SOME
REMARKS
GAUSSIAN
AND
ON
THE
UNIFORM
RADEMACHER
CONVERGENCE
FOURIER
QUADRATIC
FORMS
.lE-
M. Ledoux
M.B. Marcus
Departement de Mathematique
Department of Mathematics
Universi te Louis-Pasteur
Texas A & M University
67084 Strasbourg,
College Station, Texas 77843, U.S.A.
Fr~lce
1. Introduction
Let
{g}CD n n=O
be an L i.d. sequence of normal random variables with mean
zero and variance 1 and let
t
n n=O
of i.i.d. random variables where b e a sequence
be a Rademacher sequence, i.e. a sequence
fE}CD
P(E
O
= 1)
complex numbers satisfying
0f
Let
"Iam,n 12 ~
<00
{a
}
CD
m,n m,n=O
We define Gaussian
m,n and Rademacher Fourier quadratic forms as follows
(1. 1)
X (s,t) g
I:
m
am,ngmgn e
i(ms+nt)
(s,t) E [0,2TI]2
i(ms+nt)
(s , t) E [0, 2TI
and (1 .2)
a
I:
XE(s,t)
m
E E e
m,n m n
i
We are concerned with the uniform convergence a.s. of the stochastic processes
X (s,t)
and
N
n-1
I:
I:
g
lim N
~CD
xE(s,t) . (By convergence we mean
n=l m=O
and similarly for
XE(s,t).) It is known that if these series converge in
probability then they converge uniformly a.s. and in
*
LP
of their sup-norm. This
Professor Marcus was a visiting professor at the University of Strasbourg
when this work was initiated. He is very grateful for the hospitality and friendship that was extended to him while he was there. His work on this paper was also supported by a grant from the National Science Foundation of the U.S.A •.
54
is given in
[2J
forms (cf. also
[g I}
Let
n
and
[3J
respectively for Gaussian and Rademacher quadratic
[9J). and
[EI} n
be independent copies of
from the decoupling inequalities that for all
p
Ellx g (s,t)II
P
E[J
~
I:
m
n
~
a
as in (1.1) and (1.2) with finitely many non zero
(1 .3)
[E } . It follows
m,n
' i(ms + nt )II P am,ngmgne
and (1.4)
EIIx (s,t)/I E
P
I:
Ell
a
m
E E ' e i(ms+nt)II P
m,n m n
where the constants of equivalence only depend on
p
and
11.11
indicates
sup 2 I· I . We will show how (1.3) and (1.4) follow from the usual (s,t)E [O,2nJ decoupling inequalities in the Appendix.
We associate with
[xg(s,t)}(s,t) E[o,2nJ2
and
[xE(s,t)}(s,t)E [O,2nJ2
the
pseudometric (1. 5)
where
I: Ia 12 1e i (ms + nt) _ e i (ms ' + nt' ) 12 ) 1/2 m< n m,n
(s,t), (s',t') E [o,2nJ 2 . As usual let
entropy of radius
(
d( (s , t ) , (s ' , t ,) )
d
[O,2nJ2
E> 0
with respect to
in the pseudometric
N([O,2nJ2,d;E)
denote the metric
d, i.e. the minimum number of open balls of d
that covers
[O,2nJ2. A number of people
have recognized that
SCD logN([O,2 n J 2 ,d;E)
(1 .6)
dE
<
00
o
is a sufficient condition for the uniform convergence a.s. of (1.1) or (1.2). It appears for example in
[4J
(stated in a somewhat stronger form involving
majorizing measures). In fact C. Borell pointed this out to us and raised the question, can this sufficient condition be improved? The suspicion that (1.6) can be improved is reasonable because there are some obvious cases in which
55
xg (s,t)
is sufficient for the uniform convergence a.s. of these
is when
a
or
x (s,t) • One of E
is a product; (details will be given in § 2). Nevertheless,
m,n
no smaller function of the metric entropy than the one appearing in (1.6) suffices as a sufficient Condition for the uniform convergence a.s. of To be more precise, for all functions increasing and
lim x
~
= 0
f(x)/x
f : R+ ~ R+
such that
Xg(s,t)
or
f(O) = 0
XE(s,t) f
is
(and also satisfies a weak smoothness condition
CD
which will be given in § 2), we give examples of Gaussian and Rademacher Fourier quadratic forms which are not uniformly convergent a.s. but for which 2
CD
S
f(logN([0,2 n J ,d;E)) dE
<
CD
•
o
These examples also show that contrary to our experience with Gaussian processes no condition on the metric
d
alone can give necessary and sufficient conditions
for the uniform convergence a.s. of (1.1).
Pertaining to our study, X. Fernique
[6J
has obtained an important
corollary of the deep recent result of M. Talagrand
[13J
on the existence of a
majorizing measure for bounded Gaussian processes. Fernique's corollary deals with stationary vector valued Gaussian processes ; using this corollary (in a form adapted to our needs, Theorem 3.1 of this paper) and Theorem 1.1 of
[S,Chapter IJ
we obtain the following theorem which extends the equivalence between Gaussian and Rademacher Fourier series to quadratic forms.
Theorem 1.1. Let
tx/s,t)}(s,t)E[0,2nJ2
and
(x/s,t)}(s,t)E[0,2 n J2
as given in (1.1) and (1.2). If only finitely many of the coefficients non zero, then for all
(1 . S)
~
m,n
are
1
Ellx g (s,t)II P
where, as above, only on
p
a
be
11.11 =
sup 2 (s,t) E [0,2nJ
p . In particular,
and (l.S) holds in this case.
xg(s,t)
1.1
and the constants of equivalence depend
converges uniformly a.s. iff
xE(s,t)
does
56 Developping further the arguments of the proof of Theorem 1.1 , we characterize
xy (s,t)
the a.s. uniform convergence of
in terms of one-dimensional
entropies following Fernique's characterization of boundedness and continuity of vector valued stationary Gaussian processes. Assume for notational convenience that a
o
m,n
m~ n
if
(1 .9)
and set for
s, s', t, t'
d((s,O),(s' ,0))
and for
j
E
and
[0,2nJ d((O,t),(O,t'))
d (t,t') 2
1, 2
S
00
(1.10)
o
Recall that the space
C
(log N([0,2n J,d.;E))
1/2
dE.
J
, of a.s. continuous random Fourier series, is defined
a.s.
as the space of all sequences of complex numbers
a
=
[an}:o
in
-t 2
such that
t E [0,2nJ converges uniformly a.s. equipped with the norm
[a] The dual space
c*a.s.
space of multipliers
of
C
a.s.
has been carefully described by G. Pisier as a
M(2'~2) ; we refer the reader to
[10J, [8J
for further
details. We will not be directly concerned with this here although various interpretations of what we obtain can be given in the language of Define for each
m and
the one-dimensional sequences
and
(1.11) Set further for each
n
[lOJ.
T
in
C*
a.s.
and
j
, 2
t, t' E [0,2nJ
whenever it is defined. As a corollary of the results of Talagrand and Fernique we obtain ;
57
Theorem 1.2. Let
f
1.
Xg (S , t)} ( s , t ) E [ (I, 2n J2
finitely many non zero coefficients
E
a
m,n
Ilxg (s,t)11
Then for
+
j
= 1 or 2
J 1/ ([0,2n J,d 1 ) 2
+
(1.12)
be as given in (1.1) with only
+ T
sup J1/r([0,2nJ,d.) T E C* ~ J a.s.
IITII
$1
is
where the constants of equivalence are numerical constants and defined as in
(1.10) with
d~
instead of
J
d. J
We note that by considering Cauchy sequences Theorem 1.2 gives necessary and sufficient conditions for uniform also of
converge~ce
a.s. of
xg (s,t)
and, by Theorem 1.1 ,
XE(s,t) . Actually it is easily seen that, by Theorem 1.1 of
[S,Chapterr J ,
(1.12) also implies (l.S) .
It is customary to leave out the "diagonal" (Le. the terms
a
m,m
) when
studying random quadratics forms. For one thing (1.3) and (1.4) are no longer true, in general, if
a
m,m
J
0 . Furthermore, tte diagonal terms can be handled separately.
We write (1.13)
" 2im(s+t) "" a g e m m,m m
" im(s+ t) "" am,m e
+
m
By standard symmetrization argument and Theorem 1.1
[S,ChapterrJ , the second
series on the right in (1.13) converges u~iformly a.s. if and only if " imu "" a g e m m,m m
u E [0,2nJ
converges uniformly a.s .. The first series on the right in (1.13) is a deterministic Fourier series and the dichotomy betweel. unboundedness and uniform convergence a. s. is no longer relevant. Obviously in the Rademacher case one only gets on the right in (1.13)
" im(s+t) "" a e m m,m
58
2. Random Fourier quadratic forms and entropy
The sufficient conditions that we know for the uniform convergence a.s. of (1.1) and (1.2) follow immediately from well known results and techniques. We will present them here for the convenience of the reader.
Theorem 2.1. If
<
J1([o,2nJ2,d)
the Fourier quadratic forms
00
(1.1) and
(1.2) converge uniformly a.s. and
(2. 1)
where
2 ) 1/2
+
is a numerical constant. (2.1) is also valid with
x
E
C
C ( (I:
:s;
sup 2 Ix (s,t) I (s,t)E[O,2TIJ g
I
m< n
a
1
m,n
Conversely, if either of the Fourier quadratic forms J1/2([O,2nJ2,d) <
a.s. then
CD
easily obtain from the results of C > 0
P (
or
X
E
X
E
converge uniformly
•
be complex numbers satisfying
Proof. Let
constant
X g
replaced by
g
[12J
or
[14J
I: Ib 1 m
2
<
00
•
One can
that there exists an absolute
such that
I: b g g I m
> A ( I: Ib m
An estimate similar to (2.2) holds when
1
m,n
2 ) 1/2 )
[g} n
C exp ( - A/ C) ,
is replaced by
[E n } ,
'if A > 0 •
(see e.g. [lJ ).
Using these estimates Theorem 2.1 follows from the usual extensions of Dudley's theorem as presented for example in [11J
E exp ~ I
or
[5J
since we have that
x (s,t) - x (s',t') g
g
<
00
d((s,t),(s',t')) for some
~
> 0
and similarly with
x
g
replaced by
X
E
For the converse we note that by (1.3) the uniform convergence a.s. of (1.1) implies that of I: ( I: a g') g e n m,n n m m
ims
s E [o,2nJ
59
in which, to simplify the notation, we take independent from
am,n
=
°
if
m
~
n • Since
[g } is m
and is symmetric, the uniform convergence a.s. of
(1.1) implies, by Theorem 1.1
[8 ,Chapter I] , that of
s E [O,2n] ,
which is equivalent to
where
d
is defined in (1.9). Sinilarly we sec that
1
By the triangle inequality the metric
d
d((s,s'),(t,t'))
dpfinpo in (1.5) satisfies
+
Therefore (2.5)
and thus we see by (2.3), (2.4) ano (2.5) that the uniform convergence a.s. of (1.1) implies
J1//[O,2n]2,d)
<
CD
Exactly the same argument applies i f (1.1) is
•
replaced by (1.2) •
In an analogous fashion one can show that OO
S
r
~
(log N ([O,2n] ,d; E))
r/~
~dE
<
00
°
implies the uniform convergence a.~;. of r-dimensional tetrahedral (i.e. attention is restricted to the subset of
a
]TIl
7· • •
,m
for which
and Rademacher Fourier forms. Here, as above, distance in
L
2
m < m') < ••• < m ) Gaussian r 1 <-
r
d
denotes the corresponding
•
We now give examples which show that if one wishes to give metric entropy conditions for the uniform convergence a.s. of (1.1) or (1.2) solely in terms of the metric
d, then the sufficient: condition of Theorem 2.1 can not be improved.
60 1'(0) lim
f(x)
=
=
m . Furthermore we assume that for some C > 0 and all n
(Such a condition will follow if for some positive {,inite numbers f(2x)
:s; K
N. > N J
f(x)
for all
[b .}
large enough
o
M and
K,
tN.} be a sequence of integers such that
Let
M .)
x:;'
=d let
+ ••• + N. 1 J-
1
lim x/f(x) x - co
0 , be an increasing function satisfYing
J
be a sequence of positive numbers. Their precise
J
00
I: b~ N < j=1 J J
values will be specified later but we assume already that
2
00
Define
00
xg (s,t)
(s,t) E [0,2n]2
I: b. I: j=1 J m,n E I(j)
m
=
tNJ.}
*
0 • Note that and
tb J.}
tn
I(j) -
j:;, 1 ,
I(j)
1
+ ••. + N _
j 1
:s;
n < N + .,. + N } j 1
We will show that for some appropriate choice of
Nj
=
: N
depending upon
l' , the process
X
g
defined by (2.7) does not
converge uniformly a.s. even though
J
00
2
f(logN([0,2 n ] ,d;E)) dE
<
00
o
where
d
is the metric associated with
as given in (1.5).
X
g
By independence and Jensen's inequality we see that for each g g e i(ms
I:
b.
J m,n E I(j) m
and, by Remark 1.3
E
II
+nt )11
sup
m n
j :s; J
[S,Chapter VI] , for each
b.
Ell
J
J:;' 1 gmgnei(ms+nt)11
I:
m,n E I(j) m
j
I:
m,n E I(j) m
~ (E
:;,
(Throughout this example
I
intl2 g e sup I: n t E [0,2n] n E I(j)
-
/2N.) J
C N. log F. J
J
C will denote a strictly positive constant, possibly
changing from line to line.) Thus we have that
61
sup
(2.8)
E
J I:
II
j=1
J ;;, 1
II
g g e i(ms +nt) b. I: m n J m,nE I(j)
;;,
C
sup j ;;, 1
b.N .log N. J J
J
m
d((s,s'),(t,t'))
2
00
I:
I
"
b. '" e j=1 J m,n E I(j)
(
i(ms +nt) _ ei(ms' +nt') 12 )1/2
m
where
6
6(s,s')
+
6(t,t')
denotes the one-dimensional metric given by
~ b~
(
6(t,t')
j=1
N.
le int _ e int ' 12 ) 1/2
I:
J n EI(j)
J
It follows that 00
~
2
S
f(2 log N ([O,2nJ,6;E)) dE
o s > 0
We now estimate this entropy. For
~(s )
sup
lu I ~s
and each
j
&. (s)
6(u,O)
sup ~
lui
J
[S, p.123-124 and Lemma 3.6 Chapter IIJ
Following
we define (I:
s nE I(j)
le inu _ 11 2 ) 1/2 •
we see that
00
S
o
f ( 2 log N ([ 0 , 2n J , 6 ; E)) dE 00
1
/
C ( ( I: b~ N~ ) 1/2 j=1 J J
~
+
S
o
00
C ( ( I: b~ N~ ) 1/2 j=1 J J
~
1
f ( 2 locr
00
+
b.,;'N":"
I:
j= 1
J
J
~ ) do (s) )
S1
1
f (2 log - ) do. (s) ) 0 s J
Furthermore,
So1 f(21og
1
- ) dO.(s) s J CD
~
Now for each
j
r; .(s) J
and
f(2)
&.(1) J
I:
+
k=1
s > 0 N .-"1 J
sup (I: lu I ~s n=O
Ie inu
+
&.(2- k ) (f(2k+2) - f(2k)) • J
62 Let us denote by
To(s)
TJ (s)
to see that
~ 2 ~ , however, when J
(
J
for
N.-l
~
'1". 2 -k) A
the first torm of the right hand side of (2.9). It is easy
J
2- k (
~
n2) 1/2
n=O
sufficiently large. Using these estimates for
j
'1".(2 -k) A
and (2.6) we see
J
that for
sufficiently large
j 00
L
k=l
[log2 Nj]
T.(2-
k
) (f(2k+2) - f(2k))
2 ~ (f(2k+2) - f(2k))
L
J
J
k=l 00
L
+
k=[ log N.J+ 1 2 J
c IN:
~
By a similar decomposition of the sum on 00
~ [(N 1+ ••• +N j _ 1 ) k=l J L
~
CD
L
f f ( N. 2-
k=l
J
2-
k
k
II
II
J
k
f ( 310g N . )
J
we get
2] (£(2k+2) - f(2k)) ~
2) (£'(2k+2) - f(2k))
c $
J
J
f (3log N . ) J
Putting this all together we see that
(2.10)
Since
+
f
satisfies
lim x
~
~ CD
J J
J
we can first choose
00
tNJ.}
such that
o
lim j
o ,
fex) / x
b.N.f (31ogN.) ) •
logN. J
and then take b. J
(j 2 N.f(3logN. ) )-1 J
J
j
For these Choices, we see from (2.10) that
~
1 •
Jf([O,2n]2,d)
<
CD
even though,
by (2.8) and the integrability properties of Gaussian quadratic forms, does not converge uniformly a.s ..
x (s,t) g
63 Thus (1.6) Can not be replaced by (1.7), with any increasing function satisfying
f(x) /x = 0
lim
f
and (2.6), as a sufficient condition for the
X~CD
uniform convergence a.s. of (1.1). Note that exactly the same argument applies for the corresponding Rademacher Fourier quadratic form, i.e., with
In some cases
<
J1/2([0,2nJ2,d)
tE E } m n
replacing
is necessary and sufficient for the
00
uniform convergence a.s. of (1.1) ilnd (1.2). One of these, which is trivial, is when the coefficients
fa m,n }
vanish outside some one-dimensional set of indices.
1.
We write (1.1) in the form
(2.11)
(s,t) E [0,2nJ2
2: k
m < n
where
are non-negative integers. Using Gaussian decoupling we see that this
series converges uniformlY a.s. if and only if (s,t) E [o,2nJ
(2.12) converges uniformly a.s. where tgmkgrik}
f
g
1. mk
2
gnk }
Theorem 2.2. Let
o
E
[0,2nJ
x (s,t ) g
and
CD
it follows from Theorem 1.1
Once again exactly the same argument
•
in (2.11) is replaced by 2
J1/2([0,2nJ ,d)
marginal processes formed from
t
<
J1/2([0,2nJ ,d)
The finiteness of
the processes
k
that the series in (2.12) and consequently (2.11) converges uniformly
a.s. if and only if holdS if
is an independent copy of
tg' } nk
is an independent symmetric sequence in
[8 ,Chapter I J
2
0
s
o
E
uniformly a.s. for some
Xg(s,t)
x g (s,t) and
tEmkEnk}
also implies the continuity a.s. of the and
xE(s,t) •
be as given in (1.1). Then if
X (s ,t) g
0
are uniformly convergent a.s. for each fixed
[0,2nJ • Conversely if t
o
E
This theorem is also valid if
[0,2nJ X
g
(o,t)
and
s
o
X (s,t ) g
E
0
[0,2nJ
is replaced by
and then
X (s ,t) g
0
converge
J 1/2 ([0 ,2n J2 ,d)
xE(s,t) •
<
00
•
64 Proof. Let
(2.13)
d (s,s') 1 ~
N([0,2TI],d.;2E) J
We will show this for (sk,t ) , k
j
and
=
d (t,t') 2
N([0,2 TI ],d;E)
1 . The proof when
k = 1 , ... , N([0,2TIJ2,d;E)
that cover
[0,2TI]2
Let
be as defined in (1.9) . Let us note that
in the metric
j
j
=
1 , 2 •
2
be the centers of the balls of radius
d
For a fixed
be some fixed element in
is completely similar. Let
k
E
consider
B . It follows from the triangle inequality E,k
that B E,k
C
U £(s,O) E [0,2TI]2
since
d1(s'~k)
Since that i f
=
d((S'O)'(~k'O))
J 1//[0 ,2 TI J2 , d)
<
we have verified (2.13) when
then
CD
g e m
J 1//[0, 2TI], d )
1
ims
<
j = 1 • It follows
and this implies that
CD
s E [0,2 TI J
converges uniformly a.s .• By Theorem 1.1
[8 ,Chapter I J , the uniform convergence
a.s. of (2.14) is equivalent to that of s E [0, 2TI J
which, by decoupling inequalities similar to (1.3), implies the uniform convergence a.s. of m~ n
X (s,t ) . (To avoid confusing notation assume that g
a
0
in all these series.) A similar argument shows that
m,n
X (s ,t) g
0
o
for all
converges
uniformly a.s •• The converse follows from the proof of Theorem 2.1 since in the relevant part of the proof of Theorem 2.1 the only property of the continuity of X (s,t) g
that is used is that its marginals are continuous. All the above
statements remain valid when
xg(s,t)
is replaced by
The follOWing corollary is immediate.
xE(s,t) .
65 Corollary 2.3. Assume that
12
• Then
<
J 1/2 ([0 ,2n J2 , d)
convergence a.s. of
Xg(s,t)
2 1.f a }
Proof. Since
m
00
and
a
m,n
=
a a
[an } is a real sequence in
where
m n
is necessary and sufficient for the uniform x£(s,t) .
E .{., ,,1 and because of I:
uniformly a.s. if and only if
amgme
ims
x (s,t) g
converges
converges uniformly a.s •. The result
m
now follows from Theorem 2.2. The same proof
is valid for
X£ (s, t) •
Now let us consider stochastic processes of the form •
I: a ~ g g e m,n m,n m n m< n
where
[am,n}
and
[gn}
i(ms+nt)
(s,t) E [0,2nJ
P (£0,0
[g} • It follows from Theorem 1.1 n
the series in (2.15) converges uniformly a.s. if and only if where Let
d
is a doubly
are as given in (1.1) but where
indexed i.i.d. sequence of random variables with which is independent of
2
P ( £
1
= -1 )
0,0
"2
,
that
[8 ,Chapter I J
J1/2([0,2nJ2,d) <
00
is as given in (1.5). This observation has an interesting interpretation.
(O,J,p)
be a probability space on which
[£m,n}
w E 0
is defined. For each
we consider the Gaussian Fourier quadratic form ) I: a £ ( wgge m,n m,n m n m
(2.16)
The metric
d
of
Furthermore, i f
wE 0
i(ms +nt)
(s,t) E [0,2nJ2 .
(as defined in (1.5)) is the same for all these processes irregardless J1//[0,2nJ2,d) <
00
then for almost all
wE 0
the
series in (2.16) converge uniformly a.s. (With respect to the probability space supporting
[gn})' However, the series in (2.16) do not necessarily converge
uniformlY a.s. for all
w EO. This can be seen from the examples we just gave of
Gaussian Fourier quadratic forms which do not converge uniformly a.s. but for which 2 J / ([0,2nJ ,d) 1 2
<
00
•
Thus, unlike the random Fourier series studied in
continuity of Gaussian Fourier quadratic forms is not determined by the d
given in (1.5). Once again exactly the same argument applies if
replaced by
[8J 2 L
[gn}
[£ n } in (2.16).
Nevertheless, entropy still can be used to characterize a.S. uniform
is
metric
66 convergence of Gaussian and Rademacher Fourier quadratic forms. However, as we will see in the next section, it is necessary to consider the supremum of classical onedimensional entropies over a family of metrics.
3. Gaussian random Fourier series with coefficients in a Banach space
Fernique's corollary
Theorem 3.1. Let sequence of elements of
E t
[6J
(B,II.II)
of Talagrand's theorem
[13J
be a Banach space with dual
is the following.
B*. Let
[x} n
be a
B with only finitely many non zero terms. Then
sup II I: g x e int II n n E [0,2nJ n
E
I I: gn xn II n
+
sup E sup lI:g<x*,x>eintl Ilx*II':;l t E [0,2nJ n n n
x* E B* •
where
Therefore uniform convergence a.s. of random Fourier series with coefficients in a Banach space is characterized through conditions involving a family of classical one-dimensional entropies. We will see in the proof of Theorem 1.2 that (3.1) can be used to obtain a similar result for Gaussian and Rademacher Fourier quadratic forms. Using a set of one-dimensional entropy conditions instead of a single two-dimensional entropy condition to characterize uniform convergence a.s. of random Fourier quadratic forms seems to be necessary, as was shown by the class of examples described in § 2. Theorem 3.1 sheds some light on these examples. Indeed, the quadratic form (2.7) can almost be realized as a random Fourier series with coefficients in a Hilbert space. Define a sequence
[x } n
setting 'if J :;, 1 ,
where
[~}
'if n E I(j) ,
x
n
denotes the canonical basis of
x(t)
t
-t 2
E [0, 2n J
• Consider
of elements in
t2
by
67 Clearly I:
+
I: b.N. J J J
+
min
< x ,x > g g e m
Ilx(t)11
tb.}
and
J
2
i(m -n)t
m n
I: g g cos((m -n)t) m n J m,n E I(j)
2 I: b.
J
m
Thus
n
is closely related to the quadratic form (2.7). For the choices of in § 2,
tN.} J
x.*
(
=
J
X(t)
is unbounded a.s. since for
1 ) 1/2 I: e kE I(j) k Nj
( Ilx*11 J
1 )
we have sup I I: <x*,x>geinti t E [O,2n] n J n n
E
E
sup tE[O,2n]
I b:/2
C (b.N .1ogN . J J J
J
f
int
I
)1/2
C. Likewise the examples of § 2 show that no condition
for some absolute constant of the form
J
g e nEI(j) n I:
([O,2TI],6)
<
m
(see (1.7)) where
6(t,t') is sufficient for the uniform convergence a.s. of the Gaussian quadratic form in (3.2). (The relationship between Theorem 3.1 and metric entropy conditions is clearly E
sup II I: g x eintll t E[O,2n] n n n
where d At,t') x
n
We now use Theorem 3.1 to obtain Theorem 1.1. Proof of Theorem 1.1. We will give the proof in the case (1.4) we will prove this theorem with X'(s,t) g
Xg(s,t) and XE(s,t) ( s , t) E [
p ~ 1 • By (1.3) and
replaced respectively by
°,2n ] 2
68
and , i(ms+ nt) a E E e I: m,n m n m
X;(s,t)
where, to simplify the notation we take finitely many of the
and
m ;;, n
if
°
'"
m,n
are non zero. Let us define and
[g'} n
and
[O,2n]2
Recall that only
[gm}
and
[E } m
E
wE 0
E,E ,E, g E g
(0)<0' ,3<x3<' ,PXP')
denote expectation on the product space
and consider
f " g (w)e L ~ a m m,n m
ims
on the
on the probability space
[E '} n
and denote their corresponding expectation operators by
EE' • Let
us fix
m,n
(0,3< ,p)
probabili ty space
(0' ,J' ,p' )
a
a
E
(s,t)
}
. as a sequence In
in the Banach space of continuous complex valued functions on
n
Let
of elements
[O,2TI] • By Theorem
3.1 we have
E , sup g s,t
1 I:
( I: a
n
m
g (w) e ims) ,int e m,nm gn
+
By Theorem 1.4
[8 ,Chapter I]
E ,sup g s
1
sup sup s Eg , t
[g'}
we can replace
1
I: ( I: a g (w)e n m m,n m
[E'}
by
n
n
(3.3). Doing this and taking expectation with respect to
E sup IX'(s,t) I
s,t
g
E sup s
1
I: ( I: n m
f
L
imS
) g'eintl n
in the last term in
gm}
we get
ims am,ng~) gme
(3.4) +
E
g
ims E ' e int) g e sup I I: ( I: a sup E m,n n m E' t m n s
Let us denote the first and second term to the right of the equivalence sign in (3.4) by
I
and
I I . By Theorem 1.4
2 1 ) 1/2 E e E sup I I: ( I: la m,n m s m n
I
E sup s In analyzing
II
(3.5)
[8 ,Chapter I]
II :s;
I: ( I: a E' ) E e ims I m,n n m m n
we repeat the steps of
E sup s,t
ims
1
I: ( I: a m,n En' e int) g e m m n
E'e E sup \ I: ( I: a m,n n t n m
int
I
:s;
C3 . 3)
applied twice
and
E sup s,t
C3. 4 )
1
x;(s,t)
1
and obtain
ims
) gm l + EE t sup E sup 1I: (I: a E 'e int) g e ims t g s m n m,n n m
1
69 Denote the first term to the left of the equivalence sign in (3.5) by second by
I'
and the
II' • Using the same argument as above we have
E sup II: ( I: a g ) E'e m,n m n t n m
I'
I: ( I: a E) C'e m,n m n n m
E sup t We use Theorem 1.4
[S ,Chapter I]
on
int
int
s;
E sup I
s,t
II'
x~(s,t) I
and obtain
' int) E e ims EE' sup EE sup I I: ( I: a m,n En e m t s m n
II'
E sup I x~(s,t) I s,t
s;
Thus we have shown that E Ilx~(s,t)11
(3.6)
s;
C E Ilx~(s,t)11 C. The reverse inequality in (3.6) is obtained by
for some absolute constant
the contraction principle. This completes the proof of Theorem 1.1.
We finally prove Theorem 1.2.
Proof of Theorem 1.2. It will be sufficient to show that
E Ilx (s,t)11 g
+
E
sup E [0, 2TI]
t
sup
+
T E C*
a.s.
II for
j
=
1 or 2
n
m
E sup sE [0,2TI]
m
Til s; 1
where as before
a
m,n
= 0
if
m
~
n
have been defined in (1.11). Indeed, Theorem 1.4
tAj} m
and where the sequences [S,Chapter I]
then clearly
implies (1.12). Following the notation of the preceding proof we show (3.7) for j
=
2
and with the left hand side replaced by
Ellx'(s,t)11 g
proof of Theorem 1.1 (cL (3.3), (3.4) and the estimate of
by (1.3). As in the I), we have
70
E sup s,t
(3.8)
I X·g (s , t) I
E sup s
II
where
is the norm on
[.]
m
n
1
m,n
E sup E • sup g g t s
+
Let us call
I: ( I: la
2 )1/2
g eimsl m
( I: a g e imS) g' e m,n m n
I I: n
int
ITt
the second term to the right of (3.8). It is plain that
C . (3.7) then follows easily from a new application a.s.
of Theorem 3.1 but this time in
C . Indeed, a.s.
II
sup
+
T E C*
IITII and by Theorem 1.4
a.s. :s; 1
E sup s
[8 ,Chapter I] ,
2 E [I: Amgm ] g m
E sup
I: ( I: a
n
t
E sup t
m
I: ( I:
n
I
g) 9' e int
m,n m
n
lam,nI2)'/2g~CLnt I
m
Appendix. Proof of (1.3) and (1.4) : the argument that we give here was shown to us by Gilles Pisier. We will first prove (1.4). It follows from Theorem 2 , [7} for
that
p;;' 1
(A. 1 )
E II
I:
m
p . Therefore by the triangle
inequality , i(ms+nt)II P E E e CEil I: a p m,n m n m
C'
p
E II
I:
m
a
E E'ei(ms+nt)II P
m,n m n
E II
I:
m
a
m,n
,)i(ms+nt)II P • , ( EE+EEe m n
m n
71
Let
8 0
be a real valued random variable uniformly distributed on
[8n } be an i.i.d. sequence. Let
sn
=
in8 en,
s~ =
[8'} n
[O,2TIJ
be an independent copy of
and let
[8}. We define n
in8' e n . It follows from the contraction principle for quadratic
forms, Theorem 1 , [7J
that (A.2) holds i f Ell
(A. 3) We replace
L: a (S S'+S'S )ei(ms+nt)II P m
(s, t)
and
expression on the right in (A.3) where
s
o
and
t
o
by
(s-s ,t-t ) o
are fixed in
0
H .
in the
[0,2TIJ • These
changes do not change the numerical value of this term, i.e.
Finally we note that by Jensen's inequality 2TI 2TI
So S0 :;" Ell
H ds dt o 0
, L: a (S S +_1_ m< n m, n m n 4TI 2
S2TIS2TI e . [ ( n-m ) so+ ( m-n ) to J ds l
0
0
dt 0
0
Thus we have obtained (A.3) and consequently (1.4). The same argument works in the Gaussian case. We introduce in place of
[S)
and
[S'} n
in (A.3) where
as defined in the introduction and take Since
g
n
G'} n
and
G'} n
and
[g' } n
to be an independent copy of
Gn }
gn
=
gn + ig'n
for
Gn } [gn}
is rotationally invariant the exact same argument as above gives (1.3).
References [lJ
A. Bonami : Etude des coefficients de Fourier des fonctions de
LP(G) • Ann.
Inst. Fourier (Grenoble) 20 , p. 335-402 (1970). [2J
C. Borell ; Tail probabilities in Gauss space. Vector space measures and applications, Dublin 1978. Lecture Notes in Math. 644 , p. 71-82 , Springer (1979) •
72 [3J
C. Borell : On the integrability of Banach space valued Walsh polynomials. Seminaire de Probabilites XIII. Lecture Notes in Math. 721 , p. 1-3 , Springer (1979).
[4J
C. Borell: On polynomials chaos and integrability. Prob. and Math. Stat. 3 , p. 191-203 (1984).
[5J
X. Fernique : Regularite de fonctions aleatoires non gaussiennes. ECole d'ete de St-Flour 1981. Lecture Notes in Math. 976 , p. 1-74 , Springer (1983).
[6J
X. Fernique : Fonctions aleatoires gaussiennes
a
valeurs vectorielles.
Preprint (1985). [7J
S. Kwapien : Decoupling inequalities for polynomial chaos. Preprint (1985).
[8J
M.B. Marcus and G. Pisier : Random Fourier series with applications to harmonic analysis. Ann. Math. Studies 101 , Princeton Univ. Press (1981).
[9J
G. Pisier : Les inegalites de Khintchine-Kahane d'apres C. Borell. Seminaire sur la geometrie des espaces de Banach 1977-78, Ecole Polytechnique, Paris (1978).
[10J
G. Pisier : Sur l'espace de Banach des series de Fourier aleatoires presque surement continues. Scminairc sur la geometrie des espaces de Banach 1977-78, Ecole Polytechnique , Paris (1978).
[11J
G. Pisier : Some applications of the metric entropy condition to harmonic analysis. Banach spaces, harmonic analYsis and probability, Proceedings 1980-81. Lecture Notes in Math. 995 , p. 123-154 , Springer (1983).
[12J
M. Schreiber
Fermeture en probabilite des chaos de Wiener. C.R. Acad. Sci.
Paris, Serie A , 265 , p. 859-862 (1967). [13J
M. Talagrand : Regularite des processus gaussiens. C.R. Acad. Sci. Paris, Serie I , 301 , p. 379-381 (1985).
[14J
D. Varberg : Convergence of quadratic forms in independent random variables. Ann. Math. Statist. 37 , p. 567-576 (1966).
RATES OF CONVERGENCE IN THE CENTRAL LIMIT THEOREM FOR EMPIRICAL PROCESSES by Pascal MASSART * universite Paris-Sud U.A. CNRS 743 "Statistique Appliquee" Mathematiques, Bat. 425 91405 ORSAY (France)
SUMMARY
In this paper we study the uniform behavior of the empirical brownian bridge over families of functions
F bounded by a function
pendent with common distribution
pl.
F (the observations are inde-
Under some suitable entropy conditions which
were already used by Kol~inskii and Pollard, we prove exponential inequalities in the uniformly bounded case where
F is a constant (the classical Kiefer's inequality
(1961) is improved), as well as weak and strong invariance principles with rates of convergence in the case where
F belongs to
L2+o(p)
improve on DUdley, Philipp's results (1983) whenever
with
°E ]0,1]
(our results v
F is a Vapnik-Cervonenkis
class in the uniformly bounded case and are new in the unbounded case).
1.
Introduction
2.
Entropy and measurability
3.
Exponential bounds for the empirical brownian bridge
4.
Exponential bounds for the brownian bridge
5.
Weak invariance principles with speeds of convergence
6.
Strong invariance principles with speeds of convergence
APPENDIX
1. Proof of the lemma 3.1. 2. The distribution of the supremum of a d-dimensional parameter brownian bridge 3. Making an exponential bound explicit.
Invariance principles, empirical processes, gaussian processes, exponential bounds.
74
1. I NTRODUCT ION 1.1. GENERALITIES. Let
(X,X,P)
be a probability space and
(x n )n>1
be some sequence of indepen-
dent and identically distributed random variables with law enough probability space
P, defined on a
rich
(0,A,Pr).
1 n Pn stands for the empirical measure L Ox. and we choose to call empirical n i~1 1 brownian bridge relating to P the centered and normalized process vn =!n (p n-pl. Our purpose is to study the behavior of the empirical brownian bridge uniformly over F, where F is some subset of L2(P).
More precisely, we hope to generalize and sometimes to improve some classical results about the empirical distribution functions on lR d (here F is the collection of quadrants on lR d ), in the way opened by Vapnik, Cervonenkis and Dudley. In particular, the problem is to get bounds for: (1.1.1)
Pr (
Ilv n II
F > t)
, for any positive
where 11.11 F stands for the uniform norm over
t,
F and to build strong uniform approxi-
mations of vn by some regular gaussian process indexed by convergence, say (b n )
F with some speed of
First let us recall the main known results about the subject in the classical case described above.
We only submit here a succinct bibliography in order to allow an easy comparison with our results (for a more complete bibliography see [26]) Concerning the real case
(d~1)
, the results mentioned below do not depend on
P and are optimal :
(1.1.1) is bounded by
C exp(-2t 2),
to Dvoretsky, Kiefer and Wolfowitz [24] (C
where ~ 4/2
C is a universal constant according according to [17]).
75 1.2.2.
The strong invariance principle holds with
~ , according to Komlos,
bn
In
Major and Tusnady [37] . In the multidimensional case (d
~
2).
1.2.3.
C(E:) exp (-(2-E:)t 2 ),
(1.1.1) is bounded by
Kiefer [34].
In this expression
for any
(:
> 0,
according to
[ cannot be removed (see [35] but al so [28]).
1.2.4.
The strong invariance principle holds with
b n
n- 2(2d-1) Log(n),
according
to Borisov [8]. This result is not known to be optimal, besides it can be improved when uniformly distributed on 1.2. 5.
If
d
=
[0,1]d
In this case we have :
2 :
(Log(n)) 2 , according to
The strong invariance principle holds with
In
Tusnady [50]. 1.2.c. If
d
P is
>3
:
1
The strong invariance principle holds with
b n
=
n-
3
2Td+TI (Log(n))2, according
to Csorgo and Revesz [14] 1.2.5 and 1.2.6 are not known to be optimal. Let us note that even the asymptotic distribution of [I\inll F is not well known (the case where
d = 2 and
P
is the uniform distribution on
[0,1]2
is studied
in [12]). Now we describe the way which has already been used to extend the above results.
v
Vapnik and Cervonenkis introduce in [51] some classes of sets - which are generally called V.C.-classes - for which they prove a strong Glivenko-Cantelli law of large numbers and an
~xponential
bOLnd for (1.1.1)
76
P. Assouad studies these classes in detail and gives many examples in [3] (see also [40] for a table of examples). The functional P-Donsker classes (that is to say those uniformly over which some central limit theorem holds) were introduced and characterized for the first time by Dudley in [20] and were studied by Dudley himself in [21] and later by Pollard in [44]. Some sufficient (and sometimes necessary, see [27] in case bounded) conditions for
F is uniformly
F to be a P-Donsker class used in these works are some kind
of entropy conditions Conditions where functions are approximated from above and below (bracketing, see [20]) are used in case
F is a P-Donsker class whenever
restricted set restricted set of laws on
X (p
P belongs to some
is often absolutely continuous with
respect to the Lebesgue measure in the applications) whereas Koltinskii and Pollard's conditions are used in case
F is a P-Donsker class whenever
P belongs to some set
of laws including any finite support law (the V.C.-classes are - under some measurability assumptions - the classes of sets of this kind, see [21]). In our study we are interested in the latter kind of the above classes. Let us recall the already existing results in this particular direction . Whenever
• F is some V.C.-class and under some measurabil ity conditions, we have:
1.3.1.
(1.1.1) is bounded by
C(F,d exp(-(2-dt 2 ) for any
E
in ]0,1], according to
Alexander in [1] and more precisely by : in [2] * where
D stands for the integer density of F (from Assouad's terminology in [3]) .
1.3.2.
2
( 1. 1.1) is bounded by
*
(~ )) exp(-2t 2) , according to Devroye in [16].
Our result of the same kind (inequality 3.3.1°)a) in the present work) seems to have been announced earlier (in [41]) than K. Alexander's one.
77
1.3.3.
The strong invariance principle holds with
b n
=n
2700(D+1) , according to
Dudley and Philipp in [23]. Now let us describe the scope of our work more precisely.
2. ENTROPY AND MEASUKABILITY. From now on we assume the existence of a non-negative measurable function such that
If I ~
F,
for any
f
F
in F.
v
We use in this work Kolcinskii's entropy notion following Pollard [44] and the same measurability condition as Dudley in [21] .
Let us define Kolcinskii's entropy
notion. Let and 2.1.
p be in
p~p)(X)
[1,+oo[
.
A(X)
stands for the set of laws with finite support
for the set of the laws making
FP integrable.
DEFINITIONS. Let £ be in
]0,1[
and
Q be in
p~p) (X) .
N~P)(£,F,Q) stands for the maximal caroinality of a subset G of F for which:
holds for any
f,g
in G with
f~g
(such a maximal cardinality family is called an
sup N~P~(.,F,Q) QEA(X) L09(N~P)(.,F)) is called the (p)-entropy function of (F,F).
£-net of
(F,F)
relating to Q).
We set
N~P)(.,F)
=
The finite or infinite quantities : inf (s>O e(P)(F) = inf (s>O F
limsup £SN~P)(£,F) < oo} £->-0 limsup £SL09(HF(P)(£,F)) < oo} £->-0
are respectively called (p)-entropy dimension and (p)-entropy exponent of
(F,F).
78
Entropy computations.
We can compute the entropy of
F from that of a uniformly bounded family as
follows
{~1(F>0)' fE n , then
Let I
N~P)(.,F) < NIp) (. ,I) For, given so
£
O(F P)
EA(X)
0 in A(X), either O(F)=O and so N~P)(.,F,O)
1, or
O(F) > 0,
and then
NF(P)( .,F, 0) -- N(P)( 1 .,I, Some other
properties of the (p)-entropy are collected in [40].
The main examples of uniformly bounded classes with finite (p)-entropy dimension or exponent are described below.
According to Dudley [20] on the one hand and to Assouad [3] on the other we have
, whenever S [3]).
is some V.C.-class with real density
d
(this notion can be found in
Concerning V.C.-classes of functions, an analogous computation and its appli-
cations are given in [45] • See also [21] for a converse.
Let d be an integer and
k
D
a
be some positive real number.
We write
S for the greatest integer strictly less than
Whenever
x belongs to
Rd and
for the differential operator
k
to
J1d ,
I kl
a.
stands for
d Ik I
k
k +· •• +k d
1
and
k
ax, 1., .ax dd Let 11.11 be some norm on Let
Rd •
a,d be the fami ly of the restrictions to the unit cube of s-differentiable functions f such that: f\
R
d of the
79
IDkf(x) I
max sup Ik 1~13 xERd
+
max su p.JDkf(x)-Dkf(y)1 - <1 Ik I=13 Xfy II x-y II a-13 -
Then, according to [36] on the one hand and using Dudley's arguments in [19] on the other, it is easy to see that: d
a
Measurability considerations.
• Durst and Dudley give in [21] an example of a V.C.-class S such that
So some measurability condition is needed to get any of the results we have in view. So from now on we assume the following measurability condition (which is due to Dudley [21]) to be fulfilled:
(M)
.(X,X)
is a Suslin space
There exists some auxiliary Suslin space from V onto F such that (x,y) ->- T(y) (x) and we say that
(V,Y)
and some mapping
T
is measurable on (XxX, X III Y )
F is image admissible Suslin via (V,T) •
This assumption is
esse~tially
used through one measurable selection theorem
which is due to Sion [47] (more about Suslin spaces is given in [13]).
2.4. THEOREM. ------Let on
X.
H be some measurable subset of Then
XxX.
We write
A for its projection
A is universally measurable and there exists a universally measu-
rable mapping from A to
V whose graph is included in
H.
A trajectory space for brownian bridges.
We set : l~(F) = {h:
We consider
l~(F)
F->-lR ; hoT
is bounded and measurable on
as a measurable space equipped with the
(V,Y)}. a-field generated
80
by the open balls relating to 11.11 a-field because
(which is generally distinct from the Borel F
l~(F)
is not separable).
This trajectory space does not depend on
P any more (as it was the case in
[20]) but only on the measurable representation
(Y,T)
of F .
From now on for convenience we set : /'-
( \I , A, Pr) "-
where
/'0
OO
(X°"x [0, 1],X
III
B([ 0, 1] ), P
00
III
Ie )
stands for the Lebesgue measure on [0,1], B([O,1]) for the Borel a-field /\-. /\,. oo on [0,1] and (Xoo,Xoo,p ) for the completed probability space of the countable prooo duct (Xoo,Xoo,p ) of copies of (X,X,p) • Ie
The following theorem points out how
l~(F)
is convenient as a trajectory
space.
n
For any a in
L i=1
a.a 1
Xi
is measurable from
\I
to
l~(F).
Moreover, setting Ub(F) = {h : F ~ R ; h is uniformly continuous and bounded on
(F,pp)}'
Ub(F)
is included in
l~(F).
Provided that
(F,pp)
is totally boun-
ded this inclusion is measurable.
L2(P)
Where 2
ap
is given the distance
Pp : (f,g)~ap(f-g), with For a proof of 2.5. see [21] (sec. 9) and [40] where it is
f~ p(f2)_(p(f))2
also shown that many reasonable families (in particular A d and the "geometrical" a,
V.C.-classes) fulfill (M) . 2.6. -----REMARK.
Since whenever
F fulfills (M) it follows from [21] (sec. 12) that IIPn-PII F-+-O a.s.
N~1)(. ,F) < sup
QEP~2) (X)
00
and therefore:
N~2) (E,F,Q) ~ N~2) (~,F) for any E in ]0,1 [ .
This implies that the local behavior of the entropy function is unchanged when taking the sup in 2.1. over the set of any reasonable law.
81
3. EXPONENTIAL BOUNDS FOR THE EMPIRICAL BROWNIAN BRIDGE We assume in this section that for some constants any
f
in F;
we set
U = v-u
and
u and
v, u < f < v for
F-u = {f-u, fEn
The fo 11 owi ng entropy conditions are considered : a) b)
d(2) (F-u) < U (2 ) e U (F-u) < 2 00
Using a single method we build upper bounds for (1.1.1) that are effective in the following two situations 1°) Observe that F.
nothing more is known about the variance over
In this case we prove some inequalities which are analogous to Hoeffding's ine-
qual ity [30]. 2°) We assume that This time our inequalities are analogous to Bernstein's inequality (see Bennett [5])
We randomize from a sample which size is equal to
rJ=mn
I n Po 11 a rd 's [44],
.;
Dudley's [20] or Vapnik and Cervonenkis [51] symetrization technics,
m=2
but here,
following an idea from Devroye [16] ,we choose a large m Effecting the change of central law : P->- PN with the help of a Paul Levy's 'V 'V type inequality, we may study P -P P -P where Pn stands for the instead of n N n randomized empirical measure. Choosing some sequence of - measurably selected
- nets relating to
PN whose mesh decreases to zero and controlling the errors committed by passing from a net to another via some one dimensional exponential bounds, we can evaluate, conditionally 'V
to
PN , the quantity
82
Randomization
Setting from [1.n]
N=nm (m into
[1,N]
is an integer), let w be some random one-to-one mapping whose distribution is uniform (the "sample w is drawn without
replacement"). The inequalities in the next two lemmas are fundamental for what follows 3.1- LEMMA. For any
in
I;
N
lR N , we set SN
L
UN = (max (1;.)) - (min (I;i)); 1
pos it i ve c ,lower bounds for- Log 1°)
2°)
n
Sn
L
CN
2
oN = and
I;i'
i =1
'V
(~
N L I;D i=1
i =1
(~
~(i )'
N L
i =1
2
I;i)
the followi ng three quantities are, for any 'V
Pr
(l~n -
2nc 2
7"N
nc 2
3°) These bounds only depend on
!;
through numerical parameters
(UN,oN)'
Bound 3°) is new; concerning 1°) (due to Hoeffding [30]) , Serfling's bound is better (see [46]) but brings no more efficiency when m is large. The proof of lemma 3.1. is given in the appendix. From now we write
for the randomized empirical process
1
n
L 8 . n i =1 xw( i )
The
inequality allowing us to study the randomized process rather than the initial one is the following:
83
'V
The random elements
lip n-PN II F and
110~IIF ~
Besides, whenever
2 P
E
p2 , the following holds: 'V
(1 - 2 ' 2 ) a En' for any positive
lip n-P II F are measurable.
> c)
Pr ( Ilpn-pIIF
and any
a
in
~ Pr ( IIPn-PNII F
]0,1 [ ,where
n'
n' > (1-a) N
c)
N-n .
=
For a proof of this lemma see [16] using Dudley's measurability arguments in [21] (sec. 12) .
Statement of the results. 3.3. THEOREM. ------The following quantities are, for any positive
1°)
a)
d(2) (F-u) U
if
+ 0n,F( 1) ( 1 b)
e(2) (F-u) U
if
= 1;
2
!...) 3 (d+n) u2
2°)
= s(~:2)
<2
if
d~2)
) exp (-2
increases from 0 to 2 so does
110~11
Suppose that a)
~
(when
~
0
(F-u)
=
F
o (1) n,F
2
2
2)
U
t k+r]
k
t
exp (-2
0n,F(1) exp (On,F(1) (0) where
and n , upper bounds for
,
2d
=
t
,
with 0
< U,
t2
u2 )
k).
then
2d , 2 (~f ~ (d) +n ( 1+ t- ) 3(d +n ) exp ( U 02 \
t2 + ..!:!..(3U+t)))
!n
b)
on,rr(1) where
exp(O
if
e0 2 )
(F-u)
=
s <2
,
r(1) (~)-~-n (!)2p-~+n + 5(!)2p+n) exp ~
n,r
_ 2~(4-d p - 4+d4-d
U
(when
0
~
0
\
2(i
increases from 0 to 2 so does 2p) •
84
The constants appearing in these bounds depend on (2 )
NU (.,F-u)
and of course on
F only through
n.
Comments.
d;2)(F) <
From section 2.2., the assumption
00
is typically fulfilled whenever
•
F is some V.C.-class with real density d. bound 1 a) is sharper than those of 1.3.1. , in another connection O(F,n) t 2nd in 1 a) is specified in the appendix.
Thus
0
)
the factor
0 )
In the classical case (i . e. bound 10
)
F is the collection of quadrants on
lR d ) ,
a) improves on 1.2.3. but is less sharp than 1.2.1. in the real case
moreover the optimality of 1°) a) is discussed in the aopendix where we prove that d-1 lim Pr (11\lnll F > t) ~ 2 ~ n->-oo i=O Suppose that F = Aa,d
then, from section 2.3. we have
respects, Bakhvalov proves in [4] that if on
[0,1]d
(2t2) i -i-!e;2)(F)
~ . In other
P stands for the uniform distribution
then: 1
Ilv n II F ~ C n2 -
a
a
surely
Thus we cannot get any inequality of the 1°) or 2°) type in the situation where eF)(F)
>2
•
The border line case ,
For any modulus of continuity ¢ , we can introduce a family of functions A¢,d in the same way as Aa,d by changing u ->- ua into ¢ and defining S as the greatest integer for which ¢(u) u-S->-O holds. u->-O It is an easy exercise, using Bakhvalov's method, to show that d
Ilv n II
> C (Log(n))Y
Aa,d -
provided that ¢(u) = u2 (log(u- 1 ))Y and P is uniformly distributed on [0,1]d 2 ) (A¢,d) = 2 and we cannot get bounds such as in theorem 3.3. Of course
ei
(*)
so, there is a gap for the degree of the polynomial factor in the bound 3.3.2°)a) between 2(d-1) and 6(d+n) •
85
But the above result is rather rough and we want to go further in the analysis of the families
Aep,1
around the border line.
Then the (2)-entropy plays the same role for
A
CI.,
concerning the Donsker
1
property as the metric entropy in a Hilbert space for the Hilbert ellipsoids concerhold~
ning the pregaussian property, that is to say that the following
:
1
is a functional P-Donsker class whenever
1
J (L09(N~2)
(E,Aep, 1)))2 dE <
00
•
o
is not a functional A-Donsker class whenever
(i)
ep(u) ~ (
1
(*)
u )2 !Log(u)[
follows from Pollard's central limit theorem in [44].
(ii) follows from a result of Kahane's in [32] about Rademacher trigonometric series.
ep ( u) = J ILo~ {u}I ' we have from [32] p. 66 that : En en(t) t + L belongs to Aep, 1 with some probability PK + 1, where n>1 KnLog{n} K->= is a-Rademacher sequence and en(t) = /2 cos (21Tnt) . In fact, if we set
Let us consider a standard Wiener process on
So that, with probability more than
(En)
L2 ([0,1]), we may write
PK ' the following holds 1
IIWII A ~ ~. L Iw(e n ) I nLog{n) ep, 1 n~ 1 By the three series theorem the series L IW(e n ) I nLog{n) almost surely and therefore W is almost surely unbounded on The same property holds for any brownian bridge some Wiener process provided that of
G.
So
Aep,1
W(1)
is some
G for
N(0,1)
f+G(f)
+
fW(1)
is
random variable independent
is not pregaussian and (ii) is proved.
(*) We write f ~ g ,when 0 < lim
diverges to infinity
1 (fg- 1 ) < lim (fg- ) <
00
86 /IAn upper bound in situation 2°) is also an oscillation conti'cl".
If we set
Go
=
{f-g
0 (f-g)
<0
n ,
, f,gE'
it is not difficult to see that:
N~~) (. ,Go+U) ~ ( NQ 2) (. ,F-u))2 U into
thus changing
situation 2°) hold with
2U
and
Go
d into
2d
instead of F,
if necessary the upper bounds in the constants being independent of
0
because of 3.4•. In particular if
•
F is a V.C.-class with real density d, we set A(o,n,t) = Pr ( Ilvnll Go
> t)
At it is summarized in [231 Dudley shows in [20] that r is small enough, 0 = 0 (ILO~(t)~ and n > O(t- ) with r
A(o,n,t) < t
whenever t
>8
Applying 3.3.2°) a) improves on this evaluation for then t A(o,n,t) ~ t whenever t is small enough, 0 = O( ILog(t) I ) and
n
~
0 ((
ILo~lt)(4)
In order to specify in what way the constant in
bound 2°)a) depends on
F,
we indicate the following variant of 3.3.2°)a) • 3.5. PROPOSITION. ----------NQ2) (E,F-U) ~ C (co d- 2d for any E in ]0,1[ and some Ilo~11 F ~ with 0 not exceeding U, then there exists some
If we assume that EO
in
]0,1 [that
i
E1 in ]0,1 [ depending only on that : Pr ( Ilvnll F > t)
~
EO
and
a constant
K depending only on
C such
2 2 K E- d (!:!U)-4d (1 + t 2 )14d exp ( t ) 1 o - 2(i + U(3U+t))
!n
From now on
L stands for the function
x-> Log(xve).
3.6. COROLLARY. --------Let (On)
• (F n ) be some sequence of V.C.-classes fulfillin9(M) with entire densities Then (with the above notations)
t ; whenever
o~
=
o( 1/(D nL(D rl )))
and
Pr ( Ilv II G n on 0~2 = 0 (In) •
> t)
-+
n+oo
0 for any positive
87
(Provided that
Dn
0({6) , such a choice of a
Ln
n does exist).
Comment.
According to Le Cam [38] (Lemma 2) and applying 3.6. the process admits finite dimensional approximations whenever
Dn =
(Lo;~n))
0
{vn(f), fE Fn} and provided
that Le Cam's assumption (Al) is fulfilled. This result improves on Le Cam's corollary of proposition 3 where
Dn = O(n- Y)
1 for some Y < 2 is needed.
Proof of 3.6.
• F be a V.C.-class with entire density
Let
D and real density
Using Dudley's proof in [20] (more deta i 1s are given in [40])
d .
it is easy to
show that, for any w > d (or w -> d if d is "achieved"), we have r~ (2) (E, F)
< K1+(1/2I Lo gEI)
1
for any
exp (2w) (1 + 21 LOgE I) w E-2w 3 D K = 2D! (2D)
E in ] 0, H , with in particular when w=D ,
So from Stirling's formula we get N;2) (E,F) tant
C . 1
~ C~ e 5D 23D E- 4D for any E in
Hence, for any
E in
] 0,
1.] 12
]0,1 [ we have
and some universal cons-
NF) (E,n ~ C~ (2e)5D E- 4D thus, applying 3.5. to the class We propose below
G yields On
another variant of
3.6. inequal ity 3.3.2°)a), providing an
alternative proof of a classical result about the estimation of densities. 3.7. PROPOSITION.
-----------
If we assume that some positive bound of
2 d0 )(F-U) = 2d <
00
and
Ilo~11 F ~
V , then there exists some positive constant
Pr ( Ilvnll F
°V,F,n(1~
> t)
(n(1 +
is, for any positive
:~))3(d+T1I;
i
with
3.3.2°)a) •
In - -
for
C such that an upper
t, given by :
n (O)_4(d+ ) exp (- 2
\i
+ ..t(CLLn (..!:!.. + o)+t)))
;;; In the situation where
uv
;;;
U is large this inequality may be more efficient than
88
Application to the estimation of densities : minimax risk.
Let
lR k ;
KM be the fo 11 owi ng kernel on
KM(y) = l/!(y'M y ) for any y where I/! and
in
lR
k
is some continuous function with bounded variation from
M is some
lR into
kxk matrix.
Pollard shows in [45] that the class 2
K~{KM(.-x), MElR k
,xElR k }
• is a V.C.-class of functions and so : N1(2) k,K)
~
CE -w
for any
E in
]0,1 [
where
C and w depend only on
k.
Now if we assume that P is absolutely continuous with respect to the Lebesgue measure on lR k , the classical kernel estimator of its density f is :
fn(X) = h- k Pn (K(·h x)) where
K is a KM with fixed
I/!
and
M so that
I
K2 (x)dx < 00
Proposition 3.7. gives a control of the random expression k xElR},o
So, if we assume that
for any E
tin
[1 + oo[
C and
(Jhk Dn) ~ T + O(n a ) T6 exp
and 6 .
[1,
+ 00[,
provided that
We choose T = O(/Cn) , thus:
Dn
nh k > 4 62 •
= sup lfn(x)- f(x)] : x
Hence, after an integration:
- --,;-2---'-T2......--~T~ 2(C + 0 (L_L_n_) + _ )
Jhk
for any T in
-f by choosing;
n
U = h- k/ 2 where C2 > llfll oo IK 2 (X)dX.
1l > h- k > C2 , we get, setting Ln -
and some positive a
f
•
Jhk
89
Provided that
f
belongs to some subset of regular functions 8 , the bias
f-f can be evaluated so that the minimax risk associated to the uniform distance on R k and to 8 can be controlled with the same speed of convergence as
expression
in [29] , via an appropriate choice of
h .
3.8. SKETCHES OF PROOFS OF 3.3., 3.5., 3.7. (More details are given in [42]) . First, by studying the class G u=o
and
Let us proof theorem 3.3 ..
v~1.
parameters such as and positive
a, and
S,I3
We set :
I.l
( = -t
Pr(N)r.)
All along the proof we need to introduce
(in ]0,1 [) ; r, m (in }j) ; a (in ]1,+oo[); q (in ]0,2])
In
We write
(' =
and
Pr (1lvnll > t)
'V
Pr ( II Pn - PNII > E')
(1 -
1)
m
(1-a)
(
.
for the probability distribution conditional on
and 11·11 instead of 11·11 F
A bound for
instead of F , we may assume that
which are all chosen in due time.
y
N = mn
(x 1 , .. • ,x N)
f-u fE F} {iJ'
~
for short.
will follow, via 3.2., from a bound for
which is at first performed conditionally on
(x 1 ,··· ,x N)
The chain argument.
Let
be a positive sequence decreasing to zero.
For each integer of 2.4).
j
a ,.-net J
A projection '1 j 2 2 holds.
PN( ( 11 j f- f) )
i
F.
J
can be measurably selected (with the help
F.
F onto
may be defined from
J
so that
'j
Then 'V
'V
II (p n - PN) So, if
0
(Id -
'1
r )11 i
L j~r+1
II (p n - PN)
is a positive series such that
(nj)
L
j>r+1
o
(,1. - 11· 1) II JJ
n·J < I.l
'V
Pr(N) ( IIPn-PNII where
A and A B
B are the
Nr IIPr L j~r+1
(N)
> (') < A + B
(x 1 , ... ,xN)-measurable va ri ab1es 'V
(I(Pn-P N) a 11rl ? (N) 'V Nj IIPr (I(Pn-P N)
> (hI) (')11 0
,
(,1r 1f j_1)I > nj c:' ) II
we get
90
A is the principal part of the above bound and
B is the sum of the error
terms. Inequalities 1°) or 2°) of Lemma 3.1. are needed to control
A according to
whether case 1°) or 2°) is investigated. B , giving
Bound 3°) in Lemma 3.1. is used to control B
<2
L j~r+1
nj = (j_1)-a and
N~ exp (J
\
,2 2 n En. \ J
4m
:5.8. 1.
T~ 1)
r
L n· < ~ holds whenever j>r+1 J the control of the tail of series 3.8.1. is performed via the following
Choosing a > 2),
r
=2
(so
+
elementary lemma 3.8.2. Lemma.
Let lji : [r, +00[
->-
lR.
Provided that
~I
is an increasing convex function, the
following inequality holds: exp (- lji(j)) ~ ~ exp (-lji(r)) d stands for the right-derivative of lji L
j~r+ 1
where
ljid
We choose S
= 1 under assumption a) and S
(~)
under assumption b) •
Proof of theorem 3.3. in case 1°). We choose
a
=
t- 2 ,
m = [t 2 J and
T. J
= Jl j-(a+S) and apply 3.1.1°), then In
A < 2 Nr e 10 exp (_2t 2 (1-2~)) Under assumption a).
Considering the type of inequality we are dealing N. ~ C t 2d }(a+1ld (instead of N. < C' t 2d ' }(a+1ld' J
J -
We c hoose,
"~ -- t- 2
an d a
=
with we may assume that for any
Max (2 , 1 + 3n 4d) ' so
A ~ 0n,F(1) (1+t 2 )3(d+ n ) exp (-2 t 2 ) and
d' > d) •
pIIIN_a.s.
91
P!liN -a.s. whenever
t 2 > 7+4d(a+l)
Now the above estimates are deterministic, so using
Lemma 3.2., theorem 3.3. is proved in situation 1°)a) • With the idea of proving proposition 3.5. note that, setting a method gives, under the hypothesi sin 3.5. , that
with
~
2 , the above
Pr( llvnll> t) is bounaed by:
K (E~1 t)4d (2+t 2 )12d exp (_ 2t 2) 1 K1 depending only on C, whenever t 2 > 7+12d.
Under assumption b).
We may suppose that We set
I.l =
N.
< exp
(C t1; jc;(a+S))
J -
t -2y , where 1;(1+2y(~))
~
n and for y ~ y( 1;) - 2 S > 1 to ho 1d, whe re
tion when a
~
+00
(Namely:
2( 1-y( 1;))
~
2( 1-y) , then we choose y( 1;)
k).
a 1arge enough
is the solution of the above equa-
So:
2 k A ~ 0n,F(1) exp (On,F(1) t +n ) exp (-2 t ) and B
whenever
n, F( 1)
0
eXfJ
(-2 t 2 )
t 2 ~ ~ + 5 + C t1; 22S+2 •
So theorem 3.3. is proved in case 1°). Proof of tneorem 3.3. in case 2°). We set
Q
~!
o
The variable
and choo'se m =<.:1 q, a ~ 2\fq ,
I.l
~ <.p-q and
,. J
2
0p •
In fact, let 8N be the
(x 1, •.• ,x N)-measurable event:
8N ~ {llo~ - o~11
where
> s}
o~(f) ~ PN(f2) - (P N(f))2 for any f in F .
Each terw of the following estimate is studied in the sequel
A'
E(AlI
In
A is this time controlled with the help of 3.1.2°), so now the
probZem is to repZace
where
= .Q.
c)
GN
and
BI
~
E (B 11 8 c) N
92 Bounding
is a r1'Oo7.e", of tUDe 1 U)
Pr (eN)
F2 = {f2, f E F}
' For, se tt lng
2
, we have
2
liON - opll~IIPN-PIIF2 + 21 PN-PII Since
N;2)
(.,F 2 ) ~
in 3.3.1°), so, choosing
N;2) s =
1
(~,F)
2 F
and
fulfills (M), we may use the bounds
2
we get: Pr (eN) ~ Co exp (- ~)
k, 12nm
The evaluation of A I and B I .
2
Ii0NII
A'
~
2
~ a +s
c
holds on
eN
thus applying 3.1.2°)
2 N exp (5 <.p2-q) exp (\ r
2 (a
2 ...:t:..._:--...,..,,-), t+ljJ1-q/2
2
+ (
gives whenever
))
In
Moreover
<2
B'
2 nE,2 LN. exp (- - 2 j~r+1 J 4 a
Now the proofs are completed as CJ. '"
Max (2,1 + 4d) 3n
large enough for
A'
and
1 - ~
2 -< p + nand S >
<2
E~2d
C
we choose
e S 0-2d
2s
)
in case 1°), choosing this time
under assumption a)
To prove proposition 3.5
(j-1l
q = (2-1;)(1 +
,,(~~r)
q=2
and
+ 1)-1 \\ritn
CJ.
to hold. CJ.
= 2, so
<.p2d (2+(jJ2)6d exp ( _ _----"t,-2--"1.".--,-,,\
\
2(i+(3+t)J'
In B whenever
<.p2
~
'
< 2 C2
-4d -4d 4d 2 12d ((<.p2_ 8 )2\ EO a <.p (2+<.p) exp \4 )
8 + 12d
Besides, using 3.8.3., we get: Pr (eN)
~
E
2 K 1
(~)-4d (~)4d
(2 +
2 2 12d ) exp (- ~)
~
2
whenever
~ ~ 7+12d,
which completes the proof of proposition 3.5
via Lemma 3.2.
Proof of proposition 3.7. We assume that
u=O
and
v=1.
Inequality 3.3.2°)a) may
be written
93
2
whenever
:L2 -> 5
IJ Defining the following sequences by induction 2 b. 1 44d+1 a. (2 5 +~ + 2 ) a·J+ 1 J nIJ IJ 2 (.1 + lb.) bj +1 J Iii Iii 3 with a ,,1 and b we call M. the following inequality: 0 o Iii J 2 -a t 2 a2 t2 (IJ 1 !_) :L > t) < Ka. t ) whenever exp Pr ( Ilv n II \ 2 2 -> 5 J \ 2(IJ 2+b. + -) IJ IJ J In The, assuming that same way as 3.3.2 C )a) Then inequalities J
1 + [~]
M. holds, it is possible to deduce M. 1 from M. by the J+ J J from 3.3.1°)a) (technical details are given in [42]) (M j ) hold by induction.
Using inequality MJ ' where and a few calculations yield proposition 3.7 . .
4. We assume that section still hold
P(F 2 ) <
EXPONENTIAL BOUNDS FOR THE BROWNIAN BRIDGE
00.
We want to show that the bounds in the preceding
for the brownian bridge.
4.1. THEOREM
If e~2)(F) < 2 , then there exists some version
Gp of a brownian bridge rela-
ting to
P whose trajectories are uniformly continuous and bounded on (F,pp) . (2) . 2 2 2 Moreover, setting l; ~ e (F) , 1f IIIJpl1 F i IJ i P(F ), an upper bound for F Pr ( IIGpII F > t) , is, for any positive t and 11 , Siven by
o
11,
F(1) exp (0
11,
or, if more precisely
F) IJ-l;-211 ( P(F 2 ))l;/2+ 11 (!)2p-l;+I1+ (!)2p+l1) exp (- t~) IJ IJ 2IJ 2d <
00,
by
4.1.1.
94
4.1.2.
where
p is defined in the statement of theorem 3.3.
Comments.
In the framework of theorem 4.1. the existence of a
regul ar
version of a
brownian bridge is an easy consequence of the proof of 4.1.1., but is of course a \-lell known result (see [18]).
Moreover the bounds in 4.1. 'ire in thi s case sharper
than the more general Fernique-Landau-Shepp inequality (see [25])
that
can be
written Pr (
IIGpII F > t) i
2
t C(a) exp (- - )
20 2
Proof of theorem 4.1. If
is countable
l'
The calculations are similar to those of the proof of theorem 3.3. but here of course a sequence of nets in
(1',1') relating to P is directly given.
Moreover
the following single inequality is used instead of Lemma 3.1. : 4.2. LEMMA:
Let
V be a real and centered gaussian random variable with variance
v2
then Pr
(IVI > s) i
s2
2 exp (- 2i) for any positive s.
The choice of parameters being the same as in the proof of 3.3.2°) (except 1·2
J
2 0 --2-
J.-2(a+s)) ,
P(1' )
d 4.1. 1. an d 4 .1 .2. are prove.
S'1nce 4 .1.1. 1S . a 1 so an
oscillation control, the almost sure regularity of Gp follows from Borel-Cantelli. The general case.
Since
(F,Pn) r
is separable
, the familiar extension principle may be used to
construct a regular version of brownian bridge on on a countable dense subset of F. this version.
l' from a regular version defined
Inequal ities 4.1.1. and 4.1.2. still hold for
95
Comment.
The optimality of bound 4.1.2. is discussed in the appendix. the polynomial factors are different in 3.3.1.2°)a)
The degrees of
and in 4.1.2. ; the reason is
that bound 3.1.3°) is less efficient than bound 4.2.
5. WEAK INVARIANCE PRINCIPLES WITH SPEEDS OF CONVERGENCE P(F 2+o) <
We assume from now that
00
for some
0 in
]0,1].
Using the results in sections 3 and 4, we can evaluate the oscillations of the empirical brownian bridge and of a regular version of the brownian bridge over F , so we can control the approximations of these processes by some
Ek-valued processes
(where Ek is a vector space with finite dimension k). The Prokhorov distance between the distributions of these two processes is estimated via an inequality from Dehling [15] allowing reasonable variations of
k with
n
Oscillations of the empirical brownian bridge over F .
n over F are controlled with the help of a truncation (the proof in this case is straightforl'.'ard ) on the one hand and of
The oscillations of from 3.3.2°)a)
V
a slight modification in the proof of 3.3.2°)b) (truncating twice) on the other hand. We
shall not give any proof of the following theorem (the reader
will find it
in [42]) .
~o
We set
=
P(F 2+o)
If we assume that
then an upper bound fJr Pr (1IvnIIF> t) given by a)
d~ 2)
If
7d
OF (1) n
(F) = 2d
t Sd
(0.1
(
exp \.-
<
00
,
si
whenever the following condition holds:
2 2 2 Ilapll F ~ a ~ P(F)
is, for any positive
t
with
such that
/ria t
~ 1,
/i~1,
2
96
5.1.1.
b)
+
(p
e (2) (F)
If
0(1)
F
lJ
-o/2
on
=
s <2
~-2ot-2+o
for any positive n
v
is defined in the statement of 3.3.) whenever 5.1.1. and the following hold n0/ 4i+ o
> 512
lJ
5.1.2.
o
Remark.
Note that Yukich in [54] also used KOlcinskii-Pollard entropy conditions to prove analogous results to theorems 3.3. and 5.1., but our estimates are sharper because of the use of randomization from a large sample
1S
described in section 3
Speed of convergence in the central limit theorem in finite dimension.
We recall below a result that is due to Dehling [151 (the first result in the same di rection is due to Yurinski i [53]). 5.3. THEOREM. ------Let
(Xi )1
be a sample of centered
lR k -valued random variables.
We write
Fn for the distribution of the normalized sum of these variables and G for the centered gaussian distribution whose covariance is that of X . 1 k Let 11.11 2 be an eucl idian pseudo-norm on lR and '12 be the Prokhorov distance that is associated to
11.11 2
If
E ( IIX111 ~+o)
= IJ
<00,
then
o
- 8 14 '12 (Fn,G) ~ K n k / 1J1/4 (1 + lL(n- o!2 k- 1 1J)1 1/ 2 ) where
K is a universal constant.
Weak invariance principles for the empirical brownian bridge.
In order to build some regular versions of brownian bridges with given projection
97
on a finite dimensional vector space (or further in section 6 on a countable product of such spaces), we need two lemmas. ~~~~~
5.3.
(Berkes, Philipp [6]) .
R1 ,R 2 ,R 3 be Polish spaces, and be some distributions respecti°2 °1 vely defined on R1xR 2 and R2xR 3 with common marginal on R2 . Then there exists a distributions on R1xR 2xR 3 whose marginals on R1xR 2 and R2xR 3 are respec° tively 01 and 02 Let
l~(F)
Remember that
is generally not separable.
The following lemma is fun-
damental to avoid this difficulty (see [23]) . The space 0 to be mentionned below is defined in Section 2. ~~~~~
5.4.
(Skorohod [48])
Let ginal is
R1,R 2 be Polish spaces and be some distribution on R1xR 2 with mar° q on R2 If V is a random variable from 0 to R2 whose distribution
q , then there exists a random variable
tribution of
(y , V)
is
y
from
0
to
R1 such that the dis-
°
Concerning our problems of construction the point in the sequel is that the distribution on
l~(F)
of a regular version of a brownian bridge is concentrated on
a separable space. Now we can state some weak invariance principles for the empirical brownian bridge with speeds of convergence. 5.5. NOTATIONS. --------From now y and [0,1] x lR+
and
B are positive functions that are respectively defined on
[0,2] y(x,y)
by: =
8 + 2~ (4+x)
and
where, as in the statement of theorem 3.3.
2(1-e(z)) B(z) = z(2-2p(z)+z) p(z)
2z(4-z) 4 + z(4-z)
98
5.6. THEOREM Under each of the following assumptions there exists some continuous version on such that Pr ( Ilv n _G(n) II F -> a n)<Sn P are defined hereunder (we reca 11 that FE L2+6 ( P) and that
of a brownian bridge relating to where
(an)
and
(Sn)
P, G(n) p
e~2) and d~2) are defined in Section 2) : a)
If
for any
d(2)(F)
=
F
2d <
< y( 6, d)
L
If NF(2) (E, F)
a' )
00
~ C
-1 d E-2d (LE) for any
E
in ]0 , 1 [
I; < 2
b)
for any
< S(I;)
L
and any positive
s.
Proof of theorem 5.6. Let of
a be an oscillation rate
F on a a-net
F(a)
We approximate Setting d into ted.
2d
Ga~
(de~endin~
relating to
~
F by vnoITa
a} , we may apply theorem 5.1. to Go
if necessary), hence the quantity
IIVn-VnoITaIIF~llvnIIG
Writing
a
1100
random variables
wh ere
B
for the Prokhorov distance associated to 11·11 F(a)
vn(a)
such that
~ 1l ( F 00
can be evalua-
Fn,a be the distribution of vn IF( a ) on the k-dimensional loo(F(a)) and G be the corresponding gaussian distribution.
Strassen's theorem [49] , there exists a probability space
loo(F(a))
(changing
a
Besides, let
vector space
on
be a projection
P
vn uniformly over {f-g , rp(f,g)
on n) and 10
n,a , Ga ) .
and
G(a)
(~'
and applying
,A' ,Pr')
with respective distributions
and two
Fn,a and
Ga
99
So, using lemma 5.3. , we may ensure the existence of some continuous version of a brownian bridge
Gp relating to
applying lemma 5.4. with rl
V: w->-vn!F(o) , we may assume that
pr( !lvn!F(ofGp!F(o) II F(o) ~ B)
with
G(o) = Gp!F(o)
p such that
Hence, noticing that
i
and then,
Gp is constructed on
B.
Ilv n IF(ofGpl F(o) I! F(o)
=
!! (v n-G p)oJ1 o !! F'
we get
Pr ( I!vn-Gpll F > 2t+B) i A+B+C where
A" Pr ( Ilvnll G a
> t)
and
C" Pr ( !IGpll G a
Theorem 5.2. is used to control k ~ N~2) (~,F)
noticing that
> t)
B (with II·!! F(o)
.
i
I!·!! 2 i Ik I!·!! F(o))
according to remark 2.6 . .
Moreover C is evaluated with the help of theorem 4.1., so the calculations are
co~pleted
via an appropriate choice of
t
and
o.
6. STRONG INVARIANCE PRINCIPLES WITH SPEEDS OF CONVERGENCE.
The method to deduce strong approximations from the preceding weak invariance principles is the one used in [43] to prove theorem 2 : the weak estimates are used locally, giving strong approximations with the help of maximal inequalities and via Borel-Cantelli lemma. Maximal inequalities.
As was noticed in [23], the proofs of the following inequalities may be deduced from the one given in [10] and in [32] . Notation.
We set
X.
J
6x. - P for any integer J
j
100
6.1.
~£~~~ (Ottaviani's inequality).
k L X. .
We set Sk" j
c ~ max k
where
=1
Then, for any positive a, the following inequality holds:
J
(1-c) Pr (max IIS k II F k a) .
> ra) < Pr ( IIS n II
F
> a)
More precisely, for symmetrical variables, the following sharper inequality is available 6.2.
~£~~~ (Paul Levy's inequality).
Let
(Yi)1
bles where
(B, 11.11)
is a normed vector space.
Pr (max Ilskll > a) k
trical then where
be independent and identically distributed B-valued random varia-
i
If we assume that
Y1 is symmeholds for any positive a ,
2 Pr (11Snll > a)
S = L y. k j=1 J
Strong approximations for the empirical brownian bridge. 6.3. THEOREM. -------
Under each of the following assumptions some sequence versions of brownian bridges relating to defined on a)
if
~
(Yj)j~1
P that are continuous on
of independent (F,pp)'
may be
such that : d2 )(F) = 2d < F
1
Iii
00
,
n
II L
j ~1
(x·-~·)II F J
a. s.
J
for any a < 2( 1Y(O,d) +y ( o-;crn (2) a' ) if, more precisely NF (c,F) < Cc- 2d (1+Ls- 1 )d for any c in ] 0 , 1[ , n 1 (X .-y.) II = O(n-y(o,d)/(2( 1+y(o,d))) ((Ln)(1/2)+d + (Ln)(5/4)+(d/2)))a.s. In II j~ 1 J J F
b)
if
e?) (F) = I; < 2 , 1 II n (X.-Y.)II
,Iii
j~1
J
J
F
"O(Ln-(6/ 2 ))
for any 6 < 6(1;) . Where
y(.,.)
and
s(.)
aredefinedin5.5.
a. s.
101
For a proof of 6.3., see [42]. Comments.
When passing from weak invariance principles to strong ones, the speeds of convergence are transformed as follows within our framework:
incase a).
in case b) Transformation (ii) appears in theorem 6.1. (under 6.3 .) from [23], it is not the case for transformation (i) in the same theot'em (under 6.4.). On the contrary transformation (i) is present in finite dimensional principles and appears to be optimal in that case: more precisely, the rate of weak convergence towards the gaussian distribution for 3-integrable variables is ran' 1S rang1ng . a bou t n- 1/ 6 (see gl'ng about n- 1/ 2 when th e ra t e 0 f s rong t convergence [39] for the upper bound and [9] for the lower bound) lin the real case Application to V.C.-classes.
Applying theorem 6.3. with 6 =1
in the case where
v
F is a V.C. -class wi th
real dens ity d, we get a speed of convergence towards the brownian bridge that is 1 O(n-O:) for any This improves on 1.3.3. but is less sharp that 1.2.4. 0: < 18+20d in the classical case of quadrants in lR d
Following an idea from Dudley in [21] (sec. 11), the study of the general empirical processes theoretically allows one to deduce some results about random walks in general Banach spaces. metric space (5,K)
As an application of this principle
and the space
ped with the uniform norm
11.11
C(5) •
00
equipped with the Lipschitz-norm:
Let
let us consider a compact
of real continuous functions on
5,
equip-
X be the space of Lipschitz-functions on
5
102
II· II L : x
->-
II x II + tfS sup 00
We write
N(E,S,K)
ds,t) > E for any
sft
Ix(t)-x(s) I (s t) K,
for the maximal cardinality of a subset in
R of S such that
R.
We may apply our results through the following choices: F=II.II L ·
F={oS,sES}and Then
(X, 1/.11
) 00
is a Suslin space (but is not Polish in general), so
fills (M) . Moreover, for any distribution Q( ( Os - 0t )2)
so
N~2)(.,F) ~ N(.,S,d . Besides
11.11
Therefore, considering a sequence tributed
00
= II.II
2 Q(F ) ,
.
F
(Xj)j~1
of independent and identically dis-
C(S)-valued random variables such that: IX1(s)-X1(t)I~Mds,t)
with
p~2)(X) we have:
Q in
~ K2(s , t)
F ful-
E(M 2+0) <
00
and
E(X~+O(to)) <
00
for one
,
for any s,t
to
in
in
S.
S, we can apply some 5.5. or
6.3. theorem to get speeds of convergence towards the gaussian distribution, whose structure depends on
N(.,S,K)
(the central limit theorem for such uniformly
Lipschitzian processes as above is due to Jain and Marcus in [31]) .
A P PEN D I X
First let us recall Hoeffding's lemma
(see (29]) .
Hoe[[ding's lemma.
Let
S be a centered and
[u,v] -valued random variable, then 2 2 E(exp(tSj) ~ exp (t (~-u)) , for any t in R •
We may assume that w is chosen as follows: drawing - with uniform distribution - a partition] = (J i )1
such that
IJil =m
103
tion -.
J. - with uniform distribu-
independently in each
. then, drawing an index w(i)
1
The following evaluations are conditional on ]
but the last bound will
not depend on J, 'V giving 3.1. 5
We set
Z
5
iln - IfN and we write
~
A for the logarithm of the conditional La-
place transform of Z Then setting
-
si
~
we have , for any
m
in
s
lR
A(s) then, since the logarithm is a concave function A(s) ~ n Log (~ where, writing
ON
s
N L
j =1
5N
exp (ri (Sj - W))) ~ n AN(~)
for the uniform distribution on
the logarithm of the Laplace transform under
ON
{S1""'SN}
of x
AN
~
x-EO (x).
5'
N
stands for
Therefore the
5'
Cramer-Chernoff transform of Z is larger than that of : - EO~n (:)
under
O~n
N
where
5'n stands for the sum of n i.i.d. random variables with common distribution Then, Hoeffding [29] and Bernstein
[5]
inequal ities yield 3.1.1°) and 3.1.2°)
In order to prove 3.1.3°) we may assume that 5N = 0 (otherwise changing 5
Sj
into
N
Sj - W) Then, applying Hoeffding's lemma to the conditionally centered random variables and setting
u. = min 1
l.(t) 1
jEJ
S. i
and
v. = max S. , we get: 1
J
jEJ. J 1
(v.-u.) LOgEJ(exp(t(s(.)-~.)))< 1 1 Xl 1 8
2
t
2
,foranytinlR.
Hence A(s)
2
1 . (~) < _s_ 1 n 2 8n
n
L
i =1
(v.-u.) 1
2
1
-
and therefore s2
2
A(s)~lfrlm0N
yielding 3.1.3°) via Markov's inequality.
s2 <2 4n
n L
L
i ~ 1 j EJ.
1
104
We use Goodman's work in [28] to give a lower bound of the probability for the supremum of a brownian bridge to cross a barrier. Notations.
of
We set I = [0,1] and write for any integer d, 1 for the element d d d lR Moreover, for any s in I , we set p(s) = sl' .. sd .
(1, ... ,1)
A.1. Theorem. Let
d be an integer and
Wd be some standard d-dimensional parameter Wiener
process, then, on the one hand Pr (sup Wd(s) < t IW d (l d ) sEI d -
(i)
for almost any real number hd(a, t)
=
1
+
a
=
at) ~ hd(a,t)
(in Lebesgue sense) and any positive
exp(2t 2(a-1))
t, where
d-1 . ( 2( ))i L (_1)1+1 2t . ~-1 I ( ) i=O 1. J-oo,1] a
and on the other hand : (i i )
Proof of theorem A.1. If d=2
the whole proof is contained in [28] .
Otherwise, proceeding exactly as in [28] yields the following inequality: Hd(a,t) = Pr(sup Wd(s) sEId where
o
~ t IWd (l d )=at) ~ J (1-exp(2t 2r)) dF t ,d_1 (a,dr) A.2. a-1
Ft ,d_1(a,r) = Pr(W d_1(s)-rtp(s) ~ t, VSEI d- 1 We want to proceed by induction.
It is enough to notice that:
A.3. Lemma.
for any integer
k, any positive a and almost every B in
lR (in Lebesgue sense).
105
Proof of A.3. W is a regular gaussian process, it is enough to show that the expeck tation and covariance functions of the processes Wk(.)+p(.) and Wk(.) are the Since
same conditionally to respectively W (l k )+a and k
Wk (l k) .
Since Wk is gaussian, E(Wk(s) IWk(lk'=y) and E(Wk(S)Wk(s' )IW k(l k )=y) respectively linear and quadratic functions of y, then the knowledge of
are
E(W~(lk) W~(s) W~(SI)) with l+m+n ~ 4, yields E(W k(S)IW k (l k )=y) = p(s)y , E(Wk(s)Wk(s')!Wk(\)=y) where
p(s)p(s')/ + (s"s') - p(s)p(s')
SAS'
A.3.
Let us return to the proof of A.l .(il Using lemma A.3., we get:
so
Then, integrating by parts, inequality A.2. becomes: 0 Hd(a,t) ~ Ft ,d_l(a,O) - lexp(2t 2r)F t d_l(a,r) la-I + 2t 2 fO exp(2t 2r)F t ,d_l(a,r) dr , a-I
hence Hd(a,t) ~ 2t 2
o
exp(2t 2r) hd_1(a-r,t) dr a-l But an easy calculation yields: Jr
O
2t 2 Jr exp(2t 2r) hd_1 (a-r,t) dr = hd(a,t) a-I Therefore A.l. (i) is proved by induction (it is shown in [33] p. 284 inequa 1ity A.1. (i) holds when d=l) . In order to proof (ii), we notice, following [7], p.84, that
Pr(W d E-/ 0 ~ Wd (l d ) converges weakly in
~£
)
C(I d ) towards the distribution of the brownian bridge
that
106
Wd - pt. )W d (1 d ) whenever
E converges to
°.
So, inequality (i) gives Pr (sup d SEI
Wd(s) - p(s)W d (1 d ) > t)
~
1 - hd(O,t)
therefore (ii) is proved. Comment. 'V
Theorem A.1. was proved by ourself (see [40] and [41]) but also by E. Cabana in [11] * In another connection, inequality A.1. (ii) ensures that some polynomial factor t 2h (d)
w,"th
h(d»d-1 _
b ds 331°)) canno tb e remove d", noun .. a an d412 ..•.
The calculations yielding 3.3.1°)a) are slightly modified here, where the entropy condition a) is replaced with a more explicit one. A.4. Theorem. If we assume that a'l
F is
[0,1]-valuedandthat -2 NF)(E,F) ~ K1+1/ Lo g ( E 1 (1+LOQ(E- 2 ))d E- 2d
then, an upper bound for
Pr (
Ilv n II
F > t) is, for any
for any t
in
E in
]O,n
[1,+00[, given by
4H(t) exp(13) exp(-2t 2 ) + 4H 2 (t) exp(-(t 2-5)(Lt)2) where
Proof of A.4. Lt 2 + 1 , then In the proof of 3.3. 1°)a) we choose a = ----2 LLt A ~ 2H(t) exp(13) exp(-2t 2 ) P!liN -a.s. B ~ 2H 2(t) exp(-(t 2-5)(Lt)2)
whenever
*
t
2
Thanks to
~
P!liN -a.s.
6+4d , yielding A.4. via lemma 3.2.
M. Wchebor and J. Leon for communicating this reference to us.
107
Comment.
Assumption a'l is typically fulfilled whenever case of
d
F
•
is a V.C.-class.
In that
may be the real density of F (if it is "achieved") or the integer density
F (see the proof of 3.6.).
REFERENCES
V-i.Me.~on,
[7]
ALEXANVER, K. (7982). Ph. V.
[2]
ALEXANVER, K. P~obab~y ~ne.qua£LtZ~ nO~ e.m~c.al p~oc.~~~ and a taw On Ue.M.te.d togMdhm. Anna£~ 06 Ptwbabildy (7984), vol. 72 ,N° 4,7047 -7 06 7.
[3]
ASSOUAV. P. Ve.lUUe. e.t
[4]
BAKHVALOV, N.S. On appMuma;te. c.a.£c.ula.t<.on 06 multipte. ~n.te.g~aU (~n RUM~n). V~tn~Q Mo~Q. Se.~. Maj". Me.R.h. MUOH. Uz. K~m. (7959) i, 3. 78.
[5 ]
BENNETT,
233-282.
G.
bt~.
~me.lU~on,Aml.
MaM. IlUt. hc.h. (7982).
lust.
FoU!U~, G~e.n(lbte. 33, 3 (7983)
P~obab~<-.ty ~ne.qua£LtZ~ 60~ ~~ Am~. Sta.t~t. A~Mc.. (7962),
J.
06
'ii,
~nde.pe.nde.n.t ,~ndom v~a
33-45.
[6]
BERKES, I., PHILIPP, W. App~ouma.t<-on the.o~e.~ 60~ ~nde.pe.nde.n.t and we.aRty de.pe.nde.n.t ~andom ve.C.to~6. Ann. P~obab~<-.ty (7979), i, 29-54.
[7]
BILLINGSLEY, P. Conve.~ge.nc.e.
[8]
BORISOV, I .S.
Ab~.tMc..t6
06
~n6e.~e.nc.e., Budap~t
06
pMbab~y me.MM~.
the. CoUoq~um on non (7980), 77-87.
Wile.y, Nw Yo~Q.
paJ1.OJrl~C. ~.ta.t<-stic.al
[9 ]
BREI MAN, L. On the. .:i:t::LU be.hav<'M On ~u~ 06 ~nde.pe.nde.n.t'Landom z. WaMc.hun. VVlW. Ge.b. (7967), ~, 20-25.
[ 70]
BREI MAN , L. P~o bab~y. Re.a~ng MM ~.
[ 77 ]
'" E. On the. CABANA,
06
.tMlU~tion de.lUay W~e.ne.~ p~oc.~~ wUh one. b~~.
Ad~ 0 n-W ~ te.y
a
v~abt~.
(7 968) •
multi~me.lU~onal pa.Mme..t~ P~ob. (7984), ~, 797-200.
J. Appt.
[ 72]
'" E., WSCHEBOR, M. The. CABANA, S~mon BoL<'VM.
[ 73]
COHN, V.L. (7980).
[74]
CSORGO, M. , REVESZ, P. A nw me.tllOd to p~ove. S~Hn type. ta.w~ 06 ~nv~anc.e. p~n~pte. II. Z. WaMc.hun. Ve.~. Ge.b. (7975), ~,
.two-p~me.te.~ B~ow~a.n b~dge..
Me.MU~e. the.o~y. BiAR.haUse.~, Bo~ton
Pub.
U~v.
(7980).
267-269.
[ 15]
VEHLING, H. umU the.o~e.~ nM ~~ on we.aRty de.pende.n.t Banac.h ~pac.e value.d ~andom v~abt~. Z. WaMc.hun. Ve.~. Ge.b. (7983), 397-432.
[76]
VEVROYE, L. BOUHd~ 6M the. u~tlotun dev {at~olU 06 J. of MUU.,'VM. ArlcU'.. (7982), ]1, 7~· 79.
[17]
INCHI HU, A unA 6Mm boand folt the. .tail p~ubab'{LUlf 06 Ko.fmogMov-SmiAnov ~ta.t{6tiC.~. The. Anna.e~ On Stctt~~.t.{,C.6 (7985), Vot.73,N°2, 827-826.
('Jl1:)~It.<.cc<£ me.a~M~.
108
[78]
DUDLEY, R. M. The. -6J..zrv., 06 c.ompac;t ,6[Lbuu 06 Hlibe.Jtt -6pac.e. and c.on:Unu1;ty 06 GauMJ..an pIWC.rv.,-6rv.,. J. Func.tiona£ AnalY-6J..-6 (7967), ~, 290-330.
[79]
DUDLEY, R.M. Me:tJUc. e.nvwpy 06 Mme. &aMrv., 06 uu wah dJ..66vce.n:Ual bounc1aJL,trv.,. J. AppM xirrii:Ltion The.OI1.y (7974), !.!!-' 227- 236.
[20]
DUDLEY, R.M. Ce.ntlW£ bliUy (7978),
umU: ~,
the.OI1.e.mil 60f1. e.mpJ..Jr.J..c.al me.MUf1.rv.,. Ann. PMbai (7979), 909-977.
p. 899-929 ; c.of1.f1.e.c.:Uon
MCLthl?Jllatic.~
[27]
DUDLEY, R.M. SaJ..nt FtoUf1. 7982 • Le.c.tuJ1.e. /l:ote.-6 J..n
[22]
• DUDLEY, R.M., DURST • EmpJ..Jr.J..c.al Pf1.oc.rv.,-6rv." VapnJ..R-Ce.f1.Vone.nw c1.M-6rv., and PoJ..-6Mn pf1.oc.rv.,-6rv.,. PMba. and Math. Stat. (WIWc1.CLw) 7, 709- 775 (7987).
n07097 •
-
[ 23]
DUD LEY, R. M., PH I LI PP, W. InvaJr.J..anc.e. p!1.J..IWJ..ptrv., 6011. -6 umil 0 6 Banac.h -6 pac.rv., value.d f1.andom e.tvne.nt-6 and e.mpJ..Jr.J..c.al pf1.oc.rv.,-6 rv.,. Z. WCLM c.hun. Ve.f1.W. Ge.b • (7 983 ), E, 509- 552 •
[24]
DVORETZKY, A., KIEFER, J.C., WOLFOWITZ, J. A-6ymptotic. mJ..nJ..max c.hCLlWc;te.f1. 06 the. Mmpte. fu.tJr.J..buUon 6unc.:Uon and 06 the. &M-6J..C.al muttinomJ..a£. rv.,.tJ..mCLtof1.. Ann. Math. Stat. (7956), ii, 642-669.
[25]
FERNIQUE,
.!.i,
x.
R(gut~(
de.
pf1.oc.rv.,-6~
ga~-6J..e.I1-6.
Inve.nt. MCLth. (7977),
304-320.
[26]
GAENSSLER, P., STUTE, W. EmpJ..Jr.J..c.al pf1.oc.rv.,-6rv., : a -6Uf1.Ve.y 06 f1.rv.,utv., 6011. J..nde.pe.nde.nt and J..de.n:Uc.illy fuWbute.d :random vaJr.J..abtrv.,. Ann. Pf1.oba. i, 793-243.
[27]
GINE, M.E., ZINN, J. On the. c.e.n.tf1.a£. UmU: the.of1.e.m 60f1. e.mpJ..Jr.J..c.al pf1.oc.rv.,-6rv.,. Annal-6 06 PMbabliUy (7984), vot. 72, nO 4, 929-989.
[ 28]
GOODMAN, V. DJ..-6.tJr.J..buUon rv.,.tJ..matJ..OI1-6 6011. 6unc.:Uonal-6 06 the. .two paJUmie.te.f1. WJ..e.nvc pf1.oc.e.~-6. Annal-6 06 PMbabliUy (7976), vot. 4, nO 6, 977-982.
[29]
HOEFFDING, W. PMbabliJ...ty J..ne.quaUtirv., 6011. Wmil 06 bounde.d Jtandom vaJr.J..abtrv.,. J. Amvc. S~t. A-6MC.. (7963), 58, p. 73-30.
[30]
IBRAGIMOV, I.A., KHASMINSKII, R.Z. On the. non-paf1.ame:tJUc. de.fM-<-ty rv.,:Umatrv.,. Zap. Nauc.ha. Se.mJ..n. LOMI 708, p. 73-87 (7987). In R~-6J..an.
[37]
JAIN, N., MARCUS, M.B. Ce.ntJr.at umU: the.of1.e.m 60f1. C(S)-value.d f1.andom vaJr.J..abtrv.,. J. Func.:Uona£ Ana£Y-6J..-6 (7975), .!i, p. 276-237.
[32]
KAHANE, J.P. Some. f1.andom -6e.!1.J..rv., 06 6unc.tiol1-6. Le.xington, MM-6. D.C. He.uth ( 7968) •
[33]
KARLIN, S. TAYLOR, H. M. A 6-<.Mt C.OUMe. J..n StOC.hMtiC. Pf1.oc.rv.,-6rv., (7971). Ac.ade.mJ..c. Pf1.rv., -6, Nw- YMR.
[34]
KIEFER, J.C. On ~ge. de.vJ..atJ..ol1-6 06 .the. e.mpJ..Jr.J..c.al d.6. 06 ve.c;tof1. c.hanc.e. vaJr.J..abtrv., and a taw 06 J...te.f1.ate.d tog~hm. Pac.J..6J..c. J. Math. (7967), ~, 649-660.
[35]
KIEFER, J.C., WOLFOWITZ, J. On the. de.vJ..atJ..ol1-6 06 the. e.mpJ..Jr.J..c. dJ..-6.tJr.J..buUon 6unc.:Uon 06 ve.c;tof1. c.hanc.e. vaJr.J..abtrv.,. Tf1.a11-6. Amvc. MCLth. Soc.. (7958), §2, p. 773-786.
109
[36]
KOLMOGOROV, A.N., TIKHOMIROV, V.M. E-~nthopy and E-~apac{ty 06 ~~ ~n 6UYlwonai ~pa~eA. AmeA. Math. So~. T!W.~t. (SeA. 2) (7967), }2,
277-364. [37]
KOMLOS, J., MAJOR, P., TUSNAVY, G. An appJWumailon 06 pCVttiai ~UJ11f.> 06 J.nd~p~nd~n-t RV'~ and .th~ ~ampt~ VF. 1. Z. WaM~hun. V~JrW. G~b.
( 7975), [38]
l!,
p. 71 7- 737 •
(7983). A f1.~ma.JtQ on ~mp~~ai m~MWleA. In A FeAMd'LU;\.t 6011. E!1.J.~h L. L~hmann J.n Honof1. 06 hL6 SJ.x.ty-H6.th BJ.f1..thday (7983), 305-327
LE CAM, L.
Wa~wof1..th, Be£mon-t,
[39]
C~60f1.nJ.a.
MAJOR, P. Th~ appJWumailon 06 pCVttiai ~UJ11f.> 06 J.nd~p~nd~n-t RV'~. Z. WaM~hun. v~.
G~b.
(7976),12,273-220.
[40]
MASSART, P. VJ..teA~~ d~ ~onveAg~n~~ ~ t~ ~L(Of1.(m~ d~ ta ~mJ..t~ ~~JU:f1.ai~ POWl t~ pf1.0~eA~M ~mp~qu~. Thu~ d~ 3~ ~yct~ nO 3545 d~ t'UnJ.v~U( d~ P~-Sud (7983).
[ 47 ]
MASSART, P.
VJ..teA~eA
d~ ~onveAg~n~~
deA pf1.0~eA~M ~mp~queA. S(!1.J.~ I, 937-940.
d~ t~ .th(of1.(m~
No.t~ aux
~~JU:f1.ai ~mJ..t~ pOWl C. R. A. S., .t. 296 (20 j~n 7983)
[42]
MASSART, P. RateA 06 ~onV~f1.9~n~~ J.n .th~ ~~nthai limU .th~OI1.em tJOI1. emp~~ai pf1.0~eA~eA (Apf1.J.,t 7985),~ubmJ..t.t~d .to Ann. I~.t. H~~[ PoJ.n~a.Jt~.
[43]
PHILIPP, W. Atmo~.t ~Wl~ J.nva!1.J.an~~ p!1.J.n~pteA va!1.J.abteA. L~UWl~ No.teA ~n Math~mailCJ.,
[44]
POLLARD, V. A c~JU:f1.ai ~mJ..t.th~OI1.~m 6011. ~mp~~ai pJWCeA~eA. Math. So~. S~f1.. A (7982), 235-248.
[45]
POLLARV, V. RateA 06 ~thong unJ.60f1.m ~onv~f1.g~n~~
[46]
SERFLING, R.J. PJWbab~y J.n~qu~eA 60f1. .th~ ~um J.n ~amp~ng wUhou.t f1.~pta~~m~n-t. Ann. S.tat. (7974), vot. 2, nO 7, 39- 48.
[47]
SION, M. On lLYU60~zailon 06 ~~ J.n .topotogJ.cai ~pa~eA. T!W.~. AmeA. Math. So~. (7960), 237-245.
[48]
SKOROHOV, A. V. Th~of1.Y Pf1.ob. Aprt.
[49]
STRASSEN, V. Th~ ~x{J.,.t~nc~ 06 pf1.obab~y m~MWleA wJ..th 9,[v~n ma.Jt9~naL~. Ann. Math. S.ta.t. (7965), ~, 423-439.
[50]
TUSNAVY,
[ 57]
VAPNIK, V.N. CERVONENKIS, A.Y. On .th~ unJ.60f1.m ~onV~f1.g~n~~ 06 f1.~V~ 6f1.~qu~n~eA 06 ~v~nM .to .theiA pf1.obab~eA. Th~of1.. PJWb. Appt.
60f1. ~UJ11f.> 06 B-vaiu~d f1.andom 709, 771-793.
ii,
J. AM~an
(7982). Pf1.~p!1.J.n-t.
J..£,
(7976),
Q,
628-632.
G. A f1.~ma.JtQ on .th~ appf1.oumailon 06 .th~ ~ampt~ VF ~n .th~ mut.tJ.~m~~,[onai ~~. P~!1.J.o~~a Math. Hung. (7977), 53-55.
!'
.
~
(7971), ]£,264-28 • [52]
YURINSKII, V. V. A ~mooiling J.n~quaLUy 6011. eAtimateA 06 .th~ Le. v y-Pf1.ohoJWv ~.tan~~. Th~of1.Y Pf1.ob. Appt. (7975), !Q, 7-70.
[53]
YURINSKII, V.V. On .th~ eAf1.Of1. 06 .th~ gaM~~an appf1.oxJ.mation 60f1. ~onvotu.t,[o~. Th~OI1.. PJWb. App!. (7977), ~, 236-247.
[54]
YUKICH, J. E. UnJ.60f1.m ~xpon~ntiai boun~ 6011. .th~ nof1.m~z~d empJ.f1.J.~ai pf1.0~eA~
I 7985) •
Pf1.~p!1.J.n-t.
MEAN SQUARE CONVERGENCE OF WEAK MARTINGALES
Mariola B. Schwarz Institut
fur Mathematische Statistik
Universitat Gottingen Lotzestr. 13, D-3400 Gottingen
[4J
In
it was shown that the mean square convergence of vector-valued martin-
gales in spaces of Rademacher type or cotype 2 is closely related to the fOllowing property of a Banach space valued martingale Radon measure space
Yf
on
(B,6)
the Borel
(B
f = (f ) : there exists a Gaussian n
cr-algebra of subsets of the Banach
B) such that
(*)
Ilx*fl!;=J
Ix*(x)1
2
B
Y (dx) f
x * E B*
for every
We give a characterization of the class of martingales satisfying not containing
in 00
"n
I: IIS(e*f)11 en' where n=l n 2 is the standard square function.
(e )
for spaces
(e ) in means of
uniformly and having an unconditional basis
convergence of the series
(*) n
is the dual basis and
Furthermore we give necessary (resp. sufficient) conditions for the
L -convergence
2
2 (resp. cotype 2).
of martingales in spaces of type
2
These conditions characterize Banach spaces of Rademacher type or cotype Throughout, Borel
B
A family
[f
n
*
denotes a separable Banach space,
cr-algebra of subsets of
[J n
respect to the filtration J
and
n
Band
(O,J,P)
B
the dual space,
B
the
a probability space.
B-valued random variables forms a weak martingale with
1 n E N} of
Pettis integrable On
S
\ n
E N}
if for evecy
n
E
N
for
(Pettis-)
A
B-valued weak martingale
[f 1 n E N}
M
such that the inequality
1 / f 1 ,; M Ilx"11
f
n
J
is
n
measurable,
k,; n (see e. g.
[ 3 J) •
is uniformly bounded if there is a constant
n
hOlds a.e. for every
x* E B*
and every
n EN.
Note that i f
x* f
=
[x* f n
f = [f 1 n
n
In
E N}
In the following
LEMMA 1. Let
is a
B-valued (weak) martingale then
is a real martingale for each
f = [f
with difference sequence (en)nE N
basis. Assume that
E N}
B
n
1 n E N} [d
n
is a
* x
E
*
x f=
~':
B
B-valued uniformly bounded weak martingale
1 n E N} •
be an unconditional basis in does not contain
in 00
Then the following conditions are equivalent
B
uniformly.
and
(e~)nE N
the dual
111
I:IIS(e* f)11
e is convergent in B 2 n b) there exists a Gaussian measure Y on (B,B) f 2 Ilx*fll~ = Ix*(x)1 Yf(dx) • a) the series
n
such that for each
x
*
E
*
B
J
B
x* f
Proof.
= (x* f )
is for each
n
Thus the square function
S
*
Let the functional
x
* E B*
*
of
x f
x* d
a real martingale with increments
n
is given by
X B* - R be given by
T : B
* * * x ,y E B T
*
is well defined everywhere in the Cartesian product
B
X B*
since
f
is of
* *
weak second order. Furthermore, T is positive (i.e., T(x ,x ) ~ 0 for every * * d ° (0 * * * * * * E B* ). x E B) an symmetr~c ~.e., T(x ,y) T(y ,x) for all X,y Since
f
is uniformly bounded we have
T(x*,/) ,;; (E I: x*(\)2)1/2 (E L: /(\)2)1/2 k
k
* 2 IIS(y* 011 2 Ilx* fl1 II y * fll 2 2
IIS(x 011 =
2 ,;; M 11x*1I II/II which means that
T
We can consider
is bounded.
T
* B
as a map of
e.g. [lJ), which states that an element the natural embedding of continuous in the Assume that
B
B-
into
*
B-topology on
** B • Using a theorem of Banach (see
into
B
u
of
belongs to the image of
i f the functional
, we show that
*
TB
B
in
u(v), v E B* , is is contained in
B.
a) is satisfied. Since
I: II S(e*f) 11 e = I: (E(I:(e*(\))2)1/2 e 2 n n n n k n n
L: (Te*(e*))1/2 e n n n
the series
B*
is convergent in
n
B
n
By theorem 2.1. in B ,
[2J,
T
is a covariance operator of a Gaussian measure
Y
On
i.e., T/(X*) =
J
2
y(dx)
B
Since Ilx* f,1122
Ix*(x)1 2
= II S(x* f)1 12 = Tx* (x* )
the implication Conversely, if
a) => b)
follows.
b) is satisfied, the series e
is convergent in
n
= I: n
B
as
(Te*(e*))1/2 e n
T
n
n
Is a covariance operator of
Yf •
o
112
ln
We recall that a Banach space does not contain Rademacher cotype r < '"
cotype
r < '" • Banach spaces of Rademacher type 2 are of some Rademacher
r 5J).
(see, e.g.,
Using Theorem 5.3. and 5.4. of THEOREM 1. Let
uniformly if it is of certain
'"
(en)nE N
[4J , Lemma 1 and the above remark we get
be an unconditional basis in
*
Band
(en)nE N
the dual
basis. Then i)
B
is of cotype 2 if and only if for every
* 2 en 2.: IIS(enOli
series
in
B
f
the convergence of the 2 is a sufficient condition for the L convergence
n
of
f.
B
ii)
is of type
2.: IIS(e * 01\ 2 e n
n
in
n
2 if and only if for every
the convergence of the series 2 is a necessary condition for the L convergence of f . 0
B
f
COROLLARY 1. In Hilbert space (and only in this space) the convergence of the series
2.: IIS(e* f)11
e is a necessary and sufficient condition for a martingale 2 n n n converge in the L2-nonn. Let
'Tr"y
be the family of all
exists a Gaussian measure
2
x
*
(B,B)
THEOREM 2. Let
for which there
such that
* [2 J
(en)nE N
p > 0 • Assume that
and the techniques of the proof of Lemma
be an unconditional basis in B
does not contain
n
1",
such that for each c 1 (p)
n
o
E B
Using Theorem 2.2. in
and
on
f
(f )
to
rB Ix*(x)1 2 yf(dx)
IIx*f1 12 = for each
Y
f
B-valued weak martingales
f
112.:11 s(e» n
11
2
enl lP
~
J
II
xil P
B,
*
(en)nE N
1 we get the dual basis
uniformly. Then there exist f E
~
Y
y(dx)
B
REFERENCES
[lJ
S. BANACH:
i2J
S.A. CHOBANYAN AND V.I. TARIELADZE : Gaussian characterizations of certain Banach spaces. J. Mult. Anal. 7, 1977.
[3J
K. MUSIAL
r 4J [5J
Theorie des operations lineaires, Warszawa 1932.
Martingales of Pettis integrable functions. Lecture Notes in Math. 794, 1979.
NGUYEN DUY TIEN : On Kolmogorov' s three series theorem and mean square convergence of martingales in Banach spaces. Theor. Prob. Appl. 24(2), 1979. W.A. WOYCZYNSKI : Geometry and martingales in Banach spaces. Advances in Prob., Dekker, 1978.
METRIC ENTROPY AND THE CENTRAL LIMIT THEOREM IN BANACH SPACES J. E. Yukich * Institut de Recherche Mathematique Avancee Universite Louis Pasteur 7 rue Rene Descartes 67084 Strasbourg, France
§l.
INTRODUCTION
The intent of this article is to study the relationship between (i) the central limit theorem in Banach spaces, (ii) the Donsker property for unbounded classes of functions, especially subsets of the Banach dual, and (iii) metric entropy with LP bracketing, p;::.l Before exposing the main results, let us first Set in Dlace the framework for empirical processes, the setting of this paper. Throughout, take (A,A,P) to be a probability space and xi' i;::'l, the coordioo nateS for the countable product (Aoo,Aoo,p ) of copies of (A,A,P). The nth empirical measure for P is defined as P (B)
n
n
I
j=l
Given a class FF(x)
n
-1
Fe
1
, BE A • {x.~B}
J
L2(A,A,P) of real-valued functions with envelope
sup I f(x) I f
valued functions on let
let F;
S:=too(F) equip
S
be the space of all bounded realwith the sup norm, i.e., for
s
I;
S
II sll:= sup Is(f)1
fEF
We note that (S, II II) is a Banach space, non-separable for F infinite. When FF is finite P a.e. the function-indexed emnirical process V
n
(f) (w):= n 1/2
I
is a random vector v n with values in S. Recall [7,8J that F is a P-Donsker class if n converges in law in S to a Gaussian proceSS Gp , indexed by F, with a.s. bounded, 0p-uniformly continuous sample paths (abbreviated BUC). Here,
v
o~f,g) *Current
:=
I
(f_g)2 dP - ([(f- g )dP)2
address: Lehigh University,Math.,Bethlehem,Penn.18015-USA
114
Gp
necessarily has mean 0 and the same covariance as
f fg
cov(Gp(f), Gp(g)) =
dP -
ff
dP
I
v
n
g dP .
Clearly, a necessary condition for F to be P-Donsker is that F be GpBUC, i.e., that the limiting Gaussian proceSS Gp can be chosen to be BUC. Define the mapping h(x)(f)
f (x)
-
f
h: A->- 5
by setting
f dP
for each x ~ A Defining the random variables Xj = h(x j ) , Dudley has shown [6J that if F is GpBUC then the p-Donsker property for F means that Xl satisfies the central limit theorem in (5, II II) , abbreviated Xl E CLT (Recall that if X is a ranoom variable with values in a Banach space B and if (Xn)neN is a sequence of independent copies of X , then X satisfies the CLT if n
L(
I
i=l
X. / /i1) 1
converges weakly to a Radon measure on
B.
5ee
[12,16J
for an exposi-
tion of the CLT in Banach spaces.) In this way we see that central limit theorems for the empirical process v n (f) , f e F , can be viewed as central limit theorems with respect to the norm II lion the Banach space 5. Throughout this paper let (B, II II ) denote a not necessarily separable Banach space , B* the dual space, and B*1 the uni t ball of B* Now a subset H of B* is ca 11 ed a norminf' subset (see [7 J) 1 for B i f and only i f II X II = sup I h (X) I for all x (' B Clearly, hE'H by the Hahn-Banach theorem, B~ is always a norming subset, a fact which We shall exploit in the second section. In this way, limit theorems in B can be viewed as limit theorems for empirical measures on B, uniformly Over a class of functions, such as Bl* ' since for feB * and x(l), ... ,x(n) e R (Ox(l)+"
.+ox(n))(f)
=
f(x(l)+ ... +x(n)) .
In particular, We note that X (with L(X) = P) satisfies the CLT in if and only if B* is P-Donsker. 1
B
115
The relationship between limit theorems for empirical processes and limit theorems for Banach space valued random variables has been studied by several authors. For example, Dudley has shown [7] that the Jain-Marcus CLT for C(S) -valued ranc10m variables [13] is in fact a consequence of Pollard's CLT [17] for empirical processes. Dudley has also shown [7] that the Fortet-l'-Iourier strong law of large numbers [IS] in separable Banach spaces is a consequence of the DeHardt-Dudley law of large numbers for a class 0 f functions [3,7]; We will return to this implication later on and will also provide a short and simpler approach. Limit theorems for vn(f) often involve a metric entropy condition on the index Set F In this article ~e will consider metric entropy wi th
LP
bracketing, defined as
Definition 1.
Given
such that
~
and
h (x)
J(f:J -C)J P P L
such that
[7]:
f,g : A-+ lR, define the bracket [f,g] := {h:A-+ lR
Fe LP (A,A,P)
[f~,f;] wi th
f(x)
.f'ollo~s
~g(x)
Let \'f "F
VXt'
A}
and
N CP) [ ] ..= N(P) [I
3 l~j~m
¢
otherwise.
(~~" F P)
such that
Now log Nfpj (E)
. { m: : = mIn
p~l
Let
3 [f-l' f+] 1
f. [fj,fj]
, E> 0 , ...
and
is referred to as metric entropy
bracketing.
We recall that metric entropy with Ll bracketing has heen especially useful in the study of empirical proceSSes; See e.g. the works of DeHardt [3], Dudley [4,7], Dudley and Philipp [8], Alexander [1], Borisov [2], Gine and Zinn [11], pyke [18], KOlcinskii [14], and Yukich [20,22, 23]. See also the recent monograph by Gaenssler [10]. In particular Nflj
Vn(f) [4,7,8]. One of the main ideas running through this article is that Ll bracketing is not a natural condition for descrihing weak convergence; L2 bracketing is a more natural choice. On the other hand L l hracketing is a more natural condition for descrihing laws of large numhers. These points will be discussed jn the following sections. has
has been used to descrihe the weak convergence o.f'
Finally, We will also USe metric entropy without bracketing which enjoyed wider use, and which is defined as follows:
Definition 2. fli F
~ l~j~m
Let with
N(P) (E,F,P)
;=
min{m:3fl, ... ,fm such that for all
JU-fj)PdP<E P }
ferred to as the metric entropy of
F
Now
10gN(P) (E,F,P)
is re-
116
Having set in place the framework for this paper, We now proceed to the main results, a summary of \,"hieh was announced in [2lJ. Section two discusses the relationsllip between (i) and (iii) of tIle first paragraph, section three studies the relationship between (ii) and (iii), and section four considers metric entropy with L2 bracketing.
§
2.
~lETRI C
ENTROPY A!\'D TIlL eLT FOR TIlE U\IT BALL 01' THE BANACH DUAL
Our starting point is the following result of Dudley [7J, which adds to Mourier's well known law of large numhers [15J.
Theorem 1. (cL section 6.1 of [7]1 Let X, Xl"" he a sequence of i.i.d. random variables with values in a separable Banach spaCe (B, II II). The following are equivalent: (i ) X sat i s fi est hest ron g 1 aw 0 f 1 a r g e n urnbe r 5 (S LDI) , (ii) E IIXII < and 00,
(i i i) N (1)
(E, B1* ,P) <
[ J
00
\J E > 0 .
I t is reasonable to ask \,-hether analogous results hold for those
satisfying the eLT The following theorems answer this query in the affirmative, showing that the study of the CLT in separahle
X
Hilbert spaceS
H
is conveniently studied through the USe of
N(2)
* , where III* [ J (E,Hl,P)
is the unit 11 all or the Hi 11,ert dual.
In fact, in the same way that the equivalence of XECLT
and
E{X}
o,
E II xii 2 <
00
characterizes, modulo an isomorpllism, separable Hilbert spaces, the following results show that the same is true for the equivalence of the condi ti ons
B*
1
is P-Donsker and inf N(2) 00 [J
( E,
Bl < * ' P) .
00
P
centered.
117
This is contained in the following results.
Theorem 2. Let X be a random variahle with valueS in (B, 1111), which need not be separable; L(X) = P Then for all P':' 1 the following are equivalent: (i)
(ii)
E
IlxII P
inf
< 00
N(P)
P
<
[J
pO
If
, and 00
is tight then the following are equivalent: (iii)
EllxllP
(iv)
N(P)
[ J
and
<00,
(E,Bl*,P)
<00
t'E>O.
Remarks Using the equivalence of (iii) and (iv) with P 1 We directly obtain the double implication (ii)~(iiii of Theorem 1 without using the SLL~ property of X . (1)
As is well known [12,16J the moment condition EIIXl1 2 <00 is in general neither necessary nor sufficient for X IC CLT ; it may also be (2)
easily Seen that the entropy conAition
.
f
N(P)
~~O'[ J
(
F)
E"P
<00
.
IS
.
neIther
necessary nor sufficient for F to be P-Donsker. The interest of Theorem 2 stems from the fact that when F is B~ these are in fact equivalent conditions and they consequently share the same properties.
The next result is essentially a consequence of Theorem 2. Theorem 3. Let X be a centered random variable with values in a Banach space (B, II II) ; LeX) = ? Consider the following properties of P: (i)
X
(ii)
B*
IC
CLT
1
is a P-Donsker class of functions,
(i i i) E Ilxll 2
< 00
,
* (E,Bl,P)
(iv)
N (2)
(vi
* inf N (2) (E,Bl,P) [ J E>O
[ J
'rIE > 0 ,
< 00
< 00
We have: (a)
If
P
is tight and
all equivalent.
B
is a Hilbert space then (i)-(v) are
118
(b)
If B is separable, then the equivalence of (i), (ii), (iii), and (v) is equivalent to the fact that B is isomorphic to a Hilbert space.
For the proof, We need only observe that when B is a separable Hilbert space, the equivalence of (i), (ii) and (iii) stems from the introductory remarks. For separable type 2 Banach spaces B we have sharp metric entropy conditions insuring the P-Donsker property for B*1 Theorem 4. Let a centered law.
(B, II II) If
be a separable type 2 Banach space and
P
inf E>O
then
B*
is a P-Donsker class. Conversely, if P is a law on any separable Banach space B and if Bi is P-Donsker, then for all 1
p < 2 ,
The proof of Theorem 4 follows at once from Theorem 2. For the first part We need only USe implication (ii)~ (i) with p = 2 ; for the second We USe implication (iii)~ (iv) with p < 2 together with the f'lct that i f XECLT then EllxllP
N[Pj
=
-
+
brackets [f.,f.] J J with
We will first show (ii)
(E,B * l , P). , j
=
~
(i).
Let
E> 0
and
By definition there is a collection of
l, ... ,m, such that given any
3l
+
fk(x) < f(x) < fk(x)
Vx "
B
and
Now B*1 is a norming subset of B* and thus for fixed x EO B We See that f(x) runs OVer all of the values between -llxll and Ilxll as This implies that for any fixed x , the sum of the f runs over B*1
119
m
I
differences of the brackets
j =1
(f:(x)-f~(x)) J
must be at least as large
J
Indeed, suppose this Here not the caSe.
as 211xll that
Then:3
X
o EB
such
Thus there exists an -llxoll < a < Ilxoll and a 0> 0 such that the open interval (a- 0 , a+o) is disj 0 int from the union of closed intervals m _ + .U [f.(xO),f.(xO)J J=1 J J
an
fEBl*
Moreover, by the Hahn-Banach theorem, there exists
such that
there exists
1< j.:::.m
This leads to a contradiction since
a
=
such that
+
fj (x) .:::. f(x) .:::. f j (x)
VX
E
R •
Thus, we have shown that m
For all
I
<
IIXII
j=l P > 1
f:(X)
-
fj (X)
J
a.s.
there exists a constant P ( (f +. - f.) (X))
J
J
Integrating this with respect to N[Pj
(s,B~,P)
completing
C . = C (p, m)
such
a.s. P
and using the definition of
shoHs that
(ii)~(i)
To show (i)=)(ii) it suffices to consider the single bracket defined by
fi (x) = -llxll
and
f~ (x)
= Ilxll
and to take
f
s = 2 Ilxll PdP (x) .
If P is a tight measure then We may show the stronger imnlication (iii)=9(iv); our proof is inspired by the proof of Proposition 6.1. 7 of [7J. Let p> 1 be fixed and note that for all s > 0 there is a compact Set K~ B such that
fBIK
IlxIIPdP<sP/4
The elements of
Bi ' restricted to
K, form a uniformly bounded
120
equicontinuous family ano hence this family is totally bounded ~or the sup norm II ilK on K by the ArzeIa-Ascoli theorem. ~ B* ' m < "" , such that Iff ~ B}* l Ilf - f. II J
for some g.
J+m
Let
J
=f.+E/4
On
J
N~Pj
(E
f , ... , f m l
< E/4
K
Then for every
Thus,
Take
g.=f.-E/4 J
K,
fE B* 1
B~,P)
J
on
K , g. (x) = - II xii
gj+m(X) = Ilxll if
J
for
Ilf - f j Ii K < E/4
< 2m < 00 , proving
x ¢K
,
then
(iii)~ (iv)
~or
for
x ¢K ,
all
j = 1, ... ,m
g.
-
J +m
and completing the Q.E.D.
proof of Theorem 2.
§3.
and
THE P-DONSKER PROPERTY AND METRIC ENTROPY is often an
The results of the above section show that appropriate tool for describing the P-Donsker property above results also suggest that
~or
the
N~P~ (E,F,P) , p f 2 , will not in
general be a satisfactory tool to describe the P-Donsker property for unbounded classes F S LP(A,A,P) . This is indeed the caSe: the following propositions show even in the preSence of an envelope condition on F, that (E,F,P) , P 2 , can not possibly give sharp resul ts. This may be seen by cons idering suitably chosen subsets of the dual to (1'.00, II 11 00 ), the Banach space of bounded functions on N+ equipped with the sup norm. More precisely, these subsets are classes of functions of the form
+
NtPj
121
F
:=
a,S
0:.
a. s . lA : s . J J
J
.
J
J
=
n,
or
0
where A. , j ~ 1 , is a sequence of disjoint subsets of A with J p. - P(A.)=j-S for some S> 1 and a. = J.a for some o < a < S - 1 J J J Since elements of F define measureS on /N+ , we may view Fa, S as a,S a subset of the dual to (tOO, II 1100)
We will consider the cases
p = 1 , P > Z and 1 < p < Z in this order; much of what follows may be found in [19J. Our f'irst proposition shows that a theorem of Dudley, which is recalled below, is far from being the "best possible". Theorem (cf. Theorem 3.1 of [5J). Suppose that F has envelope FF€ LP(A,A,PI for some p> Z Suppose that there exists y , o < '( < 1 - Z/ P an d M < such that 00
for
E
small enough.
is a P-Donsker class.
Z< p < 4
For all
Proposi tion-l.
F
Then
there exist
Donsker classes
P-
F
with
-y
FF
€
LP(A,A,P)
(E,F,P) > ZE
and
where
is any number leSS
y
than liZ. Fix
Proof. p/(p-Z) above.
and
Z
an,l a > 0 and choose , 2a + Z + , a (p Z) < 1 . Let S
Since S > pa + 1 Theorem Z.4 of [5J, F
t a.,rp-:J
J
J
<
00
such that a+,+Z< F as and let F a,S
It remains to find a lower
If the individual terms
a.p. J J
S >
'-or
boun~
Also, by 2a + 2 implies
F F € LP
it is easily verified that is a P-Donsker class since
( 1) N . [ J (E,F,P)
are greater than or equal to
F [ J ( E, , P) j:5.- j o =jo(E) , the n N (1) if ja-S ~ E and thus We may take
> 2 j o. i
-0
Now
- c-1/(S-a) -
a.p. > J J -
E
for all
if' and only
E
Let
y
= l/(B-a)=
small enough it is clear that y l/(a+,+Z) By taking a and , Q.E.D. may assume any value less than l/Z . The next proposition shows that suIts. N (1)
[ J
In particular, if
(E , F , P)
~
ZE
.y
for
Z
N (1)
[ J c1 an i f
can not provide sharp reF
is such that
(p-Z)/(p-l) < y < 1/ Z , then
F
FF
€
LP
and
may or may
122
not be a P-Donsker class of functions. Proposition 2. F
For any
2
FF~ LP
with envelope
and
an d
'( > (p - 2 ) / (p - 1) , the rea r eel ass e S < 2 E-' which do not satisfy
N[lj (E,F, P)
the P-Donsker property. Proof.
Let
2
a= (l-6)/(p-2), S= FF ~ LP
I
such that
J J
I
j':~jo
(p-2)/(p-l)
F=F
Let Now
a,S
is not P-Donsker,
for some SUI't a bl y 1 arge cons t an t jo and thus Ntlj (E,F, P) < 2'2
~E
ja-S
,= l/(S-a-l) =
p < 3 - 6.
and consider the class
2a+2
J' o = K E -l/(S-a-l)
a,p. =
j~jo
Setting
,+
6> 0
but by Theorem 2.4 of [5J, F
Now l'f then
and find
(p-2)/(p-1-6)
0 +0
and letting
K ,
We See that
Q.E.D.
, giving the desired result.
We conclude this section by exploring the relationship between P-Donsker classes and
p > 2
and
(E, F, P),
NtPj
in this order.
1
p> 1,
We consider the caSeS
We first show that
(p) 2 N[ J'P> ,
will not in general furnish sharp results for the P- Donsker property. Proposition 3. with
If and
q> 2
and
p < q/2 + 1 , then there are classes
F
2 E -, , ,> q , such that
mayor
N(q) (E,F, P) [ J
,...
F
may not be a P-Donsker class, We first show that there are P-Donsker classes
Proof.
P < q/2 + 1 large.
, such that
Let
q> 2
S = 2a + 2 + 6
N (q)
[ 1
(E , F , P)
p < q/2 + 1
and
and show that if
arbitrarily large for
0
> 2 E -'
, >q
for
Fa S
then
arbitrarily small.
with
FF
~
arbitrarily
N[qj
(E,F, P)
becomes
To See that this is
j ~jO(E)
q for all note that if the individual terms a~p. > E J Jjo Now a~n,>Eq if and only if then ( E, F , P) > 2
jqa-S~Eq
if and only
actually
50,
J J -
N[qj (E,F, P) that
F
~
jf
j::::. E-q/(S-qa) o
2 E -' ; moreOver
is P-Donsker and
On the other hand, i f F
LP
8 > 0 , a=2/q-2) ,
be fixed, let
F:~
F
,= q/s too
FF ~ LP , since N tqj
may not be P-Donsker, eVen if
(E, F, P) F
If as
,=q/(S-qa) 8 +0
then
Finally, We note
S > pa + 1 < 2 E -,
for
is of the form
, > q > 2 , then F a, S .
Indeed,
,
123
let Q
I(.~ .
a. 1A. )q dP < E: q J J
J-J o
N(q) (E: , F, P) [ ]
then
< 2.2
is satisfied whenever Thus, i f we obtain
F=Fa, 13'
j 0
Since the A. are disjoint, this inequality J a. . qa - 13+ ] . qa-a q . '::' E: J ~€, I.e., whenever J o
L
j>jo y=q/(I3-qa-l) , then
(q) N. [ ]
( c, F , P) <
2E:- Y .
Letting
Q.E.D.
y +q , completing the proof.
Finally, We consider Proposition 4.
If
N~q~ (€,F,
1
P)
wi th
1
is fixeu, 2
q (p - 2) / (p - q) < y < q/2 , then there exist classes N(q) (€ , F , P) [ ]
< 2€-y , such that
F
c( + Q
<
q
<
2 .
and F
with
p FF" L ,
may or may not be P-Donsker.
Let 1 < q < 2 and 2
Letting
y~
q/(B-ql'(-l) = q/((2-q) a+l)
at 1/(p-2)
shows that
, then
y + q(p-2)/(p-q)
, as desired.
In a similar way, the first half of the proof of Proposition 3 Let 6,a,13 be as in the proof shows that F may not be P-Donsker. of Proposition 3. As c( + 0 and 6 + 0 we see that y t q/2 , as desired. Q.E.D.
§4.
METRIC ENTROPY WITH
L2
BRACKETING
Dudley (c.f. Theorem 6.2.1 of (71) has shown that if formly bounded class of functions on (A,A,P) with
F
is a uni-
124
(1)
then
F
is a P-Donsker class.
The above sections suggest that
N[2~ (E,F,P)
is a more natural choice for the description of the PDonsker property; we are thus lead to the following Conjecture 1.
Let
Fe L 2(A,A,P) with
FF
~
2
L (A,A,P);
suppose that ( 2)
Then
F
is a P-Donsker class of functions.
Note that this conjecture should be compared with Pollard's central limit theorem [17J where Nf2j i~ replaced by a random entropy. for uniformly bounded ~ (2) is generally weaker than (1).
Also,
Now the following theorems support the conjecture and show that condition (2) is actually necessary in some cases. Recall that N(2) is given as in Definition 2. Theorem 5. Let P be a probability measure on ffi with a density f(x) such that (f(x)+f(-x)) is decreasin~ for x large; let H be the class of functions {x->-e itx It I < I} The following are equivalent: (i)
Ia11 og
(l) 7 N[ J(F-,H,P) dE <
(i i)
I 0Ao g
(2) N[ J (E,H,P) dE <
(i i i)
Ia/lOg
N(2)(E,H,P) dE <
(iv)
H
,
00
00
,
00
, and
is P-Donsker
The proof of Theorem 5 closely parallels the proof of Theorem 1 of [22J where the equivalence of (i), (ii), and (iv) is demonstrated; we do not provide the details here. Our final theorem should be regarded as a generalization of the Borisov-Dudley-Durst theorem [2,7,9J which characterizes when the class of all subsets of N+ is P-Donsker.
125
Theorem 6.
Let
m~
fm ,
1 , be a sequence of positive functions on
with disjoint support; let
I
m
Ilfmll
~
<
Ilfmllz
=
(f f;dP) l/Z
lR
and suppose
Let
00
I
F := {
f
m~A
m
The following are equivalent: (i)
I
m
Ilfmllz <
00
,
(i i)
dE<
(iii) F
00
and
,
is P-Donsker.
Proo'-, Observe that standard Gaussian processes techniques give (i)~ (iii), see [19J for details. Let us prove (i)~(ii); this will essentially be a consequence of the Borisov-Dudley-Durst theorem. Take ~ Ilfmll ~ defined by
=
\,1
<
00
and let
Q
be a probabili ty measure on
IN+
Ilfmll~ /M
Q({m}) =
Now we claim that (3)
where, as usual, the metric entropy for a class of sets is taken to be the metric entropy for the class of respective indicator functions. To show (3) we show that every bracket for the class F corresponds IN+ (Z) to a bracket for Z and vice versa. By minimality of N[ J(E,F,P}, +
a bracket [h 1~ , h. J for 1
h~
1
I m~A~
fm
and
A~
and
F will necessarily have the form +
h.
1
1
for some sets that
1
A: 1
I
+
m~A. 1
fm
A~c::A: I-
I
,
IN+
belonging to Z
Now observe
126
I
+
-
l
l
Z
(h. - h.) dP
=
r J
I+
fZ dP m
mt' A. \ A~ l
l
I
mt'A.+\ A.l
M·
Ilfmll ~
l
f (1 A.
1 _) dQ
+
A.l
l
Thus, if
+
-
Z
{ (hi -hi) dP}
f
liz
< E:
, then
f (lA: l
-
lA~)dQ l
Z
<
cZ/M , showing IN+
that every
LZ(P)
This shows
N[ ](E:,F,P) ~N[ ](E: 1M, ZN+, Q); to show the reverse in-
(Z)
E:-bracket for (1)
F
is an
Ll(Q) E: 1M-bracket for Z
Z
equality we just need to trace these steps backwards. By the Borisov-Dudley-Durst theorem, we have (4)
Combine (3) and (4) to obtain Theorem 6.
§5.
(i)~
(ii); this completes the proof of Q.E.D.
FINAL REMARKS
Having shown the efficacy of metric entropy with L Z bracketing, much still remains to be done. Apart from resolving the validity of the conjecture, it remains to be seen whether
proves useful in
studying the bounded and compact laws of the iterated logarithm (LIL). Based on the results of [ZO,Z3] it seems reasonable to make the following
127
Conjecture 2.
Let
2 FCL (A,A,P) with
FF£ L 2 (A,A,P)
suppose that
(2)
log N[ ](£,F,P) dE: <
00
•
(2)
logloglog N[ ] (E:, F, P) Then
lim n->-oo
sup
hF
!V n (f) I
<
00
a. s.
Iloglog n
Were it true, this conjecture would establish sufficient metric entropy conditions for the bounded L1L weaker than those provided by [8]. Whether
provides general rates of convergence for
vn(f)
more refined than those previously established by L l bracketing techniques (see e.g. [1,8,14,23]) is also an open question. Note added in proof:
After the writing of this paper I was informed
that Dr. M. Ossiander has obtained a proof of the conjecture announced in section 4; at the time of writing, however, I have not yet seen Ossiander's proof.
REFERENCES 1.
Alexander, K. S. (1984). Rates of growth and sample moduli for weighted empirical processes indexed by sets, preprint.
2.
Borisov, I. S. (1983). Problem of accuracy of approximation in the central limit theorem for empirical measures, Siberskij Matematicheskij Zhurnal, v. 24, no. 6, pp. 14-25 = Siberian Mathematical Journal, July issue, 1984, pp. 833-843.
3.
DeHardt, J. (1971). Generalization of the Glivenko-Cantelli theorem, Ann. Math. Statist., 42, pp. 2050-2055.
4.
Dudley, R. M. (1978). Central limit theorems for empirical measures, Ann. Prob. 6, pp. 899-929; correction 7, (1979) PP. 909-911.
128
5.
Dudley, R. M. (1981). Donsker classes of functions, Statistics and Related Topics (Proc. Symp. Ottawa 1980), pp. 341-352, New York, Amsterdam: North Holland.
6.
Dudley, R. M. (1984). An extended Wichura theorem, definitions of Dansker class, and weighted empirical distributions, preprint.
7.
Dudley, R. M. (1984). A course on empirical processes, Lecture Notes in Mathematics, no. 1097.
8.
Dudley, R. M. and Walter Philipp (1983). Invariance principles for sums of Banach space valued random elements and empirical processes, Z. Wahrschein. verw. Geb., 62, pp. 509-552.
9.
Durst, Mark and R. M. Dudley (1981). Empirical processes, VapnikChervonenkis classes and Poisson processes, Prob. Math. Statist. (WrocJaw), 1, no. 2, pp. 109-115.
10.
Gaenssler, P. (1983). Empirical processes. Institute of Mathematical Statistics Lecture Notes - Monograph Series 3.
11.
Gine, E. and J. Zinn (1984). Some limit theorems for empirical processes, Ann. Prob., 12, pp. 929-989.
12.
Jain, N. (1977). Central limit theorem and related auestions in Banach space, Proceedings of Symposia in Pure ~lathematics, vol. 31, pp. 55-65.
13.
Jain, N. and M. B. Marcus (1975). Central limit theorems for C(S) valued random variables, J. Functional Analysis, 19, pP. 216-231.
14.
Kolcinskii, V. I. (1981). On the central limi t theorem for empirical measures, Theor. Probability Math. Statist., 24, Pp. 71-82 = Teor. Verojatnost. i. Mat. Statist., 24, PP. 63-75.
IS.
Mourier, Edith (1951). Lois des grands nombres dans un espace de Banach, C. R. Acad. Sci. Paris, 232, Pp. 923-925.
16.
Pisier, G. (1975). Le the6reme de la limite centrale et la loi du logarithme itere dans les espaces de Banach, suite et fin, Sem. Maurey-Schwartz 1975-76, Expose IV, Ecole Poly technique, Palaiseau.
17.
Pollard, David B. (1982). A central limit theorem for empirical processes, J. Australian Math. Soc., Ser. A. 33, Pp. 235-248.
18.
Pyke, Ronald (1984). Asymptotic results for empirical and partialsum processes: A review, The Canadian Journal of Statistics, 12, no. 4, pp. 241-264.
19.
Yukich, J. E. (1982). Ph.D. thesis (M.I.T.)
20.
Yukich, J. E. (1984). The law of the iterated logarithm and the empirical characteristic function, preprint.
21.
Yukich, J. E. (1985). Theoreme limite central et entropie metrique dans les espaces de Banach, C. R. Acad. Sci. Paris, to appear.
22.
Yukich. J. E. (1985). Weak convergence of the empirical characteristic function, Proc. of Amer. Math. Soc., to appear.
23.
Yukich. J. E. preprint.
(1985).
Convergence of empirical probability measures,
Rates of convergence for classes of functions,