This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
:(y\:£, .2) == H;-i(y,.2) Hin(YII}(y\:£, .2) == Hf(Y,.2)
for i ::;;; k - 2, for t» n - k + 2.
Proof. Since Y\:£ is an oriented n-manifold, the Poincare duality
theorem for relative manifolds and the exact cohomology sequence of the pair (Y,:£) provide isomorphisms
for n - i;;;. n - k + 2;;;. dim :£ + 2;;;. (
1 the Sobolev space W p •a is contained in continuous functions on [0,1]. We shall normalize the functions to take the value 0 at t = 0, then we can take for norm in W p•a the following expression (cf. [1]), 1 1
(1.4)
JJlu(t)/t-t'll+pa - u(t')JP , Ilullwp.adtdt P
-
o
(0 < a < 1, pa > 1).
0
2. Martingale Approximation of the White Noise We shall denote by X the Wiener space, that is the space of continuous maps from [0,1] in R", taking the value zero at t = O. The Sobolev norm for this space is still defined by (1.4) where we make appear /u(t)u(t')"... We shall denote by S; the finite subset of [0, 1] consisting of the points {kT n } , 1 0;;;; k 0;;;; 2n • We shall denote by K; the vector space of functions defined on S; with values in R". We define a map
P. Malliavin / Stochastic Differential Equations
585
obtained by restriction of x(·) to Sn' We shall denote by en the section of ({in defined by associating to an element of K; the piecewise linear function, constant in each interval defined by Sn' and such that ({in 0 en = identity. We define and denote Vn = l/Jn (X) . Then l/Jn is a linear projection of X on the finite dimensional subspace Vn. The image by l/Jn of the Wiener measure is a gaussian measure on Vn. Furthermore these projections are compatible in the sense for n'> n. If we denote by Jl- the Wiener measure on X then
(2.1) where Pn.n' is a gaussian measure supported by (Ker l/Jn) n Vn,. Furthermore denoting by H the Cameron-Martin space, that is the subspace of functions of X having their first derivative in L 2 , we have
We define a filtration :!lln of the Borel field :!llx of X:
where :!llv. is the Borel field of Vn. We define by B V' the conditional expectation relative to this filtration. Then Proposition (2.1). Given x E X define Y n = l/Jn(x). Then Y n = Boo·(x) and Y n is a X -oalued martingale relative to the filtration :!lln' which converges in the uniform norm to x. Proof. Formula (2.1). 0
Theorem (2.2). Let P be an even integer, a < ~ fixed. Then Y n is an L 2 martingale with values in wp,a.
586
P. Malliavin I Stochastic Differential Equations
In order to prove this theorem we shall use the following criterium [6J for the convergence of a martingale in a Banach space. Theorem (2.3). Let X n be a martingale with value in a Banach space B. Define p(.A) = sup{~(llx + .Ayll+ Ilx- .AyID-III/xll = 1, Ilyll = I}. Then there exists a numerical constant c such that
where
lIanlip = inf{ Cl~l p(:n)
satisfies:.
p(.A) < c.A z p(.A) < .A
if.A 1.
In particular
will imply Theorem (2.2). Proof. By the triangular inequality we have always
p(.A) ~ ~(1 + .A + .A + 1)- 1 ~ .A .
Denote p
= 2m 1 1
JJ
o
(u(t) + .Av(t) - u(t') - .Av(t,))zm
It
0
r:
t
'll+Zpa
, dt dt
-JJ .A'(u(t)- u(t,))Zm-'(v(t)- v(t'))' , - 2: (zm) I dtdt. 1 I
r
o
t- t 'IHZpa
0
P. Malliaoin / Stochastic Differential Equations
587
Denote 1
Ir =
1
dt dt'
JJ
(u(t) - U(t'»2m-r(V(t)- V(t'»' It _ t'll+2pa .
o
0
Then by the Holder inequality applied to the exponents 2m/(2m - r), 2m/r, we obtain
under the assumption
Ilull, IIvll = 1. Therefore
Therefore there exists a Ao such that for A < Ao, the following expansion, which proves the estimates of p, holds:
Suppose now
Then only a finite number of terms will be greater than 1, and neglecting a finite number of terms will not change the convergence of the martingale, we can therefore use the estimate p(A)
0
Proof of Theorem (2.2). Denote
Then h(t) vanishes on Sn-I' Denote S: = S; - Sn-I. To a point yES: we associate the intervals y + = (y, y + 2- n) and y - = (y - Tn, y). Then the collection of y +, Y - when y runs though S:, is a partition of [0, 1]. Therefore
P. Malliavin / Stochastic Differential Equations
588
For each of the integrals which appear in the second member, h(t)- h(t') is a linear function. We have the following majoration for each of these integrants (2.3) On the other hand (2.4)
with an exception when: (2.5) When (2.6)
a =
f3 we shall use the estimate /Ih(t) - h(t')/r ~ II h2~~)11 Pit- t'JP.
In the exceptional case (2.5) we shall use the estimate
Then, making the substitution in (2.2), we shall get
L
T3Pllhll~p.a ~
(11h (Y )/IP + Ilh (f3 )/IP) T 2n
-Y,fJES~
IY-f3I1+pa
1-y-fJ!>Z-"
L
+
2np.(llh(y)/IP + Ilh([3)lnrn(l-ajpTn
1-Y-fJl=P
+ L Ilh (y )IIP2nPTn(l-ajp Tn . -y
The second sum can be bounded by twice the third. We now can
P. Malliaoin I Stochastic Differential Equations
enumerate the points of S: by the expression YI Then Iy - ,B 1= 1/- 1'12- n• ,c::: 2-4PllhllP wp'a ~
~
f::.
589
(2/ + l)Tn, 0 ~ / < 2n - ' •
=
P P 2- n IIh(YI)II + Ilh(YI,)II 2npa + ~ 2- nlh( )I P2npa 1/- 1'11+pa YI .
7
1"1'
We now use the fact that the {h (YI)} are independent gaussian variables of variance Tn: where m
= dim(V) .
Therefore
we get
Efllh II~p.o] os;; c2 np (a - l /2) •
(2.7)
Now by HOlder (2.8)
Then by Lemma (2.4) we obtain Theorem (2.2). 0
In the limiting case a (2.9)
= ~ we have:
E[ sup {IIY;.II~p.ll2}]
< Cpn .
O""""n
3. The Control Transformation To every continuous piecewise ~l-function, f, we can associate the control problem [7] dv (t) ~ = Ao(vf(t» dt vf(O) = vo'
+ Ak(Vf(t»
dfk
-,
dt
590
P. Malliavin / Stochastic Differential Equations
We define in this way a map of Xl' the space of piecewise reI continuous functions defined on [0, 1] with values in R n into the space of V-valued continuous functions. We shall denote this map by cPo As Yn is a piecewise linear function then Yn E Xl' and cP(Yn ) is well defined. The manifold V is supposed to be compact. Then V can be imbedded 2 [5] in R d (where d = dim( V)). Then the space of maps from [0, 1] in V can be identified with a subset of maps from [0, 1] with values in R and d2 The Sobolev norm on Wp·a([O, 1]; IR ) induces a Sobolev distance on W p•a ([0,1]; V). Furthermore criteria of convergence for semi-martin2) gales in the Banach space wp,a([0, 1]; IR d are transferred in the same way to wp,a([0, 1]; V).
<
Theorem (3.1). (Main Theorem.) Suppose that the manifold V is compact and that the vector fields A o' A k are of class re 2. Then {cP(Yn)} is a semimartingale converging in w 4 ,a for all a <~. We will proceed here turning the discrete index in order to be able to use the resources of view will be still available in the case A E [n, n + 1], we shall define Yt(t) as the constant in each interval of S~+l such that
index n into a continuous of Ito's calculus; this point of stochastic P.D.E. For piecewise linear function,
if a E
s..
and such that denoting S: = Sn+l- S)" we have: 21/2 y t ( /3 ) are independent Brownian motions. We have, when n is an integer,
so that y# is a kind of stochastic interpolation of Y n. By abuse of notation we shall denote yt by Y),. Then YA is a martingale with values in W 4•a and we have the analogous of (2.8) (3.1) Theorem (3.1) will be proved if we show that CP(Y),) is an L 2 continuous semi-martingale with values in w 4 ,a .
P. Malliavin / Stochastic Differential Equations
591
3.1. Differential of the Control Mapping We need to compute directional derivative in the direction of h that is to differentiate VA
= cP( Y A )
in the direction h
= d YA •
We shall denote by -VA the derivative in t of Yno Differentiating the control map in A, we obtain
We introduce the resolvent matrix R(t, t') defined by
a at
(3.2)
- R A (t, t') = Q A (t)R A (t, t'),
R(t', t') = Id,
with (3.3) Then we have
(3.4) o
We obtain finally (3.5)
a2V_ J 1
-2 -
all.
o
RA(t,
aV s aV s' aV s aV s' 'rk aV s ' 2 2 ~)[ass,Ao--+ ass,A k - - Y A + 2asA k - dY Ak ] . all. all. . all. all. all.
°
592
P. Malliavin / Stochastic Differential Equations
4. Stochastic Differential Calculus on the Variable A
Denote d Y A
=
YH 00
(4.1)
E[J
• -
Y A • Then we have to prove
:i~e-IEK[llcP'(YA)dYJ~4.a]dAJ<+oo,
+00
(4.2)
E[J Il::e-IEKA[cP"(YA)(dYA,dYA)]ldA
J<+oo.
I
To estimate cP'(YA) d YA we shall make an integration by parts in (3.4) relatively to the variable ~. Denote M.(t) = cP'(YA) d YA we get (4.3)
M.(t)=Ak(v(t))dY~(t)+ J R(t,OQ(OAk(V(O)dYA(Od~ o
- J R(t,
~)«asAk)(V(~))AS(v(O)Yn(~)dYA(0 d~ = I + II + III.
o
The treatment of each term will occupy one of the following paragraphs. We finish the present paragraph by some estimate for dYA , A E [n, n + 1]:
where the G, independent variables. We then get
Lemma (4.1). Let u, v E Wp.<·
n L then uv E 00
wp,a
and
Proof. Iu(t)v(t) - u(t')v(t')1 ~ lu(t)llv(t) - v(t')/ + Iv(t')11 u(t) - u(t')I. D Lemma (4.2). Let t/J E
eel,
uE
Wp,a
then
P. Malliaoin / Stochastic Differential Equations
593
Proof. II/J(U(t» - I/J(u(t'»1 :0:;; l""'IIL~lu(t) - u(t')I. 0 Proposition (4.3). The expression (1) is defined in (4.3). There exists a constant c independent of nand p, such that lim e -I£,,'(A [II 12]
:0:;;
c(n2- nII
0 but
then by Pisier's inequality, n>Ao
We choose Ao such that
"" 2- n Ao
and we find for all A > AO , which concludes the proof.
P. Malliavin / Stochastic Differential Equations
599
References [1J R. Adams, Sobolev Spaces (Academic Press, New York, 1975). [2J R. Cairolli and B. Walsh, Stochastic Integrals in the plane, Acta Math. 134 (1975) 111-183. [3] M. Ibero, Integrales Stochastiques multiplicatives, Bull. Sci. Math. France 100 (1976) 175-191. [4J P. Malliavin, Stochastic Calculus of Variation and Hypoelliptic operator, In: Proc. Internal. Symp. S.D.E., Kyoto (1976) 195-263. [5J J. Nash, The imbedding problem for Riemannian manifold, Ann. of Math. 63 (1956) 20-64. [6J G. Pisier, Martingales with values in uniform convex spaces, Israel J. Math. 29 (1975) 326-351. [7J D. Stroock and S.R.S. Varadhan, Support of a diffusion process, Proc. Sixth Berkeley Symp. III (1972) 333-368.
J.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
601
POLYNOMIAL APPROXIMATION IN NUCLEAR FRECHET SPACES
JorgeMUnCA Instituto de Matemdtica; Universidade Estadual de Campinas, 13.100 Campinas, SP, Brazil Dedicated to Leopoldo Nachbin on the occasion of his sixtieth birthday
1. Introduction
Let K be a polynomially convex compact subset of a Frechet-Schwartz space E. By combining results of Ligocka [7], Matyszczyk [9] and the author [14], we show that if E has a basis or the bounded approximation property, then each function which is holomorphic on a neighbourhood of K can be uniformly approximated by polynomials on a suitable neighbourhood of K. This is a sharp version of the classical Oka-Weil theorem; see Weil [22] and Oka [19]. Since Mitiagin and Zobin [11] have constructed a nuclear Frechet space without a basis, and Dubinsky [6] has constructed a nuclear Frechet space without the bounded approximation property, it becomes of some importance to try to extend the aforementioned approximation theorem to arbitrary Frechet-Schwartz spaces or at least to arbitrary Frechet nuclear spaces. This is what we do next, i.e. we extend the aforementioned result to arbitrary Frechet nuclear spaces. As an application of this approximation theorem we show that if U is a polynomially convex open subset of a nuclear Frechet space, then the compact-ported topology T", introduced by Nachbin [17] coincides with the compact-open topology TO on the space :fe( U) of all holomorphic functions on U.
2. Polynomial Approximation in Frechet-Schwartz Spaces with Basis
All locally convex spaces in this paper are assumed to be complex and Hausdorff. If E is a locally convex space then we shall denote by cs(E) the set of all continuous seminorms on E. If a E cs(E) then (E, a) will denote the vector space E seminormed by a, whereas E a will denote the
602
J. Mujica / Polynomial Approximation in Nuclear Frechet Spaces
associated normed space. We shall denote by flP(E) the space of all continuous polynomials on E. We recall that a compact set K C E is said to be polynomially convex if K = K !1'(E)' where K!1'(E)
= {y E EIIP(y)l,;;;; sup IP(x)1 for all P E flP(E)}. xEK
We recall that an open set U C E is said to be polynomially convex if the set K!1'(E) n U is bounded away from au for each compact set K C U. The open set U is said to be strongly polynomially convex if the set K !1'(E) is contained in U and bounded away from aU for each compact set K C U. Every strongly polynomially convex open set is polynomially convex, and the converse holds if E is a quasi-complete locally convex space with the approximation property (see [13, Coroll. 3.3]). Moreover, the classical argument used in the proof of [13, Coroll. 3.3] can also be used to prove the following lemma, already established by Matyszczyk [9, Coroll. 2.6] in the case of Frechet spaces with the approximation property.
Lemma (2.1). Let E be a quasi-complete locally convex space with the approximation property. Let U be a polynomially convex open subset of E. Then the union of an arbitrary collection of components of U is polynomially convex as well. We shall also need the following lemma, which is essentially due to Dineen; see [5, Example 2.4].
Lemma (2.2). Let E be a locally convex space with an equicontinuous Schauder basis (en)' Let a be a continuous seminorm on E satisfying the condition (2.1)
for every x = L:=l tnen in E. Then there is a complemented subspace M; of E with the following properties: (a). M; has an equicontinuous Schauder basis and a continuous norm. (b). E = MaEB a-1(O). (c). If U is a connected pseudoconvex open set in E which has non-
J. Mujica / Polynomial Approximation in Nuclear Frechet Spaces
603
empty interior in (E, a), then U = 7T~ 1 (U n M a ) , where 7Ta : E ~ M a denotes the continuous projection associated with the decomposition E = MaEB a- 1(O ). Note that (a) and (b) are proved in [5, Example 2.4] whereas (c) follows from [18, Th. 2.1.7]. Now we can prove the first result announced in the introduction.
Theorem (2.3). Let E be a Frechet-Schwanz space with the bounded approximation property. Let K be a polynomially convex compact subset of E. Then for each function f holomorphic on an open neighbourhood of K there is a sequence (Pn ) in [!jJ(E) which converges uniformly to f on a suitable neighbourhood of K. Proof. Let f be holomorphic and bounded on an open neighbourhood U of K. By a result of Ligocka [7, Prop. 2.1] there is a polynomially convex open set V such that K eve U. We claim that there is a sequence (Pn) in [!jJ(E) which converges to f in (~(V), TO) (after shrinking if necessary). Indeed, if E has a continuous norm then our claim follows directly from a result of Matyszczyk [9, Th. 2.12]. If E does not have a continuous norm then we proceed as follows. In view of Lemma (2.1) we may assume that the polynomially convex open set V has only finitely many connected components. Then we choose a E cs(E) satisfying (2.1) and such that each connected component of V has non-empty interior in (E, a). Now we can apply Matyszczyk's theorem to the subspace M; of E given by Lemma (2.2). This yields a sequence (On) in [!jJ(Ma) which converges to f in (~(V n M a), To)' If we set Pn = On 0 '7Ta then (Pn) C [!jJ(E) and (Pn) converges to f in (~(V), TO)' as asserted. In particular we see that the sequence (Pn ) is bounded in (~( V), TO)' and hence locally bounded in V; since the space E is metrizable. Hence, after shrinking V; if necessary, we may assume that the sequence (Pn) is uniformly bounded on V. And since E is a Schwartz space we may apply [14, Lemma 5.5] to find an open set W with K ewe V such that (Pn ) converges to f uniformly on W This completes the proof. 0
v:
3. Polynomial Approximation in Nuclear Freehet Spaces We shall need the following more precise formulation of Ligocka's result [7, Prop. 2.1].
604
J. Mujica I Polynomial Approximation in Nuclear Frechet Spaces
Proposition (3.1). Let E be a quasi-complete locally convex space. Let K be a polynomially convex compact subset of E. Let stI be a directed family of seminorms which generates the topology of E. Then for each open set U:J K there exist an a E stI and a strongly polynomially convex open set Va in E; such that K C 7T:1( Va) C U, where 7Ta : E -+ E a denotes the canonical mapping. Proof. Let clos(T(K» denote the closed, convex hull of K. For each point a E clos(r(K»\ U there exists aPE g'J(E) such that JP(a)1 > 1 and SUpxEK IP(x)/ < 1. Since clos(T(K»\U is compact we can find P t , · · · , P; in g'J(E) such that SUpxEK 1~(x)1 < 1 for j = 1, ... , nand n
clos(F(K»\U C U {x E EII~(x)1 > 1}, j=t
i.e. clos(r(K»
n
n
n {x E EII~(x)I:so; 1} CU.
j=t
Choose an a E stIsuch that ~ is continuous on (E, a) for j = 1, ... , n. Then an argument in the proof of Ligocka's result (see [7, Prop. 2.1] or [13, Th. 3.1] shows the existence of a f3 E stI with f3 ;;;. a such that (clos(T(K» + N p) n
n {x E EII~(x)I:so; 1} C U, n
j=t
where N p = {x E E If3 (x ) < l }. Since f3;;;'a each ~ is continuous on (E, f3) and it follows from Liouville's theorem that each ~ can be factored in the form ~ = OJ 7Tp, where OJ E g'J(Ep) and 7Tp: E -+ E p denotes the canonical mapping. Set 0
V = (clos(F(K» + N p) n
and V p = 7Tp(clos(r(K» + N p) n
n
n {x E EII~ (x)1 < 1}
j=1
n
n {x E EpllQix)1 < 1}.
j=1
Then K eve U, V = 7Tj/(Vp ) and Vp is strongly polynomially convex, as asserted. 0 Theorem (3.2). Let E be a nuclear Frechet space. Let K be a polynomially convex compact subset of E. Then for each function f holomorphic on an open neighbourhood of K there is a sequence (Pn ) in g'J(E) which converges uniformly to f on a suitable neighbourhood of K.
J. Mujica / Polynomial Approximation in Nuclear FrechetSpaces
605
Proof. Let I be homomorphic and bounded on an open neighbourhood U of K. Since E is nuclear its topology is generated by a directed family of seminorms stJ such that E a is a separable inner-product space for each a E stJ (see [20, Prop. 4.4.1]). By Proposition (3.1) we can find an a E stJ and a polynomially convex open set Va in E a such that K C '7T: 1(Va) C U, where '7Ta: E ~ E a denotes the canonical mapping. An application of Liouville's theorem shows that the restriction of I to '7T:l(VJ can be factored in the form 1= go '7Ta, where g E JYt'(Va). Since Va is a polynomially convex open subset of the separable inner-product space E a , a result of Matyszczyk [8, Coroll. 1] yields a sequence (On) in £J>(Ea) which converges to g in (JYt'( Va)' 'To)' (Note that Matyszczyk states this result for Banach spaces with the bounded approximation property, but he only uses the completeness of the space to guarantee that the sequence of finite rank operators which converges pointwise to the identity is equicontinuous.) If we set P; = On0 '7Ta then P; E £J>(E) and (Pn) converges to I in (JYt'('7T: 1(Va 'To)' Now we can proceed as in the end of the proof of Theorem (2.3), using the Schwartz property of E to find an open set W with K ewe '7T: 1(Va) such that (Pn) converges to I uniformly on W This completes the proof. 0
»,
It follows from Theorem (3.2) that each polynomially convex open subset of a nuclear Frechet space has the Runge property, in the sense of [12] and [15]. Thus by combining [14, Th. 5.6] and [15, Th. 4.3] we obtain at once the following: Theorem (3.3). Let E be a nuclear Frechet space. Then the topologies 'To and 'T.... coincide on JYt'(U) lor each polynomially convex open subset U 01E.
In [14] the author has shown that 'To = T", on JYt'(U) when U is a balanced open subset of a Frechet-Schwartz space. In [16] the author has shown that To = T.... on JYt'(U) when U is an arbitrary open subset of a Frechet-Schwartz space with the bounded approximation property. Note that none of these three results implies any of the other two. These results extend earlier results of Barroso [1], Barroso and Nachbin [2], Schottenloher [21], Boland and Dineen [4], and Meise [10]. Finally we remark that Theorems (3.2) and (3.3) remain valid for every Frechet-Schwartz space E whose topology is generated by a directed family of seminorms stJ such that E a is a separable inner-product space for each a E stJ. These are precisely the Frechet spaces belonging to the Schwartz-Hilbert variety in the sense of Bellenot [3].
606
J. Mujica I Polynomial Approximation in Nuclear Frechet Spaces
References [1] J.A. Barroso, Topologias nos espacos de aplicacoes holomorfas entre espacos localmente convexos, An. Acad. Brasil. Cienc. 43 (1971) 527-546. [2] J.A. Barroso and L. Nachbin, Some topological properties of spaces of holomorphic mappings in infinitely many variables, In: Advances in Holomorphy, North-Holland Math. Stud. 34 (North-Holland, Amsterdam, 1979) 67-91. [3] S. Bellenot, The Schwartz-Hilbert variety, Michigan Math. J. 22 (1975) 373-377. [4) P. Boland and S. Dineen, Holomorphic functions on fully nuclear spaces, Bull. Soc. Math. France 106 (1978) 311-336. [5] S. Dineen, Surjective limits of locally convex spaces and their applications to infinite dimensional holomorphy, Bull. Soc. Math. France 103 (1975) 441-509. [6) E. Dubinsky, Nuclear Frechet spaces without the bounded approximation property, Studia Math. 71 (1981) 85-105. [7] E. Ligocka, A local factorization of analytic functions and its applications, Studia Math. 47 (1973) 239-252. [8] C. Matyszczyk, Approximation of analytic operators by polynomials in complex Bo-spaces with bounded approximation property, Bull. Acad. Polon. Sci. 20 (1972) 833-836. [9] C. Matyszczyk, Approximation of analytic and continuous mappings by polynomials in Frechet spaces, Studia Math. 60 (1977) 223-238. [10] R. Meise, A remark on the ported and the compact-open topology for spaces of holomorphic functions on nuclear Frechet spaces, Proc. Roy. Irish Acad. 81 (1981) 217-223. [ll) B. Mitiagin and N. Zobin, Contre-example Ii I'existence d'une base dans un espace de Frechet nucleaire, C. R. Acad. Sci. Paris 279 (1974) 255-256, 325-327. [12] J. Mujica, Spaces of germs of holomorphic functions, In: Studies in Analysis, Advances in Math. Suppl. Stud. 4 (Academic Press, New York, 1979) 1-41. [13] J. Mujica, The Ok a-Wei! theorem in locally convex spaces with the approximation property, In Sem. Paul Kree 1977/78 (lnstitut Henri Poincare, Paris, 1979) Exp. 3. [14) J. Mujica, A Banach-Dieudonne theorem for germs of holomorphic functions, J. Funct. Anal. 57 (1984) 31-48. [15] J. Mujica, Holomorphic approximation in Frechet spaces with basis, J. London Math. Soc. (2) 29 (1984) 113-126. [16) J. Mujica, Holomorphic approximation in infinite dimensional Riemann domains, Studia Math., to appear. [17) L. Nachbin, On the topology of the space of all holomorphic functions on a given open subset, Indag. Math. 29 (1967) 366-368. [18] P. Noverraz, Pseudo-convexite, convexite polynomiale et domaines d'holomorphie en dimension infinite, North-Holland Math. Stud. 3 (North-Holland, Amsterdam, 1973). [19] K. Oka, Sur les fonctions analytiques de plusieurs variables I, Domaines convexes par rapport aux fonctions rationnelles, J. Sci. Hiroshima Univ. 6 (1936) 245-255. [20] A. Pietsch, Nuclear locally convex spaces, Ergeb, Math. Grenzgeb. 66 (Springer, Berlin, 1972). [21) M. Schottenloher, T", = TO for domains in C", In: Infinite Dimensional Holomorphy and Applications, North-Holland Math. Stud. 12 (North-Holland, Amsterdam, 1977) 393395. [22) A. Weil, L'integral de Cauchy et les fonctions de plusieurs variables, Math. Ann. 111 (1935) 178-182.
l.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
(:1.)7
EIGENVALUES OF ABSOLUTELY r-SUMMING OPERATORS
Albrecht PIETSCH Fnedrich-Bchiller-Unioersitat, Sektion Mathematik, lena, D.D.R. Dedicted to Leopoldo Nachbin
o.
Introduction
In 1977 it was shown by Johnson-Konig-Maurey-Retherford [2] that the eigenvalue sequence of any absolutely r-summing operator with 2,,;;; r < 00 belongs to I,.. The four authors' proof (mainly based on complex interpolation techniques and Brouwer's fixed point theorem) is rather involved. Another approach due to Konig-Weis [4] turns out to be a little bit simpler. Using the concept of Weyl numbers as well as the stability of ~r (ideal of absolutely r-summing operators) with respect to the s-tensor product we are able to give a completely different and surprisingly elementary proof of the result stated above.
1. Operator Ideals of Riesz Type
Let E and F be complex Banach spaces. Then E(E, F) denotes the Banach space of all (bounded linear) operators T from E into F endowed with the norm II Til = sup{IITxlll x E U} , where U is the closed unit ball of E. We write
E= U E(E,F). E,F
Suppose that for every pair of Banach spaces E and F we have a subset
608 ~(E,
A. Pietsch / Eigenvalues of Absolutely r-Summing Operators
F) of Q(E, F). The class ~=
U
E.F
~(E,F)
is said to be an operator ideal if the following conditions are satisfied: (1). If a E E' and y E F, then a ® y E ~(E, F). (2). If S, T E ~(E, F), then S + T E ~(E, F). (3). If X E Q(E o, E), T E ~(E, F), Y E Q(F, F o), then we have YTX E ~(E, F). A non-negative function T ~ /IT/l1l defined on an operator ideal ~ is called a quasi-norm if it has the following properties: (1). IIa® ylll!l = IIalillYil for a E E' and y E F. (2). /IS + T/l1l ~ c'Il [l/SII9I + /I TlIw] for S, T E ~(.E, F), where cl!l;?; 1 is a constant. (3). IIYTXIII!l ~ 11Y11/lTlll!lllxll for X E Q(E o, E), T E ~(E, F), Y E Q(F, F o)' A quasi-Banach operator ideal is an operator ideal ~ equipped with a quasi-norm /1.1121 such that all linear spaces ~r(E, F) are complete with respect to the induced metric. In the case when c'IX = 1 the prefix 'quasi' is omitted. Let ~l' ••• ,21m be quasi-Banach operator ideals. Then the product ~m o· •• o~ll consists of all operators T E Q(E, F) which can be written in the form
where the infimum is taken over all factorizations described above. It turns out that ~m o· •• 0 ~l becomes a quasi-Banach operator ideal. The m-th power of a quasi-Banach operator ideal ~ is defined by ~m := ~ o ••• 0 ~ (m factors). To give a first example, we state that the class Sf of all compact operators is a Banach operator ideal with respect to the ordinary operator norm. Note that the famous Riesz theory holds for all T E Q(E, E) having some compact power T'". Then every eigenvalue A ~ a has a finite multiplicity which is defined to be the dimension of
A. Pietsch I Eigenvalues of Absolutely r-Summing Operators
N(A, T)
= U""
k=O
{X E
EI (A Id -
Ttx
=
609
a} .
Moreover, all eigenvalues can be arranged in a sequence (An(T)) such that the following conditions are satisfied: (1). Every eigenvalue A ¥ 0 is counted according to its multiplicity. (2). IAJ(T)I ;a.IA 2(T)1 ;a. ... ;a. O. (3). If T possesses less than n eigenvalues A ¥ 0, then An (T) = 0 (by definition). Let 0 < r < 00 and 0 < w :;;; 00. An operator ideal ~ is said to be of Riesz type Ir, W if some power ~m is contained in Sf and (An (T)) E Ir, W for all T E ~(E, E). The second condition means that the quantity II(A n(T))!I/r, w =
{i nWlr-J/An(T)/w n=1
r
w
if 0 < w < 00,
and if w = is finite. In the case when r
= w we omit the index
Remark (1.1). Note that ~2m
00,
w.
!: Sf whenever m > r.
We have the following principle of uniform boundedness [5]: Theorem (1.2). Suppose that the quasi-Banach operator ideal type Ir; w· Then there exists a constant c r, W ;a. 1 such that for all T E
~(E,
~
is of Riesz
E) .
Remark (1.3). Observe that the constant c r; w does not depend on the
Banach space E.
2. Tensor Stability of Operator Ideals
Let e(E, F) denote the Banach space of all bounded bilinear functionals z defined on the cartesian product E ' x F ' endowed with the norm
610
A. Pietsch / Eigenvalues of Absolutely r-Summing Operators
[z],
= sup{lz(a, b )11 a E
tr; b E
VOl ,
where VO and VO are the closed unit balls of E' and F', respectively. The algebraic tensor product of the Banach spaces E and F is the linear subset E®F of t:(E, F) spanned by all x
0
y: (a, b)~ (x, a)(y, b)
with x E E and y E F. Thus every element z E E0F has the form n
z=
L x;0Yi' i=l
where Xl' ... , x n E E and Yl' ... , Yn E F. The algebraic tensor product S0 T of the operators T E 2(F, Eo) is given by n
n
;=1
i=l
SE 2(E, Eo)
and
S ® T : L x;0 Yi ~ L Sx, 0 Ty;. A non-negative function z ~ Ilzlla defined on all algebraic tensor products E0 F is said to be a tensor norm if it has the following properties: (1). IIx0 ylla = Ilxlillyll for x E E and y E F. (2). lIu + vila .,-; Ilulia + Ilvll a for u, v E E0 F. (3). II(S0 T)zlla .,-;; lISllllTllllzlla for ~ E E 0 F, SE 2(E, Eo) and TE 2(F, Fo)' Given any tensor norm a, then the a-tensor product E®a F of the Banach spaces E and F is the completion of E 0 F with respect to the norm a. The a-tensor product S®a T ofthe operators S E 2(E, Eo) and T E 2(F, Fo)is defined to be the unique continuous extension of S0 T. The quasi-Banach operator ideal ~ is called stable with respect to the tensor norm a if S E ~{(E, Eo) and T E ~(F, Fo) imply S®a T E ~(E®aF, Eo®aFo) such that
where c ~ 1 is some constant.
A. Pietsch / Eigenvalues of Absolutely r-Summing Operators
611
3. Riesz Type and Tensor Stability The most outstanding moments in history of mathematics are those when we discover an interplay between two concepts which-at a first glance-have nothing in common with each other. Theorem (3.1). Suppose that the quasi-Banach operator ideal with respect to a tensor norm a such that
m:
is stable
for all T E ,)l(E, E) . If ?l is of Riesz type l.; then for all T E
~{(E,
E) .
Proof. If A =I- 0 and J.t =I- 0 are eigenvalues of T, then AJ.t is an eigenvalue of T@a T, and we have (3.1)
In order to verify this inclusion, let x E N(A, T) and y E N(J.t, T). Then there exist hand k such that and respectively. Write m
=
h
+k
- 1. Now it follows from
that
(AJ.t IdE®a E- T@a TtxQ!)y m
=
L (7 )(A IdE -
T)IT m-1x ® (J.t IdE)1 (J.t IdE - T )m-I y =
1=0
because (A IdE - Tyx =
0
for 1= h, . . . , m
0,
A. Pietsch / Eigenvalues of Absolutely r-Summing Operators
612
and for I
=
0, ... , h - 1 .
Hence x @ y E N(Aj-t, T 0« T). This proves (3.1). As an immediate consequence of the preceding observation we see that (A j (T)A/T)) can be identified with a subsequence of (An (T 0« T)). By Theorem (1.2) there exists a constant c.> 1 such that for all S E Taking F
=
E
0« E
and S = T
(i IAn (T)I')
2/, =
n=l
~(F;
F) .
0« T, we obtain
(i i IA
j
(T)Aj (T)/') 1/,
1=1 J=l
.s;
(i IAn(T 0« T)j')
l/T
.s; c,lI T0c: Tllw
n=l
.s; c,cllTlli . This implies that
If c, is chosen as small as possible, it follows that c,.s; (C,C)1/2. Thus c;«: c.
o
Remark (3.2). The idea that tensor product techniques can be used in
order to improve constants in certain inequalities goes back to B. Russo [8, Proof of Th. 1].
4. Operators of Weyl Type
We now summarize some results from [7] which will be applied in the following. Let n = 1,2, .... The n-th approximation number of any operator T E 2(E, F) is given by
A. Pietsch / Eigenvalues of Absolutely r-Summing Operators
613
an(T) = inf{IIT - Alii A E 52(E, F), Rank(A) < n}. Moreover, we define the n-th Wey/ number by
Let 0 < r < 00 and 0 < w ~ 00. An operator T E 52(E, F) is said to be of Wey/ type / r; w if (x, (T)) E Ir, w - This condition means that the quantities if 0< w <00, and if w
=
00,
are finite. The quasi-Banach ideal of these operators is denoted by 52~~~. In the case when r = w we omit the index w. Theorem (4.1). The quasi-Banach operator ideal 52~~~ is of Riesz type Ir• w • Theorem (4.2). Let 1/p + 1/q
=
52 (x)
p,U
1/r and 1/u + l/v 0
=
l/w. Then
s.! (x) C 52 (x) q,v -
'.W·
5. Absolutely r-Summing Operators Let 1 ~ r < 00. An operator T E 52(E, F) is called absolutely r-summing if there exists a constant c ;3 0 such that
for every finite family of elements
XI' ... ,
IITII'$, =
Xn
E E. Set
inf c.
The Banach ideal of these operators is denoted by 1.13,. Let E be the tensor norm defined by
A. Pietsch / Eigenvalues of Absolutely r-Summing Operators
614
where the right-hand side does not depend on the representation n
z
=
2:x;0Y;. i=1
The following result is due to LR, Holub [1]. For the convenience of the reader, we give a simplified version of his proof. Theorem (5.1). The Banach operator ideal tensor norm e such that
~,
is stable with respect to the
for all S E ~,(E, Eo) and T E ~,(F: Fo)' Proof. According to a fundamental characterization of absolutely rsumming operators [6, 17.3.2] there exist normalized measures IJ- and II on U O and VO such that
IISxl1 ~ IISII\U,
(J I(x,
a)I' dIJ- (a)
UO
r'
for x EE,
and
IITyll~IITII\U,
(J I(Y, b)I' dll(b)r'
for Y E F,
vO
respectively. Let XI"'" x n E E and Ilaoll ~ 1 and IIbol ~ 1, we have
YI"'"
Yn E F. Then, whenever
I~ (Sx;, ao)(Ty;, bo)j ~ I T(± (Sx;, ao)Y; )jl ,=1
,=1
A. Pietsch / Eigenvalues of Absolutely r-Summing Operators
615
~IITII~,(J 115(~X;(y;,b»)I/' dv(b)r' yO
~1I511\ll,IITII\ll,(J J It(x;,a>(y;,b>/' d/L(a)dv(b)r' yO Vo
,-I
This means that
11(50 T)zll. :;;;1I511~,IITII\ll, (J J Iz(a, b)/' d/L(a) dv(b)r', yO Vo
for all z E E 0 F. Since E ®. F is the closed hull of E 0 F in e (E, F), by continuous extension we obtain
1I(5®. T)zll. ~ 11511~,IITII~, (J J Iz(a, b)1' d/L(a) dv(b) yO Vo
for all z E E ®.F. Hence, given
Zl'"
.,
r'
zm E E ®.F, it follows that
~ 11511~,IITII~,sup{~ IZk(a, b)I'
Ia
E
o», bE vo}
~ 11511~, I TII~, sup{ ~I I(Zk' C>1' ICE wo} , where WO denotes the closed unit ball of (E®.F)'. This proves that the s-tensor product 5 ®. T is an absolutely r-summing operator with 115®. TII\ll' ~ 11511~,IITII\ll,. 0 The following inclusion is adopted from [7].
Theorem (S.2). Let 2:;;; r < 00. Then ~, ~ 53 ~~~.
A. Pietsch / Eigenvalues of Absolutely r-Summing Operators
616
Remark (5.3). Note that the embedding operator I from L "'(0, 1) into L'(O, 1) is absolutely r-summing.
Let
h; denote the characteristic function of the interval .:1; =
«i -1)/m, i/m) with i = 1, ... , m. Put m
m
Om = m 2, h;Q$le;,
and
J m = '" LJ e. 10. \C:;I h., I
;=1
;=1
where e, is the i-th unit m-vector. Then we have the commutative diagram I
L"'(O, 1)- L'(O, 1) m
1
i
i;
1
0m
I",(m)-I,(m) .
Since IIJml1
= 1 and IIQmll = m'", it follows that [6,11.11.8]
X n(I)m 1/, =
IIJmIlxn(I)II O; 11;:z,; xn(Im : I",(m) --+ I,(m»
;:z,; xn(Im : 12(m) --+ I",(m» = an (Im : 12(m) --'; I",(m»
Taking m = 2n, we obtain xn(I);:z,; e,n -1/'. This proves that ~, g: 2~~~ whenever 0 < w < 00. Therefore Theorem (4.1) does not imply directly that ~, is of Riesz type I,. We are now able to establish the main result of this paper which improves [2, Th. 2.3]. Theorem (5.4). (Eigenvalue Theorem.) Let l/r = l/r1 + ... + l/rm with 2 ~ rk < 00. Then ~,m 0 ' " o~,I is of Riesz type I" and we have
for all T
E~,
m
0'"
o~, (E, I
E).
A. Pietsch / Eigenvalues of Absolutely r-Summing Operators
617
Proof. From Theorems (4.2) and (5.2) we conclude that
Therefore, by Theorem (4.1), the product ~'m 0" ,ol.E'l is of Riesz type 1,,00 and so of Riesz type Is for all s > r. In view of Theorem (1.2) there exist constants c, ;;;;. 1 such that
for all T E I.E,m 0 ' " o I.E,1(E, E). We know from Theorem (5.1) that the Banach operator ideals ~" ... ,~, are stable with respect to the tensor norm E. Obviously, I .. ~, 0 ' " 0 ~, has the same property. Hence Theorem (3.1) implies that we I may take c, = 1 in the above inequalities. Letting s ~ r yields the result we are looking for. 0 OJ
Remark (5.5). The fact that I.E,.. o· .• I.E,! is of Riesz type 1,,00 was proved for the first time by H. Konig [3] with the help of elementary interpolation techniques. 0
References [1] J.R. Holub, Tensor product mappings, Math. Ann. 188 (1970) 1-12. [2] W.B. Johnson, H. Konig, B. Maurey and J.R. Retherford, Eigenvalues of p-summing and Ip-type operators in Banach spaces, J. Funct. Anal. 32 (1978) 353-380. [3] H. Konig, Interpolation of operator ideals with an application to eigenvalue problems, Math. Ann. 233 (1978) 35-48. [4] H. Konig and L. Weis, On the eigenvalues of orderbounded integral operators, Integral Equations Operator Theory 6 (1983) 70~729. [5] A. Pietsch, Eigenwertvertverteilungen von Operatoren in Banachraumen, In: Hausdorff Festband, Theory of sets and topology (Akademie Verlag, Berlin, 1972) 391-402. [6] A. Pietsch, Operator ideals (Dt. Verlag Wissenschaften, Berlin, 1978) and (NorthHolland, Amsterdam, 1980). [7] A. Pietsch, Weyl numbers and eigenvalues of operators in Banach spaces, Math. Ann. 247 (1980) 149-168. [8] B. Russo, On the Hausdorff-Young theorem for integral operators, Pacific J. Math. 68 (1977) 241-253.
J.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
619
EXISTENCE OF BEST APPROXIMANTS IN BANACH SPACES OF CROSS-SECTIONS Joao B. PROLLA Departamento de Matemtitica, Universidade Estadual de Campinas, Campinas, SP, Brazil Department of Mathematics, Center for Approximation Theory, Texas A & M University, College Station, Texas, U.S.A. To Professor Leopolda Nachbin on the occasion of his sixtieth birthday
O. Introduction
In this paper we exploit the idea of using a selection theorem similar to Michael's selection theorem to solve the problem of finding best approximants. We consider Banach spaces (and Banach algebras) L of cross-sections over some base space X, i.e. vector subspaces (or subalgebras) of the Cartesian product of families of normed spaces (or normed algebras) {Ex I x E X} endowed with the sup-norm /~sup{II/(x)lIlx E X} together with an upper semicontinuity hypothesis on the function x ~ 11/(x)ll. In such spaces we consider closed vector subspaces We L such that their cross-sections W(x) = {g(x)1 g E W} are proximinal in L(x), which we assume to fill up the fiber Ex, i.e. L(x) = Ex' The following problem was then posed by the late S. Machado (personal communication with the author): find conditions on X, Land W such that proximinality of W in L will accrue. In Section 1 we prove a selection theorem, which is then applied in Sections 2 and 3 to show that certain C({b(X, ~)-modules are proximinal in L. The author acknowledges his debt to the late Professor S. Machado with whom he had planned to write a joint paper on this subject as a continuation of their paper [9]. The author is also grateful to Dr. Jaroslav Mach for many conversations on the subject, while the latter was visiting the Department of Mathematics of the Universidade Estadual de Campinas during August 1982. Finally the author expresses his warm appreciation to all the members of the Center for Approximation Theory at the Texas A & M University, where this work was partly performed in the period January-May 1983.
620
J.B. Prol/a / Existence of Best Approximants
1. A Selection Theorem
Throughout this paper X will denote a non-empty completely regular Hausdorff space. Further properties of X will be stated when needed. For each point x in X, let us be given a normed space Ex whose norm we denote by v ~ /lv/l. The elements of the Cartesian product of all spaces Ex will be called cross-sections over X, i.e. cross-sections are functions defined on X and such that their values at a given point x lie in the corresponding space Ex. The Cartesian product of all the spaces Ex is made a vector space under coordinatewise operations, since we assume that all the vector spaces Ex are over the same field of scalars ~, where ~ denotes either the real or the complex field. By ce(X, ~) we denote the algebra over ~ of all continuous functions defined on X and taking values in ~. The subalgebra of <€(X,~) consisting of all elements of <€(X,~) which are bounded on X is denoted by ceb(X, ~). When X is locally compact, two more subalgebras will be considered. The first one is ceo(X, ~), the subalgebra of ce(X,~) consisting of all elements of ce(X,~) which vanish at infinity, i.e. those IE ce(X,~) such that, given e >0 there is a compact subset KCX such that I/(x)1 < e for all x in X not in K. It follows that <€o(X,~) is contained in ceb(X, IK). The second sub algebra of ce(X, IK) that will be considered, when X is locally compact, is <€c(X, IK), the subalgebra of ce(X, IK) consisting of all elements of <€(X, IK) which have compact support, i.e. those IE ce(X, IK) such that the closure in X of {x E XI I(x) oj. O} is a compact set. It follows that cec(X, IK) is contained in ceo(X, ~). Now returning to the general situation, the Cartesian product of all the spaces Ex is also made a module over the algebra ce(X, ~), also called a ce(X, ~)-module, by means of pointwise operation, i.e. given a function a E ce(X,~) and a cross-section I, then af is defined to be the crosssection whose value at x is the vector a(x )/(x).
Definition (1.1). A vector subspace L of the Cartesian product of all the spaces Ex is called a normed space 01 cross-sections over X, if the following conditions are satisfied: (i). For each IE L, the non-negative function x ~ II/(x )11 is upper semicontinuous and bounded on X. (ii). For each x EX, L(x) = {f(x)/ I E L} = Ex. It follows from (i) that the function I ~ sup{/l/(x )/11 x E X} is a norm on L. It is understood that L will always be equipped with this norm. In
J.B. Prolla / Existence of Best Approximants
621
particular, if L is complete under this norm, we say that L is a Banach space 01 cross-sections over X. Historically, the notion of spaces of cross-sections goes back to von Neumann, whose paper [11] was written in 1937/38, Godement [4] and Shilov [10]. Kaplansky [5] treated the case in which each Ex is a normed algebra. For a survey of the subject and its relation to Banach bundles, see [3]. We shall be interested in the following problem. Let L be a normed space of cross-sections over X, and let We L be a closed non-empty subset. Under what circumstances, assuming that for each x E X the set W(x) = {f(x)1 lEW} is proximinal in Ex, will W be proximinal in L? This problem arises in analogy with the Stone-Weierstrass theorem which implies that, when X is compact and both Wand L are Y6'(X'; ~)-modules, then W is dense in L if, and only if, W(x) is dense in Ex for each x EX. This density result is an easy corollary of the 'strong' version of the Stone-Weierstrass theorem for Y6'(X, ~)-modules which states that, when X is compact, L is a normed space of cross-sections which is a Y6'(X, ~) module, and We L is a vector subspace which is a Y6'(X, ~)-submodule, then for any IE L, dist(f, W) = sup{dist(f(x), W(x))1 x E X}. For a proof of this 'strong' version of the Stone-Weierstrass theorem see [8, Th. 1.30]. (See also [2, Lemma 4] and [1].) Proposition (1.2). Let L be a normed space 01 cross-sections over X. Let we L and I E L be given. Then, an element g E W is a best approximant to I in' W if, and only if, g(x) E ip(x) lor each x E X, where ip(x) = {h(x)1 hEW; Ilh(x) - I(x)/I ~ d} and d = dist(f, W).
Proof. Suppose g E W is a best approximant to I in W Then, for each x E X'; IIg(x) - I(x )11 ~ Ilg - III ~ d. Hence g(x) E tp (x), for each x E X. Conversely, assume that g E W satisfies the condition g(x) E ip(x) for each x E X. Then Ilg - III = sup{llg(x) - l(x)II I x E X} ~ d,
and so g is a best approximant to
I in W 0
622
J.B. Prolla / Existence of Best Approximants
To deal with the situation presented by Proposition (1.2) we will introduce some terminology.
Definition (1.3). Let L be a normed space of cross-sections over X. A carrier 01 X into L is set-valued mapping lp defined on X and such that lp(x) C Ex for each x E X. If W C L, and lp is a carrier of X into L, we shall say that lp is a W -carrier if
a
lp(x) C W(x)
= {/(x)I/E
W}
for every x E X. If tp is a carrier of X into L, a selection lor lp is a cross-section IE L such that I(x) E lp(x) for each x E X. If a selection for rp belongs to some set l¥, we may say that it is a W-selection for lp.
Remark (1.4). By Proposition (1.2) the problem of finding best Wapproximants to a given IE L is reduced to the problem of finding W-selections for the corresponding carrier ip, Obviously, to find such selections it is necessary that lp(x) ¥- 0 for each x E X. Now, this is certainly the case, if W(x) is proximinal in L(x) = Ex for each x E X. Indeed, if ux'E W(x) is a best W(x)-approximant to I(x), then
lIux -
l(x)11 ~ dist(f(x), W(x».
However, if h E l¥, then
Ilh(x) - l(x)11 ~ IIh -
III.
Hence dist(f(x), W(x» ~ d. Therefore lIux - l(x)1I ~ d, and so u, E lp(x), i.e. lp(x) ¥- 0. However, we cannot assume that for some gEl¥, g(x) = u, for all x E X. Even gEL with this property may not exist. Hence the following problem arises:
Problem (1.5). Let L be a normed space of cross-sections over X. Let we L be a closed and non-empty subset such that, for every x E X, the set W(x) is proximinal in Ex' Find sufficient conditions for W to be proximinal in L.
J.B. Prolla / Existence of Best Approximants
623
We will direct our efforts in the direction pointed out by Proposition (1.2), i.e. to search for sufficient conditions on W so that, for any given f E L, the corresponding carrier cp admits a selection. Now Michael's selection theory suggests that existence of a W-selection for tp may be obtained by appropriate lower semicontinuity with respect to W Such a concept was introduced by Prolla and Machado [9, Dei. 3.1] and will be used here. Definition (1.6). Let cp be a carrier of X into L, and let W C L. The carrier cp is said to be lower semicontinuous (abbreviated l.s.c.) with respect to W at a point x E X if, for each g E Wand r> 0 such that cp(x) n B,(g(x» :r! 0, there exists a neighbourhood V of x in X such that cp(t) n B,(g(t» = 0 for all t E V. (We used the following notation: B,(g(x» ={v E Exlliv - g(x)1I < r}.)
The carrier cp is said to be lower semicontinuous with respect to W if it is l.s.c. with respect to W at all points of X. This is equivalent to saying that, for each g E Wand r > 0, the set
is open. Definition (1.7). Let L be a normed space of cross-sections over X, and let MeL. Let A C C€(X'; IK). We say that M is a locally finite A-module if, for every collection {(fa' wa)1 a E I}, with wa EM and fa E A such that every point x of X has a neighbourhood in which all but finitely many of the fa's vanish, the cross-section x ~ La fa(x)wa(x) belongs to M. Clearly any vector subspace which is a locally finite A-module is an A-module. (Recall that an A-module, say W; is a vector subspace of L such that af E W for every pair a E A and fEW.) We can now state our main selection result, analogous to [7, Th. 3.2", part (a)~ (b)], whose proof we adapt. Theorem (1.8). Let X be a paracompact Hausdorff space, and let L be a Banach space of cross-sections over X'; which is a locally finite C€b(X, IK)-
624
J.B. Prolla / Existence of Best Approximants
module. Let we L be a closed non-empty subset which is a locally finite [0, 1])-module. Let rp be a W-carrier which is lower semicontinuous with respect to Wand such that, for each x E X, the set rp(x) is a closed and convex non-empty subset of E". Then, there exists a W-selection for rp.
~(X,
For the proof we shall need some lemmas. Lemma (1.9). Let L be a normed space of cross-sections over X and let We L. Let rp be a W-carrier of X into L which is lower semicontinuous with respect to W Let g E Wand r > O. Define a carrier l/J of X into L by setting l/J(x) = rp(x)
n B,(g(x))
for each x E X. Then I/J is lower semicontinuous with respect to W
Proof. Let w E W, e > 0 and x in the set
T
= {t E Xil/J(t) n BE(w(t))
~
0}
be given. There is some element f in W such that f(x) E rp (x), Ilf(x)g(x)/I < rand Ilf(x) - w(x)11 < E. Choose sand" such that Ilf(x)- g(x)11 < s
WD
Remark (1.10). For each vEE" we have been using the notation B,(v) to denote the set {u E E"lllu - vii < r}. Let us denote by B,(S) the set S + B,(O) C E" for every non-empty subset SeE". Lemma (1.11). Let X, L, Wand q; be as in Theorem (1.8). Then for each e > 0, there is an element g E W which is an e-approximate selection for tp, i.e. g(x)E BE(rp (x)) for each x in X.
J.B. Prolla / Existence of Best Approximants
Proof. Let e > 0 be given. For each
U(W) = {x E
W
625
E W consider the set
XI w(x) E B.«(,O(x»}.
By lower semicontinuity of (,0 with respect to W the set U(w) is open. Now, for each x E X, the set (,O(x) is non-empty and contained in W(x). Thus {U(w)1 wE W} is an open covering of X. Since X is a paracompact Hausdorff space, there exists a collection {fal a E I} of non-negative real-valued continuous functions fa : X ~ [0,1] such that (1). La fa(x) = 1, for every x E X. (2). Every point x of X has a neighbourhood in which all but finitely many of the fa's vanish. (3). For every a E I, there is wa E W such that fa(x) = 0 for all x ft U(wa ) . Define a cross-section g by setting
a
for all x E X. Then gEl¥, because W is a locally finite ee(x, [0, 1])module. Let x E X. Let I(x) be the finite subset of I such that a E I(x) if, and only if,fa(x) ¥- 0. Then
g(x) =
2:
fa(x)wa(x).
aEI(x)
For each a E I(x), we have x E U(wa). Hence wa(x) E B.«(,O(x». Since B.«(,O(x» is convex, and La E I(x) fa(x) = 1, g(x) E B. «(,0 (x». 0 Proof of Theorem (1.8). We will construct a sequence gn (n = 1,2,3, ...) of elements of W satisfying for all x E X the following two conditions: (1).
gn(x)EB p+2(gn_!(x»
(n=2,3, ...).
For n = 1, the existence of a cross-section g, satisfying (2) follows from Lemma (1.11). Let us assume that the cross-sections g!, B» ... , gn belonging to W have been defined satisfying (1) and (2) above.
626
J.B. Prolla I Existence of Best Approximants
For each x E X, define
By Lemma (1.9), l/Jn+1 is a lower semicontinuous carrier with respect to W. By (2), l/Jn+I(X) is non-empty for every x E X. By Lemma (1.11) applied once again, there is gn+1 in W such that gn+I(X) E B 2-n-l(l/Jn+I(X)) for all x E X. But then gn+i(x) E B 2-n+l(gn(X)) for all x E X, which is (1) for n + 1. On the other hand, gn+I(X) E B 2-n-l (cp (x)) for all x E X, which is (2) for n + 1. This proves that the sequence {gn} satisfying (1) and (2) can be obtained by induction. From (2) it follows that
for all n = 2,3, .... Since L is a Banach space, and W is closed, there is some element g E W such that {gn} converges to g. We clain that g is a W-selection for tp, Assume that g(x) g cp(x) for some point x E X. Since cp(x) is closed, there is some integer k such that cp(x) n B 2-k(g(X)) is empty. Since gn(x)~ g(x), there is some n > k + 1 such that gn(x) E BrH(g(X)). On the other hand, from (2) it follows that
Hence
But since cp(x)n Brk(g(X)) = 0, it follows that
a contradiction. Therefore g(x) E cp (x) for all x E X. 0
Remark (1.12). An analysis of the proof of Lemma (1.11) reveals that Theorem (1.8) remains true with the following modifications: (a). Assume X is compact. Then we may assume that both Land W are only convex ee(X, t\)-modules, i.e, given any finite number of functions ai' a2 , ••• , an in ee(x, 1<) and cross-sections 11'/2' .. ''/n in L (resp. W) such that 0,,;;; a, ,,;;; 1 and L a, = 1, then L af, belongs to L (resp. W). This
J.B. Prolla I Existence of Best Approximants
627
is certainly the case if W c L is a vector subspace and both Wand L are I'€(X; IK)-modules. (b). Assume that the carrier q; is bounded on X, i.e. there is some constant k > 0 such that sup{sup{lltlll t E q;(x)}1 x E X} ~ k ; and that, for any x E X and t E q;(X) there is some g E ~ with Ilgll ~ M, such that t = g(x). In this case, Theorem (1.8) remains true even if we relax the condition defining a locally finite module M by requiring that x ~ ~ fa (x) Wa(x) necessarily belongs to M only when the family {w a I a E I} C M is uniformly bounded. In this case we shall say that M is a bounded locally finite A-module. For example, M = I'€b(X, E) is a bounded locally finite r5b(X , ~) module, but it is not a locally finite I'€b(X, ~)-module. Moreover, in our applications of Theorem (1.8) the carrier will be tp as defined in Proposition (1.2), and q; is certainly bounded on X, with constant k = Ilfll + dist(f, W).
2. Proximinality of Certain I'€b (X; K)-ModuJes
Let L be a normed space of cross-sections over X. Let we L be a closed non-empty subset such that, for each x E X; the set W(x) = {g(x)/ g E W} is a closed non-empty subset of Ex = L(x). For each fE L, f ~ ~ let q; be the corresponding W-carrier: q;(x) = {h(x)/ h E ~ Ilh(x)- f(x)1I ~ d},
where d = dist(f, W) > O. If we define d(x) = dist(f(x), W(x», for each xEX; then d(x)~d. Hence D = sup{d(x)1 x E X}:s;; d.
Under the hypothesis above, let us denote by X(f, W) the possibly empty set of all points x E X such that d(x) = d, i.e. X(f, W) = {x E
XI dist(f(x), W(x» = dist(f,
W)}.
628
I.B. Prolla I Existence of Best Approximants
One instance in which X(f, W) ¥
0 is the following: X is compact and
we L is a vector subspace such that Awe W, where A = C€(X';
~). See Machado and Prolla [6], where we show that x ~ d(x) attains its supremum d = dist(f, W).
Proposition (2.1). Let we L be a closed non-empty subset such that for each x E X, the set W(x) is proximinal in Ex = L(x). Then, for each f E L, f ti. W, the W-carrier ({) is lower semicontinuous with respect to W at each point x ~ X (f, W). Proof. Let x ~ X(f, W) be given, i.e. d(x) = distU(x), W(x) < d. Let g E Wand r> 0 be such that
tp(x)nB,(g(x» ¥ 0. There is some wE W with w(x) E tp(x), so that lIw(x) - f(x)lI::E; d and /Iw(x) - g(x)/1 < r. Firstly, suppose /Iw(x) - f(x)/1 < d. Then, since w - f and w - g belong to L, by upper semicontinuity there is a neighbourhood Vof x such that
Ilw(t)- f(t)11 < d
and
lIw(t) - g(t)/1 < r,
for all t E V. Hence (()(t) n B,(g(t» is non-empty for all t E V. Suppose now that /Iw(x) - f(x)/1 = d. We know that d(x) < d, because x ~ XU, W). Since we have assumed that W(x) is proximinal in Ex there exists some hEW such that /If(x)- h(x)11 = d(x) < d. Now for each O::E; A ::E; 1 let gA E W be the element gA = (1- A)w + Xh. We claim that gA (x) E (() (x). Indeed, gA E Wand IlgA (x) - f(x )'I::E; (1- A)llw(x) - f(x)11 + Allh(x) - f(x)/I ::E; (1 - A)d + Ad (x) < (1 - A)d + Ad = d .
».
On the other hand, for A sufficiently small gA (x) E B,(g(x Indeed, IlgA (x) - w(x )/I::E; A/lh (x) - w(x )/1. Now if we take A > 0 so small that AlIh(x) - w(x)11 < e, where e = r -llw(x)- g(x)/1 > 0, then IlgA (x) - g(x)/I::E; Ilw(x) - g(x)11 + IlgA (x) - w(x)l/
< /lw(x)- g(x)/I+ e < r.
J.B. Prolla / Existence of Best Approximants
629
Therefore, we can take A > 0 so that IlgA (x) - f(x )11 < d and IlgA (x) g(x)11 < r and argue as in the first case: for some neighbourhood V of x, it is true that IIgA (t) - f(t)1I < d and IIgA (t) - g(t)1I < r for all t E V. Hence lp(t) n B,(g(t» is non-empty for all t E V. 0 As a consequence of Proposition (2.1), we have the following result: Theorem (2.2). Let X, Land W be as in Theorem (1.8). If W(x) every x E X, then W is proximinal in L.
=
Ex for
Proof. For any f E L, f g ~ the set X (f, W) is empty, since d (x) = 0 < d for every x E X. Now lp is a W-carrier such that lp(x) is a closed and convex non-empty subset of Ex. By Theorem (1.8), lp admits a Wselection, which by Proposition (1.2) is a best approximant to f in W. 0 Corollary (2.3). Let X be a paracompact Hausdorff space and let E be a Banach space. Let We C(fb (X, E) be a closed vector subspace which is a locally finite C(f(X, [0, 1])-module. If W(x) = E for each x E X, then W is proxim inal in (X, E).
e,
Proof. For each x E X, take Ex = E and let L = C(fb (X, E). Now C(fb (X, E) is a locally finite C(fb(X, ~)-module and it remains to apply Theorem (2.2), and Remark (1.12). 0 Proposition (2.4). Under the hypothesis of Proposition (2.1), for each fE L, f g ~ the W-carrier tp is lower semicontinuous with respect to W at all points x E X(f, W) such that for some neighbourhood V of x and some constant c > 0, d - d(t);;?: c, for all t E V, t:;e x. Proof. Let g E Wand r > 0 be given such that Ip(x)
n B,(g(x» :;e 0.
Choose 8> 0 such that 8 < d and 8 < 3- 1d-1rc. By upper semicontinuity there is a neighbourhood N C V of x such that IIf(t) - g(t)11 < d + 0 for all t E V. Now for any tEN with IIf(t) - g(t)11 ~ d, clearly lp(t) n B,(g(t» :;e 0. Fix now tEN with Ilf(t) - g(t)1I > d. Since N C V, d - d(t);;?: c. Let y(t) > 0 be defined by y(t) = I/f(t) - g(t)l/- d. Clearly, y(t) < o. Define 0 < A < 1 by
630
J.B. Prolla I Existence of Best Approximants
A = (d - d(t) + y(t»-l(d - d(t»
=(lIf(t) -
g(t)/I- d(t)r1(d - d(t» .
Since W(t) is proximinal, choose hEW with Ilf(t) - h (t)1I = d(t). Let gA E W be defined by gA = (1- A)h + Ago We claim that s. (t) E rp(t) n B,(g(t». Firstly: IlgA (t) - g(t)1I = (1- A )llg(t)- h (t)11 :so; (1- A) (lig(t):so; (1-
f(t)lI + lIf(t) - h (t)IO
A)(d + y(t) + d(t»
= y(t)(d - d(t) + y(t»-l(d + y(t) + d(t» < 8(d - d(t) + y(t)r1(d + y(t) + d(t»
< 8(d - d(t) + y(t»- 13d < rc(d - d(t) + y(t)r 1< r. On the other hand: lIgA (t) - f(t)II:so; Allf(t) - g(t)1I + (1- A)/If(t) - h(t)1I = A (d
+ y(t» + 1(1- A)d(t)
0
From the definition of A it follows that A(d + y(t» + (1- A)d(t) = do Hence l/gA (t) - f(t)II:so; d
and therefore gA (r) E
tp (r)
n B,(g(t» as claimed. 0
Theorem (2.5). Let X be a paracompact Hausdorff space, and let L be a Banach space of cross-sections over X which is a locally finite 'f5b (X , K)module. Let W be a closed vector subspace of the form W
=
{f ELI f(t)
=
O,[or all t E T},
I.B. Prolla I Existence of Best Approximants
631
where T C X is some closed subset. Then W is proximinal in L. Moreover, lor each IE L, I fit l¥, the subset X (f, W) is contained in T and, if X is compact, then d
= dist(f, W) = sup{lI/(t)111 t E T}.
Proof. Let x E X; x g T, be given. Let v E Ex be also given. There is some gEL such that g(x) = v. Since X is completely regular, there is some a E cgb(X, &(), 0::::; a::::; 1, such that a(x) = 1 and a (t) == 0 for all t E T. Then w = ag belongs to Wand w(x) = v. Thus W(x) = Ex for all x E X, x fit T. Therefore, one has d(x) = 0 < d = dist(f, W), and x fit X(f, W). When X is compact, we know that d = sup{d(x)1 x E X}. Now, when t E T, W(t) = {O} and therefore d(t) = 11/(t)ll. Hence
d = sup{d(t)1 t E T} = sup{lI/(t)II I t E T} . To prove the proximinality of l¥, by Theorem (1.8) and Proposition (2.1), only the lower semicontinuity of cp at x E XU, W) needs proof. Let IE L, I fit l¥, x E XU, W) be given. Notice that, for any given x E X we have
cp(x)={O}
if xE T,
cp(x) = {v E Exlliv - l(x)II::::; d}
if x g T.
Let g E Wand r> 0 be given such that cp (x) n BAg(x » ~ 0. Notice that, by the first part of the proof, x E T and therefore g(x) = O. Moreover, since x E XU, W) C T, d = d(x) = II/(x)II, i.e. 11/(x) - g(x)11 = d < d + r. By upper semicontinuity of 1- gEL, there is some neighbourhood V of x in X such that
II/(t)- g(t)11 < d + r
for all t E V.
(i). t E V n T. Then cp(t) = {O} and g(t) = O. Hence, cp(t) n B,(O) ~ 0. (ii). t E V and t fit T. Then W(t) = E, and cp(t) = {u E E,lllu - 1(t)11 : : ; d}. Choose wE W such that I(t) = w(t), and let gA = Ag + (1- ..\)w, where ..\ = d(d + y(t))-l, and y(t) = 1I/(t) - g(t)ll- d. Firstly: l!gA (t) - 1(t)II = ..\llg(t) - 1(t)II = ..\(d + y(t» = d.
632
J.B. Prolla / Existence of Best Approximants
Secondly: /IgA (t) - g(t)II = (1- A )IIg(t)- /(t)11
= r(t) = II/(t) Hence
g(t)ll- d < r.
e, (t) E fP(t) n B,(g(t». 0
Theorem (2.6). Let X be a paracompact Hausdorff space and let E be a Banach space. For every closed subset T C X, the closed vector subspace W
= {IE "€b(X, E)I/(t) = 0, t E T}
is proximinal in cgb (X, E). Proof. For each x E X and let Ex = E. For each / E "€b(X, E), / g W; fP is a bounded carrier and W satisfies the condition that given x E X and t E rp(x) there exists agE W with g(x) = t and Ilgll.,; Ilfll + d. By the same argument as in Theorem (2.5), fP is lower semicontinuous and then by Remark (1.12)(b), the W-carrier fP has a selection in w: By Proposition (1.2) this selection is a best approximant to / in w: 0
3. Proximinality of Closed Two-Sided Ideals In this section we shall assume that each normed space Ex is in fact a normed algebra and that the space of all cross-sections over X has been made a linear algebra by defining the product of two cross-sections / and g to be x~ /(x)g(x). If L is now a sub algebra of the Cartesian product of all the spaces Ex which is a normed space of cross-sections over X, then L is also a normed algebra. Indeed
II/gil = sup{II/(x)g(x)II' x E X} .,; sup{II/(x)11·llg(x)111 x E X} .,; 1If11·llg II , for all / and g in L. Thus, if L is both a subalgebra and a normed space of cross-sections over X, according to Definition (1.1), then we shall say that
J.B. Prolla / Existence of Best Approximants
633
L is a normed algebra 01cross-sections over X. If, moreover L is complete, then we say that L is a Banach algebra 01 cross-sections over X. Let us now consider the following condition, where L itself is assumed to be a '?6b(X, ~)-module: (3.1)
Every closed right (resp. left) ideal in L is a '?6b(X,
~)-module.
With regard to (3.1), let us point out that it is implied by the following condition (see Kaplansky [5]): (3.2)
For every 1 E L,
1 lies
in the closure of IL (resp. LI) .
When X is compact, and L is a '?6b(X; ~)-module, using the StoneWeierstrass theorem for '?6(X, ~)-modules of cross-sections (see [8, Th. 1.30]), (3.2) follows from (3.3)
For every x E X, and every v E Ex, v lies in the closure of vEx (resp. Exv) .
Clearly, (3.3) is satisfied if Ex has an identity, or an approximate right (resp. left) identity. Theorem (3.1). Let X be a compact Hausdorff space, and let L be a Banach
algebra 01 cross-sections over X which is a '?6(X; ~)-module. Let us assume that each normed algebra Ex is topologically simple (i.e. has no proper closed two-sided ideal). Then every proper closed two-sided ideal I in L, which is a '?6(X, K)-module, is proximinal in L. Proof. By Theorem (2.7) it is sufficient to prove that I has the form 1= {f ELI I(t) = 0, t E T} for some closed subset. Let
T = {x E
xl I(x) = 0, for
all 1 E I} .
By condition (i) of Definition (1.1), the set T is closed. Let
W
= {fE
LI/(t) = 0, tE T}.
Clearly, lew, let now 1 E W, and assume, by contradiction that 1 Ii 1. Since I is a '?6(X; IK)-module, and X is compact, it follows from [8, Th.
634
J.B. Prolla / Existence of Best Approximants
1.30], that for some point x EX; f(x) does not belong to the closure of I(x) in Ex' Now the closure of I(x) is a closed two-sided ideal in Ex, and therefore reduces to {O}. Hence f(x) -::I- O. Since fEW; x g T. However, we have just seen that I(x) = {O}. Hence x E T. This contradiction shows that in fact W C 1. 0 Corollary (3.2). Let X, L and each Ex be as in Theorem (3.1), and assume
that (3.3) is true. Then every proper closed two-sided ideal in L is proximinal. Proof. We have seen that, since X is compact, (3.3)=? (3.2)=? (3.1). Corollary (3.3). Let X be a compact Hausdorff space. Every proper closed
ideal in C6?(X; K) is proximinal.
References [1] R.c. Buck, Approximation properties of vector-valued functions, Pacific J. Math. 53 (1974) 85-94. [2] F. Cunningham jr. and N.M. Roy, Extreme functionals on an upper semicontinuous function space, Proc. Amer. Math. Soc. 42 (1974) 461-465. [3] G. Gierz, Bundles of Topological Vector Spaces and Their Duality, Lecture Notes in Math. 955 (Springer, Berlin, 1982). [4] R. Godement, Theorie generate des sommes continues d'espaces de Banach, c.R. Acad. Sci. Paris 228 (1949) 1321-1323. [5] 1. Kaplansky, The structure of certain operator algebras, Trans. Amer. Math. Soc. 70 (1951) 219-255. [6] S. Machado and J.B. Prolla, An introduction to Nachbin spaces, Rend. Circ. Mat. Palermo 21 (1972) 119-139. £7] E. Michael, Continuous selections I, Ann. of Math. 63 (1956) 361-382. [8] J.B. Prolla, Approximation of Vector Valued Functions (North-Holland, Amsterdam, 1977). [9] J.B. Prolla and S. Machado, Weierstrass-Stone theorems for set-valued mappings, J. Approx. Theory 36 (1982) 1-15. [10] G. Shilov, On continuous sums of finite-dimensional rings, Sbornik N.S. 27 (1950) 471-484. [11] J. von Neumann, On rings of operators. Reduction theory, Ann. of Math. 50 (1949) 401-485.
l.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
635
SOME FUNDAMENTAL METHODS IN THE THEORY OF DIOPHANTINE EQUATIONS
P. RIBENBOIM Department of Mathematics and Statistics, Queen's University, Kingston, Ontario, Canada
To Leopoldo Nachbin, with my congratulations for a prestigious career, so determinant for the development of mathematics in Brazil
o.
Introduction
Many problems in number theory lead to the solution of equations in integers or rational numbers. Diophantus of Alexandria considered many such instances 'in his six books of Arithmetics (around the year 250). This work was translated by Bachet in the 17th century and carefully studied, among others by Fermat, who commented on Diophantus problems, made interesting discoveries and proposed many difficult problems. The study of diophantine equations amounts to the search for solutions in natural numbers, or rational numbers of polynomial or exponential equations
where aj, ... , as are natural numbers (r;;': 0, example,
X2- y3= k, X 2- dy2= ±1, 3+ay3= XU - yv= 1, X 1,
S;;':
0, r + S
;;,:
1). For
X3+ y3+Z3= aXYZ, XU + yU
=
ZU,
etc.
(where X, 1'; z, U, V are unknowns). The questions to be investigated are the following: Are there solutions? If so, how to obtain the solutions, or at least, how to determine their number, or an upper bound for their number; how to estimate the size of the solutions? Usually, an equation may have some solutions which are obvious at first sight-these are called the trivial solutions, and the investigation always refers to the non-trivial solutions.
636
P. Ribenboim / Fundamental Methods in Diophantine Equations
We discuss the following three possibilities: (1). The equation has no solution (except the trivial solutions). (2). The equation has finitely many solutions. (3). The equation has infinitely many solutions. Case (1) is ascertained by reduction to absurd: by assuming that the equation has a non-trivial solution it is possible to deduce a contradiction. There appears to be three ways of showing that (2) holds: (2a). By embedding the set of solutions into a set S which is known to be finite. (2b). By showing that there exists a natural number N such that the number of solutions is at most N. (2c). By exhibiting a positive number C such that every integer in every solution of the equation has absolute value less than C. In situation (2c) there is an algorithm to determine all the solutions of the equation-it suffices to try one after the other all tuples of integers with absolute value less than C. Of course if C is a very large number, as it is often the case, this algorithm can hardly be brought to a conclusion. The situation (2b) is less satisfactory, because even if. N - 1 of the possible solutions are already known, is there still another solution? Where to search for it? For example, it was possible to show that there exists at most 10 imaginary quadratic number fields with class number 1; of these 9 were known: Q(V=t), Q(v=2), Q(v=3), Q(v'=7), Q(v=tl), Q(\/-19), Q(\/-43), Q(\/-67), Q(\/-163). The search for the last such field lasted many years, until Heegner [21] (1952)and Stark [47] (1967) proved that in fact such a field did not exist, and for this they required quite deep and sophisticated methods. (2a) is established by showing that S contains all the solutions of the equation and that if S is assumed to be infinite this would lead to a contradiction. With this argument, there is no indication of how many or how large are the solutions. This is definitely the weakest of the three situations in case (2). Case (3) may be arrived at in the following ways: (3a). By exhibiting an infinite set of solutions. (3b). By indicating a method, which may be iterated indefinitely, to produce a new solution from any given finite set of solutions. (3c). By giving a parametrization of the set of solutions S by an infinite set P of parameters; that is a bijection
P. Ribenboim / Fundamental Methods in Diophantine Equations
637
equation X 2 + y2 = Z2, it suffices to describe the set S of primitive solutions (x, Y, z), where x, y, Z are positive pairwise relatively prime integers and y is even. Then P is the set of all couples (a, b) of positive relatively prime integers and cP : P ~ S is given by cP(a, b) = (x, y, z) with y = 2ab, Now we turn our attention to the methods which are normally used to answer the proposed questions. One is immediately struck by the variety of equations, each treated in a special way, by means of particular artifices. This tends to give the impression that the theory of diophantine equations is no more than a collection of tricky arguments applied to particular equations. It is true that there are many such instances, but this is not all-and in fact there are quite interesting methods applicable to appropriate classes of equations. Fermat's method of infinite descent was applied by Fermat to show that the equations X 4- y4 = Z2, X 4+ y4 = Z4 have only trivial solutions; Lagrange used this method with the equation 2X 4 - y4 = Z2, and Euler applied it to X 3 + y3 = Z3. The idea is quite simple and the aim is often to show that the equation has only the trivial solution (say with z = 0). The method consists in producing from a solution (x, y, z) (in non-zero integers) another solution (x', y', z') with non-zero integers and [z"] < [z], Thus if (XI' YI' ZI) is assumed to be a solution in non-zero integers, by iteration of the method one obtains a sequence of solutions in non-zero integers (x 2, Y2' Z2)' (x 3, Y3' Z3)' ... with IZI/ > IZ21 > IZ31 > ... ; this leads eventually to IZnl = 0 for some n, which is a contradiction. In this paper, we shall discuss some more recent methods, which involve algebraic geometry, real or complex analysis.
1. Diophantine Approximation
In 1909, Thue [48] devised a very interesting method to treat equations like
where ao, ai' ... , an' a are integers, ao ¥ 0, n ~ 3. Moreover, he assumed that
638
P. Ribenboim I Fundamental Methods in Diophantine Equations
is irreducible, so its roots a 1, ... , an E C are all distinct and do not belong to Q. The purpose is to show that under certain conditions, (1.1) has only finitely many solutions (x, y) in integers. If F(x, y) = aoxn + a1xn-1y + ... + an_1xyn-l + anyn = a, then ao (x - a;y) = a. If a = 0 there exists a j such that x - ajy = 0, so y = 0 (because a j Q), hence the only solution is (0,0). Now we assume a 'I- o. We have
n;=l
e
n la/ IT1x-a;yl=-,,.
;=l
ao
Let
so there exists an index j, 1:;;;;, j :;;;;, n, such that Ix - ajy I:;;;;, C 1 . Let Cz = minj"k{Ia; - a k j} > O. Since for each Yo there are at most n integers x such that F(x, Yo) = a, in order to show that (1.1) has finitely many solutions it suffices to consider the set of solutions (x, y) with Iy I> 2C1/C2, and show that it is finite, For every i'l- j we have
o<~Czlyl < Cz/yl- C1 :;;;;, la;-
ajl-IYI-Ix - ajyl :;;;;'/(a; - aj)y - (x - ajy)1 = Ix - a;yl.
So
lal 2n - 1Ia/ Ix-ajy/= lao/n;"jlx-as/
1 )
> O. If aj is not real and
C3
/yln-l
'Y is its imaginary part
P. Ribenboim / Fundamental Methods in Diophantine Equations
639
and y may have only finitely many values, so there are only finitely many solutions. Thus, let a j be real. We have shown that if (x, y) is a solution with Iy I> 2C1/C2 then xly belongs to the set S(aj , n, C3 ) of all rational numbers alb, a, b E Z, b:f. 0, gcd(a, b):; 1, such that
This leads to the question of finiteness of such sets. If they are finite, then the equation (1.1) has only finitely many solutions. Let a E R, v E R, v » 1. We say that a is approximable by rationals to the order v when there exists a C> and infinitely many rational numbers alb (with b > 0, gcd(a, b) = 1) such that
°
It may be shown that if a is not approximable by rationals to the order v then the following condition holds:
(1.2)
There exists a C' >
°such that I
a -
~I > ~: '
for every rational number alb. In turn, (1.2) implies the following condition: (1.3)
For every e > 0, a is not approximable by rationals to the order v+ e.
For example, if a E Q then a is approximable by rationals to the order 1 but not to the order 1 + e (for every E > O}-this is easy to show. With the pigeon-hole principle, Dirichlet proved in 1842, [11]: Theorem (1.1). If a is an irrational number then there exist infinitely many
rational numbers alb such that
640
P. Ribenboim / Fundamental Methods in Diophantine Equations
Thus ex is approximable by rationals to the order 2 (and the constant C = 1). Hurwitz showed that the smallest possible value for the constant Cis I/VS. The immediate question is whether or not ex is approximable by rationals to an order higher than 2. Liouville proved in 1844 [31], [32] a theorem concerning the approximation of algebraic numbers. Let ex be a real algebraic number of degree d ;;;;. 2, let F(X) = aoX d + alX d- t + ... + ad E Z [X] be its minimal polynomial, a o c;i 0, gcd(a o, a l, .. . , ad) = 1. The height of ex is defined to be H(ex) = maxooO:i",Alai/}. Let 1
C(ex)
= d(d + I)H(ex)(lexl + l)d-t .
Theorem (1.2). For every rational number alb:
Therefore, for every E > 0, ex is not approximable by rational numbers to the order d + E. Thus if d = 2 then lJ = 2 is the best value of v such that ex is approximable by rationals to the order /.I. Various theorems were established with the intention of showing that for real algebraic numbers ex g Q, Dirichlet's theorem is the best possible. Thus if ex is a real algebraic number of degree d > 2 then Thue [48] proved in 1909 that ex is not approximable by rational numbers to the order lJ = 1 + ~d + E (for every E > 0), in particular, if ex has degree 3 then ex is not approximable to the order 3. Siegel [44] proved in 1921 this statement with lJ = 2Yd + E and also lJ = dltk: + 1) + k + E for 1:<:;; k :<:;; d (for every E > 0); Dyson [12] (1947), and independently Gel'fond [19] (1949), proved it with lJ = V2d + E; Schneider [43] (1949) gave a simpler proof. Finally, in 1955, [40], Roth gave the best possible theorem. Theorem (1.3). If ex is a real algebraic number, then for every E > 0 there exists a C' > 0 (depending on ex and E) such that for every rational number alb we have
P. Ribenboim / Fundamental Methods in Diophantine Equations
641
Hence, for every E > 0, a is not approximable by rational numbers to the order 2 + E. This shows indeed that for irrational real algebraic numbers, Roth's theorem is the best possible. In other words, we have the following useful transcendence criterion: if an irrational real number a may be approximated by rational numbers with order v > 2 then a is a transcendental number. It should be noted that the constant C' in Roth's theorem is not explicitly computable, contrary to the constant C(a) in Liouville's theorem. We shall return later to this point. The above theory has been generalized for the approximation by numbers of a given algebraic number field I( (which may be different from Q) but we shall not discuss it here. We take over once more the study of equation (1.1). If n ~ 3 and aj is a real algebraic number of degree n, by Thue's theorem on diophantine approximation (1909), a j is not approximable by rationals to the order n ~ 3. So the set Seai' n, C 3) , considered in the proof, is a finite set. This completes the proof of the following theorem of Thue (1909) [48]: Theorem (1.4). If F(X, Y) = a.X" + alXn-1y + ... + an_1xyn-1 + an Y" with a o, , an E Z, a o ¥- 0, n ~ 3, a E Z, if f(X) = aoX n + n-1 alX + + an is irreducible then the equation F(X, Y) = a,
i.e. (Ll),
has only finitely many solutions in integers.
This theorem may be easily strengthened by assuming only that the roots of f(X) are all distinct and a ¥- O. The method of proof does not provide any indication about the number or size of the solutions. For example the theorem applies to the equation (1.4)
a'X" + oY"
=
c.
with a, b, c E Z, a ¥- 0, c ¥- 0, n ~ 3 .
In particular, this includes the equation (1.5)
with c ¥- O.
Roth used the same method and his sharpening of Thue's theorem (1.3) to prove the much more embracing theorem:
642
P. Ribenboim / Fundamental Methods in Diophantine Equations
Theorem (1.5). Let F(X, Y) = a.X" + a1X n-1Y + ... + an_1Xy n-1+ anyn with a o"'" an E 1, ao:l- 0, n ~ 3 and assume that f(X) = F(X, 1) has distinct roots. Let G(X, Y) E 1 [X, Y], G(X, Y) :I- 0, with degree at most n - 3. Then the equation F(X, Y) = G(X, Y)
(1.6)
has at most finitely many solutions in integers.
2. Methods from Algebraic Geometry Let f E 1 [X, Y], deg(f) = n ~ 1. The geometric method to treat the equation f(X, Y) = 0 consists in the consideration of the curve defined by the above equation and the possibility of making statements about the set of points (x, y) in the curve, with coordinates x, y E 1 or even x, y E Q. Such statements will depend on certain invariants linked with the order and the singularities of the curve. A full overview of the curve includes the points at infinity and requires to work in the complex projective plane IP' 2(C), We recall that IP' 2(C) is defined to be the set of equivalence classes of triples (x, y, z) E C 3 , (x, y, z) :I- (0,0,0), under the equivalence relation: (x, y, z) == (x', y', z')
when there exists atE C, t:l- 0,
such that x'
= xt, y' = yt,
Z'
= zt.
Each equivalence class is called a point of IP' 2(C), If P is the equivalence class of (x, y, z) then x, y, z are coordinates of P. If K is any subfield of C, the point P is defined over K if it has coordinates x, y, zEK. Given f E 1 [X, Y], deg(f) = n ~ 1, we consider the homogenized polynomial F(X, Y, Z) = znf(X/Z, Y/Z), still of degree n and coefficients in 1. For any homogeneous polynomial FE C[X, Y, Z] of degree n ~ 1, if (x, y, z), (x', y', z') E C 3 are equivalent triples (distinct from (0,0,0» then F(x, y, z) = 0 if and only if F(x', y', z") = O. Thus, it makes sense to write that F(P) = 0 or F(P) =1= 0 for any point P E IP' z{C).
P. Ribenboim / Fundamental Methods in Diophantine Equations
643
If FE C[X, Y; Z] is a homogeneous polynomial of degree n;?: 1, let '"-€F = {P E P 2(c)1 F(P) = O}. '"-€F is called the (algebraic) curve defined in p 2(C) by F and F(X, Y, Z) = 0 is an equation of the curve '"-€F' Any subset '"-€ of P 2(C) is a curve if there exists F, as above, such that '"-€ = '"-€F' We have '"-€FI F2 = '"-€FI U '"-€F2 (for F t , F 2 homogeneous non-constant polynomials). A curve '"-€ is reducible if '"-€ = ~I U '"-€2 where '"-€t, '"-€2 are curves, '"-€J 'i- '"-€, '"-€2 'i- '"-€. A curve '"-€ is irreducible if it is not reducible. A first result which is easily obtained with the theory of the resultant is the following: If F, G are homogeneous polynomials, F being nonconstant and irreducible, if '"-€o:2 '"-€F then FI G. It follows that if F, G are homogeneous non-constant irreducible polynomials and '"-€F = '"-€o then F = aG with a E C, a¥- 0; moreover the curve '"-€F is irreducible. Every curve is the union, in a unique way, of finitely many irreducible curves. This allows to restrict, most of the time, the attention to irreducible curves. If '"-€ = '"-€F the order of '"-€ is defined to be the degree of F. Curves of order 1 are the lines, of order 2 are conics, of order 3 are cubics in PiC). The order of '"-€ is equal to the maximal number of distinct intersections of '"-€ with any line. '"-€ is defined over the field ~ ~ C if '"-€ = '"-€F where F has coefficients in ~; ~ may be taken as a finitely generated field extension of Q. If '"-€ is defined over K, if K is contained in the subfield l of C, let '"-€(l) denote the set of points P E '"-€ having coordinates in l. Now we discuss the singularities of an irreducible curve '"-€ = '"-€F' where F is a homogeneous polynomial of degree n ;?: 1. To begin, we note that each partial derivative of order s ;?: 0 of F is a homogeneous polynomial of degree n - s. The point P E '"-€ has multiplicity r;?: 1, when all partial derivatives of F of order at most r - 1 vanish at P, while there is some partial derivative of order r which does not vanish at P. This notion is independent of the choice of the equation of the curve. If P has multiplicity 1 it is called a simple or regular point of '"-€. If P has multiplicity at least 2 then P is called a singular point of C(j. The multiplicity of each point of '"-€ is at most equal to n = deg(F). If P is a point with multiplicity r » 1 of '"-€, every line .:£ containing P intersects '"-€ at P with 'multiplicity' at least r; there are lines .:£t, ••• , .:£, (not necessarily distinct) which intersect '"-€ at P with multiplicity at least r + 1. These lines are the tangents of '"-€ at P. P is an ordinary singular point of '"-€, with multiplicity r;?: 2 when '"-€ has r distinct tangents at P.
P. Ribenboim / Fundamental Methods in Diophantine Equations
644
For example, if r = 2 then P is called a double point. It is ordinary when r6 has two distinct tangents at P, and in this case P is called a node. P is a cusp when r6 has only one tangent at P. If r6 is an irreducible curve of order n, there is an important inequality connecting the multiplicities of its points and the order:
(2.1)
(n-l)(n-2)~
2:
rp(rp - l )
PE'€
(rp denotes the multiplicity of P E r6). As a result the number of singular points of cg is bounded above by ~(n - l)(n - 2). Thus for example every irreducible cubic curve has at most one singular point, which is a double point. One idea which is quite natural in the study of a curve r6 is to replace it by another curve r6', which is easier to study than r6, and related to cg in a way that may be explicitly described. This may be effected by means of a birational transformation. Let r6 be a curve. The curve r6' is a birational transform of r6 if there exist homogeneous polynomials A, B, C, D, A', B', C', D' E C[X, Y, Z] such that if 'if) = {P E r61 D(P) = O} and 'if)' = {P' E r6'1 D'(P') = O}, and if 0, 0' are the maps 0: r6\ 'if) ~ C6'\ 'if)', 0': cg'\ 'if)' ~ r6\ 'if) given by O(x, y, z) = (x', y', z'), O'(x', y', z') = (x, y, z) with x' =
x=
A(x, y, z) D(x, y, z )'
-----''--'--''--'--'-
A'(x', y', z') D'(x', y', z')'
B(x, y, z) y' = -----''--'--''--'--'D(x, y, z)' y=
B'(x', y', z') D'(x', y', z')'
,
z =
C(x, y, z) D(x, y, x)'
z=
C'(x', y', z') , D'(x', y', z')
then 0, 0' are inverse bijections (note that 'if), 'if)' are finite sets and that the maps are independent of the choice of the coordinates of the points). It follows that r6' is in turn a birational transform of r6, and this defines an equivalence relation on the set of curves in IP' 2(C), The noteworthy special case where x' = yz, y' = zx, z' = xy and x = y'z', y = z'x', z = x'y' is called a quadratic transformation. An important theorem states: every irreducible curve r6 is birationally equivalent to a curve of IP' 2(C) whose singular points are ordinary. Actually, by allowing curves in higher dimensional projective spaces, it may be shown that every irreducible curve is birationally equivalent to a
P. Ribenboim / Fundamental Methods in Diophantine Equations
645
non-singular curve, i.e. a curve without singular points; this is called the resolution of singularities. We note that if ee and ee' are birationally equivalent then the orders n of ee and n I of ee' need not be equal. Also the fields of definition of ee and ee' need not be the same; however each one is contained in a finitely generated extension of the other. If ee, ee' are defined over the field IK, if they are birationally equivalent and the above polynomials A, B, C, D, A', B ', C, D ' have coefficients in IK, we say that ee, ee' are IK-birationally equivalent. It may be shown that we have the equality (2.2)
(n - 1)(n - 2) -
L
rp(rp - 1) = (n ' - 1)(n ' - 2) -
n~
L
rp.(rp' - 1)
~r
when ee, ee' are birationally equivalent and all its singular points are ordinary. This common value is therefore an invariant of the irreducible curve ee (with only ordinary singular points) by birational transformation; the genus of ee is defined to be (2.3)
g=~[(n-l)(n-2)- L
rp(rp-l)],
PE~
and by (2.1) g ~ O. It will be clear that two curves may have the same genus without being birationally equivalent. All the above concepts may be found, explained in more detail, in the standard books on algebraic curves; see for example [51].
2.1. Curves of Genus 0 A curve ee is a rational curve if there exist polynomials A, B, C E C [T] (T an indeterminate) with no common zero in C such that: (i). The map 1T: C ~ IP' z{C) defined by 1T(t) = (A(t), B(t), C(t» has image equal to ee. (ii). With at most finitely many exceptions, every point of ee corresponds to a unique value of t E C. It may be shown that ee has genus 0 if and only if ee is a rational curve; actually any rational curve is birationally equivalent to a line in IP' 2(C), More interesting for our purpose is the classification of curves of genus o defined over Q, with respect to Q -birational equivalence, that is
P. Ribenboim / Fundamental Methods in Diophantine Equations
646
birational transformations given by rational functions with coefficients in Q. In 1890 Hilbert and Hurwitz [23] showed that if ~ is a curve of genus 0, order n, defined over Q, it is Q -birationally equivalent to a line !t containing (0,0,1) and defined over Q-when n is odd; or to a conic ~' defined over Q-when n is even. Thus if n is odd the set ~(Q) of points with rational coordinates is infinite. If n is even, either ~'(Q) is empty or infinite, and correspondingly ~(Q) is finite or infinite. Altogether, if it is recognized that the curve ~, associated with the diophantine equation f(X, Y) = 0, has genus 0, then the above classification indicates the number of rational points in the curve ~, hence the number of rational solutions of the given equation.
2.2. Curves of Genus 1 First we note that every irreducible non-singular cubic must have genus 1. Conversely, it may be shown that every curve with genus 1 is birationally equivalent to an irreducible non-singular cubic. So, up to birational equivalence it suffices to classify the non-singular cubic curves. Moreover, it may also be established that every non-singular cubic is birationally equivalent to one with equation (2.4)
Fa,b(X, Y, Z) = y 2 Z - 4X 3 + aXZ 2 + bZ3 with
a 3 - 27 b 2 ¥- 0, a, b E
Curves of the above type will be referred to as cubics in Weierstrass normal form and were studied in connection with doubly periodic functions. A clear exposition of the following developments may be found in Robert's excellent book of 1973 [39]. Let f be a doubly periodic meromorphic function defined over the complex plane. The set of periods of f is a lattice L, that is, the set of all linear combinations with coefficients in l. of two basic periods w, to' such that r = co'[o» g R; actually we may choose them so that Imtr) > O. The Weierstrass function associated to a lattice L is defined to be (2.5) where
L'
indicates the sum over all the elements of L different from O.
P. Ribenboim / Fundamental Methods in Diophantine Equations
647
IV L is holomorphic everywhere, except at the points of L, where it has poles of order 2, with residue O. Moreover, IV L is a doubly periodic function, with set of periods equal to L. Thus, it is an elliptic function (with respect to L). Its derivative is (2.6)
It is again holomorphic everywhere, except at the points of L, where it has poles of order 3, with residue O. It is also a doubly periodic function, with set of periods equal to L, hence also an elliptic function (with respect to L). Each series L' 1/w 2k (with k ;;. 2) converges absolutely. Let (2.7)
, 1 g2=gz
w
(2.8)
Then g~ - 27g~ ~ 0, and the function IV L satisfies the following differential relation: (2.9) It follows that the mapping z E C ~ (Z3 IvL(z), z31v~(z), Z3) E IP Z
parametrization of the non-singular cubic with equation (2.10)
We denote this cubic by CfiL • Z, Z' E C give the same point of CfiL if and only if z - z' E L. So we have a bijection T L = C/L-:;t CfiL. Since L is an additive subgroup of C, T L has the structure of abelian additive group, quotient of C by L; moreover, in T L it may be defined in a natural wayan analytic structure making it into an analytic manifold of complex dimension 1, so it is a curve which carries a structure of an abelian additive group. Thus endowed, T L is the torus associated to L. By means of the bijection 7T', this structure is transported to the associated curve Cfiv So, from every lattice we obtain a non-singular cubic curve, which is also an abelian additive group and parametrized by
648
P. Ribenboim / Fundamental Methods in Diophantine Equations
means of the Weierstrass elliptic function and its derivative-hence CffL is called an elliptic curve. We discuss the addition on CffL • If Zj + L, Zz + LET L' if P, = 7T(Zj + L), Pz = 7T(ZZ + L) are the corresponding points in Cffv then P, + P z = 7T(Z, + Zz+ L) = ({Zj + ZZ)3/1' L(Zj + zz), (z, + zz/ /I'~(z, + zz), (z, + zi)· This may be translated into the addition' theorem for the function /l'L: (2.11)
and for the function /I'~, which is obtained by differentiating the above formula. The addition on CffL has also a more classical geometric interpretation, and as such it is the basis of the secant-tangent method known in part by Bachet, and explored by Fermat. The line at infinity of IP' z(C) is the set of points defined by the equation Z = 0, that is, the points (x, y, 0), with x, y not both equal to O. The intersection of the elliptic curve Cff L with the line at infinity consists only of the point (0, 1,0) (because /l'L has a pole of order 2 and /I'~ has a pole of order 3 at every lattice point); the line at infinity is tangent to CffL at (0, 1,0). If PI' Pz, P3E eeL they are said to be collinear if either: they are distinct and belong to a line, or P z = P3 and PI lies in the tangent at CffL to the point Pz' So if Pz = P3 = (0, 1,0) then P, = (0, 1,0). With this definition, it may be shown that P, + Pz + P3 = 0 on the curve CffL exactly when PI' r; P3 are collinear. If K is any subfield of C and if the curve CffL contains a point P with coordinates in IK, the above procedure allows us to produce other points in CffL with coordinates stilI in IK. Most important for the theory of diophantine equations is the case where IK = Q. The set eeL(IK ) of points of CffL with coordinates in IK is an abelian additive group, unless it is empty. The celebrated theorem of Mordell (1922) [34], later extended by Wei! (1929, 1930) [52], [53] asserts: Theorem (2.1). If IK is a number field then CffL (IK ) is a finitely generated abelian group.
As such, it is the product of a finite group (consisting of the torsion points or division points, with coordinates in IK of CffL ) and a free abelian
P. Rlbenboim / Fundamental Methods in Diophantine Equations
649
group of finite rank. For each curve CffL it is of course important to determine the division points and a basis for the set of solutions of infinite order. Now, we discuss when two curves associated to lattices are birationally equivalent. If L, L' are lattices then Cffv CffL , are birationally equivalent if and only if there is an analytical group isomorphic between CffL and CffL' (i.e. one which is biholomorphic). This happens if and only if the lattices L, L' are homothetie, that is L = al:' (with some a E C\{O}). Let H = [z E C[ Im(z) > O} be the upper half plane. To the lattice L =? wEB? w', we associate or = w'/w E H. If or' corresponds to L' then L, L' are homothetic if and only if there exists integers a, b, c, d, with ad - be = 1 such that
ar + b or'=--. cor + d
(2.12)
In conclusion, the birational equivalence classes of elliptic curves (associated with lattices) are classified by the orbits of the upper half plane H under the action of the group of unimodular transformations given by (2.12). A set of representatives of these orbits may be chosen to constitute the fundamental domain D = {z E
n.
It still remains to assert the fact that every non-singular cubic curve in Weierstrass normal form, i.e. with an equation (for a, bEe, a 3 - 27 b2 ~ 0) , is associated with some lattice L, that is, a = g2(L), b = g3(L). This is indeed the case and may be established by means of the theory of modular forms associated with the group SLlZ) of 2 x 2 unimodular matrices with entries in ? . Summarizing, if Cff is an irreducible curve of genus 1, defined over the field ~, then it is birationally equivalent to a non-singular cubic curve Cff', in Weierstrass normal form, defined over a field ~', a finitely generated extension of ~; the set of ~' -points of Cff and Cff' differ at most at a finite subset and Cff'(~') is a finitely generated abelian group.
650
P. Ribenboim / Fundamental Methods in Diophantine Equations
2.3. Curves of Genus g> 1 A central place in the study of such curves has been occupied by the famous Modelll's conjecture (1922) [34]. This conjecture has been finally settled by Faltings in 1983 [14]: Theorem (2.2). If cg is a non-singular irreducible curve of genus g> 1 defined over the number field IK, then cg has at most finitely many points with coordinates in IK.
Faltings' work appeals to very sophisticated ideas in algebraic geometry and number theory, a combination of subjects which well deserves to be referred to as 'diophantine geometry'. The fundamental notion of height is suitably defined, with the intention to play a role analogue to the heights in number fields. The finiteness of the set cg(lK) is established by embedding cg(lK) into a set of points which have bounded height, relative to an appropriate height function. Nothing of this is easy and the proof of Faltings' settles at the same time various deep-lying conjectures in the subject. See also Deligne (1983) [10]. Faltings' theorem may be applied to Fermat's equation (2.13)
X"
+ Y"
=
Z",
Indeed the genus of the associated curve which is non-singular is equal to ~(n - 1)(n - 2) ~ 2 for n ~ 4. Thus, there exist at most finitely many points in Fermat's curve (2.13) with coordinates in Q. In other words, for every exponent n ~ 4 there exist at most finitely many triples (x, y, z) of pairwise relatively prime non-zero integers, satisfying Fermat's equation with exponent n. 2.4. Points with Integral Coordinates The original aim in studying diophantine equations is, as already stated, the determination of solutions with coordinates in 1.. The method developed by Siegel passes through the consideration of the curve associated to the diophantine equation. In this line of work, but with a purely arithmetical proof, first we note the theorem of Landau and Ostrowski (1920) [27]: the equation (2.14)
(m
~3)
P. Ribenboim / Fundamental Methods in Diophantine Equations
651
with a, b, c, dE Z, a ¥- 0, d ¥- 0, b Z - 4ac ¥- 0, has only finitely many solutions in integers. The special case with the discriminant b Z - 4ac not a square, had been settled by Thue in 1917 [49]. In particular, if m ~ 3 and k ¥- 0, then the equation
X Z - Y"
(2.15)
=
k
has only finitely many solutions in integers. This may be rephrased as follows: let ZI < Zz < Z3 < ... be the increasing sequence of integers which are squares or m-th powers. Then limi~",(zi+l - Zi) = 00. In this connection, P6lya [37] proved in 1918: If a, b, c E Z, a ¥- 0, b Z - 4ac ¥- then lim,...'" P[ ax' + bx + c] = 00; here, for every y E IR, Y ~ 1, P[ y] denotes the largest prime factor of the integer [y]. Gauss had indeed conjectured the special case that for every c ~ 1, lim,...ec P[X Z + c] = 00. In 1926 Siegel [45] proved an analogous result for the hyperelliptic equation
°
(2.16)
(n
~
3)
with ao, ai' ... , an' a E Z, a o ¥- 0, a¥- O. Finally, in 1929, using birational transformations, Siegel [46] established the far-reaching generalization:
Theorem (2.3). Let f(X, Y) E Z [X; Y] be an absolutely irreducible polynomial of degree n, let F(X; Y, Z) = Znf(X/Z, Y/Z) be the homogenized polynomial and cgF the associated curve. If cgF has genus at least equal to 1 then the equation f(X, Y) = has only finitely many solutions in integers.
°
This theorem holds also when cgF has genus 0, but at least three points (counted with multiplicities) on the line at infinity. In this respect see also Leveque (1964) [29]. It should be pointed out that Siegel's theorem gives no indication about the number or size of the solutions of the equation. A noteworthy application of Siegel's theorem is the following: If m, n ~ 3 and k ~ 1 are integers, then the equation (2.17)
X'" - Y"
=
k
652
P. Ribenboim / Fundamental Methods in Diophantine Equations
has only finitely many solutions in integers. So if Z\ < Z2 < Z3 < ... is the increasing sequence of integers which are m-th powers or n-th powers, then lim;_",(zi+\ - Zj) = 00. This follows also from a theorem of Mahler (1953) [33]: If a, b are integers, m, n ~ 2 then
(limit for max{x, y}~oo, with x, y relatively prime natural numbers). In this class of questions, there is still outstanding the conjecture of Landau: If z\ < Z2 < Z3 < ... is the increasing sequence of all integers which are powers (with arbitrary exponent greater than 1) of other integers, then lim;_",(z;+\ - z;) = 00.
3. Effective Lower Bounds for Linear Forms in Logarithms A very important and powerful method to study certain classes of diophantine equations is based on the determination of effective lower bounds for linear forms in logarithms. This study has also implications on the theory of transcendental numbers and diophantine approximation. Let n ~ 1 and consider the set of all linear forms in logarithms, which do not vanish (3.1) (bo, ... , bn, a l , · · . , an E Z; b., ... , b; "e 0; a l , . . . , an ~ 2) and such that the coefficients bo,"" b; as well as the integers al"'" an belong to given sets defined by appropriate bounds. The purpose is to show the existence of a common lower bound for the absolute values IA I of all non-zero linear forms in the set, which is expressible in terms of the given bounds. We illustrate with an example: Let n ~ 1, A ~ 2, B ~ 1 and consider the set of all linear forms A "e 0 such that bo = 0, max{aJ:S;; A, max{lb;I}:s;; B. Then for each such A we have
IA I> exp(-nB log A).
P. Ribenboim / Fundamental Methods in Diophantine Equations
653
This lower bound is rather easy to prove, but the quantity in the right-hand side is too small, i.e. nB log A is too large, in order to be useful to derive any interesting consequences. Preliminary results were obtained by Morduhai-Boltovskoi (1923) [36] who gave a good lower bound for forms bo + b, log a j (when at is an algebraic number, a j ~ 0, 1) and by Gel'fond (in 1935 [18], and again in 1949 [19]). In a series of penetrating and difficult papers (1966 to 1968, [2], as well as in later papers), Baker obtained sharper estimates for lower bounds of linear forms in logarithms, which have found important and very diverse applications. These are now linear forms of logarithms of algebraic numbers. Let log denote the principal determination of the logarithm function. For each algebraic number a let H(a) denote its height (as defined in Section 1). Given the integers n ~ 1, d ~ 1 and the positive real numbers Il, Il', B, with il > il' (and il' = e when n = 1), let d(il, il') denote the set of all n-tuples (a p . . . , an) such that (i). Each a; is an algebraic number of degree at most d, and a; ~ 0, 1
(fori=I, ...• n). (ii). If A; = max{H(a;), 4} then AI:O:;; A 2 :o:;; · · .:0:;; An' (iii). log AI' .. log An :0:;; il, log AI' .. log An_l:O:;; il' (this last condition is trivial when n = 1). Let r!lJ(B) denote the set of all (n + Ij-tuples ({3o, f3j, ... , f3n) such that (i). Each f3; is an algebraic number of degree at most d and f3; ~ (for i = 1, ... , n). (ii). max{H(f3o), H(f3,), ...• H(f3n)' 4}:O:;; B. Consider the set S = S(n, d. il, Il', B) of all linear forms in logarithms
°
(3.2) such that A ~ 0, (aI' ... , an) E d(il, il'), (f3o, f31' ... , f3n) E r!lJ (B). Baker showed (see [7]): Theorem (3.1). For every A in the above set S:
IA I> exp(-Cil log (il') log (ilB»,
°
Moreover, if f30 = and each f3; every A in the corresponding set
(1:0:;;
i
where C :0:;;
=
(16 dn ?OOn
.
n) is a rational integer then for
654
P. Ribenboim / Fundamental Methods in Diophantine Equations
IA I> exp(-ca log(fl') log B), with the same value of C indicated above. Baker's method of proofs leads to a lower bound of the form
with appropriate absolute constants 'Yo, 'YI' 'Y2' 'Y3' 'Y4' There has been much work done to improve these bounds, but this falls beyond the scope of this paper. We shall be content to quote explicitly the following variant of Baker's theorem, which was given by Tijdeman (1956) [50] and has been successfully used in applications. Given the integers n ~ 1, d ~ 1, and the real numbers A, A', B ~ e, let d'(A, A') denote the set of all n-tuples (al' ... , an) such that (i). Each a j is an algebraic number of degree at most d and a j 'i- 0, 1 (for i= 1, ... ,n). (ii). max{H(aJ, ... , H(an _ I ) , e} "'" A', max{H(an ) , e} "'" A (so A' = e when n = 1). Let fJ}J'(B) denote the set of all n-tuples (bl , ••. , bn ) such that: (i). Each b, E I, bj'i- 0 (for i = 1, ... , n). (ii). max{lbll, ... , Ibn I, e} ~ B. Consider the set S' = S'(n, d, A, A', B) of all linear forms in logarithms (3.3)
such that A 'i- 0, (a l , • • • , an) E d'(A, A'), (bl , Tijdeman showed:
• . • ,
bn ) E fJ}J'(B).
Theorem (3.2). For every A in the above set S':
IA I> exp(-C(log A'ln
2+16n
10g(A) 10g(B» ,
where C is a positive real number (depending only on n, d) which is explicitly computable. Now we describe the main applications of these results.
P. Ribenboim / Fundamental Methods in Diophantine Equations
655
3.1. Applications to Transcendental Number Theory The first noteworthy application is the following linear independence result of Baker: Theorem (3.3). Let a l , . . . , an be non-zero algebraic numbers. If log aI' , log an are linearly independent over Q then 1, , log an are linearly independent over the field of all algebraic log a l ,
numbers. From this, it may be deduced without difficulty the following corollaries: Corollary (3.4). If a l , · .. , an' 131' ... , 13n are non-zero algebraic numbers and 8 = L7=1 13j log a; of 0 then 8 is transcendental. Corollary (3.5). If n ~ 0, a
gebraic numbers then e/30a~l
l,
and 130,131' ... , 13n are non-zero ala~n is a transcendental number.. ,
an
Corollary (3.6). If aI' , an' 131' ... , 13n are non-zero algebraic numbers, if each a j of 1 (i = 1, , n) and 1,131' ... ,13n are linearly independent over Q, then afl ... a~n is a transcendental number. These corollaries contain, as particular cases, various famous transcendency results, including the theorem of Gel'fond [17], [18], obtained independently by Schneider (1934) [42]: Corollary (3.7). If a is an algebraic number, a of 0, 1, if 13 is an algebraic number, 13 fi. Q, then a/3 is a transcendental number. Thus, for example, 2v2 , V2v2 are transcendental numbers. Also e" = (-1)-; is transcendental. From Corollary (3.5), with n = 0, we have the following result which includes the theorem of Hermite (1873) [22]: Corollary (3.8). If 13 is a non-zero algebraic number, then e/3 is a transcendental number; in particular e is a transcendental number.
656
P. Ribenboim / Fundamental Methods in Diophantine Equations
And now we deduce also Lindemann's theorem (1882) [30]: Corollary (3.9).
7T
is a transcendental number.
Indeed, would 7T be algebraic, then i 7T is algebraic, hence e i " would be transcendental. Similarly:
= -1
Corollary (3.10). If a is a non-zero algebraic number such that log a ¥ 0 then 7T + log a is transcendental.
Indeed, if 7T + log a = f3 is an algebraic number then from em = -1 then -i 10g(-1) + log a - f3 = 0; thus 10g(-1), log a, 1 are linearly dependent over the field of all algebraic numbers, hence by Theorem (3.2), 10g(-1), log a are linearly dependent over Q. SO there exist m, n E Z such that m 10g(-1) + n log a = 0, hence (-l)m a n = 1 and a 2n = 1, therefore log a = 0, against the hypothesis.
3.2. Applications to Diophantine Approximation As we have already pointed out, the constant C' which appeared in Roth's Theorem (1.3), is not effectively computable. As an application of his method, Baker [4J obtained in 1968 an effective version of a theorem on diophantine approximation, which however guarantees only a weaker order of approximation by rationals. The following sharpened version with K > 1, was published in a joint paper with Stark [8] in 1971:
Theorem (3.11). Let a be a real algebraic number of degree d > 3, let K > 1. Then there exists an effectively computable real number C = C(a, K) such that
I -~l >-:-::-C~ a
b
b d - (Jog b )1- 1f<
,
for every rational number a/b. 3.3. Applications to Diophantine Equations As we have indicated in the Introduction, one of the important features of the effective determination of constants C, as in Baker's method, is
P. Ribenboim / Fundamental Methods in Diophantine Equations
657
that it allows to conclude that for certain classes of diophantine equations every integer in any solution of the equation has absolute value less than C. We review now various instances where these ideas have been applied. (1). First we deal with Mordell's equation (with k a non-zero integer) . In his book VI of Arithmetics, Diophantus of Alexandria considered the following problem 19: to find a right-angled triangle with integer sizes a, b, c such that the perimeter is the cube of an integer and the sum of the area and the hypotenuse is a square. Thus a 2 + b 2 = c', a or b is even, a + b + c = x 3 , the area is ~ab and ~ab + c = y2. This leads to y2 = x 3 + b(~a - 1) - a so x, yare the solutions of an equation of type y2 = X 3 + k with k a non-zero integer (because a 2 + b 2 is a square). Fermat considered the equation y2 = X 3 - 2 and claimed that x = 3, y = 5 is the only solution in natural numbers.' Mordell studied extensively this equation (see for example his book, (1969) [35], where he proves Fermat's claim on page 124). It follows from Siegel's theorem, or from direct considerations that this equation has only finitely many solutions in integers. By means of his lower bounds in linear forms of logarithms, Baker proved in 1968 [4]: Theorem (3.12). If x, yare integers such that y2 = x 3 + k (where k is a given non-zero integer) then
(2). The elliptic equations are those of the form (3.4)
y2= f(X) ,
1 ••• 'II faut trouver un carre qui, augrnente de 2, fasse un cube .. .' Peut-i1 avoir en nombres entiers, un autre carre que 25 qui augmente de 2, fasse un cube? Cela paraft certainement au premier abord difficile a discuter; cependant je puis prouver, par une demonstration rigoureuse, que 25 est bien Ie seul carre en tier qui soit inferieur a un cube de deux unites. En nombres fractionnaires, la methode de Bachet fournit une infinite de tels carres, mais la theorie des nombres entiers, qui est tres belle et tres subtile, n'a pas ete connue jusqu' a present, ni par Bachet, ni par aucun auteurdont j'aie vu les ecrits,
P. Ribenboim / Fundamental Methods in Diophantine Equations
658
where f(X) = aoX 3 + a l X Z + a.X + a3 with a, E I, ao 'i- a. More generally, the hyperelliptic equations are those of the form
yZ= f(X) ,
(3.5)
where f(X) = aoX" + aIX,,-1 + ... + a" with a, E I, a o 'i- a and n > 3. If f(X) has at least three distinct roots, then by Siegel's theorem the equations have finitely many solutions in integers. With Baker's method we may assert the more precise statements: Theorem (3.13). If the right-hand side of the elliptic or hyperelliptic equation has at least three distinct zeros, if (x, y) is any solution in integers then in the elliptic case
max[lx I, Iy i} < exp«106 A)IO') ,
and in the hyperelliptic case
maxj] x I, Iy I} < exp exp exp(n 10,,3 A ,,2) , where n f(X).
=
deg(f) > 3 and A is the maximum of the absolute values of
In the class of elliptic equations we have Mordell's equation already considered, and among the hyperelliptic equations we have: (3.6)
Thus if x" - yZ = k then rnaxl]x], Iy/}< exp expexp(n lO,,3/k l,,2) (3). The superelliptic equations are those of the form Y" = f(X) ,
(3.7)
where m ~ 3 and f(X) n ~2.
=
a.X" + aIX,,-1 + ... + a" with aj E Z, ao 'i- 0,
Theorem (3.14). If f(X) has at least two distinct roots and (x, y) is a solution of the superelliptic equation (3.7) then
P. Ribenboim / Fundamental Methods in Diophantine Equations
659
where A is the maximum of the absolute values of the coefficients of f(X).
This class of equations comprises many interesting instances, like (3.8)
Xn+(X+kt=zn,
(3.9)
xn+yn=(Y+kt,
with n > 3, k ¥- 0 an integer, or (3.10)
X" - Y" = k,
with max{m, n};;;:: 3, k ¥- O. If (x, z) satisfies (3.8) then maxj]»], [z < exp exp«5n)tO(n IOnA
n
)n
2 )
where
A = max{(;)lk/ i } . I.,;,i";'n
So Fermat's equation (3.11)
X" + Y" = Z"
(with n > 3),
has at most finitely many solutions in integers (x, y, z) such that y - x = k (given integer). Similarly, (3.11) has at most finitely many solutions in integers (x, y, z) such that z ~ y = k (given integer). Of course, these conclusions were obtained without appealing to Faltings' Theorem (2.2) and were first obtained by Inkeri [25] in 1976.The difference now is that there is an effective bound for the size of the solutions. As for (3.10), if m ;;;:: 3, and x" - ym = k then
(4). For Thue's equation, Baker showed in 1968:
Theorem (3.15). Let F(X, Y) E Z [X, Y] be an irreducible homogeneous polynomial of total degree n > 3, let a E Z (it may be assumed positive,
660
P. Ribenboim / Fundamental Methods in Diophantine Equations
without loss of generality). If x, yare integers which satisfy the equation F(x, y) = a then
n
maxj]r I, Iy < exp«nAYlOn)5 + (log a ?n+2) , where A is the maximum of the absolute values of the coefficients of F(X, Y). (5). In 1976, Schinzel and Tijdeman [41] applied Baker's work to exponential-polynomial diophantine equations, proving the following theorem of a novel kind:
Theorem (3.16). Let f(X) E 1 [X] have at least two distinct zeros. There exists an effectively computable number C = C(f) > 0 with the following property: If x, y, m are integers such that y 'I 0, m ~ 1 and f(x) = ym then
m
f
has at least two distinct zeros and
m~3.
(6). One of the most striking applications of Baker's ideas was achieved by Tijdeman in respect to Catalan's equation (3.12)
XU - y
V =
1,
where X, Y, U, V are unknowns; the problem is to determine solutions in integers x, y, u, v ~ 2. One solution is obvious: x = 3, Y = 2, u = 2, v = 3. Catalan [9] conjectured in 1844 that this is the only solution of (3.12). For a historical survey of the attempts to solve this equation, the reader may wish to consult my paper (1984) [38]. Tijdeman proved what is now the best result about this equation: Theorem (3.18). There exists an effectively computable real number C> 0 such that if XU - yV = 1, with integers x, y, u, v ~ 2 then x, y, u, v < C.
P. Ribenboim / Fundamental Methods in Diophantine Equations
661
Langevin [28] calculated in 1976 that C < exp exp exp exp 730 . So, even though there is, in principle, an algorithm to find all solutions (namely, to try one after the other, all quadruples (x, y, u, v) with 2:;;; x, y, u, v < C), in practice this cannot be accomplished. As we have seen, it is a common feature in all the above results that they involve bounds which are literally astronomical. A progress in the theoretical treatment should bring about a substantial reduction in the size of the constants-but this seems beyond the power of the present methods.
4. Conclusion Methods involving estimates should be expected in dealing with diophantine equations. Indeed, among the integers which satisfy a given equation, it isconceivable that a deviation of pattern may occurforthe smaller integers, but that the behaviour becomes ultimately regular. After a superposition of conditional estimates expressing different rates of growth of functions attached to the equation, each one valid from some point on it is possible to determine constants beyond which all the required conditions are fulfilled. We hope that this survey made apparent the existence of powerful methods based on estimates and geometry, which provide deep information on the integers solution of equations. A happy situation where a combination of methods of analysis (approximation) and geometry (shape) lead to a solution of diophantine equations (discrete).
References (1) e.G. Bachet, Diophanti Alexandrini Arithmeticorum Libri Sex. et de Numeris Multangulis Liber Unus (texts in Greek and Latin) (S.H. Drovart, Paris, 1621), Second ed. by S. de Fermat. 1670, with notes by P. de Fermat. [2] A. Baker, Linear forms in logarithms of algebraic numbers I, II, III, IV. Mathematika 13 (1966) 204-216, 14 (1967) 102-107,220-228, 15 (1969) 204-216. [3] A. Baker, On the representation of integers by binary quadratic forms. Philos. Trans. Royal Soc. London 263 (1968) 173-191. [4] A. Baker, The diophantine equation y2 = ax 3 + bx? + ex + d, J. London Math. Soc. 43 (1968) 1-9.
662
P. Ribenboim / Fundamental Methods in Diophantine Equations
[5] A Baker, Bounds for solutions of the hyperelliptic equation, Proc. Cambridge Philos. Soc. 65 (1969) 439-444. [6] A Baker, Transcendental Number Theory (Cambridge Univ. Press, Cambridge, 1975). [7] A. Baker, The theory of linear forms in logarithms, In: Transcendence Theory, Advances and Applications, ed. A Baker and W. Masser (Academic Press, London, 1977) 1-27. [8] A. Baker and H.M. Stark, On a fundamental inequality in number theory, Ann. of Math. 94 (1971) 190-199. [9] E. Catalan, Note extraite d'une lettre adressee a l'editeur, J. Reine Angew. Math. 27 (1844) 192. [10] P. Deligne, Preuve des conjectures de Tate et de Shafarevitch (d'apres G. Faltings), Sem, Bourbaki 36' ann. 1983/84 616. [11] G.L. Dirichlet, Verallgemeinerung eines Satzes aus der Lehre von den Kettenbriichen nebst einigen Anwendungen auf die Theorie der Zahlen, Sitzungsber. Preuss. Akad. Wiss. Berlin (1842) 93-95. Reprinted in Werke I (G. Reimer, Berlin, 1889) 633-638. Reprinted by (Chelsea, New York, 1969). [12] F.J. Dyson, The approximation to algebraic numbers by rationals, Acta Math. 79 (1947) 225-240. [13] L. Euler, Vollstandige Anleitung zur Algebra, 2 Vol., Royal Acad. Sci. St. Petersburg. 1770, also in Opera Omnia Ser. I, Vol. I (B.G. Teubner, Leipzig, 1911) 486-490. [14] G. Faltings, Endlichkeitssatze fiir abelsche Varietaten iiber Zahlkorpern, Invent. Math. 73 (1983) 349-366. [15] P. de Fermat, Ad Problema XIX, Liber VI, Arithmeticorum Diophanti, In: Oeuvres I, 333-334 (Latin) and III, 269 (French) (Gauthier-Villars, Paris, 1891, 1896). [16] P. de Fermat, Ad Problema XX, Liber VI, Arithmeticorum Diophanti: Area trianguli rectanguli in numeris not potest esse quadratus, In: Oeuvres I, 340 (Latin) and III, 271-272 (French) (Gauthier-Villars, Paris, 1891, 1896). [17] AO. Gel'fond, Sur Ie septierne problerne de Hilbert, Izv, Akad. Nauk SSSR 7 (1934) 623-630. [18] A.O. Gel'fond, Sur Ie septierne problerne de Hilbert, Dokl. Akad. Nauk SSSR 2 (1934) 1-6. [19] AO. Gel'fond, The approximation of algebraic numbers by algebraic numbers and the theory of transcendental numbers, Uspekhi Mat. Nauk 4 (1949) 19-49. (Transl. Amer Math. Soc., Ser. 1,2 (1962) 81-124. [20] T.L. Heath, Diophantus of Alexandria, A Study in the History of Greek Algebra (Cambridge Univ. Press, Cambridge, 1885). Reprinted by (Dover, New York, 1964). [21] K. Heegner, Diophantische Analysis und Modulfunktionen, Math. Z. 56 (1952) 227-253. [22] C. Hermite, Sur la fonction exponentielle, C.R. Acad. Sci. Paris 77 (1873) 18--24, 74-79,226-233,285-293. Reprinted in Oeuvres III (Gauthier-Villars, Paris, 1912). (23] D. Hilbert and A Hurwitz, Uber die diophantischen Gleichungen von Geschlecht Null, Acta Math. 14 (1890) 217-224. Reprinted in Mathematische Werke von A Hurwitz II (Birkhauser, Basel, 1933) 116-121. [24] A Hurwitz, Uber die angenaherte Darstellung der Irrationalzahlen durch Rationale Bruche, Math. Ann. 39 (1891) 279-284. Reprinted in Mathematische Werke II (Birkhauser, Basel, 1933) 122-127. [25] K. Inkeri, A note on Fermat's conjecture, Acta Arith. 29 (1976) 251-256. [26] E. Landau, Diophantische Gleichungen mit endlich vielen Losungen, new edition by A Walfisz (V.E.B. Deutscher Verlag Wiss., Berlin, 1959). [27] E. Landau and A Ostrowski, On the diophantine equation ax 2 + by + c = dx", Proc. London Math. Soc. 19 (1920) 276-280.
P. Ribenboim / Fundamental Methods in Diophantine Equations
663
(28] M. Langevin, Quelques applications de nouveaux resultats de van der Poorten, Sem. Delange-Pisot-Poitou 17" ann. 1975/76, G12. [29) W.J. Leveque, On the equation ym = I(x), Acta Arith. 9 (1964) 209-219. [30) F. Lindemann, Uber die Zahl 17, Math. Ann. 20 (1882) 213-225. [31) J. Liouville, Nouvelle demonstration d'un theorerne sur les irrationnelles algebriques, C.R. Acad. Sci. Paris 18 (1844) 910-911. [32) J. Liouville, Sur des classes tres etendues de quantites dont la valeur n'est ni algebrique ni meme reductible a des irrationalites algebriques, c.R. Acad. Sci. Paris 18 (1844) 883-885 and J. Math. Pures Appl. 16 (1851) 133-142. [33) K. Mahler, On the greatest prime factor of ax" + by", Nieuw Arch. Wisk (3) 1 (1953) 113-122. [34) L.J. Mordell, On the rational solutions of the indeterminate equations of the 3-rd and 4-th degrees, Proc. Cambridge Philos. Soc. 21 (1922) 179-192. [35) L.J. Mordell, Diophantine Equations (Academic Press, London, New York, 1969). [36] D. Morduhai-Boltovskoi, Sur Ie logarithme d'un nombre algebriquc, CiR. Acad. Sci. Paris 176 (1923) 724-727. [37) G. P6lya, Zur arithmetischen Untersuchung der Polynome, Math. Z. 1 (1918) 143148. [38) P. Riberboim, Consecutive Powers, Exposition. Math. 2 (1984) 193-221. [39) A. Robert, Elliptic Curves, Lecture Notes in Math. 326 (Springer, New York, 1973). [40] K.F. Roth, Rational approximations to algebraic numbers, Mathematika 2 (1955) 1-20. Corrigendum 168. [41) A. Schinzel and R. Tijdeman, On the equation ym = P(x), Acta Arith. 31 (1976) 194-204. [42) T. Schneider, Tranzendezuntersuchungen periodischer Funktionen, J. Reine Angew. Math. 172 (1934) 65-74. [43) T. Schneider, Uber eine Dysonsche Verschiirfung des Siegel-Thuesches Satzes, Arch. Math. 1 (1949) 288-295. (44) C.L. Siegel, Uber den Thueschen Satz, Vidensk. Selskab Skrifter (Kristiania) Ser. I 16 (1921). Reprinted in Gesammelte Abhandlungen I (Springer, Berlin, 1966) 103-112. (45) C.L. Siegel, The integer solutions of the equation y2 = ax" + bx":' + ... + k (extract from a letter to Prof. L.J. Mordell), J. London Math. Soc. 1 (1926) 66-68. Reprinted in Gesammelte Abhandlungen I (Springer, Berlin, 1966) 207-208. [46) c.L. Siegel, Uber einige Anwendungen diophantischer Approximationen, Abh. Preussischen Acad. Wiss. Berlin, Phys. Math. KI. 1 (1929). Reprinted in Gesammelte Abhandlungen I (Springer, Berlin, 1966) 209-266. [47) H.M. Stark, A complete determination of the complex quadratic fields of class number one, Michigan Math. J. 14 (1967) 1-27. [48) A. Thue, Uber Anniiherungswerte algebraische Zahlen, J. Reine Angew. Math. 135 (1909) 284-305. Reprinted in Selected Mathematical Papers (UniversitetsforIaget, Oslo, 1982) 232-253. [49) A. Thue, Uber die Unlosbarkeit der Gleichung ax 2 + bx + C = dy" in grossen ganzen Zahlen x und y, Arch. Math. Naturv. (Kristiania) 34 (1917). Reprinted in Selected Mathematical Papers (Universitetsforlaget, Oslo, 1982) 561-564. [50) R. Tijdeman, On the equation of Catalan, Acta Arith, 29 (1976) 197-209. [51) R.J. Walker, Algebraic Curves (Princeton Univ. Press, Princeton, 1950). [52) A. Weil, L'arithrnetique sur les courbes algebriques, Acta Math. 52 (1929) 281-315. Reprinted in Oeuvres Completes I (Springer-Verlag, New York, 1979). [53) A. Wei!, Sur un theoreme de Mordell, Bull. Sci. Math. (2) 54 (1930) 182-191. Reprinted in Oeuvres Completes I (Springer, New York, 1979) 47-56.
l.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.Y. (1986)
665
CONTACT AND EQUIVALENCE OF SUBMANIFOLDS OF HOMOGENEOUS SPACES
Alexandre A. Martins RODRIGUES Instituto de Matemdtica e Estatistica, Universidade de Sao Paulo, Sao Paulo, S.P. Brazil lnstitut Fourier, Unioersite Scientifique de Grenoble, Grenoble, France
1. Introduction
Let M be a homogeneous space of a Lie group G; for an element ~ M denote the diffeomorphism defined by G. Given two submanifolds 51' 52 of M, of the same dimension p, it is a classical problem of differential geometry to find conditions on 51 and 52 for the existence of g E G such that L g(51) = 52' When this happens, 51 and 52 are said to be G-equivalent. Given two points a l E 51 and a 2 E 52' we say that 51 has G-contact of order k with 52 at the points at and a 2 if there exists agE G such that Lg(a l ) = a 2 and L g(51) has contact of order k with 52 at the point a2 • The equivalence problem of submanifolds of a homogeneous space was extensively treated by E. Cartan [1], by his method of the moving frame. One of the underlying ideas in Cartan's method is that for each homogeneous space M there exists an integer k, depending on p, such that if there exists a diffeomorphism tP : 51 ~ 52 having the property that 51 has G-cot;ltact of order k with 52 at all points x E 51' and tP(x)E 52' then 51 and 52 are locally equivalent. Let a oE M be a fixed point and H the isotropy group of ao. Using Cartan's method of the moving frame, G.R. Jensen [3] proved the existence of k, assuming the existence of local sections in the space of orbits of the action of H in the Grassmann manifold GP(T fl(JM) and also assuming regularity conditions on 51 and 52' In the method of the moving frame, one defines imbeddings (TI : 51 ~ G and (T2: 52~ G such that, for g E G, L g (5 1) = 52 if and only if (T2(52) is the image of (T1(51) by the left translation defined by g. This reduces the equivalence problem to the case where M = G and G acts on G by left translations. In this case, the problem is easily solved by means of g E G, let Lg : M
666
A.A.M. Rodrigues / Contact and Equivalence of Submanlfolds
Frobenius theorem. Ultimately, the method of the moving frame reduces the necessary integration to solve the problem to the integration of a differential system of order 1. In this paper we prove the existence of the integer k and consequently solve the equivalence problem by integrating directly a differential system of order k of finite type. This makes it easier to state the regularity conditions which the manifolds 51 and 52 have to satisfy; at the same time the regularity conditions become geometrically more meaningful. Our regularity conditions bear only on contact elements of 51 and 52 of two consecutive orders I and I - 1 whereas in the method of the moving frame the regularity conditions bear on all orders from 1 up to k. Moreover, the order of regularity for which the Theorem of equivalence (3.5) applies is not fixed for all submanifolds of M. If a submanifold does not satisfy the regularity conditions at a given order it may satisfy these conditions at a higher order. This allows to prove equivalence theorems which cannot be immediately derived by the method of the moving frame. For instance, Theorem (3.5) can be applied to curves 'Y in the Euclidean space IR 3, in the neighbourhood of a point a E 'Y where the curvature p(a) and all derivatives dp d /- 2p - (a), ... ,/=2 (a) , ds ds vanish up to some order 1- 2 but and are different from zero, s being the arc length (see Section 4). Usually, equivalence theorems are stated imposing that the submanifolds 51 and 52 have same invariants at corresponding points. This type of theorem can be derived from Theorem (3.5) by taking a complete set of invariant functions for the orbits of G in the manifold cgk,P(M) of elements of contact of order k in Theorem (3.5). It is natural to ask how generic are the regularity conditions in Theorem (3.5). Let us remark that the condition
A.A.M Rodrigues / Contact and Equivalence of Submanifolds
667
(see Section 3), depends only on the contact element S~;/ of SJ at the point a J E SJ' Hence, this question will be answered (see Corollary (3.6» if one proves the existence of an integer k such that the set of points X E eek+J,p(M) for which the isotropy group Gk+J(X) of G is discrete and h k+J(X) = 0, is dense and open in eek+J,p(M). It seems reasonable to conjecture the existence of this integer for all homogeneous spaces. In the case p = 1, it follows from a theorem of I. Kupka [6] that the set of points of eek, J(M) for which the isotropy is discrete, is dense and open in ee k, J(M), for all sufficiently high k. The equivalence problem can be posed for two immersions f, h : S ~ M where S is any differentiable manifold. f and h are equivalent if there exists g E G such that h = L g 0 f. This fixed parametrization equivalence problem has been solved by means of a differential system of finite type of higher order by l.A. Verderesi [9], see also [4], [10]. In Section 2 we state a generalization of Frobenius theorem to differential systems defined by contact elements of higher order. This theorem will be our main tool in the proof of Theorem (3.5). Section 3 is devoted to the proof of the equivalence Theorem (3.5). In Section 4 we give a necessary and sufficient condition for a submanifold SCM to be an open set of an orbit of a Lie subgroup L of G. This theorem can be generalized to characterize the submanifolds S of M which are locally invariant by the action of a Lie subgroup L of G and which are fibered by the orbits of L which meet S. We end the paper with some simple remarks about curves in Ie.
2. A Higher Order Frobenius Theorem All differentiable manifolds and maps will be considered to be of class ee"'. If M is a differentiable manifold, we shall denote by eek,p(M) the differentiable manifold of contact elements of order k and dimension p of M [2]; 7T~.: eek,p(M)~ eek',p(M), k' ~ k, will denote the canonical projection. If k = 0, the manifold eeo,p(M) is identified with M. If k' = 0, we shall use the notation 7T k : eek,p (M) ~ M instead of 7T~. The fiber of eek,p(M) over a point x E M will be denoted by ee~,p(M). Let SCM be a submanifold of dimension p. We shall denote by S~ E eek,p(M) the contact element of S at the point xES and by Sk the image of the imbedding x E S~ S~ E eek,p(M). Given two submanifolds
668
A.A.M. Rodrigues / Contact and Equivalence of Submanifolds
SI' S2 C M of the same dimension and a point x E SI n S2' by definition, SI and S2 have contact order k at x if S~x = S;.. A differential system of order k and dimension p defined over M is, by definition, a submanifold [J C cgk,P(M). An integral manifold of £l is a submanifold SCM of dimension p such that S~ E £l for every xES. An important notion associated to differential systems is the notion of prolongation. Let A HI: cgHI,P(M)"'" cgl.P(cgk,P(M» be the map defined as follows: if X E cgHI,p(M) and SCM is a submanifold such that X = cg:+I(S), a E S, then, Ak+I(X) = (Sk ).\;. where X' = 1T:+1(X) E s'. It is easy to verify that A k+1 is an imbedding of cgk+l,p(M) into cgl.P(cgk,P(M». Clearly, there is also a natural imbedding of cgl,P(£l) into cgl.P(cgk,P(M»; we shall identify cgl,P([J) with its image in cgl.P(cgk,I(M». The first prolongation of n is then defined to be the set pt] = cgHI,P(M) n cg',P(£l) [8], The following generalization of Frobenius theorem is just a geometric formulation of the existence and uniqueness theorem of solutions of differential systems of finite type [7]. Theorem (2.1). Let n C cgk,P(M), k ~ 1, be a differential system such that: (I), 1TL : £l...,. cgk-l,P(M) is an immersion. (2). The projection 1T:+ I : p£l...,. £l is surjective, Then, for all X o E n, there exists a solution of £l defined in a neighbourhood of X o = 1T k (Xo) E M. Moreover, if SI and S2 are two such solutions, there exists a V C SI n S2 which is an open neighbourhood of X o in both SI and S2' A differential system satisfying conditions (1) and (2) is called completely integrable. A proof of Theorem (2,1) will appear elsewhere.
3. k-Admissible Submanifolds
Let G be a Lie group and let M be a homogeneous space of G. The action of G on M extends naturally to an action of G on cgk,P(M). If X E cgk,P(M), X = S~, x EM, and g E G then, by definition, s :X = (g. S)~x where B: S = L/S) and g' x = Lg(x), L g being the diffeomorphism of M induced by g. Given two submanifolds SI and S2 of M, we say that they are Gequivalent if there exists agE G such that g . SI = S2; we say that they are locally G-equivalent at the points a l E SI and a2 E S2 if there are open neighbourhoods VI and V 2 of a l and a2 in SI and S2 which are G-
AAM. Rodrigues / Contact and Equivalence of Submanifolds
669
equivalent. Given an element g E 0 and a point x E 51' g makes contact of order k ~ 0 between 51 and 52 at the point x E 51 if gx E 52 and g . 51 and 52 have contact of order k at the point g' x, or, equivalently, g' 5~x = 5~gx. Clearly, if g' 51 = 52' then g makes contact of any order k ~ 0 between 51 and 52 at any point x E 5. Definition (3.1). The submanifold 5 is k-admissible if there exists a submanifold U of C{]k,P(M) and a neighbourhood A of the identity e in G such that: (1). For all g E A and x E 5, s: 5; E u. (2). U is a completely integrable differential system of order k. Theorem (3.2). Given two submanifolds 51 and 52 (of dimension p) of M, assume that (1). 52 is k-admissible for some k ~ 1. (2). There exists a continuous map 1: 51 ~ G such that for all x E 51' 1 (x ). x E 52 and 1 (x ) . 5~x = 5~(xh' Then, for any point a l E 51' 51 and 52 are locally equivalent at the points a l and a 2 = 1 (a I) . a l · Proof. Choose U and A as in Definition (3.1), with respect to 52' Since the map x E 51 ~ 1 (a 1)1 (Xfl EO is continuous, there exists an open neighbourhood V of a l such that 1(a l)1(xf l E A for all x E V Hence,
Therefore, 1 (a l ) • 51 is an integral manifold of U and
Since 52 is also an integral manifold of U, it follows from Theorem (2.1) that there exists a set V 2 which is an open neighbourhood of a 2 in 1 (a l ) .51 and in 52' Then, the neighbourhoods VI = 1(a l f l • V 2 C 51 and V 2 are equivalent. 0 We shall now give sufficient conditions for a submanifold 5 C M to be locally k-admissible. For x E 5, let Ok(X) C G be the isotropy group of and let gk (x) C G be the dimension of (x). Let be the tangent space of k 5 at the point 5: and let gxO~ be the tangent space at the point 5~ of the of the point 5~ E «5k. P(M). Put orbit
0:,
cr
,0/:
5:
670
A.A.M. Rodrigues / Contact and Equivalence of Submanifolds
Definition (3.3). A point a E Sis k-regular, k ;;;'1, if the integers dk(x) = gk(X) + hk(x), dk-I(X) = gk-'(X) + hk-I(x) are constant and dk(x) = dk-'(x),
for x in a neighbourhood of a.
Theorem (3.4). Let a E S be a k-regular point. Then, there exists a neighbourhood V of a in S which is k-admissible. Proof. Let ljIk : G x Sk..,. cgk,P(M) be the map defined by
ljIk(g, S;) = g . S; .
Denote by L;'P : cgk,P(M)..,. cgk,P(M) the diffeomorphism defined by g and let Id k : Sk ..,. Sk be the identity map. Then
Since L g x Id k and L;'P are diffeomorphisms, it follows that the rank of ljIk at the point (g, S;) E G x Sk is equal to the rank of ljIk at the point (e, S;) E G x S\ where e EGis the neutral element. Clearly, the rank of ljIk at the point (e, S;) is equal to dim G -l (x) + P - h k(x)
=
dim G + P - d k(x) .
Hence, the rank of ljIk at the point (e, S;) does not depend on x for x in a neighbourhood of a. Therefore, the rank of ljIk is constant in a neighbourhood of (e, S:) in G x s'. Let A X V k be an open neighbourhood of (e, S:) in G x Sk which is mapped by ljIk onto a submanifold U of cgk,P(M) and put V = 7T k( V k). We shall show that U satisfies both conditions of Definition (3.1) with respect to the neighbourood V and the neighbourhood A of e. By construction of U, condition (1) holds true. To prove condition (2), consider the following commutative diagram: ",'
AXVk~U
Id
X
1T~_1
A
X
I
",*-1
11T~-'
V k- I ~ cgk-l,p(M).
AAM. Rodrigues / Contact and Equivalence of Submanifolds
671
Denote by ljJ~ the map induced by ljJk on tangent vectors. The kernel of ljJ~ at the point (g, S:) E A X V\ x E V, has dimension dk(x) = l(x)+ hk(x). Similarly the kernel of ljJ~-1 at the point (g, S;-I) has dimension dk-I(x). Since (Id x 7TLh is an isomorphism, it follows that (7TZ- 1Iu)* is injective at the point ljJk(g, S~) if and only if dk-I(x) = dk(x). Every point of U is the contact element of an integral manifold of U; hence, U satisfies condition (2) of Definition (3.1).0 As a consequence of Theorems (3.2) and (3.4) we have the following: Theorem (3.5). Let SI' S2 C M be two submanifolds of the same dimension p, and let a l E SI and a 2 E S2 be two points. Assume that a 2 is k-regular and that there exists a continuous map ep : SI ~ G such that ep(al ) = a 2, ep(x)· x E S2 and ep(x)' S~x = S~(X)' for all x E SI' Then, SI and S2 are locally G-equivalent at the points a l and a 2.
Notice that, under the hypothesis of Theorem (3.5), it follows that the point a l E SI is also k-regular. Notice also that the map x E SI ~ ep(x)' x E S2 is not necessarily a diffeomorphism; in fact, this map may even be constant equal to a 2• The following corollary is an important special case of Theorem (3.5). Assume that for a point a of a submanifold SCM, hk(a) = 0, that is, the orbit of S: in C(5k. P(M) cuts Sk transversely at the point S:. Assume also that gk (a) = 0, that is, the isotropy group of S: is discrete. Then, the same conditions clearly hold in a neighbourhood of a in S. Since and it follows that a is (k + I)-regular. Hence, keeping the notations as in Theorem (3.5), we have the following: Corollary (3.6). Let a 2 E S2 be a point and let k ~ 1 be an integer such that gk(a) = hk(a) = 0. Assume that there exists a continuous map ep : SI ~ G and a point a 1 E SI such that
A.( x· )
'f'
Sk+1 Ix
=
Sk+1
2>(x)'x'
for all x E SI' Then SI and S2 are locally equivalent at the points a l E SI and a 2E S2'
672
A.A.M. Rodrigues I Contact and Equivalence of Submanifolds
Assuming stronger regularity conditions on the contact elements of 52' Theorem (3.5) can be reformulated in the following way: Theorem (3.7). Let 51' 52 C M be two submanifolds of the same dimension p and let a 1 E 51 and 52 E 52 be two points. Let W k (52) be the set of all contact elements g . 5~ for all x E 52 and g E G. Assume that there exists a k ;;.1 such that W k(52) is an imbedded submanifold of eek,p(M) (i.e. the topology of W k(5..,) is the induced topology) and a 2 E 52 is k-regular. Assume also that there exists a map
Proof. As in the proof of Theorem (3.4) let l/Jk : G x 5~ ~ W k(52) be the map l/Jk (g, 5~x) = g . 5~. Since W k(52) is an imbedded submanifold, Iii is a differentiable map. Repeating the argument in the proof of Theorem (3.2), one can show that there exists an open neighbourhood U of 5~2 in W k (52) which is a completely integrable differential system. Let g E G be such that B: 5~aI = 5~a2' Since s :57 C W k(52) and the topology of W k(52) is the induced topology, one can choose an open neighbourhood Vof a 1 in 51 such that g' yk is a submanifold of U. Then g' V is an integral submanifold of U. Since there exists a neighbourhood of a 2 in 52 which is also an integral submanifold of U it follows that g . Y and 52 coincide in a neighbourhood of a 2• 0 Corollary (3.8). Assume W k(5 2 ) is an imbedded submanifold and that gk (a 2 ) = h 3(a 2 ) = O. Assume also that there exists a map
In this Section we shall characterize the submanifolds of M which are open sets of an orbit of a Lie subgroup L of G. If 5 C M is an open set of an orbit of L then, h k (x) = p and gk (x) is constant for x E 5 and for all k ;;. 0; moreover, for sufficiently high k, every point of 5 is k-regular. The following theorem is the converse to the above statement. Theorem (4.1). Let 5 be a connected submanifold of dimension p of M.
A.A.M. Rodrigues / Contact and Equivalence of Submanifolds
673
Assume that there exists a k > 1 such that hk(x) = p and such that every point xES is k-regular. Then, S is an open set of an orbit of a connected Lie subgroup L of G.
Proof. Given a point a E S, we are going to show the existence of a neighbourhood of S: in s' which is contained in the G-orbit of S: in Cfik,p(M). Keeping the notations as in Theorem (3.4), the set A· S: = {g' S:, g E A} is open in the orbit of S:. From the hypothesis hk(x) = pit follows that A . S: and U have the same dimension, hence A . S: is open in U. Since we can assume that V k is a submanifold of U with the induced topology, it follows that A· S: n V k is an open neighbourhood of S: in s' contained in the orbit of S:. Since this holds for every a E S and Sk is connected, Sk is contained in the orbit of S:. Consider the map A: g E G ~ B: S: E Cfik,P(M) and let N = A-1(Sk). N is a submanifold of G and e E N. Let .t be the invariant distribution which associates to each point g E G the subspace .t(g) = (Lgh(fi,N) C figG. Then N is an integral manifold of .t. To prove this, let g E N and let b = g . a E S. Theorem (3.5) implies the existence of a neighbourhood VI of a and the existence of a neighbourhood V z of b such that Lg(VI) = V z. Choose a neighbourhood W of e in N such that g . S: E VI for all g E W. For gl E W we have (ggl)' S: = g'(gIS:)ELg(Vt) = V 2• Hence, ggtEN and therefore g' W is a neighbourhood of gin N. Consequently, fi~ = (Lgh(fi,N) = .t(g). Since the images of N by left translations of G are also integral manifolds of .t, .t is completely integrable. Let L be the maximal integral manifold of .t which goes through e E G. L is a connected subgroup of G and the L-orbit of S: in Cfik,P(M) has the same dimension as Hence, Sk is an open set of this orbit. It follows that S is contained in the L-orbit of a in M. Clearly, the subgroup of L which leaves a fixed coincides with the subgroup of L which leaves S: fixed. Hence, S is open in the L-orbit of a in
s:
M.D 5. Regular Points of Curves in R 3 We shall characterize, up to order 3, the regular points of a curve y in the Euclidean space IR 3, the group G being the group of rigid motions. Let a E y be a point of y, since gO(a) = 3 and gl(a) = 1, x is not regular of order 1. For a to be regular of order 2 it is necessary that gl(a) = g2(a) =
674
A.A.M. Rodrigues / Contact and Equivalence of Submanifolds
1; the condition g2(a) = 1 is equivalent to the vanishing of the curvature at the point a. Hence, it is necessary that the curvature vanishes in a neighbourhood of a. Conversely, if the curvature vanishes in an open neighbourhood Vof a, then, for x E V,
and a is 2-regular. Therefore, a is 2-regular if and only if there exists a neighbourhood of a in 'Y which is contained in a straight line. Let p, s : 'Y ~ R be respectively the curvature and the arc length of 'Y measured from a (i.e. s(a)=O). g2(a)=0 if and only if p(a):I- 0 and h 2(a ) = 0 if and only if the derivative dp/ds(a):I- O. Hence, if p(a):I- 0 and dp/ds(a) :I- 0 then,
in a neighbourhood of a and a is 3-regular. If p(a) :I- 0 and dp/ds(a) = 0 then h 2(a ) = 1. Hence, for a to be 3-regular it is necessary that dp/ds(x) = 0 in a neighbourhood of a. Conversely, if p(a)1£ 0 and if p is constant in a neighbourhood Vof a then, g3(X) = g2(X) = 0 and h 3(x) = h 2(x ) = 1 for x E V. Consequently, if p(a) :I- 0 and dp/ds(a) = 0 then a is 3-regular if and only if p is constant in a neighbourhood of a. It is well-known that the classical theorem of congruence of curves in R. 3 may not hold in a neighbourhood of a point where the curvature vanishes. The following refinement of the classical theorem is an easy consequence of Theorem (3.5). Assume that p(a) = 0 for a point a E 'Y. Then, the derivative dp/ds may not exist but the right derivative dP J + (a) = lim p(s) [ ds s....o+ S always exists. Assume that
[:~r (a)
and
do not vanish at a. Then, g3(X) = l(x) = h 3(x ) = h 4(x ) = 0 in a neighbourhood of a in 'Y and a is a regular point of order 4. From the hypothesis [dp/dst(a) :I- 0 it follows that p(x) :I- 0 for x in a neighbour-
AAM. Rodrigues I Contact and Equivalence of Submanifolds
675
hood V of a and x ~ a; hence the torsion r(x) is defined for x E V and x ~ a. Let y' C R 3 be a second curve satisfying the same conditions at the point a' E y' and denote by p', r' and s' the curvature, the torsion and the arc length of y'. Let I: y ~ y' be a diffeomorphism such that I(a) = a', y = y' 0 I, r = r' oland s = s' 0 f. Then, in a neighbourhood of a, 1 is the restriction to y of a rigid motion in R. 3. This theorem can also be deduced from the usual theorem of congruence of curves in R. 3; we state it here to give an example of a situation which is covered by Theorem (3.5) whereas the method of the moving frame cannot be directly applied.
References [I] [2] (3] [4] [5] [6] [7]
[8] [9]
[10]
E. Cartan, Theorie des groupes finis et continues et la geometrie differentielle traites par la methode du repere mobile (Gauthie-Villars, Paris, 1937). E. Ehresmann, Introduction a la theorie des structures infinitesimales et des pseudogroupes de Lie, In: Coli. Geometrie Differentielle, Strasbourg, 97-117. G.R. Jensen, Higher order contact of submanifolds of homogeneous spaces, Lecture Notes in Math. 610 (Springer, Berlin, 1977). G.R. Jensen, Deformation of submanifolds of homogeneous spaces, J. Differential Geom. 16 (1981) 213-246. M.L. Green, The moving frame, differential invariants and rigidity theorems for curves in homogeneous spaces, Duke Math. J. 46 (1978) 735-779. LA. Kukpka, On regular contact elements, J. Math. and Mech. 15 (1966) 305-313. M. Kuranishi, Lectures on involutive systems of partial differential equations (Univ. Sao Paulo, Sao Paulo, Brasil, 1967). P.J. Olver, Symmetry groups and group invariant solutions of partial differential equations, J. Differential Geom. 14 (1979) 497-541. J.A. Verderesi, Contact et congruence de sous varietes, Duke Math. J. 49 (1982) 513-515. P. Griffiths, On Cartan's methods of Lie groups and moving frames as applied to uniqueness and existence questions in differential geometry, Duke Math. J. 41 (1974) 775-814.
I.A. BARROSO editor, Aspects of Mathematics and its Applications
© Elsevier Science Publishers B.V. (1986)
677
ON THE MEASURE OF LINE SEGMENTS ENTIRELY CONTAINED IN A CONVEX BODY L.A. SANT ALO Academia Nacional de Ciencas Exatas, Fisicas y Naturales, Buenos Aires, Argentina Dedicated to Leo poldo Nachbin with admiration and friendship
Let K be a convex body in the n-dimensional euclidean space R". We consider the measure Mo(l), in the sense of the integral geometry (i.e. invariant under the group of translations and rotations of R" [6, Chap. 15]), of the set of non-oriented line segments of length I, which are entirely contained in K. This measure is related by (3.4) with the integrals 1m for the power of the chords of K. These relations allow to obtain some inequalities, like (3.6), (3.7) and (3.8) for Mo(l). Next we relate Mo(l) with the function n(l) introduced by Enns and Ehlers [3], and prove a conjecture of these authors about the maximum of the average of the random straight line path through K. Finally, for n = 2, Mo(l) is shown to be related by (5.6) with the associated function to K introduced by W. Pohl [5]. Some representation formulas, like (3.9), (3.10) and (5.14) may be of independent interest.
1. Integrals for the Power of the Chords of a Convex Body
Let K be a convex body in the n-dimensional euclidean space IR ", Let dG be the density for lines G in IR n in the sense of integral geometry [6, Chap. 12] and let i f denote the length of the chord G n K. The chord power integrals
(m ;;;':0),
(1.1)
have been well studied [6, p. 237]. If dPI, dP 2 denote the elements of volume of IR n at the points PI' P 2 E K and, denotes the distance between PI and P2 , the integrals (1.2)
r; =
J Pj.PzEK
r
dP1
/\
dP2
(m ;;;.: - (n - 1)),
678
L.A. Santalo / Measure of Line Segments in a Convex Body
have also been considered and it is known that the relation (1.3)
21m = m (m - l)J m-n-l
holds good for m > 1 [6, p. 238]. For the cases m = 0, 1 and m = n + 1 we have the simple formulas (1.4)
1
o.; F '
I n +1 = (~n(n + 1»V 2 ,
10 = 2n--1
where F is the surface area of K and V its volume. We want to calculate 1m for the sphere S, of radius r in IR ". To this end, recalling that dO = dUn - 1 /I dOn - 1 [6, (12.39)] where dUn _ 1 is the area element of an hyperplane orthogonal to 0 at its intersection point with 0 and dOn- 1 is the area element of the unit sphere at the end point of the unit vector parallel to 0, we can write dO = pn-2 dOn_2 /I dp /I dOn _ 1 and therefore we have (p being the distance from the center of the sphere to 0)
(1.5)
t;
2m - 10n _ 10n-2
=
°
J(r2- p2)ml2p n
-2
dp
o
2m - 20n-l n-2,m+n-l B(!(n - 1), !(m + 2» 2 2
=
,
where B(p, q) = T(p) T(q)/T(p + q) is the Beta function and Ok means the surface area of the h-dimensional unit sphere, i.e.
°
(1.6)
h
21T(h+l l /2
1»'
= -----:---
T(hh
+
Therefore we have (1.7)
2m - 1r m+n-l 1T n-l/2 m T(l2m )
I m(S,)=
T(~n)T(~(m+n+1» .
2. Inequalities of Hadwjger, Carleman and Blaschke for the Chord Integrals 1m
The chord integrals 1m for convex bodies in IR n satisfy certain inequalities. One of them is due to H. Hadwiger [4]:
L.A. Santalo I Measure of Line Segments in a Convex Body
(2.1) where
679
(n > 2),
Kh
denotes the volume of the n-dimensional unit ball, i.e.
(2.2) and F and V denote the surface area and the volume of K respectively. Taking into account the isoperimetric inequality (2.3)
nK~n vi-lin ~
F,
inequality (2.1) gives (2.4)
(n > 2).
In (2.1) and (2.4) the equality sign holds only for the sphere. In [2] T. Carleman proved that in the plane, n = 2, J_I = f r- 1 dP I /I dPz = 1z has its maximum for the circle (for a given surface area) and pointed out that the same proof may be extended to showing that for convex bodies in R", the integrals 1m for m = 2, 3, ... , n have a maximum for the sphere for a given volume V. Thus, taking into account (1.7) and (2.2) we have the following set of inequalities: (2.5) In :s:; 2mn-m-n+I1T(n 2l2}-mn/Z nm+n-l(r(!n))m-l ( m 2
)n v
r(!m+1) 2 + n + 1))
r(~(m
m +n - 1
,
for m = 2, 3, ... , n. In [1], W. Blaschke proved that in the plane (n = 2) and for a given area F, the integrals 1m (m ~ 4) have its minimum for the circle. The proof is also easily extendible to R n, so that, taking (1.7) and (2.2) into account, we can write the new set of inequalities (for !R. n)
for m ~ n + 2. In (2.5) and (2.6) the equality sign holds only for the sphere.
680
L.A. Santalo I Measure of Line Segments in a Convex Body
3. The Measure Mo(l) of the Line Segments of Length I Entirely Contained in a Convex Body K in IR n A line segment S of given length I in R n can be determined either by the line G which contains the segment and the abscissa t of the origin P of S on G, or by P and the point on the unit sphere 0n-l given by the direction of S. The kinematic density for sets of line segments of length I (invariant under motions in R") is [6, p. 338]: (3.1)
dS
= dG
dt
1\
= dP
dOn_I'
1\
Using dS = dG 1\ dt we get that the measure of the set of line segments S entirely contained in K is (3.2)
J (u-/)dG.
Mo(l)=
u;;'/
If PI' P2 are two points of K at a distance I, we have dP I 1\ dP2 = 1\ dl 1\ dP I (up to the sign) and therefore, since we consider the
r ' dOn - 1
measure of non-oriented segments, we have
Diam(K)
(3.3)
J
1m dP I
1\
dP2 = 2
J
Im+n-lMo(l) dl.
P!,P2EK
As a consequence of (1.3) and (3.3) we have Diam(K)
(3.4)
J
I m = m (m - 1)1 m-n-l = m (m - 1)
I m-2Mo(l) dl ,
o
which holds for m
~
2. In particular, for m
=
2 we have
Diam(K)
(3.5)
I2 = 2
and the first inequality (2.5) gives
J
Mo(l) dl ,
L.A. Santal6 / Measure of Line Segments in a Convex Body
Diam(K)
(3.6)
f
I
,.::::21-1In7T(nI2)-ln1+1In(r(~n»1/nv(n+l)1n
~
Mo(l) d
681
(n +
1)F(~(n + 1»
,
where the equality sign holds only for the sphere. For instance, for convex sets K in the plane, n = 2, we have
f
Diam(K)
(3.7)
Mo(l) dl
~ _8_ F 3/2 , 3V;
o
where F is the surface area of K. Taking into account the isoperimetric inequality 47TF ~ L 2 and the inequality of Bieberbach F ~ ~ 7TD 2, where D = diam(K), we get the following inequalities (for convex sets in the plane): D
(3.8)
f
Mo(l) dl
o
f D
L3
~-2' 37T
Mo(l) dl
~ j 7TD 3 ,
o
with the equality sign always only for the circle. From (3.4) we deduce that for every polynomial function of the form 1 = a 2u 2 + ... + ahu h we have
f
(3.9)
D
f(u) dO
= Jl"(u)Mo(u) du.
OnK,,1J
0
By Weierstrass approximation theorem, this equality holds for every function I(u) having continuous derivatives I"(u) with the conditions 1(0) = 1'(0) = O. Integrating by parts the right side of (3.9) we have the following relationship D
(3.10)
J onK"1J
I(u) dO = -
J!,(u)Mb(u) drr ,
L.A. Santalo / Measure of Line Segments in a Convex Body
682
for every function /(0') having continuous derivative /,(0') and satisfying the condition /(0) = /,(0) = O.
4. The Invariants fl(l) of Enns-Ehlers Denote by K(l, w) the translate of K by a distance I in the direction co. Enns and Ehlers [3] define fl(l) to be the volume of K n K(l, w) uniformly averaged over all directions and normalized such that fl(O) = 1. If 0" denotes the length of the chord G n K, the volume of K n K(l, w) is precisely f 1 (0" - I) dUn-t' where dUn-t denotes the area element on the hyperplane orthogonal to the line G which has the direction w. Therefore, since dG = dUn_t /\ dOn_t' where dOn_ 1 denotes the area element on the unit (n - I)-sphere corresponding to the direction w, we have U"
(4.1)
fl(l)
= _2-
o.; V
J (0" -
I) dO'n - 1
II
u2!:1
dOn-1 = _2_ 0n-l V
J (0" -
I) dG
oe l .
and thus, according to (3.2),
(4.2)
n(l) = _2- Mo(l) . On-IV
Therefore, (3.4) gives D
(4.3)
t;
=
~ m (m
- 1)On-l V
JIm- fl (l ) dl. 2
o
For instance, if m = n + 1, taking (1.4) into account, we have D
(4.4)
J In-1fl(l) dl = ~, o
°n_1
according to a result of Enns and Ehlers [3, (8)]. If a 'random secant' is defined by a point in the interior of K and by a direction (the point and direction have independent uniform distribution),
L.A. San tala / Measure of Line Segments in a Convex Body
683
the k-th moment of a random secant is (using (3.1» (4.5)
Ja D
2k(k + 1) = On-IV
k-I
M o(a) drr ,
o
Thus, according to (3.4) we have k
2
Eio: ) = - - Ik+1 , On_IV
and the inequalities (2.5) give (4.6)
2 k-k1n (kln)+I(r(!n»(k+n}/n r(!(k + 3» n 2 2 Vk1n ~ 7T(k+I)/2r(~(k + n +
E( k)..::: U
2»
,
which holds for k = 1, 2, ... , n - 1 and the equality sign holds only for the n sphere. For the sphere of radius r we have V = (27Tn/2/nr(~ n »r and therefore (4.7)
as is well-known (Bnns-Ehlers [3]). In particular (4.6) implies that of all n-dimensional convex bodies K of volume V; E(u) is maximized for the n-sphere. This proves a conjecture of Enns and Ehlers [3]. The inequalitiies (2.6) can be written (4.8)
valid for k = n + 1, n + 2, .... The equality sign holds only for the sphere. For the plane, n = 2, if F denotes the area of K, we have (4.9)
L.A. Santalo / Measure of Line Segments in a Convex Body
684
and therefore, of all the convex sets of area F, the variance
is minimized for the circle (as conjectured by Enns-Ehlers [3]). The conjecture that the variance is also minimized for the sphere if n > 2 remains open.
5. The Associated Functions A(u) of W. Pohl In this section we consider only the case of the plane, n = 2. In each line G we choose a point X(x, y) and the unit vector e(cos 0, sin 0) corresponding to its direction. Consider the differential form w = (dX, e) = cos 0 dx + sin 0 dy. Then we have dw = - sin 0 dO /I. dx + cos 0 dO /I. dy = dG (according to [6, (3.11)]). W. Pohl [5] defines the associated function A(u) to the convex curve aK by . 0
0
0
0
(5.1)
A(u) =
J aK
to =
J cos a ds, ex
where the integral of w extends to the non-oriented lines (X, e), X EO aK, that determine on the convex set K a chord of length a and in the last integral a denotes the angle between the tangent to aK and G at the point X corresponding to the element of the arc ds. A simple geometric description of A(u), at least for small values of o, is the following [5]: Let aKo be the curve envelope of the chords of K of length a. Then A(u) is length of aKo. For instance, for a circle of diameter D we have (5.2)
Notice that our A(u) is one half of that of Pohl, which considers oriented lines. Let M 1(l ) be the measure of the set of non-oriented line segments of length I such that one end point is inside K and one outside K. Then we have [5]
L.A. Santal6 / Measure of Line Segments in a Convex Body
685
(5.3)
On the other side, using the kinematic density dS (5.4)
M 1(l) = 2
J I dO + 2 J
IT
=
dO /\ dt, we have
dO
o-el
u~l
and by virtue of (3.2) we get (5.5) where F is the surface area of K. From (5.3) and (5.5) we have
(5.6)
J A(lT)dlT,
M o = 1TF-
o
and (5.7)
A(lT) =
-
Mb(lT).
The relation (5.6) can be applied to compute the measure Mo(l) of non-oriented line segments of length I ~ D entirely contained in a circle of diameter D. Namely, from (5.2) we have I
(5.8)
Mo(l) = 1TF - 1T
J(D
2
-
lT 2)1I 2 drr
o
as is well known [6, p. 90]. Integrating by parts in (3.4) and taking into account (5.7), we get (for convex sets in the plane and m ~ 1)
L.A. San tala / Measure of Line Segments in a Convex Body
686
D
1m = m
(5.9)
J um-1A(u)du, o
where D is the diameter of K. This expression for the chord integrals 1m (for convex sets on the plane) is due to Pohl [5]. For m = 1 we have D
JA(u) du
(5.10)
= 1TP.
o
For m
=
2, according to (2.5) we get the inequality D
JuA(u) du::::;; _8_ p /2 . 3
(5.11)
3V;
o
For m
=
3 we have D
Ju
(5.12)
2A(u)
du = p
2
,
o
and for m >3, D
(5.13)
J
um-IA(u) du ~
o
2m - 1 I-m/2r(1 m ) 1T 2 p(m+1)/2.
r(hm +3»
In (5.11) and (5.13) the equality sign holds only for the circle. From (3.10) and (5.7) we have
(5.14)
J anK"e
D
/(u) dG =
Jl'(u)A(u) drr , 0
which holds for every function /(u) having a continuous derivative I'(u) and satisfying the conditions /(0) = 1'(0) = o.
L.A. Santalo / Measure of Line Segments in a Convex Body
687
The relation between the invariant fl(u) of Enns-Ehlers and the associated function A(u) of Pohl, according to (4.2) and (5.6) is (5.15)
A(u) = -1TFfl'(u).
References [1] W. Blaschke, Eine isoperimetrische Eigenschaft des Kreises, Math. Z. 1 (1918) 52-57. [2] T. Carleman, Ueber eine isoperimetrische Aufgabe und ihre physikalischen Anwendungen, Math. Z. 3 (1919) 1-7. [3] E.G. Enns and P.P. Ehlers, Random paths through a convex region, J. App!. Probab. 15 (1978) 144-152. [4] H. Hadwiger, Ueber zwei quadratische Distanzintegrale fiir Eikorper, Arch. Math. (Basel) 3 (1952) 142-144. [5] W. Pohl, The probability of linking of random closed curves, In: Geometry Symp. Utrecht 1980. Lecture Notes in Math. 894 (Springer, Berlin, 1981) 113-126. [6] L.A. Santalo, Integral Geometry and Geometric Probability, Encyclopedia Math. AppI. 1 (Addison-Wesley, Reading, Mass., 1976).
l.A. BARROSO editor, Aspects of Mathematics and its Applications
© Elsevier Science Publishers B.V. (1986)
689
LES GROS PRODUITS TENSORIELS EN ANALYSE ET EN PROBABILITES
Laurent SCHWARTZ Centre de Mathematiques de l'Ecole Polytechnique, 91128 Palaiseau, 'Laboratoire de Recherche Associe au C.N.R.S. 169', France Je suis heureux de publier eet article dans Ie volume en I'honneur des 60 ans de Leopoldo Nachbin, qui est mon ami depuis plus de 30 ans.
o.
Introduction
Les produits tensoriels d'espaces vectoriels (eventuellernent topologiques) sur R ou
n voudra alorsdire R~ x n = (R+ U{oo}) x u.
690
L. Schwanz I Gros Produits Tensoriels
producits tensoriels 0Opl' me parait apporter la-dessus Ie point final; les etres sur lesquels on raisonne sont des sections-Idiflerentielles de semimartingales continues formelles) de fibres vectoriels de dimension finie G A au-dessus d'un ouvert ou d'un optionnel A de ~ U {oo} x Il; dXt E G t , fibre au-dessus de t (w etant toujours om is). Une fois comprise la notion, tout Ie calcul est remarquablement simple, et [8] est considerablement simplifie et ameliore. Ce n'est pas dommage!
1. Sections-Distributions d'un Fibre Vectoriel sur une Variete C€oo; d-Sections d'un Faisceau de d-Modules
Soit V une variete C€oo reelle de dimension N, avec ou sans bordo Soit G v un fibre vectoriel C€oo sur ~ de fibre-type G., de dimension finie; G; sera la fibre au-dessus de v E V. Une section de G u au-dessus d'un ouvert V de Vest dite cgoo si elle I'est pour toute trivialisation au-dessus d'un sous-ouvert Vide V, Gu'~ V' x G,; on notera cgOO(V; G u) l'espace vectoriel de ces sections; c'est un module (pour la multiplication) sur l'algebre cgOO(V) des fonctions reelles cgoo sur U. Si merne on appelle 2( G v ) Ie fibre evident sur V, dont la fibre en chaque point v de Vest 2(Gv) = 2 R(Gv; 0v), donc de fibre-type 2(OJ = 2 R(G.; OJ, cgOO(V; G u) est un cgOO(V; 2(Ou»-module a gauche pour la multiplication. (Si U est un ouvert de V, et G u ~ V x G. une trivialisation de G u au-dessus de V, cgOO(U; G u) = cgOO(U)0R G.; si, pour deux trivialisations au-dessus du meme ouvert V, la formule de transition V x G. ~ V x G. est (v, g) ~ (v,a(v)g), aE cgOO(U;2(Gu»= cgOO(V)0R2(GJ, la formule de transition pour cgOO(V; G u) est cp~acp, (acp)(v) = a(v)cp(v), a(v)E2(GJ, cp (v) E 0., a (v)cp (v) E G.; O. etant un 2(GJ-module a gauche, cgOO( V)0 RG. est un (C€OO( V)0 R2(GJ)-module a gauche.) En outre, (cgOO(V; GU»UE
L. Schwartz / Gros Produits Tensoriels
691
ouvert, la restriction I I V' de I a V' devra etre element de d(V'), d(V)~d(V') est un morphisme d'algebres, et (d(V))UE'll' ou 611 est I'ensemble des ouverts de V, devra etre un faisceau; autrement dit, si (U;)iEI est une famille d'ouverts de V de reunion V, si I est une fonction reelle sur V, telle que, pour tout i E I, la restriction II Vi a Vi soit dans d (VJ, alors lEd(V). On def nit alors aussitot les fonctions de d sur Vi a valeurs dans un vectoriel G. de dimension finie sur IR, par coordonnees. Elles formeront un espace vectoriel d(V; G), d(V)-module, d(V; GJ = d(V)®R G.; c'est meme un (d(V; 2(GJ) = d(V)®R 2(GJ)-module a gauche. On appellera alors d-fibre vectoriel G v sur V, de fibre-type G., la donnee d'un espace avec projections G v sur V, de structures vectorielles de merne dimension finie d = dim G. sur les fibres G v' v E V, et d'une famille de d -trivialisations, G u, ~ U; x G., U; ouverts de V, U iEI U; = V, telle que la formule de transition des tri vialisations d'indices i, j E I, soit une application (U; n V) x G. ~ (U; n ~) x G., de la forme (v, g)~ (v, a(v)g), a E d«U; n ~); 2 G ) = d(Vi n ~)®R 2(G); on exigera aussi que I'atlas soit complet, ou bien on dira que deux atlas equivalents definissent la merne structure d-fibree. Une section de G u au-dessus d'un ouvert V sur lequel G u admet une d-trivialisation G u ~ V x G. se traduira alors par une fonction I sur V a valeurs dans G.; ce sera une d -section si IE d( V) ®R G.; la formule de transition des cartes assure alors que si I'on a deux d-trivialisations sur Ie merne ouvert V, une d-section pour I'une est une d-section pour I'autre; une section sur un ouvert V quelconque sera dite sd-section si elle I'est en restriction sur tous les ouverts V' C V sur lesquels G u ' admet une d-trivialisation. Si alors I est une section de G u , si (U;)iEI est une famille d'ouverts quelconque de reunion V, si la restriction de I achaque U; est une d-section sur U;, elle l'est sur V. Si done on appelle d(V; G u) I'espace vectoriel des d-sections de G u au-dessus de V, c'est un d( V)-module, et merne un d( V; 2( G u ))-module agauche, ou 2( G u ) est Ie d-fibre evident de fibre 2(Gv ) en v E V, et (d(V; GU))UE'll est un faisceau de (d(V))UE'll modules. Remarquons que G v a canoniquement une topologie separee. Si en outre d est, par exemple, Ie faisceau des fonctions cern sur une variete V de c1asse cern, alors G; a canoniquement une structure de variee cern, la projection Gv~ Vest cern, et les sections de G v sont simplement les sections qui sont cern de V dans G v. Soit maintenant so'(V) I'espace vectoriel des distributions sur une variete oo V de c1asse ce • Ce pourra etre l'espace des fonctions generalisees ou
692
L. Schwartz I Gros Produits Tensoriels
courants de degre 0, des mesures generalisees ou courants de degre N tordus, ou d'autres. Si G v est un fibre vectoriel de dimension finie au-dessus de ~ on definit aisernent l'espace £0'(V; G v ) des sections distributions de G v . II n'est plus question de dire que T E £0'(V; G v ) est une distribution telle que, pour tout v E ~ T( v) E G v ! (On se permettra quand meme de l'ecrire souvent; c'est un abus de langage, qui aujourd'hui ne l'est plus tant il est bien controlel). Prenons par exemple pour ~'(V) I'espace des fonctions generalisees; une section-fonction f de G v' localement Lebesgue-integrable, sera en particulier une section generalisee. II y a deux definitions c1assiques de ~'(V; G v ) que nous rappellerons. La premiere par dualite. On considere I'espace '@(V; Gt0 [}~) des sections ~'" a support compact du fibre Gt0 sur V, dont la fibre en v E Vest G: 0R [}~, G: dual de G v' espace vectoriel de dimension 1 des N-covecteurs tordus; on Ie munit d'une topologie limite inductive, et .@'(V; G v ) est son dual. II est alors evident que c'est, pour la multiplication, un ~"'(V)-module, et merne un ~"'(V; 2(Gv))-module a gauche, et qu'il satisfait au theorerne de recollement des morceaux par partition de l'unite: iI y a un morphisme de restriction a un ouvert, et (.@'(U; G u )) UE'fL est un faisceau. La possibilite de definition par dualite est un bonheur, qui, comme on Ie verra plus loin, ne reussit pas toujours! L'autre definition usuelle est par les cartes. Si G u est trivial au-dessus de U, G u == U x G., ~'( U; G u ) == @'(U; GJ, espace des distributions sur U a valeurs dans G., qui n'est autre que ~'(U)0R G.; puisque .@'(U) est un ~"'(U)-module, et G. un 2(GJ-module a gauche, il est un ~"'( U; 2( G u )) == ~"'( U) 0R 2(GJ-module a gauche. Si on passe d'une premiere carte a une deuxierne au-dessus du merne ouvert U, et si Ie changement de cartes est defini par a: U x G. ~ U X G., a E ~"'( U) 0R 2(GJ, on considerera aussi, par definition, que Ie changement de cartes pour @'(U; G u) est la merne multiplication par a, operant de £0'(U)0R G. dans @'(U)0R G .. On a ainsi defini les espaces .@'(U; G u) pour tous les ouverts U sur lesquels G u est trivialisable, et de facon coherente: si (T;)iEI est une famille de sections, T; sur U;, Gu. trivialisable, si U = U i U;, G u encore trivialisable, si T; = ~ sur U; n~, il existe T E £0'(U; G u ) unique qui induise T; sur chaque U;. On definit alors .@'(U; G u), pour U ouvert que!conque, comme l'espace des sections du faisceau ainsi defini: un element T de ~'(U; G u) est, par definition, une famille coherente (T;)iEI' T; E @'(U;; G u), ou (U;)iEI est la famille de tous les ouverts de trivialisation de G u (on peut se con tenter d'une sous-famille recouvrant U). Alors, d' apres la definition meme, (£0'(U; G u)) UE'fL' ou iJIL est
rr
o;
a;
L. Schwartz / Gros Produits Tensoriels
693
la famille de tous les ouverts de V, est un faisceau de cg"'( V)-modules, de cg"'( V; 2( G u ))-modules a gauche. On met facilement sur ffi'( V; G..,) une topologie de caractere local sur V, en la mettant sur les ffi'(U;; G u .), G u trivial sur Vi' Cette fois-ci c'est la dualite entre ffi'(V; Gv ) et ffi(V; G~0n~) qui devient un theorerne. Tout cela est bien connu, et les proprietes se deduisent tan tot mieux par une definition, tantot mieux par I'autre, On peut ici encore generaliser, en supposant seulement V espace topologique separe, sd faisceau d'algebres de fonctions reelles, On remplacera ffi'(V) par un faisceau quelconque (X(V»UE'lI de (sd(V))UE'lImodules, et on definira les X( V; G u ) pour un sd-fibre vectoriel G u de dimension finie. Si G u -. V x G. est une sd-trivialisation, X( V; G u ) se representera comme X(V)0R G.; pour un morphisme de changement de cartes V x G. -. V x G., (v, g)-. (v, a(v)g), a E sd(V; 2(GJ) = sd( V) 0R 2(GJ, Ie morphisme de transition X( V) 0R G. -. X( V) 0R G. sera aussi la multiplication par a; X( V) est un sd( V)-module, done X(V)0R G. est un (sd(V)®R 2(GJ)-module a gauche, et a E sd( V) 0 R 2(GJ On connait done X( V; G u) pour tout V pour lequel G u est trivia/isable, avec la propriete de faisceau pour V = U jEt U;. On. appellera alors T E X(V; G u), pour V quelconque, une famille (1';)jE/' 1'; E X(Vi ; G u), coherente (1'; = ~ sur U; n V), pour tous les ouverts U; au-dessus desquels G u est sd-trivialisable. Par definition merne, (X( V; G u)) UE'lI est u~ faisceau de (sd( V» UE'lI-modules, et de (sd(V; 2(Gu»)uE'lI-modules a gauche. Tout cela est elementaire et d'un usage courant en topologie et geometric: les proprietes suivantes, quoiqu'impliciternent connues, Ie sont moins, et, comme nous Ie verrons au Section 2, gagneraient a I'etre, au moins en probabilites I. I Par exemple, la Proposition (1.3) figure explicitement dans [I, Chap. II, Section 2, exemple (2.8.1), p. 138], comme un exemple simple sans demonstration, dans Ie cas ou d = <e~(V) et .J{ = ~'(V), V variete <e~. Et en effet tout ce qui est ecrit ici est plus simple que les grands theoremes de la cohomologie des faisceaux! Cependant notre Proposition (1.3) repose sur Ie fait que Ie d-fibre G v est facteur direct d'un d-fibre trivial, ce qui, en fait, n'est pas evident pour V non compacte. et qui, dans les cas que nous traiterons 11 la Section 3, n'est sans doute pas vrai. Des resultats de ce genre, constamment utilises en geometric algebrique, analytique on diff'erentielle, semblent etre en dehors des idees 9iJ'(V; G v ) "" courantes en analyse; il est probable que l'isomorphisme <e~(V; Gv)Q9",'(V)~'(V) n'apporte pas grand chose aux analystes pour la connaissance de 9iJ'(V; G v ). Mais 9iJ'(V) est un dual, et l'espace .'I'Al(A) des semi-martingales, de la Section 2, ne l'est pas; la manipulation de l'espace .'I'Al(A) des diffcrentielles sur A de semimartingales continues formelIes n'est pas encore passec dans l'usage en probabilites, encore moins celie de .'I'Al(A; G A), ou 'dX E .'I'Al(A; GA)' represente I'idee intuitive dX, E G lo fibre au temps t. D'ou I'interet de la Proposition (1.3) ici. Le livre de R. Godement fl] pourra etre une bonne reference pour tout ce qui concernera les faisceaux.
694
L. Scb wartz / Gros Produits Tensoriels
1.1. Morphismes de Modules
Soient l/J* E dey; Gt), ou Gt, dit fibre dual, est Ie d-fibre evident de fibre G~ en v E V, et T E K(V; G y). On fabrique avec elles un produit sealaire l/J* T E K( V). Soit en effet G u ~ U x G. une trivialisation de G u ; l/JtE d(U)0 R G~, t; E K(U)0R G; alors l/JtTu est evident: si (gk)k=I.2..... d est une base de G., (g*k)k=1.2..... d la base duale de G~, l/Jt et T u ont des ecritures uniques l/Jt= L k l/Jkg*\ l/Jk E d(U), T u = k Lk T E K(U), et l/J*T = L k l/JkTk E K(U); c'est independant de la base, et, pour un merne ouvert U de trivialisation de G u ' independant de la trivialisation. Alors les (l/J* T)u definies pour tous les ouverts U de trivialisation de G sont coherents, done, K etant un faiseeau, elles defissent un merne element l/J*T de K(V); (l/J*, T)~ l/J*T est d(V)bilineaire. II en resulte done que T E K(V; G y ) definit un element liT de Hom",(y) (d(V; Gt); K(V», e. ad. une application d(V)-lineaire U T de dey; Gt) dans K(V). Et enfin T~ u T est une application d-lineaire de K(V; G y) dans Hom",(y) (d(V; Gt); K(V».
r».
Proposition (1.1). Supposons V si-normal (i.e. si F est un ensemble [erme de V, U un voisin age de F, it existe line [onction de deY), egale a I sur F, a support dans U). Alors lapptication T~ uT ci-dessus est bijective, et c'est un isomorphisme de dey; I£(Gy»-modules gauche: K(V; G y) = Hom",(y)(d(V; Gt); K(V».
a
Demonstration. (1). Montrons d'abord I'unicite, e. a d. que, si T E K(V; G y) est telle que, pour tout l/J* E dey; Gt), l/J*T= 0, alors T= O. Nous allons montrer plus; soit U un ouvert de trivialisation de G u ; si TE K(U; G u ) est telle que pour toute l/J* E dey; Gt), l/J*T= 0 sur U, alors T = O. Cela entrainera bien Ie resultat; si en effet T E K(V; G y ) , et si, pour toute l/J* E d( V; Gt), l/J* T = 0 sur V, done sur tout U de trivialisation, T sera nulle sur tous ees U, done sur V. Reprenons les notations precedant l'enonce du theoreme: (gk) et (g*k) sont des bases duales de G., G~, T = L k Tkg k sur U; si f3 E dey) a son support F dans U, f3g*k prolongee par 0 sur U definit une fonetion sur V, qui est dans d(U; Gt), et dans d(P, G';:<) (elle y est nulle); done dans dey; Gt), done f3g*k T = et: = 0 sur U. Soit U ' un ouvert de V, U ' C U; par la d-normalite, it existe f3 E dey), a support dans U, egale a I sur U ': done T k = 0 sur U', et T = 0 sur U'. Mais, V etant normale, les U ' d'adherence dans U ont pour reunion U (tout point a un systeme fondamental de voisinages ferrnes), done T = 0 dans U. C
695
L. Schwartz / Gras Produits Tensoriels
(2). Montrons la surjectivite; soit u: d(V; Gt)-+.N'(V), nous devons montrer qu'il existe TE.N'(V; G v) telle que u(I/J*) = I/J*T. Reprenons les notations qui precedent pour V ouvert de trivialisation de Gu . Soient {3 E d(V), egale a 1 sur V', a support dans U, et soit I/J* E d(V; Gt). Dans U, I/J* = L k I/Jkg*k; posons U({3g*k) = T~u') E .N'(V), alors, dans V', u(I/J*)
= U({321/J*) = L k U({32I/Jkg*k) = L k ({3l/Jk)
U({3g*k)
= L k {3l/JkTk
=
~kl/JkT~u,)=I/J*T(u'), si T(U')=~kT~u,)gk dans U. Done T(U')E .N'(V; G u), et u(I/J*) = I/J*T(U') sur V'; il resulte de (1) qu'une telle T(U') est unique. Si done V" a les memes proprietes que V', T(U') = T(u") sur V' n V"; done, la reunion de ees V' etant U, il existe T u E .N'(V; G u) unique telle que, 'r/I/J* E d(V; G v), u(I/J*) = I/J* T sur tout V'd'adherence dans U, done dans U. Si alors Test I'unique element de .N'(V; G v) tel que T = T u sur V, pour tout ouvert V de trivialisation de G u, u(I/J*) = I/J* T sur tous ees U, done sur v: (3). Nous avons vu que d(V; G v) est un d(V; .:t'(Gv»-module a gauche. Alors d(V; Gt) est un d(V; .:t'(Gv»-module a droite, par la multiplication transposee: si I/J* E d(V; Gt), a E d(V; .:t'(Gv I/J*a = aTI/J*; et HomJt(v)(d(V; Gt); .N'(V» redevient un d(V; .:t'(Gv»-module a gauche par u -+ au, U E HomJt(v)(d(V; Gt); .N'(V», ou aU(I/J*) = u( I/J* a). Alors (au T ) ( I/J*) = UT (1/1* a) = (1/1* a)T = 1/I*(aT) = Ua T ( 1/1*), comme des cartes Ie montrent, d'ou Ie resultat: T-+ u T est d(V; .:t'(Gv»lineaire.D
»,
Corollaire (1.2). Soit V si-norm al ; l'homomorphisme naturel de d(V)modules: d(V; Gt)-+(d(V; G v »* (d(V)-dual de d(V; G v isomorphisme: (d(V; G v »* = d(V; Gt).
» est un
Demonstration. Un element 1/1* de d(V; Gt) definit evidemment une forme d(V)-lineaire "«: sur d(V; G v ) par la multiplication ponctuelle q;-+u",.(q;)=I/I*q;, (1/I*q;)(v)=(I/I*(v)Iq;(v»o'o; il s'agit de voir que 1/1* -+ u"'. est bijective. v' v II suffit d'appliquer la Proposition (1.1) avec .N' = d lui-memo, en echangeant les roles de G et G*. 0 1.2. Produits Tensoriels de Modules
Soient G v , Hi; deux fibres sur V; on eonstruit immediatement un fibre G v ® H v , dont la fibre en v E Vest G; @R H v' II existe une applidans cation d(V) bilineaire evidente de (d(V; G v) x d(V; H v d(V; G v@ H v ), par (q;, 1/1) -+ q; ® 1/1, (q; @ 1/1 )(v) = q;(v)@ I/I(v) E
»
696
L. Schwartz I Gros Produits Tensoriels
Gv 0 R H v ; d'ou une application d( Vj-Iineaire de d(V; G V)0d(V)d(V; H v) dans d(V; G v0 H v)' Elle n'est pas en generale bijective; des (d(V; G u0Hu))UE'IJ forment en effet un fais-
ceau, alors qu'il n'en est tristement pas de merne en general des produits tensoriels, (d(V; G U)0d(U)d(V; HU))UE'IJ' qui ne forment qu'un prefaisceau. De meme, la multiplication (q;, T)~ q;T definit une application d(V)-bilineaire de d(V; G v) x .H(V) dans .H(V; G v), donc une application d(V)-lineaire de d(V; G V)0d(V).H(V) dans .H(V; G v), q;0 T ~ e/I'; elle est merne d(V; 2(G v))-lineaire, car, si a E d(V;2(G v)), a(q;T) = (aq;)T. Mais elle n'est pas en general bijective, puisque (.H(U; GU))UE'IJ est un faisceau, alors qu'en general (d(V; G U)0d(U).H(V))UE'IJ n'est qu'un prefaisceau, II faudra des hypotheses supplementaires, 1.2.1. Modules Projectifs
Un A-module M, A commutatif, est dit projectif de type fini s'il est sous-module facteur direct d'un module libre de dimension finie. Nous dirons qu'un d-fibre G v sur Vest projectif s'il est facteur direct d'un fibre trivialisable: iI existe des fibres K v , B v , tells que G v EB K v (de fibre Gv EB K; en tout point v de V) = E v == V x E.. Alors d(V; Gv)EBd(V; K v ) = d(V; E v ) == d(V)0R E., d(V)-module libre de dimension finie dim E.; done, si G v est projectif, d(V; G v ) est un d(V)-module projectif de type fini. Par contre, .H(V) n'est pas en general projectif ni de type fini; si Vest une variete ~"', .H = q;', q;'(V) n'est surement pas de type fini sur l'algebre ~"'(V), et n'est tres vraisemblablement pas projectif. Notons en passant ce qu'est un sous-es-fibre d'un d-fibre vectoriel G v , de fibre-type G. de dimension d. Un sons-fibre H; de Gc, de dimension d', est dit sous d-fibre, s'il verifie I'une des relations equivalentes suivantes: (1). Tout v E V admet un voisinage U dans V, sur lequel il existe d" d-sections de Gu, s.; i = 1,2, ... , d", telles que, 'Vv' E V, les Sj(v') engendrent n, (d"~ d'). (Ibis). Meme definition, mais d" = d', et les s,(v') forment une base de
u,
(2). II existe un atlas de d-trivialisations simultanees, c. a d. de d-trivialisations, G u; ~ u, x G., ou H U j vient sur U, x H.. Alors bien evidemrnent un sous-sr-fibre a une structure naturelle de d-fibre (definition (2)). Inversement, si G v , Hi), sont deux d-fibres, H; C G v, et si l'injection Hv~ G v est une d-section de 2(Hv; G v), ou
697
L. Schwanz / Gros Produits Tensoriels
.2(Hv; G v) est Ie fibre evident de fibre .2(Hv; G v) en v E V, alors H v est sous-es-fibre de G; (definition (lbis)). II est inutile de savoir cela pour definir une somme directe GvEB K v = B v , mais pas inutile de savoir que G v et K v sont alors des sons-fibres de B v ! Si est l'orthogonal de H v dans G v (de fibre H~ en v E V), Hvest sous-ss'-fibre de G v ssi est sous-es-fibre de G~ (definition (2)). On a
tt;
u;
done de nouvelles definitions: (3). Tout v de V admet un voisinage U, sur lequel il existe d'" d-sections de Gt" s~, i = 1,2, ... , d"', telles que H v = {s~ = O}. (3bis). Meme definition avec d'" = d - d', les s~(v) etant independantes en tout v E U. Si H v est un sous-es-fibre de G v , les morphismes d(V; Hv)~ d(V; G v )' .K(V; Hv)~.K(V; G v ) sont injectifs, et font du premier module un sous-module du deuxieme; c'est evident pour d, mais aussi pour .K, car, si G u ~ U x G., H u ~ U x H" est une trivialisation simultanee, (definition (2)), .K( U) @R H. est un sous-module de .K(U)@R G., et pour U quelconque ou V cela resulte de la propriete de faisceau. Si d est Ie faisceau des fonctions r;m sur une variete V de c1asse r;m, on sait ce qu'est une application r;m de V dans une grassmannienne; il existe alors encore une definition equivalente aux precedentes: (4). L'application v -+ H; de V dans l'espace fibre Gr G v des grassmanniennes, Gr G; = grassmannienne de G; (fibre sur V a fibres non vectorielles)', est une section r;m de Gr G v sur V.
n 1:,
Proposition (1.3). Soient G v , H v' des fibres projectifs sur V (facteurs directs de fibres trivialisables). Alors les applications naturelles
sont bijectives (Ie dernier est un isomorphisme de d(V; .2(Gv ))-m odules gauche).
a
Demonstration. Soient d'abord G v , H v , trivialisables, == VX G., VX H. Alors tout est evident, car 1 Ces proprietes ne sont pas difficiles a dernontrer, encore taut-HIe faire. On les trouvera plus au mains dans [6, demonstr. prop. (6.4)], dans [7, (2.8), (2.9)], dans [8, (3.17), (4.22), (4.23) a (4.23 quinto)]; et sans doute dans taus les ouvrages de geometric traitant de la grassmannienne,
698
L. Schwanz / Gros Produits Tensoriels
d(V; GV)®d(vld(V; H v) = (d(V)®R GJ®d(Vl(d(V)®R HJ = (d(V) ®d(vl d(V»®R (G.®R HJ = d(V)®R (G.®R HJ = d(V; Gv®Hv),
et, d(V; GV)®d(VlK(V) = (d(V)®R GJ®d(vlK(V) = (d(V)®d(V)K(V»®R G. = K(V)®R G. = K(V; G v).
Si main tenant les fibres sont seulement facteurs directs de fibres triviaux, GvEBKv=Ev=vxE., HvtBLv=Fv=VxF" alors les deux membres d(V; EV)®d(V)d(V; F v), d(V; Ev®E v), sont sommes directes de 4 espaces, Ie morphisme est somme directe de 4 morphismes, iI ne peut etre bijectif que si les 4 Ie sont; de rneme, d(V; EV)®d(V)K(V) et K(V; E v ) sont sommes directes de 2 espaces, Ie morphisme est somme directe de 2 morphismes, il ne peut etre bijectif que si les 2 Ie sont. 0
Remarque (1.4). Si G v est projectif, Ie Proposition (1.1) et la Corollaire (1.2) sont vraies sans supposer V d-normale. En effet, si G v = V X G. est trivial: Homd(v)(d(V; G*); K(V»
= Homd(Vl(d(V)®R G~; K(V»
= HomR(G~; K(V» = K(V)®R G. = K(V; G v).
Si G est seulement facteur direct d'un fibre trivial, on refera Ie meme raisonnement de somme directe de 2 applications que pour la Proposition (1.3).
Remarque (1.5). On a done, si 1'0n veut, deux nouvelles definitions de K(V; G v ) quand G v est projectif:
Dans ce cas, tout element de K(V; G v) admet des ecritures L akTk, a k E d(V; G v), T, E K(V) (puisqu'il s'agit d'un produit tensoriel); en
L. Schwartz / Gros Produits Tensoriels
699
Remarque (1.6). La Proposition (1.3) montre done que, si G v et H; sont (d(U; G U) 0 d(U)d (U ; HU))UE'Il et (d(U; G U) 0 d(U) projectifs, J{(U)bE'Il sont des faisceaux. Remarque (1.7). II est done important de savoir quand G v est facteur direct d'un fibre trivial. II est connu que c'est vrai, par exemple, pour V d-normale: (a). si Vest compacte, (b). si G v est une variete topologique. Dans chacun de ces cas, en effet, on prouve qu'il existe un d-atlas fini de G v (c'est evident dans Ie cas (a), c'est moins facile dans Ie cas (b ))2, soit G ui == UI x G,• i = 1,2, ... , m. Au-dessus de (1;, il y a done un nombre fini d = dim G. de sections qui, en chaque point v E V, engendrent G v ' soit (Sj,k)t=l' Soit (uJr=1 une d-partition de l'unite subordonnee (Vest d-normale!). Alors UjSj,k E d(V; G v ) est une section globale; comme Ljuj = 1, en tout point v E V l'une des Uj ri'est pas nulle, et les uj(v)sj,dv), k = 1,2, ... , d, engendrent G v ' Alors (v, (Xj,k)j,k)~ (v, L i.k uj(V)Sj,k(V)Xj,k) est un morphisme de dfibres du fibre trivial V x IR md sur G v' done G v est quotient d'un fibre trivial; si, en chaque v E V, on appeIIe N; Ie noyau de cette application, N; est un sons-fibre de VX IR "': N; = {(Xj,k)~k /Lj,k uj(V)Sj,k(V)Xj,k = O}; d'apres la definition (3) de la Subsection 1.2.1, N v est un sous-es-fibre de VX IR md • Si P v est Ie fibre dont la fibre Pv en v E Vest I'orthogonal de N v 1 Ces ecritures ~k ak Tk sont ce que j'ai appele dans [8] des representations tangentielles (voir par exempIe (3.1), (3,2), (3,2bis) de [8]), J'y attachais alors une grade importance. En effet, rna seuIe definition de .N'(V; G v) alors etait duale, par Homst(v) (d(V; Gt-); .N'(V», dans Ie cas probabiliste qui sera traite ici a la Section 2, avec V = A, GA , .N'(A) = yAt(A). Les representations tangentielles etaient ce qui rattachait a G v lui-merne, au lieu de G1f, et elles interviennent dans Ia Iacon d'ecrire une equation differentielle stochastique. lei tout est fait a partir de G v, ces representations sont exactement la Proposition (1.3), .N'(V; G v ) =
d(V; Gv)@st(V).N'(V). 2 Si Vest une variete topologique de dimension N, tout recouvrement ouvert admet un recouvrement plus fin pour lequel tout point est recouvert au plus N + 1 fois. On en deduit [2, Chap. III, Th. (2.6)], qu'il existe un autre atlas plus fin (llj)jEJ et une partition J = U ::; J, de J, tels que tous les llj pour lesquels j est dans un meme Jk soient deux a deux disjoints, de sorte que U j E h llj est encore une carte, et done (U j E h llj)::; est un atlas fini. C'est une propriete purement topologique, ou d n'intervient pas. Au lieu d'une variete de dimension N, on peut prendre un CW-complexe de dimension N.
700
L. Schwartz / Gros Produits Tensoriels
dans R md pour sa structure euclidienne canonique, P v est un sous-ss-fibre de V x R md (P v est Ie transporte de l'orthogonal P~ par I'isomorphisme (R md)* ~ lR md), V x lR md = NvEB G v' done G v est sous-ss-fibre facteur direct d'un fibre trivial. Par des methodes completement diflerentes, en rernplacant lR par
1.3. Fibres Boreliens Voici maintenant une situation un peu differente, qui va nous servir it la Section 2. V ne sera pas un espace topologique, mais un ensemble muni d'une tribu, Bor, appelee borelienne; d(V) sera l'algebre Bor(V) des fonctions reelles boreliennes sur V Tous les ouverts U, U ', U;, des numeros precedents seront rernplaces par des parties boreliennes de V Mais la propriete de faisceau exigera toujours des reunions denombrables: si (Vn)nEN est une suite de parties boreliennes, si fest une fonction reelle sur U = Un Un' dont la restriction a chaque U; est borelienne, elle est borelienne sur U Un fibre borelien G v , de fibre-type G., sera defini de la merne maniere, un morphisme de transition sur U etant (v, g) ~ (v, a(v)g), a borelienne a valeurs dans 2(GJ; mais on exigera toujours que tout atlas ait un sous-atlas denombrable. On en deduira des espaces d(V; G v); une section de G(V) sera borelienne si elle l'est sur chaque carte. Ensuite X sera un 'faisceau' de d-modules, (X(V))UEBor' mais n'ayant la propriete de faisceau que pour les reunions denombrables: si (TJiEI est une famille, 1'; E X(U;), coherente (1'; = ~ sur U; n ~), U iEI U; = V, et s'il existe un J C I denombrable tel que U iEJ U; = V, il existe une T E X(V) unique qui induise 1'; sur
I Si G v est un fibre vectoriel holomorphe sur une variete de Stein, done lui-meme de Stein, dont la base est de dimension N et la fibre de dimension d, il est de dimension N + d, done plongeable dans <e 2(N +d )+ 1. II n'est pas plonge comrne fibre vectoriel, mais J'ensemble des plans tangents aux fibres Ie long de la base (identifiee a la section nulJe) definit un fibre vectoriel isomorphe, dont toutes les fibres sont des sous-espaces vectoriels de <e 2(N+d)+1; G v est donc sons-fibre vectoriel holomorphe du fibre trivial V x <e2(N+d)+I. Ensuite tout sousfibre vectoriel d'un fibre vectoriel sur cette base est supplernente, done G v est facteur direct de VX <e 2(N+d)+1.
L. Schwartz / Gros Produits Tensoriels
701
chaque U;. On en deduira (K(V; GU))UEBor' qui se definit sur les cartes, et ensuite comme faisceau pour les reunions denombrables. Mais il existe ici un fait qui va tout faciliter: G v est toujours trivialisable, globalement, et peut done etre defini par une seule carte, G v ~ V x G .. En effet, soit (G u• ~ U; x GJnEN un atlas denombrable: alors V o, VI \ V o' ..• , U;\ V o\ VI' .. \ V n-I sont des boreliens disjoints au-dessus desquels G v est trivialisable, done il l'est aussi sur V, en definissant Gv~ V x G. par GU.lUol .. .IU._t ~ iU;\ V o\ ... \ Vn-I) X G. induit par G u• ~ U; X G.. Done les Propositions (1.1), (1.3) et le Corolla ire (1.2), sont eoidents. Comme la Section 2 sera Ie cas borelien traite ici, on peut se demander pourquoi nous avons mis la Section 1. C'est parce que l'analyse existe aussi, pas seulement les probabilites, et que la Section 3, probabiliste, relatif aux fibres de Stratonovitch, reviendra ala situation de la Section 1. D'autre part, G v est trivialisable, mais ses trivialisa tions ne sontpas' naturelies', aucune ne s'impose plusqu'une autre; il est trivialisable, non trivial, et on ne doit pas se le representer geometriquement comme trivial! De merne que dans Ie cas S'i = C€m, ou G v etait un fibre C€m, et la projection Gv~ V etait C€m, ici G v est canoniquement muni d'une tribu, que nous appellerons borelienne et pour laquelle la projection G v ~ V est borelienne: pour une trivialisation G u ~ V x G., c'est Ie produit tensoriel des tribus boreliennes de U et de G.; et il est equivalent de dire, pour une transition entre deux cartes G u ~ G x V., G u ~ G x V., qu'elle est (v, g)~ (v, a(v)g), a E Bor(V; 2(G,)), ou qu'elle est de cette forme et borelienne de U x G. dans lui-merne; et les Bor-sections de G u sont simplement ses sections boreliennes, pour les tribus boreliennes de U et de G u. Il va falloir encore generaliser legerement, en cessant de supposer la fibre de dimension constante, mais en la supposant de dimension finie ~ d < +00 fixe. Alors V sera suppose reunion disjointe de boreliens Vk , k = 0, 1, ... , d, et sur V k la fibre sera de dimension k (on aurait aussi pu Ie faire dans Ie cas topologique, les Vk etant alors des ouverts (et fermes) disjoints de V). Cela n'introduit evidemment aucune difficulte nouvelle, on peut toujours raisonner sur Ven sachant simplement que les raisonnements peuvent obliger a se ramener a chaque V k separernent; les trivialisations, G u ~ U X G k • supposeront toujours V dans un V k • Remarquons d'ailleurs qu'on peut trouver un fibre H v tel que G v ffi H v = B v , trivialisable de dimension fixe d, et les modules resteront projectifs de type fini.
702
L. Schwanz / Gras Produits Tensoriels
2. Sections-Differentielles de Semi-Martingales d'un Fibre Vectoriel sur un Ouvert ou un Optionnel A de IR+ U {oo} x
n
Soient (n, 0, IP', (.oT,)IER+U{oo}) (IR+ = [0, +00)), ayant les proprietes habituelles en probabilites, Soit A C IR+ U {oo} x D; it rernplacera Ie V de la Subsection 1.3. II sera optionnel ou ouvert. Sa tribu fondamentale (Bor V de la Subsection 1.3) sera sa tribu option nelle, intersection de A et de la tribu optionnelle de IR+ U {oo} x n. Une fonction ({J reelle optionnelle sur A est alors la restriction a A d'une fonction optionnelle ,p sur IR+ U {oo} x Il. On aura alors des fibres option nels G A sur A (Ia tribu optionnelle de A' x G., A' (relativement) optionnel dans A, G. espace vectoriel de dimension finie, est, par definition Ie produit tensoriel des tribus optionnelles de A et borelienne de GJ, et des espaces de sections Opt(A; G A ) , rernplacant les Bor(V; G v) ou d(V; G v) de la Section 1; Opt(A; GA ) est un Opt(A)-module, un Opt(A; £'(GA))-module a gauche. Si la fibre de GA est de dimension con stante, iI est trivialisable; sinon, il a un atlas fini. Soient g, f, deux semi-martingales continues formelles; on sait ce que veut dire g ~ f si A est ouvert; s'il est optionnel, eela veut dire 1A . g = 1A. f; cela passe aux reunions denornbrables A = Un An' tous ouverts ou tous optionnels. On appellera X, ou plutot dX; une classe d' equivalence sur A de semi-martingales continues formelles sur IR+ U {oo} x On abregera en disant que dX est une differentielle de semi-martingale sur A, et on appellera 9'At(A) l'espace de ces differentielles (note, dans [8], [7, Section 6], Opt 9'AtC(A)). Si A est IP'-negligeable, 9'At(A) = {O}. Soit ({J E Opt(A), restriction de ,pEOpt(IR+U{oo}xn), et gEdXE9'At(A); a une equivalence pres sur A, ,p dg est independante du choix de ,p, et du choix de g E dX, elle ne depend que de ({J et dX. (voir [7, 6.8]), sa classe se notera ({J dX E 9'At(A); 9'At(A) est un Opt(A)-module, pour l'integration stochastique. Si maintenant A' est un optionnel de A (done pas necessairement optionnel dans IR+ U {oo} x n, si A ne rest pas), on peut encore dire que deux semi-martingales continues formelles g, f, sur IR+ U {oo} x n, sont equiualentes sur A', si lA' d!f = lA' df dans A; une classe d'equivalence se notera dX E 9'At(A'). Si ({J E Opt(A'), dX E 9'At(A'), on a une integrale stochastique ({J dX E 9'At(A'); 9'At(A') est un Opt(A')module. On pourra raisonner ainsi sur les diverses parties optionnelles A' de A (non necessairement optionnelles dans IR+ U {oo} x n), comme sur les ouverts de V a la Section 1, ou les boreliens de Va la Subsection 1.3. Un
n.
L. Schwartz / Gros Produits Tensoriels
703
A' C A, optionnel, est dX -negligeable, si lA' dX = 0; un A' IP' -negligeable est dX-negligeable, et alors f:f.Al(A') = O. Pour a E Opt(A), a dX = 0 ssi a est dX pp.nulle, car a dX = al{a"O} dX; et l{a"O} dX = «1!a)l{a"O}) a dX Pour G A (appele G v a la Subsection 1.3), Opt-fibre sur A, on definira alors f:f.Al (A; GA)' espace des sections-difterentielles de semi-martingales du fibre G A au-dessus de A; mais il faut bien comprendre ce qu'il veut dire. Si X E f:f.Al(A; GJ, G. vectoriel fixe, ecrire X, E G. ou dX, E G., c'est la merne chose. Mais ici, pour X E f:f.Al(A; GA ) , ce n'est pas X, qui est dans la fibre G" cela n'aurait merne pas de sens puisque X n'est qu'une classe, c'est, symboliquement, dX, qui est dans G" dans le merne sens qu'a la Section 1 on se permettait d'ecrire T(v) E G; pour une distribution TE 9O'(V; G v ); c'est faux, mais c'est un lang age abusif qu'on sait controler; comme alors X, est l'integrale des X s' s ~ t, qui sont dans des G, variables, X, (qui n' existe meme pas) n' est plus nulle part. Le fait qu 'il s'agisse bien de dX, traduit Ie fait que la propriete d'Opt(A)-module de f:f.Al(A; GA ) est relative a l'integration stochastique, d(a' X) ou a dX On a des operations evidentes: si dX E f:f.Al(A; GA ) , a E Opt(A), a dX E f:f.Al(A; GA ) ; si a E Opt(A; 2(GA ; H A », a dX E fI.Al(A; HpJ; si a E Opt(A; G~), a dX E f:f.Al(A); etc.... On a des proprietes de faisceau pour les reunions denombrables: si (dX;)iEI est une famille, dXi E f:f.Al(A i; G A)' Ai option nels, U iEI Ai = A', s'il existe un J C I, denombrable, tel que U iEJ Ai = A', et si dX; = d~ sur Ai n A j , il existe dX unique sur A' qui, pour tout i, induise dX; sur Ai' ([7, (6.3ter)] etait relatif a des Ai ouverts optionnels, c'est valable, avec la meme demonstration, avec des optionnels Ai quelconques.) On peut mettre sur f:f.Al(A; GA ) une notion de convergence. Elle est 'locale' (par restriction sur des optionnels de A), done on peut se ramener au cas trivialise, et finalement a f:f.Al(A), On dira que des X n E f:f.Al(A) convergent vers 0, s'il existe (~n)nEN semi-martingales continues formelles sur IR+ U {oo} x Il, ~n E dXn , qui convergent vers 0 dans l'espace des semimartingales continues formelles (c. a d., rappelons-le, s'il existe 'Y optionnelle > 0 sur IR+ U {oo} x fl, telle que les 'Y soient dXn -integrables et que les 'Y dXn convergent vers 0 pour la topologie d'Emery des semi-martingales). Si alors an tend vers a dans Opt(A), dXn vers dX dans f:f.Al(A; G A ) , an dXn tend vers a dX dans f:f.Al(A; G A ) . On se ramene en effet au cas GA = IR. Soient alors an' a prolongeant an' a en fonctions optionnelles sur R+ U {oo} x fl; soit A l'ensemble des (t, w) pour lesquels an(t, w) tends vers a(t, w), il est optionnel; alors an dXn-
704
L. Schwartz / Oros Produits Tensoriels
a dX ;::I.1 A an dgn -l A a dg; or 1 A an converge partout vers 1A a, gn vers g done Ie 2e membre converge vers 0 dans 9'AL(R+ U {oo} x 'a), done Ie I" dans 9'AL(A) ([7, (2.5)]). Enoncons alors les Propositions (1.1), (1.3) et la'CorolIaire (1.2), dont iei la demonstration est evidente puisque 9'AL(A; G A) se ramene a 9'AL(A)@R G.:
Proposition (2.1). (1). Le dual (Opl(A; G A»* du Opl(A)-module OptlA; GA ) est Optt A; G~). (2). 9'AL(A; G A) Hom Opt(A)(Opt(A; G~);9'AL(A»= Opl(A; GA)@Opt(A) 9'AL(A), isomorphismes entre OpttA; .f£(GA»-modules a gauche.
=
Remarque (2.2). Cela donne 2 definitions supplementaires de 9'AL(A; G A). De la derniere il resulte que tout element de 9'AL(A; G A) admet des ecritures L k ak dX\ ak E Opt(A; G A), dX k E 9'AL(A), comme produits pour I'integration stochastique, d Lk(ak . X k ) = L k a k dX k ; c'est ce que nous avons appele dans [8] des representations tangentielles'. En outre L k a k dXk = 0 ssi L k a k @dXk = 0 dans Ie produit tensoriel, Si les fibres G t ont toutes la meme dimension et si on prend pour les (Xk une Opt(A)-base de Optt A; G A), l'ecriture est unique, i.e. L k a k dXk = a ssi toutes les dXk sont nulles. 2.1. Sous-Fibres Optionnels
Les sons-fibres optionnels H A d'un fibre optionnel GA se definissent comme a la Subsection 1.2.1, par des definitions legerernent modifiees: (1). H A est Opl(A)-engendre par un nombre fini de sections optionnelles de GA; (Ibis). Sur {(I, w) E A I dim Ht,w = k}, H A est engendre par k sections optionnelles independantes de G A ; (2). II existe un atlas denombrable, ou fini, ou compose d'une seule carte, sur chaque {(t, w) E AI dim G t; = k, dim H ~ = I}, et des trivialisations simultanees sur cet atlas: G ~ ~ A' x G., ou HA'~ A' X H.; H A est un sons-fibre de GA ssi son orthogonal H~ est un sons-fibre de G~. D'ou de nouvelles definitions analogues a (3) et (3bis). (4). L'application (t, w) ~ H, est une section optionnelle du fibre Gr GA des grassmanniennes Gr G (~w)' 01
01
1
Voir note 1 page 699.
01
L. Schwartz / Gros Produits Tensoriels
705
Ces proprietes de H A en font un fibre optionnel, et I'injection H A ~ GA est optionnelIe; H A est un sous-ensemble optionnel de GA (d'apres la definition (3». Inversement, si H A, GA sont fibres optionnels, H A eGA' et si I'injection est optionnelle, H A est sons-fibre optionnel; mais Ie fait que H A soit sous-ensemble optionnel de GA ne me parait pas suffire pour qu'il soit sous-fibre optionnel. Mais il y a en plus des proprietes toutes nouvelles ici, valables aussi pour la situation de la Subsection 1.3. La somme H A + K A de deux sons-fibres optionnels de G A est sous-fibree optionneIle, par la definition (1); done aussi I'intersection H A n K A = (H~ + K~).l. (Cela n'est vrai pour les cas anterieurs a la Subsection 1.3 que si I'on sait que les 'fibres' de H A + K A , H A n K A sont de dimension constante.) Si H A est sons-fibre optionnel de GA , il admet un supplementaire optionnel K A , H A E9 K A = GA' Les proprietes de somme et d'intersection sont encore vraies pour une somme + nEN H A et une intersection n n H A denombrables, car A est reunion finie "de parties optionneIles, sur" chacune desquelles la reunion ou I'intersection est finie. Enfin, si LA EO Opt(A; !£(GA; H A»), Ker LA et 1m LA sont des sons-fibres. option nels de GA et H A respectivement (ce n'est vrai a la Section 1 pour les cas anterieurs a la Subsection 1.3 que si les dimensions des fibres de Ker LA et 1m LA sont constantes)'. Si H A est un sons-fibre optionnel de GA , !J'At(A; H A ) est un sous Opt(A; !£(GA; H A», Ker LA et 1m LA sont des sons-fibres optionnels est injective, voir Subsection 1.2.1, en outre, si K A est un supplementaire optionnel de HA dans G A, Y'At(A; G A) = Y'At(A; H A) E9 Y'At(A; K A) (evident par trivialisation). Si dX EO YAt(A; G A), on dit qu'elle est tangente aHA' ou provient de H A, ou que dX; EO H" si elle est dans Y'At(A; H A), i.e. si sa composante (dans la somme directe) sur YAt (A; K A ) est nulle. Proposition (2.3). Soit dX EO 51'At (A; G A)' II existe un sous-fibre tangent essentiellement minimum d'une semi-martingale dX EO Y'At(A; G A); c'est un sous-fibre TA(X) = TA(dX) optionnel de G A, auquel dX est tangente, et tel que, si dX est tangente H A eGA' H A contienne TA(X) dX-ps. II est eoidemment unique, un ensemble optionnel dX-negligeable de A pres. Le fibre TA(dX) (defin! seulement dX-pp.) s'appelle Ie fibre tangent a x ou a dX.
a
1
Voir note 1 page 697.
a
706
L. Schwartz I Gros Produits Tensoriels
a
Demonstration I. (1). Si dX est tangente H A et K A , elle l'est aussi a H A n KA" En effet, on peut ecrire (evident par trivialisations simultanees) H A = (HA n K A) EB LA' K A = (HA n K A ) EB M A, et on a des sommes directes correspondantes pour tous les anneaux et modules consideres; en outre, H A + K A = (HA n KA)EBLA EBMA" Alors dX E gAleA; H A + K A) a sa composante sur gAleA; M A ) nulle puisqu'elle est tangente HA= (HA n KA)EBLA, mais aussi sa composante sur gAleA; LA) nulle puisqu'elle est tangente a K A = (HA n K A) EB M A , done dX E gAleA; H A n K A). (2). Le merne resultat est valable pour une intersection denombrable n n K A = KA" II existe en effet une suite de parties optionnelles An de reunion" A, sur chacune desquelles c'est une intersection finie; alors sur chaque An' dX est tangente aKA' done aussi sur A (posons G A = K A EB LA; si dXL A est la composante de dX sur LA' chaque An est dX L A -negligeable, done aussi A, done dX L A = 0). (3). II existe une classe d'equivalence VO de mesures finies ~ 0 sur A, muni de sa tribu optionnelle, qui domine dX: toute partie optionnelle dans A, vO-negligeable, est dX-negligeable. En effet, dX = Lk(Xk az', (Xk E Opt(A; GA), dZ k E gAl (A) reelle, et c'est connu pour des semi-martingales formelles reelles. La borne inferieure v de ces classes ~ 0 domine encore dX; elle est alors equivalente a dX, les parties de A v-negligeables ou dX -negligeables sont les memes. On appellera alors TA (X) une borne inferieure v-essentielle des H A eGA auxquelles dX est tangente; elle repond ala question, d'apres (1) et (2), puisqu'elle est une intersection denombrable de ces H A • On pourrait d'ailleurs garder une VO dominant v, et trouver un T~(X) borne inferieure vO-essentielle (voir [7, (2.8) a (2.9)]); il suffit de poser A = Al + A 2 , V O et v equivalentes sur AI' A 2 z--negligeable, et de poser T~(X) = TA(X) sur AI' = 0 sur A 2. 0
a
Pour l'enonce suivant, il est pratique de supprimer certains des indices A en bas:
Proposition (2.4). Soient G A , H A des fibres optionnels sur A, L: G A ~ H A un morphisme optionnel, dX E gAleA; GA); soient T(dX) et T(L dX) les fibres tangents dX et L dX.
a
(1). I [7, (2.8) generale.
T(L dX) = LT(dX) ,
L dX-ps..
a (2.9)]. La demonstration donnee ici est considerablernent plus
simple, et plus
L. Schwanz I Gros Produits Tensoriels
707
Plus precisement, comme dX domine L dX, mais ne lui est pas equicalente:
(Ibis).
r(L dX) C Lr(dX) ,
(Iter).
ri]:
dX):J Lr(dX) ,
LdX-ps. , dX-ps..
(2). L dX = 0 ssi r(dX) C Ker L, dX -ps., ou Lr(dX) = 0 dX -ps.
Demonstration. (1). dX domine LdX; car, si A' optionnel est dXnegligeable, lA' dX = 0, done IA,(L dX) = L(1A' dX) = O. Elles ne sont pas en general equivalentes, par exemple si dX,c 0 et L = O. (2). dX E gAleA; r(dX», done L dX E gAleA; Lr(dX», L dX est tangente a Lr(dX), done ril: dX) C Lr(dX), L dX-ps.. On ne peut pas ici remplacer par dX-ps.; par exemple, si L = 0, Lr(dX) = 0, mais, L dX etant nul, ril: dX), defini seulement a un ensemble L dX-negligeable
pres, est arbitraire. Ceci dernontre (Ibis). dX E gAleA; r(dX», et posons . r(dX) = (3). Partons de (r(dX) n Ker L)EB CT, somme directe de sons-fibres option nels supplernentaires. Alors L = 0 sur Ie premier, mais est inversible de 0" dans son image La = LT(dX); appelons L -1 son inverse, de La dans T(dX). Alors t:'t: vaut l'identite sur 0", 0 sur T(dX) n Ker L. X se decompose suivant la somme directe, dX = dX Ker LnT(dX) + dXu ' L dX = L dXu ' et dXu = L -I L dXu ' Alors L dX = O~ L dXu
= O~ dXu = O~ dX est tangente a Ker L n T(dX),
done, d'apres la minimalite de T(dX), T(dX) C Ker L n T(dX), dX-ps., c. ad. T(dX) C Ker L, dX-ps.. Ceci dernontre (2). (4). On a encore une decomposition en somme directe de deux sousfibres supplernentaires: LT(dX) = (LT(dX) n T(L dX»
EB p.
Appelons tt la projection sur Ie 2 0m • faeteur. On sait deja que L dX est tangente a LT(dX), par (Ibis), mais aussi a T(L dX) par definition, done a leur intersection: L dX E gAleA; LT(dX) n T(L dX».
708
L. Schwartz / Gros Produits Tensoriels
Done 7T(L(dX» = 0, ou (7TL)(dX) = O. Par (2), cela entraine que r(dX) C Ker L, dX-ps., done Lr(dX) C Ker 7T, dX-ps., ou Lr(dX) C Lr(dX) n r(L dX), dX-ps., ou Lr(dX) C ril: dX), dX-ps., ce qui est (her). 0 Remarque (2.5). Le resultat (her) peut paraitre etrange avec ce dX-ps., alors que ril: dX) n'est determine qu'a un ensemble (L dX)-negligeable
pres. Cela s'explique aisernent. Puisque dX domine L dX, A est reunion disjointe A = AI U A 2, ou dX et L dX sont equivalentes sur AI' et A 2 est (L dX)-negligeable. Alors L dX est nulle sur A 2, done r(dX) C Ker L sur A 2 , dX-ps., par (2); ou Lr(dX) = {O}, dX-ps. sur A 2• Sur A 2 , on peut modifier arbitrairement r(L dX), puisque A 2 est (L dX)-negligeable, mais il contiendra toujours Lr(dX) = {O}, dX-ps.. Corollaire (2.6). En reprenant l'identification de fa Proposition (2.1), Sl J E Opt(A; G~), dX E YAl(A; G A ) , J dX E YAl(A), alors J dX = 0 ssi r(dX) C Ker J, dX-ps., i.e. J est orthogonal a r(dX), dX-ps..
C'est Ie resultat de [8, Prop. (2.9)], mais servait de definition a r(dX). 2.2. Applications aux Semi-Martingales Differentielles de Cia sse ee 2
a l'envers;
a Valeurs dans des
dans [8], ceci
Vonetes
Soit un fibre borelien 'flv sur un ensemble V muni d'une tribu, Subsection 1.3, et soit Y un processus sur A C IR + U {oo} x Il, it valeurs dans V, optionnel, c. a d. mesurable pour la tribu optionnelle de A et la tribu borelienne de V. Alors Y definit un 'fibre image reciproque' (pull-back en anglais), GA = Y*('flv ) ' La fibre Gt,w est, par definition, Gl,w = 'flY(t,w); et Y definit un morphisme yo d'espaces fibres GAl::; 'flv' (t, w, g)~ Y°(t, to, g) = (Y(t, w), g). A l'isomorphisme de transition (/J entre deux cartes: V' x 'fl. ~ V' x 'fl., defini par (v, g) ~ (v, a (v)g), a fonction borelienne a val~urs dans !£('fl), correspondra un isomorphisme de transition A' x G.~A'xG., A '=",y- 1(V') (A' optionneI), G.='fl., (/J°Y o= YOol/J, I/J defini par (t, w, g)~ (t, to, a(Y(t, w»g) = (t, w, f3(t, w)g), ou f3 = a ° Y est optionnelle sur A' a valeurs dans ';£(G) = ';£('flJ On a fait l'hypothese que 'fly admet un atlas denombrable, alors GA aussi, et les cartes precedentes font de GA un fibre vectoriel optionnel sur A. En fait, si les fibres de 'fly sont toutes de merne dimension, il est globalement trivialisable, et GA aussi;
L. Schwartz / Gros Produits Tensoriels
709
sinon, comme les dimensions sont bornees, Cf}vet GA admettent un atlas fini. II y a alors bijection entre les sections optionnelles g de GA (g optionnelle de A dans G A' g(t, w) E G I, w= Cf} Y(~ w»)' et les processus optionnels y sur A a valeurs dans Cf}v' Ie long de Y, y(t, w) E Cf} Y(~w), que nous avons consideres dans [6] et [8], g=yoY, g(t,w)=y(Y(t,w)). L'Opt(A)-algebre Opt(A; GA) pourra se noter ici Opt(A; Cf}v; Y), Y indiquant qu'i1 s'agit de processus le long de Y. Apparemment la situation (Cf}v; Y) est plus cornpliquee que la situation G A , mais elle est plus proce de la verite geornetrique; tout Ie monde se repesente I'espace tangent a Ven un point v de V camme ayant pour 0 le point v, on identifie Va la section nulle d'un fibre vectoriel Cf}v, et ici aussi on voit y Ie long de Y plus facilement que g; mais de toute Iacon la theorie genera Ie estplus aisee sur les fibres option nels GA sur A que sur les processus optionnels a valeurs dans Cf}v Ie long de Y; c'est ce qui fait que les pages precedentes sont bien plus faciles que celles de [8]. Alors une differentielle de semi-martingale sur A, a valeurs dans GA , c. a d. un element dX de Y.Jl(A; G A), sera aussi une differentielle de semi-martingale continue formelle sur A a valeurs dans Cf}v le long de Y; et on ecrira dX E Y.Jl (A; Cf}v; Y); on imagine dX1E Cf} y t = G1, et c'est un abus de langage qu'on se permettra. La Proposition (2.1) s'ecrira ici: (2.1)
(Opt(A; Cf}v; Y))* = Opt(A; Cf}t; Y), Y.Jl(A; Cf}v; Y) = Opt(A; Cf}v; Y)Q9OpI(A)Y.Jl(A) = HomOPI(A)(Opt(A; Cf}t; Y); Y.Jl(A)) ,
comme Opt(A; 2( Cf}v); Y)-modules a gauche. Dans [8], c'est HomOpt(A)(Opt(A; Cf}t; Y); 9'.Jl(A)) qui nous avait servi de definition: dX definit I'application Opt(A)-lineaire J ~ J dX = d(J· X), de I'Opt(A)-module Opt(A; Cf}t; Y) des processus optionnels a valeurs dans Cf}t le long de Y; dans I'Opt(A)-module Y.Jl(A) des classes d'equivalence sur A de semi-martingales continues formelles, module pour l'integration stochastique. J'espere qu'on verra sans peine que tout ici est plus simple et naturel que dans [8]. Ce qui s'appelle ici Y et dX, s'appelait, dans [8], X et duo La notation du ou u venait de ce que u etait un 'operator', du E HomOP1(A)' ... , alors que les methodes employees ici montrent qu'en fait iI s'agit bien d'une differentielle dX de semi-martingale continue formelle, qui, au lieu d'etre a valeurs dans un vectoriel G., est section d'un fibre GA' Nous allons maintenant donner la signification geometrique de la [or-
710
L. Schwartz / Gras Produits Tensoriels
mule d'Lto, ou du 'principe fondamentaI' de [8, (2.1)], qui dit que, si Yest une semi-martingale continue a valeurs dans une variete V de classe ee 2 , d.Yt est 'petit vecteur 2-tangent' a Ven Y,: Proposition (2.7). Si Yest une vraie semi-martingale continue sur l' au vert A de ~+ U {oo} x va leurs dans V, oariete de classe Cfi2, il existe une dX = dyE fIJ.Jtl (A; T 2 ( V); Y) unique, telle que, pour toute carte vectorielle Cfi2 de V, V' ~ U', Y' image de Y; V' ouvert de V; U' ouvert d'un espace vectoriel de dimension finie E, done T 2 ( V') ~ U' x (E EB (E 0) E» = U' x 'if}., l' expression de dX = d Y soit ce que nous avons defini anterieurement comme d y', soit (A' = Y-!(V'»
na
-
d Y' = (
dY'
i d] Y',
Y']
)
E fIJ.Jtl(A'· E en (E 8 E» = fIJ.Jtl(A'· 'if}). , Q7 , •
Demonstration. Notons que T 2(V) aurait dfi ecrit (T 2(V»v, il est compris comme fibre sur V; personne n'objectera! Notons aussi qu'ici Y est une vraie semi-martingale continue A C IR+ U {oo} x (A ouvertj-» V; d'apres
n
la definition de Stricker-Meyer'; c'est un vrai processus, non une classe. Comme U' E E, on sait qu' Y' definit alors une classe d'equivalence sur A de semi-martingales continues formelles (formelles ne peut pas ici etre supprime) sur IR+ U {oo} x n, notee d Y'. On sait aussi que la formule d'Ito reste vraie pour de telles semi-martingales continues y', [7], en termes de classes de semi-martingales continues formelIes; d'ailleurs dry', Y'] n'existe qu'en terme de ciasse de processus a variation finie continus formels. Notons qu'on prend ici non seulement une carte de T 2(V), mais de V elle-rneme, sur U' C E. Si, au-dessus du merne ouvert V' de V, on a une autre carte, V'~ W'CF; T2(V')~ W'x (FEB (F8F» = W'x fJi., la formule de transition est u~w
= (u) ,
(u, (~»~ (w, G», '(u)
O(u) =
(0
G) =
O(u)(~),
tP"(u) tP'(u)® tP'(u») '
1 Voir [8, (2.20)], et [7, Prop. (6.5 quinto)]: Y est une semi-martingale continue sur A ouvert a valeurs dans V, s'il existe une suite d'ouverts An optionnels dans A, de reunion A, telle que, dans chaque, An, Y soit restriction d'une semi-martingale Y n sur R + U {oo} x n (non necessairement continue), et si Y est continue sur A; Y est alors optionnelle dans A. Si VeE, Y est alors equivalente, sur A, a une semi-martingale continue formelle sur R+ U {oo} x u, [7, (6.4ter)]. Pour la formule d'Ito dans ce cas, voir [7, (6.8)].
L. Schwanz / Gros Produits Tensoriels
711
et Z' = 4>(Y'). Reportons cette formule sur G A ; si A' = y-I(V'), GA , """"> A' x 1(, GA , """"> A' x fJi., cette formule s'ecrit (t, w, (p»
"""">
(t, w, G»,
avec
G) = B(¥'(t, w»(P) , ou 'Y
_
(8) -
(4)'(Y'(t, 0
w»
w»
4>"(¥'(t, 4>'(Y'(t, ® 4>'( ¥'(t,
w»
w»
)
a
(,8).
Pour avoir un systeme coherent de differentielles de semi-martingales, it faudra qu'on ait les memes formules, avec multiplication de la differentielle par BoY', ou integration stochastique. Si done dX a pour images d1] E Y,;{{(A';
on devra avoir
d~ =
gJ,
d~
E Y,;{{(A'; fJi,),
B(¥') dn, ou
4>'(¥')
d( =
(0
4>"(¥') 4>'(¥')©> 4>'(¥'») d n .
En prenant dY' d1] - dY' - ( ) - - - ~d[Y', ¥'] ,
on trouve bien dY
-
!> -
dZ' - ( -
-
dZ'
~
d[Z', Z'] ) ,
car la formule d'Ito donne (2.2)
dZ' = 4>'(¥')+ 4>"(¥'Hd[Y', ¥'], ~ d[Z', Z'] = 4>'(Y')©> 4>'(Y'H dry', ¥'] .
712
L. Schwartz / Gros Produits Tensoriels
Cela prouve bien, en vertu meme de la definition par cartes de Y.JJ(A; G A)= Y.JJ(A; V; Y), que Ies dy' des cartes T2(V)~ V' x (E EB (E 0 E» definissent une section differentielle dX; on l'appellera encore dyE Y.JJ(A; V; Y), avec l'ecriture intuitive d Y, E T 2(V; ~). 0
Remarque (2.8). Cet enonce, permis par les notations directes avec produits tensoriels, ne pouvait pas figurer dans [8], oil n'existaient que des operateurs du: Opt(A; T*2(V); Y)~ Y.JJ(A). D'autre part nous avons signale, au debut de la Section 2, que, merne si G A etait en fait trivialisable (puisque ici la dimension des fibres etait constante), if n' eta it jamais donne comme trivialise : T 2 ( V) n'est pas trivialisable en tant que fibre ~o, il Ie devient, mais tres peu intuitivernent, comme fibre borelienl Remarquons aussi qu'on processus continu optionnel Y (ou optionnel merne pas continu!) definit un espace fibre optionnel GA = Y*(T 2(V», done un espace 9'.JJ(A; G A ) = Y.JJ(A; T 2(V); Y), mais que, si Y est une semi-martingale continue sur A (une vraie, pas une classe l), elle definit en outre un element particulier de cet espace, d Y Nous avons donne Ie maximum de details honnetement 'ecrivable' de la demonstration, mais on voit bien qu'elle est triviale! (Une fois faite une telle demonstration une fois pour toutes, on peut se contenter de dire ensuite; evident par des cartes de v.) Les concepts introduits ici simplifient enorrnement les choses! Remarque (2.9). On peut prendre une application t/J de classe rg2 de V dans WElle definit une application de T 2(V) dans T 2(W), et bien evidemment, si Z = t/J(Y), on a un morphisme optionnel des fibres images reciproques: Y*(T2(V»~ Z*(T2(W», qu'on peut ecrire (T 2(V); Y)~ (T2(W); Z) (fibres 2-tangents Ie long de 1'; Z). Dans la demonstration ci-dessus, en prenant sur V et W des ouverts domaines de cartes tels que ;p envoie Ie premier dans Ie deuxieme, on a les memes formules avec 4J, e, V', E 0 (E 0 E), W', FEB (F 0 F), mais 4J n'est plus inversible, e non plus, ce dont on n'avait nul besoin dans la demonstration. Cela veut dire que t/J definit des morphismes V ~ lv, Y ~ Z, T2(V)~ T 2(W), (T 2(V); Y)~ (T 2(W); Z), Y.JJ(A; T 2( y ); Y) ~ Y.JJ(A; T 2(W); Z), et dy ~ dZ'. Proposition (2.10). La differentielle d y de fa Proposition (2.7), consideree comme operateur Opt-lineaire de Opt(A; T*2(V); Y) dans Y.JJ(A), est fa 21p(Y) = D E Opt(A; T*2(V); Y), Ip fonction reelle rg2 sur V, seule qui, associe
a]
L. Schwanz / Gras Produits Tensoriels
713
J dY = d(~(Y»,
differentielle de la vraie semi-martingale continue
~(Y).
En termes integres,
le second membre est un vrai processus, une semi-martingale continue sur A, le premier une classe d'equioalence sur A de semi-martingales continues formelles sur IR+U{oo}xfl; au lieu de~, on pourrait mettre 3, Ie 2 0me membre est l'element de la classe definie par le l". Demonstration. C'est dans [8, (2.7)]. Mais c'est devenu plus simple. D'abord d Y a cette propriete. En effet, il suffit de Ie montrer dans une carte; pour simplifier, identifions done Va un ouvert de E. Alors D 2rp = (~' ~"), D;(Y) = (rp'(Y) rp"(Y», son image par dY- ( dY ) - - ~d[Y, Y]
est donc
(~'(Y) ~"(Y»C
[dY
zd Y, Y]
)=~'(Y)dY+~~"(Y)d[Y,Y]=d(~(Y»,
par Ito. Inversement, soit dX E gAl(A; T 2(V); Y) ayant cette propriete. Alors dX et dY prennent la merne valeur sur les D2~(y), ~ E ~2(V; IR). Mais toutes les deux ont un caractere local; soit V' ouvert de -v, V' ouvert, V" C V'; si ~ E ~1(V'; IR), iI existe iP E ~2(V; IR), egale a ~ sur V". Done dX et d Y ont me me valeur sur D 2 iP (Y), donc me me valeur sur D2~(y) dans Y-\V"); V" etant quelconque, aussi dans Y-1(V'). Mais les D2~(y), tp E cg2(V'; ~), engendrent I'Opt(A')-module Opt(A'; T*2(V'); Y), A' = y- 1(V'), si V' est une domaine d'une carte de V (on prend pour ~ les fonctions coordonnees et les produits de deux fonctions coordonnees dans cette carte), done dX = d Y dans A' = y-l(V'); par un atlas denombrable, dX = dY dans A. 0 On peut remplacer gAl(A) par 'Y(A), espace des differentielles de processus a variation finie formels, Al(A), espace des differentielles de
L. Schwartz / Gras Produits Tensariels
714
martingales continues formelles; .'f'At(A) = 'V(A)EBAt(A), tous Opt(A)modules. D'ou, si G A est fibre optionnel sur A, .'f'At(A; GA) = 'V(A; GA)EBAt(A; G A); pour dX E .'f'At(A; G A), on ecrira dX = dX c + dX. Si Y est optionnel sur A a valeurs dans une variete borelienne ~ on aura des processus Ie long de Y, et .'f'At(A; <§y; Y) = 'V(A; <§y; Y)EBAt(A; <§y; Y). Si Y est une semi-martingale vraie a valeur dans ~ variete de c1asse cg2, on aura une decomposition d Y = dycEBdY; on voit, par la formule d'Ito, que d r" est tangente a T'(V) Ie long de Y, d y c E .'f'At(A; T'(V); Y), d Y~ E T'(V; Y;) selon les expressions de la Proposition (2.3), c'est pourquoi nous avons ecrit d Y", non dYc. Mais, bien sur, pour dX E .'f'At(A; T 2 ( V) ; Y) arbitraire, dX c n'est
a
pas en general tangente T 1• Si maintenant dX, dX' sont dans .'f'At(A; G A), on peut definir un crochet ~ d[XC, X'C] = ~ d[X, X'] E 'V(A; GA G A) (par trivialisation). Si Y est optionnelle sur A a valeurs dans V borelienne, et X, X' E .'f'At(A; <§v; Y), ~d[X,X']E 'V(A; <§y0<§v; Y). Supposons que <§y= T 2(V); on sait seulement que ~d[X, X'] E 'V(A; T 2(V)0 T 2(V); Y). Mais en merne temps, si 7T est la projection canonique de T\ V) sur
o
T 7T
2(V)/T\V)
c
T'(V)0 T'(V),
dX E .'f'At(A; T'(V)0 T1(V); Y) C
«a;«, T
2(V)
8 T 2(V); Y).
On dit que dX E .'f'At(A; T 2(V); Y) est crochet-stable, si 7T dX = ~ d[X, X], qui est alors dans 'V(A; T 1(V)0 T1(V); Y). On verifiera que c'est equivalent a la definition (3.3) de [8], mais elle est plus elegante, sans dualite. Cela entraine que dX c E At(A; T1(V); Y). Mais c'est bien plus precis: dX est crochet-stable ssi, dans toute carte de ~ identifiee a un ouvert d'un espace vectoriel E, dX a la forme [ ! [dZ ]], 2 d Z,Z
dZ E .'f'At(A; T'(E); Y);::; .'f'At(A, E).
(C'est bien invariant par changement de carte, car, si E changement, dZ devient cP'(Y) (0
cPl/(Y) CP'(Y)@
dZ
~
F est un tel
CP'(Y) dZ + ~CPl/(Y) d[Z, Z]
cP'(Y))(~d[Z,Z])= ( ~CP'(Y)@ cP'(Y)d[Z,Z]
),
L. Schwartz / Gros Produits Tensoriels
715
au la deuxieme cornposante est bien Ie demi-crochet de la premiere)'. Si alors Y est une semi-martingale vraie sur A a valeurs dans V, on a exactement, dans toute carte, dY- ( dY ) - - ~ dry; Y] ,
d Y est crochet-stable. 2.3. L'Expression: 'Si Yest une (vraie) semi-martingale continue sur A valeurs dans V; dYt est un petit vecteur 2-tangent a V en ~'
a
On a ecrit cette formule comme un principe intuitif; on I'a aussi ecrite dYt E T 2(V; ~). On aurait pu souhaiter avoir une formule meilleure, avec des vrais vecteurs, pas petits. II faut bien se convaincre que c'est impossible (a part Ie fait qu'une chose impossible devient parfois possible dans I'avenir!). Considerons deja Ie cas deterrniniste, n reduit a un point. et Y une courbe [0, +oo)~ V, de c1asse cg' ([0, +(0) au lieu de [0, +00], pour parler de derivee), En chaque t, existe une derivee, un vrai (pas petit) vecteur tangent Y'(t) E T'(V; ~). C'est cette image qui nous fascine; remarquons que d Y = Y'(t) dt est deja en partie un petit vecteur tangent, mais on peut faire qu'il ne Ie soit pas, si on decide que dt est Ie vecteur tangent 1 a IR, au point t. Mais supposons maintenant Y seulement localement a variation finie, et Iii on peut reprendre [0, +00] si I'on veut, ou un ouvert A de [0, +00]. Alors Y'(t) dt ne represente plus rien d'interessant, merne si on definit Y'(t) seulement Lebesgue-presque partout, car, pour V = IR, Y n'en est pas la primitive. II existe des fonctions ZO reelles a variation finie (et merne croissantes ;3 0) telles que dZ" domine d Y (par exemple domine les coordonnees de d Y dans un plongement de V dans IR 2N); on peut aussi en trouver une qui soit equivalente a d Y; soit dZ. Alors on pourra prendre des derivees par rapport a dZ ou d.Z", et ecrire d Y = H dZ = HO dZO. On trouve bien un vrai vecteur (pas petit) HI' H~E T 1(V; ~). Mais il est defini seulement dZ-pp. au dZo-pp., 1 Comme on Ie voit, c'est avec une certaine prudence qu'on ecrire gAleA; T 1(E); Y) == gAleA; E). Avec gAl (A; E). on perd de vue que c'est le long de Y; fa forrnule de changement de carte utilise C/J'(Y), C/J"(Y), et non C/J'(Z), C/J"(Z)!
716
L. Schwartz / Gros Produits Tensoriels
pas partout. En outre, Ie choix de HI" H~ depend completement du choix de Z, Z"; Z n'est pas unique, dZ est unique a une equivalence pres. La seule chose reelle est H, dZ" au H~ dZ~ E T'(V; Y,), petit vecteur, defini seulement dZ au dZo-pp.. C'est tres symbolique! Si dZ, = at dZ~, a fonction borelienne sur [0, +00], defini seulement dZo.pp., alors H~ = Ha, dZo-pp.. On peut se desesperer si I'on est pessimiste, mais il n'y a den d'autre, Par contre, si l'on prend la chose globalement, pas en un point t. mais sur [0, +00], d Y (ou d y, mais c'est a valeurs dans T'( V» existera comme element de 'Y([O, +00]; T1(V); Y) = 'Y([O, +00]; G[o.+",), G[o. +"'j = Y*(T'( V», d Y differentielle de fonction continue localement a variation finie, a valeurs dans T'(V), Ie long de Y (Elle est merne bien, si l'on veut, la 'differentielle de Y', qui est a valeurs dans V,) Et nous nous sommes permis d'ecrire d Y, E T1(V; Y,). Rien de ponctuel (valeur en t) n'existe ici vraiment, pas plus qu'une distribution n'a de valeur en un point, ou qu'une fonction de L P n'est une fonction, ou qu'une integrale stochastique H . Z n'a de valeur en un point (t, w). Si en plus Y est une semi-martingale continue (vraie) sur A a valeurs dans V, il n'y a pas une semi-martingale scalaire Z qui la domine, mais seulement un systeme fini, et on a seulement des representations tangentielles d Y; = L k Hk,t dZ~, Zk semi-martingales continues reelles, H k.: E T 2( V; Y,), H, option nels a valeurs dans T 2(V). Encore faut-il voir quand deux representations tangentielles definissent Ie merne objet; c'est exactment quand L k H, 0 dZ k est nul dans Ie produit tensoriel, ce qui revient a la Proposition (2.1): 2 dyE [f'At(A; T\V); Y) = Opt(A; T ( V) ; Y) 0 Opt(A) [f'At(A). Dans Ie cas deterrniniste ou Y est a variation finie, on se permettra bien de dire que dYE 'V(A; T'(V); Y) est la differentielle de Y; on pourra done aussi, si Y est une semi-martingale continue vraie sur A a valeurs dans V, dire que dYE [f'At (A; T 2 ( V); Y) est la differentielle de Y. II n 'y a pourtant pas vraiment de petit vecteur d Yt E T 2 ( V; Y,), mais il y a une diflerentielle de semi-martingale continue formelle a valeurs dans T 2( V) Ie long de Y. L'ouvrage [8, (3.17) a (3.17 quarto)] donne un grand nombre de proprietes de ces differentielles de semi-martingales a valeurs dans des fibres C§v Ie long de Y. En particulier, l'ecriture d'une equation differentielle stochastique sur une variete
n
L. Schwartz / Gros Produits Tensoriels
717
une IP' -martingale locale continue', equivaut a dire que, pour IP', X est une 2 T ( V); X) [8, (3.16 semi-martingale, et que d.J"~ = L(X,) dt dans 'V([O, bis)]. C'est une sorte d'equation differentielle stochastique, mais ou c'est la trajectoire X qui est connue (c'est la trajectoire canonique, XI = w,), et c'est la probabilite IP' qui est inconnue. Je montrerai dans un article ulterieur, par une methode nouvelle, comment, lorsque Lest strictement elliptique lipschitzienne, une telle equation a une solution unique IP' X, pour laquelle X o = x [p'x_p s.; la methode n'utilise aucune des methodes anterieurs de cartes, ni l'enroulement du brownien usuel de IR N comme dans Eels-Elworthy', mais on se rarnene directement a la solution d'une equation differentielle stochastique sous la forme normale, ou l'existence et l'unicite sont connues, en utilisant les ecritures glob ales ci-dessus.
n
2.4. Martingales par Rapport
a une Connexion
Cette notion a ete introduite par J.M. Bismut, mais a ete aussi, presque simultanement, ou un peu apres, etudiee par Eells et Elworthy, Darling, Ikeda, P.A. Meyer'. Tout d'abord, [8], j'ai introduit des notations pour la connexion qui ne sont pas celles de tout Ie monde; j'ai ecrit r;'j ce que l'on ecrit habituellement - r;, i : Ie fais ici la rectification, r~j sera celui de tout le monde; en particulier, pour la connexion (symetrique ou sans torsion) de Levi-Civita sur une variete riemannienne, rL = t~}, symboles de Riemann-Christoffel. Soit V une variete de classe ce 2 , i f ou rune connexion lineaire sans torsion sur Ie fibre tangent, de ciasse ceo. On sait que r n'est pas intrinseque, seul if I'est. Dans chaque carte de la variete sur un ouvert d'un espace vectoriel E de base (ek)~=l' pour x E E, F(x) est une application bilineaire symetrique de Ex E dans E, qui, a ~ = L~=I eOk' 1J = L~=I 1JkOk' fait correspondre 1 Pour les methodes utilisant les cartes, voir par exemple [1], [5, Chap. VI]. Pour la methode d'enroulement du mouvement brownien usuel, voir [3, chap. V Section 4]. 2 On pourra par exemple consulter [4, page 54J, ou l'article de R. W.R. Darling, qui donne une bibliographie; Darling dernontre une reciproque de la Proposition (2.11) (plus delicate que la proposition directe), qu'il prend comme definition d'une r-martingale. La definition de geometrie differentielle ()d f = 0, (2.14), me semble plus satisfaisante; c'est aussi celie de Paul-Andre Meyer [1], ecrite ici en termes de sections-Idifferentielles de semi-martingales) de fibres vectoriels.
L. Schwartz / Gros Produits Tensoriels
718
N
(2.3)
F(x)(~,TJ)=
L
rL~iTJjak'
k, i,j=l
Done r(x) est aussi une application lineaire de EO E dans E. La liaison de F, non intrinseque, avec 0', intrinseque, est la suivante: pour tout ~ E T1(V; x), ou (x,~) E T 1(V), O'(x,~) est une application lineaire de T 1(V; x) dans T 1(T1(V); (x, ~», done, pour les cartes E et (E x E) de T1(V; x) et T1(T1(V); x, ~), s'ecrit: O'(x, ~)TJ = (TJ, - F(x )(~, TJ» .
(2.4)
Done Ex E
r(x)(~, TJ) E
E n'a pas de sens intrinseque, mais (TJ, - r(x)(~, TJ» E
== T 1(T1(V); (x, 0) est intrinseque.
Mais il existe une application lineaire canonique r(x, 0 de T 1(T\V); (x, ~» dans T 2(V; x), definie comme suit: si (() E ~\V; IR), tp' est une fonction ~1 sur T1(V), et, pour A E T 1(T1(V); (x, ~»:
En reprenant les cartes precedentes, (X, E) E E x E: (2.6)
Alors r(x,
r(x, ~)(X, E) = E ~)(O'(x, ~)TJ)
+ (~8X) E E EEl (E8E).
a un sens intrinseque, et s'ecrira p(x )(~, TJ): pour
~,TJEE,
(2.7)
p(x)(~, TJ) = -F(x)(~,
TJ)EEl (~8 TJ)E E EEl (E8E).
On voit que p(x) est une application bilineaire symetrique de E x E dans E EEl (E 8 E), done lineaire de E 8 E dans E EEl (E 8 E), et, comme elle est intrinseque, de T 1(V; x)8 T 1(V; x) = T 2(V; x)/T1(V; x) dans T 2(V; x); si 2(V; 1T est la projection canonique de T x) sur son quotient, on voit que 1(V; 7Tp = Id, done p(x) est un relevernent de T x)O T1(V; x) dans 2(V; 1(V) 2(V). T x), p de T\V)8 T dans T Alors Im(p) est un supplernentaire S(V) de T1(V) dans T 2(V), S(V) est un fibre ~o de dimension ~N(N + 1), on a la decomposition en somme directe
719
L. Schwartz / Gros Produits Tensoriels
(2.8)
Le projecteur de T 2(V) d'image S(V) est p7T, Ie projecteur d'irnage T 1(V) est Id - p7T, et definit (): T2(V)~ T1(V), egale a l'identite sur T 1(V). La formule dormant p (2.7) permet aisernent d'obtenir les composantes dans une carte vectorielle de V (T1(V):::= E x E, T 2(V) = Ex (E tJj (E 0 E)), T 2(V)/T1(V) = Ex (E 0 E)): (2.9)
u
u
u
(J=(vh1tJj(v)s=
(U + r(X)V) (-r(X)V) 0 + V
.
et
1
()(X)(~) = U + r(x)v .
Passons aux duals, on aura des formules transposees. Le sous-espace P*(V) est I'orthogonal de T1(V), P* = Tl.L; (voir [[8], Section 1] pour les notations, notamment P* [8, (1.1)]) c'est done Ie dual de T 2/T 1, ou espace des formes bilineaires syrnetriques sur T 2 x T 1 ou des formes lineaires sur T 10T1; et Ie quotient T*2/p* est T*l. Sur une carte vectorielle, T*2(V;x)=E*tJj(E0E)*, P*=(EOE)*. La transposee r*(x) (non intrinseque), sur une carte vectorielle, est une application lineaire de E* dans (E 0 E)* = E* 0 E*; elle a naturellement, relativement a une base, les memes coordonnees r7,j' On a alors une decomposition en somme directe: (2.10)
donc (2.11) (a
(3)=(a
(3)s~+(a
(3)Tu=(a
T*(x)a)+(O
-r*(x)a+f3),
ou F" (x) a est aussi a r(x). A partir de la, on peut definir les objets lies a cette connexion: (1). Soit tp une fonction reelle cg2 sur V (ou un ouvert de V); on dit qu'elle est a-conuexe ou r-convexe, si p" D 2cp ;:,; 0; D 2cp(x) E T*2(V; x), 0
1 Dans [8]. j'ai note les elements de E E9 (E 0 E) par des matrices verticales 11 2 lignes et 1 colonne, (~). et les elements de E* E9 (E 0 E)* par des matrices horizontales, 11 1 ligne et 2 colonnes, (a (3).
L. Sch wanz / Gros Produits Tensoriels
720
p*(x)D 2cp(x) = (D2cp(x))TLl E P*(V; x), est une forme bilineaire syrnetrique sur T 1(V; x) x T 1(V; x), il est done possible de dire qu'elle est ~o. Dans une carte sur E, par (2.11), cela s'ecrira, puisque D 2 cp (x ) = (cp'(x) cp"(x)),
(2). Soit Y une semi-martingale continue (vraie, pas une differentielle) a valeurs dans V, definie sur A ouvert optionnel de IR + U {oo} x n. Elle definit dYE 9'Al(A; T 2(V); Y). Alors elle a une image par 0: T2(V)~ T 1(V), 0 dY E 9'Al(A; T 1(V); Y), qui s'ecrit, dans une carte, par (2.9): (2.13)
OdY=O(! [dY ])=dY+~r(Y)d[Y;Y] 2 d Y; Y
E9'Al(A;E),
en coordonnees, (0 d y)k = d y k + ~
N
2: rL(Y) d[y
i
,
yi) .
i.i~1
On dit alors que Y est une if-martingale, ou r-martingale, si () d Y est une differentielle de martingale, 0 dyE Al(A; T\V); Y), done, dans une carte, E Al(A; E). En utilisant la decomposition 9'Al = 'YE!1Al, d Y = dye + d.f, et sachant que dycEAl(A; T 1(V); Y), done = OdY c, eela revient a od¥ = 0, ou d¥ E 'Y(A; S(V); Y) au, dans une carte: (2.14)
- 0 (1 d Y ) - 1 OdY= ) =dY+ 2F(Y)d[Y, Y)=O. 2d [ Y, Y
Notons que la notation differentielle cache une integration stochastique; en termes integres: O'
¥= Y+~r(Y)·[y; Y).
Bien entendu, cette propriete peut s'exprimer par dualite: Y est une r-martingale ssi, \:IJ E Opt(A; T*I(V); Y), (0* J) d¥ = JO d¥ = 0; 0* : T* 1 ~ T* 2 est un relevement. On a toujours les deux decompositions: 9'Al = AlE!1 'Y, d Y = cE!1d¥, dy et T 2(V) = T 1(v)E!1S(v), (2.8), dY= dYTIE!1dYs ' Alors Y est une r-martingale ssi ces deux decompositions coincident.
721
L. Schwartz / Gros Produits Tensoriels
La relation fondamentale entre fonctions convexes et r-martingales est la suivante. La formule d'Ito, d(;(Y)) = D 2cp(Y) d
Y = cp'(Y) d Y + ~tP"(Y) d[ Y, Y],
peut s'ecrire, par un regroupement qui correspond aux deux decompositions duales intrinseques (2.8) et (2.10), appliquees a D 2cp ( Y ) et a dY: (2.15)
d;cY) = cp'(Y)(d Y + ~r(Y) dry; Y]) +~(-cp'(Y)r(Y)+cp"(Y))d[Y;Y].
Alors: Proposition (2.11). Si cp est une fonction r.g2 reelle r-convexe sur V, si Yest une r-martingale sur A, dcp(Y) est une differentielle de sous-martingale (classe d'equioalence sur A de sous-martingales continues formelles); si A = IR+ U {oo} x il, cp(Y) est une sous-martingale locale continue. Demonstration. Dans (2.15), Ie premier terme est nul, puisque Y est une r-martingale, et Ie second est ~ 0, puisque - e'T + cp" est une forme -.J bilineaire ;;,:0 sur E 0) E et d[ Y, Y] ~ O. Done dcp ( Y) ;;,: 0, qui est Ie resultat cherche'. 0 La decomposition (2.8) permet de decomposer un operateur differentiel d'ordre "';;2, section de T 2(V), en somme d'un operateur d'ordre 1, section de T1(V), et d'un operateur purement d'ordre 2, section de S(V). Supposons que r soit la connexion de Levi-Civita d'une structure riemannienne sur V. II est bien connu que Ie laplacien L1 est alors un operateur purement d'ordre 2, c. ad. L1 E S(V). Cela resulte de la formule qui donne Ie laplacien sur une carte: N
L1
= £J ""
~i=l
N
gi,i
o.s. - £J "" s"r:.a =r ',J
Fg
) kg'
'I
~i,k=l
ou gi,i aA E EO E, donc L1 E Spar (2.9) (Peu importe pour cela ce qu'est g; 1 Le fait que 'd Y est une differentielle de sous-martingale continue formelle' equivaut d y,,; 0 se voit essentiellement a [6, Prop. (3.10)].
a
722
L. Schwanz / Gros Produits Tensoriels
en fait c'est I'image 1TL1 de L1 dans T1(V) 0 T1(V), c'est la forme quadratique fondamentale sur T*( V), et L1 = P1TL1 E S( V)). Comme alors Ie brownien Y associe a ~ L1 verifie (Subsection 2.3) d = ~ L1 dt (Voir [8, (3.16)]), on a () d = 0, done Y est une r-martingale. On sait aussi que, si cp est une fonction C€2 sous-harmonique, L1cp ;;. 0, cp(Y) est une sous-martingale, par Ito; e'est une propriete differente de la Proposition (2.11), car Yest une r-martingale particuliere, Ie brownien relatif a ~L1, mais cp est plus generale qu'une fonction F-convexe (une fonction r-convexe est sousharmonique, car
Y
Y
°
et g = L ~j=l s"ajaj ;;. dans EO E). Les diverses formules precedentes montrent bien que Ie formalisme des sections-differentielles de semi-martingales de fibres optionnels GA se prete tres bien a la geometric differentielle stoehastique.
3. Fibres de Stratonovitch sur un Ouvert A de R+ U {co} x
I]
Nous introduisons ici une notion interrnediaire entre celIe de la Subsection 1.3 et celles des nurneros anterieur de la Section 1. Une fonetion reelle sur R+ U {co} x.n (voir Section 2) est dite globalement de Stratonovitch, si elle est fonetion C€l d'un nombre fini de semimartingales continues; elle est alors continue, et optionnelle. Une fonetion reelle sur A ouvert de R + U {co} x n est de Stratonovitch (sousentendu localement), si A = Un An' An ouvert relativement optionnels de A, tels que, sur ehaque An' elle soit restriction d'une fonetion glob alement Stratonoviteh sur R + U {co} x Il ; elle est encore continue et optionnelle sur A 1. On definit aisernent les fonctions de Stratonovitch sur A a valeurs dans une variete C€\ et toute image C€1 d'une fonction de Stratonovitch est de Stratonovitch. On appellera Str(A) l'algebre des fonctions de Stratonovitch sur A; si IE Str(A) et I oj. partout, 1/1E Str(A); on a la propriete de faiseeau pour les reunions denombrables d'ouverts optionnels: si A = U; An' An ouverts optionnels, si I est Stratonovitch sur ehaque An'
°
1 Dans [8, (4.0)], j'ai donne une definition equivalente: les A. n'etaient pas necessairement optionnels, mais la fonction l'etait, On trouvera a [8, Section 4] tout ce dont je me sers ici.
L. Schwartz / Gros Produits Tensoriels
723
elle I'est sur A. On definit alors aisement les Str-fibres G A au-dessus de a fibres toutes de merne dimension finie; les cartes seront GA'~ A' x G., la transition pour deux cartes au-dessus de A' etant (t, tu, g) ~ (t, w, a(t, w)g), a E Str(A '; 2(GJ) = Str A0R £'(GJ; et on exige que l'atlas ait un sous-atlas denombrable. D'ou Ie Str(A)-module Str(A; GA ) des sections Stratonovitch de G A' qui est merne un Str(A; £'(GA»-module a gauche. Soit alors .N' un faisceau de Str-modules, [aisceau seulement pour les reunions denombrables d'ouverts optionnels (relativement a A) (si (A j )jEI est une famille d'ouverts relativement optionnels de A, de reunion A', et s'il existe J C I denombrable tel que U jEJ A j = A', si (T;)iEI est une famille, T; E .N'(AJ, coherente (T; = If sur A j n A), il existe T E .N'(A '), unique, induisant T; sur A j pour tout i E 1). On construira Ie Str(A)-module .N'(A; GA ) comme d'habitude; pour A' x G., .N'(A'; A' x GJ = .N'(A')0R G., la coherence de deux cartes sur Ie rneme A' est la multiplication par la fonction a E Str(A'; £'(G,», qui assure la transition, et, par definition, un element de .N'(A'; GA')' A' ouvert optionnel quelconque de A, est une famille coherente d'elernents des .N'(A j , GAJ pour les A j qui trivialisent G A; .N'(A; G A ) est un Str(A)-module,' et un Str(A; £'(GA»-module a gauche. On prendra pour .N' Ie faisceau Y';{{ de la Section 2, mais considere comme Str-module pour l'lntegration de Stratonovitch: (a,dX)~aodX, ou (a, X) ~ a (~) X On remarque que Y';{{(A) est, de deux manieres differentes, un Str(A)-module: Str(A) C Opt(A), done il I'est pour I'integrale d'Ito; et iI I'est aussi pour l'integrale de Stratonovitch. C'est ce second cas qui nous interesse, on Ie notera SY';{{(A); c'est Ie merne IR-espace vectoriel, par Ie meme module. D'ou les SY';{{(A; G A ) , qui sont des Str(A)-modules, et des Str(A; £'(GA»-modules a gauche, pour I'integrale de Stratonovitch, avec la propriete de faisceau pour les reunions denombrables d'ouverts optionnels de A. Si G A est trivial, GA = A x G., Y';{{(A; G A ) et SY';{{(A; G A ) sont Ie merne R-espace vectoriel, pas Ie meme module; si GA est seulement Str-trivialisable, mais non trivialise, Y.Jl(A; G A ) et SY';{{(A; G A ) sont encore des IR-espaces vectoriels isomorphes (par trivialisation), mais il n'y a entre les deux aucun isomorphisme naturel, car l'isomorphisme change avec la trivialisation. Mais il n'y a maintenant plus aucune raison de croire que tout Str-fibre GA soit sous-Srr-fibre facteur direct d'un fibre trivial. A,
Definition (3.1). Un Str-fibre G A sera dit projectif s'il est facteur direct d'un Str-fibre trivialisable: G A ffi H A = E A , E A :::= A x E ..
L. Schwanz / Gros Produits Tensoriels
724
Alors: Proposition (3.2). Soit G A un Sir-fibre projectif. Alors Str(A; G A ) est un Str(A)-module projectif de type fini. Le Str(A)-dual (Str(A; G A»* de Str(A; G A ) est Str(A; G~). On ales isomorphismes de Str(A; .
a
Sf:fAl(A; G A )
= HomSlr(A) (Str(A;
G~); Sf:fAl(A»
= Str(A; G A)Q9Str(A) SY'Al(A).
Demonstration comme aux Sections 1 et 2. En dehors du cas que nous allons voir main tenant, je ne vois aucun critere simple de projectivite, Si Y est un processus de Stratonovitch sur A avaleurs dans une variete V de cIasse ~1, et si C§y est un fibre vectoriel ~1 sur V, I'image reciproque Y*(C§y) = G A est un fibre de Stratonovitch sur A. Les changements de carte, comme indique au debut de la Subsection 2.2 se font en eflet par des /3 = ~ Y, Y Stratonovitch, a de cIasse ~I, done sont de Stratonovitch. On peut alors repeter l'enonce (2.1), en rernplacant Opt par Str, et Y'Al par SY'Al; en effet, C§y etant sous-fibre facteur direct d'un fibre 0
trivial, GAl' est aussi. Proposition (3.3). Soit Y un processus de Stratonovitch sur A ouvert de IR+ U {oo} x fl, a valeurs dans une variete V de classe ~I. II existe une dijferentielle de martingale continue formelle a va leurs dans T 1(V) Ie long de Y, dye E Al(A; C§y; Y) (Opt(A)-module), unique, telle que, quelle que soit la carte vectorielle de V:V'~V', V' ouvert de V, V' ouvert de E, Tl(V')~ V'xE, ou Ya pour image Y', dye ait pour image dY'C, la compensee E Al (A; E) du processus de Stratonooitch Y' E Str(A; E). C' est l'unique element de HomOpl(A)(Opt(A; T*l(V); Y); Al(A» tel que, si J=Dtp(Y)EOpt(A;T*I(V);Y), tpE~I(V;R), on ait JdY c = Dtp(Y) d y c = d(tp(Y»C E .Jt(A), compensee du processus de Stratonooitch tp(Y) E Str(A). En termes integres: Dtp(Y). y c = (tp(Y»" , en se rappelant que tp(Y) est un Stratonovitch, mais que les 2 membres sont seulement des classes d' equivalence sur A de martingales continues formelles. Demonstration. Nous ne la donnerons pas, elle est semblable a celie de la Proposition (2.10). Juste une petite explication. Si fest reelle de Stra-
L. Schwartz / Gros Produits Tensoriels
725
r
tonovitch sur A, on peut definir sa compensee E At(A); ici At(A) est un Opt(A)-module, pour I'integration d'Ito. Puisque d y c E At(A; T1(V); Y), elle est un element de HomOpt(A)(Opt(A; T*l(V); Y); At (A», d'ou la possibilite d'ecrire D\p(Y)· y c == 'P(YY E At(A). 0
a
Proposition (3.4). Soit Y une semi-martingale continue (vraie) sur A valeurs dans V, uariete cg2. II existe une differentielle unique d Y de Stratonovitch, valeurs dans T1(V) Ie long de Y; dYE SYAt(A; T1(V); Y), telle que, pour les cartes comme a la Proposition (3.3), l'image de d Y soit d Y' E SYAt(A; E) associee Y' E Y At (A; E). C' est l' unique element de HomSJr(A)(Str(A; T1*(V); Y); SYAt(A» tel que, (S)si ] == D1'P(Y)EStr(A; T*I(V); Y), 'PE cg2(V;IR), on ait ]odY==d(Jo Y)== d('P( Y» E SYAt(A), differentietle de Stratonovitch de lasemi-martingale vraie 'P(Y) sur A valeurs dans E. En termes integres:
a
a
a
D1'P(Y)
(S) 0
A
Y - 'P(Y) ,
le 2" membre est une vraie semi-martingale reelle continue sur A, le I" une classe de semi-martingales continues [ormelles; ~ pourrait se remplacer par 3, Ie second membre est element de la classe que definit Ie premier.
Merrie demonstration que Proposition (2.10).
Bibliographie [1]
[2] [3] [4] [5]
[6] [7]
[8]
Roger Godement. Theorie des faisceaux, Pub!. Inst. Math. Univ. Strasbourg (Hermann, Paris, 1958). Dale Husemoller, Fibre Bundles (McGraw-Hill, New York, 1966). Nobuyuki Ikeda and Shinzo Watanabe, Stochastic Differential equations and Diffusion processes (North-Holland, Amsterdam, 1981). Paul-Andre Meyer, Geometric Stochastique sans larmes, In: Sem, Probab. XV, 1979/80, Lecture Notes in Math. 850 (Springer, Berlin, 1981) 44-117. P. Priouert, Processus de diffusion et equations diflerentielles stochastiques, In: Ecole d'ete de Probab. Saint-Flour 111-1973, Lecture Notes in Math. 390 (Springer, Berlin, 1974) 37-113. Laurent Schwartz, Semi-martingales sur des varietes, et martingales conformes sur des varietes analytiques complexes, Lecture Notes in Math. 780 (Springer, Berlin, 1980). Laurent Schwartz, Les semi-martingales formelles, In: Sern. Probab. XV, 1979/80, Lecture Notes in Math. 850 (Springer, Berlin, 1981) 413-489. Laurent Schwartz, Geometric differentielle du 2' ordre, semi-martingales et equations differentielles stochastiques sur une variete. Sern. Probab. XVI, 1980/81, Suppl.: Geometric Differentielle Stochastique, Lecture Notes in Math. 921 (Springer, Berlin, 1982) 1-150.
l.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.Y. (1986)
727
HILBERT-SCHMIDT COHOMOLOGY OF WEYL SYSTEMS
Irving SEGAL Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A. Dedicated to Leopoldo Nachbin
Let (K, W) be a Weyl system over the finite-dimensional Hilbert space H, and let A be a Hilbert-Schmidt operator on K. For Z E H let J z denote the operator A ~ W(zt1AW(z)- A. Functions F on H" of the form F(ZI, ... , zn) = Jz•... Jz,A are characterized by a functional equation, together with obvious regularity properties. Given F, A may be recovered by the equation A = -lim Hoo
JF(Zl' ... , z,) dg,(ZI)' .. dg,(zn),
where dg, (z ) is the element of the canonical normal distribution of variance parameter t on H.
1. Introduction
In connection with the correspondence principle for the deduction of quantum hamiltonians from classical ones, there is often some a priori ambiguity in the resulting operators. A similar situation arises in the treatment of local products of quantum fields. On the other hand, sufficiently high commutators of these putative operators with Heisenberg variables (i.e. the operators often denoted as Pi or qj) may be unique and serve to characterize the operators modulo a precise indeterminacy, which on occasion may be removed by ancillary considerations (group invariance, vanishing of vacuum expectation values, etc.). A related purely algebraic theory was given in [1]. This left entirely open regularity questions that are essential for the complete specification of the ultimately relevant operators in Hilbert space. In Lie group theoretical terms, the treatment was infinitesimal rather than global. Hilbert space features of local products of quantum fields were developed in [2], but this treatment falls far short in generality of the general line of
728
1. Segal/Hilbert-Schmidt Cohomology of Weyl Systems
[1]. Thus [1] is a theory associated with a linear vector space and bilinear forms thereon, while [2] involves in addition a multiplicative structure. Here a general treatment is given for the integrated form of the theory of [1]. This integrated form involves the replacement of the Heisenberg commutation relations by the Weyl relations. The natural first stage beyond the algebraic one is the study of the transformation properties of the Hilbert-Schmidt operators on the representation space under the Weyl operators. The dimension of the underlying linear vector space, or physically speaking, the number of degrees of freedom, is assumed finite. Within this limitation the results are explicit and complete. They may be regarded as providing an integrated and symplectically invariant form of the Poincare lemma that includes an integral formula for the antidifferential and an explicit bound for it. It is doubtful whether the conclusions attained are valid in the case of an infinite-dimensional underlying linear vector space, which admit a plethora of inequivalent Weyl systems, unlike the finite-dimensional case, in which according to the Stone-von Neumann theorem they are all unitarily equivalent apart from multiplicity. However, in the crucial cases of the free quantum fields over a given complex Hilbert space, it will be shown elsewhere, using the present results, that the same conclusions are valid. This is probably true in the case of Fermi-Dirac fields as well as Bose-Einstein fields, i.e. in connection with the Clifford as well as in connection with the Weyl relations, but the finite-dimensional theory of the former fields is purely algebraic, and so is covered by [1]. The difference is related to the contrast between the infinite-dimensionality of the harmonic representation of the finite-dimensional symplectic group, and the finite-dimensionality of the spin representation of the finitedimensional orthogonal group. In the case of Weyl systems, although not in that of Clifford systems, the present direction can be interpreted purely mathematically as the study of the cohomology of certain unitary representations of the Heisenberg group. Only the one-dimensional cohomology has as yet found applications in quantum theory, and higher-dimensional cohomology is not treated here. As a nilpotent group, the semisimple theory [3] is inapplicable, and for greater completeness and directness, no use will be made of the general treatment given in [4] of the cohomology of unitary representations. The Heisenberg group is of course fundamental in certain mathematical and physical respects, and is appropriately treated in its own right.
1. Segal/Hilbert-Schmidt Cohomology of Weyl Systems
729
2. Quantized Forms in the Weyl Context In part to fix the notation, the concept of 'Weyl system' will be recalled. Let L be a given real linear vector space, and A a given nondegenerate antisymmetric bilinear form on L. A Weyl system over (L, A) is defined as a pair (K, W) consisting of a complex Hilbert space K together with a map W from L to the unitary operators in K satisfying the (Weyl) relations W(z) W(z') =
e(i/2)A(z, z')
W(z + z')
for arbitrary z and z' in L; and which moreover has the property that the restriction of W to anyone-dimensional subspace of L is continuous in the strong operator topology. When L is finite-dimensional, every pair (L, A) is equivalent (symplecticaIly) to one obtained from a complex Hilbert space H by taking as A(z, z') the imaginary part of the inner product (z, z'). This is called a Weyl system over H, and only such systems are treated here. When H is finite-dimensional, the Stone-von Neumann theorem implies the essential unicity of the Weyl system over H; i.e. unicity apart from multiplicity, which may be suppressed by requiring irreducibility of the Weyl system. For application to the case of an infinite-dimensional H, it is however inconvenient to ignore the multiplicity, which is infinite in this application. This involves using the relative rather than absolute trace of operators in Hilbert space, in accordance with the ideas of noncommutative integration theory [5]. Alternatively, however, it would be possible to work entirely with absolute traces and Hilbert-Schmidt norms, in place of the relative trace and the L 2 space with respect to an operator ring, at some cost in manifest invariance, generalizability, and application to other cases. Thus the W*-algebra generated by the W(z) as z varies over the underlying (physically, 'single-particle') Hilbert space H will be denoted as R(H), or simply as R when clear from the context. For an irreducible Weyl system, R consists of all bounded linear operators on K, on which there exists a unique non-trivial trace (apart from normalization, which as usual is defined by taking as unity the trace of a one-dimensional projection). Whether irreducible or not, R is *-algebraically isomorphic to the algebra B(K') of all bounded linear operators on some Hilbert space K', and thus has a unique trace in this case also, obtained by carrying
730
1. Segal/Hilbert-Schmidt Cohomology of Weyl Systems
back to R the cited trace on B(K'). The notation LZ(R) will refer to the space of all square-integrable operators with respect to this trace, and thus to the space S(K) of all Hilbert-Schmidt operators on K, when W is irreducible. The corresponding norm of an operator A will be denoted as IIAlb; thus, IIAII~ = Tr A * A. In the following, (K, W(.)) will always refer to a Weyl system over a finite-dimensional Hilbert space H. For Z E H, az will denote the operator on B(K), A ~ W(z l A W(z) - A. The operators az can be regarded as forming a symplectically invariant class of integrated forms of differentiation in the direction z, with a suitable interpretation for complex directions. Thus when H is one-dimensional and the Weyl system is in the Schrodinger representation in which K = L Z(IR I), the operation of multiplication by the given bounded measurable function b(x) is carried, for suitable z, into the operation of multiplication by b(x + y) - b(x), where y depends linearly on z. With this interpretation, involving the replacement of a function by the corresponding multiplication operator, the following result can be considered as a partial extension of the Poincare lemma for one-forms.
r
Theorem (2.1). Let (K, W) be a Weyl system over the Hilbert space H, and let A be an operator in LZ(R), where R is the W*-algebra generated by the W(z), z E H. For (z., .. , zn) arbitrary in H", let F(zl" .. , zn) be defined as azn ' " aZIA. Then: (1). F(ZI"'" zn) is symmetric in ZI' ... , zn. (2). F is continuous and bounded from H" into LZ(R). (3). Ftz.; .. " z,) is a cocycle as a function of ZI for fixed Zz, ... , Zn' where 'cocycle' means a function c/> from H to B(K) satisfying the functional equation (2.1)
c/>(Z + z') - c/>(z) = W(z
r
l
c/>(z') W(z) ,
for arbitrary Z and z' in H. Conversely, if F is a function from H" to LZ(R) satisfying (1)--(3), then there exists a unique operator A in LZ(R) such that the foregoing holds. Lemma (2.2). If A is in B(K) and c/>(z) = azA, z E H, then equation (2.1) holds. Proof. This is an immediate deduction from the Weyl relations. 0
1. Segal I Hilbert-Schmidt Cohomology of Weyl Systems
731
Lemma (2.3). For any A in L\R), conditions (1)-(3) hold. Proof. Direct computation shows that az and az' commute for arbitrary z and z', implying (1). (3) then follows from (1) together with Lemma (2.2). By virtue of the Weyl relations, the map z ~ W(z) is continuous in the strong operator topology. The continuity, relative to the unit ball, of the trace of operators of absolutely convergent trace then implies that the map z ~ azA is continuous from H into e(R). Evidently IiazA II2~ 211Alb for arbitrary A, and (3) follows. 0 In connection with the following lemmas the notion of Weyl transform will be recalled, specializing [6] to the case of a vector group. The Weyl transform of an operator A in L 2(R) is the function A# on H defined as follows. If A has absolutely convergent relative trace, then A#(z) = Tr W(z)A. Using the fact that for such A, IIAIIz = IIA#(.)IIz, the Weyl transform of an arbitrary element A of e(R) is defined almost everywhere on H by continuity of the transform in the relevant spaces. Lemma (2.4). The Weyl transform of auA is (eiA(u,z)_ I)A#(z), u being arbitrary in H and A being arbitrary in L\R). Proof. It suffices by continuity to establish the conclusion for the case when A is trace class. In this case, Tr W(u)A W(z) = Tr W(z)W(u)A = Tr e(i/2)A(z. u) W(z + u)A = e(i/2)A(z, u)A#(z + u). Thus the Weyl transform of W(u)B is the latter expression. Applying this fact twice it results that the Weyl transform of W(-u)AW(u) is eiA(u,z)A#(z), and the lemma follows.
o
Lemma (2.5). If cf> is a continuous function from H to L 2(R) that is a cocycle, then there exists a measurable function f on H such that the Weyl transform of cf>(u) is the function of z, (eiA(u,z) - l)f(z). . Proof. Let f(u,.) denote the Weyl transform of cf>(u); for each u, f(u, z) is well-defined almost everywhere on H as a function of z, Taking the Weyl transform of (2.1) and using Lemma (2.4), it results that f(u + u ', z) = f(u, z)+ eiA(u,z)f(u', z). Interchanging u and u', it follows that
732
I. Segal/Hilbert-Schmidt Cohomology of Weyl Systems
(eiA(u,Z)_l)/(u', z)= (eiA(u',z)_l)/(u, z) for any fixed u and u', and almost all z. Now if u 'I- 0, then eiA(u.z)_ 1 -,6 0 almost everywhere in z. Thus there exists a fixed measurable function I on H such that I(z) is almost everywhere equal to the foregoing expression, concluding the proof. 0 Lemma (2.6). Let I be a measurable function on H, and let F(u, z) = (eiA(u.z)_ 1)/(z). Suppose that IIF(u, .)Ib is bounded by a constant Cas u ranges over H. Then I is in L 2(H) and 11/112 ~ C.
Ilz'l < k and 11/(z)11 < k, and =0 otherwise; let Fk(u, z) = (eiA(u,z)-l)lk(z), Then /Fk(u, z)1 ~ IF(u, z)1 for almost allu and z, whence IIFk tu, .)Ib ~ C for all u and k. Now let gt denote the normal probability measure on H of variance parameter t, centered at the origin: Proof. Let Ik(Z) = I(z) if
dgt(z)
=
(_1_) 21T't
N e-llzIP12t
dm(z),
where N is the dimension of Hover C and dm (z) is the element of Lebesgue measure in H as a real Hilbert space of dimension 2N. By Fubini's theorem, the double integral If IFk(u, Z)1 2 dg,(u) dm (z) is finite, and it follows that f Fk(u, z) dg,(u) exists and is in L\H) with square norm bounded by the value of the cited double integral, and hence by C 2 • On the other hand the foregoing marginal integral with respect to dgt(u) may be evaluated in closed form as (e -tlizlF/2 - 1)Ik(z ), using the Gaussian integral
Je iA(u.
z) dg,( u) =
e -'llzl12/2 .
Thus the function of z, (e-tllzl12/2_ l)lk(z) has L 2-no r m bounded by C. But as n ~ 00, I k ~ I in LiH), which by Fatou's lemma implies that I is in 2(H) L with 11/1 2 ~ C. 0 Lemma (2.7). II IE L
2(H),
then
I. Segal/Hilbert-Schmidt Cohomology of Weyl Systems
1=
-lim
'_00
733
J(eiA(u,.) - 1)/(,) dg,(u)
Proof. By the proof of Lemma (2,6), the integral in the foregoing 2 expression is identical to the function of z, (e-'lIzI1 /2 _ l)/(z), and so 2(H) converges in L to -I as (-HfJ. 0 Completion of the proof of Theorem (2.1). Consider first the case n = I, i.e. F is a given function from H into L 2(R) that satisfies (1)-(3). By Lemma (2.5) there exists a measurable function 1 on H such that the Weyl transform of F(u) is (eiA(u,z) - l)/(z). By Lemma (2.6),f is in L 2(H). Accordingly, 1 is the Weyl transform of a unique operator A in L 2(R). By Lemma (2.4), the Weyl transform of auA is identical to that of F(u), implying that auA = F(U). The unicity of A is clear from the foregoing argument (and is also implied by the scalar character of the center of R). If now F is a given function from H" to e(R) that satisfies (1}-(3), then the foregoing applies to the function obtained by fixing z\' ... , zn-\' and setting zn equal to the variable z in H The result is a cocycle 2(R) F z ...... Zn-l , say. Thus there exists a unique element A z It···, zn-l of L such that F(z\, ... , z) = a.Azlo ... ,Zn' Moreover IIAz1.... 'zn_tllz is bounded as zl' ... , zn-\ independently range over H. It follows that A Zl.···, -n-l . is as a function of z\, ... , zn-\' one that satisfies (1)-(3). This argument has of course assumed that n> 1, but having earlier established the case n = 1, the case of arbitrary n now follows by induction. 0
Acknowledgment I thank S.M. Paneitz for a comment. This work was partially supported by the National Science Foundation.
References [1] 1. Segal, Quantized differential forms, Topology 7 (1968) 147-171. [2] 1. Segal, Nonlinear local functions of weak processes I, 1. Funct. Anal. 4 (1969/70) 404-456; II, ibid. 6, 29-75.
734
I. Segal/Hilbert-Schmidt Cohomology of Weyl Systems
[3] A. Borel and N. Wallach, Continuous cohomology, discrete subgroups, and representations of reductive groups (Princeton Univ. Press, Princeton, NJ, 1980). [4] A. Guichardet, Lecons sur certaines algebres topologiques (Gordon and Breach, Paris, 1967). [5] I. Segal, A non-commutative extension of abstract integration, Ann. Math. 57 (1953) 401-457. [6] I. Segal, Transforms for operators and symplectic automorphisms over a locally compact abelian group, Math. Scand. 13 (1963) 31-43.
l.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
735
GRAMMATICAL FAMILIES OF LANGUAGES
E.H. SPANIER Department of Mathematics. University of California, Berkeley, California 94720. U.S.A. With cordial greetings to Leopoldo Nachbin on his sixtieth birthday
O. Introduction This article is a survey of recent results about grammatical families. These families are particular sets of context-free languages and have a surprisingly rich and mathematically interesting structure. A grammatical family is a special kind of family of languages. To understand grammatical families we first need to understand what a language is and then what a family of languages is. Therefore, we begin with languages. A language is any subset of a finitely-generated free monoid, Our main concern is with context-free languages. These are generated by context-free grammars first introduced in linguistics to study natural languages [4], [5]. Later [14] they were shown to be essentially equivalent to ALGOL-like programming languages. Subsequently their theory has become a branch of computer science but one of interest to linguists and mathematicians. Context-free languages are not only those languages generated by context-free grammars but they are also characterized as those languages accepted by pushdown acceptors. Thus, they are characterized in two different ways. Other families of languages of interest having both a grammatical characterization and a machine characterization are the family of regular languages (either generated by right-linear context-free grammars or accepted by finite state automata) and the family of recursively enumerable languages (either generated by arbitrary phrase structure grammars or accepted by Turing machines). All three of these families of languages are closed under certain operations on languages (e.g. union and homomorphic image). This suggests another, more algebraic, description of a family of languages, namely as obtained from
736
E.H. Spanier I Grammatical Families of Languages
certain simple languages and closure with respect to suitable operations. This is formalized in the concept of an abstract family of languages which satisfies certain closure properties. It is known that many of these abstract families have alternate characterizations by means of an abstract family of machine acceptors but no corresponding characterization in terms of generation by a family of grammars is known. Grammatical families were introduced [9] in an attempt to provide grammatical characterizations of some families of languages. Here one starts with a particular context-free grammar and considers all contextfree grammars which can be modelled on the original one. The corresponding collection of context-free languages generated by all these grammars is a grammatical family. Grammatical families have closure properties making them algebraically tractable. It turns out that there is a second characterization of the collection of (almost all) grammatical families in terms of the family of all regular languages and suitable operations on families of languages. The fact that there are two different descriptions of the collection of grammatical families attests to their importance and is of particular significance in their study .. In order to analyze the collection of grammatical families it is useful to introduce the concept of a prime grammatical family. The definition is natural using the product operation in the collection of grammatical families. The prime decomposition theorem asserts that every grammatical family can be uniquely expressed as a minimal sum of products of prime grammatical families. This result has two important consequences. One is a general decision procedure for the inclusion of one grammatical family in another (surprising because the inclusion of one context-free language in another has long been known to be recursively unsolvable). The second is a unique canonical expression for grammatical families (a refinement of the minimal prime decomposition) which exhibits canonically all grammatical families. The paper is divided into five sections. Section 1 contains background material on context-free grammars and languages, and Section 2 contains background material on families of languages. These first two sections merely summarize known definitions and results needed in the sequel. Section 3 contains the definition and basic properties of grammatical families, Section 4 contains the alternate characterization of grammatical families, and Section 5 discusses the structure of grammatical families. Proofs are omitted.
E.H. Spanier / Grammatical Families of Languages
737
1. Grammars and Languages
Surveys of the material in this section intended for mathematicians are [8], [21]. More details can be found in [12], [17]. Let.! be a finite set and let .!* be the free monoid generated by .!. . P consists of all finite sequences a j a 2 ••• an' n ~ 0 with a, in ! for all i, including the empty sequence denoted by 1, and with multiplication defined to equal juxtaposition of sequences. The elements of !* are called strings over .! and each string W has a length Iwi ~ O. We set .!+ = the set of strings of positive length = '!*\{1}. A language over.! is a subset of !*. We are interested in methods of specifying languages over.! and, in particular, the method of specification by means of a grammar. A phrase structure grammar G = (V,.!, P, a) consists of: (1). A finite set V (the vocabulary). (2). A subset'! C V (the terminal symbols). (3). A finite set P of ordered pairs u ~ v with u a non-empty string over V\! and v in V* (the productions or rewriting rules). (4). An element a in V\! (the start variable). Elements of V\! are called the variables or nonterminals of the grammar. The grammar is used as follows. Given w, w' in V* we define w =? w' if there exist x, y, u, v in V* such that w = xuy, w' = xvy and u ~ v is in P (so that the relation =? corresponds to left multiplication by an arbitrary string x and right multiplication by an arbitrary string y of an arbitrary pair in the relation ~). We define :} to be the transitive reflexive extension of the relation =? (i.e. w~ w' if there is a sequence w = wo, Wj' ••• , wn = w' in V*, n ~ 0 such that that W j _ j =? wj for 1 ~ i ~ n. Such a sequence wj is called a derivation of w' from w). The language generated by G denoted L( G) is the set of strings over the terminal symbols which can be derived from a (so L( G) = {w in ! * Ia ~ W D. A phrase structure language is a language generated by some phrase structure grammar. It is known [11] that the phrase structure languages are identical with the recursively enumerable sets (characterized in terms of acceptance by Turing machines). To obtain new types of languages we restrict the productions of the grammar. A context-free grammar G is a phrase structure grammar in which each production has the form ~ ~ v with ~ in V\'! and v in V*. A language is context-free if it is generated by some context-free grammar. It is known [6], [20] that the context-free languages are characterized by pushdown acceptors.
738
E.H. Spanier / Grammatical Families of Languages
Although context-free languages are the most important in this article, we need to define some other special types of grammars. A phrase structure grammar G is linear (right linear) if every production has either the form g ~ wfw' (g ~ wf) or the form g ~ w where g, f are in V\$ and w, w' are in $*. The languages generated by these grammars are the linear (right linear) languages. It is known [7] that the right linear languages are identical with the regular sets (these being characterized by finite state acceptors). Furthermore, by the Kleene theorem [18] the regular sets in $* constitute the smallest collection of subsets of $* containing all finite subsets and closed under union, product, and star (for languages L, L' C $* the product LL' = {ww'l w in L, w' in L'} and L * is the submonoid of $* generated by L, i.e. L * = {I} U LULL U LLLU ...). In the examples below a, b denote different symbols. Example (1.1). {ani n > I} is a regular language. Example (1.2). {anbnl n;;': I} is a linear nonregular language. Example (1.3). {anbnIn;;,: 1}* is a context-free language which is nonlinear. Example (1.4). {anbna nIn;;,: I} is a phrase structure language which is not context-free. There is an algorithm to decide whether one regular language is contained in another [19]. It is recursively unsolvable to determine whether one context-free language is contained in another but there is an algorithm to determine whether a context-free language contains a given string [12], [17]. Furthermore, it is recursively unsolvable to determine whether a recursively enumerable set contains a given string [11].
2. Families of Languages In this section we consider properties of the collection of all languages of a given type. Detailed treatments of this material can be found in [2],
[13]. We begin by considering the Boolean operations on subsets of $*. The
E.H. Spanier I Grammatical Families of Languages
739
collection of all regular languages is closed under union, intersection, and complementation [19], the collection of all recursively enumerable languages is closed under union and intersection but not under complementation [11], and the collection of all context-free languages is closed under union but not under intersection [12]. All three collections of languages are closed under product and star but the collection of all linear languages is not closed under either [17]. A basic result in the theory of context-free languages is that the intersection of a context-free language with a regular language is a context-free language [1]; that is, the collection of all context-free languages is closed under regular intersection. Closure of a collection of languages under regular intersection is an important property closely related to preservation of the collection by means of machine mappings. Naturally, closure of a collection of languages under some operations frequently implies closure under other operations. Closure under certain operations of frequent occurrence has been singled out in the following. Given an infinite set 4", a full AFL .2 = {L} consists of a collection of such that: subsets L C (1). For each L in .2 there is some finite subset 4 L C 4", such that LC (2). There is some L in .2 with L ¥- 0. (3). .2 is closed under union, product, star, homomorphism, inverse homomorphism, and regular intersection. (Note: AFL is an abbreviation of abstract family of languages. The adjective 'full' is to distinguish these families from AFL which are only required to be closed under non decreasing homomorphisms instead of arbitrary homomorphisms. In this article we only consider full AFL.) Let !/A denote the collection of all regular languages, .2CF denote the collection of all context-free languages, and .2RE denote the collection of all recursively enumerable languages (in each case the languages are to be taken over all finite subsets of 4,,). Then !/A, .2CF' and .2R E are full AFL. On the other hand, if .21;0 denotes the collection of all linear languages (over finite subsets of 4",), then .2liO is not closed under product so is not a full AFL. To consider collections such as .2 lio it is convenient to make the following definition. A full semi-AFL (over 4",) is a collection .2 = {L} of satisfying all the properties of a full AFL except that it languages L C need not be closed under product nor star (so .2 is closed under union, homomorphism, inverse homomorphism, and regular intersection).
4:
4r.
4:
740
E.H. Spanier / Grammatical Families of Languages
Clearly every full AFL is a full semi-AFL. In addition .2lin is a full semi-AFL. Every full semi-AFL contains f!It. Since the intersection of a collection of full semi-Al-L (full AFL) over ~oc is again a full semi-Al-L (full AFL) over ~"" if .2 is a collection of languages over ~'" there is a smallest full semi-AFL (full AFL) containing .2 denoted by Y(.2) UF(.2» and called the full semi-AFL (full AFL) generated by .2. A principal full semi-AFL (full AFL) is a full semi-AFL (full AFL) generated by a single language. For example, .2lin and .2CF are both principal full semi-AFL. Since .5tCF is also a full AFL it is also a principal full AFL. Another concept useful in studying families of languages is that of substitution, which is a generalization of homomorphism. A substitution J.L from ~~ to ~~ is a function from ~~ to subsets of ~~ such that J.L(1) = {l} and J.L(uv) = J.L (u)J.L (v) for all ll, v in ~~. (A homomorphism from ~r to ~~ can be identified with a substitution J.L from ~~ to ~~ such that J.L (u) is a singleton for every u in ~n A substitution J.L is said to be finite, regular, or context-free if J.L(u) is finite, regular, or context-free, respectively, for every u in ~r. Every full semi-AFL is closed under all regular substitutions, and the family .2CF is closed under context-free substitutions.
3. Grammatical Families In this section we describe the language families of primary interest to us. They consist of the collection of languages generated by the set of all context-free grammars which are modelled on a particular context-free grammar. The formalization of a grammar modelled on another is by means of the concept of interpretation which we define next. This section is based on [9]. Let V", be a fixed infinite set and ~'" a subset of Voc such that V", and V""\~",, are both infinite. We will be considering context-free grammars whose nonterminals are in V",\~", and whose terminals are in ~",. Let G = (V, ~, P, IT) be such a grammar. An interpretation 1= (J.Lb Vb ~I' PI' lTI ) of G consists of a context-free grammar GI = (Vb ~I' PI' lTI ) and a substitution J.LI on V* such that: (1). J.LM) C VI\~I for all g in V\~. (2). J.LI(a) is a finite subset of ~~ for all a in ~. (3). J.LI (g) n J.Lr (7J ) = 0 for g # 7J in V\~.
E.H. Spanier I Grammatical Families of Languages
741
(4). al is in J-LI(a). (5). PICJ-LI(P) where J-LI(~~v)={a~yla in J-LI(g), y in J-LI(V)} and
J-LI (P) = Up in P J-LI (p). Note that every production in PI is obtained from some production p in P by replacing every occurrence of a nonterminal in p by a nonterminal and every occurrence of a terminal in p by a terminal string. Thus, every production in PI 'resembles' some production in P, and it is in this sense that the grammar G I is modelled on G. G serves as a 'master grammar' for the grammars of its interpretations. Let C§(G) = {GIl I an interpretation of G} and 5£(G) = {L(G')I G' in C§(G)}. A collection 5£ of languages (over 1,'",,) is called a grammatical family if 5£ = 5£(G) for some context-free grammar G. Example (3.1). PJl is a grammatical family; in fact PJl = 5£(G) where G = (V,.!, {a~ aa, o -« a}, a), a in .!. Example (3.2). 5£lin is a grammatical family; in fact 5£lin = 5£(G) where G = (V,.!, {a~ aa, o -« aa, a~ a}, a), a in .!. Example (3.3). Let 5£" = {0}. Then 5£" is a grammatical family because 5£" = 5£(G) where G = (V,.!, {a~ a}, a). Example (3.4). Let 5£/11 = {0, {l}}. Then 5£/11 is a grammatical family because 5£/11 = 5£(G) where G = (V,.!, {a~ I}, a). Example (3.5). Let 5£fin be the family of all finite subsets of .!:. Then 5£fin is a grammatical family because 5£f," = 5£(G) where G= (V,.!, {a~ a}, a), a in .!. Example (3.6). 5£CF is a grammatical family; in fact, 5£CF = 5£(G) if and only if G has the property that there exist ~ in V\1,' and U!' u z, u 3, u 4, Us in .!*, u6 in .!+ such that (in G) a ~ U 1 guz, g ~ u3gu4guS ' g ~ u6 •
A grammatical family 5£ is called trivial if 5£ = 5£(G) for a grammar G such that L(G) is finite. It is known that the trivial grammatical families are 5£", 5£{J}' and 5£fin' A proper grammatical family is a nontrivial grammatical family which is a proper subset of 5£CF" Both fJ'l and 5£lin are proper grammatical families. Every proper grammatical family is a principal full semi-AFL (but it is not true that every principal full semi-AFL
742
E.H. Spanier / Grammatical Families of Languages
which is a proper subset of 2 CF is a grammatical family). Therefore, the collection of proper grammatical families is a collection of principal full semi-AFL. It is this collection we shall consider.
4. Characterization of Proper Grammatical Families
We have defined grammatical families as those families of languages generated by the collection of context-free grammars which are interpretations of a fixed context-free grammar. In this section we describe a Kleene-type theorem characterizing the collection of proper grammatical families in terms of the family flJi and operations on families. This material is from [10]. Given two families of languages 2 and 2' their sum 2 EB 2' (sometimes denoted 2 v 2' in the literature) is the collection {L U L'I L in 2, L' in 2'} and their product 202' is the collection {U;~I LiL;1 n ;3 1 each L, in 2 and in 2/}. The sum operation is associative and commutative, and the product operation is associative. Furthermore, if .21 and 2 2 both contain 0, the following distributive properties are valid
L;
The sum and product of (proper) grammatical families are again (proper) grammatical families. Another operation on families of languages which preserves proper grammatical families is the full AFL operator ;j; (i.e, if 2 is a proper grammatical family so is ;j;(2». Note that each of the three operations EB, 8, ;j; applied to the collection of families {911} yields {911} (because 9ll EB 9ll = 9ll, 9118 9ll = 9ll, and ;j;(911) = flJi). Thus, these three operations are not enough operations on families to generate every proper grammatical family starting with 9ll. To get enough operations we need one more, a ternary operation fl, which we describe next. This ternary operation applied to families 5£1' 2 2, 5£3 yields the family of all languages obtained from certain special languages over a pairwise disjoint union A U CUB by applying substitutions which replace symbols in A, C, B by languages in 2 1, 2 2, 2 3, respectively. The special languages are linear languages in which terminals occurring in the right-hand side of productions belong to different sets A, C, B depending
743
E.H. Spanier I Grammatical Families of Languages
on whether the terminal is followed by a nonterminal in the right-hand side of the production, is the entire right-hand side of the production, or follows a nonterminal in the right-hand side of the production, respectively. The following definition makes this precise. A grammar G = (\I;.!, P, 0") is called a split-linear grammar if the right-hand side of every production in P is in A(.!\ V) U C U (.!\ V)B for some pairwise disjoint subsets A, B, C C.!. In this case we assume .! = A U CUB and use the notation G = (\I; A U CUB, P, 0"). Given families of languages :£., :£2':£3 let [Y(:£I' :£2' :£3) be the family of all languages 7(L) where L = L( G) for some split linear grammar G = (\I; A U CUB, P, 0") and 7 is a substitution on (A U C U B)* such that 7(X) is in :£1' :£2 or :£3 if x is in A, C, or B, respectively. A main result of [to] is the theorem that the collection of proper grammatical families is the smallest collection of families of languages containing (iJi and closed under the (unary) operation ;j;, the (binary) operations EB and 0, and the (ternary) operation [Y. It provides another description of the collection of proper grammatical families and implies that every proper grammatical family can be obtained from (iJi by a finite succession of these operations. The fact that there are two different ways of obtaining the collection of grammatical families is of primary importance in analyzing their structure.
5. Structure of Grammatical Families In this section we discuss the decomposition of grammatical families in terms of primes in a canonical way. The material is from [15], [16]. A grammatical family :£ is called prime if for every pair of grammatical families :£1' :£2 with :£ C:£I 0 :£2 either :£ C:£I or :£ C :£2· The following remarks are from [15]: Remark (5.1). If :£ is prime and :£ C:£I 0 - .. 0 :£n' n > 1, each grammatical family, then :£ C:E; for some i. Remark (5.2). Every trivial grammatical family is prime; also
(iJi
:E; a
is prime.
Remark (5.3). Since the product of proper grammatical families is a proper grammatical family, :£CF is prime.
744
E.H. Spanier / Grammatical Families of Languages
Remark (5.4). For every grammatical family 2, :#(!f) is prime. A grammatical family 2 is additively prime if for every pair of grammatical families 2 2 with 2 c 2 1 EB 2 2 either 2 C 2 1 or 2 C 2 2 , The following remarks are again from [15]:
s;
Remark (5.5). If 2 is additively prime and 2 C 2 1 EB ... EB 2 n , n ~ 1, each .!f; a grammatical family, then 2 c for some i.
s;
Remark (5.6). Since !tl EB!t2 C 2 1 0 !t2 for all families contammg the language {I}, it follows that every prime is additively prime. Remark (5.7). A grammatical family is additively prime if and only if it is a product of primes. Remark (5.8). If d is additively prime and 2 1,23 are nontrivial grammatical families, then g(!tl , d, !t3) is prime. It follows that 2 1in is prime and that !tlin 0 !tlin is additively prime but not prime. The prime decomposition theorem [15] asserts that every grammatical family is uniquely expressed as a minimal sum of products of primes. In this statement the term 'minimal' means that no prime in the representation can be deleted to obtain another representation of the given grammatical family, and the uniqueness in the statement is up to order of the summands. It follows from this prime decomposition theorem that a proper grammatical family is prime if and only if it is of the form :#(21) or g(!tl , d, !t3) where !tl , d, !t3 are nontrivial grammatical families and d is additively prime. Furthermore, these two types of primes do not overlap. The prime decomposition theorem is used to reduce the problem of determining whether one grammatical family 2 is contained in another !t' to the special case in which !t is prime and 2' is a product of primes. This special case is resolved in [16] so that the problem of containment (and hence equality) between grammatical families is decidable (see also [3]). Thus, we have the situation that for arbitrary context-free grammars 0 1 and O 2 it is undecidable whether inclusion L(OI) C L(02) holds between their languages but it is decidable whether inclusion 2( 0 I) C !t( O 2) holds between the corresponding grammatical families.
E.H. Spanier / Grammatical Families of Languages
745
The prime decomposition theorem expresses every grammatical family in terms of prime grammatical families. The prime families occurring in the representation can also be expressed in terms of simpler families so that ultimately every grammatical family is expressed in terms of '?Jl and suitable operations. However this manner of representing grammatical families in terms of '?Jl does not lead to unique expressions. It is possible to pick out certain canonical expressions which provide unique representations for every grammatical family [16]. These exhibit canonical forms for grammatical families such that the families can be enumerated explicitly. These canonical forms should be important tools for the study of other questions about grammatical families.
References [1] Y. Bar-Hillel, M. Perles and E. Shamir, On formal properties of simple phrase structure grammars, Z. Phonetik Sprach. Kommunikationsforsch. 14 (1961) 143-172. [2] J. Berstel, Transductions and Context-Free Languages (Teubner, Stuttgart, 1979). [3] M. Blattner, The decidability of the equivalence of context-free grammar forms, 20th Annual Symp. Found. Computer Sci., Oct. 1979,91-96. [4] N. Chomsky, Three models for the description of language, I.R.E. Trans. on Information Theory ITI (1956) 113-124. [5] N. Chomsky, On certain formal properties of grammars, Inform. and Control 2 (1959) 137-167. [6] N. Chomsky, Context-free grammars and pushdown storage, MIT Res. Lab. Electronic Quart. Prog. Rept. 65 (1962). (7] N. Chomsky and G.A. Miller, Finite state languages, Inform. and Control 1 (1958) 91-112. [8] P.M. Cohn, Algebra and language theory, Bull. London Math. Soc. 7 (1975) 1-29. Reprinted as Ch. XI of: P.M. Cohn, Universal Algebra (Reidel, Dordrecht, 1981). [9] A. Cremers and S. Ginsburg, Context-free grammar forms, J. Comput. System Sci. 11 (1975) 86-117. [10] A. Cremers, S. Ginsburg and E.H. Spanier, The structure of context-free grammatical families, J. Comput. System Sci. 15 (1977) 262-279. [11] M. Davis, Computability and Undecidability (McGraw-Hili, New York, 1958). (12] S. Ginsburg, The Mathematical Theory of Context-Free Languages (McGraw-Hili, New York, 1966). [13] S. Ginsburg, Algebra and Automata-Theoretic Properties of Formal Languages (North-Holland, Amsterdam, 1975). [14] S. Ginsburg and H.G. Rice, Two families of languages related to ALGOL, J. Assoc. Comput. Mach. 9 (1962) 350-371. [15] S. Ginsburg, J. Goldstine and E.H. Spanier, A prime decomposition theorem for grammatical families, J. Comput. System Sci. 24 (1982) 315-361. [16] S. Ginsburg, J. Goldstine and E.H. Spanier, On the equality of grammatical families, J. Comput, System Sci. 26 (1983) 171-196.
746
E.H. Spanier / Grammatical Families of Languages
[17] M. Harrison, Introduction to Formal Language Theory (Addison-Wesley, Reading, 1978). [18) S.c. Kleene, Representation of events in nerve nets, In: Automata Studies, ed. C.E. Shannon and J. McCarthy (Princeton Univ. Press, Princeton, 1956) 266-306. (19) M.O. Rabin and D. Scott, Finite automata and their decision problems, IBM J. Res. Develop. 3 (1959) 114-125. [20] M.P. Schutzenberger, Context-free languages and push-down automata, Inform. and Control 6 (1963) 246-264. [21) E. Spanier, Grammars and languages, Amer. Math. Monthly 76 (1969) 335-342.
747
l.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.Y. (1986)
RELATED ASPECTS OF POSITIVITY: A-POTENTIAL THEORY ON MANIFOLDS, LOWEST EIGENSTATES, HAUSDORFF GEOMETRY, RENORMALIZED MARKOFF PROCESSES ...
Dennis SULLIVAN Institut des Hautes Etudes Scientifiques, 91440 Bures-sur- Yvette, France and City University of New York, N. Y., 10036, u.s»: Dedicated to Leopoldo Nachbin
I. Discussion of Results, Motivation and Background
The motivation of this paper is twofold. First we are trying to get a better understanding via generalization of certain phenomena attached to complete manifolds M of constant negative curvature. Secondly, we will try to show that certain general phenomena for Riemannian manifolds which are fairly standard have interesting interpretations when specialized to constant negative curvature. Let r be a torsion free discrete subgroup of isometries of the noneuclidean hyperbolic space H d+! so that M = Hd+1/r. The critical exponent 8(r) is defined using the Poincare series Ll'Erexp(-sd(x, yy)), as the Dedekind cut in s separating convergence from divergence. In a series of papers Elstrodt [8] developed a relationship between 8(r) for Fuchsian groups and Ao(M), the edge of the L 2-spectrum of the corresponding hyperbolic surface. Elstrodt used hypergeometric functions to study the resolvent of .1, derived an inequality between 8(r) and Ao(M), and treated specific examples. In another series of papers Patterson [20], [21] constructed an interesting measure on 51 and used Selberg's point-pair invariants in a spectral analysis of .1 to relate 8(r) to the Hausdorff dimension of limit sets A (r) of certain Fuchsian groups. Patterson showed 8(r) = A (r) for finitely generated groups which either have no cusps or which satisfy 8(r) E j). Earlier Akaza [1] had treated such groups without cusps and Beardon [2] had shown that the presence of cusps implies 8(r) >~, (F nonelementary).
G,
D. Sullivan / Related Aspects of Positivity
748
Patterson [21] used his spectral discussion to sharpen Elstrodt's inequality to an equality. We will give here a new proof of a generalized Elstrodt-Patterson theorem valid for all torsion free discrete subgroups of isometries of Hl d + 1• Theorem (1.1). (Generalized Elstrodt-Patterson.) If M = Hl d + 1/r , then
if 0 ~~d, if 0 ~~d, where 0
=
o(r).
Most of our proof consists of a general study of A.o(M) for an arbitrary open connected Riemannian manifold. The rest of the proof is an estimate of the A. -Green's function J; eAtp(t, x, y) dt on hyperbolic space derived using only probability and spherical symmetry and not based on knowledge of special functions. Here A.o(M) is defined to be the negative of the infimum over smooth functions with compact support of the Rayleigh quotient J [grad tp 12/ J cp2. Since the numerator is J (J
D. Sullivan I Related Aspects of Positivity
749
Finally, using the above, Ao is easily interpreted as the edge of the for the infinitesimal generator .1 of the minimal heat semiL group p(t, x, y) (Theorem (2.2». This interpretation checks with a recent theorem of Stroock [11] that for the Friedrichs extension .1, e.it has a positive kernel which is indeed the minimal heat semigroup on M. Briefly, the first step of our proof of the generalized Elstrodt-Patterson theorem is to set forth the four determinations of the same real number Ao(M) valid for any open connected Riemannian manifold M. For the second part we turn to hyperbolic space itself. We first derive the structure of the positive A-eigenfunctions of .1 on IHI d+! using Martin's famous potential theory argument. These functions form a convex cone with compact base. The extreme rays of this cone (as in Choquet theory) are precisely labelled by the points of the sphere at infinity for HI d+! the corresponding extreme functions are the multiples of the familiar yO (in the various upper space models) with 8(8 - d) = A and, it is important to note, 8 ;;;. d12, i.e. 8 = ~d + (A + ~d2)1I2. This structure is due to Karpelevich [15] in the context of general symmetric spaces. For the convenience of the reader we give a simpler discussion which is possible here in the special case of hyperbolic space. The simpler argument makes use of spherical symmetry and probability and avoids knowledge of special functions. We then develop new results relating the growth of ¢J and its boundary measure, JL(p, ¢J), (see Theorem (2.11». For example, one interesting inequality, Theorem (2.13), reads: for JL (p, ¢J )-almost all geodesic directions g 2-spectrum
(g, R);;;' R
e-(d/2)R
where R = hyperbolic distance along the geodesic. There is also an easily derived relationship between exponential upper bounds on ¢J and lower bounds on the Hausdorff dimension of JL-positive sets on the sphere, Theorem (2.15). Namely, ¢J > 0, .1¢J = A¢J and (g E A, JLA > 0) ,
implies that D(A) (the Hausdorff dimension of A) is at least
750
D. Sullivan / Related Aspects of Positivity
Finally, we come to an algebraic point. The formula relating the eigenvalue A and the exponent {j is -A = {j(d - {j) (or the more familiar -A = {j(1- {j) in the hyperbolic plane). In other words, the two exponents, symmetric about ~d, {j and B" = d - {j lead to the same eigenvalue, - A = 8(d - 8) = 8*(d - 8*) = {j8*. One manifestation of this duality is the following: let X C s' be a set of finite positive Hausdorff measure in dimension D which is one of the numbers 8 or 8*. For each point p in IHI d+! let
D. Sullivan / Related Aspects of Positivity
751
binatories) one expects the Hausdorff dimension to vary. (It is known to do so in various special families and even to vary real analytically in analytic parameters of [23], [20]) and so the unitary 'leA varies in the complementary series of G(d) (see Subsection 2.3). Thus we can interpret the complementary series 'leA of G(d) in a dynamical and geometrical context of limit sets and negatively curved manifolds. A positive Ao-eigenfunction allows one to define a modified notion of a random path on M (Subsection 2.1). For example, if M possesses a positive square integrable eigenfunction the modified random process preserves the finite measure <1>2 dy. In effect M has been renormalized to have finite volume. In the hyperbolic case of this example there are several results usually only valid for finite volume manifolds which become true in a renormalized interpretation for these infinite volume manifolds. In some statements the sphere s' of dimension d with Lebesgue measure is replaced by the limit set, its fractal dimension, and Hausdorff measure. For example, for finite volume manifolds the entropy of the geodesic flow relative to Lebesgue measure is d while for geometrically finite examples the entropy of the geodesic flow relative to Hausdorff measure is the Hausdorff dimension, [26]. To conclude, in this paper we have tried to make a synthesis of several mathematical discussions in the context offered by complete negatively curved manifolds and discrete groups.
r
2. Precise Statement of Results to be Proved Later
2.1. Riemannian Manifolds: Definition of Ao(M) Let M be an open connected Riemannian manifold without boundary. Define the real number Ao in (-00, 0] as the negative of the infimum of JM [grad <1>1 2/ JM 1<1>1 2 over smooth functions on M with compact support. First, the potential theory approach to Ao(M). Say that a smooth function on M is A-harmonic if LJ. = A, where LJ. is the Laplacian. Theorem (2.1). For each A ~ Ao there are positive A-harmonic functions on M. For each A < Ao there are no positive A-harmonic functions on M. Compare [6], [10], [19]. Secondly, we take the Hilbert space approach to Ao(M). There is a
D. Sullivan / Related Aspects of Positivity
752
canonical self-adjoint operator (also denoted .1) on L 2(M) extending the Laplacian on smooth functions with compact support. If M is complete, all self-adjoint extensions agree and .1 is unique, [12]. In the general case we take for .1 the infinitesimal generator of the (minimal) heat semigroup, f(x, t) = IMP, (x, Y )f(y) dy. Here the symmetric positive kernel p, (x, y) is defined to be the supremum (an increasing limit) over all smooth compact subregions with boundary (Ma , aMa ) of the fundamental solutions p;(x, y) for the heat equation in M; vanishing on the boundary aMa , PI(x, y) = sup p;(x, y)
and
a
(Compare [7].) Theorem (2.2). The closed L 2-spectrum of .1 contains Ao and is contained in the negative ray (-00, Ao]' Compare (Friedrichs, Stroock [11]). Corollary (2.3). For A > Ao' the symmetric kernel bounded operator on L 2 , namely 1/.1 - A.
10'" eAtp,(x, y) dt defines
a
Combining Theorems (2.1) and (2.2) we have the following spectral picture for any open Riemannian manifold: Ao ~ a and Ao separates the L 2-spect ru m from the "positive spectrum".
111111 I
..
d.
III
111111111
'L -spect ru m ' of .1 2
Ao
'positive spectrum' of .1
Example (2.4). For M the real line (or euclidean space), Ao = 0, the functions e'", a real, are a 2-harmonic and {e- ia X } are virtual L 2 eigenfunctions belonging to -a 2 as continuous spectrum. Example (2.5). For M the hyperbolic plane, Ao = -~, the posinve Aharmonic functions for -~ ~ A ~ a are related to the complementary series of SI(2, IR), see Subsection 2.3, and the virtual L 2 eigenfunctions, as continuous spectrum on (-00, -~] are related to the principal series of SI(2, R).
D. Sullivan / Related Aspects of Positivity
753
Thirdly, we have the Markoff process approach to Ao. We say that A belongs to the Green's region of M if for some pair (x, y), x ~ y,
Je-Atp,(x, y) dt <
00.
o
A variant of a classical proposition (see Section 5) is that for A in the Green's region the integral converges for all pairs (x, y), x ~ y, and defines the A-Green's function gA (x, y) which is locally integrable and satisfies (zl, - A)gA (x, y) = Dirac mass at y. So for each y, gA(X, y) defines a positive A-harmonic function on M\{y}. Theorem (2.6). For any open Riemannian manifold the Green's region consists of either (i) the open ray (A o' (0), or (ii) the closed ray [A o, (0). In case (i), fo'" e -AOIP,(x, y) dt = 00, M is said to be Ao-recurrent. In case (ii), f: e -AOIP,(x, y) dt < 00, M is said to be Ao-transient. Now we discuss situations in which positive Ao-harmonic functions are unique (up to constant multiples). Theorem (2.7). (Recurrent case.) If the Green's region is (A o' 00), i.e. I; e -AOIP, (x, y) dt = 00, then the positive Ao-harmonic functions are constant multiples of one another. Theorem (2.8). (Square integrable case.) Suppose the spectral measure of Ll has an atom at Ao• Then the Ao eigenspace of Ll is one-dimensional and is generated by a (square integrable) positive Ao-harmonic function l!Jo' Also, the integral Jo'" e-AOlp,(x, y) dy diverges so M is Ao-recurrent and any (not necessarily square integrable) positive Ao-harmonic function is a multiple of l!Jo' We note here the related statement: if any atom of the spectral measure of Ll is represented by a (square integrable) positive A-harmonic function, then A = Ao and this atom is situated at Ao- This follows directly from Theorems (2.1) and (2.2).
754
D. Sullivan / Related Aspects of Positivity
Corollary (2.9). If a complete manifold M possesses a posutue square integrable eigenfunction cP for.1, then the eigenvalue is Ao(M) and cP is unique up to a constant multiple.
2.2. Renormalization of Random Motion Given any positive A-harmonic function we can add to the usual random motion on M a force field or drift term grad log cPo Then we have a biased random motion (the cP-process) corresponding to the second order operator .1 + 2 grad log cP, which acts on functions by (cf. Section 8) f ~ ilf + 2 grad log cP . grad f. The transition probabilities for the cP-process are (e-AtcP(y)/cP(x» x p,(x, y) dy. When the cP-process preserves the constant function 1 we say that cP is complete. This amounts to the reproducing formula cP(x) =
J e-Atp,(x,y)cP(y)dy. M
(The inequality ~ is always true.) When cP is complete the cP·process also preserves the measure cP 2(y ) dy (cf. Section 8). When there is only one positive A-harmonic function up to a multiple we refer to the cP-process as the A-process, Theorem (2.10). Suppose Mis Ao-recurrent (g e-AO'p,(x, y) dt = 00). Then the Ao·process associated to the second order operator .1 + grad log cPo preserves the function 1, the measure cP~(y) dy, and is recurrent-almost every path of the Ao·process starting from any point in M enters every set of positive measure infinitely often. In the square integrable case (Theorem (2.8» the Ao-process preserves a finite measure, cP~(y) dy.
2.3. Hyperbolic Manifolds Let M be the unique connected complete simply connected (d + 1)manifold of constant negative curvature IHI d + 1• We recall the two kinds of examples of positive A-harmonic functions on IHI d +1•
755
D. Sullivan / Related Aspects of Positivity
First, consider a Borel set A in IHJd+bS visual sphere at infinity s' which has finite positive Hausdorff measure in dimension a. Define a positive a(a - d)-harmonic function cPA on IHJd+! by the rule: cPA(X) = Hausdorff a-measure of A in the visual metric on s' as viewed from x. (That ¢JA is A-harmonic follows from the discussion below.) Second, given g in Sd choose stereographic projection of the ball model for IHJ d+! to the upper half space model for IHJ d+! with g ~ 00. If y is the vertical coordinate then ¢J(x, a, g) = (y(x)Y is a positive a(a - d)-harmonic function on IHJ d+l. (In these coordinates, L1 = l (Euclidean L1) + (1- d)y May.)
Note that in these examples both a and d - a lead to the same eigenvalue A = a(a - d) = (d - a )«d - a) - d). Also A is a minimum _~d2 for a = ~d. Theorem (2.11). (i). For IHI d+!, Ao = _~d2 [17], [21]. (ii). Fix p E n--n d+!. Then every positive A -harmonic function ¢J is uniquely expressible in terms of the ¢J(. , a, g),
J cP(x, a, g) d/L (p, cP )(g) ,
¢J(x) =
Sd
where a = ~d +(,\ - AO)' I2, the ¢J(', a,~) are normalized to be 1 at p, and /L(p, ¢J) is a unique positive measure on with total mass ¢J(p) [15].
s'
The next two theorems concern the boundary measure /L (p, ¢J) and its measure class for any positive A-harmonic function ¢J. Let /L(p, ¢J, R) be the measure on the sphere S(p, R) of hyperbolic radius R centered at p, i.e. /L(p, ¢J, R) = 1/c R ' (¢J restricted to S(p, R»· spherical measure, where e-(d-a)R
cR --
{
.R e -(d/2)R
s:
Theorem (2.12). In the compactified space H d +1 U the boundary measure /L(p, ¢J) of Theorem (2.11) is constructed from ¢J as a weak limit of the /L(p, cP, R),
lim /L (p, ¢J, R) = /L (p, ¢J) . R ...'"
756
D. Sullivan / Related Aspects of Positivity
Now we consider radial limits, along hyperbolic rays (R, g) emanating from p, of a positive A-harmonic function 4> with 4>(p) = 1.
Theorem (2.13). (a). For
g outside the
closed support of J.t (p,
as R
4»,
~oo.
(b). For J.t (p, 4> )-almost all g, 4>(g, R) ~
-(d-a)R
forA>-~d2,
{~ e -(d/2)R
for A = _~d2.
(c). For all g, as Rr--»»,
Now a generalization of Fatou's theorem. Suppose 4>1 and 4>2 are positive A-harmonic functions and J.t (p, 4>1) is absolutely continuous with respect to J.t (p, 4>2) with Radon-Nikodym derivative l/J(g).
Theorem (2.14). For J.t (p, 4>2)-almost all
g
In particular if 4>1 :s;; 4>2' then J.t (p, 4>1):S;; J.t (p, 4>2) by Theorem (2.12), and the conclusion holds.
Define the exponential growth of 4> along a hyperbolic ray (R, g) from p in the direction g by . log 4>(R, g) hmsup . R ....co R By Theorem (2.13) this growth is always :S;;a = ~d + (A + ~d2)1/2. Suppose the growth is smaller,
D. Sullivan I Related Aspects of Positivity
757
Theorem (2.15). (i). The Hausdorff dimension of A is at least
(ii). In particular if ¢J is bounded, the Hausdorff dimension of any + (A + ~d2)1/2.
J.l (p, ¢J )-positive set is at least ~d
We describe the behaviour of the A-Green's function gA(X,y)= Io"" e-Atp,(x, y) dt on Hd+t, which is finite for A E [AO' 00) and only depends on r = d(x, y) for r near 00. It is convenient to include a description of the A-spherical function SA (x, y) which is by definition the unique (up to a multiple) positive A-harmonic function of x in Hd+t, spherically symmetric about y in Hd+l. These two functions are solutions of the second order differential equation in the radius R which has regular singular points at R = 0 and R = 00. Theorem (2.16). For A ~ AO' gA (x, y) and SA (x, y) generate the two-dimensional space of spherically symmetric solutions of (.1 - A)f = 0 on H d+1\{y}. The A-Green's function (fo"" e-Atp,(x, y) dt) is the small (or recessive) solution near R = 00, and the A-spherical function (fSd ¢J (x; g, a) d8(g» is the small (or recessive) solution near R = O. Thus if a = ~d + (A + ~d2)112, gA - constant· e- aR near R = 00, while SA - constant· e -(d-a)R near R = 00, except when a = ~d where SAconstant· R e-(dI2)R near R = 00.
Now let r be any discrete group of hyperbolic isometries. If r has no torsion then H d + 1/r is a complete Riemannian manifold with constant negative curvature to which the generalities of Subsection 2.1 apply. We have the generalized Elstrodt-Patterson theorem. Theorem (2.17). For M = Hd+l/r, Ao(M) satisfies
if S(r) ~~d , if str,» ~d, where S(r) is the critical exponent of r.
Recall the critical exponent S(r) is defined so that the Poincare series of F,
758
D. Sullivan / Related Aspects of Positivity
L
g(x, y, s) =
exp - (sd(x, yy»
rEF
converges for s > 8(r) and diverges for s < 8(r) where (x, y) is any pair of points in IHJd+1.
CoroUary (2.18). (Of proof) If M
=
and the Poincare series diverges at s
IHJd+1/r, Mis Ao-recurrent iff 8(r) ~ ~d = 8(r).
Now A-harmonic functions on M are just r-invariant A-harmonic functions on IHJ d+!. From the definition it follows that for any positive A-harmonic function
where Iy'l is the linear distortion of the visual metric on Sd as viewed from p, a = ~d + (A + ~d2Y/2 as before, and Y*IL(set) = lL(y(set». Thus if
r.
where 8 = a and y E Thus Theorem (2.17) yields the existence of measures on s' satisfying (2.1). Curiously, a bit more can be said about this question than the A-potential theory implies. The following theorem generalizes earlier results of Patterson and the author:
Theorem (2.19). (i). If
r is any discrete group of isometries of IHJd+1 (except
for elementary parabolic or cocompact groups) there is a finite positive measure on satisfying y* IL = /y'llllL, Y E F, iff 8 E [8(r), (0). (ii). We may further suppose that IL is concentrated on the limit set of r unless r is geometrically finite without cusps. In these latter cases (including cocompact groups) the only such measure on the limit set is the Hausdorff measure in dimension 8(r).
s'
The limit set of r is by definition the set of cluster points in s' of any r orbit in IHJd+!. The condition geometrically finite without cusps means that has a finite sided fundamental domain in IHJd+1 which does not touch the limit set.
r
759
D. Sullivan / Related Aspects of Positivity
Remark (2.20). For the elementary parabolic groups there are point measures in s' satisfying (2.1) for any 0 in [0,00) even though o(r) = ~ x (rank of parabolic subgroup) > O.
M
We mention two more theorems relating the A-potential theory of = 1HId+1/r and the Hausdorff geometry at infinity.
Theorem (2.21). (i). If T is geometrically finite and M
=
1HId+1/r then
D~~d, D~~d,
where D is the Hausdorff dimension of the limit set. (ii). M has a square integrable positive Ao-harmonic function iff D > ~d. Mis Ao-recurrent iff D ~ ~d.
r is geometrically finite. Then whether or not the Hausdorff dimension of the limit set belongs to (0, ~d) and if not its exact value in Hd, d) is determined by the A-potential theory ofM.
Corollary (2.22). Let M
=
IHI d + 1/r where
Any discrete group of isometries of the hyperbolic plane 1H1 2 is a union of geometrically finite groups. This allows a general result.
Theorem (2.23). For any complete connected hyperbolic surface S let D
denote the Hausdorff dimension of the set of those geodesics emanating from any fixed point in S which returns infinitely often to any bounded neighbourhood of that point. Then Ao(S) satisfies 1
A.o(S) = {~(D
- 1)
l D ,< '2, D~~. -.c;:
Recall G(d) denotes the group of proper motions of IHI d+l. Then G(l) = PI(2, R) and G(2) = PSI(2, C).
Now Theorem (2.21) allows a canonical geometric interpretation of the complementary series in terms of hyperbolic manifolds IHI d + 1/r and the Hausdorff geometry of the limit sets of the discrete groups r.
760
D. Sullivan / Related Aspects of Positivity
Theorem (2.24). Let 4Jo denote the square integrable positive Ao-harmonic function on M = H d +1 where is geometrically finite and the Hausdorff dimension of the limit set D = 5(r) > ~d. Then the linear span of the G(d)-orbit of 4Jo in L 2(G(d)/r) generates the member of the complementary series labeled by Ao(M) E (-~d2, 0). For example, if r has no cusps (or all cusps have rank OE;.D) then 4Jo(p), the K-invariant vector, is just the function on HI d+l which assigns the Hausdorff D-measure of the limit set ofT calculated in the metric as viewed from p.
/r
r
Remark (2.25). There are examples where deformations of one F make Ao cover the entire (spherically symmetric) complementary series, [24], [3].
3. Compact Manifolds with Smooth Boundary Let M; be a compact manifold with smooth boundary. Let p~(x, y) be the fundamental solution of the heat equation in M; vanishing on aMa (cf. [22]). The infinitesimal generator of the semigroup
f(x, t) =
J p~(x, y)f(y) dy,
defines a self-adjoint operator L1 on L 2(Ma ) extending the Laplacian acting on smooth functions vanishing near the boundary [22]. By the compactness of M; there is a discrete set of eigenvalues for L1
and a complete basis of L 2 consisting of eigenfunctions vanishing on the boundary. Since IA~I is the infimum of f M a [grad 4J1 2 / f M a 14J/ 2 over smooth functions vanishing near the boundary, any eigenfunction 4Jo belonging to A~ does not change sign (see Section 8 for an alternative argument). It follows that A~ has multiplicity 1 and 4Jo is unique up to a constant multiple. Since one may write an absolutely convergent eigenexpansion for p~(x, y),
D. Sullivan / Related Aspects of Positivity
761
(3.1) [22], one has (3.2)
where ¢J~ is the unique positive normalized zeroth eigenfunction. From the probabilistic interpretation [18] of p;(x, y) dy as the probability density of endpoints of random paths starting at x which have not hit the boundary before time t, one has from (3.2) that the probability of starting from x and hitting the boundary iJMa by time t is asymptotically 1 like (3.3)
1- constant e(A(j)1 .
Now recall the Dirichlet problem for M a . If f is a continuous function on iJMa the harmonic extension of f inside M; may be written (3.4)
f(x) =
J f(g) dJLa (x, g)
where JL a (x, g) is the probability measure associated to hitting the boundary with random paths starting from x. Now weight the hitting probability by e- Ar where T is the hitting time and A is any number> A~ . By (3.3) the resulting measure JL ~ (x, g) is well defined and finite. Again if f is a continuous function on the boundary
(3.5)
f(x) =
J f(g) dJL ~ (x, g)
defines a smooth A-harmonic function in M; with boundary values f. The classical proof of (3.4) may be modified to give (3.5) replacing ..1 by zl - A. Now recall that the generalized Poisson measures JL a (x, g) of (3.4) are equivalent for various x and that for fixed X o in M; the ratio dJLa (x, g)/dJLa (x o, g) = t/Ja (x, ~) is for g fixed a positive harmonic function
D. Sullivan / Related Aspects of Positivity
762
(which
is zero
n
on
aMa \{~} and has a pole at Similarly ~) is for ~ fixed a positive A-harmonic function on M; (which is zero on aMa \{~} and has a pole at (See Subsection 2.3 for examples.) This shows the Harnack principle for positive harmonic functions is also valid for positive A-harmonic functions, A > A~. Namely, write (3.5) as
d,u: (x, ~)Jd,u: (x o, ~) =
1/1: (x,
n
(3.6)
showing that the values of (f(~) d,u:(xo, ~» of values
f
around x are fixed convex combinations
(1/1: (x, ~» which only vary in a bounded ratio.
4. Proof of Theorem (2.1)
Now consider the directed set of all compact connected regions M; eM with smooth boundary. Since A~ (of Section 3) is the negative of the infimum over smooth functions supported on interior M of f Ma [grad ¢J2J f Ma 1¢1 2, the number Ao defined in the introduction clearly satisfies
a
and Ao > A~ for all a. Then by Section 3 there are positive A-harmonic functions on M; for any A ~ Ao > A~. By the Harnack principle described in Section 3 we have compactness with respect to uniform convergence on compact sets for those positive A-harmonic functions which are ~1 at a fixed point xo' We can form convergence subsequences of those defined for an exhaustion of M by M; and thereby prove the first part of Theorem (2.1). The second part of Theorem (2.1) follows from the fact that a positive
D. Sullivan I Related Aspects of Positivity
t\-harmonic function (4.1)
f(x)
=
f continuous on
Je-Alp~(x,
763
M a satisfies
y)f(y) dy + fe-AT d (Wiener measure), p
where p is the set of paths which hit oMa at
T
< t. So
Je-Alp~(x,y)f(y)dy.
f(x)~
This shows t\ ~ t\~ using (3.2) and completes the proof of Theorem (2.1).
5. The Green's Region and t\-Superharmonic Functions Consider the function gA (x, y) = f; e -At P, (x, y) dt and suppose gA (x, y) is finite for one pair x 'i- y. From the definition gA (x, y) is symmetric and as a function of x it is (1). The increasing limit of continuous functions (and so lower semicontinuous, f(x) ~ lim Xj"'X f(x; )). (2). Decreased pointwise by at least the factor eAI by the heat semigroup, f(x, t) = f MP, (x, y)f(y) dy. Namely, f(x, t) ~ e-Atf(x). Functions of x satisfying (1) and (2) (and not identically +(0) are called X-superharmonic. So if t\ belongs to the Green's region there is a t\-superharmonic function, is, (x, y) for each y). Conversely, suppose f is t\-superharmonic and let denote e- At (heat operator). We apply the operator equation
P;
T
(5.1)
J
p
o
to
P; = -1 Jr: ds - -1 I
A S
ds Id-
t
t
S
r; f = f
or t\ belongs to the Green's region.
J
T
0
f and deduce using (1) and (2) that either
(5.2)
t
T+t
for all x,
D. Sullivan I Related Aspects of Positivity
764
Using the fact that for smooth functions of compact support c/J
Id-
(5.3)
e: c/J ~ - (,1 -
_------'-I
t
A )c/J ,
uniformly on compact sets as t ~ 0, one obtains by duality that a Asuperharmonic function (which is locally integrable by I~ f) satisfies
P:
(5.4) in the sense of distributions. Thus -(,1 - A)I is a positive Radon measure approximated by «(f f)lt) dy, whenever 1 is A-superharmonic. Calculating the latter for gA (x, y) (as a function of x for y fixed) yields
P:
P:
1
Id ---gA (x, y) t
=
1 t
Je
-At
p.i», y) ds
o
which approaches the Dirac mass at y as t ~ O. A coroIlary is that gA (x, y) is finite for all x =I- y and defines a positive A-harmonic function on M\{y}. Another coroIlary is that if A belongs to the Green's region then for every compact K in M
lim e- A1
(5.5)
T_oo
JPT(X, y) dy = O. K
To see this choose £j ~ 0 and T; ~ 00, write g(x, y) = limT;_oo. <;-0 and use the heat equation to calculate (,1 x - A) X (gA (x, One gets two terms, the one near zero converges to the right answer, the Dirac mass at y, so the other one corresponding to 00 must go to zero. Since the convergence is that of Radon measures, (5.5) results. Besides the Green's function, positive A-harmonic functions also provide examples of A-superharmonic functions. This foIlows using (4.1) repeatedly,
f;i e-Atpl(x, y) dt,
y».
765
D. Sullivan / Related Aspects of Positivity
Pt(X, y)
= SUp p;(X, y), a
and
M
=U Ma • a
More precisely, (4.1) shows that (..\0,00) is contained in the Green's region because the second part (5.2) must hold for a positive Ao-harmonic function whenever the A of (4.1) belongs to (A o, 00). Now if A < Ao then A < A~ for some a and if A belongs to the Green's region, (5.5) implies fKe-A~tpt(x, y)dy~O as t~oo contradicting (3.3). Thus the Green's region does not contain A and must consist of either fA,oo) or (A o' 00). This proves Theorem (2.6). 6. The LZ-Spectrum of J and the Proof of Theorem (2.2)
Using the spectral theorem and the positivity of Pt(x, y) one sees immediately that if the interval fA, 00) does not intersect the LZ-spectrum of J (the infinitesimal generator of the semigroup f(x, t) = f M Pt(x, y)f(y) dy), then the bounded operator on L Z, 1/..::1 - A is represented by the positive kernel J; e-A1pt(x, y) dy. Applying the operator to a positive function with compact support shows that f;'e-Atpt(x,y)dy is finite a.e. Thus [A,oo) is contained in the Green's region. So the entire component of the complement of the spectrum containing the positive reals is contained in the Green's region. For the other inequality required by Theorem (2.2) consider the L Z_ norm of PJ = f M Pt(x, Y)f(y) dy. This is the square root of f M (J Pt(x, YI) X f(YI) dYI f Pt(x, yz)f(yz) dyz) dx. Thus, (6.1)
IIP,fIlL = ( 2
J
1(2
PZt(Yl' YZ)f(Yl)f(yz))
,
MxM
by the semigroup equation for Pt(x, y). Now consider a positive, bounded, measurable f, with support contained in a compact K in the interior of M. By (5.5) for each Yz, e -At f Pt(x, YI)f(YI) dYI ~ 0 as t ~ 00, if A belongs to the Green's region. For a set A of Yz's in K of almost full measure this convergence is uniform. Thus if g is f times the characteristic function of A we have, by (6.1), that the LZ-norm of Ptg times e- At goes to zero as t~oo. The linear span of these g is dense in L Z. It follows the LZ-spectrum of J cannot have points greater than A, for then there would be elements h in L Z so that the L 2_
766
D. Sullivan / Related Aspects of Positivity
norm of ~h would not decrease as fast as e ". This proves Theorem (2.2).
o
The corollary to Theorem (2.2) is explained by the first paragraph of this section.
7. On the Uniqueness of Positive AD-Harmonic Functions (Proofs of Theorems (2.7) and (2.8» Suppose the convex cone of positive Ao-harmonic functions is not a single ray. The base of this cone {4>1 4>(xo) = I} is convex, metrizable, and compact in the topology of uniform convergence on compact sets by the Harnack principle of Section 3. Let f and g be two different extreme points of this compact convex set so that f ~ g and g ~ f are both false and form 4> = min{f, g}. Let p;o be the operator of Section 5. From (4.8) it follows that p;of ~ f and P;Og ~ g. Thus by positivity of p;o, P;°4> ~ 4> so 4> is Ao·superharmonic (Section 5). Since 4> is not smooth 4> cannot be AD-harmonic. (There is a transversality detail here which can be treated using multiples of f and g if necessary.) Thus, P;°4> =I 4> for some t and the second case of (5.2) must hold. Thus AD belongs to the Green's region, i.e. f; e -AO'p,(X, y) dt < 00. This proves Theorem (2.7). Now suppose there is an atom at Ao for the spectral measure of ..1 on 2 L • Since ..1 - Ao is the infinitesimal generator of p;o we must have P;°4> = 4> for 4> in the AD eigenspace of ..1. In particular, IIP;Ogl/ does not approach zero as t -HXJ for a dense set of L 2• Thus by (5.5) AD is not in the Green's region. This proves the second part of Theorem (2.8). Now we give a proof that any 4> in L 2 satisfying P;°4> = 4> cannot change sign. By Theorem (2.2), p;o is a contraction on L 2 , so IIp;ol4>llb ~ 1114>llb where 14>1 is the absolute value of 4>. On the other hand,
14>(x)1
=
1P;°4>(x)/ ~ p;ol4>/(x),
so (14)I(x)? ~ (P;ol4>l(x)f Combining these two gives 14>/(x) = P;O!4>I(x) a.e.. If 4> is not entirely negative, at a generic point where 4> (x ) > 0 we have
4>(x) =
J e-AO'p,(x, Y)4>(Y) dy M
D. Sullivan / Related Aspects of Positivity
767
and
¢(x) =
1~I(x) =
f
e-AO'p,(x, y)I¢J(y) dy.
M
So ¢ = 1<1>1 a.e. and ¢ must be entirely positive. Since any ¢ does not change sign no two can be orthogonal in L 2 • This completes the proof of Theorem (2.8).
8. The ¢-Process and Completeness of A-Harmonic Functions (Proof of Theorem (2.10» It is formal that the operator defined on functions by the kernel e-At¢(y)/¢(x)p,(x, y) and on measures by duality preserves the function 1 and the measure ¢2(y)dy iff ¢(x)=IMe-Atp,(x,y)¢(y)dy (i.e. ¢ is complete in the terminology of the introduction). The differential operator or infinitesimal generator associated to this diffusion operator is [¢ 1(..1 - A)[¢] where [¢] denotes the multiplication operator by ¢. Thus [¢r 1(L1-A)[¢]f= ¢-I(L1-A)¢f= ¢-I«L1¢)·f+ ¢ . L1f + 2 grad > • grad f - It.¢f) = L1f + 2 grad log ¢ . grad f, since L1¢ = A¢. If M is Ao-recurrent and ¢o is the unique positive Ao-harmonic function (up to a multiple), then by (5.2) we must have ¢o(x) = I M e-AO'p,(x, Y)¢o(y) dy, namely the first of (5.2) holds. For otherwise, by the second of (5.2), Ao belongs to the Green's region. This proves all but the last part of Theorem (2.10). To prove recurrence we simply check the criterion for recurrence that the Green's function of the process is identically +00. For the ¢-process the Green's function is I; e-AO'¢(y)/¢(x)p,(x, y) dt which equals +00 since the ¢(y)/¢(x) factor does not matter. This proves Theorem (2.10). Now let us discuss the question of completeness for A-harmonic functions. We will give several arguments for the existence of complete A-harmonic functions which depend on auxiliary hypotheses.
r
Argument (8.1). (Fixed point property.) Let C6A note the convex cone of positive A-superharmonic functions. The heat semigroup operates on C6A • Using compactness of the base of C6A and continuity of P, (if true
768
D. Sullivan / Related Aspects of Positivity
simultaneously) we have, by the fixed point theorem, fixed rays in eeA • Taking the minimum A, namely Ao' the equation P'c/> = cc/> implies c = e -Ao' and we arrive at a complete positive Ao-harmonic function. (I am indebted to Dan Stroock for pointing out that a topology making eeA have a compact base and P, continuous for a general Riemannian manifold is not obvious.) Argument (8.2). (Minimal A-harmonic functions.) Let tleA denote the convex cone of non-negative A-harmonic functions. The base of tleA is compact by the Harnack principle of Section 3. Suppose the heat semigroup preserves tleA or that even tie = tleA n P,tleA ¥ 0 is a nontrivial convex cone with a compact base. Let 1 lie in an extreme ray of tie and let = P;f. Then I~ by (4.1) and belongs to tie. Now g = 1is non-negative and A -harmonic. If 1 = P~h, then g = P~ (h - f) so g belongs we must have g = cd and = cd since 1 is to tie. Since 1 = g + would not be A-harmonic. extreme. But C2 < 1 is impossible for then Thus 1 = 1 for any extreme ray. By linearity and Choquet, h = P; h for any h in tie. So if tie = tleA n P'tleA is closed and nontrivial it consists entirely 01 complete A-harmonic functions.
r
r r
r;
r
r
r
r
Example (8.3). If M is the interior of a compact manifold with boundary, a continuous positive A-harmonic function c/> is rarely complete. By (4.1) it is necessary that c/> vanishes on the boundary. Thus A = Ao and 4> must be proportional to zeroth eigenfunction 4>0' which is complete. Example (8.4). (Another Argument.) If M (or a covering space) has bounded geometry, that is each point is centered in a neighbourhood of fixed radius which is a bounded distortion of the unit ball in Euclidean space, then every positive A -harmonic function is complete. This follows because the constants in Harnack's principle are uniform (so a positive A-harmonic function 4> grows at most exponentially) and the heat kernel satisfies an inequality p,(x, y):s;; c e a (d (x, y »2 for i « 1 and d(x, y) ~ 1 (so p,(x, y)c/>(y)dy has little mass near infinity). Now a straightforward estimate shows that a positive A-harmonic function is complete. Problem (8.5). (Stroock and Sullivan.) Which open connected manifolds have complete positive Ao-harmonic functions? We now turn to the proofs of the theorems in Subsection 2.3.
D. Sullivan / Related Aspects of Positivity
769
9. Proof of Theorems (2.11) and (2.16) If for some A, there is a positive A-harmonic function ep on Hd +1, then we can average ep over the compact group of isometries fixing some y in H d+l. We obtain a spherically symmetric positive A-harmonic function ep(R) = S). (x, y) where R = d(x, y). Then ep(R) satisfies
(9.1) where A(R) = the area of the sphere of radius R about y, and A'(R) = (d/dR)A(R). For R near zero and infinity respectively, this equation becomes R=O:
(9.2)
R =
00:
(-
d2
dR
d
2
+d - - A) ep =0. dR
The exponential solutions near 00 are determined from the indicial equation u 2 + du - A = O. In other words if a = -u, A = a(a - d). Real exponentials result iff A ~ _~d2. Thus there are spherically symmetric positive A-harmonic functions iff A ~ _~d2. This proves Theorem (2.1l)(i). Before proving Theorem (2.11)(ii) we must prove Theorem (2.16) and analyze the A-Green's function, g). (x, y) = e -At p, (x, y) dr, Looking again at the 'equations in the form (9.2) one sees: (i). Near R = 0 there is a l-dimensional subspace of bounded solutions, the rest of the solutions have a standard Green's singularity, 10g(I/R) if d = 1 and (l/R)d-l if d> 1. (ii). At R = 00 there is a l-dimensional space of solutions asymptotic to a constant· e- aR where a = ~d + (A + ~d2)1/2. The rest are asymptotic to a constant· e -(d-a)R if a > ~d or constant· R e -(d/2)R if a = ~d. We know from Theorem (2.11) and Theorem (2.6) and the non-uniqueness of positive Ao-harmonic functions that the Green's region is [A o, (0). We know from (L1 x - A)g). (x, y) = Dirac mass at y that g). (x, y) has a standard Green's singularity at x = y, R = O. We have seen from the definition that S). (x, y) is bounded near R = 0 and therefore S).(x, y) is the small (or recessive) solution near R = O. We want to show that g). (x, y) is the small (or recessive) solution at R = 00.
f::
770
D. Sullivan I Related Aspects of Positivity
Claim (9.1). The recessive solution at R = 00 for A ~ Ao is positive for all R > 0 and has a Green's singularity at R = O. Proof of claim. The bounded solution at R formula'
= 0,
SA (x, y), has the simple
JcP(x;~,a)dO where dO is the spherical measure on Sd with y the center of the unit ball model and the cP(·; ~, a) of Subsection 2.3 are normalized at y. A special case of the calculation in the proposition of the proof of Theorem (2.13) shows that SA is a large solution near R = 00. Thus g, the recessive solution at R = 00, cannot also be recessive at R = 0 because it would then be a multiple of SA (which is large at R = (0). Thus g tends to 00 as R .... 0 and must cross SA for some smallest R = R o. At R o the Wronskian gS~- SAg' = g(Ro)(S~ - g') is negative since g(R o) = SA (R o)> 0, and S~(Ro) < g'(R o). Since the Wronskian does not change sign and S~ < 0, each of the following behaviours
Fig. 1.
is ruled out. So g > 0 and we have the picture (Fig. 2, opposite page), which proves the claim and a bit more. 0 To finish the proof that g = constant· gA (x, y) write gA (x, y) as the sup, g~(x, y) where D; is an exhaustion of H d +1 by balls centered at y,
D. Sullivan / Related Aspects of Positivity
771
Fig. 2.
°
and g:(x, y) is the A-Green's function for Da • Now cig - C2SA is zero on aDa and has the same weight singularity at R = where C1 and C2 are positive constants. So cig - C2 SA= g; (x, y). Thus g; (x, y) ~ constant· g. The constant is fixed, so gA (x, y) = sup a g; (x, y) ~ constant· g. It follows that gA(X, y) is small (or recessive) at R = 00 and must be a constant times g. This completes the proof of Theorem (2.16). Now we are in a position to prove Theorem (2.1l)(ii) by Martin's construction (1941). We sketch. the steps of this famous argument. Choose a reference point X o in IHI d + 1 and consider the quotient kA(x,y)=gA(X,y)/gA(XO'y). As a function of y, (x fixed) kA(x,y) is continuous on H d + 1 U S" with kA(x, ~)= q,(x,~, a) (normalized at x o) for ~ in s'. This follows from Theorem (2.16), a = ~d + (A + ~d2Y/2. Let q, be a positive A-harmonic function which is a limit of A-potentials
y
of Radon-measures JLn on IHI d + 1, (all are as we shall see). The measures J1- ~ = gA (xo, y)J1-n have total mass ~q,n (x) (~q,(x) + 1 for n large). So let J1be a weak limit measure in IHI d+1 U s'. Since (..1 - A)q,n = J1- n and (..1 - A)q, = 0, J1- must be supported on s'. We calculate
772
D. Sullivan I Related Aspects of Positivity
c/J(x)=li~c/Jn(x)=Ii~
JgA(x,y)dtLn(Y) y
= lim n
J k (x, y) dtL~(Y) A
y
=
Jk, (x, y) dtL (because k, (x, y) is a continuous function of y) y
=
J c/J(x,~,a)dtL(~), f
s:
since tL lives on This proves the existence part of Theorem (2.11)(ii) for a limit of potentials. We now give the classical argument to see that any A-superharmonic function I is an increasing limit of potentials. Form In = min{f, nGAXn} where Xn is the characteristic function of the ball of radius n about some fixed point and GAXn(x) = f MXn(y)gA (x, y) dy. Then In is non-negative bounded, A-superharmonic, In increases to I, and In satisfies inf T ....cc P~/n = o (the latter, since this is true for nGAXn and inf{P~/, P~g} ~ P~ inf{f, g}). Now apply (5.1) to In and let T ~ 00 to obtain
GA (1!t(fn - P: In» = lit
JP~/n ds. o
The right hand side is increasing to In as t~ 0 since In is A-superharmonic. Thus In is the increasing limit of potentials GAtL, where IL, = 1lt(fn - P: In)' This implies that I is the increasing union of potentials and completes the proof of the existence part of Theorem (2.l1)(ii). The uniqueness follows from Theorem (2.12) (which only uses the existence part of Theorem (2.11)(ii) in its proof).
10. Proof of Theorems (2.12), (2.13), (2.14) and (2.15)
To prove Theorem (2.12) we must first calculate the normalizing factor for IL(P, c/J, R) = 1!cR • (c/JIS(p, R»· spherical measure. We want
D. Sullivan I Related Aspects of Positivity
773
cP(p) = l/cR I cP/S(p,R)d8R(x), x
where d8R is the unit spherical measure on S(p, R). Write cP as an integral of the cP(·, g, a),
cP(x)
I cP(x,g,a)df-L(p,y)(g),
=
f
where f-L (p, cP) has total mass cP(p). Substituting, gives
cP(p)CR= I I cP(x,g,a)df-L(p,cP)(g)d8R(x) x
f
f
x
Thus cR is the function of R, S),(R) = Ix cP(x,g, a)d8R(x) where x= (R, g), which we have seen in Section 9 to be of the order e -(d-a)R for a > ~d and R e -(d/2)R for a = ~d. With the indicated choice of CR the total mass of f-L (p, cP, R) is cP (p). Now let I be a continuous function on ~d+l C s' and let R ~ 00. Then
l/cR I I df-L(p, cP, R) = lIc I I' cP . d8R R
= l/cR I I(I cP(x,
g, a)df-L(p, cP)(g)) d8R(x)
f
x
= I (lICRI l(x)'cP(x, g, a)d8R(x)) df-L(p, cP)(g)· [
x
Outside a disk of radius e > 0 (fixed so that I is near I(t) on this region)
774
D. Sullivan / Related Aspects of Positivity
in polar coordinates (R, ~), cP(x,~, a) is of the order e -aR. On the other hand, the integral f cP(x,~, a)dOR is larger, e-(d-a)R or R e-(dJ2)R as indicated above. Thus the inner integral is concentrated near ~ and converges to f(~) on u-.». Thus lim
R....""
J
f· dJL(p, cP, R) =
J
f dJL(p, cP),
proving Theorem (2.12). Remark (10.1). This proof of Theorem (2.12) for a > ~d was shown to me by Mary Rees who offered it as an alternative to the sketch of Theorem (2.13) for a > ~d in [25]. The questions of Mary Rees were part of the motivation from the exposition here. Now we prove Theorem (2.13). First we have a proposition asserting that no finite measure JL on s' is more diffuse than Lebesgue measure. Proposition (10.2). Let JL be a finite positive measure on u-olmost all ~ in s'.
s'. Then for
· . fJL-(~,dr) >0, Iimm r-O
r
where JL (~, r) is the JL measure of a disk of radius r centered at
~.
Proof. Let A be the set of ~ in s' so that for every 8 > 0 and ~ in A there is a sequence rj ~ 0 with JL (~, rJ ~ er1. By the covering lemma ([9, Th. 2.8.14]) there are (arbitrarily fine) coverings of A using disks of these radii (and centers on A) which fall into K = K(d) collections consisting of disjoint disks. One of these collections C must contain at least 1/K . JL (A) of the mass of JL. Thus I/K'JL(A)~"2-JL(~,rJ~£ "2-r1 c ~ e . Lebesgue measure of
s' .
Do Sullivan / Related Aspects of Positivity
So j.L(A) ~ tion.D Fix
~o
10 •
775
K· measure of Sd for any e > O. This proves the proposi-
and calculate for x = (R,
~o)
J
f
Divide the integral into 3 parts: (i) d(~, ~o) ~ e -R, (ii) e- R ~ d(~, ~o) ~ 10, and (iii) d(~, ~o) ~ 1O. Here 10 > a is a parameter and d is the spherical or Euclidean distance in the unit ball model. An elementary calculation (see [27, Section 1]) shows that for x = (~o, R) in these 3 regions
qo)'
00) (11
e-s« . II s 2a
(iii) e-aR ,
Thus
J
eaRdj.L+
(i)
J (ii)
e-
aR
l /s2a dj.L +
J
e-aRdj.L.
(iii)
The first term is comparable to eaRj.L(qo, e- R ) . The third term is at most e -aR. We treat the second term by partial integration to obtain (ignoring constants)
=
e -aR
J
j.L (qo, S )ls
2a 1
+
ds - (constant· first term) + constantts ) .
Now by the previous proposition for j.L-almost all ~o, j.L(qo' s) is eventually ~C(gO)Sd. So, II = e?" fc-R ...... j.L(qo, s)ls 2a + 1 ds is
776
D. Sullivan I Related Aspects of Positivity -(d-a)R
~ {~ e
(10.1)
-(d/2)R
if a >~d, if a = ~d.
It follows that for R large either the first term (i) is at least as large as II or the second term (ii) is of the order of II. Thus (i) + (ii) is at least as large as II which is much bigger than (iii). This proves Theorem (2.13)(b). The others are easier. We have also derived the fact that the essential contribution to (x) = (R, fo) for R large (~R (£ » and JL -almost all f o comes from the part of the integral with dU, fo) ~ £ for any e > O. This is useful for Theorem (2.14). We now write out
I(X) =
Jx (f) (x, f, a) dJL, e
2(X) =
J
1 . (x, f, a) dJL,
e
where JL = JL(p, 2) and x(f) = dJL(p, 1)/dJL(p, 2)(~)' By the above for JL-almost all ~o and for R large we only need consider the integrals for d(~, fo) < c. Now consider a set A of ~ of positive JL-measure where X(~) is approximately a. For x = (R, ~o), (x, g, a) only depends on d(g, go), as indicated above. Moreover, (x,~, a) only varies up to a constant near 1 in ratio on annuli of a definite shape around ~o (again, from the above). For each ~o in a subset Be A of full JL-measure we can choose e so that if we divide the s-disk about ~o into concentric annuli of (relative) constancy for (x,~, a) (x = (R, ~o), R > R(e» each of these annuli will be mostly filled (relative to JL) by points of A, and the JL-integral of X on each is approximately a. This follows from Lebesgue density and differentiation. Then we see that I(X) and ix) are sume of terms in approximate ratio a which is approximately X(~o)' These sets A fill up JL. This proves Theorem (2.14). Now we turn to the proof of Theorem (2.15). Let A be a set of positive JL-measure so that (R, 0 ~ e(u+E)R for e > 0 and R > R(~, c). Fixing e we can make R (~, c) independent of ~ by reducing A a little to B. Write 8 = (J + e and r = e- R • Referring to the decomposition of the integral for (x) above, we deduce that the first term is ~e~R. Thus JL(~o, r) ~ ra-~ for any ~o in B. For any covering of B by balls of radius rj centered at ~j in B we have
D. Sullivan / Related Aspects of Positivity
0< J.L (B) :s;;
777
'Z J.L (gi' r;):S;; 'Z rf-8 . i
Thus the Hausdorff (a - o)-measure of B is positive. So the Hausdorff dimension of A::J B is ~a - 0 = a - U - E for every E > O. This proves Theorem (2.15).
11. Proof of Theorems (2.17), (2.19), (2.21), (2.23) and (2.24) If M = Hd+!/r, then p~(x, y) is just LyerP,(xO, rye) where x", yO lie in H d + 1 over x,y. Thus g~(x,y)= L ye r g,\ (XO, rYe). So if Xo is not on the r orbit of yO, then g~ (x, y) has the order of the Poincare series L r exp(-ad(xO, rye)) by Theorem (2.16), a = ~d + (A + id 2Y/2. Thus g~(x,y)
778
D. Sullivan / Related Aspects of Positivity
Acknowledgments During the work on this paper I have benefitted from remarks, questions, and discussions with Phil Trauber, Mary Rees, Jon Aaronson, Henry McKean, Ragu Varadhan, Peter Lax, Dan Stroock, Dick Holley, Robert Brooks and Bert Konstant, in chronological order.
References [1] A. Akaza, Local properties of the singular sets of some Kleinian groups, Tohoku Math. J. 25 (1973) 1-22. [2] A.F. Beardon, The Hausdorff dimension of singular sets of property discontinuous groups, Amer. 1. Math. 88 (1966) 722-736. [3] A.F. Beardon, Section on Heeke Groups, Proc. London Math. Soc. 18 (1968) 491-483. [4] Robert Brooks, The fundamental group and spectrum of the Laplacian, Comment. Math. Helv., to appear. (5) J. Cheeger, A lower bound for the lowest eigenvalues of the Laplacian, In: Problems in Analysis, ed. Gunning (Princeton Univ. Press, Princeton, N.Y., 1970) 195-199. [6) S.Y. Cheng and S.T. Yau, Differential Equations on Riemannian Manifolds and their Geometric Applications, Comm. Pure. Appl, Math. (1975). [7] Jozef Dodzuik, Maximal principle for parabolic inequalities and the heat flow on open Riemannian manifolds, to appear. [8) J. Elstrodt, Die Resolvente zum Eigenwert problem der automorphen Formen in der hyperbolische Ebene I, Math. Ann 203 (1975) 295-330. II Math. Z. 132 (1973) 99-134, III Math. Ann. 208 (1974) 99-132. (9) H. Federer, Geometric Measure Theory, Ergebn. Math. Grenzgeb. (Springer, Berlin, 1969). [10] Fiseher-Colbrie and Schoen, Comm. Pure App!. Math. 33 (1980) 199-211. [Il] M. Fukashima and Dan Stroock, Reversibility of solutions to Martingale problems, Supp!. Vo!. Advances in Math. on the occasion of M. Kac's 65'h birthday, p. 2. [12] M. Gaffney, The harmonic operator for exterior differential forms, Proc. Nat. Acad. Sci. U.S.A. 37 (1951) 48-50. [13] M. Gaffney, A special Stokes theorem for complete Riemannian manifolds, Ann. Math. 60 (1954) 140-145. [14] S. Helgason, A duality for symmetric spaces with applications to group representations, Adv. in Math. (1970) 1-154. [15] F.I. Karpelevich, The geometry of geodesics and the eigenfunctions of the Laplacian on symmetric spaces, Trans. Moscow Math. Soc. 14 (1965) 48-185. [16] T. Lyons and D. Sullivan, Function theory, random paths and Covering spaces, IHES preprint (1983). [17] H. McKean, An upper bound for the spectrum of J on a manifold of negative curvature, 1. Differential Geom. 4 (1970) 359-366. [18] Malliavin, Diffusions et Geometrie Differentiable Globale, Varenna Aout 1975 Centro Internat. Math. Estivo. [19] Moss and Piepenbrink, Pacific 1. Math. 75 (1978) 219-226. [20] R, Phillips and P. Sarnak, to appear.
D. Sullivan / Related Aspects of Positivity [21] [22]
[23] [24]
[25]
[26] [27]
779
Mark Pinsky, Large deviations for diffusion processes, Stochastic Analysis (Academic Press, New York, 1978). Dan Ray and I.M. Singer, R-torsion and the Laplacian on Riemannian manifolds, Adv. in Math. 7 (1971) 145-210. David Ruelle, Hausdorff dimension of analytic repellors, J. Ergodic Th. Dynamical Systems (1982). D. Sullivan, Growth of Positive Harmonic Functions and Kleinian Group Limit Sets of Zero Planar Measure and Hausdorff Dimension Two, In: Geometry Symposium, Utrecht, Lecture Notes in Math. 894 (Springer, Berlin) 127-144. Dennis Sullivan, The density at infinity of a discrete group of hyperbolic motions, I.H.E.S. publ, 50 (1979) 171-209. Dennis Sullivan, Entropy, Hausdorff measures old and new, and limit sets of geometrically finite Kleinian groups, Acta Math. (1985). Dennis Sullivan, The ergodic theory at infinity of an arbitrary discrete group of hyperbolic motions, Ann. of Math. Stud. 97 (1981) 465-496.
J.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
781
APPROXIMATION AND REPRESENTATION OF SOLUTIONS IN LOCALLY INTEGRABLE STRUCTURES WITH BOUNDARY Francois TREVES Department of Mathematics, Rutgers University, New Brunswick, NJ 08903, U.S.A. Dedicated to Leopoldo Nachbin
o.
Introduction
The present work extends to the main results of [1] and [4] to smooth manifolds with boundary. We define a formally integrable structure on such a manifold (called j{): Such a structure is the datum of a vector sub bundle 'V of the complex tangent bundle" C:7j{ closed under commutation bracket and further submitted to the condition that its intersection with the complex tangent bundle of the boundary oAt, C:7 oAt, is a vector bundle over oAt. This is equivalent to the datum of a vector subbundle T' of the complex cotangent bundle C:7* At, closed in the sense of E. Cartan, and whose pull-back to the boundary is a vector bundle over the latter. As the codimension of oAt is equal to one there are only two possibilities: either the boundary is noncharacteristic, i.e. the bundle 'Vla.JI is transverse to it, or else the boundary is totally characteristic, i.e. Y IO.JI C C:7At. The structure is said to be locally integrable if, locally, T' is generated by the differentials of smooth functions. (This represents a change of terminology from [4], where a formally integrable structure was called an 'almost RC-structure', and a locally integrable one, an 'RC-structure'.) Any eel function h whose differential dh is a continuous section of T'is called a 'classical solution'. This is equivalent to saying .that, given any continuous section L of 'V, we have Lh = O. In Section 3 we show that, when the structure is locally integrable, any classical solution is, locally, the uniform limit of a sequence of polynomials in m 'basic' solutions z', ... ,zm (m is equal to the complex dimension of the fibres of T'). This is implied by the approximation formula (3.6), which has other consequences-for instance the unique continuation result stated as Corollary (3.4).
782
F. Treoes / Approximation and Representation of Solutions
Sections 4 and 5 are devoted to the definition of distribution solutions. Such a definition is not as natural, in manifolds with boundary, as it is when there is no boundary. Indeed the action of the vector fields that are sections of or may result in distributions that are carried by the boundary. And as a matter of fact we adopt a different definition in the case of a totally characteristic boundary, and in that of a noncharacteristic one. In the noncharacteristic case, we require that the distribution be a solution in the interior and that it be smooth transversally to the boundary (Definitions (4.1), (4.2»; in the totally characteristic case, only that its coboundary be a (distribution) section of T' (Definition (5.2». We show that these definitions are natural, in particular that classical solutions are distribution solutions. In Section 6 we establish the approximation and representation results that one expects (each is a consequence of a single formula-and the formulas in question are the same whether the boundary is noncharacteristic or totally characteristic). The purpose of the last section is to show that applying what precedes to smooth domains in complex space (or in complex manifolds) yields what one expects: the distribution solutions are those distributions which are holomorphic in the interior and grow slowly at the boundary; they are equal to finite sums of (holomorphic) derivatives of continuous functions in the closure which are holomorphic in the interior; locally they are limits of holomorphic polynomials p(z) (in a complex manifold the zj,s are local complex coordinates).
1. Formally Integrable Structures with Boundary
err
manifold with boundary Throughout this section .if will denote a tuc, Ai will denote its interior. Thus .if = Ai U e«. We shall always assume that .if is Hausdorff and countable at infinity. We write, provisionally, N = dim.if; aAi is a smooth manifold without boundary, countable at infinity, of dimension N - l. We denote by ;y.if and ;y*.if the tangent and cotangent bundles over .if respectively. They are vector bundles of fibre dimension N over the topological space .it (including its boundary!). They are smooth vector bundles, i.e. they are C(J~ manifolds with boundary. By C;y.it and C;y*.it we denote their respective complexifications. (When there is no bar on top of Ai, each of these bundles has the standard meaning, for the manifold without boundary Ai.)
F. Treves / Approximation and Representation of Solutions
783
The tangent bundle of the boundary, ;y JAt, will be regarded as a hyperplane subbundle of ;yjfla.«' Its orthogonal in ;y* jflaAl' for the duality between tangent and cotangent vectors, is the conormal bundle of the boundary, N* JAt. The latter is a (real) line bundle over JAt. Similarly, with C prefixes, when the scalar field is extended to the complex numbers. Definition (1.1). A formally integrable structure on the manifold with boundary jf is the datum of a smooth vector subbundle V of C;Yjf having the two following properties:
(1.1)
[V, V] C V, i.e. the commutation bracket of two smooth sections of V over one and the same open subset n of .it is also a smooth section of V over n;
(1.2)
(VlaAl) + C;Y JAt is a complex vector bundle over JAt.
(All sums of vector bundles are Whitney sums, i.e. they are vector sums of the fibres at each and the same point.) The restriction of the bundle V to the interior At defines a formally integrable structure on At (regarded as a manifold without boundary). Condition (1.2) is equivalent to the fact that
(1.3)
VaAl
=
(VIa.«) n C ;YaAt is a vector subbundle of C s uc.
as one checks by computing the fibre dimensions. Henceforth V shall denote a formally integrable structure on the manifold with boundary .ii. Proposition (1.2). One of the following two (mutually exclusive) properties holds true:
(1.4) (1.5) Proof. Indeed, C;Y aAt is a hyperplane subbundle of C;YAt over aAt. If (1.5) does not hold the fibre dimension of (V Ie« ) + C;Y aAt is equal to dim At. 0
784
F. Treues / Approximation and Representation of Solutions
We shall always denote by n by the fibre dimension (over C) of 'Y. If (1.4) holds the fibre dimension of 'Ya.ll is equal to n - 1; if (1.5) holds, it is equal to n. Let T' denote the subbundle of C f/*.ii which is the orthogonal of 'Y. It is closed in the sense of E. Cartan: If 4J is any smooth section of T' in some open subset n of .ii, its exterior derivative d4J is a section of the ideal generated by T' in the (complex) exterior algebra /\Cf/*.JJ. Locally, it means that J
d4J
=
L
0/ /I f3j
,
j=l
where the a j are smooth sections of T', and the f3j are smooth one-forms. The fibre dimension of T' will always be denoted by m. Of course we have m + n = dim.Jt (= N) . By duality (1.2) is equivalent to the following property: (1.6)
(T'la.Il) + CN* a.Jt is a complex vector bundle over a.Jt.
On the other hand, (1.4) is equivalent to (1.7)
T' n CN* a.JJ
=
0,
whereas (1.5) is equivalent to (1.8)
iCN* a.Jt cT'.
That either (1.7) or (1.8) holds true is evident if we note that the fibre dimension (over C) of (T'I a.Il) n CN* a.Jt is equal either to 0 or to 1. As in [4] ~e denote by TO the intersection of T' with the real cotangent bundle f/*.Jt. It is the characteristic set of 'Y, i.e. the set of common zeros in f/*.ii of all the sections of 'Y. Note then that (1.7) is equivalent to (1.9)
TO n N* a.JJ
whereas (1.8) is equivalent to
=
0,
F. Treves I Approximation and Representation of Solutions
(1.10)
785
N* aAt C TO.
In accordance with standard PDE-terminology, we introduce Definition (1.3). We shall say that the boundary aAt is noncharacteristic if the equivalent properties (1.4), (1.7), (1.9) are true and that aAt is totally characteristic if the equivalent properties (1.5), (1.8), (1.10) are true. Observe now that the natural map (1.11) (whose kernel is CN* aAt) maps T'la.Jl onto a vector subbundle of Cg* aAt, which will be denoted by T~.Jl (not to be confused with T'1a.Jl!) and referred to as the formally integrable structure induced on the boundary aAl by that of .ii. Clearly, r ail and T~.Jl are the orthogonal of one another for the duality between tangent and cotangent vectors to aAt. The boundary aAl is noncharacteristic if and only if the natural map (1.11) induces the isomorphism (1.12)
T~""'T'la.Jl·
The boundary aAl is totally characteristic if and only if the natural map (1.11) induces the isomorphism (1.13)
T~ = (T'I ail )/CN* At .
Indeed, (1.12) is equivalent to (1.7), while (1.13) is equivalent to (1.8). When the boundary is noncharacteristic the fibre dimension of T~.Jl is also equal to m, whereas when the boundary is totally characteristic it is equal to m - 1. Example (1.4). Take T' totally characteristic.
= C g*.ii, r = O.
Then the boundary aAt is
Example (1.5). Take T' = 0, 'V = C g.ii. Then aAl is non characteristic. Example (1.6). Let Al be an open subset of a complex manifold n; dime n = m. Suppose aAt is smooth, and At lies on one side of it. For
786
F. Treoes / Approximation and Representation of Solutions
each point p E.if take T~ to be the span of the differentials of the functions which are defined and holomorphic in some neighbourhood of p. Call f~ the complex conjugate of T~; as p ranges over .ii, T~ and f~ make up two vector bundles, T' and f', such that Cff*.if = T'$ 'i: Since T' n t: = 0 the conormal bundle of aJU cannot be contained in T', and the boundary is non characteristic. Example (1.7). Let x, y denote the coordinates in the plane, and take .if to be the closed upper half plane y ~ O. Take T' to be the vector bundle over.if spanned by the one-form dy + iy dx. The pull-back of this form to the boundary aJU, i.e. to the x-axis y = 0, vanishes identically, and therefore aJU is totally characteristic. It is perhaps worthwhile to give an example, related to Example (1.7), of 'something' which is not a formally integrable structure: Example (1.8). Take.if to be the upper half plane as in Example (1.7) but let now T' be the line bundle spanned by the differential dy + ix dx. The pull-back of T' to aJU vanishes at x = 0 but not when x ~ 0, and therefore is not a vector bundle over aJU.
2. Locally Integrable Structures with Boundary We continue to deal with the concepts and use the notation of Section 1. Definition (2.1). We say that the formally integrable structure T' on .if is locally integrable if every point of .if has an open neighbourhood in which T' is spanned by exact differentials. The structures in Examples (1.4) to (1.8) are all locally integrable. (Concerning Example (1.7) see Example (2.3) below.) Example (2.2). Assume.if to be an analytic manifold with boundary. The definition of such a manifold is identical to the definition in the cgoo category, save for the substitution of 'real-analytic' for 'cgoo,. Assume also that we are given an analytic formally integrable structure 'V on .if, i.e. a real-analytic vector subbundle 'V of C ffJU satisfying (1.2) and (1.3). We write T' = 'V-'-.
F. Treves / Approximation and Representation of Solutions
787
Let (V, SI, .•. , SN) be an analytic coordinate chart centered at a point of the boundary 8Al, and such that SN = 0 is an equation of the boundary in U. Use the coordinates s' to identify V to a subset of Euclidean space R N • Possibly after contracting V, we may assume that it is the upper half, defined by SN ~ 0, of the open ball Bp(O) = {s E IR Nlisl < p}. Also, suppose that the central point is the origin. Let L i (j = 1, ... , n) be analytic vector fields spanning "If over U. Their coefficients, in the basis 8/8s\ ... , a/8s N of C:Y.ii over V, are analytic functions in V, and their Taylor expansions converge uniformly in a full neighbourhood of the origin in Bp(O). We can therefore extend them, in a unique manner, as analytic vector fields to such a neighbourhood-which we take to be Bp(O). Their brackets [Li , t., J are sections of "If over V. By analytic continuation this must also be true in Bp(O) (we could have chosen them in such a way that these brackets vanish identically). This provides an extension of T' = v- to Bp(O), which we also denote by T' and which is an analytic formally integrable structure on Bp(O). But it is well known that such a structure is spanned locally by the differentials of analytic functions. Once again after contracting Bp(O) about 0 if need be, let Z i (j = 1, ... , m) be m analytic functions in Bp (0) whose differentials span T' over Bp(O) and let z' denote the restriction of Zi to V. It is obvious that dz 1, . . . , dz m span T' over U. This goes to show that every analytic formally integrable structure, on an analytic manifold with boundary, is locally integrable. In the present section we shall be interested in local representations of the structure in an open neighbourhood V of an arbitrary point of the boundary 8Al. At interior points we have already local representations, those exhibited in [4, Ch. I, Sect. 2J. Much of the argument in the present section will parallel the argument of the latter text, with the modifications entailed by the presence of the boundary. The neighbourhood V is chosen small enough that there is a non-negative function r E 96"'( V), with dr ~ 0 in V, such that (2.1)
V
n 8Al = {p
E VI rep)
= O}.
Thanks to the local integrability of T', possibly after contracting V, we can find m smooth functions Zi (j = 1, ... , m) in V such that az', ... ,dZm form a basis of T' over V n .ii. In V, the property that the boundary is non characteristic is equivalent to the property that
F. Treves / Approximation and Representation of Solutions
788
(2.2)
dZ I ,
... ,
un e«,
d.Z'", dr are C-linearly independent at every point of
whereas the property that the boundary is totally characteristic translates into the property that, (2.3)
over un a.Jf, dr is a linear combination of dZ I ,
... ,
d.Z" .
We may, and shall, assume that U is the domain of local coordinates (x 1, •.. , x", t', ... , r"), such that the differentials dr', ... , dr",
az', ... , dZ m
span the whole cotangent space C fJ*.it at each point of U. We shall assume that all the above coordinates and all the functions zj vanish at a point of U, referred to from now on as 'the origin'. But our choice of these coordinates will vary, depending on whether the boundary is noncharacteristic or is totally characteristic: When the boundary is noncharacteristic, i.e. when (2.2) holds, we may, and shall, take t" = r. When the boundary is totally characteristic, i.e, when (2.3) holds, we shall take x" = r. We shall also make use of the m + n vector fields in U, M;, L j (i = 1, ... , m, j = 1, ... , n), defined by the conditions i MhZ = 8~,
(2.4) LZ J
i
= 0'
Ll = 87
(h, i
Mit
j
= 0,
= 1, ... , m; j,
k = 1, ... , n).
It follows at once from (2.4) that all the vector fields M;, L j commute pairwise. Let Z, denote the Jacobian matrix of the Zh,S with respect to the Xi,S, and JL ~ the generic entry of its inverse, Z;I. We have (2.5)
(2.6)
Mh = Lj =
m.
a ax
L JL~(t, x)-;, i=1
am. a + L Ai(t, x)-;, at ;=1 ax
-j
where the coefficients AJ are easily determined.
F. Treves / Approximation and Representation of Solutions
789
When the boundary is noncharacteristic the vector fields (i
are all tangent to
un aAt,
= 1, ... , m; j = 1, ... , n - 1),
and so are therefore
whereas
am. a ax r j=1
L n =-+ A a '" L.Jn
(2.7)
I
-
. l
is transverse to aAt. When the boundary is totally characteristic it follows from (2.3) that (2.8)
the pull-back of the m-form dZ I vanishes identically.
II . . . II
d.Z"
to
un aAt
By (1.5), which is equivalent to (2.3), we know that all the vector fields aAt in un e« i.e.
Lj are tangent to (2.9)
a
L j = -j +
at
m-I.
L
;=1
a
a
ax
ar
A;(t, x)-; + rYj(t, x)-,
recalling that x = (x', ... , x m - l , r). Example (2.3). This is the same as Example (1.7) but we note here that the structure T' over the upper half plane y ~ 0 is spanned by dZ with Z = y exp(ix). The vector bundle "V = T Ll is spanned by the vector field (2.10)
a ax
a ay
L=--iy-.
Define the vector field (2.11)
M =
exp(-ix)~. ay
790
F. Treves I Approximation and Representation of Solutions
We have (2.12)
LZ = 0, Lx = 1, MZ = 1, Mx = 0,
and [L, M] = O. As pointed out in Section 1, in this example the boundary is totally characteristic.
3. Approximation of Classical Solutions in Locally Integrable Structures Let T' be a locally integrable structure on the manifold with boundary
.ii. By a classical solution in an open subset n of.ii we mean a cgl function h in n such that Lh = 0 whatever the continuous section L of 'Y over n. We shall reason in an open neighbourhood U of the origin (which lies on a.Ji), exactly like the one in Section 2. In U we are given local coordinates x', ... , x", t l , ••. , t", one of which is the function r whose vanishing defines the boundary in U, and the 'basic functions' ZI, ... , Z", whose differentials span T' over U. They all vanish at the origin. We shall write
It is convenient to take
U=
(3.1)
v x W,
with V (resp. W) selected as follows: When the boundary a.J{ is noncharacteristic, V is a half ball, V = {t ERn Iitl < p, t n ~ O}, while W is an open ball in x-space IR ". When a.Ji is totally characteristic, V is an open ball in t-space IR n, while W = {x E IR mllxl < p', x" = r ~ O} is a half ball. With the notation (2.5}-(2.6), given any function f E cgl(U) we have m
(3.2)
m
df= L~fdZj+ LL/dt i . i~1
i~1
And to say that hE cgl(U) is a solution is equivalent to saying that
F. Treves / Approximation and Representation of Solutions
(3.3)
dZ = dZ I
d(h dZ) = 0,
II .•. II
791
dZ" .
We define the functions in V x V (d. [4, Ch. II, Sect. 1]) (3.4)
(3.5)
ET(t, x; s, y) =
(:r
/2
[de t ZAO, OWl exp(-TQ(Z(t, x) - Z(s, y))),
for T>O, and (t,x), (s,y)E U. Theorem (3.1). There is an open neighbourhood V o = V o x W o C V of the origin and a function g E ee;(W), identically equal to one in Wo' such that an arbitrary solution hE ee 1(V) is given by the following uniform limit in
u;
(3.6)
h(t,x)=
~~J
ET(t,x;O,y)g(y)h(O,y)dZ(O,y).
W
°
Proof. Denote by l(to) the straight line segment in W joining (oriented from to to)' Note that (3.7)
a[l(to) x W]
=
°
to to
l(to) x (aw) + {to} x W - {OJ X w.
Since g is compactly supported in W, if we regard g as a function (independent of t) in V = V x lv, supp g intersects V x aw only along the subset on which r = (keep in mind that the analysis is taking place in a manifold with boundary!). But when a.JU is noncharacteristic, r = t" and W is an open ball in R ". Therefore supp g intersects a[l(to) x W] only along the two faces {to} x Wand {OJ x W And when a.JU is totally characteristic, (2.8) implies that the pull-back of g dZ to V x aW vanishes identically. Thus, in both cases,
°
(3.8)
J o(/lto)xW)
gh dZ =
J {/O}XW
gh dZ -
J {O}xW
gh dZ.
792
F. Treves / Approximation and Representation of Solutions
But by Stokes' theorem and by (3.3) the left-hand side in (3.8) is equal to (3.9)
J
h dg AdZ.
I(IQ)XW
From there on the proof is completed as in the case of a manifold without boundary ([4, Ch. II, Sect. 1]): we replace h(t, x) by ET(to, xo; t, x)h(t, x), which is obviously also a cgl solution. The variation of to is restricted to an open neighbourhood of the origin, V o C V. Then, if the diameter of W o is small enough, the first integral in the right-hand side of (3.8) converges uniformly to gh in Vo= V o x W o as T--? +00. If furthermore the diameter of Vo is small enough compared to that of W o, in which we take g = 1, the integral (3.9) converges uniformly to zero as T --? +00. It suffices then to notice that the second term in the right-hand side of (3.8) is equal to the integral under the limit sign in (3.6), whence the result. 0 Corollary (3.2). Every solution hE cgl(V) is the uniform limit in V o of a sequence of polynomials with complex coefficients in ZI, ... ,Zm. Corollary (3.3). There is a compact neighbourhood K C V of the origin with the property that to every solution h E cgl( V) there is a continuous function h" on Z(K) such that h = hOoZ in K. Corollary (3.4). If the trace on {O} x Wof the solution h E cgl(V) vanishes identically then h = 0 in V o' These corollaries embody respectively the approximation by polynomials in Z, the constancy on the fibres of the map Z and the uniqueness of the classical solutions (ct. [4, Ch. II, Sect. 2]).
4. Distribution Solutions when the Boundary is Noncharacteristic We continue to deal with the cgoo manifold with boundary ll, equipped with the locally integrable structure T. As before the fibre dimension of T is equal to m; it is convenient to write dim II = N = m + n. Our main purpose, in the present section and the next one, is to clarify what we mean by a 'distribution solution'. There is no uncertainty about
F. Treves / Approximation and Representation of Solutions
793
what it means at interior points: Let n be an open subset of .it. A section distribution h in the interior {} n At is a solution if, given any L of rover n At, we have Lh = 0 in that open set. We begin by recalling some definitions and terminology pertaining to currents on the manifold with boundary .it [3, Ch. IX]. The same terminology and notation will be used with .it replaced by anyone of its open subsets, n. Indeed, note that .it induces on n a structure of manifold with boundary an = {} n aAt. We denote by (q.it; A P) the space of smooth p-forms in .it equipped with the standard 'f1'" topology, and by 'f1;(.it; A P) the subspace of those forms that have compact support. The latter will often be referred to as test-forms. Keep in mind that their pull-back to the boundary aAt of .it does not necessarily vanish. For any compact subset K of.it we denote by 'f1;(K; A P) the space of the smooth p-forms whose support is contained in K. This space is equipped with the 'f1'" topology, i.e. the topology induced by 'f1"'(.it; A P). By definition a current of degree p (what we shall also call a p-current) in the manifold with boundary .it is a linear functional on 'f1;(.it; A N-P), whose restriction to 'f1;(K; A N-p) is continuous for every compact set K c.it. A distribution on .it is a current of degree zero on .it. The space of p-currents on .it will be denoted by 9lJ'(.it; A P), that of distributions by 9lJ'(.it). Whenever currents or distributions have compact support '1::' will be substituted for 9lJ'. Convergence of p-currents in .it is defined in standard fashion: a sequence of such currents will converge if its evaluation at an arbitrary test (N - p)-form converges (in spaces of compactly supported currents it will also be required that the supports remain inside a fixed compact set). In the present section we only deal with the case where the boundary is non characteristic. In general, a distribution on .it might not have any trace on the boundary aAt. In order to define the distribution solutions when the boundary is noncharacteristic we introduce a class of distributions in .it whose trace on aAt is well defined. We shall say that a local chart (V, 51, .•. , SN) is adapted to the boundary if SN = 0 is an equation of V n aAt in V, and if V = V' x [0, e), where Viis an open set in the space of the variable 5' = (5\ ... , SN-l) and e > O. Then V n aAt = V' x {O}.
n
err
Definition (4.1). We say that a distribution u in an open subset n of.it is smooth transversally to the boundary if every point of n n aAt is contained
794
F. Treves / Approximation and Representation of Solutions
in the domain V = V' X [0, e) C n of a local chart (V, s\ ... , SN) adapted to the boundary such that u is a C(5oo function of SN valued in the space of distributions of s' in V'. It is not difficult to show that the property that u is a smooth function of SN valued in the space of distributions of S', is independent of the choice of the defining function SN and of the 'tangential' coordinates s', j = 1, ... , N - 1. An equivalent (and more obviously invariant) definition, but which we shall not use here, is that the distribution is smooth transversally to the boundary if its wave-front set does not intersect the conormal bundle of avU [2, Cor. 8.2.7].
Definition (4.2). Assume that the boundary avU of 11 is noncharacteristic and let n be an open subset of 11. We shall say that a distribution h in n is a distribution solution if h is smooth transversally to the boundary (Definition (4.1)) and if the restriction of h to the interior n n vU of n is a distribution solution there.
Part of the justification for Definition (4.2) lies in the following result: Proposition (4.3). Assume that the boundary avU is noncharacteristic. Let n
be an open subset of 11 and let h E C(5o(n) be a distribution solution in the interior n n vU. Then h is a distribution solution in n.
Proof. We must prove that h is smooth transversally to the boundary. Let (V, s', .... , SN) be a local chart in n adapted to the boundary. After contracting V we may assume (cf. (2.7)) that h satisfies, in V n vU, an equation with smooth coefficients, (4.1) By integration with respect to
(4.2)
we derive
SN
h(s)= h(s',O)-
L
N-I
k=1
Suppose h is a C(5Q function of
J(Ak-k)(Sl,lT)dlT. ah °
SN
as
in [0, e) valued
In
the space of
F. Treves / Approximation and Representation of Solutions
795
distributions of s' in V' for some q ~ O. By hypothesis this is true when q = O. The right-hand side in (4.1) is also a 'l5 q function of SN E [0, s ) valued in £P'(V'). From (4.2) we conclude that h is a 'l5 Q+ 1 function of SN valued in £P'(V'), whence the desired result. 0
5. Distribution Solutions when the Boundary is Totally Characteristic
We recall the notion of coboundary of a p-current u on the manifold with boundary .it. The exterior derivative acting on smooth p-forms in .it is the linear operator
such that, given any smooth p-form X on .it, the restnctIon (i.e. the pull-back) of dx to the interior At is equal to the exterior derivative of the restriction of X. The exterior derivative is a differential operator, and therefore induces a (continuous) linear map, also denoted by d, from compactly supported and smooth p-forms into compactly supported and smooth (p + I)-forms. Consider then the map (5.1)
By definition, the transpose of (_l)p-l d is the coboundary operator acting on p-currents: (5.2)
We adopt the same notation when At is replaced by one of its open subsets Il. When n does not intersect the boundary the coboundary bu of a distribution u in n is the same as what can be called the exterior derivative of u, and denoted by du (and which is an extension of the same operator on smooth forms). In the sequel, most of the time we continue to use the latter notation when no boundary is present. When there is a boundary, in general the coboundary of a smooth p-form regarded as a p-current is not equal to its exterior derivative, as shown in the following
796
F. Treues / Approximation and Representation of Solutions
Example (5.1). Let
(5.3)
ii =
IR +' the half line t ;::. O. Note that the restriction map CflOO(1R i , A I)~ CflOO(1R . A I) c'
c
+'
is a surjection, therefore its transpose, 0) (IR J
(5.4)
I
~
0)'(1R I) ,
is an injection. If u is a distribution in ii its image under (5.4) is the same distribution, regarded as a distribution in the whole real line, equal to zero for t < O. The image of the map (5.4) is exactly equal to the space of distributions in the real line whose support is contained in IR +. Let now h be a Cfll function in IR+ which we regard as a distribution on ii.1f X E Cfl;(IR+; A I), we have, by definition, +00
(5.5)
+00
Jx b h Jh dX· =
o
o
An integration by parts shows that (5.6)
bh = dh
+ h(O)8(t) dt,
where dh is the exterior derivative, which is a continuous one-form on IR+, and 8(t) the Dirac distribution at the origin. We define now the distribution solutions in the case where the boundary is totally characteristic:
Definition (5.2). Suppose that the boundary a.Jl is totally characteristic and let [l be an open subset of ii. A distribution u in n will be called a distribution solution if bu is a section of T' over n. That bu is a distribution section of T' over n means that, in the neighbourhood of each point of n, there are In Cfloo sections of T', which we may take to be exact differentials dZ I , . . . , d.Z", and m distributions 5" ... , 5m such that, near that point,
F. Treves / Approximation and Representation of Solutions
797
m
(5.7)
bu =
L s, uz-. j=!
n
If does not intersect the boundary B.Jt we have bu = du, as noted above. Thus, in this case, Definition (5.1) agrees with the definitions of [4] (called there RC-distributions). Next we discuss a simple example, which may serve as a model for the case of a totally characteristic boundary:
Example (5.3). Call (t, x) the coordinates in ~ 2 and take it to be the closed half space x ~ O. Take T' to be the vector bundle spanned by dZ where Z = x e", Except for the coordinates notation this is the same structure as that of Examples (1.7) and (2.2). Consider the following distributions:
if k
(5.8)
=
1,2, ... , E,
exp(-ikt)0 8(k-1)(x) ,
=
where 1 is the function in it equal to one everywhere, and 8(k)(X) is the k-th derivative of the Dirac distribution 8(x). We claim that
k
(5.9)
=
0,1, ....
Indeed, an integration by parts shows that
But X8(k)(X)
= -k8(k-I)(X),
Since dx + ix dt
=
hence
e- it dZ, this proves (5.9).
Example (5.1) shows why Definition (5.2) is not acceptable when the boundary B.Jt is noncharacteristic: Example (5.4). Take it as in Example (5.1), and equip it with the locally integrable structure T' = o. We would like all the constant functions to be solutions, and they are according to Definition (4.2). But observe that if 1(t) denotes the function identically equal to one in it, then, according to
798
F. Treves / Approximation and Representation of Solutions
(5.6), bl = (j(t) dt, which is not the zero distribution. This shows that there is only one c:gl function whose coboundary is a section of T ', and that is the function identically equal to zero. Let n be an open subset of .it and let g be a continuous function on the boundary part of n,n n a.J,f" Then g defines a current of degree one in n, gu, whose support is contained in n n a.J,f,. It is the one-current which to any form X E c:g;(n; A N-I) assigns the value
f
gx*,
ara«
where X * stands for the pull-back of X to the boundary. Lemma (5.5). LetfE c:gl(n). Then
bf =
(5.10)
df - Ulnna.Jl)v,
where df is the one-current defined by the differential of f (which is a continuous one-form).
Proof. Let us use a local chart (U, Sl, .•. , SN) in n, adapted to the boundary (d. Definition (4.1)) and centered at an arbitrary point of o n ««. Let X E c:g;(U; AN-I) be arbitrary. We may write (5.11)
X = a
II
ds N + f3 ds
I
where a is a smooth compactly supported (N - 2)-form which involves only the ds' for j
where d'o is the exterior derivative of ll' with respect to the variable s' = (s ', ... , SN-I) alone, and ds = ds ' II ds N. Thus,
f
bf
II
X= -
Jf dX f f d'o = -
II
ds
N
+ (_I)N
Jf (:~) ds.
F. Treves / Approximation and Representation of Solutions
799
Integration by parts gives us
Jbf
II
X ==
Jd'f
II
ex
II
ds
N
Jf3
-(-It
C;)
ds -
J
ff3 ds' .
utis«
We note that d'f II ex
II
ds N == df II (ex
II
ds N
),
f3
(:~) ds == (-It- 1 df II
f3 ds' ,
whence (5.13)
Jbf
II
x==
Jdfllx- J
Uluna.ll)X*,
ura«
which is exactly the meaning of (5.10). 0 If we apply (5.10) to the function identically equal to one in li, denoted by 1 as before, we see that
(5.14) Apply (5.13) to
bI ==
f
=
-P.
1:
(5.15) ura«
And (5.10) can be written as (5.16)
bf = df + Ulnna.ll)b 1 .
Lemma (5.5) makes no reference to the locally integrable structure, and in particular does not presuppose that the boundary is totally characteristic. We call now into play the latter hypothesis: Lemma (5.6). Suppose that the boundary is totally characteristic. Then the function 1 is a distribution solution in li.
800
F. Treves / Approximation and Representation of Solutions
Proof. We look at the integral in the right-hand side of (5.15). Note that (5.17)
J X*= J8(sN)ds N "X, uru«
In the right-hand side of (5.17) the integral now stands for the duality bracket between (N - Ij-test forms and one-currents in U. The hypothesis that the boundary is totally characteristic means, according to (1.8), that (5.18) where 'Y, P are smooth one-forms in U, and 'Y is a section of T'. But then, by combining (5.15), (5.17) and (5.18), (5.19) which proves what we wanted. 0 We can now prove, in the case of a totally characteristic boundary, a slightly weaker version of Proposition (4.3): Proposition (5.7). Assume that the boundary is totally characteristic. Let h E r:gl(il) be a solution in the interior n n At. Then h is a distribution solution in il. Proof. To say that h is a solution in the interior il n Al is the same as saying that dh is a section, perforce continuous, of T' over n n Al, and therefore over the whole of il. Lemma (5.6) entails that the same is true of (h!nnu)bl. Proposition (5.7) follows then from (5.16). 0
6. Local Representation and Approximation of Distribution Solutions Same concepts and notation as in Sections 4, 5. We are going to seek representations and approximations results analogous to those valid in a manifold without boundary [4, Ch. II, Sect. 3] but now in the neighbourhood of a point of the boundary.
801
F. Treves / Approximation and Representation of Solutions
We deal with an open neighbourhood U of the kind considered in Sections 2, 3: In U we are given basic functions z'. coordinates x', t\ all vanishing at the central point, which is referred to as 'the origin' (and which lies in a.Ji). We shall also make use of the vector fields M;, Lj defined in (2.4) (and given by (2.5), (2.6)). Here also it is convenient to take U in the product form (3.1). Test-forms of degree N in U will always be of the kind
g dZ /I dt
=
g dZ I /I ... /I d.Z" /I dt l /I ... /I dr"
(g E
ee;(1i)).
Definition (6.1). Let u be a distribution in U. For each i = 1, ... , m, Mfu shall denote the distribution defined by (6.1)
I (M~u)g
For each j the formula: (6.2)
I
=
dZ /I dt =
-I
uM;g dZ /I dt
(g E
ee;( U)) .
1, ... , n, L1u shall denote the distribution in U defined by
(L;u)g dZ /I dt =
-I
uLjg dZ /I dt
(g E
ee;(u)).
In (6.1) and (6.2) the integrals stand for duality brackets between currents (here of degree zero) and test-forms (of degree N). It is clear that u ~ u ~ L1u are continuous linear maps of gy'(U) (resp. <jg'( U)) into itself. As the vector fields M;, L j commute pairwise, it follows from Definition (6.1) that
u;«
(6.3)
[M:, Mf] = [Mf, Lf] = [L1, LtJ = 0 (h, i = 1, ... , m; j, k = 1, ... , n) .
Mr, L;
To get some feeling about what these operators really are we 1 shall let them act on a function f E ee ( U) regarded as a zero-current on U. For i = 1, ... , m; j = 1, ... ,n we introduce the following (N - 1)forms (6.4)
a j = (-ly-1 dZI/I'"
(6.5)
~j =
/12 /I ... /I d.Z"
/I dt,
(-1)M+'-I , dZ /I dt I /I ... /I--,.. dt' /I ... /I dt " ,
802
F. Treves / Approximation and Representation of Solutions
where the hatted factors must be deleted. Then, if we avail ourselves of (3.2), we obtain
(6.6)
M;(fg) dZ
f\
dt = d(fga;),
Lj(fg) dZ f\ dt = d(fg{3) .
If we integrate over U and apply the Stokes theorem we get
(6.7)
-JfM;g dZ
f\
dt =
JgM;f dZ
dt -
f\
J
fgai'
uris«
(6.8)
-JfLjg dZ /\ dt JgL/ dZ =
f\
dt -
J
fg{3j'
ura«
Combined with (6.1), (6.2) Formulas (6.7) and (6.8) show that, at least when acting on zero-currents f E cgl( U), (6.9) where R i , S, are linear operators attached to the vector fields M;, L j , respectively, and to the boundary. Up to sign RJ is the zero-current whose evaluation on a smooth and compactly supported N-form X is obtained as follows: Contract X by M;, an operation which yields a smooth and compactly supported (N - l j-forrn, multiply the latter by f and integrate over un a.Jl. A similar description applies to Sj' If we use the property that every distribution is the limit of a sequence of smooth functions [3, p. 325], we derive at once from (6.9): Proposition (6.2). In the interior
un .Jl of U:
M~ = M;, L1
= Lj •
We also note that any smooth (N - Ij-form 'Y in U can be represented in the following fashion: (6.10)
'Y =
m
n
i=1
j=l
L Aiai + L B j{3j,
where the Ai and Bj are smooth functions in U. By virtue of (3.2) we have
F. Treves / Approximation and Representation of Solutions
(6.11)
dy =
{± MjA + ±LiB i} dZ i
;=1
II
803
dt.
i~l
Assume that y has compact support and evaluate a zero-current u in V at both sides of (6.11). In the notation of (6.1) we get JUdY = - J
{± (M~u)Ai + ±(L7 u)Bi} dZ 1=1
II
dt.
f~1
But vBi dZ. II dt == (v.f dt i ) II Bi{3.f ' f
uj , vi E @'(V). If we recall the definition of the coboundary operator u ~ bu in Section 5 we get the following formula, valid in our local chart, (6.12)
bu
=
m
n
j=1
i~1
2: M~u dZ; + 2: L1u dr ' .
This can be regarded as the formula dual to (3.2) (in a manifold with boundary). Among other things it shows once again that, in the interior V nAt, bu = duo Let us identify V to a subset of IR m+n by means of the coordinates Xi, t': We recall that V is then equal to the product V x W of a ball with a half ball, both centered at the origin. When a.Jl is noncharacteristic,
is a half ball, and W is an open ball. When a.Jl is totally characteristic it is the opposite: V is an open ball, and
is a half ball. In both cases we denote by V' (resp. W O ) the smallest open ball containing V (resp. W) and we write U" = va x Woo For a while it will be convenient to view all functions, distributions, forms, currents and the vector fields M;, Li as defined in subsets of U" C IR m +n. In any case, the restriction mapping
804
F. Treves / Approximation and Representation of Solutions
(6.13)
is surjective, and its transpose is an injection, (6.14) which allows us to identify !]'( U; A P) to the subspace of !]'( UO; A P) consisting of the currents whose support is contained in U. Here we shall comply with this identification. Our next step is to extend each zj as a cgoo function in the whole of UO. Notice that, thus extended, dZI, ... ,dZ m are (::-linearly independent in a neighbourhood of the origin. Possibly after contracting UO about 0 we suppose that the linear independence of those differentials holds in the whole of UO. They span a vector subbundle of C:?1* UO, which we denote by (To)' and call an extension of T' to UO. The orthogonal of (To)' will be denoted by "11'0. There is an automatic extension of the vector fields M;, L j : simply, we require that the orthonormality relations (2.4) continue to hold in UO. Of course then, L I , •• • , L; span "11'0 over UO. When acting on a distribution u with support in UO, we have: (6.15) for all i = 1, ... , m; j = 1, ... ,n. This follows from Proposition (6.2) applied to UO instead of U (applied when the boundary is empty). In passing note that (6.15) cannot really lead to confusion, since the action of the vector fields AI; and Lj on distributions in U is only defined in the interior of U, and there (6.15) is valid. The boundary un a.Jl is a hypersurface in U, defined by r = 0; it is either noncharacteristic or totally characteristic [4, Ch. II, DeL (3.3) and (3.4)]. Let u be a distribution in UO with support contained in U. When the boundary a.Jl is noncharacteristic, the property that u is a distribution solution in U is equivalent to the property that (6.16)
in U, u is a cgoo function of t" valued in the space of distributions of (r', ... , r". x) and u is a distribution solution in the interior of U (for T' or (TO)').
This is merely a repetition of Definition (4.2). But in the case of a totally characteristic boundary we have a more elegant interpretation:
F. Treves / Approximation and Representation of Solutions
805
Proposition (6.3). Assume that the boundary aAf is totally characteristic. Let u be a distribution in VO, with supp u c U. Then the following two
properties are equivalent: (6.17)
u is a distribution solution in V, for the structure T' .
(6.18)
u is a distribution solution in U", for the structure (To)' .
Proof. It suffices to note that, in U", which has no boundary, we have = du, and Definition (5.2) states therefore that, if u is a distribution solution for T'in U, then du is a distribution section of T', which is the same as (6.18). 0
bu
We can now generalize to structures with boundary the representation theorem 3.1 in [4, Ch. II], valid in manifolds without boundary: Consider the quadratic form Q'(() = (Z~(O, O)~i ,
where Z~ is the transpose of the Jacobian matrix Zx' We recall that (Z)2 = z . z. We call Q'(M#) the second-order operator obtained by substituting Mf for (j' i = 1, ... , m, in Q'(n Theorem (6.4). There is an open neighbourhood U o C U of the origin such that, given any distribution solution h in Vand any integer /.L ;;:: 1, there is a C(51" solution h , in Uo and an integer /l;;:: 0 such that, in U o,
(6.19) Proof. (1). The boundary is noncharacteristic. The proof is an adaptation of that of [4, Ch. II, Th. 3.1]. We recall that, here, the boundary is defined by t" = 0, that M I , . . . , M m , L I , ••• , L n _ 1 are tangent to the boundary (and annihilate r"), and that L; has the expression (2.6) (cf. also (2.7». It follows at once from this that, in the present situation,
(6.20)
(i
=
1, ... , m; j = 1, ... , n - 1) .
Write then
Q'(L', M) = Li + ... + L~_l + Q'(M) .
806
F. Treves / Approximation and Representation of Solutions
If U is small enough QI(L', M) is elliptic. Since h is a f(;}'" function of t" valued in the space of distributions of (t', x) (t' = (r', ... ,tn-I», after some further contracting of U we can find an integer v and a function v E f(;}1"(U) such that
(6.21)
h
=
Q'(L', M)"v
in U. By applying L, to both sides in (6.21) we get (6.22) and [4, Ch. II, Lemma 3.1] implies that, after further contraction of U, there are constants C 1 , C, > 0 such that, for each j = 1, ... , n, and every a E17, (6.23) After this observation the proof duplicates that of [4, Ch. II, Th. 3.1). The only difference is that the variation of t" is restricted to non-negative values, and this has no effect whatsoever on the argument. In this manner we reach the conclusion analogous to that of Th. 3.1, loc. cit.: namely, that (6.19) is valid with h, E f(;}1"(U) satisfying Ljh = 0 for every j = 1, ... , n. But then Proposition (4.3) implies that h is a solution. (2). The boundary is totally characteristic. We recall that, in this case, the subset U of UO is defined by the condition x" ~ O. Here, thanks to Proposition (6.2) we may apply directly [4, Ch. II, Th. 3.1), in the extended structure (TO)' over UO. This yields the representation (6.19) in U. But since supp h C U we have (6.24)
We derive [4, Ch. II, Lemma 3.1) that, in UO\U, (6.25) where get, z) is a f(;}l function of (t, z) holomorphic with respect to z in some polydisk J centered at the origin. But Ljh 1 = 0 for all j, which is equivalent to
F. Treves / Approximation and Representation of Solutions
(6.26)
d,go(t, z )/z:Z(',X) =
807
°.
Note however that the image of VO\ V under the map (t, x)~ Z(t, x) contains a totally real submanifold of ,1 of dimension m, and therefore (6.26) implies that go is independent of t. Thus, in VO\V, h 1(t, x) = go(Z(t, x». But note that go(Z(t, x) is a (smooth) solution in the whole of VO. And by virtue of (6.24) and (6.25), giz) satisfies the equation (6.27) on the image of VO\ V under the map Z, therefore in the whole of ,1. But then we have, in VO. (6.28) Since the support of h,> gooZ is contained in V, this implies (6.19), after a redefinition of h.; 0 Remark (6.5). Actually (6.28) is a substantially stronger statement than (6.19). For h t - gooZ is a eel" function in VO, not just in V, and since it vanishes identically in VO\ V this property demands that the traces on the boundary x" = 0, of all the derivatives (aj axm)i (h 1 - go° Z) of order j ~ fJvanish identically.
Theorem (6.4) enables us to extend the approximation Theorem (3.1) to all distribution solutions. Observe that the trace of a distribution solution on the submanifold t = of V is well defined. This is evident in the noncharacteristic case. In the totally characteristic case it follows from Proposition (6.2) and from the fact that the property is true in manifolds without a boundary [4, Ch. I, Lemma 3.2].
°
Theorem (6.6). There is an open neighbourhood V o = V o x W o C V of the origin and a function g E eeOO(W), identically equal to one in Wo, such that an arbitrary distribution solution h in V is given by the following distribution limit in V o:
(6.29)
h(t, x) =
!~~oof BT(t, x; 0, y)g(y)h(O, y)dZ(O, y).
F Treves / Approximation and Representation of Solutions
808
In (6.29) the integral represents a duality bracket between distributions on the manifold lv, such as h (0, y), and test m -Iorms on the same manifold, such as g(y )E (t, x; 0, y) dZ(O, y) (here (I, x) plays the role of a parameter). T
Proof. Let h, be the eel solution in (6.19). By (6.1) we have
JET(t, x; 0, y)g(y)[Q'(M#)"hl](O, y)dZ(O, y) =
J{Q'(Ms,yy[g(y )ET(t, x; s, y)]}ls~oh!(O, y) dZ(O, y),
where the subscripts in Ms,y indicate that the vector fields M; act in (s, y)-space (actually they involve only differentiation with respect to y). Therefore, by Leibniz formula,
JEAt, x; 0, y)g(y)[Q'(M#)"hl](O, y) dZ(O, y) J{Q'(Ms,y)ET(t, s, y nls=og(y)hl(O, y) dZ(O, y) + 2: J{~ET(t, 0, y)}gj(y)hl(O, y) dZ(O, y), j=! x;
=
k
x;
where the ~ are differential operators with smooth coefficients in yspace, and the gj are test-functions whose supports are contained in supp dg. But then, if the variation of (t, x) is restricted to a sufficiently small neighbourhood of the origin, the brackets
J{~ET(t,
x; 0, y)}gj(y)h!(O, y)dZ(O, y)
converge to zero, actually in the ee'" sense, as the reader will readily ascertain. On the other hand,
so that we obtain the following equalities of distribution limits:
F. Treves / Approximation and Representation of Solutions
809
h(t, x) = Q'(M) !~~ooJ E.(t, x; 0, y)g(y)h 1(0, y) dZ(O, y) =
~~ JQ'(~.x)E.(t, x; 0, y)g(y)h 1(0, y) dZ(O, y)
= !~~ooJ E.(t,x;O,y)g(y){Q'(M#)h(}(O,y)dZ(O,y), which is precisely (6.29). 0 Corollaries (3.2), (3.4) extend now to distribution solutions:
Corollary (6.7). Every distribution solution h in U is the distribution limit in Uo of a sequence of polynomials with complex coefficients in Z\ ... , Z", Corollary (6.8). If the trace on {a} x Wof the distribution solution h in U vanishes identically then h = 0 in Uo'
7. Example: Domains in Complex Space We shall apply the results of the preceding sections to an open subset em whose boundary aAi is a ~oo hypersurface. We assume that Ai lies on one side of aAi. The closure of Ai will be our manifold with boundary, .it. Thus dim.it = 2m. The locally integrable structure on .it will be that induced by the complex structure of em. By z ', ... , z " we shall always denote the natural coordinates in em; their differentials at the points of .it span the structure bundle T' c C [!f*.it. The boundary of .it is noncharacteristic. As usual we reason in an open neighbourhood U of a point in .it which we take to be the origin of em. In fact we consider an open neighbourhood UO of 0 in em and take U = U" n.it. We assume that the tangent hyperplane to aAi at 0 is the hyperplane ym = O. Therefore, U will be defined in UO by an inequality
Ai of
(7.1)
r = ym - a(x, y');;;. 0,
where a is a real-valued ~oo function in an open neighbourhood of the origin in IR. m x R. m-(, such that
810
F. Treves I Approximation and Representation 01 Solutions
(7.2)
da(O, 0) = O.
a(O, 0) = 0,
We have used the notation Xi
= Re z i, y i = 1m z i
(j = 1, ... , m) , y ,_ - ( y,1 ... , y m-l) .
Thus, = 0 is a defining equation of the boundary in U. It is convenient to make use of the coordinates (7.3) in U, and to take U in the product form
U = V X W' x [0, '0) ,
(7.4)
where V (resp., W') is an open ball centered at the origin in x-space R m (resp., in y'-space R m-l) and is a number> O. We shall write
'0
(j = 1, ... , m - 1),
(7.5)
Z'" = x m + i[a(x, y') +,] (= z"). Here and below i = by the vector fields
\/=1.
Over U the vector bundle 'Y = T.l is spanned
(7.6)
(k
Lm where
(7.7) (7.8)
a.
=--lll(X
B,
r-,
= 1, ... , m - 1), a
y) Bxm '
F. Treves / Approximation and Representation of Solutions
811
On the other hand,
a
(7.9)
a
(j
M.=-.+fJ.'ax!
J
Bx"
J
M
-
II
r:
m -
1, ... , m - 1) ,
=
a ax m
'
where
. aa
(7.10)
(j=I, ... ,m-1).
fJ.·=-IfJ.-.
ax!
!
We have: I.
(2 1)L k =
a . aa
ai k
(7.11)
- I
Bi k M m
(k = 1, ... , m - 1),
B L m =--iMm ,
a,
a sx'
Ba
M=--i-M i j
ax
(j
m
=
1, . _. , m - 1) .
As expected, M I , . . . , M m , L I , ••• , L m _ 1 are tangential to the boundary, whereas L m is transversal to it. We are now going to define an operator of integration acting on an arbitrary holomorphic function h (z) in the interior un.;(,{. For this we assume that the open balls V and W' in (7.4) are small enough that sup a(x', y) < h.
(7.12)
VXW'
Let then to be a real number such that ~'o < to<
'0- We define
ito
(7.13)
where z'
Jh(z)=-
=
J h(z',()d(,
(z', ... , zm-I), and where the integration is carried out over
812
F. Treves / Approximation and Representation of Solutions
the following path: starting at the purely imaginary point ito we follow the horizontal line to x m + ito, and after we have reached this point, we follow the vertical line to z" = x" + iy". In this we always assume that (x, y') E V x W' and that
a(x', x m , y') < v" < r«.
(7.14)
Obviously Ih is holomorphic in Un.At, and we have the identities:
(7.15) (7.16) Definition (7.1). Let v ~ 0 be an integer and let E(V) = 1 if v = 0, E(v) = 0 if v> O. We shall denote by ;J{.(U) the space of holomorphic functions in Un.At endowed with the following property:
(7.17).
There is a constant C> 0 such that, in Un.At,.
It is clear that the infimum of the constants C> 0 which can be used in (7.17). may be regarded as the natural norm of h in ;J{.(U). This norm turns ;J{.(U) into a Banach space.
Lemma (7.2). Whatever the integer v ~ 0, h ~ Ih is a bounded linear operator ;J{.(U)~ ;J{._l(U), The operator I is also a bounded linear operator from ;J{o(U) into the Banach space of bounded and continuous functions in U which are holomorphic in the interior U n .At. Proof. By the definition of I we have 10
Ih(z)=
Jh(z',~+ito)d~-i Jh(z',x o
m+i1])d1].
ym
We look at the first term in the right-hand side. Observe that, given any (x, y ') E V X W', we cannot possibly have (z ', x" + ito) E U n a.At, for it
F. Treves / Approximation and Representation of Solutions
813
would mean that to = a (x, y'), which is excluded by (7.12) and our choice of to' It follows that the first term, in the above right-hand side, is bounded in V x W' (it does not depend on r). Let us look, therefore, at the second term. Call c1J(r) the right-hand side in the inequality in (7.17)p. For any r" < ro we have to
to
J h(z', x
m
+ i7]) d7]
to-a(x o yO)
I ~ J c1J(7] -
a(x', y)) d7] =
J
c1J(s) ds.
We reach the conclusion by computing the integral at the far right. 0 Proposition (7.3). Suppose that hE :ltp(U) (v;;;. 0). There is a bounded and continuous function h, in U, holomorphic in the interior Un.J,l, such that,
in
u rv,«,
(7.18) Proof. It suffices to take hI = r+lh and combine (7.15) with Lemma (7.2).
o
Corollary (7.4). Let h be as in Proposition (7.3). Then h defines a dis-
tribution solution in U. Proof. Formula (7.18) shows that h does indeed define a distribution in U (regarded as a manifold with boundary). By Proposition (4.3) hI defines a distribution solution in U, and therefore so does h since [Lj , M m ] = 0 for all j = 1, ... , n. 0 Definition (7.5). We shall say that a holomorphic function h in .J,l grows slowly at the boundary if, given any compact subset K of .fl, there is an integer v;;;' 0 and a constant C> 0 such that
(7.19)
Ih(z)! ~ Cd(z, J.J,lr
p,
for all z E K
n .J,l .
The space of holomorphic functions in .J,l which grow slowly at the boundary will be denoted by :lti.J,l).
814
F. Treves I Approximation and Representation of Solutions
The property that a holomorphic function u in .At grows slowly at the boundary is local: it suffices to check it in arbitrarily small open neighbourhoods of points of the boundary. Theorem (7.6). The restriction mapping h ~ hl.lt is a linear bijection of the space of distribution solutions in .it onto the space 'Jesl(.At) of holomorphic functions in the interior.At which grow slowly at the boundary.
Proof. Let h be a distribution solution in .it; its restriction to Af is evidently a holomorphic function in that open subset of em; we denote it also by h. Let U be an open neighbourhood of an arbitrary point of a.At, in fact the one we have been dealing with so far in the present section (and so the central point will be taken as the origin of em). We apply Theorem (6.4): Possibly after contracting U about 0 we may assume that (6.19) holds in U, with hi a ~I solution in U. But in the interior un Af, (6.19) reads (7.20) Indeed, in un.At, (7.21)
M:
= ~ by Proposition (6.2) and
ah
M.(h oZ) = -.oZ
sz'
I
(j=I, ... ,m).
Let then l5(z) denote the distance from z E Un.At to a.At. Let K be a compact neighbourhood of the origin contained in U such that (7.22)
l5(z)
= d(z, aU)
for all z E K.
By the Cauchy inequalities we have, for all z E K
n .At,
(7.23) But the set U'E U/3z E K:
1(- z] :S;;~I5(z)}
F. Tretes / Approximation and Representation of Solutions
815
is a compact subset of V, and thus (7.23) implies, after some increasing of the constant C> 0, the validity of (7.19) and therefore, since slow growth at the boundary is a local property, that hE f1ei.it). Corollary (7.4) shows that every h E f1esl(.it) defines a distribution solution in the neighbourhood of every point of the boundary, hence everywhere in .ii. This completes the proof of the fact that h -+ h 1.« is a linear surjection from the space of distribution solutions in .ii onto f1esl(.it). That the map is injective is evident for h is smooth transversally to the boundary, and this would not be possible if h were to vanish identically in the interior .it, unless of course h = 0 as a distribution in .ii. D Corollary (7.7). Every function h E f1eiAt) has a well-defined distribution trace on the boundary a.it.
Indeed, h can be regarded as a distribution solution. Such a distribution on .ii is smooth transversally to the boundary and has therefore a trace on aAt. Theorem (6.6) applies without modification to the present situation and we shall not restate it here. Note that the space of distribution solutions carry a natural topology, that of the space of distributions in .ii which are smooth transversally to the boundary. On the other hand, the space f1esl(At) of holomorphic functions in At which grow slowly at the boundary carries also a natural topology, defined by means of inequalities (7.19). We leave the description of these two topologies to the reader, as well as the proof of the fact that the map h -+ hl.« is a homeomorphism. It is in the sense of these topologies (with .ii replaced by one of its relatively open subsets, V o) that the word 'limit' is used in the following statement: Theorem (7.8). Every point Zo of .ii has an open neighbourhood V o such that every holomorphic function h in At, slowly growing at the boundary, is the limit, in f1ei Vo), of a sequence of holomorphic polynomials (i.e. of polynomials with respect to z ', ... ,zm, with complex coefficients).
This follows at once from Theorem (7.6) and Corollary (6.7). Theorem (7.9). Suppose At connected and let X denote a totally real (embedded) submanifold of em, of real dimension m, contained in.ii.
816
F. Treves I Approximation and Representation of Solutions
If the trace of a distribution solution h on X vanishes identically then h = 0 in ii. Proof. If.;(,f n X,e 0 then, by [4, Ch. II, Th. 2.3], h = 0 in an open subset of .;(,f which is not empty, and therfore h = 0 in .;(,f and also in ii. Suppose then that XC a.;(,f, and let us reason at a point of X. We can find a coordinate chart (U, x ', ... ,x m , tI, ... , r") in ii, in general not related to the complex coordinates z! (thus now, possibly, xi ,e Re z ') of the kind used in Section 6, but such' that furthermore U n X is defined, in U, by the equations t = O. Then Corollary (6.8) implies that h = 0 in some (relatively) open neighbourhood of X n U in .;(,f, and therefore in a nonempty open subset of ii, whence again the sought conclusion. 0 All the results of the present section extend routinely to the case where is an open set, with smooth boundary, in an arbitrary complex manifold .;(,f0 (in such a case, z ', ... ,zm are local complex coordinates in an open subset of .;(,f0).
.;(,f
References [1] M.S. Baouendi and F. Treves, A property of the functions and distributions annihilated by a locally integrable system of complex vector fields, Ann. of Math. 113 (1981) 341-421. [2) L. Hormander, The Analysis of Linear Partial Differential Operators I, Grundlehren Math. Wiss. 256 (Springer, New York, 1983). [3] L. Schwartz, Theorie des distributions (Hermann, Paris, 1966). [4] F. Treves, Approximation and representation of functions and distributions annihilated by a system of complex vector fields, Centre Math. Ecole Polytechn. Palaiseau, France (1981).
l.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
817
SOME APPLICATIONS OF INFINITE-DIMENSIONAL HOLOMORPHY TO MATHEMATICAL PHYSICS
Harald UPMEIER Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104, U.S.A.
Dedicated to Leopoldo Nachbin on the occasion of his sixtieth birthday
o.
Introduction
The concepts and results of infinite-dimensional holomorphy (on Banach spaces or other types of topological vector spaces) are of importance to mathematical physics whenever the physical systems under consideration involve an infinite number of degrees of freedom. A typical example is quantum field theory where analyticity properties have long played an important role. The present article gives a survey of the use of holomorphic functions of infinitely many variables in this area, in particular for the so-called (second) quantization. After reviewing the standard particle (or Fock) quantization, we describe infinite-dimensional (anti-) holomorphic wave quantizations which are often preferable from a conceptual or mathematical point of view. The complex wave quantization of a (flat) Hilbert space (Section 2) has been studied in detail by 1. Segal [11]. An interesting generalization to 'vector-valued Planck constants' is closely related to harmonic analysis on nilpotent Banach Lie groups arising as distinguished boundaries of symmetric Siegel domains. For systems with interaction, the quantization of curved phase spaces of infinite dimension has also recently been considered, and Section 3 gives an introduction to this promising field, based on the theory of symmetric Hilbert manifolds.
1. The Particle Representation
The quantization of a free field of particles obeying Einstein-Bose or Fermi-Dirac statistics amounts to the construction of representations for the canonical commutation relations (CCR) or the canonical anti-commutation relations (CAR), respectively, by (not necessarily bounded)
818
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
operators on a complex Hilbert space H. These operators are labeled by the degrees of freedom of the physical system. For the free field with an infinite number of degrees of freedom, our underlying 'phase space' will be a separable complex Hilbert space V. Let (b, v) denote the scalar product in V, being conjugate-linear in b and complex-linear in v. Let h > 0 be a positive constant and let e = ±1. Algebraically, the canonical commutation (e = 1) and anti-commutation te = -1) relations can be formulated in terms of creation operators C v (complex linear in v E V) and their adjoint annihilation operators C~ satisfying the identities (1.1)
(1.2) for all b, v E V. Here [A, B], := AB - eBA denotes the e-commutator and the identities (1.1) and (1.2) are understood to hold on a common dense invariant domain of all the operators involved. A classical example for the quantization of the canonical s-cornmutation relations is the particle representation (or Fock representation) which is based on the tensor algebra (1.3)
.oJ(V) =
ffi (@n
n=O
V)
of V; endowed with the unique scalar product (')h such that (1.3) is an orthogonal direct sum and
..------...,
for all bi' Vi E v: Let H h : = .oJ( V)h denote the Hilbert space completion of .oJ(V) and consider, for e = ±1, the self-adjoint projection 1T, on H h determined by
<1' for all VI"'" vn E v: Here a runs over all permutations of the set {l, ... , n} and 1<1':= 1 whereas (-It is the signum of a. The image
H. Upmeier I Applications of Infinite-Dimensional Holomorphy
819
Hilbert space H::= 1TE (H h ) is called the e-symmetric Fock space. By (1.4) and (1.5), we have n!(1T I0"'0bn), 1T I0"'0vn»h= h- n LueU(b l , vu(I»"'(bn, vu(n) and, in particular, n!(1T_(b I0"'0bn), 1T)v I0" . 0 Vn»h = h:" det«b;, vi»' For any pair of vectors b, v E V, the operators E(b
E(v
and n
CH1TE ( V I0'" 0 vn» = h- 1n - 1/2 L ei-\b, v) i=1
C = omission symbol) are well-defined on the common dense invariant domain 1T (.c1(V» C H: and satisfy (1.1) and (1.2). In this model, n is the number of particles and, accordingly, the creation operators Cv map 1T (0 n V) into 1T (0 n+ 1 V), whereas the adjoint annihilation operators n 1 C~ map 1TE ( 0 V) into 1TE ( 0 n- V). The l-dirnensional subspace C = 0° V of H: describes the vacuum state. In case e = -1, the operators Cv and C; are actually bounded. In general, the quantization of the canonical commutation relations (s = 1) involves necessarily unbounded operators. It is mathematically more convenient to reformulate (1.1) and (1.2) in terms of bounded operators using J-parameter groups of unitary transformations. Consider the operators E
E
E
T C+·= v .
1/2(C* v
+ Cv) '
1/2(C* - ·C+ C -v·· - 2v - C) v -1 iv'
which are self-adjoint (resp. skew-adjoint) on a common dense invariant domain and satisfy the identities (1.6)
1
[Cb, C~] = h 1m (v, b) . Id , C*v = 2- 1/2(C+v + C-) »r
»
CV = 2- 1I2(C+v - C-) »r :
820
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
Now suppose that the operators C: generate strongly continuous 1parameter groups of unitary operators W~ = exp(tC:) = exp(itC:)
on the representation Hilbert space H and that the mapping v ~ W~ is strongly continuous on V. Formally, (1.6) implies
wZ W~ W~b W~v = exp
(*
1m (v,
b»),
showing that t ~ W7b W~ exp(it2 1m (b, v )/2h) is a strongly continuous J-parameter group with infinitesimal generator C/, + Hence
C:.
(1.7)
W bh+V
=
(i
)
W bh W hv exp 2h 1m (b, v) ,
for all b, v E V. Formula (1.7) is called the 'integrated' Weyl form of the CCR and can also be interpreted as giving a unitary projective representation of the additive group V on H.
2. The Flat Complex Wave Representation
As pointed out in [11, p. 323], the particle representation is not very convenient for realizing the 'canonical transformations' or for studying 'wave-like' properties of the states of the physical system. The well-known Schrodinger representation realizes the canonical transformations on a Hilbert space of square-integrable functions on a real phase space and is therefore also called the real wave representation. In this model, the operators C; are the position operators (divided by h) whereas the operators -iC: are the momentum operators. From a relativistic point of view more satisfying is the complex wave representation which makes decisive use of (anti-) holomorphic functions on a complex phase space of infinite dimension. This type of representation has first been studied in [1] (cf. also [2, p. 1138]) and the rigorous formulation in the infinite-dimensional case is due to I. Segal [11]. Suppose V is a (separable) complex Hilbert space with scalar product (b, v), complex linear in v. If E C V is a subspace of (complex) dimension n, let
wZ
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
821
denote a Haar measure on E. Here vi = Xi + iYi are coordinates with respect to an orthonormal basis of E. For any positive constant h, let
v, V») dYE(V):= h- n exp ( - -h- d,uE(V) be the Gaussian probability measure on E with variance ~h. A pintegrable tame (or cylindrical) function I on V has the form 1= IE op~, where IE : E ~ C is p-integrable and P ~ is the orthogonal projection onto a finite-dimensional subspace E C V. If IE is integrable, define the expectation value Ih(l):=
JIE dYE' E
and note that I h (I) is independent of the particular representation of I since (P~h dYF = dYE whenever E C F are finite-dimensional subspaces of V. The vector space Fh of all square-integrable tame functions on V is a pre-Hilbert space under the scalar product
and the Hilbert space completion of F;, is denoted by L~(V, C). The algebra of all tame polynomials (i.e. IE is a real-analytic polynomial) is dense in L~(V, C). Unlike the finite-dimensional case, the elements of this abstract Hilbert space can in general not be identified with functions on V modulo null-functions [11]. Now fix v E V and define an isometric transformation W~ on F;, by putting (2.1)
(W~f)(b):= I(b -
2- 1/2V ) exp( Tl/2 (b~ v) -
Then
(i
)
W bh W hv exp 2h 1m (b, v) = W hb+v
(~:») .
822
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
W:
showing that v ~ defines a projective representation of the additive group V by unitary operators on L~(V; C) satisfying the Weyl relations. This so-called 'regular representation' of the CCR realizes the canonical transformations in a very natural way but has two disadvantages: the elements of the representation space L~(V; C) are in general not functions on V making a 'spatial' interpretation as 'wave functions' difficult, and the representation is not irreducible, i.e. it admits proper invariant subspaces. Both disadvantages disappear by using (anti-) holomorphic functions. Let H~ (V, C) denote the closed subspace of L~ (V; C) generated by all holomorphic tame polynomials of the form f = fE p~, where fE : E ~ C is a holomorphic polynomial. By [11, Th. 3 and Cor. 3.1], the elements of H~ (V, C) have a unique representation
W:
0
(2.2)
f(v) =
L lj(v)
j:O
as an everywhere convergent series of j-homogeneous continuous polynomials lj : V ~ C such that
Ifl~=s~p Jlf(v)fdYE(v)<+oo. E
It follows that the functions fE H~(V; C) are holomorphic on every finite-dimensional subspace of V but not necessarily (strongly) holomorphic on V in the sense of [9] since the convergence of (2.2) is not always locally uniform. In general, the functions in H~(V; C) are not even continuous (d. [6]). Since the creation operators Cv associated with (2.1) should depend complex linearly on v E V (in accordance with Section 1) it is convenient to consider anti-holomorphic instead of holomorphic functions. -2 2 Let Hh(V;C)={flfEHh(V;C)} denote the closed complex subspace of L~ (V; C) generated by all anti-holomorphic tame polynomials. Then H, := H~(V, C) (but not H~(V; C)) is invariant under the canonical transformations W: since (b, v) is conjugate-linear in b. The representation obtained by restriction to H; admits a cyclic vector (the 'vacuum state' given by the constant function 1) and is called the fiat complex wave representation since V can be regarded as a flat complex infinite-dim en-
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
823
sional phase space. By [11, Th. 3], the Hilbert space H, contains 'reproducing kernel functions', (2.3)
b,
v))
f~(b):= exp ( -h- = exp tb,
V)l/h ,
in the sense that
f(b)
=
for all b E V and f E Hi. Since the mapping t ~ W tv is a strongly continuous I-parameter group of unitary transformations on H h , Stone's theorem asserts that there exists an infinitesimal generator d hv := -a
at
I t=O
W tvh
,
as a densely defined skew-adjoint operator in H; satisfying W~ = exp(t d~) for all t E IR. Differentiating (201), we get
where
a~f(b) = (b~V) feb) denotes the creation operator (complex linear in v) and a~f(b) = f'(b)v
is the annihilation operator associated with v. These operators have a common dense invariant domain (the space of all tame anti-holomorphic polynomials), are adjoint to each other, and satisfy
(2.4)
824
H. Upmeier / Applications of Infinite-Dimensional Holomorphv
(2.5) Note that (2.4) and (2.5) describe the complex linear part and the complex anti-linear part, respectively, of the real-linear mapping v ~ d~.
3. Curved Complex Wave Representations
The complex Hilbert space V used in Section 2 to describe the (anti-) holomorphic wave representation of the CCR has a flat geometry reflecting the fact that the quantized boson field is free (no interaction). The quantization of systems with interaction requires more general, possibly curved, phase spaces. The problem of investigating the structure and possible quantization of infinite-dimensional curved phase spaces can be approached either algebraically (e.g. by studying (Lie algebras of) unbounded derivations on C*-algebras) or geometrically. The geometric approach studies geometric and analytic properties of infinite-dimensional manifolds, e.g. Banach manifolds or Hilbert manifolds. In addition, these manifolds should admit interesting groups of automorphisms reflecting symmetry and invariance properties of the physical system. The class of symmetric Banach manifolds fulfills this requirement in an optimal way. By definition, a (connected) symmetric Banach manifold M carries a compatible tangent norm (in general not a Hilbert norm) such that every point m EM admits a symmetry Sm' i.e, a biholomorphic isometric involution of M having m as isolated fixed point. It can be shown (cf. [14]) that every such manifold M is homogeneous under the Banach Lie group Aut(M) of all biholomorphic isometries of M. Symmetric Banach manifolds can be characterized algebraically in terms of Jordan C*algebras and Jordan triple systems [14]. Of particular importance are the symmetric Banach manifolds M_ which can be realized as a domain in a complex Banach space Z and admit sufficiently many bounded holomorphic functions. These manifolds generalize the hermitian symmetric spaces of negative curvature and include all bounded symmetric domains. As an example, the open unit ball of a (Jordan) C*-algebra Z is homogeneous under biholomorphic transformations and is therefore a bounded symmetric domain. As in the finite-dimensional situation, the manifolds M_ admit a dual symmetric Banach manifold M+ generalizing the hermitian symmetric spaces of positive curvature. In this scheme, the flat symmetric Banach manifold Z can be written as Mo.
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
825
As the example of C*-algebras indicates, symmetric Banach manifolds are far too numerous to admit a complete classification. There is, however, a class of symmetric manifolds which have recently been classified [5] and which seem to provide natural models for infinitedimensional curved phase spaces: the symmetric Hilbert manifolds which are locally modelled over Hilbert spaces. In order to describe the classification of symmetric Hilbert manifolds it suffices to consider manifolds of type M_ which will be called symmetric Hilbert domains. Up to the two exceptional bounded symmetric domains in e 16 and e 27, respectively, the irreducible symmetric Hilbert domains can be classified into four types generalizing the classical domains in en. For complex Hilbert spaces E and F, let 2(E, F) be the Banach space of all bounded linear operators from E into F, endowed with the operator norm '·100' The norm ideal 2 2(E, F) of all Hilbert-Schmidt operators is a Hilbert space under the scalar product
(t, z) : = Tr(C z ) . Here ?* E 2(F, E) denotes the adjoint operator of t E 2(E, F). Note that ?* z E 2 1(E) is a trace-class operator on E. The symmetric Hilbert domains of type I have the form
i.e. M_ is the intersection of Z:= 2iE, F) with the open unit ball of 2(E, F). Note that M_ is bounded in the Hilbert-Schmidt norm if and only if r:= min{dim E, dim F} < +00. More generally, a symmetric Hilbert domain is bounded if and only if its rank r is finite. Now fix a conjugation ¢ ~ ¢ on E and put Z:= {z E 2iE, E)/ z' = sz}.
Here e = ±1 and Zl(¢) = z*(;j;). Then Z is a closed subspace of 2 2(E, E) and the domains
are called symmetric Hilbert domains of types II (s = -1) and III (s = 1), respectively. The symmetric Hilbert domain of type IV is the so-called Lie ball (of rank 2) given by
826
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
where 2 is a complex Hilbert space with conjugation and z' E 2(2,
B
E
( { ,
z)v = v + 2e{vCz} + {z{?v*{}*z}
for all ?, z, v E Z. Let (?, z) denote the invariant scalar product on 2 such that (c, c) = 1 for every minimal tripotent. Then there exists a Kahler-type hermitian metric h on ME such that (3.1) for all m E ME n Z and (, z E Z. The largest connected group G of biholomorphic isometries of ME acts transitively on ME' For 'sesquiholomorphic' functions >({, z), l/J({, z) (holomorphic in z), the Poisson bracket associated with the imaginary part of (3.1) is defined as E
a> al/J al/J ~ [>, l/J](?, z):= -BE(z, z)-- - BE(z, z)-. az a{ az a? Note that al/JI az is an anti-linear functional on Z, i.e. al/JI a? E Z whereas a>I az is a linear functional on Z. For example, for triple systems Z C 2iE, F) of type I, II or III, we have ({, z) = Tr(Cz) ,
and
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
827
[c/J, ljJ]«(, z) = Tr ( -oc/J (Id F + szz ") -oljJ (IdE + ez" z)) a(
OZ
- Tr ( -oljJ (Id F oz
oc/J (IdE + ez*z) ) , + ezZ*)o(
where oc/J/oz E !£(Z, C) c !£2(F, E) and oljJ/o( E Z. Hence the (irreducible, separable) symmetric Hilbert manifolds M = M. (e = ±1) can be regarded as infinite-dimensional curved complex phase spaces. In the special case dim(Z) < +00, the Kahler metric (3.1) admits a sesqui-holornorphic potential function of the form L. «(, z) = log K. «(, z) ,
where (3.2) z):= N«(, - sz) is related to is called the scalar Bergmann kernel, N the generic norm N : Z x Z ~ C (an irreducible 'sesqui-polynomial' mapping of bidegree (r, r» and g = 2 + (r - 1)s + t denotes the genus of Z, defined in terms of characteristic numerical invariants (r, s; t) (ct. [8]). The potential functions L. «(, z) are analogous to the 'potential' L o«(, z) : = «(, z) used in the construction of the flat complex wave representation. Accordingly, we are now interested in Hilbert spaces of anti-holornorphic functions f: M. n Z ~ C which are endowed with a scalar product of the form E«(,
(3.3)
(f, g)h = ch
J f(z) g(z) exp(i. L (z, z)) dJL. (z ), E
M,nz
where h is a positive constant, c, is a normalization factor and
is a G.-invariant measure on ME (JL can be formulated as
=
Haar measure). Using (3.2), (3.3)
828
(3.4)
H. Upmeier I Applications of Infinite-Dimensional Holomorphy
(f, g)h = Ch
J j(z)g(z)KE«(, zylh dj-tE(z) M,nz
=
Ch
J j(z) g(z )NE«(, z rE/~ dj-tE (Z) , M,nz
where 1/ := hlg. By (2.3), the representation space H~(V, C) of the flat complex wave representation contains 'reproducing kernel functions' of the form exp«(, z)/h) for z E V. Similarly, define (3.5) whenever h (or 1/) is an 'admissible' positive constant and (, z E ME Then the functions (3.5) have the property
n z.
(3.6)
n
if ch is suitably chosen. Let F~ denote the linear span of the functions for z E ME n z. The admissible values of 'Planck's constant' h (or 1/) are determined as follows: for E = 1, 1/ is called admissible if 1/1/ EN. In this case, the functions are anti-holomorphic polynomials on Z for every z E Z, and (3.6) defines a positive definite scalar product on F~. For E = -1, the functions are well defined and anti-holomorphic on M_ for every z E M_ since N£ «(, z) ~ 0 and M_ is simply connected. The value h (or 1/) is called admissible if (3.6) defines a positive definite scalar product on F~. These values have been determined in the finite-dimensional case by F.A. Berezin [3] and S.G. Gindikin [4]. In order to generalize these concepts to the infinite-dimensional case we first define N£ z) case-by-case. Suppose first that Z C .5£2(E, F) is of type I, II or III. Then Ie zl,,:;;; 1(1"lzl" < 1 for all (, z E M_ and hence
n
n
u.
is a trace-class operator. For type I or III, put
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
N E (t, z) : = det(Id
829
E+ ee z) = exp(Tr 10g(Id + Ee z)) , E
whereas for type II (where N is given by the Pfaffian in the finitedimensional case) put N. (t, z) : = det(IdE +
Ee z
)1/2 =
exp(~Tr 10g(Id E
+
ee z)) .
For Z of type IV, define
NE(t, z):= 1 + E(t, z)+ ~(t, [)(i, z). Now define the functions t; (t) for t, z E M. n Z and admissible constants 71 (which are more appropriate in the infinite-dimensional setting) by (3.5). The notion of admissible value of 71 is defined as above: for E = 1 we require 1/71 EN, whereas for E = -1 the condition is that (3.6) induces a positive definite scalar product on the linear span F.". Using the results of [3], [4], LA. Shereshevskii [13] has determined the admissible values of 71. The Jordan theoretic invariants r, sand g (which are still meaningful for Hilbert domains) enable us to rephrase the results of [3], [4], [13] in a unified way. Theorem (3.1). Let ME be a non-fiat irreducible symmetric Hilbert manifold of rank r « +00. Then the admissible values of 'Planck's constant' 71 (including 71 = +00) are the following: (1). For M = M+, we have 1/71 EN. (2). For M = M_, we have discrete values (3.7)
1
I.
-=,].5J
71
for all integers j with O:s::; j < r, and continuous values (3.8) Note that for r = +00, (3.7) gives 1/71 E ~sN whereas (3.8) is empty. For the Lie ball of infinite dimension we have r = 2 and s = dim(Z) - 2 = +00. Hence no finite value of 71 is admissible. In all other cases, there are enough admissible values to allow the 'classical limit' 71 ~ O.
830
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
Let us now indicate how the admissible values of 'Planck's constant' give rise to quantizations for (irreducible) symmetric Hilbert manifolds Me (15 = ±1) except for the Lie ball of infinite dimension. Let HT/ be the completion of FT/ with respect to the positive definite scalar product (3.6). Then the real Banach Lie group G, of holomorphic isometries of Me admits a unitary (projective) representation on HT/ of the form (W;f)(() = f(g-I()Ke (g.
0,
g . 0)'/2h tc,((, g . 0 f
= f(g-IONe ((, g . 0 )e/T/ N, (g.
0,
e h /
g . 0 f e l2T/ •
It can be shown that these representations are irreducible and pairwise inequivalent. By Stone's theorem, the Lie algebra Cfie of G e , consisting of all infinitesimally isometric complete holomorphic vector fields on Me' acts on HT/ by (unbounded) skew-adjoint operators dl for X E Cfie • Since these vector fields correspond to functions on Me via the symplectic structure, we obtain a quantization of Me depending on T/, [10]. For example, the position and momentum observables correspond to the vector fields
a az
X; := (b + dzb* z}) - E Cfie on Me' where bE Z and {zb* z} is the Jordan triple product. Note that these vector fields do not commute, even for vectors b in a real form of Z. The associated creation and annihilation operators correspond to the vector fields
a az
{zb* z}and
a az
b-
belonging to the complexification Cfif = Cfi~l' The energy observable corresponds to the 'circle vector field'
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
1= 2iz -
8
8z
E CSI
831
n CS_ I
on ME generating the symmetry
about OE ME' Conversely, there exists a 'dequantization' procedure ('classical limit' 11 ~ 0) which can be described as follows: For any bounded operator a E !£(H.,) on H." the sesqui-holomorphic function
is called the n-symbol of a. The mapping (T., IS a *-homomorphism ('l1-symbol homomorphism') for a suitably defined involution and product (depending on 11) of sesqui-holomorphic functions. The correspondence principle, [2], [13], asserts that for symbol functions l/J = CT., (a) and l/J = CT., (b) of 'operator fields' a = (a.,) and b = (b.,), respectively, which are independent of 11 (and satisfy certain 'tameness' conditions, d. Section 2), the product CT.,(ab) and the commutator l1-ICT.,[a, b] of operators correspond to the product l/J . l/J and the Poisson bracket [l/J, l/J] of the symbol functions, respectively, as 11 ~ O.
References [I) [2) [3) [4) [5) [6)
(7)
V. Bargmann, On a Hilbert space of analytic functions and an associated integral transform I, II, Comm. Pure AppI. Math. 14 (1961) 187-214; 20 (1967) 1-101. F.A. Berezin, Quantization, Math. USSR-Izv. 8 (1974) 1109-1165. F.A. Berezin, Quantization in complex symmetric spaces, Math. USSR-Izv. 9 (1974) 341-379. S.G. Gindikin, Invariant generalized functions in homogeneous domains, Funct. Anal. Appl. 9 (1975) 50-53. W. Kaup, Uber die K1assifikation der symmetrischen hermiteschen Mannigfaltigkeiten unendlicher Dimension I, II, Math. Ann. 257 (1981) 463-486; 262 (1983) 57-75. P. Kree, Solutions faibles d'equations aux derivees fonctionelles II, Lecture Notes in Math. 474 (Springer, Berlin, 1975) 16-47. E.H. Lieb, The classical limit of quantum spin systems, Comm. Math. Phys. 31 (1973) 327-340.
832
(8)
[9] [10] [11] [12] [13]
[14]
H. Upmeier / Applications of Infinite-Dimensional Holomorphy
o.
Loos, Bounded symmetric domains and Jordan pairs, Lecture Notes (Univ. of California, Irvine, 1977). L. Nachbin, Topology on spaces of holomorphic mappings (Springer, Berlin, 1968). B. Oersted, A model for an interacting quantum field, J. Funct. Anal. 36 (1980) 53-71. I. Segal, The complex wave representation of the free boson field, In: Topics in Funct. Anal., Adv. in Math. Suppl. Stud. 3 (Academic Press, New York, 1978) 321-343. D. Shale and W.F. Stinespring, The quantum harmonic oscillator with hyperbolic phase space, J. Funct. Anal. 1 (1%7) 492-502. LA. Shereshevskii, Quantization based on infinite-dimensional hermitian symmetric spaces, Moscow Univ, Math. Bull. 32 (1977) 28-36. H. Uprneier, Symmetric Banach Manifolds and Jordan C*-Algebras (North-Holland, Amsterdam, 1984).
J.A, BARROSO editor, Aspects of Mathematics and its Applications
© Elsevier Science Publishers B.V. (1986)
833
ON SLOWIKOWSKI, RAIKOV AND DE WILDE CLOSED GRAPH THEOREMS
M. VALDIVIA Facultad de Matemdticas, Universidad de Valencia, Burjasot, Valencia, Spain Dedicated to Leopoldo Nachbin
This article is devoted to the study of the classes of spaces introduced by Slowikowski, Rafkov and De Wilde respectively, which are related with the closed graph theorem. Some new results on this theorem are also provided.
1. Introduction
The vector spaces we shall use are defined over the field K.of real or complex numbers. The word 'space' means 'separated topological vector space', unless the contrary is specifically stated. If A is a bounded absolutely convex subset of E, we denote by E A the linear hull of A endowed with the norm of the Minkowski functional of A. Grothendieck proved in [3] that if E is an ultrabornological space, F is an (LF)-space and T is a linear mapping with closed graph from E into F, then T is continuous. He conjectures that this theorem remains valid if we replace the (LF)-spaces by a range class containing all the Banach spaces and stable with respect to the following operations: countable topological direct sums, separated quotients, closed subspaces and countable topological products. In particular if is a non-empty open subset of the n-dimensional euclidean space, then the Schwartz space ~I(n) endowed with the strong topology belongs to this class. In what follows we shall study classes of spaces related with this conjecture.
n
2. Sfowikowski Spaces Let P and Q be countable infinite sets and let M be a non-empty subset of pO. Given a vector space E, we suppose that for every pair (p, q) where p = f(q), q E Q, fE M, there exists a linear subspace Ep.q of
834
M. Valdivia I Closed Graph Theorems
E endowed with a pseudo-metrizable vector topology defined by an F-seminorm I'/p,q. If (p q ) belongs to M we set
and we suppose this space endowed with the coarsest vector topology such that the canonical injection from E(pq) into E Pq,q is continuous for every q in Q, Then the topology of E(pq) is defined by an F-seminorm lL,» We denote (M, Ep,q, 1.l p• q ) by 8F. Following Slowikowski, 8F is an af3y-representation of E if the following conditions are satisfied: (a). If (p q ) belongs to pO and
for every non-empty finite subset H of Q, then (p q ) belongs to M. (f3). For every r in Q, (pq ) in M and every non-empty finite subset H of Q, one has that
U
Ekr,r= E.
(kq)EM
(y). Given Q = U:=I U; U; C U n+1' (p q ) in M and
where
Xn
n
E
U;
is
non-empty,
finite,
and
Epq,q
qEU.
with k
(2.1)
limsup{l2:xn+ij n
k
,=1
i=I,2, ... ,k}=O
jXn+iEEpq,q, Pq,q
for every q in Q, then there exists (P:) in S(U, (p q » such that (x,') belongs to E(p') and q
k
.
lim sup /2: xn+il n
k
;=1
= (p;)
O.
M. Valdivia I Closed Graph Theorems
The alh-representation ~ is called complete if the space complete for every (Pq) in M. Slowikowski proves the following result in [5]:
835 E(pq)
is
Theorem (2.1). Let ~ be a complete al3y-representation of a vector space E. Let G be a Banach space and let T be an injective and linear mapping from G into E satisfying the following condition: if (x n) is a sequence in G converging to x and there exists (Pq) in M such that (T(xn» is contained and converges to z in E (Pq) then T(x) = z. Then there exists (k q ) in M such that T(G) is contained in E(k q ) and Tis continuous from G into E(k q ) ' The next definition is clearly suggested by our purpose of giving a closed graph theorem in view of the previous theorem. Definition (2.2). E is a Stowikowski space if it admits a complete af3yrepresentation ~ such that for every (Pq) in M the topology on E(pq) induced by the topology of E is coarser than the topology of E(pq)' This property will be expressed in what follows by saying that the topology of E is compatible with the af3y-representation. The next result is an immediate consequence of Theorem (2.1): Theorem (2.3). Let F be an ultrabornological space. If S is a linear mapping with sequentially closed graph defined on F with values in a Slvwikowski space E, then S is continuous. The class of Slowikowski spaces contains the F-spaces and it is stable with respect to the following operations: countable topological direct sums, closed subspaces, countable topological products and continuous linear images. By Theorem (2.3), the class of Slowikowski locally convex spaces answer the conjecture of Grothendieck.
3. The Class
~o
of Raikov
Given a subset L of a linear space E, we say that a sequence (x n ) (respectively, a double sequence (xn,m» in E is finally contained in L if there is a positive integer h such that xn (respectively, x n m) belong to L if
836
M. Valdivia I Closed Graph Theorems
n ~ h (respectively, if n, m ~ h). If F is a linear subspace of E endowed with a linear topology au and the sequences (x n ) and (x n, m) are finally contained in F, we say that (xn ) (respectively, (xn,m» converges to the origin in F[au] if given any neighbourhood of the origin U in F[au], (x n ) (respectively, (x n, is finally contained in U. Let P, Q, M, Ep.q, E(pq)' 1.1p.q and 1.I(Pq) have the same meaning as in Section 2. Following Raikov, we say that (P, Q, M, Ep.q) is a :!flo-representation of the space E if conditions (a) and (/3) of Slowikowski are satisfied as well as the following condition: (8). Given any (Pq) of M, if the sequence (x,) of E is such that the double sequence (z, - x m ) is finally contained and converges to the origin in each Epq,q, then (xn ) converges to a point x in E and (x - xn ) is finally contained and converges to the origin in each E pq•q • The class of spaces which admits a :!flo-representation is called f0o. This class was introduced by Raikov in [4] to answer the conjecture of Grothendieck and to give a 'double closed graph theorem' which we shall deal with in Section 5. In what follows in this section we shall prove that the class :!flo coincides with the class of Sfowikowski spaces. In our next two propositions we consider a :!flo-representation (P, Q, M, Ep,q) in a space E. Suppose that Q is the union of {Un}:=!, with U, non-void and finite and U; C Un +! . Let (p q ) be an element of M. We suppose that n q E u; E Pq, q is endowed with the coarsest linear topology such that the injection from this space in Epq,q is continuous, q E Un' For every positive integer n, let H; be a finite-dimensional subspace of E such that
m»
u, n n
qeUn
We
let
HI' H 2 ,
· · .,
Gn
E pq• q = {O},
denote
the
u; n q e o; E pq• q • We set
Hn +1 C
n
qEU.
topological
E pq• q
•
direct
sum
of
and we provide it with the coarsest linear topology for which the injection from G into G n is continuous, n = 1,2, .... Proposition (3.1). G is complete.
M. Valdivia I Closed Graph Theorems
837
Proof. It is obvious that G is pseudo-metrizable. We take a fundamental system of circled neighbourhoods of the origin {Vn}:=j, with Vn+ j + ... + V n + 1 C Vn + I)-summands). Let (zm) be a Cauchy sequence in G. We set
«n
Then (y~) is a Cauchy sequence in H, and, consequently, converges to a point u l in this space. Since HI is a subspace of G, (x~) is a Cauchy sequence in G. We select a positive integer m j such that for n, m ~ m,
Proceeding by recurrence, suppose that for a positive integer r we have obtained a Cauchy sequence (x ~) in G with x~ E
n
qEUj
(m = 1,2, . . .),
E pq•q
(y~) in H j converging to uj (j = 1,2,
such that for n, m
~
m" and for j
=
1,2,
, r), and a positive integer m, , r,
We set X;:I E
n
Epq,q,
qE U,+1
Then (y;:') converges to a point u,+' and (X~+I) is a Cauchy sequence in G. We take a positive integer m,+, > m, such that for n, m ~ m,+j, and for j = 1,2, ... , r + 1, x~ - x~ E V'+2 .
We consider now the sequences (z m,) and (t,) = One has
(x~,
+ u, + U,_I + ... + u 1) .
z m, =x'rn, +y'm, =x 2m, +y2m, +y'm, =···=x'm, +y'm, +Y'-'+"'+y' m, m, , On the other hand,
838
M. Valdivia / Closed Graph Theorems
t, -
(3.1)
Z
m,
= U, -
y'
E VT+I
+ ... + V,+1 C
m,
+ U r-l - y,-I + ... + u 1 _ m,
,+1
yl m,
V,.
We fix an element q in Q and we take a neighbourhood of the origin W in E pq• q • We find a positive integer s such that q is in Us and V, n E Pq. q C W. If r > h > s + 1 one has .
t, - th = x~, - X~h + U, + U,_I + ... + U h + 1 , and having in mind that
and, analogously, XS
M,
=
x'm, + y'm, + Y,-1 + ... + yS+1 m, m,
,
it follows that tr - th =
XS m,
_
XS mil
+ yhmh _
yh
m,
+ yh-I _ mh
yh-l m,
+ ... +
r:_r: mh
m,
+ U, - Y rm, + U,-l - Y ,-I + • •• + Uh+ 1 - Y m, h+1 m, E Vh
h-s+1
,-h
+ ... + V h + V, + ... + V,
C V,+I + V s +1 C
v,.
On the other hand, t, - th belongs to E pq• q and therefore t, - th is in W and thus the double sequence (tn - tm) is finally contained and converges to the origin in E Pq. q from where it follows, applying condition (8), that (t,) converges to a point x in E and (t, - x) is finally contained and converges to the origin in On for every positive integer n, from where it follows easily, using (3.1), that (z m) converges to x in O. Consequently, (zm) converges to x in O. 0
Proposition (3.2). The canonical injection from 0 into E is continuous. Proof. It is enough to prove that the sequence (zm) considered in the proof of Proposition (3.1) is bounded in E. Suppose this is not true. Then
839
M. Valdivia / Closed Graph Theorems
there exists a neighbourhood of the origin U in E and a subsequence of (zm)' which we still represent by (zm) such that
m, n
=
1,2, ... , m
y!
n,
Let V be a circled neighbourhood of the origin in E such that V + V + V C U and for every positive integer n, let W n be a circled neighbourhood of the origin in E such that n
Wn+···+WnCV.
From (zm) we construct the same sequences as in the proof of Proposition (3.1) taking in this case
instead of uj - y~ E V,+l (m, n > m,; j = 1,2, ... ,r). The sequence (t,) converges to a point x in E and therefore there is a positive integer h such that tm - tn E V, m, n ;;:: h. Then, if k > r> h, one has Z
mk -
Z m, = Z mk
- tk + tk - t, + t, -
Z m,
=ykmk -uk +yk-l_ uk-I +"'+ylmk -u I +tk -tr +ur -y'm, mk ,-1 1 + U,-l-Ym, +"'+Ul-Ym, k
r
EWk+···+Wk+V+W,+···+l¥,.CV+V+VCU,
which is a contradiction. 0 Proposition (3.3). Let (P, Q, M, Ep,q) be a qgo-representation of a space E. If (Pq) belongs to M, then E(pq) is an F-space and the canonical injection from E (Pq) into E is continuous. Proof. If we take {O} = HI = Hz = . .. in our previous construction, it follows that G = E (Pq)' It is enough to apply now Propositions (3.1) and (3.2). D Proposition (3.4). Let (P, Q, M, E p • q ) be a qgo-representation of a space E. Let T be a linear mapping with sequentially closed graph from a metrizable
840
M. Valdivia / Closed Graph Theorems
Haire space F into E. Then there is an element (p q ) in M such that T(F) is contained in E (Pq) and T is continuous from F into E (Pq)' Proof. We may suppose without loss of generality that Q is the set of
positive integers. Since
U E rh l
E=
,
(rq)EM
there is an element (p~) in M such that rl(E pl, I) is a subspace of second category of F. We set = PI' Proceeding by recurrence, we suppose that for a positive integer k, we have found an element (p;) in M such that l P; = Pj' j = 1,2, . , . , k, and r (n ;=l E Pi') is a subspace of second category in F. Since
P:
k
nEpi,jCU{Erk+l,k+llrj=pj' j= 1,2, .. "k,(rq ) E M }, j=l
there is an element (p;+l) in M such that p7+1 = Pj' j = 1,2, ... , k, and rl(n;:11 Epk+l) is of second category in F. We set p~:: = Pk+l' For every}positive integer j, let {V~}:=l be a fundamental system of neighbourhoods of the origin in E Pi. j with v.. + V~+l C V~. We let {Zn}:=l denote a fundamental system of neighbourhoods of the origin in F. It is immediate that rl(nj=l V~) is a subset of second category in F, m, n = 1,2, . , .. We fix a positive integer k and we write Wn to denote the closure of rl(n;:~ V~) in F. We take x in F and we find XI in T-l(n;=l E pi ) such that X - Xl belongs to WI n Zl' Proceeding by recurrence, suppose we have found Xl' X 2, •.• 'Xm in F such that
We now find an element xm + 1 in
r
1
(n;:;n V~) such that
It is immediate that, for every positive integer q, the double sequence j ) ) is finally contained and converges to the origin in Epq,q. Consequently, the series ~ T(x) converges to a point z in E. Since the sequence (~;=l T(xj ) - z) is finally contained in E Pk' k and T(x) E E Pk
(L;::" T(x
M. Valdivia / Closed Graph Theorems
841
(j = 1,2, ...), it follows that z belongs to E pk' k : On the other hand, x = L xi in F, from where it follows that T(x) = z. Since we have taken any arbitrary k, it follows that T(x) belongs to E(p) and thus q T(F) C E (p q ). By Proposition (3.3) the graph of T is closed in F x E (p ) and E(pq J is an F-space. Applying now the Banach-Schauder theorem ft follows that T is continuous from F into E (Pq)' 0 Remark (3.5). Suppose in Proposition (3.4) that H is a non-empty finite subset and that there is an element (rq ) in M such that T(F) is contained in q E H E 'q. q' Obviously, we can count 0 in such a way that H coincides with {I, 2, ... , s}. It is immediate that (p q ) can be taken satisfying PI = rl , P2 = r 2 , ••• , P, = rs and, therefore, T(F) is contained in E(p J and q (pq ) E 5(H, (rq ».
n
Proposition (3.6). If (P, 0, M, E P. q) is a rlJo-representation of a space E, then (M, E p,q' I./ p• q ) is a complete af3r-representation of E compatible with its topology. Proof. First we shall see that condition (r) is satisfied. Suppose 0 = U with non-empty and finite and u, C Un + l • Let (Pq) be any element of M and let xn E q E U. E P , q be such that (2.1) is satisfied for q each q in Q. Let F I be the linear hull of {XI' x 2' ••• , X n , ••• }. We set HI = {OJ and, if n > 1, we take the algebraic complement, H n , of F I n q E o; E Pq.q in F I n q E U._I E Pq.q' Clearly the dimension of is finite. Let G be the space constructed above. By Propositions (3.1) and (3.2), G is an F-space and the canonical injection J from G into E is continuous. Applying Proposition (3.4) and keeping in mind Remark (3.5) we obtain an element (P:) in 5(UI , (p q » such that J(G) is contained in E(p;J and J: G ~ E(p;J is continuous. Obviously, (xn ) belongs to E(p'J' On the other hand, let V be a neighbourhood of the origin in G. We fiJd a positive integer m, a neighbourhood of the origin "'f in ~, j = 1,2, ... , m, and e > 0 such that
:=1 u,
u,
n
n
n
W I + W2+ ···+Wm+{xE
u,
n
Epq)XEG,lxlpq,q<e,qEUm}CV,
qEU",
By (2.1) there is a positive integer r> m such that
IL x k
,=1
n +i !
Pq.q
< e,
n ~ r, q E Un' k
=
1,2, ....
M. Valdivia / Closed Graph Theorems
842
Since
n > r, i = 1, 2, ... , it follows that the series ~ X n is Cauchy in G and thus converges in this space. Therefore, this series is convergent in E (p.), from which follows that q
IL x I k
lim sup n
k
n +i
i~I
=
0.
(p;)
Finally, Proposition (3.3) ensures that if (Pq) E M, E(pq) is an F-space and (M, E Pq.q' /.1 Pq.q) is compatible with the topology of E. 0 Proposition (3.7). If (M, E p,q' 1.1 p,q) is a complete a{3y-representation of E, compatible with its topology, then (P, Q, M, E p• q ) is a @o·representation of
E.
Proof. We shall see that condition (8) is satisfied. Given (p q ) in M, let (x n ) be a sequence in E such that the double sequence (xn - xm ) is finally contained and converges to the origin in each Epq,q. Given Q = U:~I {Un} where U; is non-empty and finite and U; C Un + I , we can find a subsequence (x nI) of (x,) such that
Ix nj -
x nj+l IPq. q < 2-'j
q E Ui
:
We set Yj = x mj - X mj+l (j = 1,2, ...). Then, taking any q in Q, we find a positive integer n such that qk E Un' Consequently, k
k
)2: Yn+i I . ; ~ T(n+;) < 2- n Pq.q
,~I
(k = 1,2, ...) ,
I~I
and therefore, for every q in Q, k
lim supf! n
k
~ Yn+II
,=1
Pq.q
I
Yn+i E E pq.q, i = 1,2, ... , k} = 0,
M. Valdivia / Closed Graph Theorems
843
and, applying condition (1'), there is an element (P:) in S(U\, (pq ) ) such that (Yj) belongs to E(p')q and
I~ Yn+ij k
lim sup n
k
j;\
(p') q
=
0,
from which follows that the series L Yj converges to a point Y in E(p;). Consequently, setting x = x n l - Y, the sequence (x n) converges to x in E. Suppose that (x n ) does not converge x in E. We find a neighbourhood of the origin U in E and a subsequence (zn) of (x,) such that (n = 1,2, ...) .
(3.2)
Proceeding in the same way as we did for (xn ) , we can find a subsequence (z q) of (z,) converging to a point Z in E. By (3.2), x is distinct from z. J We find now a subsequence (x r.) of (x n ) such that J
Ix
'j
- x 'j+1 IP q<. 2-j q'
q E Uj '
and such that (x,.) contains an infinite number of terms of (x nJ) and also J of (z qJ Then, proceeding again in the same way as we did for (x n) we can prove that (x,.) converges to a point u in E and, consequently, J x = U = z, which is a contradiction. Thus (x n ) converges to x in E. Fix an element r of Q. We shall see that (xn - x) is finally contained and converges to the origin in E p.;«: Suppose this is not true. We select the sets Un' n = 1,2, ... , such that, belongs to U\. We can find a closed neighbourhood of the origin V in E P,., and a subsequence (x m) of (x;) such that x mJ. - x v, j = 1,2, ... , and
e
Ix
mj
- x mj+1 IPq.P <
z! ,
q E Ut :
Proceeding as above, we find (p*) in S(U\, tr,» such that x m j - x m j + 1 belongs to E(p')q and the series ~ (x mJ - x mJ.+ I ) converges in this space. Since
844
M. Valdivia / Closed Graph Theorems
and the canonical injection from E(p;l into Ep,.r is continuous, there is a positive integer s such that s+k
"(x L.J mj -x mj+l )=x m J -x m.s+k+l EV
(k = 1,2, ...),
j~s
from which easily follows that x m, -
X
E V, which is a contradiction. 0
Theorem (3.8). A space belongs to the class @'O if and only if it is a Stowikowski space. Proof. It is an immediate consequence of Propositions (3.6) and (3.7). 0
4. Spaces with Strict Webs We say that a @'o-representation (P, Q, M, E P. q) of a space E is convex if the space E p.q» which is not necessarily separated, is locally convex. If moreover E is locally convex, we shall say that E is a strict Sfowikowski space. Let W= {C n p 2, ... ,n) be a family of subsets of a space E, where k, n l , n 2 , ••• , nk are postive integers. W is a web if it satisfies the equalities
If all the elements of the web Ware convex (absolutely convex) we shall say that W is convex (absolutely convex). The web W is of type ri, or a ri-web, if the following condition is satisfied: for each sequence (nk ) of positive integers there is a sequence (Pk) of positive numbers such that for every Ak in IK, with O~IAkl~Pk and every x, in C nlon2.... ,nk' the series L AkXk converges in E. The web W is strict if it is absolutely convex and for every sequence (n k ) of positive integers there is a sequence (pd of positive numbers such that for every Ak in IK, with a ~ IAkI ~ Pk and every Xk in C nl, n2.... , nk' the series L ;~h+1 AkXk converges in E to a point belonging to C nlo n2, .. " nh (h = 1, 2, ...). The concepts of ri -web and strict web are due to De Wilde [1]. For the next proposition we suppose that the space E has a strict web W = {Cnl.n2..... nk}. We let Q = N denote the set of positive integers and
M. Valdivia I Closed Graph Theorems
845
p = Nk, the set of all the elements of the form (n), n z, ... , nk) where k, n), nz, ... ,nk belongs to N. Let M be the subset of all the elements
k = 1,2, ... , for some sequence to denote the linear hull of Cnp2•...• nk endowed with the locally convex topology given by the Minkowski functional of this set. (Uk)
of pO such that
Uk
= (n),
n z, ... , n k ) ,
(n k ) of positive integers. We set
E(nlo ...• nk).k
Proposition (4.1). (P = r-l, Q = N, M, E p,q) is a ~o-representation of E. Proof. Obviously E P.q is locally convex. Let (Uk) be any element of po such that for every non-empty finite subset H of one has that S(H, (Uk» ~ 0. Given a subset A = {I, 2, ... ,p} of 0, let (n), (n), nz), ... , (n), n z, ... , nk)' ... , be an element of M belonging to SeA, (Uk Then U) = (n), U z = (n), n z), ... , Up = (n., n z' ... ,np ) ; it follows that (Uk) belongs to M and, therefore, (a) is satisfied. We take now r in 0, (ud in M, with Uk = (n), n z, .. . , nk), k = 1,2, ... , and a non-empty finite subset H of 0. Let m be the greatest integer in H. We take (hk) in S(H,(u k ». If r e m, then hm=(n),nz, .... nm) and, therefore,
°
».
If r> m, we take any vector x in E um• m " By the definition of web one has that there is a sequence of positive integers sm+)' sm+Z, ... such that
It follows that
and therefore
nE
kEH
Uko k
= E
um• m
CU{Eh",I(hk)E S(H, (Uk»}'
On the other hand, if x is any element of E, by the definition of web there is a sequence (mk ) of positive integers such that
846
M. Valdivia / Closed Graph Theorems
(k = 1,2, ...).
If hk
= (m),
m z,""
m k ) (k
= 1,2, ...), (hk ) belongs to M and
x is
In
E h" , ' Thus E=
U E s" "
(sklEM
and we conclude that condition (/3) is also satisfied. Finally, we take (Uk) in M with Uk = (n), n z, ... , n k ) (k = 1,2, ...), and a sequence (x k ) in E such that the double sequence (x m - x n ) is finally contained and converges to the origin in each E Uk. k : We find a sequence (Pk) of positive numbers such that for every Ak E IK, 0.:;; IA k' .:;; Pk and Zk E C nl • n2•... ,nk. the series L;=p+) AkZk converges in E to an element of Cnl.n2 ..... np. P = 1,2, .... We can find a subsequence (x p ) such that (k = 1,2, ...).
(4.1)
Then the series L (x pk - x pk+1 ) converges to a point Z in E. If x = x p I - Z, the sequence (x p) converges to x in E. We find a neighbourhood of the origin U in E and a subsequence (Yk) of (x k ) such that (4.2)
Yk
-xg U
(k = 1,2, ...).
Proceeding as above for the sequence (x k ) , we find a subsequence (y q) of (Yk) converging to a point Y in E. We find now a subsequence (x,) of (xk ) such that
(k = 1,2, ...), and such that (X'k) has infinite terms of (x p) and infinite terms of (y q) Then (X'k) converges in E to a point which coincides with x and Y, which contradicts (4.2). Suppose now that there is a positive integer s such that the sequence (x, - x) is not finally contained in E(nlo n2., .. ,n or it is finally contained but does not converge to the origin in this space. Then there are h > 0 and a subsequence (x m) such that S),s
(4.3)
M. Valdivia / Closed Graph Theorems
(4.4)
847
(k = 1,2, ...) .
From (4.4) one has that the series L;~s+1 (x nk - x n k +) converges in E to a point x n,+l - X of hC nlonz....• n, which contradicts (4.3). This proves condition (8). 0
In the next two propositions we suppose that (P, Q, M, Ep,q) is a of a space E. We shall suppose, without loss of generality, that Q = N. If Pq = g(q) for some g in M, q E Q, we take ~o-representation
(4.5) (k
= 1,2, ...),
a fundamental system of closed circled neighbourhoods of the ongm in E p .q' Let/be a mapping from Q on the family of all subsets of the form rUpl'~.I' when rand (p q ) vary in Q and M respectively. We set C nl = /(n l ) , n l E Q. Proceeding by recurrence, if we fix n l , nz, ... , nk in Q and we suppose that (4.6)
such that there is an (Sq) in M with si = Pi (j = 1,2, ... , k). Let I nl. n2•...• nk be a mapping from Q on the family of all the subsets of the form
when r is in Q and Pk+1 = g(k + 1) with g varying in M such that g(j) = Pi (j = 1,2, ... , k). We set
Proposition (4.2). The lam ily "W = C %
nz..... nk
Proof. Since
E=
U
(Pq}EM
B;I' I '
is a web in E.
848
M. Valdivia / Closed Graph Theorems
given x in E there is a (p q ) in M such that x belongs to E PI' I' We find a positive integer h such that x is in h U Pl' I. r- If m belongs to Q and f(m) = hU PI' 1.1' one has that x is in Cm and therefore E coincides with U:I=I C n l • We fix now x in the set (4.6). By condition (j3) one has k
nE q=1
pq• q
CU {ESHIok+11 (s)E M, Sj
= Pj' j = 1,2, ... , k}.
Since Cnlon2..... nk is included in E pq• q (q = 1,2, ... , k), there is an (Sj) in M such that Sj = Pj' j = 1,2, ... , k and x belongs to ESHI.k+I' We find a positive integer mk+1 such that
Hence,
If nk+ 1 is a positive integer such that fnlon2 ..... n/nk+l) is the set (4.7) it follows that x is in Cnlon2..... nHI. Consequently (k
> 1),
and therefore 'W is a web. 0 Proposition (4.3). Given a sequence (nk) of positive integers there is a sequence (Pk) of positive numbers such that for every Ak ElK, 0 ~ IAkl ~ Pk' and xk in C nlo n2..... nk' k = 1,2, ... , the series L;=h+1 Akxk converges in E to a point belonging to C nlo n2..... nh' h = 1, 2, ....
Proof. Given the sequence of positive integers (nk ) , let 1j and Pj be elements appearing in (4.6). We set
Pk
=
(r1 r2 ...k r )-1
(k = 1,2, ...).
It is immediate that (Pk) belongs to M. We take Ak in IK, a ~ IAkl ~ Pk' and x k in Cnl.n2..... nk' k = 1,2, .... We fix a positive integer h and we set
849
M. Valdivia / Closed Graph Theorems h+k+l
L
Yk =
j=h+l
(k = 1,2, ...) .
A·X· I I
We fix a positive integer q and we take the neighbourhood of the origin V in E pq• q • We find a positive integer r larger than q such that V contains U Pq.q.r' If we take s> m > r, one has Ys - Ym =
L
j=m+l
A.X. J J
E
" .LJ j=m+l
o,J C nt •... , n2•...• nj c " V, LJU Pq• q, 1 c U P ,q,e r j~m+1
q
and thus (Yk) is a sequence in E such that the double sequence (Yn - Yk) is finally contained and converges to the origin in E Pq.q' q E Q. We apply condition (8) to obtain that the series L7=h+l AjXj converges to a point z in E and h+k+l
L
(AjX)- Z
j~h+l
is finally contained and converges to the origin in each E pq• q • We fix now q in Q, 1 ~ q < h. One has h+k+l
L
h+k+! AjXj
E
j=k+l
L
j=h+l
U Pq.q.j C U Pq,q.h
,
and consequently h+k+l "L.. A·X· I I j=h+l
Since U Pq,q.j is closed in E Pq,q' j to C n).n2..... nh. 0
=
E C n). n2.....nh .
1,2, ... , it follows easily that
Z
belongs
Theorem (4.4). A locally convex space E has a strict web if and only if E is
a strict stowikowski space. Proof. It is an immediate consequence of Propositions (4.2) and (4.3) taking U Pq,q,j' j = 1,2, ... , absolutely convex in (4.5). 0
850
M. Valdivia / Closed Graph Theorems
5. On the Double Closed Graph Theorem Given the spaces E and F, a linear relation from E into F is a binary relation R whose graph, Graph(R) = {(x, y)/ xRy}, is a linear subspace of Ex F If x belongs to E and if A is a subset of E, we set R(x) = {y E Fj xRy}
and
R(A) = U {R(x)j x E A}.
The inverse relation of R is denoted by R-\ the domain of R is Dom(R) = R-1(F). The relation R is continuous if R-1(A) is an open subset of Dom(R) for every open subset A of F The relation R is open if R- 1 is continuous. Given the classes 'l: and :!Ii of spaces, we shall say that the pair {'l:, :!Ii} verifies the 'double closed graph theorem', if for every E in, 'l: and F in :!Ii, every linear relation R from E into F with closed graph and such that Dom(R) = E is continuous. In the case that R (x) consists of only one point one has a closed graph theorem, and if R- 1(y ) consists only of one point for every y in R(E), one obtains an open mapping theorem. Thus the double closed graph theorem seems to be more general than a closed graph theorem. Propositions (5.1) and (5.2) will show that this is not the case. Let 'l: be a class of (locally convex) spaces. We set 'l:d ('l:c) to denote the class of all the (locally convex) spaces such that {'l:, 'l:d} ({'l:, 'l:J) verifies the double closed graph theorem.
Proposition (5.1). If'l: is a class of (locally convex) spaces, F belongs to 'l:d ('l:J and G is closed subspace of F, then FIG belongs to 'l:d ('l:c)' Proof. Let R be a linear relation from a space E belonging to 'if} into FIG with Dom(R) = E and G(R) closed in E x (FIG). Let lp be the canonical mapping from F onto FIG. For every x in E and y in lp-l(R(x» we set xSy. The relation S from E into F is clearly linear, Dom(S) = E and Graph(S) is closed in Ex F Consequently, if A is an open subset of FIG one has that S-l(lp -1(A» = R-1(A) is an open subset of E. 0
M. Valdivia I Closed Graph 'Theorems
851
Proposition (S.2). Let 'l: and fli be classes of spaces satisfying the following conditions: (1). If FE fli and G is a closed subspace of F then FIG belongs to fli. (2). If E E 'l:, FE fli and T is a linear mapping with closed graph from E into F then T is continuous. Then if E E 'l:, FE fli and R is a linear relation from E into F, such that Dom(R) = E and Graph(R) is closed in F x F, R is continuous. Proof. We set {OJ x R(O) = H. It is immediate that R(O) is closed in F and H is a subspace of Graph(R) closed in F x F. Let 'P be the canonical mapping from F onto FIR(O). We define now a linear mapping T from E into FIR (0) in the following way: if x belongs to E, we take y in R(x) and set T(x) = 'P(y). Obviously, T is well defined and linear. Let l/J be the canonical mapping from Ex F onto (E x F)IH = Ex (FIR(O». The graph of Tin Ex (FIR(O» coincides with Graph(R)IH which is closed in Ex (FIR(O». Consequently, T is continuous. If A is an open subset of E, one has R-1(A) = r1('P(A» is an open subset of E and therefore R is continuous. 0 In [4] Raikov states that a space F satisfies the double closed graph theorem if one has next result: let E be an inductive limit of a family {E; liE I} of metrizable Baire spaces. If R is a linear relation from E into F with Graph(R) closed in Ex F and Dom(R) n E; of second category in E, for every i E I, then R is continuous and Dom(R) = E. We shall see in Proposition (5.3) that this result is equivalent to a closed graph theorem. Let 'l: be a class of spaces. We set 'l:, to denote the class of all spaces satisfying the following condition: if E E 'l:, FE 'l:, and T is a linear mapping with closed graph from E into F, then T is continuous. We say that 'l: is normal if for every E E 'l: each subspace of the completion E of E containing E also belongs to 'l:. The next result can be found in [7, p. 106] in the case of locally convex spaces. Slight modifications in the proof given there cover the more general case: (5.1)
Let E be a normal class of spaces. Let G be a dense subspace of E. Let F be an element of 'l:, and let T be a linear mapping from G into F with closed graph in E x F. If G belongs to 'l:, then G coincides with E.
852
M. Valdivia / Closed Graph Theorems
Given a class of spaces ~, let ~m be the subclass of ~, such that E belongs to ~m if and only if ElF is in ~, for every closed subspace F of E. It is easy to see that ~m is the largest subclass of ~, stable with respect to separated quotients.
Proposition (5.3). Let ~ be a normal class of spaces. Let E be an inductive limit of a family of spaces {E j liE I}. Let F be an element of ~m and let R be a linear relation from E into F such that Dom(R) n E, is dense in E; and belongs to ~ for every i E I. If Graph(R) n (E; x F) is closed in E; x F, for every i E I, then R is continuous and Dom(R) = E. Proof. Obviously we can suppose that Dom(R) is dense in E, Dom(R) belongs to ~ and Graph(R) is closed in Ex F. We proceed as in the proof of Proposition (5.2) but now T is a linear mapping from Dom(R) into F/R(O) whose graph is closed in Ex (FIR(O)). Applying (5.1), Dom(R) = E and T is continuous, whence R is continuous. 0
Remark (5.4). Raikov proves the following proposition in[4]: if the space F belongs to the class ~o then F satisfies the double closed graph theorem. Since the class of Siowikowski spaces is stable with respect to separated quotient and the class of metrizable Baire spaces is normal, to get this statement, according to Proposition (5.3) and Theorem (3.8), it is enough to prove: let E be a metrizable Baire space. Let F be a Siowikowski space. If T is a linear mapping with closed graph from E into F, then T is continuous. The proof of this result can be obtained replacing 'G is a Banach space' by 'G is a metrizable Baire space' in Theorem (2.1), which is possible modifying the proof given by Siowikowski in [5]. We conclude that [4] is closed related to [5]. Remark (5.5). The following propositions can be found in [1] and [2]: let E and F be locally convex spaces. Let R be a linear relation from E into F. If Dom(R) is of second category in E, F has a C6'-web and Graph(R) is closed in Ex F, then R is continuous. By (5.1), Proposition (5.3) and since every separated quotient of a space with C6'-web has a C6'-web [1], one has that it is a consequence of following [1]: let E and F be locally convex spaces. Let T be a linear mapping with closed graph from E into F. If E is a Baire space and F has a C6'-web, then T is continuous.
M. Valdivia I Closed Graph Theorems
853
If E is a locally convex space, we set [J}J (E) to denote the family of all
the bounded absolutely convex subsets B of E such that E B is a Banach space. Given the spaces E and F and a linear relation R from E into F, we say that R is fast sequentially closed in E X F if for every A in [J}J (E) and B in [J}J(F), Graph(R) intersects E A X E B on a closed subset. The concept of fast sequentially closed mapping can be found implicitly in Theorem (2.1), and explicitly in [2]. Let fF. be the class of all locally convex spaces such that if FE fF., E is a Frechet space and T is a linear mapping from a subspace G of F into E with fast sequentially closed graph and T( G) is of second category in E, then T(G) = E and T is open. Proposition (5.6). Let E be a locally convex space, inductive limit of a family of Frechet spaces {E i liE f}. Let F be an element of [lFs and let R be a linear relation from E into F such that Dom(R) n E, is of second category in E j , for every i E I, then R is continuous and Dom(R) = E. Proof. Obviously we can suppose E,is a Frechet space, Dom(R) of second category in E and R with fast sequentially closed graph in Ex F. We set H = R-l(O) X {OJ. It is immediate that R-l(O) is closed in E and H is a subspace of Graph(R) closed in Ex F. Let tp be the canonical mapping from E onto E/R-l(O). We define now a linear mapping T from R(Dom(R» onto E/R-l(O) in the following way: if x belongs to R(Dom(R» we take yin R-l(x) and set T(x) = q;(y). Obviously T is well defined and linear. Let '" be the canonical mapping from Ex F onto (E X F)/H = (E/R-l(O» X F. The graph of T in (E/R- 1(O» X F coincides with Graph(R)/H which is fast sequentially closed in (E/R-l(O» x F. T(Dom(R» = Dom(R)/R-l(O) is of second category in E/R-l(O). Consequently Dom(R)/R-l(O) coincides with E/R- 1(O) and T is continuous, hence Dom(R) = E and R is continuous. 0
Remark (5.7). The following can be found in [2]: let E be a Frechet space. Let F be a locally convex space with a ~-web. Let R be a linear mapping from F into F such that Dom(R) is of second category in E and the graph of R in Ex F is fast sequentially closed. Then R is continuous and Dom(R) = E.
By Proposition (5.6) to prove this result it is enough to prove the following: let F be a locally convex space with 'e-web. Let E be a Frechet
M. Valdivia / Closed Graph Theorems
854
space. Let T be a linear mapping from a subspace G of F into E. If T(G) is of second category in E and the graph of T is fast sequentially closed in Ex F, then T(G) = E and T is open.
6. The Class
~,
In this section P, Q, M, Ep.q and E(p) have the same meaning as in Section 2. We say that a space E has a ?};crepresentation (P, 0, M, E p• q ) if conditions (a) and (f3) are satisfied as well as the following condition: (77). Given any (Pq) in M, if the sequence (xn ) of E is such that the double sequence (x, - xm ) is finally contained and converges to the origin in each E pq• q , then (x n ) converges in E. We define ~, as the class of spaces having a ~,-representation. The class ~, is stable with respect to the following operations: countable topological direct sums, continuous linear mappings, closed subspaces and countable topological products. Obviously, .@O is contained in .@,. A subset A of a linear space E is linearly regular in E if it is a circled neighbourhood of the origin for some vector topology on E. We say that a web (C6' -web) "W = C nlo n2..... nk is regular if each subset Cnl.n2•...• nk is linearly regular in its linear hull. In the next proposition we suppose that the spaces E has a regular C6'-seb "W= Cnlon2 ..... nk. We set here again Q = Nand P = N\ k E Q. Let M be the subset of pO such that (Uk) is in M if and only if there is a sequence (nk) in Q such that Uk = (n" n 2 , ••• , nk) (k = 1,2, ...). Since the web "W is regular we can take in the linear hull E (nlo n2..... nk l. k of C nlo n2..... nk a sequence (C~lo n2..... nk) of absorbing circled subset such that
and we suppose that E(nl. n2 ..... nk).k is endowed with the pseudo-metrizable linear topology defined by the fundamental system of neighbourhoods of the origin
Proposition (6.1). (P, Q, M, Ep.q) is a .@crepresentation of E.
M. Valdivia / Closed Graph Theorems
855
Proof. The proofs that (P, Q, M, Ep.q) satisfies conditions (a) and (f3) are the same as given in Proposition (4.1). We take now (ud in M with Uk = (n l , n2, ••• , nk ) (k = 1,2, ...), and a sequence (x k ) in E such that the double sequence (x n - x m ) is finally contained and converges to the origin in Enk,k for each k in Q. We find a sequence (Pk) of positive numbers such that if A k E~, 0 ~ IAkl ~ Pk and Zk E C nlo ... ,n2, ... ,nk (k = 1,2, ...), the series L AkZk converges in E. We can find a subsequence (x Pk) of (xk ) such that (k
=
1,2, ...).
Then the series L (x,k - x p k+1 ) converges to a point Z in E. If x = x p 1 - Z, the sequence (Xp) converges to x in E. Proceeding as we did in Proposition (4.1) we conclude that (xk ) converges to x in E. 0 In two following propositions we suppose that (P, 0, M, E p,q) is a 9)1-representation of a space E. We suppose without loss of generality that Q is the set of positive integers. If Pq = g(q) for some gin. M, q E Q, we take a fundamental system of circled neighbourhoods of the origin in Epq,q:
(6.1) and we construct from (6.1) the sets Proposition (6.2). The family W
C nl• n2.... ,nk
= {Cnl.n2..... nk}
as in (4.6). is a regular ri-web.
Proof. It is analogous, and even simpler, to the one of Proposition (4.1). 0 Theorem (6.3). A space E belongs to the class 9)1 if and only if it admits a regular ri -web.
Proof. Apply Propositions (6.1) and (6.2). 0 Open Problem (6.4). By Theorem (6.3) every Sfowikowski space has a ri -web. We do not know if there is a space with a ri -web which is not a Sfowikowski space.
856
M. Valdivia / Closed Graph Theorems
7. Stowikowski Spaces without Convex
~-Webs
We say that a locally convex space E is conoex-Baire if given any sequence (An) of closed convex subset of E covering E there is a positive integer q such that A q has non-empty interior [7, p. 22]. We take 0< p < 1. Let Ip be the linear space of all sequences (an) in IK with L lanl P < 00 endowed with its ordinary topology. We set Ap to denote Ip endowed with the topology induced on it by II' In [7, p. 281-287] the following result can be found: (a). The space Ap is convex-Baire. Let (An) be a completing sequence in a locally convex space E, i.e. E:J An :J An+!' An ¥ 0 (n = 1,2, ...), and there exists a sequence of positive numbers (Pn) such that if An E IK, a ~ IAnl ~ Pn, and xn E An (n = 1,2, ...), then the series L AnXn converges in E. In [6] the following result can be found: if for every positive integer n, then closure of An in E has non-empty interior, then E is a Frechet space. Proposition (7.1). Ap is a Siowikowski space. Proof. The canonical injection from Ip into Ap is continuous. Since Ip is an F-space the conclusion follows. 0 Proposition (7.2). The space Ap does not have a convex
~-web.
Proof. Suppose that there is a convex ~-web W= {Cnbn2 ..... nk} in Ap • We set Anl.n2..... nk to denote the closure of Cnbn2..... nk. By result (a) above the elements of the family Anlon2..... nt having empty interior cannot cover Ap and therefore there is an element x in Ap which is not contained in the union of these sets. By the very definition of ~ -web we can find a sequence (m k ) of the positive integers such that (k
= 1,2, ...).
Obviously the sequence (Cmlom2..... mt);=I is completing and the interior of A nl.n2,···.1I1: is non-empty (k = 1,2, ...). Applying the above mentioned result of [6] we obtain that Ap is a Frechet space, which is impossible Ap being a proper dense subspace of It. 0
M. Valdivia / Closed Graph Theorems
857
References [1] M. de Wilde, Reseaux dans les espaces lineaires a serni-normes, Mem. Soc. R. Sci. Liege 2 (1969). [2] M. de Wilde, Closed graph theorems and webbed spaces (Pitman, London, 1978). [3] A. Grothendieck, Produits tensoriels topologiques et espaces nucleaires, Mem. Amer. Math. Soc. 16 (1955). [4] D.A. Raikov, Double closed graph theorem for topological linear spaces, Siberian Math. J. (Translated from Russian) 2 (1966) 287-300. [5] W. SYowikowski, On continuity of inverse operators, Bull. Amer. Math. Soc. 5 (1961) 498-500. [6] M. Valdivia, Localization Theorems in Webbed Spaces, Semesterbericht Funktionalanalysis, Tiibingen (1982) 49-57. [7] M. Valdivia, Topics in LocalIy Convex Spaces (North-Holland, Amsterdam, 1982).
1.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
859
HYPERBOLIC DOMAINS IN BANACH SPACES AND BANACH ALGEBRAS
Edoardo VESENTINI Scuola Normale Superiore, 56100 Pisa, Italy To Leopolda Nachbin on his sixtieth birthday
o.
Introduction
Any bounded domain in a complex Banach space is hyperbolic, i.e. the Kobayashi distance of the domain defines the relative topology [2, p. 92]. This result can be extended to bounded domains in a complex, locally convex Hausdorff vector space either directly [6, p. 496] or as a consequence of the fact that any locally convex, locally bounded Hausdorff vector space is normable. The question arises whether the existence of a hyperbolic domain in a complex, locally convex Hausdorff vector space imposes restrictions on the space itself. The question is answered in this paper showing that such a space is necessarily normable. 'How large' can a hyperbolic domain be? A fundamental result in the theory of hyperbolic domains was the discovery, by S. Kobayashi, that the complex plane C minus two points is hyperbolic: actually the largest hyperbolic domain in Co This fact suggests, as an example of a 'large' domain, a connected component D of the set of all points in a complex Banach algebra whose spectra are disjoint from two fixed distinct points. It will be shown that D is hyperbolic if, and only if, the Banach algebra is a function algebra on a locally compact Hausdorff space.
1. Hyperbolic Domains in LocaUy Convex Spaces For a domain D in a complex, locally convex Hausdorff vector space 'If, K D will denote the Kobayashi pseudo-distance and the Kobayashi differential pseudo-metric on D [6]. If D is the open unit disc B(O) = L1 in C, k,j is the Poincare distance w in .1 (ct. e.g. [2]). k D and
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
860
For X oED and r> 0, B k D (x o, r) will denote the open ball with center X o and radius r for the Kobayashi pseudo-distance. If p is a continuous seminorm on 'l:, for X o E 'l: and r > 0, B/xo, r) will stand for the open ball with center Xo and radius r. The following proposition extends theorem 2 of [7]. Proposition (1.1). Let D be a hyperbolic domain in 'l:. For X o E D, let p be a continuous seminorm on 'l: such that B/xo, R) CD for some R > O. Let s > 0 be such that B k/XO' s) C Bp(x o, R). For every r with 0 < r < s there exists a constant c > 0 such that KD(X,
v)::;: cp(v)
for all v E 'l: and all x E B k D (x, r). Proof. If the conclusion is false there exist two sequences {x.} and {vv} (v = 1,2, ...) of points Xv E BkD(XO' r), Vv E 'l:, such that p(v.) = 1 and lim
v....""
Assume
KD(X v '
vJ < 1 for v
=
KD(X v '
vJ =
o.
1,2, ... , and let e v be such that
and lim .....ee e; = O. There exist a ~v E ~(.1, D) (the set of all holomorphic maps from .1 into D) and T" E C such that ~v(O) = x"'
Thus (1.1)
Let 0" = e ~2. Because
I
2u
~ ~(O) = 2~Ov
o
I
2u
e -i8 ~v(Ov e") dO =
2~Ov
0
e -i8(~.(ov e") -
~v(O»
dO,
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
861
we have
and (1.1) yields
Since lim,,-oo e: 1/2 = +00, there exist a Vo > 0 and, for all u > vo, a gv E C, Igvl:os; 8v' such that p(CPv(gJ - xJ > 2R whenever v > V o. Thus
for all u > V o .
(1.2) On the other hand,
1 1 + 8v :OS;210g--.
1-
s,
Hence
which contradicts (1.2). 0 Proposition (1.1) (or theorem 2 of [7]) is the basic tool in the proof of Theorem (1.2). Let D be a non-empty domain in ~. If D is hyperbolic, then ~
is equivalent to a normed space.
D. There is no restriction in assuming X o = O. Since D is open, there exists a continuous seminorm Po such that B Po(O, 1) C D. For any x E Bpo(O, 1) Proof. Let
Xo E
862
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
kD(O, x):s; k B Po (01)(0, x) = w(O, Po(x)). •
Since kD(O,x»O for all xED\{O}, we have Po(x»O for all xE Bpo(O, 1)\{0}, i.e. Po is a norm. For any continuous seminorm p on 'If, let p' be the continuous seminorm defined by p'(y) = Po(y) + p(y). Then Bp'(O, I)C Bpo(O, 1), and by Proposition (1.1) there is a positive constant c such that for all y E 'If. Since, on the other hand,
we see that Po(y);;;. cp'(y) = c(Po(y) + p(y»;;;. cp(y) ,
for all y E 'If, which shows that all continuous seminorms are bounded on Bpo(O, 1). 0
Remark (1.3). Completeness of 'If as a normed space is equivalent to sequential completeness of 'If as a locally convex Hausdorff space. For the remainder of the present section 'If will be a complex Banach space. For X oE 'If and R > 0, B(xo, R) will indicate the open ball with center X o and radius R for the norm distance on 'If. Let D be a hyperbolic domain in 'If. For X o E D, with the same notations as in Proposition (1.1), let S > be such that
°
Let x, y be two points of the closure clos B(xo, S) of B(xo, S), and let I: [0,1] -+ D be a continuous piecewise ~1 map such that 1(0) = x, 1(1) = y. For O:s; t1 < t2 :s; 1
'I
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
863
Recall that KD is upper semicontinuous (ct. e.g. [2, Prop. V.2.6]). If 1([0, l])CB kD (xo,r), then, by Proposition (Ll),
JKD(f(t), ret)) dt ~ cJIIr(t)11 dt ~ ell y - xII .
(1.3)
o
o
If 1([0, 1]) It B k D (x o, r) there exist t l , t2 , with 0 < t l < t2 < 1, such that:
1([0, tl]) U 1([t2 , 1]) C B(xo, 3S), I(~) E
B(xo, 3S)\clos B(xo, 2S)
(j = 1,2).
Thus, by Proposition (Ll), /1
/1
JKD(f(t), ret)) dt ~ c JIIr(t)IIdt ~ cll/(t
l) -
xii> cS,
o
and similarly
JKD(f(t), ret)) dt > cS. /2
Since Ily - xli:::;; 2S, (1.3) holds then for any continuous piecewise ~I map 1:[0, l]~D such that 1(0)= x, 1(1)= y. Royden's theorem (ct. e.g. [2, Th. VA.1]) yields (1.4)
kD(x, y)~ cllx -
yll·
The Kobayashi distance k D being continuous, (1.4) implies Theorem (1.4). II D is a hyperbolic domain, then lor every X o ED there is a positive number S such that clos B(xo, S) C D and such that the Kobayashi distance k D is equivalent to the norm distance on clos Btx.; S).
At this point, the same argument devised by J.-P. Vigue in [9, p. 279] (cf. e.g. [2, Th. IV.2.6]) yields
864
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
Theorem (1.5). If D is a hyperbolic domain and if the group of all holomorphic automorphisms of D acts transitively on D, the Kobayashi distance k D is complete. Since bounded domains are hyperbolic, Theorem (1.5) extends a theorem established by J.-P. Vigue for the Caratheodory distance on bounded domains. Let D be hyperbolic and let f be a holomorphic map of D into D having a fixed point X o ED. Let R > 0 be such that B(xo, R) C D. By Proposition (1.1) there is a positive constant c such that for all y E 'C . Thus c/ldf(xo)yll ~ KD(XO' df(xo)y) ~ KD(X O' y) ~
KB(xo,R)(X O' y) =
R-11IYII,
for all y E 'C. Hence
As a consequence, the spectral radius p(df(xo)) of df(x o) satisfies the inequality
proving thereby Proposition (1.6). If D is hyperbolic and if the holomorphic map f: D ~ D fixes a point X o E D, the spectrum of df(xo) is contained in the closure of the unit disc. The above proposition was established in [2, d. Lemma IV.2.5] for bounded domains.
2. Hyperbolic Domains in Banach Algebras In the following d will be a complex Banach algebra. For xEd, p(x)
E. Vesentini I Hyperbolic Domains in Banach Spaces and Banach Algebras
865
and Sp x will denote the spectral radius and the spectrum of x, respectively. Lemma (2.1). Let E be a hyperbolic domain in C and let D be a domain in stJ such that Sp x C E for all xED. If stJ is abelian and contains no nontrivial quasi-nilpotent elements, then k D is a distance. Proof. If Xl :;t. X 2 are two points of D, there is a character X of stJ such that X(x l ) :;t. X(x2) . Since X(x) E Sp x C E for all xED, X maps D holomorphically into E. Hence
Remark (2.2). By T.J. Barth's theorem (ct. e.g. [2, Prop. IV.2.3]) the assumption on E is equivalent to kE being a distance. The same argument as before shows that the Caratheodory pseudo-distance on D is a distance, provided that the Caratheodory pseudo-distance on E is a distance. Let K be a non-empty compact set in C\{O}, and let D(K) be the connected component of 0 in the open set {x E .911 Sp x n K = 0}. When is the pseudo-distance kD(K) a distance? Consider first the case in which K consists of one point a :;t. O. Lemma (2.3). The pseudo-distance k D({a}) is not a distance. Proof. Recall that, if .91 has no identity Sp x is the spectrum of x in the unital Banach algebra .91 xC. Hence stl will be assumed to have an identity 1. For any ~ E C, the affine function
maps C\{a} biholomorphically onto itself. For x E stl define f,(x) = e'(x - a 1) + a 1. For x E D({a}) the function ~ ~ f, (x) maps C holomorphically into stJ, and
Since fo(x)
=
x E D({a}) and a
Ii Sp f,(x) for all
~E
C, the image of C
866
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
by the map
for all
~E
~~ f{(x)
is contained in D({a}). Hence
C. If k D({a}) is a distance, then e'(x-a1)+a1=x,
and therefore x
=
a 1, contradicting the fact that a
~
Sp x. 0
°
To investigate D(K) it will be useful to consider for any r > balanced neighbourhood of defined by C(r) = {x E
°the open
.911 p(x) < r}.
It was shown in [8] that (2.1)
(2.2)
kqr)(O,x)O::;;w(O,r-1p(x»
Kqr)(O, y)
0::;;
r-1p(y)
for all xEC(r),
for all y E .91 .
Inequality (2.1) implies that, if k qr) is a distance, .91 contains no nontrivial quasi-nilpotent elements. Such nontrivial elements always exist if .91 belongs to anyone of the following classes: (1). The algebra of all bounded linear operators on a complex Banach space of dimension greater than one [5, p. 278]. (2). Non-commutative finite-dimensional algebras [1, p. 44]. (3). Non-commutative algebraic Banach algebras [1, p. 44]. Hence the pseudo-distance k qr) is not a distance if, for example, .91 is one of the above mentioned non-commutative Banach algebras. Let a 'i- b be two points of C\{O}, and consider the domain D({a, b}). For 0< r < minl]«], Ibn, C(r) C D({a, b}) and therefore
for all x, y in C(r). Hence the pseudo-distance k D({a.b}) is not a distance if .91 belongs to anyone of the above mentioned classes of non-commutative Banach algebras.
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
867
Since
and only if, d contains no nontrivial quasi-nilpotent elements. The condition that D({a, b}) be hyperbolic imposes heavier restrictions on d. Lemma (2.5). If there exist an r > K
°and a c > °such that
C(riO, v) ~ cllvll
for all v Ed,
then d is norm equivalent to a function algebra. Proof. Inequality (2.2) implies that
(2.3)
p(v)~
crllvll
for all v Ed.
If d is unital, d is abelian in view of a theorem of Le Page [1, p. 43]. If d is not unital, consider the algebra dO = d x
II(v, A)II = Ilvll+ IAI
(x E d, A E
obtained by adding the unit (0, 1) to d. Since (d. e.g. [1, p. 3]) ~(p(v) + lAD ~ p«v,
A»,
we have (2.4)
p«v,
A»
~ bll(v,
A)II,
with b = ~min{cr, I}. Again Le Page's theorem implies that sd", and therefore also d, is abelian. Let M and M O be the maximal ideal spaces of d and dO endowed with the Gelfand topology. For v E d, let fj be the Gelfand transform of v. Since by (2.3) and (2.4)
11811M ~ crllvll ~ crll811M ,
1I(0)IIMo~ bll(v, A)II ~ bll(V,A)IIM
o ,
868
E. Vesentini I Hyperbolic Domains in Banach Spaces and Banach Algebras
stl and stl° are norm equivalent to function algebras on the compact Hausdorff spaces M and MO (i.e. uniformly closed algebras of continuous functions containing the constants and separating the points of M and MO). Thus, stl-if not unital-is norm equivalent to a function algebra on a locally compact Hausdorff space. 0 As a consequence of Lemma (2.5) the following theorem holds: Theorem (2.6). If D({a, b}) is hyperbolic, the Banach algebra stl is norm equivalent to a function algebra on a locally compact Hausdorff space (which is compact if stl is unital).
Hyperbolicity characterizes function algebras, as will now be shown. Theorem (2.7). If stl is a function algebra on a compact Hausdorff space T then D({a, b}) is complete hyperbolic. Proof. (1). Since stl (contains the constants and therefore) is unital, and since the affine function x ~ (a - b fl(X - bI) maps D({a, b}) bi-uniquely onto D({O, I}) (the connected component of (C \{O, I})l in the open set {x E stll Sp x n {a, I} = 0}), there is no restriction in assuming a = 0, b = 1. Denote C\{O, I} by V, and let XoE D({O, I}), 8> 0. The image xo(T) being compact in V; Barth's theorem and an elementary compactness argument shows that there is a > such that
°
for all t E T. For any t E T the function x ~ x(t) maps D({O, I}) holomorphically into V. Thus, for x E B k D(lO. I}) (x o, a),
and therefore
Ix(t) - xo(t)/ <
E
for all t E T.
Hence IIx - xoli T :os;; 8 < 28, and we conclude that
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
869
which shows that the domain D({O, I}) is hyperbolic. (2). Let {xJ (z; E D({O, I}), v = 1,2, ...) be a Cauchy sequence for k D({O. I})' Because
°
for all t E T, v;;;. 1, JL;;;' 1, for any E > there is a v o;;;' 1 such that kv(xv(t), xI' (r) < E for all JL;;;' vo, v;;;. vo, and all t E T. Since k v is a complete distance, there is a function x o: T ~ V defined by xo(t) = Iim.-. xv(t) (t E T). Being lim .....ee kv(xo(t), xv(t» = 0 uniformly for t E T, and k v being equivalent to the euclidean distance on all compact subsets of V, X o is continuous. Let K be a compact neighbourhood of xo(T) in V. There is a VI such that xv(T) C K for all v;;;. VI' The equivalence of k v and of the euclidean distance on K implies that the uniform norm Ilxo- xvii tends to as v ~ 0, showing that XoEd and that {xJ converges to X o in d. 0
°
Example (2.8). The space <en with its natural topology and coordinatewise product (gl"'" gn)('TJI' ... , 'TJn) = (gl'TJI' ... ,gn'TJn) is a function algebra on a discrete set consisting of n points. Thus, any point x E en with coordinates gl'" ., gn has spectrum Sp x = UI} U' .. U {gn}' For a E C and any a = 1, ... , n, let Pa(a) be the affine hyperplane
Then for any choice of a and b in C, a
~
b, the set D({a, b}) is given by
D({a, b}) = <en\(PI(a) U·· . U Pn(a) U PI(b) U··· U Pn(b».
By Theorem (2.7), D({a, b}) is complete hyperbolic, i.e. the complement of the 2n affine hyperplanes PI(a), ... , Pn(b) in en is complete hyperbolic (a result which follows easily from [3, Prop. 1.5]).
Remark (2.9). Inequalities (2.1) and (2.2) hold also when d is any complete locally multiplicatively-convex topological algebra over C, whose set of quasi-regular elements is open. As a consequence of this
870
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
condition, according to lemma 5.2 of [8] the function. (2.5)
X~SpX
is upper semicontinuous on .sIl. As Enrico Casadio has kindly pointed out, the proof of the upper semicontinuity was carried out in [8, Lemma 5.2] only in the case in which the topological algebra .sIl satisfies the first countability axiom. However, the proof given in [8] can be adapted to the general case either by replacing sequences by generalized sequences (as was done by Enrico Casadio) or by the following direct argument. If the function (2.5) is not upper semicontinuous, there is an element X E .sIl and an open neighbourhood V of Sp Xo in C such that in every o neighbourhood V of Xo an element x can be found for which Sp x ~ V. The origin 0 being quasi-regular, there is a continuous seminorm Po on .sIl such that y E .sIl is quasi-regular whenever Po(y) < 1. Suppose first that for every integer N;;;. 1 there are xN E.sIl and AN E Sp xN n (C \ V) such that IAN I> Nand po(xN - x o) < 1. Because
we find that A -;..1 xN is quasi-regular provided that N}> O. This contradiction shows that there is some constant H > 0 such that Igi < H for all g ESp x and all x E V o= {yl Po(y) < I}. Let {U)jEJ be a fundamental system of neighbourhoods of 0 in V o' and let xj - X o E ~ be such that Sp xj V. For any j E J choose Aj E Sp xj n (C\ V). Because IAjl < H, the set {Aj}jEJ has an accumulation point AoE C\ V. If Ao = 0, then 0 ~ Sp x o' Therefore .sIl has an identity, 1, and X o is invertible. Let p be a continuous seminorm on .sIl such that y E.sIl is invertible whenever p(y - x o) < 1. Choose e > 0 in such a way that ep(1) < j and let j E J be such that IAj 1< E, ptx, - xo)< ~. Then
z
contradicting the fact that xj - Aj 1 is not invertible. If Ao of. 0, A01XO is quasi-regular. Let p be a continuous seminorm on .sIl such that A01 y is quasi-regular for all y E .sIl for which p(y - x o) < 1. Let (T > 0 be such that p(ry - x o) < 1 for all T E C and all y E .sIl satisfying the
E. Vesentini / Hyperbolic Domains in Banach Spaces and Banach Algebras
871
conditions IT - 11 < (T, p(Y - X o) <~. Choose e in such a way that 0 < e < IAol and such that IA - Aol < e implies lA-lAo - 11 < (T. If j E J is such that p(xj - x o) < ~ and IAj - Aol < e, then
and therefore A;I xj is quasi-regular. This contradiction completes the proof of the upper semicontinuity of the function (2.5) on d.
References [1) B. Aupetit, Proprietes speclrales des algebres de Banach, Lecture Notes in Math. 735 (Springer, Berlin, 1979). (2) T. Franzoni and E. Vesentini, Holomorphic maps and Invariant Distances, NorthHolland Math. Stud. 40 (North-Holland, Amsterdam, 1980). (3) S. Kobayashi, Hyperbolic manifolds and holomorphic maps (Marcel Dekker, New York, 1970). [4] S. Kobayashi, Intrinsic distances, measures and geometric function theory, Bull. AIDer. Math. Soc. 82 (1976) 357-416. [5] C.E. Rickart, General theory of Banach algebras (Van Nostrand, Princeton, 1960). [6] E. Vesentini, Invariant distances and invariant differential metrics in locally convex spaces, In: Spectral theory, Banach Center Publications 8 (Banach Centr. Publ., Warsaw, 1982) 493-512. [7] E. Vesentini, Complex geodesics, Composition Math. 44 (1981) 375-394. [8] E. Vesentini, Caratheodory distances and Banach algebras, Adv. in Math. 47 (1983) 50-73. [9] J."P. Vigue, Le groupe des automorphismes d'un domaine borne d'un espace de Banach. Applications aux domaines bornes syrnetriques, Ann. Sci. Be. Norm. Sup. (4) 9 (1976) 203-282.
J.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
873
THE CATEGORY OF QUOTIENT BORNOLOGICAL SPACES L. WAELBROECK Departement de Matnematique, Unioersite Libre de Bruxelles, 1050 Bruxelles, Belgium
In homage of Leopoldo's sixtieth birthday
O. Introduction The reader will find in this paper an abelian category. The objects are formal quotients of b-spaces, A b-space (E,~) is a complete convex, separated, bornological space (H. Hogbe-Nlend writes an ebccs-space), A b-subspace (F, ~F) of (E,~) is a vector subspace, with a b-structure ~F such that ~F ~ ~ (i.e. B E ~ if B E ~F' some mathematicians inverse the order, write for example ~F > ~ when ~F ~ ~ !). We study (E,~) modulo (F, ~F)' and write (E, ~)I(F, ~F)' where I must be read 'modulo'. And it is easier to write ElF rather than (E, ~)I (F, ~F)' It is a long time ago that I stumbled on q-algebras [15]. I studied elements of a commutative b-algebra d, i.e. a b-space on which a commutative associative multiplication is defined, and B, . B 2 is bounded if B I and B 2 are bounded. In one proof, I had to introduce a b-ideal (a, ~a)' where (a, ~a) is a b-subspace of (sd, ~), which is an ideal of sd, and B I . B 2 E ~a if B. E ~, B 2 E ~a' It turns out that all further proofs of that memoir could be generalized. The aim was to construct a holomorphic functional calculus. The following is a consequence of the generalization. Assume that (a l , ••• , an) and (a;, ... , a~) are two tuples of sd and f a holomorphic function such that f(a l , ••• , an) and f(a;, ... , a~) have been constructed. Then f(a l , ••• , an)- f(a;, ... , a~) belongs to the ideal generated by the a, - a;. This ideal is not always closed. If you wish, in the proof, I considered a q-algebra sd Ia. A couple of years later (1962), I had constructed the category of q-spaces, had proved that the category is abelian and had placed a q-structure on the spaces of morphisms, defined thus multilinear algebra in the category. The notes were long, more than 100 pages, and did not contain much mathematics. The results are true, and applicable to the
874
L. Waelbroeck I Category of Quotient Bomological Spaces
memoir quoted above. But, at that period, they were not applicable elsewhere. The notes were not sent to publication. These notes have been given to G. Noel. He considered b-spaces, and what I call bo-spaces, i.e. separated, convex bornological spaces. He gave another definition of the q-spaces, gave a definition of the qo-spaces (quotients of bo-spaces). It is known that a bo-space cannot always be embedded in a b-space ([6], [16]). Noel shows that a bo-space can be embedded in a q-space, more generally, a qo-space ElF can be embedded in a q-space if F is a b-space. He constructs tensor products of q-spaces, and of qo-spaces. But he shows that E I 0 q E z is not always a b-space though E I and E z are b-spaces. For example, if E I is a reflexive Banach space, then a Banach space E z exists such that E I 0 q E z is not isomorphic to any Banach space. His results are published [9], [10], [11], [12]. One remark, assuming that the approximation conjecture would turn out affirmatively, he introduced 'ultraflat' spaces. Now, I have proved that all flat spaces are ultraflat in his sense, but I cannot prove that all ultraflat spaces are flat [22]. My definition of q-spaces is announced in the Cornptes-Rendus ([18], [19], [20], [21]). Later, I observed that the category of quotient Banach spaces is a subcategory of that of vector spaces. The definitions are now published [23], [24]. But applications are more important than definitions. Unfortunately, there is more published about the definitions than about the applications. The original memoir [15] contains of course applications. Just translate the propositions in quotient language. Their proofs are not difficult any more, and the statements are probably easier to understand. J.-P. Ferrier [3], [4] and 1. Cnop [1] considered holomorphic functions having growth properties at the boundary of the domain. These functions were those I considered in the holomorphic functional calculus. And the results apply to the quotient of a b-algebra modulo an ideal that is not assumed closed. Another application is now published [26], [27]. Let d be a Banach algebra, commutative and associative, with a unit. Let a be an ideal whose hull X = XI U X z is disconnected, the X; are thus compact and disjoint. Then two ideals ai' a z exist such that a = a l n a z where Hull a i = X;. (The ideal is not assumed to be closed.) J.L. Taylor [13], [14] did not observe that formal quotients exist. His proofs are difficult because the category of Banach spaces is not abelian, and he needs homological algebra. He sketches one proof, but does not
L. Waelbroeck / Category of Quotient Bomological Spaces
875
use it, because the homology space he meets is not a Banach space. It is a quotient Banach space, the remainder of Taylor's proof holds even if the space is a quotient Banach space. His sketched proof is valid [25]. I do not follow Taylor's construction of his 'Cauchy-Weil' formula. ('Cauchy-Taylor' seems more reasonable.) Fortunately, R. Levi (Sofia) [8] has constructed the Cauchy-Taylor formula by diagram-chasing. I can now construct Taylor's operational calculus, involving (a" . . . , an) commuting operators on a quotient Banach space. (The result is not yet published.) J.W. Helton and R.E. Howe [5] consider operators T on a Hilbert space H, which are normal modulo 7,: 7, is the ideal of operators with a finite trace; T is normal modulo 7, if T* T - IT* E 7,. They construct f(T) mod 7" where f is of class cg'" on a neighbourhood U of Sp, T, the essential spectrum of T. In other words, they construct a continuous linear mapping s : cg"'( U) ~ .2(H) such that s(f· g) - s(f) . s(g) E 7, when f, g are of class cg"', s(z) = T, s(£) = T* when z is the imbedding U ~ C, and £ its conjugate, and such that s(1) = Id if 1 is the constant 1, and Id the identity operator on H. The reader feels, like the author, that this is a quotient problem. Finally, I must mention J.F. Colombeau. He has constructed an algebra ~(IJ) of 'generalized functions', with
Both ~M(~(IJ)) and .H(IJ) have natural bornologies. ~M(~(IJ)) is a b-algebra, .H(IJ) is a b-ideal. And ~(IJ) is a q-algebra. P.M. Dirac wrote a set of axioms, which seems useful in quantum mechanics. L. Schwartz has proved that, along with a few postulates necessary in the development of the theory, Dirac's axioms are contradictory. Colombeau has amended Schwartz's postulates, and constructed a model of Dirac's axioms. In his model 'renormalization', for example, makes sense. (For this author, (5 + 00) - 00 = 5 is not correct.) Colombeau is sending a paper to this book, I send the reader to the paper and to the bibliography [2].
1. Bornological Language Several mathematicians have published papers about bornological
876
L. Waelbroeck / Category of Quotient Bornological Spaces
spaces. This author's language is different from that of many other authors in the field.
Definition (1.1). A b-space (E, gJ), or by abuse of language E, is a complex vector space that is a convex, complete, separated bornological space, i.e. with a bounded structure gJ, with U BEll B = E, with B E gJ if B ~ B I U n, or B ~ LiBI + LiBz whenever B\ E gJ, B z E gJ, Li being the unit disc in C, without any non-null vector subspace of E belonging to gJ, and in which for every B E gJ there exists a E\ E gJ with B ~ B\, B\ absolutely convex, and E BI' the subspace absorbed by B\, with the norm-gauge associated to E), is a Banach space. In this paper, [!lJ will be called a complete boundedness, or a complete bornology on E. The elements of gJ will be called 'bounded', or when several boundednesses crop up in the problem, 'bounded in E'.
Definition (1.2). A b-subspace (F, gJF) of (E, gJ), or more simply F, b-subspace of E, is a complex vector subspace of E, with a boundedness gJF' such that gJF ~ gJ. Elements of gJF are 'bounded in F'. Let F) and F z be two b-subspaces of E. Then F) + F z is a b-subspace of E, its elements are the z = x + y with x E F), y E F z, its bounded subsets are the B such that B ~ E) + E z' with B) bounded in F) and B z bounded in F z. And F) n F z is a b-subspace of E, its elements are the z which belong both to F\ and to F z' and B is bounded in F\ n F z if B is bounded in F I and in F z.
Definition (1.3). Let E I and E z be two b-spaces. A bounded linear mapping u : E) ~ E z is a linear mapping, which maps every bounded subset of E\ into a bounded subset of E z. This set of bounded linear mappings is called b(E), E z). b(E) , E z) is obviously a vector space. We place on b(E), E z) the equibounded boundedness, B ~ b(E), E z) is bounded if U fEB fBI is bounded in E z whenever B) is bounded in E I • The range, the image, uE I of u : E I ~ E z is the set {uxl x E E)} with the range boundedness {uBI B E gJ EJ This is a b-subspace of E z. More generally, if F\ is a b-subspace of E I , uFI is a b-subspace of E z, its elements are the ux, x E F I and its bounded subsets are the uBI' B) bounded in F\.
L. Waelbroeck / Category of Quotient Bomological Spaces
877
Let F2 be a b-subspace of E 2, and u: E I ~ E 2 a bounded linear mapping. Then u- I F2 has as elements the x E E I with ux E F2 , and as bounded subsets the bounded subsets B of E I such that uB is bounded in F2•
Definition (1.4). The mapping u: E I ~ B 2 is bomologically surjective if uBI = B 2 as a b-space, i.e. for every y E E 2 there exists x E E I with ux = y, and for every B 2 bounded in E 2 , there exists B I bounded in E I with uBI = B 2 • Definition (1.5). A closed b-subspace F of E is a b-subspace of E, such that every bounded subset of E which is contained in F is bounded in F. The quotient vector space BJF, with the quotient boundedness is a b-space: a subset of EJF is there bounded if it is the quotient image of a bounded subset of E. The kernel Ker u of a bounded linear mapping u : B I ~ B 2 is a closed subspace of E I • And u factors
The mapping EdKer
UI ~
uBI is an isomorphism of b-spaces.
Definition (1.6). Let {EJiEI be a family of b-spaces. Then EBiEi is the direct sum of the vector spaces, with its direct sum boundedness: a subset of the direct sum is bounded if it is contained in a finite sum of sets, each bounded in one of the summands. And IIi E; is the direct product of the vector spaces, with its direct product boundedness: a subset of the direct product is bounded if all its projections are bounded.
2. q-Spaces, Strict Morphisms, and Pseudo-Isomorphisms Definition (2.1). A q-space ElF is a couple (E, F) where B is a b-space and F is a b-subspace of E. Definition (2.2). Let ElF and B'I F' be two q-spaces. A strict morphism u: B IF ~ E' IF' is induced by a bounded linear mapping u: B ~ E' whose restriction to F is a bounded linear mapping F ~ F'. Two linear mappings, u l and u2 ' both inducing a strict morphism, induce the same
878
L. Wae/broeck / Category of Quotient Bomological Spaces
morphism if and only if U 2 - u l is a bounded linear mapping E -'> F'. The space of strict morphisms ElF -'> E'I F' will be called (Tij(E IF, E'I F'). The expressions 'u induces a strict morphism' and 'u is induced by u' are useful, and will be reused. The notation (Tij will be analysed later in this paper (Sections 6 and 8). With their obvious composition, the q-spaces and their strict morphisms are a category that we shall call ij. This category is not completely satisfactory. Let G be a closed b-subspace of E. We would like E I G to be isomorphic with (EIG) IO. But the obvious strict morphism E I G-'> (EIG) lOis only an isomorphism of ij when G is complemented in E. More generally, let ElF be a q-space. Let G be b-subspace of F which is already closed in E. There is an obvious strict morphism EIF-'> (EIG) I(FIG). We would like this morphism to be an isomorphism. Definition (2.3). A strict morphism ii : £1 I F I -'> £21 F2 is a pseudo-isomorphism if it can be induced by some u : E J-'> E 2 which is bomologically surjective and such that u- JF 2 = F I as a b-space. In other words, the pseudo-isomorphism ii : E J / F I -'> E 2 / F 2 is the composition of the morphism Ell F J-'> (EI/Ker u) I(pJ/Ker u) and a strict isomorphism (EJ/Ker u) I (PIIKer u) ~ E 2 1 F2 , induced by an isomorphism EJ/Ker u ~ E 2 whose restriction to FJ/Ker u is an isomorphism of this space with F 2 •
3. Standard q-Spaces and Lemmas Definition (3.1). A b-space E is free if it is isomorphic to a direct sum of Banach spaces IlX;). A q-space ElF is standard if E is free. Lemma (3.2). For each q-space ElF we can find a standard q-space E' IF' and a pseudo-isomorphism sElF: E'I F' ~ ElF. Every b-space E is isomorphic to the quotient of a free b-space E'. Let s : E' ~ E be the quotient mapping, and F' = s-IF. Then s induces a pseudo-isomorphism s: E' IF' ~ ElF.
L. Waelbroeck / Category of Quotient Bomological Spaces
isomorphisms. The mapping U ~ S2 ° U ° SI uij(EII F I, E 21 F2)~ uij(VII VI' V 2 1 V 2)·
is
an
879
injection
Proof. The strict morphisms 51,52, U are induced by Sl' S2' u; Sl' S2 are bornologically surjective, and s~· F I = VI' S;I V 2 = F2. We want to show that u = 0 if 52 ° U ° 51 = O. If B is bounded in E I, we want to prove that uB is bounded in F 2• But Sl is bornologically surjective, a B' bounded in VI exists such that sIB' = B. We assume that 52 ° U ° 51 = 0, hence S2 ° U ° SI (B') is bounded in V 2, i.e. S2 ° u(B) is bounded in V 2. The set uB is bounded in E 2 and mapped by S2 on a bounded subset of V 2, uB is bounded in S;IV2 = F2. 0
Lemma (3.4). Let uI : E I IF I ~ E 21 F 2 and u2 : E 21 F2~ E 3 1 F3 be strict morphisms, induced by u l ' u2 • Assume that u2° u. = O. Then U I : E. ~ u;IF3 is a bounded mapping. Proof. Let B be bounded in EI' then u.B is bounded in E 2, and uiuIB) is bounded in F3 • The image of u.B by U 2 being bounded in F3 , u.B is bounded in u;IF3 • 0
Lemma (3.5). Let U: E I IF I ~ E 2 F2 be a strict morphism induced by u : E I ~ E 2. Assume that E 2 1 F 2 is standard, and also that uE I + F2 = E 2 and u- I F2 = F I , both relations bornologically. Then U is invertible in the category ij. 1
Proof. E 2 = EB 11(X;) is free. Let x E X;, and ex E 11(X;) be such that eAy) = 0 if x¥- y, eAx) = 1. The set B; = {eX}XEX; is bounded in E 2 = uE I + F2. Sets G bounded in E I, D; bounded in F2 exist such that B; ~ uG + D;. Apply the axiom of choice. Mappings c; : X; ~ G, d; : X; ~ D; exist such that ex = u ° c;x - d'p: The sets G, D; can be chosen 'completant', i.e. are absolutely convex and such that E IBj and F 2Dj are Banach spaces. The mappings d', can be 'extended linearly and boundedly', let <: II(Xi)~ Elc; c E I and d.: 11(X;)~ F 2D; c F2 be the extensions. For Xo E II(X; ), Xo = U ° c, (x") + d,(x"). It is clear that b(EB II (X;), E I) and b(1g) lPC;), F2) can be identified with b(lI(X;), £1) and b(lI(X;), F2) resp. The system of mappings c., d, determine thus c : £2 ~ E I, d : E 2~ F2 such that, for Xo E E 2,
c;,
n
n
880
L. Waelbroeck I Category of Quotient Bomological Spaces
The mapping c : E2~ E I is bounded. We also want to prove that its restriction to F2 is bounded F2~ Fl' Let B be bounded in F2, and call Id the identity of E 2 • To prove that cB is bounded in F I , it is sufficient to prove that ucB is bounded in F2, but ucB ~ -dB + B, i.e. ucB is contained i~a bounded subset of F 2• So c induces a strict morphism co, and a°c = Id. a is also invertible at left. Since u ° e + d = Id, u ° e ° u + do u = u, and ao (co a- Id) = O. So (Lemma (3.4)) co u- Id is a bounded linear mapping EI~U-IF2' but u- IF2=FI. This proves that coa= Id, a is invertible at left. 0 Lemma (3.6). Let ElF be a standard q-space, and 5: E I 1 F I ~ E 2 1 F2 a pseudo-isomorphism. Then a ~ a° 5, ij(E 1 F, E I 1 FI) ~ ij(E 1 F, E 2 F2) is a 1
bijection.
Apply again the axiom of choice. The proof is not that different from the proof above. Lemma (3.7). Let ElF be a q-space. Consider two pseudo-isomorphisms 51 : E I I F I ~ ElF, 52: E 21 F2~ ElF. A q-spaee U I V exists, and pseudoisomorphisms ~:UIV~EIIFI' t2:Ulv~E2IF2 such that 5Io~= -
I
52 ° t 2 •
Proof. The pseudo-isomorphisms 5j are induced by bornologically surjective mappings s, : E; ~ E such that s ~I F = F;. Consider the b-space E I x E 2, and the mapping e: E I x E2~ E, (xl' X2)~ SIX I - S2X2' We let U = c·IF and V = F I X F2, both U and Vas b-spaces. We see that U I V is a q-space. We let t l,12 be the restrictions to U of the projections E,xE2~E" E,XE2~E2'
The mappings t, : U ~ E; induce obviously strict morphisms 1; : U 1 V ~ E; 1 F;, and U has been constructed in such a way that sloti - S2° t2E b(U, F), i.e. 5,0 (I = 520 (2' We want to prove that the 1; are pseudo-isomorphisms; consider for example ~, prove that t l is bornologically surjective and t;IFI = V. Consider B, bounded in E I , s.B, is bounded in E. And S2 is bornologically surjective, B 2, bounded in E 2 exists such that S2B2 = siB I. Let
B = {(XI' x 2) E B I x B 2/ SIX, = S2X2} . It is clear that B is bounded in U (in particular eB = {O} is bounded in F). And t l is bornologically surjective.
L. Waelbroeck / Category of Quotient Bomological Spaces
881
We want to show that t~IFI = v: Let B be bounded in U and be such that t.B is bounded in Fl. Consider tzB. This is a bounded subset of E z. Its image by Sz is
We assumed that t.B is bounded in F I , hence (slotl)B is bounded in F. And U had been defined in such a way that
is bounded in F if B is bounded in U. So tzB is bounded in s;IF = F z, and B is bounded in F I x F z = v: We have thus proved that tl is a pseudo-isomorphism. 0 Lemma (3.8). Let u : Ell F I ~ Ezi F z be a strict morphism, induced by u, and s:EzIF2~E3IF3 a pseudo-isomorphism, induced by s:Ez~E3' bomologically surjective. Let v = sou. If E 3 = vEl + F 3 and PI = v- IF3, then E 2 = uE I + F z and F I = u- IF2, (all four relations in the bomological sense).
Proof. First, it is clear that u- I F 2 = F I ; let B c;;, E I be bounded and such that uB is bounded in F 2. Then (s ° u)B = vB is bounded in F3, i.e. B is bounded in v- I F 3 = Fl. Next, let us show that uEI + F2 = Ezo Let B ~ E 2 be bounded, then sB is bounded in E 3 = vEl + F 3. So a C bounded in E I , a D bounded in F3 exist such that sB ~ vC + D. Mappings 'P : B ~ C, IjI: B ~ D exist such that for all bE B : txpb + IjIb = sb. Let D I = {b - u'Pbj b E B}. This set is bounded in E 2 • It is mapped by s onto {sb - (s ° u )bl b E B} ~ D. So D I is bounded in S- I P3 = F 2 and B is bounded in uEI + F 2 . 0
4. Statement and Proof of the Main Result The following is the main theorem of this paper: Theorem (4.1). A category q exists, which contains ij, has the same objects as ij, and is such that the pseudo-isomorphisms are isomorphisms of q. Further, if Cat is another category, a functor (/> : ij ~ Cat extends to q if,
882
L. Waelbroeck / Category of Quotient Bornological Spaces
and only if tP(s) is an isomorphism of Cat if s is a pseudo-isomorphism of ij. The extension is unique if it exists.
We observe that the statement of the theorem determines the category q modulo isomorphism, if the category q exists. Proof. (a). For every q-space ElF we choose a standard q-space E' IF' and a pseudo-isomorphism SElF: E' IF' ~ ElF (Lemma (3.2)). Consider a strict morphism a : E J I F J ~ E 2 1F2 • Lemma (3.3) shows that the mapping ii ~ ii 0 SEIIFI is injective. And Lemma (1.6) shows that a unique ii' exists such that
We let tP(E IF)
=
E' IF' and tP(ii) = and
ij ~ ij, and for all E IF J O'ij(E; IF;, E;/ F;) is injective.
a:
It is clear that tP is a functor tP : O'ij(EJ IF I, E 2 1 F 2) ~
E 2 1 F 2,
We wish tP to become an inciusion. For all couples (E I ) F I , E 2 1 F 2) we choose a set O'q(EI I F I, E 2 1 F 2 ) ~ O'ij(EII F" E 2 1 F 2 ) , and a bijection
where
is the inclusion. Structure transport allows to place a vector structure on O'q(EI IF I, E 2 1 F 2) using the bijection, and define the composition
i.e. q is a category that contains ij. (b). Let u be a pseudo-isomorphism of ij. We want to prove that u is invertible in q. Look at ii' : E; I F; ~ E~ IF~ such that ii SE]IF] = SE21F2 ii', We prove that il' is invertible in ij. This will be sufficient to prove the result. ii' is induced by u', It is sufficient (Lemma (3.5)) to prove that uE; + F; = E;, and u'-IF~ = F;, both bomologically. But that is proved (Lemma (3.8)), SE21F2 is a pseudo-isomorphism, and ii 0 SEliF] is also a 0
0
L. Waelbroeck I Category of Quotient Homological Spaces
883
pseudo-isomorphism, induced by U «sEIIF1 = v. We assumed that vEl = E; and v-1Fz = Fi, a fortiori vEl + F; = E;. So uBi + F; = E; and u-1F; = Fl' (c). Let Cat be a category, and 1/1': /j ~ Cat be a functor. If 1/1' extends to q, let 1/1'1 be the extension. We see that 1/I'1(S) is invertible in Cat if S is a pseudo-isomorphism in /j. Let u E O"q(E IF; Ell FI ). Then a = SEIiFl 0 a' 0 sEiF' The three morphisms sEIIFI' a', and sElF are strict, their image by 1/1' is known. And 1/I'(sElF I is also determined by 1/1'. So
r
i.e. the extension is unique if it exists. (d). We want to construct the extension. In this section, we shall construct a mapping 1/1'1 : q ~ Cat. In the next section, we shall prove that this mapping is a functor. Every morphism u E O"q(E IF, E' IF') can be expressed in at least one way as
with
UI
strict, and
SI
a pseudo-isomorphism. We would put
For that, we must prove that
if
Lemma (3.7) shows that a q-space U I V and pseudo-isomorphisms, tj : U I V ~ E, IF; such that Slo t l = Sz 0 tz exist. We assumed that U I = U 0 SI' U z = U 0 sz, hence u l 0 t l = U z 0 (z. Hence
884
L. Waelbroeck / Category of Quotient Bornological Spaces
P'(UI)P'(slf l = P'(U2)P'(t2)P'(tlflp'(Slfl
=
P'(u 2)P'(t2)P(SI ° tlf l
l
l
= P'(U2)P'(t2)P(S2 ° t2f = P'(U2)P(S2f .
Thus the mapping P'I is defined. (e). We want to show that P'I is a functor. Let U; E uq(E; IF;, E;+II F;+I) for i = 1,2, and show that 1Ji'1(U2 0 u I) = P'1(U 2)P'I(UI). We choose standard q-spaces E; IF;, pseudo-isomorphisms Sj : E; IF; ~ E; I F; and strict morphisms u;: E; I F;~ E;+I'I F;+I in such a way that , H ence u, = ( ' ) «s -I " ) °SI-I UjOSj = si+IOUj' Sj+IOUj ; , U 20U I = ( S30U2°UI an d therefore
5. Second Theorem The two statements below are equivalent. Theorem (5.1). Let cP and P' be two functors from ij into a category Cat, both extending to q. Let cP I and P'I be extensions of cP and P. Let H : cP ~ P' be a functor homomorphism. Then H is a function homomorphism cP 1~ P'I' Theorem (5.2). Let Cat, and Cat, be two categories, let cP be a functor ij x Cat, ~ Cau. Assume that cP(s, objl) is an isomorphism of Cac. for all objects obj, of the category Cat, and all pseudo-isomorphisms of ij. Then cP has an extension q x Cat, ~ Cau,
They are nearly obvious, let us prove the first. The reader can prove the second. Every morphism u : ElF ~ E' IF' factors as u = u' 0 S -I where u' : E" IF" ~ E' is strict and s: E" IF" ~ ElF is a pseudo-isomorphism. H determines for the space ElF a morphism H E 1F : ep(EIF)~ P'(E IF) of Cat, and if u': E" IF" ~ E' IF' is strict, then
'F'
H E'IF' cP(u') = P'(u')H E'IP' If s : E" IF" ~ ElF is a pseudo-isomorphism, then cP(s) and P(s) are invertible in Cat.
L. Waelbroeck / Category of Quotient Bomological Spaces
i.e. H is a morphism rtJ1 -+
885
1[11.
6. The Functors a and {3
Let ElF be a q-space, Then the quotient vector space ElF is well known. Let ElF and E'I F' be two q-spaces and u : ElF -+ E' IF' be a strict morphism. Then u is induced by u 1 : E -+ E' and u1(F) c: F', so U 1 induces a linear mapping ElF ~ E'IF'. We shall let u(E IF) = ElF, and when u E mj(E IF, E'I F'), we shall let au : ElF -+ E'IF' be the associated linear mapping. It is obvious that ou is bijective if u is a pseudoisomorphism, in other words au is then an isomorphism. Definition (6.1). The functor a : q -+ E. V. described above will be called the subjacent vector space functor.
(For the author, in this paper, E. V. is the category of vector spaces and linear mappings.) The only q-spaces ElF which are isomorphic in the category q to 0 I0 (are null) are the q-spaces E IE. But it is possible to define a 'non-trivial q-structure' on 0: consider a b-space E in which at least one bounded subset generates an infinite-dimensional subspace of E. Let F be bsubspace of E, which is the same vector space as E, but with a strictly finer bornology. Then ElF is not null, but u(E IF) = {O}. Definition (6.2). Consider a set X, and a b-space E. Then the vector space {3(X, E) is the set of mappings X -+ E which map X onto a bounded subset of E. On this vector space, we place the equibounded boundedness, B 1 is bounded if {f(b)1 f E B 1, X E X} is bounded in E.
Defined in this way, {3(X, E) is a b-space. If F is a b-subspace of E, then {3(X, F) is a b-subspace of {3(X, E). We put {3(X, E IF)
= {3(X, E) I{3(X, F).
Next, let u : ElF -+ E' IF' be a strict morphism, induced by a bounded linear mapping U 1 : E -+ E'. Composition with u 1 gives a linear bounded mapping {3(X, E)-+ {3(X, E'). The bounded linear mapping U 1 induces a strict morphism: composition with U 1 defines a bounded linear mapping
886
L. Waelbroeck / Category of Quotient Bomological Spaces
{3(X, F)~ {3(X, F'). If u were 0, composition with U I would map {3(X, E) into {3(X, F'), boundedly. The strict morphism u defines in this way a strict morphism {3(X, u): {3(X, E IF)~ {3(X, E' IF'). In this way, {3(X,.) is a functor ij ~ ij. Further, if u is a pseudoisomorphism, {3(X, u) is also a pseudo-isomorphism, {3(X, .) becomes also a functor q ~ q. We want to investigate the functorial properties of f3 in function of X. We introduce a category. Definition (6.3). II is a category, whose objects are sets, and if X, Yare sets, IJ(X, Y) is the set of linear bounded mappings II(X)~ ll(Y)' If you prefer, ll(X, Y) is the set of matrices (aXY)XEX.YEY with
lIa/l =
sup x
2: laxyl <
00.
y
The composition of linear mappings is equivalent to that of matrices. Definition (6.4). Let X be a set, and ElF a q-space, then {3(X, E IF) = {3(X, E) I{3(X, F). If u : ElF ~ E' IF' is a strict morphism, then {3 (X, u) is
the strict morphism constructed above. And if a E ll(X, Y), {3(a, E IF) is the morphism {3(Y, E IF)~ {3(X, E IF) induced by the mapping {3(Y, E)~ {3(X, E), (u(Y»~ (~y aXYu(y».
Ir
It is clear that {3 is a functor x ij ~ ij, and extends to a functor lr x q~q (Theorems (4.1) and (5.2». lr is the category opposite of q.
7. The Category q is Abelian The first part of this Section is the following: Theorem (7.1). A q-space ElF is null if, and only if u{3(X, E IF) = 0 for all sets X. A morphism u : ElF ~ E ' IP' is null if, and only if, u{3(X, u) = 0 for all X. A morphism is monic if, and only if, for all X, u{3(X, u) is injective, it is epic if, and only if, for all X, u{3 (X, u) is surjective.
L. Waelbroeck / Category of Quotient Homological Spaces
887
To remind the readers who have forgotten, a morphism u: EIF-4 E' IF' is monic if v : E I IF I -4 ElF is zero as soon as u a V = 0, and is epic if v : E' IF' -4 E I IF I is zero as soon as va U = 0. In the statement of the above Theorem, two of the 'if' parts are trivial, as the reader can check. We shall prove that the condition is sufficient, the reader will check that the condition is necessary. In the proof, we shall use Lemma (7.2). Let E I = EBII(X;) befree. Then uq(EII {O}, E IF) is naturally isomorphic to IIi uf3(X;., E IF). This is obvious. Proof of Theorem (7.1). (a). If ElF is not null, a subset B exists, which is bounded in E and not in F, the inclusion B -4 E belongs to f3(B, E) and not to f3(B, F), i.e, a non-zero element of uf3(X, E IF) exists. Let u: ElF ~ E' IF' be strict, and non-zero. Let u l be the mapping inducing u. Saying that u ':ji means that a bounded subset B of E exists such that u.B is not bounded in P. Let i : B -4 E be the inclusion mapping, and f E uf3(B, E IF) the class equivalence. Then uf3(B, u)f ¥- 0, i.e, uf3(B, u) ¥- 0. Finally, let u be a non-zero general morphism. Then a pseudo-isomorphism s exists such that u a5 is strict. And of course, u as¥- 0. A set B exists such that uf3(B, u a 5) ':ji 0, hence uf3(B, u) ¥- 0. (b). Assume that uf3(X, u) is not injective. We know that uf3(X, u) can be identified with uq(lI(X) I{O}, u). We can find a v E uq(lI(X) I{O}, E IF) which is not zero, but u a V = 0, i.e. u is not monic. Conversely, let u be nonmonic. Let v be a morphism which is not zero, but u a V = 0. Assume first that both u and v are strict. Let u l and VI be bounded linear mappings inducing u and v. Since v : E I IF I -4 ElF is not zero, a bounded set B in E I exists such that vlB = X is not bounded in F. Let i: X -4 E be the inclusion mapping, and E uf3(X, E IF). Then f ¥- 0, but uf3(X, u)f = 0, so uf3(X, u) is not injective. If v is a general mapping, v ¥- 0, such that u a V = 0, we find a pseudo-isomorphism sl:E2/F2-4EIIFI such that vaSI is strict. Since v ¥- 0, we see that v 0 51 ¥- 0, but u 0 (v 0 51) = O. The proof above applies, X exists such that uf3(X, u) is not injective. Finally, let u be a general morphism. We can assume that v comes
°
r
L, Waelbroeck I Category of Quotient Bomological Spaces
888
from a standard q-space. Find a pseudo-isomorphism s: E' IF' ~ ElF with E'IF' standard. Then v = Sov 1, so UOV = (uos)ovl. We have constructed s in such a way that (u <s) is strict, so we proved that a[3(X, U - s) is not injective. The composition with u[3(X, s) is a bijection a[3(X, u)~ u[3(X, U s), which maps injective mappings onto injective mappings, and non-injective mappings onto non-injective mappings, so u[3(X, u) is not injective. (c). Assume that a[3(X, u) is not surjective for some X. If u is strict, u is induced by u l : E ~ E'. A bounded subset B' of E' exists which is not bounded in u.E + F'. The identity mapping E' ~ E' induces a morphism v : E' IF' ~ E' I(uiE + F'). This mapping is not zero, but v 0 u = 0: u is not epic. In general, u = u' 0 S-I with s a pseudo-isomorphism and u' strict. We know that composition with af3(X, s) is bijective and maps surjective mappings onto surjective mappings and non-surjective mappings onto non-surjective mappings. So uf3(X, u') is not surjective, u' is not epic, and u is not epic. Finally, assume that u is not epic, then a v: E' IF' ~ E I IF I exists, which is not zero, but v 0 u = O. Again, consider first the case where both u and v are strict. Let u l : E ~ E' and VI: E' ~ E I be bounded linear mappings inducing u, v. Since v is not zero, a bounded subset X exists such that vlX is not bounded in Fl' This set X is not equal to any U I Y + B for any Y bounded in E nor any B bounded in F', since v l o ul induces the zero mapping ElF ~ E I IFl' The inclusion mapping i : X ~ E' defines an element of u[3(X, E" F') which is not in the range of 0
uf3(X, E IF).
In the general case, when u and v are not assumed strict, we find standard q-spaces V I V and V' I V' and pseudo-isomorphisms s : V I V ~ ElF and s' : V'I V' ~ E' IF'. We lift the morphisms u, v and obtain strict morphisms u': V I V ~ V'I V', v': V'I V' ~ Ell F I , such that v' 0 u' = 0, but v' # O. We proved that uf3(X, u') is not surjective, hence u[3(X, u) is not surjective. 0 Theorem (7.3). A strict morphism u: ElF ~ E' IF', induced by up is monic if, and only if, F = u~IF'; u is epic if, and only if, E' = uE + F', both equalities being those of o-spaces. Proof. This is nearly obvious. Assume first that F # u-IF'. A bounded subset X of E exists such that uX is bounded in F' but X is not bounded
L. Waelbroeck / Category of Quotient Bomological Spaces
889
in F. Then i E u{3(X, E IF) is not zero but u{3(X, u)i = 0, where i is the equivalence class of i, inclusion of X in E. So u{3(X, u) is not injective, hence u is not monic. Conversely, assume that F = u-Ip. Let X be a set, let a E {3(X, E), ii the equivalence class in u{3(X, E IF). We see that aX is bounded in E, u l 0 a (X) is bounded in F' if u{3(X, u)ii = O. So aX is bounded in u~lF' = F. But if aX is bounded in F, ii = O. So u{3(X, u) is injective for all X, hence u is monic. Assume that E' ¥- u.E + P. A bounded subset X ~ E' exists such that X is not bounded in uE + P. Let i: X ~ E' be the inclusion and i its equivalence class in u{3(X, E'I P). This i is not in the range of u{3(X, u) since for all bounded mappings f : X ~ E, ufX is bounded in uE I , and if ul = ~ then the set C = {x - udxl x E X} is bounded in F', and u.X is bounded in u.E + F'. This shows that u is not epic. Conversely, if E' = u.E + F', let it E a{3(X, E'I F'), induced by u E {3(X, E'), and let ii E u{3(X, E" F') be the equivalence class of a E {3(X, E'). We know that E' = u.E + F' bornologically, B bounded in E and C in F' exist such that aX ~ u.B + C. Mappings f : X ~ B, g : X ~ C exist such that
and ii
= u{3(X, u)l And hence
u is epic. 0
Theorem (7.4). A morphism of q is an isomorphism if it is monic and epic. Proof. We assume that ElF is standard, every q-space is isomorphic to such; u : ElF ~ E'I F' is a strict morphism, induced by u l : E ~ E'. We assume that E' = u.E + F' and F = U~l P. We consider the q-space E EB F I IFEB F I, and the strict morphisms $1 : E EB FII F EB F I ~ ElF induced by Sl: x EB y ~ x, and $2: E EBFII FEBFI ~ Ell F I induced by S2 : x EB y ~ ulx + y. We shall prove that these morphisms are pseudoisomorphisms. It is trivial for $1: its inverse belongs to ij. To prove that 52 is a pseudo-isomorphism, we observe first that S2 is bornologically surjective, since s2(E EB P) = u.E + F ', bornologically. Assume that B is bounded in S;I F', i.e. that B is bounded in E EB F I and S2B is bounded F'. Let B I , B 2 be the projections of B in E and P. The second projection is of course bounded in F'. We must show that B I is bounded in F. We assume that F = U~I F' bornologically, B I is bounded in
890
L. Waelbroeck / Category of Quotient Bomological Spaces
E, we must still show that uJB) is bounded in P, but
and the right hand side is bounded in P. So both s, and S2 are pseudo-isomorphisms in ij. And u = S20S;) is an isomorphism of q. 0 Definition (7.5). Let ElF be a q-space, and E ' a b-subspace of E which contains F as a b-subspace. Then E ' IF will be called a q-subspace of ElF, and E IE ' will be called a quotient of ElF. The morphism E ' IF ~ ElF induced by the inclusion is the canonical injection; ElF ~ E IE ' induced by the identity morphism is the quotient morphism, or the canonical epimorphism. We observe that the set L(E IF) of q-subspaces of ElF is a lattice, and a pseudo-isomorphism E ' IP ~ ElF determines an isomorphism L(E' IP)~ L(E IF). Every isomorphism E) I FI ~ ElF being a composition of pseudo-isomorphisms and inverses of such, we see that L(E IF) and L(E, IF)) are isomorphic if ElF and Ell F) are isomorphic. Theorem (7.6). Every morphism of the category q is the composition of a quotient morphism, an isomorphism, and a canonical injection. A monomorphism is the composition of an isomorphism and a canonical injection. An epimorphisms is the composition of a quotient morphism and an isomorphism. Proof. Every q-space is isomorphic to a standard one, every morphism can be replaced by a strict one. Let u : ElF ~ E) IF) be strict, induced by u) : E~ E). We factorize u = io u'<s, where s: E IF~ E I u-)FJ is the quotient morphism, where i: (uE + F')P ~ E ' IP is the canonical injection and u':Elu-)p~(uE+P)IP is clearly (Theorem (7.4)) an isomorphism. 0 Theorem (7.7). The category q is abelian. Proof. We must prove that every morphism has a kernel, a cokernel, and is such that the natural morphism of the cokernel of the kernel into the kernel of the cokernel is an isomorphism. Let u be a morphism, which we
L. Waelbroeck / Category of Quotient Bomological Spaces
891
assume is strict, induced by U I : E ~ E'. The kernel of u is the canonical injection u-IF' IF ~ ElF; its cokernel is E'I F' ~ E' j (uE + F'), and u': E I u-IF'~ (uE + F')j F'
is an isomorphism. Let u: ElF ~ E' IF' and v: E' IF' ~ E"I F" be strict morphisms, induced by u I' VI' and such that v ° u = O. Then (u, v) is exact if, and only if, u.E + F' = v~'F" as bornological spaces. Every short exact sequence 0 ~ E I ,F I ~ E 2 1F 2~ E 3 / F 3 ~ 0 is isomorphic to a short exact sequence of the following type:
This being proved, a monic morphism E I IF I ~ E 2 / F2 is the composition of an isomorphism Ell F , ~ E' IF2 and the canonical injection E'I F , ~ E 21 F 2. An epic morphism E 21 F 2~ E 3 1 F 3 is the composition of the canonical surjection E 2 1F 2~ E 21 F' and an isomorphism E 2/ F' ~ E 3 1F 3• The original sequence was exact, E' = F'. 0
8. Multilinear Mappings
We define multilinear mappings In the category q by placing a qstructure on the vector space aq(E,1 F I, E IF), and define q2(EI I FI , E2IF2;EIF) as q(EI IFI,q(E2IF2, ElF». And we begin of course with ij. Definition (8.1). Let ElF and E' )F' be two q-spaces. Then ijl(E IF, E'I F') is the space of bounded linear mappings u : E ~ E' whose restriction to F is a bounded linear mapping F ~ F'. A subset B I ~ ql(E IF, E'I F') is bounded in this space if {uxl u E B I , X E B} is bounded in E' if B is bounded in E, and is bounded in F' if B is bounded in F. And ijO(E IF, E' IF') is the space of bounded linear mappings E ~ F', in which B I is bounded in ijO(E IF, E'I F') if {uxl u E B I , x E B} is bounded in F' when B is bounded in E. We let ij(E IF, E'I F') = ijl(E IF, E' IF') IijO(E IF, E' IF') .
892
L. Waelbroeck I Category of Quotient Bomological Spaces
This definition is coherent with the earlier definitions of a, and of aq. Also, ij is a functor ij * x ij ~ ij, clearly. Definition (8.2). By induction, we let ijl(E IF, E' IF') = ij(E IF, E' IF'),
and
We see that ij~ Iij~ = ijn' where ij~ is the space of bounded linear mappings E I x ... x En ~ E', whose restrictions to
are, for all i, bounded linear mappings into F', the space ij~ having the obvious boundedness. And ij~ is the space of bounded linear mappings of E I x ... X En into F', again with the obvious boundedness. uq(E IF, E' IF') had been defined associating standard q-spaces Ell F I, E; IF; to ElF, E'I F' and pseudo-isomorphisms s, s', The space uq(E IF, E'I F') was naturally isomorphic to uij(EII F I , E; IF;), and contains uij(E I F, E'I F'). Definition (8.3). We define q(E IF, E' IF') in a such a way that this q-space is naturally isomorphic to ij(E I/ F I, E; IF;).
In this way, q is a functor q* x q ~ q, because ij is a functor ij* x ij ~ ij. We must still 'embed' ij in q. Theorem (8.4). For all couples of o-spaces ElF, E'I F', we have a monomorphism H EIF,E'IF' : ij(E IF, E'I F')~ q(E IF, E'I F'). The system of these monomorphisms being a functor homomorphism.
Proof. We shall construct these H EIF,E'IF" and show that they are monic. The fact that they are natural will be left to the reader. For each q-space ElF, we choose a standard q-space Ell F I and a pseudo-isomorphism s: E I IF I ~ ElF. Let u: ElF ~ E' IF' be a strict morphism, induced by U: E ~ E'. The axiom of choice assures that a
893
L. Waelbroeck / Category of Quotient Bomological Spaces
commutative diagram exists: UI
Et~E;
11 U
E~E'.
V being chosen, VI is not completely determined. If V = 0, we see that Vt(E t) ~ Ker We have a mapping
s;.
T: {/(E IF, E' IF')~ i/(Etl
r; E; IF;)/b(E
p
Ker s;).
This mapping maps tt(E IF, E' IF') into ,r(Etl r; E; IFD/b(E t, Ker s;), and
i.e. T induces a monomorphism of ij(E IF, E' IF') into
This
q-space
is
pseudo-isomorphic
to
ij(Etl
r; E; IF;),
ij(E IF, E' IF'). We call this monomorphism H EIF.E'IF" D
i.e.
to
References [I] [2]
[3] [4] [5]
[6] [7)
I. Cnop, Spectral study of holomorphic functions with bounded growth, Ann. Inst. Fourier 22 (1972) 293-303. LF. Colornbeau, Some aspects of infinite-dimensional holomorphy in mathematical physics, In: Aspects of mathematics and its applications, ed. l.A. Barroso, NorthHolland Math. Stud. 34 (North-Holland, Amsterdam, 1986) 253-264. l.-P. Ferrier, Approximation des fonctions holomorphes de plusieurs variables avec croissance, Ann. Inst. Fourier 22 (1972) 67-87. l.-P. Ferrier, Spectral theory and complex analysis, Notas de Mat. = North-Holland Math. Stud. (North-Holland, Amsterdam, 1973). l.W. Helton and R.E. Howe, Integral operators: traces, index, and homology, Lecture Notes in Math. 345 (Springer, Berlin, 1973) 141-209. H. Hogbe-Nlend, Theorie des bomologies et applications, Lecture Notes in Math. 213 (Springer, Berlin, 1971). H. Hogbe-Nlend, Bomologies and functional analysis, Notas de Mat. = North-Holland Math. Stud. (North-Holland, Amsterdam, 1977).
894
L. Waelbroeck I Category of Quotient Bomological Spaces
[8] R. Levi, Notes in the Taylor joint spectrum of commuting operators, In: Spectral Theory, Banach Center Publ. 8 (Banach Center, Warsaw, 1982) 321-332. [9] G. Noel, Une immersion de la categoric des espaces bornologiques con vexes separes dans une categoric abelienne, CiR, Acad. Sci. Paris 269 (1969) 195-197. [10] G. Noel, Sur Ie complete d'un q-espace, C.R. Acad. Sci. Paris 269 (1969) 238-240. [11) G. Noel, Sur Ie produit tensoriel dans les categories qesp et qespc. CiR. Acad. Sci Paris 269 (1969) 275-278. [12] G. Noel, Produits tensoriels et platitude des q-espaces, Bull. Soc. Math. Belg. 22 (1970) 119-142. [13] J.L. Taylor, A joint spectrum for several commuting operators, J. Funct. Anal. 6 (1969) 172-191. [14] J.L. Taylor, The analytic functional calculus for several commuting operators, Acta Math. 125 (1970) 1-38. [15] L. Waelbroeck, Etude spectrale des algebres completes, Acad. Roy. Belg. CI. Sci. Mem. 7(2) 31 (1960). [16] L. Waelbroeck, Le complete et Ie dual d'un espace 'il bornes', C.R. Acad. Sci. Paris 253 (1961) 2827-2828. [17] L. Waelbroeck, Topological vector spaces and algebras, Lecture Notes in Math. 230 (Springer, Berlin, 1971). [18] L. Waelbroeck, Les quotients d'espaces bronologiques complets, CiR. Acad. Sci. Paris 285 (1977) 899-901. [19] L. Waelbroeck, Les quotients d'espaces bornologiques complets; theorie multilineaire, C.R. Acad. Sci. Paris 285 (1977) 949-951. [20] L. Waelbroeck, Les q-algebres, C.R. Acad. Sci. Paris 285 (1977) 1053-1055. [21] L. Waelbroeck, Les quotients banachiques, c.R. Acad. Sci. Paris 286 (1978) 37-39. [22] L. Waelbroeck, Les espaces de Banach plats sont ultraplats, Bull. Soc. Math. Belgique Ser. A 33 (1981) 3-6. [23] L. Waelbroeck, Quotient Banach spaces, In: Spectral theory, Banach Center Publ. 8 (Banach Center, Warsaw, 1982) 553-562. [24] L. Waelbroeck, Quotient Banach spaces: Multilinear theory, In: Spectral theory, Banach Center Publ. 8 (Banach Center, Warsaw, 1982) 563-571. [25] L. Waelbroeck, The Taylor spectrum and quotient Banach space, In: Spectral theory, Banach Center Publ, 8 (Banach Center, Warsaw, 1982) 573-575. [26] L. Waelbroeck, Holomorphic functional calculus and quotient Banach algebras, Studia Math. 75 (1983) 273-286. [27] L. Waelbroeck, Quasi-Banach algebras, ideals, and holomorphic functional calculus, Studia Math. 75 (1983) 287-292.
1.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
895
GAUSS ET LA COMPOSITION DES FORMES QUADRATIQUES BIN AIRES
Andre WElL Institute for Advanced Study, Princeton, NJ 08540, U.S.A. A Leopoldo Nachbin en temoignage de cordiale arnitie
Un des plus beaux titres de gloire de Gauss est d'avoir, dans la V" Section des Disquisitiones (Artt. 234-261, De compositione [ormarum, et Artt. 305-307), etabli la theorie de ce que nous appelons le groupe des classes de formes quadratiques binaires primitives de discriminant donne, ou encore, en termes plus modernes, Ie groupe des classes d'ideaux dans les 'ordres' des corps quadratiques. C'est la, dit-il au moment d'en aborder l'etude, 'un sujet de grande importance, auquel personne n'a encore touche' ('a nemine hucusque attactum'; Art. 234). En fait, pas plus que mainte autre grande decouverte mathematique, ce n'etait une creation ex nihilo. L'histoire en a deja ete plusieurs fois ecrite (cf. en particulier [5, pp. 25 et 41-47], et [1, Chap. I et Chap. IY, Section YI]); il ne sera pas inutile neanmoins d'en retracer les principales etapes avant d'examiner de plus pres la contribution de Gauss a cette question. (1). Soit I une forme quadratique binaire
I(X, Y)
=
AXz + BXY + Cy Z ,
ou A, B, C sont des en tiers, et ou de plus, si nous voulons suivre Gauss, B doit etre pair. Son 'discriminant' est B Z - 4AC; son 'determinant', au sens de Gauss, est (~B)Z - AC; Ie cas des formes a discriminant nul est exclu une fois pour toutes. Le 'diviseur' de I est le p.g.c.d. de A, B, C; si
ce diviseur est 1, 1 est dite primitive. Deux formes sont dites equioalentes (resp. proprement equiualentesy si elles peuvent se deduire I'une de I'autre par une substitution lineaire de determinant ±1 (resp. + 1); elles ont alors merne discriminant et merne diviseur. Les 'classes' de formes, au sens de Gauss, sont celles qui sont definies au moyen de l'equivalence propre. Soient 10' II' Iz des formes quadratiques (1.1)
(i = 0, 1,2).
896
A. Wei! / Gauss et La Composition des Formes Ouadratiques Binaires
Supposons qu'en posant:
(1.2)
on ait l'identite (1.3) alors, suivant Gauss, on dit que 12 est 'transformable en lo/r' au moyen de la substitution (1.2). Du fait qu'on a exclu Ie cas ou I'une ou I'autre des /; serait de discriminant nul, il est facile de voir que les determinants Pp.v = Pp.qv - qp.Pv ne peuvent etre tous nuls; si de plus ils sont sans diviseur commun autre que ± 1, 12 est dite composee de 10 et II. Gauss dernontre I'existence d'une telle cornposee chaque fois que les determinants de 10 et II ne different que par un facteur carre; de la iI deduit la loi de composition qui fait de I'ensemble (fini) des classes de formes primitives de determinant donne un groupe commutatif. Bien entendu il n'emploie pas Ie mot de groupe, qui ne devait faire son apparition qu'avec Galois, mais les Artt. 305-306 des Disquisitiones montrent qu'il en a eu, en ce qui concerne les groupes commutatifs finis, la notion claire et distincte. Sur ce point, d'ailleurs, comme sur beaucoup d'autres, Euler avait joue un role de precurseur (ct. [1, Chap. III, Section VI]). (2). On peut, avec beaucoup de vraisemblance, faire remonter a l'antiquite grecque quelques cas particuliers de 'composition' (ct. [1, Chap. I]); ce sont ceux qui s'expriment pour nous par les identites
auxquelles iI convient d'adjoindre la suivante:
Ces identites se rapportent respectivement aux cas
A. Wei! / Gauss et fa Composition des Formes Quadratiques Binaires
897
fo(X, Y) = !J(X, Y) = fz(X, Y) = X 2 + y 2 , fo(X, Y) = fl(X, Y)
=
f2(X, Y)
=
X 2 =+= Ny 2,
ou N est un en tier positif. Bien entendu, (2.1) est un cas particulier de (2.3), et (2.2) ne differe pas de (2.3) des qu'on admet pour N des valeurs negatives. Reciproquement, des qu'on admet les irrationnelles quadratiques, (2.3) se deduit de (2.1) en y substituant YoVN, YJVN a Yo, YI' et (2.2) s'en deduit de merne pourvu qu'on admette les 'imaginaires'. (3). On trouve des traces de l'identite (2.1) chez Diophante, au HIe siecle de notre ere; faute de textes, on ne peut remonter plus haut. Dans \'lnde, au vue siecle, Brahmagupta connait (2.2) et l'applique avec virtuosite a l'etude de I'equation dite 'de Pell'; il est plausible que cette connaissance ait pu lui venir de la Grece et qu'Archimede, peut-etre dans Ie merne contexte, en ait rencontre tout au moins des cas particuliers significatifs. Au XHIe siecle, Leonardo Fibonacci, dit Leonardo Pisano, enonce (2.1) dans son langage, sans d'ailleurs s'en attribuer Ie rnerite, et se donne la peine d'en fournir ('sans ayde d'Algebrc', comrne aurait dit Fermat) une laborieuse demonstration. Telle est ce qu'on pourrait appeler la 'protohistoire' de la composition des formes. (4). Une fois acquise la technique algebrique de Viete, ou encore celie de Descartes, toutes deux deja assez proches de la notre, la verification des identites (2.1), (2.2), (2.3) se reduit a une trivialite. Viete Iui-rneme applique (2.1) a l'etude des triangles rectangles et a la trigonometric. Au XVll" siecle, Fermat, dont la technique algebrique etait celie de Viete, fait un large usage de (2.1), (2.2), (2.3) dans ses recherches sur les formes quadratiques. Euler lui-merne, reprenant et prolongeant les travaux de Fermat, se contente longtemps d'enoncer les identites en question et d'en signaler I'importance ('notatu dignum', dit-il de (2.1) en 1749; 'theorema eximium', dit-il de meme de (2.3) en 1758) sans chercher encore ales motiver. C'est seulement dans son Algebre qu'Euler, arrive a la soixantaine, donne de ces identites la seule explication qui lui semble et nous semble satisfaisante; il I'obtient en decomposant en leurs facteurs lineaires les formes X 2 + Ny 2 (ou N peut a present etre soit posit if, soit negatif): X 2+NY2= (X+ YY-N)(X- YY-N) ,
898
A. Wei! / Gauss et fa Composition des Formes Ouadratiques Binaires
et observant qu'on a evidemment:
(2.3) s'en deduit aussitot en multipliant cette derniere relation par sa 'conjuguee' (obtenue en y substituant - V- N a V- N). Comme I'observe Euler, la merne methode permet d'obtenir une identite du type (1.2), par exemple pour les cas fo(X, Y)
=
fiX, Y)
=
aX 2 + c y
2
,
UX, Y) =
x 2 + acy2 •
Un peu plus tard, Lagrange, qui avait eu, lui aussi (independarnment d'Euler) l'idee d'appliquer a la theorie des nombres la decomposition des formes en leurs facteurs lineaires, traita, a la Section IX de ses Additions a la traduction francaise de I'Afgebre d'Euler, Ie cas plus general fo(X, Y) = !J(X, Y) = UX, Y) = X 2 + aXY + b y
2
•
II ecrit pour cela
X 2 + aXY + by 2 = (X + ~"Y)(X + g' Y) , d'ou l'identite cherchee a partir de la relation
et de la relation 'conjuguee' obtenue en substituant g' a q. (5). En I'an VI de la Republique, c'est-a-dire en 1797/98, Legendre publia son Essai sur fa Theone des Nombres, consacre a l'expose des decouvertes d'Euler et Lagrange, et des siennes propres, en arithmetique. Toute une section (la Section III de la lye Partie) y traite de la multiplication des 'diviseurs quadratiques' d'une 'formule' t 2 + au', c'esta-dire, dans notre langage, de la composition des formes primitives de determinant - a (d. [1, Chap. IV, Section VI]). Plus precisement, il se donne deux formes primitives (i=0,1),
A. Weil / Gauss et fa Composition des Formes Quadratiques Binaires
899
dont il suppose qu'elles ont merne discriminant et que ao et a l sont premiers entre eux; comme d'ailleurs il ne fait pas de distinction entre formes equivalentes, il observe qu'on peut toujours satisfaire a cette derniere condition par une 'preparation' convenable de 10 et II (c'est-adire en les rernplacant, s'il y a lieu, par des formes equivalentes). Cela fait, il donne la construction d'une forme 12 de meme discriminant que fo et II' et de formes bilineaires x 2' Y2 en x o, Yo et XI' YI' de maniere a satisfaire it l'identite (1.3). II est a noter du reste que sa definition de I'equivalence est celie de Lagrange, qui ne distingue pas entre equivalence propre et imprapre; par exemple, il ne distingue pas entre les formes I(X, Y) et I(X, - Y) ou bien f(Y, X). Chaque classe, en ce sens, est done, soit une classe au sens de Gauss, soit la reunion de deux telles classes; il s'ensuit que la multiplication, telle que la definit Legendre, est bien une loi de composition entre classes de formes, mais en general a deux valeurs, comme il apparait dans les tables qu'il en donne, par exemple pour la valeur -89 du determinant. (6). C'est en 1801 que parurent les Disquisitiones de Gauss; il en avait concu Ie projet pres de cinq ans auparavant; la redaction s'en etait poursuivie de 1797 a 1800, et I'impression, trap lente a son gre, de 1798 a 1801. Plus de la rnoitie du volume est occupee par la ye Section qui traite des formes quadratiques; a la fin de novembre 1798, il confie a son ami Bolyai qu'il en redige une quatrieme version, qui Ie satisfait mieux que les precedentes. C'est aussi a l'automne 1798 qu'il aborde la theorie de la composition des formes ('hae disquisitiones inchoatae autumno 1798', note-t-il en marge de son exemplaire). II est vrai que, des Ie debut de la V" Section, il en avait donne un premier exemple, celui qui s'exprime par l'identite:
ou l'on a pose:
f(X, Y) = aX 2 + 2bXY + cy 2 ,
identite 'facilement verifiable', dit-il, et dont le resultat de Lagrange, cite plus haut au n° (4), a pu lui donner l'idee. D'apres la merne lettre a Bolyai, il venait, au moment ou il abordait la
900
A. Wei! / Gauss et fa Composition des Formes Quadratiques Binaires
theorie en question, de passer quelques semaines des plus fructueuses a Helmstedt, chez Pfaff, ou il avait eu tout loisir de consulter la riche bibliotheque de celui-ci, ainsi que la bibliotheque locale (sans doute celie de l'Universite). Y avait-il rencontre I'Essai de Legendre? La section de I'Essai ou il est traite de la composition avait-elle attire son attention sur ce sujet, dont il allait tirer un si brillant parti? Cela n'a rien d'invraisemblable. Toujours est-il qu'il n'a, ni dans Ie corps de son ouvrage, ni dans les Additamenta ajoutes apres coup expressernent pour prendre position vis-a-vis de I' Essal de Legendre, reconnu a celui-ci Ie moindre rnerite a cet egard. Sans doute leurs temperaments scientifiques etaient-ils trop opposes pour que Gauss put se resoudre a laisser croire qu'il avait en quoi que ce soit subi !'influence de son atne, Telles peuvent etre les faiblesses d'un grand esprit. (7). La methode d'exposition de Gauss, aux Artt. 234-261 des Disquisitiones, consiste a traiter les identites (1.1), (1.2), (1.3) par la methode des coefficients indetermines, et a en derouler les consequences. Sans doute, du point de vue de la rigueur, cette methode est-elle irreprochable; elle n'en manque pas moins de transparence, et surtout elle comporte, au dire de Gauss lui-merne, des calculs si penibles qu'il n'en explicite que la moindre partie, laissant au lecteur Ie soin de verifier tout Ie reste. Aussi ce sujet demeura-t-il une pierre d'achoppement pour les lecteurs des Disquisitiones pendant un bon derni-siecle, si bien qu'en 1854 Dirichlet jugea utile d'en presenter un expose qui etait, a peu de chose pres, un retour, conscient ou non, a Legendre (d. [6]). C'est aussi la methode de Dirichlet que presente Dedekind en 1871 dans Ie X" Supplement de la 2e edition des Vorlesungen iiber Zahlentheorie de Dirichlet, redigees et publiees par lui pour la premiere fois en 1863. Dedekind introduit dans son exposition Ie mot de 'groupe', pour lequel il se refere a Galois dont il avait etudie I'ceuvre plus de quinze ans auparavant; mais ce X" Supplement contient aussi une innovation bien plus decisive, sous forme d'une premiere esquisse de la theorie des ideaux dans les corps de nombres algebriques; celle-ci, developpee plus amplee ment dans les 3 et 4e editions du meme ouvrage, devait y fournir la matiere du celebre Xl" Supplement. En particulier, Dedekind y fait voir que Ie groupe des classes de formes binaires, tel qu'il avait ete defini par Gauss, n'est pas autre chose que Ie groupe des classes d'ideaux dans l'anneau des entiers d'un corps quadratique, ou plus generalernent dans un 'ordre', c'est-a-dire un sous-anneau unitaire de celui-ci ([2, 2e ed., Sections 168-170; 3e ed., Sections 180-181; 4e ed., Sections 186-187]; d.
A. Weil / Gauss e( La Composition des Formes Ouadratiques Binaires
901
aussi l'interpretation geornetrique donnee par F. Klein dans ses notes de cours autographiees sur la theorie des nombres: [4, pp. 94-221]). (8). Jusque la, il ne s'etait agi que de formes quadratiques sur I'anneau Z des entiers rationnels; il restait a examiner ce que devenait la theorie en question lorsqu'on substitue a Z des anneaux, toujours commutatifs et unitaires, mais de plus en plus generaux. Ces recherches ont abouti, d'abord au travail [5] de J. Towber, ou il est suppose seulement que 2 n'est pas diviseur de zero dans I'anneau de base, puis enfin a celui de M. Kneser, [7], ou cette restriction est levee grace a l'emploi des algebres de Clifford, et ou la question est elucidee d'une maniere qu'il est permis de croire definitive. (9). II est temps de revenir a Gauss et a I'objet propre du present travail, qui est d'imaginer, s'il est possible, comment Gauss a pu parvenir a ses resultats sur la composition; car, merne si Dirichlet n'en avait pas fait l'observation [6, p. 107] et si Gauss Iui-merne ne I'avait pas indique a la fin de la preface des Disquisitiones, il serait clair que les demonstrations 'synthetiques' inserees dans cet ouvrage n'ont pu etre fabriquees qu' a posteriori. Heureusement nous avons sur ce point les confidences de Gauss a Dirichlet, transmises par celui-ci a Kummer puis par Kummer a Kronecker en 1846 [3, p. 98]. 'Dirichlet rn'a explique', ecrit a peu pres Kummer, 'que Gauss, lorsqu'il composait la section De compositione formarum dans les Disquisitiones, s'etait fabrique pour son usage personnel quelque chose comme mes 'facteurs ideaux', mais qu'il n'avait pu Ie faire reposer sur une base solide; voici en eftet comment il s'exprime dans une note a son memoire sur la decomposition des polynomes en facteurs lineaires, . .. Que cette note se rapporte a la composition des formes, c'est ce que Gauss lui-rnerne a dit a Dirichlet.' La note que cite Kummer figure en bas de page (Gauss, Werke, III, pp. 14-15) a propos du passage ou Gauss critique l'emploi, abusif a son avis, que ses predecesseurs, Euler, d' Alembert et autres, avaient fait des imaginaires dans leurs tentatives de demonstration de ce que nous appelons 'Ie theoreme fondamental de l'algebre'. Ces auteurs s'accordaient Ie droit de calculer avec les racines d'un polynome a coefficients reels avant d'avoir montre qu'elles peuvent s'ecrire sous la forme a + bv=I; traduit en langage moderne, cela revenait a admettre qu'etant donne un tel polynome, il existe une extension du corps de base ou il se decompose en facteurs lineaires, C'est la un point qui pour nous ne fait pas difficulte, puisqu'un tel corps peut aisernent se construire d'une rnaniere formeIIe;
902
A. Weil / Gauss et la Composition des Formes Quadratiques Binaires
mais il est vrai que, faute d'une telle justification, les racines en question demeurent a l'etat de fantomes sans consistance. Voici maintenant la note de Gauss: 'Dans une question bien differente, mais quelque peu analogue, j'aurais pu, moi aussi, m'accorder une pareille licence; cela m'aurait permis de dernontrer en quelques mots, au moyen de fictions du merne genre, des resultats qui, faute de celles-ci, se revelent fort difficiles et exigent des artifices tres cornpliques. Mais j'ai prefere m'en abstenir, et je crois que j'aurais ete peu approuve si j'avais sur ce point imite la methode des analystes. ' (10). C'est done apparemment au moyen d'une construction formelle, sans justification substantielle a ses yeux, que Gauss avait precede d'abord dans son etude de la composition. La-dessus, la comparaison, d'une part avec les travaux d'Euler et Lagrange (que Gauss connaissait a fond en 1798), et de I'autre avec ceux de Dedekind et de ses successeurs, suggerent des conjectures qui vont etre exposees a present. C'est principalement Dedekind qui nous servira de guide. Comme Gauss nous partirons des relations (1.1), (1.2) et (1.3); avec lui nous supposerons provisoirement que les a.; b.; c, sont entiers. Pour i = 0, 1,2, soit m, Ie diviseur de /;, c'est-a-dire Ie p.g.c.d. de ai' 2bi , ci ; soit d, = b7 - aici son determinant; on suppose d, ¥- 0 pour tout i. Gauss commence par montrer, par un calcul formel, que dod! est un carre, et cela en faisant voir que bob! ± Vdod! s'exprime comme un polynome en a 2 , b2, c2 et les PI' et q,.; puis il montre que d 2 divise domi et d!m~ et n'en differe que par un facteur carre. Comment etait-il parvenu aces resultats? II est permis d'imaginer qu'a I'exemple d'Euler et de Lagrange il avait commence par ecrire Ies comme produits de facteurs lineaires:
t.
(10.1)
/;(X, Y) = (aiX + ,B;Y)(a;X + ,B;Y).
On aura done en premier lieu (10.2)
Si ron pose alors
A. Wei! / Gauss et la Composition des Formes Quadratiques Binaires
903
on aura:
81 = (aJ3;- b;}(b
j -
=
f3 j a ;)
bj ( a jf3; + f3 j a ; ) - b7- (a ja;)(f3j f3;) = b7- ajcj = d.,
L'identite (1.3) implique a present que chaeun des deux faeteurs en lesquels se decompose fzCx z' Yz) doit etre, a un facteur pres independant des Xj' yj' Ie produit d'un des faeteurs de fo(xo, Yo) et d'un des faeteurs de ft(x t, YI); apres un changement de notations, on pourra done supposer qu'on a identiquement
(10.3) (aoxo+ f3oyo)(a,x l + f3, y,) = azCpoxox l + P,xoYI + PzYox, + P3YOYI)
+ f3z(%X oXI + qlxoYI + qZYOX 1 + q3YOYl) , ainsi que la relation 'conjuguee' de celle-Ia, obtenue en y substituant f3; a ai' f3j pour i = 0, 1,2. Posons de plus
et definissons de rnerne les quantites 'conjuguees'
A~.
a;,
On aura:
Al = a of31 = Pla2 + qlf3z,
(lOA)
A 3 = f3of31 = P3 a Z + q3f32 ,
ainsi que les relations conjuguees de celles-ci. (11). Pour obtenir les resultats de Gauss, ecrivons de deux manieres differentes chacun des produits ApA~. On a d'une part: (11.1)
ApA: = (p pa2+ qpf32)(pva~ + qvf3D = azppp.
+ b2(ppq. + qppJ + czqp q. + Pp)2 '
avec, eomme precedernment p p• = ppqv - qpPv. D'autre part, on a:
(11.2)
(aOa~)(alf3;) =
ao(bl + 8 1)
,
AoA~ = (aof3~)(ata;) =
(bo+ 8o)al
,
AoA; =
AoAi = (aof3~)(al f3;) = (bo+ 8o)(b t + 8 t ) , etc.
904
A. Weil I Gauss et fa Composition des Formes Quadratiques Binaires
Egalant ees expressions aux precedentes, on obtient d'abord, au moyen des AI'A~ (pour JL = 0, 1,2,3), les relations [1], [2], [3], [4] de Gauss, Art. 235. Au moyen de AoA: et AjA~ on obtient, par addition et soustraetion, puis division par 2, sa relation [5], et aussi aool = POj02 , et de merne
bobl + 00SI
=
a 2POP3 + b2(Poq3 + qoP3) + c2QOq3 ,
bOb j - SOSI = a2PIP2 + b2(PIQ2 + QIP2) + c2QjQ2,
(en posant L1 = 00SI ee sont Ifl les equations [10] et [11] de Gauss), et aussi: (11.3)
Puisque S: = d, pour i = 0, 1, 2, on deduit d'abord de ees relations que SOSI est entier, done que do et d, ne different l'un de l'autre que par un faeteur carre, puis que d 2 ne differe de eeux-ei que par un tel faeteur. Autrement dit, les relations ci-dessus entrainent que no = 00/S2 et n l = SI/S2 sont rationnels. De plus, on aura:
(11.4)
2bonl = Pm - P l2
,
2b ln o = P03+ P 12 ·
Comme on a note m l Ie p.g.c.d. de aI' C j , 2b j, il s'ensuit que Imjnol est Ie p.g.c.d. de Pm' P 13, Pm + P l2 et est done entier; de merne Imonll est entier. De plus, puisqu'on a Do = n OS2, on a do = n~d2' mido = (m lnoid2; par suite d2 divise mido; pour des raisons analogues, d 2 divise aussi m~dj' Soit maintenant k Ie p.g.c.d. de mlno et monl' ou, ce qui revient au meme, celui de Pol' P 13, P03+ P 12, POj' P23, Pm - P l2 ; k 2 est alors Ie p.g.e.d. de (m jno)2 = (do/d2)mi et de (monji = (dj/d2)m~, done Id2k 2 est eelui de domi et de dlm~. Enfin observons, comme Ie fait a peu pres Gauss, qu'on a, d'apres (11.4): 1
(Pm + P I2i - 4PmP13 = 4dln~ = 4di nonli
(Pm- P I2i - 4P OjP23 = 4d oni = 4dinonl)2
A. Wei! / Gauss et La Composition des Formes Quadratiques Binaires
905
d'ou la relation, bien connue par ailleurs: (11.5) Soit k' Ie p.g.c.d. de tous les Pp.v; comme k' divise k et que k divise 2k', on a k = k' ou k = 2k'. Si k = 2k', c'est que Po3/k' et P12/k' sont tous deux impairs; mais c'est impossible d'apres (11.5), puisque chacun des Pp.v qui figurent au second membre est multiple de k = 2k'. Done k' = k. On a ainsi acheve de verifier les quatre premieres 'conclusions' de Gauss telles qu'elles sont enoncees dans son Art. 235. (12). En fait, ce dernier raisonnement est presente autrement par Gauss; pour montrer que P03/k et P1z/k sont entiers, il ecrit:
Comme on a b~ - do = aoco, c'est la un multiple de m~; de merne bi - d, est multiple de mi, de sorte que P03P1Z est multiple du p.g.c.d. de min~ et m~ni, qui est k Z ; done 2P03/k et 2P 1Z/k ne peuvent etre tous deux impairs; ils sont done pairs, puisque P 03 + P1Z et P 03 - P1Z sont multiples de k. Deux fois, par la suite (aux Artt. 235 et 236) il est renvoye a ce raisonnement, entre autre pour faire voir que 2(b ob1 ±.:::1) est multiple de mom l ; en effet, dit Gauss, cela est vrai de leur somme et de leur difference, et leur produit est multiple de (mom1l Mais on peut ecrire:
et la relation analogue pour bi- d.; et aussi:
II est tentant, dans ces conditions, de chercher a motiver les calculs de Gauss en ecrivant:
ce qui, (12.1)
a nos yeux, se justifie du fait qu'en posant
906
A. Wei! / Gauss et la Composition des Formes Quadratiques Binaires
OJ et 0; sont les racines de l'equation
a coefficients en tiers et sont done eux-memes des en tiers (ration nels ou algebriques suivant que d, est on non un carre). (13). Gauss a-t-il pu raisonner ainsi? Cela ne semble pas impossible, etant donne qu'a la merne epoque it avait entarne ses recherches sur les 'entiers de Gauss' et aussi sur Ie corps des racines cubiques de l'unite. On peut merne aller plus loin, puisqu'au dire de Dirichlet Gauss s'etait 'fabrique quelque chose comme les facteurs ideaux de Kummer' ('etwas dhnliches wie idea/e Factoren': [3, lac. cit. D. Imaginons done une 'fiction' de plus, puisque c'est la Ie terme rnerne dont Gauss s'est servi ace propos. Pour chaque i, attribuons un p.g.c.d. 'fictif', ou 'ideal', m, it a i et f3i' et de merne un tel p.g.c.d. m; a et f3;. Si nous admettons en merne temps que les regles usuelles pour les p.g.e.d. restent valables, il s'ensuivra que mjm; est Ie p.g.e.d. de
a;
d'apres ce qui precede, on a done mjm; = m.. Si maintenant on resout par rapport it (l2' f32 deux des equations p/La 2+ q/Lf32 = AI" on obtient (13.1)
a quoi on peut adjoindre les relations 'conjuguees' de eelles-ci. lei les AI' sont, de par leur definition, multiples de lltom). Puisque k est Ie p.g.c.d. des P /LV' ces relations montrent que ka 2et kf32 sont multiples de lltom); done leur p.g.c.d. km 2 l'est aussi. De merne km~ est multiple de l~m;. Par suite 2, qui n'est autre que (km2)(km~), est multiple de
em
D'autre part, eomme m j est, pour chaque i, le p.g.e.d. des valeurs prises par /;(x, y) pour x, y entiers, l'identite (1.3) implique evidemment que mOm) est multiple de m 2- En particulier, si k = 1 (e'est Ie cas de la 'composition' proprement dite suivant les definitions de Gauss), on doit avoir m 2 = mOm).
A. Wei! / Gauss et la Composition des Formes Quadratiques Binaires
907
(14). Ce dernier resultat est la 'cinquieme conclusion' de Gauss (Art. 235), conclusion a laquelle il adjoint des relations exprimant a 2, b2, c2 au moyen des quantites PI" qp.' no, n l; nous allons verifier celles-ci en supposant pour simplifier que alcl,t O. MuItipliant par a; les relations (13.1) pour J.L = 0, v = 2, on obtient:
Divisant par P02 = aln O, et posant 3, il vient:
P~
= pp./no,
q~
= qp./n o pour J.L = 0, 1, 2,
Procedant de merne sur les relations (13.1), respectivement pour (J.L, v) = (1,3), pour (J.L, v) = (0,2) et pour (J.L, v) = (1,3), on obtient:
les relations conjuguees s'obtiennent de meme. Les relations ainsi obtenues sont semblables aux relations (10.4), avec la seule difference qu'aux coefficients entiers (PI' ; qp.)' il faut substituer les coefficients (eventuellement fractionnaires)
On peut les interpreter en disant qu'a un facteur pres la forme 10 est transformable en Idl' En les traitant precisement comme on a traite (10.4), on obtient des relations analogues a (11.4), c'est-a-dire d'abord de nouveau alno = P02 , etc., puis aussi:
ce sont Iii les formules de Gauss. (15). Passons a la reciproque, qui consiste a construire une forme 12 composee de deux formes donnees 10' II (Disq., Art. 236). Autrement dit, it s'agit, pour 10' II donnees par (10.1) pour i = 0, 1, de satisfaire aux
908
A. Weil / Gauss et la Composition des Formes Quadratiques Binaires
equations (10.4) et a leurs conjuguees de telle sorte que les Pp.v soient sans diviseur commun; iI faut done qu'il y ait, entre les quatre vecteurs (AI" A~), deux relations Iineairernent independantes a coefficients entiers, ou encore que Ie module qu'ils engendrent sur I soit de rang 2, apres quoi on satisfera aux conditions du problerne en prenant pour (a z' a~) et ({32' {3~) des generateurs de ce module. Pour simplifier, supposons ao oj. 0, a, oj. O. Supposons qu'on ait L rp.Ap. = 0, L rp.A~ = 0; multipliant la premiere relation par a~a;, on aura:
on obtient de meme la relation conjuguee. Par addition et soustraction, cela donne
reciproquernent, ces relations entrainent L rp.Ap. = 0, L r,»; = O. Pour qu'elles aient une solution non triviale, iI faut que 00 et 0, ne different que par un facteur rationnel; c'est ce qu'on avait deja trouve plus haut. Reciproquernent, s'il en est ainsi, ces equations ont deux solutions (rp.), (r~) Iineairernent independantes; il existe done bien une composee de fo et ft. En fait il y a merne deux cas a considerer, suivant qu'on aura pris 000. positif ou bien negatif; cette distinction correspond a celIe que fait Gauss entre composition 'directe' et 'inverse'; pour abreger, nous ne discuterons pas de cet aspect de la question, qui est lie a la distinction entre equivalence 'propre' et 'impropre' pour les formes quadratiques binaires. Quoi qu'il en soit, l'associativite de la composition, qui dans les Disquisitiones fait l'objet de calculs particulierernent rebutants, decoulerait immediatement de ce qui precede. (16). II est inutile d'aller plus loin; aussi bien notre propos n'etait-il pas d'ajouter quoi que ce soit aux resultats connus, mais seulement de suggerer une voie heuristique par laquelle Gauss a pu y parvenir. II nous reste a indiquer comment on peut justifier la procedure esquissee cidessus. Pour cela, un premier moyen nous est offert par Dedekind (d. [2, loco cit.]). S'appuyant sur Ie fait que, d'apres Gauss, les discriminants
A. Wei! / Gauss et fa Composition des Formes Quadratiques Binaires
909
des formes lies par une identite de composition ne peuvent differer les uns des autres que par des facteurs carres, et laissant de cote celles dont Ie discriminant est lui-rnerne un carre, Dedekind se donne un corps quadratique k = Q (VV) et se borne une fois pour toutes aux formes qui, dans k, se decornposent en facteurs Iineaires; ce sont celles dont Ie discriminant ne differe de D que par un facteur carre. Observons d'abord qu'avec les notations des n'" (10)-(12), les Z -modules libres M;, de bases respectives (a j , f3J et (a;, f3;) peuvent etre consideres comme modules sur I'anneau Z [8;] = Z [8l On a en effet:
M;,
ainsi que les relations conjuguees de celles-la, En merne temps, on voit que (a;, {3J~ (a;, {3;) determine un isomorphisme de M; sur M;, compatible (en un sens evident) avec I'automorphisme de Z [8;] determine par 8;~8;.
Notons encore qu'il n'y a presque rien a changer aux n'" (10)-(15) si I'on suppose seulement que les a.; Zb; c, soient entiers; pour eviter les fractions, il suffirait alors d'operer avec les discriminants Ad, au lieu des determinants d; De plus, dans Ie cas de la composition proprement dite, c'est-a-dire pour k = 1, on a m 2 = mom l; en consequence, pour que f2 soit composee de fa et fl' il faut et il suffit que f 21m2 Ie soit de folm o et fllm l; ce sont la les formes primitives dont sont 'derivees' les /;. Autrement dit, on ne perdrait rien en se born ant aux formes primitives; inversement, pour composer entre elles des formes primitives fa, fl' on peut leur substituer des formes derivees mofo' mdl' avec rna, m l choisis a volonte. C'est la, en substance, ce que fait Dedekind. A toute forme primitive f(X, Y) = AX 2 + BXY + Cy 2 ,
il substitue systematiquernent une forme mf, avec m tel qu'on puisse ecrire mf(X, Y) = (aX + {3Y)(a'X + {3'Y) ,
ou a, {3 sont dans Ie corps quadratique k donne une fois pour toutes, et ou a', {3' sont leurs conjugues au sens usuel. Lorsqu'i1 en est ainsi, Ie Z -module libre de base (a, {3) est un ideal, eventuellement fractionnaire, de l'anneau Z [8] engendre par I'une ou l'autre des racines 8, 8' de
910
A. Wei! / Gauss et La Composition des Formes Ouadratiques Binaires
T - BT + AC == O. Par exemple, pour pourra prendre m == A, puisqu'on a: 2
Af(X, Y) == (AX
f
donnee comme ci-dessus, on
+ OY)(AX + 0' Y) .
C'est ainsi que Dedekind peut traiter de la composition des formes sans sortir d'un corps quadratique k qu'il suppose donne a l'avance. Dans ce cadre, les calculs heuristiques des n'" (10)-(15) deviennent pleinement legitimes, pourvu qu'on identifie m., m; avec Mi , M; consideres comme ideaux de l'anneau Z [OJ]' (17). On peut rnerne aller plus loin. Seules importent en effet, dans la construction esquissee ci-dessus, quelques proprietes formelles des symboles a.; [3j, etc., qu'il est facile d'expliciter. On est ainsi conduit tout naturellement a les interpreter dans Ie cadre trace par les travaux [5] et [7] cites plus haut. Soit R un anneau, suppose commutatif et unitaire, mais sur lequel il n'est pas necessaire en definitive de faire aucune hypothese supplementaire. Suivant [7], la theorie de la composition s'etend aux 'modules projectifs' de rang 2 sur R, et aux formes quadratiques definies sur de tels modules. Pour abreger, nous ne considererons que des modules libres; c'est a ceux-ci d'ailleurs que la methode de localisation suivie dans [7] permet toujours de se ramener. Par Ie choix d'une base, une forme quadratique sur un tel module s'ecrira: !(X, Y) == AX 2 + BXY + Cy 2 ,
avec A, B, C dans R; eventuellement on pourra supposer que! est 'primitive', ce qui veut dire que l'ideal de R engendre par A, B, C est R lui-merne. A f on attachera l'anneau S == R[8] defini par 8 2 = BO - AC; c'est Ie quotient de l'anneau R[T], ou Test une indeterminee, par I'ideal de R[T] engendre par T 2 _ BT + AC. En merne temps on attachera a f deux R-modules libres M, M', de bases respectives (a, [3) et (a', [3'); on en fera des S-modules en posant a8 == Ba - A[3, a'B' == Ba'> A[3' ,
[38 == Ca, [3'8' == Ca' ,
avec ()'==B-8; 0 et 8' sont les racines de T 2 - B T + A C = O dans S.
A. Weil / Gauss et la Composition des Formes Quadratiques Binaires
911
L'isomorphisme de M sur M' (en tant que R-modules) determine par (a, f3)-+ (a', f3') est compatible avec I'automorphisme de 5 donne par (j -+ (j'; I'un et I'autre seront notes ~ -+ f. On pourrait aussi effectuer la construction de 5, M, M' au moyen de P'algebre de Clifford' C(f) de f. Celle-ci est en effet Ie R-module libre de base (1, £0, TJ, (j) muni de la multiplication non commutative definie par la table £02=
A.l,
TJe
=
(j,
eTJ = (j' = B . 1- (j ;
5 n'est autre alors que la sous-algebre C(f) = R . 1 + R . () de C(f); la 'partie impaire' c(f) = R . £0 + R . TJ de C(f), en tant que module a droite sur 5 = C (f), est alors isomorphe aM; en tant que module a gauche sur 5, c'est Ie module 'oppose' au module a droite M'. Au lieu d'adopter, comme Ie fait [7], Ie langage des algebres de Clifford, nous nous contenterons du formalisme decrit au n° (10); il consiste a poser une application bilineaire de M x M' dans 5, notee (~, TJ')-+ ~. TJ', definie par a'a'=A,
a . f3'
= (j,
f3'a'=(j',
f3' f3'
=
c.
Si ~ = ax + f3y est un element quelconque de M, avec x, y dans R, on aura f = a'x + f3'y, puis f(x, y)
= ~. f = (ax + f3y)' (a'x + f3'y).
On ecrira f(~) = ~. f, ce qui revient a identifier avec M Ie R-module sur lequel f a ete don nee. Soient a present fo, fl' f2 trois formes quadratiques; on leur associera comme ci-dessus des anneaux S, = R[(j;] et des modules M;, M;. On dira que f2 est transformable en fofl s'il existe une application bilineaire j.t de Mo x M I dans M 2, telle que I'on ait
ce qui revient aux formules (1.2) et (1.3). En transportant j.t aux modules M;, on obtient une application j.t' de M~ x M; dans M~, et, avec les notations qu'on vient d'expliquer, on pourra ecrire:
912
A. Weif / Gauss et fa Composition des Formes Ouadratiques Binaires
Supposons de plus, avec M. Kneser [7, Section 3] que les t. soient primitives et 'de noyau nul' (hypothese qui se reduit a B 2 = 4AC';' dans les cas examines precedemment): it dernontre alors [7, Th. 1] qu'il existe des homomorphismes 'Yo' 'YI de So, SI dans S2' teIs que l'on ait:
°
pour gj EM;, {; E S, (i = 0, 1); en un certain sens, cela peut s'exprimer en disant que J.L est compatible avec les structures de S;-modules mises sur les M;. Sa demonstration opere par 'localisation'; en procedant exactement de merne, on peut montrer aussi qu'on a dans ces conditions:
pour ~; E M;, 17; EM; (i = 0, 1). Cela pose, les calculs des n'" (10)-(11) deviennent legitimes, quelques details pres qui demanderaient de Iegeres modifications, chaque fois que les M;, M; sont definis comme les R-modules Iibres de bases respectives (ai' {3;) et (a;, {3;). Quant aux resultats du n° (14), ils sont consequences de l'associativite de la composition, appliquee aux trois modules M~, M o, M;.
a
References [1] A. Weil, Number Theory, An approach through history (Birkhauser, Boston, 1983). [2) P.G. Lejeune Dirichlet, Vorlesungen tiber Zahlentheorie, herausgegeben und mit Zusatzen versehen von R. Dedekind (Vieweg, Braunschweig, 1863); 2' ed. (1871); 3' ed. (1879); 4" ed. (1893). [3] E. Kummer, Collected Papers I (Springer. New York, 1975). (4) F. Klein, Ausgewiihhe Kapitel der Zahlentheorie (II), Zweiter Hauptteil, Von der Composition zu derselben Discriminante gehorigen ganzzahligen Gitter, ausgearbeitet von A. Sommerfeld und Ph. Furtwiingler (Gottingen, 1897) 94-221. [5] J. Towber, Composition of oriented binary quadratic form-classes over commutative rings, Adv. Math. 36 (1980) 1-107. [6) P.G. Lejeune Dirichlet, De formarum binariarum secundi grad us compositione, Crelles J. 47 (1854) 155-160; Werke II, 105-114. [7) M. Kneser, Composition of binary quadratic forms, J. Number Theory 15 (1982) 406-413.
l.A. BARROSO editor, Aspects of Mathematics and its Applications © Elsevier Science Publishers B.V. (1986)
913
THE STONE-WEIERSTRASS THEOREM AND A CLASS OF BANACH LATTICE ALGEBRAS Guido ZAPATA! Instituto de Matemdtica, Universidade Federal do Rio de Janeiro, 21944 Rio de Janeiro, RJ, Brasil Dedicated to Professor Leopoldo Nachbin on his sixtieth birthday
O. Introduction The Stone-Weierstrass theorem (SWT for short) is a fundamental result in analysis, and has been a source of continued inspiration, both for research and applications. To point out some trends in research, having SWT as a most stimulating starting point, we should quote subjects like C*-approximation, differentiable approximation, lattice approximation, nonarchimedian approximation, weighted approximation and uniform algebras. For additional information and references see [1], [2], [6], [21], [24], [28], [30], [34] and [36]. We notice that the generalizations of SWT usually are centered around a main feature of the theorem like the algebra, antisymmetric, lattice, module or weighted, aspect. At least this is true at the very beginning, since, for instance the weighted approximation theory, developed by Nachbin and his school, has considered several aspects, including those mentioned above. As in the related Stone-papers, here we consider both the lattice and algebra aspect, but with the additional assumption of completeness which in our approach is crucial. In fact, let X denote a locally compact Hausdorff space containing at least two points. Keeping in mind some basic properties of cgo(X) related to SWT, and also some facts concerning the representation of Banach algebras, and Banach lattices, we are led to consider the class f!F!t.s4 (X), of all strongly separating Banach lattice algebras of real continuous functions on X, vanishing at infinity.
I
The author was partially supported by CNPq, Brasil.
914
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
A natural problem to consider is the following: under which conditions does A E ~:£d(X) have the SW property, that is, a sub algebra B of A is dense if, and only if, B is strongly separating? This paper is devoted to give an answer to such question and give a precise description of the corresponding class Y"U!(X). We notice that the class Y"U!(X) is also natural in another sense. In fact, there exists a directed set F*(X) of Riesz norms on
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
915
equivalence of several properties concerning pointwise description of closures of algebras, ideals, order ideals, Stone lattices, polynomial algebras and modules over separating subalgebras, as well as dense subalgebras and dense vector lattices. Further, it gives a complete description of the class ff"W'(X). An important application concerns approximation results for cgc(X) endowed with the inductive limit topology. Finally, in Section 4 is given the proof of the Main Theorem. Part of this research was done at the University of Paderborn, West Germany, during our stay, February-March, 1984, under the CNPqGMD agreement. We would like to express here our acknowledgments to those institutions, and especially to thank Professor Klaus Bierstedt, from University of Paderborn, for his kind hospitality and some stimulating discussions. Let us finish this introduction with the information that some generating ideas of this paper were originally presented in a talk at an International Analysis Conference honouring Professor Leopoldo Nachbin. That conference was held at Universidade Federal do Rio de Janeiro during 2-6 August 1982, on the occasion of Professor Nachbin's sixtieth birthday.
1. Preliminaries
In the following, X will denote a locally compact Hausdorff space, F a real locally convex Hausdorff space and cgS(F) the set of all continuous seminorms on F. Both X and F are assumed to contain at least two points. Let cg(X, F) denote the vector space of all continuous functions from X to F. When F=IR, we write ~(X) for cg(X,IR). Under the operations defined pointwisely, cg(X) is a lattice and an algebra (lattice algebra, for short). cgo(X) and cgJX) will denote the lattice algebras of all I E ~(X) which vanish at infinity, and, respectively, supp(f) is compact. Let A C cg(X) be a vector lattice endowed with a norm 11.11. If 11/11,,::;; /lgil for all I, g E A such that Ifl,,::;; jgj, then 11.11 is called a Riesz norm on A. A lattice algebra A C cg(X) endowed with a Riesz norm /1.11 such that (A, /I ·11) is complete is called a Banach lattice algebra. 11.110 will denote the sup norm on the space of all bounded real functions on X, A non-empty subset L C ~(X) is solid if gEL and I E ~(X) such that III,,::;; Igi imply that IE L. A solid vector subspace of ~(X) is called (order) ideal.
916
G. Zapata f Stone- Weierstrass Theorem and Banach Lattice Algebras
Let PC ce(X, F) be a non-empty subset. The equivalence relation X/P defined by P on X is introduced as follows: if x, y E X, then x - y modulo X/ P if g(x) = g(y), for all g E P. If [x]p denotes the equivalence class modulo X/P then [x]p = [x]F'op by the Hahn-Banach theorem. P is separating if [x]p = {x}, for all x E X. Also, P is strongly separating if it is separating and does not vanish at any point, that is Z(P) = {x E X Ig(x) = 0 for all g E P} is empty. Further we define 5(P) as the open set {x E XI g(x);t 0 for some g E P}. Assume that PC ce(X, F) is a vector space and let Be ce(X) be an algebra. Then P is a B-module if BP C P. If A = F' 0 P is an algebra such that A ® F C P, then P is a polynom ial algebra. If x E X, P(x) = {g(x)/ g E P}.
2. The Class f!F!£s1.(X) Definition (2.1). A lattice subalgebra A C ceo(X) is called a Banach function lattice algebra on X if it is strongly separating, and if there exists a Riesz norm /1./1 on A such that (A, /I. I/) is complete. f!F!£s1(X) will denote the class of all Banach function lattice algebras on X. Examples (2.2). (1). Let u be a positive, locally bounded function on X, such that inf u > O. If ceuo(X) denotes the solid vector space of all f E ce(X) such that uf vanishes at infinity, then A = ceuo(X) E f!F!£s1(X). In fact, cec (X) CAe ceo(X) and a suitable norm is f~ /luflb·
The space ceo(X) corresponds to the case u = 1. (2). Let f.L be a positive Radon measure on X, 1:!S P < +00, A = ceo(X) n U (f.L). Then A E f!F!£s1 (X), since it is a solid vector space such that cec (X) CAe ceo(X) and a suitable norm is
The space lp (5) corresponds to a particular case. (3). Assume in addition that X is a IT-compact, noncom pact space. Let 11 E ce(X) be such that u(x) > 0, for all x E X and such that v = 1/u
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
917
vanishes at infinity. If C6iub (X) denotes the solid vector space of all f E C6i(X) such that uf is bounded, then A = C6iub (X) E fJP.2.st1(X). In fact, C6ic (X) cAe C6io(X) and a suitable norm is f~llufllo .
Also the spaces A o = C6iu o(X) and Al = A o+ R v are closed lattice subalgebras of A. Hence A o, Ai' A E fjP.2.st1(X) and the inclusions A o C Al C A are proper, since Al is not solid. Remark (2.3). We recall the basic property concerning continuity of positive linear operators from a Banach lattice to a normed lattice [28]. Consequently, if A E fJP.2.st1(X), then the Riesz norm under which A becomes complete is unique (up to an equivalence) and dominates the sup norm. Then the corresponding norm will be denoted by /I. IIA' Since we can assume that II.IIA ;?; II . 11o, it follows that II . IIA is an algebra norm, hence A is a Banach algebra too. Remark (2.4). Given A E fJP.2.st1(X) and a real function f on X, then f E A if (and only if) there exists a Cauchy sequence (In) in A such that f is the pointwise limit of Un)' Lemma (2.5). Let (Pn) be the sequence of real polynomials defined by (2.1)
(nEN,tER).
Then 0 ~ Pn (t) ~ Pn+l(t) ~ It/ on [-1, 1]' for all n EN. Further Pn (t)~ ItI uniformly on [-1,1]. Proof. The first part is clear by induction. Since Pn (t)~ ItI pointwisely on [-1,1], it is enough to apply Dini's theorem. D
Definition (2.6). (An auxiliary function.) p E C6i(R) is given by pet) = ~ + It - ~I-It -
11
(tER).
Then p = 0 on (-oo,~], p = Ion [1, +00) and O~p ~ 1. Lemma (2.7). Let A E fJP.2.st1 (X) and f E C6ic (X) n A. Then p» f E A, precisely p f E clos R [fl. 0
918
O. Zapata I Stone- Weierstrass Theorem and Banach Lattice Algebras
Proof. Let M = IIfllo and consider the sequence of polynomials defined by
qn(t) =
~+ (M +1)[ r, (~~\) - r, (~-+\) ]
(n E N, t E R),
where (Pn) is given by (2.1). Let (n E N).
Since I(t - s)/(M + 1)1 0;;; 1 for all t E [-M, M] and s E [0, 1], and qn (O)~ 0, it follows that
On ~ p
uniformly on [- M, M] .
Let En = lip f - On fila, n EN, and fix g E A+ such that g;;;: 1 on supp(f). Then 0
0
for all n, m.
II. IIA is a Riesz norm, it follows
that (On 0 f) is a Cauchy sequence in pointwisely. From Remark (2.4) we conclude that p» f E A, and it is clear that p» f E clos R [f], since On 0 fER [f]. 0 Since
A. Further, On
0
f ~ p» f
Lemma (2.8). Let A E :¥2!s1.(X), 0 ~ Be A be a lattice subalgebra, x E S(B) and Van open neighbourhood of [X]B' Then [X]B is compact, and there exists a (J E c1os(B) n "€c(X) such that (J = 1 on a compact neighbourhood of [x ]B' 00;;; (J 0;;; 1, and supp (J C V. Proof. Let K = [x ]B' There exists an fa E B such that fo(x) ~ 0, hence K is a closed set contained in the compact set {y E Xllfo(y)1 ;;;: ~Ifo(x )IJ, thus K is compact. Since K C S(B), there exists agE B such that g > 1 on K. For p given by Definition (2.6), let
Then fEB, since B is a lattice algebra. Also supp(f) is compact since g E "€o(X) and p vanishes on a neighbourhood of O. We notice also that f> 1 on K. If supp(p 0 f) C ~ we take (J = p 0 f. From Lemma (2.7) it follows that (J E clos(B) n "€c (X) and it is clear that the other conditions are fulfilled.
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
919
Now consider the case supp(p 0 f)\ V = H ¥ 0. Then for every y E H there exists a gy E B such that gy(x) ¥ g/y), since y fi! K. Let
where c is a constant such that I, (x) > 1. Then from Lemma (2.7) it follows that I y E cIos(B) n e, (X). Moreover, I y(y) = 0 and I, > 1 on K. If 8y = pOly, then 8y E cIos(B) n r5c (X) by Lemma (2.7). Further, 8y = 1 on a compact neighbourhood of K, 8y = 0 on a neighbourhood of y and 0",;; 8y "';; 1. Since supp(8y ) C supp(p f), by taking the product of a suitable finite number of the functions ()y, y E H, we obtain a function 0 with the required properties. 0 0
Lemma (2.9). Let A E g;'y:.st/(X), 0 ¥ Be A be a lattice suba1gebra, K C S(B) a compact set, VI"'" Vn an open covering 01 K such that x E K n Vi ~ [X]B C \1;, i = 1, ... , n. Then there exists ()I"'" en E clos(B) n e,(X) such that
onK, 0",;; 8i
",;;
1 and
supp(OJ C \I;
(i=I, ... ,n).
Proof. For every x E K there exists an i E {I, ... , n} such that x E \1;, hence [X]B C \I; by assumption. From Lemma (2.8) there exists an Ix E clos(B) n r5c (X) and a compact neighbourhood of [X]B such that
u,
f, =
1 on H x '
0 ~ t.« 1 and
supp(fx) C \1;.
The compactness of K implies the existence of a finite subset Fe K such that tc
c: U n.. xEF
If F; = {x E FI H, C \I;}, i = 1, ... , n, we can assume that every F; is non-empty. Otherwise, if F; = 0 we define 0; = O. Fix i E {I, ... , n], let F; = {XI' ... ,xm } , Ik = IXk' k = 1, ... , m and define
920
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
gl =
i..
g2 = (1 - f1)f2 ,
+ ... + gm' then hi E c1os(B) n cgc(X) and supp(hJ C\/;. Also, o~ hi ~ 1 and hi = 1 on U xEF; H x' since hi = 1 - (1- fl) ... (1- fm)' If hi = gl
Finally, we define
It is clear that the functions 8; verify the stated conditions. 0
Remark (2.10). The above lemma is a generalization of theorem 2.13 of [26] (case B = cgc (X)), and its proof is based on the corresponding proof of that theorem. Theorem (2.11). If A E PREsti(X), then
e,(X) C A.
Proof. Given x, y E X, x 'I y, there exists an f E A such that f(x) = 1 and f(y) = 0, since A is a strongly separating algebra. If g = lp f, where p is given by Definition (2.6), then g E A n cgc(X) (d. the proof of Lemma (2.8)). Also g(x) = 1 and g(y) = O. Let 0
B = A
n cgc (X) .
Then B is a strongly separating lattice algebra such that c1os(B) n cgc (X) = B. Also [X]B = {x} for every x E X, since B is separating. Let h E cgc (X), h 'I 0, be given. Fix 8 E B such that 0 ~ 8 ~ 1 and 8 = 1 on a compact neighbourhood K of H = supp(h). The existence of such a function is a consequence of Lemma (2.9). Given e > 0, from the compactness of H and the fact that 5(B) = X, it follows that there exists a finite number of functions f1' ... , t, E B such
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
921
that, letting
v; = {x E int KII/;(x)- h(x)/ < s}
(i= 1"",n),
H C VI U ... U Vn. By Lemma (2.9) there exist
e
l , ••• ,
en E B
such that
n
on H, o,,;;e;,,;;1
and
(i = 1, .. " n).
supp(e;)cv;
If g = :L;=I e;/;, then g E Band supp(g) C K. Since h that
= :L;=I O;h, it follows
Since E is arbitrary, given a sequence (En) such that En> 0 and En ~ 0, there exists a sequence (gn) in B such that (for all n) .
In particular, h is the pointwise limit of the sequence (gn)' Further, Ign - gm I,,;; (En + Em)e, for all n, m, whence (gn) is a Cauchy sequence in A, since II.IIA is a Riesz norm. To finish the proof that h E A, it is enough to apply Remark (2.4). 0 Corollary (2.12). If A, B E [JF2d(X), then A
nB
E [JF2d(X).
Proof. Let C = A n B. Clearly C is a lattice subalgebra of ~o(X) and it is strongly separating by the preceding theorem. Also, a suitable norm is
I ~ max{II/IIA' II/IIB}' 0 Corollary (2.13). Every A E [JF2d(X) is a Stone lattice. Proof. Given I E A we notice that Hence I /\ 1 = 1- (f - f /\ 1) E A. 0
I-I
r;
1E
~c (X)
since
I
E ~o(X).
922
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
Corollary (2.14). If X is compact, there is just one A E :Ji2d(X), viz. A = ce(X). Remark (2.15). The above Corollary allow us to give an example of a strongly separating lattice subalgebra A of ce[O,I] such that A g :Ji2d[0, 1]. In fact, it is enough to consider A = Lip[O, 1], the set of all f E ce[O, 1] such that sup{lf(x) - f(y)I/lx - y) I x, Y E [0, 1], x'" y} <
+00.
Definition (2.16). Let A E :Ji2d(X). II.IIA is a-continuous if for every decreasing sequence Un) in A, converging pointwise to 0, it holds that I, -+ 0 (that is IIfn IIA -+ 0). Examples (2.17). (1). Let A = ceo(X), Then II.IIA is rr-continuous, as a consequence of Dini's theorem. (2). Let A = ceo n u (J.L) be as in Example (2.2)(2). Then, as a consequence of Dini's theorem and Lebesgue's monotone convergence theorem, it follows that II./IA is o-continuous. (3). With the notation of Example (2.2)(3), let A = ceuo(X) + IR v. Consider an increasing sequence (en) in cec(X) such that lim en = 1 pointwise and let I, = (1 - en )v, n EN. Then In ~ fn+l for all n and lim In = 0 pointwise, but t, ..,4 O. In fact, fnu = 1 - en' hence Ilfn IIA = 1, for all n EN. Proposition (2.18). Let A E :Ji2d(X) be such that IJ.IIA is a-continuous. If B is a closed subalgebra of A, then B is a lattice. Further, if B ,., 0, given a a-compact set V C S(B), there exists an increasing sequence (q>n) in B n e,(X) such that
o~ q>n ~ 1,
for all n E N
lim
and
q>n (x) =
1,
for all x E V.
Proof. In order to prove that B is a lattice it is enough to prove that 1/1 E clos R [f] for all / E B such that Ifl ~ 1. In fact, fix any such / and let (Pn) be the sequence of polynomials introduced in Lemma (2.5). Define (n E N).
Then Un) is a decreasing sequence in A such that lim In = 0 pointwise, hence In -+ 0 by assumption. Since P; IE IR [f], it follows that 1/1 E clos 1R[f]. 0
G. Zapata / Stone-Weierstrass Theorem and Banach Lattice Algebras
923
Now, let (Kn ) be an increasing sequence of compact sets in X such that V= U n E N K n. Since K; C S(B), there exists a gn E B such that gn ~ Ion K n • For p given by Definition (2.6) let and Then hn E B since B is a lattice algebra, and it is clear that hn has compact support. Hence l/Jn E B by Lemma (2.7), and by construction 0,,;;; l/Jn ,,;;; 1 and l/Jn = 1 on K n. To finish the proof, it is enough to define (n E N).
Then it is clear that the sequence ('iOn) has the stated properties. 0
Lemma (2.19). Let A E ~2.s4(X). Then II./IA is a-continuous if and only if cgc (X) is dense in A. Also, in this case, A is solid. Proof. Assume that II.IIA is o-continuous and let I E A, I oj. O. Since S(f) is a u-compact open set, there exists an increasing sequence ('iOn) in cgc(X) such that 0,,;;; 'iOn ,,;;; 1 and lim 'iOn (x) = 1, for all x E S(I). Then 'iOJ --'> I since II - 'iOJI is a decreasing sequence in A, converging pointwise to O. Conversely, assume that cgc (X) is dense in A and let (In) be a decreasing sequence in A converging pointwise to o. Given e > 0, for every n E N there exists an I~ E cgc (X) such that
Let s, = In hence
A
I/J
Then s, E
e, (X),
0,,;;; s, , ;;; In and lin -
e;I,,;;; lin -
I~',
If hn = inf{go' ... ,gn}' n EN, then hn E cgc(X), 0,,;;; h n ""! In and In - h; ,,;;;
L 7=0 U. -
gj), hence for all n EN.
924
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
Further IIhnllA ~lIh~/21Iollh~/2I1A. We notice that (h~2) is a decreasing sequence in ~o(X), converging pointwise to 0, hence IIh ~211o ~ by Dini's theorem. Then for some no ENlarge enough:
°
Now let I E A and g E ~(X) be such that Igl ~ III. Since A is a vector lattice we can assume that ~ g ~ f Also, by assumption there exists a sequence (fn) in ~c(X) such that In ~ f. Since Illnl II g-llml II gl ~ lin - I; I for all n, mEN, it follows that (lin I II g) is a Cauchy sequence in ~c(X), which converges pointwise to g, hence g E A by Remark (2.4). 0
°
Definition (2.20). r(X) is the directed set of all Riesz seminorms on ~c(X).
We list some basic (and well-known) facts concerning the set r(X): (2.1)
Let u be a positive, locally bounded function on X, and positive Radon measure on X. Then
I ~ lIulllo
and
JL a
I ~ JL(I/J)
define elements in F(X). A seminorm of the first type is called a weighted seminorm. (2.2)
Let Y E F(X) and 1 ~ P < +00. If Yp is defined by (fE~c(X»,
then Y» E r(X). (2.3)
Assume, in addition, that X is paracompact. Let (X;)iEI be an open covering of X, and (O;);EI be a continuous partition of unity subordinated to the given covering. If (Y;);EI is a family of seminorms such that 'Yi E F(Xi ) , for all i E I, then I~
2: 'Y;(OJ)
iEI
defines an element in r(X). Further, for every 'Y E r(X) there
G. Zapata / Stone> Weierstrass Theorem and Banach Lattice Algebras
exists a weighted seminorm ')I' E r(X) such that ')I only if, X is (J"-compact. (2.4)
~
925
y' if, and
Assume, in addition, that X is not countably compact. Let j.t be an unbounded positive Radon measure on X such that the function
x ~ inf{j.t (f)1 I E Cf5c (X)+' I(x) = I} is bounded. Let ')I(f) = j.t(IIi), IE Cf5c (X), and 1 ~ P < +00, Then yp is not equivalent to any weighted seminorm ')I' E r(X). (2.5)
The topology generated by r(X) coincides with the inductive limit topology.
(For definitions, proofs and additional information see for instance [4], [7], [27] and [35].) Given ')I E r(X) let yO be the functional on Cf5(X), with values in [0, +00]' defined by
With the usual conventions about +00, it follows that yO is a sub additive, positively homogeneous functional on Cf5(X). Also it is increasing in the sense ')I0(f) ~ yO(g), if III ~ Igi.
Definition (2.21). Given y E F(X), let ~y(X) denote the solid vector space of all bounded continuous functions I on X such that y0(f) < +00. ~y (X) is endowed with the natural Riesz norm
111.11,- max{I I II0'
y0(f)} .
Also, let Cf5y (X) denote the closure of Cf5c (X) in ~y (X). When y obtain respectively, the spaces Cf5b (X) and Cf5o(X).
=
0, we
Lemma (2.22). Let y E F(X). Then ~y (X) is a Banach lattice algebra. Further, (X), Cf5o(X) n ~r<X) E fliIt.stJ(X).
e,
926
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
Proof. Since £YJ,,(X) is a solid vector space of bounded continuous functions, it follows that it is also a lattice algebra and clearly 11.111' is an algebra norm. Let (fn) be a sequence in £YJy(X) such that
L
IIlnlly < +00.
n~O
Since Illn I~ "'" Illn 111" for all n EN, it follows that the series L:=o Itn I converges in ceb (X) to a function g. Also, the series L:=o In converges in ceb (X) to a function f We claim that yO(g) "'" L:=o y°(fn)' In fact, let 0 E ceceX) be such that o"'" 0 "'" 1. Define m
(m=O,I, ...). n=O
Then gm E cec(X), gm "'" gm+l and gm ~ 8g pointwise, hence uniformly by Dini's theorem, since Og E cec(X). Let q; E cec(X)+ be such that q; = 1 on supp(8). Then y(Og - gm)"'" IIOg - gmlloy(q;)~ 0, that is y(gm)~ y(Og). But y(gm) "'" L:=o y«()ln) "'" L:=o y°(fn)' hence
y«()g)"'"
L
y°(fn)'
n=O
Since 0 is arbitrary, and y(h)= y(lhi) for all hE cec(X), it follows that yO(g) "'" L:=o y°(fn)' In particular, IE e, (X) since IE e,(X) and yO(f) "'" yO(g) "'" =0 Illn 111' . Now we will prove that 1= L :=oln in the 11.111' sense. In fact, let E > 0 be given. There exists an mo E N such that
L:
Fix m ~ m o and let hm = L:=m In. The preceding discussion shows that yO(hm)""'L:~mll/nlly and Ilhmllo""'L:=mlllnllo' hence Ilhml/y""'E. Since m is arbitrary we obtain that 1= 2.:=0 In' and the first part of the Lemma is proved. To finish the proof, it is enough to notice that ce y(X) and ceo(X) n £YJ y (X) are closed subspaces of £YJ y (X). D
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
927
Theorem (2.23). Let A E fJi.2.s4(X). Then II.IIA is a-continuous if, and only if, there exists a 'Y E reX) such that A = 'f6'/X). Proof. If A = 'f6'y(X), for some 'Y E reX), from Lemma (2.19) it follows that II. IIA is a-continuous. Conversely, assume that II .IIA is a-continuous and let 'Y = II.IIA I'lic(X)' B = e, (X). From the above Lemma, B E fJi.2.s4 (X), and II. IIA = II. liB on the dense subset (X), hence A = B by Remark (2.4). 0
e,
Remark (2.24). At this point we should notice that the requirement concerning completeness of each A E fJi.2.s4(X) was crucial in order to obtain the main results of the class fJi.2.s4(X), especially in Theorem (2.11) and Theorem (2.23). Definition (2.25). In analogy with the definition of 'f6'o(X, F), given A E fJi.2.s4(X) and a (real) locally convex Hausdorff space F ¥- 0, let A F denote the set of all f E 'f6'(X, F) such that aU) E A, for every a E 'f6'S(F). If A F is a vector space, we endow it with the locally convex topology defined by the family of seminorms (a E 'f6'S(F». Even in the case F = R, A F is not necessarily closed under addition. For instance, let X = Nand u be a sequence of real numbers such that II ~ 1 and such that v = l/u converges to O. Let
(ct. Example (2.2)(3». Notice that given a real sequence s it follows that sv E A if, and only if, s converges. Define the real sequences s' and s" by
s ~ = 1 if n E 2N, s~ =
1 if n E 4N,
s ~ = -1
otherwise,
-1
otherwise.
s~ =
Then s'o, s"v E A R since Is'vl = Is"vl = v E A. If s = s' + s", we have s2(2n+l) = 0 and S4n = 2 for all n EN, hence lsI does not converge. that is Islv g A, or equivalently, s'v + s"v gAR' We recall that a vector subspace Be 'f6'(X F) is a polynomial algebra if F' B is an algebra such that F' B ® FeB. 0
0
928
G. Zapata I Stone- Weierstrass Theorem and Banach Lattice Algebras
Lemma (2.26). Let A E fJ'2d (X) be solid and F ¥ 0 a locally convex Hausdorff space. Then A F is a polynom ial algebra such that F' ° A F = A. Proof. Let f, g E A F , C E IR and a E ceS(F). Since 0 ~ at] + g) ~ a(f)+a(g), and a(f+g)Ece(X), it follows that a(f+g)EA. Also a (c/) E A. Hence A F is a vector space, and it is clear that F' ° A F C A, since A is solid. Conversely, let h E A, v E F, v ¥ 0, and cp E F' such that cp (v) = 1. Notice that h 0 v E A p since for every a E ceS(F) holds a(h 0 v) = Ih 10' (v) E A. Therefore h = cp(h 0 v) E F' ° A F and finally
F'oA F 0 F = A0FCAF • 0
Remark (2.27). The assumption, in the above Lemma, that A E fJ'2d(X) should be solid in order to obtain a polynomial algebra A F is not too restrictive. In fact, let A ° be the solid vector space of all g E ce(X) such that Igi ~ f, for some IE A+, endowed with the Riesz norm
Then AOE fJ'2d(X) is solid and A is a closed subspace of AO. Further, A F is a closed subspace of A~, for every locally convex Hausdorff space F¥ O.
Remark (2.28). Let A E fJ'2d(X) be solid, Fa normed space. Then A F is normed. Further, A F is complete if F is complete, and conversely. Definition (2.29). Given A E fJ'2d(X), let A c denote the vector space of all complex continuous functions I on X such that 1= g + ih, where g, h E A, endowed with the norm
Then A c is a (complex) Banach space, and is called the complexification 01 A (d. [28]). We recall that a set B C ce(X, C) is sell-adjoint if IE B, for every IE B, where 1 denotes the complex conjugate of f. Also, given IE ce(X, C), let 1# denote the function (Ref, Im/)E ce(X, 1R 2 ) .
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
Lemma (2.30). Let A E
~!tJil(X)
929
be solid. Then the IR-linear mapping
is a homeomorphism. Further, a vector subspace Be A c is a self-adjoint algebra if, and only if, B"' c A R2 is a polynomial algebra.
Proof. Let f E A c and a E eeS(1R 2). Then, for some constant C> 0 it holds a(f"')";; C(iRe fl + 11m fi) .
Hence a(f"') E A, since A is solid and a(f"') E ee(X)+. Therefore f"' E A R 2 , since a is arbitrary. Clearly the mapping f~ is IR-linear and 1: 1. Also, given (f1' f2) E A R 2, notice that IfII + If21 E A and t, E ee(X), j = 1,2, hence !J E A, j = 1,2, since A is solid, thus f = t, + if2 E A c and 1"'= (f.'/2)· Further, if atu, v)= luj+ lvI, (u, v)EIR 2, then
r
(for all f E A c ) , hence the given mapping is a homeomorphism. Assume that B is a self-adjoint algebra and let B o = {Re(f)1 fEB}. Then ([19, Lemma 4, Sect. 17]) B o is a sub algebra of ee(X) such that B = B o + iBo. Hence B"' = {(g, h)1 g, h E B o} is a vector space and {cp 0 fl cp E (IR 2)',/ E Bill} == {ag + bhl a, b E IR, g, h E B o} = B o is an algebra such that B o ® IR 2 C E"'. Conversely, assume that B# is a polynomial algebra and let fEB. Then Re(f)(l,O), Im(f)(O, 1) E B#, hence = (Re(f), - Im(f)) E B# and 1E B. 0
r
3. A Generalization of the Stone-Weierstrass Theorem Definition (3.1). Given A E ~!tJil(X), and n = 0, ... ,6, we say that property S, (A) holds if, respectively, the following is true: So(A): 11.IIA is o-continuous. S)(A): A sub algebra Be A is dense if, (and only if), B is strongly separating. In this case we write A E Y'W(X). S2(A): For any subalgebra Be A and f E A, it holds f E clos B if, (and only if), f is constant on each [x lB' x E X, and vanishes on Z(B).
930
0. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
S3(A): For any ideal 1 C A and f E A, it holds f E clos 1 if, (and only if), f vanishes on Z(1). SiA): A vector sublattice LeA is dense if, (and only if), for every Xl' X z E X, Xl 'i- xZ' there exists agE L such that g(xJg(x z) < O. We say that L is order separating. Ss(A): For any Stone lattice Be A and f E A, it holds that f E clos B if, (and only if), f is constant on each [x ]B' X E X, and vanishes on Z(B). S6(A): A is solid and for any solid semigroup B C A and f E A, it holds f E clos B if, (and only if), f vanishes on Z(B).
Remark (3.2). Let A
= reo(X). Then So(A) is the well-known Dini property. Further, with the only exception of Ss(A) (and after a reformulation of S4(A», the remaining properties correspond to classical Stone theorems on approximation. (See [34].)
Remark (3.3). Let B C A be a sub algebra and f E A such that f is constant on each [X]B' X E X, and vanishes on Z(B). Then, given Xl' Xz E X, there exists agE B such that g(x;) = f(x;), i = 1,2. (Cf. (19, Lemma 3, Sect. 17].) Definition (3.4). Let A E ffF2.s4(X), and F'i- 0 be a real locally convex Hausdorff space. We say that property P(A, F) holds if A F (d. Definition (2.25» is a polynomial algebra and also, for any polynomial algebra PC A F and f E A F , it holds f E clos P if, (and only if), for every Xl' X z E X and a E res (F) there exists agE P such that a(f(x;)- g(x;):s;; 1
(i = 1,2) .
Remark (3.5). When X is compact and A
= ee(X), the property P(A, F) is related to a generalization of the Stone-Weierstrass theorem due to Peiczynski ([22]).
Remark (3.6). If A E ffF.2.s4(X) and P(A, IR Z) holds then, in particular, we obtain a theorem describing the closure of self-adjoint sub algebras of AI(; (ct. Lemma (2.30».
Remark (3.7). Assume that A F is a polynomial algebra. Let Pc A F be a polynomial algebra, B = F' P and f E A F satisfies the two point condition with respect to P, in Definition (3.4). Then, as a consequence of the 0
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
Hahn-Banach theorem, it follows that [X]B = [X]c1OSP' for all x E X, constant on each [X]B and vanishes on Z(B).
931
I
is
Definition (3.8). Let A E !!Ji!£.91(X), and F;;i 0 be a real locally convex Hausdorff space. We say that property Q(A, F) holds if A F is a module over A and also, for any M C A F which is a module over a separating algebra Be A and lEAp it holds that f E clos M if, (and only if), I(x) E c10s M(x), for all x E X. Remark (3.9). Assume that property Q(A, F) holds. Then, if Me A F is an A-module, then we obtain a simple description of clos M. In particular, if F = 1R 2 , this alIows us to describe the closure of ideals in A c ' through Lemma (2.30). Remark (3.10). Assume that A E !!Ji!£.91(X) is solid and F;;i 0 is a real locally convex Hausdorff space. There are closed polynomial algebras in A F which are not modules over separating subalgebras of A. Also, if dim F> I, there are A-modules in A F which are not polynomial algebras. However, if F = IR and property Q(A, IR) holds, let M be a closed B-module in A, where B is a separating subalgebra of A. Then M is an ideal, in particular, an algebra. Main Theorem (3.11). Let A E !!Ji!£.91(X) and F;;i 0 be a real locally convex Hausdorff space. Then the properties Sn(A) (n = 0, ... ,6),
peA, F) and
Q(A, F)
are equivalent. Further, they hold if, and only if, there exists ayE reX) such that A = cgy (X).
Remark (3.12). Whenever A = cgy(X), for some y E Ft X), the above Theorem gives several theorems of approximation corresponding to the properties SI(A), ... ,S6(A), peA, F) and Q(A, F). We obtain essentially four different theorems, corresponding to the properties SiA), S5(A), peA; F) and Q(A, F), since the others are easy consequences of some of these properties. In particular, when A = cgo(X), we obtain the Stone theorems on dense subalgebras, closed subalgebras, closed ideals and dense lattices. Also we obtain the Pelczynski theorem on closed polynomial algebras. Hence, Theorem (3.11) is an effective generalization of those
932
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
classical results. Further, even in the case A == ~o(X), that is in the classical situation, we obtain a new theorem through S5(A). Remark (3.13). As another consequence of Theorem (3.11), we obtain a complete description of the class [f"'JI/(X), whose elements are the Banach lattice algebras A E G;2d(X) for which the Stone-Weierstrass theorem holds. In fact, as shown by the Main Theorem, [f"'JI/(X) is described by ~}' (X), 'Y E r(X). Further, it should be noticed that the space F plays no
role in order that the properties P(A, F) and Q(A, F) hold. In fact, this is determined only by A, through the o-continuity of II.IIA' Further, if A E [f"'JI/(X) then A is solid and every strongly separating polynomial algebra in A F is dense (d. Lemma (2.26». We postpone the proof of Theorem (3.11) and give some applications, other than just splitting the above result and writing down the corresponding statements for particular cases of 'Y. Proposition (3.14). Let A E G;2d(X). Then A E [f"'JI/(X) if, and only if, for every ideal J C A c it holds that clos J == {IE
Adf== 0 on
Z(I)}.
Proof. Assume that A E [f"'JI/(X). Let J C A c be an ideal and f E A c such that f == 0 on Z(I). With the notation of Lemma (2.30) it follows that J# is an A-module and J#(x) == 0 for x E Z(I), J#(x) == 1R 2 otherwise. Hence f# E clos J#, since Q(A, IR 2) holds by Theorem (3.11). Then by Lemma (2.30) we have f E clos 1. The other inclusion is clear. Conversely, assume that the above description of closure for ideals in A c holds. Then ~c (X) + i ~c (X) is a dense ideal in A c ' hence ~c (X) is dense in A, so A E Y'W(X) by Lemma (2.19) and Theorem (3.11). 0 Proposition (3.15). Let A E [f"'JI/(X) and F ¥ 0 be a real locally convex Hausdorff space. Then A F is separable if, and only if, X is a-compact and metrizable, and F is separable. Proof. Let since it is ~c (X) and and ~c (X)
P == ~c (X) ® F. Then P is a dense polynomial algebra in A F , strongly separating. Further, P is separable if, and only if, F are separable. Since the inclusion A ~ ~o(X) is continuous is dense in ~o(X), it follows that (~c (X), II.IIA) is separable if,
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
933
and only if, 'fi'o(X) is separable, that is, if, and only if, X is IT-compact and metrizable (d. the proof of Theorem 18 in [37]). 0 Theorem (3.16). Let 'fi'c (X) be endowed with the locally convex inductive limit topology. Let B C e,(X) and f E 'fi'c (X). (i). If B is an algebra or a Stone lattice, then f E clos B if, and only if, f is constant in each [x ]B' x E X, and vanishes on Z(B). (ii). If B is a vector lattice, then B is dense if, and only if, it is order separating. Proof. According to [27, V.6], it follows that the (locally convex) inductive limit topology on 'fi'c (X) coincides with the topology defined by the norms II.I/y (d. Definition (2.21». Now it is enough to apply Theorem (3.11). D Remark (3.17). Let B 1 = 'fi';(R.) and B 1 denote the set of all f E 'fi'c (lR) such that f II is piecewise linear, for some compact interval I which contains
supp(f). Notice that B, is an algebra and B 1 is a Stone lattice such that B I n B 1 = {O}. Remark (3.18). Let r1(X) denote the set of all Riesz norms 'Y on 'fi'c(X) such that 'Y ;",11.1/0 and consider the natural equivalence relation on r1(X)
'Yl - 'Yl ~ 3C1 , C 1 > a such that CI'YI ~ 'Yl ~ C1'Yl'
By the axiom of choice there exists a set r*(X) in r1(X) which contains exactly one norm in every equivalence class, and 11.1/0 E r*(X). Let r;.(X) be the set of all 'Y E r*(X) which are equivalent to some weighted norm (d. (2.1». Then 'fi'y(X), 'Y E r*(X), describes the class :rtJI'(X) and the correspondence 'Y E I" (X) ~ 'fi'y (X) E :rtJI'(X) is 1 : 1. Further, when X is not countably compact, in particular, when X is paracompact noncompact, then the set r*(X) - r,;;(X) is uncountable (2.4). As a consequence, the existing weighted approximation theory does not apply, at least directly, to the lattice algebras 'fi'/X), 'Y E r*(X) - r,;;(X). If X is compact, it is clear that r*(X) = {II .llo}. But there exist noncompact spaces X for which the above relation also holds. For instance, let Y be a IT-compact, locally compact noncompact Hausdorff space and w E f3( Y) - Y, where f3 (Y) denotes the Stone-tech compactification of Y. Then X = f3( Y)\{ w} has the mentioned property ([4, Exercise 2, Sect. 10f Ch. III]). In particular, for these spaces, the class :rtJI'(X) reduces to 'fi'o(X).
934
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
4. Proof of the Main Theorem If we assume the equivalence of the indicated properties in Theorem (3.11) then, according to Theorem (2.23), they hold for a given A E fJ'!l!d(X) if, and only if, there exists a 'Y E r(X) such that A = cgy(X).
Now we will prove the equivalence of the mentioned properties in several steps. Step I P(A, F)~ S2(A)~ SI(A)~ So(A)~ P(A, F)
7i\
~ S3(A)~
S6(A)
(1). P(A, F) ~ S2(A). Let B C A be a subalgebra, / E A such that / is constant on each [X]B and vanishes on Z(B), and E > O. Let P = B® F, v E F, v 'i- 0 and g = / @ v. Then P C A F is a polynomial algebra such that F' 0 P = B. Further, for any two points Xl' X z E X, there exists h E B such that h (x j ) = /(x j ) , i = 1, 2, by Remark (3.3), that is g(x j ) = (h 0 v)(x;), i = 1,2. Since h 0 v E P, it follows that g E clos P by property P(A, F). Also F is Hausdorff, hence the Hahn-Banach theorem gives a cp E F' such that cp(v) = 1. If we take a = Icpl E cgS(F), then there exists a g' E P such that II/- cp g'IIA = IIa(g - g')IIA .:; E, and it is enough to notice that cp 0 g' E B, to conclude that / E clos B. (2). S2(A)~ S,(A). Let Be A be a strongly separating subalgebra. Then [X]B = {x}, for all x E X, since B is separating, and Z(B) = 0, since B does not vanish at any point. Now it is trivial that each / E A is constant on each equivalence class [X]B and vanishes on Z(B), hence /E closB. (3). S,(A)~ So(A). Let B = cgc<X). Since B is a strongly separating sub algebra of A, it is dense in A, hence II.IIA is o-continuous by Lemma (2.19). (4). So(A)~ P(A, F). By Lemma (2.19), A is solid, hence by Lemma (2.26), A F is a polynomial algebra such that F' 0 A F = A. Let P C A F be a polynomial algebra and / E A F such that, for any two points Xl' X 2 E X and a E cgS(F), there exists agE P such that a (f(x;) - g(x;».:; 1, i = 1,2. Fix a E cgS(F) and let B = F' P. We can assume that B ¥ O. By definition, B is a subalgebra of ce(X) such that B 0 Fe P, and it is clear that Be A. Also, by Remark (3.7), / is constant on each equivalence class [X]B and vanishes on Z(B). In particular, S(a 0f) C S(B). Since a 0/ E A, St« 0f) is a rr-compact set, hence by 0
0
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
Proposition
(2.18) there
exists
an
increasing
sequence
935
('Pn)
in
c1os(B) n cgc(X) such that 0 ~ 'P n ~ 1, for all n EN, and lim 'Pn(x) = 1, for all x E S(a f). Then a(f - 'Pnf) = (1- 'Pn)a f, n E N, is a decreasing 0
0
sequence in A which converges pointwise to 0, therefore Ila(f - 'PJ)IIA-+ 0, since II.IIA is CT-continuous. In particular, there exists an noEN such that letting 'P = 'P 11() we have (4.1)
Let K = supp(e ) and r > 0 be such that II'PIIA . r ~ 1. Notice that for every x E K there exists a gx E P such that gx(x) = f(x). In fact, if f(x) ¥ 0, there exists a hE B such that h(x) = 1, and gx = h ®f(x) is a suitable function. Let V x = {y E
xl a(f(y)- gAx» < r}
(x E K).
Since V x is an open neighbourhood of x, by compactness there are E K such that
XI' ••• , X n
where V; = V x; (i = 1, ... , n). Let y E K n V; and z E [y]B' Then f(z)- gXj(x;) = f(y)- gx;(x;), since f is constant on each [Y]B' therefore [Y]B C V;, for all y E K n V; (i = 1, ... , n). Notice that [Y]B = [Y]elo.B' for all Y E X, hence by Lemma (2.9) there exist 1" " , On E c1os(B) n cgAX) such that ~;=I 0; = 1 on K, 0 ~ 0; ~ 1 and supp(O;) C V; (i = 1, ... , n). Define g = ~;=J 'PO;gXj(x;). Then g E clos(B) ® F C c1os(B ® F) C clos P and
°
a('Pf - g) = a
(~ O;(f -
g x;(x;» )
I~I
n
~
L O;a(f -
;=1
g X; (x;)'P ~ rtp ,
hence (4.2)
Ila('Pf- g)IIA ~ 1.
From (4.1) and (4.2) follows lJa(f- g)IlA ~ 2. Since a is arbitrary, we conclude that f E clos clos p = clos P.
936
G. Zapata I Stone- Weierstrass Theorem and Banach Lattice Algebras
(5). SiA) ~ S3(A). Let I C A be an ideal and f E A such that f vanishes on Z(I). Notice that I is an algebra which separates the points in X\Z(1), hence [xlI = {x} for all x E X\Z(1). Then f is constant on each equivalence class [xL, x E X, and vanishes on Z(1), hence f E clos 1. (6). S3(A)~ S6(A). The ideal 1= <€c(X) is dense, since Z(I) = 0. Hence A is solid by Lemma (2.19). Let Be A be a solid semigroup and f E A such that f vanishes on Z(B). Given g E Band r E IR, let n E N be such that Irl";;::; n. Then Irgl,,;;::; Ing/, hence rg E B, since ng E Band B is solid. That is, B is a vector space. Also, given g E Band h E A, it follows that hg E B, since hg is a continuous function on X such that Ihgl,,;;::; Illhllogl and B is solid. Therefore, B is an ideal and f E clos B since f vanishes on Z(B). (7). S6(A)~ So(A). Let B = cgc(X), Then B is a solid semigroup of A such that Z(B) = 0. Thus B is dense, and it is enough to apply Lemma (2.19). This finishes the proof of the given implications in Step I. Step II. SiA) ¢:> So(A). (1). SiA) ~ So(A). Let B = cgc (X). Then B is a vector lattice such that, given any two points XI' Xl E X, XI ~ Xl' there exists an fEB such that f(x 1) = 1, f(x l) = -1. Hence B is an order separating vector lattice, therefore it is dense and we apply Lemma (2.19). (2). So(A)~ SiA). Let LeA be an order separating vector lattice. Let XI' XlE X, XI ~ Xl' and C I' C E IR. There exists agE L such that g(X1)g(X l) < 0, and we can assume that g(xl»O. Let h=alg++alg-, a!,alEIR. Then hEL and for suitable a!,a l, it follows that h(xJ=cj , i = 1,2. Let f E cgc(X)+, f ~ 0, 0 ~ K C X compact and e > 0 be given. We claim that there exists an h E L+ such that
f + i£ , ; : ; h , ;;::; f + 2£ In fact, let
X
on K.
E K. For every y E K there exists a gy E L such that gy (y)
=
f(y) + e .
We can assume that gy E L+, otherwise we take /gJ Let (y E K). Then Vy is an open neighbourhood of y, hence by compactness there
G. Zapata I Stone- WeierslTass Theorem and Banach Lattice Algebras
937
exists a finite subset Fe K such that Kc U Vy • yEF
Let h, = sup{gyl y E F}. Then h, E L+ and hxCx) = I(x) + E. Let
Wx = {u E
XI hAu)(u)+2E}
I + ~E < n,
on K. Further,
(x E K) .
Then Wx is an open neighbourhood of x, hence by compactness there exists a finite subset G C K such that Kc U xEG
w.
Let h = inf{hxl x E G}. Then h E L+ and it is clear that I on K. Fix h o E L+ constructed as above, such that
+ ~E ~ h ~ I + 28
on supp(f). Notice that supp(f) C S(h o). Let (Kn ) be an increasing sequence of compact sets in X such that U:~o K; = S(h o), K o = supp(f). Then for n = 1,2, ... , there exist h; E L+ such that
Let In = inf{h o,"" hn}, n = 1,2, .... Then In E L+ and 1~/n+l
r -r
Step III.
Ss(A)~
So(A).
(1). Ss(A)~ So(A). Let B = cgc(X), Then B is dense in A, since it is a strongly separating Stone lattice, hence So(A) follows from Lemma (2.19). (2). So(A)~ Ss(A). Let Be A be a Stone lattice. Notice that 10 II 1- b 1\ 11 ~ la - bl, for all a, b E IR, hence the mapping
938
G. Zapata / Stone- WeIerstrass Theorem and Banach Lattice Algebras
is continuous on A (d. Corollary (2.13». Therefore clos B is also a Stone lattice. Let f E clos B+, f;i 0 and assume that K = supp(f) is compact. Let x, y E X, x;i y, be given. If f(x) = f(y), let h = f(x )f. Otherwise, assume for instance that f(x) > f(y), and let r> 0 be such that f(x) > r > f(y). Define f(x)
and h =f(x)(f
A
g)+f(y)(f-f
A
g).
In any case, h E clos B, h (x) = f(x), h (y) = f(y) and supp(h) C K. Let C > 0 be given. If x E K, by compactness of K we can take the supremum of a suitable finite number of functions h as above and obtain a function hx E clos B such that hx(x) = f(x),
supp(h x ) C K,
Again by compactness of K, we can take the infimum of a suitable finite number of functions h, as above and obtain a function g E clos B such that f-c
and
supp(g) C K
(d. with the proof of So(A) ~ S4(A), (Step II (2». Fix () E ~c (X)+ such that () = 1 on K and let (cn) be a sequence in IR such that cn > 0 and Cn ~ O. From the result proved above, there exists a sequence (gn) in clos B such that If - gnl ~ cn(), whence f E clos B. Now we consider the case of an arbitrary f E clos B. In order to prove that f E clos B we can assume that f ~ 0 and IIflio = 1. Let
x, = {x E Slf(X)~~}
(n = 1,2, ...).
Then (Kn ) is an increasing sequence of compact sets in X such that
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
939
U tc, = S(f). Let (n
= 1,2, ...).
Then CPn E clos B, 0 ~ CPn ~ CPn+I' CPn = 1 on K; and sUPP(CPn) C K n+ J for all n. Since In = I A CPn E c1os(B) n ric (X)+' from the preceding case it follows that I~ E cIos B. Notice that (l- I~) is a decreasing sequence in A, converging pointwise to 0, hence I~ ~ l by the IT-continuity of II.IIA' that is lE cIos B. We have proved that clos B is an algebra. Therefore, Ss(A) follows from SiA), since So(A)=? SiA) by Step I, and [X]B = [X]c1osB' for all B CA, B 01 0. Step IV. Q(A, F)~ So(A). (1). Q(A, F)=? So(A). Let M= ric(X)tg)F. Then MCA F is a module over the separating algebra ric (X) C A. Since M(x) = F for all x E X, it follows that M is dense in AF" Let I E A, v E F, v 01 0 be given. By the Hahn-Banach theorem there exists a tp E F' such that cp(v) = 1, since F is Hausdorff. Let a = Icpl and E > 0 be given. Then there exists agE M such that
III-
cP 0 gilA = Ila(ftg) v - g)IIA ~
E.
Since tp 0 g E ric (X), it follows that II.IIA is IT-continuous, by Lemma (2.19). (2). So(A) =? Q(A, F). Let M C A F be a module over a separating subalgebra B C A, lEAF such that I(x) E clos M (x), for all x E X; and a E riS(F). Assume first that S(f) C S(B). Then S(a 0 f) C S(B) and, as in Step 1(4), there exists a tp E c1os(B) n ric (X) such that
o ~ cP ~ 1,
lIa(f -
supp(cp) C S(B).
Let K = supp(c ) and r > 0 be such that IlcpliA . r ~ 1. By assumption, for every x E X there exists a gx E M such that a(f(x)- gAx)) < r. If Vx = {y E XI a (f(y) - gx (y» < r}, then by compactness there are Xl' .•• , x, E K such that where \1;= V x j ( i = l , ... ,n).
940
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
Notice that [X]B = {x} for all x E X, since B is separating. Then by Lemma (2.9) there exist (J" ... , On E clos B n C6'c(X) such that L:=\ OJ = 1 on K, O:s:; OJ:S:; 1 and supp(O;}C V; (i = 1, ... , n). Let g = L:=I cpOjgxi' then gEclosM and a(cpl-g):s:;cpL:=\Oja(J-gx):S:;cpr, whence I/a(cplg)/IA:S:; 1 and Ila(J - g)IIA :s:; 2. Since a is arbitrary, we conclude that IE closM. Now assume that there exists Xo E X such that I(x o) 01- 0 and S(B) = X\{x o}. Fix tp E cgc(X) such that O:s:; cp :s:; 1 and cp = 1 on a compact neighbourhood K of X o' Since I(x o) E clos M(x o), there exists a goEM such that
v=
{x E int
KI a (J(x) -
go(x» < IIcpl /
lA
}
is an open neighbourhood of X o' Let 8 E cgc (X) be such that O:s:; (J :s:; 1, 8(x o) = 1 and supp(8) C V, and define II = (1- 8)/,
Then /; E A F , /;(x) E clos M(x) for all x E X and S(/;) C S(B), for i = 1,2. Thus I, E cIos M by the preceding result, hence there exist gl' gz EM such that
(i
=
1,2).
Let g = go + gl - gz· Notice that 1- g = (J\ - g\) + 8(J - go) + (g2 - 12) and a«(J(J - go»:S:; cp/llcpIIA' therefore Ila(J - dlA :s:; 3. Since gEM and a is arbitrary, we conclude that IE clos M. In Step I, we proved that S,(A), SiA), S3(A), S6(A) and P(A, F) are equivalent to So(A). Further, in the remaining steps we proved that SiA), Ss(A) and O(A, F) are equivalent to So(A), hence all those properties are equivalent and we finish the proof of Theorem (3.11).
Remark (4.1). In proving the equivalence of the properties S, (A), n = 0, ... ,6, P(A, F) and O(A, F), we did not use any result of approximation theory. Hence in the classical case, that is, when A = cgo(X), the corresponding approximation theorems for those properties (01- So(A», are genuine consequences of the Main Theorem. In particular, we obtain as corollaries the classical related results formerly proved by
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
941
Stone, Shilov, Kakutani and Peiczynski. (See [12], [22], [31] and [34].) Also it should be noticed that in the proof of the Main Theorem, the rr-continuity of II.IIA had a crucial linking role.
References [1) C.A. Akemann and J. Anderson, The Stone-Weierstrass problem for C·-algebras, In: Invariant Subspaces and Other Topics, ed. C. Apostol, RO. Douglas, B.Sz. Nagy and D. Voiculescu (Brikhauser, Basel, 1982) 15-32. [2J K.D. Bierstedt, Verallgemeinerungen des Satzes von Stone-Weierstrass, Jahrb. Uberlicke Mathematik (1975) 109-135. English trans. by the author in An. Acad. Brasil. Cienc. 49, 4 (1977) 507-523. (3) E. Bishop. A generalization of the Stone-Weierstrass theorem, Pacific J. Math. 11 (1961) 777-783. [4J N. Bourbaki, Integration, Ch. 1, 2, 3 et 4 (Hermann, Paris, 1965). (5) RB. Burckel, Characterizations of '€(X) among its sub algebras (Marcel Dekker, New York, 1972). (6) RB. Burckel, Bishop's Stone-Weierstrass theorem, Arner. Math. Monthly 91, 1 (1984) 22-32. [7J J. Dugundji, Topology (Allyn and Bacon, Boston, 1966). (8) N. Dunford and J.T. Schwartz, Linear Operators: General Theory (Wiley, New York, 1958). (9) J. Glimm, A Stone-Weierstrass theorem for CO-algebras, Ann. of Math. 72 (1960) 216-244. (IOJ A. Goullet de Rugy, Une classe d'espaces de Banach reticules, Math. Z. 144 (1975) 217-238. (11) E. Hewitt, Certain generalizations of the Weierstrass approximation theorem, Duke Math. J. 14 (1947) 419-427. (I2) S. Kakutani, Concrete representation of abstract (M)-spaces (a characterization of the space of continuous functions), Ann. of Math. (2) 42 (1941) 994-1024. (13) 1. Kaplansky, The Weierstrass theorem in fields with valuations, Proc. Amer. Math. Soc. 1 (1950) 356-357. (14) Y. Katznelson and W. Rudin, The Stone-Weierstrass property in Banach algebras, Pacific J. Math. 11 (1961) 253-265. [15J L.H. Loomis, An Introduction to Abstract Harmonic Analysis (Van Nostrand, Princeton, 1953). [16J S. Machado, On Bishop's generalization of the Weierstrass-Stone theorem, Indag. Math. 39 (1977) 218-224. (17) RE. Mullins, A converse of the Stone-Weierstrass theorem, Amer. Math. Monthly 77 (1970) 982-983. (18) L. Nachbin, Sur les algebres denses de fonctions differentiables sur une variete, C.R Acad. Sci. Paris 228 (1949) 1549-155l. (19) L. Nachbin, Elements of Approximation Theory (Van Nostrand, Princeton, 1967). Reprinted (R. Krieger, New York, 1976). (20) L. Nachbin, On the priority of algebras of continuous functions in weighted approximation, In: Symp. Math. XVII (1976) 169-183. (21) RJ. Nagel, A Stone-Weierstrass theorem for Banach lattices, Studia Math. 47 (1973) 75-82.
942
G. Zapata / Stone- Weierstrass Theorem and Banach Lattice Algebras
[22J A. Petczyriski, A generalization of Stone's theorem on approximation, Bull. Ac. Polon. Sci. 5 (1957) 105-107. [23J C. Portenier, Espaces de Riesz, espaces de fonctions et espaces de sections, Comm. Math. Helvetici 46 (1971) 289-313. [24J J.B. Prolla, Approximation of Vector Valued Functions, Notas Mat. 61 (NorthHolland, Amsterdam, 1977). [25J J.S. Pym, Notes on the Stone property in vector lattices, Arch. Math. 19 (1968) 70-78. [26J W. Rudin, Real and Complex Analysis (McGraw-Hili, New York, 1970). [27] H.H. Schaefer, Topological Vector Spaces (Springer, Berlin, 1971). [28] H.H. Schaefer, Banach lattices and Positive Operators (Springer, Berlin, 1974). [29] RH. Schaefer, Aspects of Banach lattices, In: Studies in Functional Analysis, ed. R.G. Bartle (Math. Assoc. of America, Washington, D.C., 1980) 158-221. [30] Z. Semadeni, Banach Spaces of Continuous Functions I (Polish Sci. Publ., Warsaw, 1971). [31) G. Shilov, Ideals and subrings of the ring of continuous functions, Dokl, Akad. Nauk. SSSR 22 (1939) 7-10. [32] M.H. Stone, Applications of the theory of Boolean rings to general topology, Trans. Amer. Math. Soc. 41 (1937) 375-481. [33] M.H. Stone, The generalized Weierstrass approximation theorem, Math. Mag. 21 (1947/48) 167-184; 237-254. [34) M.H. Stone, A generalized Weierstrass approximation theorem, In: Studies in Modern Analysis, ed. RC. Buck (Math. Assoc. of America, Washington, D.C., 1962) 30-87. [35) W.R Summers, Weighted locally convex spaces of continuous functions, Ph.D. Dissertation (Louisiana State University, 1968). [36J G. Zapata, Dense subalgebras in topological algebras of differentiable functions, In: Functional Analysis, Holomorphy, and Approximation Theory, ed. S. Machado, Lecture Notes in Math. 843 (Springer, Berlin, 1981) 615-636. [37) G. Zapata, On the approximation of functions in inductive limits, In: Functional Analysis, Holomorphy and Approximation Theory, ed. J.A. Barroso, Notas Mat. 88 (North-Holland, Amsterdam, 1982) 461-485.