Lecture Notes in Mathematics Edited by A. Dold and B. Eckmann
Jean Moulin Ollagnier
anvd Statistical h/lechanics
Spri nger-Verlag Berlin Heidelberg New York Tokyo
Author Jean Moulin Ollagnier Departement de Mathematiques, Universite Paris Nord Avenue J. B, Clement, 93430 Villetaneuse, France
AMS Subject Classification (1980): 20F, 28D, 54H20, 82A05, 82A25 ISBN 3-540-15192-3 Springer-Verlag Berlin Heidelberg New York Tokyo ISBN 0-387-15192-3 Springer-Verlag New York Heidelberg Berlin Tokyo Th~s work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically those of translation, reprinting, re-use of illustrations, broadcasting, reproduction by photocopying machine or similar means, and storage in data banks. Under § 54 of the German Copyright Law where copies are made for other than private use, a fee is payable to "Verwertungsgesellschaft WOW', Munich. © by Springer-Verlag Berlin Heidelberg 1985 Printed in Germany Printing and binding: Beltz Offsetdruck, Hemsbach/Bergstr. 2146/3140-543210
CONTENTS
INTRODUCTION
....................................................
I. P R E L I M I N A R Y
ANALYSIS
I.I.
Sublinear
1.2.
Compact
1.3.
Radon measures
1.4.
Extremal
1.5.
References
2. D Y N A M I C A L
........................................
functions
convex
and
sets
the H a h n - B a n a c h
theorem
.......
...................................
........................................
points
in c o m p a c t
convex
sets
................
............................................
SYSTEMS
AND AMENABLE
GROUPS
.......................
I I 4 6 10 14
15
2.1.
Dynamical
systems
2.2.
The
point
2.3.
Amenability
2.4.
References
............................................
33
3. E R G O D I C T H E O R E M S
............................................
35
4.
fixed
.....................................
V
property
and the a m e a n i n g
and a l g e b r a i c
constructions
3.1.
Invariant
linear
functionals
3.2.
Invariant
vectors
and mean
3.3.
Individual
ergodic
theorems
3.4.
The
ergodic
theorem
3.5.
References
ENTROPY
saddle
Equivalence
DYNAMICAL
4.2.
Entropy
of p a r t i t i o n s
Entropy
of d y n a m i c a l
4.4.
The
a n d the 4.5.
theorems
...........
41 47
............................
50
ergodic
52
.......................
53
systems ..............
53
................................. systems
subadditive
35
...........................
dynamical
Shannon-McMillan
References
19 30
..........................
ergodic
SYSTEMS
of a b s t r a c t
4.3.
almost
......
...............
............................................
OF ABSTRACT
4.1.
filter
15
..........................
54 59
theorem
theorem
......................
............................................
64 70
IV
5. E N T R O P Y
6.
7.
A
FUNCTION
AND
THE
5.1.
Topological
entropy
5.2.
Pressure
a continuous
5.3.
Entropy
5.4.
References
STATISTICAL
of as
MECHANICS
6.2.
Cocycles
6.3.
Phase
6.4.
Supermodular
6.5.
References
ON
of
and
and
72
.................
82
measure
measures
measures
86
................
91 96
............................
...........................................
SYSTEMS
IN
STATISTICAL
MECHANICS
7.2.
Invariant
Gibbs
measures
7.3.
Mixing
7.4.
Example:
7.5.
References
properties
and
.................
....................... equilibrium
measures
of
Ruelle's
....
.......................
...........................................
COUNTABLE
8.1.
Tiling
amenable
8.2.
Equivalence
8.3.
Rokhlin's
8.4.
References
of
AMENABLE
groups countable
lemmaand
GROUPS
...................
............................... groups
......................
hyperfiniteness of countable amenable groups
...........................................
..................................................
.........................................................
98 104
....................................
a theorem
86
..............
....................................
interactions
specifications
INDEX
85
.........................
Gibbs
local
BIBLIOGRAPHY
71
the
quasi-invariant
transitions
71
the variational principle
A LATTICE
specifications
OF
........
function,and
Invariant
EQUIVALENCE
PRINCIPLE
...........................................
Local
DYNAMICAL
VARIATIONAL
..................................
a function
6.1.
7.1.
8.
AS
105 105 106 108 112 115
117 117 124 130 138
139
145
INTRODUCTION
It can be said that for the study of dynamical the crucial property of the acting group. classical
point of view is not only natural,
applications isometries
in statistical
systems amenability
This generalization
is
of the
but is also related
to the
mechanics where the acting group consists of
of the lattice.
This text, which grew out of a "third cycle" course in Ergodic Theory and Statistical the University
Mechanics
of Paris VI in 1980, deals with both topological
measure-theoretic dynamical
which I gave together with Didier Pinchon at
dynamical
systems,
systems of statistical
The existence of the ameaning
and
the symbolic
mechanics.
filter for an amenable group shall be
proved with the use of strongly dynamical
and in particular
subadditive
system of total orders.
Several
set functions
and the special
ergodic theorems
shall be
given. The entropy theory of measure-theoretic completely
described;
dynamical
and the Shannon-McMillan
corollary of a new ergodic
theorem,
systems shall be
theorem is given as a
the "almost subadditive
ergodic
theorem." A link between topological
and measure-theoretic
be made by way of the variational continuous
principle
function on a compact Hausdorff
dynamical
systems
shall
for the pressure of a space under the action of an
amenable group. A careful
study of subadditivity
use of tiling methods
in proving
of set functions several
allows us to avoid the
important
tiling is essential when proving the equivalence
theorems.
However,
of a countable
amenable
group with Z. This proof is given in the last chapter along with Rokhlin's lemma and the proof of the hyperfiniteness
of countable
(using the tower extension argument of Connes,
amenable groups
Feldman and Weiss).
Vl I would like to express my indebtedness significant portion of the material
to Didier Pinchon,
to whom a
contained in this text is due.
I also would like to thank Jean-Paul Thouvenot for many helpful discussions which stimulated the work on this text, and Tony Frank Paschall for his assistance with the English manuscript.
Jean Moulin Ollagnier Villetaneuse,
September 1984
i. P R E L I M I N A R Y ANALYSIS
I.I. S U B L I N E A R FUNCTIONS AND THE H A H N - B A N A C H THEOREM
I.I.I.
Definition.
Let E be a real vector
space. A real function p on E
is said to be sublinear if it is both subadditive and p o s i t i v e l y homogeneous,
i.e. if the two following conditions hold:
i)
for every pair
p(x+y)
(x,y) of vectors in E
ii) for every x and every positive number
1.1.2.
Remark.
~ p(x)
+ p(y)
p(Ix) = I p(x)
It is quite clear that a linear functional
is a sublinear
function and that the least upper bound of a set of sublinear functions, if one exists,
is still sublinear.
linear functionals
Therefore,
a least upper bound of
is a sublinear function. We shall state that this
p r o p e r t y is characteristic.
1.1.3. Example.
Consider the vector space C(X) of all real continuous
functions on the compact set X. Function s, defined on this space by the formula
s(f)
=
sup f(x) xeX
is sublinear.
1.1.4.
Extension theorem
(Hahn-Banach).
Let E be a real vector
space and
p be a sublinear function on E. Let F be a subspace of E and m be a linear functional b o u n d e d above by p on F, i.e. for every x in F, m(x) Then,
~ p(x).
there exists a linear functional n on the whole space E, still
b o u n d e d above by p, w h i c h is an extension of m. The proof of the theorem e s s e n t i a l l y depends on an extension lemma, and, w h e n the dimension is infinite,
on Zorn's lemma as well.
1.1.5.
Extension
lemma.
Let E be a real vector
function on E, G a vector functional
subspace of E non-equal
p a sublinear
to E, and f a linear
on G bounded by p.
Let a be an element of E\G so that the vector greater
space,
than G. It is then possible
a linear functional
space G 8 Ra is strictly
to find an extension of f which is
on G 8 Ra and is still bounded by p.
Proof of the lemma. We look for a real number ~ such that, in G and every real number ~, the following f(x) + ~ Using homogeneity,
< p(x+~a)
we have only to verify
f(x) + a
~
p(x+a)
and
that, f(x)
The real number ~ must then be chosen greater Sup yeG
for every x
inequality holds:
for every x in G,
- ~
~
p(x-a)
than or equal to
(f(y)-p(y-a))
and less than or equal
to
Inf (p(x+a)-f(x)) xeG Such a choice is possible f(y) - p(y-a) which is equivalent Because
to
if, for every pair <
p(x+a)
in G,
- f(x)
f(x) + f(y) ~ p(x+a)
f(x) + f(y) = f(x+y)
(x,y) of vectors
~ p(x+y)
+ p(y-a).
~ p(x+a)
+ p(y-a),
the proof is
achieved. 1.1.6.
Proof of the theorem.
G is a subspace
Consider
of E which contains
the set I of all pairs
(G,f) where
F, and where f is a linear functional
on G, is bounded by p, and extends m. This set I can be ordered in the following way: (G,f) $ (G',f')
<
Set I is non-empty because the order.
>
G
G' and f' is an extension of f
(F,m) belongs
to it and it is inductive
for
According
to Zorn's
for this order. lemma 1.1.5) 1.1.7.
For such a maximal
element,
element
(G+,n)
G + is equal to E (using
and the proof is thereby obtained.
Corollary.
linear functionals Proof.
lemma, we can then find a maximal
A sublinear
function is the least upper bound of all
less than or equal to it.
We want to show that,
for a given sublinear
for every vector x in a vector
function p on E and
space E, the real number p(x)
least upper bound of all numbers
is the
f(x), where f is a linear functional
on
E bounded above by p. Then,
the inequality
On the other hand,
f(x) ~ p(x) holds.
the linear functional
given by f(x) = p(x), whole
on the subspace generated by x,
can be extended according
to theorem 1.1.4 to the
space.
1.1.8.
Example.
describing
Continuing
now example
the linear functionals
for every continuous
1.1.3 of function
s on C(X) and
on C(X) bounded by s, we find that,
function f on X, a linear functional m bounded
above by s verifies m(f)
~ IIfII, where
II II is the uniform norm.
We then have m(f) ~ s(f) ~ IIfll
and
Therefore m is a Radon measure Moreover,
if f is everywhere m(-f)
On the other hand, m(1)
~ 1
and
$ p(-f)
m(-l)
$ -I
non-negative,
$ 0
and
measures
on X.
1.1.9.
~ 0
to constant
A pseudo-norm
the identity
is a sublinear
function which moreover
p(x) = p(-x)
of linear functionals
given pseudo-norm
p.
= 1
are bounded by s.
One of the more common forms of the Hahn-Banach extension
m(1)
equal to I, i.e. Radon probability
all Radon probability measures
Remark.
I, both inequalities
are true and therefore
on X with a total mass
verifies
m(f)
on C(X) bounded by s are then positive Radon
measures
Conversely,
$ s(-f) ~ IIfll
on X.
if f is equal
The linear functionals
-m(f) = m(-f)
theorem deals with the
whose absolute value
is bounded by a
If p is a pseudo-norm,
the two conditions m ~ p and
[m[ ~ p are equivalent
and theorem 1.1.4 enables us to complete the proof.
I.I.I0.
Remark.
It is possible to consider a similar problem with complex
rather than real vector spaces. In this case, a p s e u d o - n o r m p is a subadditive positive function on complex space E that verifies,
for every vector x and every complex
number ~,
p(~x) = I~Ip(x)
Then let m be a complex linear functional bounded by p on subspace F of E. According to remark 1.1.9,
the real
an extension v to the whole space,
linear functional on F, Re(m), has such that the absolute value of v is
b o u n d e d by p. The complex function n on E, given by n(x) = v(x)
- i.v(ix),
is a complex
linear functional with an absolute value bounded by p, and is an extension ofm.
1.2. COMPACT CONVEX SETS
1.2.1.
Definition.
Let E be a real vector space and E* its algebraic dual
space. Every x in E defines a p s e u d o - n o r m Px on E* in the following way:
px(f) =
f(x)
The family of all these pseudo-norms endows E* w i t h a topology and it becomes
a locally convex topological vector space.
This topology is the restriction to E* of the product
topology on R E .
It is the coarsest topology for which the coordinate applications from E* to R, f --> f(x),
are continuous.
It is called the weak* topology on E*. 1.2.2. Proposition.
Let K(E,p) be the subspace of E* that consists of
all linear functionals on E that are bounded above by the sublinear function p. K(E,p)
is a convex subset of E*, and it is compact for the weak* topology.
Proof. results
The convexity property is evident, from Tychonoff's
The product
space of all segments
RE; and the conditions
while the compactness
of K(E,p)
theorem. [-p(-x),p(x)]
defining K(E,p)
is a compact subset of
in this product
space make it a
closed subspace.
1.2.3. Examples. The unit ball of the strong dual of a normed space is weakly compact. The unit ball of the space of Radon measures w e a k l y compact.
on a compact
space X is
The subset of this ball of all probabilities
is convex
and weakly compact.
The following
lemma is useful
1.2.4.
Let F be a finite dimensional
Lemma.
in proving
the converse of proposition real vector
its dual space, and K a convex compact
subset of F'
Every linear functional
to K(E,p) when,
f(x)
Proof.
Consider
dian structure
~
f on F belongs
Sup geK
1.2.2.
space. Let F' be for every x in F,
(g(x))
a Euclidian
structure
< , >
on F and the dual Eucli-
on F'
Call f' the orthogonal
projection
of f on K, i.e.
the unique element
of
K such that Inf geK
Let x be the vector
=
such that
f - f' = <x,.>.
For every g in K and every real number e between 0 and I, belongs
(f'+e.(g-f'))
to K and (f-f')(x)
=< IIf-f'-e.(g-f')II 2
=
(f-f') (x) +
For every strictly positive 0
< 2(f'-g) (x) + EIIf'-gll2
2e. (f'-g) (x)
+
E2 llf'-gll2
When ~ tends to 0, this formula reduces to
(g-f')(x) ~ 0 , from which
we derive f'(x) = Sup g(x) geK Therefore f(x)
is less than or equal to f'(x), and the scalar square of
f-f' is non-positive, which means
1.2.5. Proposition.
that f belongs to K.
Let K be a convex part of the dual space E* of a
real vector space E, and let it be compact in the weak* topology. Let x be a vector in E, and consider the least upper bound p(x) of all numbers
f(x), where f belongs to K. For every x in E, p(x)
So defined, and K(E,p) Proof.
function p is sublinear,
is finite.
and the two compact convex sets K
are equal.
For every x, the function
therefore bounded.
f --> f(x)
is continuous on K and
Function p is then defined on the whole space E and
it is a supremum of linear functionals.
According to remark 1.1.2, p is
sublinear. By definition, K is contained in K(E,p). To prove the converse inclusion,
consider an element f of K(E,p).
For every finite dimensional vector subspace of E, there exists according to lemma 1.2.4 a convex compact n o n - e m p t y subset K F of K whose elements give the same value as f to all elements of F. The family of all compact sets K F has the non-empty finite intersection property.
Therefore,
there exists a convex compact subset K E of K, whose
elements give the same value as f to all elements of E. There is only one element in K, which is indeed f, and the converse inclusion is proven.
1.2.6. Remark.
It is possible to deduce the result of p r o p o s i t i o n 1.2.5
from a geometrical
form of the H a h n - B a n a c h theorem.
1.3. RADON MEASURES
1.3.1.
Definition.
Consider the vector space C(X) of all real continuous
functions on a given compact Hausdorff space X. A Radon measure on X is a linear functional on C(X),
continuous for the s u p r e m u m norm given by
the formula
1.3.2. gical
IIfll = Sup xeX
Definition.
If(x) l.
Let
space and ~ a
(X,~) be a measurable
o-algebra
A positive real measure measure
of all its compact
is said to be outer regular the greatest 1.3.3.
subsets.
space,
theorem.
and let A be a positive
Then there exists
a o-algebra ~ i n
~ represents
real measure on this space
of every measurable
set is
of all open sets containing
linear functional
on Cc(X),
on X with a compact
X which contains
all Borel
support. sets, and
the following properties:
A
feCc(X)
---> ~(f)
=
f f d~ X
second,
~ is inner and outer regular and gives a finite measure
compact
sets; and,
third,
the o-algebra ~ is complete
One proof of this can be found in Rudin 1.3.4.
Remark.
theorem,
the Borel o-algebra 1.3.5.
Corollary.
correspondence,
and also, when necessary,
the restriction
Let ~ be a Radon measure on a compact Hausdorff semi-continuous
Proof.
to
space
function on X. Then the integral
is the supremum of all ~(~), where ~ is a continuous
greater
we shall
derived from a linear functional
of this measure.
X, and f be an upper ~(f)
to
for ~.
(i).
Because of the above one-to-one
call a Radon measure both the measure on C(X) by Riesz's
it.
Let X be a locally compact
real functions
a unique positive measure ~ on ~ w i t h first,
A positive
if the measure
lower bound of the measures
the space of all continuous
if the
subset of X is the least upper bound of the
The Riesz representation
Hausdorff
the Borel o-algebra.
on this space is said to be inner regular
of every measurable
measures
containing
space, where X is a topolo-
function on X
than or equal to f. For every continuous
results
from the positivity
function ~, the inequality
~(f)
~
~(~)
of ~.
Consider now,
for every natural number n, the function fn which is equal
to Sup(f,-n),
and is so upper
Sequence
(fn) then decreases
theorem,
the integral
semi-continuous to f. According
and bounded. to the monotone
of f is the limit of the integral
convergence
of fn when n
tends to infinity. We have only then to prove the results for bounded upper semi-continuous functions.
By adding a constant we can even restrict the proof to the
case of non-negative functions. To achieve such a proof,
let g be a n o n - n e g a t i v e upper semi-continuous
function on X. For every positive real number 8, g~ is the function given by
= g~ In fact,
~
l{g~n6}
6"n= 0
there is a finite number of terms in the sum that are different
from 0 since g is bounded.
The last subscript is
n O = E(Sup(g)/6).
This function is greater than or equal to g, and this inequality holds:
~(g~ - g)
<
~.~(I)
Given a positive real number ~, select a ~ less than e/2~(1).
Since u is inner regular,
every compact set K n = {g~n~}
has an open
n e i g h b o r h o o d O n such that ~ ( O n - K n) < n Then,
there exists a continuous function 0n , whose values
lie between 0
and i, and which is equal to 1 at every point of K n and to 0 at every point outside of 0 n. The positive real number n is then an upper bound for
~(0 n) Choose n less than
~(I K ) n E/2 Sup(g)
and consider the function
no 0
=
5.( i + n~I= 0n )
This function is greater inequality
u(+-g)
< ~
than or equal to g at every point in X, and the holds, which concludes the proof.
1.3.6. A version of Dini's continuous
lemma. Let (fi)iel be a set of upper semi-
functions on a H a u s d o r f f compact set X such that every pair
of elements has a common lower bound in the set, i.e. that this set is directed for the order ~. Let f be the greatest
lower bound of all these functions.
The following m i n i m a x result holds in this situation:
Sup f(x) xeX
=
Inf Sup fi(x) ieI xeX
To demonstrate this consider first the obvious inequality
Sup f(x) ~ Inf Sup fi(x) xeX ieI xeX In order to prove the converse inequality denote by a the following number :
a = Inf Sup fi(x) ieI xeX If a is equal to -~ Otherwise,
, there is nothing to prove.
the compact sets
fil(La,+=L)
are n o n - e m p t y and their family
has the n o n - e m p t y intersection property. At every point x of the intersection of all these compact sets,
the
limit function f takes a values f(x) which is less than or equal to a; and this concludes the proof.
1.3.7. Corollary.
Given a convex and compact set of Radon p r o b a b i l i t y
m e a s u r e s on a compact Hausdorff set X, and an upper semi-continuous f u n c t i o n f on X, the function ~
--> ~(f)
on K is also upper semi-
continuous. Moreover,
Sup ~(f) ~eK where
=
Inf Sup ~(~) ~ f ~eK
the infimum is taken on the set of all continuous
functions on X
b o u n d e d below by f.
Proof.
Corollary 1.3.5 implies that the function ~ --> ~(f)
of a set of continuous
functions,
is an infimum
and thereby upper semi-continuous.
The m i n i m a x result is obtained by application of lemma 1.3.6.
10
1.4. EXTREMAL POINTS IN COMPACT CONVEX SETS
1.4.1.
Definition.
A point M in a convex part C of a real vector space E
is said to be an extremal point in C if it is not a convex c o m b i n a t i o n of two other points in C.
In the following extremal points of particular compact convex sets of p r o b a b i l i t y measures will be used.
The next property,
product of the K r e i n - M i l m a n theorem,
demonstrates
which is a by-
the existence of
extremal points.
1.4.2.
Proposition.
space,
and K a convex and compact subset of E.
The set Ext(K)
Let E be a locally convex Hausdorff real vector
of all extremal points in K is not empty.
Proof. Let I be the set of all closed non-empty subsets of K, for which the so-called open-segment p r o p e r t y holds:
every open segment ] A BF meets
(]A B E
= {=A+BB, ~+~=I,~
>0,B
>0})
that
C is contained in C.
Consider the converse order of the inclusion order on I; K being a compact set, exists,
I is hence an inductive p a r t i a l l y ordered set. Then,
according to Zorn's lemma,
there
some maximal element in I.
It remains to be seen that any such maximal element reduces to a single point, which would then be an extremal one in K. Let C be an element of I w h i c h is not reduced to a point.
Given two
different points x and y of C, there exists a continuous p s e u d o - n o r m p such that p(x-y)
is strictly positive because E is a H a u s d o r f f space.
The e x t e n s i o n theorem 1.1.4 enables us to build a continuous functional
f on E such that
f(x-y) = p(x-y)
> 0.
Denote by C + the subspace of C consisting of all points reaches
linear
z at which f
its m a x i m u m value on C.
Because C is compact,
C + is not empty.
It it clearly a convex and closed
subset of C. The open-segment p r o p e r t y still holds for C +. Indeed,
consider a convex combination
M = ~A + BB, where ~ and B are
n o n - n e g a t i v e and whose sum is equal to i; and, if M belongs to C +, A and B also belong to C because the open-segment property holds for C.
11
Moreover,
the inequalities f(A) ~ of(A) + ~f(B)
imply
f(A) = f(B) = f(M)
Therefore
1.4.3.
Example.
extremal
f(B) ~ of(A) + ~f(B)
, and the points
C + is an element
and this concludes
and
A and B then belong to C +.
of I, strictly greater
than C for the order;
the proof.
The following
situation will enable us to characterize
points.
X is a compact Hausdorff (defined in 1.1.3)
space and s is the sublinear
so that K(C(X),s)
all Radon probability measures N is a vector
function on C(X)
is the convex and compact
set of
on X (1.1.8).
subspace of C(X) such that,
for every f in N, s(f)
is non-
negative. The null functional
is then bounded by s on N and it is possible,
ing to theorem 1.1.4,
to find a functional
space E whose restriction The non-empty
to N is 0.
set K(C(X),s,N)
give an integral
equal
the extension p(f)
The following
=
on X that
of N is then a convex and
this compact and convex set is characterized
by its upper bound p according Applying
of all Radon probability measures
to 0 to all elements
compact part of K(C(X),s);
accord-
bounded by s on the whole
to 1.2.5.
lemma 1.1.5 to the space N • Rf, p may be written
Sup meK(C(X),s,N)
m(f)
=
Inf s(f+g) geN
theorem gives a characterization
of the extremal
points
in
K(C(X),p). 1.4.4.
Theorem.
In 1.4.3,
the extremal
the measures m for which the subspace
points N @ R.I
in K(C(X),p)
are precisely
is dense in C(X)
for the
p s e u d o - n o r m m(l.l). Proof. i) Let us first show that a measure with the above property Consider
a strict convex combination
property holds
for m:
(=i.o2 > 0) where
is extremal.
the density
12
m = ~iml + ~2m2 The two linear functionals m I and m 2 are continuous m(I.l)
for the p s e u d o - n o r m
since
Iml(f) l
~
(1/~ 1) m(Ifl)
Im2(f) l ~ (1/~ 2) m(Ifl)
and
They are equal to m on the dense subspace N @ R.I, and therefore are equal on the whole space. Functional m is then an extremal point of K(C(X),p). 2) The converse can be proven by contraposition. If
N ~ R.I
is not a dense subspace of C(X) for the p s e u d o - n o r m m(l.l),
there exist a real continuous
function f and a strictly positive real
number ~ such that, for every g in N and every real number a,
m(l f-g-al )
>
~
>
0
Consider the linear functional 8 on
0(g + a + ~f) =
N • R.I @ R.f
given by
~=
The functional verifies on this subspace the inequality
e ~ m(I.I).
It is then possible to find on the whole space C(X) a linear functional b o u n d e d above by m(I.I) , whose restriction is 0. Denote it by ~. The d e c o m p o s i t i o n
m
=
(1/2)
((m+~) + (m-~))
is non-trivial because #(f) It remains to be shown that
is not 0. m+~
and
m-~
are elements of K(C(X),p);
but the only thing to be proven is that they are positive measures. If h is a continuous non-negative
(m + ~¢)(h)
1o4.5. Remark.
=
m(h)
The dense subspace
function on X, we get (s = jl)
+ E¢(h)
N @ R.I
>
m(lh[)
- [#(h)[
>
0
is then dense in LI(X,Q,m)
the L l - n o r m , where m is the regular measure built from the functional m by theorem 1.3.4. 1.4.6. Remark.
Compact convex sets described in 1.4.3 are simplexes and
for
13
all compact 1.4.7.
simplexes
Example.
of probability measures have such forms.
Let f be a continuous
mapping
from a compact
space Y
onto a compact set X, and let m be a Radon probability measure The set of all probability measures type described Extremal morphism
in this convex set are exactly those for which f is a
of the two measure of the quotient
the o-ideals 1.4.8.
in 1.4.3.
points
conjugacy
spaces
(Y,~,n)
Boole algebras
and
(X,~,m),
i.e. an iso-
of the o-algebras
of events by
of null sets.
Example.
Let X be a compact
A Radon probability measure
set and T a h o m e o m o r p h i s m
of X.
~ on X is said to be invariant under the
action of T if the equality function
on X.
on Y whose image by f is m is of the
~(f)
= ~(foT)
holds
for every continuous
f on X.
The set M(X,T)
of all Radon probability measures
on X, which are invar-
iant under T, is not empty and is also a convex and compact set of the type described Extremal invariant
elements
Let us prove Here,
in 1.4.3.
invariant probabilities of LI(X,~,~)
are ergodic, are constant
which is to say the only functions.
these results.
the space N is generated by the increments
belongs
to C(X).
increment It shall
Every linear
f - foT
combination ~ ~i(fi-fioT)
where
f
is still an
f - foT. suffice to verify
s(f-foT)
~ 0
Let x be a point in X at which f reaches a point the difference therefore,
(f-foT)(x)
s is non-negative
The extremal
points
in M(X,T)
its least upper bound.
is greater
At such
than or equal to 0, and,
on N. are those for which the subspace N @ R.I
is dense in LI(X,~,~ ). Consider
now the sequence of contraction
operators
A n of LI(X,~,~)
where
A n is given by the expression An(f) The sequence
=
(An(f))
n-I (I/n). ~ loT i of averages
converges
to the expectation
~(f)
for
every element of N • R.I; and, by virtue of the density of this subspace, the convergence
result also holds
for every element of LI(X,~,~).
14
If f is an invariant to u(f).
An invariant
if u is extremal 1.4.9.
element,
Remark.
An(f)
is a constant
element of LI(X,Q,u)
In the previous
space.
In Chapter
that converges
is then a constant
function
in M(X,T). example we proved that there always
at least one invariant probability under Hausdorff
sequence
This was demonstrated
2, we shall examine
a homeomorphism
by Bogoliubov
the transformation
are invariant probability measures whenever
and Krylov
groups
exists
of a compact (1).
for which there
they act on a compact
space
by homeomorphisms. To prove the existence we used a convergence ergodic
theorem.
of invariant probabilities theorem of means,
in the present
which is a special
chapter,
case of a mean
Chapt,er 3 deals further with this topic.
1.5 REFERENCES
For further Bourbaki's
study of Hahn-Banach text devoted
Riesz representation Choquet
theorem,
to topological
refer to Meyer
vector
spaces
theorem is proved by Rudin
(i) is recommended
for general
and more in-depth study of simplexes.
(i) or to
(3).
(i, chapter
information
2).
on functional
analysis
2. DYNAMICAL
2.1. DYNAMICAL
2.1.1.
SYSTEMS AND AMENABLE GROUPS
SYSTEMS
Definition.
We call a pair
when X is a compact Hausdorff
(X,G)
a topological
dynamical
system
space and G a group of homeomorphisms
of
this space. When discussing
the action of an abstract group G, we shall denote by T g
the homeomorphism (X,T)
of X associated with the element g of G and denote by
the corresponding
dynamical
system.
When G is the group Z of all relative the generator
integers,
T is simply the image of
i.
2.1.2. Example.
The first example
set, and X the product
is given by shifts.
space I G of all mappings
Endowed with the product topology of the discrete a compact Hausdorff
space which is metrizable
Given an element g of G, consider Tg(~)(a) The mappings
topologies,
X becomes
when G is countable.
the mapping T g from X to X:
= ~ (ag)
T g are homeomorphisms
group of all homeomorphisms corresponding
Let I be a finite
from G to I.
of X; and the mapping from G to the
of K, obtained by :sending every g to the
T g, is a group homomorphism.
Such a group homomorphism We shall sometimes
is called an action of G on X by homeomorphisms.
refer to T g as a translation.
Indeed,
T g is the right
translation by g-I of the graphs r($): (a.g-l,i) The set I is sometimes we have constructed 2.1.3.
Example.
e
r(Tg(¢))
<
>
(a,i)
called an alphabet,
is called a symbolic
Here is another example,
e
r(~)
and the dynamical
dynamical
system that
system.
which is as symbolic as the
16
preceding.
Given a group G, consider
the set T(G) of all total orders
on G. If t is a total order on a finite part F of G, O(F,t) the set of all total orders on G whose restriction The set of all O(F,t)
is one of the bases of a topology on T(G).
Endowed with this topology, metrizable to T(G)
T(G)
when G is countable;
of the product
Of course,
topology
is a compact Hausdorff on
<==>
x < y
For every element g in G, consider Tg(r)
(a,b)
=
The T g are homeomorphisms
2.1.4.
As in example
Definition.
of T(G)
the mapping
and mapping
is a one-to-one
An automorphism
bimeasurable mapping
Definition.
to T(G):
are called translations.
if T and its inverse
of a probability
space
from X to X that preserves
i.e. for every A in the u-algebra (T -I(A))
from T(G)
g to T g gives an action of G
A mapping T is bimeasurable
are both measurable.
the measure,
for
2.1.2 these homeomorphisms
(X,~,~)
4, the following holds:
= ~(A)
Given a probability
action of G on (X,~,~) system.
{0,I} GxG.
~(ag,bg)
mapping
2.1.5.
space, which is
this topology is simply the restriction
we identify an order T with its graph: T (x,y)=l
on T(G).
stands for
to F is t.
space
by automorphisms
(X,~,~)
and a group G, an
is called an abstract
Such an action is then a family of automorphisms
lity space, which is indexed by the elements
dynamical
of the probabi-
of G, and for which the
equality T gh
holds 2.1.6.
=
for every pair Remark.
TgoT h
(g,h) of elements
The question of modulo
spaces will be investigated In these first chapters, pological
of G. 0 automorphisms
and Lebesgue
later.
abstract
dynamical
ones and the automorphisms
systems are derived from to-
are therefore well-defined
mappings.
17
2.1.7.
Proposition.
Let
(X,G) be a topological
be a Radon probability measure of G. This means
every element g of G, the following
Then,
=
function f on X, and for
equality holds:
of the probability
space
is the o - a l g e b r a built by the Riesz representation
lar,
Indeed,
and
~ (f)
the T g are automorphisms
Proof.
system,
on X, which is invariant under the action
that for every continuous
u (foT g)
dynamical
the T g are measurable
for ~; and,
(X,~,~) where
theorem
(1.3.3).
since ~ is inner regu-
for every A in Q, (A)
Of course, compact
=
Sup KcA
~ (K)
here we take the least upper bound of the measures
of all
subsets of A.
A similar result holds for (Tg)-I(A). On the other hand,
according
to corollary
the compact set K, is the greatest continuous Then,
functions
1.3.5, ~(K),
the measure
lower bound of the measures
above the indicator
of
of all
of K.
for every element A in the o-algebra
~, the following
equality
holds: ~(A)
2.1.8.
Example.
=
~((Tg)-I(A))
Taking the topological
dynamical
duced in 2.1.2 we can define a probability Let
system that we intro-
on it in the following way.
(Pl .... ,pn ) be a n-uple of strictly positive
is I, where n is the number of elements If f is a function only depending example
the coordinates
(f) If we take another
=
real numbers whose
sum
in I.
on a finite number of coordinates
in the finite part F), ~(f)
(for
is defined by
i=n ~ F f(a).( ~ Pa. ) aeI i=l l finite part F' that contains
F, the value of u(f)
is
the same. Then the above formula defines value
1 to the constant
a positive
function
linear functional,
I, on the vector
giving the
space of all functions
that only depend on a finite number of coordinates.
18
According
to the Stone-Weierstrass
the uniform norm on C(X), can be extended
theorem,
this subspace
and the uniformly continuous
to the whole
space C(X).
is dense for
linear functional
Here we get a Radon probability
measure ~ on X. It is easy to see that ~ is invariant under tions.
Such an abstract
2.1.9.
Example.
dynamical
Consider
the action of G by transla-
system is called a Bernoulli
the dynamical
system T(G)
introduced
scheme. in 2.1.3
of all total orders on G. All O(F,t)
are open and closed
Let us define a positive which
consists
subsets
of T(G).
linear functional ~ on the subspace of C(T(G)),
of all functions
depending
solely on the restriction
of
the order to a finite part of G, by giving in a coherent way the mass of all O(F,t). It is possible
to do this by setting
The above subspace
~(O(F,t))
is dense in C(T(G))
the Stone-Weierstrass
theorem;
for the uniform norm according
and the functional
of a unique Radon probability measure still
= I/IFI! to
~ is the restriction
on T(G) which we shall nonetheless
call simply ~.
One can easily ensure that ~ is invariant under the action of G on T(G) by translations;
in fact, ~ is invariant under the action of the wider
group of all homeomorphisms According
of T(G)
2.1.10.
Remark.
variant
Radon probability measures
systems.
In the two previous
However,
probability
topological
to find invariant
topological
dynamical
2.1.12.
Radon
system.
for every action on a compact Haus-
there is at least one invariant
A group G is said to have the fixed-point
acting by affine continuous
one-to-one mappings
pact and convex subset of a locally convex Hausdorff space,
dynamical
Radon
As we saw in 1.4.9 the group Z has this property.
Definition.
if, whenever
of G.
system.
examples we were able to build in-
in an arbitrary
dorff space by homeomorphisms, probability.
dynamical
in particular
it is not always possible
measures
But there exist groups such that,
2.1.11.
induced by the permutations
to 2.1.7, we then get an abstract
this compact convex space contains The Markov-Kakutani
theorem.
property on a com-
topological
vector
at least one invariant point.
An Abelian group has the fixed-
point property. The study done in 2.3 leads to one proof of this result,
cf. 2.3.6.
19
2.1.13.
Remark.
Concerning vocabulary:
the study of dynamical
systems
b e g a n with some actions of R and Z, which represented r e s p e c t i v e l y the e v o l u t i o n of a mechanical
system through time, and the d i s c r e t i z a t i o n
b e t w e e n regular time intervals of this evolution; hence the adjective "dynamical". In this text the most significant examples of dynamical be found in statistical mechanics.
systems shall
In these examples the group G does
not connote evolution but consists of isometries.
However,
not prevent us from calling such an action a dynamical
that will
system.
2.2. THE FIXED POINT PROPERTY AND THE AMEANING FILTER
2.2.1 Definition. We call a group G amenable if it has the fixed-point p r o p e r t y described in 2.1.11, however used, 2.2.2.
cf. Greenleaf Definition.
several other definitions
can be
(i). A group is said to give an invariant version of the
H a h n - B a n a c h theorem if the following extension result holds: Let E be a real vector space,
s a sublinear function on E, and T a
right action of a group G on E by linear one-to-one mappings preserving s, i.e. for every f in E and every g in G,
s(f) = s(f.Tg).
Let
F be a subspace of E invariant under the action of G, and m a linear functional on F bounded by s and invariant under the action of G. There exists a linear functional on E bounded by s and invariant under the action of G whose r e s t r i c t i o n to F is m.
2.2.3.
Proposition.
A group G gives an invariant version of the Hahn-
Banach theorem if and only if it has the fixed-point property.
Proof. Let us first show that the fixed-point p r o p e r t y leads to an invariant version of the H a h n - B a n a c h theorem. A c c o r d i n g to the H a h n - B a n a c h theorem, cribed in 2.2.2,
there exist,
in the s i t u a t i o n des-
linear functionals on E bounded by s, whose r e s t r i c t i o n
to F is m. The set of all these linear functionals of E* for the weak* topology.
is a convex and compact subset
20
The group G acts on this set by the affine one-to-one mappings
Tg(n)(f)
=
given by
n(f.T g)
There is an invariant point in this compact and convex set, i.e. a linear functional bounded by s and invariant under the action of G on E whose r e s t r i c t i o n to F is m.
In order to demonstrate the converse,
let K be a convex and compact
subspace of a locally convex Hausdorff topological vector space E, and let T be an action of a group G on K by affine continuous one-to-one mappings. Let E be the real vector space of all continuous real functions on K, and s the usual
sublinear function on E (defined in 1.1.3).
From the left action of G on K we can deduce a right action of G on E and still denote it by T
(f.Tg)(x)
=
f(Tg(x))
Next we take the subspace reduced to the function 0 as the subspace F. Thanks
to the fixed point property,
there exists a Radon p r o b a b i l i t y
m e a s u r e on K invariant under the given action of G. Because K is a compact and convex part of E*, the center of gravity of this p r o b a b i l i t y is an invariant point of K.
2.2.4.
Example.
We can prove that finite groups are amenable.
Indeed,
if
the finite group F acts on the compact and convex set K, the m e a n value
(IlIFI). [ Tg(x) geF is a fixed point of this action for every x in K.
2.2.5.
Example.
To prove that Z is amenable,
let K be a compact convex
set and T be an affine continuous one-to-one mapping of K. Consider a point x in K and the sequence
(Mn) of affine continuous oper-
ators on K given by
Mn
=
n-i (l/n). ~ Ti
Let x' be a limit point of the sequence
x n = Mn(X).
And let p be any of the continuous pseudo-norms
that define the topology
21 of the locally convex topological
vector
space in which K lies.
Because p is continuous
p(x'-T(x'))
~
Inf Sup P(Mn(X)-T.Mn(X))
N And because
Mn(X)
- T.Mn(X)
p(x'-T(x'))
= (i/n).(x - Tn(x))
~ Inf Sup (2/n).p(x)
N The value p(x'-T(x'))
n~N
n~N
is then equal to 0 for every continuous
n o r m on E; and because E is a Hausdorff
space this means
pseudo-
that x' is an
invariant point. 2.2.6.
Remark.
property
In the above example,
we demonstrated
the fixed-point
for Z by finding a sequence of finite parts of Z, the segments
A n = {0 ..... n-l},
such that the ratio
(I/IAnl).IAnAAn T I tends to 0
when n tends to infinity. All of which 2.2.7. F(G),
leads us to the following
Definition.
definition.
Given a finite part D of G, m D is the function on
the set of all finite parts of G, defined by mD(A )
=
l{xeA,~deD,dx~A}l
Intuitively we can see that A is as invariant under the right translations by the elements 2.2.8.
Definitions.
of D as the ratio
mD(A)/IA I
is small.
A group G is said to have an ameaning
filter
if, for
every finite part D of G and every positive real number 6, there exist finite non-empty parts of G such that the ratio
mD(A)/IA 1
is less
than o. The non-empty
set of all these finite parts of G is then denoted by
M(D,6 ). It is clear that if D' contains contained
The parts M(D,~) positive
of F(G)\{~}
real number)
we call the ameaning 2.2.9.
Proposition.
amenable.
D, and if 6' is less than 6, M(D',6')
is
in M(D,~). (where D is a finite part of G, and 6 is a
are then a basis of a filter M on F(G)\{~},
which
filter of G. If the group G has an ameaning filter,
it is then
22 This proof shall be similar
to the one in example
2.2.5.
Let K be a compact and convex subset of a locally convex Hausdorff
topolo-
gical vector space, and T an action of the group G on K by affine continuous one-to-one mappings. For every finite and non-empty part of G, consider
the average operator
M A given by MA
=
For every pair
(I/IAI).
(D,~),
~ Tg geA
F(D,~)
stands for the closed subset of K, which is
the closure of the union of all images of K by the operators
M A where A
is an element of M(D,~). Because
the set of all M(D,~)
has the non-empty
is a filter basis,
finite intersection
Next let F be the common intersection
the family of the F(D,~)
property. of all F(D,~),
and let y be a point
in F. For every h in G and every continuous
pseudo-norm
p on E, the following
inequality holds: p(y - Th(y))
Then,
Inf (D,~)
Sup AeM(D,~)
Inf (D,d)
Sup AeM(D,~)
P(MA(X)
Th.MA(X))
(2/IAl).m{h}(A).p(x)
for every h in G and every p,
p(y - Th(Y))
= 0
And y is invariant under the action of G. The rest of part 2.2 is devoted to the proof of the converse of the previous result:
every amenable group has an ameaning
This theorem was first proved by E. F~Iner tained in our work 2.2.10.
filter.
(I). We give here the proof ob-
(2) which comes from the study of invariant
Definition.
An invariant
capacity
capacities.
is a real function on F(G) with
the four following properties: i)
m(~)
=
0
ii)
for every pair
(A,B),
(strong subadditivity)
m(AUB)
+ m(A~B)
~ m(A) + m(B)
23 iii)
for every finite part A and every g in G,
iv)
there exists
m(A)
= m(Ag)
(right invariance) a positive
and a, the increment
2.2.11.
Example.
the cardinal
A -->
functions m D for instance, 2.2.12.
m(A~{a})
The first example
function
Proposition.
constant K such that,
of an invariant
Properties
used to verify
than -K.
capacity is given by
as the following
proposition
examples,
shows.
For every finite part D of G, the real function m D
(i) and
(iii)
capacity.
evidently hold.
The constant
I can be
(iv).
Next we prove the decisive Therefore,
is greater
IAI. There are some less trivial
on F(G) defined in 2.2.7 is an invariant Proof.
- m(A)
for every A
calculate
strong
subadditivity
the difference
which is the integral
mD(A)
property
+ mD(B)
for the counting measure
(iii).
mD(AnB)
- mD(AUB),
on AUB of the function
IA.I(D-IAC ) + IB.I(D-IB c) - !(ANB).I(D-I(A~B)C ) - I(D-I(AuB)c ) This function
is always non-negative,
dering every possibility, 2.2.13.
Definition.
greatest
The mean value of an invariant
capacity.
capacity m is the
lower bound q(m) of the ratio m(A)/IA I where A is a finite non-
empty part of G. Due to the property ties,
as one can see by carefully consi-
and thus m D is an invariant
q(m)
2.2.14.
Remark.
whereas
the function
homogeneous
(2.2.10
(iv)) of invariant
capaci-
is finite. The set ~ of all invariant m-->
q(m)
capacities
is a convex cone,
is clearly increasing
and positively
on E.
The existence
of the ameaning
filter
is proven when,
for every finite
part D, the mean value q(m D) is equal to 0. It is clear that the following
mD ~ Therefore, point,
d~D
subadditivity
result holds:
m{d}
we must now state the result for the parts reduced to a
and show the subadditivity
property of q on the cone E.
24
Let then d be an element of G. If d generates Otherwise,
a finite group F, one has
when n tends to infinity,
(i/IFl).m{d}(F)
= 0 .
the limit of the ratio
(i/I Anl ) .m{ d} (An) where
A n = {di,i=O,..,n-l},
The subadditivity
is equal to O.
property of q is the most important point of the proof.
In order to demonstrate
it we will rely on several
But first we must state a definition 2.2.15.
Definition.
support,
Let f be a non-negative
tive coefficients,
of f as a combination
we select a special
0 = s 0 < ~i < "'" < o k
values of f in ascending The following
special
2.2.16.
of indicators
one as follows.
be the finite sequence of the different
i=k ~ (~i-~i_l) I i= 1 " (f>=i) of f is called its pyramidal
Let m be a strongly
subadditive
decomposition.
function on F(G),
and f
function with a finite support on G.
Among all possible decompositions with positive
coefficients,
est lower bound to the sum Consider
f =
of f as a combination
the pyramidal
decomposition
of indicators gives the great-
[ ~Am(A).
an arbitrary
decomposition
of f:
~ ~AIA Ael
Let J be the set of all finite parts of G which is obtained by making repeated unions
and intersections
Then J is finite and the finite parts Choose a maximal of elements
(f~i)
element among all finite
in J containing
Every indicator were not,
with posi-
order.
decomposition
Lemma.
a positive
Proof.
with a finite
equality holds:
f = This
function,
lemma.
on a set G.
Among all decompositions Let
theorems.
and prove a combinatorial
the
I A is constant
(f~i),
of elements
from I
of I.
belong to it.
strictly increasing
and call this sequence
on every non-empty
the finite part K' = (AnKi+I)UK i
set Ki+ ~ K i ;
sequences (KI,.,Kn). if it
would be strictly between K i
25
and Ki+ 1 for the order, and this would contradict the maximality of the sequence. We use an Abel transform to obtain
(~i-ai_l) m(f>~ i)
=
~km(f=~k) +
k-I ~ ~i(m(f_->~i) - m(f>~i+l)) 1
Inserting the other K i of the sequence, we see that the first sum is equal to n f(i) .(m(Ki)-m(Ki_l))
+
f(1) .m(K I)
where f(i) is the constant value of f on Ki\Ki_ 1 (on K 1 for f(1)). Replace f by the given decomposition to get the following expression of the sum relative to the pyramidal decomposition: n
A~ei ~A
~ (A(i)'(m(Ki)-m(Ki-l))
+ A(1)'m~KI)
)
Because of the strong subadditivity property of m, the difference m(Ki)-m(Ki_l) equal to i.
is less than or equal to
m(A~Ki)-m(A~Ki_l) , when A(i) is
We then get the inequality
(~i_~i_l).m(f>~i)
<
~ ~A.m(A) Ael
which is the result sought to accomplish the proof. 2.2.17. Corollary. Mapping a function f with a finite support on G to the real± number ~ (~i- ~ i - l ) ' m ( f ~ i ) gives a sublinear function on the cone C$(G), which itself consists of all non-negative functions with a finite support on G. The restriction of this mapping to the indicators is the capacity m. The correspondence between m and its sublinear extension is a linear mapping. The proof of all these results is straightforward. 2.2.18. Definition. Two functions fl and f2 on G are said to have the same variation table when, for every pair (x,y) of elements in G, the following inequality holds:
26
(fl(x)
- fl(y)
).(f2(x)
- f2(y)
belong
to C~(G),
they have the same variation
If the two functions
if and only if the inclusion of all
(fl~=i)
2.2.19.
Proposition.
additive Proof.
The canonical
(K 1 ..... Kn)
to an invariant with a finite
extension
sublinear
to m(fl)
function
The next
For every element This
- m(g),
f in CK(G)
where
invariant
Proof.
to
(fl~i)
and all
in the proof of lemma 2.2.16
invariant
on the whole
function
+ on the cone CK(G)
space CK(G)
of functions
extension
of an invariant
capacity
the least upper bound of the differences g and f+g belong
on CK(G)
and an extension
To get a non-negative
+ f-, where belongs
is
lemma deals with this.
the functions
least upper bound defines
near,
of all
capacity
table.
+ m(f2).
2.2.20. Lemma. Let m be the canonical + to the cone CK(G). m(f+g)
sequence
of m(f)
this sublinear
support.
table
set consisting
of an invariant
have the same variation
the increasing
is equal
We have now to extend
0
is a total order on the finite
The second expression
that m(fl+f2)
~
(f2~j).
when the two functions
Call
(f2~Bj). shows
and all
)
a real
to C~(G),
function which
of m.
sum f+g we have to choose
f- is the negative
C~(G).
is finite. is subli-
a g equal
to
part of f (f = f+ - f-), and where
We then look for the least upper bound of all differences m(f + + ~)
m(f- + ~)
+ with a function ~ in CK(G). The following relation holds: m(f++~)
- m(f-+})
=
(m(f++#)-m(})) m(f +)
And property
(iv) of 2.2.10
h is the counting measure:
easily gives
+
+ (m(#)-m(f-+%))
(m(+)-m(f-+~))
the following
inequality
where
27 m(f-+~)
- m(~)
The difference
m(f+g)
real function
on CK(G)
- m(g)
still denote
-K.h(f-) is then bounded
is therefore
Because m is subadditive shall
>
extends m, but for simplicity
on CK(G) , function m is invariant to be shown that m is subadditive.
of G containing
both supports
positive
c such that
of g and f+g. f + c.l A
c, the subadditivity
m(f+g+c.l A)
~
and positively It is possible
is positive. + of m on CK(G)
c.l A has the same variation 2.1.19
m(f+g)
m(g)
fl + c.l A
~
~
table as f+g, enabling
us
m(f+c.l A) - m(c.l A)
f2 + C'IA
m(fl+ f2 )
yields
to get the inequality
real number
and
to find a
m(f+c.l A) + m(g)
to use proposition
For every positive
homogeneous.
Let A be a finite part
On the other hand,
that
we
it by m.
It remains
number
and the
well defined.
this function
So defined
For such a number
by m(f +) + K.h(f-);
m(fl+
E, we can then find a function are non-negative f2 + 2c.I A)
c.l A such
and
2m(c.l A) +
Then m(fl+
f2 )
$
e + (m(fl+ C.IA)-c.m(A))
+ (m(f2+ C.iA)-c.m(A))
E + m(f I) + m(f 2) Because
this is true for every e, m is subadditive
2.2.21.
Corollary.
cone E, consisting linear
invariant
2.2.22.
Theorem.
The above extension of all invariant
functions
the right
capacities,
on the vector
from the
to the cone of all sub-
space CK(G). space of all real functions
on the group G.
Let m and n be two sublinear under
gives a linear mapping
Let E be the real vector
with a finite support
on CK(G).
translation
functions
on E, and let them be invariant
action of G on E given by
28
(f.Tg)(x) Then,
=
f(x.g -I)
for every element f of E, the inequality holds Sup ~eK(E,m+n,G)
~(f)
~
Sup ~(f) ~eK(E,m,G)
+
Sup ~(f) ~eK(E,n,G)
where the symbol K( ..... ) denotes
the set of all linear functionals:
first,
bounded by a sublinear
on a vector
finally, Proof.
space;
second,
function;
and,
invariant under a group action.
Consider
the product vector space ExE, the component-wise
action of G on it, and the sublinear
right
function u given by
u(fl,f 2) = m(fl) + n(f 2) The diagonal mapping
subspace A of ExE can be identified with E and this natural
sends u to m+n, yielding
the equalities
Sup ~(f) ~eK(E,m+n,G)
Sup ~eK(A,u,G)
=
Sup ~(f) ~eK(E,m,G) Sup ~(f) ~eK(E,n,G) Because of the fixed-point
=
Sup ~eK(Ex{0},u,G)
~ (f,0))
Sup ~eK({ 0}xE,u,G)
~((0,f))
property,
an invariant
subspace which is bounded by an invariant variant extension sublinear
~((f,f))
linear functional
sublinear
function has an in-
to the whole space which is still bounded by the given
function.
The least upper bounds in the right sides of the three equalities therefore
relative
Subadditivity
are
to the same set K(ExE,u,G).
immediately
results because
(f,f)
is the sum of (f,0) and
(0,f). 2.2.23.
on a
Theorem.
As a real function on z, q is subadditive.
Proof. To derive this result from theorem 2.2.22,
it will suffice to
identify q(m) with the least upper bound of ~(l{e }) on K(E,m,G). For every ~ of K(E,m,G), ~(l{e })
=
the following holds: (I/IAI).~(IA)
~
(I/IAl).m(A)
29
Whence the inequality q(m)
(I{ e} )
It remains to be shown that the linear functional ~ on E, given by ~(l{e }) = q(m) is bounded by m. For every f in CK(E) , using the counting measure h on G, we get ~(f)
=
h(f+).q(m)
-
h(f-).q(m)
Using the pyramidal decompostion of f+, we write h(f+).q(m)
=
~ (~i-~i_l).h(f~i)-q(m)
and then obtain the inequality h(f+).q(m)
~
~ ( ~ i - ~ i _ l ) . m ( f ~ i)
Since there exist invariant linear functionals
=
m(f +)
on CK(G) bounded by m,
the following inequality holds: Sup feC~(G),f~0
( - m(-f)/h(f)
)
< =
(m) q
The number ~(f) is therefore
less than or equal to
When c.l A tends to infinity,
this sum is the limit of
m(f +) + m(c. IA-f-) For an A sufficiently
large,
m(f +) + m(-f-).
m(c.l A) c.l A - f-
and f+ have the same variation
table. The function m then verifies m(f)
=
m(f +) + m(-f-)
which achieves the proof. 2.2.24. Remark. The fixed-point prove,
property was used for a second time to
for a negative f, the inequality
30 h(f)
q(m)
<
m(f)
This was merely for simplicity 2.2.25.
Conclusion.
Amenable
(see Moulin Ollagnier
groups have an ameaning
and Pinchon filter.
As stated in remark 2.2.14,
the subadditivity
on z remained
this has just been accomplished
to be proven;
(2)).
property of q as a function in the above
theorem.
2.3. AMENABILITY
AND ALGEBRAIC
CONSTRUCTIONS
We give now some results on the stability under
several
Amenability and direct 2.3.1.
algebraic
of the fixed-point
of abelian and solvable groups, limits,
follows
Proposition.
of their finite extensions
from this study.
If the group G has the fixed-point property,
if H is the image of G by a group h o m o m o r p h i s m Proof.
property
constructions.
and
s, H is also amenable.
Let T be an action of H on a convex and compact part of a locally
convex Hausdorff vector
space by affine continuous
one-to-one mappings.
Define an action of G on this convex set by setting Tg(x)
=
Ts(g)(x)
Because G has the fixed-point iant point under
property,
there exists
at least one invar-
this action of G, which is then an invariant
point under
the given action of H. 2.3.2.
Proposition.
A subgroup
Proof.
We use here the ameaning
of an amenable group is amenable. filter.
For every finite part D of H and every positive real number exists
a finite part A of G such that
roD(A)
=<
~. ]A[
5, there
31
Dividing A between
the left cosets of H in G that it meets,
we see that
this set A is the disjoint union of the Ai.x i = HxinA. On the other hand,
the set
{xeA,~deD,dx~A}
is the disjoint union of
the sets {xeAi,~deD,dx~A i} because D is a subset of H. The two equalities
IAI
therefore
simultaneously
= ~ IAil
and
At least one of the Aix i belongs
hold
mD(A)
to M(D,~)
=
mD(Aixi )
as well as its right trans-
late Ai, which is a part of H. We have now proven the existence 2.3.3.
Proposition.
amenable Proof.
of the ameaning
filter
for H.
Let G be an extension of an amenable
group N by an
group H. The group G is then amenable.
Let T be an action of G on K with the usual
The restriction
of this action
has some invariant points The non-empty
to the normal
subgroup N of G
in K.
subset K N consisting
convex and compact.
conditions.
amenable
Moreover,
of G because N is a normal
of all these invariant points
KIN is globally invariant under
is
the action
subgroup: -i
Tn(Tg(x))
The restriction
=
Tn'g(×)
=
Tg(T g
"n'g(x))
=
Tg(x)
to K N of the given action of G is in fact an action of
the quotient group H, i.e. the mapping T g, restricted
to KN, depends
solely on the coset of N to Which g belongs. Indeed,
if x belongs Tg-n(x)
to KN, =
Because H is amenable,
Tg(Tn(x))
Proposition.
nable groups, Proof.
Tg(x)
there is at least one invariant point in K N for
this action of H; this point 2.3.4.
=
is invariant
for the given action of G.
If G is the union of a directed family
(G i) of ame-
G is amenable.
The family
(Ki) of compact
convex subsets
of K, where K i is the
set of all points of K which are invariant under the action of Gi, has the non-empty
finite intersection
The intersection
property.
of all these K i is the non-empty
set of the points
that
32
are invariant under the whole action of G. 2.3.5.
Proposition.
Proof.
Consider
the usual
The free group with two generators
the vector
sublinear
the vector
is not amenable.
space of all bounded real functions
function
s and the action of L(a,b)
on L(a,b),
preserving
s on
space: f.Tg(x)
=
f(gx)
Let us show that there is no linear functional
on E both bounded by s
and invariant under the action of L(a,b). A linear functional
bounded by s is a positive
linear functional
with a
total mass equal to i. Call it a mean. Consider
the four following
subsets
of L(a,b):
A+
is the subset of all words beginning with a
A-
is the subset of all words beginning with a -I
B+
is the subset of all words beginning with b
B-
is the subset of all words beginning with b -I
The whole group L(a,b) of the part reduced
is the disjoint union of these four subsets
to the unit element.
For every mean on L(a,b), I
=
Moreover,
and
the equality holds
v(A +)
+
v(A-)
IA+oT a
~
1
+
~(B +)
+ v(B-)
+
v({e})
IA-
An invariant mean would then verify
~((IA+) which contradicts
+ (IA-))
~
I
and
~((IB+)
+ (IB-))
~
i
the above equality.
2.3.6.
Remark.
groups
and Z by using the constructions
All known amenable
groups are obtained
from the finite
that we have so far described
in 2.3. 2.3.7.
Conjecture.
A standard conjecture
is that a non-amenable
group
33
has a subgroup 2.3.8.
isomorphic
Proposition.
to L(a,b).
Consider
the group B, consisting
of Z with a finite support. the semi-direct product
The group G of permutations
of Z obtained
of B by the group of all translations
nable and finitely generated. solvable
of all permutations
However,
as
is ame-
G is not a finite extension of a
group.
Proof. Denote by S A the finite subgroup of all permutations
of the fi-
nite part A of Z. The group B is the direct Then,
according
limit of S A when A tends to Z.
to 2.3.4 and 2.2.4,
Call o i the transposition
this group is amenable.
of the points
i and i+l.
The group B is generated by the set of all o i where if we denote by T the unit translation, between
the following relations
•
=
°
Tl°°0°T-i
G is generated by the two elements
extension
o 0 and T. It is amenable
of an amenable group by an amenable
It remains
to be proven that the kernel
F is finite there exists
Let k be an integer The mapping homomorphism Then,
as an
(2.3.3).
group.
an integer n such that h(T n) = e F.
and K be the finite part
from S K to B given by
group
of every group h o m o m o r p h i s m h
from G in a finite group F is not a solvable Because
hold
these generators:
qi Thus,
i is in Z; and,
{0,...,k-l}
o --> ~.Tnk.~.T -nk
and its image is contained
for every k, the kernel Ker(h)
in the kernel
contains
.
is a one-to-one of h.
a subgroup
isomorphic
to
S K and is therefore not solvable.
2.4. REFERENCES
For the existence and Krylov
of invariant probability measures,
to Bogoliubov
(I).
Invariant means on groups are examined by Greenleaf Markov-Kakutani also by Bourbaki groups).
refer
theorem is discussed (3)
(I).
of course by its two authors but
(with a generalization
to the case of solvable
34
Regarding paper
the existence
of the ameaning filter,
(I) or refer to Moulin Ollagnier
see F~Iner's
and Pinchon
original
(2,7) for an alter-
nate proof and a study of locally compact amenable groups.
3. ERGODIC THEOREMS
3.1. INVARIANT LINEAR FUNCTIONALS
3.1.1. Ergodic theorem. Let E be a real vector space, s a sublinear function, and T a right action of an amenable group G by linear one-toone mappings preserving s on E. Then, for every f of E, the following holds: Sup ~(f) ~eK(E,s,G)
=
Inf AeF(G)
(I/IAI).S(gYAf.Tg)
=
lim sup (i/IAI).s( [ f.T g) M geA
Proof. Since G is amenable, there exist linear invariant functionals bounded by s on E. Denote by K(E,s,G) the convex and compact set consisting of all these functionals and let ~ be one of them. Then, for every f in E, ~(f)
=
(I/IAI).~( ~ f.T g) geA
and the two inequalities easily follow Sup ~(f) ~eK(E,s,G)
~
Inf AeF(G)
(I/nAl).s( X f.T g) geA
lim sup (I/IAI).s( ~ f.T g) M geA Consider the function p on E given by p(f)
=
lim sup (i/IAI).s( X f'Tg) M geA
So defined, p is a sublinear function and, according to 1.1.7, is the least upper bound of all linear functionals below it.
36
In order to complete
the proof,
convex compact sets K(E,s,G) The first inclusion,
we have to show the equality of the two
and K(E,p).
K(E,s,G) ~ K(E,p),
tivity property of s gives
is already proven.
the inclusion of K(E,p)
The subaddi-
in K(E,s).
Now the only thing to be proven is that a linear functional, p on E, is invariant,
i.e. equal
to 0 on the elements
bounded by
f-f.T h with f in E
and h in G. What is true for f is also true for -f and we have only to show that p(f-f.T h) is equal
to 0.
Let us calculate p(f-f.T h)
=
lim sup (I/IAl).s( M
~ (f-f.Th).T g) geA
lim sup (i/IAl).(s(f)+s(-f)).m{h}(A) M 3.1.2.
Corollary.
As the upper
limit along the ameaning
the theorem above is equal to the greatest tity on the set of all non-empty
filter used in
lower bound of the same quan-
finite parts of G, this upper
limit is
in fact a limit. 3.1.3.
Example.
continuous
Consider
the usual vector
near function
s defined
for every continuous
a right action that preserves
function
to the limit p(f) of the ergodic
3.1.4.
Corollary.
number p(f), For an upper
Proof.
on X, is equal
along M
I foT ). geA g
In the above situation,
functions
it is possible
functions
on X with values
semi-continuous
measure ~ on X, ~(f) eventually
averages
not only for continuous
semi-continuous
s on C(X).
f on X, the least upper bound of all
~(f), where ~ is an invariant Radon probability measure
(I/IAl)'s(
of all
space X, and the subli-
in 1.1.3. When acting on X by homeomorphisms,
the amenable group G induces Then,
space C(X) consisting
real functions on a compact Hausdorff
to define the
on X, but also for upper
in the interval
E-~, +~]
function f and an invariant Radon probability
is well defined by regularity;
and p(f), which is
equal to -~, is still the least upper bound of all ~(f).
Consider
the upper
the value ~ (f) to ~.
semi-continuous
function on K(C(X),s,G)
giving
37 Because
function p is non-decreasing,
corollary
1.3.7 of Dini's
lemma
gives us Sup ~eK(C(X),s,G) The converse 3.1.5.
inequality
Definition.
~(f) holds
A real
=
Inf ~eC(X),~f
and the proof
function
p(~)
~
p(f)
is achieved.
c on the set F(G) of all finite parts
of G is said to be subadditive
if c(~)
decomposition
of a finite part A as a combination
indicators following
of the indicator
of subsets inequality c(A)
Function
F(G)
Remark.
coefficients,
of
1 A = ~ ~BIB,
the
~ ~BC(B)
The subadditive there
Remark.
to 0 and if, for every
holds:
g of G, the numbers
for which
3.1.7.
of it with positive
c is said to be invariant
every element 3.1.6.
g
is equal
if, for every finite part A of G, and c(A)
functions
is a sublinear
According
and c(Ag)
are equal.
are exactly
extension
to lemma 2.2.15,
the functions
on
to the cone C~(G).
strongly
subadditive
functions
are subadditive. 3.1.8.
Lemma.
Let B be a finite non-empty
part A of G, the following I{ geG,BgcA} I The positive
<
IAI
function A B on F(G) &B(A)
=
The set
=<
hold:
I{ geG,Bg0A#@} ]
is then defined by
I{geG,BgnA#@,Bg~A}I
and the limit of the ratio Proof.
inequalities
part of G. For every finite
{geG,BgaA}
(I/IAI).AB(A) is exactly
along M is equal the intersection
where b is an element
of B; and the number
is less than or equal
to the cardinal
The set
{geG,Bg~A~}
cardinal
number
of A.
of all b-IA,
of the elements
number
to 0.
of this set
of A.
is the union of all b-IA where b is in B; and the
of this set is of course
greater
than or equal
to the one
38
On the other hand, A B is bounded by a sum of standard elements AB(A)
~
~ l{geb-IA,~b'eB beB b~B
Proposition.
g~b'-iA}l
mBb-l(A)
and the limit along M of the ratio 3.1.9.
of Z
(I/IAI).AB(A)
is therefore
0.
Let G be an amenable group and c a subadditive
invar-
iant function on F(G). Then,
the greatest
not empty,
lower bound of all averages
(I/IAI).c(A),
is equal to the limit of the same averages
where A is
along the ameaning
filter. Proof.
It is quite clear that the greatest
equal
to the upper limit along the ameaning
We must merely prove,
for every non-empty
(I/IBI).c(B) Consider
therefore
positive
combination IA
Dividing
(I/IBI).
properties
[ c(B) Bg~A
(I/IAI).I{g,BgcA}I
(I/IAI).I{g,BgOA#~, Remark.
continuous
Bg~A} I
+
allow us to write Sup c(C).[{geG,Bg~A#@,Bg~A} C=B
I
tends to i along M and the ratio
tends to 0 along M according
The use of the ergodic
function on a Hausdorff
alternate proof of the previous It is indeed possible
theorem 3.1.4 with an upper semi-
proposition. subadditive
function
space X, an action T of G on X by homeomor-
and an upper semi-continuous
finite part A of G,
to lemma 3.I.8.
compact set would have led to an
to find, for every invariant
on F(G), a compact Hausdorff phisms
I A as a
of subsets of B:
by IAI and taking the limit along M lead to the result because
the first ratio
3.1.10.
~
of the indicator
of right translates
l i Bg~A#@ Bg~A
and invariance
c(A)
finite part B, the inequality
decomposition
of indicators
(i/IBl).
filter.
lim sup (I/IAI).c(A) M
the following
=
The subadditivity
~
lower bound is less than or
function f on X such that, for every
39
c(A) Consider
=
s( [ foT g) geA
the compact Hausdorff product
action of G on Y by translations
space Y = [-~ ,c({e})]G,
and the closed invariant
the
subspace X,
defined by all the inequalities
~e n(g) g A The triple
<
c(A)
(X,T,f), where T is the restriction
and f is the coordinate
function
looked for in order to complete 3.1.I1.
Remark.
all M(D,~),
The ameaning
to X of the action of G
at the unit element e of G, is what we the proof.
filter,
that we defined with the basis of
can also be defined by other basis.
For every finite part
ml,...,m n
number ~, M(m I ..... mn;e)
denotes
of the cone E, and every positive the non-empty
real
subset of F(G) consisting
of all finite non-empty parts of G for which all differences
(l
are
less than
(I/]AI).mi(A) These
sets constitute
convergence
result
- q(mi)
a basis
of the ameaning
filter according
to the
3.1.9.
For every finite non-empty part B, function n B is given by
nB(A) So defined, F(G)
=
n B is an element of E; and the sets M(nB;e) , for all B in
and all positive
Indeed,
the following
nB(A) 3.1.12.
Remark.
arbitrarily
IB.AI
real numbers
E, constitute
a basis of M.
inequality holds with any given element b 0 of B:
-]AI
__< [BI.mB.boI(A)
Let B be a finite non-empty
invariant with A, which means
part of G. Then,
that,
B.A is
for every element M I of
M, there exists another element M 2 of M such that B.A belongs whenever
A belongs
to M I
to M 2.
Given a finite part B of G, the intersection invariant with A. In the first case, use the M(nD;c)
~ b-IA
is also arbitrarily
beB as a basis of M and observe
that
40 nD(B.A ) The following
=
nD.B(A)
inequality
I
<
then gives
(i/I B.AI ) .nD(B.A)
On the other hand,
=<
(i/l AI ) .nD.B(A)
the inequality
mD( ~ b-iA) beB which
the proof of the first result:
-I(A) mB'D'bo
< =
is true for any given element
b 0 of B, leads
quickly
to the proof
of the second result. 3.1.13.
Example.
functions
i
g G (2.1.3) with Denote
Given an element m of Z, it is possible
on the compact
Hausdorff
the increments
therefore
space T(G)
by g$ the past of g for the order 3, i.e.
in G that are strictly
The greatest
lower bound of all increments iA(T) a
=
strongly Denote
to G\{a},
the set of
-
m(a~A)
of G that do not contain
also the limit of these
a is, when A
increments
because m is
subadditive.
by ia(T)
functions
this limit;
on T (depending
as a greatest
Because
of the property
bounded
below.
lower bound of continuous
solely on the restriction
finite part of G), i a is an upper
semi-continuous
of capacities
(2.2.10
of the order
function
(iv)),
If T is the action of G on T(G) by translations, property
on
less than g in the order 3.
m({a}U(a~A))
on the set of all finite parts tends
some
of m.
all elements
simply
to define
of all total orders
to a
on T(G).
this function
the following
is
invariance
holds: ig = ieoTg
One can easily verify s( [ i ) geA b When G is an amenable
that =
m(A)
group,
the ergodic
theorem shows
that the least
41
upper bound of the values by all Radon invariant But this particular lity measure
Indeed,
probability
example
~ on T(G),
~(i e ) because
taken on the upper
=
measures
is richer;
the following
semi-continuous is equal
function
e
to q(m).
for every Radon invariant equality
i
probabi-
actually holds:
q(m)
i e is the directed
greatest
lower bound of the continuous
functions
i A and because every Radon measure is regular there exists e ' ' for every positive real number ~, a finite part A 0 such that u(i~-i e) is less
than ¢ when A contains
Then,
A 0.
for every finite part A, re(A)
Integrating
=
I geA
over T(G)
(m((g$0A)U{g})
-
for the probability
m(g$~A)
)
~ gives -I
geA
geA
We arrive at the following
(I/IAI).m(A)
inequality:
=< e
+
~(i e)
+
(I/IA]).mA0(A).~(ie~-i e)
Take the limit on A along M, and let e tend to 0 to derive q(m) As the converse the equality
3.2.
~(ie)
inequality
follows
from the ergodic
VECTORS
Description.
AND MEAN ERGODIC
In this section,
we consider
a normed vector
space
group G on E by continuous
linear mappings.
We are therefore MA(f)
the proof of
THEOREMS
(E,II If) and a right action T of an amenable one-to-one
theorem
is achieved.
INVARIANT
3.2.1.
~
interested
of a given element
in the behaviour
f of E along
of the ergodic
the ameaning
filter
averages
42
MA(f) In the general
=
(I/IAI).
case,
~ f.T g geA
the closed hull C(f)
generated by f and its images
f.T g is not compact. 3.2.2.
Proposition.
suppose
We consider
the previous
that the norms of the operators
situation
and, moreover,
T g are uniformly bounded
(e.g.
they are isometries). Then,
there exists at most one invariant element
invariant hull C(f) Proof.
of an element
By definition
f of E.
the closed convex invariant hull C(f)
of the convex set of all convex combinations non-negative
in the closed convex
is the closure
~ ~(g) f.T g, where
~ is a
real function with a finite support and an integral
to I for the counting measure Every invariant
continuous
linear functional
If i I and i 2 are two invariant vectors linear
on E is constant on C(f).
of C(f),
for every continuous
invariant
In order to conclude
that il-i 2 is actually
have to prove that,
functional
~(il-i 2) is equal
continuous
to 0
~ on E.
the null vector of E, we
for every invariant vector of E different
there exists an invariant
equal
on G.
linear functional
from O,
~ on E such that
~(i) ~ O. The uniform bound property means
that there exists a positive number K
such that the norms of all T g are bounded by K. We can then construct respect
an invariant norm N on E which is continuous
with
to the initial norm N(f)
Applying
=
Sup Ilf.Tgll ~ geG
K. Ilfll
theorem 3.1.1 produces Sup ~eK(E,N,G)
~(f)
Using these equalities p(i) According functional extension
=
=
p(f)
=
Inf AeF(G)
(I/IAI).N(
for the invariant element
N(i)
=
to the Hahn-Banach given on Ri by
[ f.T g) geA
i, the following holds:
Ilill > 0 theorem,
~(i)=p(i)
~ is a linear functional
it is possible
to extend the linear
to the whole space E so that the bounded by p; the linear functional
43
is invariant and continuous 3.2.3.
Proposition.
and, moreover,
and ~(i)
Taking the hypotheses
assuming
Proof.
of the previous
the existence of an invariant
closed convex invariant hull C(f) norm-limit
is strictly positive.
of f, this single element would be the
We have to state =
0
Because 0 is the only invariant vector of C(f-i)
every continuous
riant linear functional
on E is equal to 0 on f-i.
Restricting
to the elements
this result
3.2.4.
=
Sup ~(f-i) ~eK(E,N,G)
since the initial norm Von Neumann's
theorem.
action of an amenable
=
ameaning
filter.
lira (I/IAI).N( [ (f.T g- i)) M geA
Let H be a Hilbert
space and T a right
group G on H by isometries. averages MA(X)
The limit is the orthogonal
closed subspace H G consisting
Proof.
and appling theorem
II II is bounded by N, the proof is achieved.
For every vector x of H the ergodic
under
of K(E,N,G)
inva-
we get 0
Then,
in the
along M of the averages MA(f).
lim (I/IAI).II I (f.Tg - i)~ M geA
3.1.1,
proposition
element
of all vectors
converge
projection
along the
of x on the
of H, that are invariant
the action of G. The family of norms of all average operators
by I; and the set of all x, for which the averages the orthogonal
projection
of x on HG,
Let us first show the convergence If x belongs
to HG,
If x has the form a computation orthogonal
the property
converge along M to
is then a closed subspace
result for special
of H.
vectors of H.
clearly holds.
y-y.T h, the averages
that we just did several
projection
is uniformly bounded
converge times.
to 0 along M thanks
The null vector
to
is the
on H G of such an x because y-y.T g is orthogonal
every invariant vector. It remains
to be proven that the subspace of H generated by H G and the
increments
y-y.T g is dense in H, and therefore
subspace reduces
to 0.
that its orthogonal
to
44
Let then z be a vector orthogonal Because
z is orthogonal
It is then orthogonal 3.2.5.
Mean ergodic
to H G and to all y-y.T g.
to the increments
it is an invariant vector.
to itself and therefore equal to 0.
theorems
in L p. Let
(X,&,~,T)
be an abstract dynamical
system where T is an action of an amenable group G; let ~ be the o-algebra of all invariant
events
in ~ and p be a real number greater
than or equal
to I. Then,
for every f in LP(x,~,~),
the ergodic averages
norm along the ameaning filter; tion
E(fl~)
of f with respect
The arguments theorem.
in the L pexpecta-
to the o-algebra ~.
of the proof are very similar to those of the Von Neumann's
The conditional
expectation
is a projection
with a norm i and the average operators I. The subspace
of LP(x,~,u)
this subspace
contains
ments
where h belongs
y-yoTh,
converge
and the limit is the conditional
elements
in LP(x,a,~) to
for which the result holds is then closed;
the invariant elements of LP(x,&,~)
and the incre-
to G and y to LP(x,~,~).
In order to prove the norm-density by the invariant
operator
have a norm less than or equal
in LP(x,a,~)
and the increments,
of the subspace generated
we have to examine
two
possibilities. First,
suppose that p is strictly greater
than i and consider
the conju-
gate number p' of p (I/p + I/p' = I). An element of LP'(x,
of L p', orthogonal
Let ~ be such an element. (l~I)(P'/P).sgn(~)
when p is equal
because
the measure
3.2.6. tems,
equal
result.
functions
norm-convergence
Because ~ is orthogonal
to I, an invariant
is finite;
to the invariant
element # of L ~ belongs
it is then orthogonal
The norm-convergence
For instance,
it is possible
continuous
element element to L I
to itself and
to 0.
Counterexample.
general
is an invariant
of L p, ~ is equal to 0.
Second,
therefore
to all the y-yoTh,
,~).
of averages
in the case of topological
to construct
counterexamples
are constant
and in which, however,
of the averages
is in no way a dynamical
sys-
in which all invariant there is no
of a given function along M to a constant
function. The next theorems make this remark more explicit. 3.2.7.
Theorem
(Furstenberg).
Let
(X,T) be a topological
dynamical
system,
45
where T is an action of an amenable group G on the compact Hausdorff space X by homeomorphisms. The two following properties
are equivalent:
I)
there exists a single invariant
2)
for every continuous to a constant
Proof.
Radon probability measure
function f, the ergodic averages
function.
It is clear that the second of these two properties
first because
implies
the
the value taken by the probability measure on the function
is the same as the value taken on the constant. to the constant According
on X
converge
itself and therefore
to the Hahn-Banach
p(f)
+
p(-f)
clearly determined.
theorem,
Radon probability measure yields, =
1~is value is then equal
the uniqueness
of the invariant
for every f in C(X), the equality
0
that is to say
limM xexSUp (i/IAI).geA ~ f°Tg This is equivalent to a constant 3.2.8. ergodic
limM xexinf (i/IAl)'g~Af°Tge
to the uniform convergence
function;
Definition.
=
and hence property
A topological
if the equivalent
dynamical
properties
of the averages
I implies Froperty
along M 2.
system is said to be uniquely
of theorem 3.2.7 hold for it.
3.2.9. Definition.
A topological
gically
if there exists a point x in X whose orbit under the
transitive
dynamical
system is said to be topolo-
action T is dense in X. This property clearly implies
that all invariant
continuous
functions
are constant. 3.2.10.
Example:
a topologically
ergodie topological
dynamical
transitive,
but nonetheless
Let G be an infinite amenable group. The dynamical 2.1.2 is not uniquely ergodic because different
probability
lity measures.
vectors
not uniquely
system. all Bernoulli
lead to different
system described
in
schemes built on
invariant
Radon probabi-
46 Let us first show that the only continuous constants,
invariant
and then solve the counterexample
For simplicity's two elements
sake, consider
functions
that we mentioned
are the earlier.
the case where the alphabet I has only
and denote them by +I and -I.
Call a x the coordinate
function at the point x of G given by Ox(~)
and denote by OA the product of the coordinate
functions
= $(x)
at all points
of the finite part A. The family of all functions OA is an orthonormal space L2(X,~,u),
where the abstract
scheme built on the probability When f is a continuous
dynamical
vector
basis of the Hilbert
system is the Bernoulli
(1/2,1/2).
function the following
inequality holds:
2 I (ffoo A d~ ) AeF(G) Moreover,
if f is invariant,
~
ff2 du
the equality
= holds for every g of G, and because of the preceding
inequality,
scalar product is equal to 0 for every non-empty
the
finite part A
of G. The following
equality
According
=
then holds for every A in F(G): .
to the Stone-Weierstrass
theorem,
the °A generate
space of C(X); and the two linear functionals are the same.
a dense sub-
on C(X), and .~,
Then /f2 d~ and the continuous
=
(ff d~ )2
function f is almost everywhere
equal to a constant;
and because ~ gives a positive mass to all open sets, f is actually a constant
function.
The same arguments constants;
show that the only invariant
and the abstract
dynamical
When the group G is countable,
system
elements
(X,~,~,T)
in L 2 are the
is ergodic.
the compact Hausdorff product space X has
a countable basis of open sets consisting
of the cylinder
sets [A,A],
47
where
[A,A]
is the closed and open subset of X [A,A]
=
{$eX,~xeA E(x)=+I,VxeA\A
The set D of all points
~(x)=-l}
in X whose orbit is dense is then the countable
intersection D
=
~ [A,A]
=
{ ~ ,3geG Tg (~) O[A,A]#~}
~
( U
[A,A]
(rg)-i ( [A'~ ) )
geG
All invariant events of the previous
intersection
measure which is then equal to i because cal system. Thus, ~(D) is equal to I, which implies topological
dynamical
have a strictly positive
of the ergodicity
of the dynami-
that D is not empty;
system is then topologically
3.3. INDIVIDUAL
ERGODIC THEOREMS
In the previous
section we were interested
and the
transitive.
in the asymptotic
behaviour
of the averages (l/n). where
(X,Q,~,T)
[ foT i 0
is an abstract dynamical
system with an action of Z and
where f an element of LI(x,~,u). In particular, conditional
we showed the norm-convergence
expectation
iant events. For this very special known,
Birkhoff's
of f with respect to the o-algebra
sequence of averages
theorem,
according
for every dynamical
another convergence
~ foT i k
=
n
result is also holds
to the mean ergodic theorem.
system with an action of Z, the norm-conver-
gence still holds for the sequence of averages (l/n).
to the
of all invar-
to which the convergence
almost everywhere. But this result is far from being analogous Indeed,
of these averages
48
where
(kn) is an arbitrary
The norm-convergence
sequence of integers.
is derived from the right invariance
(M(D,6))
of the ameaning
However,
it is possible
to shift the Birkhoff's
averages by a sequence
(k n) in such a way to lose the almost everywhere To do this,
Emerson
irrationnal
rotation of the one-dimensional
The counterexample
of the basis
filter.
(2) considers
the dynamical
convergence
result.
system generated by an
torus R/Z.
that we give next has still another nature
deals with generalized
Bernoulli
3.3.1.
Let ~ be the product
Counterexample.
since it
schemes. space R Z endowed with the
product o-algebra ~ of all Borel o-algebras. Given an element
i of Z, denote by X i the random variable
to the coordinate
at the point i.
If ~ is a probability measure on
(~,00
a sequence
translate
(An ) of disjoint
are integrable
the law ~ symmetrical and their expectation
Let T be the unit shift. convergence
=
with a first moment is equal
so that the X i
to 0.
theorem in L I gives
the norm
(i/n). ~ X i ieA n
Because
the A n are disjoint, According
the random variables
to the Borel-Cantelli
a the probability
that the upper
lemma,
M n are mutually
indep-
for a given positive
limit of the M n is greater
real than
to a is equal to I if and only if the series of the probabilities
of the events
(Mn~a)
If this holds
for every a, the upper
where
finite parts of Z such that A n is a
The mean ergodic
endent.
almost
P
of the averages M n to 0 w h e n n tends to infinity Mn
or equal
a single probability
independent with the same law ~.
of the segment {0,..,n-l}.
Choose moreover
number
on R, there exists
such that the X i are mutually
Consider
corresponding
diverges.
surely infinite
limit of the sequence M n is then
and this sequence
cannot converge almost every-
to 0.
To complete
the proof of the counterexample,
cal probability
law on R with a first moment
real number a, the following ~n
series diverges:
we must construct such that,
a symmetri-
for every positive
49
(u
*n
denotes
the n-th convolution
In the following propostion, 3.3.2.
Proposition.
power of u).
we construct
this law.
For every real number ~ strictly between
the function exp(-[t[ ~) is the Fourier
1 and 2,
transform of a probability
law
on R. For the corresponding
probability u~,
the function
(Ix[) 8 is integrable
if the real number 8 is less than ~. Moreover,
this law is a stable one,
independent
random variables
homothetic
n((l-~)/~).X
This probability Proof. Because
the law of the average
with a law ~
of a variable
gives the example
it is continuous
The positive
is the same as the law of the
that we looked for in 3.3.1.
and takes the value according
type is a classical
proven in several ways. of section XXII.I. the Fourier
from the properties
of n
X with a law p~.
The function t --> exp(-[t[ ~) is of a positive
transform of a probability,
Because
i.e.
It is possible,
of the "El6ments
1 at 0, it is the Fourier
to Bochner~s
result
theorem.
in Probability for instance,
d'Analyse"
transform is one-to-one,
type.
Theory that can be to solve exercise
by Dieudonn4.
the stability of p~ comes
of the function exp(-]t]~).
The law of the average
of n independent
random variables with a law pe
is then that of the homothetic (n ((I-~)/~) ) .X
of a random variable whose
law is ~ .
If we denote by ~ the Fourier the Fubini's
transform of p~, a simple application
theorem shows that the integral (I
- Re~(t))/t
of
of
B+I
is the product of the integral
over R for the Lebesgue measure
of
(I - cos u)/u B+I
by the expectation
for p~ on R of Ix[ B
If the real number ~ strictly for the Lebesgue measure
lies between 0 and 2, the integral
of the function
over R
2
50
(i - cos u)/u B+I
is finite and the two other integrals It is then easy to see that is strictly The series
are then simultaneously
Ixl B is integrable
for ~
finite
if and only if B
less than ~. that we are interested
as the integral
for ~
in has,
for every a, the same nature
of the function
Ixl (=-II~) and therefore
diverges.
The probability ~ 3.3.3.
Remark.
amenable tion.
group,
then provides
The existence
us with the counterexample
of ameaning
for which Birkhoff's
Only partial
results,
sequences
sought.
of finite parts of an
result holds,
is still an open ques-
such as for groups with a polynomial
growth,
are known.
3.4. THE SADDLE ERGODIC THEOREM
This section is devoted pological 3.4.1.
dynamical
to the proof of a minimax
sytems; we give this result in a very general
Saddle ergodic
theorem.
subset of a real locally convex Hausdorff
following
to toform.
Let K be a convex and compact non-empty
T be a left action of an amenable one mappings
theorem related
topological
group G by affine
on K, and let f be a real function on
vector
space;
continuous F(G) x K
let
one-towith the
properties:
i)
for every x in K, the partial
function
in the sense given in definition ii)
for every finite part A of G, the partial is concave and upper semi-continuous
iii)
f(.,x)
this function
is invariant,
=
function
f(A,.)
on K
i.e. for every g in G, every
finite part A of G and every point x of K, f(Ag-l,Tg(x))
is subadditive
3.1.5
f(A,x)
51
Then,
the following
Inf AeF(G)\{~}
minimax
(iii AI ) .Sup
f(A,x)
for the subset
action
of G.
Proof.
For every element inequality f(A,x)
of all invariant
of K under
the
Sup f(A,t) teK
of these subadditive
the limits
points
x in K G and every finite part A of G, the
along M of these averages taking
(ill AI ) .f(A,x)
Sup Inf xeK G AeF(G)\{~}
holds: ~
Also the averages Because
=
xeK
where K G stands
following
equality holds:
invariant
functions
and the limits
are in the same order.
along M are equal
to the greatest
lower bounds,
the least upper bound on all x in KG, we get the inequality
in
one direction Sup xeK G
Inf (I/IAl).f(A,x) AeF(G)\{ ~}
In order
to prove
function
on F(G)
the converse defined
T(A)
=
decomposition
lower bound,
=
call ~ the subadditive
and also the limit along M, of the
is nothing
to prove.
fix a finite part B of G and consider
of the indicator
iA
inequality,
by
ratio (I/IA I).~(A). If ~ is equal to -~, there case,
Inf (I/IAI).Sup f(A,t) AeF(G)\{ ~} teK
Sup f(A,t) teK
Let = be the greatest
In any other
=<
(I/IBI).
the positive
of A:
~ IBgQA BgNA#~
For every x in K, the subadditivity
property
of f(.,x)
leads
to the
inequality f(A,x) Setting
apart
and using
~
(I/IBI).
~ f(Bg~A,x) Bg~A#~
in the sum the elements
the invariance
property
g for which Bg is contained
of f, we get
in A
52
f(A,x)
~
(I/IBI).(
X f(B,Tg(x)) Bg~A
+
% f(BNAg-I, Tg(x)) BggA
Choose an x such that f(A,x) = T(A) and call x A the center of gravity of all Tg(x) where Bg is contained in A. Because the partial function f(B,.) is concave, the following holds: (I/I AI ) .]~(A)
=<
(i/I BI ) .f(B,XA) • (i/I AI ) • I{g,BgcA} I
+
(i/I AI ). (I/I B1 ) .AB(A) .Sup ]~(C) C~B
Let y be a limit point of the x A along M. Use the upper semi-continuity ot f(B,.) and lemma 3.1.8 to derive (I/IBI).f(B,Y)
~
Moreover, according to remark 3.1.12, the intersection O(b-IA,beB) becomes arbitrarily invariant with A; and y is an invariant point. The following inequality then holds for every finite non-empty part B: Sup (I/IBl).f(B,y) yeK G
~
We conclude the proof, according to corollary 1.3.7, by replacing the ratio (i/IBl).f(B,y) with its greatest lower bound on the set of all finite non-empty parts of G.
3.5 REFERENCES
About general ergodic theorems, refer to Aloaglu and Birkhoff (I). Furstenberg (I) gives the theorem on uniquely ergodic dynamical systems. Almost every ergodic theory text gives a proof of Birkhoff's theorem; refer, for instance, to Billingsley (i). This theorem was generalized, especially to groups with a polynomial growth, by Chhtard (I), Bewley (i) and Emerson (2). The frame of the proof of the saddle ergocic theorem given here was inspired by Misiurewicz (I), and we shall have the opportunity to speak about it further in Chapter 5.
4. ENTROPY OF ABSTRACT
4.1.
EQUIVALENCE
OF ABSTRACT
DYNAMICAL
We shall classify here the objects abstract
dynamical
of an equivalence 4.1.1.
system between
Definition.
and U are actions
(X,~,~,T)
DYNAMICAL
SYSTEMS
given in the definition by giving several
two dynamical
Two dynamical
SYSTEMS
systems.
systems
(X,~,~,T)
and
(Y,~,~,U),
of the same group G, are said to be isomorphic
exist an element X' of the o-algebra ~ whose measure of the a-algebra ~ whose measure mapping
= (i.e. measurable
and Y',
such that a) X' is invariant under
the action T of G
b) Y' is invariant under
the action U of G
for every g in G,
mapping ~oT g
bimeasurable
instead of the provisory
in 2.1.4
(where X' = X and Y' = Y).
Definition.
Suppose that
distance
notion of an iso-
one of a strict isomorphism
Let ~ be a Boole algebra,
and d the corresponding
i.e.
ugo~
From now on, we shall be concerned with this particular morphism,
between X'
~ commutes with the actions, =
where T if there
is i, an element Y'
is I, and a one-to-one
together with its inverse mapping)
c) the bimeasurable
4.1.2.
2.1.5 of an
different notions
P an additive
introduced
function on ~,
on ~ (d(A,B) = P(AAB)).
(~,d) is a complete metric
of a group G on ~ by one-to-one mappings
space,
and consider
that preserve
an action T
the distance
and
the Boolean operations. This triple 4.1.3.
(~,d,T)
Example.
is said to be a dynamical
If (X,~,~,T)
is a dynamical
Boole algebra.
system,
the quotient
algebra
of the o-algebra ~ by the o-ideal Afof all events whose probability
is
54
equal
to 0, together with the distance
metric
space,
d(A,B)
= ~(AAB),
and the given action of G immediately
is a complete
induces
an action on
this Boole algebra a/~ by automorphisms. 4.1.4.
Definition.
An isomorphism between
associated with two dynamical
systems
the dynamical
Boole algebras
is called a conjugacy of the two
systems. 4.1.5.
Example.
It is evident
that an isomorphism
in the sense of defi-
nition 4.1.1 induces a conjugacy of the systems. 4.1.6.
Remark.
In questions
often only interested A condition systems
sufficient
involving dynamical
in the associated
isomorphic
unit interval
these spaces are isomorphic
together with the Lebesgue measure;
hence
We shall suppose this condition to be fulfilled whenever interpretation encounter
of the results;
and all explicit
are actually Lebesgue
The metric
ones,
together with Borel probabilities.
~/~ has no atom,
[0,I]
Boole algebras.
is that the spaces are Lebesgue
to Polish spaces
algebra
dynamical
we are in fact
to deduce a conjugacy between two dynamical
from an isomorphism
quotient
systems,
systems
i.e.
If the to the their name.
we need a point that we shall
spaces.
space a/~ corresponding
to a Lebesgue
space is separable.
4.2. ENTROPY OF PARTITIONS
4.2.1.
Definition:
The function
entropy of probability
defined on the interval ] 0 ~
extended
to the whole segment
negative
function n is strictly concave;
the value 0 at the points Function n is subadditive, numbers whose
<
If all the coordinates vector
by
n(x)
in a continuous
= -x.Log(x) function.
in particular,
can be
This non-
it only takes
0 and I. i.e. for every pair
sum is less than or equal
n (x+y)
nical base,
F0 ~
vectors.
n (x)
+
(x,y) of positive
real
to I,
n (Y)
of a vector of R n are non-negative
and if the sum of all these coordinates
is said to be a probability
vector
in the cano-
is equal
of order n.
to i, this
55
It is always vector
possible
of greater
Consider
to consider
order by setting
~
l~i~n
are added
of the properies
its value
vectors
(indepen-
is the same when null
with a given order;
strong
subadditivity
i, j and k range
coordi-
and continuous
on
it is also symmetri-
the probability
abc, and consider
property
natural
numbers
vector
and suppose
p = (Pijk) whose
also the probability
vectors
that the
order
is the product
q of order a, r of order
as follows:
q = (pi)
where
r = (Pij)
where Pij =
~ Pijk
s = (Pik)
where Pik =
~ Pijk J
Then we get the inequality
of entropy.
from I to a, I to b and I to c.
ab and s of order ac, that are defined
Pi = j!kPijk
H(p) + H(q)
~ H(r) + H(s)
We have to state
i,
,k
inequality
n(Pijk )
+
is equivalent
~n(p i) l
=<
.[ n(Pij) l,j
to the following
i,~,kPijk.Log(Pij.Pik/Pijk.Pi) where
in R n is
of n, function H is concave vectors
Let a, b and c be three positive
This
to 0.
whose
concave.
Proposition:
Consider
vectors
to a vector.
cal and strictly
Proof.
coordinates
on the set of probability
because
all sets of probability
subscripts
vectors
as a probability
n(pi)
Function H is well defined dently of their order)
4.2.2.
all supplementary
on the set of probability
H(p) =
Because
vector
the function H on the set of all probability
definition
nates
a probability
the sum only ranges
not equal
over the triples
to 0; this results
the logarithm (Pij.Pik/Pi)
because is equal
~
the sum over all triples to i.
! n(Pik) i k
one:
0
(i,j,k)
from the concavity
+
for which Pijk is
and the monotonicity (i,j,k)
of the ratios
of
56
The assumption that no Pi is equal to 0 is in fact not a restriction. 4.2.3. Definition. A finite partition of the probability space (X, ,~) is a finite set {PI' .... Pn } of events such that the measure of the intersection of Pi and Pj has a measure 0 when i and j are different, and such that the measure of the union of all Pi is equal to I. 4.2.4. Definition. A partition Q is said to be finer than a partition P if every atom of Q is contained (modulo 0) in an atom of P. This property is denoted by
Q >> P.
4.2.5. Remark. The previous relation ">>" is a preorder relation on the set of all finite partitions of a given probability space; and the equivalence classes of the corresponding equivalence relation (P>>Q and Q>>P) can be identified with the partitions of the unity between non-null idempotents in the quotient algebra ~/~. We shall identify a partition with its equivalence class for the previous equivalence relation. 4.2.6. Proposition.
For the previous order relation, every two parti-
tions have a least upper bound. Proof. The partition R, which is defined by its atoms
Rij = Pi~Qi, is
clearly a least upper bound for the set (P,Q). 4.2.7. Definition. The entropy of the partition P = (Pi) is the entropy of the probability vector (~(Pi)); this entropy is denoted by H(P). 4.2.8. Proposition. Entropy is a strictly increasing function on the ordered set of all partitions of a given probability space. Proof. Suppose Q finer than P and use the subadditivity of n to derive H(Q)
=
~ ( ~ n(~(Qj))) l Qj Pi
=> ~ n(~(Pi)) l
Equality can only hold if, for every atom Pi of P, the number n(~(Pi)) is the sum of all n(~(Qj)) where Qj is contained in Pi" Strict concavity of the logarithm then implies that this last equality is only possible when one of the ~(Qj) is equal to ~(Pi); and entropy is then strictly increasing.
57 4.2.9. Proposition. Entropy is strongly subadditive on the set of all finite partitions, i.e. verifies, for every triple (P,Q,R) H(PfQVR)
+
H(P)
__< H (PVQ)
+
H (PVR)
Proof. This proof is apparent from the strong subadditivity property of H on the set of all probability vectors (proposition 4.2.2). 4.2.10. Definition.
Given a o-algebra
6 contained in ~ a n d
a finite par-
tition P of (X,~,~), the conditional expectations E(IPiI~) constitute a partition of the unity almost everywhere. The integral of the entropy of this random probability vector is called the conditional entropy of P with respect to D and is denoted by H(PI6): H(PI~)
=
4.2.11. Proposition.
[ I n(E(ip.I~)) l
If the o-algebra~
d~
is generated by a finite parti-
tion Q, we denote by H(PIQ) the conditional entropy H(PI~Q). The following equality then results from an immediate calculation: H(PIQ)
=
H(PVQ)
H(Q)
4.2.12. Remark. The entropy H(P) can be considered as the conditional entropy with respect to the trivial o-algebra {~,X}. 4.2.13. Proposition. of the o-algebra,
Conditional
entropy is a non-increasing
i.e., if ~ contains ~,
H(P]~) ~ H(P]~).
Proof. According to definition 4.2.9, one has H(P]~)
=
[ / n(E(ip. IZ)) d~ l
=
I ] E(n(E(Ip. I~))113) d~ l
And because n is a concave function H(PI~ )
_<
~ f n(E(E(!Pil~) 16)) d~ l=
function
58
The composition property of conditional expectations gives, for every subscript i, the equality
E(E(IPil~) I~)
=
E(iPiI~)
The summation in the right-hand side of the above inequality is then equal to the conditional entropy H(PI~). 4.2.13. Theorem. Let (6n) be an increasing sequence of o-algebras contained in ~, and let 6 be the c-algebra generated by all ~n; for every finite measurable partition P of (X,~,~) the convergence result holds: H(P[6)
=
Inf (H(P[~n))
=
lim H(P[~ n) n --> =o
A proof can be found, for instance, in the work of Parry (2).
We shall need in the following some control over the concavity of the entropy. The next lemma provides us with it. 4.2.14.~ Lemma. Let (~i) be a convex combination and, for every i, let (p~)~ be a probability vector. Denote by (pj) the mean value of all these probability vectors
pj
=
i ~ ~iPj
Then the linearity defect
H(p)
~iH(P l )
is bounded by the entropy
of the probability vector (=i). i Proof. Consider the probability vector ($ij = ~iPo )" The mean value of the entropies of the p l is equal to the conditional entropy of P with respect to ~, i.e. the difference between the entropy of Q and the entropy of ~; hence the result.
59
4.3. ENTROPY OF DYNAMICAL
4.3.1.
Definition:
Consider
SYSTEMS
mean entropy of a partition.
a dynamical
system
(X,~,~,T)
where
(X,~,~)
is a probability
space
and T an action of an amenable group G on this space by automorphisms. Given a finite partition P of the space and a finite part A of the group, pA stands for the least upper bound of the partitions ments of A, the symbol PoT g representing
PoT g for all ele-
the partition whose atoms are
the inverse images of the atoms of P by the automorphism The equality of the probability comes from the invariance these two partitions The following
associated
of the measure ~ under T; and the entropies
+
is the strong subadditivity H(p A~B)
=< H(P A)
The real function on F(G), A --> H(pA), strongly
subadditive
T g.
to pA and to pAa of
are then equal.
inequality
H(p AUB)
vectors
+
of entropy
H(P B)
is then positive,
and left invariant;
non-decreasing,
this function belongs
to the
cone Z. By definition
the mean entropy of the partition
P in the dynamical
tem (X,~,~,T)
is the mean value of the previous
invariant
the greatest
lower bound of the ratio
sys-
capacity,
i.e.
(I/IAI).H(P A) on the set of all
finite non-empty parts of G and also the limit along the ameaning
filter M
of the same ratio. We denote the mean entopy of the partition P by h(P). 4.3.2.
Definition:
entropy of dynamical
The entropy of a dynamical
entropies h(P) of all finite partitions 4.3.3.
Remark.
By its very definition,
if two dynamical
4.3.4.
Remark.
systems.
system is the least upper bound of the mean
systems
of the measure
entropy is a conjugacy
are conjugate,
their entropies
The mean entropy h(P) of a partition
but the entropy of a dynamical
Sinai theorem shows that the entropy of a dynamical partition.
We have first to define the notion of a generator.
invariant:
are equal.
is always finite,
system may be infinite.
the mean entropy of a generating
space.
The Kolmogorov-
system is equal to
60 4.3.5.
Definition.
A finite measurable
ting for the dynamical
system
given by d(A,B) = u(AAB), translates
partition P is said to be genera-
(X,~,~,T)
if, for the pseudo-distance
the Boole algebra generated by P and all its
PoT g is dense in ~.
When G is countable,
a dynamical
system with a generator
is a Lebesgue
space. 4.3.6.
Proposition.
partitions
(P,Q) is a distance; Proof.
The function defined on the set ~ of all finite
of the space =
(X,~,~) by
H(PIQ)
+
H(QIP)
we call it the entropic distance.
The symmetry property of ~ is apparent.
is the triangle inequality;
we use,
of the entropy and its monotonicity H(P[ R)
Thus,
=
H(PVR)
<
H (PVQVR)
<
H(PVQ)
=
H ( P IQ)
function ~ is a distance,
therefore,
Definition.
is a measurable
H(R) -
H(R) H(Q)
+
+
4.3.8.
Definition.
H(QVR)
-
H(R)
H(QIR)
providing,
An ordered partition
mapping
point
the strong subadditivity
to calculate
we are concerned here with the equivalence 4.3.7.
The only remarkable
of course,
that we specify that
classes defined in 4.2.5.
of the probability
space
(X,O,p)
from X to a finite set I.
Given two measurable
partitions
p and q of (X,~,p)
which are ordered by the same finite set I, we denote by d(p,q)
the
number (1/2). ~ ~(p-l(i)Aq-l(i)) ieI So defined,
d is a distance on the set of all partitions
ordered by I.
61 4.3.9.
Lemma.
Let p and q be two measurable
same set I with n elements; Then the following
~ (P,Q) Here ~ stands on
[0,I]
Function
d between
distance
of P
p and q:
non-decreasing
~(x) = sup(n(t),0~t~x)
~ is concave
partitions.
the entropic
2n2.~ ((2/n2) .d(p,q) )
for the non-negative
given by
and it is constant Proof.
holds between
distance
=<
ordered by the
call P and Q the associated
inequality
and Q and the previous
partitions
on the segment
continuous
real function
.
[0,~,
it agrees with n on [ 0 , I / ~
on [i/e,l].
We look for an upper bound of the difference
H(PFQ)
- H(Q),
which
can be written H(PVQ) This expression
- H(Q)
=
~(n(~(Pj~Qj))-n(~(Qj))) j
can be bounded,
n.~(d(p,q)/n)
+
thanks
+
to the concavity
[ n(~(Pi~Qj)) i#j of ~, by
n(n-l).~(d(p,q)/n(n-l))
and then by n2.~(2d(p,q)/n 2) The same is true for the difference the demonstration
of the previously
H(PVQ)
- H(P);
all of which
stated relation
between
leads to
the numbers
6
and d. 4.3.10.
Proposition.
P is a generator
for the dynamical
if and only if the set of all partitions is dense in ~ for the entropic
coarser
system
(X,&,~,T)
than some upper bound pA
distance.
Proof. I) Let Q = (Qi' lJi
Q~ of ~, that are unions
of elements
of pA,
!
every i, ~(QiAQi ) is less than ~. Define
a partition
QJ
=
Q", with n+l elements,
QJ \
j Qk'
if
l<j
by setting
such that,
for
82
Q~+I Partition
=
x \
Q" is coarser
the comparison
than pA. Thanks
of distances,
real number ~, another than 60,
~ Q'.' l~jSn j
the entropic
to the previous
it is possible
positive distance
real number between
to find,
lemma concerning
for every positive
~0 such that, when ~ is less
the given partition
Q and Q" is
less than e. 2) Let QI be an event
in ~ and Q2 its complementary
part.
Let Q be the
partition whose atoms are Q1 and Q2 and let Q' be a partition than pA whose entropic distance to Q is less than ~. The greatest tends
lower bound of all distances
coarser
from Q1 to the atoms of Q'
to 0 with ~.
4.3.11.
Remark.
The mean entropy h(P)
is clearly
a non-decreasing
function
of P. 4.3.12.
Proposition.
For every finite non-empty
part B of g, the mean
entropy h(P B) is the same as that of P. Proof.
The ratio
(IB.AI/IAI)
the set B.A becomes
tends
arbitrarily
to 1 along
invariant
the ameaning
with A (remark
filter
3.1.12);
and hence
the result. 4.3.13.
Proposition.
is a Lipschitz
With respect
to the entropic
function with a constant
distance,
mean entropy
1 on ~.
Proof. For every finite part A of G, the repeated H((PVQ) A) Hence
-
H(p A)
~
use of subadditivity
leads
AI .(H(PVQ)-H(P))
the two inequalities h(PVQ)
=< h(P) + H(QIP)
And then the inequality h(P)
- h(Q)
;
h(P~Q)
=< h(Q)
+ H(PIQ)
that we seek ~
h(PVQ)
- Inf(h(P),h(Q))
<
Sup (H(Q] P) ,H(P] Q) )
<
6 (p,Q)
to
63
4.3.14.
Kolmogorov-Sinal
the dynamical
system
theorem.
(X,~,~,G),
If the partition P is a generator
for
the entropy of the system is equal to
the mean entropy of P. Proof.
The mean entropy is a continuous
distance,
and the subpartitions
function on
for the entropic
of the pA are dense in
; the entropy of
the system is then the least upper bound of the mean entropies particular
of these
partitions.
On the other hand,
the entropy is non-decreasing
and we need only take
the least upper bound on the set of all pA, which all have a mean entropy equal
to h(P).
4.3.15. 2.1.8,
Example.
To resume the example
the partition
open sets
of Bernoulli
({$(e) = i},iel)
is a generator
To prove this result,
notice
topology and remember
the regularity
for every Bernoulli
that the cylinder
scheme.
Radon measures.
to the mean entropy of the
at the unit element.
The mutual of H(pA);
in
sets form a basis for the
of the invariant
The entropy of such a system is then equal partition
schemes described
at the unit element of G whose atoms are the closed-
independence
of all translates
and the entropy of the abstract
of P leads to the additivity dynamical
system is then the
entropy of the probability
vector on which the Bernoulli
4.3.16.
stated the theory of entropy as an invariant
Remark.
for dynamical termine were
Kolmogorov
systems.
Until
if the two Bernoulli
scheme is built.
this time there had been no argument schemes built on (1/2,1/2)
and
to de-
(1/3,1/3,1/3)
isomorphic.
We will recall
that the spectral
noulli
in the case of the group Z (cf. Billingsley
schemes
4.3.17.
Remark.
ral question Bernoulli Ornstein
properties
After this first success
arose:
schemes
Is entropy a complete
are the same for all Ber(I)).
of the entropy theory, invariant
a natu-
in the class of all
?
solved this problem
in the case of Z: Two Bernoulli
schemes
with the same entropy are isomorphic. The essential
arguments
Shannon-McMillan Several
of the proof are the mean ergodic
theorem,
and the Rokhlin
authors have been interested
the action of an arbitrary amenable space in order to develop We presented
the
lemma.
in generalizing
these results
to
group on a standard probability
an isomorphism
the mean ergodic
theorem,
theorem.
theorem in chapter
3 and shall
speak about
64
Rokhlin's devoted
lemma in chapter 8. The end of this chapter, however,
to the Shannon-McMillan
4.4. THE ALMOST SUBADDITIVE
Definition.
theorem.
ERGODIC THEOREM
AND THE SHANNON-McMILLAN
4.4.1.
THEOREM
Given a probability
able partition P of it, the information positive
space
(X,~,~)
relative
and a finite measur-
to P is the measurable
function I(P) defined by I(P)
=
~ (-Log(n (p)) .i peP P
This function is integrable the partition 4.4.2.
will be
and its expectation
Theorem.
of an infinite of this space.
Let (X,~,~,T)
be a dynamical
system where T is an action
amenable group G. Let P be a finite measurable
The mean information
(I/IAI).I(P A) converges
filter and the limit is an invariant The limit is also the greatest of the averages all invariant
is the entropy H(P) of
P.
in Ll-norm along the ameaning
function.
lower bound of the conditional
(I/IBI).E(I(pB)I~)
partition
expectations
with respect to the o-algebra
~ of
events.
The convergence
result is the Shannon-McMillan
theorem that was proved
for the group Z. The general proof of this result for an amenable group was first given by Kieffer
(I). The characterization
lower bound and the demonstration work
(8) and essentially
4.4.3.
Definition.
elements
decompositions
be a dynamical
system.
ergodic theorem.
A family
(fA) of
indexed by F(G), is said to be almost subadditive
a positive real number C, such that, of every indicator
IA = the difference
depend on the almost subadditive
Let (X,~,~,T)
of LI(x,~,~),
if there exists
of the greatest
that we give here follow on earlier
for all positive
of a finite part A, such as
I ~BIB Bel
(fA - Bel~~ BfB) is almost negative,
i.e. the measure
65
u((fA-
[ ~BfB )+) Bel
of its positive part is less than or equal to C. 4.4.4. Definition. A family (fA) of the previous type is said to be covariant if, for every finite part A of G and every element g in G, f(Ag) = f(A) oT g
4.4.5. Theorem. Let (X,&,~,T) be a dynamical system and P a partition of the measure space. Then, the family of elements of LI(x,~,~) defined by fA = I(pA) is almost subadditive in the sense of definition 4.4.3; and the constant C can be taken equal to I. Proof. Consider a decomposition with positive coefficients:
IA
=
~ ~BIB BeF
The difference l(p A) -
~ ~BI(P B) BeF
is constant on every atom of pA and can be written (~(B(a)) ~B) I(pA) -
~ ~B I(PB) BeF
=
~ A L°g(BeF aeP
).I a
~(a)
Here B(a) stands for the atom of pB that contains a. When a real number x is greater than or equal to I, it is an upper bound for its logarithm. We can then bound the positive part of the previous difference in the following way: (~(B(a)) xB) (l(pA) _
~ ~BI(pB))+ BeF
< =
~ (BeF aePA+
). i ~(a)
a
Here the subscript a varies only in the set P+A of all atoms of pA for which the logarithm is greater than or equal to 0.
66 Adding some positive terms, we get v((I(P A) -
~ ~BI(pB)) +) BeF
<
~ A ( ~ (v(B(a)) aeP BeF
~B
))
where a now varies in the whole set of all atoms of pA. It now remains to prove that the number H (~ (B(a)) XB) ) aeP A (BeF is bounded above by 1 for every positive decomposition of a finite part A. We shall use an induction argument to prove it. The result is obviously true when A reduces to a single point; but when this is not the case, distinguish a point g in A and denote by A' the complementary part of {g} in A. Putting together the atoms of pA contained in the same atom of pA', we can write the previous sum this way:
a
~pA( H (~(B(a))XB)) = ~ A,( ~ (~(B(a')XB).~A ~ ( ~ ((B(a))~B))]) BeF a'e~ B A' '(a)=a ' geB
The sum of all coefficients ~B of the parts B containing g is equal to I; according to H~Ider's inequality every bracket CA
~ ( H ((B(a)) ~B) )] '(a)=a' geB
is bounded above by
geB
( ~ (~(B(a))) ~B) A'(a)=a'
Given an atom a' of pA' P the union of the B(a) for all atoms a such that A'(a) = a' is exactly B'(a'), where B' is the part B A' of A'. The initial expression (~(B(a)) ~B)
a~pA(BeF is then bounded above by a,~pA,(e B'eF' g (u(B'(a'))
XB'
)
67 where the B' = A'~B verify
IA' = B'~F '~B'IB' This last expression is less than or equal to i, by induction hypothesis. The proof of the almost subadditivity of the family (I(pA)) of the information functions is now complete. 4.4.6. The almost subadditive ergodic theorem. Let (X,~,~,T) be a dynamical system where the group G is amenable and infinite. Let (fA) be an almost subadditive and covariant family of elements of LI(x,~,~)
indexed
by the finite parts of G, and let (fA) verify the greatest lower bound property: the greatest lower bound K of all (I/IB[).~(f B) is finite Then the average (I/IAl).f A converges in Ll-norm along the ameaning filter. The limit is an invariant function and it is the greatest lower bound of the conditional expectations (I/IAI).E(fAI ~) with respect to the o-algebra ~ of all invariant events. A brief remark before proving the theorem - the convergence result is still true (it even becomes trivial) when the group is finite, but examples can easily be found where the limit is no longer the greatest lower bound of the conditional expectations. Proof of the theorem. The proof shall be divided in three steps. a)
Let us first show that the averages asymptotically become less than
the conditional expectations; fix a finite part B and consider the difference A(A,B) A (A,B)
=
(I/IAl).f A
(i/I BI).E(fBI~)
As a function of A, the positive part of A(A,B) tends to 0 along the ameaning filter. Indeed, consider the usual positive decomposition 1A
=
(I/IBI).
~ IBg~A Bg~A@~
68 Apply tile almost subadditivity
inequality
to the corresponding
elements
of the family to derive ~((fA - (I/IBI)" Setting apart the elements the covariance
~ f~ ^A) +) Bg~A#~ ~ g ~
< =
C
g such that Bg is contained
in A and using
property of the family, we get the inequality
~(A(A,B) +)
~
C/IA[
+
II(I/[BI).E(fB[~)
+
(I/IA[).(I/]B I) •Bg~A#~,Bg,A [IfB Ag -I °TglIl
- (I/IAI).(I/IB[).B~cAfBoTg]II
As we shall see, the three terms of the right side tend to 0 along the ameaning
filter.
The first,
the ratio C/IAI,
amenable
group G is infinite;
as regards
tends to 0 because
the second,
the mean ergodic theorem to (i/IBl).f B demonstrates to 0 along M; and as for the third, because
the application
of
that this term tends
there are only finitely many
different parts of B, and because ~ is invariant, bound ~(B)
the
there exists a common
for the norms
II(fB Ag-l)oTgIll and as the third term is bounded by (I/IA I). (I/IB I).~(B).AB(A) it also tends to 0 along M. b)
Because L 1 is a complete
the functions
space,
in order to prove the convergence
(I/IAI).f A along M, we need only verify Cauchy's
of
criterium.
Let then E be a positive real number and B be a finite part of G such that
(I/IBI).~(fB)
According
is less than
K + e/6.
to the first step of the proof,
it is possible
part D of G and a positive real number ~ such that than e/6
whenever A belongs
to M(D,~).
to find a finite
~(&(A,B) +)
is less
69 We then get an upper bound for the integral (I/IBI).u(E(fBI~))
of (I/IBI).E(fBI~)
=
(I/IBI).U(fB)
<
K + e/6 (I/IAI).~(fA)
Hence a lower bound for the integral -e/6 The integral
<
~(A(A,B))
=
of A(A,B)
~(A(A,B) +) - ~(A(A,B)-)
of the negative part of A(A,B)
and the Ll-norm of A(A,B)
+ E/6
is then less than c/3
is less than e/2.
Then, when the two finite parts A' and A" belong to M(D,6), II(I/IA'I).fA,and Cauchy's c)
(I/IA"I).fA,,II 1
condition is fulfilled.
From the right invariance
the limit f is invariant.
of the M(D,~),
the conditional
a contraction
expectations
(I/IAI).E(fAI~).
along M
expectations
on
parts of G).
Proof of theorem 4.4.2.
To complete hypotheses Indeed,
this proof, we notice that the family of the almost subadditive
this family is almost subadditive
the greatest
(I(pA)) verifies
all
ergodic theorem.
It is covariant because ~ is invariant
according
to theorem 4.4.5.
under the action of G.
lower bound property holds since all these functions
are non-negative. 4.4.8.
is
(i/IAI).f A is
(I/IAI).E(fAI~)
lower bound of the same conditional
the set of all finite non-empty
Last,
expectations
operator of L I, the limit f of the averages
(and the greatest
deduce that
expectation with repect to a given o-algebra
also the limit of the conditional
4.4.7.
we immediately
Using the first step of the proof, we see that
f is less than or equal to all conditional Because
<
Remark.
probability
We have only defined finite mesurable
partitions
of a
space.
The notion of a countable measurable
partition
is also important.
70 Such a partition is said to have a finite entropy if the series of all numbers n(~(p))
converges.
By definition,
the sum of the series is then
the entropy of the countable partition. The Shannon-McMillan theorem holds even for countable partitions with a finite entropy and there is therefore no change in the proof. 4.4.9. Remark.
For particular
sequences of finite parts,
countable amenable groups, and for special
it is possible to state Breiman's theorem thus:
The convergence of the mean information relative to a given partition not only holds in norm but also almost everywhere. Breiman's work concerned the usual means on Z. For groups with a polynomial growth,
Ornstein and Weiss,
and Derriennic all obtained significant
results.
4.5. REFERENCES
As regards conjugacy and isomorphism of dynamical lingsley
systems, refer to Bil-
(I), who gives the proofs of several properties of entropy.
On this subject,
see also the works of Kolmogorov
the article by Rokhlin The martingale
(i), of Sinai
(i), and
(2).
theorem for the conditional entropy is proven by Parry (2).
The proof of the isomorphism of Bernoulli schemes with the same entropy was acccomplished by Ornstein
(1,2). Refer also to Smorodinsky
The first generalization of the Shannon and McMillan theorem case of amenable groups is found in Kieffer
(i).
(i) to the
(i); the current author
(8)
has given an alternate proof of this extension and the characterization of the limit function as a greatest lower bound of conditional expectations.
5. ENTROPY AS A FUNCTION AND THE VARIATIONAL
5.1. TOPOLOGICAL
5.1.1.
PRINCIPLE
ENTROPY
Definitions.
Let (X,T) be a topological
dynamical
system.
Given
a finite open cover 0 of X, N(O) stands for the logarithm of the minimal number of open sets in all subcovers
of O.
For every finite part A of G, denote by 0 A the open cover of X by all sets (Tg)-l(ag) geA where the a
are the elements of O. g The set function A --> N(O A) is weakly subadditive,
i.e. whenever
A
and B are disjoint parts of G, N(O AUB)
~
N(O A)
+
N(O B)
The mean topological
entropy h(0) of the cover 0 is the upper
along M of the ratio
(I/IAI).N(oA);
The topological bound h(T)
entropy of the dynamical
(eventually
limit
and h(0) is finite. system is then the least upper
equal to +~) of the mean topological
entropies
of
all finite open covers of X. 5.1.2.
Remark.
morphism dynamical
Topological
of topological
entropy is an invariant number for the iso-
dynamical
metric entropy of abstract a topological
5.1.3.
two isomorphic
systems have the same topological
Let us state a result analogous
parating
systems:
dynamical
open cover,
Definition.
The topological
entropy.
to the Kolmogorov-Sinal
dynamical
topological
systems:
theorem for the
the topological
system is the mean topological
entropy of
entropy of a se-
if one exists.
The variational
principle
entropy of a topological
the least upper bound of the entropies
gives the following dynamical
system
of all abstract
result:
(X,T) is
dynamical
72
systems
(X,~,~,T) built with invariant Radon p r o b a b i l i t y measures.
We know that the topological In the present chapter, principle
we shall state a more general variational
for the pressure of a continuous
5.2. PRESSURE OF A CONTINUOUS
5.2.1.
entropy is the pressure of the null function.
however,
Definitions.
function.
FUNCTION AND THE V A R I A T I O N A L PRINCIPLE
Let X be a compact Hausdorff space and T be an action
of an amenable group G by h o m e o m o r p h i s m s
on X.
Denote by W the set of all open symmetrical neighborhoods A of XxX;
of the diagonal
set W is a basis of the u n i f o r m filter.
Let 6 be an element of W. A subset E of X is said to be ~-separated if any two distinct elements of E are not neighbors
of order ~.
An open cover O is said to be of order ~ if the Cartesian square of every element a of 0 is contained in ~. Because X is compact,
there exist finite open covers of X that are of
order ~. Any m a p p i n g i from a ~-separated set E into a finite open cover O of order 8, such that the image of every point of E is one of the open sets of 0 to which this point belongs,
is one-to-one.
Such mappings do exist and a 6 - s e p a r a t e d set is then finite. Let f b e a real function on X w h i c h is bounded above. For every finite subset E of X, Z(f,E)
Z(f,E)
=
stands for the sum
[ exp(f(x)) xeE
For every finite open cover 0 of X, Z(f,O)
Z(f,O)
=
stands for the sum
[ Sup exp(f(x)) aeO xea
W h e n e v e r E is a 6-separated set and 0 an open cover of order 6, there exists a one-to-one mapping from E to O, leading to the inequality Z(f,E)
~
Z(f,O)
73 The least upper bound Pl(f,~) set of all f-separated
of all numbers Z(f,E), where E runs in the
subsets of X, is finite; and the greatest lower
bound P2(f,~) of all numbers Z(f,O), where 0 runs in the set of all finite open covers of order ~, is not equal to 0. Thus the following inequality clearly holds: Pl(f,~)
$
P2(f,f)
If A is a finite part of G and f an element of W, {A stands for the following element of W: fA
=
~ (TgxTg)-I(~) geA
Here T is the initial action of G on X and TxT the component-wise
action
of G on the square XxX. If A is a finite part of G, fA is the real function on X given by fA
=
[ f°Tg geA
Denote then by pl(f) the upper limit of the ratio along the ameaning fiter. If ~' is finer than ~, the f-separated
(I/IAI).Log(PI(fA,fA))
sets are also 6'-separated
and we
get the inequality Pl(f,6)
__< Pl(f,6')
That allows us to define the function Pl on the set of all real functions that are bounded above on X by pl(f)
=
Sup pl(f,~)
=
~eW
Similarly,
p2(f,f)
lim pl(f,~) W
is the upper limit of the ratio
along M. Here also, when ~' is finer than ~, we get p2(f,~)
<
p2(f,~ ')
and P2 is defined by p2(f)
=
Sup p2(f,f) feW
=
lim p2(f,~) W
(I/[AI).Log(P2(fA,6A))
74 For every 8 in W, pl(f,6) where
~
p2(f,8)
$
exp(s(f)).h(O)
0 is any open cover of order 6.
Then for every real function pl(f) where h(T)
~
p2(f)
~
is the topological
5.2.2.
Theorem.
system
(X,T) be finite
numbers
pl(f)
Proof.
Because
is bounded
entropy
of the dynamical
entropy
system
(X,T).
function
dynamical
on X. The two
are then equal.
f is continuous,
it is uniformly
exists,
continuous
for every positive
on the compact
real number
~, an
6 of W such that - f(y)J
<
c
whenever the pair (x,y) belongs to 8. Choose an element y of W such that the composition in 8. In order
to calculate
mal y-separated
the upper bound Pl(f,y),
subsets
square yoy
simply
of X; they are y-spanning,
consider
i.e.
is contained the maxi-
the y-neighbor-
of all points make an open cover of X.
Such a cover
is of order 8; hence P2(f,~)
Applying, whose
we get
of the topological
and let f be a continuous
therefore
If(x)
hoods
above,
exp(s(f)).h(T)
Let the topological
and p2(f)
space X. There element
f on X which
<
the inequality
exp(c ) .PI (f,Y)
for every finite part A of G, this result
composition
square yAOYA
P2(fA,SA) Take the logarithm limit along
<
<
in 8A, we get
exp([A[ .E).PI(fA,YA)
of both members,
the ameaning p2(f,8)
is contained
to 8 A and chosing YA'
filter pl(f,y)
divide
them by I A[ , and take the
to get the double +
c <
pl(f)
+ c
inequality
75 Take now the limit along W to derive
p2(f) Because p2(f)
<
pl(f)
this inequality
is true for every e, the two numbers
5.2.3.
Definition.
5.2.4.
Remark.
a continous
The pressure
5.2.5.
The equivalence
function
and
Function
t gives
Deduce
is a Lipschitz
tool in studying
the
t (we can no longer
call
of the increments
~
on X, p(f+h)
is
PI(fA,~A).Sup exp( [ hoTg(x)) xeX geA
function
as a function
p(f + hoT g - h)
entropy p(0)
on C(X):
=
is finite,
f on X and the pressure
for every h in C(X)
p(f)
functions
of the pressure:
from the finite rank inequality
function with constant
is infinite,
of
of the fil-
p(f) + t(h).
If the topological
it verifies,
If p(0)
control
to the sum
nite for every continuous wing properties
function
(f,h) of real continuous
the relation
Corollary.
of the pressure
property
3) on C(X) given by
Pl((f+h)A,~ A)
5.2.6.
f is the
lim (I/IA I ).Sup ~ hoTg(x) M xeX geA
less than or equal
Proof.
is a decisive
the sublinear
the following
For every pair
of the ameaning
this property
Consider
=
function
and p2(f).
on the space C(X).
it p as we did in chapter t(h)
pl(f)
of the two definitions
is independent
as a function
Theorem.
of the continuous
of the two numbers
ter M. On the contrary, pressure
on X.
pl(f)
are equal.
common value p(f)
and,
+
function
p(f)
is fi-
has the follo-
p is non-decreasing;
i with respect
it
to the uniform norm;
and every g in G,
p(f)
is also infinite
for every continuous
function
76 The control
of the increments
responsible
for all these properties.
5.2.7. where
Variational principle.
of the function p by the function t is
Let
(X,T) be a topological
dynamical
system
the space X is compact and where T is an action of an amenable
group
on X by homeoemorphisms. The pressure p(f)
of a continuous
bound of all sums
(h(~) + ~(f)),
dynamical
to the Riesz representation
The proof is divided
in two parts:
h(~) + ~(f)
5.2.8.
<
inequality
Theorem
gical dynamical
theorem.
theorem to be given in 5.2.8;
shall be a consequence
(Variational
principle
and the
of theorem 5.2.13.
- part one). Let
(X,T) be a topolo-
system and ~ be an invariant Radon probability measure. is measurable
given by the Riesz representation
theorem,
itive real number e, there exists a symmetrical the diagonal
with respect
open neighborhood
+ ~(f)
~
p2(f,~)
on the use of the quantity R(~,~) which gives a measurement
for the overlap ratio of an open cover with respect Let us study this number before proving Definition.
a one-to-one mapping j from P into ~ such that
every atom Pi of P is contained Definition.
in the corresponding
open set j(Pi ).
Given a probability measure ~ and a finite open cover
~, the overlap ratio R(~,~) entropies
to a given probabi-
the theorem.
A finite partition P is said to be adapted to the open
cover 6 if there exists
5.2.10.
of the
of a finite open cover of X. The proof of this theorem depends
essentially
5.2.9.
~ of
+ E
Proof. We shall look for 6 as the union of the Cartesian squares
lity.
to the
and for every pos-
such that
h(~,P)
elements
~ on
the first inequality
For every finite partition P of X which a-algebra
is the entropy of the abstract
p(f)
from the approximation
converse
where h(~)
system built with the invariant Radon probability measure
X, according
results
real function on X is the least upper
is the least upper bound of all conditional
H(p,PIQ) where P and Q are measurable
partitions
adapted
to 5.
77
5.2.11.
Proposition.
cover, which means (whose elements
The overlap ratio is a subadditive
function of the
that the overlap ratio R(~,~IV6 2) of the cover ~IV~2
are the non-empty
intersections
alna2,
with a I in 6 1 and
a 2 in 6 2 ) is less than or equal to the sum of the overlap ratios R(~,~ I) and R(~,~2). Proof.
Let P be a partition adapted
a one-to-one mapping j = (jl,J2) (considered Gluing
subpartition one,
of the covers
as the sets of their open sets).
together
Similarly,
to the cover 61V6 2. There then exists
from P into the product
the atoms of P which have the same image by Jl' we get a
P' which is
adapted
we get a subpartition
to 6
I" P" adapted
to 6 2 . Because j is one-to-
P is the upper bound of P' and P".
A similar
construction
can be made for the second partition Q.
Using the subadditivity respect
H(~,PIQ) Because
property
to a given a-algebra,
=< H(~,P'IQ)
the conditional
a-algebra,
expectation with
+ H(~,P"IQ)
expectation
is also a decreasing
=< H(~,P'IQ')
By taking the least upper bounds of the subadditivity
+ H(~,P"IQ")
in both members we complete
Proof of theorem 5.2.8. Let
Let P be a finite partition
(X,T) be a topological
Because ~ is outer regular,
with respect
to the
theorem. of the atoms of P
these open sets can be chosen in such a way
than a given positive
cover adapted
dynamical
on X.
of the difference between an atom and its neighborhood real number.
The partition P, with which we started, is adapted
is measurable
an open cover by taking open neighborhoods
that the measure is smaller
of X which
given by the Riesz representation
We construct
the proof
of R(~,6).
system and ~ be an invariant Radon probability measure a-algebra
function of the
we derive H(~,PIQ)
5.2.12.
of the conditional
we get
is by the construction
of the
to it and, for every finite part A of G, the partition pA
to the cover ~A"
Let 5' be an open cover of order 6A, let R be a partition
adapted to ~'
i.e. finer than the cover 6A; and
78 Consider a mapping 0 from 6' into 6 A that sends every open set of ~' to one of the elements of 6 A that contain this open set. If j is one of the admissible one-to-one mappings from R into 6', consider the subpartition Q of R obtained by the following construction: an atom of Q is the union of all the atoms of R
whose image by 0oj is
equal to a given element of ~A" Partition Q is adapted to 6 A and we then get H(~,P A)
<
H(~,Q) + H(u,PAIQ)
Hence H(~,P A) + ~(fA )
<
H(~,Q) + R(~,6 A) + u(fA )
<
H(~,R) + R(~,6 A) + ~(fA )
According to the Jensen inequality, than or equal to Log(Z(fA,6')) H(~,P A) + ~ (fA)
the sum (H(~,R) + ~(fA )) is less
and we next obtain
=< R(~,~ A) + Log(P2(fA,6A))
Using the subadditivity of R and taking the upper limit along M, we derive h(~,P)
+
~(f)
<
R(~,6)
+
p2(f,6)
If the least upper bound of the measures of all differences between the atoms Pi of P and their open neighborhoods is less than (I/n).Sup(~(Pi)), every partition which is adapted to the cover has exactly n elements. Thanks to lemma 4.3.9 the entropic distance ~(P,Q) between two partitions, both adapted to the cover 6, is bounded above by 2n2.~(2d(p,q)/n2). Hence an upper bound for the overlap ratio R(~,6)
<
2n2.~ (2 Sup (~ (Oi\Pi)) )
The previous relation is true whenever the right-hand side is less than (I/n).Inf(~(Pi)). The proof of the first part of the variational principle is thereby complete.
7g 5.2.13.
Theorem
(Variational
open neighborhood
~ of the diagonal,
probability measure
- part two).
there exists an invariant
>
greatest
Radon
pl(f,~)
Here we use what are essentially
the saddle ergodic
For every symmetrical
~ on X such that
h(~) + ~(f)
Proof.
principle
theorem.
lower bounds
the arguments
We have to be careful,
for the proof of
however,
because
some
are no longer reached.
Let us then fix a strictly positive
real number a.
For every finite part A of G it is possible
to find a ~A-separated
set
E such that
Z(fA,E)
~
PI(fA,~A).exp(-a)
Call a A the probability measure
on X whose
support
is E and which gives
to a point y of E the mass
OA({y})
=
(I/Z(fA,E)).exp(fA(y))
Denote by ~A the average of the translates
~A
=
(I/]AJ).
of °A by the elements
of A
~ Tg(o A) geA
Then let N be a filter with the following properties: first,
N is finer than the ameaning
second,
the ratio
filter M;
(I/IAJ).Log(PI(fA,~A))
converges
to pl(f,~)
along N; and, This
third,
the probability measure
~A has a weak limit along N.
limit ~ is an invariant probability measure,
as we have just seen
(proof of theorem 3.4.1). In order to state that this limit verifies consider
a finite measurable
to the boundaries
the anticipated
inequality,
partition P such that ~ gives a measure
0
of the atoms of P.
For every atom Pi of the partition P, the real function v --> ~(Pi ), which is defined on the set of all Radon probability measures continuous
at the point ~.
on X, is
80 Moreover,
we can construct P of order ~ so that every atom of pA contains
at most one point of E, which leads to H(OA,P A) + ~A(fA)
=
Log(Z(fA,E))
~
Log(PI(fA,~A))
- a
Entropy is a strongly subadditive set function; it is subadditive, according to remark 3.1.7 and we can then exploit the usual positive decomposition
IA Subadditivity
=
(I/IBI).Bg~#¢IBg~A
leads to the following inequality
(I/[AI)'H(eA ,PA)
~
(I/IAI)" I (I/IB])'H(eA ,PBg) geA
+
(I/IAI).Log(n).(IB.A
\ {g,BgcA} I)
where n is the number of the atoms of the partition P. As a function of the probability, (I/IAI).Log(PI(fA,6A))
entropy is concave; hence ~
(I/IBI).H(~A ,PB)
+
~A(f) + (I/IAl).a
+
(I/IAI).Log~n).(IB.A
Taking the limit along N, and using the continuity of
\ {g,BgcA}l)
~ --> H(~,P B) at
the point ~, we obtain pl(f,~)
<
(I/IBI).H(~,P B) + ~(f)
Now, taking the least upper bound on the set of all finite parts B, we get the following inequality: pl(f,~)
<
h(~,P) + ~(f)
and the second part of the variational 5.2.14.
Remark. We assumed,
<
h(~) + ~(f)
principle in proven.
in the previous proof,
the existence of
finite measurable partitions of the space X which were of order ~, such that the measures for ~ of the boundaries of the atoms were O.
81
To demonstrate clearly that such partitions do exist,
take an open cover
finer than the given one. Now we still must show that it is always possible to find a m e a s u r a b l e set between a compact set and a given open neighborhood of it, with a null boundary for the measure ~. Therefore,
consider a continuous
function ~ on X whose values
lie between
0 and i, which is equal to 1 on the compact and to 0 out of the open neighborhood.
There are only countably many real numbers
1 such that the ~-measure of the set
{xeX,~(x)
If u does not belong to this exceptional
= t}
t b e t w e e n 0 and
is not equal to 0.
set, the b o u n d a r y of the measur-
able part {~ > u} is contained in the null set {~ = u} and we achieve the result we sought.
5.2.15. Example.
Consider the symbolic system
right translations,
X = IG
where G acts by
and let Y be a closed invariant subset of X.
The p a r t i t i o n at the unit point of G is also a separating open cover. The pressure of a continuous
p(f)
=
lim sup M
function f is then equal to
(I/IAI).Log( [ A Sup exp(fA(~)) aeI SA=a
For a finite part A of G, p(fA,PA) (~(fA)+H(~,PA))
)
is the least upper bound of the sum
on the set of all invariant Radon p r o b a b i l i t y measures,
if P is the p a r t i t i o n at the unit point. Employing the saddle ergodic theorem 3.4.1 gives the variational principle in this particular
5.2.16.
situation.
Remark. When we encountered an open cover ~ as a neighborhood of
the diagonal of XxX, we implicitly supposed that an open cover 5' order ~ is finer than 6 in the usual
sense,
of
i.e. every element of 5' is
contained in an element of ~. This
is m e r e l y a set-theoretic result which can be settled by an easy
induction argument:
If
A, A I ..... A n
AxA of the product
are some parts of a set S such that the part SxS is contained in the union of the AixAi,
there exists an i such that A is contained in A.. i
82
5.3. ENTROPY AS A FUNCTION OF THE MEASURE
5.3.1.
Definition.
We are now interested
in some properties
of the func-
tion ~ --> h(~), which is defined on the compact convex set M(X,T) isting of all invariant Radon probability measures space X. We suppose that the topological dynamical 5.3.2.
system is finite,
Proposition.
entropy of the topological
thus the previous
If the topological
finite and if the group
is infinite,
cons-
on a compact Hausdorff
function is bounded.
entropy of the dynamical
system is
the entropy is an affine function of
the measure. Proof. We must simply state the following equality (~i,~2)
of invariant Radon probability measures
of strictly positive representation
for the a - a l g e b r a
+ ~2.h(~2,P)
=
given by the Riesz
h(p,P)
such a partition P is also measurable
from ~i and ~2 because respect
(=i,~2)
to I, and every
theorem
~l.h(~l,P) Indeed,
on X, every pair
real numbers with a sum equal
finite partition P that is measurable
for every pair
these measures
for the a-algebras
are absolutely
continuous
built with
to ~.
The least upper bound on the set of all these partitions limit and therefore Calculate
the equality
then the difference, (I/IAI).(H(~,PA)
This positive
difference
to lemma 4.2.14,
still holds
is a directed
for the limits.
for a finite part A of G,
_ = I . H ( ~ I , P A ) _ =2.H(~2,PA))
is bounded above by (i/IAl).Log(2),
and tends to 0 when the cardinal
according
number of A tends to
infinity. 5.3.3.
Definitions.
Given an element 6 of W, a partition P is said to be
strongly of order ~ if the Cartesian of P is contained
square of the closure of every atom
in a.
The mean entropy of order 6 of an invariant Radon probability measure (or, better,
of the dynamical
bound h(~,6)
of the mean entropies h(~,P)
which are strongly of order ~.
system built on ~)
is the greatest
of all measurable
lower
partitions
83
5.3.4. Lemma.
Given an invariant Radon p r o b a b i l i t y measure ~ and an ele-
m e n t ~ of W, h(~,6)
is the greatest
lower bound of the mean entropy h(~,P)
on the set of all Borel finite partitions which are strongly of order 6 and whose atoms have a null b o u n d a r y for ~.
Proof.
Entropy calculations use only m o d u l o 0 partitions.
thus shows that the Borel partitions w h o s e atoms have null boundaries
upper
Proposition.
that are strongly of order 6 and
for ~, are dense for the entropic dis-
tance in the set of all partitions
5.3.5.
Remark 5.2.14
strongly of order 6.
As a function on M(X,T), m e a n entropy of order 6 is
semi-continuous.
Proof. This function is less than or equal to the greatest lower bound of the m e a n entropies of all Borel partitions
strongly of order ~ whose
atoms have a null b o u n d a r y for ~. This
last function is a greatest lower bound of continuous functions at
the point ~, and thus an upper A c c o r d i n g to lemma 5.3.4, --> h(~,6)
semi-continuous
these two functions
is then upper semi-continuous
function at ~. agree at ~. The function
at ~ because it is equal to
an upper semi-continuous function that e v e r y w h e r e bounds it above.
5.3.~.
Proposition.
The mean entropy h(~)
is the least upper bound of
all m e a n entropies of order 6.
Proof. The following inequality clearly holds for every 6 in W: h(~,~)
__<
h(~)
If P is a Borel p a r t i t i o n such that
h(~,P)
>
h(~)
- e/2
and if 0 is an open cover of X consisting in open n e i g h b o r h o o d s atoms of P, such that the overlap ratio R(~,O)
of the
is less than ~/2, we have,
for every partition Q adapted to O,
h(~,Q)
>
h(~)
W h e n ~ is an open cover, w i t h the partitions
-
e
the mean entropy of order 6 can be calculated
strongly of order 6 and adapted to the cover.
84 According
to remark 5.2.16,
for every atom Pi of P, the Cartesian square
of the closure Pi is included
in 5; and the cover by the Pi is finer than
the cover 5. It is then possible adapted 5.3.7.
to construct
to the covering. Remark.
Hence
The set M(X,T)
a subpartition
strongly of order ~ and
the result. of all invariant
Radon probability measures
on X is a simplex and every element ~ in it can be uniquely decomposed a gravity
as
center =
f
v
de
(v)
M(X,T) where e M(X,T) i.e.
is a Radon probability measure that give all the mass
on the convex and compact
to the subset of extremal
to the set of ergodic measures;
points
set
of M(X,T),
this is the ergodic decomposition
of ~. Entropy
is an affine
ergodic
decomposition holds
h(~)
function on M(X,T);
=
we can then suppose
for the entropy,
h(v)
/
de
that the
i.e. that
(~)
M(X,T) Thanks
to the previous
is an affine upper decomposition
results
on the mean entropy of order 5, h(.,6)
semi-continuous
function and the following
ergodic
holds for it:
h(~,6)
=
f
h(v,~)
de
(v)
M(X,T) We then get, according f
h(~)
de
* M(X,T)
to the very definition of the inner integral
(v)
>
Sup f
~
The inner integral
h(v,~)
of h(v)
functions, h(~)
>
(v)
=
h(~)
is therefore bounded below by h(~).
If we knew that entropy were a greatest continuous
de
M(X,T)
lower bound of affine lower semi-
we would get the inequality f
h(v)
dO
(v)
M(X,T) showing
that the entropy function
and that the ergodic decomposition We could achieve variational
is measurable holds
for all probabilities
for it.
this result as a corollary of the following
principle.
converse
85 5.3.8.
Conjecture:
entropy
is finite,
converse variational
the entropy of a measure
bound of the difference upper
semi-continuous
5.3.9.
Remark.
for instance
principle.
(p2(f)
~ is the greatest
- ~(f)) when f ranges
functions
lower
in the set of all
on X.
When the entropy function
is upper
in the case of closed invariant
the converse variational
If the topological
semi-continuous,
subsets
of symbolic
principle holds with continuous
as spaces,
functions.
5.4. REFERENCES
Adler,
Konheim and M c A n d r e w
cal ,entropy. Its relation formation Bowen
by Dinaburg
(I) have introduced
to the metric
(i), Goodman
the concept of topologi-
entropy was done for one trans-
(i) and Goodwyn
(i).
(i) should also be cited for his contribution
The pressure
of a continuous
the frame of statistical
to this study.
function was first defined by Ruelle
mechanics
and by Walters
(i) in
(I) in the general
case.
For the second part of the variational
principle,
given here was inspired by Misiurewicz
(i); we adapted his proof to the
case of pavable an smenable
groups
group
Consult Jacobs
in our work
(I) as regards
the proof of the ergodic
is Z and when the compact
The ergodic decomposition Denker,
(3) to reach the general
proof for
(6).
entropy when the group of integration
the proof that we have
Grillenberger
principle when entropy
and Sigmund is upper
functions
for instance,
(I) proved
is a result
in Meyer
the converse
semi-continuous
of
space is metrizable.
of affine semi-continuous
theory that you can find,
decomposition
on M(X,T).
(I).
variational
6. STATISTICAL MECHANICS
6.1. LOCAL SPECIFICATIONS
6.1.1.
Definitions.
anics on a lattice,
AND GIBBS MEASURES
In order to study some problems consider
configurations
compact metrizable Moreover
and, at every point of S, the set I
is finite.
tions of the system is the product Endowed with the product
topology of discrete
X is then a
to select a closed subspace of
on the configurations
stucture
is generated by the basis
=
topologies,
space.
X to take some constraints
VA
The space X of all configura-
space I S .
in certain cases it is necessary
The single uniform
in statistical mech-
the following model of the situation.
The set S of all sites is countable of all possible
ON A LATTICE
corresponding
into account.
to the compact topology of X
(VA,AeF(S)) , where V A is the subset of X 2
{ (~l,~2)eX 2, V x e A ~l(X)=~2(x)}
We then denote by VA(~)
the neighborhood
closed and open set of all configurations
of order V A of ~, i.e.
the
~' that agree with ~ on A.
On the other hand,
given a finite part A of S, A(~)
all configurations
that agree with ~ at all points
is the finite set of of the complementary
part of A. To simplify matters we only shall consider models with two elements
in I
which we will denote by +I and -I. The configuration
space X = {-l,+l} S is then a compact
The normalized Haar measure measure
group.
of this group is the product probability
h = (I/2,1/2) S
Denote by a consider,
topological
x
the coordinate
function at x given by
Ox($)
= ~(x)
and
for every finite part A of S, the product °A of all Ox where
x runs in A.
87 The real
functions
OA, where A runs in the set of all finite parts
are the characters It is possible uous
function
of the compact
f(A)
6.1.2.
group X.
here to use the Fourier on X, its Fourier =
~ f(~).OA(E) X
Proposition.
of S,
transform.
transform
If f is a real
f is defined
contin-
by
dh(~)
If f is a continuous
function
on X, the partial
sums
fA' defined by fA
=
I f(B).o B BoA
uniformly
converge
to f when the finite part A simply
Moreover,
the mapping
uous when the uniform structure Proof.
A --> fA' from F(S) structure
on P(S) which gives
Let us calculate fA(~)
Here n stands
to C(X),
is induced
to P(S)
is uniformly
contin-
on F(S) by the single uniform
its compact
topology.
=
[ {f f(~).CB(~) BcA X
dh(~)}.OB($)
=
f ( ~ OB(~).OB(E).f(~)) X B~A
=
[ ( ~ OB(n).f(nA$)) X B~A
dh(~)
dh(n)
configuration
~A$ of ~ and $ in X. The no-
tation A for the product
is of course related
of X with
endowed with the symmetrical
the group P(X)
of A, is equal
to S.
fA($)
for the product
A quick computation
tends
shows
to the natural
isomorphism
difference.
that the sum of the functions
OB, over all parts
to
21 A1 . IVA(T ) Here the symbol T stands The number
fA(¢)
mass
VA(¢);
from 0 because
for h.
of X.
is then the mean value of the function
h on the neighborhood different
for the unit element the Haar measure
every non-empty
f for the measure
of this neighborhood
is
open set has a strictly positive
88 Thanks tends
to the uniform
continuity
of f, fA uniformly
tends
On the other hand,
for every pair
(fA)A '
=
(A,A')
of finite parts
a continuous
to gA is a positive contraction of C(X). Then, for every positive real number E, there exists S such that Suppose
llf - fA, l[ ~ E
whenever
now that the two parts
intersection Denoting
A' contains
function
g on X
a finite part A of
A.
A{ and A~ agree on A, i.e.
that their
with A is the same set B.
by A'7 the union A~OA
IIfA~
fA~ll
If B' is the set AINA~,
[IfAy~B, The intersection corresponding
-
~
we get
2~
the contraction
fA~B,II
<
A local
continuous
all finite parts
property
of the proposition
specification
functions
leads
to
2g
of B' and A is B; each part A ~ B '
A~ and the proof i
Definition.
negative
of S, we get
fAQA'
and, given a finite part A of S, mapping
6.1.3.
to f when A
to S.
on X which
of S, and which verifies
is then equal
is a family ~ = (~A) of nonis indexed by the set F(S) of the following
three conditions:
a)
the function ~@ is equal
b)
for every finite part A of S and every element $ of X,
~'e (~)~A($ ') c)
for every pair
(A,A')
to the
is complete.
to I
=
1
of finite parts
of S, where
A' contains
and every element ~ of X,
~A, (¢)
If we assume normalization
=
~A(~).
that the functions condition
~ ~A, (~') 'eA(~ )
~A are not identically
(b) results
from t h e coherence
equal
to 0, the
condition
(c).
A,
89 6.1.4.
Proposition.
ponding
Given a local
family H = (H A ) of operators
IIA(f) (¢) All operators operator
=
~A
¢'e (¢
H A are positive
~@ is the identity;
commutation
and, notably,
~
)~A(¢')'f(E')
contractions
of the normed
these operators
space C(X),
fulfill
and
the
=
to verify
Now suppose
condition
consider
to s(f).
(b) of a local
H A is a positive
This
contraction,
function
inequality
follows
f on X, from the
specification.
in A'; in order
to prove
the commutation
the identity
HAO~A, (f) (~) summation
=
~ ~A(¢') ( ~ ~A, (¢") .f($") ¢'eA(¢) ¢"eA' (~')
can be factorized
HAOH A ,(f)(¢)
In the other
]IA,
for every real continuous
that A is contained
This double
=
that the operator
that,
is less than or equal
normalization
HA,OH A
they are projectors.
In order
relations,
the corres-
on C(X) given by
moreover,
HAOIIA,
we have only to verify S(~A(f))
7, consider
relations
AcA'
Proof.
specification
in the following
=
¢ 's ~(¢ )~A($')(
=
I.~A, (f) (¢)
direction,
we get
]IA,O]IA(f)(¢)
=
"e~'
)
way:
(¢)~A '(¢")
"f($")
)
~ ~A,(¢')( ~ )~A(~").f(¢ '') ) 'eA'(¢) ¢"eA(¢' ¢"eA'(¢)
f(¢") • ~ .... ~A, (¢ ') -~A(¢") ¢ 'eA(¢ )
~A, (f) (¢) The commutation 6.1.5.
relations
Definition.
between
Given a local
is a Radon probability
measure
the operators specification
H A are then established. ~, a Gibbs measure
~ on X which verifies,
part A of S and every continuous
function
f on X,
for
for every finite
90
(~A(f)) This means
=
~ (f)
that the local specification
version of the conditional
~ provides
probabilities
events
only depending on the coordinates
6.1.6.
Proposition.
The set G(~)
convex and compact
of
out of finite parts of S.
consisting
given specification ~ is non-empty.
us with a continuous
for ~ given the o-algebras
of all Gibbs measures
For the weak* topology,
for a
it is a
subset of the set of all Radon probability measures
on X; and it is a simplex. Proof.
Consider
mappings
the family
of the compact
(~A' AeF(S)
set K(C(X),s)
) of continuous
linear one-to-one
of all Radon probability measures
on X. A Radon probability measure ~ verifies u = u o~ A if and only if it belongs to the image by ~A of the convex and compact set K(C(X),s). , The images ~A(K(C(X),s)) are compact sets with the non-empty section property due to the commutation The elements precisely
of the non-empty
Theorem.
sets are
is convex and compact for the weak*
topology
Proof.
to remark
For every element
Sup ~(f) ~eG(~)
=
f of C(X),
the following
For every Gibbs measure ~, for every f in C(X),
<
greatest
inequality
and for every
is true:
is then less than or equal to the
lower bound of all S(~A(f)).
On the other hand, sublinear A tends
equality holds:
S(KA(f))
The least upper bound of all ~(f)
Then,
1.4.6.
Inf S(HA(f)) AeF(S)
finite part A of S, the following
~(f)
the operators.
for ~.
and also a simplex according 6.1.7.
between
of all these compact
the Gibbs measures
It is quite clear that G(~)
relations
intersection
finite inter-
because
the function on C(X), given by ~(f) this greatest
lower bound is also a directed
to S (due to the commutation
the sublinear
functionals
on C(X)
= Inf S(~A(f)) , is
relations
between
limit when
the HA).
function ~ is the least upper bound of all linear that it bounds
above.
91
It must still be shown that a linear functional above by the function ~ is a Gibbs measure First,
notice that the sublinear
the sublinear
function
function ~. A linear
on C(X) which
s is greater
functional
by function ~ is a Radon probability measure
Therefore,
~ which is bounded above function f of C(X),
is equal to 0 whenever
A' contains
the function ~ is equal to 0 for all previous
probability
~.
than or equal to
on X.
For every finite part A of S and every continuous the expression ~A,(f - hA(f))
is bounded
for the specification
~ is equal to 0 for these increments
A.
increments.
and so belongs
The
to G(~).
The proof of the theorem is now complete.
6.2. COCYCLES AND QUASI-INVARIANT
6.2.1.
Definition.
MEASURES
Given a finite part A of S, the one-to-one mapping
from the space {-I,+I} S onto itself
~A
is defined by
IA(X) • A(~)(x) The mapping
=
(-I)
.~(x)
rA is then a homeomorphism
in reversing
the configuration
of X = {-I,+i} S, and it consists
at all points of A; we call z A a finite
modification. The set ~ of all finite modifications The group ~ is Abelian;
it is generated by the rx' where x runs in S,
and this group is naturally
isomorphic
of S endowed with the symmetrical 6.2.2.
Definition.
space C(X)
is a locally finite group.
A cocycle
that verifies,
to the group of all finite parts
difference.
is a mapping V from the group
for every pair
(A,B)
~
to the
of finite parts of S,
the identity
VAA B A cocycle
V A + VBOr A
is then completely
in S, provided responding
=
determined by the functions Vx, where x runs
that these functions
to the algebraic
satisfy the following relations,
relations
between the generators
for every element x of S,
Vx + V x ° ~ x
=
0
for every pair
V x + VyoTx
=
Vy + Vxo~y
(x,y),
cor-
of the group:
92
6.2.3. Definition. A Radon probability measure on X is said to be quasi° invariant with respect to a given cocycle V if, for every finite part A of S and every continuous function f on X, the following equality holds: ~ (fo~A)
=
~ (f.exp (VA))
By an easy induction argument the verification of the previous equality can be restricted to the parts that consist of a single point. It is possible to state directly that there always exist quasi-invariant measures for any given cocycle. We do not demonstrate this result since the next proposition shows the link between cocycles and positive local specifications on the one hand, and between quasi-invariant measures and Gibbs measures on the other. 6.2.4. Proposition. Let V be a continuous cocycle. For every finite part A of S, define a function ~A by ~A
=
i/( ~ exp(VB)) BoA
The family ~ = (~A) is a local specification; moreover, all functions ~A are everywhere strictly positive. Proof. First, it is clear that
~@ = 1
In order to verify the normalization relation, calculate the sum , ~ )~A(~ ') eA(~
=
~ I/( ~ e x p ( V B ( ' ) ) ~'e~($) BoA
Because the configurations $' of A(~) are exactly the ~B(E) where B runs in the set of all parts of A, and according to the cocycle relations, we get
'e~A(~ )~A (~ ')
BaA
(I/( ~ exp(VcoTB(~)))) CcA
BoA
(exp(VB(~))/( ~ exp(VcAB(~)))) CcA
and the sum is equal to i. In order to verify the coherence relations, in A':
suppose that A is contained
93
¢'e
(~ )~A, (¢,)
=
~ (I/(B~A,exp(VB(~')))) ~'e (~)
=
C~A(I/(B~A,exp(VBoTC(~))))
Apply the cocycle relations to get
¢'e ($
)~A,($,)
=
With the coherence relations, hold for ~.
~A,(~)/~A(~) all properties of a local specification
6.2.5. Proposition. Let ~ be a positive local specification, continuous functions ~A are everywhere strictly positive.
i.e. all
The function Log(~A,O~A/~A,) does not depend on the finite part A' whenever this part contains A. Denote this function by V A. The family of all V A is then a cocycle. Proof.
First verify that V A is well defined by calculating
:A,OTA($ )
=
,AO:A(: ). (
~ :A,(: ')) 'eA(:A(~ ))
~A'(~)
=
~A(~)'( ~ ' e ~A(~ )~A'(~'))
Because the sets A(~) and A(~A($)) are the same, the ratios ~A,O~A/~ A, and ~AOTA/~A are equal; hence the existence of the function V A. To verify the cocycle relations, A and B and calculate VAA B VA~ B
take a finite part A' containing both
=
Log(~A,O~AO~B/~A,)
=
Log(~A,OTA)
=
VA
+
+ Log(~A,O~BO~A/~A,OT A)
V B =T A
6.2.6. Proposition. The mappings ~ --> V(~) and V - - > ~(V), defined in the two previous propositions between positive local specifications and cocycles, are inverse of each other. There is no difficulty in calculating the proof of this outcome.
94 6.2.7.
Proposition.
the Gibbs measures
Given a pair
(V,~), where V = V(~)
and
for ~ are exactly the quasi-invariant
~ = ~(V),
measures
for
the cocycle V. Proof.
Let us first prove that a Gibbs measure
for V; therefore,
we have to show that,
for ~ is quasi-invariant
for every element
g in C(X) and
every point x of S,
~ (g.exp (Vx))
=
~ (go~x)
Because p is a Gibbs measure
~(f)
=
for ~, we get
~(~x(f))
=
~((f + fOTx.exp(Vx))/(l
+ exp(Vx)))
Then
p((f.exp(Vx))/(l Choosing
+ exp(Vx)))
=
f = g.(l + exp(Vx)) , we get the quasi-invariance
In order to prove that a quasi-invariant measure
p((fo~x.exp(Vx))/(l
measure ~ for V is a Gibbs
=
~(( ~ fOTB.exp(VB))/( BaA
local
Remark.
specification
B~A ((f°~B)/(CcA [ exp(Vc°~B))
=
~ BcA
=
p (f)
((f.exp(VB))/(
of the Gibbs measures
that a Gibbs measure
a mass which is proportional
are interesting:
probabilities;
and the
of the energy
(we shall
of some measures
under the
to prove a result of Ruelle's).
Given a cocycle V = (V A) on X, there exists
J = (J(A)) of real numbers
the
gives to the configurations
to the exponential
the quasi-invariance
group of modifications
Proposition.
~ exp(Vc))) CCA
involves given conditional
means
use in the next chapter action a wider
~ exp(Vc))) CcA
=
Two properties
quasi-invariance
6.2.9.
relation.
for ~, let us simply calculate ~(~A(f))
~(HAf))
6.2.8.
+ exp(Vx)))
a family
indexed by the finite parts of the set S, such
95 that the Fourier
transform
of the continuous
function
V
is given by X
Proof.
ix(A)
=
-2J(A)
if xeA
ix(A)
=
0
if
Using
Fourier
the cocycle
coefficients
x~A
relations
Vx(A)
V x + VxO~ x = 0, we discover
are equal
that the
to 0 when the finite part A does not
contain x. The other
cocycle
that contains
relations
ix(A)
the existence
Imagine
Fourier
=
Vy(A)
ix(A)
-
= Vy(A).
of the family J, the "-2" being related
that there exists coefficients
the increments
Notice
for a part A
to the
interpretation.
the Fourier with
Vy(A)
-
that is to say to following
, lead,
both x and y, to the equality
ix(A)
Hence
V x + Vyo~ x = Vy + Vxo~y
coefficients that J(@)
E representing
-2J(A)
a cocycle
You would
and that on X
then find the
that we have just built.
function,
of energy
the energy,
and then build
V A = Eo~ A - E of the energy.
is not determined,
tation of an energy the differences
a function
of E are J(A);
which
is coherent
with our interpre-
only defined up to an additive
are actually
An interaction
constant;
significant.
6.2.10.
Definition.
indexed
by the set of all finite non-empty
is a family J = (J(A)) parts
of real numbers
of S, such that, for
every point x of S, the mapping
A --> J(A).ixe A is the Fourier
transform
These functions 6.2.11.
Definition.
if J(A)
is equal
6.2.12.
Definition.
J(A)
is equal
of a continuous
then verify
the cocycle
An interaction
function
on X.
relations.
is said to satisfy
Ising's
condition
to 0 for every finite part A with an odd cardinal An interaction
to 0 whenever
number.
J is said to be a pair interaction
the cardinal
number
of A is greater
if
than or
96 equal to 3. Notice that a function J on the set of all finite parts of S which verifies this condition,
is an interaction if and only if, for every point x
of S, the series J({x,y}) where y runs in S converges. This results, 6.2.13.
for instance,
Definition.
from proposition 6.1.2.
An interaction J is said to be attractive
if the
function J is non-negative. A non-negative function J on the set of all finite parts of S is an interaction if and only if, for every point x of S, the series
(J(A),xeA)
of real numbers converges. To prove this equivalence,
apply proposition 6.1.2 to the unit configu-
ration.
6.3. PHASE TRANSITIONS
6.3.1. Remark.
Certainly,
one of the most important problems
tical mechanics on a lattice is the phase transition problem,
in statisi.e. deci-
ding whether there are many different Gibbs measures for a given specification. When J is an interaction,
~J is also an interaction for every positive
real number 8. For some types of interactions, the comparison theorem:
it is possible to state
if there is a phase transition for BJ, there is
also a phase transition for ~'J~ whenever B' is greater than B. On the other hand,
the Kirkwood-Salsburg
little further on, postulates
theorem,
that we shall prove a
that in some cases there is no phase
transition for B adequately minute. Therefore, verifies orems,
the specific phase transition problem for an interaction which
the hypotheses of the comparison and the Kirkwood-Salsburg
the-
is knowing if there are several Gibbs measures for ~ adequately
large. Classical
interactions,
like attractive pair interactions,
verify
all these hypotheses. The problem of phase transition is a sizable one and we only give here the general 6.3.2.
theorems used to simplify this particular work.
Definition.
of all continuous convergent.
Consider the vector subspace A(X) of C(X) consisting functions on X whose Fourier series is absolutely
97
Define
then a real
function
]IIflll This
function
=
is less
bound of the norms
Proof.
measure
function
lity ~(f.(l + th(Vx/2))
V consists
of functions
and if the least upper quasi-
measure
on X that verifies
f + f°~x = 0, the equa-
=
~(f.(2exp(Vx/2)/(exp(Vx/2)+exp(-Vx/2)))
=
~(f.2exp(Vx)/(l
=
~(2fo~x/(l
-
then an operator
(I/]AI).
K(1)
=
0
Because
the °A generate and continous
the iden-
on the elements
OA
~ oA.th(Vx/2) xeA
a dense
subspace
on A(X) whenever
l[Ith(Vx/2) lll is finite.
verifies
(A,x) where x is in A.
K on A(X) by its values
=
+ exp(Vx)))
~(V) necessarily
= 0 for every pair
K(o A)
defined
+ exp(Vxo~x)))
0
for the specification
+ th(Vx/2)))
+ exp(Vx)))
~(2f.exp(Vx)/(l
=
norm of the operator
for V. For every point x of S,
this measure:
~(f.(l + th(Vx/2))
Define
of the norms.
= 0 holds.
we can calculate
tity ~(OA.(l
of A(X),
a subalthat the
for V.
Let ~ be a quasi-invariant
A Gibbs measure
to the product
If the cocycle
are elements
is, moreover,
]II.lll, meaning
lllth(Vx/2)II I is less than I, there is a single
and every continuous Indeed,
for the norm
than or equal
theorem.
V x such that all th(Vx/2)
than the supremum norm on C(X),
for it. Space A(X)
and a Sanach algebra
Kirkwood-Salsburg
invariant
it is finer
is complete
norm of the product 6.3.3.
I If(A) l AeF(S)
lll.III is a norm;
and the space A(X) gebra of C(X)
on A(X) by
If the condition
K is strictly
of A(X),
the operator
K is well
the least upper bound of the norms of the theorem
less than i.
is fulfilled,
the
g8
The restriction verifies
of ~ to the subspace A(X)
the equation
The operator
(I + K*)(~)
is a linear functional
(I + K*) has then an inverse
6.3.4.
of X.
in the dual space of A(X); and
there is only one solution for the previous thus,
that
= h, where h is the Haar measure equation
in this space and,
in the set of measures. Corollary.
are elements
If the continuous
of A(X),
functions V x that define the cocycle
and if the least upper bound of their norms in this
space is less than 7/2,
there is no phase transition
for this cocycle.
For every element ~ of A(X), whose norm is less than i, we can define th(~)
as the sum of a power
series;
ded by tg(lll~Ill). The conclusion 6.3.5.
Example.
Let us describe
S = Z n and the attractive
The interaction
and we can calculate
depend on x because When n is equal
constant)
J is equal
It is clear that the functions A(X),
the Ising model.
pair interaction
(where J is a strictly positive the lattice.
and the norm lllth(~)IIl is then boun-
is clear.
the norm
the interaction
6.4.1.
cocycle belong to invariant. immediately
in the Ising model
confirms
in dimension i for
B.
llIth(Vx/2)lll = 2 th(2J),
that there is no phase transition when th(2J)
which
shows
is less than I, i.e. if J
(Log3)/4.
6.4. SUPERMODULAR
INTERACTIONS
Definitions.
Denote by ~ the order on the product
nl ~ n2
t. .~
in
lIIth(Vx/2)III ; this norm does not is translation
any v a l u e of the inverse temperature is 2, we get
= J
to 0 otherwise.
to i, lllth(Vx/2)lll = th(2J), which
When the dimension
the lattice
for the pairs of neighbors
of the corresponding
that there is no phase transition
is less than
Consider
J defined by J({x,y})
VxeS,nl(X)
set X = {-I,+I} S defined by
=< n2(x )
99
It is the product
order of all natural
orders
of the factors;
it is not
a total order. A continuous decreasing
function
for this order.
monotonic
continuous
functions
belong
6.4.2.
V x + VyOT x = Vy + VxOTy
function ~
the cone of all these
on X; to be specific,
Given two different
VxoTy - V x This
Let us call then M(X)
functions
if it is non-
all coordinate
to M(X).
Definition.
relation
f on X is said to be non-decreasing
xy
=
points
x and y of S, the cocycle
leads to the equality
VyOT x - Vy
has the following
=
~xy
Fourier
transform:
A
~xy(A)
=
4J(A) . IxeA. lyeA
When all functions
+xy.Ox.Oy
to be supermodular.
This
are non-negative,
is the same as saying
of S, the function V x is non-decreasing 6.4.3.
Example.
A typical
vided by an attractive
example
the interaction that,
for every point x
on the subset {o x = -i} of X.
of a supermodular
pair interaction.
is said
interaction
is pro-
The function Sxy then reduces
to 4J({x,y})OxOy. 6.4.4. modular
Proposition.
Let A be a finite part of S and # be a local
specification.
that n 1 ~ n 2. Denote by ~+n
Let n 1 and n 2 be two elements
the configuration,
which
tion of the element
$ of {-i,+i} A and of the element measures
on {-i,+i} A, verify
the Holley relation:
for every pair
#l(~iV$2).~2(~lh$2 )
Proof. product
~ --> ~A($+nl)
(EI,$2)
We have to prove,
~
of {-I,+I} S A such
is the result
The two probability
of elements
super-
of the concatena-
n of {-I,+I} S A
and ~ --> #A(~+n2),
defined
of {-i,+I} A
#i(~i)-~2(~2 )
for every pair
(~i,~2)
of elements
of the
space {-i,+I} A, the inequality
~A((~IV~2)+nl ).~A((¢IA~2)+n2 )
>
#A($2+n2 ).~A($1+nl )
1O0
This
inequality
is equivalent
to
~A( (~ I W 2)+n i)/~A(~ l+n 1 ) According equal
to proposition
6.2.6,
to exp(VB(~l+nl)) , where
is greater points
than $1(x).
at which ~2(x)
The right-hand
= +i and $1(x)
to be demonstrated
the value
This results
B is the subset is equal
is supermodular;
Theorem.
product
[~,~
to the product
or-
of X of all configurations
of B.
of B in an arbitrary
Then, 7' is greater is non-decreasing
~,~
and V B is also non-decreasing
Let 7' and 7" be two probability
and that verify
order.
on the subset
space {-I,+I} A which give a positive
this set,
to exp(VB((~iA$2)+n2 ) .
that, with respect
term in the sum is non-decreasing
6.4.5.
of A of all
Vbl + Vb2°~ b I + • • + VbnO~bl o • • oT bn_ 1
=
where bl, .... b n are the points interaction
is
= -I.
on the subset
-i at all points
side of the inequality
for the part of A on which ~2(x)
from the decomposition
VB
Every
the left-hand
In other words,
der, V B is non-decreasing
~ A(~ 2+n 2)/~A( (~ IAE 2)+n 2 )
B stands
side of the inequality
It then remains taking
>
the Holley
than or equal
for the product
mass
relation
to 7",
measures
because
the
on [ ~ , ~ . on the finite
to every element
of
(6.4.4).
i.e.
for every function
order, ~'(f)
is greater
f which
than or equal
to ~"(f). The proof of this result
can be found in Holley
6.4.6.
to the previous
Corollary.
Thanks
the least upper bound of ~A(f) is equal 6.4.7.
to 1 at every point
of ~
= HA(f)(T) , weakly
is a Gibbs measure +
(f)
=
results,
when f belongs
at the configuration
to M(X),
T, which
of S.
Proposition • The family
where ~ ( f )
is reached
(I).
(~+A ) of Radon probability converges
when A tends
for ~ and verifies,
Sup ~ (f) eG(~ )
measures
on X,
to S. The limit ~+
for every f in M(X),
101
Proof. When f belongs functions
to the cone M(X) of all continuous
on X, the following
~A(f) (I)
=
equality holds
non-decreasing
for every finite part A of S:
S(~A(f))
And this directed family of real numbers has a limit when A tends to S; this
limit is the least upper bound of all ~(f), where ~ is a Gibbs
measure
for ~.
This family of probability measures ments
(fl
then simply converges
for all ele-
f2 ) of C(X), where fl and f2 are in M(X).
The subspace,
consisting
of these differences,
is dense in C(X)
uniform norm and the family is equicontinuous.
for the
Hence the existence
of
the weak limit ~+ is proven. For every f in M(X), ~+(f) this
infimum is equal
= Inf(s(~A(f));
according
to theorem 6.1.7,
to the least upper bound of all ~(f), where ~ is
in G(~). + . In order to verify that p is actually a Gibbs measure for =, notice + + simply that ~A(~B(f)) = ~A(f) whenever A contains B, and take the limit when A tends to S. 6.4.8. weak
Remark.
The probability measure ~- is similarly defined as the
limit of the ~A(.)(---T). It gives the least upper bound of all Gibbs
measures 6.4.9.
for the non-increasing
Proposition.
specification
continuous
functions.
There is a phase transition
if and only if the two particular
for a supermodular + measures ~ and ~
local are
different. Proof.
The condition
is obviously
sufficient because ~
and ~
are Gibbs
measures. If they agree,
proposition
6.4.7 and remark 6.4.8 show that a Gibbs
measure
for ~ takes necessary
because
this subspace
Gibbs measure 6.4.10.
values on the subspace
is dense in the uniform norm,
for ~.
Proposition.
is a phase transition
Let ~ be a local
The condition
is also necessary,
supermodular
specification.
There
for ~ if and only if there exists a point x in S
such that u+(~x ) is strictly greater Proof.
generated by M(X); there is only one
is obviously
than ~-(~x ).
sufficient.
we have only to demonstrate
In order to prove that it the following
inequality:
102
(u+ - ~-)(pA ) where PA is the positive
PA In this case,
=
<
(1/2). ~ (~+ - u-)(~x ) xeA
function
defined
~ ((°x+l)/2) xeA
the two Gibbs measures
~+ and ~- agree on all OA; because
the set consisting
of all PA generates
measures
are equal
and there is no phase
previous
proposition.
Let us then demonstrate
By an easy induction inequality,
<
subspace
transition
of C(X),
according
relation
~ (u+ - u-)(px)
argument
we can deduce
this inequality
which holds when the two finite parts
is equivalent
the two
to the
xeA
(~+ - ~-) (PAUB) This relation
a dense
the subadditive
(~+ - ~-)(pA )
following disjoint:
by
=<
from the A and B are
(~+ - ~-) (pA) + (~+ - ~-) (pB)
to
+ (PAUB - ~A - PB + i) This which uous
last inequality is equal function
6.4.11.
describing
holds because
to the product
interactions
however,
belongs
to M(X)
contin).
we shall turn our full attention
which we have just anticipated
we can say that the difference
on the point x when the interaction
the action of a geometrical
to
while
of one another
(~+ - ~-)(o x)
is invariant
under
group.
when the interaction
has the Ising property,
by the involution
Then,
the nullity
of the non-negative
rizes
the absence
of phase
articles
(PAUB - PA -PB + I),
the Ising model.
does not depend
the images
- OA - PB + i)
- pB ), is a decreasing
-(l-PA)(l-PB)
In the next chapter,
invariant
At this point,
Moreover,
the function
(I - pA).(l
on X ( the function
Remark.
translation
=< ~-(~AUB
number
transition.
that deal with this number
• which reverses
all spins.
~+(o x) completely
The numerous
call
~+ and ~- are
mathematical
it spontaneous
charactephysics
magnetization.
103
6.4.12.
Proposition.
local supermodular compact
The Gibbs measures
~
and ~ , relative
specification 7, are extremal
to a given
points of the convex
set G(7) of all Gibbs measures.
Proof. Measure ~
+
agrees on the cone M(X) with the function 7, which is
the least upper bound of all Gibbs measures sible to decompose
for ~. It is therefore
this measure because this cone generates
impos-
a dense sub-
space of C(X). A similar result holds for 6.4.13.
Remark.
not sufficient
The previous to postulate
hold for attractive 6.4.14. nite;
results
about supermodular
a comparison
theorem.
are
theorems
interactions.
Proposition.
Let X A be the product
set {-I,+I} A where A is fi-
and let E 1 and E 2 be two real functions
forms verify,
interactions
The comparison
on X A whose Fourier
for every part B of A (except perhaps
trans-
the empty part),
the
relations
Then,
for every character,
are proportional
the probability
to the exponentials
~l(OB)
measures ~I and ~2 on X A which
of these functions
__< ~2(OB )
The proof of this result can be found in Preston 6.4.15. that,
Proposition.
~
(I).
Let Jl and J2 be two attractive
for every finite part A of S, the following
Jl (A)
proposition Proof.
6.4.16.
such
J2 (A)
and ~2,A(.)(T),
probabilities
verify the order relation described
on XA,
in the
6.4.14.
A quick computation
the outside configuration proposition
interactions
inequality holds:
Then, for every finite part A of S, the conditional 71,A(.)(T)
are related by:
shows that these conditional understood
energies,
as T, satisfy the hypothesis
with of
6.4.14.
Corollary.
Let J be an attractive
Ising pair interaction.
Then,
104
if there is a phase transition
for some BJ, where B is a positive
real
number,
there is also a phase transition
greater
than B.
Indeed,
the interactions ~J and B'J verify the hypotheses of proposition + and the inequality holds at the finite rank between the ~A(Ox)
6.4.15;
for the interactions
for every B'J, where 6' is
BJ and B'J. The limit inequality
therefore
implies
the corollary. 6.4.17.
Remark.
If J is an Ising attractive
all values of the positive real parameter transition
is a non-empty
may be finite or infinite.
interval
pair interaction,
the set of
B for which there is no phase
[0,Bcl . The non-null
limit value
Bc it may or may not belong to
If it is finite,
the set. 6.4.18.
Remark.
For a given Ising attractive
pair interaction,
the phase
transition problem is deciding whether B c is finite or not, and, eventually,
calculating
this value when it is finite.
For the Ising model
in dimension 2 for instance,
devised by Peierls,
it is possible
the spontaneous magnetization
thanks to an argument
to show that for B adequately
large
is not 0.
6.5. REFERENCES
Ruelle
(I) provides
the general
ton (I), in particular, supermodular
for this chapter while Pres-
for the study of attractive
is a notion of our own and is used in our work (5).
Holley first stated his theorem
(I) by using Markov processes.
proofs were given in 1974 during the conferences
dic Theory in Rennes by Brunel
(i) and by Hansel
Phase transition
models
physics
and
potentials.
Quasi-invariance combinatorial
foundation
gives the basis
in particular
to be given specific
Direct on Ergo-
(I).
is too broad an area of mathematical
citations here.
7. DYNAMICAL SYSTEMS IN STATISTICAL MECHANICS
7.1. INVARIANT LOCAL SPECIFICATIONS
7.1.1.
Definition.
Let X be the compact m e t r i z a b l e space I S . When the
countable set S is a group G, we can consider the action of G on the space X by translations models
(described in 2.1.2)
and then study invariant
in statistical mechanics.
A local specification ~ is said to be invariant if this family verifies, for every finite part A of G and every element g of G, the coherence relation ~AoT g = 7.1.2.
~Ag"
Definition.
Let ~ be a local s p e c i f i c a t i o n on X. A Gibbs measure
for ~ is said to be invariant
if it is invariant under the action of G
by translations on X.
7.1.3.
Remark.
invariant, However,
If a local s p e c i f i c a t i o n has a Gibbs measure which is
then this s p e c i f i c a t i o n is invariant.
it is not generally true that all Gibbs measures
for an invar-
iant s p e c i f i c a t i o n are invariant.
7.1.4.
Proposition.
If the group G is amenable,
Gibbs m e a s u r e s for every invariant
Proof.
there exist invariant
local specification.
Group G clearly acts on the convex and compact set G(~) by affine
continuous
one-to-one mappings.
Since G is amenable,
there exists at
least one invariant point in this set and the invariant points are precisely the invariant Gibbs measures for
7.1.5. Definition. tion ~(V) verifies 7.1.6.
A cocycle is said to be invariant if the specifica-
is invariant.
This is equivalent to saying that this cocycle
all coherence relations VAg = VAoTg.
Remark.
The group T of translations normalizes
the group ~ of all
106
finite modifications;
and,
an invariant
to ¢ of a mapping V from the group ~ verifying
=
V a + Vboa -I
and whose restriction Definition.
ponding
(generated by T and ¢) into C(X)
the following relations:
Va. b
7.1.7.
cocycle V is the restriction
cocycle
to T is 0.
An interaction
is invariant;
is said to be invariant
this property is equivalent
if the corresto the invar-
iance of J under the right translations VgeG,VAeF(G),
7.2.
J(Ag)
INVARIANT GIBBS MEASURES
7.2.1 Remark.
= J(A)
AND EQUILIBRIUM MEASURES
The subset I(~) of all invariant Gibbs measures
for ~ is
defined in G(~) by the invariance property. We want now to define it as a subset of the convex and compact
set M(X,T)
(which can also be denoted by K(C(X),s,T)
2) consis-
according
ting of all invariant Radon probability measures The variational tion, gives 7.2.2.
principle,
when applied
such a characterization;
Theorem.
e = (ex,xeG)
Given an invariant
of continuous
to chapter
on X.
to a particular
continuous
func-
now we have to build such a function. interaction J, there exists a family
functions
on X with the following
two prop-
erties: first,
family e is coherent under the action of G on X by transla-
tions,
i.e. for every pair
and,
second,
ex(A)
Proof.
the Fourier
=
(x,g) of elements
transform of the function e
X
is
IxeA.(J(A)/IA])
Because J is invariant under
it exists,
of G, exg = exoTg;
is certainly coherent.
the translations,
such a family,
if
107
Now we have only to show that the real function on F(G) given by
A --> Ixe A.(J(A)/IAI) is actually the Fourier The Fourier The next
lemma
on X whose
transform of a continuous
function.
transform of Ox.Vx is -2J(A {x}).Ix~ A. (7.2.3)
Fourier
shows that there exists a probability measure
transform
is
A
v(A)
=
v(o A)
=
I/(I+[A I)
The function e x = -(I/2).ax.(V~(ax. Vx)) rier transform is the one given above. 7.2.3.
Lemma.
Since the Fourier
are looking for depends corresponding geable,
Fourier
transform is given by v(o A) = I/(I+IAI).
transform of the probability measure only on the cardinal
to the character,
to the permutations
this probability measure must be exchan-
is in fact necessary
according [0,~
I / 0 v k ( O A ) de(k)
as an average of power measures
(~,I-~) G. We have to find a probability such that,
=
(this
to the De Finetti theorem).
the probability measure
measure e on the segment
of X cor-
of G.
We then look for this probability Call v~
that we
number of the finite part
i.e. invariant under the action of all homeomorphisms
responding
i.e.
and its Fou-
There exists a Radon probability measure v on the compact
group X = {-i,+I} G whose Proof.
is then continuous
for every finite part of G,
I/(I+[A I)
for every natural number n, i /
(2).-1)n de(~)
=
I/(l+n)
0 The probability measure solves
the equation.
7.2.4.
Definition.
called
local energies.
2.1(~>i/2).m,
The functions
e
where m is the Lebesgue measure,
that we built
in theorem 7.2.2 are
X
transforms, energies
Formally
speaking,
i.e. as regards
Fourier
it can be assumed that the energy is the sum of the local
e x over all sites.
108
7.2.5. Theorem. Let ~ be an invariant local s p e c i f i c a t i o n on the space X = {-I,+I} G and let the group G be amenable. The e q u i l i b r i u m m e a s u r e s for a continuous function f on X are the measures for which p(f) = h(~) + ~(f). For the continuous
functions
ex, the e q u i l i b r i u m measures
are the same for
all sites of G, and are p r e c i s e l y the invariant Gibbs measures
Proof.
Because the e X constitute a coherent family,
measures
for ~.
the e q u i l i b r i u m
are the same for all of them.
Such e q u i l i b r i u m measures do exist,
according to the variational prin-
ciple, because the entropy is, in this case,
an upper semi-continuous
f u n c t i o n of the measure. We have proven elsewhere
(5) that the e q u i l i b r i u m m e a s u r e s
are the
invariant Gibbs measures.
7.3. M I X I N G PROPERTIES
7.3.1 Definition.
The asymptotic o - a l g e b r a of the compact space X = I S is
the greatest lower bound of all Borel o-algebras consisting of the events only depending on the coordinates out of a given finite part of S. This o-algebra also consists of all events that are invariant under
the
action of the group ~ of all finite m o d i f i c a t i o n s of X.
7.3.2. Theorem.
Let ~ be a local specification on X = I S. The extremal
points of the convex and compact set G(~)
are the Gibbs measures whose
r e s t r i c t i o n to the asymptotic O-algebra is trivial.
Proof. The convex and compact set G(~) has the p r o p e r t y described in 1.4.3 and its extremal points are the elements of it for which the subspace Here,
NCR. I is dense for the L l - n o r m in C(X).
N is the subspace generated by all differences
(f - [A(f)), where
f runs in C(X) and A runs in F(G). For every element ~ of G(~), of LI(X,~,~)
the operators ~A are positive
because p o s i t i v i t y extends from C(X)
For every continuous
to the whole set L I.
function f, we then get the inequality
;~(IzA(f) 1 )
=<
Z(ZA(Ifl))
=
~(Ifl)
contractions
109
and the extension of the operator On the other hand,
tends to the constant also holds result
the elements
7.3.3.
Remark.
and compact 7.3.4.
As regards
This is equivalent
to the triviality
the quasi-invariance
aspect,
of the extremal
of
the above result is points
set of invariant Radon probability measures
the abstract
this
for the given measure.
Let ~ be an invariant
an invariant Gibbs measure Then,
C(X),
of L I.
to the characterization
Proposition.
uniformly
of L I that are invariant under all operators
functions.
the asymptotic ~-algebra
quite similar
function ~A(f)
Because of the density of the subspace
is true for all functions
~A are the constant
in [email protected],
function p(f) when A tends to S; and the convergence
in Ll-norm.
In particular,
to L I is a contraction.
for every element
local
specification
for ~ which is also extremal
dynamical
system
(X,~,~,T)
of the convex
as ergodic ones. and ~ be
in G(~).
has the following
strong
mixing property: For every pair
(a,b)
of events
and every positive real number e,
there exists a finite part F of G such that the inequality l~(a~Tg(b)) holds whenever
Proof.
- p(a).~(b)I
<
E
g does not belong to F.
The partition at the unit element
of G is a generator;
we can find, for every positive real number ~, two measurable X, a' and b', depending the coordinates
therefore parts of
only on a finite number of coordinates,
in A for a' and on the coordinates
in B for b',
e.g. on such
that p(aAa')
<
e/5
and
~(bAb')
<
e/5
The following upper bound is easily calculated: Ip(a~Tg(b))
- ~(a).~(b)l
<
l~(a'~Tg(b'))
We then need only state the mixing property
for two events only depending
on a finite number of coordinates. If ~(b')
- ~(a').~(b') I + 4~/5
is equal to 0, there is nothing to prove.
110
Otherwise, the problem consists in proving that the limit, when g runs out of every finite part, of the ratio ~(a'nTg(b'))/~(b') is equal to the measure ~(a'). This results from the martingale theorem: The conditional expectation with respect to the o-algebra of all events that only depend on the coordinates out of a finite part, converges almost everywhere and in Ll-norm to the conditional expectation with respect to the limit o-algebra. The limit o-algebra is the asymptotic one; because this o-algebra is trivial, the limit of the above ratio is actually ~(a'). 7.3.5. Definition. Let (X,~,~,T) be an abstract dynamical system, and let the acting group G be amenable. The weak mixing property is said to hold for this system if, for every pair (a,b) of events, the average value (I/IAI).( ~ I~(anTg(b)) - ~(a).~(b)I) geA tends to 0 along the ameaning filter. 7.3.6. Remark. Definition 7.3.5 is clearly equivalent to the analogous statement on all pairs of elements in L2(X,~,~). The triangle inequality then shows that, for every pair (fl,f2) of elements in L2(X,~,u) , ~(fl.(I/IAl).( [ f2oT g )) - ~(fl).~(f2) geA tends to 0 along the ameaning filter. An invariant element f2 of L 2 thus verifies ~(fl.f2) = ~(fl).~(f 2) for every element fl of L2(X,Q,~); and f2 agrees almost everywhere with the constant function ~(f2 ). The only invariant elements in L 2 are then the constant functions. Thus, the weak mixing implies the ergodicity of the system. 7.3.7. Proposition.
Strong mixing, defined in 7.3.4, implies weak mixing.
Proof: elementary calculations. 7.3.8. Proposition. Let (X,R,~,T) be a dynamical system, let the acting
111 group G be amenable and suppose that the weak mixing property holds for the dynamical system. Then, for every ergodic dynamical system (Y,~,~,U), the product dynamical system (where the probability space is the product space of the two probability spaces and where the action is the componentwise action) is an ergodic system. Proof. Because of the density of the vector subspace generated by the functions f(x)g(y), we have only to prove, for every pair (fl,f2) of elements in L2(X,Q,~) that the quantity
and every pair (f3,f4) of elements in L2(Y,~,~),
(~v)((fl®f3).(I/IAl).(
~ (f2oTg)®(f4oug))) geA
tends, along the ameaning filter to ~(fl).~(f3).~(f2).v(f4). We now have to show that the following average of differences (I/IA[).( ~ (~(fl.f2oTg).v(f3.f4oug) geA
- ~(fl)~(f3)~(f2)~(f4)))
tends to 0 along the ameaning filter. Every difference in the sum can be written (~(fl.f2oTg)
- ~(fl)~(f2)).~(f3.f4oug))
+ ~(fl).~(f2).(~(f3.f4oug)
- ~ (f3) .~ (f4))
Hence the following upper bound for the absolute value of averages of differences: If312-[f412.(I/IAl) • [ l~(fl.f2=Tg) geA
- ~(fl).~(f2) I
+ l~(fl )I "l~(f2)I "I v(fB'(I/IAl )'g~A f4°Ug)e
- ~ (f3) "v(f4)[
The first term tends to 0 along M due to the weak mixing property of the dynamical system (X,~,~,T). The ergodicity of the second factor implies that the second term also tends to 0 along the ameaning filter. 7.3.9. Remark. When ~ is an invariant supermodular specification, two Gibbs measures ~+ and ~- have the strong mixing property.
the
112
Indeed,
they are extremal
Notably,
for attractive
ones in G(~) and they are also invariant.
invariant pair interactions
perty there is no phase transition
The mixing property enables us to calculate model
in dimension 2 in particular)
7.3.10.
Remark.
interaction
properties
of the dynamical
systems cor-
supermodular
than the strong mixing property.
We are interested is completely
of non-negative
numbers J(Ix-yl)
7.4.2.
If the sequence
Theorem.
in invariant
characterized
attractive
Ising pair
by the converging
series
= J({x,y}). (J(n),neN,n~2)
is sufficiently
in the sense that the series n.J(n)
phase transition condition
as
on {-I,+i} Z.
Such an interaction
decreasing,
magnetization
A THEOREM OF RUELLE'S
Definition.
interactions
(the Ising
of the correlation ~+(~x.~xg).
~÷ and ~- for an invariant
are in fact richer
7.4. EXAMPLE:
7.4.1.
The stochastic
to the measures
in some models
the spontaneous
the limit, when g tends to infinity,
responding
with the Ising pro-
if and only if ~+(o x) is equal to 0.
converges,
for J (neither for 8J, with a positive
quickly
there is no
B because the
is homogeneous).
The original
proof was done by Ruelle.
The present
the proof of this result as an illustration
section is devoted to
of the quasi-invariance
and
mixing properties. 7.4.3. Remark. equal
Consider
to E(A) = J(A).
X which consists
the "energy function" with a Fourier transform
Consider
in reversing
the homeomorphism the configurations
that are strictly less than x (for the natural The difference rx(~),
r x of the compact
order).
of energy between a configuration
$ and the configuration
is then given by its Fourier transform (Eor x - E)(A)
=
-2J(A)
=
0
if
l{yeA,y<x}l
otherwise
space
at all points y of Z
is odd
113
In the case of a pair interaction,
we get,
for y < x ~ z,
A
(Eor x - E)({y,z}) But Ruelle's increment
condition
=
indicates
then actually
-2 J(ly-zl) that this series
exists as a continuous
is convergent;
this
function on X and we are
lead to the examination
of the quasi-invariance
under
the action of a
group of homeomorphisms
of X which is wider than the group ~ of all
finite modifications. 7.4.4. r
X
Definition.
Let us give now a formal definition
of the reversal
on the left of x: rx(~)(y)
Similarly,
=
the reversal rx(~)(z)'
=
All these homeomorphisms
((_I)Iy <x ).~(y)
r X' on the right of x is defined by ((-i) Ix
of X are involutions
and commute with each
other. !
!
Moreover, the reversal rx at x is equal to rxorx+ I and also to rxorx+l; and the group U of all homeomorphisms of X which is generated by all reversals
r
and all r' then contains ~, because X
elements 7.4.5.
Proposition.
all the
the corresponding
<
the summability
=
condition
+ ®
cocycle V is the restriction
U to C(X) which verifies
Vab
system of this group.
If J verifies
n.J(n)
This cocycle
it contains
X
of a generating
the following
Va + Vb°a
to ~ of a mapping V from
cocycle relations:
-I
is completely determined
by its values
on a set of genera-
tors of U. The function V X that corresponds
Vx
=
to the reversal
~ -2J( ly-z I ).Oy.O y<x
r X is defined by
114
and the function V' x
=
In particular, the product
7.4.6.
[ -2J(ly-zl ) .o y<x~z y'°z
and for example,
=
V
+ V'or X
=
X
of all these results
Proposition.
is quasi-invariant Proof.
the reversal
T at all sites
is equal
to
rxor i and we obtain V
The proof
to r xI by
corresponding
is elementary. + u , with
The measure under
0
the action
We only have to prove,
the cocycle V as its modulus,
of U.
for every continuous
function
f on X, and
every point x of Z, the equalities + (for x)
=
u+(f.exp(Vx))
+(forl)
=
~+(f exp(~')) "
Indeed,
these reversals
We only prove it. Consider
generate
the first equality
U and the proof because
__< y
<
we get,
~+(f°~A
)
with respect
~+(f'exp(VA
n
function
of Z such that
function
f,
)) n
the continuous foT A
similar
to the action of ~ (with V
for every continuous
=
is quite
x
u+ is quasi-invariant
as its modulus)
Because
shall be achieved.
the second
then the finite part A n of all points x-n
Because
X
function
uniformly
tends
f is uniformly
continuous
on X, the
to for x when n tends to infinity.
n On the other hand,
the finite modification
product of the reversals Hence, using the cocycle
=
VA n
~A
can be written n
r x and rx_ n. relations of V, we get
Vx_ n + Vxorx_ n
as the
to
115
The quasi-invariance
property
(f.exp(V A ))
leads to
=
~+(f.exp(Vxorx_n).exp(Vx_n))
n
The uniform uniformly
continuity
of the continuous
function V x shows
We have thus proved
the following
convergence
result:
+ lim ~ (f.exp(Vx).exp(Vx_n)) n--> +® Because
that Vxorx_ n
tends to V--x when n tends to infinity.
the interaction
is invariant,
and we use now the strong mixing
=
p+(for x)
Vx_ n is the translate
property
VxoT n of Vx;
of ~+ to get
+ (for x) In order
to conclude
is equal
to i.
=
~+(f.exp(Vx)).~+(exp(Vx ))
the proof,
we still have to show that ~+(exp(Vx))
Hence we must note that the previous function, proof. 7.4.7.
and in particular
Corollary
i.e.
equal
sition
to ~-
for the constant
(proof of theorem
for the U-cocycle
equality
7.4.2).
V, it is in particular , and there
is no phase
holds
for every continuous
function
Because ~
invariant
+
i, achieving
is quasi-invariant
under
transition
the
the reversal
according
3,
to propo-
6.4.9.
7.5. REFERENCES
The characterization measures
of the invariant
for a suitable
by this author The original
continuous
proof
of the uniqueness by Ruelle
The study of attractive condition
described
invariant
way
in the last sec-
pair interactions
greater
in dimension
for which J is proportional
is then the summability
is necessarily
in a general
(5).
(i).
been done for the interactions exponent
result
as the equilibrium
is proven
in a joint work with D. Pinchon
tion was achieved
interaction
Gibbs measures
function
to n
of the series
than I. If it is greater
i has ; the
and the
than 2, Ruelle's
116
theorem shows that there is no phase transition verse temperature
stated that there is a phase transition therefore
for
a comparison with the hierarchical
= 2, known as Anderson's model, and Spencer
for any value of the in-
B. For an ~ strictly between 1 and 2, Dyson B adequately model.
large.
He used
The borderline
case,
was only solved recently by FrShlich
(I); they showed that there is a phase transition
values of the parameter.
(1,2)
for large
8. EQUIVALENCE OF COUNTABLE AMENABLE GROUPS
8.1. TILING AMENABLE GROUPS
8.1.1. Definitions. The sequence of segments in Z, (An = {0 .... n}), encountered at the beginning of this text is the prototype for all F~Iner sequences in countable amenable groups. But the elements of this sequence,
the segments An, have an interesting
property which is a decisive tool for solving some problems of ergodic theory:
they are tiling sets.
A finite part of a group,
for which it is possible to tile the whole group
with some pair-wise disjoint right translates of it, is called a tiling set. This notion is therefore in fact related to equivalence classes of finite parts under the right translations which can be called forms. Some authors,
Ornstein and Weiss in particular,
have used F~iner sequences
consisting of tiling sets in groups other than Z. Notice that, in all known amenable groups,
it is always possible to con-
struct explicit F~Iner sequences consisting of tiling sets. This means that the class of groups with arbitrarily invariant tiling sets is stable, like the class of amenable groups, under the constructions we described in section 2.3
(see Moulin Ollagnier and Pinchon
that
(3)).
But, as we said in chapter 2, no structure theorem for amenable groups is stated herein. Connes,
Ornstein and Weiss have introduced the notion of almost tiling:
with a finite number of different forms, each of them being as invariant as desired,
it is possible to cover the greatest part of the group with
almost disjoint right translates of these forms. This property is true for all amenable groups and is the subject of the next lemma.
118
8.1.2. Tiling
lemma. Let A, A and D be three finite parts of a group G
and let the unit element e of G belong Then,
there exists a partition
right
translates
of subsets of A, such that the proportion
in A for which all left translates atom,
to D.
of A, whose atoms are contained
is greater
dg by elements
Let A' be the subset of A consisting
is contained
of D are in the same
than
(I - m D ( A ) / I A I ) . ( I A I / I D - I A I ) . ( I
Proof.
in some
of all points
- m D ( A ) / I A I)
of all points
g such that Dg
in A.
By definition,
we get
IA - A'I = mD(A)
Denote by T the set of all total orders on A-~A = {geG,Ag~A~@}. Consider P(T)
the elements
gl .... gk''"
as the partition of A whose
following
A1
=
AnAg 1
A2
=
(A~Ag2)\A 1 k-i (A~Agk)\( U A i) 1
The notation P(T,x) measure
then stands
for the class of x in the partition P(~).
the uniform probability measure
~ on T, i.e. the probability
giving to every point ~ of T the mass
For a given element x in A', the probability P(~,x)
is exactly the probability
the first element This
of the
finite sequence:
"'" Ak =
Consider
of A-~A in the order T, and define atoms are the non-empty elements
I/(IA-~AI)!. that Dx is contained
that Dx is contained
in the order T such that Ag meets Dx.
last event depends only on the restriction
subset A-~D.x; probability Therefore,
in
in Ag, where g is
and its probability
of the order • to the
can be easily obtained because
for a given element g being the first in A - ~ the probability
is equal
of the event that we are searching
the ratio l{g,AgcDx}I/I{g,Ag~Dx#@}l
the to
is equal
to
119
i.e.
to the ratio
(i
Summing
mD(A)/IAI).(IAI/ID-I.A
-
I)
on all x in A', we get the expectation
E( [ iDxcP(~,x )) xeA
~
for
E( ~ IDxcP(~ x)) xeA' ' (IA'I).(IAI/ID-~AI).(I
The proportion positive
(I
There
of all x of A such that Dx is contained
function
of T whose expectation
is bounded
mD(A)/;AI).(IA I/ID-I.AI).(I
-
then exists
the above number.
an order T for which The corresponding
expected inequality. triple (A,A,D).
in P(T,x)
is a
below by
- mD(A)/IA ])
this proportion
partition
Such a partition
- mD(A)/IAI)
P(T)
in greater
thus verifies
is said to be adapted
than the
to the
8.1.3. Iterated tiling lemma. Let A, A 1 . . . . , An, and D 1 .... , D n be finite parts of a group G such that
eeD I c D 2 ~ There
then exists I) 2)
a family
for every right
... c D n (PI ..... Pn ) of partitions
subscript
translates
for every i from 1 to n-l, tion of Pi' i.e.
3)
the proportion is contained ponding
(i
Proof.
i between
of subsets
Consider
1 and n, the atoms
of Ao l the partition
Pi+l is a subparti-
the atoms of Pi+l are unions
of the points
of Pi are
of atoms of Pi
x of A for which every set D.x l containing x of the corres-
in the atom Pi(x)
partition
-
of A such that
m D (A)/i n
Pi is greater
than
i=n AI. ~ (IAil/ID?llAil).(l i=l
the finite probability
space
- mDi(Ai)/IAil)
(T,~), which
of the (Ti,~i) where T i is the set of all total orders the uniform probability on it.
is the product
on A?~AI and ~i is
120
!
For every n-uple
• = (~i ..... ~n ), consider
partitions
where every P! is built as it was in the proof of the
of h
preceding lemma. by setting
A new sequence
p
Pn-I P1
Conditions
of partitions
=
p, n
=
p,
=
Pi V P2
n
Call A' the subset of A consisting
E
i=n ~ ({TeT, i=l
where
Pi(T,x)
built
from the n-uple
stands
By construction
is then defined
of all x such that DnX is contained of the event
Dix(Pi(~,x)}) x of the partition
Pi(~),
~.
of the sequence
of all Pn from the one of the P'n' the
defined
i=n j=n ( ~ ({reT,
as the following
J
of D i :increases, definition
intersection:
D~x~P'(T,x)})) J
i=l j=i
Since the sequence
of
by such a sequence.
for the atom that contains
event E can be equivalently
get the following
(PI,...Pn)
for an x in A', the probability
=
(PI ..... Pn)
n-I V Pn
1 and 2 are then fulfilled
in A; and calculate,
the sequence
this expression
can be simplified
to
of E:
i=n E
({TeT,Dix c P~(r ,x) })
=
i=l
Because
the factors
get the probability
of the product
space T are mutually
independent,
we
of this event as the product
i=n 1 H (I - mD(Ai)/IAil).(IAil/ID.7~.Ail).,,,,, ~ , i=l l Conclude
as in the previous
A family of partitions tiling
lemma by summing
that verifies
and is said to be adapted
over all x in A'
the inequality
is called
to the given 2n+l-uple.
an iterated
121
8.1.4. Definition.
Denote by T O the subset of all total orders on G which
are isomorphic to the natural order of Z, i.e. the orders without a first element, without a last one, and for which there are only finitely m a n y elements b e t w e e n two given ones. Set T O is a subspace of the Hausdorff compact space T of all total orders on G, and it is invariant under the action of G d e s c r i b e d in 2.1.3. When the amenable group G is countable,
this set is not empty.
From now on, we shall work under the a s s u m p t i o n that G is countable. N o w we want to know if there exists, lity measures
among all invariant Radon probabi-
on T, a p r o b a b i l i t y measure ~ such that ~(T 0) = I, when
the countable group G is amenable.
Because T O is not compact, we cannot
apply the fixed-point p r o p e r t y to get the answer.
Since every Radon p r o b a b i l i t y measure on T is inner regular, we get
(T 0)
=
Sup ~ (K) KcT 0
w h e r e K runs in the set of all compact subspaces of T contained in T O .
R e v e r s i n g the order in which we take the least upper bounds, we get
Sup ~(T 0) ~eK(C(T),s,G)
=
Sup Sup u(K) KCT 0 ~eK(C(T),s,G)
But the indicator of a compact set is an upper semi-continuous
function,
and we can use the ergodic theorem 3.1.3 to get the least upper bound of all invariant p r o b a b i l i t y measures on such a compact set.
Next, we have to study the compact sets contained in T O .
8.1.5.
Definition.
Given an element • of TO, d r stands for the function
on GxG w i t h integer values defined by
d (g,h)
=
i +
=
0
I{keG, g~k~h or h~k~g}[
if
g
=
if
g # h
h
It is a distance on G.
8.1.6. Proposition.
Let d be a distance on G with integer values.
K d of all total orders such that d
The set
is less than or equal to d is compact.
122
Proof. equal
Consider the function dg,h on T with values to the n u m b e r
is
of elements between g and h plus one if this number
is finite and to += otherwise.
This function is lower semi-continuous.
The set of all total orders such that dg,h(T) d(g,h)
in N where dg,h(T)
is less than or equal
to
is then a closed subset of T.
The set K d is the intersection of all these closed sets when g and h run in G, and it is therefore compact.
8.1.7.
Theorem.
Let G be a countable amenable group.
Then,
there exists
a Borel probability, w h i c h is invariant under the right translations,
on
the set T O of all total orders on G that are isomorphic to the natural order of Z.
Proof.
It shall be sufficient to find,
a distance d on G with integer values
for every positive real number e, such that the following holds for
the c o r r e s p o n d i n g compact set Kd:
Sup ~ (K d) ueK(C(T),s,G)
>
I -
We then find an invariant Radon p r o b a b i l ~ y measure on T that gives the whole mass to the union of all K d. This union is p r e c i s e l y the set of all total orders for which there are only finitely m a n y elements b e t w e e n two given ones.
On the other hand,
thanks to an easy invariance argument,
the set of all
total orders with a first or a last element has a measure 0 for every invariant Radon p r o b a b i l i t y measure on T. The difference is the set T O of all orders that are isomorphic to the order of Z. Thus we must look for distances on G with integer values,
in particular
ones that have the following form:
d(g,h)
=
d(g) + d(h)
if
w h e r e d is a integer function on G equal
Take a converging sequence
(l-e i)
>
to 0 at e.
(~n) of p o s i t i v e real numbers
such that
i=l
g#h
1 -
E
less than i
123
and an increasing
sequence
(D n) of finite parts,
containing
the unit
element e of G, whose union is G. Because G is amenable,
there exists,
for every subscript
i, a finite
part A i of G such that
(IAil/IDi~Ail).(I
- mD!Ai)/IAil)
>
I - ~i
i
Define
Next,
therefore
the function d in the following way:
d(e) =
0
d(g) =
IAII - i
if
geDl\{e}
d(g) =
IAnl
if
geDn\Dn_ I
I
denote by K n the following
Kn
=
compact
subset of T:
{meT, VgeD n d (e,g)
Apply to the 2n+l-uple
(A,AI,..,An,DI,..,D n) the iterated
We then get an adapted
iterated
Now construct
the following
tiling
tiling
lemma.
(PI .... ,Pn ) of A.
order m on G:
-
let the restriction
of T to every atom of PI be arbitrary
-
get the restriction
to an atom of P2 by setting an arbitrary
order on the set of all atoms of PI contained taking the lexicographical -
in it, and then by
order on it
and so on in such a way that the atoms of all partitions intervals
for the built order on A
decide that all elements An easy computation
out of A are less than those of A.
then shows that the ergodic average
(III^I).geAI IKnoTg(m) is greater
Pi are
than
(i - m D
i=n ( A ) / I A I ). ~ (I - e i ) n i=l
124
Taking the limit along the ameaning filter for the part A, and applying the ergodic theorem 3.1.3, we get
Sup ~(K n) ueK(C(T) ,s,G) The compact set K d contained Applying now corollary
in T O is the intersection
of all K n.
of K d is greater
than or equal to i - e.
the proof of the theorem.
8.2. EQUIVALENCE
8.2.1.
i=n ~ (i - ~i ) i=l
1.3.7, we deduce that the least upper bound of all
invariant Radon probabilities This completes
>
OF COUNTABLE
GROUPS
Definitions.
Endowed with the product topology of discrete
topologies,
the set H G of
all mappings from the countable Polish space.
group G in the countable group H is a
Its subspace A(G,H),
of all elements
consisting
for which the image of
the unit element of G is the unit element of H, is a closed subspace, and thus a Polish space for the induced topology. Let g be an element of G. The right translation by g does not preserve the subspace A(G,H). We must now define a mapping T g from A(G,H) following
definition Tg(a)
to itself by giving it the
on the graphs: =
{(glg-l,hlh-l),
(gl,hl)ea}
where h is chosen in such a way that (e,e) belongs h is the image of g by a. Hence, using the functional Tg(a) (g ')
notation, =
to Tg(a),
i.e. that
T g is defined by
a(g'g) .a(g) -i
It is easy to verify that the mappings
T g are continuous
and that they
constitute
an action of G on the Polish space A(G,H) by homeomorphisms.
We denote,
too, by A(G,H)
the corresponding
dynamical
system.
125
The subset I(G,H)
consisting
of all one-to-one
elements
closed and invariant under the action of G. Once again, stands
for the corresponding
The set B(G,H) mappings
But,
B(G,H)
also denotes
(a G6 in a Polish
in this system,
possible
I(G,H)
is
also
system.
is the subspace of I(G,H)
from G onto H. This subspace
of G. Once more, system
dynamical
of A(G,H)
consisting
of the one-to-one
is also invariant under the corresponding
the action
standard
dynamical
space is a Polish space).
the two groups play symmetrical
to define an action of H on B(G,H)
roles and it is
in a way very similar
to
that of G. It is not obvious, bility measure
however,
that there exists
for this dynamical
an invariant
system when G is amenable.
dard space is not a compact one and it is not possible point property because
Borel probaThe stan-
to use the fixed-
the convex set of all Borel probability measures
is also not compact. The next proposition makes
clear the symmetry between
the roles of the
two groups when acting on the set B(G,H). 8.2.2.
Proposition.
Let B(G,H)
be the set of all one-to-one mappings
from G onto H, giving to the unit element of G the unit element
of H as
its image. Consider
the action T of G on this space given by Tg(x) (g ' )
=
x(g'g) .x(g) -I
and the action U of H on it defined by uh(x)(g ') Then,
=
triction of the product Moreover,
Proof.
where
g = x-l(h)
for the topology of a Polish space on B(G,H),
mappings under
x(g'g).x(g) -I
topology of discrete
ones,
obtained
as the res-
the T g are continuous
and the U h are Borel ones. every Borel probability measure
the action of G, is also invariant The continuity
on the space which is invariant under
that of H.
of the T g is straightforward.
In order to state the Borel m e a s u r a b i l i t y
of the transformation
U h, we
126
need only show that, elements
for every pair
This makes
obvious
O ({x,x(g)=h geG
that this set is a Borel
the invariance
for the cylinder A because
set because
it is a F
O
in the
=
.o. and X(gn)=h n}
sets generate
the Borel o-algebra.
U ({x(glg)x(g)-l=hl}~...~{X(gng)X(g)-l=hn}N{x(g)=h}) geG equality where
(uh)-I (A)
=
Using now the invariance
the union is disjoint:
0 (Tg) -i (A~{ x(g-l)=h-l} ) geG of ~ under T, we obtain
~((uh)-I(A))
8.2.3.
=
~ geG
=
,(A)
(Afl{x(g-l)=h-l})
Remark.
The mapping
from B(G,H)
ment of B(G,H)
its inverse
mapping
Nevertheless,
to the Borel
tions becomes
the standard structure,
is invariant
is invariant under
Remark.
the actions
space
fact,
giving
to every ele-
is not a continuous it is a Borel
B(G,H)
the postulated
and forget symmetry
one°
isomorphism. the topology
of the two ac-
clear.
is then an easy corollary
probability
to B(H,G),
as image,
and this is an important
if we consider
leading
this property
set, we get
the following
8.2.4.
and x(g'g)=h'.x(g)})
of ~ under U, we must only verify
{x(gl)=h I and
these cylinder
(uh)-I(A)
This
as follows:
sets
=
For such a Borel
Thus,
the set of all
space B(G,H).
To state
Hence
in GxH,
such that uh(x)(g ') = h' can be decomposed {xeB(G,H),uh(x)(g')=h '} =
Polish
(g',h')
under
of the previous
proposition
one of the two actions
that a Borel
if and only if it
the other one.
A probability
measure
on B(G,H)
T and U of G and H, is ergodic
which
is invariant
for one action
under
if and only if
127
it is ergodic
for the other.
Indeed,
the measurable
because
the orbits under the two actions
8.2.5.
Definition.
if there exists
invariant
sets are the same for the two actions
Two countable
groups G and H are said to be equivalent
a probability measure
on the Polish space B(G,H) which
is invariant under the two previously This relation
is clearly a reflexive
term "equivalence" 8.2.6.
Theorem.
is a Borel
by proving
the transitivity
tensor product
Proof.
one. We justify the
of this relation.
Let G, H and K be three countable
groups.
on B(G,H)
Suppose that
and v a Borel
invar-
on B(H,K).
the probability measure ~ v
actions
defined actions of the groups. and symmetrical
invariant probability measure
iant probability measure Then,
are the same.
of ~ and v under
on B(G,K) which
is the image of the
the composition
is invariant under
from the product
space B(G,H)xB(H,K)
the
of G and K on B(G,K). The composition mapping
B(G,K),
defined by (x,y) --> z = xoy,
Borel o-algebras,
In order to state the invariance of G on B(G,K),
is measurable
allowing us to define
the image ~ v
of the measure ~ v
it is sufficient,
with respect
to
to the
on B(G,K). under
as we have previously
the action V
noted,
to prove
the equality ~ v ( ( V g)-l(A))
when the Borel
A
We can write
=
set A is a cylinder
=
the following
equalities:
n
(vg)-l({z,z(gi)=ki})) 1
set
{z(gl)=kl}~... N{Z(gn)=kn}
n
~v(~
~v(A)
= ~v(~({z,z(gig)z(g)-l=ki})) 1 n = ~ ~v(({z,z(g)=k})~A({z,z(gig)=kik})) keK 1
128
Using the definition of the measure ~ v with the tensor product ,®v, the above number is equal to the following sum: n
~ ~ {~(({x,x(g)=h}) keK heH (hi)eH n
O N({x,x(gig)=hih})). 1 n (({y,y(h)=k}) ~ ~({y,y(hih)=kik}))} 1
Employing the actions T and U, we see that this sum is equal to n
~ ~ {~((Tg)-l(({x,x(g-l)=h-l}) keK heH (hi)eH n
0 ~({x,x(gi)=hi}))). I n ~ ~({y,y(hi)=ki})))} 1
((U h)-l(({y,y(h-l)=k-1}) Next u s i n g t h e i n v a r i a n c e we find that the number
of u u n d e r T and the i n v a r i a n c e
o f v u n d e r U,
n ~ v ( ~ ( v g ) -I ( { z , z ( g i ) = k i } ) ) 1 is equal to n
k~eK h~eH (hileH n {~(({x,x(g-l)=h-l}) A n({x,x(gi)=hi})).l n v (({y,y(h-l)=k -I}) ~ ~({y,y(hi)=k i}))} I The following equality then results from the o-additivity product: n
~ v ((vg) -I (N({ z ,z (gi) =ki} ) ) I
of the tensor
n
=
~ v (N({ z,z(gi)=ki} ) ) I
and the "convolution product" is then invariant under the action V of G on B(G,K). 8.2.7. Proposition. All countable amenable groups are equivalent sense of definition 8.2.5.
in the
Proof. Since the involved relation is an equivalence one, we need only state the equivalence of any amenable group with a fixed one, Z. The measurable space B(G,Z) can be imbedded in the compact space of all total orders on G by identifying an element x of B(G,Z) with the total order on G, which is the image by x of the natural order of Z.
129
This embedding
is clearly a one-to-one mapping
space T O of T. In addition, two measurable B(G,Z)
this mapping
from B(G,Z)
is an isomorphism between
spaces and gives a conjugacy between
and the action of G by right translations
The existence results
of an invariant
from the existence
onto the subthe
the action of G on
on T O .
Borel probability measure
on B(G,Z)
then
of a similar measure on TO, which we proved
in section 8.1. 8.2.8.
Proposition.
an invariant
Given two equivalent
countable groups G and H, and
Borel probability measure ~ on B(H,G),
for every element m of the cone E(G),
we can construct,
a real function m
on the set F(H)
of all finite parts of H by setting m (A)
f
=
m(x(A))
d~(x)
B(H,G) This function m Proof.
belongs
to the cone Z(H)
and we get q(m ) ~ q(m).
These results are easily obtained by integration,
invariance 8.2.9.
follows
from the invariance
Proposition.
Given two equivalent
countable
groups G and H, and
an element m of the cone E (G), denote by i e the upper function on the compact
set T(G)
and the right
of ~.
semi-continuous
of all total orders on G defined
example
3.1.13;
denote by Je the upper
semi-continuous
defined
in the same way from the invariant
in
function on T(H)
capacity m , built
from m
with the invariant measure ~ on B(H,G). Let ~H be a Radon probability measure
on T(H) which is invariant under
the action of H by right translations. Consider
the measurable mapping
where x(T)
(x,r) --> x(r)
from B(H,G)xT(H)
to T(G)
is defined by (a,b)e~
This mapping
-~ 3
(x(a),x(b))ex(T)
sends the tensor product ~®~H to a probability measure ~G
on T(G). Then, Proof.
the integral
of Je for ~H is equal
The following
ie(~) fT(G)
equality results
d~G(T)
=
f
to the integral
of i e for ~G.
from the very definition of ~G:
i (x(T)) B (H,G)xT (H) e
d~ (x) d~H(~)
130
The integral
for p of ie(X(~))
is equal
to je(~)
and the result is
obtained. 8.2.10.
Corollary.
A countable
one is also amenable. amenable
group,
The mean value of an invariant
and the mean value of the invariant
it on another equivalent Indeed,
group which is equivalent
group,
if H is an amenable
to an amenable
capacity
on an
capacity built from
are equal.
group and,
if G is equivalent
=
=
to H, we get,
for every element m of t(G),
q(m)
<
=
q(m )
This shows first that q(m)
~H(Je )
is equal
invariant probability measures according
to example
3.1.13.
to the least upper bound of all
of ie, and,
The equality
8.3. ROKHLIN'S LEMMA AND HYPERFINITENESS
8.3.1. space
First, (X,~,~)
recall Rokhlin's
lemma.
is said to be aperiodic
from 0, the measure
~G(ie)
then that G is amenable q(m) = q(m ) results.
OF COUNTABLE AMENABLE
A transformation
GROUPS
T of a probability
if, for every integer n different
of the set of all points
invariant under T n is equal
to 0. If the standard dynamical says that,
system
for every positive
n, there exists a measurable
(X,~,~ ,T) is aperiodic,
is greater
disjoint and the measure
of their
than 1 - E.
We cite here the strongest result was achieved by Connes, If R is a countable Lebesgue
lemma
integer
set F in X such that the images Ti(F), where
i runs between 0 and n, are m u t u a l l y union
Rokhlin's
real number E, and every positive
space,
in the field of hyperfiniteness,
amenable measurable
this equivalence
single transformation; of the probability
which
Feldman and Weiss:
this means
equivalence
relation
on a
relation can be generated by a that there exists an automorphism
space such that the graph of R is the set of all
(x,Tn(x)) where x runs in X and n in Z, up to null sets obviously.
T
131
We shall restrict ourselves in the proof of this result to the case of the action of an amenable group on a p r o b a b i l i t y space w i t h an invariant p r o b a b i l i t y measure.
8.3.2. Definition.
A standard dynamical
system is a dynamical
isomorphic to an action by Borelian automorphisms preserves
system
on a Polish space that
a Borel p r o b a b i l i t y measure.
The following measurable
section theorem is useful
to the proof of an
analog of Rokhlin'lemma.
8.3.3. Measurable section theorem. Let X be a Polish space and R be an e q u i v a l e n c e relation on X. Suppose that all equivalence classes are closed and that the saturated set of every open set is m e a s u r a b l e with respect to a given o - a l g e b r a Then,
that contains all open sets.
there exists a m e a s u r a b l e part of this o - a l g e b r a
that meets every
e q u i v a l e n c e class in one and a single point.
The proof can be found,
8.3.4.
Definition.
(X,~,~)
for instance,
o b v i o u s l y the unit one),
g
(I).
An action T of a group G on a p r o b a b i l i t y space
is said to be aperiodic
I
in Bourbaki
= {xeX,
if, for every element g of G (except
the m e a s u r e of the m e a s u r a b l e
set
Tg(x) = x}
is equal to 0. 8.3.5.
Definition.
Given an aperiodic action of a countable group on a
standard probability space
(X,~,~), denote by R the equivalence r e l a t i o n
whose graph is a m e a s u r a b l e part of XxX
(x,y)eR
<
3
~geG,
Tg(x) = y
8.3.6. Definitions. Under the previous hypotheses,
a tower is a symmetrical and transitive
m e a s u r a b l e r e l a t i o n on X whose graph is contained in R, and whose every class
is finite. The basis of the tower t is then the m e a s u r a b l e
of all points of X that verify
(x,x)et.
set t
132
Thus,
the restriction
Denote
then by t(x)
relation
of a tower to its basis is an equivalence
relation.
the class of a point x of ! for the equivalence
t.
We shall make no distinction between and shall only consider non-trivial not a null
towers towers,
that agree almost everywhere i.e. towers whose basis is
set.
Our aim is now to build towers with a basis as large as possible, such that the forms of G corresponding
to the classes
and
are as invariant
as possible. 8.3.7. Lemma dynamical
(separation
system.
Proof.
there exists
We need only repeat
several
part A' of A of positive
of F are m u t u a l l y disjoint.
times the following (f,f')
argument with all
is any pair of different
ele-
of F:
fixed point,
transformation
it is possible
tive measure,
last result
to a theorem,
an aperiodic
in A,
Let
to its image
of T to A is conjugate with the identity.
(I), such a transformation
Theorem.
included
are disjoint.
the proof of which can be found for instance
then almost everywhere 8.3.8.
any
for every event A with a posi-
part of the standard space A is equal
by T, the restriction Billingsley
to find,
space without
can be proved by contraposition.
If every measurable Thanks
of a standard
another event A' with a positive measure
such that A' and T(A')
and
,u,T) be a standard aperiodic
a measurable
T fo(Tf') -I, where
If T is a measurable
This
(X,
translates by the elements
transformations ments
Let
For every finite part F of G and every event A with a
positive measure, measure whose
lemma).
is isomorphic
equal to it, which contradicts
(X,~,u,T)
be a standard dynamical
action of a countable
amenable
in
to the identity and the aperiodicity. system where T is
group G by automorphisms;
let F be a finite part of G and A be a measurable
part of X with a
positive measure. For every positive contained
real number e, there exists
in A and which verifies
I
the following
a tower t whose basis inequality:
{u({xet,
Tf(x)eA and
(x,Tf(x))~t})
+
~({xet,
Tf(x)eA and
(Tf(x),x)~t})
}
feF <
E .~ (t)
is
133
Proof. Consider, for every non-empty finite part A of G, the following measurable function whose integral is equal to the measure of A: (I/{A{). I IAoTg geA For every element f of F, the integral of the function (IIIAI). IAoTg. iAoTfg geA, fg~A is coarsely bounded above by mF(A)/IAI . Thereupon follows {{xeAy,Tf(x)eA and Tf(x)~Ay}{
d~(y)
X <
(mF(A)/IAI).(I/~(A)). f IAyqA{ X
d~(y)
and the analogous result f l{x~Ay,xeA,Tf(x)eAY ~A}I d~(y) X <
(mF(A)/IAJ).(I/~(A)). ~ {AyNA{ X
dr(y)
Choose the finite part A such that mF(A)/{AI is less than ~.~ (A)/21 FI . The measurable part B of all y in X such that the sum {{xeAyAA,Tf(x)eA and Tf(x)~Ay}[
+ {{x~Ay,xeA,Tf(x)eAnAy}I
is less than e.IAy~AI , therefore has a strictly positive measure. Using the separation lemma, it is possible to find a measurable part C of B with a strictly positive measure and whose translates by all elements of A are disjoint. The tower t is then defined in the following way. Its basis is the set (A.C)~A, and two points in this basis are related if the single element of C of which they are translates by an element of A is the same. This tower answers the question. 8.3.9. Corollary. If the basis of the tower t has not a measure equal to i, and if t verifies the inequality X ~({xet,Tf(x) et and (x,Tf(x))~t}) feF
__< E.~ (t)
134 then there exists a tower t', whose basis
is disjoint
the tower t", union of t and t', still verifies Proof.
from t such that
the previous
inequality.
We need only apply theorem 8.3.7 to the complementary
part of !.
The computation
is elementary.
8.3.10 Theorem.
For every finite part F of G and every positive
number e, there exists
real
a tower t such that the sum
u ({x,(x,Tf(x))~t}) feF is less than E. This means
that it is possible
to find towers
that almost
contain the graphs of the T f. Proof.
Consider
towers
:
the following
strong order relation on the set of all
t I is less than t 2 if there exists
t 3 such that t 2 is the
disjoint union of t I and t 3. The set of all towers u({xet,
that verify Tf(x)et and
(x,Tf(x))~t})
<
E.u(t)
feF is inductive hand,
for the previous
order.
Corollary
8.3.8 shows,
that this set is not empty and, on the other hand,
element has X for its basis extendable. 8.3.11.
Such a maximal
Corollary.
on the one
that a maximal
since a tower with another basis is strictly element verifies
There exists
the predicted
an increasing
sequence
inequality.
of towers whose
union is R. Proof.
Let Fn be an increasing
sequence
is G, and let (E n) be a converging For every positive
of finite parts of G whose union
sequence of positive
integer n, the previous
real numbers.
theorem gives a tower tn that
verifies
U ({x' (x'Tf (x))~tn }) feF Denote greater
~
~n
n
then by t' the tower which is the intersection n than
or
equal
to
n.
of all t for m m
135
We get the following
f The
sequence
elements
inequality:
e[ ~({x, F n of
all
contains
t'
(x,Tf(x))~tn })
n
the
is
<
[ Sm
=
n
an increasing
graphs
of
all
one
Tf,
and
where
the
f runs
union in
of
G, u p
all to
its null
sets. 8.3.12.
Corollary:
Let (X,~,u,T)
the Rokhlin's
be a standard aperiodic
action of an amenable Then,
lemma for countable
countable
for every triple
n two positive
dynamical
amenable
groups.
system where T is an
group G.
(D,~,n), where D is a finite part of G, and ~ and
real numbers,
there exists a finite set I of pairs
(Ai,Ai),
with the following properties: I)
for every i in I, A i belongs
2)
the images Tg(Ai) of the Ai, where i is in I and g in Ai, are disjoint (up to null sets) and the measure of their union is greater
to M(D,~)
and A i to the o-algebra
than 1 - n.
Proof. Let L be a maximal
tower such that the sum
u ({x, (x,Td(x))~t}) deD is less than 6.n. The equivalence
relation
section theorem.
t verifies
all hypotheses
Let A be a measurable
disjoint union of all measurable
section.
of the measurable
Set A is the countable
sets AA, where A is a finite part of G,
that are defined by AA The previous
=
{xeA,
integral
AeFIG)fA
On the other hand, I
=
(x,y)et <
9
geA, y=Tg(x)}
is then equal to
(d[eDmd(A))
du(x)
the following I [A[ .~ (AA) AeF(G)
decomposition
holds:
136
Hence the inequality
AeF(G)
mD(A).V (AA)
<
6.~. AeF(G)
IAJ .~ (AA~
and the upper bound
[
A ~M(D,~ ) Then,
there exists a finite
M(D,~)
such that the finite
This achieves 8.3.13.
the aperiodic
Let
everywhere
n
set of finite parts of G that all belong sum of all IAJ .V (AA)
Let
(X,~,v,T)
is greater
be a standard dynamical
action of a countable amenable a bimeasurable
transformation
orbits almost everywhere
Proof.
<
to
than 1 - n.
the proof of the predicted result.
Theorem.
There exists whose
fAr.~(A A)
group G.
S of the probability
space
agree with those of T.
(t n) be an increasing equal
system where T is
sequence of towers whose union is almost
to R.
Apply the measurable
section theorem
to the equivalence
relation t n on
its basis. We then get a measurable
set A n which meets every equivalence
class in
a single point. Because
the sequence of relations
is increasing,
the saturated part of
A n for tn+ 1 is equal to the basis of tn+ I. Applying the measurable section theorem to the restriction the part An, we can find An+ 1 contained With this decreasing
sequence
of tn+ 1 to
in A n .
of measurable
sections,
we can construct
some order relations. First fix a reference stands
total order on G. For n greater
than I, An(X)
for the single finite part of G such that the equivalence
of x for the restriction Denote by Ak(X)
of tn on An_l,
tn(X)NAn_l,
class
is An.(X).
the single element of A k related by tk with x, and write
=
TgI(A l(x))
with gleAl(Al(X))
Al(X)
=
Tg2(A2(x))
with g2eA2(A2(x))
An_l(X)
=
Tgn(An(X))
with gneAn(An(X))
137
The order T n is defined on the basis related by tn for the lexicographical Two points
t of t by comparing -n n order.
two points
related by tn have the same image by A n and are characterized
by the n-uples (gn,...,gl). Thus, we never put any new element between
two elements
once they have
been related. In the limit, we obtain a measurable which
order relation T on the orbits
is a total one in the sense that two equivalent
points
are com-
pared by the order. For almost every x in X, there are only finitely many points between x and Tf(x). In order to achieve transformation 8.3.14.
Definition.
countable
the proof of the theorem,
Two aperiodic
actions
(X,
groups are said to be orbitally
measurable
consider
S that sends a point to its successor
,~,G) and
equivalent
invariant part X' of X with a measure
iant part Y' of Y with a measure
the measurable
for the order ~. (Y,
,v,H) of two
if there exist a
i, a measurable
i, and a bimesurable
invar-
mapping between X'
and Y' such that the image of the orbit of every point
is the orbit of
its image. 8.3.15.
Remark.
phism between
The isomorphism the orbital
An equivalence
the o-algebras
in the previous
of invariant
of these o-algebras
sense gives an isomor-
events.
is then a necessary
condition for
equivalence.
Let's give Dye's result
in the ergodic
case, where
these o-algebras
are
trivial. 8.3.16. valent
Theorem.
Two ergodic actions
in the sense of definition
of Z on Lebesgue
The proof can be found in a paper by Dye 8.3.17.
Corollary.
Indeed,
hyperfiniteness
is amenable
ergodic
is transitive.
systems where the coun-
are equivalent.
gives the equivalence
standard action of Z, and Dye's since equivalence
(i).
All standard aperiodic
table group of transformations
spaces are equi-
8.3.14.
of such a system with a
theorem allows us to conclude
the proof
138
8.3.18.
Remark.
amenable
Conversely, allows
It is simple enough to show the hyperfiniteness
group action on a standard the existence
space from Rokhlin's
of a single transforamtion
to go from the usual Rokhlin's
for amenable groups.
It is therefore
invariant
described
capacities
of an
lemma.
with the same orbits
lemma to the version that we gave sufficient
to use the transfer
of
in section 8.2.
8.4. REFERENCES
For the results As regards
present author's Our note
about almost tiling,
tiling particular
consult
joint work with D. Pinchon
(4) concerns
amenability
The proof of the hyperfiniteness relation was done by Connes, Rokhlin'
refer to Ornstein and Weiss
groups,
of groups
for example,
that are equivalent
Feldman and Weiss
ergodic theory because of its fundamental to Conze
(5).
(4) and the
(3).
of an amenable
lemma is shown in his exposition
tions; refer,
their paper
countable
to Z.
equivalence
(I).
paper
(I) and in many books on
character
for so many applica-
(i).
A precise reference
for the measurable
Bourbaki on general
topology,
chapter
section theorem is the work of 9, page TG IX 70 of the last
edition. Dye's article
(I) gives different
ness and the orbital
equivalence
equivalent
The comparison between isomorphism (I, pp. 66-73).
definitions
of ergodic systems and conjugacy
of hyperfinite-
is stated there.
is made by Billingsley
BIBLIOGRAPHY
(I)
ADLER, R. L., KONHEIM, A. G. and McANDREW, M. H., Topological entropy. Trans. Amer. Math. Soc. 114 (1965), 309-319.
(2)
ADLER, R. L. and MARCUS, B., Topological entropy and equivalence of dynamical systems. Memoirs of Amer. Math. Soc. n°219 (1979).
(1)
ALAOGLU, L. and BIRKHOFF, G., General ergodic theorems. Annals of Mathematics Vol. 41 n° 2 (1940), 293-309.
(1)
BATTY, C. J. K., An extension of an inequality of R. Holley. Quarterly J. Math. Oxford (2), 27 (1976), 457-461.
(1)
BEWLEY, T., Extension of the Birkhoff and Von Neumann ergodic theorems to semigroup actions. Ann. I. H. P., Vol VII, n°4 (1971), 283-291.
(1)
BILLINGSLEY, P., Ergodic Theory and Information. and mathematical s t a t i s t i c s (1965).
(I)
BIRKHOFF, G. D., Proof of a recurrence theorem for strongly t r a n s i t i v e systems. Proc. Nat. Acad. Sci., Vol. 17 (1931), 650-655.
(1)
BOURBAKI, N., El~ments de math~matique. Topologie g#n~rale. Hermann (1971).
(2)
BOURBAKI, N., El~ments de math~matique. Integration. Hermann (1965).
(3)
BOURBAKI, N., El~ments de math~matique. Espaces vectoriels topologiques. Hermann (1966).
(1)
BOWEN, R., Entropy for group endomorphisms and homogeneous spaces. Trans. Amer. Math. Soc. 153 (1971), 401-414.
(1)
BREIMAN, L., The individual ergodic theorem of information theory. Ann. Math. Stat. 28 (1957), 809-811 and 31 (1960), 809-810.
(1)
BRUNEL, A., Une preuve ~l~mentaire du lemme de Holley, in Th~orie Ergodique, Rennes 1973/1974, Lecture Notes in Mathematics 532, Springer-Verlag, Berlin, Heidelberg, New-York (1976). 15-18.
(1)
CHOQUET, G., Lectures on analysis I , I I , I I I .
(1)
CHATARD, J., Applications des propri~t~s de moyenne d'un groupe localement compact ~ la th~orie ergodique. Th~se de 3~me cycle. Paris (1972).
(1)
CONNES, A. and KRIEGER, W., Measure Space Automorphisms, the normalizers of t h e i r f u l l groups, and Approximate Finiteness. Journal of functional analysis. 24 (1977), 336-352.
Wiley series in probability
W.A.Benjamin Inc.(1969).
140 (2)
CONNES, A., FELDMAN, J. and WEISS, B., An amenable equivalence relation is generated by a single transformation. Erg. Th. and Dynam. Sys. i (1981), 431-450.
(1)
CONZE, J. P., Entropie d'un groupe ab~lien de transformations. Z. Wahrscheinl i c h k e i t s t h e o r i e verw. Geb. 25 (1972), 11-30.
(1)
CORNFELD, I. P., FOMIN, S. V. and SINAI, Y. G., Ergodic theory. Springer Verlag, Berlin, Heidelberg, New York (1982).
(1)
DELLACHERIE, C., Quelques commentaires sur le prolongement des capacit~s, in S~monaire de Probabilit~s V. Lecture Notes in Mathematics 191, SpringerVerlag, Berlin, Heidelberg, New-York (1971). 77-81.
(1)
DENKER, M., GRILLENBERGER, C., SIGMUND, K., Ergodic Theory on compact spaces. Lecture Notes in Mathematics 527. Springer-Verlag, Berlin, Heidelberg, NewYork (1976).
(I)
DERRIENNIC, Y., Un th~or~me ergodique presque sous-additif. The Annals of Probability Vol. 11N ° 3 (1983), 669-676.
(i)
DIEUDONNE, J., El~ments d'analyse. Gauthier-Villars. (1975).
(1)
DINABURG, E. I . , The relation between topological entropy and metric entropy. Soviet Math. Doklady 11 (1970), 13-16.
(i)
DIXMIER, J., Les moyennes invariantes dans les semi-groupes et leurs applications. Acta Sci. Math. Szeged 12 (1950), 213-227.
(i)
DOBRUSHIN, R. L., Existence of phase transitions in models of a l a t t i c e gas, in Proceedings of the 5th Berkeley Symposium, 73-87. Univ. of California Press (1966).
(1)
DYE, H. A., On groups of measure preserving transformations I. Amer. J. of Math. vol. 81 (1959), 119-159.
(2)
DYE, H. A., On groups of measure preserving transformations I I . Amer. J. of Math. v o l . 85 (1963), 551-576.
(I)
DYSON, F. J., Existence of a phase transition in a one-dimensional Ising ferromagnet. Comm. Math. Phys. 12 (1969), 91-107.
(2)
DYSON, F. J., Non existence of spontaneous magnetization in a one-dimensional Ising ferromagnet. Comm. Math. Phys. 12 (1969), 212-215.
(3)
DYSON, F. J., An Ising ferromagnet with discontinuous long-range order. Comm. Math. Phys. 21 (1971), 269-283.
(1)
ELSANOUSI, S. A., A variational principle for the pressure of a continuous Z2-action on a compact metric space. Amer. Jour. of Math. 99 (1977), 77-106.
(1)
EMERSON, W. R., GREENLEAF, F. P., Covering properties and F~Iner conditions for locally compact groups. Math. Zeitschr. 102 (1967), 370-384.
(2)
EMERSON, W. R., The pointwise ergodic theorem for amenable groups. Amer. Jour. of Math. 96 (1974), 472-487.
(3)
EMERSON, W. R., Averaging strongly subadditive set functions in unimodular amenable groups I. Pac. J. of Math. 61 (1975), 391-400.
141
(4)
EMERSON, W. R., Averaging strongly subadditive set functions in unimodular amenable groups I I . Pac. J. of Math. 64 (1976), 353-368.
(5)
EMERSON, W.R., Characterization of amenable groups. Trans. Amer. Math. Soc. 241 (1978), 183-194.
(1)
FELDMAN, J. and LIND, D. A., Hyperfiniteness and the Halmos-Rokhlin theorem for nonsingular abelian actions. Proc. of the Amer. Math. Soc. 55 (1976), 339-344.
(1)
FOLLMER, H., On entropy and information gain in random f i e l d s . Z. Wahrscheinl i c h k e i t s t h e o r i e verw. Geb. 26 (1973), 207-217.
(1)
F~LNER, E., On groups with f u l l Banach mean value. Math. Scand. 3 (1955), 243-254.
(1)
FRIEDMAN, N.,A. Introduction to Ergodic Theory. Van Nostrand Reinhold mathematical studies (1970).
(i)
FRITZ, J . , Generalization of McMillan's theorem to random set funcions. Stud. Sci. Math. Hung. 5 (1970), 369-394.
(1)
FROHLICH, J. and SPENCER, T., The phase transition in the one-dimensional Ising model with I / r 2 interaction energy. Comm. math. Phys. 84 (1982), 87i01.
(1)
FURSTENBERG, H., S t r i c t ergodicity and transformation of the torus. Amer. J. of Math. 83 (1961), 573-601.
(2)
FURSTENBERG, H., The structure of d i s t a l flows. Amer. J. of Math. 85 (1963), 477-515.
(1)
GALLAVOTTI, G., Ising model and Bernoulli schemes in one dimension. Comm. math. Phys. 32 (1973), 183-190.
(1)
GEORGII, H. 0., On K-systems and b e r n o u l l i c i t y in s t a t i s t i c a l mechanics, in Th~orie Ergodique, Rennes 1973/1974, Lecture Notes in Mathematics 532, Springer-Verlag, Berlin, Heidelberg, New-York (1976). 216-227.
(1)
GOODMAN, T. N. T., Relating topological entropy and measure entropy. Bull. London Math. Soc. 3 (1971), 176-180.
(I)
GOODWYN, L. W., Comparing topological entropy with measure theoretic entropy. Amer. J. of Math. 94 (1972), 366-388.
(1)
GREENLEAF, F. P., Invariant means on topological groups. Van Nostrand mathematical studies (1969).
(2)
GREENLEAF, F. P., Ergodic theorems and the construction of summing sequences in amenable l o c a l l y compact groups. Comm. in pure and appl. Math. XXVl (1973), 29-46.
(3)
GREENLEAF, F. P. Group structure and the pointwise ergodic theorem for connected amenable groups. Adv. in Math. 14 (1974), 153-172.
(1)
HAJIAN, A., ITO, Y. and KAKUTANI, S., Full groups and a theorem of Dye. Adv. in Math. 17 (1975), 48-59.
(1)
HALMOS, Po R.
Ergodic Theory. Chelsea publishing company (1956).
142
(1)
HANSEL, G., Sur les in~galit~s de Holley, in Th~orie Ergodique, Rennes 1973/1974, Lecture Notes in Mathematices 532, Springer-Verlag, B e r l i n , Heidelberg, New-York (1976). 143-151.
(I)
HOLLEY, R., Remarks on the F.K.G. i n e q u a l i t y . Comm. Math. Physics 36, (1974), 227-231.
(I)
JACOBS, K., Ergodic decomposition of the Kolmogorov-SinaT i n v a r i a n t , in Proc. Internat. S)nnp. in Ergodic Theory, Academic Press, New-York (1963), 173-190.
(I)
JEWETT, R. I . , The prevalence of unique e r g o d i c i t y . J. of Math. and Mech. 19 (1970), 717-729.
(1)
KIEFFER, J. C., A generalized Shannon-McMillan theorem for the action of an amenable group on a p r o b a b i l i t y space. The Annals of Proba. 3 (1975), 10311037.
(2)
KIEFFER, J. C., A r a t i o l i m i t theorem for a strongly subadditive set function in a l o c a l l y compact amenable group. Pac. Jour. of Math. 61 (1975), 183-190.
(i)
KOLMOGOROV, A. N., A new i n v a r i a n t for t r a n s i t i v e dynamical systems. Doklady Akad. Nauk SSSR 119 (1958), 861-864.
(i)
KRIEGER, W., On unique e r g o d i c i t y , in Proceedings of the 6th Berkeley Symposium, v o l . I I , Univ. of C a l i f o r n i a Press (1972). 327-346.
(2)
KRIEGER, W., On non-singular transformations of a measure space I. Z. Wahrscheinlichkeitstheorie verw. Geb. 11 (1969), 83-97.
(3)
KRIEGER, W., On non-singular transformations of a measure space I I . Z. Wahrschein]ichkeitstheorie verw. Geb. I I (1969), 98-110.
(4)
KRIEGER, W., On entropy and generators of measure-preserving transformations. Trans. Amer. Math. Soc. 149 (1970), 453-464.
(5)
KRIEGER, W., On exhaustive o-algebras of ergodic measure-preserving transformations. Z. Wahrscheinlichkeitstheorie verw. Geb. 20 (1971), 75-82.
(I)
KRYLOV, N. et BOGOLIUBOV, N., La th~orie g~n~rale de la mesure dans son a p p l i c a t i o n ~ l'~tude des syst~mes de la m~canique non l i n ~ a i r e . Ann. of Math. 38 (1937), 65-113.
(I)
LEDRAPPIER, F., Principe variationnel et syst~mes dynamiques symboliques. Z. Wahrscheinlichkeitstheorie verw. Geb. 30 (1974), 185-202.
(1)
LIND, D. A., The structure of skew products with ergodic group automorphisms. Israel J. of Math. 28 (1977), 205-248.
(i)
McMILLAN, B., The basic theorems of information theory. Ann Math. Stat. 24 (1953), 196-219.
(1)
MEYER, P. A., Probabilit~s et p o t e n t i e l . Hermann (1966).
(1)
MISlUREWICZ, M., A short proof of the v a r i a t i o n a l p r i n c i p l e for a Z~-action on a compact space. Ast~risque n° 40 (1977), 147-158.
(1)
MOULIN OLLAGNIER, J. et PINCHON, D., Un probl~me de minimum pour le couplage de deux p r o b a b i l i t ~ s . C.R. Acad. Sci. Paris 287 (1978), 153-156.
143 (2)
MOULIN OLLAGNIER, J. et PINCHON, D., Une nouvelle d~monstration du th~or~me de E. F~Iner. C. R. Acad. Sci. Paris 287 (1978), 557-560.
(3)
MOULIN OLLAGNIER, J. et PINCHON, D., Groupes pavables et principe v a r i a t i o n nel. Z. Wahrscheinlichkeitstheorie verw. Geb. 48 (1979), 71-79.
(4)
MOULIN OLLAGNIER, J. and PINCHON, D., Dynamical systems of total orders, in Ergodic Theory, Lecture Notes in Math. 729. Springer-Verlag, Berlin, Heidelberg, New-York (1979). 153-162.
(5)
MOULIN OLLAGNIER, J. et PINCHON, D., Mesures de Gibbs invariantes et mesures d'~quilibre. Z. Wahrscheinlichkeitstheorie verw. Geb. 55 (1981), 11-23.
(6)
MOULIN OLLAGNIER, J. et PINCHON, D., The variational p r i n c i p l e . Studia Mathematica LXXII (1982), 151-159.
(7)
MOULIN OLLAGNIER, J. et PINCHON, D., Filtre moyennant et valeur moyenne des capacit~s invariantes. Bull. Soc. Math. France 110 n°3 (1982), 259-277.
(8)
MOULIN OLLAGNIER, J . , Th~or~me ergodique presque sous-additif et convergence en moyenne de l ' i n f o r m a t i o n . Ann. Inst. Henri Poincar~ Vol. XIX n ° 3 (1983) 257-266.
(I)
NEVEU, J., Calcul des probabilit~s. Masson et Cie (1970).
(1)
ORNSTEIN D. S., Bernoulli shifts with the same entropy are isomorphic. Adv. in Math. 4 (1970), 337-352.
(2)
ORNSTEIN D. S., Ergodic theory, Randomness and dynamical systems. Yale mathematical monographs. Yale University press (1974).
(3)
ORNSTEIN D. S. and WEISS, B., F i n i t e l y determined implies very weak Bernoulli. Israel j . of Math. 17 (1974), 94-104.
(4)
ORNSTEIN. D. S. and WEISS, B., The Kakutani-Rokhlin theorem for solvable groups. Preprint (1976).
(5)
ORNSTEIN D. S. and WEISS, B., Ergodic theory of amenable groups I. The Rokhlin lemma. Bull. of the Amer. Math. Soc. 2 (1980), 161-164.
(6)
ORNSTEIN D. S., and WEISS, B., The Shannon-McMillan-Breiman theorem for a class of amenable groups. Preprint (1981).
(1
PARRY, W., Zero entropy of distal and related transformations, in Topological Dynamics (Gottschalk, W. H. and Hedlund, G. A. editors). Amer. Math. Soc. Colloquium Publications n°36 , Providence (1955).
(2
PARRY, W., Entropy and generators in Ergodic Theory. Benjamin (1969).
(3
PARRY, W., Topics in Ergodic Theory. Cambridge University press n°75 (1981).
(i)
PRESTON, C. J., Gibbs states on countable sets. Cambridge University Press n° 68.
(2)
PRESTON, C. J., Random fields. Lecture Notes in Math. 534. Springer-Verlag, Berlin, Heidelberg, New-York (1976).
(i)
ROKHLIN,V. A., Selected topics from the metric theory of dynamical systems. Amer. Math. Soc. Trans. (2) 49 (1966), 171-240.
144
(2)
ROKHLIN, V. A., Lectures on the entropy theory of measure-preserving transformations. Russian Math. Surveys 22 (1967), 1-52.
(i)
RUDIN, W., Real and Complex Analysis. McGraw-Hill (1966).
(i)
RUELLE, D., S t a t i s t i c a l Mechanics - Rigourous results. Benjamin (1969).
(2)
RUELLE, D., Thermodynamic Formalism. Encyclopedia of Mathematics and i t s applications n°5. Addison-Wesley (1978).
(3)
RUELLE, D., S t a t i s t i c a l mechanics of a one-dimensional l a t t i c e gas. Comm. math. Phys. 9 (1968), 267-278.
(i)
SHANNON, C., A mathematical theory of communication. Bell System Tech. J. 27 (1948), 379-423 and 623-656.
(I)
SHIELDS, P., The Theory of Bernoulli Shifts. The University of Chicago press (1973).
(2)
SHIELDS, P. and THOUVENOT, J. P., Une remarque sur les isomorphismes de schemas de Bernoulli qui pr~servent certains facteurs. Israel J. of Math. 21 (1975), 173-176.
(I)
SINAI, Y. G., On the notion of entropy of a dynamical system. Doklady Akad. Nauk SSSR 124 (1959), 768-771.
(2)
SINAI, Y. G., Weak isomorphism of transformations with invariant measure. Soviet Math. Doklady 3 (1962), 1725-1729.
(1)
SMORODINSKY, M., On Ornstein's isomorphism theorem for B e r n o u l l i - s h i f t s . Adv. in Math. 9 (1972), 1-9.
(1)
STRASSEN, V., The existence of p r o b a b i l i t y measures with given marginals. Ann. of Math. Stat. 36 (1965), 423-439.
(I)
THOUVENOT, J. P., Convergence en moyenne de l'information pour l ' a c t i o n de Z2. Z. Wahrscheinlichkeitstheorie verw. Geb. 24 (1972), 135-137.
(2)
THOUVENOT, J. P., Quelques propri~t~s des syst~mes dynamiques qui se d~composent en un produit de deux facteurs dont l ' u n est un schema de Bernoulli. Israel J. of Math. 21 (1975), 177-207.
(1)
WALTERS, P., A variational principle for the pressure of continuous transformations. Amer. J. of Math. 97 (1975), 937-971.
(2)
WALTERS, P., Ergodic theory, introductory lectures. Lecture Notes in Mathematics 458, Springer-Verlag, Berlin, Heidelberg, New-York (1975).
(3)
WALTERS, P., An Introduction to Ergodic Theory. Graduate Texts in Mathematics n°79. Springer-Verlag, Berlin, Heidelberg, New-York (1982).
(I)
WIENER, N., The ergodic theorem. Duke Math. J. 5 (1939), 1-28.
(1)
ZIMMER, R. J. , Orbit equivalence and r i g i d i t y of ergodic actions of Lie groups. Ergod. Th. & Dynam. Sys. 1 (1981), 237-253.
INDEX
abstract dynamical systems adapted 5.2.9 almost subadditive
2.1.5
4.4.3
almost subadditive ergodic theorem ameaning filter
4.4.6
2.2.8
amenable group 2.2.1 aperiodic group action
8.3.4
asymptotic -algebra 7.3.1 attractive interaction 6.2.13 automorphism of a probability space B
Bernoulli scheme
2.1.8
Birkhoff's theorem cocycle
2.1.4
3.3
6.1.2
conditionnal entropy 4.2.10 conjugacy 1.4.7, 4.1.4 covariant
4.4.4
Dini's lemma 1.3.6 dynamical Boolean algebra entropic distance
4.1.2
4.3.6
entropy of a dynamical system entropy of a partition
entropy of probabilty vectors equilibrium measures 1.4.1
finite modification fixed point property
4.2.1
7.2.5
equivalence of countable groups extremal point
4.3.2
4.2.7
6.2.1 2.1.11
8.2.5
146
generator
4.3.5
Gibbs measures Hahn-Banach Holley'
6.1.5
theorem
relation
information
1.1.4
6.4.5
relative
invariant
cocycle
invariant
interaction
Invariant
local specification
invariant
7.1.7
2.1.7
Ising model isomorphism
6.2.11
of two abstract theorem
Kolmogorov-Sinal local energies
dynamical
4.3.14
7.2.4 6.1.3
mean entropy of a partition mean entropy of order
capacity
section theorem
mixing properties
8.3.3
3.2
open segment property orbitally equivalent ordered partition
2.2.13
7.3
norm ergodic theorems
1.4.2 dynamical
systems
4.3.7
isomorphism
theorem
4.3.16
5.2.10
pair interaction partition
4.3.1
5.3.3
mean value of an invariant
overlap ratio
systems
6.3.3
theorem
local specification
Ornstein's
2.2.2
6.3.5
Kirkwood-Salsburg
N
theorem
6.2.10
Ising interaction
measurable
7.1.1
version of Hahn-Banach
interaction
4.4.1
7.1.15
invariant measure
M
to a partition
6.2.12
4.2.3
phase transition
6.3.1
pressure of a continuous
function
5.2.3
8.3.14
4.
147
pyramidal decomposition
2.2.15
quasi-invariant measures Radon measures
6.2.3
1.3.1
Riesz representation theorem
1.3.3
Rokhlin's lemma 8.3.1 Rokhlin's lemma for countable amenable groups saddle ergodic theorem
3.4
separated 5.2.1 Shannon-McMillan theorem
4.4.2
standard dynamical system strongly of order
8.3.2
5.3.3
strongly mixing 7.3.4 strongly subadditive functions subadditive functions sublinear function
2.2.10
3.1.5
I.I.I
supermodular interactions
6.4.2
symbolic dynamical systems T
2.1.2
tiling sets 8.1.1 tiling lemma 8.1.2, 8.1.3 topological dynamical systems
2.1.1
topological entropy 5.1.1 topologically transitive dynamical systems total orders tower
3.2.9
2.1.3, 8.1.4
8.3.6
uniquely ergodic topological dynamical systems variation table
2.2.18
variational principle W
8.3.12
weak mixing
7.3.5
5.1.3
3.2.8