Lecture Notes in Mathematics Edited by A. Dold and 13. Eckn'lann
534 Chris Preston
Random Fields
Springer-Verlag Berl...
34 downloads
1148 Views
5MB Size
Report
This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form
Lecture Notes in Mathematics Edited by A. Dold and 13. Eckn'lann
534 Chris Preston
Random Fields
Springer-Verlag Berlin. Heidelberg New York 197 6
Author C h r i s Preston King's C o l l e g e C a m b r i d g e / G r e a t Britain
Library of Congress Cataloging in Publication Data
Preston, Christopher J Random fields. (Lecture notes in mathematics ; 534) Bibliography: p. Includes index. 1. Stochastic processes. 2. Measure theory. 3. Statistical mechanics. 4. Equilibrium. I. Title. II. Series: Lecture notes in mathematics (Berlin) ; 534. QA3.L28 vol. 534 [9A274] 510'.8s [519.2] 76-26664
AMS Subject Classifications (1970): 28A35, 6 0 G X X , 60K35, 8 2 A 0 5
ISBN 3-540-07852-5 Springer-Verlag Berlin 9 Heidelberg 9 New York ISBN 0-38?-0?852-5 Springer-Verlag New Y o r k . Heidelberg 9 Berlin This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically those of translation, reprinting, re-use of illustrations, broadcasting, reproduction by photocopying machine or similar means, and storage in data banks. Under w 54 of the German Copyright Law where copies are made for other than private use, a fee is payable to the publisher, the amount of the fee to be determined by agreement with the publisher. 9 by Springer-Verlag Berlin 9 Heidelberg 1976 Printed in Germany Printing and binding: Beltz Offsetdruck, Hemsbach/Bergstr.
Preface
in the last decade there has been a lot of mathematical iuterest in models from classical e~ailibrium statistical mechanics; these notes describe some of this work. ~hey are concerned, in particular, with the properties of equilibrium states defined in terms of conditional probabilities. ~lis way of defining equi-. libri1~a! states is due to Dobrushin, Lanford ~ d
Ruelie; the formulation given
here is due mainly to F611mer. The approach taken will be fairly abstract, and will be done using the language and basic techniques of probability theory. It ~Ti]_l thus be assumed that the reader has some fs~iliarity ~ith things like standard measure theory, conditional expectations, the martingale convergence theorem, ~id probability kernels. Some of the deeper results will be obtained using standard Bore] spaces, but no previous ~ o w l e d g e of such objects will be required. These notes were written between 1974 and the present; the first six sections were written in the academic year 1974-75, while the attthor was a Fellow of Brasenose College, Oxford. The rest was written ~ i l e
the aubho~~ was
Fellow of King's College, Cambridge. The material has been much influenced b~" conversations with Harls ~611mer over the last three years, and many thanks are due to him.
Chris Preston King's College, Csmbridge, February, 1976.
RANDOM FIELDS
Section
i.
Introduction
. . . . . . . . . . . . . . . . . . . .
Section
2.
Random
and specifications
Section
3.
Existence
of Gibbs
Section
4.
Invariant
specifications
Section
5.
Lattice
Section
6.
Continuous
Section
7.
Specific
Section
8.
Some thermodynamics
Section
9.
Attractive
fields
models
models:
states
..........
33
..............
46
point processes
........
gain . . . . . . . . . . . . . . .
................
specifications
ii
..............
. . . . . . . . . . . . . . . . . .
information
1
..............
59
87
111
137
160
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
192
Index
198
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
I.i
i. Introduction The intention of these notes is to set up a framework within which much of the equilibrium behaviour of a lot of models from classical statistical mechanics can be described. The main reason for doing this is to determine those features which are c o , o n
to almost all models from equilibrium stat-
istical mechanics. This will emphasise which kinds of behaviour are "general" as opposed to being consequences of a specific model.
(The type of behaviour
we consider will thus be qualitative rather than quantitative.) This approach will also help to unify subjects which are usually treated separately; in particular, the framework will cover both lattice and continuous systems. It will turn out that the framework looks a lot like the study of stochastic processes which are indexed by an arbitrary partially ordered set (rather than tile usual indexing by some subset of the real line). T~his suggests the possibility of adapting ideas and techniques from the theory of stochastic processes in order to give results about statistical mechanical models. In particular, we will use the language of probability theory, and will assume that the reader is familiar with the basic definitions and results of this theory (as found for example in Chapters I and II of Meyer (1966)). The approach that will be taken is based on the work of Dobrushin (1968a),(1968b),(1968c),(1969), These
Lanford and Ruelle (1969), and Ruelle (1970).
papers deal with the definition of equilibrium states for various
models and the properties of the equilibrium states; the reader is stongly encouraged to study them. The set-up which will be considered in these notes is a natural generalization of the Dobrushin-Lanford-Ruelle work, and is due to F~llmer (1975a),(1975b). A lot of the material which we will present is adapted from some earlier notes (Preston (1974a)) which dealt with a special case of Follmer's work.
1.2
2
Perhaps
the best way to start out is to describe a fairly simple
example and to use this to lead into the general lattice model, given as follows. a set of sites
(or locations);
d > I (where
Zd
are described
by a set
t ~ S
Y
, and that 0
and
the presence of a particle;
denoted by
+
and
in a positively cartesian
, with
+
(resp negatively)
product
X
=
~
S
S
is a
representing
will be
Zd
for some
space that have
there is an entity whose states
Y
suppose
is finite.
i , with
0
that all the For example,
representing
or alternatively (resp
set
in d-dimensional
Yt ; and for simplicity
have two points, denoted by i
and in most models
At each site
copies of the same space
and
There is a countable
denotes the set of points
integer coordinates).
set-up. The example
Y
are could
the absence
the two points could be
- ) representing
charged
Yt
when the entity is
state. The basic phase space is the
Yt ' which we will regard as a compact Hausd-
t g S orf space
(giving
ogy). Let
~
X
the product
topology and each
denote the Borel u-field on
Y
space, the evolution being governed by a Hamiltonian statistical
Hamiltonian which probability
mechanics
measures
on
for the model).
context
is due to Dobrushin,
above).
Equilibrium
right conditional
(X,~)
by formnlas our example,
as its phase
of some kind. O n e o f
represent without
the
from the
the equilibrium explicitly
const-
The way that this is done in the present
Lanford and Ruelle
(in the papers mentioned
states will be those probability
probabilities,
X
is to determine
states of the system (and to make this determination ructing the dynamics
topoi-
X .
Suppose that there is some system evolving with
first aims of equilibrium
the discrete
measures which have the
with these conditional
involving the Hamiltonian.
probabilities
given
We will explain what this means for
but to do thi~ we need some notation.
3
For each subset
A
of
S
let
1.3
X(A) =
Yt ' considered as a
~ tEA
compact Hausdorff space and let PA
be the projection from
thus a sub-o-field of observable)
F
from inside
X
_F (A) denote the Borel a-field on --o onto
X(A), and let
and it represents A . Note that if
denote the finite subsets of
S, and
is the set of events that are measurable
F(A) = pAI~Fo(A)). F(A) is
the events that are measurable A~
for
B C S
A ~ C
then
we put
from outside
A ~ C
probability measure
on
o~
and
y E X(S-A)
F A = F(S-A),
function of
Since for each
{o K}
F g F (A) ~o
y , a probability measure
probabilities
(i.I)
FA
A . Lanford and
the Hamiltonian will define a
as their (appropriate)
oY(F) i~ p
conditional
will be an F (S-A)-measurable =o .
will have the right conditional
if and only if
PA(P)
The equations
so
will
(X(A),F (A)), and the equilibrium states will be --o
those probability measures which have probabilities.
(or
F(A) C F(B). C
In terms of the above notation the hypothesis of Dobrushin, Ruelle is that for each
X(A). Let
=
I ~
dPs-A(P)(Y)
for all
A g ~ .
(i.i), although usually written in a different form, are some-
times referred to as the DLR-equations. The probability measure
a~
should be thought of as the equilibrium
state for a system evolving in the "finite vessel" A, governed by the same Hamiltonian,
but with the configuration
Since
is a finite set we can identify
X(A)
ect to counting measure), and usually where for
w g X(A)
gA(y,w)
outside
O~
O~
%
constrained
to be
y .
with its density (with resp-
will take the form
z-lexp gA(y,.)
is the energy of the configuration
w
in
X(A)
,
1.4
given that the configuration outside make
~
a probability measure,
A
thus
is Z =
y , (and where ~--
Z
is chosen to
exp gA(y,w) ). In the ease
w ~ X(A) when
Y
consists of the two points
ification of gA(F,G) =
X(A)
~
0
and
i , and there is an obvious ident-
with the set of subsets of
~(B)
(for
F~S-A
, GcA
A A , g
), where
will take the form
~ : ~.
> R
is the
BGFUG B~A # interaction potential corresponding to the Hamiltonian. At this point the reader is probably asking why the above definition is the right one. This is a very hard question to answer satisfactorily. The only real way of knowing that the definition of an equilibrium state is correct is to construct the appropriate dynamical system and check that the supposed equilibrium states are exactly the invariant states for this system. However, it is extremely hard to construct the dynamical systems corresponding t o m o s t models, and only recently has much been done on particular cases of this problem (see for example Lanford (1975) , Marchioro, Pellegrinotti and Presutti (1975)). The only justification we will make is that the definition is intuitively appealing, that most physicists would agree it is the correct one, and that the results which can be proved show that these equilibrium states have the properties which, based on physical experiments,
they should have.
For the model we have described above there are two ways of looking at the probability measures satisfying (i.i). Firstly, they can be regarded as those measures whose distributions
inside any finite set are in equilibrium
with respect to the distribution outside tile finite set. This is because PS_A(D)
is the distribution outside
A , and thus the right-hand side of (i.i)
represents the equilibrium state (inside side
A
is constrained to be
A ) of a system where the state out-
pS_A(D). Secondly, it is not too difficult to
5
check that the measures satisfying
1.5
(i.I) are e~actly those probability measures
which can be obtained as weak limits of sequences of the form
with
An
increasing to
kind of "thermodynamic
S
and
Yn g X(S-An)
Yn OA n
. This corresponds
, n ~ i ,
to taking some
limit": letting the vessel expand and at the same time
changing the boundary conditions.
This second way of looking at things will
perhaps help to convince any physicist
that the present approach does not really
differ from the "classical" way of doing things in equilibrium statistical mechanics. There is a better way (at least mathematically) model described above. For each
(1.2)
that is for each
X
and
F g ~
(1.4)
(where
~A(.,F)
if
probability measure
(1.5)
where
if
~A
~A
for all
is a probability kernel, and for each x g X
~A(X,.)
also satisfies:
F g ~ ;
and is
0
otherwise).
In terms of the
HA
a
will satisfy (i.i) if and only if
Ep( XF I ~A ) =
Ep( f I ~ )
by
~A(.,F) = XF ;
x c F p
)R
the
: Ps_A(X)XW s F }) ,
Then
It is clear that
then
HA : X x ~
is ~-measurable,
is ~A-measurable
F C ~A
XF(X) = i
X(S-A)xx(A)).
~A(',F)
is a probability measure.
(1.3)
define
PS_A(x) OA ({ w e X(A)
~A(x,F) =
(where we identify
A E ~
of formulating
HA("F)
p-a.e,
denotes the conditional
for all
A g ~ , F g ~ ,
expectation
(with respect to
p )
6 of
f
given the o-field
(1.6)
D(Ff]G)
1.6
; thus (1.5) can be rewritten as
=
.I zA(x,F) d~(x)
for all
A ~ C , G s FA , F g F .
G
The measures
{o~}
given by the Hamiltonian will satisfy a consistency
condition, which in terms of the
(1.7)
if
(where
~% zA
AC%
e ~
then
ZA
can be written as
~% ~A = ~%
'
is the probability kernel defined by
(z~ ~A)(x'F)
=
I ~A(Y'F) ~%(x,dy)
.
Note that, regardless of any physical interpretation, (1.7) is a natural assumption to make if the
7A
are to be interpreted as conditional probabilities,
since it just corresponds to the usual rule
E ( f I~
which holds
~-a.e. if
) = E ( E ( f J ~A ) I ~
AC%
because then
) ,
F=]C F=A 9
Now in the formulation of the lattice model just given the particular structure of the model plays very little part. All that we needed was a set equipped with a o-field C, ~C~A
a collection whenever
~ , an index set
{~A}AsC AC~
~
of sub-o-fields of
, and a collection
X
with a partial order denoted by ~
{~A}Ag~
which are decreasing in that of probability kernels sat-
7
1.7
isfying (1.3),(1.4) and (1.7). Given this, the problem is then to find out all about the probability measures on If
~ = {~A}AeC
satisfies
(X,~)
satisfying (1.5).
(1.3),(1.4) and (1.7) then we w i l l c a l l
a specification ; the corresponding equilibrium states (those probability measures satisfying (1.5)) will be denoted by
G(V)
and called the Gibbs states
with specification ~ . These notes are a study of specifications and their Gibbs states (although in fact we will relax (I~4) in most of what we do). This set-up includes ~ n y
models from continuous statistical mechanics, where usually
will be the bounded Borel subsets of R d (d-dimensional Euclidean space), and X
will be a subset of the integer-valued measures on
Rd
(representing config-
urations of particles in space). Although in the definition of a specification we only required
~
to
be any partially ordered index set, in practically all models from statistical mechanics
~
will be a collection of subsets of some underlying space. As in
the lattice model we will start with a space
S
(usually
on whether we have a continuous or lattice model), S o-field again
~
, and for each
(X,~)
so
~
~A
or
Zd
depending
will he equipped with a
we will have a sub-O-field
~(A) of
~
(where
is the basic phase space), representing the events that are meas-
urable from inside o-fields
A g ~
Rd
A . The o-fields {~(A)}A~ ~ will be increasing, and the
in the definition of a specification will be of the form
will be some suitable subset of
that are measurable from outside
~ , and
~A
~(S-A);
will represent the events
A .
As mentioned before, a useful way of thinking about specifications is in terms of generalized stochastic processes: the kernels
~ = {~A}Ae~
regarded as the transition kernels of a process indexed by case
~
would be some subset of
can be
~ . (In the usual
R , representing the time axis, and for
L ~
8
_~
1.8
would be the events determined by the process up to time
-t .) Some of the
basic problems which will arise with specifications will be concerned with the analogue of constructing the entrance boundary for a stochastic process. There are three parts to these notes. The first is Sections 2, 3 and 4 ; in which the basic properties of specifications are worked out. The second is Sections 5 and 6, which deals with exs~ples of specifications.
Section 5 deals
with lattice models, and studies the model we have already introduced (but with Yt
an arbitrary set). Section 6 is about continuous models, and in particular
about poin'~ processes. It is possible ! and maybe desirable) to read Sections 5 and 6 before the first part of the notes. The third part comprises Sections 7, 8 and 9, and treats some particufLar topics (which are mainly concerned with lattice models ). Let
G(~_~
be the set of Gibbs states for a specification
fundamental question is whether or not
~(~)
V . The most
is non-empty; and in Section 3 we
give sufficient conditions for Gibbs states to exist. Given that Gibbs states do exist for
~g , and noting that
we would i~ke to know whether
G(V)
is clearly convex set of probability measures,
G(Y_) has any extreme points, and whether
can be represented in terms of its extreme points.
G(Y0
(There are reasons for doing
this other than it being the reflex action of any mathematician who has had a course in functional analysis. The extreme points of
G(V)
should represent the
pure phases of the system, and these states should have particular properties which we would like to deduce from our framework. ) In Section 2 we characterize the extreme points of space then of them.
G(V)
G(V)
; and also show that if
(X,~
is a standard Borel
has extreme points, and a nice representation exists in terms
(The existence of extreme points and the representation are due to
9
1.9
FSllmer (1975a), and the proof is based on Dynkin's construction of a Martin boundary, which is given in Dynkin (1971a) and (1971b).) In Section 4 we consider the situation where there is a group of bijections acting on
X , and where the specification is in some sense invariant under the
group. It is then natural to study those Gibbs states which are also invari~ut.
(It
is not necessarily true that the Gibbs states are al~omatically invari~nt, and their failure to be so corresponds to the physical phenomenom of symmetry breakdown.) In most of the models the bijeetions on on the underlying space
S ; for example, if
X S
group usually taken is the group of translations.
are induced from a group acting is either
Zd
or
Rd
then the
The representation obtained
in
Section 2 will be used to give a representation of the invariant Gibbs states. The material in Sections 7, 8 and 9 deals mainly with lattice models, though Section 7 also includes results applicable to the models of Section 6. In Section 7 we give a variational characterization of invariant Gibbs states in terms of the specific information gain of one random field with respect to another. This follows an approach of F~llmer (1973) to proving more physical variational principles. We apply the results of Section 7 to give, in Section 8, a version of the Gibbs variational principle: the invariant Gibbs states are exactly those invariant states which minimise the specific free energy. This provides a generalization of a result of Lanford ~.d Ruelle (1969). Also in Section 8 we do some thermodynamics and, again following Lanford and Ruelle (1969), identify the invariant Gibbs states with the tangent functionals to the pressure (in a suitable B~uach space of interactions). In Section 9 we look at what could be considered generalizations of ferromagnets, or attraetivo interactions. We introduce this by putting a partial order
70
on
i. i0
X , and looking at specifications which ~ave certain properties with respect
to the order. We show that~ under suitable conditions, there exists more than one Gibbs state at exactly those places where there is a jump discontinuity in the density. Those readers who consider the present setting too abstract, and would like to consider alternate or preliminary material containing more specific examples, should look at the papers of Dobrushin, Lanford and Ruelle already referred to, and also at Spitzer (1971a), (1973), Lanford (1973), Georgii (1972), (1973), (1975), Sullivan (1973) and Preston (1974b).
These papers also many import-
ant topics which we hardly consider at all here, in particular the problem of determining when there is exactly one Gibbs state. Finally, it is worth noting that there are connections between the theory of specifications and topics in quantum field theory. The interested reader should look, for example, at Nelson (1973), Dobrushin and Minlos (1975), and Royer and Yom (1975).
Note: Professor Dynkin has just pointed out to me that the proof of Theorem 2.2 (which gives the representation in terms of extreme points) can be made much simpler using a general result from Dynkin (1971b), and the notion of a support system (see Dynkin and Juskevic: Controlled Markov processes and their applications, Moscow 1975, pages 306-307).
2.1
2. Random fields and specifications
This section works out the basic properties of specifications and their Gibbs states. We start with a set set
~
X , a o-field
F
of subsets of
which is partially ordered. The order on
(rather than o-field of
C
~
will be denoted by
used in the introduction). For each
% , denoted by
X , and an index
A E ~
~ ,
we have a sub-
~ A ' and we assume that the o-fields
{FA}AEC
are
decreasing, i.e.
(2.1)
X
F % C__ F ~
whenever
A _<
represents the basic phase space,
F
the set of all observables~ and the
~A
are subsets of observables, usually associated with regions of some underlying space. We will assume that the partial order is directed upwards (i.e. if AI, A 2 g ~
then there exists
A ~ ~
with
ably generated (i.e. there is a sequence then
A < A
for some
AI ~ A {An}n~ I
and from
A 2 ~ A ) and is count~
such that if
A g
n ).
n
Some notation is needed. If
~
is a sub-o-field of
denote the set of probability measures on r A : P~_F) i.e.
>
P(FA)
c P~_F) and
A g ~
for all
A ).
P~B)
then let
~ g P(~F) , F g ~ A " Elements of
P(~_F) will be
(sometimes the term stochastic fields is used instead). If then we can regard
ervable with respect to side
A E ~
then let
be the mapping got by restriction to the smaller o-field,
rA(~)(F) = ~(F)
called random fields;
(X,~) ; if
~
rA(~)
as the distribution of
~
obs-
A ,(which in most cases will mean observable from out-
2.2
12
As in the introduction we define a collection of probability kernels = {~A}AeC
to be a specification
(2.2)
~A(',F)
(2.3)
~A(.,F) = XF
(2.4)
~
If
~
~A
is
=
if
F__A-measurable for all
for all
7%
F g FA , A E ~ ;
whenever
A ~ % .
is a specification then we define
cation V
to be those random fields
(2.5)
A g ~ , F g ~ ;
E ( XF i ~A ) = ~A (',F)
~(~
, the Gibbs states with specif-
~ g P(~_F) for which
~-a.e.
for all
Let us consider for a moment the kernels this will give us some insight into conditions the kernels define
~ = {~A}AgC
tA : P~FA)
tile kerneI
~A
tA(V )
>P(~_F)
by letting
on the measure
~ s P(F_A)
= ~A(X,F)
~
satisfy:
tA(V) = ~ A
A g ~
we can
' where the action of
is given by
dr(x)
for all
F
which gives uS a probability measure
when we only know what is happening on the sub-
FA " With this interpretation
{tA}Ag ~
(2.3) and (2.4). Suppose that
J
as a predicting mechanism,
on the whole of
~-field
the
tA
as acting on measures;
just satisfy (2.2); then for each
(~A) (F) Think of
~A
J g ~ , F g % .
it would be natural to assume that
2.3
13
(2.6)
=
r A tA
(2.7)
if
A < %
(2.6) just says that
identity: P~F A)
--~P~FA)
then
tA rA t%
tA(~)
is an extension of
(2.7) says that the extension from
=
~A
for all
A g ~ ;
t% .
~
to the larger o-field;
is consistent with that from
~
9
The connection between this and the definition of a specification is given by the following proposition.
Proposition 2. !
Let
satisfying (2.2); let only if ~
be a collection of probability kernels
~ = {~A}Ag~ tA(~) = ~ A
satisfies (2.3), and
" Then {tA}Ag ~
{tA}AE ~
satisfies (2.6) if and
satisfies (2.7) if and only if
satisfies (2.4). Proof
(2.6) says that
tA(~)(F) = v(F)
for all
~ g P(F=A) , F e ~ A
; and
tA(~)(F) = I ~A(x,F) dv(x) . Thus for (2.6) we need
I for all F e~A
F g~A
~A(X,F) d~(x)
=
I XF(x) dv(x)
' ~ e P(_FA) ; while for (2.3) we need
~A(',F) = X F
; these conditions are clearly the same. Similarly, if
e P(F_F A) , F g ~
then
(tA rA t% (V))(F)
=
I~A(Y'F)
d(rA t%(~))(y)
A~
for all and if
2.4
14
=
I~A(Y,F) I~%(x,dy)d~(x)
=
II ~A(Y,F) ~%(x,dy) dr(x) ,
and thus for (2.7) we need
I I ~A(Y'F)~%(x,dy)d~(x)
for all
F g ~ , v s P~%)
Still thinking of
=
I~%(x'F) dr(x)
, which is the same as (2.4).
tA
82
as a predicting mechanism, the natural random
fields to look at are those which predict themselves, i.e. those which g P_(~
tA rA(P) = ~
for all
D s P~J
for
A s ~ . Writing this out in full we get that
is self-predicting if
(2.8)
D(F) =
I~A(x'F) d~(x)
for all
A g~ , F g~ .
Since by the definition of conditional expectation (2.5) is the same as
(2.9)
~(G~F)
=
I ~A (x'F) dp(x)
for all
G g~A
" A ~
, F ~
;
G
it is clear that any Gibbs state is self-predicting (just by putting
G = X
in (2.9)). The converse of this is true, thus one of the first results we shall prove is that Let
~ g ~_~)
~ = {~A}Ag~
if and only if it satisfies (2.8). be a specification and let
E~)
denote the set of
probability measures satisfying (2.8). We will hence show that
E(~
= G(V) ,
but assuming this fact for the moment we can obtain some simple but useful
15
properties of follows that
G(V) . ~or
A ~ ~
tA rA(~) = ~
~A
G~)
~A = tA(P-~-A)) ; by (2.6) it easily
if and only if
Also from (2.7) we have that of showing that
let
T%~
~A
~ g~A
whenever
' and thus
E(~) =
~
~A "
A ~ % . Therefore the problem
is non-empty amounts in most cases to showing that the
are compact in some suitable topology.
introduction, where
2.5
X
(In the example considered in the
was compact, we need only show that the
~A
are closed
in the weak topology, and this becomes a continuity requirement on the Hamiltonian.) We will more or less use this approach in Section 3. This section will be mainly taken up with the study of the extreme points of
G~
. We assume that we have a specification for which Gibbs states exist
(leaving the existence problem for the next section). After showing that E~VJ = G ~ ) , fields in
G(V)
field, and 0
or
we will characterize the extreme points of that have trivial tail-field.
~ g P(~)
( F=~ =
G(~ ~
has trivial tail-field if for all
as those random ~A
F g F=~
is the tail-
~(F)
is either
I . Sometimes the tail-field is called the set of observables at infinity,
for example in Lanford and Ruelle (1969).) This characterization of the extreme points of
~_~)
can he found in various degrees of generality in Dobrushin
(1968a), Lanford and Ruelle (1969), Georgii (1973) and Preston (1974a),(1974b). Out of this result comes the fact that the elements of the tail-field, in the sense that if all
F ~ =ooF then
~i' D2 E G(V)
G(V)
with
are determined on
NI(F) = ~2(F)
for
~I = ~2 " It also follows that distinct extreme points are
mutually singular. The main result that will be proved about
G(V)
is that if
(X,F)
is
a standard Borel space (which means that there exists a complete separable metric space
Y
such that
~
is ~-isomorphic to the Borel subsets of
Y ; see
16
2.6
for example Parthasarathy (1967), page 133) then each element of
G~)
~_V)
has extreme points and
has a unique representation in terms of the extreme
points. (This is of course all under the implicit assumption that
G~)
is non-
empty.) This result comes from Follmer (1975a), and the method of proof is based on Dynkin's construction of an entrance boundary for a stochastic process (see Dynkin (1971)). Before we start proving results it will be convenient to slightly generalize the definition of a specification. This will be mainly because of the applications to continuous models. A well-known problem which arises in elementary probability is what to do about the definition of the conditional probability of A
given
B
when the probability of
B
is zero. An analogue of this occurs
when trying to construct specifications corresponding to some models: for certain
x g X
measure
and
~A(x,-)
A c ~
there is no natural way of defining the probability
because
x
ably be overcome by defining
is an "impossible" event. This problem could prob~A(x,.)
in a fairly arbitrary way (but making
sure that (2.2),(2.3) and (2.4) still hold: but this might be difficult); or in some models it can be overcome by replacing
X
with a suitable subset of
We will do something like the latter, but will keep the original set the "impossible"
x
we let
~A(x,.)
X .
X . For
be the zero measure, and then only look at
Gibbs states which give zero probability to the set of "impossible" events. The above will be formalized in the following way. Suppose that for each A g ~
we have
~A : X x ~
RA g ~ A >R .
R = {RA}Ag C
if:
(2.10)
~A(x,.)
(representing the "possible" event in
We will call
~ = {~A}AsC
~ A ) and
a specification with respect to
is a probability measure for each
x g RA , ~ C C ;
17
(2.11)
~A(x,F) =
(2.12)
~A(',F)
(2.13)
~A(',F) = X F N R A
(2.14)
~% ~A
Again we define
0
for all
2.7
x s RA , A g ~ , F g ~
is ~A-measurable for all
=
G(~
~
if
;
F g ~ , A e ~ ;
F g _FA , A g C ;
whenever
A ~ % .
, the Gibbs states with specification ~ , to be those
g P(F__F) for which
(2.15)
E ( XF [ F A ) = ~A(',F)
~-a.e.
for all
A g C , F g F .
(2.15) is the same as (2.5) and thus also the same as (2.9). Putting in (2.9) shows that if
~ a _G(V)
then
~(R A) = i
for all
G = F = X
A g C ; thus the
Gibbs states automatically give zero probability to the "impossible" events. Note that we get our original definition of a specification back if we take RA = X
for all
A ~ C .
Again define provided
tA
by
~ g P~_FA,RA) , where
tA(~ ) = v~ A ; if P~_FA,RA) =
~ ~ P~_FA) then
{ ~ g P(FA)
: ~(R A) = I } . (2.7)
still holds, and so does (2.6) provided P~_FA) is replaced by T A = tA(P~FA,RA)) _E~) = ~
; again we have
T A , where as before
~ = CA rA(~) E~)
tA(~ ) g P(~_F)
P~FA,R A) . Let
if and only if
D g T A , thus
denotes the random fields satisfying
AgC (2.8"). Also, once more we have
_T2~~
_TA
quence of this is that to show
~ g E_~)
whenever
A < 2~ ; an important conse-
it is sufficient to show that
~ g TA n
18
for all
n ~ I , where
{An}n> 1
Some notation: if
~
2.8
is any sequence that generates the order on
is a sub-o-field of
the set of non-negative finite measures on extend
tA
tA(~) = VZA if
f
to be an affine mapping from for all
9 ~ M+(~'A) . If
%
~
then let
C .
M+~_B) denote
(X,B) ; it will be convenient to 4
M IF_A)
to
M+(~__F) by letting
is a non-negative finite measure and
is non-negative and integrable with respect to
X
then
f%
will be the
measure defined by
? (fX)(F) =
~
f d%
F
Now let us fix a specification
~ = {~A}Ae~
with respect to
~ = {RA}Ag ~ ,
and where it is necessary to make sense we will always assume that G(V) is nonempty, tA
Lemma 2.1
will denote the mapping defined above in terms of
If
A g C , ~) g M+~F_A )
tA(XF~) =
Proof
From (2.6) we have
XFtA(v)
and
F g FA
zA "
then
9
rA tA(Xp~) = XF O RA9
and thus
tA(XF~) (X-F) = (XFV) (X-F) = 0 .
Therefore
XX_FtA(XF~) = 0 ; similarly
XFtA(Xx_F~) = 0 . Now
tA(~) = tA( XFV + XX_F V ) = tA(XF9) + tA(Xx_F~)
,
and thus
XFtA(~) = XFtA(XF~) + XFtA(Xx_Fv) = XFtA(XF~)
19
2.9
= ( i - XX_F )tA(XFV ) = tA(XFV ) - XX_FtA(XF V) = tA(XFV ) .
Proposition 2.2
G~)
= E~)
, i.e., ~ s G(V]
if and only if
satisfies
(2.8).
Proof
It is clear that
G E~A
and
Lemma 2.1
E(yJ ~
F g ~ . Since
G(V)
~ g EQVJ
, thus take
we have
XG~ = XGt A rA(~) ~ tA(XGrA(~))
~(Gf~F) = (XG~)(F) = tA(XGrA(~))(F)
and writing this in terms of
~(G~F)
= I~A(X'F)
I ~A(X,F) d~(x)
~A
and let
A e ~ ,
tA rA(~} = D , and thus by . In particular
,
gives
d(XGrA(~)l(x)
,
~ e E~)
= IXG(X)~A(X'F)
which is what we want.
drA(~)(x)
82
G
We will now look at the extreme points of s _G~)
which cannot be written as
DI # ~2 " Recall that the tail-field
D = ~(~i+~2 ) F
G~) with
was defined by
, i.e. those elements ~I' ~2 ~ G ~ ) F
=
~
and
F A , and
AeC we are going to show that if
~
has trivial tail-field.
Lemnm 2.2
If
with respect to
Proof
~ g G~)
is an extreme point of
then
if and only
The proof of this will proceed via three lemmas.
A g ~ , ~ Z M+~_FA)_ and v
G(V)
f
is non-negative and integrable
tA(f~ ) = ftA(~ ) .
For simple functions it follows from Lemma 2.1 by linearity;
for den-
20
eral
f
just use the usual approximation by simple functions and the monotone
convergence theorem.
Lemma 2.3
If
respect to
Proof
2. i0
~
82
A E ~ , p c P~_~) and then
We have
tA rA(gp) =
g
is non-negative and integrable with
Ep( g I ~
) tA rA(P) 9
rA(gp) = Ep( g I ~A ) rA(P) , as this is really just the def-
inition of the conditional expectation. Thus from Lemma 2.2 we have
tA rA(gP) =
Lemma 2.4
Let
respect to
only if
Let
~
and let
I g d~ = I
g = Ep( g I F
Lemma 2.3 Thus
p E G~)
~ , with
an F -measurable
Proof
tA(Ep( g I ~A )rA(P)) =
)
with
A g C ; then
g
Ep( g ] ~A ) tA rA(P) "
be non-negative and integrable with
(thus
gp g P_~J). Then
p-a.e, then
g = g
if and only if
g = E ( g I FA )
g = E ( g I --F A )
tA rA(gp) = g p . But by
~-a.e. because
gp c G(_V) then
cA rA(D) = ~
p-a.e. If
g = Ep( g I -FA )
g = E ( g I F=~ )
take an increasing sequence
i
n
)
gD g _G~) .
p-a.e, for all C
~ E G~.
A g C .
is countably generated,
p-a.e. (For example, one way to see this is to
{An}n>_l
martingale convergence theorem to
n >=
since
g = EB( g I F
F=oo~F A ; hence
This, together with the assumption that the order on implies that
if and
D-a.e.).
g~ g T A
if and only if
Conversely, if
gp E G ~
p-a.e. (or, equivalently, if and only if there exists
tA rA(gp) = Ep( g I F A ) CA rA(P) , and
gB g _TA
82
generating the order on
C , and apply the
{EB( g I FAn )}n>l ' using the fact that
21
2.11
The characterization of the extreme points of
G(V)
is one of the sev-
eral facts that are simple consequences of Lermna 2.4.
Theorem 2.1 only if
~
(I) If
~ g G~)
G(V)
~i' ~12 c G ~ )
~I(F) =
~i
~
is an extreme point of
G~)
with
and
~2
are determined on the tail-field, in that if ~2(F)
for all
F e ~=~
are distinct extreme points of
then
G~)
~i = ~2 "
then
~I
and
are mutually singular, even considered as measures on the tail-field; thus exists (4) If
F g F==~ with
H(V) ~
H~_) = { ~
if and
has trivia], tail-field.
(2) The elements of
(3) If
then
M+~_F_)
~2
there
~I(F) = i , ~2(F) = 0 .
is the cone generated by
: ~ ~ O , ~ ~ G(_V) }) then
H(V)
G(V)
in
M+(~F
(that is
is a lattice in its own order.
(For a definition of this see for example Choquet (1969), Volume II, page ]60. This result tells us that if G~
G(_V)
is compact in some suitable topology then
is a Choquet simplex; and we could thus use all the representation theory
for such simplexes. However, we will not take this approach.)
Proof with to
(I)
If
D
is not an extreme point then we can write
~i' ~2 g ~ ) ~
and thus we have
with respect to F= -measurable. > 0
and
~i # ~2 ' g
with
g
is non-negative and integrable
D
~
is
{ x ~ X : g(x) > ~ }
is not trivial on the tail-field. Conversely,
does not have trivial tail-field, so there exists
O < ~(F) < 1 . Put
g
is not ~-a.e. a constant; thus for some
O < ~({ x E X : g(x) > ~ }) < i ~ But
is in the tail-field, so suppose that
~I = g~ ' where
)
is absolutely continuous with respect
~ ; and therefore by Lemma 2.4 we can assume that Since
we have
~I # ~2 " ~I
~ = 89
~ = ~(F)
and let
~i =
F g F
~-I(xF~)' ~2 = (I-~)-I(Xx-F ~);
22
then by Lemma 2.4 we have =
~i
(2)
Let
then
~i
2.12
~I' ~2 g ~(~) and of course
+ (i-~)~2 ' and from this it is clear that ~I' ~2 g ~ ) and
P2
with
~I(F) = ~2(F)
~i # D2 " Now
~
is not extreme.
for all
F ~ F=~~ . Let
are absolutely continuous with respect to
~i = gl ~ ' ~2 = g2 ~ ' and by Lemma 2.4 we can assume that F=m-measurable. then
If
FI, F 2 g F=o~
Therefore
Thus
gl = g2 Let
(4)
Let
0
and
- g21 d~
=
~t-a.e., i.e.
i . Thus
Vl' ~2 c H ~ )
and
g2
are
I ( gl - g2 ) d ~
I
FI
F2
=
( gl - g2) d ~
0 .
~I = ~2 "
; but by (I)
~I
and
, let
D2
~ =
~I(F)
G(V)
. By (2) there exists
and
~2(F)
can only have the
are mutually singular on
~i + ~2 ' and let
~I = gl ~ ' ~2 = g2 ~ " By rescaling the measures ures, it follows from L ~ m ~
gl
~I(FI) = ~2(FI), ~I(F2) = ~2(F2).
be distinct extreme points of
~I(F) # ~2(F)
~ , so we can write
, F 2 = { x g X : gl(x) < g2(x)}
~I(FI) - ~2(FI) - ~I(F2) + ~2(F2)
~i' ~2
F E F=~o with values
and thus by hypothesis
llgl
=
(3)
F I = { x g X : gl(x) > g2(x)}
~ = 89
gl' g2
F=oo .
be such that
to make them probability meas-
2.4 that we can assume that
gl
and
g2
are
+
F -measurable.
Put
~
= max{gl,g2}~
~ ~
; then by Lemma 2.4
= min{gl,g2}~
+
we h a v e
+
~) , ~ - a HQV)
~- = ~I A ~2
and i t
is easily
checked that
~
= ~1 v ~2
and t h a t
82
Because of (I) in the above theorem it is worthwhile noting a fairly
,
23
2.13
standard result about random fields with trivial tail-field. We will say that a random field
~
has short range correlations if given any
then there exists
A s ~
such that for all
F g ~
and
6 > 0
G ~A
I~(FNG) - ~(F)~(G)I <
Proposition 2.3 (This can be found for example in Lanford and Ruelle (1969).) Suppose that of
~
F
is countably generated (i.e. there exists a countable subset
such that
~ = o(_A) , the smallest o-field containing
~ ). Then a
random field has trivial tail-field if and only if it has short range correlations.
Proof
If
~
has short range correlations then clearly
field, because if
F ~ F=~~
l~(F) - ~(F)~(F) I < 6 either
0
or
then
F e~A
for all
A e ~
and thus
i . (This part of the proof does not require
then there exist
has trivial tail-
6 > 0 , and this can only happen if
generated.) Conversely, suppose that
Fn g ~ A
for all
~
F ~ ~ , {An}n> I
~
~
~(F)
is
to be countably
does not have short range correlations;
increasing and generating the order on
~ ,
, such that n
lira { H(FflFn) " ~(F)H(Fn) lr+oo
Since ball in
F
is countably generated L (X,~,~)
LI(x,~,~)
0 .
is separable, and thus the unit
is sequentially compact in the weak-*-topology.
there exists a subsequence
lim J-~
} #
{nj}j> I
~ ( F n ~ G) = J
~ G
It is easily seen that we can take
and
g dD
g
g ~ L~(X,F,~)
for all
to be
Therefore
such that
G g ~ .
F -measurable and since we have
2.14
24
I
g dp-
p(F)\ g dD
F
g
=
cannot be p-a.e,
tail-field.
equal to a constant.
G~)
Therefore
p
does not have trivial
that there might be some kind of representation
in terms of its extreme points,
though in this generality we have no
way of showing that any extreme points exist. However, is a standard Borel space (i.e. there exists a complete Y
0 ,
82
Theorem 2.1 suggests of
#
lira { p ( F ~ F ) - I~(F)~(Fn ) } j_><~ n. 3 3
~
such that
Parthasarathy
F
is isomorphic
as a o-field
for
~(_VO
example that is likely to occur
exists.
(X,~)
(X,F)
separable metric
space
to the Borel subsets of
(1967), page 133) then we can prove that
and that a representation
if we assume that
G(V)
Y ; see
has extreme points
It is worth noting that in any
will certainly be standard Borel. The
next theorem gives us the analogue of an entrance boundary.
Theorem 2.2 (The hypothesis If
(X,~)
+ R
for each
F ~ ~
(2.17)
for each
x e X
of
;
for any
z(-,F)
p s ~(V)
(2.20)
and
if
~ e P_(~
}
then
F e ~
is needed in this theorem.) kernel
then
;
~(x,-)
is an extreme point
we have
) = z(-,F)
if we denote tile measure
A(x) = { y g X : ~Y = x
is F=~o-measurable
the probability measure
E ( XF I ~
(2.19)
is non-empty
with the properties:
(2.16)
(2.18)
~(~)
is a standard Borel space then there exists a probability
: X x F
~(Y9
that
~(x,.)
p-a.e.
by
x
;
and for
x g X
11x(A(x)) = i ;
D e G(V)
if and only if
Dn = p .
let
25
2.15
Before proving this theorem we will show that it enables us to construct a representation of alence relation on
G~
in terms of its extreme points. Let
X
defined by
onding quotient space and
p
B
mapping from
G(V)
Proposition 2.4
Suppose that
~
(X,~)
~
G~V)
{z} g L
Fo = p-l~) also if
A(x)~ on
F
and if
(X,L)
and let
o
p
X
then let
F
==oo
(X,_B~ then let
p(~)(F) = ~(p-l(F))
, we can consider
p
.
as a
(X,F_~o) .
and the probability measures on
F~
p
(X,L).
z g X .)
F
and for any
==co
F e F=~ ~
then
~(-,F) = Xp
then
A(x)~F
= ~ ). Let
; define a probability measure
By (2.8) we have
that sends an element
defined by
for all
; then
x ~ F
E P~_)
be the corresp-
is as constructed in Theorem 2.2. Then
==co
F=~~
X
X
be the equiv-
correspond under this mapping to the point masses
(X,_F_~) . (Note that
Let
onto
to probability measures on
The extreme points of
Proof
X
= ~y ; let
is a probability measure on
are determined on
is an affine bijection between
on
~
be the probability measure on G~)
x
is a o-field of subsets of
= { F C X : p-l(F) g ~ } ; if
Since elements of
if
the map from
to its equivalence class. If
P(U)
x ~ y
~
~
on
F e =F
(since if V
~(-,F) is x g F
then
be a probability measure
(X,F=~ ~
by
~(p-l(G)) = ~(G)
be defined by
~~
=
o
g G~)
I ~(x,F) d~(x) .
and it is easily checked that
is onto. Now by (2.20) we have
D(F) =
~(x,F) J
d~(x)
p(o)
= v ; hence
2.16
26
F s F , p g _G ~ )
for any
, which shows that
thus it immediately follows that
p
p
is determined on
F O ==co ; and
is one-one. The rest of the proof is clear. 82
We now start the proof of Theorem 2.2; the proof will be via a number of lenmms, the first of which gives us a crucial fact (and does not depend on
(X,~)
being standard Borel).
Lemma 2.5
Let
all
we have
A g C
g
be bounded and F-measurable, and let
rag -~ E ( g I F A )
where
Proof
N-a.e. ,
~rAg is the function defined by
We need to show that if
I ~Ag dP F (since it is clear that
~Ag
=
g dP
=
I F
=
=
,
P g ~(~)
I ~ A g d(XFP) =
g d(P~A)
then
is ~A-measurable ). Note that by Lemma 2.1 we have
(XFP)~A = XF(P~A), and that since
I ~ A g d~ F
(TAg)(x) = ~g(y) ~A(x,dy) 9
F g ~A
~ F
p g G(V) . Then for
I F
The next results require
P~A = p " Therefore
I g d(XFP)~A
= I g dXF(P~ A)
g dP
(X,~)
to be a standard Borel space, so let
us suppose for the rest of the section that this is true. We will use the foil-
27
2.17
owing fact about such spaces (see, for example, Parthasarathy if
~
(1967), page 145):
has uncountably many atoms then there exists a countable field
such that
~ = o~A)
and such that if
~ :A
~ R + is bounded then
~C ~
is a
measure (i.e. countably additive) if and only if it is finitely additive. Note of course that if
~ : A
> R * is a measure with
~(X) = 1
/
Caratheodory extension theorem and the monotone class theorem quely to an element of Now if particular
F
~
then by the ~
extends uni-
P~_F) .
has only countably many atoms then so does
F
and in
is conntably generated. In this case Theorem 2.2 is easy to
prove, and we leave it as an exercise for the reader to do so. (This case, in fact, never occurs in any of the models we will consider.) Thus in the following we suppose that
~
has uncountably many atoms.
Fix a countable field also f i x an i n c r e a s i n g = N
FA
9 For
A
having the properties described above, and
sequence F e A
{An}n.>l
I
define
lim ZA (x,F) n ->~176
otherwise.
O
I = { x g X : lim z^ (x,F) n -+~176
UN ^ n>l m>n
Lemma 2.6
Proof
I e F
if the limit exists,
n
=
~o(X,F)
R~ =
~ ; thus
n
A~C
Let
g e n e r a t i n g t h e o r d e r on
exists for all
F ~ A }~Roo , where
n
m
and
It is clear that
~(I) = i
for any
D g G(V)
.
I~ e ~=~ , and by (~.9) we have
~(Roo) = I for any
28
e G(V)
. For
F g A
I (F) e F=~, and thus that
V(I
(F)) = 1
let
I (F) = { x g X :
I g F for any
since
theorem we have
~(I (F)) = i .
82
2.7
Proof
If
x e I
then
By the definition
additive
on
~
; but if
~o(X,FIUF
=
2)
=
of
~
In fact
x
>R
I F=~o )
To(X,.)
Tx
denote
=
because
on R=o~
we will I
and thus
To(X,.)
on
also denote To(X,X)
2.8
is some arbitrarily
zI
.
is finitely
82
if
x g I ,
if
x ~ I ;
chosen element
has the properties:
ol
~
; since
x
this extension
= i . Now define
by
\,(F)
Lemma
; thus
then
~l(X'F) =
~
A
Zo(X'Fl) + ~o(X'F2 )
the measure
zX(F)
(where
~-a.e.
lira zA ( x ' F I U F 2 ) rr+oo n
to a measure
e P(~)
iT1 : X x F
2.5 and the martin-
on the field
FIeF 2 = ~
}; then
n
then let
extension
= E ( XF
we need only show that with
exists
We now need only show
. But by Lemmm
is a measure
lim { WA (X'Fl) + nA (x'F2) }
x g I
unique
, F g A
To(X,. )
lira ~A (x,F) n +oo n
is countable.
lim ~A (~ rr+OO n
FI, F 2 ~ ~
n
If
A
~ e G(V)
gale convergence
Lemma
2.18
P~_F) ).
has a by
x
If
29
2.19
(2.21)
for each
x g X
Zl(X,-)
is a probability measure;
(2.22)
for each
F g F
nl (-,F)
is _F -measurable;
(2.23)
for any
~ e _G(V)
and
F E F
~I(-,F) = E ( XF I ~
Proof
(2.21) holds by definition;
FgA
and hence hold for all
Let
I~
=
Lemma 2.9
Proof
)
~-a.e.
(2.22) and (2.23) are certainly true if
F e ~
by the monotone class theorem.
82
{ x g X : ~l(X,.) e ~(~) } .
I~ g F
and
~(I ~
= i
for all
~ e G~V) .
From (2.8) and the monotone class theorem we have that
x g I~
if and
only if
(2.24)
~l(X'F) = I ~An(Y'F) ~l(X,dy)
for all
n~
i , F g ~ .
Since both sides of (2.24) are F -measurable functions of only countably many equations, we get (2.23) shows that if
I ~l(X,F) d~(x)
G E F=~
=
G
I G
and thus (2.24) holds for
and
I
I ~ g F==~ . Now a simple calculation using
~ e GQVJ
then
~A (y,F) ~l(X,dy) n
~-a.e.
x , and there are
x e X ; hence
dD(x)
,
~(I ~
= i .
So far the proof has been fairly straightforward,
82
and has followed an
2.20
30
obvious path. Now comes a trick, due to Dynkin, which enables us to complete the proof. Let It = { x s I~ : ~l(y, -) = ~l(X,')
Lenmm 2.10
Proof
1% g F
==co
It =
~ FgA
, and
for
~(1%) = i
x e I~
then
x g I~
c
G~)
.
=
x g It(F)
we have
for
Zl(X,.)-a.e. y s X } .
if and only if
I { ~l(X,F) -~i(Y,F )
but since
for all
It(F) , where
It(F) = { x e I ~ : ~l(Y,F) = ~l(X,F)
Let
~l(X,')-a.e. y s X } .
}2 ~l(X,dy)
I ~I(Y'F) ~l(X'dy)
0 ;
=
nl(X'F)
and thus
I{ ~I(X,F ) _ ~[l(Y,F) }2 ~[l(X,dy)
=
I { ~I(X'F)2 - 2~I(X'F)~I(Y'F) + ~I(Y'F)2 } ~l(X'dy)
I
~I(Y,F) 2 ~l(X,dy)
- ~l(X,F) 2 .
Ther'efore
It(F) = { x g I ~ :I~I(Y'F)2 ~l(X'dy) = ~l(X'F) 2 } ; and in particular
It(F) g ~
. Now if
~ g_G(V)
then
31
2.21
I I~I(Y,F)2 ~l(X,dy) d;J(x) - I1[l(X,F)2 dM(x)
But
=
I~I(Y,F)21~I(X,dy)
=
I~I(Y'F)2 d~(y)
d~(x)
- I~[l(X,F)2 d~(x)
- I~l(X'F)2 d~(y)
I ~I(Y'F)2 ~l(X,dy) -
~I(X'F)2 ~ 0
=
0 .
for all
x E I~ (and thus for
~-a.e.
x c X ) since we have written this above as the integral of something squared. This shows that
~(It(F)) = I .
Lenmm 2.11
x e I*
Proof
If
Let
x g It
Therefore
~l(X,F)
define
~
E ( XF
is either
~
: X • F
I
is an extreme point of ~ ( ~
for
Hi(x,') . If
F=~ ) =
~I(.,F) = ~l(X,F)
field. Thus by Theorem 2.1
Now
~l(X,.)
and write
~I(-,F) =
but we also have
then
82
0
or
~l(X,.)
~R
XF
~-a.e.
.
F c F=~ then we have
~-a.e.;
and thus
I , and hence
~l(X,F) = XF
~l(X,-)
~-a.e.
has trivial tail-
is an extreme point of G ( ~
.
82
by
~l(X,F)
if
x E It ,
7rl(Y,F)
if
x s It ;
~(x,F) =
where
y
is some arbitrarily chosen element of
It . It is a simple matter to
32
check that
2.22
?r satisfies all the requirements of Theorem 2.2; and thus the proof
of this theorem is now complete.
3.1
3. Existence of Gibbs states
In this section we will find sufficient conditions on a specification ensure that
G~)
p g G(V)
with
~n g ~Q~) such that
~ = o~)
to
is non-empty. The techniques we will use also give suffic-
ient conditions to ensure that that given
~
G~V)
is sequentially compact (in the sense
' n ~ 1 , then there exists a subsequence ~(F) =
lim ~n (F) j~ J
for all
F g ~ , where
{nj}j> 1 ~
and
is a field
which we will define below).
It is worth pointing out that there exist specifications defined in a natural and reasonable way for which there are no Gibbs states. We will briefly describe an example of this. Consider the lattice model described in the introduction, but now suppose that
Y
is countably infinite. Let
and we will consider the basic phase space
X
S
be the integers,
to be the phase space (i.e. the +
~ample space) of a Markov chain with state space
Y . Let
Q : y x y
>R
be
a stochastic matrix; then there is an obvious way to try and define a specification using the conditional probabilities defined by
Q . Suppose that
all its entries strictly positive and that all its iterates
Qn
Q
n > 1
has are
finite. Then there are no problems in defining these conditional probabilities, and so we get a specification defined it, hence and
Q
RA = X
~
(and this is a specification as we originally
for all
A g ~ ). However, if
Y
is the transition matrix for a random walk (i.e.
function of
y-z ) then Spitzer (1974b) has shown that
is the integers,
Q(y,z)
is only a
~(~_V) is empty. This
example is considered in more detail in Section 5, along with a generalization of this result due to Kes~en (1975). As at the end of the last section we will work with standard Borel spaces, but now add some more structure to equipped with a partial order
~
(X,F) . Let
N
be an index set
which is directed upwards and eountably
34
3.2
generated. We will suppose that we have a family
{$}eCN
of sub-d-fields
of
satisfying:
(3.1)
if
(3.2)
~
~ ~ ~
then
~
C
~
;
is the smallest o-field containing
(3.3)
(X,_~)
(3.4)
if
U
~e
is a standard Borel space for each
e I =< Co.
...
and
An
is an atom of
e g ~ ;
_~
for
n > i , and if
n
AI ~
A2 ---~ ..., then
~ n
An
# ~ .
& 1
We are thus assuming that
(X,~)
is the projective limit of the spaces
{(X,B6)}6E ~ ; these assumptions will make
If we are in the situation where from outside
(X,~)
F = ~(S-A) =A
A , then we will usually have
measurable from inside
a standard Borel space.
N = ~
, the events measurable
and
~A = ~(A)
, the events
A 9 In such cases we will almost always have (3.1),(3.2),
(3.3) and (3.4) holding.
(For example, it is certainly true for the lattice
model considered in the introduction.) The important technical fact that follows from our hypotheses following
(3.5)
(see Parthasarathy
if
DO g P B~)
and
is the
(1967), Theorem 4.2, page 143):
{DO}OE N
are consistent
all
e < D , F r _~ ) then there exists
all
e c N , F E ~
D g P~')
(i.e.
be(Y) = D~(F)
such that
for
D(F) = D0(F)
. (Note that by (3.2) and the monotone class theorem
for D
unique. )
Let
A =
U
~
; thus
A
is a field generating
~ ; if we are in
is
35
the case when
~ = ~
and
~ A = %(A)
then
3.3
A
is called the field of local
observables.
Theorem 3.1 Suppose
(3.6)
Let
~ = {~A}AgC
be a specification
with respect
to
= {RA}A~s
that
for each
A E C
there exists A
g A n > i , with = ' =
n
AI ~
A2~
"'"
=
n>
1
snd that for some
x g
(3.7)
given
e ~ N
6 > 0
such that if
~
and
RA
we have
y > 0
then there exists
F g ~
with
, A E ~
, and
~(F) < ~
then
, and
e P_(Be) , A S ~
~(x,F)
< y
for all
~ ~ A .
Suppose also that
(3.8)
given
F ~ ~
~-measurable all
Then
funstion
f : X
y > 0 >R
then there exists
such that
O ~ N
I ~A(Y,F) - f(Y)
and a
I < Y
for
y g RA .
G(_V_)
is non-empty.
Note that for our original definition holds trivially,
and (3.8) just says that for
in the uniform closure of that are B~-measurable slightly stronger
of a specification
B(~)
for some
than (3.7) is
A g ~
and
(when F g ~
, the set of bounded functions e c N
RA = X ) (3.6)
from
~A(-,F) X
to
. Note also that a condition which is
is R
9
36
(3.9)
given
@ g N
then there exists
~ g P~B@)
{~%(x,.)}~>__A , considered as measures on continuous with respect to that if
F e ~8
with
~
then
and
(X,~),
(i.e. given
m(F) < 6
3.4
A g ~
such that
are uniformly absolutely
y > 0
there exists
~%(x,F) < y
for all
6 > 0
such
% ~ A ).
(3.9) should remind the reader of the hypotheses of the Dunford-Pettis theorem (see, for example, Meyer (1966), Page 39), and thus (3.7) and (3.9) can be regarded as compactness criteria.
The basic idea for the proof of Theorem 3.1 is simply to choose a suitable ~ountable subse~
~
of %
with
Z=
o~),
to ehoose
which (3.7) holds, and to find an increasing sequence
lim ~A (x,F) n+~ n ion to
~
exists for all
of an element
The choice of have either (i) ~
~
V
x ~
{An}n> 1
[~
RA
such that
F E D ; then to prove this limit is the restrict-
of
P(~) ; and finally to show that
depends on how big the
B@
many atoms. By (3.1), if (ii) holds for some
~ ~ G~)
are. For any
has a countable number of atoms, or (ii) _~ 8 s N
@ e N
e ,
Then, as in Section 2, for each
=
8 g ~
{An}n> I
there exists a countable field
e s N 9 ~0
and such that finite additivity implies countable addit-
ivity for bounded set functions on and
O g ~ ,
@ g N .
We first give the proof for the case when (ii) holds for all
~O = 0 % )
we
then it holds for all
we can, without loss of generality, ~ssume that either (i) holds for all
such that
9
has uncountably
> 8 ; and since we will only be interested in what happens for "large"
or that (ii) holds for all
for
~
. Choose increasing sequences
which generate the orders on
~
and
~
{@n}n>l
respectively. Let
37
D =
~
_~
n>
i
, and thus
~ = o(D)
3.5
. Now fix
x g
n
~
R%
satisfying
(3.7);
AEC
by replacing
{An}n> 1
if necessary with a subsequence,
lim ~A (x,F) n ->~~ n
exists for all
clearly finitely additive on
F e _D 9 Let
Ao
we can assume that
~(F) = lim ~A (x,F) rc->~176 n
; then
and hence countable additive.
~
Thus
is
D , res-
n tricted to
A@
, has a unique extension to a probability measure on
(X,Bo) m
m which we denote by the
Dm
n__> I
~m " (Note that we have no a priori reason to suppose that
are consistent, let
,
although
in fact it follows from the next lemma.) For
~n = ~A (x,-). n
Lemma 3.1
For any
m=> i
and
F g Be
we have
pro(F) = lim ~n(F)
.
m Proof
Let
y > 0
and choose
A E C
, ~0 g P ( ~
)
and
~ > 0
as given by (3.7)
m (with
e = O m ). Now by standard measure
theory we can find
such that
A g ~ m
both
Pm(AAF) < y
such that
A
and
00(AAF) < ~ (where
> A . Then for n ~
I ~ m (F) - Vn (F)
we have
I
I Pm (A) - ~n (A) + ~m (F-A) - Vn(F-A)
<
I Pm (A) - Vn(A)
.
- Pm (A-F) + ~n (A-F)
I + ~m (FAA) + Vn(FAA)
But by definition we have
Pm(F) = lim ~n(F) rr+oo
Choose
~ o
o
=
n > n
AAF = ( A - F ) U ( F - A ) ) .
82
<
I
I ~m (A) - ~n (A)
lim Vn(A ) = p(A) = Dm(A)
, and hence
I + 2y .
no
3.6
38
Lemma 3.2
for all
There exists a unique
F g A . (Thus
Proof
B
P ~ P(F_F) such that
is an extension of the
By Lemma 3.1 the measures
{Pm}m>l
= Bm
for all
m~
defined above.)
are consistent. Clearly we can ex-
tend this sequence to get a consistent family Be
P
B(F) = lim ~A (x,F) n
{p0}6g~
with
p o g P(B_~)
i . Therefore by (3.5) there exists a (unique)
and
B e P F(~_
m
such that
B(F) = Pm(F)
for all
, m~
F c ~
i . Hence from Le~mm 3.1 we get
m
B(F)
= lira ~ A
(x,F)
.
82
n-+~176 n
Lemma 3.3
B(RA) = i
for all
A ~ C 9
Proof
Using the notation of Section 2
~) g T A
then
Now
v(RA) = 1 . Thus if
~)n = ~A (x,.) e_T A II
that if
B(RA)
=
n => no
i
TT[ ~
~ g T%
then
and thus, given any
_TA
whenever
~)(RA) = 1 A e =C
A < % ; also if
for all
there exists
A < 7[ . no
such
n
then
~)n(RA) = 1 . Therefore by (3.6) and Lemma 3.2 we have
.
Lemma 3.4
B g G~V_) .
Proof
B(~)
Let
denote the set of bounded F--measurable functions from
X
R , considered as a Banach space with the uniform norm; let B~A) = { f g B ~ )
Then
: f
is ~ - m e a s u r a b l e
W
=
{ f e B~__F):
I f dB =
W
is a closed subspace of
for some
lim I f rr+oo
e ~ N } ; and let
d~)n } .
B~F) , and since Lemma 3.2 just says that
to
39 XF e W
for all
F ~ A , we certainly have
y > 0 ; then by (3.8) there exists for all
nl > no
such that
I I
and hence
A
f ~ B~A)
W . Let
such that
:A(Y'P) dD(y)
- I
: ~A(y,F) d~(y) =
n~
> A
~(R A) = i ; also
i ~ f dD - I f d~n i < y
"rrA(Y'F) d~n (y)
A E C , F E A
and
I ~A(Y, F) - f(Y) I < "f ~)n(RA) = i
for all
f E W , so there exists
for all
n>
nl 9 Thus for
n>
nI
I < 3y ;
lim I 7rA(Y'F) d~)n(Y) , rr+oo
(i.e. nA(',F) E W ).
then
I ~A(Y,F) dgn(Y) =
: nA (y'F) ~An(X,dy) =
ZAn(X,F)
~ ~A(y,F) d~(y) = D(F) . We have shown this equality
Therefore by Lemmm 3.1
for all
B(~ C
y g R A . Now from the proof of Lenmm 3.3 we have
n__>_ n o , and by Lemma 3.3 we have
But i f
3.7
F g ~ ; by the monotone class theorem it must hold for all
hence by Proposition 2.2
~ ~ G(V) .
F g ~ , and
82
This completes the proof in the uncountable atom case; so now suppose that for each
@ E ~
_~
has countably many atoms. Let now
set of all finite unions of atoms of before and such that
additive on
_~e
B@ ; let
D(F) = lim ~ A (x,F) n +~o n
denote the
{0n}n~ I , {An}n~ I , ~
exists for all
and thus extends to a measure on m
A0
~0
F E ~ . ~
be as is finitely
which we denote by m
~m "
40
~m(X) < I ; the thing that could go wrong is that we might
It is clear that have
~m(X) < I . However,
Lemma 3.5
Proof
~m(X) = i
y > 0
Let
3.8
the next lemma shows that this does'nt happen.
for all
m~
I .
A s ~ , ~ g P(B=9 )
and let
and
6 > 0
he as in (3.7)
m
(with
@ = 8m ). We can find
F g ~O
such that
~(F) => I - 6
and tbus
m
W%(x,F) ~ I - y y > O
for all
% ~ A . Hence
is arbitrary we get
~m(X) = i .
~m(X) ~ ~m(F) ~ i - y ; and since 82
The rest of the proof in this case is the same as before; we leave the reader to check the details. Note that the proof of Theorem 3.1 shows that if (3.6) and (3.8) hold then for any
x e
~
RA
satisfying
(3.7) we can find
~ ~ G~V)
and an
AsC increasing sequence
{An}n> 1
such that
D(F) = lim H A (x,F) ~
for all
F e ~
We should point out that in a lot of the examples we will consider Section 6 (continuous models) hold (for suitable we can construct
(3.6) and (3.8) will not hold; however,
x e X ) and thus Lemmas 3.1 and 3.2 still work. ~ ~ P~F~
with
~(F) = lim w A (x,F) n->oo n
before, but we have to use different methods
.
n
for all
to show that
(3.7) will
This means
F g ~
D e ~_~J
in
just as
. We will
look at this at the end of the section. If we assume a bit more than the hypotheses show that
G(~
is sequentially
compact
of Theorem 3.1 then we can
in the sense mentioned
before.
41
Theorem 3.2
Let
~ = {ITA}Ag~
3.9
be a specification with respect to
~ = {RA}Ag ~ 9
Suppose that (3.6) and (3.8) hold and that
(3.10)
6 > 0
given
such that if
Then given e G~)
Proof
O e N
and
y > O
F e ~
x)n s G ~ ) such that
then there exists
with
~ G~
then
D(F) = lim Vn.(F) j~o j
for all
F g A =
for all
{n.}j> I
, A e C
nA(y,F ) < y
, n > i , there exists a subsequence
Once we have got a subsequence
~(F) = lim 9 (F) j-~o nj
for all
{nj}j~ 1
and y g X .
and
F g A .
and
such that
~ e P(FO
then the same proofs as before will give us
; thus we need only look at the analogue of Lemma 3.1. Consider the
uncountable atom case; let sequence
{nj}j> 1
{Sn}n> I
such that
~A(y,F) < y
for all
and
~
~(F) = lim v
=
for all
00(F) < 6
0J e P ~ )
j-~o
y e X
then
nj
~(F) < y
be as before and choose a sub(F)
exists for all
for all
F e ~ . Now if
v e ~ A ; and since
~n e ~ %
A ~ ~ , n ~ i , the proof of Lemma 3.1 goes through just as before.
Similarly, in the countable atom case Len~na 3.5 has the same proof.
If (3.10) holds then (3.7) holds for any theses of Theorem 3.2 we have
G(V~
82
x g X . Thus under the hypo-
is non-empty, provided
~
RA # ~ 9
A condition which is a bit stronger than (3.10) is
(3.11)
given
{~A(Y")}y
O ~ N
then there exists
m s P%)
g X ' considered as measures on ( X , ~ )
continuous with respect to
m .
and
A g ~
such that
, are uniformly absolutely
42
3. i0
We now consider a more general situation which will be applicable
to
the continuous models to be dealt with in Section 6. As we have already mentioned, in most of these models
(3.6) and (3.8) will fail to hold,
and so we need some-
thing else to put in their place. Let
_~_
denote the set of countable intersections
(thus (3.6) is the requirement
that
RA E ~
be an increasing sequence of elements
Theorem 3.3
Let
~ = {WA}AEC
of
for each
~
of elements
A e ~ ). Let
of ~=
~
,
{Un}n>=l
.
be a specification.
Suppose that for some
x e X
(3.7) holds and also
(3.12) all
given
A e ~
~ > 0
then there exists
m ~ i
such that
WA(X,U)
~ i - 8
for
.
Suppose also that
(3.13)
given any
such that
A e ~ , F e ~ , ~ > 0
IwA(Y,F) - f(Y) l < 8
and
for all
m~
y e U
i
then there exists
, (where
B(A)
f e B(~)
is the set of
m
bounded f ~ c t i o n s
Then
~(~)
Proof ~(F)
that are 4 - m e a s u r a b l e
for some
8 e ~ ).
is non-empty.
As before we use (3.7) to find = lira ~A (x,F) n~
for alL]
A
increasing and
n
~ ~ P(F__) such that
F s A_ 9 (Note that (3.12) implies that
x c
n
~
RA.)
AeC
We will show that
~ ~ _G(~) . Put
~n = ~A (x,.)
. Let
A e C
, F ~ A , and
6 > 0 ;
n
choose for all
m > 1
such that
n => i , and since
~A(X,%) Um e ~
>= 1 - 6
for all
A e C . 'lhus ~n(Um) >__ 1 -
this implies that
~(U m) => 1 - ~ . There
43
exists
f a B(A )
assume that
n >n O
I
such that
[~A(Y, F) - f(Y) l < 6
0 <= f ~ 1 . Let
then
3. ii
n
o
Ip(F) - Pn(F) I < 6
wA(Y,F) dp(y) - ~(F) I
he such that
and
for all
A < A ~ n
I I f d~ - I f
y E U m , and we c~n
and also such that if o
d~n I < 6 . If
6 + ~(x-~m) + I If d~
n n>= o
then
.(F)I
-
<
26 + I If d~- p(F)I ~ 36 +' ~f dPn- ]J(F)I
<
4~ + Pn(X-%) + I lwA(Y,F ) dPn(y) -p(F)J
<
56 + I IwA(Y,F)~ A (x,dy) - p ( F ) i n
56 + I~A (x,F) -p(F) l ~ 66 . n
Thus
p(F) = IwA(Y'F)
dp(y)
for all
A e ~ , F e A , and hence
p s ~(~)
.
In most of the models in Section 6 we will have a condition stronger than (3.12) holding, and also something stonger than (3.7)- These conditions are:
(3.14)
given any
F ~
then
Um
6 > 0
then there exists
lira inf WA(X,F) ~> 1= Ae C
m >= i
for all
such that if
x e U
, where
F e A U
given
8 s N
that if
F e ~
with
and
y > 0
~(F) < 6
then there exists then
~
%
;
m~l
=
(3.15)
=
with
~ e P(~)
lira sup WA(X,F) < y A~C
and
for all
6 > 0 x e U=
such
44
(3.13),(3.14)
and (3.15) imply a compactness
~(U ) = i . %~is is because such measures,
Lemma 3.6 m~
i
propel~y for those
U s ~(~)
(3.14) and (3.15) give us uniform estimates
with
for all
as seen by the next two lemmas.
Suppose
such that
~
satisfies
D(U m) ~ I
Proo_____f Let
U e G(VJ
If
then we have
F DUm
3.12
- 6
with
(3.14). for all
Then, given any ~ s G(V)
w(U ) = i . Given
6 > 0
lim inf WA(.,F) => i - ~
6 > 0 , there exists
with
~(U ) = i .
let
m > i
~-a.e.
be as in (3.14).
, and since
Ass ~(F) =
IWA(X'F)
d~(x)
it follows
~(U ) > 1 - 6 , because m
U
~
Suppose
~ e P(~)
for all
V and
~ e G(V)
satisfies ~ > 0
with
Note that if
Theorem 3.4
~
(3.15).
~n(U ) = 1
Then, given
such that if
F s ~
% s N with
and
~(F) < ~
is the measure given by Theorem 3.3 then it has
for each ~(U ) = i
y > 0 , there then
~(F)
82
Suppose that (3.13),(3.14)
e G(_V) with
Proof
82
w(U ) = i .
Proo___f Same as Lemma 3.6.
with
e A..
~(F) ~ i - ~ . Thus
m
Lemma 3.7 exists
from Fatou's lemma that
and (3.15) hold for
V . Let
n . Then there exists a subsequenee such that
~(F) = lim ~n.(F) J+~ 3
~(U ) = i .
for all
~n e G(V)
{nj}j>_ 1
and
F s A .
Using Lemma 3.7 and the proof of Theorem 3.1 we can find a subsequence
< y
45
{n.} > j j=l
and
I~ E P(_F) -
such that
Lemma 3.6 and the proof of Theorem
3.13
~(F) = lim ~n (F) j~ j 3.3 show that
for all
~ E G(V)
F e A
. Then
; and it is clear that
4.1
4. Invariant
specifications
Let us look again for a moment st the lattice model described Suppose S
that
S , the set of sites,
is a group, and each
t e S
{Tt(x)} s = xt~ s , where at site
s E S
xs
acting on
t
: X
X . Let
Po~F)
integers. ~X
Then
by
x (i.e. the value
x ). We thus get a group
under these transformations.
will also be invariant;
T
the s th. coordinate of
in the configuration
that are invariant
Z d , the d-dimensional
induces a transformation
denotes
F_-measurable transformations
specification
is now
in the introduction.
denote
{Tt}t e S
of
the random fields
In most physical models the
for the lattice model this will mean that
T t (Y) ~
t
= ~
for all
A e ~
and where we consider
Tt
, y e X(S-A)
, t e S ; where
also as a mapping from
X(A)
A+t = { s+t : s e A }, to
X(A+t)
denote those Gibbs states with respect to this specification One might think that
~
being invariant would imply that
that any Gibbs state is automatically and the existence iant corresponds
of non-invariant
invariant.
. Let
that are in
G_o(_V) = G ~ )
However,
_Go~V ) Po~_F) .
, i.e.
this is not the case,
Gibbs states when the specification
to the physical phenomenom of symmetry breakdown.
is invar-
(Examples of
this are given in Section 5.) In the case considered happen that invariant
is countably
( Y
finite)
G(V)
is closed in the weak topology.)
infinite then this need not happen.
Consider
ainly gives a specification which is invariant. the number of invariant Q
But Kesten
However,
if
the example given
at the beginning of Section 3 in terms of the stochastic matrix
holds if and only if
it will always
Gibbs states will exist whenever any Gibbs states exist.
(This is true, anyway, when Y
in the introduction
Q . This cert-
(1975) has shown that
Gibbs states is either one or none, and that the former is positive recurrent.
On the other hand Spitzer
has given examples for which Gibbs states exist with
Q
null-recurrent.
(1974b)
47
4.2
Inmost models in classical statistical mechanics there will be a group of bijections of the phase space (usually induced by the group of translations acting on the underlying space). The specifications that arise are also, in a natural way, invariant under the bijections. Thus, as in the lattice models just considered, we have the problem of analysing the invariant Gibbs states. We will do this in this section. Let
H
be a group of ~-measurable bijeetions from
define what it means for a specification H-invariant then we will look at
~
X
to
to be H-invariant.
~o~V) = G(V)~ Po~_F) , where
the set of H-invariant random fields.
X . We will If
~
is
Po~_F) denotes
( II will be fixed for this section, so
we will not bring it into the notation.) We will characterize the extreme points of
G_~O
: under reasonable hypotheses they will be exactly the H-ergodic
elements of G~O
G o (~
. In the case when
(X,~)
is a standard Borel space and
# ~ , Po~F) # ~ , then we will show that either
every element of
G~)
~ ~ G~
and
is non-empty, or
is mutually singular to every element of
this is uniform in the sense that there exists all
Go~)
~(F) = 0
for all
be non-empty whenever it can be. If
F E ~
such that
Po~_F~ (and ~(F) = I
~ g Po_(~ ). This shows that ~o(V_) # ~
it in terms of its extreme points, provided
for
G (V_O will
then we get a representation of
Po_~J
has a representation in terms
of the H-ergodic random fields. At the end of the section we give sufficient conditions to ensure that ~o~(~
is non-empty. Some notation: let
4 -1
are ~-measurable;
if
~ : X---~X ~
be a bijection such that both
is a sub-G-field of
~_B) = { B g % : ~'I(B) g B } ; for
~ ~ PCB)
let
~
~
and
then let ~
g p(~(B))
be given by
48
(~v)(F) = ~(~J-I(F)). p f : X
>R
Let
B = ~(B)
~ : X • ~---gR
F e ~ , and ~
~(x,')
: X x F
>R
~ ~ {~A}Ae~
~OV = p . A function
f = f-~ ; similarly
B ~ ~
will be
.
be a kernel (i.e.
is a measure for each
I[(-,F)
is ~-measurable for all
x s X ); we can define another kernel
by
~(x,F)
If
will be called ~-invariant if
will be called ~-invariant if
called ~-invariant if
4.3
= ~(~-l(x),~-l(F)).
is a specification with respect to
~ = {RA}Ag ~
then
~A
also has the properties:
(4.1)
~A(',F)
(4.2)
~A(-,F)
We will say that e~
such that
is ~(~A)-measurable for all
= XF~(RA)
if
F e ~(FA)
A e
~(~.~) C
~
~A
and
(~)~A
measurable from outside
A , then
~(A) e C
implies that
V
F A = F(s-A)
"
, the events that are
S . In this case we will have
~(A) = { ~(t) : t e A } . The bijection
for all
~-invariant is that
=
then there exists
~J will usually be induced from a bijection
: S------~S of tile underlying space ~(F%) = --~(A) ' where
;
.
~ is ~-invarisnt if given any
If we are in the situation where
that
F e~
~
A e C , and thus the natural definition of
~I[A = I:~(A) for all
will be such V
being
A e C . This condition certainly
is ~-invariant as we have defined it above.
It is important to note that if
V
is ~-invariant then we must have
49
Lemma 4.1
If
such that
Proof
that
TA
% g ~
with ~
is ~-invariant then, given any
~(T%) C
Let
p E~
~
~ g ~ , there exists
be such that
~i = t%(~) , and
~(F%) ~
~A
~ E P(F~,R%)
and
(~%)wA = ~%
; suppose that
. Then a simple calculation shows
= tA(1) , where
Proposition 4.1
If
~(~)(~(x),F)
~
Now let
H
d~(x)
is ~-invariant and
Immediate from Lemma 4.1.
p ~ G(V)
then
r
be a group of F-measurable bijections from
denote the H-invariant elements of
Po~)
~(~)n
g ~_V)
.
X
X ; we
82
will call an object H-invariant if it is ~-invariant for all
G(%) =
to
~ E H . Let
P~F) , and for a specification
~
G(V)
and
~V)
~ e Po~_F) then we will say that
~-a.e. for all
D(F)
~
, for
is convex, we will first look at the extreme points of
. To do this we need some definitions: if
g P F(F) o --
let
is non-empty.
Since G o ~~
Po~)
.
Suppose from now on that we have an II-invariant specification which
% g
9
I(F) =
Proof
4,4
f
f : X
~R
is F--measurable
is p-almost H-invariant if
~ c H ; we also apply this term to
F g ~
if it holds for
is called H-ergodic if, given any D-almost H-invariant
is either
0
or
i .
f-~ = f
F g ~ , then
It is well-known (and easily checked) that the
H-ergodic random fields are exactly the extr~ne points of
Po~_F) .
XF .
50
If
H
is countable then we can leave out the
definition of is either
0
4.5
D
being H-ergodic;
or
i
thus in this case
for all H-invariant
"D-almost" D
in the
is H-ergodic if
~(F)
F g F . (We leave the reader to check
this.)
Proposition 4.2
~ g ~o(V)
is an extreme point of
given any D-almost H-invariant
F g F=F=F=F=~, then
particular any H-ergodic element of
Proof
g : X
Let
g~ g P (F) O
>R +
~V)
D(F)
is either
g
or
I . (in
is extreme.)
be bounded, F--measurable and with
if and only if
0
~ g dD = I . Then
is D-almost H-invariant, and thus by Lemma 2.4
--
gD g ~ _ V )
if and only if there exists
D-almost H-invariant and with
g = g
g : X
>R +
When way: let
~
H
with
g
~[
is countable the above result can be stated in a more elegant
denote the o- field of all H-invariant elements of
is an extreme point of is also true if
F==o-mea~urable,
D-a.e. The result thus follows in exactly
the same way as the proof of Theorem 2.1 (i).
H
G_o~_V) if and only if,
H
G ~)
if and only if
D
is trivial on
is separable in the sense that
H
~ ; then
D
F=~on I . (This
has a countable subgroup
such that H-invariance is the same as H -invariance.) O
O
In many cases we can improve on Proposition 4.2 because, if enough, then
F C ~
F C F== ( i n that
F
~
is nice
being D-almost H-invariant implies that we almost have is in tile completion of
F=~~ with respect to
speaking, this would say that, up to null sets, ~ C separates
H
if there exists a sequence of G-fields
D ). Loosely
F=~ . We will say that ~i ~
~2 C-
...
~
H
4.6
51
such that
A C C
~
there exists
Lemma 4.2
~ U n> i
is the smallest o-field containing
~ e H
Suppose that
with
H
~n
separates
) C_ F A .
~ , P e Po~_F) and
F--measurable and u-almost H-invariant. Then there exists and
~
n ~ i ,
, and for any
f :X ~
)R
with
is bounded,
~ = f
D-a.e.
F -measurable.
==co
Proof (The idea of this proof is due to F~llmer and can be found in Georgii (1973).) We want to show that LI(x,~,D) norm. Given
I] ED( f I FA ) -
But since
f
f = E ( f I F=~o )
y > 0 , there exists
ED( f i F=oo ) II < y
is D-almost H-invariant and
D-a.e. Let
A e ~
and
n ~ i
IIED( f I ~
p g P (F) O
g H
and
II
II
denote the
such that
) - f II < y 9
it follows that for any
--
we have
f - ED( f I B
and thus for any
II f -
E (f
IF
II ED( f I ~ % )
)
=
f - E ( f I ~(_B--n) )
~ ~ H
) II < II f -
ED( f I _Bn ) II +
) - E~/( f I F A ) II +
< 2y + II ED(f I ~(B))-ED(~
Now choose
~
D-a.e.,
such that
~-Bn) ~
IIEp(i [~CB~)-Ep(f i-FA)
ED( f I FA ) - EA( f I ~
) If
F A) II
~A ; then by Jensen's inequality we have
II ~ IlED(f ]~_B~n))-f ]I
4.7
52
and thus II ~ ( f
II Ep( f I ~ ) - f I[ [F)-f
Theorem 4.1
If
< 3y . Since
II = o , i e
H
separates
y > 0
f = E ( f i~
~
then
p C_Go~_V~
)
is arbitrary we get p-a.e.
82
is an extreme point if and
only if it is H-ergodic.
Proof
This is immediate
from Proposition
If we are in the case when
4.2 and Lemma 4.2
~ A = ~(S-A)
, and each
from a bijection
@ : S---~S
, then we will usually
An E ~
increasing
to
with
A , % a~
An
, there exists
~ g H
S .
Then
~
such that
is the finite subsets of
A~(%)
S , and
In the language of functional separates
=C
then
=oG (V)=
G ~V_) are compact lar models,
that
~ : X x F
)R
We want to show that
follows by assuming H
(X,~)
= ~(x,F)
that
is separable~
H
if, given any
= ~ . For example,
this will
Theorem 4.1 says that if
This result,
P o ~)_
H
and
for some more particu-
(1969), and Georgii
(1973).
is a standard Borel space, and in Theorem 2.2. Let
for all
F g ~
I(H) g F=~ , and to do this we need
by only countably many conditions.
~
Po(F)_ , at least when
is the kernel constructed
~(x,F)
~ B = =F(An) , where
is the group of translations.
can be found in Lanford and Ruelle
I(H) = { x g X :
that
H
in some suitable topology.
Let us now again suppose that
is induced
at the beginning of the section,
analysis,
is a face of
take
~ ~ H
I] will separate
clearly hold for the lattice mode] considered where
82
, ~ g H } .
I(H)
to be determined
For simplicity we will achieve this in what
is countable,
as we defined
though it would be enough to ass~Ime
it above.
(For example,
in most models
in
53
4.8
continuous statistical mechanics the group will be taking the countable subgroup to be
R d , and this is separable,
Qd , the points in
Rd
having all ratio~al
coordinates.)
Theorem 4.2
Suppose that
H
is countable. Then
(4.3)
I(H) ~ F=~~ , is H-invariant, and
(4.4)
G~V)
(4.5)
if
= ~
if and only if
~ g Po~F)
with
~ c G ~_(~ ; if
(Note that if
V
~
v(l(H)) =
~(I(H)) = I
D(F) = ~ ( x , F )
then
p(l(H)) = I
0
and
for all
for all
p
p g G~)
V g Po(~)
;
;
is defined by
d~(x) ,
is H-ergodic then
~
is H-ergodic then by (4.3)
is an extreme point of v(I(H))
is either
0
G~_V) or
.
i .)
We will break the proof of this theorem up into a number of lemmas. Note that we are not assuming
H
separates
~ , so the extreme points of
G l~)
need not be H-ergodic.
Lemma 4.3
For
~ g H
define
I(~) = { x g X :
Then
I(~) E_F_~ and
Proof by
~
Let
~
~(x,F) = ~ ( x , F )
p(l(~)) = i
for any
be a countable field with
in the definition of
for all
D g ~)
~ = o(A)
F g F } .
9
; then we can replace
1(9) , and thus we need only show that for any
54
F ~ ~
{ x g X : ~(x,F) = ~J~(x,F) } g _ F
for any
~ e G~V) . But
~(',F)
and
~(x,F)
dr(x)
, and that
~(',F) = ~ ( ' , ~ )
~-a.e.
~b~(.,F) are F -measurable functions, and
so the set on which they are equal is in
I
4.9
F
==co
= IXG(X)~(~-I(x)'~-I(F))
. Now let
~ e G(V) , G ~ F=oo ; then
dr(x)
G
I{X G
=
~(',~-I(F))}o~-I
~XG.~(.,~-I(F))
d(~-lV)
dv
I
=
~(., _I(F) ) d(~-Iv)
~-I(G)
= (~-Iv)(@-I(G)~ ~-I(F)) ,
by (2.18); since by Proposition 4.1
~-i
g G(V)
, and also
-I
(G) e F
But (~-Iv)(~-I(G)~-I(F))
=
= (~-Iv)(~-I(G~F)) = v ( G n F )
] ~(x,F) d~(x) ,
(again by (2.18)).
G
Thus
,~ ~ ( x , F ) d r ( x )
=
G
~ ~T(x,F)d]J(x)
for a l l
and hence
~(',F)
Lemma 4.4
I(H)
Proof
x g I(H) , ~ ~ H ; we want to show that
Let
G c F=oo,
G = ~(',F)
~-a.e.
82
is H-invariant.
~(x) g I(H) , i.e. we need
55
~(~(x),F)
= ~(~(x),F)
~w(~(x),F)
4.10
for all
~ e H , F g F . But
= ~(~-l~(x),~'l(F))
= (~-l~)li(x,~-l(F))
= w(x,~-l(F))
(since
= ~-l~(x,~-l(F)) =
~(~(x),F)
.
x g l(H) )
(again since
x g I(H) )
82
Together, Lemmas 4.3 and 4.4 give us (4.3), since
I(H) =
N
I (~')
9
~EH
Lemma 4.5 p = ~
Let
H
be countable,
let
9 r Po(~)
V(I(H)) = I , and let
, i.e.
P(F) = I ~(x,F) d r ( x )
Then
with
9
~ g G=o~V_) .
Proof
It is immediate from (2.8) and the fact that
x g X
that
p g G~)
. If
~ g H
then
(~p)(F) = p(~-l(F)) = IZ(x,~-I(F))
= I{~7~(',F)}'~ dV
=
~ I(H)
=
I~'(x,F)
~(x,F) d~)(x) =
z(x,-) e G(V)
I l(n)
~
dv(x) =
d(~v)(x)
=
1[(x,F) dv(x)
(~(x),F) d~(x)
I~(x'F)
=
p(F)
d~(x)
.
for all
56
Thus
D e P
CF) O
have
is an easy consequence
with
and thus
82
--
(4.4) g Po~_F)
9
4. Ii
~(l(H))
~o~V)
> 0
then
# ~ . Conversely,
~ g Po(~_~)
and
~(I(H))
of Lemma 4.5, because ~
if
E ~o(~_V) , where Go(~)
if there
exists
~ = {~(I(H))}-IxI(H)V
# ~ , then for any
~ g G~V)
we
= i . We have now only the second part of
(4.5)
left to prove.
Lemma 4.6
If
H
is countable,
if
V g P (F) O
and
~ = ~g
Proof
Let
F e F
,
then
~
G g F==~
with
F
is
an
extreme
be ~-almost
H-invariant
point
of
H-invariant;
and
is H-ergodic
V(Z(H))
with
= 1
--
G ~V)
since
~(G) = ~(F)
.
H
is countable
. (We can take
there
F =
~
exists
~(G)
, for
~gH example.)
If
~ g H
~(~(x),F)
Thus,
as
~(x,-) field,
x g I(H)
= ~(~(X),~(F))
V(I(H))
H-ergodicity
and
= I , we have
of
~
implies
is an extreme point and
F g F
; so
~(x,F)
since
= ~-I~(x,F)
7[(.,F)
that of
then,
: ~(x,F)
is ~-almost
~(" ,F) G~
~(F) = F , we have
is v-a.e,
for all
.
H-invariant;
hence
the
equal to a constant.
x g X , thus has trivial
can only take the values
0
or
But tail-
i . Therefore
==r
either or
~(-,F)
= I
~-a.e.
i . Thus by Proposition
The proof g P~_F_) then
~
or 4.2
~(',F) ~
= 0
v-a.e.,
and hence
is an extreme
point of
of Theorem 4.2 is now complete. = ~
that we can consider
if and only if ~
~ g ~C~)
as an affine mapping
Recall
~(F) G=~V)
from
.
(2.20)
. Thus T h e o r e m
from
is either
0
82
that if
4.2 tells us
{ v g po~_~) : ~(I(H))
= 1 }
57
to
G ~)
4.12
; this mapping fixes the points of
sends extreme points
G ~)
, and is thus onto, and it
to extreme points.
In all reasonable
situations
there will exist a representation
of
P (F) O
in terms of the H-ergodic measures. like this: if
E
denotes
the H-ergodic
an affine bijection
T : P(E,E)
subsets of
P(E,~)
E , and
denotes the point mass at exists;
let
E
The representation
~P (F)
denotes ~
elements of where
will usually be something
Po~_F) then there will be E
is a suitable o-field
the probability measures
for any
a g E , then
= { ~ g E : v(I(H)) = 1 }
on
E
(E,~)
and suppose that
E
of
. If
T(6 ) = ~ . Suppose
O
that
T
g E . Let O
= { E ~ D : D g E }
==O
--
and consider
the mapping
O
=
~.T
: P(~o,E)
It is not hard to show that extreme points.
E
is affine, onto, and sends extreme points
to
From this it is a simple matter to obtain a representation
in terms of (Eo,~o)
~C=o ~)- "
P(Eo,E=o)
, where
under the equivalence
(Eo,~)
relation
We will now give sufficient
is the quotient
, where
conditions
~ ~ 6
if
to ensure that
of
space got from E(~ ) = E(6~)
_Go~_V)
.
is non-
empty. Let us return to the situation of Section 3; thus we have an increasing family of o-fields H
{Be}ee ~
be a group of bijections;
(4.6)
if
8 s N
and
~ g H
satisfying
(3.1),(3.2),(3.3)
and (3.4). Again let
we will suppose that
then thei-e exists
~ g N
such that
~(Be) ~
B~
.
4.13
58
Theorem 4.3
Let
H
be a countable abelian group; let
= {WA}AeC
be an
=
H-invariant specification with respect to (3.10), and with
Proof
Let
n
~ =
~
RA # @ 9 Then
~ = {RA}Ag C
~_V)
satisfying (3.6),(3.8),
# ~ .
~O ; then by (4.6) we have
~ A ) C_ ~
for all
@ E H .
0 c
Since each and
H
and
~o
on all
--~O is countably generated, the order on
is countable, we can find a countable field )C
(X,~)
=oA
for all
~ e H . Let
topologised so that
F g A
is countably generated,
=oA C. =A
such that
=F = ~ )
M(X,P)= denote the finite signed measures
~n----~p
. This topology makes
~
M(X,F)
if and only if
~n(F)
>p(F)
for
a linear topological space (it is
==O
Hausdorff since the elements of M(X,F) are determined on 3.2 tell us that
G~)
is compact.
If
). Theorems 3.1 and
is a non-empty, sequentially compact subset of
but it is not hard to check that G~)
A
~ E H
then
M(X,~)
M(X,~) ,
has a countable base for its topology, so
~ : M(X,%)------~M(X,_F~
inuous linear map, and by Proposition 4.1 we have
~))
is clearly a cont~
G(V_) . The result
thus follows from the Markov-Kakutani fixed point theorem (see for example Dunford and Schwartz (1958), V. 10.6, page 456).
82
Finally, we leave the reader to check that the hypotheses of Theorem 4.3 can be weakened, and that satisfying (3.13),
_Go(~)
is non-empty if
~
(3.14) and (3.15) (provided
is an H-invariant specification U
~ r ).
5.1
5. Lattice models
So far we have developed a genera], theory of random fields,
specifications,
Gibbs states. The time has come to examine some particular models. ion we will look at lattice models, introduction Let
S
be a countable
suppose that at the site
about the
set, representing
t e S
in the
Yt )'
a collection
of "sites",
we have an entity whose possible
described by the points of a set
In this sect-
and will study the model described
(but now making no assumptions
and
Yt " The cartesian product
and
states are
-~
repre-
Yt
t g S sents all possible configurations phase space F =
X . Suppose that
of the system,
Yt
and this will be our basic
comes equipped with a o-field
F t , the product o-field on
X . For
AC-S
let
=tF
Tr ~ ,
X(A) =
t ~ S
_F_o(A) =
; let
teA
~
Ft
, and let
PA
be the projection
from
X = X(S)
onto
X(A)
.
t gA Let
__F(A) denote the sub-o-field
depend
on
the coordinates
C let
in
F
observable)
, thus
FA
from outside
the order by
~
for
A s ~
let
finite subsets of
consists of the events
A . Since
(rather than by
we have a o-finite measure
consisting of those events which only
A , i.e. F(A) = (pA)-I(Fo(A)).
will denote the (non-empty)
F A = ~F(S-A)
measure);
of
mt ~A
C
~ on
S ; and for
A e C
that are measurable
(or
is ordered by inclusion we will denote
used in Section 2). Suppose for each (Yt,~t)
t e S
(which we will use as a reference
denote the product measure
~--[
~t
on
tea (X(A) '~o (A)) . We will start by assuming absolutely
continuous with respect
that we llave a specification to
~A
for each
A g ~
and
for which
o~
y g X(S-A)
is (where
60
OK
5.2
is the measure described in the introduction). This will impose conditions
on the Radon Nikodym derivatives of the
oK
(in order that (1.3),(1.4) and (1.7)
hold), and we will compute what these conditions are. From this comes a representation of the derivatives in terms of what, in a physical model, would correspond to a potential. We then reverse the procedure, and starting with a suitable potential we construct a specification. The properties of these specifications are then worked out in terms of the properties of the potentials. In this section we will only deal with specifications as we originally defined tbem (i.e. kernels satisfying (1.3),(1.4) and (1.7)). This is mminly for the sake of simplicity, and it is left as an exercise for the reader to see what modifications are necessary in the more general case. In fact, the more general case will be covered by what we do in Section 6. Some notation: it will be convenient to ~ i t e
(for
x E X , A~
X(A)
whenever
Let function on
S ), and in fact to write y E X(B)
{fA}Ag C X(A)
with respect to
with
YA
xA
instead of
for the projection of
be a collection of ~-measurable functions, fA(yx')
for fixed
y $ X(S-A)
, will represent the density o f
~A " (Here we have identified
~A(X,F) =
~
y
onto
B ~ A .
X
with
X(S-A)xX(A)
that we will continue to do.) We thus want to try and define
(5.1)
PA (x)
fA(Xs_A×w ) d~A(W)
, as a oK
, something
~A : X x ~
,
G
where
G = { w g X(A)
kernel we need:
:
Xs_AXw g F } . For this to make
~
a probability
)R
by
61 (5.2)
fA ~ 0
(5.3)
for all
5.3
A c~ ;
I fA(yxw) d~A(W) = i
for all
y ~ X(S-A) , A e ~ .
Suppose that (5.2) and (5.3) hold; it is then easily checked that (1.3) and (1.4) hold. Thus for ~ = {~A}Ae~ (1.7) holds; i.e. whether
to be a specification depends on whether
~%~A = ~% whenever
A C-% E ~ . A simple calculation
shows that (1.7) will hold if and only if
[
I fi(Xs-%Xw) I f%(xS-%xw%-Axz) d~A(z) d~i(w) F
(5.4)
I
f%(Xs_~Xw) d~7~(w)
for all A C_ % g C , x g X , F g _F_o(Ti ) .
F
In particular, (1.7) will hold provided
(5.5)
f%(x) =
fA(x)I f%(Xs_Axw) d~A(W)
for all
x g X ,A ~ % E~ .
(The converse of this is almost true, in that (5.4) implies (5.5) holds for ~-a.e. x% .) We can in fact replace (5~
by a simpler condition, given by the
following result.
Lerm~a 5.1 I
Suppose (5.2) and (5.3) hold. Then (5.5) is equivalent to if A ~
~~
and
x
~ g X with Xs-~ = ~S~A
(5.6) L
f%(~)fA(x) = f%(x)fA(~) .
then
62 Proof
5.4
Suppose that (5.5) holds and let A ~ % s C , x , ~ g X with
Xs_A = KS_A . Then
f~(~)fA(x) J~(Xs_A• %(w)
:
an~ thus f%(~)fA(x) = f%(x)fA(~)
But if
I f%(Xs-AXW) d~A(W) = 0
F(x)f~(~) ]f~%_~xw)%(w) provided
then
,
I f%(Xs-AXW) d~(w) > O .
f%(x) = f%(x) = 0 , and so we still have
f%(x)fA(x) = f%(x)fA(x) . Conversely, suppose (5.6) holds. Let
A~%~C,
xsX;
if w,z C X(A) then
fA(xs_Axw)f%(Xs_Axz)
Now by (5.3) there exists
by (5.3) we have that
w
s x(A)
f%(xs_AXw)fA(xs_AXZ) .
such that
I fTi(xs_Axz) dc0A(z)
fA(xs_A•
exists. Therefore
= j" fA (x)f% (xS_A• =
I fT[(x)fA(Xs-A•
d~A(W)
f~(x)
dmA (w)
.
Suppose from now on that we have a collection fundtions satisfying (5.2),(5.3) and (5.6). Let defined in terms of fA(x) > 0
for all
{fA}Ag C
> O , and thus again
{fA}Ag C
~ = {~A}Aa~
of =F-measurable
be the specification
using (5.1). If, instead of (5.2), we have
x E X ~ A E C , then ~e can obtain a representation of
63
{fA}As C
5.5
in a form which should be recognizable to anyone who is familiar with
the formulae of statistical mechanics. Fix
X
= { x e X :
x
and
b
b e X
as a reference point and let
differ at only finitely many sites } ;
O
thus
x g Xo
if and only if
Proposition 5.1 exists
a unique
Xs_ A = bs_ A
Suppose that function
for some
fA(x) > 0
V : X
-#R
for all with
A e =C .
x e X , A g ~
V(b)
= 0
and
. Then there
such
that
0
exp{V(x)} (5.7)
fA(x) =
for all
x e X
~exp{V(Xs_A
XW) } d~A(w)
Proof
Let
x g X
let o
A g C
~
with
Xs_ A = bS_~
o
and let
% e C
~
with
~
%~
A 9
Then by (5.6) we have
f%(x){f%(b)} -I
and thus we can define
(5.8)
Clearly
V(x) =
V
=
fA(x){fA(b)}-i
by
log{fA(x)} - log{fA(b)}
V(b) = 0 . Now let
x e X
, A s C O
Xs_ % = bs_ %
and
% ~
and choose
=
A . Then
exp{V(Xs_Axw)}
and thus by (5.5)
for any
=
,% -I f (Xs_AXw){f (b)} ,
A g ~
with
% s C
Xs_ A = bs_ A .
such that
64
fA(x)
exp{V(Xs_A•
=
Therefore
(5.7) holds.
Xs_ A = bs_ A
= {f~(b)}-IfA(x)
d~A(W)
{f~(b)}-lf~(x)
=
If
5.6
exp{V(x)}
f~(Xs_A•
d~A(w)
.
is such that (5.7) holds and if
V
x g X
with
o
then we must have
fA(x){fA(b)}-l =
exp{ V(x) - V(b) }
V(b) = O , V
Thus, together with the condition
.
is uniquely determined
Now although the above result tells us the form a specification
by (5.8). ~I
is likely
to take, in practice we really want the converse of this. We will usually be given a function
V : X
> R , and we want to define a specification
o
of it. It is easily checked that if fA : X
> R
o
then the
fA
V : X
is defined by (5.7) satisfy (5.2),(5.3)
~R
o
with
The function relative b
to
to
b , i.e.
V
and (5.6) , but of course with
V(x)
from
replacing
X~ X~
to
X ; and
to ensure that this can be done. energy of the system
is the work done to change from the configuration
x . In statistical
physics
as a sum of terms due to various
erent sites. We will mimic
(5.9)
V
fA
in (5.7) looks like the potential
the configuration
usually ~ i t t e n
on
V(b) = O , and if
(assuming that the integrals make sense),
X 9 The problem is thus to extend the definition of we will now look at conditions
in terms
the potential
interactions
this here, and suppose that
V(x) =
~
CA(XA)
,
V
energy is
between the diff-
can be written as
65
~A : X(A)
where
(5.10)
Let
~
) R
is ~o(A)-measurable
~A(w) = 0
if
w{t } = b{t }
5.7
and s u c h t h a t
for some
t ~ A 9
= 0 . Note (5.10) implies that the sum in (5.9) is only over a finite
number of non-zero terms for each convergence.
~A
x g X ~ , thus there are no problems about
can be regarded as the potential due to the IAl'body interac-
tion between the sites in
A . In fact, the assumption that
as in (5.9) is only a measurability checked that, if we define
(5.11)
~A(z) =
~A
~
assumption.
V
can be written
This is because it is easily
by
(-I)[A-AIV(bs_AXZA)
,
A CA
then (5.9) and (5.10) hold.
(Furthermore,
this
~A
is the unique solution to
(5.9) and (5.10).) Thus the only question that arises is whether (5.11), is
F(A)-measurable.
and only if for each
Now for
(5.12)
#(x)
It is not difficult to see that this will happen if
A g ~
A ~ C
V(bs_A •
define
=
~
: X(A)-
A :X o
g
)R
)R
by
~A(XA ) ;
A~(A)
where
~A ' defined by
~(A) = { A g ~ : A h A
V(Xs_Axw)
# ~ ) . Then we have
- V(Xs_AXbA)
and thus we can rewrite (5.7) as
= gA(xs_Axw )
is F o(A)-measurable.
68
5.8
exp{ gA (x) } (5.!3)
fA(x) = l expigA(Xs_AXw)] dmA(W)
But if the
~A
are nice enough then we can use (5.12) to define
x c X , and hence use (5.13) to define
fA(x)
for all
ileal I
=
sup
for all
x g X .
Suppose until further notice that the reference measures Let
gA(x)
t
are finite.
i~A(y) i
yeX(A)
Proposition 5.2
Suppose that for each
t g S
II~AII
~._
is finite.
Ac~({t}) (Note that
AE~({t}) just means the sum is over all finite
Then for each X , and if
A g ~
fA : X
A
containing
(5.12) defines a bounded, ~-measurable function )R
is defined by (5.13) then
{fA}Ag C
A
g
t .) on
satisfy (5.2),(5.3)
and (5.6).
Proof
We have
~-- iCA(XA)l = ~. AaZ(A)
t g A
~__
1 --I+A(XA) I lAnAI
AgZ({t})
i0A
=< t 8 A~ AgZ({t})
thus
~__
li:it by
~A(XA)
converges absolutely and uniformly. If we denote the
gA(x) , then clearly
gA : X
>R
is bounded and F=-measurable; we
67 can thus define
fh : X---->R using (5.13). It is inmlediate that (5.2) and (5.3)
hold; to show that (5.6) holds for exp{gA}; and if
=
Hence
fh
A C-% E ~ , x , x E X
exp{gT~(x)}/ exp{gT~(x)}
=
5.9
=
amounts to showing that it holds for with
then
exp{ gT[(~) _ g%(x) }
exp [ ~ { ~A(~A)- ~A(XA)}] ~AgZ (%)
exp{ gh(~) _ gA(x ) }
Xs-h = XS-A
=
exp{g~(~)}exp{gA(x)}
=
exp [ ~ { CA(~A) - CA(XA) }] As Z (h)
exp{gA(~)} / exp(gA(x)} .
=
exp{g%(x)}exp{gA(~)}
.
82
We now have an explicit method of constructing specifications in terms of the functions V
{~A}AgC . If
a potential (with base point
call
{~A}Ac~
V : X~ b ); if
)R
with
V(b) = 0
~A
is defined by (5.11) then we will
the interaction potentials corresponding to
satisfy the hypotheses of Proposition 5.2 and specification, then we will say that We will sometimes write states with potential
~(V)
~
instead of
~ = {~A}Ac~
then we will call
V ; if
{~A}Ac~
is the corresponding
is the specification with potential G~)
, and call
~(V)
V .
the Gibbs
V .
Conditions like (5.2),(5.3) and (5.5) probably first occur in Dobrushin (1968a). The proof of Proposition 5.1 is adapted from Preston (1974b); a similar kind of result can be found in Sullivan (1973). The condition on the
{~A}AcC
given in Proposition 5.2 corresponds to one which is often used in statistical physics; see, for example, Ruelle (1969), Lanford and Ruelle (1969).
68
Let
{~A}AeC
5. i0
satisfy the hypotheses of Proposition 5.2, and let
the corresponding specification.
We will show that
used in the previous sections; in particular
G(V)
~
be
satisfies the hypotheses will be non-empty (provided
we have the right standard Bore! structure). Suppose from now on that
(Yt,~t)
is a standard Borel space for each
then
{BA}AE~
~A = ~(A)
satisfy (3,1),(3.2),(3.3) and (3.4) (with ~ = ~ ).
Proposition 5.3
Proof
t e S ; thus if we let
G~V)
is non-empty.
We will show that (3.7) and (3.8) of Theorem 3.1 are satified. Let
and let
NA > 0
be such that
IgA(x) I ~ NA
for all
x a X . If
F g ~(A)
A g then
G
where
G = PA(F) ; and thus for any
~A(X, F)
Define
~ g P(B__ A)
holds with For
~
x ~ X , F E ~(A)
we have
exp{2NA}~A(PA(F))/ ~A(X(A)) 9
by letting
~(F) =
mA(PA(F))/ WA(X(A)) ; then clearly (3.11)
A = e . But (3.11) is stronger than (3.7), and so (3.7) also holds.
A , % g ~
define
g~ : X-
g~(x) =
~
~R
by
~A(XA)
A~Z(A)
A~% then
g~
is ~(%)-measurable, and
g~-----> A gA
uniformly as
~---->S
. Hence, if
69
F ~ =F(Ao)
for some
h%(x) =
Ao g =C and
h%
5.11
is defined by
[ I exp [gTl(xs_Axw) ( w ) l -} dmA i A
[~
exp{g%(xs_AA Xw) } d0~A(w)1 F O
where
Fo = { w E x(A) : xS_A•
g F } ; then
is a simple matter to verify that Therefore (3.8) holds.
h~
,,,
h%
is =F(AUAo)-measurable , and it
> ~A(.,F)
uniformly as
~
~S .
82
The calculation in the above proof in fact shows that (3.11) holds; and thus
G~)
has the compactness property given by Theorem 3.2. Recall that this
gives us the following: if {n.}.>. j
j
and
D g G(V)
~n ~ ~-~-) , n ~ i , then there exists a subsequence
such that
~(F) = lim ~n (F) j-~o j
l
for all
F g A , where
t._,J %(A). A~__c
A =
The most important examples of lattice models are when d ~ I , and when all the
(Yt,~t)
are copies of the same space
S = Zd (Yo,~)
this case there is an obvious way of defining an F-measurable bijection T
t
: X
>X
for each
(5.14)
Tt Ts
(5.15)
To(X) = x
(5.16)
Tt(~(B)) = =~(B+t)
where
= Tt+ s
t r Zd
sucb that
for all
for all
B+t = { s+t : s g B } .
s, t g Z d ;
x g X ;
for all
B e_ Z d
,
t g Zd
for some . In
70
We just let
(Tt(x)) s = Xs+ t , (i.e. the
(s+t) th. coordinate of d
5.12
s th. coordinate of
Tt(x )
is the
x ). There is also an obvious way of constructing
~
Z -znvariant specifications, i.e. specifications that are invariant under the group
{Tt}tEzd
same measure
~o
t ~ Z d ; then where
: start with reference measures
{~A}AcC
(Yo,_Fo)_ , choose
b g X
d . will be Z -znvariant if
~
(5.17)
where
on
~
~t
that are all copies of the
such that
bt
is independent of
is defined in terms of
{~A}AcC ,
d are Z -invariant in the sense that
~A(X) =
(x+t)s+t
Proposition 5.4
~A+t(x+t)
for all
A g ~ , x s X(A) , t g Z d ,
= xs
Suppose that
{~A}AE~
X
satisfy (5.17) and that
]I~AII
Ag~({t}) is finite for all
t ~ Zd 9 I f
=V i s t h e c o r r e s p o n d i n g
specification
then
G (~_V)
d is non-empty; i.e. Z -invariant Gibbs slates exist.
Proof
From the proof of Proposition 5.3 we know that (3.7) and (3.9) hold. Thus
the result follows from Theorem 4.3.
82
The proof of Theorem 4.3 also tells us that, under the hypotheses of Proposition 5.4, _Go(~V)
is sequentially compact in the sense described after
Proposition 5.3.
Note that
{Tt}tgzd
separates
Theorem 4.1 the extreme points of ~(V=)
~ , (as defined in Section 4), thus by
G=o(~) are exactly the zd-ergodic measures in
. Note also that all of the above remains true if
S
is any countable
7~
5.13
abelian group. Before looking at some specific examples of lattice models, we will consider one more general topic. This will concern finite range interactions, and more particularly, nearest neighbour interactions. We will say that a function
f : X-
>R
is ~-measurable if
f
is ~(A)-measurable for some
(Such functions are sometimes called tame.) Let
V
in terms of
has finite ran g_e if
{fA}Ae C
as before. We say that
~-measurable for each
A e ~ . If
V
~
A e ~
then there exists
(Recall that and
% e ~
~(B) = { A e ~
;I~AI I
such that if
: A~B
is finite for all
be a specification defined
is a potential, with
ponding interaction potentials, then we say that
V
{~A}AgC
A g ~(A)~(S-%)
then
fA
is
the corres-
has finite ranse if, given
# ~ } .) Note that if
A e ~
A e ~ .
~
V
IleAl !
then
~A = 0 .
has finite range is finite for all
Ae~({t}) t e S , since there are only a finite number of non-zero terms in this sum.
Proposition 5.5
Let
x e X , A e ~ ; let 5.1. Then
V
ll@Al I
tial
V , then
For
V
be the potential with base point
finite for all ~
A e ~ , and if
V
~
b
fA(x) > O
for all
given by Proposition
is any finite range potential is the specification with poten-
has finite range.
t e S
""
A e C
have finite range, and suppose that
has finite range. Conversely, if
with
Proof
~
let
7~ e C
be such that
f~t~~
is ~(At)-measurable.
If
t
and
% =
~
A t , and
A e C
is such that
A e ~(A)N~(S-%)
tea will show that
~A = O . From (5.8) and (5.11) we have
, then we
72
~A(y)
= ~
5.14
(-l) IA-Bi{ log fA(bs_BXY B) - log fA(b) }
B~-A
Let
s 8 AlIA , t s A~l(S-%) , and for
= BkJ{t} , B = BD{s,t}
(5.18)
B C- A - {s,t}
let us write
B = B~{s}
; then
fA(bs_BxYB)fA(bs_~Xy~)
I
(_l)iA_Bilog
~A(y ) =
fA(bs_~XY~)fA(bs_~XY ~)
B e_ A-{s,t}
fA(bs_BXY B)
f{S}(bs_BXY B)
But from (5.6) we get
and fA(bs_~Xy~)
fA(bs-BxYB) " fA(bs_gXyg) t ~ ~
-
f{S}(b S ~Xy~)
f{S}(bs-BxYB)
f{s} ;
and since
is F(%)-measurable,
and
f{S}(bs_gXy ~)
we have
f{S}(bs_BxYB) = f{S}(bs_~Xy~)
, f{S}(bs_~Xy~) = f{S}(bs_~Xy~)
.
Thus every term in the sum on the right-hand side of (5.18) is zero, and hence ~A(y) = O . Conversely, suppose that finite for all implies
A g C ; let
A e C
V
is a finite range potential with
and choose
~A = 0 . By (5.12) we have
% g C
gA(x) =
such that
IleAl i
A g__Z(A)fIZ(S-%)
~A(XA) , and therefore
g
A~_Z(A) A~ is ~(%)-measurable.
Let
~
Thus by (5.13)
fA
is ~(%)-measurable.
82
be a finite range specification corresponding to a finite range
,
73
potential
V
with
[[~A[ [
finite for all
5.15
A e ~
proof tells us that the "range of interaction" Hore precisely,
for
~(r(A))-measurable, A e ~(S-~(A))
Proposition
A e C
let
and let
implies that
5.6
For all
r(A)
r(A)
. An examination of the above
of
~
is the same as that of
be the smallest
be the smallest
set such that
set such that
fA
V .
is
A e ~(A)
and
~A = 0 . Then we have
A e ~
r(A) = ~(A)
; also
r(A) =
~
r({t})
.
t g A
Proof
The proof of Proposition
(5.19)
r(A) C_.
r(A)
~_.
5.5 tells us that
I ~
I r({t})
for all
A e C
.
tEA
In particular we have
r({t}) = r({t})
r(A I) ~
r(A2)
whenever
Thus if
t e A
then
t e S . Now it is clear that
AIC- A 2 ; and by (5.5) the same thing is true for
r({t}) ~
r({t})
for all
~
r(A)
r(A) C
tEA
, and hence by (5.19)
~(A) ~
~
r({t})
G . Assume that
G
has no multiple
and that there are only finitely many edges incident s
and
~A , the boundary of
t
~
S
are
edges, no loops,
to each vertex;
then we will call them neighbours.
For
if there is A ~
S
define
A , by
~A = { t e S - A : t
SAg
.
example of the above result is when the points of
the vertices of some graph
(Note that
.
tEA
An important
an edge between
r
if
A e ~
is the neighbour
of some
s e A } .
.) Call a finite range specification
~
nearest
74
neighbour
(with respect to
(i.e. if
r(A) C
if
r
= 0
whenever
A~A
whenever s , t s A
Proposition 5.7 potential
V
if and only if
V
Pro_____o~ Suppose
G ) if for each
A
is not a simplex of
with
s # t
then
II@AI I
t ~ r({s}) = r({s})
with
s # t
A
V
is a simplex with
AUrA .
D ( F 2 ~ F 3) > 0 ~(A)
nearest neighbour G
if
are neighbours).
~
is nearest neighbour
A g ~
s , t
then
with
not a simplex;
not neighbours. Hence
A g ~({s}) V
A
, A ~ ~(S-r({s}))
is nearest neighbour. If A C AUrA
; thus
82
g G(V)
then
A g _C and
;j is often called a
F 1 g F(A)
, F 2 ~ F(~A)
,
then
~.'(FI~ F 2 ~ F 3 ) D ( F 2)
says that
t
V
is a simplex of
A c ~ . Then
and
A E ~(A)
is nearest neighbour and
F3 g ~(S-(AD~A))
Thus if
and
; and since
Markov random field. This is because if and
( A
CA = 0 . Conversely, suppose that
r(A) = ~(A) C-
If
C
is nearest neighbour, and let
t ~ ~{s}
and
s
finite for all
s , t E A
A ~ ~
is ~(At.JSA)-measurable,
be a finite range specification corresponding to a
there thus exist
we must have
fA
is nearest neighbour.
~
and so
A ~ ~
) ; call a finite range potential
Let
with
5.16
=
D(F2f~ F 3 ) ~ ( F I N F 2 ) 9
we can write this as . ~(FIIF 2 ~ F 3) = ~(FIIF2)
is independent of
~(S-(A~$A)),
given
~(~A)
, which
9 This is an
analogue of the usual Markov condition (though here it is spatial, rather than temporal). Proposition 5.7 is a form of a much over-proved result, versions of which can be found in Averintsev
(1970), Spitzer (1971), Harmmersley and Clifford
(1971), Suomela (1972), Sherman (1973), Preston (1973) and Grimmett
(1973).
,
75
5.17
Now for some specific examples of lattice models.
I. The Ising model
This is supposed to be a model for a d-dimensional magnet.
Let
S = Z d ; at each site
and
- ; thus each
Yt
t E Zd
there is a "magnet" Which has two states,
is a set with two points in it. We could let
+
Yt = {+'-}
but it will be more convenient (for the author) to let
Yt = {O,I} . This allows
us to interpret
as the site being empty
1
as the site being occupied, and
O
(and so we could consider this as a lattice gas rather than a magnet). For A C. Z d
we will make the obvious identification of
of subsets of
A
(i.e. we identify
y e X(A)
with
define potentials we will take the base point to be X~
can be identified with
potentials
{~A}Ag~ ; if
~ . Let %r'A
e ~
V : ~---~R
X(A)
with
~(A)
{ t e A : Yt = i } ). To (i.e. all
O's ), thus
be a potential with interaction
then by (5.10) we have
~A(%) # O
= A ; thus we can regard the interaction potentials as a potential by letting
, the set
~(A) = ~A(A). This gives us the formulae connecting
V
only if ~ : ~-
and
~
JR in
the form
V(A) =
~
~(A)
A~A
In terms of t g Zd
~I
,
~(A) =
~
(-I) IA-AIv(A) .
A~-.A
~
~(A) I
the hypotheses of Proposition 5.2 become that for all must be finite. Let uS assume that this is true, and thus
teAgC = ~(V)
is non-empty. The main problem for this model is to determine for which
potentials we have If
V
I~(V) I = 1 , i.e. exactly one Gibbs state with potential
is zd-invariant then we also want to know if
,=,IGo(V) I = I ; and whether
V
~
76
G (V) = G(V)
. These
problems
5.18
are extremely
diificult;
the results
that are kno~r
~O
often have
involved
and very clever proofs,
some of the results
(5.20)
(Dobrushin
and much is not known.
in order to give some idea of the complexity
(1968c))
Suppose
< I
there exists
'~', (IAI-1) I~(A)I 5 ~/
state
of the situation.
such that
for all
2e
We will
t E Zd .
tcAaC Then
i~(V) i = i . (This result
says that the interaction
Suppose
A
all
(5.21) with
between
the sites
from now on that the potential g ~
, t E Zd
(Lebowitz
IAI ~
means
and Martin-L~f
diam(A)
= max{
d
.
is Z -invarlant,
(1968)) Im-nl
(1972))
l(v)l :
that the interaction
(Ruelle
is weak.
i.e. ~(A+t)
for
= ~(A)
9
2 9 Then
(5.22)
S = Z d .) The condition
does not depend on having
I
Suppose
if and only if
is attractive,
Suppose
~(A) _>_ 0
lo(V)l :
for all 7% E
i . The condition
C on
or that we have a ferromagnet.
d = i , and for
: m , n g A } . Then
~
that
A C.Z
I~(V) I = 1
let provided
l~(A) Idiam(A)
O~A~s is finite.
This says that in one dimension
able interactions.
(5.23)
(Dyson
we have
i~(v)1
= 1
for most reason-
However:
(1969))
Suppose
d = I ,
r
= 0
for
]A I ~ 3 , #({O,n})
= n -~
77
for
n > O , and
~({0}) =
- ~_
5.19
n -~ , (where
n>
~ > i ). Then
a < 2 . (Note that from (5.22) we have
[~(V) I = I
if
#(A) = O
if
, . Z d -invarlance
~({O,t})
for all
is determined by
~({O})
and
(5.24) (Ruelle (1971) , Lebowitz and Martin-Lof for all
t # 0 . Then
]~(V) I = 1
2~({O}) +
~
> I
if
a > 2 .)
Now we will only consider pair potentials, i.e. ~
[G(V)]
1
IAI ~ 3 . By the 0 ~ t g Zd
(1972)) Suppose that
~({O~t}) ~ 0
provided
#({O,t})
#
0 .
t#O
In terms of magnets (and we are again in the ferromagnetic case) the condition 2~({0}) + ~
~({O,t})
=
0
means the absence of an external field; thus
t#O I~(V) I = i
in the presence of a non-zero external field.
(5.25) (Ruelle (1971) , Griffiths (1964), Dobrushin (1965)) for all
t # 0 , that
2~({0}) + ~ ' - - "
~({O,t})
=
Suppose
~({O,t}) ~ O
O , and that for
% > O
V%
t#O is the potential corresponding to I~(V~)I > 1 . Further, if if
~({O,t}) ~ y > O
d ~ 2 , then there exists
we also have
%~ . If
p = p(~)
l~(V%)l > i for all
such that
l_G_~(Vp)I > 1 ). The parameter
%
and
t E Zd
~ > ~ with
then llti] = i , and
l~(Vp) l > 1 , (and by (5.21)
is inversely proportional to the
temperature, thus the result says that a ferromagnet, in the absence of an external field, exhibits spontaneous magnetization at low enough temperatures.
78
If
V
is as in (5.25) with
5.20
for
~({O,t}) ~ y > O
Iltll = i , and if
d > 2 ,
then letting
c
we have
=
0 < %c(~)
A particular
(5.26)
d = 2
inf{ ~ > O : I~(V%)I
< m
and
I~(V%)]
i
if
O
otherwise,
~({o})
the actual value
to be a major unsolved
For
(Griffiths
repulsive
VI
(5.28)
d > 2
(1964),
~
interaction
e
is known.
the value
=
1
-
=
~ > %c
(Dobrushin
is unknown,
Dobrushin
(1965))
Let
corresponding
(or an anti-ferrom~o.gnet). G (VI) # ~(VI)
(1968c))
Suppose
and the case
result
d = 3
~
to If
be as in (5.26), 1} . If d > 2
. (This is about
~ < O and
of
is con-
=
then we have a
the simplest
that
~({0})
otherwise~
-dx
+
and for any
~ < -I
if lltll=l, =
O
-d .
(It comes from a famous
breakdown.)
~({o,t})
if
problem.
be the potential
I~(Vx) [ > 1 , and also symmetry
, ..[G(V%)I= > 1
~c
IInil = 1
of
sidered
let
% <
llt~ = I ,
=
(1944).)
s R
if
that
Onsager
(5.27)
= I
> 1 }
case of this is:
Suppose
~({O,t})
For
(r
p
.
(~)
then
example
of
c
79
If
% > O
d > 2
then from (5.24) we have
then there exists
M > 0
5.21
[G(V) I = 1
such that if
whenever % < -M
O ~ 0 . However,
then
IG(V) I > 1
for some
For other results on the Ising model the reader should look, for example, Ruelle
(1969) , Gallavoti
2. A model with
S
(1972) and Georgii
an infinite tree
is much simpler
We will again have some
N ~ 1 , where
vertex.
TN
(1974a) and Preston Yt = {O,I}
deleted.)
involved,
for each
u
and
I ~({s,t})
N+l
graph which becomes disconnected
are homogeneous;
the
S = TN
for
edges incident to each when any edge is
on this graph, and will
thus there are only two parameters
v , where
v
if
s
and
t
are neighbours,
=
~({s})
O
The corresponding
The results about
t g S , but now
We will look at nearest neighbour potentials
assume that the potentials
it is
(1974b).
is the infinite tree with
(A tree is a connected
than the Ising
can be made with it. However,
still complex enough to exhibit some interesting behaviour. model are taken from Spitzer
in
(1973).
This model
model, and fairly explicit constructions
if
= u
otherwise,
potential will be denoted by
we want to know for which values of
u
and
v
V(u,v)
. As with the Ising mode]
do we have
l~(V(u,v))[
= 1 .
For this model a comp]ete answer can be given. A crucial fact about this model seems to be that there is an explicit method of constructing we can construct
a lot of random fields;
at least one e]ement of
enough so that for each
~(V(u,v))
.
u ,
v
80
Let
5.22
O < p < i , O < q < i , and consider the matrix
as the transition matrix of a Markov chain with the two states be the unique stationary distribution (i.e. subgraph isomorphic to X(A) =
--~-
Yt
p l-p ) l-q q
M =
O
~M = ~ ). Now if
and
A ~. T N
Z , then we can define a probability measure
corresponding to
M
and
~ :
~A
1 ; let is a
~A
on
is the stationary distr-
t ~ A ibution of the doubly infinite chain, as a measure on its sample space.
It is
not hard to check that these measures are consistent, and they uniquely define a random field an element of
~(p,q)
on
~(V(u,v))
X . A simple calculation then shows that
~(p,q)
is
, where
=
logIIl-q~N+l
I l-p~]
(5.29)
= logI (l-p)(l-q) Pq 1 Let sends
F : (O,I)• (p,q)
to
~R 2 (u,v)).
r
be defined by (5.29) (thus
of more than one point for some l~(V(u,v)[ > I 9 In the case that
and only if
s30
(u,v) v ~ O
G(V(u,v))
. If
xl x
F-l(u,v)
consists
then we will have of course (an attractive interaction), the results
(1972) can be used to show that
lY-l(u,v)[ > 1 . But
I~
is the map that
is easily seen to be onto, thus as stated above,
we can construct at least one element of each
of Lebowitz and ~ r t i n - L ~ f
F
II'-l(u,v)[ > 1
l~(V(u,v))I > i
if
if and only if the equation
81
has more than one solution w i t h
x > 0 . If
(5.30) has only one n o n - n e g a t i v e
solution;
then there exist values solutions.
of
u
for w h i c h
Note that the case
N = I
5.23
N = 1
or
but if
fN+l~ 2 ~ I , then
exp(v) ~
N > i
and
(5.30) has either
2
/N+i~ 2 ~N-~I
exp(v)
>
or
non-negative
3
has already been covered by (5.22),
since
T1 = Z . If F(p,q)
v < O
= (u,v)
{ (u,v)
(a repulsive
with
: v < O }
by
v < O
interaction)
if and only if
{ (p,q)
q x
then
IF-l(u,v) l = 1 ; also
p + q < 1 . Thus w e can r e p a r a m e t r i z e
: p + q < I } . Define
N
+
~ : R+ -
~R +
by
(l-q)
~(x) (l-p) x N
Using a construction showed that if
similar
p + q < I
only if the equation
p
to the one used to get
and
~o~(x)
+
(u,v) = F(p,q) = x
and
if and only if
(u,v) = F(p,q), ~(x) = x
then
~o~(x)
In both cases we therefore l~(V(u,v))I
positive solved,
solution.
= i
though the answer
3. Countable
state Markov
of some countable
set
= x
solution,
Y
for each
if and
(namely
x = i ).
if solution
and this happens
surprisingly
S = Z
result:
~o~(x)
the p r o b l e m of when
Let
solution
= I
if and
solution.
have the following
is perhaps
chains
IG(V(u,v))[
(197~a)
has only one positive
if and only if the equation
For this model
Spitzer
p + q ~ 1 . This is b e c a u s e
has only one positive
only if (5.30) has exactly one positive
then
, then
has only one positive
In fact the above still holds when p + q ~ I
(5.27),
= x
if
(u,v) = F(p,q) has exactly one
I~(V)[ = i
has thus been
complicated.
and suppose
that
Yt
is a copy
t E Z , We will look at nearest n e i g h b o u r
82
5.24
specifications
in this case (with
Z
neighbours
n
n+! ). We will see how the Gibbs states
of
being
n-I
and
considered
with Markov chains having state space
Y
as a graph in the usual way, the
(and regarding
axis). The results stated are taken from Spitzer ional material can be found in F~llmer Put the o-field measure. X
Let
{fA}Ag C
g X , A g ~
also that
. ~
fA
iance, thus
~(Y) satisfy
Choose a base point V
A
gives that
V
is of the form ~
for all
b g X
b
other than a potential.
f{O}
{n}
or
by
, where
probability x
t g Z .
~{0}
x , z c u
t e Z
5.1, and let
and
define
f
be
5.7 cont-
the proof of this
. The Z-invariance ~{0,i}
(for some {~A}As~
Then, although Proposition
{n,n+l}
Thus of
~A = 0 V
therefore
" to represent
X,Z
x = W_l , y = w 0 , z = w I
is ~({-l,O,l})-measurable).
the conditional ing state was
For
{fA}Ag c . Suppose
for all
given by Proposition
In the present case it will be convenient
since
and
= ~
t
for all
, and that we have Z-invar-
A g C , x g X
such that
fA(x) > 0
is a nearest neighbour potential.
is determined by
fx,z(y) = f{O}(w)
given by
that the reference measure was finite,
result still shows that unless
and suppose that
for all A g ~
interaction potentials.
ained the hypothesis
(1975). Addit-
(1975b).
(5.3),(5.6),
be the potential
the corresponding
time
Y , and use counting measure as the reference
is ~ ( A ~ A ) - m e a s u r a b l e = fA(x)
as the discrete
(1974b) and Kesten
will denote the specification
fA+t(x+t)
S Y ). Let
on
Z
Lie in
: Y----~R
state is
and the next state will be
z . Let
by something
by
(and this is well-defined
Thus, for any
that the present
V
~ g G~)
, fx,z(y)
is
y , given that the precedQ : Y • Y
-~R
be given
5.25
83
(5.31)
{f~,y(~) }-If~,y(X)
Q(x,y) =
Then a simple calculation shows that
(5.32)
fx,z (y)
where
=
{Q2 (x,z) }-lQ(x,y)Q (y,z)
Q2(x,z) =
Q(x,w)Q(w,z) . Thus
Q
determines the
f
X~Z
, and
wgY vice versa. But it is not difficult to see (using (5.8) in the proof of Proposition 5.1) that we have
*{o}(X)
I
=
log [
{Q(~,~) }2 ]
(5.33)
:IQ(x'Y)Q(~'~)] ~{O,l}(X,y)
Thus
Q
determines
V
=
log [Q(~,y)Q(x,~)J
and hence
V
as
fA
of
X .) Note that one consequence of
{fA}AE~
is ~(AUSA)-measurable this means that
is determined by We will use
Q
(since
Q
V
V
determines fA
fA
on
X
O
, and
is determined on the whole
being determined by the
f
X~Z
is that
f{O} .
to describe the specification. It is easily checked that
has the properties:
(5.34)
Q(x,y) > O
(5.35)
Qn(x,y) < ~
for all
for all
matrix multiplication, i.e.
x , y g Y ;
x , y g Y , n ~ i , (where
Qn(x,y) = ~ Qn-l(x,z)Q(z,y) zgY
Qn
).
is defined by
84
Thus our specification defines a matrix sely, if
Q
Q
5.26
satisfying (5.34) and (5.35). Conver-
is a matrix satisfying (5.34) and (5.35), then defining
V
through
(5.33) gives us a specification for which (5.32) and (5.33) hold (but not necessarily (5.31)). Let
F
denote the set of all functions
satisfy (5.34) and (5.35). Then to each ~(Q)
Q g F
Q : y x y.
~R
which
there corresponds a specification
~ and the specifications we get this way are exactly the ones satisfying the
conditions we started with. This correspondance is not, however, one to one; but it is not hard to show that if exists
k > 0
and
v : Y
Q , Q g F
~R
, with
Q(x,y) =
If
then we will say that
~(Q) = ~(Q)
~(Q) = ~(Q)
v(y) > 0
Q(x,y)v(y) %v(x)
(5.36)
then
for all
for all
Q
and
Q
Q
are equivalent.
Q
~
by the matrix
Q
is
behaves like the transition matrix for a M a r k o v
chain (recalling the interpretation of However,
y g Y , such that
x , y 8 Y .
The reason for representing the specification because (5.32) suggests that
if and only if there
fx,z(y)
as a conditional probability).
is not usually going to be stochastic;
(though in many cases it will
be equivalent to a stochastic matrix). Suppose
P g F
and aperiodic. P
is stochastic; then by (5.34)
P
has to be irreducible
will thus have a stationary probability distribution if and
only if it is positive recurrent. If
P
is positive recurrent, and
~ : Y~
.>R
is the unique stationary distribution, then we can define a Z-invariant random field
~ = D(P)
by
n
~({ x : x k : tO , Xk+ I = E1 ,..., Xk+ n : ~n })
:
~(~0 ) ~
P(~j-I'~j ) "
j=l
We call
~(P)
the stationary Markov chain defined by
P . It is easily checked
5.27
85
that
~(P) g G(V(P))
. The following facts are known:
Y
(5.37) (Dobrushin (1968a); see also Spitzer (1971)) Suppose Q ~ F
then there exists a unique stochastic
equivalent; and
is finite. If
such that
P g F
~(P)
~(~_V(Q))I = I_Go(~(Q))I = I , with
P
and
Q
are
being the single Gibbs
state. Further, we have
P(x,y)
where
%
Q(x,y)r (y) ~r(x)
=
is the largest eigenvalue of
for all
Q
and
x,
r
ygY
,
is the corresponding right
eigenvector.
Suppose from now on that
Y
(5.38) (Kesten (1975)) If
is infinite.
Q E r
latter occurs if and only if Moreover, in this case
P
Q
then
I~_V(Q))I
is either
0
or
I . The
is equivalent to a positive recurrent
is unique, and
D(P)
P E r .
is the single Z-invariant Gibbs
state.
(5.39) (Kesten (1975))
Suppose
P 8 r , and suppose there exists
Q s F 6 > O
is not equivalent to a positive recurrent and
m > 1
such that for all
x g Y
m
~ .
Qn(x,x) > 6 .
n=l
Then
G~V(Q))
is empty. In particular, this will hold if
Y = Z
and
Q
is
equivalent to the transition matrix of a random walk; a case proved in Spitzer (197~b).
86
(5.40)
(Spitzer
(1974b))
then both
l~(g(Q))]
I~o~(q))l
= 1 .)
(5.41)
(Spitzer
I~V(P))I
= i
(1974b))
If and
Q e F
is equivalent
I~C~(Q))I
There exists
= ~
then it cannot be equivalent
Note that in this case w e could not have then any translate of Z-invariant).
p
is still in
to a positive recurrent
are possible.
a null-recurrent
= ~ . In this case w e also have
null recurrent
5.28
GoQV(P))
(By (5.38) we [lave
P g F
is empty,
such that since if
to a positive recurrent IG(~_V(P))I = 1 , b e c a u s e
~C~(P))
P g F
, (but of course
if D
P
is
P g F . P g ~(~(P)) cannot be
6.1
6. Continuous models: point processes
We noted that the Ising model, considered in the previous section, regarded as a model for a lattice gas; where at each site presenec,
and
0
the absence of a particle
1
could be
represented
the
(or molecule of gas). A more realis-
tic model for a gas would have the molecules lying in
Rd
rather than
Z d : we
will now generalize the Ising model to include this case. We will thus be concerned with the distribution of configurations of particles, with the particles lying in some underlying space point processes. applications Let arbitrary,
S . Such probability measures are often called
It should be clear that this kind of framework will have wide
in describing phenomena from physics, biology, and other fields. S
denote the space in which the particles lie; we let
though in practice this space will usually be
comes equipped with a o-field
(6.1)
if
A I , A 2 e_C
(6.2)
if
A g ~
(6.3)
there exist
~
, and let
then
and
~
S
be fairly
R d . Suppose that
be a subset of
~
S
such that
AIUA 2 g C ;
A g ~
with
A C.% A
then
A n g =C , n => i , such that
A g ~
S =
;
A n , and such
~_~ n > I
that if
A g C
then
A ~
A
for some
n .
n
Usually we will have
S = Rd
Borel sets. Our basic space on
(S,~)
(6.4)
S X
the Borel subsets of
and
C
the bounded
will be the set of all integer-valued measures
satisfying:
x(A) < oo
Rd
for all
A c C .
x
88
(By integer-valued
we mean taking values in the set
thus made the assumption particles
lying in
that if
A g ~
Already an implicit
(6.5) all
is separable,
~.~
nm 6x
1
Xl,X2,...
in any
i.e.
This is justified because we now add
~
is countably
generated,
and
{t} g ~
for
, where
A g ~
6x
is
the
elements of
S
; also
6
~ 6
x
which has
if
y
~
: B ~
n
m
A } , considered
defined by
replaced by X(A)
"A g C
FA = ~(S-A)
for
x
m
and
measures
with
Ar
on
, and
B g S
~A
= x(B) A ~ ~
for
" let
'
for
x.'s 3
F(A)
, where
{~A}AgC
. For
(A,SA)
A E ~
let
A , and let
which satisfy
(6.4)
F (A)
be the o-field
{ x e X(A)
: x(B) = m } ,
be the o-field of subsets of
PA : X
to
m = 1,2, . . . .
A"). Let
generated by sets of the form
~(A) = (pA)-l~o(A)) PA(X)(B)
of
, nm e {0,1,2,...}
as a o-field of subsets of
denote the se~ of integer-valued
m = 0,i,...
x
x ~ y . This measure clearly corresponds
We now need to define the o-fields
of subsets of
mass at
with only a finite number of
at
"A g C"
consists exactly of elements
point
particles
~A = { B g ~
given by
X
m
are distinct
the configuration
with
has been made between configurations
(6.4) and (6.5) it is easily seen that
m>
(with
measures.
S .
t g S .
the form
X(A)
at the same point of
i.e.
that
(S,~)
Under
and
.) We have
then there can only be finitely many
identification
and integer-valued
the hypothesis
{0,i,2,...,~}
A . Note that we have not excluded multiple occupancy,
there can be more than one psrticle
of particles
6.2
~X(A)
x g X , B c S A . Now let
X
is the projection F = ~(S)
and
. This gives us a collection of o-fields with the right
,
89
properties,
with
from outside
~A
representing
the events
6.3
that are observable
A .
As in Section 5 we would like methods of constructing the models we now want to consider it will be necessary definition of a specification to (2.14)).
(i.e. as a collection
are actually hard spheres of diameter configuration
in which two particles
that in other models a catastrophic
satisfy
specifications.
For
to use the more general
of kernels
satisfying
(2.10)
There are two reasons for this. The first is that in some models we
want a priori to exclude certain configurations;
conditioned
(or measurable)
(in fact most realistic models),
than
any
r . The second is
certain configurations
have
and so we do not want to define probabilities
on them. These configurations
(6.4), the number of particles
are usually
in
A
those that, although
increases
effect of one of these configurations
force at all points of
if the particles
r , then we do not want to consider are closer together
effect on the system,
and tile resultant
for example,
S . (Of course,
too quickly as
they
A~S
,
is to produce an infinite
the set of such configurations
would have
zero measure with respect to any probability measure having some physical meaning.) The best way to handle these situations ized definition
appears
to be to use the general-
of a specification.
We take the same approach as used in Section 5. First we work backwards and get a representation
for certain specifications
starting with a suitable potential, Some notation: projection of
PB,A(X)
Lemma 6.]
from
X(B)
if onto
. An important
Let
A , B E S
we try and constuct
A , B g ~ X(A)
in terms of potentials;
. If
with
A~
x g X(B)
B
a specification.
then let
A~B
= @ . Then
PB,A
denote the
then we will write
technical fact about our present
with
X(AUB)
then
xA
instead
set-up is:
is isomorphic
to
90
X(A) x X(B) we have
Proof
If
under the mapping
F (AUB)
= _FFo(A) • ~o(B)
.
then we write
A g ~
for the corresponding
suppose we have a finite measure
Suppose also we are given
R A ~ ~f~ , and that
denotes the zero measure, RA = PS-A(RA ) Let
then
{fA}Ag C
0 g RA
RA c ~o(S-A)
and
element ef
for all
I fA(xs-AXY)
(X(A),~o(A)).
A g ~ Since
(where RA~
~A
0 ' if
9
functions. We can, as in
~A : X x ~----->R
dwA(Y)
on
R A = (Ps_A)-I(RA)
be a collection of Z-measurable
~A (x'F) =
wA
i.e. the absence of any particles).
Section 5, attempt to define a kernel
(6.6)
yXz
. For each
we put
(XA,X B) ; also, under this isomorphism
Easy exercise.
y g X(A) , z g X(B)
X(A~B)
x ~->
6.4
by
'
G
where
G = { y e X(A)
X(S-A) x X(A) . For
(6.7)
fA > 0
(6.8)
~fA(zxy)
(6.9)
fA(x) = 0
If (6.7),(6.8)
: xs_AxY e F } , and where we have identified ~A
X
and
to satisfy (2.10) and (2.11) we need
for all
A ~ C ;
d~A(y) = 1
if
for all
z g RA ' A g C ;
x ~ RA , A ~ C .
and (6.9) hold then
~A
also satisfies
(2.12) and (2.13)
; thus
6.5
91
for
~ = {~A}Ae~
(2.14), i.e.
to be a specification with respect to ~ = {RA}AE~
~%wA = w%
whenever
A~%
we need
g ~ . So far this looks much the same
as in Section 5; however, in showing that (1.7) is equivalent to (5.4) we needed tile lact that
~
= ~AX~_A
, and it would ~mke things a lot easier if this was
still true in the present case. We will call the measures
{~A}AsC
independent if whenever
A1 , A2 a
~AlXWA2
(under the identification of
with
AI~A 2 = ~
with
X(A I) • X(A 2) ). Then the same computation which connected (1.7) and (5.4)
then
WAIUA 2
now gives us that, if the
=
{~A}AEc are
X(AIUA2)
independent, then (2.14) will hold
provided
(6. IO)
f~(x) = fA(x) I f%(Xs-AXw) d~~
for all
x g X , A~%
g C .
Again, the converse is almost true in that (2.14) implies that (6.10) must hold for
~-a.e. x~ . Therefore, given independent
{0~}~g~
and
{fA}A~~
satisfying
(6.7),(6.8),(6.9) and (6.10), we can construct a specification ~ = {~A}Ae~
by
means of (6.6). This would be fine provided we have a way of constructing suitable independent defining
{~A}As~
{ ~ A } A ~ . Luckily it turns out that the most natural way of (in terms of a given measure on
(S,~)) mmkes them independent.
After we have obtained a representation of our specifications we will go through the construction of these measures. Suppose then we have
{~A}Ag~
independent and
{~}Ag~
satisfying (6.7),
(6.8),(6.9) and (6.10). Note that if (6.7),(6.8) and (6.9) hold, then a slight modification of the proof of Len~na 5.1 sho~s that (6.10) is equivalent to
92
if
A~%
g C
and
x
~ ~ X
6.6
with
Xs_ A = Xs_ A
then
(6.11) f%(~)fA(x) = f%(x) fA(~)
and
(6.12)
if
A C%
g ~
and
fA(x) = O
((6.9) and (6.12) show that
R
then
cannot be arbitrary if any
exist.) We will obtain a representation for = { y g X : y(S) < ~ } , thus only finitely many particles,
XF
~ 0
{fA}Ag C
are to
{fA}Ag C . Let
consists of those configurations containing
(equivalently, X F
which are supported in some element of exp(-~)
f%(x) = O .
consists of those elements of
~ ). We will make the convention that
.
Proposition 6.1
Suppose that
unique function
V : XF
fA(o) > 0
>[-~,~)
with
for all
A g ~ . Then there exists a
V(O) = O , and such that
exp{V(x) }
(6.13)
fA(x) =
for all
x ~ Z~R
A .
l exp{V(Xs_AXw)} dmA(w)
Proof
This is the same as the proof of Proposition 5.1.
82
The above all looks very much like the lattice models of Section 5. This is not surprising, since we could actually formulate our present situation as a lattice model. The assumptions about sequence
A n , n => I , of elements of
C
are such that there exists a disjoint
=C , with
S =
U n ~ i
An . Lemma 6.1 then
X
93
shows that
X
6.7
is the cartesian product of the
esponding product of the
~o(An)
X(A n) , and that
~
is the corr-
. The model we are looking at in this section is
the same as this lattice model. However,
the present model is, in most cases, much
worse behaved than the models considered in the previous section. As in Section 5 we would like to reverse the procedure that gave us Proposition 6.1, and construct a specification from a given potential. will first construct some independent
In order to define
~A
However, we
{~A}AgC .
it will be convenient,
for
A g ~ , to write
in a different way. By (6.4) there can only be finitely many particles we can write
X(A)
as the disjoint union
~
in
X(A)
A , so
Xn(A ) , where
n ~ O Xn(A) = { x s X(A)
: x(A) = n } . Xn(A)
involving exactly
n
( n
particles,
so
consists of all configurations
Xn(A)
looks something like
in
A•215215
times), except that in the product each configuration occurs up to
(since, for example, in
A•
($,n)
and
(N,~)
~n(A)
denote the product of
n
copies of
ting of a single point); put an equivalence relation (~l,...,~n) ~ (nl,...,nn) ~i = No(i)
for
alence relation;
then
In(A)
is equipped with the o-field which we denote by which we call
A ~
if there exists a permutation
i = l,...,n . Let
In(A)
n!
(with on a
~ = N
"~O(A)
~Tn(A) of
~A " so we can give ~ n ( A )
consis-
by defining
{l,2,...,n}
Xn(A)
. Now
I(A)
with
A
the product o-field,
S_n(A) , This induces a o-field on the quotient space let
or
be the quotient space under this equiv-
is isomorphic in a natural way to
B (A) . Finally,
times
both represent the same config-
uration, and this occurs either once or twice, depending on whether not). Let
A
be the disjoint union
D
In(A)
~n
n>
In(A)
O
, '
94
and give
l(A)
morphic to
the o-field
B(A)
6.8
generated by the
B (A) . I(A)
X(A) ; moreover, it is not hard to see that, under this isomorphism,
(x(A),Fo(A))
is isomorphic to
(I(A),B(A)).
Now there is a natural way of constructing measures on we start with a measure on %(A) < ~ For
for all
n ~ I
(S,S)
:
A g ~ , and let
the product measure
induces a measure
%~n)
let %A
%~n)
=
%A
Lelmna 6.2
Proof
{%A}AgC
(S,S)
on
if
with %
to
(A,~A) .
(~n(A),~(A)) ; let
%~0)
denote
. Then any combination of
. The particular combination
defined by
n! n>
exp{-%(A)}
times)
(In(A),~(A))
(I(A),~(A))
exp{-%(A)}
(Note that the factor
(I(A),~(A))
denote the restriction of
(I0(A),_Bo(A))
which we will use gives us a measure
%~
be a measure on
on the quotient space
will give us a measure on
(6.14)
%
%AX%AX...x% A ( n
the point mass on the single point in the
is thus iso-
0
makes
%A
a probability measure.)
are independent.
We leave this as an exercise.
82
In probabilistic terms we have done no more than (almost) constuct the Polsson point process corresponding to measure
(pA)
-i
* (XA)
on
% : if we identify the measure
(X,F(A)) , then
Section 3); thus if (3.4) holds (with unique
%
g P(F)
such that
PA ( % )
* {~J~}Ac~
C =N = ~A
%A
with the
are consistent (as defined in
, F(A) =_B A ), then there exists a for all
A g C . %
is called the
95
Poisson point process
corresponding
to
6.9
% .
We will now start w i t h a potential
V : XF
and from it try and define a specification. on
(S,~)
x ~ y
as above.
means
there exists
that
x
z E X
(6.15)
There
is a natural
with
V(x) =
~(y)
place any restrictions (5.11).
This would
of
and
V
to
V , because
~
that
V
~
, where
x ~ y
can be w r i t t e n
if
as
that in S e c t i o n 5 this did not
could be defined V
V(y) = -co . ~
in terms of
did not take the value However,
V
using
-~o , but if
it is not that difficult V(x) = - ~
will be called the in__teracLion potential
conditions
are going to be put on
w e deal w i t h will be measurable.
embedding
iA
: X(A)
~oi A
~
corr-
in order that w e can p r o c e e d
The first assumption we have to make is to ensure
everything
~X F
: X(A)
The next question
like
(thus, as measures,
can be w r i t t e n as in (6.15) if and only if, whenever
w i t h the construction.
we w o u l d
is a measure
V .
Various
(6.16)
y
~(0) = 0 . Recall
still be true here if
y ~ x , then
esponding
k
= 0 )
,
it does then things are a bit more complicated. to check that
' where
V(O)
x
with on
(with
X , w h i c h w e denote by
x + z = y ). We suppose
~
~ : XF-----~[-c~
~A = %A
order on
is a s u b - c o n f i g u r a t i o n
y<
where
Let
~[-~o,~)
For each
. We will always assume
>~-oo,=)
that arises
~exp{V(xs_AXw)}
A g ~
there is an obvious
that
is F=o(A)-measurable
is the existence
d~A(w)
that
for each
A g ~
of the integrals
to exist for all
x c Xy
and
.
in (6.13):
A g ~
.
96
6. i0
The standard condition which is used to make this happen is to assume the potential is stable (see for example the account in Ruelle (1969), Section 3.2). The potential for all
V
is called stable, if there exists
n ~ i , x a XF(n)
Proposition 6.2
If
V
, where
N c R
V(x) ~ nN
such that
XF(n) = { x ~ XF : x(S) = n } .
is stable then
lexp{V(Xs-A•
d~A(W)
exists for
all X ~ X F , A ~ C . Proof
Let
we have
x a X F , A E ~ , and suppose that
xs_AxY a XF(n+m ) , and thus
the definition of
~A(Xn(A)) =
V
V(xS_A•
being stable). Let
x(S-A) = m . Then for ~ (n~)N
~ = exp{-X(A)}
(where
N
y c Xn(A) is given by
; then noting that
~ { X ( A ) } n , we get
I exp{V(Xs_Axw)} d~A(W)
s
(expmN ) {X (A )expN }n
~ " {exp(n+m)N}WA(Xn(A)) n~=O
=
m exp{ m N +
X(A)expN }
<
~
82
n&O
We have defined
xA
as a measUre on
(A,~A)
for
x e X , A s S ; but
it will be convenient (to avoid more notation) to also regard it as a measure on (S,S) (i.e. as an element of Similarly, we will also regard A ~
X ); thus we let xA
XA(B) = x(A[~B)
as an element of
X(B)
for
for all A , B E S
B e~ with
B ; it will be clear what use of the symbol is being made in any particular
9
97
6. ii
circumstance. Let
V
be a stable potenti~.; in terms of it we want to construct a
specification, which entails defining
{RA)Ae ~
and
{fA}A~~ . Now corresponding
to (5.12) and (5.13) we can, at least formally, write (6.13) as
exp{gA(x)} (6.17)
fA(x) l exp{gA(xs_Axw) } d~A(W)
where
gA(x) =
~
@(y) ; since this just amounts to multiplying the
r
y<x
yCA)>o numerator and denominator in (6.13) by
exp{-V(Xs_A)} . Of course, this need
make no sense, since the integral could be zero, but it suggests the approach we will take in defining
g~:A x-
fA
For
A
~ e J~ with
A ~
~
define
~[-~,~) b~
glCx)=
0(yl; y ~x~ z(A)~0
(where a sum over the empty set is taken to be zero).
A g~
is clearly ~-measur-
able (and is in fact ~(~)-measurable). Let
R Ao
=
{ x s X : lim g~ (x) A+S
exists
}
~{
x g X : g~(x) A = -~
(where the limit in the first term is required to be finite);
for some
A ) ,
6.12
98
and define
g
A lim g~(x)
: RA~
+
o
R A = { x c X : xS_A•
a RA
such that
+ and
put
for all
gA(xs_A•
w s X(A)
=< N m
. N o w let
, and there exists
for all
w ~ Xm(A)
N e R
, m --> i } ;
-
R A = R A n R A , where
RA = { x a X : r
> -~
for all
y c ~,
with
y < Xs_ A } .
+
Clearly
R A ~ ~i
L e m m a 6.3
; note that
+ x g RA
If
The existence
b e i n g stable implies that
X F ~-. R A .
then
exp{-~(A)} s
Proof
V
lex~{gA(xs- A•
of the integral
d~A(w) < ~
follows
of Proposition
6.2. The lower b o u n d comes
exp gA(Xs_A•
= i
and
mA({O})
from the estimates
in the p r o o f
from the fact that
= exp{-~(A)}
(with
0
the zero m e a s u r e
in
x(A) .
-
L e m m a 6.~
If
Proof
A~-~
Let
A ~-- A e ~
e ~
and
then
RA~R~
+
+
e-_ R A .
x ~ X , z e X(A)
; if
it is easily seen that
(6.19)
g~(Xs_A z) = g~(~s_A•
+ g~(Xs_A)
E ~
with
A~A
then
99
6.13 ~
Suppose that
x s RAaR~
w ~ X m (A) , m => i ; let also we bays
+ S
g~
~ let
N
be such that
m ~ = x(A-A)
. Since
(xS_A) < ~ . Thus if
g A ( x s _ ~ ' ) a Nm
x s RA
z s Xm(A)
for a l l
we have
g~(x S A ) > -~ ;
xS_A•
o e R A , and letting
then
in (6.19) we get
~
A g
(XS_A>;z) = g~(XS_A•
- gA(xs A) ~
<= N(mo*~) + IgA(~)1
<= { Nm o + IA(~s_A)l }m
;
+
and hence
x s RA .
Proposition 6.3
82
S~aDpose that
V
is stable and define
exp
fA : X------~R
gA(x) if
~exp{gA(xs_AXw)] (6.20)
xeR
A ,
d~A(W)
fA(x) =
0
[Then
by
{/t}Ae~
satisfies (6.7),(6.8),(6.9) and (6.10); and thus defines a
specification via
Proof
otherwise.
(6.6).
It is clear that
{~}Aa~
satisfies (6.7),(6.8) and (6.9). Instead of
checking (6.10) we will show that (6.11) and (6.12) hold. We first look at (6.12). Suppose bilities:
A~A (i)
s C
and
~(x)
x @ R , (ii) A
= O
for some
_ x c I{A , but
x e X ; then there are three possi+ ~ ~ R A , and (iii)
x s R A , but
100
6.14 ~
gA(x)
....
If (i) then either
x ~ R~ , or
x ~ RUA and
gA(x)
= -~
; if' (ii)
+
then by Le~na 6.4
x ~ R~ ; and if (iii) then
all cases we have
f~(x) = 0 , ~ d
A f~A
e ~ , x , ~ c X
with
x
of the above equation gA(x)
X
are zero;
if
= -~ , and so we can assume
greater
than
zero.
; we want
= fA(x)fA(R)
and
gA(x)
gA(x)
= -~ . ~lus
We now look at (6.11);
R A n R ~ , since otherwise = -~
then,
fA(~),
fA(x),
both sides
as above, we w o u l d have ~(x)
and fA(~)
are all
~
gA(~) + gA(x) = g^(x) + gA(~) .
But this holds
e C
since for shy
with
~ A
we have
~
r w<=~
wS_y
w(~-)>o
w(.~)>o
_>---" w
s
w(A)>o
ws w(A)>O
r
+ 7_~ w
w(A)>O
wS_~ w(A)>O
r
let
to show that
Thus what we need to show becomes
-
in
.
are in
that
and
(6.12) holds.
xS-A = xS-A
fA(R)f~(x)
We can assume that both
hence
x ~ R~
+
~
r
w 5_Ys_A
w(]O>O
w s ~S_A ~(~)>0
(writing
x~=y
,
101
6.15
~
=
4(x)
Let us now look at some examples; what follows is mostly Ruelle Borel
(1970) 9 Let
S = R d , =S
sets. We will take
(S,~)
and
a
X
r
is translation
with
@
t o be
is a p o s i t i v e
r , by which we mean that
be the B o r e l subsets
real number.
r
invariant.
= 0
m
R d , and
denotes
~
from
the bounded
Lebesgue m e a s u r e
on
Suppose that we have a pair p o t e n t i a l
whenever
x(S) > 2 ; and suppose
Then there e x i s t s
even (i.e. ~(u) = r
[
am , where
of
adapted
6 e R
and
@ : R+
also that .~(-~,~]
), such that
r
= 6
if
r
= -@(Ul-U2)
x(S) = i ;
(6.21)
[
to
from
by
x(S) = 2 , where
the two points
Note that changing X
if
~
{exp(6-B)}X am
to
to
6
Let
and
V
assume that
denote the p o t e n t i a l
u2
are
x .
has exactly the same effect in (6.6)
. Thus we can, and will,
(exp6)am).
in
uI
6 = 0
as c h a n g i n g
(by c h a n g i n g
defined in terms
of
X r
(6.15). The first thing we must now do is to find conditions
that
V
Section
is regular. 3.2,
The conditions
on
~
to ensure
that w e state are taken f r o m Ruelle
(and proofs of the assertions
em~ be found there).
V
w i l l b e reg-
ular if any of the following hold:
n
(6.22)
~
n
~
~(~i-~j)~ o
for all
=
n > 1 , Ul,...,u n ~
i = l ~ = l
(6.23)
~ >= ~ ,
where ~
is positive
definite;
(1969),
Rd
6.16
102
(6.24)
there exist
0 < rI < r 2 < ~
42 : [r2,~)-------~(0,~)
il
td_l~l(t ~ 9
, with
dt
=
~
@i ' @2
i
,
0
and if
r
i~f
'
decreasing, with
td-l@2(t) dt < ~
;
>-~
, *(~)hr
for
lul ~ r 2 9
0&lul
for all
& r I , and
Rd
I@(u) l ~ l~2(lu I)
~
~i : ~ 0 ' r l ~ - - - ' ~ ( O ' ~
r2
u~:
If
and
is upper se~Licontinuous then (6.22) is equivalent to
V
being stable.
(6.24) is the most useful in practice; it is due to Dobrushin, Fisher and Ruelle. Physically it says that the interaction has to be both sufficiently repulsive at short distances, and to die off quickly enough at large distances. If
V
is stable then we get a specification from Proposition 6.3. We
would like to know when
G(V)
of
let
R d , and for
e ~ N
satisfy (3.1),(3.2),(3.3)
is non-empty. Let B
==~
N
denote the compact subsets
= =F(e) . It is not hard to show that
{B0}es N
and (3.4); tbus we are in the set-up of Section 3. For
most potentials the hypotheses of Theorem 3.1 fail; but there is one class of potentials for which everything is fine. This is the extreme case of an interaction being repulsive at short distances, when there exists ~(u) = ~
for
r > 0
such that
0 < lul < r . If this happens then we have a hard--core potential,
and this will describe situations in which the particles are hard spheres of diameter
r
(with the points in our configurations representing the centres of
the spheres). Suppose that
V
is a hard-core potential, that
inf us R d
and that
@(u) > -~ ,
103
(6.25)
there exists
and a positive
s > r
6.17
I~(u)l <__%(lul)
~ td-l~o(t ) dt < ~ such that
~o : [ s , ~ ) - - - - - ~
decreasing
for a l l
lul
>s
.
S
Then clearly
(6.24) holds, thus
a specification (3.6),
is regular and so Proposition
V . It is fairly easy in this case to show that
(3.8) and (3.11). %]n~s by Theorem 3.1 we have
Theorem 3.2 Rd
V
~(Y)
has the sequential
acts in a natural way on
X , and
d . . and the fact that R -invarlance set of points
in
Rd
~
property.
Further,
is Rd-invariant.
coordinates)
V
satisfies
is non-empty~
and by
the group
Thus by Theorem 4.3
d 9 is the same as Q -znvariance
having rational
For proofs of these results,
compactness
G(=VJ
6.3 gives us
we get
(where
~(V)
and much more about the hard-core
Qd
is the
is non-empty. case, see Ngu~en
and Zessin (1975).
If the potential
is not hard-core
then things are much harder,
most cases (3.6) and (3.8) will fail to hold. However, then (3.7) holds with
x = 0 , and for suitable
if
{Um)m> I
r
satisfies
(3.12)
(with
and in (6.24) x = 0 ),
(3.13),
(3.14) and (3.15) will also hold. We will look at how these facts are
proved.
All the hard estimates needed are taken from Huelle
(1970).
Note that since we are dealing with a pair potential we have
y s xR
y ~ xA
y(A)>0 Let
k
> 0 , B n > 0 , with n
~
~
v g x~
y(A)~(~-A)=z k
+ ~ n
as
n + ~ ; and for
n > i
let
104
W n = { x s X : x( An-An_l)
=< ~n } ' where
at the origin
A 0 = @ ). Let
(and where
6.18
is the cube with
An W =
~ ~
side
W n , (thus
W
2k n
centered
restricts
the
n~l number
of parSicles
@2 : [ r 2 ' ~ ) to the whole
Lemma 6.5
in various
regions
of space).
)(0,~)
; it will be convenient
R
r
of
by defining
Suppose
@
satisfies
In (6.24) we are given a function
to extend the domain
= @2(r2)
(6.247
and
for all
~
of' this
function
t < r2
r
< ~
for all
n>l ~ 0 . Then for any
Proof
Let
C
e A , ~ ~ C for some on
be a cube, then
)[ ~ C
for which
such that
An ~ o
W
R Ao
centered
and
g~
at the origin,
I~-nl >__ r 2 ; suppose
then it will happen
{ x ~ X : gA(x)
x s W
A c C
that
for all
C A ~
)gA
uniformly
with
C OA
has side C . Thus
= -~ ) , and so we need only concern
gA( x) ~ -~ C ; let
for all
n => no
and
~ e =C . Let ~ ~ A~
no
--
An ; suppose
r
and thus
- g~(x)
< m~n
2c . If A g~[ ~ g
A
A g~(x)
ys y (A )=y (Am+l-A m )= i
that
if
= -~
uniforlmly
with those
be the smallest
y%x~ y(A)=y(~-~):l
IA
W .
and such that
ourselves
Then from (6.26)
g~(x)
on
A g~(x)
integer > -~
105
Therefore,
if also
x c W
then (since
AIx) I
<
x(A) ~--~
@2
6.19
is decreasing)
@2(km-C+r2)~m+l
.
m > n
But
x(A) <_ ~
~n ; and thus
gA
~gA
imiformly on
W .
82
n>n o
Lemma 6.6
Proof
If the hypotheses of Lemma 6,5 hold then for any
Let
x s W , then by the proof of Le~rma 6.5 we have
w ~ X(A) . Also, for
A ~C
(where
g~(Xs_AXw ) = g~(Xs_A• A
w(A)
C
A c ~
W C_ R~ .
xs_Axw E R oA
for any
is the same cube as in Lemma 6.5), we have
A - gC(Xs_A•
@2(kn-c+r2)~n+l
A + gC(Xs_A•
+
A gC(xS_AXW)
.
n ~ I
A gC(Xs_AXW) = gA(x c _Axw) , and since (6.24) implies stability, there exists
But N >_ 0
such that
gAc(xS,AXW) ~ N{x(C-A)+w(A)}
difficult to check that x(C) <__ ~ n<
gn
and
N
for all
can he chosen independently
inf ~(u) > -~ ). Thus, since u
n o
exists
~l
N
such that
of
, (and it is not
x e W , because
x(C-A) <__ ~
~n ' there
n>n =
A g~(Xs_A•
can b e c h o s e n i n d e p e n d e n t l y
w e X (A) , and hence m
w ~ X(A)
_<_ Nm
of
x s R+ . A
for all
~
x s W ). T h e r e f o r e 82
o
C , w ~ Xm(A) (and note that
gA(xs_Axw) ~ Nm
for all
106
As before _let A =
[J
> -~
D
~
aud
W
are in
B(A~_) denote the set of bolmded
F(@) , an0 let
functions that are F( O )-measurable for some D = { x r X : r
6.20
for all
y c ~.
0 r N . Let
with
y < x ) . It is clear that both
.
Lelm~a 6.7
Suppose treat the hypotheses of Ler~la 6.5 hold. Then for any
F r A
5 > 0
all
Proof
m~d x
E W ND
For
-there exists
f E B(~)
such that
A s ~ ,
I~A(X,F) - f(x) I < 6
for
,
A r ~
with
A ~ A
define
h~ : X
>R
by
A
exp gS(xS_A• d,~A(w) A exp g~(Xs_A•
where
F = { z r X(A) : Xs_A•
to show that x r Wf~D
h~---~WA(.,F)
s F } . Then uniformly on
d,~A(W)
h~ s B(A) , and so it is sufficient WglD . But
WnD~-
R A , thus if
then
A ~ exp g (Xs_A•
~A(W)
A[ ~ exp g ~xS_A•
d~A(W)
~A(x,F) =
and since
Q ~ F (A)
S exp g~(Xs_A• A
d~A(W) ~ exp{-~(X)}
we need only show that for any
107
A I exp g~.(xs_A• %(,.)
> ]' exp gA(xs_,x,.,) %(w)
G
uniformly that
G
in
x s W~D
lexp(~)
lexp
. Using the uniform
- exp(q) I ~ l~-qlexp
A
(max{~,q})
from Lenma 6.6 and the fact
, we get
- exp g (~sA•
=< Ig~(Xs_A•
- gA(xS_Axw) I exp{~w(A)}
6(X)~(A)exp{Nw(A))
I
estimates
A
g~(Xs_A•
The proof
6.21
is thus complete,
w(A)e:>mp{iYfw(A)}
, where
6(X)'--'~O
as
~--'~S
9
because
dtOA(W )
~
=
m exp{Nm}
{I(A) )m ml"
<
~
82
m>=l
Let now
kn = n
An-An_ 1 ). S i n c e
~2
6 n = (2n) d - (2n-2) d , (so
is decreasing
< ~"
@2(n-~)nd-i
and
for all
and
It)2(t)td-1
> 0 , and thus
is the volume
of
we h a v e
~J2(n-~)~n+l
< ~
for all
n>l
e > 0 . For
n > 1 , m > 1
let
Un, m = { x E X : X(An-An_l)
I
9 Then by Lemma 6.7 the
Le~ma 6.8 exists
dt < ~
~.~
n > 1
6n
The
In :~ I
U
m
satisfy
} , and let
(3.13).
m
satisfy
such that
U
< m6n
(3.12) with
~A(0,Um)
~ i - ~
x = 0 , i.e. given any for all
A s ~ .
8>0
there
108
6.2~ ~n
Prcof
Fix
n ~ I
for the moment; we can write
Cj , where tile
An-An_ I = j=l
C. J
are disjoint, and where the closure of each
C. J
is a cube with unit side. Now
by Ruelle (]-970), Corollary 2.8~ we have there exists only on
~'2 ) such that for any
wA(O,E(M))
<
A e ~
and
p > 0
and
q ~ 0 (depending
M > 0
exp{-(pM2-q)~n }
,
n where
E(M) = { x ~ X :
{x(Cj)) 2 > M2Sn } . B u t
j=l ~n U
=
n~m
{xcX:
n < mS n } 3 {
x(Cj)
x E X :
j=l
{x(Cj)}2 < m28n j=l
(by Schwarz's inequality)
=
and thus
x
-
S(m)
;
IrA(O,Un,m)~ i -- exp{-(pm2-q)~n }
9 Therefore, since
WA(O,D)
= 1
we have
~A(O,~) & 1 - ~
e~{-(pm2-q)Sn }
n>l
and clearly
~
exp{-(pm2-q)~n )
-7 0
as
n--~
.
82
n ~ i
Lemma 6.9
(3-7) ho]ds for
x = 0 ; i.e. given
8 s I~ and
y > 0
then there
,
109
exists
m ~ P(~0) , A ~ ~
and
6 > 0
w~(O,F) < y
for all
Proof
e ~ N ; we can write
Let
6.23
such that if
F ~ _~
with
~(F) < 6
then
A O A .
X
as the disjoint union
~
Xn(e) , where
n>O Xn(8) = { x s X : x(8) = n } ; and we have measure to put on
(X,~)
(%1
is
(pc)
-1
Xn(e) s B=~ . The natural reference
*
(le) ; and
(x o) = exp{-x(e)}
~
~
,
n>O
(n)
where
~e
WA(O,')
= (ps)-l(l~n)){ " Let
as a measure on
(X,~)
A e =C
with
A~
8 ; let
m => i . Consider
, and look at the restriction of this measure to
X (e) . Theorem 0.2 of Ruelle (1970) shows that this has a bounded density with m respect to
~e(m) (and the bound is independent of
Lemma 6.8 shows that there exists
on than
~
> 0 , with n ~
A ). Further, the proof of lim ~ = 0 n n-~
e ), such that the probability (with respect to n
particles in
e
is less than
a
n
(and depending only
WA(O,.) ) of finding more
. From this it easily follows that (3.7)
n
holds with
im! ~8(m)
~ = ~
for suitably chosen
6
m~d
n
m = 1
Pr_~osition 6.4
If
@
satisfies (6.24) then
!(X)
is non-empty.
82
110
Proof
This follows i ~ e d i a t e l y
6.24
from Theorem 3.3, using Lemmas 6.7, 6.8 and 6.9.
82
The proof that (3.14) (resp. (3.15)) holds is much like the proof of (3.12) (resp. (3.7)). The proofs are, of course~ much harder; and a more careful analysis of Rueile's estimates is needed. We leave the interested reader to make this analysis. Given that (3.14) and (3.15) hold we have that the conclusion of Theorem 3.4 is valid, i.e. we have the sequential compactness property for those for which
~(U ) = i , where
U
=
~ m>
D e ~(~)
Um . (Those random fields with
p(U ) = 1
i
correspond to what Ruelle calls the te___m~ered measures.) Finally, the remarks after Theorem 4.3 show that if
@
satisfies (6.24) then translation invariant Gibbs
states exist, ~ud the translation invariant tempered Gibbs states also have the sequential compactness property.
It is possible to show that Gibbs states and translation invariant Gibbs states exist, and that the tempered Gibbs states are sequentially eompact~ under less restrictive conditions than having a pair potential given by
~
satisfying
(6.24). Ruelle's estimations are done for a class of interactions which contain what we have considered as a special case, and the proofs we have given also work for this class of interactions (the superstable, lower regular interactions). details we again refer the reader to Ruelle (1970).
For
.~. S~ecific information ~ain
We have described a framework which is supposed to have something to do with the equilibrium behaviour of models from statistical mechanics, and as we stated before this c~u only really be justified by constructing the appropriate dynamical systems and looking at their equilibrium states. However, some justification for these definitions would be obtained if we could show that the Gibbs states we have defined satisfied suitable variational principles. For example, for the Ising model (as we have defined it in Section 5)~ given by a zd-invariant potential satisfying the hypotheses of Proposition 5.2, Lanford and Ruelle (1969) have sho~m that the d
.
Z --~nvarlant Gibbs states are exactly those zd-invariant random fields which minimize the specific free energy. ~lis is a variational principle which was first formulated by Gibbs, and is generally accepted as being the right one for this kind of model. Lanford and Ruelle's result has been given other proofs, and extended to more general lattice models (with
Yt = {0,i}
replaced by an a r b i t r a ~
finite set)
in Thompson (1973), Holley (1971)~ Spitzer (1971b) (for 1-dimensional models), and F611mer (1973). Fr
showed that the variational principle can be restated in
information theoretic terms (an approach which was also used by Sullivan (1973)). For zd-invariant random fields h(~,v) , of h(~,v) = 0
~
with respect to
if and only if
~
B , v
he defined the specific information gain,
v , and proved that if
v
was a Gibbs state then
was also a Gibbs state.
In this section we will show that F611mer's result goes through in our present set-up provided the specification is nice enough. We will define the specific information gain, h(~,v) , for H-invariant randcm fields, with
H
~ , v s Fo(~) (where
a suitable group), ~ d
Po (F) =
are the
show that if
v s G(V)
112
and
~
satisfies certain conditions, then
7.2
h(v,v) = 0
if and only if
V c G(V)
.
To give an idea of what we are going to do let us state the results for the lattice models considered in Section 5. Suppose that
S = Z d , that the
are all copies of the same space
~
(Yo,F=o) , and that
(Yt,~t)
is a specification given
by a potential satisfying the hypotheses of Proposition 5.2; suppose that zd-invariant in the sense of (5.17). If subsets of
Z d ) and
(X,~(A)) ; if
p e P(~)
~ , ~ ~ P(~)
then let
A e ~ (and in this case ~A
~
is
is the finite
denote the restriction of
~
to
then 3et
d~
if
ZA
is absolutely continuous
t d~ A 1
~A(V,9) =
with respect to
~A ' and the integral
exists, otherwise.
Let
1 --
hA(~t,9) =
)
IAI show that if
if and only if
of cubes in
.
.
D , 9 s Po(~)
with respect to
the existence of energy.
.
lim hA(B,~) = 0 A
lim hA(B,~ ) A
cubes), then we denote the limit by ~
A
We will
v E G~V) , then
.
(where the limit is taken over the set
~ ).
If for
gain of
IA.I denotes the cardinality of
~ , ~ s P (F) (the zd-invariant random fields), and O
e G(~)
where
'
h(~,v)
h(w,~)
exists (where the limit is over all
and call it the specific information
~ . in the next section we consider the question of
and also its interpretation in terms of the specific free
113
7.3
For simplicity we will start by working with what looks like a general lattice model, (though it will include most of the models considered in Section 6). At the end of the section we will indicate the modifictions required for other models. Until further notice (where
d ~ i
meaning as
~
will denote the non-empty finite subsets of
is fixed), ordered by inclusion, i.e.
A ~" A . Again we have a basic set
we are given a ~-field
~(A)
measurable from inside
A ), such that
A e ~
we let
~
will assume that
= ~(zd-A)
of subsets of
, 8rid take
Z = o (~Z(A)
I
Tt : X - - - - ) X
=
now has the same
X , and suppose that for each
ACZ d
X (representing the observables
~(AI) c ~ ( A 2 ) ~(Z d)
whenever
AI~A
to be the basic o-field
2 . For ~ . We
.
We will further suppose that for each bijection
A ~ A
Zd
t ~ Zd
we have an F=-measurable
such that
(7.1)
Ts Tt
Ts+ t
for all
(7.2)
T0(x) = x
(7.3)
Tt(Z(B)) = Z(B+t)
where
B+t = { s+t : s ~ B } .
for all
s , t s Zd ;
x e X ;
for all
B~Z
d
t e Zd
Most of the objects we will look at will be translation invariant, i.e. zd-invariant in the sense of being invariant under the group
Some notation: if P(~(A))
; if
~ e P(Z d)
A~Z then
d ~A
{Tt}taZd .
then we will often write
P(A)
will denote the restriction of
instead of ~
to
~(A)
,
114
i.e.
PA e P(A)
we have
is such that
Pzd_A = rA(~)
then we will write ect to
~A(F) = B(F)
. If
kl, k 2
k I << 12
X 2 , and if
7.4
for all
are finite measures
to mean that
kl << 12
F s ~(A)
kI
; thus for
A r
defined on the same o-field
is absolutely continuous with resp-
dk I ---- will denote the Radon-Nikodym derivative
then
dk 2 of
kI
with respect to
Let
12 .
p , v a P (Z d)
and
A s C ; define
0
log
~ dp dv A /
if
PA << VA
and the integral exists,
otherwise.
I Let
hA(~,v) =
lira
hA(~,v)
~
)
where
IAI
denotes the cardinality of
exists then we denote the limit by
h(p,v)
A ; if
and call it the s~eeific
AtZ d information gain of limit to exist if
p A
with respect to
v . In fact it is too much to expect the
ranges over all elements in
C , so we will restrict the
above limit to be taken only over the set of all "cubes" in ~e mem~ any element of
{ (hi,..
C
9
C , where by a cube
that is a translate of a set of the form
, n a) E Z d : I n i l
for some non-negative integer
<__ N , i = 1 , 2 . . . . .
d }
N . It is not hard to check that
~A(~,~) _t 0 (and
we will verify this below), and thus we have
h(H,v) k_ 0 (if the limit exists); we
are interested in the consequences
h(~,~)
of having
: 0
115
7.5
For the first result of this section we will require our specifics~ions to satisfy a condition which looks something like (3.8); and for the lattice models of Section 5 it will be equivaAent to (3.8). As usual let
~ =
~(A)
, and then
Ac~ the condition we will need is:
(7.4)
given any
A~zd-A
A s ~ , F e A
and
and an ~(~)-measurable function
Another way of putting this is: let from
X
to
R
that for each If
V
> 0 , then there exists
BA(~)
g : X---~R
and
F e A
ZA (',F)
such that
with
IZA(-,F)-g I < 6 .
denote the set of bo'muded functions
which are F(~[)-measurable for some A e C
A s ~
A ~ zd-A ; then (7.4) says
is in the uniform closure of
BA(A)
.
is a specification defined in terms of a potential satisfying the hypotheses
of Proposition 5.2 then the proof of Proposition 5.3 shows that
V
satisfies (7.4).
Our results will thus include the specifications considered by Lanford and Ruelle (1969), FSllmer (1973), and Sullivan (1973). Of course, (7.4) does not include the models from Section 6, but we will be able to deal with these separately after having dealt with specifications satisfying (7.4).
Theorem 7.1
Let
p , v ~ Po(Z d)
V = {~A}AsC
with
be a specification satisfying (7.4); let
v s _G(V) , and suppose that
the lim inf is taken over the set of all cubes in
Let ~A << VA
~ , v
lira inf hA(~,~) = 0 , where A C . Then
~ e _G(_V) .
be as in the statement of the theorem. By hypothesis we have
for arbitrarily large cubes
A , and thus this must hold for all dP A
(since if
~A << VA
and
~ C_A
then
p~ << v~ ); let
gA :
. Now if d~ A
As
C
7.6
116
HA(~,v) < ~
then we can write
I l~
where
r
= (log (
is strictly convex, HA(P,V)
< ~
for ~ ~ 0
gA dPA = ~(log gA)gA dvA = ~ r
> 0 , r and
= 0 . Let
~(~) = 0
~(~) = r
if and only if
,
- (~-i) ; then
~ = 1 . Thus if
then
~A(~,~) = ~(gA) d~=
Ii dp +
Ii dv
dv
~{~(gA) - (%-l)}
Therefore in partiaular
HA(~,v) ~ 0 ; also by reversing the above steps we get
that if
then
I W(gA) d9 <
Now if
A~A
HA(W,v) < ~
and again
then it is easily checked that
HA(~,v) = ~ ( g A
g~ = EV(gAI~(A))
thus by Jensen's inequality
Therefore for all
HA(U,v )
is an increasing function of
A ; thus we must have
A e ~ (since it is true for arbitrarily large cubes). Let
A , ~ c ~
with
A~A
= r
and define
qA,A : X
~R
by
) dv .
,
and
7.7
117
if gz(x) r o
gAu~(x) / g~(x)
,
%,~(x) 0
Thus
qA,A
9-a.e. on
otherwise.
is ~(AUA)-measurable, { x r X : g~(x) = 0 }
(7,5)
Let
A , A s C
gAUA
= 0
we have
gA qA,A = g A l A
Lemma 7.1
and since it is easily checked that
~-a.e.
with
Aria = @ ,
and
let
F
r
~(AUA)
.
Then
~(F) = F
Proof
~(F)
Lemma 7.2
Let
=
I i dWAU~ F
=
I q~,~ g~ d~A~J~ F
A r C
lira ~+zd-A Proof
Let
F
A C_ Zd-A
and
=
~ gAk)~ d V A U ~ F
=
I qA,~ g~ dv " F
F r A . Then
I g~ dv = IWA(',F) du 9 F
with
A e C
and large enough such that
F s F(Ak] A) . Then
118
Let
~ > 0 , then using (7.4) there exists
~(A)-measurable
function
as above~ but also with
=A(.,F)
-
g : X
)R
A ~
=A(.,F)
=
-
=
{~A(.,F)
Ev(XFI~(A))
- gt
and an a
~
9 ~et
- ~({~A(.,F)
- g}i%(~))
.
~-a.e., and hence we can find a version o f
E (XFIZ(A)) I ~ 26
such t h a t t h i s h o l d s everywhere ( j u s t by c h o o s i n g a v e r s i o n such t h a t
IE~(XFIZ(A)) - gl s 6
everrwhere).
Therefore
F
Putting together Lemmas 7.1 and 7.2 gives us that if
A e ~
and
F E
then
p(F)-
~A(',F)dB
=
lim A+zd-A
~ {qA,~ - l}g~ dv . F
But by Proposition 2.2 and the monotone class theorem we have only if
be
Ev(~A(.,F)]~(~))
~A('~F ) - g + g - Ev(~A(.~F) IZ(A))
I~A(.,F) -
A~zd-A
I~A(', F)
such that
=
Thus
with
A ~ A ; then
~(xFIZ(~))
- g}
7.8
p ~ G(V)
if and
119
~(F) = I~A(''F) a~
~erefore
for all
7.9
A ~ g,
F g A 9
the proof of Theorem 7.! reduces to showing that
lim
~ (qA,A - l}g~ d~
Z+z~-A
=
O
for all
A e ~ , F e ~ ;
F
and since we know that the limit exists it is sufficient to show that
(7.6)
lim inf ~ ~lqA, ~ ~ llg ~ d~
:
0
for all
A s __C .
~.+zd-A We will in fact show that
(7.7)
P
lim inf
~Y(qA,~)g~ dv ~
=
0
for all
A e ~ ;
~§
since the following lemma tells us that (7.6) is a consequence of (7.7).
Lemma 7.3
Given
y > 0
ability space, and if
~(f)
Proof
dl < 6 , then
there exists f : Y
~If-l,
.>R
~ > 0
such that if
is any prob-
is non-negative, B--measurable, and such that
dl < y .
This is a simple estimation, comparing the functions
we leave the reader to do it.
(Y~B,X)
82
We now set about establishing chat (7.7) holds.
W(~)
and
I$-iI
;
7.10
120
Lamina 7.4
Let
A , A e C
with
A(~. = @ . Then
d~
Proof
=
i~l~
Lemma 7.5
qA,~ dw
Let
=
~l~
AI, .... An g C
qA,~ d ~ A ~
I {l~
be disjoint, and for
k rk = ~
=
qA,~}gA~)~ dVAk)~
k = 1,...,n
put
n Aj ; l e t
A e ~
with
j=l
D
j-=l
A. C_ A J
Then
n
~(qA k=2
Proof
r
k' k-I
)gr
dv .
k-i
This follows in~nediately from Lemma 7.4 on writing
n
Hrn(~,v)
=
HAl(P,v) + Z {Hrk(~'v) - Hrk_l(v,v)} , k=2
and also using the fact that
HA(~,v ) > H r (~,v) .
82
n
Note that so far we have not used the facts that
W
and
v
are translation
121
7.11
lim inf hA(~,~) = 0 . These hypotheses are now brought in to A
invariant, and that
complete the proof.
Lemma 7.6
For all
A e ~
we have
P lim inf J ~(qA,~)g~ ~§
Proof D A
Let
A a ~
=
O
O
> 0 ; we need to show that if
end
e ~
then there exists
d~
d~
Let
with
{An}n> I
~
zd-A
and
A
is any cube with
A~)~--~A , and such that
be a sequence of elements of
~
with the
properties:
(7.8)
the
(7.9)
for any
translate of
An
are disjoint;
n ~ i
An = A2n-lk)A2n
A ; also if
An
is a translate of
is translated back onto
A
A then
and A2n
A2n
is a
will go onto
A ; N~
J
(7.10)
if
N. = jd
then for any
j > 1
~J
Ai
is a cube.
i=l
(This just amounts to partioning appropriate copy of
A
Zd
into cubes congruent to
A , putting the
into each of them, and then enumerating the pieces in the N.
J
right way.) Let
Cj =
~ i=l
Ai ; then using the fact that
HA(I~,~)
is an increas-
122
ing function of
exists
n ~ 1
A , and that
that
such
7.12
lim inf hA(~,v ) = 0 , it is easily seen that $here A
h C (B,V)
< ~/[AI
9 By Lemma 7.5 we have
n
h c (~,~) ~
V,~) +
n
~(qA
ICnl
k=2
r
)gr
k' k-i
a~ k-i
k where as before
rk = ~
A i . But
ICnl NnlZI :
, ~a
thus
i=l
HAI(~'v)
Therefore
+
~_~ W(qA ,I" )gl" dv k=2 q k k-i k-i
there exists at least one
k
<
~N n
with
2 ! k < 2N
n
and with
k
even, such
that
I Let
D
put
~ = D-A
~(q^ r )gr dv "~k'~k-i ~k-i
be the translate
of d
rk
.
<
6 .
that takes
then the Z -invarlance
of
~
Ak
onto
and
v
A ; then
D D A , and if we
implies that
P dv _
The proof of ~ e o r e m assume that the specification
gFk_ I
7.1 is now complete. was zd-invariant,
Note that we did not have to
though in practice we will have to
123
assume this in order to know tbat
G (V) ==o --
7.13
is non-empty,
except for (7.4) the set-up we have considered in this section
Now
includes
the models from Section 6. To see this, just partition
Rd
side; then in a natural way these cubes are indexed by
Z d ~ thus we can write them
as
Z d ) we associate
{Cj}j~zd
set
1
. Then to each
I C. J
of
A e ~ (a finite subset of
up into cubes with unit
R d . It is not difficult to check that (7.4) holds
the sub-
for a hard-
t sA core potential
given by
~
satisfying
potential with finite range (i.e. r >
if
(6.25); and it clear
= 0
for all
0 ); but it will not always hold for a potential
Irl ~ r ~ , for some
given by
@
satisfying
O
(6.24). However,
the only use made of (7.4) is in the proof of Lemma 7.2, and an
examination
of this proof gives us (using the notation of Section 6):
Proposition
7.i
(6.24); let that
in
Let
~ , v ~ Po(~)
lim inf hA(~,v) A R d . Then
Proof
~
be a specification with
given in terms of
~(U ) = v(U ) = i , and with
r
satisfying e G(V)
. Suppose
= 0 , where the lim inf is taken over the set of all cubes
~ e G_(V) .
Looking at the proof of Lemma 7.2 we see that the only time (7.4) is used
is to show that if
A e C
and
F e A
then
A+zd-A
It would thus be sufficient
to show that for each
exists a version,
E (XFIF(A))
g~ , of
with
A s C
0 < g~ < i
with
A C_ zd-A
and such that
there
124
lim
g~(x)
=
WA(X~F)
7.14
for all
x s U
~+zd-A Let
g~(x)
where
=
IWA(XA•
dPA(V)(z) '
A = zd-(Ak)A) , PA
N~ : X(A)•
>R
with respect to
is the projection from
X
onto
X(A) , and
is a version of the Radon-Nikodym derivative of
p~(~)•
. It is easily checked that
E (XFIF(A)) , and since we can assume that
~N~(y,z)
g~
P d (~) Z -A
is a version of
dPA(~)(z) = i
for all
y s X(A) , we have
IWA(X,F) - g~(x)I
II~A(X'F)- ~A(XA• But for any
as
m > I
A--@zd-A
we have, uniformly for
dPA(V)(z) z s U
m
. Thus we would be done if, given any
, that
WA (x;•
)hA (x'F)
6 > 0 , there exists
m~l
such that
lim inf A+zd-A
I U
N~(x~,z) dPA(V)(z)
~
i -
m
We leave the reader to check that, using Puelle's estimates given in Section 6, it is possible to choose
N~
with the above property.
82
125
7.15
We will now look for situations in which the converse of Theorem 7.1 is true; i.e. when we have
(7.11)
if
~ , ~ e G o(~)
then
lim hA(~,~) = 0 A
o
We will be mainly interested in situations which apply to the models of Section 5, though at the end we will consider the set-up of Section 6. The first case we look at is quite simple, but it gives the idea of why (7.11) might hold. We will say that A~
~ = {WA}Ae ~ A s ~
has bounded ranse if there exists
with
Is-tl > r
Z(AUA)-measurable
P~_~oposition 7.2
for all
Let
(7.12)
there exists
(7.13)
~(AU~)
Then
for all
such that if ~A(.,F)
is
F s ~(A) .
such that
= ~{~(A)UZ(~)}
lim hA(~,~ ) = 0 A
s ~ A , t e zd-A , then
~ , v e ~(~) , where
N
r > 0
~
hA(B,v) ~ N
for all
has bounded range. Suppose that
for all
A e ~ ;
A , ~ s ~
(where the limit is taken over the set of all cubes in
~ ).
(Note that we do not assume that the measures or the specification are translation invariant.)
(7.13) is a condition which is satisfied in all the examples we have considered; we will thus assume that it holds for the rest of this section. An examination of the proof of Lemma 7.2 shows that, with (7.13), we can replace the hypothesis that (7.4) holds in Theorem 7.1 by the slightly weaker hypothesis:
7.16
126
(7.14)
if
A s ~ , F ~ ~(A)
~(A)-measurab!e
function
and
8 > 0
g : X
>R
then there exists
such that
, rather than for all
F e ~ .) If
~
and an
]WA(',F) - gl < 8 .
((7.14) differs from (7.4) in that we only have to approximate F e ~(A)
A ~- zd-A
WA(',F)
for
has bounded range then of course
(7.14) holds, and thus the conclusion of Theorem 7.1 is valid. The crux of the proof of Proposition
7.2 (and the key to understanding when (7.11) should hold)
is the next len~r.a.
Lemma 7.7
Let
A , A e ~
Z(A)-measurable
Proof
Since
F e F(A)
for all
with
F g Z(A)
A~A
= @ , and suppose that
. Then for any
HA~j~(N,v) ?_ H~(~,v)
~ , v e ~(X)
we can assume that
~A (. ,F)
is
we have
H~(~,v) < = . Let
, F e F_.(A) ; then
~(Ff~;F) -- I • d~ = [ wA(x,F) d~j(x) = ; ~h(X,F) g~(x) d'o(x)
=
f~
g~ dv
(by Lemma 2.2).
FnP
Thus
P ~(A) =
~ g~ d~
for all
A a ~ , where
A
n
]2 = {
~) k=l
(.~IknFk) :
F k e =F(A) , Fk r F(A)
; (Fkf~Fk)gi(Fjf~F j) = ~ if k#j }.
127
But
D
is a field, and by (7.13) we have
p(A) =
~ g~ dv
for all
A e F(AkJA)
7.17
~(AUA)
; i.e.
= q(D)
. Therefore
~tA~A << VA~)A
and
gAUA
= gA "
A Hence
log g A U l )
d# = ~ ( l o g g~) du = H~(~,v)
The proof of Proposition cube of side
b
(where
bounded range), and let . Then
WA(-,F)
7.2 follows easily from Le~ma 7.7: let
b > 2r , w~th A
.
r
as in the definition of
be the cube of side
is ~(A--A)-measurs~le
for all
b-2r
~
A
having
having the same centre as
F e F(A) , and thus
H~(~,V) = HK_A(~,~ ) . Therefore
I -AI h~(~,v) =
~
I~1
bd-(b-2r) d h~_A(~,v )
2dr =<
b
and hence
h~-A(~'~)
hd
2&r h~_A ) , (~'l.l,v
< ~
-
lim h;(~,v) = 0
'N b
o
We must now find out when (7.12) holds.
Lemma 7.8
(7.15)
Suppose that there exists
w{t)(x,F) ~ ~w{t}(Y,F)
a
for all
>
I
be a
such that
x , y ~ X , F a ~({t})
, t e Zd .
128
Then for any
Proof
p
, v c G_(V) , A c C
First note that for emy
i/~ & g{t} A a ; so
Let
thus
M(F) <__a~{t}(x,F)
=
we have
t e Zd
hA(P,v) ~ l o g
we have
a .
~{t} << v{t}
and
h{t}(P,v) A log ~ . We will show by induction on
t E Z d , A a :=.C with
~(FaF)
7.18
t ~ A . For
for all
F c =F({t}) let
x c X . Let
I w{t}(x,F) d~(x) <_~ I
IAI
that
M(F) = sup ~{t}(x,F) x~X
F c F({t}) , F e F(A) ; then
M(F) dP(x) =
:
M(F)gA(x) d~(x)
~up gA(x) I M(F) d.(x)~ sup gA(x) I ~{t}(x'F) ~(x) xcX
=
and thus
~(A)
sup gA(x) s xe X
IAI
x~X
e sup gA(x) v ( F ~ F ) xg~
~
c sup gA(x) v(A) xgX then
;
for all
B { t } ~ A << v ( t } U A
and
A g ~(tt}UA)
. If
g { t } ~ A Ae(IA[+l)
estimate gives the other inequality.
Combining Proposition 7.2, Lemma 7.8 and Theorem 7.1 we get:
9 A similar
129
Proposition
7.3
some
. Let
if
e > i
Suppose that
lim hA(M,v) A
M , v e P
= 0
~
7.19
has bounded range and that
(Z d)
with
v e G(VD
~
concerning the existence of the Radon-Nikodym
A , A e C
such that for each
with
F e F(A)
A~A
In
so that there are no problems
derivatives
. Our estimates will be based on how close
Let
if and only
doesn't have bounded range.
what follows we will suppose that (7.15) holds,
Le~mla 7.9
M e G(V)
for
(where the limit is tahen over all cubes).
We will now see what c~] be done if
HA(p,v)
. Then
(7.15) holds
~
and the finiteness
of
is to having bounded range.
= r , and suppose
there exists
we can find an F(A)-measiG, able function
GA,A > l h : X----)R
with (~A,~)-'lh
Then for any
]J , v e G(j~)
~AU~.(!J,~))
Proof
Let
<= WA(',F)
<
~A,~h 9
we have
- ix(]J,,))
<__ 2 log (~A,A
F e F=(A) ~ F e F__(A) , and let
h : X
%R
be as in the hypotheses.
Then
~ WA(,,F)dla < c~A,~~h dM = ~A,~.~hg.~ dv
.(F~)
<
((~A,~)2I-~A(. ,F)g~dv
=
(CLA,~)2 I g~ dv
130
Thus
gAUA
~
(~A,A)2
7.20
and a similar estimate gives us
gA
gAUA
>-- (~A,A)-2gA
Therefore HAUA(}J'v) - H~(IJ'v)
Proposition 7.4
~ l o g / g A U ~ } d~~
=
Suppose that
~
<
2 l~
~A, ~ "
82
is such that (7.15) holds for some
e ~ i ;
and that
(7.16)
there exist
such that if
A
71
To :
Z+
is a cube of side
ing the same centre as urahle
,
h : X--~R
> Z
n
A , and if
+
and
with
A
Yl(n)
lim " n-~ n
= O
,
is the cube of side
Y2(n)
lim---7--- = 0 , n~ n n+Tl(n)
hav-
F ~ ~(A) , then we can find an ~(A-A)-meas-
with
{exp(-T2(n))}h ~ ~A(.,F) ~ {exp T2(n)}h .
Then for any
~ , v e G=(~). we have
lim hA(~,v) = 0 A
(with the limit taken over
cubes).
Proof
Let
H~(M,v)
But
A , A
<
HA(~,v)
be as in (8.16). Then by L e n a
H~_A(p,v) + 2T2(n)
< H~(p,v)
<
7.9 we have
2dy](n)nd-llog
c~ + 2T2(n)
, and thus
1 hA(%,,v)
<
--
=
IAI
}[~(~,v)
< =
{2d log ~} Yl(n~ n
+
2 T2(n) nd
.
"
131
7.21
Combining Proposition 7.4 and Theorem 7.1 we get:
Theorem 7.2 any
Suppose that
, ~ s G(V)
v ~ ~(~)
and
~
satisfies
(7.14), (7.15) and (7.16). Then for
lim hA(~,~ ) = 0 . Conversely, if A
lim inf hA(V,v ) = 0 , then A
, v s P (Z d) o
with
v c i(X) , then
p , v c Po(Z d) , with
# s ~(X) 9 In particular, if
lim hA(W,v) = 0 A
~ g(i)
if and only if
9
(As before, the lim and lim inf are taken over the set of all cubes.)
Suppose now that we are in the situation of Section 5, with with
(Yt,F=t) all copies of the same space
of potentials that the
{r
CA
(Yo,Fo) . Let
~
S = Z d , and
be defined in terms
satisfying the hypotheses of Proposition 5.2, and suppose
are zd-invariant in the sense of (5.17); thus
~IIr
I
is
Act tea independent of noted that (7.15) with
~
t s Z d , and we will denote this n ~ e r
by
N . We have already
satisfies (7.14), and a simple calculation shows that a = exp(2N) . Furthermore,
~
satisfies
(7.16) holds, and we can choose
Yl
to
~l(n) be any function with
lira ~ n~ n
Y2 (n)
= 0 ,
lira yl(~i) = ~ ; and then define n~
I leall
=
Y2
by
9
OsAeC diam(A)>Yl(n)
(where
diam(A) = max{ Is-tI : s,t e A }). We leave it to the reader to 'check
what the F(A-A)-measurable
function
h
has to be. Thus for this case the
132
7.22
hypotheses of Theorem 7.2 hold, and so we get the result we stated at the beginning of the section, i.e.
Proposition 7.5
with
/
Let
II@AI I
V
be defined in terms of zd-invariant potentials
finite. Let
~ ' ~ ~ Po (Zd)
with
{r
~ s G(V) . Then
0~A~ lim hA(P,v) = 0
(with the limit over cubes) if and only if
V e ~(~) 9
A
Now consider the set-up of Section 6, with
V
defined in terms of
satisfying (6.24). The converse of Proposition 7.1 is in fact true, i.e. if , ~ ~ G(V)
with
,(U)
= ~(U)
= i
then
lira hA(P,~) = 0 . However, the best A
w&y to show this is to explicitSu write down projections of
p
and
v
hA(~,v) (in terms of
onto the outside of
@
and the
A ), and then estimate directly
how small this quantity is. This is quite hard to do, and requires the use of more estimates from Ruelle (1970). We leave the interested reader to attempt it. Let us now look at a slightly more general situation than that considered in most of this section. We want to include the set-up of Section 5~ but now with S
being any regular lattice (or graph) rather th~n just
have an underlying space , and that for each
S (not necessarily countable), equipped with a G-field
A s ~
we are given a G-field,
(where we make no particular assumptions about ~(A) C ~ ( B )
; let
Z d . Suppose that we
~ = F(S) . We take
C
F(A) , of subsets of
X ). Assume that if
to be some subset of
A C B
if
AI , A 2 E C
then there exists
A s C
with
then
S , ordered by
inclusion, and make the assumptions:
(7.17)
X
AI[)A2~A
;
133 (7.18)
if
(7~
there
for some
For
A I, A 2 s C
exists
with
A
AIC
A2
7.23
then
A2-A I ~ C ;
e C , n = 1,2,... ~
n
such that if
then
A C_A
n
n .
A e ~
let
~A = ~ ( S - A ) .
We will suppose
that
We will need a group of transformations of S=-measurable
hijections
((A) ~ ~ . Suppose transformation
from
T~ : X -
T~Tq = T~q
(7.21)
T (x) = x
(7.22)
T~I(~(B))
(7.24)
m
< ~
H-invariaut
for all
{T~}~eH
. If
hA(~,v)
by
fields,
~ v e P(S)
o~(A)
acting on
such that if
there
1 .
X . Let A e ~
is associated
and
H
be a group ~ e H
then
an F=-measurable
~ , O c H ;
x ~ X , where
for all
a measure
is H-invariant,
random
~=
in such a way that
=F(CI(B))
Again we will write
S
( e H
for all
suppose we have
m(A)
>X
to
for all
e
(7.23)
S
that to each
(7.20)
Finally,
A e C ~
m
A e C
i.e.
P(A)
on
is the identity
B E S,
(S,S)
in
H ;
~ e H .
such that
;
m((-l(B))
= re(B)
for all
( e H , B e S_ .
instead of
P(F(A))
; Pc(S)
will
i.e. those random then we define
fields HA(~,~)
1
h A (]J ,'o ) -
e
,'o )
invariant
denote the
under the group
as before;
we now define
134
7.24
The condition (7.4) still makes sense in the present setting (provided replaced by
Zd
is
S ); thus to give the analogue of Theorem 7.1 we need only find out
what corresponds to "cubes". The property of cubes which was used was that they pack together nicely, and to measure this in our present set-up we define, for
max{ n : there exist
nH(A,A ) =
~i' .... $ n e
H
such that
are disjoint subsets of
Thus
nH(A,A )
is the number of translates of
A
~I(A) ..... ~n(A) A } .
that will pack into
A ; by
m(~) (7.2~) we have
nH(A'A) ~ m(A) " Let
upwards, generates the order on say that
~
is adapted to
H
if there exists
A
~ > 0
~ . (Note that
m(~)
"volume" of
~
which is directed H . We will
such that for all
A e
m(A)nH(A,A) ~
~ ~
be a subset of
~ , and is closed ~ider the action of
m(A)nH(A,~) lira inf
~
-
-
is the fraction of the
m(~) which can be filled by disjoint translates of
set of cubes in the case
S = Zd
then we can take
A . If
~
is the
a = i .) The new form of
Theorem 7.1 now becomes:
Theorem 7.3
Let
_~ = {WA}Ae C= satisfy (7.4); let
and suppose that
adapted to
Proof
H . Then
lim inf hA(P,v) = 0 , where
~
~ , 9 e Po(S) is a subset of
with ~
u e G(V), which is
~ s G(V) .
This is the same as the proof of Theorem 7.1 as far as Lemma 7.6. The proof
of Lemma 7.6 is easily modified, and we leave it as an exercise for the reader.
82
135
7.25
To see when the converse of Theorem 7.3 holds we look at the proof of Proposition 7.4. Let us again suppose that (7.13) holds. Note that although it is not hard to see what the analogue of (7.16) is, there is no real ~ a l o g u e (7.15) unless
S
Propgsition 7.6
(7.25)
is countable. Mimicking the proof of Proposition 7.4 gives:
Let
there exist
~
be a specification with the property that:
Tl ' Y2 : ~
~R
with
= such that if
A c ~
and such that if
F c F(A)
]~ , v ~ G(V)
(7.26)
Then
A e C
with
0 , lim
re(A) A ~ A
Ac~ and
then there is an F( A-A )-measurable
<
WA (-,F)
<
Y2(A) -
m(~-A) < Yl(A) , h : X----~R
w~th
{exp Y2(A)}h .
N
for all
such that
AI , A2 c ~
and
A c C
of the form
A2 C AI .
lira hA(~,v) = 0 9
The above setting gives us the analogue of Proposition 7.5 when replaced by any suitable regular graph. H and
0 ,
re(A)
with
there exists
A = A]-A 2. ~ with
YI(A) =
l i m ~
Ac~
then we c~m find
{exp(-Y2(A))}h
Let
of
~
Zd
is
is then the symmetry group of the graph,
is a set of reg1~Lar shapes for the graph. However, it is worth pointing out
that things can go wrong for some graphs: let ered in Section 5, and let
H
S
be the infinite tree
TN
be the group of all graph automorphisms of
considTN .
(The random fields we explicitly constructed in Section 5 are clearly H-invariant. ) Let
~
be a specification defined in terms of a nearest neighbour, homogeneous
7- 26
136
potential
V ; let
C = { A(n,t)
: n > i
A(n,t) = { s s S : d(s,t) < n ] , and from
s
to
t . Then although
~
t ~ S }
d(s,t)
where
is the length of the shortest path
is not quite adapted to
H , a slight modification
of the proof of Lemma 7.6 still gives us Theorem 7.3. However, shows that the converse of Theorem 7.3 is false in this case: have H-invaria~t
~ , v ~ G(_V)
with
a direct computation it is possible
to
liln hA(~l,v) > 0 . Now it is easily checked
ac_c. that (7.26) holds that fails.
Tnis is not very surprising,
corresponding point in
(since in this case we even have (7.15)), thus it must be (7.25)
A c C
A , thus
the right properties.
must include
because if we are given
]A-A I > 89 I , and so it is not possible to choose The problem is that the boundaries
g_ap~ which has some physical meaning, similar to that which happens n
d-i
for
then the
at least all points that are neighbours
same order of size as the sets themselves.
of order
A s ~
Zd
of sets in
Tais "pathology"
~
Yl
to some with
are of the
will not occur for any
and typically we would expect behaviour where a cube with
nd
points has a boundary
8.1
8. Some thermodynamics
In Section ect to
7 we defined
h(~,u)
, the specific informatioh
~
with resp-
v , but we did not show that the limit defining this quantity
existed. We
now tackle this problem by showing that if under suitable conditions
(8.1)
where
h(~,~)
f(~,~)
ect to
e s p o n d i n g to existence
P(~)
= f(~,~)
is
+ P(~)
free
energy of
a n u m b e r w h i c h can be i n t e r p r e t e d
~ . The existence
of the limit f(~,~)
h(~,~) and
w
with resp-
as the pre s s ure
corr-
can then be got from the
P(~)
.
approach also s11ows us to restate the results of Section 7 as a
form of Gibbs' variational principle: and only if equality
~ c __G_o(V)_ then,
;
t o some k i n d o f s p e c i f i c
of the limits which define
9
and
we have
corresponds
~ , and
~ e Po (Zd)
gain of
~ s (~)_
if and only if
, then from (8.1) we get
h(~,v) ~ 0 , with equality f(~,~) ~ -P(~)
~ s ~ (~) ; i.e. the translation
are exactly those translation free e n e r ~ .
if we have
invariant
again with
invariant
Gibbs states
random fields which minimize the specific
As mentioned in Section 7 this generalizes
result of~ Lanford and Ruelle
if
(Lanford a~_d Ruelle
FS~Lmer's
(1969), F611mer
proof of a (1973)).
Most of this section deals with the set-up of Section
5 (although we will
start as in Section 7), and in particular with specifications
defined in terms of
potentials which satisfy the hypotheses
of Proposition
5.2. In theory the models
of Section 6 could be handled the same way, but the technical overcome are many:
difficulties
to
the reader is invited to try.
Suppose we are in the situation
of Section 7. Thus for each
AC
Zd
we
138
have a o-field
~(A)
8.2
of subsets of a basic space
empty finite subsets of
Z d ) we put
~A = ~(zd-A)
X , and for
A s ~ (the non-
. We make the same assu~#tions
about the o-fields as we did at the beginning of Section 7. Usually in physics the specific free energ~ is defined by is the specific energy of entropy of
~
f(~,~) = e(~,~) - s(~) , where
with respect to
~ , and
s(~)
e(~)
is the specific
p . We will start by looking at what corresponds to
s(N) , and show
that the limit which defines it exists. We define a special kind of random field: if, given any
A I , A2 ~ ~
with
A I ~ A 2 = @ , and
a P(Z d)
is called independen-~
F I s ~(A I) , Y 2 s ~(A2) , then
k ( F l ~ F 2) = k(Fl)k(F 2) . (This says that events depending on disjoint s ~ s e t s Zd
of
are independent.) In the situation of Section 5 the natural (and only) w~j
of getting independent measures is as product measures; the Poisson point processes defined in Section 6 are also independent.
For all the models we are interested
in there is at least one independent measure. We will thus define the specific entropy by putting
s(p) = -h(p,k) for some suitable independent
k e Po(Zd).
The first result of this section shows that this is at least well-defined.
Theorem 8.1
Let
~ , k r P (Z d) o
exists, and equals
with
k
independent. Then
h(~,k) = lim hA(~,k) A
sup hA(P,k ) (with the limit and sup taken over the set of all A
cubes). Note: We must allow
Proof
(8.2)
Define
h(p,k)
to take the value
D : ~
~[0,~]
D(A) ~ D(A)
whenever
by
A C_A
+~ .
D(A) = HA(PSk)
. Then we have
, (using Jensen's inequality, as in Section 7);
8.3
139
(8,3)
D
_ invar{ant,
is trmnslation
Suppose we also knew that
(8.4)
if
A I ....
D
A s C ~ n =
i.e.
=
D(A+t)
was superadditive,
are ~ s j o i n t
Ai
D(A)
for all
,
A e C
Z d
t ~
i.e.
then
>
D(A i) . i=l
Then by a well-known be complete.
Since
*
(see for example
it is so simple,
1 lim --- D(A)
that
result
exists
in Ruelle
let us show how i sup - -
and equals
IAI
(1969))
(8.2),
D(A)
the proof would
(8.3)
and (8.4)
(where the limit
imply
and sup are
A IA I
taken over the set of all cubes). For
A , A g C
let
n(A,A)
be the maximum number
which will fit into
A . If
A
exists
a cube
A
n(A,A)
> (1-6)
--
o
such that
. Let
if
Let
A
A
A with
n = n(A,A)
of
D
~
Ai
above,
, then we can find
a translate
and we are given
is any cube with
--
D(A)
A . Then by
D(A i)
=
nD(A)
and let
~
lim
A
of
A
then there
then
o
over cubes),
D(A)
_>= D
. q~us
--
! --- D(A)
exists
meaning
be any cube with subsets
Ai
smd
and equals
of
if
d = += ).
A O
A
D(A)
l
. Put each
Ai
and (8.4)
> (1-6)--
= d .
o
A , with
; and by (8.3)
D(~) > -
-I . Therefore
~ > 0
(with the sup taken
AI,...,A n , disjoint
(8.2)
translates
I^I
= >__ (l-~)(d-~)
A ~
D(A) > (d-~)iA I (with the obvious
be the cube defined
o
A
d = sup
IAi choose a cube
is a cube,
of disjoint
I^I
D(A)
.
140
8.4
We now need to show that (8.4) holds, and clearly it is sufficient to show i% in the case
n = 2 . This is given by the following lemma.
Lepta 8.1
Let
~ , k c P(Z d)
with
k
independent, and let
A , A ~ C
with
A ~ A = @ . Then
HAO~(~,X)
Proof
We can assume that
easing f'0mction of
A
A ~A(~,~) + ~Z(~,~) 9
HAU~(~,I)
we also have
< ~ , and thus since
HA(P,k)
HA(~,k) < ~ , H~(p,l) < ~ . As in Section 7
dP A let
gA -
. Then we have dk A
HAkj~(p,%) - HA(P,%) - H~(p,%)
=
f l o g (gALl-----~A)dp
gA g%
gAh} A dk
=
I r
dk
gA gA I
~gAg~/ A A dl +
But
I gA~A
dl
=
i gAg~ dl
Therefore
{gAu~-gAg~ } ~
i , and also since
=
IgA
is an incr-
d l l g~ dX
1
=
is independent we have
i .
141
HAU~(~,X)-
8.5
IY (gAUAI dl _>_ 0 .
HA(U,X)--H~(~,X)=
gA g~ !
Suppose now that we have some independent regard as a fixed reference measu.re. cation,
and let
that for each
v c Go(~) A E ~
Let
~
be a translation
. Suppose that for all
there exist
X s P (Z d) , which we will o
~A ' BA s R
A e ~
with
invariant
specifi-
v A << X A , and further
0 < eA ~ 8A
such that
dv A ~A
<
~
BA "
dl A
Then for any
~ s Po(Z d)
HA(U,v)
we have
=
HA(V,X)
HA(U,v)
-
< ~
log
if and only if
--
HA(P,X ) < ~ , and
dv
dXA #
'Thus, from Theorem 8.1 , in order to show that
h(~,~)
exists we need only show
that
lim - -
log
A IAI
--
d~
d~A/
exists; we would also like to identify this limit with e(~,V)
the specific energy of
corresponding
to
~
with respect to
V , and
V . Note that if we could show that
i
lim---
log
--
1
-{e(~,V) P(V)
+ P(V=)} , with the pressure
142
ex•
8.6
with all the terms uniformly bounded, then by the dominated convergence
theorem
h(~,v)
would exist, and the existence of
particular properties of
h(p,v)
would not depend on any
p (other than it being translation invariant).
We will first see when this limit exists in the situation of Section 5. Let
(Yo,~_~,~o) be a probability space, ~ d
a copy of
have
for each
t ~ Zd
let
(Yt~F_t,wt) be
(Yo,~,a~o) . We will use the notation of Section 5; in particular we
X = ~
Yt"
~=
t E Zd (X,~)); thus
~
F----t" Let
~ =
~
t s Zd
~t
(as a measure on
t s Zd
~ s P (Z d) , and it is clear that
~
is independent. Choose
O
bo ~ Yo
and let
b = {b t}
(where
bt
is the appropriate copy of
as a base-point. Suppose we have a specification
~
bo ); use
defined in terms of
b
{r
which are tr~islation invariant (i.e. they satisfy (5.17)), and for which
IleAl I
is finite (i.e. they satisfy the h~otheses of Proposition 5.2).
OEA~
~ider these conditions we will show that
c G (~)), and hence that Let (writing
(8.5)
S
{~}AE~ for
h(p,v)
1 --
EA[
log ( dvA I
\ d% /
exists for all
be defined by (5.13). Then for
converges nicely (for any
e P o ( Z d)
, 9 ~ O~o(V) 9
A ~,,~(__v)
~e
have
Zd )
---(x) =
fA(ys_A• A) a~(y)
d~A
(in that the right-hand side is a version of the Radon-Nikodym derivative, and it
8.7
143
is the version we will always use). For
(8.6)
A c ~
define
JA : X - - - - ~ R
exp V(bs_AXXA )
JA(x) =
l exp VCbs_Axw) d~A(W)
and let
=
WA
li Aff
A~Ar162 AaCs-A)r
The next len~ma gives us an important estimate. Lemma 8.2
Let
v e G(V)
exp(-2WA)
Proof
xsX
exp(2WA)
(Ys_AXXA) dvCy) / JA(X)
9
We have
I Now
~
, A c C . Then for all
exp gA(ys_AXXA)
#~(yS_~•
dr(y) =
I exp gA(ys_A•
exp gA(ys_A•
d~(y)
I
/ exp V(bs_Axw)
=
dZOA(W)
exp {gA(ys_A•
- V(bs_Axw)}
9Thus
A n Aye A ~(S~A)@@
exp(-w A) s
exp gt'(y_R A•
/ ex~ V(bS_A•
<= ex~(WA) .
The result therefore follows by a simple comparison of terms.
82
by
8.8
144
For
B e P "-(Z d) , A e C
define
o
(8.7)
eA(~, Z)
V(bs_A• A)
=
IAI
d~(x)
~,
a~d
(8.8)
PA(__V)
=
--
log
exp V(bs_Axw)
d~A(W )
IAI
Thus
--
].og{JA(X)}
du(x)
=
- eA(U,V) - PA(_u
IAI
=
,
-
and hence by Lemma 8.2 we get
I (
i
(8.9)
d~A~}
d]J +
eA(~,_V)
+
P4(X)
I
<
--W
A .
But:
i lim - -
Lemma 8.3
A
Proof
For any
A e ~
II~AII A DA~ A a (S-A)#r
Let
K
=
~ueh
that
WA
=
0
we have
l z z IAnAIiI%II <=
:
t ~ A
teAcC A r~(S-A):/r
I1%11~given
L
(provided the limit is taken over cubes).
IAI
ileal!
~
Oci~s A n(S-A )~ o
~ > 0
<
t e A
there exists a cube
. For ~ y
t e Zd
let
AO
II~AII-
t~Aes A t~ (S-A)qr
with centre
Ao(t )
0
be the translate
145
of A
e
A
that has centre
o
= A - A
"~ t
t ; for
~
tmAe_C_=
i ~hus
--
let
=
A
m
= { t e A : A (t)C. A } , and let o
. Then
m
~ II~AII a AI'~(S-A)yr
~ A
A E C
8.9
WA
IAI
<
6 + K
(
6 + ~ ~ A
t
~ E A
m
e
IAml . But clearly
lim
--
A
IAI
IAI i lim - -
and thus
WA
II~AII <__ alAml+KIAel .
tEAc~
IAml )
i -
=
over cubes),
t
=
0 .
=
] (with the limit
82
IAI
In order to show that
h(p,~)
Lemma 8.3, Kid ~ h e o r e m 8.1, that
h(p,~)
exists then its value
look at
lira eA(~,~)
(8.1o)
. For
~t(x)
exists we n e e d only show,
l i m eA(P,V) A
and
lim PA(~) A
does not depend on w h i c h
t e Zd
=
define
__
~t : X
CA(XA)
using
exist.
~ c G=o(~)
)R
(8.9), Note that if
is used.
Let us
by
9
IAI
Clearly the sum in (8.10)
function.
of
From (5.17)
converges
absolutely,
it follows that if
and
c p ""~zd) o
r
is a bounded,
then
~
@t dp
F=--measurable
is independent
J
t e Zd . Let us suppose from n o w on that
for the specifications of Gibbs states.
we are considering,
(Yo,F_o)
is a standard Borel space.
there are no problems
Thus,
about the existence
146
Lemma8.4
If
~ E P (Z d) o
then (with the limit taken over cubes)
limA eA(~'V)
=
-
Ir
an 9
V(bs-A• = A~~ . A CA(WA) = t~e~A ~_ 1 ~ t E A ~ A IAU
Proof
Thus
8.1o
I V(bs_A• A) -
~
CA(WA) "
Ct(x) i
tEA
I t EA
! ~A(XA) I = IAI
tEA
I
A n(S-A)~r
- -
A~A~
CA(XA ) I
<
WA 9
IAI
A n (S-A)#~
But
tEA
I eA(~,v) +
~0d~
I =<
i -IAi -W
A ;
and therefore the result follows from Lemma 8.3.
Let us put
e(w,~)
with respect to
Lemma 8.5
P(~)
Proof
=
lim eA(~,~) , and call A
e(~,y)
the specific energy of
~ .
lim PA(~) A
exists (over cubes), and is finite. We denote the limit by
~Id call it the pressure corresponding to
Let
82
p E Po(Z d) , ~ e G ( ~ )
~ .
; then by (8.9)
147
I hA(P,v)
- hA(P,~)
Thus by Theorem
exists
- eA(P,V----) - P A ( ~ ) I
and equals
putting
U = v
though the limit might be
K
l l|
=
hence
PA(V)
~
K
for all
for
entropy
~ ; we have
thus
UA = ~A
Theorem
if
If
(8.11)
0 ~h(~,v)
hypotheses
Proof follows
(8.1i)
as
=
A ~ C .
82
~
s(~)
gives
- h(~,m)
e(p,V_.) - s ( p )
h(B,v)
h(~,v)
~(~)
< ~
= e(~,~)
= 0 , that
then
+ P(~)
hA(~,e)
h(~,v)
because
if'
s(p)
the specific
if and only if
= 0
for all
A ~ ~ ,
exists,
and
.
for some
the previous
if and only if
> -~
exists
ana
s(~) = 0
< ~ , (and also in this
+ P(~)
+~ ).
~ = ~ .)
B e ~o(V~
together
lim PA(~) A
is finite,
, and call
then by Theorem 8.1
s +~ , but if then
hA(V,~)
--
A E ~ , and hence
=
+ PA(~)}
(which could have the value
-~ ~ s(~) ~ 0 . (Note that
~I~is is just collecting because
=
lira {hA(U,v) A
in fact the limit
D e Po(Z d) , v e G ( ~ )
h(p,v)
We have
since
V(bs_^Xw)
s(~) = 0
for all
8.2
WA 9
+ e(w,V)
we get,
~ E Po(Z d) ,
= ~ , because
h(u,~)
+~ . However,
, the
Let us put, of
<
8.1, Le:~aa 8.3 and Lerama 8.4 we have
(over cubes),
In particular,
8.11
=V
satisfying
case
results.
s(~)
the same
> -~ ).
The last part
s(B) > -~ , and putting
v =
in
148
Note
8.12
7~though we have restricted ourselves to looking at limits over c~oes, in
fact the limits exist under much weaker hypotheses;
for example "in the sense of
van Hove": for details the reader should look in Ruelle (1969)~ Gallavotti and Miracle (1967), and Robinson and Huelle (1967).
Combining Theorem 8.2 and Proposition 7.5 we immediately get a version of Gibbs' variational principle:
Proposition 8.1
For
p ~ Po(Z d)
let
specific free energy of
~
equality if and only if
p c G=o(V) .
f(p,~) = e(~,~) - s(p)
with respect to
This variational principle involves varying
(f(p,~)
V ). Then
f(p,V__J> - P ( V )
B s P (Z d) o
with
V
is the , with
fixed; we will
now look at some other variational results in which it is the specification
V
that is ~aried. In particular we will identify Gibbs states with tangent functionais to the pressure. Most of what follows is adapted from Lanford and Ruelle (1969) and Ruelle (]967). If we a~.e going to vary
V
then we need a suitable class of specifications;
the obvious class to use in our present situation is those specifications given by potentials satisfying the hypotheses we have assumed for
set of potentials
V 9 Xo------@R
such that
V . Let
B_
denote the
V(x) = _ _ Z CA(XA ) , with
{r
A~C translation invariant a~id
2
= I ICAII
finite. B
can clearly be regarded as a
OcAcC
real vector space; and in fact as a normed vector spaee~ with the norm of
V ,
149
denoted by
IIVll , given by
~__
8.13
IleAl I . Furthermore, it is not difficult to
0~h~ see that each
B
is complete as a normed vector space, i.e.
V e B
there corresponds a specification
e(~,V) , P(V) , ~(V) , etc. instead of
B
is a Banach space. To
~ ; it will be convenient to write
e(B,~) , P(~) and
G(~) .
We can thus consider the pressure, P , as a function from will show that
P
A
to
R . We
is continuous and convex; and it is therefore natural to look at
the tangent functionals to ionals. If
B
P . Let us recall some basic facts about tangent funct-
is any real Banach space, if
space of continuous linear maps from uous and convex, then
u e A
g(a + &) - g(a) ~ u ( K )
A
to
A
denotes the dual of
R ), and if
g : A
is called a tangent functional to
for all
& e -A- . Let
Ta
A_ (the Banach )R
g
at
is contina e _A if
denote the set of tangent funcw
tionals to
g
at
a ; then
Ta
is a non-empty, compact, convex subset of
A
(the compactness being in the weak topology). Now if
~ ~ Po(Z d)
then it is easily checked that
Ile(~,')II a i , since by Lemma 8.4
~ Go(V)
e(~,v+w)- s(~) hence
Ie(B,V) I = I ~ @ 0 dB I ~ llVII ). Further, if
then Proposition 8.1 tells us that
the pressure at
V , (since if
a-P(V+W)
and
W ~ B
-e(~,-)
e(.,V)
- s(.) = -P(V)
tangent functionals are obtained this way, i.e. if V
then
u = -e(~," )
show that in this representation
B
is a tangent functional to
then
P(V+W) - P(V) ~ -e(~,V+W) + e(p,V) = -e(p,W)
the pressure at
e(~,. ) e B_ (and
for some
;
). We will show that all u
is a tangent functional to
Z e G(V)
. Furthermore, we will
is unique by showing that
-e(~%" ) = -e(v, ")
8.14
150
only if
U = v .
Theorem 8.3
Define
-e : P ( z d ) - - - - ~ B
by
(-e)(u) = -e(~,,)
. Then
-e
is
O
injective,
and for any
of tangent ft~ctionals
V s ~ to
P
the image of at
C_~o(V) under
-e
is exactly the set
V .
We start the proof of the theorem by looking at some of the properties the pressure.
Before doing this,
proof of Theorem 8.3 is that if subset of
U = { V e B : I ~ ( V ) I = i } , then
U
of the
is a dense
B . (This will be proved as Theorem 8.4.)
Lemma 8.6
P : B ------*R
IP(h) - P(v2)l
Proof
let us just mention that one consequence
of
is continuous,
~
A simple estimation,
IPA(VI) -
Lermna 8.7
P~(v2)I
P : B_-------~ R
then
P(tV I + (l-t)V2)
Proof
We will show that,
and exponentiating,
~id in fact
ilvl- v211
which is left to the reader,
a l l v I - v211
~r
is convex;
0 < t <__ i
i.e. if
shows that for any
and
VI , V2 e B
tP(u I) + (l-t)P(V 2) .
for any
A e C_ ,
PA
is convex. Multiplying by
we need to show
l exp{tVl(bs_Axw)+ (l-t)V2(bs_AxW)}dmA(W)
(8.12) <
Ace
[~ e~ vI(bs-A• d~A(w)It [I expV2(bS-A• ~A(~) ](l-t)
IAI
151
But if we put
p
i/t , q = ] A l - t )
=
t
~ then
8.15
p
and
q
are conjugate exponents,
emd thus (8.12) holds by applying Holder's inequality to the functions
fl
and
f2 ' -where
fl(w) = exp{tVl(bs_A•
P~o~Oosition 8.2 -
Proof
,
f2(w) = exp{(l-t)V2(bs_AxW)}
The mapping
-e : F (Z d)
-
O
We want to show that
~i = P2 " For
V e B
let
V ; then by Lemna 8.4
{r
{,~V}VsB
to be able to s e p a r a t e the elements of
show that
is injective.
for all
V s B
ilrLDlies
denote the interaction potentials determining
-e(~,V) = I C Y dp , where
for some
82
--
-e(Pl,V) = -e(P2,V)
The problem is thus to show that
and F(A)-measurable
)B
9
implies
=
0 c A s ~ - IAI CA(xA)V .
is a sufficiently rich set of functions
P (zd). Let 0
A e ~ ) with
-e(Pl," ) = -e(p2 , ')
r
f(x) = 0
f ~ B(~) ( i . e . f whenever
x0
: f dp I = I f dp2 " Let
is bounded
= b0
. We will
CA : X(A) ----->R
be given by
CA(w) = [A I ~
(-l)IA-Alf(wAXhB_ A) ;
A= A
then
f(x) =
0cA•c C - -1
SA(XA)
(with only finitely many non-zero terms), and
IAl
CA(W) = O
whenever
wt = bt
for some
t e A 9 Take
A e ~
with
O s A
and
8.16
152
define
{r
by
r
if
A = A+t
for some
t e Zd
cA(W)= { 0
Let
V
otherwise.
be the potential corresponding to
-i-
A~F
~A(XA ) ~ where
F
~I
and
; then
V e B_ and
is the set of translates of
IFI--IAI ). By hypothesis we have translation invariance of
{r
U2
A
that contain
~ V d ~ l - - ~ V d~2" and using the this gives us
I%~(xa)d~1(~)= ~(xa) d~e(~). Doing this for all
ation gives ~
A e ~
with
0 e A , and taking the appropriate linear combin-
~ f dw1 = I f d~2 " In particular, again using translation invar-
lance, this will hold if
n -(8.13)
f(x) = - - ~ ft.(xt. ) i=l i l ~.-measurable, and 1
,
where
ft. : Yt.-i l
ft.(bt.) = 0 1 1
for some
R
is bounded,
i .
n But if
and
gi
ft.(bt.) @ 0 l l satis•
for all
i , then we can write
(8.13). ~nerefore
f = c +~ gi ' with i= I
I f d~l = I f d~ 2
for all
f
c ~ R
of the form
~s3
8.17
n
f(x)
= 7~
ft.(Xt.
i=l hence
m
) ' with m
ft. : -ft.:= m m
9R
F=b.-meastLrable and bomqded; a~,d i
DI = ~2 "
The technical result
(due to Gallavotti and Y~racle (1967), and Lanford
and Robinson
(1968)) which enables ~s to deduce that all tangent functionals
of the form
-e(~,*)
Prol~osition 8.3
is the following:
Let
A
be a real separable Banach space, and let
be continuous and convex. For als to
g
at
are
a ; let
a s -A
let
Ta
g :A
.~R
denote the set of tangent function-
U = { a c _A : ITal = l } . Then
U =
~
U n , where
n>l each
U
is s~l open dense subset of
A ; in particular
U
is a dense subset of
n
A . Further,
--
an ~ U then
with u
Proof
if
n
a s _A an--~a
~u
smd
u
is an extreme point of
such that if
un
Lanford and Robinson (1968).
4
is the tangent functional to
~nd ~ r a c l e
~
an
(1967), for the second see
be a closed separable subspace of
B . P -e
restricted to
as a mapping from
~0
is
Po(Z d)
(though this mapping need not be injective m~ymore).
Proposition
Then
at
82
still continuous and convex, and we can consider to
g
(weakly).
For the first part see Gallavotti
Let
then there exist
Ta
8.4
u=-e(~,')
Let for
V c ~O sor~e
and .
u c ~0
e ~(V)
.
be a tangent functional to
P
at
V .
154
Proof
Let
written as
'1'
8.18
be the set of t~ugent functionals to
-e(~,.)
for some
u(W) = I r W dIJ for all
p s G(V)
W c 4
. If
u ~ T
P
at
with
V
which can be
u = -e(p,.)
" But is is easily seen that for any
W e B
B(A) , the bounded ftm_ctions from
are ~(A)-measurab]e for some
A e C . Thus by Theorem 3.2 (and the separability of
) we have
T
is (weakly) closed. T
u
if
u
Pn e ~o(Vn)
then
-e(Pn,')
u n = -e(lJn," ) . Thus
{nj}j> I =
and
u(W) =
D c Go(V )
that
is sn extreme tangent functional.
is extreme then by Proposition 8.3 there exists 4n e ~
a unique tangent functional, u n , to
R
is clearly convex, therefore it is suffic-
ient to prove the proposition in the case when If
to
@W
is in the uniform closure of
4
X
then
P
at
V n , and
such that ~here is
Vn - - - ~ V , U n - - - ~ u
is a tangent functional to
P
at
. But
V n , and hence
lim I @ W dw n . Suppose we could find a subsequence
such that
p(F) = lira Pn (F) j~ j
for all
F e A ; then we
would have P lira ~ @ W dPn. j+~ J j
P
-e(#,W)
=
\ ~ W dp
=
=
u(W)
(for a l l
W e ~O )
and the proof would be complete. ~nis fact is given by the next lemma.
Lemma 8.8
Let
V
exists a subsequence
, V e B_ with {nj}jk 1 _
all
Proof
and
Vn
~V
~ e G(V) =
snd let
82
Pn ~ =O(Vn) " Then there
such that
~(F) = lim ~n.(F) j~
for
J
FE~.
Let
{ITin)}Ac C (resp. {WA}AeC ) be the specification corresponding to
Vn
155
8.19
(resp. V ). It is not hard to show that for any ITA(n)(',F) ~
WA(" ,F)
if we have
uniformlJ~ as
~ s Po(Z d)
with
n--~
A s C
and
F e A
we have
Thus, as in the proof of Le~na 3.4,
p(F) = lim ~n.(F) j+~ J
for all
F e A , then we have
s G(V) . The rest of the proof is like Lemmas 3.1 and 3.2 (using estimates similar to those in the proof of Proposition 5.3), and is left to the reader.
82
We can now complete the proof of Theorem 8.3 by finding a nice separable subspace of
B . For st~pose there exists a closed separable subspace -e : P (Zd)
such that the mapping
~
is injective. Let
V s B
~0
of
B
and
u
be
O
a tangent f~uctional to by
.~
and
V . ~l
P
u(W) = -e(~,W)
injective
~ ~2
that
g c ~i " Since
for all
-e : Po(Z d)
u(W) = -e(D,W)
W e ~2
~I
>~
W e 4
~F = o ( A )
and
for some
~ = ~ ; thus in particular
There exists a closed separable suhspace
We need to find
for all
for all
u(W) = -e(~,W)
-e : Po(Z d)
Proof
be the closed suhspace of
is the closed subspace spanned by
we must have
Le~ma 8.9
~i
is unique. But we must have
able, and so --B2~ ~0
V ; let
B
spanned
is separable and so by Proposition 8.3 there exists
such that
because if
at
>~
~ e --o ~ (V)
is
for all
W s B ;
W
~2
then
is separ-
~ E C_~o(V) ; and as
u(W) = -e(p,W) .
~O
of
B
such that
is injective.
~0
such that
BI " ~2 e Po (Zd)
and
~ @W d~l = ~ r
imply
~i = ~2 " There exists a countable field
; for
A e ==C let
_~A__ C
d~2
==oF such
~ ( A ) C. =-~(A) be the field generated by
156 p r o d u c t s o f elements o f B --OO
~
, thus
8.20
~o(A)__ i s c o u n t a b l e and
consist of those potentials in
B
=oF(A) = o ( A ( A ) ) .
which are determined by
{r
Let
with
n
CA : X(A)------~R
of the form
~
akXAk , with
ak
rational and
Ak e A_o(A) ,
k=l and such that there exists then
CA = 0
~i = ~2
. B
--OO
whenever
A E ~
so that if
A e ~
with
0 ~ A
and
A - A ~ @ ,
is countable, and the proof of Proposition 8.2 shows that
~W
be the closed span of
d~l = ~ , W
B --00
d~2
for all
W g --oo B . We can thus let
B --o
82
The proof of Theorem 8.3 is now complete. It is worth noting that if is finite then Let
B
Proposition 8.3 it follows that
U
B
is separable then from Theorem 8.3 and
is dense in
B ~ and furthermore that
dense G6 (i.e. a countable intersection of open sets). If
Theorem 8.h space of
Proof
B
U
U then
O
is separable, ~Id in this case the above proofs can be simplified.
U = { V e B : IG_o(V) I = 1 } ; if
we still have
Y
is dense in
~ . If
is a dense G~ in
Clearly the second part implies
separable suhspaee of
_B with
~
~
then by Proposition 8.4 and Ler~aa 8.9 tar,gent functional to
P
at
is a
B i s not separable then
B .
is a dense subset of Uf%B_I
U
V (in
4
U
is any closed separable sub-
~i
B1 .
is dense in (with
~
I_G_o(V)I = 1
B . Let
~
be any closed
as in Lemma 8.9). If
V E
if and only if there is one
w B_2 ). Thus by Proposition 8.3
U~B2
is a
157
dense G 6 in
~2 " Now if
apply the above with
~2
~i
8.21
is any closed separable subspace of
the closed span of
~
and
~i "
B
we can just
82
We have seen that the pressure is a convex function; let us now see to what extent it is strict]@ convex. Suppose there exist
VI , V2 E B
and
t ~ (0,i)
such that
(8.14)
P(tV I + (l-t)V2)
=
By convexity (8.14) holds for all VI
and
tP(Vl) + (l-t)P(V2) .
t
with
0 ~ t ~ i , i.e.
P
is linear between
V 2 . One would thus expect that the tangent functionals to
tV I + (l-t)V 2
do not depend on
G ( t V 1 + (l-t)V 2)
P
at
t e (0,i) ; and thus by Theorem 8.3 that
does not depend on
t e (0,i) . We will show that this is true,
but will deduce it from Proposition 8.1.
Le~ma 8.10 let
Let
V 1 , V2 e B
and suooose that (8.14) holds for some
~ e G3(tV I + (l-t)V2) . Then
Proof
Write
U e G_o(VI)~Go(V2)
t E (O,l) ;
.
V = tV I + (l-t)V 2 ; then by Proposition 8.1 we have
t{e(~,V I) - s(~)} + (l-t){e(p,V2) - s(~)}
=
e(B,V) - s(~)
=
-P(V)
=
-tP(VI) - (I-t)P(V 2) .
thus t{e(~,V I) - s(u) + P(V1)} + (l-t){e(u,V 2) - s(p) + P(V2)}
But for
i = 1,2
we have
e(~,V i) - s(u) + P(V i)
~
=
0 .
0 ; and hence
158
e(~,Vi) - s(u)
=
-P(Vi)
. ~ ~ ( V l ) n C = o ( V 2) 9
Le~a
8.11
Let
(8.15)
~I
if
Proposition
_Go(tV I + (l-t)V2)
be a s~ospace of
V1 , V2 s B
8.5
If
strictly convex on
Proof
is independent of
with
B
V1 # V2
then
is a subspace of
B_I
t s (0,i) .
82
with the property
B
~ ( V I ) ~ A ( V 2) = @ .
satisfying (8.15)
then
is finite and
~
o
gives positive weight to each point of
not hard to see that
Y
8.6
a-field on
Yo ' mld that
Suppose that
G ). Let {r
then it is o
satisfies
Proposition
determined by
is
82
o
empty open set
P
~i "
Immediate from Lemma 8.11.
Y
8.1, we get
and suppose that (8.14) holds for some (and thus all)
This follows easily from Lemma 8.10.
Let
If
agsin using Proposition
82
VI , V2 e ~
t E (0,I) . Then
Proof
. Therefore,
8.22
~o ~l
Yo
is a topological space~ that
=oF
is everywhere dense (i.e. So(G) > 0 be the subspace of
for which
given the product topolo~).
(8.15). More generally we have:
CA : X(A)
TT1en B1
satisfies
B ;R
is the Borel
for every non-
consisting of those potentials is continuous
(8.15).
(where
X(A)
is
159
Proof
Let
V e ~i
easily checked that Wrlting of
~
S
for
and
Zd
: X if
with respect to
X(A) . Now for any
if
G C.X
~
to
A
pA(B)
fA
is the projection of
is everywhere dense, and ~
, thus
p~(p)
x).
is the Radon-Nikodym
P~(H)
p
onto
has a strictly
is everywhere dense. Tmerefore
is non-empty and open then there exists a non-empty open
i = 1,2
fl
by (5.13). It is
is everywhere dense, since from the definition of the product topology
let
~ ~(VI)~(V2) both
A e ~ , then
0~AXPs_A(W) , where
Go a _A 9 Suppose there exist For
and
p e ~(V)
A c ~
V
is continuous (with the product topology on
~R
positive density with respect to mAXPs_A(~)
be given in terms of
{fA)As C
?
8.23
and
(f~)A~
V I ' V2 ~ ~i
A
A ~ f2 A fl
for some
Vi
and
A A fl = f2
x e X : f~(x) @ f~(x) }
with
~ ( V I ) ~ ( V 2) r @ .
by (5.13); let
A e C (since
are versions of the Radon-Nikodym derivative of
~AxPs_A(D) , and so
everywhere dense.
VI @ V2
be defined in terms of
. We must have f2
with
Go C_G
V I @ V 2 ); but p
with respect
~AxPs_A(H)-a.e. This is a contradiction because
is non-empty and open, and we know that
~AXPs_~(p)
82
The strict convexity proofs given here have been adapted from Griffiths and Ruelle (1971) (where the case
Y
finite is considered). O
is
9.1
9. Attractive specifications
If we are given a specification
~
(and let us suppose that
G(V)
is non-empty)
then one of the most important problems to solve is to determine whether or not there iS exactly one Gibbs state. Even in what would appear to be very simple cases this is extremely hard (as the results in Section 5 show). The situation in which there has been most success in solving this problem is with the Ising model of a ferromagnet (as described in Section 5). In this case the existence of certain inequalities
(the FKG, GHS and Griffiths inequalities, see Fortuin,
Kastelyn and Ginibre (1971), Griffiths, Hurst and Sherman (1970), and Ginibre (1970)) help to simplify things. ~hese inequalities come from the fact that there is an "attractive interactmon " " involved with a ferromagnet. In this section we will generalize this by introducing the notion of attractiveness in terms of an order structure on the underlying space
X . We will look at specifications that
have proprrties with respect to the order which can be thought of as corresponding to attractive interactions. Let us first consider conditions for there only to be one Gibbs state in the general set-up of Section 2. Let simplicity we will assume that
~ = {~A}A ~
RA = X
for all
be a specification,
and for
A a ~ (though no problems arise
if this assumption is not made); we are thus really in the situation of Section i. As before, let to
R . For
B(~)
denote the set of bounded F__-measurable functions from
f ~ B(~) , A e ~
recall that we defined
wAf : X
~R
by
(~Af)(x) = If(Y) ~A(X'~) ; it will be con~zenient to sometimes write
WA(x,f~
instead of
(wAf)(x) . Let
X
9.2
161
~A(f) =
sup ~A(x,f) x~X
Lemma 9.1 tion of
Proof
~ (f)
, ~A(f) =
inf ~A(x,f) x~X
is a decreasing function of
A ; ~A(f)
is an increasing func-
A .
Let
A ~ A E ~
with
A ~ A ~ then
x~X
<__ sup xgX
Similarly
Let
-
~(f)
w+(f) =
Lemma 9.2
xaX
I +~A(f;' ~ ( x , d y )
> wA(f)
lim ~ ( f )
If
f e B(~)
.
=
w (f) .
A
82
,
~-(f) =
and
lim hA(f) .
p E ~(V)
then
~-(f) < If d~ <= ~+(f) .
Proof
Hence
If
A c C
then
~ = ~WA ' and thus
Ifd=f(y)IA(x0y)dCxl=IA(x,fl(x) ! wA(f)
<
f dp
<
§ wA(f;
.
82
162
A subset
pl,
W
~2 g P ( F ) a n d
I,e.Tm~a 9.3
Let
of
B(F)
will be called a deter~ining class if
I f d~l
W
9.3
=
i'f dp 2
for all
be a determining class. If
f ~ W
imply
pI
=
P2
"
fsW
w+(f) = ~.-(f) for all
then there is at most one Gibbs state.
Proof
Immediate from Lemma 9.2.
82
Now suppose that we are in the set-up of Section 3 ; so we have a family {-~)SaN
of sub-o-fields of
let
F
B(A)
satisfying (3.1),
(3.2),
be the functions in
(3.3) and ( 3 . 4 ) . Let
B(_F_) that are _~-measurahle
ee__N for some
0 ~: N 9
Lemma 9...4
Suppose
f e B(A) . Then
V
(3.8) and (3.11); and
w-(f) < ~+(f)
IG(_~)I > 1 . (Note that, since we are assuming
just says that for any
B(~)
satisfies
A s C , F e A
~A(',F)
for some
R A = X , (3.8)
is in the uniform closure of
.)
Proof
We can find
x n ~ X , such that
{An}n>=l ~+(f) =
increasing and generating the order on
_C_ , and
lira w A (Xn,f) . By a slight modification of the n-~= n +
proof of Theorem 3.2 there exists
that
p (A) = +
lim wA (Xn.'A) j-~o n. j J
~
c G_(_V) and a subsequence
for all
A ~ =~ . Hence
I
{nj}j> 1
such
h d~ + = lim ~A (Xn.,h) j-x~ n. O J
163
for all
exists
h e B(A) , and in particular
p- e G(~)
Proposition 9.1
with
Suppose
determining class. Then
Proof
z-(f) =
~
9.4
~+(f) = I f d~+ " In the same way there
I f d~- , and thus
satisfies
I~(~)I = i
(3.8) and (3.11), and
if and only if
Combine Lemmas 9.3 and 9.4.
I~(~)I > i .
82
W~B(~)
~+(f) = w-(f)
is a
for all
f s W .
82
To give our definition of an attractive specification we need an order on
X . Let
~
be a partial order on
such that
(9.1)
~
is measura01e,
(9.2)
~
is directed both upwards and downwards,
there exist
Let where
= {WA}As ~
Proposition
{(Xl,X2) ~ XxX : x I ~ x2} s F xF ;
x3 , x4 s X
BI,(Z) (resp. B+(s f : X--~R
i.e.
X
such that
is increasing if
Suppose
(3.11). Then there exists
xI , x 2 ~ X
f(x) ~ f(y)
~
if
whenever
B(~) (resp. B(A__j ),
x ~ y . A specification
WA f c B+(_F_J for all
is an attractive specification
~+ c ~(~)
then
x 3 ~ x I ~ x4 , x 3 ~ x2 ~ x 4 .
) denote the increasing functions in
will be called attractive
9.2
i.e. given
such that
w+(f) = ~ f J
f s B+(F)
, A s ~ 9
satisfying (3.8) and dB +
for all
164
for all
f r B+(~) . Similarly there exists
for aZl
f s B+(A) .
Proof
Let
9.5 ~- a G(V)
~n(F)---~(F)
of
for all
F r A
where
A
F = ~(A=o) . By the proof of ~ e o r e m M(X~F)
w-(f) =
M(X,~_) be the space of finite signed measures on
ized (as in the proof of Theorem 4.3) so that
and
with
. For each
f e B@(~)
we are trying to show that
~n---~B
I
(X,_F) , topolog-
if and only if
is some countable field with 4.3
G(V~
f d~-
A~C
A
is a non-empty compact subset
let
~
U(f)
is non-empty. Each
U(f)
is closed
f E ~+(A) (since
f e B(~) ), so by the compactness of
finite intersection is non-empty. ating the order on
G(V)
we need only show that any
Choose an increasing sequence
C ; then for any
(since the limit is monotone). Let
f r B+(A)
fl ....
we have
fme
{An}n> I
~+(f) = lira w A (f) n -~eo n
B+(A) ; we can find =
n
1 , 1 < j < m ~ such that
for each
n > i
choose
w+( f~ ) =
xn s X
so that
n
done by (9.2)). Since
thus for
V
is attractive we have
i ~ j <= m
~+(f.)~
=
lira ~A (Xn'f~) " n+~
n
< x =
for n
x (~) s X n
lira WAn (xn( j ),f~) n-~ x~'
gener-
for
'
i =< j =< m . Now
1 < j < m (which can be =
=
wA (Xn'fj) >----~A t'x(J),fj)n and n n
9.6
165
Just as in the proof of Lemma 9.4 we can find such that
]J(A) =
lim w A (Xn.,A) j-~= n. O J
for all
e G(V)
~nd a subsequence
{nj}j~ 1
A e A . We thus have for
m
that
~+(fj) =
~ fj dz , and thus
~ e ~
U(fj) .
82
j=l
Theorem 9.1
Suppose
(3.11), and that
~
B+(~)
e G_(_V) such that
~- ~ G_(~) with
is am. attractive specification is a determining
w+(f) =
~-(f) =
f dz +
I f d,-
satisfying (3.8) and
class. Then there exists a unique
for all
for all
f ~ B+(A) , and a unique
f e B+(A)
. Furthermore,
IG(~)I = i
+
if and only if
Proof
~
= ~- .
The existence of
~+
and
~-
is given by Proposition 9.2; their unique-
ness follows from the definition of a determining class, and the last part follows from Proposition
9.1.
82
We will assume from now on that we have an attractive specification
V
B@(~)
is a determining class, and that
satisfying (3.8) and (3.11). ~+ (resp. ~-
will be c~'~lled the high density
(resp. low density) Gibbs state. It is clear from
the uniqueness of
that they are extreme points of
~+
and
B-
_G(V) .
In order to check if there is only one Gibbs state it would appear that we would have to see whether
w+(f) = w-(f)
for all
f c B+(~)
. However,
in most
9- 7
166
cases we only have to check this equality over a much smaller set of functions. This follows from the next lemma, which in the case of the Ising model is due to Lebowitz ~nd Martin-Lbf
Lemma
Proof
9.5
Let
E = { f s B+(A)
We first rescale
> 0 , such that if
~+(h) = w-(h)
du+ :
]
(l-f~176
f
: w+(f) : ~r (f) } . If
and
g : there exist ~
fo = ~f + Y ' go = 6g + 6
is the same as
on the other hand
(f~176
(1972).
l h d~ + = l h dz-
~'og~ : (l-fo)(l-go)
I (f~176
dP+ ~
d~-
(since
I (l-f~176
, and so
fg
W all
Let
is a determining f s W .
W
be a subset of class. Then
then
(for
with
fg ~ E .
~ > 0 ,
0 ~ f o ~ go ~ i . Since
h ~ B+(A) ) it is clear
' go r E ), and also
O
(since
-(l-fo)(l-g o) s ~+(A)
fog ~ a E . Therefore
= (~6)-l(fogo - ~fo - Ygo + Y~) "
•neorem 9.2
6, y, 6 r R
then
+ (fo+go) - i , and
f
dp-
f , g e E
)" Thus
fg e E , since
82
Bt(_A)
IG__(~)I = i
such that the algebra generated by if and only if
~+(f) = ~-(f)
for
167
Proof B
Let
W~
be the algebra generated by
is the vector space spanned by
B§
O
9.8
W ; let
Bo = { f-g : f ' g e B+(A)_ }.
, and it is not h a r d to see that =-
Wo C
Bo . Let
space of
B
Eo = { f E Bo : ~ f
. If
f , g ~ E
O
with
dp-),
so
Eo
is clearly a sub-
f = fl - f2 ' g = gl - g2
and
O
fl' f2 ' gl" g2 e B+(~) Lemma 9-5
d~ + = I f
fg ~ E
, then
. Thus
E
o
and this implies
fg = flg I + f2g 2 - flg 2 - f2g I , and so by is an algebra, hence if
W C_ E
0
~
= ~
then
W
0
C_ E 0
0
82
Before looking at examples of attractive specifications let us consider what happens when the attractive specification is H-invariant, where group of ~ m e a s u r a b l e
bijections from
H-invariant if, given any r
~
~
Le~:a 9.6
and
If
~f+(f) = ~ + ( f ~ )
Proof
Let
(r
~
~ ~ H
= ~A
X and
to
A ~ ~ , ~hen there exists
and
~w~
@ c H , f e B(~)
.
A c ~ . Choose
~nus
But by the definition of
A E ~
"
= ~-(fo~)
r ~ H , f E B(~)
is a
X . As in Section 4 we will call
is H-invariant, then for any
, ~-(f)
H
we h a v e
A e ~
so t h a t
we have
with
9.9
168
+" ~A(i) ~ w (fo~) ~ ~+(fo~)
and therefore have
w+(f) ~ ~+(for
w+(f) ~ ~+(fo@) same way.
. Replacing
, and hence
f
by
for all
serves the order on
all
and
r
by
r
A ~
we
gives us
. The other equality is got the
82
fo~ s B(~)
for all
fo~
~+(f) = w+(fo~)
We now add the assumption that that
. Since this holds for all
f s B(~)
X , in that if
@(A) C_
~
for all
, @ s H . We will also assume that x , y s X
with
@ c H . From this assumption we immediately
f s B@(~)
@ e H . This implies
x ~ y get that
then
r
H
pre-
~ r
fog s B@(~)
for
, @ E H .
+
Theorem 9.3
If
~
is H-invariant
I~(~)I = 1 if and only if
then
l~(~)I = 1
are H-invariant.
and (where
Also
C-o(=V) is the set of H-invariant
Gibbs states).
The second part follows immediately
Proof For s_ny
f ~ B+(A)_ , @ ~ H
f d~ I'+
Since
B%(A)
=
~+(f)
=
from Theorem 9.1 and the first part.
we have
fo~J ~ B+(A)
~+(fo~)
=
is a determining
Ifo~
, and thus by Lermna 9.6
d~ +
class, this implies
=
f d(r +)
+
=
@~+
.
; and hence
~
+
is
H-invariant.
Finally,
note that if
V
is H-invariant,
Theorem 9.2 in that the hypotheses H-invariant
algebra generated by
about W
W
then we can improve on
can be reduced to having the
a determining
class.
(In the most important
169
9.10
examples we will look at this will mean that
W
can consist of a single f ~ e -
tion. )
We will now look at attractive specifications in the setting of Section 5. Let
Y
be a Polish space (i.e.
o
metric space), let order on
Y
(9.3)
be the Borel ~-field on
is closed, i.e.
<= thus satisfies (9.1), and each
t e S (where
be a copy of A C
S
S
X(A)
o
, and let
<
be a partial
{ (Yi'Y2) ~ Yo•
(Yo,%)
: Yl =< Y2 }
is closed in
Yo • o
is clearly a standard Borel space. For
is any countable set, not necessarily
Z d ) let
(Yt,F__t)
the product order, i.e.
{xt}te A <= {Yt}teA
if
xt < Yt
t e A . This order again satisfies (9.1), (9.2) and (9.3) (with the
product topology on
A =
Y
(Yo,F=o) . We again use the notation introduced in Section 5. For
we give
for all
can be topologized as a complete, separable
o
satisfying (9.2) and
o
~
F
Y
U
F(A)
, and
X(A) ). We take
B(A)
N = __C and let
B A = F(A)
; thus
consists of the bounded functions that are
AEC
~(A)-measurable for some
A s _C .
One more assumption about
(9.4)
given f
YI' Y2 e Yo
with
Y
o
and the order
Yl @ Y2
<
will be made; this is
then there exists
bounded, continuous, increasing and with
f : Yo
)R
with
f(yl ) @ f(y2 ) .
The most important cases we will be concerned with are when
Y
o
is either
R
or
170
some closed subset of
R (with the usual ordering);
all the above conditions.
and 48.) Also note that if (9.2),
Lepta 9.7 functions
generated by
Proof
holds.
{Y } n
(This follows
B§
class.
that are ~({t})--measurable is a determining
W
and let
C
o
X
then by the Stone-Weierstrass are uniformly
f e W F
o
of
. Take arly X
with
F
Let
~
dense in
by
Bf(A) :-
denote the continuous
algebra containing the constants, is a compact subset of
C(F)
(1965)~ pages
30
these conditions.
W
consists
of those
t ~ S ,-then the algebra
the algebra generated by o Let
W
functions
and
C(F)
in
W
denotes the continuous
theorem the restrictions . Let
B , ~ c P(~)
X
with
o
. C
o
is an
functions
of the functions ?f
d~ =
~f
in d9
X . If on
F , C
o
for all
is Polish there exists a compact subset
~(F) > i - ~ , v(F) > i - ~ 9 For any
X-F
(resp. ~ ) denote the restriction
W
be the algebra gen-
o
and (by (9.4)) separating the points of
8 > 0 , then since
F
is com-
o
sequence of spaces
if
for some
Y
class.
is contained in the vector space s p u m e d
F
Furthez~ore,
The second part implies the first because
erated by
clearly satisfy
o
from Nachbin
is a finite or countable
is a determining
W
Y
(9.3) 8nd (9.4) then their product satisfies
B§ in
these
It is worth noting that if (9.3) holds and
pact then (9.4) automatically
satisfying
9.11
f E W
o
we have
X-F
of
p
(resp. v ) to
F
(as a meastu-e
9,12
171
on
F ); then the above application of the Stone--Weierstrass theorem gives us
t ]~ f dp -
If
IIp-~li
~ 26
and thus
II~-~ll
d~ I
~ 48 , and hence
Let
(where
281]f11~
[Ik[]
> = ~ .
for all
f e C(F) ,
i s the total variation o f
X ). Therefore
82
WI be any subset of
separate the points of ated by
<
W
such that the continuous functions in
Wt
X ; the proof of Lemma 9.7 sho~rs that the algebra gener-
W l is a determining class.
In order to give examples of attractive specifications in this setting it will be convenient to start with a special case and assume (until further notice) that :
(9.5)
Yo
is totally ordered, i.e. if
YI' Y2 E Yo
then either
Yl =< Y2
or
Y2 ~ Yl ;
(which really amounts to assuming that x , y r X(A)
(xvY)t
Thus
with
=
Y
o
is a subset of
x = {xt)ta A , y = {Yt}tcA , then define
max{xt~Yt} ,
(x~Y) t
=
R ). If
A ~
S
and
xVy , x^y r X(A)
by
min{xt,Yt} 9
X(A) , with the product order, is a lattice. As usual~ we will only look at specifications given in terms of interaction
potentials { @ A } A e ~
such that
~
IleAl I
is finite for all
t e S (and thus
taAa~ (3.8) and (3.11) hold). The inequality which gives us examples of attractive
172
9.13
specifications is the following:
Theorem 9.4
~ f l d~A
Let
=
A a C
I f2 d~A
and
=
fl' f2 : X(A)
for all
fl.>_ 0 , f2 >= 0 , and
x , y ~ X(A) .
h : X(A)---'~R
Then for any bounded, measurable, increasing
Proof
with
i . Suppose that
f:(xvY)f2(xAy) h fl(x)f2(Y)
(9.6)
~R
we have
We defer this to the end of the section. The proof we will give is taken
from Preston (197hc). The inequality is a generalization of a result of Holley (1974), which in turn was used to give another proof of the FKG inequalities (Fortuin, Kaste3@n and Ginibre (1971)).
Let
V(x)
=
{r
~
82
V : X
be as above and let
)R
be given, as always, by
0
CA(XA ) "
Ac~
Theorem 9.5
(9.7)
Suppose that
V(xVy) + V(xAy) ~ V(x) + V(y)
Then the specification corresponding to
Proof
Let
{~}Ae~
for all
V
x , y e X
O
is attractive.
be defined by (5.13). If
h
B§
ana
AE~
then
9.14
173
~rA(X,h)
Let
=
x , y a X
J h(z) WA(X,dz)
with
f~(w) = fA(xs_~• Then
g
y ~ x
=
h(xS_A•
and define
~ f2(w) = fA(ys_A•
d~A(W) .
fl' f2' g : X(A)
~R
by
, g(w) = h(Xs_AXw) .
is bounded and increasing~ and
~A(x,h)
= ~ g(W)fl(W) ~^(w) ,
~A(y,h) : ~hCYs_A•
(w) d~A(w) ~ ~g(w)f2(w)~A(w)
Thus the proof would be complete if we could show that
fl
and
f2
satisfy (9.6).
But this amounts to showing that
(9.8)
where
exp gl(UVv) exp g2(u^v)
~
exp gl(u) exp g2(v)
for all
gl(u) = gA(Xs_A• ) , g2(u) = gA(ys_Axu ) , and where
gA
u , v e X(A) ,
is defined by
(5.12). (This is because the terms which occur from the denominator in (5.13) c~ncel out.) (9.8) is of course equivalent to
(9.9)
and ~f
gl(uVv) - g](u)
x , y e X
o
~
g2(v) - g2(u^v)
for all
u ~ v e x(A) ;
then
g~(uvv) - gl(u)
= V(Xs_A•
g2(v) - g2(u^v)
= V(Ys_a•
- V(xs_A• - V(Ys_a•
Thus from (9.7) we have (9-9) holds provided
x ~ y ~ X
, 9 9 It therefore holds in o
174
general~ since
gA(x~Xhs_~)~
~ gA(x)
9.15
as
A--~S
.
In practice we would like conditions on the V
satisfies (9.7). But note that
the sense that for each
(9.lO) If
~a(uv~)
IAI
= 1
+
V
CA
which would ensure that
will satisDy (9.7) if each
CA
does, in
A s
~a(u^~)
CA(U) + ~A(v)
then (9.10) always holds; for
for all
[A I > i
u , v ~ X(A) .
a useful sufficient condition
is given by
Proposition 9.3.
Let
A s
s~%8 for esch
negative ~Id increasing; let
~A(y)
then
CA
Proof
let
=
c §
~ tea
let
CA : X(A)----+R
:
Yt-----~R
be non-
is defined by
ht(Yt) ,
c = 0 , and will proceed by induction on
then there is nothing to prove~ so assume
B=A-
ht
satisfies (9.10).
We can assume that
[A I = i
e s R . If
t e A
{t) . Define
~B : X ( B ) - - ~
by
IA[ > i , take
~B(Z) =
]~
[A[ . If t e A
and
hs(z s) , so by
sgB the induction hypothesis
~B
satisfies (9.10). Let
loss of generality we can assume that
~A(xvz) + ~A(xAy) =
x , y E X(A) , and without
Yt ~ xt " Then
~A(x) - ~A(y)
ht(xt){ C B ( ~ V Y B ) - CB(:~B)} - ht(Yt){$B(YB) - CB(XBAYB)}
9.16
175
Now as
CB CB
is increasing, thus
r
satisfies (9.10) we have
Therefore since we also have
) ~ CB(XB)
and
CB(YB ) ~ CB(XBAYB ) ; and
CB(XBVYB) - CB(XB) ~ CB(YB ) - CB(r~AYB)
0 ~ ht(Y t) ~ ht(xt)
CA(xvy) + CA(xay) - CA(X) - CA(y)
.
we get
~
0 .
82
Oi" course, the hypotheses of Proposition 9.3 are by no me,is necessary for
CA
is when
to satisfy (9.]0). One case that is worth looking at a bit more closely IAI = 2 . For this it is easily seen that the condition
removed from the hypotheses of Proposition 9.3. Further, if an open inter~al, and
CA : Yo • o
)R
ht ~ 0
IAI = 2
and
can be Yo
is
is differentiable, then it is not diff-
icult to check that (9.10) is equivalent to having
DI2@ A ~ 0
(where
DI2
denotes
the mixed partial derivative).
Let
p : Y
o
~R
be bounded, continuous and strictly increasing (i.e.
~(yl ) < p(y2) if Yl < Y2 )" and for t e S
let
Pt : X
)R
be given by
Pt(x) = P(Xt) . The proof of Lemma 9.7 shows that the algebra generated by is a determining class, and thus, by Theorem 9.2, w+(pt ) = w-(p t)
for all
IG(V__)I = i
t E S (or, equivalently, for all
if and only if
t c S
Pt d~+ = ~ Pt d~'- ). An important way of looking at this is as follows:
If
~. ~ P(F)
then we can regard
Yo-valued stochastic process prob.{Xtl ~ F 1 ,..., X t
B
{Xt}te S
as defining the joint distribution of a indexed by
~ F n} = ~({x e X : x n
S , (with
e F I ,..., x t tl
{Pt}teS
e Fn}) ). n
176 Let
{Xt}tc + S (resp. {Xt}tE S-
9.17
) be the stochastic process given by
~+
(resp. U- );
then we can tell whether these processes are the same just by comparing the distrX"+ t
ibutions of the individual random variables
and
X
~
for each
t ~ S
(some-
thing which is far from true for two arbitrary processes).
Now suppose that
S = Z d , and that
{r
in that they satisfy (5.17) 9 By ~ e o r e m 9.3
D
are translation invariant, +
end
~-
are translation invar-
iant; further, since the smallest zd-invariant algebra containing as the algebra generated by if
{Pt}tezd , it follows that
P0
I~(~)l = i
is the same if and only
w+(p 0) = w-(p 0) . A consequence of this is that the variational principle
considered in ~heorem 8.3 is much simpler in the present sit,~ation. We will now look at this. We are assuming that instead of
S = Z d (thongh for convenience we will write
Z d ) and that we have a specification
satisfying (5.17), with
~
IleAl I
~
defined in terms of
finite, ~nd with
V
S {r
satisfying (9.7).
OEAE~ Let
p : Yo
(where For
b
~ a R
>R
be bo~]ded, continuous, strictly increasing, and with
is the base point); for define a potential
v~(x)
t E S
V~ : X ~
= V(x) + ~ ~ _
define ~R
Pt : X
"~R
by
by
~t(x) ;
tgS
thus
VX
corresponds to the interaction potentials
{r
, where
p(b) = 0
Pt(X) = P(Xt)-
177 X CA
r
It is clear that
VX
=
CA
=
r
if
9.18
IAI ~ 2 ,
+ Xp(E)
.
satisfies all the conditions we have assumed for
can be thought of as adding en external field to field proportional to the pressure,
V
(with the strength of the
k ). We are going to prove that
as a function of
X , is differentigole
V ; VX
i~(~)i = i at
if and only if
X = 0 . Thus unlike the
general case considered in Section 8, where we had to look at all the tangent functionals to the pressure in the Banaeh space the one-dimensional
subspace
{ VX : X e R }
A s C , x E X , X s R
For
=
exp gXtxs_Axw)
^
(where
g~
is defined in terms of
PA'x(V~ )
(thus
PA(~)
Lemma 9.8
_l_
IAI
log
~_ PA,x(V~)
Calculus.
82
=
V X ).
I
B .
{r
dLOA(W) ,
X
by (5.12)); and let
ZA'x(V~) >
defined in (8.8) is the same as
kernels corresponding to
Proof
=
of
let
A,
I ZA,x(V X)
~ , here we only have to look at
~_~
PA,b(V)).
WA,x(x,Pt)
(where
OrA,X }AE___
are the
9.19
178
Lcm~a 9.9
For each
A ~ ~
choose
x(A) s X . Then
P(Vk) =
lim PA~x(A)(V)) A
,
provided the limit is taken over cubes.
Proof
From the definition of
only show that
P
we have
p
+
, p-
~+(~)
where
p~
; so we need
: R ~-~R
=
easily enough
II
by
Po d.~ ,
p-(~)
=
PO d~X
;
(resp. ~X ) is the high density (resp. low density) Gibbs state corres-
ponding to and o n l y
lim PA,b(Vx) A
lim {PA,xtA~(Vx) ~ i - PA, b(Vk)} = 0 , and this fo!l~r A
from the estimates used in Section 8.
Define
P(V k) =
V x . Then
if
p-(X)
Proposition 9.4
p- < p+
= p+(k)
If
and, as we have already noted,
IG=(Vx)I = i
if
.
P(V~)
is differentiable, as a function of
X , at
X = ko
I ~ ( V x 11 = 1 .
then
0
Proof
Without loss of generality we can assume that
~
O
= 0 . For each
+
c~n choose
x(A) s X
such that
WA(X(A),Pt ) ~ WA(Pt) - I/IA I
Then by Lemma 9.8 we have
amaXPA'x(*;(vx) Ix=o >_ IAI
) - z/IAI}
for all
Aes teA.
we
179
= IAI
l/l~l}
{'~ (%1
Choose a sequence of cubes
and 9.8
f
~,+(%1
Am + S
and let
9.20
I/IAI
o+(o1
fro(X) = P A m , x(Am)(Vx)
is convex and differentiable; by Lemma 9.9
m
a -- P(Vx) aX
.
(If
fm(y)~--)f(y) f
p
P(Vx)
I~
as
R
is
m---) ~
is differentiable at
- 1/IAml
< p-(O) , and since
an
open interval and
for all
are differentiable at
fA(O) ~ p + ( O )
. By Len~mas 8.7
~P(Vx)
as
X = 0 . Let
IX=O ; then by a standard fact about convex functions we have
p : lim f~(0)
and
f (l)
9
m
m---~ ~ , and by hypcthesis
P =
I/IAI
: I ---OR
y r I ; and if for some
Yo ' then f'(yo ) = lim f'(yo ) m
, and hence p
f
< p+
p ~p+(0)
we must have
are convex, with
Yo r I
all the
f
m
") But
. The same proof also gives us p (0) = p+(0) . Therefore
l~(v) l = 1 .
An immediate consequence o f Proposition 9.4 is that { is at most countable; since
P(Vx)
, being a convex function of
to have a derivative at a countable number of points. of Propostion 9.4, and show that if
X ~ R : I~(Vx)l > 1 }
I~(V X )I > 1 o
We now look at the converse
then
P(V l)
is not different-
-
iable at
X = I
. This will be done by identifying
i , can only fail
p
+
and
p
as respectively
O
the left- and right-hand derivatives of
-
Lemma 9. I0
p
P(V X) .
+
, p
-
are increasing functions, p
is upper semi-continuous.
+
is lower semi-continuous,
p
180
Proof
Since
PA,x(Vx)
9.21
is convex, Lenlma 9.8 shows that
9
p
+
WA,k(x,Pt)
IAI
is increasing (as a function of
and
--
k ), and from this it easily follows that
p
are increasing. Now
p+(1)
I
=
+ P0 d~x
Therefore to show that for any
A e C
so we have
p
=
+
+
f (~) x
=
AEC
it is sufficient to show that
is a continuous function of
_
=
sup xEX fx(X)
=
the increasing directed set defined by
~X
+ inf ~A,x(p0) .
=
is upper semi-continuous
WA,x(po )
+ k(p O) WA,
+ wx(P0)
~
lira x fx(X)
k . Let
fx(k) = ~A,l(X,Po);
(where the limit is taken over
<= ). Some elementary calculus shows
{~A,x(X'PoPt) - ~A,~(X,Po)~A,~(x,Pt)}
tEA
and thus
I~-- fx(X)I ~ 21A i Ilpil~ . Therefore {fx}xEX 8l
family, and thus
+
WA,x(p 0)
lower semi-continuous.
is an equicontinuous
is continuous. The same method gives us that
p-
is
~I
P(Vk) , being convex, has a left-hand and a right-hand derivative at each point. Let us denote them by
Pr_r~o~osition 9.5
Proof'
P
=
p(1)
It is easi]~u seen that
p(1)(X)
,
P+
p(1)
and
=
p(r)(x)
respectively.
P(r)
and
p(r)
are increasing, and
p(l)
is
181
lower semi-continuous,
p(r)
exists a countable set
E ~
9.22
is upper semi-continuous. R
such that if
I @ E
Theorem 9 . 6
]~(V% )I : 1
if and only if
9.h there
then
p-(~.) = p(1)(1) = P (r)(1) = +(~)
From this ~nd Lemma 9.10 the result follows.
By Proposition
.
82
P(VI)
is differentiable
at
I = 1~
0
Proof
This is just Propositions
I f we consider
p+
and
9.4 and 9.5.
p
82
as derivatives of the pressure with respect
to an external field, then we can interpret them as "densities", says that
I~(Vl) I > i
discontinuity
occurs at exactly those points 9 I
in the density.
and Theorem 9.6
where there is a jump
Thus, in physical language, we have more than one
Gibbs state at exactly those places where there is a first order phase transition.
For
and
A e ~
let
+ p (I) = WA,l(p0)
,
pA(1) = WA,l(p0) -
; thus
p~ + p+
pA + p-
Theorem 9.7
12(Vl )I = i
if and only if the functions { P ~ } A e ~
are equicont-
O
inuous at
1 0
continuous at given any all
G
; or, equivalently,
1~
~ > 0
whenever
(Functions
if and only if the functions
gs
then there exists Ix - Yl < Y -)
are called equicontinuous y > 0
such that
{PA}AeC at
x e R
are equiif,
Iga(x) - g~(Y)l < ~
for
182
Proof
+
Since
PA
p
+
is continuous at
~
o
+ p_ A
~
R
the fl~ctions
PA
X
at
PA +
p+
, it is easy to
is the exact condition to ensure
o
. The result thus follows from Theorem 9.6.
A particular case of Theorem 9.7 is: I
+
is continuous and increasing: and
check that the equicontinuity of the that
9.23
~i
Suppose that on some interval
are all either convex or concave (or that the
PA
are
all either convex or concave). Then the hypotheses of Theorem 9.7 hold for any ~o
I , so we must have
, -.I~(V~)I = i
for all
~ c I . We will now look at an
important example of this. (This example will include the Ising model for a ferromag~et.) Let and let
tO o
Y
o
= R
with the usual order, let
be an even probability measure on
under the reflection {r
~'-~-~
). Take
0
F =o
be the Borel q-field on
(Yo,~) -
(i.e.
tO o
R ,
is invariant
as the base point. Consider a potential
given by
r
= 0
if
IAI @ 2 ,
Js-t = Jt-s ' and
~
r
(~'q) = Js-t ~q , where
IJtl
Js-t e R
finite. Then, although we do not have
t @ o
(-N,N)
for some
~_
IIr
0~A~
finite, it is quite easy to check that if and (3.1l) hold. (If in fact
with
~
o
tOo
dies off quickly enough then (3.8)
has compact support, then we can replace
N > 0 , and in this case we would have
~
IIr
I
R
by
finite.)
0~AE~ We will assume that p : Y
o
----~R
by
Jt ~ 0 , because this implies that (9.7) holds. Define
p(~) = ~
( p
is not bounded, but ot~ proofs still work if
tO
O
183
dies off quickly enough.)
9.24
It now follows from a result of Ellis
(1975) that there +
exists a class of even probability measures
Z , such that if
is concave on
me s Z
{ ~ : k > 0 } . Therefore if
then
~o e Z
then
l~(Vk)l = 1
for all
k > 0 . But we leave the reader to check that performing the reflection at each site amounts to changing IG(Vk) I =
= i
for a].l
IG(v~)i= > l
is if
,
Z mass at
Vl
to
I < 0 . Thus if
pA
6~-~
V_k , and thus we would also have
~o s Z
then the only possibility for
~=0
includes the case when
~o = ~2{ 6 §
~ . In this case we can take
Y
+ ~ - 1 } ' where
= {-i,+i}
8~
is the point
, and this gives us the Ising
O
model. The above result is then the one described in (5.24), and in this situation Ellis' result becomes the GHS inequalities Other measures in
Z
(1970)).
include measures which have densities with respect to
Lebesgue measure of the form zing constant, ~id
(Griffiths, Hurst and Sherman
U(z)
U(z) =
ek z
e exp(-U)
, where
c
is the appropriate normali-
is an entire f~iction with expansion
2k
, with
ak ~ 0
for
k ~ 2 , and
aI
real .
k > i
We have assumed that
Y
is totally ordered, but it should be clear that O
everything can be adapted to the case when
Yo
is a finite product of totally
ordered spaces. The condition
(9.7) on a potential
V
ification attractive.
is the product of
factors then Theorem 9.6 has
If
Y
n
will still make the spec-
O
to be changed to looking at the derivative of the pressure in an appropriate n-dimensional
space. We leave the reader to sort out the details.
184
9.25
Let us return for a moment to the general situation considered at the beginning of the sectibn. Suppose that (9.1) and (9.2), and
~
+l
has a partial order
~
satisfying
is an attractive specification satis~'ing (3.8) and
(3.11). Suppose also that we will denote by
X
and
X actually has a maximum and a minimmn element, which -1
respectively.
what we have just considered whenever
Y
(For example, this will happen in
is compact; in particular with the O
Ising model.) Then for
~(f)
f E B@(~)
we have
: ~A(+l,f) , ~(f)
: ~A(-1,f).
In most reasonable cases this will imply that for all
~+(A)
=
lim WA(4.I,A) ,
Ac~
B-(A)
=
A e
lim ~A(-I,A)
-
Aes
(In order to prove this we would need some density property of
B (A) O
where
Bo(~)
is the space spanned by
Bo(~) = B(~) .) In the above situation we thus have
limits over all of
~
B(~)
~
--
and
~
as actual
(rather thsr~ just over subsequences) of measures of the
form ~A(x,.)
We will end the section by giving a proof of Theorem 9.4. Instead of proving this result, we will in fact prove the following: assigning
Y
(recall that we are
is totally ordered) O
Theorem 9.8
Let
Theorem 9.4; for
A s C i = i, 2
and let
fl" f2 : X(A) -----+R ~i
,
Bi(A) . For the Ising model we in fact +
have
in
"-
satisfy the hypotheses of
be the probability measure that has density
185
fi
with respect to
9.26
~A " Then there exists a probability measure
(X(A)•215
on
such that
(9.~)
v(F•
= ~l(F)
for all
F ~ F (A) ;
(9.12)
~(X(A)•
= ~2(F)
for ~ l
F s ~(A)
(9.13)
~(~ (x,y)
: y i x )) = 1 .
(9.11) and (9.12) say that the projections of factors are because if
~i
and
~2
h : x(A)
v
;
onto the first and second
respectively. Theorem 9.8 easily implies Theorem 9.h )R
is bounded and increasing, then putting
E = { (x~y) : y ~ x ) we have
I hlxl
hIyl d I ,yl = I m(xl -
d Ix,yl
0
E
(since
h(x) - h(y) ~ 0
if
(x,y) ~ E ). In fact the existence of
v
satis-
fying (9.11), (9.12) and (9.13) is equivalent to having
l h dB I
~
I h d~ 2
for all increasing bounded
h .
This follows from Strassen (1965) , Theorem ii.
We prove Theorem 9.8 by induction on IAI = i . Let
gl' g2 : Yo - 9
R
IAI . First consider the case when
be non-negative, measurable, and with
186
I
gl da~o
=
by projecting
I ~
g2 d~o
=
i . Let
o(B)
=
y xy , thus for O 0
~o ( { Y ~ Yo : ( Y ' Y )
Define a probability measure
6
on
(y xy ~F xF ) o o~o~o
be the measure on
o
onto the diagonal of
O
9.27
B E F xF
=0 ~0
.
E B })
(YoXYo,FoX~o)
got
by
-1 =
,,,in{glI./.g
{yl %
+
,
+
where
gl(x)
=. {gl(x)
(Note that since
- g2(x)}
+
,
g2(y ) = {g2(y ) - gl(y)}
gl + g2 = g2 + gl
I ~2(z) d~ o (z)
and thus the definition of
~
=
we have
I-
d~~
gl (z)
is symmetric in
gl
'
and
g2 " If the integral is
zero then we leave out the second term in the definition of
Lem~a 9.11
Let
g
be as above, and for
measure having density
gi
with respect to
onto the first and second factors are
Proof
Y1
This is a simple computation.
Lemma 9.12
Suppose for all
i = 1, 2
x , y e Yo
let
6 .)
Yi
be the probability
o~~ . Then the projections of and
Y2
respectively.
82
with
gl(x)g2(y) ~ gz(y)g2(x) .
y ~ x
we have
187
Ther~
~({ (x,~) : . ~ , s
Proof
It
is
sufficient
suppose there exist gl(y) > g2(y)
})
=
to
show t h a t
x , y
.
with
, g2(x) > gl(x)
gl(x)g2(y)
which contradicts
1
9.28
gl(x)g2(y)
y < x
a~d
= 0
unless
gl(y)g2(x)
y < x . Thus
> 0 9 Then we have
, and hence
< gl(y)g2(x)
the hypotheses
,
of the lemma.
82
Together Le~nas 9.11 and 9.12 give a proof of Theorem 9.8 lot the case IAI ~ i ; the construction Suppose now that i = i, 2
let
has density
p(~i ) gi
of
~
will also be useful for the general case.
IAI > 2 , let
t E A
denote the projection
with respect to
gi (x)
=
I
and put
of
~A ' where
fi (xx~) d~t(()
Di
onto
A = A - {t} . For X(A)
gi : X(A)----)R
. Then
p(~i)
is given by
"
Yt
~e
proof of &~eorem 9.8 can proceed by induction because of the next lemma.
Len~na 9.13
If
g](uVv)g2(uAv)
Proof
Let
fl(xVy)f2(xAy)
> gl(u)g2(v)
t'or all
G = { (~,n) e YtxYt
L = { ((,q) ~ YtxYt
> fl(x)f2(y)
u , v e X(A)
: q < ~ } ,
: ~ < q } , Then
for all
x , y e X(A)
then
.
E = { (~,q) e YtxYt
: $ = q } ,
188
gl(Uvv)g2 (uAv)
=
I I
9.29
fl(UVv'~)f2 (uAv''l) d~t(~)dmt(q)
G~E~L
=
I I fl(UVv'~)i'2 (uAv'~) d~t(~) dmt(n) E + I I {fi(~vv'~)f2 (u^v'~) + fl(uvv"~)f2 (u^v'~)} ~ t (~) ~ t (~) "
G Similarly
gl(u) g2 (v) = ~I fl(u'~)f2(v'q) E +
{fl(u~)f2(v,q) li G
d~t(~) @~'t(~)
* fl(u,n)f2(v,~)}
d~t(~) dmt(q) 9
But by hypothesis ~e have
f1(uVv,~)f2(uav,~) ~ fl(u,~)f2(v,~)
,
and thus we can ignore the terms involving integrations over
E . The proof of
the lemma would therefore be complete if we could show that
fl(uvv,~)f2(u~v,n)
+ fl(uvv,n)%(~v,~)
fz(u,~)f2(v,n)
whenever
+ fl(U,n)f2(v,~)
q < ~ . Let us write
a
=
f l C u V v , ~ ) f-2 C 1 .~ v , q l . . ,
e
=
f1(u,~)f2(v,n)
,
d
b
=
=
f z C u v v , q ) f_2 ( ~ _v , ~ ) _
fl(u,n)f2(v,~)
.
189
It is easily c h e c k e d that if 8nd
ab ~ cd . W e want,
f r o m L e m m a 9.1k.
L e m m a 9.14 and
ab ~
Proof
w e get
of course, to show that
a, b, c, d
cd . Then
a > 0 . Now
n < ~ then by hypothesis we have
a = 0
b e n o n - n e g a t i v e real numbers w i t h
then
c = d = 0
w h i c h gives
aa + ab ~ ac + a d . D i v i d i n g by
what m o d i f i c a t i o n s
aa + cd ~ ac + ad , and since a
82
i = i, 2
define
by
Let
=
fi(u,~)/gi(u)
u , v a X(A)
Fl(U,~Vr,)F2(v,~^n)
Clear.
cd ~ ab
gi > 0 . (We leave the reader to see
9
is thu~s the R a d o n - N i k o d y m d e r i v a t i v e of
Proof
gives the result.
are n e e d e d if this is not the case.) For
Fi(u,~ )
L e ~ n a 9.15
a >__ c , a ,> =d
and the result is true; w e can thus assume
For s i m p l i c i t y let us assume that
F.I
a + b ~ c + d , a n d this follows
a + b >_ c + d .
(a-c)(a-d) ~ 0
F.i : X(A)---~ R
a ~ c , a ~ d ,
82
Let
If
9.3o
with
~i
w i t h r e s p e c t to
v ~ u . Then for all
>__ Fm(u,~)F2(v,n)
p(~i)•
.
~ , ~ e Yt
9
82
N o w suppose that T h e o r e m 9.8 is true for all sets w i t h c a r d i n a l i t y less than
]A I . Then b y L e p t a 9 . 1 3 there exists a p r o b a b i l i t y m e a s u r e
~
on
190
(X(A)•
such that the projections of
second factors are
Define
9.31
p(~l )
~nd
p(p2 )
Q~ R : X(A)xX(A)xYtxY ~
Q(u,v,~,n)
=
'~R
onto the first ~ d
respectively, and
~({ (u,v)
: v ~
}) = :
by
min{Fl(U,$),F2(v,q )}
;
+
R(u,v,$,n)
whcre
=
S(U,V)
S(u,v)-l{Fl(~,~)
= I{F2(v'q)
- F2(~,n )} {F2(v,n)
FI(U'~))+ dwt(q) " Let
-
(YtxYt,~,xF__--t) got by projecting probability measure
=
Lemma 9.!6
Proof
v
v
on
Q ~x~ t
- Fl(U0~)} + ,
~t
onto the diagonal of
(X(A)•
+
~t
(A)xK(A))
be the measure on
YtxYt . Define a
by
R 0•
satisfies (9.11) and (9.12).
This is a straightforward calculation.
Finally, the proof of Theorem 9.8 is completed with:
Lemma 9.17
Proof
~
satisfies (9.13).
We need only show that
follows from Lemmas 9.12 ~ d
R(u,v,~,q) = 0
9.15.
82
if
v < u
and
~ < q , and this
191
9.32
It is worth noting that Theorem 9.8 gives the following generalization of the FKG inequalities:
Theorem 9.9
that
Let
f : X(A)---@E
f(xVy)f(xay) ~ f(x)f(y)
measure that has density
be non-negative, with
for all
J f d~A = i , and s u c h
x , y ~ X(A) ; let
f with respect to
~A " If
~
be the probability
g , h : X(A) ----~R are
bounded, _Fo(A)-measurable and increasing, then
~ g h dP >_~ ~ g d~ l h d~
Proof
If
c ~ R
then ~ g h a11 -
is unchanged if f2 = f ' and
g
I g dlJ l h ~
is replaced by
g + c ; we can thus assume that
g > 0 . Let
fl = gf{ ~gf d~A}-I ; then
fl(xvY)f2(x^y) = { ~ gf do~n}-ig(xvy)f(xvy)f(x^y)
T h ~ by Theorem 9.8 we bare
Igh d~
For
Y
=
I g d~ Ibfl d~A
~
I g d~lhf2 d~A
=
~ g d ~ l h d~ 9
82
finite, Theorem 9-9 was first proved by Fortuin, Kastelyn e~ud Ginibre 0
(197].). In the generality given here it can be found in Cartier (1972).
R.I
Refere~ees
Averintsev, M.B. (1970). On a method of describing discrete parameter random fields. Probability Beredachi Informatsii, 6, 100-109. Cartier~ P. (1972). Mecaulque' " statistique et mnegalltes" ~ ' 9 de correlatlon," " Sem~naire Bourbaki, No. 4312 1972/73. Choquet, G. (1969). Lectures on Analysis. Edited by J. Marsden, T. Lance a~id S. Gelbart. New York: Benjamin Inc. Dobrushin~ R.L. (1965). Existence of a phase transition in the two-dimensional and three-dimensional Ising models. Theory of Prob. and its Appl., i0, 193-213. Dobrushin, R.L. (1968a). Description of a random field by mearls of conditional probabilities and the conditions governing its regularity. Theory of Prob. and its Appl~, 13, 197-224. Dobrushin, R.L. (1968b). Gibbsia~ random fields for lattice systems with pairwise interactions.
~nct.
Anal. and its Appl., 2, 292-301.
Dobrushin, R.L. (1968c). The problem of uniqueness of a Gibbsian random field and the problem of phasetransition.
Funct. Anal. and its Appl., 2, 302-312.
Dobrushin, R.L. (1969). Gibbsian random fields. The general case. Funet. Anal. an~ its App].~ 3, 22-28. Dobrushin~ R.L. and Minlos, R.A. (1975). Construction of a one-dimensional quantum field via a continuous Markov field,
to appear.
Dunford, N. and Schwartz~ J.T. (1958). Linear Operators I. New York: Interseience. Dynkin, E.B. (1971a). Entrance and exit spaces for a Markov process. Actes Congres Intern. Math. 1970, Tome 2, 507-512.
198
R. 2
Dynkin, E~B. (1971b). The initial and final behavio~r of trajectories of a Markov process. Russiarl Math. Surveys, 26, 4, 165-185. Dyson~ F.J. (1969). Existence of a phase transition in a one-dimensional ising ferromagnet. Comm. Math. P.hys., 12, 91-107, 212-215. Ellis~ R.S. (1975). Concavity of magnetization for a class of even ferromagnets. Bull. American Math. Soc., 81, 925-929. F~llmer, H. (1973). On entropy and information gain in random fields. ,Z. Wahrs. verw. Geb., 26, 207-217. FBllmer, H. (1975a). Phase transition and Martin boundary. Sem. Probabilites Strasbourg, IX,
to appear.
FDllmer, H. (1975-b). On the potential theory of stochastic fields. Invited address: 40th. Session, ISI, Warsaw, Poland. Fortuin, C.M., Kasteleyn, P.W. and Ginibre, J. (1971). Correlation inequalities on some partially ordered sets. Comm. Math. Phys., 22, 89-]03. Gallavotti, G. (1972). Instabilities and phase transitions in the Ising model. A review. Rivista del Nuovo Cimento, 2. Gallavotti, G. and Miracle-Sole, S. (1967). Statistical mechanics of lattice systems. Comm. Math. Phys., 5, 317-323. Georgii, H.-O. (1972). Phasen~bergang i. Art bei Gittergasmodellen. Springer Lecture Notes in Physics, Vol. 16. Georgii, H.-O. (1973). Two remarks on extremal equilibrium states. Comm. Math. Phys., 32, 107-118. Georgii, H.-O. (1975). Stochastiche Felder t~id ihre Andwendung auf Inter~_ktions systeme,
to appear.
194
R. 3
Ginibre, J. (1970). General formUlation of Griffi~hs inequalities. Comm. Math. Phys., 16, 310-328. Griffitbs, R.B. (1964). Peierls' proof of spontaneous magnetization in twodimensional Ising ferromagnets. Phys. Rev. A136, 437-438. Griffiths, R.B., Hurst, C.A. and Sherman, S. (1970). Concavity of magnetization of an Ising ferromagnet in a positive external field. J. Math. Phys., ii, 790-'95. Griffiths, R.B. and Ruelle, D. (1971). Strict convexity ("continuity") of the pressure in lattice systems. Coram. Math. Phys., 23, 169-175. Grim~nett, G.R. (1973). A theorem about random fields. BUll. Lond. Math. Soc., 5, 81-84. Hammersley, J.M. and Clifford, P. (1971). Markov fields on finite graphs and lattices,
in,published.
Holley, R. (1971). Free energy in a Markovian model of a lattice spin system. Comm. Math. Phys., 23, 87-99. Holley, R. (1974). Remarks on the FKG inequalities. Corm. Math. Phys., 36, 227-31. Kesten, H. (1975). Existence and uniqueness of col~mtable one-dimensional Markov random fields,
to appear.
Lanford, O.E. (1973). Entropy and equilibrium states in classical statistical mechanics,
in Statistical Mechanics sm.d Mathematical Problems, Springer
Lecture Notes in Physics, Vol. 20. Lsnford~ O.E. (1975). Time evolutions of large classical systems,
in Dynamical
Systems, Theory and Applications, Springer Lecture Notes in Physics, Vol. 38. Ls-uford, 0.E. and Robinson, D.W. (]968). Mean entropy of states in quantum statistical mechanics. J. Math. Phys., 9, 1120-1125.
195
R. 4
Lanford, O.E. and Ruelle, D. (1969). Observables at infinity and states with short range correlations in statistical mechanics. Comm. Math. Phys., 13, 194-215. Lebowitz, J.Lo and Martin-LSf, A. (1972). On the uniqueness of the equilibrium state for Ising spin systems.
Comm. Math. Phys., 25, 276-282.
Marchioro, C., Pellegrinotti, A. and Presutti, E. (1975). Evolution for v-dimensional statistical mechanics. Cozm~. Math. Phys., ~0, 175-186. Meyer, P.-A.
(1966). _robablllte~ P 9 9 t et Potentiel. Paris: Hermann.
Nachbin, L. (1965). Topology and Order. Van Nostrand Mathematical Studies, No. 4, Princeton, N.J. : D. Van Nostr~id Company, Inc. Nelson, E. (1973). The free Markov field. J. Funct. Anal., 12, 211-227. Nguyen, X.X. and Zessin. H.N. (1975). Punktprozesse mit Weehselwirkuug. 0nsager, L. (1944).
to s~ppear.
Crystal Statistics L. A two dimensional model with an order-
disorder transition. Phys. Rev., 65. Parthasaratl~v, K.R. (1967). Probability measures on metric spaces. New York: Acadermi e Press. Preston, C.J. (1973). Generalized Gibbs states and Markov random fields. Advances in Appl. Prob., 5, 242-261. Preston, C.J. (1974a). Notes on randon fields, Preston, C.J. (1974b).
unpublished.
Gibbs states on countable sets. Cambridge Tracts in
Mathematics, No. 68, London: Cambridge University Press. Preston, C.J. (1974c). A generalization of the FKG inequalities. Phys., 36, 233-241~
Cormm. Math.
196
R. 5
Preston, C.J. (i974d). An application of the GHS inequalities to show the absence of phase trsnsitionsfor Ising spin systems. Comm. Math. Phys., 35, 253-255. Robinson, D.W. and Ruelle, D. (1967). Meson entropy of states in classical statistical mecha~lics. Comm. Math. Phys., 5, 288-300. Royer, G. and Yom, M. (1975). R6presentation int~grale de mesures quasiinvariantes sur C(R,R).
to appear.
Ruelle, D. (1967). A variational formulation of equilibrium statistical mechanics and the Gibbs phase rule. Comm. Math, Phys., 5~ 324-329. Raelle, D. (1968). Statistical mechanics of a one-dimensional lattice gas. Comm. Math. Phys., 9, 267-278. Ruelle, D. (1969). Statistical Mechanics: Rigorous results. New York: Benjamin Inc. Ruelle, D. (1970). S uperstable interactions ir~ statistical mechsnics.
Co~.
Math.
Phys., 18, 127-159. Ruelle, D. (1971). On the use of 'small external fields'
in the problem of
symmetry breakdown in statistical mechanics. Armals of Phys., 69, 364-374. Sherman, S. (1973). Markov random fields and Gibbs random fields. Israel J. Math., 14, 92-103. Spitzer, F. (1971a). Random fields and interacting particle systems. Lectures given to the 1971 l.~IAAS1~zmer Seminar, Math. Soc. America. Spitzer, F. (1971b). A variational characterization of finite Markov chains. Annals Math. Star., 93~ 61-74. Spitzer, F. (1973). Introduction aux processus de Markov ~ param~tre clans Z Springer Leet~re Notes in Ms.thematies, Vol. 390.
197 Spitzer, F. (1974a). Markov random fields on an infinite tree.
R. 6 to appear.
Spitzer, F, (1974b). Phase transition in one-dimensions~l nearest neighbor systems. to appear. Strassen, V. (1965). The existence of probability measures with given marginals. Annals Math. Stat., 87, 423-439. Sullivan, W.O. (1973). Potentials for almost Markovian random fields. Comm, Math. Phys., 339 61-74. Suomela, P~ (1972). Faetorings of finite dimensional distributions. Commentationes Phsico-Mathematicae, 42, 1-13. Thompson, C. (1973). Ph.D. Thesis, Cornell University.
I.l
Index
anti-ferromagnet
5.20
base point
5.9
boundary
5.15
Choquet simplex
2. ii
countably generated q-field
2.13
cube
7.4
determining class
9.3
DLR-equations
1.3
dynamical system
1.4
entrance boundary
1.8,2.6,2.14
equilibrium state
1.3,1.4
external field
5.19
extreme point
2.6,2.11,4.2 5.18,5.19
ferromagnet first
order
phase
transition
FKG-inequalities
9.22 9.1,9.13,9.32
GHS-inequalities
9.1,9.24
Gibbs state
1.7,2.2,2.7
high density
9.6
invariant
4.2
low density
9.6
Gibbs' variational principle
7.1,8.1
Griffiths inequalities
9.1
Hamiltonian
1.2, 1.4
H-invari an ce
4.4
1.2
199
infinite tree model
5.21
interaction attractive
5.18,5.22,9.1
repulsive
5.20, 5.23
superstable
6.24
Ising model
5.17
lattice model
1.2,1.3,5.1
local observables
3.3
Markov chain
3.1, 5.23
measures independent
6.5, 6.8
tempered
6.24
neighbour
5.16
observables at infinity
2.5
point process
6.1
Poisson
6.9
Polish space
9.10
potential
5.9, 6.9
finite range
5.13
hard-core
6.16
interaction
1.4, 5.9, 6.9
nearest neighbour
5.16, 5.21
pair
5.19, 6.15
stable
6.i0
pressure convexity of, strict convexity of,
8.i, 8.5, 8.i0 8.14 8.22
probability kernel
1.5
projective limit of standard Borel spaces
3.2
1.3
200
quantum field theory
i.i0
random field
2.1
H-ergodic
4.4
independent
8.2
Markov
5.16
random walk
3.1, 5.27
separable o-field
6.2
short range correlations
2.13
simplex
5.16
specific energy
8.2, 8.5, 8.10
specific entropy
8.2, 8.11
specific free energy
7.2, 8.1, 8.12
specific information gain
7.2, 7.4, 8.1
specification
1.7, 2.2, 2.6
attractive
9.1, 9.4, 9.13
bounded range
7.15
finite range
5.13
nearest neighbour
5.15, 5.23
spontaneous magnetization
5.19
standard Borel space
2.5, 2.14
stochastic field
2.1
stochastic process
1.7
symmetry breakdown
4.1, 5.20
tail-field
2.5, 2.11, 2.13
trivial
2.5, 2.9
tangent functional
8.13
thermodynamic limit
1.5
van Hove convergence
8.12