This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
2 are identical. Axiom 3. The function m is upper semi-continuous with respect to the distribution metric. That is to say, for every e > 0, there exists a 5 = (5(s, a, >), such that if \\(a, <j>), (/?,i/>)||. < S then m{fi,\p) < m(a, >) + e. This last condition tells us that if the two measures (/a'^)*(/«) and (/ ' T( V ) a r e t n e same, then m{a,4>) = m(fi,ip). Hence the value m is well defined on those measures on GG which arise as such an image, and we can write /; V '\c = <£, so that ) is a bounded rearrangement) and Q : X —* T,Q is a finite partition. Given any e > 0 there is a 5 > 0 so that in any other free and ergodic G-action (Xi,^\,fi\, T"1) with (1) ||(T*,<2 WgM)),{T;\Qi (2) * 0, <5i = 5\(a,(j)',e,ri), from Axiom 3 and from Theorem 3.0.17. Thus if '(x)}) < 2^(F)||a(x, (/)(x))||oo. Proof See Lemma 4.3 and Corollaries 4.4 and 4.5 of [43]. 'NJr\ and define 0" = 0,-i/> for all i. We can assume ju({x : \p(x) =fc x}) < 2~N~10. The rearrangements $'( are B&B with base sets F" = ^"'(f,-). The base set FJV+I may not lie in the various F", i < N. This is really all we lack. For each x G FJV+I consider the return-time block for F'^ containing it, and the return-time blocks just preceding and following this one. Be sure that /I(FJV+I) is so small that the set of all points in these blocks has measure less than 2~N~U. Modify each 4>\ to )'(, i < N by making it the identity on these blocks. Now modify the sets F", i < N on these blocks so that all the new return-time blocks are still long enough for the collapsings on them to still be m, 2~' — 2~-'-B&B, so that they are nested and all contain the new O point in FN+i. The following result now follows rather directly from this corollary. It is complex to state, but the basic picture is easy to understand. Theorem A.I.13. For each N and finite list of bounded rearrangements (a,4>\),(a,4>2),---,(a,4>N) with J(a,0,-) = 0 there is a value i,0-2) -> 0 and all ( a i , 0 , ) and (a2,4>~1) are bounded we will conclude a\ ~ a.\. In particular,
Later on we will find ourselves in the situation where the rearrangement (a, 4>) is well-defined pointwise and the measure ).i is allowed to vary. In this case we will be more specific and write m/((a,<£) or mai/,($i,$2)Lemma 2.2.1. Let m be a size. The identity map
is uniformly continuous.
22
Definitions and Examples
Proof Let 0i,02 € F. As
Lemma 2.1.8 tells us that for any <5i > 0, there exists 5 > 0, such that if ||0i,02||? < 5 then ||(a0,,0j02),a0i,id)||, < Sx. Fix an arrangement a0 and 0o = id. Let e > 0. Now select (5] = <5(e,ao,id) from Axiom 3. It follows that if ||0i,02lls < <5 then ||(a0i,0[0 2 ),a0i,id)||. = ||(a0i,0]0 2 ),ao,id)||. < 5u and hence
• Definition 2.2.2. Let a £ r f , {0,} £ F and m be a size. Define F a to be the equivalence classes of elements ofY at an ma distance zero. Thus (F a ,m a ) A
A
is a metric space and we let (Ta,ma) be its completion. That is to say, F a consists of sequences {0,} which are mx-Cauchy, modulo the equivalence relation {0,} ~ {y>,} if ma(0,-, xpi) -> 0. We write the elements of Fa as (0,-)a. Lemma 2.2.3. Fixing an arrangement a and size m the map 0 —> <x0 from V to si is well-defined as a map Fa —> si and extends to a uniA
formly continuous map from (Fa, ma) —> (si, ||-, •|li)- We refer to this map as P m,a((0i)a ) = lim,a0/. Proof As the map 0 —> a0 is uniformly continuous F a —• si it extends to the completion of F a (remember, si in the ||-,||i-metric is complete).
a Thus if (0,) a € F a then a0, —» j3 for some /J in j / . Hence we will at times abbreviate this element (0,) a G F a as j6, indicating that it is a lift to F a of the arrangement p. The arrangements in the range of P,m are a first cut toward the mequivalence class of a. The subset which is the actual equivalence class may be somewhat smaller.
2.2 Definition of a size and m-equivalence
23
Next, we see that since m satisfies Axiom 3, right multiplication in the full-group F is an isometry. Specifically, we have the following Lemma. Lemma 2.2.4. Let a be any arrangement. For all 0o € F, the map 0 —» 00o is an ma-isometry, that is to say: mM\ 0o, 020o) = m«(#i, 4>2)Proof For all 0o, 0i S F, we have that ||(oc0o,0o'0i0o),(a,0i)||. = ° Thus, by Axiom 3, for any 0o,0i,02 G F,
That is to say, m a (0i0 o ,020o) = «a(0i,02)-
•
It is not necessarily the case that left multiplication is continuous. In fact, left multiplication need not preserve the equivalence classes of the pseudometric ma. For example, consider the size
For G = Z and 0, = Tf, we have that m°(id,0,) = 0, but m°(0,00,) = m°(id, 00i 0~'). Here one can prove that m^(id, ip) = 0 if and only if y; = r t a . So if m°(0,00i) = 0 then 0 0 1 0 " ' = 0 7 Y 0 - ' = 7£. Thus, (using ergodicity and freeness), fc = 1 and 0 commutes with T". Hence 0 is itself T" for some n, yet the full group is much larger than this. We do have something akin to semi-continuity of right multiplication. Suppose ||a0,,/?||£, -* 0. Then, for a.e. x, we have that a(0,-0,(x),0,-02(x))
- /?(0,(x),02(x)),
and so (a0,0i,(0f0i)~'(0,02)) = (a0,0i,0]~'02) converges in distribution to (/?0i,0f'02). Axiom 3 now tells us that limma(0i0,,0(02) < m/i(01,02).
(#)
I—>O0
Definition 2.2.5. H'e saj' t/iat m is a 3 + size if whenever {0,} is ma-Cauchy and hence ||a0,-, /?||i —> 0, t/ie inequality (#) is o/i equality. i
In particular, this will be true if Axiom 3 is replaced by the following:
24
Definitions and Examples
Axiom 3 + . Let a be any ordering,
Proof Let {$,-} be an ma-Cauchy sequence, (representing the class (0,-)a). Fix j and using Lemma 2.2.4 compute that
Since {),} is ma-Cauchy, for any e > 0, there exists / such that for all i,j > I, ina((f)i,(l)j) < e. Thus, for all j > I,
The result follows.
D
In particular, this tells us that the action of the full-group F on {f^mx) is minimal, i.e. every orbit is dense. For a size m, we would like to define a notion of m-equivalence between two arrangements a and /?. Within the context of rearrangements, a natural candidate would be to say that two arrangements a and fi are m-equivalent if P is in the range of PmA. That is, there is a sequence of rearrangements (a, 4>>) with a$,- —» fi in L1, such that {<£,} is ma-Cauchy. The problem with this definition is that on the face of it the "equivalence relation" is not symmetric. More precisely, it is not clear that, in general, the wa-Cauchiness of {<£,} will imply m^-Cauchiness of {0~'}The following theorem describes this situation more precisely.
2.2 Definition of a size and m-equivalence
25
Theorem 2.2.7. Suppose a is an arrangement, m is a size, {>,} is ma-Cauchy and a>, —• /? in L1. The map i
P : (T,mp) -> (f a ,m a ), given by
is a F-equivariant contraction, so that
Hence P extends to a T-equivariant contraction
P :(fp,mp)-»(f a) ma ). The following are equivalent: (1) (2) (3) (4)
id € Range(P); P is onto; P is an isometry; {4>~1} is mp-Cauchy.
Lastly, if m is a 3 + size, then for all fi in the range of PmA, P is an isometry. Proof That P is a T-equivariant contraction we verified earlier (in (#)) as a consequence of Axiom 3. This certainly implies that P extends to a F-equivariant contraction P : (tii,mp) -»• (f a ,m a ). Moving on to the four equivalent statements: if id e Range(P) then there exists y E (tp,mp) such that P(y) = id. Suppose y = (xpt)p. To show that P is onto, we need only show that P maps onto F. Let
Compute that l\mma{P(\pi
i i—> coy—+oo
= limlimma((i,y>i, id) i>ooy>oo
26
Definitions and Examples
since F acts isometrically,
= limma(P(y;,), id) 1—>00
= 0, since P(y) = id. Hence if id s Range(P) then F £ Range(P) and P must be onto. Thus (1) and (2) are equivalent. Next, we argue that if id e Range(P) then the sequence {0, r '} must be m/?-Cauchy. Suppose P{y) = id, where y = (tp,)/;. The fact that P is an equivariant contraction implies that
To see this, simply compute that Y\mma{(j)j,\p~[) = l
= \imma(P(ipi),P({p)) l-»0O
= 0. Define
g :(r,ma) by
Exactly as for P, argue that Q is a F-equivariant contraction, so that mp(Q(a),Q(b))<ma(a,b). Hence Q extends as a F-equivariant contraction Q :{r,ma)^(r,mp).
2.2 Definition of a size and m-equivalence
27
Since y G (t,mp), by Lemma 2.2.4, we see that Q({vr1)*) = idThe above argument (applied to Q) now shows that
In particular this implies that {0j~'} is m/j-Cauchy. Thus, since Q is a contraction, for (f> G F, mp(Q(P(cl>)),cl>) = 0.
Hence for all <> / G {tp,mp), we see that
Thus, for a,b £ (Yp,mp), mp(a,b) = mll{Q(P(a)),Q(P(b))) <mx(P(a),P(b)) <mp(a,b), so that P is an isometry. If P is an isometry, then of course, {>"'} is m^-Cauchy, by Lemma 2.2.4. Hence, id e Range(P). This completes the proof that statements (l)-(4) are equivalent. If m is a 3 + size, then P is directly seen to be an isometry for all fi in the range of PmA. • Definition 2.2.8. There are three natural levels now on which to define m-equivalence classes. The first is the most functorial, as a subset of ta we can set
G fa : afa -» 0 and (^p
G
Second, we can consider the relation on arrangements given by Em(a) = {P\a ~P} = P,,,,a(£,,,(a)). Third, we can consider the category of free and ergodic G-actions T and S and say S is in-equivalent to T if there is an arrangement ft G Em(ar) with T^ conjugate to S. We indicate all three of these relations by the symbol ~. Thus we will write (0,) a ~ (i/j,-)a, a ~ fl and T ~ S.
28
Definitions and Examples
We investigate the first two of these. We will show that Em(a) is a dense G$ subset of Ta directly by exhibiting it as a countable intersection of open sets. We will also show that each equivalence class Em(a), as a subset of the arrangements, can be endowed with a natural wvmetric making it a universal Gs- We obtain this latter result by showing the map A
A
Em(oC) —> £,,,(a) is obtained by considering £m(<x) modulo a natural group of isometries of £,,,(a). Our first task is to see that on the level of arrangements ~ is an equivalence relation. It then follows automatically for free and ergodic actions. We begin by putting a natural metric on Em(a). As PmA is continuous from fa —y #/, for any P in the range of PmA, its pull-back P^liP) will be a closed set. We can use the Hausdorf metric on these closed sets to put a metric on the equivalence class Em(a). More precisely, for P\,Pi € Em(a), define
At this point it is not quite evident that this is a metric and not just a pseudometric. We have the following trivial consequence to Axiom 2. Lemma 2.2.9. For every e > 0, there exists a 8 > 0, such that ifm{a., /?) < 5 then \\a,P\\[ < e. That is to say, the identity is uniformly continuous from m to |K-||i. Hence m(a.,f5) is a metric. In particular, a ~ P if the map P, defined in Theorem 2.2.7, is an isometry. We can now put this together in a simple form: Lemma 2.2.10. There exists a T-equivariant isometry P : (F, ma) —> (F, nip), with PmypP = PmA, if and only if a ~ p.
Proof The existence of such a P follows from Theorem 2.2.7 if a ~ p. Conversely, suppose such a P exists. As PmA(id) = a, we see that Pm,p(P(id)) = a. Now P(id) = <#-')/, e (f/j.mp). Thus, P^d^)^ = a.
2.2 Definition of a size and m-equivalence
29
For any j , we compute that ]
'
(id)),
—• 0 in j , by Lemma 2.2.4. Thus p-'(id) = (4>j)a e (ta,ma),
and id e Range(P).
•
The following theorem is now evident. Theorem 2.2.11. The relation ~ on srf is an equivalence relation. From this we see that ~ breaks si into disjoint equivalence classes on each of which we have defined a metric. Our final step is to see that relative to this metric each of these classes is a Polish space. We begin with the classes Em(a). Theorem 2.2.12. The set Em(a) is a G$-subset ofrx. ma-separable the ma-topology on Em(a) is Polish. Proof
As the full-group is
For any cf> and \p in the full-group and e > 0 let
&(<j),\p,s) = {(#,-)« e fa : a
It remains to see that the sets &(4>,\p,£) are open in fx. Suppose (4>i)a e &(4>,ip,s) and hence there is an e > 0 with
, \p) < ma(((/),)o,0, ((t>i)a\p) + e + e.
30
Definitions and Examples By Axiom 2, Lemma 2.1.8 and Axiom 3 there is a do > 0 such that if
then ||jS,jS'||i will be sufficiently small to imply that mp>((t>,ip) < As
making sure that <50 < e/3 we will have mir((t),\p)
<mli((t>,\p)-e/3
) + e + 2e/3 and /?' e G(<j>,\p,e).
D
Suppose that ^i = ($•)« and ^2 = (^)« are in £,,,(a) with a0- —> y5 and a0? —> j3. That is to say, PmA(P\) = PmAfc) — ft- This means that for all 4> and ip that
Begin the definition of an m^-isometry /g g by setting
The above calculation implies that !-„ % is an ma-isometry where it is defined, and, as the fi\(j) are dense in fa, 1% g will extend to an isometry of fa. Notice that this makes /g g commute with the action of e full group on fa. Lemma 2.2.13. For all /?i, fa in Em(a) with Pm,x{P\) P
Proof
=
PmAfc),
w
e have
I- - — P
As PmA is equivariant with the action of the full-group,
PmAiK-h{h
= H = PmAMY
This now extends to all of £,,,(a) as the fi\(f) are dense.
•
2.2 Definition of a size and m-equivalence
31 A
Definition 2.2.14. Let J consist of those ma-isometries of Fx commuting with the action of the full-group and satisfying P
I— P
This is a complete and separable metrizable space under pointwise convergence. Notice then that for any ft\ G Em(a) and / e / , setting ft = l(ji\), we will have ft € Em(a) and
Furthermore, as right multiplication by elements of the full-group is an ma-isometry of F a ,
is a constant on fa. We now argue that the orbits of J are closed sets. Suppose /,(/?o) A
A
converges to some ft. Then in particular the /,(/?o) will be m^-Cauchy. But the above remark implies that /,-(/?) is ma-Cauchy for all fi and in particular converges to some /(/?) G Fa. That is to say, /, —> / uniformly and we conclude that / e / and all orbits are closed. Lemma 2.2.15. Using the infimum metric, the space t^/J separable metric space.
Proof
is a complete
To see that the infimum is a metric suppose
then of course ley
and there will be a sequence /,(ft) —> ft. But this says /,(ft) is i A
A
and hence /,-(/?) is m^-Cauchy for all ft That is to say, the /, converge uniformly to another I e J implying ft = /(ft). Thus the infimum is a metric on Ta/J. Separability follows from the fact that f;, is separable. • We now show that Em(a) is also a Gs-subset of j / by showing that it embeds as Em(u)/J which we show to be a residual subset of ta/J. We achieve the embedding by lifting /? e Em(a) to P^la(P) n £,,,(a) which we note maps to a singleton in Va/J. It follows directly from the definitions
32
Definitions and Examples
that this is a continuous embedding. This implies Em{a) is a universal G$, that is to say a G$ -subset of any metric space in which it is embedded. Lemma 2.2.16. The sets 0(0, tp,e) of Theorem 2.2.12 are Hence
^-invariant.
is a Gs-subset. Proof
Remember that
&(4>, v ,e) = {/? : P^CP) = P and For / e / w e have both Fm?a7 = Pm>a and / is an ma-isometry. These combine to say that if ft = l(fi) then PmA{P') = P and
It is now clear that
It is obvious that if (9 is an ./-invariant open set then (91J is open, and that the collection of ./-invariant sets form a Boolean algebra, with moding out by ./-equivariant with the Boolean operations. This completes the result. D Perhaps the best heuristic to take away from this Section is the image of the m-equivalence classes as foliating the set of arrangements. Each leaf of this foliation is metrized by its ma as a Polish space and T acts on each leaf minimally and isometrically.
2.3 Seven examples Having developed the axiomatics of m-equivalence we now give a list of examples to indicate the range of equivalence relations that can be brought under this perspective. In [43] a number of examples and classes of examples are discussed. Some of those are quite speculative. The Appendix of this work demonstrates how to bring all those examples under the umbrella we open here. The examples we discuss in this Section are those which are most obviously significant and directly related to classical issues in ergodic theory. As part of this discussion we give
2.3 Seven examples
33
some general principles that underlie many of these examples as the beginning of we expect a fruitful study of what a size might look like in general. Many examples of sizes have the common feature of being integrals of some pointwise calculation of the distortion of a single orbit. To make this precise we first review some material about bijections of G. Remember that 38 is the space of all bijections of the group G with the product topology, ^ is the space of bijections fixing id and we metrized both with a complete metric d. The group G can be regarded as a subgroup of 38 acting by left multiplication, (g(g') = gg')- The map H : 38 —> 10 given by H(q) = qq(\d)~[ is a contraction in d. Also G acting by right multiplication conjugates 38 to itself giving an action of G on 38. (Tg(q){g') = q(g'g)g~l.) We view this action by representing an element q e 38 by a map f : G —> G, /(g) = (g)g~'. Those maps / e GG that arise from bijections are a G$ and hence a Polish space we call F. The map q —> / is obviously a homeomorphism from 38 to F. For / e F let Q(f) be the associated bijection and for q e 38 let F(q) be the associated name in G c . The action of G on 38 in its representation as F is the shift action <xg(/)(g') = /(g'g)- Any rearrangement pair (a, >) then gives rise to an ergodic shift invariant measure on this Polish subset of GG and any ergodic shift invariant measure is an ergodic action of G with a canonical rearrangement pair. The probability measures on a Polish space are weak* Polish [57] and hence the invariant and ergodic measures on this Polish space are weak* Polish. We will now define a general class of sizes that arise as integrals of valuations made on the bijections q'd. Definition 2.3.1. A Borel D : 38 —> K + is called a size kernel if it satisfies: D(q)>0; D(id) = 0; D(q(id)-lq-lq(id)) = D(q); D(ql(id)q2qYl(id)ql) < Dfo,) + D(q2); for every e > 0 there is a 5 > 0 so that if D(q) < 5 then d(id,H(q))<s; (6) the function fi —> f D(q(f))d[i is weak* continuous on space of shift invariant measures \i on the Polish space F.
(1) (2) (3) (4) (5)
Note: an element of G regarded as an element of 38 acts by left multiplication.
34
Definitions and Examples
The complex form of conditions (3) and (4) arise from the following considerations. When an orbit is viewed as a copy of G the base point x sits on the identity element of the group. When acted on by some rearrangement the identity moves, i.e. the point x now is based at a different point in G. Hence it is necessary to view both q~l and q as based at this new origin when they act. Writing it out explicitly on an orbit we have the identities
and
and now conditions (3) and (4) become
and
For size kernels D, defined solely in terms of H(q), these two become even simpler as
and
For a size kernel D we define
mD{a,(j>)=
ID(q«'
We call such an mD an integral size. In condition (6) on D we could have asked for only upper semicontinuity and still obtained that mD was a size. All examples though are continuous here so we ask for the stronger condition and obtain a stronger conclusion: Theorem 2.3.2. For D a size kernel, mD is a 3 + size.
2.3 Seven examples
35
Proof First note that
i
and so
and from the above identity and condition (3),
Condition (3) gives symmetry as
and m® is a pseudometric on F. Axiom 2 of a size follows directly from condition (5). Condition (6) is precisely that Axiom 3 should hold. • Examples 1 & 2 (Conjugacy and Orbit Equivalence). These first two examples are the extremes of what is possible. For one the equivalence class will simply be the full-group orbit and for the other it will be the entire set of arrangements. Both of the pseudometrics d(q, id) and d(H(q), id) are easily seen to be size kernels and so both
m\a,(j>)
=
|| (a, 0), (a, id) C
are 3 + sizes. As d makes 38 complete, relative to m1 a class of sequences ((/>,-)„ e F a iff (j>j —> (f> in probability. Thus a ~ /} iff /? = oup i.e. they differ by an element of the full-group and the equivalence class of a is exactly its full-group orbit. Note in particular that T a and T^ will be conjugate actions. To tie this relation into our work here notice that in Chapter 7, where we define the notion of an m-finitely determined action, this
36
Definitions and Examples
definition reduces to Ornstein's classical characterization of the Bernoulli actions as the finitely determined actions for m1. As for m°, for any a and fi one can use the Ornstein-Weiss Rokhlin Lemma (Lemma 2.1.5) to construct a sequence of (/>, with <x$; —• fi in L' with the sequence >, an ma-Cauchy sequence. Thus all arrangements are m°-equivalent. Dye's Theorem [10] and the Theorem of Connes, Feldman and Weiss [6] now tell us that any two ergodic actions of G are m°-equivalent. We tie this example into our work. First a reminder of the distribution topology. For G a countable and amenable group and £ a finite labeling set, the space of probability measures on £ G forms a compact metrizable space. For any measure-preserving action of G and £ valued partition the map from points to £, G-names will project the invariant measure to a measure on £ G and will make a pseudometric space of such processes (pairs of actions and partitions). For two such pairs (T,P) and (S,Q) let \\(T,P),(S,Q)\\, be some metric giving this weak* or distribution pseudotopology. We state a lemma concerning ergodic actions of G. Lemma 2.3.3. Let G be a countable and amenable group, Ta a free and ergodic action of G on the standard space {X,2F,\i) and P a finite H-valued partition of X. For each e > 0 there is a 5 > 0 so that for any other free and ergodic action S^ of G on (Y,$, v) and partition Q : Y —> £ satisfying: (1)
\\(T«,P),(SP,Q)\\t<5
and for every 5\ < 0 there is a 0 in the full-group of fi and a partition Q' of Y with (a) m%S, <£)<£(b) v(QAQ') < s and (V)
\\(T',P),(Sl>*,Q:)\\.<5i.
We leave the proof as an exercise for the reader. A version of this fact for Kakutani Equivalence is found in lemma 4.3 of [38]. The reader can use this as an outline of to how to proceed. One concludes from this that all ergodic actions of G are weakly m°-finitely determined (see Definition 7.2.5) and, as m° is a 3 + size, Theorem 7.2.6 now implies all ergodic actions are m°-f.d. giving an alternate albeit elaborate proof of Dye's theorem using our machinery. Before we continue to other examples we make a few general observations concerning size kernels. First we can w.l.o.g. assume that all size kernels are bounded by 1, as replacing D by the supremum of D and 1 will maintain the axioms and will not change the associated equivalence
2.3 Seven examples
37
relation. Notice next that one evaluates the size mD of a rearrangement by calculating / D(Q(f))d/i(f) where fi is some shift invariant measure on F. Suppose Fo £ F is a shift invariant set with H(FQ) = 1 for all shift invariant probability measures \i. Assume as well that FQ contains the identity and if it contains F(q) then it also contains F(q~l). Notice that changing D outside of Fo will have no effect on the evaluation of mD. In particular if D is initially only defined on Fo, is bounded by 1 there and satisfies the axioms of a size kernel (where applicable), then if we set D(Q(f)) = 1 for / ^ Fo we would extend D so as to be a size kernel. We now give an explicit example of such an Fo. Suppose G is Z" and BN = [—N,N]" is the standard Felner sequence of boxes. For shift invariant measures on GG the pointwise ergodic theorem holds along this sequence BN- For each / £ F set A M ( / ) to be the upper density of the set {v\f(v) £ BM} calculated along the sequence of sets BN as N s oo. Let Fo consist of those / for which lim A M (/) = 0. The pointwise M->co
ergodic theorem tells us that this set has measure 1 for all shift invariant measures. Hence when working in Z" one need only define a size kernel D on such / . Notice that for such / one will have |im
N->C»
#{i3 € BN\Q(f)(v) £ BN}
= Q
#BN
We describe a class of examples that take advantage of these observations and this choice for an FQ.
Example 3 (Kakutani equivalence). The development of Kakutani equivalence in Z" can be found in [9] and a complete development of the equivalence theorem for it in [16]. What we present here is an approach that brings this example into our context. For this example let G = Z" and BN = [—N,N]n be the standard Folner sequence of boxes centered at 0. We begin with a metric on Z" given by x{u,v) = min(|K«/||M||) - (S/||t51|)|| + | (assuming 0/||0|| = 0). What is important about T are the following two properties: (1) T is a metric on Z" bounded by 1; and (2) w and v are x close iff the norm of their difference is small in proportion to both of their norms.
38
Definitions and Examples
For h e % set BN(h) = {v e mapped into BN by /i). Now set
BN|/I(2)
e BN} (those elements of
= sup N
&
N
ieBN(h)
and /<() = k(H(q). Lemma 2.3.4. The function K is a size kernel. Proof That k(h) = k(h~]) is a calculation as in fact this equality holds already for each TV. That k{h2 o h\) < k(h2) + k(h\) is also true as it is true for each TV before taking the sup N . That K satisfies the first five conditions of a size kernel is now direct. We get (3) and (4) from the observation that K(q) only depends on H{q). As x is a metric, for K(q) to be small H(q) must fix a large (finite of course) number of vectors v. To obtain (6) suppose [i is some shift invariant measure on GG, hence supported on FQ. For q € FQ as TV —> oo we have \BN(H(q))\/\BN\ —> 1. Moreover for each TV this value is continuous and so its expected value relative to fi is weak* continuous. For any fixed D, {q\H(q)(id) = v} is a clopen set and hence its measure is weak* continuous in p.. As i; varies these sets form a countable partition of GG and so for any e > 0 there is an No and a neighborhood U of j.i so that for v e U also invariant and TV > No, letting h = h(q(f)),
I
ueBN(h)
For each N < No the calculation
ieBN(h)
is continuous and hence the supremum of these values for N < No is, continuous and so its integral is weak* continuous in fi. It follows that in some sub-neighborhood U' £ U we will have a variation of at most e in the value jK(q{f))dv. • The use of T here is not the usual calculation taken to construct Kakutani equivalence, but noting that for T to be small simply means the distance between two vectors is small relative to their lengths makes it clear that it is equivalent to earlier presentations. One finds without
2.3 Seven examples
39
mK
much effort that a ~ /? iff for a.e. x
Although the vocabulary of [9] is somewhat different it is shown there that this is equivalent to saying: Proposition 2.3.5. For every e > 0 there is a (/> e F and a subset A with fi(A) > I — 6 so that for all x,y £ A, x{a4)(x,y),tx(x,y))<£. For G = Z this implies that T^ and T& induce the same map on A and hence Ta and T^ are evenly Kakutani equivalent in the classical sense. In [9] the converse of this is proven, i.e. this is precisely even Kakutani equivalence in Z and a broad exploration of this equivalence relation in Z'' is made connecting it to Katok cross-sections of Rd actions. As mK is entropy-preserving we know the Bernoulli actions are mKfinitely determined and hence there exist mK -finitely determined actions. By Theorem 7.2.6 they are characterized by the condition of being weakly mK -finitely determined. Notice that for actions of Z this precise fact is proven in lemma 4.3 of [38]. Examples 4 & 5 (a equivalences). Once more take G = Z" and choose a vector a = {<xi,a2,...,a,,} of nonzero real numbers. Set A : K" —• T" to be A(vi,...,vn)
= (vi/ai,V2/<*2,...,vn/<xn)
mod Z"
and on T" use the natural metric p ( M 2 ) = ||e 2 *' v '-e 2 *' T ; 2 -'||. Notice that for p o A to be small means the two vectors differ approximately by a vector {«iai,...,n,-a,-} where the ;i, are integers. For q e 88 set k(q) = p(A(q(0)),A(0)). A is not a size kernel as it fails to satisfy (1) although it does satisfy both (2) and (3). To obtain a size kernel all we need do is add to A some other size kernel. We currently have two choices giving the two size kernels Dsfa)
=
d(H(q),id) + A(q)
K&(q)
=
K(q) + A(q).
and
40
Definitions and Examples
(Notice it makes no sense to use d(q, id) as adding on A(q) would add no further restriction to the already minimal equivalence class.) Both of these examples give interesting equivalence relations. The second has been well studied under the name of a-equivalence (see [8] and [48]). Because of the standard use of a to represent the parameter of this relation we will use /? to represent an arrangement throughout the discussion of these two examples. The first example, D&, has not been discussed in the literature so we present a brief discussion here. What we obtain is a refinement of simple orbit equivalence that splits the ergodic actions into a countable list of equivalence classes characterized spectrally. Remember a function / : X —> C is an eigenfunction of the ergodic action T with eigenvalue 1 if / is of norm one and
Fixing i 0 = ( l / a i , l / a 2 , . . . , l / a , , ) , those values (kuk2,...,kn) € Z" for is an eigenvalue for T form an additive which (k\/a\,k2/<X2,...,kn/an) subgroup we will call As(T). Two ergodic actions U and V of Z" are mDi equivalent iff A 5 ([/) = Aj(K). In particular we see that there are at most countably many mDi-equivalence classes. We will indicate the proof of parts of this characterization, leaving much to the reader. Proposition 2.3.6. If fix "~ /?2 then Aa(T^) = A 5 ( 7 % Proof Suppose X = {k\/a\,ki/a.2,...,kn/a.n) is an eigenvalue for the eigenfunction / of T^'. We now compute that for all v
= I f ° Tp' HJ
,,
„
B+/?(v^(x))/j(r|'1(x)0(rflw))
< \\p{A{q*'*(d)),A(6))h
Thus if /?i m~ fo, for all v we will have
- f
2.3 Seven examples
41
By the mean ergodic theorem there must be an / with
veBN
and by the above,
We know / must have constant norm and as long as it is not identically 0> / / I / I will be an eigenfunction for T"1 with eigenvalue I As all TM are conjugate to T^1 we can assume mDi(fS\,fi2) < 1/2, forcing / ^ 0.
•
Lemma 2.3.7. Suppose T^ is a free and ergodic action of Z" on (X, J*, /i) with As(T^) = {6}, and P : X -» Z is a finite partition. For each e > 0 there is a 5 > 0 so that for any other free and ergodic action Sy on (Y,(S, v) and partition Q : Y —> Z satisfying:
an>> value 5\ < 0 t/iere is a 0 in the full-group of Sy and a partition Q' : Y -» Z wit/i (a) m D »(y,0)<£ (b) v(QAQ') < £ and (l')
^
We once more leave a complete proof of this copying lemma to the reader. We do point out the ingredient used to obtain (a) beyond the construction of Lemma 2.2.3. For a fixed consider the group rotation on T" given by (xi,...,x n ) —> (xi + a[,...,xn + a,,) mod 1. This is not necessarily ergodic but all its ergodic components are conjugate to some group rotation we call (Rz,Z) where Z is a compact subgroup of T". To say A 5(T^) = {6} is equivalent to saying R& x T& is ergodic. Partition Z into sets of diameter less than e/2 by a partition H. Consider now H V P,Bjv-names arising from the action of R& x T^. The pointwise ergodic theorem tells us that if we fix a choice of h £ H and cylinder set C in the process (T$,P) then for N large the relative density of C just at indices of an H\/ P,BN-name whose H term is h will be very close to n(C). Fixing the value N, if {Sy,Q) satisfies (1) for a small enough 5 then this same fact will be true for H V g-names relative to v (even if R$ x Sy is
42
Definitions and Examples
not ergodic). The full-group element 4> will now be constructed on some Rokhlin tower of size BM, (M » N) in the action Sy by overlaying the Sy, Q names with a template R&, //-name and constructing
2.3 Seven examples
43
Here is what is known in two and higher dimensions. In [47] it is shown that any R" action can be represented as a special sort of "Markov" tiling suspension of an action of Z" where the tiles are rectangles whose length in dimension k is either only 1 or 1 + a* (we assume all a/< are irrational and > 0). We say two actions U and V of Z" are a related if they arise as such sections of a common W action. We say they are evenly 5-equivalent if the proportion of the space occupied by each of the tile shapes is the same for both representations. The argument in [8] extends to higher dimensions to show that if U and V are evenly a-related then they are mKi-equivalent. In two dimensions Sahin [48] shows that the converse is true, i.e. if two actions U and V are mKi-equivalent then they arise as Markov tiling sections of a common K" action. It remains open however whether any R" action for which U is such a section must also have V as such a section. Our last two examples exhibit another general context in which a restricted orbit equivalence relation can arise. Suppose that to each arrangment a we can assign a subgroup Fg of the full-group F with the equivariance property that FQ* = (^"'Fg^. In particular the choice of subgroup does not change when we perturb a by an element of its subgroup. What interests us are those /? reachable as limits of sequences of rearrangements a>, where
44
Definitions and Examples
#",_! are atomic with a fixed number of atoms k, and that each atom has a constant mass l//c,. We call such a filtration uniform. Next we assume that the 3Fi decrease to the trivial algebra. A filtration with this property is called exact. One natural way for such a filtration to arise is from an action of a group of the form G = Y^=i Z/r n Z. What matters here is that G is the increasing union of the finite groups Hi = Yl'n=\ Z/r n Z. If we have a measure-preserving and free action of this group and we set 2Fi to be the algebra of H\ invariant sets then we obtain a uniform reverse filtration. It is exact iff the action is ergodic. Conversely, given any uniform and exact reverse filtration, using the Rokhlin decomposition of each successive #",- over J ^ - i , we can place on the space an action of G for which the filtration is obtained as this list of invariant sub-algebras. The action of G here is not unique and this leads to a natural relation: we say two actions of G are Vershik related if conjugate versions of both of them can be placed on the same space, giving rise to the same reverse filtration. Notice in particular that the two actions will be orbitequivalent and what characterizes the particular orbit equivalence is that it preserves the orbits of all the subgroups //,. For the purpose of our discussion it will be useful to assume only that G is the increasing union of finite abelian groups //, without assuming that each is cyclic over its predecessor. Such a G is countable and amenable. Notice that for any increasing subsequence {_/,} we could define //, = Hjt and get another representation of G as an increasing union of finite subgroups. These distinct representations will give distinct values for the vector f = {\H{\, \H2/H\\,...} and so we can represent a choice for such an increasing subsequence of subgroups by its vector f. This is consistent with the usage when //,///,_! is cyclic of order r,-. We say two actions of G are Vershik r-related if they are Vershik related for the choice of subgroups H; determined by the values of f. We describe Vershik relatedness indexed by the choice of subsequence r a s a family of restricted orbit equivalences on G. Although one can use a size kernel here we follow [19] and give the size directly. Notice that for a fixed arrangement a and choice for f the full-group F contains closed subgroups F0' a consisting of those cj> which preserve the Tx orbits of all //,-. (This is equivalent to saying that either q*'^ or equivalently h^ permutes cosets of //, for all i and a.e. x.) If we have a sequence of rearrangements a>, converging to some fi where all the (pj £ rf^ then T a and T1* will have identical H, orbits for all i and hence be Vershik f related in this very strong sense. To define a size giving this relation, for a and r fixed, we first calculate the distance some
2.3 Seven examples
45
is from the subgroup FrA as its m\ distance: c?(a,4>)= inf n{x\4>{x) (This is not the definition of c> given in [19] but is equivalent by the Flattening Lemma proven there.) One can now define the family of sizes
Heicklen proves this to be a size but does not show it to be 3 + . We will not present the details showing that it is in fact 3 + . This can be done either by suitably expanding Heicklen's argument or by showing that the sizes mr arise from size kernels. Heicklen's conclusion is that two actions a ~ /J iff there is a ip G F so that Tai' and T'1 have identical Ht orbits for all i. As Ta and T0"" are conjugate (by \p of course) T01 and T^ are f related and if two actions are f related they can be realized as two such actions. The family of sizes exhibits two very interesting properties. The first is due to Vershik who proved a lacunary isomorphism theorem for such groups [58]: for any two actions U and V, if the /-,- are chosen to grow rapidly enough, then the two actions are f related. Notice that the allowed choices for the sequence //, are partially ordered under containment. As one goes further out in this partial order more and more actions become equivalent and Vershik's result says any two will become equivalent once one is far enough out in this net. The second very interesting property follows from this. Vershik has shown that for very slowly growing sequences, like r, — 2 for all /, the entropy of a G-action is an invariant of r-equivalence. On the other hand, Vershik's lacunary isomorphism theorem tells us that for some choices of r, entropy definitely is not an invariant. As we learn in Chapter 5 a restricted orbit equivalence either preserves entropy or generically in the ma topology an action has zero entropy. Vershik [59] conjectured and proved the sufficiency and Heicklen [19] proved the necessity of the following characterization of the boundary between these two regimes: The size mr is entropy-preserving iff
Example 7 (Entropy as a Size). We discuss this example only for actions of Z although the ideas extend to general countable amenable groups. The
46
Definitions and Examples
results described here are found in [46]. Before examining this example in detail consider the following observations. Two major goals of this current work are to demonstrate: (1) A size m is either entropy-preserving in that two equivalent actions have the same entropy, or entropy-free in that residually in each class actions have zero entropy. In the first case we say an action's m-entropy is its entropy and in the latter that its m-entropy is always zero. (2) Each size possesses a family of distinguished classes, characterized by their m-entropy, called the m-finitely determined classes. Any two m-finitely determined actions of the same m-entropy are mequivalent. Notice that this implies the possibility of two sizes m for which all actions are finitely determined, one that is entropy-free and one that is entropy-preserving. Dye's theorem, here done via the size m°, shows that there is an entropy-free size for which all actions are m-finitely determined. What the example we now discuss shows is that the other size also exists, relative to which two actions are equivalent iff they have the same entropy. The size at its base will simply be the entropy of the rearrangment itself. We make this precise as follows. Note g(«,0)(x) = a(x,(j)(x)) takes on countably many values and hence can be regarded as a countable partition g ^ j of X. Set Fg to be those <> / for which g ^ j is finite. It is not difficult to see that Fj-j is a subgroup and moreover FQV) = I/;~'FQ1/; as g(av,,v,-^VI)(v~'(A-)) = g(«,^)(x). It is shown in Theorem 4.0.2 that the Fg are all ml dense in F. For <> / e Fg one can use the entropy of the process h(Tx,g{a,
2.3 Seven examples
47
The form of the size makes it reasonable to believe and easy to prove that m''-equivalence will be entropy-preserving. A more subtle combinatorial argument leads to the reverse conclusion as well, that any two ergodic actions of equal entropy are in fact /n''-equivalent. A Non-Example. Hoffman and Rudolph [20] have presented an isomorphism theory for measure-preserving endomorphisms that can be viewed as an extension of the methods here but cannot be viewed as an application of restricted orbit equivalence as we develop it. This work grows naturally from the Vershik equivalence theory described in Example 6. Consider the standard example discussed there, {0,1,...,p— 1 }N with uniform Bernoulli product measure (1/p,..., l/p). In Example 6 this gives the standard filtration where all k, = p. Here we consider in addition the shift map, giving a p - 1 Bernoulli endomorphism. We call it "uniform" as all p inverse images of a point are equally likely. An endomorphism conjugate to this standard one would also have to be uniformly p— 1 in that almost every point must have p inverse images and all must be equally likely. This standard example has entropy logp so this also would be true of any endomorphism conjugage to it. What is shown in [20] is that for the class of uniformly p — 1 endomorphisms of entropy logp there is an isomorphism theory completely analogous to that of Ornstein for Bernoulli automorphisms and following the same outline as our work here. We will describe enough of [20] to indicate why the theory is parallel and why the results here simply do not apply. Let T acting on (X, 3F, \i) be an ergodic and uniformly p— 1 endomorphism of entropy log p. For a.e. x € X we can consider the set of all inverse images T~i(x), x > 0 organized as a p-ary tree rooted at x. The points in T~->(x) are at "level / ' of the tree and each x\ € T~'(x) is connected by one edge to T(x\) e T~j+l(x). We refer to a map from the nodes of such a tree to itself that preserves the edges as a "tree automorphism" and between two such trees as a "tree isomorphism". A conjugacy between two uniformly p — 1 endomorphisms will give tree isomorphisms between the trees of inverse images attached to matched points. For P a finite partition of X label each node of a tree of inverse images by the set in P containing it. Given two such labeled trees of inverse images one can ask how closely they can be matched by a tree isomorphism. To be more precise, each node at level j is given a mass of 2~J so that the set of nodes at each level has total mass 1. Relative to this weighting one seeks to match the nodes of two labeled trees by
48
Definitions and Examples
a tree isomorphism minimizing the proportion of nodes in the trees with mismatched labels. This minimum is called the t distance between the two labeled trees. This extends in a standard fashion to provide a f distance between measures on labeled trees and hence between uniformly p — 1 F-valued stationary stochastic processes. Notions of t finitely-determined and t very weakly Bernoulli follow directly and can be shown to be equivalent. Both these are true of the standard example and are conjugacy invariants. Finally, one can give a natural weak* Polish space of joining measures for this theory, called the "one-sided joinings", and show that for the t finitely determined endomorphisms those one-sided joinings which arise from conjugacies are a residual subset. This development follows the outline of our work here and is in fact much simpler both because it is just a single example of an equivalence relation and because it is a "zero entropy" theory. Example 6 of Vershik equivalence was brought under the restricted orbit equivalence umbrella by the choice of an action whose orbit structure mirrored the fibers of the filtration. Here one can also find such a group action. Construct the natural orbit relation setting x\ ~ xj if there are j \ > 0 and ji > 0 with T-'1 (xj) = T-' 2 ^)- The equivalence classes are organized naturally as a complete p-ary tree (each node has p + 1 edges attached to it, p going back in time and 1 going forward). These naturally organize as the orbits of a free group on p involutions where two points are connected by an edge in the tree iff they are interchanged by one of these p involutions. This is not an amenable group so our work here does not apply. This action also is not measure-preserving. It is in fact an amenable action of type IIIi/ p . As such it could be given as an orbit of a Z action but this would lose the tree structure essential to the conjugacy theory of the original endomorphisms. What is needed to make this nonexample an example is to lift the work here to non-singular amenable actions of groups that are not necessarily amenable. This non-example gives evidence for, and an approach to, such a generalization.
3 The Ornstein-Weiss Machinery
In this section, we describe the constructive tools we will need in order to continue with our work. From the beginnings of the Ornstein approach to constructive ergodic theory and in particular the Isomorphism Theorem it has been understood that there are three basic tools necessary to work constructively with dynamical systems: a version of the Rokhlin lemma; a version of the Ergodic theorem; and a version of the Shannon-McMillan theorem. It has also been understood for some time that a natural context in which all these results hold is that of locally compact and amenable groups. The results described here are lifted almost verbatim from the seminal work of Ornstein and Weiss on this subject [37]. We include them here to provide the reader with ready access to them and as we vary their statements slightly in places. Since we will consider only countable discrete amenable group actions, we do not need the most general form of their results. Thus, for clarity, we have stated these results in the context of discrete group actions. Furthermore, we have opted for a classical description of entropy, using finite partitions and name-counting techniques. Two notions of essential invariance of finite subsets F £ G are central to [37]. Definition 3.0.1. Let d > 0. Let K £ G be a finite set. A subset F £ G is called (6, K)invariant if
f(KK~lFAF) To say that a set F is sufficiently invariant means that there exists a 5 > 0 and a finite set K s G such that F is (S,K)-invariant. 49
50
The Ornstein-Weiss
Machinery
is sufficiently invariant is to say To say that a list of sets F\,F2,---,Fii that there exists a 8 > 0 and a finite set K £ G, such that, setting FQ = K, for each j G {1,2,...,k}, the set Fj is (d,Fj-\)-invariant.
We now describe our version of the Ornstein-Weiss quasi-tiling theorem. Definition 3.0.2. A finite list of sets H\,H2,.-.,Hk £ G, with id € H,, for all i, is said to \,2,...,k, £-quasi-tile a finite set F £ G if there exist "centers" Cjj, i = j = 1,2,...,/(/), and subsets Hjj £ H, such that:
(1) (2) the HJJCJJ £ F are disjoint;
(3)
and
( y y )
Theorem 3.0.3 ([37]). Given e > 0, there exists N = N(e) such that is any suffiin any countable discrete amenable group G, if H\,...,H^ ciently invariant list of sets, then for any £> £ G that is sufficiently invariant (depending on the choice of H\,...,H^), D can be e-quasi-tiled by H\,...,Hs-
This theorem is the essential content of Theorem 6, 1.2 [37]. Our definition of e-quasi-tiling is slightly different; weaker in that we do not ask that H,c,j n Hkcuj = 0, /' ^ k, and stronger in that we require HjjCjj £ F. Obtaining the latter from Theorem 6, 1.2 [37] is easy if D is sufficiently invariant and N is fixed. We have described this result, as the picture it gives makes much of [37] more accessible. We will not go further into the development of families of tilings, which are the essential tools of their proofs. Rather, we will move on to state their principle results. In particular, we will discuss their version of the Rokhlin and strong Rokhlin lemmas, which can be regarded as dynamical versions of the tiling theorem. Suppose (X,£8,n) is a standard probability space. Suppose I is a measure-preserving free action of G on X. For a finite set F £ G and measurable subset / l e ^ with /i(A) > 0, consider F x A £ G x X. As a measure on F x A, put the direct product c x \i of counting measure c and fi. Consider the map T : F x A —> X given by T(g, x) = Tg(x). On each level set g x A, T is 1-1 and measure-preserving. On any fiber set F x x, T is again 1-1. We definitely do not expect T to be 1-1 on F x A. It is clear, though, that T is non-singular and, at most, # F to 1.
The Ornstein-lVeiss Machinery
51
In particular, if T is y'-to-l at x e X, then
Rokhlin lemmas concern the degree to which maps T, as above, can be made 1-1. In particular, within a set F x A, one can look for large subsets S on which T is 1-1. We will ask that S be large in a rather strong sense. For S £ F x A, we get a counting function, defined on X, given by
Of course (cx/i)(S)= / cs{x)dii(x). JA
Set c(S) = min cs(x). Definition 3.0.4. We say that F x A maps an £-quasi-tower if there exists a measurable subset S S F x A such that: (1) T\s is 1-1; and (2) c ( S ) > ( l - e ) # F . The e-quasi-tower itself is T(F x A) £ X. Notice that we may always assume T(S) = T(F x A). Note that if there exists an S s F x A, such that T is 1-1 on S and (c x JI)(S) > (1 -e 2 )(c x /i)(F x /I), then there must exist an A' £ /I, with /((/I') > (1 — e)fi(A), such that cs(x) > (1 — e)#F, for all x e A'. Hence F x A' maps to an e-quasi-tower. We now state our version of the Ornstein-Weiss Rokhlin lemma. It is only a minor modification of Theorem 5, II.2 of [37]. Theorem 3.0.5 ([37]). Suppose G is a discrete amenable group. For any e > 0, there exist 5 > 0, K <= Q and N = N(s) such that for any sequence H\,..., HN of(5,K)-invariant subsets ofG, and any free measurepreserving G-action T = {Tg}geC> acting on (X,3S,^t), there exist sets A\,...,AN e & such that: (1) each Hi x A-, maps to an s-quasi-tower £%, in X; (2) for i ^= j , 3#i n 3kj = 0; and (3)
52
The Ornstein-Weiss Machinery
A collection of sets of the form {H,- x Aj}f=l satisfying (1), (2) and (3), we call an e-Rokhlin tower. As we indicated earlier, Ornstein and Weiss prove this for a slightly different notion of e-quasi-tower. We commented above that using e2 in their result gives the e in ours. Their statement differs slightly in another respect. They partition each Ai further into sets Ajj with T actually 1-1 on each H, x A,j. In fact, for any set A £ 3S, and finite set H £ G, A can be partitioned into a countable list of sets Aj with T 1-1 on each H x Aj simply because T acts freely. Hence this added structure is automatic. Theorem 3.0.6 ([37]). Given any finite partition P of X, one can select the sets Aj in Theorem 3.0.5 with
Proof This result is Theorem 6, II.2 of [37]. Again, we have stated a slightly strengthened version. The important observation from Theorem 6, II.2, is that setting "e"= e/10, one obtains N and {S,K) for Theorem 3.0.5. one is now at liberty to choose Then for ((5,X)-invariant sets H\,...,HN, a finite partition Q, which we set to be V T.-i(P). geytf, g Theorem 6, II.2 contains an extra parameter 5 with Aj ±5 Q. But as Ornstein and Weiss point out following the proof, by slightly shrinking the Ai, one obtains strict independence. • We now describe the entropy of an ergodic, measure-preserving Gaction. We use a name-counting approach, as described in [44]. Because we are considering only discrete amenable group actions, the entropy function we describe appears simpler than that described by Ornstein and Weiss in [37], where they consider names that take values in some compact metric space. Here we will count names that take values in some finite partition (or state space). In fact, though our approach appears to be different, the entropy function we describe is the same as that described in [37]. Definition 3.0.7. A finite partition of a set X is a map P from X to some finite symbol space. We will write the symbol space as Up which is simply
The Omstein-Weiss Machinery
53
the range of P. By the cardinality # P we simply mean the cardinality of Up. Later we will consider partitions where the nature of the state space is important, but for considerations of entropy it is simply their cardinality that will come into play. Nonetheless it is convenient to use symbol spaces as, for example, the partition P V Q is then easily described as the tensor product of the two partitions. We can topologize the E-valued partitions of a fixed space X with an L1 metric ||Pi,P 2 ||i = fi{x;P\(x) ^ P2(x)}. This is usually referred to as the partition metric. We can also topologize the partitions taking values in I with a distribution topology. This is actually a pseudo-metric topology. As we saw when we considered the distribution topology on rearrangements, this is a weak*-topology on measures. To any finite partition P labeled by S we can consider the map to names. P(x) = {P(Tg(x))}geG
e ZG.
As Z c is compact and metrizable the dual of the continuous functions is precisely the space of Borel measures. The shift action a of G acts on names in S G and P conjugates the action of T to that of a. Hence P*{[i) is a (T-invariant Borel probability measure. Hence we can pseudotopologize the "process" (that is to say, a G-action (X,tF,ji,T) and partition P (usually abbreviated to just (T,P)) with the weak*-topology on P*(n). We give an explicit metric giving this topology as follows. For a Folner sequence {F,-}, for each ;', consider the set of finite names Nj = {n : Fj —• Z}. For each i and n S N, define the cylinder set associated with such a name by C(n) = { « 6 l c : h(g) = n(g) for all g e F,-}. All such sets are clopen and so their characteristic functions are continuous. Furthermore, the finite linear combinations of such characteristic functions are uniformly dense in the space of continuous functions. Hence to say m; —> m weak* for Borel measures on {1,...,JV} G is equivalent to saying mi(C{n)) —» m(C(n)) for all names n e iV, and all i. For each i we can define a pseudometric on the Borel probability
54
The Ornstein-Weiss Machinery
measures on L c by
It is an easy calculation to show that this is a pseudometric and is uniformly bounded by 1. (There are natural reasons for the coeffiecient 1/2 which we will not go into here.) Notice that as F, is nested and increasing, the values ||mi,m2||' are non-decreasing in i. Define a metric on the probability measures on IP by \\m\,ni\ | Our earlier discussion makes it clear that this metric gives the weak*topology on Borel measures. By pulling it back to \\(T,P),(T',P')\\.
=
we get an explicit metric giving the distribution topology on processes. We now begin our description of entropy. Let P : X —» Z P be any finite partition of X. As earlier we lift P to a map P : X —> I p given by (P(x))(g) = P(Tgx), for all g e G. We call P(x) the T,P-name of x. For any finite set F s G, we may restrict P to F to get a map PF : X -> Z£. We say that P F (x) is the 7",P, F-name of x. Let {F,} be a nested Folner sequence with {F,} s G and id e F,, for all i. For each fixed F,-, there are at most # F # f | possible T , F , F,-names covering X. Let ^i G J/,(X). Let e > 0. Starting with the names of least ^-measure, remove as many as possible, in such a way that the /(-measure of the union of the remaining names is still greater than 1 — e. Let S(T,P;Fj,s) be the collection of remaining names. Let N(T,P ;F,,e) = #5(T,P;F,,e).
Notice that N(T,P;Fj,e) < §P*Fi. Define
Certainly /i(T,F;F,,e) < log2(#F). Define h{T,P) = lim liminf £—•0
I—>CO
h(T,P;Fhe).
The Ornstein-lVeiss Machinery
55
Lemma 3.0.8. For T ergodic, P a finite partition and for any e' with e' < e < 1, we have
limsup h(T,P;F,,s') < liminf h(T,P;F,-,e). Proo/" Let e > 0, with e < 1. Let e' < e and let /? < e'. In the following, for brevity, we use the notation {F,}, even after passing to a subsequence. Select a subsequence of {F,} such that lim/i(T, P;F,,e) = lim inf/i(T,P;F,,e) = /j(T,P;e). /—»OO
(—>00
We may assume that this subsequence is sufficiently invariant. By Theorem 3.0.3, this means that these Felner sets may be used for quasitiling. To be more specific, use Theorem 3.0.3 to select K so that if F\,...,FK is a sufficiently invariant sequence, then for any sufficiently F\,...,FKinvariant set F, F may be ^-quasi-tiled by Fix such sufficiently invariant F\,...,FK
so that for all / = 1,...,K,
Fix F, sufficiently invariant to be ^-quasi-tiled by F\,...,FKeach i = \,...,K, we have that
There must exist centers c,j, i = l,...,K, Fjj ^ F, such that: (1) | F , - j | > ( l - / ? ) # F / , for
Then for
j = 1,...,/(/), and subsets
j=l,...,/(0;
(2) the FJJCJJ c f are disjoint; and
(3) # ( U F , j C u ) > ( l - ^ ) # F . We want to count the "good" names across F. Specifically, we want to
estimate N{T,P;F,r\). There exists a set D £ X with /.t(D) > 1 — n/e such that for x 6 D, TcIJx) > (1 V
'J
We will consider names of points x in D. The number of such T,P,Fnames of points in D is bounded by the product of three terms, as follows: The first term is the number of ways the sets F|,...,Fj< could arise, which is bounded above by the number of subsets of size at most >7#F
56
The Ornstein-Weiss Machinery
in a set of size # F . Standard Stirling's formula estimates show that this term is bounded above by
where, as usual, H(a) = - a log2 a - (1 - a) Iog2(l - a). The second term is the product over all i,j of the number of possible good names across each FJJCJJ. This is bounded above by 2(''' r ' P;£)+ To )#f . Finally, the third term is the number of possible names outside the tiling, which is bounded above by #P"#F. Putting this all together, the number of T,P, F-names covering all but n of X is
N(T,P;F,n)<2r where l0g2
Jt F )
P + 1 + h(T, P;e)].
This holds for all sufficiently invariant F and thus
h(T,P;F,n) < H{n) + n\og2#P + ±+h{T,P;e). Letting n —> 0, since t\ < &', we have that
lim sup h(T,P; Fh e')
which completes the proof.
D
Note that the preceding argument shows that the definition of entropy is independent of the choice of Folner sequence. Define
h(T) = sap{h(T,P);P £0>). The Ornstein-Weiss version of a Shannon-McMillan type theorem, in our situation, reads as follows ([37], Section II.4, Theorem 5). Theorem 3.0.9. Suppose (X,&,(i, T) is an ergodic G-action (G a discrete amenable group). Suppose P is any finite partition of X. For any s > 0, ; / F £ G is sufficiently invariant, there exist T,P,F-names P\,Pi,...,Pk such that:
The Ornstein-Weiss Machinery
57
(2) k< (3) for all i, ji(P,) < It will be helpful to restate this in a slightly different form: Corollary 3.0.10. Suppose (X,39,fi, T) is an ergodic G'-action (G a discrete amenable group). Suppose P is any finite partition of X. For any E > 0, if F £ G is sufficiently invariant, there exist T,P,F-names P\,?2,...,Pk such that: (1) (2) for all i, n(Pi) = 2-
\imh(T,P;Fj,E). i—>oo
Proof Theorem 3.0.9 tells us that for all e > 0, there exists / such that for all i > I and 6 < 1, N(T,P;Fh5) lies within
so that h(T,P;F,,5) lies within h(T,P)±s.
•
Note that the corollary implies lim lim h(T,P;Fi,s)
= h(T,P).
£—>0 /—>OO
Notice that the entropy of a process h(T,P) is actually a function of the image measure P*(^) as it only depends on the measures of finite names. Thus we can write h(Ji) instead of h(T,P) where jl = P*(/<)For our subsequent work, we will need to know that not only entropy,
58
The Ornstein-Weiss Machinery
but conditional entropy, as a function of this measure is upper semicontinuous in the weak*-topology. This is a very well-known fact. In particular Weiss has shown that it is precisely the amenable group actions for which it is true. As the proof is not available elsewhere we include the counting argument that leads to it. As with the Shannon-McMillan theorem in this context, it is a direct consequence of the Ornstein-Weiss quasi-tiling theorem, through its corollaries, the mean ergodic theorem and the Shannon-McMillan theorem. Definition 3.0.12. For P : X —• HP and Q : X —> Eg two finite partitions of a free and ergodic G-action {X,8F,\i, T) we define the conditional entropy as follows. For any finite set F £ G, finite partition R and x e X let
be the cylinder containing x. Now let N(T,P\Q;F,e)(x) be the minimum number of elements of\JeF Tg-i(P V Q) it takes to cover all but a fraction e in measure of the cylinder C®(x). Note: If Q is the trivial partition this is N(T,P;F,E). Set ,e)(x) =
-^=log2(N(T,P\Q;F,e)(x)).
Lemma 3.0.13. The functions h(T,P\Q;F,s)
are all bounded by
lo g2 (#Z P x # I e ) . For {Fj} a Folner sequence and any e < 1, h{T,P\Q;Fu8){x)
-> h(T, PVQ)-
h(T, Q)
in distribution and hence in L\. Proof From Corollary 3.0.10 for any £ > e > 0, if F, is sufficiently invariant, then for all but e2 in measure of the x € X,
and We call these the "good" Q and P V Q atoms, respectively.
The Ornstein-Weiss Machinery
59
Hence for all but 2e of the x S X, C® is a good Q atom and is all but a fraction I in measure covered by good F V g atoms. Thus any subset of all but a fraction e of C{?(x) must contain at least 2 (/i(7-,Pve)-/i(7',e)-2c)#F^ 1
_ g _ g)
elements of VgSF, V ' ( ^ v 2) ( l ^ e " S 0 0 ^ " atoms still in the set). Furthermore, as e < e, all but a fraction e is indeed covered by fewer than 2(/i(r,pve)-/i(r,e)+2e)#F,
atoms of Vggf, Tg-i(P V Q) (the good atoms). Hence once F,- is sufficiently invariant, for all but e of the x S X, h(T,PvQ)-h(T,Q)-2e
h(T,PVQ)-h(T,Q).
Theorem 3.0.15. For any e > 0 and bound B there are values <5 > 0 and are a 5,K-invariant N and a finite set K € G so that if H\,H2,-..HN sequence of finite sets in G and T is a free and ergodic G-action on the probability space (X,^,fi) and P and Q are finite partitions of X with both # Z P and #Zg < B satisfying \\h(T,P\Q;HhS)h<E for all i = l,...,N
then h(T,P\Q)<E+e.
Proof Fix a value ei, which we will set later. Choose 5 > 0, K c G and N so that for any 5,K-invariant sequence H\,...,HN in any free and ergodic action (X,^,n, T) and finite partitions P and Q we can constuct
60
TTie Ornstein-Weiss Machinery
an ei-Rokhlin tower (Theorem 3.0.6) {//, x A{) with i4,lVgeH(Tr,(Pvg). Also require that for all i, 1 forcing
For each i = 1,..., N let G, consist of those x for which C,pv^(x) is one of the N(T,P\Q;Hj,d) atoms we know can cover all but a fraction S of C,e(x). As G, is V geH , Tg-i {P V Q)-measurable,
Define a function =
flo g 2 (iV(T,P|e;//,^))(x)
if x G ^ n C i and
10
otherwise.
Notice that N
/ Ndti=Y^n(Ai)
/ \og2(N(T,P\Q;Hi,d))d^i
The mean ergodic theorem (theorem 3, II.1 of [37]) now tells us that for all F sufficiently invariant, for all but ei of the x e X the following three conditions hold. First,
* ' geF
from the above calculation. Applying it to two characteristic functions we also ask that second, #{g G F : Tg(x) is in the tower image } > (1 — 2ei)#i ? and third, F : Tg(x) G U,/l,} < //(u,-/l,)(l + e , ) # F < e,
The Ornstein-Weiss Machinery
61
Call the set of points in x that satisfy all three of the above conditions D(F). We also ask that F be sufficiently invariant that for all i=
\,...,N,
e.
This now implies that for all but Js\ of the x e X,
H(Cf{x) n D(F)) > n(C?(x)){\ We now estimate the number of atoms of VgeF Tg-i(P V Q) it takes to cover C$(x) fl D(F). For x' G C®(x) n D(F) consider the set of values g € f such that: (1) Tg(x) e UiAj n G, (i.e. Tg(x) is the center of a "tile"); and (2) Hjg £ F for all i (i.e. the entire tile centered at g lies in F). The number of ways to select the set of values g to be such centers is
Having split according to where the centers lie, we now further subdivide according to how each center is assigned to one of the potential tiles Hj, that is to say according to which At n G, actually contains the center g. The number of ways to do this is bounded by
Fix a collection of choices for centers, and for the tile //, each is the center of. Let them be c\,i, C2,i, • • •, Cf(i),i € At n
Gj,
and let x' G Cf n D(F) be a point for which these list the actual g's satisfying (1) and (2) above. The set U,J//,C,J covers all but a fraction 3ei of F as it contains all g G F with Tg(x') in the tower image, except for those too near the boundary of F to put the full slice into F. We also know that if g G F and Tg(x') G At n G, and Htg £ F then
62
The Omstein-Weiss Machinery
g = cij for some j . Hence
= — log2 (n,j(#of choices for the P V Q name on
HJCJJ))
> — log2 (#of choices for the P V g name on U,j //,-c,j). The number of possible PVQ names that might occur on the remainder of F outside of U/J//,C,J is certainly at most
Combining these estimates we conclude that for all but JE\ of the x G X, -^-
\og2(N(T,P\Q;F, + 6e, log2(B) + £(l + £ , ) + e,.
Standard Stirling's formula estimates tell us that if «i is small enough then Iog 2 (/C)/#F < e/3. If ti is small enough, depending only on N, B and E, all the other error terms in this estimate can be forced to contribute less than e/3 as well. We conclude that once F is sufficiently invariant, then for all but Je\ of the x e X,
The previous Lemma (3.0.13) now guarantees the conclusion.
•
Definition 3.0.16. Notice that as with entropy the calculation of conditional entropy depends only on the measure Ji = (P~VQ)'(ii) on (Up x T.Q)G. AS a function of such a-invariant and ergodic measures m we write this conditional entropy as
The Ornstein-Weiss Machinery
63
Theorem 3.0.17. For any two finite symbol spaces Up and Eg the function /I(V,S/>|SQ) on the a-invariant and ergodic measures on (Ep X Z Q ) G is upper semi-continuous.
Proof To begin, as we can always take the direct product of a non-free ergodic G-action with a Bernoulli action, getting a free and ergodic Gaction, Theorem 3.0.15 holds for any ergodic action, whether free or not. Next, fix a a-invariant and ergodic measure v. Suppose that h(v,P\Q) = E and by Lemma 3.0.13 we know that for any £ < 1, for any Felner sequence F,-,
in L'(v). Let e > 0 be chosen, and let B bound the cardinalities of both P and Q. Using e/3 as "e" in Theorem 3.0.3 we obtain a 5, K and N. Among the terms of the Folner sequence {F,} we then will be able to pick a list H[,H2,.--,HN that forms a 8,K-invariant sequence and for which
h(v,P\Q;HhS)<E
+ e/3.
There are only finitely many clopen sets C^ v ^ and so once v' is sufficiently close to v weak* we will have h(v,P\Q;Hh5)<E Theorem 3.0.15 now implies that if v' is this close weak* to v, then
h(v,P\Q)<E+e. This completes the result.
D
We can apply this result to rearrangements with a simple observation. Suppose (X, 3F, \x, Ta) is a free and ergodic action of G, and P and Q are two finite partitions of X. For any arrangement /? we will have a corresponding map to names (PVQ)P(X) and hence a measure
= {P(T/(x)),e(T/(x))}geG
64
The Ornstein-Weiss Machinery
It is little more than an observation that the map fi —> £(/?, P V Q) is continuous as a map from the Hvlli -topology to the weak*-topology. With a bit more thought one is easily convinced that it is uniformly continuous in the metrics we have established. (This latter remark will not be critical for us.) Corollary 3.0.18. For P and Q two finite partitions of the free and ergodic G-action (X,!F,n,T), the conditional entropy h(Ta,P\Q) is upper semicontinuous as a function of a in the \\;-\\\-topology. That is to say ifa.j —• a in Ll then
4 Copying Lemmas
Copying lemmas play a pivotal role in Dye's Theorem [10], Vershik's Lacunary Isomorphism Theorem [58] and most significantly in Ornstein's Isomorphism Theorem for Bernoulli shifts [32]. The Burton-Rothstein version of this last result puts them in an even more central role as the rest of the argument becomes soft analysis. We are generalizing from this Burton-Rothstein perspective of course, making the core of our equivalence theorem rest on category. Copying lemmas will play a pivotal role for us in two contexts. First in the equivalence theorem they will play the same role as always, allowing one to copy partitions, and in our case full-group elements, from a joining of two systems into one of the two. We will also use a copying lemma as a basic tool in our development of m-entropy to show that sizes are either "entropypreserving" or "entropy-free". These two applications have one fact in common. As indicated above one must copy not only partitions but full-group elements. Hence we will have to investigate how one does this. These copying lemmas will also have one very real difference. For use in the equivalence theorem one will want the copied process to have as much entropy as one can hope for, but for use in the entropy theory, one will want the copy to kill as much entropy as possible. Hence we really must give two copying lemmas. They will be parallel in structure, but in assigning names in one case we will try to make the assignment as close to 1-1 as possible, and in the second, as far from 1-1 as possible. We have tried to make our copying lemmas "modern", avoiding the unnecessary use of Hall's Marriage lemma and making no minimum entropy-bound assumption on the image system. We do more here than is essential for our work, hoping to provide a tool-box of copying lemmas. The basic technical lemmas are Theorems 4.0.5 and 4.0.13. We establish 65
66
Copying Lemmas
various corollaries of these that state explicitly the precise technical versions we will use. Let R = {Hi x Aj}y=i be an e-Rokhlin tower for T (see Theorems 3.0.5 and 3.0.6). For x G A\ we set S(x) = {x' G T(Hj x {x}) : x has a unique preimage in /f, x {A,}}. These are the slices through the tower and are the atoms of a measurable partition of the tower image, given by {x' G T(L){Hj x Aj)) : x' has a unique preimage in U (H, x At)}. i
i
We set g(x) = {g € H, : Tg(x) G S(x)}, the slice viewed as a subset of //, ^ G. Suppose Z is some finite labeling set and n, : A\ —* £Hf assigns to each x G A, a L,//,-name, that is to say, an element in HHi. We want to describe the notion of painting the names «, onto the tower {H, x ^4,}^,. For any x' G S(x) and x G A, there exists g G H, such that x' = Tg(x). Set P(x') = n,(x)(g), thus defining a map P : tower image —• £. To complete its definition we must extend P outside the tower image. We will discribe how to do this, depending on the circumstances. If no explicit description is given, then any extension will do. As described twice earlier, first in Section 2.1 when discussing rearrangements, and later in Chapter 3 when developing entropy, for a choice of symbol space S, finite or countably infinite, we can place two measures of closeness (one a metric, the other a pseudometric) on I-valued partitions. These generalize what we have already described as the L' and distribution topologies on rearrangements and on finite partitions. Our description here is parallel to that given earlier. If PUP2 : X - > I then ll^i, Pi Hi = ^5Z//(Pr l (no)AP 2 - | (« o )) = /i({* : P\M £ P2(x)}) = f
d(Pi(x),P2(x))dn
where d is the discrete 0,1 valued metric on E. For P : X —> £ any
Copying Lemmas
67
partition, we can define a map P : x —> Z G given by P(x) = {P(T g (x))} geG . This is just the map taking x to its name. If £ is finite, then ZG will be compact in the product topology. If £ is countably infinite, adjoining a single point at infinity, it becomes compact, and again we can regard I G as compact. Hence the space of probability measures on EG is a compact and convex metric space in the weak*-topology. The distribution pseudotopology on Z-valued processes is this weak*topology on the Borel measures P'in)- It will be useful to have in mind a way to verify that two processes are in fact close in distribution. We lift to this context the distribution metric described earlier for both rearrangements and finite partitions. For F any finite subset of G, the map P F : x —» {P(T g (x))} g6 f maps X —> L F which is an at most countable set. Thus P^(/() can be regarded as simply a vector of masses on E F , and one can calculate the £\ distance between these vectors, \(TUPI),(T2,P2)\F
= ||P,>F(/ii),P2,Fte>)lli-
For Ft some increasing sequence of finite sets, exhausting G (for example a Felner sequence), we can define
_vl ;=i
Notice that as Ft £ Fi+l, |(r l ,P,),(T 2 ,P 2 )| F j .
<\(T{,P0,(T
Thus ||(r,,P,),(T 2 ,P 2 )||. < |(T,,P,),r 2 ,P 2 )| F , + l/2''
for all i.
We now proceed to demonstrate our copying lemmas. It will be useful to restrict our attention to finite partitions. If P is a finite partition, let Z/> be its range, that is to say, its labeling set. For a rearrangement (a, (j>) we have already associated a countable partition
Notice that /"•* is precisely the Ta, g ^ - n a m e of the point x. We say a
68
Copying Lemmas
rearrangement is bounded if the partition g(a^) is finite, which is equivalent to saying the values a(x, >(x)) lie in some finite subset of G.
Definition 4.0.1. Given a tower {//,- x i , ] ^ , we say a rearrangement (a, $) respects the tower if for any x G A\,x' G S(x), (p(x') is also in S(x), and for any other x1, outside the tower image, 4>{x') = x'.
A rearrangement (a, (j>) that respects a tower {//, x Aj}^=i will be bounded, and hence g ^ j will be a finite partition. Furthermore any 4> which respects a tower will be of finite order as its cycles are at most slices through a tower.
Theorem 4.0.2. For any rearrangement (a, 0) and s > 0, there exists K Q G so that ifH\,...,HN are (^K)-invariant and if{HjXAj}f=i is an ^-Rochlin tower for T*, then there exists (j)' G FG(&) so that (1) (<x,(j>') respects the tower {//, x / l ; } ^ , and ( 2 ) llg(«^),g(a,^)Hl < £In particular the rearrangement (a, (j)') is bounded and
Proof Choose K finite so that /x({x : g(«,0)(x) e K}) > 1 — | . Suppose H\,...,HN are (f,K)-invariant and {H-, x ^ } * , is an |-Rokhlin tower for r \ We identify a set of good points C in each slice of the tower. For x € Ah x' 6 S(x), we say that x' G C if also 0(x') G S(x). Notice that for x G Ai,
S(x) \ C £ {x : either (a) a>(x',4>(x'))(tKor (b) a(x',<£(x')) G K,x' = Tga(x) for some g G //,-, but(x(x',0(x'))g^H ; , or (c) 0(x') e T(H, x A{) but (/>(x') doesn't have a unique preimage in Hi x A{}.
Copying Lemmas
69
Hence we compute that
>J
U S{x))-J
>(\--\-
U S(x)\c)
n{x' : (a) holds} - n{x' : (b) holds} - H{x' : (c) holds}
~ I) ~ \ ~ & { ^
T
?(T(Hi x A,))AT(H, x A,)}
i
-n{(j){x)
: (c) holds}
>l-e. For x' S C define (j)'(x') = >(x'). Then for each slice S(x) we have defined $' : C D S(x) -» S(x). Furthermore #{x' G S(x) : >' is not yet defined at x'} = #{x' G S(x) : x is not yet in the range of <£'}. Next, extend cf>' to uT(H,- x A ) \ C so that for each x, 4>' is a 1-1 map from {x' G S(x) : ^>' not yet defined at x'} onto {x' G S(x) : x' not yet in range of >'}. This can be done measurably, since these two sets, as functions of x, were measurably chosen. Finally, outside of U^[T(H, x /!,), define >' = identity. This constructs
The following result will fill the gap for us between copying partitions and copying full-group elements. Theorem 4.0.3. Suppose (a, (j)) is a rearrangement of the free and ergodic G-action (X,!F,\i, Ta). Given any e > 0 there exists 5 > 0 (depending on (<x,(j>) as well) so that if P' : X[ —> G is a partition of the free and ergodic G-action {X\,^\,n\,T^) satisfying:
\\{Ta,gM)),{T?,P')\\.
70
Copying Lemmas
then there exists ft in the full-group of Tj*1 with \\P',g(au
||(a,<£),(«,,<£')II. < £ as this will be implied if the "e" and " 5 " in Theorem 4.0.3 are small enough. Proof As P' maps X\ -» G, we can construct an associated element /*, e GG by
The only issue is that H(fXl) might not be a bijection of G a.e. If it were then ft{xi) = T^p,,x Ax\) would already be the full-group element we are after. Also note that if we construct ft with
then we will have ll^',g(a,^,)lll <£• To begin, select K ^ G so that
X = {x:a(x,
> 1 - e/4.
Select J C i S G with id e /Ci so that for any g ^ K{, id ^ Xg/C (i.e. Now choose 5 > 0 so small that if
then: (i) Setting Ax = {xx e X\ : P'(xi) e K}, we will still have
and: (ii) Setting G = {xi e X\ : for some xeX, have jUi(G) > 1 —e/4.
/"^|K,
= /. Vl k,} we will
Copying Lemmas
71
Set G] = Ai n G and /.i\(G\) > 1 — e/2. We claim that the map
is 1-1 on G|. To see this, suppose for x\ and x', = Tj^(xi) e G\ we have
This translates directly to the identity *i = rr,H(/,,) which is to say
As both x\ and x\ are in G\, both /.v,(id) and /.Vl(g) are in K and we conclude that g must lie in K\. But now since for some x € X we have fx
\K,
=/X,I/CI.
we get that
But for all x, h^ is a bijection of G, and so g = id. On the set G\ set >'(xi) = 77^, (V|)(xi). This is a 1-1 map, as we just saw, and consists piecewise of elements of T,0". Hence it must be measure preserving. This means that the two sets C\ and 4>'{G\)C have the same measure, which is in fact less than E/2. It is a standard fact, and also a rather simple exercise using the Ornstein-Weiss Rokhlin lemma and mean ergodic theorem, that in a free and ergodic system like the Tj*1 system there will always be full-group elements taking any one set to any other, as long as the two sets have the same measure. This means that 4>' can be extended to a full-group element that takes G\ to <j>'(G\)c, finishing the result. • To any e-Rokhlin tower R = {H, x A}/Ii for T, we can associate a tower partition UR by
for all other x. Lemma 4.0.4. For any e > 0 and N, there exists 6 < j such that if R = {Hi x /4,-}£L, is a 6-Rokhlin tower with #//, > | , for all i, then H(UR) < e.
72
Copying Lemmas
Proof
First notice that, for all i,
Each A, is partitioned by a choice for S(x) into at most ( | ^ . ) sets. Thus
< -25NH2S) + jLj J2 ±
( #|)
if 5 is chosen small enough.
•
We now prove the first basic copying lemma. The proof follows fairly standard lines, but a number of parts in the statement are a bit different from the usual. We have tried to tie together in this one result all of the copying type arguments we will need. In the corollaries following the proof we will state the special cases we will use. Theorem 4.0.5. Suppose (X,^,n,T) is a measure-preserving ergodic Gaction and P : X —>T,p and Q : X —» Eg are two finite partitions. For any e > 0 there is a 5 > 0 satsifying the following. Suppose (X\,2?\,\i\, T\) is another measure preserving and ergodic G-action, and Q\ : X —> "LQ is a partition satisfying
(1) UTuQMT.Qn.KS. (1')
Then there exists a partition P' : X\ —> EP satisfying: \\{T,PVQ),(TuP'VQi)\\.<e;and
(2') Proof Fix e. Let i0 = [Iog2(2/e)] + 1 so that if
\(T,PVQ),{TuP'VQi)\Flo<e/2 we will have obtained (1'). Let a = min{ju(/) : /*(/) > 0 and / is an atom of
V Tg-,(P V Q)}.
Let no be the number of elements in V Tg-i(P V Q). Let
= (JL\
a
Choose K s G with F, o £ K, choose 5 and N so that, applying Theorem 3.0.5, for any (<>,K)-invariant sequence H\,...,HN, we can
Copying Lemmas
73
construct an e-Rokhlin tower, R = {//,• x /!,}£!, in any free ergodic G-action (in particular, {X\,3F\,\i\,T\) once it is chosen). Fixing N, we will place further requirements on the sets H\,...,HN, as follows: (1) for any £-Rokhlin tower R = {H, x -4,-}^,, by Lemma 4.0.4,
H(UR)<±; (2) for any atom / € V Tg-i (P V Q), by the mean ergodic theorem,
(3) letting Q(P)
=
{/' e V Tg^(P) : n(f) = 2-{ht-T-p)±&*H>} and
= {/"e V rg-,(e):M/") = gen
the //, are sufficiently invariant that the Shannon-McMillan theorem (Corollary 3.0.10) gives /<(C,(P) n C,(g)) > 1 - e2; and (4) the Hi are so invariant that ^_2 Set D,- = | / e \f Tg-, (PVQ):
/e
for some x € / and all
v
By (2), above, we know that /x( U / ) > 1 — g2. Define
£.- = /" e C,(2) : Mf n (.U /)) > (1 - %(/") and /6D,
( u J €C,(r)
One calculates that //( /;U /") > 1 - 3s.
74
Copying Lemmas For / " e Et, let
»/(/") = { / ' e C , ( P ) : / = / " n / ' e D,-}. We know that for each / " G £,,
We want to assign to each / " G £, an atom / ' G >?(/")• This assignment need not be 1-1, but we want to control how close it is to being 1-1. Choose a model non-atomic measure space (M,,v) of total mass
Partition M, into pieces labeled by sets / ' n /", / " G £,, / ' G ;?(/"), each of mass /((/' n / " ) - In this model, a set / ' is just a union of all sets labeled / ' n / " , and similarly for / " . Choose an integer N, with
log2(JV,) = \h(T,Q)-m\n(h(T,P),h(T,Q))
+j ± ^
(Note that this requires #//, > 10/e.) Refine each atom / ' n / " in the model into N, sets of equal mass, labeled {1,...,N,}. Let f'j be the union of pieces whose first label is / ' and third label is j G {1,...,N,},
f'j=fUE(f'nf"nj). We calculate that v(fj) < 2-WT''')±™)#i/'Nrl
for all / " G Q(Q). e Making sure that
TX#W;
1U
(W > log ( - j we see that v{f'j) < e v{f"). \
e
/
Select a 1-1 map p:Ei^{f'J:j
= l,2,...,Nl,
f G Q(P)}
with the condition that p(f")C\f" j= 0 and such that v(Domain(p)) is of maximal size. Let £/,- = Domain(p). We claim that v([/,•) > 1 — 4e. If not, then there is a collection
Copying Lemmas
75
Uf of atoms / " G £,-, not in the domain of p, with v(t/, c ) > I. Let V = {/j G Range(p)}. Since p is 1-1 and each v(p(/")) < lm{f"), we see that v(F) < e. Hence there must exist some / " <£ [/,- and / j £ K such that / " nf'j £0 and p could be extended to / " by setting p{f") = f'y This contradicts the maximality of I/, completing the claim. If p(f") = f. where f'} = (U, (/' n / " n j), then define
the T, £/>,//,-name of/'. We are now ready to select 5. For any free and ergodic G-action (Xu^unuTO and partition Q, : X, -» Z e , for any / " G V Tg-,((2), there will be an atom / " G V Tljg-i(Qi) with the same ZQ,//,-name. g£W,
Choose 5 so small that \\{T,Q),(TUQ\)\\. < $ will imply that for all //,-, and all f" G V Te-,(0) with u(f") > 0 we will have
Suppose (X\,SF\,\L\, T\) is a free and ergodic G-action and Q\ a partition of A"i satisfying (1) of Theorem 4.0.5. Construct an £-Rokhlin tower {H, x Ai}?Ll in X\ such that
Partition each Aj according to the atoms / " G V T\,g-i(Qi), and assign to each /4,-n/" the name q(/")- A fraction of at most e of each A, may lie in atoms f'{ whose //,-, F-names do not occur in the (T, Q) process. Assign to them any P-name you wish, still calling the assignment q{f"). Paint these names on the towers as described earlier. This gives a partition P' : X\ —> Zp of the tower image. Extend this partition outside the tower image, in any measurable fashion. Further refine P' by also assigning to the points in a slice S(x\), x\ G /", the index j if p(/") = / j . Call this refinement P : X , ^ Z P x { l , . . . , sup (N,)}. 1<(
To verify conclusion (1') consider the set Z = {x', : for all g G F,o, T,)g(x',) e S(x,), for x, G At with xi G /',' where / " G (£,- n £/,•)}.
76
Copying Lemmas
That is.
z
= U
LJ(n T "- l(S(x)))
As Z has measure within e of the corresponding set in
For any atom/G V T lg -i(i"VQi), let / e V Tg-,(F Vg) be the atom geF
'o
'
geF,0
of the same name. For x\ e Z we then have that x', G / iff x\ = Ti g (xi), for some g G H-,,x\ G /'/, such that the name of (/",(/")) G ( I e x I P ) H ', in (Sg x Ep)H| restricted to the indices (F,og) is the T,EQ X E/>,F,0-name of/. For x, G /'/, / " G Ui and / G V Tg^(P V Q\) we compute that #{x', £ / n S ( x , ) : Tlg(x') G S(x,) for all g G F,o} =
#HMf)±s)
± #{g 0 G W,- : Ti|Wi.Xi4i. is not 1-1 at some (ggo,x,),g G F,o} ± #{go G Hj : gg0 £ Hi for some g G Fio}
Since ^,(Z) > 1 - 5e > 1 - ^ ( / ) , for all / G V
T
g-'(P
v
60,
l/'(/) — (*i(f)\ ^ -6e^(/)> which is more than enough for (1'). To obtain the entropy estimate (2'), we calculate a lower estimate for h(T\,P) and an upper estimate for
h(TuP\P') = h(TuPV P') - h{TuP'). Let P be the partition of X\ defined by p(jCi)=fp(f) \*
ifxie/l/n/.Jel/,otherwise.
Now for x'i G A,i n / i , / G Uj, the T],P,Hi-name of x\ can differ from only on Ht\g(x\). Thus an atom of V ^i,e-'(^)U, contains at most #P £ # H / atoms of P. Hence we see that N
H(P\ v r u -,(n f i vp)) < Y,MAi)WHi\og2(#P) < i=l
since £JI, fH^Ai)
< 1 + e and log2(#F) < log(n0) < «o-
,
Copying Lemmas
11
Now Q\ is measurable with respect to the span of
\fTUr,(P),
VT1>g-,(nR) and \J Tlig-,(Q),
geG
geG
geG
where f2i(*i)
if x\ is outside the tower image
4
.. . . .
\* if xi is in the tower image. Since h{TuTlR) < -^ and h(TuQ) < Hbnitower image)) + (1 -^,(tower image))H(g) < ^ , we see that h(TuP) > MTuGi) - fi> a n d s 0 h(TuP)>h(TuQ)-j. To obtain an upper estimate of h(T\,P\P'), let / be the partition defined by / ( if x , e / i ' n / ! , , / " € ( / , • , p(/") = / | [_* otherwise. Obviously P is V T lg -i(P' V/)-measurable, and so geG
,,P|P') = h(TuP)-h(T,P')
< h(TuI) <
But
N <~ i=i
<~[KT,Q)-min(h(T,Q),KT,P)]. Thus P') = h(Ti,P yP')-h{TuP\P') >h(TuP)-h{TuP\P') j-j-h(T,Q)
+ min{h(T,Q),h{T,P))
78
Copying Lemmas
We now state a version of Theorem 4.0.5 with an entropy bound on the process (T\,Q\). Corollary 4.0.6. Suppose (X,^,fi,T) is a measure-preserving ergodic Gaction and P : X —> Y.P and Q : X —• Eg are two finite partitions. For any T\) is £ > 0 there is a 5 > 0 satsifying the following. Suppose (X\,^\,fi\, another measure preserving and ergodic G-action, and Q\ : X —• Eg is a partition satisfying (1) (2)
UTi,Qi),(T,Q)\\.<5and h(Tl3Qi)>h{T,Q)-*.
Then there exists a partition P' : X\ —> £p satisfying: (1') | | ( 7 \ / > v e ) , ( r , > P ' V Q 1 ) | | . < e ; a n < *
(2') h(TuP')>mm{h(T,P),h(T,Q)}-e. Proof Use e/2 in Theorem 4.0.5, ensure that 5 < e/2 and (2') of Theorem 4.0.5 easily gives (2') of the corollary. • We now state two particular cases of this corollary of interest to us. Corollary 4.0.7. Suppose (X,^,n,T) is a free and ergodic G-action, P : X —> Zp and Q : X —• Eg are two finite partitions. Suppose ffi £ J^ is a T-invariant sub-a-algebra on which T still acts freely, and Q is !/Cmeasurable. For any e > 0 there exists a partition P' : X —> £p that is 34?-measurable satisfying: (1) ||(7\Pve),(r,P've)||.<e;flnd
(2)
h(T,P')>min(h(T,P),h(T,Q))-e.
Proof Just let (T\,Q\) be ( T , g ) restricted to the factor action on J f in Corollary 4.0.6. D Corollary 4.0.8. Suppose (X,!F,n,T) is a free and ergodic G-action and P : X —• Up and Q : X —> HQ are two finite partitions and e > 0. Let (Y,(S,v,S) be a Bernoulli action with h(S) + h{T,Q) > h{T,P)-s/3. There exists a 5Q (which we assume to be < e/3) satisfying the following. Suppose (X\,2F \,\i\, T\) is any other free and ergodic G-action, and Q\ : X\ —» ZQ is a finite partition with (2)
h(TuQi)>h(T,Q)-e/6.
Copying Lemmas
79
Then there exists a partition P' : X\ x Y —> Sp so that in the product action (X\ x Y ,fF x cS^i\ x v, Ti x S) we conclude (3) 11(7, xS^VP'MT.gvP)!. <6flnd (4) Proof Let B : 7 —• Zfi be a finite and independent generator for the Bernoulli process. Consider the free and ergodic G-action T x S with respect to product measure ju x v. Consider on it the pair of partitions T :X xY ^I.P, ~P(x,y) = P(x), and Q : X x Y -»• E e x I B , Notice that for any free and ergodic G-action T) with partition Q\ : X] —> Eg we have
Thus applying Corollary 4.0.6 to the G-action T x S with partitions P and g, and e/3, there exists 0 < (50 < e/3 so that if
then there will exist a partition P' : X\ x Y -*l,p with ||(T, x S , e , V B V P ' ) , ( T x S , e v P ) | | . < £ . This distribution distance though is at least as large as ||(T, x S , Q i V P ' ) , ( 7 , Q V P ) | | . which is (3). We also have from Corollary 4.0.6 that h(T] xS,P')>min{h(T,P),h(Ti xS,Q] VB)}-e/3. As h{T{ x S,Qi V B) > h{T x S,Q\J B) - 5 > h(T,P) - 2e/3, this latter minimum is at least h(T,P) — s giving us (4) and the result. D The following Corollary to Theorem 4.0.5 states that if the partition P had arisen from a rearrangement then the new partition P ' could also be chosen to have arisen from a rearrangement. Corollary 4.0.9. Let (a, (p) be any G-rearrangement. Suppose Q : X —> SQ
80
Copying Lemmas
is a finite partition. For any e > 0, there exists d > 0 so that the following holds. Suppose (X\,2F\,[i\, T*1) is a free and ergodic G-action, and Q\ : X\ -» Eg is a finite partition satisfying
Then there exists (f>' in the full-group of T*1 such that
g(«,) is a bounded rearrangement. Hence gM) : X -» G given by g(a>^(x) = a(x, 4>(x)) is a finite partition. Let S^ £ G denote its set of labels. Let £ > 0. Notice that if e' < —— where i > Iog2(2/e) then for any 2#r partitions R and K' with ||/?,/?'||i < s' we will have \\R,R'\\. < e/2. Apply Theorem 4.0.3 to obtain a <5i < e' so that if \\(T«,gM)),(T?\P')\U
<<5i
then there will be a (/>' in the full-group of T*1 with
Apply Theorem 4.0.5 to see that there exists a <5 < e/2 so that for any free and ergodic (X\,^\,n\, T"1) and finite partition Q\ : X\ —> ZQ satisfying
then there will be a partition P' : Xi —> Z^ s G such that
ll(ng(«,«vQ),(r1a'>p'v<2i)||.<5,. Theorem 4.0.3 now tells us there is a / in the full-group of T"1 with
and hence ||(T 1 a ',P'vQ,).(r 1 a i s g( a i ,*')Ve 1 )||. < £ / 2 and the result follows.
•
Corollary 4.0.10. Suppose (X,tF,n,T*) is a free and ergodic G-action, Q : X —• Eg is a finite partition and
Copying Lemmas (1) (2)
81
\\(T',Q),(T?,Qi)\\.<5and h(T*\Q])>h(T«,Q)-E/6.
For (Y,'2?,v,S) a Bernoulli G-action with h(T,Q)>h(T**,Q)-e/3, let U°" = (T°" x S) be the direct product action. Then there exists 4>' in the full-group of Ua' such that (3)
(4) ||( a i ,f),(a,^)||. <£, and (5) Proof To begin, notice that (T a *,Q) and ( T a , g o 0" 1 ) are identical in distribution. Furthermore, notice that, given any eo > 0 (whose value we will set later), there exists EI > 0 so that if
then
(AM xv)(Q,of-'Ae')<eoThis is simply the observation (explored in more detail in Lemma 4.0.11 below) that there is a coding from the Q V g(a,0)-name of a point to its Q o 0~'-name, and applying this coding to the Qi V g ^ ^ - n a m e will construct the Q\ o 0'~'-name, which must agree most of the time with the Q'-name, if £] is small. Given e2 > 0 whose value will be set later, choose 8 with 82 > S > 0 so small that (1) and (2) (using P = Q o <^~' in Corollary 4.0.8) imply there is a partition Q' : X\ x Y —> Eg with (1') (2') h(U"\Q!) > h(Tx,Qo ,"')- 2e2 = Given £1 from above, there is an £2 so that (1') and Corollary 4.0.9 imply there is a (j)' in the full-group of U"1 with \\(U"\Q, V Q' V g(a^)),(T",Q and hence
V (Q o «T') V g M ) ) | | . < £ ,
82
Copying Lemmas If EO is small enough, then
can be made < a (which is (3)) and
> h(Ta+, Q) - 2e2 - e0 log(#Z e ) - //(so) can be made > h(Tart>,Q) - e (which is (5)). Lastly, as
if ei is small enough,
which is (4).
•
Lemma 4.0.11. Suppose (X, 3F, \i, Tx) is a free ergodic G-action, cj> is in its full-group with g(a,
Vg (a| , 0l) )||. < 5 we will have ^
Proof Remember that in general the two processes (S^^Ro (SP,R) are identical in distribution and so (2) can be written
tp) and
We now show that the T",(Q Vg(,^-i)) o ^"'-name of a point is a bounded code from its T a , Q V g(a^)-name. That is to say there is a finite set K £ G with (Q V g( a ^-i)) o 0~' measurable with respect to VKeK ^ a -i(2 v g(«,0))- We complete the result by noting that one now can apply this same code to the process (T*l,Q\ Vg(ai,0,)) to obtain the partition (Q, V g ^ ^ o ^ 1 and the result.
Copying Lemmas
83
We see this finite code by first noting
where (^"'(x) = T ^ j M where ^ ( ^ " ' ( x ) ) = z(x)~l. This value z(x) is a finite G-valued partition, taking values in the inverses of the values taken by g(a.0). Thus z is obtained as a finite coding from the Ta,g(ajf,y name of x. By searching out over this finite list of values for the solution to g(a4)(T?(x)) = z" 1 . Thus g(a0,0-i) o 0~'(x) is a bounded code of the Ta,g(ai0)-name of x. As Qo(p~l(x) = 2(Tf(v) (x)), and z is bounded, this is a finite code from the T", Q V g(a^)-name of x as well. Notice that the code to z(x) is canonical in that its dependence is really just on the function / " ' * | K and not on the measure space giving rise to this particular function (see Lemma 6.3.5 for more on this). Hence precisely the same code will apply to the T*',Q\ Vg( a ,^j-names. • We now state a rather complicated result that is really little more than a series of observations from our current vantage, but states exactly what we will need in the equivalence theorem. Theorem 4.0.12. Suppose (X, 2F, /.i, T a ) is a free and ergodic G-action, P and Q are finite partitions. Suppose ffl is a Ta-invariant a-algebra on which the action of Tx is still free, and Q is M' -measurable. Suppose also that 4>\,(j)2,. • .,4>i,4>i+\ are all finite-valued elements in the full-group of T a and that h(Ta,Q)>
h(Tx
Then there exist full-group elements
(1) \\(T*,P V Q V VJi 1 g ( a , W ),(T«,P' V Q V Vj=,' g(a,0;))l|. < e; and (2) h(Tx
84
Copying Lemmas
Using Lemma 4.0.11 for any 62 > 0, if <5i is small enough we will have
Remembering that (T^, Qoy) and (Ta, Q) are identical in distribution, we know that h(Ta
h(Tcut"+l,P)-s/2.
Now applying Corollary 4.0.6 with
"T"
=
we can copy P into J f as a partition P'. Thus for any e > ^3 > 0 if 62 is small enough, we will obtain
(T^,
(pve, v g ^ ^ w T ^ n . < «53.
More importantly though we get the entropy estimate: h(T<^,P')
> mm{h{T*'t"+\P\h(T<^,Qo
(j)'l+{)}
-E/2.
As
^+',Qo4>',+])
= h(T\Q)
> h(T°*'+\P)
we obtain (2). Continuing, for any £4 > 0 if ^3 was chosen small enough, by once more applying Lemma 4.0.11 we can pull back into the T" and T,°" systems with ||(T a , P V Q V g M / + l ) ) , (T a , P ' V (2 V g ( ^; + l ) )||. < 54Now for the easy part. In Corollary 4.0.7, with " P " = \/'=i g(a,
Copying Lemmas
85
with
||(Ta,F V QVgMl+l)v\J
gMl]), 1=1
1=1
If Ss is small enough, applying Theorem 4.0.3 to each of the P[ we can modify them to become g(ai,^) obtaining (1) of the Theorem. • We now develop a copying lemma that moves in precisely the opposite direction from Theorem 4.0.5 and its corollaries. There the principle intent was to maintain entropy as high as possible. Here it will be to decrease entropy as much as possible. This will play an essential role in our work on m-entropy. The proof follows exactly the same lines as that of Theorem 4.0.5 except that where we worked to make the assignment of names q as close to 1-1 as possible, here we will work to make it as far from 1-1 as possible. is a free and ergodic G-action and Theorem 4.0.13. Suppose (X,^,fi,T) P and Q are two finite partitions and ffl is a T invariant sub-a algebra of 2F on which the action is still free and for which Q is $t'-measurable. Suppose (1) h(T,Q\P) = h(T,QVP)-h(T,P) there is a partition P' e $C with (2) \\(T,PyQ),(T,P'VQ)\\.<eand (3) h(T,Q)-h(T,P')>e.
> e > 0. Then for any e > 0
Proof The proof is so parallel to that of Theorem 4.0.5 that when appropriate we will simply refer to the analogous facts in that proof. Fix e > 0 with e < h(T,P V Q) - h{T,Q) and let i0 = [Iog2(2/e)] + 1. We will obtain \(T,PVQ),{T,P'vQ)\Fio <e/2 implying (2). (See the beginning of this section for definitions.) Let a = min({^(/) : /*(/) > 0 and / is an atom of VgeFfo T g -i(P V Q)})
86
Copying Lemmas
and let n0 be the number of elements in \/geF. <
Tg-\ (P V Q). Let
sa 10no #Fi0
be so small that N M Choose a finite subset K £ G with Fio £ K and values 5 > 0 and JV so that Theorem 3.0.5 will guarantee that for any (c),K)-invariant sequence of sets H\,...,HN we can construct an e-Rokhlin tower R = {//,• x /I,}!!, in any free and ergodic G-action. Fixing N we will place further requirements on the sets follows:
H\,...,HN,
as
(i) for any e-Rokhlin tower R = {//,• x /4,-}-^,, using Lemma 4.0.4 we want H(UR) (ii) for any atom / G V Tg-i{P V (2), by the mean ergodic theorem,
(iii) letting Q(P V Q) = {/ G V g6H| T g -i(P V and
the //, are sufficiently invariant that the Shannon-McMillan theorem (Corollary 3.0.10) gives / /(C,(Pve)nC,(P))>l-e
and (iv) the H, are so large and invariant that < e2 and 2
4
;
87
Copying Lemmas Set Dj = l] e \f
Tg-i ( P V g ) : for some I E /
and all
fe V Tg-,(Pv0,
<
By (ii) we know
Define
n(f" n (Uj S D . n C ( ( P v e ) n (U/'ec,(P))) One calculates /t(U/»e£(/") > 1 — 2e2. For / " € £, let »/(/") = {/' G C,(P) : / = / " n / ' G D, n C , ^ V Q)} Hence for each / ' e £,
and setting
we have
Finally let G,(P) = {/' G Q(P) : M/' n C , - ) > ( l and one calculates > (1 - 2e) - e4 > 1 -
3E.
88
Copying Lemmas For any / ' e Gj(P) notice that (I
»
• f' nnJ f" — c ru i 1; — > -J
= (1 > (1 -3e)2 ( e + 4 e / 5 ) # H ". As in Theorem 4.0.5 attempt to make an assignment p :£,--> G,(P) with p(f") G >?(/") but here we push p as far away from one-to-one as possible. More precisely, suppose inductively p has been defined on a collection of atoms D c £,. For the first step just assume D is empty. If n(D) > 1 — 4e stop. If not, then we must have
and for some / ' G G,(P) we must have
n(f n (u r e D /") c n (uf6Gi(P)/')) > Hence for this / ' we must have
#{/" : /' n /" e G, /" ^ D} > %T 2(e+7e/10)#Hi_
For this / ' and any / " with / ' Pi / " e G,- but / ^ D we will have / ' 6 rj(f"). Extend p to these / " by setting
Pif") = f. Notice that this forces p to always be at least (2(c+7e/'10'*Wi) -to-one where it is defined. This assignment can proceed until n(D) > 1 — 4e. We now proceed as in Theorem 4.0.5. Construct an e-Rokhlin tower R = [Hi x Ai}?Ll measurable with respect to the sub-algebra ^C with
Partition each At according to the atoms / " G V gsH/ T g -i (Q). For any / " G D, we have assigned a name p(f"). For any / " ^ D, assign to / " a single name constant at all indices. Paint these P-names on the tower as described at the beginning of the section to construct the partition P'.
Copying Lemmas
89
Outside of the tower image assign all points to the same symbol of £/> as was used to assign names to / " ^ D. That (2) now holds for (T,P' VQ) is precisely the same as in the proof of Theorem 4.0.5 so we do not repeat it here. All we need argue is that
h(T,P')
elements. This value is of course
Let W be the partition w
>
*
on
Qi
otherwise
It is easily seen that P' is measurable with respect to ^f(T, W). Hence
h(T,P')
+ H{YlR)
< h(T,Q) - e - s/W < h(T,Q) - e.
D Corollary 4.0.14. Suppose (T,^,ii,Ta) is a free and ergodic G-action, Q is a finite partition of X and 4> is in the full-group of Ta satisfying (1) h(T«, Q\Qo r ' ) = h(T,QV(Qo
,-')) - h(T, Q o ^ ) > e.
For any e > 0 there exists ft in the full-group of Tx with (2) (3) (4)
90
Copying Lemmas
Proof Apply the argument in Lemma 4.0.11 to Theorem 4.0.13 instead of Theorem 4.0.5. As we are after an upper bound on the entropy of h(Ta,Qo(j)~]) as opposed to a lower bound, as obtained in Corollary 4.0.8, there is no need for the extra Bernoulli shift. Of course we set T\ = T, Q = Qi and P = g o f . •
5 m-entropy
We define the m-entropy of T* to be the infimum of entropies of T&, over arrangements [i in the m-equivalence class of a. Although our earlier work has made it essentially obvious, we will show that m-entropy is upper semi-continuous in ma and hence this infimum is obtained on a dense G,5 subset of the equivalence class E,,,(a). As in earlier work [25], [43], the main goal of this section is to show that for a fixed size m, either the m-entropy of Ta is zero, for all arrangements a, or the m-entropy is the usual entropy for all arrangements a. We will say that each size m is either entropy-free or entropy-preserving. Let m be a fixed size. Let a be a G-arrangement. Recall that £,,,(a) = {P\a ~ /?} is the m-equivalence class of a. Definition 5.0.1. Define the m-entropy of T a to be
We first argue that this m-entropy is attained residually in Em(a). The following lemma tells us that if two arrangements are m-close then they are, in fact, close in distribution. Lemma 5.0.2. For every s > 0 there exists <5 > 0 such that if m(aj)<8 then, for any partition P,
Proof This follows from Lemma 2.2.9 as if P(T*{x)) f P(TJ!(x)) then in particular <x(x, Tj,'(x)) =f= g. D 91
92
m-entropy
Corollary 5.0.3. Fix any G-arrangement a and partition P. Let e > 0. There exists 5 > 0 such that ifm(a,p) < S (i.e. if P £ Bs{a)), then + s. h{T'l,P)
This is a direct consequence of Lemma 5.0.2 and Theorem 3.0.17.
D Theorem 5.0.4. The set of P e Em(a) with h{T^) = hm(Tx) is a dense Gs in Em(a). As Em(a) is Polish, this infimum is achieved residually. Proof
Fix a and P. Let
If h(TP) = hm(Ta) then P e < V for all e > 0 and finite partitions P. On the other hand, let {P,} be a countable family of partitions labelled by finite subsets of N, dense in the L1-metric on partitions. As h(Tll,P) is continuous in the L1-metric, if P € 0 !/&,/>. for all k and i then we will have
for all P and hence h(T^) = /i m (T a ). All we need to see is that the sets (P^P are open. This, though, is just upper semi-continuity of entropy h(p) for (7-invariant and ergodic measures on Zp, Theorem 3.0.17. D Our goal now is to show that if ever /zm(Ta) < h(Ta), for some a, then for this size m and for any G-arrangement /?, we have hm(T^) = 0. Note that this P need not be in the m-equivalence class Em(a) or even on the orbits of the same free and ergodic action. Although our path to this is somewhat technical we gain a lot of insight into the relation between a size and its entropy along the way. First we show that if entropy can be lowered by moving to another m-equivalent arrangement, then relative to a partition, it can be lowered by an element of the full-group. Lemma 5.0.5. Let e > 0. Suppose, for some a, hm(T°)
< h(T*) - e.
Let P be any partition. For any e > 0, there exists <j> such that m(a, >)<£ and h(Ta'l>,P)
m-entropy
93
Proof As the full-group acts minimally on £,,,(a) and preserves entropy, those /? e Em(cc) with h(T") - e are a dense set. Thus those /? with
are a dense and open set. On the other hand, those a(j> with m(a, >)<£ are dense in Be(u), the ball of radius e in m about a. Intersecting these two sets yields the result. • Next we show that if entropy can be lowered, then it can be lowered on an independent and identically distributed (i.i.d.) process. (We are following the argument of [25] and [43] here. Knowing that the mentropy of a Bernoulli action is less than its entropy will allow us to conclude this for all ergodic systems of positive entropy.) Lemma 5.0.6. Let a be any arrangement. Suppose there exists e > 0 such that hm(Ta) < h{Tx)-e. Let {UP,P) be any i.i.d. process with h(Up,P) = h(Ta). For any e > 0, there exists
Fix a e st with h(Ta) > 0. Suppose e > 0 such that
hm(Ta)
n > 0, so that hm(Ta) < h(Ta)
-e-n.
At this point, as in both [25] and [43] we seem forced to introduce a circularity in that we want to apply Sinai's Theorem to say we can obtain a full-entropy i.i.d. factor of Ta. This is of course a known theorem, so no circularity is really obtained. We also note that even within the confines of our work here no circularity is obtained as for the size m of conjugacy, m-entropy is of course entropy and the Sinai's Theorem (our Theorem 7.3.3) for this strongest of equivalences will hold without further ado concerning m-entropy. Applying the Ornstein-Weiss, Sinai Theorem [37] we see that there exists a partition P' such that h(Ta,P') = h(T") and (Ta,P') has the
94
m-entropy
same i.i.d. distribution as (C/^,P), that is to say \\{Ta,P'),{Ull,P)\\,
= 0.
By Lemma 5.0.5 for any e > 0 there exists a full-group element >' with m(a, <j)') < 8 such that m(a, ft) < | and
If, in fact, this 0' were measurable with respect to V T^^P')
then
geG 8
would have a "version" on (U^,P), which would complete the proof. In general this need not hold. We need to "copy" 4>' into this process. Without loss of generality, we may assume that
then both m{<x, < and
Apply Corollary 4.0.9 with "Q"= P' and "(j>"= (/>' to select 4>, measurable with respect to M* = V T^iP') such that
... ._,vn(r,g M v, Hence h(TMt',P')
D
Next, we see that if the entropy of a process can be perturbed down by arbitrarily small (in m) elements of the full-group, then the m-entropy of the action is in fact less than the standard entropy. This provides a converse to Lemma 5.0.6 Lemma 5.0.7. Let e > 0. Suppose, for some partition P and arrangement a, there exists {<£;} £ r with m(a, <£,•) —* 0 and
h(T*
m-entropy
95
Proof To begin, as the assumed inequality above is strict, it still holds for some e' > e. By Theorem 3.0.17 conditional entropy is upper semicontinuous with respect to the distribution pseudometric. Thus for any finite partition Q, we have that l i m s u p h(Ta
l i m s u p (h(Ta
+
h(T*,Q\P)).
/—*00
Thus, by hypothesis, for any finite partition Q, lim sup h(Ta
a strict inequality. In particular this tells us that for any finite partition Q there is a partition R = P V Q refining Q and elements of the full-group 4>i with m(a., (/>,) —• 0 and lim sup fi(Ta*', R) < h{Ta, R) - e. Let Pj be a countable list of finite partitions dense in L1 in the space of all partitions taking values in N. We will define a sequence of sets. Let Oj consist of those /? e Em(a) for which there is some partition R refining Pj and h(Tls,R)
=
h(Ta,R')-e'.
I—>CO
This implies that once i is large enough a^>i/->; S Oj and ucj) is an accumulation point of Oj and these open sets are dense.
96
m-entropy
We know Em(a) is Polish in the m-topology and hence these sets have non-trivial intersection in Em(a). For any /? in this intersection, and for all partitions P, there will be a refinement R of P with
It follows easily that
• Lemma 5.0.8. Let a be any arrangement. Suppose there exists e > 0 such that hm(Ta) < /i(Ta)—e. Let /? be any arrangement on any free and ergodic orbit relation for which h(T^) > h(Ta). Then hjjt) < h(T»)-e. Proof Apply the Ornstein-Weiss Sinai Theorem [37] to obtain a partition P such that (TP,P) is i.i.d. with h{Tp,P) = h(T«). By Lemma 5.0.6, there exists a sequence {>,} £ p such that m(fi, 0,) —> 0 and, for every i,
h(Tl)
•
The next result gives some insight into the relation between a size m, rearrangements and entropy. This is precisely where we use the copying lemma Theorem 4.0.13 by way of Corollary 4.0.14. Theorem 5.0.9. Suppose Ta is an ergodic G-action, P is a finite partition, e > 0. Suppose {),} £ F with (i) m(a, (f)i) —* 0, and i
(ii)h(Ta,P\Po
(iv) h(T\Poxp^) = h(T««\P) < h(Ta,P)-e. Proof
Given e > 0, we will construct \p with
m(a,xp) < e and ^(T^.P) <
h(T\P)-e.
m-entropy
97
Choose 0, with m(a, >,) < e. Axiom 3 tells us that there is a d > 0 so that if || (a, >,), («, V) II. <<5 then we will still have m(a,\p) < e. Corollary 4.0.14 tells us we can construct a \p with h{Tav,P)
= h(Ta,P
oxp~l)<
h(T*,P)
- e
and with || («,<£,), (a,
•
The preceding result can be read as saying that if one can find fullgroup elements of arbitrarily small m-size that move a partition in a way visible to entropy, then one can actually lower entropy and as we are about to see, push entropy to zero. This result will be central to the rest of our proof of the dichotomy of entropy-preserving and entropy-free sizes. Lemma 5.0.10. Let e > 0. Let T\ and T2 be two ergodic G-actions such that h(Ti x T2) > 0 and hm{Tx x T2) < h(T{ x T2) - e. Then either hm(T\) < h{Tx) - f or hm(T2) < h(T2) - f. Proof Let a] and <x2 be two G-arrangements with T\ = T°" and 1 2 T2 = T * . Associated with T°" x T1*2 is a G x G-arrangement a such that Tx = T"' x T"2 Thus, according to the hypotheses, we have a G x G-arrangement a such that h(T") > 0 and /i m (r a ) < h(Tx) - e. Apply Lemma 5.0.6 to see that for any i.i.d. process (U^,P), if hiU^P) = h(Tx) then for any e > 0, there exists (f> in the full-group of UIS with m(/?, (f>) < s and /i((/W, P) < h{Ufi, P) - e. For i = 1,2, let (Up',Pi) be i.i.d. processes with h(U^,Pi) = h(Ta>). Let P = Pi x P2, p be such that Up = U11' x [/&. Then for the (f> found above m(f},4>) < e and /i(l/W,P, x P2) < h(UP,Pi x P2) - e. Note that, in general, U^ is not a direct product. The above calculation together with basic entropy facts, imply that (writing 2 to mean the trivial algebra) one of the following must hold; either (1) hiU^Pt (2)
x2|(P1x2)o0-')>f,or
98
m-entropy
Without loss of generality, suppose (1) holds. By Theorem 5.0.9, given any s > 0, we could have selected e in such a way that there exists y; in the full-group of Up with m(/?, \p) < I and
P, x2)
x2)
Apply Corollary 4.0.9, with "Q"= P, x 2 and ^ = V g6C l/J_,(P, X 2) = Vgect/f'^CP,). Thus for any (5 > 0 there exists y; in the full-group of U\' /^)
V(P, x 2)),(l/f', g ( / w V(P,))||. < .5.
By Axiom 3, we may select 5 so small that also m(fi\,\p) < E. Furthermore, by the upper semi-continuity of entropy, for any IJ we may select 5 so small that /)(t/f"' ) ,Pi) < h(Ul!i',Pi x2) + /j. Thus, in fact, the 5 may be selected so small that we still have P\)
.
In particular, for any e > 0, we have produced an element y; in the full-group of t/f1 such that m(pitw) < e and /?(t/f"' ) ,P l ) < f i ( l / { \ p , ) - f. Now, Lemma 5.0.7 implies that /im(T,) < /?(Ti) — | as T| must have a factor conjugate to the Bernoulli shift (/f1. • We are now ready to complete the proof that there are two kinds of sizes - those which are entropy preserving, in which case the m-entropy is simply entropy, and those which are entropy-free, in which case the m-entropy is always zero. Theorem 5.0.11. Let e > 0. Suppose there exists some G-arrangement a such that h{Ta) > 0 and hm(Ta) < h(Ta)-e. Then for any free and ergodic G-action S'1, hm(Sp) = 0. Proof Suppose for some arrangement /?, hm(T^) = B > 0. By Theorem 5.0.4, we may select y G £,„(/?) such that /?(S'O = hm(Sp) = B > 0. In particular, h(Sy) = hm(Sy) = B > 0. Let (U,P) be any Bernoulli process with h{U,P) = B. Select k so large (but finite!) that the /c-fold product (U x ••• x U,P x ••• x P) has entropy h(U x • • • x U,P x • • • x P) > h(Ta). By Lemma 5.0.8, hm(U x • • • x U) < h(U x • • • x U) — e. Applying Lemma 5.0.10 inductively, (In other words, we can lower k — 1 times, we see that hm(U) < hiU)—^. the entropy on any Bernoulli process.) Since h(Sy) = h(U), Lemma 5.0.8
m-entropy
99
now implies that h,,,(Sy) < h(Sy) — ^ r . But this is a contradiction, since hm{Sy) = h(Sy). Hence, we must have that hm(Sp) = 0 universally. • This completes our technical work on m-entropy. Definition 5.0.12. Let m be a given size. (1) If hm(Ta) = h(Ta), for all arrangements a, then m is called an entropy-preserving size. (2) If hm{T*) = 0 for all arrangements <x, then m is called an entropyfree size.
6 m-joinings
6.1 Polish topologies Our objective here is to produce a topological model for arrangements and rearrangements. We seek a single action of G by homeomorphisms of a topological space on which there is a canonical notion of a base arrangement and a rearrangement of it given as continuous functions. We want this model to be sufficiently large that any example of a measure space, an arrangement and rearrangement will be conjugate to this model space for a particular choice of measure. Thus to move from one measurable example to another is to choose a different invariant measure on the model. This will allow us to topologize the set of all rearrangements by a weak*-topology on this space of measure. To accomplish this, the topological space must be rather elaborately defined, and in particular cannot be compact. Sufficient to our purposes though they will all be Polish spaces in that the topological space is metrizable and, for some choice of metric, is complete and separable. Usually we will give explicit metrics for these spaces, and quite often they will not be complete for that metric, but will always sit as residual subsets of their completion, as this is a universal property independent of the choice of metric. We will rely in an essential way on the results of Varadarajan [57] that the space of probability measures on a Polish space is again a Polish space in the weak*-topology of convergence on bounded continuous functions. Recall the following definition. Definition 6.1.1 ([5]). A topological space (X,T) is called a Polish space if it can be metrized as a complete and separable metric space. 100
6.1 Polish topologies
101
Polish spaces are functorially quite robust. Of particular interest to us, are those listed in the following lemma. Lemma 6.1.2. (i) Any countable set in the discrete topology is a Polish space. (ii) Any countable product of Polish spaces (in the product topology) is a Polish space. (iii) Any open, closed or Gs subset of a Polish space is a Polish space. Proof
See [5].
•
One fact about Polish spaces, which will be important for our work, is that a Polish space is a universal G$, that is to say, a Gs-subset of its completion, relative to any metric giving the topology. In particular, we have the following lemma from [5]. Lemma 6.1.3. Let X be a Polish space. Then a subspace of X is Polish if and only if it is a Gs in X. Proof See [5].
•
Definition 6.1.4. Let J/(X) be the space of real valued Borel measures on the Polish space (X,T) (we suppress the topology in the notation). Let J/\(X) be the probability measures in J/(X). Theorem 6.1.5 ([57]). In the weak*-topology of convergence on bounded continuous functions, J(\ is again a Polish space. Proof Theorems 14 and 17 of [57] give this result. Note: "topologically complete and separable" is equivalent to Polish. • We find the following picture useful to understand this result. Lemma 6.1.6. The space (X,r) is Polish if and only if it is a residual subset of some compact metric completion. Proof If (X,T) is a residual subset of some compact metric space, it is separable, and, as any Gs subset of a complete metric space is topologically complete, we have one direction of the proof. For the other,
102
m-joinings
suppose (X, d) is a complete and separable metric space. Let x, be a countable and dense subset of X. Define a map h{x) = {d(xi,x)}f=x. It is easy to check that h is 1-1 and a homeomorphism to its image in MN with the product topology. For each x, and j > 1 define an open set (9,j £ RN by 0 y = {y : \d(xk,Xi) - yk\ < 2~j for all k < max(/,y)}, that is to say, all vectors y which are within 2~! of /)(x,) on coordinates For any x e X there is an / with d{x,xt) < 2~J and hence for all k, \d(xk,Xi)-d{xk,x)\
< 2~J and so h{x) G (9U.
Thus (9j = (J(. Ojj is an open cover for the range of h, and range(/i) c Suppose j; e p| (5y. Then in fact there must be a sequence i(j) with
(it is easy to see that such an intersection is at most a single point). Thus in particular for any j > 1, \d(XiU),XiU))-yiU)\
= \yi{j)\ < 2~J
and as we know for any / > ma\(i(j),j), \d(xi{j),xi{f))-yiU)\
<2~f < 2~J,
\d(xiUhxi(n\<2-J+l. This implies x,^ is Cauchy in X, converging to some XQ. For any k>0, if sup(y, i(j)) > k then \d{xk,xiU))-yk\ < 2~}. As j -* co, d(xk,xnk)) —> d(xk,x) and so h(x) = y and range(/i) = f|; ^7 ' s a G^ subset of MN. Now RN can be compactified as a residual subset of [0, l]n and hence (X,d) can be embedded as a G$ subset of this compact metric space. • Lemma 6.1.7. If(X,x) is a Polish space and Y cr X is a Gs subset (and hence Polish) then Jl.(Y) consists of the subset of J/{X) of measures supported on Y with the inherited topology.
6.1 Polish topologies
103
Proof This is a direct consequence of Theorem 2.7, page 368 of [21], that if m, -* m in JI\X) and g is bounded and continuous m-a.e., then
/•
f
Jx
Jx
/ gfl/'i -» /
The above two lemmas tell us that we can view a Polish space as a residual subset of a compact metric space, and its space of Borel measures as just the subset of Borel measures on the compactification, with the inherited weak*-topology. Most importantly, this is independent of our choice of metric compactification. To bring dynamics into the picture we say (X, T, {T g } geG ) is a Polish G-action if {X,T) is a Polish space and {Tg}gec is a n action of G on X by homeomorphisms. For example, if G is countable and (X,z) is a Polish space, then the shift action of G on XG is a Polish G-action. If G is countable notice that, for this example, there is a metric compactification of X to which the action of G extends as homeomorphisms (if {C,d) is some metric compactificication of X, then CG is a metric compactification of XG). Lemma 6.1.8. If(X,T,{Tg}g<=G) >s a Polish G-action, G a countable group, then X has a metric compactification to which {Tg}g6G extends as homeomorphisms. Proof Embed X in XG by mapping x to {Tg(x)}g€c- Call the map p. Note that p conjugates {T g } gS r to the shift action {ag}gec- It is easy to see that p is a homeomorphism from X to its range. Let C be some metric compactification of X, and d some metric on CG giving the product topology. The pull-back d o p of d to X via p is a metric giving the topology T. The completion of X with respect to this metric will embed via p isometrically to the closure of X in C'1, a compact metric space. As p(X) is cr-invariant, so is its closure. • For (X, T, {T g } geG) a Polish G-action, G a countable and discrete group, for any point x e X, the set l(x) = {g e G : Tg(x) = x} is a subgroup. Similarly, for any g € G, {x e X : Tg(x) = x} is a closed subset. For any subgroup W s G w e can define XH = {x : H = I(x)}. Lemma 6.1.9. For any subgroup H £ G, {x G X : I(x) S H) is a closed subset, and XH is a G& subset of it. In particular each XH is a Polish space.
104
m-joinings
Proof Certainly {x G X : I(x) £ H) = f]geH{x closed set. Now
e X : Tg(x) = x} is a
XH = {x e X : I(x) s / / } \ U {x e X : Tg(x) = x} a closed set with a countable union of closed sets removed, hence
a G <5 .
•
Lemma 6.1.10. Let {X,x,{Tg}geG) be a Polish G-action. The subset of J/\(X) consisting of all {Tg}^-invariant measures is a (perhaps empty) closed and convex subset of Jl\{X). The extreme measures are the ergodic measures for the action and every invariant measure is an integral ofergodic and invariant measures. Proof If (C,d) is a metric completion to which {T g } g s G extends homeomorphically, then in Jl\(C) the invariant measures are a compact and convex subset. Hence the intersection with Jt\(X) is relatively closed. As J/.\{X) is a convex subset of Jl\(C), we obtain convexity for the invariant measures. If we write an invariant measure fi as an integral of extreme invariant measures in Jl\(C) (as we know we can, this is just the ergodic decomposition) H=
v dn(v)
where n is a probability measure supported on the ergodic measures in J/.\(C), it is an easy calculation that n-a.e. v must be supported on X, that is to say must be in Jl\(X) as /.i is. Hence n must be supported on the extremal measures in J/t(X). It is clear that if \i is ergodic for the action on C it is ergodic for, its restriction to X is. • This implies that the space of invariant probability measures for a Polish G-action is a Polish space. We let J/S(X) represent the invariant (stationary) measures, and J/.e(X) the ergodic ones, suppressing the other terms of (X, T, {r g } gE c) unless they are not obvious. Corollary 6.1.11. Suppose (X,x,{Tg}gec) is a Polish G-action, G a countable and discrete amenable group. The space of ergodic and invariant measures J/e(X) in the weak* topology is a G^-subset of' Jls{X) and hence is a Polish space.
6.1 Polish topologies
105
Proof It is sufficient to prove this for (X, T) = (C, d) a compact metric space where it is a standard consequence of the mean ergodic theorem. • The next definition and theorem are not precisely to our purpose, but indicate a bit of the impact of this point of view, and are a first case of the general machinery we are building. Definition 6.1.12. Suppose (X\,t\,T^) and {Xi,*!, T^) are two Polish Gactions, G a countable and discrete amenable group. Suppose fii and HI, respectively, are ergodic and invariant measures for these actions. The joinings of'n\ and \xi will consist of all invariant measures for the Polish action (X\ xX2,ti xi2,{Tg' x Tg2}gSc) which project to n\ and \ii, respectively, on the two coordinates. We write this set of measures as J(T\, T2) suppressing all the space variables. Theorem 6.1.13. The space J(T\, T2) is a closed, convex subset of Jls(X\ x X2). Any joining \i G J{T\,T2) will be an integral of ergodic measures in J(e(X\ x X2) which must themselves be joinings. The ergodic joinings themselves are a G$ subset of' J(\(X\ xX2) and hence are a Polish space. Proof As in earlier arguments, we can again assume X\ = C\ and X2 = Ci are compact spaces by extending the actions to some metric compactification. In the context of compact spaces, the fact that any joining is an integral of ergodic joinings is well known (see for example [44]). As these ergodic joinings project to \i\ and \x2 they are supported on X] x X2. As J(T\, T2) is a closed subset of Jl\{C\ x C2), and the ergodic joinings are the intersection J(T\, T2) n Jle{X\ x X2), an intersection of a closed set and a Gs, the ergodic joinings form a G$. • We are now ready to introduce the explicit Polish G-actions we will use to model arrangements and rearrangements, and ultimately to define the notion of m-joinings for m a size. In this section we will construct a natural Polish space modeling a pair a and ft of G-arrangements, that is to say a Polish topological space on which all such pairs can be modeled by invariant measures for a fixed orbit-equivalent pair of G-actions. This is a dramatic shift in perspective. It is analogous to shifting from the consideration of all Z-valued partitions of some fixed space and Gaction to the space of all cr-invariant measures on E G . In the latter there is only one partition of real interest, the time-identity partition. Analogously in our case there will be only a distinctly limited and
106
m-joinings
precisely encoded set of arrangements and rearrangements of interest on the spaces constructed. These will not be arrangements and rearrangements in the precise sense we have given earlier, as they will not sit on a single measure space. Rather, they will be topological analogues. There will always be a base arrangment which we will write as a corresponding to the base action of G defined on the space. This basic action will be some generalized "shift" action which will be called a = <J* if it is truly a shift on a product space, or S = S" otherwise. There will also always be some second arrangement (we are modeling changes of arrangements after all) which we will label /? with corresponding actions a$ or S^. At times we will define the actions S a and S^ as maps before discovering that they are orbit-equivalent. Furthermore, for most of our work no measure will be specified. In the end though, two such actions will be orbit-equivalent, but may not be free. Hence the actions cannot be said to give rise to arrangements a and /}. At such points we ask the reader to regard this notation as formal and to look for later clarification of what precisely a and j? represent. Later we will also consider full-group elements intrinsic to the construction. In these models there will first be just one full-group element which we will write as <j>, and later a sequence of them written $,. Whenever we use this bold-face notation we are indicating the canonical elements of the construction and not an arbitrary element of a full-group. Our topological models will contain all pairs of arrangements, and later all rearrangements, and convergent sequences of rearrangements, not as distinct arrangements on one G-action, but as distinct invariant measures on one canonical construction. As usual G is a fixed, countable and discrete amenable group, and F,some Folner sequence in it. We will stick to certain notational conventions that should help the reader follow some of the more technical twists we take. For a G-action (Z,&, fi, T a ), the symbol p will always represent the function from a point z to its name in Z G , i.e. p(z) = {T*(z)}geG- The symbol q will always represent a standard involution of the spaces involved. The symbols n (n\ or 7i2) will always represent the projection of a product onto one of its coordinates, i.e. Z\ x Zi onto Z\ or Zj_. We will develop more elaborate topological representations of a G-action than simply the shift map on Z c . Under these circumstances we will us L to represent a labeling of the action by some more elaborate coordinates. Remember that to any two arrangements a and /? and x G l w e can associate a bijection \tf : G —> G which describes the manner in which /?
6.1 Polish topologies
107
arranges an a orbit. Similarly, to any rearrangement pair (a,$) and point x s X we associate a map /"•* which describes how > moves points on the orbit of x as seen by the a-arrangement of that orbit. The actions T a and T!i (or T0"*) act on the spaces of such functions in an equivariant fashion by acting on the point x. It is these actions on the corresponding functions that we wish to model topologically. Keeping this in mind will make our definitons and calculations more transparent.
6.1.1 Overview of the topology on m-joinings Before beginning the explicit construction of these models we give a brief and vague description of each to give an overview of their structure: 1. We start with
108
m-joinings
Polish space of measures and models all m-equivalences via explicit choice of a rapidly converging sequence of rearrangements. 9. To build a space of m-joinings of two actions T,"1 and T%2 we first model them as acting as the shift map on spaces Z,G and Z2G. We then consider the subset of Jim(Z\ xZ 2 ) where the action given by a projected to Z,G is T,0" and the action given by fi projected to Z2G is 7 2 2 . We call this Polish space of measures Jm(T*\ T2a2). Now to the actual construction of the models.
6.2 Modeling pairs of arrangements (
Let S be the set of all bijections of G fixing the identity, that is to say, the collection of all maps h : G —> G that are 1-1 and onto and with /i(id) = id. These form a group under composition. We can topologize 3? as a subset of GG, putting on GG the product topology as a product of discrete spaces. It is an exercise that the group action of composition is jointly continuous, and the map q : h —• h~x is a homeomorphism of <S. This is our initial version of the involution q. In Section 2.1 we gave an explicit metric d for the space (S making it a complete metric space. Recall that this metric was invariant for the involution q : h —» h~x. It is useful to notice that <& is a zero-dimensional space, and so in C3 x
or
6.2 Modeling pairs of arrangements
109
Theorem 6.2.1. The collections of maps {Sg}g6G and {S^}gec ore both G-actions on <$ by homeomorphisms. As S*(h) = Sf^ih), they are orbitequivalent. They also are conjugate as the map q conjugates S* to S|. Proof
The following two calculations explain why we have G-actions: S,-oS-(*)(*) = S'(S« (*))(*)
and for S^ showing the conjugation of S* to S{! will suffice: q(S*(h))(k) = (S"g(h)Tl(k) =
h-[(kh(g))g-1
As G is a group this implies that all the maps S* and S£ are 1-1. As the topology is the minimal one making all h —> h(k) continuous, to verify continuity of S" is just to verify continuity of h —• S*(h)(k) = h(kg)h(g)~\ which has the same level sets as h —> h(kg) which is continuous. That SJ! is continuous is the same argument. D It is evident that the action of S* is not free (in particular, the identity bijection is a fixed-point of the G-action). Hence we cannot speak of arrangements associated with the two actions in the form of maps from the orbit relation to G. We almost can though up to the isotropy subgroups of points, and we do this by defining two a/most-arrangements by
a{h,s;(h)) = gli(h)
and
flM/W)
= ghW
where Ii(h) is the isotropy subgroup of elements of the S a-action fixing h, and I2(h) is the subgroup of the S^-action fixing h. It is easy to see these are well-defined on pairs of functions on the same orbit. Notice that S*h(h)(h) = S*(h) and so we can write:
Zm
P(h,sH{g)(h)) = h(g)!2(h).
If we regard the orbit of h as identified bijectively by
and similarly identified bijectively by S'(h) ->g/2(/i) e G/I2{h),
110
m-joinings
then the map which carries this one identification to the other is the map
gl\(h)-+h(g)I2(h). Lemma 6.2.2. The bijection It is a group isomorphism from I\(h) to Ii{h) and moreover, for any g G I\(h) and k e G, h(kg) = h(k)h(g). Proof Notice g G 7,(/i) iff h{k) = h{kg)h{g)~x for all k. This may be rewritten as h(kg) = h(k)h(g). We have already seen that /7(g/i(/?)) = Hg)h(h) and this completes the result. • Thus h is well defined as a map from G/I\{h) -> G// 2(/i). As such, h is precisely the map which rearranges the S* and S11 orbits, identified on these spaces by the two actions. We should point out here that even though on the orbits of S" we may only have an almost-arrangment, that is to say a map on G/I\(h), the value h itself tells us how to rearrange an entire copy of G, not just the quotient space. This observation was in fact the origin of the construction. We wanted an action which modeled topologically all possible ways that a second arrangement arranges the orbits as viewed from a first arrangement. Our goal now is to produce a topological dynamical system on which the invariant Borel measures model all possible pairs of arrangements on any given measurable action. Notice that we are close to having just that. If (X,£F,[i) is a standard probability space, with an ergodic measure-preserving orbit relation (9, and two G-arrangements a and /? on it, then the map L : x -» hf
maps
X^(S
with the arrangement a identified with the a/most-arrangement a, and similarly /? with p. That is to say, L conjugates Ta to S a and T' ; to Sp. Furthermore L'(n) e J/e(<&). Thus the weak*-topology on Jte{<3) can be used to topologize spaces with pairs of arrangements. This is an inadequate structure so far because of a lack of 1-1-ness and of course the lack of freeness of the orbits of S". For example, if a = /? then L'(fi) will be a point mass on the identity. Even if L*(fi) is supported on free orbits, the map L* will be far from unique, as it depends only on the way the orbits are rearranged and this may very well only generate a subalgebra of SF. To remedy this, let Z be a Polish topological space and let a be the shift G-action on ZG. For any h € 3? define a map r/, : ZG —> ZG by
6.2 Modeling pairs of arrangements
111
That is, ffg(Oi(z)) = ^(/,) ( T ''- | (g)( l )
which implies that the map
conjugates the action of S* x ag to that of Sj! x ff/,-i(g). Suppose (X, t") is a free G-action by homeomorphisms of a Polish space X, and L : X —> 2? is a continuous map factoring T a onto S". Then the action of S' on 5? lifts canonically to X as the action
Notice that, in this case, / ^ = L(x) G 0. We hope the reader will understand our continuing use of the notation a and /} even though the space has changed. The simplest case of this situation is where a is the G-shift on Z G , Z some Polish space and t a = S a x a on some Polish Y c # x ZG, projecting to S a on the first coordinate. By Lemma 6.1.9 the set of all free orbits in (S x Z G is such a 7 . Lemma 6.2.3. The maps T^ are homeomorphisms of Y forming a Polish G-action with the same orbits as Ta. Proof The only issue here is to see that the f£ are homeomorphisms and map Y to Y. Just notice that if h and h' 6 CS are close enough, then h(g) = h'(g). Thus if I and z' G Z c are close enough T^(y) and T£ ( g ) (/) will be close. We already know that ( 7 / ) - ' = tf_, and they are invertible. As T^ preserves T" orbits, it must map Y to Y. • This justifies our use in this context of the two names T* and Tp as they are orbit-equivalent, and we can now take a and p as the two corresponding arrangements. As Y consists of free orbits these are actually arrangements, short of the fact that arrangements were originally defined only on measure spaces, and here we have no specified measure. We have already seen the calculation that implies
hf.=h
and
hi? = h~K
112 a
m-joinings
Recall that the map q : (h,z) -»(h~ l ,r h (z)) is an involution conjugating
f to f'. Notice that the invariant (and ergodic) measures for f" and " f are identical. Hence q* acts as an involution on both Jls(Y) and Jf,e(Y). A standard probability space is, by definition, a Borel measure on a compact metrizable space. Hence any free G-action on such a space (Z,!8),n, {Tg}gec) can be mapped to Z G by p : z —> {Tg(z)}gsG , conjugating the action of T to the shift map a. This map will carry /< over to a measure supported on the Polish space of free a orbits. If we have two arrangements of these orbits by a and /? (and we may as well assume T = Tx), then we get a map L : Z -> <$ x ZG given by
This map conjugates Ta to T a and T /; to fP. It is supported on free orbits, i.e. maps to the Polish space Y. This map is 1-1 and so carries /.i to a measure in Jie(f) faithfully modeling the original system. If we were to interchange the roles of a and ft in this construction, the new image measure would simply be q* of the original one. In this construction the choice of compact metric space Z is arbitrary as any two non-atomic standard probability spaces are, up to null-sets, the same. What it provides us with is a fixed space on which to model pairs of arrangements topologically. Suppose we have two free and ergodic G-actions (Z,,^, / ( 1 ,{T g } g 6 G )
and
(Z 2 ,#,w,{S g } g 6 C ).
We can define the notion of an orbit joining of these two. Let p\ and pj be the maps from Z\ and Zi to their orbit names in Z,G and Z2G. Definition 6.2.4. Consider the Polish space ^ x ( Z ) y.Z{f and construct on it, as above, the two actions T a and T^ corresponding to the arrangements a. and p. Let Y be its space of free orbits and n\ be the projection of Y to Zf, i = 1,2. Any measure p. £ J/e(T) with the property that
and
is called an orbit joining of the two free processes.
6.3 Modeling rearrangements
113
Notice that if we reverse the order of the two processes, then the interchange of the Z\ and Zi coordinates in Y will take an orbit joining in the Z\,Zi order to one in the Zi,Z\ order. It is worthwhile to note that the space of orbit-joinings is not empty as it contains d\& X H\ X \ii, corresponding to the usual notion of joining and a will equal P, /t-a.s. More generally any joining of the two actions can be extended to an orbit joining by appending a point mass on id e (S.
6.3 Modeling rearrangements The two notions of a joining and an orbit joining of two G-actions are the extreme cases of what we really need, a restricted orbit joining. To define this we must bring sizes into the structure, and to do this we must bring into the model the behavior of elements of the full-group. This follows a path quite analogous to the development of <S. For / € G c we have already defined H by H(f)(g) = f(g)gf(id)~l. Notice that H(f) is a map G —• G fixing the identity. In all the following calculations it is important to remember that ~' placed by a bijection of G is the inverse bijection, and by an element of G is the inverse element. This is particularly regretable in regards to H(f)~l(g) and H(/)(g)~' which have distinct meanings. Notice that for fh{g) = h(g)g~\ H(fh) = h and hence H is onto. Definition 6.3.1. Let $ = {/ £ GG : H(f) G <$], that is to say, the set of maps f : G —> G with g -> /(g)g/(id) - 1 a bijection. Topologize 8& with the product topology on GG. Notice that the map H from 0t into (S is continuous. (In fact the map H is continuous as a map GG —> GG.) Also notice that if / e ^ , then
S*(H(f))(k)=H(f)(kg)H(f)(g) -i =
f(kg)f(g)~l=H(ag(f))(k).
That is to say, the map H conjugates the action of a on GG to the action of S a on ^. As dynamics on 01 we take the restriction of a to this (j-invariant subset. The pull-back //, of h to ^ described above is not equivariant. If it were, all arrangements would be cohomologous. Lemma 6.3.2. 01, is a Gs subset of GG.
114
m-joinings
Proof We already know that (8 is a Polish space, hence is a Gs subset of GG. As 3ft is the pull-back by a continuous map (H) of this Gs, it also
is a Gs subset of GG.
•
In <3 x 3#. we can restrict to the ftx = S°txa invariant subset of all (//(/),/). We will refer to this as the graph of H even though the coordinates are interchanged. As H is continuous, this set is homeomorphic to 3ft,. For / e ^?, we can define a map on the graph of H by
which is to say,
aL(ts(H(f),f),
= f(g) =
That is, <j>(fg(H(f),f)) = 0(//(<7g(/)),ff g(/)) and is well defined on the graph of H, even on non-free orbits. The following simple lemma explains how & models rearrangements. Lemma 6.3.3. Restricted to the graph of H, which is f "-invariant, the almost-arrangements at and /} satisfy
As the graph of H is homeomorphic to 91, the same identity holds on itself. Proof
This is just the following calculation: 1
P(f*(H(f)J), t?(H(f)J)) =
• G
On the graph of H as a subset of 3? x G , we can act by the orbit equivalent action "f, which as we have seen is simply Tx
6.3 Modeling rearrangements
115
as rj rearranges coordinates and / just flips values at each coordinate to their inverses. Thus letting
I commutes with f*, ft and q. Lemma 6.3.4. On the graph ofH in (S x (%,, the map ql conjugates T^ to tx, i.e. qlfV = t'ql. Proof The only thing to check is that ql preserves the graph of H. For / G @ set
/(g) =/(#(/)"'(g)r' = (/7/(/)(/)(g)r' which is to say
We can calculate: H(f) o (H(f)(g)) = / ( g r ' t f (/)(g)/(id) = g, and H(f) = (H(f))~l. This calculation also shows that
as H of the right-hand-side is H(f)~l and the right-hand side agrees with / at the identity. (Two /'s with the same image under H that agree at the identity must be the same.) Now notice
qi(H(f)J) = /(//(/), rH(f)(f)) = Thus ql preserves the graph of H and hence conjugates T^ to T" on this Polish space. It is an easy calculation to show that f = f making the D map / —> / a continuous involution conjugating a = aa to a^. Notice in particular that we now know
or equivalently that <M/) = aH(f)(g)(f)-
116
m-joinings
Lemma 6.3.5. For any / £ £% there is a unique solution z = z(f) to the equation f(z)z = id. It is z(f) = /(/(id)) = H(/)(/(id)) = //(/)"'(/(id)). This value now satisfies °z(f)(f) = * " ' (/)• More generally, ' (g))) =
'
Proo/ We know H(f)(g) = /(g)g/(id)~' is 1-1 and onto. Hence there is a unique z with H(f)(z) = /(id)" 1 . This is the z we want. Notice that = ff/(id)(/) so 0(/)(g) = /(g/(id)). Letting
we calculate that
ztfY1 = f'W)) = /(z(/')/(id)). Notice that z = /(id)" 1 satisfies z"1 = /(z/(id)) and as this has a unique solution, z(/') = /(id)" 1 . Now we see:
To calculate z more explicitly, notice that H(/)(/(id)) = /(/(id)) and /(//(/)(/(id))) = /(/(id))" 1 and so z(/) = /(/(id)). But then symmetrically z(/) = /(/(id)). Using the conjugation H(ag(f)) = Tg(H(f)) we conclude
=
Tg(H(f))-[(f(H(f)(g)))-
Notice that this gives the natural symmetry: ff(M(id))
= z(f)
and
/(id) = H(/)(z(/))
H(f)(f (id)) = z(f)
and
/(id) = tf(/)(z(/)).
and
D
6.3 Modeling rearrangements The action of f'
117
on the graph of H pulls back to an action a^ on 01
Defining a map q by §(/) = f,q conjugates a$ to cr = a". On 5? of course ft = <x<j> where <j>{f) — Of^)(f). Our work above tells us that <x(/, >"'(/)) = z(/) and more generally that
Notice that on the graph of H we actually have two conjugations of a" to <jt. One is the map / —* f, the global symmetry we have been following throughout our construction. The other is the explicit full-group element tj> pulled back from the graph of H to $k. This latter conjugation acts as the identity on orbits of a and hence will preserve all invariant measures. The former conjugation, as we have seen, interchanges the roles of a and p. As in the case P =
and a Z-name p(z). Let L'(z) = (/ z ,p(z)) € Yo £ St x Z G . As expected the diagram below commutes:
We also see that setting /? = a<£, /iz = H(fz)
(the bijection associated
118
m-joinings
with / 2 ) , the following diagram commutes:
(f\P(z)) Note that L is the map discussed preceding Lemma 6.2.3 and in this case associated to the pair of arrangements (a, /?) = (a,a^>). We now begin construction of a topological model for a sequence of rearrangements converging to a second arrangement
«0 ; -» p. Remember that for a measure-preserving G-action T* and sequence of rearrangements {$,•} with
we are only obtaining If^'1 —> h^ in L l . Given such a sequence one can always find a subequence i/;, with d(hafVi,if/)
—> 0 for /j-a.e. x. It is this
notion of pointwise convergence we will topologize. What we are going to see in this construction is that it is essential to introduce invariant measures to reacquire the natural symmetry of
(xfa -> /? and /?0~' -> a. Definition 6.3.6. Consider the space 3#n with the shift a acting simultaneously on all copies of Sft,. Recall d, the metric defined in Lemma 2.1.6 making e S a complete metric space, and for which q : /i —* /?"' acted isometrically. Let 0t. consist of those sequences {/,•} € 0in for which the sequence {//(/,)} is d-Cauchy, i.e. converges to some h £ (S. This is a a-invariant subset of ^?N . The map { o H : {/,•} —> h is a a-equivariant Borel map from tk to ( S. mapping the sequence {/,} to the limit point h = /(//(/,)). Notice that £ o H conjugates a acting on 3ft, to S" acting on CS. The set 3ft, is not so useful to us, most especially as the map {/,-} —> h is not continuous. Let Sft,\ £ Sft, consist of those sequences {/,} G M for which d(H(f,),H(fj))
< l/i+l/jfor
all i,j > I,
that is to say are uniformly Cauchy at this prescribed rate at indices beyond I.
Let k =
k
6.3 Modeling rearrangements
119
Notice that to say lim sup (H(/,),//(/,-)) < 1 / / for all i > I is equivalent to saying
for all j > i > I and this in turn is equivalent to d(//(/,V(//(/*))) £ l// for all i> I. Lemma 6.3.7. Each 5k\ £ <MN is closed but not necessarily a-invariant. The map { o H is continuous on each hi and as any {/,-} e 0). contains a subsequence in hi, the image under f, of each 8/l.\ and h are the same. (Note: this image is of course all oj'(S.) Proof As H is continuous, ®H : 3#n -> ^ N is continuous. In (Sn, for each value /, {{/?,} : d(hj,hj) < \/i+ \/j, j > i > /} is closed, hence its inverse image, hi, in ^ is closed. As to H on 0). is the uniform limit of the continuous maps //, : {/_,-} -+ //(/,•), it is continuous. The possible lack of (7-invariance for 3/t. is simply that S" need not be a c/-isometry on (S. D For a convergent sequence of elements ht -* h in <8, clearly h~x —> h~l. Thus /(/;-') = £{hi)-\ As q is a rf-isometry, the map qN : {/,} -^ {/,•} is a homeomorphism of each hi, which projects by HN to pN on S?N. It is sad, and also obvious, that <jN is not the conjugation of the convergent sequence ct(j> —> /} to ^^~' —> a, that is to say, /(;>' ^ /,. The reason is that /, describes how
P({fi},<({//})) = Jim a^({/,},
ffg({/,-}))
= Mm ff (/,-)(g) = /(//(/ j ))(g) = /i(g). Define {/,} by
Notice that /,- depends on both /, and the limit of the //(/,•), and is
120
m-joinings
the element of Sk describing how >"' rearranges the ^-arrangement of the orbit of {/,}. Lemma 6.3.8. For
«{/.-}, M M ) ) = W//))(g) and *,({/,-} we have
Hg) = fl^tf/,}), #"'(<#{/.•}))) = Mz«-1(g) where, as before, h = t?(H(fj)) is the limit of the H(fj). Proof
Notice
)))
(^(/))g
D
Corollary 6.3.9.
Froo/ This is a direct calculation from the definition of /,.
D
Corollary 6.3.10. On each k\, the map
is continuous. Proof Fix g S G and a value i. All we need to show is that /,(g) is a constant in some neighborhood of {fk}. Remember that
where, as always, h = £(H{fk)). We have seen that the map {fk} -» h is continuous on each 0),\. We already know that the map / —• / is
6.3 Modeling rearrangements A
121
A.
continuous. Hence the map / —• z(/) = //(/)(/(id)) is continuous, i.e. is constant in some neighborhood of each / . The map {fk} —• h~\g) is continuous and so is constant in some neighborhood of each {fk}. Hence {fk} —> <x"_,, ,(/,•) is continuous as also will be
Hence this choice of element in G is constant in some neighborhood of each {fk}- Continuity of the map h on each J?/ now tells us that {fk} —> 7i(g) is constant on some neighborhood of each {fk} which is to say, is continuous. • For actions of Z it is relatively easy to construct sequences {/,} with H(fj) convergent but with //(/,) failing to converge along any subsequence. In particular
H(Ji(g)) = h(z(at.Hg) and 1
(fi(H(f i)(g))).
Hence the stability of the sequence //(/,) in a finite region around the identity is controlled by the stability of h along the moving sequence of values
This is one reason of course that the z(/) were introduced, but it also indicates the path we will take in order to force the convergence of the P<j>Y' back to a. In doing this the fact that our point spaces are Polish will be lost, and we will instead move to spaces of invariant measures supported on these point spaces which will remain Polish. Define a sequence of sets: F, = {{fk} e », : diHia^ifdlHia^ifj))) < l/i+l/j for all y >
J
> /}.
Notice that as F; E 3&i, the map q is continuous on F/. As
122
m-joinings
Lemma 6.3.11. The set Fj is a closed subset of' 0t\. For {/^.} e F/, the sequence {/t} S 0?,, and one can consider the map t o H oq : {fk} ^
f(H{q{{fk}))).
On each F; this is a continuous map, and moreover on each F; nZ/(F/) it is a continuous involution with £{H{]h)) = t{//(//•))"'• Proof The map /, —> z(/,) is continuous and hence is a constant in some neighborhood of {//J. That F; is closed is now simply that it is the intersection of a collection of closed sets. For the second part, this is most easily calculated in the vocabulary of arrangements, i.e. by writing points on the orbit of {f^} = xo as x or y, setting a(x,y) = g where aAx) = y, <j>j(y) = afMxo.y)){y) and P(x,y) = lima#,(x), >,(}>))• Thus ti%*a) = H(ag{f,)) and fora(x 0 ,y) = go,
hf(g) = P(y,
We also know that
So saying ( /(//(a r(/j) (/ / )),H(cr,(/,.)(/ 7 )))
<
l/i+l/j
is to say
Letting j —> oo in this last expression implies:
As d is c/-invariant,
Fix a g G G and let y be such that P(xo,y) = g. Then <x(xo, i>) = which we will call g,. Thus ^ x o ) ( g i ) = ^^-'(A-O),^-'()')), an element of G we will call g2.
6.3 Modeling rearrangements
123
From our statement above, once i is large enough, depending of course on gi,
and so for / this large,
Thus H(Ji)(g) = li>*(g) = S(H(fk))-l(g),
once / is large enough. As
{/*} and {/,} = q({fk}) e k,,
are continuous, finishing the result.
D
The definition of the F\, from a topological dynamical perspective, is perhaps un-natural, but as we will quickly see, very easily understood in the context of shift-invariant measures. Consider the tx-invariant and ergodic measures Jt,,(&). This is nonempty as any constructed sequence of rearrangements (a, {0,}) with <x(/>, converging in L1 rapidly enough, in some ergodic measure-preserving G-action will project to 5ft. via the map z —> {L'(a, <£,)(z)} and this map will carry the invariant measure for the action to Jt.e(0)). Remember k, = {{/,} : d{H(f,),H(fj))
\/j
for all / > / > j},
a closed subset of 0),. For any standard probability space (X,!F,\i) x —» h\(x),h2(x) e (S we can make the calculation
and Borel maps
Notice that this is actually a calculation on CS2 as it can be written
where
On eS2, the map (/i|,/i2) -> d(h\,h2) is continuous, so the map H -> f d(huh2)dn, is continuous.
taking J/(>$2) -^ [0,1]
124
m-joinings
Definition 6.3.12. For {fj} s ik define functions RiJ =
maxd(H(fi),H(fj)), J>j>i
an increasing sequence in J, and its pointwise limit
Our above observations tell us the Rjj are continuous, and as d is at most one, Ri is a bounded Borel function. Notice that if (X, &,\i, T a ) is a free and ergodic G-action, and <x(j>j converges to /? in L1 then for any sequence of values r\i > 0 we can select a subsequence ipi of 0, with
/
•
Just as in the previous discussion, this is actually a calculation in Jle{
where
Definition 6.3.13. Let Jt. = {fi e Jt.e(k)
: f Ridn < (2 ;+2 0~' for all /}.
Note the strict inequality here. It will play an essential role, and create the need for some subtlety in the following argument.
Certainly for n € J/., H{{{fk} : d{H(f,),H(fj)) < i/i for all j > i}) > \ -
2~^
and hence
Thus, as tj>{ is ^/-preserving,
This is where the role of invariant measures comes into play. If \i{iki) is close to one then ju(F/) is also automatically close to one. Hence for
6.3 Modeling rearrangements
125
/.i-a.e. {fk}, {fk} G F; for all sufficiently large /, and so for /* G JL, for all i,
Jd(H{fi),S{H(fk)))dn < jRidn < ( 2 ' + V Thus by restricting to measures in JL our measures will all be supported on the limsup of the F/'s. This is an ¥„ and hence not a Polish space, but as our interest has shifted to spaces of measures this will not matter. We call this set F, i.e. u, n,>, Fi = P. Lemma 6.3.14. JL is a Gs subset Proof
ofJle(&).
For each i < J, {/< : / Rij d\x < (2' +2 i)~'}
is a closed set of measures and as R\j / j R, pointwise, Jf = {/z : I Rtd^i < {2i+2i)-{ for all i) is a closed set of measures containing JL. Restrict the topology and the discussion to this subset. For n G JL and i fixed, there is a value e,(^) > 0 with Rj d\x < (2 l + 2 i)~' — 6,(/<).
Choose J = J{i,y) so large that for all /i' G J/', f Rj dfi' < (2 7 + 2 J)-' < e(/i)/2. As Rij(i,ft) is continuous, for some neighborhood (9(i, /<) of /<, for /.i' G @{i,n), we will still have
Hence (using the fact that \i G Jf)
f
' f
J
~J
'"/('''<)
•"
Let &i = L>i,ej/,&(i,i.L), an open set. For any /.i' G (9, we have
jRidfi<(2i+2i)-1
126
m-joinings
(a strict inequality) and so Jl. ^Ufte,ff.0i s i . .
• For any \i € Jl. we know that /J(F;) > 1 — 2'. On each F; the map q is continuous. These two facts imply that q* acts continously on the space of measures Jl. q' : Jl. -> Definition 6.3.15. Set
Jl.. = {/( € Jl. : q'(n) G „//.}. Since for /< e V/.. we have ^<(F; n^(F/)) > 2~ / + 1 , and on each F; C\q(Fi), q acts as a continuous involution, q' acts as a continous involution on Jl... Lemma 6.3.16. Jl.. measures. Proof
is a Gf, subset of Jl. and hence is a Polish space of
As q* acts continuously on Jl., Jl' = {ft € Jl. : f Ridq'(n) < (2 ;+2 /)~1 for all /}
is a closed subset of Jl. containing Jl... We restrict the discussion and the topology to this closed subset. For y! & Jl',
for all /. For n e Jl.* and / fixed, there is a value
J
E,(/<)
> 0 with
Ridq\n)<{2i+2iY'-zi{ii).
Choose / = I(i,n) sufficiently large so that for all fi' £ Jl',
and further,
6.3 Modeling rearrangements
127
Choose a neighborhood @(i,fi) so that for \.( e G{i,y) we still have /
RUM
dq'in') = f
(Ru{i4l) o q) dii < (2 i + 2 /)-' -
8i(/t).
As all the R,y are bounded by 1 we conclude
Setting ©,- = U/ie.y/. &{i,j.i), an open set, for /<' £ (9,-,
a strict inequality. Hence V/..
£ Dj&i S
V/...
D To help set the current situation in mind we state the following corollary. Corollary 6.3.17. The map q is of order two on each {f^} € F, (a set given full measure by any measure in Jl,,») and ~q' is a continuous involution on Jl... Moreover the diagrams below commute: p toH
•• F
"ft(oH
(
S
<S
•• "fl»*
(ol
Jle(<3)
Jl,. is not empty since for any G-action {Z,^C,n, T a ) and sequence of full-group elements (/>,-, with
in Lx there is a subsequence \pt = >,(,) so that defining an image of the subsequence in ^?N by
then L;vy(/i) e JL..
128
m-joinings
Proof All but the last statement have been proven earlier. For the last part of the corollary, just notice that if
is sufficiently small, then
for a set of points x of ^-measure strictly greater than 1 — (2' +4j)~ 1, and similarly for those x with
It is now a relatively simple calculation to show that L*am{y) G Jl...
•
As we did before with (S and 01., we can again extend 01 to 0t x Z G and consider the a-invariant, ergodic and aperiodic measures which project to an element of Jl... We call these measures J/...(Z). The action of £ o H and q extend to this expanded space, as before, by rearranging the Z G name precisely as the GG-name is rearranged. Any measure /.i £ Jl..(Z) will project, by the trivial extension of (oH to Z-names, to Jle{fB x Z G ). Furthermore the continuous involution q will extend naturally to the Z-names and the following diagram commutes.
*-x.ii'j
- F
Corollary 6.3.18. The space J/...(Z) is a Polish space, q acts on it as a continuous involution and £ o H* projects it continuously into J/e(Z). Proof
6.4 Adding sizes to the picture
129
6.4 Adding sizes to the picture We now bring sizes into the picture developed in the previous Section. On the space ZG x ^ we have two canonical arrangements a and /} and a canonical sequence of elements 0, with
In order for a size m to provide an evaluation on a rearrangement {fttf)h(j)Jx<j)j) we must specify a measure making the action of a* free and ergodic. On the other hand, if we are given such a measure we can immediately make the evaluation of the size m of the rearrangement. We include this dependence of the evaluation on /.i by writing
Definition 6.4.1. Let m be a G-size and Z a Polish space. Define
J/'S(Z) = {n e J/,.(Z) : lirnsupm^a^r 1 ^,.) < l/ifor all /}. ./-•CO
Let Jtm(Z) = {n € Jl..{Z)
: both n and q(fi) are in J/'£(Z)}.
Notice that for any G-action (Z,^,fi, T a ) and any sequence of rearrangements a
is a Gs subset of J/..(Z)
and hence is a Polish
Proof For convenience let D be a metric on J(,»{Z) making it a complete metric space. For each /.i s J?Q(Z) and / there is an e,(/i) > 0 with limsupm^aoc/);,^" 1 ^) < 1/;-£,(/<)• Set n,(n) = [8/e,00] + 1. For ft e J/'$(Z), for all i, j and k,
130
m-joinings
and taking a lim sup in k,
This implies that in fact lim sup m;i(a>,,>~'^) must be a limit. More importantly to us, for all j > n,-(^) we must have
(Otherwise the lim sup would be too large.) Choose radii r ; (^) < 2~' so that if D([i,n') < r,(/<) then by Axiom 3, for all i = 1,2,...,/ and each j = n,(/i)
Define open sets
0,= (J Obviously
To see the other containment, suppose \i s For each / then there must exist /<; e ^//^(Z) with
We can then find a subsequence /(/) > / so that B
r,(i+1)(/i,,i+1))(^/(i+l)) S ^ ( ^ ( A ' / O ) ) -
From this, for all k > i, and y > «,(/(/(/)), setting / = «,(/(;(,)),
As /i,(A) e Br,m(/i,(i))(/'/(i)) and also is in V/g'(Z), this value is < 1// - 3e,-(W(0)/4 + \/j + 1 / / < \/i - 36/ (W(o)/4 + e,(/(;( = l/i - e/(/u/(i))/2. Fixing i and j in this, letting k —> oo and again applying Axiom 3, we conclude
Examining the lim sup as j —> oo we are forced to conclude that
6.4 Adding sizes to the picture
131
Definition 6.4.3. We say /.i G J/., belongs to Jlm if for some Polish space Z and jiti G Jl,"\Z), the projection of \i\ onto its §t, coordinates is /<. Corollary 6.4.4. A measure /<, G J/e(ZG x &) is in J/"'(Z) if and only if its projection on Sft, is in Jlm. Moreover, Jlm is a G$ subset of J/., and hence is a Polish space. Proof The first part of the corollary is simply the observation that Axiom 3 tells us that m,,,^,-, 0~'0/) depends only on the projection /.i of /(i on 0).. The proof of Theorem 6.4.2 defined its open sets G\ using only Axiom 3 and these values, hence it applies equally well to J/."' as a subset of Ji... • Lemma 6.4.5. For /< G Jt™(Z) we know a<j>i —> P relative to the measure —* h*'P on the support of n and {$,•} is mtuI-Cauchy. Such a n is [i as hf in J/.'"{Z) if and only if {ij>~{} is m^-Cauchy, which is true if and only if is mT!.{^-Cauchy. Proof Only one piece of this is less than direct and it requires a little calculation. In particular, we must show that if /.i G Jl™{Z) and {$,~'} is m;,^-Cauchy, then q*(fi) is in J/Q(Z), that is to say {»"'} is m^-Cauchy at a fast enough rate. To see this, let (>,) = j? in the m/vx-completion of the full-group. That /i G J/.'^{Z) says simply that for all i,
If {$"'} is also known to be m^-Cauchy then by Theorem 2.2.7, setting ((/>"') = a in the mlt^-completion of the full-group, we will have
Hence for all i, lim sup m / ( (a^p <j>~x<j>j) = lim sup m ^ ^ a ^ , , (j)~l(f)j)
forcing q'(n) into J/%(Z).
D
Definition 6.4.6. To /j, G ^//g(Z) we can assign a value we will call m(f.i) defined by wi(/() = /??/i a (id, /?) = lim /?J,I(«, >,). '
I—>CO
132
m-joinings
If in fact \x G J/m{Z)
then we will also have m{fi) = m(
Corollary 6.4.7. The function m acting on V/g1 (Z) is upper semi-continuous. Proof Axiom 3 tells us that for each
'>m
k
p ( w ( a , 0 f ) + 2/;')) < lim co
k
_
i-.co
m
a 6.5 More orbit joinings and m-joinings Earlier we defined an orbit joining of (Zu&unuT?)
and ( Z 2 , ^ 2 , ^ 2 , T f )
as an S" x a x a invariant and ergodic probability measure /i on $ x Zf x Z2G whose projection on Zf was |(i mapped by p\ to the space of Zpnames, and for which q'(j-i) was ;<2 mapped by p\ to the space of Z2 -names. We can now refine this idea to include a fixed sequence of rearrangements leading from the one action to the other in the joining. Definition 6.5.1. Suppose (Zu&uHuT?) and {Z1,&1,\i1, T f ) are two free ergodic G-actions on Polish spaces. A C-orbit joining is a measure p. e J(.»{Z\ x Z 2 ) with n*i(P) = P*\(H\)
and
n'2(q(p.)) = p^im)-
We write this set of measures as Jc(T^, T^1). This is a closed subset of J/.. as L* acts continuously on J/,.. It is non-empty as it contains all ergodic joinings of the two systems, extended by a point mass supported on the sequence {id} in Sk. The C in this definition can be taken to stand for Cauchy, or constructive, or convergent, or completion as you will, as what is modeled here is not only a pair of orderings a and /J with projections to the two original actions, but an explicit sequence of full-group elements 0, with a^>,- —> j$.
6.5 More orbit joinings and m-joinings
133
We now want to see that JQ{T*[ ,T%2) as a topological space, is a conjugacy invariant of the pair of actions, that is to say, does not depend on the topology on Z\ and Z2 but merely on the measure-algebra structure on them. This takes us on a small digression. For (Z,^,y) a regular probability space, let D(Z) be the set of all real-valued Borel functions with ||g||oa < 1- Topologize D(Z) with the L'(/i)-metric making D(Z) a complete separable metric space. (As usual one identifies two functions here if they are a.s. equal.) Let {Z\,fF\,fi{) and (Z2,S?'2,H2) be two standard probability spaces and (£(n\,H2) consists of all Borel measures on Z\ x Zi which project to \i\ and HJ respectively on the two coordinates, topologized with the weak*-topology. We refer to measures in #(^1,^2) as couplings of j.i\ and 1.12.
On D{Z\) x D{Zi) place the metric
For p, e (€{n\,n2) define Fp : D{Z\) x D(Z2) -» R by F[t(gi,g2) = / gi ®gidti. On {Fp} place the Tychanoff topology of pointwise convergence. Lemma 6.5.2. (i) The functions {Fft} are a uniformly equi-continuous family and are uniformly bounded, hence precompact. The Tychanoff topology is metrizable as D(Z\) x D(Zi) is separable. (ii) The map p. —> Fp is a homeomorphism from ^ ( 1 , ^ 2 ) to {Fp} and hence the set {Fp} is compact. Proof
That the Fp are uniformly equi-continuous is simply that
I^(gi.g2)-^(gi,g 2 )l = \Jgi ®g2d{i- Jg[ ®g'2dp\ < \J{&\ - gi) ® g2 dP\ + \Jg[ ® (g2 - g'2)dU\
134
m-joinings
Both directions of this are easy. If Fp{ —> Ff, then for all continuous functions g\ ® g2, r
*
f
gi ®g2fl/i; -> / g\ ®gidii. But then j gdp\ —> J g dp for any g in the closure in the uniform topology of the linear span of all such continuous gi ®g2. This uniform closure is precisely all continuous functions on Z\ x Z 2 and so £,• —• p weak*. Now suppose pj —> p weak*. Then / gi ® g2 dpi —> J g, ® g2 dp for all pairs of continuous functions (gi,g2) € D(Z\) x T){Zi). The continuous functions are dense in each D(Z,), and so uniform equi-continuity of the • family {Ff,} tells us that Ff,t —> F;-,. Corollary 6.5.3. Suppose (Z,,J^,,^,) and ( Z / , ^ - , ^ ) , i = 1,2, are regular spaces, and O,- are two measure-preserving invertible Borel maps defined f.i,a.e. to fi'j-a.a. ofZ[. Then the map p —> (Ot x (£>2)"(P) is a homeomorphism /rom «?(/*,,/i2) to Proof
The map (gi,g2) —* (gi o $ i , g 2 ° ^2) is a n isometry taking
Hence it takes {Fp : p' e ^(fi\n'2)} to some uniformly equi-continuous family of functions on D(Z\) x D(Z2). We want to show that this family is exactly {F# : p € ^(fn,^)}Where they are defined these maps are bounded and linear. Hence on pairs of continuous functions (gi,g2), each image function F/Vo(d)| xO 2 ) is the restriction of a measure. As the family of functions is uniformly equi-continuous, and the continuous functions are dense in D(Z\) x D(Zi), each Fp> o ( Fp o (Oj x O2) takes {Fp> : p' € ^(fi\,n'2)} to {F-fl : p e #(^i,^ 2 )}- Invertibility of the •S.'s makes this map invertible. It is very easy to see that this map is continuous in the TychanofT topology. Lemma 6.5.2 now completes the result. • This corollary already tells us that the space of joinings of two actions, as a topological space, is independent of the choice of regular Borel model we choose for the two actions, as the space of joinings is a closed subset of the space of couplings of the two measures. To show this same fact for orbit-joinings, C-orbit joinings, and later m-joinings (m a size), we must discuss the extra baggage these spaces carry in the form of the Polish spaces ^ and
6.5 More orbit joinings and m-joinings
135
For regular spaces (Z,,^,,^,), i = 1,2, and Z3 a Polish topological space, consider D{ZX) x D(Z2) x d ( Z 3 ) where Ci(Z3) is the space of all continuous real-valued functions of supnorm at most 1 (the unit ball in the continuous functions). On this space put the metric Il(gl,g2,g3),(g{,g2>g3)ll = llgl-g'llll + Ilg2-gilll + Ilg3-g3llooThis makes D(Z\) x D(Z2) x C\(Zi) a complete metric space. Let %!(n\,ii2,Zi) consist of all Borel measures on Z\ x Z2 x Z 3 whose projections on Z\ and Z 2 are ^1 and \.ii respectively. For p € ^(1.11,1.12, Zj) let
be given by
Topologize the space of all such F;-,'s with the Tychanoff topology of pointwise convergence. Lemma 6.5.4. (i) The family of functions {Ff, : /i G (€{^i\,H2,Zi)\ is uniformly equicontinuous and metrizable hence precompact. (ii) The map /i —» F;-, /s a homeomorphism between (€{\i\,^i,Z3) a;jrf {F;-,}. Hence {F,-,} ;';? t/ie Tychanoff topology is a Polish space. Proof That the family {F;,} is uniformly equi-continuous is virtually identical to the same argument in Lemma 6.5.2, as is showing that /i, —> /t iff F/i, ^^ F/, pointwise. The lemma is finished by observing that ( €{n\,\.i-i,Z{) is a closed subset of the Polish space of all Borel probability measures on Z\ x Z2 x Z3 and hence is a Polish space. D Corollary 6.5.5. Suppose {Z\,^\,n\,T^) and (Zi^i^^T^1) are two free measure-preserving and ergodic G-actions on regular probability spaces. Remember the space of orbit-joinings of T,°" and Tp is the space of fa invariant measures p. on IS x Zf x Z2G with n\(p) = p\(j.i\) and n'2(q*(p)) = This Polish space of measures in the weak*-topology is an isomorphism
136
m-joinings
invariant of the two original actions. More precisely, any Borel conjugacy between G-actions gives a canonical homeomorphism between the spaces of orbit-joinings. Proof
Consider the map
This is a homeomorphism of <§ x Z G x Z2G, and carries any orbit joining p to a measure in <&(ii\,n2,'&). As t~l(p\(zl),p2(z2),h)
=
{p\(zi),rf-,(p2(z2)),h),
(t~1)' is a homeomorphism of <€{\i\,\ii,'S') to its range. Those p. e ( &(H\,H2,'&) with {t~l)'(p) a r a -invariant measure form a closed set homeomorphic (by (t"1)*) to the space of orbit-joinings. If O, and <J>2 are conjugacies of the original two actions to some other versions (Z/, 3F\,[(i, S*'), then the map (Pi(zi),P2(z2),h)
->(Pi(®i(zi)),p2(
will give a t-equivariant map between their spaces of couplings, and hence (as in Corollary 6.5.3) a homeomorphism between their spaces of orbit-joinings. • Corollary 6.5.6. The space of C-orbit-joinings of two ergodic G-actions, as a topological space in the weak*-topology, is a conjugacy invariant, in that any pair of conjugacies of the joined actions gives a canonical homeomorphism between their spaces of C-orbit-joinings. Proof This is only slightly different from Corollary 6.5.5. Let i and d>2 be a pair of conjugacies of the two joined actions to "primed" versions. Define a homeomorphism of
=
{p\{z\),rnH(fl))(p2{z2){fi})).
For p a C-orbit joining, t*(p) e <$(n\,fi2,i%). For the conjugacies
p2(O2) x id : Zp x Z 2 c x « - » (Z[)G x (Z2)G x
which will lift to a homeomorphism from #(//[, |i2,i%) to ($(ii\, n'2, ffi) that is £*-equivariant. Hence (pi(
6.5 More orbit joinings and m-joinings
137
Definition 6.5.7. For ergodic and free G-actions (Z\,^i,n\,T"') and 2 {Zi, 2F i, \i2, Tj ), an m-joining of the two actions is a measure p. G Jtm(Z\ x Zj) whose projections satisfy n\(p) = p\(n\) and We write this space of measures as Jm(T^', T^2), suppressing the other components of the dynamical systems as long as they are obvious. Notice again that the space of m-joinings is not empty as it contains all ergodic joinings, extended via a point mass on {id} to a measure in J(»». As m(a,id) = 0, this measure is in Jim(Z\ x Zi). The following result follows directly from the definitions of the spaces involved. Theorem 6.5.8. The set Jm(T*', Tj2 ) is a Polish space as it is precisely the intersection of Jim{Z\ x Zi) and the space of C-orbit-joinings of the two systems, both of which are Polish subspaces of Jl*,{Z\ x Zi). As the topological space of C-orbit-joinings is a conjugacy invariant of the two actions joined, so too is the topological space of m-joinings. Proof Everything except perhaps the last remark is direct. To clarify the last remark, in Lemmas 6.5.5 and 6.5.6 we saw that conjugacies of actions would lift to a canonical homeomorphism of their orbit-joinings and C-orbit-joinings. As this homeomorphism acts as the identity on the ^-coordinate, it will preserve the subspace of m-joinings. • Theorem 6.5.8 has been the goal of our entire journey from the beginning of Chapter 6. Knowing that the space of m-joinings is a Polish space now allows us to apply a category approach to the equivalence theorem. This is what we now proceed to do. Notice that if (Z,J*, p., T a ) is a free and ergodic G-action with fullgroup elements {$,-} with
then of course there will be a /? with a
have a pair of measure preserving maps n\ : Z —> Z\ and 712 : Z —> Z2 satisfying n\T* = T^%i and n^T® = T^^-i- Then this G-action will give rise to an m-joining via the map Q : Z —»{Z\ x Zi)G x M where <2(Z) =
138
m-joinings
We refer to such a G-action, sequence of full-group elements and projections m and TT2 as an m-joining in the loose sense in that although it is not itself an element of Jm(Tj*', J^1) it gives rise to one directly via the image Q*(/(). The topology on Jm(T*\ T^1) puts a pseudotopology on these m-joinings in the loose sense. Extending this one step further, if Ta and T11 are two orbit-equivalent G-actions on (Z,^,p,) and the maps n\ : Z —> Z\ and 712 : Z —> Zi conjugate the action of T a to Tj*1 and T& to T^2 respectively, then we refer to (Z, SF, p.), the pair of arrangements a and ji and the factor maps %\ and %i as an m-joining of Tj*1 and Tp in the very loose sense. In this situation we can always interpolate full-group elements taking a to fi rapidly enough to give an m-joining in the loose sense and hence an m-joining. This last notion of an m-joining in the very loose sense is the most natural one for what one would heuristically call an m-joining. The additional structure we have given is simply to place it in a Polish topological setting.
7 The Equivalence Theorem
7.1 Perturbing an m-equivalence
In proving the equivalence theorem, starting from one free and ergodic G-action, with two m-equivalent arrangements, we will need to construct a second one which will be, in a sense we now make precise, a perturbation of the first. The aim of this will be to show that a certain open set of m-joinings is in fact dense under appropriate hypotheses. This density will be shown by demonstrating how to "perturb" any given m-joining into the given open set. In this section we will develop the technical facts we will need for such perturbations. Recall that when we omit the subscript x on an expression / ^ or h*Js we are regarding them as maps from X to M or <S. Suppose (X,^,i.i, T a ) is a free and ergodic G-action and {(/>,•} is a sequence of elements in the full-group of Ta for which (/"•*' ® f^1
® • • • )'fi G J/m.
The following discussion will be relative to this fixed action and sequence of full-group elements. Under these conditions we know that a0, —• /? where a ~ /?. We describe now what we mean by an / 0 , J , (5-perturbation of this G-action. It is perhaps more correct to call it an m-perturbation, but as we will assume m is a fixed size from here on, we omit this. Throughout the rest of this chapter it will be convenient to have a metric for the weak*-topologies on various spaces of measures. Let D represent such a metric in all cases. Definition 7.1.1. Fix values /o and J and consider the measure jx = (f^
® fa^
® • • • ® f'K 139
®/°*)V e
J/e{i%'0+]).
140
The Equivalence Theorem
Suppose we have a second free and ergodic G-action (X\,^\,n\,T*1) with /o+l elements in its full-group $',,$',,..., (/)[ +1 , and a further sequence of full-group elements {rpt} satisfying: (i) D(Ji, {fau^ (ii) (/*#<>+'•»»
® • • • ® /ail*'o+' Yni)<8; f V ^
and
(iii) m((/a"/>''o ^Fe re/er to SMC/I an action and collection of full-group elements as an Io, J, ^-perturbation of the original G-action and sequence in the full-group. Notice that in such an IQ, J,<5-perturbation the sequence of rearrangements ai(j)io+\xpi will converge in i to an arrangement we will call fi\, and u\ and Pi will be m-equivalent. What interests us is the precise sequence of full-group elements
taking a\ to Pi. We will write this sequence as {4>"). Notice that if we drop to a subsequence of the \pt in an Io,J,3perturbation, we will still have an Io,J,S -perturbation. What we will show in the rest of this section is that given a G-action (X,2F,\i, Ta) and sequence of full-group elements {0,} with
then for any IQ, if J is large enough and S small enough, any Io,J,dperturbation will still give
To begin to see why, notice the following calculation: For all i
J > ;o +
b'i, >;0+i) + <5,
f° r a ll J > h + 1 sufficiently large.
7.1 Perturbing an m-equivalence
141
Lemma 7.1.2. For any free and ergodic G-action (X,tF,n,Ta) group elements
and full-
and value h, there exists 5 > 0 such that for all J sufficiently large, any IQ,J,5-perturbation (dropping to a subsequence of the {ip,} if necessary) will satisfy:
and \<j>"jl)
lim sup m/;, (>",• ; c o
Proo/ As ( / ^ ' ® • • • )'n G ^//m we have, for all i, limsupm a (0,,^) < l/i
and
lim sup m(s(0j~1, )]"') < 1//. y-.oo
Hence, for all i, both of these lim sup's are in fact limits converging to ma((j)i,((j)j)a) and m^(<£~', (0J1)/?) both of which must be < 1//. Hence there is an ei > 0 so that for all i < IQ we have
Thus for all 7 sufficiently large, and i = 1,..., IQ, mMhfpj) < 1/i-eiBy Axiom 3, there exists S > 0 such that if
£>Gu,(f"^' ®---®.f' Vi)V 1 )<<s then all of the finite list of inequalities '"ot, (<#,>/„+! ) < l / ' - S l ,
( = l,...,/0 + l
will still hold. As long as 8 < s\/2, the calculation made preceding this lemma implies m^{(f)\,4>\0+\Wj)<m^,
+ mai^+i(id,xpj)<
and for / = l , . . . , / 0 lim sup mai ()",-, <j>"j) <
\/i.
1/i — e,/2
142
The Equivalence Theorem
For i = I0 + 1, limsupm ai (#" o+1 ,0j) = ma^'> (id,y>;-/o-i) < 5. y^co
°
Making sure that 8 < l/(/o + 1) we obtain this term. For i > IQ + 1, as lim sup mai (<#', fij) = lim sup m^; j—>oo
y—>oo
+|
{xpt-h-i, Wj)
°
by dropping to a subsequence of the \pi we can ensure that this value is < 1//. As we assume that the sequence (/>'( achieves an m-equivalence between oti and j8i, the other set of strict inequalities now follows automatically from Lemma 6.4.5. • We now want to obtain the same fact for pointwise convergence, that a small enough perturbation will not leave Ji*.. The first step is to notice that from Axiom 2 we get the following: Lemma 7.1.3. For all 5 > 0 there exists <S, > 0 and if (Xx,^x,n\, Tj*1) is a free and ergodic G-action, with §\ + 1 and {xp,} in its full-group with e £m(<xi)
then for a subsequence of the y)j we have both u and
Proof Axiom 2 tells us that if 5X is sufficiently small then for all i sufficiently large, '/j a| 0/o+i' a'*''o+i V)| ,id)(fyfi
<
We can drop to a subsequence of the ip, with
5/4.
7.1 Perturbing an m-equivalence
143
for /<|-a.e. x. Hence by the Lebesgue dominated convergence theorem,
Thus by omitting sufficiently many initial terms ip, we can obtain
for a subset of X of measure > 1 — 5/4. As d < 1, restricting to this tail of the y;, sequence we obtain [sup(d{ha"t>'>o+»*"t''>o+
As the sequence {(/;,} realizes the m-equivalence between <x\(f)'Io+l and Pi, we also have
and the second inequality of the lemma follows symmetrically.
•
Lemma 7.1.4. Suppose (X, HF,\i, T01) is a free and ergodic G-action, such that {(/»,•} is in its full-group with
Given any £ > 0 and Io, there exists S > 0 and J > Io so that if (Xi,!Fi,\x\, Tj*1), 0',,...,0' / o + 1 anrf {i/;,}/o/-m an I0,J,S-perturbation of the first system, then we can conclude: (a) for all i
J
RiJo+i{h«^,...,ha>'a^)dm
(b) /sup,-d(/i"l>ai*''o+'«" >/iai-" 1^o
(c) for all i < Io + 1,
< JKj (d)
g;
144
The Equivalence Theorem
Proof To begin, for any 5[ > 0 and IQ, there is a d > 0 such that for all J sufficiently large, perhaps dropping to a subsequence of the i/>,, we will have: (i) (/«••
(ii) / s u p ^ r t - H ' ^ o + ' ^ i d ) ) ^ , <«5,;and (iii) To obtain (i) just notice that h^1 -> h^ and the map ff taking 01. to 2/ is continuous. The second two statements follow directly from Lemma 7.1.3. The proofs of (a)-(d) will follow, except for one small step, from (i)-(iii) with b\ small enough and J large enough. One can calculate that for any two G-arrangements a and ft and full-group element
Represent a point in Sth x $ as (f[,f'2,---,f',0J'),
and define
and notice that the maps
are all continuous. It follows that
Partition ^?'° x ^ into a countable collection of clopen sets according to the values
(/;)(gA^(g;) H(F{f\,nr\gj),H(f\)-l{gj)^{gj),
and
j = 1,..., [ln(8/e)] + 1.
Label the partition elements C\, C2, Notice that for any two points (/;,...,f' I a ,O and (f'{,...,f'/0,t") which belong to the same Ck we will have
d(H(F(f[, £')), H(F(f;, £"))) < e/8 and d{H{f\),H{f'/))<e/i, i = 1 /o-
7.1 Perturbing an m-equivalence Choose a finite list
C|,...,CK
145
with
/t(u£L,(/•"•*' ® •' • ® fx'K ® ^ r ' ( C f c ) ) > 1 - e/8. As the Q are clopen, we can choose 6] so small that (i) implies (/«'•# ® • • • o / 0 " ' ^ ® fcai'ai0'o+')V1(u£=1q[) > 1 - e/4. Select an index iV > [ln(8/e)] + 1 so large that for all gy, = 1,..., [ln(8/e)] + 1 and (/'„...,f', '„...,f',aaJ') € UkQ all of the values ,)(gj), //(/,')"' (gj),
nr'igMigj), and r\gj) are indexed as some g,,, n < N. (Remember, all of these values are constant on each set Cu) Set e = \/N, be sure that ee 3
and (ii) now implies / d(hai
Define a set A by A={x\
: d(hx\ ' 0 + l '' '0+l , i d ) < e
and
d(h!,,'xaV' ' '0+1 ',id)
< e for 7 = l , . . . , / 0 and all i, hence d(hXl '0+1' ,id) < e and d{h^,^'
', id) < e for 7 = 1,..., / 0 ,
and we further require d(h'l(>+." _ id) < e and d{h^'^Vl, id) < e for all /}. As there are / 0 + 3 inequalities to satisfy to lie in A, each of which holds on a set of measure at least 5\/e (remember that f.i\ is preserved by all the >|) we conclude that H(A)> l - e / 8 . Let
and H\{A') > 1 — 3e/8.
146
The Equivalence Theorem
The four inequalities come in two pairs (a),(b) and (c),(d). The latter two are more difficult so we focus on their proofs. We begin with (c). For [ln(8/e)] + 1 we will have all x\ &A', i = l,...,/o and j = 1
—».vi
n.vi
I.vi
VgyJ
as both the pre- and post- functions in this composition act on gj and its image as the identity. Taking the inverses of all the bijections in this calculation, we find the same identity holds there. That is to say:
Examining the (/0 + l)'th term and j= I,..., [ln(8/e)] + I,
and again the same identity holds at these g7- for the inverse maps. Hence for xt e A' and all i = 1,..., / 0 + 1,
Calculating (c),
J RUa+ [
^
A'
O
l
+ j.
The map
fl, H(f'Io+i))),...,
H(F(f'h+l,
H(f'h+l))))
is continuous from ^ ' 0 + 1 —> [0,1] and so if d is small enough we will
7.1 Perturbing an m-equivalence
147
obtain from (i) of Definition 7.1.1 of an / 0 , J,<5-perturbation that this calculation is
As hP/^ -> id for all x, if J now is sufficiently large, this is < f RUh{h'm",..
•, / W + ' , h»*) d/x + s
which is (c). To obtain (a) we can omit the first part of the argument for (c) and just notice that for i = 1,...,/Q + 1, the maps
taking ^?' 0+1 —> [0,1] are continuous and so from (i) of the definition of an /o, ,<5-perturbation, if § is small enough,
ijo+i{H;...,h\H)
+
and now if J is chosen sufficiently large, this will be
which is (a). To demonstrate (d) notice that for j = 1,..., [ln(8/e)] + 1, and x\ e A' that
again as the pre- and post- composing functions act as the identity on gy and its image. Also as before the same identity holds for the inverse bijections at gj. We also have n
-vi
vgj) - «v,
For all A'I w e k n o w t h a t /7X, '
VTO' +I(-VI)
V|
Vl
^ g7 '^
' 0 + '(gj) = g ; once / is large e n o u g h .
148
The Equivalence Theorem
Drop to a sufficiently distant tail of the sequence tp, so that on a set A" with /i\(A") > 1 — e/8, for x\ G A" we have *'0+l(gj) = gj for
C
Mj=l,...,N.
It follows that for xi e A' n A" we have ^x,
° (gy) = 'i.v,
(gj),
and the same identity holds on the inverses of these bijections at gy. (Note: it is this last remark that required us to use N in the definition of A" and not [ln(8/e)] + 1.) This implies for all x, e A'nA", for all i and for 1 < j < [ln(8/e)] + 1 that K
(gj) = hxt
(gj)
and the same identity holds for the inverses of these bijections at gy. Hence
f e
e
For (b) we follow similar but easier lines. Notice that for j 1,..., [ln(8/s)] + 1 and xi e A' that
and the same identity holds at gy for the inverse bijections. Now integrate just as for (d) to obtain (b). D Theorem 7.1.5. For (X, 3F, /<, Ta) a free and ergodic G-action and fullgroup elements (j), with
and /o, there is a 5 > 0 such that for all J sufficiently large, any perturbation
IQ,J,S-
7.1 Perturbing an m-equivalence
149
(by dropping to a subsequence of the xp-,) will satisfy
as well. Proof Having already proven in Lemma 7.1.2 that if this measure is in Jl,. it will be in J(m all we need to show is (a) //? i (/i ai ' ai ^,/i ai ' ai * > '-+i,...)^i < (2' +2 /)~' and As (/"•*' ® /"•*' ® • • • )'m G •///"' we already know that there must exist an g| > 0 so that for all i = 1,..., IQ,
To demonstrate (a) for i = 1,..., / 0 , notice
<
JRija+l(h'^',...
+ i
Applying Lemma 7.1.4 with "e" = ej/3, make sure (5 is at most the "<5" obtained there, and for all J sufficiently large we obtain from (a) and (b) of Lemma 7.1.4 that this calculation is
<
JRUo+]{h"**>,...,fc^'o,
(as < ( 2 ; + l i T ' — BI/3 < (2f+2i)~1. For / > /o + 2,
I RW"*",...)dn\
= I Ri(till"+'>*"'l"-'°-\...)dii\
2e,/3
150
The Equivalence Theorem
and as ai<^>;0+i !/;,_/„_, —• fju by dropping to a subsequence of the y;,- we can ensure this is
f su J
i
This completes (a). Obtaining (b) follows parallel lines using (c) and (d) of Lemma 7.1.4. For / = l , . . . , / 0 + 1,
+ f sup d(hPlM'w,
//'^""rVT
from (c) and (d) of Lemma 7.1.4 and that /?0~' —• a. But this is
For i > /o + 2 we calculate
f Ri(hMl^T\...)dm
=I
and since /JiyJ|l/0_i#'^+i -* «i, by dropping to a subsequence of the just as for the previous case, we can ensure this is
D
7.2 The m-distance and ni-finitely determined processes Definition 7.2.1. Suppose (Z,!F,n, Ta) is a free and ergodic G-action and P : Z —> S is a finite labeled partition (that is to say, S is a finite labeling set). We refer to (Z,^,/.i,Ta,P), as a 2,-valued process. We will usually a abbreviate this by just the pair (T ,P) as long as there is no confusion. We consider two 1,-valued processes to be identical if they give rise to the same a-invariant measure on Y,G via the usual map of a point in Z to its name {P(T a (z))} g e C e S c .
7.2 The m-distance and m-finitely determined processes
151
If it is not necessary, we will also suppress the £ and just refer to (T01, P) as a process. This is in keeping with the usual vocabulary of the isomorphism theory of Ornstein and Weiss. Definition 7.2.2. Given two ^-processes (T*\P\) the ifi-distance between them by m(T?',Pl;T«\P2) inf
and (T*2,P2), we define
= (£({(z,,z 2 ) : P,(z Ud ) £ P2(z2,id)}) + m(/i)).
That is to say, we consider all m-joinings of the two actions, and among them look for the one which simultaneously matches the two labeled partitions on the two processes as closely as possible, and is as small an injoining as possible. For \i an m-joining of two processes we abbreviate U{{(h,h)
: P,(z1>id) ± P2(z2,id)}) by £(P,AP 2 ).
We refer to the evaluation fi(P\AP2) + m(p) as the m-evaluation at /i. Lemma 7.2.3. The m-distance is a metric on the space of all 1,-valued processes. Furthermore, for any e > 0, the set of m-joinings jx with , AP 2 ) + m([i) < m(T?>, P,; T?,P2) + 8 is open. Proof That m is symmetric follows from the fact that q' interchanges the roles of the two processes in an m-joining but does not alter the calculation of the infimum. The existence of the diagonal joining certainly tells us that the mdistance between a process and itself is zero. For the other direction of this, suppose m{T^,P\; T^^i) = 0. Let /t, be a sequence of m-joinings of the two actions with the m-evaluation converging to zero. Consider the projections of these measures to their S G x Z c -names alone, dropping all other coordinates. Certainly then, the measures must be converging to a measure supported on the diagonal. On the first coordinate it must be p\(f.i\), the projection of ^ onto the space of Z-names. As m(/i,) —• 0, by Axiom 2 we must also have that the projections of the /(, to measures on (S are converging to a point mass on the identity bijection. That is to say, the /i, converge on IP x S G x (S to a joining (not just an orbitjoining) of the two processes. As it is supported on the diagonal, the
152
The Equivalence Theorem
two processes must be identical. For the triangle inequality, notice that if we have two m-joinings, £, of (T*',P{) and (T«2,P2) and fa ° f (T?\P\) with (T*\Pi), then we can construct the relatively independent coupling of jUj and fa over their common (T,ai,Pi)-factor. An ergodic component of this (fa) will be an ergodic and free action with three m-equivalent arrangements, (versions of ai,a2 and 0C3), and three S-valued partitions (versions of P\,Pi and P3). In this fixed system, m^3j0(1 is a metric on its m-equivalence class of arrangements, and the calculation fa(Pj,Pj) is a metric on Z-valued partitions. Projecting fa to just its Z 2 x Z 3 -coordinates and partitions, £3 be mapped directly to an m-joining of the two processes {T^fPi) (T^,Pi). The m-evaluation for this m-joining can be pulled back evaluated on fa where it will be maximized by the sum of the m-calculations for the original two m-joinings fa and fa.
can and and two
That the set of m-joinings that get within s of the m-distance is open follows from an observation. Knowing that the space of m-joinings is a conjugacy invariant means we can assume here that the maps Pi and P> are continuous (i.e. the partitions are into clopen sets) as this is simply a different choice of model. This means the m-evaluation is an upper • semi-continuous function of the m-joining p.. We now define the notion of an m-finitely determined process. To readers familiar with the isomorphism theory this is lifted directly from the corresponding notion there. Hence it will be automatic that for m an entropy-preserving size, the Bernoulli processes are m-finitely determined, giving a basic class of examples. In this context it is more natural to speak of m-finitely determined processes of zero m-entropy and of positive m-entropy. The Bernoulli processes provide examples of m-finitely determined processes of positive m-entropy. One must examine case by case whether m-finitely determined processes of zero m-entropy exist. The simplest example is, of course, that no free and ergodic finitely determined actions of zero entropy exist. Having defined the m-finitely determined property, we will first show that it is an m-equivalence invariant. We will show more, that in fact any process that sits as a factor of an m-finitely determined process is again m-finitely determined. Hence one can speak of an m-finitely determined G-action as one for which every partition is m-finitely determined. Finally we will show that among the m-finitely determined actions, m-entropy is a complete invariant of m-equivalence, i.e. that any two in-
12 The m-distance ami m-finitely determined processes
153
finitely determined actions of the same m-entropy are in fact m-equivalent. We do this by showing that if {X,$F,\i, Ta) is m-finitely determined, and (X\,^i,n\,T"') is any other free and ergodic G-action with hm(T*') > hm{Ta), then in the space of m-joinings Jm(T*, Tj*1), those // for which the full-group elements {>,} and the first coordinate algebra 3F \ are /i-a.s. SF\ -measurable, form a dense G§ subset. If the two actions happened to both be m-finitely determined and of the same m-entropy, then, simply intersecting the two residual subsets, we see that m-equivalences between them not only exist but form a dense Gg, subset of their space of mjoinings. Before stating the definition of m-finitely determined, following our earlier convention, fix a metric D giving the weak*-topology on the space of Borel measures on a sequence space ZG where Z is some finite labeling space. Also remember, for P : X —> Z, we let p(x) = {P(T*(x))}gec be the Ta, P-name of the point. Again as a reminder, we define a pseudometric
Definition 7.2.4. We say a T-valued process (Ta,P) is m-finitely determined (abbreviated m-f.d.) if for any e > 0 there is a S > 0 so that if (T, a| , Pi) is any other H-valued process satisfying: (1)
\\(T\P),{T«>,Px)\\.<5and
(2)
h?'P m(T?',P l)>hm(T«,P)-d, l) then
(3) m(T, a
A few comments are appropriate. To begin, condition (2) is a bit disingenuous in that if m is entropy-preserving, then it should read
h(T?,Pl)>h(Ta,P)-5 and if m is entropy-free, then (2) is no condition at all. We have mentioned earlier that among positive m-entropy processes (that is to say, positive entropy processes and entropy-preserving sizes in), the Bernoulli processes give examples of m-f.d. processes. This is simply because for the f.d. processes of Ornstein and Weiss (1) and (2) imply ^-closeness, which is to say, the existence of a joining ft with
Such a joining of course extends via a point mass on {id} to an m-joining with m(p) = 0. Although this definition is well-suited to the completion of a general
154
The Equivalence Theorem
equivalence theorem it suffers from being perhaps difficult to verify for any particular process. For 3 + sizes we can use a more easily verified condition. In its definition we use the notion of an ergodic lift of some G-action T on a standard space (X,3F,\i). By this we mean an ergodic G-action T on some (X,^,^) which factors by some map n onto T. Notice that the full-group of T then lifts as a subgroup of the full-group of f. Further, if T = Ta then any other arrangment /? of the orbit of T lifts as well. Both of these observations follow from the fact that all functions on X, in particular / ° ^ , q"^ and ha'P lift to X through the projection of x to n(x). The same of course applies to say that any partition P of X can be regarded as a partition of X. Definition 7.2.5. For Tx a free and ergodic G-action we say the Z valued process {T*,P) is weakly m-finitely determined (weakly m-f.d.) if for each e > 0 there is a 5 > 0 such that for any other free and ergodic G-action TQ° and Z valued partition PQ with (1) ||(r<,P),(r o a ',Po)||. <5and (2) hm(T?,P0)>hm(T«,P)-S and each 5\ > 0 there is an ergodic lift T*1 of Toa°, a H-valued partition P\ of X\ and a 0i in the full-group of Tj*1 with (a) //,(/> 0 APi)<8and (b) m(ai,0 t ) < e,
for which (1') \\{T*,P),{T^,Pi)\\.
7.2 The m-distance and m-finitely determined processes
155
and there is a p, 6 Jm{T*a, Ta) with ,l 2 ) : -Pi(zi,id + ^fejd)}) + m(A) < e. The action
As the sequence n
in both distribution and m-entropy. Hence for this lift of TQ° we can select (j) = (j>i for some (' and Pi = P2(z2,id) and obtain (a), (b), (1') and (2'). To verify (ii) notice that the weakly m-f.d. condition is set to be used inductively moving from conditions (1) and (2) to (1') and (2') with (a) and (b) measuring the size of successive perturbations. Thus if (Ta,P) is weakly m-f.d. and e > 0, then for any (TQ°,P0) satisfying (1) and (2) of the definition one can obtain a succession of ergodic lifts, partitions Pt and full-group elements 0,. To simplify notation we can assume all these ergodic lifts sit inside one maximal ergodic lift we call T^°. From such an inductive application of the definition we obtain for an initial step: m(uo, >i) < e and /i(P0AP,)<£ but after this first step, for i > 1
where the values e, are at our disposal to choose. Hence we can force
with m(ao,P) < e and
with fi(P0AP) < s.
156 As (r o
The Equivalence Theorem 5o0i
a
,F,) —> (T ,P) in distributions we will have
If m is a 3 + size then Theorem 2.2.7 ensures f$4)~x —• So and that So, /? and the sequence <£,- form an m-joining in the weak sense of TQ° and T a allowing us to conclude that
a Our next goal is to see that the m-f.d. property is actually a property of a sub-c-algebra, not just a partition. More precisely, for any process (Ta,P), let Jf(Ta,P) be the a-algebra
geG a
the smallest T -invariant cr-algebra relative to which P is measurable. What we want to demonstrate is the following. Theorem 7.2.7. If(Ta,P) is m-f.d., and Q : X ^> ZQ is any finite ^ measurable partition, then (Ta,Q) is m-f.d. We will argue this assuming m is entropy-preserving. The entropy-free case follows precisely the same lines, but without the need for any entropy estimates. The result will follow from the next two lemmas. Lemma 7.2.8. Suppose (TX,P) is m-f.d. and Q : X partition with
—>Y,Q
is another finite
Then {T*,Q) is m-f.d. Proof We prove this for m an entropy-preserving size. Fix s > 0 and choose Ko £ G, a finite subset, so that there is a "coding"
satisfying M({x
Choose £o <
in JL
: Co({P(Tg*(x))}g6Ko) + Q{x)}) < e/10. small enough so that on any orbit space for any
7.2 The m-distance and m-finitely determined processes
157
pair of m-equivalent G-arrangements a\ and 1*2 with m(ai,a2) < £0, by Axiom 2 we will have
for all but at most e/10 in measure of the points x. Use eo in the definition of m-f.d. for the process (Ta,P) value do- Choose e < (5o/lO with
to obtain a
and e < Now once more choose finite codes c\ and c2 (much more accurate than Co) c\ : (£g)
—* Zp and c2 : (£/>)
—» ZQ,
K a finite subset of G, satisfying (a) M{x : c,({e(T g a (x))} geK ) = P(x)}) > 1 - e , and letting P(x) = c,({g(7 (b) /.i({x : c 2({P(Tgx (x))}ge«) = > 1-e. For any Sg-valued process (T"',Q\) let
Choose 5 > 0 so that if
(1) 11(7,0,(^,2011. <<5 then (1') ||(T a ,n(7Y l ,/',)l|. < \\(T",P),(T",P)\\. +
\\(T",P),(Tfl,Pi)\\.<8o,
and furthermore both H({xi • Co({>i(T^(x,))}g6K) = e.(x,)}) > 1 - e/5 and /ii({xi : c 2 ({P,(T^(xi))} g6K ) = Qi(xi)}) > 1 - 2e.
158
The Equivalence Theorem
Hence - H(2s)
Thus if (2) h(T?,Qi)
> h(Ta,Q) - 5 = h(T\P)
(2') h(T^l,Pi) > h(Ta,P)-50,
- 5 then
and hence
m(T«l,P1;Ta,P)<e0.
(3')
Let p. be an ergodic joining of T* and T"' for which the m-evaluation is less than eo, that is to say, for which /((Pi A P ) < e0 and m{p) < e0. We conclude that A(GiAe)
m(T?<,Qi;T«,Q)<s, and hence (T a , Q) is m-f.d.
D
Lemma 7.2.9. Suppose (Ta,P) is m-f.d. and Q : Z -+ T.Q is 3f(T*,P)measurable. Then (Ta,Q) is m-f.d. Proof From Lemma 7.2.8, we know that (Ta,Q\/ P) is m-f.d. From Corollary 4.0.8 of the copying lemma, for any t>o > 0 there exists 8 so that if (T, ai ,Qi) satisfies (1) \\( (2)
h(T?',Qi)>h(T\Q)-80/6,
then for (Y,
7.2 The m-distance and m-finitely determined processes
159
there is a partition Pi of^i x 7 satisfying (1) ||(Ta- x S , e , V F , ) , ( r a , e v P ) | | . <<50and (2) /i(T,°"
xS,QlVPi)>h(T*,QVP)-50.
This implies that m(Ta<
xS,QiVPi;Ta,Q\/P)<e.
Thus restricting the m-joinings to the processes {T\',Q\) obtain
and {Ta,Q) we
m(T?',Qi;T«,Q)<e, completing the proof that {Ta,Q) is m-f.d.
•
Definition 7.2.10. If(X,^,n, T a ) is m-f.d. for all finite partitions P we say a T is m-f.d. Notice we now know this will be implied if(Ta,P) is m-f.d. for a generating partition P. We now show that m-f.d. is an m-equivalence invariant. Theorem 7.2.11. Suppose (X, J*, [i, T' ; ) is m-f.d. Then for any a. with a ~ (5, Ta is also m-f.d. Proof
Fix a partition P and g > 0. Suppose {<£,-} s F with a<^>, —> /?. Set m
P = (0/)« ' n the ma-closure of F. By Theorem 4.0.2 we can assume all the (a, (f>i) are bounded rearrangements. Choose >/ so that
Letting a = (0~')/j in the m/(-closure of F, we have \a) = ma(<^/,^) < E/3. As T/'fr' is conjugate to T/:(, it is m-f.d. In particular (T / J fr',P) is an m-f.d. process. Hence there exists <5 > 0 so that for any process (Xu&uM.Tp.Pi) satisfying (1) (2) M77"Pi) > hm(Tl>+T\p)-& then (3)
M{T?,Pl;T?+T',P)<el.
160
The Equivalence Theorem
As ma(\A,B(b7x) < e/3, there are w, £ FG(O) with aip, -> B
supm(a, \pi) < E/3. i
By Lemma 5.0.7 we know
\imint hm(T««\P)>hm(TW,P), i—>oo
(remember, /im = h as we are assuming the size is entropy-preserving) and certainly
Select xp from among the ip,- so that (1) ||(7a">;JP (2)
hm(T^,
Let Q = P V P o v - ' , and as (T°">',P) and (T a ,P oip" 1 ) are identical in distribution,
Choose a value <5i so that if
then we still will have
m{a\,y') <e/3. We proceed now as if m were an entropy-preserving size. If m is entropy-free, just replace the use of Corollary 4.0.11 with Corollary 4.0.9, without the need of the extra Bernoulli factor to obtain entropy. By Corollary 4.0.11 there is a (5o so that if (1) | | ( r i a ' , P ) , ( T a , P ) | | . < 5 0 a n d (2) h(T?',P)>h(T,P)-d0 then for (Y/$, v,S) a Bernoulli shift of sufficient entropy, there is a xp' in the full-group of U"< = T,0" x S so that: (3) (4) (5)
\\(U^>',Pl),(T«v,P)\\.
7.3 The equivalence theorem
161
From this we can conclude that
and so
as m(ct\,ip') < e/3. Hence
m(Tax\Px;T«,P)<£ as ma(id, jgipf') < e A completing the proof that Ta is m-f.d. with respect D to any partition P.
7.3 The equivalence theorem We now develop the background for and prove the equivalence theorem, that any two m-finitely determined G-actions of the same m-entropy are mequivalent. Let (X,&,\x, T") and (Xi,^\,m, T*1) be two free and ergodic G-actions. We already know that the space of measures J ^ T " 1 , T a ) is a Polish topological space in the weak*-topology. What we intend to show is that if both of these actions are m-finitely determined and they have the same m-entropy then in fact those m-joinings that arise from m-equivalences between the two actions are a residual (dense G&) subset
ofJm{T?\Ta). Stated in this form, the equivalence theorem will follow directly from a corresponding "Sinai theorem" whose structure we now develop. Consider the subset Sm s Jm{T*',Tx) denned as follows: <$m{T*\ Ta) = {pe Jm(T?\
Ta) : with respect to /i,
all /, and hence $, are /^(J^O-measurable and
For p. to belong to Sm means p. can be thought of as follows. There are elements in the full-group of Tj*1 for which (/«'•*' ®/"••** ® ... )Vi
eJfn.
Hence a\4>i —> P, and there is a measure-preserving map r\ : X\ —» X m
with rjTf = Tan. This really is, very loosely speaking, no more than saying ct\ is m equivalent to an arrangement /? for which T[ has Ta as a
162
The Equivalence Theorem
factor. To actually obtain the joining in Sm one must select a sequence of full-group elements that achieves the m-equivalence, and then perhaps drop to a subsequence to guarantee they map fi\ into Jlm. To see that any p € Sm gives rise to this situation, just notice that from the definition of Sm, one can project the full-group elements (/>,• from Xf x XG x fa to Xi, as well as the identification of 8F as a subalgebra of &\. Notice further that to have both p e gm(T*\Ta) and q'(p) e gm(Ta, Tj*1) is to say that p arises from a pair of m-equivalent arrangements whose corresponding free and ergodic G-actions are conjugate to T"1 and Ta respectively, i.e. the two actions are m-equivalent, and p is a detailed description of one such m-equivalence between them. is a Gs subset What we want to show first is that J/m(T"l,T") x of Jm(T"', T ) (perhaps empty). To complete the equivalence theorem what we will show is that if T* is m-f.d., and hm(T*') > h,,,(Ta), then
gm{T?\ Ta) is a dense subset oU,»{T*\ Ta). To that end, for / e N, P : X —> S P , a finite partition, and £ > 0 let
<9(I,P,e)={jieUT?,T"): (i) there are 0- / = 1,...,/ in the full-group of T"1 with H{y = (P,(X,),P(A-),{/,}) : ^ , ) ± 4»{y)}) < e and (ii) there is a P' : X, ->• I P with
That is to say, an m-joining belongs to 0(1,P,e) if, loosely speaking, the first / full-group elements (f>i in the joining can be approximated by full-group elements measurable with respect to $F\ (the first coordinate algebra) and the partition P, which is measurable with respect to $F (the second coordinate u-algebra), can be approximated by a partition in !F\ (the first coordinate algebra). Lemma 7.3.1. The sets (9(1,P,E) are open subsets ofJm(T*>, Ta). Proof Let p € (9(1,P,E). AS the topological space Jm(T*',Tx) is a conjugacy invariant of the two G-actions, we can assume that the elements >•, i'• — I,...,I and the partition P' are all continuous functions of X\ (in detail this is simply asking that a countable collection of sets be assumed clopen). Similarly we can assume that P is a continuous function of X. Now both (i) and (ii) of the definition of (9(I,P,l/j) are simply asking that the Ll(p) norms of a finite collection of continuous characteristic
7.3 The equivalence theorem
163
functions are < e. Thus there will be a neighborhood of p in the weak*topology on which both (i) and (ii) will continue to hold. • Lemma 7.3.2. Let P, be a countable collection of finite Borel partitions of X, dense in the space of all Borel partitions taking values in N. Then
and hence is a G$. Proof
One containment follows directly from the definition, that
To see the other containment, let p 6 U/0(/,P,-, \/j). There will be a partition P( of X and elements >•, i = I,...,I in the full-group of Tj" with P/ =Pj,
/i-a.s. and
4>\ =4>h
i=l,...,I,
/i-a.s.
Hence for all p e r\UJ0{I,Pj, \/j), P/ =P,,
/i-a.s. and
0) =$,-,
/i-a.s. for all i.
As the collection of sets in p(^) that are also /(-a.s. also in p\(S^\) forms a ^-complete u-algebra which we have just shown contains all of the Pj, p(^) is /(-a.s. a sub-c-algebra of p\{3F{). This proves the other containment
D To complete the equivalence theorem we show that if {X,SP',/<, Ta) is with an m-f.d. free and ergodic G-action, then for any (X\,^\,i.ii,T*1) hm(T?[) > hm(Tx), then each of the sets (9(1,P,s) is in fact dense in J,,,{T?\Ta). Having already shown in Theorem 6.5.8 that Jm{T*\Ta) a is a Polish space, we immediately conclude that Sm(T^, T ) is a dense subset of this space m-joinings. We refer to this proof of denseness as the "Sinai theorem" of our work as it embeds the m-f.d. action T,*1 as a factor of some T11 where a ~ /?.
164
The Equivalence Theorem
Theorem 7.3.3. Suppose (X, 2F, n, T a ) is an m-f.d. free and ergodic Gaction, and (X\,!F\,n\, Tj*1) is a free and ergodic G-action with hm(T?l)>hm(T«). Then for any I € N, finite partition P and e > 0, the set (9(1, P,s) is dense in jmI(i (Tai[ in
a , i T \).
Proof Let p. e Jm(T"',Tx) be some fixed m-joining, and let n be an open neighborhood of/i in Jm(T?\T«). Notice that in Jm{T*\T?) the arrangement a represents ai and /? represents a. We need to show that (9(I,P,s)nn is not empty. is a conjugacy invariant, we can assume both X and As Jm(Ti',Ta) ^ i are 0-dimensional spaces, that is to say have dense families of clopen sets, including the F-measurable sets. Hence we can find partitions Q of ^ i and P' of X, P' a refinement of P, a value /o € N and a value eo, E > e0 > 0 so that if fa e Jm{T?>, Ta) satisfies h (a)
\\{o*,QonxV
P' on2v\J
gMd)ti,
then fa e n. We will construct fi\ satisfying this as well as
(b) fa e(9(Io,P',eo)<=&(I,P,£\ To obtain fi\ we will construct an /o,./,c>-perturbation of the free and ergodic G-action
((Xi xX)G
xk,^,faa").
This perturbation will be the m-joining we want in the loose sense. We then just map it by Q* into the space of m-joinings. By Theorem 7.1.5 there exists <5i > 0 such that for any J\ sufficiently large, any !o,J\,S\-perturbation of the above system will still project to a measure in J/m. Our choices for 5 and J will be at least this small and large respectively, and hence all we need see is how to also obtain both (a) and (b) above via a perturbation. As [Ta,P') is m-f.d., there is a S2 > 0 so that if T2*2 is some free and
7.3 The equivalence theorem
165
ergodic G-action with: (1.0)
\\(T?,Pfr(T<,P')\\.<82and
(2.0) hm(T?)>hm(T«,P>)-52 (3.0)
then
m(T?,Pl;T',P')<5l.
As <x
hmj,(c«*J,P')jhm(T\P').
Select Ji larger if necessary to ensure: (1.2) \\(o*'i,P'),(Ta,P')\\. < ^2/2 and (2.2) h^o"**. ^') > U ^ a , P') ~ WAs hm(Tfl) > hm{Ta), select afinitepartition Q" of Xi refining Q' with
Consider the free and ergodic G-action
(1.3) (2.3)
\\{T*^,P\),(T«,P')\\.
< S2 from (1").
hm(T*'*''a+l,Pl)>hm(Ta,P')-52
and hence we will have (3.3)
m(T 1 a "'' lo+1 ) P,;r a ,P')<<5i-
We also require of the copying that h
(a)'
||(^,(2"oniVP'o^2v\/g(a,«),v 1=1
/o+l
( T j " , Q " v P ; v V «(«.,*!))II- < £ o -
166
The Equivalence Theorem
Now (3.3) tells us there is an m-joining /i0 of TJ""*'0+1 and T« with (1) /i o (P|AP')<<5, < e o a n d (2) m(jk) < Si.
We can lift to this m-joining of r " " ^ ' and T* the full-group elements 4>\, and the arrangement a.\ yielding the sequence of G-actions
The m-joining now comes equipped with its canonical full-group elements >, with a$, —> /? and for which rimsupm;-,o(a,0,-) < <5i i—>oo
which is to say (ii) m ai 0, o+lA ((^ 1 -) ) id)<(5i. x.k,g&,fo,oai) Notice that (i) and (ii) tell us that the G-action ((X] xX)G with full-group elements (f>\,...,(j)'lfj+i and {(/>,} form an 7o,Ji,<5i-perturbation of the original m-joining jui with its canonical list of full-group elements. In particular we know by Lemma 7.1.5 that dropping to a subsequence of the 0, if necessary we will have
Examining this perturbation more closely, consider the two projections:
These satisfy:
=m, ai
7r,(T = Tf'Tt, and 7 t 2 ^ = 1
a
Tan2
and hence this is an m-joining of Tj* and T in the loose sense. Let
Ai = G*(Ao) be the corresponding m-joining in Jm(T*\ Ta). Notice that (/>•' = ty for i = l,...,/o are all rcf'J^i-measurable, and as /io(PiAP') < so, we will have (b), that /i, efi>(/o ,P',eo). Statement (a') implies directly that /ii satisfies (a) and the result is proven. •
Appendix
Our intent in this appendix is to provide a linkage to the two previous papers the authors have written on restricted orbit equivalence. We say that an r-size is a size function m as defined in this current work, in Section 2.2. (The "r" denotes "rearrangement".) We will see that the notion of m-equivalence developed in [25] (where m is a p-size, where "p" denotes "permutation") is subsumed by our work here. On the other hand, we will not quite be able to show this for the work in [43]. What we will see though is that a slight strengthening in the definition of the equivalence relation associated with a 1-size, (size as defined in [43]) will make it possible to describe the equivalence relation as a restricted orbit equivalence in the sense we describe here. As we will see, this change will have no effect on the examples described in [43], and the m-f.d. systems for the original equivalences are unchanged by this strengthening. Whether some equivalence classes of arrangements are possibly changed for some 1-size, we do not know. As we now consider the definition in [43] to have been a very preliminary, perhaps unrefined, attempt to axiomatize the notion of restricted orbit equivalence, we have not pursued this issue further. Our main interest here is to bring the examples, in particular examples 3 and 4 (referred to as mv and m^) under the umbrella of our work here. We break this work into two sections. First we will handle the notion of a size, here called a 1-size (for 1-dimensional) used in [43] for actions of Z. This will be our most detailed section. Then we will discuss the notion of size first put forward in [25], here called p-sizes, for actions of Z'1. In so doing we will also consider a preliminary axiomatization we have given for discrete amenable group actions, as it lies somewhere between that of [25] and what we have discussed here. What we do here is rather technical, and as it is perhaps not of great 167
168
Appendix
interest to the general reader, we will not provide a great deal of motivation. We also will assume that an interested reader has a copies of [25] and [43] in hand to refer to.
A.I 1-sizes The notion of a 1-size, as set up in [43] begins at the level of permutations n of intervals of integers (i, i + 1,.. .,j), and lifts from here to bijections of Z via the definition m(/) = liminfm(7t/i(lj)) I—>— CO
j-yco
where 7t/,(,j) is the "push-together" of /|(,j), i.e. the permutation of (/,_/) that reorders points exactly as /|(,j) does. In [43] a bijection of Z is indicated by a function / . To any such bijection / there corresponds a unique bijection fixing 0 that reorders points exactly as / does:
h(n)=f(n)-f(0) (as we are in Z we use additive notation). Axiom (ii) (page 7 of [43]) requires that m be "stationary", i.e. as a calculation on a permutation it should depend only on the number of terms and the way they are reordered, not on where the block (1,7) is placed. Thus the calculation m(f) is translation invariant (regarding / as an element of Z z ), m(h) = m(f). This then implies, using our notation, that m(h) = m{S'{h)) as S'(h){n) = a(f)(n) - a(/)(0). The value of the 1-size distance between a pair of arrangements (m(oci,a2)) (called orderings on Z) is now set to be the a.s. constant value of m(hax>'«2).
Notice that we could have written this as m(a
a2)=
fm(hy
Written this way, we see this can be viewed as the integral with respect to various invariant measures of the Borel function m(h) on (S. Two arrangements are defined to be m-equivalent (written ai ~ 0C2) in [43] if there are full-group elements (/>, and m(<xi >,-, 012) -> 0. The given m is not a metric (soon we will see how easily it can be made one) but taking it to be one, what we are doing is taking the w-closure of the set of all <xi0 in the full-group.
A.I \-sizes
169
This approach is most different from our current one in that it evaluates a distance between arbitrary arrangements rather than just rearrangement pairs (a, >). This is best understood by the introduction of the collapsing of one arrangement onto another, which converts a pair of arrangements into a rearrangment. This has a serious fault, that the collapsing of a pair a, a<j> will produce a new pair a,<x0' where (j) and >' are not necessarily close in L1. This leads to a serious difficulty in obtaining Axiom 3 for the r-size we construct. This is more than just technical, as Axiom 3 plays a pivotal role in all of our development. It did as well in the development of [43], in particular in trying to develop the m-f.d. notion. This is the motivation for the consideration there of dividing rearrangements, and the restrictive definition of an m-joining, requiring the two processes to be covered by a third in which they sit linked by a bounded rearrangement. Lying behind this is the fact that if (a, >) is in fact bounded then collapsings (a, (/>') can be chosen such that 0 and ft are L1-close (Corollary 4.5 of [43]). This problem that a collapsing of a rearrangement is not necessarily a small L1-change is also hidden in the interplay between the definition of m(a\,a.2) via collapsing of blocks and the requirement that m satisfy an approximate triangle inequality. For these to coexist for the same notion of m forces severe restrictions, and is in some sense the reason we can show all of our examples are equivalent to r-sizes, and also the reason that a-equivalence [8] comes from a p-size but not a 1-size. We introduce two ideas to provide the linkage we wish to demonstrate between 1-sizes and r-sizes. Definition A.I.I. We say a rearrangement (a, 0) is bounded if the function /"'* = a(x, (j)(x)) is bounded (i-a.s. We say a rearrangement (a, (j)) is blocked and bounded (abbreviated B&B) over a subset F if the return-time rF(x) = min(n > 0 : Tjf(x) G F) is bounded, and cp acts as a permutation of each return-time block (x,T l «(x),...,r« (vH (4x6F. Lemma A.1.2. Those <> / with (a, >) bounded form a subgroup of the fullgroup and Theorem 4.0.2 tells us that those (p with (a, (j)) B&B over some set are \\-,-\\*-dense in the full-group. Definition A.1.3. Axiom (v) of a l-size tells us that for any s > 0 there are S and N so that if (a, (f>) is B& B over some set F with
170
Appendix
(1) rF{x) > N for all x G F and (2) m(7t.v,(o,rf(x)-i)) < <5 tfen m(a,a(j)) < s. Any {cc,(j)) B&B over a set F satisfying (1) and (2) we will call m,
H-B&B.
Definition A.1.4. We say a sequence of rearrangements (a, (ft-) is nicelyblocked over F[ if each of the rearrangements (a(ft,(^>~'
the next full-group element (ft acts as a fixed power of Ta. That is to say, for each x € F, there is a value j(x) and for all x' in the return-time block overx, 4>i+\ (x1) = Tf{x)(x'). We say the sequence of rearrangements is m-nicely-blocked over F, if for all i < j , (a^j, (ft"1 <£_,•) is B&B over Ft and is in fact m, 2~' - 2~j-B&B. Lemma A.1.5. / / ( a , 0,) is nicely-blocked over the sequence of sets Fi and /<(F;) —> 0 then a0, is converging in L 1 to an arrangement fi. Furthermore i
the sequence (/?,0f') is also nicely-blocked over the sequence of sets 0,(F,) and /?0~' —> a in L]. We get no symmetry like this for m-nicely-blocked i
sequences as m is only assumed symmetric for arrangements and not for permutations in that knowing m(n) is small does not a-priori imply that m(n~]) is small. Definition A.1.6. For m a {-size we say a.\ ~ ai nicely if there is an mnicely-blocked sequence of rearrangements (pi with m{a\<j>i,a.2) —> 0. i
This relation cannot be assumed to be either symmetric or transitive. Definition A.1.7. We say a l-size m is a l+-size if for any e > 0 there is a 5 and for any rearrangement (a, $) with m(a, a>) < 5, and any s\ > 0 there is a bounded
A.I l-sizes
171
Proof Note: In these arguments we will assume the reader has access to [43]. The two examples (mo and mm) and two classes of examples (mv and m^) all have the common feature that m(ai,c<2) can be defined without reference to collapsing. For mo this is done in Lemma 2.6 of [43], for mv in Lemma 2.10 of [43], for m^> in Lemma 2.14 of [43], and notice for mm, if mm(a.\,a2) < 1 then a\ = OLI. Modifications of their proofs give the arguments we seek. We really only need consider mv and m^. Suppose we consider a = ai and a0 = <X2. Theorem 4.0.2 gives a method to modify (j> by less than any preassigned amount in L1 to a 4>' so that (a, >') is bounded. We show that given any e > 0 there is a 5 so that if the original m(a, a<£) < <5 then the new value in(a,a
or
is at least half the size of #(/"•*'((«, «0)A(M,,,)).
172
Appendix
As \p(n)/\p{2n) is required to be bounded away from zero in n, we obtain the result. For ni0, work from the characterization of m^ in Lemma 2.14 of [43]. Note that we will represent full-group elements here by fy and <^>' to avoid misunderstanding with the use of 0 as a parameter of m. To begin, fix 1 /4 > e > 0 and select a value Bo so that li({x:\*(x,4>(x))\
Next choose a value B2 so that 20fli" Thus: ..//.»
.
SOCjd) / /
Ll\ i A • / v
p
D
\ \
i—
/
D
D
D
|
D
\~l \
^-^ 1
\ \ —*^ 15 ^* 1 / / ^= V — ^ 2 — *^ 1 " 2 "~r" " 1 / i / - ^ ^ —
Choose N so large that iVg/20 > B2 + B\. In Theorem 4.0.2 be sure the Rokhlin tower has height H > N and covers all but E / 1 0 in measure of X. Apply the construction of Theorem 4.0.2 to this tower. That is, map those points x in the tower whose image under <j> throws the point out of the tower block containing x to those points in this tower block that do not have ^-preimages in it. On points outside the tower, we replace <j> with the identity, to create the new full-group element 4>'. It is a direct calculation from our choices for Bo,B\,B2,N and H that H({x : (f>(x) ^ >(x)}) < £. To see that m^(a,a0') is still small, we assume m$(a.,a(/>) < 5 < 1/4 and let A be the set given by Lemma 2.14 of [43]. Remove from A all points that are (1) outside the tower, (2) within e.H/5 of either end of a tower block, or for which Call the remaining set A. It follows from our estimates that H(A') > fi(A) — s.
We wish to show that this set now can be used in Lemma 2.14 of [43]
A.I 1 -sizes
173
to show that m$(a.,a.<j>') < 3d + e. We do this by showing that for any x e A' and any x' in the orbit of x, that <j>(\a(x,x')\)
< 5 + e/3.
-1
Note: if \a - 1| < d < 1/2 then |a - ±| < 3d and if \a-l-\< d then a - 11 < d. We only need consider those x' with 0(x') ^ 0'(x') as otherwise we already have the estimate. Hence we can assume |a(x,x')| > B\. Suppose x' is such a point. There are two possibilities: (1) 4>'(x') = x' or (2) x' lies in a tower block, but is thrown out of it by (j>, and hence x" = <J>(x') lies in this tower block, but ^>~'(x") does not. To understand (1) notice
'(x), 0'(x')) = a(x, x') + a(4>'(x), x). As \a(4>'(x),x)\ < Bo and |a(x,x')| > B\, we conclude: -1 To understand (2) we describe the case when all three points x,x' and x" lie in the same tower block. When they do not, the estimates improve. Once more there are three possibilities: (1) Both |a(x,x')| and |o#(x),x")| > EH/10, or (2) |a(x,x')| < both \a((j){x),x")\ and EH/10 or (3) |a(0(x),x")| < both |a(x,x'))| and In case (1) we will have that
EH/\O.
A x)[
[f ' , must lie between e/10 and 10/e, |o#(x),.v")|
both numerator and denominator are larger than B\ and we conclude that !«(*, x')|)
- 1 < e/10.
Cases (2) and (3) will be handled in a parallel fashion. For (2), we will have > etf/10 but |c#(x),x")| < H and so: 1>
a(x,x')|
e
|a(x,x')|
174
Appendix
Thus
4>(Hx,x')\)
H\a(x,x')\) > (1 - (5)(1 - e/10) > 1 - 5 - e/10.
For (3) use the same argument as for (2) but from the point of view of (j)~\ i.e. replace x with 0(x) and x' with x" and 0 with 0~' throughout. • This completes the calculation and the proof. Having seen that the examples of [43] are l +-sizes, we now want to explain the link between nice m-equivalences and l +-sizes. To begin we now remind the reader in more detail of the construction of collapsings. This is carried out on pages 47-48 of [43]. By a collapsing a2 of a2 on a\ one means the construction of a full-group element (f> (S2 = a\4>) so that (oi\,
«,:«„*,-„-{ "f «»>"-» °'. That is to say, 0T2 is obtained by pushing together or collapsing the image points /*'' a2 ((0,...,rf{x) — 1)) into a consecutive block. It is convenient to allow the collapsing to act as the identity on some of these blocks. Such a collapsing is called an g, K-collapsing if (in our vocabulary)
Note: Having 8 in the definition in [43] will give an £ in this definition. Having an e in this definition will give J£ in the [43] definition. Definition A. 1.9. //) [43] the notion of an e,,e,,K -collapsing was also given where one also asked that m(a 1,0:2) < fi-
We give here a stronger definition for this notion than the one actually used in [43]. Notice that an E,K-collapsing is a B&B rearrangement. If it is in fact an m,s-B&B arrangement we call it an F.,S,K-collapsing (see Definition A.1.3). This in particular forces m(a\,u\4>) < s. Lemma A.I.10 (Lemma 3.2 of [43]). Given any s > 0 there is a 5 so that if m(a.\,a.2) < 5 then for all s,K
on a\.
there exists an s,s,K-collapsing
0.2 of'0.1
A.I i-sizes
175
Proof Although [43] only claims to obtain its definition of an e,E,Kcollapsing, it does so by obtaining a collapsing satisfying our Definition A. 1.9. • Collapsings behave particularly nicely with respect to bounded rearrangements (or more generally what are called dividing reorderings in [43]). We recall a few simple facts about rearrangements of Z. Recall that a.(f> = aft iff >' = Tj o (f> (we are assuming ergodicity universally). Next recall that for (a, (f>) a bounded rearrangement, <x(x, (j)(x)) d[i
is always an integer, which we call J(a, (j)) (the average translation induced by (j)). Thus we can replace (j) by 4>' = T^j^^cj) to obtain a rearrangement (a, (/>') with J(a, (j)') = 0 and a0 = a>'. As the calculation of m for a rearrangment is m(a, a
•
We now need a little trick which we will discuss in more detail later under the notion of a convergence criterion. For now we need just a very special case. Lemma A.I.12. Suppose (a, 0,), / = l,...,N is a finite sequence of blocked and bounded rearrangements which for all i < j , (a>,-, 0~'0y) is m, 2~'—2~iB&B. Assume further that they are nice in that the base sets F,- over which they are blocked are nested and on any return-time block of F,- the rearrangement (a4>i,4>i^l4>i+\) acts by a constant translation. (This is saying that the finite list is the beginning of a potentially m-nicely-blocked sequence.) There is then a <5/v depending on this finite list so that for any bounded rearrangement (ot,(j)N+i) with J(a,4>N+\) = 0 and
176
Appendix
we will be able to construct a new list (a, $(•), ; = l,...,N + 1 of blocked and bounded rearrangements, satisfying all the conditions of the original list but with N replaced by N + 1 and with, for all i = 1,..., N + 1,
H{{x : Ux) + #(x)}) < 2- /v " 9 . Proof Notice that if 5 is small enough, we can obtain cc0'N+1 as a collapsing of GC0JV+I on a0jv with all its conditions, and its base set F^+\ as small as we like. Set \p =
so that if(a,4>j) is an infinite sequence of bounded rearrangements all with J=0 satisfying m(a0j,a0 ( - + i) < (5,((a,0!),...,(a,0,))>
then there is an m-nicely-blocked and bounded sequence of rearrangements (a, 4>\) with
for all i. This will mean that there is a P with m(a0J,/?) -> 0 and hence that i ~ p. Furthermore we will see that there is a \p in the full-group with
A.I {-sizes
177
which is to say the <x0, are converging to an arrangement fi\p that is inequivalent to a. Proof We will construct inductively using Lemma A. 1.12. At stage N of the induction we will have constructed a sequence of rearrangements (a,(pf), i = \,...,N. These will satisfy the hypotheses of Lemma A.1.12. The value d^ will be the value SN of this Lemma. Supposing m(a(j)N,a(j)N+\) < SN, let 0/v+i<^'<^iv be the " $ # " of the lemma, as (a0jv>>/v')N+i) and (a.4>'N,4>'N~x4>N+\
implying the full-group elements <^f converge in N to the full-group element we call <\>\ with
K{x : Ux) + #(x)}) < ii{{x : cf>)(x) ± <M*)}) +
2'^.
Our inductive construction makes
and so summing backwards to i = 1 we see that for all /
H({x :
which is summable. This tells us that the 4>'~X (j); are L'-Cauchy in the full-group, hence converging to some \p. Thus a0, = a<^J0p'), converges in s4 to the arrangement jSip finishing the result. •
178
Appendix
The following corollary now indicates the link between l + -sizes and m-nicely-blocked sequences of rearrangements. Corollary A.I.14. Suppose m is a l+-size and a ~ fi. There is then an arrangement y and full-group elements ip,- and tpf, i = 0 , 1 , . . . , so that the rearrangements
and
are both m-nicely-blocked and bounded and both m(aip,-, y) and m(/?ip(', y) tend to zero in i. Proof
W e will begin by c o n s t r u c t i n g sequences ip,- a n d ip,' with m{a.\pi,ip"1tp,+i)
< <S,-((ayj0, Vo"1 >i )>•••>(«Vo, v V V i ) )
and
mWit xp'-X v ; + 1 ) < <5,((M, Vo"' vi), • • •, Wo> Vo"' V,')) and
Theorem A. 1.13 now completes the result by giving $>,• and y;, that are m-nicely B&B with both
and as m(a\pi,fi\p\)—> 0, y = yiy; for some ip. Setting y>(' = t/>,t/> is all we need do. To construct ip, and ip(' we will alternately add terms to each of the sequences ip,- and tpf. Moreover, at each stage all the full-group elements in both lists that we have already constructed will be slightly perturbed by right multiplication by an L'-small full-group element. As we do this to all terms, we do not change the relations among them, and as these perturbations will converge in the full-group, the two sequences will converge to sequences in the full-group. The two full-group elements ipo and ip'o are in fact the total perturbations. In this process we will not index over this sequence of perturbations but will always just indicate ip, or ipf, noting that they have been perturbed but have kept their names. To begin, let both ip0 and ip'o be the identity (they will, as indicated above,
A.I
l-sizes
179
change). The choice of \p\ is arbitrary, but choose it so that (xipo,
Choose y;2 so that both (1) m(cnp2,Pv\) < 8\ and (2) (a\po, v-'cT' VJ2) is bounded. Do this by first obtaining (1), then perturbing both tpo and ipi by right multiplicating by a full-group element that is within some e2 in L1 of the identity to obtain (2). To construct i/J2> a PPly Theorem A. 1.13 to the a side to get
52 = <52((a<£o, <£o '>!)> («<£(), >o''fo))Choose i/;2 so that both (1) m(/?t/;2,ay;2) < <52 and (2) (/?y; 0 ,ip 0 ~y) is bounded. Do this by first obtaining (1), then perturbing the list \p'o, ip\ by right multiplying by a full-group element that is within some e2 in L1 of the identity to obtain (2). To finish we just continue this induction alternating sides and using successively the bounds <5JV of Theorem A.I.13. Making the e,- summable will force convergence of the perturbations, and hence existence of the I/;,- and i/;j as described. • This rather long effort has now established the role of nicely blocked sequences of rearrangements. The Fundamental Lemma (Lemma 4.8, page 100) of [43] actually describes an inductive step of precisely the same form as this result. The sequences of arrangements (ao,ai,...,a n and a'o,a\,...,a'm) constructed there are not directly assumed to be m-nicelyB&B but the mass of structure in the definition of an (n, /^-approximation says we could in fact collapse all the terms and get a sequence that is.
180
Appendix
This observation is not going to matter much to us as boundedness is all that will really matter in our discussion of m-f.d. actions. We point this out here just to indicate the parallelism. We proceed to construct an r-size associated with every 1-size which, for l + -sizes will have precisely the same equivalence classes, and universally the same finitely determined classes. Both these results will be based on the fact that for a sequence of rearrangements a$,- —> P and /?(/>"' —> a i
i
in L1 which are bounded in both directions, convergence for the 1-size and the corresponding r-size will be equivalent. Let m be a 1-size. Our first simple task is to bound m. For a permutation n, define m'(0) = min(m(7t), 1). Lemma A.I.15. If m is a l-size, then in1 as defined above is as well. Moreover, this new l-size has the same equivalence classes as m. Proof It is quite simple to check the six conditions of [43]. Notice that using m1 the requirement that m(a2,a3) < 1 in axiom (vi) can be completed omitted, i.e. one has: (iv) given any e > 0 there is a <5 > 0 so that if m1 (1x1,0(2) < <5 then for any a3, m'(ai,ot3) < m{{a2,ai) + e. To see that the equivalence classes are unchanged, just remember that for oti ~ (X2 means there are full-group elements $,• with m(ai0,-,a2) -» 0. 1
We observe that m'(oc,/?) = min(m(a,/?), 1)- Thus m(ai0,, a2) —* 0 is equivi
alent to m1 (ai>,-, 0E2) —> 0. Thus the two equivalence relations agree. i
Our next step is to replace ml with a metric. To do this we must leave behind permutations. We use a standard trick for creating a metric. For any arrangement a\ define a function: F ai : s/ -» [0,1]
by
Fai(a2) = ml(an,a2).
Set
It is easy to see that m2 is a metric. The only non-obvious point is that
•
A.I [-sizes
181
the map a\ —> F a , is 1-1. To see this just note that the only zero of this function is at <x\. Lemma A.I.16. Given any e > 0 there is a 5 > 0 so that: (a) If ml(a\,a2) < d then m2(a 1,1x2) < £ and (b) if m2(a\,ct2) < S then ml(a.\,U2) < e. Proof Statement (b) is trivial as m2(a\,a2) = s\ip(\ml(tx\,<X3) — ml(a2,a.i)\) > m'(ai,a 2 ). 013
For (a), using (iv) stated above for ml, if m'(ai,a2) < 5 then for all 0C3, |m(ai,a 3 ) —m(a 2 ,a 3 )| < £ which is what we want.
D
Lemma A.I.17. For m a l-size, <x\ ~ a2 if and only if 1x2 is in the m2-closure of the full-group orbit of tx\. Proof
To say m(<xi0,,a2) —> 0, as we just saw in Lemma A. 1.16 is
equivalent to m2(a.\4>i,0.2) -* 0.
•
i
We need to take two more steps in modifying m to become an r-size. At present m2 satisfies Axioms 1 and 2 of an r-size (Axiom 2, that (r,m2) —> (F, II, ||*) is uniformly continuous follows from Axiom (iv) of a l-size, Lemma A.I.16 and a bit of thought). As we indicated earlier Axiom 3 is not clearly associated with the conditions of a l-size and acquiring it will take a little two-step. Definition A. 1.18. For m a l-size let m\aua2)
= inf(/n2(a 1>a 20) + /*({* : 4>(x) ± x}) : > € FG(0)).
Note: we use (i({x : $(x) ^ x}) rather than ||0,id||" as it has better equivariance properties. In particular relative to the metric /.t({x : (f)\(x) =fc (j>2(x)}) the full-group is a Polish metric space and this metric is invariant under both right and left composition. Lemma A.1.19. For m a l-size, m3 is a metric and for any arrangement a, the m-orbit closure of {cup} is the same as the n?-orbit closure. Moreover n? satisfies Axiom 2 of an r-size.
182
Appendix
Proof To check the triangle inequality, suppose m3(ai,a2) = a and m3(a2,o<3) = b. For any e > 0 there exist 0i and 02 with a <m2(ai,a2<£i) + yu({x : 0,(x) ± x}) + e, b <m2{a2, a2(j)2) + M{* • fa(x) ^ x}) + e
Thus 3
2
+ K{x :fa(l>\(x)^ x})
>') + M{* : <^'(x) 7^ *}) < e a n d hence m2(oti,c(200') < e and the in2 orbit closure contains the m3. The other direction is trivial. As m2 satisfies Axiom 2 of an r-size, and ^<({x : cp(x) ^ x}) is an r-size, 3 • m satisfies Axiom 2 as well. Definition A.1.20. Let m4(a,4>)=
limsup
m 3 (a',a'0')-
Our next task is to show that m4 is an r-size. Lemma A.1.21. For m a l-size and a an arrangement m\(4>\,4>2) = m4(a
This is a job for a copying lemma. By Corollary 4.0.9, for any d > 0 there is a <5| > 0 so that if ||(a, &),(a', # ) | | . <5U then there is a 0', in the full-group of Ta' with both
||(a,0,),(a',0',)IU < ^
and
||(a01,0r'02),(a',oc'0r102)ll- < S.
A.I
\-sizes
183
Note: This is stretching Corollary 4.0.9 a bit. To see that this stretch is acceptable notice that as
we will have both
||(a, fa),(<*', >',)II.-0
and
Thus use finite partitions that approximate the two possibly infinite partitions g(a,02) an^ £(«',<£') a s Q ar>d 6i m Corollary 4.0.9. Continuing, for any e > 0 we can choose 5 and d\ small enough that m4{a,(l>2)<m3(a',a'(t>2) + 8, m 3 (a',a'^',) < in4 {a,^>\) + E, and
mWi.fr'^) < ^(a^.^r'^ + e. We conclude m4(a, >2) < m\a, 4>i) + m4(a0i> >?'02) + 3« to complete the result.
D
Theorem A. 1.22. For m a l-size, m4 is an r-size. Proof Lemma A. 1.21 says we only need to verify Axioms 2 and 3. Lemma A. 1.19 tells us m3 satisfies Axiom 2 and as
we conclude m4 does as well. The definition of m4 is such as to make Axiom 3 direct. • We will complete our work now by showing that for any l-size, if ai ~ a.2 by an m-nicely-blocked sequence of rearrangements then a, ~ a2. Suppose (a, 0) is B&B over a set F and hence rF is bounded. For each x e F let n'x be the permutation of (0,1,..., rF(x) — 1) with
0 ( T ? W ) = Tfc{J)(x), 0 < j <
rF(x).
This is a finite list of permutations. Add on to this all identity permutations of finite blocks (0,...,N — 1) and call the resulting collection of permutations U(4>, F). This is an infinite set of permutations only because we included all the identities.
184
Appendix
Lemma A.1.23. Suppose (a, >) is blocked and bounded over a set F, and for some N, rF(x) > N for all x € F. Then for any e > 0 there is a S so that for any (a.1, ft) with \\(a,(j)),(a',ft)\\.<S there is a ft' with n({x : <j)'(x') j= ft'{x')}) < £ and {a1, ft') is blocked and bounded over some set F' with 2. rF(x')>N
for all x! € F'.
Proof As rp is bounded let rF < B and choose 4M = [10J3/e] + l. Choose 5 so small that knowing
implies that for all but e/10 in measure of the x' € X' there is an x € X with
f*'HT*(x)) = ?•* {Tf{x%
0<j<M.
Call a choice for this point x(x'). We can and do choose the point x(x') to depend only on the list of values fa'^'(x'), 0 < j < M. Hence there are only finitely many points of the form x(x'). Let Q be the finite partition of X' into sets according to the value /"''*'(x) if it is < B in absolute value, and the complimentary set. Construct a Rokhlin tower for T01' with base C and height M, covering all but e/10 of X' with M-l
C 1 \ / 7%.(Q). Let Co £ C consist of those x' for which there is no value x(x'). Hence
KCo) < toKQ. Partition the remainder of C according to VJIQ' Tf'7(Q) which is the same as to say, according to the value x(x'). Call these sets C\, C2,..., Q . Let x, be the value x(C,), i > 1. For each x, let 0 < j \ < f2 < • • • < j ' m < M
be those indices with T|(x,-) G F. Any two successive values j \ < j \ + x are at least N and at most B apart.
A.I l-sizes
185
To start a definition of F', put
u u ri^ K /(/)-i
i=l r=2
in F'.
To start a definition of <£", for each x' £ C, and each block of points
just set 0" = (f>. Thus, as this is a block between two occurrences of F' and the functions P'4"{x') and f-^(x,) agree on (0, M - 1), we have
For all other points x', those in the tower over Q or outside of the tower, or below level j'2 or above level 7j,(,)_i — 1 in the tower over C,, set
if m is a l+-size
then whenever oti ~ a2,
we also have <x\ ~ 0(2 and any T which is m-f.d. is mA-f.d.
186
Appendix
Proof Corollary A. 1.24 tells us that both {a.\,^>i) and (oe2, $ f ' ) must be m4-Cauchy, completing the result. To finish the other observations, notice that any nicely-blocked sequence of rearrangements is bounded in both the forward and backward directions. Corollary A. 1.14 tells us that we can interpose a y between a.\ and a2 that will be m4-equivalent to both of them. That the notion of m-f.d. implies m4-f.d. is simply that the joining demanded in the definition of m-f.d. in [43] has to be a bounded rearrangment and hence m4-small. • We now want to see that if ai ~ a2 then a.\ ~ <x2. This may seem trivial now as we know that any m4-Cauchy sequences of rearrangements will be m3-Cauchy. The problem is that we do not know that just because (a\,(j)j) is m-Cauchy, and ai>, —> a2 in L1 that then m(ai0;,<X2) —• 0. The way we handle this issue is much like what we already did for the other direction; we introduce convergence criterion. Definition A.1.26. Suppose (X,d) is a metric space and m : X x X —> K + . We say a sequence of functions s
. v"
On . A
>. in>+ —> IK
is a convergence criterion for the function in if for any sequence of values {x,,} in X with
there is a (necessarily unique) point x G X with
Lemma A.1.27. For m a {-size, and (a)m the m-equivalence class of an arrangement a, we know m2 is a metric on (a),,,. Relative to the metric space ((a)m,m2), there are convergence criteria for both m and m2. Proof This construction just abstracts what is done in both Theorem 3.1 and Lemma 4.8 of [43]. To construct the convergence criterion for m, select sets A\,A2,... and bounds K\ < K2 < • • • inductively with (1) n(AN) > 1 - 2-
so that
A.I l-sizes
187
(2) [-N,N] £ / r " ( H W x ) , m , ; n ( x ) ] ) <= [-K N ,K N ] and < m(a,, a/v ) + 2-( w + ". (3) m(nfrUniN{xlmiN(xy)) That m(a,,ajv) = m{f*uaN) ^-a.s. and that / ^ is a bijection tell us we can select such bounds K^ and sets A^. Choose 5N(a\,...,au) so small that if m(aN,aN
+ 1) <
5N(a.\,...,aN)
then for all i < N, (a) m(a,,a /v+ i) < m(a/,a N) + 2" (iV+l) and (b) 0({x : fTaN+'\[-KN,KN] = id}) > 1 - 2-{N+2). That we can do this is a consequence of Lemma 2.6 concerning mo and Axiom (v) both of [43], which requires that all l-sizes essentially dominate moSuppose u\,a.2,--- satisfy m(af+i,(Xj) <
Let K\,Ki,...
and A\,A2,...
Aj = {x:xe
5i(a.i,...,<Xi).
be the associated sets and bounds. Let
AN, N>j
and /_*"•«"+' \[-KN,KN]
= id, JV > j } .
Notice that /i(A7-) > 1 — 2~K For x £ A-}, for each N > j and i < N we will have values n,yvM <-N [-N,N]
<0
< mhm(x),
<= fr»([
and further, for all k > N, f?'at+l\[-Kk,Kt]=id. As the Kk's increase, by composing we see x
+
\[-KN,KN] = l d
and so for all j > N, .v
l[nj,w(.v),mi,jv(x)] — J.v
I K N (*),'",,« (*)]•
As [—N,AT] ^ [i;,iv(x),m,iv(^)] we conclude that there is a 1-1 function / . v with Jv
—* ]x.i-
188
Appendix
As [—N,N] £ / " ' ' " ^ ( K / v M ^ y v M ] ) w e conclude that /.v?, is a bijection. Moreover it is a calculation to see that JX
J X,J — J X,f
Hence defining an arrangement
P(x,T?{x))=fx,lU), we have
p(x, T?(x)) = fxJ(j). It remains only to show that m(a,-,)S) —> 0 which is equivalent to To see this just notice that for x e Xy, rc > j and i < N
2- ( " +1) < 2"\ Letting N —> co we conclude that m(f*'P) < 2~' and we are done. As m1(ai,a2) = m(ai,a.2) once m(ai,a2) < 1 a convergence criterion for m will be one for m' as well, and as m' and m2 are uniformly related, there is a convergence criteria for m2. Lemma A.1.28. For m a l-size and (a)m the m-equivalence class of a, relative to the metric space ((a) a ,m 3 ) there is a convergence criterion for m\ Proof To construct a convergence criterion for m3 select inductively, for a list ai,...,ajv, full-group elements (J)\,...,4>N SO that
!)) + n({x : cpi(x) =/= fcix Let SH be the convergence criterion for m2 and now define <5^(oti, <x2,..., ocjv) = min Suppose now that a, satisfies
Then both 2
...,a,),)
and
•
A.2 p-sizes
189
We conclude that there must be a /? S (a),,, with
/nWf,0)->O and a
•
i
Theorem A.1.29. For m a l-size, if a ~ /? then a ~ /?. Proof The "trick" here is exactly the same as that in Corollary A. 1.14. We give a brief sketch. Restrict the discussion to the metric space ((a),,,4,m4). Remember m4 is the "geodesic" distance, the infimum over sequences of full-group elements leading from one to the other in both directions. What we construct now are two sequences of full-group elements y;, and y;j so that (a, y;,) and (/?, vj) both satisfy the convergence criterion for m3 and
This of course completes the result as both ay;,- and ft\p\ will be converging to a common y to which they are both m3- and hence m-equivalent. The construction follows the same lines as that in Corollary A. 1.14 by successively pulling in from one side and then the other by full-group elements close enough in m4 < nv" to be sure to obtain the next term of the convergence criterion on the other side at the next step. •
A.2 p-sizes In a preliminary version of our work on sizes for discrete amenable group actions we put forward an axiomatization that lies between what we have actually set out here and the axiomatization we used in [25]. What we will do in this section is to state that axiomatization, show that a "size" in those terms gives rise to a size in satisfying our axioms here having precisely the same equivalence classes. We then will present the axiomatization used in [25] and show that a size in those terms (which we call a p-size) gives rise to a size in terms of our older axiomatization for discrete amenable groups. Here is that preliminary axiomatization: a size m, as defined in a
190
Appendix
preliminary version of this work, is a function m from the space of G-rearrangements to R+ that satisfies the following five axioms: Axiom Bl. Given E > 0 there exists 5 > 0 such that if ||$|| p . w . < d then
m(a,
||a,a0||p.w. <£. Axiom B3. Given e > 0 there exists d > 0 such that if m(a, >) < 5 then )~l) < s. Axiom B4. Given e > 0 there exists d > 0 such that if m(cc>, y;) < 5 then m(a, (j)ip) < m(a, 0) + e. Axiom B5. Given E > 0 and rearrangement (a, <j>), there exists 5 = <5(e,a,0) > 0 such that if ||(a,0),(j8,y;)||. < 5 then m(fi,ip) < m(a,0) + e. Suppose m is a size function, in the sense that it satisfies the above five (old) axioms. We will construct a new size function m2 which satisfies the current three axioms, giving the same full-group topology, and the same completion. That is to say, the m-Cauchy sequences in the full-group are exactly the »?2-Cauchy sequences in the full-group. The two steps we take are precisely analogous to those taking m2 to m4 in our work on 1-sizes. It is worth pointing out here that Axiom Bl and Theorem 4.0.2 tell us that we have the analogue of a l + -size in that we can always perturb elements of the full-group by a small amount in m to be bounded. Note that the size function m is a conjugacy invariant, meaning that
-] 4>\p). To each arrangement a and full-group element
/c;o :r->E+ by letting
Axioms Bl and B3 imply that k^ is uniformly continuous with respect to the L1 topology on the full-group. For 0i and 4>2 m the full-group, define
A.2 p-sizes
191
and define
mW)=||fc? d> fcJ|| sup and so
Clearly, /«£(-,•) is a pseudo-metric on the full-group. Axioms Bl, B3 and B4 imply that mx and m\ give rise to equivalent topologies on the full group. In particular, any sequence that is ma-Cauchy is also m^-Cauchy, and vice versa. Axiom B2 tells us that the identity map on the full-group is uniformly continuous from the m' to the L'-metric. All that remains is to obtain Axiom 3 of an r-size, i.e. upper semicontinuity with respect to distribution. We do this in exactly the same way as for 1 -sizes. Define m2 : {G-rearrangements} —> R + by letting m2(a,(j>)=
limsup
tn}(fi,\p).
Also define
That in2 is a pseudometric follows exactly as in Lemma A. 1.21 relating in3 to m4 for 1-sizes, as the copying lemma (Corollary 4.0.9) applies in precisely the same way. Lemma A.2.1. ml and m\ are equivalent metrics on the full-group. That is to say, given any e > 0 there is a 5 so that (1) ifmla((f)],(t)2) < S then m ^ i , ^ ) < e and (2) ifmlW^fc)
< <5 then »4(0i,0 2 ) < £•
Proof To see this just note that one can rewrite Axiom B5 as: Given any (a, (/>), lim sup
m(P, xp) < m(a,
192
Appendix
As we know that m and m' satisfy (1) and (2) above we conclude the lemma. • Notice here that we could have used less than Axiom B5. To conclude that m2 was equivalent to m all we really needed is the following. Axiom B5'. Given any e > 0 there exists 5 so that for any (a, 0) with m(a, (j)) < S then limsup
m(fi,y) < s.
In [25], we presented a framework for restricted orbit equivalence for actions of Z1'. A size function was initially defined on permutations of boxes in Z''. Such a size was required to satisfy six different axioms. This function was then extended to be defined on rearrangements, (a, 4>), in many ways analogous to what happened for a 1-size. We will avoid all the problems we had in dealing with 1-sizes here in that rather than "collapsing" by "pushing-together" (something that does not make much sense beyond Z1 ), we "fill in", analogous to what we did in the proof of Theorem 4.0.2. Let FI,, be the set of all permutations of the box Bn (in Z''), and let n = U,,n,,. According to [25], a p-size (p signifying "permutation") is a function m : n —> [0,1] that satisfies the following axioms. Axiom Cl. (Continuity near the identity) (a) m(id) = 0 for all identity permutations, and (b) for all £ > 0 there is an N and 8 > 0 so that for n > N, n S FI,,, and if |B,,| then m(n) < e. Axiom C2. ("Lumpiness" of small permutations) For all NQ and e > 0, there is an N\ and 5 > 0 so that if n > N\ and n e Tln with m(n) < 3, then \{v s Bn; for some v0 £ BNo,n(v + dp) ± TT(») + ^o}l To make n{v + VQ) well defined we extend n outside Bn as the identity. In such contexts we will always make this choice.
A.2 p-sizes
193
Axioms Cl and C2 are partial converses to one another. Axiom Cl tells us that if a permutation moves very few points, then it must be small. Axiom C2 tells us that if a permutation is small it must move most points in large rigid lumps. Axiom C3. (Continuity of inverses near zero) For all s > 0 there is an N and 5 > 0 so that for n > N and n G FT,, with m(n) < 5, then m(n~x) < e. Axiom C4. (Continuity of composition near zero) For all e > 0 there is an N and 5 so that for n> N and n\,ni G IT,,, if m(7r2) < 5, then m(n2n{) < m(n\) + e. Axioms C3 and C4 simply tell us the kind of uniform continuity we will want of the group operations within each n,,. There are two other natural operations within n and that is the concatenation of permutations on disjoint blocks, and the rearranging of blocks left invariant by a permutation. One of these builds a new permutation on a larger block from smaller ones, the other changes a given permutation to a new one. Our last two axioms concern continuity of such operations. Axiom C5. (Continuity of concatenation near zero) For all e > 0 there is an No and 5 > 0 so that if ti\,ni,...,rik>No ifd\,V2,...,ik G Zrf are such that:
and
(a) Aj = Bn; + dj are disjoint and for some n;
(b) u^/1,- = Bn and ' U ^ ' fil > 1 -8: (c) n £ n,, is such that for all i, n(Ai) = A; and setting n\ G IT,,, to be n,(v) = n(v + Vj) — if we have (d) m(7t,-) < d for all i; we then must have that m(n) < E. The picture of what Axiom C5 requires is that if Bn is almost completely covered by translates of various B,h each of which is fixed by n and if when 7t is viewed as a permutation of each translate it is small, then n itself must be small. Axiom C6. (Continuity of rearranging large lumps near zero) For all s > 0 there is an N and d > 0 so that if N, > N and and Ki,K2,...,KszBn
194
Appendix
(a) the Kj = BNl + dj are disjoint and
(b) there exist ti, S Bn, i = 1,..., s so that Kt + u, = K[ are also disjoint and £ Bn, (c) the permutations n,n' € FT,, are such that n(Kj) = Kj and n'(K-) = K'j and for v £ Kt n'(v + M() = n(v) + ui, (i.e. n acts on Kj exactly as n' acts on K-) and, (d) m[n) < d we then must also have m(n') < E. This last axiom says that smallness of a permutation is a local property. More precisely, if you have a small permutation (in m) that maps a lot of sets K, to themselves, where the K, are all translates of a large box BJV,, and if you simply rearrange where these boxes /C, happen to lie in B,, then the permutation you obtain will still be small. Suppose now that m is a p-size as defined by these six axioms. Suppose a is an arrangement and <j> e FG{(9). Let x e X and let f^ be the associated bijection of 7Ld. Recall that for /i-a.e. x e X for any e > 0 there is an N(x,s) so that for all n > N the permutation n*'f agrees with /*,f all but £ of the "time", i.e. on all but a subset of Bn of density at most s. Define: wV v (a,0) = min max{m(7t), ten,,
H— \tsn\
} and
mx(a,(f>) = liminf m,,v(a, $). Finally define the p-size m on rearrangements by m(u, d>) = ess inf mJoc,
sex
Theorem A.2.2. If m is a p-size as defined above, then m satisfies Axioms B1-B4 and B5' and hence the corresponding m2 is an r-size giving the same equivalence classes. Proof As the axiomatization given in Axioms B1-B5 was derived from the work in [25] we can just about read off everything we want directly. Axiom Bl is precisely the conclusion of Lemma 2.3 of [25]. Axiom B2 is
A.2 p-sizes
195
simply Lemma 2.4 of [25] reworded a bit using the blocks Bn as a Felner sequence. Axioms B3 and B4 are precisely the conclusions of Lemmas 2.5 and 2.6 of [25]. To see that in satisfies Axiom B5' takes a bit of thought. It is essentially embodied in Axiom C5. To see this first notice that although in is defined as a liminf in n, Theorem 2.1 of [25] tells us that if this liminf,, is small then the ih(a,(j)) = limsup,, is also small. More precisely, for any <5i there is a 5 so that if m(a,4>) < <5 then m(a,(j)) < 5\. This means once n is large enough, for all but K/10 of the x S X, there is a permutation n(x) of Bn so that (1) m(n(x)) < 5\ and (2) #({S e Bn : 7t(x)(S) + /£*(«}) < e#B B /10. Suppose a permutation n of a large box BN agrees with some n' on all but a fraction 8\ of BN, and that we obtain n' by covering all but <5i of the BN by disjoint translates of Bn and permuting the points in each by a permutation of m-size less then S\ and applying the identity to the rest of the points. Combining Axioms Cl and C5, if 5\ is small enough, we will have m(n) < fi/2. Combining the ergodic theorem, the strong Rokhlin lemma and a bit of thought one sees that if (/?,!/)) is close in distribution to (a, cj)) then there will be large blocks BN in the orbit of a.e. x' where n^ will be as described as n is described above. We conclude that mx{/}, \p) < £ and the result follows. •
Bibliography
[1]
L. M. Abramov, On the entropy of a flow, Dokl. Akad. Nauk. SSSR 128 (1959), 873-875. [2] W. Ambrose, Representation of ergodic flows, Ann. of Math. 42 (1941), 723-739. [3] W. Ambrose and S. Kakutani, Structure and continuity of measurable flows, Duke Math. J. 9 (1942), 2 5 ^ 2 . [4] M. Brin and G. Stuck, Introduction to dynamical systems, unpublished manuscript, contact the authors. [5] D. L. Cohn, Measure theory, Birkhauser, Boston, Mass., 1980. [6] A. Connes, J. Feldman, and B. Weiss, An amenable equivalence relation is generated by a single transformation, Ergodic Theory Dynamical Systems 1 (1981), no. 4,431-450. [7] I. P. Cornfeld, S. V. Fomin, and Ya. G. Sinai, Ergodic theory, Springer-Verlag, 1982. [8] A. del Junco, A. Fieldsteel, and D. J. Rudolph, a-equivalence: a refinement of Kakutani equivalence, Ergodic Theory Dynamical Systems 14(1994), no. 1,69-102. [9] A. del Junco and D. J. Rudolph, Kakutani equivalence of ergodic Z" actions, Ergodic Theory Dynamical Systems 4 (1984), no. 1, 89—104. [10] H. A. Dye, On groups of measure preserving transformations I, Amer. J. Math. SI (1959), 119-159. [11] J. Feldman, New K-automorphisms and a problem of Kakutani, Israel J. Math. 24(1976), no. 1, 16-38. [12] A. Fieldsteel, Factor orbit equivalence of compact group extensions, Israel J. Math. 38 (1981), no. 4, 289-303. [13] A. Fieldsteel and N. Friedman, Restricted orbit changes of ergodic TLd actions to achieve mixing and completely positive entropy, Ergodic Theory Dynamical Systems 6 (1986), no. 4, 505-528. [14] A. Fieldsteel and D. J. Rudolph, Stability of m-equivalence to the weak Pinsker property, Ergodic Theory Dynamical Systems 10 (1990), no. 1, 119-129. [15] , An ergodic transformation with trivial Kakutani centralizer, Ergodic Theory Dynamical Systems 12 (1992), no. 3, 459-478. 196
Bibliography [16] [17]
[18] [19] [20]
[21] [22] [23]
[24] [25]
[26] [27] [28]
[29] [30]
[31] [32] [33] [34] [35] [36] [37]
197
J. R. Hasfura-Buenaga, The equivalence theorem for Z^-actions of positive entropy, Ergodic Theory Dynamical Systems 12 (1992), no. 4, 725-741. B. Hasselblatt and A. Katok, Introduction to the modern theory of dynamical systems, Encyclopedia of Mathematics and its applications, vol. 54, Cambridge University Press, 1995. D. Heicklen, Bernoullis are standard when entropy is not an obstruction, Israel J. Math. 107 (1998), 141-155. , Entropy and r-equivalence, Ergodic Theory Dynamical Systems 18 (1998), no. 5, 1139-1157. C. Hoffman and D. Rudolph, Uniform endomorphisms which are isomorphic to a Bernoulli shift, preprint submitted to Annals of Mathematics. K. Jacobs, Measure and integral, Academic Press, New York-London, 1978. S. Kakutani, Induced measure preserving transformations, Proc. Imp. Acad. Tokyo 19 (1943), 635-641. J. W. Kammeyer, A classification of the isometric extensions of a multidimensional Bernoulli shift, Ergodic Theory Dynamical Systems 12 (1992), no. 2, 267-282. , A classification of the finite extensions of a multidimensional Bernoulli shift, Trans. Amer. Math. Soc. 335 (1993), no. 1, 443-457. J. W. Kammeyer and D. J. Rudolph, Restricted orbit equivalence for ergodic Z'1 actions I, Ergodic Theory Dynamical Systems 17 (1997), no. 5, 1083-1129. A. Katok, Monotone equivalence in ergodic theory, Izv. Akad. Nauk. SSSR Ser. Mat. (1977), no. 1, 104-157, 231. , The special representation theorem for multi-dimensional group actions, (1977), 117-140, Asterisque, No. 49. A. N. Kolmogorov, A new metric invariant of transient dynamical systems and automorphisms in Lebesgue spaces, Dokl. Akad. Nauk SSSR 119(1958), 861-864. , Entropy per unit time as a metric invariant of automorphisms, Dokl. Akad. Nauk SSSR 124 (1959), 754-755. W. Krieger, On the Araki-Woods asymptotic ratio set and non-singular transformations of a measure space, (1970), 158-177. Lecture Notes in Math, Vol 160. D. Lind, Locally compact measure preserving flows, Advances in Math. 15 (1975), 175-193. D. S. Ornstein, Bernoulli shifts with the same entropy are isomorphic, Advances in Math. 4 (1970), 337-352. , Factors of Bernoulli shifts are Bernoulli shifts, Advances in Math. 5(1970), 349-364. , Two Bernoulli shifts with infinite entropy are isomorphic, Advances in Math. 5 (1970), 339-348. , Ergodic theory, randomness and dynamical systems, Yale University Press, 1974. D. S. Ornstein and P. Shields, An uncountable family of K-automorphisms, Advances in Math. 10 (1973), 63-88. D. S. Ornstein and B. Weiss, Entropy and isomorphism theorems for actions of amenable groups, J. Analyse Math. 48 (1987), 1-141.
198
Bibliography
[38]
D. S. Ornstein, B. Weiss, and D. J. Rudolph, Equivalence of measure preserving transformations, Mem. Amer. Math. Soc. 37 (1982), no. 262. [39] K. Petersen, Ergodic theory, Cambridge University Press, 1983. [40] S. Polit, Weakly isomorphic maps need not be isomorphic, Ph.D. thesis, Stanford Univ., 1974. [41] V. A. Rohlin, On the fundamental ideas of measure theory, Amer. Math. Soc. Translation 1952 (1952), no. 71, 1-54. [42] A. Rothstein and R. Burton, Isomorphism theorems in ergodic theory, unpublished manuscript. [43] D. Rudolph, Restricted orbit equivalence, Mem. Amer. Math. Soc. 54 (1985), no. 323. [44] , Fundamentals of measurable dynamics, Oxford University Press, 1990. [45] D. J. Rudolph, A two-valued step-coding for ergodic flows, Math. Z. 150 (1976), no. 3, 201-220. [46] , Classifying the isometric extensions of a Bernoulli shift, J. Analyse Math. 34 (1978), 36-60. [47] , Markov tilings ofR" and representations o/'K" actions, (1989), 271-290. [48] A. Sahin, Tiling representations of K2 actions and a-equivalence in two dimensions, Erg. Th. & Dyn. Sys. 18 (1998), no. 5, 1211-1255. [49] E. Sataev, An invariant of monotone equivalence that determinines the factors of automorphisms that are monotonely equivalent to the Bernoulli shift, Izv. Akad. Nauk SSSR Ser. Mat. 41 (1977), no. 1, 158-181, 231-232. [50] P. Shields, The theory of Bernoulli shifts, University of Chicago Press, 1973. [51] Ya. G. Sinai, On the concept of entropy for a dynamical system, Dokl. Akad. Nauk SSSR 124 (1959), 768-771. [52] , On a weak isomorphism of transformations with invariant measure, Mat. Sb. 63 (1964), 23-42. [53] A. M. Stepin, The entropy invariant of decreasing sequences of measurable partitions, Funkcional. Anal, i Prilozen 5 (1971), no. 3, 80-84. [54] C. E. Sutherland, Notes on orbit equivalence; Krieger's theorem, vol. 23, Univ. of Oslo Lecture Notes, 1976. [55] J.-P. Thouvenot, Quelques proprietes des systemes dynamiques qui se decomposent en un produit de deux systemes dont l'un est un schema de Bernoulli, Israel J. Math. 21 (1975), no. 2-3, 177-207. [56] , On the stability of the weak Pinsker property, Israel J. Math. 27 (1977), no. 2, 150-162. [57] V. S. Varadarajan, Measures on topological spaces, Amer. Math. Soc. Transi, Ser. 2 48 (1965), 161-228. [58] A. Vershik, A theorem on lacunary isomorphism of monotonic sequences of partitionings, Funkcional. Anal, i Prilozen 2 (1968), no. 3, 17-21. [59] , A continuum of pairwise nonisomorphic dyadic sequences, Funkcional.
[60] [61] [62]
Anal, i Prilozen 5 (1971), no. 3, 16-18.
, Theory of decreasing sequences of measurable partitions, St. Petersburg Math.' J. 6 (1995), no. 4, 705-761. J. von Neuman, Proof of the quasi-ergodic hypothesis, Proc. Nat. Acad. Sci. USA 18 (1932), 70-82. P. Walters, An introduction to ergodic theory, Springer-Verlag, New York, 1982.
Index
y/, 14 a-equivalences, 9, 39 a-related, 42 A, 39 Abramov's formula, 7, 9 amenable group, 10 arrangement(s), 13, 24, 106 almost-, 109, 114 complete L1 metric on, 17 def, 13, 14 L1 metric on, 17 8$, 33
Belinskaya's theorem, 10 Bernoulli shift, 6 V(m,L<2), 133 cocycle, 13 conjugacy, 35 Connes, 13 copying lemmas, 5, 8, 65-90 first basic, 72 coupling(s), 133 cylinder set, 53 D(/(l,/(2), 19 Da, 39
del Junco, 9 Dye, 5, 6, 8 theorem, 36, 65 £,„(«), 27, 28, 31 £,„(«), 27 »., 161 e(a,tp), 46 endomorphisms, 47 Bernoulli, 47 uniformly p — 1, 47 entropy, 5, 7, 8 m-entropy, 91
-free size, 99 -preserving size, 99 as a size, 45 classical, 54-64 conditional, 58-64 upper semi-continuity of, 63, 64 entropy-free, 5 equivalence m-, 20 equivalence class(es) m, 27 ergodic, 13 ergodic theorem, 5, 10 examples of sizes, 32 a non-example, 47 a-equivalences, 39 conjugacy, 35 entropy, 45 Kakutani equivalence, 37 orbit equivalence, 35 Vershik equivalence, 43 F, def, 33 Fi, 121 /-metric, 7 F, 125 {//}. " 9 /"•*, 15
fx4, 15 Feldman, 7, 13 Fieldsteel, 9, 10 finitely determined, 8 7,48 m-, 7, 10, 150 Folner sequence, 16 free action, 13 Friedman, 10 full-group, 20 completion, 22 def, 14
199
200 full-group (cont.) equivariant contraction of, 25 isometry of right multiplication, 23 pseudometric ma on, 21 strong L' distance on, 18 weak V distance on, 18 T, 14, 21
f\,, 22, 24, 25, 28 'S, 16, 17, 33, 107, 108 g(a.<£), 67 H(f), 15, 113 h(T,P), 54, 57 h(T,P\Q), 59 " 15 /,"/ 15 lh,(T*), 91 Hasfura-Buenaga, 9 Heicklen, 6, 43, 45 History and references, 5 Hoffman, 11 J, def, 31 invariant (<5,K)-, 49 sufficiently, 49 isometry ma, 23 J(T 1 ,T 2 ), def, 105 JC(T*',T*2), 132 J m (T, ai ,T 2 a2 ), 108, 137 joining, 8 m-, 9, 137 in the loose sense, 138 in the very loose sense, 138 overview of the topology, 107 C-orbit, 132, 136 def, 105 one-sided, 48 orbit, 112, 113, 135 restricted orbit, 113 K, def, 38 K5, 39, 42 K-systems, 6 Kakutani equivalence, 7-9, 11, 36, 37 even, 9 Katok, 7, 9 cross-section, 39 Kolmogorov, 5 Krieger, 5
Index Lebesgue space, 13 loosely Bernoulli, 42 J/(X), def, 101 .,//'", 131 J/m(Z), 129 J/'${Z), def, 129 J/](X), def, 101 „//,,, 107 J/e(X), def, 104 J/S(S), def, 104 m, def, 21 mentropy, 91 equivalence, 9, 24, 27, 29, 32 equivalent -27 finitely determined, 152, 153 m-joining, 137 in the loose sense, 138 in the very loose sense, 138 m(a,P), 28 m(/i), def, 131 m°, 35 m',35 mK, 39 nf, 46 m D 5, 40 mr, 45 ma, 21
ma-Cauchy, 22, 24 ma-isometry, 23 ">a(0l,<W, def, 21 "V, 21, 129 „//., 107, 124, 125 J/e(&), 123 J/.., 107, 126
Jf..{Z), 128 J/"', 107 joining m - , 100 mdistance, 151 evaluation, 151 non-singular, 11 1-size, 9 orbit equivalence, 9, 35 def, 13 resticted, 8 restricted, 5, 10 orbit joining, 112 orbit relation, 13, 14 non-singular, 11 Ornstein, 6, 7, 10, 49
A5(T),40 IIPi,^2IIi, 6 6
P, 25, 28
Index P, 29 Pm.a, 22, 24, 28 P
general usage, 106 partition finite, 52 metric, 53 perturbation
k,J,S; 139 def, 140 Polish action, 103 Polish space, 29, 32, 33, 100-105 and residuality, 101 definition, 100 examples, 101 of measures, 101 residuality, 103 subspaces of, 101 Polit, 6 pre-Bernoulli, 12 process, 53 processl-valued, 150 pseudometrics ma, 20 family of, 20 a., 20 q, 120 V, 126 if*, 15 quasi-tile, 50 <;-, 50 quasi-tiling theorem, 50 quasi-tower maps an <:-, 51 Rit 124 Rij, 124 3f, 118 Sfi, 118 S>, 107 def, 113
k, 118
@N, 118 rearrangement(s), 13, 24, 106 complete L1 metric on, 20 def, 14 distribution pseudometric on, 19, 21 metrics on, 16 respects the tower, 68 References History and, 5 residual subset, 8 restricted orbit equivalence, 5 reverse filtration, 7 def, 43 exact, 44
201
standard, 7 uniform, 44 Roadmap to the text, 3 Rokhlin lemma, 5, 10, 13, 50, 51 Ornstein-Weiss, 51 Rokhlin tower e-,52 Rothstein, 8 Rudolph, 11 S", 108 S ' , 108 Sahin, 43
Sataev, 7 semigroup, 11 Shannon-McMillan theorem, 10 Ornstein-Weiss, 56 Shields, 6 shift action, 15 <j ? , 15
Sinai, 5 Sinai theorem, 6, 8, 161, 163 Ornstein-Weiss, 93 size, 20 3+, 24, 27, 34 axioms, 21 entropy-free, 91, 97, 99 entropy-preserving, 10, 91 def, 99 examples of, 32 integral size, 34 size kernel, 36 def, 33 Stepin, 7 T, P-name, 54
T,P,F-name, 54 f distance, 48 Thouvenot, 10 relativized theory, 11 tower image, 66 painting names on, 66 partition, 71 slices, 66 tree automorphism, 47 tree isomorphism, 47 Vershik, 6-8 equivalence, 43 lacunary isomorphism theorem, 45, 65 related, 44 weak isomorphism, 8 weak Pinsker property, 10, 12 weak*-topology, 18, 53, 54, 67 weakly infinitely determined, 154 Weiss, 7, 10, 13,49 Zd, 10