WORD PROBLEMS I1 The Oxford Book
Edited by
S. I. ADIAN
W. W. BOONE
7he Steklou Mathematical Institute
University of Illinois Urbana, USA
Moscow, USSR
G. HIGMAN University of Oxford Oxford. England
1980
NORTH-HOLLAND PUBLISHING COMPANY A M S T E R D A M . N E W YORK . O X F O R D
@ NORTH-HOLLAND PUBLISHING COMPANY - 1980
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without the prior permission of the copyright owner.
ISBN: 0 444 85343 X
Published by: North-Holland Publishing Company - Amsterdam . New York . Oxford Sole distributors for the U.S.Aa n d Canada: Elsevier North-Holland, Inc. 52 Vanderbilt Avenue New York, N.Y. 10017
Library of Congress Cataloging in Publication Data
Main entry under t i t ) e : Word problems 11.
(Studies i n l o g i c and t h e foundations o f mathematics V-
,851.
This book grew o u t o f t h e working conference 'Decis i o n problems i n algebra' held i n Oxford t h e summer o f 196.'' Bibliography: p. 1. Groups, Theory of--Congresses. 2. G6del's I. Adian, S. I. 11. Boone, theorem--Congresses. W i l l i a m W. 111. Higman, Graham. IT. Series. ~ 7 1 . ~ 8 6 5I2l.22 79-1P76 ISBN 0-444-85343-X
PRINTED IN THE NETHERLANDS
;
Dedicated to the memory of Kurt Godel (1906-1978) in awe and affection
In angusto vivebamus, si quicquam esset cogitationibus clausum. Seneca, Letters 55.11
INTRODUCTION This book grew out of the working conference “Decision Problems in Algebra”, held in Oxford the summer of 1976, under the auspices of the Science Research Council of the United Kingdom. This work is a sequel to the volume “Word problems: decision problems and t h e Burnside problem in group theory”, which itself was the result of a similar working conference held in Irvine the summer of 1969. The Oxford conference was organized by t h e editors of this volume. The secretary of the conference was Donald J. Collins, without whom the entire endeavour could not have been successfully carried out. What is to be said about t h e present book? Like t h e Irvine book, a major intention is that it serves as a means of entry for the reader into the field of word problem. For this reason we have included various surveys; but, moreover, it is hoped that various articles which, while on the very borderline of advancing ideas, have still been presented in such a way as to be highly accessible to the working mathematician. The field of “Word problems” would seem to have flourished between Irvine and Oxford. Unsolvability results have been sharpened in various ways. Thus, as explained in an expository article herein, various classical decision problems about groups have been attacked, but with the class of groups considered now restricted to some familiar variety; and in certain cases this has led to new unsolvability results e.g., in the case of solvable groups. Now, too, the full story is known about the existence of word problems of the various finer degrees of Post. But positive results have not been lacking either. Indeed, we include an article solving t h e conjugacy problem for matrices with integer entries. (This was problem 22 of t h e Irvine book.) A short article on algebraically closed groups appeared in t h e Irvine book, but in subsequent years, this has become a large area of inquiry both with regard to decision problems and with regard to purely algebraic questions. An in-depth study and a shorter paper are included. Much the same thing could be said about a focus of interest on simple groups within the word problem field, and we include an article on this matter also. Small cancellation theory and generalizations (surveyed in the Irvine book) have truly come of age. We include two papers in the general area, one of which solves a well-known problem of Kurosch and Bjarni J6nsson. vii
viii
Introduction
A brief list of open questions is given as well. We call t h e reader’s attention to the note by J.L. Britton, “Erratum: The existence of infinite Burnside groups” in which he says that a mistake occurs in his article in the Irvine volume. This mistake was first noted in the book “The Burnside problem and identities in groups”, by S.I. Adian (Nauka, Moscow, 1975), page 4. The existence of infinite Burnside groups of large odd exponent was proved in a joint article by P.S. Novikov and Adian in 1968. The much stronger proposition of the announcement of Novikov in 1959 must be regarded as not proved. We wish to thank all the mathematicians who have helped us by refereeing individual articles, but to single out here, by name, only Gerhard Hesse for his especially valuable help in this regard. Originally, this volume was intended as a Festschrift to mark the seventieth birthday of Kurt Godel. Now, sadly, we can only dedicate it to his memory. Sergei I. Adian William W. Boone Graham Higman
S.I. Adian, W.W. Boone, G . Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 1-16
MODULAR MACHINES, THE WORD PROBLEM FOR FINITELY PRESENTED GROUPS AND COLLINS’ THEOREM StAl AANDERAA and Daniel E . COHEN University of Oslo and Queen Mary College, London
We shall define a class of machines which we call modular machines, related to Minsky machines [15] (these were called r-machines in the lecture by the first author on which this paper is based). These machines act on NZ,the set of pairs of natural numbers, with a very simple transition function. It will then be almost immediate that any function computable by a modular machine is partial recursive. On the other hand, modular machines are defined in such a way that Turingcomputable functions are computable by modular machines (which can also be regarded as a new way of Godelising Turing machines). This provides a new proof that Turing-computable functions are partial recursive. It also provides an easy proof of the normal form theorem for partial recursive functions, since t h e data for a modular machine, being numerical in nature, can easily be encoded by a natural number with t h e decodings being primitive recursive. There will be a modular machine M,, whose halting problem is unsolvable. Readers are invited to construct their own proof of t h e undecidability of elementary number theory using this machine M,, and to see how it compares in difficulty with standard proofs. (Various results on the degrees of halting, word and confluence problems for modular machines, similar to known results for Turing machines, are proved in [9] but are not needed in this paper.) However the main aim of this paper is not to apply modular machines in computability theory and logic, but to give some of their applications in group theory. In this and t h e following paper t h e grouptheoretic results are all known, but the new proofs are very much easier than any previous proofs. We prove the following theorems.
Theorem A (Novikov-Boone). There is a finitely presented group whose word problem is unsolvable. Theorem B (Collins). For any recursively enumerable unbounded truth table degree, there is a finitely presented group whose word problem has that degree. 1
S.Aanderaa, D.E. Cohen
2
Theorem A was first proved by Novikov [16] and Boone [2] independently. The original proofs were somewhat complicated and lengthy, but were later simplified. In particular, Britton [5] made very important simplifications by proving results about HNN extensions. Lemma 4 of his paper is now referred to as Britton’s Lemma and is a very important tool in combinatorial group theory, which will be used in our papers also. Our proof of Theorem A has t h e same basic idea as Britton’s, but is very much easier because of the use of modular machines rather than Turing machines. Other proofs of Theorem A are known. It can be proved using Higman’s embedding theorem [0, 131. A very different proof from any of the others has been given by McKenzie and Thomson [14]. A weaker form of Theorem B, with “unbounded truth-table degree” replaced by “Turing degree” was proved independently by Boone [3, 41, Clapham [6], and Fridman [ l l , 121. Collins [lo] was able to improve Boone’s proof to obtain Theorem B itself. In $1 we develop the theory of modular machines. Theorem A is proved in 02. In 54 we prove Theorem B, having obtained the relevant degree results for modular machines in 03. The paper following this used similar techniques to prove the Higman embedding theorem. 01. Modular machines
We shall follow Turing and consider Turing machines as defined by quintuples q,a,a,q,D (where a,, a, are letters, q,, qs are states, and D is one of the two symbols L, R ) instead of the more common definition by quadruples due to Post. The two definitions are equivalent, a non-moving quadruple q,aa ’qs being replaced by a quintuple q,aa’q,s,R (where q,s. is a new auxiliary state) together with quintuples qrsaxxqsLfor all letters x. We shall use the word configuration rather than “instantaneous description”. We write uqau for the configuration with uau on the tape (u , u being words, a being a letter), t h e machine being in state q scanning letter a. Let T be a Turing machine. We regard its alphabet as consisting of the natural numbers 0 , 1 , . . . ,n (where 0 is the blank) and its states as consisting of n i- 1 , . . . , m - 1 and possibly 0. Take a configuration . . . b l b o q a c , , c I. . Define u, u by u = C. b m ’ , u = C c,rn Minsky [ 151 represents this configuration by (u, u, a, q ) E N4. It can also be represented by either of the pairs (urn + a, urn + q ) or (urn + q, um + a ) . We shall use whichever pair is most convenient at the time, sometimes using one pair and sometimes the other for t h e I.
3
Modular machines I
same configuration. Notice that both mappings from configurations to pairs are recursive. Also P ( T ) = {(a,p ) E N Z ;(a, p ) represents a configuration} is recursive and the map sending (a,p ) to t h e corresponding configuration is partial recursive with domain P ( T ) . Let qaa’q‘R be a quintuple of T. It is easy to see that (with u, u as above) one element of N Zcorresponding to the next configuration is (urn’+ a’m + q ’ , u ) . Similarly if qaa‘q’l is a quintuple of T then (u, urnz + a ’ m + 9’)is a pair corresponding to t h e next configuration. This motivates the definition of a modular machine. Definition. A modular machine M consists of an integer m > 1, an integer n with 0 < n < m, and quadruples (a, b, c, R ) and (a, b, c, L ) such that 0 S a, b < h,0 S c < m ’, and, for each a, b, at most one quadruple begins with the pair a, b.
p
A configuration of M is an element (a,p ) of N 2 . Write a = um
+ b, where
+ a,
0 S a, b < m. If n o quadruple begins with a, b we call (a,p ) terminal. If (a, p ) is not terminal we say (a, p ) yields ( a ’ , p ’ ) ,written ( a , p ) 3 (a‘,p‘), if either M has a quadruple ( a , b, c, R ) and a ’ = urn2+ c, p’ = u or M has a quadruple (a, b, c, L ) and a ’ = u, p’ = urn2 + c. If ( a I ,PI) 3 (a2, p2) 3 * . * 3 (at, pk) we write ( a , ,p l ) + ( a k , p k ) . We define a (partial) function gM : N --+ N 2 by g M ( a ,p ) = (a’, p’) iff (a, p ) + (a’,p ’ ) and (a‘,p ‘ ) is terminal. The class of functions { g M ; all M } is rather strange. For if gM is somewhere defined, there is a pair a, b such that n o quadruple begins with a, b. T h e n g M ( a , p ) = (a,P) if a = a (mod m), p = b (mod m). We shall use t h e integer n (which has played no part as yet) to define input and output functions. The name “modular machine” is given because t h e action of M on a pair depends on its class modulo m. For any r E N we can write r .uniquely as r = C i b,n’, 1 S b, S n (for r = 0, take the empty sum). In a Turing machine with alphabet 0,1,. . . , n, this means that b, . . b,, is the tape description corresponding to r. Let iM : N 4N 2 be given by iMr = (C b,mI, n + 1). Write a E N as C c,m’, 0 6 c, < m , and take y with c, = 0 but c, # 0 for j < y. Define uM : N 2 + N by u M ( a p, ) = X:a,n’-I. Then uMgMMiM is a partial function from N to N , which we call the function computed by M. We shall associate with a Turing machine T a modular machine M such that M and T compute the same function. We shall use a slightly unusud definition of the function computed by T. The class of Turing computable functions is (without using their characterisation as partial =
urn
4
S. Aanderaa, D.E. &hen
recursive functions) easily seen to be the same for both definitions. The reader who prefers a more usual definition should change the definition of the input function iM and the output function uM accordingly. Let T be a Turing machine with alphabet 0, 1,.. . , n, and states n + 1,. . .,m - 1 and (perhaps) 0. Each r E N has a tape description as above. The output of a configuration will be the integer corresponding to the portion of the tape lying strictly between the scanned square and the first blank to its left. The function fT : N + N computed by T is defined by fTr = s if T when started in state n + 1 on the rightmost square of the description of r ultimately halts in a configuration with output s. Note that, if convenient, we may modify T so that whenever it halts the scanned square is blank. We now define a modular machine M associated with T. M will have the integers m, n previouslyAe6ned for T.M will have two quadruples (a, q,a’rn + q’,R (or L))and (4,a, a‘rn + q’,R (or L)) corresponding to each quintuple qaa’q’R(or L ) of T. The account preceding the definition of modular machines explaining how to associate members of N 2with configurations of T makes it clear that M simulates T. Precisely, if (a,p ) corresponds to C then (a,p ) is terminal iff C is terminal, while if C 3 C’ then (a, p ) 3 (a’,p ‘ ) where (a’,p ’ ) is a pair corresponding to C’. The definition of the functions computed by T and M now makes it clear that T and M compute the same function. The numerical nature of M makes it obvious that the function computed by M is partial recursive, so we have a proof that Turing computable functions are partial recursive. We could also obtain the normal form theorem fairly easily by a Godel numbering of modular machines. ) Define H f l ( M )to be 0 if (0,O) is not terminal for M and H f l ( M = {(a,p ) ; (a, p ) + (0,O)) if (0,O) is terminal. It is obvious that H o ( M ) is r.e. For any r.e. set S there is a Turing machine T such that fT is the (partial) characteristic function of S. Further we may assume that if f T r is defined then T halts on a blank tape (see [18], where T is constructed to simulate the action of a single-register machine computing the function). If M is t h e modular machine associated with T,plainly H o ( M ) is not recursive if S is not recursive. A stronger result is proved in 03. 02. Unsolvability
We begin with an account of HNN extensions and Britton’s Lemma
Modular machines I
5
for the reader who is unfamiliar with these topics. A knowledge of free groups and free products will be assumed. Let A , , A _ , (for i in some index set Z which need not be countable) be subgroups of a group H. Let cp, : A, + A - , be isomorphisms. Let G be the free product H * * ( p , ) (i E I). Let N be the normal closure in G of the set of elements p;la,pi(cp,a,)-lfor all i and all aiE A,. The quotient group GIN is called t h e HNN extension of H with stable letters p , and associated pairs of subgroups A, and A - , ; it is usually written as ( H , p , ; p ; ' A , p , = A - , ) .Any element of G can be written as
. hn-lp::hn, where n 2 0 , h ,,..., hn E H and hop::hI
* .
,..., E , = + I . If for some r we have i, = i,,', E , + = 0 and either E , = - 1 and h, E A,, or E , = 1 and h, E A _ , ,we say this expression has a pinch (more precisely, that it has a p,,-pinch). In such a case t h e same element of GIN is represented by an expression with fewer p-symbols (since we can replace p ; ' a p , with a E A, by cp,a without altering the image) and we call this process pinching out. We may now state Britton's Lemma (for a proof see [7] or [S]). Britton's Lemma. A n y non-trivial element of G lying in N has a pinch. The following results are immediate consequences (and are all we need in this section; the full force of Britton's lemma is used in 54). (I) H embeds in the HNN extension (by t h e map H -,G + G I N ; we regard this as inclusion). (11) If K is a subgroup of H such that cp, (K n A , ) = K r l A _ , for all i then H n ( K ,p a(all i ) ) = K. For it is easy to check that if a pinch in G can be applied to an element of t h e subgroup ( K , p , (all i ) ) of G the hypotheses ensure that the new element still lies in this subgroup. Hence any element of t h e subgroup ( K , p , )of GIN can be represented in a form without pinches. By Britton's Lemma it cannot lie in H if any p , occurs, i.e. if it is in H it is in K. (It also follows that the map from the HNN extension of K with associated pairs K n A, and K fl A _ , to the group ( K , p , )is an isomorphism. This part is used in [O], not in this paper.) (111) In t h e group ( H , p ; p - ' A p = A ) , the isomorphism being t h e identity, p - l h p = h (with h E H) o n l y if h E A . Let A be the group ( t , x, y ; x y = y x ) = ( t ) * ( x , y ; x y = y x ) , which is also an HNN extension of the free group ( r , x ) with stable letter y . Let T = ( t ) ^ (the normal subgroup of A generated by t ) and let t ( r , s ) = y -'x-'tx'y '. We easily obtain the following properties.
S. Aanderaa. D.E. Cohen
6
(i) T is free with basis { t ( r , s)}. This follows from the KuroS subgroup theorem, but is more easily proved directly by expanding a product of elements t ( r , s). (ii) T f l ( t ( i , j ) ,x"', y " ) has basis { t ( r , s); r = i mod rn, s = j mod n } . These elements are plainly contained in the subgroup, and any element of ( t ( i , j ) , x " , y " ) is ux'"'yp" for integers a, p and some element u of ( t ( r , s); r = i mod m , s = j mod n ) , whence the result. (iii) The subgroups ( t ( i , j ) , x " ' , y " )for an m , n # O and any i , j are all isomorphic. For this subgroup is a conjugate of ( t , x " , y " ) = ( t ) * (x"', y " ) which is isomorphic to A. (iv) Let rn, n, rn,, n l # 0 and i,j, i l , j l arbitrary. Then the map r ( i , j ) H t ( i l , j l ) , x"' H y" H induces isomorphisms of ( t ( i , j ) , x " ' , y " ) with ( t ( i l , j l ) , x ' " ~ , yand " ~ ) of T n ( t ( i , j ) , x " , y " ) with T f l (t(il,jl), ~ ' " y1",~ ) This . is immediate from (iii) and (ii), t h e latter isomorphism sending t ( u m + i, urn + j ) to ((urnl+ i l , unl + jl). Now let M be a modular machine with quadruples (a,,b , , c , , R ) for i E Z and (a,, b,, c,, L ) for j E J. Let T ( M )= ( t ( a ,p ) ; (a,p ) E Ho(M)). Let B ( M ) be t h e group (A, r,, 1,; r;'t(a,, b,)r, = t(c,,O), r;'x"r, = x m Z , r;'y"r, = y, l;'t(u,, b,)l, = t ( 0 , c,), l~'x"'1,= x, lJly"1, = y m 2 for all i E I, j E J). By (iii) this is an H N N extension of A with stable letters r,, 1,. (v) Let (a, b, c, R ) be a quadruple of M with corresponding stable letter r. Then under t h e isomorphism of the associated subgroups their subgroups T ( M ) n ( t ( a b, ) , x " , y " ' ) and T ( M ) f l ( f ( c , O ) , x m 2 , are y) isomorphic. For by (iv) t h e isomorphism sends t(a,p ) to t ( a l ,PI) where a = a, p = b mod rn and (a,p ) 3 ( a I PI). , Then (a,p ) E H o ( M ) iff ( a l ,PI)E H o ( M ) .The similar result holds for L. (vi) A fl ( T ( M ) ,r,, 1, ; all i, j ) = T ( M ) . This follows from (11), using (v). (vii) ( T ( M ) ,r,, 1, ; all i, j ) = ( t , r,, 1, ; all i, j ) . We show r(a, p ) E ( t , r,, 1,) for (a,p ) E H o ( M ) by induction on the length of the computation. If (a,p ) is terminal we have a = /3 = 0 as (a,p ) E H o ( M ) .Let (a,p ) j ( a I PI), , and let a = a, p = b mod m where (a, 6, c, R ) is quadruple with corresponding stable letter r (similarly for L and I ) . Then t ( a l ,PI)E ( t , r,, 1,) by induction and r - ' t ( a , P ) r = t ( a l ,PI), giving the result. ~"'1,
y"l
Theorem 1. Let G ( M )= ( B ( M ) ,k ; kt = tk, kr, = r,k, kl, = ljk, all i,j). Then G ( M ) is finitely presented and H o ( M ) is Turing reducible to the word problem for G ( M ) . Theorem A follows immediately, since we can choose M with H o ( M ) unsolvable .
Modular machines 1
7
Proof. G ( M ) is plainly finitely presented. Also G ( M ) is an HNN extension of B ( M ) with stable letter k. By (111), k f ( a ,p ) = t(a,P ) k iff t ( a ,p ) E ( t , r,, 1,). By (vii) and (vi) this holds iff (a,p ) E H o ( M ) , proving the theorem.
T h e most natural way of showing the existence of finitely presented groups whose word problem has arbitrary r.e. Turing degree would be to reduce the word problem for G ( M )to H o ( M ) .G(M)is nice enough for this to be possible, using the Bokut normal form; this is done in an unpublished paper by K. A. Kallorkoti. W e shall prove a stronger result in $4. 93. Modular machines revisited It is frequently inconvenient to have two configurations of M corresponding to one of T. Suppose q is a state such that no quintuple ends with qL.Then if a configuration with state q occurs in a computation the corresponding element of N Zis (urn + q, urn + a ) , with the previous notation. Consequently we need only associate this pair (and not the pair (urn + a, vrn + 4 ) ) with the configuration. Also corresponding to a quintuple qaa’q’R (or L ) we need take only one quadruple in M , namely (4, a, a’m + q ’ ,R (or L)). Similarly if no quintuple ends with qR. In particular if T is a directed-state machine, i.e. n o state q occurs both in a quintuple ending with qR and in one ending with qL, then we need only associate one pair with each configuration. Any Turing machine can be replaced by a directed-state machine computing the same function. If qo is a state of the Turing machine T, and there is a quintuple beginning qoaodefine H o ( T )to be 0; if there is no such quintuple then the configuration Co,of a blank tape in state qO,is terminal, and we define H o ( T ) as {C;C + Co}. We call the state qo special if the following conditions hold: no quintuple ends with qoR, the only quintuples ending with qoL are q*aaqoL for all a and some fixed state q*, and n o quintuple ends with q*L. A s above, in this case we only take one pair corresponding to configurations in states qo or q * , and only take one quadruple of M corresponding t o each quintuple starting with qo or q * . For convenience when regarding states as integers, we take q* as rn - 1. Many complications occur because 0 may stand for the letter a. or the state qo; requiring qo t o b e special is one way of managing the complications.
8
S. Aanderaa. D.E. Cohen
Theorem 2. (i) Let T be a Turing machine with special state qo and let
M be the associated modular machine. Then H o ( T ) and H , ( M ) have the same many-one degree. (ii) Let d be an r.e. many-one degree, not that of N . Then there is a modular machine Md with Ho(Md)of degree d.
Proof. (i) (0,O) is the unique pair corresponding to Co. Since M simulates T, if (a,p ) corresponds to C we have C E H o ( T )iff (a, p ) E Ho(M).Hence Ho(T) is many-one reducible to Ho(M), the function showing the reduction being C (a, p ) where (a,p ) is chosen to be (um + a, um + q ) if there are two possibilities. The set P ( T ) of pairs corresponding to configurations of T is recursive. Let P * ( T )= {(a,p ) ; (a, p)+(a’, p ‘ ) for some (a’, p’)E P ( T ) } . Plainly H o ( M )C P * ( T ) . If P * ( T ) is recursive we can define a total recursive function 7c from N 2to configurations of T as follows. If (a,p ) $Z P * ( T ) let ~ ( ap ), be qoaoa,(which is not in Ho(T),since it is terminal if Ho(T)#O). If ( a l p ) €P * ( T ) ,let ( a ’ , p ’ ) be the first pair in the computation starting from ( a , @ which ) lies in P ( T ) and let ~ ( ap ), = C‘, the configuration corresponding to (a’, p’). Then C’E H o ( T )iff (a’, p’) E H o ( M ) iff (a,p ) E Ho(M).Hence (a,p ) E H o ( M ) iff ~ ( ap ), E Ho(T), as required. Since P ( T ) is recursive, to show P * ( T ) is recursive it is enough to show the following. (a,p ) E P * ( T )- P ( T ) iff p = b modm where 0 s b s n, a = um i + ‘ + qm + m ‘ - 1 for some q with n < q < m and some i >O, and, if i is the largest integer such that a can be written as above, then (um + q , p m ’ ) E P ( T ) . We prove this by induction on the number of steps needed to obtain an element of P ( T ) starting from (a, p ) . Let (a,p ) 3 (a’, p ’ ) where (a’,p ’ ) satisfies conditions similar to the above. The quadruple which applies to (a,p ) cannot correspond to a quintuple ending with q * R (recall that q * is represented by t h e integer m - 1, and no quintuple ends with q * L ) . For such a quadruple leads to a pair whose first entry is congruent mod m * to am + m - 1 where 0 S a S n, but a ’ is not of this form. The only other possibility is that in (a’,p ‘ ) the state-symbol is not m - 1 (occurring in a’) but is 0 (occurring in p’). This means that the quadruple applied to (a, p ) was ( m - 1 , b, bm, L ) for some b with 0 s b s m. Hence p = b mod m and a = a ’ m + m - 1, so (a, p ) has the required form. We must start the induction by showing that if ((a,@) (a’,@’))€ P ( T ) then either (a,p ) E P ( T ) or (a,0) has the stated form.
+
Modular machines I
9
Suppose the quadruple that applies is (9, a, a ‘ m + 9’, L ) (other cases are similar). If 9 ’ # 0, and C’ is t h e configuration corresponding to ( a ’ , p ‘ )it is easy to see that C ’ =W,q’ba‘W, for some letter b and tape descriptions WI, W,. Then (a,p ) E P ( T ) , corresponding to the configuration WlbqaW2.If, however, 9’ = 0, when t h e quadruple can only be (m - 1, a, am, L ) for some a, in addition to this case there is another one. We may regard 9 ’ as being the letter a. rather than a state. Thus we may have a‘ = um + 91 for some state 91, and p‘ = um ; as ( a ’ , p ’ ) EP ( T ) neither u nor u can be congruent modm to an integer strictly between n and m. In this case a = a r m + m - 1 and p = u, giving t h e stated form of an element in P * ( T ) - P ( T ) . (ii) H f l ( M )is not many-one equivalent to N for any machine M, since if Ho(M)‘#0 then (m,O) fZ H,,(M) as n o quadruple begins with 0,O. Also we can find M with H f l ( M = ) 0. By (i) it is enough to find a Turing machine Td with a special state such that H[)(Td)has many-one degree d for any other d. The work of Borger [ l ] and Overbeek [I71 show that H o ( T )can be chosen to have any r.e. many-one degree (except that of N) for a Turing machine T defined by quadruples. This is explicit in [ I ] for a programmed Turing machine. Replacing program lines by the appropriate quadruples we get a Turing machine by quadruples whose configurations are bijective with those of t h e programmed machine. The work in [17] is already expressed in terms of quadruples; Lemma 5 of [17] shows that the halting problem may have any degree. If we add to the routine given there further subroutines clearing all the arguments it is easy to see that the resulting machine T has H f l ( T )of the required degree. From t h e Turing machine T defined by quadruples we get, as explained earlier, a Turing machine T’ defined by quintuples by adding auxiliary states 9,s, (indeed, a simpler set of auxiliary states may be used here; the full force of this labelling is only needed for other problems). Any configuration C of T can be regarded as a configuration of T’ and C E H,(T) iff C E Ho(T’).T‘ has other configurations corresponding to t h e auxiliary states. If C’ is such a configuration then C’ C, where C can be regarded as a configuration of T, and C‘E Ho(T’)iff C E Ho(T’)iff C E Ho(T). Hence Ho(T’)and H o ( T ) are many-one equivalent. Finally we obtain Td by adding states to T’. Let H,,(T’)be defined in terms of the state 96. Obtain Td from T‘ by adding states 9 * , 9,) and quintuples 9*aaqoL for all letters a and the quintuple 9,koao9*R. Plainly a configuration C of T‘ is in Ho(T’)iff it is in Ho(Td). A configuration of Td in states 9 * or 90 is in Ho(Td) iff it consists of a
+
10
S. Aanderaa, D.E. Cohen
blank tape. Thus H o ( T d )and H,,(T') are many-one equivalent, as required. Remarks. 1. H,,(T) and H , , ( M ) are special halting problems. By contrast to Theorem 1, the general halting problem for M can have any r.e. Turing degree greater than that of T. A full account of these results is given elsewhere [9]. 2. Instead of choosing t h e state n + 1 as initial state and t h e state 0 as halting state we may, once having ensured that the machine only halts scanning 0, make 0 into t h e initial and final state (so no quintuple begins 00 but there is a quintuple beginning O i for 1 c i c n). This is useful in t h e proof of Higman's embedding theorem, and the details are given in t h e paper following [O].
84. Truth-table degrees
Let M be a modular machine. We shall associate with M a finitely generated group whose word problem has the same truth-table degree as H o ( M ) .There are several ways of doing this; indeed t h e group G ( M ) of 02 is suitable. We shall use a different group, partly as an illustration of t h e techniques that can be used. It is convenient to regard M as acting on Z 2 instead of on N 2 ; H o ( M ) is unaltered by this change, as one may prove by induction on the length of the computation that if (a, p ) E H,,(M) with a,p E Z , then a, /3 3 0. Let G , ( M )= (r,, k, for a,p E 2 ; r,k, = k,ra for (a,p ) E H,(M)). We sometimes write r, k for to, k,, and r ( a ) ,k ( p ) for r,, k , to avoid complicated subscripts. GI(M) may be regarded as an HNN extension with stable letters { k , } of t h e free group with basis {r,} or as an HNN extension with stable letters { r a } of the free group with basis { k , } . It follows immediately that r,k, = kpra in GI(M) (if and) only if (a,p ) E H o ( M ) . y - ' k , y = k p e l for all Let G 2 ( M )= (GI(M),x, y ; x - ' r a x = a, p E Z). Since {r,} and { k , } are bases for free subgroups of GI(M), we see that G 2 ( M )is an HNN extension of GI(M)with stable letters x, y . As before let M have quadruples (an,b,, c,, R ) for i E I and (a,,b,, c,, L ) for j E J. We obtain G,(M) by adding new generators ri(i E I ) and I, (j E J) to G 2 ( M )and corresponding relations r;'x"r, = xm2, r;'y"r, = y , r;'r,r, = r,,, r;'ker, = k,, for all pairs ((u, p ) to which (a,,b,, c,, R ) applies and yields (a,, PI) (more simply we have
Modular machines I
11
r;'t(um + a,)r, = t(urn2+ c,), r;'k(um + b,)r#= f ( u ) ) and the similar relations for l,, interchanging x and y. W e shall write G I , G2, G3 instead of GI(M), G t ( M ) , G 3 ( M ) .W e shall prove shortly that G3 is an HNN extension of G2. From the two ways of regarding GI as an HNN extension of a free group we obtain both t,-pinches and k,-pinches. As G2 is an HNN extension of G I we also have x - and y-pinches. Notice that if a word in the generators of Gz has an x-pinch this means that it contains a ~ E = ? 1 and u is a word in { f a , k,} which equals subword X - ~ U Xwhere in G I a word in { t m } ;this does not mean that u freely equals a word in it-). (i) If a word in the generators of G2 has an x-pinch then either it has an x-pinch of the form X - ~ U X ' with E = 2 1 where u is a word in { t < , }or it has a k,-pinch for some p (and similarly for y). For, as already remarked, it must have a subword X - ~ U X where ~ u is a word in { f a , k,} which equals in G I a word u in { f a } . Either uu-' freely equals 1 or uu-l has a k,-pinch for some p in which case SO does u (as no k, occurs in u ) . (ii) G3is an HNN extension of Gz. To check the isomorphism conditions we have four cases to consider, one of which is the following (the others are similar). Let W be a word in the symbols X, Y, T,, K,. Let (a, b, c, R ) be a quadruple of M. If W(x"', y"', t @ m + a ) , k (qm + b)) = 1 in G2 then W(x"', y, t @ m 2+ c), k (4)) = 1 in Gz. We use induction on the length of W, which we may assume freely reduced. If W is non-trivial, W(x "',y "',f (prn + a ) , k (qm + b)) must contain a pinch. By (i) W freely equals either WIX-'UXpW2 where U is a word in {T,} and E = 2 1 or W freely equals WlK,'UK,W2 where U is a word in those T, for which @m + a, qm + b ) E H o ( M ) (or a similar case with the roles of X and Y and of Tp and K , interchanged). In the first case let W' = WIU' Wz where U' is obtained from U by replacing each T, by T,,,; in the second case let W' = WlUW2. I n both cases we find that W(x", y"', t @ m + a ) , k(qm + b)) = W'(x"',y",t(pm + a ) , k ( q m + b ) ) and that W(x"",y,t(pm2+c),k(q))= W'(x"', y, t @ m 2+ c), k ( q ) ) , so the result follows by induction. The process of repeated HNN extension is called a Britton tower. (iii) G , is finitely presented. It is finitely generated, with generators t, k, x, y, r,, I,, since f a = x-"tx" and k, =
[email protected] infinitely many relations involving r, (and similarly 1,) are consequences of four of them, namely r;'x"'r, = X"'', r , 'y"'r, = y, r , Ix "~tx"~r, = x-'atxc,, r;ly-bikyber, = k. The remaining relations include tk = kt. However they are all consequences of this
12
S.Aanderaa, D.E. Cohen
relation and the relations involving r, and f,. This follows by induction on the number of steps in the computation from (a, to (0,O)since if (a,p ) 3 (a,, p,) using the quadruple (a,,b,, c,, R ) then the relations involving r, show that r, conjugates the commutator of x-"tx" and y - B k y Pinto the commutator of x-"1txU1 and ~ - ~ i k y @Hence 1. the former is trivial if the latter is. (From this presentation we find a much easier expression for G,as a Britton tower, G , 2 (t, x , y, r,, 1,) 2 ( t , x , y ), the latter being free. This is easily seen to be a Britton tower, G,being obviously finitely presented, and the construction being analogous to that of 02. The construction given, however, makes questions of truth-table degrees easier to look at .) Let W be a word in the generators of G,.By a reduction of W we mean one of the following: (I) Free reduction; also replacing x p . x q by x p + q if p + q # O and deleting x " . x-,; also similar reductions for y . (11) If W freely equals UxPt:V where U does not end in x e , replacing W by U t : - g p V ,and similarly for y and kp. (111) If W freely equals Ur-'WorV where r is a stable letter corresponding to (a, b, c, R ) and Wofreely equals a word in x m , y m , t(pm + a ) , k(qm + b ) (for all p , q E Z ) , replacing W by V W ,V where W 1is obtained from Wo by replacing x",y",t(pm + a ) , k(qm + b ) by xm2,y, t(pm * + c), k (4);also similar replacements corresponding to r - l , 1 and I-'. (IV) Replacing UkitZV by UtZkEV or vice versa for all integers a, p. A reduction is called correct if it is of types (I), (11) or (111) or if it is of type (IV) with (a,p ) E Ho(M).A reduction sequence starting from W is a sequence W = W , , W2,..., W, of distinct words such that W,+, is a reduction of W, for all p ; a correct reduction sequence is one in which all reductions are correct. Define the weight of W to be the sum of the following: the number of occurrences of r,, kp, r,, I, and their inverses, the number of occurrences of powers of x and y , and the number of times a power of x precedes (not necessarily immediately) some t : (for E = +1) and similarly for y and k,. Plainly, reductions of types (I), (11), and (111) reduce the weight while type (IV) reductions preserve it. Since there is a bound on the number of type (IV) reductions which can be applied consecutively to W without getting a repetition, we see that only finitely many reduction sequences can start with W (indeed, there is a recursive function of W bounding their number). Because we apply type (111) reductions only when Wo freely equals a
6)
Modular machines I
13
word in the relevant generators, and apply type (IV) reductions for all a, p, it is clear that the finite set { W ' ;W' is a reduction of W } depends recursively on W. The finite set {W'; W' occurs in a reduction sequence starting from W } will also depend recursively on W, as will {(a, p ) E 2'; a type (IV) reduction involving fa, k, occurs in some reduction sequence starting from W } .This set will be used to construct a truth-table depending recursively on W. (iv) If a word in the generators of G , has a pinch it has a correct reduction sequence ending in a word of less weight. Such a sequence will be called a weighr -reducing sequence or w.-r. sequence. By (i) if there is an x- or y-pinch either a type (11) reduction applies or there is a f a - or k,-pinch. Suppose there is a k,-pinch for some Po (the other case is similar). Then there is a subword k;;Uk;, where U is a word in those r, with (a,Po)E H,(M). Type (IV) correct reductions replace this subword by Uk;,'kE, to which a type (I) reduction applies. Let r be a stable letter corresponding to (a, b, c, R). Suppose the pinch is the subword r-'Wr (the other cases being similar). Then W equals in G 2a word in x'", y"', r,, k, for a = a mod m, p =_ 6 mod m. If W freely equals such a word we may apply a type (111) reduction to the original word, while the original word has a w.-r. sequence if W has. We use induction on the length of W to show that if W equals such a word and has no w.-r. sequence then it freely equals such a word. Suppose W has no w.-r. sequence. Then W has no pinch since it only involves x, y, r,, k,, and the result has been proved for such words. Let W equal W , in G2, where W , is a word in x"', y'", fa, k, for a = a, p = b mod m. We may assume W , has no pinch, since pinching out produces a shorter element in the same subgroup. Let U and U , be the largest final segments of W and W 1involving only r, and k,. Suppose there is cancellation in U I U - ' .Then, for instance, W = W'r, where a. = a mod m and W' is shorter than W. Then W' has no w.-r. sequence since W has none, and W' = W,t;i. Hence, inductively, W' freely equals a word in x"', y"', t ,k, (a = a, p = b). Then W will also freely equal such a word. Suppose there is a k,-pinch in U I U - ' but no cancellation. Since U and U , have no pinches, W and W , must freely equal W'kiV and WlkEV, where V, V, are words in { a } and p = b (since k, occurs in W J . The pinch now tells us correct type (IV) reductions take us from W to W'Vk;. This must be freely reduced, since it has no w.-r. sequence as W has none (and free reduction reduces weight). But then W'V has no w.-r. sequence, equals in Gz a word of the required form, and is shorter than W. Hence it freely equals such a word (by induction on length). Since W'V is freely reduced and V involves only
14
S.Aanderaa. D.E. Cohen
both W' and V, and hence W itself, must be as required. Similar aruments apply to a t,-pinch. As W ,W - ' = 1 in Gz it must have a pinch. By (i) the only possibility left is that W and W , freely equal W'xqU and W:xp"U1where U, UI are words in { t a } (or similarly for y and k p ) . U must be empty, else W has a type (11) reduction; we may also assume UI empty since a type (11) reduction applied to W, gives a word of the same form. If q = p m , as before W ' , and hence W, is as required. If q # pm then WIX~~-~W must - ' have a pinch which can only be a pinch of a term x * I in x ~ m - q with a similar term in Wl or W - ' . In the first case (by (i) as W, has no pinches) a further type (11) reduction could have been made in W,; we may choose W, so that this is impossible. In the second case, for the same reason, a type (11) reduction could be made in W, contrary to hypothesis.
{t,},
Lemma 3. Let W be a word in the generators of G,. If W = 1 in G , there is a correct reduction sequence starting at W and ending at 1. Conversely, i f there is such a sequence then W = 1 in G,.
Proof. Each correct reduction is a consequence of the defining relations so W = 1 if there is such a sequence. If W = 1 in G , and is non-trivial it must have a pinch. By (iv) there is a correct reduction sequence from W to W , where W , has smaller weight than W. This sequence shows that W , = W in G3,and the lemma follows by induction on weight. The next lemma is used only for the Higman embedding theorem [O] but its proof fits in naturally here. Lemma 4. Let S be a subset of Z . If a word W equals in G , a word in ( k , t, ( a E S ) ) there is a correct reduction sequence from W to W , where W , freely equals a word in ( k , t, ( a E S ) ) .
Proof. It is enough to show that if W does not freely equal such a word then it has a w.-r. sequence. For if it has a w.-r. sequence ending in W' then W' = W in G , and by induction on the weight there is a correct reduction sequence from W' to such a W , , which gives a correct reduction sequence from W to W,. If W involves x,y,r,, or 1, it must have a pinch since it equals an element of GI. By (iv) it then has a w.-r. sequence. Suppose W is a word in (t,, k p (all a,p ) ) which equals in G I a word W , in (k,t, (a E S)). If W has a pinch it has a w.-r. sequence so we
Modular machines I
15
may assume it has no pinch. We may also assume that W , has n o pinch, since pinching o u t gives an element of t h e same subgroup. Since W ,W - ’= 1 in G , it must contain a cancellation or a pinch. We now proceed as in (iv). Theorem B is an immediate consequence of Theorem 2 and Theorem 5 below. Theorem 5 . G , ( M ) is a finitely presented group whose word problem has the same unbounded truth-table degree as H,(M).
Proof. We have already shown that G , ( M ) is finitely presented. H o ( M ) is one-one reducible to t h e word problem for G , ( M ) since ( a ,p ) E H o ( M ) iff x - ” t x ” and y - P k y P commute in G,. We show how the word problem for G3 is truth-table reducible to H o ( M ) .Let W be a word in t, k , x, y , r,, 1,. Our truth-table will have question columns corresponding to all pairs ( a ,p ) occurring in type (IV) reductions in some reduction sequence starting at W. A line in the answer column will be “Yes” iff there is a reduction sequence ending in 1 all of whose relevant questions are answered by “Yes”. This is a recursive construction, by the remarks on reduction sequences. This construction gives a “Yes” somewhere in the answer column iff there is a correct reduction sequence starting at W and ending at 1. By Lemma 3 this holds iff W = 1 in G,. This proves Theorem 5. The crucial point that leads to truth-table reducibility rather than Turing reducibility is that we make reductions in a general situation, and only decide after all reductions are made whether or not they have been made correctly. If w e had partial recursive functions which allowed us to make correct reductions, but (for instance) required us to determine if ( a ,p ) E H , ( M ) or not before determining if kp’t,kp could be replaced by t, t h e procedure would only give Turing reducibility.
References [l] E. Borger, A new general approach to the theory of the many-one equivalence of decision problems for algorithmic systems, Z . Math. Logik Grundlagen Math. (to appear). [2] W.W. Boone, The word problem, Annals of Math. 70 (1959) 207-265. [3] W. W. Boone, Word problems and recursively enumerable degree of unsolvability A sequel on finitely presented groups, Annals of Math. 84 (1966) 49-84.
16
S. Aanderaa, D.E. Cohen
[4] W.W. Boone, Word problems and recursively enumerable degrees of unsolvability. An emendation, Annals of Math. 94 (1971) 389-391. IS] J.L. Britton, The word problem, Annals of Math. 77 (1963) 16-32. [6] C.R.J. Clapham, Finitely presented groups with word problems of arbitrary degrees of unsolvability, Proc. London Math. SOC.(3) 14 (1964) 633-676. [7] D.E. Cohen, Residual finiteness and Britton’s Lemma, J. London Math. Soc. (2) 16 (1977) 232-234. [8] D.E. Cohen, Combinatorial group theory; a topological approach, Queen Mary College Lecture Notes, 1978. [9] D.E. Cohen, Degree problems for modular machines (to appear). [lo] D.J. Collins, Truth-table degrees and the Boone groups, Annals of Math. 94 (1971) 392-396. [ 111 A.A. Fridman, Degrees of unsolvability of identity in finitely presented groups, Soviet Math. 3 (1962) 1733-1737. [I21 A.A. Fridman, Degrees of unsolvability of the word problem for finitely defined groups (Izdatel’stvo Nauka, Moscow, 1967). [13] G. Higman, Subgroups of finitely presented groups, Proc. Roy. SOC.A 262 (1961) 455-475. [14] R. Mckenzie and R. J. Thompson, An elementary construction of unsolvable word problems in group theory, in Word Problems, Studies in Logic and the Foundations of Mathematics (North-Holland Publishing Co., Amsterdam, 1973). [15] M.L. Minsky, Recursive unsolvability of Post’ problem of ‘Tag’ and other topics in the theory of Turing machines, Annals of Math. 74 (1961) 437-455. [16] P.S. Novikov, O n the algorithmic unsolvability of the word problem in groups, Trudy Mat. lnst. Steklov 44 (1955). [ 171 R. Overbeek, The representation of many-one degrees’by decision problems of Turing machines, Proc. London Math. Soc. (3) 26 (1973) 167-183. [ 181 J.C. Shepherdson and H.E. Sturgis, Computability of recursive functions, J. Ass. Computing Machinery 10 (1963) 217-256. And also the immediately following paper in this volume. 0,s. Aanderaa and D.E. Cohen, Modular machines and the Higman-Clapham-Valiev embedding theorem, in Word Problems 11: The Oxford Book, Studies in Logic and the Foundations of Mathematics (North-Holland Publishing Co., Amsterdam, 1979).
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 17-28
MODULAR MACHINES AND THE HIGMAN-CLAPHAMVALIEV EMBEDDING THEOREM Stal AANDERAA and Daniel E. COHEN University of Oslo and Queen Mary College, London
In this paper we use the modular machines introduced in [2] to give simple proofs of the following theorems. Theorem A (Higman [5]). A n y finitely generated recursively presented group can be embedded in a finitely presented group. Theorem B (Valiev [lo]). Let C be a finitely generated recursively presented group. Then there is a finitely presented group containing C whose word problem. has the same unbounded truth-table degree as the word problem for C. Since Higman’s original proof of Theorem A, several other proofs have appeared [ l , 3, 4, 6, 91. Our proof is closest in spirit to that of Aanderaa [l]. As in [2], the use of modular machines instead of Turing machines greatly simplifies the proof. Higman also showed (which is not difficult) that any finitely generated subgroup of a finitely presented group is recursively presented. He also observed that the existence of a finitely presented group with unsolvable word problem follows, by Theorem A, from the existence of a recursively presented group with unsolvable word problem, and that such a group is easy to find. Theorem B was first proved by Valiev [lo]. A weaker result, with Turing degree replacing truth-table degree, was proved earlier by Clapham [4]. Readers interested only in Theorem A should omit 01. They may omit 02 if they give their own proof of the following fact, shown in the middle of 92. For any r.e. set S of words in the alphabet {a,,. . . , a,} there is a Turing machine T on ao,a,, . . . ,a, such that if W is not empty W E S iff the configuration with tape description W in state qo scanning the last square of W is in Ho(T).(Note the unusual features: qo is both the initial and the final state, and the last square of W is scanned.) The proof of Theorem A is contained in parts (i)-(iv) of 03. 1 1
18
S. Aanderaa, D.E. Cohen
91. Some reduction lemmas
Let X be a (finite or countable) set of generators of a group H, and the natural map from the free group F = F ( X ) to H. A map f from H to H is called recursive if there is a recursive cp : F + F with n-cp = fn-. If X is infinite, changing the ordering of X could change recursiveness; in the applications there will always be a natural ordering of X . If X is finite, any homomorphism cp from F to F is recursive; if X is infinite, cp is recursive iff its restriction to X is recursive. n-
Lemma 1. Let A be a subgroup of H, f : H + A a recursive map which is the identity on A. Then the membership problem for A is many-one reducible to the word problem for H. Proof. Take recursive cp : F + F with n-cp = fn-. Let w E F. If T W E A, n-cpw = fn-w = T W , while if n-w = n-cpw we have n-w E fH C A. Hence n-w E A iff n- (w-'cpw) = 1, which reduces the membership problem for A to the word problem for H. Lemma 2. Let A, ( i = ? I ,..., k n ) be subgroups of H, fi : H + A - , recursive maps which restrict to isomorphisms A, -+ A _ , (the restrictions of fz and f-l being inverse). Let H * be the HNN extension ( H , p , ; ~ ; ' A Q= , A-#, i = 1,. . . , n ) . Then the word problem for H * is truth-table reducible to the join of the word probiem for H and the membership problems for A,, i = ? 1,. . . , k n. Proof. Denote p;' by p-#. Let cp# : F + F be maps with ~ c p ,= fin-. (Note that fi and cp, need not be homomorphisms.) W e use the methods of 93 of [2]. Let w be a word in X and the p , ( i = 1,. . .). A reduction of w is one of the following: (I) delete a subword p - # p # ; (11) delete a subword p-,up,, where u E F; delete the whole of w if wEF; (111) replace a subword p - p p , by cp,u, where u E F. A reduction is correct if it is of type (I), of type (11) with the deleted subword equalling 1 in H, or of type (111) with TU E A,. A (correct) reduction sequence is a sequence w,, w2,.. . . ,wk where w,+' comes fOm w, by a (correct) reduction, j = 1,. . . , k - 1. Plainly there is a bound to the length of a reduction sequence starting at w, and there are only finitely many reduction sequences starting at w. W e associate a truth-
*
Modular machines I1
19
table recursively with w as follows. There will be a question column m = l?” to each (sub)word u which is deleted in a type (11) reduction in a reduction sequence starting from w ; there will be a question column “ I T U E A,?” for each u occurring in a type (111) reduction in some reduction sequence starting with w. In t h e answer column a line receives the answer “Yes” if the questions answered by “Yes” o n this line include all the questions in some reduction sequence ending in 1. This means that there is a “Yes” in the answer column iff there is a correct reduction sequence from w to 1. If there is such a correct reduction sequence then w = 1 in H*, since each correct reduction corresponds to a relation of H*. If w = 1 in H * either w E F and ITW = 1 or w contains a subword p-,pl or w contains a pinch p-#up, with u E F and ITU E A,. In each case a correct reduction can be performed, yielding a word w ’ with w’ equalling 1 in H * and w ‘ either trivial or with fewer p-symbols than w. Inductively, w’ and hence w itself, has a correct reduction sequence ending in 1. Hence w = 1 iff the truth-table for w has a “Yes” in t h e answer column, giving the result. “
Note that Lemmas 1 and 2 may be generalised slightly. In Lemma 1 we only require a recursive map cp with ~ c p wE A for all w and ncpw = TW if T W E A ; then TQ is a map from F to A which need not induce a map from H to A. In Lemma 2 we may take maps ‘pi such that ITQ,FC A-, and pi induces the isomorphism from A, to A _ , (but need not induce a map from H to A - , ) .
Lemma 3. Let A, ( i = 2 1,..., * n ) be subgroups of H . Let cp, : F + F be partial recursive functions such that cpa induces an isomorphism from A, to A _ , (and cp-, induces the inverse isomorphism). Let 0, be a , that ITW E r 0 , w if recursive map from F to finite subsets of I T - ’ Asuch r w E A,, and 0,F dom cp,. Then rhe word problems for H and H * = ( H , p , ; p;’A,p, = A-,, i = 1 , . . . , n ) are truth-table equiualent.
Proof. The proof combines and slightly generalises Lemmas 1 and 2. Replace in the proof of Lemma 2 t h e reductions of type (111) by reductions of type (111’): replace P - ~ U P , by cpiu for some u E Oiu. As Oi is recursive and cpiu is defined for u E Oiu it is clear that { w ’ ; w ’ is a reduction of w } is a finite set depending recursively on w. The set of all reduction sequences starting from w is finite and depends recursively on w as do the finite sets { u ; u is used in some reduction of type (11) in a reduction sequence starting from w } and (u, u ) ; the pair (u, u ) is used in a type (111’) reduction in a sequence starting from
20
S. Aanderaa, D.E. a h e n
w. A reduction of type (111’) is correct if m = PU. Build a truth-table as before with columns corresponding to “mu = l?” for u in a type (11) ) for reduction in some sequence starting from w and ‘ ‘ ~ ( u u -=~ l?” pairs (u, v ) in such a type (111’) reduction. As before this provides a truth-table reduction of the word problem for H * t o the word problem for H. The word problem for H is trivially one-one reducible to that for H*. In Lemma 2 the existence of partial recursive isomorphisms from Ai to A-i would enable us to perform pinches and so Turing reduce the word problem for H * to the join of the word problems for H and the membership problems for Aki in H. The crucial feature that gives truth-table reducibility is that we have functions from H to A-i. These enable us to perform reductions first and check correctness later. Many similar results can be proved, in particular versions of Lemmas 2 and 3 for amalgamated free products. We will simply prove some results on solvable problems.
Lemma 4. Let H, A, and H * be as before. Suppose the isomorphisms from Ai to A-i are partial recursive. Zf the membership problems for Aei in H are solvable, then the membership problem for H in H * is solvable. Zf, in addition, the word problem for H is solvable, so is the word problem for H * . Remark. Frequently we are given partial recursive maps from Yi to A _ , which will always extend to partial recursive maps from (Y,) to A+. If P ( Yi) = Ai, and ker P is r.e. (i.e. H is recursively presented) this defines a partial recursive map from .rr-’Ai to A - , which is what we need.
Proof. Any element of H * can be written as the product of elements of H and the stable letters (and their inverses). We can see if there is a subword p-,up,. If so, as the membership problem for A, is solvable we can determine whether or not u E A,. If u E A, we can replace p-,up, by the corresponding element of A _ , (as the isomorphisms are partial recursive). This enables us to obtain recursively a reduced form for any element of H*. An element of H * equals 1 in H * iff its reduced form lies in H and equals 1 in H. Lemma 5. Let A1, A2 be subgroups of HI, H2.Suppose there is an isomorphism from A , to A2 which is partial recursive with partial recursive inverse. Let the membership problems for A t in H,and for A 2
Modular machines I1
21
in H , be solvable. Let K , be a subgroup of H I with A 1C K I and the membership problem for K I in HI solvable. Then the membership problems for HI, K , , and ( K l ,H,) in the amalgamated free product ( H I ,H,: A , = A,) is solvable. Proof. As in Lemma 4 we recursively find reduced forms for elements of the amalgamated free product. An element lies in HI iff its reduced form lies in H I (or in A,). Because K, 2 A , an element lies in (K1,H,) iff its reduced form consists of elements alternately from K1 and H,. 02. Turing machines
Let S be an r.e. subset of N. It is proved by Overbeek (Lemma 5 of
[7]) that there is a Turing machine (defined by quadruples) on the alphabet {0,1} whose halting problem is many-one equivalent to S. A
glance at the construction shows that the machine halts on qlOl' iff r E S ; also there is a halting state (i.e. a configuration is terminal iff its state is the specified one). It is easy to check that if S # N and we add further subroutines clearing all registers and a final subroutine moving to the next blank square to the right (this latter is convenient but not necessary) we obtain a machine T I with halting state qo such that H,,(TI)(the problem of halting with a blank tape) has many-one degree that of S, while r E S iff qlO1' E H o ( T I )(and T I does not halt on qlO1' if r & S). The final subroutine (move to next blank right) ensures that there is only one quadruple involving qo, which is qOOqo for some state
4.
Routine technical arguments (e.g. [8]) extend this result to an arbitrary alphabet. Le., if S is a proper r.e. set of word in the alphabet { a , , .. . , a,,} then there is a Turing machine T I on the alphabet {ao,a , , . . . ,a.} with the following properties: TI has a halting state qo; only one quadruple involves qo and this is qaoaoqofor some state q ; H o ( T I )is many-one equivalent to S; if TI halts on q l a o W (where W is , q l a o W E H o ( T Iiff ) a word in u , , ..., a,,) then q l a o W E H o ( T I )and W E S. For more details look at the discussion in [S] of machines on an arbitrary alphabet and translate the subroutines in Overbeek's machine (and his con,struction of machines computing total recursive functions) accordingly. We wish to consider Turing machines defined by quintuples. These are obtained by adding auxiliary states corresponding to non-moving quadruples. We shall also need another modification. Let T; be as above. Let T be defined by adding auxiliary states q :
22
S. Aanderaa, D.E. Cohen
for each state q. and two further states q’, q”. T h e quintuples consist of q,a,a,q,R (or L ) corresponding to the quadruple q,a,q,R (or L ) , q,a,a,qTR corresponding to q,alalqs,and for all s and i quadruples q fa,a,q,l, and finally quintuples qoa,a,q’l for i # 0, q’a,a,q’L for i # 0, q’aoaoq’’R and q”a,a,qIL for all i. Note that qo is special in the sense of Theorem 2 of [2] and the remarks preceding it. If T is started in state q,, scanning the last square of W, and W is non-empty (if W is empty the tape is blank and the configuration is terminal) it will reach the configuration q,aoW; from this it will not halt if W tif S but will halt on a blank tape if W E S. A configuration in state q, (for r # O ) will lie in H n ( T ) iff it is in H,(T,) (note that TI only halts on a non-blank square if the configuration we start with is in state q,,). If the configuration C is in state q f then C j CI with C , in state 4.; for s # O we have C E H n ( T ) iff C, E Ho(TI)(again because TI halts on blank squares the computation for T does not continue after it has simulated the computation for TI). For configurations with state qn scanning a blank square or in state q : immediately to the right of a blank we can tell whether or not they are in Hn(T).For any other configuration C we can recursiv,ely find a configuration CI with C + C , and C , in state q l (all configurations between C and C, being in states qo, q’ or 4”). Then C E H o ( T )iff C , E Hll(Tl).These remarks show that H n ( T )is many-one equivalent to Hn(Tl). So we have found t o any proper r.e. set S in the alphabet a l ,. . . ,a, a Turing machine T defined by quintuples such that (i) H,,(T) is manyone equivalent to S and (ii) W E H,,(T)iff the configuration in state qn scanning the last square of W lies in H,(T) (the holding also for W empty). This machine is used in the next section, S being the set of relators of a recursively presented group. 03. The embedding theorems Let C be a non-trivial finitely generated recursively presented group. Let C be generated by c , , . . .,c,. W e regard a word in the letters cf’, . . . , c:’ as a positive word in the letters c,,. . . , c2,, where c,+~ denotes c;’. Let W be the r.e. set of all relators of c. By 92 we may take a Turing machine T whose alphabet is cl,. . . , cz. and co (blank) such that H o ( T )is unbounded truth-table equivalent to W and such that a word w is in W iff the configuration with tape description w in state q,, scanning the last square of w lies in H , ( T ) (note that w = 1 corresponds to the blank tape). As remarked in the introduction for
Modular machines II
23
the proof of Theorem A only the second of these two properties is needed, and readers may supply the proof of this property themselves if they do not wish to read § 2 . Let M be the modular machine associated with T. Each word w defines an integer a, by regarding w as the representation in m-ary notation of a with cl,. . ,cZn being taken as the integers 1,. . ., 2 n . The set I of integers represented by a word is recursive. For a E Z let W, denote the corresponding word. In particular wo = 1 and, for 1 C i C 2n, we have wi = ci and w,,+, = w,ci. If a E Z then (a,O)E H o ( M ) iff w, E W by the construction of M from T. It was shown in 94 of [ 2 ] that the group G 3 = G 3 ( M )constructed there is finitely presented. We never refer to the quadruples of M explicitly so we can use letters a, b, c for generators of groups without fear of confusion. The notation used is chosen to be as close as is convenient to the notation in the proof of Theorem A by Aanderaa [l]. Let H3=C*G,. Let H4= ( H 3 ,bi ( i = 1,. . . ,n ) ; br'qb, = cj ( i , j = 1,. . . , n ) , b;'kbi = kc;' (i = 1,. . . , n ) ) . Observe that if bn+i denotes b;' then we have b;'cjbi = c, and bi'kb, = kc;' for i , j = 1,. . . , 2 n . (i) H4 is an HNN extension of H,. For the subgroups associated with b, are both C*(k), and the relevant map is a homomorphism which plainly has an inverse. LetLH,=(H4,d;d-'kd = k ) and let Hs=(H,,p;p-'kp = k, p-'r,p = r,w,(b)d for all a E I ) . Here if w is a word in the ci, w(b) denotes the corresponding word in the b , ; other similar notations will be used. (ii) H , is (obviously) an HNN extension of H4. Also H6 is an HNN extension of H,. We must show that the subgroups (ra, k (a E I ) ) and (rawa(b)d, k (a E I ) ) are isomorphic, with the named generators corresponding. The latter plainly maps to the former by the homomorphism of H,which maps G , identically and kills all the other generators. From the construction of G3 as a Britton tower we know that the former has defining relations r,k = kr, for all a such that a E Z and (a,0) E H o ( M ) .Thus we have a homomorphism of the former to the latter provided r,w,(b)dk = kr,w,(b)d for all such a. If a E Z we have w, (b)dk = kw, (bc)d since k-'bik = bici. Also w, (bc) = w, (b)wa (c) since bici = cibi for all i, j. Since w, ( c ) = 1 iff w, E W iff (a,0) E H o ( M ) the required relations hold. The last group needed is H7= (H6, a ] ,. . .,a z n ;a;'rai = ti, a;'xai =
S. Aanderaa. D.E. Cohen
24
x"', a;'pa, = p , a;'da, = b,d, a;'b,a, = b, (i = 1,. . . , 2 n ; j = 1,. . . , n)).
Notice that a;'b,a,
=
b, will also hold for j
=
n
+ 1,. . . ,2n.
(iii) H7is an HNN extension of H,. The corresponding associated subgroups which we must prove isomorphic are (for given i) K = (r, x, b,, d , p (j = 1,. . . ,n)) and L = (r,, x m , b,, b d , p ) . Recall that if B is a subgroup of a group A, and A * is any HNN extension of A, then the group generated by B and the stable letters will be an HNN extension of B provided B meets the associated subgroups suitably (see, e.g., the remarks following Britton's Lemma in [2]). Also an HNN extension with trivial associated subgroups is just a free product. It follows from these remarks that, since ( t , x ) n ( k ) is trivial (and hence so is ( r , x ) r l ( C * ( k ) ) ,the subgroup (r, x, b, 0' = 1,. . . ,n ) ) is free on these generators. In turn we find that (r, x, b,, d (j = 1,. . . ., n)) is free on these generators. We next show that ( t , x, b,, d
) n ( k , taw, ( b ) d (a E I)) = ( a w , ( b ) d (a E I)).
For the former clearly contains the latter. Let h be in the former. In the homomorphism from Hs to G 3 which is the identity on G,and kills the other generators h maps to an element g E (r, x ) n ( k , r, (aE I)) which equals (r, (a E I)) from the construction of G, (indeed this follows from the expression of G,as an HNN extension of G1).Since h is the image of g under the map sending k to k and C, to r,w,(b)d we find that h is in the required subgroup. From this and other similar properties we can identify K as the HNN extension with stable letter p of the group freely generated by {r, x, b,, d } , the defining relations being (where r, = x - " t x " ) p-'r,p = raw,(b)d for all a E I. Similarly L is an HNN extension with stable letter p of the group freely generated by r,, x m , b,, d, the relations being p-'r,p = t,w,(b)d for all a E Z with a = i mod m. Since w U m + , ( b=) w, (b)b,,the isomorphism condition is now easily checked. (iv) H7is finitely presented. Since G , is finitely presented it is clear that H7is finitely generated and is presented by: (I) the relations of C, i.e. w , ( c ) = 1 for all a with a E Z and ( a , O ) E H,(M), (11) p-'t,p = r,w,(b)d for a E I, and (111) a further finite set of relations, namely, the relations of G,, all the relations not in (11) which have been explicitly given in the cogstruction of H,, and the relation p-'rp = rd (which also occurs in (11)). We first show that the set of relations (I) follows from (11) and (111).
Modular machines I1
25
From (111) we have, for all a E I, both k-'p-'t,pk = p-'k-'t,kp and k-'t,w, (b)dk = k-'t,kw, ( b ) w , ( c ) d . Hence, using (11), we find that w , ( c ) = 1 if k-'t,k = t,. But by definition of G,, this holds as a consequence of the defining relations of G , provided (a,0) E H,(M). This gives the result. Next we show that (11) follows from (111), proving H , is finitely presented. We first show inductively that, for (Y E I, w i ' ( a ) t w , ( a ) = t, and w,'(a)dw, (a) = w, ( b ) d are consequences of (111). For these are true by definition if 0 < a < 2n. Since /3 > 0 is in I iff /3 = am + i for some a E I and 1S i S 2 n , it is enough to show that if these are true (a) = for a then they are true for am + i (1 s i =s2n). Now warn+, w,(a)a,, SO w 2 , + , ( a ) t w , , + , ( a = ) a ; ' w 2 ( a ) t w a ( a ) a ,= a;'r,a, (inductively) = a;'x-"tx"a, = x-Pma;'ra,x"m = tam+,. Similarly w 2 + , ( a ) d w a m + , (=a a) ; ' w i ' ( a ) d w , ( a ) a , = a;'w,(b)da, = w,(b)b,d = warn+,( b ) d Hence, as a consequence of (111) we have, for all a E I, p-'r,p = p-'w;'(a)rw,(a)p = w:'(a)p-'tpw, ( a )= wi'(a)tdw, ( a )= raw, ( b ) d , i.e. (11) follows from (111). This completes the proof of Theorem A, since H,, being obtained by a Britton tower (i.e. sequence of HNN extensions) from C*C,, will contain C. We now proceed to the proof of Theorem B. We use the same construction as above, and show that at each stage the degree of the word problem is preserved (by using the lemmas of 01). If H, H' are any groups we write W P ( H ) = W P ( H ' ) if H and H' have word problems which are unbounded truth-table equivalent. (v) WP (H3)=W P ( C ) . For by 04 of [2] the word problem for G , is truth-table equivalent to H , ( M ) which is many-one equivalent to H o ( T ) by Theorem 2 of [2]. Hence W P ( G 3 ) =WP(.C)by the construction of T. The result now follows from the analogue of Lemma 2 for free products (the details of this are left to the reader). (vi) WP(H,) = WP(H,).
As already remarked, the associated subgroups are both C * ( k ) .The isomorphisms plainly lift to partial recursive functions. We may now apply Lemma 3 (in the weak form where the maps 8 map to elements rather than to subsets). The relevant maps send c, to c, (j= 1 , . . . , n),
map all the finite set of generators of G , except k to the identity, and send k either to k or to kc;'.
26
S. Aanderaa, D.E. &hen
(vii) WP(H,)= WP(H,). This is a very easy application of Lemma 3, the map 8 being the homomorphism which kills all the generators except k. (viii) WP(H,) = WP(H,). For the only time we use the full force of Lemma 3. Let F,, F3 be the free groups on the generators of H , and G3, IT : F , + H , the projection and p : F5+ F3 the retraction (i.e. maps the generators of F. to themselves and maps the generators of F, not in F3 trivially). Let U and V be the subgroups (k,t, (a E I)) and (k,taw, ( b ) d (a E I)) of H,. Let cp be the homomorphism from the subgroup (k,t, (a E I)) of F, into F5 defined by cpk = k, cpt, = t,w,(b)d (here t, is the element x - " t x a of F,, and the homomorphism cp exists because the subgroup is free on the stated generators). As I is recursive, cp is partial recursive and pcpw = w if w E dom cp. Plainly cp and p induce the isomorphisms between the associated subgroups U and V. Refer to the discussion of reduction sequences in §4 of [2]. It is shown in Lemma 4 of that paper that if w E F3 with ITW E U then there is a correct reduction sequence (in the sense of that section) from w to w ' where w ' is in the subgroup (k,t, (a E I))of F3; also ITW = ITW' since the reductions are correct. For w E F3 let Bw be the subset consisting of all words w ' E (k,t, (a E I)) which occur in some reduction sequence starting at w. By the above ITW E d w if ITW E U. Also B is plainly recursive. Extend 8 t o F5 by defining Ow t o mean Bpw for any w E F,. Then 6 has the required properties provided ITW = ITPW whenever ITW E U. This certainly holds if w E F3, since then pw = w. If w @ F3 but ITW E U there must be a pinch in w . Let w ' be obtained from w by pinching out the relevant subword. It is easy t o check that pw' = pw. Since I T W ' = ITW it follows by induction on the number of generators not in F3 which occur in w that ITW = ITPW,as required. Finally we must define B ! with ITW E d ' w if ITW E V. W e notice that Bw Cdomcp and define B'w to be pew. If ITW E V then ITPW E U. Hence we can find w ' E Ow = Bpw with ITW' = rrpw. W e have rrcpw'E ITB'W so we need only show I T P W ' = ITW. Since m p w ' E V and p induces an isomorphism from V to U we need only show rpcpw' = ITPW,which is true since pcpw' = w'. (ix) WP(H,)= wP(H6). The isomorphisms between the.associated subgroups K and L lift t o partial recursive maps. By Lemma 3 it will be enough to find recursive maps from H6 to K and L which are the identity on K and L. It will
Modular machines II
27
be enough to find a subgroup HL of H6 containing K (and hence L ) such that there is a retraction from H6 to H i and to show that the membership problems for L in K and K in HL are solvable. For then a map (which is not a homomorphism) from HL to K can be defined to be the identity on K and trivial outside K,and this gives a map from H6 to K by composing with the retraction. Let Hk be the subgroup of H6 generated by r, x, d, p , b, (1 S j S n) and all the r - and I-symbols (which are among the generators of G3). We obtain a retraction from H6 to HA by killing the remaining generators. This retraction sends those defining relators of H6 involving only the generators of HL to themselves and sends all other relators to the trivial relator. It follows that HL has as defining relators those (for HL is defining relators of H6 involving only the generators of certainly a quotient of the group with the same generators and only these relators, but in addition H6 and so HL maps to this latter group). In particular HA is an HNN extension of K with stable letters the r and I-symbols; each associated subgroup is (ru,xy)for some integers u, U.
The required results have been proved by Kalorkoti (unpublished) using the Bokut normal form. The messy proof given here does not seem to admit much simplification. It follows from Lemma 4 that the membership problem for K in H L is solvable if t h e membership problems for ( t u , x " ) in K are solvable for all u, u. The membership problem for (tu,x u ) in (r, x, b,, d ) is clearly solvable, since the latter is free on t h e named generators. Hence it is enough to show that the membership problem for ( t , x, b,, d ) (which we denote by K') in K is solvable. Now K is in turn an HNN extension of K' with associated subgroups (r,(a E I)) and (r,w,(b)d (a E I ) ) . Using Lemma 4 again it is enough-to show the membership problems for these in K' are solvable. Now an element of K ' is in the first of these iff (i) it is a word in t and x only, (ii) its exponent sum in x is zero, so that it can be written as a word in r, (all a ) , and (iii) when so written it involves only those t,with a E I. Since I is recursive, this is solvable. With the notation of part (viii) an element w of K ' is in the second subgroup iff pw is in the first subgroup and w = cppw, showing that this problem is also solvable. We still have to show that the membership problem for L in K is solvable. Now K is t h e amalgamated free product of the free group (I, x ) with (t,, (all a ) , d , p, b, (1 S j s n)), the amalgamated subgroup being (r"). Since (t,, d, b , , p ) is an HNN extension of the free group (a,d, b,), Lemma 5 and the previously proved results show that the
28
S. Aanderaa, D.E. Cohen
membership problem for the subgroup L ' = (r, (all a),x'", b, (1C j C n), d , p ) in K is solvable. Since ( x ' " , r, (all a)) is free on x'" and r,, (1 =su =sm ) we may , d , p ; r;lxmpx-mr, = x " p d - ' w 2 + , ( b ) x - present L' as ( r , , , ~ "b,, (1 c j C n, 1 d u m, all u ) ) . Thus L' is an HNN extension of the free group (x'", bj, d , p ) with several stable letters, and so is the amalgamated free product of HNN extensions with one stable letter. Since L is the subgroup of L' with the stable letter rj, it follows from Lemma 5 that the membership problem for L in L' is solvable if the membership problems for the base group in each extension are solvable. A further application of Lemma 4 tells us that this holds if the membership problems for ~"'"px-"'" (all u ) and ~ " ' " p d - ~ w ; ~ , , ( b ) x - "(' u" fixed, all u ) in the free group x'",b,,d,p are solvable. These problems are seen to be solvable in the same way as for r, and r,w,(b)d in K'. These results complete the proof of (ix) and so prove Theorem B.
References [l] S. Aanderaa, A proof of Higman's embedding theorem using Britton extensions of groups, in Word Problems. Studies in Logic and the Foundations of Mathematics (North-Holland Publishing Co., Amsterdam, 1973). [2] S. Aanderaa and D.E. Cohen, Modular machines, the word problem for finitely presentcd groups and Collins' theorem, in Word Problems 11: The Oxford Book, Studies in Logic and the Foundations of Mathematics (North-Holland Publishing Co., Amsterdam, 1979 (i.e. this volume)). [3] C.R.J. Clapham, Finitely presented groups with word problem of arbitrary degrees of unsolvability, Proc. London Math. SOC. (3) 14 (1964) 637-676. [4] C.R.J. Clapham, An embedding theorem for finitely generated groups, Proc. Londoi Math. SOC. (3) 17 (1967) 419430. [5] G. Higman, Subgroups of finitely presented groups, Proc. Roy. SOC. A 262 (1961) 455475. [6] R.C. Lyndon and P.E. Schupp, Combinatorial Group Theory, Ergebnisse der Mathematik, Bd. 89 (Springer, Berlin-Heidelberg-New York, 1977). [7] R. Overbeek, The representation of many-one degrees by decision problems of Turing machines, Proc. London Math. SOC. (3) 26 (1973) 167-183. (81 J.C. Shepherdson and H.E. Sturgis, Computability of recursive functions, J. Ass. Computing Machinery 10 (1963) 217-256. [9] M.K. Valiev, On a theorem of G. Higman, Algebra i logika, 8 (1969) 93-128. [lo] M.K. Valiev, On polynomial reducibility of the word problem under embedding of recursively presented groups in finitely generated groups, Lecture Notes in Computer Science, vol, 32 (Springer, Berlin, 1975), pp. 432438.
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 29-53
MALCEV’S PROBLEM AND GROUPS WITH A NORMAL FORM L.A. BOKUT’* Institute of Mathematics, Novosibirsk
Introduction
The following problem was known as Malcev’s problem.
Does there exist an associative ring R, without zero divisors, which is not embeddable in a skew field whose multiplicative semigroup R * of non-zero elements is embeddable in a group. This problem occurred to A.I. Malcev in connection with his wellknown work o n embedding rings in skew fields and semigroups in groups (see (191). In 1966 at the International Congress in Moscow it emerged that three authors had, independently, constructed examples of rings which solve Malcev’s problem (see [ 5 ] , (131, [IS]). The solution given in Bokut’ (51 to this problem leads to the notion of a group with standard basis, a notion which provides a uniform approach to certain well-known constructions in group theory. This paper begins with a brief account of the solution in [5] to Malcev’s problem and then goes on to provide the detailed definition of a group with standard basis, together with some examples. 61. Malcev’s problem
Let k = GF(2) be the prime field of characteristic 2. Let 0 be the semigroup with generators
a,, b,, c,, s,, t,, v, where 1 d i
S
4, 0 d j d 1, and with defining relations
(1)
a,s, = c,vo, b,s,+l= c,uI, b,t,+,= a,t,
where 1 s i s 4 and ss = sl,
tr = t l .
With D.J. Collins, Queen Mary College, London. 29
30
L.A. Bokut'
It is easy to show that the ring kQ contains no zero divisors and is not embeddable in a skew field. (The latter follows from the fact that if kQ C D, where D is a skew field, then the equality ( u ; ' u , ) = ~ 1 holds and hence u o = u I , which is impossible.)
Theorem 1. The semigroup ( k Q ) * is embeddable in a group.
The proof of this theorem appears in Bokut' [S], [9]. The idea of the proof is as follows. Since Q is given byhomogeneous relations, the ring kQ can be embedded in the ring kQ of infinite series. Then, to prove Theorem 1, it suffices to prove that t h e semigroup (kQ)*is embeddable in a group. Now it follows from the results of [2] that the semigroup has a presentation of the form
(m)*
(2)
(kQ)*= (r,s ;PIX#,
= p k x k p , xtpt = pix:)
where r is the group of units of the ring kQ (i.e. t h e set of all series with non-zero constant term), 92 is a set of atoms of kQ consisting of one atom from each class of associated atoms (see [14] for terminology), t h e elements p with subscripts lie in \n and the elements X with subscripts belong to the group r. We consider the corresponding group of quotients
given by the same generators and defining relations as the semigroup (2). To establish Theorem 1 it is necessary to prove that two words in the alphabet of t h e semigroup (2) are equal in the group (3) if and only if they are equal in the semigroup (2). The proof of this last assertion is carried out in the following manner. Firstly we enlarge the initial system of defining relations of the group (3) by adding relations of the form X,~q,X,~klp;'X~~IPLX:!;/ p I' = pk y;k/p L' y;/ pi y$/p y'
yt,kl
where p E 2 , X , Y E r. These additional relations are consequences of the initial defining relations and so we may write (4)
G((kQ)*)= g p K 9 ;@)
where Qi is the enlarged system of relations. Then we prove that every group word in the generators f U S has a normal form, which is defined in terms of the presentation (4). It turns out that the process
Malceo's problem and groups with a normal form
31
whereby a positive word in f U % is reduced to normal form is carried out by means of transformations in the semigroup (2). This enables one to prove Theorem 1. 92. Groups with standard basis
The process of constructing normal forms in the group G ( ( k Q ) * ) , given by (4), can be formalized and, in particular, the definition of a group with standard basis can be formulated. This was done in t h e papers [3], [4], and [6] of Bokut'. These papers also contain proofs that Novikov's groups ?I,,,, (see [20]) and Boone's groups G ( T , q )(see [lo]) are groups with standard basis. This makes it possible to obtain many well-known properties of these groups (see [ l ] and [12]) in a uniform and relatively simple way. Furthermore, with the help of the same method, it was possible (Bokut' [7]) to prove that for any recursively enumerable Turing degree of unsolvability there exists a finitely presented group (of the kind ?I,,a) whose conjugacy problem is of t h e given degree. This last result was obtained, independently, by Collins 1151. We begin with the definition and some properties of groups with stable letters, following, by and large, P.S. Novikov's work [21]. If 2 is an alphabet, then by a 2-word or a word in C (on 2, from 2 ) we shall always understand a group word constructed from this alphabet. For a fixed presentation G = ( 2 ;@) of some group we write X E G if X is a 2-word. Graphic equality of words will be denoted by the sign E and definition or denotation by the sign e. Let d = (2;@) be the group with generators 2 and defining relations @. The group
G = (2,'$; @, A p m , = pn,B,, i E I) (5) where 2 r l '$ is empty, p,,, pn, E '$ and A,, B, are 2-words is the group with stable letters '$ and base group (?. Groups with stable letters were first considered by G . Higman, B.H. Neumann and H. Neumann in their well-known paper [22]. Thus the group G is often called an HNN extension of the group d. (We do not at present assume any conditions which ensure, for example, that d is naturally embedded in G although such conditions are often incorporated in the definition of HNN extension.) The relations (6)
A ;'pn, = pm,B;',
iEI
are, of course, valid in G. We say that the letters pm and p . are related
32
L.A. Bokut'
if the relations (5) and (6) contain, among them, a sequence of equalities A :,"pn,* = p n,* l B2, . . . A :,'p n,, = p n,{)B
r,'
7
-
where p m 2 p n , *and p. K p " , ? . When pm and p,, are related via t h e above sequence, we use the notation
Bpdm B :,I . . . B :,* In other words we use ?Ipmpnand Bpmp, as variables for pairs of words of the form (7). Obviously we always have \ ) I p _ p ~=mp n 8 p n P n . In t h e book [21], t h e words ?Ipmpnand Bpsmin (7) are referred to as occurring in the correspondence L,. We shall call the words (7) connected. For brevity we write ?Ipmpne A f; . . . A f;,
(7)
?IP"z ?IPd", BP" e BPd", Y p ; p ; '
e BPGm,BP"lPrn1 e ?IP&.
By the extension system of relations of t h e group G we mean the system of relations in (5) enlarged by the equalities
B ; ' p , '= pLfA;',
iE
I.
We now turn to the notion of individuality of letters. In using the extension system of relations of G, we perform transformations
XUY + X V Y where either U = V or V = U is a member of the extension system of relations (or else a trivial relation of the kind v f ' = 1, pp-' = 1, u E Z,p E Is). We say that the individuality of the letters in X and Y is preserved by the transformation. Further, if, for example, U K A,pm, and V K p , B , or U K B l ' p i , ' and V K p G l A ; ' we also say that pm, and pn,, or pi,' and p;f have the same individuality. We define individuality for letters occurring in a sequence of transformations of the extension system of relations by transitivity. (For brevity we call such sequences extension sequences.) We give three lemmas, which are explicitly or implicitly in Novikov [21], and which we shall use later. Lemma 1. Ler there be given an extension sequence of the group G
w p : v + w l p : , v I ~ * " - * Wkp:,vk where p : , p ; , , . . . . ,p z kF p L have the same individuality. Then for some connected words ?Ipk; and Bpk; the equalities w k
hold in G.
=
w?Ip;p;,
v, = 82 .v pq-
Malcev's problem and groups wirh a normal form
33
Lemma 2. If in any extension sequence of G which starts with the word Wp;'Up;V the letters p i a and p ; (strictly, letters of the same individuality as p i e and p 2) cancel one another, then U = ?lpap:in G, for some '?Ipap:.. For the third lemma we need a definition. The system @ of stable letters of the group G is called regular if for any p E @ and any two connected words ?I, and '23, the condition
?I,
= 1 in
G
if and only if '23,
=
1 in
G,
is satisfied. (This is often called the isomorphism condition for G and is assumed as part of the definition of HNN extension.)
Lemma 3 (Britton's lemma). Let @ be a regular system of stable letters of the group G, with base group G and let W be a ( 2 U P)-word. I f or W contains W = 1 in G then either W is a 2-word and W = 1 in a subword of the form
pieup:,
(8)
where U is a 2-word and for some !I',;
u = ?I,;,,
(8')
in
G.
A (2 Ug)-word is called $3-reduced if it has no subword (8) for which (8') is satisfied. We now embark on the definition of a group with standard basis. (Our account is a little different from that in [4].) Let G = ( 2 ;@) be a group. Let us suppose that the given presentation of G satisfies the conditions: SB 1) 2 = UZa,0 S a < 7,where 7 is an ordinal and t h e sets S,, are non-empty and mutually disjoint. SB 2) @ = 0 S a < 7 and, for every a, all the relations in @a are of t h e form
u@=,
A p . = p,B
(9)
where p,,p, E 2- and A, B are non-empty words in Up<,2,+(We allow the possibility that @a is empty and our definition requires @,, to be empty.) We introduce the following notation:
z,
=
u
20,
P
-
G, = Ga+i.
=
u
@
@p,
G" = ( Z a ; 6 , , ) ,
34
L.A. Bokut'
e,.
Then G, has stable letters 2, and base group The group Go is free on Z,,. Thus G is obtained from a free group by a sequence of HNN extensions (taking ascending unions at limit ordinals). We shall see later that for Novikov's groups ?Ip,p, and Boone's groups G ( T , q ) t h e number T is finite. For t h e group (4), and for other groups in [ 5 ] and [S], T is infinite. We distinguish in every word A and B, which appears in a relation (9) one letter (positive or negative). Thus every relation (9) is assumed to be in t h e form
(10)
AixAzp.
=pJiyBz
where x and y are the distinguished letters of A and B respectively. With each relation (10) we associate the following words of G , :
where x ' , x are both stable letters of some Go, p < a , y ' , y are both stable letters of some G,, y < a and x , y are the distinguished letters in (10). We now define, inductively, the sets of words C,, 0 s a < T : C 1) The set C,, is the set of all freely reduced words in X,,. C 2) Suppose, for some a 3 0 that the sets C,, p < a have been defined and that C, consists of words of G,. Then the word (12)
W 4 Rop : : R l . * . p 2Rs
-
where s 2 0 (when s = 0, W 4 R,,), p,,, E 2, and R, is a word in Z,, belongs to C, if and only if the following conditions are satisfied: (a) W is a (freely) reduced word; = C,, 0 s i s s; (b) R , E (c) W contains no subword equal to a word (11) which is associated with a relation (10) from @., We understand condition (c) to signify that W contains, for example, n o subword x'Ep. where E is a x,-word and E = BxpxA2 in G, (and similarly for other words from (11)). Clearly the sets C, form an ascending sequence
c, up,,
coc CI c . .. c c, c . * .
We write C = UC,, 0 s a < T. We call elements of C normal words. We say that the group G = ( 2 ;@) is a group with standard basis if in addition to SB 1 and SB 2, the following condition SB 3 holds: SB 3) For every a, 0 < a < T, the set C, is a basis of the group G,,
35
Malcev's problem and groups with a normal form
that is, every word X E G, is equal to one and only one normal word C(X)E c,. We often call C ( X ) the normal form of X . From condition SB 3, it follows that C is a basis of the whole group G. We note that any subword of a normal word is normal and that every letter p , p - ' , p E 2 constitutes a normal word. In addition, if Rp', where R E p E Z,, is normal and U E then Rp'U is normal. Furthermore, if Rp' and Up;' are normal words, where R , U E p , p I E 2, and U $1, then Rp Up y' is normal. In verifying condition SB 3 for our later examples we shall use t h e following lemma.
c,,
c,,
c,,
Lemma 4. Let the conditions S B 1 and SB 2 be satisfied for the group G = ( 2 ;@). Let the relations of this presentation be fixed in the form (10). Then condition S B 3 is satisfied if and only if the following three conditions are simultaneously satisfied : SB 4) each word X E G, is equal (in G,) to some normal word C(X)EC,, o s a < 7 ; S B 5 ) the set 2, is a regular system of stable letters for the group G , with base group G,, 1s a < 7 ; SB 6 ) if Rp:, Tpf E C,, where p,, p, E Z,,R, T E and, for some ?lPTp7,R = T1'Ip,,,in Gczthen i = j and R = T (in G,).
c,
Proof. Let condition SB 3 be satisfied. T h e n obviously SB 4 is satisfied. We verify SB 5. T o this end we pick some a z= 1, p E Z<, and connected words ?fp, Bp and prove, for example, that if 1'1, = 1 in G,", then Bp= 1 in G,. Then, in G,, t h e equality p = p C ( B p )holds, since Bp= C(Bp)in G,. Both sides of this equality are normal. Thus c(BP)z1,i.e., Bp= 1 in G,. We verify SB 6. We have Rp: = Tp;C(Bp;p:) and this is an equality between normal words. From the uniqueness assertion in SB 3 it follows that i = j and R = T in G,,. Conversely, let us suppose that conditions SB 4-SB 6 are satisfied. We prove SB 3 by (transfinite) induction on a. Certainly C,,is a basis of the free group Go. Suppose we have proved, for every p < a , that C, is a basis of GLI. We prove the same assertion for the index a. Let us note first of all that the normal words of G, are, by our conditions, 2,-reduced. Suppose, to t h e contrary, that a normal word contains a subword p;'Rp:, where p,, p, E 2,, R E G,, and R = ?)lp:p7in Gc,.T h e n the words
L A . Bokur'
36
Rp: and p ; , which are certainly normal, satisfy t h e hypotheses of SB 6 whence i = j and R = 1 in By the inductive assumption R E 1 and this contradicts the fact that a normal word is freely reduced. Let us now consider two normal words of G,
c,.
W n R o p : ' R ,. . - p ; ' R , ,
c,,
W , n S , q : ' S ,- q : s ,
where 1 3 0 , r SO, R,,S, E p,,q, E Z,,E,, 6, in G,. Then we have S ; l q ~ ' .~..q -6 l ' S ; ' R o p ; '.. .pjlR - 1
= ? 1.
Suppose W = W ,
I -
in G,. By virtue of Britton's lemma and the previous remark, we deduce that either 1 = 0 = r and RaKSo,or that 1, t > O and E , = S I , Ro = So%,,:lq;, . From SB 6 it follows that p l Eq, and Ra = So in i.e. R o 9 S o .The proof of this assertion we need is completed by induction o n 1.
em,
We make some remarks that we shall use later on. Remark 1. Let (Y > 0 and suppose that for each /3 < a t h e set C, is a basis of Gp.We consider a.reduced word W of the form (12) in which every R, is normal. If W is not normal, then by condition C 2 it contains, for example, the subword x 'Epn,
c,
where E E and E = Bx.,A2in C?, (for some Bxsx). Since E is ) . condition C 2 is equivalent to the normal and so E E C ( B ~ . , A ~Thus condition : C 2') a reduced word W of the form (12) belongs to C, if and only if every R, is a normal word of G, and W contains n o subword of the form x 'C (B.,A2)pn,
(13)
y 'C(B,,,B,)p; I ,
x r - l C ( B x , - i x;')pm, -~A
y
C(By .~ y - IB ;' ) pm', 1
corresponding to a relation (10) from @., Remark 2. In Go, each word is transformed to its normal form by means of cancellations. Let us suppose that for some a > O , it is already proved, for every /3 < a , that C is a basis of G and that rules for transforming words R E G, into normal form have been formulated. Then the following constitute rules whereby a word
W K RopL:.. . p;R,
Malcev's problem and groups wirh a normal form
31
of the group G,, where s S O , p E Z,, R, E G,, is transformed into normal form: (a) transform each R, into normal form, 0 s i c s ; (b) make all possible cancellations of letters p., in the word obtained from (a); (c) exclude from the resulting word the leftmost occurrence of a word (13); (d) return to (a). We understand condition (c) to mean that one has to apply to the word under consideration a transformation of, for example, the kind x ' C ( B ~ ~ , A-+ ~ %,,,A ) P ~ ;'p,B.
Similar transformations are to be used for the other words from (13). We call the application of transformations (a)-(d) the process of reducing W to normal form in G,. In general this reduction process may not terminate in a finite number of steps. To realise the application of the given process in a finite number of steps it is necessary and sufficient that in reducing each word Rp'., where R E G,, p. E Z,,only finitely many transformations (a) and (c) are applied. Remark 3. The process described for reducing a word W of G, to its normal form is some extension sequence of G, beginning with W. Therefore we can refer to t h e individuality of letters p F , p E 2, in this sequence of transformations. We shall say that p F is cancelled (uncancelled) according as a letter of t h e same individuality as p F is cancelled (remains uncancelled) when W is reduced to normal form. We note that Lemmas 1 and 2 can both be applied to the extension sequence of transformations whereby a word is reduced to normal form. 03. Novikov groups
In this section we shall prove that a Novikov group ?Ip,, is a group with standard basis. We give the definition of such a group ?Ip,,, following [20]. Let I
. . , a,,
5 = (a],.
q l , . . . ,q,, r j , . . . , rA, l I , .. . , l A )
where n and A are positive integers. We also consider the alphabet 5' = ( a ; , q:, r:, I : , 1 sj s n, 1 s i S A}. Further we fix a system (A,,B,),1 s i c A, of pairs of non-empty positive words in the alphabet
38
L.A. Bokut'
{ a l , .. .,a n } .Then we denote by ?lp,m = G the group given in t h e alphabet 2=BU3'U{p,,p,}
by the following system of defining relations (the bold face italic letters are those which we distinguish in the relations)
(14)
1. q,a, = aiq,q,
1'. a;q: = q:q:a;
2. r,r,al = a,r,
2'.
3. l,a, = all,
3'. 1:a;
r:a; = u;r:r: = a;l:
4. A : p l A , = q:l:pllt4,
5. r:plrz = p l
6 . B,p,B := r,l,pzl:r: 7. q,p24:
= p2
where i = 1,2, .. . , A , j = 1 , 2 , . . ., n and ( a , ; . . a,,)'= a t . . . a:!. We partition 2 and @ into disjoint subsets: z o = { q , , r , , +4 # ,+r ,l,S i S A } ,
-Zz= {la,I:, 1 i Qo = 4,
S
A 1,
21={uJ,u;,lSjSn}
Z3= { p l ,p 2 ) ;
@, = {(14.1),(14.2), (14.17, (14.2+)}
= {(14.3),(14.3+)},
@d
= ((14.4) - (14.7)).
We view all the relations of the corresponding group G,, 0 S cr as being in the form (15)
AixAzp,
6 3,
= pmBiyB2
where p.,p, E 2, and A l x A r , B I y B z are 2,-l-words. In each case x and y in (15) are the bold italic letters in (14) making the obvious modifications to (14.4)-(14.7) so that they have t h e form (15). We define the sets C,, 0 < cr < 3, as described in the previous section, and prove that C, is a basis of G,, 0 S a S 3. We shall use, in turn, the process of reducing to normal form given in Remark 2. The set Co consists of all reduced words in &. It is clear that Co is a basis of Go. The set Cl consists of all reduced words of t h e form
-
w x Uox f ' UlX ;*
. . x ;"u, *
where k 3 0 (if k = 0 then WK U,,), x, E S1, U, E C,, and W contains no subword from the following list:
Malcev's problem and groups with a normal form -2
(16)
39
-1
q:a,,q, a , ,q,a;', rfa,, r:'a,, rra;'
q +2 i a +, ,qT-ia;,q?a;-',
r?a;, rTa;-', r:-2a;-'.
Let us verify conditions SB 4-SB 6 for LY = 1. We begin with SB 5 . For the connected words 21a, and Bal we have =
V ( q t ,rf),
8 , = V(q:,r r )
where V(q,,r:) is a word in the arguments ql,rf, 1 s i s A and V ( q f ,r,) is the same word but in the arguments q:, r,. If V(q,,rf) is reduced then, clearly, V ( q : , r , ) is reduced. Hence SB 5 follows. Next we show that every word of G I is equal to some normal word. By Remark 2 it suffices to prove that the process of eliminating subwords of the form (16) from a word Rp; R E C,, p. E 2' is finite. Suppose, for example, that p i n a,. In the transformations q:a, + a,qf' and rfa, + a,r,, the length of the word R is decreased. In the transformations r;'a, + r,a,r;l, the length of R does not increase and the number of negative letters in R is decreased. Therefore after finitely many such transformations Ra, will be reduced to normal form. The argument is similar when p',Zu;', a:, u;-'. Let us now verify SB 6. Let, for example, Ra, and Ta, be normal words and let R = TV(q,,r f ) , where V = V(q,,r f ) is a reduced word. Suppose that V i l . If T V is reduced then R ends in q : or rf' which is impossible since Ra, is normal. Otherwise if V begins with q : then T ends with q;' and thus Ta, is not normal. Similarly if V begins with rf', and rf' can be cancelled in TV then Ta, is not normal. So suppose only r : can be cancelled between T and V. If V $ r f Ethen Ra, is not normal while if V I r f ' then either Ra, or Tu, is not normal according as E = - 1 or E = 1. This condition is verified in an analogous way in the other cases. Now let us consider the group G2.From the equalities
?Il,= V ( a , ) , B,, = V ( a , ) ,
?XC =
V ( a T ) , BC
=
V(a;),
it follows that {l,, l:} is a regular system of stable letters. Then, by C 2' (in Remark 1) t h e reduced word
WnUox;'UI * * . x : u , where k 2 0 , x, E Z2, U, E CI,is normal if and only if it contains no subword from the list: (17)
a , V ( q f ,r,)Z:, a:V(q:, rT2)l?,
a;'V(q,, rf)Z: aT-'V(q:', r:)l:',
L.A. Bokut'
40
where the V's are reduced words. To verify SB 4 in this situation it suffices, by Remark 2, t o note that, for instance, after the transformation
R l f sR'ajV(qf,ri)lf+ R'V(qi,rT)r;a, has been performed, R'V(qi,r f ) has fewer occurrences of the letter a, than R. Let us verify condition SB 6. Let Rli and TI, be normal words and is *a - non-empty *a~ reduced let R = TV(aj),where V ( a , ) K u , ~ , ' a ~ ~ word. We consider the process of reducing TV to normal form. If a;,' is not cancelled in this reduction process then none of a;,',. . .,a;: will be cancelled. By Lemma 1 it follows, writing E = E, and j = js, that R must end in a;E where E = Ba;. We see immediately that Rli is not normal. If u f : is cancelled in the reduction process, then, by Lemma 2, T ends in a,"E, where E = Ba;=l,and this contradicts the normality of TIi.Thus V-1 and SB 6 is verified. Finally we consider G,. By C 2', the reduced word
w zu0X;'u'- x ?Uk, * *
where k 3 0 , xi E Z3and Ui E C2,is normal if and only if it contains no subword from the list:
rtEppt,
r:pi',
I ;V( a ;)PI, (18)
qfpz,
q?pi',
1;' V( a, ) p i',
I+-lC(V(af)q'-'At)pi, IiC(V(aj)qiA;')p;', IV(a,)p2,
l+-'V(at)p;',
1 ;' C ( V(a,)r ;'Bi)pz,
I + C ( v ( a f ) r + B+-')p;I,
where 1 s i s A, 1 s j s n and the V's are reduced words. The validity of SB 4 for the group G, follows from Remark 2 and the fact that eliminating subwords (18) from the word Rp:, where R E G2,either decreases the number of letters I f , I + " in R or leaves fixed the number of these letters but decreases the number of letters r:, r:', qf, 97' in R. Now we verify SB 5 . The relations of G, which involve p , and p 2 can be presented in the form
E'piE = pi,
F'zF'
=PZ
x p * * . x:,". It remains where E, F are words in P and (x y,' . . x::)' to note that the mapping U + U' defines an antiisomorphism of the group with generating set 1" and defining relations (14.1)-(14.3) and the and defining relations (14.1+)-(14.3+). group with generating set
a+
Malcev's problem and groups with a normal form
Before verifying SB 6 we consider the elements more detail these are
am= V(BY1r,f8, q,),
'BPI, ?Irn, Bm.In
BPI= V(A,q;'l;',r ; ' )
?Ip,= V ( A : - ' q : l : , r:), (19)
41
'Brn= V(B:rT-'f:-',q:-')
where we may assume that in the words V there are no adjacent inverse pairs ( A:-'q;l:)-'(A ;-'q,f;)',r:-'r;', . . . ,(q;-')-'(q:-')'.We q,). Obviously this is normal if it consider, for example, am= V(B;'r,f#, contains no occurrences of I,. On the other hand, if it does contain occurrences of f, then it is, in fact, {l}-reduced. Suppose, for example, that V contains the subword
f ;'r;IB, W(q,)B and r;'B,W(qf)B;'r,= V , ( a f ) where , W and V , are reduced. The normal forms (in G I ) of r;'B,WB;'r, and V , must coincide graphically and so C(r;lB,WB;'r,)contains no occurrences of letters 4,. But this happens only when W 3 1 and then V contains an inverse pair, which we ruled out above. Now, to verify SB 6, suppose that Rp2 and Tp2 are normal and that R = TV(B;'rJ,,q,),where V contains n o adjacent inverse pairs (B;'r,l,)-'(B;'r,l,)~, q;'qf. If V K V ( q , ) ,then, since neither R and S can end in q f , it follows that V K l . Let V contain an occurrence of a letter I : . We take the leftmost of such occurrences and suppose that this I f is cancelled in reducing TV to normal form. Two cases arise: 1) E = 1. Then, by Lemma 2, we have
T3T,I;'T2,
T2€ GI,
V K Vi(qJ)B;'r,l,V2
where T2V,(qJ)B;'r, = V3(aJ) in G I . It follows that T2KC(V3(aJ)r;'.B,V4(q,)), where V3 and V 4are reduced words. The last equality shows that Tp2 is not normal. (If V4Z1,then Tp2 contains q;pz while if V4E1,then Tpt contains l;'C(V3(aJ)r;'B,)p2.) 2) E = - 1. Here we have Tz€ GI,
T K TIl,T2,
V K Vi(q,)l;'Vz
where T 2 V l ( q , ) =V3(aJ). Thus T , = V3(aJ)V4(qJ), with V,, V 4 reduced and again Tp2 is not in normal form. We conclude that the leftmost letter 1: of V(B;'r,l,,q,)is not cancelled in reducing TV to normal form. But then, by Remark 3 and Lemma 1, the word C ( T V ) H R ends in one of the following words: 18
v3(aJ) v4( qJ
)?
I;'
c(v,(
a,)rrlBt
v4(q,1)
42
L.A. Bokut'
Arguing as above we show in both cases that Rpz is not in normal form. The other cases occurring in the verification of SB 6 (pEKpG',pl,p71) are dealt with in an analogous way. This completes the proof of the following theorem. Theorem 2. A Nouikou group ?Ip,, is a group with standard basis. Corollary (See [l]). The word problem in any Nouikou group ?Ip,, is algorithmically soluable.
To establish the corollary it suffices to inspect the list (16)-(19) of words which we eliminate in reducing to normal form and be convinced that the question of recognising whether an arbitrarily given word belongs to the indicated list is algorithmically solvable. In fact the only words in this list whose explicit form is not indicated are those words, for example, of the type C ( V ( a , ) r ; ' B 0 )However, . a normal word T is of this indicated form if and only if C ( T B ; ' r , ) K V ( a , ) which fact one can always determine. W e derive from Theorem 2 the result of Novikov [20] concerning the conjugacy problem which may be unsolvable in a group '%p,h. W e recall the definition of a certain calculus introduced by Post. A Post system K is a pair K. = [Zn; GO]where 20 is a finite alphabet and @n = {(A,,B,); 1 s i s A } is a finite set of pairs of words from the alphabet Zo.By elementary transformations of K we mean the following transformations of words in the alphabet Zo: (20)
X-X,
A,X-XB,,
XB, + A , X
where (A8, B , )E Go. Two words U and V (from Zo) are called equivalent in K, written U W, if there exists a sequence of elementary transformations which transforms U into W. It is well known (see the references to Post and Markov in [20]) that there exists a Post system with unsolvable problem of equivalence. With each Novikov group '%, one can associate a Post system
-
K ( 2 ) = [ { a l , . . ,a"};(A,,B#), 1S i
S
A]
Novikov's first theorem. Let X and Y be positive words from { a , 1 n } . If BP,X2lIh = Y in 21P,,, then X Y in the corresponding Post system K(2l).
-
Proof. From the given condition we have
ic
Malcev's problem and p u p s with a normal form
(21)
43
V(A , q : 'l: ',r,)X(a,)W(B;'r,l,,q,) = Y(a,)
where V and W are suitably reduced (see (19)). If V 4 V (r , ),then W I W ( q , )- otherwise the normal form of the left-hand side of (21) will contain letters I:, which is obviously impossible. In this case V Z 1 and W K l because Y is in normal and contains no letters r;, qf. Then X K Y and so X Y in K(?I). Now let V and W contain letters I:. Then the last letter 1: of V has to be cancelled (in reducing the left-hand side of (21) to normal form) with the first letter I;' of W . There are two alternative cases. 1) E = - 1. We have
-
(22)
V K V ,A,q;I 1;I V2(r, ),
W K W,(q,) B ;'r,l,W 2
and, by Remark 3 and Lemma 2, we obtain (23)
V2(rJ)XWl(q,)B;'rt = U(a,).
As the normal form of the left side of (23) cannot contain q, or r, we have W , S l , X n X I B , and V2(rl)Xr,= X I in G I . Substituting in (22) and (23) we obtain
V , ( A , q ; ' K 'r,)A,Xl , W'(B;'r,l,,q , ) = Y where V , and W'Eq;'IX~'W2 contain fewer occurrences of letters 1: then do V and W respectively. (Here IXII is the length of X I . ) 2) E = 1 . In the same way as above, we show that X 4 A , X I ,and from (22) there follows the equality V I X I BW, , = Y where V , and W , contain fewer occurrences of letters I:. The proof is completed by an obvious induction on the number of occurrences of letters 1: in V.
Novikov's second theorem. Let X and Y be positive words from {ai,1 S i G n } . Then the words p I X p 2 X +and p l Y p 2 Y +are conjugate in the group ?I,,, if and only if X Y in the corresponding Post system K (%).
-
Proof. Write E E p lX p 2 X+and D 4 p l Y p 2 Y +and assume that E is conjugate to D in %plm. Let {&, 1 6 k S N } be the set of all words obtained from E by cyclic permutation of letters. Among all normal words U satisfying (24)
D = UEkU-',
for some k we choose a word Uo in which the number of occurrences of p ; , p ; is least. We claim that this chosen Uo contains no occurrences , U2 is pf-free, then of p ; , p ; . Suppose otherwise; if UoK U l p f U 2where we have
L.A. Bokut'
44
D = Uip :UzE,U;'p yEUY1.
(25)
Take E = - 1; as D contains only positive occurrences of p l and p2, Britton's lemma applied to (25) yields (26)
Ek KE{p,E:,
U2E
= 21p8.
Substituting in (25), this yields
D
=
UIBpiE;E{p,Bj,lUil
and C(UIBp,)contains fewer occurrences of p f than Uo.This contradicts our choice of Uo. Now we consider (24) with U K Uo.So we have (27)
I/oEku~'=piYPzY+
Since Uo has no occurrences of p ; , p ; we can, without loss of generality assume that Ek KplXp2X'. Then Britton's lemma applied to (27) yields
Bp,X21,= Y
-
in 21plh. It follows from Novikov's first theorem that X Y in K(2l). Y in K(?I). Inductively it suffices to Conversely suppose X consider the two cases:
-
XKA,X,,
Y K X , B , and X F X I B , , Y K A , X , .
Let, for example, the first of these occur. Then in 21pl, we have
UplA,XipzX;A; U-' = piXiB,pzB TX: where U K(r:)-21xfi11:-1q;-1A :. The second case is dealt with similarly and so the theorem is proved. Novikov's second theorem shows that the equivalence problem for the system K(clI) is reducible to the conjugacy problem for the group ?IPlh. The converse reduction was established in [7] and the following result proved.
Theorem 3. For any finitely presented semigroup IZ one can construct a Post system K ( l 7 ) such that 1 ) the word problem for Il is equivalent (in the sense of Turing reducibility) to the equivalence problem for K ( l 7 ) , 2) the equivalence problem for K ( l 7 ) is equivalent (in the sense of Turing reducibility) to the conjugacy problem in the Novikov group ?Iplh for which K ( l 7 ) is the corresponding Post system.
Malceu's problem and groups wirh a nonnal form
45
From this theorem it follows that there exist finitely presented groups with conjugacy problem of arbitrary recursively enumerable Turing degree of unsolvability. As noted earlier, this last result was obtained independently by Collins [15]. 94. Boone groups
In this section, we shall prove that a Boone group G(T,q), as defined in [lo], is a group with standard basis and, using this fact, we shall derive some properties of such groups. By a special semigroup, we understand a semigroup T with generators Sb, 9.
( b E B,
A)
and defining relations
~ , = r , ,i s i c ~ where 2,and r, are special words, i.e. words of t h e form Sq.S', s and s' are (positive) {Sb}-WOTdS.
where
Let T be a special semigroup with a distinguished letter q1= q. We construct, for the pair ( T , q ) ,the following group G ( T , q ) .This group is generated by t h e set of elements I
=;
=(sb,qo,k,t,x,y,I,,r,,b E B , a € A , l S i S N }
and has the following defining relations (the bold face italic letters are to be distinguished): 1.
Sby
= yysb,
XSb =
2. Sbla = yltysb,
risb
sbxx
= sbxr,x
3. 2,= l,T,r,, (28)
4. ti' = f i t ,
5 . r,k = kr,,
ry
= yt
xk
=
kx
6. (q-'tq)k = k(q-'tq),
where b E B, 1 s i c N and Z,, r, are t h e words from the defining system of equalities of the semigroup T. We add to this system of relations t h e equalities
4'. l,V(y)r = rl,V(y), (29)
6'. q-'tqV(r,, x)k
=
kq-'tqV(r,, x),
LA. Bokut'
46
where V ( y ) and V ( r i , x )are reduced words. The system of relations (28) and (29) we shall denote by @. Since each relation in (29) is easily seen to be a consequence of the relations (28), (8;@) is still a presentation of G ( T , q ) ,though now one with infinitely many relations. In our present approach the adjunction of these extra relations is essential to the definition of normal words. Let us consider the following partitions of 8 and @:
E2= {Ii, r,, 1 s i s N }
3 0
= {x, y l ,
81= { S b , b E B},
8 3
= {qa,u E A } ,
5 4
= {t},
8 5
={k},
@I
= {(28.1)},
@2
= {(28.2)},
@5
= {(28.5), (29.6')).
@o= @3
4,
43, = {(28.4),(29.4')},
= {(28.3)},
These partitions define the groups G,, 0 c a c 5, as explained in Section 2. (For reasons of convenience we change notation somewhat; instead of 2 we write 8,and so on.) We take all relations from @,, 0 < a s 5 to be given in the form A 1 Z A2pn = pmB1 U B Z
where p n , p mE E,, AIzA2and BluB2 are 8,-l-words, and z, u are distinguished letters in (28) and (29). Let us construct the sets C,, 0 s a 6 5 , and prove, for each a, that C, is a basis of G,. The group Go is free and Co is defined to be the set of reduced words in { x , y } . The group G I was essentially considered in the previous section. We note only that C1is the set of reduced words W 4 U o x f ' U l . - - x ; ' Uwhere k, k 2 0 , U,ECo, x , E ~ l = { ~ b } ~ which contain no subwords
y2sb,y-lsb, Y's;', X E S b , X-'sb1,Xsbl. (30) Now we consider G2. By virtue of C 2', the set C2 consists of all the reduced words W E Uox UI . . . x ;"U,, where k 3 0, U, E CI, x , E 9,= {Z,, r,}, containing no subwords sbv(y,X 2 ) I : , (31)
sbV(y,xZ)xE1r,,
sbl
v(y',X)y'f!;' s i l v ( y 2 ,x ) r f
where V, Vy', Vx'1 and V are reduced words, b E B and 1 s i N . Condition SB 4 is verified in a routine way - elimination of subwords (31) from Rp:, where R E C1,p,, E Z2,leads to a decrease in the number of letters s;' in the word R . Condition SB 5 follows from the fact that
Malccv's problem and groups with a normal form %Il
=
V(y-'sb)= y s y - ' ,
a,l = V(sbX)= xsx-',
47
81, = v(ysb)= y-'sy
B,, = V(sbX-')= x-'Sx
where S is a reduced {sb}-Wordand the mapping sb + sb x +x - ' , y + y - l induces an automorphism of the group GI. Condition SB 6 is verified in a routine way. For example, let 7'li and Rl, be normal with R = TB,,, i.e. R = TySy-' where S is a reduced word. Let S nsi: ,:s k 3 1. We consider two cases. 1) In reducing TySy-' to normal form, the letter s 2 K s i is cancelled. Then by Lemma 2 and Remark 3, we obtain
---
TZT1s;'Tz, Tz€ Go, TzKC(Bs,*y-')
This contradicts the fact that 7'h is normal. 2) In reducing TySy-I, the letter sb: is not cancelled. Then in reducing this word to normal form no letter of the word S is cancelled, and thus, by Lemma 1, the word R EC(TySy-') ends in s;C(Bay-'), where b = bk, E = &k. This contradicts the fact that RI, is normal. Other possibilities (p;K l;', ri, r ; ' ) are dealt with in an analogous way. Let us consider the group G,.Writing 2,and r, in the forms
2,I 2,I q ~ i z ,
r,Irt,qrn1rh
respectively, we rewrite the relations (28.3) as
(28.3') Then, if
(32)
X,'Wilqm, = q,,,S2r;1r;i'. and B,, '%,m,m
are connected words, we have
n v(2;,'1,r,J,
B,_
V(Z,dr;:)
where we take a reduced word V ( x , )in some new variables x,, 1 S i C N,and '21qmqeand a,&_are obtained from V ( x , )by substituting 2;,'l,I',l and 2,2r;1r;:, respectively, for x,. We shall call V ( x , )the diagram of the words (32). Let 21qmqm have diagram V(x,),where V ( x , )is reduced and nonempty. Then the projection of 21qmqm on the alphabet { I , } is reduced and non-empty. This projection is a homomorphism from Gzto the free 1 in Gz.By a similar argument the group on {I,} and hence 1 in G2and SB 5 holds for G,. connected word B,# We note that the words aqmqn, Bqmqm are uniquely defined by the and sequence of letters lf,r: which appear in them. Specifically if B, have diagram V ( x , )I x * x 2, then we have
r,' - -
(33) where
'21qmqm~ ? l ( I f ; ., . .,l;;),
BqAm FB(r;ei,. . .,r;..),
L.A. Bokuf'
48
and
For Gs, the verification of condition SB 4 is straightforward. We note only that C, consists of all reduced words w K uoq 2 U1* q kuk, where k 3 0 , U, E C,, which contain no subwords
ItC(y-'syrJqm,, (34)
r ;1 ~ ( x s -1r;;)q x
;:,
I;'C(ysy -'Z 1)qn,
r , c ( x +sxz ;;)q
associated with the relations of (28.3'). The verification of SB 6 is also straightforward. For example, let Rq, and Tq. be in normal form, where R, T E Gzand R = T'uqmqm = T'u(I:;,. . . ,I:;). Suppose 'uqmqn f 1; if 1:; is cancelled in reducing T21qmqn to normal form, then, by Lemma 2, the word T ends in I;'C(ySy-'s,ll) or in I,kC(y-'SyI',ll), according as E' = 1 or E' = - 1. In either case Tq. is not normal. If 1:; is not cancelled, then R ends in I;'C(ySy-'2,J or I,kC(y-'SyI'ikl)and Rq, is not normal. For the group G4we have 'ur = V(Ly),
'8, = v(Ly)
where V(l,,y ) is a reduced and hence normal word. The set C4consists of all reduced words W K Uot'lUl * * * tekUk,where k a 0 and U, E C,, which contain no subwords
(35)
y v ,
lC(y-'SyV(y))t', I;'C(ySy-'V(y))t'
where S = k 1 and S, as usual, is a reduced {sb}-word. Condition SB 5 is clear because 'u, = 23, while condition SB 4 follows from the fact that elimination of subwords (35) from Rt', R E C,, either decreases the number of letters I : in R or decreases the number of letters y' in R. To verify SB 6, consider, for example, Rt and St where R = S a , = SV(I,,y ) , and V(l,,y ) is reduced. Suppose V(I,,y ) f 1; if V(I,,y ) K V ( y ) then either S or R ends in y 6 which is impossible. If V ( l , , y )involves some If, then considering, as usual, the first and last of the occurrences of these letters, we deduce that Rt or St contains a subword (35), which is impossible when Rt and St are normal. Finally we consider Gs. Bearing in mind that we added relations (29.6'), strictly we have
%k = w ( q - ' t q v ( r #x), , r,, x ) = 8
k
;
Malcev's problem and groups wirh a normal form
49
of course we can write W(q-'rqV(ri,x ) , ri, x ) = W'(q-'rq,r,, x). The set C5consists of all reduced words W K UokelUl. . . k'SU, where s 2 0 and Ui E C4,which contain no subwords
rfk', X'k',
r"C(W(li,y)qV(ri,x))ke
where 6 = & 1. The verification of SB 4, SB 5 and SB 6 for G5is similar to that for G4 and we omit these straightforward arguments. We do, though, note precisely why the additional relations (29) are needed. Let 2 be a special word of T and suppose that 2 = q in T, Then Z'rZk = k 2 - ' r 2 in G (see [lo] or Lemma 6 below). If we had used only the relations (28) in our definition of normal form, then we would be forced to regard 2-'r2k as normal since we are not able, in general, to show that Z = V(li,y)q in G. Since kZ-'r2 would also be normal, SB 6 would fail. The relations (29.6') are added since, when 2 = q in T, one can prove that 2 = V(l,,y)qW(ri,x ) in G.,. Our conclusions above amount to the following result.
Theorem 4. Any Boone group G ( T , q ) is a group wirh standard basis. We shall now apply Theorem 4 to show that the word problem for G ( T , q ) is equivalent to the problem of equality with q, for special words, in the semigroup T. For this we determine a number of properties of G ( T , q ) (see [12], [17]).
Lemma 5. The word problem for rhe group G4 is solvable.
Proof. We consider, in turn, the groups Go, 0 s a s 4 and verify that, for each a, the basis C, is effective in the sense that the process of reducing words to normal form is effective. Since Go is free, its basis Co is clearly effective. The effectiveness of CI and C2 follows from the fact that the words (30) and the words (31) are explicitly defined. By contrast the words (34) are not given in an explicit way. Suppose though, that we are given a normal word T. To decide, for example, whether or not there exists a word S such that TKC(y-'SyTiI),it suffices to decide whether or not C(yTT;;'y-')is an {sb}-wordand this can be done effectively. So C3 is effective. In a similar way, we can decide of a normal word T of C , whether or not TKC(y-'Sy"),for some reduced {sb}-wordS. For this occurs if and only if S is the projection of T on the alphabet { s b } and C(S-'yT)Ey".This shows that C, is effective and the lemma is proved.
50
L.A. Bokut'
Lemma 6. Let 2 and 2'be two special words of T. Then 2 = 2' in T if and only if there exist words V(lz,y ) and W(ri,x ) such that (36)
2
=
V(li,y)2'W(r0,x )
in the group G,. Proof. Let 2 = 2'in T. Inductively, it suffices to consider t h e case when 2 is transformed into 2' with the aid of one transformation of the type 2,+ or r,+ 2,. Suppose, for example, that 2 KS12,Szand 2'KSlf,Sz.Then, applying relations (28.1)-(28.3) we see that 2 = V(l,,y)X'W(r,,x)in G3,for some words V(l,,y ) , W(r,,x). We now prove the converse. Let the equality (36) hold. We suppose that
r,
2ns1qns2, 2 ' n S ; q m S ; , v ( l ' , y ) ~ y " . l ~ ~ . . . l ~ y " h , and that V is reduced. By our assumption, the normal form of
(37)
y " . 1 : ; . * *1:;s;q,
is of the form SlqnU,where U E GI. However, by Remark 2, the normal form of (37) is obtained by eliminating words (34) which correspond to relations (28.3) of G ( T , q ) .Since each such elimination represents the application of a relation (28.3'), accompanied by use of the relations of G I except insertions of letters I,, we obtain
. 1 2 Y " k S I = SI?l(Q,.. . . ,If;) in GI, where 2l(l:;, . . .,l:;)Z!!lqmqmand the diagram of ?lqmqm is reduced. (38)
y".Q
*.
From (36) and (38), it follows that
(39)
B(r;EI,.. . , r ~ ) s ; w ( r ' = , xsz. )
From (39) and Britton's lemma (or the algorithm for reducing to normal form) we have
W(r,,x)Kx"V:;..* r:;xm~. We prove that 2 = 2' in T by induction o n k. If k = 0, then 2 K2'. Let k > O and suppose, for example, that &k = 1. Then from (38) we have
I,y"'SIqm = I,y-'Syftklqm where qmZqmfk and S is a positive {s,,}-word. From a first reduction of the right-hand side of (36), we obtain
(W
2 = y v : ; . - - i ~ , l y " k - l s 2 , , , q n , k ~ r ~ r , 1 r , : s : x m-k.r ~i . p
.
Malcev's problem and groups with a normal form
From (39), with
El,
51
= 1, it follows that = xs~x-i
r,:s:xmk
where S' is a positive word, and thus S:KT,,,S'. In this way we obtain, from (40),
2 = y % l : l . . l'k-ly"k-ISZ 4 ~ B k 2 X m ~ - l Irk -eI k - l* * . r , : # X m O (41) k-l I d nnk and Z'ZST,,,q ,,kI',kzSr.The inductive assumption applied to (41) yields 2 = SZ,klqn,kZ,kzS' in T and so Z = 2' in T as required. Lemma 7. The problem of equality of an arbitrary word W of G , with a word of the form V ( Ly)ZW(ri,x),
where
Z is special,
is solvable.
Proof. If suffices to consider a normal word 0 4 Q1qmQzwhere Q,, Q2E Cz. Suppose that (42)
Q = V(l,,y)ZW(r,,x )
where Z KZ'q,Z'' is special and V, W are reduced. Among all equalities (42), for our given Q, we select one for which the (1,)-length of V is least. Then we have (43)
Q IC ( V Z W ) IQiqmC(gqmqmZ"W(rt,x))
where Q 1K Ql(l,, y , s b ) contains no letters sbl, 8q,qm 9 U(Z,zr;lK') and the diagram of this word is reduced. It follows from (42) that
Q1Vlq&"
= VZ'
where '?Iqnqm I LJ(Z;:U',l). We claim that BqmqnZ"W(r,, x ) is (1,)-reduced. Let us suppose that this is not so and that, for example, Bqmqn ends with Z,zr;lr;;. Then W ( r , , x ) begins with x'r,, say W K x ' r , W l ( r , , x ) ,8t'Kr,zSand Sx' = xSX-'. It follows from (43) that, when reducing VZ'q. to normal form, the first transformation will be
i,y-lslyr,lqn+ y ~ ~ y - ~ ~ , ~ q ~ , ~ , ~ r ; ~ r ; : , that is, V KVl(l,,y ) 4 y k , Z'FSlr,l and y k S 1= y-'S,y. Then (42) gives
Q = V I l ykSJ',~qnI"2Sx% Wl = vlyk'sIZ,Iqn,Z,zsx'wl
52
L.A. Bokut'
and V l y k 'has smaller (1,)-length than V . This contradicts our choice and so our claim is proved. W(ri,x ) is {r,}-reduced, the characterisation of Bqmq., in Since BqmqmZ" the form of the sequence of letters rT occurring therein, remains in the word C(B,P"W(r,, x ) ) K Q2 which we know. Thus Q2KRRorT; * rzRk, k P 0 and for some 1, 0 d 1 d k, B(r:;, .. .,r?) has the form Bqmq,, for some qm. Furthermore the words
--
1,
C(Ql~lqmqm
c(aAQ2)
contain no letters s i ' . These last conditions can be easily checked having found the normal forms of the respective words. Thus the lemma is proved.
Corollary 1 ([ll], [17]). The Turing degree of unsolvability of the word problem for the group G (T,q ) coincides with the degree of unsolvability of the problem of equality with q for special words in the semigroup T. Proof. The word problem for Gs is reduced to the process for transforming words of Gs into normal form. Since the word problem for G4 is solvable the process is reduced to the problem of whether or not a word 0 of the group G3 is equal to a word of the form V ( Qy)qW(ri,x). By Lemmas 6 and 7, this is reducible to the problem of equality with q for special words in T. To prove the converse reduction, it suffices to note that 2 = q in T if and only if Z - ' t Z k = k Z - ' t 2 in Gs. One part of this assertion follows from Lemma 6 and the other part from applications of Britton's lemma. Since there exist special semigroups in which the problem of equality with q for special words has arbitrary Turing degree of unsolvability we have the following result.
Corollary 2 ([ll], [16], [17)). For an arbitrary recursively enumerable Turing degree of unsolvability, there exists a finitely presented group (namely a Boone group G ( T , q )for some semigroup T ) with word problem of the given degree of unsolvability. In conclusion we should like to express our thanks to Professor W.W. Boone at whose invitation this article was written.
Malceo’s problem and groups with a normal form
53
References [I] S.I. Adjan, On the work of Novikov and his pupils concerning algorithmic questions in algebra, Trudy Matem. Inst. imeni V.A. Steklova AN SSSR 133 (1973), 23-32. [2] L.A. Bokut’, Factorisation theorems for some classes of rings without zero divisors, 1. Algebra and Logic 4, No. 4 (1965), 25-52; 11, Ibid. 5 , No. 1 (1965), 5-30. [3] L.A. Bokut’, On a property of Boone’s groups, Algebra and Logic 5 , No. 5 (1966), 5-23; 11, Ibid. 6, No. 1 (1967), 15-24. [4] L.A. Bokut’, On Novikov’s groups, Algebra and Logic, 6, No. 1 (1967), 25-38. [5] L.A. Bokut’, On the embedding of rings in skew fields, Doklady AN SSSR 175, No. 4 (1967), 755-758. [6] L.A. Bokut’, Groups with a relative standard basis, Sib. Mat. Z. 9, No. 3 (1968) 499-521. [7] L.A. Bokut’, The degrees of unsolvability for the conjugacy problem for finitely presented groups, Algebra and Logic 7, No. 5 (1968) 4-70; 7, No. 6 (1968) 4-52. [8] L.A. Bokut’, Groups of fractions of multiplicative semigroups of certain rings, I, Sib. Mat. Z. 10, No. 2 (1969) 246-286; 11, Ibid. 10, No. 4 (1969) 744-799; 111, Ibid. 10, No. 4 (1969) 800-819. [9] L.A. Bokut’, On Malcev’s problem, Sib. Mat. Z. 10, No. 5 (1969) 965-1005. [lo] W.W. Boone, The word problem, Ann. of Math. 70 (1959) 207-265. [I I ] W.W. Boone, Word problems and recursively enumerable degrees of unsolvability. A first paper on Thue systems, Ann. of Math. 83 (1966) 520-571. [ 121 W.W. Boone, Word problems and recursively enumerable degrees of unsolvability. A sequel on finitely presented groups, Ann. of Math. 84 (1966) 49-84. [13] A.J. Bowtell, On a question of Malcev, J. Algebra 7 (1967) 126-139. [ 141 P.M. Cohn, Free rings and their relations (Academic Press, London, New York, 1971). [ 151 D.J. Collins, Recursively enumerable degrees and the conjugacy problem, Acta Math. 122 (1969) 115-160. [16] C.R.J. Clapham, Finitely presented groups with word problems of arbitrary degree of unsolvability, Proc. Lond. Math. Soc. Series 3, 14 (1964) 633-676. [I71 A.A. Fridman, Degrees of unsolvability of the word problem for finitely presented groups (Nauka, Moscow, 1967). [ 181 A.A. Klein, Rings nonembeddable in fields with multiplicative semigroups embeddable in groups, J. Algebra 7 (1967) 100-125. (191 A.1. Malcev, Selected works, Vol. I (Nauka, Moscow, 1976). [20] P.S. Novikov, Unsolvability of the conjugacy problem in the theory of groups, Izv. Akad. Nauk SSSR, Ser. Mat. 18 (1954) 485-524. [21] P.S. Novikov, On the algorithmic unsolvability of the word problem in group theory, Trudy Mat. Inst. Steklov No. 44 (1955). [22] G. Higman, B.H. Neumann and H. Neumann, Embedding theorems for groups, J. Lond. Math. Soc. 24 (1949) 247-254.
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems 11 @ North-Holland Publishing Company (1980) 55-69
DECISION PROBLEMS FOR RING THEORY L.A. BOKUT’ Institute of Mathematics, Novosibirsk
In my talk I shall discuss the following questions concerning the theme of the title: 1. The word problem for Lie algebras. 2. Embeddings in simple associative algebras. 3. Associative algebras with one relation. 4. Finite approximation of algebras. 5. Free Lie algebras. 6. Alternative, Jordan, and Malcev rings. I have not included the results on free associative algebras and associative division rings in my report, but have presented here some well-known theorems by P. Cohn, J. Bergman, A. Macintyre and others worth analysing in some detail. I do not touch upon constructive rings, because the theory of these rings is part of the general theory of constructive models (A.I. Malcev, Yu. L. Ershov and others). 91. The word problem for Lie algebras
The first results on the word problem for finitely presented (f.p.) algebras ( a , ..., a n ;f l = 0,. . . . ,fm = 0)s
in the given variety of linear algebras X belongs to A.I. h k o v (1950). He proved that if X is the class of all (non-associative) algebras over a field k, then the word problem in X is algorithmically solvable. A.I. SirSov published two papers in 1962. In the first he pointed out one simple algorithm “exclusions of the leading words of the defining relations” which solves the word problem in the class of all the algebras and in the class of commutative ( x y = y x ) or anti-commutative ( x y = - y x ) algebras. In the second paper SirSov obtained the first result on the word problem for Lie algebras - he proved the solvability of this problem for Lie algebras with one relation
(at, . ., an ; f = 55
56
L.A. Bokut'
In the same paper SirSov proposed a composition method, which I will say more about below. The status of the word problem for arbitrary f.p. Lie algebras was settled in 1972. I was able to prove
Theorem 1. Let k be an arbitrary field. There exists an f.p. Lie algebra over field k with unsolvable word problem. I shall speak briefly about the idea of the proof. Let M be a recursively enumerable non-recursive set of natural numbers. Then the Lie algebra
( a , b, c, a l ,c l ;ab"c.= a l b ; c l ,n E M ) , where ab"c = ( . * . ( a b )* * b ) c has unsolvable word problem. In this way, it suffices to prove that this (or some similar) Lie algebra is embeddable in a finitely presented Lie algebra. This is proved by means of Matijasevich's theorem on the existence of a Diophantine representation of the set M and uses the following simple analogy of one of Higman's lemmas.
Lemma. Let A be an f.g. Lie algebra and let B be a Higman subalgebra A. Then the algebra ( A ,A'"; b = b"), b E B),,,
is embeddable in an f.p. Lie algebra. Definition. B <,A if there exists the following diagram: 3K,L(K-f.p., L-f.g., A, L c K , A n L = B).
In this way it suffices to prove that every subalgebra (ab"c,n E M ) is a Higman subalgebra of free Lie algebra (a, b, c). At t h e present time a new proof of the theorem has been obtained
Decision problems for ring theory
57
by G.P. Kukin. Kukin has essentially simplified the construction of my paper and obtained the following proof. Let
n=(u,u;A,=B,,lSiSn)sgp be a semigroup with unsolvable word problem. Let us consider t h e Lie algebra
L ( n ) = ( x , y , u , v , a ; x u= x u = y u = y u = O , ax = au, ay
=
au, a A , = aB,, 1 S i
S
n)L,e
where, for example, auuu*= ( ( ( a u ) u ) u ) u T. h e n Kukin’s lemma is:
Lemma. Let A, B E (u, u ) ~ Then ~ ~ .
B
One part of this lemma ( 3 ) is clear. If, e.g., A = A ’ A I A ” , = A‘B,A’’, then in L ( n ] because of t h e equalities
wxu
=
wux, wxu
=
wvx, wyu = wuy, wyu = wuy
we have:
a A ’ A , A ” = a A , A ’ ( x , y ) A ’ ‘ = a B I A ’ ( x ,y ) A ” = a A ’ B , A ” . In fact, the word problem for IZ is Turing equivalent to t h e one for L(IZ), so that we have
Theorem 2 (G.P. Kukin). For any r.e. Turing degree of unsoluability there exists an f.p. Lie algebra whose word problem is of the given degree. By means of Theorem 1 I obtained
Theorem 3. A n y Markou property CY of f.p. Lie algebras is algorithmically unrecognizable. The problem of being isomorphic to any particular f.p. Lie algebra is algorithmically unsolvable.
Of a number of open algorithmic problems for Lie algebras, the following problem of A.I. SirSov has t h e greatest interest: Is the word problem solvable in the class of solvable Lie algebras of given solvability step n 3 3? For n = 2 t h e word problem is solvable. The analogous problem for groups was solved by V.N. Remeslennikov (1973, n 3 5 ) .
L.A. Bokut'
58
12. Embeddings into simple associative algebras The interest in embeddings into simple rings and algebras is supported by some internal questions (the study of simple algebras) as well as by external ones - comparison with theory of groups. I can distinguish at least three theorems, which, in this respect, have great influence. These are Higman-Neumann's theorem (1949) about embedding any group into a simple one, the Boone-Higman theorem (1974) on the algebraic characterization of f.g. groups with solvable word problem (f.g. < simple r.p. s f.p.), and P. Hall's theorem (1973) about embedding a group into a simple one, generated by four "abitrary" groups. I n 1962 I proved for associative rings and algebras t h e following Theorem 1. A n y arbitrary associative algebra A can be embedded into a simple associative algebra ?I, in which any equation of the type xay = b, 0 # a, b E ?I is solvable. I f A is an algebra with a recursive basis, then 'u is r.p. with solvable word problem. l f A is a ring, then it is embeddable in a simple ring if and only if it has characteristic p 3 0 0, is a prime number, or 0). Recently I proved the two following sharpenings of this theorem. Theorem 2. Let A, K,, Kz, K 3 be arbitrary associative algebras over field k, I A ( s / K , * K ~ * K , I ,d i m K I * K z * K , a I k ( (here * isthe operation of free product of algebras). Then the algebra A is embeddable in a simple associative algebra ?I of the type ?I = (K,, Kz, K3). If A, K I , Kz, K, have recursive basis, then ?[ also has recursive basis (and solvable word problem ). In particular, an arbitrary countable associative algebra is embeddable in a simple associative algebra with 3 generators. Let K be an algebra over a field k, dim K = a 3 N,,. Let us say that the algebra K satisfies condition (a), if K
=
U K("), K(")CK("+I)dim K("+I)/K(")= dim K(n)= a. n =I
Theorem 3. Let A, K,, . . . , K, be associative algebras over a field k, dim K, = a a 1 k 1, N,,, 1 s i s 4, dim A s a and algebras K,, 1 C i 4 satisfy condition ( a ) . Then the algebra A is embeddable in a simple associative algebra ?I of the type
?I
=
K , + . . . + K.,.
Decision problems for ring theory
59
In particular, algebras K , with trivial multiplication satisfy the condition of t h e theorem. So, every algebra A is embeddable in a simple one which is t h e sum of t h e four algebras with trivial multiplication.
Theorem 4. Eoery associative algebra can be embedded in a simple algebra ?[ = N , + N 2+ N3, when the N , are nilpotent.
0. Kegel (1964) proved, that if B = N , + N2, where N , , N 2 are nilpotent, then B is a nilpotent algebra. Problem. Is every r.p. associative algebra embeddable in an f.p. associative algebra? The same question arises for Lie algebras. As for embeddings into simple non-associative algebras and Lie algebras, there are the B. Neumann and P. Cohn theorems and my theorem as well (1962) on the embedding of an arbitrary Lie algebra into an algebraically closed Lie algebra. . I will say a few words concerning the methods of proof of the revious theorems. As I have already pointed out they follow from irsov’s work (1962) for Lie algebras. Let us reformulate the definition and the lemma by Sirsov for associative algebras. Let F = k ( X ) be a free associative algebra. The semigroup X can be linearly ordered (namely, lexicographically); denote by f the largest word of element f E k(X). Let us define a composition cf, g ) w of elements f, g E F with respect to a word w E (X)sgp.Let
!
w
=
fu
=
vg, l ( w ) <
r(f)+
l(g).
Then
cf, g ) w = pfu We have
f
= a f + .. . , g =
pg + * * * .
-
cf,
Let S if
- aug,
< w. F = k(X).The set S is closed with respect to the composition, g ) w
cf, g ) w =
-
a,a,s,b,,a, E k, s, E S, a,s,b,< w
for every composition of elements f, g E S.
Sidov’s Lemma. If S is a subset of free algebra F = k ( X) closed with respect to the compositions, then in the associative algebra
L.A. Bokut'
60
( X ;s, = 0, s, E S ) , all the words from X which contain no subwords Si,siE S will serue as its base. 03. Associative algebras with one relation The word problem for (associative) algebras with one relation was dealt with by several authors. S.I. Adyan (1960) proved the algorithmic solvability of the word problem for semigroups (hence, the same for algebras) with one relation of the type A = 1 and with the irreducible relation A = B as well. In t h e paper by J. Levin and T. Levin (1968) the word problem was solved and there was proposed t h e Freiheitssatz for associative algebras (x,~
1
,
.. ., y n ;
= 0)
with one relation, satisfying the conditions: 1) F/I d ( u ) is an algebra without zero divisors, where F = k ( x , y , , . . . ,y,,), 2) t h e least homogeneous part of element u doesn't lie in k ( y l , .. . . ,y , ) (i.e. it contains x ) . In the paper by W. Dicks (1972) t h e solvability of the word problem was proved for associative algebras with one relation u = 0 satisfying t h e condition V u E F(uu E F"uF j u E FuF),
where F" is the set of polynomials with zero free members. I would like to give a more detailed description of the paper by V.N. Gerasimov (1976). Let F = k(X)be a free associative algebra with 1 over a field k , d ( x ) t h e ordinary degree function, and F" the set of homogeneous elements of degree n 3 0 (FO = k ) . When considering the word problem it turns to be quite useful to have Theorem 1 (V.N. Gerasimov). Let f be an arbitrary homogeneous element of algebra F. Then the lattice subspaces
Lat (F"fF",n, m
3 0)
generated by subspaces F"fF", are distributive. Let us fix notations: 6 is the largest homogeneous part of element h E F, A = (6, h E H } , H C F. We consider a connection between Theorem 1 and the word problem. Let f be an element of the algebra F and g E Id(f). Then in F there exists an equality
Decision problems for ring theory
61
where uA, uA are homogeneous elements. Let's assume that f satisfies the condition (2)
fF n Ff
=m f
(here F is, of course, the set of all homogeneous elements of the algebra F). Condition (2), for example, is satisfied, if f has no proper two-sided divisors:
f=xu=vxjx-1
or x - f .
From Theorem 1, using (2), one can easily derive the existence of representation (1) of the bounded degree
d(UA) f d ( % ) d ( g ) - d ( f ) . That means that the question of whether the element g belongs to the ideal Id(f) is reduced to the problem of the solvability of a finite system of linear equations (because the subspace Cn+mhd(g)-dV) F"fF" is finite-dimensional). In this way, we obtain Corollary 1. I f f is an element of the free algebra k(X), the largest homogeneous part of which contains no proper two-sided divisors, then in the algebra ( X f;= 0) the word problem-is solvable. This result contains Dicks' theorem mentioned above. In an analogous way one can prove Corollary 2. Let f E k (X), xoE X,f EZ k (X),{xo}), .f, - the largest homogeneous part of the element f with respect to xo and suppose that has no proper two-sided divisors. Then the set X,{xo}generates a free subalgebra in ( X f;= 0).
7%
A special calculus is constructed to prove Theorem 1 in V.N. Gerasimov's paper with axioms and deduction rules which are as follows
where d,d l ,d2are finite subsets of subspaces of a special kind. Two element sets {Ll,L2}of subspaces (which on account of the modularity generate the distributive lattice) are the axioms of this calculus. The d2generate the deduction rules are such that if initial sets d,, distributive lattice of subspaces, then the conclusion set d has the
62
L.A. Bokur'
same property. As a result, it is proved that any finite set of subspaces F"fF" is deducible; this is how the distributivity of the initial lattice is proved. Problem. Let f be an arbitrary element of free algebra F = k(X).Is the lattice Lat (F"fF",F',n, rn, I 2 0) distributive in the following sense: there exists an n = n u ) 3 2, such that the identity V f l (X:=l V,) = X=l( V n Z i z i V)) holds in the lattice? From a positive solution of this problem immediately follows a positive solution of the word problem for associative algebras with one relation. 04. Finite approximation
An algebra A over k is called finitely approximable, if it can be approximated by finite-dimensional algebras. It is easy to observe that any recursively presented finitely approximable algebra over a constructive field has solvable word problem. The study of finitely approximable associative algebras was started in A.I. Malcev's work "Representations of infinite algebras" (1943). An algebra A over k is called representable, if A is embeddable in an algebra of matrices over any extension K of the field k. The representability of the algebra is equivalent to its finite approximability by algebras of bounded dimension. Of the results obtained by Malcev in this paper I shall point out only one: Every f.g. commutative algebra is representable and therefore is finitely approximable. Let 92 be any variety of associative algebras. Let us call the variety %' locally representable (1.r.) (respectively, locally finitely approximable - l.f.a.), if m y f.g. algebra from 92 is representable (finitely approximable). Because of this work by Malcev a natural problem arises: to describe (in terms of identities) 1.r. and, separately, 1.f.a. varieties of associative algebras. The first progress towards the solution of this problem was made by V.T. Markov in 1975. Markov proved that the variety of Engel algebras "x, Y I,.
. . ., y 1 = 0
turns out to be 1.r. (here [x, y ] = x y - y x ) . The problem formulated above was finally solved by A.Z. Ananin. H e proved
Decision problems for ring theory
63
Theorem 1 (A.Z. Ananin). Let k be an infinite field and let 2Jl be a variety of associative algebras over the field k. Then the following conditions on taZ are equivalent (1) zlll is Lr., (2) zlll is l.f.a., ( 3 ) in taZ an identity of the kind [[xlYl,...,Ylr"[[t,u],...,u] =o
holds. The same varieties have several more remarkable properties: Conditions (1)-(3) are equivalent to the following conditions as well. (4) (A.Z. Ananin, I.V. Lvov) n is a locally weak Noether variety (i.e. in any f.g. algebra from taZ t h e maximal condition for two-sided ideals holds). It is clear that for the variety of associative commutative algebras over k Condition 4 is the Hilbert basis theorem. (5) (Yu.N. Malcev) mZ is a local Hopf variety (i.e. any f.g. algebra from taZ is Hopfian). The variety of commutative algebras is local Hopf. This was proved by A.I. Malcev in t h e work mentioned above. Let us call a variety zlll representable if every algebra from taZ is representable in a matrix algebra over a commutative algebra. J. Levin proved (1972) that the variety roZ([x, y ] [ z ,t ] = 0 ) is representable.
Theorem 2 (A.Z. Ananin). Every nilpotent variety is representable. This is a solution of one of L. Small's well-known problems, which h e proposed at a conference o n ring theory (Utah, 1971). Theorem 3 (A.Z. Ananin). Every group ring with a polynomial identity (PI - group ring) is representable. This is a solution of a problem in Passman's book in group rings.
15. Free Lie algebras In the theory of infinite-dimensional Lie algebras the theory of free Lie algebras and free products of Lie algebras occupies the main place at present. The foundations of this theory and some effective methods used in papers by other authors were created by A.I. SirSov. As in
64
L.A. Bokut'
group theory, the study of free Lie algebras and free products of Lie algebras is connected very closely with decision problems. 1. Bases of free Lie algebras The first method for constructing bases of free Lie algebras was given by M. Hall (1949). The general scheme of the construction of such bases was published by A.I. SirSov in 1962. This scheme was applied in my work (1964), in which bases of free polynilpotent Lie algebras were constructed. Namely, let n,, . . . ,nk be positive integers, L = k(X),. be a free Lie algebra, R = L / ( ( ( L " I ) I ) - . be the free polynilpotent Lie algebra corresponding to the sequence ( n , ,. . . , nk). In the work mentioned, a basis 9 = 9' U 9"of the algebra L is constructed such that 9' is a basis of the ideal (((L"1)T).. .)"k and the images of the words from W'form a basis in the algebra R. So, R has a recursive basis. Among all the bases proposed by A.I. SirSov, the most useful basis is the so-called SirSov basis of right words (1958). The basis has been used in SirSov's work to prove the following assertion. .)"k
Theorem 1 (A.I. SirSov). A n arbitrary Lie algebra with countable dimension is embeddable in Lie algebra with 2-generators. 2. The intersection of finitely generated subalgebras of free Lie algebras The following question was open for a long time: Is the intersection of two finitely generated subalgebras of a free Lie algebra again a finitely generated subalgebra? The analogous property of groups was proved by Howson (1954). Recently this question was solved positively by G.P. Kukin. Theorem 2 (G.P. Kukin). Let L = k(X),,be a free Lie algebra ouer an arbitrary field k. Then its f.g. subalgebras form a sublattice of the lattice of all subalgebras.
In the case k has characteristic p >0, this property was proved earlier in Kukin's work (1972). 3. The semigroup of the varieties of Lie algebras Let k be a field of characteristic 0. Let rLle be the semigroup of Lie varieties. In 1967 V.A. Parfenov proved the following theorem, which is analogous to the well-known theorem by Neumann and Smelkin for groups :
Decision problems for ring rheory
65
Theorem 3 (V.A. Parfenov). The semigroup rLle of varieties of Lie algebras over a field of characteristic zero is a free semigroup with zero and unit. 4. Subalgebras of free products of Lie algebras Let L , i E J be Lie algebras over field k , let L = I l f E J L be i their free product. For a long time there existed the Kuros-Witt conjecture that any subalgebra A C L has the following form:
A=F*n*(AnLi) where F is a free Lie algebra. This conjecture was refuted by A.I. SirSov in 1962. Theorem 4 (A.I. SirSov). The Kuros- Witt conjecture is not true. The counter-example is the following pair:
L = ( a ) * ( b , c ; c b= b ) , A
= (ba,ca).
The problem stated by A.I. SirSov on the description of the subalgebras of free products of Lie algebras was solved by G.P. Kukin (1972) who proved the following theorem: Theorem 5 (G.P. Kukin). If A C L
[
= II:,,Li
then
A = F * n * ( A"Li)]/N iEJ
where the generating elements of ideal N can be written out in terms of the multiplication table of algebras Li. In particular, if A f l Li = 0 , then A is a free Lie algebra; the Cartesian subalgebra of the algebra L (the kernel of the natural homomorphism 11* Li + C L i ) turns out to be the free subalgebra, etc.
5. Solvable Lie algebras with 2 generators Recently G.P. Kukin proved the following theorem, which was open for a long time: Theorem 6 (G.P. Kukin). A n y solvable countable -dimensional Lie algebra with solvability length n is embeddable in a solvable Lie algebra with 2 generators and solvability length n + 2.
L.A. Bokut’
66
This result is analogous to the well-known theorem of Neumann for groups (1959). As in the case of groups, the result can be formulated for Lie algebras from an arbitrary variety. 96. Alternative, Jordan and Malcev
rims
Besides the classes of associative and Lie rings, the classes of alternative, Jordan and Malcev rings play a significant role in ring theory. Let us recall that the definitions of these classes are given by the following identities: ( x , x , y ) = 0 , ( y , x , x ) = 0 (alternative rings), (here ( x , y , z ) = ( x y ) z - x ( y z ) is the associator),
xy
=
yx, (x’y)x = x’(yx)
(Jordan rings),
x 2 = 0, J(x, y , x z ) = J ( x , y , z ) x
(Malcev rings)
(here J ( x , y , z ) = ( x y ) z + ( y z ) x + ( z x ) y is the Jacobian). Each of these classes of rings has a very rich internal theory and many connections with other fields of mathematics, so that the impo;tance of studying these classes is clear. In this section I shall briefly dwell upon the following questions, without any claim of completeness or depth. 1. Simple rings. 2. Finitely generated rings. 3. Free rings. 1. In the case of alternative and Malcev rings a situation takes place which is opposite to that mentioned in 93 for associative algebras and Lie algebras. Namely, in the case of alternative rings there is a wellknown theorem, following from the work of Kleinfeld, that any simple alternative ring is either associative, or is the (8-dimensional) Cayley-Dickson algebra over its own centre. An analogous situation takes place for Malcev algebras.
Theorem 1 (V.T. Philippov). Any central simple Malceu algebra is either a Lie algebra or 7-dimensional algebra A which is adjoint to the Cayley-Dickson algebra A . (e’
Theorem 1 was obtained earlier for finite-dimensional Malcev algebras by E.N. Kuzmin (1968). For Jordan rings the following question is still open: Is any simple
Decision problems for ring theory
67
Jordan ring either a finite-dimensional algebra over its own centre or a special Jordan ring? 2. In the case of alternative rings any 2-generator ring is known to be associative (Artin). The following theorem shows that not every countable alternative ring is embeddable in a 3-generator alternative ring. Theorem 2 (G.V. Dorofeev). A n y 3-generator alternative ring satisfies the non-trivial identity ( [ x ,y ] [u,v ] ,z , t ) = 0 where [x, y ] = xy - yx, a 0 b = ab + ba. 0
An interesting question was whether there exists a natural number n such that every countable ring is ernbeddable in an n-generator alternative ring. Recently this question was solved by I.P. Sestakov. H e has proved Theorem 3 (I.P. Sestakov). There exists no natural number n, such that any countable generated alternative algebra is embeddable in an n generator alternative algebra. Sestakov has also proved that t h e same theorem is true for Malcev algebras. Theorem 4 (A.I. SirSov). A n arbitrary Jordan algebra over k with 2 generators is special. Here k is a commutative ring of operators containing 1 and the element 1/2; a Jordan algebra embeddable in an algebra A' = ( A ,a b = f ( a b + b a ) ) adjoint to an associative algebra A is called special. Thus, not every countably generated Jordan algebra is embeddable in a Jordan ring with 2 generators. 3. I shall point out t h e following two theorems on free alternative rings. The first of them deals with t h e question, which is still open, about t h e existence of a simple effective basis in such rings. 0
Theorem 5 (A.I. SirSov). In a free alternative ring / c ( X ) ~the , , set of r2-wordsfrom X forms a linear system of generators of this ring (this system is dependent). Every word of the kind ((x,,x,J.. . x,") (with the right order of brackets) is called a r,-word from generators X = { x , } ;r2-words are
68
L.A. Bokut'
-
rl-words from rl-words, i.e. the words of the kind ((ulu2) ur), where ui are r,-words. Theorem 6 (I.P. Sestakov). The set of nilpotent elements of a free alternative ring k ( X ) A , , ,where k is an integral domain, forms an ideal which coincides with the quasi-regular radical of this ring. There exists an algorithm for answering the question of whether an element f E k (X),,, is nilpotent.
It is known (Kleinfeld and Humm), that for I X I 3 4 the quasiregular radical of the ring k ( X ) A l is t non-zero (for I X I = 3 this question is still open). K.A. ievlakov (1972) proved that any element of the quasiregular radical of a free alternative ring is nilpotent. References [l] ,A.Z. Ananin, Locally finitely approximated and locally representable varieties of associative algebras, 14th All-Union Student Conference, Novosibirsk, 1976, p. 15. [2] L.A. Bokut, Embedding of Lie algebras in algebraically closed Lie algebra, Algebra and Logic 1, No. 2 (1962) 47-53. [3] L.A. Bokut, Some embedding theorems for rings and semigroups, I, Sib. Mat. Z. 3, No. 4 (1963) 500-518. [4] L.A. Bokut, Bases of free polynilpotent Lie algebras, Algebra and Logic 2, No. 3 (1963) 5-18. [5] L.A. Bokut, Unsolvability of some algorithmic problem in the class of associative rings, Algebra and Logic 9, No. 2 (1970) 137-144. [6] L.A. Bokut, Unsolvability of the word problem and of the subalgebra of finitely presented Lie algebras, Izv. Akad. Nauk Ser. Matem. 36, No. 6 (1972) 1173-1219. [7] L.A. Bokut, Unsolvability of some algorithmic problems for Lie algebras, Algebra and Logic 13, No. 2 (1974) 145-152. [S] L.A. Bokut, Embeddings in simple associative algebras, Algebra and Logic 15, No. 2 (1976) 117-132. [9] V.N. Gerasimov, On distributive lattices of subspaces and the word problem for associative algebras with one relation, Algebra and Logic 15, No. 4 (1976) 384-435. [lo] G.P. Kukin, Primitive elements of free Lie algebras, Algebra and Logic 9 (1970)
-
458-472.
[ l l ] G.P. Kukin, De carte subalgebras of free Lie products of Lie algebras, Algebra and Logic 9 (1970) 701-713. [12] G.P. Kukin, Subalgebras of free products of Lie algebras, Algebra and Logic 11, No. 1 (1972) 59-86. [13] G.P. Kukin, On the embedding of solvable Lie algebras in solvable Lie algebras with 2 generators, Algebra and Logic 14, No. 4 (1975) 414-421. (141 I.V. Lvov, Maximality condition in algebras with identical relations, Algebra and Logic 8 (1969) 449-454. [15] A.I. Malcev, Representations of infinite algebras, Matem. Sbornik 13 (55) (1953) 263-286.
Decision problems for ring theory
69
[16] A.I. Malcev, Algorithms and recursive functions (Moscow, Nauka, 1%7). [17] A.I. Malcev, Introduction in the theory of algebraic systems (Moscow, Nauka, 1972). [18] Yu. N. Malcev, Some properties of the varieties of the associative algebras products, Algebra and Logic 11, No. 6 (1972) 651-670. [19] V.A. Parfenov, Varieties of Lie algebras, Algebra and Logic 6 (1%7) 61-73. [20] V.A. Parfenov, On one of the properties of ideals of free Lie algebras, Algebra and Logic 10 (1969) 940-944. [21] A.I. Sidov, Subalgebras of free Lie algebras, Matem. Sbornik 33 (75). No. 2 (1953) 441452. [22] A.I. Sidov, On free Lie rings, Matem. Sbornik 45 (1968) 113-122. [23] A.I. Sidov, Some questions of the theory of rings close to associative rings, Uspeki Matern. Nauk 13, No. 6 (1958) 3-20. [24] A.I. Sidov, Some algorithmic problem for &-algebras,Sib. Math. J. 3, No. 1 (1%2) 132-137. (251 A.I. SirSov, Some algorithmic problems for Lie algebras, Sib. Math. J. 3, No. 2 (1%2) 292-296. [26] A.I. Sidov, On one of the hypotheses of the theory of Lie algebras, Sib. Math. J. 3, NO. 2 (1962) 297-303. [27] A.I. Sidov, On the bases of free Lie algebras, Algebra and Logic 1, No. 1 (1%2) 14-19. (281 A.I. Sidov, Some algorithmic questions for solvable Lie algebras, Uspeki Matem. Nauki 17 (1962) 228. [29] A.I. Sidov, On special J-Rings, Matem. Sbornik 38, No. 2 (1956) 149-166. [30] A.I. Sidov, On some of non-associative nil-rings and algebraic algebras, Matem. Sbornik 4, No. 2 (1957) 381-394. [31] G.V. Dorofeev, Alternative rings with 3 generators, Sib. Math. J. 4, No. 5 (1963) 1029- 1048. [32] K.A. kevlakov, Quasi-regular ideals in finitely generated alternative rings, Algebra and Logic 11, No.2 (1972) 140-161. [33] E.N.Kuzmin, Malcev algebras and their representations, Algebra and Logic 7, No. 4 (1968) 48-69. [34] V.T. Philippov, On zero divisors and nil-elements in Malcev algebras, Algebra and Logic 14, No. 3 (1976) 204-214. 1351 1.P. Sestakov, Radicals and nilpotent elements of free Alternative algebras, Algebra and Legic 14, No. 3 (1975) 354-365.
S.I.Adian, W.W. Boqne, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980)71
ERRATUM: THE EXISTENCE OF INFINITE BURNSIDE GROUPS J.L. BRITTON Queen Elizabeth College, London
A mistake in my article [3] was pointed out by S. Adian [ 2 ] . However I have not investigated the nature of this mistake: I believe that what is now primarily needed is a much simpler proof, in which no attention is paid to t h e value of the lower bound A, of the result in question; namely, that there exists an A such that for all m 2 2 and all odd integers n > A the Burnside group B ( m , n ) on m generators and with exponent n is infinite. (This was proved originally by Adian and Novikov [ l ] and subsequently by Adian [ 2 ] . )Conceivably, such a proof might contain ideas from [I], [ 2 ] and [3]. It is possible that such a proof would also throw light on the outstanding unsolved problem in the field: is B ( m , 2 ' ) infinite for any k 2 I? However I am inclined to pessimism since I believe that the answer is negative. Reference [I] P.S. Novikov and S.1. Adjan, On infinite periodic groups, Izv. Akad. Nauk SSSR, Ser. Mat. 32 (1968) 212-214, 224-251, 709-731. [2] S.I. Adian, The Burnside problem and identities in groups, Nauka (1975). [3] J.L. Britton, The existence of infinite Burnside groups, in Word Problems: Decision problems and the Burnside problem in group theory, Studies in Logic and Foundations of Mathematics (North-Holland Publishing Co., Amsterdam, 1973).
71
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 North-uolland Publishing Company (1980) 73-80
ON SEMIFIR CONSTRUCTIONS P.M. COHN Bedford College, London
$1. Introduction
Free ideal rings or firs (the definition is recalled in §2) form an analogue of principal ideal domains, to which they reduce in the commutative case, and to obtain examples of firs one can use the weak algorithm and its variants. Semifirs form a wider class and it is less easy to construct semifirs (that are not also firs) directly. A possibility is to form direct limits or ultraproducts of firs, but a more interesting way is to start from n-firs, for the examples so obtained show that semifirs do not form an elementary class. This is in sharp contrast to the commutative case, where semifirs correspond to Bezout domains (an elementary class). After describing the necessary background in $52-3 (including a 'small cancellation' theorem needed to construct n-firs) we examine the ultraproduct construction more closely. It turns out that a countable ultraproduct of n-firs for increasing n (with a non-principal ultrafilter) is always a semifir but never a fir, except in trivial cases. On the other hand, an uncountable ultraproduct may well be a genuine fir. $2. Firs, semifirs, n-firs Throughout, all rings are associative, with a unit-element 1 which is inherited by subrings and preserved by homomorphisms. Mostly our rings have a commutative field k as coefficient domain, i.e. they are k -algebras. A free k-algebra on a set X, k ( X ) can be defined by a universal property, although often a more pragmatic definition (in terms of a normal form for its elements) is preferred. If we examine the sense in which relations in k ( X ) can be described as trivial, we are led to the following Definition. Let R be a ring; a relation (1)
x.y =xlyl+***+x"y,=o 13
(Xi,YiER)
74
P.M. Cohn
is said to be trivial if for each i = 1,. . ., n either x , = 0 or y , = 0 . If there is an invertible n x n matrix P over R such that the transformed relation xP-’ . Py = 0 is trivial, t h e relation (1) is said to be triuialized by P. It turns out that every relation in k(X)is trivializable in this way; this is proved by first showing that k(X)possesses a ‘weak algorithm’, an analogue of the Euclidean algorithm to which it reduces for commutative rings (cf. [S], Chapter 2). Next it is shown that in a ring with a weak algorithm all relations are trivializable. We shall not repeat the proofs, but to elucidate the connexion we quote the following result from [4] (cf. [S], Chapter 1):
Theorem 1. Let R be a non-zero ring and n 2 1, then the following conditions are equivalent : (a) every relation (1) with at most n terms is trivializable, (b) any right ideal of R generated by m d n elements which are right linearly dependent ouer R can be generated by fewer than m elements, (c) any right ideal on at most n generators is free, of unique rank. A non-zero ring satisfying these equivalent conditions is called an n-fir. By (a) this condition is left-right symmetric while (b) shows that the class of n-firs is elementary, i.e. it can be defined by a single elementary sentence. Thus for any n 2 1, both the class of n-firs and its complement are closed under ultraproducts (cf. [ 5 ] , Chapter 5 ) . It is easily checked that I-firs are just (non-commutative) integral domains; for increasing n the n-firs form smaller and smaller classes until we reach semifirs, defined as n-firs for all n. From Theorem I we see that a ring R is a semifir if and only if every finitely generated right ideal is free, of unique rank. Thus semifirs can be defined by an infinite set of elementary sentences, so t h e class of semifirs is closed under ultraproducts. As we shall see later, its complement is not closed under ultraproducts, so the class of semifirs is not elementary. By contrast, in t h e commutative case a 2-fir is already a semifir (a commutative 2-fir is a Bezout domain, i.e. a domain in which every finitely generated ideal is principal), so the class of commutative semifirs is elementary. Now a right fir ( = free ideal ring) is defined as a ring in which every right ideal is free, of unique rank. Unlike semifirs right firs are n o longer left-right symmetric (cf. [S], Chapter 2), so we must also define left firs, and by a fir we shall understand a left and right fir. In the
O n semifir consmctwm
75
commutative case firs reduce to principal ideal domains, while semifirs reduce to Bezout domains, as noted earlier. This remark already shows that t h e class of firs is not closed under ultraproducts, and so cannot be defined by a set of elementary sentences. Examples of firs exist in profusion; they include (a) principal ideal domains (even non-commutative), (b) free algebras over a field, (c) free products ( = coproducts) of skew fields. Semifirs are a little harder to come by; as in the commutative case we can obtain them as direct limits of firs, or also as ultraproducts of firs. But now we can also form ultraproducts (or direct limits) of n-firs, for increasing n. For this method to be of use we shall need a source of n-firs; this is provided by a result to which we now turn. 83. A ‘small cancellation’ theorem for rings Our aim in this section is to describe a result which enables one to recognize from certain ring presentations that t h e ring defined is an n-fir. By definition this just means that every relation (1) of at most n terms is trivializable, and one would expect a condition for this to be expressed by saying that all the defining relations can be expressed as relations of N > n terms with little interference, so that no nontrivializable relations of S n terms can be deduced by cancellation. Such a theorem was proved in [7] (see also [6] for a special case). To state it we need some definitions. Let k be any commutative field and let R be a k-algebra, generated by a set U ; then R can be expressed as a homomorphic image of the free k -algebra F = k( U) : R = F / N . On F we define a filtration u by assigning arbitrary positive integer values u ( u ) to each u E U, putting u ( u I * u,) = u(ul) + . . . + u ( u , ) and for any f = C a,m E F writing
where m runs over all products of u ’ s and a, E k. If the homomorphism F + R is denoted by a H a * , then the filtration u can be defined on R by writing for r E R, We now turn to t h e defining relations. If one of them is linear, C a,ui + /3 = 0 (ai,/3 E k), we can use it to eliminate one of t h e generators, because k is a field. So we may assume that there are no linear relations. Now relations of higher degree may be expressed as
P.M. Cohn
76
relations of degree two, by introducing further generators. Thus the defining relations for R may all be taken to be of the form
C N
(2)
I
~ i yi
b = 0,
where xi, yi E U and b is an expression in the generators with no term having xi as left factor. We shall assume that all defining relations are of the form (2) with the same value of N ; this can always be achieved by incorporating supernumerary terms in b. Now each element f of R can be expressed as a linear combination of products of the generators. Such an expression is said to be in reduced form for the suffix 1 if no term contains a factor x l y l for any of the defining relations (2). Any f E R can be expressed in reduced form by writing down any expression for f and then applying the move
arising from (2), for all the defining relations, wherever possible. Under suitable conditions a reduced form for the suffix 1 for f (not necessarily unique) is reached after a finite number of moves. If for each f E R there is just one reduced form we shall call it the normal form of f and denote it by [ f l l ; we also say then that a normal form for the suffix 1 exists. In a similar way we can define a reduced and a normal form for the suffix v ( v = 2 , . . . , N ) by using in place of (3) the move
Such a normal form will be denoted by [f].. Now we can state the result:
Theorem 2. Let R be a k-algebra generated by a set U containing a family of distinct elements (xi,) ( i E I, v = 1 , . . . ,N ) , another family of distinct elements (y,) (jE J, v = 1,. . . ,N ) and possibly other elements z h ( h E H ) . These generators are assigned positive integer values in any way such that v(xi.) = 1 and v(y,) is independent of v. Assume that R has a complete set of defining relations indexed by some subset of I x J :
where bij is an expression of value v(bii)d 1+ v ( y p j )in U and it has no term of this value with any xi,, as left factor. Moreover, assume that a normal form exists for each v = 1,. . . , N, satisfying
O n semi@ consmcctions
n
N , If f g is in normal form for v and v ( g ) > 0, then #or any h E R, the terms of highest value in f [ g h I yare in normal form for v. Then R is an n-fir, for all n C N The proof consists in verifying that R satisfies a truncated form of the weak algorithm ((N - 1)-term weak algorithm, cf. [7]). In spite of its complexity the condition is a very natural one, easy to check in most concrete cases. 04. Direct limits and ultraproducts
It is clear that any direct limit (over a directed system) of firs is again a semifir; more generally, a routine argument shows the truth of
be a direct system of rings. Suppose that for Theorem 3. Let {R,, each n 2 1 there exists A, E I such that R, is an n-fir for all a > An, then + lim R, is a semifir. This result enables one to construct semifirs from n-firs, but there is no guarantee that the limit will be non-trivial; it may well reduce to 0 or to the ground field k unless precautions are taken. For ultraproducts the situation is rather better. We recall that in a fir each element not zero or a unit can be written as a product of a finite number of atoms ( = unfactorable elements) and any two complete factorizations of c have the same number of factors; this is called the length of c, written l(c). Moreover, the factors can be paired off into pairwise similar ones; the precise condition need not concern us here, it is expressed by saying that the ring is a unique factorization domain, or UFD for short. In the commutative case this reduces to the usual definition (cf. [8], Chapter 3). We also recall that a semifir is a UFD if and only if each non-unit # 0 can be written as a product of a finite number of atoms. Theorem 4. Let {R,}I be any family of rings and R = ll R,/D their ultraproduct with respect to a non-principal ultrafilter D. If for each n 2 1, the set {a E I R, is an n-fir} is in D, then R is a semifir. If moreover, I is countable, then R is either a skew field or not a fir.
I
Proof. That R is a semifir is clear from the definition of ultraproduct, because n-firs form an elementary class. Now let I = N say; if the set { n E N R, is a field} is in D, then R is a field. Otherwise there exists
1
P.M. Cohn
78
N o € D such that R. is not field for all n E No. Let a. E R. be neither 0 nor a unit for n E No and a. = 1 for n 6Z No, and consider a = (a:) E nR.. The image ti of a in R is a non-unit, and it can be written as a product of n non-units, for each n. Hence R is not a UFD and a fortiori not a fir. For uncountable index sets the situation can be very different, especially if we have an wl-complete ultrafilter. Here w I denotes the first uncountable cardinal, and for any infinite cardinal d an ultrafilter D on a set I is called d-complete if the intersection of any set of fewer than d members of D belongs to D. This is the case if and only if, for any partition of I into fewer than d parts, one of the parts belongs to D (cf. [3] p. 180). Theorem 5 . Let { R , } , be an uncountable family of f i r s and let D be an wl-complete non-principal ultrafilter on I, then the ultraproduct R = Il R,lD is a unique factorization domain. Proof. As noted earlier, in a fir each non-zero element a has a finite length l ( a ) . Given a = ( a , ) E n R,, put P. = {a E I1 l ( a , ) = n } , P, = {a E I a, = 0 } , then the P's form a partition of I into w < w1 sets, hence one of them, say P,, lies in D, and a is a product of nl atoms (or a = O ) . Thus R is a semifir in which each non-zero element has finite length, i.e. R is a UFD. In the commutative case this result shows that an ultraproduct of an uncountable family of principal ideal domains with an wl-complete non-principal ultrafilter is again principal. The same method of proof will show that under the conditions of the theorem R has ACC. (i.e. the ascending chain condition on ngenerator right ideals) for any n, but it is not known whether R is necessarily a fir. If however, each R, is a ring with weak algorithm, then R will again have weak algorithm and hence be a fir.
I
95. Examples
With the help of Theorem 2 it is easy to give examples of (n - 1)-firs that are not n-firs. We simply take 2n generators x , , y , with the defining relation xly,+ * *
*
+ x.y.
= 0.
This is not trivializable (otherwise every n-term relation would be
O n semifir constructions
79
trivializable and every ring would be an n-fir!). But the conditions of Theorem 2 are satisfied, so we have an (n - 1)-fir Llsay. If we take the ultraproduct of LI,Lz, .. . with a non-principal ultrafilter, we obtain a ring which is an n-fir for all n, i.e. a semifir. Thus semifirs do not form an elementary class. A slight modification shows that the class of rings embeddable in skew fields is not elementary. This problem, raised by Mal’c,ev [lo] was answered in [9] using the compactness theorem. Here is a more direct proof: Let R,-, be generated by 2(nZ+ n ) generators aiA,bAi ( i = 1 , . . . , n, A = 1,. . . , n,+ 1) with the defining relations (4)
In matrix terms we have an n x ( n + 1) matrix A and an (n + 1 ) x n matrix B such that AB = I, BA = I. Thus R.-l does not have invariant basis number (cf. [6]) and so is not embeddable in a skew field. But the relations (4) satisfy the conditions of Theorem 2 (cf. [6, 7]), hence R.-l is an ( n - 1)-fir. Now the ultraproduct of all the R. (with a nonprincipal ultrafilter) is a semifir, and by the results of [8], Chapter 7, is embeddable in a skew field. Therefore the class of rings embeddable in skew fields is not elementary, although, by general results of universal algebra (cf. [5, lo]) it can be defined by an infinite set of sentences. This can also be seen from the explicit form of the embeddability conditions given in [8]. In the last example we saw that when a relation such as ab = 1 is interpreted as a relation between n x n matrices, an (n - 1)-fir results. This idea has been made precise and proved generally by G.M. Bergman [2]. His result (referring to square matrices only) may be stated as follows: Let R be any k-algebra; if we take any presentation of R and interpret all relations as relations between n x n matrices, we obtain an n x n matrix ring over an (n - 1)-fir. Bergman gives two proofs, one based on results in [l], the other based on [7]. In this way one obtains a family of n-firs (for all n ) from any given k-algebra; in general there is no way of making this family into a direct system, but by taking ultraproducts one can again obtain semifirs. References [l] G.M. Bergman, Modules over coproducts of rings, Trans. Amer. Math. SOC. 200 (1974) 1-32.
80
P.M. Cohn
[2] G.M. Bergman, Coproducts and some universal ring constructions, Trans. Amer. Math. Soc. 200 (1974) 33-88. [3] C.C. Chang and H.J. Keisler, Model theory, Vol. 73, Studies in logic (North-Holland Publishing Co., Amsterdam, 1973). [4] P.M. Cohn, Free ideal rings, J. Algebra 1 (1964) 47-69. (51 P.M. Cohn, Universal algebra (Harper and Row, New York, London, Tokyo, 1965). [6] P.M. Cohn, Some remarks on the invariant basis property, Topology 5 (1966) 2 15-228. [7] P.M. Cohn, Dependence in rings 11. The dependence number, Trans. Amer. Math. SOC. 135 (1969) 267-279. [8] P.M. Cohn, Free rings and their relations, LMS monographs No. 2 (Academic Press, London and New York, 1971). [9] P.M. Cohn, The class of rings embeddable in skew fields, Bull. London Math. Soc. 6 (1974) 147-148. [lo] A.I. Mal’cev, Algebraic systems (Springer, Berlin, 1973).
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 81-85
CONJUGACY AND THE HIGMAN EMBEDDING THEOREM Donald J. COLLINS Queen Mary College, London
The Higman embedding theorem asserts that if a finitely generated group has a recursively enumerable set of defining relations, then it can be embedded in a finitely presented group. Subsequently, Clapham showed that t h e Higman construction preserves the solvability of the word problem in the sense that if the original finitely generated group has solvable word problem, then so does t h e finitely presented group in which it is embedded. Our purpose here is to show that the analogue of Clapham's theorem for the conjugacy problem is false. Let C = (cl,c 2 , .. . ,c , ; r = 1, r E R ) be a finitely generated group with a recursively enumerable set R of defining relations. It is convenient to assume, as we may, that R = R ( c , ) is the normal subgroup of the free group F ( c , )= F(cl,c2,.. . , c , ) consisting of every relation of C. As proved by Higman [4], C can be embedded in a finitely presented group G in the following manner. Let F ( a , ) and F ( b , ) be free groups isomorphic to F ( c , ) . From these form the amalgamated free product
H = ( F ( a , ) * F ( b , ) ; R ( a , )R=( b , ) ) . Then H can be embedded in a finitely presented group L. Next form the direct product L x C and finally the HNN extension
G
=
(L x C,r ; t-'a,r = a,, r-'b,t = b,, i
=
1,2,.
. . ,n ) .
Then G is, in fact, finitely presented. We fix t h e above notation and are ambiguous, leaving the context to clarify, in our use of notation for an element of a free group and its image in a factor group.
Proposition 1. Let x ( a i )E F ( a i ) and suppose (a) x ( a i ) is cyclically reduced in F ( a , ) , (b) x ( a i ) is nor a proper power in F ( a i ) , (c) x ( a i ) @ R ( a i ) . 81
82
D.J. Collins
Then for any u(ci), the subgroup of C generated by x ( c i ) contains u ( c i ) if and only if x(a,)u(ci)t-'and x(a,)t-' are conjugate in G. Proof. Suppose x(a,)u(c,)t-' and x(a,)t-' are conjugate in G. Then by Collins' Lemma (see [ 5 ] ) , they must be conjugate by an element w(a,, b,). The conjugacy is then valid in the group
( H x C, t ; t-'a,t
=
a,c,, t - ' b , t
=
b,, i
=
1,2,.
. . ., n ) .
From the equation w(a,,b,)-'x(a,)u(c,)t-'w(a,, b,) = x(a,)t-' we obtain, via the relations t-'a,t = ax,,
(1)
t-'b,t =
b,,
w(a,,b,)-'x(a,)w(a,,b,) = x ( a , )
in H,
(2)
U(C,)Z(Ci)=
1
in C, where w (a,c,,b,) = w(a,,b,)z(c,)in H X C. As x ( a i ) R ( a i ) it follows from (1) and the structure of H as amalgamated free product that w(a,,b i )E F(a,). Then from (a) and (b) in the hypotheses, we see that, for some integer m, w(ai,6 , )= x(a,)"' in F(a,). This means that z ( c # = ) X ( C , ) " ' in C as required. Conversely suppose that u(c,)x(ci)"'= 1 in C. A simple calculation verifies that x (a,)-? (a,) u (c, ) t - ' x (a,)" = x (a,) t - ' .
Corollary 2. Let C have unsolvable power problem and solvable order problem. Let C be embedded in G via the Higman embedding. Then G has unsolvable conjugacy problem. Proof. Let X be t h e set of all cyclically reduced words of F ( c , ) that are not proper powers in F ( c , ) and do not lie in R ( c i ) . Since C has solvable order problem this is a recursive set. On the other hand, since C has unsolvable power problem there is no algorithm to decide of an arbitrary element z of F(c,) and an arbitrary element x of X whether or not z lies in the cyclic subgroup of C generated by x. By Proposition 1 G must have unsolvable conjugacy problem. We give another demonstration of the way in which the Higman embedding fails to preserve the solvability of the conjugacy problem.
Conjugacy and the Higman embedding theorem
83
Proposition 3. Let xk, y k , k = 1,2,3,4 be elements of F(c,), none of which lies in R ( c , ) . Let w = w(a,,b,) be an element of the free product F ( u ~*)F(bi). If w - 1 ~ l t ~ z t - I ~ 3wt ~=4yltyzt-ly3ry4r-1 t-l
in G then, in C, x I = y l , z-'xzz = y 2 , x3 = y 3 and
z-'x4z = y 4 ,
where w(aici,b i )= w ( a , b i ) z ( c i )in H x C. Proof. The postulated equality must hold in the HNN extension (H x C, t ; t-'ait = sic,, t-'bit = bi, i
=
1 , 2 , . ..,n ) .
Using t as stable letter, we see that x4w(aici,bi)y;' lies in the subgroup (sic,, bi). But then, clearly, x4w(aic,bi)y;' = w(aic,bi). This gives Z - I X q Z = y 4 and
w (ai,b i ) - 1 ~ l t ~ Z (ai, r - Ib ~i )= 3 ~yltyzt- l y 3 . From this we obtain x 3 w ( a ,bi)y;' = w ( a , b i ) and hence x3 = y 3 and w-lxltxzt-lw = y l f y 2 f - l . A similar argument gives z-'x2z = y z and X I = y1.
Corollary 4. Let C have solvable conjugacy problem but suppose there is no algorithm to decide of two arbitrary pairs u I , u1 and u2,u2 of elements of F ( c i ) whether or nor there exists z in F ( c i ) such that z-'ulz = u1 and z-'u2z = u2 in C. If C is embeddable in G via the Higman embedding then G has unsolvable conjugacy problem.
Proof. Let x and y be elements of F ( c i ) such that x # y in C and x # 1, y # 1 in C (i.e. x f y mod R ( c i ) and x, y 6Z R ( c i ) ) .For any two pairs ul, uI and uz, uz of elements of F ( c i ) , none of which lies in R ( c i ) , it follows from Proposition 3 that if ~ t u ~ f - ~ and y f ~ xtulf-1ytu2f-' ~ f - ~ are conjugate in G then there exists z in F ( c i ) such that z-'uIz = u1 and z-'u2z = uz in C. Conversely, if such a z = z ( c i ) exists then the given words ~ t u ~ t - ~ y t and u ~ txtult-'ytuzt-' -~ are easily seen t o be conjugate in G by z ( a i ) . Since C has solvable conjugacy problem it follows that even when the assumption that none of ul, u2, ul, u2 lies in R ( c i ) is added there is still no algorithm to decide whether or not there exists z such that z-'ulz = ul and z-'u2z = u2 in C. Thus G must have unsolvable conjugacy problem.
D.J. Collins
84
Example. We specify a group in which t h e hypotheses of Corollary 4 are satisfied. Let C have presentation (abandoning our systematic notation):
a l , a2, b l , b2, p l , p2, 41, 92,
C I , CZ,
dl, d2,
T I , TZ, S I , SZ;
p;’b’,alb’,p’, = r;’d’,cld{r’, p;’bia2bipi = r;’dic2dM q;’q;’bf(l)a I bf(” I q;q;= s~’s~’d~“’cldf“’s;s; q ;‘q ;‘b{“’a2b{(“q;q ; = s 7’s;‘d {“’c2d{(‘)s;s ; where i and j range through the positive integers and f is a one-to-one recursive function of t h e positive integers with non-recursive range. Then C has solvable conjugacy problem and there exists z such that
z-‘b’,alb:z= d’,cld’, and
z - ’ b i a 2 b := dic2di
if and only if j lies in t h e range of f. We shall not give the arguments for this since they are standard and rather similar to t h e kind of argument given by Miller on pp. 46-54 of [ 5 ] . We conclude with some general remarks. There seems at present to be no hope of establishing t h e analogue of Clapham’s theorem. T o prove Higman’s theorem one firstly achieves a kind of logical embedding in a finitely presented group (in our notation C is logically embedded in L since w ( a , )= w ( b , ) in L if and only if w ( c , )= 1 in C). Then, in Paul Schupp’s graphic phrase, one performs the Higman “rope-trick’’ to achieve the actual group embedding. It follows from the author’s paper [3] that at least for Aanderaa’s version [l] of the Higman theorem, solvability of the conjugacy problem can be achieved for t h e logical embedding section of the construction. But, as we have seen, there are substantial difficulties in dealing with t h e “rope-trick’’ part. Furthermore, these difficulties seem to be more or less inevitable given t h e structure of t h e proof and probably a wholly new strategy will be needed to avoid them. For the present the most that can be hoped for is t h e isolation of conditions on C that are necessary and sufficient for the preservation of the solvability of the conjugacy problem in t h e Higman embedding.
References [I]
s. Aanderaa, A
proof of Higman’s embedding theorem using Britton extensions Of groups, in: W.W. Boone, F.B. Cannonito and R.C. Lyndon (eds.), Word problems (North-Holland, Amsterdam, 1973).
Conjugacy and the Higman embedding theorem
85
[2] C.R.J. Clapham, An embedding theorem for finitely generated groups, Proc. Lond. Math. Soc. 3, 17 (1967) 419-430. [3] D.J. Collins, Representation of Turing reducibility by word and conjugacy problems in finitely presented groups, Acta Math. 128 (1972) 73-90. [4] G. Higman, Subgroups of finitely presented groups, Proc. Roy. Soc. Series A, 262 (1%1) 455475. [S] C.F. Miller, On group-theoretic decision problems and their classification (Princeton, 1971).
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 87-100
SOME SOLVABLE WORD PROBLEMS Trevor EVANS* Emory University, Atlanta, Georgia
Introduction
This is a survey of some algorithms which have been used to solve decision problems (mainly word problems) in various varieties of algebras, e.g., lattices, commutative semigroups, quasigroups. O u r interest is in algebraic properties which imply the existence of such algorithms. There is n o attempt to be encyclopaedic but instead the emphasis is on examples of algorithms which have a universal algebra flavor. In Section 1 we discuss the connection between embedding of partial algebras in a variety and the solvability of the word problem for finitely presented (f.p.) algebras in t h e variety. In Section 2 we consider algorithms based on finite separability properties. Finally, we look at some aspects of normal form theorems. We will work within a finitely presented variety V, i.e., a variety defined by a finite number of finitary operations and a finite set of identities, and by an algebra, we will always mean an f.p. algebra in such a variety.
81. Partial algebras Let V be an f.p. variety and let V* be t h e variety with the same operation type as V but defined by the empty set of identities. For example, if V is t h e variety of commutative semigroups, V* is t h e variety of groupoids. If R is t h e set of operations in a V-algebra, by a partial V*-algebra 9 = (P,R ) we mean a set P of elements and, for each n-ary operation f in 0,a mapping f : S + P where S C P". By a partial V-algebra, we mean a partial V*-algebra 9 = ( P , R ) such that the following conditions are satisfied. (i) Insofar as the identities of V apply to 9, they are satisfied. That * This research was supported in part by U.S. NSF Grant MCS76-06986.
88
T. Evans
is, if u ( x l ,x 2 , x 3 , . . . ) = v ( x I x, 2 , x 3 , . . . ) is one of the defining identities of V and if for any elefnents a r , a 2 , a ,.... , of P substituted for the variables in u = v, the partial 0-operations of 9’ allow both u(al, a2,a , , . . . ) and v ( a , ,a2,a 3 , .. . ) to be computed, then the identity u = v holds for these values. (ii) The partial operations defined in 9’ and the defining identities of V do not allow us to extend the domain of the partial operations of P. In other words, if a substitution of elements of P for the variables in an identity u = v of V enables us to compute one side as a, and the other as f ( a l ,a2,a 3 , .. .), for some f E 0 and a l ,a 2 , a 3 ,... in P, then f ( a l ,a2,a 3 , .. . ) = a, is already given as a value of the partial operation f . We say that the embeddability problem is solvable for V if there is an algorithm for deciding whether any finite partial V-algebra (actually, we could just as well say “any finite partial V*-algebra”) can be embedded isomorphically in a V-algebra. The following theorem is proved in [4]. Theorem. The embedding problem is solvable for the variety V if and only if the word problem is solvable for V.
This theorem is useful in solving the word problem for two reasons. (i) In the most favorable case, any finite partial V-algebra can be embedded in a finite V-algebra. This occurs more frequently than one would expect. (ii) We may be able to give reasonable conditions on V or on partial V-algebras which imply a test for embeddability. For example, V may have the property that if a finite partial V-algebra is embeddable at all, then it can be embedded in a finite V-algebra. If V has the property that any finite partial V-algebra can be embedded, then the algorithm for solving the word problem for an f.p. V-algebra goes as follows. Let d be a V-algebra given by generators a l , a2,a 3 , .. . , and relations r, (ar,a2,a s , .. . ) = r : ( a r a2, , a 3 , .. .), i = 1,2,3,.. . . Let w , , w2 be two words in the generators of d. We wish to decide whether w I = w2 in d. We begin by introducing new generators b l , bz, b 3 , .. . for every word in the a, which occurs as a subword of r,, r:, w 1 or w2. This enables us to rewrite the defining relations of d so that each relation is either of the form b, = b, or f(b,,b,, bk,.. . ) = b, where f is an operation of d. Now, by direct applications of the identities of V and the defining relations of d, we alternately remove redundant generators and introduce new relations of the form f(b,,b,, bk,. . . ) = b,. We arrive at a
h
e solvable word problems
89
presentation for d which has the form of a partial V-algebra 9. In doing this, the b,, b, corresponding to wl,w 2 may have been indentified, in which case we know that w1= w 2 in d.On the other hand, b,, b, may remain distinct in 8. Since 9 can be isomorphically embedded in a V-algebra, bi# b, in the V-algebra freely generated by 8 and so, isomorphically, w1# w 2 in d.For more details see [3], [4]. The above procedure solves the word problem for the varieties of (i) lattices (ii) quasigroups, loops and many subvarieties of these which arise in the study of combinatorial designs [17]. In a number of papers (see, for example, [13]) Gluhov and Gvaramija have given conditions on varieties of quasigroups (and of other algebras) which imply that any finite partial algebra in the variety can be finitely embedded. We can differentiate between the different kinds of embeddability of finite partial algebras in a variety V by the following conditions. El: Any finite partial V-algebra can be finitely embedded. E2:Any finite partial V-algebra can be embedded (although not necessarily in a finite V-algebra). E,: If a finite partial V-algebra can be embedded, then it can be embedded in a finite V-algebra. Clearly, El @ E2A E,. One of the simplest examples of a variety V with property E2but neither El nor E , is given by binary operations x y , x\y and the identity x . ( x \ y ) = y . For details, see [17]. The situation is clarified by the following theorem [17]. (See also [2], [7].)
Theorem. A variety V has the property E , if and only if every f.p. V-algebra is residually finite. An f.p. residually finite algebra has a solvable word problem. (See [17], although the result is apparently first due to Malcev.) We repeat briefly the simple argument. Let d be an f.p. residually finite Valgebra and let w l , w z be words in the generators of d.We enumerate all consequences of the defining relations of d.If w 1= w 2 in d,it will appear in this enumeration. We also enumerate the finite V-algebras which are homomorphic images of d.If w , # w 2 in d,then in one of these homomorphic images of d w 1 and w 2 will map onto distinct elements. Combining these enumerations, we have a procedure which, in a finite number of steps, will decide whether w , = w 2 in d. In the table below we list these embeddability properties for f.p. algebras in some familiar varieties. We note that the free algebras in all of the listed varieties other than inverse property loops and modular lattices are residually finite and hence have solvable word problem. It is not known whether free modular lattices are residually finite, i.e. have property E,, nor is this known for inverse property loops.
T. Evans
90
W.P.
Variety
E,
E,
E, solvable
Groups, rings semigroups, modular lattices
no
no
no
no
no
yes
Quasigroups, loops, lattices
Yes
yes
yes
Inverse property loops
?
yes
?
Non-associative rings
no
no
?
Abelian groups, commutative rings and semigroups
For commutative rings and semigroups, residual finiteness (or E,) at present provides the only procedure for solving the word problem for f.p. algebras. Problems. 1. Prove that free modular lattices are not residually finite. 2. Prove that f.p. non-associative rings are residually finite. 3. Prove that finite partial inverse property loops can be finitely embedded. (Inverse property loops are one of the few varieties of loops which have been studied in detail but for which the finite embeddability property has not been verified.) 82. Finite separability properties
An algebra d is residually finite if, for any two disjoint finite subsets SI, S 2 of d , there is a homomorphism of d onto a finite algebra such that the image of SI and Sz are disjoint. There are numerous generalizations of this concept. If x is an element and Y a subalgebra of d , then d has the finite separability property (abbreviated to f.s.p.) if, for any x fZ Y there is a homomorphism a onto a finite algebra in which x a fiiYa. If Y,, Y2are subalgebras, then d has the subalgebra property if for any such disjoint Y,, Y2,there is a finite separability Y2are disjoint. homomorphic image of d such that the images of Y,, If d is a group, then it has the conjugacy separability property if for
Some solvable word pmblems
91
any elements x, y not conjugate in d, there is a finite homomorphic image of d in which the images of x and y are not conjugate. The relationship of such finite separability properties to algorithms for solving corresponding decision problems is obvious. We have given one example earlier - residual finiteness implies solvability of the word problem - a similar proof shows that finite separability of an element and an f.g. subalgebra implies solvability of the generalized word problem. Free groups have the finite separability property for f.g. subgroups (M. Hall [14]), hence have solvable generalized word problem. So do free semigroups and free commutative semigroups (these are easy to show). In fact, free algebras in any variety defined by balanced identities (each variable in a defining identity u = u occurs exactly the same number of times in u as in u ) . Surprisingly, however, free rings do not have the finite separability property. Let 9 be the free ring on one generator x and let Y be the subring generated by 2x,2x2+ x. Then Y# 9, i.e. x E Y.However, in any homomorphism a of 9 onto a finite ring, x a E Ya.This result is due to K. Mandelberg [17]. The generalized word problem for free rings is apparently an open question. However, one can show, using methods similar to those of Mihailova [19] for groups, that the generalized word problem is unsolvable for F @ F, the direct sum of two copies of a free ring. Let S = (gl,gz, . . . ,g, ; u I = u,, u z = u2, u, = u,) be a semigroup with an unsolvable word problem and let F be the free ring generated by g,, gz,. . . , g,. In the direct sum F @ F, let S * be t h e subring generated by (u,, u,), i = 1,2,. . . , n and (g,,g#), i = 1,2,. . . , m. It is easy to prove that if u, u are monomials in F with coefficient one, then (u, u ) E S * if and only if u = u in S. It follows that t h e G.W.P. is unsolvable for F@E For the problem of deciding for a V-algebra d whether two f.g. sub-algebras have a non-empty intersection, we appeal to t h e subalgebra separability property. For quasigroups (and various subvarieties) we can use the fact that any finite partial quasigroup can be finitely embedded to show that t h e variety of quasigroups has the subalgebra separability property [17]. There is an interesting contrast for this property between free semigroups and free commutative semigroups. Let 9 be an f.g. free semigroup and YI, Y2be disjoint f.g. subsemigroups of 9.By a standard result in the theory of finite automata, Y,and Y2can each be expressed as the union of congruence classes of some congruence of finite index o n 9.(See, for example, [ll].) The intersection of these two congruences is a congruence 8 of finite index. In the quotient
92
T.Evans
semigroup S / O , the images of 9,and YZwill be disjoint. Now let sd be the free commutative semigroup generated by g , , gz and let 58,, 58, be the subsemigroups generated by {glgz,g , g : } and {gl,g:gz} respectively. Although 9 ,n 58z = 0 in d,in any homomorphism of .d onto a finite semigroup the images of 58, and BZintersect (Garrett [121). We conclude this section with a decision problem which is known to be solvable [16] but for which it would be interesting to know whether there is an algorithm based on a finite separability property. Let 3 be a group. If, for any x, y in % such that x and y are not conjugate in 3, there is a homomorphism a of 3 onto a finite group such that x a , ya are not conjugate in 3-, then 3 is conjugacy separable or residually finite with respect to conjugacy. Stebe [23] has shown that free groups are conjugacy separable. If we describe conjugacy separability in groups as finitely separating elements and conjugacy classes, it suggests the following generalization. Let JB be an algebra and 3 a group of automorphisms of d. Does d have the finite separability property with respect to orbits of 3? A particularly interesting case of this concerns free algebras. Let 9,b e an f.g. free algebra in'a variety V. Consider all homomorphisms of 9, onto n-generator finite algebras which are free in some subvariety of V. If an element x E 9,maps onto a primitive element in each such finite relatively free image, is x primitive in S,,? (Here, primitive means an element of some free generating set.) We are actually asking here whether 9,has the finite separability property with respect to the orbits of its automorphism group. No examples of varieties V appear to be known in which F . ( V ) does not have this property but on the other hand only a few rather trivial examples, such as semigroups and groupoids, are known of varieties in which the free algebras d o have this property. Motivation for t h e above questions comes from the following theorem [17]. Theorem. Let 9,be a residually finite free V-algebra which also has the f.s.p. If wl,w z , .. . ,w, in Snmap onto a free generating set for every homomorphism of 9,onto a finite relatively free V-algebra, then w I ,w z , .. . , w, is a free generating set of 9,. From this, we obtain by a familiar argument Theorem. If 9"is a free V-algebra on n generators which is residually finite and has the f.s.p., then there is an algorithm for deciding whether a set of n elements of 9,is a free generating set.
Some solvable word problems
93
This theorem applies to groups, semigroups, groupoids, quasigroups, loops, lattices. It is tempting to conjecture that if each f.p. algebra in a variety V is residually finite (so that it has a large number of finite homomorphic images) then any two f.p. V-algebras having the same homomorphic images are isomorphic. This is the case for abelian groups unfortunately no other non-trivial example of this situation is known. A result of Pickel [21] states that if H ( d ) is the set of finite homomorphic images of an f.g. nilpotent group d,then there are only a finite number of f.g. nilpotent groups 93 such that H(46)= H ( d ) . However, examples are known (Remeslennikov [22], Dyer [2]) of nonisomorphic f.g. nilpotent groups d,93 such that H ( d ) = H(93). Let us call an f.p. algebra in an f.p. variety Vfinirely derermined if the set of its finite homomorphic images determines it uniquely (to within isomorphism). A similar argument to that used to show residual finiteness implies solvability of the word problem shows that the isomorphism problem is solvable for f.p. V-algebras if every f.p. Valgebra is finitely determined. However, as we remarked above, few examples of this situation are known. We conjecture, o n rather flimsy evidence, that the varieties V of quasigroups with the property that every finite partial V-quasigroup can be finitely embedded d o have every f.p. algebra finitely determined. The variety of lattices is also another good candidate for this property. A curious connection between t h e properties we have been discussing is that if the isomorphism problem is solvable for f.p. algebras in a variety V and if every such algebra is hopfian, then the word problem is solvable for f.p. V-algebras. Let d be an f.p. Valgebra and u, u words in the generators of d. Let d,be the algebra obtained from d by adding the additional defining relation u = u. Since d is hopfian, u = u in d if and only if d and I, are isomorphic. Problems. 1. Solve the generalized word problem for free rings. 2. Prove that free lattices (or f.p. lattices) have the f.s.p. and the subalgebra separability properties. 3. Prove that w is a primitive element in an f.g. free group 9" if and only if w maps onto a primitive element in every homomorphism of 9" onto a relatively free finite group. Prove corresponding theorems for free loops and free quasigroups. 4. Let d,93 be two f.p. groupoids (or f.p. algebras in a variety V defined by the empty set of identities). Prove that if d, 46 have the same sets of finite homomorphic images, then d, 46 are isomorphic.
94
T. Evans
5. The same as Problem 4, for lattices. 6. It is known that residual finiteness of f.p. algebras in a variety V is not sufficient to prove t h e property described in 4 above for f.p. V-algebras. Find finite separability properties on V which d o imply this property.
93. Normal form theorems We recall the usual procedure for solving the word problem in a free group. Consider the set of all words in the generators and their inverses. We introduce contractions and expansions of words (deleting and inserting subwords g g - ’ , g ’ g ) and define two words to be equivalent if one can be transformed into t h e other by a finite sequence of expansions and contractions. It is easy to prove that every for which no equivalence class [ w ] contains a unique word contractions are possible, and that starting from any word in [ w ] and applying contractions, we eventually arrive at This word EJ is called the normal form (or reduced form) of any word in [ w ] .Clearly, when we have such a theorem we have a procedure for solving the word problem for F. There are numerous varieties to which we can apply this method or some variation of it. The general setting is as follows. Let d be an algebra (gl, g,, g,, . . . ; rl = r ; , r2 = r:, . . . ) in a variety V defined by identities u, ( x l , x z , x 3 , . . . ) = u:(xl,x2, x,, . . .), i = 1 , 2 , 3 , . . . . The elements of d are represented by words w ( g , ,g,, g,, . . . ) built up from t h e generators and operations of d.Two words u, u‘ are connected by an elementary transformation u W- u’ if one can be transformed into the other by an application of either a defining relation r, = r: of d or a defining identity u, = u : of V. Words u, u’ are equivalent u = u‘ if one can be transformed into the other by a finite sequence of elementary transformations. The equivalence classes [ w ] are the elements of d and the word problem for d consists of finding a method (or showing that one does not exist) which will decide when two words lie in the same equivalence class. We may represent the situation by a graph. The points correspond to words and connecting segments to elementary transformations. The maximal connected components of the graph are the classes of equivalent words. We turn the graph into a directed graph by classifying the elementary transformations as either contractions or expansions. We write u + u’ in the directed graph if we can get u’ from u by a contraction. We attach a “weight” to each point, usually
* *.
Some solvable word problems
95
called the length l ( u ) of the corresponding word. These weights can come from any well-ordered set although most frequently one uses the non-negative integers and we require that if u + u’ then I ( u ) > l(u’). Our aim in doing this is to have in each connecting component exactly one point of minimum weight such that any directed path from any other point in the component ends at this “sink”. This point will correspond to the normal form of every word in its equivalence class. There is a general theorem of M.H.A. Newman [20] which enables us to achieve this desired end once we have assigned directions and weights in the graph. We require the resulting graph to satisfy the following confluence property (sometimes called the Diamond Lemma). If u + u’, u + u” are any two contractions of u, then there is a point u”’ + u’”. such that there are directed paths u ’+ * * + u”’, up’+
-
I
J
If this condition is satisfied it is an easy matter, using induction on the length l ( u ) of u, to show that any sequence of contractions starting at a word u ends a unique normal form fi of u. A simple induction on the number of elementary transformations connecting two words w, w’ also enables us to show that words w, w‘ are equivalent iff they have the same normal form. Thus, to use this approach to solve the word problem for algebras in a variety V we have to find an appropriate length function on words and we have to characterize the contractions of words so that we can prove that the confluence condition is satisfied. Once this is done, we may decide whether two words in an algebra are equivalent by comparing their normal forms. Actually, what we have described is the bare bones of just one normal form approach t o solving the word problem. In putting flesh on the bones, we may have to (even in the most direct application of this method)
T. Evans
%
(i) rewrite the defining relations so that they are in some special form (or consider only algebras whose relations can be put in this special form) (ii) add redundant identities to the defining identities of V. Such adjustments (and others) may be necessary in order to obtain a confluence condition for the contractions. However, the above approach, with the refinements described, is sufficient to give a solution of the word problem for f.p. algebras in many different varieties. Some examples are given below. Example 1. If V is a variety not satisfying any laws, then the word problem for free V-algebras is trivial. For f.p. algebras in such a variety we rewrite the defining relations in the form f(gl, g,, g,, . . . ) = g k (see [3]). Now, with the obvious definitions of length and contraction, the confluence condition holds. Example 2. Let V be the variety of groupoids defined by the identity
x y . y z = y, for all x, y , z. Let F be a free V-algebra. If, using only the reduction x y . y z + y, we try to prove that the confluence condition holds, then we fail but we do discover that the identities x ( x y . z ) = x y and ( x . y z ) z = y z are consequences of x y .y z = y .
We now try to verify the confluence condition using the contractions
x y . y z + z, x ( x y . z)+ x y , ( x . y z ) z + y z where x , y, z are any words in
the generators. This time, we are successful. This example illustrates a situation which occurs quite often - the confluence condition (if it holds) “generates” its own proof. Example 3. Let V be the variety of quasigroups defined by three binary operations x y (multiplication), x \ y (left-division), x / y (rightdivision) and the identities x . ( x \ y ) = y , (XlY1.Y = x x \ ( x y ) = y, (XY )/Y = x .
Let d be an f.p. V-algebra generated by g,, g, g, . . . . In order to obtain the confluence property and hence, a normal form theorem, we first rewrite d in terms of generators a , , az,a s , .. . and defining relations of the form aiai = a k . Then we add the redundant identities x l ( y \ x ) = y, (YlX)\Y = x
to the defining identities of V and the redundant relations ai\ak = a,, ak/ai= ai for every defining relation aiai = ak.
Some solvable word problems
97
Now we use as contractions aia, + at, ai\ak+ ah at / a ,+ a , where aiaj = at is a defining relation and x ( x \ y ) + y, ( x / y ) y + x, x \ ( x y ) + y, ( x y ) / y + x , x / ( y \ x ) + y , ( y / x ) \ y + x , where x, y are any words in the
generators. The confluence condition holds and a normal form theorem results. See [ 5 ] for the use of this in studying the structure of f.p. algebras in various varieties of quasigroups and loops. Actually, our original description of how we solve the word problem for free groups is not an example of the normal form approach we have described - it illustrates a generalization. When, in a free group, we define a word to be a string of generators and their inverses such as glg;'g3g3g2', the associative law (and the identities ( x y ) - ' = y - l x - ' , (x-')-' = x ) are sneaking in through the back door. Formally, we should define a word to be, for example, ((glg;')g3)(g2g;')-'. In writing glg;'g3g3g;' as a "word" we are identifying all words which are equivalent by using group identities which preserve length. In other words, the confluence condition using xx-'+ 1, x - ' x + 1 as reductions, is being applied to equivalence classes of words, rather than individual words, each equivalence class consisting of all words connected by the elementary transformations ( x y ) z f;*x(yz), (x-')-'-x, (xy)-'f* y - ' x - ' . The general set-up is this. We have a variety V and a V-algebra d. With what seems to be the appropriate definition of length of a word, it may happen that some of the V-identities preserve length and so cannot be used to shorten a word. We proceed by first considering an equivalence relation on words using only these length-preserving identities. We define contractions so that if u + u' and u u, then there is a contraction u + u' where u ' - u ' . If this can be done, we have essentially defined contractions on ( )-equivalent classes of words. If we can now prove a confluence theorem for these contractions applied to the ( )-equivalence classes, then providing that we can recognize when words are ( - )-equivalent, we have a solution of the word problem. An example of this procedure is used in [9] where the length-preserving identities are essentially generalized commutative laws for the operations. We conclude this section with an example from group theory. The result is familiar - the normal form theorem for free products with an amalgamated subgroup - but the elementary transformations and length function used are not the familiar ones. Let A, B be groups with subgroups A', B1 which are isomorphic. We will identify A', Bl according to the isomorphism and regard A, B as groups sharing a common subgroup C = A n B. We construct a group A *=B, the generalized free product, as follows. The generators
-
-
-
-
T. Evans
98
of A *cB are the elements A U B and the defining relations are the multiplication tables of A and B. By a word, we will mean a finite sequence x I x 2 * . x , of elements of (A U B )- { e } ( e is the common neutral element of A and B). 1 denotes the empty word. By an elementary transformation of a word we mean either a contraction or its inverse, where contractivn is defined as follows. We begin by choosing fixed left coset representatives for the subgroup C in the groups A and B. W e take 1 as the coset representative of C and denote by x * the left coset representative of C in A or B to which x belongs. Hence, x = x * c for some c in C. Let w = x 1 x 2* x, where x , E ( A U B )- { e } . (i) If x, belongs to A - C or to B - C, then replacing x, by x t c where x , = x : c in A or B, is a contraction of w. (ii) If adjacent elements x,, x , + ~in w belong t o the same group A or B, and x,x,+l = x = x * c in that group, then replacing X , X , + ~ by x * c is a contraction of w. Thus, contractions have the effect of replacing elements of A - C and B - C by coset representatives and of shifting elements of C to the right. A reduced word in the free product of A and B with x I c where the x t are amalgamated subgroup C has the form x : x f coset representatives from A and B, and c is either empty or in C. We now introduce a length function with the right property. We define the empty word to have length (0, 0). If w = x I x 2 . . . x,, we assign as its length an ordered pair of non-negative integers (p,q). Each x , in C is given p-weight 1, each x , which is a coset representative is given p-weight 3, all other x, are given p-weight 5 . W e define p to be the sum of the p-weights. To each x, which is in C we assign a q-weight the number of x,’s in w which are coset representatives and lie to the right of x , in w. W e define w to be the sum of the q-weights of w. We order lengths lexicographically. It is a routine matter to check that the confluence condition holds and hence we obtain the usual normal form theorem that every element in the generalized free product can be written uniquely in the form x : x f - . . x t c . Despite the simplicity of the procedure described in this section, it is enormously versatile. There is an extensive survey for other applications of the Diamond Lemma in a forthcoming expository paper by G. Bergman. P. Hall’s expository paper [15] is a detailed study of an application of this technique. A number of examples are given in Knuth and Bendix [18].
-
Some solvable word ptvblems
99
Problems. 1. In the paper by Knuth and Bendix referred to above, there is a universal algebra version of the algorithm we have described for free algebras in a variety satisfying certain conditions. Can one extend the results of Knuth and Bendix to apply to varieties where some of the identities preserve length? 2. Is there a normal form theorem of the type we have described for free lattices? The usual solution of the word problem (Whitman [24]) involves explicit use of the order relation. The same question applies to f.p. lattices. 3. A version of the algorithm we have described for solving the word problem is used by the author (Trans. Amer. Math. SOC.1963) to solve the isomorphism problem for some varieties of loops and quasigroups. Is there a universal algebra version of this? References
RADCLIFFE
[l] B. Banaschewski and E. Nelson, On residual finiteness and finite embeddability, Algebra Universalis 2 (1972) 361-364. [2] J.L. Dyer, On the isomorphism problem for polycyclic groups, Math. Z. 112 (1%9) 145-153. [3] T. Evans, The word problem for abstract algebras, J. London Math. Soc. 26 (1951) 64-71. [4] T. Evans, Embeddability and the word problem, J. London Math. Soc. 28 (1953)
76-80.
[5] T. Evans, On multiplicative systems defined by generators and relations, I. Normal form theorems, Proc. Cambr. Phil. Soc. 47 (1951) 637-649. [6] T. Evans, The isomorphism problem for some classes of multiplicative systems, Trans. Amer. Math. Soc. 109 (1963) 303-312. [7] T. Evans, Residual finiteness and finite embeddability. A remark on a paper by Banaschewski and Nelson, Algebra Universalis 2 (1972) 397. [8] T. Evans, Identities and relations in commutative Moufang loops, J. of Algebra 31 (1974) 508-513. [9] T.Evans, A decision problem for transformations of trees, Can. J. Math. 15 (1%3) 584-590. [lo] T. Evans, Some connections between residual finiteness, finite embeddability and the word problem, J. London Math. Soc. (2). 1 (1969) 399-403. [ I l l E. Engeler, Introduction to the Theory of Computation (Academic Press, New York, 1973). [12] R. Garrett, Some problems concerning residual finiteness in algebras, Ph.D. Thesis, Emory University (in preparation). [I31 A.A. Gvaramija and M.M. Gluhov, A solution of the fundamental algorithmic problems in certain classes of quasigroups with identities, Sibirsk. Mat. Z. 10 (1969) 297-317 (Russian). English translation; Siberian Math. J. 10 (1969) 211-224. [I41 M. Hall, Jr., Coset representations in free groups, Trans. Amer. Math. Soc. 67 (1949) 421432. [IS] P. Hall, Some word-problems, J. London Math. Soc. 33 (1958) 4824%.
100
T. Eoans
P.J. Higgins and R.C. Lyndon, Equivalence of elements under automorphisms of a free group, Mimeographed Notes. Queen Mary College, London. C. Lindner and T. Evans, Finite embedding theorems for partial designs and algebras (Les Presses de L’Universite de Montreal, 1977). 1181 D. Knuth and P. Bendix, Simple word problems in universal algebras, Proc. Conf. o n Computational Problems in Abstract Algebra, Oxford. 1967. 1191 K.A. Mihailova, The occurrence problem for direct products of groups, Dokl. Akad. Nauk SSSR 119 (1958) 1103-1105 (Russian). [20] M.H.A. Newman, On theories with a combinational definition of “equivalence”, Ann. Math. 43 (1942) 223-243. [21] P.F. Pickel, Finitely generated nilpotent groups with isomorphic finite quotients, Trans. Amer. Math. Soc. 160 (1971) 327-341. [22] V.N. Remeslennikov, Finite approximability of groups with respect to conjugacy, Sibirsk. Mat. Z. 12 (1971) 1085-1099. [23] P.F. Stebe, A residual property of certain groups, Proc. Amer. Math. SOC.26 (1970) 37-42. [24] P. Whitman, Free lattices, Ann. Math. 42 (1941) 325-330.
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 101-139
SOLUTION OF THE CONJUGACY PROBLEM IN CERTAIN ARITHMETIC GROUPS Fritz J. GRUNEWALD University of Bielefeld (FRG)
The purpose of this paper is to give an algorithm which decides for two matrices T, f E GL, (Q) whether there exists a matrix X E GL, (Z) such that: XTX-'=
f'
This is problem 22 of the problem section of [l]. The algorithms we give can all be transformed into quite practical computer programs. They have as a general feature that one can always easily compute a bound for the number of steps necessary to complete the computation. To explain our results we make the definitions: Definition. Let KI C K 2 be two commutative rings with 1 . For T E GL, ( K z ) define:
1
%,,(T)= { X E GL, (KJ XTX-'
=
T}
GL,(K,) being the group of invertible n x n matrices with entries in the ring K , . n is a natural number. We of course consider GL,(KI) as being contained in GL,(Kz). For T, f € GL,(K2) we write T
--K,
if and only if there is a X E GL, ( K , ) with XTX-'=
f.
We write SL.(Kz) for the group of n x n matrices with entries in K Z of determinant 1. We prove: Theorem A. Given two matrices, T, f E GL, (Q), there is an algorithm for deciding whether : 101
102
EJ. Grunewald
T-zf
If the answer is “yes” the algorithm constructs a conjugating matrix. Another algorithm does the following:
Theorem B. Given two matrices T, f E GL. (Q), there is an algorithm for deciding whether there is a matrix X E SL. (2) such that
XTX-’= If the answer is ‘‘yes’’ the algorithm produces such an X . We get Theorem B from Theorem A together with the following:
Theorem C. Given T E GL.(Q), there is an algorithm for computing a finite set of generators of Cez(T). q Z ( T )is an arithmetic group in the sense of [3]; a main theorem of this paper says that q Z ( T )will be finitely generated. As generalizations we mention some results on general and special linear groups over rings of algebraic integers. Let K be an algebraic number field, that is a finite extension of the field of rational numbers. We assume that K is given by a field generator 8 together with its minimal polynomial over Q. Let OK be the ring of algebraic integers in K. By algorithm A6 listed in the last section of the paper it is possible to compute a Z-basis for OK together with a multiplication table for it. Almost a word for word translation (using OK instead of Z) gives now: Theorem A’. Given T, f E GL. (K), there is an algorithm for deciding whether: T-oKf If the answer is ‘‘yes’’ the algorithm computes a conjugating matrix. A little more involved is:
Theorem B’. Given T,f E GI, (K), there is an algorithm for deciding whether there is a matrix X E SL. (OK)such that: XTX-’ =
f.
If the answer is ‘‘yes’’ such an X may be computed.
n t e conjugacy problem in arithmetic groups
103
Theorem B’ follows from: Theorem C‘. Given T E GL,(K), there is an algorithm for computing a (T). finite set of generators of gOK The groups GL, (OK),SL, (OK),goK(T) are again arithmetic groups in the sense of [3] as may be seen by “reduction to the ground field” in [24] Chapter I. Hence they are all finitely presented by [3], [3a]. Our algorithm therefore has some group-theoretic meaning. Presentations of some of them are computed in [21], [13], [14]. A more thorough look into the methods used here even leads to an effective algorithm for constructing a finite presentation for the g o K ( T ) for a matrix T E GL, (K). As an application to another type of arithmetic group we mention: Corollary A. Given T, -i. E PGL, (K), there is an algorithm for deciding whether there is an X E PGL, (OK)with
XTX-’ =
f
in PGL, ( K )
and similarly : Corollary B. Given T, -i. E PGL,(K), there is an algorithm for deciding whether there is an X E PSL. ( O K ) with :
XTX-’=
-i.
in PGL,(K).
Our method allows one to deduce Theorems A-C for Z replaced by an order in a (explicitly given) ring of algebraic integers or for an S-arithmetic ring if S is an explicitly given finite set of primes of the algebraic number field in question. The proof again is an almost word for word translation. It is also clear that Theorems A’-C‘ decide the conjugacy problem in every subgroup of finite index in GL.(0) or SL”( 0 ) . As a further generalization we mention the possibility of applying a quite similar procedure to Sp2,(Z) or to O,U, n ) where f is an ndimensional integral quadratic form, or to the group of matrices having entries in an order in a (explicitly given) finite dimensional skew field over Q, whose reduced norms lie in the unit group of an order in the center of the skew field. Details will appear elsewhere. The particularly nice group-theoretic structure of GL2(Z) and SL(Z) as amalgamated products of finite groups gives a more group-theoretic procedure for the decision problems in Theorems A and B (for 7’’ f € GL(Z)!). For this see for example [12], [13], [27], and for generalizations [ 111.
F.J. Grunewald
104
We have not tried to write out very effective algorithms, a lot of them depend highly exponentially on the data. But for dimensions 2 and 3 it is possible to modify the procedures (because there are not SO many unipotent Jordan matrices) to actually obtain not too inefficient computer programs. In the first part of the paper we give a number of effective reduction steps which will reduce our algorithm to a couple of algorithms known from algebraic number theory. These will be listed at the end of the paper. I wrote this paper while doing a project on quite similar questions with Daniel Segal. I thank him very much for his advice. I also thank Professor J. Mennicke for help with an argument.* A. Reductions Before beginning, I want to mention that we often use a form of Gauss’ algorithm: Given a finitely generated free abelian group N with Z-basis n,,. . . ,n,. Let { f l , . . . ,fs} and { g , , . . . , g l } be two sets of elements in N , given in terms of t h e basis n l , . . ., n,. Let:
. ,fs}> Nz = ({gi,. . . ,g o ) Ni = ( { f i r . .
be the subgroups generated by these two sets. It is then effectively possible to compute the index:
1 NI: (NIn N2)I and if it is finite to give a set of coset representatives for: N , I N ~n N ,
For this see e.g. [ 5 ) . We will not always mention the use of this algorithm. We add some further definitions: If R is a ring and E a subset of R we put ( E ) ~for the ideal generated by E. We often leave the subscript R.
* (Added September 1977): After this paper was submitted in May 1977, Professor Adian informed me in September 1977 that similar results have been independently obtained by A. Sarkisjan.
105
77ae conjugacy problem in arithmetic groups
Step I Let n be a natural number. We are given two n x n matrices T, f with integer entries and both of determinant # 0. We hand these to algorithm A, which returns:
f=g+o
T=S+U
where S, S are rational semisimple matrices and U, nilpotent matrices. Furthermore we have:
S U = US
and
0
are rational
S f i = I%.
These decompositions are clearly unique. We now find a scalar matrix k . E n where
En =
(1
:)
* a .
is the n-dimensional unit matrix and where 0 # k E Z, such that:
kE, . S, kE. . U E M,, (Z) kE,,S, kEnU E M, (Z). Put now:
TI = kE,, T
fl= kE, f
S , = kE. * S
S,=
UI= kEnU
U ,= kE,o
1
*
kE,
*S
We find then:
Lemma 1. (1) fa ~ , - ~ f , . ( 2 ) S , , 3, are semisimple with integer entries. ( 3 ) U , , ir, are nilpotent with integer entries. (4) TI= S1+ U , ; 7 3, + 0,. ( 5 ) S , U , = U , S , ; s,u,=U$,.
TI
These statements are obvious. Because of the uniqueness of t h e decomposition T = S + U we get:
Lemma 2. TI- z f,e (1) 3X E GL. (Z) with XSIX-'= 3, and ( 2 ) 3Y E %'z(S,) with Y X U I X - ' Y - '= 0,. So, to decide the conjugacy problem we are essentially left with the two tasks:
106
F.J. Grunewald
I. Given two semisimple matrices S , , 3, with integer entries and nonzero determinant, decide whether S I and if t h e answer is "yes" to produce a conjugating matrix X . 11. Given a semisimple matrix 3, with integer entries and two nilpotent matrices U , , U z with integer entries and:
-zsl
s,u,= u,s,; s,u2= uzs, decide whether there is X E %,(s,) with XUIX-' =
u2.
We now define for any n x n matrix T with integer entries a Z[x,,]module, which as a Z-module will be isomorphic to Z". The operation of xo o n Z" is defined by: XoU
=
Tv'
for any vector v E Z". Here u' is t h e transpose of v. We call this module Z [ T ] . We obviously have:
Lemma 3. Let T I ,TIE Mn(Z), then TI modules.
--z
Z[ TI] Z[ TI] as Z [ X O ] -
In Step I1 we shall reduce our problems I and I1 and the problem in Theorem C to t h e following two questions: Let K be an algebraic number field and O C OK an order in t h e ring of integers of K (i.e. 0 is a subring of finite index in OK). Ia. Let M, M be two explicitly given Z-torsion free, finitely generated
O[~ ] / ( ~ t , - m o d u ldecide e s , whether: M=M
as O[y]/cy~-rnodules. IIa. Given a Z-basis for M --. Z", compute a Cfinite) set of generators for the group of O[y]/,,~,-module automophisms, viewed as a subgroup of GLn (Z). Step 11.
This step gives reductions of our problems to more number-theoretic questions, which are then dealt with in Step I11 and Step IIIa. We first compute the minimal polynomials min(SI) and min(3,) by algorithm A*. Both will be monic and will have integer coefficients. We assume that:
m e conjugacy problem in arirhmeric groups
107
P ( X )= min (sI) = min (3,). Otherwise t h e answer to our original question would be “no”. We now split P ( x ) by algorithm A,,: P ( x )= P l ( X ) . * .P , ( x ) ,
(1)
P , ( x ) being Q-irreducible, monic with integer coefficients for i = 1,. . . ,r. O u r matrices being semisimple, the P, will all be distinct. Put:
n
Pi@)= P / ( x ) jZi
for i = 1,. . . , r. The polynomials p,(x) are coprime, so algorithm A1 will give us polynomials Ql, . . . ,Q, with integer coefficients and a natural number A such that:
QIPl+ . . . + Q,P, = A
(2)
We now define:
R = Z [x ]/(rw)) where ( P ( x ) ) denotes t h e ideal generated by P ( x ) . The ring R decomposes: R
(3)
Z [ X ] / ( P , , ,X, , * . . X Z[x]/(rAx))
and we put 0, = Z [ x ] / , , , , ) for i = 1,. . ., r. The polynomials P I , .. .,P, being Q-irreducible, monic with integer coefficients, the rings 0, are rings of algebraic integers. We then find a natural number 1 such that:
ul = 0:= 0. We define on Z [ T , ] resp. Z [ f , ] now R[y]/(,~)-modulestructures by: xv = S l V ’ yv
=
u l v ‘ for all v E z [ P , ]
with t h e corresponding definition for Z [ f,]. By Lemma 1 this definition , Z[T,]. leads to a ring operation of R [ y ] / ( , i on Putting everything so far together we get:
Lemma 4. The following two statements are equivalent: (i) Z[ T I ]= Z [ PI] as Z[xo]-modules (ii) Z[ T I ]= Z [ PI] as R [y]/(,?-modules. We now decompose ZITl] and Z [ F , ] according to the decomposition (2). For this put:
F.J. Grunewald
108
M, = p, ( x ) . Z[ TI]
M,= F,(X).Z[Ti]. The M, and 9, are a variant of Gauss' and M,.Equally we on these new bases.
clearly R[y]/(,~,-submodulesof ZITI] and Z['fl]. By algorithm we are able to find Z-bases in both M, find a representation for the operations of x and y Put then:
+ M, C Z [ T i ] M = M I+ * +M,C Z [ T 1 ] M = Mi i - .. * * .
Lemma 5. (i) M, 6 f are R [y]/,,~,-submodulesof Z[T,] resp. Z[ TI]. (ii) M,, M,are annihilated by P,(x) for i = 1,. . ., r. (iii) 1 Z[ TI]/M 1 S A " and 1 Z[ 'f.,]/~, 1 S A ". (iv) M, n (C,,,M,) = (O), n (C,,, = (0). So M and A? are direct sums of their submodules M, and d,. (v) M, is a characteristic submodule of M for i = 1,. . . , r (i.e. normalized by any automorphism of M ) .
a
G)
Proof. Lemma 5 follows from our decomposition (2) and t h e fact that the polynomials P, (x). are distinct and Q-irreducible. We may consider then M,, &Ij as explicitly given O,[y]/(,~,-modules. Furthermore M and M are given explicitly as R [y]/(,~)-modules.
Lemma 6. Let cp : Z[ TI]+ Z[ TI] be a n R [y]/(,l)-module isomorphism then : cp(M)=
M.
Proof. For any u E ZITI] we have: cp
( R (x)
*
0 )=
Po( X I . cp (u ),
hence cp even induces R[y]/(,~)-rnoduleisomorphisms of the M, and
A?,.
We now first decide whether the nicer R[y]/(,~)-modulesM and $2 are isomorphic. The following is clear:
Lemma 7. The following two statements are equivalent: (i) M = h a s R [y]l(y~)-modules. (ii) Mi = hia s Oi [y]/,,~,-modulesfor all i = 1,. . . ,r.
7he conjugacy problem in arithmetic groups
109
We now go with our modules M,, together with their rings O , [ Y ] / ( ~toI , Step I11 of this paper. We are supplied there with explicit 0, module isomorphisms: f o r i = 1 , . . . , r.
~,:M,+M,
By our lemmata so far we are able to construct an R[y]/(,~,-module isomorphism: y:M+M by putting the information about t h e y, together. If t h e application of Step 111 to a pair of modules M,, M, yields that these are not isomorphic as 0,(y]/,y~,-modules,then our matrices T, f will certainly not be integrally conjugate. We put now:
fi = A . Z [ f l ]
N = y(AZ[Ti])
We note that: and
A Z [ f l ]= k
&f
Y(AZ[TI])Cy ( M ) = A?.
N and N may be viewed as given by Z-bases in terms of a fixed Z-basis of Id. Lemma 8. The following statements are equivalent: (i) Z[ T I ]= Z[ f,]a s R [y]/(yymodules. (ii) 3 a n R [y]/(,l,-module automorphism K : M + M such that K(N) = N. Proof. Assume we have an R [y]/(,~,-moduleisomorphism cp
:z[Tl]+Z[fl]
then q ( M ) = M has already been observed. Furthermore we have: cp ( A
Z[ TI])= A Z[ ?I].
We take K = cpy-' Then K is a R[y]/,,+-module automorphism of M with K ( N )= fi. To prove the reverse direction we put cp = KY.Then cp : M + A is a R [ y ] / ( y ~ ~ - m ~isomorphism d~le with cpy-I(N) = fi this means: cp(AZITI])= AZ[fl].
110
F.J. Grunewald
We define:
(4)
cp0(u)= h - l c p ( W
for all LJ E ZIT1]. cp,, is evidently an R[y]/(,~,-moduleisomorphism of Z[T,] and Z [ f , ] . The formula (4) even gives a recipe of how to compute cptr from an explicitly given K. We are now left with our nice R[y]/,,l,-module and two explicitly given abelian subgroups N and both of finite index in and we have to decide whether there is a R[y]/(,~,-module automorphism: ( 5 ) with
K
:M-*M
K(N)= N.
To do this and to prove Theorem C, C' we first construct a set of generators for the group of R[y]/(,l,-module automorphisms f of A?. Our module has a chosen Z-basis, this enables us to view f as a subgroup of GL,(Z). We will give finitely many matrices for t h e generators of f. Let f, be the group of 0,[y]/(,~,-module automorphisms of 0,. By our previous lemmata we know:
M=@M, i=l
and each M gis a characteristic R[y]/~,~,-submodule of M. So:
f =f
x
' ' '
x
f,
But from Step IIIa we know a finite set of generators for each individual f,. We then compute the index:
IM:(Nn
= hz
Our Z-basis of M gives us an explicit isomorphism: M/AzM
(z/A*Z)n
and a homomorphism:
f
+
GLn (Z/A2z)
Let f ( h J be the image and f[Az] be the kernel of this homomorphism. We know a set of generators of f ( A z ) in GL,,(Z/,,=) and we want to list all elements of f ( h z ) . We may assume to be in
111
7'he conjugacy problem in arithmetic groups
possession of the order ( S (A;)!) GL, (Z/A2z). We then employ:
and a multiplication table of
Lemma 9. Let G be a finite group of order m , and H c G generated by yl, . . . ys. One can find all elements of H amongst the products of length c m formed from the y l , . . . , y s .
.
Proof. To compute t h e length one adds up t h e number of occurrences of t h e generators y,. We claim that each element of H has an expression of length s m in terms of t h e generators y l , . . . , y.. This is so because any word of length > m in the y, may be shortened in G ; for among the first m + 1 initial segments of such a word, at I'east two must take the same value in G. To finish off o u r decision problem (5) put Obviously we get: of N resp. A in (Z/A2Z)n.
fl and A
for t h e images
Lemma 10. The following statements are equivalent: (i) There is an R [ ~ ] / ( ~ t , - m o dauromorphism ule
with
K1M-M
K(N)= A
(ii) There is an element
with
y Ef(Az)
y ( N ) = N.
Being in possession of all elements of f ( A z ) we may decide (ii) of this lemma. Any preimage of a y satisfying (ii) is then suitable as a K in (i). The preimage is found as a word in the generators of f. To prove theorems C, C' we still have to find a set of generators for the group of R[y]/(,+module automorphisms of Z [ f , ] which comes by our lemmata to the same as giving a set of generators for Vz(Fl) = Vz(F). By our formula (4) in Lemma 8 this comes to the same as giving a finite set of generators for the group of R[y]/(,)I module automorphisms of Id which leave N invariant. We call this group f,,CP. Again we want matrices in GL,(Z)Z P as generators for Po = {& E P &(& = lir}. By our previous argument we find all elements E l , . . .,E, E P ( h J which satisfy:
I
F.J. Grunewald
112
E(N) = N. . . . , E , is a set of preimages in f for them. The ei can be found as words in the generators of r'. Secondly we can effectively solve the word problem in f ( A 2 ) by reduction modA2. This enables us to define a set of coset representatives in terms of the generators of r' for the quotient: together with an effective coset representative function. Algorithm A, will then construct a finite set of generators vl,. . ., v. of f[A2] as words in the generators of r'. We then note that: f[Az]
C fo
and find: Lemma 11.
Po is generated
by
el,
.: . , E ~ 7, 7 1 , . . . vS.
We have proved now theorems A, C modulo the following Steps I11 and IIIa.
Step I11 Let P ( x ) be a monic, Q-irreducible polynomial with integer coefficient?, of degree m. Put:
0 = Z[X]/(P(l)) Let 1 be a natural number. Furthermore let MI, fit be two 0 [ y ] / ( , 1 , modules which are Z-free of rank n. This step decides whether MI= fi,as B[y]/~,~,-modules. If the answer is "yes" an isomorphism with respect to a given pair of Z-bases in MI, fit is exhibited. is a ring of algebraic integers in an In this situation 0 = Z[X]/(~(,), algebraic number field K = Q [ X ] / ( ~ (which ~ ) ) is the quotient field of 6. Our fixed Z-bases in MI and n"ll enable us to define K-vector spaces QMI and Q d 1 . Both QM1 and Q f i l carry an induced K[y]/(,,structure. We are going to investigate certain nice subgroups of MI resp. fit;for this we mak,e the following definition.
Definition. Let R I be a ring (commutative with 1). An R,[y]/(,l,-module where the di are M is called standard of rype { d o , .. . , nonnegative integers; if:
The conjugacy problem in arithmetic groups
113
as R,[y]/,,,-modules. We put in short M = R l { d o , . . , d I - , } .For an Rl[y]/(,~-moduleM, we put M ' = {kernel of the endomorphism y' of M}, with the convention M" = (0). We mention now the following easy characterization of standard modules:
Lemma 12. Let M be an Rl[y]/(,~-module.The following two conditions are equivalent : (i) M is standard. (ii) is finitely generated free as R,-module for 1 Q i Q 1. M'IyM1+l+~l-l
Proof. (i) j (ii) is obvious. (ii) j (i). For each i there is a free R1-submodule Fn of M, of rank r ( i ) say, such that M ' = (yM'+'+ MI-') @ F,.
In particular, M' for each i,
=
F', @ MI-', and an inductive argument shows that
M'= E@yE+,~...@y'-'F',@M'-'. Therefore by a trivial induction:
@j y'E.
M' = &I ,=I
But
y'E = 0.
,=o
y'E = (Rl[y]/(yL))'('), as Rl[y]/(,~)-modulesince E C M ' implies
Thus M is standard of type { r ( l ) ,. . . ,r(1)) as claimed. It is also clear that explicit bases for the free R1-modules M ' / ( y M ' " @ MI-') can effectively be lifted to R1-bases of the-F,'s, and thus yield an explicit isomorphism of M onto the standard module of the correct type. We now go back to our original situation. Let OK be the full ring of algebraic integers in K. Then 6 C OK and algorithm As computes a Z-basis { w l , . . . ,on}for OK together with the index: f =(OK
:61.
F.1. Grune wald
114
We have MI C Q M , and Q M , carries a K-vector space structure. Hence it makes sense to say of an abelian subgroup N of MI that it is OK-stable, i.e., OKNC N. We now make the following definition. Definition. An abelian subgroup N of MI is said to be good of level A, A being a natural number, iff: (i) ( M I : N I S A . (ii) N is OK-stable. (iii) N is y-stable. (iv) N is a standard OK[y]/(yl,-m~dule. Lemma 13. It is effectively possible to compute a natural number A, such that the set of good subgroups of level A, in MI is not empty. Proof. From linear algebra we know that any K[y]/,,~,-moduleis of standard type. By a simple algorithm based on Lemma 12 we are able to pick a K-basis inside MI for t h e standard K[y]/,,ymodule Q M , . So we have: (6)
O{dn,. . . ,dr-i} C Mi C QMi = K { d o , .. . , dr-il
if {d,,, . . . ,d r - , } is t h e type of t h e K[y]/(,~-moduleQ M , . By Gauss' method we compute the index A., of the copy of O{do,. . . ,dr-,} in MI. We then multiply our previous K-basis by A;' an.d get:
(7)
MI C B{do, . . . , dr-I} C O K { d o , . . . ,dr-I} C QMi
=
K { d o , . . ., d1-11
: MI1" . t". Here n is the Z-rank of MI Put now: A, = I O{do,.. . . , and the index 1 O{do, . . . . , d r - , }: M I ]is supposed to be computed from
(7)-
It would have been possible to define t h e notion of good submodules and to find a As without referring to t h e K[y]/(,i-module structure of QMI and an embedding (7). For this one must use Steinitz's theory in [18], [19]. This would have been more natural, but more complicated. Lemma 14. There are no more than 2(A3)"good subgroups of level A s in MI (or MI). It is effectively possible to make a list of all of them: {NI,.. . ,Ns},
each one together with an explicit OK[y]/(,~,-module isomorphism :
. . .,d f - l } .
yi : Ni + OK{do,
{ d o , .. .,d r W lis} here the type of the K[y)/(,i-moduleQMI.
77ae conjugacy problem in arithmetic groups
115
Proof. The first claim follows from the fact that for any good subgroup N of level A, in MI we have: AsM1 S N.
There order of 1 MI : A,M,( is A;. We then list all abelian subgroups of index S A,. These will be only finitely many, again less than 2(*3)".Let No be one of them, given by a Z-basis in terms of our Z-basis of MI. To test whether N o is OK stable it is enough to check whether:
wA"C No for our Z-basis of OK. The same method decides whether No is y-stable. We end up with a finite list of OK[y]-submodulesof MI of index A,. Each one is (say) given by a Z-basis and the operation of OK[y]on it is given in terms of this Z-basis: Let N be one of them. By algorithm A we compute Z-bases for t h e groups N ' , i = 0 , . . .,1. So we get a set of generators and relations for the groups
N' l y N '
+
1 +N'- 1,
The OK-module structure on them is given by a multiplication table for the w,, i = 1 , . . .,m, our Z-basis of OK. We hand these objects to algorithm Als. It decides whether the
N'/yN'+l+N' 1
are free OK-modules. If they are free we get an OK-basis for each of them. The second part of the proof of Lemma 12 shows how to put this information together to obtain an explicit OK module isomorphism y : N + O K {do,. . .,&1}. Let now { N I , .. . , Nsl}and {fil,.. . ,fi%} be the sets of good subgroups of level A, of MI and fhI.y,, are t h e OK[y]/(y~,-module isomorphisms +#
and
y : Nt
oK{dO, *
9, : fi,
OK
*.
df-I}
{do, . . . , df-l}.
We may clearly assume that the types of the K[y]/(y~)-m~dules QM1 and Q 2 , are the same, otherwise t h e answer to our question would be "no". For each i = 1, ..., s1 let: At
=
A3MI C N ,
and correspondingly for i
=
1,. . . , sz put:
116
F.J. Gnrnewald
di = A 3 M l C fii. The A, resp. di are subgroups of
Ni resp.
N,of finite (known) index.
Lemma 15. The following two statements are equivalent: (i) There is an O[y]/,,,-module isomorphism : y : M1-,MI.
(ii) The set of good subgroups of M Iof level A3 is not empty. There are i , j with 1 6 i 6 sI and 1 6 j 6 s2 and an OK[y]l(,~-module isomorphism : ~:N,+fi
with K
(Ai) = d,.
Proof. (i) j (ii). Let y : MI+ dlbe an 0-module isomorphism. Let N C MI be an OK[y]/,,~-submoduleof MI. We claim that Y :N+Y(N) is an OK-module isomorphism. For this let w E OK and u E N. We put:
oK.MI GQM,.
w = w .y(u)- y ( w u ) ~
Because
toKC 0 we get: tw = ( t w ) y ( u ) - y ( ( t w ) u ) = 0.
Hence w = O . Put now: fii
= y(Ni).
By the above remark that:
fi, is a good
subgroup of
MI. It
is then clear
~ ( A M I =) Asy(Mi) = A M I . (ii) j(i). Let define:
K
: Ni -P
fi, be the OK-module isomorphism.
We
Y ( V )= A ; ' K ( A ~ U )
for u E MI. By our assumption we have: K ( A ~ u= ) A3w
for a w E MI.Hence y defines an isomorphism of abelian groups between MI and MI. y is clearly compatible with the B[y]/,,~,-module structures on both sides.
m e conjugacy pmblem in arithmetic groups
117
Having our lists of good subgroups
{N, ,..., N,,} and {Al,..., &J in MI and dltogether with the isomorphism yi, f i we end up with the following decision problem: (8) Let A, d^ be two abelian subgroups of OK{do,. .., & I } of finite index, given in terms of a Z-basis of &{do,. . ., decide whether there is an OK[y]/(,~-moduleautomorphism: with
Y : oK{do,. . . ,di-i}+ o ~ { d o ., .. ,d1-J y ( A ) = d.
The decision procedure we shall describe will certainly exhibit such an automorphism, if there is any. We write r = ToK{do,. . . , d r - l }for the group of OK[y]/(,~-module By extending our Z-basis of O K automorphisms of M = OK{&, . . ., we think of r as a subgroup of GLn(Z). We are first going to construct a finite set of generators for r by giving certain matrices in GLn (Z). For each i between 1 and 1 we find an BK-freesubmodule F, of M of OK rank di such that: Mi = (,Mi+'
+ M i - ' )@ F,
1 s i s 1.
We put: I
n;i=@ECM i=l
We obviously have:
&!@.
n;i
..@
&-I
as OK-modules. Now let a E M. Then there is a unique set of elements Po,.. . ., PI-]E such that: (9)
Q!
=
2 Y'(P")
"=-0
under the convention y o = id,. Let f = GL,(OK) x * x GL4-,(OK).l= operates naturally on &f. Let y € f and a E M . Put then:
118
EJ. Grunewald
using expression (9). Elementary arguments show that now y is an OK[y]/cyi-m~dule automorphism of M. In fact we have defined a group monomorphism: Q1:f-r
W e shall identify f with its image in r. Any OK[y]/cyg-moduleautomorphism y of M induces an OK-module automorphism of each: Mi/(yM'+l+M1-l)
Let C be the subgroup of on all of these quotients. Furthermore put:
r which
induces the identity automorphism
I
fi = @ Homo, (E,(yM"' + MI-')). ,=I
We now define a map rpz of fi t o C. For this let f E HomoK(E,(,MI+' + MI-')) and a E M, we put: rpz(f)(a)
=
3 Y '(P.)+ 3 Y
"-0
"-0
' ( f ( P Y ) )
using the expansion (9) for a. By f ( & ) we ith projection of Py E lii. This means rpz(f) O n e easily sees that the so defined rpzcf) is automorphism of M. If 1 > 1 then rpz is not But we have:
understand f applied to the centralizes all F, with i # j. an OK[y]/cy~-module a group homomorphism.
Lemma 16. (i) rpz is a surjective map from H to C. (ii) Let h , , . . . , h, be a set of generators for the abelian group H, then the rp2(hl),. . . ,rpz(hS)generate C. (iii) is the semidirect product of C by f i f f is assumed to be contained in r via rp,.
r
Proof. (i) Any OK[y]/cyi)-moduleautomorphism of M is determined by its values on A? C M ; if y E C then for P E E ,
v(P)=P+At(P) with A, ( P ) E (yM'+' + MI-'). A, is clearly in Hom, (F, (yM'+' + M'-')), and y = rp2(AI,. . . , A , ) . of
(ii) Let AUto,(M'/Me-k-f)be the group of OK-modules automorphisms M ' / M ~ - kfor - ~ i = 2,. . . , I ; k = 1,. . . , i - 1. Each element of C
7he conjugacy problem in arithmetic groups
119
induces an element of this group. One easily sees that the composition map: C + AutoK( M i / , w - k - l ) Homo,(E, M i - k )
4
Homo, (E,(yMi+'+ Mi-')) for each k = 1,.. . i - 1 is a group homomorphism. (i) together with an
obvious induction under the assumption that the automorphism in question already centralizes all F, with j > i, finishes then (ii). (iii) is obvious.
Both E and (yMi+'+ M i - ' ) are free OK-modules. It is then easy to describe a finite set of generators for the group fi effectively. By algorithm AI9we may construct a finite set of generators for f. The maps cpl, cp2 are then explicit enough to deduce together with Lemma 16 a set of generating matrices for f in GL,(Z). We now go back to our decision problem (8): The first thing is to determine the index: As=(OK{do7...,dr-1}:(dnd)l.
Our Z-basis of OK{ d o , .. . ,dl-l}gives us an explicit reduction isomorphism: OK{& .. .
9
df-l}/AlO,( ~ , . . . d. ,.- , )
2,(z/AsZ)"
of abelian groups. Let T(A,) the image of f in GL,(Z/,,,) via this homomorphism. Because we know a finite set of generators for f we know a set of generators for f ( A , ) . We may assume to be in possession of a multiplication table for the finite group GL, (Z/+). The order of this group is certainly less than (A:)! So by an_ argument used resp. d be the images before we may list all elements of f ( h s ) . Let We then obviously have: of A or d^ in (Z/AsZ)n.
a
Lemma 17. The following statements are equivalent: (i) There is an OK[ y ] / , , ~ - m o d u automorphism fe : ~ ~ K ~ d O 7 ~ ~ ~ 7 ~ f - l } ~ ~ K { d O ~ ~ ~ ~ ~ d f - l }
with
y ( A ) = d^
(ii) There is an element K
E f(A5)
-
with K
(a)= A.
C GL. (UA+)
120
F.J. Grunewald
By the above remarks we know all the elements of f ( A , ) . There are not many things which are then easier than to decide the second condition of Lemma 17. Having found a K satisfying this condition, any preimage of K in f will then serve as y. y may be found as some expression in t h e generators of r. Step IIIa Let P ( x ) be a monic, Q-irreducible polynomial of degree m with integer coefficients. Put:
6 = Z[X]/,P,,)) Let 1 be a natural number. Let M i be an O[y]/,,l,-module which is Z-free of rank n, with a fixed Z-basis. Let f l be the group of B[y]/,,l,-module automorphisms of M I . The Z-basis in MI allows us to view f I as a subgroup of GL. (Z). Step IIIa gives a finite set of generators for f lC GL. (Z). We use t h e terminology of Step 111. We first compute a natural number A ? such that t h e set of good subgroups of level A 3 in M2 is not empty. We then compute a list: {Nl, . . . ,Ns,}
o f all of them, together with explicit OK[y ]/,,l,-isomorphisms: i = 1,. . . ,S I
y, : N, +-6K{d,,,. . . , d1-l)
where {d,,,. . . ,d , - i } ,is the K-standard type of Q M I . Put again: A, = AXMi C N , . By the method described in Step 111 we decide for each pair: (Ni,N , )
2 s i =ssI
whether there is an B[y]/,,~)-moduleautomorphism: with
E,
: M i + Mi
E,
: (NI) = N , .
Step I11 then gives us a finite list of { E ,,..., E , } , to get:
Lemma 18. {E
fl
E fl
is generated by the
E,’S
.. . , E , }
{ E ~ ,
I E ( N I ) = Nil.
Proof. This follows from Lemma 15.
which we renumber to
together with :
f i e conjugacy problem in nrirhmeric groups
121
Using our isomorphism: 71:
NI4
. . df}
OK{&,.
7
We end up with the problem: Give a set of generators for t h e group:
1
Tz = { E E r & ( A l )= A l }
(10)
where r is the group of OK module automorphisms of and A , is a subgroup of finite index in OK{&, . . . , O K { & , . . ., At t h e end of Step I11 we showed how to give explicitly a finite set of generators for r. Knowing a set of generators for r2on a Z-basis of we could rewrite the corresponding O[y]/,,~,-module OK{ d o , .. . , automorphisms of M I by the method described in the proof of Lemma 15. and To deal with problem (10) we again put M = OK{&, . . . , compute:
I M : A l l = h6
-
We put r[h6] = k e r { r GL, (Z/+)}. This homomorphism is defined by use of our Z-basis in M.The was called r(h6). Knowing all elements of image of r in GL,(Z/,) r(h6) we give a set of coset representatives for:
rlmi
in r. Because we are able to solve the word problem in r/r[b] effectively by reduction modh6 we have an effective coset representative function required by algorithm As. This algorithm then turns our set of generators for r into a set of generators { T ~. ,. . ,q v } for r[h6]. Next let Al be the image of A l in (Z/-)". Knowing all elements of r(h6) we make a list of: {E
E r(A6)
1
E
(51) = 31)
This is a finite list with no more than (A:)! members. Let l, . ., . ,5. be a set of preimages in of all elements of this set. We are able to find them as expressions in terms of the generators of r. Any element y of r [ & ] now satisfies:
r
r(Ai)=Ai and we obviously get:
Lemma 19.
r2is generated
by
771,.
. . ,vU, 51,. . .-5..
EJ. Grunewald
122
This finishes the problem of Step IIIa.
Proof of Theorems A‘, C’ A glance through Steps I and I1 shows that everything stays correct if one replaces Z by OK.One has to keep in mind that we consider OK to be given by a Z-basis together with a multiplication table for this Z-basis. Steps I11 and IIIa may then be used as stated to obtain a proof of Theorems A’ and C’. Proof of Theorems B, B’ We give our matrices T, f to the algorithm described in Theorem A and decide whether there is an X E GL. (Z) such that:
XTX-’
=
?
If there is none the answer to our question is “no” too. If the answer is “yes” we construct such an X by the same algorithm and compute the determinant of X . If det X = 1 then we are in good shape. Otherwise we find the following two equivalent statements: (i) 3Y E SL.(Z) with Y T Y - ’ = f. (ii) 3Y E Cez(T)with det Y = - 1. Then we construct a set of generators ll,. . . , Ls for the group Wz(T) by Theorem C and look whether:
det (6,) = - 1 for a generator 5;. The answer to our original question is then “yes” if and only if det (&) = - 1 for at least one i. The proof of Theorem B’ is slightly more involved. We again assume to have found an X E GL. (OK)with:
XTX-‘=
f
by Theorem A’. Put u = det X .
We then construct a set of generators and put u, = det
6,
{ 3 ] ,. . . ,& } of the group W,(T),
for i = 1,. . . ,s
and V = ( u l , . . . , u s )C 0:. We then get the following two equivalent statements: (i) Y E SL.(OK) with Y T Y - ’ = f. (ii) u E V. To decide the second condition we express u and ul,. . . , us in terms of
f i e conjugacy problem in arithmetic groups
123
our set of generators for 0: by the second part of algorithm Ae. Having done this we apply Gauss' method. Proof of Corollaries A, B Take two preimages To,foof T and
a
= det
To;
ci
= det
f
in GL,(K) and put:
fo
We factorize then the polynomial:
over K. By this method we find all zeroes b,, . . . , b, of this polynomial in K. For each zero put:
6, = and
(; ;#)
T,=T6,
...
i
=
1, ..., r
i = l , ..., r.
Then we find the following equivalent statements: (i) 3 X E P GL. ( O K )with XTX-' = f in P GL, (K). (ii) 31 S i S r and X E GL, (OK)with XT,X-' = f0. Similarly: (i) 3 X E P SL, ( O K )with XTX-' = f in P GL, (K). (ii) 31 S i S r and X E SL, (OK)with X T X - ' = fo. The second conditions may then be decided by applications of the algorithm from Theorems A', B'. B. Some algorithms
Let K be a finite extension field over the rational numbers Q. We consider K to be known if we are given a Q-irreducible polynomial P ( x ) of degree m with rational coefficients and are told that K is generated as a field by a zero of P ( x ) , i.e.,
K = C?[x]l(P(x)) as fields. By a suitable transformation x+Ax
AEZ
we may assume that P ( x ) has integer coefficients and highest coefficient 1. Hence all zeroes of P ( x ) are algebraic integers. We write
EJ. Grunewald
124
OK for the full ring of algebraic integers in K . We consider OK to be } 0, in terms of the field known if we have a Z-basis { w , , . . . , u r nfor generators I , x , . . . , x ' " - ' , i.e., w , = COl+ w, = Corn
CllX
+
+
* * *
+ c,,x +
*.
a
G - I . 1 .
+ Crn-l.,
Xm-I
* Xm-l
with c,- E Q. One of the following algorithms will compute such a 2-basis for OK. A lot of t h e following algorithms are taken from Steinitz's papers [19], [20]. We often have only written out why t h e reasoning of [19], [20] is effective. For the algorithms given in section A of this paper we need only A,,-A6 and AI7, Aln,A,, of the algorithms listed below. Algorithm A,, Let K be an algebraic number field and let P ( x ) E K [ x ] be a polynomial with coefficients in K. Algorithm A, splits P ( x ) into irreducible factors:
P ( x )= Pl(X)
-
* * *
*
P,(x).
P, ( x ) are K-irreducible for i = 1,. . . ,r. If P ( x ) is monic with coefficients in t h e ring of algebraic integers OK of K , then each P i ( x ) will be monic with coefficients in OK. For this w e use Kronecker's algorithm as described in [ 9 ] . Algorithm A , Let K be an algebraic number field. Given 1 polynomials PI,.. . , PI E K [ x ] , this algorithm decides whether P I , .. . , P I are coprime and if the answer is "yes" it produces polynomials Q,, . . . , QI E K [ x ] such that:
OIP, + . . . + QIPI = 1. If t h e polynomials P I , . . , PI have coefficients in t h e ring of algebraic integers OK of K and if they are coprime, algorithm A l will compute t h e polynomials Q,, . , . , Ql with algebraic integer coefficients together with an element A E OK such that
OlP,+ . . . + QIPI = A. A repeated application of Euclid's algorithm in K [ x ] will do t h e job of A,.
7 h e conjugacy problem in arithmetic groups
125
Algorithm A2 Let K be an algebraic number field and T E GL, (K) for a natural number K. Algorithm A z computes the characteristic polynomial char(T) of T together with the minimal polynomial min(T). Knowing the characteristic polynomial char(T) we factorize it by algorithm Ao. In the list of all divisors of char(T) we then find the polynomial of minimal degree which is satisfied by T. Algorithm A, Let K be an algebraic number field. Let T E GL, (K). Algorithm A, computes a semisimple matrix S and a nilpotent matrix U both with coefficients in K such that:
and
T=S+U
su= us
We first compute t h e minimal polynomial min(T) of T by algorithm A2. By successive factorisation of min(T) we construct a multiplication table for the normal extension field K O of K which is generated by all zeroes of min(T). By an algorithm of linear algebra we find y E GL, ( K O )such that yTy-' = J, where J E GL,(Ko) is in Jordan normal form. How to decompose: with
J = S o + Uo
UoSo= UOSO
where Uo is nilpotent and SO is semisimple is then evident. We put:
S
= y-'S0y,
u = y-'u,y.
A Galois-type argument then shows that both
S, U E M. (K). For all this see [4] Chapter 7, 05. Algorithm A, Let M = Z" be a free Z-module with basis. Let A be an n x n matrix with integer entries representing an endomorphism with respect to this basis. Algorithm Aq computes a Z-basis for the kernel of A.
126
EJ. Grunewald
By elementary row and column operations (over the euclidian ring Z!) we find element P, Q E SL,(Z) such that
PAQ = D where D is a diagonal matrix. Algorithm A, (Reidemeister-Schreier) Given a finite set of generators g,, . . . ,g, for a group G and a subgroup H S G of finite index together with a set of coset representatives Y and an effective coset representative function
@:G+Y As computes a set of generators for H. We use the method described in [6] Lemma 7.2.2. Algorithm A6 Let K be an algebraic number field with integeral generator 19. Let 6 be the subring of K generated by 4, this is the same as the Zsubmodule of K generated by 1,6,6',. ..,I?,-'where rn is the degree of the minimal polynomial of 6. This algorithm computes a Z-basis WI
=
Cll6
COl+
w, = Corn
+
* .
+ CI,6 +
.+ C m 1 6 m - l * * *
+ Cmm6m-l
Ci.1
EQ
for the ring of algebraic integers OK of K. Furthermore the algorithm gives the index Let
1 6 K : 6I = t.
Tr:K+Q
be the trace of the field K. The trace of an element a of K is the trace of the linear automorphism, which is induced by multiplication with a. It ,can therefore be computed, for example, with our field basis 1,6,.. ., a"-'. Let
BK( a ,p ) = Tr(a * p ) be the trace form induced on K. If we first compute the matrix of BK with respect to 1,6,.. . ,a"-', we are able to give a complementary basis d l , .. . , 4, for 1,6,. . . , $?"-I with respect to BK. Let d be the Z-module generated by d,,. . .,4,. Then by [ 5 ] Chapter 11, 02, Theorem 6:
OK
d
7he conjugacy problem in arithmetic groups
127
Next we list a set of coset representatives a I ,. . .,a,for 6/O. Let a,,. . . ,a, be those which are integral. This can.be checked by computing the characteristic polynomials for the linear maps corresponding to the 0's on K. Because t h e existence of an algebraic integer in a coset b/O would imply that any element of the coset is integral, we find OK
=(o,al,...,a,)
We then compute by Gauss' method a basis for OK and the number t. Algorithm A, Let K be an algebraic number field with ring of algebraic integers OK. Let n be a natural number. This algorithm lists a set of elements AI,. . .,A, E OK with such that for any A E OK with
1 OK : A O K 1 c ~t we can find a unit u E OK such that
A =A~u for some i E (1,. . . ,r } . Here we use the algorithm described in the book of Borevich-Shafarevich [ 5 ] Chapter 11, 55.4. Algorithm As Let K be an algebraic number field with ring of algebraic integers OK and let OE be the unit group of OK. (i) The first part of this algorithm gives a set of generators:
5,u1,. . . , us for the unit group of OK. ul,... , u s are a set of fundamental units and 4' is a root of unity. (ii) Given a unit u E 0: in terms of our Z-basis of O K : u = alwl
+ - + a,w, * *
ai E Z
the second part finds f , e l , .. . ,e, E Z with: u = 5fu;1.. . u >
(i): Let 9 be a generator of K over Q ; we first compute t h e discriminant d of K , for example by obtaining a Z-basis for OK. By [7]
128
EJ. Grunewald
928.1 the order of 5 is a divisor of 2 )d 1. By algorithm A. we then decompose the polynomial: X2ldl - 1
in K into irreducible factors. By this we find generators for the torsion group of 0:. T o find a set of fundamental units { u l , . . . , u s }we employ the algorithm described in [ 5 ] Chapter 11, 05. 3. (ii): For the second part of this algorithm we have to explain Dirichlet's method in more detail. For a description of this method see [ 5 ] Chapter 11, 05. We want to prove: Let u E 0: be a unit given in terms of t h e chosen Z-basis of OK. There is an effectively computable number c E N such that
(*I
u E { 5 c w ; ~ *. .
. . u > Jle, I s c, Vi = 0 , . . . , s }
where 5, u , , . . . ,us is t h e set of generators computed in (i). For this let al, . . .,a,, be the distinct embeddings of K into R and T ~. ., ., T , ~ , f l , .. . , f,zbe the embeddings of k into C which do not map K entirely into R. Put:
L : K,{O)-, R'I+'~ L : a +(In l a l ( a ) l , .. . ,In lurl(a)l,lnl ~ ~ ( a ). l. .,In ~ , Irr2(a)I2) L is a homomorphism of the multiplicative group K,{O} into t h e additive group of R'l+'z. The image of 0: under L is a discrete lattice of dimension rl + r2 - 1 in R'l+'z. The kernel of L is exactly the group of roots of unity contained in 0:. By the proofs of Lemma 1 and Theorem 2 of (51 Chapter 11, 05 it is clear how to find effectively a rational number c I# 0 such that:
where:
B ( C J ~~ ( 0 ; =) (0) B ( c ) = {x E R I " .
IIlx I1 = +d/xxIcc }
is the (rI + r2)-dimensional euclidian ball of radius c. By approximating all zeroes of a minimal polynomial of a field generator of K in C up to the first decimal of their real and imaginary parts, we find for our explicitly given unit u E 0: a rational number c2 such that u €5 B ( C 2 ) .
In (*) we may then clearly put:
(
o =max 2 ) d ) ,c2+ 1) CI
m e conjugacy problem in arithmetic groups
129
To finish this algorithm we spread out each element of the finite set mentioned in (*) in terms of our Z-basis of OK and look which element is equal to u.
Algorithm As Let K be an algebraic number field, with ring of algebraic integers OK. Given r ideals al, . . . ,a, by Z-bases, which are pairwise coprime 1.e.: (ai,ai) = OK
for i # j
and elements e l , .. . ,8, E OK. We give here an algorithm for determining an x E OK such that x=8,modal x
= 8, moda,.
We first determine a Z-basis for the ideal a = a1fl .
- . fl a,.
This has as a consequence, that we explicitly know the order of OK/a and a multiplication table for the ring: OK//8.
In this finite ring we check for one element after the other whether it satisfies the corresponding conditions. The Chinese Remainder Theorem makes sure that we find an f in 6 / a satisfying our congruences. As x we may take any preimage of f.
Algorithm Alo Let K be an algebraic number field, with ring of algebraic integers OK. Let a be an ideal of OK given by a set of ideal generators or, which comes to the same (because we know a Z-basis of O K ) , by a Z-basis. Here we compute Z-bases for prime ideals pl, . . . ,p, and natural numbers e l , . . .,e, such that: a = p71. . . . pfr
.
By Gauss’ method we know the order of the ring 4 / a . We are also , .We are given a multiplication table by reduction of our table for 6 then able to decompose:
F.J. Grunewald
130
O K I a = R1 X . . . X R,
effectively into a direct product of Artin local rings R,. Let p, be t h e maximal ideal of R, and e, be t h e least natural number such that p? = (0). Let: ~ p ,: O K
+
R,
be the projection homomorphism, which we may consider to be known explicitly. Put pr = Cp;l(pt). By algorithm A5 we are able to compute a set of generators for the abelian groups p,. By Gauss’ method we reduce these sets to Z-bases. Algorithm A l l Let K be an algebraic number field, with ring of algebraic integers OK. Let a, b be two ideals given by Z-bases. We decide here whether there is a c E K (cZ0) with c a = b.
If the answer is “yes” we determine such a c. In particular we decide whether ideals are principal, and in case they are, we give generators for them. We determine first a A E Z with: AbCa. A c satisfying c a = b would then satisfy:
Aca= AbCa.
Hence we get Ac E OK. On t h e other hand putting: A o = la:Ab/
we get:
I
OK
I
: ACOK = A ~ .
We now take the list of nonassociate elements yl, . . . ,ys with )OK:Y,OKI=AO
computed by algorithm A, and check whether one of the numbers A-ly,, i = 1,. . . , s, satisfies: A-’y,a = b.
f i e conjugacy problem in arithmetic groups
131
Algorithm A12 Let K be an algebraic number field, with ring of algebraic integers OK.Let K be the class number of OK and JK the ideal class group of OK. This algorithm computes K together with ideals a,, . . . ,a, which lie in distinct classes. Furthermore a multiplication table for JK is given. Let m be the degree of K and D be the absolute discriminant of OK. From the proof of [ 5 ] Theorem 3 of Chapter 11, 96 it follows that each ideal class of OK contains an ideal a with
We then list all subgroups of OK of index s c . There are not more than 2(Cd)of them. For each such subgroup we check whether w,NCN
i = l , ..., m
for our Z-basis of OK. This leaves U S with a list of all ideals in OK with index at most c. This gives a first upper bound for K . By algorithm A l l we deduce from our list a set of inequivalent ideals together with a multiplication table for JK. Algorithm A1, Let K be an algebraic number field, with ring of algebraic integers OK. Let a, b be two ideals of OK. This algorithm finds a number c E K such that c b C OK and c b is coprime to a. In other words, given an ideal class of OK and an ideal a we want to find an element of 6 which is prime to a. We first split a by algorithm Alo into prime factors: a = p;i.
...
- pf,
Then we determine by algorithm A,, an ideal bo in the ideal class inverse to the class of b. Again by application of algorithm A,,, we find: bo = p{i . . p{rq;i . . . q3 Here t h e f, might be zero. We then find elements: 0, E OK for i and with
8, E
$8;
8,
pfs+l
1,.
for i = 1 , . . . , r
vg€OK
f o r i = I , ..., s
7,Eq:
for i
=
=
1,..., s.
. . ,r with:
132
E l . Grunewald
Then we determine by algorithm As a solution of the congruences: z
= O1 mod
z
= 8, modp!." = v 1modq?
z
z = 9, modq:.
The ideal z OK is clearly divisible by bo hence we find an ideal bl such that:
z * OK
= bob1
bl may be determined by: b i = { v E OK IvboCZ ' O K ) .
But bl then lies in the ideal class of b and by the choice of our congruences it is prime to a. Algorithm AI4 Let K be an algebraic number field, with ring of algebraic integers OK. Let A = (a,,)be an n x m matrix with coefficients from O K such that:
and
I
(a,, 1 d i d n ; 1 c j s m ) = O K
rgKA 3 2
Algorithm A14determines a vector x = (xI,.. .,x.) with entries in OK such that: with
XA = y = (yi,. . ., y m ) (yli.. ., y m > = OK.
The rank condition for A allows us to assume that: d = a l l a 2 2 -uZ1u12
is not zero. We first compute a Z-basis for the ideal d . OK and decompose it by algorithm A; dOK = p;' ' . ' p:',
For each ideal p. we find an entry a , , , , , , , ,such that: a I(v).l(u)
P..
f i e conjugacy problem in arithmetic groups
133
By algorithm As we solve the congruences: if p = i ( v )
else
mod p.
for p = 1,. . ., n and v = 1 , . . . , r . NOWput: x ’ = ( x : ,..., x:)
and y ‘ = x ’ A = ( y I ,..., y A ) .
a22and azl not being simultaneously 0 (because of (11)) we may furthermore assume that:
y;zo. We apply algorithm Alo again to find:
y;OK = pi,. . . p!rq;, . . . q:: where the fi might be zero, but where none of the q, is a p.. We now find an a E OK with: (12)
v = 1, ..., s v = l , ..., r
a =lmodq, a=Omodp,
It then follows that: ad$Omodq, a d =Omodp,
v = 1, ..., s v = 1,..., r.
Then we find a number b E OK such that:
y { + abd
+ 0 mod q.
v = 1,. . . , s
This is possible because of (12). It is then obvious that the ideal
( y : + abd, y 4)
has only prime divisors from the set {pl,. . . ,p,}. Now put: c r = ba2* and
r = ba2,.
We make the further definitions: x,=x~+aa, for v
= 3,. . . , n
x2=x;+7a,
x,=x:
and
y = xA = ( ~ 1 , . . . , y m ) .
Because of our choice of a the congruences (11) are still valid for the Xi.
EJ. Gruncwold
134
Hence n o p. for v = 1 , . . . ,r divides (contains) the ideals ( y l , . . .,ym). On the other hand an easy computation shows: yl = y l + abd
and
y2 = y;,
so by the above every prime division of (yl, . . . , y m ) is amongst the PI,. . . ,p.. Thus ( y i , . . . ,ym)= OK. Algorithm A1, Let K be an algebraic number field, with ring of algebraic integers OK. Let A = (ali) be an n x rn matrix with coefficients in OK such that r g K A G n - 2.
:. ,x,)
Here we find a vector x and
(xi,. . .,x n )
=
= (xl,.
with entries in OK such that
OK
X A = 0.
We first find two linearly independent solutions with entries in OK: Put:
a = (a,. . .,a")
p r = (Pi,.. . ,PA)
( p ' )= (pi,. . ., P 3 By algorithm AI3we find in the ideal class of ( p ' ) an ideal (a) = ( a l , .. . ,a.)
and
coprime to (a).In other words we find a c E K such that: CP
= : P I , . . . ,C P :=:
P n
a which is
EO K
and (a)is coprime to ( p ) where:
P
= (PI,. . ., P n ) .
By algorithm AI4we then find yl, y2 E OK such that:
satisfies: (x,, . . . ,x,)= OK. Algorithm Al, Let K be an algebraic number field with ring of algebraic integers OK. Let x = (xI1,x I 2 , .. . ,xln) be a vector with entries in OK such that: (xll,.
. ., X I " )
=
OK,
7he conjugacy problem in arithmetic groups
135
Algorithm AI6 finds a matrix
with det X = 1. We assume we have constructed the matrix X up to the rth row
with r < n - 2 under the hypotheses that: (619
* . ., 6s) = O K
where 61,.. . ,6, are the r x r subdeterminants. Here s = (:). There is a n x (,:J matrix A ( r ) whose entries are the 6, with appropriate signs attached, such that the column:
( r l , .. . ,z.). A ( r )
has the ( r + 1)X ( r + 1) subdeterminants of
as entries. The rank of A is n - r. As long as r S (n - 2) we may therefore employ algorithm A I 4to find a ( r + 1)th row such that the ( r + 1)x ( r + 1) subdeterminants still generate t h e unit ideal. For the linear algebra involved see [4] Chapter 3. By this method we find X up to the last row. Let S , , . . . , be t h e (n - 1) x (n - 1) subdeterminants of the so found ( n - 1)X n matrix. We know that:
(61,. . .
9
&+I)
=
OK-
By writing out a basis for the abelian subgroup generated by oi6, ( i = 1,. . . ,m ; j = 1,. . . , n ) inside OK we find a solution of the equation: xn161 + * ' * + XnnS. = 1 i = 1,. . . ,fl xni E O K Algorithm AI7 Let K be an algebraic number field with ring of integers OK.Let -~
136
F.J. Grunewald
ul,. . .,anE 62 for natural numbers n, m. This algorithm decides . is free. If the whether the.OK-module M generated by the u l r..,a,, answer is “yes” an OK-module basis is computed in terms of the (TI,.
. .,an.
Let r be the K-dimension of the vector space generated by the uI,. . . , a nBy . successive application of t h e algorithms Al, and AI6 we effectively find a set of generators: for M
rl,.. . ,T , + ~
in terms of the u I , . .,a,,. We then form the matrix: all,.
x=(
i
. . ,a1.n i
)
. . ,a r + 1 . m ~. ., . ,T , + ~ X . has rank r. Let * X be the left
a+1,1,.
whose rows are the T adjuncted matrix to X. Its entries are certain r x r subdeterminants of X , with appropriate signs attached. * X has ( r + 1) columns and many rows and satisfies:
*XX = 0 X having rank r. X has nonzero entries. For each row (XI,.. .,x,!+~)of * X we test by algorithm All whether is a principal ideal.
(x I, . . . ,
Let the answer be “yes” for the row (xi,. . .,x;+,) and put: cxi = x :
for i = 1 , ..., r + l
for a generator c found by algorithm A l l . Then: and
(XI, * * .
9
-&+I)
=
OK
x,71+ . . . + xr+17;+1= 0
This follows from the corresponding properties of the xi. By algorithm Alh we may then effectively find a set of generators for M consisting of r vectors. Then M is free and we have found an OK-basisin terms of the ul,. . . ,an. Suppose now that n o row of * X generates a principal ideal. We claim that then M will not be free. For this see [19] Theorem 27. Algorithm Al, Let K be an algebraic number field with ring of algebraic integers OK. Let B be an explicitly given finitely generated abelian group
7he conjugacy problem in arithmetic groups
137
equipped with an OK-modulestructure, which is given on a set of generators of B. Algorithm Ale decides whether B is a free OKmodule. If the answer is “yes” an OK-modulebasis is computed. We first test whether B is a free Z-module by Gauss’ method. This condition is clearly necessary for B to be a free OK-module. B being a free Z-module we pick a Z-basis. Then we have: B = Z” C Q“ = Q B and Q B carries a K vector space structure. We pick then in Q B a K-basis such that: B C OZC K‘ = Q B . Then we express our Z-basis for B as vectors in OZ and use algorithm An.
Algorithm AI9 Let K be an algebraic number field with ring of algebraic integers OK.Let n be a natural number. This algorithm lists a finite set of generators for GL. (OK). If for a given n we would have computed a finite set of generators y l , . . . ,y, for SL. (OK) we would get a finite set of generators for G L . ( b ) by adding matrices
where u runs through a set of generators for the unit group of OK which is computed by algorithm As. It suffices then to compute a set of generators for SL.(OK): (1) n = 2, OK has infinitely many units. The main result of Wasserstein’s paper [25] says that SL(OK)is then generated by
G
-;) (; y),.-.,(; ”;) 9
where wI,. . .,wm is a Z-basis of OK. (2) n = 2, OK is imaginary quadratic. In this case Swan’s paper [21] even gives a presentation for !%(OK). It can be checked that the constructions of [21] are all effective. See the remarks in section 8 of this paper.
138
F.J. Grunewald
These two cases exhaust SL2 because of Dirichlet’s theorem o n units (see [51). (3) n 2 3 . By [O] and [I51 Corollary 16.3 it follows that SLn(O;O is generated by elementary matrices. It is quite easy to extract a system of generators for SLn(OK)n 3 2 from the paper [lo] of A. Hurwitz. The method used there can also be seen to be effective. A feature of all these methods is that one can give an a priori bound for the number of generators needed for SLn(OK)in terms of n, the discriminant of OK,the degree of K, and the number of complex valuations of OK. References [O] H. Bass, J. Milnor and J.-P. Serre, Solution of the congruence subgroup problem for
SL, ( n 3 3) and Sp,. (n 2 2), Publ. Math., IHES 33 (1%7). (11 W.W. Boone, F.B. Cannonito and R.C. Lyndon, Word problems (North-Holland Publishing Comp., Amsterdam, London, 1973). [2] A. Borel, et al., Seminar on algebraic groups and related finite groups, Lecture Notes in Mathematics 131 (Springer Verlag, Berlin-Heidelberg-New York, 1%9). [3] A. Borel and Harish-Chandra, Arithmetic subgroups of algebraic groups, Annals of Mathematics Vol. 75, No. 3 (1962). [3a] A. Borel, Arithmetic properties of algebraic groups, Proc. Int. Congr. Math., Stockholm (1962). [4] N. Bourbaki, Algbbre, Chap. 3 and Chap. 7 (Hermann, Paris, Deuxibme Edition, 1958). [5] Z.I. Borevich and I.R. Shafarevich, Zahlentheorie (Birkhauser Verlag, Base1 u. Stuttgart, 1966). [6] M. Hall, The theory of finite groups (Macmillan Comp., 1973). [7] H. Hasse, Zahlentheorie (Akademie-Verlag, Berlin, 1%9). (81 E. Hecke, Vorlesungen iiber die Theorie der algebraischen Zahlen (Akademische Verlagsgesellschaft, Leipzig, 1923). [9] G. Hermann, Die Frage der endlich vielen Schritte in der Theorie der Polynomideale, Math. Annalen 95 (1926) 736788. [lo] A. Hunvitz, Die unimodularen Substitutionen in einem algebraischen Zahlkorper. Nachrichten von der k. Gesellsc.haft der Wissenschaften zu Gottingen (1895). [ll) R.D. Hurwitz, On the conjugacy problem in a free product with commuting subgroups, Math. Annalen, Vol. 221 (1976). [ 121 W. Magnus, Noneuclidian tessalations and their groups (Academic Press, 1974). [13] W. Magnus, A. Karass and D. Solitar, Combinatorial group theory (Interscience Publishers, 1966). [14] Charles Miller, 111, On group-theoretic decision problems and their classification, Annals of Math. Studies, No. 68, Princeton University Press (1971). [15] J. Milnor, Introduction to algebraic K-theory, Annals of Math. Studies, No. 72, Princeton University Press (1971). [16] M. Newman, Integral matrices (Academic Press, 1972). [17] I. Schur, Uber Gruppen linearer Substitutionen mit Koeffizienten aus einem algebraischen Zahlkorper, Math. Annalen, Vol. 71 (1912).
fie conjugacy problem in arithmetic groups
139
[ 1x1 A. Seidenberg, Constructions in algebra, Transactions of the Am. Math. Soc.,Vol. 197 (1974).
[ 19) E. Steinitz, Rechteckige Systeme und Moduln in algebraischen Zahlkorpern I, Math. Annalen, Vol. 71 (1912). 1201 E. Steinitz, Rechteckige Systeme und Moduln in algebraischen Zahlkorpern 11, Math. Annalen, Vol. 72 (1912). 1211 R.G. Swan, Generators and relations for certain special linear groups, Advances in Math. 6 (1971) 1-77. 1221 0. Tausky, Matrices of rational integers, Bull. Am. Math. Soc. 66 (1960) 327-345. 1231 B.L. van der Waerden, Eine Bemerkung uber die Unzerlegbarkeit von Polynomen, Math. Annalen, Vol. 102 (1930). 1241 B.L. van der Waerden, Moderne algebra I + 11, 2nd edition (Springer Verlag, Berlin, 1937). [2S] L.N. Wasserstein. On the group SL, over Dedekind domains of arithmetic type (Russian), Mat. Sbornik, Vol. 89 (131) No. 2(10) (1972). 1261 A. Weil. Adkles and algebraic groups, Inst. Adv. Study (1961) (notes polycopies). 127) M.J. Wicks, Presentations of some classical groups, Bull. Aust. Math. Soc., Vol. 13 (1975).
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 141-155
ALGEBRAICALLY CLOSED GROUPS: EMBEDDINGS AND CENTRALIZERS Kenneth HICKIN* and Angus MACINTYRE** Michigan State University, East Lansing, MI 48824, USA Yale University, New Haven,
W W ,USA
90. Introduction
Here we present several results on t h e algebraic structure of a.c. groups. We refer t h e reader to [2: Chapter IV, §8], [3], and [6] for discussions of t h e basic properties of a.c. groups. Theorem 1. No a.c. group can be embedded into a finitely generated subgroup of itself. This suggests the following problem: Is there an o-homogeneous group (in the sense of Jonsson) which can be embedded into a f.g. subgroup of itself? If A and B are groups, A C B means that A is a subgroup of B. Theorem 2. Suppose G is a countable a.c. group. There is a set % of 2-1 a.c. groups, each of power w I and m-o-equivalent to G, with the property: each U E % has a subgroup U,,= G such that, for all U # V E % and for all countable B and D with U,,C B C U and V,C D c V, neither of the centralizers CU(B)and Cv(D) is embeddablp in the other. Hence distinct members of 91 are not isomorphic. If the C.H.is assumed, then we can arrange that distinct members of % are mutually non-embeddable. In [ 5 ] Shelah and Ziegler give a complete solution to the spectrum problem for a.c. grdups using stability theory. Our proof of Theorem 2 The first author gratefully acknowledges the support and splendid hospitality of the Wanvick Symposium on Infinite Groups and Group Rings during the preparation of this paper in May of 1978 at the University of Wanvick. ** Both authors were aided in this research by NSF grant MCS77-07731. 141
142
K. Hickin, A.
Maclntyre
shows that, at least in the case of wl, extremely elementary techniques involving centralizers suffice to construct the maximum number of CQ-wequivalent a.c. groups which are not isomorphic for obvious reasons. In the remainder of this section G denotes an arbitrary a.c. group. MIN denotes the minimum condition on subgroups.
Theorem 3. Suppose A C G satisfies MIN and A is the intersection of finitely many f g . subgroups of G. Then, for every M with CG( A )C M C G , there exists H C A such that
(the normalizer of H in G ) .
CG( H )C M C NG ( H )
Zn particular, if A is finite, then there are only finitely many subgroups of G containing the centralizer of A. Corollary. If A C G is finite and characteristically simple, then NG ( A ) is a maximal proper subgroup of G.
Proof. Suppose NG ( A )C M C G. By Theorem 3, CG (H)C M C NG (H) for some H C A. Since G is a.c., every automorphism of A is induced by an element of N G ( A ) . Hence, H is characteristic in A and either H = l , M = G o r H = A , M=NG(A). A more complete characterization of the maximal subgroups of countable a.c. groups will be given in a future paper by the first author. If P C G and 9 acts on P, we define N Z ( P ) = { x E NG ( P ) x induces a 9-automorphism on P}. Aut (P) is the automorphism group of P. Further information on the subgroup lattice between C G ( A )and G is given by
I
Theorem 4. Suppose B,C G , 1 C i c n, are fig. and B,f l . . r l B, has MIN. Suppose 23,C Aut ( B , ) , 1 C i d n, and let K = the largest subgroup * n B, which is inoariant under 9 ,U . . . U 93“. Then, of B ,fl.
I
N ~ ~ . . . . . ” J= ( K( )N ~ ( B , 1) s i s n ) . In the special case of centralizers Theorem 4 becomes
Lemma 1. Z f B , C G , l d i d n , aref.g ., t h e n C G ( B I n . . . n B , ) = ( C G ( B i )1(s i C n ) . Proof of Theorem 3. Induct on the subgroups of A. Suppose C G ( A ) C
Embeddings and centralizers
143
M C G. If M C No (A), there is nothing to prove. So, suppose x E M - NG(A).We have Ao=A n A " < A since A f A' and A has MIN. Now, M 3 (CG(A), X ) 3 (CG(A),CG(A')). Put A = Dl n * * n D, (DiC G f.g.). Thus, M 3 CG (Di)and M 3 CG(DI) for all 1 < i < m. Since A. = DIf DYn l * fl D, n D i , we have by Lemma 1, M 3 CG(Ao). By induction we now conclude that CG(H)C M C NG ( H ) for some H C A".
-
Next we consider the normal structure of normalizers.
Theorem 5 . Suppose A C G is f.g. and x E NG ( A ) - A. Then CG(A) C X ~ G ' ~ Thus ) . CG(A)/CA(A) is simple. ( x c is the subgroup generated by all the c-lxc, c E C ) . It is well known that if A C G is centerless and f.g., then CG(A) is a.c. and m-o-equivalent to G. If CA(A)# 1, it is in many cases easy to see that CG(A)/CA(A)has a pair of isomorphic f.g. subgroups which are not conjugate-in it. For example, suppose CA(A) is not divisible and let x E CA(A) have no pth root. Choose z, y E CG(A) such that y p = x and z p = 1, z & CA(A). It is easily checked that (yCA(A)) and (zCA(A))are not conjugate in CG(A)/CA(A) since the former 'cosets have no elements of order p . Next we note that Theorem 3 fails if A is infinite cyclic.
Proposition 1. Suppose Z C G is infinite cyclic. Put c = U{C,(Z")In 2 11, N = U { N ~ ( Zn 3~ I), )) and M = {x E G IZn Z x # 1). Then C a N a M, M is maximal in G, C is simple, N / C has order 2, and M / N is free abelian of rank w. Proof. Clearly M # G and its maximality follows from Lemma 1 since x $Z M implies Z n Z" = 1 whence ( M , x ) 3 (CG( Z ) , CG(Z')) = G. The simplicity of C follows from Theorem 5 : if 1 # x E C, there exists m 3 1 such that x E CG( Z " ) - Z" for all multiples n of m . Thus, CG( Z " ) C X ~ C ( ~ for " ) such n which gives C C x c . Clearly N / C has order 2 and N a M . It remains to consider M / N . For each pair ( m , n ) of positive integers, there exists T = T ( m , n ) E M such that T-'Z"'T = Z". We will view T as an element of M / N . Thus, 7 ( n ,n ) = 1 , T(n, m ) . T ( m , p )= ~ ( n , p )and , T(nm, n p ) = T(m,p)(mod N ) for all n, rn, and p . Thus,
K. Hickin,A. Maclntyre
144
I
M / N is generated by { T ( a ,b ) a, b are relatively prime} and multiplication in M / N is given by T(a,
( y $)
b ) . ~ ( cd ,) = 7 -
3
where A is t h e least common multiple of b and c. We check easily that M / N is also generated by { T ( l , p ) ( pis prime} and that these generators have infinite order, commute, and are independent. Finally, we will show that a basic construction of group theory can be performed in every a.c. group. If A is a f.g. group, W(A) denotes the words in some finite generating set of A which are relations of A ; W-(A) denotes the corresponding set of words non-trivial in A. If S and T are (coded) sets of natural numbers, S S eT is the relation “S is enumeration reducible to T” [4: p. 1451. Theorem 6 . Suppose A, B C G are f g . and there exists an f . g . F C G such that W-(B) SeW(F). Then, the restricted wreath product A wrB is embeddable in G. In particular, this is true i f B has a solvable word problem.
Corollary. If A C G is f.g., there exists a f.g., perfect, centerless group D such that A C D C G. Proof. Let P # 1 be a finite perfect group. Since P * P is SO-universal [2: Th. 11.31, there is a group H = ( P I ,P J with P, = P such that A C H. Since G is a.c., we can assume WLOG that A C H C G. Now P I *Pf has a solvable word problem; so, by Theorem 6, D = H w r ( P , * P&is embeddable in G. D is f.g., perfect, and centerless.
Notice that the group D of the corollary cannot always be chosen simple or even subdirectly irreducible. For example, if every f.g. subgroup of G is recursively presented [3: Th. 81 and A has an unsolvable word problem, D cannot be so chosen; and also, if S is a finite non-abelian simple group, then S wr A is not embeddable in G because S wrA is subdirectly irreducible. This same reasoning shows that for arbitrary G, the converse to Theorem 6 is also true: namely, if S wr B is embeddable in G, this group is an adequate choice for F since W-(B)<, W ( S w r B ) since S wr B is subdirectly irreducible.
Embeddings and centralizers
145
91. Proof of Theorem 1
We will need the General Higman Embedding Theorem [C.F. Miller 111 (unpublished) and M. Ziegler [6]]. Suppose the group K has a presentation ( X , t., ( n E 0 ) : W ( A ) U R ) where X is a finite generating set of the group A, R seW(A), and the relations R are consistent ouer A (and may involve r’s and x’s). Then K is a subgroup of a f.g. group H which has a similar presenrarion ( X , sI,.. .,s,, : W(A) U F ) where F is finite ; furthermore, {t, 1 n E w } is a recursive subset of H. And we will also need Lemma 2. Suppose S, T C w, w - S seS and T is r.e. in S. Then T % S. This is proved by a simple double enumeration argument. To prove Theorem 1, suppose G is an a.c. group, A C G is f.g. and cp : G + A is an embedding. From the inclusions cp ( A ) C cp (G) C A and the fact that G is a simple group it follows that W c p ( A 1) s
(1)
e
W(A 1
because for all x, y E W-(cp(A)),we have x E y *. Let P and R be disjoint subsets of w which are r.e. in W(A), but such that there is n o set X C w recursive in W ( A ) with P C X and R C w - X. Such sets are obtained easily by relativizing the construction given in [4: p. 941. Define a group r with generators
(2)
1
c,, u,, u,,
c,
=
and
d ( n E w ) and relations
1 for all n E P, c > c > = d for all n E P.
From (l), Lemma 2, and t h e fact that P and R are r.e. in W(A), it follows that w ( r ) seW(A 1.
So, t h e General Higman Theorem implies that r is a subgroup of an f.g. group H with relations W ( A ) U F, F finite, and that t h e generators (2) form a recursive subset of H. Since G is a x . there is a homomorphism p : H + G such that p ( d )# 1 . From the inclusions p ( r ) C p ( H )C G 6 A , the first of which is recursive, we have as in (1) that W-(p seW(A). So,
(r))
K. Hickin, A. MacIntyre
146
C = { n E w 1 p ( c , , )# l } s eW(A), and, trivially, C is r.e. in W(A). Likewise, W(p (f)) 6,W(A) and w - C is r.e. in W(A). Hence C is recursive in W(A). Since p ( d )# 1, we have R C C and P C w - C contrary to the W(A)-recursive inseparability of P and R.
62. Proof of Theorem 2 From Ziegler [6] we need Lemma 3. If G is a.c. and A C G is f . g . , then A in G.
X
A is embeddable
Lemma 4. Suppose G is a countable a.c. group, H C G, and X,, C G, n 3 1, are isomorphic to G. Then, there exist embeddings cp : G -+ G and J, : H + G such that (1) c p ( G ) @ J , ( H )exists in G and ( 2 ) for all n 3 1, Cc(cp(Xn)) = cp(G(X))@ J,(H).
1
Proof. Let H = u { H , i 3 1) where H, C H , + ,is f.g. and possibly H = H , for some n. Let G = U{G, i 2 1) where G, C G , + ,is f.g. We construct q and J, inductively. Suppose we have defined cp(G,) and + ( H a )so that c p ( G , ) @ J , ( H , )exists in G. At the next step it is possible (A) to extend J, to H,,, so that S = c p ( G , ) @ + ( H , + ,exists ) in G and (B) given elements g l , . . . , g, E G - S and x,, . . . ,x , E ( G 8 +-, G , )r l ( u n a , X n )to, extend cp to G , + ,so that (a) q ( G , + , ) @ J,(H,+Jexists in G, and (b) no cp (x,), 1 s j s 9,commutes with any g,, 1 S j S p. (A) and (a) can be done because G , + , @H , + , is embeddable in G and G is homogeneous, while (b) can be done because we need only force finitely many inequations, which hold in the free amalgamated product ( S , x l , .. . ,x , ) * ~ ( S , g l , . . . . ,g,), to hold in G.
I
We claim that (A) and (B) suffice to build the desired properties of the embeddings cp and J,. Note that, by Theorem 1 , no X,, is contained , Gi for in a f.g. subgroup of G, so that every X,,intersects G i +infinitely many values of i. Hence, by choosing the sets { g , , . . .,g , ) to eventually exhaust G - (cp(G)@J , ( H ) ) and the sets { x , , . . . , x , } to intersect every X,, for infinitely many values of i, we will guarantee that, for all n 3 1,
147
Embeddings and centralizers
yielding conclusion (2) of the lemma. Proposition 2. Let G be any countable a.c. group and let H ( ( Y ) ,(Y < w l , be groups embeddable in G. There exists an a.c. group G * of power w1 which is m-w-equivalent to G and there exists GoC G * with Go= G such that, for all countable S with GoC S C G *, there exists p < w1 such that
CG.(S)=A x
(n
H((Y)
/3GO
where A C IIos.
Go< G , < . . . < G, < . . . ,
(G,
G,
<~
1
)
as follows. Let G o = G. Having constructed the chain u p t o Go we apply Lemma 4 letting H = H ( a ) and letting {G, y a} b e the . collection { X " } , thus obtaining embeddings cp : Go + Go and 4 : H ( a ) + Go satisfying (1) and (2) of Lemma 4. W e now extend Go to G,+, in such a way that the embeddings cp(Go)< Go and Go < Go+, are isomorphic, say by an isomorphism f : Go+ G,,,, and we identify H ( a ) with f ( J I ( H ( a ) )
I
L
I
Proof of Theorem 2. W e have much freedom t o specify the groups H ( a ) in Proposition 2.
K. Hickin, A. Maclnzyn
148
If S is any set of primes, let H ( S ) be a rank 1 torsion-free abelian group such that, for all primes p and for all 1 # x E H ( S ) , x is infinitely divisible by p if and only if p E S. Since the additive rationals are embeddable in every a.c. group, so are all the groups H ( S ) . Let '21 be a collection of infinite sets of primes such that 1'211 = w1 and for all S # T E '21, S fl T is finite. Let SB be a collection of subsets of '21 such that every member of SB is uncountable 193 I = 2 9 and for all O f R E 93, 0 fl R is countable. If Q E 9, let G * ( Q )be the group G * of Proposition 2 obtained by putting H ( a ) = H ( S , ) where {S, a < w , } is some list of Q. We define '21 = { G * ( Q ) Q ( E a}. Suppose U # V E '21, say U = G * ( Q )and V = G * ( R )where Q # R SB. Let Uo and Vo be the subgroups Go of Proposition 2 of U and V respectively. Suppose UoC B C U and VoC D C V with D and B countable. By Proposition 2, we have
1
Cl= C " ( B ) s Ai
X
and
1
-=
1
where Q = {So a < wl}, R = {T, a < wl}, p, y w I , and A I ,A Zare countable. We must check that neither of these centralizers is embeddable in the other. Suppose f : Cl + C2were an embedding. Then, for all p d a < wl, f ( H ( S p ) )can have non-trivial projection on H(T6), y 6 6 < w I only if S, C T6; but, since Q fl R is countable, this is possible for at most countably many values of a,which forces f(CJ to lie in a countable subgroup of C2,an impossibility. If the C.H.is assumed, we use the second part of Proposition 2. Let U # V be as above. We must show that U and V are nonembeddable. Suppose f : U + V were an embedding. Since Go= f ( Uf,)= G, Proposition 2 implies that
But also,
Cv (f( Uo)) A 2 X
(?
Cfd-f(Uf,))=A , x
(n
@<7-=-
H(So))
with the notation as before. Since the latter centralizer is contained in the former, we have a contradiction to the non-embeddability established above.
Embeddings and centralizers
149
93. Proof of Theorem 4 and Lemma 1 We will use the notation of Theorem 4 and we also define
H = ( B ,..., ~ B.)CG, A = ~ ~ n . - n ~ . , % = 91**..*9n,
and if U 3 H, then
N” = ( N > ( B ~I )1 s i s n ) .
Lemma 5. Assume the hypotheses of Theorem 4. There exists a group J 3 H such that, for all a E %, there exists y E N, such that 1 y I = W, ( y ) n H = 1, H n H Y= K, and y induces a on K (by conjugation). Proof. Let J have the presentation (H, r ( i , a ) where a E 9, and 1 S i s n : W ( H ) and “ t ( i ,a ) induces a and B, by conjugation”). J is a (multiple) HNN extension of H and we will use Britton’s Lemma (B.L.). Put Q, = ( t ( i ,a ) a E 9 , )C J and Q = (Q, 1 S i S n ) . Note that Q C N,. Q is free on the generators { r ( i , a)}and Q n H = 1 by B.L. Also,
I
1
K = n { A q1q E Q}. (1) Suppose w = u1* uk is a product such that adjacent factors belong to different Q,’s. Using B.L. in J, we check easily that (2)
[
for all h E H, h E H iff h E H for all initial segments s = ~ l . . . u, ( l s i s k ) of w.
Suppose a E % is given as in the hypothesis. We will construct y as ..., u k + l of Q to be a product of a sequence of elements u,, determined inductively. We define u1as follows. Let 1 # q, E Q,, 1 d i d n and put 1 , = t ( i , L,).Define s1 = q1 and inductively s,+~ = s,lr+ls;lq,+l, 1 s i d n, and define u I= s,,. We claim
(3)
H nHUT’C
A.
To prove this, assume inductively
if h E H - ( B l n . - . n B , ) , then hsa6fH, which is obvious if i = 1 from B.L. Suppose
K. Hickin, A. MacZntyn
150
h E H - B1n
- - - f l Bi+land h'i+lE H; this implies h &
or else (4), is contradicted using (2).
Thus, h'l E H by (2). If h'l then h'll~+ltir H by B.L. and hence h'i.1 fZ H by (2). So, we have h'd E Bi+land hence h'i'i+l= h'i, whence h =h H contradicting ( 5 ) and proving (4)i+1.Thus (4)" holds and this proves (3). Suppose ul,. . . ,ui have been selected such that the last letter of a, 1 S j S i, belong to distinct Qk's.Put 7ri = u1 * * * ui. and the first of Note that '~';'~i+l
(6)
e
Li = H
n H "L1C A
by (3) and (2).
Suppose there exists a E Li - K . Thus a-4 fZ K and by (1) there exists q E Q such that a"@6Z A, and we can arrange that a"dq bZ H (by replacing q by qr where 1 # r E C?, if u"fl E H - B,).Putting m+l= q,
we have a fZ Li+l(see (6)). So, by (6) and'(2) we have A 3 L 1 > * * * > L i > L i + l > . * * >K.
Since A has MIN, we must have for some k, L, = K. Finally we Q so that y = ? T k u k + + I induces Q on K (we can begin u k + l choose with some l i to prevent unwanted cancellation). is the set-theoretic union of two groups, An amalgam d = = B U C,which intersect in a common subgroup A = B f l C. gp,(d) is the free product of €3 and C with amalgamated subgroup A.
d
Lemma 6. Suppose K C H C J, y E J with y 2 fZ H, y normalizes K , and H f H l Y= K. Let SP = 'Y' be a symmetric amalgam ( - : J + is an isomorphism which fixes H elementwise). Put 3 = gp*(SP). (i) There exists t E X , such that 1 t 1 = m, t normalizes K, ( t )n H = 1, and =
(t)KYJ generates gp,(SP')
in 91, and (ii) Suppose X C Aut (K), y induces a X-automorphism on K and for all a E X, there exists ye E J - H which induces a on K. Then, t can be chosen in ( i ) so that it induces any a E X on K. Proof. To prove (ii), put t = yyy,yg-' where p E X is chosen so that will induce a on K. (To prove (i) we would use y in place of yp.)
t
Embeddings and centralizers
151
Clearly It I = m since y 2 @ H and ( t ) r l H = 1. We have ( r ) K r lJ = K because every z = t:k E r'K with i f 0 involves y implying z $Z J. Hence the amalgam d'does exist in g p , ( d ) . Let w = po(t"k1)pi(t"kz)p2.. . (r'"k.)p,, where p, E J - K, i, # 0, and k, E K, 1 =sk S n (and possibly P o or p . do not occur) be a reduced product of members of d'.We must show that if n 2 1 (i.e., some t occurs), then w f 1 in ?I. This will be so provided not too much reduction in ?I occurs in segments of the form t 'ik,p,t'1
+
1.
To check this we can ignore k, E K by consolidating it with p, and consider subsegments of the possible forms tpr, rpr-', r-'pt, and
t-'pr-'.
All cases being similar, we consider only t h e first. We have
YY%YF-'PYYYPYY-'. If p E J - H , this segment is already an &?-reduced product. If p E H - K , then u = y - ' p y E J - H since by hypothesis H r l HF-' =K; and, making the replacement u = y - ' p y in rpt yields an &-reduced product. Thus, the middle letter of every r occurring in w survives in'the &?-reduced form of w, which implies w # 1 in ?l and finishes t h e proof of Lemma 6. tPt
=
Lemma 7. Suppose K C B C G where G is a x . and B is f.g. If Y l C Aut (K), then N Z ( K ) is generated by elements of infinite order.
Proof. Suppose x E N E ( K ) . Since B is f.g., there exists c E G with lc I = co such that (c)@ (x, B ) exists in G (e.g., by Lemma 3). Now x and c + x induce the same automorphism on K and c + x has infinite order; so we only need to show that C G ( K )is generated by elements of infinite order. Suppose x E CG(K). With c as above, we have c and c + x centralize K, have infinite order, and generate x, so we are done. Lemma 8. Suppose & = "YR, M C Q, and dl= M Xgenerates p g p , ( d , ) in Q. Then, dz= '"wR generates gp,(dZ) in gp,(&).
Proof. A reduced product in gp,(&) can be written as r l r I* r n r n where ri E R - P and each ri is a reduced product in gp,(dl) (with
K. Hickin, A.
152
Maclnryre
n,, r, optional). We have 7ri 6Z P for all i, otherwise the product is not reduced. Hence, this equals a reduced product in g p , ( d ) .
Proof of Theorem 4. We will use t h e notation of Lemma 5. By Lemma 7, N g ( K ) is generated by elements of infinite order. Let x be such an element. We must show that x ENG. Let F = ( a,,a z ,a,) be free on {aI, az,a 7 }and let FK be t h e semidirect product such that a z and x induce on K the same automorphism and aI, a? centralize K . Put
with x = ( Y I ( Y Z ( Y 7 . First, we claim (*)
[
the amalgams d, = (L.,)Ky(x.H) ( i = 1,2,3) generate g p , ( d , ) in g p , ( d ) .
To prove this, use Lemma 8 with M = ( a , ) K . Since ( a , ) K f l ( x ) K = K in FK, we must show that
generates gp,(d',) in FK. Since K is normal, this is so iff a, and x generate ( a , ) * ( x )in F, which is easily checked. Second, we claim there is a group P 3 g p , ( d ) such that aiE N p ( i
=
1,2,3)
and hence x E N p :
To obtain P we first use Lemma 5 to get J 3 H and y E N , such that ly I = w, (y) f l H = 1, H n H Y= 4, and y = ya can be chosen to induce any a E X o n K. Now apply Lemma 6 to obtain
( f , ) fl H = 1, 1, and a, induce t h e same automorphism such that I f, 1 = on K, t, EN,,, and (r,)K3?' generates its free product in J,. Thus, ( ' j K Y H generates its free product in J , , which is, by (*), isomorphic to the subgroup of g p , ( d ) generated by (a>KY under the map H = H and t, * a , . So, we can amalgamate g p , ( d ) with the J, via these isomorRhisms to obtain P = ( J , ,Jz, J,) 3 g p , ( d ) = ( H , F). Finally, by making direct use of the algebraic closedness of G and QJ,
Embeddings and centralizers
153
the existence of P in (**), we can obtain P with ( H , x ) C P C G such that x E Np C NG, as required. To obtain P, use (**) to express x = z1. . * z, where z , E N%~)(Bn(,)). So, the relations {x = f l. . . fe and “2, induces the same automorphism on B,(,)as does z a ” (1 s i s E ) } (which can be taken to be a finite set because each B, is f.g.) are consistent over (H,x). So, there are elements fl,. . . , f, E G which satisfy them and we take
P
= ( H , x, 51,.. . , fc).
Proof of Lemma 1. In the case of centralizers, in the proof of Theorem 4, we have Bi= 1 (1 s i S n), X = 1, and hence K = Bl fl... f l B, and the hypothesis of MIN is not needed in the proof of Lemma 5 in this case, and we obtain Lemma 1. 04. Proof of Theorem 5
In the notation of Theorem 5, suppose x E NG ( A )- A. Put C = CG ( A). In the group with presentation (A,x, (Y : (Y centralizes A ) let f = ( Y - I X ( Y * X - I ( YThen -I. If 1 = 03, 5 E x C p ) and , ( Z ) @ A exists. Since G is a.c., there exists z E G such that ) z1 = CQ, z E x‘, and ( z ) @ A exists in G. We will show that (*)
{
C is generated by elements w such that 1 w I = 03 and ( ( z ) * ( w ) ) @ Aexists in G.
First note that C is generated by elements c such that I c I = Q), (c)@ A exists in G, ( z ) n ( A , c ) =1. To see this, let u E C and choose u so that I u 1 = 00, and ( u ) @ ( A ,z, u ) exists in G. The elements u and v + u satisfy the requirements on c above and generate u. To prove (*), let c be a generator of C mentioned above, let F = (5, be a free group, and put
p)
J = gP*
( ; @ A Q % c J ) ) c =@)@A
(p),
Since ( E ) , and ( z ) all intersect the amalgamated subgroup trivially, we see that ( G ) * ( z ) @ Aand ( p ) * ( z ) $ A exist in J. Since G is a.c., there exist a,p E G such that c = ap and ( ( a ) * ( z ) ) @ Aand ( ( P ) * ( z ) ) @ Aexist in G (because these groups are embeddable in
K . Hickin, A. Maclntyre
154
G). This proves (*) by taking
(Y
and p as values of w, thus generating
c and hence all of C. Now, for every w E C satisfying (*), there exists T E C such that T-’z= ~ w, 17 = 2, and T E C. Thus C is generated by elements of t h e form z T E xc. This concludes t h e proof.
I
85. Proof of Theorem 6 We shall deduce Theorem 6 from the following result due essentially to Belegradek [ l ] who proves it for recursively presented groups. Proposition 3. Suppose A and B are f . g . groups with generating sets X and Y. Let ( X ) and ( Y ) be the free groups on X and Y . B is embeddable in every a.c. group which contains A iff W ( B ) % W ( A ) and there is a function 9 S cW ( A ) whose domain is ( Y ) and whose range consists of certain sequences in ( X ) (which we denote 9 ( y ) = (xb ,..., x; ,... )) such that, for all y E ( Y ) , y E W ( B ) i f f x ; E W ( A )for all n 2 1.
I
By 9 S c W ( A )we mean that the set of pairs { ( y , x;) y E ( Y ) , n 3 1) is enumeration reducible to W ( A ) . The “if” part of Proposition 3 - which we will use - is proved in a manner similar to the proof of B.H. Neumann that every group with a solvable word problem can be embedded into every a.c. group [2: Chapter IV, §8] and the method can also be gleaned from Belegradek’s proof for r.p. A and B ; but one must use the General Higman Embedding Theorem ($1). Let us assume the hypothesis of Theorem 6. We can also assume that A , B C F and hence that W ( B )ScW ( F ) as well as W - ( B )S cW ( F ) because of obvious properties of enumeration reducibility. Let X , P, and Q be finite generating sets of F, A , and B respectively and put Y = P U Q. So, Y generates A wrB. Since A , B C F, we can assume Y C X. Using Proposition 3 we need only find a function 9 s cW ( F ) whose domain is ( Y ) and such that, for all y E (Y), 9 ( y ) = ( x i , . . . ,x;‘’)) (with x ; E (X)) and y E W ( A w r B ) iff x : E W ( F )for all 1 S i S m ( y ) . We will now describe an algorithm for computing 9 given an enumeration of W ( F ) . Suppose y E (Y). Using the fact that A w r B = A B B is a semidirect product we can collect P’s and Q’s in y to obtain
Embeddings and centralizers
y =p;‘.
. . p>w
where
155
pgE P ; w , w , E (0).
As we enumerate W ( F ) we can also enumerate W ( B ) and W - ( B ) since these sets are seW ( F ) . Thus, w will be enumerated either in W ( B )or in W - ( B ) in finitely many steps. In the latter case, we have y FZ W ( A w r B) and we put 9 ( y ) = ( w ) . In the former case, we likewise determine for all 1 c i, j s n that either w,w;I E W ( B ) or E W - ( B ) in finitely many steps. In A wrB, p 7 and p 7 will commute provided w , # w,. So, the above information allows us to rearrange y as follows: y = p ; ’ . . . p > = u y , . . . UZrn where u, E (P), u, E (Q), and, if 1 s i f j s m, then u , # u, (each u, is one of the original w , ) .
Since the factors up,,1 s i =Sm, belong to distinct direct factors of A B in A wrB, we have y E W ( A w r B ) if and only if { u , , ..., u,,,}C W ( A ) , so we put 9 ( y ) = ( u I , .. . , u,,,).
Note added in proof
The paper “Maximal subgroups of a.c. groups” mentioned in 00 has been subrmtted for publication. Theorem 3 is generalized in it to effectively embedded subgroups, and applied to study properties of Ziegler’s construction [ 5 ] .
References [ I ] O.V. Belegradek, On algebraically closed groups, Algebra and Logic 13 (1974) 135-143. [2] R. Lyndon and P. Schupp, Combinatorial group theory (Springer-Verlag, 1977). [31 A. Macintyre, On algebraically closed groups, Annals of Mathematics 96 (1972) 53-97. [4] H. Rogers, Theory of recursive functions and effective computability (McGraw Hill, 1967). [ 5 ] S. Shelah and M. Ziegler, Algebraically closed groups of large cardinality, to appear. [6] M. Ziegler, Algebraisch abgeschlossene Gruppen, to appear.
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 157-214
SMALL CANCELLATION THEORY OVER GROUPS EQUIPPED WITH AN INTEGER-VALUEDLENGTH FUNCTION Bernard M. HURLEY University of Zam bia, Lusaka, Zambia
01. Introduction
In this paper, small cancellation theory is developed over a class of groups equipped with an integer-valued length function satisfying certain “natural” axioms. Such groups can also be characterised by the fact that they are equipped with a normal form (for elements) satisfying certain other axioms. Since free groups, free products with amalgamation, and HNN groups all possess a normal form of the required type, our theory includes the “classical” small cancellation theories of Lyndon [13], Schupp [19] and Sacerdote and Schupp [18] However, the class of over the above mentioned classes of groups.groups that we shall consider (we call them NFS groups) is much wider than this and is closed under a fairly large number of constructions, as will be shown in 93. As with the “classical” theories, there are three “cases” of our theory. The observant reader will notice that we only apply one of the “cases” in 08, whereas we develop the theory of all three in 05. We make no apology for this. We consider the mathematics of 95 to be interesting mathematics in itself. The applicatiqns of 08 have been selected according to two criteria. The first being that the result in question can be proved without too much extra work. The second being that the result is of interest. The applications include various embedding theorems, a characterization of the solubility of the word problem for groups, and proofs of the residual infinite simplicity and the infinite height (in the sense of Pride [16]) of various groups. It is obvious, from the results of 03 and 97, that many of the theorems of 08 could be followed by a corollary( giving a vast list of groups satisfying the theorem (e.g. we could follow Theorem 8.10 with a corollary to the effect that non-abelian free groups, certain HNN groups, etc.. . are residually infinite simple). Many writers would do this, but we do not see the neccessity for it. 157
B.M. Htcrley
158
Many interesting applications have been left out of 98, in view of their failure to satisfy our first criterion. Perhaps the most interesting being the use of the theory to prove a version of Higman’s Embedding Theorem. In 99 we have given a list of such applications. These, perhaps, should be regarded as “signposts” to things to come. Of course, whether or not one needs signposts depends on who one is. (Anglo-Romany has no word for signpost.) T o t h e reader unfamiliar with small cancellation theory, we recommend Schupp [20] for an incomplete, but clear, introduction to the “classical” theories. Having read this paper, he/she may be surprised that we d o not impose any length restrictions in our Cconditions. Our use of special sequences, in 95, makes this unneccessary . We have tried to strike a balance between formality and informality. If the reader finds t h e construction of the diagram, in $ 5 , too informal, he/she should bear in mind that, written out formally, it would involve a four-fold induction. Notation : We believe that most of our undefined notation is standard. However, we draw to the reader’s attention that, according to context,
n=
[
proof ends here, proof omitted, proof to occur later in text.
It is natural to think of elements of a group that possesses one of our axiomatic normal form structures as “words”, by analogy with the words of a free group. We shall often use the word “word” in this loose sense. When we use it in the “strict” sense of a reduced word on a set S of letters, we shall speak of a “word o n S”, unless the context makes it obvious what we mean. -
82. NFS groups, NFS groups and length functions
In general terms a length function o n a group G is a function 1 g 1 satisfying a “reasonable” set of axioms. Length functions on groups were first studied axiomatically by Lyndon [12]. He imposed five axioms, of which t h e following two seem to capture the “essence” of the notion of a length function. (The numbering is Lyndon’s.):
G + R, g
Small cancellation theory
A4.
159
d ( x , y ) < d ( x , z ) implies d ( y , z ) = d ( x , y ) ,
where d ( x , y ) = f { J x J + l y I - J x y - ’ ( } .Chiswell [4] and Hoare [8] have studied length functions satisfying these two axioms. A length function is normalised if it satisfies: Al’.
IlI=O.
If g I+ 1 g I is a length function G + R satisfying A2 and A4, then the function g llg (1, defined by ((g(1 = ( g1 - 1, is a normalised length function that still satisfies A2 and A4 (see the comment after [8] Proposition 5 ) . Thus there is no loss of generality in considering only normalised length functions. Geometric small cancellation theory was first developed by Lyndon [13] over free groups and free products. Later Schupp [19] developed the theory over free products with amalgamation, and Sacerdote and Schupp [I81 developed it over HNN groups. It is a striking fact that in every case the group under investigation possesses a “natural” length function satisfying Al’, A2 and A4. Moreover this length function is always well-based in the sense of Hurley [lo]. That is, in addition, it is integer-valued and satisfies the extra axiom: N1*.
G
= gp{x E G : I x
I s 1).
Intuitively, the function d ( x , y ) measures the amount of cancellation in the product x y - ’ . Thus, for instance, we say that no cancellation occurs in this product if d ( x , y ) = 0, and that almost no cancellation occurs if d ( x , y ) S 1. However in order to develop a small cancellation theory we need to do more than merely give a numerical value to the cancellation in a product, and, in each of the above mentioned cases, a central role is played by the appropriate Normal Form Theorem, whereas the length function only appears as a way of simplifying some of the “small cancellation” hypotheses. Abstract normal form structures (NF structures) on groups were first studied by Hurley [9], however we shall use the axiomatisation of Hurley [lo]. We can associate a length function in a natural way with each NF structure on a group. This leads us to the Main Theorem of [lo].
Theorem 1.1. A function G -+Z, g H ( g1 on a group is a well-based length function (i.e. it satisfies N1*, Al’, A2 and A4) if and only if it is the associated length function of some NF structure on G. 0
B.M. Hurley
160
We shall now briefly explain our axiomatisation of the normal form. First we need some definitions in which we shall suppose G is a fixed group, X is a fixed subset of G,satisfying 1 bZX, and B is a fixed subgroup of G called the base group. Definition. Let S = S(G,B, X) be the set of all sequences u = (ui);ET', where n 3 0, such that u21+1 E B, for j = 0,. . . , n, and uziE X U X-', for j = 1 , . . . , n. We call n the length of such a sequence, and denote it by u 1. We define the value of the sequence to be 6 = u l u 2 .. un.
I
If u = (0;)E S and for some 1 s j s n - 1, u21uzl+1u21+2 E B, then the sequence r of length n - 2 defined by: 7, =
[
if l s i s 2 j - 2 ~ ~ , - ~ u ~ , u ~ , + 1ifu i2= , 2j + 2-u1Z I + 3 if 2j s i < 2n - 4, u,+4 u#
clearly lies in S and satisfies 7 = 6.We say that r is obtained from u by means of a pinch between u2,and u ~ , + ~ . If u = (a,)E S and for some 1 s j s n, uzlu21+lu21+2 E B(X U X-')B, = bgc for some g E X U X-' and some 6, c E B and then u21u21+1u21+2 the sequence p of length n - 1 defined by:
PI =
I
if u21-lb if g if cu21+3if u , + ~ if u,
lsis2j-2 i = 2j - 1 i = 2j i = 2j + 1 2j+2sis2n-2,
clearly lies in S and satisfies p = 6. We say that p is obtained from u by means of an amalgamation between u2,and u ~ , + ~ . A (G, B,X)-sequence u is reduced if n o pinch or amalgamation can be performed between any two elements of u.The set of all such sequences shall be denoted by RS(G,B , X ) or just RS. Intuitively, RS consists of normal forms for elements of G. If u E RS and g = 5, then we say that u represents g . Clearly if u E S then there exists r E RS such that 7 = 5.It follows that to ensure that every element g E G has a normal form (i.e. that every element of G is represented by some member of RS),we need merely impose the axiom:
N1.
G = g p W , B).
IT
We say that a, 7 E S are parallel, written u 1) T , if, either I u 1 = 1 = 0 and 6 = 7, or 1 u 1 = I T 1 = n > 0 and there exist h, E B for i = 1 , 2 , . . . ,, 2 n such that ul= T 1 h l , u,= h ;!'T,h, for 2 S i S 2n, and
Small cancellation theory
161
IT^,,+^ = h ; , ! ~ ~It~ is + easy ~ . to see that if u 11 T then 5 = .T. The right sort of uniqueness for normal forms turns out to be uniqueness “up to parallelism”. This can be obtained by imposing the axiom:
N2.
If a,T E RS(G, B, X) and 5 = ?, then u I(T.
Definition. A triple ( G , B , X ) ,consisting of a group G, a subgroup B, and a subset X , satisfying 1 ex,that satisfies axioms N1 and N2 is called an NFS group (NFS = normal form structure). Alternatively we say (GI B, X) is an NF structure o n G. Given an NFS group (G, B, X), we define the associated length function in the “obvious” way. It easily follows from N1 and N2 that any two normal forms for (i.e. sequences in RS representing) an where element g E G have the same length, and so we define Ig = ( u ) u is such a normal form. An obvious question to ask is “Can distinct N F structures on a group G give rise to the same length function?”. The answer is “Yes”. Following [lo], we call such structures isometric. If (G, B, X) is an NFS group, then it is trivial to check that (G, B, XI).where Xl = B(X U X - ’ ) B , is also an NFS group and that these structures are isometric. Clearly these structures are the same if and only if .(G,B, X ) satisfies:
I
F.
X
= B ( X UX - ’ ) .
An NFS group satisfying F shall be called a full NFS group or an group, and, correspondingly, we speak of full NF structures or F F structures. We easily see that XI = {x E G : I x I = l}, and it follows that (G,B,X& always full. Moreover, by [lo] lemma 2.3, this is the unique NFS group isometric to (G, B, X ) . Thus F is a sort of normalisation axiom. We shall work entirely with NFS groups; by the usual abuse of language we shall speak of the NFS group G. The reader who wishes to reformulate our results in terms of NFS groups will find it a routine, if somewhat messy, matter. It is easy to see that any u E RS, in an NFS group, is parallel to one of the form (1, xl, 1, xz,. . ., x,, 1) if n 2 1, and is equal to one of the form ( h ) if n = 0. If u has value g E G, then we say that, according to case, g = xlxz * * * x, or g = h, in normal form, and we abbreviate the sequence t o (x,, xz, .. . ,x,) or ( h ) . Finally, we note that there are alternative axiomatisations of the NFS groups. Consider the following set of axioms:
N3.
If u E RS and 5 = 1, then u = (1).
162
B.M. Hurley
N3*.
If u E S and 6 = 1, then either T E S - RS such that u 11 7.
N4.
If g, h, gh E XI but hk 6Z B U XI, then ghk 65 B U XI.
N4*.
If g, h, hk E XI but gh
EB
(T
=
(1) or there exists
U XI, then
ghk
EB
U XI,
where, as before, X1= B(X U X - ' ) B , and so, when F is in force, we may drop the suffix 1 in N4 and N4*. By [lo] Lemma 2.2, N3 N3*, N4 ~3 N4* and N2 e N3 & N4, giving us various axiomatisations of the NFS groups.
-
63. Examples of NFS groups
-
The purpose of this section is to provide examples of NFS groups. It is to be hoped that these examples will help motivate the remainder of the paper. We shall also show that, together with the appropriate homomorphisms, the NFS groups form a category JWW,in which constructions analogous to free product, free product with amalgamation and HNN extension are possible. We 'first show that certain familiar constructions yield NFS groups.
-
Lemma 3.1. In the following cases, (G, B,X ) is an NFS group: (i) G is any group, B is any subgroup of G and X = G - B. (ii) G is the free group freely generated by the set Y,B = 1 and = Y u Y-I. (iii) G = (H * K : B1= B2), a free product with amalgamation, B = BI and X = ( H - B ) U ( K - B). (iv) G = (t, B : t-'At = C ) an HNN extension, and X = B{t, t-'}B.
x
Note on Proof. We feel that the proof of this result may safely be left to the reader. The following remarks may be useful. Parts (ii), (iii) and (iv) are restatements of well known normal form theorems (Britton's Lemma, see [3], in part (iv)). However an alternative way of proving the lemma would be to note that the natural length function satisfies N1*, Al', A2 and A4, in each case, and then to apply Theorem 1.1.
-
Definition. If (G1,B1, XI) and (G,, B2,X , ) are NFS groups, then by an F S homomorphism (p : (G1,B1, XI)+ (G2,B2,X2), we mean a group homomorphism (p : G1+ G2 satisfying Bl = (p-'(B2)and XI = (p-l(X2).
Our next result, whose proof isroutine and therefore omitted, sums up the more trivial properties of NFS homomorphisms.
163
Small cancellation theory
-
-
Lemma 3.2. (i) The NFS groups together with the NFS homomorphisms form a category, composition being composition of mappings. (ii) Let (G2,B2,X2) be an NFS group and let cp : G I+ G2 be a - Let B1= cp-'(B2) and XI = cp-'(Xz). Then (group) homomorphism. (GI,BI,XI) is an N a r o u p if and only if G I = gp(Xl, B1). When this is the case, cp is an NFS homomorphism. (iii) Let (GI,B I ,XI) and (G2,B G 2 ) be NFS groups. Then a homomorphism cp : GI-+ G2 is an NFS homomorphism if and only if Igcp I = l g 1 for all g E G (i.e. i f n d only if cp is length preserving). (iv) Let (GI,BI,XI) be an NFS group and let cp : G I + G2 be a (group-imorphism. Let B2= cp(B,) and X2= q ( X , ) . Then (G2,Bz,X 2 ) is an NFS group if and only if ker(cp)S B1. In this case cp is an NFS homomorphism. 0
-
Problem 1. Let (GI,B I ,XI) be an NFS group and let (o : G I+ Gz be a (group) homomorphism that is not an epimorphism. Under what conditions isitpossible to construct an NFS structure on Gz that makes cp an NFS homomorphism? A necessary condition is ker(cp) 6 B,.
-
-
Notation. We denote the category of NFS groups and NFS , homomorphisms by N 9 Y . We often use the suffix N to indicate that we are working in this category. Thus, for example, to indicate that two NFS groups are isomorphic in N99,we often write G I= N G2 instead of (GI,B,, XI)= (G2,B2,X2).
-
Definition. Suppose (GI,BI,XI) a n d x 2 ,Bz,XZ) are NFS groups and G I sG2. Then we say GI is a sub-NFS-group (a subgroup,) of G2, and write (GI,B1,X s (G2,B2,X2) (or G1SNGz), if the natural injection G I + G2 is an NFS homomorphism (i.e. if B1= G Ifl B2 and XI = G In Xz or, by Lemma 3.2(ii), if the length function on Gz extends that on GI). This relation is clearly transitive. Warning. sNis not the same relation as
<
of (10).
We now come to the free product in N99.It is easy to see that this is a co-product in the category theoretic sense.
Lemma 3.3 ( F r s r o d u c t for 6%groups). Let (GI,B1,XI) and (G2,B2,X2) be NFS groups. Let G = GI*, B = B1* BzS G and X = B(XI U X,)B, then (G, B, X ) is an NFS group. Moreover Gi SNG for i = 1,2.
B.M. Hurley
164
-
Proof. In this proof we deal with four NFS groups: (Gi,B,, X i ) for i = 1,2, (G,B,X ) and G equipped with the free product (f.p.) NF structure. We shall use J y 1 to denote either of the Gilength functions (note that these functions agree on GIfl G2= 1, and so n o ambiguity arises from this), and we put D(x, y ) = ${I x I + 1 y I - 1 xy -'I}. We denote the f.p. length function by IxJo,and we put Do(x,y)=4{~x~,,+lylo-Ixy-1~o~.
For all g E G define:
It is immediate that B = {x E G : J x1 = 0}, X = {x E G : ( x I = 1) and that the function g I+ J g1 satisfies N1*,Al' and A2. So, by Theorem 2.1 andthe remarks following the statement of axiom F in 42, ( G , B , X )is an NFS group if and only if this length function satisfies A4. We check A4 in its equivalent form: d(x, y ) 3 rn and d(x, z ) 3 rn implies d(y, z ) 3 rn. Where, as usual, d(x, y ) = t{lx 1 + Iy 1 - Ixy-'I}, and where rn is a real number. Suppose then that we have rn € R and x, y, z E G satisfying d ( x , y ) 3 rn and d(x, z ) a rn. If Do(x, y ) < Do(x,z ) then, by A4 applied to the f.p. length function, we see Do(y,z ) = Do(x,y). If this number is an integer then we see x = xIu, y = ylu and z = zIu,for some x l , y,, z I ,u E G, where n o cancellation occurs with respect to the f.p. structure in any of these products, neither does cancellation occur in the products: y,z;' and xly;' (note that we take the statement "No cancellation occurs in the product xy" to mean that no cancellation or amalgamation occurs in this product; see the remarks following the statement of axiom N1* in $2). We now see that d(y, z ) = I u I = d(x, y ) 3 rn, as required. If Do(y, z ) is half an odd integer, then we see that x = xlx2u, y = yly2u and z = zlz2ufor some xI,x 2 , yl, y2,zl, z2, u E G, no cancellation occurring with respect to the f.p. structure, where xz, y2,z2,x2y;l and y2z;l are all 'not equal to 1 and all lie in the same factor of the free product. We now see: d(x,y) = t u x I + I Y
I-
I} = t{lXIX2U I + J Y I Y 2 U I - Ixl(X2Y;')y;'I) = t { l X l I + IIx2ll+ I I + I IlY2ll+ I A ! I - 1x11 - l b 2 Y ;'It = I U I + D(x2, y2). IXY
Yl1+
- I Y 211
165
Small cancellation theory
I I
Similarly, d ( y , z ) = u + D ( y 2 , zz) and d ( x , z ) s 1 u 1 + D(x2,z2), with an inequality in the last case because xZz;I = 1. Thus we see that D(x2,y 2 ) 2 m - 1 u 1 and D(xz,z2) 2 m - I u 1. But we are working with the.length function associated with one of the G , and this satisfies A4. So we see that D ( y z , zz) 3 m - I u 1, and hence d ( y , z ) = I u I + D ( y 2 , z2)3 m , as required. A similar calculation works if D o ( x , y ) > Do(x,z), and so we may suppose D o ( x , y ) = D o ( x , z). If this number is an integer, then it is obvious that d ( y , z ) 2 d ( x , y ) = d ( x , z ) 3 m . If Do(x,y ) is half an odd integer, then we can find x , , x z , y l , y 2 , z l , z 2 ,u E G such that x = x l x z u , y = y l y z u , z = zlzzu, no cancellation occurring w.r.t. the f.p. structure, such that x z , y 2 and z 2 all lie in the same free factor of G, and such that d ( x , y ) = 1 u 1 + D(xz,y z ) and d ( x , z ) = 1 u I + D ( x z , z2). Applying A4 in this free factor, we see d ( y , z ) 3 I u 1 + D ( y z , zz) 3 m , as required. It is immediate from our definitions that Gi SNG for i = 1,2. 0
,
Notation. In the situation of Lemma 3.3, we write:
(G,B, X ) = (GI,BI, Xi) * (G2,Bz, Xz) = GI *N G2.
-
The product, unlike the co-product, of two NFS groups is not always what one might expect it to be. For example: (C3,1,C3- 1 ) X ( C z , L C z - 1)=(1,1,0).
-
Moreover the product of two NFS groups does not always exist. We leave it to the reader to verify that (S,,, 1, S 6 - 1) and (C2x C3,1, Czx C3- 1) have no product in N 9 Y . We have used C, and S. to denote, respectively, the cyclic group of order n and the symmetric group on n letters. Our next result, which is a trivial consequence of Lemma 3&), can be used to show that the product of (H, H,O) with any other NFS group always exists and has the “expected” form.
-
Lemma 3.4. If (G,B,X ) is an NFS group and H is =group, then G x , H = ( G x H, B x H, ( B x H ) X ( B X H ) ) is an NFS group. Moreover G SNG X , H. 0 Definition. Let ( G , B , X )be an NFS group. Then we say that H 6 , G is a tidy subgroup, of G if whenever x , y E X and x y E H U X , then x , y E H (i.e., H is tidy if it is impossible to amalgamate into it).
-
Lemma 3.5 (HNN construction for NFS groups). Ler cp : ( K , C, Y)+ (L,D, Z ) be an NFS isomorphism between tidy
B.M. Hurley
166
subgroupsN of (Go,Bo,Xo). Let G = (Go,r : t-’Kt = Krp), B = gp(Bo, T ) and X = BXoB. Then (G,B, X ) is an NFS group. Moreover Go %G.
Proof. Axioms N1 and F are obviously satisfied by (G, B , X ) . If x E X then we can write x = bfc where f E Xo, and b, c are words in t and the generators of B,. If xl, xz, . . .,x, E X satisfy r 3 1 and xlxz * * x, = 1 then, to prove that G satisfies N3*,we must show that this sequence is parallel to a non-reduced sequence. We d o this by induction on the number of r e that occur in any of the words x, after as many as possible have been “pinched” out, using Britton’s Lemma. If no t ‘ OCCUE in any of the x,, then x, E X o for all i. Applying N3* in Go,we see that the sequence is non-reduced in Go and so a fortiori, it is non-reduced in G. If t‘ do occur in some of the x,, then write x, = b,f,c,,as above, and consider the equation: blf1CibzfzCz. . * 6 3 , = ~ 1.~ (A) Now our calculations are taking place within an HNN group. So, by Britton’s Lemma, a “pinch” can be performed between two t-symbols in the word on the left-hand side of (A). But, by hypothesis, no such “pinch” can be performed inside any of the x,, and this leaves US with two cases to consider.
-
Case Z. For some i, a “pinch” can be performed between some t’ in c, and r-‘ in b,+,. Suppose E = 1 (the other case is similar). We can write x, = f , t c and x , + = ~ b ~ - ’ f , +where ~ , b, c E B and bc E L. Thus bc E 0, and since rp is an NFS homomorphism, there exists d E C d Bo such that d = tcbr-’. So the sequence (xl,. . ., X , - ~ , X , , dx,+’,.. . ,x,) has product 1, is parallel t o the original sequence, but contains less t-symbols. By induction, this sequence is parallel to a non-reduced sequence, and hence so is the original sequence. Case ZZ. A “pinch” can be performed around some product of the , and ~ , b , =+ fr-’f; ~ form ef,c,bt+lf,+l b,f,f, where we have ~ , - ~=bPt”e and where, e, 5,f, f E B and i < j (here we write co = 1 = b,+,, if necessary). Suppose E = 1 (the other case is similar). The word around which we are pinching lies in L and contains no t-symbols. Since L C , Go,we may suppose e = C, = b,+l= * * = c , - ~= b, = f = 1, and xk E 2 for k = i , . . . ,j. Thus in G:
---
(xl, xz,. . . ,xr)[[(xl,. . . , x,-~,x,-le, tx,,
. . ,x,r - 1 ,fx,+’,
[[(xl,. . . ,rx,t-’, t ~ , + ~ t -.’.,. ,tx,r-’, = (Xl,.
. . ,(x,)rp-l,(x,+l)rp-l,.
x,+z,. . . . ,x,)
. . .,x,)
. .,(x,)rp-l,.
. .,xr).
167
Small cancellation theory
Since this last sequence contains less t-symbols than the original, by induction, it and the original are parallel to a non-reduced sequence. To check N4, suppose g , h, k , gh E X but hk B U X . Conjugating by an element of B if necessary we may suppose g = g l b where g l E X o and b E B. It follows from gh E X that h can be written h = b-'h,c where c E B and h , E Xo. Writing k = dklf with d, f E B and klE Xo, we see that ghk = g l h l ( c d ) k l f .If ghk E B U X then it must be possible to eliminate, by pinching round k l and g l h l all the t' in cd. Note that the assumption that K and L are tidy ensures that, if it is possible t o pinch around the product g l h l , then it is possible to pinch around g l and h l individually. In this process, we may have to introduce t-symbols to the left of the product g l h l . Eliminating all the t' from cd, in this way, we obtain el(g2h2)kzfl where e l ,f l E B and g,, hz, kZ E X o . At this stage, we have (gzh2)k2E Bo U X o . Now if h2k2bf Bo U X o then N4* applied to (Go,Ho,X o ) would give g z h z k z 6Z Bo U X o and hence h z k 2 E Bo U X o . But then hk = b-'hlcdklf = b-1elh2k2fl E B U X , contrary to assumption. Hence we have ghk 6Z B U X and N 4 is proved. It is now easy to see that Go S N G. 0 Notation. On the situation of Lemma 3.5, we write:
(G,B, X ) = ((G,B,X ) , t : t-'(K, C, L)r = (L,0, X)) = N$Y(G, t C ' K t " L ) .
-
Lemma 3.6 (Free product with amalgamation for NFS groups). For i = 1,2 suppose (Hi,Ki, Y , ) is a tidy subgroupN of (Gi,Bi,X i ) . Suppose that H 1sN H2. Let G = ( G 1* G z: H , = H2), let B be the subgroup of G generated by the B,, and let X = B ( X , U X2)B. Then (G,B, X ) is an NFS group. Moreover Gi cN G for i = 1,2.
Proof. Let
(d2,B2,xz) be an =-isomorphic
(E, C, Z ) = N$Y(Gi
*N
copy of G 2 .Set:
d2, t : t-IHit = N R ~ ) .
Identifying G2with rdzt-', we may consider G to be a subgroup of E. It is routine to check that -B = G fl C and X = G fl 2. Now by Lemma 3.2(ii), ( G , B , X ) is an NFS group. It is easy to see that Gi S N G for i = 1,2. 0 Notation. In the situation of Lemma 3.6, we write:
( G , B , X ) = ( ( G I , B , , X , ) * ( G z , B z , X , ) (: H I , K I Y, I ) = N ( H ~ , KY2)) z,
N$Y( G 1 * GZ :H =N H ) .
B.M. Hurley
168
It easily follows from the next result, whose routine proof shall be iterate the HNN extension and free product with omitted, that we can amalgamation for NFS groups to obtain (with an obvious extension of notation): and:
N%Y(G, r,, ( p E M ) : t;'K,t,, =NK,,v,, ( p E M ) ) ,
Lemma 3.7. Let 2 be a n o n - e m m c h a i n (with partial order be a family of NFS groups satisfying: {(G, B, u
Then
S).
Let
+ G, SNG,.
urEz G, = (UG, UB, UX,) is an NFS group. Moreover: 7
N
G, S
N
U G,
oez
for all T E 2. 0
Suppose that B is a group generated by a non-empty family {Bp},,EM of subgroups. Suppose that {G,,},EM is a family of groups such that, for each p EM, G,, contains a subgroup B, isomorphic to Be. We define the structure-product of the G, over B to be:
If M = {1,2} and B = B1x B2, then the structure product of G I and G2 over B is easily seen to be the free product with commuting subgroups Bl and B2,which is denoted by (GI* G 2 :[B1, B2]). For the usual definition of this, and the other two products below, see Magnus, Karrass and Solitar [14] pp. 220-222. For our purposes, we could take these products to be defined in terms of the structure product. If Hi ( i = 1,2) are groups containing subgroups Ci ( i = 1,2), and we set G I= HIX C2,and G 2= H2x CI, let Bi ( i = 1,2) be the obvious copy of C1x C2,in Gi, and let B = Bi ( i = 1,2), then the structure product of G I and G 2 over B is the free product of HI and H2with centralised subgroups C1and C2 (in this last example, the structure product we take is, of course, just a free product with amalgamation) and is denoted by (HI* H2: [HI,C2],[H,, C , ] ) .We define H @ G = ( H * G : [ H ,Z(G)], [G, Z ( H ) ] ) ,and call it the free product of H and G with centralised centres. The properties of this product are studied in [14] Example 36 p. 223. The most important being H @ G = G @ H, K @ ( H @ G ) = ( K @ H ) @ G , and Z ( H @ G ) = Z ( H ) x Z(G). We
Small cancellation theory
169
can clearly generalise this product to the free product with centralised centres of an infinite family { G , } p E M of groups. However there are two ways of doing this depending on whether one takes an unrestricted or a restricted direct product of the Z ( G , ) at the appropriate point in the construction. We choose the restricted product of the Z ( G , ) , and denote the product of the G, by B F E M G ,It. is easy to see that Z ( @ G , ) = XZ(G,,), where, of course, the product on the right-hand side of this equation is a restricted direct product. Now observe that all these products have been defined in terms of the free product with amalgamation, and the (restricted) direct product. From Lemmas 3.4 and 3.6, the first of these constructions is always possible in the category N99,while the direct product is possible (in a reasonable sense) in certain cases. It follows that, in certain cases, these products are also possible in N99. We sum up the situation with a series of Lemmas, the proofs of which are easy to construct using the results of this section and are therefore omitted. Note that when working “in N99”,we must replace each group theoretic concept with the corresponding conceptN in N99.
Lemma 3.8. The structure product construction works in N99. 0 As would be expected, we denote the structure product in N99 by STRN(G,,B). It is easy to see that, if we can construct a structure product of a family {G,}aEMover B in the category of groups, then, assuming the NF structures (G,, B,, G,, - B,) and (B, B, 0) on the groups concerned, we can make a corresponding construction in N99. If we work out the resulting structure on STR(G,,B), we obtain the next result.
NF
Lemma 3.9. (sTR(G,,B),B, B(U,,~(G, - B , ) ) B ) is an group. 0
NF
NFs
NF
We call the structure of Lemma 3.9 the natural structure on the structure product. The next result is obtained by combining structure product with direct product in N9.9’. We cannot expect a completely general result because direct products are awkward in N99.
-
Lemma 3.10. Let Gi ( i = 1,2) be NES groups, and suppose Hi ( i = 1,2) has the FF structure (Hi,Hi,0). If Hi S,., Gi ( i = 1,2), then both the free product of the Gi, with the Hi commuting, and the free product of the Gi, with the Hi centralised can be constructed in N99. 0
170
B.M. Hurley
NF
Starting with the structures (Gi,Hi, Gi - H i ) , and working out the resulting structure on the two types of product, we see:
Lemma 3.11. Suppose that either G = (GI* G 2 :[HI,H,]), or G (GI* G2: [ H I ,G2],[ H 2 ,GI]), then ( G , H i x H 2 , ( H 1 x
-
H2)((Gi-iii)U(G2-Hz))(HIX
=
H2))
is an NFS group. 0
NF
We call the structures of Lemma 3.11, the natural NF structures on t h e respective products. We remark that the idea of defining a normal form on a free product with commuting subgroups, that is naturally related to its structure, is not new. For instance, the normal form constructed in a free product with commuting subgroups in [14] Example 35 p. 223 gives rise to an N F structure whose corresponding N F structure is the one given in the lemma. Moreover, a very similar type of normal form has been used by B. Baumslag [l]. Combined with t h e Conjugacy Theorem (Theorem 4.2), structures can be used to tackle the conjugacy problem. Hurley (unpublished) has used this approach to find neccessary and sufficient conditions, on groups Gi ( i = 1,2) and their subgroups Hi (i = 1,2), for G = (GI* G z :[HI,H2]) to have soluble conjugacy problem. The result of Hurwitz [ l l ] , which states that if the Gi are free and the Hi are finitely generated then G has soluble conjugacy problem, follows easily from the former result. Since t h e free product (of two factors) with centralised centres is a special case of the free product with centralised subgroups, we can apply Lemma 3.10, and then iterate the construction, using Lemma 3.7, to obtain our next result. Since we will use this construction explicitly in $8, we have calculated the resultant structure.
NF
NF
-
Lemma 3.12. Let {(G,,B,,X,)}pEMbe a family of NFS groups. Suppose that, for all p E M, Z ( G , ) s B,, then the free product of the 6, with centralised centres can be constructed in N . 9 9 , and shall be denoted by @,EM W )G ~ * Explicitly, we have :
0 G p = ( @ G w ,u
u
u G - B d ) ( u BJ). 0 We close this section by remarking that, although the NF structures
P E M W
,EM
pEM
B”(
,EM
&)(
*EM
PEM
of Lemmas 3.9, 3.11 and 3.12 look complicated, this is, in part, the fault of the notation. A little thought reveals that structures are entirely natural, and almost “what one might expect”.
171
Small cancellation theory
-
94. NFS groups: The Torsion Theorem,
The Conjugacy Theorem and The Centre
In this section (G, B, X) is a fixed NFS-group and g H ( gI shall be its length function. We say that g E G is cyclically reduced (c.r.) if ( g 1 s Ix-lgx I + 1 for all x E G and is strictly cyclically reduced (s.c.r.) if I g l s l x - ' g x l for all x E G. Lemma 4.1. (i) g E G is c.r. if and only if either g E B or g can be written g = x l * x , ( r 3 1) in normal form where if r 3 2 then (XJ, 1 # 0. (ii) g E G is s.c.r. i f and only i f either g E B or g can be written g = xl * * x, ( r 3 1) in normal form where if r 3 2 then I x,xI1 = 2 and i f r = l then I t - ' g t ( 2 1 f o r a l l t E G . (iii) If g is c.r. but not s.c.r. then there exists t E X such that tgt-' is s.c.r. 1
Proof. (i) Suppose g E G is not c.r., then I g 13 2 and there exists k E G such that I kgk-'I S 1 g I - 2. By the corollary to [lo] lemma 3.3, k k2 B. Thus we can write g = x l * x, ( r 5 2) and k = cI . . c, (s 3 1) both in normal form. If 1 call# 0 then by [lo] lemma 2.1 (i) we have I kg I 3 r + s - 1 and so by [lo] lemma 3.3, we have 1 kgk-I 1 3 I(kg1 - 1 k 1 ) = r - 1 contradicting 1 kgk-'( s r -2. Thus 1caI( = 0 and similarly 1 XF ;' 1 = 0. Hence I x,xl I = 0. Conversely, if g = x1 * x, ( r 3 2) in normal form and 1 x,xl 1 = 0, then g is clearly not c.r. (ii) Suppose g E G is not s.c.r., then Ig 15 1 and there exists k E G such that 1 kgk-'1 C Ig 1 - 1. If J g1 = 1 then there is nothing to prove, so suppose 1 g I b 2. If g is not c.r. then from (i), Jx,xl1 = 0. If g is c.r. then by (iii) there exists t E X such that tgt-' is s.c.r. Thus either 1 t x l 1 = 0 and I x , t - ' ) = 1 or J t x l J= 1 and ) x , t - ' ( = 0. In either case (x,xl(= 1. Conversely if g = x l . . * x, ( r 3 1) in normal form and either r = 1 and I t-lgt 1 = 0 for some t E X or r 5 2 and 1 x,xl I s 1, then g is clearly not s.c.r. (iii) Suppose g E G is c.r. but not s.c.r., then ) g 12 1 and there exists k E G such that I kgk-')= Ig I - 1. Since 1 k 13 1, we can write g = X I * * * x, ( r 3 1) and k = cl * * c, (s b 1) both in normal form and consider the reduction of the product c1c2;* c a l x 2 . XF;' * . . c;'c;' to normal form. First suppose r = 1, then g = x1E X. If (c,gc;'l b 1, then, by lemma 2.l(i) of [lo], we see 1 kgk-'I b 2 )k I - 1 2 1, contradicting our choice of k. Thus we have 1 c,gc;'I = 0 = ( gI - 1 with c, E X as required. Now suppose r 3 2 . If / c a l l = Ix,c;'l= 0 then (c.gc;') = 1 g I- 2 contradicting our assumption on g. If either I c a I1 = 2 or 9
-
-
-
-
-
B.M. Hurley
172
(x,c;'J = 2, then 1 kgk-'( 3 I 1 kg 1 - 1 k - ' ( 1 = g contradicting our assumption o n k. If (c,xI(= lx,c;'( = 1, then by [lo] lemma 2.l(i) we see I kgk - I I = 2 1 k I + 1 g I - 2 2 1 g 1 contradicting our assumptions on k. Thus one of the pair 1 c,x,(, (x,c;'l is 0 and the other is 1. Hence c,gc;'( = ( g 1 - 1 and 1 cgc;') is s.c.r. and c, E X as required. 0
I
-
Theorem 4.2 (The Conjugacy Theorem for NFS Groups). Let (G, B, X ) be an NFS group. Suppose g E G is s.c.~.and suppose g g' where g' = p l p 2 p,(r 3 2) in normal form with (p,pI1 = 2, then g can be obtained from g' by cyclically permuting the elements p , , p2, . . . ,p, and then conjugating by an elemgnt of B.
-
Proof. From Lemma 4.1, g' is s.c.r. and hence ( g 1 = Jg'l= r. Since g g', there exists w E G such that g = wg'w-'. If 1 w 1 = 0, then w E B and we are done so suppose 1 w I = s 3 1. We can write w = c l c 2 .. c, in normal form. If 1 cspll = ( p ~ ;=~1,) then putting gl = p,c;', h , = cspl, k l = p;', we see g , , h,, k l E X , h , k , = c, E X but g l h , = p,pl fZ B U X. So by N4*,in the presence of F, we conclude p, = glh,kl fZ B U X , contradicting p, E X. Thus at most one of lcspl(and Ip,c;'l is equal to 1. Suppose (cspl(= 1. If Ip,c;') = 0 , then )c,g'c;'( = ( g ' l - 1 contradicting g' being s.c.r. If lp,c;'( = 2, then, by [lo] lemma 2.1(i), we have ( g 1 = I wg'w-'I = 21 w I + (g'l- 1 > lg'l contradicting ( g l = ( g ' l . Thus Icspll f 1 and similarly (p,c;'(# 1. Since ( c s p l= J (p,c;'( = 0 contradicts g' being s.c.r. and (c,pl(= Jp,c;'J = 2 , is 0 and contradicts J g J= Ig'(, we see that one of the pair ( c s p l ((p,c;'( the other is 2. Suppose (cspl(= 2, the other case is similar. Putting g " = p , p l - * * p , -and l w ' = wp;', we see g = w'g"w'-' and ( w ' ( = ( w ( - l . By induction g can be obtained from g" by cyclically permuting p o p l , . . . ,p,-, and then conjugating by an element of B. 0
-
-
Corollary 4.3. Let k E G be s . c . ~ .with 1 k I z=2 and let g E G be any conjugate of k , then there exist w, k ' E G such that g = wk ' w - ' with no cancellation occurring in this product and such that: (a) If I g 1 = I k 1 (mod 2) then k' is a s.c.1. conjugate of k and hence can be obtained from k as in Theorem 4.2. (b) If 1 g 1 f k 1 (mod 2) then k ' is a C.T. conjugate of k that is not s . c . ~ .and hence, by Lemma 4.l(iii) can be obtained from some s.c.~. conjugate of k by conjugating by an element of X .
I
Proof. Let w be an element of maximal length such that g = wk'w-' for some k ' E G with no cancellation occurring. If k ' is not c.r., then ~ ~that = 2k ' ,= x l . . . x , ( r * 2 ) in by Lemma 4.1(i), and ~ k ' ~ ~ = ~wek see
Small cancellation theory
173
normal form with /x,xlI = 0. This contradicts the maximality of the l . Ik'(= length of ( w l . Since k ' - k , w e s e e ( k J < J k ' J s k + But l g / - 2 ) w J = l g I ( m o d 2 ) .So that if ( g I = ( k I ( m o d 2 ) ,then ( k l = l k ' J and k' is s.c.r., while if 1 g 1 f 1 k I (mod 2 ) , then 1 k 1 + 1 = I k'l and k' is c.r. but not s.c.r. 0 The next result is an immediate corollary of the last.
Corollary 4.4. Let k E G be s.c.~.with 1 k 122 and let g E G be any conjugate of k , then there exists w E G such that g = wkw-' and 1 w I s KIg 1 + I k I - 1). 0 No analog of this result holds if 1 k 1 = 0. Consider an HNN extension of a direct product of a countably infinite number of copies of Z by an element t that has the effect:
t-'(n,, n2,n3,.. . )t = (0,n l , n2,n3,. . . ).
-
If we equip this group with the usual (Britton's Lemma) NFS structure, then we see that for any choice of k # 1 with I k 1 = 0 , and for any integer N, there exists a conjugate g of k with Ig 1 = 0, but such that wkw-' = g implies 1 w 13 N. The situation for 1 k I = 1 is summed up in the next result. Its short proof shall be omitted.
Lemma 4.5. Let k E G be s.c.~.with 1 k 1 = 1 and let g E G be any conjugate of k , then there exists w E G wuch that g = wkw-' and 1 w 1 S f{1 g 1 + 1). Moreover one of the following two cases obtains. (a) l g 1 is odd and no cancellation occurs in wkw-'. (b) W e can write w = wla, no cancellation occurring, such that 1 a 1 = 1, I ak 1, 1 ka-' 1 3 1, I aka-' 1 = 1 or 2 according as 1 g 1 is odd or even, and, putting u = aka-', no cancellation occurs in wluw;'. 0
-
Theorem - 4.6 (The Torsion Theorem for NFS groups). Let (G, B, X ) be an NFS group. Then every element of finite order in G is a conjugate of an element of finite order in B U X. Proof. Let g be an s.c.r. element of G. It follows from Lemma 4.l(ii) g have t h a t , i f g G B U X , t h e n ( g " J = n J g J , f o r a l l n b l , a n d s omust infinite order. Thus every element of finite order is conjugate t o an s.c.r. element, which, having finite order itself, must lie in B UX. 0
-
Before we study the centre of an NFS group we need a new concept.
B.M. Hurley
174
Definition. Let B be a group and let h E B. By an nth root of the inner automorphism x H hxh-' of B we mean an automorphism (p of B satisfying:
bp = b
and xqo" = bxb-', for all x E B.
By a root inner automorphism of B we mean an automorphism that is an nth root of some inner automorphism for some n 2 1. Remarks. Plenty examples of root inner automorphisms can be given. Inner automorphisms and automorphisms of finite order are root inner. If B is the free group freely generated by the set {a, b, c } , then the automorphism of B that has the effect:
a H bc-la-', b H b
and c H aca-'
is root inner (it is a square root of x H bxb-'), but is neither inner nor of finite order. It should also be noted that, if cp is an nth root of x H bxb-' and an m th root of x H cxc-', then it is a (pn + qm)th root of x H (bPcq)x(bPcq)-'. It follows that, if cp is root inner, there exists n 2 1, such that the integral multiples of n are precisely the numbers m for which (p is an rnth root. If qo is an nth root of an inner automorphism I, where n 2 1, but is not an mth root for any m satisfying 1< m < n, then we call (p a primitive nth root of I. The remainder of this section shall be devoted to the proof of our next result.
-
Theorem 4.7. Let (G,B, X ) be an NFS group. Then one, and only one, of the following three cases obtains. (i) Z ( G )G B. (ii) Z ( G ) C B, and X = G - B. (iii) G is the split extension of B by an infinite cyclic group ( t ) , say, such that theautomorphism x H t-'xt of B is root inner, and G has the usual HNN NFS structure (G,B, B{t, t-'}B). Moreover, in this case:
Z ( G )= 0%)x ( Z ( B )n CB( t ) ) , where b E B, n 3 1 and t is a primitive nth root of x
Before we start on
I+
bxb-'.
the proof of Theorem 4.7, we note that if the
NFS group (G,B, X ) has a factor or an extensionN of the form - -groupN (G,8,G - 8 ) or of the form (G,B, B{t, t-'}8),where d is a split
extension of B by ( t ) , then G itself has one of these two forms. - This implies that taking preimagesN or extensionsN of non-trivial NFS groups
Small cancellation theory
175
preserves the property Z ( G )C B. We prove t h etheorem via a number of lemmas, in all of which, ( G , B , X )is a fixed NFS group.
Lemma 4.8. If x, y E G satisfy x y = yx, then either (i) there exists t E G such that: / txt-ll = 1 tyt-'( = 1, or (ii) there exist g, w E G, h, h' E B and j , k E Z such that:
x
=
ghg-lw',
y = gh'g-'w* and ghg-', gh'g-' and w commute in pairs.
Proof. We first show: (*) If x, y E G commute, E = ? 1 and 77 = 0 or f 1 then the pair (x, y ) satisfies (i) or (ii) if and only is the pair ( x e ,x " y ) does. By the symmetry of the situation and the form of (i) and (ii), it is
sufficient to prove (*) only in one direction and only in t h e case E = 77 = 1. Suppose ( x , y ) satisfies (i). Then 1 txt-'( = 1 tyt-'1 = 1, for some t E G. If t x y r - ' / = 0 then (x, x y ) satisfies (ii) while if 1 txyt-'/ = 1 then it satisfies (i). Suppose then that 1 txyt-'1 = ( t y x t - ' (= 2. Applying N3 to the equation ( t x t - ' ) ( t y t - ' ) ( t x t - ' ) - ' ( t y t - ' ) - ' = 1, we see tyx-'t-' = h E B for some h and t-'ht = yx-' commutes with x. Since xy = r-'htx2, we see that ( x ,xy) satisfies (ii). If ( x , y ) satisfies (ii) then trivially (x, x y ) also satisfies (ii). Thus (*) is proved. We now prove the lemma by induction on 1 x I + 1 y I. It is trivial if Ix I + I y 1 s 2 so suppose I x I + 1 y I > 2. By symmetry we may suppose ( x 1 C I y 1. Again the lemma is trivial if 1 x 1 = 0. If 1 x I = 1, then, by our assumptions, we can write y = c I c 2 .. * c, (s 2 2 ) in normal form. If (xcll= 0, then Ixy 1 = 1 y I - 2 so by induction (i) or (ii) holds for ( x ,x y ) and thus by (*), (i) or (ii) holds for ( x , y ) . If 1 xcI 1 = 1, then ( x c I ) c 2 . c, and c1c2* ( G X ) are both normal forms for x y so by N3, c-'xcI E B and hence (ii) holds for ( x , y ) . If I xcI 1 = 2, then xclcz * * c, and L1c2 . CJ are both normal forms for x y and so by N3, ( x - ' c I1 = 0. But then (i) or (ii) holds for ( x - ' , y ) and so by (*), (i) or (ii) holds for ( x , y ) . If 1 x 1 2 2 , then we can write x = c 1 c 2 * * - c( ,r 2 2 ) and y = d l d 2 . . . d , (s 2 r ) both in normal form. Since xyx-'y-' = 1, either (c,d;'( = 0 or else 1 c;'d;'J = 0. By (*), we may assume 1 c,d;'( = 0. If 1 c,cl(< 2, then I c j c ; ' ) < x and I c,yc;'( C y, thus by induction (i) or (ii) holds for the pair ( c j c ; ' , c,yc;') and thus (i) or (ii) holds for (x, y ) . If 1 c,cl 1 = 2, then no cancellation occurs in either product xy or yx. Comparing normal forms (i.e. using N2) in the equation x y = yx, we see ( x - l y 1 = s - r. By induction (i) or (ii) holds for the pair ( x , x - ' y ) and so by (*) holds for the pair ( x , y ) . 0
I
-
- -
B.M. Hurley
176
We define a pseudo-proper power (p.p.p.) to be an element of the form: x - ' h x w k where h E B ; x, w E G ; k 3 2 and x-'hx and w commute.
or 1 x " I = I n I for all n E 2. (ii) Suppose I x 1, 1 y 1, I x y 1 3 1 , but are not all equal to 1, xy = yx and x is s.c.r., then y and x y are s.c.r. and there exist w E G, h, h ' E B and j , k E Z such that: Lemma 4.9. (i) If 1 x 1 = 1 , then either ( x )C B U X
x = hw',
y = h'w',
w is s.c.r. with 1 w z J= 2 )w 1, and h, h' and w commute in pairs. (iii) Let x E X be s.c.r., then either or :
CG(X)CBu x , CG( x ) = ( x ) x ( B n CG( x )) and
1 x 2I = 2.
Proof. (i) If ( x ) B U X, then there exists r 3 1 such that x'+' 6Z B U X. Let r be chosen to be as small as possible. If E B, then write b = x'-'. N o cancellation occurs in (bx)x, and so no cancellation can occur in xx. Thus r = 1. If x'-' E X, then writing g = x-'+', h = x and k = X ' and applying N4, we see that xx = ghk 6Z B U X, and again r = 1. By the minimality of r, there are n o other cases and it is now easy to see that no cancellation occurs in any product of the form xxx . . . x. Thus 1 X " 1 = 1 n 1 1 x 1 for all n E Z. (ii) We first prove: (**) Suppose I u 1, 1 u 1, 1 uu 1 3 1, but are not all equal to 1, and uu = uu, then u is s.c.r. (j u is s.c.r. @ uu is s.c.r. We first note that, by symmetry, we need only prove that if u is s.c.r. then so is uu. Suppose then that u, u and uu satisfy the hypotheses of (**), and that u is s.c.r. If 1 uu 122, then the first and last letters of uu are the same as, respectively, the first and last letters of u. This implies that uu is s.c.r., so suppose 1 uu 1 = 1. From [lo] lemma 2.4 and our assumptions on u and u, the following cases can occur. Cpse I. u = u-'uI, where 1 u l ( = 1, and where no cancellation occurs in this product. Since uu = uu, we have that u-'u-'ul = u - ' u l u - ' , and no cancellation occurs on either side of this equation because u is s.c.r. Comparing normal forms for u-'ul and ulu-', we easily see that there exists b E B and r 3 1 such that u = ui'b, u = u;+'b-', where ulb = bul. We easily see that uu = uI, and that 1 u : l = 2. If uu is not s.c.r., then, XI-'
Small cancellation theory
177
by Lemma 4.l(iii), there exists t E X such that t - ' u l t E B. It follows that t-'u:f E B, from which we conclude that there exist c, d E B satisfying u1 = fc = d t - ' . But now u: = dc E B, contradicting u:l = 2. Hence uu is s.c.r. Case ZZ. u = uIu-', where lull = 1, and where no cancellation occurs in this product. This case is similar to Case I. Case 111. u = u l z - ' , u = zul, where no cancellation occurs in these products, and where I ul( = lull = I u l u l \= 1. Since we cannot have I u I = I u I = ( u u l = 1, we conclude that I z ( 3 l . Now uu = uu, and so Cases I, I1 and I11 can occur with the roles of u and u reversed. We may assume Case I11 obtains, for otherwise we could argue as before to obtain that uv = uu is s.c.r. Now we easily see that there must exist f, g E G such that u = ulfu;', u = u;'gul, with no cancellation occurring in these products. We now see that u - ' u = uu-', with no cancellation occurring because u is s.c.r. W e can also see that u I = I z 1 + 1 = I u 1, and so, comparing normal forms, u = u-'b for some b E B f l CG( u ) . But now uu = b E B, contradicting 1 uu 1 = 1. Thus this final case cannot occur. Thus (**) is proved. Now suppose that x and y satisfy the hypotheses of (ii). By.(**), x, y and x y are s.c.r., and so, by symmetry, we may suppose that ( x 13 2, and that either 1 y 1 3 2, or 1 y 1 = I xy 1 = 1. By Lemma 6.2, there exist g, w E G,'h,h ' E B and j , k E Z, satisfying x = ghg-lw' and y = gh'g-'w', where ghg-', gh'g-' and w commute in pairs. We assert that w' 6Z B, for, if not, then we see that x = g ( h g - ' w ' ) = (w'gh)g-', both products being normal forms, and hence x = gh"g-' for some h " E B, contradicting x being s.c.r. We also assert ghg-'E B, for, if not, then, since I x 132, we can apply (**) to the pair ghg-' and w', to obtain the contradiction that ghg-' is s.c.r., but does not have length 0. It now easily follows from our assumptions on x and (when 1 w I = 1) (i) of the lemma, that w is s.c.r. and satisfies 1 w21 = 2 )w 1. Applying a similar argument to y, we see that w' 6Z B, and that, if ( y 1 = 2, then gh'g-'€ B, so that, in this case, we may assume g = 1 and (ii) is proved. It remains to prove (ii) when y I = 1 x y 1 = 1, and it is clear that it is sufficient to show that gh'g-'E B. Suppose this is not so, then we have gh'g-', w' E X , for, if not, we could apply (**) to this pair to obtain the contradiction that gh'g-' is s.c.r. But now we can obtain the same contradiction by observing that )gh'g-'x 12 1, and then applying (**) to gh'g-I and x. Thus gh'g-'€ B, and the proof of (ii) is completed. (iii) Suppose x E X is s.c.r. If CG( x ) B U X, then x commutes with some element y with ( y 1 2 2 . Since ( x y I = 1, we can write x = hw', y = h'w', as in the conclusion of (ii). Since I w21= 2 )w I and I x 1 = 1, by
I
I
I
178
B.M. Hurley
(i), j = + - 1. We may clearly suppose j = 1. Writing h" = h'h-*, we see that h " E CG(x) f l B and that y = x k h " . Hence we see CG(x) = ( X ) X ( B n C G ( X ) ) . Since l x Z l = ( h z w Z ( = 2 (iii) , is proved. 0 Our next result is an immediate corollary of part (ii) of the lemma. Corollary 4.10. If ghg-'w' is S . C . ~ .pseudo proper power of length at least 2, then ghg-' E B, so that we may suppose that g = 1, and that no cancellation occurs in the product hw'.
Proof of Theorem 4.7. It is easy to see that cases (i), (ii), and (iii) of the theorem are mutually exclusive, so that at most one case obtains. We shall assume that we have an NFS group (G, B, X), for which case (i) does not hold, i.e. for which Z ( G ) & B , and shall prove that one of the remaining two cases obtains. Let x be an element of Z ( G ) - B of smallest possible length. Clearly every element of Z ( G ) is s.c.r. and so x is s.c.r. If ( x ( = 1, then, by Lemma 4.9(iii), either G = CG(x) C B U X , when clearly X = G - B and (ii) holds, or G = C G ( x ) = ( x ) X ( B fl c G ( x ) ) = ( x ) X B, and we easily see that (x) is infinite cyclic and that X = B{x,x-'}B. In this case, x induces the identity automorphism on B, which is clearly root inner, and since the centre of G obviously is (x) x Z ( B ) , case (iii) obtains. Now suppose 1 x I 3 2. Since G # B, X is nonempty, and so we can choose t E X. Clearly ( x t 1 z= 1, so, by Lemma 4.9(ii) and (iii), t is s.c.r., x E CG( t ) = ( t ) x ( B n CG( t ) ) and ( t ) is infinite cyclic. Thus there exists b E B such that x = t"6, where ( n ( = ( x I. Replacing t by r-', if necessary, we may assume n is positive. If s E X , then a similar argument shows that x = sene,for some c E B and E = & 1, in normal form. Comparing these two normal €orms for x yields s = (td)', for some d E B. So G = gp(t, B). Now suppose e E B, then te E X , and so x = (te)'"f, for some f E B and E = & 1. If E = - 1, then te = ht-', for some h E B, and so ( x I = I(te)-"fl = I he(te)-'"-Z'f)d n - 2, contradicting ( x ( = n. Thus E = 1, and, comparing the two normal forms (et)"-'ef and t"-'b for t - ' ~ ,we see that et = th, for some h E B. But e E B was chosen arbitrarily, and so G is a normal extension of B by (x), and, since (x) is free, it is a split extension. It is now easy to see that X = B { t , t-'}B. Now, x = t"b E Z ( G ) , and so for y C B, we have t-"yt" = byb-'. Since b E B n CG(t),we have t-'bt = b, and so t induces an .nth root of y H byb-'. To see that it induces a primitive nth root of this automorphism, suppose t-"yt" = cyc-' and t-'ct = c, for some c E B, some m 3 1, and all y E B, then we see
Small cancellation rheory
179
t"c E Z ( G ) - B. By the minimality of the length of x = t"b, n S m, as required. It is now easy to see that the centre of G is of the required form, and so case (iii) obtains. 0 85. Small Cancellation Theory over
NFS groups
In this section, we shall assume familiarity with Lyndon (131, and we shall concentrate on the points of difference between Lyndon's theory and ours. For the whole of this section, (G, B,X) shall be a fixed NFS group. Definitions. A set R of c.r. elements of G is symmetrised if whenever r E R so is r-' and so is any c.r. conjugate of r. (From now on R is a fixed symmetrised set.) An element p E G is a piece (or an R-piece) if there exists rI # r2 E R such that rl = p s l , r2 = ps2 with almost no cancellation (see §2) occurring in both products. (Note that we do not require all the elements of R to be s.c.r. and that if R # 0, then any element of B is a piece.) We now state our small cancellation conditions, of which all except
T: are restatements, for NFS groups, of corresponding conditions, from [13], for free products. C @ ) : No element of R is a product of fewer than p pieces. C'(A): If r E R has the form r = ab with almost no cancellation occurring, where a is a piece, then 1 a 1 < A 1 r 1. It is easy to see that C'(A) with A s l/(p - 1) implies C@). Definition. If w E G and w = uxu, with no cancellation occurring in this product, and x EX, then we say that x is a letter occurring in w. If rl, r2 and 13 are s.c.r. elements of R, then, in at least one of the products rlrz, rzr3, 1311, no cancellation occurs. If a l , az and a , are letters occurring in s.c.r. elements of R, then T: alazas# 1. Ti: If r l , . . . , r5 are s.c.r. elements of R , with each r,ri+' # 1, then there exists a G such that each r l = aria-' is in R, and in at least one of the products r : ~ i l +no ~ , cancellation otcurs. Ti: If a ] , az, a,, a4 are letters occurring in s.c.r. elements of R and a l a z , aza3,a3a4& B, then alaza3a4# 1. We now take conditions T: and T: to be the obvious analogs of T: and T:.
Ti:
B.M. Hurley
180
We now state the main result of this section. It is an analog, for NFS groups, of Theorems I and IV of [13].
Theorem 5.1. Let (G,B , X ) be an NFS group, R be a symmetrised subset of G, N be the normal closure of R in G, and w be a nontrivial element of N. Under the additional hypothesis (9
C(6),
it follows that w contains some r from R with three pieces missing: w and r can be written w = bac and r = axIx2x3with almost no cancellation occurring in these products, where x1,x2and x3 are pieces. Under the hypotheses (ii)
C(4), T3 and T:,
or (iii)
C(3),
Tf,Tf, i = 3,4,5,
it follows that w contains some r from R with two pieces missing. 0
It is pointed out by Lyndon [13] p. 225 that, in the free product case, the next result is not quite an immediate corollary of the last. 1 C 1 x 11 + Ix21+ Ix3I. This is because, for example, we may have 1 xIx2x3 However corresponding considerations strengthen the hypothesis C’(A), for free products as opposed to free groups, in a way that more than compensates for this. A similar argument clearly works for general NFS groups.
Corollary 5.2. Let (G,B, X ) , R, N and w be as before. If we have (i)
C‘(A)
for some A d 1/5,
I
then we contains a part a of some r in R, with I a > ( 1 - 3A)l r I. If we have either (ii)
C(A),for some A
S f ,Ti
C‘(h), for some A
d 4,
and
T:,
or
(iii)
and
T:, Tf, i
= 3,4,5.
then w contains a part a of some r in R, with 1 a 1 > ( 1 - 2A)l r I. 0
-
To prove Theorem 5.1, we must first construct diagrams over NFS groups, and then show that these diagrams have all the “usual” properties. We turn to this construction now. It is essentially the same
Small cancellation theory
181
as that used by Lyndon [I31 for diagrams over ordinary free products, and that used by Schupp [19] for diagrams over free products with amalgamation. Let R be a symmetrised subset of the NFS group (G, B, X), satisfying (i), (ii) or (iii) of Theorem 5.1. With certain sequences pl,p 2 , . . . ,p , of conjugates of elements of R we shall associate a diagram M(pl,p 2 , . . . ,p , ) which will be a connected, simply-connected, oriented planar map with a distinguished vertex 0 E M’, the boundary of M. M will be labelled by a function cp into G satisfying: (1) If sl, . . . ,s, are, in order, the edges in the boundary cycle of M beginning at 0, then, writing w = pl * * p,, we have w = cp(sl).. * cp (s,) with almost no cancellation occurring in this product of labels. (2) If D is any region of M and e l , .. . ,ei are the edges in a boundary cycle S of D, then cp ( e l )* . . cp ( e i ) is equal to a c.r. conjugate of one of the pi, and almost no cancellation occurs in this product. By a weak diagram for a sequence pl, . . .,p,, we mean a diagram M satisfying (2), but only a weaker form of (l), namely that the product of the boundary labels, taken in their “correct” order, is equal to w = pl * * p,,. Suppose the (weak) diagram M‘ can be obtained .from the (weak) diagram M by: (a) choosing a new origin 0’, not necessarily lying on the boundary M ’ of M, for M’, (b) deleting boundary edges ending up with a, possibly disconnected, diagram MI’, of which 0’ is a boundary vertex, and (c) deleting the connected components that d o not contain 0‘, then we say M‘ is a subdiagram of M. It is important to notice that, in (b), we not only allow deletion of edges on the boundary of the original diagram M, but we also allow deletion of edges that become boundary edges in virtue of other edges having been deleted. We say a (weak) diagram is reduced if the product of the boundary labels (taken in their “correct” order) of any subdiagram is equal to 1 if and only if that subdiagram is a tree. We say a sequence p , , p2,.. .,p . of conjugates of elements of R is minimal if no shorter such sequence has the same product. 9
Lemma 5.3. A ( w e a k ) diagram for a minimal sequence is reduced. Here we impose no cancellation hypotheses on R .
Proof. Let M be a non-reduced (weak) diagram for a minimal sequence. Then M contains a subdiagram M’ the product of whose boundary labels is 1, and which is not a tree. We can now divide M into diagmms A?’ and M” as shown in Figure 1. The product of the boundary labels of M“ is clearly the same as the product w of the boundary labels of M. Since f i r is not a tree, M” contains less regions
182
B.M. Hurley
than M, and so must be a diagram for a shorter sequence. This contradicts M being a diagram for a minimal sequence. 0 Exactly as in [13], we may prove the following lemma. Lemma 5.4. If M is a reduced (weak) diagram, then the label on an interior edge is a piece, provided the boundary path of every path of every region is a simply connected closed path. 0 We recall the initial construction of the diagram M, over a free product F as described by Lyndon [13]. In this case vertices are divided into two classes, primary and secondary. The labels on every edge belong to a free factor X i of F, with the labels on edges meeting while the labels on at primary vertices belonging to different factors Xi, edges meeting at secondary vertices belong to the same factor. Moreover in traversing any path through the diagram, one comes across, alternately, primary and secondary vertices. We shall use an analogous initial construction; however, following Schupp [ 191, we allow 1 to be a label. The precise relationship between the two classes of vertices and the labels on the edges in our initial construction shall be as follows: (a) The label on any edge has length 0 or 1. (b) If vl, v2, v3 are vertices of M joined by edges El = ( v , , v2) and E2 = ( v 2 ,v,) and v I is primary, then 1q(E1)cp(E2) Cl1. Moreover if v l , v2, v3 are consecutive vertices of either the boundary path of some region or the boundary path of the whole diagram (starting at 0), then Q(EI)Q(E2)I= 1. (c) Suppose vi ( i = 1 , . . ., 5 ) are consecutive vertices of the boundary path of some region, v 1 is primary and Ei= (vi,v ~ + ~ for) ,i = 1 , . . . ,4, then 1 Q (El)cp (E,)Q(E,)Q(&)I = 2. Unless we are dealing with a weak diagram, we require an analogous condition to obtain when the vi are consecutive vertices of the boundary path of the whole diagram (starting at 0).
I
Small cancellation theory
183
We introduce two ways of adjusting the labels on the edges of a diagram. (i) Suppose the vertices u, (i = 1,. . . , r) are joined to t h e vertex u by edges E, = (uz,u ) (i = 1,. . . , r ) with labels cp(E,) (i = 1,. . . , r). Then we perform an adjustment of the first kind if, for each i, we change the label on E, to cp(E,)h,where h is a fixed element of B. (ii) Suppose the primary vertices u, (i = l , . . . , r ) are joined to the secondary vertex u by edges E, = (ua,u ) ( i = 1,. . . ,r ) with labels cp(E,) (i = 1,. . . , r). Then we perform an adjustment of the second kind if, for each i, we change t h e label on E, to cp(E,)(cp(E1))-'. Figure 2 illustrates an adjustment of t h e second kind. It is easy to see that adjustments of both kinds preserve properties (a), (b) and (c) of the labeling. Moreover, the precise set of quintuples of boundary points for which the analog of (c) fails remains unchanged by these adjustments. The product w of t h e labels on the boundary arc is unchanged by any adjustment with u # O . If u = 0, then an adjustment of the first kind changes w to K ' w h , for some h E B, and, since we shall take care to make 0 a primary vertex, adjustments of the second kind will have no effect on w. l B = 0, for if R # 0 then any element of B is a piece, and Now R r so, by Lemmas 4.3 and 4.5, any conjugate p of an element of R can be written as p = wrw-' with no cancellation occurring and r E R. Note that, since every element of R is c.r., if p = wfrfwf-Iwith no cancellation and r f E R , then ( w I = I w f l . W e d e f i n e s t ( p ) = I w / . We say a sequence p l , p 2 , . . . ,p . of conjugates of elements of R is special if it is minimal, and, for i = 1,. . . , n, p i p ; . . p : = p I p 2* * p , , with p : a conjugate of something in R for 1 9 j 9 i, implies st @,) =zst @:). Our initial construction of a diagram M ( p l , p 2 , .. . , p n ) for a special sequence is as follows. If n = 0, then the diagram consists in the
x.
P?,rn.."
01
S.arm.rr "art."
".,I..
Figure 2.
184
B.M. Hurley
Tq'f 0
0
Figure 3.
primary vertex 0. If n 3 1, then ,...,pn-l is special sequence, and so we may assume, inductively, that we have constructed a diagram M(pl,.. . . , P , , - ~ )for it. We also assume, inductively, that t h e origin is primary, and that, when all vertices of degree 2 have been deleted, the diagram is, according as R satisfies (i), (ii) or (iii) of Theorem 5.1, a (3, 6), (4, 4) or (6, 3) map. Here, by a ( p , q ) map, we mean one in which each interior vertex has degree at least p and each interior region has degree at least q. We now form the usual 'lollypop' diagram for p,,, making the origin of this diagram primary, and form a weak diagram for pl, p 2 , . . . ,p,, by indentifying t h e origins of the two diagrams. We now form the diagram M ( p l , p 2 ,.. . . , p . ) from this diagram by a process of 'cancellation' o n the boundary, which we shall describe fully below. Before doing so, we note that, in such a process, only edges on the boundary of the region of the 'lollypop' diagram can become identified with an edge on the boundary of some region of M(pl,.. . . ,P , - ~ ) ,for otherwise we could clearly construct a sequence p i , p i , . . . . , pL of conjugates of elements of R, with plp;...pA= pIp2* * .p . and st (p:) < st (p"),contradicting pl,p 2 , . . . . ,p,, being special. Informally, this says that the diagram M(pl, . . .,p , ) will 'look like' one of t h e three diagrams of Figure 3, and t h e same is true of all the intermediate diagrams formed during the cancellation process. In Figure 3, D is the region from t h e 'lollypop' diagram for p,, and u is the only vertex, if any, at which our cancellation process can take place. To see how the cancellation process works, let M * be an intermediate diagram formed during the process. Then there is precisely one quintuple of consecutive vertices ui (i = 1 , . . . ,5) of the ) , i = 1,. . . ,4, for which boundary path joined by edges Ei = (ui, u ~ + ~for
Small cancellation theory
185
1, with u1 primary. Since M * 'looks like' Icp(El)cp(E2)cp(E3)cp(E4)1 one of the diagrams of Figure 3, with u3 = u, we see that u2 f u4. (Our sole reason for using only special sequences in our construction is that it enables us to force this inequality.) We now transform the diagram as follows. (i) If 1 cp (El)cp(Ez)cp(E,)cp(E4)I= 1, then because u2 # u4, it is possible to force cp(E2)= cp(E,)-', by making adjustments of the first kind. Having made such adjustments, we identify u2 with u4 and E2 with E;', and then we make another adjustment of the first kind, if necessary, to restore the value of the product of the labels on the boundary. Now no further cancellations are possible on the boundary, and M ( p , , . . . , p , ) has been constructed. (ii) If 1 cp(El)cp(Ez)cp (E,)cp(E4)I= 0, then we assert ul f u5. For, if not, then we could construct a reduced diagram whose boundary consisted of a single edge with a label of length 0, and which would be (for reasons which will become clear below), according to case, a (3,6), (4,4)or (6,3) map. Corollary 2.4 of [13] now implies that some element of R is a product of two pieces, a contradiction. Since we also have uz # u4, it is possible to force cp(E,)= cp(E4)-'and cp(Ez)= cp(E3)= 1, by means of adjustments of both kinds. Having made such adjustments, we identify uz with u4, E2with Eq', u1 with us and El with ES', and again we make a final adjustment of the first kind, if neccessary. We now see that the diagram, so obtained, 'looks like' one of the diagrams of Figure 3, and, if further cancellation is possible on the boundary path, then it takes place at u I = us, which 'looks like' the vertex u of Figure 3. It is clear that, when we identify two primary vertices in this cancellation process, we obtain a primary vertex. To see that identifying two secondary vertices yields a secondary vertex, consider the situation of Figure 4, where we wish to show, for instance, that ( c d - ' (s 1. We know ( c a - ' ( ,(ad-'ls 1, and if either of these is zero, then we are done. Suppose then that ( c a - ' (= lad-'l= 1, but ( c d - ' ( = 2. Then, putting g = ca-', h = ad-' and k = d, we see that g, h, k E X but gh B U X. So, by N4*,c = ghk $Z B U X, which contradicts our assumption that all labels have length 0 or 1. Thus Icd-ll S 1, as required. We now wish to show that the cancellation process preserves the property of being, according to case, a (3,6), (4,4)or a (6,3) map, when all vertices of degree 2 have been deleted. We easily see that the T-conditions have the effect of excluding interior vertices of the 'wrong' degrees, except that T,' does not exclude that the diagram should contain an interior vertex P of degree d(P) = 4. However
186
B.M. Hurley
Figure 4.
suppose that in the passage from M * to M * * , the diagram obtained by ‘cancelling’ at u3, we have ‘created’ such a vertex. We wish to use a similar construction to that of Lyndon [13] pp. 224-225 to eliminate this vertex. However, with Lyndon’s notation, in our case, Ti only allows us to suppose azalE B, which implies x1 = hx,, for some h E B. However, the vertices PI and P , must be distinct, for otherwise we should have an impossible subdiagram, which is both a (6,3) map and has label of length 0 on its boundary. Hence it is possible to force x , = x 3 , by means of an adjustment of the first kind, and then to eliminate P as in (131. We now show that the boundary of any region of our final diagram M(pl,.. . . , p . ) is a simple closed path. Suppose not, then there exists a region D such that its boundary D’ is not a simple closed curve, but that the submap L obtained by taking 0 ’ and all of M interior to D contains no other such region. Let 1 be a loop contained in D ’ , but not the whole of D , such that the subdiagram K obtained by taking 1 and the whole of M interior to 1 contains as few regions as possible. We see that at most one vertex uo, say, of 1 lies in D ’ . It follows from our T- and C-conditions that every region of K has degree at least p and every interior vertex of K has degree at least q (i.e. K is a [ p , g ] map), where, according to case, (p, q ) = (3,6), (4,4) gr (6,3). K contains more than one region, for otherwise C ( p ) is contradicted, and by [13] corollary 2.4, must therefore, contain more than one vertex. Then, by [13] corollary 2.2, Z.’(p/g + 2 - d ( u ) ) s p , where this sum is taken over the boundary vertices of K. But the T-conditions imply that only uo can make a positive contribution to this sum, which gives us the contradiction that Z:’(p/4+ 2 - d ( u ) ) < p. All the hard work in the proof of Theorem 5.1 is now over. We merely now need to remark that it is obvious that every w in N, the normal closure of R, can be written as a product of a special sequence,
Small cancellation theory
187
and the theorem now follows in the same manner as Theorem I of Lyndon [13]. It is also possible to use corollary 2.4 of [13], and our construction of a diagram, in a similar way to prove our next result.
Lemma 5.5. Let (G,B, X ) , R, N and w be as before. Suppose further that w is not conjugate to an element of R. If we have 6)
C’(A), for some A
S
1/5,
then some s.c.r. conjugate of w contains two disjoint subwords, a l and az, such that, for i = 1,2, each ai is a part of some ri in R, with 1 ai 1 > (1 - 3A ) 1 ri I . If we have either (ii) or (iii)
C’(A), for some A
S ! , Ti
C’(A), for some A
S;,
and Ti,
T:,Tf,
i
= 3,4,5,
then some s.c.r. conjugate of w contains two disjoint subwords, a l and a2, such that, for j = 1,2, each a, is a part of some rj in R, with I a, I > (1 - 2A)(rj I. There is, of course, a non-metric version of this last lemma, which the interested reader might like to formulate for himself/herself. We end this section with a few remarks on our results. Firstly it is clear that, by making appropriate adjustments of the second kind to our final diagram, the subwords a of w, in Theorem 5.1 and Corollary 5.2, and the two subwords ai ( i = 1,2) of an s.c.r. conjugate or w, in Lemma 5.5, can be taken to consist of a consecutive sequence of ‘letters’ from the normal form of w, or, respectively, the s.c.r. conjugate of w. However, we cannot take a (respectively ai, for i = 1,2) to consist of a consecutive sequence of ‘letters’ from a normal form for r (respectively ri, for i = 1,2), without allowing for amalgamations to take place when we express it as a subword of w (respectively an s.c.r. conjugate of w ) . Our second remark concerns the strength of the C-conditions. Suppose R satisfies C’(A), and r E R has the form r = ab with almost no cancellation occurring, where a is a piece. By definition, there exists r ’ E R, which can be written r’ = ab’ with almost no cancellation occurring, with r# r’. This fact can be illustrated by the two region diagram of Figure 5. If r is not s.c.r., then, provided I a 1 # 0 , we can make an adjustment of the second kind at the vertex v, which, obviously, is secondary, to make r s.c.r., while leaving r‘ c.r. and
B.M. Hurley
188
Figure 5 .
leaving the length of a constant. We thus see that C‘(A) implies that if r E R and r is not s.c.r. and r = ab, almost n o cancellation occurring, and a is a piece, then 1 a < r 1 - 1). Our third remark concerns the formulation of Corollary 5.2 and Lemma 5.5. For example, in Corollary 5.2(i), to conform with the ‘usual’ way of formulating such results, we have written l a 1 > (1 - 3A)(r 1. But it is clear that a stronger result holds, namely, that we can write r = ab, almost no cancellation occurring, with I b < 3 A I r 1. We shall have occasion to use this stronger form of our results in the next section. The easiest way if doing this, while avoiding undue circumlocutions, is simply to agree that, when we say that a is more than a of r, what we mean is that r = ab, for some b, almost no cancellation occurring, where b 1 < (1 - a ) [r 1. This convention has the advantage that, if a is more than half of r, then I b < 1 a Our fourth remark concerns the applications of the results of this section. We shall only apply Case (i) of the various results; we have already given our reasons for including the other cases. The results are so fundamental to the subject, and shall be used so often, that it would become tiresome to refer to them by name at each application. Accordingly, we shall not do so, but we d o not anticipate that this will cause the reader undue difficulty. We shall, in fact, be mainly concerned with proving embedding theorems, for which purpose, we shall, for the most part, need Corollary 5.2, or, to be precise, a trivial corollary of it. We leave the reader to judge whether or not, given Corollary 5.2, the next result is worth stating at all. However, if it is to be stated, it must clearly be called a theorem.
I 4
I
I
I
I.
-
Theorem 5.6. Let (G,B, X) be an NFS group, R be a symmetrised subset of G, N be the normal closure of R in G, and w be a non-trivial element of N . If we have C‘(A), for some A S 115, and R contains no element of ( B UX)’, then G I N embeds B U X . Proof. N contains no element of (B U X)’.(Note that if r E R and ( r (S 5 , then every R-piece contained in r has length 1 . ) 0
189
Small cancellation theory
06. Control over torsion, conjugacy and the centre in 9th and 10th groups
-
In this section, ( G , B , X ) is a fixed NFS group, R is a fixed symmetrised subset of G, N is the normal closure of R in G, and F is the factor group GIN. Lemma 6.1. Suppose R contains no p.p.p. Zf x m a E R, where m, ( x m aI 3 2, x is s.c.~.with Ix 1 3 1, and almost no cancellation occurs in the product ( x " ) a , then ax # xa, and x and x " - I are pieces. Proof. Suppose ax = xu. Since R contains no p.p.p., 1 a 15 1. Applying Lemma 4.9(ii), x = hw' and a = h w ', for some h, h ' E B, w E G and j , k E Z - { 0 } , where w iss.c.r., ( w 2 1 = 2 ( w (and , h , h ' a n d w commute in pairs. Now x m a = (hm'h'k)wm'+kis not a p.p.p., and so mj + k = & 1. Obviously, ~ j ~ , ~ k ~ and , / w so ~ ~ l ,
d ( x " , a - ' ) = l { m l j l l w ( +Ik IIw I- Imj
+ k 1 1 w /}a1,
and so cancellation occurs in ( x m ) a ,contrary to hypothesis. Thus a x # x a , and so x " - ' ( x a ) # x " - ' ( a x ) E R and x ( x " - ' a ) # x ( x " - * a x ) E R, with almost no cancellation occurring in any of these products. Hence x and x"-' are pieces. 0 Corollary 6.2. Suppose R satisfies Cf(1/5), and contains no element of ( B U X)' and no p.p.p. Suppose x E X , b E B, m 2 1 and 1 x ' J = 2, then x"b# 1 in F. Proof. Since F embeds B U X , we may assume m 3 2. Since b E B, it is a piece. If x"b E R, then, by the lemma, it is a product of three pieces, contradicting Cf(1/5). Thus, if x"b = 1 in F, then some s.c.r. conjugate of x"b contains two disjoint subwords, each of which is more than 2/5 of some element of R. From this it easily follows that some element of R has the form ~ " awith , almost no cancellation occurring, and a being a product of at most three pieces. If 1 a = 0, then, as above, this element of R is a product of three pieces, contradicting C'(1/5). Thus I a 3 1, and so, by C'(1/5), lxma I 3 6 , and so Ix" 13 2. From the lemma, x and x"-' are pieces, and so x m a is a product of 5 pieces, contradicting C'(1/5). Hence x"b# 1 in F, as required. 0
I
I
Corollary 6.3. Suppose Z ( G )P B, then, if R satisfies C'(1/5) and contains no element of ( B U X)' and no p.p.p., then R = 0.
190
B.M. Hurley
Proof. By Theorem 4.7, if Z ( G ) e B, then either X = G - B, when there is nothing to prove, or every element of G - B has the form x"b, where x E X , b E B, m 3 1 and ( x ' ) = 21x I. Thus, in the latter case, by Corollary 6.2, F embeds G, and so R = 0. 0 Theorem 6.4 (Torsion theorem for 9th groups). Let (G, B, X ) , R, N and F be as above. Suppose R satisfies Cr(1/9), and contains no element of ( B U X)' and no p.p.p. Then every element of finite order in F is conjugate in F to an element of B U X . Hence F contains an element of finite order n i f and only if G does. Proof. Let w have finite order n in F, and let C be the set of all elements of G whose images in F are conjugate, in F, to w. Let z E C have minimum possible length. If z 13 2, then z is s.c.r. and no c.r. conjugate of z can have the form ab, almost no cancellation occurring, with a more than half an element of R. Since R contains no p.p.p., z " @ R, and so contains more than 5/9 of some element of R. Replacing z by an s.c.r. conjugate, if necessary, we may suppose Z" = ub, no cancellation occurring, where u is more than 6/9 of some element of R. Since u is not a subword of z, by [lo] lemma 2.4, u = zmt and z = ts for some t, s E G, no cancellation occurring in either product. Thus some r E R has the form r = uu = (ts)"rv, with almost no cancellation occurring in the product uu and no cancellation anywhere else. If m 2 2 , then, by Lemma 6.1, (ts)"-' and ts are pieces, and we easily see that t is also a piece, which implies that u is the product of three pieces contradicting its being more than 6/9 of an element of R. If m = 1, then r = tstu and t is a piece unless tstu = tuts. But, in this case, either u = suI,no cancellation occurring, when, by Lemma 6.1, ts, and hence t, is a piece, or s = us1, almost no cancellation occurring, when again t is a piece. Thus t is less than 1/9 of r, and, since tst is more than 6/9 of r, we see that u is less than 3/9 of r. Hence:
I
But z = ts = (tu)-' in F, which contradicts the minimality of 1 z Hence 1 z 13 1, and the theorem is proved. 0
1.
Lemma 6.5. Suppose R satisfies Cr(l/lO), contains no element of ( B U X ) ' , and contains no element that is conjugate to its own inverse. Then for each g E G, there exists g E G such that g = g in F, and, for all s, t E B U X , if g-'sg = t in F then g-'sg = t in G.
Small cancellation theory
191
Proof. We prove the stronger result (***). (***) Suppose R satisfies the hypotheses of Lemma 6.5. If g E G, then any element g E gN of shortest possible length satisfies the conclusion of the lemma. Suppose (***) to be false. Let g E G be an element of smallest possible length such that there exists g, of smallest possible length in gN, that falsifies (***). Clearly g has smallest possible length in gN = gN, g-'sg = t in F if and only if g-'sg = t in F, and, by the minimality of the length of g , 1 g 1 = I g 1. Thus we may assume g = g. Note that here we are taking a new choice for g ; we d o not claim that, by taking g to be our original g , the conclusion of the lemma would not hold. Now there exist s, t E B U X such that g-lsg = t in F, but not in G. It easily follows that ( g 1 s 2 . Write g = g l * . . g , ( r 3 2 ) in normal form. We assert I g;'sgll, I g,tg;'( s 2 . Suppose, for instance, that g;'sgl = sIE B U X. Then h = g 2 * g , must be a shortest element of hN, and, by the minimality of g , since h-'s,h = t in F, this equation also holds in G. But this implies the contradiction that g-'sg = f in G. Hence 1 g;'sgl 1 3 2, and, similarly, 1 g,tg;') 2 2 . The word g-'sgt-' must have an s.c.r. conjugate that can be written, in normal form, as g ; ' . . . ( g ; ' s ) g l * * (g,t-') or as one of the eight similar forms obtained by allowing s to amalgamate with g l instead of g ; ' , or by allowing t - ' to amalgamate with g;' instead of g , or by allowing both s to amalgamate with g l and t - ' to amalgamate with g ; ' , or by allowing less amalgamations. Call this s.c.r. conjugate of g-lsgt-', w. If w E R, then some c.r. conjugate of w can be written ga, with almost no cancellation occurring, and some c.r. conjugate of w - ' can be written gb, with almost no cancellation, but 1 g I S w 1, and so, by C'(l/lO), w w - ' , contrary to hypothesis. Thus w $Z R, and since w = 1 in F, some s.c.r. conjugate of w contains two disjoint subwords each of which is more than 7/10 of some element of R. At least one of these subwords does not "involve" the letter t - ' , and so there exists an element u E R, 7/10 of which has the form:
-
-
u = g;'
* * *
g;'sgl
*.
*
gk,
where, at most, an amalgamation occurs between g;' and s or between s and g , , but not at both places. From the minimality of g, neither g l * gj nor g l . . . g k is more than half an element of R, and it easily follows that both have length > 1/10]u I. By C'(l/lO), u u-I, contrary to hypothesis. This contradiction proves (***). 0
- -
-
Corollary 6.6. If R satisfies the hypotheses of Lemma 6.5, and contains
192
B.M. Hurley
no p.p.p., then F embeds the centraliser of every s.c.r. element of length 1, and, if x is such an element, then C F ( x )= CG( x ) , with the “obvious” identification. Proof. It is immediate from t h e lemma that if x E X is s.c.r., then C,(x) is the image of C, (x). If C, (x) C B U X, then it is embedded in F, and the result follows. If CG(x) B U X , then, by Lemma 4.9(iii), every element of CG( x ) has t h e form x”b, where m E Z and b E B. Since 1 x z (= 2, it follows from Corollary 6.2, that F embeds C, (x), and the result holds. 0 The next result is an immediate corollary of Lemma 6.5, but, in view of its content, we label it a theorem. Theorem 6.7. Let (G,B, X ) , R, N and F be as above. Suppose R satisfies C’(l/lO), and contains no element of ( B U X ) * and no element that is conjugate in G to its own inverse. Then any two elements of B U X are conjugate in F i f and only if they are conjugate in G. 0 Theorem 6.8. Let (G,B, X), R, N and F be as above. Suppose R satisfies C’(l/lO), and contains no element of (B U X)’, no p.p.p., and no element that is conjugate in G to its own inverse. Then Z ( G ) is embedded in F, and, with the “obvious” identification, Z ( G )= Z ( F ) . Proof. If R = 0, then the result is trivial, so suppose R # 0. By Corollary 6.3, Z ( G ) cB, and so is embedded in F. But z E Z ( F ) if and only if z-’sz = s, for all s E B U X , and so, by Lemma 6.5, Z ( F ) must be the image in F of Z ( G ) , and the theorem is proved. 0 97. SSC groups
-
In this section, we consider a class of NFS groups, each member of which has many “nice” small cancellation quotients. In the next section, we shall be concerned mainly, but not exclusively, with quotients of groups from this class. We start with some definitions.
-
Definitions. Let R be a symmetrised subset of the NFS group ( G , B , X ) . By the basis number b ( R ) of R we mean the number of equivalence classes in R under the equivalence relation “ r is conjugate to s or s-”’. A basis for R is a set of representatives for these classes. If S is a basis for R, then Card(S) = b ( R ) S Card(R), but, in general, we need not have b ( R ) = Card(R).
193
Small cancellation theory
-
Definition. Let K be an infinite cardinal. Then an NFS group ( G , B , X ) is an SSC-K group, if G I = K and it contains a symmetrised set R satisfying C’(1/6),b ( R ) = K , and the centraliser condition (C.C.) below. C.C. Every element r E R has an s.c.r. conjugate r‘ satisfying
I
CG(r’) n B
= Z(G).
-
An SSC group is defined to be an NFS group that is an SSC-K group for some K . Before we give examples of SSC groups, we need a concept introduced by Schupp [ 191. Definition. Let A be a subgroup of the group H. Let {xl,x2}be a pair of distinct elements of H, neither of which are in A. We say that {x,,x2}is a blocking pair for A in H if the following two conditions are satisfied: (i) x f x p & A , l s i , j G 2 , E = k l , S = ? l , u n l e s s x f x ~ = l , (ii) if a E A , a # 1 , then x f a x p @ A ,l s i , j s 2 , E = k l , S = * I . Theorem 7.1. (i) A non-abelian free group of rank r, with the “usual” NFS structure, is an SSC-K group, where K = max (r, No). (ii) A free product of two or more non-trivial groups is an SSC-No group, with the single exception of the group C2* C2, provided it has order No. (iii) A free product with amalgamation ( A * B : H = K ) , where H A, K B, H = K , A # H and B contains a blocking pair over K , is an SSC-No group, provided it has order No. (iv) A n H N N group ( H ,t : t-’At = A+), where : A B is an isomorphism between subgroups of H and I H 1 = No, is an SSC-No group under any of the following conditions: (a) There exists x E H, x bZ A, x $Z B such that x-’Bx n A = 1. (b) There exist x, y E H such that x - ’ A x f l A = y-’By n B = 1. (c) There exists m 2 1 such that C,,,= D,,, = 1, where Co= A, Do = B, and, for n 2 0 , Cn+,= A n +(Cn) and Dn+, =B f +-‘(Dn). l (v) A group that can be presented with at least three generators and only one relator can be equipped with an NFS structure in such a way that it becomes an SSC-No group. In all the above cases, the SSC groups have a trivial centre.
+
-
Proof. These are the “classical” cases. The existence of the appropriate symmetrised sets follows from the results of [13],[18] and [19].The fact that these groups have trivial centre follows easily from the
B.M. Hurley
194
construction used in each case. Moreover C.C., which now has the form: “Every element r E R has an s.c.r. conjugate r‘ satisfying CG( r ’ ) r l B = l”, is easy to check. For example, in case (iii), if k E K, k # 1, then k cannot commute with either element of the blocking pair { x I , x z }in B over K, and so t h e group has trivial centre. If R is t h e symmetrised set constructed in [19] (to be precise, R is the symmetrised set of [19] 04 with C = l), then for r E R, let r’ be an s.c.r. conjugate of r whose first letter in its normal form is xT’. If r’ commutes with k E K, then cancellation takes place in r’kr’-’, which implies that x ; k x : E K, for some E = 1, 7 = & 1. From the definition of a blocking pair, we now see k = 1, and so C . C . holds. We remark that Murasugi [15] first noticed that the one relator groups of (v) have trivial centre, however, Sacerdote and Schupp [18] show that they are all HNN groups of type (iv)(a), and, in the present context, this is t h e easiest way of seeing that they have trivial centre. 0
*
Lemma 7.2. Let (G,B, X ) be an SSC-K group. Let real numbers A, n > 0 be given. Then G contains a symmetrised set R satisfying C’(A), b ( R ) = K , C.C., I r > n for all r E R, and containing no p.p.p. and no element conjugate to its own inverse.
I
Proof. By definition, G contains a symmetrised set R I satisfying C’(1/6), b ( R ) = K , and C.C. Partition a basis SI for R 1into pairwise disjoint triples. To each such triple U = (s,,s2,SJ, say, we associate a word W ( U ) , which we define to be an s.c.r. conjugate of:
n M
,=I
{SIS2(SIS3)’),
where M is some large number. We now take the set of all W ( U )to be a basis for a symmetrised set R. Now in each W ( U ) ,less than 1/6 of each s, is lost through cancellation either at the beginning or-at the end of this s,, which leaves at least 2 as a subword of W ( U ) . But since each s, can be identified by as much as 1/6 of it, it is clear that we can force R to satisfy all the required conditions, except, possibly, C.C., merely by making a large enough choice for M. To see that we can force R to satisfy C.C., consider how cancellation can occur on the left-hand side of the equation W ( U ) - ’ h W ( U ) =h, which we shall assume holds for some h E B. Lemma 2.l(i) of [lo] tells us that, as soon as an amalgamation takes place, the left-hand side becomes reduced, which implies that no amalgamations take place. But this implies that some a E G exists such that when we write W ( U )= uav
Small cancellation rheory
195
and r = ab, n o cancellation occurring in either product, with a more than 1/6 of si E U, and r an s.c.r. conjugate of si,then an equation of the form k , a = ak2, with k , , k Z E B, and k, a conjugate of h, must hold. Replacing si by an s.c.r. conjugate, if necessary, in the definition of W ( U ) , we can write W ( U )in t h e above form where r satisfies C G ( r )n B = Z ( G ) . Since a is not a piece and kla = akz, we see that k, must commute with r. Hence k l E Z ( G ) , and, since h is conjugate to k,, h E Z ( G ) . 0
Remark. By a similar argument to that used above, we see that if R 1 satisfies C.C. and contains no element of X , then any s.c.r. element r E R I satisfies C G ( r )n B = Z ( G ) , and so no special choice of a c.r. conjugate for si in the definition of W ( U )is needed, in this case, for the conclusion of the lemma to hold. The next result is easy to see and its proof shall be omitted.
-
Lemma 7.3. Let R be a symmetrised set, in the NFS group (G,B,X ) , satisfying C'(1/6), C.C., 1 r 1 2 2 for all r E R, and containing no p.p.p. and no element conjugate to its own inverse. Then, for all r E R, CG ( r ) f l ( B U X ) = Z ( G ) . 0 Corollary 7.4. If M is a normal subgroup of the SSC group (G,B, X ) and M d B, then M d Z ( G ) . Proof. By Lemma 7.2, G contains a symmetrised set R satisfying the hypotheses of Lemma 7.3. Let r E R and rn E M. Then r-lrnr = m , E M. Since r cannot be an R-piece and mrm = rm ,rn E R, we see m = m , , and so m E C,(r) f l B = Z ( G ) . 0 It is, perhaps, worth remarking that the last result can be strengthened to the statement that any bounded normal subgroup of an SSC group lies inside the centre, where we say S C G is bounded if S C (B U X)", for some n.
Lemma 7.5. If (G,B, X ) is an SSC group, then for any subgroup M Z ( G ) , G / M is an SSC group and Z ( G / M )= Z ( G ) / M . In particular, G / Z ( G ) is an SSC group with trivial centre. Proof. Let us write a bar over symbols to denote image in = G/M. It is obvious, from Lemma 7.2, that X # G - B, and that we d o not have X = B { t , t-'}B, with G a split extension of B by an infinite cycle
B.M. Hurle y
1%
x)
(t). It follows that (G, B, does not have either of these two forms, and so, by Theorem 4.7, Z ( G )S B. Let R be the symmetrised subset of G constructed in the proof of Lemma 7.2. Let p E G, and suppose that is an I?-piece. (Note that, since the map G + G is length preserving, R is a symmetrised subset of G.) Then there exist a, b E G and rl # rz E R, such that rI = p a (mod Z ( G ) )and r, = pb (mod Z ( G ) ) , and it follows that p is an R-piece. Thus R satisfies C'(1/6) if R does. Now suppose r E R and h E B. If h E CG(T) n B, then r-lhr = hz, for some z E Z ( G ) . If R satisfies C'(1/6), then z = 1. Since R was constructed as in the proof of Lemma 7.2, we may assume it satisfies the hypotheses of Lemma 7.3. Thu? h E Z ( G ) and R satisfies C.C. Thus G/M is an SSC group. Since Z ( G / M )C B, the inverse image, in G, of Z ( G / M ) lies in B, and hence, by Corollary 7.4, lies in Z ( G ) . Hence Z ( G / M )= Z ( G ) / M . 0
Lemma 7.6. Let (G,B , X ) , A, n be as in Lemma 7.2, let R1 be a symmetrised subset of G satisfying C'(1/6) and C.C., and suppose T is a finite subset of G - 1. Then there exist symmetrised subsets R, and R, of G, both satisfying the conclusion of Lemma 7.2, such that (i) every subword a or r E R,, satisfying 1 a I S A 1 r 1, contains every element of T as a subword, (ii) R, C gp(R,, T ) and (iii) R, C gp(R,, T) fl (t)").
(n,,
Proof. Let T = {tl, t,, . . . ,t s } . By Lemma 7.2, there exists a symmetrised subset R * of G satisfying C'(1/20), b ( R * )= K , C.C., J r (>20(ti 1 for all r E R * and all ti E T, and containing no p.p.p. and n o element conjugate to its own inverse. From the form of the construction of R * , we see R * E gp(RI). It follows from Lemma 6.1, that there exists, for each i = 1,. . . , s and each E = ? 1, rn ( i , E ) 3 1, such that, for all r E R * , r(t:)m(i.')rcontains t ; as a subword, but
Now, we construct R, from R *, by first partitioning a basis S * for R * into pairwise disjoint triples. If U = (sl, s,, s3) is such a triple, we first form the modified triple U' = (si, s2, ss), where s; = S l t ; " ( l . l ) S , t ~ ( Z 1s1 )
. ..
t~(s.l)slt;m(l.-l)
st..
.
t ; m ( s . - 1 ) s1.
We now construct R,, by a similar construction to that of Lemma 7.2, but using modified triples U' instead of U. It is clear that we can make R, satisfy (i) and (ii). We illustrate the construction of R, in the case T = {t}. The general case follows from an obvious induction argument. The construction is
Small cancellation rheory
197
similar to the construction of R2, but we take our modified triples to be U ”= ( s y , s;, s‘& where s’i = s;’tsi. Since R * contains no element conjugate to its own inverse, we see that this works. Obviously R 3 C gp(Rl, t ) n 0)“. 0 We now come to our most important “New symmetrised sets for old” construction.
Lemma 7.7. Let (G,B, X ) be an NFS group, and suppose A, p are real numbers satisfying 0 < 3A < p < f. Let R , be a symmetrised subset of G, which satisfies C‘(A), and which contains no element of length less than 2 + 2/(p - 3A), no element conjugate to its own inverse, and no p.p.p. Suppose S , is a basis for R,. Let cp : S , + B U X be a function such that, for all s E S , , (scp)s is c.r. and CG(s) n ( B U X ) C CG(scp). Let S = {(scp)s : s E S , } , and let R be the symmetrised closure of S. Then R satisfies C ’ ( p ) , and contains no element of length less than 2 / ( p - 3A), no element conjugate to its own inverse, and no p.p.p. Proof. Since the length of an element of G is decreased, at most, by 2 when it is premultiplied by an element of B U X , it is clear that R contains no element of length less than v = 2 / ( p - 3 A ) . Now suppose we can write r, = pa, r2 = pb, almost n o cancellation occurring, with r , # r2 E R. By the comments following Corollary 5.2, we may suppose r, to be s.c.r. Suppose 1 p 1 3 3A I r , I 2, then it is clear from the construction of R that p = qlflq2f2q3, with no cancellation occurring, with f , , f 2 E B U X, and with q,, q2 and q3 being subwords of r , and r2 that were not “disturbed” during the construction of R from R,. Clearly at least one of the qi is not an R,-piece. This qi identifies an element s E S , , from which r, and r2 were both constructed, and it is clear from the fact that R, contains n o element that is conjugate to its own inverse that scp occurs in r, and r2 t o the same power. Thus taking inverses of both elements, if necessary, r, = s1(scp)s2 and r2 = s3(scp)s4, where sIs2= s3s4 and s2sI= s4s3= s. It follows from these equations that s;IsI commutes with s. Write h = s;IsI. There exists x E X such that x-lsx is s.c.r., and so, from Lemma 4.9(ii) and the fact that R , contains no p.p.p., we see that x-lhx E B. Thus 1 h 13 2. If 1 h 1 = 2, then from hs = sh, we see that s = xux-’, with n o cancellation, and is not c.r., contradicting R , being symmetrised. Thus h E CG (s) n ( B U X ) C CG ( ~ c p ) . Thus r , = s , ( s ~ ) s=~ ~ 3 (h~ c p ) s Z = s3(scp)hs2= s3(scp)s4 = r2. Thus if r , # r2, then we must have:
+
198
B.M. Hurley
Thus R satisfies C'(p). Suppose R contains some element conjugate to its own inverse, then we easily see that this element has two subwords that are mutually inverse, have not been disturbed in the construction of R from R l , and are too long to be R1-pieces. Thus R , contains an element that is conjugate to its own inverse, contrary to hypothesis. Thus no element of R is conjugate t o its own inverse. Suppose R contained a p.p.p. Then S I would contain an element of t h e f o r m bxm,where b E B U X , m a 2 , I b x " I a 3 , x iss.c.r. and ( x 13 1. If almost no cancellation occurs in b(x"), then, by Lemma 6.1, x m - l is an R1-piece and C'(1/6) fails for R,. If cancellation occurs, then either m 3 3, x " - ~is an R1-piece and C'(1/6) fails for R , , or m = 2, and b ( x 2 )= x i * ' * x,xlx2 . x, in normal form, where x = x1* x, and x i x i ' E B, and again C'(1/6) fails for R,. But this contradicts A < 1/6. Thus R contains no p.p.p., and the lemma is proved. 0
-
Remarks. When applying Lemma 7.7, it is usually easiest not to compute exact values for A and p, but to think of the lemma as showing that if we wish to construct R, satisfying "strong enough" cancellation conditions, then it is merely necessary to impose "strong enough" conditions. However, we have also imposed conditions on the mapping 9. The condition that (scp)s be c.r., for all s E S I , is nearly redundant, and one could d o without it altogether by insisting that p and A be small enough. In any case it does not seem to lead to any difficulties in practice. The condition that CG(s) r l (B U X ) C CG(scp), is, by Lemma 7.3, automatically satisfied if R satisfies C.C. and G is an SSC group. However, if G is not an SSC group, then it requires careful checking. 88. Applications
Application I. Simultaneous quotient groups Definition. Let 2 = { L j } j Ebe J a family of groups, then, by a simultaneous quotient group of 2, we mean a group that is a homomorphic image of each Lj, if J # 0, while, if J = 0, we allow any group to be a simultaneous quotient group of 2. Theorem 8.1(A). Let K be an infinite cardinal. Let 2 = { L j } j Ebe J a family of SSC-K groups, with IJ 1 S K . Suppose G is any group satisfying 1 G 1 d K . Then G can be embedded in a simultaneous quotient group Q of 2. 0
Small cancellation theory
199
Before we prove Theorem 8.1(A), we prove a technical lemma. Before stating the lemma, we will define some notation that will be of use throughout this section.
-
Definition. We say that a symmetrised set R, in an NFS group satisfies [A, n ] ' , for A, n > 0 , if it satisfies C'(A), and contains no p.p.p., no element conjugate to its own inverse, and no element of length less than n. We say it satisfies [A, n ] , if it satisfies [A, n]' and C.C., and we say it satisfies [A, n, K ] , if it satisfies [A, n ] and b ( R ) = K .
Lemma 8.2. Let J be a non-empty set, { K , } , E J be a family of infinite cardinals, let L, = (L,,B,, X , ) be an SSC-K,group, for each j E J, and let R, be a symmetrised subset of L,, with basis S,, and satisfying [ A , n, K , ] , for each j E J, where A 6 1/6 and n > 12. Let
F
=
@ IEJ
(N)
Li,
-
the free product with centralised centres of the L,, with the NFS' structure induced on it by the L,. Let R be the symmetrised closure of U , R , in F, and let S = u , S , . Then S is a basis for R, and R satisfies [ A , n, E, K , ] in F. Proof. We first prove the lemma in the case Z ( L j ) =1, for all j E J. In this case, F is simply the free productN of the SSC groups Lj. The base group or F is simply B = * Bp It is clear from Corollary 4.3 that any element of R can be obtained from one in u R j by conjugating by an element of B. Let p be an R-piece, then there exist p u f p u E R, with almost no cancellation occurring in these products. Let pu = b;'rlbl and p u = b;'r2b2 where b l , b2E B, rl E Ljcl,and r2 E Lj(*),for some j(l),j(2)E J. It is clear that we may assume that when bi is written in free product normal form, it does not end in a letter from Ljci,.Now let $aibe the first letter in the (Lj(i),Bjci),Xj(i,)-normal form for ri. If ( p 1 2 2 , in F, then a;'b1b;'a2E B, and from the free product normal form for this element, and the fact that ai E Bjci,,we see that bl = b2 and j(1) = j(2)= j , say. Thus premultiplying p by an element of B, we may assume, if ' ( p1 3 2, that pu = rl and p u = r2 where rl, r2 E Rj. A similar argument applied to the products up and up, shows that, postmultiplying by an element of B, we may assume that p E L p It is now clear that R satisfies C'(A), at least as far as pieces of length at least 2 are concerned. But our assumption n > 12 ensures that shorter pieces cannot give us any trouble. It follows from the above argument that p E F, with J p1 3 2, is an
200
B.M. Hurley
R-piece if and only if there exist b, c E B, u,u E F, u',u ' E L,, for some j E J, and an R,-piece p ' such that p ' u ' f p ' u ' E R,, almost no cancellation occurring, and p = b-lp'c, u = c-'u'b and u = c-'u'b. Using this, it is easy to see that R contains n o p.p.p., and no element conjugate to its own inverse. Clearly S is a basis for R , and the remainder of [A, n, 2 K , ] is easy to check. The general case is proved as follows. We write L, = L , / Z ( L , ) ,and F = *,€, (N)L,. The homomorphisms L, + L,, induce a unique homomorphism F F, which is clearly length preserving, and hence, an N9.Y-homomorphism. We write for the image of Y under this homomorphism, for any Y. As in the proof of Lemma 7.5, R, satisfies [A, n, K , ] , and clearly has basis $. From the Z ( L , )= 1 case, it is clear that R satisfies [A, n, C K , ] , and has basis Since R is the inverse image, in F, of R, it clearly must have all the desired properties, except, perhaps, C.C. But C.C. follows from the fact that every element of R is conjugate to an element of R,, for some j E J. 0
-
s.
Proof of Theorem 8.1(A). If J = 0, then t h e result is trivial, so suppose J f 0. Let H = (G *N F : [G,Z ( F ) ]= l), where G has the NFS structure (G, 1, G - l), and F is as in Lemma 8.2. We pick a sufficiently small A > O , and a sufficiently large n > O , and we let S, be a basis for a symmetrised set R, satisfying [A, n, K ] in each L,. By Lemma 8.2, s = U S , is a basis for a symmetrised set R in F satisfying [A, n, K ] . Clearly R is also symmetrised in H and satisfies the same condition. Now, for all j E J, let T, = uk+,€J (&. U xk U G), where & xk are as in Lemma 8.2. Then 1 T, I S K = 1 S, 1, for all j E J, and so there is an onto map (p, : S, T,. Let (p : S + F be the map whose restriction to S, is cp,, for all j . Since R satisfies C.C., and (scp)s is clearly c.r. for all s E S, we see from Lemma 7.7, that the set S * = {(s(p)s : s E S } , is a basis for a symmetrised set R * in H, satisfying [ p , m,K ] , for some convenient p, m, provided only we choose A, n "correctly". If we choose them so that p s 1/6, and m 3 6 , then letting N * be the normal closure of R * in H, H / N * embeds everything in H of length 0 or 1. Hence H / N * embeds G. However, it is clear, from the definition of S*,that given the obvious presentation for H / N * , we can remove from it, by means of Tietze transformations, all generators the generators of L,, for some fixed, but arbitrary, j E J. It follows that H / N * is a simultaneous quotient group of 2, and so the theorem is proved with Q = H / N * . 0
-
Before we turn to other versions of Theorem 8.1, we note the following corollary.
20 1
Small cancellation theory
Corollary 8.3. A n y countable group G can be embedded in a countable group Q with the property that, for any word w(a, 6, c ) in ( a ,b, c ) , the free group on {a,b, c } , there exists a generating set {x,, y,, 2,) of Q such that w(x,, y,, 2,) = 1, in Q. Proof. By Theorem 7.1 (v), the three generator one relator groups are SSC-N,, groups. There are clearly No of them, and so we can apply the theorem to embed G in a simultaneous quotient Q of them. This group clearly satisfies the conclusion of the corollary. 0 It is well known that any group of order K , where K is an infinite cardinal, can be embedded in a simple group of order K . We vary t h e proof of Theorem 8.1(A) as follows. (a) Replace the family K , of SSC groups, by 9 = 9 U F,, where F. is the free group of rank K . a simple group of order K (b) Replace the group G by containing G. (c) Add to S * all elements of the form:
c,
where g is some fixed element of G, M is some “large” number, and t is any element of some basis of F,, to form the set S * * . (d) Let R * * be the symmetrised closure of S** in H, and let N** be the normal closure of R * * in H, and put Q**= H / N * * . (e) Let N be a normal subgroup of Q** that is maximal with respect to the property that g 6Z N, where g is t h e element used in (c), and let Q = Q * * / N . Then we see that, by choosing M large enough, we can make R * * satisfy any condition [ p , m, K ] that R * satisfies. By requiring p S 1/6 and 1 m I > 6, we can ensure that Q** embeds and since N** 2 N*, where N * is constructed as in the proof of Theorem 8.l(A), Q** is a simultaneous quotient of the family 9. Now Q embeds for, if not, then N n C?# 1, and since is simple g E n N, contradicting t h e definition of N. Finally, Q is simple. For, let P be a non-trivial normal subgroup of Q, then the inverse image P** of P in Q** contains g, and since N must contain all the elements of H introduced in (c), we see that t E P**,for every generator t of F,, and since P** is a quotient of F., P**= Q * * , and so P = 0.Thus we have proved the following version of Theorem 8.1.
c,
c
c,
Theorem 8.1(B). In Theorem 8.1(A), we may assume Q to be simple. 0
202
B.M. Hurley
If we let K = No,and we suppose a two generator group to occur among the L,, we obtain the following result, which has also been proved by Schupp [21], Goryushkin [6], and Hurley [9]. Corollary 8.4. Every countable group can be embedded in a 2 generator simple group. 0 In the proof of Theorem 8.1(A), we can replace G by any generating set for G in the definition of the sets T,, and the conclusion still holds. But note that, if this is done, then no use has been made of the relations between these generators. It follows that, if M a G, then M' n G = M, and so G / M is embedded in Q / ( M o ) . We express this property by saying that G is an E-subgroup of Q. If we ensure p s 1/10, and m 123, then since the embedding of G in H is Frattini, the embedding of G in Q is also Frattini, by Lemma 6.5. From the constructions of 03, we see that the group H c o n c c t e d in the proof of Theorem 8.1(A), can be given the additional NFS structure (H,2, Y),where Y = Z ( ( G - 1) U (Li - Z(L,))))Zand Z is the (restricted) direct product Z = x i e , Z ( L i ) .It follows from Theorem 4.7 that, if J # 0, then Z ( H ) = Z. From Theorem 4.6, it follows that every element of H of finite order is conjugate to one of the form gz, where g E L,, for some j E J, and z E Z, and both have finite order. If we ensure that p < 1/10, and I m I > 3 , we can apply Theorems 6.4 and 6.8 to the construction of Q from H to obtain the following version of Theorem 8.1.
I
(u,,,
Theorem 8.1(C). In Theorem S.l(A), we can assume that G is an E subgroup of Q, and that the embedding of G in Q is Frattini. Moreover, i f J # 0, we may assume that Z ( Q )= X j e , Z ( L i ) the , restricted direct product of the Z ( L j ) , and that any element of Q of finite order is conjugate, in Q, to an element of the form gz, where g E Li, for some j EJ, and z E Z ( Q ) . 0 If, in the proof of Theorem 8.l(A), we assume Z ( L i )= 1, for all j E J, and we replace H by H' = H x,A, where A is an abelian group satisfying / A 1 < K , and T, by Ti = T, U A, then Z ( H ' )= A, and Q is still a simultaneous quotient of 2 that embeds G. If we ensure that p < 1/10, and 1 m I < 3, we see, from Theorem 6.8, that Z ( Q ) = A. If we do not have Z ( L , )= 1, for all j E J, then, by Lemma 7.5, L,/Z(Li) is an SSC group with trivial centre, and since any simultaneous quotient of these groups is also a simultaneous quotient of the Li, we obtain the following version of Theorem 8.1.
Small cancellation theory
203
Theorem 8.1(D). In Theorem 8.1(A), we can assume that Z ( Q )= A, where A is any abelian group satisfying / A =sK . 0
I
The following corollary is immediate.
Corollary 8.5. Every countable abelian group is the centre of a two generator group. 0 Application 11. Embeddings into the join of given groups. For this application, it is convenient to work with countable groups. However, the results can clearly be generalised so that they work for groups of higher cardinality.
Theorem 8.7. Let {G,},eM be a family of at least two groups, and let G be any group. Suppose 1 M /,I G 1 s No, and 1 G, I S No,for all p E M. Suppose each G, contains a proper subgroup B,, such that, for all, except perhaps one, p E M , G, contains a blocking pair over B,. Finally, suppose there exists a group B, generated by a family of subgroups { C , } , , , satisfying C, = B,, for all p E M, and let SB denote the amalgam, obtained from the G, and B, by setting B, = C,, for all p E M. Then there exists a group Q, that embeds the amalgam SB and the group G, and is generated by any pair of groups (G,, G w ) where , p#vEM.
Proof. The proof is very similar to the proof of Theorem 8.l(A), and so we confine ourselves to a few remarks. In the structure product STR (G,, B), with its natural NFS structure, for every pair p, v E M, we can construct a symmetrised set R,, satisfying [A, n,No],for convenient A, n > 0 , with a basis S,," that lies entirely inside G, U G , This is possible because at least one of the groups G, or G, contains a blocking pair over B, or B , and the construction is similar to that used in a free product with amalgamation. We now proceed as in the proof of Theorem 8.1(A), but we replace the use of Lemma 8.2 by the construction described above. 0 v)
Variants of this theorem can be proved, as in the last application. In fact analogs of Theorems 8.1(B), (C) and (D) hold, where, in case (C), we replace Z ( Q ) = X Z ( L j )by Z ( Q )= 1. We leave their formulation and proof to the reader. (Note that, in case (B), a little extra work is needed to ensure that d is embedded in the simple,group Q.) It is also possible to combine Applications I and 11.
B.M. Hurley
204
Application 111. Decision problems in group theory.
Theorem 8.8. A necessary and sufficient condition for a finitely generated, recursively presented, group G to have soluble word problem is that there exists a non-trivial finitely presented group H, such that G can be embedded in every simple factor group of H . Proof. Suppose we have a finitely generated group G with soluble word problem. By Theorem I of Boone and Higman [2], we can embed G in a simple group S, which, in turn, can be embedded in a finitely presented group K. As i s p e l l known, we may assume K to be generated by two elements, a and b, say. Let ( u , u ) be the free group of rank 2, and form the free product L = K * ( u , v ) . Then the set S consisting of the four words:
n 80
a
,=I
( u v i ) ,b
n 80
i=l
n ( s u ~ ) s u -n~ ~( s~u~~, ) s u - ~ ~ ~ ~ , 80
80
( U U ~ + ~ ) , i=l
i=l
is a basis f o r g m m e t r i s e d subset R of L, equipped with an appropriate NFS structure, where s is some fixed element of S - 1. We let N be the normal closure of R, and put H = L / N . H is visibly finitely presented, and, since R satisfies C’(1/6), embeds K and hence embeds G. Moreover, we easily see that every non-trivial quotient group of H embeds G, and so, in particular, every simple quotient group of H contains a copy of G. Now suppose we have a finitely generated group G, with presentation (gl, g 2 , . . . ,g, : K ) , which can be embedded in every simple quotient group of the non-trivial finitely presented group H = ( x l , . . . ,x, : R). We let u I ,u 2 , .. . , v, be variables. If L # 1 is a group, then we say p # 1 E L is consistent in L if ( p ) = # L, and otherwise, we say it is inconsistent in L. We define a, possibly finite, ascending sequence N o d N , d N 2 of finitely generated normal subgroups of H as follows, where we suppose we are given an enumeration U of all s-tuples of words on { x l , .. .,x,}. We define No = 1. For i 3 1, suppose N,-] has been defined. Let ( w , , w2,..., w.) be the ith element in the enumeration U. There are three cases. (a) If there exists a word u ( v , ) such that u ( g , ) = 1, in G, but u ( w , ) is inconsistent in H / N , - , , then define N, = N,-,. (b) If (a) does not obtain, but there exists a word u ( v , ) such that u ( g , ) # 1, in G, but u ( w , ) is consistent in HIN,-,, then define N, to be the normal closure, ,in H, of N,-, and u ( w , ) . (c) If neither (a) nor (b) obtains, then Nk is undefined for k z = i .
205
Small cancellation theory
Suppose this sequence were infinite, then, since H is finitely generated, and the N, are proper subgroups of H, there is a maximal normal subgroup M of H containing all the N,. But H cannot be embedded in the simple quotient group H I M of H, because, for every s-tuple ( w l , .. . ,w s ) , of words on {x,, . . . ,x,}, either there exists a word u ( u , ) such that u ( g , ) = 1, in G, but u ( w , ) # 1 in HIM, or there exists a word u ( u , ) such that u ( g , ) # 1, in G, but u ( w , ) = 1 , in HIM. This contradicts our assumption that G can be embedded in every simple quotient group of H, and shows that the ascending sequence of normal subgroups of H is finite. Let Nk be the last member of the sequence { N , } .And write = H / N k . From the definition of the sequence, if ( w ~wz, , . . . , w , ) is the (k + 1)th element of V , then: { u ( u , ) : u ( g , ) # l in G } = { u ( u , ) : u ( w ,is) inconsistent in
R}.
Since R is finitely presented, the set on t h e right-hand side is r.e., by an obvious diagonal enumeration, and hence the set on the left-hand side is r.e. Since G is recursively presented, { u ( u , ) : u(g,) = 1 in G} is also r.e., and so must be recursive. Hence G has soluble word problem. 0 We remark that, using the techniques of Application I, we may assume that H is a simultaneous quotient group of any finite family {Lj},EI of finitely presented SSC groups. Problem 2. Can the hypothesis that G be recursively presented be removed from Theorem 8.8? The proof that G has soluble word problem in Theorem 8.8 is nonconstructive, and there seems to be n o obvious way of replacing it with a constructive proof. Thus there seems to be no reason why an analog of Theorem I11 of Boone-Higman [2] should hold, in this case. Indeed, it seems rather unlikely that there should be such an analog. This situation prompts us to ask the next question. Problem 3. Does there exist a non-trivial finitely presented group H with the property that every finitely generated group G with soluble word problem is embeddable in every simple quotient group of H?
Our Theorem 8.1(B) is a generalisation of Theorem I of Schupp [21]. It is possible to use our theory to obtain a generalisation of his Theorem 11. The proof is a simple adaption of Schupp’s proof, using
206
B.M. Hurley
methods from the proof of Theorem 8.1. We therefore content ourselves with a statement of this theorem. For the definitions of the terms used in the theorem, we refer the reader to Schupp’s paper.
Theorem 8.9. Let A be a finitely presentable group which has a finitely presentable quotient group L which is an SSC group. Let rr be any finite presentation of A, and let P be a Markov property which is possessed by the trivial group. Then the problem of determining P restricted to quotients of rr is insoluble. Also the generating problem for A x A is insoluble. 0 Application IV. Residual simplicity. Theorem 8.10. A n SSC group with trivial centre is residually infinite simple. Proof. We actually prove the following stronger statement.
Let K be an infinite cardinal. Suppose L is an SSC-K group, with trivial centre, and let G be a group. Suppose I G I,[ L I K . Then, if P is any finite set of non-trivial elements of L, there exists a simple quotient group Q of L, containing a copy of G, such that p f 1, in Q, for all p E P. Proof of above statement. By Lemma 7.6, L contains a symmetrised set R, satisfying [A, n, K ] , for some convenient A, n > 0 , which is contained in We embed G in a simple group G, of order K , and we let H = d * L * F., where F, is the free group of rank K . We proceed, as in the proof of Theorem 8.1(B), to construct a symmetrised set R * , in H, satisfying [ p , r n , ~ ]for , suitable p , m . We may suppose R * contains (i) words of the form gr, where r E R and g is a generator of d or F., (ii) words that express the generators of L in terms of those of F,, and (iii) words that ensure that all non-trivial quotient groups of Q * = H / N * , where N * is the normal closure of R * in H, contain a copy of G. We chose p , m so that G is embedded in Q*, and, as before, construct a simple quotient group Q of Q*. Now Q is a simple quotient group of L, containing G. If p = 1, in 0, for some p € P, then r = 1, in Q, for all r E R, and hence g = 1, in Q, for all generators of F.. But Q is a quotient group of F,, and so this would imply Q = 1, a contradiction. Hence p # 1, in Q, for all p€P. 0
npEp(p)L.
Small cancellation rheory
207
Application V. The height of SSC groups. We assume familiarity with Pride [16], and we shall freely use the terminology of Pride’s paper. Theorem 8.11. Countable SSC groups have infinite height.
Proof. Suppose G is a countable SSC group. Let So,S,,. . . be an infinite sequence of countable groups. By Theorem 8.9(A), we can construct an infinite group 0 that is a simultaneous quotient of the family {G}U {Si * Si)T=,,. For n = 0,1,2,. . ., define Q, to be the group whose presentation is obtained from the obvious presentation (i.e., the presentation constructed in the proof of Theorem 8.9(A)) for Q by deleting the generators of Si * Si, for i 2 n, and deleting all relators that involve these generators. It is clear that Q, is a simultaneous quotient of the family {G} U { S , * Si}yZd, and that Qn+lis a homomorphic image of Q,, for all n. It is immediate from [16] Theorem 2.1, and the definition of S , which follows [16] corollary 2.1.1, that G 3 Qo t QI3 Q2 ?=. . .?=Q, ?= . . . . The theorem will be established if we can show # On, for all n. We note that, for some choice of the groups Si, On+, that the construction of Q can be done in such a way that it does not depend on the defining relations of the Si, and so, for all i, we can choose Si to be a countable simple group that is not contained in Qi. (Note that by Theorem 8.1(B) there are 2”. two generator infinite simple groups, and Qi can only contain Nosuch groups.) Since Q,+, is a homomorphic image of S, * S,, it is clear that any non-trivial homomorphic image of a subgroup of finite index in must contain a copy of S,, and so cannot be contained in 0,. By [16], Theorem 2.1, we have Q n + l # On,for all n, as required. 0 If G is a countable SQ-universal group, then this can usually be established by arguments that use small cancellation theory and the fact that SQ-universality is a large property, in the sense of [16]. This prompts us to ask the following question. Problem 4. Do all finitely generated SQ-universal groups have infinite height? We remark that Hurley (unpublished) has constructed a countable SQ-universal group that is minimal, in the sense of Pride [16]. However, a finitely generated SQ-universal group cannot be minimal, because it must have a just infinite quotient group.
B.M. Hurley
208
Application VI. The rank of certain direct products. Definition. If G is a finitely generated group, then by t h e rank r ( G ) , of G , we mean the minimum size of a generating set for G. Theorem 8.12. If G i s a countable SSC group, and C is any countable group, then C can be embedded in a quotient group 0,of G , of rank 2, such that, for all non-trivial finitely generated groups B,
r ( B )+ 1, if B is free or cyclic, B ) = ,.(B), otherwise.
[
r(Q
Remark. The theorem says that usually r(Q x B ) = r ( B ) . It is clear that an exception to this rule must occur when B is cyclic. If B is free, then r(Q x B ) = r ( B ) would imply that Q x B was free, which contradicts the fact that a free group cannot be decomposed into a non-trivial direct product. Hence an exception must also occur when B is .free. Thus the value of r ( Q x B ) given by the theorem is always as small as possible. Proof. Let uI, ur, u 3 , . . . be an infinite list of variables, and let U be an enumeration of all pairs (r, w ( u , ) ) ,where r 3 2 is an integer, and w ( v , ) is a reduced word on {ul, u 2 , . . . , u,} that both begins and ends with vT'. Note that we do not require w ( u , ) to involve all the variables in u I , u 2 , . . . , u,. Let R be a symmetrised subset of G satisfying [1/30,30, No].Let S be a basis for R. Since Card(S) = No,we can write it as a disjoint union
s = I3 s,, =o a
where Card(So)= N o , and, for i = 1,2,. . . ,Card(S,) = r,, where (r,,w , ( v , ) )is the ith element in the enumeration U. We suppose that each S, is given a standard ordering (s,,s2,. . . ,s,,). For each i = 1,2,. . . , we define the set T,(u,)of words on { u , , u 2 , . . ., u,,} to be the set of all words of the form:
for all k larger than 1 w,(u,)I. Note that the r,,k are reduced as written. We now write T,(u,) as a disjoint union T:(u,)U T';(u,),where Card(T:) = r,, Card(T',')= No, and we equip both sets with a standard ordering.
Small cancellation theory
209
We now suppose that cl, czrc,, . . . is a generating set for C, and that . . . is a generating set for G consisting of elements of length 0 and 1. We set g,, gz,g,
H = C * G * ( i g= l K ( i ) ) , where K ( i ) is a free group with basis { u , , ~u ,, . ~. ., . u, ,,}. We now let R * be t h e symmetrised closure, in H, of all words of the forms: (a) cqlzqr where z, is t h e 4th word in the standard ordering of So, (b) u;:z,, where z , is the 4th word in the standard ordering of S,, (c) u;:t,, where t, is the 4th word in the standard ordering of T ( u , . , ) , and (d) g i l t , , where r, is the 4th word in the standard ordering of T':(u,.,). By an argument similar to that used in the proof of Theorem 8.1, we see that R * satisfies [1/10, 10,No], in H. Thus, if N * is the normal closure of R * in H, then Q = H / N * embeds C. As before, Q is a quotient group of G. We must now show that Q satisfies the conclusion of the theorem. We first note that, from the construction, Q is a simultaneous quotient group of the family {P,}:=l, of groups, presented as follows:
P, = ( v , , u z , . . ., u,, : vi'r, (t, is the 4th word in the standard ordering of T:(v,),1 c 4
r,)).
Let us fix some i 2 1, and write (r, w ( v , ) )= (r,, w,(u,)). Let E be the one relator group with presentation
E = (xi, ~
..,X, ;~ ( x , ) ) .
2 , .
Let Y be t h e split extension of E by t h e free group with basis {vl, v 2 , . . ., u,}, with the v, acting as follows: v;'x,u, = x,'x,x,, for all i , j = 1 ,. . . , r. Let J = E { v ? , . . . ,u"}E. Then, by Lemma 3.2(ii), ( Y , E , J )is an NFS group. Note, however, that it need not be an SSC group. Let R * * be the symmetrised closure, in Y, of all words of t h e form v;'?,x,, where, for the fixed i, and for q = 1 , . . . ,r, the q th word in the standard ordering of T:(u,)is r,. We leave it to the reader to check that R * * satisfies C'(l/lO), in Y . Note that, as well as doing the usual calculations, one must check that the elements r E R ** are not, themselves, R **-pieces. This amounts to showing that, if b, c E E and r E R **, then br = rc implies b = c. This is very easy t o do, but since, for instance, the symmetrised closure of {fixl} does no? satisfy this condition, it is non-trivial.
210
B.M. Hurley
Let N * * be the normal closure of R * * in Y , and set D = Y / N * * . . Since the r, are words in the v,, we can eliminate the x, from the obvious presentation for D, by means of Tietze transformations, and hence, we see, r ( D )s r. Let P' be the normal closure, in D, of w ( u , ) , and let D' = D/P'. From their definitions, we see, r, E P', in D, for q = 1,. . . , r. It follows that v, = x,, in D', that D ' = E, and that P' fl E = 0. Where, since R ** satisfies C'(l/lO), E is a subgroup of D. But note that E a D, and, since r, E P', for all q, D = gp(P', E ) . It follows that D = P'x E. Setting x, = 1, in the presentation for D, and deleting the x,, we obtain the presentation which we used to define P,, for our fixed i. Thus D = P, x E. We now let i vary, and accordingly add the suffix i to E defined above. We have shown r(P, X E , ) s r,, for all i = 1,2,. . . . Now let B be any non-free group of finite rank r 2 2 . For some i, B is a quotient group of E,, and r, = r. Since Q is a quotient group of P,, we see that Q x B is a quotient group of P, x E,. Hence r ( Q x B ) s r(P, X E,)< r, = r ( B ) . Now suppose B is cyclic or free, and let it have rank r. For some i, B is a quotient group of E,, with w ( v , )= u l , and r, = r + 1, and we conclude r ( Q X B ) < r ( B )+ 1. Since the value of r ( Q x B), given in the statement of the theorem is as small as possible, the above inequalities can be strengthened to equalities. To see that r ( Q ) = 2, we merely need to note that it is a homomorphic image of the two generator group Q x (x, y : x 2 ) , that Q f 1, for it contains a copy of C, and that Q is not cyclic, for cyclic groups do not satisfy the conclusion of the theorem. 0 We remark that, as with Theorem 8.1, one can prove variants of Theorem 8.12. In particular, a version of the theorem holds in which Q is simple. The next result is a corollary of this version of the theorem. Proposition 8.13. There exists a family 9 of simple groups satisfying : (i) Every countable group C can be embedded in some member of 9, and (ii) if S , , S 2 , .. . ,S, E 9,where n 3 1 is finite, then S , x S2 x . . x S, is a two generator group.
69. Concluding remarks The theory bas many applications which, for reasons of space, we have been unable to include in 08. We shall mention a few of these here, without going into details.
Small cancellation theory
21 1
One construction we have hardly used is the HNN construction in the category N9.9'. Indeed, the only reason we included it in $3 is that it provides the quickest way we know of proving the results we need about structure products. One of the difficulties with this construction is that it does not, in general, preserve the property of being an SSC group. However, it can be used, for instance, to prove a version of Theorem 8.1, in which all elements of Q of the same finite order k are conjugate to one another. Moreover, it is possible to exploit the pathologies that can occur when using the HNN construction in N9.9'. For instance, observe that, if R is a symmetrised set in an SSC group G, and R * is its symmetrised closure in an N 9 Y HNN extension G *, then there is no obvious reason why R and R * should not satisfy the same C- and T-conditions, and yet also satisfy b ( R *) < b ( R ) . This observation has lead us to a new proof of the well-known Embedding Theorem of Higman [7). The construction is rather nice and allows us to prove results such as the following.
Theorem 9.1. There exists a finitely presented group G, with soluble word, power and order problems, with a subgroup F, which is free of rank 2, and a recursive functional 0 from the natural numbers to words on the generators of G, such that: (i) if Re is the r.e. subset of F with r.e. index e, then, in G, F f l @(e)" = RP, and so ( G : @(e)) embeds (F: Re),and (ii) the embedding of ( F : Re) in ( G : @(e)) is Frattini, and preserves the Turing degrees of the word, power and order problems. 0 Theorem 9.2. Every recursively enumerable c -degree contains the word problem of some finitely presented group. 0 Theorem 9.3. A necessary and sufficient condition for an abelian group A to be the centre of some finitely presented group is that it should be recursively presentable. 0 Remarks. We refer the reader to Rogers [17] for the definitions of r.e. index and c-reducibility, and to Collins [ 5 ] for the definitions of the power and order problems. For the definition of r.e. index of a subset of the free group F, of rank 2, to make sense, we, of course, assume some Godel numbering of F into N to be given. In Theorem 9.3, by recursively presentable we mean, of course, possessing a presentation with an infinite, but recursive, set of generators a l , a 2 , .. . , and an r.e. set rl, rz, . . . of relators on these generators.
212
B.M. Hurley
A topic we have not mentioned is t h e study of symmetrised sets, - that are fixed, setwise, by some satisfying small cancellation conditions, group of automorphisms of an NFS group. A typical example of a result obtainable by such study is the following.
Theorem 9.4. Let H, K be countable groups, neither of which can be decomposed into a non-trivial free product. Suppose further that 1 H 1 3 2, 1 K I 2 3 , and that K is not an infinite cyclic group. Then all normal extensions of H * K are SQ-universal. 0 We remark that P. Hall (unpublished) has constructed an example of a countable SQ-universal group G, that has a normal extension (actually a split extension by an infinite cyclic group) that is not SQuniversal. The reader will notice that Theorem 9.4 gives us no information about what happens when both H and K are infinite cyclic groups, and so we ask the following question.
Problem 5. Are all normal extensions of a free group of rank 2 SQuniversal? Another topic we have not mentioned, is that of small cancellation towers. That is, possibly infinite, towers G IS G2C . * . of groups, where Gi, for i 3 2 , embeds Gi-l, by a small cancellation theory argument. We remark that a group with the same property as the group of P. Hall, mentioned above, can be constructed by means of such a tower, that the countable, minimal (in the sense of Pride [16]), SQ-universal group, mentioned after the proof of Theorem 8.11, was constructed by such a tower, and that Problem 7 of Pride [16] has been settled, in the negative, by a small cancellation tower argument. Finally, a small cancellation tower construction can be used to prove the following.
Theorem 9.5. Every countable group C can be embedded in a nonsimple countable group G that is isomorphic to all its non-trivial quotient groups. 0 We shall say a word about the future of small cancellation theory. It is widely believed that it should be possible to iterate small cancellation arguments. That is to “do” small cancellation theory over small cancellation groups. We believe that this should be possible, in the near future. It is possible that such a development would have some relevance to the question of whether or not every group with soluble word problem can be embedded in a finitely presented simple
Small cancellation theory
213
group, the answer to which we believe to be “No”. The reader who thinks of using Theorem 8.8, in some way, to obtain the answer “Yes” to this question had better be warned that one can prove a version of this theorem, in which every non-trivial quotient group of H has insoluble word problem, in which case, it can have n o recursively presented simple homomorphic image. Returning to t h e non-iterated theory, we have a feeling that constructions of t h e type used in the proof of Theorem 8.12 might have some relevance to disproving a famous conjecture in Topology. And now a word about the history of t h e results in this paper. The idea of doing the “classical” cases of small cancellation theory in a uniform way has its origins in my (for the remainder of this section, for the sake of naturalness, the pronoun “I” shall be used instead of the conventional mathematical “we”) M.Sc. dissertation [9]. This was conceived, written and typed within the space of three weeks, and so contains many errors. However, it does contain a correct proof (The first, I believe, but then, what does it matter who proves things first?) of the fact that every countable group can be embedded in a two generator simple group. The theory remained dormant, for some time, mainly because I got bored with it. My interest revived, when, at the Oxford meeting, I learned of the existence of the axiomatic lengthfunction theory, and it seemed to me that there might be some connection between this theory and my theory of normal forms in groups. As to the value of this work, I am unsure. Dividing things, roughly, into “reality” and “waxworks”, most Mathematics falls within the category of “waxworks”, and if one thinks that then, presumably (apologies to Lewis Carroll) one ought to pay. Perhaps one can do no better than t o quote Wittengenstein “Wenn es einen Wert gibt, der Wert hat, so muss e r ausserhalb alles Geschehens und So-Seins liegen”. 110. Acknowledgements
Knowledge is not gained in isolation, and my ideas on this subject have benefited from discussion with many people, and I would find it impossible to compile a complete list. However certain people need special mention. The idea of doing small cancellation theory over NFS groups originated in my M.Sc. dissertation, and I must thank my (long suffering) supervisor Professor G . Higman. Incidentally, it was h e who started me thinking about small cancellation theory by showing me a paper of Schupp. I thank Professor R. Lyndon, who read an earlier draft of this paper and made many detailed suggestions as to how it
214
B.M.Hurley
might be improved. I must thank Professor W.W. Boone, without whose encouragement, this paper might never have been written. Finally, I must apologise to the other contributors to this volume for the late appearance of the final version of this paper.
References [I] B. Baumslag, Residually free groups, Proc. London Math. Soc. 17 (1967) 402-418. [2] W.W. Boone and G. Higman, An algebraic characterization of groups with soluble word problem, J. Austral. Math. Soc. 18 (1974) 41-53. [3] J.L. Britton, The word problem, Ann. of Math. 77 (1963) 16-32. [4] I.M. Chiswell, Abstract length functions in groups, Math. Proc. Camb. Phil. Soc. 80 (1976) 451-463. [5] D.J. Collins, The word, power, and order problems in finitely presented groups, In: Word problems (North-Holland Publishing Co., Amsterdam, 1973) pp. 401-420. [6] A.P. Goryushkin, Imbedding of countable groups in 2-generator groups, Mat. Zametki. 16 (1974) 231-275. (71 G. Higman, Subgroups of finitely presented groups, Proc. Royal Soc. London, Ser. A 262 (1961) 455-475. [8) A.H.M. Hoare, On length functions and Nielsen methods in free groups, J. London Math. Soc. 14 (1976) 88-92. [9] B.M. Hurley, Embedding theorems for groups, M.Sc. dissertation (Oxford, 1972). [lo] B.M. Hurley, On length functions and normal forms in groups, Math. Proc. Camb. Phil. Soc. (to appear). [ I l l R.D. Hunvitz, On the conjugacy problem in a free product with commuting subgroups, Math. Ann. 221 (1976) 1-8. [12] R.C. Lyndon, Length functions in groups, Math. Scand. 12 (1963) 209-234. [13] R.C. Lyndon, On Dehn’s algorithm, Math. Ann. 166 (1966) 208-228. [I41 W. Magnus, A. Karrass and D. Solitar, Combinatorial group theory (Wiley, New York, 1966). [IS] K. Murasugi, The center of a group with a single defining relation, Math. Ann. 155 (1964) 246-251. [16] S.J. Pride, The concept of “largeness” in group theory, in this volume, pp. 299-335. [17] H. Rogers, Jr., Theory of recursive functions and effective computability (McGrawHill, New York, 1967). [I81 G.S. Sacerdote and P.E. Schupp, SQ-universality of HNN and I-relator groups, J. London Math. Soc. 7 (1974) 733-740. [19] P.E. Schupp, Small cancellation theory over free products with amalgamation, Math. Ann. 193 (1971) 244264. [20] P.E. Schupp, A survey of small cancellation theory, In: Word Problems (NorthHolland Publishing Co., Amsterdam, 1973) pp. 569-589. [21) P.E. Schupp, Embeddings into simple groups, J. London Math. Soc. 13 (1976) 90-94.
S.I. Adian, W.W. Boone, G . Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 215-245
ON ISOMORPHISMS OF DIRECT POWERS J.M. Tyrer JONES Department of Pure Mathematics and Mathematical Statistics, Cambridge University
81. Introduction
In [2] Corner shows that, given any positive integer n, there exists a countable abelian group H such that H' = H sif and only if r is congruent to s modulo n. (Where H' denotes the direct sum of r isomorphic copies of H.) In this paper, we show that similar results can be obtained for finitely generated non-abelian groups. For the case n = 1 it is sufficient to construct a non-trivial finitely generated group isomorphic to its own direct square. This was done in [3], and the method of construction used there can easily be adapted to the needs of the general case. For given n > O we wish to construct a group G such that (i) G G"" (ii) G g G ' for l < r < n + l . The fact that G' = G " if and only if r is congruent to s modulo n follows easily from (i) and (ii). For ease of notation, we shall take the given integer to be n - 1, so that we need to construct a group G satisfying (i)' G = G" (ii)' G g G' for 1 < r < n. The 'building blocks' of our construction are t h e simple groups constructed by Camm in [l]. We give a brief description of these groups at t h e beginning of $2, but will assume a general familiarity with the notation and methods of calculation used by Camm. We will make frequent use of the structure theorems for amalgamated free products and HNN extensions. For the terminology of amalgamated free products we will follow [4], and for that of HNN extensions [ 5 ] . General structural results to be found in either of these two sources will be utilised without further comment.
82. The construction A = ( a , p ) and B = ( b , q ) are two-generator free groups. Z is t h e set 71 C
J.M.T. Jones
216
{ 2 1, 2 2, 2 3 , . . . } and p, a, T are permutations of I. Let U be the subgroup of A (freely) generated by the set {a'pP'":i E I } and V be the subgroup of B generated by the set {b'q"'":i E I}. Set g, = alpp(') and h, = b'q"'", and form the free product R of A and B amalgamating U with V under the isomorphism g, h,(,),i = 21, 2 2 , 2 3)... . In [l], Camm shows that if p, a and 7 are suitably chosen, then the group R is simple. Her definitions specify the effect of p, a and 7 on some of the positive integers. The definitions elsewhere can be made arbitrarily, subject to one technical condition, which states essentially that I p ( i ) l , J a ( i ) (and I ~ ( i ) lmust not be very much bigger than l i l . It is easy to see that if, for a given integer t (however large), the negative integers are divided into 'blocks' B 1= { - 1, - 2, - 3 , . . . - t } , B 2= { - ( t + l), - (t + 2), . . . - 2t}, B , = { - (2t + l), - (2t + 2), . . . - 3 t } etc., and p, a and T are so defined that for each r = 1,2,3,. . . p ( - rt) = a( - rt) = T( - rt) = - rt and p, a and T fix each B, setwise, then Camm's conditions are satisfied, and a simple group R results. We now proceed to the details of our construction. W e are given an integer n 3 3 , and wish to construct a group G satisfying conditions (i)' and (ii)' of $1. W e divide the negative integers into blocks of size 9n, and define permutations p, a and T of the negative integers by p ( - 9nt + r ) = - 9nt
+ pl(r) a( - 9nt + r ) = - 9nt + a l ( r ) 7(-9nt+r)= -9nt+7,(r)
O c r c 9 n - 1 , r = l , 2 , 3 ,...
where p l , ul, T~ are those permutations of the set {0,1,. . . 9 n - 1) defined by pl = (13,14), u1= ( 5 , 6 ) , = (9n - 10,9n - 9). By using Camm's definitions on the positive integers, we obtain a simple group R = A *"="B. The subgroup A = ( a , p ) of R is free, and so we may form an H N N extension H of R by adding a generator c and relations c-'ac = a, c-lpc = p", viz. H = ( a ,p , b, q, c : a ' p = bT(')qm('), i = * 1 , 2 2 , * 3 ,..., c - ' a c = a , c - ' p c = p " ) . We next construct a group T = ( d , e,f : d-'fd = f', e-lde = d ' ) as follows: Let (f) be an infinite cyclic group, and form an H N N extension (d,f : d-lfd = f') of (f). The element d in this group has infinite order, and so we may make a further H N N extension by adding a generator e and the relation e-'de = d 2 . The resulting group we call T.
217
On isomorphisms of direct powers
Britton's Lemma shows that the subgroups ( b - ' a , c ) of H and (e,f) of T are both free groups of rank two, and so we may form the free product of H and T amalgamating ( b - ' a , c ) with (e,f) according to the relations
b-'a = e
and
c = f
The resulting group S has presentation
S = ( a , p , b , q , c , d : a ' p P " ' = b T c ' ) q ~ (*1,*2,*3 '),i= ,..., c - ' a c = a , c-lpc
= p " , d-lcd = c2,
( b - ' a ) - ' d ( b - ' a )= d 2 ) .
We now form the Cartesian product C = II:,,S, of copies of S indexed by N = {1,2,3,. . .}. Elements of C will be written as infinite vectors. The corresponding direct product D = II,,,S, is the subgroup of C consisting of all those elements whose projection on S, is the identity for all but finitely many i E N . We shall denote by H,, T,, A,, etc. the subgroups of S, corresponding to the subgroups H, T, A, etc. of s. G is defined to be the subgroup of C generated by the elements x = (b-'a, b-'a, b - ' a , . . .),
Y = (P, P, P , . * * ), z = (P9ap-', p "ap-", ~ ~ ' a p. . -. ),~ ~ ,
,... ), u = ( d , d , d ,... ). u =(c,c,c
We shall show that G satisfies conditions (i)' and (ii)' of 91.
63. The isomorphism In this section we prove that the group G defined above satisfies condition (i)' of $1. Theorem 3.1. G is isomorphic to G". Proof. It is easy to check (by using the definitions p(1)= a(1) = ~ ( 1 ) =1, specified by Camm, and the definitions of p, u and T on the negative integers given in $1) that the following equations are consequences of the amalgamation relations of R, and hence hold in S. (1) b - I a ( P 9 n l - 9 k a p -9n1+9k+I )a-lb = P 9 n r - 9 k a p - 9 n r + 9 k + I for all k = 0 , 1 , ... n - 2 , a n d a l l t s l .
218
J.M.T. Jones
(2) b - l a (pgnl-Y(n-l)ap-'hr+%n-I)+l )a-Ib = p % T - % n - I ) + I 131. (3) b - 1 a (PYnl-14aP-9nf
2
a p
-%tt+%n-l)+l
for all
for all t 3 1. -yn'+4qfor all t 3 1. Using equations (1) and (2) we deduce that )a-lb
= P%-13ap-ynf+15
(4) b-la(pYnf-xap-Ynt+4)a - lb = p'"l-3ap
[ x - i , z - l=~@lnap-', i , i,... 1 , ~ ~ ~ + ~ ~ ~ ~ i,p1Hn+1nap-18n-9,i - ~ ~ - ~ , 1 , 1 ,...I , . . . =
z I say.
Equation (3) gives [X-l,yYn-24 - I Yn+24 ]=@,1,1, ... l , p , l , l , ... l , p , l , ... ) = y l say zIy and equation (4) gives [yYn-llz
;ly9n+13
, x-'1
=
(q,I,l,. . . 1,q, 1, 1, . . . l , q, 1,.. .).
Since b-'a = qp-l, we deduce that G also contains
xI= (b-'a, 1,1,. . . 1, b-la, 1,1,. . . 1, b-'a, 1,. . .). The relations [ d , b-'a] = d and [c, d ] = c of S show that the elements u I =
and
[u,xI]=
(d, 1,1, ... 1,d, 1,1, ... l , d , l , . . . )
uI= [ u , u l ] = (c, 1 , l )... l , c , l , l , ... l , c , l , ...)
also lie in G. Let G I be t h e subgroup of G generated by xI,y , , z I , u1 and u I . Then G I is t h e projection of G on the subgroup C , of C consisting of all those elements of C whose projection on S, is trivial for all i f 1 mod n. By considering the commutator [x-I, Y ' " - ~ zi'y-"'+'] and proceeding as above, we deduce that G also contains its projection G2 on C2 (the set of all elements of C whose projection on S, is trivial for all i f 2 mod n ) , and similarly for eath i, 1 i n. Clearly, G = G I X G2 x . * * X G,. The groups G,, 1 s i s n, are all isomorphic, and so it only remains to prove that G I s G. Now
GI = (
~
1
~
z~I , 1 u ,I , v I )
= (XI,YI, y?-10Zly;9n+9, U I ,
But
= (Xl,Yl, Uly~-~oZly;yn+9U;'yl,it,, Ul).
y~~-10fly;9n+Y
so
V')
- (p9.up-9-, 1,1,. . . l,plH"ap-IH", 1, 1,. . . 1,p27nap-27n, 1,. . . )
On isomorphisms of direct powers
lyy -loz ,y ;Yn
and
+Yu
;I
= (pyap-y,1,1,. Yn-lOZly;Yn+Y
uly 1
219
-1
u1
y1
= (p9ap-8,1,1,.
. . l , p'8ap-'8,1,1,. . . 1, pZ7ap-Z7,1,. . .) . .1,p'8ap-'7,1,1,. . .1,p2'ap-26,1,. ..).
The isomorphism betwen G and G I is now clear. This completes the proof of Theorem 3.1. 04. Some preliminary lemmas
We wish to show that G 8 G' for 1 < r < n. Since G = G", it is clear that each of G and G' is both a subgroup and a homomorphic image of the other. Consequently many of the standard methods of demonstrating that two groups are not isomorphic are not applicable in our case. For this reason we must proceed by direct calculation. We first prove Lemma 4.1. G contains the direct product D = II,,,Si. Proof. As in 43, we see that t h e following relations hold in S. ( 5 ) b - I a (p9nr-(9k -8)ap -Ynr+(Yk -8)+ I)a -1b = p 9 n r - ( Y k - 8 ) a p - Y n ~ + ( 9 k - 8 ) + 1 for all 121 andQSkSn. (6) b - 1 a ( p - 1 a p 2 ) a - 1=b ~ - ~ a ~ p ~ q - ' . (To prove (6), we have to use some of Camm's definitions for p, u and T on t h e positive integers, namely, p ( l ) = U ( l ) = T(1)= 1,
p(2)=7(2)=2, a ( 2 ) = 3
p ( 3 ) = 3 , U(3)=7(3)=4,
p(4)=5, V(4)=2, T(4)=3
p ( 5 ) = 4 5 ) = 4 5 ) = 5,
p ( 6 ) = 4 6 ) = ~ ( 6=) 6.)
From (5) and (6) it follows that [x-l,y-loz-ly'o] = @-3a3p6q-5p-2a-1p,1,1,. . . l , . . .).
The first entry is a non-trivial element of the subgroup R 1of SI. But the projection of G on Sl is the whole of SI, and R , is simple, so G contains elements (g, 1,1,. . . l , 1,. . . ) for each g E R. In particular, G contains the element h = (b-la, 1,1,. . .1,1,. . .), and hence t h e elements [u,h ] = (d, 1,1,. . . 1,1,. . . ) and [u, [u,h ] ]= (c, 1,1,. . .1; 1,. . . ) also lie in G. Thus G contains t h e whole of Sl. Similarly, by considering [ ~ - l , y - ' ~ z - l y we ~ ~ ]see , that G contains S2. Repetition of the argument shows that G contains Si for all i 3 1, and hence that G contains D. 0
220
J.M. T. Jones
Lemma 4.2. S is centreless. Proof. By t h e amalgamated free product structure of S, we have
Z ( S )c Z ( ( b - ' a ,c ) ) . But (!.-'a,c ) is a two-generator free group, and so has trivial centre. 0 Lemma 4.3. Zf N , and N 2 are normal subgroups of S which intersect trivially, then one of NI and N 2 is trivial.
Proof. Suppose that neither NI nor N 2 is trivial. We first show that one of them lies in t h e amalgamated subgroup K = ( b - ' a ,c ) of S. Suppose N , p K. Then N , must contain a cyclically reduced element of S of normal form length greater than one. For suppose not. If NI contains a cyclically reduced element g, say, of length one, then g lies in one of the factors H, T of S, but not in K. Let x be any element of the other factor not lying in K. Then the commutator [x,g] is a cyclically reduced element of length four, lying in N , . So suppose that every cyclically reduced element of N , lies in K. NI K, so there is an element ghg-'(g E S, h E K ) , of N , which does not lie in K. By taking a suitable conjugate of this, we find that NI contains an element which lies in a factor, but not in K , and t h e result then follows as above. Let sIs2---sm be a cyclically reduced element of N , , written in normal form with m > 1. Without loss of generality, we may assume s1 E H and s, E T. Since IH : K I # 2, we may find an element g E H \ K such that gsl $Z K. Thus g~,s~---s,g-~ E N1 and is not cyclically reduced. Consequently N , contains both cyclically reduced and non-cyclically reduced elements. Let x be any element of N,. Since N , and N 2 intersect trivially they generate their direct product, so x commutes with both a cyclically reduced and a non-cyclically reduced element, and thus, by a length argument, x E K. Thus N 2 C K, as require$. Restricting our attention to H, we see that H has a non-trivial normal subgroup N 2 lying entirely within K = ( b - ' a ,c ) . We shall show that this cannot happen. Let g = (b-'a)"lcaI(b-'a)".c~---(b-'a)",c Br E N2.
Then bg6-I also lies in N2, and we have an equation of the form
6 (b-'a)"ic6 ___ (b-'a)"rcarb-'
= (b-'a)'ic *z(b-la)%c* 2 - - -
There are, without loss of generality, two cases to consider:
(b-'a)'sc **.
On isomorphisms of direct powers
22 1
(i) p,# 0 and (ii) g = (b-la)”]. In case (i), an application of Britton’s lemma to the HNN extension H of R shows that one of b-’ and b-’(b-la)-ys lies in A, and the normal form theorem for R shows that neither case is possible. In case (ii), Britton’s lemma shows that b(b-’a)”lb-’ = (b-’a)?I, which again is impossible by the normal form theorem in R. Thus N 2 = 1 , as required. 0 We are now ready to prove the important Proposition 4.4. If G = A x B, then there is a partition {PI,P2}of N such that A = GPI,B = Gp2,where Gpdenotes the set of all g E G such that the projection of g on S, is the identity for all i P. Proof. Suppose there exists an i E N such that A and B both have non-trivial projection on S,. Then A 3 a = ( a l ,a2,.. .a,, . . . ) with a, # 1 . By Lemmas 4.1 and 4.2, there exists an element g, E S such that [ g , , a , ] # I ,a n d g = ( l , l , ... l , g , , l , l ,. . . ) E G . Now l # [ g , a ] = (1,1,. . . l , [g,, a , ] ,1,. . . ) E A since A is normal in G. Thus A has a non-trivial intersection with S,, A fl S, = NI, say. Similarly, B has a non-trivial intersection with S,, B nS,= N 2 say. But then NI and N 2 are non-trivial normal subgroups of S,, which intersect trivially, and this contradicts Lemma 4.3. Thus there are disjoint subsets PI and P2 of N such that A has non-trivial projection on S, if and only if i E PI and B has non-trivial projection on S, if and only if i E P2.Since S, c G for each i, and since G = A x B, we see that {PI,P2} is a partition of N. Now A C GPlr B c Gp2and every element g of G can be written in the form g = ab with a E A and b E B. Hence A = GPlrB = Gp2as required. 0
85. The admissible partitions {PI,P2} Since G 2 0,several decompositions of G as a direct product of the form G = S‘ x G suggest themselves. Other direct decompositions were obtained in 03. In this section, we prove that the only decompositions of G as Gplx Gp2can be obtained by making repeated use of these two methods of decomposition. In particular, we prove Proposition 5.1. Let P be a subset of N. Then G = Gpx GNiP if and only if either (i) P is finite or
J.M.T. Jones
222
(ii) There exist integers k , m and t, t c n', and congruence classes CI,Cz,. . . C, mod n' such that P = X U Cl U CzU . . . U C,, where X is a subset of {1,2,. . . m } . The methods used are those of direct calculation, using the normal form theorems for amalgamated free products and HNN extensions. We begin with a trivial observation. Lemma 5.2. The subgroup of G generated by x, u and v is isomorphic to T under the map x -+ e, u +f , v -+ d. We now attempt to derive similar descriptions of other subgroups of G, in particular, of t h e subgroup generated by x, y , z and u. T o this end, we introduce new elements of C as follows: For any g E C, let g, denote the projection of g on S,. For each positive integer k , and each integer I, 1 S I S n', define new elements x ( I , k ) , y ( I , k ) , u ( l , k ) and v(I, k ) of C by x(I, k ) , = y ( l , k ) , = u(I, k ) , = v(I,k ) , = 1 if i f I mod n'. x ( I , k ) , = b-'a, y ( l , k ) , = p , u(I, k ) , = c, u ( l , k ) , = d if i = I mod n'.
For each k , I as above, and each integer s 3 1, define an element
z ( l , k , s ) of C by
z(I, k , s ) , = 1
if i f I mod n'
z(I, k , s), = p'n'fap-9n1f+1if i = n * ( t - 1)+ 1.
The proof of Theorem 3.1 shows that G contains the elements x ( l , k ) , y ( l , k ) , u(I, k ) , v(I, k ) and z(I, k , s ) for all I, k , s as above. Let C 1 . k be t h e subgroup of c consisting of all those elements of c whose projection on s, is trivial for all i f 1 mod n', and let D { , k be the corresponding direct product. The proof of Lemma 4.1 can easily be adapted to show that for each I, k , s t h e group L I . k . s = ( x ( I , k ) , y ( I , k ) , z(I, k, s ) ) contains the direct product DI.'. We will use bars to denote cosets of D1.k in L I . k . s . Lemma 5.3. The group ~ , . k . ~ l D ~is, ka free product of free groups A l . k . s = ( j ( I ,k ) , f ( I , k, s ) ) and B 1 , k . s = ( % ( I , k ) j ( I , k ) , f ( I , k, s ) ) with the subgroups ~ I . k . ~ = ( j ( I , k ) - i f ( I , k , s ) j ( I , kif12,13,14mod9n, )i, j(1, k)-IZ+Ynfz(I, k , s ) j ( l , k)13-9"f,y(I, k)-I3+'"'-z ( I , k , s ) j (I, k )14-9nr, j (I, k )- 14+9nr z(I, - k , s)f(l, k)I2-'"', I E Z ) , and v f . k . s = ((f(1, k ) j ( 1 , k ) ) - i f ( I ,k , s ) ( f ( I , k ) j ( l , k ) ) ' , i f 4 , 5 , 6 m o d 9 n , (a(/, k)ji(I, k))-4+9"f2(I,k , s)(f(l,k ) j ( I , k))s-9"f,
On isomorphisms of direct powers
223
@ ( I , k)f(I, k))-'+''''f(I, k , s)(f(I,k ) j ( I , k))6-'n', ( X ( I , k ) y ( I , k))-"""'f(I, k , s)(X(I,k ) j ( I , k))4-'n', t E Z ) amalgamated under the isomorphism j ( I , k ) - ' f ( l , k , s)f(I, k ) ' = (%(I, k ) j ( l , k ) ) - ' f ( I , k, s)(X(l, k)f(l, k ) ) '
for all i f 4 , 5 , 6 , 1 2 , 1 3 , 1 4 , 9 n- 11,9n - 10,9n - 9 mod 9n j(1, k ) - 4 f Y n t - z(I, k , s ) j ( I ,k)4-'n' = (f (I,
j ( I , k )-S+YW
z ( I , k, s )f ( I , k )5-'n'
k
~ ( lk , s ) j ( l ,k)'-'"'
=
j (I,
)-6+Ynl-
=
k )f ( I , k ))-4+'n'f ( I , k , s)(i I, ( k ) j (I, k ))5-'n'
(3( I , k ) j (I, k ))-'+'"'f
( I , k , s)(X ( I , k ) j ( I , k ))4-'n'
(3(I,k ) j ( I , k))-'+yn'f(I,k , s)(.?(I, k)f(l, k))6-'n'
j ( I , k ) - l 2 + Y n l - z(I, k , s ) j ( I ,k)13-'n1 = (X ( I ,
k )?(I, k ))-12+'"'Z ( I , k, s )(X ( I , k ) j (I, k ))"-'"'
jj ( I , k )- 13+0nl-z(I, k , s ) j ( I ,k)I4-'"' = (%(I,
k ) j ( I , k))-'4+'"'2(I, k , s ) ( f ( l , k ) j ( I k,))'@'"
j ( I ,k ) - l 4 + Y n l -z(I, k , s)f(I, k)'*-'"' = ( X ( I , k ) j ( I , k))-13+'n'f(I,k , s)(i(l, k ) j ( I , k))I3-'"' j
( I , k )-(Yn-
I I )+Ynl-
z(I, k , s ) j ( l , k)'n-ll-9n' = ( X ( l , k ) j ( I , k))-('n-ll)+'n'~ ( 1 k, , s) *
(X ( I , k )? ( I , k ))-If ( I , k , s )(f( I , k ) j ( I, k j(1, k
)-(Ym-lll)+Ynl-
~ ( lk , s ) j ( I ,k)yn-l"-yn'
= (a(/,k)f(I, k))-('"-l")+'n'z ( I , k, s)-'.
(3( I , k ) j ( I , k ))'" j ( I , k ) - ( Y n - Y ) + Y m l - z ( I , k , s ) j (I, k )'n-'-9nr
- I"-'n'
- (%(I, k ) j i ( / , k))-(9n-10)+ynfz(l,k,s)*
(X ( I , k ) j (I, k ))-'Z( I , k , s )(X ( I , k )f ( I , k ))'n-'-'n' Proof. An equation w ( 3 ( I ,k ) , j ( I , k ) , f ( I , k , s)) = 1 holds in L 1 , k . s if and only if t h e projection of the corresponding word w(x(I, k ) , y ( I , k ) , z(I, k , s ) ) o n S, is trivial for all but finitely many i E N. Thus it is easy to check that the listed equations hold in L 1 . k . s . To show that the subgroups k i 1 . k . s and B 1 . k . s are free, we need only note that for any i = I mod n' the projections of Al.k.s and B,.k.son S, are the free subgroups A, and B, of S,. Hence i j , . k . s and are free on the listed generators, and the given equations represent an isomorphism between U,.k.sand Thus L,.k..is a homomorphic image of t h e rgquired amalgamated free product.
vl,k.s
v,.k.s.
224
J.M.T. Jones
Let g = K s ~ s ~ - - - s , be an element of L 1 , k . s written in normal form with respect to the relations of the amalgamated free product, so that is a word o n the generators of U 1 . k . s and the S, are powers of j(1, k ) and Z(1, k ) y ( l , k ) taken alternately. If we can prove that g # 1 unless r = 0 and K = 1, then the result will be established. Let J ( 1 , k)'"IZ(I, k , s)J(l, k)'"2 be any generator of 0 l . k . s and consider t h e corresponding word w = y(1, k)'"lz(I, k , s)y(l, k)". of L1.k.$.Now w, lies in t h e subgroup R, of S,, and it is easy to check (by taking each generator in turn) that w, also lies in t h e amalgamated subgroup U, of R, for all sufficiently large i = 1 mod n k . Now K involves only finitely many generators of 0 l . k . . and consequently the projection h, of h on S, lies in U, for all sufficiently large i E 1 mod n k . It follows that, for all sufficiently large i = 1 mod n k , g, is a word of R, of normal form length r. Thus g = 1 in L 1 . k . s if and only if r = 0 and 6 = 1, as required. We next consider the subgroup of G generated by the elements
x(1, k ) , y(1, k ) and z ( l , k , s ) when s is allowed to vary through a finite
set. We prove
Lemma 5.4. Let CY and p be fixed positive integers. Then the groups Ll,k . 0 0 S a S s S p generate their free product amalgamating the subgroup (Z(1, k ) , j ( 1 , k ) ) . Proof. The group fi,,,,,,,, generated by t h e L 1 , k . s is clearly a homomorphic image of the required amalgamated free product. We note that any element g of f i , , k , - . O may be written in the form = w(X(1, k ) , J ( 1 , k ) ) f i i i i z - - - i i r
where each 6, is a non-trivial word on t h e generators of U , , k . s , for some s = s ( i ) and where s(i + 1)# s(i) for any i = 1 , 2 , . . . r - 1. We need only show that g = 1 if and only if r = 0 and w ( 3 ( f ,k ) , J ( f ,k ) ) is the trivial word. As in the proof of Lemma 5.3, the projection of u I u 2 - - - u , on Si lies in the amalgamated subgroup U, of R, for all sufficiently large i = 1 mod n k . A length argument in R, then shows that if g = 1, w must be the trivial word. Thus it is sufficient to consider words of the form fi,iiz---iir. Suppose z.i,iiz-r-ii, = 1 with r > 1, and set i = n k ( t- 1)+ 1. The projection of u, on S, is a word in finitely many elements from among p - m @9n~rap-9nsr+l )p", m # 12, 13, 14 mod 9n,
P -12+9nm
@9nslap
-9n
+
%I I
lP
13-9nm,
P - 1 3 + 9 n m @9"'lap
-9n'l+l
>P14-9nm,
225
On isomorphisms of direct powers
- I4+Ynm (p 9nslap -Yn*r+ 1 )p12-ynm, m E Z. Now u I u 2 - - - u , E D, so there P must exist some j , 1 s j s r - 1, such that for infinitely many values of i = 1 mod n k , all adjacent p-symbols in the projection of U , U , + ~on S, cancel. Suppose that ii,, written as a word on the generators of u , , k , s ( , ) ends in f(l, k)'Z(l, k, s)=f(I,k)", y, 6 E Z , E = f 1, and s = so'), and ii,+l begins in f(l, k)"Z(l, k, s')''f(l, k)"', y', S'E 2, E ' = f 1, and s ' = s ( j + 1)# s. The projections of these segments on S,, i = n k ( t - 1)+ I, are p'(py"rap-yn"+1)ep6 and v'(pYn.'rap -9n*',+ I ) 'p"'. Thus one of the following must hold for infinitely many values of t : (i) - 9n't + 1+ 6 + y ' + 9n"t = 0 (ii) - 9n't + 1 + 6 + y ' + 9n"'t- I = O (iii) - 9n"t + 6 + y ' + 9n"'t = O (iv) - 9n't + 6 + y + 9n"'t- 1= 0. Rewriting these, we see that one of the following must hold for infinitely many values of t : (i)' 9 t ( n s ' - n ' ) + 1+ 6 + y ' = O (ii)' 9 t ( n s ' - ns)+ 6 + y' = 0 (iii)' 9 t ( n S ' - ns)+ 6 + y ' - 1 = 0. But 6, y ' , s and s' are fixed with s f s', so each of these equations can hold for at most one value of t. Thus ti,&--- 1. Since u , . k . s is free, the case r = 1 is easily dealt with, and so the structure of & & , k . m , @ is established. 0 E
We are now in a position to investigate the structure of the subgroup
(x, y, z, u ) of G. We prove
Lemma 5.5. Suppose w is an element of (x, y, z, u ) written in the form w = u"Ig,u"2g*---U('rg, where each gi E (x, y, z ) . Set k = C:=, 1 ai1 + 1. Then there exists an integer m, and congruence classes C , ,C2,.. . C, mod n k such that the projection of w on Si is trivial if i > m and i 61 C1U C2U * . U C,, and non-trivial if i > m and i € C , U C2U . . . U Cr.
-
Proof. The projection w, of w on S, lies in H,, and H, is an HNN extension of R,. It follows that if Z:=, a,#0, then w , # 1 for all i E N, and the result is proved. Thus we may assume C : = , a ,= O . It is clear that there exists a word
G
= G ( x ( l , k ) , y ( l , k ) , z ( I ,k , k ) , u ( l , k ) )
such that $, = w, for each i = l mod n k . Thus it is sufficient to prove
226
J.M.T. Jones
that for each I = 1,2, ..., n k , there is a positive integer mf such that if S,# 1 for some i > mf , then S,# 1 for every i > rnf such that i 3 I mod n k . We may write S in the form
GJ= ~ ( lk)"lglu(l, , k)U2gz---U(I,
k)"rg,
where each g, is a word in x ( I , k ) , y ( l , k ) and z(I, k , k ) . Now the following relations clearly hold in G: (1) u ( I , k ) - ' z ( I , k , s ) y ( I , k ) - ' u ( I , k ) ' =z ( l , k , s + r ) y ( l , k ) - ' for alJ r S O . (2) u ( l , k)'z(I, k , s ) y ( I , k ) - ' u ( I , k)-' = z ( l , k , s - r ) y ( I , k ) - ' for all O < r < s - 1. (3) u ( I , k 1-'Y (1, k )u (1, k = Y ( I , k 1". We will call a word in n ; i , k . , , Z k reducible if it contains a subword of the t Y Pe (i) ti(/, k ) - ' g P ( I , k ) with g E r ; i , , k , l , Z k - l or (ii) f i ( l , k ) g l ( l , k ) - ' with g E N J . k . 2 . Z k = ( f ( l , k , s ) j ( l ,k ) - ' , j ( f ,k ) " 2sss2k). Otherwise the word is called irreducible. Lemma 5.4 shows that we can always tell of- a given word of n ; i J , k , I . Z k whether or not it lies in -
M L I C . IOr . ~N - I, k , , x .
By using the images of relations (l), (2) and (3) in G / D we see that any reducible word can be replaced by a word equal to it, and involving fewer appearances of f i ( 1 , k ) . Thus any word can be replaced in a finite number of steps, by an equal irreducible word. We apply this process to 6.The choice of k ensures that the resulting reduced word contains no subword of the form ti(/, k)-'gP(f, k ) with g E f l f . k . l . 2 k or of the form # ( I , k ) g P ( l , k ) - ' with g E N f , k : l . Z k . Let V be the reduced word thus obtained. Suppose that V still involves an appearance of i i ( l , k ) , but that the projection of u o n S, is trivial for infinitely many i = 1 mod n k . It follows from the structure theorems of H , that u contains a subword of the type (i) u ( l , k ) - ' g u ( l ,k ) with g, lying in A, for infinitely many i = l m o d n k or (ii) u ( I , k ) g u ( I , k ) - ' with g, lying in ( a , p " ) for infinitely many i = 1 mod n k . By Lemma 5.4, g may be written in reduced from as w ( i ( l , k ) , j ( l ,k))ii1ii2---&, and for sufficiently large i = 1 mod n k , the projection of g o n S, is equal to the projection of w ( x ( I , k ) , y ( I , k ) ) u l u 2 - - - u r on S,. The projection of each u, on S, lies in the amalgamated subgroup U, of R, and so it follows from the normal form theorem for R, that w ( x ( l , k ) , y ( l , k ) ) = y ( l , k ) " for some integer s. Thus g E n ; i J , k . ] , , k - l and so case (i) leads to a contradiction.
O n isomorphisms of direct powers
227
Case (ii) therefore holds, and the projection of y ( l , k ) s u l ~ 2 - - - ~on, S, lies in (a, p " ) for infinitely many i = f mod n '. Now the projection of z(f, k, s)y(f, k)-' (1 C s s 2 k ) on S, lies in ( a , ~ "for ) all i, and so we see that, by writing g as a word in j ( I , k ) and ? ( I , k , s ) y ( l , k)-', 1 C s s 2 k , we may conclude that g lies in N r , k . 2 . 2 k contradicting the fact that V is irreducible. Thus if 0 contains an appearance of U ( I , k), then u, is non-trivial for all but finitely many i = f modnk. If, on the other hand, V contains no appearance of U ( I , k ) , we may use Lemma 5.4 to write V in normal form, and hence we may deduce that there is an integer mr such that, if u,# 1 for some i > mr then u, # 1 for all i >' mr such that i = f mod n k . This completes the proof of Lemma 5.5. 0
Lemma 5.6. Let w E ( x , y , z, u ) be written in the form w = u u ~ g I u u * g z - - - u a ~where g, each g , E ( x , y , z ) , and let k = C : = , l c r , ) + I . Let G be the corresponding word of ( x ( I , k ), y ( I , k ), z (f, k, k ), u ( I , C)), and let 77 = k"ol(I,k ) v % ( f , k)'z---u -(f, k)S"'h;, (&, = 2 1, h E n;il,k.1,2k)be the reduced form of 6.Then the projection u, of u on S, lies in (b-la,c ) for infinitely many i = 1 mod n k i f and only if there is an element 6 E D, and a word g = g ( u ( f ,k ) , x ( f ,k ) ) such that u = (g(u(I,k ) , x ( I ,k ) ) . Proof. We proceed by induction on m. If m = 0, then V E n;il,k.l,Zkand the result follows from Lemma 5.4, and t h e structure theorems of H,. Suppose m > O . A slight adaptation of Lemma 5.4 shows that we may write 6, in t h e form k,,, = ii,i&----ii,w(f(f,k ) , j ( f ,k)) where each U, is a word on the generators of u l , k , $ for some s = so'), and where s(j + 1) # s (j ) for each j = 1,2,. . . r - 1. Now the projection of u on S, lies in (b-la, c ) for some suitably large i = f mod n k . An application of Britton's lemma in H, th en shows that k, = iiliiz---ii,j(f, k)@f(I,k)' for some p, y, and that, if E, = f 1, the projection of u I u z - - - u r y ( I , k)@ on S, lies in ( a , ~ " As ) . in the proof of Lemma 5.5, we see that filGz---Li,j(l, k ) P E g i . k , i , z k - i if &, = - 1, and ~ ~ i i ~ - - - i ik)@ , j (E ~ ,N l , k . 2 , 2 k if E, = + 1. Hence E(f, k)'-h;, may be rewritten in t h e form ~ u ( kI) ,F m f ( f , k)', where g E n;if.k.l~Zk. We may, therefore, rewrite B in t h e form
-
-
6 = k,,u(f, k)'lk,E(f,
k)'.---k
-
,-rE(I, k ) € m f ( f ,
k)'.
The projection of u on S, then lies in ( b - ' a , c ) if and only if the same is true of 0 = L o u ( / ,k)'nk,u(I,k ) E 2 - - - k , - l , and the result follows by induction. 0
228
J.M.T. Jones
We now take a general word in (x, y, z, u, u ) . G is clearly a homomorphic image of the free product of the groups (x,y, z, u) and (x, u, u ) with the subgroup (x, u) amalgamated. Any element of G can be written in the form w = g l h , g 2 h 2 - - - g l , where each giE (x, y, z, u) and each hi E (x, u, u ) \ ( x , u). By Lemma 5.2, the projection of hi o n Si does not lie in (b-la, c ) for any j = 1 , 2 , . . . r. If r = 1, and w = g,, then the result of Lemma 5.5 applies. Otherwise the normal form theorem in S, shows that if wi = 1 for infinitely many values of i, then there is some j such that t h e projection of g, o n Si lies in ( b - ' a , c ) for infinitely many values of i, and so the result of Lemma 5.6 applies. We are now ready to prove Proposition 5.1. Let P be a subset of N . Then G = G p X GN\p if and only if either (i) P is finite or (ii) there exist integers k , m and t C n', and congruence classes CI,C 2 , .. . C, mod n k such that P = X U CIU C2U.. . U C, where X is a subset of { 1,2, . . . m }. Proof. Suppose G = G px GN\P. Then G p is a homomorphic image of G, and, as such, is finitely generated. Each of t h e generators of G p lies in G, and thus may be written as a word on t h e generators x, y, z, u, u of G. We have seen that any such word may be written in the form g l h l g n h 2 - - - g l , where each g, E (x,y, z, u) and each h, E (x, u, u ) \ ( x , u). Each g, can be written in the form U ~ ~ W ~ U " ~ W ~ - - - U ' "with ~ W , each w, E (x,y, z ) . We set k, = C:=, la, 1 + 1 and for each of t h e finitely many generators g of G , we set k, = max k,. Let k = max k,, where t h e maximum is taken over our set of generators for Gp.We now split each g l h , g 2 h z - - - g l , into a product of n' commuting words on the generators x ( l , k ) , y ( l , k ) , z(1, k , k ) , u ( l , k ) , u ( l , k ) , 1 S 1s nk...Let w = iI6,g2&---g,k, be any such word, and note that 6, E ( x ( l , k ) , u ( l , k ) , u(l, k ) ) \ ( x ( l , k ) , u(l, k)). Lemma 5.6 shows that either t h e projection w, of w on S, is non-trivial for all sufficiently large i = I mod n k , or some g, can be written in the form g, = &(x(l, k ) , u(l, k ) ) where 6 E D. But if -this be t h e case, may be rewritten in the form giKig;h;---gkh A with conditions as above, and with m < r. Repeated application of the above enables u s to conclude that there is an integer m such that, if w, # 1 for some i > m, then w,# 1 for all i > m such that i = 1 mod n k . We deal thus with each generator of Gp.Since P is equal to the set
.
O n isomorphisms of direct powers
229
of all those i E N such that the projection of g on Si is non-trivial for some generator g of G , t h e required result follows. 0 96. Equations in S Suppose G = G'. Proposition 4.4 shows that there is a partition {PI,Pz, .. . P,} of N such that each Gplis isomorphic to G, and G = Gp,x . * * x Gp,. Let cp : G + Gpl be an isomorphism. Then cp(x), cp(y), cp(z), cp(u), cp(u) generate G , and satisfy any relations satisfied by x, y, z, u, u. If i E P, then t h e projections cp(x),, cp(y),, cp(z),, cp(u),, cp(v), of cp(x), cp(y), cp(z), cp(u), cp(u) on S, must generate S, and must satisfy any relations satisfied by x, y, z , u and u. We will use t h e structure theorems of S to establish the possible values of cp(x),, cp(y),, cp(z),, cp(u), and cp(u), and will then use the ) results of 95 to reconstruct the elements cp(x), cp(y), p(z), ~ ( u and cp(u) cf G,. We will then know which maps from G to G , can be isomorphisms, and can use this information to deduce that G = G , if and only if the number t of congruence classes mod n k given in Proposition 5.1 is congruent to 1 mod n - 1. This shows that G $ G' if 1
c:=,a,.
Lemma 6.1. If 1 # w E K
= ( b - ' a , c ) ,rhen w
+
H
~
for any m #
m
*
f 1.
Proof. Suppose 1 # w E K, and suppose w - H w"', m # 1. Without loss of generality, w may be taken to be a cyclically reduced element of the (free) group K. Since H is an HNN extension of R by c, a consideration of c-length shows that w is c-free, that is w = ( b - l a ) " for some a # 0 . Thus w E R, and w is a cyclically reduced element of normal form length 2 a in t h e amalgamated free product R. Thus w is not conjugate in R to an element of A = ( a , ~ )It. follows that the conjugacy w w"' is achieved within R. But this is clearly impossible, by a length argument. 0
-
230
J.M.T. Jones
Lemma 6.2. Suppose that w # 1 is a cyclically reduced element of the free group (e, f ) and w - T ~ m for some m # 2 1. Then w = f" for some a, and if g-'wg = w"' then either g = d s for some p > 0, or g may be written in the form d sf 'd' for some integers p, y, 6 satisfying p > 0, 6 S 0, and either 6 = 0 or y is odd, and p > 16 Moreover, m = 2@in the first case and 2s+a in the second.
I.
Proof. As T is an HNN extension of To= (f, d ) by e, a consideration of e-length yields w =f" for some a # 0. Now To is an HNN extension of (f) by d, and so a d-length argument shows that w is not conjugate in To to a power of d. It follows that t h e conjugacy w w"' is achieved in To. Now To has presentation (d, f : d-'fd = f'). The conjugates d'fd-', r 2 0 , of f generate a locally cyclic group which is the normal closure o f f in T,,. Thus it is easy to see that any element with p 3 0 , and so, making of T,, may be written in t h e form d'f'd' careful use of t h e relation d-lfd = f2, we may write g in one of t h e forms (i) g = dP, p # 0 or (ii) d @f 'd' with p 3 0, S c 0, and either 6 = 0 or y odd. In case (i), g-'wg = d-@f"da,and it is clear that this is a proper power of w only if p > 0, in which case g-'wg = f'"" = w2'. In case (ii), g -' wg = d -'f -'d -Of "d @f'd'. Since p 3 0, this is d -'f ""d '. Now 6 S O , so t h e result is a proper power of w if and only if p > ( 61. In this case d-'f'"d6 = f'"-'a' ". This completes t h e proof. 0
-
Lemma 6.3. Suppose 1 # w E TI,, and w + T f " for any a. Suppose further that w - T ~ m with m # 51. Then w LITd" for some a#O. Moreover, if g-'d"g = dam,g E T, then either g = e p for some p > O , or g = ePd'e' for integers p, y, 6 satisfying p > 0, 6 < 0, and either 6 = 0 or y is odd and /3 > 16 1. Moreover, m = 2' in the first case, and 2s-'6' in the second. Proof. As in Lemma 6.2, w may be written in the form d"fPd' with w"'. But a + y # O . A consideration of d-length shows that w w - I ~ w " ' and , SO we must have w -T,d" for some a. Suppose g E T satisfies g-'d"g = d"", m # 5 1, and g is written in e-reduced form as woe"lwle"~w2---e"~w,where each wi E TI,.Clearly r # 0, and so w i l d " w o E (d). Suppose wo = dPf'd' with y # 0. We must have f-'d"f' E (d), and so it follows by a consideration of d-length in To that f-'d"f' = d", that is, d"f'd-" = f', and this relation clearly does not hold in To. Hence w o E (d), and repetition of the argument shows that w, E ( d ) for each i = 1,2,. . .r. Hence g E (e, d ) and, since the relation e-'de = d 2 holds in (e, d), we may use the method of Lemma 6.2 to write g in one of the forms (i) es, p # 0, or (ii) ePd'e'
O n isomorphisms of direct powers
231
with p 3 0, 6 < 0, and either 6 = 0 or y odd. The remainder of the proof is identical to the corresponding part of Lemma 6.2. 0
Lemma 6.4. Let w = d"f @d'with T to an element of ( e ,f).
CY
+ y # 0.
Then w is not conjugate in
Proof. Suppose w is conjugate in T to a cyclically reduced element g of (e, f). A consideration of e-length shows that g = f 6 for some 8 f 0 . Let K ' w h = f 6 and suppose that h is not e-free. By considering the last e-reduction in the word h-'wh, we see that f " - % , d Afor some AZO.But a consideration of d-length in To shows that this is impossible. Hence h is e-free, and d"fadT-%,f6, and a consideration of d-length again yields a contradiction. 0 Lemma 6.5. Suppose w E ( e ,f ) and re( w ) # 0. Let g be an element of T such that g-'wg E (e,f). Then g E (e,f).
Proof. Without loss of generality, we may take w to be a cyclically reduced element of the free group (e,f). We first show that g cannot be a power of d. We may write w in e-reduced form as w = e'~f"~eC~fo2---e'~f"~, where each E, = -+1. (Here the a, are permitted to take the value zero, but if a, = 0, then E , + ' # - E , . ) If d-@wdPE (e,f), an application of Collins' Lemma shows that we have an equation of the form d-@e ---e'rf "rd = e 'I f 71---e ',f 1, 'if"i
that is, d
'1f"1
_ _ _ e ef%d pf-yre
-er
___ f-71e
-cI
= 1.
Now f"dPf' is equal in To to a power of d if and only if it is equal to d P (by d-length in To).Moreover, e-'dPe' is equal to a power of d if and only if. it is equal to d2'@(where, if E = - 1, this should be interpreted to mean that p is even). Consequently our equation is possible if and only if it results in d-Pd2"P= 1 where Y = El-, e i . But this is dP""-l) = 1, which is possible only if 2"*("'= 1. Since a,( w ) # 0, this cannot occur. An application of Collins' Lemma now shows that the only cyclically reduced conjugates of w lying in (e,f) are those achieved by cyclically permuting w. Thus if g-'wg E (e,f) there is an element h E (e,f) such that g-'wg = h-'wh. It remains to show that if g commutes with w then g E (e,f). Suppose g is written in e-reduced form as g = goe"1glee2g2---ee,g, where each E, = & 1 and each gi E To. We will assume that as large as
J.M.T. Jones
232
possible an initial segment of g of the form goe'lgleez--- g,e'm lies in (e,f), viz. go,gl,. . .g, are powers of f, g,+l is not, and if g can also be then at least one of written in reduced form as &eel~leez---ee~& is not equal in To to a power of f. If g E (e,f), we are go,gl,.. finished, so assume not. We have g = hd"f@d'k where
h = goe'lgle"2---g,eEm+l E&,f) gm+i
=
d"f@d' (f)
k = eEm+2gm+2eEm+3--e Erg,. Case (i). k E ( e , f ) . Let w , and w 2 be the reduced forms of h-lwh and kwk-' in the free group (e,f). Then ( d " f @ d T ) - I w l ( d " f @=d w2. Y ) We have already established that conjugation by a power of d cannot transform a nontrivial element w of (e,f) with u , ( w ) # O to another element of (e,f), and so, without loss of generality, we may take (Y > O , y < O and p odd. Let w , = f"oe@lf"I---e@-f"" and w 2 -- fyoeS,fYI---esfYiwhere each p, and each 6, = 2 1. We have (d"f@d')-'(f"oe @if"i---e (d"f@d7 ) (f-y'e-9 ___ e-*if-Yu)= 1. @.fun)
An application of Britton's Lemma yields that p. = 6, and that fad"fedrf-'~lies in ( d ) if Pn = - I, or in ( d ' ) if Pn = + I. Since a > O and y
neither a nor y is zero. Case (ii). k We now assume that as large as possible an end segment of k of the form eZa+2gs+2---e',g.lies in (e,f). We then have k = k,dAf'd"k2where k l = e'm+zgm+2---gse's+~
e(e,f).
gs+i= d A f " d "@ (f)
kz = e'~+2g~+~---e'rg, E (e,f). If k , contains no occurrence of e, then this case reduces to case (i), so we may assume that k l involves e. Let w l , w2 be the reduced forms of K ' w h and k2wk;' in ( e , f ) . We then have
(dAf"d")-'k;'(d"fPdY)-lWI(d"fed ' ) k i ( d A f ' d "= ) ~
2
,
so that, when w , and w 2 are written as words in (e,f), as in case (i), we have
( d A f ' d " ) - ' k;l(d"f@d")-If"oe@i _ _ _ e @"fa,( d a f @'d) k I(dAf'd")f-?e-"t_ _ _ e-%f-' = 1.
O n isomorphisms of direct powers
233
An application of Britton's Lemma shows that e-reduction must take place in one of the subwords (i) e@mf"m(d"ffPdr)ee-+2 or (ii) e 's+i(d'fed ")f-"e If e-reduction occurs in (i), we have Pn = - E,,,+' and f"m(d"f@d') lies in ( d ) if E,,,+'= + 1 or in ( d ' ) if E,+'= - 1. Now the assumptions on the form of g show that we have a 3 0, y S 0, and either cr = 0, E,+' = - 1 and y is odd, or a > 0, and either 2" does not divide /3 or E,+' = - 1 and a + y is odd. If E,,,+' = - 1, then f"n(d"f@d')E (d'), so we have d"f2""~fBdT E (d'). This is possible only if 2"an+ /3 ='O and a + y is even. Since either /3 is not divisible by 2" or cr + y is odd, we see that this case is impossible. If E,+' = + 1, then f"-d"fadrE ( d ) , so d"f2""m+@dY E ( d ) , and this happens only if 2"an+ /3 = 0. But, since E ~ + ~+=1, 2" does not divide p, so this case too is impossible. Thus cancellation occurs in (ii). As above, we must have E , + ~= 8, and (d^f'd")f-'t lies in ( d ) if E , + ~= - 1 or in ( d ' ) if E , + ~= + 1. The assumptions on the form of k show that eirher E , + ~= - 1, A 3 0, u < O , and 2'"'does not divide p, or E , + ~= + 1 and either u = 0 and A is odd, or A 30, u < 0, and either 2'"' does not divide p or A + u is odd. If E , + ~ = - 1, we have (d'f'd")f-'n E ( d ) , that is d'f'-''""d" E ( d ) , which can happen only if p - 2'"'y = 0. since u # 0 and 2'"' does not divide p, this is impossible. Thus E , + ~= + 1, and we have (d'f'd")f-'~ E ( d ' ) , that is dAf"-Zl"'rd" E (d'). This forces p -2'"'y, = 0, and A + u even, contradicting the conditions on A, p and u. Thus the assumption that g fZ (e,f) leads to a contradiction, and so the result is established. 0
Corollary 6.6. If w any m # & l .
=
( b - ' a ) " d P ( b - ' a ) 'with
(Y
+ y#O,
then w + S w m for
Proof. Since b - ' a = e, a consideration of b-'a-length shows that w 4 T w m . Suppose g = glg2---g, is an element of S, written in normal form with respect to the amalgamated free product structure of S, and suppose that g satisfies g - l w g = w'",m # & 1. If r = 1, then g = gl E H \ K , and ( b - ' a ) " d P ( b - ' a ) ' lies in K. An application of Britton's Lemma in T shows that /3 = 0. Since (Y + y # 0 , a consideration of normal form length in R shows that (b-'a)"+' is not conjugate in R to an element of A. Thus, if w - H ~ m , then the conjugacy is achieved in R. But a length argument in R shows that this is not the case. Thus r # 1, and so w is conjugate in T to an element of K. Let w I be a cyclically reduced conjugate of w lying in K. Collins' Lemma shows that w I can be obtained from w by a cyclic permutation of w,
234
J.M.T. Jones
followed by a conjugation by a power of d. Since this process cannot introduce any c-symbols, it follows that w 1 = (b-'a)"+?.Now w 1-s w;", and we have already seen that this conjugation cannot be achieved by an element of normal form length 0 or 1. Suppose h = hlht---h, is an element of S in normal form such that h - ' w l h = w;". If h l lies ir! T, then an application of Lemma 6.5 shows that h , E K , contradicting the assumption that h is in normal farm. Thus h l E H, and a repetition of the argument used in the case r = 1 shows that ab-l,(h;'wlhl) = *(a + y). A further application of Lemma 6.5 now shows that hzEK , and this contradiction completes the proof. 0 Suppose we have a set of elements x , y , z, u, u of S which together generate S, and which satisfy the following relations: (i) u - l y u = y " (ii) u-luu = u z (iii) X - I U X = u z (iv) X - I Z X = z (v) x ( y - l z y ) x - l = y - ' z y (vi) ~ ( y - ~ ~ z y ~ ~=) yx- -1 -3I ~ y 1 4 (vii) y - " z y " = ( x y ) - ' z ( x y ) " . Lemma 6.7. x , y, z, u, u are all non-trivial. Proof. From (iii), x = 1 implies u = 1; from (ii), u = 1 implies u = 1; from (i) u = 1 implies y n - l = 1. The structure theorems for amalgamated free products and HNN extensions show that S is torsion free, and so y n - ' = 1 implies y = 1. From (vi), z = 1 implies y = 1. Thus if any of x , z, u, u is trivial, then y is trivial, so we need only show that y # 1. Suppose y = 1. Equation (vii) then gives X - ~ Z X ' = z. But x commutes with z by (iv), and so x = 1 . But we have already shown that x = 1 j u = 1 j u = 1, and so ( x , y, z , u, u ) = (2). But S is not cyclic, and so we have a contradiction. 0 Lemma 6.8. Not all of y , u, u lie in H . Proof. Suppose y , u, u all lie in H. Modulo an inner automorphism by an element of H, we may assume that y is c-cyclically reduced, and since y - H ~ n it follows that y E R. Now u - H ~ z , and so u is conjugate in H to an element of R. Suppose u is not c-free. T h e n u may be written in reduced form as g - l w g where w E R, and g begins in a power of c. Now w - ' ( g y g - ' ) w = ( g y g - I ) " , and an application of Britton's Lemma shows that gyg-' E R. Thus, modulo an inner
O n isomorphisms of direct powers
235
automorphism, we may assume that both y and u lie in R. Now y - R y n and so a length argument in R shows that y lies in a conjugate of a factor (A or B ) of R. But A and B are free, and n o non-trivial element of a free group is conjugate in that free goup to a proper power of itself. Thus, taking, if necessary, an inner automorphism by an element of R, we may assume that y lies in the amalgamated subgroup U of R. Now u cannot lie in a factor, but we have u-'uu = u 2 , and if u is not c-free, this forces u to lie in a conjugate of A = ( a , p ) . Thus if u is not c-free, we may write u in reduced form (with respect to the structure of R ) as g-'tg where t lies in a factor but not in U , and 1 # g = glg2---g, where g , lies in a different factor from t. Putting this into the equation u-'yu = y", we obtain t-'(gyg-')t = (gyg-I)". This is possible only if gyg-' lies in U. But then gyg-' and t lie in the same factor, contradicting the fact that the factors are free. Thus u is c-free. But then u - R ~ 2 , so u lies in a conjugate of a factor of R, and t h e same contradiction goes through. 0 Lemma 6.9. Modulo an inner automorphism of S, we have y E H \ K , x,zEH, uEKand VET.
Proof. By relation (i) y -sy". Thus, by a length argument, y lies in a conjugate of a factor ( H or T) of S, so, modulo an inner automorphism of S, we may assume that y lies in a factor. By relation (ii), u - s u 2 , so u also lies in a conjugate of a factor. Suppose u does not lie in a factor. Then we may write u in the form g-'tg where t lies in a factor, but not in K, and g = glg2---g, in normal form, with g l in a different factor from t. By relation (i), t(gyg-')t-' = (gyg-I)", and so a length argument shows that gyg-' lies in K . Lemmas 6.1 and 6.2 show that modulo an inner automorphism we may take y = c o for some a, and u = t = dPc'd6 for some p, y, 6 with p + S # O . By Lemma 6.4, u does not lie in a conjugate of K, but relation (ii) says u - ' u u = u 2 , and so a length argument shows that u E T. Lemma 6.3 may then be applied to deduce that u - T (b-'a)^d''(b-'a)"with A + j~ # 0. But relation (iii) gives u - s u z , and this contradicts Corollary 6.6. Thus u lies in a factor. If y lies in a conjugate of K , t h e same contradiction results, so y does not lie in a conjugate of K , and so u must lie in the same factor as y. Suppose that factor is T. Without loss of generality, y may be assumed cyclically reduced, and so the equation y - T y " implies by a length argument in T, that y E To. NOW y does not lie in a conjugate of K , and so Lemma 6.3 shows that without loss of generality we may take y = d" for some a, and u =
236
J. M. T. Jones
( b - ' a ) @ d ' ( b - ' a ) for * some p, y, 6 with p + 6 # 0 . But u-luu = u 2 , and so we have again contradicted Corollary 6.6. .Thus we may assume without loss of generality that y and u lie in H, and y does not lie in a conjugate of K. Suppose that z is not cyclically reduced. Then, modulo an inner automorphism by an element of H, we may assume that z begins and ends in an element of T Now relation (vi) shows that z - s z y , and so a length argument yields an immediate contradiction. Exactly the same argument shows that z Sr T \ K , and so either z E H or z may, without loss of generality, be written in reduced form as z = sIs2---sn with s, E H and s l E T. Suppose that z fiZ H . Now z - s z y , and so a length argument shows that s,y fiZ K . Since y Sr K , it follows that y - l z y is reduced but not cyclically reduced. By relations (iv) and (v), x commutes with both a cyclically reduced and a non-cyclically reduced element of S. Hence, by a length argument x E K. Now relation (iii) gives u - s u z , so u lies in a conjugate of a factor. Suppose u does not lie in a factor. T h e n we may write u in the form u = g-'tg where t lies in a factor, but not in'K, and g = g,g,---g, in normal form, where g l lies in a different factor from 1. We then have t-'(gug-l)t = (gug-I)', with u E H. As in previous cases it follows that gug-' lies in K , and we may use Lemmas 6.1, 6.2 and 6.3 to deduce that u - - S ~ ' , u - s d s and x - s ( b - ' a ) " d a ( b - ' u ) ' with a + y Z 0 . But x E K , and so, as in t h e proof of Corollary 6.6, we have x - K (b-'u)*'+' with a + y Z 0 . In particular u b - l , ( x ) # 0 . But z - l x z = x , and so s ; ' x s I E K. But sl E T, so by Lemma 6.5, sIE K , a contradiction. Thus if z Sr H, then g = 1. Moreover, if u lies in a conjugate of K , the same contradiction results. It follows that u lies in the same factor as u, that is u E H. Hence y , u, u, x all lie in H, contradicting Lemma 6.8. Thus z E H. We next consider x. If x is not cyclically reduced we may suppose, without loss of generality, that x begins and ends in an element of T. Since z commutes with x, a length argument forces z E K . By relation (vii) z -szxy and, as y G K, a length argument gives a contradiction. The same contradiction results if x E T \ K . Suppose x 6Z H. We may then write x in normal form as x = sIs2---s, where, without loss of generality, s. E H and sIE T. Now relation (iii) shows that u lies in a conjugate of a factor. Suppose as before that u is written in reduced form as u = g - l t g . Assume first that g # 1. As in previous cases we deduce that u c', u d " and x (b-'a)"d@(b-'a)'with a + y # O . Thus x is conjugate to an element of T contradicting the fact that x is cyclically reduced of length greater than one. Thus u lies in a factor. If u lies in a conjugate of K, the same contradiction results. Thus u does not lie in a conjugate of K, and so u must lie in H. We now have all
-
-
-
On isomorphisms of direct powers
237
of y, u, u lying in H, and this contradicts Lemma 6.8. Hence x must lie in H. We now have all of x , y, z, u lying in H. If u lies in a factor, but u does not lie in a conjugate of K, we are forced to conclude that u lies in H, contradicting t h e fact that x, y, z, u, u generate S. Thus if u lies in a factor, then that factor is T, so u must lie in K, and we are finished. Suppose therefore that u does not lie in a factor. As usual, u may be written in the form u = g-'tg, where t lies in a factor, but not in K , and g = gl---g, in normal form, with g l in a different factor from t. We may further assume, by taking an inner automorphism by an element of H if necessary, that g, E T. Repeating t h e argument we have used before, we now obtain gug-' - K c ' , gug-' - T d s , gxg-' - T ( b - ' a ) " d P ( b - ' a ) 'where a + y # 0. We have, in particular, x = g - ' l g where l E T, and f l b - l u ( l ) # O . Since d " is not conjugate in T to an element of K (Lemma 6.4), we also have t E T, and g l E H. The equation x = g - l l g when written out in full yields
g;'g;!'---g;'lg'gZ---g,
= x.
Since g, E T, and x E H, we conclude that x E K. Also gl E H, and l E T, so 1 E K, and, as in the proof of Corollary 6.6, we deduce that 1 = (b-'a)p+r with Q + y # O . Now g i ' l g , lies in K , and a repetition of the argument used in t h e proof of Corollary 6.6 shows that vb-b ( g i ' l g ' ) # 0. But then Lemma 6.5 shows that g2E K, contradicting the fact that g is in normal form. This completes the proof of Lemma 6.9. 0
Lemma 6.10. Modulo an inner automorphism of S, x = b-la, u = c, u d, y = a"psa-" for some a, p and z E U.
=
Proof. By Lemma 6.9 we may take y E H \ K , x, z E H, u E K and u E T. An application of Lemma 6.2 shows that, without loss of generality, u = c ' and u = d"ced' for integers r, a, p, y with a + y = 1. Lemmas 6.3 and 6.4 then show that u - T d and x - T (b-'a)'dd'(b-'a)"for integers A, p, u with A + u = 1. Since x E H, x must lie in K, and hence, by the argument of Corollary 6.6, x = b-'a. Now u = d"csd' and ( b - ' a ) - ' u ( b - ' a )= u2, so an application of Britton's Lemma in T shows that p = 0, and so u = d. Next consider z. We have Z-'XZ = x, and z E H. Since b-'a is not conjugate in R to an element of A, we must have z E R. The structure of R as an amalgamated free product then shows that z = g(b-'a)* for some integer k and some element g of U such that g commutes with b-'a. Now y E H and y -"y", so we may write y in
238
1.M. T. Jones
reduced form a s h - ' w h where w is an element of R, a n d either h = 1 or h begins in a power of c. Suppose h # 1. By relation (vi), we have z - H ~ y ,that is g(b-'a)' -Hg(b-la)'h-'wh -nhg(b-la)'h-'w. By a c-length argument, we deduce that g(b-'a)' E A, and so, by length in R, k = 0. But relation (vii) shows that we also have z - H ~ ~ ayn,d applying the above argument t o this equation yields a contradiction. T h u s we must have y E R. Since c-'yc' = y", we further deduce that y E A, a n d hence y can b e written in the form y = a"IpPia"2pP2---a".pP'. T h e equation c-'yc' = y " then yields affjpPi"'aa 2 p @ 2 " ' _ _ _
a"sp@s"'= (a"1pP1a~2pP2-_-a~*pPI)~
where, in t h e case r < O , each p, must be divisible by n"'. Since A is free, a n d r Z 0 , this equation is possible only if r = 1 a n d y = a--pPa" for some integers a a n d p. W e now have x = b - ' a , y = a-"p@a",z = g ( b - ' a ) k ( g E U ) , u = c and u = d. It remains to prove that k = 0. By relations (vi) a n d (vii) z - R ~ yand z - R ~ ~ which y , yield
g ( b- ' a )'
-
R
g ( b- ' a )'a -"p ' U "
-
R
g ( b- ' a )'+'a -"p ' a ".
A length argument in R proves that this is not possible unless k = 0, and the result then follows. 0
Lemma 6.11. If a = 0 , then p
=
1.
Proof. T h e relation z - R ~ gives ~ y g -Rgb-lapB. Since g E U, a length argument in R shows that upP E U, that is p = 1. 0
Lemma 6.12. Suppose that the elements x = 6-la, y = p, u = c, u = d and z = pn~aP~pL.2a@z--pa-aPmp"-+lE U generate S. Suppose also that for each i = 1,2,. . . m , a,# 0 , p, # 0 and a, > max (18n, p z ) ,and that the following relations hold between x, y and z : (i) y-'zy' = (xy)-'z(xy)' for 0 s i s 9n - 1, i# 4,5,6,12,13,14, 9n - 11, 9n - 10, 9n - 9. (ii) y 4 z y 4= ( ~ y ) - ~ z ( x y ) ' . (iii) Y - ~ Z Y=~( ~ y ) - ~ z ( x y ) ~ . (iv) y-6zy6 = ( ~ y ) - ~ z ( x y ) ~ . (v) y - 1 2 ~ y = 1 3( X ~ ) - ~ ~ Z ( X Y ) ~ ~ . (vi) y -13zyI4 = ( x y )-14z( x y )I4. (vii) y-I4zy = (xy)-13z (xy)13. (viii) y - ( 9 n - 1 1 ) Z y 9 n - l l = (XY)-(9n-11)~ (XY )-Iz (XY )9n- In. (jx) , , - ( 9 n - l 0 ) ~ ~ 9 n - 1 0 = (xy)-(9n-10)z )9n-in. -(9n-9)Zy9n-9 = (XY ) - ( 9 n - 1 n ) ~ (XY)-'z (XY )9n-9. (x)
On isomorphisms of direct powers
239
Proof. We first note that for each pair of integers ( i , j ) there exists a unique integer k = k ( i , j ) such that p ' a J p klies in U. Moreover, pia~p
k
= (ap-'(-i)p-i)-l(aj-p-l(-i) = (br(p-l(-i)) m r ( p - I ( - i ) )
4
P )-I@
p(j-p-I(-i))
TG-p-I(-i))
1 4
-Wp-l(-i))
)
= qi'bJ'qk'
for some integers i ' , j ' , k ' . We set B1= pnia8ipk(ni.8i), B 2= P2p k ( - 2 - k ( - , . P , ) . P 2 ) , etc., so that z = BIB2---Bm, and we call each Bi a block of z. Now for each integer r, there exists a unique integer s, and blocks BI, B;,. . .BA such that y-'BlB2---Bm = BIB;---BAy'. (We.have BI = p ~ l - ~ a 8 1 p k ( ~ , - ~ . ,8 1etc.). ) Similarly, using t h e relations a ipp(i)= b T ( i ) q m ( iz) , may be written as a product of m blocks B l , f i 2 , ..Bm, . say, each of type q i ' b " q k ' , and for each integer r there exists a unique integer s and blocks BI, 8;,. . . BA such that ( X ~ > - ~ B , ~ , - -=- ~ , , , BIB;---BA(xy)'. It follows that an equation of the form y - ' z y " = ( x y ) - ' z ( x y ) " ' can only hold if y-"Bl = ( x y ) - ' B I ( x y ) ' y " for some integers t and u. Let a I = 9nt - k , where 0 s k < 9n. The restrictions o n ai and pi ensure that, in calculating y-'BI for any 0 S r < 9n, the only values of p , u and T used are those on the negative integers, that is, those for which definitions were made in 01. An inspection of the relations (i) to (x) made in the light of the above comments, shows that k = O is the only possibility. Now relations (viii) and (ix) yield z y 9n-1 - 1 = ( x y ) (y -(9n-l0) zy 9n -lo) ( x y )-I
Pa2-k(-l.81)a
X(Y
-(9n-11)
= (x y ) - ( 9 n -
I
-I
( x y )9n-
- (y-(9n-ll)zy9n-ll =
-(9n - lo)z - 1
y z-I
.
I1
-(9n-10)zy 9 n - 11)
and, similarly, relations (i), (ix) and (x) yield (
-(9n - 9 ) z y
9n-9)x
-I
) (y -(9n-8) zy 9n-8) ( x y ) - ' - ( x y )-(9"-9)* ( x y )9n -9
= (x,,
Yn-lO)-l
J.M.T. Jones
240
The left-hand side of each of these equations is a product of m blocks, whereas the right-hand side of the second equation is P -(yn - lo) BIB2---B, p-1BIB2---B,py"-Y.Thus some cancellation between blocks must take place, and we obtain a series of equations of the form B,,,p-'BI= p'i for some integer r l , Brn-,p'1B2= p'z for some integer r2, etc., the cancellation continuing until we have obtained a product of m blocks. Suppose m > 1. After cancellation, we obtain an expression beginning with a block of the form p - ~ n + ~ o + o , a p l k(-yn+in+ol.pl) , and so it follows that X ~ - ( ~ " - 'must ) B ~ take P the form p - Y n + l O + o , a P l p k ( - Y n + l O + o ~ ' P l ) p ' qfor s some integers r and s. Now a 1= 9nf, and the expression X ~ - ( ~ " can - ~ )be B evaluated ~ for each possible congruence class of P I mod 9n. The only congruence classes for P I which give an answer in the correct form are P I = 0 or - 1 mod 9n. Suppose P I = - 1 mod 9n, and consider the equation
(y -(Yn-
I I)zy
Yn-I
-1
=
-(Yn-Ifl)
z
-I
YZ-lYYn-l1.
We have already noted that Brnp-'BI= p'i for some integer T I , so B i ' = p - l B l p - ' l , and the left-hand side of our equation must begin in a block - 9 n + 9 + a l a P I P k ( - Y n + Y + a , . P , ) and it is easy to check that for PI = - 1 mod 9n, this is not the case. Thus there are integers t l and k,, t l > max (2, k l ) such that B,= Ynl, a 9nk,p -9nrl+Ynk , . We next consider B 2 . Arguments similar to those used above show that, if m > 3, there are integers f 2 > 2 + t l - k, and k 2 < f 2 - t l + k, such that B,= pynr2aynk2p-ynr2+ynk2 . We continue in this way for as many blocks as possible, and note that we also have equations of the form B,p-'BI = p'l, B,-lp'~B2= p'2, etc. The form of B l , B2, etc. shows that r, = r2 = * * = - 1. Thus we have proved that if m > 1, then either (i) m is even and z = BIBZ---B:,~-'(BIB,---B~,)-'~, where for each i = 1,2,. . . ,4m B,= 9maa 9nk,p -9n18 +9nk for some integers I, > 2 and k,, or (ii) m is odd and z = B I B 2 - - - B ~ ~ m - l , B ~ ( , + l ~ p ~ ' ( B l B z - - - B ~where ( m - I )€or ) ~ 1each p i= 1,2, . . .i(m - I), B,= p9nraa9nk,p-9*n+9nk for some integers 1, > 2 and k,, and B;(,+,) = p9'"a'ps for some integers r, s, r. In case (i) an easy calculation shows that y-4zy4 = ( ~ y ) - ~ z ( x y and )~, since x y = q # 1, relation (ii) fails to hold. Thus m is odd, and we have case (ii). Again we consider the equation ~ y - ' ~ " - ~ ' z y ~ " -=~ x - ' -(Yn-IO)zy -Izy 9"-9 . The right-hand side of this equation is 1
1
On isomorphisms of direct powers
p -(vn-v)B i B z - - - B t(,,, - I )B,(:
+I
) p-'B!(,
+
24 1
-I - I))-'pYn-* l)p (BIBZ---Bj(,,,
and this must be a product of m blocks. It follows that k . B;(,+l,p-lB+(,,,+I,p IS a block for some integer k , that is pVnra rp s- I + Y w a r p + k is a block. This is possible only if s = - 9nt + 1, and this can occur only if r = 1. Thus Bi(,,,+l) = p9"'ap-'n''', as required. The case m = 1 is easily dealt with, by similar methods, and results in = p'"' ap-'"'+I for some t > 2 . I
97. The possible isomorphisms between G and Gp = @Vnap-Vn+I,plXnap-~~n+I , . . . ,p '"lap-yn'+ I , . . . ). L~~ = u - l z y - I u y Then x, y , 2, u, u generate G, and, by projecting onto each S, (i E N), it is easy to check that the following relations hold in G : (i) u - l y u = y " (ii) u-luu = u 2 (iii) x - I u x = u 2 (iv) x - l i x = i (v) x ( y - ' i y ) x - l = y - ' i y (vi) x(y-"iy")X-' = y - ' 3 i y l 4 (vii) y-'iy'= ( ~ y ) - ~ i ( x y ) ' . Suppose P N is such that G = Gpx GN,P,and cp : G --* Gp is an isomorphism. cp(x), q ( y ) , cp(i),cp(u), cp(u) generate Gp and satisfy the , ~(i),, above equations. It follows that for each i E P, ~ ( x ) , v(J),, cp(u), and cp(u), generate S, and satisfy t h e above equations. Their values are, therefore established, within an inner automorphism of S,, by Lemma 6.10. Let CI,Cz,. . . C, be the congruence classes of n k found in Proposition 5.1, and suppose t > O . Let C, be the congruence class of I, mod n k . For each of the listed generators g of G, q ( g ) is equal modulo D to a product of commuting words cpl,(g) on the elements x(I,, k ) , y(I,, k ) , z(I,,k , k ) , u(I,,k ) and u(L, k ) . We restrict our attention to one of these words, with 1, = 1, say. Now t h e results of $5 show that, provided k is chosen sufficiently large, a necessary and sufficient condition for cpI(g), to lie in T, for all but finitely many i = I mod n k is that there is an element 5 E D, and a word w on the generators x(I, k ) , u(I, k ) and u(I, k ) such that q c r ( g= ) (w. The results of 86 then show that there is a word gr on x(I, k ) , y(I, k ) , r(l,k , k ) , u(I, k ) and u(I, k ) such that VI(X)=
5ig;'x(I, k)gr
C P , ( U )= S z g ; ' U ( l
k)gr
1.M. T. Jones
242 (pi ( u ) =
t3g ; ' u ( I k )gr
where t,,t2,t 3 ED. We n e x t consider the word g l ( p l ( y ) g ; ' . By the results of $6, t h e projection of this element o n S, is equal to a-"pP aa (some a , P ) for all sufficiently large i = 1 mod nk. The results of §5 force gl(pl(y)g;' = t4y(1, k ) @ for some t 4 E D. It then follows by Lemma 6.11 that /3 = 1. Now consider g l ( p l ( i ) g ; ' . Lemma 6.10 shows that t h e projection of this element on S, must lie in U, for all sufficiently large i = 1 mod n k . We conclude, from the results of $5 that g , ( p , ( f ) g ; ' = t5uIu2---ur where t5E D and each y is a word on finitely many of the generators of u l , k . . , , and where s,+' # s, for each ] = 1 , 2 , . .. r - 1. We now project 5 5 ~ 1 ~ 2 - -onto - ~ , S, where i is a very large integer congruent to 1 mod nk, to obtain a word of the form p a ~ a P ~ ~ u ~ a P ~ - - - p o ~ a P - p n ~ + ~ where t h e a, and P, satisfy the conditions of Lemma 6.12. It can easily be checked (by projection o n each S,, i E N ) that the elements x, y, z of G satisfy equations (i) to (x) of Lemma 6:12, and hence so also do (p,(x),, (p, ( y ) , , (p, (i),for each i = 1 mod n '. We may, therefore, use Lemma 6.12 to obtain information about the projection of ~ 5 u I u ~ - - - u , o n S, for all sufficiently large i = 1 mod nk. We see that g l ( p l ( i ) g ; ' can be written in t h e form g,(p1(i )g
=
= ~ ~ ( W ~ W ~ - - - W , , , )kY) '( "I ,' ~ ( lk, , s ) Y ( ~k)-'"'y(I, ,
k ) - ' ( w i ~ *. .. w , , , ) - ' Y ( ~k, )
where t h E D, and each w, is a word of the form y(1, k)9n'i(z(1,k, s,)y(I,k)-')9"kny(1,k)-y"'~+y"k~ for some integers s, t, k,, s,, t,, with, for each i = 1 , 2 , . . . rn - 1, either s,+, # s, or s,+' = s, and t,+i # t, - kt. Set glcpr(i)g-' = h. Now x(1, k ) , y ( l , k ) , u(1, k ) , u(1, k ) and h generate G,, and so there is a word w on these generators such that z(1, k , k ) = w(x(1, k ) , y ( l , k ) , u(l, k ) , u(1, k ) , h ) . For all suitable t, let h ( ' )be the element obtained from h by replacing each occurrence of z(1, k , r ) (any r ) in h by z(1, k, r + t ) . The analysis of $5 can now be applied to the group ( x ( l , k ) , y ( l , k ) , u(1, k ) , u(l, k), h ) . Analogues of Lemmas 5.3 to 5.6 with h taking the place of ~ ( 1 k, , k ) and h") taking the place of z(1, k , k + t ) can be obtained (a certain amount of care being necessary about t h e range of values that t is allowed to take). It thus becomes clear that the equation z ( l k , k ) = w(x(1, k ) , y (1, k ) , u ( l k ) , 4 1 ,
k), h )
is possible only if h = t 6 y ( I ,/ ~ ) ~ " 'k, z (sl),y ( l , k)-"" for some t 6 E D, and some integers s, t.
On isomorphisms of direct powers
243
We now put together the results we have achieved so far to obtain
Proposition 7.1. Suppose P is an infinite subset of N such that G = Gpx GN,Pand suppose cp : G + Gpis a n isomorphism. Then there exist integers k , m , and t n k and congruence classes C I ,C2,. . . Cl mod n k such t h a t P = X U C I U C 2 U . . . U C ,w h e r e X C { l , 2 , ... m } , and, if C, is the congruence class of 1, mod n', there exist elements g, of Get, integers s,, t, and elements &, &, e4, e5 of D such that
el,
Moreover, each s, is greater than or equal to two. Proof. Only the inequality for s, remains to be proved. Let s, = min s, and suppose that s, = 1. The equation 9nt, - 9n'v = 9nt, - 9nS:rlmay be rewritten as t, - t, = ns,-'r - r l , and so it has a solution for the pair (r, rl). Moreover, if (r, r l ) is one solution, then ( r + i f rrI + r'n'd-'!) is another. Now G , C G, for each i = 1,2,. . . t, and consequently (ignoring a few terms of the direct product) we have Gc, Gp.There is therefore a word w on cp(x), cp(y), c p ( f ) , cp(u) and cp(u) whose projection on S, is trivial for all sufficiently large i fiZ C,, but non-trivial for infinitely many i E C,. But a consideration of the equation 9nt, - 9 n ' ~= 9nt, - 9nSlrlshows that the projection of w on the r t h term of Cl is a conjugate of the projection of w on the r,th term of C,, and this pattern is repeated for all sufficiently large r. Thus the projection of w on S, is trivial for all but finitely many i E C,, and we have our contradiction. 0 This establishes the form of all possible isomorphisms between G and Gp.We will use this information in the next section to show that G Z G' for any 1 < r < n.
J.M.T. Jones
244
88. The non-isomorphisms We are now ready to prove the final theorem. Theorem 8.1. G $ G' for any r such that 1 < r < n. Proof. Suppose that G = G px GNiP,and that G = Gp. We first show that P is infinite. Suppose not. An easy adaptation of Proposition 4.4 shows that given any finite integer k , the only possible non-trivial decomposition of S' as a direct product is of the form S' = S' x S'-' for some r, 1 zs r < k. It follows that S' can be decomposed into a direct product in only finitely many different ways. But Lemma 4.1 shows that G can be factored as a direct product with an arbitrarily large number of factors. Thus G $ S' for any k. Hence P is infinite, and so the result of Proposition 7.1 holds. We will prove that the number t of congruence classes mod n k given in Proposition 7.1 is congruent to 1 mod n - 1. We proceed by induction on t. If t = 1 the result is proved, so assume t > 1. Now the proof of Theorem 3.1 shows that the normal closure in G of the commutator [ x - ' , z - ' ] is a subgroup of G isomorphic to G. q induces an isomorphism between this subgroup and some subgroup of G, We will consider the form of this induced isomorphism, and hence establish the basis of an inductive proof. Now z = u i y - ' u - ' y , and so we have
q ( r )= 5 6
fl gll(u(l,,k ) ~ ( 1 , , k ) ~ " ' ~k,s,)y(I,, z ( 1 , , k)-9"7(I,,k)-'u(1,k)-'~(l,k))g,' ,=I
Since s, 3 2 for each i, this is 56n:=,gI,(y(1,,k)"iz(I,, k , s , - J y ( l , , k)"i)g,'. We now consider the projection of cp([x-', z - ' ] ) on S, for large i = 1, mod nk. If t, is not congruent to one modulo n, cp([x-', z-']), = 1. Consequently, if each of the t, satisfies t,f 1 mod n, then ~ ( [ x - ' 5, ' 1 ) lies in D, and so cp induces an isomorphism between G and S k for some k . We have already seen that no such isomorphism exists, and so there are some integers i, 1 c i I, such that t, = 1 mod n. Set R = uI,-,(,,)C,. Then, if we ignore a few terms of the direct product, cp induces an isomorphism between G and GR. We investigate the form of this isomorphism. The calculations made in the proof of Theorem 3.1 show that if 4 is the induced isomorphism, then there exist elements vl, qt, v3, v4, q 5 fo D such that
On isomorphisms of direct powers
245
Similarly, (as in Theorem 3.1) t h e normal closure of (y9n-9Zy-(9n-9) ) -I ] in G is another subgroup isomorphic to G. If we apply t h e above arguments to this subgroup, we obtain similar results for those congruence classes C,for which t, = 2 mod n, and, continuing in this way, we derive similar results for those congruence classes C, for which t, = r mod n, for each r, 1 < r < n. Our inductive hypothesis allows us to conclude that the classes C, can be divided into n sets, each containing a number of classes congruent to 1 mod n - 1. Thus t is congruent to ( n x 1) mod n - 1, that is, t = 1 mod n - 1. Now suppose that G = G ' with 1 < r < n. Then there is a partition {PI,P2,.. . P,} of N such that G = GP,x GP2 x . . . x Gp,,and each Gp,= G. By the above reasoning, each set P, consists of a subset of D, and a number t, of congruence classes modulo n k (where k is taken sufficiently large to be the same for each GP,),such that t, = 1 mod n - 1. Thus the total number of congruence classes i n N = PI U P2U... UP, is congruent to r x 1 = r mod n - 1. But the total number is n k ,which is congruent to 1 mod n - 1. Thus r = 1 mod n - 1, contradicting 1 < r < n. Thus, G f G' for any r such that 1 < r < n, and t h e result is established. [x-l,
References [l] R. Camm, Simple free products, J. London Math. SOC.28 (1953) 66-76. [2] A.L.S. Corner, On a conjecture of Pierce concerning direct decompositions of Abelian groups, Proc. Colloq. Abelian Groups (Tihany, 1963) (Akademiai Kiado, Budapest, 1964) pp. 43-48. [3] J.M. Tyrer Jones, Direct products and the Hopf property, J. Australian Math. SOC. Vol. 17, Part 2 (1974) 174-196. [4] W. Magnus, A. Karrass and D. Solitar, Combinatorial group theory, Pure and Applied Mathematics, Vol. 13 (Interscience Publishers, 1966). [5] C.F. Miller, 111, On group-theoretic decision problems and their classification, Annals of Maths. Studies, No. 68 (Princeton University Press, 1971).
S.I. Adian, W.W. Boone, G . Higman, Word Problems I1 @ North-Holland Publishing Company (1980) 247-254
CANCELLATION THEORY IN FREE PRODUCTS WITH AMALGAMATION Roger C. LYNDON* University of Michigan, Ann Arbor, Michigan
The technical details of the original talk appear elsewhere [27], and need not be repeated here. Instead, we first discuss the general context of the problem under consideration, and then give a brief summary of the specific results obtained. 01. Cancellation theory in group theory
There are two principal methods, admittedly related, in the study of free groups, free products, free products with amalgamation, HNN extensions, and similar group constructions. One is the method of cancellation arguments, already used extensively by Nielsen [31, 32, 331. The second, equally old but less systematically used until recently, derives from topology, and is often described as using covering space arguments. One could say that cancellation arguments are based on nothing more than the sheer group axioms, while covering space arguments employ elaborations of Cayley’s observation that every abstract group can be realized as a group of transformations of a suitable object. For general subgroup theorems, covering space arguments give by far the shortest and most transparent proofs. The most elegant theory at present seems to be that of graphs of groups, or groups acting on trees, introduced by Serre and Bass [43]. The fundamental theorem of this theory can be developed without topology, as is done by Serre and Bass, although a very simple proof can be given, using only a minimum of the most elementary topology, following the ideas of Tretkoff [MI. Once the fundamental theorem is known, no further topology is needed to obtain immediately a subgroup theorem for graph products of groups, which contains all the main classical subgroup theorems. * The author gratefully acknowledges support of the National Science Foundation 247
248
R.C. Lyndon
Chiswell [4] has used this theory to obtain a fairly direct and transparent proof of the Grushko-Neumann Theorem, which is comparable in its simplicity (and presumably related) to the topological proof by Stallings [45]. A number of the more technically specialized theorems concerning subgroups of groups of the sorts under consideration have been given improved statements and proofs by Cohen [7, 81, exploiting t h e implicit interpretation of the Serre-Bass theory in terms of graphs of cosets. The theory has other applications, as noted by Serre, notably to the structure of linear groups, especially PSL2’s, where it is closely related to a fruitful method of Behr [ l , 21. There remains, nonetheless, an area where cancellation arguments appear indispensable. Cancellation arguments themselves can be divided into two kinds. There are those, exemplified historically by the work of Dehn [9], that are concerned typically with the question of membership in the normal subgroup defined by a given set of elements; of this theory, which contains most of small cancellation theory, we shall say no more here. The remaining area is exemplified by the problem of characterizing the elements of a subgroup generated by given elements. A first example is Nielsen’s proof of his subgroup theorem. For this limited purpose his method is hardly preferable to that of Schreier [a] (which can be interpreted topologically), and is certainly inferior to the standard simple covering space argument; but it is very fruitful for other purposes. First, it gives immediately a set of generators for the automorphism group of a finitely generated free group (Nielsen [31, 32, 33]), and so also for G L ( n , Z ) . A most important direct outgrowth of this method is the theorem of Whitehead ([46, 471; see also Rapaport [36], Higgins and Lyndon [17]), giving an effective criterion for two finite sequences of elements from a free group to be equivalent under automorphism. From this has grown in turn McCool’s [28, 291 recovery, in simple form, of Nielsen’s finite presentation of the automorphism group of a finitely generated free group (and with it of GL(n, Z)), as well as McCool’s [30] finite presentations for stabilizers in this group, and, consequently, for the classical mapping-class groups. In another direction, Nielsen transformations and their generalizations play an important role in the study of low-dimensional topology. In the original setting of free groups, the main role of cancellation theory lies in transforming a given finite sequence of elements into one that is Nielsen reduced. A Nielsen reduced sequence is then a basis for the subgroup it generates. (An elegant refinement of t h e Nielsen reduction process was presented at this conference by Hoare [18].) This cancellation argument was formulated axiomatically by Lyndon [22],
Cancellation theory in free products
249
using length functions, to obtain a uniform and sharper form of the Nielsen-Schreier and Kurosh subgroup theorems, as well as a very down-to-earth proof of the Grushko-Neumann Theorem ([23]; see also Cohen [6]). Chiswell [5] has recently shown a close connection between the theory of length functions and the Serre-Bass theory. Cancellation theory has long been applied to free products, free products with amalgamation, and HNN extensions, to the extent of providing normal form theorems for such groups (Britton’s Lemma, in the case of HNN groups), and relative solutions of the word and conjugacy problems. But the role of Nielsen transformations and the status of Nielsen reduced sets had received only small, though important, consideration. In 1970 Zieschang [54] initiated an extension of the full theory, with the necessary modifications, to free products with amalgamation. The essential difficulty in such an extension is that a Nielsen reduced set, even with any reasonable modification of the definition, cannot be expected to be a basis for a free group..However, Zieschang managed to pinpoint the special configurations that must occur in the exceptional cases. By these means he, later together with Peczynski and Rosenberger [35], managed to determine the ranks of all Fuchsian groups. This method was used also by Rosenberger [37] to analyze the situation in which an analog of the Grushko-Neumann Theorem fails for free products with amalgamation. The work to be described here uses Zieschang’s formalism to obtain a proof of a version of the theorem of Karrass and Solitar [19] on subgroups of a free product with amalgamation. Although the proof is admittedly lengthy, we believe that the detailed information obtained is instructive and should be of use in other applications. Indeed, the result as obtained contains the main content of Zieschang’s theorem. We mention only one application that we have immediately in mind. Recent work of Edmunds and others [3, 10, 11, 12, 13, 14, 151 refines earlier work on cancellation theory in free products to extend significantly work of Lyndon, McDonough, Newman, Rosenberger, and Zieschang [21, 24, 25, 26, 38, 39, 51, 521 on solutions of quadratic equations in free products, as well as work of Schupp [41, 421 and Wicks [48, 49, 501 on this and on the closely related endomorphism problem: given two elements w and w ’ in a free group F, when is there a map from F into itself carrying w into w’? A suitable analogous result for free products with amalgamation should put in a more general setting the result of Nielsen ([34]; see also Zieschang [53]) on lifting automorphisms of surface groups. The theory under discussion already gives some promise of revealing to what extent Edmunds’ work carries over to free products with amalgamation.
R.C. Lyndon
250
42. Cancellation arguments
We sketch below, omitting details, the derivation of a form of the Karrass-Solitar Subgroup Theorem for free products with amalgamation, using the method of Zieschang. Let G = * A {G, : A E A}, the free product of groups G A with a common subgroup A amalgamated. Every element of G either belongs to A or can be written in the form w = u1 u,, n 3 1, where each ui is in some GA, no ui is in A, and no successive ui, ui+,are in the same GA.We define the length of an element w in A to be 1 w 1 = 0; otherwise w determines the number n uniquely, and we define I w l = n. We write u , . . u, = 11, * * u, to mean that equality holds and that the right member is reduced in the sense that 1 u1 * * u, I =
---
-
I
u11+ * * *
-
+ 1 u. I .
Following Zieschang, we make constant use of the fact that every element of G can be written in the form
U'phq-',
IpI=lql, J h l S l .
Here u uniquely determines pA, qA, and A h A . If 1 h 1 = 0 we may choose h = 1. If p A = q A we may refactor, writing u =ph1p-', (h,l = 1. Paralleling the usual development of the Nielsen theory for free groups, we define a well-ordering of the cosets A p such that (1) if l P l < l 4 l t h e n p A < q A , (2) ifp =plp2, q = q l q 2 , Ipll = lqll,Ipzl= /q21, and p l A < q A , then pA
U={u=php-':JhI=l}.
U = U{G:: A E A , g E G } .
We write
V
= { G : :A E A , g
E G}.
Making the obvious generalization of the concept of tree product introduced by Karrass and Solitar [19], we show that the set U is the tree union of a graph r whose vertices are the sets in the family h., n 2 1, all I hi 1 = 1, and % U { A } .In fact, if C = Gfofor g = h l h , G,, we define C o= A g and C ' = G:;"'h".If C = G A , we define C o= C' = A. Then we complete the definition of the graph by joining each pair of vertices C and C' by an edge which we take to be (a replica of) Co.It is easy to see that r is in fact a tree, and that U
- -
r
Cancellationrheory in free products
251
is essentially the disjoint union of the vertex sets C with the edge sets Co amalgamated. It is now not difficult to show that G is the quotient of the tree (now in the category of graphs of groups rather than of product of sets) modulo a set of conjugation relations u " = w, for u, u, w in U. We are now ready to relativize these ideas to a subgroup G * of G. We define V * to be the set of C* = C n G * for C E V, and we define U * =uV*and N = G p U * . The map V + V * induces a map T+T*, where r* is a tree with vertex sets C * and edge sets Co*.We have, as before, (3) U * is the tree union of T * , (4) N is the quotient of the tree product of r* modulo the conjugation relations. Further, let 9 C V * be the set of all C * = Gf-'n G * for which Ap is minimum under conjugation of C* by elements of N. Then 9 spans a subtree A of r*,and (5) N is the tree product of A . To find a complement F of the normal subgroup N of G * we follow Federer and J6nsson [16]. We define an order g l < gz on elements of G * to hold if ( g l ( < ( g 2or( if ( g l ( = ( g zwith ( g;''p,hq;', g ; ' = P*hzqi', I p i ( = I p z ( = ( q i ( = ( q z ( ,J h i J = ( h z ( cpiA l , SqiA, pzA sqzA, and either plA C pzA or else plA = pzA and qlA < q2A.Now we define
r
S = { g :g
kZ G p ( N U { h : h < g })}.
We find then that (6) S is a basis for a free group F, (7) G * = FN, (8) F n N = l . In short, G * is a split extension of N by t h e free group F. In other words, G * has a presentation with generating set U U S and with defining relations a set of relations among the elements of Lr sufficient to define N, together with a set of relations u " = w, for u, w E N and u E S, sufficient to define the action by conjugation of F on N. The details of the proof hinge on a series of unfortunately rather complicated lemmas, which can all be combined into one. In this lemma we consider a finite sequence (T = ( u l , . . .,u.) of elements of U U S U S-'. We show, roughly, that if the product is trivial, u1* u, = 1, then the sequence (T can be reduced to the empty sequence by a succession of steps of two kinds: first, elementary Nielsen transformations (associated in the usual way with a pair or triple of consecutive terms of the sequence), including deletion of a
-
252
R.C. Lyndon
term ui = 1; second, application of the permissible defining relations mentioned above to replace a pair or triple of adjacent terms by another with the same product. As noted, full details are given in [27].
References [I] H. Behr, Uber die endliche Definierbarkeit von Gruppen, J. Reine Angew. Math. 211 (1962) 116122. [2] H. Behr, Uber die endliche Definierbarkeit verallgemeinerte Einheitengruppen, J. Reine Angew. Math. 211 (1962) 123-135. [3] R.G. Burns, C.C. Edmunds and E. Formanek, The equation s-'r-'sr = u - ' u - ' u u in a free product of groups, Math. Z. 153 (1977) 83-88. [4] I.M. Chiswell, The GrushkeNeumann theorem, Proc. London Math. Soc. 33 (1976) 385-400. [ 5 ] I.M. Chiswell, Abstract length functions in groups, Math. Proc. Cambridge Philos. SOC.80 (1976) 451463. [6] D.E. Cohen, Groups of cohomological dimension one, Springer Lecture Notes 245 (1972). [7] D.E. Cohen, Subgroups of HNN groups, J. Austral. Math. Soc. 17 (1974) 394-405. [8] D.E. Cohen, Finitely generated subgroups of amalgamated free products and HNN groups, J. Austral. Math. Soc. 22 (1976) 274-281. [9] M. Dehn, Uber unendliche diskontinulerliche Gruppen, Math. Ann. 72 (1912) 116144. [lo] C.C. Edmunds, On the endomorphism problem for free groups, Comm. Algebra 3 (1975) 7-20. [ l l ] C.C. Edmunds, Some properties of quadratic words in free groups, Proc. Amer. Math. SOC.50 (1975) 20-22. [I21 C.C. Edmunds, Products of commutators as products of squares, Canad. J. Math. 27 (1975) 1329-1338. [13] C.C. Edmunds, A short combinatorial proof of the Vaught conjecture, Canad. Bull. Math. 18 (1975) 607-608. [14] C.C. Edmunds, On the endomorphism problem for free groups 11, Proc. London Math. Soc. 38 (1979) 153-168. [IS] C.C. Edmunds, On words of minimal length under endomorphisms of a free group, Math. Proc. Camb. Phil. Soc. 83 (1978) 191-194. [16] H. Federer and B. Jbnsson, Some properties of free groups, Trans. Amer. Math. SOC.68 (1950) 1-27. [17] P.J. Higgins and R.C. Lyndon, Equivalence of elements under automorphisms of a free group, J. London Math. SOC.8 (1974) 254-258. [18] A.H.M. Hoare, On Nielsen methods in free groups. [19) A. Karrass and D. Solitar, The subgroups of a free product of two groups with an amalgamated subgroup, Trans. Amer. Math. SOC.150 (1970) 227-255. [20] A. Karrass and D. Solitar, Subgroups of HNN groups and groups with one defining relation, Canad. J. Math. 23 (1971) 933-959. [21] R.C. Lyndon, The equation u 2 b 2= c 2 in free groups, Michigan Math. J. 6 (1959) 155-164. [22] R.C. Lyndon, Length functions in groups, Math. Scand. 12 (1963) 209-234. 1231 R.C. Lyndon. Grushko's theorem. Proc. Amer. Math. SOC.16 (1965) 822-826.
Cancellarion rheory in free producrs
253
[24] R.C. Lyndon, On products of powers in groups, Comm. Pure Appl. Math. 26 (1973) 781-784. [25] R.C. Lyndon, T. McDonough and M. Newman, On powers of products in groups, Proc. Amer. Math. Soc. 40 (1973) 419420. [26) R.C. Lyndon and M. Newman, Commutators as products of squares, Proc. Arner. Math. Soc. 39 (1973) 267-272. [27] R.C. Lyndon and P.E. Schupp, Combinatorial Group Theory, Ergebnisse der Mathernatik, Bd. 89 (Springer, Berlin-Heidelberg-New York, 1977). [28] J. McCool, A presentation for the automorphism group of a free group of finite rank, J. London Math. Soc. 8 (1974) 259-266. [29) J. McCool, On Nielsen’s presentation of the automorphism group of a free group, J. London Math. Soc. 10 (1975) 265-270. [30]J. McCool, Some finitely presented subgroups of the automorphism group of a free group, J. Algebra 35 (1975) 205-213. [31] J. Nielsen, Die Isomorphismen der allgemeinen unendlichen Gruppe mit zwei Erzeugenden, Math. Ann. 78 (1918) 385-397. [32] J. Nielsen, Uber die Isomorphismen unendlicher Gruppen ohne Relation, Math. Ann. 79 (1919) 269-272. [33] J. Nielsen, Die Isomorphismengruppe der freien Gruppen, Math. Ann. 91 (1924) 169-209. [34]J. Nielsen, Untersuchungen zur Topologie der geschlossen zweiseitigen Flachen, Acta Math. 50 (1927) 189-358. [35] N. Peczynski, G. Rosenberger and H. Zieschang, Uber Erzeugende ebener diskontinuierlicher Gruppen, Invent. Math. 29 (1975) 161-180. (361 E.S. Rapaport, On free groups and their automorphism, Acta Math. 99 (1958) 139-163. [37] G. Rosenberger, Zum Rang- und Isomorphieproblem fur freie Produkte mit Amalgam (Habilitationschrift, Hamburg, 1974). [38] G. Rosenberger, Anwendung der Nielsenxhen Kurzungsmethode in Gruppen mit einer definierenden Relation, Mh.Math. 84 (1977) 55-68. [39] G.Rosenberger, Produkte von Potenzen und Kommutatoren in freien Gmppen, J. Algebra 53 (1978) 416422. [40]0. Schreier, Die Untergruppen der freien Gruppen, Abh. Math. Sem. Univ. Hamburg 5 (1927) 161-183. [41] P.E. Schupp, On the substitution problem for free groups, Proc. Amer. Math. Soc. 23 (1969) 421-423. [42] P.E. Schupp, Quadratic equations in groups and cancellation diagrams on compact surfaces, in this Volume. [43] J.-P. Serre, Arbres, amalgames, SL, AstCrisque 46 (Soc. Math. de France, 1977). [44] M. Tretkoff, A topological approach to HNN extensions and the theory of groups acting Qn trees, preprint. [46] J.R. Stallings, A topological proof of Grushko’s theorem on free products, Math. Z. 90 (1965) 1-8. [46]J.H.C. Whitehead, On certain sets of elements in a free group, Proc. London Math. SOC. 41 (1936) 48-56. [47] J.H.C. Whitehead, On equivalent sets of elements in a free group, Ann. of Math. 37 (1936) 782-800. [48] M.J. Wicks, Commutators in free products, J. London Math. SOC. 37 (1962) 4 3 3 4 . [49] M.J. Wicks, A general solution of binary homogeneous equations over free groups, Pacific J. Math. 41 (1972) 543-561.
254
R.C. Lyndon
[SO] M.J. Wicks, A relation in free products, Proc. Conf. Canberra 1973. Springer Lecture Notes 372 (1974) 709-716. [51] H. Zieschang, Uber Worte SflS;z.. . S p in einer freien Gruppen mit p freien Erzeugenden, Math. Ann. 147 (1962) 143-153. [52] H. Zieschang, Alternierende Produkte in freien Gruppen I, 11, Abh. Math. Sem. Univ. Hamburg 27 (1W) 13-31; 28 (1%S) 219-233. [53] H. Zieschang, Uber Automorphismen ebener diskontinuierlicher Gruppen, Math. Ann. 166 (1966) 148-167. (54) H. Zieschang, Uber die Nielsensche Kiirzungsmethode in freien Produkten mit Amalgam, Invent. Math. 10 (1970) 4-37.
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 255-259
REPRESENTATIONS OF AUTOMORPHISM GROUPS OF FREE GROUPS W. MAGNUS and C. TRETKOFF Polytechnic Institute of New York, Couvan Institute of Mathematics, New york
Let F,, denote t h e free group of (finite) rank n, let A, denote its automorphism group and A z its group of automorphism classes, i.e., , I . is the group of inner the quotient group A n / I nwhere automorphisms. A finite-dimensional faithful linear representation of A f over the complex numbers C is well known. In fact, A = GL(2, Z), t h e general linear group of degree 2 over the integers. N o finite-dimensional faithful linear representations over any field are known for A *, if n > 2 or for A,. We conjecture that they d o not exist, but we have not been able to prove this. However, we shall contribute to the problem and we shall try to explain why it may deserve some interest. We shall show first that it would suffice to prove our conjecture for Az. We have Theorem 1. A 2 is isomorphic with a subgroup of A *, for all n > 2.
Since A :+, 3 A z , it suffices to prove this for n = 3. Let a, 6, c be free generators of F3. Let a be an automorphism of F2 with free generators a, 6. Then a also acts o n A , as an automorphism by defining a ( c )= c. But unless a is the identity automorphism, its action on A, is never that of an inner automorphism. Next, we observe that t h e obvious methods for proving our conjecture d o not apply. The groups A. are residually finite according to a theorem of Baumslag. See [l]. The groups Al: are residually finite according to E. Grossman [2]. Therefore, Malcev’s Theorem (see [4]) according to which finitely generated linear groups over a field are residually finite, cannot be used to prove our conjecture. The same remark applies to Selberg’s Theorem [5] according to which a finitely generated finite-dimensional linear group over C has a torsion free subgroup of finite index. Indeed, this is most easily seen to be satisfied by Az which has a subgroup S of finite index with t h e following 255
256
W.Magnus, C.'Iketkoff
property: S is the extension of a free group Zz of rank two (the group of inner automorphisms of Fz) by another free group of rank 2, namely the group generated by the elements represented by the matrices
c 3'(; 3
of GL(2,Z). This remark also shows that Az cannot contain ''large'' abelian subgroups the existence of which might contradict the existence of a linear representation. We now prove
Theorem 2. Z f Az has a finite-dimensional faithful linear representation R over C , then at least one of the irreducible components of R represents Az faithfully. Proof. Let RA, A = 1,. . ., I , be the irreducible components of R. (Of course, R need not be fully reducible, i.e., R need not be the direct sum of the RA.) Assume first that none of the R , represents Zz faithfully, and let K , be the normal subgroup of ZZ represented by the identity in R,. Let (KA, K , ) denote the normal subgroup generated by commutators of elements of K , with elements of K,. Then (K,, K , ) C KA f l K,. Since Zz is free of rank 2, it follows that the intersection K of all of the K, is free of rank 3 2 . On the other hand, K would be represented in R by supertriangular matrices which would form a solvable group. Therefore R itself could not represent ZZ faithfully. Suppose now that, in RA,an automorphism a # 1 would be represented by the identity. Let I ( w ) denote the inner automorphism which maps any element f in Fz onto w-'fw, where w € Fz.Since a-'Z(w)a = I ( a w ) where aw is the image of w in Fz under the action of a, we would have that, in R,, the inner automorphism
I ( a w ) Z - ' ( w )= Z ( ( a w ) w - ' ) would be represented by the identity. This can happen only if ZZ is not represented faithfully in R, since there does not exist any automorphism a # 1 which maps every w E Fzonto itself, and since the center of F2 is trivial. We shall now analyze the possibilities for unitary representations over C of A2.We note that there exist such representations of finite quotient groups in which both the image of ZZ and of its quotient group are non-solvable. See Stork [6].
A presentation of A, can be given as follows: Let x, y be free generators of the free group F,. We present the group A2 of its automorphisms as follows: Generators. PI, uI,uz,U,V, U*,V*,defined as:
-
P: x + y , y + x ,
u1:
u: x +xy, u*:x + x ,
v:x + yx, y y. v*:x - x , y - x y .
y + y, y+yx,
x+x-',
y+y,
u2:x - x ,
y-y-'.
If w E F2 is any word in x, y, the inner automorphism Z ( w ) is defined by Z ( w ) : x +w-'xw, y +w-'yw.
We denote Z ( x ) , Z ( y ) respectively by X, Y and we have (1)
x = v*-1u*, y = v-'u.
For any a E Az, we have (2)
a-'Z(w)a =Z(a(w))
where a ( w ) is the image of w under the action of a . The defining relations for Az can be put into the form [3, p. 1621
(5)
P2= a:= a:= 1, UlU2 = U2U1, PU'P = uz PUP = u*,PVP = v* UIUUI = v-',U,U*U, = v*-'
(6)
u2uuz=
(7)
u,vu,= v-I,u1 V*UI =
(3) (4)
(8) (9)
U-I, UIU*UI =
u*-'
v*-' U-'u*V-'=U'P,uv*-'v = uv = vu,U * V *= V*U*.
u2P
As an easy consequence of (l), (4), (8), we find
(10)
u-lxu = x u ,
a relation which also would be derived from (2) and from the definition of u. To build up a unitary representation of A,, we start with the maximal abelain subgroup generated by U and V. We can diagonalize U and V simultaneously and we may assume that U (which, according to (lo), is conjugate with U-I) has in its main diagonal successively in certain multiplicities the distinct eigenvalues
+ 1, - 1,
Al,
A;’,
. . . , A r , A;’
We call any matrix commuting with U an w-matrix. An w-matrix consists of blocks whose sizes equal the multiplicities of the respective eigenvalues. We call blocks belonging to eigenvalues AZ and A;’ ( 3 = 1 , . . . ,r ) “conjugate blocks.” By using the proof of Clifford’s theorem on page 5 in [7], we find Lemma 1. The subgroup of AZ generated by U, V,ul,uzhas unitary representations which can be put into monomial form, with U, V, in diagonal form. The permutation associated with the monomial matrix u2 exchanges conjugate blocks in any w-matrix. Relations (lo), (6), (7) show that (11)
x-’ux= v =
u 1
U-Iul = u1u2Uu;’u;’.
Therefore xu1u2
=
R
is an w-matrix, and by using (l),(9,(6) we find
(Xu,)’= (OU2)’ = O 2= 1. (13) These remarks prove Lemma 2. If U has simple eigenvalues in a unitary representation of A2, then X’ commutes with U (since R is diagonal) and therefore (10) shows that Y X Y = X and that the representation cannot be faithful. We can sharpen Lemma 2. Let el and 5’ be the permutations associated with the monomial matrices representing u1and uz.Then we have Theorem 3. Let Il be the permutation group generated by Sl, Sz and by the permutations rr which exchange only eigenvalues of U which have the same value. If Il is imprimitive with the eigenspaces of U as a system of imprimitivity, then the unitary representation of A2 is not faithful. It should be noted that the permutations 7~ together with 6’ certainly generate an imprimitive group which satisfies the conditions of Theorem 3 and that el commutes with E2.That is, 6’is given and Sl is not entirely arbitrary.
Representations of automorphism groups
259
Proof. We show that a system of imprimitivity of II is also a system of imprimitivity for the group generated by X and Y. To see this, we observe that R is of order 2. Therefore the group ring generated by the unitary matrices v,,vz,R can also be generated by diagonal matrices and the matrices in Z7. Therefore X is imprimitive if and only if II has this property. But if a power of X commutes with U, we have, as in Lemma 2, a relation
X"
=
(XYy
which cannot hold in a faithful representation since, in Az, X and Y freely generate a free group. Acknowledgment Work on this paper was supported in part by a grant of the U.S. National Science Foundation. References [ l ] G. Baumslag, Automorphism groups of residually finite groups, Proc. London Math. Soc. 38 (1963) 117-118. [2] E. Grossman, On the residual finiteness of certain mapping class groups, J. London Math. Soc. (2) 9 (1974) 160-164. [3] W. Magnus, A. Karrass and D. Solitar, Combinatorial group theory (Dover Publications, 1976) pp. 162-165. [4] A.1. Malcev, On the faithful representation of infinite groups by matrices, Amer. Math. Soc. Translations, Ser. 2, Vol. 45 (1965) 1-18. [5] A. Selberg, O n discontinuous groups in higher dimensional symmetric spaces, Int. Colloq. Function Theory, Tata Inst. Fundamental Res., Bombay (1964) pp. 147-164. [6] D. Stork, Structure and applications of Schreier coset graphs, Comm. Pure Appl. Mat. 24 (1971) 797-805. [7] B.A.F. Wehrfritz, Infinite linear groups (Springer Verlag, New York, 1973).
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1
@ North-Holland Publishing Company (1980) 261-295
ON REDUCIBLE BRAIDS James McCOOL* a p t . of Mathematics, Uniwrsity of Toronto, Toronto, Canada
11. Introduction The Braid Group B. ( n 3 1 ) may be defined as t h e abstract group o n the generators vl, uz,, . . , C T , - ~ with defining relations cr,q = up,
(1.1)
if I i
-
j
1 3 2, I s i, j
sn
-I
and ~r,a,+~rr, = U,+~U,U,+~,1 s i
(1 - 2 )
S
n -2.
We denote by 93 the set of all ordered pairs ( p , r ) , where r is a and positive integer and p is a word on the generators ul,a 2 .... , their inverses, and we call 93 t h e set of (algebraic) braids. In [ I ] J. Birman has defined a braid (p, r ) to be reducible if p is conjugate in B, to an element of the form ya;ll, where y is a word on u *I I , . . . , a?!2,and has asked how one can recognise, given a braid (p, r ) , whether or not it is reducible. The interest of this question stems from a result of Markov, which says that in order to provide an algorithm to determine, given any two links, whether or not they are combinatorially equivalent, it is sufficient to describe an algorithm to determine, given any two algebraic braids, whether or not they are combinatorially equivalent; here braids (p, r ) and ( P ’ , r ’ ) are said to be combinatorially equiualenr if there is a sequence of braids (p, r ) = (PI,rl), (&, r2),. . . ,(&, r.) = (p’, r’), such that each braid in the sequence can be obtained from its predecessor by applying one of t h e following operations: A,
replace ( P , r ) by ( y , r ) if p and y represent conjugate elements of
B“ A,
replace (p, r ) by (pa:’, r + l), or, if p = pa:!l in B, where p is a word on a?, . . . ,a,f12,replace (p, r ) by ( p , r - 1). * Research supported by a grant from the National Research Council of Canada. 261
J. McCool
262
A full account of Markov’s result is given in [l]. The restricted question of finding an algorithm to determine when two given braids (p, r ) and (7, r ) are equivalent under A, operations, i.e. the conjugacy problem for B, has been solved by Garside [3]. The investigation of reducible braids would seem to be a natural next step in the study of the general problem of combinatorial equivalence of braids, since to say a braid ( p , r ) is reducible is to say that it is joined to a braid ( y , r - 1) by a single application of operation A2 (preceded by an application of A,). The aim of this paper is to describe an algorithm to determine, given a braid (p, r ) , whether or not (p, r ) is reducible. In addition, we shall show that if (p, r ) is reducible to (y, r - l), i.e. if p’ = yu:!, in B, for some conjugate p’ of p, then y must belong to a determinable, finite set of conjugacy classes of B,-,. 811. Statement of results
In order to state our results precisely, we begin with a brief description of some of the results obtained by Garside [3]. A word W on u?,. . . ,u:!, is said to be positive if it is a word on u,,. . . , u,-~only, i.e. no inverse of any generator occurs in W. Of particular interest are the positive words and
A,
~
A,
As-IAs-2*
1
~ + 2U, ’
* *
A,,
(where = denotes equality of words). When we are considering B,, the words An-, and A. will usually be denoted by A and A, respectively. A positive word P is said to be prime to A if there does not exist a positive word Q such that P = A Q in B,. Garside showed that each element y of B. has an expression of the form y = A ‘ P for some integer k and positive word P prime to A. Moreover, if A ‘ l P I is another such expression for y, then k = kl,and so P = PI.The integer k is called the power of y, and such an expression is called a standard form for y. For y E B. we denote by (e, ( y ) the set of conjugates of y of power i . Garside proved that %,(y) is finite for each i, that there is a maximum value of i, called the summit power of y, such that % , ( y ) is non-empty, and that elements p, y of B. are conjugate if, and only if, they have the same summit power t and (e, ( y ) = %?, ( p ) . Garside also provided an algorithm to determine, given y E B,, the summit set (e, ( y ) of y.
On reducible braids
263
We now observe that if r, s are integers such that 1 S r S s S n - 1 and B,,,is the subgroup of B, generated by the set {u,, . .,us}, then it follows easily from the usual geometric definition of B , that the map uI+ u,,uz+ u,+~, . . . ,us-,+l+ usextends to an isomorphism from B,-,+2 to B,,,(this may also be established algebraically by rioting that the case for was proved by Chow [2], and then using Garside’s automorphism 22 of B,, defined in I11 below, to derive the general case from this). This observation enables us to identify the groups B z , .. . ,B,-z with the subgroups B1.l,. . . , B l , n - of 2 B,, respectively. We say that an element y of B, is positively reducible to y ’ ~ ” if- ~ Our main result is there is a conjugate y ’ of y such that y ’ ~ , , -E~ Theorem A. Let y E B, be positively reducible to Y ’ U , , - ~ ,and let y have ) C,(y) summit power t. Then t C - 1 and there is y l E V,-z(y) U % , - ~ ( yU such that ylun-lE B.-l, and ylun-I is conjugate in to y’~,-~. Now given y E B, it is easy to see, by using Garside’s results and Theorem 2-7 of [l], that the finite set Vf-z(y) U V,-,(y) U V,(y) can be computed, and that it can be determined if any element y l of this set is such that Y ~ U , ,E - ~Bn-l.Thus, given y E B,, it can be determined if y is positively reducible; moreover, there is only a finite set of conjugacy classes of containing an element of the form Y ’ U , - ~ , where y ’ is a conjugate of y, and this set can be determined. for some conjugate y ’= P y P - ’ If y E B. is such that y’u;!,E of y, we say that y is negatively reducible to y ’ u ; ! , . Now PyP-’a;ll E if, and only if, (u,-l~)y-’(u,-l~)-’u.-lEi.e. y is negatively reducible to y ’ u ; ! , if, and only if, y-’ is positively reducible to (y’a;ll)-l. Now obviously y is reducible if, and only if, y is either positively reducible or negatively reducible. We thus have Corollary B. Let y E B,. Then it can be determined i f y is reducible. If y is reducible and V is the set of conjugacy classes in B.-I which contain elements to which y is reducible, then V is finite, and can be determined.
OIII. Notation and preliminary results Let U, V be words on u?,.. .,u:!,.We write U = V if U and V are equal as words, U = V if U and V represent the same element of B., U V if U and V represent conjugate elements of B.. If P = XIX2 X, is any word, where each Xi is a generator or the
-
264
1.Mecool
inverse of a generator, then by the reverse of P, revP, is meant the word X,Xl-l* * * XI. We have rev QP = rev P rev Q. (1 Q i d n - l), extends to an We note that the mapping u,+ automorphism 52 of B. of order two. We shall write X for 9 X when this notation is more convenient. with defining Let S . be the abstract semigroup on uI, u2,.. . , relations (1.1) and (1.2). We write X = Y to mean X and Y are .. We note that if positive words which represent the same element of S X = Y then rev X = rev Y . If X = U V we say that X begins with the positive word U,and ends with the positive word V . The diagram D ( W ) of the positive word W is defined to be the set of positive words which can be obtained from W by a succession of applications of the operations replace a subword u,u,by up,, if 1 i - j
(3.1)
1 3 2,
replace a subword u,u,u,by u,u,u,,if 1 i - j
(3.2)
I = 1.
We note that D ( W )is finite, and consists of all positive words which are equal to W in S .. We have the following results, due to Garside [3].
Theorem 1. S. is embedded in B. under the identity map on the generating set ul,. .. ,
x
Thus, for positive words X and Y, we have X = Y if, and only if, =
Y.
Lemma 2. Let X, Y be positive words in B,. If uix= U k Y , then (a) X = Y i f k = i. (b) x = u k z , Y = uiZ, for some 2,if I k - i 1 zz 2. (c) x = UkqZ, Y = uiukz, for some Z , if I k - i 1 = 1. If Xui = Y u k , then (d) X = Y if k = i. = Zo,, for some Z , i f 1 k - i 1 3 2. (e) X = zuk, (f) x = z u i v k , Y = zukui, for some Z , if I k - i I = 1. Lemma 3. In B., u,A, = Atu,-l
if 1 < s
a&, = A,U,+~
if 1 Q s < t c n - 1..
ss t d
n - 1,
Lemma 4. Let XI,Xz,. . . , X I be generators of B, which commute with ui, and such that no Xiis ui. We have
On reducible braids
265
(a) Zf uiP = X l X 2 . XfQ, then Q begins with ui. (b) Zf Pai = Q X , X , * X f , then Q ends with a.
--
Lemma 5. (a) Zf ui+lP= AiQ in B., then Q begins with U ~ + ~ U , . (b) Zf Pai+l= Q(rev Ai), then Q ends with uiui+l. Lemma 6. In B., (a) XA = Ag, AX = X A , for any word X . (b) A = = rev A . (c) A begins and ends with ui,for each i E {1,2,. . . ,n - 1). (d) Zf r s n - 1 and W is a positive word which begins (ends) with ui, for each i E {1,2,. . . ,r } , then W begins (ends) with A,+l. (e) Zf W = A V and W = PQ, then, for each i E {1,2, .-..,n - l}, either P ends with ui or Q begins with ui. (f) Zf A = IF, then for each i E {1,2, . . .,n - l}, either Z ends with ui or F begins with a,but not both.
a
The next result, due to Birman [l], is of great importance for our work.
Theorem 7. Let /3 = A “ P be in standard form in B.. Zf /3 is not in summit form (i.e. m is less than the summit power of p ) then there are positive words Z, F, R with A = ZF, such that (a) P = FRZ if m is even, (b) P = FRf if m is odd. We shall also need the following result from [l].
Lemma 8. Let A = ZF. Then A = Ff = fp = FZ. We now introduce two notational conventions which facilitate the statements of the results to follow. Let i, j be non-negative integers. We define a ( i ;j ) to be the product aiui+l - .ujif 1 d i d j d n - 1, and to be the identity element of B. otherwise; p ( j ; i ) is defined to be the product U , U ~ - ~ * al if 1d i d j d n - 1, and to be the identity element otherwise. We shall write Q = Q ( r , s) to indicate that the word Q involves (at most) the generators a, .. ,a,,where 1 d r d s d n - 1. In a context where we have established that Q = Q ( r , s), and i is a positive integer such that i + s - r d n - 1, we shall denote by Q(i, i + s - r ) the ..,a, by ui, . . ,ai+,-, word obtained from Q by replacing a, a,+l,.
-
--
266
J. McCool
respectively, and by Q(i + s - r, i ) the word obtained from 0 by replacing u,,u,+~, .. . , us by a,+,-, . . ,ui respectively. For example, if Q = Q(r, s) then 6 may be written as Q ( n - r, n - s), and if s < n - 1 then we may write AQ(r, s) = Q ( r + 1, s + l ) A . Finally, in this section, we note that, in view of the isomorphism the results listed above may be exhibited earlier from Bs-,+2to B,.,, applied to the subgroup B,.s. To give just one example, we have An-2(1, n - 3), and from (a) of Lemma 6 we have u i A , - 3 ( 2 , n - 3 ) = A , - 3 ( 2 , n - 3 ) u , - i - 1 , if 2 S i S n - 3 .
OIV. Further preliminary results We now obtain a number of results which are required for the proof of Theorem A. Lemma 9. Let X , Y be positive words in B, with X = Y. Then a generator ui occurs in X if, and only if, it occurs in Y.
Proof. This follows immediately from the definition of D ( X ) . Lemma 10. Let i, r be integers with 1 s i s r s n - 2. We have (a) If P a ( i ; r ) = Qu,,, in B,, then P = Z a ( i ; r + l), for some Z. (b) If P ( r ; i ) P = C,+~Q in B,, then P = P ( r + 1 ; i ) Z , for some Z. (c) I f P P ( r + l ; i + l ) = Q Q ai,n & , t h e n P = Z p ( r + l ; i ) , f o r s o m e Z . (d) If a ( i + 1; r + l ) P = uiQ in B., then P = a(i, r + 1)Z, for some Z.
Proof. We note that (b), (d) follows from (a), (c) respectively using the fact that X = Y implies rev X = rev Y, while (c) follows from (a) by applying the automorphism 92. Thus it is only necessary to prove (a). We note that (a) is true if r = i, by part (f) of Lemma 2. We suppose that (a) holds for i, r as stated, with r < n - 3, and that P l a ( i ;r + 1 ) = Qlu,+z. Applying (f) of Lemma 2 again, we see that P l a ( i ;r) = Qzu,+lu,+z and QI= QZu,+2u,+l, for some Q2. Hence, by (b) of Lemma 4, we have PI= P2ur+2for some Pz, and we then see that P z a ( i ;r) = Qzu,+l.It now follows, from our assumption that t h e result holds for i, r, that P2 = Z a ( i , r + 1) for some Z, so that PI= Za ( i ;r + 2), as required. Lemma 11. Let i,j,r be integers with 1 s i sj s r s n - 1. (a) If P a ( i ; r ) = Qui, and j < r, then P ends in uj+,. (b) If P p ( r ; i ) = Qui, and j > i, then P ends in uj-l.
On reducible braids
267
(c) If a ( i ;r ) P = a,Q, and j > i, then P begins with a,-1. (d) If P ( r ; i ) P = a@, and j < r, then P begins with q + , . Proof. As in Lemma 10, we need only prove part (a). By Lemma 4, we have Q = Q , a ( j + 2; r ) for some Q,, so that P a ( i ;j + 1 ) = Qlaj.Hence, by Lemma 2, P a ( i ;j ) = PIu,+,a, for some P I , and so P a ( i ;j - 1 ) = as Plaj+l. Applying Lemma 4 again, we see that P ends in required.
Corollary 12. Let W be a positive word and r a non-negative integer. (a) If W A { a A } ' ends in a, and 1 s i < n - 1 , then W ends in a,+,. (b) If W{AA}' ends in a, and 1 S i < n - 1, then W ends in mi. Proof. These are immediate consequences of Lemma 11.
We note, comparing with Lemma 10, that each part of Corollary 12 has three variants obtained using t h e operations rev and 2.We leave it to the reader to state these variants. Lemma 13. Let U, V , W be positive words with W = UV. Suppose that U ends only in an-,,V = V ( l ,n - 2) and W ends in a,, for some i E {1,2,. . . , n - 2). Then V ends in a,. Proof. We must clearly have L ( V )3 1, where L ( V ) is t h e length of the word V. Now if each generator in V is distinct from, and commutes with t h e generator a,,then by Lemma 4, U ends in a,. Since this is not the case, we must have either V ends in a,, or V = V,a,+rV2, where E = 5 1 and each generator in V 2is distinct from and commutes with a$. In the latter case, we have W = UV,a,+.V2=Ka, say, and by Lemma 4 we'see that K = K1V2for some K , , so that UVlal+r= Kla,. We then have, by Lemma 2, that UV, = K 2 a l + r a gfor , some K 2 . Now if L ( V )= 1, then V , = 1 and U must end in a,, which is a contradiction. Hence L ( V )> 1 and we have, inductively, that V , ends in a,, V ,= V3a,say. We now have UV3= K2u,+.,and i + E f n - 1 , since a,+,occurs in V. Hence, inductively, V 3must end in a,+E, V3= V4u,+,say. We then have V = V , C T # +=~V4u,+,a,u,+.V2 V~ = V4a,u,+,V2u,, as required.
We leave it to the reader to state t h e three variants of Lemma 13 obtained using rev and 2.
J. McGwl
268
Lemma 14. Let W be a subword of {AA}", m > 0. Then D ( W )= W. Proof. This follows immediately from the definition of D ( W ) . As a consequence of Lemma 14, we note that A{AA}' (r PO) begins only with uland ends only with
Lemma 15. Let X , Y be positive words in B.. (a) Z f X Y = a ( 2 ; n - l ) a ( l ; n - 2 ) , then forsome k,j with Ocjs n - 2 , l c k c n - 1 and k - j a l , wehaue Y = a ( k + 1 ; n - l ) a ( j+ 1 ; n -2). X r a ( 2 ;k ) a ( l ; j ) , (b) If X Y = P(n - 2; 1)P(n - 1 ; 2), then for some k , j with 2 s k d n, l s j c n - 1 and k - j P 1 , wehaue
X = P(n - 2 ; j ) p ( n - 1 ; k ) ,
Y = P(j
- 1 ; 1)P(k - 1;2).
Proof. W e need only prove part (a). W e let 9 be the set of words of the form UIV 1U 2 V 2 . . UsVs,where s d n - 2 and
-
U 1 U 2 . . U,= ( ~ ( 2n; - l ) ,
-
V l V 2 . . V,E ( ~ ( 1n; - 2),
V, = a(pj ;pj+i - l ) , for l s i d s , with 2 = A 1 < A 2 < - . . < A , + l = n, 1 = p l < p 2 < - . . < p s + I = n - 1, and Ui
(Ai
;A i + l - l ) ,
( i = 1,2,. . . ,s). - (pi+I - 1)5 2 (4.1) W e claim that Ed is the diagram of W = a ( 2 ; n - l ) a ( l ;n - 2). W e note firstly, from (4.1), that UiV,= V,Ui whenever 1 s j d i - 1. Thus, if UIVl UsVsis in 9, then
-
u,v,*.* u,vs= u,u2..*usv,v2**. v,= w, so that Ed C D ( W ) . Now if 2 E 9 it clear that no operation of type
(3.1) can be applied t o 2,while any application of (3.2) t o 2 will result in an element of 9. It follows that Ed = D ( W ) . Hence if W = XU, then there is an element U 1V, U,V, of Ed such that X Y = U 1VI * . U,V,, and the result follows easily from this.
--
Lemma 16. Let P be a positive word in B., with P prime to A. Then Pun-lhas power zero or one, and the latter occurs if, and only if, P ends in a ( 2 ; n - l)Am-l. Proof. Suppose Pun-lhas power at least one, Pun-1= SA say. Then, by
On reducible braids
269
(e) of Lemma 6, P ends in each of a], u2,.. . , so that by (d) of Lemma 6, P ends in Am-l, P = QA.-, say. Thus QA.-lun-l = SA, and using Lemma 8, we have QA.-lan-l = Sa(2; n - l)Am-la,,-l, so that Q = S a ( 2 ; n - 1). Hence P ends in a ( 2 ; n - l)A,,-], as required. = ZA. Now Z cannot Conversly, if P = Zcu(2; n - l)An-l then Pan-1 contain A, since P is prime to A, and therefore Pa,,-1 has power one. This proves the Lemma. Lemma 17. Let Q be a positive word, and r a non-negative integer. We have : (a) {Aa}'AQ contains A if, and only if, Q begins with (b) {aA}'aQ contains A if, and only if, Q begins with Proof. Part (b) follows easily from (a) and the observation that a positive word P contains A if, and only if, P contains A. To prove (a), we observe firstly that if Q begins with then so that {Aa}'AQ contains A. Q = A,-IQI say, and A = Conversely, suppose that W = { A a } ' A Q contains A, so that W begins with each of a2, ma,.. . , Using (a variant of) Corollary 12, we see that Q must begin with each of u I , a,..., 2 and so Q begins with A n - I . Lemma 18. Let V be a positive word such that some generator a,does not occur in V. Then A 'V has summit power r, for each integer r. Proof. We know, by Lemma 9, that V cannot contain A, so that A ' V is in standard form. Suppose that A ' V has summit power greater than r. Then, by Theorem 7 there exist positive words Z, F, Y with A = IF and V = FYI*, where I*= Z if r is even, I*= 1 if r is odd. Since ai does not occur in V, it does not occur in F or I*. However, we have A = Z*F if r is even, and, by Lemma 8, A = FZ* if r is odd. Since a, occurs in A, we obtain a contradiction from Lemma 9. Lemma 19. Suppose that WV(2, n - 1) = U p ( s ;1) in B., where 0 S s S n - 1, and let r be the greatest integer such that W ends in P ( r ; l), W = W l p ( r ; 1) say. Then there exists I 3 0 , with s + I s r, and positive words Hi = Hi (2, s + i - l), Ki = Ki (s + i + 1, n - l), 1 G i s 1 + 1, such that ,
and U
=
V 7 H i + i K i + i ~ ~ + r H i K i ~* ~ * *+ Hr -~iK N , + I H ~ K I
Wlp(r; s + 1 + I ) H ; + I K i + i a ~ + i - i H ; K i ~ * + i - ~* *(HX2aSa,+iHIKi, ~~+i-i.
where H : = Hi (1, s + i - 2), and Hi= 1 if 2 > s + i - 1.
270
J. McCool
Proof. If s = 0 then we may take 1 = 0, HI= 1 and K , = V. Thus we may assume that s 3 1. We consider firstly the case that some word in D ( V ) is of t h e form Hl(2,s ) K , ( s + 2, n - 1). It then follows from Lemma 4 that U = UIKl for some U , , so that WH, = UIP(s;1). Now applying Lemma ll(b), we see that U , = TH,(l, s - l), for some T. Hence W = TP(s;l), and the result holds with 1 = 0. We now suppose that n o element of D ( V ) is of the form H I K I ,with H , K , as above. It follows that there must be a word in D ( V ) of the form Vl(2, n - 1)uS+,Hl(2, s)Kl(s + 2, n - 1). As in the previous case, we then see that U = UIHIKI,for some Ul, so that w V ~ u , +=, UIP(s;1). From Lemma 5(b) we now obtain UI= U2u,us+l, for some U2, SO that U , p ( s ;1) = Uzusus+lp(s; 1) = U 2 p ( s+ 1; 1)us+,, and therefore WV, = U , p ( s+ 1; 1). Since L ( V I )< L ( V ) an inductive argument now completes the proof of the Lemma.
Corollary 20. Suppose that /3 ( r ;1) V(2, n - 1) = U p(s ; l), where 1 s r, s s n - 1. Then s c r and A-'U has summit power - 1. Proof. From t h e Lemma we have s
+ 1S r
and
U = P ( r ; s + 1 + l)H~+lKl+lus+~-lus+l . . . H;K2usus+lHIK1
+ 1 + l)H~+lus+l-lus+lHi' - . H ~ ~ , u , + , H ~ K l +. IKK,l = H;+lus+l-lH H~ ; uu, H s$+( r,; ~s 2 + ~l)Kl+lKl ~ K,. =P(r;s
*
*
* *
Now writing Z, = H ~ + l u ~ + l - l * .H H;u,H:, ~ u s +and l-2~ Z, = P ( r ; s + l)Kl+,K1 K1, we have U = ZIZ2.Noting that ZI= Zl(l,n - 2), Z2= Z2(2,n - l), since 1 c r, s < n - 1 and s + 1 r, we see that Z22, is a positive word on u,,. . . ,u,,-,. Since A-'Z2gIis conjugate to A-IU, it follows from Lemma 18 that A-'U has summit power - 1.
--
We pause to observe that B. is the trivial group if n = 1, and is infinite cyclic if n = 2. From this, it is easy to see directly that Theorem A holds if n = 2. We shall therefore assume, in what follows, that n 3 3.
Lemma 21. Let r be a positive integer. Then (a) A:-, is in standard form in B.. (b) A -*r+'{AA}r-'Ais the standard form of A ;?',+I in B.. (c) A -,'{AA}' is the standard form of A i?', in B..
O n reducible braids
271
Proof. Part (a) follows immediately from Lemma 9. Now if k is a positive integer, it is easy to prove, by induction on k, that i-k-1
W'(A)]
and therefore
Now IIP-k-192i(A)is prime to A, by Lemma 17, and this prove? the result. Lemma 22. Let P be a positive word in B. which is prime to A, and let r be a positive integer. Then: does not occur in P. (a) P E Bn-l if, and only if, (b) A'P does not belong to B,-l. (c) A-"P E B,-l if, and only if, P = {AA}'Q, where Q is a positive word in B,-l which is prime to Moreover, if P has this form, then A,?lQ is a standard form of A-"P in B,,-l. (d) A-"+'P E B,-, if,. and only if, P = {AA}'-'AQ, where Q is as in (c). Moreover, if P has this form then A,?,+;"Ois a standard form of A-''+'P in Bn-l. Proof. Suppose that, for some integers k and s, we have A *P = AL1Q, where Q is as in (c). If not both k < O and s < O hold, then it is easy to obtain a contradiction, using Lemma 9 and the fact that P is prime to A. In particular, this establishes (a) and (b). Now suppose that A 'P = A i-lQ, with k,s < 0. Then A *P = A'{IIPe-,-, Bi(A)}Q, and from Lemma 17 it follows that the right-side of this equality is in standard form. Hence we must have P = {l-IP=-s-l wi(A)}Q, as required. Conversely, if P has this form then A 'P = A i-IQ. This proves the Lemma.
#V. Proof of Theorem A The theorem is an immediate consequence of the following two Lemmas. Lemma 23. Suppose that A-"-'M is in standard form in B. ( r b O ) , and A-2'-1Mun-l E B.-l. Let t be the summit power of A-"-'M. We have: (a) t = - 1 if r = 0.
272
J. McCool
(b) Zf r 2 1 then either t s - 2r + 1 , or there is a conjugate A -2rM1of A-"-'M such that A-"M, is in standard form, A ~2'Mlu,-l E En-,and A -Z'Mlun-l is conjugate to A-2'-1Mun-lin En-'. Lemma 24. Suppose that A-"M is in standard form in B, ( r 3 l), and A MU^-^ E En-,. Let t be the summit power of A-"M. W e have: (a) t s - 1 i f r = l . (b) If r 3 2 then either t s - 2r + 2, or there is a conjugate A -'"'MI of A-"M such that A-2"'MI is in standard form, A-2'+'MIun-l E En-' and A ~ 2 r + 1 M l uisn nconjugate -l to A -2rMun-lin
In order to prove Lemma 23, we shall require the following two results. Lemma 25. Let p = A -2rA{AA}'-' V ( 2 ,n - 1 ) be in standard form in B. ( r 3 l ) , and suppose that p has summit power t > - 2r. Then (a) t = - 1 i f r = l . (b) Zf r 3 2 we have V ( 2 ,n - 1 ) = X ( 3 , n - 1)V1(2,n - 1 ) Y ( 2 ,n - 2 ) for some X , Y , V, with Y ( 2 , n - 2 ) X ( 2 , n - 2) = An-,(2, n - 2). Also, p A -2r+1A{AA}'-2ul V , ( 2 ,n - l ) , and the latter word has power -2r+1.
-
Lemma 26. Let 4 = A -2r+1A {AA}'-'ul V l ( 2 ,n - 1 ) be in standard form in B. ( r 3 2 ) , and suppose that 4 has summit vower t > - 2r + 1. Then V 1 ( 2 n, - 1 ) ends with a ( 2 ; n - 1).
The proofs of Lemmas 25 and 26 will be given after that of Lemma 23. Similarly, the following two results will be required for the proof of Lemma 24, and their proofs will be given after that of Lemma 24. Lemma 27. Let p = A -2r+1{AA }'-I V ( 2 ,n - 1 ) be in standard form in B, ( r 3 2), and suppose that p has summit power t > - 2r + 1. Then V ( 2 ,n - 1 ) = X ( 2 , n - 2 ) V l ( 2 ,n - 1 ) Y ( 2 ,n - 2 ) for some X , Y,V , with V l ( l ,n - 2), and the P X = A,-,(2, n - 2). Also, p A-2r+2{AA}r-2un-l latter word has power - 2r + 2.
-
Lemma 28. Let 4 = A -2r"{~A}r-2un-I V l ( l ,n - 2) be in standard form in B. ( r 3 2), and suppose that 4 has summit power t > - 2r + 2. Then V , ( l ,n - 2) ends with ( ~ ( 1n; - 2), and t = - 1 if r = 2.
We now give the proofs of these Lemmas.
On reducible braids
273
Proof of Lemma 23. Suppose that A-"-'M is in standard form in B. (r 3 0), and that A-2'-1Mun-iE Z3,-,. We suppose, moreover, that A-2'-1M has summit power t > - 2r - 1. The plan of t h e proof is as follows. We begin by establishing that A-2'-'Man-, has power - 2r - 1. We then have, by Lemma 22, Since Man-,= {Aa}'AQ(l, n - 2), for some 0 which is prime to t > - 2r - 1 we also have, by Theorem 7, M = FRf for some F, R, Z with IF = A, so that FRfan-, = {Aa}'AQ(l, n - 2). We are then able, using the results of t h e previous sections, to analyse the contributions that Q(1, n - 2) makes to F and f respectively, and this in turn gives us a description of R. Now A -2r-1M = A -"-'FRf A -2rR, and the description obtained for R enables us to complete t h e proof. As was noted above, since t > - 2r - 1 we have M = FRf, where ZF = A. Also, A -"-'FRf A -"R, and A -"R has power - 2r, since R is prime to A. We note, by Lemma 16, that A-"-'Man-, has power - 2r - 1 or - 2r. We suppose that its power is - 2r, and consider firstly E Bn-,, we then have, by Lemma 22, the case r > 0. Since A -2'-1Mun-l that FRfu,-, = A{aA}'Q(I,n - 2), where Q is prime to A,-,. Hence, by Lemma 3, FRfu,-, = AQ(1, n - 2){AA}'. However, since A ends in u,-,, this would imply that FRf contains A, which is not the case. Thus A-2'-1Mu,-, has power - 2r - 1 if r > O . Now suppose r = 0 and A-'FRfun-, has power zero. From Lemma 16 it follows that F R f = Sa(2; n - l)A,-,, so that, in particular, FRf ends with each of u,, uz,. . ., It follows that f cannot end with u,-,, since otherwise, by Lemma 6, FRf would contain A. Now A = IF, so that, by Lemma 6, F must begin with uI. We have A-'FRfan-, = Q(1, n - 2), where Q is a positive word, and therefore
-
-
FRfun-,= AQ(1, n - 2)- a(1; n - l)An-lQ(l; n - 2) = A.-,a(l;
n - l)Q(l,n - 2)
= A,-,Q(2,
fl - l)Cy(l; n - l),
so that FRf = A,-,Q(2, n - 1)a(1; n - 2). Now An-l begins with each of a 2 ,u3,. . . , un-,, so that FRf begins with each of a l , u2,. . . , un-'and hence contains A. This contradicts t h e fact that M is prime to A. Thus we have established that A-2'-'Man-i has power - 2r - 1. Since A - 2 ' - 1 F R f ~ nE - , B,-,and has power - 2r - 1, we have, by Lemma 22, that FRfu,-, = {Aa}'AQ(l, n - 2), where Q is prime to An-,. Hence FRfun-, = {Aa}'Q(2, n - 1)A, so that (5.1)
FRf
= {AA}'Q(2,
)t
- l ) a ( l ; n - 2).
We now observe that if FRf begins with some a, (i > I ) , then, by (a
1. McCool
274
variant of) Corollary 12, O(2,n - l)a(l; n - 2) must begin with ui, and hence, by (a variant of) Lemma 13, O(2, n - 1) must begin with ui. Also, if FRf ends with some a,(i C n - 2) then, by Lemma 11, {AA}' must end with ui+l,so that, by Lemma 13, O(2, n - 1) must end with ui+t.
We now write F = FIF2,where FI= F1(2,n - 1) and F2 does not begin with any of u2,u3,. . .,u.-~.We also write Z = Ill2,where Z2 = Z2(3,n - 1) and I, does not end with any of u3, u4,. . . ,u.-~.From the observations of the preceding paragraph we see, firstly, that Q(2, n - 1) = F101(2, n - 1) for some O1= Q42, n - l), and secondly, since f2= Z2(n - 3, l), that O1= Q2(2,n - 1)Z2(n- 2,2) for some O2= Q2(2, n - 1). Thus we have
O(2, n - 1) = F1(2,n - 1)02(2,n - l)Z2(n- 2,2). We can now write (5.1) as
F2Rfl= {AA}'Q2(2, n - 1)a(1; n - 2).
(5.2)
In order to 'solve' this equation for R, we seek descriptions of F2 and I,. We begin with F2. We have
fp1p2= UA,-lp2 U,since fpl ends with each of
A for some ends with
A
= ZFIF2 =
Hence
ul,u2,.. . , c,,-~, and therefore
An-IF2U7 A.-IP(n - 1; l), . . A
7
so that p20= P ( n - 1; l), and therefore we have F2 = a(1; I) and U = a(I + 1; n - l), for some I with 0 d 1 d n - 1. We now consider ZI. Since Z, does not end with any of u3, u4,.. .,u , , - ~we , know that either (a) f, ends with both and u,,-~, or (b) 1, ends only with c,,-~, or (c) ZI = 1, or (d) f, ends only with u.-~.We shall consider each of these four cases separately, and show that the Lemma holds in each case.
f, ends with both and u,,-~. We have A = Z,Z2F= f,f# = flAn-zX
Case ( a ) .
for some X, since
CT,,-~
f2pbegins with each
of uI,uz,. . .,
Hence
A = XflA,,-2=a(1; n - 1)a(1; n - 2)A.-2, so that XI, = a(1;n - l)a(l; n - 2). Since f, ends in both we have 1, = Vu,,-2u.-lu,,-zfor some V, so that
and
On reducible braids
275
Xfl = X ~ u , - ~ u . - ~ =u ,a(1; - ~ n - l)a(1; n - 2) = a(2; n - 2)a(1;
n -2)~.-~,
and therefore X V = a(2; n - 2)a(1; n - 3). Applying Lemma 15(a) (with n - 1 in place of n ) we obtain R = a ( 2 ; k ) a ( l ; j ) and V = a ( k + l ; n - 2 ) a ( j + l ; n - 3 ) f o r s o m e k,j with O S j s n - 3 , 1s k n - 2 and k - j 3 1. Hence i,= a ( k + 1; n - 2)aG + 1; n - l)un-2. Before using the descriptions obtained for F2 and I, to study (5.2), we obtain a description of 12(3,n - 1)F1(2,n - 1). This will be used firstly to show that 1 = j , and then will be used again later in the proof of the Lemma for case (a). We have
A = Z2FIF2f, = X f l A , - 2 = d.-2a(2; k ) a ( 1 ;j)!,, so that
(5.3)
143, n - 1)F1(2,n - l ) a ( l ; I ) = A,-2(3, n - l)a(2; k ) a ( l ; j ) ,
= A,-2(n - 1,3) = A,-,(3, n - 1). We wish to show that 1 = j since and that
(5.4)
Z2(3,n - 1)F1(2,n - 1) = A,-2(3, n - 1)a(2; k).
Suppose, firstly, that 1 = 0. Then u Idoes not occur in the left side of (5.3), and hence cannot occur on the right, so that j = 0. Now suppose that 12 1. Then u,occurs on the left side of (5.3), so it must occur on the right, and therefore j 3 1. We now apply (a variant of) Lemma 13 to (5.3) as follows. The word a(1; I ) on the left begins only with ul, while the word A,-,(3, n - l)a(2; k ) on the right does not involve ul; it now follows, from Lemma 13, that An-,(3,n - 1)a(2; k ) is a beginning subword of L(3, n - 1)F1(2,n - l), Z2(3,n - 1)F1(2,n - 1)= A,-2(3, n - l)a(2; k ) X say. Similarly, since a(1;j ) on the right begins only with ulrit follows that A.-,(3, n - l)a(2; k ) = Z2(3,n - 1)F1(2,n - 1)Y, for some Y. Hence YX = 1, and since YX is a positive word we have Y = 1, X = 1. This establishes (5.4), and 1 = j follows. We may now rewrite (5.2) as (5.5)
a ( l ; j ) R a ( k+ 1; n - 2)aO + 1; n - 1) = {A,4}'Q2(2,n
- l ) a ( l ;n - 3).
Now, since j c n - 3, the right side of (5.5) must end in u,-~, so that, by Lemma 4, {A,4}rQ2(2,n - 1) must end in un-l. Since AA ends only in ul, it follows, using Lemma 13, that Q2(2, n - 1) ends in gn-1, Q2(2,n - 1)= Q42, n - l)un-, say. We then have
. I McCwl .
276
a ( l ; j ) R a ( k+ 1 ; n - 2 ) a G
(5.6)
+ 1 ; n - 2)
= { A A}'Q3(2, n - 1)a( 1 ; n
- 3).
Since the right side of (5.6) must end in u ~ . .we ~ ,see, using Lemma 10, that { A A } ' Q 3 ( 2 ,n - 1 ) must end i n a(1;n - 2 ) . Now if r = 0 this is impossible, so we must have r 3 1 . Since { A A } ' Q , ( 2 , n - 1 ) must end in a(2; n - 2) we see, using Lemma 13, that Q42, n - 1) ends in a ( 2 ; n - 2), 0 4 2 , n - 1) = Q4(2, n - l)(r(2; n - 2) say. We then have { A A } ' Q 4 ( 2 ,n - l ) a ( 2 ;n - 2 ) = { A A } ' ~ ' A Q 4 ( 1 , f l - 2 ) p ( f l1-; 2 ) a ( l ; n - 2 ) ,
and (5.6) yields Ra(k+l;n-2) = a(j
+ 1; n - 1){,&4}r-'04(1, n - 2 ) P ( n - 2 ; 2 ) a ( 2 ;n - 2 ) a ( l ; j ) .
Since j s k - 1 we have a ( 2 ; n - 2 ) a ( 1 ;j ) = n ( 2 ; k ) a ( k + 1; n - 2 ) a ( l ; j ) = a ( 2 ;k ) a ( l ;j ) a ( k
+ 1 ; n - 2),
so that
R
=
00' + 1 ; n - J){AA}r-'Q4(l,n - 2 ) P ( n - 1 ; 2 ) a ( 2 ;k ) a ( l ;j ) .
We conclude the proof of the Lemma for case (a) by finding a conjugate A-''M, of A-"R which has t h e required properties. We have A-"R - A - 2 ' A { A A } ' - ' Q 4 ( 1 , n - 2 ) p ( n - 1 ; 2 ) a ( 2 ; k )
- A-"A{AA}'-'P(n - 1; n - k + l)Q4(n - 1 , 2 ) a ( l ;n - 2 ) 5
A -'.MI
say. NOW ~ - ~ ' ~ ~ c r , - ~ = ~ - ~ ~ { A ~ ) ~ p ( n - 2 ; n - k ) ~ , ( n - 2 , 1 ) = A i Y l p ( n - 2 ; n - k ) Q 4 ( n - 2, l), so that A-Z'Mlan-l E Bn-l. to A - 2 ' M l ~ m - l Next, we check that A - 2 ' - ' M 5 n -is ~ conjugate in We have
A--2'-'M~m = A,2'L1Q(1, -l n - 2) =
A iCL'Fl(1, n - 2)Q41, n - 2 ) a ( ;n - 2)Z2(n- 3 , l )
- A ,?';'p(n =
(using (5.4))
- 2; 1)Z2(2,n - 2 ) F l ( l
n - 2)Q4(1, n - 2 )
A i?L'p(n - 2 ; 1)A,-42, n - 2 ) . 1; k - 1)Q4(1, n - 2 )
277
On reducible braids
= A ,?,cy(l;
- A ;?,/3(n =
k - l)Q4(l, n - 2) - 2; n - k)Q4(1, n - 2)
A -Z'Mlu.-,,
and the two conjugacy relations above hold in J3-,. Finally we must show that A-"M has power -2r. Suppose not. Then MI must contain A, so that, by Lemma 17, P ( n - 1; n - k + l)Q,(n - 1,2)a(1; n - 2) must begin with An-,(n - 1,2). Applying Lemma 13, we then have that P ( n - 1; n - k + 1)Q4(n - 1,2) begins with A,-,(n - 1,2), a n d so P ( n - 2; n - k)Q4(n - 2 , l ) must contain An-,. This implies that a(1; k - 1)Q4(1,n - 2) contains A,-,. However, we have Q(2, n - 1) = F,(2, n - 1)Q4(2,n - 1)a(2; n - l)Z,(n - 2,2), and applying Lemma 13 to (5.4), we see that F,(2, n - 1) = Xa(2; k ) for s o m e X = X(2, n.- l), so that
Q(l, n - 2) = X(1, n - 2 ) a ( l ; k - 1)Q4(1, n - 2 ) a ( l ; n - 2)Z2(n - 3,l). Since Q(1, n - 2) is prime t o A m - , , so therefore must be a(1; k - l)Q4(l, n - 2). This contradiction proves t h e L e m m a for case (4. . Case (b). f, ends only with a,-1. W e have A = ZIZzF= I1An-, Y , for = a(1; n - l)Ao-l, and therefore 9 = some Y , so that A = PfIAn-, cy(l;s), f I = a ( s + l ; n - l ) , f o r s o m e s w i t h O s s s n - 2 . Since A = ZzFIFrf,= d , - , a ( l ; n - l), we now have -
12(3, n - 1)Fl(2,n
-
1
l ) a ( l ; I ) = A.-,(2, n - l ) a ( l ; s).
As in t h e previous case we now see, using L e m m a 13, that 1 = s and
(5.7)
Z2(3, n - 1)F1(2,n - 1) = An-,(2, n - 1) = A,-2(3, n
- l)a(2; n - l),
and applying Lemma 13 again, we have (5.8)
F,(2, n - 1) = W a ( 2 ;n - 1 )
for some W = W(3, n - 1). W e now consider (5.2), which may b e written c y ( I ; s ) R a ( s +1 ; n - 1 ) = { A ~ } r Q 2 ( 2 , n - l ) a ( l ; n - 2 ) . Now s G n - 2, so the left side above ends with urn-,, and from Lemma 10 it follows that {Aa}'Q2(2, n - 1) ends with a(1; n - 1). This is
J. McGwl
278
impossible if r = 0, so we must have I 3 1. We then see that Q2(2,n - 1) = Q3(2,n - l)a(2; n - l), and then we have R =a(s
+ I; n - I ) { , ~ A } ~ - ' Q ~n( I-, 2 ) ~ ( -n I; 2)a(2; n - 1)a(1; s).
We now find a conjugate A-"M1 of A-"R which has the required properties. We have A-"R
- A-"a(2;
n - l)A{AA}r-1Q3(l,n - 2)P(n - 1;2)
=A-2'A{aA}r-'a(l;n-2)Q,(1,n-2)p(n-1;2)
- d-2rA{AA}r-'p(n- 1;2)Q,(fl - 1,2)a(1;
?I2)
-
= A -''MI say, and A-2'Mla.-1 = A ,?'$(n We have, using (5.7),
- 2; l)Q3(n - 2,l).
A-2'-'M~n-I= Ai?'L'Q(l, ~t- 2) =A,?';-'Fl(l,n-2)Q3(1,n-2)a(l;n-2)Z2(n-3,1)
- A ;?;;-'P(n- 2; 1)Z2(2,n - 2)Fl(l,n - 2)Q3(l, n - 2) = A,?la(l; n
-A
,?'l@(n
- 2)03(1, n - 2)
- 2; l)Q,(n - 2, l),
Hence A-2'-1Man-l is where the conjugation relations hold in conjugate to A-2'Mla.-1 in To complete the proof of the Lemma for this case, we need to show that A-2'MI has power -2r. If this is not so, then MI must contain A, so that P ( n - 1; 2)Q3(n - 1,2)a(1; n - 2) must begin with A.-42, n - l), and therefore P ( n - 1;2)Q3(n - 1,2) must begin with A.-,(2, n - 1). This implies that a(1; n - 2)Q3(1,n - 2) contains A,,-'. However, Q(1, n - 2) = Fl(l, n - 2)Qs(l, n - 2)a(l; n - 2)Z2(n - 3 , l ) =
W(2, n - 2)a(l; n - 2)Q3(1,-.n
2)a(l; n - 2)Z2(n- 3, l),
and Q(1, n - 2) is prime to An-l, so that a(1; n - 2)Q3(l, n - 2) must be prime to A,-I. This proves the Lemma for case (b). Case ( c ) . ZI = 1. We have
A = Z2(3,n - 1)F1(2,n - 1)F2= & - l a ( l ; n - 1) P ( n - 1 - 1; 1)An-1(2,n - l ) a ( l ; I ) , so that
On reducible braids
Z2(3,n - 1)Fl(2,n
279
1)- P ( n - 1 - 1; 1)An-42,n - 1).
-
Since uIdoes not appear on the left side of t h e above, we must have 1 = n - 1, and so Z2(3,n - 1)F1(2,n - 1) = An-1(2,n - 1). We now have, from (5.2),
a(1; n - l)R = {A,4}'Q2(2, n - l ) a ( l ; n - 2). If r = 0 then, by using 'rev' of Corollary 20, we obtain a contradiction. Hence we must have r t 1, so that
R
= ,4{A,4}'-lQ2(2,
n - l ) a ( l ; n - 2),
and A -2'Run-1= A i?'lQz(l,n - 2). Now A -2'-IMun-I= A ;?;'Q(l, n - 2) =
A i?'ilFl(l,n
- A,?'i1Z2(2,n =
A ;!'lQ2(1,
-
2)Q2(1, n - 2)Z2(n - 3 , l )
2)F1(1,n - 2)Q2(1, n - 2)
n - 2).
Since A -"R has power - 2r and is conjugate to A -*'-'M, this proves the Lemma for case (c). Finally we consider Case ( d ) . 1, ends only with of ZI.We have A
We obtain, as before, a description
= ZiZzF = Ild.-z~,-lH, A , . *
A
for some H, so that A
= An-zan-lHI1 = An-zP(n -
1; l)P(n - 1;2),
and therefore HII= P ( n - 2; l ) P ( n - 1; 2). Hence by Lemma 15, H = P ( n - 2; a ) P ( n - 1; b), ZI= P ( a - 1; I)P(b - 1;2) for some a, b with lSaSn-l,3GbSn andb-asl. Using A = An-$3(n - 1; a ) p ( n - 1; b)Zl = 1$,F2Z1,we obtain
12(3,n- 1 ) F 1 ( 2 , n - I ) a ( l ; 1 ) = A n - 2 ( 3 , n - 1 ) ~ ( 1 ; n - a ) a ( l ; n - b ) = An-2(3, n -
l)a(2; n - b + l ) a ( l ; n - a ) ,
since b - a 5 1. As in the previous cases, we see that 1 = n - a and that, for some W = W(3, n - l), (5.9)
Z2(3,n - l ) W = An-2(3,n - l), F1(2,n - 1) = Wa(2; n - b + 1).
We may now rewrite (5.2) as
280
J. McCool
a(1; n - a ) R a ( n- a 7 {AA}rQ2(2, n
+ 1; n - l ) a ( n - b + 1;2)
- l ) a ( l ; n - 2),
so that a(1;n - a ) R a ( n - a
+ 1; n - 1) = {Aa}'Q2(2, n - l ) a ( l ; n - b).
If now follows, since (n - a + 1)- ( n - b) 3 2, that {Aa}rQ2(2rn - 1) ends with a ( n - a + 1; n - l), and this implies that Q42, n - 1) = 0 4 2 , n - l ) a ( n - a + 1; n - l), for some Q3. We then have a(1; n
- a ) R = {Aa}'Q3(2, n - l ) a ( l ; n
- b).
Now if r = 0 we conclude from Lemma 19 that n - b 2 n - a, which contradicts the fact that b - a 3 1. Hence we must have r 3 1, and
R
=a(n-a
+ 1; n - 1)a{AA}'-'Q3(2, n - l ) a ( l ; n - b).
We now observe that
A-"R
- A-''a{Aa}'-'a(2;
n - b + 1)Q3(2, n - l ) a ( n - a
+ 1; n - 1)
A -"A{AA}'-'V(2, n - 1) say. We show that this latter word has power -2r. To see this, note that, by (5.9),
Q(2, n - 1) = F1(2,n - 1)Q42, n - l)12(n - 2,2) =
W(3,n - 1)a(2; n - b
+ 1)Q3(2,n - I ) a ( n - a + 1; n - 1)12(n- 2,2),
and the desired conclusion follows easily from t h e fact that Q(1, n - 2) is prime to An-'. Thus we have shown that A -"-'A4 is conjugate to A-"A{Aa}'-'V(2, n - l), and that t h e latter word has power - 2r. Now from Lemma 25 we see that if r = 1 then A-"-'M has summit power t = - 2 or t = - 1. Thus we may suppose that r 3 2. Now if t s - 2r + 1 there is nothing to prove, so we suppose that t > - 2r + 1. Applying Lemmas 25 and 26 we obtain
V(2, n - 1) = X ( 3 , n - 1 ) V,(2, n - 1) Y(2, n - 2) = X(3,
n - 1)V2(2,n - l)a(2; n - 1)Y(2, n - 2)
(for some V,) 7
X(3, n - 1) V2(2,n - 1)Y(3, n - 1)a(2; n - 1)
=
v42, n - 1)a(2; n - 1)
say, where Y(2, n - 2)X(2, n - 2) = A.-2(2, n - 2).
On reducible braids
281
We now obtain the desired A-2'M1.W e have
-
A-~'-'M A - ~ ~ A { A A } ~ - -~ 1) v(~,~ = A - ~ ~ { A A } ~ - ' A v ,n( ~ -,1)a(2; n
= A-2r{AA}r-'V3(1, n -/2)&(2;
- 1)
n - 1)
-A-*'A(AA}'-IV~(~,~I -2)P(n-1;2)
-A
{AA l r - l v3(n - 1,2)a (I, n - 2)
= A -"M1 say, and A -2'M1un-l= A ;!" V3(n - 2,l). We check that A-2'M1 has power -21. If not, then V3(n - 1,2)a(l, n - 2) must begin with A,-'(2, n - l), so that V3(n - 1,2) must begin with A,-'(2, n - l), and so therefore must v3(2, n - 1). This {AA}'-' V(2, n - 1) has powef - 21. contradicts the fact that A Hence A-"M1 has power - 21. To complete the proof of this case, we show that A -2'MIu.-lis conjugate to A-2'-'Ma,-l in E L l . We have
-2ra
A-2'-'Mun-l = A;f;-'Q(l, n - 2) =A
;?;'Fl(l,
n - 2)Q3(l, n - 2)a(n - a ; n - 2)Z2(n - 3 , l )
- A ;?i'Z2(2, n - 2)Fl(1, n - 2)Q3(1, n - 2)a(n - a ; n - 2) = A ; 2 _ ; - ' A , - 2 ( 2 , n - 2 ) a ( 1 ; n - b)Q3(1,n-2)a(n - a ; n l - 2 ) = A ;?;'An-2(2,
n - 2)V(1, n - 2)
= A,?~'A,-2(2, n - 2)V3(1, n - 2 ) a ( l ; n - 2)
-A
I'!;
-A
v3(1, n
-
2)
V3(n- 2, l),
as required. This concludes the treatment of case (d). In the consideration of the above cases, we have shown that if r = 0 then t = - 1, and that if I 3 1 then (b) of the Lemma holds. Thus we have established that the Lemma follows from Lemmas 25 and 26. Proof of Lemma 25. We have p = A -"A{AA}'-' V(2, n - 1) is in standard form, r 3 1 and p has summit power t > - 21. We thus have A{Aa}'-'V(2, n - 1)= FUZ for some F, U, Z with ZF = A. We write F = FlF2where FI3 Fl(l, n - 2) and Fz does not begin with any of W e also write Z = Z1Z2, where Z2 = Z2(2,n - 1) and Zl does ul, .. .,
J. McCool
282
not end with any of u2,.. . It is then easy t o see that V(2, n - 1)= F1(2, n - 1)V2(2,n - 1)Z2(2,n - 1) for some V2. We now obtain descriptions of F2 and ZI. We have A = Z , Z 2 F = Z1A.-1(2, n - 1)W for some W, so that A
= A,-,(2,
n - l)Wfl = An-42, n - l ) a ( l ; n - l),
sothat f , = a ( j ; n - l ) f o r s o m e j WlAn-IF2for some W,, so that A
= An-,F2W, =
with l G j C n . A l s o , A = Z F , F 2 =
A,-IP(n - 1; l),
and therefore F2 = P ( n - 1; k ) for some k with 1 c k c n. We note that not both F2 = 1 and ZI = 1 can hold, since otherwise we would have A = Z2(2, n - l)Fl(l, n - 2) = F , ( l , n - 2)Z2(n - 2, l), which would imply that does not occur in A. We consider separately the cases (a) F2# 1 and Z, # 1, (b) F2 = 1, ZI # 1, (c) F2# 1, Z, = 1.
Case ( a ) . F2 # 1 and ZI # 1. We have j A
1..
n - 1, k C n - 1 and
,. = P ( n - 1; k ) a ( j ;n - 1)f2PI. C
7 F2Z1Z2FI
From Lemma 6 it follows that k # j . We consider separately the subcases j > k, j < k.
Subcase (1). j > k. We note that A
= A,-2(2,
n - 2)&(2; n - 1)
= A,-2(2,
n - 2)P(n - 1; k ) P ( k - 1; l)a(2; j
'-
1)a(j;n - 1)
= A,-2(2, n
- 2 ) a ( l ; j - 2)P(n - 1; k ) a ( j ;nl- 1)P(k
=a(n -k
+ 1; n - 1)A,-2(2, n - 2 ) a ( l ; j
- 1; 1)
-2)P(n - 1; k ) a ( j ;n - l),
so that
12(2,n - l)Fl(l, n - 2) = a ( n - k
+ 1; n - 1)A,-2(2, n - 2)a(1; j - 2).
Using Lemma 13, we now see that
Z42, n - I ) = a ( n - k
+ 1; n - 1)Y,
Fl(l,n - 2) = Xa(1; j - 2), for some X(2, n - 2) and Y(2, n - 2) with YX We now observe that
V(2, n - 1) = X(3, n - 1)a(2; j - 1)V2(2,n - l ) a ( n - k
= A,-2(2,
n - 2).
+ 1; n - 1)Y[2, nl--2)
283
O n reducible braids
is a description of V(2, n - 1) of the required form, with V42, n a(2; j - 1)V2(2,n - l ) a ( n - k + 1; n - 1). Now if r 3 2 we have A -"A{AA }'-I V(2, n
-
=
n
-
-
1)V,(2,n
-
1)Y(2, n - 2)
2)X(2, n - 2)A{AA}r-' V'(2, n - 1)
A-2'A,-2(2, n - 2)AV1(2,n
-
l){AA}r-'
- A -"A
{AA }'-'AAn-2(2, n - 2)A V'(2, n - 1)
=A
{AA } r - 2 ~ cv,(2, + I n - 1)
=A
1) =
1)
= A-2rA{AA}r-'X(3,n
- A -"Y(2,
-
-"+'A{AA}r-2alVI(2, n - 1).
Thus to establish the Lemma in subcase (l), when r 3 2 , we need only check that this last word has power - 2r + 1. If this is not so, then, since A{Aa}'-'a, ends only in ulr V1(2,n - 1) must begin with A.-42, n - 1). This implies that V(2, n - 1) contains An-'(2, n - l), which contradicts the fact that p has power - 2 r . We now suppose that r = 1. We then have V(2, n - 1) = FUZ, so that AV2(2, n - 1) = a ( n - 1; k)UP(n - j ; l), and therefore P(k - 1; 1)V2(2,n - 1 ) = U @ ( n- j ; l), where 1 5 k < n - 2 - 2 5 j 5 n - 1 and k < j . Now if k = 1 we obtain a contradiction, since u 1would then occur in U p ( n - j ; 1) but not in P ( k - 1 ; 1)V2(2,n - 2). Hence k 3 2 , and it now follows from Corollary 20 that A-'U has summit power - 1. Since 1 A-'U, this concludes the proof of the Lemma for Subcase (1).
A
-
Subcase (ZZ). k > j . We have A
= AA,-r = a(1; j - 1)a(j;
n
- 1)P(n - 2; k - 1)P(k -2;
= P ( n - 1; k ) a ( l ; j = a ( 1 ; j - l)P(n -
l)a(j; n - 1)P(k
- 2;
l)A.-2(2, n - 2)
1)6.-2(2, n - 2)
1; k)a(j; n - 1)P(k - 2; 1)An-2(2,n - 2)
= P ( n - 1; k ) c y ( j ; n -
1)p(k -2; 1)~¶,-~(2, n - 2)P(n - 1; n - j
+ l),
so that
i2F1= P ( k - 2; 1)A.-2(2, n - 2)P(n - 1; n - j + 1). We then obtain F , ( l , n - 1 ) = Xa(1; j - l), 142, n - 1) = a ( n - k + 2; n - l ) Y for some X,Y with YX = An-2(2,n - 2). If r
32
J. McCool
284
the result then follows as in Subcase (1). If r = 1 then we have P ( k - 1; 1)V2(2,n - 1)= U p ( n - j ; l), as in Subcase (I), where 1 j n - 2 and 2 k n - 1; an application of Corollary 20 now shows that r = - 1.
Case ( b ) . Fz = 1, ZI# 1. We have A
=
ZIZzFl = P ( n - j ; 1)Z2Fl= P ( n - j ; 1)An-,(2,n - l ) a ( l ; j - l),
so that 142, n - l)FI(l, n - 2)= An-,(2, n - l ) a ( l ; j - 1) = ( ~ ( 2n; -
1)An-2(2,n - 2)a(l; j - l),
and if r 2 2 this gives the desired result, as in t h e previous case. If r = 1 then we have AV2(2,n - 1) = U p ( n - j ; l), so that Vz(l,n -2)P(n - l ; n - j
-
+ 1 ) = U.
Hence A - ' U A - ' a ( l ; j - l)Vz(l, n - 2), and j d n - 1 since I , # 1, so that, by Lemma 18, A - ' U has summit power - 1, as required.
Case ( c ) . F2# 1, ZI= 1. We have
A
= Z2FIF2 7 I2F,P(n -2;
= &(n
-k
k)
+ 1; n - l)A.-IP(n
- 2; k ) ,
so that Z2(2,n - l)Fl(l, n - 2) = a ( n - k =a(n-
k
+ 1; n - 1)An-,
+ 1; n - 1)A.-2(2, n - 2)a(l; n - 2).
As before, this establishes the result if r 3 2. If r = 1 we have AV2(2, n - 1) = P ( n - 1; k)U, so that U = P ( k - 1; 1)V2(2,n - 1) and A-'U A-'V2(2, n - l ) a ( n - k + 1; n - 1).
-
Now k d n - 1 since F2# 1, and the result follows from Lemma 18. This concludes the proof of Lemma 25.
Proof of Lemma 26. We have 4 = A - 2 ' + ' A { A a } r -V42, 2 ~ ,n - 1) is in standard form, r B 2 and 4 has summit power t > - 2r + 1 . We therefore have
A { A A } r - 2VI(2, ~ l n - 1)
FWf,
for some F, .W, Z with A = IF. We write F = FIF2,where F , = F,(2, n - 2) and F2 does not begin
On reducible braids
285
with any of u l ,u2,. . . , un-z.Now alVl(2, n - 1) must begin with F1(3, n - l), V,(2, n - 1) = F1(3,n - 1)V2(2,n - 1) say, so that A{AA}'-2alV2(2, n - 1) = F2Wf.Now if F2 begins with u,,then alV2(2,n - 1) must begin with u2.However, this would imply that V2(2,n - 1) begins with a2a1, which is impossible. Hence either F2 = 1, or F2 begins only with am-1. We also write Z = Z1Z2, where Z2 = Z2(1, n - 2) and ZI does not end u 2 , .. . , un-z. We then have V2(2, n - 1) = V42, n - l)i2, with any of ul, for some V,. We now find descriptions of F2 and ZI.W e have A = ZFlF2= UA,-IF2for some U, so that A.-,A = An-IF2fi, and F2 = P ( n - 1;j ) for some j with 1 s j s n. Also, A = ZIZ2F= ZIA,-IH for some H, so that H i , = A and f, = P ( k ; 1) for some k with 0 k n - 1. We note that not both ZI= 1 and Fz = 1 can hold, since otherwise we would have A = Z2(1, n - 2)Fl(2, n - 2), which is impossible. We consider separately the cases (a) I , # 1 and F2 # 1, (b) F2# 1, ZI= 1, (c) Fz= 1, ZI # 1.
Case ( a ) . ZI# 1, F2# 1. We have 1 S j , k
A
= Z2FlFzf, = Z2Flp(n-
S
n - 1. Now
1;j ) p ( k ; l),
so that j # k . If j < k then A has a subword P(n
1; j ) P ( k ;j + 1)Po'; 1)- P ( k - l ; j ) P ( n - 1;j)Po'; I),
which is impossible. Hence k < j. Now if j > k A
+ 1 then
j > 1 and
+ 1)P(k; l)P(n - 1; j ) P G - 1; 1) = a ( n - j + 1; n - l)A...2P(n - 2; k + l)P(n - l ; j ) P ( k ; 1) = a ( n - j + 1; n - l)A.-2P(n - 2; k + 1)F2fl, = A,-$(n
- 2; k
so that Z2(l,n - 2)Fl(2, n - 2) = a ( n - j
+ 1; n - 1)An-$(n
- 2; k
+ 1).
However, occurs on the right side above, but not the left, which is impossible. Hence we must have j = k + 1. Since j = k + 1 we have A = Z2F,A,and therefore
Iz(1, n - 2)Fl(2, ~t- 2) = An-I
=
P ( n - 2; 1)Am-2(2,n
- 2).
Applying Lemma 13, we see that Z2(l,n - 2) = P ( n - 2; 1)Z, for some Z = Z(2, n - 2), so that 1, = ( ~ ( 2n; - l ) Z ( n - 2,2) = Z ( n - l,3)a(2; n - 1). Since Vl(2, n - 1) ends with f2, this proves the required result.
286
3. McCool
Case ( b ) . 1 1 = 1 ,Fz#l. We have k = 0 , l s j s n - 1 and
Iz(l,n - 2)Fl(2, n - 2 ) = UA.-,
= a ( n -j
+ 1; n - l)An-l.
Now a,,-, does not occur in IzFl, and so we must have j IzFl = An-l, which yields the result as in Case (a).
Case ( c ) . II# 1, F2= 1. We have 1 < k
Iz(1, n - 2)F1(2, n - 2) = A,-IH
=
1 and hence
n - 1, j = n and A,-iP(n - 1; k
+ l),
which is impossible unless k = n - 1. Thus LF, = Am-l,and the result follows. This concludes the proof of Lemma 26.
Proof of Lemma 24. Suppose that A-"M is in standard form, r a 1, and A -"M has summit power t > - 2r. We then have M = FRI for some F, R, I with A = IF. We note that A-"FRI A-"+IR, and that A-'"'R has power - 2r + 1. We show, firstly, that A-2'Mu,,-I has power -2r. If this is not the case then, by Lemma 16, it has power - 2r + 1, and by Lemma 22 we have FRIU,,-~= A{Aa}'-'AQ, where 0 = Q(1, n - 2) and Q is prime to A,,-,. We then have FRIu.-, = A{Aa}'-'Q(2, n - 1)A, so that FRI must have power contains A. This is a contradiction, and so A - 2r. We now have, by Lemma 22, that FRIun-, = {aA}'Q(l, n - 2), where Q is prime to Hence
-
FRI = a{Aa}'-lQ(2, n - l ) a ( l ; n - 2). We write F = Fl(l, n - 2)F2, where Fz does not begin with any of u,,. . . , v,,-~, and I = 1112(1,n - 3), where II does not end with any of ul,. . . ,c.-~.We then have Q(2, n - 1)= F1(2,n - 1)Q1(2,n - 1)Iz(2,n - 2) for some Q1, so that
(5.10)
FzRIl= A{AA}'-IQl(2, n - l ) a ( l ; ~t- 2).
We now obtain a description of Fz.We have A = IFIFz=UAn-IF2 for some V,so that A = A.-1F20= An-lfl(n - 1; 1). Hence F2= P ( n - 1; I) for some 1 with 1 S 1 d n. We now consider separately the four possibilities for I,: (a) II ends in both and (b) ZI ends only in (c) ZI = 1, (d) I I ends only in
Case ( a ) . I , ends in both and u n - 2 . We have A = Z l I 2 F = IIA.-zH, for some H, so that A = f i I I A n - z = a(1;n - l)a(1;n - 2)A,-*, and
287
O n reducible braids
~ n - 2)a(l; t~ - l)un-2. NOW therefore f i Z l a(1; n - l ) a ( l ; n - 2 ) a(2; z1= WU,-2un-lu,-2 say, so that fiW = a(2; n - 2)a(l; n - 3). Applying Lemma 15 we obtain fi = a(2;k ) a ( l ; j ) and W
a(k
+ 1; n - 2)a(j + 1; t~ - 3),
forsome k, j w i t h O s j s n - 3 , l ~ k ~ n - 2 a n d k - j * 1 , s o t h a t Z I = a ( k + 1; n - 2)a(j+ 1; n - 1)3,-2. We now obtain a description of Zz(l,n - 3)Fl(l, n - 2). We have A
7ZIZ2FiF2
=a(k
+ 1; n - 2)a(j + 1; n - I)U,-~A.-,P(~- 2; n - k ) P ( n - 1; n - j),
so that Z2(1,n - 3)Fl(l, n - 2)P(n - 1; 1 ) = A.-2P(n - 2; n - k ) P ( n - 1; n - j). Using Lemma 13 we see tht 1 = n - j, so that Z2(1,n - 3)Fl(l,n - 2) = A,-$(n
- 2; n
- k),
and, using Lemma 13 again, we have
Fi(1, n - 2) 7 Z(1, II - 3)P(n - 2; n - k ) , for some Z. We now have, from (5.10), that
P ( n - 1; n
- j)Ra(k
+ 1; n - 2)a(j + 1; n - 1)
= A{AA}'-IQ1(2, n
- l ) a ( l ; n - 3).
Since the left side of this ends in u , - ~we , have Ql(2, n - 1) = Q2(2,n - l ) ~ "say, - ~and P ( n - 1; n - j ) R a ( k = A{AA}'-IQ2(2,
+ 1; n - 2)aG + 1; n - 2) n - l ) a ( l ; n - 3).
Since the left side of this ends with (242, n - l)a(2; n - 2) say, and
we have Q2(2,n - 1)=
P ( n - 1; n - j)Ra(k + 1; n - 2) ={AA}~-*Q~(I, n - 2 ) ~ ( -n 1;2)a(2; n - 2)a(1; j ) = {AA}r-1Q3(1, n - 2)P(n - 1;2)a(2;k ) a ( l ; j ) a ( k
+ 1; n - 2),
so that
P ( n - 1; n - j ) R = {AA.)-'Q3(1,n - 2 ) P ( n - 1;2)a(2; k ) a ( l ; j ) . This equation may be solved directly for R if r a2, while if r = 1 a solution may be obtained using Lemma 19. However, in order to avoid
J. McCool
288
separate consideration of these two cases, we proceed somewhat differently. We have A -"M = A -"FRI = A -"FIP(n - 1; n - j)RZIZ2 = A-"FI{~A}'-'Q3(1, n
- 2)P(n - 1;2)a(2; k ) a ( l ; j ) a ( k + 1; n - 2)
(Yo' + 1; n - l)an-2Z2
- A-"Z2Fl{~A}'-'Q3(l,n - 2)P(n - 1;2)0(2; n - 2)a(l; n - l ) ~ . - ~
- A-"a(l;
t~ -
l ) ~ ( t~ l ;- 2)ZzFl{AA}'-'Q3(1, n - 2)P(n - 1;2),
and substituting the expression obtained for Z2F1, we have
A-"M
- A-"a(l; =A
n - l ) a ( l ; n - 2)A.-z/3(n - 2; n - k){aA}'-' Qd1, n - 2)P(n - 1; 2)
- ~ ~ + ' { A A }-~2;- n~ -~ k( )~a 3 ( 1 ,n - 2 ) ~ ( n 1;2)
- A - " + ' { A a } ' - ' a ( 2 ; k ) Q 3 ( n - 1 , 2 ) a ( l ; n -2) A -Z'+IMI say, and A -"+lMlan-l= A i?,+'a(l;k - 1)Q3(n - 2,l). We now show that A-"MU~-~and A-2'+'MIan-Iare conjugate in B,-]. We have
A-"Man-I
= Ai?'lQ(l,
n - 2)
= A i?IFl(l, t~ - 2)Q3(1, n
- 2 ) a ( l ; n - 2)Z2(1, n - 3)
- A if;a(l;n - 2)Z2(1,n - 3)Fl(l, n - 2)Q3(l, n - 2) = A i?',+IP(n- 2; n - k)Q3(1,n - 2)
-
A i?';'a(l; k - l)Q,(n - 2, l), as required. Next we check that A-"+'MI has power - 2r + 1. If this is not the case, then MI contains A, so that a(2; k ) Q 3 ( n - 1,2)(1; n - 2) begins with An-'(2, n - l), and therefore a(2; k ) Q 3 ( n - 1,2) also begins with An-1(2,n - 1). However, this would imply that Ql(l, n - 2) contains An-,, since
01(l,n-2)=F~(l,n-2)03(1,n-2)a(l;n-2)Zz(l,n-3) = Z(1, n - 3 ) P ( n - 2; n - k)Q3(I, n
- 2)a(1; n - 2)I2(1, n - 3).
This contradiction shows that A-"+'MI has power - 2r + 1. It remains to show, in Case (a), that r = - 1 if r = 1 . Now when r = 1 we have
On reducible braids
289
A-'M 1 = A - ' a (2; k ) Q 3 ( n - 1 , 2 ) ~ ~ (n1-2) ; -A-'p(n-
1;2)a(2;k)Q3(n- 1,2),
so that A-'MI has summit power - 1, as required. Case (b). ZI ends only in un-'.We then have A = Z l Z 2 F = ZIAn-lH for some H, so that f i Z l = A, and therefore ZI = a ( s ; n - 1) for some s with l ~ s ~ n - 1 . Now A = ZlZ2FlF2 = ZIA,-lP(n - 1; n - s + l), so that
Z 2 ( 1 , n - 3 ) F ~ ( l , n - 2 ) ~ ( n - 1 ; 1 ) = A n - ~ P ( n - 1 ; n - ~1). + From this we conclude that 1 = n - s + 1, so that
141,n - 3)F1(1,n - 2)
A,-'
An-2/3(n- 2; l),
and therefore FI(l,n - 2) = J(1, n - 3 ) P ( n - 2; l), for some J. We now see, from (5.10), that
P ( n - 1; n
-s
+ l)Ra(s; n - 1 ) = a{Aa}'-'Q1(2, n - l ) a ( l ; n - 2).
Since the left side of the above ends in Q2(2,n - l)a(2; n - 1) say, and then
we have 0 4 2 , n - 1) =
P ( n - 1; n - s + l)R ={(aA}'-'Q,(l,n-2)P(n-l;2)a(2;n - l ) a ( l ; ~ ' - l ) . We now observe that
A-"M = A-"FRZ
= A-"Fip(n
- 1; n - s + 1)RZIZZ
= A-2rFl{~A}'-'Q2(1, n - 2)P(n
,
- 1;2)a(2;n - l ) a ( l ; s - l ) a ( s ; n
- A-2'a(1; n - 1)Z2FI{~A)'-'Q2(1,n - 2)P(n - 1;2)a(2; n - 1) = A-2r+'{AA}r-'Q2(1,n - 2)P(n - 1;2)a(2; n
- A-2"'{Aa}'-'a(2;
- 1)
n - 1)Q2(n - 1,2)0(1;n - 2)
A -2r+1M1
say, and A-2'+'Mlun-l= A ;!',+'a(l;n - 2)Q2(n - 2,l). Now A 2'lQ(l,n - 2) = A 2'lFIQ2(l, n - 2)a(1; n - 2)Z2
- A,?'T.;''p(n - 2; l)Q2(1, n - 2) - A ;!',+'a(l; n - 2)Q2(n - 2, l),
so that A-2'Mun-I is conjugate in to A-2'+'MIun-I. Now it is easy to see that A-2'+'M1 has power - 2r + 1, so to
-
1)Z2
J. McCool
290
complete the proof of the Lemma in Case (b) we only have to examine what happens when r = 1. In that situation, we have A-'M1 A-'P(n - 1 ; 2 ) a ( 2 ;n - l)Q2(n - 1 , 2 ) , so that t = - 1, as required.
-
Case (c). Z,= 1. We then have
Z2(l, n so that
- 3)Fl(1, n - 2)/3(n
- 1; I ) = A
=a(n- 1
+ 1; n - l)A.-lP(n
- 1; l ) ,
Z2(1, n - 3)Fl(1,n - 2 ) = a ( n - 1 + 1; n - 1)An-2,
and therefore 1 = 1. We may now rewrite (5.10) as
R
= {AA}'-'Q1(2,
n - 1)a(1;II - 2).
Hence A - Z ' + I R ~ n=- lA ;?,+'QI(l,n - 2 ) . Now
A-2'M~n-I = A,?'Q(l, n - 2 ) = A ;!',F1QI(l, n
- 2)ZZ- A ,!',+'Q,(l,n - 2).
Noting that if r = 1 then A - ' R has summit power result is established for Case (c).
- 1, we see that
the
Case (d). ZI ends only in We then have A = Z1Z2F= ZlA.-2~n-,H, for some H, so that A = fiZlAn-2cn-l = a ( 2 ; n - 1)a(1;n - 2)A.-zun-l, and therefore f i Z l = a ( 2 ; n - l ) a ( l ; n - 2). From Lemma 15 it follows that I, = a ( a + 1; n - l ) a ( b + 1; n - 2 ) and fi = a ( 2 ; a ) a ( l ;b), for some a , b with l S a S n - l , O S b S n - 3 a n d a - b a l . = Z,ZzFlF2, so that Hence A = Z l A n - 2 ~ n - l H
Z2(l, n - 3)Fl(1,n - 2 ) P ( n - 1; I )
= A . - , p ( n - 1 ; n - a ) P ( n - 1; n - 6 ) = An-$(n - 2; n -
since a - b
2
b - 1)P(n - 1; n - a ) ,
1. It then follows that 1 = n - a, so that
Z2(1, n - 3)FI(1,n - 2 ) = An-2P(n- 2 ; n - b - l),
and therefore FI(l,n - 2 ) = K ( 1 , n - 3 ) P ( n - 2 ; n - b - 1) and Z2(l, n - 3 ) K ( 1 ,n - 3) = for some K. We may now write (5.10) as
p ( n - 1; n - a ) R a ( a + 1; n - l ) a ( b + 1; n - 2 ) F A{AA}'-'QI(2,
n - l ) a ( l ; n - 2),
On reducible braids
291
so that
/3(n-l;n-~)Ra(~+l;n-l)~A{AA}'-'Q,(2,n-l)~(l;b). Since a - b
1, it then follows that
3
Ql(2, n - 1) = Q2(2, n - l ) a ( a
+ 1; n - 1)
say, and R
/3(n - u - 1; l){AA}'-'Q2(2, n - l ) a ( l ; b).
We then have A-"+'R
- A-"+'P(n
+ 1; n - 1) = A-2r+'{AA}'-'/3(n - 1; n - b)Q2(2, n - l ) a ( a + 1; n - 1)
say. We note that
- 1; n - b){AA}'-'Q2(2, n - l ) a ( a
A -''+'{AA}'-' V(2, II - 1)
Q(1, n - 2) = K(1,
n - 3 ) P ( n - 2; n
- b - l)Qz(l, n - 2 ) a ( a ; n - 2)12(1,n - 3)
and that Q(1, n - 2) is prime to Am-'. From this it follows easily that A -'"'{AA}'-'V(2, n - 1) has power - 2r + 1. Now if r = 1 then A-'V(Z, n - 1) has summit power - 1. Hence we may assume that r 2 2, and t > - 2r + 2. Applying Lemmas 27 and 28, we obtain V(2, n - 1) = X(2, n - 2) V1(2,n - 1)Y(2, n - 2) = X(2,
n - 2)V2(2, n - l ) a( 2 ; n - 1)Y(2, n - 2)
= X(2, n - 2) V42, n - 1)Y(3, n - 1)a (2; n - 1)
v3(2, n - 1)a (2; ? -I1) say, and ?X = A,-42, n - 2). W e now obtain the desired A -"+'MI. We have A - z r + 1 { ~ A } r - l v (~t 2, 1) = A - 2 r + 1 { ~ A } r - 1 ~n3 (-2i)a(2; , n - 1) =
A - 2 r + 1 ~ { A ~ } r - 2 ~-32)Aa(2; (i, n - 1)
- A-2r+'{AA}r-'V3(l,n - 2)P(n - 1;2) - A - ~ ~ + ' { A A } ~ -- ' i~, 2, () ~a ( i ; n - 2) A -"+'MI say, and A-Z'+'Mla.-I= A 2',+'V3(n - 2 , l ) .
I. McGwl
292
Now
A-"Mun-l
= Ai?'lQ(l, ~t- 2)
= A ;FIK(l, n
- 3)V(1, n - 2)12(1,n - 3)
- A ;!'lZz(l, n - 3)K(1, n - 3)V(1, n - 2) = A ,!'lAn-zV3(1,
-A
I;?'
n
- 2)(w(l;
- 2)
~t
V3(l, n - 2),
so that A-z'Mun-l is conjugate to A-Z'+lM1u,,-l in &-,. It remains to establish that A-"+'M1 has power - 2r + 1. If this is not the case, then V3(n - 1 ,2 )a(l; n - 2) must begin with A,,-,(2, n - l), and so therefore must V3(n - 1,2). However, this would imply that A-2'Mu,,-I has power greater than - 2r in Eln-,. Hence A -"+'MI has power - 2r + 1. The concludes the proof of Case (d) and thereby establishes the Lemma. Proof of Lemma 27. We have p = A-zr+1{AA}'-1V(2,n - 1) is in standard form, r 3 2 and p has summit power t > - 2r + 1. We then have {AA}'-'V(2, n - 1 ) = FWf for some F, W, I with A = IF. We write F = F1(2,n - l)Fz, Z = ZIZz(l,n - 2), where Fz does not begin with It is any of u 2 , .. . , C T - ~and ZI does not end with any of u l , .. ., then easy to see that V(2, n - 1) = F1(2,n - 1)V2(2,n - 1)12(n- 1,2), for some Vz. We obtain descriptions of II and F2. We have A = 1112F=IIAn-lU for some U,so that A,-IUfl = A, and therefore f, = P ( k ; l), for some k with 0 S k S n - 1. Also, A = ZFIFz= HAn-1(2,n - l)Fz for some H, so that F 2 H = A and F 2 = a ( 1 ; j ) , forsome j with O s j s n - 1 . We note that not both Z, = 1 and Fz = 1 can hold, since otherwise A = F,(2, n - l)Zz(n - 1,2), which is impossible. We consider separately the cases (a) F2 # 1 and ZI # 1, (b) ZI # 1, F2 = 1, (c) ZI = 1, F2# 1.
Case ( a ) . We have A = 12FIFzfl = ZzFIa(l;j ) p ( k ; l), so that j # k . We consider separately the subcases (I) j > k , (11) j < k. Subcase (I). j > k . Then A
= P(n -j
- 1; 1)A,-2(2, n - 2)P(n - 1; k
+ 2 ) a ( l; j ) p ( k ;l),
so that Zz(l, n - 2)Fl(2, n - 1) = P ( n - j - 1; 1)An-z(2,n - 2)P(n - 1; k
+ 2).
293
On reducible braids
It now follows that there exist X(2, n - 2), Y(2, n - 2) with Z2(l,n-2)=p(n-j-1;1)Y(n-2,2), F1(2,n-l)= X(2, n - 2)P(n - 1; k + 2) and Y(n - 2,2)X(2, n - 2) = A,-42, n - 2), and therefore
V(2, n - 1) = X(2, n - 2)V1(2,n - 1)Y(2, n - 2), where V1(2,n - 1) = P ( n - 1; k + 2)V2(2,n - 1)a(j + 1; n - 1). To establish the Lemma for the subcase under consideration, we Vl(l, n - 2), and that this need to show that p A -2"2{~A}r-2an-, latter word has power - 21 + 2. We have
-
p = A -2'"{AA}'-1X(2,
n - 2)V1(2,n - 1)Y(2, n - 2)
= A-"+'XA{AA}'-'VI(l,
- A-"+'AY(n =
n - 2)AY(2, n - 2)
- 2,2)XA{AA}'-ZVl(1,n
- 2)
A -2'c1Aan-1 V1(l,n - 2){dA}'-'
- A -2r+z{AA}'-2an-IVl(l, n - 2).
Now if this last word has power greater then - 2r + 2, then VI(l, n - 2) must contain An-l, which would imply that p does not have power - 2r + 1. Hence A -ZrfZ{~A}'-2an-I V1(l,n - 2) has power - 2r + 2, as required.
Subcase (IZ). j < k. Then A
=P(n
- j -2; 1)An-2(2,n -2)P(n - 1; k
+ l ) a ( l ; j ) P ( k ; l),
so that
IzFl= P ( n - j - 2; 1)An-42, n - 2)P(n - 1; k
+ l),
and the result follows as in the previous subcase. Case (b). II# 1, F2 = 1. We have IzFi = An-lU = An-iP(n - 1 ; k
+ 1)
=a(l;n-2)An-2(2,n-2)P(n-l;k and the result follows as above.
+l),
Case (c). ZI= 1, F z # 1. Then IzF1 = HAn-l(2, n - 1) = P ( n - j - 1; 1)An-42, n
and the result follows.
-'2)P(n - 1;2),
1. McCool
294
This concludes the proof of Lemma 27. Finally, we have Proof of Lemma 28. We have + = A --2"2{AA}r-2a.-l Vl(l, n - 2) is in standard form, r 3 2 and has summit power t > - 2r + 2. W e Vl(l, n - 2) = FWI, for some F, W, I with therefore have {AA}'-2an-l A = IF. We write F = F l ( l , n - 3)F2, where F2 does not begin with any of uI,. . . , an-3.We then have Vl(l, n - 2) = F I ( l , n - 3)v2(l, n - 2) for V2(1,n - 2) = F2WI. Now if F2 begins with some V2, so that {AA}r-2an-l then an-l V2(1,n - 2) begins with so that V2 begins with ~ , - ~ a . -which ,, is impossible. Hence either F z = 1 or Fz begins only with We also write I = 1,12(1,n - 2), where 1, does not end with any of aI, . . . ,an-2. We then have V2(1,n - 2) = V3(l, n - 2)12(1,n - 2), for some V,. We now describe I, and F2. W e have A = II12F= l l A n - , U, for some U , so that Uf,= A and f, = P ( j , 1) for some j with O S j S n - 1. Also, A = l F l F 2 = HAn-,F2,for some H, so that F2H = and F 2 = P ( n - 1; k ) for some k with 1 S k S n. We note that not both I,= 1 and Fz = 1 can hold. We consider separately the cases (a) I,# 1, F2 # 1, (b) II= 1, F2# 1, (c) II# 1, F2 = 1.
+
a
Case (a). 1 1 # 1 , F2#1. We have 1 S j S n - 1 , 1 S k S n - 1 and A
= I2FIFzfl = I z F , P ( n -
1; k)p(j; l),
so that k # j . If j > k then we obtain a contradiction as in Lemma 26. If k > j + l then
A
= A,-2P(n
- 2; j
+ l)P(n - 1; k ) P ( j ; 1)P(k - 1; l),
so that
+ 1; n - l)A.-2/3(n - 2; j + 1). k = 1, but since k > j + 1 we cannot
I2(1, n - 2)Fl(l, n - 3)= a ( n - k This, however, is impossible unless have k = 1. Hence k = j + 1. We now have A = I2Fla,so that
12(1,n - 2)F1(1,n - 3) = a ( 1 ; n - 2)A.-2, and therefore Z2(1,n - 2) = a ( 1 ; n - 2)X(1, n - 3) = X(2, n - 2 ) a ( l ; n - 2), for some X. This shows that VI(l, n - 2) ends with a ( 1 ; n - 2).
On reducible braids
295
To complete the proof of this case, we note that when r = 2 we have u , - , V 3 ( l , n - 2 ) = F 2 W Z 1 = ~ ( n -l ; k ) W a ( n - j ; n - l ) , so that V3(l, n - 2) = P ( n - 2; k)Wa(n - j; n - l), which is impossible. Hence Case (a) cannot occur when r = 2.
Case ( b ) . ZI= 1, F2# 1. We have j
=0
and 1 s k < n - 1. Now
Z2(l, n - 2)Fi(l, n - 3) = HAn-]7 c*(n - k
+ 1; n - l)An-i,
so that we must have k = 1, and Z2Fl= A,-I. If r z 3 then this proves the result, as in the previous case. If r = 2 we have a,,- V3(1,n - 2) = F2 W =
a W,
so that V3(1,n - 2) = P ( n - 2; 1) W, and therefore W = W(1, n - 2). Now A -2a,-l Vl(l, n - 2) A - I W, and A - I W has summit power - 1, as required.
-
Case (c). ZI # 1, F2 = 1. We have 1 s j < n - 1 and k
= 0.
Now
12(1,n - 2)Fl(1, n - 3) = An-IU= A , - ] P ( n - 1; j + l),
so that we must have j = n - 1 and Z2Fl= A,-]. If r 2 3 this establishes the result. If r = 2 then we have an-I V2(1,n - 2) = WA, so that u1V2(n - 1,2) = Wa.However, from Corollary 20 we see that this is impossible. This concludes the proof of the Lemma. References [ l ] J.S. Birman, Braids, links and mapping class groups (Princeton University Press, 1975). [2] W.L. Chow, On the algebraic braid group, Ann. of Math. 49 (1948) 654-658. [3] F.A. Garside, The braid group and other groups, Quart. J. Math. Oxford 20 (1969) 235-254.
S.I. A d i ~W.W. , Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 297-298.
ON SOME GROUP PRESENTATIONS* B.H. NEUMANN Instintte of Advanced Studies, me Australian National University, Canberra
Let p, q, r, s, r, u be integers, and denote by G(p,q, r, s, r, u ) , or G for short, the group with three generators a, b, c and defining relations
a P b= bas, bqc = cb‘, c’a = ac”. It is known that G(1,1,1, - 1, - 1, - 1) is infinite [2, 021 and that is trivial (Higman [I], see also [2, 923)). If
G(1,1,1,2,2,2)
(*)
2spslsJ,
2sqsJr1,
2 ~ r ~ l u J ,
then it can be shown that G is infinite. The method consists of constructing a “normal form” for the elements of G. Though the idea is simple, the technical details are complicated. The groups G(p, q, r, p + 1, q -+ 1, r + 1) have no proper subgroups of finite index, and thus no non-trivial finite epimorphic images. The proof follows one of Higman [ l ] (see also [2], 023). On the other hand, if one of 1 s - p 1, I t - q 1, 1 u - r I is different from 1, the group is easily seen to have non-trivial finite factor groups. All the presentations considered have solvable word problem; this is an immediate consequence of the fact that there is a normal form, which is readily computable, for the elements of the groups presented. The question with which the investigation started is, whether G(2,2,2,3,3,3) is finite (and then necessarily trivial) or infinite; and the author is indebted to Dr A.M. Biunner for reminding him of it. This group is infinite. By contrast it follows from recent, more general results of Post 13) that G(2,2,1,3,3,3) is finite, and G(2,2,1,3,3,2) is * Most of this work was carried out at the Conference, but not formally presented there; it was only shown privately to a few interested participants. I t was reported to the Conference on finire groups and permutarion groups at the Mathematisches Forschungsinstitut Oberwolfach, Germany, in August 1976. 297
298
B.H. Neumann
trivial. The author is grateful to Dr Michael J. Post for permitting him to quote his unpublished results. A more detailed account is to be published in the Canadian Journal of Mathematics. References (11 Graham Higman, A finitely generated infinite simple group, J. London Math. SOC.26 '
(1951) 61-64.
[2] B.H. Neumann, An essay on free products of groups with amalgamations, Phil. Trans. Roy. SOC.London, Ser. A 246 (1954), 503-554. [3] Michael J . Post, A note on finite three-generator groups with zero deficiency, to be published.
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 299-335
RADCLIFFE
THE CONCEPT OF “LARGENESS” IN GROUP THEORY Stephen J. PRIDE* Open University, Milton Keynes
91. Introduction 91.1. Summary of results
There are many group-theoretic properties which are traditionally thought of as measuring the “largeness” of a group - for example, having cardinality greater than or equal to K for some infinite cardinal K , being SQ-universal, having a free subgroup of rank 2, having a properly ascending chain of normal subgroups. There are also many groups which are traditionally thought of as being “large” - nonabelian free groups and Fuchsian groups for example. At the other end of the scale there are groups, which although infinite, are still considered to be rather “small” - infinite groups in which every nontrivial normal subgroup has finite index for instance. The aim of this paper is to develop a theory which gives precision to these vague notions of “largeness”. This theory provides a framework in which to discuss several well-known, but until now, unrelated ideas. It also gives a different viewpoint, which leads to new results. By examining several properties 9 which are traditionally thought of as indicating the “largeness” of a group, one finds that they all satisfy three basic conditions, and moreover, it is these conditions which account for the fact that the property is a measure of “largeness”. (LP 1) If G has 9 and K maps onto G then K has 9. (LP 2) If H s G with H of finite index in G then G has 9 if and only if H has 9. (LP 3) The trivial group 1 does not have 9. A group-theoretic property satisfying (LP 1)-(LP 3) will be called a large property. In 02.1, partly for illustration, and partly for use in later discussion, several well-known large properties are listed. In 02.2 it is shown how to associate with any infinite group A a large property 2 ( A ) called the large property generated by A . This notion enables one to compare two infinite groups G, H. Thus G is said to be larger than H (written * Present address: University of Glasgow. 299
S.J. Pride
300
H < G ) if G has 6P(H); G and H are said to be equally large if G has 9 ( H ) and H has 9 ( G ) ;and G is said to be strictly larger than H if G is larger than H, but G and H are not equally large. If G is larger than H then any large property enjoyed by H is also enjoyed by G ; and if G and H are equally large then they have the same large properties. The relation of larger than is a quasi-order, and one would hope that groups traditionally thought of as being “large” would be “high up” in this quasi-order, and groups traditionally thought of as being “small” would be “low down”. These two extremes are examined in Sections 3 and 4 respectively. If G is a finitely generated group then it is easily shown that F2, the free group of rank 2, is larger than G. Thus G is “as large as it can possibly be” if it is equally as large as F2. In §3.2 it is shown that many finitely generated groups which are traditionally thought of as being “large” are actually equally as large as F2. In particular, this is verified for: (i) Fuchsian groups (Theorem 3.2); (ii) one-relator groups with torsion on at least two generators (Theorem 3.3); (iii) certain arithmetically defined linear groups PSL(2,
1, SL(2, 1, GL(2, 1.
Some remarks are also made concerning small cancellation groups. The verification that one-relator groups with torsion on at least two generators have 6P(F2)provides a new and straightforward proof that these groups are SQ-universal. Also in 83.2, an example is given of a finitely generated one-relator group B which, although having several well-known large properties, nevertheless is not equally as large as F2. To be precise, it is shown that if B = ( a ,t ; t-’a2t = a3) then no subgroup of finite index in B maps onto a free group of rank greater than 1 (Theorem 3.4). In 03.3 the question of which free products have 9 ( F 2 )is discussed. Let G = H * K with H , K # l . If ( H I = I K ( =then 2 G iscyclic-byfinite and is “small”. On the other hand, if max{ 1 H I , IK I} > 2 then G has many well-known large properties. However, it is often quite a complicated affair to establish that G has such properties, and this leads one to think that perhaps G need not necessarily be “naturally large” in the sense of having 9 ( F 2 ) .This turns out to be the situation. It is shown in 83.3 that G has 8 ( F 2 ) if and only if either: ( i ) one of H, K has 9(F2);or ( i i ) both H and K have non-trivial finite homomorphic images fi, E,with max { I fi 1, I R I} > 2 (Theorem 3.7).
Largeness in group theory
301
Section 4 turns to “small” groups. Firstly, in 54.1, several concepts of “smallness” are discussed. Thus, an infinite group G is said to have height m if whenever one has groups G I , . . .,G, with G, = G and Gi+l strictly larger than G, for i = 1,. . . , r - 1, then r m. Also, G is said to satisfy Max- < (Min- < ) if there is no infinite sequence of groups G l , G Z ,... with G larger than G, and Gi+lstrictly larger than Gi (Gi strictly larger than Gi+l)for i = 1,2,. . . . Related to these two latter properties are chain conditions on certain subgroups of G (see Theorem 4.1). In $54.2, 4.3, 4.4 consideration is given to a class of infinite groups which are traditionally thought of as being ‘‘small’’ - namely, infinite groups in which every non-trivial normal subgroup has finite index (the so-called just-infinite groups). Let A denote the class of minimal groups with respect to the quasi-order relation of larger than. It is easy to show that any infinite group in which every non-trivial subnormal subgroup has finite index lies in A. However, just-infinite groups which have non-trivial subnormal subgroups of infinite index need not be minimal. Nevertheless, if G is a just-infinite group satisfying the maximal condition on subnormal subgroups then G has finite height (see Theorem 4.2). For just-infinite groups not satisfying the maximal condition on subnormal subgroups, one finds that such a group is either minimal or does not satisfy Min- < (see Theorem 4.3). An example is given in 54.4 of a just-infinite group lying in A but not satisfying the maximal concition on subnormal subgroups. In 04.5 questions concerning extensions are raised. In particular, are the properties “having finite height”, Max- < , Min- =S extension closed? The main result (Theorem 4.4) is that the answer is negative for the first two properties (it remains unresolved for the third). In fact, it is shown that there is a split extension of one minimal group by another which does not satisfy Max- < . In contrast to this result it is also shown that i f MI,.. . ,M. are minimal groups then MI x * . * X M. has height less than or equal to n (see Theorem 4.5). Although poly-minimal groups d o not in genela1 have finite height, it can be shown that for certain classes 8 of minimal groups, poly-9/ groups do have finite height. In particular, poly-cyclic groups, and groups with a composition series, have finite height. These results will be obtained in a ganeral setting in a sequel to the present paper.* The work presented in this paper raises many questions, some of which are stated in the text below. *
“Groups of finite height”, J. Austral. Math. Soc. (Series A) 28 (1979) 87-99.
302
S.J. Aide
61.2. Notes to the reader The reader who wishes to get the main results and ideas of the paper, but is not interested in details should read as follows: 02.1, 02.2, 03.2, 43.3 (up to the end of the statement of Theorem 3.7), 04.1, 04.2 (up to Problem 6), 04.5 (up to Problem 7). The casual reader should note that = does not signify isomorphism. 61.3. Acknowledgements This paper has benefited from conversations with D. Goldrei, A.J. Wilkie, and most especially with I.M.S. Dey. Visits to New York University and the University of Illinois at Urbana in Spring 1976 were also very beneficial. I thank these universities for their hospitality, and the Overseas Research Committee of the Open University for financial support. Some of the work in this paper was carried out while I was attending the Conference on Word and Decision Problems in Group Theory and Algebra, Oxford, 1976. The work of the organizers and the financial support of the SRC is gratefully acknowledged. 61.4. Notation Throughout the paper the text book [14] by Magnus, Karrass and Solitar will be used for matters relating to combinatorial group theory. In particular, the techniques of the Reidemeister-Schreier method as described in [14, 02.31 will be used in several places, and in such computations the notation of [14, 02.31 will be employed. Additional notation and background material will be needed as follows. Let G be a group. If A is a subset of G then sgp A will denote the subgroup of G generated by A. If {H, : i E I } is a set of subgroups of G then sgp{Hi : i E I } will denote the subgroup of G generated by the elements of the Hi. Let H be a subgroup of G and let g E G. The normalizer of H in G will be denoted by N G ( H ) .The conjugate subgroup {g-lhg : h E H } of H by g will be denoted by H E .The intersection of all the conjugates of H in G will be called the core of H (in G ) and will be denoted by Cor(H). Clearly Cor(H)G G. The index of H in G will be denoted by J G : H ( If. J G : H l < w then, as is well-known (and easily established), H has only finitely many distinct conjugates under G and so J G:Cor(H)I
Largeness in group theory
303
If 9 is a group-theoretic property then i 9 will denote the negation of 9 The properties of satisfying the maximal (minimal) condition for subgroups, normal subgroups, subnormal subgroups will be denoted by Max, Max-n, Max-sn respectively (Min, Min-n, Min-sn respectively). The integers will be denoted by Z, and the rationals by Q. The cyclic group of order 2 will be denoted by Zz. 62. Large properties 62.1. Well-known examples of large properties For illustration, and also for use in later discussions, several wellknown large properties of groups will now be listed. For all of these examples it is an easy matter to verify that (LP 1) and (LP 3) hold. However, it is not always straightforward to verify (LP 2), and for several of the examples information will be given relevant to the verification of (LP 2). Many large properties arise as negations of well-known properties. It is not difficult to see that if 9 is a group-theoretic property which is (*)
closed under taking homomorphic images, normal subgroups of finite index, extensions by finite groups,
then -19 is a large property. (Conversely, if 9 is a large property then -19satisfies (*).) To see that (LP2) holds let G be a group with a subgroup H of finite index. If G has -I 9 then Cor(H) has -I 9 since G/Cor(H) is finite. Thus H has -19 since Cor(H)s!H. Conversely if H has -I9 then Cor(H) has -I 9 (by what has just been shown), and so G has -19 since Cor(H) is normal and of finite index in G. It follows immediately from the above that if 9 is a group-theoretic property such that: (**)
finite groups have 9 ; 9 is closed under taking subgroups, homomorphic images, extensions;
then 1 9 is a large property. Many interesting large properties arise this way. It is perhaps worth commenting that if 9 is any large property then no finite group can have 9.This is a trivial consequence of (LP 2) and (LP 3). Example 2.1. The property % ( K ) of having cardinality greater than or equal to K , where K is an infinite cardinal.
304
S.J. Pride
Example 2.2. The property % ( K ) of not being generated by less than elements, where K is an infinite cardinal.
K
This is the same property as % ( K ) except when K is No. To verify that (LP 2) holds, let G be a group with subgroup H of finite index. Obviously, if H can be generated by less than K elements then so can G, since a generating set for H together with the elements of a transversal of H in G constitutes a generating set for G. On the other hand, if G has a generating set of cardinality less than K , .then so does H [14, Theorem 2.71. Example 2.3. SQ-universality. A group G is said to be SQ-universal if every countable group is embeddable in a quotient of G. For a verification of (LP2) see [17] or [7, Lemma 91.
Example 2.4. The property P ( K )of having a free subgroup of rank where K is a non-zero cardinal.
K,
Notice that, since a free group of rank 2 has free subgroups of every countable rank 9(2), 9(3), . . . ,9(No) are all the same property. Example 2.5. Having a free subsemigroup of rank non-zero cardinal.
K,
where
K
is a
Example 2.6. Satisfying no non-trivial law (i.e. generating the variety 0 of all groups). Example 2.7. Non-amenability. A group G is said to be amenable if there is a finitely additive translation-invariant measure p on the set of all subsets of G such that p ( G ) = 1. The property of amenability satisfies the conditions (**) - see Theorems 1.2.4-1.2.6 of [6]. Example 2.8. Having exponential growth. The concept of a group having exponential growth has been defined in the literature for finitely generated groups (see [15]). In order to ensure that (LP 1) holds it is necessary to extend the definition to infinitely generated groups by local considerations. Thus, a group G
Largeness in group theory
305
r
will be said to have exponential growth if there is a finite subset of elements of G such that the following holds: if y r ( r ) ( r 30) denotes the number of distinct elements of G which can be expressed as words of length less than or equal to r in the elements of r, then there are constants a, p with a > 0, p > 1 such that y r ( r ) 3 a p ‘ for r = 0 , 1 , 2 , .. . (i.e. yr grows exponentially). Suppose G is a group with a subgroup H of finite index. Obviously if H has exponential growth then so does G. Conversely, suppose of G such that yr grows exponentially. Let there is a finite subset g I+ g be a right coset representative function for G modH, and let A be the subset of H consisting of all elements
r
--I
ua. ua ,
where U is an arbitrary representative and a E f‘. Then it follows from [14, pp. 89-90] that if g is an element of G expressible as a word of length less than or equal to r in the elements of r, then g = hg, where h is an element of H expressible as a word of length less than or equal to r in the elements of A. This implies that y r ( r ) S I G : H 1 y,, ( r ) , so that H has exponential growth. Example 2.9. Let 9 be a group-theoretic property which is closed under taking homomorphic images; then the property of not being locally 9-by-finite is a large property.
To verify that (LP 2) holds, suppose H is a subgroup of finite index in a group G. Obviously if G is locally 9-by-finite then so is H. Conversely, suppose H is locally 9-by-finite, and let r be a finite subset of G. Let A be as constructed in Example 2.8. Then sgpA has finite index in sgpr. Since sgpA is L?-by-finite, so it sgpr. Thus G is locally 9-by-finite. Example 2.10. The properties i Max-sn, i Min-sn.
i
Max,
iMin, i Max-n, 1Min-n,
The properties Max, Min satisfy the conditions (**). (See [ l l , Lemma 1.E.11 and [19, Lemma 1.48, Corollary] in this connection.) The properties Max-n, Min-n, Max-sn, Min-sn satisfy the conditions (*). (See [19, Lemma 1.48, Corollary] and [26] in this connection.) There are several group-theoretic properties which are not known to be large properties, although they “ought to be”. Two of these are mentioned here. In [25] Wiegold discusses the class 8!? of groups G with the property
S.J. Pride
306
that whenever G acts transitively on a set R, there is some element of G displacing every element of R. Problem 1. Is the property of not lying in
I a large property?*
In [2] Chiswell discusses the concept of length functions on groups. O n e can define a length function on any group by setting the length of every element equal t o 0, or by setting the length of the identity equal t o zero and the length of every other element equal t o 1. However, there are various meanings one can attach t o a “non-trivial” length function. For example, one can require the existence of at least one element g such that the length of g 2 is greater than the length of g . Problem 2. With some interpretation of “non-trivial”, is the property of having a “non-trivial” length function a large property?
82.2. Large properties generated by groups Let A b e an infinite group. Then one can associate with A a large property 9 ( A ) , called the large property generated by A, defined as follows. A group G has 9 ( A ) if and only if there is a chain of groups Go,G I , .. . , G, with Go= A, G. = G, and where for i = 1,. . . , n either
Gi is a subgroup of finite index in GiF1,
(2.1)
or Gi-l is a subgroup of finite index in Gi,
(2.2)
or
(2.3)
G, maps onto Gi-,.
It is clear that Z ( A ) actually is a large property. Theorem 2.1. Let A be an infinite group. (i) A group G has 9 ( A ) if and only if there are groups B, C with B of finite index in A, C of finite index in G , and C mapping onto B. (ii) It can be assumed in ( i ) that either B S A or C a G. finite
G-2F e\:if index
I B
Added in proof. The answer is “No” (P.M. Neumann).
Largeness in group theory
307
Proof. First consider (i). Obviously if the conditions are satisfied then G has 8 ( A ) . Conversely, if G has 2 ( A ) then there is a chain of groups Go,G I , .. . ,G, with Go= A, G. = G, and where for i = 1,. . . , n one of (2.1)-(2.3) holds. It will be shown by induction on n that the conditions stated in (i) are satisfied. The result is trivial if n = 0. Suppose n > O . Since G,-, has 2 ( A ) the inductive hypothesis guarantees that there are groups B,,C , with B, of finite index in A, C , of finite index in G,-, and C , mapping onto B,by a homomorphism 8 say. Case ( i ) . G, is of finite index in G,-,. Let C = C , r l G,. Then C has finite index in both C , and G.. Let B = 8 ( C ) . The fact that C has finite index in C , implies that B has finite index in B,.Thus B also has finite index in A. Case ( i i ) . G,-, is of finite index in G,. Simply take B = B I and C = C , . Case ( i i i ) . G, maps onto G,-,. Suppose JI : G, + G.-,is the homomorphism. Let C = JI-'(C,).Then C is of finite index in G.. Let B = B,.Then 84 maps C onto B. Now consider (ii). If C is not normal in G one can replace C by Cor(C) and replace B by the image of Cor(C) under the homomorphism C + B. On the other hand, if B is not normal in A one can replace B by Cor(B) and replace C by the inverse image of Cor(B) under the homomorphism C + B. This completes the proof of the theorem. Considerable'use will be made of Theorem 2.1 in the sequel. For the present, the following simple corollary is noted.
Corollary 2.1.1. Let G and A be infinite groups and suppose G has no proper subgroups of finite index. If G has 8 ( A ) then A has a (necessarily normal) subgroup G of finite index, where G is n homomorphic image of G. In particular A has a subgroup of finite index which has no proper subgroups of finite index. The first part of this result is an immediate consequence of Theorem 2.1 (i). The second part follows from the fact that no homomorphic image of G can have a proper subgroup of finite index.
308
S.J. Pride
The concept of the large property generated by a group is useful in that it gives a way of comparing two groups. Let G and H be infinite groups. It will be said that G is larger than H (written H G ) if G has Y ( H ) ; G and H will be said to be equally large (written G = H) if H < G and G < H; and it will be said that G is strictly larger than H (written H < G ) if H < G but H is not larger than G. It is easy to see that H < G if and only if 9 ( G ) implies 9 ( H ) . Obviously the relation of larger than is a quasi-order and the relation of equally large is the induced equivalence relation. Another trivial but important fact is that if 9 is any large property, and if H has 9,then every group larger than H also has 9.Thus two groups which are equally large have the same large properties. The referee has pointed out that an alternative way of viewing the relation < might be profitable for future work. Define two groups to be equivalent if one is a subgroup of finite index in the other, and let fi denote the equivalence class of G. Then G =S H if and only if some Ho in is a pre-image of some Go in &.
93. “Large” groups 93.1. Large properties of non-abelian free groups
P and F be free groups of ranks r, s respectively with 2~r<s. (i) If s is infinite then P < F. (ii) If s is finite then P = F.
Theorem 3.1. Let
Proof. First consider (i). Since there is a homomorphism of F onto F it follows that P =S F. To show that P < F it suffices to find a large property enjoyed by F but not by I? Clearly %(s) (Example 2.2) is such a property. To prove (ii), let F2 denote the free group of rank 2 freely generated by x, y . The normal closure in Fz of {x”, y } ( n 2 1) has finite index and is free of rank n + 1 by Schreier’s formula [14,Theorem 2.101.Thus Fz has subgroups of finite index isomorphic to P and F, so F2= P 1:F, as required. This proves the theorem. It is now easy to describe the large properties of non-abelian free groups.
Largeness in group theory
309
Corollary 3.1.1. Let F be a free group of rank r 3 2, and let 9 be a large property. (i) If r is infinite then F has 9 if and only if there is an r-generator group with 9. (ii) If r is finite then F has 9 if and only if there is a finitely generated group with 9. Proof. (i) is straightforward. For (ii), note that if there is a finitely generated group G with 9 then there is a free group P of rank greater than or equal to 2 mapping onto G and therefore having 9. Thus if r is finite then F has 9 since F = ?I Taking 9 to be Y ( A ) in the above result one obtains: Corollary 3.1.2. Let F be u free group of rank r 2 2. (i) If r is infinite then F is larger than euery r-generator group. (ii) Zf r is finite then F is larger than every finitely generated group. 93.2. Groups which are equally as large as F2 It was seen in 03.1 that F2, the free group of rank 2, is larger than every finitely generated group. Consequently, a finitely generated group is as large as it can possibly be if it is equally as large as F2. The aim of this section is to show that many groups traditionally thought of as being “very large” are actually equally as large as F2. It is clear that any group which has a non-abelian free subgroup of finite index has Y ( F 2 ) .The structure of such groups has been determined [3], [lo], [22]. For future reference it should be noted that the class includes all free products A * B with A, B finite and [ A1 3 2 , 1 B 13 3. It also includes all HNN groups, where the base H is finite and the associated subgroups K-l, K ,are properly contained in H. 1. Fuchsian groups
Theorem 3.2. All finitely generated Fuchsian groups are equally large as F2. Proof. The following argument is adapted from [17]. A finitely generated Fuchsian group G has a presentation with generators
a l , . . .,a,,, b,,. . ., b,, c I , .. .,c,,, e l , . . .,e,,
S.J. Pride
310
and defining relations i = 1, ..., r,
e:= 1 [ a , ,bl]
*
-
[a,, b P ] c *l . cqel . * e, = 1,
where ni 3 2 for i = 1,.. . , r, and where
By a theorem of Nielsen, Bundgaard and Fox [8, p. 681, G has a subgroup H of finite index which can be presented with generators UI,.*.
9
urn, V I , . . ., Urn, X I , .
9
XI,
and defining relator [UI,v , ] *
* *
[u,,
V,]XI
*
XI =
1.
Here (m,I ) # (O,O), (0, I), (0,2), (1,O). If 1 > O then H is free of rank greater than 1, while if 1 = 0 then H maps onto F2 (put u , = * = urn= v3 = . . . = v, = 1). Thus in either case H, and therefore G, has 2’(F2). 2 . One-relator groups There are two well-known heuristic principles in the theory of onerelator groups: (i) one-relator groups which do not satisfy a non-trivial law behave “like” free groups; (ii) one-relator groups with torsion which do not satisfy a non-trivial law behave “more like” free groups (G. Baumslag). The following two theorems further illustrate, and in fact give a certain substance, to these two principles.
Theorem 3.3. Every one-relator group with torsion which does not satisfy a non-trivial law has 2‘(F2). Theorem 3.4. There is a two -generator torsion -free one -relator group B which does not satisfy a non-trivial law but which is such that B < F2. It follows immediately from Theorem 3.3 that if G is a one-relator group with torsion which does not satisfy a non-trivial law then G has every large property enjoyed by F2. In particular, G is SQ-universal. The SQ-universality of one-relator groups has been investigated by Sacerdote and Schupp in (201. They obtain results for torsion-free groups as well as groups with torsion, but their methods (small cancellation theory over HNN groups) are much more complicated than those needed to prove Theorem 3.3.
Largeness in group rheory
311
As well as the two heuristic principles mentioned above, there is a third principle, namely that one-relator groups on at least three generators behave "more like" free groups. One is led to ask, Or to state the therefore, whether such groups actually have 9(F2). question in an alternative form: Problem 3. Does every one-relator group on at least three generators have a subgroup of finite index mapping onto Fz? In this connection it is worth remarking that Stallings [23] has shown that for any positive integer n there is an n-generator one-relator group which does not map onto F2. A question related to the above has been raised by A.M. Macbeath (private communication): is there an integer d > 0 such that whenever a group G can be presented with r + d generators and r relations, then G has 2(F2)?*(See [17,p. 5 ) for a similar question concerning SQ-universality.) Proof of Theorem 3.3. Let G = (t, a, b, . . . ; R " ) ( n > 1). Then G does not satisfy a non-trivial law provided G has at least two generators [9]. Now by [20, Lemma 4.11 it can be assumed that u,(R) = 0. Let a, = t-'at', b, = t-'bt', . . . ( i E Z). Then R can be rewritten as a word P in the a,, b,, . . . . There is an integer m 3 0 such that if a, occurs in P then 1 i 1 C m. Now it is easy to'see that G has presentation with generators
t , a - , , . . . , a,,,, b, ( i E Z), . . ., and defining relations
P", t-'a,t
=
a,,,
( i = - m , . . .,m - l),
t-'b,t = b,,, ( i E Z), . . . . The result will therefore follow from the Lemma 3.1. Let H be a group which has a presentation with generators t, a-,,
. . . , a , , bi ( i E Z), . . .
and defining relations This question has now been answered affirmatively (with d
= 2).
See "Groups with
two more generators than relators" by Benjamin Baumslag and Stephen J. Pride, J.
London Math. Soc. 17(2) (1978) 425426.
S.J. Pride
312
V, W,. . ., t-'a,t = a,,'
t-'b,r
=
( i = - m,. . . , m - l),
b,,, ( i E Z ) , . . . ,
where m 3 0 , and where V, W, . . . are t-free words. Suppose that there is an integer n > 1 such that for i m each of aa, ( V ) ,aa, ( W ) ,. . . is divisible by n. Then H has 2 ( F 2 ) .
I I
To prove this lemma consider the factor group K of H obtained by imposing the additional relations: a:= 1
(lilsm),
b, = 1 ( i E Z ) ,..., [a,,a,]=1
(lil,ljl~m).
Then V, W, ... are superfluous as defining relators in K , and so K is an H N N group with base the direct product of the finite cyclic groups sgp{a-,}, . . . ,sgp{a,}, associated subgroups sgp{a-,, . . . , a m - J , sgp{a-,+l,. . . ,a,}, and stable letter t. Thus K has a non-abelian free subgroup of finite index (see above) and therefore has 2 ( F z ) . Consequently, H has 2 ( F 2 ) , as required. This proves the lemma, and Theorem 3.3 follows. Before going on to prove Theorem 3.4 it is worth remarking that Lemma 3.1 can be used to show many groups with more than one relator have 2 ( F z ) . For example:
Theorem 3.5.* Let G be a group with presentation of the form
(t, a, b,. . . ;R",S", .. .), where n > 1. Suppose that a, ( R )= a,( S ) = * . = 0 , and furthermore there is an integer K 2 0 such that the number of t-symbols in each of R", S", . . . does not exceed K . Then G has 2 ( F 2 ) . The requirement in this theorem that there be a bound on the number of t-symbols in the relators R", S",. . . cannot be dispensed with in general. For consider the group Go with generators a, t and * 1 thank A.H.M. Hoare for pointing out a mistake in my original formulation%f this theorem.
Largeness in group theory
313
defining relators [ WI, Wz]", where WI, W2 range over all words in a, t. Then all the conditions of Theorem 3.5 are satisfied except that there is no bound on the number of r-symbols in the relators. Moreover, it is clear that Go does not have the large property 9 ( 2 ) , and therefore certainly cannot have Y ( F z ) . Lemma 3.1 can also be used to show that many torsion-free onerelator groups have Y ( F z ) .For example, it is easily shown that if G = ( a , t ; a " ~ t ' ~ . . . a "where ~ t ' ~ )E T ,= O and where the a, have a common factor greater than 1, then G has Y ( F z ) . (On the other hand, if the a, do not have a common factor greater than 1 then G need not have Y ( F z ) ,as the group B in Theorem 3.4 shows.) Proof of Theorem 3.4. Let B = (a, t ; t - ' a 2 t = a'). It will be shown that B < Fz. By Theorem 2.1 it must be shown that n o normal subgroup of finite index in B can map onto a subgroup of finite index in Fz. Suppose N a B, 1 B : N 1 finite. Since t and a have infinite order in B, some positive power of t and some positive power of a must belong to N , otherwise 1 B : N 1 would be infinite. Suppose I " , u p ( n , p >0) belong to N , and assume p is the smallest integer i such that a ' E N. Let M be the normal closure of {t",a P } ,so that M N. Let B = B / M . Then
B = ( a , t ; t - l a z t= a 3 , f n , . p ) . Now the minimality of p guarantees that ( 2 , p ) = (3,p) = 1. For example, if p = 24 then a 4 = a-Zqt-lazqt E N , and 0 < q < p. It now follows that sgp{a} is normal in B. Moreover, the minimality of p guarantees that a has order exactly p in B. Thus every element of can be expressed uniquely in the form
(3.1)
t 'a'
B
i < n, 0 c j < p ) .
(0
In particular, M has finite index in B. Thus in order to show N does not map onto a subgroup of finite index in Fz it suffices to show that M does not map onto a subgroup of finite index in Fz.Now a subgroup of finite index in Fz is free of rank greater than 1 and therefore maps onto F2. Consequently, it is enough to show M does not map onto Fz. The elements in (3.1) constitute a Schreier system for M in B, and so M is generated by the elements --I
(3.2)
t'a'a. t'a'a ,
(3.3)
t'a't. t'a't
--1
,
314
S.J. Ride
where 0 S i < n, 0 S j < p. An element (3.2) is freely equal to 1 unless --1
j = p - 1. Let'a, denote the element t'aP-la. t'aP-la ( = t'aPt-'), and let y denote the element t". Then it is easily shown that the following relations hold:
a:, . . . ,
(3.4)
a;
(3.5)
y - l a r y = a:.
=
=
a;-l,
Suppose M maps onto a free group F. Then the relation (3.5) holds in F and so, in F, y and a . commute. Thus, in F, a. has finite order and must therefore define the identity. Consequently, by (3.4), all the a, define the identity in E It follows that the factor group M of M obtained by putting all the a, equal to 1 still maps onto F. Let s(i, j ) denote the generator (3.3) of M. Now the defining relators of M are obtained by rewriting the relators 'a"ta y
t a -2a
-at
-7
(0 S T < n, 0 S a < p ) of B in terms of the s(i, j ) and the a,. Thus the defining relations of &f on the generators s(i. j ) are S(T, ~ ) s - ' ( T a ,
+ 2) = 1,
where 0 d 7 < n, 0 =scy < p, and where LY + 2 is reduced modulo n to lie between 0 and n - 1. Since (2,p) = 1 it follows easily that for T = 0, ..., n - 1 S (7,o) = S ( T , 1 ) = * * * = S (7,p - 1).
But t't.t't-I is freely equal to 1 unless T = n - 1. Consequently, M is infinite cyclic generated by s ( n - 1 , O ) ( = y ) , and so F is of rank less than 2. This compietes the proof of Theorem 3.4. 3. Arithmetically defined linear groups Let d be a positive square-free integer, and let I d be the ring of integers in the quadratic field Q(d-d). Fine and Tretkoff [ 5 ] have investigated the SQ-universality of the groups PSL(2, I d ) , SL(2, I d ) , GL(2, I d ) in case I d is a Euclidean domain - that is, when d = 1,2,3,7,11. Their work carries over to show that these groups have 2(Fz). Since SL(2, I d ) maps onto PSL(2, I d ) , and since, for the values of d under consideration, SL(2, I d ) has finite index in GL(2, I d ) , it suffices to show PSL(2, I d ) has ~ ( F Z ) . For d = 1,2,3,11 the arguments of Lemmas 1 , 2, 4 of [5] can easily
Largeness in group theory
315
be modified. Consider the case d = 7. It is shown in Lemma 3 of [5] that PSL(2,17) has a subgroup G of finite index with presentation
(x,r,s, w ; x 3 = r 3 = s 3 = 1, w-’[s-’,r]w = [s-’,r], [r-l,x-’] = [s-I, r ] , w-’sw Setting s = x
=
= x-’).
1 in G gives the group
(r, w ; r’
=
G
with presentation
1).
Since has 6p(F2)it follows that PSL(2,17) has 2?(F2). Fine and Tretkoff also show that for all d, t h e subgroup E(2, Id) of SL(2, Id) generated by the transvections is SQ-universal [5, Theorem C]. Their argument is easily modified to show that E(2, Id) has 6p(F2). R. Zimmert [28] has shown that for many values of d, SL(2,Id) has Fz as a homomorphic image. For such values of d it is obvious that sL(2, I d ) has 6p(F2). It follows from [l],[16] that for n 3 3, SL(n, Id) does not have 6p(Fz).
4. Small cancellation groups Let F be a free group and let r be a symmetrized subset of F which satisfies t h e small cancellation conditions C ( p ) , T ( q ) for ( p , q ) one of (6,3), (4,4), (3,6). Let G be t h e quotient of F by t h e normal closure of r. Then G is called a small cancellation group. The reader unfamiliar with small cancellation theory should consult [21]. Small cancellation groups tend to be “quite large”. For example, it has been shown by Collins [4] that if r satisfies C(4) and T(4) then, with a few trivial exceptions, G has the large property 9(2). Recently it has been shown by M. al-Janabi in his PhD thesis at London University that if r is finite and satifies C(6), then again, apart from a few trivial exceptions, G has %(2). Al-Janabi has also established that if r is finite and satisfies C(7) then G is actually SQ-universal whenever it has 9 (2 ). One is led to ask the following question. Problem 4. D o “most” finitely presented small cancellation groups have 2?(Fz)? I suspect that the answer is negative (see the comments in t h e second paragraph of 03.3 in this regard). It should be remarked that many small cancellation groups can be shown to have 6p(F2) using Lemma 3.1. The following lemma is also useful.
316
S.J. Pride
Lemma 3.2. Let G = ( a ,b, x, y , . . . ;R, S, . . . ). Suppose there are integers p,q greater than 1 and not both 2 such that all the a-exponents in R, s,. . . are divisible by p and all the numbers c b ( R ) ,u b ( S ) , . . . are divisible by q. Then G has 2?(Fz). Proof. Let d be the image of G obtained by putting a p = b q = 1, and = (a, b ; a p ,b q ) , so that d has setting x,y,. . . equal to 1. Then 2?(F2).Consequently G has 6p(F2). Finitely generated infinitely related small cancellation groups need not have 2?(F2),even though they can be "quite large". Roughly speaking, this is because there is considerable scope to "impose relations which destroy largeness".
Theorem 3.6. Let u be a number greater than or equal to 5, and let F be the free group on a, b. There is an infinite symmetrized subset r of F satisfying C'(l/u) such that the factor group G of F by the normal closure N of r is such that G < F2. The group G is SQ-universal. Proof. The construction of G is similar to the construction given in [18,§2]. Let n > 2 v and let a I ,. . . , a, be numbers satisfying 1 a,1 > 1, C a,= 1, I a , J # I a , Ifor i # j . Let a = m a x { ) a , ( : i= 1, ..., n } , and suppose 2a/XIa,I < l/v. Let r be the symmetrized closure of the following elements of F: ab"1ab"2.. . ab"., (3.6)
+
a"lbPa"2bP* . . a"nbP,p= a + 3, a 4,.. . . (3.7) Then it is easily shown that pieces relative to r are subwords of (b"lab"~+1)*', (a"lbPao~+l)*', (bPa"nbP)"for i = 1,. . .,n (where subscripts are calculated modulo n ) , and it thus follows that r satisfies C'(l/v). Now G has no non-trivial finite homomorphic image. For if G maps onto with C? finite, then in d,b P = 1 for some p 2 (Y + 3, and so it follows from the relators (3,6), (3,7) (which of course hold in that a = b = 1. Thus is trivial. Now since F2 does not have a subgroup of finite index which has no proper subgroups of finite index, it follows from Corollary 2.1.1 that G
c
c
c),
Largeness in group theory
317
. . a".b"", ka"ib"'Za"2b"'2.. . a"mb"'2.
ha"ib""a"2b"".
Then it is easily shown that s satisfies C'(l/v), so the quotient of L by the normal closure K of s is a small cancellation product of (a), ( b ) , C. In particular, C is embedded in L / K . It is easy to see that L/K is a quotient of G. This completes the proof. 93.3. Free products Let G = H * K with ( H ( , I K ( # l If . J H I = I K ( = 2then G is a finite extension of the infinite cyclic group, and is therefore a minimal group. On the other hand, if max{l H 1, I K I} > 2 then G has several wellknown large properties, and is usually thought of as being "quite large". For example, it has been known for a long time that G has 9(2). More recently, it has been shown that G is SQ-universal ([21, Theorem 51 [7, Corollary 9.21). Now verifying that G has large properties is not always easy - see for example, t h e two proofs of SQ-universality. This leads one to think that although G tends to have large properties, perhaps it is not always the case that G is "naturally large" in t h e sense of having 2'(F2). This turns out to be so.
Theorem 3.7. Let G = H * K with IH I,/ K 1 # 1. Then G has 2'(F2) if and only if one of the following holds: (3.8)
either H or K has 2'(F2);
(3.9)
H, K have non-trivial finite homomorphic images H, K respectively, with max { 1 R I,( l? I } > 2.
Obviously (3.8) is a sufficient condition for G to have 2'(F2).T o see that (3.9) is a sufficient condition, note that G maps onto R * K, and A * l? has 2'(Fz), since it has a non-abelian free subgroup of finite index. In order to establish the necessity of the conditions (3.8), (3.9), it is just as easy to work in a more general situation. Theorem 3.8. Let G be a group generated by two subgroups H,K. Let A be an infinite group with the following properties:
(3.10)
if B s A, B # 1, then B has proper subgroups of finite
index ;
318
(3.11)
S.J. Pride
if B S A with ( A : B ( finite then B is not a homomorphic image of a subgroup of Z2* Z2.
Then G has Y ( A ) only if either H or K has Y ( A ) ,or H, K have non -trivial finite homomorphic images H, K respectively with max{I I? I , [ K I } > 2. The necessity of the conditions (3.8), (3.9) obviously follows from Theorem 3.8, taking A = F2. The rest of this section is devoted to proving Theorem 3.8. The proof proceeds by a number of lemmas. All groups referred to in these lemmas will be as in the statement of Theorem 3.8.
Lemma 3.3. Suppose H has no proper subgroups of finite index. Let C be a subgroup of G with ( G : C 1 finite. Then C = sgp{N, K , } , where N has no proper subgroups of finite index, and K I =z K with ( K : K , ) finite. Proof. If g E G then H 8 n C is of finite index in H 8 and is therefore equal to H8.Thus H 8 S C, and so C contains the normal subgroup N of G generated by H. Now N has no proper subgroups of finite index. For suppose L S N, ( N : L 1 finite. For g E G, L r l H 8 is of finite index in Hg, and so is equal to H 8 . Thus H 8 S L for all g E G, and so N S L. Consequently N A L. Now let K I = C f l K. Then K I is of finite index in K , and it is easily shown that C = sgp{N, KI}.
Lemma 3.4. If H has no proper subgroups of finite index and K does not have Y ( A ) then G does not have Y ( A ) . Proof. Assume, by way of contradiction, that G has Y ( A ) .By Theorem 2.1, G has a subgroup C of finite index, with C mapping onto a subgroup B of finite index in A. Now by Lemma 3.3, C = sgp{N,K,} with N having no proper subgroups of finite index, and K I < K , ( K : K I1 finite. The homomorphism of C onto B carries N, K I to subgroups N, K,respectively, and B = sgp{N, K,}.Since N has no proper subgroups of finite index, it follows from (3.10) that N = 1. But then B = El, which implies K,,and therefore K , has Y ( A )- a contradiction.
Lemma 3.5. Suppose H, K have subgroups NH, NK respectively each of index 2, and h a t H, K have no proper subgroups of any other finite index. Let N be rhe normal subgroup of G generated by NH, NK.Then N
Largeness in group theory
319
has no proper subgroups of finite index. A n y subgroup C of finite index in G contains N , and C I N is a homomorphic image of a subgroup of
ZZ* z,.
Proof. First note that NH, NK are the unique proper subgroups of finite index in H, K respectively. To see that N has no proper subgroups of finite index suppose L S N with I N : L 1 finite. Then for g E G, L f l N L is of finite index in N f , and is therefore equal to N f , . Thus N S C L. Similarly N g S L. It now follows that N C L, so N = L. Now suppose C is a subgroup of G of finite index. Then for g E G, Cr l N f , is of finite index in N S , and so N f , S C. Similarly N k C C. Thus N C C. Now G I N is a homomorphic image of Z2*ZZr and so C I N is a homomorphic image of a subgroup of Zz*Z2. Lemma 3.6. Suppose H, K have subgroups NH, NK respectively each of index 2, but have no proper subgroups of any other finite index. Then G does not have 2 ( A ) . Proof. Assume, by way of contradiction, that G has 9 ( A ) .By Theorem 2.1, G has a subgroup C of finite index, with C mapping onto a subgroup B of finite index in A. Let N be the normal subgroup of G generated by NH and N K . Then N s C by Lemma 3.5. The homomorphism of C onto B carries N to a normal subgroup N,and there is an induced homomorphism of CIN onto BIN. Now since N has no proper subgroup of finite index (by Lemma 3.5), neither does N. It therefore follows from (3.10) that N = 1. Thus B is a homomorphic image of a subgroup of Z2* Z2(again using Lemma 3 . 9 , and this contradicts (3.11). This proves the lemma. It is now an easy matter to prove Theorem 3.8. Suppose G has 2?(A).If, say H, does not have a non-trivial finite homomorphic image then it follows from Lemma 3.4 that K has 2 ( A ) . If both H and K have non-trivial finite homomorphic images then it follows from Lemma 3.6 that not both of la(, can be 2.
a,
84. “srna~~” groups
84.1. Concepts of “smallness” Throughout this subsection G will denote an infinite group.
320
S.J. Pride
The height of G is defined to be sup{r : r is a non-negative integer, there are groups
G = G I , .. . ,G, with G, < . * . < GI}. As will be seen, groups of finite height tend to be "small". Notice that G is a minimal group (that is, lies in A ) if and only if it has height one. Notice also that the property of having height greater than or equal to m (where m may be a)) is a large property. It is consistent with the above definition t o define the height of a finite group to be zero. Example 4.1. Let A ( n ) be a free abelian group of rank n 2 0 . Then A ( n ) has height n. This is trivial if n = 0. Assume n > 0. Using Theorem 2.1 together with the general theory of abelian groups, it is easily established that if H < A ( n ) then H = A ( m ) for some m with 0 < m C n. Clearly, A (1) < A (2) < * * < A (n). It will be shown in fact that if O < s < r S n then A ( s ) < A ( r ) . If, on the contrary, A ( r )< A(s) then by Theorem 2.1 and general results on abelian groups, A ( r ) would have a subgroup D of finite index, with D free abelian of rank less than r. As is well-known, this is impossible. A proof can be given as follows. Let A ( r ) = CIx * * * x C,, where each C, is infinite cyclic generated by x , say. Let B, = sgp{C, : j < i } , and let
E, = {p : p > O , there exists u E B, such that uxPE D}. Since C, r l D has finite index in C,, E, is non-empty. Let pa = min E,. Then there exists u, E B, such that u,xP'E D. Now the u,x: obviously freely generate a subgroup. It will be shown that they generate D, so D has rank r, contrary t o assumption. Suppose h E D. Then h = wx:' where w E B,. Now p, divides m,. For if d = @, m,) then d = Lye, + pm, for certain integers a, p, and so ( w a x ! ? " " ) ( u ~ x ~D p ~) € that is, w @ u ; x I ED. Thus d = p , by minimality. It now follows that wx?= w'(u,x:)'r for some w ' E B, and some integer 1,. Now w ' = ux::;' with u E and a similar argument shows m,-, that ux,-, - ~ ' ( U , - ~ X : : ; ) ' ~ - I where v ' E B,-,. Continuing in this way one can express h as a product of the u,xP'. Example 4.2. If A(No) is a free abelian group of countably infinite rank then A (No) maps onto A ( n ) for all n > 0, so A (No) has infinite height.
Largeness in group theory
32 1
The group G is said to satisfy Max- < (Min- < ) if there is no infinite sequence of groups Go,G , ,G 2 , . . with Gi < G and Gi < Gi+, < G i )for i = 0, 1,2,. . . . Related to these two properties are chain (Gi+] conditions on certain subgroups of G, defined as follows. A subgroup A of G is said to be almost normal if it is normal in some subgroup of finite index in G. Thus A is almost normal if and J If A is almost normal then G ( A )will denote only if G : N G ( A )<m. . maximal and minimal conditions o n almost the group N G ( A ) / AThe normal subgroups will be denoted by Max-an and Min-an respectively. Min-an are all large Note that -I Max- < , -I Min- 6 , -I Max-an, i properties.
I
Theorem 4.1. If G satisfies Max-an then G satisfies Min- < . Proof. Suppose * * * G,+l< G, < . . . < G2< G I= G, and that almost normal subgroups N , < N 2< * < N , have been found with G ( N i )= Gi for i = 1,.. .,r. Since G,+]< G ( N , )there is a subgroup HIN, of finite index in G ( N , )mapping onto a subgroup A of finite index in G,+]. Thus there is a normal subgroup N,,, of H with N, < N,+land HIN,,, isomorphic to A . Thus G(N,+,)= G , + ] . It is worth pointing out that if A , B are almost normal subgroups of G with B properly contained in A and 1 G :A I = m, then G ( A )< G ( B ) . One cannot conclude in general that G ( A )< G ( B ) ,for otherwise the converse of Theorem 4.1 would be true. However, as the following example shows this is not the case.
- -
Example 4.3. Let S be a non-trivial finite non-abelian simple group and let I be a countably infinite set. For i E I let Si be a copy of S, and let C be the restricted direct product of the Si. It will be shown that if L < C then L has a subgroup of finite index isomorphic to C ; consequently L = C, and so C is a minimal group. However, C satisfies neither Max-an nor Min-an (in fact, it does not satisfy either Max-n or Min-n). Let N be a normal subgroup of C and let z E N. Suppose the ith coordinate s of z is non-trivial, and let x E S,. Then [z, x ] E N and [ z ,x ] = [s, XI.Since [s,x ] # 1 for at least one x, it follows that S, fl N # 1. Thus S, f l N = S,, or in other words S, s N. It is now easy to see that if J is the subset of I consisting of all those i for which some non-trivial element of S, occurs as the ith coordinate of an element of N , then N = sgp{S, : i E J } .
322
S.J. Ride
Now suppose L 6 C. By Theorem 2.1, C has subgroups M, H with M 9 H 9 C, 1 C : H 1 finite and H / M isomorphic to a subgroup of finite index in L. Using the result of the previous paragraph, it is easy to show that both H and H I M are isomorphic to C. 64.2. Just-infinite groups As mentioned in 91.1, a group is said to be just-infinite if it is infinite but has every proper quotient finite. Just-infinite groups have been investigated by several authors [12, 13, 24, 271. One has the feeling that such groups should be “quite small”, and the aim of the next three subsections is to examine these groups in terms of the notions introduced in §4.1. Following Wilson [27], let g2denote the class of groups in which every non-trivial subnormal subgroup is of finite index. It is easy to see that an infinite g2-group G is a minimal group. For suppose H < G. By Theorem 2.1 there are groups B, C with B of finite index in H,C normal and of finite index in G, and C mapping onto B by a homomorphism 8 say. Let K be the kernel of 8. Then K is subnormal in G and therefore must be trivial since 1 G : K I = 03. Consequently, 8 is 1 : 1, and so H has a subgroup of finite index mapping onto a subgroup of finite index in G. Thus G < H . A just-infinite group G which lies outside B2need not be minimal. Nevertheless if G satisfies the maximal condition on subnormal subgroups then G still has finite height. To be more precise: Theorem 4.2. Let G be a just-infinite group which satisfies Max-sn. Then G has a subgroup of finite index which is a direct product MI x * * x M,,, where the Mi are pairwise isomorphic subnormal subgroups of Q. The M, are g2-groups and are therefore minimal. If H < G then H = M I x . * . x M, for some 1 < r < n. Moreover
M I< M , X M 2< *
*-a
< MIX
* *
. X M,,.
Thus G has height n. For just-infinite groups not satisfying Max-sn the following theorem applies. Theorem 4.3. Let G be a just-infinite group not satisfying Max-sn. Then either G lies in .dl or G does not satisfy Min- < . An example will be given in 94.4 of a just-infinite group not satisfying Max-sn which is minimal. I do not have an example of a
Largeness in group theory
323
just-infinite group not satisfying Max-sn which also does not satisfy Min- < . Probably an infinite wreath product WrZ-S-,,, where t h e S-,, are non-isomorphic finite perfect simple groups, would provide such an example, but I have been unable to verify this. The proofs of Theorems 4.2, 4.3 will rely heavily on results on the structure of just-infinite groups obtained by Wilson in [27]. In particular, frequent use without explicit mention will be made of Wilson’s result [27, Proposition 31, that a just-infinite group has no non-trivial finite subnormal subgroups. Before discussing the proofs of Theorems 4.2, 4.3, some consequences will be noted. Corollary. Let G be a just-infinite group. Then G lies in A if and only if either G lies in g2or G does not satisfy Max-sn. Proof. If G satisfies Max-sn and lies in A then it follows from Theorem 4.2 that G has a $?h2-subgroupH of finite index. This implies that G itself is a !32-group. For let K be a non-trivial subnormal subgroup of G. Then K f l H is subnormal in H. Now since 1 K : K f l H 1 < w, K n H cannot be trivial, else K would be finite. Thus ( H : K n H I < w , and so ( G : K ( < w as , required. If G does not satisfy Max-sn then Theorem 4.3 applies. Corollary. If ?!/ is a class of groups which is closed under taking infinite homomorphic images, and i f each infinite group in ?!/ has a just-infinite quotient, then every group in ?!/ n A is equally as large as some justinfinite ?!/-groupwhich either lies in g2or does not satisfy Max-sn.
For if K lies in ?!/ f l A then K has a just-infinite quotient G with K = G, and t h e result then follows from t h e previous corollary. The last corollary applies, for example, if ?!/ is taken to be t h e class of groups satisfying Max-n. It also applies if 9 is taken to be t h e class of finitely generated groups [19, Lemma 6.171. In this connection the following question naturally arises. Problem 5. Do there exist finitely generated just-infinite groups not satisfying Max-sn?
The last corollary also suggests t h e following, admittedly rather vague, question.
324
S.J. Pride
Problem 6. What finiteness conditions on an infinite group G ensure that G has a just-infinite quotient? It is worth pointing o u t that not every minimal group is equally as large as some just-infinite group. For instance, let C be the group in Example 4.3. Then any group L = C has a subgroup of finite index isomorphic to C. Thus L cannot satisfy Max-n (since subgroups of finite index in groups with Max-n also have Max-n [26]), and so L certainly cannot be just-infinite. Some aspects of t h e proofs of Theorems 4.2, 4.3 will now be discussed. Other parts of t h e proof of Theorem 4.2 will be given in 04.3, and the majority of the proof of Theorem 4.3 will be given in 04.4. As Wilson remarks (27, p. 3741 the structure of a just-infinite group G depends very strongly on its Baer radical. (Recall that the Baer radical of a group is the subgroup generated by the cyclic subnormal subgroups.) If G has non-trivial Baer radical then [27, Theorem 21 G is a finite extension of a free abelian group A of finite rank. (Thus G satisfies Max, and therefore certainly Max-sn.) In this case one takes the M, in Theorem 4.2 to be the cyclic factors in a decomposition of A. The argument given above in Example 4.1 shows that all the required conditions are satisfied. For groups satisfying Max-sn which have trivial Baer radical t h e argument is not quite so straightforward, and some preliminary results are needed. These results also apply to groups not satisfying Max-sn, and will be required in the proof of Theorem 4.3. So for t h e rest of this section GI,will denote an arbitrary just-infinite group with trivial Baer radical. Now it is shown in [27, Theorem 3B] that every subnormal subgroup K of GI, has a near-complement C. (Recall that C is a nearcomplement to K in Go if C is subnormal, sgp{K, C}= K x C, I Go:K x C 1 < 00.) As the following lemma shows, this implies that in discussing those groups H with H < Go it suffices to concentrate on subnormal subgroups of Go.
Lemma 4.2. (i) lf H < G,, then H is equally as large as some subnormal subgroup of GI,. (ii) lf L, M are non-trivial subnormal subgroups of G,, with L S M then L < M. Proof. (i) If H < GI, then, by Theorem 2.1, there is a normal subgroup A of finite index in GI,and a homomorphism 8 of A onto a subgroup
Largeness in group theory
325
B of finite index in H. Let K be the kernel of 8. Then K is subnormal in Go and so K has a near-complement C in Go. Since ( G o : K x C ( < m ,/ A : K x ( C f l A ) ( < w , and so A / K has a subgroup of finite index isomorphic to C fl A. Thus H = C f l A. (ii) Let D be a near-complement for L in Go. Then M n ( L x D ) = L x ( M n D ) , a n d I M : L x ( M n D ) I < m . Since L x ( M f l D ) m a p s onto L, it follows that L < M. Now in [27, $4.21 Wilson defines a congruence on t h e lattice of subnormal subgroups of Go as follows: for K , L subnormal in Go, K L if and only if K n L has finite index in both K and L. The quotient of the lattice of subnormal subgroups of Go by is called the structure lattice of Go. Its elements are congruence classes [ K ] of subnormal subgroups of Go.
-
-
-
Lemma 4.3. Let K , L be non-trivial subnormal subgroups of Go. If [ K ]= [ L ] then K = L.
-
This is a trivial consequence of the definition of . It is shown in [27, $4.21 that t h e structure lattice is Boolean. If Go satisfies Max-sn then t h e structure lattice is finite [27, Theorem 41. If Go does not satisfy Max-sn then the structure lattice is infinite, isomorphic to the lattice of all closed-and-open subsets of Cantor’s ternary set [27, Theorem 61. Now to return to t h e proof of Theoreni 4.2. Suppose G is a justinfinite group satisfying Max-sn, and assume G has trivial Baer radical. Since the structure lattice of G is finite it must be isomorphic to the lattice of all subsets of a set of n elements for some positive integer n. By investigating the significance of the integer n, Wilson obtains t h e following description of G [27, Proposition 71. The group G has a subnormal %-subgroup M e2G such that the normal closure of M in G is t h e direct product MI x . . . x M,, of the distinct conjugates of M under G. If K is subnormal in G then K M,, x . . . X M,, for some subset { i l , .. ., i,} of { l , . .. , n } (and so t h e mapping { i , , .. .,i,}+ [M,, x . . . x M,,] is an isomorphism from t h e lattice of subsets of { I , . . . , n } onto the structure lattice of G). Now using the above result together with Lemma 4.3, it is easily seen that if K is a non-trivial subnormal subgroup of Go then K = M , x . . . x M, for some 1 S r < n. Taking account of Lemma 4.2 (i), it thus follows that the proof of Theorem 4.2 will be completed once the following lemma has been established.
-
Lemma 4.4. If 1 < p < q < n then M , x . . . x M, < M I x
* *
x Mq.
326
S.J. Ride
This lemma will be proved in $4.3. It is interesting to note that the results described in the next-to-last paragraph can be summarized as follows: if K, L are non-trivial subnormal subgroups of G, then K = L i f and only if [ K ] , [ L ] have the same height in the structure lattice of G. Thus the < relation is intimately connected with the order relation in t h e structure lattice. 94.3. Proof of Lemma 4.4 Let H = MIX * x M,,L = MI x
* * X M,, and suppose by way of contradiction that H = L. Then by Theorem 2.1 there would be a subgroup A of finite index in H and a homomorphism 8 of A onto a subgroup B of finite index in L. Let A, = A fl M , ( i = 1 , . . . , p ) . Then A, x x A, S A. Moreover, since for each i, I M, : A, I < m, A l x . . . x A, is of finite index in H . Let B, ( i = 1 , . . . . p ) be the image of A, under 8. Then sgp{B,, . . . ,B,} is of finite index in B. Note that for i # j , each element of B, commutes with each element of B, (and so in particular B, s sgp{BI,. . . , B,} for i = 1 , . . . , p ) . Now by [24, Theorem 21, A, ( i = l , . . . , p ) lies in B2.Consequently either B, is finite or the restriction of 8 to A, is an isomorphism onto B,. By relabelling, if necessary, it can be supposed that B, is infinite for i S r, and B, is finite for i > r. Then sgp{B,, . . . ,B,} has finite index in sgp(B,,. . . , B p } ,and so has finite index in L. Let C be the core of sgp{BI,. . . ,B,} in L. Then I L : CI <m. Let C, = C n B, ( i = 1 , . . . , r ) . Then IB, : C, I < m so that sgp{C,, . . . , C,} is of finite index in L. Moreover, since B, Q sgp{B,,. . . ,B r } , C, C, so that C, is subnormal in L and therefore also in G. Note that each C, lies in B2,by [24, Theorem 21. Now for 1 S i < j S r, C, n C, is abelian, and normal in C,. Consequently, C, f l C, is an abelian subnormal subgroup of G, and so C, fl C, = 1, since G has trivial Baer radical. Thus sgp{C,,. . . , C,} = c,x . * . x c,. For each M, (1 S j S q ) there is a Ca,,such that M, f l C,,,# 1. For if M, fl C, = 1 for all 1 s i S r then M, fl sgp{C,,. . . ,C,}= 1 by [27, Lemma 14(b)], contradicting the fact that sgp{CI,. . . ,C,} has finite index in L. Now if j # 1 then C,,, # C,(/,.For if C,,, = C,,,,then I C,,, : M, n C,,,( I C,,,:MI fl C,,,(< m since C,,, is a B2-group, and so (M, fl C,,,)fl (Ml f l C,,,) would be of finite index in C,,). But (MI fl C,,,)r l ( M Ifl C,,))= 1, so that C,o,would be finite, a contradiction. It now follows that the mapping j i ( j ) is 1 : 1, which is impossible since r < q. This contradiction shows that H < L, as required.
Largeness in group theory
327
04.4. Proof of Theorem 4.3, and an example The aims of this subsection are firstly to provide a proof of Theorem 4.3, and secondly to give an example of a just-infinite minimal group not satisfying Max-sn. Proof of Theorem 4.3. Throughout the proof implicit use will be made of the fact that every subnormal subgroup of a just-infinite group has only finitely many distinct conjugates (by [27, Theorem 3)). Let G be a just-infinite group not satisfying Max-sn and suppose there is a minimal group M < G. It will be shown that M = G. By Lemma 4.2 (i) it can be assumed that M is a subnormal subgroup of G. Moreover, by [27, Lemma 161 it can be supposed that there is no subnormal subgroup L of G with M =Z L and 1 < I L : M ( < m. Let M I , .. . , M . be t h e distinct conjugates of M under G. Now M cannot lie in %. For if M lay in 5% and g , g ’ E G with M 8 fl M 8 ’ # 1, then IM8 : M 8 fl MB’I I M 8 ’ :M g n M8‘1< 00, and so Mg = M8’ b y [27, Lemma 161. Consequently, MI,..., M, would intersect trivially in pairs, and so sgp{MI,.. . , M,} = MI x * * . x M. by [27, Lemma 141. But this would imply that G satisfied Max-sn by [27, Theorem 41. Now suppose, by way of contradiction, that G is not a minimal group. Then it will be shown that there is a non-trivial subnormal subgroup D of G such that D r l M , = 1 for i = 1,. . . ,n. This implies (using [27, Lemma 14 (b)]) that D r l sgp{MI,.. . , M.} = 1, and so sgp{M,, . . . , M,} has infinite index in G. But since sgp{M,, . . . ,M.} is normal, this contradicts t h e fact that G is just-infinite. To obtain D, a sequence of subnormal subgroups D I , . . , D, of G will be constructed satisfying the following: for m = 1 , . . . ,n (i) D, is non-trivial; (ii) M , fl D, = 1 for i = 1 , . . . , m ; (iii) there is a subnormal subgroup B, of G with B, -- M,
B, no,
= 1 , J G:sgp{B,,D,}l<m.
One then takes D to be D,. The following fact will be required: if A, B are two groups with A = M, B = M then A x B -- M. T o prove this, first note that since M is not a ’&-group, M has a non-trivial subnormal subgroup K of infinite index. Let C be a near-complement for K in G. Then ( G : K x C (< m , so I M : K x ( C f l M ) I < m . Note that since ( M :KI = 03, C n M is infinite. Now it follows from Lemma 4.2(ii) that K < M and C n M < M. Consequently, K = C n M = M , since M is minimal.
328
S.J. Pride
Now it is clear that A x B --. K x (C fl M), and since K x (C fl M) has finite index in M, it follows that A x B = M, as required. The groups DI,. . . , D, are obtained as follows. For D , choose any near-complement of MI in G. Suppose 1 S m < n and that D , , . . . ,0, have been obtained. Let L be a near-complement of M,,, f D, l in G. Then 1 G : (M,,, fl D,) x L I < 03, and so ~ D m : ( M , + , n D , ) x ( L n D , ) ( Let < ~ .D , + , = L n D , . NOWD,,, is non-trivial. For suppose not. Then M,,, r l D, would be of finite index in Dm, so that M,,, r l D, = 0,.Consequently, D, = M,+,(since M,,, fl D, < M,,, by Lemma 4.2(ii) and M,,, is minimal). However, if B, is as in (iii) then sgp{B,, D,} = B, X Dm(using [27, Lemma l4(a)]) and so, by what was shown in the previous paragraph, B, x D, = M. However, B, x D, = G. This contradicts the assumption that G is not minimal. Consequently, D,,, is non-trivial, and so (i) holds. Obviously Dm}. l Since (ii) holds. T o see that (iii) holds let B,,, = sgp{B,, M,+, f B,, M m + fl , D,, D,,, intersect trivially in pairs, it follows from [27, Lemma 14(a)] that sgp{B,+,, D m + J= B, x (M,,, fl D,) x Dm+,,and so, since sgp{B,,D,)= B, x D,, sgp{B,,l,D,+l} has finite index in G. Moreover, since B,,, = B, x (M,,, fl D m )and , since M,,, r l D, is either trivial or equally as large as M,,, (using Lemma 4.2 (ii) and the fact that M,,, is minimal), it follows from the previous paragraph that B,,, = M. This completes the proof.
Example 4.4. Let S be a non-trivial finite perfect group and for i = - 1, - 2,. . ., let Si be a copy of S. Let G = Wri<-,S , . Then it is shown in [27] that G is just-infinite (see pp. 376-377) but G does not satisfy Max-sn (see pp. 378-379). It will be shown here that G lies in A. Suppose H 5 G. By Theorem 2.1 there is a subgroup A of finite index in G mapping, under a homomorphism 8 say, onto a subgroup of finite index in H. Now A contains the base group of some segmentation of G. To be more precise, there is an integer m > O such that G = GI t Gz, where
GI = Wr S,, ic-m
G2= Wr S , , -mSicO
and where A contains the base group D of GI1 GZ. Note that GI is isomorphic to G and so D is a finite direct product of isomorphic copies of G.
329
Largeness in group theory
Let K be the kernel of the restriction of 8 to D. Note that K is subnormal in G. Since ID : K 1 = t ~ ,K must intersect at least one of the factors of D trivially. Thus D = G I x L with K f l G I = 1. By [27, Lemma 14(b)], sgp{L,K}n G I= 1, and so D I K maps onto G I . Thus G = G I< D I K -- H. Consequently, H = G, and so G is minimal. 84.5. Extensions It is not difficult to see that if A is an almost normal subgroup of a group G, and if 8 is a homomorphism of G, then 8 ( A ) is almost normal in 8 ( G ) .In addition if H S G then H fl A is almost normal in H . It thus follows from [19, Lemma 1;48] that the properties Max-an and Min-an are extension closed. Moreover, suppose a group G has a normal subgroup N such that every properly ascending chain of almost normal subgroups in N has length at most r, and every properly ascending chain of almost normal subgroups in GIN has length at most s ; then every properly ascending chain of almost normal subgroups in G has length at most rs. To prove fhis latter result* consider a chain A , S A z C . almost normal subgroups of G. Then
AiNIN
d
AZNIN
d
*
S
*
d
A,,+,of
A,+iN/N,
and by the condition imposed on GIN, A,NIN = . * . = A,+,N/Nfor some i . Also, since A, fl N 6 . . . s A,,, r l N , there exists i S j < 1 S i + r such that A, fl N = Al fl N, by the condition imposed on N. Since A,N = AIN and A, C Al one now has: A, = A, n A,N =
A, n A ~ N
=
A l ( A ,fl N )
=
Ai(Ai n N )
= A,.
Now it was shown in 04.1 that there is a relationship between the almost normal subgroups of an infinite group G and the groups H with H 4 G. It is, therefore, natural to ask whether results analogous to those stated above apply for the relation < . For example are M a x - d and M i n - d extension closed? Is the property of having finite height extension closed? The main aim of this subsection is to show I thank B.A.F. Wehrfritz for supplying me with this argument.
330
S.J. Pride
Theorem 4.4. The properties Max- < and “having finite height” are not extension closed. In fact, there is a split extension of one minimal group by another which does not satisfy Max- S .
In contrast to Theorem 4.4 the following result will also be proved in this subsection. Theorem 4.5. Let M I , .. ., M,, be minimal groups, and let G = M , x . . x M,. Let H < G. Then H = Mi,x . . . x Mi, where { i l , .. . , i , } c { l , .. . , n}. In particular G has height less than or equal to n. It should be mentioned that the height of G in Theorem 4.5 can be less than n. For example, if M is a minimal just-infinite group not satisfying Max-sn then M x M = M , as can be seen from the proof of Theorem 4.3. The following question remains open.
Problem 7. Is Min-
< extension closed?*
The rest of this subsection is devoted to the proofs of Theorems 4.4 and 4.5. For Theorem 4.4 the following construction is required. Let I be a non-empty set and for i E I choose a non-trivial group A,. For (i, n ) E I x Z let A ( i , n ) be a copy of A,, and let B be the restricted direct product HE,,,A (i, n ) of the A (i, n ) . Then B = IIZ, B,, where B, = I I f E Z A ( in, ) . Now for each i there is an automorphism 8, of B, obtained by permuting the factors according to the permutation of Z defined by n n + 1. Let 8 be the unique automorphism of B whose restriction to B, is 8, for i E I, and form the split extension of B by an infinite cyclic group ( t ) . If G is (isomorphic to) the resulting group it will be convenient to write G = [A, ( i E I ) , t ] . Define the rank of G to be the cardinality of I.
Lemma 4.5. Assume each A, is perfect. Let N e G and suppose btPE N with b E B and p # 0. Then B N . Proof. For each i there exixts no such that the (i, no) coordinate of b is 1. Let a l , a z E A ( i , n o )Then . t-Pb-l~lbtP = r-PaltP EA(i,no+p), and so [[btP, a , ] ,U Z ] = [ a l ,a*].But [ [ b t P ,all, az]E N, and since a l , a2 are arbitrary it follows that N contains the derived group of A ( i , no), and Added in proof. The answer is “No” (B.Hurley).
Largeness in group theory
33 1
therefore the whole of A (i, no) since A, is perfect. Conjugating A (i, no) by powers of t then establishes that A(i,n ) s N for all n E Z. Thus B,S N , and since i was arbitrary, B S N as required. Lemma 4.6. Suppose each A, is perfect and let H be a subgroup of finite index in G. Then H = sgp{B, t " } for some q > 0. Moreover H = [C,( i E I ) , s] where C, = A f. In particular, the rank of H is the same as the rank of G. Proof. Now there is a normal subgroup N of finite index in G containing H. In particular, t P E N for some p # 0, and so it follows from the previous lemma that B s N s H. Thus H = sgp{B, t'} for some q > O . Now let s = t", and for (i, n ) E I x Z let C(i,n ) = A ( i , n q ) x A ( i , nq + 1 ) X - . . X A ( i , nq + q - 1). T h e n for all n E Z , C(i,n ) is isomorphic to A f , and it is readily seen that H = [C,( i E I ) , s ] .
Lemma 4.7. Suppose A, = n?S, ( i ) ( i E Z), where each S, ( i ) is a nonabelian simple group. Let G be an infinite homomorphic image of G. Then G = [fly S, (j) (j E J ) , f], where J C I and where for each j E J, f, is a non-empty subset of A,. In particular, the rank of is less than or equal to the rank of G. Proof. In the copy A(i,n ) of A,, the copy of S, ( i ) will be denoted by S,(i,n), so that A ( i , n ) = n ? , S , ( i , n ) . Some of the following calculations are similar to those in Example 4.3 but are repeated in detail here for convenience. Let N be the kernel of the homomorphism G - G . It follows from Lemma 4.5 that N s B. Consider the subgroups N n A ( i , n ) . It will be shown that these generate N . Let b be an element of B belonging to N , and suppose that the (i, n ) coordinate, a say, of b is non-trivial. Now A (i, n ) = n?,S, (i, n ) . Suppose the A th coordinate, s say, of a is non-trivial. Let z E S, (i, n ) . Then [b,z ] E N and [b, I ] = [s, 21. Since [s, I ]# 1 for at least one z, it follows that SA(i,n ) < N. Consequently, a E N, and so b E sgp{N n A (i, n ) : (i, n ) E I x Z},as required. Now similar calculations to those in the previous paragraph show that N fl A (i, n ) is the subgroup generated by S, (i, n ) with A ranging over some subset ei of A. This subset ei depends only on i, and not on n. For if S A ( i ; n S ) N n A ( i , n ) , then conjugating by r p shows that S,(i, n + p ) s N n A ( i , n + p ) .
332
S.J. Ride
It is now not difficult to see that one can take J to be the subset of I consisting of those elements i for which Si# A , and then put & = Ai'\6Ji for j E J. This proves the lemma. Proof of Theorem 4.4. In the above construction take Z = {1,2,. . .}, and take all the Ai to be isomorphic copies of some finite non-abelian simple group. Then B is minimal (see Example 4.3), and G is a split extension of B by the minimal group ( t ) . It will be shown that G does not have Max- < . For p = 1,2,.. . let G, = [Ai(i = 1,. . . , p ) , t ] . Then G I + Gz< < G. It will be shown that G, < G,+,. Suppose, by way of contradiction, that G,+,< G,,. Then by Theorem 2.1, G, would have a subgroup H, of finite index mapping onto a subgroup H,+] of finite index in GP+,.Now the rank of H, would be p, by Lemma 4.6. Moreover, H, = [Af(i = 1,. . . , p ) , t ] for some q. Consequently, by Lemma 4.7, H,+, would have rank less than or equal to p. On the other hand, H,+, would have rank p + 1, by Lemma 4.6. This contradiction shows G, < GP+,,as required. Proof of Theorem 4.5. The proof is by induction on n. If n = 1 the result is trivial. Suppose n > 1, and let H =S G. By Theorem 2.1 there is a subgroup A of finite index in G mapping, by a homomorphism 8 say, onto a subgroup of finite index in H. Let Ai = A n Mifor i = 1,. ..,n. Then I M i : A i ) < w a n d s oI G : A , x . - . x A , ( < ~Notice . that 8(Al X . * . X A,) has finite index in H. Let N be the kernel of the restriction of 8 to A , x * x A,. For i = 1,. . . ,n let
Ni = { a : a E Ai, there exists u E A, x
. . . x-Ai-,x A,+,x
-
*
x A. such that au E N}.
Then Ni Ai. Case 1. For some i, 1 Ai : Ni 1 < w.
Assume for definiteness that 1 A, : N , 1 < 03. Then (A2x . . X A,)N/N has finite index in (A, x * * x A,)/N. For let z,, . . . ,z, be coset representatives of N , in A,, and let au E A , X . X A, with a E A,, u E Az x * * x A,. Then a = zia' for some zj and some a ' € N , . Moreover, since a ' € N , , there exists u E A 2x * X A, such that a ' u E N . Thus au = ~ ~ ( a ' u ) ( u - and ~ u )so auN = z j ( u - ' u ) N . Consequently (A, x * x A,)/N = U:(zjN)((A2 X * X A.)N/N).
-
-
- -
-
- -
Largeness in group theory
333
It now follows that Since
(A1 X
* * .
X
A,,)/N 2: (A2X .. . X A.)/N n(A2X
* * .
(A2X
* * *
X
A,,)/Nn (A2X
X
* * *
X
A,,) 6 M2X
0
.
*
X
A").
M,,,
the inductive hypothesis now applies to give the required result.
Case 2. For all i, / A i: Ni I = w. NOW N sgp{NI,N2,. . . ,N,,}.Thus ( A lx * * * x A,,)/sgp{NI,.. .,N.} < ( A l x - . . x A . ) / N But . ( A I x ~ ~ ~ x A n ) / s ,..., g p {N.} N l is isomorphic to A I / N Ix . . . x A,,/N,,,and since for each i, A i / N i Mi, it follows that AJN1X * * * X A,/N,, = G. Thus H G. This completes the proof. 2 :
2 :
BS. A final question Let 9 be a large property and let % be a class of groups. Say that 9 is large for 9 if every %-group with 9 has infinite height.
With the possible exception of SQ-universality, none of the properties listed in $2.1 is large for finitely presented groups. In fact, for all the properties except SQ-universality there are finitely presented minimal groups with the property. On the other hand, finitely generated SQ-universal groups cannot be minimal (since finitely generated groups have just-infinite quotients). Now SQ-universality is generally agreed to be quite a good measure of largeness for countable groups. Can this be made more precise? Problem 8. Is SQ-universality large for finitely presented, finitely generated, or even countable groups?"
Of course, SQ-universality is not large for all groups since there are simple (and hence minimal) SQ-universal groups. References [ l ] H. Bass, J. Milnor and J.-P. Serre, Solution of the congruence subgroup problem for SL, ( n Z 3 ) and Sp,. ( n Z 2 ) . Inst. Haute Etudes Sci. Publ. Math. 33 (1%7) 59-137. [2] 1.M. Chiswell, Abstract length functions on groups, Math. Proc. Camb. Phil. Soc. 80 (1976) 451463. Added in proof. B. Hurley has constructed a countable SQ-universal minimal group.
334
S.J. Pride
[3] D.E. Cohen, Groups with free subgroups of finite irldex, Conference on Group Theory, Springer Lecture Notes No. 319, 26-44. [4] D.J. Collins, Free subgroups of small cancellation groups, Proc. London Math. SOC. ( 2 ) 26 (1973) 193-206. [5] B. Fine and M. Tretkoff, The SQ-universality of certain arithmetically defined linear groups, J. London Math. SOC. (2) 13 (1976) 65-68. [6] F.P. Greenleaf, Invariant means on topological groups, Van Nostrand Mathematical Studies No. 16 (Van Nostrand-Reinhold Company, New York, Toronto, London, Melbourne, 1969). [7] P. Hall, On embedding a group i n t o the join of given groups, J. Austral. Math. SOC. 17 (1974) 434-495. [8] A.H.M. Hoare, A. Karrass and D. Solitar, Subgroups of infinite index in Fuchsian groups, Math. Z. 125 (1972) 59-69. [9] A. Karrass and D. Solitar, Subgroups of HNN groups and groups with one defining relation, Can. J. Math. 23 (1971) 627-643. [lo] A. Karrass, A. Pietrowski and D. Solitar, Finite and infinite cyclic extensions of free groups, J. Austral. Math. SOC.26 (1973) 458-466. [ 1 I] O.H. Kegel and B.A.F. Wehrfritz, Locally finite groups, North-Holland Mathematical Library, Vol. 3 (North-Holland, Amsterdam, London, New York, 1973). [I21 D. McCarthy, Infinite groups whose proper quotient groups are finite I, Comm. Pure Appl. Math. 21 (1968) 545-562. [13] D. McCarthy, Infinite groups whose proper quotient groups are finite 11, Comm. Pure Appl. Math. 23 (1970) 767-790. [14] W. Magnus, A. Karrass and D. Solitar, Combinatorial group theory: Presentations of groups in terms of generators and defining relations (Interscience Publishers, New York, London, Sydney, 1966). [I51 J. Milnor, A note on curvature and fundamental group, J. Differential Geometry 2 (1968) 1-7. [16] J.L. Mennicke, Finite factor groups of the unimodular group, Ann. of Math. (2) 81 (1965) 31-37. [17] P. Neumann, The SQ-universality of some finitely presented groups, J. Austral. Math. SOC.16 (1973) 1-6. [I81 S.J. Pride, On quotients of certain countable groups, Bull Austral. Math. SOC.,16 (1977) 225-228. [ 191 D.J.S. Robinson, Finiteness conditions and generalized soluble groups, Parts 1 and 2, Ergebnisse der Mathematik und ihrer Grenzgebiete, Bands 62 and 63 (SpringerVerlag, New York, Berlin, 1972). [20] G. Sacerdote and P.E. Schupp, SQ-universality in HNN groups and one relator groups, J. London Math. SOC.(2) 7 (1974) 733-740. [21] P.E. Schupp, A survey of small cancellation theory, in Word Problems (NorthHolland Publishing Company, Amsterdam, London, 1973). [22] G.P. Scott, An embedding theorem for groups with a free subgroup of finite index, Bull. London Math. SOC.6 (1974) 30&306. [23] J.R. Stallings, Quotients of the powers of the augmentation ideal in a group ring, in Knots, Groups and 3-Manifolds, Annals of Mathematics Studies No. 84 (Princeton University Press, Princeton, 1975). [24] C. Tretkoff, Some remarks on just-infinite groups, Commun. Algebra 4 (1976) 43-49.
Largeness in group theory
335
(251 J . Wiegold, Transitive groups with fixed-point free permutations, Arch. Math. 27 (1976) 473-475. [26] J.S. Wilson, Some properties of groups inherited by normal subgroups of finite index, Math. Z . 114 (1970) 19-21. [27] J.S. Wilson, Groups with every proper quotient finite, Proc. Cambridge Philos. SOC. 69 (1971) 373-391. [28] R. Zimmert, Zur SL, der ganzen Zahlen eines imaginar-quadratischen Zahlkorpers, Inventidnes Math. 19 (1973) 73-81.
S.I. Adian, W.W. Boone, G . Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 337-346
ALGORITHMIC PROBLEMS FOR SOLVABLE GROUPS V.N. REMESLENNIKOV and N.S. ROMANOVSKII Institute of Mathematics, Nooosibirsk
$1. Introduction
Classical algorithmic problems - t h e word problem, conjugacy problem, and isomorphism problem - arose in the theory of groups from topolocal considerations and were formulated for t h e first time at the beginning of this century by Dehn. In the middle fifties it was shown that these problems have negative solutions in the class of all groups. However, for important classes of groups such as nilpotent and solvable groups, these problems remained unsolved. At t h e present moment a number of interesting results have been obtained in this field, and this paper is intended to survey them. Let us recall the main definitions and t h e formulation of t h e classical algorithmic problems for t h e class of all groups. We call the pair ( X , R ) t h e presentation of group G, where X is a non-empty set and R is a set of t h e words in t h e alphabet X U X - ' . The group G is a factor group of the free group with the base X by the normal subgroup, generated by the set R . With this understanding we shall write G = (X, R ) ; further, X is called the set of the generators of group G, and R - the set of defining relators. If t h e sets X and R are finite, we say that group G is finitely presented. We shall also consider the case when X is finite and R is recursively enumerable. In this case G is called a recursively presented group. Let G be a given finitely presented (recursively presented) group. The word problem, conjugacy problem and membership problem are the problems of establishing the existence of algorithms to decide respectively (i) whether or not an arbitrary element (word) of group G equals 1; (ii) whether or not two arbitrary elements from G are conjugate in G ; (iii) for a given arbitrary finite set { u , u , , . . . ., u,} of elements of G 337
338
V.N.Remeslennikou, N.S. Romanouskii
whether or not the element u belongs to the subgroup generated by UI,. . . , V".
Finally, let = {GiI i E N } be a recursive class of finitely presented groups. The isomorphism problem for the class C#I consists in establishing the existence of an algorithm to decide for arbitrary i, j whether or not the groups G, and G, are isomorphic. P.S. Novikov [lo] and W.W. Boone [27] constructed examples of finitely presented groups, for which the word problem was solved negatively. This implied the negative solution of the conjugacy and membership problems. Aided by this result, S.I. Adian [l] proved that the isomorphism problem in the class of all groups also has a negative solution. In addition, algorithmic unsolvability was proved for a number of other problems, such as the recognition of the following properties: nilpotency, finiteness, simplicity or unity of a given group presentation. Let H be a subgroup of the finitely presented group G, generated by the elements u I , . .. , u r nThe . set of all the words from the u, is recursively enumerable, as is the set of all the words from G equal to 1. The intersection of these two sets forms the recursively enumerable set of the presenting relations of group H in the generators u I , .. . ,u,. Therefore, the finitely generated subgroup of the finitely presented group is recursively presented. H igman (301 showed that the converse statement is also correct; i.e. every recursively presented group is embeddable in some finitely presented group.
92. The setting of the problems
Thus we see that the majority of algorithmic problems has a negative solution in the class of all groups. Therefore, it is natural to study algorithmical problems with additional restrictions on t h e groups considered. In a number of papers, such restrictions were on the form of the defining relations, e.g., Magnus' theorem [32], proved in 1932, and which deals with the solvability of the word problem for groups with one defining relation. However, we shall consider restrictions having a natural group-theoretic character, such as nilpotency, polycyclicity and solvability. Here, two approaches to such problems are possible. The first approach consists in the algorithmic problems being considered for a finitely presented group (or a set of groups) under the assumption that such group turns out to be, say, solvable of given steps of solvability;
Algorithmic problems for solvable groups
339
i.e. it is assumed that t h e condition of solvability results from the defining relators. In t h e second approach, groups which are finitely presented in a given variety are studied. This means t h e following: Let V be a variety given by a finite set of identities. We say that a group G is given in the variety V by generators X and relations R, if it is a factor group of t h e free group of the variety V with the base X by the normal subgroup, generated by the set R. Finitely presented group and recursively presented group in the variety V are defined in the obvious way. Let us introduce the corresponding designations FP, RP, FPV, RPV for the classes: finitely presented, recursively presented, finitely presented in the variety V, recursively presented in the variety V. In t h e language of these classes t h e first of the formulated approaches leads to the study of groups from F P n V, if the additional condition consists of membership in t h e variety V. In the second approach groups from FPV are studied. Evident inclusions take place:
F P n V c F P V C R P n V=RPV. For some varieties these relations may be made more exact. If V is t h e variety of groups of nilpotency class c, then it is well known that FP f l V = FPV = RPV. where 1 3 2 For the variety of 1-step solvable groups, denoted by d’, strict inclusion takes place: F P n V 3 the inclusion is strict FPV < RPV, as it was shown by A.L. Shmelkin [23], who proved that the free metabelian group is not finitely presented in d’. Recalling Higman’s theorem about the embedding of recursively presented groups into finitely presented ones, we may formulate a few which acquire an important meaning, problems for the varieties d’, taking into account V.N. Remeslennikov’s theorem, which we shall deal with below.
340
V.N. Remeslennikov, N.S. Romanooskii
Problem 1. Can any RPd'-group, where I >3, be embedded into an FPdm-groupfor suitable m ? Problem 2. Can any FPdI-group, where I >3, be embedded into an FP r l d"-group for suitable m ? Problem 3. Can any RPd-group, where I > 3 , be embedded into an FP rl A -group for suitable m ? For I = 2 these problems have been solved positively, since G. Baumslag [25] and V.N. Remeslennikov [13, 151 have shown that any finitely generated metabelian group can be embedded into a finitely presented metabelian group. 63. Classical algorithmic problems for solvable groups Before supplying the formulations of the main results obtained, we shall take note of the connection of the algorithmic problems with the finite approximatability of the groups. Let T be some group-theoretic predicate. We say that the group G is finitely approximated with respect to the predicate T , if for any collection of elements which are not related under T, there exists a homomorphism of group G to a finite group H such that the images of the elements considered are not related under T in H. Let us consider, for instance, the predicate: to be equal to the element 1. Groups which are finitely approximated with respect to this predicate are called residually finite. Let G be a group finitely presented in the variety V. Let the variety V be given by a finite set of identities. If G is residually finite, then the word problem can be positively solved in G. I.e., the set of words representing 1 is recursive. It is evident that this set is recursively enumerable. Therefore, it is enough to show that its complement is also recursively enumerable. Actually, because of the residual finiteness the complement may be enumerated by searching through all possible homomorphisms of group G to finite groups. Similarly, if the group G is residually finite with respect to conjugacy or membership, then in it the problems of conjugacy and membership are solved positively. Let us consider now the word problem for groups in d'.By Hall's theorem [29] finitely generated metabelian groups are residually finite, therefore, the word problem for groups finitely presented in the variety d 2 ,is solved positively. On the other hand, V.N. Remeslennikov's
Algorithmic problems for solvable groups
341
theorem [14] states that for 1 > 5 there exists a group finitely presented irl the variety d’, for which the word problem is solved negatively. To be more precise, for each I a specific representation of this group can be obtained. Since FP n d’< F P d ’ for 1 > 3, we are led to the following question:
Problem 4. Does there exist a solvable finitely presented group with unsolvable word problem? Let us note that a similar question was formulated by S.I. Adian, who asked if there exists a group with unsolvable word problem which satisfies some nontrivial identity. It is clear that an affirmative answer to problem 2 and 3 gives an affirmative answer to problem 4. The cases 1 = 3, 4 are not covered by V.N. Remeslennikov’s theorem. Hence, we have the following question:
Problem 5. Is the word problem solvable for groups from F P d 3 or FPd4? As for the conjugacy problem, we know still less about it. It follows from V.N. Remeslennikov’s theorem that for 1 > 5 there exist examples of groups finitely presented in the variety d’, for which the conjugacy problem has negative solution. However, this problem remains uninvestigated for solvable groups of 2, 3 and 4 steps. It should be noted that it cannot be solved by using the finite approximation method, because M.I. Kargapolov and E.I. Timoshenko [5], and Wehrfritz [35],have constructed examples of finitely generated metabelian groups which are not residually finite with respect to conjugacy. As for the membership problem, there exist two main results. On the one hand, N.S. Romanovskii [18] proved that it is solved positively for metabelian groups. On the other hand V.N. Remeslennikov [14], for 1 3 4 , has constructed an example of a group which is finitely presented in the variety d‘,and with a finitely generated subgroup for which the membership problem is solved negatively. Hence, we have the following question:
Problem 6. Is the membership problem solvable for groups in F P d 3 ? By using V.N. Remeslennikov’s theorem, for each I2 7, there was for which there constructed in [6] a group G finitely presented in d’,
342
V.N.Remeslennikov, N.S.Romanovskii
exists no algorithm, deciding for any group H finitely presented in d' whether or not it is isomorphic to G. By this example the isomorphism problem for F P d ' for 1 3 7 is solved negatively. Hence, we have the following
Problem 7. Is the isomorphism problem solvable for metabelian groups? We recall that in the class of all groups the isomorphism problem for a unit group is solved negatively. The analogous statement is wrong for the variety d', because there G # 1 if and only if G / G ' # 1. Therefore, the problem is reduced to the same problem for abelian groups, where, as is well known, it has a positive solution. We may also remark that for groups from F P d ' there exists an algorithm to decide whether a given group is finite or not, and in the positive case to find its multiplication table. This can be proved by induction on the solvability step of the group. It should be noted that, for solvable groups, there does not exist a general theorem on the algorithmic undecidability of group-theoretic properties, similar to the theorem of Adian-Rabin. Perhaps, we can prove such a theorem if we can solve the following:
Problem 8. Is there an algorithm which decides if groups in d', 1 > 2, are nilpotent? Now a few words about the methods of proof of the majority of theorems of this kind. They use the so-called Magnus embedding of a free solvable group, the essence of which lies in the following. Let 5 be a free solvable group of step 1 and rank n and T the left free module of rank n over the ring Z[F,]. Then the free solvable group Fl+Iof the step 1 + 1 and rank n can be embedded into the group of matrices M of the type (6 :), where f E R and t E T.We may consider the group M as the wreath product of the free abelian group of rank n with R. The Magnus embedding turns out to be convenient, for instance, in the case that N is a normal subgroup of E + , ;for in this case the embedding may be extended to the normal of group M,which represents the semidirect product of subgroup the subgroups consisting of unitriangular and diagonal matrices such n F,+,= "181. that Thanks to the Magnus embedding many questions concerning solvable groups can be replaced by corresponding questions about modules over group rings of lesser solvability step. In particular, a
Algorithmic problems for solvable groups
343
number of algebraic problems for two-step solvable groups become problems for free modules over the ring Z[%',,. . . ,Zn]. 94. Free solvable groups
As has been mentioned above, free solvable groups of finite rank are recursively defined in the class of all groups, and evidently finitely defined in the corresponding variety YI'. By the use of the Magnus embedding for a free solvable group, the positive solution of the word problem is easily established. Matteus [33] showed that in the wreath product of groups A and B the conjugacy problem is solved positively, if it is solved positively in A and B, and additionally, the membership problem for cyclic subgroups is positively solvable in B. Making use of this fact and of the Magnus embedding, M.I. Kargapolov and V.N. Remeslennikov [3] positively solved the conjugacy problem for free solvable groups by induction on the solvability step. Later on, V.N. Remeslennikov and V.S. Sokolov [16] established the finite approximatability of those groups with respect to conjugacy. R.A. Sarkisjan [19] proved that the conjugacy problem is positively solved for free polynilpotent groups. The next problem is still open. Problem 9. Is the membership problem solvable for free solvable groups? This problem is closed connected with the following. Problem 10. Is there an algorithm for solving finite systems of linear equations over Z[F,]? Here, F. is a free solvable group with solvability length 1. Let us also note that in an absolutely free group there exist algorithms (for instance, Nilsen's method), to decide if a given finite set is a basis. A.F. Krasnikov (7) has found a similar algorithm for free solvable groups. The class of groups with one defining relation in the variety a' is the closest to free solvable groups. We know little about these groups so far. Let us still note that N.S. Romanovskii [17] proved the analog of Magnus' Freiheitssatz for a group with one relation in the variety a', and the word problem with some restrictions on the single relation has been solved [18]. However, the general case remains open:
V.N.Remeslennikou, N.S.Romamuskii
344
Problem 11. Is the word problem solvable for groups with one defining relation in the variety ‘ill’1,3 3? 05. Algorithmic problems for nilpotent groups
In 1955 A.I. Mal’cev [8] showed that the word problem can be solved positively for nilpotent groups. He also showed [9], that finitely generated nilpotent groups are residually finite with respect to membership, hence the membership problem is solvable. Blackburn [26] has positively solved the conjugacy problem for nilpotent groups by the same method. M.I. Kargapolov and his students [4] raised and solved the following algorithmic problems for $3-power groups over a those of word, conjugacy, membership, separating of “good” ring ‘0-periodical part, intersection, the description of subgroups in the terms of relations. Thus, we see that in nilpotent groups the picture is diametrically opposed to that in the class of all groups we have the following:
a:
Problem 12. Is the isomorphism problem solvable for nilpotent groups? This problem cannot be solved by the method of finite approximation, for at present we know many examples of nonisomorphic finitely generated nilpotent groups with the same families of finite homomorphic images. Concerning this we should note Pickel’s theorem [34]. H e showed that there can only exist a finite number of groups with the same family of finite homomorphic images in the class of finitely generated nilpotent groups. It is highly probable that problem 12 will be answered negatively, hence, we have another interesting problem: Problem 13. Is the isomorphism problem solvable for groups with one defining relation in the variety of nilpotent groups with given nilpotency class? Such an algorithm exists for class 2-nilpotent groups [20]. 06. Algorithmic problems for polycyclic groups
It should be noted at once that any polycyclic group is finitely presented, so it is possible to discuss algorithmic problems for
Algorithmic problems for soloable groups
345
polycyclic groups. All results known to us here were obtained by means of finite approximatability. Initially, Hirsch [31] showed that polycyclic groups are residually finite. Then A.I. Mal’cev [9] showed that they are finitely approximated with respect to membership. Finally, V.N. Remeslennikov [12] and Formanek [28] proved the finite approximatability of polycyclic groups with respect to conjugacy. The isomorphism problem for polycyclic groups is still unsolved.
References [ I ] S.I. Adjan, Insolvability of some algorithmic problems of group theory, Trudy Moskovoskogo Matem. ob-va 6 (1957) 213-298 (Russian). [2] M.I. Kargapolov, Finite approximatability of supersolvable groups with respect to conjugacy, Algebra i logika 6, No. 1 (1967) 63-68 (Russian). [3] M.I. Kargapolov and V.N. Remeslennikov, Conjugacy in free solvable groups, Algebra i logika 5, No. 6 (1966) 15-25 (Russian). [4] M.I. Kargapolov, V.N. Remeslennikov, N.S. Romanovskii V.A. Roman’kov and V.A. Churkin, Algorithmic problems for 0 power groups, Algebra i logika 8, No. 6 (1969) 643-659 (Russian). [5] M.I. Kargapolov and E.I. Timoshenko, O n the problem of finite approximatability with respect to conjugacy of metabelian groups, 4 Vsesoyuznij simpozium PO teorii grupp (Tezisy dokladov, Novosibirsk 1973) (Russian). [6] A S . Kirkinskii and V.N. Remeslennikov, The isomorphism problem for solvable groups, Matematiceskie zametki 18, 3 (1975) 437-443 (Russian). [7] A.F. Krasnikov, On the bases of free solvable groups, to appear (Russian). [8] A.I. Mal’cev, Two remarks on nilpotent groups, Matematiceskij sbornik 37 (1955) 567-572 (Russian). [9] A.I. Mal’cev, On homomorphisms to finite groups, Ucenye zapiski lranovskogo ped. instituta 18 (1958) 49-60 (Russian). [lo] P.S. Novikdv, On the algorithmic unsolvability of the word problem in the theory of groups, Trudy matematiceskogo instituta AN SSSR 44 (1955) 1-144 (Russian). [ I l l V.N. Remeslennikov, Conjugacy of subgroups in nilpotent groups, Algebra i logika 6, No. 2 (1967) 61-76 (Russian). [I21 V.N. Remeslennikov, Conjugacy in polycyclic groups, Algebra i logika 8, No. 6 (1969) 712-725 (Russian). [ 131 V.N. Remeslennikov, O n finitely presented groups, 4 Usesoyuznyj simpozium PO teorii grupp, Novosibirsk 1973, pp. 164-169 (Russian). (141 V.N. Remeslennikov, An example of a group, finitely presented in the variety ?I5, with the unsolvable word problem, Algebra i logika 12, No. 5 (1973) 577-602 (Russian). [IS] V.N. Remeslennikov, Investigations of infinite solvable and finitely-approximated groups, Doctoral dissertation, Novosibirsk, 1974 (Russian). [ 161 V.N. Remeslennikov and V.G. Sokolov, Some properties of Magnus’ imbedding, Algebra i logika 9, No. 5 (1970) 566-578 (Russian). [I71 N.S. Romanovskii, The freedom theorem for groups with one defining relation in
346
V.N.Remeslennikou, N.S. Romanooskii
the varieties of solvable and nilpotent groups of given steps, Matematiceskij sbornik 89, No. 1 (1972) 93-99 (Russian). [18] N.S. Romanovskii, On some algorithmic problems for solvable groups, Algebra i logika 13, No. 1 (1974) 2 6 3 4 (Russian). [19] R.A. Sarkisjan, Conjugacy in free polynilpotent groups, Algebra i logika 11, No. 6 (1972) 694-710 (Russian). [20] N.F. Sesekin, On the classification of metabelian (two-step nilpotent) torsion free groups, Ucenye zapiski Ural’skogo universiteta 1 9 (1965) 27-41 (Russian). [21] V.G. Sokolov, An algorithm for the word problem for a class of solvable groups, Sibirskij matematiceskij iurnal 12, No. 6 (1971) 140.5-1410 (Russian). (221 E.I. Timoshenko, Some algorithmic problems for metabelian groups, Algebra i logika 12, No. 2 (1973) 232-240 (Russian). [23] A.L. Shmel’kin, Wreaih products and varieties of groups, Izv. AN SSSR, ser. mat. 29 (1965) 149-170 (Russian). [24] G. Baumslag, Wreath products and finitely presented groups, Math. Z. 7.5 (1961) 22-28. [2.5] G. Baumslag, Subgroups of finitely presented metabelian groups, J. Austral. Math. Soc. 16 (1973) 98-110. [26] N. Blackburn, Conjugacy in nilpotent groups, Proc. Amer. Math. Soc. 16 (1965) 143- 148. [27] W. Boone, The word problem, Ann. Math. 70, No. 2 (1959) 207-265. [28] E. Formanek, Conjugate separability in polycyclic groups, J. Algebra 42, No. 1 (1976) 1-10, [29] P. Hall, On the finiteness of certain solvable groups, Proc. London Math. Soc. 9 (1959) 595-622. [30] G. Higman, Subgroups o n finitely presented groups, Proc. Roy. Soc. London A262 (1961) 455-475. (311 K.A. Hirsch, On infinite solvable groups, IV, J. London Math. Soc. 27 (1952) 81-85. 1321 W. Magnus, Das Identitats problem fur Gruppen mit einer definierenden Relation, Math. Ann. 106 (1932) 295-307. [33] J. Matteus, The conjugacy problem in wreath products and free metabelian groups, Trans. Amer. SOC. 121 (1966) 329-339. [34] P.F. Pickel, Finitely generated nilpotent groups with isomorphic finite quotients, Trans. Amer. Math. Soc. 160 (1971) 327-341. [35] Wehrfritz, Two examples of solvable groups that are not conjugacy separable, J. London Math. Soc. (2). 7 (1973) 312-316.
S.I. Adian, W.W. Boone, G . Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 347-371
QUADRATIC EQUATIONS IN GROUPS, CANCELLATION DIAGRAMS ON COMPACT SURFACES, AND AUTOMORPHISMS OF SURFACE GROUPS Paul E. SCHUPP* The University of Illinois at Urbana -Champaign
91. Introduction
In recent years cancellation diagrams have become a fundamental tool of combinatorial group theory, and we assume familiarity with the basic facts about such diagrams. (See, for example, [9], [lo], or [15].) The basic inspiration for this paper is a remark of Roger Lynaon who once stated that “There is an obvious intuitive connection between the question of commuting elements and cancellation diagrams o n t h e torus, which has yet to be made precise.” It turns out that the connection can indeed be made precise, not only for the basic commutator, but for any quadratic word. In terms to be defined precisely below, we show that a “non-trivial” solution of a “quadratic equation” W = 1 in a group G = (X: R ) induces a “non-trivial” cancellation diagram on a compact surface S defined by an endomorphic image of W. If we assume that R satisfies a suitable small cancellation hypothesis, such a diagram cannot exist, and we can thus conclude that all solutions of W = 1 in G are “trivial”. Consequences of this result are discussed below. It turns o u t that the present methods also yield a very simple proof of the important theorem of Nielsen [I41 that if G is the fundamental group of a compact surface then all t h e automorphisms of G are induced by automorphisms of the corresponding free group. It is a pleasure to acknowledge many helpful conversations with C.C. Edmunds during the research leading to this paper. We now turn to definitions and precise statements of results. Let @ = (a,,a z ,. . . ) be a free group of countably infinite rank. We shall use the letters a, p, y, etc. to denote generators or inverses of generators of @. A word W E @ is quadratic if every generator of @ * This research was supported by the U.S. National Science Foundation and the National Research Council of Canada. 347
348
P.E. Schupp
which occurs in W occurs exactly twice. Each quadratic word W defines a closed compact surface as the quotient space of a polygon whose sides are labelled, in counterclockwise order, by W. (We assume that the reader is familiar with the proof of the classification theorem for compact 2-manifolds; see, for example, Massey [12].) It is wellknown that if W is a quadratic word, then there is an automorphism of @ sending W to one of the canonical forms [ a I PI] , * * * [a8, p,] or a:. . * a;. (The proof of t h e classification theorem tells one how to construct the automorphism.) We say that W is orientable (non-orientable)if the surface defined by W is orientable (non-orientable). It is well-known that W is orientable if and only if for every generator ai which occurs in W, the two occurrences of a, have opposite signs. We define the Euler characteristic of W, written x( W ) , to be the Euler characteristic of the surface defined by W. To calculate the Euler characteristic of W, we write W counterclockwise around the boundary of a polygon in the plane, and then count the numbers V, E, and F of vertices, edges, and faces after identifying each pair of edges labelled by the same generator, and set x( W )= V - E + F. If W contains n generators, then there are n edges after identification. If W is a canonical form it is easy to verify that there is only one vertex. Since there is exactly one face, we have x( W )= 2 - n for W a canonical form containing n generators. We shall often use this observation. Let F = (X) be a free group, and let G = (X; R ) be a quotient of F. We shall always assume that the set R of defining relators for G is symmetrized, that is, all elements of R are cyclically reduced and R is closed under taking cyclic permutations and inverses. Let N be the normal closure of R in F, so that G = F / N , and let n- : F+ G be the natural map. We want to investigate solutions of quadratic equations in G. Within the set of all solutions we distinguish those solutions which arise in an obvious manner from solutions in the free group F. Definition. Let W ( a l ,. . .,a,) E @. A solution of the equation W = 1 in G is an n-tuple of words ( a l , . . , a , ) on X" such that W ( a l , . . , a , ) = 1 in G. We say that the equation is quadratic if W is quadratic. We say that the solution ( a l , .. . , a,) is free if there exist words b l , . . . ,b, in F such that n-(b,)= a, for i = 1 , . . . ,n, and W ( b l , . .,b,) = 1 in the free group F. (Note in which groups the equations hold.) For example, since two elements of a free group commute if and only if they are powers of a common element, a non-free solution of ( Y ~ ( Y ~ ( Y ; I ~= ; ~ 1 in G is a pair ( a l ,a z ) representing elements of G which commute but which are not powers of a common element.
Quadratic equations in groups
349
For complicated quadratic equations, exact descriptions of all possible solutions are not known even in the free group F. There is, however, one general result which we shall find very useful. This is the “rank formula” of Lyndon [ l o ] which states that if W ( a , ,. . . , a,) is a canonical form involving n generators and ( a , ,. . . , a,) is a solution of W = 1 in F, then the rank of the subgroup H = Gp{al,. . .,a,} generated by the entries in the solution must satisfy the bound rank(H) S [ n / 2 ] (where, as usual, [ m ] denotes the greatest integer not exceeding m ) . W e shall give a simple proof of the rank formula in the next section. In studying solutions of a particular equation W = 1 it turns out that we must consider a larger class of equations. W e shall denote by % ( W ) the set of all quadratic words of @ which are endomorphic images of W. (If W is orientable then %( W ) consists of all orientable quadratic words which have Euler characteristic greater than or equal to x ( W ) . If W is non-orientable, %( W ) consists of all non-orientable quadratic words of characteristic greater than or equal to x ( W ) , and all orientable quadratic words with Euler characteristic strictly greater than
XW.1
We now turn to the definition of a cancellation diagram M on a compact surface S. First of all, M is a tessellation of S and there is a function assigning to each oriented edge e of M a label + ( e ) in the free group F such that + ( e ) # 1 and + ( e - ’ ) = + ( e ) - ’ . Let e be an unoriented or “geometric” edge of M. If e is on the boundary of two distinct regions D , and D z , we say that e occurs with multiplicity one on the boundaries of DI and D,. If e occurs only on the boundary of a single region D, we say that e occurs with multiplicity two in the boundary of D. If D is any region of M , the degree of D, written d ( D ) , is the number of unoriented edges, counted with multiplicity, in the boundary of D. Let R be a symmetrized.subset of F. W e say that M is an R diagram if for every region D of M there is a closed path 6 with the following properties. All the edges in 6 lie in dD, and for each pair { e , e - ’ }of oriented edges lying in the boundary of D, the total number of occurrences of either e or e-’ in S is equal to the multiplicity with en say, which the geometric edge e occurs in aD. Further, if S = e l and +(el) = c, E F, then the product r = c I . c, is cyclically reduced without cancellation and is an element of R. If we are given one such S then, since R is symmetrized, any path which is a cyclic permutation of S or S-I also has the same properties. W e shall assume then that, when we have an R-diagram M, there is specified for each region D of M a class of paths, called distinguished boundary cycles of D, such
+
350
P.E. Schupp
that each path has t h e properties of S above and the class is closed under cyclic permutations and inverses. We now recall the basic “non-triviality” condition on R -diagrams. If there are elements rl = cbl and r2 = cb2 of R with r2# rl then c is called a piece. In the plane, we say that an R-diagram is reduced if the label on every interior edge is a piece. If M is an R-diagram which tessellates a compact surface S, we say that M is reduced if the label on every edge is a piece. Geometrically, this means that if e is an edge on the boundaries of regions DI and D 2(not necessarily distinct) and if Si is the boundary cycle of Di beginning with e, then +(Sl)# +(S2). (In the case that D 2= D1, S1 and S2 are boundary cycles of D1beginning with distinct occurrences of e.)
Figure 1.
The basic connection between non-free solutions of quadratic equations and reduced diagrams is given by the following.
Theorem 1. Let W E @ be quadratic, and let G = ( X ;R). If W is nonorientable, assume that R contains no proper powers. If G admits a non-free solution of the equation W = 1, then there is a reduced R diagram on a surface S defined by some U E %( W). The above theorem requires n o hypothesis on R except the exclusion of proper powers in the case where W is non-orientable. In this generality, we cannot hope to say much about whether or not all solutions of specific equations are free. In recent years, small cancellation theory has become an important part of combinatorial group theory, and it turns out to be precisely the tool needed here in order to obtain sharp results. We now assume that the reader is familiar with the basic notation and hypotheses of small cancellation theory. (See, for example, [9], [lo], or [16].) We need here mainly an understanding of the hypotheses, for we shall actually use very few results.
Quadratic equations in groups
351
Theorem 2. Let W E @ be a quadratic word with x = x ( W) S 0. Let G = ( X ;R ), and assume that if W is non-orientable then R contains no proper powers. Suppose that R satisfies any one of the small cancellation hypotheses C(7- 6x), or C(5 - 4x) and T(4), or C(4- 3x) and T(6). Then all solutions of W = 1 in G are free. We turn to a consideration of some of the consequences of the theorem. Solutions of quadratic equations W = 1 where the Euler characteristic of W is non-negative have previously been investigated by a variety of ad hoc methods.
Remark 1. Let W be a 2 ,which defines the projective plane. Since, of course, x ( W ) = 1, the statement of the theorem does not apply. We shall see from the proof however, that if we assume that R contains no proper powers and satisfies C(6), or C(4) and T(4), we can conclude that G has no elements of even order. It is fairly well known that if R contains no proper powers and satisfies one of the stronger metric hypotheses, either C’(b) or C’(t) and T(4), then G is, in fact, torsionfree. These results were established by Greendlinger [5] and Soldatova [18]. It has been strongly conjectured that the torsion result is still true assuming only C(6), or C(4) and T(4), but the methods here get only “half way” towards this goal. Seymour [17] has shown that if R is finite, contains no proper powers, satisfies C(4) and T(4), and no piece has length greater than one, then G is torsion-free. Chris Chalk [l] has recently proven that if R contains no proper powers and satisfies C(7), or C(5) and T(4), then G is torsion-free. Remark 2. Let W be aPa-’P-’, which defines the torus. If we assume that R satisfies C(7), or C(5) and T(4), we conclude that two elements of G commute only if they are powers of a common element. This result was established by Greendlinger for R satisfying C(!)[6], and subsequently [7] strengthened to C’(i). Comerford has extended these results to small cancellation quotients of free products. Remark 3. Let W be either aPaP-’ or a2P2,which define the Klein bottle. We call an element of G real if it is conjugate to its own inverse. Since the only real element in a free group is the identity, if we assume that R contains no proper powers and satisfies C(7), or C(5) and T(4), then we conclude that G has no real elements except the identity. Equivalently, we conclude that whenever square roots exist in G, they are unique. Comerford [2] established that if R contains n o even powers and satisfies C’(a), or C’(a) and T(4), then G
352
P.E. Schupp
has n o non-trivial real elements. Earlier, Gowdy [4] had proved that if R contains no proper powers and satisfies C'(k), then G has n o real elements. Remark 4. Let W be a 2 p Z y z which , defines the non-orientable surface with Euler characteristic - 1. It is a well-known result of Lyndon [8] (and an immediate consequence of his rank formula), that if a 2 b Z c = 21 in the free group F then a, b, and c are all powers of a common element. Thus if we assume that R contains n o proper powers and satisfies either C(13), or C(9) and T(4), the same result holds for G = ( X ;R ) .
For more complicated quadratic equations W = 1 we lack explicit descriptions of the solutions even in the free group F. We do, however, transfer the rank formula to all those groups G = (X; R ) for which we can prove that all solutions of W = 1 are free. Remark 5. Another interesting consequence of the theory is the following result. Let W ( a l , . .,a,) be a canonical form involving n generators, and let S be the surface defined by W. The standard presentation of the fundamental group G of S is G = ( x l , . . . ,x, ; W ( x , ,. . ., x.)). Let V be any quadratic word involving k generators where k < n. Then all solutions of V = 1 in G are free. It is tempting, in view of the known results on torsion elements and real elements, to attempt to weaken the hypothesis that R contains no proper powers to assuming only that R contains no even powers. The following example, due to Comerford, shows that this cannot be done in general, regardless of how good a small cancellation condition is assumed. Let G = ( x,..., ~ x ~ - ~ ; ( x : . - . x : - ~2k+1 ) )
where g > 2 . Let W be a ; . - - a i Then . (xl,..., X ~ - ~ , ( X ; - - * X ~ - , ) is ~ )a solution of W = 1 in G. The subgroup generated by the entries in the solution is all of G, and has rank g - 1. Thus the solution is non-free by the rank formula. The above presentation satisfies an arbitrarily prescribed cancellation condition C ( m ) if k is chosen large enough. It is an important theorem of Nielsen [14] that if G is the r) where r is fundamental group of a compact surface S, say G = ( X ; one the canonical relators, then every automorphism of G is induced by an automorphism of the free group F = (X). (See also Zieschang, Coldewey, and Vogt [19].) Let r = W(xl,. . . ,xm). If 8 : G + G is any
Quadratic equations in groups
353
endomorphism of G, then, of course, t h e n-tuple (8(xl), . . . , 8(x.)) is a solution of W ( a I ,... , a,)= 1 in G. Assuming that 8 is one-to-one, the rank of the subgroup generated by (8(x,), . . . , 8(x,)) is n, the rank of G, and we thus have a non-free solution. Let R be the symmetrized subset of F generated by r. It turns out that if x ( S ) < O , then the possible tessellations of S by reduced R -diagrams are severely limited; indeed, such a tessellation must have only one region. We shall use this fact to deduce that the n-tuple (8(xl), . . . , 8(x,)) is obtained from (xl, . . . ,x,) by Nielsen transformations, and thus 8 is induced by an automorphism of F. 02. Quadratic words and equations
In this section we collect the results which we need about quadratic words and equations in free groups. We assume that the reader is familiar with Nielsen transformations and the fact that they generate the automorphism group of a free group of finite rank. (See [lo] or [ll].)Let @ = ( a l ,az,. . .), and let W ( a l , . . , a,)€ @ be quadratic. Let F = (X) be a free group, and let u = ( a l , .. . , a,) be an n-tuple of elements of F. If we substitute ai for a, in W, we obtain the element W(al, . . . , a , ) = w € F. Thus ( a l , .. . , a,) is a solution of the equation W ( a I ,...,a,)= w in F. We need to consider the process of performing certain attached transformations, which will be defined by the list below, to the triple W, w, and u.The transformations will, in each case, consist of applying an endomorphism of @ to W to obtain a quadratic word U(pl,. . .,p k ) , applying an inner automorphism of F to w to obtain u, and replacing the n-tuple u = (al,.. .,a,) by a k-tuple 7 = ( b l , .. . ,b k )such that U ( b l , .. . ,b,) = u. For ease of notation we adopt the following convention. We use a,p, y,. . . to denote a letter of @, that is, a generator of @ or its inverse. If we are considering a particular transformation, we shall write W(. . . ,a,., . ,p, . . . ) displaying only the generators which are involved in the transformation. In defining an endormorphism of @, t h e endomorphism is understood to fix all generators not explicitly mentioned in the definition. Given a tuple ( . . . , a , . . . ,b , . . . ) it is understood that the element of F denoted by a Roman letter is substituted for the corresponding Greek letter. If we replace the tuple (. ..,a , . , . ,b, . . .) by a new tuple (2,.. .,a , . . ., b, . . . ) it is understood that all entries not explicitly mentioned are the same in the new tuple as in the old, and that new letters denote new entries in the tuple. We write a = bc to mean that a is freely reduced as written, that is, there is no cancellation between b and c.
P.E. Schupp
354
Given a quadratic W ( a l , ..,a,) . and a tuple u = (a,,...,an) with W ( a , ,. . . ,a,) = w, a transformation of any of t h e following types is said to be attached to the triple (W, w, u). (1) Suppose that ap occurs in W ( . . .,a,.. . ,p,. . . ) and that a = alz, b = z - ' b , where z # 1. Obtain U ( [ ,. . . , a , . . . , p,. . . ) from W by applying the automorphism of @ defined by a + ap, p + p - ' p where [ does not occur in W. Replace the n-tuple ( . . . , a , . . . , b,. . . ) by t h e (n + 1)-tuple (2,. . . , a , , . . , ,b , , . . .). It is clear that U is quadratic and that U ( z ,. . . , a , , . . . , b , , . . .) = w. Leave w unchanged. (2) Suppose that ap or p - l a - ' occurs in W. Obtain U ( . . .,a,. . . , p,. . . ) from W by applying t h e automorphism of @ defined by a + ap-', and replace the tuple ( . . . , a , . . . , b,. . . ) by the tuple ( . . ., a , , .. . , b,. . . ) where a , = ab. Note that t h e new tuple is obtained from the old by a Nielsen transformation. (3) Suppose that a occurs in W and that a = alz. Obtain U ( [ , .. . , a,. . . ) from W by applying the automorphism of @ defined by a + a[ where [ does not o c p r in W. Replace the n-tuple ( . . . , a , . . . ) by the (n + 1)-tuple (2,. . . , a l ,. . .). Leave w unchanged. (4) Suppose that W begins with a. Obtain U from W by applying the conjugation W + a-'Wa, and do not change the tuple ( . . . ,a , . . . ). Replace w by a-lwa. ( 5 ) Suppose that a occurs in W ( . . . , a,.. . ,p, . . . ) and that a = 1. Obtain U from W by the endomorphism of @ defined by a + 1. Replace the n-tuple ( . . . , a , . . . , b,. . .) by the (n - 1)-tuple (. . . , b,. . .). Leave w unchanged. We call a transformation of type ( 5 ) a singular transformation. Suppose that we are given a quadratic word W and a solution u = (a,,. . . ,a,) of W ( a , ,. ..,an) = w in F. We say that the equation V ( p , ,. . . , p k ) = u and t h e solution 7 = ( b , ,. . . , b k ) are derived from W, w and a by a sequence of attached transformation if there is a sequence
(W, W ,u)= (Woi ~
0 ~ ,o
) +
*+
(Wn, w n , a n ) =
(U, u, 7 )
where each (W,,,,w , + , ,a,+l) is obtained from (W,,w,, a,)by a transformation attached to (W,, w,,a,). If a E F, then la I will denote the length of a relative to t h e fixed basis X of F. We define the length of an n-tuple by adding the lengths of the entries, I ( a , , .. . , a , ) [ = Z:=, 1 a, 1. Note that if we derive the tuple ( b , , .. . , b,) from the tuple (a,,. . .,a,) by an attached transformation of ~ ,an)[. type (l), then I(bl ,..., b , ) / < ( ( a,... Let W E @, say W = a,,* * * a,m.We say that the n-tuple (a,,. . . , a , ) (of elements of the free group F) is cancellation-free in W if no a, = 1, and if w = W ( a , ,. . . ,a,) = a,, . . . a," is cyclically reduced as
Quadratic equarions in groups
355
written. In other words, no cancellation occurs, even cyclically, between any of t h e a,. Thus ( x y x , y x y ) is cancellation-free in azpz,while (xy-I,y x ) is not cancellation-free in a’@’. Lemma 1. Let W ( a ,..., I a , ) € @, let ( a ,, . . . , a n ) be an n-tuple of elements of F, and let W ( a , ,. . . , a n )= w. Then there is a triple ( U ( p ,..., , pk),(bl, . . . , b k ) , u )derioed from ( W , ( a l ,..., a n ) , w )by a sequence of attached transformations such that u is a cyclically reduced conjugate of w and ( b , , . . . , bk) is cancellation-free in u. Proof. Suppose that a,a,occurs in W while a, = a : r and a, = z - l a ; with z # 1. Then apply the transformation of type (1) where U is obtained by applying t h e automorphism a, -+a,( and a,+ ( - l a , and the tuple ( . . . , a,, . . . , a,, . . . ) is replaced by the tuple (2,. . . , a : , . . ., a ; , . . .). In this case the length of the new tuple is less than the length of the old tuple. If any of the entries ak = 1, apply the appropriate singular transformation sending a k + 1. If W ( a , ,. . . , a,) is not cyclically reduced, a conjugation of type (4) brings us to t h e case considered above. Since each transformation of type (1) produces a tuple of shorter length, a finite number of transformations must lead to a U and ( b , , .. . , b k ) which is cancellation-free in u.
Lemma 2. Let W be a quadratic word of @, and let V E @ be any quadratic word which is an endomorphic image of W. Then a cyclically reduced conjugate of V is obtainable from W by a sequence of attached transformations. Proof. By assumption, there are words Y,, . . . , Y,,of @, not necessarily By Lemma 1, some cyclically quadratic, such that V = W (Y , ,. . ., Yn). reduced conjugate V* of V is obtaina,ble by a cancellation-free substitution ( Z , , .. . , Z k )into tr(pI,.. . , p k ) where U is derived from w by a sequence of attached transformations. Since both U and V are quadratic and the substitution is cancellation-free, a generator which occurs in some Z , can occur only in that Z , . By transformations of type (3) we can “subdivide” each pi into a product of (Zi1 generators. If the result is U * ( y , ,..., y m ) then V* is obtainable from U * by a letter for letter substitution and thus differs from U * only by a permutation of the generators and their inverses. Such a permutation is obtainable by a sequence of attached transformations. 0
The following lemma is well-known.
356
P.E. Schupp
Lemma 3. Let W E 4j be a quadratic word. If U is obtained from W by an attached transformation which is an automorphism of 0,then ,Y( U ) = x ( W ) . If U is obtained from W by applying one singular transformation then x ( W )s y, ( U )S y, ( W )+ 2. In view of Lemmas 1 through 3 we have the
Corollary. I f W is quadratic and U is a quadratic endomorphic image of W , then x ( U ) a , y ( W ) . Let F be a free group. Recall that a sequence y l , . . . , y , of elements of F is said to be Nielsen-reduced if the following three conditions hold. (Nl) Each y,# 1. (N2) More than half of y , does not cancel in either of the products y,-,y, or Y , Y , + ~ . (N3) No y , cancels exactly half in both products y,-,y, and y , ~ , + ~ . A set S of elements of F is said to be Nielsen-reduced if every sequence y I , . . . , y , of elements y, E S" such that n o y,+, = y;' is Nielsen reduced. If T is a finite subset of F, then a finite number of Nielsen transformations lead from T to a set S which is Nielsen reduced. (For a fuller discussion see [lo].) We shall need the following relativization of the concept of being Nielsen reduced. Let W ( a l , .. . , a,)E @, say W = a :,'* . * a : I where each E , = k l . Let ( a l , . .. , a , ) be an n-tuple of elements of F. We say that (a,,. . . ,a,) is Nielsen-reduced relative to W if the particular sequence a:,', . . . ,a:; satisfies conditions (Nl) through (N3). In short, we consider only the products which result from substituting the a, into W. Transformations of types (2) and ( 5 ) effect Nielsen transformations on the tuples involved. It is easy to see that given any W ( a , , a , ) E @ and any n-tuple ( a l , .. . , a n ) we can, by a sequence of attached transformations, of types (2) and ( 5 ) , obtain u(&,. . . ,p k ) and tuple (b,, . . . , b k )which is Nielsen-reduced relative to U. Lemma 4 (Lyndon's Rank Formula). Let W E @ be a quadratic word which involves n generators and which is of minimal length among its images under automorphisms of @. Let F be a free group, and let (a,, . . . , a n ) be any n-tuple of elements of F such rhar W ( a , ,. . . ,a,) = 1. Then the rank of the subgroup H generated by { a l , .. . , a.} is less than or equal to [ n / 2 ] .
Quadratic equatwns in groups
357
Proof. It is well-known that any canonical form has minimal length among its automorphic images. Let W * be the canonical form which is an image of W under an automorphism of @. Since W is of minimal length under automorphism, 1 W*l = I W I , and thus W * involves n generators. Hence, x ( W )= x ( W * )= 2 - n. There is a sequence of attached transformations of types ( 2 ) and ( 5 ) leading from W and ( a l , .. . ,a.) to U ( p l , .. . , p k ) and a tuple ( b , ,. . ., b k )which is Nielsen reduced relative to U and with U ( b l , .. . , b,) = 1 . Now the product of elements in a Nielsen reduced sequence y,, . . .,ym with m 3 1 cannot be equal to the identity. We conclude that U = 1. Let d be the number of singular transformations applied in obtaining U from W. Since a singular transformation is applied only when an entry in the tuple is the identity, it is clear that rank(H) z s ( n - d ) . Since x ( W )= 2 - n, and x(1) = + 2, in going from W to U the Euler characteristic is raised by n. According to Lemma 3, each automorphism preserves the Euler characteristic, while one singular transformation can increase the Euler characteristic by at most two. Thus the number d of singular transformations must be at least n / 2 if n is even, and at least ( n + 1)/2 if n is odd. Hence, rank(H)S(n - d ) S [ n / 2 ] . 0 As usual, let F = ( X ) be a free group, and let G = (X; R ) be a quotient group of E Let W ( a , ,. . . , a , ) € @, and let ( a l , .. . , a , ) be a non-free solution of W = 1 in G. In the next section we shall transform given equations and solutions, and we shall need to know that we preserve the non-freeness of the solutions. All the situations which will be encountered are covered by the following lemma.
Lemma 5. Suppose thar ( a l , .. . , a , ) is a non-free solution of W ( a l , . ., a , ) = 1 in G. Suppose thar there are words Z, ( P I , .. ., p k ) of @, and a k-tuple of ( b , ,. . . , b k )of words of F such that a, = Zi(bl,.. . , b k ) in G for i = 1,. . . , n, and U ( p , ,. . . , P k ) = W ( Z l ,. . . , Z n ) in @. Then ( b l ,. . . ,b , ) is a non-free solution of U = 1 in G .
Proof. If the solution ( b ] , . . , b , ) of U = 1 in G were free, there would be words b : , . . . , b : such that each b f = b, in G and U ( b : , .. . , b : ) = 1 in F. But Z , ( b : , . .., b : ) = a, in G and
W ( Z l ( b : ,..., b : ),..., Z . ( b : , .. ., b : ) ) = U ( b : , . . . , b : ) = 1 in F. This contradicts ( a l , .. , , a n ) being a non-free solution of W=l. 0
358
P.E. Schupp
%3.Cancellation diagrams on compact surfaces
We now assume that the reader is familiar with the basic facts about cancellation diagrams in the plane. (See Chapter V of [lo].) Let F = (X) be a free group, let R be a symmetrized subset of F, and let G = (X; R). Let N be the normal closure of R in F, so that G = F / N , and let T : F + G be the natural map. Let W ( a , ,. . . , a , ) E 4j be a quadratic word, and suppose that there is a non-free solution ( a l , .. ., a , ) of W = 1 in G. Note that W ( a l , .. ., a , ) is a non-trivial element of N. We consider the following
Choice. Among all U E 8( W ) and all tuples of words (bl, . . . , b k ) which are non-free solutions of I/ = 1 in G, pick a pair U and ( b l , . ..,b k ) such that (1) The number, rn, of regions in t h e minimal connected simply connected R-diagram M in t h e plane with boundary label U ( b l , . . , b k ) is as small as possible. (2) Subject to (l), the length of the free group element U ( b l , .. . ,bk) is as small as possible. (3) Subject to (1) and (2), t h e length l(bl, ..., b k ) l of t h e solution tuple is as small as possible. . and ( b l , . . . ,b k ) refer to t h e pair In the next sequence of lemmas, U chosen above, and M is a minimal connected simply connected R diagram in the plane with boundary label U ( b l , .. . ,b k ) . Lemma 6. The tuple ( b , , . . . , b k ) is cancellation-free in U. Proof. If cancellation occurs we can, by an attached transformation as in Lemma 1, obtain V ( y l , .. . , y m ) and a shorter tuple ( c l , .. . , c”). Since the free group element u = V(cl, . . . , c,) is a conjugate of u = U(b1,.. . , b k ) , the minimal R-diagram with boundary label u still has only rn regions. This contradicts our minimal choice. 0 By using transformations of type (3) we can “subdivide” the variables so U ( p , ,. . . ,& ) is such that each entry bi in ( h , . . , b k ) is a generator or the inverse of a generator of F. We now assume that this has been done.
Lemma 7. The boundary cycle of M is a simple closed path. Proof. Suppose that M is not bounded by a simple closed path and let K be an extremal disk of M. (See [lo] or [13].) Both K and M \ K
Quadratic equations in groups
359
contain regions. Since each letter of U is replaced by a letter from XI',the label o n t h e boundary of K is a substitution instance of V ( p , , , ..,&) . where some cyclic permutation of U has the form VZ. First suppose that V is quadratic, so that for each p, which occurs in V, both of t h e occurrences of p, are in V . Then both V and Z are quadratic. Substituting the appropriate b, into V and Z yields solutions of the quadratic equations V = 1 and Z = 1 in G. If both of these solutions were free, then ( b , ,..., b k )would be a free solution of U = 1 in G, which is not the case. On t h e other hand, if at least one solution is non-free, say that of V = 1, then there is an R-diagram, namely K, with fewer than m regions verifying that V (b , ,., . .,b,")= 1 in G. This contradicts t h e minimality of our choice. We are left with the possibility that some generator p occurs exactly once in V. Write V = V l p V 2 .Since V(b,,,. . . , b,[)is t h e boundary label of an R-diagram, it follows that V( b , ,., . . , b,,)= 1 in G. Let u1= V2(b,,, . . . , b,,),and u 2 = V 2 ( b , , ., . ,b,[).Solving for 6 we have b = u ; ' u ; ' in G. Obtain U * from U by replacing p by V ; ' V ; ' and replace ( . . . , b , . . . ) by ( . . . , u ; ' v ; I , . . . ). Obtain the R-diagram MI from M by removing t h e extremal disk K and reattaching K by sewing, in the appropriate orientation, the edge labelled by p in K to t h e edge labelled by p in M \ K . Now M 2 has the same number of regions as MI, and a boundary label of MI is u * = U * ( .. . , u;Iu;', . . . ). But u * is shorter than u. This contradicts the minimality of our choice, so we must conclude that M is bounded by a simple closed path. 0 Since t h e label on the boundary of M is a cancellation-free substitution instance of U, we can regard aM as a polygon whose boundary "has t h e form U". We can then identify the edges in aM to
M
Figure 2.
360
P.E. Schupp
obtain an R-diagram, which we also call M , on the surface S defined by U. Let D be a region of the diagram M in the plane, and let 6 = e l * * e, be a boundary cycle of D. Considering D as a region of the diagram M o n the surface S, a distinguished boundary cycle of D will be the sequence e l * et after identification.
Lemma 8. The diagram M is still reduced on the surface S provided that, if W is non-orientable, we make the additional assumption that R contains no proper powers. Proof. Recall that M is reduced if, for any edge e on the common boundary of regions D1and D 2(which may coincide), the labels +(al) and +(az) are not the same, where Si is the boundary cycle of D i which begins with e. Since the diagram M was reduced in the plane, we need only consider the edges identified according to U. Suppose that the identification of two edges e , and e2 o n the boundaries of two distinct regions makes the diagram not reduced. (See Figure 3 which pictures the orientable case.)
M
Figure 3.
Obtain the diagram M' in the plane by removing the two edges e l and e2. Now M' has boundary label equal in G to U ( . . . , b,. . .), for if b is the label on the edge e, and the boundary cycle S of D has label +(a) = bh E R, then we have replaced b by h-I. The diagram M' has fewer regions than M, contradicting the minimal choice. It is conceivable that two edges of the same region D are identified in such a way that the diagram becomes not reduced on the surface S. If the identification of the two edges is an orientable identification (in the obvious sense that the edges are labelled by b and b - ' ) then the diagram being non-reduced means that the label r on D is conjugate to r-' in the free group F, which is impossible.
Quadratic equations in groups
361
Figure 4.
There remains the possibility of a non-orientable identification making the diagram not reduced. (See Figure 5 . )
Figure 5 .
Let D have boundary cycle e l v l e 2 q 2where c#J(e,)=b = c$(ez), and c#J(v2) = z. The diagram being non-reduced on S means that bybz = bzby in the free group E Thus y b z y - ' b - ' z - ' , which is equal to the commutator ( y b ) ( z b ) ( y b ) - ' ( z b ) - 'is, equal to 1 in E Since this implies that yb and z b are powers of a common element, the label r = bybz must be a proper power, but we have assumed that this is not the case.
c#J(vl)= y ,
We have completed the proof of Theorem 1 and now consider small cancellation theory. In our constructions so far, it has been convenient to assume that edges are labelled by generators or their inverses. We now want to d o t h e opposite, eliminating as many unnecessary vertices as possible. If u is a vertex of a diagram M, either in the plane or on a surface, we use d ( u ) to denote the degree of 11, that is, the number of edges incident at u, where, if both endpoints of an edge are a? u, we count t h e edge twice.
Lemma 9. Let M be an R-diagram on a compact surface S, where M is obtained by {dentifying the boundary of an R-diagram in the plane as in the proof of Theorem 1. (Thus S is not the sphere.) W e can modify M so
362
P.E. Schupp
that M has no vertices of degree less than three unless S is the projective plane. If S is the projective plane, we may assume that M has at most one vertex of degree two. Proof. Consider t h e diagram M in the plane before identification. Since M is an R-diagram with cyclically reduced boundary label, M has no vertices of degree one. If u is an interior vertex of degree two, say u separates edges e l and e2, we can simply delete u and combine e l and e2 into a single edge e with label 4 ( e ) = 4(el)4(e2). Suppose that there are successive edges e l and e2 in dM separated by a vertex u which has degree two in the plane, and where the labels on e l and ez correspond to successive occurrences of a and p in U with p # a?'.Now u will have degree at lest three after identifying the boundary edges of M unless the edges e3 and e4 corresponding to the two other occurrences of a and p in U are situated so that the end of e , is the beginning of e4. This means that in U (or a cyclic permutation of U ) the occurrences of a and p are in two subwords of the form (ap)%'.We can replace these occurrences of ( Q P ) ~ by ' y" where y is a new variable not occurring in U . Delete the vertex u, combining e l and ez into a single edge e with label 4 ( e ) = +(el)4(e2). Similarly, combine the two edges corresponding to the other occurrence of ( a p ) = l into a single edge. (In short, all that we have done is to eliminate an unnecessary subdivision.)
Figure 6.
Iterate the above process, eliminating as many vertices of degree two as possible. The only situation not covered is that of a vertex u separating edges e l and e2 corresponding to successive occurrences of a letter a in U. Now u will certainly have degree at least three after the identification of dM unless e l and ez are all of dM. In this case, U is a2,the surface S is the projective plane, and u is the only exceptional vertex. 0 From now on, when we consider a reduced R-diagram M tessellating a compact surface S, we assume that M has no vertices of
Quadratic equatwm in groups
363
degree less than three; except for the possibility of one exceptional vertex in the case of the projective plane. Since M is reduced, the label on every edge is a piece and t h e small cancellation hypotheses have their usual geometric meanings. (See [9] or [ l o ] . )If R satisfies the condition C ( k ) , then no element of R is a product of fewer than k pieces, and thus d ( D ) a k for every region D of M. If R satisfies the condition TO') and u is a vertex of degree at least three, then, in fact, d ( u ) 3 j . Thus every vertex of M has degree at least j , except possibly the one exceptional vertex in the case that S is the projective plane. The basic tool of small cancellation theory is the summation formula given below. We use the following convention on summation signs: if M is a fixed diagram, CM [ p - d ( D ) ] denotes the sum of [ p - d ( D ) ] for all regions D of M, while E M [q - d ( u ) ] denotes [q - d ( u ) ] summed over all vertices u of M. We shall usually omit the subscript naming the diagram.
Lemma 10 (The summation formula). Let M be a tessellation of a compact surface S. Let p and q be positive integers with l l p + l/q = 112. Then
Proof. Let M have V vertices, E edges, and F regions. (This is our only departure from using F to denote a free group.) Euler's formula is
(1)
x ( S )= V - E
+ F.
Since we have counted degrees with the appropriate multiplicity,
(2)
2E
=
2 d ( u )= c d ( D ) .
Let n be a positive real number. We have
(3)
2(n + l)X(S)= 2(n + l)V - 2(n + l)E + 2(n + 1)F.
Using (2), we can eliminate E from (3), (4)
2(n + l ) X ( S )= 2(n + l ) F -
2 d ( D ) + 2(n + l)V - n c d ( D ) .
Since V is the number of vertices and F is t h e number of regions,
P.E. Schupp
364
Let p = 2(n + 1) and q = 2(n Substitution in (5) yields
+ l)/n. Then
n = p / q and l/p
+ l/q = 1/2.
which is the desired formula. 0 We can now prove Theorem 11. Suppose that we start with a quadratic equation W = 1 where x = x ( W ) is not positive; and a nonfree solution of W = 1 in the group G = (X; R). If W is nonorientable we assume that R contains no proper powers. Then there is a reduced R-diagram M on a surface S defined by an endomorphic image U of W. Thus x ( S ) = x ( U )a x ( W ) . Consider the summation formula with @, q ) = (6,3).We have
6x(W)"6x(S)=C [6- d(D)]+2 C [3- d ( ~ ) ] . M
M
The basic idea is simply to assume a good enough small cancellation condition to force the right-hand side to be less than p times x ( W ) . Supposing that S is not the projective plane, all vertices of M have degree at least three. Since M has at least one region, the equation (*) is impossible if R satisfies the small cancellation condition C(7- 6 x ) . If S is the projective plane, x(S)= + 1, and we have
6=
(**)
c [6 M
+ 2 C [ 3 - d (U )].
- d(D)]
M
Since there is at most one vertex of degree two, this equation is impossible as long as R satisfies C(6). In summary, if R satisfies C(7- 6 x ) then the required diagram M cannot exist. We conclude that there cannot be a non-free solution of W = 1 in G. By letting @, q ) = (4,4), we see that it is sufficient to assume that R satisfies C(5 - 4 x ) and T(4). If we set (p, q ) = (3,6),we see also that it suffices to assume that R satisfies C(4- 3 x ) and T(6).
64. Automorphisms of surface groups Let W ( a I,..., a,) b e o n e o f thecanonicalforms[al,az]...[a.-l,a,] or a:. . at. Let F be the free group on X = { x l , .. . ,x,,}, let r = W ( x , ,. . . ,x,,), and let R be the symmetrized closure of r in F. We say that (X; R ) is the standard presentation of the fundamental group G of t h e surface S defined by W. We first observe some facts about the standard presentations.
-
Quadraiic equaiions in groups
365
Lemma 11. Let ( X ;R ) be the standard presentation of the group of the surface S defined by W(crI,.. . , a n ) .Then R satisfies both of the cancellation conditions C ( 2 n ) and T ( 2 n ) . Proof. We prove t h e lemma in t h e case where R is the symmetrized closure of x i . . . x'.; t h e orientable use is analogous. T o verify C ( 2 n ) it suffices to observe that any two-letter word, of the form x: or X , X , + ~ , has at most one ocurrence in any element of R. Thus a piece consists of only one letter. To verify T ( 2 n ) our assumption is that we have elements T I , . . .,r, of R with each r,+' # r ; ' , r h # r;', and that there is cancellation in each of the products r l r z , r2r3,. . . , rh-'rh,rhrl. Now h > 2 and one of r I or rz must be of t h e form (xf . . . x'.xX:. ~ f - ~ ) and * ' the other of the form X : - ~ X , ) ~ ~By . symmetry, we may suppose that rl = x i * * x'.. (X,X:+~ Then we must have rz = X;'X,!~ * * * x;'x;'. Consequently, r3 = XZ.xl2 . .. x i - , . Continuing thus, we see that it takes at least 2n steps to obtain an r h which can cancel against r l . Thus h 2 2 n . 0
Lemma 12. Let G = ( X ;R ) be given by the standard presentation of the fundamental group of the surface S defined by a canonical form W ( a I ,...,a,)where W inuolues n > 2 generators. Let V ( P l , . . , P k ) be a quadratic word inuoluing k generators where k < n. Then all solutions of V = 1 in G are free. Proof. The canonical form V* to which V is equivalent under an automorphism of @ involves at most ( n - 1) generators. Thus x ( V ) s 2 - ( n - 1) = 3 - n. If G admits a non-free solution of V = 1, then there is a reduced R-diagram M on a surface S * defined by an endomorphic image of V: Using t h e summation formula with (p, q ) = (4,4) we have
4(3-n)<4x(S')=
M
[ 4 - d ( D ) ] + C[ 4 - d ( u ) ] . M
First assume that S * is not the projective plane. Since R satisfies C ( 2 n ) and T(2n) the degree of every region and every vertex is at least 2n, so every term in t h e right-hand side of (*) is non-positive. Since M has at least one region and one vertex equation (*) implies that
4(3- n ) < ( 4 - 2 n ) + ( 4 - 2 n ) , which says that (12- 4 n ) s ( 8 - 4 n ) , a contradiction. If S * is t h e projective plane, then 4 x ( S * )= 4 and there is at most one vertex of
P.E. Schupp
366
degree two, so the equation (*) clearly cannot hold. Thus assuming the existence of a non-free solution of V = 1 yields a contradiction. 0
Lemma 13. Let S be a surface defined by a canonical form W ( a I , .. . ,a,) where n 3 3 . Let ( X ;R ) be the standard presentation for the group G of S. If M is a reduced R-diagram which tessellates S, then M has only one region. Proof. Since W is a canonical form, x ( S ) = x( W) = 2 - n. The summation formula with @, q ) = (4,4) says that 4(2- n ) =
M
2
(4- d ( D ) ) + M (4- d ( u ) ) .
Let k be the number of regions of M. Since R satisfies C(2n) and T(2n) and M has at least one vertex, we have or Since n
4(2 - n ) S k (4 - 2n) + (4 - 2n), Os(k-1)(4-2n). 3 3,
this implies that (k - 1) G O . 0
We note that in the case n = 2 where the surface is the torus or the Klein bottle, there are reduced R -diagrams with arbitrarily many regions tessellating the appropriate surface. In the case of the torus, when R is the symmetrized closure of xyx-'y-', the minimal R diagram with boundary xy'x-'y-' has k regions, and xy'x-'y-* is the result of a cancellation-free substitution into aPa-'P-'. We now turn to a proof of Nielsen's theorem on automorphisms of surface groups. Let S be a surface defined by a canonical form W ( a , ,. . ., a,) which involves n generators. Let G = (X; R ) be the standard presentation of the fundamental group of S. If n = 1, the theorem is trivial. If n = 2 and S is orientable, then G is the free abelian group on two generators, and the theorem is easy to prove. (See [ll].) The proof which we give is valid for n at least three, which we now assume. (Whether S is orientable or non-orientable makes no difference.) Let 8 : G --., G be an automorphism of G. The rank of t h e group G is n, so the n-tuple (O(x,), . . . , 8 ( x , ) ) is a non-free solution of W = 1 in G. Let F be the free group on X. In discussing Nielsen transformations on tuples of elements of F one fixes a well-ordering < of the words of F which is compatible with length; if I u I < 1 u 1 then u < u. (See [lo].) The ordering < , induces a lexicographical well-ordering, which
Quadraric equarions in groups
367
we also denote by < , o n n-tuples of elements of E This ordering is used to decide whether or not to perform Nielsen transformations which leave t h e lengths of the entries in an n-tuple fixed. Suppose that we are given a quadratic word U ( p , ,. . . ,p.) E @ and an n-tuple T = (61, ..., 6,) of elements of F. Note that an attached transformation of type (4)does not affect the entries in T, but only cyclically permutes U. We say that (61,. . . ,6,) is cyclically Nielsen reduced relative to U if (61,. . . , 6,) is Nielsen reduced relative to all cyclic permutations of U . In studying t h e authomorphism 0 we shall find it convenient to work with all automorphisms 0’ which are equal to 0 followed by an inner au tomorph ism. Let Kl(W, 0 ) consist of all pairs (W, cr) where cr = ( a l , .. . , a.) is an n-tuple of words of F, and there is an automorphism 0’ of G differing from 0 by an inner automorphism such that a, = 0’(x,) in G for i = 1,. . . , n. Let X ( W , 0 ) be t h e set of all pairs ( U ,T ) such that there is a sequence
(wl,V l ) +
‘
’ --* (wk? gk
)
=
(u,7 )
where ( WI,al) E XI(W , 0) and each ( W , , , ,c , +is ~) obtained from (W,,u,)by an attached transformation of type (2) or (4). Note that in the above definition if a pair (V, T ) is in N ( W, 0), t h e tuple T is derived from t h e tuple cr of a pair in XI(W, 0) solely by Nielsen transformations in the free group E We have, however, the following Observation. Suppose that ( U ,T ) E X ( W , 0 ) with T = (61,. . . , 6 “ ) . If T ’ = ( 6 ;,..., 6 ; ) where 6 : = 6, in G, i = 1,..., n, then ( U , T ’ ) E N ( W , ~ ) .
To prove t h e observation, suppose that ( U ,T ) is derived from an element (W, cr) in XI(W, 0 ) by a sequence of transformations N I ,. . . , Nk. We shall write transformations on t h e right, so ( U ,T ) = (W, u ) N I . . * N k . Now ( U , nil . . . Nil is a member (W, u’)of XI(W , 0). So ( W, a’)NI . . . N k = (U,T’),verifying that ( U ,7’)E X ( W, 0). We now make essentially t h e same minimal choice as in the proof of Theorem 1, except that we use the ordering < in the third condition. Choice. Among all elements of K ( W , 0 ) pick a pair U and (61,. . . ,6.) such that: (1) The number of regions in the minimal connected simply connected R-diagram M in the plane with boundary label U(61,. . . ,6,) is as small as possible.
368
P.E. Schupp
(2) Subject to (l), t h e length of t h e free group element U ( b 1 , .. . , b.) is as small as possible. (3) Subject to (1) and (2), the n-tuple ( b l , ..., b n ) is as small as possible in the ordering K . For the remainder of this section, we assume that U and ( b l , . . . , b,) are chosen as above, and that M is a minimal connected simply connected R-diagram in t h e plane with boundary label U ( b l , .. . , b,). If u = ( a l , .. . ,a , ) is the tuple in a pair ( WI,a )E Xi(W, O ) , then the rank of the subgroup of F generated by { a l ,..., a , } is n. Since ( b l , .. . , b,) is obtained from such a tuple by a sequence of Nielsen tranformations, the rank of t h e subgroup generated by {bi, . . . ,b.} is also n. In particular, n o b, = 1. Also, applying a fixed inner automorphism of F to each b, cannot yield a tuple of shorter length, for this would violate our minimal choice. Exactly similar to Lemma 6 we have Lemma 14. The tuple ( b l , .. . , b n ) is cyclically Nielsen reduced relative to u.
Let U = p,,,. . ., p,2, in @. It follows that t h e sequence b,,,. . . , b,2m satisfies the Nielsen conditions (Nl) through (N3). In particular, there will be a "middle part" of each b, which is not cancelled from either occurrence of b" when freely cyclically reducing U(b1,.. . , bn). (See [lo] or [ll].) We thus write each b, = c,bTd, where b: f 1, n o letter from b f is cancelled from either occurrence of b" when freely reducing U ( b l , . . ,b n ) ,and each letter of c, or d, is cancelled in one of the occurrences of b:'. If all t h e c, and d, are empty, the tuple ( b l , .. . , b,) is cancellation-free in U, but we now have to take care of the situation where some cancellation occurs. We call t h e generators p, which occur in U primary variables. We introduce, by successive transformations of type (l), new auxiliary variables l, 7,etc. in one-to-one correspondence with the letters cancelled in the c, and d,. When the elements substituted for two auxiliary variables cancel, say in a product 67 where some y is substituted for 5 and y - I is substituted for 7,we perform the transformation 77 + l - I . Such a transformation affects only the auxiliary variables. If the letter substituted for p cancels in the product (p, we perform the substitution /3 + f ' p . We thus arrive at a quadratic word U * ( p 1 , ..,pn,C1,.. . .,&,) and a tuple T * = (b:, .. . , b * , , y l , .. . , y , ) where (i) U * is obtained from U by an automorphism A of @ of the form p, + C,p,D, where each C, and D, is a word on t h e auxiliary variables,
Ouadratic equations in groups
369
(ii) the b f are those in the decomposition b, = c,bfd,, and each y , is a generator of F or its inverse, (iii) t h e tuple T * is cancellation-free in U * and U * ( b f , .. ., b z , y , , . . . , y , ) = U ( b l , ..., b.) in F. Note that U is obtainable from U * by deleting all t h e auxiliary variables. The following lemma is easily proved by induction on t h e number of transformations needed to obtain U * from U. Lemma 15. Let U * be obtained from U as described above. Let V * ( p , ,., . . , p,,,&, . . . , I & ) be a quadratic subword of a cyclic permutation . b f , y,,,...,y,,)= 1 in G. Let V be of U * , and suppose that V * ( b f , , .., (the subword of a cyclic permutation of U ) obtained from V * by deleting all the auxiliary variables. Then, substituting the original b, for the primary variables in V , we have V(b,,,. . . , b,,)= 1 in G. Further, if Z * contains all the primary variables in U * , then Z * is U * . We turn to the analogue of Lemma 7. Lemma 16. A n y boundary cycle of M is a simple closed path.
Proof. If the boundary of M is not a simple closed path let K be an extremal disk of M. Let K have boundary label u I where u l u 2is a cyclic permutation of the boundary label u of M. First suppose that U I = Z * ( b f , ,. . . , b f , ,Y,,,. . . , Y,,) where Z*(P,,, . . .,PI,,L,,. . . , l,,)is a proper quadratic subword of a cyclic permutation of U*. Since u I is the boundary label of a simply-connected R-diagram in t h e plane, u I = 1 in G. By Lemma 15, Z(b,,,. . . , b,,)= 1 in G, where Z is obtained from Z * by deleting t h e auxiliary variables. Since Z involves t primary variables, and the rank of the subgroup generated by {b,,,. . . ,b,,} must be t, (b,,,. . . , b,,) is certainly a non-free solution of Z = 1 in G. But since t < n, this is impossible by Lemma 12. Since the above situation cannot occur, there is some variable y (either primary or auxiliary) so that part of the result of substituting the ec:ry in t h e tuple T * corresponding to y appears in u I and part appears in uz. Then, exactly as in the proof of Lemma 7, we may remove and reattach K , obtaining an R-diagram. with the same number of regions but with a shorter boundary label u’. We have replaced one entry in the tuple T * by a word representing the same element of G. Calculating the corresponding change in T, we replace T by T ’ = ( b l , . . . ,b:) with each 6: = b, in G. By our previous observation,
370
P.E. Schupp
( U ,7')E X ( W , 0 ) and U ( b l , .. . , b:) = u ' . But this contradicts our minimal choice. 0 We now know that the boundary of M is a simple closed path labelled by a cancellation-free substitution instance of U * ( p , ,. . .,pn,C,,.. . ,[,). Since U * is the image of U under an automorphism of 0,the surface defined by U * is the surface S defined by U. Identifying t h e edges in the boundary of M according to U*, we obtain an R-diagram M on the surface S. Exactly as in the proof of Lemma 8 we have Lemma 17. The diagram M is reduced on S. Since R is the standard presentation of the fundamental group of the surface S defined by the canonical form W o n n 2 3 generators, the diagram M can have only one region D by Lemma 13. The boundary label r of D is a cyclic permutation of W ( x , ,. . . ,x n ) * ' . Since r = U * ( b t ,. . . ,b*,,y l , . . . ,y , ) , no cancellation occurs, and each b : # 1, we conclude that U * contains n o auxiliary variables. Thus U * = U. Further, each b, is some x , t l , and up to a possible relabelling of the p, and their inverses, U is a cyclic permutation of We1. We next want to conclude that each c, and d, is the identity. T o illustrate this, we d o an example. (Although we are assuming that n 2 3 , our example is on two generators.) Suppose that U is plpzp;lp;l and that d l # 1. Suppose b t = x 1 and b f = x,. Substituting b, and bz into U , we have that the product
( ~ i ~ i d i ) ( ~ z ~ 2;'xd zi l)c(idl ) ( d2 -x' z
c2
)
and the indicated occurrences of x I and cyclically reduces to x z are not cancelled. It follows that cz = d ; ' , d , = d l , and c1 = d,. Thus each b, is of the form d i l x , d lwith d l # 1. In the general case, we see that assuming some c, or d, is not the identity forces each b, to have t h e form z x t l z - ' with z # 1. But this violates our minimal choice. We conclude that each b, is some x t ' . Now ( b l , .. . ,b,) is obtained by a sequence of Nielsen transformations from a tuple ( a l , .. . , a , ) where a, = O'(x,) in G for some automorphism 0 ' of G differing from the given 0 by an inner automorphism. Hence there is an automorphism J,' of t h e free group F sending x , to a,. Correcting by the inverse of inner automorphism we see that there is an automorphism J, of F with J , ( x , ) = 0 ( x , )in G, that is, 0 is induced by an automorphism of F. Tracing through the effect of J, on W , we also see that J, sends W to a conjugate of W"
Quadratic equations in groups
37 1
References [ l ] C. Chalk, Ph.D. thesis, University of Nottingham, 1977. [2] L.P. Comerford, Real elements in small cancellation groups, Math. Annalen 208 (1974) 279-293. [3] C.C. Edmunds, O n the endomorphism problem for free groups, Comm. Algebra 3 (1975) 1-20. [4] S.O. Gowdy, O n Greendlinger’s eight-groups, Ph.D. thesis, Temple University, Philadelphia, 1971. [5] M. Greendlinger, On Dehn’s algorithms for the word and conjugacy problems with applications, Comm. Pure Appl. Math. 13 (1960) 641-677. [6] M. Greendlinger, A class of groups all of whose elements have trivial centralizers. Math. Zeit. 78 (1962) 91-96, (71 M. Greendlinger, Problem of conjugacy and coincidence with anticenter in group theory, Siberian Math. Journal 7 (1966) 626-640. [8] R.C. Lyndon, On the equation a 2 b 2= c 2 in free groups, Michigan Math. J. 6 (1959) 89-92. [9] R.C. Lyndon, On Dehn’s Algorithm, Math. Ann. 166 (1966) 208-228. [ 101 R.d. Lyndon and P.E. Schupp, Combinatorial group theory (Springer-Verlag. Heidelburg, 1977). [I11 W. Magnus, A. Karrass and D. Solitar, Combinatorial group theory (Wiley, New York, 1966). [12] W. Massey, Algebraic topology, an introduction (Harcourt, Brace and World, New York, 1967). [13] C.F. Miller and P.E. Schupp, The geometry of HNN extensions, Comm. Pure Appl. Math. 26 (1973) 787-802. [ 141 J. Nielsen, Untersuchungen zur Topologie der Geschlossenen Zweiseitigen Flachen, I, 11, 111, Acta Math. 50 (1927) 189-358; 53 (1929) 1-76; 58 (1931) 87-167. [I51 P.E. Schupp, On Dehn’s AlgoYithm and the conjugacy problem, Math. Ann. 178 (1968) 119-130. [I61 P.E. Schupp,.A survey of small cancellation theory, in Word Problems, edited by Boone, Cannonito and Lyndon (North-Holland Publishing Company, Amsterdam, 1973). [I71 J.K. Seymour, On conjugate powers in small cancellation groups, Ph.D. thesis, University of Illinois, Urbana, 1974. [18] V.V. Soldatova, O n groups with a 6-basis, for 6 < 1/4, and a single additional condition, Siberian Math. J. 7 (1966) 504-511. [I91 H.Zieschang, E. Vogt and H.-D. Coldewey, Flachen und ebene diskontinuierliche Gruppen, Springer Lecture Notes, 122 (1970).
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems 11 @ North-Holland Publishing Company (1980) 373-394
ON A PROBLEM OF KUROSH, JONSSON GROUPS, AND APPLICATIONS Saharon SHELAH* Institute of Mathematics. The Hebrew University, Jerusalem, and University of Wisconsin, Madison
Abstract. We prove some results in group theory in a model theoretic spirit. (i) We construct Jonsson groups of cardinality 84, and other cardinalities as well. This answers an old question of Kurosh. (ii) Our group is simple with no maximal subgroup; so it follows that taking Frattini subgroups does not commute with direct products. (ii) Assuming the continuum hypothesis, our group is not a topological group, except with the trivial topologies. This answers a quite old question of A.A. Markov. In the construction we use small cancellation theory. We try to make the paper intelligible to both group theorists and model theorists. Only a knowledge of naive set theory and group theory is needed.
80. Introduction
We first give the background, state the results, and then explain the proof. Schmidt asked whether infinite groups with no infinite proper subgroups exist; there has been much work in Schmidt groups, see e.g. [16]. Kurosh generalized this question to the following: Does there exist a group of cardinality N, which has no proper subgroup of the same cardinality? Later Jonsson asked the same question for any algebra; so now an algebra with no proper subalgebra of the same cardinality is called a Jonsson algebra. Chang and Keisler [ 5 ] in their list of open problems, repeat Kurosh’s question in this terminology: is there a Jonsson group of cardinality N,? McKenzie [ l l ] proved that, for almost any cardinal A, every Jonsson semi-group of cardinality A is a group. Much work was done on the following question: for which A The author would like to thank the United States-Israel Binational Science Foundation for partially supporting his research, by Grant 1110, and the U.S. National Science Foundation for similar support under Grant 144-H 747. 373
374
S. Shelah
is there a Jonsson algebra of cardinality A (with countably many operations, of course). See e.g. [5]. Magidor and Malitz showed, from a general theorem, that if there is a Jonsson group in some regular cardinal, and V = L (or even OM,)holds, then there is a Jonsson group of cardinality NI. The main body of the paper is a proof of the following result:
Theorem A. There is a Jonsson group of cardinality N, and also of cardinality A + when A + = 2^. The groups we construct can serve as counterexamples for some problems; more properties of these groups are stated in Theorem 1, and more hold by the construction. It has been asked whether the operation of taking the Frattini subgroup commutes with direct products. Now T ( G ) ,the Frattini subgroup of G, is the intersection of all proper maximal subgroups. It was well known that any simple group G with n o maximal subgroup will provide a counterexample (because T ( G )= G, but T ( Gx G ) = the diagonal subgroup = { ( a ,a ) : a E G}). Our groups serve as examples.
Theorem B. (1) There is a simple group (ofpower NI) with no maximal subgroup. ( 2 ) Taking Frattini subgroups does not commute with direct products. Problem. Does Theorem B hold for countable groups? A.A. Markov [IS] asked about the existence of an untopologizable group; i.e., a group which admits only the discrete topology. A.A. Markov [19] and Podewski [12] reported this question and proved that for any Hausdoff indiscrete topologizable group, any finite set of inequations which has at’least one solution has at least two; and that for countable groups this condition on the group is necessary and sufficient for the existence of a Hausdorff indiscrete topology for the group. (The demand of “Hausdorff” is quite natural.) It is still unknown whether countable groups not satisfying this condition exist. Podowski [I21 also gives a sufficient condition on a not necessarily countable group G (any system of less than I G 1 inequations which has at least one solution has at least two) for the existence of such a topology. He also deals with other algebras. Previously Hanson 17) gave an untopologizable groupoid and Arnautov [2], [3] proved results o n rings similar to those of Podewski. Kertesz and Szele [S] showed that for abelian groups a nontrivial topology always exists.
O n a problem of Kurosh
375
Theorem C . Assume CH (i.e. 2"0= Nl). Then there is an untopologizable group of cardinality NI. In fact, every countable subgroup of this group is topologized, thus answering negatively a question of h k o w s k i . Of course we can replace NI by any A' = 2^. Bokut [4] asked whether every algebra M over a field K (M not necessarily associative) of infinite dimension can be represented as U.,,M,,, M,, a subalgebra over K,dim(M,,) = dim(M,,+l,M.) = A for every n.
Theorem D. There is a group G of cardinality N1 such that for every field K, the group-ring K [ G ]is not the union of a strictly increasing chain of length w. Note that Sabbagh [13] had proved the result for modules and groups; but they were of cardinality A = A"o; but his group can serve as well, and we can easily change it to cardinality N1. Koppelberg and Tits [9] have shown that no complete Boolean algebra is the union of a strictly ascending chain of subalgebra of length w. I would like to thank G. Hesse wholeheartedly for pointing out the incorrectness of 2.11 as it was stated (it was needed for Theorem 2.9) and suggesting a proof of 2.9 avoiding 2.11. The error was that if K \ H contains torsion elements we cannot get L C m d L *so * we added in 2.11 the hypothesis "all groups are torsion-free" and added a proof to 2.11 (which was left to the reader in the first version). Hesse's proof is added too. (Note that by his method in 2.9 for any uncountable set of elements I of the group, and element a, a belongs to the subgroup generated by three elements instead of two, but the length of the word is shorter.) We shall first present small cancellation theory. We prove everything except the main theorem. We also slightly improve an application from [14]. We shall prove (in Theorem 2.1) that for A ' = 2^, A >No, there is a Jonsson group of cardinality A' satisfying some other conditions. Then we show that with small changes our proof works for A = No. Next with less details (Theorem 2.9) we prove there is a Jonsson group of cardinality N1 (without assuming CH). Now Theorems A, B, follow immediately from 2.1, 2.9; we then give the (short) proof of theorems C, D, and we finish the paper by making a few remarks.
376
S. Shelah
I would like to thank Paul Schupp wholeheartedly for explaining small cancellation theory to me, and for checking the proof. Let me try to explain the proof of Theorem 1. We construct the group M as t h e union of an ascending chain of length A’, of groups of cardinality A, say Ma(a< A ’). For definiteness we assume that the set of elements of Mu is A (1 + a ) ( = the set of ordinals < A (1 + a)).We try to prevent the existence of large proper subgroups G. In stage a Mu is already defined, and we are defining we have a list of proper subgroups of Ma of cardinality A { S , : p a},we look at them as approximations in some a (0) s a to a G (i.e. as G fl and try not to let them “grow”. More specifically, we want that for every a E M a +-,Mu, and p s a, the subgroup of M e + ,generated by S, and a includes M,. This means that if G is a proper subgroup of M , M,,,,C G, S, = G fl M,,,,, then G is disjoint to M a + ,- M,. If we choose S, ( p < A + ) so that every subgroup of M of cardinality A appears (and this can be done by 2* = A + ) this scheme works, provided that we can define M a + ,from Mu.We construct M a + ,by a series of approximations of smaller cardinality L, ( p < A ) ( L , ascending and continuous, of course) and let H, = L, f l M , (of course u , H , = Ma, U,L, = M a + , ) .So we have to amalgamate L,, H,+, over H,, faithfully. The free product of L,, H P + ,with amalgamation over H , satisfies this; =p M, is easy; but we but we have tasks to fulfil. To ensure U P C A H also have to ensure that each b E Ma belongs to (a, S , ) for a E L, - H,, y s a. There are A such tasks, so in each amalgamation we can deal with one such task only. We can choose H,,, such that b E H,,,, and S , fl H,,, is “quite” large. But how to do this? We want something like free amalgamation, with an extra relation, saying that a word in a and some x E S, is equal to b. Small cance]lation theory is just the right theory. However one needs some hypothesis on Q and x. For this we make the induction assumption that H , is a malnormal subgroup of L, (see “Notation”, just below, for a definition), and we use the fact that IH, 1 < A = to find suitable x, y E S, f l H , + , . Now a, x, y satisfy a variant of the blocking pair condition, so small cancellation theory works. For A = N,, (see 2.7) we should replace usually “of cardinaltiy < A ” by “finitely generated”. The main point is that a set S C M,, which is not included in a finitely generated subgroup of Mu may be included in a finitely generated subgroup of some M,, p 3 a. However our construction prevents this possibility, as shown in fact by 2.8, which is useful also in replacing O, by CH in Macintyre [lo] for algebraically closed groups (see [17]). Theorem 2.9, which says that in power NI Jonsson groups always
O n a problem of Kurosh
377
exist, demands more changes, but we give fewer details and the ideas are essentially t h e same. Notation. G, H, K, L, M will be groups. G G H means G is a subgroup of H. If G < H, x E H, then we call x malnormal over G (relative to H, of course) if G, G" are disjoint, except for the unit element which we ignore where G' = {xgx-' : g E G}. If G 6 H and every x in H - G is malnormal over G, then H is a malnormal extension of G, and G a malnormal subgroup H and we write G S m H . If G 6 H , x E H then the righr, left, double coset of x over G in H is xG, Gx, GxG resp; and belonging to t h e same right (or left, or double) coset, is an equivalence relation; and inequivalent elements have disjoint right (left or double) cosets. We do not distinguish strictly between a group and its set of elements. J A1 is t h e number of elements of A, i.e., its cardinality. For X C G, (X) is t h e subgroup of G generated by X. We call x , y E H good fellows over G if x, y E G - H with I?' Gy "G. Note that among any three elements of G - H with distinct double cosets at least two are good fellows over G. Note also that for good fellows x,y ( G x f l G ) n ( G y f l G ) =0; and that G itself is a double coset. The relation of not being good fellows is an equivalence relation. XI'
11. Free products with amalgamation Definition 1.1. Suppose H, K, L are groups, K n L = H (for notational simplicity only). We now define the free product with amalgamation of K, L over H, denoted by L* = K * H L as follows: i) Let F, be the free group generated by t h e elements K U L. ii) Let N , be t h e normal subgroup of F , generated by {glg2g3:gl, g2, g 3 EK , glgz= g;' or g,, gz,g 3 E L, g lg 2= gill. iii) Let L * = F , / N , . Now K, L, H have natural homomorphisms into L* : g I-+ g l N , . Fact 1.2. These homomorphisms are embeddings, and the intersection of the images of K and L is the image of their intersection, H (see e.g. [41). We shall not from now on distinguish between g and g l N , , and we shall call an element of F , a word, and of K U L a letter. It follows that every element of L * which is not in H is equal to a product of the form g,. * . g, such that for each 1, gr E L U K - H, and gr E L iff
S. Shelah
378
gl+lE K. Such a product will be called a canonical representation. For
g E H, g itself is the canonical representation. A subword of (a canonical representation) g,. . * g, is gr * * * ,g, 1 s 1 s m s n. In general
canonical representation is not unique but
Fact 1.3. If g E L * has the canonical representations g l . g, g: . * . g!,, then: (i) n = m. (ii) gl E K iff g: E K . (iii) There are h,, . . . , h,-, E H such that letting h, = h, = e, for every 1, g: = hrgrh;:,. Definition 1.4. If g E L * has a canonical representation g l * * * g, n will be called the length of g and denoted by Ig I. Definition 1.5. The canonical representation g, * g, is called weakly cyclically reduced if n is even, or n = 1 or g,gI 6Z H (equivalently, g l . . * g, has n o conjugate of length < n - 1). Notice that this is a property of the element. Small cancellation theory The aim of this theory in this context, is as follows: We have in L* = L * “ K , a set R of words which are “long and complicated”, which we want to make equal to the identity without “hurting” L, K , and short words in general. More accurately, we want to divide L * by the normal subgroup N of L * generated by R, and want that N will be disjoint to K , L and moreover will not have “short” elements. For simplicity R will always be a set of weakly cyclically reduced elements. Definition 1.6. (1) The symmetrized closure of R is obtained from R by the following operations: (i) Add the inverses of the elements in R. (ii) Add the conjugates (in L*) which are weakly cyclically reduced. (2) R is symmetrized if it is equal to its symmetrized closure. (3) A part of a cyclically reduced word is a subword of a weakly cyclically reduced conjugate of it. A part of R is a part of one of its elements.
-
Claim 1.7. If R = {g, * * g,}, n even, g, * g. a canonical representation, the symmetrized closure of R consists of the following elements: for each 1 =S 1 Q n, gr = g:g:, g:E K @ gr E K, the words:
379
On a problem of Kurosh
-.
g,gl . g,-,g: and (gi)-'g;J1 . . . g;'g;' . . g;+'&;)-'. (Note that we write the elements in canonical form except that maybe g f E H and then we multiply it with its neighbor.) gfg,,,
*
Definition 1.8. A symmetrized R satisfies the condition C(O)( 0 a real positive number < 1) if whenever g * = g" . . * g I , g * = g l * g, are canonical representations of elements of R, g * # g-d and g""' . . . g 'gl * gf(,, E H (hence this hold for any 1 s l(0)) then l(0)< On, l ( 0 )< Om and n, m > (l/O) (this means g *, (g * ) - I have n o common segment of this length. Clearly l(0) depends o n g * and g, and not on the representation).
--
-
Main Theorem 1.9. If L * = K * H L, R L * symmetrized and satisfies C'(l/k), k 3 6 , N the normal subgroup of L * that R generates, w = g l . . . g. E N in a weakly cyclically reduced canonical form, then w has a part wo which is part of some w1E R, and 1 w o J2 [ ( k - 3 ) / k ]I w11. Now we have to find suitable R. Schupp [14], [15) suggests, as a nice sufficient condition for existence, that L (or K ) contains a blocking pair { x , y } over H, which means a pair of malnormal elements over H which are good fellows except that possibly x = y - ' . Then for a a E L \ H the symmetrized closure of {axayax(ay)*ax(ay)'** . ax(ay)"} satisfies C'(l/lO). We notice two weaker conditions: (1) there is a malnormal a E L over H, and two good fellows X, y E K - H ; (2) there is a malnormal a E L, and x # y E K - H (this is weaker than (1) but the word contains inverses). So we can somewhat improve Theorem 10, p. 582 of Schupp (14) (see [15]). Theorem 1.10. Let L * = L * H K be a free product with amalgamation where L # H, K # H, and in at least one of them there is a malnormal element over H. Then L * is SQ-universal except when L, K, H are cyclic groups of order 2, 2, 1 resp. (SQ-universal means every countable group can be embedded in a quotient group of K ) . Now we present this more accurately. We denote by w ( - f ) = w ( x I , .. . , x,) a sequence composed of the letters x I , . . . , x,. For every group (or serfiigroup) G and a , , . . .,a, E G t h e meaning of w ( a l,..., a,) is clear. Definition 1.11. w ( i ) = w ( x I,..., x , ) = x ' - . . . . x k , where X I E { x l , . . . ,x,} and we stipulate x i = x i when i = j mod k (for latter use).
380
S. Shelah
(1) We call w ( 2 ) n-random if (i) whenever 1,s p , q s k , and ( V i ) (1 < i s k / n + x p + ' = x q + ' ) then p = q, (ii) for every p, 1 s p , q s k , for some i, 1 s i < k / n and x p + ' = x4-1
( 2 ) We call w ( f ) strongly n-random if (iii) if 1 s p , q s k , u 1 . . . u,, u I * * u. sequences of x , ' ~ , u,, x q + e ( I + t ) = E , 6 E (1, - 1) then for some 1, 1 s 1 < k / n - n, x P + ' ( ' + ' ) = u, for every i, 1 s i < n, except when an apparent contradiction arises (that is 6 = E , and for some i , j , 1 < j s n, p + 6i = q + ~ j u, , # v: ( S O IP - 9 I
gr+2,= a 6 , ga+2i+l =( Y ) ~ when x
~ = +x2 ~
when t h e index of g is < rn and > 2 , except possibly once (because of z). Of course E , 6 and p depends on g, so we shall write E ( g ) , S(g), and Pk). Suppose now g * , g , , I ( O ) are as in 1.8, and let h, = g' . - . g l , g I * . . g t E H for I d I(0); so hr+,= g'+lh,gl+l,h, = e ; and note g' E K gr E K (otherwise I ( O ) = O ) . Except for at most three I's (1, and one
O n a problem of Kurosh
38 1
exception for g, and one for g*) g' E L 3 [g',gI = a f ' ] and g' E K .$ [g,, g' E {x*', y*'}]. There are 1 < l(1) < l ( 2 ) S l ( O ) , l ( 2 ) - l(1) 3 1(0)/3, such that no 1, l(1) < 1 s l(2) is exceptional. So for every such 1, if g' E K , (as x, y are good fellows and g'hl-lglE H), then gr = x t 1 iff g' = x * ' . This means for 1(1)/2 < i < 1(2)/2, x ~ ( ~ ) - ' ~ ( ~=~ Ip(g.)+s(s.)i-(=(g')-E(s.))/21 (note g * was written in a reverse order). By the (4/8)-randomness of w(x1,x2) this implies S ( g * ) = - S ( g * ) and g ' = (gr)-' for 1 < 1 < rnin{m, n } . If gigl # e, then for every 1 < l ( O ) , hl# e (as a conjugate of an element # e is # e ) and for some 1, g' = a s so gI = a-' and hl+,= ashla-*,so by the malnormality of a, hr+lSr H, contradiction. So necessarily g' = gi', hence n = m, g " = g,- 1 , so g * = g;'. So we finish. (2) We proved as before and let u E {x, y } , u # x-'. Suppose S(g *) = S ( g . ) = 1 for simplicity. But now by strong (4/8)-randomness we can find 1,1(1) < 1, 1 + 2 < l(2) such that g ' = a, gl = a - ' , g"' = x, g r + l= u, g'+*= a, gl+2= a - ' . As h, = g'hl-lgrE H, a rnalnormal over H , clearly hl-l,hr E H so hl = hl-l = e (otherwise hr fZ H by a being malnormal over H ) SO hl+l=X U # e SO hi+2= a(xu)a-' $! H, contradiction. In the other cases we get similar contradictions except in the desired case. 92. The theorems
Theorem 2.1. Suppose A is an infinite uncountable cardinal and A ' = 2A. ( 1 ) There is a Jonsson group of cardinality A + . ( 2 ) Moreover this group is a Jonsson semigroup (i.e. has no proper semigroup of the same cardinality), is simple, and there is a natural number no such that for any subset S of the group of cardinality A', any element of the group is equal to the product of no elements of S. Proof. We will note some facts on groups, th e n we describe the construction, and at last prove it works.
Fact 2.2. Suppose H is a subgroup of K and A a subset of K such that either (i) 3 ( H 1 2 < I A1, or (ii) H is included in a finitely generated subgroup of K and A is not included in a finitely generated subgroup of K . Then there are x, y E A which are good fellows over H. Proof. Say x = y if x or x - ' E HyH. Now = is an equivalence relation o n A - H. Then x, y are good fellows over H iff they are not
) + ~ ( ~ * )
S. Shelah
382
=-equivalent. Thus if no x, y E K are good fellows over H, A - H is contained in the union of at most 2 double cosets: HaH, Ha-'H ( a any element of A - H); which contradicts (i), and contradicts (ii) as if H is contained in a finitely generated subgroup of K , so is ( H a H )u ( H a - l H ) u H. Fact 2.3. Suppose H = K r l L (so H is a subgroup of K and of L), and H s,,, L, and L * = K *HL (i.e. L* is the free product of K and L with amalgamation over H). Then K G,,, L*
Proof. Let g E K, g # e, p = w I w 2* * w. E L* - K (a canonical form, the w ' s are letters; so each wi belongs to L - H or K - H , and successive letters d o not belong to the same one). Note that n=1 w IE L - H as p 6Z K. We should prove that q = pgp-' = W I . . . w & w ; ' . . . w;' does not belong to K. (i) If w. E L - H, g E K - H, q is already written in canonical form (of length 2n + 1 > 1) hence q 6Z K. (ii) If w. E L - H, g E H, then w,,gw ;' E L - H by the malnormality of H in L, hence q has the canonical form w 1 w 2 . -w * . - l ( w ; l g w n ) w ; l l . . - W ; ' W ; ' and w ; ' g w . E K . So q g K. So we can assume w. E K - H, so as p $2K, n > 1. (iii) If w,,gw;' 6Z H, then w 1* ~ . - ~ ( w , , g w ~ ' ) w ~w;' l ~ is a canonical form of q hence q E K. (iv) If g l = w,,gw;'E H, then clearly gl # 1, and as n > 1, we are reduced to case (ii). We exhaust all possibilities, thus finish.
+
--
-
We shall need the following fact about small cancellation theory over free products with amalgamation. Fact 2.4. L * = K *HL where H 6, L. Suppose that x, y E K - H are good fellows over H and let a E L - H, z E K . Let r = r(a, x, y, z ) = z - ' ~ z y a x a ( y a ) ~ x a ( y a ) ' . x a ( y a ) @ " , and let R be the symmetrized closure of r. Let N be the normal closure of R in L*, and let L**= L * / N . Then (i) the natural map r : L * + L** embeds H and K, so that their intersection does not increase, (ii) K s,,, L**.
-
Remarks. In fact r has to be a long enough random word consisting of instances of x a , y a ; then multiplied by z - ' ; but group theorists like this particular r. "Random" means that no two distinct "large" (e.g. of
On a problem of Kurosh
383
length at least 1/100 of that of r ) segments of r are the same, even if we allow to inverting the order (but a segment and its inversion are considered distinct). We could have replaced "good fellows" by "with distinct double coset representations" but then the random word should consist also of instances of ax-', ay-', so later we should not get a Jonsson semigroup.
Proof. For (i) see 1.13 (1) and then 1.9. To see that (ii) holds, let e f k E K . Suppose u k? K . We can assume that u does not contain more that half of an element of R. Write u = u 1 u 2 ~u. * -in reduced form in L*. If u-'ku = k ' E K, we have the equation u - ' k u ( k ' ) - ' = e in L** and thus the left-hand side contains a part of more than 7/10 of an element of R by length. This cannot occur in u-l nor in u (as they do not contain more than half of a generator), so this occurs around k (or (k')-'), and at least 2/10 of it lies on each side of k. This means that for some v (initial segment of u ) u-'ku is a part of more than 1/10 of a member of R. But no element of R contains a long subword of the form c-lgc where g has length 1. Thus the above equation cannot hold. If the part is around (k')-' , the proof is similar. The construction 2.5 (for Theorem 2.1). As we have assumed 2* = A' the number of subsets of A' (as a set of ordinals) of cardinality exactly A is 2* = A', so let {S, : a < A'} be a list of them. We now define an increasing sequence of groups Mu(a< A + ) such that the set of elements of Ma is A ( l + a ) . We define them such that (i) Mu is a malnormal subgroup of Mu+I. (ii) For every y S a and a E Ma+' - Mu,if S, C Ma then the subgroup (S, U { a } >includes Mu.In fact, every element of Ma is a product of length no elements from S, U { a } (where no is the length of the r in Fact 2.4). (iii) For limit S, M~= Uac8Ma. We do the induction step later, and now let us prove that M = u,,,+M, is as desired. Suppose by contradiction that M has a proper sub-semigroup, M*, of cardinality A'. Clearly, for some a(0)< A + , S = M *r l Ma(o)has cardinality A, and let a E M - M * ,for some a ( l ) , a E Ma('). Now S is a subset of A' of cardinality A, hence for some a(3), S = &). Let a = max{a(O), a(1), ( ~ ( 3 ) )Since . M * has cardinality A + , it is not included in Mu and hence for some b E M *- Ma.So necessarily for some p 2 a, b E MB+'- MB. Thus by (ii) (p, a(3) here includes stands for a,y there) the subgroup generated by b and
384
S. Shelah
and hence a belongs to it. But b E M*, S,(9 C M*, hence a E M*, contradicting its choice. This finishes the proof of 2.1(1). As for part 2 of Theorem 2.1, M is a Jonsson semi-group as just proved; moreover checking how the group generated by b and S,(3) includes M, (by (ii)) we see that we have proved that if S is any subset of M of cardinality A', any element of M is equal to the product of no elements of S, here no is the length of the word r from Fact 2.4 and is a fixed natural number. From 2.1(2) only the simplicity remains, but if a E M, a not t h e unit, for some a , a E Ma,so for every /3 a'cy a has a conjugate in M,+l-M, (as M,+I is a malnormal extension of M,, for each b E MScl- M, we have bab-'E M,,, - M,). Hence the set of conjugates of a has power A + , so the subgroup they generate is the whole group M. In other words t h e normal subgroup a generates is M, so M is simple.
Ma.Therefore it includes
The induction step 2.6. So M a is given and we should define Ma+,. Let < A } be a list of t h e elements of Ma.Let {(a,, y,, b,): /3 < A } be a - Ma ( M a + ,is not list of all triples (a, y, b), b E Ma,y S a, a E yet defined, but - Ma is A, = (5 : A(1 + a )S 5 < A(l + a + 1))) and each triple appearing A times. (This can be done by cardinality considerations.) Now we shall define groups Lo, H, (/3 < A ) (more = uPc,L,. accurately L;, H ; ) and later let Ma+, We define them such that: (a) IL,I
up
O n a problem of Kurosh
385
apply (d). Part (iii) and the condition on the set of elements of holds by (b) and (c).
Statement 2.7. The proof works for A = No, too, the only changes being: (A) In the construction 2.5 in (ii) we have to say “if S, C Mu is not included in a finitely generated subgroup of Mu”. (B) In the induction step 2.6 at the end we have to use Fact 2.2(ii) instead of 2.2(i); and in (a) we should assert that L,, Hp are finitely Z Mu,5 $Z Lo} is infinite. generated and ( 5 < w ( 1 + a + 1): 5 f (C) In the construction 2.5 when we prove the construction works, we have to define S = M * fl Mace, somewhat more carefully. First of all we have to choose it so that it is not included in a finitely generated subgroup of Mu,, which can be done by a Lowenheim-Skolem argument. However, a priori, maybe in M, this is no longer true; but the following fact closes the gap. Fact 2.8. Suppose Mu(a < 6 ) is an increasing continuous chain of groups, and Mu+,is defined from Mu as in 2.6 for A = No,i.e. Mu+,= u B < w L ;L, ; f l Mu = H;, HE is a finitely generated subgroup of Mu (or even is included in such a subgroup) Mu = u p < w H ; .Then if a < 6, and G a subgroup of Mu not included in a finitely generated subgroup of Ma,then G is not included in any finitely generated subgroup of M6 (of course, we can replace group by algebra of some fixed kind). Proof. We prove there is n o finitely generated subgroup of M , ( p S 6) which includes G, by induction on p. For p S a this is a hypothesis, for p limit it is easy. For p + 1, suppose a l , .. . , ak generates some such subgroup. Then for some n a l , . . . ,at are in L!, hence G C ( a , , .. . , a k )f l MuC L! r l Mu = H,, so G is included in a finitely generated subgroup of M,, contradiction.
Theorem 2.9. (1) There is a Jonsson group of cardinality N,. (2) This group is simple, and is a Jonsson semigroup. Proof. We first we state a definition and a fact.
Definition 2.10. (1) When H C L, x E L is made over H if for n, m natural numbers, h , , h2E H, hlxnh2= x”’ implies n = m, h , = h2 = e. (2) H is a mad subgroup of L, and L is a mad extension of H, if every x E L - H is mad over H; and we write H L.
cmd
S. Shelah
386
Fact 2.11. Let L* = L *"K,and H SmdL, all of them countable torsion-free. Let {(b,,,c,,, z,,): n < w } be a list of all triples (b, c, z ) , b E L - H, c E K - H, and z E K. We can find natural numbers k ( n ) (n < w ) such that the following holds. c , ~ ; ( ~ " + ~:) n) z<; w~ } Let R be the symmetrized closure of {r(c,bt(2"), where r is strongly 4000-random and N the normal closure of R, and L ** = L *IN. Then (i) R satisfies the small cancellation condition C'(1/1000), (ii) the natural maps L * + L** embed K and L, and the intersection of their images is the image of H, (iii) K SmdL **, (iv) L** is torsion-free. Proof of 2.11. For each n, the classes HbkH (0 < k < w ) are distinct (hence disjoint), because H is a mad subgroup of L. A similar assertion holds for Hb;*H (0 < k < w ) . Hence we can define by induction on n, k ( n ) > 0 such that 2 n + p # 2m + q, n,m natural # bZkL(2m+q) numbers, p = 0,1, q = 0,1, h , , h2E H implies hlb:k(2n+p)h2 or, equivalently, b ~ k ( 2 n + p )ik*(2m+q) h 2 b # hi. Let us check each part: (i) We use the notation of Definition 1.8, and let g" * . g 1 g l* * g, = h, E H, so ho = e, g"+lh,g,+l = h,+l, so by the n , g*, g ,. are choice of the k ( n ) , for some r. = r(cnbt(2n),cnh~(2n+'))z-i cyclically reduced conjugations of r,,, r;' respectively or of r;', r. respectively. By the strong 4000-randomness we get the desired contradiction (like 1.13). (ii) Follows from (i). (iii) Suppose x E L** - K, k l E K, k 2 E K, and n, m are distinct natural nnmbers > 0, and klxmk2= x n and we shall get a contradiction. Among the representations of x we choose one y-'zy, y = y l * * y, z = z i . z, (where y l . . * z I . * . E K U L ) with smallest p + 2q (p, q natural numbers). So y;'. * y -l1 z1 * zqyl* * y, is in canonical form except that we may put together y;lzi,z,yl, and z I - * - z ,is weakly cyclically reduced. [Otherwise q > 1 is odd, z,zl E H,so x = (y;i.. . y;1z;1)((zqz1z2)z3. . Z , - ~ ( Z.,.~ y,) ~ so we can let y' = * y, z' = (z,zlz2)zs * z,-~). (Note z,zlz2E L U K - H)so p' = z,yl . p + 1, q' = q - 2, contradicting the minimality of p + 2q).] Similarly q > 1 implies q is even; y,, ..., z l * * 4:* H. A s x E L * * - K clearly q 2 1 . Note: 9
-
-
-
-
387
On a problem of Kurosh
A) y,y-', z cannot contain more than half of a word from R. (As then we can decrease p without changing q or vice versa.) B) y - ' z , z y cannot contain more than 9/10 of a word from R. By symmetry suppose z,+'. . * zqyl* y, is a subword of a word w from R, of length > 91 w 1/10, so w.1.o.g. w = z,+' * z q y I . * y,t ( t a word 1 t 1 s 1 w I/ 10). Clearly
-
. . y ; I z , . . . zqyl . y, = (y;'.. . yy:l)(y;'. y;')(zi = . yY+!l)[(y;'. y;')(zl
x = y;'.
* .
* *
(y;l..
y;'
' '
* * *
zt)(zt+I* .
. zq)(yl.
.
* * yj)(yj+I*. yp)
* * * zl)t-'](yJ+l. ' * yp)*
So we get a representation with y ' = y,+' * * * y,, z ' = . . . y ;'zI . . 2,t-I. So p ' = p - j , q' = j + i + It - ' l , hence
-
(p
+29)-
@'+ 29') 3 (p
+ 2 q ) - (p - j + 2j + 2i + 2lt-Il)
=2(q-i)-j-2It-l/ 3 2(q - i ) - j - 21 w
1/10.
But p + 29 was minimal, hence s p ' + 2q', hence j 3 2(q - i) - 21 w 1/10. But also j + (q - i) 3 91 w 1/10, hence
3j /2 = j
I
+ j / 2 3 j + ( q - i ) - I w 1 / 1O 3 9 1 w 1 / 10 - 1 w I/ 10 = 4 1 w I/ 5 .
So j > w 1(4-2/5.3)= 1 w 1(8/15) > I w 1/2, contradiction. C) zz cannot contain more than 9/10 of a word from R. So suppose z , + ~ * z,zl * * z, is a subword of a word w from R of length > 9 ) w 1/10. So w.1.o.g. w = z , + ~ * z,zl * z, t ( r a word so I t 1 < 1 w I/ 10). Assume first j < i. so x = y , ' . . .y;'zl.. . z q y l . . . y,
--
= (y;'.. = (y;l..
a
-
y ; ' ) ( z ; l . . . 2;:1)(2,+1..
*
. y ; ' z ; ' - . . z;:l)(~-lzJ+l
Z,)(ZI
* * *
* * *
Z,)(Z,+l.
* *
Z,)(Yl*
* *
y,)
Z,)(Z,+1*** z,y1..* y p ) .
So let y ' = z,+' - - * z q y l .. ym z ' = t - ' ~ , * * z,. Then x = (y')-'z'y', and p ' s p + ( q - i), q ' s I t - ' J + i - j Q 1 w 1/10 + i - j . Again O3(p+2q)-@'+2qf) s @+ 2 q ) - @ + (q - i ) + 2 (w I/lO+2(i - j ) ) =q
- i +2j - 1 w p5.
So q - i s 1 w ) / 5 - 2j, but by the choice of w, i, j , t clearly (q - i ) + j s 9 1 w I / 10, hence q - i 3 9 1 w 1 / 10 - j . Those two inequalities
388
S. Shelah
imply I w ) / 5 - 2j
j SO).
1/10 - j , which is a contradiction (as I w I S 1,
3 9 )w
Now we are left with the case j 2 i, but then by the strong randomness of r, j - i s I w )/1000, so replace j by i - 8, and repeat the same argument. D) In B) and C) we can replace 9 )w 1/10 by 1 w )(9/10 - 1/1500).
We have assumed n # m > 0 , k l x m k 2= x". We shall get weakly cyclically reduced forms of word which is e, and using 1.9 and A)-D) get a contradiction. Case 1. q = 1. Notice z I E K - H .$ y l E L - H by p + 2q minimality. Then X " = y p- I - . y ; l Z ; y ' . - y , . If k l = k 2 = e we get z 7-" = e and as K, L are torsion free, n = m. If k l = e # k 2 , m > n w eg e t y p ' . * . y ; ' ~ ; i - ~ y ~ P. . 2- =y ke = y p ' . . . Y ; ' z " - " Y ~ .* . yP-I(ypk2). As before, zT-"#e. As y l . - . z l . . . are in K U L - H , if k 2 E H , z"-"tZ H then the word is in canonical form. If k 2 E H, z"-" E H , y l E L - H, by t h e madness condition w = y,'...y;'(y;'z"-"yl)y2...y p-l(ypk2) is in canonical form ( z m - " # e as K is torsion free) when p > 1, and (y;lzrn-"ylh2)EL - H is in canonical form when p = 1. When k2 $Z H we have a similar situation. So in all cases in t h e word w is not e in L * . As it is e it should be long, SO we can make it weakly cyclically reduced by small changes and we get easy contradiction by t h e strong 4000-randomness. The case k l , k 2 # e is similar. Case 11. q > 1 hence q is even. We get that
e
-
k l x m k 2 x - "= k l y , ' . . y ; ' z l . . . z , . . . z I z,yl * . . ypk2 y p ' . . . y -I ' z'l- 1 . . . z i t . . . . z;'y,. . . yp = d f w * *
=
The word is not weakly cyclically reduced only, possibly in k l , k2,y;'zI,z4yI,y;'zq', z;'y,. But if kl,k z # e, the needed changes involve few letters (much less than a hundred) so we ignore them. Let w * be the weakly cyclically reduced form. If k , = k 2 = e we get e = x"-" = z I . z, . z l . z,. If k l = e # k 2 -
e
= Xm-"k2=
a
y;' .
y;'z,
. . -2, . .
e z , .
. .z q y l .
*
- y,k2.
Both cases are easier, and we leave them to the reader. So by 1.9 there is t = tl t, a part of w * and of a word w from R I t 1 a 997 1 w I/ 1000.
---
O n a problem of Kurosh
389
If for some natural number j > 1 w 1 / 1000, 1 < i < i < i + j < 9, t contains 2, . . . z,+, from two copies of z, we get a contradiction to t h e strong 4000-randomness of r. Similarly if t contains z1 * z,+,, 1 z -,+, . . . z ;'. Also if for some j > 1 w 1 /4000, 1< i < i + j < p, t contains y ;+!,* . y ;', and y , * . . y , + , (or two copies from one of them) we get a contradiction. But t is a part of w * , so looking at it we can see that if it intersects two among the four copies of y, y - ' in w * with length > I w 1/4000, we get a contradiction to the above. With the one left its intersection has length =G I w ) / 2 (by A). Necessarily the sum of its intersection with z", z - " is Z= (w1(99/100-3/4000- 1 / 2 ) > 4 9 6 ( w ( / l O O O so with one of them, e.g. z'", it is a 2 1 w ( / 1 0 . If the length of z is < I w I/lOO, then for some i, 1 < i < 1, for every (or t,+,-J contradicting the strong j , i S j < i + 1 w 1/10 t, is equal to 4000-randomness of r. So the length of z is > 1 w ) / l o o , hence t cannot contain two copies of z (see above). So summing our observations we have the following possibilities only : a) t is contained in y - ' z z , and is not disjoint to y - l . So its intersection with the second z has length < I w ( / 1 0 0 0 , SO y - ' z contains a subword of w of length > (91/100 - 1/100))w I = 91 w 1/10, contradiction to (B). b) r is contained in zzy and is not disjoint to y . The same contradiction. c) r is contained in zz, so it is z:z,+1* . . Z , Z l . . . z,-1z:.
We get contradiction to (C). d) t is contained in z z y k 2 y - ' z z (or similarly with k l ) . We can get similar contradictions. (iv) Left to the reader. Proof of Theorem 2.9. The construction is as in 2.5, but the list {S, : y < A'} is n o longer necessary; and we make the changes mentioned in 2.7 and assume all groups are torsion-free. The main point is that in the induction step 2.6 we use Fact 2.11 rather than Fact 2.4. So in 2.5, condition (ii) is replaced by (ii)' For every a E Ma+, - Ma,for an S C Ma which is not included in Mas (a, S ) . Let us prove that t h e a finitely generated subgroup of Ma, definition of Ma+l(see 2.6) satisfies this. So let b E Ma,and we should prove b E (a, S ) . Clearly for big enough n, a E L., b E H.; and we can find m 3 n such that
S. Shrlah
390
Sf (H,,,+, l - H,)# 0 (as Ma= u,,H,, but for no m is S C H,,,). Now - If,,,).As we have used Fact 2.11 in La+], choose c E S n (H,,,+l b E (a, c ) ; so we finish.
Proof of Theorem C. We assume 2*0 = N,, and let M be the group is a non-trivial topology which makes M a from 2.7. Suppose topological group. Choose a neighborhood Uo# M of the unit, and define inductively neighbourhoods U, of the unit such that Un+lC U,, and x , y E 3 xy E U,. If U,,, is uncountable, every element of M is the product of no elements of it and hence belongs to Uo, so U o = M. Contradiction. If U,,, is countable, it is a subset of Ma for some a < ol. Choose x E M - M a ;xU,,,x-’ is necessarily open, so U,,, n xU,,,x-’ is a neighbourhood of the unit; but by the malnormality condition (Ma SmM,see 2.5(i)) this intersection contains the unit alone. So any singleton is an open set, so any subset of M is open, a contradiction. is either {M,0} or the discrete topology. We can conclude that
v
Proof of Theorem D. Let M be the group of Theorem 2.9. Clearly we cannot find a strictly increasing sequence of sub-semi-groups M,, M = u.,,M, (as some M , is necessarily uncountable, so M , = M). Hence if K is a field, K ( M ) the group-ring, K ( M ) = U.,,A., then M = (A,, n M). Now A,, fl M is a sub-semi-group of M. Clearly M E A,, so A, n M is strictly increasing and get a contradiction to the previous observation.
u,,,
Additional information 2.15. Simplicity of Jonsson groups Macintyre has shown that in fact “almost” any Jonsson group is simple. More exactly, there is no Jonsson abelian group, hence M has centre Z ( M ) of power < [ M I . For any a E M - Z ( M ) , its centralizer again (is by its choice # M, hence) has cardinality < ( M I ; so the number of conjugates of a is 1 M 1, so the normal subgroup it generates is M. So M / Z ( M ) is a Jonsson group, and is simple. 2.16. Jonsson groups with centre This naturally raises the question whether there are Jonsson groups with a non-trivial centre. We can repeat the proofs and constructions of 2.1, 7, 9 by starting with an abelian group Z , IZ 1 S A, and change the definitions and requirements accordingly. So all groups will extend
O n a pmblem of Kumh
391
Z, and Z will be in their centre; H =%L means a E L - H, b E H Z j aba-' 62 H, etc. S o the generalization will be easy. 2.17. On torsion Jonsson groups The groups we constructed are torsion free, and moreover satisfy x" = y" x = y for n# 0. We may like to build torsion Jonsson groups. Now free products with "good" amalgamation for some torsion groups (i.e. the given groups and the result are torsion) exists (by Adian [l]) but not with small cancellation, so we can only hope.
+
2.18. Jonsson group in Nz The proof of Theorem 2.9 works also for A = Nz without any CH but for any N,, we need more complicated amalgamations, and the situation is not clear.
Appendix Another proof of Theorem 2.9 avoiding 2.11. This proof is due to G. Hesse. Lemma. Let H , K , L be groups such that K n L = H and H S,,, L, and let 0 be a subset of H x (K\ H ) X ( L \ H)* satisfying : (i) If (h, a, b, b') E 0,then b, b' are good fellows over H. (ii) If ( h l ,a', bl, bl) and (h2,az, bz,b;) are different elements of 0, then at least one of the pairs bl, bz and bl, b; is a pair of good fellows. Suppose L * = K * H L , w ( x l , x z )is the word xlxzxlx:...xlx?, R C L * is the symmetrized closure of {h-'w(ba, b ' a ) ((h,a, b, b')€ a},N is the normal closure of R in L * and L** = L*/N. Then: (1) R satisfies C'(l/lO), and the natural map 7r : L*+ L** embeds K and L, so that their intersection does not increase. (2) K s , L * * . (3) I f a E K \ H , b E L \ H a n d O S m < n € o , r h e n (ab)",(ab)"are good fellows over K in L * . (4) If b,, bz E L \ H are good fellows over H, then b l , bS are good fellows over K in L**. Proof. (1) Let g * = g" * * * g', g,. = gl * . . g, be canonical representations of elements of R such that g*g, # e, and let g' * g 'gl * * gr € H for some 1 S min{m, n } =: p. An easy computation shows m, n E { k , k + l}, where k = 6640. We may assume that there are
-
-
392
S. Shelah
elements (hl, a , , b l ,bl) and (hz, az,bz,b;) of 0,canonical representations h;'w(blal,bIal)= u l * * * Uk and hi'w(bzaz,b h z ) = V, * V,, p , q E (1,. ..,k } and 6, E E { - 1,1} such that (a) g ' = uf-&, g, = VG+e,(1 c i < p ) , (b) g'g" * * * g' = U ; - ~ U : - & - * u:-+ g , * * * g,,gl= V;++ * * * VG+akVG+L, where u, = u, and V, = V, when i = j mod K. Case 1. p # q or E = 6. Then by (i) and the choice of w we can find i E (2,. . .,664) such that g' = u;-& and g, = VG+e,are good fellows over H. Hence g'hg, H for every h E H, and we have 1 C i d 664 C p /lo. Case 2. p = q and E = - 6. We find i , j E (2,. . . ,664) such that w.l.0.g. g ' = u& = bye, g, = V;+.,= b;, g' = u&= b!-' and g, = V;+,= bl-e. If we assume 12664, then none of the pairs b,, bz and bI, b; is a pair of good fellows. By (ii), ( h , , a , , b,, b : )= (hz,az,bz,b;) and w.1.0.g. u, = V,( i = 1,. . . , k). Choose u E {2,3} and b E L \ H with g' = b-', g . = be. Since by assumption b-'(g"-' * g'g, . . . g,-,)b" E H, by the malnormality of b over H we have g "-I * g'g, . . g V - ,= e. Therefore by (a) and (b):
--
-
-
g * g * = g" = g"
.. .g'g'-'. . . g " g y . ..g,-,g,. . . g , . . . g'g' . . . g, . . .g . . . gigm . g'g, . . - g,g, . g')-'(g,. . . gV-,)-' = e,
= (g v-I
')-Ig
"-1
* *
*
-
*
g"-l(gl . . . gJ'
-(gu-l..
a contradiction. The embedding property follows now from 1.9. The proof of (2) is exactly the same as the proof of Fact 2.4(ii). (3) Suppose n, n E w, rn # n, E E { - 1, I}, a , , az E K and (ab)"a,(ab)""az= e in L**.Let w = g l . * . gr be a canonical representation of the element (ab)"al(ab)''"azof L *. Obviously there is at most one i E { l , . . . , l } such that gi, b are good fellows over H in L. Since m # n, we have 1 2 1. Thus by (1) and the Main Theorem 1.9, g , . . . gr contains a long subword W o which is also a piece of some w , E R. It follows from (i) that there are at least two i E (1,. . .,I } such that gi, b are good fellows over H in L, a contradiction. The proof of (4) is left to the reader.
Proof of Theorem 2.9. As before M is constructed as the union of an increasing continuous chain ( M , a < N,) of countable groups, where no M, is finitely generated If M , has already been constructed, we choose a strictly increasing chain ( H , n E w ) of finitely generated
I
I
O n a problem of Kurosh
393
I
subgroups such that H o = {e} and M, = u { H , n E w } . By induction, we now define a strictly increasing chain (L, n < w ) of groups with the following properties: (a) I L n ) = N o . (b) L, n M, = H <+,, L.. (c) If a E Hn+,\Hn, b E L,\H,, 0 s rn < n < w, then (ab)", (ab)" are good fellows over H,+, in L.,,. (d) If b,, bz E L, \H, are good fellows over H,, then b,, bz are good fellows over H,+, in L n C l . In order to define L.+,, let ((h,,a,, a:, b : ) i E w ) be an enumeration of all (h, a, a', b') such that h E H,, a E H,+,\H,, a ' € H, \H, and b ' E L, \H, for some m < n. Using the induction hypothesis and the Lemma above, one can define a sequence (b, i E w ) of elements of L,\H, such that: (1) b E L n \ H n * (2) b, is a product of elements of {a;, b ; } . (3) b,, b ; are good fellows over H . (4) If j < i then 6, and b, are good fellows over H . By induction hypothesis "H, 6,L," and by the lemma, the symmetrized closure R of {h;' * w(b,a,,b h ) i E w } satisfies C'(lIl0) in L i + l : =H,+,*HnLn, and we may define L n + l : =L : + , / N , where N is the normal closure of R in L i , , and w.l.0.g. Ln+,n M, = Hn+,.The desired properties of L.,, follows from the lemma. This concludes the = n Ew}. main step of our construction, and we put Ma+, Suppose now that S is an uncountable subset of M = L { M , a < N,} and h is an element of M. Then there is an a < N, such that S fl (M,+,\ M,) # q and S fl M, contained in no finitely generated subgroup of M,. Choose m < n < w and a, a', b ' E S such that h E H,, a E Hn+,\Hn, a ' € H, \ H , and b ' E L, \H,, where the groups H,, L, are as in the construction of M,,,. By construction, h is a product of elements of {a, a', b'} in L.+, and hence in M.
1
1
I
1
u{Ln
I
References (11 S.I. Adian, Periodical products of groups, Trudy Mat. Inst. Steklov Vol. 142 (1976) (dedicated to 85th birthday of I.M. Vinogradov). 121 V.I. Arnautov, Non-discrete topologizability of infinite commutative rings, Soviet Math. Dokl. 11 (1970) 1307-1309. (3) V.I. Amautov, Nondiscrete topologizability of countable rings, Soviet Math. Dokl. 11 (1970) 423426. [4] L.A. Bokut, Imbeddings in simple algebras, Algebra and Logica, 1976, No. 2.
S. Shelah C.C. Chang and H.J. Keisler, Model Theory (North-Holland Publishing Co., Amsterdam, 1973). P. Erdos, A. Hajnal and R. Rado, Partition relations for cardinal numbers, Acta Math. Acad. Sci. Hungar. 16 (1%5) 93-1%. J. Hanson, An infinite groupoid which admits only trivial topologies, Amer. Math. Monthly 74 (1%7) 586-589. A. Kertsz and T. Szele, On the existence of non-discrete topologies in infinite abelian groups, Publ. Math. Debrecn 3 (1953) 187-189. S. Koppellberg and J. Tits, Comptes Rendus, 279, serie A, 1974, p. 583. A. Macintyre, Existentially closed structures and Jensen's principle, Israel J. Math. 25 (1976) 202-210. R. McKenzie, On semigroups whose proper subsemigroups have lesser power, Algebra Universales 1 (1971) 21-25. K.P. Podewski, Topologisierung algebraischer strukturen, a preprint. G. Sabbagh, Sur les groupes qui ne sont rkunion d'une suite croissante de sousgroupes proper, C.R. Acad. Sc. Paris Vol. 280, 766-Series A. P.E. Schupp, A survey of small cancellation theory, in: Word problems, Decision problems and the Burnside problem in group theory (Boone, Cannomito and Lyndon eds. (North-Holland Publ. Co., Amsterdam, 1973) pp. 569-590. P.E. Schupp, Small cancellation theory over free products with amalgamation, Math. Anna1 193 (1971) 255-264. S.P. Strunkor, Subgroups of periodic groups, Soviet Math. Dold 7 (1%8) 1201-1203. S. Shelah, Ekentially closed groups in W,, Submitted to Bull. of the Greek Math. SOC.
A.A. Markov, On free topological groups, C.R. (Doklady), Acad. Sci. USSR, (N.S.) 31 (1941) 299-301. Bull. Acad. Sci. USSR Ser. Math. [Izvestya Akad. Nauk., SSSR] 9 (1945) 3 4 4 (Russian, English summary). English transl., Amer. Math. SOC. Translation No. 30 (1950) 11-88; reprint Amer. Math. SOC. Transl. (1) 8 (1962) 195-273. [19] A.A. Markov, On unconditionally closed sets, Matematiceskii zbornik, 18 (60). No. 1 (1946) 3-28 (Russian, English summary).
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 395400
THE WORD AND TORSION PROBLEMS FOR COMMUTATIVE THUE SYSTEMS H. SIMMONS Depariment of Mathematics. Universiiy of Aberdeen, Aberdeen
In this note I give a proof of t h e following result. Theorem. There are primitive recursive algorithms W P A , T P A such that for each commutative Thue system A = ( X ;R ) and words U, V on the alphabet X of A , (a) W P A will decide whether or not U = V in A and if so will produce an A-transformation of U to V. (b) T P A will decide whether or not there is an integer k 5 1 such that U k= 0 in A (where 0 is the empty word.) and i f so will produce such a k and an A-transformation of U k to 0.
In more concise but less precise language this shows that the word problem and t h e torsion problem for finitely presented Commutative semigroups are uniformly solvable in a strong sense. The solvability of this word problem has been known to some people for some time (for instance it follows easily from the residual finiteness of f.p. commutative semigroups), however it has not yet achieved the doubtful status of ‘folklore’. In fact the problem has been discussed recently in the Queries column of the Notices AMS, (see [ O ) ) . The response by Schein in [ O ] gives several references to earlier work by Russian authors on the algebraic structure of commutative semigroups and related decision problems. In the late 1950s S. Kochen also found a positive solution to the word problem for commutative semigroups. The method given here (which is essentially the same as the one sketched by.Bergman in [0]) was shown to me (around 1964) by J. Shepherdson, and is derived from an argument used in [3]. A similar method is used in [4] to solve the corresponding problem for commutative rings. I am grateful to W. Boone for suggesting that, because of its simplicity, t h e method ought to be written up. 395
H.Simmons
3%
Let A = (X; R ) be some fixed (but arbitrary) commutative Thue system. We will describe algorithms which solve t h e word problem and torsion problem for A. By inspection of these algorithms it will be clear that they are primitive recursive and uniform in A, and so we will obtain the above result. Let = { X I , . . . , x.}
x
and let the relations of A be
uI= v,,..., u,=v,
(so that U ,,..., U,, V,,..., V, are words on the alphabet X). We may regard xI,.. . ,x. not only as t h e generating letters of A but also as the indeterminates in the polynomial ring D = Q [ x l , . . . ,x,], and hence each word W o n X may be construed as a monomial in D. (The empty word 0 is construed as the identity 1 of D.) This means we may consider the polynomials
f,=
u,- v,( . . . (f , = u,- v,
of D (and the corresponding ideal I= Cf, . . .,f , ) of D ) . Now let U, V be arbitrary words on X and let f = U - V be t h e corresponding polynomial of D. The following lemma reduces the word problem for A to a problem concerning polynomials of D. Reduction lemma. Let A, f , , . . . ,f,, d,U, V, f be as aboue. (a) The following are equivalent. (i) U = V in A. (ii) f E JQ in D, that is there are polynomials g , , . . . ,g , of D such that (1)
f =glfl+.*-+g,f,
holds. (b) There is a primitive recursive algorithm which, when given polynomials g , , . . . ,g, of D such that (1) holds, will produce an A transformation of U to V.
Proof. In this proof it is convenient to use ' = ' for equality of words in A and ' = ' for equality of polynomials in D. Since A is a Thue system (not just a semi-Thue system) we may assume that A is symmetric, that is, for each relation Ui = Vi of A, Vi = Ui is also a relation. In other words, for each i €{l,..., r } there is some j E ( 1 , . . . r } such that Vi = Uj, Ui = V,. This means that U = V
Commutative
7 % systems ~
397
in A if and only if there is a sequence i,,. . . is of members of (1,. . . , r } and a sequence W,, ..., W, of words on X such that
u = WIU,, w, v,,= wru,, Wr v,2= (2)
= w,u,*
w,v,s= v.
Let us first prove (a: (i) j (ii)). Suppose U = V in A so there is a situation (2) above. Then in D we have
u = w,u,,
w, v,, = Wlf,, + wru,, = WIJ, +
= Wlf,, + WZj, + w2v#2 = W,J, + ’ . ’ + WSJ, + v -glf,
+ .. . + g,f, + v
where g,, . . . ,g, are suitable polynomials built up from t h e monomials W,, . . . , W,. Notice that these polynomials g l , . . . ,g, have integer coefficients. This verifies (ii). Now let us prove (a; (ii) 3 (i)) and (b). Suppose we are given polynomials g,, . . . ,g, such that (1) holds. Thus we can find an integer m 3 0 and polynomials h , , . . . ,h, with integer coefficients such that (m
+ 1)f = h l f l +
-
a
*
+
h,f,.
This gives us an equality
where W,, . . . , wk are monomials (words on X ) and m l , . . . ,mk are integers. Here and below the range of C and the range of the index j is {1,2,.. . , k}. Since A is symmetric we may assume that each mi 2 0 . Thus we can rewrite this equality as
H.Simsnons
398
In the following argument we should remember that the empty word is treated as the identity of D. Consider the monomial U occurring explicitly on the left of (3). Since all the coefficients are positive U must occur explicitly on the right of (3). so either U = V (in which case we are done) or there is some j (say j = 1) such that U = W,U,,. But then we have (4)
mu+ w , v , , + ~ ~ ~ w , v , , ~ ( ~ + I ) ~ + ~ ~ ~
where m; =
[tn;-l
ifj=1
if j # 1.
Now consider the monomial W,V,, occurring explicitly on the left of (4). As above W,V,, must occur explicitly on the right of (4), so either W ,V,,= V (and we are done) or there is some j (say j = 2) such that W, V,,= W,U,,. But then we have
for some suitable positive integers m Y , . . . ,m i . Now consider the monomial WzVi2occurring explictly . . . Clearly this procedure can continue for no more than m I + . . . + mk steps, and when the procedure stops we have built up an A transformation (2). ! This completes the proof of the lemma. Before we continue with the proof of the Theorem we note that M. Rabin has observed (in a letter to Shkpherdson) that the above argument with Hilbert's basis theorem implies that each finitely generated commutative semigroup is finitely presentable. To see this let (x,,.
. ., x m ;{Ui = V , : i E I})
be a presentation of such a semigroup A (so the index set I may be infinite). As above let D be the polynomial domain Q[x,, .. . ,x.] and let d be the ideal of D generated by { U i - V, : i E I}. Then as in the lemma, for each two words U, V on {xl ,.... x.}
U = V in A e U - V E d.
Commutative 7 % systems ~
399
But (since D is noetherian) d is finitely generated and so there is some finite J C I such that {Ui - V, : i E J } generates d.Thus we see that (X
, , . . . , x n ; { U i =V i : i € J } )
is a finite presentation of A . Now that we have the reduction lemma t h e algorithm WPA (of the theorem) is constructed easily using the following proposition of G. Hermann [l]. A more readable proof of this proposition can be found in [2].)
Proposition. There is a primitive recursive function m ( . , . ) such that for each polynomial f, f , , . . . ,f , of D = O[x,,.. . ,x.] of degrees at most d, if there are polynomials g , , . . .,g, satisfying (1) then there are such polynomials of degrees at most m ( d , n ) .
To complete the proof of the theorem we must obtain the algorithm TPA. This algorithm is constructed easily using WPA and t h e following lemma.
. . . ,f , be polynomials of D = O [ x l , .. . ,x, 1 of degrees at most d. Let d be the ideal of D generated by f , , . . . ,f , and let rn = m ( d + 1 , n + 1) + 1. Then there is an integer k 3 1 such that f k E d if and only i f f “ E d.
Lemma. Let f , f , ,
Proof. Suppose there is some integer k with f k E 1.We introduce a new indeterminate y and consider the polynomials 1- Yf,fl,.
..
,fr
of D [ y ] .These polynomials have no common root (since f k E a)and so, by the Nullstellensatz, there are polynomials h, h , , . . . ,h, of D [ y ] such that 1 = h ( 1 - y f ) + hlf, + * *
(5 )
*
+ h.f,.
The above proposition allows us to assume that each of h, h , , . . . ,h, has degree at most m ( d + 1, n + 1) = m - 1. We now replace y throughout ( 5 ) by y-‘ and, after multiplying up by y m , we obtain
y”
=
h’(y - f ) + h ; f , + *
* *
+ h:f,
where h ’, h ;, . . .,h are some suitable polynomials of D [ y 1. The desired result is now achieved by substituting f for y.
400
H.Simmons
This concludes the proof of the theorem. When I explained the above proof to Boone h e suggested that a similar technique might solve the word problem for commutative semiThue systems. I have been unable to decide whether or not this is so. However, for such systems there is a corresponding reduction lemma. The crucial observation is that a semi-Thue system may not be symmetric (for otherwise it is a Thue system) and so, in t h e above proof, we must assume that the integers mi are positive. This means that t h e given polynomials g l , . . . ,g, must have positive coefficients. Thus a positive solution to the following problem will show that the word problem for commutative semi-Thue systems is solvable. Problem. Is there an algorithm which, given polynomials f, f l , . . . ,f, of D = O[xl,.. . ,x,], will decide whether or not there are polynomials g,, . . . ,g, of D, each with positive coefficients, such that (1) holds.
Added 10 October 1977: I am grateful t o the referee A.L. Semenov for pointing out that the word problem for commutative semigroups is solved also in [ 5 ] . The method used in [ 5 ] gives an algorithm which is faster than the algorithm given here. References [ O ] American Math. SOC.Notices 23 (1976) 80, 126, 223. [ I ] G. Hermann, Die Frage der Endlich Vielen Schritte der Theorie der Polynomideale, Math. Ann. 95 (1926) 736 and 788. [2] A. Seidenberg, Constructions in algebra, America1 Math. SOC.Transactions 197 (1974) 273-3 13. [3] J.C. Shepherdson, Inverses and zero divisors in matrix rings, London Math. SOC. Proceedings (3) 1 (1951) 71-85. [4] H. Simmons, The solution of a decision problem for several classes of rings, Pacific J. Math. 34 (1970) 547-557. (51 E. Cardoza, R. Lipton and A.R. Meyer, Exponential space complete problems for Petri nets and commutative semigroups, 8th Ann. ACM Symp. on Theory of Computing, May 1976. pp. 50-54.
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland I Publishing Company (1980) 401441
EMBEDDINGS INTO FINITELY GENERATED SIMPLE GROUPS WHICH PRESERVE THE WORD PROBLEM Richard J. THOMPSON Universityof California, Berkeley
The main result of this paper is that a finitely generated group has a solvable word problem if and only if there is an embedding of the group (such that two elements not conjugate in the original group will not be conjugate in the embedding group) into a finitely generated simple group which is a subgroup of a finitely presented group. It is known that a finitely generated simple group is a subgroup of a finitely presented group if and only if it has a recursively enumerable set of defining relations, and that it has a recursively enumerable set of defining relations if and only if it has a solvable word problem. Boone and Higman recently obtained a result similar to t h e given result, but without any reference to conjugacy and with no requirement that the simple group be finitely generated. The construction used was quite different from those used here, the simple group being obtained as an increasing union of groups, making it infinitely generated. It should be noted that t h e constructions of 92 and 93 can be appiied in fact to arbitrary groups or to arbitrary countable groups (or groups which are just isomorphic to permutation groups of countable sets) respectively, so that the fact that these embeddings preserve t h e solvability of the word problem is only an incidental consequence of their well-behaved character. I n 94 the constructions of the previous sections will be put together to obtain the main result and some subsidiary results (including some old ones not concerned with the word problem); outside of $4 the only role recursiveness will play is in our remarking that certain procedures are effective (where we will rely on Church’s thesis). 00. Preliminaries
The notation used will be much as in [ 6 ) .Thus the set of all natural numbers is denoted by o,while every natural number n is considered to be identical with t h e set {0,1,. . . , n - 1) and (given two sets X and 401
402
R.J. 7kompson
Y )yX denotes the set of all functions from Y into X. However, here we will be concerned with o2(the set of all infinite binary sequences of length w ) and its subsets, rather than with "w. By "2 is meant the set of all finite binary sequences; thus it is the union of the collection of all sets "2, for n E w. The Greek letters p, a,and T will be used to designate finite binary sequences, and the letters a, p, and y will be used to designate infinite binary sequences. The length of a E "2 will be t h e unique n E w such that a E "2, and for this n we can write a = (a(O), a ( l ) ,. . . ,a ( n - 1)) (where no confusion results, this may be wrttten as (a(O)a(l)- .. a ( n - 1)) or even a(O)a(l) * * a ( n - 1)); if 7 is a sequence of length m, the concatenation of a with T will be the sequence (a(O),. . . , a ( n - l), T(O), . . . ,T ( m - l)), which will be designated by U"T (or even by UT, when no confusion results). In fact, for p E "2 U "'2 and a E "2 of length n the sequence a n p will be the function such that ( a " p ) ( i ) = a ( i )for i < n and ( a n p ) ( i )= p ( i - n ) for i 2 n, with a n p E "2 if p E "2. For p E 32 U "2 the formula a C p will be used to assert that for some Y E 52 U "2, a n y = p ; we will say that a is an initial part of p (or p begins with a).The notation a'"' will be used for the sequence obtained by concatenating a with itself n times (a"may be used when the context is clear). Thus (Ol5O2) is (011100). In the composition of permutations (and, unlike [6], of functions in general) we will take ( f g ) ( i ) to be the same as g ( f ( i ) ) . If x and y are two elements of a given group, their commutator is the element [x, y ] = x y x - ' y - ' ; so [x, y ] = 1 if and only if x and y commute (here "1" denotes the identity element of the group). The empty set is 0.
-
Definition 0.1. Fr is the set of all non-empty subsets K of "2 such that for every finite binary sequence a and a E "2, ana E K if and only if a E K. The elements of Fr will be called the finitely replaceable sets.
For K E Fr, the interval in K determined by a is {a E K : a C a}; when K is clear from the context this set will be designated by [a]. Definition 0.2. For K E Fr, Pa(K) (the partition group of K ) is the group of permutations of K whose members are the permutations F such that for every Q E K there are a,T E 3'2 with a C a and, for every P E K, F(unP)= T ~ P .
It is not hard to justify this definition by showing that if F and G satisfy the defining conditions then so do the permutations FG and F - ' . Let us say that a set 2 "2 is a partition set for K E Fr provided
Emkddings which prcseruc rhe word problem
403
that every a E K begins with exactly one element belonging to 2 (in u E 2,form a partition of K). other words, provided the intervals [a], For example, {(0),(1)) is a partition set for every K E Fr.
Theorem 0.3. For K E Fr let Z0,XI, J, be any triple where Z0 and Z I are partition sets for K E Fr and J, is a one-one map from Z0 onto ZI. Then the formulas F ( u " P ) = J,(u)"p (for u EZoand p E K ) define a permutation F which belongs to Pa(K). Moreover, every member of Pa(K) can be put in this form, with the further property that for any u, T E " 2 such that for every y E " 2 F ( a " y ) = T"Y there exists some p C u with p E .Z0. Proof. The first statement is clear from Definitions 0.1 and 0.2. On the other hand, suppose F E Pa(K), where K'E Fr. Then we let $'(a, T) (for u,T E " 2 ) mean that F ( a " P ) = ~ " whenever p p E K. We put
Z0={a:For
all p , p = u if and only if for some
T,
J , ' ( ~ , T )& p C
0).
As J,' is a function (taking +'(a, T ) and J,'(u)= T to have the same meaning), if we let J, be the restriction of J,' to So, and X I be the range of 4, we can see that So,2,, J, and F are related as required by the theorem.
Note that " 2 E Fr. Because of the compactness of the Cantor space " 2 , we have: Proposition 0.4. Every partition set of " 2 is finite. Proposition 0.5. For K E Fr, the finite partition sets of K coincide with the partition sets of " 2 . Proof. First, every partition set of " 2 is finite, by Proposition 0.4, and from the definition, every partition. set of "2 is a partition set of K. On the other hand, if 2 is a finite partition set of K E Fr there will be some n E w such that every element of 2 is of length G n. Given a E " 2 there is some T E " 2 of length n such that a = ~ " for p some p E " 2 . Now pick y E K. As K E Fr, T"Y E K so that for some u E 2, u C T " Y . AS u is of length G n and T is of length n, u C T and so u c ~ " =p a. So 2 is also a partition set of "2. Proposition 0.6. If 2 is a finite subset of " 2 such that [u] f l [ T ]= 0 whenever a,T E 2 and u f T , then 2 can .be extended to a finite partition set 2*for " 2 .
R.J. 'Ihompson
404
Proof. Given 2 as in the proposition, let 2*= 2 U { p : For every p ' , if p' C p and for every a E 2,[a] r l [ p ' ] = 0, then p' = p, and for every a E 2, [a] n [ p ] = 0). If p, v E 2 * and p # v, then [ p ]n [u ] = 0. This is clear from the hypothesis concerning 2 and the definition of 2 * , unless p, v & 2. But, in that case, if [ p ]fl [ u ] # 0 then either p C u or else u C p, which requires p = u by t h e definition of 2 * . As 2 is finite n. there is some n E w such that every element of 2 is of length Now, for every p E "2 of length n, there is some p ' E 2 * such that p ' C p. (If [a] n [ p ] # 0 for some a E 2 we can choose p' = a,as every member of 2 has length C n ; otherwise we can take p' to be that n [ p ' ] = 0 for every initial part of p of smallest length such that [a] a E 2.) This (with the earlier result) implies that every element of 2 * has length < n so that 2*is finite, and also implies that for every a E "2 there exists p ' E 2*with a E [ p ' ] (as a = p"p for some p E "2 and p E 32 of length n, and if p ' C p then a E [ p ' ] ) . Thus 2 * is a finite partition set. Proposition 0.7. If K E Fr and F E Pa(K) is not the identity, then for some a, T E "2 such that [ a ]n [ T I = 0 (in K ) for all a E K, F ( a " a ) = are partition sets T"CL Further, if 2,,, ZI, is any triple where 2,, and for K, is a one-one map from 2,, onto Z I , and F ( p " P ) = + ( p ) " p for every p E Z0 and p E K , then u and 7 can be chosen so that a E Z0, T E XI, and +(a) = T.
+
+
Proof. In view of Theorem 0.3, it is enough to prove the last statement. If F is not t h e identity then for some a'€XI,, a'# +(a'). If, in K, [a'] n [+(a')] = 0, then we can set a = a',T = +(a'). Otherwise, either a'C +(a')or $(a')C a' If +(a') C a',t h e n as +(a')# a' we have for y E K that exactly one of +(a')"(O)"y and + ( a ' ) " ( l ) " y 6elongs to [a']in K. As 2,, is a partition set for K, t h e other of these belongs to some interval [ p ] in K, where p E 2,) is such that $(a')C p, but p # a'.Then we can take a = p, T = + ( p ) , because [ + ( p ) )fl [+(a')] = 0 in K (as is one-one and Z I is a partition set for K ) and [ p ]C [+(a')] in K (as + ( a ' ) C p ) so that [ p ]n [ + ( p ) ]= 0. If a'C +(a'), the same argument applied to F-l shows that for some p E Z I , [ p ] n [ + - ' ( p ) ]= 0, and we can take a = + - ' ( p ) , T = p.
+
Proposition 0.8. I f K E Fr and 7 is the mapping with domain Pa("2) such that, for F E Pa ("2), 7 (F) is the restriction of F to K, then 7 is an isomorphism from Pa("2) into Pa(K). Proof. Given K E Fr and 7 as in the proposition, by Theorem 0.3 and
Embeddings which preserw the word problem
405
Proposition 0.5 we see that 17 maps Pa("2) into Pa(K). For a E K and F, G E Pa("2) we have (as F ( a ) E K if a E K , since K E Fr) that ( F G ) ( a ) =G ( F ( a ) ) =t 7 ( G ) ( F ( a ) ) =t7(G)(t7(F)(a))= (77(F)77(G))(a), so that 77 is a homomorphism. If F E Pa("2) is not the identity then by Proposition 0.7 there are a,T E "2 such that (in "2) [a] fl[ T I = 0 and . y E K we have (as for every p E "2, F ( a " p ) = ~ ~ Byp choosing K E Fr) both a n y E K and ~~y E K , so that ( v ( F ) ) ( a " y )= F ( a " y ) = ~ ~ while y ,c + " y # ~ ~ (or y the intervals [a]and [ T I of "2 would not be is an disjoint), so that v ( F ) is not the identity, and therefore isomorphism. For K E Fr, F t ( K ) will designate the set of all mappings belonging to P a ( K ) which correspond to a triple So, Z l , (I, as in Theorem 0.3 such that S,, (and thus X I ) is finite. From Theorem 0.3, Proposition 0.5, and Proposition 0.8 it is now apparent that Ft (K ) is a group (with composition of functions as t h e operation), which will be called the group of finite table permutations of K , and that Ft (K) is isomorphic to Pa("2). More precisely: Proposition 0.9. If K E Fr and 7 is as in Proposition 0.8, then 77 is an isomorphism which maps Ft ("2) = Pa("2) onto Ft(K) in such a way that for F E Pa("2) corresponding to a triple Z0, Z l , (I, as in Theorem 0.3 where Z,, and Z l are partition sets for "2 (and thus are finite) 7)( F ) corresponds to the same triple. As a consequence of Proposition 0.9, Ft (K) = Ft (K') for K, K ' E Fr and we can identify elements of Ft (K) and Ft (K') which correspond to the same triple Z,,, Z l , (I, (with Z,, and Z,finite). We conclude this section by pointing out some facts concerning permutation groups which will be needed later:
Lemma 0.10. Suppose that M = M , U M 2 , M I n M 2 = 0, and g and g' are permutations of M which both map M , onto M I and M 2 onto M2. Then : (i) If g ( x ) = x for every x E M I , while g ' ( y ) = y for every y E M2, then gg' = g'g. (ii) If for every x € : M I , g ( x ) # x and g ' ( x ) # x, while for every y E M 2 g ( y ) = y and g ' ( y ) = y, then for every permutation h of M such that h-'gh = g', h also maps M I onto M I and M 2 onto M2. Proof. (i): This follows by examining the cases z E M I and z E M 2 to show that (gg')(z) = (g'g)(z) for every z E M. (ii): Suppose z E M z but
R.J. Thompson
406
h ( z ) E MI.Then ( h - ’ g h ) ( h ( z ) ) =( g h ) ( z ) =h ( g ( z ) ) =h ( z ) (as g ( z ) = z, since z E M2),but g ’ ( h ( z ) ) # h ( z ) (as h ( z ) E MI).On t h e other hand, suppose z E MI but h ( z ) € M2.Then ( h - ’ g h ) ( h ( z ) ) =( g h ) ( z ) = h ( g ( z ) ) # h ( z ) (as g ( z ) # z , since z E MI), but g ’ ( h ( z ) ) =h ( z ) (as h ( z ) E M2).So h maps M2 into M2 and MI into MI, thus MI onto MI and M2 onto M2. 81. About table permutations
Throughout this section K will be a fixed element of Fr. A table will be a triple XI),Zl, JI as in Theorem 0.3; the table being called “finite” if X l l and 2,are finite. A line (which will be called a regular line in $3) will be an object u + 7 corresponding to an ordered pair (u, 7) where a,7 E Y 2 . We will say that the line u + 7 belongs to the table T just in case u E Zl), 7 E Zl and +(a)= 7, T being the triple Zll, Zl, JI. If T is a finite table it can be given just by listing all of the lines belonging to it; elements of Ft (K) will usually be given by listing the lines of a corresponding table. The left partition set of a table XI,, XI, JI is XI,; the right partition set is XI. As a practical matter, it is useful to be able to multiply members of Ft (K) by “composing” corresponding finite tables. If F, G E Ft (K) have t h e tables Zl,, Zl, JI and X I , X2,JI’ corresponding, t h e table XI,, X2,$4’ will evidently correspond to FG. Definition 1.1. If u + 7 is a line, then the direct extracts of u + 7 are u0+70 and u1+71. We immediately have: Proposition 1.2. l f the table r‘ is obtained from the finite table r by replacing a line belonging to r by its direct extracts, then f and r’ correspond to the same element of Ft ( K ) . Proposition 1.3. Suppose that the finite table Zl,, Zl, JI and m,n E w are such that the lengths of the longest sequences in Zl) and X I are =sm and s n respectiuely. Let Z:,and 2 : be, respectiuely, the set consisting of all binary sequences of length m and the set consisting of all binary sequences of length n. Then there exist partition sets Z and 2 and one-one functions JI’ and JI’ such that the tables Z& Z;, JI’ and Z:, Z:, JI* correspond to the same element of Ft(K) as Zl,, Z l , JI. Proof. There are only finitely many binary sequences of length less
Embeddings which preserw the word problem
407
-
than or equal to m or n ; thus by appealing to Proposition 1.2 we can, if Zll# ZA, choose a E Zo of length < m and replace a + ( a )in Zll, X I , by its direct extracts and repeat this process to obtain Z:,, Z:,+' - if Zl # ZT we can similarly start with T E Z l of length < n and replace +-'(T)+ T by its direct extracts, repeating this process to obtain Z:, ZT,+*
+
A table obtained from another table by an iterated process of replacing a line by its direct extracts will be called a refinement of the other table. Proposition 1.3 shows that if t h e finite tables Zll, Z,, and Z2, Z3, 8 correspond to F, G E Ft ( K ) we can, by choosing n E w such that every element of Z l U Z2 is of length < n, and taking 2 ; = 24 to be the set of all sequences of length n, find tables Z:), 2 { ,4' and Z;, Z;,8' which are refinements of &, ZI, and Z r , 23, 8 respectively, so that Z:), 2.4, +'8' corresponds to FG. Note that t h e process described in the proof of Proposition 1.3 is effective, SO that the word problem for Ft ( K ) is solvable (when t h e elements are given by finite tables): a finite table Z0, Z l , corresponds to the identity element if and only if is t h e identity, while we can effectively find a finite table corresponding to t h e product of two elements of Ft( K ) given by finite tables. Of course, if the finite table Zn, Z l , corresponds to F E Ft(K), then X I , &, I,!-' corresponds to F-'.
+
+
+
+
+
Theorem 1.4. For K E Fr, Ft ( K ) is finitely generated, with the elements E (given by the table O+OO, 10+01, 11+ l), D (given by the table O+O, 10- 100, 110- 101, 111- ll), P o (given by the table O-+ 10, 10-0, 11 -+ ll), r 1(given by the table 0-0, 10+ 110, 110+ 10, 111- l l l ) , and P? (given by the table 0-0, 10- 11, 11- 10) as one set of generators. Proof. First we show that E and D generate the subgroup P of Ft ( K ) consisting of all elements of Ft ( K )which are order-preserving, the order in question being the lexicographic order on K (that is, the ordering 4 such that, for a, p E K, a Q p if and only if a = p or for some p E "2 we have a E [PO] and p E [pl], p possibly being the empty sequence; thus K is simply ordered by 4).Note that whenever Zo and ,XI are two finite partition sets of the same cardinality there exists exactly one function 3/ such that ,Xo, Z l , J, is a finite table corresponding to an element of P; for a , T E $2 such that [a]n [TI = 0 either a Q p for every a E [a]and p E [TI or p 4 a for every a E [u] and p E [TI, so that (indicating the former case by u Q T and the latter with uo4 alQ * * case by T Q a ) if Zo has as elements a o r . .. ,
408
R.J. Thompson
--
4 an-]and Sl has as elements T ~. ., . ,T.-] with T~Q T~ Q 4 T " - ~ ,then $ must be the mapping defined by $(ai) = T ~ for , 0 6 i < n. Now let A" be (0) and let A, be (1"')O) and A *. be (1'")) for n E w with n # 0. For n E w let be {Ao,. . . ,An, A *,+ It is clear that is a finite partition set of cardinality n + 2. Let P* be the subset of P consisting of all elements of P corresponding to a finite table So, S , , where So is for some n E w. (In fact, P* is a subsemigroup of P.) From their definitions, E and D belong to P* and P. Let Wo=E and let W,,,+l= E"DE-" for m E w, so that W , = D. Now, for n 3 rn + 1, W,,, corresponds to the finite table Z,8 where S is the range of 8 and 8 ( A k ) = A k for k < m , e(A,,,)=h,,,O, e(A,,,+I)=A,,,l, d ( A k + i ) = A k for n > k > m , and @ ( A t+,)= A This can be shown for E and D by refining the tables which were used to define them, and for W, with m 2 2 by induction on m, as EW,,,E-' is easily computed. For p E P * , p belongs to the subgroup of P generated by E and D. This can be shown by induction: First, there is only one element of P * , t h e identity, which corresponds to a finite table whose left partition set is To. Now let us suppose that n 1 and that p corresponds to a finite table Zl, while every element of P* corresponding to a finite table whose left partition set is for some j < n belongs to the group generated by E and D. Let us choose (+ E 2 , such .that (+ is of maximum length. As n 2 1, (+ is either P O or p l for some p E "2 of length 2 1. Further, both p0 and p l belong to S , (since 2 , is a partition set and (+ is of maximum length). Case 1: p l is + ( A Z + , ) . Then, p 0 must be +(A,,), and if we define the function with domain rn-,by setting $ ' ( & ) = + ( A k ) for k < n and +'(A t) = p, and put 2 ; equal to the range of +', p will correspond to the finite table Zi, 4' (for replacing p0 and p l in ZIby p still leaves a finite partition set), and thus belongs to the subgroup generated by E and D. Case 2: p l is $(A,,,+l) for some m < n. Then, p 0 must be $(A,,,), and we define the function with domain by setting + ' ( A k ) = + ( A k ) for k < m, +'(A,) = p, $ ' ( A k ) = + ( A k + l ) for n > k > m, and +'(At) = +(At+,), and put 2;equal to the range of +', so that 2;is a finite partition set. If p * is the element of P* corresponding to the finite table r,,-,, S ; , +', then p * must belong to the subgroup generated by E and D. But it is now easy to compute that p = W,,,* p * , so that p also belongs to the subgroup generated by E and D. (Replacing the references above to the group generated by E and D by references to the semigroup generated by the W,,,, m 30, we see that this semigroup includes P* and so coincides with it.) Now, every r E P* corresponds to a finite table So,ZI, in which So and 2 , both contain n 2 elements for some n E w . Let p be t h e
r,,
r.
+
r,
r,,
z.
r.,
+,
r,
+'
r,,-,,
+'
r.-,
+
+
409
Embeddings which preserve rhe word problem
unique element of P which corresponds to a finite table f,,211, 5 and let 9 be the unique element of P which corresponds to a finite table f., ,XI, v. From p , 9 E P it follows that p , 9 E P*. Further, the element p - ' q corresponds to t h e finite table XI,, X I , 5 - l ~(as p - ' corresponds to the finite table 2,),f,,t-'), and thus must be equal to r, as both p - ' q and r belong to P (so that 5 - l ~is equal to +). As p and q belong to the group generated by E and D, so does p - ' q = r, and we conclude that P is generated by E and D. The definition of TI,, r I ,and r: has been given in the statement of the theorem. For m > 0 , let r,+,= Er,,,E-' and r:+, = E7r:E-I. Then for n > m, r,,,corresponds to the finite table f,,,f",8' where for k # m, m + 1 we have e ' ( h k ) = Ak with O'(A,) = A,+, and O'(A,+l) = A,, while r t corresponds to a finite table f,,f,, 0" where e " ( A k ) = Ak for k < n with O"(A,) = A t + I and e"(A *,+ I ) = A. For r ,this is true as inspection shows: t h e rest follows by computation and induction. Now, every g E Ft ( K ) corresponds to a finite table -XI,, Sl, in which -XI, and 2, both contain n + 2 elements and n > 0. As before, let p and 9 be the unique elements of P corresponding to finite tables f., -XI,, 6 and f., XI, v. Then ,$+v-' is a permutation of f, so that f,, f., (+v-' is a finite table corresponding to an element g ' E Ft (K). Consequently, as the tables involved can be directly multiplied to show that g = p-'g'q, to show that g belongs to the group generated by E, D, rlj,r I ,and r f it is enough to show that g' belongs to this group. But ,$+q-' is a permutation of f,,and is thus equal to a product of transpositions, in fact of transpositions which transpose A, and A,,, for some i < n or transpose A,, and A This means, in turn, that g' is a product of the elements r,,i < n, and r t . So g' and consequently g belong to t h e group generated by E, D, rlj,r l ,and r f ,which thus coincides with Ft (K).
+
Proposition 1.5. For K E Fr, Ft (K) is generated by two elements. Two such elements are r : r l D and r o r l m : . F t ( K ) is also generated by the four elements E, D, ro,and r f ,aud by the three elements E, D, and ror fro.
Proof. As E, D, rll,r i ,and r f are a set of generators by Theorem 1.4, and r l .= m O r f D - ' r l , D r f r oas , a computation will show, Ft(K) is generated by E, D, ro,and r f .In fact, it is generated by E, D, and r , l r f r las j , rlr = D - ' E ( r l , m : r n ) D ( m n m f r o ) D - ' and E r f= D ( r , , r f r , , ) D - ' E ( r o r f m 1 ~The ) D .last reduction is more lengthy. Let g l be r t r l D and let gz be rllrlr?. Put g 3 equal to (g:gl)3 and g4 equal to g:glg;glg2. and then set g5 equal to g3g4g3g:g3.(So g s = rt.)
R.J. 7kompson
410
Computation then shows that T ? = g2gsg:gl and D = g,T:gl and E = g:Dg2D.
TI)=
g : g s g : , while
Now we turn to results connected with simple subgroups of Pa(K). Proposition 1.6. For K E Fr there is for every non-identity element g E Pa(K) some h E Ft(K) such that h-'gh corresponds to a table 2 : . 2 ? , $ * such that $*((0)) = (10).
Proof. Let g be a non-identity element of Pa(K). Then there will be some table 2,),,El, $ corresponding to g ; we can assume that (0) 6Z 2,) and ( I ) 6Z Co,since a line 0- p could be replaced by lines 00- p 0 and 01 + p 1, and 1 p' could be replaced by 10- p'0 and 1 1 + p ' l without changing t h e corresponding element of Pa(K). By Proposition 0.6 there will then be some finite partition set 2 with a, T E 2, where a E ,El) and T E ,El have been chosen according to Proposition 0.7 so that $(a)= T and [a]f l [ T ]= 0. As a # T , 2 has cardinality 3 2 . In fact, 2 must have cardinality 3 3 , as if 2 had cardinality equal to 2 then we would have 2 = {@),(I)}, and so a = (0) or a = (I). Thus 2 has the same cardinality as some f., n 2 I (where f, is as in the proof of Theorem 1.4). Because n 2 1, (0) = A,, E f. and (10) = A I E f,. We can take $' to be some one-one mapping of 2 onto f, such that $ ' ( a )= (0) and $ ' ( T ) = (10). Then there is h E Ft ( K ) corresponding to t h e finite table 2, f., $'. Now, for p E K , ( h - ' g h ) ( ( O ) " p=) ( g h ) (h-'((O)"P))= ( g h ) ( a " P ) = h ( P p ) = (lo)"& so by Theorem 0.3 there will be a table 2 $ , 2 : , $ * corresponding to h - ' g h such that for some p C (O), p E 2 : . Thus (as h-'gh, like g, is not t h e identity) (0) = p E 2 : and we must have $*((0)) = (10).
-
Proposition 1.7. For K E Fr, if H is a group juch that Ft ( K )C H C Pa(K) then every non-trivial normal subgroup of H contains rhe elemenr of F t ( K ) corresponding to th.e finite table 00-000, 010-001, 011 -01, 1000- 100, 1001 1010, 101 1011. 11 1 I .
-
- -
Proof. Suppose H is as in the proposition, and N is a normal subgroup of H with g E N and g not equal to t h e identity. By Proposition 1.6 there will be some j E N corresponding to a table 2 : , 2 ? , $ * such that (0)E 2 : , (10)E 27, and $*((0))= (10). Let f E F t ( K ) correspond to the finite table 0-0, 100- 1000, 1010- 1001, 1011- 101, 11 11. Then, jfi-' is t h e element corresponding to t h e finite table 00-000, 010+001, 011 +01, 1- 1. For, given any p E K we have j((l)"p) 6f [lo] so that ( j f j - ' ) ( ( l ) " p= ) j - ' ( f ( j ( ( l ) " p ) )=)
-
Embeddings which preserve the word problem
411
j-'(j((l)"P)) = (I)"@. On the other hand, if (T is (0), (lo), or (11) and (T* is (OO), (Ol), or (1) respectively, then (ifi-')((O)"a"p) = Cf,-')((lO>"a"P)= j-'((lO>"a*"P) = (O)"a*"P. AS f E F t ( K ) c H, [ j , f ] E N, and it is easily computed that [j,f] = (ifi-')f-' corresponds to the finite table given in the proposition. Proposition 1.8. For K E Fr, if H is a group such that Ft ( K )C H C Pa(K) then Ft (K) is a subgroup of every non-trivial normal subgroup of
H.
Proof. Let H be as in the proposition. It is enough, in view of Proposition 1.7, to show that Ft (K) coincides with the normal subgroup N of Ft (K) generated by the element k of Ft (K) which is mentioned in Proposition 1.7. Now, let g , be D - ' E D - ' E - ' D . Then, as k E N , [k,gl]= k g l k - ' g ; ' E N. Computation shows that k g l k - ' g ; ' = D -'ED - I EDE -'D * = ( D-2E ) ( D- I EDE - ') ( E - I D ') ; D - I EDE - I is conjugate to this and so belongs to N. Next we observe that D - ' E N as D - ' = (D.rrT.rr"E)-'(D-'EDE-')(D.rrTik,,E), and E - ' D = (.rra.rrl.rrT)-'(D~'EDE-')(.lrl,.rrl.rrT) so that E - ' D E N. So D E N and E = ( ( E - ' D ) D - ' ) -E' N. Thus, since (.rr,,.lrTE)-'E(.lr0r~E)E N there N . So, as .rri'D.rro~ N, follows .rro.rrT = E(no.lr:E)-'E(.rr,,.rrTE)~ .rro.rrT.rr,, = E-'(.rr,,.rrT)D-'(.rrn'D.lro)E-'(.rr[,.rrT)D-' E N. Since, by ~ , lgenerators of Ft(K), N must Proposition 1.5, E, D, and . r r ~ l ~ Tare coincide with Ft (K).
Corollary 1.9. For K E Fr, Et ( K ) is a simple group. Proof. Take H
=
Ft(K) in Proposition 1.8.
Theorem 1.10. For K E Fr, is H is a group such that Ft ( K )C H C Pa(K), then Ft (K ) is a subgroup of the commutator group of H , and this group is a subgroup of every non-trivial normal subgroup of H. Proof. Let H be as in the theorem. As Ft (K) is a non-Abelian group, so is H. Thus the commutator group of H is a non-trivial normal subgroup, and so has Ft(K) as a subgroup by Proposition 1.8. Now let N be a non-trivial normal subgroup of H, so that Ft(K) C N by Proposition 1.8. Given f , g E H, by Proposition 1.6 there will be h,,,hl E Ft(K) such that hi'fh,, and h ; ' g h , correspond to tables Zb, 2 { , and Z:, z:, respectively with (0)E Zb, (10)E (0)E z f , (10)E Z?,and +'((O)) = (10) = +*((O)). Let x = hi'fhll.rroand y * = h;'gh,.rr,,. Then, for every /3 E K, the restriction of x to [(O)] is the identity mapping, as x ((0)"P) = .rro((hi'fh,J((O)"P1) = . r r ~ ( ( l O ) ~=P(O)"P, )
+'
+*
zi,
412
R.J. Thompson
while x permutes t h e elements of [(l)]; similarly, t h e restriction of y * to [(O)] is t h e identity map. Note that r o r : E corresponds to t h e finite we then have, for every table 0 + 1, 1-0. If we set 7~ = rI,r?:E p E K, ( r - ' y * r ) ( ( l ) " p )= ( y *r)((O)"p) = r ( ( 0 ) " p )= (l)"p, so that the restriction of r - ' y * r to [(l)] is t h e identity mapping, while r - ' y * r permutes t h e elements of [(O)]. If we let y = r - ' y * r , we then have by Lemma 0.10 (i) that x y = yx. So [f,g ] = fgf-'g-' = f g y - ' x - ' y x f - ' g - ' = (fx-')(x(gy-')x-')(y(xf-')y-')(yg-') E N, N being a normal subgroup of H,provided that ( f x - I ) , ( g y - ' ) E N (so that (xf-I), ( y g - I ) E N ) . But f x - ' = fr;'h;'f-'h,, = ( f ( r ; ' h i ' ) f - ' ) h o E N (since .rri'hi', h o E Ft(K)G N) and gy-' = g.rr-'.rri'h;'g-'h,a = (g(.rr-'ni'h;')g-')hl.rr E N (since .rr-'.rri'h;', h l a E F t ( K ) c N). As f and g were arbitrary elements of H,we conclude that t h e commutator group of H is a subgroup of N, and the last part of the theorem follows. As a matter of act, for H as in t h e last theorem the commutator group of H is simple; this result is not needed later however, the following corollary being sufficient.
Corollary 1.11. For K E Fr, if H is a subgroup of Pa ( K) a n d H is equal to its commutator group, then the subgroup of Pa(K) generated b y H U Ft (K) is simple. Proof. Let H be as in the proposition, and let B be the subgroup of P a ( K ) generated by H U F t ( K ) , so that F t ( K ) c B C Pa(K). Then t h e commutator group of H is a subgroup of t h e commutator group of B, so that H is a subgroup of the commutator group of B, and Ft (K) is a subgroup of the commutator group of B by Theorem 1.10. Thus B is equal to t h e commutator group of B, and from Theorem 1.10 it now follows that B is simple. 92. Splinter groups Definition 2.1. Suppose K E Fr and Y is a group of permutations of a set L. Then S p ( Y , K), the splinter group corresponding to Y and K, is t h e group of permutations of L x K generated by the set Y = {g : g E Y } , where g ( ( x , a))= ( g ( x ) ,a ) for g E Y, x E L, and a E [Ol] and where g ( ( x , a))= ( x , a ) for g E Y, x E L, and a 6Z [Ol], and t h e set Ft ( K ; L ) = {F : F E Ft (K)}, where F ( ( x , a))= ( x , F ( a ) ) for all x E L and a E K.
Embeddings which preserve the word problem
413
Note that both Y and Ft ( K ;L ) are subgroups of Sp(Y, K ) , and that Y = Y and Ft ( K ;L ) = Ft ( K ) . The name "splinter group" refers to the way in which t h e action of an element of Sp(Y, K ) on a subset of L x K whose projection on K is contained in a sufficiently small interval in K is determined by a single element of Y; see also Proposition 2.4. Throughout the rest of t h e section we assume that Y is a group of permutations of a set L and that K E Fr. Proposition 2.2. For every g E Y, if F,, FzE Ft ( K ) are given by the finite tables 00-000, 010+001, 011-01, 1 + 1 and 00-00, 01-010, 10+011, 11+ 1 respectively, then [g,F,]= Proof. We can show that ( [ g ,F , ] ) ( ( x ,a ) )= ( F ; ' g F 2 ) ( ( x ,a ) ) for arbitrary x E L and a E K by checking that this holds in the four separate cases of a E [OO], a E [OlO], a E [Oll], and a E [l ]. Theorem 2.3. Sp( Y, K ) is equal to its commutator group. Proof. This follows from Proposition 2.2, which implies that Y is contained in the commutator group of Sp(Y, K ) , and the fact that Ft ( K ;L ) Ft ( K ) , which is simple (and so equal to its commutator group, as i t is non-Abelian) by Corollary 1.9.
Let us now define a splinter table to be a quadruple 20,XI, $, 5 such that Z:,,, Z,, $ is a finite table and 5 is a mapping of Z:,, into Y. By the permutation of L x K corresponding to the splinter table XI,, Z I , $, $, is meant that permutation Q of L x K such that for cr E Z,,, a E K , and x E L, Q((x, a " a ) )= ( ( 5 ( a ) ) ( x ) ,$ ( a ) " a ) . Proposition 2.4. Sp( Y, K ) is equal to the set of permutations of L x K corresponding to splinter tables. Proof. First of all, every splinter table corresponds to an element of Sp(Y, K ) . For, given a splinter table Z,,, Zl, 4, 6, t h e corresponding permutation of L x K is the product of the permutations corresponding to the splinter tables Zll, Z,,, $*, 6 and XI,, El, $, (*, where $ * is the identity mapping on Zll and 5* is the mapping whose domain is Z l , and whose range consists of a single element, the identity of Y. The permutation corresponding to the second table evidently belongs to Ft ( K ;L ) . As for the permutation corresponding to the first table, if Z,,= { u l , .. . , u"},take for 1 S i s n the mapping ti to have domain Z:,,
414
R.J. 7hompson
with < , ( a , )[((T,) = but with & ( a , being ) t h e identity of Y for j # i. Then this permutation will be equal t o t h e product o f permutations corresponding to splinter tables C,,, XI,, Q*. 5, for 1 S i S n (of course, by Lemma 0.10 (i) these later permutations will commute), s o it will be enough to remark that such p e r m u t a t e a r e , when ,Vl, has cardinality greater than 1, equal to elements FT'{(a,)E of Sp(Y,K) where t ( a i ) E and E is an element of Ft(K;L) which maps L x [Ol] onto L x [a,]. When C,,has cardinality 1, the permutation corresponding to C,,, Z,,, Q*,5, will also corespond to {(O), (l)}, {(0),(1)},Q', [i, where Q ' is the identity on { ( O ) , (l)}, and [{((O)) and [{((l)) are both equal t o that element of Y which is the range of tl; this reduces t o the previous case. On the other hand, the elements of Ft(K; L ) correspond t o those C,,Q, [ where Z,,, C,,4 is a finite table and t h e splinter tables C,,, range of 5 is the identity element o f Y, while g E corresponds t o the splinter table {(OO), (01). (I)}, {(OO), (Ol), (I)}, Q*,[ * where Q * is the identity map and [*((()I)) = g with [ * ( ( O O ) ) and [*((I)) being equal to the identity of Y. So to show that every element of Sp( Y , K ) and F t ( K ; L ) corresponds t o a splinter table it is enough (with being closed under inverses) to show that t h e product of permutations corresponding to splinter tables also corresponds to a splinter table. Now, if a E Z,, the splinter tables C,,, C,,Q. [ and Z,',Xi, , Q', 5' correspond to the same permutation when Z'b, Xi, Q' is obtained from Z,,, C,,II, by replacing the line (T Q(a)by its direct extracts, and [' is obtained by setting 5 ' ( p ) = [ ( p ) for p # (TO, ( T I , with [ ' ( a ( )=) ['(al) = [(a). So splinter tables can be refined, just like finite tables. Thus, by refining the right partition set of one splinter tableand the left partition set of another splinter table until they are equal, we can multiply the two splinter tables: for the product of the permutations corresponding to the splinter tables C,,, C,,Q, [ and Z,, Zz,Q', 6' is Z2,$4'. (*, the permutation corresponding to the splinter table Z,,, where ( * ( p ) is, for p E Z,,, [ ( p ) - ( Q [ ' ) ( p ) . (The inverse of the permutation corresponding to Z,,, Z l , Q, (,, is the permutation Z,,, I,!-', where ( , ( p ) = ([ll($-l(p)))-' for p E corresponding to Z,, 21.)
v
-
el,
Definition 2.5. An embedding of a group G into a group H is a Frattini embedding just in case, whenever g, g ' E G are conjugate in H . they are conjugate in G. Theorem 2.6. Suppose that only the identity element of Y has any fixed points. Then the embedding of Y irito Sp(Y, K ) is a Frattini embedding.
Embeddings which preserve rhe word problem
415
Proof. Suppose a, b E Y and g E Sp( Y, K ) are such that g-'iig = b. If a = b, obviously ci and 6 are conjugate in Y. If a # b, then both d and b are distinct from the identity permutation of K x L and map K x [OI] onto itself but have no fixed points on this set (because, for x E L, a ( x ) , b ( x ) # x ) , although they are pointwise fixed o n t h e rest of K x L. Thus Lemma 0.10 (ii) applies and g must map K x [Ol] onto K x [OI]. By Proposition 2.4 there will be some splinter table Z(,, Z l , $, 6 corresponding to g. In fact (because we can refine a splinter table if necessary) we can assume that there is some a E Zl, such that (01) C m. As g maps K x [Ol] onto K x [Ol] it follows that (01)C +(aj.If we put h = 6(a)we have for all x E L and p E K that (g-'iig)((x,$ ( a ) " P ) ) = ( a g ) ( ( h - ' ( x ) on@>) , = g((a(h-'(x))a , " P ) ) = ( h ( a ( h - I ( x ) h+ ( a ) " P ) , while b ( ( x ,+(a)"p)) = ( b ( x ) ,+ ( m ) " p ) , so that for all x E L, b ( x ) = h ( a ( h - ' ( x ) ) ) =( h - ' a h ) ( x )and consequently b = h - ' a h . As h E Y, a and b are thus conjugate in Y, so ii and 6 are conjugate in Y. Proposition 2.7. If the word problem for Y is solvable, so is the word problem for Sp(Y, K ) . Proof. We can appeal to Proposition 2.4 to see that the word problem for Sp(Y, K ) is solvable if we have an effective means of multiplying and inverting splinter tables and determining whether or not a splinter table corresponds to the identity mapping of K x L onto K x L. This last determination can certainly be made effectively if the word problem for Y is solvable, for a splinter table Zo, Z l , $, 6 corresponds to the identity mapping of K x L onto K x L just in case XI = Zll, $ is the identity mapping of Z(, onto Z(,, and 6 is the mapping whose range consists of the identity element of Y . The last half of t h e proof of Proposition 2.4 shows that splinter tables can be multiplied and inverted effectively in terms of Y (the word problem being solvable for Ft(K; L ) = F t ( K ) ) .
Although no proofs will be given here, it is worth remarking that Sp(Y, K ) is such a well-behaved extension of (an isomorphic image of) Y that it has a finite presentation if Y has a finite presentation. On the other hand, Sp(Y, K ) has the interesting property (which is not hard to prove) that the countable weak direct product of a collection of subgroups of Sp(Y, K ) is always isomorphic to a subgroup of Sp(Y, K ) . This property is also possessed by F t ( K ) ; in the former case Proposition 2.4 can be used, while finite tables are used in the latter case.
416
R.J. Thompson
03. The main construction Throughout this section J will denote the set of all positive integers, and K O will denote the subset of " 2 consisting of every (Y E "2 such that there is, for every n E w, some m > II with cr(m)# a ( n ) . (Thus K,, consists of every sequence belonging to " 2 which is not "eventually" either 0 or 1.) It is easy to see that K u EFr. Note that every P E Ko is equal to (I)'"(O>'"(I>"P' for some unique i E w, j E J, and P ' E KIJ;by t h e index of p will be meant j , and by the parity, the integer j - i. For t h e remainder of this section A will be a fixed group of permutations of J, possibly satisfying these conditions: C1: The word problem for A is solvable. C2: Given g E A and j , k E J it is always effectively decidable whether or not g o ' ) = k. C3: If g E A is such that { j E J : g o ' ) 2'j } is finite, then g is t h e identity of A. C4: If g E A is not the identity of A, then go') # j for every j E J. For g E A let g * be t h e function with domain K,, such that, for P E K,, with parity not equal to 0, g * ( p ) = p, while for (Y E KOand m EJ, g*((1)""(0)""(l)"a)= (l)~cm)n(0)R(m)n(l)"(Y. As g is a permutation of J , g * is a permutation of K,, which maps the set of elements of parity 0 onto itself. Further, in view of Definition 0.2, g * E Pa(&). If we set A * = {g* : g E A } it is evident that (taking the group operation for A * to be functional composition) we have A = A *, with A * a subgroup of Pa(K,,). In order to deal with t h e subgroup of Pa(&) generated by A * U Ft (KO) using objects finitely expressible in terms of A we have devised A-tables. Thus t h e reader should be careful to note that t h e constructions to be given in connection with A-tables are effectively obtained, granted that C1 and C2 hold. Let F i ( A ) be the set of all functions whose domain is a finite, non-empty subset of J and whose range is a subset of A. For b E F i ( A ) let ( ( b ) = { ( m + l , g ) : ( r n , g ) E b}; then ( ( b ) E F i ( A ) . For i E w , by ("'(b) will be meant ((('(b)), ("(b) being b.
Definition 3.1. An element b of F i ( A ) will be called n-bolsrered, for n E w, just in case for every (m, g ) E b and every j E J such that j 3 m we have g ( j ) + n 3 m.
Embeddings which preserve the word problem
417
Lemma 3.2. Suppose that b E Fi ( A ) is n - bolstered. Then : (i) For n' 3 n, b is n'-bolstered. (ii) For i E w, ( ' ( b ) is ( n + i)-bolstered. (iii) For every s E J there is some r E J , r 3 s, such that ( ' ( b ) is ( n + r - s)-bolstered.
Proof. W e obtain (i) a n d (ii) directly from Definition 3.1. For (iii), given s E J let us first set m * equal t o the largest element in the domain of b. As the range of b is finite, and every g E A is a permutation of J, there must b e some r 3 s such that for every x 3 r we have g ( x ) 3 m * + s for each g in the range of b. T h u s if ( m ,g') E ( ' ( b ) we will have r < m s m * + r so that, for every j 3 m , g ' ( j ) + ( n + r - s ) 3 ( m * + s ) + ( n + r - s) 2 m * + r 3 m. This yields (iii). Note that, given s in Lemma 3.2 (iii), r can b e obtained effectively when C2 holds: for each g belonging to t h e range of b we compute in succession g(l), g ( 2 ) ,. . . until every positive integer less than m * + s has appeared (which must happen as we are dealing with permutations); then r can b e taken to b e t h e smallest integer which has not been used a s an argument in o u r computations. By a regular line is meant what was called a line in $1; in this section a line will b e either a regular line or an A-line,-defined as follows: Definition 3.3. An A -line is a quadruple a, b, T , n such that a, T E "2 and b E F i ( A ) , while n E w is such that b is n-bolstered; it will b e written as a, b + T , n. Definition 3.4. (i) T h e line function corresponding to a regular line a+ T will b e that function x with domain [a]such that x ( a " P ) = ~ " p for p E K,,. (ii) T h e line function corresponding to an A-line a, b + T , n is that function x with domain [a]such chat for p E K O with parity not p , for p E KO belonging to the domain of b, x ( a " P ) = ~ ~ ( l ) " "while such that p has parity m (where m belongs to the domain of b ) SO that p = (I>""'"(0)"(1)"p' for some p ' E KO and k a m , x ( a " P ) = n( 1 ) ( R ( k P - m )n(0)8(k"'(1)"p', where g = b ( m ) . (Observe that from k 3 m and b being n-bolstered it follows that g ( k ) + n 3 m.) T h e left sequence of a line a + T or a, b + T,n will b e a,a n d t h e righr sequence, T. Definition 3.5. A partial ruble is a finite set of regular lines a n d A -
418
R.J. 7Xompson
lines such that all of the left sequences of these lines are distinct and the intervals in K o they determine are all disjoint. Definition 3.6. The left sequence set of a partial table is the set consisting of all the left sequences of lines belonging to it. Definition 3.7. An A-table is a partial table whose left sequence set is a finite partition set for K,,. By the function defined by a partial table or A-table will be meant the union of the line functions (which is again a function, as the left sequences of t h e lines determine disjoint intervals in Kll). For an A table this corresponding function will have a domain equal to K,, (the left sequence set being a finite partition set for K,,); as a line function is always a one-one function an A-table will correspond to a one-one function just in case the ranges of the line functions (for the lines belonging to t h e A-table) are disjoint. Note that, if a regular line appears in a finite table corresponding to an element of Ft ( K , ) ,this element of Ft (Kl,) will be an extension of t h e line function corresponding to the regular line, which gives: Proposition 3.8. If an A-table and a finite table have the same lines belonging to them (which will thus all be regular lines) then they correspond to the same element of Ft ( K O ) . Proposition 3.9. For every A-table having only regular lines and corresponding to an element of Pa(K,,), this element belongs to Ft (Kll). Proof. An A-table has a left sequence set which is a finite partition set; if all its lines are regular it can only define an element of Pa(Ko) if the set of t h e right sequences of its lines is a finite partition set of the same cardinality as its left sequence set and the set of its lines. In this case there is a finite table with the same lines as the A-table. From t h e relevant definitions we directly obtain: Proposition 3.10. For g E A, i f b = ((1,g)) then b E Fi(A), b is 0bolstered, and the A -table with the lines (0) +(0) and ( I ) , b + (I), 0 corresponds to g * E A *. Proposition 3.11. For k E J and u,T E e2, if b E Fi( A) is n-bolstered for n E w (and so ( n + k)-bolstered) then a,b + 7, n + k and a,b + ~ " ( 1 ) ~n. determine the same line function.
Embeddings which p r e s e w rhe word problem
419
The direct extracts of a regular line have already been defined, in Definition 1.1. For A-lines we make the following definition: Definition 3.12. The direct extracts of the A -line u,b + 7,n are, where m * is the largest element in the domain of b, the regular line u"(0)"'+l-+7n(l)nn(0)m*+I, and for 0 < m S m * the regular line u"(O)""(l)-+ 7n(l)nn(0)mn(l)(when m does not belong to the domain O ) ~ ( m~ ) ~ ( l ) of b ) or the regular line u"(O>""(l)+ ~ ~ ( l ) ( ~ ( ~ ) + ~ - ~ ) ~ ((when belongs to the domain of b and b ( m )= g ) together with the A-line a"(l),((b)-+ 7,n + 1. (Noting that ( ( b ) is ( n + 1)-bolstered by (ii) of Lemma 3.2.) Definition 3.13. An extract of a line is either the line itself or a line obtained from it as the last in a series of direct extracts, starting from the given line. From Definition 1.1 or Definition 3.12, using Defin'ition 3.5, etc., we obtain: Proposition 3.14. The function determined by the partial table consisting of the direct extracts of a line is the line function corresponding to the line.
Corollary 3.15. The line function corresponding to a line is an extension of every line function corresponding to an extract of the line. Definition 3.16. Suppose n E w and b, c E Fi(A). Then t h e n-alignment of b and c, or al(n, b, c), is the function whose domain is the union of the domains of b and ["(c), while (al(n, b , c ) ) ( r n ) = ( b ( m ) ) .( ( l " ( c ) ) ( m )for ) m belonging to both domains, (al(n, b , c ) ) ( m ) = ( l " ( c ) ) ( m )for m belonging to t h e domain of [ " ( c ) only, and (al(n, b, c ) ) ( m )= b ( m ) for m belonging to the domain of b only. (Thus al(n, b , c ) € F i ( A ) . ) Further consideration of the separate cases of the definition shows: Proposition 3.17. Zf b, c E Fi(A), n, n' E w, b is n-bolstered, and c is n'-bolstered, then al (n, b, c ) is (n + n')-bolstered. Definition 3.18. Suppose p, a, 7 E "2, a'nd b, c E Fi(A), with b being n bolstered and c being n'-bolstered. Then the (symbolic) composition of lines is stipulated to be such that:
R.J. Thompson
420
(1) ( a + ~ ) . ( ~ + p ) = ( a - + p ) . (2) (a+ 7 ). (7,b + p, n ) = (a,b + p, n ). (3) (a,b + T , n ) * ( T p ) = (a,b + p, n ) . ( 4 ) (a,b + T , n ) . (7,c p, n ' ) = (a,al(n, b, c)+ p, n -+
-+
+ n').
Proposition 3.19. The line function corresponding to the composition of two lines is the same function as the composition of the two line functions which correspond to the lines. Proof. This is clear enough for the cases (l)-(3) of Definition 3.18, using the definition of line function. For (4) we have to consider the effect of the two compositions on a"P, where /3 E K O ;there are four subcases depending on whether or not the parity m of /3 belongs to the domains of b or l " ( c ) , and each is easily checked. Just as in §1, an A-table (or partial table) obtained from another A-table (or partial table) by an iterated process of replacing a line by its direct extracts will be called a refinement of the other table; if, in this process, we also allow the replacement of a line a, b -+ T , n + k (where k E J ) by t h e line a, b -+ ~ " ( l ) n~ ,when b is n-bolstered, we will call the result a semi-refinement. Appealing to Proposition 3.14 and Proposition 3.11 we now see that: Proposition 3.20. If an A-table (or partial table) is obtained from another A -table (or partial table) as a refinement or semi-refinement, it corresponds to the same function. Definition 3.21. T a ( A ) is the subset of Pa(&) consisting of all elements of Pa(&) which are defined by A -tables. Definition 3.22. If a E " 2 is equal to p "(1) for some p E " 2 it will be said to be (Z)-ending;a line whose right sequence is (1)-ending will also be said to be ( 2 )-ending. Proposition 3.23. Every extract of an A-line with left sequence a is either a regular line or has a left sequence equal to ~ " ( 1 for ) ~ some k E w. Proof. This follows because, given an A-line with left sequence a, the only direct extract which is an A-line has ~ " ( 1 ) as its left sequence while all extracts of regular lines are regular lines.
Embeddings which preserve rhe word problem
42 1
Lemma 3.24. Suppose a,u'E " 2 with u'C u while a', b --* T, n is an A line. If u is either (1)-ending or else, for some p E " 2 of length t, is equal to p"(O)"'+' for some m E J which is greater than every element of the domain of b, and u'C p in the second case, then there exists exactly one line which is a n extract of this line and has u as a left sequence.
Proof. The intervals in K,, determined by the left sequences of the direct extracts of a regular line or A-line are all disjoint; as these left sequences are distinct from the left sequence of the line itself (but are extensions of it), it follows that there can be at most one line which is an extract of a given line and has a given left sequence. It remains to show that whenever u, u', b, T, and n are as in the lemma there is some line with u as its left sequence which is an extract of the A-line u',b + T, n. Let u" be the element of " 2 of maximum length such that u"C u and u" is either u' or is equal to ~ ' " ( 1 ) ' for some s E J. Then a''is the left sequence of an A-line which is an extract of the A-line u',b + T, n, being either this line itself or, for some s E J , the line u'"(l)', l ' ( b ) - , T,n + s. So if a"= a, the lemma holds. Otherwise, ~ " " ( 0C ) u..Then, if u is (1)-ending, there will be a regular line which is a direct extract of the A-line with left sequence d" and which has u as an extension of its left sequence; this regular line will then have an extract which has u as its left sequence, and t h e lemma will again hold. Finally, if u is equal to pn(0)"'+' for some p E " 2 of length t and m E J which is greater than every element of the domain of b, while u'c p, then u"C p (since u"C a ) so'that if u" is ~ ' " ( 1 ) ' for s E J then s < t and thus m + s m + t so that m + t is greater than every element of the domain of l ' ( b ) , just gs m is greater than every element of the domain of b ; consequently, there will be (as u " C p and ) u )a direct extract of the A -line w i t h left sequence a'' also ~ " " ( 0C which is a regular line and has u as an extension of its left sequence, and so has an extract whose left sequence is u, so that the lemma holds in this case also. Lemma 3.25. Suppose f I is an A -table or partial table. Then, for every k , m E J there is a n A -tGble or partial table, respectively, which is a semi-refinement of r ,'and in which every line has a right sequence of length 2 . k and is either (1)-ending or else is a regular line with a right sequence equal to P ~ ( O ) ~ +for ' some t E J and p E "2 with length t. Proof. First we show that there is a semi-refinement f 7 of r , in which every A-line has length 3 k and is (1)-ending. Suppose that, for n E J , there are exactly n A-lines in f l which are either not (1)-ending or are
422
R.J. 7hompson
of length less than k. Let a, b + 7 , j be one such A-line. By (iii) of Lemma 3.2 there is some r E J with r z=k such that l ' ( b ) is (j+ r - k)-bolstered. For 1 c m C r let f,,,,, be obtained from f,,,by replacing the A -line belonging to f,,,which is an extract of a, b + 7, j by its direct extracts. Then r,+l has ~"(1)'.l ' ( b ) + ~ ,j + r as one of its A-lines (with this being an extract of a, b + 7 , j ) and has exactly n - 1 other A-lines which are either not (1)-ending or are of length less than k. Now, f,+, is a refinement of fl,and if f;+l is the A-table resulting from replacing an(l)', l ' ( b ) + 7 , j + r by ~ " ( 1 ) ' . l ' ( b ) + ~ ~ ( lj + ) r~ -, k then f;+,will be a semi-refinement of f,,,and f,.Note that f;+Ihas only n - 1 A-lines which are either not (1)-ending or are of length less than k. Proceeding from f;+,we can continue to refine t h e table in the same way until we obtain a semi-refinement f f of f l in which all A-lines are (1)-ending and of length 3 k. (Of course, if there are n o A-lines in f,which are not (1)-ending or are of length less than k, we Now f';can be obtained from f T by replacing each take f T = f,.) regular line a + 7 of f f by t h e 2' extracts consisting of all the lines a n y + T ~ such V that v E "2 is of length k. T h e n f';is a semirefinement of f ? and so of f I, while every line of ft;is of length 3 k and is (1)-ending if it is an A-line. Finally fl is obtained from f; by replacing each regular line ar--,7'by the regular lines (where t is t h e length of 7') a r n ( l ) + ~ ' " ( l ) ,a'"(0)"""(1)-+ ~~"(0)'"'"(1) (for each m r such that 0 < m' < m + t ) , and arn(0)"'+'-+ ~'"(0)'"+~; f will be a semirefinement of f: (and so of f,)with all t h e required properties.
Theorem 3.26. The set T a ( A ) is (with functional composition as the operation) a subsemigroup of Pa(K,,).
Proof. Suppose that F and G are elements of T a ( A) , with f l and f2 being A-tables which define F and G respectively. Let k be t h e least integer greater than the lengths of all left sequences of f2,and let m be the least integer greater than every element belonging to t h e range of some b E Fi(A) occurring in an A-line of f2.By Lemma 3.25 there will be an A-table f ;which is a semi-refinement of f l and has only lines, with right sequences of length a k, that are (1)-ending or else are regular lines with a right sequence equal to p ,(0)'"+' for some t E J and p E ?2 with length s t. By Proposition 3.20, r I also defines F. Now we will define an A-table f3 (as t h e "composition" of the two A-tables f I and f2)as follows: we replace each line a 7 or a, b + T, n of f I by the composition of this line with an extract of a line of fzto obtain a line with a as its left sequence. Given a line of f I, say with left sequence a and right sequence T, it follows that, as . -+
Embeddings which preserw rhe word problem
423
the left sequences of f2are a partition set for K,,, and all of them are of length < k, while T is of length 3 k, that T is an extension of some left sequence T ' of r2. Case 1: T ' is the left sequence of a regular line ~ ' + pof f2.Then let p E "2 be such that T = ~ ' ~and p ,replace the line of f I which has u as its left sequence and T as its right sequence with t h e composition (according to (1) or (3) of Definition 3.18) of this line and t h e extract ~'"+ p p " p of the line ~ ' + p . Case 2: T' is t h e left sequence of an A -line T', c + p , n' of f2.By Lemma 3.24 there will exist exactly one line with T as its left sequence which is an extract of T ' , c + p, n'. Let the line of f I which has u as its left sequence and T as its right sequence be replaced by t h e composition (according to (2) or (4) of Definition 3.18) of this line of f I with the extract of T',C + p,n' which has T as its left sequence. Now, we obtain in this way an A-table r3with t h e same number of lines as f I and the same set of left sequences. If H is t h e function defined by f3 then t h e restriction of H to an interval [u], where u is a left sequence of f3 and so of f I, is evidently a line function equal by Proposition 3.19 to t h e composition of t h e restriction of F to [u]and a function which is, by Corollary 3.15, a restriction of G (because it is the line function corresponding to a line which is an extract of a line of f2).Thus t h e restriction of H to an interval [u], u being a left sequence of f3,is a function of which FG is an extension. It follows that H itself has FG as an extension; as they have t h e same domain, H is equal to FG, and thus the theorem holds.
Theorem 3.27. If A satisfies condition C3, then an A-table f will define the idenity element of Pa(K,) (that is, the identity permutation of K,,) if and only if every right sequence of a regular line belonging to f is the same as its left sequence and, for every A-line a,b -+ T,n of f, u = ~"(1)"and the range of b consists of only one element, the identity of A.
Proof. The sufficiency of t h e requirement concerning f is clear from the definitions of t h e line functions of regular nnes and A-lines. The necessity is clear enough, as regards regular lines, since a regular line U - + T corresponds to a line function which maps [a] onto [TI. For an A-line u,b + T, n of an A-table defining the identity element of Pa(&) we must have u = ~"(1)"as, for m E J sufficiently large, [a"(O)'"] will be mapped onto [ ~ ~ ( 1 ) " ~ ( 0by ) "t'h]e line function corresponding to u,b + T,n. Then, in view of Definition 3.4, this line function will be the identity mapping on [u]only if every g E A belonging to t h e range of b is equal to the identity of A, as if m E J
424
R.J. nompson
belonging to the domain of b is such that g = b ( m ) is not equal to the identity of A then by C3 there will exist k 3 m such that g ( k ) # k and thus [~"(1)"-""(0)"(1)] will be mapped onto [.n( 1y k ) + n - m n(0)p(k)n(l)], which must be distinct as g ( k ) # k. From the next result t h e significance of Theorem 3.26 and Theorem 3.27 becomes apparent.
Theorem 3.28. The subgroup of Pa (KII)generated by A * U Ft ( K O )is a subsemigroup of T a ( A ) . Proof. As every inverse of an element of A * U Ft(K,,) belongs to this set, the subgroup of Pa(&) generated by A * U Ft(K,,) is t h e same as the subsemigroup of Pa(Kll) generated by A * U Ft (KJ,and thus, because of Proposition 3.8 and Proposition 3.10, is by Theorem 3.26 a subsemigroup of T a( A ) . Theorem 3.29. Suppose that A satisfies conditions C1, C2, and C3. Then the word problem for the subgroup of Pa(K,,) generared by A * U Ft (I&) is solvable (for a presentation appropriately related to the presentation for which A satisfies C1 and C2). Proof. Given that A satisfies condition C3, Theorem 3.27 will apply, and we will be able to effectively determine whether or not an element of the subgroup of Pa(&) generated by A * U Ft (&) which is given in terms of an A-table is equal to the identity element (of Pa(&) and so of the subgroup) using condition C1. In view of Proposition 3.10 and Proposition 3.8 we can effectively find A-tables which correspond to given elements of A * U Ft ( K J ; as this set is closed under inverses, and t h e proof of Theorem 3.26 shows that there is an effective procedure (relative to a solution of the word problem for A, and an effective method of determining for j , k E J and g E A whether or not g o ) = k ) which yields an A-table corresponding to the composition of two elements of T a ( A ) given by A-tables, we have a procedure that effectively determines whether or not an element of the subgroup of Pa(&) generated by A * U Ft ( K , , )is equal to t h e identity element, so that the word problem is solvable.
Theorem 3.30. Suppose that A satisfies condition C4. Then the embedding of A * into the subroup of Pa(&) generated by A * U Ft ( K O ) is a Frattinis embedding.
Embeddings which preserve the word problem
425
Proof. Let B be the subgroup of Pa(Ko) generated by A * U Ft ( K O ) , and let M be the set of all p E K,, with parity equal to 0. Suppose g, h E A and f E B are such that f - ' g * f = h *. If g = h, obviously g * and h * are conjugate in A *. If g # h then both g * and h * are distinct from the identity permutation of K,, so that they both map M onto itself but have no fixed points on this set (by C4 and t h e definition of g * and h *) and are pointwise fixed on t h e rest of K,,. Thus Lemma 0.10 (ii) applies and f must map M onto M. Now, by Theorem 3.28, f is defined by some A-table f(,. Then there is some semi-refinement rl of f,,(also defining f ) which has at least two lines, thus possessing a unique line with (for some r E J ) a left sequence a = (l)', this unique line being (1)-ending. (After first obtaining, if necessary, a refinement of f,,with more than one line, t h e unique line with a left sequence of the form (1)" for some s E J is either replaced by its direct extracts if it is a regular line, or if it is an A-line an appeal is first made to Lemma 3.25.) Whether the line with a as its left sequence is a regular line or an A-line, the corresponding line function will map [ a ]into [ T I , where T is the right sequence of the line. If T is not (1)' for some s E J , the parity of every element of [TI will be the same (as T is (1)-ending), so that f either maps [ a ]= [(l)']into M or maps it into the complement of M in K,,. But this is impossible, as [ ( l ) ' ]r l M # 0 and [ ( l ) ' ] is not contained in M, so that the image of [ a ]under f cannot lie wholly within either M or its complement. Thus T is (1)' for some s E J. If T is the right sequence of a regular line, then s = r, as we can choose (Y E K,, such that (Y has parity equal t,o r, so that (+"a has parity 0 and belongs to M and thus the value of f for una,which is T ~ ( Ywith parity r - s, also belongs to M. Further, if T is the right sequence of an A-line a, b + T,n then s = r - n, as we can choose a E K,, such that a has parity equal to r, so that aria has parity 0 and belongs to M , and thus the value of f for v n a , which has parity equal to r - (s + n ) by Definition 3.4, belongs to M. Now we remark that there is some u E A such that for every (Y E K,,, and for every k 3 r , f ( ( l)k"(o)kn(1)"a)= ( l)u(k)n(0)u(k)n( 1)"CY. If the line with a as its left sequence is a regular line (i.e., if it is ( l ) ' + ( l ) ' ) we can take u to t h e identity element of A (so that u(k)= k for all k E J), and if v is the left sequence of an A-line a, b + T , n we can take u (in view of Definition 3.4) to be the identity element of A when r does not belong to the domain of b, and b ( r ) when r
belongs to the domain of b. Then, u - l g u = h. For otherwise u-lguh-' will not be the identity permutation and so, by C4 (in fact, by C3
426
R.J. 'Ihompson
alone), as u, g, and h are all permutations there will be some m E J such that ( u - l g u h - ' ) ( m ) # m while for f = u - ' ( m ) (thus ( g u ) ( t ) # ( u h ) ( t ) )we have f 3 r and g ( t ) 3 r. For such an m and corresponding t we would then have, for every cr E KO,
( f - ' g *f)((l)"~""(o)U't'n(l)na) = ( g *f)((l)'"(O>l"(l>"a) = f(( I ) g ( t ) n ( 0 ) s ( r ) n ( 1)"a)
0u ( m ) l n a # 1h(u(t))Oh(u(t))l"cr -
lu(g(r))
=
h *(( l)u(t)n(0)u(r)n( l)"a),
so that f-'g *f# h *, contrary to hypothesis. Thus if g * is conjugate to h * in B, g is conjugate to h in A and so, as A = A *, g * is conjugate to h * in A * . The results that are really needed for 04 have now been obtained, but the question naturally arises as to whether or not T a ( A ) is actually a group - in particular, the subgroup of Pa(&) generated by A * U Ft(K,,). It actually is the case that T a ( A ) is a subgroup of Pa(K,,), the one generated by A * U Ft(K,,). This makes both T a ( A ) and A * appear less arbitrarily chosen, t h e exact form of t h e definitions not being so important.
Theorem 3.31. The subgroup of Pa(Ko) generared by A * U Ft ( K , , )is equal to T a ( A ) .
Proof. Let us say that the A-multiplicity of a regular line is 0, the A-multiplicity of an A-line a,b + T,n is equal to t h e cardinality of the domain of b, and the A-multiplicity of an A-table is t h e sum of t h e A-multiplicities of its lines. Let B be t h e subgroup of Pa(K,,) generated by A * U Ft (K,,); t h e theorem follows (in view of Theorem 3.28) once it is shown that. every element of T a ( A ) belongs to B. Note that if p E $2 is non-empty there will be a finite table f,,with the line p + (1); if-FIE Ft (K,,)is defined by this table then, for g E A, G = F1g* E B will be defined by t h e A -table obtained from f,,by replacing t h e line p + ( l ) by the line p , { ( I , g ) } + ( l ) , O = (p+(l)).((l),{(I,g)}~(l),o). Now suppose F E Pa(KI,) is defined by an A -table f I with A multiplicity d 1. Then, if f,has A -multiplicity 0, F E Ft (KO) C B, by Proposition 3.9; otherwise f I will have t h e single A -line: (1)
a,{ ( m , g
+ 7,
n.
Embeddings which preserve the word problem
427
If necessary, f l can be refined to ensure that T E "2 is a non-empty sequence. Case 1: n 3 m - 1. As { ( m , g ) } is ( m - 1)-bolstered, if n is ( m - 1)+ j for j E J then line (1) can be replaced by the line a , { ( m , g ) } + ~ ~ ( l y , m - 1 which will define the same line function. Thus we can assume n = m - 1. As noted above there will be G ' E B defined by an A-table f2whose only A-line is (2)
7,((1, g-%+
(I), 0.
Since n = m - 1, the composition of lines (1) and (2) is, by (4) of Definition 3.18: (3)
a , { ( m 9 g g - ' ) } - - + ( 1 ) , m- 1.
By refining f,we can get an A-table f I of which (1) is still the only A-line while every other line has a right sequence equal to the left sequence of some extract of a line of f2,this extract being a regular line in view of Proposition 3.23 and the fact that (1) is a line of f I while F E Pa(&). Composing the A -tables f I and f2we obtain an A-table f3which defines FG' and has (3) as its only A-line. As the same line function corresponds to the regular line a+(l)"' as to (3) (because gg-' is the identity permutation on J), by replacing (3) with this regular line in f, we obtain an A-table f; also defining FG', but with only regular lines. So FG'E B, and F = (FG')(G')-'€ B as G' E B, as required. Case 2: n < m - 1. Let f4 be a finite table with one of its lines being T - , ~ ~ ( l ) ( " - ' ) - Then, ". where G"E Ft(K,,) corresponds to f4,FG" will be defined by an A-table whose only A-line is cr,{(m,g)}-,~n(l)(m-')-n, n (the composition of ( 1 ) and T + ~ ~ ( l ) ( " ' - ' ) - " according to (3) of Definition 3.18), which defines the same line function as u,{ ( m , g ) } - * ~ ,m - 1. So there is an A-table f, which has this line as its only A-line and which defines FG"€ Pa(K,,); by case 1 we have FG"€ B so that, as G"EB, F = (FG")(G")-'E B. Thus every F E Pa(&) defined by an A-table with A-multiplicity C 1 belongs to B. Now let us assume that p 2 1 and every element of Pa(&) defined by an A-table of A-multiplicity S p belongs to B. Let F E Pa(&) be defined by an A-table f6 of A-multiplicity p + 1. Then f6 will have an A-line; let (4)
a,b + 7, n
be such a line, and choose m to belong to the domain of b. As g = b ( m ) is a permutation of $ there will be some w E J such that
428
R.J. ntompson
for k 3 w, g - ' ( k ) 3 m. Then X = { j E J : j < rn + w and g o ' ) 3 rn + w } = { j E J : rn j < rn + w and g ( j ) z rn + w } and the set Y = { j E J :j 3 rn + w and g o ' ) < rn + w } have the same finite cardinality and there is a one-one mapping 6 of X onto Y.For j E X, let IiG) be u"(1)'-""(0)'"( 1) + u"(l)eo)-mn(0)eo)n( l),
which is a regular line. Then there is a partial table r* whose lines consist of the lines lie) for j E X, together with t h e A-line (5)
u"(l)", {(rn
+ w, g - ' ) } - +
u,w.
Let 2 be the set of all left sequences of r6 except a, let 2 * be the left sequence set of r*,and let 2" = 2 U 2 *; as r6 is an A -table and the left sequence of (5) and of every line li(j) for j E X is an extension of a, Proposition 0.6 and Proposition 0.5 apply and there is a finite partition set which is an extension of 2(,.Note that we can arrange that the added sequences of this partition set 2' are all left sequences of some extract of line (4) (which will be, by Proposition 3.23, a regular line, as it cannot have a"(1)' for a left sequence for any j E w, since ~ " ( 1 ) " belongs to t h e parition set Z,). Let r7be the A-table with left partition set 2' which i s a n extension of 2 * by regular lines with the same left and right sequences. The A-multiplicity of r7 is 1, with (5) the sole A-line; thus the function H defined by r7 belongs to B as it belongs to Pa(&) (as inspection shows that the partial table 2 * determines a one-one function whose range equals its domain). 6 yields the A-table Tx:First, each line of r 7 Composing r7 with r whose left sequence belongs to 2 is replaced by the line of r6 with the same left sequence. Second, every regular line of r7 with a right sequence u such that u C u' is replaced by the composition of this line, according to (1) of Definition 3.18, with the unique regular line which is an extract of line (4) and has u' as its left sequence (this extract exists as, for j E X , the line liu) has a (1)-ending right sequence, and the other regular lines have right sequences equal to their left sequences, chosen to be left sequences of extracts). Third, letting b' be the function with the same domain as l " ( b ) , and agreeing with it except that b'(m + w ) = g-lg, the line (5) of r7 is replaced by ~ " ( l ) ' " ,b'+ T , n + w. (6) The A-table r, defines the function HF and has exactly the same A -lines as r6,except that the A -line (5) is replaced by the A -line (6). If m + w is t h e only element belonging to the domain of b', (6) can be replaced in rxby the regular line u"(1)" -+ ~ " ( l ) "and + ~ otherwise , b' can be replaced in (6) by the b " E F i ( A ) which is like b' except for not
Embeddings which preserve the word problem
429
having m + w as an element of its domain. In this way we obtain an A -table r9,also defining HF, which has A -multiplicity less than that of r, and r6.So HF E B and F = ( H - ’ ) ( H F )E B as H E B. Hence every element of Pa(&) defined by an A-table of A-multiplicity p + 1 belongs to B. Thus, by induction, every element of T a ( A ) belongs to B, and so B is equal to Ta(A). 14. Conclusions
In this section, as in $3, J will be t h e set of all positive integers. It was shown by Rabin (in [8]) that if a finitely generated group has a word problem which is solvable with respect to one presentation o n a finite set of generators then the word problem is solvable with respect to every presentation o n a finite set of generators. Thus we can say that the word problem for a finitely generated group is solvable or unsolvable without making reference to a specific presentation (on a finite set of generators). As only countable groups will be discussed in this section, it will be tacitly assumed that all the presentations referred to are on a finite or countably infinite set of generators. In the case where a presentation is given on a countably infinite set of gernerators of a group G there will be a function A whose domain is J and whose range is a set of generators of G, with A 0‘) ,being, for j E J, the element of G correspodning to the j-th element of an enumeration of t h e generators of the presentation. Note that A need not be a one-one function. From the proof of Proposition 4.1 it will be apparent that the word problem for t h e presentation depends in fact only on G and A. The ordering of the generators of t h e presentation (when there are more than finitely many of these) is important, however, so it will be assumed that with every presentation on a countably infinite set of generators there is also “given” an ordering, of the. same order type as J, of the generators; when the presentation has only finitely many generators we will assume that a simple ordering of these is also provided, but here we rely on the fact that the specific finite simple ordering assigned does not affect the Turing degree of the word problem (as this is explicated in the proof of Proposition 4.1). Proposition 4.1. Given a presentation of a countable group C there exists an isomorphic group C‘ which is a group of permutations of J such that only the identity element has a fixed point. Further, C’ can be construed as given by the same presentation as C in such a way that for j , k E J
430
R.J. 7%ompson
and a word w on the generators of the presentation, with g E C’ corresponding to w, it can be effectively determined whether or not g ( j ) = k relative to a solution of the word problem for the given presentation (of C and C’).
Proof. It will be convenient to make the obvious adjustments enabling us to speak of recursive functions on J instead of w. Let F be a free group with the same number of independent elements required to generate it as t h e number of generators in the presentation of the countable group C.The words on the generators of F (which will include the empty word) form a countably infinite set (even if F is finitely generated), so there is a one-one mapping 6 of this set onto J. The words of length one on F correspond either to the generators or their inverses; let K ,be the subset of J which is the image under 6 of the set of words corresponding in this way to the generators. Now, 6 can easily be chosen in such a way that K , is a recursive subset of J, while 4 and 8 are recursive functions, where 4 and 8 are the unique binary and unary functions from J into J such that for words u and v on the generators of F we have + ( t ( u ) , 6 ( v ) )= [ ( u u ) and 8 ( [ ( u ) )= [ ( u ’ ) where uv is the concatenation of the words u and v and u ’ is the inverse of t h e word u (obtained by reversing the order of u and then replacing generators by their inverses and vice-versa - the empty word being its own inverse, of course). The generators of the presentation of C can now be identified with the elements of K,,with the first generator of t h e presentation identified with the smallest element of K,,the second generator identified with the second-smallest element of K,,etc. Words on the generators of the presentation are thus implicitly identified with the image under 6 of corresponding words on the generators of F. Let K 2 be the set of all elements j E J such that the word on the generators of the preseitation with which j is identified corresponds to the identity element of C. Note that K 2 depends only on the choice of generators of C and the ordering given to the generators. We can take the Turing degree of the word problem for the presentation to be the Turing degree of K,.A recursive permutation of the ordering given the generators will not result in a change in the Turing degree of K2;in particular, the Turing degree will be independent of the ordering if the presentation has only finitely many generators. Using K2,the group C’ of permutations of J can be defined. First of all, let us define t h e equivalence relation = on J by setting j = k for j , k E J just in case the two words (on the generators of the presentation) with which they are identified correspond to the same
Embeddings which preserw the word problem
43 1
element of C. Then we have + ( e ( j ) , k ) € K Z if and only if j = k . As 9 and 8 are recursive functions, we can determine effectively (relative to KZ)whether or not j = k for j , k E J. Note that for every element of C there are infinitely many distinct words on t h e generators of the presentation, and so elements of J, which correspond to it; thus we can now effectively determine (relative to K 2 ) the smallest, second-smallest, third-smallest . . . members of J corresponding to a given element of C (given by specifying a word on the generators of the presentation and so an element of J - to which it corresponds). We now define the function r by letting r ( j ) be, for j E J, the unique n E J such that j is the nth smallest element of the equivalence class it determines. For g E C we let 6 ( g ) E C’ be t h e function with domain J such that, for j E J, ( S ( g ) ) ( j ) is the unique element i E J such that r ( i ) = r ( j ) and i belongs to the equivalence class determined by + ( j , k ) , where k is the smallest element of J corresponding to g. Note that is a congruence relation relative to = , while for i, j , k E J we have + ( + ( i , j ) , k ) = + ( i , +(j,k ) ) . For g, h E C we have for every j E J (where k l , k z , and k 3 are respectively the smallest elements of J corresponding to gh, g , and h ) ( S ( g h ) ) ( j ) - + ( j , k l ) - + ( j , + ( k z , k 3 ) ) (as k l = + ( k z , k 3 ) ) = +(+kkz), k 3 ) = +((S(g))W7 k 3 ) - ( 6 ( h ) ) ( ( 6 ( g ) N hwhile r((S(gh))O’))= T o ’ ) = r ( ( 6 ( g ) N )) = r ( ( ~ ( h ) ) ( ( ~ ( g ) K Thus, i ) ) ) . ( 6 k h ) ) U )= ( S ( h ) ) ( ( a ( g ) ) ( j )for ) every j € J and so C‘ is a homomorphic image of C. Note that S ( g ) only has a fixed point if g is the identity element of C, as for j E 7 (with k E J being t h e smallest element of J correspomding to g) (6(g))(j) = j if and only if j = +(j, k ) , which holds if and only if k E Kz. Thus C’ is isomorphic to C and can be given by the same presentation - and as it can be effectively determined, relative to K z , whether or not for i, j , k E J we have both r ( i ) = r ( j ) and i a member of the equivalence class determined by +(j, k ) , the same applies to t h e determination of whether or not g ( j ) = i for g E C’.
+
Lemma 4.2. Given a presentation Q of a countable group G there exists a group V with a presentation Q’ such that V is equal to its commutator group, G is a subgroup of V, and the embedding of G in V is a Frattini embedding, while Q’ is obtained from Q by adjoining 2 generators and further relations, with the word problems for them Turing equivalent. Proof. Given Q and G, we see by Proposition 4.1 that G is isomorphic to a group Y of permutations of J such that only t h e identity element of Y has any fixed points; by Theorem 2.6 and Definition 2.1 we have Y = t where P is a subgroup of Sp(Y,”2)
432
R.J. 'Thompson
whose embedding into Sp(Y,"'2) is a Frattini embedding. Further, it appears from the proof of Proposition 2.7 that the word problems for the presentation Q of Y (and G and Y) and the presentation Q' of Sp( Y, "'2) obtained by adjoining to Q two generators corresponding to of Ft(Y;"'2) (these will the elements ( r : r l D ) " and (rorlrTTT)" generate Ft ( Y ; "2), in view of Proposition 1.5 and t h e fact that Ft( Y; "'2) = Ft("'2)) and all the relations valid in Sp( Y, "'2) will have the same Turing degree. We now just choose V so that there is an isomorphism of Sp(Y,"2) onto V whose restriction to Y is an isomorphism of Y onto G. AsV=Sp(Y,"'2), V is equal to its commutator group by Theorem 2.3.
Lemma 4.3. Given a presentation S of a countable group V equal to its commutator group there exists a simple group W with a presentation S' such that V is a subgroup of W, the embedding of V in W is a Frattini embedding, and S' is obtained from S by adjoining 2 generators and further relations, with the word problems for S and S' being Turing equivalent. Proof. Given S and V, we see by Proposition 4.1 that V is isomorphic to a group A of permutations of J satisfying condition C4 of $3 (and so C3) and conditions C1 and C2 relative to a solution of the word problem for the presentation S of V. Let A * be defined as in 43, and . let B be the subgroup of Pa(Ko) generated by A * U Ft ( K O )Then A = A * and the embedding of A * into B is a Frattini embedding by Theorem 3.30. Further, it appears from the proof of Theorem 3.29 that the word problems for the presentation S of A (and V and A * ) and the presentation S' of B obtained by adjoining to S two generators corresponding to the elements r : r l D and rorlrTTTof Ft(Ko) (these will generate Ft ( K O ) ,in view of Proposition 1.5) and all the relations valid in B have the same Turing degree. We choose W so that there is an isomorphism of B onto W whose restriction to A * ,is an isomorphism of A * onto V. As W = B, W is simple by Corollary 1.11, taking H there to be A *, equal to its commutator group as A * = V. Theorem 4.4. Given a presentation Q, of a countable group G there exists a simple group H with a presentation Qz such that G is a subgroup of H , the embedding of G in H is a Frattini embedding, and Q2 is obtained from Q1 by adjoining 4 generators and further relations, with the word problems for Q1 and Q2 being Turing equivalent. Proof. Given Q1 and G we obtain Q' and V by Lemma 4.2, adjoining
Embeddings which preserve the word problem
433
two generators; as V is equal to its commutator group we can apply Lemma 4.3 (with Q‘ 9s S) to obtain Q2 (that is, S’) and the simple group H (that is, W), adjoining two more generators. The word problem for Q, is Turing equivalent to that for Q’, which is Turing equivalent to that for Qz, so the word problems for Q, and Q2 are Turing equivalent. As t h e embedding of t h e subgroup G of V into V is a Frattini embedding, and t h e embedding of the subgroup V of H into H is also a Frattini embedding, G is a subgroup of H and its embedding into H is a Frattini embedding.
Lemma 4.5. Given a finitely generated simple group H , the word problem for H will be solvable if and only if there is a presentation of H on a finite set of generators with a recursively enumerable set of defining relations. Proof. If the word problem for H is solvable, there will evidently be a recursively enumerable set of defining relations for H: after choosing a finite set of generators for H we can take the set of defining relations to consist of those words on the generators which correspond to t h e identity element of H ; this set is recursive. But o n the other hand, if there is a presentation of H on a finite set of generators with a recursively enumerable set of defining relations there must be a recursive enumeration of those words on the generators of the presentation which correspond to the identity element of H. Let u s choose a fixed word wo on the generators of the presentation which does not correspond t o the identity element of H. Then (because the defining relations of the presentation can be recursively enumerated) there is a recursive enumeration of those words on t h e generators of the presentation such that their (individual) adjunction to the defining relations determines a group in which wo corresponds to the identity element. As H is simple, every word on the generators of the presentation will occur in one of these two enumerations, but not both; thus t h e word problem for H is solvable.
If we start with a countable group which is not finitely generated, it is necessary to supplement Theorem 4.4 in order to obtain an embedding of the group in a finitely generated simple group. Using the Higman-Neumann-Neumann embedding technique we can embed a countable group in a finitely generated group (Rotman’s book [9] serves as a reference for t h e particular proof used here for Proposition 4.6).
434
R.J. 7lompson
Proposition 4.6. Given a presentation Q of a countable group C on an infinite set of generators, there is a group C' generated by 3 elements such that C is a subgroup of C', the embedding of C in C' is a Frattini embedding, and the word problem for C' is Turing equivalent to the word problem for the presentation Q.
Proof. Let the generators of the presentation Q be, in order, g l , g 2 , .. . ,g,, . . . ; call the set of these generators f. Let F be the free group on the two (independent) generators a and b, and let G I be the free product C * F of C and F. A subgroup U of F with infinitely many free generators can be chosen so that there is a recursive set of words on a and b corresponding to t h e (independent) generators of U :let these words be u I ,u 2 , .. . , u., . . . . For i E J let wi be the word giui. Let GI and G',' be, respectively, t h e factor groups of G I obtained by putting all t h e elements of C and F, respectively, equal to t h e identity. Let W be the subgroup of G I generated by the wi, i E J ; then W is isomorphic to U via the mapping q defined by setting q ( w i ) = ui for i E J. (For in GI the images of t h e generators of W are independent free generators of t h e image of U.) As F is a free group and GI is a free product, G I has a presentation QI whose generators are, in order, a, b, g l ,g, . . . , g n , . . . and whose defining relations are exactly the defining relations of Q. We now let C' be t h e group given by the presentation Q' obtained by adjoining the generator t and the defining relations t-'wit = ui for i E J to the presentation Q1.Note that C' is generated by { a , b, t } , for in C' we have t-'giuit = ui and thus gi = tu,t-lu;' for i E J. As q is an isomorphism we can apply Britton's Lemma to conclude that G I is a subgroup of C' (that is, words on r U { a ,b } corresponding to distinct elements of GI remain so for C') and that a word w on f U { t , a, b } in which t or t - I occurs can only be equal to the identity element of C' if it contains (where u and u' are words on f U { a ,b } ) either a subword t-lut or t u f t - ' , where u is equal in G I to an element of W, and u' to an element of U (so the subword is equal to an element of U or W in C'). Now, given words p and q on f not corresponding to the identity element of C, if p and q are conjugate in C' there will be a word s with a minimum number of occurrences of t and t-I such that s-lps = q in C', and thus s-lpsq-' is t h e identity element of C'. By Britton's Lemma there will be a subword of s-lpsq-' of the form t - ' u t or t u r f - ' (with u and u' as above) if s is not a word on r U { a , b } . But the subword cannot be a subword of s-' or s (as it could be replaced in s-I or s by a word on f U {a,b } equal to it in C', and s would not be minimal in terms of number of occurrences
Embeddings which preserve rhe word problem
435
of t and t - ' ) , nor can it contain p as a subword and thus be conjugate to p (because an element of U or W distinct from the identity will have, in GI, an image distinct from the identity, no element of C except the identity element can be conjugate in G I to an element of V or W). As these are the only possibilities for a subword, s must be a word on r U {a, b } so that elements of C conjugate in C' are conjugate in G I . And if two elements of C are conjugate in G I they are conjugate in C, since if they were conjugate in G I they would be conjugate in GY. Thus the embedding of C in C' is a Frattini embedding. By using the normal form for free products (and the fact that F is a free group) we observe that a freely reduced non-empty word on U {a,b } will be equal to the identity in G I only if there is some nonempty subword on r which is equal to the identity in C. This allows us to reduce the word problem for Q, t o that for Q, so their Turing degrees are the same. Similarly, Britton's Lemma allows us, relative to a solution of the word problem for Q1,t o either effectively determine that a word on r U {r, a, b } is not equal to any word on r U {a, b } or else obtain (by replacing subwords t-lvt or tv't-' by words on r U {a,b } ) a word on r U { a ,b } t o which it is equal in €' and thus in G I (because we can effectively determine of a word on r U { a ,b } what element of V,or W, it could be equal to in GI by examining the factor group GI). So the word problems for Q, and Q' have the same Turing degree. Finally, C' has a presentation Q" on the generators t, a, and b whose defining relations consist of the defining relations of Q with each gi,for i E J, replaced by the word tuit-Iu;I. That Q" is a presentation of C' can be seen from the fact that by defining gi as tuit-'u;' for i E J we obtain the relations t - ' w i t = ui for i E J as well Because we can effectively eliminate as the relations of Q (and 0,). the generators belonging t o r from words o n r U {r, a, b } by using the equality gi = tuit-Iu;I holding in C', it follows that the word problems for Q' and Q " have the same Turing degree. Thus, so d o the word as Q" is a presentation of C f on a finite problems for Q and 0"; number of generators, the word problem for C' is Turing equivalent to the word problem for the presentation Q.
r
The construction in the proof of Proposition 4.6 could be continued so as to reduce the number of generators to two, but (for finitely generated groups) it is more convenient here to rely on a result which can, for example, be extracted directly from the main theorem of [7]:
Proposition 4.7. Every countable group G has a Frattini embedding into
436
R.J. lhompson
a two-generator group H, with the Turing degree of the word problem for H being the same as that for G, if G is finitely generated. Proposition 4.8. Given a presentation Q of a countable group C there is a group C' generated by 2 elements such that C is a subgroup of C', the embedding of C into C' is a Frattini embedding, and the word problem for C' is Turing equivalent to the word problem for the presentation Q. Proof. If Q is a presentation of C on finitely many generators, we can use Proposition 4.7. If Q is a presentation of c on infinitely many generators, we can put Proposition 4.6 and Proposition 4.7 together, much as was done with Lemma 4.2 and Lemma 4.3 to obtain Theorem 4.4. The main theorem of Higman's paper [5] gives a characterization of finitely generated groups with recursively enumerable sets of defining relations. We state it in the following form: Proposition 4.9. A finitely generated group can be embedded in a finitely presented group if and only if it has a presentation on a finite set of generators with a recursively enumerable set of defining relations. Applying Proposition 4.9 to Lemma 4.5 yields: Proposition 4.10. A finitely generated simple group has a solvable word problem if and only if tt can be embedded in a finitely presented group. From Theorem 4.4 it follows that every finitely generated group can be embedded in a finitely generated simple group (by adjoining at most four extra generators) as was proved originally by Philip Hall; Theorem 4.11 below shows that, as a crude estimate (the best result will be given later), there will always be a Frattini embedding of a countable group into a simple group generated by no more than six elements. (In [4], Hall shows that every countable group can be embedded in a simple group with three generators; A.P. Goryushkin reduced the number to two in [3], and P.E. Schupp in [ I l l showed that these two generators could be chosen to be elements of order two and order three.) Theorem 4.11. Given a presentation 0 of a countable group C there is a Frattini embedding of C into a simple group H which is generated by 6 elements and has a word problem with the same Turing degree as the
Embeddings which preserw the word problem
437
word problem for the presentation Q. Further, if the word problem for Q is solvable, there will exist a finitely presented group K such that H is a subgroup of K and K, like H, has a solvable word problem. Proof. In Clapham’s paper [2] it is shown that when a finitely generated group possesses a presentation o n a finite set of generators with a recursively enumerable set of defining relations it can be embedded in a finitely presented group in a way that preserves the Turing degree. So, in view of Lemma 4.5, the last part of t h e theorem follows from the rest. As for the rest, we put Proposition 4.8 and Theorem 4.4 together (in the same way that Lemma 4.2 and Lemma 4.3 were put together to obtain Theorem 4.4). Corollary 4.12. For any Turing degree of unsolvability, not necessarily recursively enumerable, there exists a simple group, generated by 6 elements, whose word problem has this degree. Proof. For any Turing degree of unsolvability there is a presentation Q of a countable group C which has this degree. Then by Theorem 4.11 the appropriate simple group H exists. At this point it is easy to show that all t h e results obtained by Boone and Higman in [ l ] concerning groups still remain valid when “simple group” is replaced’by “finitely generated simple group.” For instance, when this replacement is made in Theorem I‘ and Theorem IV of [ l ] we obtain stronger results which are weakened versions of Theorem 4.11 (weakened by failing to specify the number of elements required to generate it and omitting the requirement that t h e embedding be Frattini). When the “Corollary to Theorem IV” is strengthened by this replacement it becomes a weaker version of Corollary 4.12. The same replacement applied to Theorem I11 of (11 yields t h e first sentence of t h e next theorem: Theorem 4.13. A necessary and sufficient condition that any recursively enumerable class r of finitely presented groups have a uniformly solvable word problem is that there exist a finitely generated simple group H, and a finitely presented,group K , such that for each member G of r, G is a subgroup of H, and H is a subgroup of K. Further, it can be required that the embedding of G in H be a Frattini embedding for each member G of r, while H is generated by 6 elements. Proof. As the condition is stronger than that of the condition of
438
R.J. 7Xompson
Theorem 111 of [l], the proof of sufficiency given there (of a constructive form, Theorem III’, in which t h e replacement of “simple” by “finitely generated and simple” also produces a weaker result) applies here also. As regards necessity, we note as in [ I ] that if has a uniformly solvable word problem then the obvious presentation of the free product GI- of the members of r has a solvable word problem, and Gr contains each member of r as a subgroup. Further, t h e embedding of each member G of r i n t o GI-is a Frattini embedding (we examine, as in t h e proof of Proposition 4.6, the factor groups obtained by putting all the elements of G I .not belonging to the given G equal to t h e identity). Applying Theorem 4.11 to Gr and its presentation, we obtain t h e required H.
r
Without the use of Proposition 4.8 or Proposition 4.6 we can obtain Theorem 4.14 and (using exactly t h e same proof) Theorem 4.15. (When “simple group” is replaced by “finitely generated simple group” in Theorem I of [ I ] the result is Theorem 4.15.) Theorem 4.14. A finitely generated group G has a solvable word problem if and only if there is a Frattini embedding of G into a finitely generated simple subgroup of a finitely presented group. Proof. When there is an embedding of G into a finitely generated simple subgroup H of a finitely presented group K, H will have a solvable word problem by Proposition 4.10, and as G is a finitely generated subgroup of H, so will G. Since G has a presentation on a finite set of generators, the other part follows from Theorem 4.4 and Proposition 4.10. Theorem 4.15. A necessary and sufficient condition that a finitely generated grodp G have a solvable word problem is that there exist a finitely generated simple group H, and a finitely presented group K , such that G is a subgroup of H, and H is a subgroup of K . An algebraic characterization of finitely generated groups with solvable word problems is provided by Theorem 4.15, just as it is provided by Theorem I af [l]. Using only the construction of §3 as a basis we could prove a version of Lemma 4.3 in which V is not required to be equal to its commutator group and W (instead of being required to be simple) is only required to be a group such that every proper factor group is Abelian. This would lead to a proof of the variant of Theorem 4.15 obtained by replacing “finitely generated
Embeddings which preserve rhe word problem
439
simple group H” by “finitely generated group H whose proper factor groups are Abelian.” Boone and Higman ask in [l] whether or not every finitely generated group with a solvable word problem is embeddable in a finitely presented simple group. If so, t h e finitely generated subgroups of finitely presented simple groups would consist precisely of those finitely generated goups which have a solvable word problem. The construction of 93 (unlike that of §2) does not preserve finite presentability (because applied to finitely presented groups with unsolvable word problems it yields a finitely generated simple group); something like the piecemeal approach (depending on a genetic characterization of the recursive function which solves t h e word problem for the group to be embedded) which Higman (in [ 5 ] ) and Clapham (in [ 2 ] )employ is probably needed for a positive answer. The question also arises as to whether or not a finitely generated simple group which is sufficiently well-behaved to have a solvable word problem must have a solvable conjugacy problem. The answer is given by Proposition 4.16, and follows from the fact that there exist finitely generated groups with solvable word problems and unsolvable conjugacy problems. Proposition 4.16. There exists a simple group generated by 6 elements which has a solvable word problem but an unsolvable conjugacy problem. Proof..Starting with a finitely generated group G with a solvable word problem but unsolvable conjugacy problem we obtain by Theorem 4.1 1 a Frattini embedding of G into a simple group H which is generated by 6 elements and has a solvable word problem. Since G is a finitely generated subgroup of H, H cannot have a solvable conjugacy problem, as then we would be able to determine effectively whether or not two elements of G were conjugate in H - and so in G. Although the proofs cannot be given here, there are some significant observations that can be made regarding the number of generators required for the simple groups refcrred to in the theorems above. First of all: Remark 4.17. For K E Fr and Y a finitely generated permutation group, Sp( Y,K) is generated by 2 elements of finite order.
The proof that Sp( Y,K ) can be generated by three elements relies on Proposition 2.4 (further reduction depends on a careful examination
440
R.J. Thompson
of the generators of Ft ( K ) ) . Note that Proposition 4.7 follows from Remark 4.17 (and the proof of Lemma 4.2), and Proposition 4.6 is only required when we deal with infinitely generated groups. So the construction of $2 can serve as a partial substitute for the Higman-Neumann-Neumann embedding technique. Consideration of Theorem 3.31 eventually leads to: Remark 4.18. When A is a group of permutations of J generated by k 3 2 elements, two of which are of finite order, then T a ( A ) can be generated by k elements, two of them of finite order. It is now a straightforward matter to show that, using Remark 4.17 and Remark 4.18, we can replace 6 by 2 in Theorem 4.11, Corollary 4.12, Theorem 4.13, and Proposition 4.16 (with these two generators both being elements of finite order); in addition, we can require that the simple group generated by two elements be such that the countable weak direct product of a collection of subgroups is always isomorphic to a subgroup. This last part depends on the following remark: Remark 4.19. When A is a group of permutations of J , the countable weak direct product of a collection of subgroups of T a ( A ) is always isomorphic to a subgroup of T a ( A ) . George Sacerdote’s paper [ 101 has been included in the references below because it uses Theorem 4.14 to give (among other characterizations) an algebraic characterization of groups with a solvable conjugacy problem. References [ I ] W.W. Boone and G. Higman. An algebraic characterization of groups with solvable word problem, J . Aust. Math. Soc. 18 (1974) 41-53. [2] C.R.J. Clapham. An embedding theorem for finitely generated groups. Proc. London Math. Soc. (3) 17 (1967) 419430. 1-71 A.P. Goryushkin, Imbedding of countable groups in 2-generated simple groups (in Russian), Mat. Zametki 16 (1974) 231-235. The English translation appears in Math. Notes 16 No. 2 (1975; for 1974) 725-727. 141 P. Hall. On the embedding of a group in a join of given groups. J. AUSI.Math. Soc. 17 (1974) 434495. [S] G. Higman, Subgroups of finitely presented groups, Proc. Roy. Soc. London, Ser. A. 262 (1961) 455475. 161 Ralph McKenzie and Richard J . Thompson. An elementary construction of unsolvable word problems in group theory. in: Word problems. ed. W.W. Boone.
Embeddings which preserve the word problem
441
F.B. Cannonito, R.C. Lyndon, (North-Holland Publishing Co., Amsterdam, 1973) pp. 457478. [7] C.F. Miller and P.E. Schupp, Embeddings into Hopfian groups, J. Algebra 17 (1971) 17 1- 176. [S] M.O. Rabin, Computable algebra, general theory and theory o f computable fields. Trans. Amer. Math. Soc. 95 (19M) 341-360. [Y] J.J. Rotman, The theory of groups, an introduction (second edition, Allyn and Bacon, Boston, 1973). [lo] George S. Sacerdote, The Boone-Higman theorem and the conjugacy problem. Preprint version; for a later version see J. Algebra 49 (1977) 212-221. [ I 1 1 P.E. Schupp, Embeddings i n t o simple groups. J. London Math. Soc. (2) 13 (1976) 0-94.
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 4 4 3 4 8
LAWS IN FINITE ALGEBRAS M.R. VAUGHAN-LEE Mnthematical Institute, Oxford
In the first part of this talk I shall describe a method which has been used with great success by several authors to show that certain classes of finite algebras have finite bases for their laws. (Here I am using the word ‘algebra’ in the sense of Universal Algebra.) In t h e second part of the talk I shall give some examples of finite algebras which do not have finite bases for their laws. If A is a finite algebra then t h e variety generated by A, VarA, is a locally finite variety. Any locally finite variety is generated by its critical algebras, a d so the critical algebras in Va r A play an important role. (A finite algebra is critical if it is not contained in the variety generated by its proper subalgebras and proper homomorphic images.) A locally finite variety V is Cross if (a) V has a finite basis for its laws, (b) V has only finitely many critical algebras. Finite groups, rings, Lie rings, and finite algebras in varieties all of whose algebras have distributive congruence lattices, all generate Cross varieties. The basic method of proving that a finite algebra A genergtes a Cross variety is to find a finite set r of first order sentences with the following properties. (1) Every algebra in V a r A satisfies r. (2) Every finitely genetated algebra satisfying r is finite. (3) There-are only finitely many critical algebras satisfying r. Then because V a r A is axiomatized by its laws, it follows from (1) that each sentence in r is a consequence of t h e laws of A. Since r is finite this implies that the sentences in r are all consequences of a finite set V of laws of A. V determines a finitely based variety V of algebras containing VarA, and (2) and (3) imply that V is Cross. The fact that VarA is Cross follows from the result proved below that a subvariety of a Cross variety is Cross. In most applications the sentences in r are laws, but this is not necessary, and is not the case in the proof that a finite algebra generates a Cross variety if it lies in a variety all of whose algebras have distributive congruence lattices. The power of t h e
4
443
444
M.R. Vaughan-Lee
method is that it is not necessary to find an explicit basis for the laws of A, but only necessary to find a finite set of first order sentences which imply t h e existence of a Cross variety containing A. The following theorem shows the strength of property (b) of a Cross variety. Theorem. I f V is a locally finite variety then the following conditions are equivalent. (b) V has only finitely many critical algebras. (c) V has only finitely many subvarieties. (d) V satisfies the maximal and minimal conditions on subvarieties. The equivalence of (b) and (c) is well known (see L’vov [ l ] ) . The maximal condition o n subvarieties is equivalent to the condition that every subvariety (including V) is generated by a finite algebra. The minimal condition on subvarieties is equivalent to t h e condition that every subvariety is finitely based as a subvariety, that is every subvariety is determined by the laws of V together with a finite set of additional laws. Thus the result that subvarieties of Cross varieties are Cross follows from the fact that (b) implies the minimal condition on subvarieties. Proof of Theorem. Every subvariety of V is generated by its critical algebras, that is by some subset of the critical algebras of V. Thus (c) follows from (b). Trivially (c) implies (d). To show that (d) implies (b) we will assume that V is a locally finite variety which satisfies the minimal condition on subvarieties, but which has infinitely many critical algebras. We will then prove that V fails to satisfy t h e maximal condition o n subvarieties. First some notation: if A is an algebra then ( 0 s - l ) A is t h e set of proper factors (or sections) of A. A is critical if A @ V a r ( 0 S - l ) A . We will show that if V has a subvariety U such that V has infinitely many critical algebras A with Var(QS - l ) A 3 U, then V has a subvariety W with this property which properly contains U. By assumption V has infinitely many critical algebras, and if A is any algebra then V a r ( 0 S - l ) A contains t h e trivial variety determined by the law x = y. It follows by induction that V has a strictly ascending sequence of subvarieties. Now let U, W be subvarieties of V, and let U be a proper subvariety of W . Since the subvarieties of V satisfy the minimal condition, U is determined by t h e laws of V together with a finite set of additional laws. Let this additional set consist of n-variable laws. Then W must
Laws in finite algebras
445
fail to satisfy some n-variable law of U . So W contains an n-generator algebra B,say, which does not lie in U. Then V < U v V a r B s W. Now there are only finitely many n-generator algebras in V, and so this implies that there are only finitely many subvarieties of V which are minimal with respect to strictly containing U. Let these be W , ,W2,. .., W,,,. Now suppose that A is a critical algebra in V and that Var(QS - l )A 3 U, but that Var(QS - I)A$ W, for any i = 1,2,. . . , in. Then Var(QS - l )A = V , which implies that A is n generator. But V has o n l y finitely many n-generator algebras, and so if V has infinitely many critical algebras A such that Var(QS - l ) A 3 U then for some i, 1 s i < rn, V has infinitely many critical algebras A such that Var(QS - l ) A 3 W,. This completes the proof of the theorem. Note that the proof actually gives that (d) implies that V has only finitely many S-critical algebras. (A finite algebra is S-critical if i t is not contained in t h e variety generated by its proper subalgebras.) This theorem, and the success of the method I have just described, suggest that the two properties (a) and (b) of a Cross variety may be closely related. However the examples below show that this is not the case. The first is a finite algebra which generates a variety satisfying (a) but not (b), and the second is a finite algebra which generates a variety satisfying (b) but not (a). The examples are non-associative algebras over an arbitrary finite field F. (A non-associative algebra over F is a vector space over F with a bilinear product.) From the point of view of Universal Algebra, non-associative algebras are very well behaved. They are unital, that is every algebra contains the unique one element algebra { O } . They have regular congruences, that is two congruences of an algebra are equal if and only if the corresponding congruence classes of 0 are equal. Most important of all, they have permutable congruences and their congruence lattices are moduJar. Finite associative algebras do generate Cross varieties, but Polin (21 has given examples of finite n o n associative algebras which do not have finite bases for their laws. The first of the examples below is a finite algebra A such that Var A does not satisfy the minimum condition on subvarieties. Thus Polin's examples and this example show that the variety generated by a finite non-associative algebra need not be finitely based, and may have subvarieties which are not finitely based even us subvarieties. First we establish some notation. A left normed convention is used: thus abc denotes ( a b ) c . We define ab" inductively by a b " = a, ab" = (ab"-')b. Thus ab' = ( a b ) b and a b 3= ((ab)b)b. F is an arbitrary finite field, and we let IF1 = q.
M.R.Vaughan-Lre
446
The first example is an algebra A with basis a, b, c, d, ab, abc, abd, abcb, abcd, abdb, abdc, abcdb over F. The products of basis vectors are all zero, except for those of the form xy, x E {a, ab, abc, abd, abcb, abcd, abdb, abdc, abcdb}, y E {b, c, d } . These products are given by the following table.
a ab abc abd abcb abcd abdb abdc abcdb
b
C
d
ab ab abcb abdb abcb abcdb abdb abcdb abcdb
0 abc abc abdc abc abdc abdc abdc abdc
0 abd abcd abd abcd abcd abd abcd abcd
It is straightforward to check that the following three laws are a basis for the laws of A (as a non-associative algebra over F). XI(X2X3)
= 0,
XIX2X3X4X5 = XIX2X4~X3XSr XIX2X1X4
= XIX2X3X4.
We show that V a r A does not satisfy the minimum condition o n subvarieties by finding a finite algebra B E Var A which does not have a finite basis for its laws. Furthermore it can be shown that V a r B has only finitely many subvarieties. This implies the existence of a finite algebra C E Var B with the following properties. (1) C does not have a finite basis for its laws. (2) Every proper subvariety of V a r C is Cross. (3) V a r C has only finitely many subvarieties and only finitely many critical algebras. The algebra B has a basis a, b, c, ab, ac, abc, acb, abcb over E All products of these basis vectors are zero except for those of the form xy, x E {a, ab, ac, abc, acb, abcb}, y E {b, c}. Products of this form are given by t h e following table.
Laws in finite algebras
447
b
C
a ab ac abc acb
ab ab acb abcb acb
abcb
abcb
ac abc ac abc abc + acb - abcb abc
The following laws a r e a basis for the laws of B. XI(X2X3)
= 0, XIX2X4X3XSr
xIx2x3x4xs=
x1x2xqx4 = XIX2X3X4,
u,=O,
i=1,2,3,4,5,
w, = 0,
i = 6 , 7 , . . . , where
UI
:= x l x : + ' -
u2:= x l x f x : u3:=
x1x:x3x4-
XIX2X2,
x1x2xq-
x1x;x.1+
xIxsIx4x.1-
u4:= x I x 2 x . 1 x 4 + x I x . 1 x 4 x 2 +
- xIx3x2x4-
XIX2X3,
XIXZX.1X4+ XIX2X4X.1, XIX4X2X3
x1x2x4x1-
XIX4X3X2,
u5:= xIx2x3x4xs - xIx3x2x4xs-xIx2x3x5x4+xIx3x2xsx4,and
w, := XIX2X3XOX7 *
* * x,x4x5
- XIX2X3XhX7
' '
- XIXlX2XhX7
* * * x,x4x5
. x,x5x4 + x I x 3 x 2 x h x 7 '
* '
xgX5x4,
for i = 6 , 7 , . . . .
Note that the first three laws of this basis form a basis for the laws of A, a n d so B E Var A. It is straightforward to check that B satisfies these laws, but it is extremely tedious to check that they d o form a basis f o r t h e laws of B. However there a r e no real difficulties. T h e main step is to notice that it is sufficient to consider laws of the form u = 0 , where u is a linear combination of terms of t h e form XlX,X, * . * Xk with i,j,. . . ,k 3 2. To show that B does not hbve a finite basis for its laws we proceed as follows. It is not difficult to show that for i = 1,2,3,4,5 any
448
M.R. Vaughan-Lee
consequence of u, = 0 is of the form u + u = 0, where u = 0 is a law of A, and u is a (non-associative) polynomial whose degree is the same as the degree of u,. Similarly for i = 6,7,. . . any consequence of w, = 0 is of the form u + u = 0, where u = 0 is a law of A and u is a polynomial of degree i. Now if B has a finite basis for its laws, then for some integer n greater than the degree of u, for i = 1 , 2 , 3 , 4 , 5 , B has a basis consisting of the laws of A together with u, = 0 for i = 1 , 2 , 3 , 4 , 5 and w, = 0 for 6 s i < n. But then w. = 0 must be a consequence of these laws, and so, by t h e above remarks, w, can be expressed in the form u + u where u = 0 is a law of A and u is a polynomial of degree at most n - 1. Clearly we may assume that u and u only involve the variables x,, x z , . . . , x,. Now, for i = 1 , 2 , . . . , n, let T, be the endomorphism of the free non-associative algebra over F generated by xl, x2,. . . ,x. which maps x, to 0 and maps the other generators to themselves. Let p be the endomorphism (1 - .rr,)(l- r2).. . (1 - T,).Then if u is any polynomial of degree less than n, up = 0. So w, = wnp = up + up = up. This implies that w, = 0 is a law of A. Howeverit is easy to see that w, = 0 is not a law of A for any i 3 6 , and so B cannot have a finite basis for its laws. Let V be the subvariety of Var A determined by the laws u s= 0, w, = 0 for i = 6,7,. . . . Then the argument given above shows that V does not have a finite basis for its laws, and hence that V a r A does not satisfy the minimum condition o n subvarieties. It is possible to prove that any subvariety of V is determined by 4-variable laws. It follows that V has only finitely many subvarieties, and since B E V this implies that Var B has only finitely many subvarieties. However since V has only finitely many subvarieties it follows from the theorem that V is itself generated by a finite algebra. Thus the example B is, in a sense, redundant, but I have included it as it seems more satisfactory to have a specific example of a finite algebra which does not have a finite basis for its laws, rather than just to have proved the existence of one. References [ I ] I.V. L'vov, Varieties of associative rings, Algebra i logika 12 (1973) 269-297. [2] S.V. Polin, The identities of finite algebras, Sib. Math. J. 17 (1976) 1356-1366.
S.I. Adian, W.W. Boone, G. Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 449-576
ALGEBRAISCH ABGESCHLOSSENE GRUPPEN Martin ZIEGLER Mathematical Insitute, Technical University, Berlin
Introduction
We characterize algebraically closed groups up to partial isomorphism by “recursion-theoretic” invariants and determine their properties by looking at these invariants. In t h e course of our investigations we obtain most of the known results* about algebraically closed groups (in [21, 22, 14, 15, 2, 31) from more general theorems. We started from the problems of A. MacIntyre in [14, 15, 171 and solve many of them. We survey some of our results: A non-trivial group M is algebraically closed if every finite system of equations with coefficients from M, which is solvable in a supergroup of M, has a solution in M. By the theorem of Higman, Neumann, and Neumann (“every isomorphism of subgroups is extendable to an inner automorphism of a supergroup”) one can conclude: Algebraically closed groups are w-homogeneous, i.e. every isomorphism of finitely generated subgroups is extendable to an automorphism (I, 1.10). w-homogeneous groups M are determined up to partial isomorphism (u.t.p.i.) by their skeleton Sk(M) (the class of all finitely generated groups which are embeddable in M) (I, 2.10). So in the study of properties which are compatible with partial isomorphism, it is enough to study the skeletons of algebraically closed groups. First we characterize those classes X of finitely generated groups which occur as skeletons of algebraically closed groups. X has to satisfy the following conditions (I, 3.8): (U) If G E X and H is a finitely generated group, which is embeddable in G, then H E X. * This paper - my “Habilitationsschrift” at the Technische Universtitat Berlin - was finished in its original version in June 1976. I here omit the last chapter of that version, which dealt with the construction of uncountable groups, because of the great progress made in this area since then; see e.g. [34] and [35]. 449
450
M. Ziegler
(JEP) If H, G E X,then there is F E X,in which both H and G are embeddable. (AA) Every finite system of equations with coefficients from G E X which is solvable ifi a group containing G has a solution in a supergroup of G which belongs to X. B.H. Neumann showed that we can disregard the finitely generated groups with solvable word problem ([22], cf. [27]): Every finitely generated group with solvable word problem is in the skeleton of every algebraically closed group. MacIntyre proved the converse: If G is in the skeleton of every algebraically closed group, it has solvable word problem. And moreover, if G is in the skeleton of every algebraically closed group which contains the finitely generated group F, the word problem of G is Turing-reducible to the word problem of F ([15]). It was asked in [14] whether this theorem has a converse. The answer is negative ([3], 111, 3.5.3). We define a new notion of reducibility S * , stronger than Turing reducibility, and prove: for all finitely generated groups F, G the following are equivalent (111, 1.8.1): (a) for all algebraically closed M, F E Sk(M) 3 G E Sk(M). (b) W ( G ) s *W ( F )( W ( G )stands for the word problem of G). S * induces an equivalence relation =* (on t h e subsets of the set of natural numbers), which partitions 9 ( w ) into *-degrees. V, the set of *-degrees, is - like the set of Turing degrees - an upper semilattice in a natural way. “(b)+(a)” of 111, 1.8.1 shows that the skeleton of an algebraicall closed group M is already determined by W , ( M )= { W ( G ) / = *[ G E Sk(M)}. The following characterization of these subsets Z of V, which are of the form W , ( M ) for an algebraically closed group M, turns problems about algebraically closed groups into recursion theoretic problems. (Note that there is u.t.p.i. at most one M s.t. I = W , ( M ) (111, 3.12).) There is an algebraically closed group M s.t. I = W , ( M ) iff the following conditions hold: (i) If b E I, a s b, then a E I. (ii) If a, b E I, then sup(a, b ) E Z. (iii) If (A v B)E X for a r.e. horn class X and A E UI,then there is C E U I s.t. (A v C) E X. We call I C V satisfying (i), (ii), (iii) a closed ideal. (A v B is the disjoint union of A and B, A, B C w. R.e. horn classes - our recursion theoretic substitute for finite systems of equations - are defined from r.e. sets {(nb, . . . n:,) i E w } of finite sequences of natural numbers by X = { X C w ( V i ~ : E X A . . . A ~ : , E X ~ ~Cf. ~ E 111, X } . 3.8.) The main result of Chapter I1 is: For every r.e. horn class X and every natural number m, there is a finite system S ( & i ) , which is
1
Algebraisch abgeschlossene Gmppen
45 1
solvable in a supergroup of a group generated by g,, . . .g, if the word problem of ( g l , .. . gm) is contained in X (11, 2.6). As a first application of (111, 3.12) we give in 111, 4.1 a recursion theoretic proof of 111, 1.8.1 (a)+ (b): Let a, b E V. If for every closed ideal Z C V,a E Z j b E Z, then b s a. We prove the following three theorems in 111, 4 using the equivalence of algebraically closed groups and closed ideals: 1. Let the word problem of F be a complete set, then there is a smallest algebraically closed group M which contains F, i.e. M is embeddable over F in every algebraically closed supergroup of G (cf. 111, 2.2). Such an M exists uniquely. 2. M,, the countable algebraically closed group whose skeleton consists of all finitely generated groups with r.e. word problem, is minimal, i.e., isomorphic to each algebraically closed subgroup. 3. There are 2-1 algebraically closed groups of power wl, which are pairwise not partially isomorphic. All algebraically closed groups satisfy the same (elementary) Vzsentences (i.e sentences of the form V.? 39 cp(x, y ) , where cp is a quantifier-free formula). In [14] MacIntyre exhibited two algebraically closed groups and a V,-sentence which holds only in one of them. One of the problems in [14] was the question whether there is a V,-sentence by which one can distinguish algebraically closed groups. What are the elementary properties of algebraically closed groups in terms of their associated closed ideal? We assign to W , ( M ) the omodel o(M)= (w, + , * , W , ( M ) , E). We speak about w ( M ) in the language L z - the elementary language of second-order number theory. It is shown in Chapter IV that the properties of w ( M ) which are expressible by Lz-sentences, correspond to the elementary properties of M: For every Lz-sentence p there is an elementary sentence cp, which holds in an algebraically closed group M iff p holds in o ( M ) , and vice versa (IV, 2.4). We have a good control over the mutual dependence of he complexity of p and the complexity of 9. E.g., for every r.e. set A there is a V,-sentence cp, which is true in an algebraically closed group M iff A W , ( M ) . cp is false in M,,. If A is not recursive, there are also (by [lS]) algebraically closed groups where cp holds. This solves the mentioned problem from [14]. There are in fact 2”-many algebraically closed groups which satisfy pairwise different V,-sentences (IV, 3.5.1). (This was also proved in [MI.) By this method we can answer a further question from [14]. Which properties of algebraically closed groups are definable by elementary formulas? [ 141 contains some examples of “infinitary notions”, which
u
u
M. Ziegler
452
are in algebraically closed groups elementarily expressible. There is, e.g., a V2-formula cp(xo,. . .x n ) , which holds in M for a l , .. . a, iff the group generated by a , , . . . a, is simple (cf. IV, 3.1.1). In all w-homogeneous M the L*-properties of w(M) correspond canonically to t h e “arithmetical” properties of M, which are described in I11 in two ways: As the properties which are obtained from “recursive” properties by quantification, and as the properties which are expressible in L*, a weak second order logic with “in-built number theory”. In algebraically closed groups all arithmetical properties are “elementary”. The examples in [14, 21 are easily seen to be arithmetical properties. This Characterization was independently given in [33]. The algebraically closed groups which we construct in this paper e.g., in the proof of I, 3.8 (characterization of the skeletons of algebraically closed groups) or in the proof of 111, 1.8.1 - are built up using step-by-step methods, not by “forcing in model theory”. On the contrary we used our methods to study generic groups. In V, 1 we give a simple description of the w-generic groups. An algebraically closed group M is a-generic iff w ( M ) is an elementary substructure of (w, + ,* ,9 ( w ) , E ). V, 2 gives a new approach to finite generic groups: We define the forcing relation “game theoretically”, in a way which clarifies the connection with the step-by-step methods. The usual syntactic definition of forcing is now Lemma V, 2.6. The results of IV (“arithmetical properties are elementary”) allow us to prove a simple characterization of the generic groups, in which the forcing relation and elementary sentences are not used. A group M is generic iff every “dense”, arithmetical family of finite systems of equations has a member which has a solution in M (V, 3.8). In Chapter VI we study the propertie’s of W , ( M ) , for generic M. A family 9 of r.e. horn classes is “dense” if 2 n US#0 for all nonempty r.e. horn classes 2. B C w is said to be “subgeneric” if B belongs to the union of every arithmetical dense family of r.e. horn classes. Theorem (VI, 3.3) is: An algebraically closed group M is generic iff W , ( M ) contains only subgeneric sets. We list four theorems, which we prove by this characterization: (VI, 3.3) For every generic group M there is a unique b E V s.t. W , ( M )= { a E VI a S b} (the b’s which occur in this way can be easily characterized). (VI, 3.4.4) Countable generic groups are minimal. (VI, 4.4.2) Every generic group M contains a finitely generated group F s.t. F * F & Sk (M).
u
453
Algebraisch abgeschlossene Gruppen
(VI, 2.10) A V3-sentence, which holds in some algebraically closed group, holds in all generic groups. The last theorem answers a question from [14]: Is there a V,sentence true in generic groups which does not hold in m-generic groups? It is enough to note that every V,-sentence which holds in w-generic groups, holds in all algebraically closed groups (IV, 3.6.2). C.F. Miller I11 proved many of our theorems (e.g. 11, 3.10, 111, 1.8.1) independently and at about the same time. I thank Susanne Ziegler and Robert Fittler for their help and encouragement. Bezeichnungen Eine Ordinalzahl ist die Menge aller kleineren Ordinalzahlen. w, die kleinste Limesordinalzahl, ist die Menge der naturlichen Zahlen. a + 1 ist der Nachfolger der Ordinalzahl a. Wenn K eine Kardinalzahl ist, ist K + die Nachfolgerkardinalzahl. Statt w + schreiben wir w , . Die Kontinuumshypothese (CH) sagt 2" = w , . ( A 1 ist die Machtigkeit der Menge A. A ist abzahlbar, wenn ( A 1 = w. Die Potenzmenge von A bezeichnen wir mit P ( A ) . 1st K eine Kardinalzahl, bedeutet PK(A) {X C A ( X ( < K } . AB ist die Menge der Funktionen f : A + B von A nach B. Fur C C A ist f[C] = {f(x)( x E C } . A ist der Vorbereich Vb f[A] = N b f der Nachbereich. idA ist die Funktion {(x, x ) xvonf, E A}. f setzt g fort, wenn g C f. f l c bezeichnet die Einschrankung f fl (CXNbf) auf C. Wenn f injektiv ist, ist f-' die Umkehrfunktion von f. 1st der gemeinte Vorbereich klar, schreiben wir Ax x * * . fur die Funktion, die jedem x . * x * * . zuordnet. fg ist die Funktion A x f ( g ( x ) ) . Eine Funktion f konnen wir als die Familie cf(x)( x E V b f ) auffassen. 1st a eine Ordinalzahl, nennen wir (ap /3 < a ) eine Wohlordnung von {aB /3 < a}.9B:= U { @ B/3 < a}.Mit 6 meinen wir eine endliche Folge ( b o , .. . ,bn-') aus yB.n ist die Lange l(6) von b. Wir schreiben haufig kurzer 6 E B. Wenn g E "C, meinen wir mit g(6) die Folge (g(bo),. . . g(bn-')).6"(c) ist die Folge ( b o , .. . ,bn-',c ) . Elemente von "B bezeichnen wir mit 6. 1st = eine Aquivalenzrelation auf A, ist A / = die Menge der Aquivalenzklassen a / =, a E A. C trennt A und B, wenn A C C und
I
r
-
1
1
1
~nc=0.
Wir betrachten L-Strukturen a, 23... fur eine abzahlbare Sprache L. ( L ist eine Menge von Konstanten, Funktionszeichen und Relationszeichen. AuSer im ersten Kapitel wird L immer die Sprache L G der "Gruppentheorie" sein: L G enthalt eine Konstante e fur das neutrale Element, ein zweistelliges Funktionszeichen fur die
-
M. Ziegler
454
Gruppenmultiplikation und das einstellige Funktionszeichen-I, die Inversenbildung.) Wir unterscheiden vielfach die Struktur nicht von ihren Grundbereich, z. B. in Schreibweisen wie X f l ? ! = 0, x E ?I, ii E 2I, f[2I]. Mit 2I C B meinen wir aber, daB 2I Unterstruktur von B ist. 9f < B heiBt: 2I ist (isomorph) in B einbettbar. Ein Homomorphismus v :2I+ 2 ist “uber C”, wenn v auf C identisch operiert. Wir schreiben dann v : B und ?I < c B, wenn es eine Einbettung von 2I in B uber C gibt. ?I=cB sagt, daB es einen Isomorphismus von 2I und B uber C gibt. Aut(2I) ist die Automorphismengruppe von ?I. id, ist der triviale Automorphismus. Wenn B C a, bezeichnet (B) die von B erzeugte Unterstruktur. Entsprechende Bedeutung haben (B,C), (B, C ) , (ii), (B,ii), (6,ii), (a, i E I ) , . . .usw. Wir erweitern jedes L durch Hinzufugen einer Menge C = {c, i E w } von neuen Konstanten zu L ( C ) . Wenn ii €?I, wird ?I zur L(co,. . .~ , - ~ ) - S t r u k t u(r8 ,ii), in der c, durch a, interpretiert wird. Fur c, schreiben wir dann a,. ((ii), a‘) kurzen wir mit (6)’ab. v :(ii)+ (6) ist also genau dann ein Homomorphismus von (ii)’ nach (6)’,wenn v ein Homomorphismus von (ii) nach (6) ist, und v ( i i ) = 6. Entsprechend meinen wir mit (B,a)’ die L(E)-Struktur ((B, i),(z) usw. a und b haben den gleichen Typ uber C, wenn ( a ) ’ = =(b)’ (kurz fur (C, a)’s C(C, b)‘). Der Sprache L ( L ( C ) )ordnen wir die Menge L,, (L,,(C)) der elementaren L- (L(C)-) Formeln zu. Die Formeln bauen sich aus den atomaren Fomeln tl f 2 und R(rl,. . . t , ) mit 3x, A , i auf ( t , L- ( L ( C ) - ) Terme). Die Formeln von L-., erhalt man mit der zusatzlichen Bildungsregel “ist S eine Menge von Formeln, so sind auch A S und V S Formeln”. Diese Schreibweise benutzen wir, wenn S endlich ist, auch, um L,,-Formeln zu beschreiben. Statt A{cp,( i E I} verwenden wir auch A I E I q , .Wenn S leer ist, ist A S = T und V S = I (“wahr” und “falsch”). Wenn man nur hochstens abzahlbare Konjunktionen und Disjunktionen zulaBt, gelangt man zu den Formeln von L-.,. DaB in der L,,(C)-Formel cp hochstens die freien Variablen xo, . . .x. (und die neuen Konstanten co,. . . c k ) vorkommen, drucken wir durch cp(x0,. ..,x,) (cp(co,.. .ck, xo,. .. x,)) aus. Fur das Resultat der Ersetzung der x, durch Terme r, (und der c, durch t : ) schreiben wir cp (to,.. .) (cp ( t &. . . t o , . . .)). Entsprechend benutzen wir die Schreibweise S(xo, . . .) fur Formelmengen. Wenn 6 E 2I, bedeutet 2I I= cp(a’), daB (a, i)I= q ( 6 ) (oder ( 8 , i i ) C cp(E)). t(ii), fur einen L-Term t(Z), bedeutet genauer f(G)%. 1st f(2) eine Folge von L-Termen, ist die Folge to(Li), tl(ii), . . . .
1
1
r(i)
455
A lgebraisch abgeschlossene Gruppen
Die Menge S ( i ) von Formeln ist in !?1 erfullbar, wenn '?I I= S ( G ) fur geeignete ii aus !)I. Wenn T eine Menge v o n Aussagen ist, ist {cp T t- cp} die Menge der logischen Folgerungen von T. ?I = B heiBt, daB 91 und 23 elementar aquivalent sind, !?I < B,dal3 91 elementare Unterstruktur von % ist. Eine 3,-Formel +b hat in pranexer Normalform die Gestalt 3 i 1 V i 2 3 . . Qi.cp, cp quantorfrei. Dann ist -I t,b eine V,,-Formel. 91 B meint: alle 3,-Aussagen gelten genau dann in 91, wenn sie in B gelten. !?I ist 1-elementare Unterstruktur von %, wenn 91 C 93 und fur alle ii € 1 ' 1 jede 3,-Formel aus L ( t i ) , die in '8 gilt, auch in 9[ gilt, ? I K l B . Es sei n u n L = L G .T G ist die Menge der Gruppenaxiome. Gruppen sind Modelle von T G .Wir schreiben . . . k G * fur * * U T G t- . . . . Eine Gleichung (Ungleichung) ist eine Formel der Form ( i ) t l ( 2 ) t2(2),ti L(C)-Terme. Relativ zu T G ist jede Gleichung aquivalent zu einer Gleichung der Form t ( X ) = e. Weiter k o n n e n wir Terme mit Wortern identifizieren; das sind endliche Folgen von x, und x;'. Wenn W ein Wort in x ,,,... xk ist und gl,,...g, Elemente einer Gruppe G sind, bedeutet W(g,,,. . . g,) - wenn k s m - W ( g o , . . g k ) , und - wenn m < k - W(g,,,. . . g,, e, e,. . .). Gl(G, 2) ist die Menge aller Gleichungen aus L ( g ) ,Ungl(G, g) die Menge aller Ungleichungen aus L ( g ) , die in G gelten. Diag (G, g ) := Gl(G, 2) U Ungl( G, 2). Wenn 90,. . . ,q,, Aussagenvariable sind, nennen wir die Formel A q,, + (das bedeutet q1A q2 A * A q,, + 90 bzw. q 1A q2 A (ql A A q,, + I)eine positive bzw. negative aussagenlogische Implikation). Eine (positive bzw. negative) Implikation bekommt man, wenn man in einer aussagenlogischen Implikation die Aussagenvariablen durch Gleichungen ersetzt. ist die Godelnymer von (9) Vermoge einer Godelnumerierung iibertragen wir rekursionstheoretische Begriffe auf L,, (C). z. B. heil3t eine Menge von Formeln rekursiv aufzahlbar (r.a.), wenn die entsprechende Menge von Godelnummern r.a. ist. Fur Worter nehmen wir eine besondere Godelnumerierung. Wir wahlen eine rekursive Bijektion von yw und w. Dadurch ist jede naturliche Zahl s Godelnummer einer Folge (s(O), s(l), . . . s ( l ( s )- 1)). Dem Wort W = x 2 . . . x t , E~ E (1, - l}, ordnen wir die Folge mo,. .. m,, zu, wobei mi = 2ki ( = 2ki + l), wenn E~ = 1 ( = - 1). Die Godelnummer von W ist die Godelnummer s der Folge m o , .. . m,,.Wir schreiben W = V,. Wenn g' E G, definieren wir W ( G ,g) = {i V, (g) = e}. Wenn G endlich erzeugt ist, und g und 6 zwei Erzeugendensysteme von G sind, sind W(G, 2) und W(G, 6 ) rekursiv isomorph (W(G,2) =, W(G, 6)).Wir schreiben dann einfach W(G) fur W ( G , g ) ,das Wortproblem von G.
I
-
-
--
(v
1
-
456
M. Ziegler
G hat losbares Wortproblem, wenn W ( G )rekursiv ist. G ist rekursiv prasentiert, wenn W ( G )rekursiv aufzahlbar ist. Fur eine Untergruppe U von G definieren wir W(G U, g):= { i V,(g)E U } . Mit einer rekursiven Bijektion n, m (n, m ) ( 2n, m ) identifizieren wir w 2 mit w. Wir konnen so eine Teilmenge von w auch als zweistellige Relation auffassen. (a, b, c , . . . ) bedeutet ( . . ((a, b ) , c ) . * -). Die disjunkte Vereinigung A v B von A, B ist 2 A U (2B + 1) d. h. { 2 x I x E A } U { 2 x + l l x E B } . A v B v C v - - -bedeutet (( . . . ( ( A v B ) v C ) v . * . ). Wir verwenden Godelnumerierungen i w D,, D : , Df aller endlichen Mengen, aller Mengen mit hochstens einem Element, aller Mengen mit hochstens zwei Elementen. W, bezeichnet die z-te r.a. Menge, z ist ein Z?-Index von W,. A ist r.a. in B, A CraB, wenn fur ein z E w A = { x ( ~ u , w ( x , u , w )W € , A D , C B A D , C ~ \ B }A . ist TuringB. A ist m reduzierbar auf B, A S T B , wenn A % B und w \ A Sra reduzierbar (1-reduzierbar) auf B vermoge f, wenn f eine (injektive) rekursive Funktion ist und A = { x ( f ( x ) EB } , Schreibweise: A S m B , A SIB. Eine 2;-Menge A C w k ist eine Menge der Form { ( x l , .. . x k ) I 3 v l v v 2 3 * . QV.R ( v l , . .x t ) } fiir eine rekursive Relation R. A ist n;,,wenn w k \ A 2 ; ist, und A,", wenn A 2; und n: ist. A ist arithmetisch, wenn A 2 ; ist fur ein n E w. 4'"' ist die vollstandige Z$Menge aus [24]. 9"= {(n, k ) k E 4'")).Entsprechend definiert man n; (Z:,arithmetische) Teilmengen von w k X (P(w))'", (in [24] Zt), Z7:)). A ist implizit arithmetisch definierbar in B, wenn { A } = { Y C w ( Y ,B ) E R } fur ein arithmetisches R C (P(w))'. = Zu jeder Teilmenge Y von P ( w ) gehort das w-Modell ( w , 9) (w, + ,. , E ). Wir beschreiben w-Modelle mit L2-Aussagen. L 2 hat Variable v I , . . . w,,. . . fur naturliche Zahlen, X , Y,2,. . . fur Elemente von Y und fur jedes n E w eine Konstante. Aus den atomaren Formeln rl = rz, rl E X (r, Zahlterme) baut man die Formeln von L 2 mit 7 ,n , I u , und 3X, auf. R ist analytisch, wenn R in (0, P ( w ) ) mit einer L2-Formel definiert werden kann. A ist analytisch in B, wenn A = {x (B,x ) E R} fur ein anaIytisches R. Die L2-Formeln ohne gebundene Mengenvariable beschreiben gerade arithmetische Eigenschaften. D a die Gultigkeit einer solchen Formel cp nicht von Y abhangt, iibernehmen wir haufig cp in die Metasprache und schreiben q ( A ) statt (w, * . )I= cp(A). Wir nennen eine solche Formel X;-Formel, wenn sie eine ZZ-Menge definiert. Wir denken uns alle arithmetischen Relationen als Relationszeichen in L z aufgenommen.
1
1
1
I
1
Algebraisch abgeschlossene Gruppen
457
Eine L2-Formel der Form 3G,,XlVG2,X 2 .* . Qtj,,,XnR fur ein rekursives R n e n n e n wir eine 2 :-Formel. Entsprechend definiert man fiL-Formeln. Wenn A eine Teilmenge der Gruppe G ist, bezeichnet ( A ) Nden von A erzeugten Normalteiler. G ist genau dann einfach, wenn fur . ist die v o n (von den Elementen von) S alle a, b f e a E ( b) N Fr(S) erzeugte freie Gruppe. Fr(6) ist Fr({bo,. . . bn-l}). G wird von der Menge von Gleichungen S(g) prasentiert, wenn - falls S(g) = { W ( g )= e W E A } - (G, 6) isomorphes Bild von (Fr(g)/( W(g)l W E A ) Ng) , ist. Insbesondere wird also G von den 2 erzeugt. Wenn H, F Untergruppen von G sind, ist "H .F direkt", wenn H F = H x F (inneres Produkt) ist. Das heiat H n F = E (E ist die triviale Gruppe) and H C Z'( F ) ( Z'( F ) ist der Zentralisator von F in G). G sei Untergruppe von H, E Wir fassen das amalgamierte Produkt H * G F als Obergruppe von H und F auf. Das amalgamierte der Produkt der Obergruppen F; von G, j E J ist * G j E J}. @X ist die direkte Summe aller Gruppen aus 9, wir schreiben auch BG,,G oder X,,,F,. Z ( G ) ist das Zentrum von G, [g, h ] der Kommutator g h g - l h - ' . r, = Axgxg-' ist der zu g gehorende innere Automorphismus von G.
1
{cI
I. Algebraisch abgeschlossene Gruppen als o-homogene Strukturen
Im ersten Abschnitt zeigen wir, daB sich jede Gruppe in eine algebraisch abgeschlossene (a.a.) Gruppe einbetten laBt (1.3). D a jede 1-elementare Unterstruktur einer a.a. Gruppe wieder a.a. ist (1.5.2), findet man zu jeder abzahlbaren .Gruppe eine abzahlbare a.a. Obergruppe (1.6). Das zeigt, weil es 2" nicht-isomorphe endlich erzeugte Gruppen gibt, daB es auch 2" nicht-isomorphe abzahlbare a.a. Gruppen gibt. Um die Isomorphietypen der abzahlbaren a.a. Gruppen zu bestimmen, beachten wir, daB a.a. Gruppen w-homogen sind (2.1): Jeder partielle Isomorphismus endlich erzeugter (e.e.) Unterstrukturen laBt sich zu einem Automorphismus fortsetzen (1.9). Abzahlbare w-homogene L-Strukturen (fur eine Sprache L ) sind bis auf Isomorphie eindeutig durch ihr Skelett - die Klasse aller einbettbaren e.e. L-Strukturen - bestimmt (2.11.1). Allgemeiner :Zwei w-homogene Strukturen sind genau dann partiell isomorph (oder Lm,uaquivalent), wenn sie das gleiche Skelett haben (2.10.1). Wir konnen
458
M. Ziegler
also (zumindest theoretisch) Eigenschaften w-homogener Strukturen, die nur vom “partiellen Isomorphietyp” abhangen, am Skelett erkennen. So ist z. B. (wenn L die Sprache der Gruppentheorie ist) eine w-homogene L-Struktur genau dann eine a.a. Gruppe, wenn ihr Skelett aus Gruppen besteht und algebraisch abgeschlossen ist, d. h. daB jedes uber einer Gruppe des Skeletts losbare Gleichungssystem in einer Obergruppe aus dem Skelett losbar ist (3.7). Um das Studium der w-homogenen Strukturen auf das Studium der Skelette zuruckzufuhren, mussen wir die Klassen der e.e. Strukturen kennzeichnen, die als Skelett vorkommen. (3.5) sagt: Eine Klasse X endlich erzeugter Strukturen (in der hochstens w , viele verschiedene Isomorphietypen vorkommen) ist genau dann Skelett einer w homogenen Struktur, wenn X “unterstruktur-abgeschlossen” ist (d. h. mit einer Strcktur auch jede in sie einbettbare e.e. Struktur enthalt), die “joint embedding property” und die Amalgamierungseigenschaft besitzt. Dieser Satz liefert uns ein einfaches Verfahren zur Konstruktion a.a. Gruppen. Wir bekommen z. B. eine (dadurch eindeutig bestimmte) abzahlbare a.a. Gruppe deren Skelett gerade aus den e.e. Gruppen besteht, deren Wortproblem rekursiv aufzahlbar ist (3.9.2). Im vierten Abschnitt konstruieren wir mit den Methoden von 3 zu jeder abzahlbaren a.a. Gruppe eine partiell isomorphe Gruppe der Machtigkeit wl. In 4.6 und 4.7 werden einfache Bedingungen fur w homogene Strukturen angegeben zu einer Struktur der Machtigkeit w 1 partiell isomorph zu sein. Wir zeigen auf drei verschiedene Arten, daB a.a. Gruppen diesen Bedingungen genugen: in 4.3, 4.9, 4.10. 1. Einfache Konstruktionen algebraisch abgeschlossener Gruppen Ein Korper heiBt algebraisch abgeschlossen, wenn fur jedes nichtkonstante Polynom p E K [ X ] - ein Polynom also, das in einem Oberkorper eine Nullstelle hat - die Gleichung p ( x ) = 0 eine Losung hat. Hilberts Nullstellensatz zeigt, daB in allen algebraisch abgeschlossenen (a.a.) Korpern sogar jedes endliche System von Gleichungen
pI(xI, . - . x , ) = O , . . * , ~ n ( ~ l ,..* x , ) = O 9
3
@I,.
. . ,pn E K [ X l , .
a])
das in einem Oberkorper von K eine Losung besitzt, schon in K losbar ist. D a man nicht erwarten kann ein ahnliches Resultat fur Gruppen zu erhalten, definiert man:
Delinition 1.1 (Scott, Neumann). Eine nichttriviale Gruppe G ist algebraisch abgeschlossen, wenn fur jede endliche Folge von Wortern
Algebraisch abgeschlossene Gruppen
459
Wl(yll,.. . ,y,, xo, . . . ,xs), . . . W,(yo,. . . ,x,) und Koeffizienten go,. . .,g, € G das Gleichungsystem (“mit Koeffizienten aus G”) { Wl(g0,.. . ,g, xll,.. . ,x.)
=
e, . . . , W, (gll,.. . ,x,) = e } (=: S(g,i ) )
in G losbar ist, wenn es in einer Obergruppe von G losbar ist. (E, die triviale Gruppe, hat diese Eigenschaft, gilt aber per definitionem nicht als algebraisch abgeschlossen. Eine Losung von S(g, i )in G ist eine Folge von Elementen f € G, auf die alle Gleichungen von S zutreffen, fur die also G I= A S ( g , j ) . Statt “a.a. Gruppe” sagen wir auch “a.a.Gr.”, und fur “in einer Obergruppe von G losbar” “uber G losbar”.) Wenn F und H Obergruppen von G sind, sind sie uber G in das amalgamierte Produkt F * G H einbettbar. Wir schliefien daraus Bemerkung 1.2. Ein Gleichungssystem mit Koeffizienten aus G C F ist uber G losbar gdw. es uber F losbar ist.
In [26] wurde der erste Satz iiber a.a. Gruppen bewiesen: Satz 1.3. Jede Gruppe ist Untergruppe einer algebraisch abgeschlossenen Gruppe. Beweis. Sei G eine Gruppe und ( S , ] p E a \{O}) eine Wohlordnung aller endlichen Gleichungssysteme mit Koeffizienten aus G, die uber G losbar sind. Wir definieren rekursiv eine aufsteigende Folge von Gruppen Go, p < a : Go:= G ; fur p > 0 sei G, eine Obergruppe von U{G,(y < p } , in der S, eine Losang hat (vgl. 1.2). Wir bilden G + : =U{G,I p < a } und iterieren diese Konstruktion: G o : =G ; G“+’:=(G”)’. Die gesuchte Obergruppe von G ist U{G‘‘I n < w } . 0 Folgerung 1.4. Z u jeder Menge X von Gruppen, gibt es eine a.a.Gr., in die jedes Element von X einbettbar ist. Beweis. Man wende 1.3 auf
BGEtKG an. 0
Wir geben noch zwei andere Konstruktionsverfahren fur a.a. Gruppen an. Lemma 1.5. 1. Die Vereinigung einer aufsteigenden Kette von a.a. Gruppen ist algebraisch abgeschlossen. 2. Jede l-elementare Unterstruktur einer a. a. Gncppe ist algebraisch abgeschlossen.
460
M. Ziegler
Wir werden spater sehen, daB sich 2 umkehren 1aBt: a.a. Untergruppen sind 1-elementare Unterstrukturen. Zum Beweis von 1: Ein endliches Gleichungssystem, das iiber der Vereinigung einer Kette a.a. Gruppen losbar ist, besitzt schon eine Losung in jedem Kettenglied, in dem alle Koeffizienten des Systems liegen. Z u 2: Sei G < I M a.a. Wenn S ( g , i ) iiber G losbar ist, ist S auch iiber M und also in M losbar. Aus MI= 3 i A S(2, i) folgt wegen der Voraussetzung G I= 32 A S(g, i ) :Es gibt eine Losung von S in G. 0
Folgerung 1.6. 1. A sei eine Teilmenge der a.a.Gr. M. Dann gibr es eine a.a.Gr. A C N C M mir 1 NI = 1 A I + o. 2. Zu jeder Gruppe G und jeder Kardinalzahl K 2 1 GI gibt es eine a.a.Gr. M 3 G, K = / M I . (Aus 4.5.2 wird sich ergeben, daB a.a. Gruppen nicht endlich sind.) 1 bekommt man aus 1.5.2 mit Hilfe des Satzes von Lowenheim Skolem, der sogar ein A C N < M, 1 N I = I A I + o liefert (siehe auch 2.11.5). Nach 1.3 ist G in beliebig groBen Gruppen enthalten, also folgt 2 aus 1. 0 In jeder Machtigkeit gibt es also a.a. Gruppen. Uberabzahlbare a.a. Korper fester Charakteristik sind durch ihre Machtigkeit (bis auf Isomorphie) eindeutig bestimmt. Es gibt abzahlbar viele abzahlbare a.a. Korper. Wieviel a.a. Gruppen gibt es in jeder Machtigkeit? Im Laufe dieser Untersuchung werden wir zeigen, daB es in jeder iiberabzahlbaren Machtigkeit 2-1 a.a. Gruppen gibt. (Tatsachlich gibt es in jeder Machtigkeit K genau 2" nichtisomorphe a.a. Gruppen, siehe [341-)
Satz 1.7 [22]. Es gibt 2" abzahlbare a.a. Gruppen. Der Satz folgt aus
Lemma 1.8 [19] (siehe auch 111, 3.5.2). Es gibt 2" uon zwei Elementen erzeugte Gruppen. Denn jede abzahlbare Gruppe enthalt nur abzahlbar viele von zwei Elementen erzeugte Untergruppen. Jede von zwei Elementen erzeugte Gruppe ist aber in einer abzahlbaren a.a.Gr. enthalten Andererseits gibt es iiberhaupt nur 2" abzahlbare Gruppen. 0 Die Losbarkeit eines endlichen Gleichungssystems S mit Koeffizienten g E M in der a.a.Gr. M hangt nach 1.2 nur von ( g ) (bzw. den Obergruppen von ( g ) ) ab. Der nachste Satz zeigt, daB alle
Algebraisch abgeschlossene Gruppen
Eigenschaften von 2 in M nur vom Isomorphietyp von M ist “w-homogen”.
461
(2) abhangen:
Satz 1.9. In algebraisch abgeschlossenen Gruppen laPt sich jeder Isomorphismus z wischen endlich erzeugten Untergruppen zu einem inneren Automorphismus fortsetzen. Z u m Beweis verwenden wir Lemma 1.10 [ 8 ] . Jeder Isomorphismus zwischen zwei Untergruppen einer Gruppe lafit sich zu einem inneren Automorphismus einer geeigneten Obergruppe fortsetzen.
Sei nun M algebraisch abgeschlossen, g E M und p : (2)- M eine Einbettung. In einer Obergruppe von M gibt es einen inneren Automorphismus 7, 3 p. f ist also eine Losung des Gleichungssystems {xg,x-’ = p ( g , ) i = 0 , . . . . , r}. Wenn f ’ eine Losung in M ist, leistet 7,. das verlangte.
1
Im nachsten Abschnitt untersuchen wir beliebige Strukturen fur eine (abzahlbare) Sprache L, die ein Analogon von 1.10 erfullen. 2. a-homogene Strukturen Definition 2.1 (siehe [7]). Eine Struktur $%I ist w-homogen, wenn sich jeder Isomorphismus zwischen endlich erzeugten (e.e.) Unterstrukturen “schrittweise” fortsetzen lafit; d. h. wenn 91 e.e., ?I Cm, p :?1+m eine Einbettung ist, gibt es fur jedes a E \JJ7 eine Fortsetzung p c v : (?I, a)-+s%I. Bemerkung 2.2. Nach 1.10 sind a.a. Gruppen w-homogen. 1st rol w-homogen und ii E YA, so ist auch (die L(E)-Struktur) (m, ii) w-homogen. Lemma 2.3. 1 . $YR . ist genau dann w-homogen, wenn fur jedes e.e. ?I C ‘%I jede endlich erzeugte Oberstruktur von ?I, die in lm einbettbar ist, iiber ?I in 2J2 einbettbar ist. 2. Die Vereinigung einer aufsteigenden Kette von w-homogenen Strukturen ist w-homogen. 3. A sei Teilmenge der unendlichen w-homogenen Struktur m. Dann gibt es eine w-homogene Struktur A C ?I C lllz mit (?[ ( = ( A( + w.
M.Ziegler
462
Beweis. 1. Sei 9Jl w-homogen, e.e. '? CI W,2l C 23 e.e. und p :23 +9Jl eine Einbettung. Wir setzen den Isomorphismus p &, :p [a]+ '3 fort Dann ist vp : 2 3 - ~2172 die gesuchte Einbettung. Sei zu v : p [23]+92. umgekehrt die Bedingung von 1 erfult und p : l?l+9Jl eine isomorphe Einbettung einer e.e. Unterstruktur, a E 92.Wir wahlen eine Oberstruktur p [a]C 23 mit einem Isomorphismus p C v :(a,a ) + 23. 9' 3 ist vermoge v-' in W einbettbar. Also gibt es eine Einbettung T : 23 +ciurl 9Jl. ~p : (a, a ) + W ist die gesuchte Fortsetzung von p. Wir uberlassen dem Leser den Beweis von 2 und 3. 0 w-homogene Strukturen sind bis auf "partielle Isomorphie" durch ihre endlich erzeugten Unterstrukturen eindeutig bestimmt:
Delinition 2.4 ([ll]). 1. Zwei L-Strukturen W,8 sind partiell isomorph - W = p 8 -, wenn es eine nichtleere Familie I von Isomorphismen zwischen Unterstrukturen von 9Jl und '% gibt, die folgende Eigenschaften hat: (a) fur alle 7r E I, a E 9Jl gibt es v 3 T , v E I, a E Vb v. (b) Fur alle T E I, b E W gibt es v 3 T , v E I, b E Nb v. 2. 9Jl ist partiell isomorphe Substruktur von 8 - 9Jl<<, 8 -, wenn C 8,und fur alle i E 9Jl (n,ii) sP(8,ii). 3. W heiBt partiell in 8 einbettbar - m<, 8 -, wenn es eine nichtleere Familie I von Isomorphismen von Unterstrukturen von 1111 und % gibt, die die Eigenschaft (a) von 1 hat. Naturlich sind isomorphe Strukturen partiell isomorph und einbettbare Strukturen partiell. einbettbar.
Beispiel 2.5. Wenn W und 8 partiell isomorph sind, so ist mit W auch 8 o-homogen. Denn wenn p : % 2)23 ein Isomorphismus zwischen endlich erzeugten Unterstrukturen von '% und a E 8 ist, wahlen wir nach l(b) ein 7r E I mit (%, 23, a ) C Nb T. Weil W o-homogen ist, ~ : T-'[%] + p-'[23] zu einer konnen wir den Isomorphismus 7 r - l ~lr-qa1 Einbettung p' : 7r-'[(%, a ) ]+ 272 fortsetzen. Mit (a) wahlen wir ein T C Y E I mit p'[7r-'(%,a ) ] C Vb v. Die gesuchte Fortsetzung von p ist vp 'v-'
((PI,d.
Die eben eingefuhrten Beziehungen lassen sich einfach durch infinitare Sprachen beschreiben: Satz 2.6 ([ll]). 1.
2.
m
% gdw.
Ws,, 8 gdw. YR=m,u8.
m<-,- %.
A lgebraisch abgesch lossene Gncppen
463
3. W<, % gdw. $9Jl j 300,y9. 92 bedeutet, dap alle Existenzaussagen von L,,,, die in Y ’ JI (W 3 gelten, auch in \n gelten.) Folgerungen 2.7. “=,,”, ‘ ‘ < p ” , “<,” sind transitive Relationen. A u s W, =, <, %, =,, %* folgt 5112, <,, 912. Wenn 92 Vereinigung der Folge a < y ) ist, und nnz, <,, W, fur a < p, ist fur alle a ma <,, 92.
(maI
a
Der nachste Satz ist wohlbekannt.
$nz,
92 und % seien hochstens abziihfbar. Dann ist Satz 2.8. 1. gdw. W = %. 2. Wenn mZ hochstens abzahlbar ist, ist Z1IZzP9 gdw. W zu einer partiell isomorphen Substruktur von % isomorph ist. 3. Wenn W hochstens abzahlbar ist, ist W<, \n gdw. W < 92.
Wir deuten einen Beweis von 2 an. Es ist klar, daB aus W < , 92, 111 folgt. Sei andererseits W=,, ‘31 und (a, n E w ) eine Aufzahlung von W, Z eine Familie von partiellen Isomorphismen, die und 111 partiell isomorph macht. Wenn man in jedem Schritt 2.4.l(a) anwendet, gewinnt man eine Folge vo C v, C . * . von Isomorphismen aus Z, a, E Vb v,. u : = v, ist eine isomorphe Einbettung von $9Jl in 111. Da vn aus Z ist, ist (W; ao,. . . ,a,) =,, (%; v(ao),. . . , v(a,)). 0
1
u,,,
Deenition 2.9 (vgl. [14]). Das Skelett Sk(W) einer Struktur ist die Klasse aller endlich erzeugten, in W einbettbaren Strukturen. Wenn rC eine Klasse von Strukturen ist, bezeichne 11 X 11 = 1 X/,(die Zahl der verschiedenen Isomorphietypen in X. Den nachsten Satz kann man als Verallgemeinerung des Satzes uber die Isomorphie aller abzahlbaren homogenen und (Sk(W) - ) universellen Strukturen auffassen. Fur a.a. Gruppen findet er sich in ~41. Satz 2.10. 1. W und % seien o-homogen. Dann ist (a) W s , , 111 gdw. Sk(W) = Sk(%). (Esist dann sogar (W, ii) =,, (111, 61,wenn (5)’= (6)’.) (b) Wenn W C ‘32, W <,, 111 gdw. Sk (W) = Sk(%). 2. Sei ‘2l w-homogen. Dann ist W<,% gdw. Sk(W)CSk(%). Beweis. 2. Sei W<, 111 und ‘2l E Sk(W). Wenn p % in W einbettet, wahlen wir ein v E I mit p [fl] C Vb v. Das zeigt, daB ‘2l< %. Sei umgekehrt Sk(W) C Sk(111). Fur Z nehmen wir die Menge aller
464
M. Ziegler
Isomorphismen zwischen e.e. Unterstrukturen von W und 91. Aus der w-Homogenitat von ,32 folgt 2.4.l(a). Denn sei '?l C W, T : '?l+ %, T E I und a E W . Es gibt eine Oberstruktur '23 von T[%]und eine Fortsetzung T':(g,a ) + '23 von T. D a '23 E Sk(W), ist nach Voraussetzung '23 E Sk(%). Wir haben also, weil ,Jz w-homogen ist, mit v = PT' ist die gesuchte 2.3.1 eine Einbettung @ : '23+,,l~~llW. Fortsetzung von T : v E I, a E Vb (v). l(a). Wenn W s P8,ist a<,% und \9Jl<<, %, also wegen 2 Sk('E) = Sk(%). Wenn Sk(W) = Sk(%), so hat das im Beweis von 2 definierte I auch die Eigenschaft 2.4.l(b). l(b) folgt aus dem Zusatz von l(a). Folgerungen 2.11. 1. Y.R und 2 seien hochsten abzahlbar und w homogen. Dann isr W = \n gdw. Sk (W) = Sk (92). 2. W sei hochstens abzahlbar und w-homogen. Dann lafit sich jeder Isomorphismus zwischen e.e. Unterstmkturen z u einem Automorphismus von 2R fortsetzen. (1.10 zeigt, dafi man hier fiir a.a.Gr. die Abzahlbarkeit nicht vorauszusetzen m up.) 3. W sei hochstens abzahlbar und 91 w-homogen. Dann ist \E< 91 gdw. Sk(9X) C Sk(%). 4. % sei aufsteigende Vereinigung der Folge (YJL a < p ) . !?J& sei whomogen und fiir alle a < p Y.Ra s P5% Dann ist !lJ&=p %. 5 . A sei Teilmenge der unendlichen w-homogenen Struktur '9Jl. Dann gibt es ein A C % K P W mit ( % I= IA 1 +JISk(W)((+w.
1
3. Kennzeichnung der Skelette w- homogener Strukturen Fur die Untersuchung der Eigenschaften w-homogener Strukturen, die sich auf partiell isomorphe Strukturen ubertragen, genugt es nach dem Vorangehenden, die Skelette w-homogener Strukturen zu betrachten. Nicht jede Klasse endlich erzeugter Strukturen ist Skelett. In 3.2 geben wir zwei notwendige Bedingungen (U, JEP) an, die fur abzahlbare Klassen auch hinreichend dafiir sind, Skelett zu sein. Die Skelette w-homogener Strukturen haben zusatzlich die Amalgamierungseigenschaft (AE). Hier bekommt man sogar: (3.5) Eine Klasse e.e. Strukturen, von hochstens der Machtigkeit wl, ist genau dann Skelett einer w-homogenen Strukture, wenn sie (U, JEP, AE) erfullt. Unter Voraussetzung der Kontinuumshypothese sind dadurch also die Skelette w-homogener Strukturen vollstandig gekennzeichnet. Zum SchluP geben wir eine Anwendung dieser Methode auf a.a.Gr.: Eine w-homogene Gruppe ist genau dann a.a., wenn ihr Skelett die Eigenschaft (AA) hat. (Wir betrachten Strukturen fur eine abzahlbare Sprache L.)
Algebraisch abgeschlossene Gmppen
465
Definition 3.1. Eine Klasse X endlich erzeugter Strukturen erfullt 1 die Bedingung (U) - X ist Unterstruktur-abgeschlossen - wenn X mit jeder Struktur 'u jede Struktur enthalt, die in ?I einbettbar ist. 2 X erfiillt (JEP) - joint embedding property - wenn es fur alle ?I, 8 E X ein B aus X mit 'u < und 23 < 0 gibt. Weiter sol1 X nicht leer sein. Satz 3.2. 1. Fur jede Struktur 2 ' R hat Sk(1172) die Eigenschaften U und JEP. 2. X erfiille ( U ) und (JEP). Wenn )IX (1 < w, gibt es eine Struktur '537 mit X = Sk(2X). Zusatz: Fur alle mZ ist IISk(m)((s min(2",
2R, das hochstens abzahlbar ist.
lfozl+
w ) . In 2 finder man ein
I
Beweis. 1 ist trivial. 2: (41, n < w ) sei eine Aufzahlung von Elementen von X,unter denen alle Isomorphietypen von X vorkommen. Wir setzen 23" := ?Io und wahlen fur 23n+lE X eine Oberstruktur von Sn, in die ?In einbettbar ist; dazu brauchen wir (JEP). Wegen (U) ist 31 das Skelett von m:= U{an 1 n <w}. 0
2R ist weder durch X eindeutig bestimmt, noch kann es im allgemeinen w-homogen gewahlt werden. Wir definieren daher Definition 3.3. Eine Klasse von endlich erzeugten Strukturen X hat die Amalgamierungseigenschaft (AE), wenn es zu jeder Einbettung v : ?l+ 23 von zwei Strukturen aus X und jeder Oberstruktur ?I C ?I' E X eine Fortsetzung v C p : ?X'+ 23'€ X gibt. Bemerkung 3.4. 1 . Die Klasse 3 aller endlich erzeugten Gruppen erfullt (AE). Man identifiziert ?I vermoge v mit einer Untergruppe von 23 und setzt 23' = ?I' *:1123. 2. Wenn 0 # X C 3, folgt (JEP) aus (U) und (AE). Satz 3.5. 1. Wenn 9R . w-homogen ist, hat S k ( m ) die ( A E ) .
2. Wenn IIXIIs w , und X die Bedingungen ( U ) , (JEP) und ( A E ) erfiillt, gibt es ein w-homogenes '9Jl mit X = Sk(\%).
Zusatz. Aus 2.11.5 folgt, dab sich bei 2 ein finden lafit.
mit
lml sIIX)I+w
Beweis. 1. ?I C a' seien aus Sk (m) und v : ?I + 23 E Sk (m) eine
M. Ziegler
466
Einbettung. Wir konnen ?l’Clllz und 23 C iD2 annehmen. D a YJt w homogen ist, gibt es eine Fortsetzung v C p : 2l’+lllz. Wir wahlen fur 23’:= p [?1’] E Sk(Z0z) und haben so (AE) nachgewiesen. Wir fassen beim Beweis von 2 zur Vereinfachung 0 als endlich erzeugte Struktur auf, die durch 0 in jede Struktur eingebettet ist. (JEP) wird so ein Sonderfall von (AE). Wir brauchen zwei Hilfssatze: Hilfssatz 1. Sei \Jz hiichstens abziihlbar, S k ( 8 ) C X,‘2l e x . , 2l C %, v : ?I + 23 E X eine Einbettung. Dann gibt es % C %’, 1 %’I w, Sk (9’) C X und eine Einbettung p : 23 + %’, p v = id,. Beweis.
Y2 sei
Vereinigung der endlich erzeugten Strukturen ‘2l =
?loC ?I, C . . . . Wir definieren eine Folge 23 = BoC Bl C . * * von
- -
Strukturen aus X und Enbettungen v = v f l Cv 1C * , vi : H i+ B i , indem wir, um 23i+l und zu erhalten, (AE) auf 21i C und vi i < w } und p ‘ = vi i < w } . anwenden. Dann setzen wir %” = Wir setzen nun p ‘ zu einem Isomorphismus p “ : %’+ 8’’ einer Oberstruktur von \Jz fort. p :p’%’ leistet das verlangte. (23- a < w , ) sei eine Folge von Elementen von X,unter denen alle Isomorphietypen von X vorkommen.
u{8,/
u{ I
1
Hilfssatz 2. Sei \Jz hiichstens abruhlbar, Sk(%) C X,p < w , . Dann gibt es ein %@3 9,1 %@I d w, Sk (YIP)C X, sodab es fur alle Einbetrungen v : ?l+ Be uon e.e. ?l C 9 ( a < p ) eine Einbettung p : Ba+ %378 mir p v = id,l gibt.
I
Beweis. (vn : a,, + Ban n E w ) sei eine Aufzahlung aller dieser Einbettungen. Wir konstruieren eine Kette % = 3, C %, C * * * S k ( 8 , ) C X hochstens abzahlbaren Strukturen, indem wir den Hilfssatz 1 auf 9. und v, : ?fn -23;” anwenden, und so konstruieren. Bei diesem Schritt erhalt man auch die Abbildung p. : 23an + gn+, C !TI@:= n < w } mit pnvn = idvcm.
u{9.1
Beweis von 3.5.2. Wir definieren 2, = 0 (s. Bem. zu Beginn des p < a})= wie im zweiten Beweises) und - fur a < wI - 9,= Hilfssatz. a:=U{%, a < wl} ist nach 2.3.1 w-homogen. 0
1
(u{%,1
Man kann sich leicht uberlegen - oder aus dem nachsten Satz folgern -, daB jede Gruppe, die zu einer a.a.Gr. partiell isomorph ist, selbst algebraisch abgeschlossen ist. Es mu0 also moglich sein, die a.a. Gruppen als w-homogene Gruppen mit bestimmten Skelett zu charakterisieren. Dazu definieren wir
Algebraisch abgeschlossene Gruppen
467
Definition 3.6. Eine Klasse X von endlich erzeugten Gruppen ist algebraisch abgeschlossen (erfullt AA), wenn jedes endliche Gleichungssystem mit Koeffizienten aus F E X,das uber F losbar ist, in einer Obergruppe von F losbar ist, die zu X gehort. Satz 3.7. Eine o-homogene Gruppe ist genau dann algebraisch
abgeschlossen, wenn ihr Skelett algebraisch abgeschlossen ist.
Beweis. Wenn M a.a. ist, folgt (AA) fur das Skelett von M sofort aus 1.2. Sei umgekehrt M w-homogen und erfulle Sk(M) (AA). Das endliche Gleichungssystem S mit Koeffizienten aus der e.e. Untergruppe F C M habe eine Losung in der e.e. Gruppe G 3 F. Wegen (AA) konnen wir G E Sk(M) annehmen. Nach 2.3.1 ist G uber F in M einbettbar und also S in M losbar. M ist also algebraisch abgeschlossen. 0 Folgerungen 3.8 (2 ist im wesentlichen Th. 17 aus [14]). 1. Aus M a.a. und N = p M folgt N a.a. 2. X C 3 ist genau dann Skelett einer abzahlbaren a.a. Gruppe, wenn X abzahlbar ist und (U, JEP, A A ) erfiillt. Die Gruppe ist durch 3Y eindeutig bestimmt. 3. X C % ist genau dann Skelett einer a.a.Gr. von hochstens der Machtigkeit wl, wenn X hochstens die Machtigkeit w 1 hat und (U, JEP, AA ) erfii‘llt. 4. (CH) X C % ist genau dann Skelett einer a.a.Gr., wenn X (U, JEP, A A ) erfiillt. Beweis. 1 folgt sofort aus 2.2, 2.5 und 2.10.1. Um fur 2, 3 und 4 die Satze 3.5 und 3.7 anwenden zu konnen, mussen wir zeigen, dal3 aus JEP und AA die Amalgamierungseigenschaft folgt. Wir nehmen dazu G C G’ aus X und eine Einbettung v : G +F E X. JEP gestattet es G’C F anzunehmen. Wie im Beweis von 1.9 finden wir ein endliches Gleichungssystem S mit Koeffizienten aus F, das f erfult, wenn q 3 v. Weil X algebraisch abgeschlossen ist, ist S in einem F C H’E X Iosbar, die Losung sei f’. Dann ist v C ~ ~ F +1 H’ ~ die : gesuchte Fortsetzung von v. 0 Zwei Beispiele 3.9. 1. 3 - die Klasse der endlich erzeugten Gruppen - hat die Eigenschaften U, J E P und AA. Wenn Sk(M) = 3 - M ist “w-universell” - und M o-homogen ist, ist M also algebraisch abgeschlossen. Mit (CH) liefert uns der letzte Satz w-universelle,
468
M. Ziegler
homogene Gruppen. Wir brauchen aber weder CH noch den letzten Satz, denn 1.4 verschafft uns eine w-universelle a.a.Gr., 2.11.5 sogar eine a.a.Gr. der Machtigkeit 2". 2. Xo, die Klasse der endlich erzeugten Gruppen mit rekursiv aufzahlbarem Wortproblem, hat die Eigenschaften U, JEP und AA. Denn nach dem Satz von Higman ([9]) besteht XI,gerade aus den endlich erzeugten Untergruppen endlich prasentierter Gruppen. .X(, erfullt also U. Wenn F durch das endliche Gleichungssystem S(f) prasentiert wird und G durch T ( g ) , wird F * G durch S ( f ) U T ( 2 ) prasentiert, also gilt JEP. Um AA zu zeigen, sei S ( x i ) ein endliches Gleichungssystem, das uber F E X(, losbar ist. Man kann annehmen, daB F - durch T ( f )- endlich prasentiert ist. S ( $ g ) U T(f) prasentiert dann eine Obergruppe von F, in der S - von 2 - gelost wird. Zu X o gehort also eine eindeutig bestimmte abzahlbare a.a. Gruppe Mo mit Sk(Mo)= XI,. MI,wurde zuerst in [I41 konstruiert. 4. w- homogene Strukturen der Machtigkeit w , Wir haben zu abzahlbaren Skeletten abzahlbare w-homogene Strukturen und zu Skeletten der Machtigkeit o,w-homogene Strukturen der Machtigkeit w , konstruiert. Gibt es uberabzahlbare w homogene mit abzahlbaren Skelett? Diese Strukturen waren zu w abzahlbaren Strukturen partiell isomorph. Nach einem Satz von Kueker ([13]) besa0e dann die abzahlbare Struktur 2"' viele Automorphismen. Wir untersuchen zunachst die Zahl der Automorphismen abzahlbarer o-homogener Strukturen. 4.2 charakterisiert auf einfache Weise die abzahlbaren w-homogenen Strukturen mit 2" Automorphismen. In 4.3 zeigen wir, daB abzahlbare a.a.Gr. der Bedingung von 4.2 genugen. Die Aussage von 4.3 stellt sich fur beliebige abzahlbare w -homogene Strukturen sogar als aquivalent zur Existenz einer partiell isomorphen Erweiterung der Machtigkeit w , heraus. Im letzten Teil dieses Abschnitts geben wir noch zwei andere Moglichkeiten an, zu abzahlbaren a.a.Gr. w,-machtige partiell isomorphe Gruppen zu konstruieren. In [34] wird gezeigt, daB jede a.a. Gruppe zu Gruppen beliebig grol3er Machtigkeit partiell isomorph ist. Der folgende Satz findet sich in ([13]).
Satz 4.1. Eine abzahlbare Struktur 2Jl besitzt genau dann 2"' viele Automorphismen, wenn es zu jedem endlich erzeugten 3 C 9.R einen nichttrivialen Automorphismus von 9.R uber 3 gibt. 2Jl besitzt sonst nut hochstens abzahlbar viele Automorphismen. Folgerung 4.2. Eine abzahlbare w-homogene Struktur 2Jl besitzt genau
Algebraisch abgeschlossene Gruppen
469
dann 2" Automorphismen, wenn es zu jedem ?I E Sk(2R) ein (?l, a, b E (?I, b)'. Sk(!JJl) gibt, so dafi a # b und (W,a)' Algebraisch abgeschlossene Gruppen haben die in 4.2 geforderte Eigenschaft, es gilt sogar:
G seien e.e. Untergruppen der a.a. Gruppen M und f 2 ' F. Dann gibt es einen inneren Automorphismus T von M uber F mit f b2 d G 1 . Satz 4.3. F C
Zum Beweis brauchen wir das folgende Lemma Lemma 4.4. F sei Untergruppe von G. Dann gibt es einen inneren Automorphismus T einer Obergruppe von G mit T [ G ]n G = F und 7 ( F = idF. Beweis. G' sei eine isomorphe Kopie von G uber F, G n G ' = F. Wir bilden H = G *FGr und setzen p : G G F G ' zu einem inneren Automorphismus T einer Obergruppe von H fort. 0 Seien nun f , F, G , M wie in 4.3. Unser Lemma liefert einen inneren Automorphismus T~ einer Obergruppe N 3 M , mit rglF=idF und f E T~ [GI. Nach eine weiteren Anwendung von 4.4 haben wir in einer Obergruppe (0.E. N ) einen inneren Automorphismus Th iiber T~ [ G ) mit Th cf) # f. Wenn G = ( g o , . . . , g.), F = (f", . . . ,fm)> gilt in N gfog-' = f o . . . gf,,,g-' = fm, h (ggog-')h-' = ggog-' . . h (gg,,g-')h -' = gg,g-' und h f K ' # f. Wir beniitzen nun 1.3.1 des nachsten Kapitels: Weil also das endliche System von Gleichungen und Ungleichungen 1
s(S, g, f, x, y ) = {xfix-'
=
fi I i s m 1
U{y(xgix-')y-'= x g , x - ' ( i S n )
u { 1YfY - ' =f ) in einer Obergruppe von M (durch g, h ) gelost wird, besitzt es auch eine Losung g ' , h' in M. T = T ~ hat . die gewunschten Eigenschaften. 0 Folgerungen 4.5. 1. F C G seien endlich erzeugte Untergruppen der a.a. Gruppe M, S fl F = 0 eine endliche Teilmenge von M. Dann existiert ein innerer Automorphismus r iiber F mit S fl T [ G = ] 0. 2. Algebraisch abgeschlossene Gruppen sind nicht endlich erzeugt.
M.Ziegler
470
3. Jede abzahlbare a.a. Gruppe besitzt 2” Automorphismen ([14]). Wir beweisen 1 durch Induktion uber 1 S 1. Dazu nehmen wir ein f E S. Wenn (F,f ) n ( S \(f})# 0, folgt aus 7 IF = idF und T[Gn ] ( S \V}) = 0, daS T [ G ]r l S = 0. In diesem Fall folgt also die Behauptung aus der Induktionsvoraussetzung. Sonst wahlen wir einen inneren Automorphismus 7 uber F mit f & T [ G ]und einen inneren ] \(f})= 0. Dann ist Automorphismus T iiber (F,f)rnit r [ ( ~ [ G ] , ff l)(S m [ G ]f l S = 0. 2. Ware M e.e. miiSte es fur f E M\ E einen inneren Automorphismus geben mit f aT [ M ] . 3. Sei F endlich erzeugte Untergruppe der a.a.Gr. M. Da M nicht e.e. ist, gibt es f E M\F. Wahle einen Automorphismus 7 uber F, f & ~ [ ( F , f )Natiirlich ]. ist ~ # i d M . 0 Wir haben ein einfaches Kriterium fur die Existenz partiell isomorpher Strukturen der Machtigkeit ol. Die Voraussetzung der oHomogenitat ist eigentlich nicht notig, vereinfacht aber Formulierung und Beweis. (a)+(b) folgt aus Lemma 34 ([14]).
Lemma 4.6. W sei abzahlbar und w-homogen. Dann sind aquivalent (a) Es gibt % W,% = W. (b) Es gibt % s PW,181= ol.
s
Beweis. (a)+ (b) ([14])Wir definieren eine aufsteigende Folge W = * %Jla * a E w1 von zu W isomorphen Strukturen. Wa+l existiert wegen (a). Fur Limeszahlen nehmen wir WA= U{%Jlaa < A}. Aus 2.11.4 und 2.8.1 folgt WA= W.% := u { W a a < ol}hat die gewunschten Eigenschaften, wieder wegen 2.11.4. (b)+ (a) Man kann % voraussetz,en. Sei a E %\W. 2.11.5 liefert ein wegen 2.8.2 W<<, abzahlbares %’ rnit %Jl U { a }C %’ < p %. Also ist W z %’ und W58’. 0
%sW1 5
--
-
I
I
Der nachste Satz zeigt, daS die in 4.3 beschhebene Eigenschaft a.a. Gruppen sogar noch starker als (b) von 4.6 ist.
Satz 4.7. Dann
W
sei abzahlbar und o-homogen, S eine Teilrnenge von
sind aquivalent :
W.
(a) Fur alle e,e. Unterstrukturen % C 8 uon W rnit S n %= 0 gibt es einen Autornorphismus T uon W u b e r % rnit ~ [ 8 S] = 0. (b) Jede zu S disjunkte e.e. Unterstruktur uon 2J2 ist in einem %CW enthalten rnit % = ID1, % n S = 0.
Algebraisch abgeschlossene Gruppen
471
Beweis. (b)+ (a) ?1C B seien wie in (a). Fur wahlen wir '% C 8 wie in (b). Es gibt eine Einbettung v : B + 8. Weil 8 w-homogen ist, konnen wir uIH= idv, annehmen (2.3.1). Fur das gesuchte T nehmen wir eine Fortsetzung von v (2.11.2). (a)+(b) 2l sei zu S disjunkte e.e. Unterstruktur von YJl. Wir stellen 2R. als Vereinigung einer Folge '3 = 210 C * * . 211C . . . von e.e. Unterstrukturen dar, und definieren eine Folge vo C v 1C * * von Einbettungen v, :!Xi +YJl mit v,['%,I fl S = 0. Fur vo nehmen wir die Inklusion von 2l in 1Dz. Sei vi definiert und p eine Fortsetzung auf 21i+l. (a) angewandt auf p [ai]C p [21i+l]gibt einen Automorphismus T von mit Tl,lul,l = idelvrdund S fl ~ [ [ai+,]] p = 0. Setze v i + ]= ~ p . v:= i < w } . 8 := v[%!] erfullt (b). 0
u{viI
Folgerungen 4.8. 1. Sei F e.e. Untergruppe der abzahlbaren a.a. Gruppe M und S eine endliche zu F disjunkte Teilmenge von M. Dann gibt es F C N C M, N fl S = 0, N = M. Uber keiner endlich erzeugten Gruppe gibt es also eine minimale a.a. Gruppe. 2. ([14]) Jede abzahlbare a.a. Gruppe ist partiell isomorphe Substruktur einer a.a. Gruppe der Machtigkeit w l . 3. Es gibt 2" verschiedene a.a. Gruppen der Machtigkeit w l . (3 gilt, weil es 2" abzahlbare a.a.Gr. gibt. Siehe 111, 4.2.) Bemerkung 4.9. Wir werden spater sehen, da13 das Skelett einer algebraisch abgeschlossenen Gruppe mit F und G auch F x G enthalt. Also ist, wenn M a.a. und F E Sk(M), Sk(M X F) = Sk(M). 1st M abzahlbar, kann man daher M X F isomorph in M einbetten (2.11.3). Auf diese Weise wurde in [14] 4.6 (a) fur a.a.Gr. nachgewiesen. Es gibt eine dritte Moglichkeit in einer abzahlbaren a.a. Gruppe eine isomorphe echte Untergruppe zu finden: Satz 4.10. 1. Der Zentralisator Z'(F) einer endlich erzeugten
Untergruppe F einer a.a.Gr. M hat dieselbe Machtigkeit wie M. 2. Wenn das Zentrum von F trivial ist, ist Z'(F)sPM.
Bemerkungen 4.11. 1. Wenn Z ( F ) = E, ist Z'(F) also algebraisch abgeschlossen, hat dasselbe Skelett wie M und ist, wenn M abzahlbar ist, zu M isomorph. 2. Wenn FZ E und Z ( F ) = E, ist Z'(F)S M. D a man jede endliche Gruppe in M einbetten kann, gibt es immer solche F in M (siehe 11, 1.3.3). 3. Wenn Z ( F ) # E, ist Z ( F ) nicht algebraisch abgeschlossen. Denn
472
M. Ziegler
ein f E Z ( F ) \ E ist auch in Z ’ ( F ) und die Ungleichung zwar in M , nicht aber in Z ’ ( F ) losbar. (1 beantwortet eine Frage aus (171.)
x f + f x ist
i
Wir beweisen zuerst ein Lemma Lemma 4.12. F und G C H seien e.e. Untergruppen der a.a.Gr. M. Wenn F X G C M, gibt es einen inneren Automorphismus T von M iiber G, so dab (F, T [ H ] = ) F x T[H]. Beweis. Wir werden in 111, 1.5.2 beweisen, dal3 mit F und H auch F x H in M einbettbar sind. Also betten wir vermoge p F X H uber F x G in M ein (M ist w-homogen) und wahlen T so, dal3 T~~ = idc und T [ H ]= p [ H ] . 0 Beweis von 4.10.1. Wir unterscheiden zwei Falle: (a) In M gibt es 1 MI viele Elemente rnit verschiedenem Typ uber F. d . h . E s g i b t A C M , ~ A ~ = ( M ~ , f u r a l l e a # b (a)’ZF(b)’.Nach EA 4.12 gibt es eine isomorphe Kopie F‘ von F, so daB F x F C M. Da M w-homogen ist, gibt es auch IM I viele Elemente mit verschiedenen Typ uber F‘; sei B C M, IBI = ] M I , fur alle a # b E B (u)’gF(b)‘. Wir wenden nochmal 4.12 an und finden zu jedem a E B ein f a E M rnit (f,,)’=F((a)’ und ( F , F ’ , f , )= F X ( F ’ , f a ) .Die fa, a E B sind also alle verschieden und aus Z’(F). (b) Wenn (a) nicht eintritt, gibt es nach dem Schubfachprinzip fur alle K < I M I mindestens K + Elemente vom gleichen Typ uber F. Sei also fur alle a, b E A ( a ) ’ = ,(b)’ und ( A1 = K + . a E A sei fest gewahlt und fur jedes b E A ein innerer Automorphismus T , ~iiber F rnit T , J ~ ) = b. Die f b sind alle verschieden und aus Z ’ ( F ) . Also ist fur jedes K < ( M I I Z ’ ( F ) ( ~ K + . Beweis von 4.10.2. Sei G endlich erzeugte Untergruppe von Z ’ ( F ) und G C H E S k ( M ) . Wir zeigen, daB H uber G in Z ’ ( F ) einbettbar ist. Damit ist sowohl nach 2.3.1 die w-Homogenitat von Z ’ ( F ) gezeigt, als auch S k ( Z ’ ( F ) )= Sk(M). Daraus folgt mit 2.10.1 die Behauptung. Weil Z ( F ) = E, ist aber (F, G ) = F x G ; H ist also wegen 4.12 uber G in Z ’ ( F ) einbettbar. 0 11. Auflosbare Gleichungssysteme
Ein Gleichungssystem S(E, 2 ) sei gegeben. Wir betrachten die Klasse X
Algebraisch abgeschlossene Gruppen
473
aller Gruppen G und Folgen von Parametern 2 E G, fur die S ( g , X) uber G losbar ist. Im ersten Abschnitt zeigen wir, daB man auf diese Weise gerade die Klassen der Form X = {(G,g ) G I= Z ( g ) } - fur eine Menge Z von positiven Implikationen - erhalt. Die X,die sich durch r.a. Mengen von Implikationen axiomatisieren lassen, gehoren auf diese Weise zu endlichen Gleichungssystemen, wie wir im zweiten Abschnitt nachweisen. Das hat zur Folge, daB man r.a. Gleichungssysteme in vieler Hinsicht wie endliche Gleichungssysteme behandeln kann. So ist ein r.a. System von Implikationen in einer algebraisch abgeschlossenen Gruppe genau dann erfullbar, wenn es in einer Obergruppe erfullbar ist (2.7). Der Begriff der “Aufzahlungsreduzierbarkeit” gestattet es, die Ergebnisse des zweiten Abschnitts zu relativieren. Wir geben im dritten Kapitel zunachst eine rekursionstheoretische (3.4) dann eine gruppentheoretische Charakterisierung der Aufzahlungsreduzierbarkeit (3.10). 3.10 ist eine “Relativierung” des Satzes von Higman, der die Gruppen mit r.a. Wortproblem als die e.e. Untergruppen endlich prasentierter Gruppen beschreibt: Die Gruppen, deren Wortproblem auf das Wortproblem von G aufzahlungsreduzierbar ist, sind gerade die e.e. Untergruppen endlich “uber G ” prasentierter Gruppen. Dann untersuchen wir die Auflosbarkeit von endlichen Gleichungssystemen, bei denen wir einen Teil der Koeffizienten “fest” in einer Gruppe G interpretieren. Wie irn zweiten Abschnitt entsprechen dann endliche Gleichungssysteme Mengen von positiven Implikationen, die auf das Wortproblem von G aufzahlungsreduzierbar sind (z. B. 3.12).
I
1. Gleichungssysteme und positive Implikationen Wir betrachten zwei Beispiele
I
Beispiel 1.1. S(&, 6,x ) = {xuix-’ = bi i = 0 , . . . , n}. Wenn (z, 6 E G ist nach I, 1.10 S genau dann uber G losbar, wenn ((z)’z(6)’. Das ist aquivalent mit der Gultigkeit der folgenden Menge von Implikationen in (G; (z, 6)
~ ( i i6, )= { ~ ( i i=) e + ~
( 6= e) 1 w Wort in xo,. . .,x,}
u { ~ ( 6= e) -+ ~ ( i i =) e 1 w Wort in xo, . . . ,x,}.
Beispiel 1.2. Wir betrachten umgekehrt eine Implikation u = e -+ b = e. Das nachste Lemma gibt eine Gleichung an, die gen.au dann in einer Obergruppe losbar ist, wenn die Implikation gilt.
M. Ziegler
474
Lemma ([14]). Die Gleichung ~ ~ a, ( bxo, , xl)= b G [ [ a ,xo],x l ] ist genau dann iiber G losbar (a, b E G ) , wenn G I= a = e + 6 e. ( I n [ 141 wird die einfachere Gleichung b = xoax;lxlax i1benutzt.) Beweis. Wenn die Gleichung in einer Obergruppe losbar ist, folgt naturlich b = e aus a = e. Sei umgekehrt b = e oder a # e. 1. Fall a = b. Sei G ' = G x ( a ' ) fur ein a' rnit der selben Ordnung wie a. In einer geeigneten Obergruppe wahlen wir rnit 1.10 ein go rnit goa-'g;' = a' und ein g , mit g,(aa')g;' = a'. Dann ist [a,go]= aa' und [[a,go], g l ] = aa'(a')-' = a = b. 2. Fall a # b. Sei H beliebig und g o € H\E. Weil a # b und also a # e, haben b-'[a,go] und [a,go] in G * H unendliche Ordnung. Wir finden daher in einer Obergruppe von G * H ein g l mit gl([a,go])g;' = b-'[a, g o ] . Dann ist [[a,go],g l ] = [a,go](b-'[a,go])-' = b. 17 Folgerungen 1.3. 1. Jedes endliche System von Gleichungen und Ungleichungen mit Koeffizienten aus einer a.a. Gruppe M, das uber M liisbar ist, ist in M liisbar ([21]). 2. Algebraisch abgeschlossene Gruppen sind existentiell abgeschlossen ; d. h. M ist 1-elementare Unterstruktur jeder Obergruppe. 3. Endliche Gruppen sind in alle a.a.Gr. einbettbar ([26]). 4. Lokalendliche abzahlbare Gruppen sind in alle a.a. Gr. einbetrbar. 5 . Algebraisch abgeschlossene Gruppen sind einfach ([21]). Beweis. 1. Sei
2€ M
und
~ ( ix') ,= { W , (g, x')
e
I i < n } u { 7 ui(8, x')= e I i < m )
ein System von Gleichungen und Ungleichungen, das in einer fur Obergruppe H von M losbar ist. a sei aus M \E und H I= S ( i , geeignete h'E H. Nach dem Lemma gibt es fur jedes i < rn Elemente in einer Obergruppe F von H mit FI= bl(u, Ui(g,&),$). ES gibt also auch in M eine Losung 6',f: des Systems von Gleichungen
a)
5
{ W , (i, 2)
e
I i < n } u {Ll(a, ui(g, i ) ,y i ) I i < m 1.
Daraus folgt, daB Ui (f, @) # e fur alle i < m. 2. Sei M C G, ~ ( y eine ) Existenzformel, 2 E M, G t= ~ ( g )Jede . Existenzformel ist logisch aquivalent zu einer Formel der Form 3x'v { A Si (9,x') i < n } , wobei die Si endliche Mengen von Gleichungen und Ungleichungen sind. Eines der G1. & Ungl. systeme Si(i,x') ist also in G und daher wegen 1 in M losbar. ~ ( ggilt ) also auch in M.
I
475
A lgebraisch abgesch lossene Gruppen
If
3. F sei eine endliche Gruppe. Das endliche System S((x, E F ) )= {-I x, i x, f, g E F, f f g } U {x,x, = x,, g E F } von Gleichungen und Ungleichungen ist in F und also auch in M x F (uber M ) Iosbar. F ) eine Losung in M ist, ist hfg, eine isomorphe Wenn (g, Einbettung von F in M. 4. Wenn G lokalendlich ist, ist nach 3 Sk(G) C Sk(M) fur alle a.a. M. 1st G abzahlbar, folgt G < M aus I , 2.11.3. 5. Wenn N ein nichttrivialer Normalteiler der a.a.Gr. M ist, finden wir a E M \ N und 6 E N \ E. Nach dem Lemma gibt es in einer Obergruppe - und also in M selbst - eine Losung g der Gleichung ~ ~ (b,4i , ) .Dann gilt aber in M / N Ll(aN,bN, g N ) . Daher muS, weil bN = eN, aN = eN sein - in Widerspruch zu a & N. 0
If,
I
If€
Wir wollen zeigen, daS sich fur jedes Gleichungssystem S die Klasse der Gruppen uber denen S losbar durch eine Menge von positiven Implikationen axiomatisieren laat. Wir beniitzen dabei nur - im folgenden Lemma -, daS T" eine universelle Horntheorie ist, d. h. daB Unterstrukturen direkter Produkte von Gruppen wieder Gruppen sind.
Lemma 1.4. Sei Z eine Menge von Zmplikationen und cp,,, . . . ,cpn eine Folge von Gleichungen. Wenn Z t" cp,, v . . . v cp,, ist fur ein i S n Z t Gcp,. Beweis. Sonst gibt es fur alle i S n eine Gruppe G, und Interpretationen der Konstanten aus S g, E G mit (G,,g,)+ cp, und (G,,g,)+Z. Dann ist aber x:=,(G,,g,)l= Z und X:=o(G,, g,)F qoo v . v ~ n . 0 *
a
Folgerung 1.5. Z sei eine Menge von Implikationen, U eine Menge von Ungleichungen, +,, . .., q I , ... ,q, ( n ,m E w ) Gleichungen. Wenn IU U tG A * * * A -+ q Iv * * . v cp,, gibt es ein y, E U mit I U{x}tc+, I \ . . * + oder es ist I tGICIlA . * . A 4" + q i fur ein i s m.
+, +,,
+,,
Satz 1.6. Z u jedem System von Gleichungen S(2,Z) gi6t es eine Menge I ( E ) von positiuen Implikationen, sodajl fur alle E G S ( g , x') genau
dann uber G losbar ist, wenn G I= I @ ) .
476
M.Ziegler
(Wenn S ein System von Gleichungen und Ungleichungen ist, leistet eine Menge Z von Implikationen das gewunschte.)
I
Bemerkung 1.7. Man kann fur Z die Menge {cp cp positive Implikation aus L(c'), S(E, 6) t G q } nehmen. Beweis. Z ( E ) sei wie in 1.7. S ( i , 2 ) sei uber G nicht losbar. Dann ist Diag(G, 6) U S ( i , d) nicht erfullbar, vgl. I, 1.2. Es gibt also eine endliche Menge J C Diag(G, g), fur die J U S ( i , d) nicht erfullbar ist. Sei J = { 1cp,, . . .-I cp,, q I , . . $,,,} fur Gleichungen cpi, $;. Es ist S(g, 6)kG A . . A $,,, + 'pi v . . . v cpn. Nach 1.5 ist S ( & d) tG A . A $,, + cp, fur ein j S n. Die positive A . . . A $,, + cp, gehort zu I. Also G # Z ( g ) . (Wenn S Implikation A . . . A $,., += auch Ungleichungen enthalt, kann Z auch enthalten.) I7
-
Der nachste Satz kehrt (1.6) um:
Z(c) %on positiven Zmplikationen gibt es ein Gleichungssystem S(Z,i ) ,so dajl fur alle 2 E G S ( g , 2 ) genau dann uber G losbar ist, wenn G I= Z(g).
Satz 1.8. Z u jeder Menge
Der Satz folgt sofort aus dem nachsten Lemma
Lemma 1.9. 1 ([14]). Fiir jedes n kann man eine Gleichung L , ( C ~ , . . . ,c,, i ) angeben, so dajl fur alle g' E G L, (g, 2 ) genau dann iiber G losbar ist, wenn G I= g l = e A * * A g, e 4go = e. 2. 9 sei eine Familie von Gleichungssystemen mit Koeffizienten aus G. Wenn keine Variable in zwei Gleichungssystemen von 9 uorkornmt, ist US genau dann iiber G losbar, wenn jedes System aus 9 iiber G losbar ist. Beweis. 1. Fur n = 1 ist 1 die Behauptung von 1.2. Sei L , ( C ~ , .. . , c,, 2 ) aquivalent zur Gleichung t ( c o , .. . ,c,, i )= e. Dann nimmt man fur L , , + ~ ( C ~., . . ,~ , + ~ ,92) ,d i e Gleichung Ll(t(co,.. . c,, i ) , c n + ]9). , Die so rekursiv konstruierten L, haben die geforderte Eigenschaft, weil ci = e + co e dasselbe bedeutet wie ~,~e+(c,-~~e+(...(c~=e+~~=e)...)). (Wenn L ~ ( C c,, ~ , 2 ) wie in 1.2 die Gleichung t ( c l , 2 ) A
co ist, kann man so die Darstellung von ([14]) fur L, bekommen: coA t ( c I ,21) * * t(c,, in), der man die gewiinschte Eigenschaft leicht ansieht. Unsere kompliziertere Darstellung zeigt, wie wenig man die
Algebraisch abgeschlossene Gruppen
477
Gruppeneigenschaften braucht, um das Lemma aus 1.2 zu schlieaen.) 2. Sei 9 = { S , ( i ,i , )j E J } . Wenn dann S, in F, 3 G gelost wird, ist US in * G { F , j E J } losbar. 0
1
1
2. Endliche Gleichungssysteme und rekursiv aufzahlbare Mengen von positiven Implikationen Wir haben gesehen, daR Gleichungssysteme, die in Obergruppen gelost werden sollen, Mengen von positiven Implikationen entsprechen. In diesem Abschnitt wird gezeigt, daB auf diese Weise endliche Gleichungssysteme gerade zu rekursiv aufzahlbaren Mengen von positiven Implikationen gehoren. Satz 2.1. Jedem endlichen Gleichungssystem S(c', i ) kann man eine rekursiv aufzahlbare Menge von positiven Implikationen Z ( E ) zuordnen, so dalJ fiir alle 2 E G S ( & i ) genau dann uber G losbar ist, wenn G t Z(i).
I
Beweis. {cp E L,(E) FGcp} ist r.a. Also ist auch Z(c) = {cp (c') tC A S(E, d)+ cp (Z), cp positive Implikation} rekursiv aufzahlbar. 0
1
Bemerkung 2.2. Die Zuordnung S H Z ist effektiv: der Glidelnummer von S wird rekursiv ein 2Y-Index von Z zugeordnet (genauer: ein 2:Index der Menge der Godelnummern der Elemente von I ) . Beim Beweis von Satz 1.8 haben wir jeder positiven Implikation aus Z ein eigenes Gleichungssystem zugeordnet. Wenn Z unendlich ist, bekommen wir also so ein Gleichungssystem mit unendlich vielen Variablen. U m zunachst diese Schwierigkeit bei der Umkehrung von 2.1 zu uberwinden, zitieren wir aus ([S]): Lemma 2.3. Es gibt eine Folge von Wortern Ni (x, y ) , so dap fiir alle Folgen (gl i < w ) aus einer Gruppe G das Gleichungssystern {Ni(x, y ) g, i < w } uber G losbar ist.
1
I
Bemerkung. In ([20]) ist
N (x, Y ) = [[X, Y 2i+'1, X I .
Folgerungen 2.4. 1 ([S]). Jede abziihlbare Gruppe ist in eine von zwei Elementen erzeugte Gruppe einbetrbar. 2. Jede endlich erzeugte Untergruppe einer a.a.Gr. ist in einer von zwei Elementen erzeugten Untergruppe enthalten. 3. Zu jedem (endlichen, r.a., rekusiven) Gleichungssystem S(2,X') gibt
M. Ziegler
478
es ein (endliches, r.a., rekursives) Gleichungssystem S'(E, xl, x2) - mil nur zwei Variablen - das genau dann in einer Obergruppe losbar ist, wenn S in einer Obergruppe losbar ist. Beweis. 2. Wenn g' E M a.a., ist (g') C ( h l ,h z )C M ; wobei h l ,h2 eine Losung von {N, (xl, x2) = gi i c n } ist. 3. Setze einfach S'(E, xl, x2) = S(Z, No(xl, x2), . . . ,N, (xl, XZ), . . .). 0
1
Bevor wir in 2.6 Satz 2.1 umkehren, behandlen wir noch einen Spezialfall: Fur jedes n ist
{ w ( E ) = ~w(d)=.el + w Wort
in xO,..., x,}
eine r.a. Menge von positiven Implikationen, die auf a', 6 E G genau dann zutrifft, wenn (6)' homomorphes Bild von (2)' ist. Das nachste Lemma ordnet also dieser Menge ein endliches Gleichungssystem zu.
Lemma 2.5. Man kann fur jedes n ein endliches Gleichungssystem H. (E, d, x, y ) angeben, so dap fur alle 2,6 E G (bo,.. . ., b,,)' genau dann homomorphes Bild von ( a o , .. .,a,)' ist, wenn H,(& 6 x, y ) uber G losbar ist.
.. .
a',6 E G und ( a & , a k)' eine isomorphe Kopie von ( a o , .. . , a , ) ' . (bo,. ,b,,).' ist genau dann homomorphes Bild von ( a o , . ,a,,)', wenn in ( a ; ,. . . ,ab)x G ((a&bo),. , ( a ; , ha))'und
Beweis. Sei
..
..
..
(ao,.. . ,a,,)' isomorph sind. In diesem Fall finden wir in einer
geeigneten Obergruppe H von (a;, . . .,a 3 X G Elemente f , g mit guog-' = a;, .. ., g a d - = a und faof-' = gaog-'bo, .. .,f ~ , , f - '= gang-lbn. Das Gleichungssystem H , (ti, 6,x, y ) := {xaix-' = yaiy-'bi i s n } U (ya,y-'b, bjyaiy-' i, j s n } ist also in einer Obergruppe von G losbar, wenn (bo,.. .,b,,)' homomorphes Bild von (ao,...,a,,)' ist. Wenn andererseits f , g eine Losung von H, (& 6,X, y ) , sind, gilt fur jedes Wort W(x0,.* xn),
I
I
-
f W ( 2 ) f - l = WCf~of-', -..,f a n f - ' )
. .,ga,g--'b,,)
= W(gaog-'bo,. = = AUS
~ ( i= i )e folgt
W(gaog-', .. .,gag-') W(bo,. ..,b,)
gw(ii)g-'w(6). also ~ ( 6= e. ) KI
Satz 2.6. Z u jeder rekursiv aufzahlbaren Menge I ( E ) von positiven Implikationen rapt sich ein endliches Gleichungssystem S(E, x') angeben,
A lgebraisch abgesch lossene Gruppen
so dap fur alle G I= Z ( g ) .
g E G S ( g , 2 ) genau
419
dann uber G losbar ist, wenn
Beweis. Mit Hilfe von 1.9 verschaffen wir uns zunachst ein r.a. Gleichungssystem S’(E,E), so daB S’(g,Z) genau dann uber G losbar ist, wenn G I= Z(g). 2.4.3 zeigt, daB wir annehmen konnen, daB S’ nur zwei Variable hat: S’(E, x I ,x2). H sei die Gruppe, die von den Relationen S’(Cr, bl, b2) in den Erzeugenden cf, b l ,b2 prasentiert wird. Da fur ein Wort W ( j ,x I ,x2) W(ii,bl, b2)= e gdw. S’(ii, bl, b2)bG W ( i ,bl, b2)= e und S’ rekursiv aufzahlbar ist, ist das Wortproblem von H rekursiv aufzahlbar. Nach dem Satz von Higman ([9]) IaBt sich H in eine endlich prasentierte Gruppe F einbetten. cf, bl, b2,cf‘ seien die Erzeugenden von F und R ( 6 , bl, b2,ti’) das definierende Gleichungssystem. Sei n die Lange von E. Das gesuchte Gleichungssystem ist S(E, XI, Xz, 9, Z I , Z Z , ii, u1, u2) := := R ( ~ , z I~ , 2 C), U H n + ~ ( j Z, I , 2.2; C X I , ~ 2 U;I , VZ).
Wenn in einer Obergruppe M von G M I = S(g, h l , h z , g’,h ;,h i , f,f:,fk) fur h . . . ,f ; E M, ist (g‘, h I,h i , f)’homomorphes Bild von ( F ;Z, b,, bz, Z’) und (2, hl, hz)’ homomorphes Bild von (g’,h I, hi)’. (6 h,, hz)’ ist also homomorphes Bild von (H; Z, b l ,b2) und daher gelten die definierenden Relationen von H auch in (g, h l , h2)’.h l , h2 sind also eine Losung von S’(g, x l , x2), das bedeutet G I= Z ( g ) . Sei umgekehrt G I= Z ( g ) und daher MI= S’(& h l , hz) fur h l , h2 aus einer Obergruppe M von G. (g, h , , hZ)’ist also homomorphes Bild von (H; Z, b,, bz). Also gibt es in einer Obergruppe von M x F eine Losung bl, bz; i,hl, h ~U I, ,UZ). f L f : von h,, hz, Z, b,, bz,Z‘, f : , f ; ist dann eine Losung von S ( 6 , xl,. . .,u2). 17 Als erste Folgerung aus 2.6 bekommen wir, daB die “bzgl. endlicher Gleichungssysteme” a.a. Gruppen auch bzgl. “r.a. Gleichungssysteme” algebraisch abgeschlossen sind. Wir haben sogar Satz 2.7. I ( & 2) sei ein rekursiv aufzahlbares System uon Zmplikationen
mit Koeffizienten 2 aus der algebraisch abgeschlossenen Gruppe M. Wenn Z ( # , i ) in einer Obergruppe (- uber M -) erfullbar ist, ist Z(g,2) schon in M erfullbar.
Beweis. Wir nehmen go# e an und verfahren wie beim Beweis von 1.3.1. Wir ersetzen jede negative Implikation
M. Ziegler
480
W l ( g ,2 ) = e
A
. . . A W n ( &2 ) = e +
aus I durch die positive Implikation W,(g,i )= e
A *.
*
A
W . (g, 2 )
e -+go = e.
So k o n n e n wir annehmen, dal3 I nur aus positiven Implikationen besteht; weiter kommen wir in I wegen 2.3 mit zwei Variablen xl,xz aus. Mit 2.6 wahlen wir ein endliches Gleichungssystem S(g, c , , c2,j ) , so daB fur alle g, h ,, h , E H S(g, h I , h2,j ) genau dann uber H losbar ist, wenn H I= I ( g , h ,,h,). Wenn also h , , h z aus einer Obergruppe H von M I erfullen, ist S(g, h , , h,, 7) in einer Obergruppe von H losbar. Das Gleichungssystem S(g, x,, x,, j ) ist daher uber M und somit in M losbar. Wenn h I, h i , f’ eine Losung in M ist, erfullt h I, h ; das System Z(g, x,, x,). 0 Der nachste Satz verallgemeinert 1.3.3: endliche Gruppen lassen sich durch ein endliches System von Gleichnngen und Ungleichungen beschreiben. Die Gruppen, die sich vollstandig durch eine r.a. Menge von Gleichungen und Ungleichungen beschreiben lassen, deren Wortproblem also - wie das Komplement des Wortproblems - r.a. ist, sind gerade die Gruppen mit rekursiven (- “losbarem” -) Wortproblem. Wir werden spater sehen, daB Gruppen mit nichtlosbarem Wortproblem sich sogar in folgendem Sinn nicht durch ein r.a. System von Gleichungen und Ungleichungen beschreiben lassen: Es gibt fur jede solche Gruppe eine a.a.Gr., in die sie nicht einbettbar ist (111, 1.8.2). Satz 2.8 ([22]). Endlich erzeugte Gruppen m it losbarem Wortproblem
sind in jede algebraisch abgeschlossene Gruppe einbettbar.
Beweis. G = (g) habe losbares Wortproblem, M sei algebraisch abgeschlossen. Die beiden Systeme von Gleichungen und Ungleichungen { W ( i )= e W Wort mit W ( g )= e } , {i W ( 6 )= e W ( g )# e } sind rekursiv. Ihre Vereinigung wird in M x G durch g’ gelost. Nach 2.7 gibt es eine Losung g“ in M. Es ist dann (g’)’= (G, g). 0
1
I
Bemerkung 2.9 ([22, 271). Wenn man den Beweis von 2.8 bis zum Beweis von 2.7 zuruckverfolgt, erkennt man, daB man fur G (die Gruppe mit losbarem Wortproblem), ein endliches System T ( i ,y’) von Gleichungen und Ungleichungen findet, das losbar ist und fur dessen Losungen (H,ii, 6) immer (ii)’ = (G,g) gilt. Solche Gruppen heiBen
481
Algebraisch abgeschlossene Gruppen
absolut endlich (a.e.) prasentiert. Endlich erzeugte Gruppen mit Iosbarem Wortproblem sind also a.e. prasentierbar. Umgekehrt has jede a.e. prasentierte Gruppe losbares Wortproblem. Denn W ( 2 )= e gdw. T(E,6 )tGW(E)A e und W ( 2 )# e gdw. T(Z,6 )kG i W ( E )= e. Also ist sowohl das Wortproblem, als auch das Komplement des Wortproblems von G r.a. Das Wortproblem ist rekursiv. Folgerungen 2.10. 1. Abzahlbare Gruppen mit lokal losbarem Wortproblem sind in jede a.a. Gruppe einbettbar. (“lokal losbares WP” bedeutet: jede e.e. Untergruppe hat losbares W P ) . 2. Abziihlbare abelsche Gruppen sind in jede a.a. Gruppe einbettbar. (1 folgt aus 2.8 und I, 2.11.3. Abelsche Gruppen haben lokal losbares Wortproblem, denn jede e.e. abelsche Gruppe ist direkte Summe von endlich vielen zyklischen Gruppen.) Es gibt natiirlich nach I, 1.3 a.a. Gruppen, die e.e. Untergruppen mit nicht-losbaren Wortproblemen enthalten. Daruber hinaus gilt: Satz 2.11 ([14]). Jede a.a. Gruppe enthiihlt eine e.e. Untergruppe mit
nicht-losbarem Wortproblem.
Beweis. A und B seien zwei disjunkte r.a. Teilmengen von w, die nicht durch eine rekursive Menge zu trennen sind ([24]). Das GI. & UngLsystem S(x, y ) = { N ,(x, y ) t e i E A } U {-I Ni (x, y ) f e i E B } ist - weil A r l B = 0 - losbar nach 2.3 und rekursiv aufzahlbar. Wenn a, b eine Losung von S in der a.a.Gr. M ist, hat (a, b ) nicht-losbares Wortproblem, denn {i Ni(a, b ) = e } trennt A und B. 0
I
I
1
Bemerkung 2.12 ([lS]). Wenn man den Beweis von 2.11 zuruckverfolgt, erhalt man ein endliches losbares System von Gleichungen und Ungleichungen, dessen Losungen immer eine Gruppe mit nichtlosbarem Wortproblem erzeugen. Satz 2.7 gestattet uns eine Verscharfung von I, 4.10.2. Definition 2.13. F ist r.a. erzeugte Untergruppe von M, wenn Elemente g E M und eine r.a. Menge W von Wortern existieren, sodaB
I w E w.
F =({W(t)
Satz 2.14. F sei eine r.a. erzeugte Untergruppe der a.a. Gruppe M. Wenn
F triviales Zentrum hat, ist M = p Z ’ ( F ) .
M. Ziegler
482
Beweis. Wie mann aus dem Beweis von I, 4.10 sieht, genugt es, folgende Verscharfung von I, 4.12 zu zeigen: “Sind G C H endlich erzeugte Untergruppen von M und F x G C M, so gibt es einen inneren Automorphismus 7, uber G mit (F, T~ [HI) = F x T,,[HI”. Beweis. Weil Z ( F ) = E, brauchen wir a bloB so zu wahlen, daB 7 . [ H I c Z ’ ( F ) . Wenn F C (d’)C M und F = ({ W(g’) W E W } )fur eine r.a. Menge von Wortern W, H = ( K ) und G = ( d) , muB a also gerade das r.a. System
I
1
g, i < n ) u { w ( i ’ ) x h , x - ’ xhix-’w(i’)
{xg,x-’
1 w E w i <m )
losen. Da wir aber ein 7 mit den gewunschten Eigenschaften in einer Obergruppe von M finden, folgt die Behauptung aus 2.7. 0 Damit beenden wir zunachst die Reihe der Folgerungen aus 2.7. Wir geben noch eine Verbesserung von 2.10.2 an: Satz 2.15 (vgl. [lo]). Jede abzuhlbare abefsche Gruppe, die nicht endlich
erzeugt ist, ist in jede abzahlbare a.a. Gruppe als maximal abelsche Gruppe einbettbar.
Beweis. G sei abzahlbare abelsche Gruppe, nicht e.e., M a.a. und abzahlbar. (g, i < w ) sei eine Folge von Erzeugenden von G rnit g,,, 6Z (g,,, . . . ,g,); ( m , i < w ) erne Aufzahlung von M. Wir konstruieren eine Folge (a, i < w ) aus M, so daB ( a o , . ., a , ) ’ s ( g o , . .., g , ) ’ fur alle i . a,, wahlen wir beliebig rnit ( a o )= (go). Seien ao,. . . , a, bereits konstruiert. ( g o , . . . ,g,,,) ist in M einbettbar, also gibt es wegen der w-Homogenitat von M ein a E M, so daB ( a o , . . a,, a)’= ( g o , . . . g,+J’. 1. Fall m, E (an,. . ., a,): Setze a,,, := a. 2. Fall m, 6Z (an,. . . ,a , ) : Das System xaox-’ = an,. . .,xa,x-’ at, -I xax-’m, m,xux-’ besitzt dann eine Losung in einer Obergruppe von M .,,,)(ao,.. . , a , ,a ) . Wenn f eine Losung in M ist, setzen wir a,+l:= faf-’. Es ist dann (go,. . . ,g,+,>’=( a o , .. . ,at+,)’und a r + l m , #m,a,+,. Das Erzeugnis (6)der a,, i < w ist die gesuchte maximal abelsche Untergruppe von M. Denn, wenn m, & (G), so ist m a + ’# a,+’m,. 0
I I
1
3. Aufziihlungsreduzierbarkeit Wir untersuchen algebraisch abgeschlossene Gruppen als w homogene Strukturen anhand ihrer Skelette. Wir haben gesehen, daS alle endlich erzeugten Gruppen mit losbarem Wortproblem im Skelett
Algebraisch abgeschlossene Gruppen
483
jeder a.a. Gruppe vorkommen. Das beim Beweis dieses Satzes letztlich verwendete Gleichungssystem, das eine Losung in der a.a. Gruppe haben muBte, hatte keine Koeffizienten. Wenn wir Koeffizienten aus einer endlich erzeugten Untergruppe zulassen, k o n n e n wir die Existenz von Losungen “komplizierterer” Systeme von Implikationen im Skelett algebraisch abgeschlossener Gruppen - die die Koeffizientengruppe enthalten - erschlieBen. Dieser Abschnitt prazisiert den EinfluB vorgeschriebener Koeffizienten auf die “Ausdruckskraft” endlicher Gleichungssysteme. Wie im letzten Abschnitt endliche Gleichungssysteme r.a. Mengen von Implikationen entsprochen haben, entsprechen n u n endliche Gleichungssysteme mit fester Interpretation (eines Teils) der Koeffizienten Mengen von Implikationen, die aufzahlungsreduzierbar auf das Wortproblem der “Koeffizientengruppe sind. Definition 3.1 ([24]).A, B seien Teilmengen von w . A ist aufzahlungsreduzierbar auf B - A S e B -, wenn es ein z E w gibt, so daB fur alle n E w, n E A gdw. 3i ((n,i ) E W,A D,C B). Wir schreiben dann auch z : A s ~ Boder A = W : [ B ] . Ein erster Zusammenhang mit Sraist: A SraB gdw. A S eB v ( w \ B ) . “ i : A S , B ” ist eine 11‘:-Relation.
Beispiel 3.2. Wenn T eine Menge von Aussagen einer Sprache LL.ist, ist {cp E LLW T t- cp} S e T. Denn T t- cp gdw. 3i (D, t cp A D, C T ) .
I
Beispiel 3.3. U sei r.a. erzeugte Untergruppe der e.e. Gruppe G. Dann ist W(G U ) s eW(G). Denn wenn G = (g) und (0.E.) U = (W(g) W E W ) fur eine r.a. Menge von Wortern, ist W(g) EU gdw. ex. Worter w,, . . . , wk E W und W ( x l , .. . ,x k ) mit
1
I
w(g)= W(w,(g), ..
.> wk
(2)).
Diese Gleichung 1aBt sich aber durch W-’W(W,, . . . , wk)(g) E GI(G, g) ausdrucken. Im Gegensatz zu Craist Se transitiv: Aus A S eB, B SeC folgt A S e C. (Durch A =< B : e A S eB, B S e B , erhalten wir also eine Aquivalenzrelation.) Daraus folgt (siehe 3.1) daB A S e B , B C 3 A Sra C. Diese Eigenschaft charakterisiert S e:
(Ohne Beweis.)
484
M. Ziegler
Lemma 3.5. Fiir B C w sind aquivalent : (a) Fiir alle A C w, A s r a Bj A s e B , (b) w \ B S , B . Bemerkungen 3.6. 1 . Es gibt B mit w \ B &B, z. B. wenn B r.a. aber nicht rekusiv ist (siehe auch 111, 2.5.2). 2. w \ B S e B gilt zum Beispiel, wenn B C w x w Funktion ist: (x,y)Ew\B ~ 3 y ( ( 3 u D , = ( ( x , u ) } ~ u # ~ ) ~ D , C B )
Beweis. (a) -+(b) ist klar, weil w \ B SraB. (b)+ (a) Wenn w \ B s c B , folgt aus A Sra B A S , B v ( w \ B ) S , B v B =,,,B. Also ist A S e B .
0
Urn die Gruppen zu charakterisieren, deren Wortprobleme aufeinander aufzahlungsreduzierbar sind, verallgemeinern wir in der folgenden Definition den Begriff “endlich prasentiert”. Definition 3.7. G C F seien endlich erzeugte Gruppen. G werde durch g, F durch g, f erzeugt. F ist endlich prasentiert uber G, wenn es eine endliche Menge S(g,f) von Gleichungen gibt, so daB F durch GI(G, 2) U S(g,f) prasentiert ist. Beispiel 3.8. Wenn H endlich prasentiert ist, ist G * H endlich uber G prasentiert. Denn wenn H durch S ( 6 ) prasentiert wird, wird G * H durch GI(G, 2) U S ( $ ) prasentiert. Lemma 3.9. 1. Wenn F uber G endlich prasentiert ist, ist W ( F )S~ W ( G ) . 2. Wenn F C G endlich erzeugt’sind, ist W ( F )S,,, W ( G )und also erst recht W ( F )seW(G). Beweis. 1 . Es ist W ( x2) = e gdw. Gl(G, 2) kG A S ( g , f)-. W ( xg ) = e. Die Behauptung folgt nun aus 3.2. 2. F sei von f = erzeugt; dann ist W ( f )= e gdw. W ( o ) ( g ) EGl(G,g).
o(2)
Satz 3.10. F und G seien endlich erzeugte Gruppen. Dann ist W ( F )seW ( G )gdw. sich F in eine uber G endlich prasentierte Gruppe einbetten lapt. Wenn G = E, so sind die endlich iiber G prasentierten Gruppen gerade die endlich prasentierten Gruppen und A S eW ( G )gdw. A r.a. ist. So erhalt man fur G = E aus 3.10 den Satz von Higman ([9]).
A lgebraisch abgesch lossene Gruppen
485
Beispiel. G ‘ sei isomorphe Kopie von G = (2). Dann ist in G * G‘, W(2, 2’) = e gdw. Gl(G, g) U Gl(G’, g’) t W(& g’) A e. Gl(G, g) und Gl(G’, 2’) sind rekursiv isomorph, es folgt also W(G * G’) S eW(G). G * G‘ mu13 daher in eine endlich uber G prasentierte Gruppe einbettbar sein. Tatsachlich ist G * G’ in G * Z einbettbar. Beweis von 3.10. Die eine Richtung des Satzes folgt aus 3.9. Fur die andere Richtung nehmen wir z : W ( F ) s . W(G) an. F werde von f, G von 2 erzeugt. Wir haben also fur alle i V , (f)= e gdw. 3u ((i, u ) E W, A D, C W(G, 2)). Das bedeutet, dal3 die r.a. Menge von positiven Implikationen
I(& f):= { A { V,(g)
I
e j E
1
0,) + Vi (f)A e ( i , u ) E W, }}
in G X F gilt. Nach 2.6 findet man fur I ein endliches Gleichungssystem S ( g , f X), das genau dann in einer Obergruppe losbar ist, wenn I gilt. Weil S(g, y, i )uber G losbar ist, prasentiert Gl(G, 2) U S ( g , y, i ) eine Obergruppe (g, f ,R ) von G. S ist so gewahlt, dal3 (2, f ’ ) != I ( & Daraus folgt W((f)’) C W((f)l). Weil andererseits Gl(G, g) U S ( i , f 2 ) in einer Obergruppe von G x F losbar ist, ist (F,f)homomorphes Bild von (f)‘. Daraus folgt W((f)’)C W((f)’). Das ergibt zusammen W((f)’) = W((f)’), also (f)r = (f)’. F ist also in die endlich uber G prasentierte Gruppe (g,f,R) einbettbar. 0
p).
Um auch die Ungleichungen der “Koeffizientengruppe” zu behandeln, brauchen wir noch den folgenden Begriff:
Definition 3.11. A ist 1-aufzahlungsreduzierbar auf B - A s L B -, wenn es ein z E w gibt, so daB fur alle n E w, n E A gdw. 3j, i ( ( m , j ,i ) E W, A DiC B A Di C w \B).Wir schreiben dann z : A G i B oder A = WI[B]; das ist ein @-Relation zwischen A, B und z. Der nachste Satz relativiert 2.7.
X) bzw. J(f X) seien Systeme uon posiriuen bzw. negativen Zmplikationen m it Koeffizienten aus der algebraisch abgeschlossenen Gruppe M. Fur G E Sk(M) sei Z seW(G) und J sb W(G). Wenn dann Z(f 2 ) U J(f i ) uber M erfiillbar is?: ist Z(f i )U J(f X) auch in M erfullbar.
Satz 3.12. Z ( f
M. Ziegler
486
1
{ A {Vi(g) L e i E D,}
A
I
A { i V,(g) = e j E D!,.}+ cp
l(7,u, u ' ) E Wz.}.
K ist ein r.a. System von Implikationen, denn, wenn q l , .. . , cpn Gleichungen sind, bzw. $* eine positive bzw. negative Implikation, so sind cp, A * * * A ( p n + $, und cpl A A cpn-l A i cp. + t+bZ
-
wieder zu Implikationen aquivalent. Offenbar ist K - in allen Gruppen, die G enthalten - mit Z U J aquivalent. Wenn also I U J iiber M erfiillbar ist, ist K iiber M erfiillbar. Nach 2.7 hat K eine Losung in M, die also auch eine Losung von Z U J ist. 0 Wenn man den Beweis von 3.12 zuriickverfolgt, erhalt man eine relativierte Version von 2.6:
Lemma 3.13. Z ( g , E ) bzw. J ( g , E ) seien Mengen uon positiuen bzw. negativen Zmplikationen und Z SeW ( G ) ,J W ( G ) .Dann gibt es ein endliches System S(6, K, i ) ,sodap fur alle h' aus einer Obergruppe H uon G S(g, h, i )genau dann uber losbar ist, wenn H t= I ( g , h') U J ( g , Z). G# E oder J = 0, geniigt fur S ein Gleichungssystem, sonst nur eine Ungl. in S. Es genugen zwei Variable.)
==:
Folgerung 3.14. U sei Untergruppe der endlich erzeugten Gruppe G. G Untergruppe der a.a. Gruppe M, F au? dem Skelett uon M, W ( G U )S , E Dann gibt es fur jedes f E M \ U ein h E M mit Thlu = idu und Th cf) # f (nach 11, 3.3 ist W ( G U )6,W ( G )f u r endlich erzeugtes U,in diesem Fall folgt die Behauptung aber auch aus I, 4.3).
1
I
Beweis. Solch ein innerer Automorphismus existiert in einer Obergruppe von M (I, 4.4). D a wir aber die gewiinschte Eigenschaft von h durch folgendes G1. & Unglsystem ausdrucken konnen { x v , ( ~ ) x - ~ G~ , ( g ) )W(G i ~
I
1U)}U{~X~X-~L~},
das m -reduzierbar auf W ( G U ) und daher aufzahlungsreduzierbar auf W ( F ) ist, folgt die Behauptung aus 3.12. 0 111. Rekursionstheoretische Kennzeichung algebraisch
abgeschlossener Gruppen Im letzten Kapitel haben wir gesehen, daB jedes System von Implikationen, das iiber einer a.a. Gruppe erfiillbar ist, und das
Algebraisch abgeschlossene Gruppen
487
aufzahlungsreduzierbar ist auf das Wortproblem einer Gruppe G E Sk(M), auch in M erfullbar ist. M enthalt also alle Gruppen F, die durch solche Systeme beschreibbar sind. Das sind gerade die F, deren Wortproblem auf das Wortproblem von G “*-reduzierbar” ist, (1.4). Im ersten Abschnitt zeigen wir, daB auch die Umkehrung gilt: Wenn die e.e. Gruppe F in alle a.a. M mit G E Sk(M) einbettbar ist, ist W ( F ) *-reduzierbar auf W ( G ) (1.7). Im zweiten Abschnitt grenzen wir den Begriff der *-Reduzierbarkeit gegen naheliegende starkere und schwachere Definitionen ab. Dann “berechnen” wir die “*-Grade” einiger e.e. Gruppen. Nach 1.3 ist klar, daB man das Skelett einer a.a. Gruppe kennt, wenn man weiB, welche *-Grade die Wortprobleme der e.e. Untergruppen haben. Da a.a. Gruppen bis auf partielle Isomorphie durch ihr Skelett vollstandig bestimmt sind (I, 2.10), entsprechen also die a.a. Gruppen gerade bestimmten Mengen von *-Graden. Im dritten Abschnitt charakterisieren wir diese Mengen mit rekursionstheoretischen Begriffen. Wir brauchen dabei, daD jeder *Grad das Wortproblem einer endlich erzeugten Gruppe enthalt und ein Aquivalent fur endliche Gleichungssysteme: r.a. Hornklassen (3.8) und (3.10). Auf diese Weise wird die Theorie der abzahlbaren a.a. Gruppen ein Teil der Rekursionstheorie. Im vierten Abschnitt geben wir einige Beispiele fur diese Beschreibung a.a. Gruppen. 1. Gruppen, deren Wortprobleme aufeinander *-reduzierbar sind
Definition 1.1 (A und B seien Teilmengen von w). 1. A ist *-reduzierbar auf B - A <* B -, wenn A S e B und w\A C:B. Wenn A = W : [ B ]und w \ A = W:.[B],schreiben wir ( z , ~ ’A ) : C * B. 2. A ist +-reduzierbar auf B - A & B -, wenn es eine rekursive Funktion f : w + 9-( w ) gibt, so daB fur alle n n E A gdw. f ( n )C B. Wir schreiben dann f : A s+B. 1.2. Wir bemerken, daB A c,,, B j A & B .$ A s*B j A S T B . Spater werden wir sehen, daB sich keiner der Pfeile umkehren 1aBt (3.5,2.10, 2.4).
Beispiel 1.3 (siehe 11, 3.14). U sei Untergruppe der e.e.Gr. G. G Untergruppe der a.a.Gr. M, F E Sk(M), W ( G V )C * W ( F ) .Dann gibt ‘es ein h E M mit U = cf E G hf = fh}. Denn, wenn
I
und
I ( i , x ) = {xVi(i)x-’
I
I
I
V, (6) i E W ( G U ) }
M. Ziegler
488
J ( g , x ) = { 1 x v i ( g ) x - ' =v , ( g ) I i gW ( G
1
1 u)),
I
muB h gerade I U J erfiillen. Da I SeW ( G U ) und J S : W ( G U ) , ist (2.1) I S eW ( F ) und J Si W ( F ) . Die Behauptung folgt nun aus 11, 3.12 denn nach I, 4.4 ist I U J in einer Obergruppe von M erfiillbar. Der nachste Satz verallgemeinert 11, 2.8, denn wenn F losbares Wortproblem hat, ist W ( F ) s * W ( E ) . Satz 1.4. F und G seien endlich erzeugte Gruppen, W ( F ) S * W ( G ) .
Dann ist F in jede algebraisch abgeschlossere Gruppe einbettbar, in die G einbettbar ist.
I
Beweis. F sei von f erzeugt und I ( i ) = { W ( i )& e W ( f )= e } , J ( i ) = {-I W ( i ) e W ( f )# e}. Offenbar ist I S eW ( F ) und J cl W ( F ) , daraus folgt I SeW ( G ) ,J S : W ( G ) .Da I U J in der Obergruppe M x F von M losbar ist, gibt es nach 11, 3.12 eine Losung f € M. Naturlich ist ( f ' ) = F.
I
Folgerungen 1.5. 1. Z u jeder e.e. Untergruppe F der abzahlbaren Gruppe H gebe es ein G aus dem Skelett der a.a. Gruppe M mit W ( F )S * W ( G ) .Dann ist H in M einbettbar. (Das verallgerneinert 11, 2.10.1.) 2. Sei M a.a. Wenn F, G E Sk (M), ist auch F x G E Sk (M). 3. Sei M a.a., G e.e. Wenn F,, ..., F,, E S k ( M ) und W ( G )S * W ( F ,x * x Fn), ist G € Sk(M).
--
Wir beweisen 2: Sei F, G C H und H = (F, G ) . Es ist W ( F x G )S + W ( H ) . Denn W ( xg ) E G l ( F x G ;f, g') gdw. W(f7 e ) = e und W ( e ,g) = e gdw. { W ( xe), W ( e ,g')} C G l ( H ; f, g'). Bemerkung 1.6. 1. Wir nennen F = ( f ) absolut endlich (a.e.) uber G = (2) prasentiert, wenn es ein endliches G1. & Unglsystem S ( g , 7, i )gibt, das uber G losbar it und fur dessen Losungen (g, f', K ) immer ( f ' ) ' ~ (f)' ist. (Man sieht leicht; wenn G# E, braucht man fur S nur ein Gleichungssystem; F ist uber E a.e. prasentierbar, wenn F a.e. prasentierbar ist.) Wenn G Z E und W ( G )r.a., ist F genau dann a.e. uber G prasentiert, wenn G und F so in eine endlich prasentierbare H einbettbar sind, daB jeder Homomorphismus auf H, der auf G injektiv ist, auch auf F injektiv ist ([3]). Denn, wenn S(g, h^) H endlich prasentiert, zeigt S, daB F a.e. uber G prasentierbar ist. Wenn andererseits F durch S a.e. uber G prasentiert wird, und S' endlich eine
x
A lgebraisch abgesch lossene Gruppen
485,
Obergruppe von G prasentiert (G hat r.a. Wortproblem), prasentiert S U S' eine Gruppe H mit den gewunschten Eigenschaften. 2. Wenn W ( F ) s * W ( G ) ,ist F uber G a.e. prasentierbar. Das GI. & Unglsystem S bekommt man, wenn man auf die Mengen I und J, die im Beweis von 1.4 vorkommen, 11, 3.13 statt 11, 3.12 anwendet. (Wir haben damit eine Verallgemeinerung von 11, 2.9.) Da naturlich, wenn F uber G a.e. prasentiert ist, F in jede a.a. Gruppe einbettbar ist, in die G einbettbar ist, erhalten wir so einen anderen Beweis von 1.4. Umgekehrt ist W ( F ) S * W ( G ) ,wenn F uber G a.e. prasentiert ist. Wir lassen den sehr einfachen Beweis weg, weil dieses Resultat aus dem nachsten Satz folgt.
E , G,,i < w seien endlich erzeugte Gruppen. Wenn fur alle n, m W(F,) # * W(Gox . x G,,,),gibt es eine abzahlbare algebraisch abgeschlossene Gruppe, in die alle G,, aber kein einbettbar ist.
Satz 1.7.
(Fur F = E, G ([31).)
-
=
Gi mit r.a. Wortproblem, folgt 1.7 aus Th-1.1 in
Folgerungen 1.8. 1. Fur e.e. F, G sind iiquivalent: (a) W ( F ) S *W ( G ) , (b) F ist in jede a.a. Gruppe einbettbar, in die G einbeftbar ist. ( Z u m Beispiel ist also W ( F )S * W ( G ) ,wenn F C G. Das folgr aber naturlich auch aus 11, 3.9.2.) 2. Die e.e. Gruppe F hat genau dann losbares Wortproblem, wenn F in jede a.a. Gruppe einbeftbar ist ([15]). 3. Wenn W ( F ) P T W ( G ) ,gibt es eine a.a.Gr., in die G aber nicht F einbeftbar ist ([ 151). Beweis. (Wir werden in 4.1 noch einen anderen Beweis angeben.) Die G, seien von i,erzeugt. Wir konstruieren ein (unendliches) G1. & Ung1.s stem S ( & i )in Koeffizienten go,gl,. . ., = i,das mit D = vertraglich ist. Diag(bicuGi,B)= Diag(@,,,Gi;go,. . Die algebraisch abgeschlossene Gruppe M wird dann eine von g' und einer Losung h' von S U D erzeugte Gruppe sein. Ti, i < w sei eine Durchzahlung aller endlichen Gleichungssystemen = Wp', . . . , W::, i < w mit Koeffizienten g' und Variablen i und f! W',) sei eine Durchzahlung aller endlichen Folgen von Wortern W ( &2). erzeugt. Wir konstruieren S als Vereinigung einer Die E seien von aufsteigenden Folge Si von endlichen G1. & Unglsystemen. Wir setzen So = 0 und unterscheiden 3 Falle.
u,,,
M. Ziegler
490
(i) n = 3 i : (a) Wenn T, U S, U D widerspruchsvoll ist, setze S.+l = S,. (b) Wenn T,(2,y’) U S, U D widerspruchsfrei, gibt es x k , x k + l , . . . ,x k + o = so daB T,( i ;X k , . . .,X k t r ) U S, u D widerspruchsfrei ist. Setze T, (2; x k , . * * X k + r ) u sn. (ii) n = 3i + 1: Wahle x k , . . . ,&+n, genugend groB, so daB s,,+, = { w:” A x k + , j c n,} u S, u D widerspruchsfrei ist. (iii) n = 3i + 2: Sei (k,m ) = i und Diag(Fm,f , )= R ( f m ) . s + 1 sei die die Liinge von fm. Mit i ’ kurzen wir x k ,..., x k + s ab. (a) Wenn S,, U D YGR(x”),wahlen wir ein cp E R ( i ’ ) mit S,, U D Y ” q und setzen S,,, = S, U { i cp}. (b) Der Fall S, U D k G R ( i ’ )kann nicht eintreten: Wenn hochstens in S, vorkommen, wurde S, U go,. . . Diag ,,,G,; go,. . .,&-,) k R(x”) sein. Wir schreiben G fur (@,,, G, ;2 0 , . . . ,g’,-,). Fur alle W ist W(Tm) E G1(Fm,j m ) gdw. Diag(G) kG A S, + W(i’) = e. Da Diag(G))CG A S, +, ist also wegen 11.1.5 dies aquivalent zu G l ( G ) kG A S, + W(i’) = e. Daraus folgt W(F,) ceW(G). Fur alle W ist W(Jm) $Z Gl(F,, f m ) gdw. Diag(G) U S. kG i W(i’) = e. Wegen 11.1.5 also gdw. es ein ,y E Ungl(G) mit W(x’) A e gibt. Daraus folgt GI(G) U S, U { x }kG i w \ W(Fm)C i W(G). Also zus_ammen W(Fm)S * W(G), in Widerspruch zur Voraussetzung. Damit ist die Konstruktion der S, abgeschlossen. h‘ sei eine Losung von D U S in der von g’, h’ erzeugten Gruppe M. Offenbar ist M Obergruppe der direkten Summe der G,, alle G,’sind also in M einbettbar. Wir zeigen, daB die Fm nicht einbettbar sind. Sei p : F, + M eine Einbettung .und p ( f , )= W’O’(8,L). Im Konstruktionsschritt 31 + 1 wurde sichergestellt, daB es h‘‘ = h k , . . . , h k + , , gibt, mit W(J)(g, 6)= 6‘. Wenn i = (k,m),wurde im Konstruktionsschritt n = 3i + 2 ein cp(i’) in S aufgenommen rnit Fm jd cp(f,,,). Da also MI= cp(i’), kann p keine Einbettung sein. Wir weisen noch nach, daB M algebraisch abgeschlossen ist. Sei dazu TI(&6,i )ein endliches Gleichungssystem, das in einer Obergruppe von M losbar ist. Dann tritt bei der Konstruktion an der Stelle 3i der Fall (b) ein. Mit der dortigen Bezeichung ist dann hk,. . .,h k + r eine Losung von T’ in M. 0 3
1
2. *-Grade 2.1. Aus A SLB S * C folgt A Si C; zusammen mit der Transitivitat von ceergibt sich also die Transitivitat von S * . Wir konnen deshalb
Algebraisch abgeschlossene Gmppen
40 1
sinnvoll die Aquivalenzrelation =* einfuhren: A =* B gdw. A <* B < * A , "A und B haben denselben *-Grad". V:= P ( w ) / = * ist die Menge der *-Grade. Da jeder *-Grad abzahlbar ist, ist I V ( = 2". Wir ubertragen C * auf V : A / = * c B / = * gdw. A <* B. (V, C ) ist ein Halbverband mit kleinstem Element +/=*. Es ist namlich, wie man leicht uberlegt, sup(A/=*, B/=*)= (A v B)/=*; +/=* besteht gerade aus den rekursiven Mengen. Aus A =* B s e C =* D folgt A C e D . Wir konnen also sinnvoll fur a, b E V a C e b durch A s e B definieren, wenn A E a und B E b. Ebenso hangen die folgenden Definitionen nicht von der Wahl der Reprasentanten ab: (Sei A E a E V, B E b E V ) a c r a b ,wenn A Crag;A S e b , wenn A c.B; A C * b, wenn A C * B ; A =* b, wenn A =*B ; a < ba, wenn A 6 *Bz (vgl. 2.6).
+
Definition 2.2. 1. A heifit vollstandig, wenn V B C w, B C e A B CIA. 2. a heifit vollstandig, wenn fur alle b s e a 3 b a (a, b E V ) . (Wenn A r.a., ist A vollstandig, gdw. A 1-vollstandig im Sinne von ([24] S. 82) ist.) Lemma 2.3. 1. Ist A vollstandig und A E a E V, so ist auch a vollstandig. 2. Fur jedes B gibt es ein vollstandiges B" mit B" =< B. B uist bis auf rekursive Zsomorphie eindeutig bestimmt. 3. Z u jedem b E V gibt es genau ein vollstandiges b" E V mit b z eb". Wenn B E b, ist b" =* B". 4. A C , B gdw. A " 6 i B " .
I
Beweis. (2). Setze B " = {(x, y ) x E W',[B]}.Offenbar ist B" S e B . Wenn W, = {(x, u ) ) )D, = {x}}, ist x E B gdw. (x, y ) E B". Also B C1B" 3 B %B". Sei A C eB", also auch A C eB und daher A = W : [ B ] .Aus x E A gdw. ( x , z ) € B" folgt A C , B". Die anderen Teile des Lemmas folgen daraus leicht. 0 In den nachsten drei Lemmas vergleichen wir 6 *mit zwei schwacheren Begriffen, die selbst starker als 6, sind. Aus A 6 *B folgt (a) A C : B, w \ A 6 ;B und (b) A 6,B, w \ A SraB. Sowohl aus (a) als auch aus (b) folgt A C T B (d. h. A C , , B, w \ A CraB). Aber weder aus (a) noch aus (b) folgt C * . Lemma 2.4. Fur eine Teilmenge A von w sind aquivalent :
M. Ziegler
492
(a) fur alle B folgt B (b) o \ A S c A .
S*A
aus B Si A und w \ B Si A.
Bemerkung 2.5. 1. (a) bzw. (b) sind aquivalent zu (c): fur alle B ist B STA gdw. B S * A . 2. Wenn A vollstandig ist, gilt w \ A g , A . Beweis. (a)+(b): Da w \ A S I A , A S i A folgt aus (a) o \ A S * A insbesondere w \ A S e A . (b)+(c): Sei B STA. Aus (b) folgt mit 11, 3.5 B S e A und o \ B S e A , also erst recht B < * A . Beweis von 2.5.2: Wenn A vollstandig ist, ist A " =, A. Es geniigt also w \ A " $.A zu zeigen. A ' sei wie im Beweis von 2.3.2. Aus ~ \ A " s . A folgt A : = { x) ( X , ~ ) ~ A ~ ) S , ~ 3 \ Aii~S ~ SA, .Sei A z : A s e A . Dann ist f u r a l l e x x E A gdw. ( x , z ) E A U . A l s o ( z , z ) e A " gdw. ( z , z ) E A " . 0 Definition 2.6. A und B seien Teilmengen von w. 1. A S = { ( m,, . . . ,m.) 3i mi E A). 2. A * = {S E s o 13i < I(S) s ( i ) E A). 3. B <:A gdw. ex. z fur alle x E w x E B gdw. 3u,w((x,u,w)E W, A D . C A A D ~ , C O \ A ) .
1
Lemma 2.8. A sei eine Teilmenge von w. Dann sind aquivalent (a) F u r a l l e B C w B S , A h w \ B S 3 A + B < * A . (b) A * = * A . (c) w'\A&S:A. (Fur rekursive aufzahlbares A wurde 2.8 in [3] bewiesen.) (Wenn (a) bzw. (b), (c) gilt, folgt B < * A aus B <,A und w \ B <,aA.) Beweis. (a)+(c): folgt aus A J S e A und o \ A 3 < $ A . (b)+(a): Aus w \ B <:A folgt w \ B S f A z S I A y Y * A 3 o \ B <:A. (c)+(b): Wir mussen Yw\AY<:A zeigen. Sei z :(w2\AZ)<:A, also fur alle x,y E w (x,y)$ZAZgdw. 3u,w((x,y,u,w)E W , h D . C A , D!,,Cw\A). Wenn n die Lange von s E Yw ist, folgt aus der letzten Beziehung durch n - 1maliges Anwenden: s @ A W gdw.
3u,w(3ml,..., mn-2[(3u',w'(s(O),s(l), u ' , w ' ) ~W, AD.,CD. h D ! , , , C { m l } )
Algebraisch abgeschlossene Gruppen A
( 3 U ‘, W ’ ( m1,
S(2), U ’, W ’ ) E
w, A D,. c D.
A
493
D L. c {m}) A ..‘
~ ( 3 u ’ , w ’ ( m , - , , s ( n - 2 ) , u ’ , w W, ’ ) ~A D , , c D . A D $ . c ( ~ . - ~ ) ) A ( 3 u ‘ ( m , - 2 , s ( n - 1 ) , u ’ , w ) EW, A D . . C D , ) ] A DC, A A D L C ~ \ A )
Aus dieser Darstellung erkennt man leicht *w \ A *
Lemma 2.9. Zu jedem B C w gibt es ein A mit B G I A und w’\A3&A. (Wir werden in VI sehen, dap fur “vollstandig subgenerische” Mengen w 2 \ A?$: A . ) Beweis. Wir konnen annehmen, daB B eine Menge von ungeraden Zahlen ist. Wir definieren zwei aufsteigende Folgen Po C PIC * * No C N , C N 2 . . . von endlichen Mengen von geraden Zahlen, so daB N, n P, = 0 fur alle i. Fur A werden wir B U u { P , i < w } nehmen. Wenn N , und P, schon definiert sind, unterscheiden wir zwei Falle: (a) Es gibt a, b E 2w; u, w E w so daB (a, b, u, w ) E W, oder ( b , a , u ,w ) E W, und (P, U { a } U D , UB)n(N, U D L U ( ( 2 w + l ) \ B ) ) = 0. Wir wahlen dann solche a, b, u, w und setzen P,,,= (P,U { a }U D,,)n 2 w und N , , , = ( ( N , U D $ )r t 2 w ) U (2k) fur ein 2k @ N, U P,+i. (b) Wenn der Fall (a) nicht eintritt, wahle einfach N, E N,,, C 2 0 und setze P,,, = P,.Weil B = A n ( 2 w + l ) , ist B sIA . Annahme: i : w 2 \ A 2 S ; A .D a wir die N, bei jedem Schritt vergrol3ert haben, gibt es a ’ # b”aus 2w \(A U N # ) .D a also (a’, b’) 6Z AS, gibt es u’, w’ mit ( a ’ , b ‘ , u ’ , w ’ ) EW, und D , . C A , DL C w \ A . D a a ‘ e D L . o d e r b’ D t . , erfullt a’, b’, u ’ , w’ oder b‘, a ’ ,u ’ , w’ die Bedingung von (a) bei der Konstruktion von P,,,, N,,,. a, b, u, w seien wie dort gewahlt. Es folgt dann, weil D, C B U Pa+,und D$C ((2w + l ) \ B ) U N,,,, daB (a, b ) oder (b, a ) A S ,also a, b fZ A . Es ist aber a E P,+lC A.
-
I
e
e
In ([24]) wird definiert: A s o B gdw. es ein z gibt, so daB fur alle x, A , gdw. 3 n ((x, u ) E W, A D t C w \ B ) . Fur rekursiv aufzahlbare A und B stimmt sQmit S * iiberein. In ( [ 3 ] )wird S o fur r.a. Mengen fur ahnliche Zwecke benutzt wie in dieser Arbeit S * . Nach ([24]) gibt es r.a. A, B mit A s T B und A SOB. Nach 2.8 folgt daraus wie in ( [ 3 ] )B z p o B , also Ba$*B. Wir bemerken ohne Beweis:
x
$?!
Lemma 2.10. 1. Fur jedes nicht rekursive A gibt es ein B rnit A s*B und nicht A S +B. 2. z und z ‘ seien aus w. Wenn es zu jedem B ein A mit
M. Ziegler
494
(2,z’) : A d *B ( z , 2 ‘ ) : A 6*B
gibt, gibt es ein rekursives f : w + ’9- ( w ) mit gdw. f : A G +B fur alle A, B. (Vergleiche VI, 3.6.) Folgende Verscharfung von 2.12 ist zugleich eine Verallgemeinerung des Satzes von Boone und Higman in ([4]):
Fur eine endlich erzeugte Gruppe sind aquivalent : (a) w \ W ( G )6, W(G ). (b) Es gibt eine endlich iiber G prasentierte Gruppe F und eine einfache Gruppe K mit G C K C E (ohne Beweis) Wir zeigen, wie das Hauptresultat von ([4]) folgt: “G hat losbares Wortproblem gdw. ex. endlich prbentierte Gruppe F und eine einfache Gruppe H mit G C H C F”. Wenn G losbares Wortproblem hat, ist w \ W ( G )d eW ( G ) .Wir wahlen H und F wie in (b). W ( F ) ist dann r.a. Also ist F in eine endlich prasentierte Gruppe einbettbar. Wenn es umgekehrt eine einfache Gruppe H und eine endlich prasentierte Gruppe F gibt mit G C H C F, folgt aus (a), daB w \ W ( G )6 , W(G).Weil G C F, ist aber W ( G )r.a. Also ist auch w \ W ( G )r.a. und somit W ( G )rekursiv.
*-Grade einiger Wortprobleme
Lemma 2.11. Fur endlich erzeugte Gruppen F, G ist W ( F X G)/=*= sup( W(F)/=*,W ( G ) / = * ) .
x
Beweis. F, G seien durch 2 erzeugt. Wir ordnen jedem Wort Vi (Z,y^) die Worter Vi (i,6 ) = V , . ( i )und Vi (6, 9 ) = V,.(y^)zu. Es ist dann i E W ( F x G ; E g )gdw. {2i’,2i”+ 1)C W ( F )v W ( G ) .Also ist W ( F x G )<*sup( W(F)/=*, W(G)/=*).Umgekehrt ist natiirlich W ( F )c*W ( F x G ) , W ( G )6*W ( F x G). Daraus folgt die Behauptung. 0 Zusammen mit 1.5.3 und 1.7 haben wir damit
Folgerung. F, Go,.. ., G, seien endlich erzeugt. Dann ist W ( F ) s * s u p ( W ( G o ) / = ,..., * W(G,)/=*)gdw. in alle a.a.Gr., in die all Gi einbettbar sind, auch F einbettbar ist.
Algebraisch abgeschlossene Gruppen
495
Lemma 2.12. Wenn die e.e. Gruppe G einfach ist, ist w \ W ( G )seW ( G ) . Beweis. Sei G von g erzeugt und go# e. Dann ist f # e gdw. go im Normalteilererzeugnis von f liegt. Aus der Beschreibung W ( g )# e gdw. ex. W ,,..., W,, W mit g o = W ( W 1W W ; ' , ..., W,WW,')(g) erkennt man, dal3 w \ W ( G )5,W ( G ) . Folgerung. Wenn G, F e.e. Gruppen sind, G einfach, ist W ( F )sTW ( G ) gdw. F in alle a.a. Grupgen einbettbar ist, in die G einbettbar ist. Lemma 2.13 ( F , G seien endlich erzeugte Gruppen). 1. Wenn je zwei nicht triviale Normalteiler von F nicht-trivialen Schnitt haben, ist W(F)"=* W ( F ) . 2. Wenn F, G # E, ist W ( F * G )=* ( W ( F )v W(G))Y 3. ( H # E habe losbares Wortproblem) Es sind aquivalent (a) W(F)J =* W ( F ) . (b) W ( F * H) s*W ( F ) . (c) Fiir alle a.a. Gr. M, F < M j F * H < M. (d) Fiir alle a.a. Gr. M und alle G, F < M, W ( G )C * W ( F )3 F*G<M. (2.13 enthalt Resultate von' ([3]) iiber Gruppen mit r.a. Wort problem.) Beweis. 1. Eine Folge a , , . . .,a, E F enthalt genau dann nur Elemente # e, wenn der Schnitt der von den a, jeweils erzeugten Normalteilern nicht-trivial ist. Also ist (wenn F = (f)) W , ( f )# e, W 2 ( f# ) e gdw. ex. uI,..., u ~0, ,,..., 0"Wl, W 2mit e # W l ( U I W I U ;,..., ' UkwlU;')(J)= W2( W2U;',.. . , 0,W2U;')(f).Das zeigt, daD w \ W ( F ) ' S ; W ( F ) . 2. Zunachst has F * G die Eigenschaft von 1: Den Fall 1 FI = 1 G I = 2 brauchen wir nicht zu behandeln, weil d a m die Behauptung von 2 ohnehin gilt. Sei F 3 3. Ein Element h = f o g l f l g.f, ( n E w, f , E F\ E, g , E G \ E ) von F * G heil3t F-Element. Jedes Element h' von ( F * G)\ E laat sich eindeutig in der Form (i) h oder (ii) gh oder (iii) hg oder (iv) glhg; oder (v) g schreiben, wobei h ein F-Element und g, g ;, g ; E G \ E sind. Jedes Element h ' aus ( F * G)\ E sind und einem F-Element konjugiert, denn, wenn f E F\ E und im Fall (iii) f # f;' und im Fall (ii) f # f n ist, wenn h' nicht selbst F-Element ist, fh'f-' ein F-Element. Wenn h,, h2 aus ( F * G ) \E sind und ahla-',bh2b-' F-Elemente, g E G \ E, so vertauschen g(ah,a-')g-' und bh2b-' nicht. Der Kommutator dieser beiden Elemente liegt dann
u;'
-
M. Ziegler
496
sowohl im Normalteilererzeugnis von h , als auch im Normalteilererzeugnis von hz und ist nichttrivial. Also ist W ( F * G ) " = * W ( F * G ) . Wegen F , G C F * G ist W ( F ) ,W ( G )S * W ( F * G ) . Also ( W ( F )v W ( G ) ) "c* W ( F * G)" <* W ( F * G ) . Weil in F * G nur die Gleichungen gelten, die aus den Gleichungen von F und G folgen, ist W ( F * G ) C , ( W ( F )v W ( G ) ) .W ( f ,g ) ist # e in F * G gdw. eine , . . . , W,,, U. existiert mit W ( xg ) = Folge von Wortern W 1 U1, wl(f)Ul(g). . . W n ( f ) U n ( gund ) W l ( f )U , , ( g ) ,. . . , U. ( g ) # e (etwas ( W ( F )v W ( G ) ) ,d. h. vereinfacht). Also ist w \ W ( F * G )Sra W ( F * G ) cfW ( F )v W(G))". 3. (b)t*(c) folgt aus 1.8.1. (a)+ (d) W ( F * G ) c *( W ( F )v W(G))-" <* ( W ( F ) ) *c*W ( F ) . (d)+(c) W ( H ) e *W ( F ) . (b) (a) W ( F ) " e*( W ( F )v W ( H ) ) " c* W ( F * H ) S * W ( F )6 *W(F)". 0 +
Das nachste Lemma zeigt, daI3 die Voraussetzungen in 1.3 notwendig sind.
Lemma 2.14. U C F, G seien Gruppen. F, G endlich erzeugt. Dann sind aq uiva lent : (a) ( W ( F U ) v w ( F ) ) S * w(G). (b) In jedes a.a. M 3 G ist F zweimal einbettbar: F, F ' C M mit F f l F ' = U, F = u F ' . (c) In jedes a.a. M 3 G ist F so einbettbar F C M, dalJ es ein h E M gibt mit U = { f E F h f = fh}.
I
I
Beweis. (a)+(c) folgt aus 1.3 und 1.4. (c)+ (b): Sei G, F C M und fur ein h E M, U = {f E F hf = f h } . Ein g E M erfullt genau dann das rekursive System von Implikationen (F sei von f erzeugt)
1
I
{hW ( j )= W ( f ) h+ x W ( f ) x - ' = W ( f ) W Wort }
I
u { x w 1 ( J ) x - 'G w2(J)+ h w2(J) A w 2 ( j ) h wl,W , Worter}, wenn T~.~ = idu und F n T~ [ F ]= U. D a es solch ein g nach I, 4.4 in
einer Obergruppe-von M gibt, gibt es nach 11, 2.7 auch eine Losung g' in M. Setze F ' = T ~ , [ F (b)+(a): ]. Wir werden spater (3.4) sehen, daI3 es zu jeder Menge A C w eine e.e.Gr. H gibt mit W ( H )=* A. Sei W ( H )=* W ( F U).Sei M eine a.a.Gr., die G enthalt und sei also F, F'C M mit F G ~ F ' F, f l F' = U. Sei F = ( f ) und vermoge des
I
497
Algebra isch abgesch lossene Gruppen
Isomorphismus F' = (f'). Dann ist fur jedes Wort W ( f )E U gdw. W ( f )= W ( f ) .Also ist W ( F U )G I W ( ( F ,F'); f). Daraus folgt W ( H )<* W ( ( F ,F'); f, f ) und daher nach 1.4 H < M. H liegt also in jeder a.a. Gruppe, in der G liegt, das bedeutet nach 1.8 W(H)<* W(G). 0
1
Im folgenden Lemma berechnen wir, wie "schwierig" es ist, ein F' mit F ="F', F n F' = U in einer a.a. Gruppe zu finden, indem man F * [ , F einbettet. Lemma 2.15. U sei eine Untergruppe der e.e. Gruppe E Dann ist W ( F *,F) =* W ( F )v ( W ( F U ) ) ?
I
Beweis. F sei von f erzeugt und F' = (f') eine isomorphe Kopie uber U. Zunachst ist W ( F * [ , F ' ) W ( F )v W ( F U ) . Denn in F*,F' gelten genau die Gleichungen, die aus GI(F, f ) U GI(F', f ) U {~ ( = j ) ~ ( fE )U )u T~ folgen. Aus dem Normalformsatz fur das amalgamierte Produkt sieht man: w(f,f:)#-egdw. ex. w I , ..., wk,U I , _ . .Uk,so , dal3 w ( f , f ) = w ~ ( f ) u ~.(. fwk' )(f)r(, . ( f ) und Wi (f), Ui (f') U. (Etwas vereinfacht) oder ex. W mit W ( ,f') = W ( f )# e. Also ist o \ W ( F *"F') sl W ( F ) v ( W ( F U))-w.Das ergibt zusammen W ( F * , F ' ) s * W ( F ) v ( W ( F (U))?. Weil F C F *"F' ist W ( F )s*W ( F *"F'). Weil F n F' = U folgt aus dem letzten Lemma, dal3 W ( F 1 U ) G * W ( F * , F ' ) und daher W ( F U ) s seW ( F *"F). Wir unterscheiden zwei Falle: (a) F : U = 2. Dann ist W ( F U )= W ( F / U ) rekursiv, und es folgt W ( F U)" v W ( F )=* W ( F )s*W ( F * " F ' ) . (b) F : U > 2 . D a n n g i b t e s i n F * u F ' f u r a l l e g E F \ U g ' E F ' \ U , h E (F*,F')\ U ein x E F*"F', so dal3 [ h ,xg'gx-'1 6! U. (Wenn zum Beispiel h = kk', k E F\ U, k ' E F'\ U, wahle y E F\(U U U g ) und y ' E F'\(U U ( k ' ) - ' U ) und setze x = y'y.) Fur g E F sei g' das entsprechende Element beim Isomorphismus (f)' (f')'. Dann ist g U gdw. g'g-' # e. Weiter ist go, gl $Z U gdw. ex. x E F *,F' mit [g&;',xg;g;'x-'] # e usw. Also g o , . . . ,g, U gdw. ex. xI, x z . . . x, E F *,F', so dal3
1
w(JI) 1
e
1-
I
1
1
e
e
- . [[g~g;',xlf~f;'x;'], xZf;f;'~;'], AISO ist w \ W ( F 1 w(F*"F'). [ [ e
. . .,I,xnfiffnlxi'] # e.
Satz 2.16. M sei eine algebraisch abgeschlossene Gruppe. Es sind
iiquivalent
M. Ziegler
498
(a) Fur alle G C F und G C H aus Sk(M) mit F r l H = G gibt es einKESk(M) mitFUHCK. (b) Wenn H, F E Sk(M) und G eine e.e. gemeinsame Untergruppe von H und F isr, ist H *G F E Sk(M). (c) Fur alle G E Sk(M) und e.e. H ist W ( H ) G .W ( G )3 H E Sk(M). Bemerkung. Fur algebraisch abgeschlossene Gruppen M rnit der Eigenschaft von 2.16 gilt: Jede Gruppe N rnit Sk(N) C Sk(M) ist in einer a.a. Gruppe N' enthalten, die zu M partiell isomorph ist. Insbesondere gibt es beliebig, groi3e zu M partiell isomorphe Gruppen. Beweis der Bemerkung: S(g^,i) sei uber N losbar. G sei von den g erzeugt. Wenn H von S uber G prasentiert wird, ist W(H)=S,W ( G ) und also H E Sk (M). Man sieht leicht, dai3 Sk ( N * G H) C Sk ( M ) . Auf diese Weise kann man N algebraisch abschliei3en. Beweis. (b)+(a) is klar. (c)+ (b). Aus 11, 3.3 folgt W ( H G ) seW ( H )und W(F G )6 =W(F).Wie im Beweis von 2.15 folgt W(H *OF)seW ( H )v W ( F )=* W ( H x F ) . Weil H x F E Sk(M), ist also H*GFESk(M). (a)+ (c). Wenn (a) gilt, ist W(F G )6 *W ( F )fur alle G C F aus dem Skelett von M.Das folgt aus 2.14. Wir brauchen eine Konstruktion, die uns noch spater nutzlich sein wird: c, b seien Erzeugende einer Gruppe 8 . Wir definieren bk = c-'bck fur k E Z und ai = bi+lbob;21b;1 fur i E o.8 sei durch die Relationen caiG ac, aia,A aiai, a;& e, aib= bai, i, j E u,prasentiert. Aus den Relationen fOlgt Uibk = b&, bk+i+:bk= aibkbk+i+lfur alle i E 0, k E 2.Man sieht leicht, dai3 man jedes Element von 3 in der Form c'b;; b;: schreiben kann. Daraus gewinnt man eine Darstellung jedes Elementes * . . ainb;; * b = ; mit von H in der Form c 'aai, i l < i 2 < * * - i nk, l < k Z < . . . k , , k , n i E Z , n , m Eo, n , # 0 . In ([29]) wurde (implizit) gezeigt, dai3 diese Darstellung eindeutig ist (siehe auch ([3])). 23 sei die von den ai erzeugte Untergruppe von 8. '?Bliegt im Zentrum von 8 und ist direkte Summe der zyklischen Gruppen der Ordnung 2 ( a i ) .Jede Untergruppe von 23 ist Normalteiler von 8 . Sei nun G endlicherzeugte Untergruppe von M and W ( H )6,W ( G ) .Wir konnen G f E annehmen. Dann gibt es eine rekursive Funktion f : 0 2 + 9- ( w ) mit
I
I
I
-
i E W ( H ) gdw. ex. k rnit f ( i , k ) C W ( G ) .
Algebraisch abgeschlossene Gruppen
499
Wir indizieren die ai doppelt vermoge einer rekursiven Bijektion zwischen w und w 2 . Es gibt also eindeutig j , k rnit = ai, und umgekehrt kann man zu j , k genau ein i mit a,= aj,kfinden. Die Untergruppe N von B sei erzeugt von
{ai,j * a i l l f ( i , j ) C W ( G ) , f ( ik, ) c W ( G ) I . Wir betrachten a / N . Die Erzeugenden cN and bN nennen wir E und & Ebenso bezeichenen wir aiN mit iii und biN rnit 6. Wenn k # O oder n,#O, und k l < k t < * ist C k i i i l * . . iiim6;;* * 6;; ! i i BIN. Wir nehmen nun eine isomorphe Kopie 5%’ von a, die von c’ und b’ erzeugt wird. A bezeichne die von den a: fur i E W ( H )erzeugte Untergruppe von 23’.Wenn a : E A, gibt es ein k E w mit f ( i , k ) C W ( G ) ;wir identifizieren a: mit i i , k (Wenn f ( i , j ) C W ( G ) ,ist di,k = iii.j) und bilden das amalgamierte Produkt BIN *a; B’. Das folgende System von Gleichungen und Ungleichungen in den Die Menge der “Unbekannten” 6, c, b‘, c ’ gilt offenbar in B / N *A 8’. Gleichungen ist aufzahlungsreduzierbar auf W ( G ) ,die Menge der Ungleichungen 1-aufzahlungsreduzierbarauf W ( G ) .Nach Satz 11, 3.15 gibt es eine Losung des Systems in M. Die ai, a:, 6k*,6;. fassen wir als Worter in c, 6, c f , b ’ , auf. ( i , j , k, i l , . . . in E w ; k l < k 2 < * . * < k , E Z)
0. ai,ks a: wenn f ( i , k ) C W ( G ) . 1. a 2 -- e, cai = aic, aiaj A up,, crib =hi. 2. ickai;..ai~6;;...b;;=aa: wenn k # O oder n,#O. 3. i aj,kai,* - * aim=a: wenn i l , . . . in# (i,k ) , und f ( i , k ) l t W ( G ) . 4. i a:, . . = a: wenn i 6i { i l , . . . i n } . Wir andern unsere Bezeichnungen und bezeichnen mit b, c, b’, c f eine Losung des Systems in M. Wir zeigen, dab i E W ( H )gdw. a : € (b, c ) . Daraus folgt dann die Behauptung mit W ( H )6, W((b,c, c’, b‘) (b, c)) und nach unserer Bemerkung am Anfang W((b,c, c’, b’) (b, c ) )6*W((b,c, c’, b’)) aus 1.4. Sei i E W ( H ) .Dann gibt es ein k rnit f(i, k ) C W ( G ) .Wegen 0 ist a : = aik E ( b , c ) . Sei umgekehrt a : € (6, c ) . Weil 1 gilt, ist a : = ckui,- - - a,b;; - - * b;: fur geeignete k, i l . . ., nl .. . ,k , - * * . Aus 2 folgt k = 0 und m = 0, also fur a : = ai,* . aim.Wegen 3 kann man schreiben a : = ajI,,]* * * f ( j , m , ) C W ( G ) ( r = 1,.. . n ) . Aus 0 folgt a : = a/,* - ai’.. Nach 4 mu0 i E Ql,. . .jn} sein, d. h. i E W ( H ) . 0
I
1
3. Ideale von *-Graden und a.a. Gruppen Definition 3.1. Wir ordnen jeder Gruppe M die Menge W , ( M ) C V der *-Grade aller endlich erzeugten Untergruppen von M zu. X C V
500
M. Ziegler
ist durch UX eindeutig bestimmt: X W , ( M ) schreiben wir W * ( M ) .
u
= {a E
I
V aC
UX}. Fur
Aus 1.4 schlieBt man sofort
Lemma 3.2. M sei algebraisch abgeschlossen. Dann ist fur alle e.e. F F E Sk(M) gdw. W ( F )E W * ( M ) . Mit diesem Lemma konnen wir I, 2.10/2.11 fur a.a. Gruppen anders formulieren. Satz 3.3. M, N seien Gruppen, M algebraisch abgeschlossen. 1. N < p M gdw. W , ( N ) C W , ( M ) . 2. Wenn N abzuhlbar ist, N < N gdw. W , ( N ) C W , ( M ) . Sei auch N algebraisch abgeschlossen. 3. N = p M gdw. W , ( N ) = W , ( M ) . 4. Wenn N und M abziihlbar sind, N = M gdw. W , ( N ) = W , ( M ) .
W , ( M ) sol1 an die Stelle des Skeletts w-homogener Strukturen treten. Ebenso, wie wir in I, 3 untersucht haben, welche Klassen als Skelette vorkommen, wollen wir jetzt die Teilmengen von V bestimmen, die von der Form W , ( M ) fur ein algebraisch abgeschlossenes M sind. Zunachst zeigen wir, daB alle *-Grade als Wortproblem einer endlich erzeugten Gruppe vorkommen. Satz 3.4. Fur alle A C w gibt es eine von zwei Elementen erzeugte Gruppe F mit A C ,W ( F )< + A .
Dieser Satz wurde (unabhangig) in ([3]) und [29]) bewiesen. Beweis. % sei wie im Beweis von 2.16. A bezeichne jetzt die von den ai, i E A erzeugte Untergruppe von '23. Setze F = %/AT.Weil c "ai, * * biz E A gdw. { i l , .. . in}C A ist W ( F )6+A. Weiter ist f : A C ,W ( F ) fur f ( i ) = Godelnummer von ai. Wir fassen ai dabei als Wort in b,c auf. 0
-
Folgerungen 3.5. 1. Fur alle a E V gibt es ein e.e. F mit W ( F )=* a. 2. Es gibt 2" von zwei Elementen erzeugte Gruppen. 3. Es gibt W ( F )6, W ( G )und eine a.a.Gr. M mit G < M , F A M. (Wiihle w \ A & A , W ( G )=* A, W ( F )=* w \ A , M nach 1.7.) 4. Es gibt e.e. F < M a.a., so dap fur kein HZ E F * H < M. (Wiihle A mit A* & * A (2.9), W ( F )= * A , wende 2.13.3 an.)
Algebraisch abgeschlossene Gruppen
50 1
Bemerkung 3.6. In [29] und [30] wird gezeigt, dal3 man Satz 3.5 nicht verscharfen kann, indem man s +durch scharfere Reduzierbarkeitsbegriffe - zum Beispiel s,,,- ersetzt. Es gibt ein nichtrekursives r.a. A, so daa fur keine endlich erzeugte Gruppe F mit nicht losbarem Wortproblem W ( F )%I1 A, [30]. ( s b r r : beschrankte wahrheitstafel Reduzierbarkeit.) Lemma 3.7. Wenn M algebraisch abgeschlossen ist, ist W , ( M ) ein Ideal in (V, s). Beweis. Sei b E W , ( M ) , also b =* W ( G )fur ein G E S k ( M ) . Wenn a s b fur ein a E V, wahlen wir ein e.e. F mit W ( F )=* a. Dann ist W ( F )<* W ( G ) und also nach 1.4 F < M, also a = W ( F ) / = *E W , ( M ) . Wahle fur a, b E W , ( M ) F, G E Sk(M) mit W ( F )E a, W ( G )E b. Nach 1.5.2 ist F x G < M und nach 2.11 ist sup(a, b ) = W ( F x G ) / = *E W , ( M ) . 0 Sicherlich gehort nicht jedes Ideal, z. B. {0/3*}nach 11, 2.11, zu einer algebraisch abgeschlossenen Gruppe. Wir wollen jetzt geeignete Bedingungen fur das Ideal definieren. Dazu brauchen wir einen “rekursionstheoretischen Ersatz” fur endliche Gleichungssysteme uber Gruppen: r.a. Hornklassen. Definition 3.8. 1. Eine Klasse X C 9 ( w ) heiBt Hornklasse, wenn (a) X ist abgeschlossen in der Produkttopologie von 2”. (b) X ist abgeschlossen unter Durchschnitten, d. h.
~ d c ~ + n d E ~ .
2. Eine UY-Klasse X heil3t rekursiv aufzahlbare (r.a.) Hornklasse, wenn X durchschnittsabgeschlossen ist. Bemerkungen und Definitionen. Die Hornklassen sind die Klassen der Form gW = {X E 9 ( w ) 3u,w ( u , w )E W A D, C X A D C w \X} fur W C w. Die Relation A E gW (zwischen A und W) ist arithmetisch (UY). Die r.a. Hornklassen sind von der Form gW fur rekursiv aufzahlbare W C w. Statt gw,schreiben wir 2,. Die Relation X E ,%, (zwischen X und i ) ist ZI:. i ist eine Godelnummer von gi. Einer Hornklasse X C 9 ( w ) ordnen wir die Hornklasse %”’ = {(X,, . . .Xn) (XI v v X n )E X}zu. Damit konnen wir weitere Hornklassen bilden: wenn X eine Hornklasse ist und A1,. ..An-]E 9 ( w ) , definieren wir die Hornklasse %‘[AI,..., A n - l ] = { X EB ( o ) l ( A 1 ,...,A n - l , X ) E X n } Typische .
1
1
- -
M. Ziegler
502
Beispiele fur r.a. Hornklassen sind W : [ X ]C Y (damit meinen wir { ( X , Y)I W : [ X ]C Y } )und Y n W : [ X ]= 0. Umgekehrt ist jede r.a. Hornklasse von der Form (0) n W : [ X ]= 0. Zu Hornklassen gehoren Mengen von aussagenlogischen Implikationen. Wir betrachten Formeln, die sich aus den Aussagenvariablen q(O), q(1), . ..,q(i), . . . i C w, und Konstanten T , I mit A , v, i , 3aufbauen. A C w erfullt die Formel cp, wenn bei der Wahrheitswertbelegung, die q ( i ) wahr zuordnet, wenn i E A, und falsch, wenn ifZA, cp den Wahrheitswert wahr bekommt. Wir schreiben A I= cp; und fur 9 C 9 ( w ) und eine Menge T von Formeln 9 I= T, wenn A I= cp fur alle A E 9, cp E T. Zur Hornklasse gW definieren wir die Menge von Implikationen
I
%'w = { A (40') j E D.)+
I
1
V { q ( i ) i E D:} (u, w ) E W ) .
(&, ist dann eine r.a. Menge von aussagenlogischen Implikationen.)
Wir haben A E gW gdw. A I= %'w. Man sieht umgekehrt leicht, daB man jede Menge von Implikationen in dieser Form schreiben kann und eine r.a. Menge von Implikationen in der Form 2%. Hornklassen % L C ( 9 ( w ) ) " ordnen wir entsprechend eine Menge von Implikationen in den Aussagenvariablen ql(i),q 2 ( i ) ,.. .,q"(i), i C w, zu, so daB fur alle A 1 , .. . , A . C w (A1,.. . , A n )E %Lgdw. (A1,.. . ,A.)I= %'L.Dabei bedeutet (A1,.. .,A,,)I=cp, daB cp bei der Wahrheitswertbelegung: "q'(i) H wahr gdw. i E Aj" den Wert wahr bekommt. Auf diese Weise gehoren die r.a. Systeme von Implikationen in den Aussagenvariablen q'(i), j = 1,.. . ,n, i < w zu den r.a. Hornklassen X". Ein System von (gruppentheoretischen) Implikationen Z entspricht einer Hornklasse X fur die (G, g') I= Z gdw. W ( G ,g') E X. Im nachsten Lemma fassen wir einige nutzliche Eigenschaften der Hornklassen zusammen. Wir vermerken dabei analoge Satze uber Gruppen, die wir fruher bewiesen haben. In einigen Fallen sind die gruppentheoretischen Satze Spezialfalle der Ergebnisse uber Hornklassen. Dazu benutzt man, daB die Klasse der Wortprobleme von Gruppen eine F.a. Hornklasse ist (3.9.9).
Lemma 3.9. 1. Der Durchschnitt einer Familie von Hornklassen ist eine Homklasse. Der Durchschnitt einer r.a. Familie von r.a. Hornklassen ist eine r.a. Homklasse (d. h. fur alle r.a. A ist i E A } eine r.u. Homklasse ). 2. 9 sei eine Familie von Homklassen. Wenn der Schnitt von je endlich vielen Mitgliedem von 9 nichtleer ist, ist der Schnitt von 9 nichtleer.
n{giI
503
Algebraisch abgeschlossene Gruppen
-
3. Wenn XI= q ( n l )v * * v q ( n k ) ,gibt es ein i mit X I= q ( n i ) (siehe 11. 1.4). 4. { X X [ X ]# 0) ist eine Hornklasse (siehe 11, 1.6) Wenn X r.a., ist auch " X [ X ]# 0 ' r.a. (11, 2.1). { i gi#0) ist IT?-Menge. 5 . X sei r.a. und A C w. Dann ist n X [ A ]<,A (11, 3.2). Weiter ist w \ u X [ A ]st A. 6. B < * A gdw. {B}.=X [ A ] f u r eine r.a. Hornklasse X (111, 1.6.2). 7. Die Klassen der Form X [ A ]f u r r.a. X sind gerade die von der Form { X X I= I U J } , wobei I bzw. J Mengen von positiven bzw. negativen Impfikationen rnit I <=A,J <:A sind (11, 3.13). 8. gA, C gBist eine l7:-Eigenschaft von A, B. 9. Wn:= {( W ( F ;f o , * * , f n ) F uon f erzeugt} ist eine r.a. Hornklasse {(x, n ) x E wn} ist X. 10. s seine eine endliche Teilmenge von w, A C w. Wenn i E w, bezeichnen wir mit g ( i ) die Godelnummer des Wortes in xo, xl,. . . , das aus Vi(xo, xl,. . . ) dadurch hervorgeht, dap man die Variablen x, mit j fZ s durch e ersetzt. Wir schreiben A 3. s f u r g - ' [ A ] . (Wenn F durch f o , . . . erzeugt ist, ist also W ( F ;f o , . . .) .1 n = W ( ( f o.,. . ,fn-]>).) {(X,Y)I X = Y .1 s} ist eine r.a. Hornklasse II? in X , Y und der Godelnummer von s.
I
1
1
I
I
I
1
Beweis. 1. Fur eine Familie (U, i E I ) ist n{gu8i E I } = %utu,,iEI). 2. folgt aus dem Kompaktheitssatz der Aussagenlogik. 3. Wenn die Behauptung von 3 nicht stimmt, gibt es fur jedes i ein Ai E X rnit Ai eCq(ni)- (also ni fZ Ai).Dann ist A 1n A2n... n A, E X und A t f l - - -n Ak F q ( n l )v . - .v q ( n k ) .Wid. 4. T sei die Menge aller Implikationen cp in den Aussagevariablen q l ( i ) , i < w, X 2 + cp. (Wenn X = %,, ist T r.a.) Aus (A,B)E X z folgt (A,B)I= T, und daher A I= T. Sei andererseits X [ A ]= 0. Die Horn klasse X 2 n n { { ( X ,Y)I a E X>I a € A ) n n { { ( X ,Y)I b fZ XI1 b fZ A ) ist leer. Nach 2 gibt es a',. . .,ak E A, b l , . . .,b,,, fZ A rnit %''I= q l ( a l )A A q'(ak)+ q l ( b l )v - .. v q'(b,,,). Eine Anwendung von 3 liefert ein i mit Z21= q l ( a l )A * * A ql(ak)+ ql(bi).Diese Implikation ist in T. Also ist A F T. Wenn gW = T, ist somit { X X [ X ]# 0) = gW. Die letzte Behauptung gilt, weil { i gif I}l7: ist. 5 . Definiere A + = ( q ' ( a ) l a E A }und A - = { i q ' ( a ) l a $ Z A } . Dann ist n % i [ A ] = { n % ' f U A ' U A - k q ' ( n ) } . Nach 3 ist das das gleiche w i e { n ( & U A + t q ' ( n ) } .Also n%i[A]S.A. n $ Z U g i [ A ]gdw. 2 f U A ' U A - k i q ' ( n ) . Das gilt nach 3 gdw. ex b $ Z A rnit X f U A ' U {i q ' ( b ) }k i q ' ( n ) .Daraus folgt die Behauptung.
--
-
1
I
I
M. Ziegler
504
6. Wenn (m,n ) : B S * A, ist {B}= { X WC,[A] C X A X n W!,[A] = 0). 1st umgekehrt { B } = %[A], so ist B = n X [ A ]= U%[A]. 7. Wenn %?,[A]gegeben ist, wahle fur I bzw. J die Menge der positiven bzw. negativen Implikationen in q2(j), j < w, die aus %': UA+ bzw. ,@ U A + U A folgen. Die gewunschte Eigenschaft folgt nun leicht aus 3 wie in 4. Wenn z : I S e A , z ' : J <:A, wahle 2, so, daB 2?: die Vereinigung von { A 0:- cp u ) E W2} und { A ( D : U ( w \ D : . ) - ) - cp I ( v , u , u ' ) E Wz.} ist. (Wir fassen dabei I und J als Mengen von Implikationen in den Variablen qz(i), i E w auf.) 8. Man erkennt die Richtigkeit der Behauptung aus %a C gB gdw.
I
,?(I
Vq E
2€3
t-cp.
9. Fr(go,. . . ,g,,) sei die freie Gruppe in den Erzeugenden g. Wn ist die Menge aller A C w mit (a) Wenn V, (go,.. . ) = e (in Fr(go, . . . ,g")), ist i E A. (b) Wenn i, j E A und V, (gel,. . . ,8.) V;'(g(l,. . . ,g n )= Vk (go,. . . ,g.) (in Fr(go, . . .)), ist auch k E A. (c) Wenn i E A und fur ein h E Fr(go,. . . ,go) h V, ( g ) h = V, (g), ist j E A. Denn W(F) erfullt naturlich (a), (b), (c). 1st anderseits (a), (b), (c) erfullt, ist N:= { V, (2) i E A } ein Normalteiler von Fr(g). Sei ( F ; f o , .. . ,fn) = (Fr(g)/N; g o . N, . . . ,g, . N). Aus der Definition von N folgt W(F, f ) 3 A. Wenn umgekehrt j E W(F,f), folgt V,(g)E N und daher V,(g) = V, (g) fur ein i E A. Wegen (c) (h = e) folgt j E A. Also ist W(F,f ) = A. 10. g ist rekursiv. Das System T von aussagenlogischen Implikationen {qI(n)+q*(g(n))l n E w}U{q*(g(n))-ql(n)) n E w } , fur das A = B 4 s gdw. (A, B ) C T, ist also r.a. 0
1
Definition 3.10. Ein Ideal I C V heil3t abgeschlossen, wenn es zu jedem A E I und jeder r.a. Hornklasse X,fur die %[A] # 0, ein B E UI mit B E % [ A ]gibt.
u
Bemerkung 3.11. 1. X C 9 ( w ) sei nichtleer. 3t ist genau dann Vereinigung eines abgeschlossenen Ideals, wenn (a) Fur alle A, B E X ist A v B E X ;und, wenn %[A] # 0 fur ein r.a. X,gibt es ein C E X mit C E X[A 1. oder (b) Fur alle A, B E X und r.a. X rnit X[A, B ]# 0 gibt es ein C E X mit C E X[A, B ] . 2. Wenn I abgeschlossen ist, C E UI, A SeC, B Sic,gibt es, wenn A fl B = 0, ein D E UI, das A und B trennt.
505
A lgebraisch abgeschlossene Gmppen
Zum Beweis von 1 benutzt man 3.9.6 und uberlegt, daB es ein r.a. X gibt, sodal3 %‘[A,B ]= { A v B} fur alle A, B. 2 gilt, weil “ W : [ X ]C Z A 0 = Z n W : . [ X ] ”eine r.a. Hornklasse ist. Es ist fraglich, ob aus der Eigenschaft 2 allein die Abgeschlossenheit von I folgt.
1. Wenn M a.a.Gr. ist, ist W , ( M ) ein abgeschlossenes Ideal. 2. Die abzahlbaren abgeschlossenen Ideale von V sind gerade die Mengen der Form W , ( M ) fur eine abzahlbare a.a.Gr. M. 3. Es gibt genau dann eine algebraisch abgeschlossene Gruppe M der Machtigkeit w 1 mit W , ( M ) = I, wenn 1 I I rs w I und I ein abgeschlossenes Ideal ist. 4. (CH) Die Mengen der Form W , ( M ) fur eine a.a.Gr. M sind gerade die abgeschlossenen Ideale von V. (Fur alle M ist I W , ( M ) l S min(2”, 1 MI + w ) . )
Satz 3.12.
(3.12) folgt aus dem nachsten Lemma: 1. Wenn M a.a. ist M w homogen (I, 1.10). Wegen I, 3.2.1/3.7. erfullt Sk(M) daher U, JE P und AA. Aus 3.2 folgt, daB Sk(M) = S( W , ( M ) ) , also ist nach 3.13 W , ( M ) ein abgeschlossenes Ideal. 2, 3, 4 erhalt man aus dem Lemma mit I, 3.8. Dazu uberlegt man, daB aus 3.5.1 folgt, daB fur jedes M Sk(M) = S ( I ) W , ( M ) = I. Fur 3 benutzt man, wenn Z abzahlbar ist, I, 4.8.2.
+
Lemma 3.13. Fur I C V sei S ( I ) die Klasse aller e.e.Gr. F mit W(F)/=* E I. Dann erfullt S ( I ) genau dann U, JEP und AA, wenn Z ein abgeschlossenes Ideal ist. Beweis. S ( I ) erfulle U, J E P und AA. Wir zeigen, daB U I 3.11.1(a) erfullt. Sei dazu A , B E UI. Wir finden ein abzahlbares J rnit A, B E UJ C U I , fur das S ( J ) U, JEP, A A genugt. M sei - nach 13.8.2 - eine a.a.Gr. mit Sk(M) = S ( J ) . D a - wegen 3.5.1 W , ( M ) = J, ist J Ideal (3.7). Daher ist A v B E UJ und also A v B E UZ. Fur die r.a. Hornklasse X sei X [ A ]# 0. Wir wahlen ein G C M rnit W ( G )=*A . Wir beschreiben % [ A ]durch zwei Mengen I’ bzw. J’ von aussagenlogischen Implikationen rnit I’< = A und J’
I
506
M. Ziegler
I
f l , f 2 EM. Wir setzen C = { i Ni(jl,f2) = e } . Dann ist C E X [ A ] und C S 1W((fl,f2))E Also C E UI.
UJ.
Sei Z ein abgeschlossenes Ideal. Wir zeigen, daB S ( I ) U, J E P und AA erfullt. U Wenn G C F und W(F)/=* E I, folgt wegen W(G) S * W(F) W(G)/=* E Z aus der Idealeigenschaft von I. JEP Wenn W(G), W ( F ) E UI, ist auch sup(W(G)/=*, W(F)/=*)E I. Nach 2.11 also F X G E S ( I ) . AA Fur die von fo, . . .,f n - l erzeugte Gruppe F sei W(F) E U I und S(x i )ein endliches Gleichungssystem, das in einer Obergruppe G von F durch g o , . . . ,g , gelost wird. Dann ist W(G: f , g ) E { X IXE W,,,, A W ( F ; f ) = X 4 n A Q C X } , wenn S(x 2) = { Vi i )= e i E a}.Nach 3.9 ist die eben beschriebene Menge von der Form X [ W(F)] fur eine r.a. Hornklasse X . Wegen der Abgeschlossenheit von I gibt es ein C E UI, C E X [ W(F)]. Weil C E W,,,,, ist C = W(H;fl, . . . ,fn‘-l, h o , .. ,,h , ) fur eine Gruppe H. Aus W(F) = C 5. n folgt (F, f ) = (fl,. . .,fn‘-,)’. Wir konnen also H als Obergruppe von F auffassen. Da Q C C ist h o , .. . , h, eine Losung von S ( x x’) in H. D a C E UI, ist H E S ( I ) .
(x
1
4. Anwendungen Durch 3.3 und 3.12 kann die Theorie - zumindest der abzahlbaren - algebraisch abgeschlossenen Gruppen auf die Theorie der abgeschlossenen Ideale zuriickgefuhrt werden. Zum Beispiel ist das Analogon von Satz I, 1.5.1 “Jede aufsteigende Vereinigung von abgeschlossenen Idealen ist ein abgeschlossenes Ideal”. (Ein Analogon zu I, 1.5.2 - und dafur zunachst den Begriff “elementare Unterstruktur” fur Ideale - werden wir im nachsten Kapitel kennenlernen.) Als weiteres Beispiel beweisen wir Satz 1.7 noch einmal, ohne uber Gruppen zu sprechen.
I und die hochstens abzahlbare Teilmenge Q von V seien disjunkt. Dann gibt es ein hochstens abzahlbares abgeschlossenes Ideal I C J, J n Q = 0.
Satz 4.1. D a s hochstens abzahlbare Ideal
Wir fuhren den Beweis nur in dem einfachen Fall, daB I Q 1 = 1 und I ein Hauptideal ist. Wir brauchen vier Hilfssatze. (1) Sei Xl r.a. und {B}#Xl[A]. Dann gibt es ein r.a. X 2 3 A, so dab fur alle C E X 2 {B}# Xl[C]. Beweis. Wenn B E X,[A], gibt es u, u‘, sodaB D, C A v B, D:,C w \ A v B und fur alle A’, B’ D. C A ’ v B‘, D:.C w \ A ’ v B’=$ B’6Z Xl[A’]. Setze
A lgebraisch abgesch lossene Gruppen
507
X ~ = { X ~ ~ W ~ D2 ,w C n ~~: .Xc 2, ( w \ x ) } . Wenn B, B ' E X 1 [ A fur B # B', sei b E B \B' oder b E B'\B. X 2 : = { X ( X 1 [ X ] n { Yb E Y } # 0 , X l [ X ] n { Y I b ! ZY } # 0 } ist nach 3.9.1 und 3.9.4 eine r.a. Hornklasse.
1
(2) Wenn fur ein r.a. X , %''[A]# 0, und fur alle C, D C w, (C,D)EXz[A]C j = B , ist B S * A .
Beweis. Nach 3.9.4 ist X : : = {(X, Y ) I X [ X , Y ] # 0) eine r.a. Hornklasse. Die Behauptung folgt nun aus 3.9.6, weil {B}= X l [ A ] . (3) Fur ein r.a. X sei X [ A ] # 0 und C P * A . Dann gibt ein B E % [ A ] mit C P*A v B.
-
Beweis. Wir konstruieren eine Folge X o 3 X13 X 2 3 * * von r.a. Hornklassen, fur die X,[ A ]# 0, und wahlen dann B E n { X ,[ A ] i < w } nach 3.9.2. Fur Xo nehmen wir X. Sei X,bereits konstruiert. Ware fur alle D ' E X,[ A ] {C) = 2, [A,D'], wurde aus (2) C S * A folgen. Es gibt also ein D E [ A ] mit {C}# 2, [ A , D ] . Wir wenden (1) an und bekommen ein r.a. X , , so daO (A,D) E X: und fur alle (A',D')E X i , { C } # %',[A',D'].Wir setzen nun Xn+l= 2, n X,. Wenn dann B E n { X , [ A ] i < w } , ist fur kein i < w {C} = .%,[A, B ] = 2, [ A v B ] . Nach 3.9.6 bedeutet das C # * A v B.
1
I
(4) Wenn A1< * Az, gibt es zu-jedem i ein j , so daB
Ri[ A 1 = ] %?, [A2].
Beweis. Sei - nach 3.9.6 - { A 1 = } % [ A 2 ]fur ein r.a. X.Wahle fur gj[[Az] - die nach 3.9.1 und 3.9.4 - r.a. Hornklasse { X % [ A ]n { Y X E .%i [ Y ] }# 0).
I
I
Beweis von 4.1. Sei Q = { C / = * }und Z von {Ao/=*}erzeugt. Wir konstruieren eine Folge A, S * A16 *AzC * * von Mengen mit C P*Ai. A, sei bereits definiert. Wenn i = (k,n) und % [A,,]# 0, wahlen wir nach (4) j E w, so daB %j [ A i ]= %';, [A,,].Nach (3) gibt es ein B E Xj[ A i ]mit C P*Ai v B. Setze Ai+l= Ai v B. Wenn die Voraussetzungen nicht gelten, setzen wir Ai+l= Ai. Das von den Ai erzeugte Ideal hat die gewunschten Eigenschaften. 0
- -
Lemma 3.95 liefert uns eine neue Methode zur Konstruktion abgeschlossener Ideale (und damit algebraisch abgeschlossener Gruppen):
M. Ziegler
508
Lemma 4.2. 1. Fur das Ideal I C V gelte: a E I, b S e a 3 b E I. Dann ist I abgeschlossen. 2. Sei Q C V. Dann ist Q : = { a ex. b,, . . . , b. E Q, a sesup(b,,. . ., b.)} ein abgeschlossenes Ideal.
I
Beweis. 1. Wenn A E U I und X [ A ] # 0 fur eine r.a. Hornklasse X, ist nach 3.9.5 n X [ A ]E X [ A ]fl UI. Fur 2 uberlegt man, daI3 aus Ai = s e B i ,i = 1,2 A , v A 2 S , B Iv B 2folgt. Folgerung. Es gibt 2-1 viele (nicht partiell isomorphe) alg. abgeschlossene Gruppen der Machtigkeit w , . Beweis. Man uberlegt leicht, daI3 es ein X C P ( w ) gibt mit 1 XI = 2", so dal3 fur alle A # A , , . . . , A , E X A # , A , v .. .&A,. W z n also Q und R verschiedene T e i l E n g e n von 3% sind, ist Q/=* # R / = * . 1st 1 Q 1 = w , , so auch Q/=* I = w , . Wir haben also so viele abgeschlossene Ideale der Machtigkeit w , wie Teilmengen von X der Machtigkeit wl. Die Behauptung folgt also aus 3.12. 0
I
Definition 4.3. Sei A C w. Wir bezeichnen mit MA die - eindeutig bestimmte - abzahlbare algebraisch abgeschlossenzGruppe mit W * ( M A )= {B B S e A } . (Es ist also W,(MA) = {A/=*}.)
1
Satz 4.4. 1. MA = Me gdw. A " =, B". A '/=* ist das gropte Element von W +. (MA). 2. Sei F endlich erzeugt und W ( F )=* A (Fur jedes A gibt es solch ein F nach 3.5). Dann ist MA die - dadurch eindeutig bestimmte kleinste algebraisch abgeschlossene Gruppe, in die F einbettbar ist ; d. h. 0.E. ist F C MA, und wenn p : F +- M eine Einbettung in eine a.a. Gruppe M ist, gibt es eine Fortsetzung p C v : MA +- M. 3. Wenn W(F)/=* vollstandig ist, gibt es eine kleinste a.a. Gruppe, die F enthalt. 4. M0 ist minimale a.a. Gruppe; d. h. jede a.a. Untergruppe von MB ist zu M0 isomorph. 5. In jeder Gruppe MA gibt es eine endlich erzeugte Untergruppe, in die jede e.e. Untergruppe von MA einbettbar ist. "MA besitzt eine universelle e.e. Untergruppe ". Bemerkungen 4.5. Z u 1: Nicht jede abzahlbare a.a. Gruppe, deren zugehoriges Ideal Hauptideal ist, ist von der Form MA (z. B. endlich
A igebraisch abgesch lossene Gruppen
509
generische Gruppen, vergleiche 4.4.5 mit V, 3.4). DaB W , ( M A )ein groBtes Element besitzt, folgt auch aus 5. Zu 3: Nach I, 1.3 ist jede Gruppe in eine a.a.Gr. einbettbar. Nicht jede e.e. Gruppe ist in eine kleinste a.a.Gr. einbettbar. Denn wenn z.B. I= losbares Wortproblem hat, ist F < Me. Sk(Me) enthalt ein G mit nichtlosbarem Wortproblem. Nach 1.8.2 gibt es eine a.a.Gr. M' in die G nicht einbettbar ist. F ist in M' einbettbar (1.8.2), es gibt aber keine a.a.Gr. M, die sowohl in M0 als auch in M' einbettbar ist. Denn wegen 4 ware M = MO,das widerspricht G < Me. Zu 4: M0 is offenbar die in I, 3.9.2 konstruierte abzahlbare a.a. Gruppe Mor deren Skelett die Klasse X o aller e.e. Gruppen mit r.a. Wortproblem ist. Wir werden spater noch andere minimale a.a. Gruppen kennenlernen: die endlich generischen Gruppen. Wie wir in der letzten Bemerkung gesehen haben, ist M0 nicht in alle a.a. Gruppen einbettbar. Zu 2, 3 und 4 vergleiche man I, 4.7.1: 4.7.1: Jede abzahlbare a.a.Gr. ist zu einer echten Untergruppe isomorph, die eine vorgegebene e.e. Untergruppe enthalt. Zu 5 : Man findet, wegen 11, 2.4.2, sogar eine universelle e.e. Untergruppe, die von zwei Elementen erzeugt wird. Andererseits ist eine Untergruppe einer a.a.Gr. schon universelle e.e. Untergruppe, wenn sich jede von zwei Elementen erzeugte Untergruppe in sie einbetten laat. Es is fraglich, ob alle abzahlbaren a.a. Gruppen, die eine universelle e.e. Untergruppe besitzen, von der Form MA sind.
Beweis von 4.4. 1. MA = M e gdw. W * ( M A )= W * ( M B )gdw. A z e B gdw. (2.3.4) A " = , B " . Nach 2.3 ist fur alle C, C S , A gdw. C S * A " . 2. F ist in MA einbettbar, weil A " s e A . Wenn F in die a.a. Gruppe M einbettbar ist, ist A " E W * ( M ) und daher W * ( M A ) C W * ( M ) .Aus 3.3.2 folgt MA C M ; man kann MA sogar uber F in M einbetten, weil M o-homogen ist. 3. folgt aus 2: Setze A = W ( F ) . Wenn W ( F ) vollstandig ist, ist nach 2.3 W ( F ) = * A". Fur 4 brauchen wir das folgende Lemma, das auch noch spater nutzlich sein wird.
Lemma 4.6 [32]. Es gibt zwei disjunkte rekursiv aufzahlbare Mengen A, 8, so daj? jede rekursiv aufzahlbare Menge, die A und B trennt, vollstandig ist. (A und B sind also insbesondere nicht rekursiv trennbar.) Nun zum Beweis von 4.4.4. Sei M a.a. Untergruppe von Mo. W * i M ) besteht nur aus r.a. Mengen. 3.11.2 und 3.12.1 lehren, dal3 A und B
M. Ziegler
510
von 4.6 durch ein - also r.a. - C aus W * ( M )getrennt werden. C ist also vollstandig und W , ( M )= W,(M,,) ist das von C / = * erzeugte Ideal. M = Mo folgt nun aus 3.3.4. Wir beweisen 4.4.5: Wahle ein G = ( g l , .. . ,g.) mit W ( G )=*A. ( S , (g, xI,x2) i < w ) sei eine effektive Durchzahlung aller endlichen Gleichungssysteme mit zwei Variablen. Wir schreiben Gl(G, f ) in der Form I(gl,.. . ,g “ ) . Wir zerteilen die Folge der Terme N, (yl, y 2 ) in eine Folge N ? ) ( y , ,y2), . . .,N(,‘!2(ylr y2), i < o,von endlichen Folgen von n + 2 Termen. H = (hl, h2) sei die durch die Gleichungen
I
U{Z(NY(hl,h2), . . . ,N!?(hI, hz)) u
u s , ( N P ( ~h2), ~ ,. . .,~ ( , ‘ ) (h2), h ~~, ( , ‘ i ~ h2), (h~ NL(~,, , h2))l i < W I
prasentierte Gruppe. Offenbar ist W ( H )% W ( G )und daher H < MA nach 3.3.2. Wenn F < MA, ist W ( F ) s ,W ( G )und daher F einbettbar in eine endlich uber G prasentierte Gruppe F‘ (11, 3.10). Wir konnen F’ als von zwei Elementen erzeugt annehmen. F’ sei durch S, (g, f l , f2) uber G prasentiert, F’= (8, f l ,f2). Dann ist F < F’= ( N ? ) ( ~h2), I , . . .,N‘,‘!2(hl,h2))C H : H ist die gesuchte universelle e.e. Untergruppe von MA. 0 IV. Elementare Aussagen Eine Theorie besitzt genau dann einen Modellbegleiter, wenn die existentiell abgeschlossenen Modelle dieser Theorie eine elementare Klasse bilden. Der Modellbegleiter ist dann die Theorie dieser Klasse ([6]).So ist die Theorie der a.a. Korper der Modellbegleiter der Korpertheorie. Halt man die Charakteristik fest, ist diese Theorie vollstandig. Wir werden zeigen, daB die Klasse der a.a. Gruppen nicht elementar ist (die “Gruppentheorie” bestzt also keinen Modellbegleiter). Eine Klasse X von L-Strukturen ist genau dann elementar, wenn sie mit einer Struktur auch jede elementar aquivalente enthalt und kompakt ist, d. h. wenn jede Menge von Aussagen (aus L,,), deren endliche Teilmengen jeweils ein Modell aus X haben, ein Modell aus X hat. Die Klasse der a.a. Gruppen erfullt beide Bedingungen nicht. Im ersten Abschnitt geben wir zu jeder a.a. Gruppe eine elementar aquivalente Gruppe an, die nicht algebraisch abgeschlossen ist. (Also kann sogar keine Klasse von a.a.Gr. elementar sein.) Wir Lienutzen dabei, daB sich in a.a.Gr. “ a E (b)” elementar beschreiben la&, d. h. gibt, so daB fur alle a, b aus einer daB es eine Formel cp (x, y) aus
A lgebraisch abgesch lossene Gruppen
511
a.a.Gr. M a E (b) gdw. MI= ~ ( ab). , Aus der Nicht-kompaktheit der Klasse der a.a.Gr., die wir im dritten Abschnitt zeigen, folgt sofort, daB es unendlich viele paarweise nicht elementar aquivalente a.a.Gr. gibt, insbesondere ist also die Theorie der a.a.Gr. nicht vollstandig. Beim Beweis nutzen wir aus, daB sich in a.a. Gruppen ein groBes Fragment von LEI.”elementar beschreiben lafit (2. Abschnitt). Wir geben dieses Fragment in ersten Abschnitt auf zwei Arten an: Als schwache zweite Stufe L* mit “eingebauter Zahlentheorie” und als die Eigenschaften von Gruppenelementen, die sich durch Quantifizierung aus “rekursiven” Eigenschaften von Gruppenelementen ergeben, die “arithmetischen” Eigenschaften. Das beantwortet die Frage aus ([ 141) nach einem Fragment von Lq+ mit “rekursiver” Konjunktion, das in a.a. Gruppen elementar beschreibbar ist. Diese Antwort ist in gewissem Sinn naturlich. Denn die mit L* ausdruckbaren Eigenschaften a.a. Gruppen entsprechen gerade den durch Lzbeschriebenen elementaren Eigenschaften des zu jeder a.a. Gruppe M gehorenden o-Modells w ( M ) = (0, + ,. , W * ( M ) ,E ) (1. Abschnitt. Das gilt eigentlich fur alle w-homogenen Gruppen). Im zweiten Abschnitt zeigen wir dann - und hier wird die algebraische Abgeschlossenheit erst wirklich gebraucht - daB sich die elementaren Eigenschaften von M und w ( M ) gegenseitig bestimmen. Das entspricht dem im letzten Kapitel begonnenen Programm: die Untersuchung a.a. Gruppen wird zuriickgefuhrt auf die Untersuchung der zugehorigen abgeschlossenen Ideale. Nicht jede Aussage aus LZl.”laBt sich in a.a.Gr. elementar beschreiben. Denn zwei abzahlbare L,,,-aquivalente Strukturen sind isomorph; es gibt aber elementar aquivalente, nichtisomorphe abzahlbare a.a. Gruppen. Aber auch ein anderes Extrem kommt vor: abzahlbare a.a. Gruppen, die sich eindeutig (unter allen anderen abzahlbaren a.a. Gruppen) durch eine elementare Aussage charakterisieren lassen. Zum Beispiel gibt es fur jede arithmetische Menge A eine elementare Aussage Q, so daB fur jede abzahlbare a.a. Gruppe M MI= Q gdw. M = MA (3. Abschnitt). Wir erhalten so explizit abziihlbar viele nicht-elementar aquivalente a.a. Gruppen. Aus der Existenz abzahlbar vieler unabhangiger r.a. Turinggrade gewinnt man abzahlbar viele bzgl. a.a. Gruppen unabhangige 3,-Aussagen. Also auch 2” viele a.a.Gr., die paarweise nicht 3,-aquivalent sind. (Das lost ein Problem aus ([14]).) Und verscharft I, 1.9: es gibt 2” viele nichtisomorphe abzahlbare a.a. Gruppen. Diese Beispiele aus dem dritten Abschnitt zeigen, wie modelltheoretische Probleme fur a.a. Gruppen auf rekursionstheoretische Probleme zuriickgefuhrt werden konnen:
512
M. Ziegler
Arithmetische Eigenschaften von M werden in LZ-Eigenschaften von w ( M ) ubersetzt, LZ-Eigenschaften von w ( M ) ubersetzt in elementare Aussagen uber M. So laBt sich z. B. durch eine elementare Aussage ausdrucken, daB W , (M) ein Hauptideal ist. Diese Ubersetzung geschieht mit den Methoden von Kapitel 11, beruht also auf dem Satz von Higman, dessen Beweis so kompliziert ist, daB die Ubersetzung kaum effektiv ist. Im vierten Kapitel zeigen wir, daB man w ( M ) auf naturliche Weise in M interpretieren kann. Wir geben die interpretierenden Formeln explizit an, die dann eine eff ektive Ubersetzung von L*-Aussagen in elementare Aussagen uber M gestatten. 1. Arithmetische Eigenschaften
8.8.
Gruppen
Lemma 1.1 ([6]). Wenn U ein nichttriuialer Ultrafilter auf w ist, und M eine a.a. Gruppe ist, ist M " / U nicht algebraisch abgeschlossen. Beweis. Nach I, 4.4 (siehe auch I, 4.3 oder 11, 3.14) ist fur Elemente b o , .. . ,b, von M bo aus der von b,, .. .,b, erzeugten Untergruppe, gdw. Mi= En(bo,.. . ,b,). Dabei ist En(xo,. . .) die Formel aus L G (siehe [141), V X (XXI
= XIX
A XXz A X2X A '
* *
A XX,
X,X + XXo
= XoX).
(Da wir keine anderen Sprachen betrachten werden, schreiben wir in Folgenden fur L o - die Sprache der Gruppentheorie - einfach L.) M " / U ist w,-saturiert, alle unendlichen definierbaren Teilmengen sind also uberabzahlbar. Als Obergruppe einer algebraisch abgeschlossenen Gruppe besitzt auch M " / U ein Element bl von unendlicher Ordnung. Die von b, erzeugte Gruppe ist nicht definierbar; also auch nicht durch Ei(xo, h). 0 Die Klasse der algebraisch abgeschlossenen Gruppen ist also nicht elementar. Wir geben spater noch einen anderen Beweis dafur an, indem wir zeigen, daB die Klasse der a.a.Gr. sogar nicht kompakt ist (3.5). Wir haben im dritten Kapitel gesehen, daB das Ideal W , ( M ) die algebraisch abgeschlossenen Gruppe M bis auf La. -Aquivalenz eindeutig bestimmt. Wir wollen nun zeigen, daB entsprechend der "elementare Typ" von W , ( M ) den elementaren Typ von M bestimmt. In welcher Sprache beschreiben wir W , ( M ) ? Es genugt - vermutlich - nicht die elementaren Eigenschaften des Halbverbandes ( W , ( M ) , S ) zu betrachten. Wir ordnen daher jeder Teilmenge I von
Algebraisch abgeschlossene Gruppen
513
V das w-Modell (w, + , . , UZ, E ) - fur das wir kurz ( w , U Z ) schreiben - zu; auf diese Weise gehort also zu W , ( M ) das a-Model1 w ( M ) =: (w, W ( M ) ) .Die elementare Sprache fur diese w-Modelle ist L2, die elementaren Eigenschafte von W , ( M ) sind also die Eigenschaften, die sich mit L2-Aussagen uber w ( M ) beschreiben lassen. Ein abgeschlossenes Ideal zu sein ist zum Beispiel eine elementare Eigenschaft von W , ( M ) . Denn 0 # X C P ( w ) ist genau dann Vereinigung eines abgeschlossenen Ideals, wenn ( O , X ) k V x ,Y V U ~ ~ Z I , Z ~ [VZY~A(%:,[x]#0-*z,E =X
%:,[x]].
Zur Rechtfertigung dieser Schreibweise erinnern wir daran, daO “A = B v C ” , % “,! [ A ] # 0”, “ C E [A]” arithmetische Eigenschaften von A, B, C, n sind, die somit in L z ausdruckbar sind. Wir nennen diese (w, X) algebraisch abgeschlossen. Mit L z konnen wir die elementaren Eigenschaften der partiellen Ordnung (I, < ) beschreiben. Denn “A <* B” ist eine arithmetische (2;) Relation. Zum Beispiel ist das Ideal I ein Hauptideal, gdw. die L2-Aussage max: 3 X V Y Y <* X in (w, U I ) gilt.
Formel p ( x o , . . . ,x,) kann mann eine L2-Formel p ( X ) angeben, sodap fur alle ao,. . . , a, aus einer a.a. Gruppe M Mi=p((ao,...) gdw. w ( M ) l = p ( W ( ( a ,... o a,)’)).
Satz 1.2. Z u jeder L,,
Folgerungen 1.3. Fur alle algebraisch abgeschlossenen N, M 1. Wenn w ( N ) = w ( M ) , ist N = M. 2. Aus w ( N )< w ( M ) und N C M folgt N < M. 3. Wenn N elementar in M eingebettet werden kann, ist jede Einbettung von N in M elementar. ( 3 gilt fur alle w-homogenen Strukturen.)
Beweis. Genau genommen hangt p noch von der Wahl der x o , . . . ab. (Die xi mussen nicht alle frei in p vorkommen.) Wir verzichten auf diese Genauigkeit, weil, wenn p ( x o , .. . ,x , ) = ~ ( X O.,. . ,X,, Xn+I,. . . , x k ) und (a) MI= p(ao,...,a,) gdw. w(M)l= p,(W((ao,.. . a,)’)),ist Mi= p ( a o , .. .a t ) gdw. w(M)t= p l ( W ( ( a o ,. . &>’) & ( n -t 1)). Wenn andererseits gilt (denn man (b) Mkp((ao,. . . , a k ) gdw. w(M)i=pz(W((ao,...ar,>’)), setzt einfach alle a,+],. . . , at gleich e ) M ! = p(ao,.. .,a,) gdw. w ( M ) i = pz(W((ao,.. . ,an>’)). Wir konstruieren nun p durch Rekursion uber den Aufbau von p.
M. Ziegler
514
Wenn p ( x l l ,. . . ,x , ) atomar ist, also 0.E. von der Form V,( x o , . . . ,x , ) = e, ist fur alle a l l , . . , a, E M M != p(all,.. . ) gdw. i E W((ao,. . . , a“)’).Wir nehmen daher fur p ( X ) die LZ-Formel i E X. Fur’ p l “ p z wahlen wir p , A pzr fur 1p -I p. Weiter ist (wir brauchen eigentlich nur den Fall k = n ) M k 3x., . . . xk p(aIl,.. . , an-,,X,, . . . ,x k ) gdw. ex. U,, . . . Uk E M w ( M ) k ~ ( ~ ( ( U I I. ., ,. a , ) ’ ) )gdw. es cine von all,.. . , ak erzeugte Obergruppe F von ( a l , , .. . a , - , ) gibt mit F E S k ( M ) und w ( M ) ! =p(W((ao,.. . , a k ) ‘ ) ) , gdw. ex. w E W*(M), W J, n = W ( ( a o ,... , a,-,>),W E wk mit w ( M ) k p ( W ) . Wir nehmen also fur 3 x , . . . xkp die L’-Formel 3 Y (Y E w k A Y .1 n = A p ( Y ) )- man beachte: “ A E wk”und “A .1 n = B” sind arithmetische Eigenschaften (Zl‘i). 0
x
Bemerkung 1.4. Wir haben beim Beweis von 1.2 nicht wirklich die algebraische Abgeschlossenheit von M gebraucht, sondern lediglich die w-Homogenitat. Auch auf die genaue Definition von W * ( M )kommt es nicht an; weil 111, 1.4 und 111, 3.2 nicht mehr gelten, ist fur beliebige w-homogene Gruppen W * ( M )ohnehin nutzlos. Wir haben aber fur alle p ( x o , . . . ), a,,,. . .E M, M w-homogen, Y C P ( w ) mit S k ( M ) = {FE 3 W ( F )E Y}(das gilt z. B. fur Y = W * ( M )oder Y = { W ( F ) F E Sk (M)}),daB M I= p (ao,. . .,a, ) gdw. (w, 9) P (w((an,. . . >)).
I
Die Sprache L *
1
Der Satz 1.2 gilt nicht nur fur Formeln p aus LW.+Wir geben eine starkere Sprache an, die sich in a.a. Gruppen immer noch in Lz Formeln ubersetzen IaBt. Wir werden sehen, daB sich auch umgekehrt L z in L * ausdrucken 1aBt. L * ist eine Art “schwacher zweiter Stufe”. Es gibt drei Sorten von Variablen: Zahlvariable uo, u,, . . . , wo,w , , . . .; Variable fur Gruppenelemente xo, x l , . . . , yo, zo,. . . und Variable a, 7,.. . fur endliche Folgen von Gruppenelementen. (Wir werden hier nur den Fall L = L G behandeln. Viele der folgenden Ergebnisse hangen von dieser besonderen Wahl von L nicht ab.) Zahlterme werden mit + und . aus den Zahlvariablen, den Konstanten 0, 1 fur die Zahlen 0, 1 und /(a), /(T),. . . gebildet. (f(a) gibt die Lange der Folge a an.) Individuenterme (fur Gruppenelemente) setzen sich mit und aus den Variablen xo, . . . zo, . . . , der Konstanten e und a ( t )fur eine Folgenvariable a und einen Zahlterm t zusammen ( a ( n ) ist das n + 1-te Folgenglied, wenn n < l(a).Sonst = e.) Atomare Formeln sind tl t2 und gl A g2 fur Zahlterme ti und Individuenterme gi. Die Formeln von L* bauen sich aus den atomaren Formeln mit A , -I und drei Arten von Quantoren
s1s
Algebraisch abgeschlossene Gruppen
auf: 3 u , 3 x und 3 ~ Es . ist klar wie die Formeln von L in einer Gruppe interpretiert werden sollen. Man kann die Aussagen von L * auch als elementare Aussagen uber ein dreisortiges System ((w ; + , ( M ; - I , e ) , ” M ) ( M eine Gruppe oder allgemeiner eine L-Struktur) mit Funktionen I: “ M + w (Lange der Folge) und AaAua(u) :“M x w + M (u-tes Folgenglied) auffassen. Mit L * laBt sich “a E ( b ) ” in jeder Gruppe ausdrucken: durch a ) ,
a ,
3~ ( ~ ( 0A) b
Vuo((r(u0+ 1) A a ( u o ) b v u ( u 0 + 1) A e )
A
A 3Ul(U(Ul)
A U V U(Ul)
A
U-I)).
Wir konnen L * als Teilsprache von L,,., ansehen: Jede L*-Formel cp(xo,. . . ,x.) ist gleichbedeutend mit einer Formel $(xo,. . .) aus L,,,,. Man gewinnt 4 aus cp, indem man beim Aufbau von cp die Quantoren 3 u p ( u ) durch V { p ( n ) n E w } und 3 a p ( a ) durch V { 3 x o , . . . ,x,,p(xoxl~ . x,) n E w } ersetzt. Zwei partiell isomorphe Gruppen erfullen also dieselben Aussagen von L *. Mit einem ahnlichen Beweis wie bei 1.2 zeigt man
1
1
XU,. . . , X,; UO,. . .Urn) kann man eine L2-Formel p ( u o , . . ., U k , %,. . . ,w- , ;X ) angeben, so dab fur alle n o , .. . , nk E w und a o , .. .,a, und b o , .. .,b, aus einer a.a. Gruppe M M b p(no, . . . , nk ;ao, . . . ,4,;60,.. . , 6 , ) gdw.
Satz 1.5. Z u jeder L*-Formel p ( u o , .. . , uk;
~ ( M ) pI (=n o , .. . , nk,l(K0), . . . l(6, ); w((ao,. . . ,a,, 60,.. . ,h )I). Folgerung. Zu jeder L*-Aussage p kann man eine L2-Aussage p angeben, sodab fur jede a.a.Gr. M Mi= p gdw. w ( M )I= 6. Wie in 1.4 gilt 1.5 in entsprechender Formulierung in allen w homogenen Gruppen. Wir wollen zeigen, daB L * und L 2 fur a.a. Gruppen dieselbe Ausdruckskraft haben. Dazu untersuchen wir zunachst die Ausdruckskraft von L *. Eine Eigenschaft P einer Folge no,.. . , a o , .. .,b o , .. . von naturlichen Zahlen, Elementen einer Gruppen und endlichen Folgen von GruppenElementen nennen wir rekursiu, wenn man - durch einen Algorithmus aus einer gegeben Folge no,.. .,a o , .. . , b o , .. . “ausrechnen” kann ob sie die Eigenschaft P in einer Gruppe M hat oder nicht. Der Algorithmus operiert mit der Folge der Gruppenelemente, indem e r z. B. aus zwei Gruppenelementen durch Multiplikation ein drittes bildet oder indem e r pruft ob ein bei der “Rechnung” produziertes Gruppenelement = e ‘ ist, oder nicht. Folgende Prazisierung bietet die Rekursionstheorie an:
M. Ziegler
516
P(vo,.. .,x o , . . . , co,.. . ) ist rekursiv, wenn es eine rekursive Relation R c w fx ” ( w ) gibt, soda8 fur alle no,. . . E w, ao,.. . , bo,. . . aus einer Gruppe M P- in M auf no,. . . ,ao,.. . , bo,. .. genau dann zutrifft, wenn R ( n o , .. ., l(bo),.. . ; W ( ( a o , . . ,bo,. . .))). Aus den rekursiven Eigenschaften erhalten wir die arithmetischen Eigenschaften durch Quantifzierneung. 2 ; und ZI; Eigenschaften sind die rekursiven Eigenschaften. Wenn P(v,. . .) ein ZI: (2:) Eigenschaft ist, ist “nicht P” eine 2: (I?:) Eigenschaft, und “ex. uo P”, “ex. x P” und “ex. a P” 2z+l.Man sieht leicht, da8 man jede 2 ; Eigenschaft P(vo,.. .xo, . . . ao,. . .) in der Form 3 ~w I~V T,~w, 2 3 ~w3 ~ , * Q,T, R ( v o , .. . wl,. . . l(ao), . .. 1 ( ~ ~. .).,; W((xo,... ao,.. . T ~.,. .))) schreiben kann, wobei R aus ZI: bzw. 2: (je nachdem ob 0, ein Allquantor oder ein Existenzquantor ist) ist. Ein 2: bzw. ZIY-Index von R ist ein 28,Index von P. Der nachste Satz stellt einige Eigenschaften der eben definierten Hierarchie zusammen, deren Beweise man leicht aus den Beweisen der entsprechenden Satze der Rekursionstheorie gewinnt. A
A
A
--
Satz 1.6. 1. Wenn P1 und Pz 2 : sind, sind PI A P2 und P1 v P2 2:. 2. Wenn P Z:, so is? P auch 2:+1und
3. Wenn P 2 :, ist (Vuo< v l P ) 2:. 4. V,(a)= e, I(a) uo sind rekursiu. 5 . Wenn R ( v o , .. . wo,. . ;X ) eine 2:-Relation ist, ist R ( u o , . . l(ao),. . ; W((ao,.. .))) eine 2: Eigenschaft. 6. Wenn P(uo) 2: und f : w + w rekursiu, is? auch P(f(uo))2;.
.
.
.
7. Wenn P(xo) Z :, ist P(V,(a)) auch 2:. 8. Fur jedes n gibt es eine hniuerselle 2: Eigenschaft 0 : die 2;Eigenschaft mit dem Index i trim auf uo,. . .xo,. . .ao,.. . genau dann wenn Q auf i, uo,. . .xo, .. .a. * * zutrift.
-
ZU,
Folgerung aus 8. Die Konjunktion einer “arithmetischen Familie ” uon 2: Eigenschaften ist arithmetisch. (Dabei wird eine arithmetische Familie durch eine arithmetische Menge uon 2 8,Zndizes beschneben.) Beispiel 1. “Algebraisch abgeschlossen” ist ZI;. Zum Beweis definieren wir G l ( A ) := {Vi(xo,.. .) = e i E A}. “Gl(A) U T widerspruchsfrei” ist eine ZIY-Eigenschaft von A, T. M ist genau dann algebraisch abgeschlossen, wenn es die folgende arithmetische Eigenschaft hat:
I
V a Vu (“Diag((a)’) U Gl(D,) U T G widerspruchsfrei”+ +3 7 0 ,
c
W((U,T)1))
Mit Hilfe von 1.6 rechnet man leicht nach, daB diese Eigenschaft ZI2” ist .
Algebraisch abgeschlossene G r u p p n
517
Beispiel 2. Die Eigenschaft MAX: “. . . besitzt eine universelle e.e. Untergruppe” (siehe 111, 4.4.5) ist 2%MAX laBt sich durch 3 a 0 V a l 3a2 ( a 2 E (a,,) A ( a 2 ) ’(a,)’) s ausdrucken. Daraus folgt die Behauptung, weil, wie man leicht iiberlegt, u E ( 7 ) (d. h. es gibt . . l o , I ] , . . . soda8 V*(T)= a ( O ) , V,,(T)= a(l),. . . ) 27, und (a)’= ( 7 ) ’ (d. h . W((a)= ) W ( ( 7 ) ) )IIf ist. Satz 1.7. Die Formeln uon L * beschreiben gerade die arithmetischen
Eigensc h afte n.
Folgerung. Zwei partiell isomorphe Gruppen haber dieselben arithmetischen Eigenschaften. Aus Beispiel 1 folgt also noch einmal die Folgerung I, 3.8.1. Beweis. Zunachst ist klar, daB die Formeln von L * arithmetische Eigenschaften beschreiben, denn offenbar beschreiben die atomaren Formeln von L * rekursive Eigenschaften und durch Anwenden von 3 u , 3x, 3a, A , -I gehen arithmetische Eigenschaften in arithmetische Eigenschaften iiber. D a die arithmetischen Eigenschaften aus 2 f Eigenschaften der Form R ( u o , .. . l(ao), . . . ; W((xo,. . . a o . . ))) - R 27 - durch Anwenden von 3u, 3x, 3u,i hervorgehen, brauchen wir nur zu zeigen, daB die ZtEigenschaften der angegebenen Form sich durch L *-Formeln ausdriicken lassen. Eine Zy-Relation R ( u , X ) ist aber von der Gestalt 3u,w ( S ( u , u, w ) A D, C X A D, C w \ X ) fur eine rekursive Relation S. Da sich rein zahlentheoretische Formeln schon in L* befinden, miissen wir noch zeigen, daB D, C W ( ( a ) )und D, C w \ W ( ( a ) )in L* beschreibbar sind. Diese Aussagen sind aber aquivalent zu V v E D, VU(a) A e und V u E D, i Vu(a) = e. Also geniigt es zu zeigen, daB V v ( u )= e zu einer L*-Formel aquivalent ist. V, ( x o * ) ist eine Folge von Variablen x , und x L’, die wir als eine Folge von 2n und 2n + 1 beschreiben. s ist die Godelnummer dieser Folge. Wenn ao,a ] ,. . . eine Folge von Gruppenelementen ist, ist also V, ( a o , .. . ) = e gdw. es eine Folge bo, b l , . . . von Gruppenelementen gibt (die “Zwischenergebnisse bei der Becechnung” von Vs) sodaB bo = e und b,(s)= e und fur alle j < l ( s )
-
-
bJ+l
=
{ b,b, .. aa,,
;I,
wenn so’) = 2n wenn so’) = 2n + 1.
Diese Darstellung zeigt, daB V, (a,,,. . . ) = e in L * ausdriickbar ist. 0 Wir konnen nun 1.5 in 2 des nachsten Satzes umkehren. 1 ist eine Verscharfung von 1.5.
M. Ziegler
518
2 : Eigenschaft P ( u o , .. . xo, . . . an * .) lupt sich eine .$!,+,-Formel P(u,,,. . ._wo,. . . , X ) uon L 2 angeben, so dap fur alle rill, . . . E o und a l l , .. b,,, . . . aus einer a.a. Gruppe M
Satz 1.8. 1. Z u jeder
(*)
P genau dann auf rill,. . . a , , , .. . b-0,. .: zurrifft, wenn o ( M ) C P ( n o , .. . l(bl,),.. . W ( ( a o , .. . bo,. . . >)).
2. Z u jeder 3;-Formel S ( u o , .. .XI,,. . . ) aus L z gibt es eine ZEEigenschaft S(u,,,. . . ,uo,. . . ), sodup fur alle no,. . . und bo,. . . aus einer a.a. Gruppe M (**)
genau dann a u f rill,-. . . &, . . . zutriflf, wenn o ( M ) I = S ( n l l , .. . W ( ( b l J ) ., . . ).
S
Bemerkung 1.9. 1. 1.8.1 gilt auch fur o-homogene M und fur - statt w ( M ) - ( w , Y)fur irgendein Y C 9 ( w ) mit S k ( M ) = {F W ( F )E Y}. 1.8.2 gilt fur beliebige M, fur w(M) braucht man nur ein ( w , 9)mit z. B. - Y = { f - ' ( W ( F ) ) F E Sk(M)}, f rek. 2. Eine 2 : Eigenschaft nennen wir eine .$: Eigenschaft, wenn sie in , , V.T ~w,z ,37.,,W Z * &,T.-I, W,-I der Form 3 ~w ~
1
1
-
R(ul,,.. . w ! . . . /(uo),. . . l(Ti), . . . ; w((Xo,.. . a n , .
. . 7 1 , .. . >))
darstellbar ist, wobei R eine L",'bzw. 2';Relation ist, je nachdem, ob ein Existenz- bzw. Allquantor ist. M A X ist ist z. B. eine Eigenschaft. Der Beweis von 1.8.1 zeigt dal3 es zu jeder .$: Eigenschaft P eine .$;-Formel P aus L 2 gibt, fur die (*) gilt.
s:-
Beweis. 1. Induktion uber n : Wenn P 2; Eigenschaft ist, existiert P mit (*) nach Definition der rekursiven Eigenschaften. Die 2 : + , Eigenschaft P IaBt sich in der Form 37, w i S ( u o , .. . w,xo,. . .x,, an,.. . u k , T ) fur ein 2: S schreiben. Z u S gehort aus L z mit (*). Wie im Beweis von 1.2 sieht man, eine .$A+, Formel dal3 die Formel
s
P(U0,.
. . wo,. . .,X ) =
= 3Y
3 w 3 i, j ( Y E W, A
X = YJ(wu+...wk+n+l)~
s(uo,. . . w, wo,. . . wk, i, Y )I\
1
j = wo+ . . . wk
+n+i)
die gewunschten Eigenschaften hat. 2. Wenn f die rekursive Funktion im Beweis von 111, 3.4 ist, ist
Algebraisch abgeschlossene Gruppen
519
1
W * ( M )= {f-’(W ( F ) ) F E S k ( M ) } fur alle a.a. Gruppen M. (Das folgt aus 111, 1.4 und 111, 3.4.) Wir beweisen 2 durch Induktion uber n. Wenn S rekursiv ist, beschreibt S ( u o , .. . W((ao)), . . .) eine rekursive Formel S ( v o , .. . Xo,. . .) Formel aus L 2 1aDt sich Eigenschaft. Eine in der Form 3 Y o , . .3w i P ( u o , .. . w, Xo,.. . Yo* ) schreiben, P A. Dann ist auch P’(u0,.. . W , Xo, . . . Y o , . . ) = P ( u ~.,. . W , XO,. . .f :I( YO), . . .) 2:. Zu P’ gehore die 2: Eigenschaft p’, so da13 (**) fur P‘, P‘ gilt. Sei M a.a. no,.. . E w und A,. . . aus M. Dann ist w ( M ) I =S(no,.. . W((&,)), . . .) gdw. ex. m und co,.. . E M mit w(M)I= i P(no,.. . m, W((bo)), . . .f - ’ ( W((co)),. . . ) gdw. die Z:+J Eigenschaft 3 w 3 ~. . .~i p’(vo,. , . . , w, ao,. . . T ~.,. .) auf no,.. . bo, . .. zutrifft. 0
e:,,
-
2. Elementare Aussagen Wir zeigen, daD sich in algebr. abg. Gruppen die L2-Eigenschaften von w ( M ) durch elementare Aussagen uber M ausdrucken lassen. Im Hinblick auf den letzten Satz sind also alle arithmetischen Eigenschaften von M elementar ausdruckbar. Wir betrachten spezielle f: formeln von L2: Wenn A r.a. ist, n 3 2 ist S Y X ) die Formel 3 Y , V Y 2 . . . O n - 2 Y n - 2 ( 1 ) 3 w EA(X, Y1,..., Yn-z)E %‘:-I. Wenn n ungerade ist, (ist Qn-2 ein Ex.-quantor) steht das Negationszeichen; wenn n gerade ist, steht kein Negationszeichen. Wenn A endlich ist, ist Sf(X) die n?-Formel von L 2 X E
U{gi 1 i E A } .
Satz 2.1. 1. Fur jede 3,-Formel p ( x o , . . . x , ) (aus L,, n 3 1 ) gibt es ein r.a. bzw. endliches A C w, so dap f u r alle ao,. . . a, aus einer a.a.
Gruppe M
(*)
MI=p(ao,... 4,)
gdw.
w ( M ) I = S : ( W ( ( a,... o a,))).
2. Fur alle m E w, n 3 1 ltnd r.a. bzw. endliche A gibr es eine 3, Formel p ( x o , . . . x , ) rnit (*). Folgerung. Z u jeder 3.-Aussage p gibt es ein r.a. bzw. endliches A C w rnit MI= p gdw. w ( M ) I =S t ( w ) (*) fur jede a.a. Gruppe M. Umgekehrr gibt es f u r alle n 3 1, r.a. bzw. endliche A C w eine 3,-Aussage p mir (*). Beweis. 1. n = 1: Die Existenzformel p ( x o , . . .x , ) hat die Gestalt 3 y 0 , . . . y, ( A S o ( ~ o , .. . y o , . . . ) v A S i ( x o , . . .) v * . * v A S, ( ~
. . ))
0 , .
M. Ziegler
520
wobei die S, endliche Systeme von Gleichungen und Ungleichungen sind. Fur ao,. . . a, E M ist MI= cp ( a o , .. .) gdw. eines der Gleichungssysteme S, ( a o , .. . y o . . . ) in M - und, wenn M a.a., uber M - Iosbar ist. Fur jedes i gibt es eine r.a. Menge von Implikationen I,(co,.. .c,,,), soda6 - fur alle u o , .. .,a, aus einer Gruppe M S, ( a o , .. . yo,. . .) genau dann uber M Iosbar ist, wennn MI= I,( a o , . . ). (Denn in 11, 1.6 ist die Menge I von Implikationen r.a., wie in 11, 2.1.) MI= I,(ao,...) kann man auch mit W((a,, . . .)) E X,fur eine r.a. Hornklasse X,= g,, ausdrucken. Wir setzen also A := { n o , .. ., nk}. n = 2: Die 32-Formel hat die Gestalt 3 y o , . . . yq 1cp (xo, . . . yo * . . ) fur eine Existenzformel cp. Wir wenden den eben bewiesenen Fall an und . g,,, SO dal3 fur alle u o , .. . bo,.. . aus bekommen r.a. Hornklassen gm7.. einer a.a.Gr. M
MI= cp(a0,.. . bo,.. .) gdw.
W((a0, . . . bo,. . . ) ) E %,U.
* .
U %”,.
M l = p ( a _ o , . . . ) gdw. w ( M ) I = 3 Y ( Y E Wm+q+l A Y 3. rn + 1= W((uo, ...)) A Y $Z Xmu . U g,,) folgt daraus wie im Beweis von Satz 1.2. Fur alle u o,.. . u k , w o , .. . wk E o ist {Y(D,,,,U-.. UD., c Y A ( D ~ , u - u - -D:J n Y = 0) eine r.a.
-
Hornklasse X , deren Index wir mit einer rekursiven Funktion g aus uo,. . . wk berechnen: X = gg(”,,, w k ) . W ist genau dann nicht in gm U * u g,,, wenn es (uo, w,) E W,, . . . ,(uk, W k ) E W,,, gibt mit W E X,,, B sei die Menge aller g(uo, . . w k ) fur (uo, wo)E W,, . . . ,(uk, w k ) E w,,,.B ist r.a. Fur jedes w E o ist {(x,Y)JY E eine r.a. Hornklasse, deren Wm+q+I A Y 3. m 1 = X A Y E gW} = Index rekursiv von w abhangt. Es ist MI= p(ao,.. .) gdw. o(M)C3 w E B (3Y E % , ( w ) [ W((a0, . . .))I). Weil o ( M ) abgeschlossen ist, gibt es gznau dann ein w E w*(M) W E g h ( w ) [ W((a0,. . . )I7 wenn X h ( w ) [ W((a0,. . .))] # 0. Nach 111, 3.9 ist die Menge aller W’ mit %$,(w)[ w‘]# 0 eine r.a. Hornklasse %h,(,,,). Der Beweis von 111, 3.9 zeigt, daB wir h ‘ rekursive annehmen konnen. Es ist also M I = p ( a o , ...) gdw. w(M)I=3 w E B ( W((ao,. . . )) E %hr’(w)). Wenn wir A := h ’ [ B ] setzen, gilt (*)* (Dieser Beweis zeigt auch, wie man, statt wie im Beweis fur n = 1 rnit den Methoden von Kapitel 11, rnit den Satzen uber abgeschlossene Ideale und r.a. Hornklassen arbeiten kann.) n 2 3 . Der Beweis geht durch Induktion uber n. Die 3, Formel p(xo, . . .x,) habe die Form 3y0,. . . yq 7 cp(xo,. . . yo,. ..) fur eine 3,-] Formel cp. Die Induktionsvoraussetzung liefert eine r.a. Menge A ’, so da6 fur alle ao,. . .,bo,. . . aus einer a.a. Gruppe M M I= cp (ao,. . . bo,. . . ) gdw. w ( M )C Sf:,( W((ao, . . . bo,. . .))). Also ist wie friiher M I= p(ao,...) gdw.
.-
.
Wk).
+
52 1
Algebraisch abgeschlossene Gmppen
w ( M )l=3 Y ( Y E
A
Y
.1 m + 1 = W((ao,. . .)) A i S
Y)).
Wir schreiben das in der Form u ( M ) l =S(W((ao,. . .))) fur eine LzFormel S ( X ) . Wir zeigen, daD S ( X ) fur ein r.a. A C w aquivalent in allen w-Modellen - zu S t ( X ) ist. 1. Fall: n gerade. Jedem w ordnen wir die r.a. Hornklasse
2);:
=
{(x,y, Y I ,. . . ,
1yE
yn-3)
Y
%,+,+I
A
.1 m + 1 = x
A
( Y ,Y , ,. . . , Y,,-,)E
%:-*I
zu. Wir finden ein rekursives g. Offenbar ist S ( X ) aquivalent zu
3YVY,3Y,...Q.-3Yn-33wE A ’ ( X ,Y, Yl, ... Yn-3)E%:;,). Nun sieht man, daB diese Formel fur A := g [ A ’ ]zu Sf aquivalent ist. 2. Fall: n ungerade. Die r.a. Hornklasse {(x, Y E w,,,,, A Y 4 m + 1 = X I sei %kfur ein r.a. W. Wir ordnen effektiv jedem k eine r.a. Hornklasse %:&I) zu. g also rekursiv. Wenn k = 2s, sei %:&f = {(X,Y, Y , ,. . . Yn-4 ( Y , Y , ,. . . Yn-3) E %:-’I. Wenn k = 2s + 1 und s = (u, vo) sei
Y)I
1
%;GI)
=
{(x,Y, Y , , .. . Y
I
~ - D, ~ )c x v Y
A
D Ln~(Xv Y )= 0).
Dann ist S ( X ) aquivalent zu
3 Y VYI 3YZ. * * Qn-3Yn-31 3k E (2A’U (2W + l ) ) ( X , .. . Y,,-J E %:&I). Wir wahlen also fur A g[2A’U (2W + l)]. 2. f sei wieder die rekursive Funktion aus dem Beweis von 111, 3.4, fur die W * ( M )= {!-I( W ( F ) ) F E Sk(M)} (wenn M a.a.Gr.). n = 1: Sei A = { n o , .. . , nk}. Jede der Hornklassen %,, gehort zu einer r.a. Menge von Implikationen Zi(co,., . c,), sodaB fur alle a o , .. . a, aus einer Gruppe M M I= Zi ( a o , .. .) gdw. W ( ( a o ,...)) E %,,. Aus 11, 3.13 (fur G = E) oder aus 11, 2.6 folgt, daB es endliche Systeme Si(ao,.. . yo,. . .) gibt, die genau dann in der a.a. Gruppe M losbar sind, wenn die Zi auf die a o , .. . zutreffen. Also ist w(M)l= Sf(W((u,,.. .))) gdw. &fb 3y0,. . .( A so(&, . . . y o * *.) V . * * V A & ( U o , . . . Y o , . . .)). n = 2: Sei A r.a. Jedem w E w ordnen wir effektiv ein endliches System S, (co, . . . , c,, yo, yl) zu, sodaB fur alle ao,.. . aus einer Gruppe M ( W((uo,. . . )) E %, gdw. S , ( a o , .. .yo, y l ) iiber M losbar ist, S, besteht aus Gleichungen und hochstens einer Ungleichung (11, 3.13). Dann ist fur alle a o , .. . aus der a.a.Gr. M u ( M ) t = S t ( W ( ( a o ,...))) gdw. es ein w E A gibt mit M != 3 y 0 3 y I S , ( a o , .. . yo, yl)). Weil die S , hochstens eine Ungleichung enthalten, ist Z(co,.. .do,dl) = {i A S , (co,. . . do, d , ) w E A } eine r.a. Menge von Implikationen. Damit gilt w(M)I= S t ( W ( ( a o ,...))) gdw. ex. bo,& E MFZ(a0,.. . bo, b l ) .
I
I
M . Ziegler
522
Wir wahlen ein endliches System S(c,,,. . . d,,, d , , 2 ) von GI. & Ungleichungen, das genau dann in M losbar ist, wenn Z gilt. Also ist w ( M ) C SA( W ( ( U ,.,., . ))) gdw.
M I = 3yo, yi V i 1 A S(ao,.. . , yo, yi, 5)
n 2 3 : Beweis durch Induktion uber n. Sei A C w r.a. S;;'(X)ist bis S t - I ( X v Y ) . Man auf Umbenennung gebundener Variable 3 Y i rechnet leicht nach, dal3 fur jedes w E w
{(z, Y , ,. . . Y ,
-.3)
I ( ( zJ m + 1) v f-'(zJ {m + 1, m + 211,
Y I ., . . Y n - JE $ty}
eine r.a. Hornklasse ist, deren Index g(w) rekursiv von w abhangt. Sei A ' = g[A], und p(xo,. . . xmt2)eine 3,-,Formel, sodaB fur alle aus der a.a.Gr. M M I = p(al~, . . . ) gdw. a,,,. . . w ( M ) I =St:,( W ( ( a , ,,,. .))). Dann gilt fur alle a o , .. . a, aus der a.a.Gr. M m(M)I=Sf( W ( ( a , ,.,. .))) gdw. es ein W E W * ( M ) mit w ( M ) + i S ; f - l ( W ( ( a , l , . . . ) ) vW) gibt gdw. ex. Qrn+l,am+zEM w(M)t= 1St-,(W ( ( a , ,.,. .)) v f - ' (W((a,,,+,, a,+J))) (nach 11, 3.4) gdw. ex. U , + ~ , ~ , + M ~ Emit w ( M ) C i S ~ ~ l ( W ( ... ( a a,+2))) o, gdw. ex. a,+l,am+2EMM I = i p ( t ~ i i , . . . ~ a m gdw. +~) M I= i I x m + i , ~ m + 7 2 ~p (a,,,. . ., a m , x m + 1, X m + Z ) . 3x,+,, x ~ i +cp (x,,, ~ . . . , xm+J ist die gesuchte 3, Formel. Beweis der Folgerung: Man wahlt a. = e, . . . ,a, = e, dann ist W((U,l,. . .)) = 0. 0 Wir zeigen nun, daB jede Lz-Formel aquivalent zu einer S f Formel ist. U m n moglichst klein zu halten, klassifizieren wie die L2-Formeln noch etwas genauer. Zunachst definieren wir die fi; bzw. 2: Formel P;f(X) fur ein r.a. A (bzw. endliches A ) als die Negation von St. Ebenso wie in 2.1 die 3,-Formeln zu den S : gehoren, gehoren dann die V,-Formeln von L,, zu den P ; f .
Definition 2.2. Eine S1-Formel von L 2 ist eine Formel der Gestalt S A ( X 1 , . . . ,Xk)= 3 4 W ( X I , . . . Xm, U , W ) E A A D" C ( X I v X z v . . . v X k ) ~ D f , C w \ ( X 1 ~fur ~ ~ein ~ vr.a. X kA. ) Wenn m = 0, sagt diese Formel gerade ( X I , . . ,xk) 2:. 1st k = 0, reduziert sich die Formel auf (xl,. . . ,x,) E A. Sonst driickt diese Formel, wenn A = Wi, ( x l , . . . x,) E W : [ X ,v . * * v X2]aus. Die S, Formeln haben die Gestalt 3 Y:, . . . Yi13y:, . . . y f , V Y : - . . Qn-lY;-' (1) cp (XI,
. ..
9
xm,
y ;,* . * yy,,;:, XI,.. .
7
* *
- Yn-'Qn-lyY-' '"-1
xk,y:7
* *
yrrn::)
* * *
Y n-1 3"-l
Afgebraisch abgeschlossene Gruppen
523
wobei Q eine S,-Formel ist. Wenn n gerade ist (wenn also Qn-l = 3 ist), steht das Negationszeichen. Wenn n ungerade ist, ist Qn-, = V und das Negationszeichen fallt weg. Die P. Formeln sind die negierten S, Formeln. Im nachsten Satz bedeutet Q E S, bzw. E P,,, daB es eine S,,-Formel bzw. P,,-Formel gibt, die in allen algebraisch abgeschlossenen oModellen zu cp aquivalent ist. Satz 2.3. Mit cp:(xl,. . . xm,X I , .. . x k ) bzw. t,b:(xli.. . X k ) seien S,, bzw P,-Formeln bezeichnet, i = 0 , l . 1. Fur alle n ist (i) 3 X , ~ o "3x1(p,", , cp,"vvYES. und VX~slro", Vx~cLo",Jlo"~cLYEpn. (ii) i q o " E P , , it,b,"ESn. (iii) Q,"E Sn+l,P,+l und +o"ES,+,,P,+I. 2. Fiir alle n 3 2 ist (i) Q ~ " A Q Y E Sund , , ~o"vt,b;EP,,. (ii) Fiir ein r.a. A ist qo"(xl,.. .,X k ) uquivalent mir Sf({x,} v * * v {x,} v X 1 v * * v x k ) . $o"(xl,.. . ,x k ) ist fur ein r.a. B mit P f ( { x l }v * * v x k ) iiquivalent. Wenn m, k = 0 meinen wir S ; f ( o ) , bzw. P f ( o ) . Umgekehrt, wenn A r.a., ist S;f({x,} v * * v x k ) E S, und P^({XI}v ' * . v x k ) E P n . 3. (i) V X 1Q A € S1 und 3 X I t,bAE PI. (ii) Jede endliche Konjunktion (Disjunktion) von Formeln aus S1 (PI) ist fiir ein endliches A iiquivalent zu P?({xl} v * * v X k ) (Sf({xl} v * v x k ) ) . Umgekehlr sind diese Formeln iiquivalent zu endlichen Konjunktionen (Disjunktionen) aus S1 (PI). 4. Fiir alle n 2 1 ist jede 2 ; Formel aus Sn+l und jede fit-Fonnel aus pn+l*
-
-
-
Bemerkung. Wir brauchen die algebraische Abgeschlossenheit von (w, 9)nur in 2(ii), 3(i). Es genugt sonst in l(i) fur 3 X 1cp A € S1 und V X , J/AEP , und in 4 fur n = 1, daB in den betrachteten w-Modellen
Y)Y alle endlichen Mengen enthalt. Diese Bedingung reicht auch in 2(ii) zusammen mit "A v B E Y gdw. A, B E 9"' aus, urn zu zeigen, daB cpo" - in allen diesen Modellen - aquivalent zu einem S~+,({x1}v * ) ist. Ohne die algebraische Abgeschlossenheit bekommt man in 3(i) nur V X , Q A E Pz. (0,
- .
Beweis. l(i) DaB 3 x 1~ 0 E" S,,, ist klar fur alle n, ebenso folgt 3x1 cp;E S,,fur alle n a 2 sofort aus der Definition. Sei s A (xl,. . .,X k )eine S,-Formel wie in 2.2. Wir formen die Formel urn und erhalten zwei r.a. SMengen A', A" soda0 s A aquivalent ist zur Disjunktion der Formeln
M. Ziegler
524
3u1, uz, w (XI, . . . xm, I , u2, w ) € A ' A
Du, c (xzv * .
v
x k ) h
D,
CW\XI
und 3U1, Uz, W (XI,. . . X,
U I ,Uz, W ) €
c X I A D t,
A " A D,, C (x2V . ' ' V
x k ) A D, c xi A D k CW\(xzA * * * h x k ) .
Aus dieser Darstellung sieht man, dal3 es genau dann "ein XI gibt mit s A (x, X I , .. . )", wenn es "ein endliches XI mit s A " gibt gdw. " ~ U I U, Z ,W ( x ' , U I ,U Z ,W ) E A ' A
D,, C (XZV ' ' . V
x k ) A
DLC W \ D y "
(Wahle XI= D,!) oder A "A
" ~ U I Uz, , W ( x ' , U1,Uz, W ) E Du,
c (XI v
*.
*
v
x k ) A
DLC
\(xZv .
'
v Xk)"
Setzt man nun (Wahle dann XI = 0,).
B
= {(i ,u I , w )
1 3u
zr
w '( x', u uz,w ') E A ' A D k C o \ D, I
A
D +!. = 0
oder 3uz(x, ul, uz,w ) E A"}, so sind in allen algebraisch abgeschlossenen o-Modellen s B ( i ,Xr,. . . x k ) und 3XlsA aquivalent. Denn jede endliche Menge ist als rekursive Menge durch eine Hornklasse beschreibbar und somit in jedem a.a. o-Modell enthalten. Durch Ubergang zu den negierten Formeln erhalten wir aus dem D a s A v s B zu s A U B Vorangehenden, daB f.a. n VXI $o", Vxl $o"EP,,. aquivalent ist, folgt zunachst 9 ;v cp E SI und $A A $; E PI.Weiter ist s A A s B aus Pz. Denn wenn C
= {(2x0,S)I x o E w , S E
A ) U { ( 2 x o + 1, S)( x o E W , S E B )
ist sA(xI,. . .) A sB(xl,. . .) aquivalent zu Vxosc(xo,x I , . . .). Wir haben also cp; A cp E P2und $A v $; E Sz. Durch Induktion uber n zeigen wir, daB f.a. n 3 2 c p o " ~cp;ES,,, c p o " ~cp;ES,,, $o" v J/; EP,,, G ,I A J/; EP,, (Damit ist dann auch 2(i) bewiesen). Fur i = 1,0 sei cp = oder abkurzend cp = 3XI$;-I. Dann ist cpo" v cp y 3 X I , .. .x l . . . (90" A c p ; ) aquivalent zu 3x1($0"-' v $;-I) 3x1,xi(+:-' A $;-I( * * . xi * * * ))). Nach Induktionsvoraussetzung ist v +;-' ($$-I A $?-I) zu einer P,-,-Formel oder, falls n = 2, zu einer S,-Formel (P,-l-Formel) aquivalent. Daraus folgt sofort, daB cpo" v cp Y (.yo"A cp 7 ) aus S, sind. Die anderen zwei Behauptungen folgen durch Ubergang zu den negierten Formeln. l(ii) folgt sofort aus den folgt durch Einfiihren Definitionen. l(iii) cpo"EP,+lund $$E "blinder Variable". Wir zeigen den Rest der Behauptung durch Induktion. Sei A = {x' I3y R ( i , y)} fur ein rekursives R. Dann ist R ( x , , . . . x,, u, w, y ) A 0,c ( X , ) A D k c w \ (X, ) aquivalent zu
r
r
---
--
525
A lgebraisch abgesch lossene Gruppen
+.
einer PI-Formel s A ist aquivalent zu 3u,w, y+ und also aus S,. Pz. Es sei nun qZ = 3Xl +;-I. Nach Daraus folgt, daB auch Induktionsvoraussetzung ist +:-I E P,, also cpZE Ebenso folgt +o"EP"+I. 3(i) Wenn A r.a. ist, konnen wir x I , .. . x, effektiv den Index g ( i ) einer r.a. Hornklasse R:(*)zuordnen, soda0 das Zutreffen von 1 S A (i, X I , .. . ,x k ) durch ( X I , . . ,x k ) E oder XkE %g(i)[Xlr.. . Xk-11 beschrieben werden kann. Weiter finden d r fur gdw. jedes 2 eine r.a. Hornklasse gk;), so daB ( X I , .. . ,x k - I ) E 2;;; Rg(:)[XI,. . . , x k - I ] # 0. D a (w, 9')algebraisch abgeschlossen ist, ist also 3xk i s A zu ( x ~. .,&.I ) E 2;;;) aquivalent. Dazu gibt es aber wieder u, w ) (u , w ) E Wh(*)} - fur das ein r.a. B - B = {(i, -I S " ( i , XI,. . . ,& - I ) zu (XI,. . .) E %[G\ \aquivalent ist. Wir haben gezeigt, daB 3xk E PI.Daraus folgt vxk q,',E S,. Fur 2(iii) und 3(ii) uberlegen wir, daB die PI-Formeln gerade die LZ-Formeln sind, die fur eine r.a. Hornklasse X aquivalent zu ({xl}, . . . ,x k ) E %'"+,' sind. Denn wenn wir fur ein r.a. A die PI s A ( x l r . . ,X,, X I , .. . X,') betrachten, ist Formel i
1
+,')
{ ( Y l v * ' * vY m VX1V*'.VXk)IVxIEY l * " V x m E
Y m 7 S A ( X 1 ,
... Xk)}
die gewunschte Hornklasse. 1st umgekehrt X gegeben, so ist fur alle .. . x k ) ( ({xJ,. . .x,)E x ~ +eine ' ) r.a. Hornklasse %:(,), x l , . . .x, {(x,, deren Index rekursiv von xI, . . .x, abhangt. Wie im Beweis von 3(i) bekommen wir ein r.a. A mit i s A ( i , X 1 , .Xk) . . gdw. ( X I , . . ) € %!&). 3(ii) folgt unmittelbar aus dem eben Bewiesenen. Eine Variante der letzten Uberlegung zeigt, daB fur jede r.a. Hornklasse X und jedes r.a. B x o E B A ({xl}v . . . v {x,} v XI v * * v Xk)E %%, aus PI ist. Daraus folgt der zweite Teil von 2(ii): Fur alle r.a. B 1st Sfl({xl}v . . . v Xk) aus S,. Die Umkehrung beweisen wir durch Induktion uber n 3 2 . n = 2: Aus 3(i) folgt, daB man cpt in der Form sC(xI,. . .,x,, y l r . . .,ymX I , .. .x k ) schreiben kann, C r.a. 3y,, . . . ys i Man findet leicht ein r.a. B, so daB s c( i , y 1 ,.. . , y s , x ) und s"(i,(yl,. . . ,y,), 2)aquivalent sind. cp; ist also aquivalent zu 3y i ~"( y, i2). , Nun sehen wir wie fruher, daB es eine rekursive Funktion gibt, fur die i s B ( i ,y, 2)gdw. ({x,}, . . .,{x,}, X I , .. .,x k ) E %:&?. Setzt man A = g [ w ] , ist dann cpi aquivalent mit Sf({xl}, . . . X I , .. .). Entsprechend folgt die Behauptung fur $6. n > 2: Sei cp; = 3 y I , . . Y, g y l , . . . Y ~ + ; - ~ ( X. .~x,, . j , xl,. . .x k , V ) . Da (w, 9')alle einelementigen Mengen enthalt, ist - in diesen w-Modellen - g j + ~ ' ( i j, , X,P) aquivalent mit
526
M. Ziegler
. A z,# 0 A v y l E z,. . . vy, E Z s + , ! - l (y,X, i, P)). In der Klammer steht eine P,-,-Formel, denn Z,#0 ist aus S1.Die P). Man sieht leicht, da13 es eine P,-lFormel nennen wir +(i,g,%, Formel + Y - ' ( i , X , Y) gibt, sodaB +(i, z,X,P) und 3 2 (ZI# 0 A
* *
-
+ y - ' ( i , X, Z , v * * v Z, v Y , v * * v Y,) aquivalent sind. D a in 'gdw. A v B E 9',ist QO" algebraisch abgeschlossenen (w, 9') A, B E 9 aquivalent zu 3 Y+y-'(i,X, Y). Wenn nach Induktionsvoraussetzung +Y-' fur ein r.a. A mit Pf-'({x,} v * . . v X I v . . . v Y) aquivalent ist, sind Q Z und Sf({xl} v . . . v X I v . . - ) aquivalent. 4. Wir brauchen nur zu zeigen, daB alle $;-Formeln aus S2 sind. Eine :-Formel Q (i,%) hat 0.E. die Gestalt
3 P 3 U ,W ( i , U,W ) E A A r.a.
A
D. C ( X IV . . * Y IV .. * )
h
D, c W
\ ( ~ V I
.
{ ( X I , . . Y1)l D, c ( X IV - * . Y , V
* *
* ) A
D, t w\(xl
V
* * *
YI V
* * *
),
.. . Y , V - . . ) }
ist eine r.a. Hornklasse, deren Index rekursiv von u und w abhangt. aus PI,so daB Q mit 3 P 3u,w (2, u, w ) E Es gibt also ein A A + ( u , w, 2, P) aquivalent ist. D a (i,u, w ) E A aus S1 ist, folgt die Behauptung aus 1 und 2. 0
+
Folgerungen 2.4. 1. Fur jede S,- (Pm-) Formel p ( X ) , n 2 2, m E w, gibt es eine 3.- (Vm-) Formel p (aus L,,), sodap (*)
fur alle ao,. . . a,,, aus einer a.a.Gr. M M t= p ( a o , .. .) gdw. w (MI t= cp ( w ( ( a o ,. . . , am>)I.
Und umgekehrt. 2. Fur jede S,- (P,-) Aussage Aussage p (aus L,,), sodap (**)
fur alle a.a.Gr. M M C
Q,
Q
n 3 2 , gibt es eine 3,- (Vn-)
gdw. w(M)l= p.
3. Zu jeder Konjunktion (Disjunktion) uon S , - (PI-)Formeln Q ( X ) bzw. Aussagen cp gibr es eine V,- (gl-) Formel bzw. Aussage p, sodap (*) bzw. (**). Und umgekehtt 4. Zu jeder 2;- (I?!,-) Formel Q ( X ) bzw. Aussage Q gibt es eine (V,+,-) Formel bzw. Aussage p mit (*) bzw. ( * * ) ( n 3 1). 5 . Z u jeder L2-Formel Q ( X ) bzw. Aussage Q gibt es eine Formel bzw. Aussage p aus L,, mit (*) bzw. (**); und umgekehrt. 6. Zwei a.a. Gruppen M, N sind genau dann elementar aquivalent, wenn w ( M ) und w ( N ) (bzgl. L2)elementar uquiualent sind. Wenn M Untergruppe von N ist, ist N genau dann elementare Erweiterung uon M, wenn w ( N ) elementare Erweiterung von w ( M ) ist.
Algebraisch abgeschlossene Gruppen
527
7. Jede st-Eigenschaft von Gruppenelementen ist in a.a. Gruppen zu einer 3,,,-Formel aquivalent. 8. Jede Z;-Eigenschaft uon Gruppenelementen ist in a.a. Gruppen zu einer 3,+2-Formel aquivalent. 9. Jede arithmetische Eigenschaft von Gruppenelementen ist in a.a. Gruppen einer elementaren Formel aquivalent. 10. Jede Formel von L * ist in a.a. Gruppen einer elementaren Formel iiquiiralent.
2.4.9 und 2.4.10 wurden unabhangig von [33] gefunden. Die Methode von [33] ist allgemeiner anwendbar, liefert aber nicht unsere genaue Angabe des Prafixtypes. In [33] wird bewiesen: In einer Struktur ‘?I gilt genau dann 2.4.9 oder 2.4.10, wenn es eine L,-Formel q ( x , y, 6 )gibt, mit der man jede endliche partielle Funktion f C A x A in der Form f = {(a,b ) ‘?I I= q(a, b, E)} fur ein c‘ E A schreiben kann.
I
Zwei Beispiele 2.5. 1. Dies Beispiel sol1 zeigen, wie genau man mit Hilfe von 2.3 L 2-Formeln klassifizieren kann. “Algebraisch abgeschlossen” druckt sich durch VX, Y Vv,[ 32,,-2, = X v Y A (%‘JX] # 0+ 3Z2E %‘JX])]aus. 2,= X v Y, XJX] # 0 und Z2E %‘JX] sind aus PI.Wegen 2.3.3(i) sind also auch 3 2 , 2,= X v Y und 3 Z 2 2, E .%‘u,[X] aus PI.Der Ausdruck [ . * .] ist nach 2.3.2(i), aus P2. Also ist VX, Y Vv, [ . . ] aus P2.“a.a” ist also aquivalent zu P f ( w ) . Dies ist eine arithmetische Aussage, deren Zutreffen in (w, 9’) nicht von 9’ abhangt. (Naturlich erfullen alle a.a. w-Modelle “a.8.”) 2. Die L2-Aussage max 3 Y VX X C * Y ist, wie man leicht nachrechnet, eine s;-Formel, entspricht also einer %-Amsage. Nun sind aber x E W:[Y], x E W : . [Y] und x E X, x $Z X aus S,.Daher ist +:= (x E X @ x E W:[Y]) A (x $Z X - x E W:.[Y]) und somit auch (2,z’) : X S * Y d. h. Vx+ aus P2. max ist also aus Ss, und entspricht daher einer elementaren 3,-Aussage max, die genau in den a.a. Gruppen M gilt, fur die W , ( M ) ein Hauptideal ist. 9
Um arithmetischen Eigenschaften moglichst einfache elementare Formeln zuzuordnen, definieren wir:
Definition 2.6. Eine Eigenschaft P ( u o , .. .,xo,. . .,ao,. . .) von naturlichen Zahlen (vo,.. .), Gruppenelementen (xo,. ..) und endlichen Folgen von Gnilppenelementen heifit StEigenschaft, wenn es ein r.a. A gibt, sodal3 fur alle no, E o und ao, go aus einer Gruppe M P genau dann auf no,. . . a o , .. . go,. . in M zutrifft, wenn s A ( n o , ., ., I(h0),. .. , W((ao,... bo, .. .))). PtEigenschaften sind
--
-
528
M. Ziegler
Negationen von Sf-Eigenschaften. 1st P eine S i bzw. Pi-Eigenschaft, so ist Vvo,. . .Vxo,.. .Vuo,. . .P bzw. 3 u o , .. . 3 x o , . . . 3uo,.. .P ein eine Pi+,,bzw. S i + , Eigenschaft. Der nachste Satz folgt leicht aus 2.3, 2.4 und den Beweisen von 1.2, 1.8 und 2.1. E S ; bedeutet: zu einer SE-Eigenschaft aquivalent. “Aquivalent” heiBt in 1 und 2: in allen Gruppen aquivalent; in 3, 4: in allen a.a. Gruppen aquivalent. Satz 2.7. ( P und P ’ sind Si-Eigenschaften.) 1. Fiir alle n ist 3 v o P , P v P ’ E SE; 1P E P i ; P E P i + , . 2. Fiir alle n 2 2 ist P A P’, 3 x o P, 3uoP E SE. 3. Fiir n = 1 ist VxoP, V u oP E Sf.
4. (i) Jede endlich Konjunktion uon Sf-Eigenschaften ist zu einer V1Formel uquiualent. (ii) Fiir alle n 2 2 ist P zu einer 3,-Formel aquivalent.
Beispiele 2.8. 1. “aOE ( a l , .. .,a,)” ist Sf-Eigenschaft (siehe Beispiel 2 nach 1.6), und daher in a.a.Gr. durch eine V,-Formel (z. B. En von 1.1) beschreibbar. (Z)‘= (it)’ ist Pf und durch “ex. Innerer Automorphismus . . . (eine V,-Formel) beschreibbar. 2. MAX ist, wie man aus der Darstellung im Beispiel 2 nach 1.6 sieht, Sf und daher durch eine %-Aussage beschreibbar: In ([14]) ist diese Aussage explizit angegeben. ”
3. Anwendungen Satz 3.1. 0. “(ao,.. . a,,,) unendlich” ist in a.a. Gruppen durch eine 3,-
Formel ausdruckbar (siehe auch [2]). 1. Dap (ao,.. . a,,,) einfach (nicht nilpotent) ist, ist in a.a.Gr. jeweils durch eine V2-Formel ausdruckbar ([14, 21). 2. “(ao,.. . a,,,) hat losbares Wortproblem” ist in a.a, Gruppen mit einer g3-Formel ausdruckbar. 3. ([2]) Sei G eine endlich erzeugte Gruppe mit losbaren Wortproblem. Dann ist in a.a.Gr. “(aO,... a,,,) ist in G einbettbar” durch eine 3 2 Form el a usdriick bar. Beweis. 1 (ao,.. .,a,,,) ist genau dann einfach, wenn es fur alle i = 0, 1,. . .m und Worter W, fur die W ( 6 )# e, Worter W, W , , .. . W, gibt mit ai = W ( W’WW;’, . . . W,WW;’)(cZ) (siehe auch den Beweis von (111, 2.12)). Aus dieser Darstellung erkennt man, daB “(ao,... ,a,,,) einfach” eine P:-Eigenschaft ist.
Algebraisch abgeschlossene Gruppen
529
Die Klasse der n-nilpotenten Gruppen ist gleichungsdefiniert, durch D a “(af,,. . . a,,,) erfullt 9,” eine Py-Eigenschaft von eine Gleichung 9,. n, an,. . . ist, ist “nilpotent” eine S!-Eigenschaft. 0. (an,. . . a,,,) ist genau dann endlich, wenn es Worter W f , ,. . . W k und Zahlen n,.,, ( i , j S k ) , gibt mit Wl,(ii)= a,,,. . . W,,,(ii)= a,,, und W,W;’(ii)= Wn,,,(ii) f.a. i,j S k. Das ist aber eine Sf-Eigenschaft. 2. Wir mussen zeigen, daR “ X rekursiv” aus S , ist. Da aber cp = ( u E X e= u E W z )A ( u $Z X e= u E W z . )Pz-Formel ist, ist 32, Z’VUCp E sj. 3. G sei von g erzeugt. Dann ist “(al,,. . . a,,,)< G” aquivalent zu: “ex. Worter Wo,.. . W,,,, sodaR fur alle W W ( a f , , . . a,,,)= e gdw. W ( W 0 , .. . W,,,)(g)= e (in G)”. Wie man sieht, ist das eine S;Eigenschaft. 0 Wir haben schon gesehen, daR die a.a. Gruppen M, deren zugehoriges Ideal W , ( M ) ein maximales Element hat, durch eine 3,Aussage und die a.a. Gruppen, die eine universelle endlich erzeugte Untergruppe besitzen, durch eine %-Aussage beschrieben werden konnen. Eine noch kleinere Klasse von a.a. Gruppen, die Klasse aller Gruppen, die partiell isomorph zu einer Gruppe MA sind, A C o,ist ebenfalls “elementar”:
Es gibt eine %-Aussage cp, die in einer a.a. Gruppe M genau dann gilt, wenn es eine Menge A gibt, so dafi M = p MA. cp gilt also in einer abzahlbaren a.a.Gr. M genau dann, wenn M isomorph zu einer Gruppe MA ist.
Satz 3.2.
Beweis. Wir mussen zeigen, daR die L ’-Formel
3x [(VZ 3 Y Y
=
w:[x]) A ( V Y 32 Y = w:[x])
aus S, ist. Dazu uberlegt man, daR Y
=
W : [ X ]aus Pz ist.
0
Die abzahlbaren a.a. Gruppen, deren Ideal ein Hauptideal ist, sind in gewisser Weise w-kategorisch: Jede abzahlbare a.a. elementare Erweiterung ist wieder zur Gruppe selbst isomorph. Denn sei M abzahlbar a.a. und W , ( M ) von B/=* erzeugt. Wenn N eine a.a. elementare Erweiterung von M ist, ist w ( N ) elementare Erweiterung von w ( M ) . Also gilt in o(N)ebenso wie in w ( M ) die L2-Aussage V Y Y S * B . D a W , ( N ) also auch von B erzeugt wird, ist W , ( M ) = W , ( N ) d. h. M s P N . Aus I, 4.8.1 folgt, daR es immer echte elementare Erweiterungen einer a.a. Gruppe gibt. Naturlich gibt es a.a. Gruppen mit nicht isomorphen abzahlbaren a.a. elementaren
M. Ziegler
530
Erweiterungen. Denn MI sei eine Gruppe mit uberabzahlbaren Skelett B. eine w-universelle a.a. Gruppe (I, 3.9) - und M eine abzahlbare elementare Unterstruktur. N sei eine abzahlbare elementare Unterstruktur von MI, die M enthalt, aber ein groBeres Skelett als M hat. Dann sind (I, 1.5.2) M and N algebraisch abgeschlossen und M < N. Aber naturlich nicht M = N. Der nachste Satz zeigt, daB die MA fur arithmetisches A sogar in folgendem Sinn "w-kategorisch" sind: Jede zu MA elementar aquivalente a.a. Gruppe ist zu M partiell isomorph. Dann geben wir noch ein Beispiel fur eine w-kategorische Gruppe, die nicht von der Form MA ist, deren Ideal noch nicht einmal Hauptideal ist. Wenn 9'" die Menge aller arithmetischen Mengen ist, ist (w, 9")nach 111, 4.2.1 algebraisch abgeschlossen. 9'"ist abzahlbar, also gibt es bis auf Isomorphie genau eine abzahlbare a.a. Gruppe Ma mit W * ( M a = ) 9'".
- z.
Satz 3.3. 1. Wenn A implizit arithmetisch definierbar ist (also z. B. arithmetisch ist), gibt es eine Aussage q A aus L,, die genau in den a.a. Gruppen gilt, die zu MA partiell isomorph sind. 2. Wenn A durch eine 2; (IZi)-Relationimplizit definiert ist, finden wir f u r q A eine 3,-Aussage rnit n = max(k + 1,5) ( n = max(k +2,5)). Wenn A eine A!-Menge ist, finden wir fur q A eine 3,-Aussage, n = max(k + 1,5). Mo ist durch eine V4-Aussage beschreibbar. 3. Es gibt eine V,-Aussage, die genau in den a.a. Gruppen gilt, die zu Ma partiell isomorph sind.
I
Beweis. Wenn { A }= { X R ( X ) } fur ein arithmetisches R, ist M genau dann partiell isomorph zu M A (d. h. W * ( M )= {B B <,A})wenn w(M)I=3 X ( R ( X )A V Y 3 z Y = W : [ X ]A V z 3Y Y = W : [ X ] ) .Da diese Aussage eine L2-Aussage ist, folgt 1 aus 2.4.5. Wenn man ausnutzt, daB Y = W : [ X ] aus Pz ist und daB, wenn R 2: (n;)), R ( X ) aus Sk+] ( P k + l ) , berechnet man leicht, daB diese Aussage aus S,,fur die angegebenen n ist. Der erste Teil von 2 folgt damit aus 2.4.2. Wenn A eine A: Menge ist, ist X = A d. h. V u ( u E X u E A ) aus pk. Wenn wir in der oben angegebenen Aussage R ( X ) durch A = X ersetzen, erhalten wir eine Aussage aus S, fur n = max(k + 1,5). Nach 111, 4.4.4 (und 111, 3.3.2) ist eine a.a.Gr. M genau dann zu Mo partiell isomorph, wenn W * ( M )nur aus r.a. Mengen besteht. Denn wenn W * ( M )nur aus r.a. Mengen besteht, ist eine (jede) abzahlbare a.a. Untergruppe N von M in M,, einbettbar. Daraus folgt N = M o und daher W , ( M o )C W , ( M ) , d. h. W , ( M ) = W , ( N ) . 9' enthalt genau dann. nur r.a. Mengen, wenn ( w , Y ) I = V X 3 z ( X = W z ) .D a X = W, aus P2,ist diese Aussage aus Pa. Damit ist 2 bewiesen.
1
f,
53 1
Algebraisch abgeschlossene Gruppen
3. Die vollstandigen Z:-Mengen 0‘“’sind implizit II‘:-definierbar: Es gibt eine 17: Relation R, so da13 fur alle n {+‘”)} = { X R(n,X)}([24] S. 344). Die arithmetischen Mengen sind die auf ein 4cn) I-reduzierbaren, oder die auf ein 4‘”’aufzahlungsreduzierbaren. Also ist W * ( M )= 9” gdw. w ( M )I= Wz 3 X R ( z ,X) A V Y 32 3X ( R ( z ,X) A Y Sex). Diese Aussage ist, weil R aus P , und Y s e x aus Sz ist, aus Ps. 0
1
Wir haben unendlich viele nicht elementar aquivalente a.a. Gruppen gewonnen, z. B. sind, wenn A arithmetisch ist und A f,B, MA und ME nichtelementar aquivalent. Wenn n < m, so gilt z. B. in M+cm, die 34-Aussage “+‘“‘)E W*( * . .)”, nicht aber in M+(,,,.(“4‘m)EW * ( .. .)” ist in a.a. Gruppen durch eine g4-Aussage ausdruckbar, weil das gerade w ( M ) I =3 X R ( m , X ) - fur R wie im Beweis von 3.3.3 - bedeutet. R ist f l y und also E P,.) Algebraisch abgeschlossene Gruppen, in denen verschiedene 34-Aussagen gelten, wurden zuerst in ([14]) konstruiert; dort wurde zum Beispiel die g4-Aussage MAX untersucht. Bei den bis jetzt betrachteten Gruppen, gilt MAX in den MA,nicht aber in M a oder ([14]) in a.a. w-universellen Gruppen. In M a oder in w universellen Gruppen gilt noch nicht einmal max, denn es gibt keinen maximalen arithmetischen *-Grad und keinen maximalen *-Grad (weil es keinen maximalen arithmetischen T-Grad und keinen maximalen T-Grad gibt). Es blieb in ([14]) offen, ob es 3,-Aussagen gibt, durch die sich a.a. Gruppen unterscheiden. Wir konnen jetzt leicht eine solche Aussage angeben: Wenn A eine r.a., nicht-rekursive Menge ist, ist “ A E W*( * )” -J,-ausdruckbar; denn die LZ-Aussage3X X = A ist aus S3. Wegen 111, 4.1 (oder 111, 1.8.2) gibt es algebraisch abgeschlossene Gruppen M, fur die A nicht in W * ( M )vorkommt. Andererseits gilt aber “A E W * ( . *)” in allen ME, B C w. Im nachsten Satz zeigen wir auf ahnliche Weise, daB es sogar 2” a.a. Gruppen gibt, die sich paarweise durch 3,-Aussagen unterscheiden. Aus dem Vorigen ergibt sich die Nichtkompaktheit der Klasse aller algebraisch abgeschlossenen Gruppen auf folgende Weise: Wir betrachten die Menge der 3,-Aussagen “c$(’”)€ W*( * * fur m E w und die %-Amsage “ex. m 4““’6Z W * (. . . )”. (Diese Aussage ist 3*, weil M genau dann diese Eigenschaft hat, wenn (fur das R aus dem Beweis von 3.3.3) u ( M ) l =32 V Y -I R ( z , Y ) . )Diese Menge ist off ensichtlich nicht erfullbar; je endlich viele dieser Aussagen sind aber in einer a.a. Gruppe gultig. Denn in M+(”,gilt “ex. m 4‘“‘’ W * (* . * )” und fur alle i S n “4(i)EW * ( * . )”. Wir geben im nachsten Satz nicht erfullbare, aber endlich erfullbare Mengen von Aussagen kleineren Prafixtyps an. Es gibt abzahlbar viele rekursiv aufzahlbare Mengen, deren
-
e)”
-
M. Ziegler
532
Turinggrade paarweise unvergleichbar sind (1241 S. 175). Auf ahnliche Weise kann man eine Folge (Ai i E w ) von r.a. Mengen konstruieren, deren Turinggrade "unabhangig" sind: Wenn i k? { i t , .. . , in},ist A, jdTAi,v A, v . * * v A,".(Das folgt auch aus ([31]).) Die Konstruktion liefert sogar eine r.a. Familie, d. h. {(x, i ) x E A,}ist r.a. Wir wahlen uns fur jedes i eine 3,-Aussage aus L,, (pi, die genau dann in einer a.a. Gruppe gilt, wenn A, E W * ( M ) .32 V Y Y # A, ist eine L2Aussage aus S4. Es gibt also eine 34-Aussage p aus Lw,wdie genau dann in einer a.a. Gruppe giIt, wenn es ein Ai gibt, das nicht in W * ( M )vqrkommt. Wir wahlen uns noch eine 3,-Aussage 4, die fur gibt. eine a.a. Gruppe aussagt, dal3 es ein i mit Ai E W*( *
I
1
a
)
Satz 3.4. 1. Die Folge der 3,-Aussagen cpi, i E w ist in a.a. Gruppen unabhangig: Fur jedes B C w gibt es eine a.a. Gruppe M mit B =
I
{ i ~ i c p =i } . 2. Sei T , = { c p i i E w } und cp die oben definierte %-Amsage. Dann hat zwar jede echte Teilmenge von TIU {cp} ein algebraisch abgeschlossenes Modell, es gibt aber keine a.a. Gruppe M mit MI= T I U{p}. 3. Sei Tz = { i (pi i E w } und I(I die oben definierte 3,-Aussage. Dann hat jede echte Teilmenge von T2U {cp} ein a.a. Modell, T2U {rp} aber nicht.
I
1
Folgerung 3.5. 1. Es gibt 2" viele a.a. Gruppen in denen paarweise verschiedene 3,-Aussagen gelten. 2. ([16]) Es gibt 2"' viele verschiedene elementare Typen von a.a. Gruppen (vgl. [28]). 3. Die Klasse der algebraisch abgeschlossenen Gruppen ist nicht kompakt.
Beweis. 1. Sei B C w und I das von den Ail=*,i E B erzeugte Ideal. Z ist abzahlbar. Wenn j 6Z B, ist A,/=* nicht aus I ; denn sonst gabe es i,, . . . in aus B mit A, S * A,, v . . . v Aimund somit A, STAi, v . * * v A,", was der Unabhangigkeit der A i / = T widerspricht. 111, 4.1 liefert uns nun ein hiichstens abgeschlossenes abzahlbares Ideal J, Z C J, das kein Ail=*, j k ? B enthalt. Nach 111, 3.12.2 gibt es eine abzahlbare a.a.Gr. M A mit w*(M;) = J. ES gilt B = { i A, E w*(M;)}. 2. Es ist klar, dal3 keine algebraisch abgeschlossenen Gruppe TIU { a } erfullt; denn cp sagt ja gerade "es gelten nicht alle (pi". Da z.B. Mo ein Modell von T , ist, genugt es zu zeigen, dal3 fur jede echte Teilmenge S von w {pi i E S } U {cp} ein a.a. Modell besitzt. Solch ein Modell ist aber - in der Bezeichung des Beweises von 1 - M 5
1
1
A lgebraisch abgeschlossene Gruppen
533
3. 4 sagt, da13 nicht alle i c p , gelten, es gibt also kein algebraisch abgeschlossenes Modell von T. U ( 4 ) .Sei S eine Teilmenge von w . Dann ist M;,,,, ein Modell von T., wenn S = w, und von { - I ~ ~ , ( ~ E s ) u wenn { + ) ,~ f w .
Bemerkungen 3.6. 1. 3.5.1 ist optimal: In allen algebraisch abgeschlossenen Gruppen gelten dieselben 3.-Aussagen (siehe [ 141). Es gilt sogar fur jede iI2-Formel cp (x,,, . . . x”): Fur alle a.a.Gr. M , N , a , , , .. . a, E M, b,,, . . . b, E N mit (a,,,. . .)’=(b,,,. . . )’ ist M k p(ao,.. . ) gdw. N I= ~ ( b , ,.,. .). Das ist leicht direkt zu beweisen. Man kann das Problem aber auch in L z ubersetzen. Nach 2.1 gehort zu cp ein r.a. A, fur das wir w(M)I= S A ( W ( ( a o , . .))) gdw. w ( N ) I =SA( W((b,,,. . . ))) zeigen miissen. SA ist aber eine Formel ohne Mengenquantoren (Z!)und W ( ( a , , ., . .)) = W(b,,,. . .)). 2. Fur Mengen von 3,-Aussagen ist die Klasse der a.a. Gruppen kompakt. Es gilt sogar: Jede 3,-Aussage, die in einer a.a. Gruppe erfullbar ist, gilt in jeder a.a. w-universellen Gruppe. D a eine a.a.Gr. M genau dann w-universe11 ist, wenn W * ( M )= B ( w ) , mussen wir wegen 2.4.2 gerade zeigen, dal3 jede S,-Aussage, die in einer a.a. Unterstruktur von (a, B ( w ) ) gilt, auch in ( 0 ,B ( w ) ) gilt. Wir zeigen allgemeiner: “ ( w , Yl)C ( w , Y2) seien zwei algebraisch abgeschlossene w Modelle, p ( X ) aus S3, B E Y,. Dann folgt aus (0, YI)+ p ( B ) , dal3 (w, Y 2 ) tp ( B ) . ” Beweis: Wegen 2.3.2(ii) kann man p ( X ) in der Form Sp(X) fur ein r.a. A annehmen. Wenn es also ein C E 9, mit (w, 9,) I= i 3 w E A (B, C )E 2;gibt, so erfiillt C auch (w, Y2) i= -I 3 w E A (B, C)E 2;d. h . (w, Y2)I= S f ( B ) . 3. Die Klasse der a.a. Gruppen ist auch bzgl. V,-Aussagenmengen kompakt - 3.5.3 ist also optimal. Wir werden namlich spater zeigen, dal3 jede V,-Aussage, die in einer a.a. Gruppe erfullbar ist, in jeder endlich generischen Gruppe gilt. Die bis jetzt konstruierten 3,-Aussagen waren alle schon in Mo gultig, also auch (man vergleiche 2 der letzten Bemerkung) in allen MA, A C w, und Ma. Wir werden jetzt 3,-Aussagen angeben, die erfullbar sind, in M a aber nicht gelten. Ein Baum ist eine Menge von Godelnummern von endlichen Folgen von naturlichen Zahlen, die mit der Godelnummer einer Folge auch die Godelnummern der Anfangsstucke dieser Folge enthalt. Ein unendlicher Zweig eines Baumes ist eine Teilmenge der Form {Godelnummer von ( f ( O ) , . . . ,f(n - 1)) n E w } fur eine Funktion f:w-.w.
I
534
M.Ziegler
Satz 3.7. Z u jedem r.a. Baum T kann man eine 3,-Aussage qT
angeben, die genau dann in einer a.a. Gruppe M gilt, wenn T einen unendlichen Zweig aus W * ( M ) besitzt.
Beweis. k sei die Godelnummer der leeren Folge. Fur Godelnummern von Folgen bedeute sIunvs2,daB s1 und s2 keine gemeinsame Fortsetzung haben; sIanf sZ bedeute, daB s1 ein echtes Anfangsstiick von Sz ist. Die folgenden Formeln sind aus 23,: u E Y u E T, k E Y, ( u o u n v u l ~ u o ~ Y ) + u l U~ ~Y E,Y + 3 u l E Y u o a n f u l , (uoanf u1 A u1E Y)+ u o E Y. Also sind Y C T, Y) = Vul, uo((uounv u1 A uoE Y)+ u1 6! Y), J12( Y) = Vuo(uoE Y + 3 u I E Y uoanf ul)'und +3( Y) = Vuo,ul((uOanfu l A u1E Y)+ uoE Y). Formeln aus Pz. B ist ein unendlicher Zweig von T gdw. $,(I?) gilt, wobei II, die P2 Formel Y C T A k E Y A tJl(Y) A &(Y) A +h3(Y) ist. Wenn M a.a.Gr. ist, gibt es also genau dann einen unendlichen Zweig aus W * ( M ) ,wenn w ( M )C 3 Y I)( Y). D a 3 Y +( Y) eine S,-Aussage ist, folgt nun die Behauptung aus 2.4.2. 0
-
Folgerungen 3.8. 1. Es gibt eine 33-Aussage, die in M a nicht gilt, aber auf alle a.a. w - uniuersellen Gruppen zutrifi. 2. Die V3-Theorie der a.a. Gruppen ist eine uollstandige IIkMenge. D. h. die Menge der Godelnummern der V3-Satze, die in allen a.a. Gruppen gelten, ist eine uollstandige Il:-Menge. 3. Die elementare Theorie der a.a. Gruppen ist uollstandig ZZ:. 4. Die elementare Theorie der a.a. Gruppen lapt sich nicht durch eine Menge uon V2- und %-Amsagen axiomatisieren. Beweis. 1. T sei ein rekursiver Baum, der einen unendlichen Zweig besitzt, aber keinen arithmetischen Zweig. qT hat die gewunschten Eigenschaften (siehe ([24] S. 419)). 2. Aus ([24] S. 396) folgt, daB die Menge der Godelnummern (d. h. der ,Z:-Indizes) der r.a. Baume ohne unendlichen Zweig eine vollstandige ZZ:-Menge 9 ist. Wir haben in 3.7. jedem r.a. Baum T eine %-Amsage pT zugeordnet. T hat einen unendlichen Zweig gdw. es eine a.a. Gruppe gibt in der qT gilt. Die Godelnummer des Baumes T gehort also genau dann zu 9, wenn i q T in allen a.a.Gr. gilt. D a die Zuordnung T QT effektiv ist und leicht injektiv gemacht werden kann (injektiv auf den Godelnummern), ist somit 9 l-reduzierbar auf die V,-Theorie der a.a. Gruppen. 2 und 3 sind also nachgewiesen, wenn wir zeigen konnen, daB die Theorie der a.a. Gruppen Il: ist. Man kann jede abziihlbare Teilmenge
Eigenschaft R , ( X ) von Mengen zuordnen, so daB fur alle A C w (w, A)+ cp gdw. R, ( A ) . Dabei ersetzen wir einfach uberall in cp den Quantor 3 X - - - X . . .durch 3 n - - - { i( ( i , n ) E A } . - - .cp gilt nun in allen a.a. w-Modellen, wenn fur alle A R..,. ..,( A ) . Die LZ-Theorie der a.a. w-Modelle ist also IZ:. Wegen 1.2 ist also auch die elementare Theorie der a.a. Gruppen IZ:. 4. Man kann jeder %-Aussage p effektiv eine r.a. Menge A zuordnen, sodaB p in einer (allen) a.a. Gruppe(n) gilt, wenn S?(w). Ebenso ordnet man jeder Vz-Aussage i S ? ( w ) zu. Die g2-Theorie der a.a. Gruppen ist also Zi (sogar vollstandig Z!),die Vz-Theorie II!. Die Menge der Folgerungen aus der Vz U %-Theorie der a.a.Gr. ist 2:; es gibt also sogar V3-Aussagen, die in allen a.a.Gr. gelten, aber nicht aus der Vz U g2-Theorie der a.a.Gr. folgen. 0
Bemerkung. Wir werden spater sehen, daB die 3,-Theorie der a.a. Gruppen vollstandig 2: ist.
{sc, 1
I
0, {Sc.,)c,dEM})=w(M).
I
4. Eine Interpretation von o(M)in M Im ersten Satz dieses Abschnitts betten wir w ( M ) in M ein. (M a.a.Gr.). Dann geben wir Formeln an, die diese Einbettung elementar beschreiben, und bekommen so einen neuen Beweis fur 2.4.5. Diesen Satz beweisen wir am SchluB noch einmal, indem wir die Formeln von L * in (in a.a.Gr.) gleichbedeutende elementare Formeln ubersetzen (siehe 2.4.10). 2.4.5 folgt dann mit 1.8.2 und 1.7. Da Z losbares Wortproblem hat, ist Z in jede a.a.Gruppe M einbettbar (11, 2.8). Es gibt also in M Elemente a unendlicher Ordnung. l a ) sei die von a erzeugte Halbgruppe {a" n E w } . Durch a" @a"' = an+"' und a " 0a m= an'"'definieren wir zwei Operationen auf ( a ) .Fur alle c, b E M setzen wir s c . b = { a "E l a ) (c ( a " b a - " )= (a"ba-")c}.(Das entspricht der Konstruktion in ([28]).) Satz 4.1. ( l a ) , @,
I
Beweis. Wenn g c , b = { n a" E s , . b ) ( = { n c ( a " b a - " )= (a"ba-")c)), mussen wir zeigen, daB W * ( M )= b c, b E M } . Einerseits ist klar, daB fur alle c, b E M $ , b S 1W ( ( a ,b, c ) ) und also g c , b E W * ( M ) .Sei
M. Ziegler
536
(a) in M einbettbar. Wir erhalten so ein b E M, so daB a, b (a, b) frei erzeugen. Wir setzen d; = a "ba "", Die Folge (d n E Z) ist frei und jedes Element W(a, b) von (a, b) = F HiBt sich eindeutig in der Form ZW'(d), z E Z, schreiben. Wenn U die von {dn E A} erzeugte Untergruppe von Fist, ist W(a, b) E U gdw. z = 0 und in W'(xo, ... ) nur die x, mit n E A vorkommen. Also ist W(F U) ~+ A. Nach III, 2.14 gibt es ein c EMmit U = {f E Fife = ct}. Es ist dann n E A gdw. d; E U gdw. cd; = d.c gdw. n E 0
In In I
a
s.;
Satz 4.2 (siehe [28]). Wir definieren die folgenden Formeln
=x 2) II -, E,(x, x 2) Y2 =
ipo(x): = 3 WI (WIXW~1 ip ,(y I, Y2> z) : = y I
•
(siehe Beweis V, 1.1)
Z
ip . (x, YI, Y2, z):= 3w (wxw "
= y, II WY2W-1 = z) V(y, = ell z =e)
ipw(x,y):= 3YI'" y43w,··· W4Y= WI' W2' W3' W4" 4
/\ (E1(y"x)
II
ip.(x,y" y" w.)
i=J
ip,(y, WI, W2):= W1(YW2y-l)
=(yW2y-I)W1
Dann gilt fiir aile a.a. Gruppen M und a EM 1. a hat genau dann unendliche Ordnung, wenn M 1= ipo(a). Wenn a unendliche Ordnung hat, ist fiir aile b, c E M, n, mEw. 2. bEl a) gdw. M 1= ipw (a, b) _ 3. c = anEBa m gdw. Ml=ip+(an,am,c) 4. c=anOa m gdw. Ml=ip.(a,an,am,c) 5. anESc.bgdw. Ml=ip,(an,c,b). Beweis. 1. a hat unendliche Ordnung gdw. (a 2) == (a) und (a 2) eine echte Untergruppe von (a) ist d. h. a (a 2). 3. ist trivial. Wenn ani e, gibt es in Meinen inneren Automorphismus T, der a in an iiberfiihrt. Es ist dann T(a m ) = a":", Aus dieser Uberlegung folgt 4. (jl+ und cpo definieren sogar Addition und Multiplikation auf (a) und mach en so (a) isomorph zum Ring der ganzen Zahlen. In diesem Ring sind aber die natiirlichen Zahlen die Summen von vier Quadraten. Das beweist 2. 5 ist trivial. 0
e
Foigerung 4.3 (siehe 2.4.5). Man kann fiir jede L 2-Formel p(v, X) eine Lw.w-Formel p(y, w, w', x) angeben, so daft fiir aile a.a. Gruppen M, ii E w, a, 6, c E' M, wenn a unendliche Ordnung hat,
w(M)1= p(ii, S"".'1" SCI,b" ... ) gdw.
M 1= p(a nll, an" ... ,
c, 6, a).
A lgebraisch abgeschlossene Gruppen
537
Die Aussage p gilt also genau dann in w ( M ) , wenn MI= 3 x ( i ; ( x ) A qn(X)). Beweis. Wir definieren i; induktiv: Wir kiinnen dabei annehmen, dal3 p nur atomare Formeln der Form u I + U? A u3, u I . u Z = u3 und Oil E X o enthalt. Wir setzen dann (Ul
und
+ uz = u3)" = cp+(y,,yz, y 3 ) , ( U l . uz = 4" = cp .(x, Y l . y z 7 y 3 )
(~n€X~i)"=cp,(yi,,wii,w6),(1~)"=1b~ ( p A p')" = b A b', ( 3 x o p ) ' = 3W o , WI!@ (3Vii
p ) " = 3yn(b
A
(x, y o ) ) .
c ~ w
0
Wir wollen eine Ubersetzung von L *-Formeln in elementare Formeln angeben. Dazu ubersetzen wir zunachst L * in eine n e u e Sprache L**. Diese Ubersetzung nutzt aus, dal3 wir in algebraisch abgeschlossenen Gruppen jede endlich Folge bo, . . . , b, in der Form Nl,(c,,cz),. . . Nn(cI,cz) schreiben konnen. L * * hat Variable uo, . . . fur naturliche Zahlen und x, y, . . . fur Gruppenelemente. Zahlterme werden aus 0 und den Zahlvariablen mit + und . gebildet. Individuenterme aus e, den Variable fur Gruppenelemente durch "Gruppenmultiplikation", und wenn g ein Individuenterm und f ein Zahlterm ist, ist g' ein Individuenterm (mit der Interpretation g . . . g -t mal). Die Formeln bauen sich wie ublich aus den atomaren Formeln g l = g z und t l fz ( g , Individuenterme, t, Zahlterme) auf. Offenbar beschreiben die Formeln von L * * arithmetische Eigenschaften von Gruppenelementen. Also ist nach 1.7 (in allen Gruppen) jede L **-Formel durch eine L *-Formel ausdruckbar. In a.a. Gruppen gilt auch die Umkehrung:
- -
Lemma 4.4. Z u jeder L *-Formel cp(uo,.. . X I ) , . . . a,,... ) gibt es eine L * * Formel cp * ( u o , .. . w,,,. . . xi], . . . zo, . . .z6,. . . ), so dap fur alle a.a. Gruppen M, an,. . . € M, Co,. . . E M, ~ 6 ,... E M und n o , .. . , m o , .. . E w M C cp (no, . . . an, . . . ,(Ni(Co, c ; ) i < mo), . . . ) gdw. M I = cp * ( n o , . . m,,, . . . an,. . . en,. . . CC . . . ).
I
Beweis. Wir ersetzen in cp die Folgenvariable a,durch die Variablen z,, z:, w, mit der Idee, dal3 l(a,)= w, und a, = N o ( Z , , 2:). . . Nw,-l(z,,2:). Wir nehmen an, daB die atomaren Formeln von cp nur entweder rein "zahlentheoretisch" sind oder atomare Formeln von L,, oder von der Form a,( v , ) = x, oder l(a,)= u, sind. Fur zahlentheoretische atomare Formeln und atomare Formeln setzen wir cp * = cp. Wir definieren
538
M. Ziegler
x,) v (w, G u, A x,= e ) und (u, (u,) = x,)* = (u, < w, A N,,(z,,z:) (l(a,) = u,) = w, A u,. ( N , ( x , y ) ist der Individuenterm [[x, y2"+'],x ] aus L * * . ) Wir gewinnen nun cp * durch die rekursive Definition ( i q ) * = i c p * , (cp A +)* = c p * A +*, (3u,cp)* = 3u,cp*, (3x,cp)* = 3x,cp* und (3a,cp)*=3w,3z,,z:cp*. 0 Satz 4.5. Z u jeder L**-Formel cp(uo,... X I ) , . . . ) kann man eine LW,-Formel 6(yll,.. . x(,, . . . x ) angeben, so dap fur alle n o , .. . E w und
a, ao,a , ,. . . aus einer a.a. Gruppe M, a uon unendlicher Ordnung, M t= cp (no,. . .a(),. . . ) gdw. M != . . a l l , .. . a ) . Die Aussage cp gilt also genau dann in M , wenn MI= 3 x ( @ ( x ) A cpo(x)). $(a"~l,.
Folgerung. Z u jeder L*-Aussage kann man eine in algebraisch abgeschlossenen Gruppen aquiualente elementare Aussage angeben. Beweis. Wir ubersetzen zuerst die Formel x = x,: Nun ist aber, wenn a unendliche Ordnung hat, b" = c gdw. (6,c)' ein homomorphes Bild von (a, a")' ist. In 11, 2.5 wurde explizit ein endliches Gleichungssystem H z ( d l ,d z ,d l , d,, z,, z2) angegeben, das in einer Obergruppe genau dann losbar ist, wenn ( d ? ,d4)' homomorphes Bild von ( d , , d2)' ist. Wir setzen also (x: = x,). = 3 z l ,z2 A Hz(x,y,, x, x,, z , , z2). Wir konnen annehmen, dal3 cp sonst nur noch atomare Formeln der Form u, + u,= Uk, u, * u,= u k und x, .x, = Xk enthalt. Wir setzen (u, + u, = U k ) . = cp+(y,,y,, y k ) , = (x, y,, y,, y k ) und (x, . XI A x k ) " = (x, . XI x k ) . Wir (u, * 0, definieren dann rekursiv uber den Aufbau von cp ( i c p ) ^ = 1 4 , (cp A +)^ = @ A 4, (3x,cp)^ = 3x,@ und (3u,cp)^= 3y, (4 A cpW(x,y,)). Man sieht leicht, dal3 die so definierten @ die gewunschte Eigenschaft haben. 0 V. Generische Gruppen
"Forcing in der Modelltheorie" wurde in ([l, 14, 15, 23)) benutzt, um a.a. Strukturen (a.a. Gruppen) zu konstruieren. Wahrend ein grol3er Teil dieser Konstruktionen durch die einfachen "schrittweisen" Methoden der Kapitel I und I11 ersetzt werden kann, (siehe z. B. I, 3.5, I, 4.7, die Konstruktion von Mo (I, 3.9), 111, 1.7) sind generische Gruppen weiterhin wegen ihrer relativ einfach zu bestimmenden elementaren Eigenschaften von Interesse (siehe z. B. V, 3.4, VI, 2.9, VI, 3.3.7, VI, 4.1(2), 4.4(2), 4.5(2)). Im ersten Abschnitt von V behandelen wir die unendlich-generischen Gruppen. Das bekannte Resultat, dal3 die unendlich-generischen
Algebraisch abgeschlossene Gruppen
539
Gruppen gerade die elementaren Substrukturen der a.a. w -universellen Gruppen sind, fuhrt zu einer einfachen Kennzeichnung: (1.5) Beim Ubergang zu w ( M ) , dem zur a.a. Gruppe M gehorenden w-Modell, entsprechen die unendlich-generischen Gruppen gerade den elementaren Unterstrukturen von (w, 9 ( w ) ) . Dieser Satz macht die Untersuchung der m-generischen Gruppen sehr leicht. Wir erhalten z.B., wenn V = L, eine kleinste a-generische Gruppe M“. Das Skelett von M” besteht aus den e.e. Gruppen mit analytischem Wortproblem (1 .S). Die Untersuchung der (endlich-) generischen Gruppen beginnen wir im zweiten Abschnitt. Wir geben eine neue Definition der Erzwingungsbeziehung Il- : Eine Bedingung p (eine endliche Menge von atomaren und negiert atomaren Aussagen aus L G ( C ) ,die mit T G vertraglich ist) erzwingt die Eigenschaft d,wenn im dem Spiel, in dem zwei Spieler I und I1 abwechselnd Elemente einer aufsteigenden Folge von Bedingungen p = p o C p , C p 2 C . . * wahlen, I eine Strategie hat, die gewahrleistet, daB - wie auch I1 spielt - die von U{p, i E w } beschriebene Gruppe die Eigenschaft d hat. Dieser Zugang hat den Vorteil, daB keine syntaktischen Uberlegungen vorkommen. Man kann z. B. formulieren: O!= “ . . . ist generisch” (2.4). Aul3erdem wird der Zusammenhang mit den “schrittweisen” Methoden klar. Aus 2.6 folgt, daB fur elementare Eigenschaften d unsere Definition rnit der ublichen Definiton von IF iibereinstimmt. Im dritten Abschnitt bestimmen wir die durch “Vxo,. . . x, S?( W((xo,. . . x,)))” ausdruckbaren Eigenschaften generischer Gruppen. (Das sind fur r.a. A gerade die durch V,-Aussagen beschriebenen elementaren Eigenschaften.) Als Folgerung bekommen wir den Satz aus ([14]), daB in generischen Gruppen nicht MAX gilt: es gibt keine universelle e.e. Untergruppe. SchlieBlich geben wir noch zwei Charakterisiereungen der generischen Gruppen, o h n e modelltheoretische Begriffe zu benutzen: Die Gruppe M ist generisch, wenn jede “dichte” arithmetische Menge von endlichen GI. & Ungl.systemen ein losbares System enthalt (3.8). Auf ahnliche Weise kann man die endlich erzeugten Untergruppen der generischen Gruppen (die “subgenerischen” Gruppen) charakterisieren. Die generischen Gruppen sind dann die a.a. Gruppen, deren endlich erzeugte Untergruppen subgenerisch sind (3.11, 3.12). Im VI Kapitel werden wir die generischen Gruppen durch rekursionstheoretische Eigenschaften des zugehorigen w-Modells kennzeichnen.
I
540
M. Ziegler
1. Unendlich generische Gruppen
Algebraisch abgeschlossene Gruppen sind “grol3e” Gruppen. Ein Gleichungssystem, S ( a , i ) , das uber (5)losbar ist, ist schon in der Gruppe selbst losbar. Wir werden einige Verstarkungen dieser Definition behandeln. Die starkste derartige Forderung ist, dal3 jeder Isomorphietyp einer endlich erzeugten Obergruppe von (ii) schon in M realisiert wird. Diese M sind gerade die a.a. w-universellen Gruppen. Denn aus I, 2.3.1 und I, 3.9.1 folgt Lemma 1.1. Eine Gruppe M ist genau dann a.a. und w-universell, wenn sich fur alle ii E M alle endlich erzeugten Obergruppen von (Z) uber ( a ) in M einbetten lassen.
Weil es 2‘” endlich erzeugte Gruppen gibt, hat jede w-universelle Gruppe mindestens die Machtigkeit 2‘”. Aus I, 1.4 folgt, daR sich jede Giuppe in eine a.a. w-universelle Gruppe einbetten 1aBt (man wahlt in I, 1.4 3% so, daR jede endlich erzeugte Gruppe zu einer Gruppe aus X isomorph ist). I, 2.11.5 zeigt, daR man zu jedem G eine a.a. w-universelle Obergruppe M finden kann mit I M 1 = max (2”, 1 G I ). Wir suchen abzahlbare a.a. Gruppen, die ahnlich “groB” sind wie a.a. w-universelle Gruppen. Die endlich erzeugten Untergruppen (a,,,. . . a,) sollen in diesen Gruppen dieselben elementaren Eigenschaften haben wie in w-universellen a.a. Gruppen. Diese elementaren Eigenschaften hangen nur vom Isomorphietyp von (a,,,. . . )’ ab; denn aus I, 2.10.l(a) folgt: Wenn M , N a.a., w-universe11 sind, ii E M , 6 E N, so ist (M,ii) = p (N, 6), also auch ( M , ii) = (N, 6), falls (ii)’ = (6)’. (Wenn M, N beliebige a.a. Gruppen sind, gilt nur ( M , ii) E ~ * ( N6), (IV, 3.6.1).) Fur endliche erzeugte Gruppen F und ii E F konnen wir also sinnvoll definieren: (cp ( 2 ) eine Formel aus Lqw)F IF cp (a) gdw. cp (ti) in einer (allen) a.a. w-universellen Obergruppe(n) von F gilt. Mit folgendem Schema konnen wir durch Rekursion uber den Aufbau von cp “berechnen”, ob F IF cp (a): Lemma 1.2. 1. Wenn cp quantorenfrei ist, ist F It cp ( a ) gdw. F I= cp (a). 2. Fb 3icp(u, 2 ) gdw. es eine e.e. Obergruppe G 3 F und 6 E G gibt rnit G IF cp (ti, 6). 3. F It cp ( a )A +(ti) gdw. F I t cp (6)und F IF +(a). 4. F I t i cp ( t i ) gdw. nicht F It cp (a).
Algebraisch abgeschlossene Gruppen
54 1
Folgerung. “(ii) It cp(&)” ist fur jedes cp eine analytische Eigenschaft von W ( ( i i ) ) .Das beweist man leicht durch Induktion iiber den Aufbau von cp, ist aber auch auf folgende Weise einzusehen: Nach IV, 1.2 gibt es eine L’-Formel cp(X), so dap fur alle a.a. w-universellen Gruppen M und alle ii E M MI= cp(Z) gdw. (w, B ( w ) ) C + ( W ( ( 6 ) ) )Die . genaue Angabe von 6 in IV, 2.1.1 zeigt, dap - wenn cp eine 3,-Formel ist “ ( a ) l tcp(&)” eine ZL-z-Eigenschaft von W ( ( i i ) )ist ( n 2 3). Definition 1.3. M heiBt unendlich generisch (m-generisch), wenn fur alle ii E M und alle Formeln cp(2) MI= cp(u) gdw. (ii)Itcp(u). Satz 1.4. 1 . N sei Untergruppe der m-generischen Gruppe M. N ist genau dann m-generisch, wenn N elementare Unterstruktur von M ist. 2. N ist genau dann m-generisch, wenn N elementare Unterstruktur einer a.a. w-universellen Gruppe ist. 3. m-generische Gruppen sind algebraisch abgeschlossen. 4. Jede Gruppe G is in eine m-generische Gruppe der Machtigkeit 1 G 1 + w einbettbar (es gibt also 2” viele abzahlbare m-generische Gruppen ) . 5 . M ist genau dann a-generisch, wenn fur alle ii E M und alle q ( i , j ) , fur die (ii)Il- 39cp(ti, j ) , 6 E M existiert mit (6, 6 )It cp(ti,6). 6. Alle m-generischen Gruppen sind elementar aquivalent. Folgerung aus 6. m-generische Gruppen erfullen nicht max, es gibt also auch keine universellen endlich erzeugten Untergruppen. Jede %Aussage, die in einer a.a. Gruppe gilt, gilt in allen a-generischen Gruppen. (Man vergleiche dazu IV, 2.8.2, die Bemerkung nach dem Beweis von IV, 3.3 und IV, 3.6.2.) Bemerkung zu 5. Wegen IV, 3.6.1 gilt in allen a.a. Gruppen M, ii E M, fur alle g2-Formeln cp ( 2 ) M I= cp (Z) gdw. ( t i )IF cp (ti). Umgekehrt sind die Gruppen mit dieser Eigenschaft algebraisch abgeschlossen. Ahnlich wie in 5 gilt: Die Gruppe M ist genau dann algebraisch abgeschlossen, wenn fur alle ti E M und alle 3,-Formel cp (2, j ) , fur die ( i i ) I t 3 j p ( i i 7 j )6 , E M existiert mit (ti,6)bcp(ti,6). Beweis. 1 folgt unmittelbar aus der Definition. 2 folgt aus 1, weil a.a. w-universelle Gruppen m-generisch sind und jede Gruppe in eine a.a. w-universelle Gruppe einbettbar ist. 3 folgt mit I, 1.5.2 aus 2. 4: Man bettet G zuerst in eine a.a. w-universelle Gruppe ein. Die gesuchte Gruppe ist eine elementare Unterstruktur von geeigneter Machtigkeit, die G enthalt und mit dem Satz von Lowenheim-Skolem gewonnen
542
M. Ziegler
wird. 6 folgt sofort aus der Definition. 5: M sei Untergruppe der m-generischen Gruppe N. M ist unendlich generisch gdw. M < N gdw. (Nach einem Kriterium von Tarski) fur alle q ( i , j ) , ii E M
N I= 3yq(ii,j ) j ex.
EM
N I= ~ ( i i6). ,
Daraus ergibt sich die Behauptung. 0 1.4.2 liefert eine sehr einfache Beschreibung der m-generischen Gruppen. (Man vergleiche auch [28].) Satz 1.5. 1. Eine algebraisch abgeschlossenen Gruppe M ist genau dann
m-generisch, wenn w ( M )< ( w , 9 ( w ) ) . 2. Sei ( w , 9')< ( w , 9 ( w ) ) . D a m gibt es eine m-generische Gruppe M mit w ( M )= (0, 9')und I MI = 1 91.Wenn 9'abziihlbar ist, ist M eindeutig bestimmt.
Beweis. Eine a.a. Gruppe M ist genau dann w-universell, wenn W * ( M ) =9 ( w ) . 1 folgt so aus 1.4.2 und IV, 2.4.6. Sei Y wie in 2. (0,9 ')ist a.a. (vgl. Bem. vor IV, 1.2). Sei (G, i E I ) eine Familie von e.e.Gr. mit Y/=* = { W(G,)/=* i E I } . Setze No = @ (G, i E I). Da Y/=* ein Ideal ist, ist W * ( N o ) =9'.Sei N, schon konstruiert mit W * ( N , )= 9.S J ( i J ) j, E J seien alle endlichen Systeme von Gleichungen uber N,, I J 1 = I 9'1.Wir bilden das amalgamierte Produkt aller freien Losungen N,,, = * N , ( ( N ,* F r ( i J ) ) / ( S J ( i J ) )jNE J). Wegen 1.6.4 ist W * ( N , + J =9 Setze M = u{N, i E w } . 0
1
1
1
1
1
Folgerungen 1.6. 1. Wenn M m-generisch ist, ist
Th(M)=,Th(w,9 ( w ) ) .
2. F und G seien endlich erzeugte Gruppen, W ( F ) analytisch in W ( G ) .Dann ist F in jede m-generische Gruppe einbettbar, in die G einbettbar ist (ugl. 111, 1.4). 3. M a ist in jede m-generische Gruppe einbettbar. 4. U C F, U C G seien e.e. Untergruppen der m-generischen Gruppe M. Dann ist F * " G < M. Beweis. 1. Nach IV, 2.4.5 lassen sich die Aussagen uber M und w ( M ) eff ektiv ineinander ubersetzen. 2. gilt, weil, wenn (w, Y)< ( w , 9 ( w ) ) , jede Menge, die analytisch in einem Element von Y ist, selbst Element von Y ist. 3. D a alle arithmetischen MTngen analytisch in W ( E ) sind, ist fur jede m-generische Gruppe N W * ( M a )C W * ( N ) . Die Behauptung folgt nun aus 111, 3.3.2.
543
Algebraisch abgeschlossene Gruppen
4. Wenn W ( F )=zeW ( G ) ,ist W ( F ) analytisch in W ( G ) .4 folgt also aus 2 und 111, 2.16. Der letzte Satz gestattet uns eine weitere Charakterisierung der generischen Gruppen, die zeigt, daR m-Generizitat eine naturliche Verstarkung der Algebraischen Abgeschlossenheit ist.
m-
Satz 1.7. Die Gruppe M ist genau dann a-generisch, wenn es zu jeder analytischen Klasse 9 C P ( w ) und fur alle 6 E M, fur die es 6 in einer Obergruppe uon M gibt mit W ( ( 6 ,6))E 9,Z E M mit W ( ( 6 ,E ) ) E %; gibt.
Bemerkung. Nimmt man fur die 9 in 1.6 nur r.a. Hornklassen, charakterisiert die Bedingung von 1.6 gerade die a.a. Gruppen.
SO
Beweis. Sei die Bedingung fur M erfullt. Wir zeigen, daR M die Eigenschaft von 1.4.5 hat. Sei 6 E M und (6)It 3jjcp(a, 9). (6, d ) It c p ( i , 6 ) ist eine analytische Eigenschaft von W((ii,6)).Da fur eine Obergruppe (6, E ) 3 (2) (ii, E ) It cp(& E) und man wegen der Amalgamierungseigenschaft der Klasse aller Gruppen E aus einer Obergruppe von M wahlen kann, existiert nach Voraussetzung ein 6 E M mit (ii,6)IF cp (i,6 ) . Wir zeigen nun, daR umgekehrt alle m-generischen Gruppen die angegebene Eigenschaft haben. M, 9 C 9 ( w ) und 6 seien wie in 1.7. %; werde durch die L2-Formel cp (X) beschrieben, d.h. A E %; gdw. (o, P(o))!=cp(A). DaB fur eine Obergruppe (ii, 6) von ( 6 ) 6 = ak,. . . a,) W((ii,6))E 9 bedeutet (wenn ii = a o , .. . ( o , P ( w ) ) ! = 3 Y ( Y J k =W((ii))hYEW,hcp(Y)).Nach 1.5ist w ( M ) C 3 Y ( Y J k =W ( ( i i ) ) ~ Y E W , ~ c p ( Y ) ) . S e i a l s o ( M ist ohomogen, man beachte den Beweis von IV, 1.2) E E M mit o(M)!= cp(W((ii,E))). Dann ist wieder wegen 1.5 (o,P(o))l=cp(W((ii,E))),d.h. W ( ( i , E ) ) E9. 0 Satz 1.8 ( V = L). 1.
Es gibt eine kleinste m-generische Gruppe M o :M o ist in jede m-generische Gruppe einbettbar. ( W * ( M o )besteht gerade aus den analytischen Mengen.) 2. Fur e.e. F, G sind iiquiualent: (a) W ( F ) ist analytisch in W ( G ) , (b) F ist in jede m-generische Gruppe einbettbar, in die G einbettbar ist (ugl. 111, 1.8.1). Bemerkungen. M o ist durch die angegebenen Eigenschaften eindeutig bestimmt. Es gibt keine kleinste a.a. Gruppe (111, 4.5 zu 3). ( V = L ) W ( F ) analytisch gdw. F in alle m-generischen Gr. einbettbar.
M. Ziegler
544
Beweis. 1. Wie in 2 folgt aus V = L, daB (w, d )< (w, B ( w ) ) - d die Klasse der analytischen Mengen. Mn sei nach 1.5.2 die abzahlbare agenerische Gr. rnit W * ( M n= ) d. Aus 1.6.2 folgt, daB M” in jede mgenerische Gruppe einbettbar ist. 2. Aus V = L folgt: R sei eine n!,-Relation zwischen Mengen. Wenn fur ein A C w { X R ( X , A ) } # 0, gibt es ein B, das n!,+, in A ist und R ( B , A ) erfullt. Wenn nun W ( F )nicht analytisch in W ( G )ist, ist nach dieser Bemerkung fur X = { X X analytisch in W ( G ) }( w , X ) eine elementare Unterstruktur von ( w , B ( w ) ) . Nach 1.5.2 gibt es ein w-generisches M mit W * ( M )= X. F ist nicht in M einbettbar. Damit ist (b)+ (a) bewiesen. (a)+(b) folgt aus 1.6.2. 0
1
1
2. Forcing in der Gruppentheorie Wir haben fruher (111, 1.7) eine algebraisch abgeschlossene Gruppe M, die eine vorgegebene e.e. Gruppe G mit nicht-losbarem Wortproblem nicht enthalt, auf folgende Weise konstruiert: M war eine Losung der Vereinigung einer aufsteigenden Folge po(E) C p l ( E ) C . . . von endlichen Mengen von Gleichungen und Ungleichungen. Wir muBten bei der Wahl der p , dafiir sorgen, daB alle endlichen Gleichungssysteme S, (t,i ) ,die ev. spater in einer Obergruppe von M losbar sind Losungen bekommen und daB, wenn G = (g), fur jede endliche Folge (E) von Termen (G,g’) nicht isomorph in (M, (2)) eingebettet werden kann. Wir numerieren diese Forderungen E, i = 0,1,. .. und wahlen, wenn p,-l schon vorgegeben ist, p, so, daB, egal wie weiter konstruiert wird, M die Forderung E erfullt. Da wir dabei nicht ausnutzen, daB p,-l “richtig” gewahlt war, ist es moglich, mit dieser Konstruktion noch mehr Forderungen zu erfullen. Die Sicherheit mit der dieses Verfahren zum Ziel fuhrt, kann man auch dadurch beschreiben, daB man sagt, daB I ein Spiel der folgenden Art immer gewinnen kann. B sei die Menge aller Gleichungen und Ungleichungen ohne freie Variable aus L ( C ) (die Menge der basischen Aussagen), P die Menge aller endlichen Teilmengen von B, die mit T G konsistent sind. Die Spieler I und I1 wahlen abwechselnd Elemente einer aufsteigenden Folge p o C pl C * von Elementen aus P. Sei d eine Klasse von Teilmengen von B und eine Folge (ein Spiel) C p l C * . . gewahlt. i E w } E d,ist “d gewonnen”. Eine Gewinnstrategie von Wenn I fur d ist eine Folge (fi i E a),f, : P + P, mit V i , p f, (p) 3 p , so daB in allen Spielen p o C pl C . . . , in denen I nach dieser Strategie spielt, d.h. fur alle i > 0 f, ( p 2 , - l ) = p z , und f o ( 4 ) = Po, d gewonnen ist. Ahnlich definiert man Gewinnstrategien von I1 fur d :eine Folge von
u{p,I
1
Algebraisch abgeschlossene Gruppen
545
Funktionen f,],f l , . . . von P nach P, so dal3 in allen Spielen poC p l C . . ., fur die f.a. i P ~ , = . ~fg(pz,),d gewonnen ist. I1 hat eine Gewinnstrategie fur d gdw. I fur jedes p E P eine Gewinnstrategie (f, i E w ) fur d rnit fo(4)= p hat (f “beginnt mit p”). p ist eine (I-) Gewinnposition fur d , wenn I eine Gewinnstrategie fur d hat, die rnit p beginnt. Wir schreiben dann p ltd (p erzwingt d).p E P ist eine 11-Gewinnposition fur d , wenn es ein p’ 3 p mit p ’ t d gibt. .d heiBt determiniert, wenn jedes p E P Gewinnposition fur d oder 11-Gewinnposition fur B(B)\ d ist (p kann nicht gleichzeitig Gewinnposition fur d und 11-Gewinnposition fur B ( B ) \ d sein. Alle d,die wir spater betrachten, werden determiniert sein).
1
Bemerkung. Es gibt zwei andere Begriffe fur “Gewinnstrategie” fur d, eine starkere und eine schwachere. Eine starke Gewinnstrategie von I fur d ist eine Funktion F : st‘ --* P mit p, C F ( ( p o , .. . ,p,)) f.a. i und p(,,. . . ,p, aus P, so dal3 jedes Spiel po C p, C . . rnit pz, = F ( ( p o , .. . , P ~ , - ~f.a. ) ) i d gewinnt. Eine schwache Gewinnstrategie von I fur d ist eine Funktion f : P-, P mit Vp f(p) 3 p, sodal3 cf i E w ) eine Gewinnstrategie von I fur .d ist. Aus jeder Gewinnstrategie von I fur d,($ i E a),erhalt man eine starke ) (pZt-,)setzt. Man kann Gewinnstrategie, indem man F ( ( p o , .. . ,P ~ , - ~=) f, sich n u n uberlegen, dal3 man aus einer starken Gewinnstrategie eine schwache Gewinnstrategie gewinnen kann. Die drei Begriffe sind also aquivalent. +
1
1
Beispiele. 1. Sei G eine e.e. Gruppe mit nicht-losbarem Wortproblem und d die Klasse aller Teilmengen von B, die mit T G konsistent sind und fur die jedes Modell, das von den Interpretationen der ci erzeugt wird, algebraisch abgeschlossen ist und G - bis auf Isomorphie nicht enthalt. Der Beweis von 111, 1.7 zeigt, dal3 4 It d. 2. cp sei eine atomare Aussage von L ( C ) . Dann ist 4 II- {G c B cp E G oder i cp E G}. Denn wenn I1 0 C pl wahlt, so kann I p2 3 p l so wahlen, dal3 cp E p2 oder i cp E p2. (Bei der Gewinnstrategie von I kommt es also nur auf f l an.) Statt p I t { G C B I . . . G . . . } s c h r e i b e n wirauch p I I - “ - - . G . * . ” .
I
Lemma 2.1. d,do, d l ,... seien Teilmengen von s ( B ) , r, p , q E P. 1. Wenn p Ik d,gibt es ein G E d,G U T G konsistent, p E G. 2. p u q P (p,q unvertraglich) gdw. p It “ q f G”. “ q C G ” ist determiniert.
M. Ziegler
546
3. Aus p Il- P ( B ) \ d folgt V q 3 p q Y d.Wenn d determiniert ist, ist auch 9 ( B ) \ d determiniert und p Il- P ( B ) \ d gdw. V q 3 p q Y d. 4. p Ik i E w } gdw. fur alle i p IF d,. Wenn alle d,determiniert sind, ist auch I i E o} determiniert. 5 . A ~ VSq 3 p 3 r 3 q 3i E w r IF d,folgt p IF U{d#i E w } . Wenn i E w } determiniert und alle d,determiniert sind, ist auch
n{d# I n{d,
1
u{d,1
Vq 3 p 3 r 3 q 3 i E w r l t - d ,
gdw. p l l - U { d , l i i E w } .
6. Wenn p C q, p Ik d,so auch q Il- d.
u{p,1
Beweis. 1. Man gewinnt G als i E w } aus einem Spiel po C p , C . . . , bei dem I seine bei p beginnende Gewinnstrategie fur d anwendet. 2. Wenn p und q unvertraglich sind, ist jede Strategie, die bei p beginnt, eine Gewinnstrategie fur “ q JZ G”. Wenn p und q vertraglich sind, kann I1 auf die Wahl po = p von I mit der Wahl p U q antworten. Also ist p J$“ q z G” und p ist 11-Gewinnposition fur “ q C G”. “ q g G” ist also determiniert. Nach 3 ist also auch “ q C G ” determiniert . 3. Sei p Il- P ( B ) \ d und q 3 p. Wenn I eine bei q beginnende Gewinnstrategie fur d hat, und I1 nach dieser Strategie spielt, gewinnt jedes Spiel p C q C p2C * d.Wid. Sei d determiniert, p Y P ( B ) \ d . Wenn also I po = p wahlt, kann I1 p , = q 3 p so wahlen, daB q keine 11-Gewinnposition fur 9 ( B ) \ d ist. q ist dann I-Gewinnposition fur d. 4. Jede Gewinnstrategie fur i E w } ist auch eine Gewinnstrategie fur jedes d,. Sei umgekehrt fur jedes i ( f t l j E w ) eine die mit p beginnt. ( ( i k , j k ) l k E o)sei Gewinnstrategie von I fur d,, eine Durchzahlung von w 2 mit j o = 0 und V k 3 k ’ > k (ill*=j k + 1, i k = ik,). (fa1 k E o)ist dann eine Gewinnstrategie von I fur i E w } , die bei p beginnt. Denn wenn I diese Strategie beim Spiel p = po C p , C . . . angewendet hat, gibt es fur jedes i eine Teilfolge . * C p,, C f ~ , , C) . * . Die Vereinigung p = po C p,, C f;@,() C C der p,, liegt aber in d,. 5 . Sei V q 3 p 3 r 3 q 3i E w r Il- d,. Dann hat I die folgende Strategie fur U{d, i E d}:I beginnt rnit p ; wenn I1 q 3 p wahlt, wahlt I ein r 3 q mit r Il- d,fur ein i und spielt mit einer Strategie fur d,, die bei r beginnt, weiter. Also ist p Il- U{d, i E w } . Der Rest von 5 folgt aus 3, 4 und der Darstellung U{d, i E w } = g ( B ) \ ( n { S ( B ) \ d , i E w } ) . 6. Wenn (f, i E w ) eine Gewinnstrategie von I fur d ist, die bei p beginnt, ist (f: i E w ) rnit f A = q und f: = f i sonst eine Gewinnstrategie von I fur d,die bei q beginnt.
n{d,1
n{d,I
I
I
1
1 I
I
Algebraisch abgeschlossene Gruppen
547
Beispiel 2.2. Wenn G C B, bezeichnet MG die von den Gleichungen in G prasentierte Gruppe. 1st G U T G konsistent, gelten alle Aussagen von G in MG. co,... d o , ... seien in MG durch go,... ho,... interpretiert. Es ist dann
0 It “MG ist algebraisch abgeschlossen”. Nach 2.1.4 mussen wir zeigen, dai3 fur jedes endliche Gleichungs- & Ungl.-system S(E,X’j 0It“Wenn S ( g , i ) uber MG losbar ist, hat S(g,X’)eine Losung in MG”. I hat folgende Gewinnstrategie: pl sei von I1 gewahlt; wenn pl U S(E, i ) (mit T G )konsistent ist, wahlt I p 2 = pl U S(E, 6) fur Konstanten 6, die in S und pl noch nicht vorkommen. Auf die weiteren Zuge von I kommt es nicht mehr an. X sei eine Klasse von determinierten Teilmengen von P(B). Wir nennen G C B X-generisch, wenn G Ergebnis eines Spiels ist, bei dem “um jedes Element von X gespielt wurde”. Das sol1 bedeuten:
Definition 2.3. G C B heiBt X-generisch, wenn G U T G konsistent ist und fur alle d E X G E d gdw. 3 p C G p It- d. Lemma 2.4. Wenn X abzahlbar, ist 011 “G ist X-generisch ”. (Also gibt es nach 2.1.1 X-generische Mengen.) Beweis. In Anbetracht von 2.1.4 mussen wir zeigen: Fur alle d E X, 1. 0lt- “ G E d es gibt p C G p It d” und fur alle p mit p It d, 2. 0lt “Wenn p C G, ist G E d”. 1. Wir geben eine Gewinnstrategie von I an: Wenn I1 pl wahlt, und es ein p 3 pl mit p k d gibt, setzt I p z = p (in dem Spiel ist dann offenbar “es gibt p C G, p I!- d” gewonnen). Wenn es solch ein p nicht gibt, ist pl C “G $Z d”.I spielt nun eine Gewinnstrategie von “G $Z d ” , die mit pl beginnt. 2. Sei pl gegeben. Wenn pl und p vertraglich sind, ist pl C p , U p I!- “G E d”.Sonst ist p , It- ‘‘pg G”. 0
+
Lemma 2.5. X sei eine Klasse uon determinierten Teilmengen von ??(B). Wenn d E X, sei auch P ( B ) \ d E X. Dann ist fur alle p E P und d E X p It- d gdw. fur alle X-generischen G p C G 3 G E d. Beweis. Wenn p It- d,G X-generisch, p C G, d E X , ist nach Definition G E 1.Sei andererseits p F SQ. Dann gibt es ein q 3 p mit
M. Ziegler
548
q It- $P(B)\d. D a q It- “G ist X-generisch”, gibt es ein X-generisches G, das q enthalt. D a Y ( B ) \ d E X,ist G @ d. 0 Wir interessieren uns fur elementare Eigenschaften von Gruppen, also fur jede Aussage cp E L ( C ) fur die Klasse A, = {G C B MG I= cp}. Wir schreiben p It (o fur p It- A,. In folgendem Lemma geben wir an, wie man p It- cp rekursiv “berechnen” kann.
I
Lemma 2.6. cp, Ic, seien Aussagen aus L ( C ) , p, q, r E P, konstante L ( C ) -Terme. 1. Wenn cp quantorenfrei ist, ist p Ik cp gdw. p kG cp. 2. plt-icp gdw. V q 3 p qIIccp. 3. p It- cp A cp gdw. p It- cp und p Ik Ic,. 4. p It cp v cp gdw. V q 3 p 3 r 3 q r Ik cp oder r It5. p It VXcp ( i )gdw. fur alle I p I t cp ( I )gdw. fur alle E p It- cp ( E ) . 6. 0lk “Es gibt c, MG I= t = c”. 7. Alle A, sind determiniert. 8. W e n n p ( ~ ) t t - c p ( ~i)s, t p ( l ) ~ t - c p ( IC) f a l l s p ( l ) E P ) .
+.
Folgerung. “ p I t cp (E)” ist fur jedes cp eine arithmetische Relation zwischen (den Godelnummern v o n ) p und E. M a n rechnet leicht nach, dap, wenn cp E V., n z=1, ist, diese Relation n”,,ist. Beweis. 1: Wenn p kG cp und das Spiel p o C pl C * * mit p beginnt, ist i E w } E A,. Also ist p It- A,. Sei p )cGcp. Dann gibt es ein q 3 p cp und die mit q kG i cp, also q It- “ G @ A,”. Daraus folgt p Determiniertheit von A,. 2, 3, 4, 7 und der erste Teil von 5 folgen sofort aus 2.1. 6: Sei p , E P. Wenn c weder in pl noch in t vorkommt, ist pl U { t l = c } E P. Aus pl U { t = c)’lt- t = c folgt die Behauptung. Wir zeigen den zweiten Teil von 5: Sei fur alle E p I1 cp(E) und I eine endliche Folge von konstanten Termen. Nach 6 ist dann p It “Es gibt c mit MG I= = ; E A cp (E)”. Daraus folgt p It cp ( I ) .8: Es ist klar, dal3 fur eine Bijektion T :C+Cp(E)lkcp(E) gdw. p ( ~ ( E ) ) l t c p ( ~ ( E ) ) .Sei nun p(E)It- c p ( E ) und p ( I ) gegeben. q E P sei eine beliebige Erweiterung von p ( r ) . Fur neue Konstanten d ist p ‘ = q U {to+ do, t l d l , . . . } U p ( 4 ) aus P. Die Bemerkung am Anfang zeigt p’I1 cp(d) und daher p’lt- cp(t). 0
u{p,I
Wir nennen G endlich generisch, wenn G X-generisch fur cp Aussage aus L(E)}. Das bedeutet also
X = {A,
I
Definition 2.7. 1. G C B heiDt endlich generisch, wenn G U T G
Algebraisch abgeschlossene Gmppen
549
konsistent ist und fur jede L(C)-Aussage cp MG C cp gdw. es ein p C G mit p IF cp gibt. 2. Eine Gruppe M heiBt (endlich) generisch, wenn fur alle Formeln cp(E), ii E M MI= c p ( i ) gdw. es 6 E M und ein p(E, d ) E P gibt rnit MI= p ( i , 6 ) und p(E, d ) IF cp(Z). Lemma 2.8. 1. G C B sei mit T G konsistent. Dann ist G genau danil endlich generisch, wenn MG generisch ist und fur jede atomare Aussage cp G t-cp oder G I - i c p . 2. (gi i E w ) sei ein Erzeugendensystem der Gruppe M und G die Menge aller Aussagen aus B, die in ( M ,g’) gelten. Dann is; M genau dann generisch, wenn G endlich generisch ist. 3. p(E, d ) IF cp ( E ) gdw. fur jede generische Gruppe M ii, 6 E M MI=p(i,6)+ MCcp(i).
1
Beweis. 1. G sei generisch, ii E MG, MG C c p ( i ) . D a MG von den Interpretationen der c‘ erzeugt wird, gibt es Terme F(d), Konstanten d aus C,die in MG durch h‘ interpretiert werden, mit ii = F(h’). D a MG I= cp(t(d)),gibt es ein q ( d , d r )C G rnit q IF cp(l(d)). Naturlich ist Ma I= q ( i , i‘). Wenn E neue Konstanten aus C sind, ist p = q U {to A _co, tl = cl,. . . } aus P. Dann ist p IF c p ( E ) und MG I = p ( i , i , h ’ ) . Wenn M G - C p ( i ) , p(E)IFcp(E) folgt MG I= c p ( i ) . Denn cp (2). D a aber p sonst gibt es 6 E Me, q(E, d ) E P, MG I= q(i,6), q IF i und q vertraglich sind, wurde p U q IF (1 cp A cp)(c) folgen. Wid. Sei nun umgekehrt MG generisch und erfulle G die genannte Bedingung. Sei p C G und p IF cp. Dann ist MG C p und somit MG C cp. Sei umgekehrt MG C cp. Dann gibt es ein q(E, d ) IF cp mit MG I= q(E, 6 ) fur 6 E M. Wir konnen annehmen, daB die d von den E verschieden sind. Es sei 6 = f ( j r )fur die Interpretationen j ‘ der Konstanten E’. Aus 2.6.8 folgt q(E, F(E’))IF cp. D a MG C q(E, ?(I?’)),ist G I-q(E,F ( E f ) ) . Also gibt es p C G rnit p t- A q(E, t(E’)). p IF cp. 2. Folgt aus 1, weil M = MG. 3. Die eine Richtung der Behauptung folgt sofort aus der Definition. Wenn p(E, d)JV cp(Z), gibt es ein endlich generisches G mit p C G und MG bi’ c p ( E ) . Es ist aber MG C p. CI
3. Endlich generische Gruppen In diesem Abschnitt behandeln wir zuerst einige einfache Eigenschaften generischer Gruppen. Dann geben wir verschiedene Charakterisierungen der generischen Gruppen. Satz 3.1. 1. Alle generischen Gruppen sind elementar aquivalent.
550
M. Ziegler
2. Generische Gruppen sind algebraisch abgeschlossen. 3. Die Theorie der generischen Gruppen ist rekursiu isomorph zu 0 c. ([28]). Beweis. cp sei eine Aussage aus L,,, p(E) It- cp. Wir zeigen @IFQ ; Es gibt sonst q ( d ) E P, q ( d ) l F i c p . Wenn fur die Permutation .rr von C T ( E ) von d disjunkt ist, sind p(.rr(E)) und q vertraglich. Aber q Up(.rr(E))It- i c p A cp. Wid. Damit ist 1 bewiesen. 2. Das endliche GI. & UngLsystem S ( 6 , i ) sei uber der generischen Gruppe M losbar. Wenn S keine Losung in M hat, gibt es 6 EM, p(E, d ) It- V f i A S(E, i ) ,M C ~(6,s). Da S ( & i )in einer Obergruppe von (5,6) Iosbar ist, sind p(E, d) und S(E, E’) fur neue C’ vertraglich. Es ist aber p C p U S(E, E‘) It- 32 A S(E, if). Wid. 3. Beim Beweis von 1 haben wir gesehen, dal3 eiae elementare Aussage genau dann in allen generischen Gruppen gilt, wenn sie von 0 erzwungen wird. Also ist Q genau dann aus Tf, der Theorie der generischen Gruppen, wenn die arithmetische Aussage “0 IF cp” gilt. Daraus folgt T’<,@*. Umgekehrt lal3t sich nach IV, 2.4.5 jeder arithmetischen Aussage eine gruppentheoretische Aussage zuordnen, die genau dann in einer - allen - a.a. Gruppen gilt, wenn die C , T’. 0 arithmetische Aussage wahr ist. Also ist, wegen 2, 0-” Folgerung 1. Die arithmetische Eigenschaft R (xo, . . . ) triflt genau dann in einer generischen Gruppe M auf ao,. . . zu, wenn es bo,.. . E M und p(E, d ) E P mit M t= p ( i , 6) und p IF “go,. . . in MG haben die Eigenschaft R” gibt. Das folgt aus IV, 2.4.9, weil generische Gruppen a.a. sind. Folgerung 2. Keine generische Gruppe enthalt eine endlich erzeugte Untergruppe, deren Wortproblem arithmetisch, aber nicht losbar ist. Denn (Beispiel 1 uor 2.1) es gibt eine generische Gruppe, die eine solche e.e. Gruppe mit nicht-losbarem Wortproblem nicht enthalt. Da also auch alle elementar aquivalenten a.a. Gruppen (siehe IV, 3 ) diese Gruppe nicht enthalten, folgt die Behauptung aus 1.
Keine der Gruppen MA, M a und keine m-generische Gruppe ist also generisch. Mit dem folgenden Satz konnen wir viele arithmetische Eigenschaften generischer Gruppen herleiten. Satz 3.2 (siehe [14]). Ai, i E w sei eine Forge uon Teilmengen von w, m E w. Dann sind aquivalent :
55 1
Algebraisch abgeschlossene Gruppen
(a) Es gibt eine generische Gruppe M, sodap f u r alle i E w und alle ao,. . . a,,, E M gilt S t ( W ( ( a o , .. . a,))) (ugl. IV, 2 fiir die Definition). (b) Jedes endliche Gleichungs- & Ungleichungssystem S ( i , y), das losbar ist, hat fur alle i eine Losung ii, 6 in einer Gruppe H, so dap S 3 ( W ( ( a o ., . . a, >). (c) Fiir alle i gibt es eine a.a. Gruppe M, so dap fiir alle an,.. . a,,, E M S$( W((a,,. . . a,,,))) gilt. Zusatz. Wenn alle A, arithmetisch sind, sind ( a ) , ( b ) , ( c ) iiquiualent zu (d). Fiir alle generischen Gruppen M, alle i E w, a(),.. . E M gilt SPa(W((ao,.. . a, ))I. Beweis. Wenn die A, arithmetisch sind, folgt (a)@(d) weil “f.a. ao,. . . Spt( W ( ( a o., . . )))” arithmetische Eigenschaften sind, die sich auf elementar aquivalente Gruppen ubertragen. (a)+ (c) ist klar. Um (c)+ (b) einzusehen, geben wir uns ein i E o und ein endliches G1. & Unglsystem S(X,j) vor. S hat eine Losung ti, 6 in der a.a.Gr. M, wo S$( W ( ( a o , . .))) gilt. . . ) ) € 2”’’ (go,. . . (b)+(a): Wir zeigen 01t“f.a. i ex. n E A, W((go,. sind die Interpretationen der co, . . . in MG). Das bedeutet
V i E w V p 3q 3 p 3 n E A, q IF “W((g0,.. .)) E 2,”’. Sei also p(c0,.. . c,,,, d ) E P und i € w gegeben. Wegen (b) (S(X,9 ) = p(xo,. . . y)) gibt es ao,. . .6 E H mit HI= p(ao,.. . 6 ) une ein n E A, mit W ( ( a o ., . . a,)) E X,,. Es gibt ein endliches System S ( a o , . ..Z) von GI. & Ungl., das uber H genau dann losbar ist, wenn W((ao,. . .)) E 2“ (siehe Beweis von IV, 2.1.2, n = 1). q = p U S(co,.. . d ’ )(fur neues d r ) ist aus P, weil S(ao,. . . ) uber H losbar ist. Andererseits ist q Ik “ W((g,,,. . . )) E %,,”. Wie in 2.6.8 folgt nun 0 t “f.a. ao,. . . aus MG, f.a. i Sgh( W((a,, . . .)))”. Da auch
0 It “G generisch”, gibt es ein generisches G, so da8 MG (a) erfullt. 0
Folgerung 3.3 (Bezeichnungen wie in 3.2). Aus 1 und 2 folgtjeweils, dap (a), (b), (c) des letzten Satzes rutrefen. 1. Fiir alle i E w und Erzeugende ao,. . . a, einer Gruppe rnit r.a. Wortproblem gilt S:h( W ( ( a o ., . . a,,,))). 2. Kein A, ist durch eine r.a. Menge uon T,,,:= { n W, C 2-1 trennbar.
I
Beweis. Wenn 1 gilt, trifft fur jedes i (c) von 3.2 auf MI, zu; denn alle e.e. Untergruppen von MI, haben r.a. Wortproblem.
M. Ziegler
552
2. Wir zeigen (b). Sei also i E w und S ( i , 9 ) gegeben. Zu S gehort eine r.a. Hornklasse X,die genau dann W ( ( a , ,.,. . a , ) ) enthalt, wenn S ( & 9 ) in einer Obergruppe losbar ist. Wir setzen D = { n ,%, n %‘ n W, = 0). D ist r.a. Wenn wir S als losbar annehmen, folgt D n T, = 0. Es muB also A i f D sein. DaB fur ein n E Ai ,%, f l X n W, # 0 bedeutet aber, daB es eine Gruppe ( a o , . . a,) gibt mit W((ao,. . .)) E % n 2, d.h. Sf( W ( ( a o ,...))) und S(ao,.. . ) losbar uber ( a o , .. .). 0
1
Beispiel 3.4 ([14]). Generische Gruppen erfullen MAX nicht (siehe IV, 1.6 Beisp. .2). Beweis. E sei die Menge aller endlichen Systeme S(xo, . . .) von G1. & Ungl., die hochstens eine Ungleichung enthalten. Jedem S E E konnen wir effektiv eine r.a. Hornklasse gnzuordnen, die genau dann das Wortproblem einer Gruppe ( a o , . . a,) enthalt, wenn S in ( a o , .. .) sei die nicht losbar ist; n berechne sich durch g ( S ) rekursiv aus S. in die Menge der losbaren Systeme von E . Da jedes System von in jeder jeder a.a. Gruppe losbar ist, muB auch jedes System von universellen e.e. Untergruppe einer a.a.Gr. losbar sein. In generischen Gruppen gibt es keine e.e. Untergruppen mit dieser Eigenschaft: Wenn a o , .. . a, Elemente einer generischen Gruppe sind, gibt es ein S E mit W ( ( i i ) E ) ,%g(s). Wenn A = g [ ~ , ] heil3t , das Sf(W((ao,. . .)). Um das zu zeigen, zeigen wir, daB A nicht durch eine r.a. Menge von T, getrennt werden kann, und wenden dann 3.3.2 an. Nehmen wir an, D sei r.a. und trenne A von T,. Da g E \ E~ in T, abbildet, ware dann r.a. Das widerspricht der Unlosbarkeit des aber = g-’(D) d.h. Wortproblems fur Gruppen. 0 Die generischen Gruppen sind eine “elementare” Klasse von a.a. Gruppen:
Satz 3.5 (siehe [33]). Es gibt eine rekursiue Menge uon elementaren Aussagen, die genau dann in einer a.a. Gruppe gelten, wenn die Gruppe generisch ist.
Bemerkung. Da alle generischen Gruppen elementar aquivalent sind, ist das - von der Rekursivitat abgesehen - dasselbe wie: Wenn N = M a.a. und N generisch, ist M generisch.
1
Beweis. Wir wahlen eine Godelnumerierung (pi i E w ) von P. Dann ist fur jedes cp{i [ p ikcp} arithmetisch. ‘‘(M,fi)t=p’”ist in M eine
Algebraisch abgeschlossene Gruppen
553
arithmetische Eigenschaft der a,,, . . . und i. Also ist V i ( c p ( i ) o 3 u3~"( . . . ; i C, p~" )" ~ p " t c peine ) arithmetische
Eigenschaft von Gruppen. In a.a. Gruppen sind diese arithmetischen Aussagen durch elementare Aussagen beschreibbar. Wir nehmen fur jedes cp die zugehorige elementare Aussage. Wir geben jetzt andere Charakterisierungen fur generische Gruppen. Lemma 3.6. Die Gruppe M ist genau dann generisch, wenn es fur alle GEM, p E P m i t ( M , i i ) / = p u n d a l l e c p , $ a u s L ( C ) e i n q E P u n d 6 E M m i t p C q u n d (M,i,6)Cqgibt, sodaJ?pItcpvq 3 q l t c p o d e r q It $ und p It3xcp(x) j ex. c E C q IF cp(c). Beweis. Die eine Richtung folgt sofort aus der Definition. Wenn die Bedingung erfiillt ist, zeigen wir durch Induktion uber den Aufbau von cp E L ( C ) , daJ? fur alle i E M, p E P ( M , ii) t= p , p It cp 3 (M, ii) C cp. Wir konnen annehmen, daB cp sich aus atomaren und negiert atomaren Formeln mit A , v , Vx, 3 x aufbaut. Fur quantorenfreie cp ist die Behauptung klar (siehe 2.6). Der Beweis ist beim Induktionsschritt fur A , Vx sehr einfach und bei v ahnlich wie bei 3 x . Sei (M, ii)t= p und p IF 3 x Q (x). Nach Voraussetzung gibt es q 3 p , 6 E M, rnit (M, ii, 6 ) C q und c E C rnit q It cp(c). Nach der Induktionsvoraussetzung ist ( M ,Ci, 6)C Q (c). Also gilt 3 x Q (x) in ( M ,ii). Wir mussen noch zeigen, daB es, wenn ( M ,ii) I= cp, ein p E P 6 E M rnit p It cp und ( M , ii, 6)k p gibt. Nun ist aber 0ll- Q v i c p . Also gibt es 6 E M und 0 C p rnit ( M , i i , 6 ) t = p u n d p l t c p o d e r p t i c p . Aus p It i cp wurde aber ( M , ii, 6)F cp folgen, also ist p It cp. 0 Definition 3.7. Eine Familie 9 von endlichen Gleichungs- und Ungleichungssystemen mit Koeffizienten g o , .. . ,g, aus einer Gruppe G heiBt dicht uber G, wenn zu allen . . ,gk E G und jeder endlichen Menge S(go,. . . gk) von Gleichungen und Ungleichungen, die in G gelten, es ein F ( g o , .. . , g n , 2 ) aus 9 gibt, sodaB S U F U T G widerspruchsfrei ist. Bemerkung. 9 ist also dicht uber G, wenn fur kein q E P, g E M (M,g ) t= q und q It i V FESr 3 2 F . Beispiel. Wenn das endl. G1. & UngLsystem S' uber G Iosbar ist, ist {S'} dicht.
In der nachsten Charakterisierung der generischen Gruppen kommt die Erzwingungsrelation nicht mehr vor.
554
M. Ziegler
Satz 3.8. Eine Gruppe M ist genau dann generisch, wenn es fur jede
arithmetische, uber M dichte Familie 9 ein F E 9gibt, das eine Losung in M hat. Beweis. Die Bedingung sei erfullt. Mit 3.6 zeigen wir, daB M generisch ist. Sei ( M , i)t= p ( E ) und p(E)It- 3xcp(x). Die Familie von endlichen Systemen von Gleichungen und Ungleichungen uber M 9 : = { q ( t i , i ) l q ( E , d ) E P ,p C q , pIkcp(d’), d ‘ E C ) ist arithmetisch. Sei a” E M und M I = S(ti, ti‘) fur eine endliche Menge S(ti, ti’) von Gleichungen und Ungleichungen. Dann sind p und S(E, E’) vertraglich und es gibt ein q 3 p U S(E, t’)mit q IF cp(d’) fur ein d ’ E C. 9 ist also das eine Losung 6 E M hat. Das dicht und es gibt ein q ( 6 , i ) aus 9, wurde gerade in 3.6 verlangt. Der Fall cp v I,/I wird ebenso behandelt. Sei M generisch und $ ( g o , . . . g , ) eine arithmetische Familie von endlichen GI. & Unglsystemen. M I = 3 i V { A F(g, i ) F E 9)ist eine arithmetische Eigenschaft von M , g o , . . . ; wenn M, g o * diese Eigenschaft nicht hat, gibt es p ( c o ,. . . ,ck) E P g n + , ,. . . g k E M mit M k p ( g 2 . ..gk) und p IF “MG b v i A {-I A F ( E , i ) / F E 9)”. Also iSt fur alle d E C und alle F E 9p Ik -I A F(Z, 2). Das bedeutet aber, daB p ( g o , . . . gk) U F(go,. . . g,, i )U T G fur alle F E 9 inkonsistent ist. 9 ist also nicht dicht. 0
I
-
Nach 3.5 ist jede zu einer generischen Gruppe partiell isomorphe Gruppe wieder generisch. Die generischen Gruppen miissen sich also unter den a.a. Gruppen durch ihr Skelett auszeichnen lassen. Dazu definieren wir:
Definition 3.9. Ein Gruppe G heiBt subgenerisch, wenn jede arithmetische Familie 9 ( g o , .. . g n ) von endlichen Systemen von Gleichungen und Ungleichungen uber G, die in folgendem Sinn “dicht” ist:
Zu jedem endlichen G1. & Unglsystem S(go,. . . gk, y), das so daB uber G losbar ist, gibt es ein F ( g o , .. .g., 2 ) aus 9, F U S U T G widerspruchsfrei ist. ein Mitglied besitzt, das uber G losbar ist.
Bemerkung 3.10. Nach 3.8. ist eine a.a. Gruppe genau dann generisch, wenn sie subgenerisch ist. Aus I, 1.2 folgt daB jede Untergruppe einer subgenerischen Gruppe wieder subgenerisch ist. Satz 3.11. Eine algebraisch abgeschlossene Gruppe ist genau dann
generisch, wenn jede endlich erzeugte Untergruppe subgenerisch ist.
555
A lgebraisch abgesch lossene Gruppen
Beweis. 3.11 folgt aus 3.10. Denn eine Gruppe ist, wie man leicht sieht, genau dann subgenerisch, wenn jede e.e. Untergruppe subgenerisch ist. Folgerung. Jede a. a. Untergruppe einer generischen Gruppe ist generisch. Satz 3.12. Jede abzahlbare subgenerische Gruppe isf Untergruppe einer
generischen Gruppe.
I
Beweis. Es sei G = {g, i E w } subgenerisch. Wir denken uns die Menge der Konstanten C disjunkt zerteilt in {g, i E w } und eine unendliche Menge C’. Wir konstruieren eine aufsteigende Folge p o C p , C . von Elemeden aus P, die mit Diag(G, g ) U T G vertraglich sind. Die generische Obergruppe M von G wird dann eine von den Interpretationen der c E C erzeugte Losung von Diag(G, g ) U u{pi i E w } . $,, i E w sei eine Numerierung aller Aussagen aus L ( C ) . Sei pi schon gewahlt und t+bi = (p1 v cpz (oder = 3xcp(x)). In t+bi mogen nur die Parameter g o , .. . ,g, vorkommen. Wir betrachten die arithmetische Familie 9 = {q(go,. . . g,, i ) q 3 p,, q Il- 14, oder q IF cpl oder q IF cpz (oder q IF cp(c) fur ein c E C)}. 9 ist im Sinn von 3.9 dicht. q(go, . . . g,, i )E 9 sei uber G Iosbar. Wir setzen dann p i + ,= q(go, . . . E ’ ) U pi fur neue Konstanten c“ aus C’. Das so konstruierte M erfullt offenbar die Bedingungen von 3.6.
I
1
I
Bemerkung. Wir werden sehen, daB fur generische M W*(M) abzahlbar ist, G ist also genau dann partiell in eine generische Gruppe einbettbar, wenn G subgenerisch ist und W*(G) hiichstens abzahlbar. Wir werden im nachsten Kapitel die endlich erzeugten subgenerischen Gruppe auf einfache Weise durch ihr Wortproblem charakterisieren. Das direkte Product zweier subgenerischer Gruppen ist i.A. nicht subgenerisch. Problem. 1st W*(G) fur subgenerische G immer hochstens abzahlbar? VI. Generische o-Modelle Wir bestimmen die generischen Gruppen durch die zugehorigen w Modelle. Zuerst geben wir eine einfache rekursionstheoretische Beschreibung der subgenerischen Gruppen: Eine Menge von naturlichen Zahlen heiBt subgenerisch, wenn sie Element der Vereinigung jeder “dichten” Familie von r.a. Hornklassen ist (1.1). Die
M . Ziegler
556
e.e. Gruppe F ist subgenerisch gdw. W ( F ) subgenerisch ist (1.11). D a mit der Menge G auch jedes H mit H S * G subgenerisch ist, ist die a.a. Gruppe M genau dann generisch, wenn W * ( M )nur aus subgenerischen Mengen besteht (2.4). Wir nennen ein w-Modell daher generisch, wenn es algebraisch abgeschlossen ist und nur subgenerische Mengen enthalt. Die generischen Gruppen entsprechen dann gerade den generischen w-Modellen. Im zweiten Abschnitt definieren wir eine Erzwingungsrelation fur w Modelle: eine r.a. Hornklasse X erzwingt die L’-Formel cp(X), wenn fur alle generischen ( w , 9’) und alle A E Y n %‘ (w, 9’)t= p ( A ) (2.7). Wenn man n u n ausnutzt, daB r.a. Hornklassen endlichen Gleichungssystemen, L ‘-Formeln Formeln aus L generische w Modelle generischen Gruppen entsprechen, und bemerkt, daB (2.7) dem Lemma V, 2.5 entspricht, sieht man, daB unsere neue Erzwingungsrelation die Ubertragung der Erzwingungsrelation von V auf w-Modelle ist. Aus (der Definition) V, 2.7 kann man also schliessen : In einem generischen w-Modell ( w , 9’) gilt cp(A) genau d a m , wenn eine r.a. Hornklasse X mit A E X cp(X) erzwingt (2.8). Wir beweisen 2.8 aber nicht auf diese Weise, sondern direkt, ohne die Resultate von V uber generische Gruppen zu benutzen. Ebenso zeigen wir “nocheinmal” direkt die Existenz generischer w -Modelle (1.3, 2.5). Das zeigt weitere Moglichkeit generische Gruppen einzufuhren (namlich analog zur Einfuhrung der generischen w Modelle): Man definiert generische Gruppen durch V, 3.8 und die Erzwingungsrelation analog zu 2.7. Die Theorie der generischen Gruppen 1aBt sich dann wie in VI entwickeln. Als erste Anwendung zeigen wir, daB alle in a.a. Gruppen erfullbaren V,-Aussagen in allen generischen Gruppen gelten (2.10). Man kann also generische Gruppen “klein” nennen, denn V,-Aussagen, die in einer a.a.Gr. gelten, gelten auch in jeder a.a. Untergruppe. In diesem Sinn sind w-generische Gruppen “groB”: Eine V,-Aussage, die in einer w-generischen Gruppe gilt, gilt in allen a.a. Gruppen. Durch diese “Kleinheit” werden die generischen Gruppen nicht charakterisiert. Denn eine zu einer generischen Gruppe elementar aquivalente a.a. Gruppe ist zwar generisch (V, 3.5). Fur jedes n gibt es aber eine nicht-generische a.a. Gruppe, in der alle 3,-Aussagen gelten, die in generischen Gruppen gelten (3.5). Der Beweis von 3.5 benutzt schon das Hauptresultat des dritten Abschnitts: Zu jeder generischen Gruppe M gibt es eine (bis auf rekursive Isomorphie eindeutig bestimmte) Menge B C w, so daB W * ( M )= {A C w A <, €3). Die so vorkommenden B (die also M bis
zu,
I
Algebraisch abgeschlossene Gruppen
557
auf partielle Isomorphie eindeutig bestimmen) lassen sich sehr einfach rekursionstheoretisch beschreiben: Es sind (bis auf rek. Isomorphie) die subgenerischen Mengen, die die rek. untrennbaren r.a. Mengen A, B von 111, 4.6 trennen (3.3). Dieser Satz fuhrt also die Untersuchung der generischen Gruppen zuruck auf die Untersuchung dieser Mengen (der “v.s.g.” Mengen). Fur generische Gruppen M ist also W , ( M ) ein Hauptideal. Wir zeigen in 3.7, dal3 W , ( M ) immer unendlich ist. Im vierten Abschnitt zeigen wir, daB es in W , ( M ) sowohl *-Grade a mit a z $ a als auch nicht-rekursive *-Grade b mit bz b gibt (4.4, 4.5). Fur a wahlt man dabei den maximalen *-Grad in W , ( M ) . D a cz arithmetisch in c ist, bedeutet das gerade nach 3.6, dal3 a ; fZ W , ( M ) und bS€ W , ( M ) ; denn, wenn d arithmetisch in c E W , ( M ) ist, ist (fur generisches M) d E W , ( M ) gdw. d < c (3.6). Es gibt also in jeder generischen Gruppe M eine endlich erzeugte Untergruppe F, sodal3 fur kein H # E F * H nicht in M einbettbar ist. Es gibt allerdings auch immer e.e. Untergruppen F mit nichtlosbaren Wortproblem, so dal3 fur alle endlichen H F * H und alle F * F, F * F * F, . . . in M einbettbar sind. Das steht im Gegensatz zu 4.1: Wenn M generisch ist und G E W * ( M ) ,ist w \ G E W * ( M )gdw. G rekursiv ist. Daraus folgt das Resultat aus ([14]), daB alle e.e. einfachen Untergruppen einer generischen Gruppe losbares Wortproblem haben. 1. Subgenerische Mengen Die im folgenden betrachteten “subgenerischen” Mengen, entsprechen den Wortproblemen endlich erzeugter subgenerischer Gruppen.
Definition 1.1. 1 . A C w sei arithmetisch. Die “arithmetische” Familie {gi i E A} von r.a. Hornklassen heiBt dicht, wenn es zu jeder nichtleeren r.a. Hornklasse X ein i E A mit 2,f l X # 0 gibt. 2. Eine Menge G von natiirlichen Zahlen heiBt subgenerisch, wenn es in jeder arithmetischen dichten Familie von r.a. Hornklassen eine Hornklasse gibt, die G enthalt.
I
Beispiele 1.2. 1. Jede rekursive Menge ist subgenerisch. Denn, wenn G rekursiv ist, ist { G } eine r.a. Hornklasse. Wenn 9 eine dichte Familie von Hornklassen ist, ist also { G } n d.h. G E UP. 2. Jede subgenerische arithmetische Menge ist rekursiv.
u9#0,
I
9 ={ { X i
I
E X } l ifZ G}U{{X i @ X}l
i E G}
ist, wenn G arithmetisch ist, eine arithmetische Familie von r.a.
558
M. Ziegler
Hornklassen. G ist in keinem Element von 9 enthalten; wenn G subgenerisch ist, ist 9 also nicht dicht. Es gibt also eine r.a. Hornklasse X,nicht-leer, die von US disjunkt ist. Weil US = 8 ( w ) \ { G } , folgt X = {G}. G ist also rekursiv. Aus dem nachsten Satz folgt, daD es nichtrekursive subgenerische Mengen gibt. Satz 1.3. Jede nichtleere r.a. Hornklasse enthalt eine subgenerische
Menge.
I
Beweis. (9, i E o)sei eine Aufzahlung aller arithmetischen dichten Familien von r.a. Hornklassen, Xn eine nichtleere r.a. Hornklasse. Wir konstruieren eine absteigende Folge Xn 3 XI3 X 2 3 . . . von r.a. nichtleeren Hornklassen, so daD C US,, i = 0,1,2 * . * . Wenn X, gegeben ist, finden wir ein X E 9, mit X n X , # 0. Wir setzen X,+, = X,n 2.Nach 111, 3.9.2 ist 0.Jedes Element des Durchschnitts ist subgenerisch. 0
n,&%#
Bemerkungen 1.4. 1. 1st Bi, i E o eine Folge von nichtrekursiven Mengen und Xn eine nichtleere r.a. Hornklasse, gibt es es eine subgenerische Menge aus Xn, die von allen Bi verschieden ist. Denn man kann im Beweis von 1.3 in der Aufzahlung der 9,fur jedes j eine Familie S,! von r.a. Hornklassen mit US: = 9 ( w ) \ { B i } vorkommen lassen (siehe 1.2.2). 2. D a es nichtleere r.a. Hornklassen gibt, die keine rekursiven Mengen enthalten, gibt es nichtrekursive subgenerische Mengen. (Eine solche Hornklasse ist z.B. {X X trennt A und I?} fur die A,I? aus Lemma 111, 4.6.)
I
Fur diese Hornklassen gilt sogar Satz 1.5. Jede nichtleere r.a. Hornklasse, die keine rekursiven Mengen
enthalt, enthalt 2" subgenerische Mengen.
Beweis. Die nichtleere r.a. Hornklasse Xohabe keine rekursiven Elemente. (95 i E w ) sei eine Aufzahlung aller arithmetischen dichten Familien von r.a. Hornklassen. Wir konstruieren fur jede endliche Folge u E "2 eine nichtleere r.a. Hornklasse Xu mit (a) u C 7 3 Xu 3 Xn (b) Wenn u E "+l2, ist X-C USn,(c) Xun(n)f l = 0. Dazu wahlen wir, wenn u E "2, 2-gegeben ist, eine nichtleere r.a. Hornklasse X C US,,n Xu.Weil Xn keine rekursiven Mengen enthalt,
I
Algebraisch abgeschlossene Gruppen
559
gibt es in X zwei verschieden Mengen, von denen die eine sagen wir k enthalt - und die andere nicht. Wir setzen X v n ( o ) = { X ( k E X A X E X }und X m n ( , ) = { X ( k e X A X E % ’Fur } . jedes f E “ 2 , wahlen wir ein Gf E u{X,ji1 i E w } . Die Gf sind subgenerisch und alle verschieden. 0 Bemerkung. Wir konnen wieder erreichen, daR alle Gf nicht in einer vorgegebenen Folge von nichtrekursiven Mengen vorkommen. Definition 1.6. Sei d C 9 ( w ) . Eine r.a. Hornklasse X erzwingt d ( X IF d), wenn alle subgenerischen Mengen aus X in d sind. Statt X Ik{X * . . X . . . } schreiben wir XI1 “ * * * X * .
1
a’’.
Beispiel 1.7. Xo, XI seien r.a. Hornklassen. Dann ist XolkXl gdw. XOC XI. Beweis. Sei XI= %,+, und A E X o \ X l . Dann gibt es (u, w ) E W mit D, C A und DLC w \ A . Die r.a. Hornklasse X = 5Yo f l { X D, C X A DLC w \ X } ist nichtleer, und disjunkt zu XI. Eine subgenerische Menge aus X ist in Xo, aber nicht in XI, also XouC 2 , . Die andere Richtung ist trivial. 0
I
Satz 1.8. 1. d c 9 ( w ) sei arithmetisch, G subgenerisch. Dann ist G E .d gdw. es eine r.a. Hornklasse mit G E X, X IF d gibt. 2. R ( u o , .. .,X ) sei eine arithmetische Relation. Dann ist giIF “ R ( n o , .. . ,X ) ” eine arithmetische Relation zwischen i, no,n l ,. . . .
n{di1
Bemerkung. 3. Fur jede Familie di,i E Z gilt X IF i E I } gdw. fur alle i E I X Ik di.4. Wenn d t 9 ( w ) die Behauptung von 1 erfullt, gilt X I F P ( w ) \ d gdw. fur alle O # X r C X X ’ Y d . Beweis. 3 ist klar. Wir zeigen 4: Wenn X IF P ( u ) \ d , ist fur alle X ’ C X X’IF 9 ( w ) \ d . Also, wenn X ’ # 0, X’Fd.Sei umgekehrt X Y 9 ( w ) \ d . Dann gibt es eine subgenerische Menge G E X n d. Wegen 1 gibt es ein X”Ik d,das G enthalt. Dann ist aber 0 # X ’ =
x n xrrIF d.
(a) Aus 1.7 folgt, daR 1.8.1 und 2 fur r.a. Hornklassen d.h. R ( u , X ) = X E 2” gelten. Fur eine Relation R ( v o ,. . . X ) formulieren wir 1 noch einmal: 5 . “Fur alle n o , .. . E w und alle subgenerischen Mengen G ist R ( n o , .. . G ) gdw. es eine r.a. Hornklasse X mit G E 5Y und X Ik “ R ( n o , .. . X ) ” gibt.”
M. Zuglcr
560
(b) Wenn R(uo,. ..X ) 2 und 5 erfullt, erfullt auch i R ( u o , ... X ) 2 und 5 . Denn aus 5 folgt 4: 2,P “ 1R(no,. . .X)” gdw. V j (%J # 0 A gJC 2,) 3 %, Y “R(no,.. .X)”. Das ist, wegen 2 und 111, 3.9 arithmetisch. Also erfullt i R 2. Sei G subgenerisch und i R (no,.. . G ). Die Familie { 2, 2,k “ R(no,. . .X)” oder 2,IF “7R (no,. . .)”} ist dicht (- denn wenn X Y “7 R (no )” ist 2’k “R (no,. . .)” fur ein 0 # X’C X).G ist also in einem Element R ( n o , .. . G), gibt es daher ein dieser Familie enthalten. Weil i 2 IF “1R (no,.. . )”, G E 2.Damit ist auch 5 nachgewiesen. (c) Wenn R ( u o , .. .X ) 2 und 5 erfullt und B arithmetisch ist, erfullt auch Vu,,E B R(uo,u l , . ..X ) 2 und 5. Denn aus 3 folgt, daS 2,k “Vu0 E B R(uo,nl, . . .X)” gdw. fur alle n , , E B %, It- “R(no,n l , . . .X)”. Weil fur R 2 vorausgesetzt war, folgt aus dieser Darstellung, daB VuoE B R(uo,.. . ) 2 erfullt. Sei G subgenerisch und V u o EB R(uo,n,, .. . G). 9 sei die Familie aller r.a. Hornklassen 2,die fur ein noE B ‘‘R1 ( n o , .. .X)” erzwingen oder ~ ‘ V u 0 R ( u o.,..X)” erzwingen. Da i R nach (b) 5 erfullt, gibt es, wenn %jV“R(no,...X)” ein 0 # X‘C X mit X ’ k “ i R ( n o,... X)”. Die Familie 9 ist also dicht und wegen 2 und (b) arithmetisch. Sei fur ein X E 9,G E X. Dann ist X F “VuoR(no,.. .X)”. Satz 1.8 folgt aus (a), (b) und (c), weil man jede arithmetische Relation aus r.a. Hornklassen 0 durch Anwenden von ‘‘1und ’’ “Vu0 E B . ” erhalten kann.
I
--
-
Der nachste Satz entspricht V, 3.2. Wir bestimmen die Erzwingungsrelation fur eine einfache Klasse von Relationen. Dazu erinnern wir an die Bezeichnungen von IV, 2 und V, 3.2: G erfullt genau dann S?(X), wenn G E i E A } . G ist also subgenerisch gdw. G jedes S?(X) erfullt, fur alle arithmetischen A, fur die { g , i E A } dicht ist.
u{%I
I
Satz 1.9. 1. Sei A arithmetisch. Dann ist ~ ( w ) ~ - ‘ ~ S ? ( Xgdw. ) ’ * die Familie {gi i E A } dichr ist. 2. Die Relation LL%k F S?(X)’* ist 17;(in k, j ) .
I
Beweis.. 1: 9 ( 0 i) “S?(X)” gdw. V X # 0 X Y “1 S?(X)” gdw. V X # 0 3 i E A XY“X$Z%i”gdw. V X # @ 3 i E A 3 0 # X ’ C X X ’ k % , gdw. {gi i E A } dicht. 2: Die Relation ist iiquivalent zu:
I
~ x ( % n % ~ 3# @i W, j~ xngi#O). Daraus erkennt man leicht die Gultigkeit der Behauptung.
0
A igebra isch abgesch lossene Gruppen
561
Foigerung 1.10. A sei arithmetisch. Wenn S ? ( B ) fur alle r.a. B, ist 9(0) IF ‘‘SP(X)”. Beweis. Jede r.a. Hornklasse # 0 enthalt eine r.a. Menge. Wir wollen zeigen, dal3 endlich erzeugten subgenerischen Gruppen die e.e. Gruppen mit subgenerischen Wortproblem sind. Der Beweis beniitzt das folgende Lemma: Lemma 1.11. ( G C w ) G ist genau dann subgenerisch, wenn jede arithmetische Familie 9 von r.a. Hornklassen mit :
Wenn die r.a. Hornklasse X G enthalt, ist X n G E US erfdlt.
u S# 0.
Beweis. Eine Richtung ist trivial. Sei G subgenerisch und 9 eine Familie von r.a. Hornklassen mit der angegebenen Eigenschaft. 9‘sei die Familie aller r.a. Hornklassen X mit X C US oder X n US = 0. 9’ ist arithmetisch und dicht. Sei X E 9‘mit G E X.Dann ist &9 n U S Z 0 , also X C U S und daher G E U S . 0 Satz 1.12. G sei endlich erzeugte Gruppe. G ist genau dann
subgenerisch, wenn W ( G ) subgenerisch ist.
Beweis. Sei G subgenerisch, G erzeugt von go,. . . g,,. 9 sei eine arithmetische dichte Familie von r.a. Hornklassen. Jedem Xi E 9 ordnen wir ein endliches System von Gleichungen und Ungleichungen Si(co,. . . c,, x‘) zu, so daB Si (ho,. . . ,i )genau dann iiber einer Gruppe H losbar ist, wenn W ( H ;h o , .. . ) € Xi. Wir erhalten so eine arithmetische Familie $ ( c o , . . . c.) von GI. & Unglsystemen. Wir zeigen, daB g ( g o , .. . gn) dicht im Sinn von V, 3.9 ist. Sei also S(go,.. . g,, 9 ) iiber G losbar. Zu S gehort eine r.a. Hornklasse X,so daB S ( h o , . . . , 9 ) genau dann iiber einer Gruppe losbar ist, wenn W ( H ;ho,. . .) E X. D a W ( G )E X n Wn,gibt es ein Xi aus 9 mit X n W, n Xi# 0. Sei W E X r l W, r l Xi. Dann gibt es ein H = ( h o , .. . h , ) rnit W ( H )= W. Sowohl Si(lao,.. . i) als auch S(h0,. . . Y ) sind iiber H losbar. si(go,. . . i )U S ( g o , . . . y) U T G ist also widerspruchsfrei. Da G subgenerisch ist, ist ein Si (go,.. . i) aus $ ( g o , . . . ) iiber G losbar. Also ist W ( G )E Xi. Sei umgekehrt W ( G ) subgenerisch. W ( G ) hat also die Eigenschaft von 1.11. %(go,.. . gn) sei eine arithmetische Familie von endlichen G1. & Unglsystemen Si(g,,,. . .g,, i), dicht im Sinn von V, 3.9. Zu jedem Si(c,, . . .i )gehort eine r.a. Hornklasse Xi, sodal3 fur alle ho,. . .E H
M.Ziegler
562
Si ( / t o , . . . 2 ) genau dann uber H losbar ist, wenn W ( H )E Xi. 4 sei die Familie der so gewonnen Hornklassen. 4 ist arithmetisch. X sei eine r.a. Hornklasse, die W ( G ) enthalt. Das endliche G1. & Unglsystem S(co,. . . y) sei in einer Obergruppe losbar gdw. das Wortproblem zu X gehort. S(go,. . . y ) ist also uber G losbar. Fur ein Si aus 9 ist also Si (co, .. . n') U S(c,,,. . . 1)U T G widerspruchsfrei. Es gibt also ein H = (ho,. .. h"), uber dem Si ( / t o , . . .) und S(ho,.. .) losbar sind. Daher ist W ( H ) E Xi n X # 0. Wir haben gezeigt, daB 9 die Bedingung in 1.11 erfullt. Also ist W ( G )E Xi fur ein Xi aus 9. Si (go,.. .n') ist somit uber G losbar. 0
Folgerung. Wir haben einen neuen Beweis fur die Folgerung 2 von V , 3.1: D a arithmetische subgenerische Mengen rekursiv sind (1.2.2), haben alle endlich erzeugten Untergruppen generischer Gruppen losbares oder nicht-arithmetisches Wortproblem.
2. Generische o-Modelle Wir werden sehen, daB die generischen w-Modelle gerade die zu generischen Gruppen gehorenden w -Modelle sind. Definition 2.1. Ein o-Model1 ( w , 9 ' ) heiBt generisch, wenn es algebraisch abgeschlossen ist und Y nur aus subgenerischen Mengen besteht.
1st also M a.a. und w ( M ) generisch, so ist M generisch. Zur Konstruktion generischer w -Modelle brauchen wir das folgende Lemma. Eine Folge (Ao,.. .,A,) von Mengen heiBt subgenerisch, wenn A,, v A , v * v A, subgenerisch ist.
- -
Lemma 2.2. 1. Wenn ( G , H ) subgenerisch ist, sind auch G und H subgenerisch. 2. Ist fur eine r.a. Homklasse X und eine subgenerische Menge G X [ G ]# 0, gibt es ein subgenerisches (G,H ) E Xz. Anmerkuig. Wenn G und H subgenerisch sind, ist i.A. (G,H ) nicht wieder subgenerisch. 1.3 folgt aus 2.2. Beweis. 1. Wir zeigen, daB G subgenerisch ist. Sei 9 eine arithmetische dichte Familie von r.a. Hornklassen. Wir ordnen jedem X aus 9 die r.a. Hornklasse 2' = {(X, Y ) X E X} zu. Die so erhaltene Familie ist dicht. Denn wenn Xi # 0, gibt es, weil 9 dicht ist, ein X E 9 mit {X 3 Y (X, Y )E Xi}r l X# 0. Dann ist Xi n %'* # 0. Es gibt nun ein X E 9 mit (G, H ) E 2'.Also ist G E X.
I
I
Algebraisch abgeschlossene Gruppen
563
1
2. (9: i E w ) sei eine Aufzahlung aller arithmetischen dichten Wir konstruieren eine Familien von r.a. Hornklassen X 2C (B(w))~. absteigende Folge X 2 = X i 3 Xi 3 . . . von r.a. Hornklassen mit 2,[GI # 0 und C Us:. Wir brauchen dann H n u r aus dem Schnitt aller X,[GI zu wahlen. Fur die Konstruktion verwenden wir: “c 2 .9 sei eine arithmetische, dichte Familie. Xi eine r.a. Hornklasse rnit Xo[G]#O.Dann gibt es ein X ? E 92mit ( X , , f lX I ) [ G#0.” ] Beweis. Wenn Xzeine Hornklasse ist, bezeichnen wir mit P ( X 2 )die Hornklasse {X 13 Y ( X , Y )E X 2 } . 9 sei die Familie aller P ( X i fl Xi) fur X; aus 9’. 9 erfiillt die Bedingung von 1.11: Sei X eine r.a. Hornklasse, die G enthalt. g2flXi ist nicht-leer (siehe 1). Also gibt es ein Xi E 9 mit g2f l Xi n X:# 0. Das bedeutet X n P ( X i f l X:)# 0. Nach 1.11 gibt es also ein 2%’; aus 9 rnit G E P ( X i n X?), wie gewunscht. 0 Bemerkung 2.3. Man kann 2.2 auf drei Arten verscharfen (2’r.a. Horn klasse). 1. Sei G subgenerisch, X [ G ]# 0 (B, i E w ) eine Folge von B, C w mit B, $ * G. Dann gibt es ein subgenerisches (G, H) E X rnit B, jd* G v H fur alle i (siehe 1.4.1 und 111, 4.1 Bew. 3). 2. Sei G subgenerisch, X [ G ]# 0, X [ G ]enthalte kein B mit B S * G. Dann gibt es 2” viele H rnit (G,H) subgenerisch und (G, H) E X 2 (siehe 1.5). 3. Sei (G, i E w ) eine Folge von G, C w. Fur alle i E w sei (Go,.. . G,) subgenerisch. Wenn %‘[Go]# 0, gibt es ein H, soda0 (Go,H) E X 2 und fur alle i E w ( H , Go,.. . G,) subgenerisch ist.
1
1
Folgerung 2.4. 1. Wenn H G * G und G subgenerisch, sind auch (G,H ) und H subgenerisch. 2. Die a.a. Gruppe M isr genau dann generisch, wenn w ( M ) generisch isr.
Weil man {H}in der Form X [ G ]schreiben kann, folgt 1 aus 2.2. Wenn M generisch ist, mussen wir fur 2 zeigen, daf3 alle H aus W * ( M ) subgenerisch sind. Es gibt aber ein H E S k ( M ) rnit H S * W ( G ) .D a G subgenerisch, ist W ( G ) subgenerisch. A u s V, 3.12 konnen wir nun schlieBen, daB jede subgenerische Menge G zu einem generischen w-Modell gehort. Denn, wenn F eine e.e.Gr. rnit W ( F )=* G ist, ist W ( F ) und damit F subgenerisch. Wir betten F in die generische Gruppe M ein. G gehort dann zu dem generischen w-Model1 w (M). Wir beweisen dieses Resultat noch einmal mit 2.2, o h n e auf V zuriickzugreifen.
M. Ziegler
564
Satz 2.5. Jede subgenerische Menge ist in einem generische w-Model1 enthalren. Beweis (siehe auch den Beweis von 111, 4.1). Sei Go subgenerisch. Wir konstruieren eine Folge von Mengen G,, i E w fur die (Go,. . . G,) subgenerisch ist. G, sei bereits definiert und i = ( k , n ) . Wir setzen m = min(i, k ) + 1. Wenn g,,[Go,.. . G,] leer ist, setzen wir G,+]= G,. Sonst schreiben wir %?,,[Go,... G,] in der Form g,,,[Go,... G,]. Nach 2.2 gibt es ein G,+]E %,,,[Go,.. . G,], fur das (Go,.. . G,, G,+l)subgenerisch ist. (w,{G, i E w } ) ist generisch. 0
1
Bemerkung 2.6. 1. Aus 1.5 folgt rnit 2.5, daB es 2" viele generische w-Modelle gibt. (Das ist die genaue Anzahl, denn, wie wir sehen werden, ist jedes w-Model1 abzahlbar.) Es gibt also 2" viele abzahlbare generische Gruppen ([15]). 2 . (siehe 2.3.1). 1st G subgenerisch und B,, i E w, eine Folge von Mengen mit Bi$ * G, gibt es ein generisches w-Modell, das G aber kein Bienthalt. 3. I C " ( w ) sei hochstens abzahlbar, fur alle Go,.. . G, E I sei (Go,.. . G,) subgenerisch. Dann ist I in einem generischen w-Model1 en t halt en. Definition 2.7. Die r.a. Hornklasse X"+'erzwingt die L2-Formel v(X0,. . . )), wenn fur alle generischen (0,9')und p(X0,. . . X,,) (X"+'!+ An,. . .A,, E 9, mit (An,. . . A,,) E g"",( o J , ~t=) Q ( A ~ ,... An). Bemerkungen. 1. Fur arithmetische L2-Formeln (also ohne gebundene Mengenvariable) stimmen 2.7 und 1.6 uberein: X " + lI t Q (Xo, . . . ) (im Sinn von 2.7) gdw. fur alle subgenerischen (An,. . . A , ) aus % " + I q ( A o , . . .A,) gilt. (Das folgt aus 2.5.) 2 . Q(X0,. . . X,) enthalte nur die Variablen Xn,. . . X,, m < n. Wenn %';+'ItQ(X0,. . . . X,,) und = {(Xo, . . . X m ) ax,,, * * X,, (XO, . . . ,X,) E X,""}, ist dann XY+'IF v(X0,. . .Xm). (Das folgt aus der algebraischen Abgeschlossenheit der gen. w gdw. ex. A,+], . . . A, E Y mit Modelle: (An,. . . A,)€ (An,. . . A,) E %?;+I.)
1
-
Satz 2.8. cp(X0,. . . X,) und $ ( U o , . . .Xo,. . .X,) seien L2-Formeln. 1. Fur alle Ao,. . .A , aus einem generischen w-Model1 (0,9')ist (w, 9') t= rp (Ao,'. . .) gdw. es eine r.a. Hornklasse mit (Ao,. . .) E X"+'und %"+I It Q ( X o ,. . .) gibt.
Algebraisch abgeschlossene Gruppen
565
2. @;+'It- + ( n o , .. . X o ,. . .) ist eine arithmetische Relation zwischen i, no,nl * a .
Der Beweis geht durch Induktion uber den Aufbau von cp wie der Beweis von 1.8. Wir behandelen nur den Fall "VX,,". Zuvor bemerken wir noch, daB, wie in 1.8, aus 1 folgt C X,"" 3. X:+'ll- i cp (X,,,. . .) gdw. fur alle 0 # XY+*Jvcp(X0,.. .). Wir nehmen an, daB 2.8 fur cp(uo,.. . X o , .. . X n )und i cp(uo,.. . ) gilt und zeigen die Gultigkeit von 2.8 fur VX,,cp(uo,.. .). DaB 2 gilt, folgt aus ({x,,,. . .x,,)(x,,,. . . x,,-~) E x")= gn+') 4. X" Il- V X , cp (X,,,. . .) gdw. %"'+' It cp (X,,, . . .X n ) (klar). Wir zeigen, daD 1 gilt. Sei ( w , Y ) generisch, A,,, . . . E Y und (w, Y) I= VX,,Q (Ao,. . . X , , ) .Wir gebrauchen die Abkurzung P ( X n + ' )fur Die Familie die r.a. Hornklasse {(X,,,. . . X,,-l) 3X,, (X,,,. . .X,,)E X"+'). IF cp (x,,, .. .x,,))u { P ( x " + ' ) ~ti cp (x,,,. . .x,))ist 9 = (2" arithmetisch und dicht (weil 3 fur i cp gilt). ( A o , . . A n - l )ist subgenerisch und daher in einem Element von 9 enthalten. Wenn (A,,, .. .) E P ( X " + ' ) ,gibt es aber ein A,, aus Y mit (A,,,. . .A n )E Xn+'. Also mul.3 ( A o ,. .An-Jin einem X" mit 2?"" It cp(X0,.. .X , ) liegen. 0
1
I
I
I xn+'
Folgerung. A sei Element der beiden generischen w-Modelle (w, X ) , i = 0 , l . Dann ist fiir jede Lz-Formel cp(X) (w, Yo)+ cp(A) gdw. (w, 9")I= Q ( A ) . Insbesondere gelten in allen generischen w-Modellen dieselben L *-Aussagen. Die L2-Formel cp(X) "spricht nur vom Isomorphietyp von X" wenn fur jede rekursive Bijektion 7~ : w + w P ( w ) IF cp(X)t,cp(.r(X)).Die r.a. Hornklasse X ist vollstandig, wenn es fur alle r.a. Hornklassen 0 # X i C X i = 0 , l rekursiv isomorphe Ai E X i gibt. Lemma 2.9. X sei vollstandig, cp(X) spreche nur uom Isomorphietyp uon X . Dann ist X Il- cp ( X ) oder X Il- i cp ( X ) . Beweis. Sonst gibt es 0 # Xi C 2,XoIt- cp ( X ) , XIIt i Q ( X ) . Fur eine vertraglich. Es rekursive Bijektion 7~ sind Xound X3= { X T ( X )E XI) Q ( ~ ( x Wid. )). 0 ist aber X o - nX3It- cp ( X ) A i
I
und B trennen (111, Beispiel. 5' sei die r.a. Hornklasse aller X , die 0# Xi C V, sind nX,,und n X l Elemente von X,, bzw. XI, r.a. und rekursiv isomorph. 5' ist also vollstandig.
4.6). Wenn dann
M. Ziegler
566
Satz 2.10. 1. Eine P,-Aussage (siehe IV, 2.2), die in einern a.a. w Model1 gilt, gilt in allen generischen w - Modellen. 2. Eine V,-Aussage, die in einer a.a. Gruppe gilt, gilt in allen generischen Gruppen. 3. Die 3,-Theorie der generischen Gruppen ist vollstandig 2:, n 3 3. 4. Die 3,-Theorie der a.a. Gruppen ist vollstandig 2;. In IV, 3.6.2 haben wir gezeigt, daB eine V,-Aussage, die in einer a.a. Gruppe gilt, auch in allen a.a. Untergruppen gilt. In diesem Sinn zeigt also 2.10.2, daB generische Gruppen “klein” sind. Beweis. Nach IV, 2.3.2(iii) ist jede P,-Aussage aquivalent zu V X S : ( X ) fur ein r.a. A . Wenn diese Aussage in dem a.a. w-Model1 ( w , Y ) gilt, gibt es fur jede r.a. Hornklasse X # 0 ein B E X f l Y und ein i E A mit B E %#.{gi i E A } ist also dicht. Nach 1.9 ist also P ( w ) IF S:(X), d.h. P(w)IkVXS2A(X). Damit ist 1 bewiesen. 2 folgt aus 1 mit IV, 2.1.1, 2. Mit IV, 2.2 ubersetzt man jede Zt-Aussage uber w ( M ) , in eine (fur a.a. M ) aquivalente 3,-Aussage uber M, n 3 2 . Also laBt sich jede Zf-Menge 1-reduzieren auf die 3,-Theorie der generischen Gruppen. Um 3 zu beweisen, mussen wir noch zeigen, daB die 3,-Theorie der generischen Gruppen 2: ist. Das folgt aus dem nachsten Lemma und IV, 2.2. (Zur Definition von S t ( X ) , P t ( X ) siehe vor IV, 2.1 und IV, 2.2.)
I
Lemma 2.11 ( n 3 3 ) . Die Relation
“2,‘IFP.W,(X,)”
ist
nf in k , j.
Beweis. Aus der Behauptung folgt, daB “ 2 k Ik S,“,(X,)” eine ist aquivalent zu 170,+l-Relation ist. Denn V i (2, n 2 k # 0 j(2# n %~)PP?(X)). Fur n = 2 ist das 1.9.2, unser Induktionsanfang. 2.11 gelte fur n. Aus der Darstellung g k IF P Z , ( X , ) gdw. k !% IF V X s.W,(X, ~ v x , )gdw. { X v Y x E %k} IF s.W,(X,) erkennt man, daB dann 2.11 fur n + 1 gilt. 0 ‘‘a
a’’
1
3 gilt nun, weil man jeder 3.-Aussage ein j zuordnen kann, derart, daB die Aussage genau dann zur 3.-Theorie der generischen Gruppen ) { w } Y P>(Xo). gehort, wenn in allen generischen ( w , Y ) I= S ~ J ( Wd.h. 4. Eine 3,-Aussage gilt genau dann in allen a.a. Gruppen, wenn ihre Negation nicht a.a. erfulbar ist, d.h. (nach 2) wenn die Negation in generischen Gruppen nicht gilt. Die V,-Aussagen, die in generische Gruppen gelten, bilden eine vollstandige Zi’; Menge. Also bilden die V,-Aussagen, die nicht gelten, eine vollstandige 2$Menge. 0
Algebraisch abgeschlossene Gruppen
567
Mo, die abzahlbare a.a. Gruppe, deren Skelett gerade die e.e. Gruppen mit r.a. Wortproblem sind, ist zwar nach 111, 4.4.4 minimal, aber nicht klein im eben angegebenen Sinn: Es gibt V,-Aussagen, die in generischen Gruppen, aber nicht in Mo gelten. Denn fur jede r.a. Menge B gibt es eine 3,-Aussage cp, die genau dann in einer a.a. Gruppe M gilt, wenn B E W * ( M )(siehe IV, 3.4 Beweis). Wenn B nicht rekursiv ist, ist nach der Folgerung 2 aus V, 3.1 i c p in allen generischen Gruppen gultig. Andererseits ist immer MoI= I/. In 3.5 werden wir zeigen, daD es a.a. Gruppen gibt, die “klein” sind (die alle VrAussagen, die in generischen Gruppen gelten, erfullen) und nicht generisch sind. 3. Die Ideale generischer Gruppen Fur generische Gruppen M zeigen wir: W , ( M ) ist ein Hauptideal. Fur A, B E W * ( M )ist A 6’B gdw. A in B. W , ( M ) ist unendlich.
6*B
gdw. A arithmetisch
Definition. 3.1. G C o heiBt vollstandig subgenerisch (v.s.g.), wenn G subgenerisch ist und H E “Ir fur ein H =, G (s. Beispiel zu 2.9). Aus 1.5 folgt, daD es 2” v.s.g. Mengen gibt. Lemma 3.2. ( H , G) sei subgenerisch, G v.s.g. Dann ist H G. Vergleiche 2.2.1 und 2.4.1. Aus 3.2 folgt: Z u jedem v.s.g. G gibt es ein subgenerisches H, so daB (H, G ) nicht subgenerisch ist; denn es gibt nur abzahlbar viele H mit H 6, G. Beweis. Toti sei die Menge aller i, fur die f i - die i-te partiell rekursive Funktion - total und injek4iv ist. Toti ist arithmetisch. Jedem i E Toti ordnen wir effektiv die r.a. Hornklasse %g(i) = {(X, Y) fi : X 6, Y}zu. Dann wahlen wir ein r.a. C, sodan {(X, Y ) (Y $Z “Ir} = i E C } . Wir setzen A = C U g[Toti]. B1,B2 seien r.a. Wenn fur kein i E C (B1,B z )E gi, ist B2aus Sr, also vollstandig r.a. Fur ein i E Toti ist dann fi :B IC B2.Wir haben S$(BI,B z ) gezeigt. Aus 1.10 folgt 9(0)~IlS f ( X , Y).Das sollte bewiesen werden. 0
I
u{%’i I
Satz 3.3. 1. In allen generischen o-Modellen gibt es eine
rekursive Isomorphie - eindeutig bestimmte v.s.g. Menge. 2. Wenn (a, 9)generisch ist und B E Y v.s.g., ist Y = { X x <* B } )= {X x CIB}.
I
I
- bis auf
568
M. Ziegler
1
3. ( w , { X X s1B } ) ist genau dann generisch, wenn B v.s.g. ist. 4. (w, Y:)seien generisch, Bi E Y: v.s.g. Dann ist fur jede LZ-Formel, die nur uber den Isomorphietyp uon X spricht, (w, 9’o)C cp(Bo) gdw. (w, 9’1) I= cp (BJ. 5 . 1st (w, Yo)C (w, Y,),( w , Yo)algebraisch abgeschlossen, (w, 9,) generisch, so ist 9, = Yo. 6. Generische w-Modelle sind abzahlbar. 7. In generischen w-Modellen gilt m u (siehe Beispiel uor IV. 1. 2).
Beweis. 1. Das generische w-Model1 (w, 9’)ist algebraisch abgeschlossen. Es gibt also ein B, E 9’fl V . B1ist v.s.g. Wenn B z E 9’ v.s.g., ist (B1, B z ) subgenerisch und nach 3.2 B 1 B z und B z S lBz. 2. Sei B v.s.g. aus dem generischen ( w , 9’).In allen a.a. w-Modellen B) subgenerisch ist {X X S1B } C { X X S * B} C 9’.Sei C E 9’.Da (C, ist, ist C s1B. 3. Wenn ( w , { X X s1B}) generisch ist, gibt es ein v.s.g. C S lB, (B, C) ist subgenerisch, also B s ,C. 1st B v.g.s., gibt es ein generisches (w, 9’)mit B E 9. AUS 2 folgt Y = { X x s ,B ) . 4. folgt aus 2.9. 5 . Yo,9, seien wie in 5 . Da (0, Yo)a.a., gibt es ein B E 7f fl Yo Weil B E Y,, ist B v.s.g., Y ,c { X x s1B ) c yo. 6 und 7 folgen aus 2. 0
I
1
I
1
I
Folgerungen 3.4. 1. Wenn B v.s.g. ist, gibt es eine - eindeutig bestimmte - abzahlbare a.a. Gruppe M B mit W * ( M B = ) { X X S 1B}. M B ist generisch. 2. Wenn B1und B1 v.s.g. sind, ist MBl = MB2 gdw. MBl < Ma. gdw. B1 Bz gdw. B1 s*Bz. 3. In jeder generischen Gruppe M gibt es eine e.e. Untergruppe G, deren Wortproblem u.s.g. ist. Es ist dann M z pMW‘G’. 4. Eine generische Gruppe ist partiell isomorph zu jeder a.a. Untergruppe. Abzahlbare generische Gruppen sind also minimal. 5 . Generische Gruppen erfullen die %-Amsage max (siehe IV, 2.5.2). 6. Es gibt zwei e.e. subgenerische Gruppen, die nicht gemeinsam in eine generische Gruppe einbettbar sind.
1
1
Beweis. 1. Wenn B v.s.g., ist ( w , { X X S , B}) a.a. und abzahlbar. Wegen 111, 3 (3.4 und 12.2) gibt es eindeutig eine abzahlbar a.a. Gruppe M mit W * ( M )= {X X S1B}. Weil w ( M ) generisch ist, ist M generisch (2.4.2). 2. M B =~ M E gdw. MB1< MB2 folgt aus 4. Aus B1= 1 BZfolgt
1
Algebruisch abgeschlossene Gruppen
569
naturlich MBl = ME.. Daraus folgt B1S * Bz.Wenn El 5*Bz ist (Bl,B2) generisch, also B 1 EZ. 3. B E W * ( M )sei v.s.g. Es gibt ein G E Sk(M) mit B S IW ( G ) . W ( G )ist dann v.s.g. und B W(G). 4. folgt aus 3.3.5, 5 aus 3.3.7, 6 aus der Bemerkung zu 3.2. 0 Nach V, 3.5 lassen sich die generischen Gruppen unter den a.a. Gruppen durch eine elementare Satzmenge auszeichnen, es gilt aber: Satz 3.5. Fur kein n gibt es eine Menge von 3,-Aussagen, die genau
dann in einer a.a. Gruppe gilt, wenn die Gruppe generisch ist. Beweis. (Skizze) Sei n gegeben. (a) Es gibt ein m E w mit: Fur jede S,-Aussage Q aus L z gibt es eine 2; Eigenschaft R ' ( X ) , sodaB fur alle B C w ( w , { X X S , B})F Q gdw. R ' ( B ) . Wir nennen B C w k-subgenerisch (k-s.g.), wenn fur jede dichte Zi-Farnilie 9 von r.a. Hornklassen B E Wir schreiben X It.+R ( X ) , wenn R ( B ) fur alle k-s.g. B aus 2. Aus dem Beweis von 1.8 erkennt man. (b) Es gibt ein k E w, so daB fur alle 2;-Eigenschaften R ( X ) und alle r.a. Hornklassen X Xbk R ( X ) gdw. X b R ( X ) . Mit einer leichten Anderung des Beweises von 1.3 kann man zeigen: (c) Es gibt eine arithmetische k-s.g. Menge B E 5'. Es sei B}. Wenn wir n a 4 annehmen, ist (w,sP) a.a. Denn sP= { X X wenn cp die P , (also auch S , ) - Aussage "a.a." ist, ist nach 3.3.3 5' It R ' ( X ) . Da 5' IFk R ' ( X ) , ist R ' ( B ) . Wir finden nun eine a.a. Gruppe N mit W * ( N ) = 9 '.N ist nicht generisch, weil B arithmetisch und nicht rekursiv ist. Wir zeigen aber, daB alle 3.-Aussagen p, die in generischen Gruppen gelten, in N gelten. Zu p gehort nach IV, 2.4.2 eine S, Aussage Q, die genau dann in w ( M ) gilt, wenn p in der a.a.Gr. M gilt. Es ist also, wenn p in allen generischen Gruppen gilt, Q in allen generischen w-Modellen, also allen (0,{ X X 6, G}) fur subgenerisches G E V, gulti . Aus 5' I t R '( X ) folgt aber Yf ItnR ( X ) und R ' ( B ) . Also ist (0,{ X f X 6 ,B } )I= Q und somit N I= p. 0
I
u9.
1
I
Der nachste Satz schlie5t die Lucke zwischen 1.2.2 (Jede arithmetische subgen. Menge ist rekursiv) und 2.6.2 (Wenn G subgenerisch ist, und H P*G, gibt es ein generisches o-Modell, in dem G,-aber nicht H liegt). Satz 3.6. (G, H ) sei subgenerisch. Dann sind uquivalent
(a) H ist in G implizit arithmetisch definierbar,
M. Ziegler
570
(b) H s*G, (c) H s+G (falls G # w ) . Beweis. (c) + (b) + (a) ist klar. (a)+ (b): Sei R ( X , Y )arithmetisch und {H}= { Y R ( G , Y ) } .Nach 1.8 gibt es eine r.a. Hornklasse X z rnit (G, H) E X z und Z21tR ( X , Y). Wenn H $* G, gibt es ein H ' # H mit (G, H ' ) E X z . Wegen 2.2 konnen wir annehmen, daB ( G , H ' ) subgenerisch ist. Dann wurde aber R (G, H') folgen. Wid. (b)+ (c): (G, H) sei in der r.a. Hornklasse X z , fur r, s E w sei X 2It (r, s) : X <* Y. Wir wahlen g E w \ G. Wir definieren W ' = { u ( i , u ) E W , } und bilden die r.a. Hornklassen XPf= {(X,Y ) i fZ W f [Y ] }und fur 'edes u E W ' X:,, = { ( X , Y ) D, JZ Y}. Wenn (A, B )E Xf n E W ' } , ist i fZ W : [ B ] und i fZ W : [ B ] , also (r, s ) : A $* B. Fur jedes i ist also
I
I
1
I
n{X:.lx
xZn n{x:, 1
(*I
E
w i }n 2; = 0.
Wenn C E X 2f l XPf und fur u E W ' C,, E X 2 n X:n Xt,,ware C r l n{C, x E W ' } im Durchschnitt (*). Es gibt also eine Funktion f : w + Pw( w ) mit: f ( i ) = D. fur ein u E W ' mit X 2 n Xf n X:,= 0 oder = { g } und X 2n Xf = 0. Wir konnen f rekursiv wahlen. Es ist f:HS'G. 0
1
Folgerung. G und F seien e.e. Untergrupgen der generischen Gruppe M. Wenn W ( G ) implizit arithmetisch in W ( F ) definierbar ist, ist W ( G )<* W ( F ) . Gruppen mit losbarem Wortproblem sind in jede generische Gruppe einbettbar. Wenn eine v.s.g. Menge denselben *-Grad wie das Wortproblem der endlich erzeugte Gruppe G hat, ist G in genau eine abzahlbare generische Gruppe einbettbar. Gibt es e.e. Gruppen mit nicht Iosbarem Wortproblem, die in verschiedene (nicht partiell isomorphe) generische Gruppen einbettbar sind? Diese Frage ist aquivalent rnit der Frage, ob die Ideale W , ( M ) fur generische M nur aus zwei Elementen bestehen. Denn wenn B v.s.g. und fur ein G . < M a G f * B, also B P*W ( G ) , gibt es eine generische Gruppe N rnit G < N und B tif W * ( N ) (2.6.2). Satz 3.7. Fur alle generischen Gruppen M ist das Ideal W , ( M )
unendlich. Zum Beweis definieren wir:
Algebraisch abgeschlossene Gruppen
571
Ein *-Grad a wird durch die r.a. Hornklasse X beschrieben, wenn X # 0 und jedes r.a. Element von X aus a ist. Der *-Grad einer vollstandigen r.a. Menge wird z. B. durch 5’ beschrieben. Wir brauchen ein Lemma:
Lemma 3.8. a , bzw. az seien durch X , und X z beschrieben, und al P a z . Wenn dann (G,M ) subgenerisch ist, G E Xl, H E X2, ist G P*H. Beweis. Sei G C * H und also fur ein rekursives f f : G 6 +H (3.6). Nach 1.8 ist X’tl-f :X 6 +Y fur eine r.a. Hornklasse mit (G, H)E X 2 . Es gibt r.a. (A,B) E X 2 mit A E al und B E az. Es ist also f : A P+B. Sei i E A + f ( i ) C B fur ein i E w. (A,B) ist in der r.a. Hornklasse X i : = { ( X ,Y)l i E A t * i E X A f ( i ) C B - f ( i ) C Y } . Also X Z nX i Z 0 . Es kann aber keine subgenerischen ( C , D ) in X Z nX i geben. Wid. 0 Satz 3.7 folgt nun aus dem nachsten Lemma: Wir wahlen fur jeden der *-Grade eine beschreibende r.a. Hornklasse X und ein Element l W * ( M ) . Diese Elemente haben dann alle verschiedenen aus X r *-Grad.
Lemma 3.9. Es gibt eine absteigende unendliche Forge von beschreibbaren *-Graden. Beweis. Wir brauchen aus der Rekursionstheorie: I ([24]). Jede nichtrekursive r.a. Menge 1aBt sich disjunkt in zwei r.a. Mengen zerlegen, die unvergleichbare Turinggrade haben (S. 175, Sacks). I1 ([24]). Es gibt maximale r.a. Mengen R. D.h. R ist r.a., w \ R unendlich, fur alle r.a. A folgt aus R C A, daB A \ R oder w \ A endlich ist (S. 235, Friedberg). Wir benutzen nur eine Folgerung aus I: 111. Jede nichtrekursive r.a. Menge A ‘ lupt sich disjunkt in zwei nichtrekursive r.a. Teilmengen A, B mit A <*A’ zerlegen. Beweis. Wir finden nach I A und B mit unvergleichbaren Turinggraden. A und B sind somit nichtrekursiv. Weil A r.a., ist A < = A ’ . w \ A < : A ’ gilt, weil x $Z A gdw. x E B oder x $Z A ’ . Also A < * A ’ . Ware A ’ S * A , wurde B < * A folgen. Wir wenden nun I11 sukzessive auf die maximale r.a. Menge R an und bekommen zwei Folgen AP Bi von nichtrekursiven r.a. Mengen .. .A. = mit R = A,U B,,, u B.+, .. . , R * > A : > A ; > A 2 . .. . Alle An/=* sind beschreibbar:
572
M. Ziegler
I
Setze X = { X A,, C X A X n ( B oU B,.. . U B,) = 0). Offenbar ist A,, E X. Sei B E X r.a. Dann ist B U B,, U . . . U B, 3 R . Ware w \ ( B U Bo U . . . U B,) endlich, ware Bo U . . . U B, rekursiv, also A , = * R . Also ist ( B U B o U . - . U B . ) \ R = B \ A , endlich. Daher A.=*B. 0 Folgerung. Es gibt eine endlich erzeugte Gruppe mit nicht-losbarem Wortproblem, die in zwei generische Gruppen M , N M g PN einbettbar ist.
Kommen in jeder a.a. Gruppe unendlich viele *-Grade vor? 4. Drei Eigenschaften generischer Gruppen Die Turingreduzierbarkeit unterscheidet sich von der *-Reduzierbarkeit dadurch, daB zwar fur alle A w \ A < , A und A:<,A, im allgemeinen aber nicht w \ A < * A und A a G * A .Wir zeigen ( G C w ) : 1st G nichtrekursiv und subgenerisch, so ist w \ G # * G. 1st G v.s.g., so ist GZ $ * G. Es gibt nichtrekursive subgenerische G mit GZ s* G.
Satz 4.1. Wenn (G,w \ G ) subgenerisch ist, ist G rekursiv. Beweis. Sei ( G , w \ G ) subgenerisch und G nichtrekursiv. Nach 1.8 gibt es eine r.a. Hornklasse 2’ mit (G,w \ G )E X z und X zIF ( Y = X \ w ) A “ X nicht rekursiv”. Aus 1.7 folgt X zC { ( X , Y ) Y = X \ a}.Wenn A, B r.a. und ( A ,B ) E X z , sind A und B rekursiv (also auch subgenerisch). Daraus folgt Xzll.c“X nicht rekursiv”. 0
I
Folgerung. 1. Wenn G subgenerisch ist urrd w \ G GeG, ist G rekursiv. 2. ([ 141) Jede endlich erzeugte einfache Untergruppe einer generischen Gruppe hat losbares Wortproblem. ( A u s IV, 3.1.2 folgt, dajl sich diese Eigenschaft der generischen Gruppen in a.a. Gruppen durch eine V,-Aussage ausdriicken lupt.) Beweis. 1. Wenn w \ G C e G , ist w \ G s* G. Nach 2.4.1 ist daher mit G auch (G,w \ G ) subgenerisch. 2 folgt aus 1 weil nach 111, 12 fur einfache F w \ W ( F )CeW ( F ) . ( E s gibt nichtrekursive subgenerische G , fur die auch w \ G subgenerisch ist.) Definition 4.2. Eine r.a. Hornklasse 2,heiBt ordnend, wenn fur alle n , m 3?,k q ( n ) + q ( m ) oder 3?,k q ( m ) + q ( n ) . ( X ist also genau dann
Algebraisch abgeschlossene Gruppen
ordnend, wenn oder B C A.)
X (bzgl.
C ) eine Kette ist: fur alle A, B E
573
X
ist A C B
Lemma 4.3. 1. Wenn G E 2, X ordnend, ist G,fS * G. 2. Wenn G subgenerisch, Gz S * G, gibt es ein ordnendes X , G E
X.
Beweis. 1. 2,sei ordnend und G E 2,. Dann ist ( x , y ) $Z GZ gdw. x $Z G und y $Z G gdw. 2, k q ( x ) + q ( y ) und y $Z G oder %, ! - q ( y ) + q ( x ) und x $Z G. Das zeigt, daB w \ G z s i G . 2. Sei G subgenerisch und w \ G z= W : [ G ]Es . gibt eine r.a. Hornklasse G E X,X Ik o \Xz = W : [ X ] .X ist ordnend. Denn sei n , m E w und & Y q ( n ) - + q ( m ) .& Y q ( m ) + q ( n ) . Dann gibt es subgenerische Mengen G I ,G 2 E X mit n E G I ,m $Z G I , m E G2, n $2 G2.D a (n, m ) E qf,ist (n, m ) $Z W:[G,](j = 1,2). Also ist G In G2E X f l { X (n, m ) $Z W : [ X ]A n $Z X A m $Z X } = : Xo. Fur alle A E X o ist aber w \ A ; # W ! [ A ] Wid. . 0
I
Satz 4.4. Wenn G v.s.g., ist (Gz, G ) nicht subgenerisch. Folgerungen. 1. Wenn G v.s.g., H =* G , ist Hz $* H. 2. In jeder generischen Gruppe M gibt es eine e.e. Untergruppe F, sodap f u r kein HZ E F * H in M einbettbar ist. Beweis der Folgerungen. 1. Aus 2.4.1 folgt GzP* G. Wenn A =* B, ist aber immer auch A z = * B z . 2. Man wahlt F so, daB W ( F ) v.s.g. ist. Nach 111, 2.13.2 ist W ( F ) a s *W ( F * H). Z u m Beweis von 4.4 konstruieren wir zunachst eine r.a. Menge B, die in keiner ordnenden r.a. Hornklasse liegt. Wir wahlen dazu eine rekursive Relation R, so daB fur alle i, n, m aus w &, l k q ( n ) + q ( m ) gdw. 3 k R ( k , i, n, m ) .
1
+ 1) A V j < k i R (j,i, 2i + 1,2i))} U (2i + 1 I3j (Vk S j i R ( k , i, 2i,2i + 1 ) A R ( j , i, 2i + 1,2i))}
B := (2i 3 k ( R ( k , i, 2i, 2i
B ist r.a. Sei %#ordnend. Wir wahlen k (ev. = m) minimal mit R ( k , i,2i, 2i + 1 ) und j minimal mit R(j, i, 2i + 1,2i). D a 2, ordnend ist, ist k oder j # m. Wenn k s j , ist 2i E B, 2i + 1 $2 B, #%' !- q(2i)+ q(2i + 1). Wenn j < k , ist 2i + 1 E B, 2i$Z B, &, !- q(2i + 1 ) + q(2i). In beiden Fallen folgt B $Z %,. Sei nun G v.s.g. und (Gz,G ) subgenerisch. D a G z S T G ,folgt aus
M. Ziegler
574
3.6, daB GZS* G. G ist also in einer ordnenden r.a. Hornklasse X enthalten. Wir konnen 0.E. G E 7 f annehmen. Sei nun A r.a. und A E 2’n 7 f . Dann ist A vollstandig, und es ist f : B c ,A fur eine rekursive Funktion f . B ist dann ein Element der ordnenden Hornklasse {X f[X] E X}.Wid. 0
I
Satz 4.5. In jedem generischen ( w , 9’) gibt es ein nichtrekursives G E 9’ mit G Z E 9’. Folgerungen. 1. In jedem generischen (w, 9’) gibt es ein niclitrekursives G mit G 3 c * G. 2. In jeder generischen Gruppe M gibt es eine e.e. Untergruppe F mit nichtlosbarem Wortproblem, sodup alle F * F, F * F * F, . . . ,F * H fur e.e. Gruppen H mit losbarem Wortproblem in M einbettbar sind. 3. Es gibt nicht-rekursive subgenerische G, fur die auch w\G subgenerisch ist. Beweis der Folgerungen. D a G 3 S T G ,ist GZ c*G, wenn (G,G3) subgenerisch ist (3.6). Also gilt 1. 2 folgt aus 111, 2.13.2, weil W ( F * F), W ( F * H ) c*W ( F ) Y .Man wahlt F < M mit W ( F ) = * G fur ein G E W * ( M )mit G3<* G. Aus 111, 2.8 folgt G * = G. 3 Man iiberlegt leicht: Wenn G in einer r.a. ordnenden Hornklasse ist, ist G subgenerisch gdw. w \ G subgenerisch. Beweis von 4.5. = sei die r.a. Aquivalenzrelation von 4.6. Wir identifizieren w rnit Q’ und definieren die r.a. Hornklasse X : = { X C Q ’ ( O E X ,I E X , V r E X V s ( ( s < r v s = r ) - + s E X ) } . ist offenbar ordnend, nichtleer (denn O / = E X)und enthalt keine rekursive Menge. Wenn (w, 9’)generisch, wahlen wir G E X E 9’.Dann ist G3<* G und also G Z E 9’.
x
Lemma 4.6. Auf der Menge Q’ der rationalen Zahlen aus dem Interval1 [0,1] gibt es eine r.a. Aquivalenzrelation = mit (i) Die Aquivalenzklassen sind konvex, d.h. r < s < t, r = t 3 r = s. (ii) O f 1. (iii) Es gibt keine rekursive Teilmenge B C Q’ mit 0 E B, 1 6! B,f.a. r,sEQ’, r E B , s=r j sEB. Beweis. n ( n , ,n2)sei eine rekursive Bijektion von w rnit w z . f : w + Q” sei eine partiell rekursive Auswahlfunktion fur R ( n , r, s): 1 r E W,,, s E W,. D.h. wenn gdw. n E w , r , s E Q ’ , r < s < r + 2”+‘’
Algebraisch abgeschlossene Gwppen
575
f ( n ) = (r, s), ist R ( n , r, s); und wenn R(n, r, s), ist f(n) definiert. J = {[r, s ] C Q ' (ex. n f(n) = (r, s)} ist eine r.a. Menge von abgeschlossenen Intervallen. Wir defininieren r = s gdw. es gibt .lo,. . .J k E J mit r E Jo, s E J k und fur i = 0, 1 , . . . k - 1 JiflJi+]# 0. = ist offenbar r.a. und die Aquivalenzklassen sind konvex. Weiter ist 0 f 1, denn die J E J konnen 0, 1 nicht uberdecken, weil ihre Gesamtlange kleiner als -+-+...-+... 1 1 1 = 1 ist. 2 4 2"+l Sei nun B C Q', B rekursiv, B = W,,, o \ W,. Wenn 0 E B und 1 1 6 B gibt es ein r' E B, s' 6 B mit r' < s' < r'+ 2 n + ~. f(n) ist also definiert und sei = (r, s). Dann ist r E B,s Damit ist das Lemma bewiesen. I7
B und [r, s] E J d.h. r = s.
Literatur [l] J. Barwise und A. Robinson, Completing theories by forcing, Annals of Mathematical Logic, Vol. 2 (1970) 119-142. [2] O.V. Belegradek, Definability in algebraically closed groups, Mathematical Notes of the Academy of Sciences of the USSR, Vol. 16, No. 3 (1974) 813-816. [3] O.V. Belegradek, Uber algebraisch abgeschlossene Gruppen, Algebra i Logika 13, No. 3 (1974) 239-255 (Russian). [4] W.W. Boone und G. Higman, An algebraic characterisation of groups with soluble word problem, J. Austral. Math. Soc. 18 (1974) 41-53. [5] C.C. Chang und H.J. Keisler, Model Theory (North-Holland Publishing Co., Amsterdam, 1973). [6]P. Eklof und G. Sabbagh, Model-completions and modules, Annals of Mathematical Logic, Vol. 2 (1970)251-295. [7] R. Fraisst, Sur I'extension aux relations de quelques proprietts des ordres, Ann. Sci. Ecole Norm. Sup. 71 (1954) 540-542. [8] C. Higman, B.H. Neumann und H. Neumann, Embedding theorems for groups, Journal of the London Mathematical Society, Vol. 24 (1949)247-254. [9] C. Higman, Subgroups of finitely presented groups, Proc. Roy. Soc. London (A) 262 (1961)455-475. [lo] J. Hirschfeld und W.H. Wheeler, Forcing, arithmetic, division rings, Springer Lecture Notes in Mathematics Vol. 454 (1975). [Ill C. Karp, Finte quantifier equivalence, in: The theory of models, Proc. of the 1963 Symposium at Berkeley (North-Holland Publishing Co., Amsterdam, 1%5). [12] H.J. Keisler, Forcing and the omitting types theorem, preprint. [13] D.W. Kueker, Definability, automorphisms and infinitary languages, in: Syntax and semantics of infinitary languages, Springer Lecture Notes in Mathematics, Vol. 72, Berlin, 1968. [14] A. MacIntyre, On algebraically closed groups, Annals of Mathematics Vol. % (1972) 53-97.
576
M. Ziegler
[15] A. MacIntyre, Omitting quantifier-free types in generic structures, Journal of Symbolic Logic Vol. 37 (1972) 512-520. [16] A. MacIntyre, On algebraically closed division rings. [ 171 A. MacIntyre, Martin’s axiom applied to existentially closed groups, Mathematica Scandinavica, Vol. 32 (1973) 46-56. [18] C.F. Miller 111, The word problem in quotients of a group, unpublished. [19] B.H. Neumann, Some remarks on infinite groups, Journal of the London Mathematical Society, Vol. 12 (1937) 12&127. [20] B.H. Neumann und H. Neumann, Embedding theorems for groups, J. London Math. SOC. 34 (1959) 465-479. [21] B.H. Neumann, A note on algebraically closed groups, Journal of the London Math. SOC. Vol. 27 (1952) 247-249. [22] B.H. Neumann, The isomorphism problem for algebraically closed groups, in: Word Problems (ed. by W.W. Boone et al.) (North-Holland Publishing Co., Amsterdam, 1973). [23] A. Robinson, Infinite forcing in model theory, Proc. of the Second Scandinavian Logic Symposium, Oslo, 1970 (North-Holland Publishing Co., Amsterdam, 1971). [24] H. Rogers, Theory of recursive functions and effective computability, New York, 1%7. [25] P.E. Schupp, Some reflections on HNN extensions, in: Proc. of the Second Intern. Conference on the Theory of Groups, 1973, Springer Lecture Notes in Mathematics, Vol. 372 (1974) 611-632. [26] W.R. Scott, Algebraically closed groups, Proc. of the American Math. Soc., Vol. 2 (1951) 118-121. (271 H. Simmons, The word problem for absolute presentations, Journal of the London Math. Soc., Vol. 6 (1973) 275-280. [28] W.H. Wheeler, Algebraically closed division rings, forcing and the analytical hierearchy, Diss. Yale Univ. 1972. [29] M. Ziegler, Gruppen n i t vorgeschriebenem Wortproblem, Math. Ann. 219 (1976) 43-51. [30] M. Ziegler, Ein rekursiv aufzalbarer btt-Grad, der nicht zum Wortproblem einer G ~ p p behort, e Zeitschrift fur Math. Logik und Grundlagen der Math. 1976. [31] G.E. Sacks, Degrees of unsolvability, Ann. of Math. Studies No. 55, Princeton, 1%3. (321 R.M. Smullyan, Theory of formal systems, Ann. of Math. Studies, Princeton, 1961. [33] M.Y. Trofimov, Definability in algebraically closed systems, Algebra i Logika (14) 1975. [34)S.Shelah und M.Ziegler, Algebraically closed groups of large cardinality, Journal of Symbolic Logic 44 11979) 130-140. [35] A. MacIntyre, Existentially closed structures and Jensen’s principle, Israel J. Math. 25 (1976) 202-210. [36] K. Hickin, A Maclntyre, Algebraically closed groups: Embeddings and centralizers, Proc. Oxford Logic Conf., Word Problems 11, this volume, pp. 141-155. (37) K. Hickin, Maximal subgroups of algebraically closed groups (preprint). [38] S. Shelah, Existentially closed groups in N, with special properties, Bull. of the Greek Math. Soc. (submitted). [39] S. Shelah, Existentially closed groups in continuum (preprint). [40]O.V. Belegradek, Elementare Eigenschaften algebraisch abgeschlossener Gruppen (in Russian), Fund. Math. XCVIII (1978) 83-101.
S.I. Adian, W.W. Boone, G . Higman, eds., Word Problems I1 @ North-Holland Publishing Company (1980) 577-578
PROBLEMS P.M. COHN Bedford College, London
The following is a list of open problems relevant to the topic of the working conference Decision Problems in Algebra. Some were obtained in discussion with others at this conference, some elsewhere. 1. (a) Let K be a skew field and A an n x n matrix over K. Show that there is an extension field L of K and a E L such that A - aZ is singular (i.e. a is a ‘singular eigenvalue’ of A). (b) If moreover K is a k-algebra, where k is an algebraically closed commutative field, and only extensions that are k -algebras are admitted, show that A has a non-zero singular eigenvalue unless A is triangularizable over k. Note: For 2 x 2 matrices (a) and (b) have been verified, cf. P.M. Cohn, Skew field constructions, LMS Lecture notes, No. 27 (Cambridge University Press, 1977). There it is also shown that the conditions are needed. 2. As a counterpart to 1, if K is an infinite skew field with finite centre k and A is an n x n matrix over K , does there exist a E K such that A - a1 is non-singular? Note: If K is finite, such a may not exist (e.g. for a suitable diagonal matrix A). If k is infinite, the reduction theory of matrices over K [ t ] shows that such a always exists in k. 3. (Asked by A. Macintyre) Find an existential sentence about skew fields which is true in an existentially closed skew field K precisely if K has characteristic 0. Note: In the commutative case one cannot have such a sentence because the class of algebraically closed commutative fields admits ultraproducts (so a suitable ultraproduct of algebraically closed fields of finite characteristic ,will be algebraically closed of characteristic 0). If one does not insist on an existential sentence one can proceed as follows: The sentence w(X)
: 3y,Z (Xy = y X 2 A
X2Z
= ZX2 A
XZ
# ZX
A
y # 0)
(due to W.H. Wheeler) expresses the fact that (in an existentially closed field) x is transcendental over the ground field. The sentence 577
Problems
578
B ( x ) : 3 y ( x y - y x = 1)
due to M. Boffa and P.V. Praag, does t h e same in characteristic 0. In finite characteristic, B ( x ) holds if and only if x is either transcendental or purely inseparable over (but not in) the ground field k. Now Wheeler also has a sentence W ( x , x ’ ) which expresses (in an existentially closed field) that x , x ’ are independent commuting indeterminates, and B , ( X ) :3 y , Z ( w ( Z ) A
1w
(X,2 )A
X I = ZX A
YZ = Zy
A
Xy - YX = 1)
expresses the fact that for some z, purely transcendental over k, x is algebraic but not separable over k ( I ) . So 3 x B l ( x ) holds precisely when K has characteristic # 0. 4. (Asked by S. Shelah) Let ( K , ) be a family of skew fields, all containing E as subfield, and denote by P = * B KA their coproduct ( = free product) over E. If a single relation U I U Z .. . U, =
1,
U, E KA,, A, #
A,+I
is imposed, under what conditions is the resulting ring P (i) an integral domain, (ii) embeddable in a skew field? Note: J. Lewin and T. Lewin (J. Algebra, 52 (1978) 39-74) have proved that if a group with a single defining relation is torsion free then its group algebra is embeddable in a skew field. 5 . (Asked by S. Eilenberg) Is every finitely generated group which is embeddable in a finitely presented semigroup also embeddable in a finitely presented group? Note: V.L. Murskii (Mat. Zametki 1 (1967) 217-224) has proved the following analogue of Higman’s embedding theorem: A finitely generated semigroup is embeddable in a finitely presented semigroup if and only if it is recursively presented. The proof of Murskii’s embedding theorem is much easier than that of Higman’s embedding theorem and the above question is of interest because a positive answer obtained without using Higman’s result would yield a new proof of Higman’s embedding theorem. Of course the answer to question 5 is “Yes” - for this follows by Higman’s result trivially. 6. (Asked by E. Formanek) Does every skew field contain a free fVegmup of rank two? Note: For finite-dimensional division algebras this result was proved 6 y J. Tits, J. Algebra 20 (1972) 250-270. 7. (Asked by Y. Zalcstein) Is every finitely generated periodic subgroup of t h e multiplicative group of a skew field finite?