(7.17)
by our condition on the r:ltional part of y. Since .0 = {I, ay} (see the proof of Lemma 1) and A cD, we easily obtain that k is divisible by a; that is, k = as with integral s. Since m = N(A) = P(l/a) (corollary of Lemma 1), then (7.18) We shall show that the representation of the module A in the form A
= as{1, y},
(7.19)
where a, s, and y satisfy (7.18), (7.15) and (7.17), is unique. Indeed, if = a ISI{1' yd, where ai' Sl' and YI satisfy the same requirements, = al SI and hence {I, y} = {I, yd. By the corollary of Lemma 1 we thus have a = a l and hence also S = SI' Further, since y in {1, y} satisfies (7.15) and (7.17), it is uniquely determined; that is, y = YI' Conversely, for given m choose a and S so that (7.18) holds. If band c satisfy the conditions as{l, y} then as
b2
-
4ac
= D,
(a,b,c)=I,
-a";; b < a,
(7.20)
then, with y given by (7.16), the module A = as{1, y} will be contained in its coefficient ring .0 = {I, ay} and its norm will be a 2s2(l/a) = m. Thus to obtain the module A we need to find all four numbers S > 0, a > 0, b, c satisfying conditions (7.18) and (7.20). If we can devise an algorithm for solving the question of strict similarity of modules of the field R( .jd), then, after listing all modules A c .0 with norm m, we can determine those which are similar to the module M- I • By Theorem 5 this will yield all solutions of (7.13). The following assertion easily follows from Theorem 5. Theorem 6. Let m be a natural number. Then m is represented by some primitive binary quadratic form with discriminant D if and only if there is a module A with norm m contained in the order .0 with discriminant D, with .0 the coefficient ring of A. This is in turn equivalent to the existence of
Sec. 7]
REPRESENTATION OF NUMBERS BY QUADRATIC FORMS
integers s> 0, a> 0, b, c satisfying the conditions: m (a, b, c) = 1, -a ~ b < a.
= as 2 , b2 -
145
4ac
= D,
In case D is the discriminant of a maximal order 0, the second assertion of Theorem 6 is simplified. Namely, we have Theorem 7. Let D be the discriminant of a quadratic field (that is, the discriminant of a maximal order). In order that the natural number m = as 2 , where a is square-free, be represented by some primitive binary form with discriminant D, it is necessary and sufficient that the congruence
x 2 == D (mod 4a)
(7.21)
be solvable. The proof of Theorem
I
is left to the reader.
7.7. Similarity of Modules in Imaginary Quadratic Fields In the case of an imaginary quadratic field R( /d), d < 0, there is a particularly simple method for solving the problem of similarity of modules. The geometric representation of a number rx E R( .jd) by a point in the space 91 2 (see Section 3.1) coincides with the usual representation of complex numbers in the complex plane. The points of a full module Me R( .jd) correspond to the points (or vectors) of some full lattice in 91 2 • The lattice which corresponds to the module M will also be denoted by M. The effect of multiplying all points of the lattice M by the complex number ~ l' is to rotate the lattice M by an angle arg ~ and to expand it by a factor I~I, so similar lattices M and ~M are also similar in the sense of elementary geometry. All subsequent results will be based on this simple fact. The question of similarity of lattices in the plane is solved by constructing a special basis for each lattice, called a reduced basis. A reduced basis rx, /3 consists of a shortest nonzero vector rx and a shortest vector /3 which is not collinear with it (and satisfies some further conditions). We now show that in any lattice M such a pair of vectors rx and /3 always forms a basis. For if this were not the case, then M would contain a vector ~ = urx + v/3 with the real numbers u and v not both integers. Adding to this vector a certain integral linear combination of rx and /3, we may clearly assume that lui ~ t and Ivl ~ t. If v l' 0, we must have I~I ~ IPI, which contradicts the inequality
°
I~I
< lurxl + lu/31 ~ -!-/rxl + tl/31 ~ 1/31.
If v = 0, then I~I = lurxl ~ 11rxl < Irxl, which violates the choice of rx. Our assertion is proved. If rx is any shortest vector of M and /3 is any shortest vector among those not
146
NUMBERS BY DECOMPOSABLE FORMS
[Chap. 2
collinear with ct, then the length of the projection of fJ on ct does not exceed tlctl. For among the vectors of the form fJ + nct (n an integer) there clearly is one for which the length of the projection is ~ tlctl. On the other hand, the vector of the form fJ + nlY. with shortest length is also the one with shortest projection. We now consider the set of al1 nonzero vectors in M with shortest length, and let w denote the number of such vectors. Since if IY. is in this set - ct also is, the number w is even. Further, the angle between two shortest vectors IY. and ct' cannot be less than rr/3, since otherwise the vector ct - ct' would be of shorter length. Hence w ~ 6 and we have the fol1owing possible cases: w = 2, w = 4, w =6.
{3
FIG. 1
FIG. 2
We now construct a reduced basis for the lattice M. If w = 2 we take as ct either of the two shortest vectors. There may be two or four vectors in the set of vectors, noncol1inear with ct, of shortest length (see Figures I and 2). As fJ we choose that one for which the angle q> between ct and fJ in the positive direction (counterclockwise) is smal1est. If w = 4 or w = 6 we take as reduced
{3
FIG. 3
FIG. 4
Sec. 7]
REPRESENTATION OF NUMBERS BY QUADRATIC FORMS
147
basis a pair of vectors of shortest length such that the angle between rt and P in the positive direction is as small as possible. It is easily seen that the reduced basis ofa lattice is uniquely defined up to a rotation which takes the lattice into itself. In the cases w = 2, and w = 4 and n/3 < q> < n/2 (see Figure 3), there are two reduced bases, which are obtained from each other by a rotation of angle n. For w = 4, q> = n/2 (Figure 4) we are dealing with a square lattice with four reduced bases, which can be obtained from one another by rotations through angle n/2. Finally in the case w = 6, we have six reduced bases, and they are transformed into each other by a rotation through angle n/3 (Figure 5; the circle is divided into six equal parts, since the angle between shortest vectors cannot be less than IT/3). Using the concept of a reduced basis, we can easily solve the question of similarity of lattices in the plane.
FIG. 5
Theorem 8. The lattices M and M, in 91 2 are similar if and only if their reduced bases are similar (that is, are transformed into each other by a rotation and an expansion).
P and rt" PI be reduced bases of the lattices M and MI' If M I , then ~rt, ~P clearly is a reduced basis for MI' As we have seen, this basis can be obtained from the basis rt" P, by a rotation. Therefore there is a number '1 (which is a root of unity of degree I, 2, 3, 4, or 6) such that '1~rt = rt" '1~P = Pl' Hence the basis rt" P, is obtained from the basis rt, P by rotation through the angle arg('1~) and expansion by a factor 1'1~I, so that they are similar. The converse is clear. We now turn to the description of the set of classes of similar modules of an imaginary quadratic field. Let M be any module in R( /d), d < 0, and let rt, P be any reduced bases for M. We pass to the similar module (l/rt)M = {I, ')'}, where')' = P/rt. The basis {I, ')'} here is also reduced. From the definition of a reduced basis it easily follows that')' satisfies
Proof. Let ct., ~M =
NUMBERS BY DECOMPOSABLE FORMS
148
Imy > 0,
--!- < !y/ > I
(7.22)
Rey ~-!-,
if
-
1<
[Chap. 2
(7.23) Re y < 0, (7.24)
Definition. The number y of an imaginary quadratic field is called reduced if it satisfies conditions (7.22), (7.23), and (7.24). The module {1,y} is called reduced if y is reduced.
Geometrically, y is reduced if it lies in the region r described in Figure 6 (the indicated part of the boundary, including the point i, is included in r; the rest is not).
_-'-_.1..-----'-_-'-_-'-
o
-I
FIG.
-_x
6
Theorem 9. Each class of similar modules of the imaginary quadratic number field R( /d), d < 0, contains one and only one reduced module. Proof. We have already shown that each class contains a reduced module.
We need only show now that distinct reduct modules cannot be similar. We first show that if y = x + yi is reduced, then the numbers I, y form a reduced basis for the lattice {I, y}. We must show that y is the smallest of the vectors of the lattice {l, y} which do'not lie on the real line, that is, that Ik+/yl ~ IYI for any integers k and / =f. O. Since Ixl ~ 1,
Ik ± yl2 If
III ~
=
(k
± X)2 + y2 ~ x 2+ y2 = lyl2.
2, then
Ik + [y12
~ [2 y 2
> 2y 2 > x 2+ y2
=
IYI2,
which proves our assertion. Now let y and YI be two reduced numbers. If the modules {I, y} and {I, yd are similar, then by Theorem 8 the bases {I, y}
Sec.7]
REPRESENTATION OF NUMBERS BY QUADRATIC FORMS
149
and {l, yd are similar. But this is possible if and only if Y = Yl' Theorem 9 is completely proved. To solve the question of the similarity of modules in an imaginary quadratic field we must have an algorithm for finding the reduced module similar to a given module. Such an algorithm is formulated in Problem 24. Two given modules M 1 and M 2 are similar if and only if their reduced modules coincide. Remark. In the proof of Theorem 9 we never actually used the fact that the module under consideration was contained in some imaginary quadratic field. The assertion of the theorem hence is true for any lattice in the plane: any lattice in the complex plane is similar to one and only one lattice of the form {l, y}, where Y is some number of the domain r, which is described in Figure 6. By Lemma 2, which is applicable to arbitrary lattices in the plane without any changes, two lattices {l, y} and {I, A} are similar if and only if A and yare connected by
A = ky m"
+I + 11'
kll-ml=±I,
with rational integers k, I, m, n. Two such nonreal complex numbers are called modularlyequimlent. Hence we have shown that every nonreal complex number is modularly equivalent to one and only one number of the region r. The region r itself is called the modular domain. Its points are in one-to-one correspondence with the set of classes of similar lattices in the plane. The question of similarity of planar lattices is connected with many important questions in the theory of elliptic functions. A field of elliptic functions is given by its period lattice, and two such fields are isomorphic if and only if their corresponding period lattices are similar (see, for example, C. Chevalley, "Introduction to the Theory of Algebraic Functions of One Variable," 1951). Hence the points of the modular domain r are in one-to-one correspondence with isomorphism classes of fields of elliptic functions. Consider now the classes of similar modules which belong to some fixed order D with discriminant D < 0. Let the module {l, y}, y E r, belong to the order D. If we use the notations of Lemma 1 and write y in the form
y
=
JIDI
- b+ i
2a
'
then conditions (7.23) and (7.24) yield - a
~
b < a, ~
c?a
for
b
c>a
for
b > 0.
0,
(7.25)
150
NUMBERS BY DECOMPOSABLE FORMS
[Chap. 2
Hence to find a full system of reduced modules of an imaginary quadratic field which belong to the order with discriminant D, we need only find all triples of integers a > 0, b, c which satisfy (7.25) and also the condition
D = b2
4ac,
-
(a, b, c) = 1.
(7.26)
By Theorem 3 of Section 6 the number of such triples is finite, a fact that can be directly verified from the inequalities
jDI = 4ac
- b 2 ~ 4a 2
Ibl~a<
-
a2
= 3a 2 ,
IDI
J
3'
for given D, so that there are only finitely many possibilities for a and band hence also for c. Example 1. We shall find the number of classes of modules which belong to the maximal order of the field R() -47). Since here D = -47, then Ibl ~ a < )47/3. Since for odd D the number b is also odd, we have the following possibilities: b = ± I, b = ± 3. In the second case we would have to have b 2 - D = 56 = 4ac, ac = 14, 3 ~ a ~ c, which is impossible. If b = ± 1, then b 2 - D = 48 = 4ac, so that
a=l,
c=12;
a = 2,
c = 6;
a
= 3, c = 4.
Since the case b = a = 1 must be excluded, the maximal order of the field R( 47) has five classes of similar modules. Each class contains a reduced module {I, y}, where y is one of the numbers
J-
1+
iJ47
± 1 + iJ4i
± 1 + iJ4i
4
6
2
In the preceding section we remarked that the existence of an algorithm for determining the similarity of modules in quadratic fields allows us to solve equations of the form (7.13). Example 2. We shall find all numbers in the module M = {13, 1 + 5i} with norm 650. In this case the coefficient ring is the order .0 = {I, 5i} with discriminant D = - 100. Since N(M) = 13, we must first enumerate the modules A cD, which belong to the order .0 and have norm m = 650/13 = 50. From conditions (7.18) and (7.20) we have the following possibilities: (1)
s = 5, a = 2, b = -2, c = 13;
(2) s=l,
a=50,
b=10,
(3) s = I,
a = 50, b = -10, c = I;
(4) s= I,
a= 50,
b= -50,
c=l;
c= 13.
Sec. 7]
REPRESENTATION OF NUMBERS BY QUADRATIC FORMS
151
For each of these four cases we form the module A of the type (7.19) and find the reduced module similar to it: I
+ 5i}
10{1'-2- ,
- I -+- i} -----w= (- 5 + 5i){l, 5i}, I + i} 50 { I, i l l = (5 + 5i){l, 5i},
50 { I,
{ 5 i}
{I
+ = lOi I, - + 501'10 2 5i} - . We also compute the reduced module for M- 1 : M
_ 1
f I - 5i}
I - 5i { 13
I
+ 5i}
=\1 - - = - - 1 , - -
\'
13
2'
We eliminate the ~odule A in cases (2) and (3), since it is not similar to M- 1 • In cases (I) and (4) the equality A = ~M-l holds for ~ = 5 + 25i and ~ = - 25 + 5i. Since D contains only two units, ± I, the module M has four numbers with norm 650: ±(5 + 25i) and ±( -25 + 5i). We have thus also established that the equation 13x 2 + 2xy + 2y 2 = 50 has four integral sol utions: x
= 0,
x = 2,
y = 5;
x = 0,
y = -5;
y = -I;
x
=
= I.
-2, y
Example 3. Which natural numbers are represented by the form' x 2 + / ? The discriminant of the form is D = -4. Let D = {I, i} be the order with discriminant - 4, which is contained in the field R( J=1). Since conditions (7.25) and (7.26) are satisfied only by a = c = 1, b = 0, only one reduced module belongs to the order D. This means that al1 modules which belong to the order D are similar, and hence every binary form with discriminant -4 is equivalent to the form x 2 + y2. But equivalent forms represent the same numbers, and hence by Theorem 6 the form x 2 + y2 represents the number m if and only if there is a module A c: D which belongs to the order D and has norm m. If such a module exists, then for some s, a, b, c we have m = as 2 ,
D = -4 = b 2
Here the number b must be even, b Z2
=
-
4ac,
(a, b, c) = I.
2z, where z satisfies
== -1 (mod a).
(7.27)
NUMBERS BY DECOMPOSABLE FORMS
152
[Chap. 2
Conversely, if (7.27) holds for some a = mls 2 ; that is, if Z2 = -1 + ac, then, as is easily seen, (a, 2z, c) = I, and hence there is a module A c: .0, belonging to the order .0, and with norm m; that is, m is represented by the form x2 + y2. It is weIl known that the congruence (7.27) is solvable if and only if a is not divisible by 4 and not divisible by any prime number of the form 4k + 3. Since a must contain all prime factors which Occur in m with even exponent, we obtain that m is represented by the form x 2 + y2 if and only if prime numbers of the form 4k + 3 occur in it only with even exponent.
PROBLEMS
1. Find fundamental units for the fields R) vi 19) and R( vi 37). 2. Show that if d cO I (mod 8) (positive and square-free), then a fundamental unit for the order {I, vi d} is also a fundamental unit for the maximal order of the field R(vld). 3. Show that if the discriminant of some order I: in a quadratic field is divisible by at least one prime of the form 4n -+- 3, then any unit of I: has norm -+- I. 4. Let the rational integer m> 1 not be a perfect square. Show that in the continued fraction expansion of the sequence of entries has the form
vim
(hereql+1 =q,_I, i= 0, .__ , s - I). 5. Under the same assumptions, show that if P,I Q, is the convergent (s again denoting the period), then P, -+- Q, is a fundamental unit of the order {I, V;;;) (in the field
vm
R(v-;;;».
6. Let the modules M 1 and M 2 of a quadratic field have for coefficient rings the orders e'l and 1:'2 (using the notation of Section 7.2). Show that the product M I M 2 belongs to the order C" where I is the greatest common divisor of II and 12. 7. For any natural number f let \l, denote the group of modules in a given quadratic field which belong to the order C, (see Section 7.4). Show that if d is a divisor of f, then the mapping M -+ M Cd (M E 1\,) is a homomorphism from 11, to the group 11 d •
e
8. Let be a number of the maximal order I: = (I, w) of a quadratic field which is relatively prime to f. Show that the coefficient ring of the module M = {f,fw, 0 is 1:, and that M I: = C. Further, show the converse, that is, that any moduli: M which belongs to the and satisfies the property = 6 is of the form M = if, Iw, for some E D order which is relatively prime to f.
C,
Me
e)
e
el and e2 be two numbers of ~ which are relatively prime to f Show that el) ~ if. Iw, e2) if and only if Sel e2 (mod!) for some rational integer s.
9. Let {f, fw,
==c
10. Show that if M I and M 2 are any two full modules of a quadratic field (not necessarily belonging to the same order), then
11. Let h denote the number of classes of similar modules belonging to the maximal
Sec. 7]
REPRESENTATION OF NUMBERS BY QUADRATIC FORMS
153
order 0 of a quadratic field, and let h, denote the number of classes of similar modules belonging to the order 0, (of index f). Show that
where rtJ(f) is the number of residue classes in 0 modulo f which consist of numbers relatively prime to f(rtJ is analogous to the Euler ep-function), and is ~he index of the group of units of the order 0, in the group of units of the maximal order D. 12. A number I' of a real quadratic field is called reduced if it satisfies 0 < I' < 1 and its conjugate satisfies 1" < -I. If I' is reduced the module {I, y} is also called reduced. Using the notation of Lemma 1, show that the number I' is reduced if and only if
e,
-b+ YD < 2a < b+YD.
O
Deduce that the number of reduced modules which belong to a fixed order of a real quadratic field is finite. 13. Let I' be an irrational number of a real quadratic field such that 0 < I' < I. Set 1'1
1
=
-(sgn 1") - - n, I'
where the rational integer n is chosen so that 0 < 1'1 < I. Show that after a finite number of transformations {I, y} ---> {I. Y1}, the module {I, y} is transformed into a similar module which is reduced. Hence every class of similar modules (in the usual sense) of a real quadratic field contains a reduced module. 14. Let I' be a reduced number of a real quadratic field. Since sgn 1" = -I, the mapping I' ---> 1'1 of the preceding problem takes the form 1 1'1 = - - n, I'
Show that the number 1'1 is also reduced. It is called the right neighbor of the number 1', and I' is called a left neighbor of 1'1 . Show that any reduced number has one and only one left neighbor y. 15. Starting from a reduced number yo of a real quadratic field, we construct the sequence of reduced numbers 1'0,1'1,1'2' .. , , in which each number is the right neighbor of the preceding one. Show that there exists a natural number m such that yo = I'm, that is, that the sequence is periodic. If m is the smallest possible such integer, then the numbers yo, 1''' ... , Ym-1 are distinct. Such a finite sequence of reduced numbers is called a period. Show that two reduced modules {I, y} and {I, y*} are similar (in the usual sense) if and only if the reduced numbers I' and 1'* belong to the same period. 16. Find the number of classes of similar modules belonging to the maximal order of the field R( Y 10), 17. Show that all integral solutions of the equation 17x 2
+ 32xy + l4y 2= 9
are given by ±(l5 + 6v2)(3
(for all integers n).
+ 2v2)n =
±[17xn + (16 + 3Y2)y.]
NUMBERS BY DECOMPOSABLE FORMS
154
[Chap. 2
18. Which of the modules {I,
vIs},
{2,1
+ vB},
{3,
{35, 20 + vB}
viS},
of the field R(V 15) are similar to one another? 19. Find a full system of representatives for the classes of strictly equivalent primitive forms with discriminant 252. 20. What is the number ofclasses of properly equivalent primitive forms with discriminant 360? 21. Which prime numbers are represented by the forms x 2 + 5y 2 and 2x 2 + 2xy + 3y 2? 22. Find the integral solutions of the equations: 5x 2
+ 2xy + 2y 2 = 5x 2
(2)
2y 2 = 3,
-
80x 2
(1)
26,
-
y2
= 16.
(3)
23. Show that the equations
+ 34xy + 22y 2 = 2 5x + 16xy + 13y 2 =
13x 2
23,
(1)
23
(2)
have no integral solutions. 24. Let y be a number of an imaginary quadratic field which satisfies 1m y> 0, -t < Re y"; t, but which is not reduced. Set y, = O/y) + n, where the rational integer n is chosen so that -t < Re y,"; t. If y, is not reduced, determine y2 = -(1/y,) + nl analogously, etc. Show that after a finite number of steps the module {I, y} is transformed into a similar module {I, y,} which is reduced. 25. Determine the coefficient rings of the modules {ll, 6 + 2iV2},
{2, 1 + iV2}, {4, iV2},
{2, iV2j.
Which of these modules are similar? 26. S'J.ow that all modules which belong to the maximal order of the field R( V=43) are similar.
CHAPTER
3
The Theory of
Divisibility
In Chapters 1 and 2 we saw how the solution of number-theoretic problems led to the consideration of broader questions in the theory of algebraic llumbers: Thus to find the integral representations of a rational number by a full decomposable form, we had to study the theory of units in orders of algebraic number fields. Many problems of number theory lead to another important question in the arithmetic of algebraic number fields, the question of decomposition of algebraic numbers into prime factors. In this chapter we shall construct a general theory of the decomposition of algebraic numbers into prime factors and will apply this theory to several problems in number theory. The results which we shall need from the theory of rings are given in the Supplement, Section 5. These results, along with the theory of finite extensions of fields which has already been used in Chapter 2, form the algebraic tools for this chapter. The problems of factorization are very closely connected with Fermat's (last) theorem. Historically, it was precisely the problem of Fermat's theorem which led Kummer to his fundamental work on the arithmetic of algebraic numbers. Therefore we shall start with an exposition of the first results of Kummer on Fermat's theorem as an introduction to the general theory of decomposition of algebraic numbers into prime factors. 155
156
THEORY OF DIVISIBILITY
[Chap. 3
1. Some Special Cases of Fermat's Theorem
1.1. The Connection between Fermat's Theorem and Decomposition into Factors ,1
The proposition, stated by Fermat, is that the equation
x"+y"=z" has no nonzero solutions in rational integers, x, y, z when n > 2. It is clear that if Fermat's theorem is proved for some exponent n, then it is also automatically proved for all exponents which are multiples of n. Since any integer n > 2 is either divisible by 4 or by some odd prime, we may limit our consideration to the cases where n = 4 or n is an odd prime. For n = 4, an elementary proof was given by Euler. We thus consider only
x'
+ y' = Zl,
(1.1)
where the exponent 1 is an odd prime number. We may clearly assume that the numbers x, y, z in (1.1) are relatively prime. For those values of 1for which a proof of Fermat's theorem has been found, the proof is usually divided into two parts: first, it is shown that (1.1) has no solution in integers x, y, z, which are not divisible by I, and second, that (1.1) has no solution in integers x, y, z precisely one of which is divisible by I. These two are called the first and second cases of Fermat's theorem. From the extant proofs of various cases of Fermat's theorem we can deduce that the principal difficulties in the first and second cases of Fermat's theorem are roughly the same, although the techniques used in the first case are more simple. Here we consider only the first case of Fermat's theorem. The connection between Fermat's theorem and the problem of the decomposition of algebraic numbers into prime factors is given in the following simple observation. If' denotes a primitive lth root of 1, then (1.1) may be written 1-1
TI (x + 'k y ) = Zl.
(1.2)
k=O
If a product of pairwise relatively prime rational integers is an lth power, then each of the factors is an lth power. The factors on the left side of (1.2) belong to the algebraic field R(O of degree 1 - lover R. (It is easily seen that the polynomial t ' - 1 + t ' - 2 + ... + t + 1, 1 a prime, is irreducible over the field of rational numbers; see Problem 6 or Theorem 1 of Section 2, Chapter 5.) Consider in the field R(,) the order .0 = {I, " ... , ,'- 2} [by Theorem 1 of Section 5, Chapter 5, .0 is the maximal order of the field R(Ol. Assume that in the ring.o factorization into primes is unique. Then for any IX ED, IX # 0, we have a factorization
Sec. 1 ]
SOME SPECIAL CASES OF FERMAT'S THEOREM
157
where e is a unit of the ring .0, the prime numbers 7tl' '" , 7t r are pairwisenonassociate, and the exponents aI' ... , a r are uniquely determined (see Section 2.2). Then every prime 7t which occurs in the factorization of Zl occurs with an exponent divisible by I. But we shall show below that when we are dealing with the first case of Fermat's theorem, the numbers x + y (k = 0, 1, ... , I - 1) are pairwise relatively prime. Hence if we represent x + y as a product of prime factors, each prime will occur with an exponent divisible by I. This means that, up to a unit factor, x + y is an lth power and, in particular, (1.3) x + = ea l ,
,k
'k
'k
'y
where e is a unit of the ring .0 and ex E .0. Since I is odd, we may also write (1.1) in the form Xl
+ (_Z)l = (_y)l,
and analogously we obtain (1.3')
Equations (1.3) and (1.3') lead, in a fairly easy manner, to a contradiction. When this is done, we shall prove that (1.1) has no solution in integers x,)I, z not divisible by I (under the hypothesis made on the ring .0). After this introduction we now establish some auxiliary facts concerning the ring .0.
1.2. The Ring Z[,]. Lemma 1. In the ring .0
= Z[,] the number I - , is prime, and I has the
factorization 1= e*(1 -
0 /- 1
(1.4)
where e* is a unit in .0. Proof. In the decomposition tl-
1
+ t l - 2 + ... + t + I
= (t - O(t - ,2) ... (t _
,1-1),
set t equal to I. Then (1.5)
If ex = r(O is any number of the field R(O [here r(t) is a polynomial with rational coefficients], then the numbers (1
~
k
~
I - I)
(1.6)
are the images of ex under the isomorphisms of the field R(O into the field of all complex numbers. In the terminology of Section 2.3 of the Supplement,
I 158
I
[Chap. 3
the numbers (1.6) are the conjugates of a, and thus N(a) = particular, for s ¥= 0 (mod I), we have I-I
N(1 - 'S)
=
TI (1 -
k=1
TIL: ~ r(,k).
In
I-I
,kS)
=
TI (1 -
,k) = I.
k=1
,2, ... ,
From this it follows that 1 - " 1 1 - , I - I are primes in the ring .0. Indeed, if 1 = afJ, then N(a)N(fJ) = I, and then either N(a) = 1 or N(fJ) = 1; that is, one of the factors is a unit (Theorem 4 of Section 2, Chapter 2). Taking norms in the equation
'S
'S,
we obtain N(E s ) = 1, and thus Es is a unit in .0. Hence the numbers 1 _ for s ¥= 0 (mod I), are associate with 1 - ,. The decomposition (1.4) now follows from (1.5) and (1.7). Lemma 2. If the rational integer a is divisible by 1 - , (in the ring .0), then it is also divisible by I. Proof. Let a = (1 - ')a, where a ED. Taking the norm of both sides, we obtain d- I = IN(a), where N(a) is a rational integer. Since 1 is prime, then a is divisible by I.
,I ,
i
Lemma 3. The only roots of 1 contained in the field R(,) are those whose degree divides 2/.
i
j
I
~
I
II ~ :
,
1
I
III.
I
THEORY OF DIVISIBILITY
!
:
r/I' I:
i~ j
I
Proof. Any root of 1 in R(,) clearly lies in the maximal order. By Theorem 2 of Section 3, Chapter 2, the set of all roots of I in R(O forms a finite cyclic group. Let m denote the order of this group and let 1'/ be any primitive mth root of 1. Since - , belongs to R(O and is a root of degree 21 of 1, m is divisible by 2/. In Section 2 of Chapter 5 (corollary of Theorem 1), it is proved that the degree of the field R(I'/) over R equals cp(m), where cp(m) is Euler's function. Set m = l'mo, (mo, I) = 1 (r ~ 1, mo ~ 2).
Since R(I'/) is contained in R(O, and the latter field has degree 1 - 1, then
From this inequality it follows that r = 1 and cp(m o) = 1. Since cp(mo) mo ~ 2 only when mo = 2, m = 2/, and Lemma 3 is proved.
= 1 for
Lemma 4 (Kummer's Lemma). Any unit of the ring.o is a product ofa power of' with a real unit.
Sec. 1 ]
SOME SPECIAL CASES OF FERMAT'S THEOREM
159
Proof. Let
e=a O+a l '+···+a l _ 2,1-2=rm
(aiEZ)
be any unit of .0. It is clear that the complex conjugate e = r(C l) = r(,I-I) is also a unit of .0. Consider the unit 11 = eje E .0. By (1.6) any conjugate of 11 has the form
Since r(,k) and r(C k ) are complex conjugate, then Iak (ll) I = 1 (k = 1, ... ,1- 1). By Theorem 2 of Section 3, Chapter 2, 11 is a root of 1, and then by Lemma 3, 11
= ±,o.
We shall now show that the plus sign always occurs on the right. For otherwise we would have ·Consider this equation as a congruence in the ring .0 modulo A = I - ,. Since' == 1 (mod A), all powers of' are congruent to 1 modulo ).. and we have e ==
e == a o + al + ...
+a l - 2 = M (mod A),
which means that M == -M (mod A), or 2M == 0 (mod A). By Lemma 2 2M == 0 (mod I),
M== 0 (mod I),
M == 0 (mod A),
so that e == 0 (mod A),
which contradicts the fact that e is a unit of the ring .0. Thus e=
'''e.
We now take an integer s so that 2s == a (mod I). Then e = e can be written in the form
,2s
'0 = ,25 and the equation
This shows that the unit '1 = ens is real. Hence we have represented e as the product of and the real unit '1, and the lemma is proved.
'5
Lemma 5. Let x, y, m, n be rational integers, m "I- n (mod I). Then x + ,my and x + y are relatively prime if and only if x and yare relatively prime and x + y is not divisible by I.
,n
160
THEORY OF DIVISIBILITY
[Chap 3
Proof. If x and y have a common divisor d> I, then x + ,my and x are both divisible by d. If x + y is divisible by I, then x + ,my and x have a common divisor I - , (which is not a unit). Indeed,
x
+
,my =
X
+Y +
+ ,ny + ,ny
(,m - I)y
= (x + y) - (l - Oemy == 0 (mod I - O. Thus the necessity of both conditions is proved. To prove their sufficiency we shall show that there exist numbers ¢o and '10 in .0 such that
Consider the set A of all numbers of the form
. "
,
:
where ¢ and '1 independently run through all numbers of .0. It is clear that if a and fJ belong to A, then any linear combination a¢' + fJ'1' with coefficients ¢', '1' ED also belongs to A. We need to show that I belongs to A. From
!
(x (x
I
+ ,my) _
+ ,myKn -
(x
(x
+ ,ny ) = ,m(l
+ ,nyKm =
- C-m)y = ,men_m(l - Oy,
_,m(l - ,n-m)x
= _,men_m(l - ')x,
we conclude that (l - Oy and (I - Ox belong to A (since ,men_m is a unit in the ring .0). Since x and yare relatively prime, there exist rational integers a and b such that ax + by = I, and therefore
j' I
I, i!
Further,
'- !
l'
IfI: .
!
I
i
I:I, I,I
I I I i!
I I,
I
and thus x + YEA. Since I is divisible by I - " then lEA. But we are also assuming that x + y and I are relatively prime. Hence for some rational integers u and v we have (x + y)u + Iv = I, so that lEA. Lemma 5 is proved. 1.3. Fermat's Theorem in the Case of Unique Factorization Theorem 1. Let I be a prime integer and let' be a primitive Ith root of I. If decomposition into prime factors is unique in the order .0 = Z[,] = {I, " ... , ,1- 2} of the field R(O, then the equation
has no solution in integers x, y, z not divisible by I.
,I
Sec. 1 ]
SOME SPECIAL CASES OF FERMAT'S THEOREM
161
Proof. The prime 3 will play a special role in our proof, so we consider the case f = 3 separately. We shall show that not only the equation x 3 + y3 = Z3, but also the congruence
has no solution in integers not divisible by 3. For assume that there is such a solution of this congruence. But from the congruence x 3 + y3 == Z3 (mod 3) it easily follows (from the little Fermat theorem) that x + y == z (mod 3); that is, Z = x + Y + 3u, and hence x
3
+ y3 == (x + Y + 3U)3 ==
x 3 + y3
+ 3x 2Y + 3xy2 (mod 9),
and 0== x 2y
+ xy2 =
xy(x
+ y) ==
xyz (mod 3).
Thus one of the numbers x, y, z is divisible by 3, and our assertion is proved. Now let f ~ 5. We prove the theorem by contradiction, assuming that for some rational integers, x, y, z, pairwise relatively prime and not divisible by f, we have x' + l' = Zl, which we also write in the form (1.2). Since x + y == x' + l' = Zl "I- (mod f), and x and yare relatively prime, then by Lemma 5, all the numbers x + (ky (k = 0, 1, ... ,f- 1) are pairwise relatively prime. Then, as has already been shown in Section 1.1, from the unique factorization of the numbers appearing in (1.2) it follows that
°
x
+ (y = eal,
(1.3)
x - (z = ela l ' ,
(1.3')
where e and e l are units in the ring .0. We have already remarked that (1.3) and (1.3') lead to a contradiction. We now show that this contradiction even arises from the corresponding congruences modulo f in the ring .0. Let a = a o + al ( + ... + a ,_ 2(1-2 with ao , ... , a l - 2 rational integers. Then al == ao l + all(l
+ ... + al_ 2(1(l-2) == M
(mod I),
where M = ao + al + ... + a l -2' By Kummer's lemma the unit e can be represented in the form e = (5'1, where '1 is a real unit. Hence from (1.3) we obtain the congruence
with the real number ¢ ED. We may also write this congruence in the form
C
5
(x
+ (y) == ¢ (mod l).
(1.8)
We now note that for any a ED the complex conjugate fi. also belongs to .0. If we have the congruence a == f3 (mod f), then a - f3 = fy, so that fi. - /3 = fy
THEORY OF DIVISIBILITY
162
[Chap. 3
and hence Ci. == /3 (mod I). Passing now from the congruence (1.8) to its complex conjugate, we obtain
'S(x
i
.1
:I
But ~
= ¢,
+ ely) == ~ (mod I).
(1.9)
and therefore from (1.8) and (1.9) it follows that
eS(x
+ 'y) == 'S(x+
ely) (mod I),
or (1.10) It is clear that a number of .0, represented in the canonical form ao + al ' + .. , + a l _2,'-2, is divisible by I if and only if all coefficients ao , ... , a, -2 are divisible by I. If the exponents
s - 1,
s,
) I.
I'
I ~
ii
-s,
I-s
(1.11)
are pairwise-noncongruent modulo I and also noncongruent to 1- 1, then the number on the left side of the congruence (1.10) is in canonical form and hence all its coefficients are divisible by I. Thus in this case x == 0 (mod I) and y == 0 (mod I), which is impossible, since x and yare relatively prime (and also not divisible by I). Consider the case when the left side of (1.10) is not in canonical form, that is, when one of the integers (1.11) is congruent to 1- 1 modulo I, or two of them are congruent modulo I. One of the exponents (1.11) will be congruent to I - 1 modulo I only in the following cases:
j
.! :
J
f!
Ii
;
~"
s
I s-1
-s
1- s
---- -- - ---
2
Ii
/- 1
/- 2
1
0
/ -1
0
1
I
1
0
/- 1
0
2
1
/-2
/- 1
, I:
i'I
I
J
I
!i
We see that in each of these cases only one of the exponents is congruent to 1- 1 (since I ~ 5). To write the left side of (1.10) in canonical form, we must use the equation
I'
i
!i
If
Substituting this expression in the left side of (1.10), we replace the term with exponent I - 1 by a sum of the monomials 1, " ... , ,'- 2 each with coefficient ± x or ±y. Since the number of these terms is equal to I - 1 ~ 4 (since I ~ 5), after we combine terms in which the exponent of' is the same, there will be at least one term in which the coefficient is ±x or ±y. But this again would
Sec.1]
SOME SPECIAL CASES OF FERMAT'S THEOREM
163
imply that x == 0 (mod I) or y == 0 (mod I), which is impossible', since we have assumed that x and yare not divisible by I. We now need only consider the case when two of the exponents in (1.11) are congruent modulo I. The congruences s == s - 1 (mod I) and -s == 1 - s (mod I) are clearly impossible. If s == -s (mod I) or s - 1 == 1 - s (mod I), then we have s == 0 (mod I) or s == 1. (mod I), and we have again the cases considered above where s - 1 == 1- 1 (mod I) or -s == 1- 1 (mod I). In the remaining (equivalent) possibilities s == 1 - s (mod I) and s - 1 == -s (mod I), we have s == (l + 1)/2 (mod I). In this case the congruence (1.10) takes the form (x - y)(l+ 1)/2
+ (y
_
X)((I-l)/2
== 0 (mod I).
Since the left side of this congruence is in the canonical form [the exponents (l + 1)/2 and (l - 1)/2 are neither congruent to each other nor to I - 1], it follows that x == y (mod I). Analogously, we deduce from (1.3') that
x == -z (mod I). Then from the congruences x + y == x' + y' = Zl == z (mod I) it follows that 2x == -x (mod I) or 3x == 0 (mod I). Since 1# 3, x == 0 (mod I) and we again have a contradiction. This completes the proof of Theorem 1. By using more subtle arguments involving the integers of the field R«(), Kummer showed that if the prime I satisfies the conditions of Theorem 1, then the second case of Fermat's theorem also holds for the prime I. We shall generalize Theorem 1 to a wider class of exponents in Section 7.3. For this wider class of exponents we shall prove the second case of Fermat's theorem in Section 7.1 of Chapter 5. We make some remarks about Theorem 1. Remark 1 . The main part of the proof of the theorem is the verification of the impossibility of certain congruences modulo I. Of course it does not follow from this that the congruence x' + y' == Zl (mod I) is impossible, since this congruence is equivalent to x + Y == z (mod I), which always has solutions in integers not divisible by I. Moreover, it can be shown that, for example, when 1= 7, the equation x' + y' = Zl, when considered as a congruence, has, for any modulus, solutions not divisible by 7. Thus the proof of the unsolvability of (1.1) is achieved first by using unique factorization in the ring Z[(] to obtain Equations (1.3) and (1.3'), and then by applying the theory of congruences to these latter equations.
164
THEORY OF DIVISIBILITY
[Chap. 3
Remark 2. It is clear that the methods which we have applied in this section to the solution of Fermat's theorem can also be applied to analogous problems, by using other algebraic number fields instead of the field R«() (Problem 2). Remark 3. If we wish to apply the theorem to some particular prime I, we discover that this cannot be done, since we have no means for determining whether factorization into primes is unique for the field R(O.
Hence we come to the following two basic problems of number theory: (1) In which algebraic number fields K is decomposition into prime factors unique? (2) What are the arithmetic properties of those fields K in which decomposition into prime factors is not unique?
c
It
r
r
z e
PROBLEMS
1. Show that the congruence x 3 + y3 == Z3 (mod 52) has no solution in rational integers x, y, z not divisible by 5. Let w be a primitive cube root of I. Assume it known that decomposition into prime factors is unique in the field R(w). Show that the equation x 3 + y3 = 5z 3 has no solution in rational integers x, y, z not divisible by 3. 3. Let I be a prime number, ~ a primitive Ith root of I, x and y rational integers, and d the greatest common divisor of x and y. If x + y ~ 0 (mod I) set I) = d, and if x + y == 0 (mod I) set I) = d(l - ~). Show that I) is a common divisor of the numbers x + ~my and x + ~.y which is divisible by all other common divisors of these numbers. 4. Show that in the order {l,~, ... , ~1-2 of the field R(~) a product 1Xf3 is divisible by I - ~ if and only if IX or f3 is divisible by I - ~. 5. Using the concept of congruence of integral polynomials (Section I.l of Chapter I), show that 1'-1+ ... +1+ I ==(1_1)1-1 (modI).
6. Show that the polynomial 11- 1 + ... + 1 + I is irreducible over the field of rational numbers by considering congruence of integral polynomials modulo 12.
2. Decomposition into Factors
2.1. Prime Factors In Section 1 we saw how a problem of number theory can reduce to a question of decomposition into prime factors in some order of an algebraic number field. We shall see other such examples later. We now consider the general problem of decomposition into prime factors. In order to speak of decomposition into primes, we must be dealing with a
I
v
e f tl v
n a
E r
o 2
p
fI
Sec. 2]
DECOMPOSITION INTO FACTORS
165
fixed ring .0, the elements of which we are decomposing into factors. We formulate our problem in the general case where .0 is any commutative ring without divisors of zero and possessing a unit element. In the future these conditions will be assumed without special mention. Definition. An element n: of the ring.o, nonzero and not a unit, is called prime if it cannot be decomposed into factors n: = exfJ, neither of which is a unit in .0.
Thus an element is prime if it is divisible only by units and associates. In some rings there are no prime elements and hence not every element of a ring can be represented as a product of primes. For example, let .0 be the ring of all algebraic integers. Any ex # 0, which is not a unit, has the factorization ex = -J~ -J~, in which both factors lie in .0 and are nonunits. Thus every nonunit of.o has nontrivial factorizations and there are no prime elements in .0. For examples of rings in which decomposition into prime factors is always possible, consider orders in algebraic number fields (it is these rings which will interest us most). We shall call prime elements in orders prime numbers. Theorem 1. In any order .0 of an algebraic number field K, every nonzero element which is not a unit can be represented as a product of prime numbers. Proof. By Theorem 4 of Section 2 of Chapter 2, the units of .0 are characterized by having norm ± 1. We prove the theorem by induction on the absolute value IN(ex) I of the number ex ED. If the number ex is itself prime, there is nothing to prove. Otherwise ex = fJy, where fJ and yare numbers of.o which are not units, so that
I < IN(fJ) I < IN(ex)I,
1 < IN(y)1
< IN(ex)l.
By the induction assumption, fJ and yare products of prime numbers of the ring.o. But then since ex = fJy, the number ex is also a product of prime numbers of the ring.o. Hence Theorem 1 is proved.
2.2. Uniqueness of Factorization We assume now that in the ring .0 decomposition into prime factors is possible, and we turn to the question of the uniqueness of such factorizations. Definition. We shall say that decomposition into prime factors in the ring .0 is unique if for any two decompositions
I I
166
THEORY OF DIVISIBILITY
[Chap. 3
the number of factors is always the same (r = s) and for suitable indexing of the factors, the prime elements 7t; and 7t;' are associate (i = I, ... , r). In the decomposition a = 7t 1 ••• 7t" associate prime elements can be made equal by multiplying by a suitable unit. We may then group equal factors into powers and obtain a factorization a
= 1>7t/'
... 7t m
km
,
in which the prime elements 7t 1 , ... , 7t m are pairwise-nonassociate and I> is a unit of the ring D. In case factorization is unique, the prime elements 7t 1 , ••• , 7t m are determined up to associates and the exponents k l , ... , k m are uniquely determined. The classical example of a ring with unique factorization is the ring of rational integers. It is far from true that decomposition into prime factors is unique in all rings. Thus the result of Problem I shows that among orders in algebraic number fields, unique factorization can occur only for maximal orders. Unique factorization for the ring Z of rational integers follows from the theorem on division with remainder, which asserts that for any a and b # 0 of Z there exist integers q and r, such that a = bq + rand r < Ibl. If in a ring.c there is an analog of division with remainder, then we can prove uniqueness of factorization in .0 just as in Z. 1 I'
Definition. We say that the ring .0 has the division with remainder property if there is a function Iiall on nonzero elements a E .0 which takes nonnegative integral values and is such that the following conditions hold:
I I
I
iI
(I) If IX # 0 is divisible by fl, then lIall ~ IIflll· (2) For any elements IX and fl # 0 of .0, there exist I' and p such that
l
! I
a = fly + p, where either p = 0, or Euclidean.
I
\, !
I:
Consider the proof of unique factorization in the ring of rational integers.
I ,
Ilpll < IIflll. The ring .0 itself is then called
j
It uses, in addition to the general properties of rings, only the theorem on
division with remainder. Therefore, by translating this proof, we obtain the following result. Theorem 2. In every Euclidean ring, factorization into primes is
unique. Con sider as an example the maximal order .0 of the quadratic field R( -J~). We shall show that .0 has the division with remainder property with lIall = N(IX). Let a and fl # 0 be arbitrary numbers of .0. Then
~=u+vFI,
Sec. 2]
DECOMPOSITION INTO FACTORS
167
where u and v are rational numbers and we choose rational integers x ann y so that
lu - xl If we now set y
1-.
~
Iv - yl
~!-.
= x + y -J-::}, p = a - fJ y, then from N
a)y = (u (P-
x)
2
+ (v
2
- y) ~
4:1 + 4:1 <
1
we obtain
and this proves our assertion. From Theorem 2 we concl ude that in the maximal order of the field R( -J - 1) factorization into primes is unique. In the same fashion we can prove uniqueness of factorization for certain other rings (see Problems 3, 4, and 7). It must be noted that there are rings in which factorization is unique which are not Euclidean. An example is the maximal order of the field R( -J -19). It folIows from Problem 6 that this ring does not have the division with remainder property. But it wilI folIow from Problem 11 of Section 7 that factorization is unique in this ring. Consider the maximal order of the real quadratic field R( -Jd). We obtain a division algorithm with remainder by using the absolute value of the norm only when d is one of the folIowing sixteen numbers: 2,
3,
5,
6,
7,
11,
13,
17,
19,
21,
29,
33,
37,
41,
57,
73.
2.3. Examples of Nonunique Factorization It is not difficult to construct examples in which the maximal order of an algebraic number field will not have unique factorization. Consider, for example, the field R( -J - 5). As was shown in Section 7.2 of Chapter 2, the numbers of the maximal order of this field are of the form a = x + Y where x and yare rational integers, and then N(a) = x 2 + 5y 2. In the ring.o the number 21 has the factorizations
-J=S,
(l)
21=3·7,
(2)
21 = (1
+ 2~)(1
- 2J~- 5).
We claim that alI terms on the right in (I) and (2) are prime. Suppose, for example, that 3 = afJ, where a and fJ are nonunits. Then since 9 = N(afJ) = N(a)N(fJ), we must have N(a) = 3. But this is impossible since the equation x 2 + 5y 2 = 3 has no integral solutions. In precisely the same manner we
THEORY OF DIVISIBILITY
168
[Chap. 3
could prove that the numbers 7, I + 2 -J - 5, I - 2 -J - 5 are also prime. Since the quantities ;I ± 2....; - 5 1±2J~ 3
I ill
7
I'
,!
! '
! I
I
are not contained in the ring .0, the numbers 3 and 7 are not associated with I + 2 -J - 5 and I - 2 -J - 5. Hence we see that the ring .0 contains numbers which allow essentially different decompositions into prime factors. The example of nonunique factorization in the maximal order of the field R( -J - 5) is not an unusual exception. Many such examples are easily found (see Problems 10 and 11). It might be thought that the phenomenon of nonunique factorization which we have discovered in algebraic number fields would make it impossible to construct a complete theory of the arithmetic of these fields, and that this would dash our hopes for deeper applications to the problems of number theory. But this not the case. In the middle of the last century, Kummer showed that, although the arithmetic of algebraic numbers was radically different from the arithmetic of the rational numbers, it could be developed in great depth, allowing strong applications to number-theoretic problems. The basic idea of Kummer is that if the maximal order .0 does not possess unique factorization, then the nonzero elements of .0 can be mapped into some new set, in which multiplication is defined and in which factorization into primes is unique. If a is any nonzero element of .0, then its image (a) under this mapping will factor uniquely into a product of primes, but these primes will lie not in the ring but in the new set. Unique factorization, in the sense of Kummer, is restored by virtue of the fact that some prime numbers (perhaps even all of them) are mapped onto nonprime elements of the new set, and therefore their images factor in a nontrivial fashion. Thus, in the example of the maximal order of the field R(-J - 5), there must exist objects VI' V2' P3 , V4 such that 3=PIP2,
7=P3P4,
1+2F5=P1P3,
1-2J-5=P2P4
(in these equations we do not distinguish between numbers and the new objects which correspond to them). The decompositions (1) and (2) now reduce to the decompositions
which differ only in the order of the factors. Kummer himself called these new objects ideal numbers. Now they are called divisors. In Section 3 we give a systematic exposition of the theory of divisors.
Sec. 2]
DECOMPOSITION INTO FACTORS
169
PROBLEMS
1. Show that if in the order 0 of the algebraic number field K decomposition into prime factors is unique, then 0 is the maximal order of thefieldK. And show, in general, that if 0 is a ring in which factorization into primes is unique, then 0 is integrally closed in its field of fractions. 2. Show that if an element a ¥= 0 of a Euclidean ring is divisible by {3, and a and (3 are not associate. then Iiall > 11{311. 3. Let \JR be a lattice in the complex plane, the points of which represent the numbers of the maximal order 0 of an imaginary quadratic field. Show that we obtain an algorithm for division with remainder in 0 by using the norm N(a) if and only if the translates of the unit disc (without boundary) by all vectors of the lattice \JR completely cover the plane. 4. Show that in the maximal order of the imaginary quadratic field R( Vd), an algorithm for division with remainder is obtained by using the norm if and only if d is one of the values -I, -2, -3, -7, -II. 5. Letd < 0 be square-free and not equal to -I, -2, -3, -7, - I I. Show that the norm of any integer of R( v'd), except 0 and ± I, is greater than 3. 6. Show that, except for the five fields indicated in Problem 4, the maximal order of an imaginary quadratic number field is never a Euclidean ring. (Hint: Carry out the proof by contradiction. Assume that there is a function Iiall on the elements of the maximal order 0 which satisfies the conditions given in Section 2.2. Among the numbers of 0 which are not units, choose y so that Ily [1 is as small as possible. Then any a E 0 will be congruent modulo y to one of the numbers 0, I, - I.) 7. Show that there exists an algorithm for division with remainder in the maximal order of the field R( v2t
v-
8. Show that in the maximal order of the field R( I) every odd rational prime p of the form 4k + 3 remains prime, while every odd rational prime p of the form 4k + I factors into p = 7T7T', where7T and 7T' are nonassociate primes. Find the decomposition of the number 2 into prime factors. 9. Let 0 be a ring with unique factorization. Show that for any two numbers a and {3 of 0 (not both equal to zero), there is a common divisor 0 which is divisible by all common divisors of a and {3 (0 is called the greatest common divisor of a and (3). 10. Show that in the maximal order of the field R(v -6) the following are essentially different prime factorizations:
55 = 5· II = (7 + v - 6)(7 - v - 6), 6=2'3= -(v-6)2. 11. Show that in the maximal order of the field R(V -23) the following are essentially different prime factorizations: .
6=2'3=
l+v-231-v-23 2 2'
27 = 3·3·3 = (2 + v - 23X2 - v - 23). Find all possible factorizations of the number 8 in this ring.
170
I:
[Chap. 3
3. Divisors
3.1. An Axiomatic Description of Divisors We consider an arbitrary commutative ring .0 (with unit element and without divisors of zero), and we shall try to clarify the idea mentioned in Section 2.3 of mapping the nonzero elements of the ring .0 into some new domain, in which decomposition into prime factors is unique. Our theory must clearly consist of two parts: the construction of some set :?fl of new objects in which decomposition into prime factors in unique, and the determination of the mapping of the nonzero elements of the ring.o into the set [0. We start with the first part. In order to be able to speak of decomposition into prime factors in [0, we must have an operation of multiplication defined in:?fl; that is, we must associate to each pair of elements of [0 a third element, their product. We shall require that this operation be associative and commutative. A set with such an operation is called a commutative semigroup. We shall further require that the set [0 contain a unit element, that is, an element e such that ea = a for all a E [0. In a commutative semigroup [j} with unit e we may speak of divisibility of elements; an element a E:?fl is divisible by b E [0 if there exists aCE [0 such that a = be (we also say that b divides a). An element P E [0, distinct from e, is called prime if it is divisible only by itself and by the unit e. We further say that the semigroup [0 has unique factorization into prime elements if every element a E [0 can be represented as a product of prime elements
11
I
THEORY OF DIVISIBILITY
" I,'
I
I, ,I!:
a = PI ... Pr
I'
(r
~
0),
! :
I
I
I
and this decomposition is unique up to the order of the factors (for r = 0 this product is set equal to e). Thus uniqueness of factorization implies that e is the only invertible element (divisor of e) in the semigroup:?fl. It is clear that a semigroup with unique factorization is completely determined by its set of prime elements (essentially by the cardinality of this set). As a simple example of a semigroup with unique factorization we may take the set of all natural numbers under the operation of multiplication. In a semigroup with unique factorization, any two elements have a greatest common divisor (a common divisor which IS divisible by all common divisors of the two elements), and also a least common multiple. Two elements of [0 are called relatively prime if their greatest common divisor is equal to e. We note some elementary properties of divisibility in [0: If a product ab is divisible by e and a is relatively prime to e, then b is divisible bye; if e is divisible by the relatively prime elements a and b, then e is divisible by their product ab;
·t
Sec. 3]
DIVISORS
171
if a product ab is divisible by a prime element p, then at least one of the factors is divisible by p. We now pass to the second part of the theory, the conditions which must be satisfied by the mapping from the ring .0 to the semigroup f:0. Let .0* denote the set of all nonzero elements of the ring.o. Since we have assumed that .0 does not have divisors of zero, the set .0* is a semigroup under the operation of multiplication. Suppose that we have a mapping of the semigroup.o* into the semigroup :?fl which has unique factorization. We denote the image of an element ex E .0* by (ex). It is clear that we can use the semigroup f:0 to study the multiplicative structure of the ring .0 only if under the mapping ex ~ (ex), the product of two elements in .0* is mapped onto the product of their images in f:0, that is, only if (ex{3) = (ex)({3) for all ex and {3 in .0*. Hence we must assume that the mapping ex ~ (ex) is a homomorphism of the semigroup .0* into the semigroup f:0. If ex is divisible by fJ in the ring .0, it will then follow that (ex) is divisible by ({3) in the semigroup f:0. In order that divisibility in .0 should closely correspond to divisibility in :?fl, we shall also demand the converse: If (ex) is divisible by ({3) in f:0, then ex is divisible by {3 in .0. We shall also say that the element ex # 0 of.o is divisible by the element a E f:0, and shall write alex, if (ex) is divisible by a in the semigroup f:0. We shall suppose 0 to be divisible by all elements of f:0. If ex E .0*, the set of all elements of.o which are divisible by ex is closed under addition and subtraction. It is natural to assume that this property is preserved for divisors a of the semigroup f:0. Our last requirement is that f:0 not contain any "unnecessary" elements. By this we shall mean that distinct elements of f:0 must not divide precisely the same elements of .0*. We thus give the following definition. Definition. By a theory of divisors for the ring.o we shall mean the giving of some semigroup f:0 with unique factorization, along with a homomorphism ex ~ (ex) of the semigroup .0* into f:0, satisfying the following conditions:
(1) An element ex E.o* is divisible by {3 E.o* in the ring .0 if and only if (ex) is divisible by ({3) in the semigroup f:0. (2) If ex and {3 of.o are divisible by a E f:0, then ex ± {3 are also divisible by a. (3) If a and b are two elements of:?fl and the set of all elements ex E .0 which are divisible by a coincides with the set of all elements {3 E .0 which are divisible
by b, then a = b. The elements of the semigroup :?fl are called divisors of the ring .0, and divisors of the form (ex), ex E .0*, are called principal divisors. The unit element e of the semigroup is called the unit divisor.
THEORY OF DIVISIBILITY
172
[Chap. 3
Condition (1) in the definition of a theory of divisors clearly implies the following assertion: The equality (a) = ({3) holds if and only if a and {3 are associate in the ring .0. In particular, units e of the ring .0 are characterized by (e) = e. We shall denote a theory of divisors for the ring .0 by .0* --+:?fi. Our definition of a theory of divisors only fixed what we shall mean by such a theory. It does not at all guarantee the existence or uniqueness of the homomorphism .0 --+ [0. In the next section we consider the question of the uniqueness of a theory of divisors, assuming that one exists, and in Section 3.3 we indicate an important necessary (but not sufficient) condition for existence. The existence of a theory of divisors for maximal orders in algebraic number fields will be proved in Section 5 (Theorem 3 implies that such a theory does not exist for nonmaximal orders). 3.2. Uniqueness Theorem 1. If a ring.o has a theory of divisors, then it has only one. More precisely, if we have two homomorphisms .0* --+ [0 and .0* --+ [0', satisfying all requirements of the definition, then there is an isomorphism [0 ~ :?fi' under which the principal divisors in :?fi and :?fi' which correspond to a given element a E .0* are identified. Proof Let.o* --+ [0 and .0* --+ [0' be two theories of divisors for the ring .0. Let P E [0 and P' E:?fi' be prime divisors. Denote by Pand p' the sets ofelements
of the ring which are divisible by P and by P' (with respect to the theory .0* --+ [0 for P, and with respect to .0* --+ [0' for p'). We now show that for any prime divisor p' E [0' there is a prime divisor p E [0 such that V c V'. Assume that this is not the case, that is, that p ¢ p' for all prime divisors p E [0. From condition (3) it easily follows that any divisor must divide a nonzero element of the ring.o. Choose in D an element {3 ¥- 0 which is divisible by p', and decompose the divisor ({3) E [0 into prime factors: ({3) =
p/l ... p/r
(Pi' ... , Pr are prime divisors of the semigroup [0). Since we have assumed that Pi ¢ p', then for each i = 1, ... , r there is an element Yi ED which is divisible by Pi but not divisible by p'. The product Y = Yl kl ••. Yr kr is divisible
by Plk 1 ••• p/ and this means, by condition (1), that Y is divisible by {3 in the ring.o. But then Y must be divisible by p'. Thus we have a contradiction, since the product Yl kl ••• y/ r cannot be divisible by p', since p' is prime and does not divide any of the Yi. Hence for any prime divisor p' E [0' there is a prime divisor p E [0 such that r
,
Sec. 3]
DIVISORS
173
pcp'. By symmetry, there is a prime divisor q' E [0' for which i1' c p . We shall show that q' = p', and hence i1' = P = p'. Indeed, by condition (3) there is an element ¢ in .0 which is divisible by q' and not divisible by q'p'. If we assume that q' ¥- p', then the element ¢ will not be divisible by p', which is impossible since q' c p'. Since (for given p' E [0') there is one and only one prime divisor p E :!fi such that p = p' [condition (3)], we obtain a one-to-one correspondence p ...... p' between prime divisors of [0 and prime divisors of [0'. This correspondence can clearly be extended (in. a unique manner) to an isomorphism :!fi ~ :!fi'. Namely, if PI ...... PI" ... , p, ...... p/, then
We now need only show that under this isomorphism, the divisors (a) E [0 and (a)' E [0' (for given a E .0*) correspond tD one another. Let p E:!fi and p' E [0' be corresponding prime divisors, and assume that they occur in the factorizations of (a) and (a)' with exponents k and I, respectively. From condition (3) it follows that there is an element nED which is divisible by p and not divisible by p2. Since p = p', the element n is also divisible by p'. The principal divisor (n) hence has the form (n) = pb, where b is not divisible by p. Now choose in .0 an element w which is divisible by bk and not divisible by bkp. Since p does not divide b\ then w is not divisible by p or by p'. Consider the product aw. Since a is divisible by p\ and w is divisible by b\ then aw is divisible by pkb k = (n k), and by condition (I) we have aw = nk'1, '1 ED. But p'ln, and hence aw is divisible by p'\ and since p' ,r w, then p'kl a . This means that in the factorization of the divisor (a)' E [0', the prime divisor p' occurs with exponent not less than k; that is, I ~ k. But by symmetry also k ~ I, and thus k = l. We have thus shown that if (a)=PIk1 ... p/r and PI ...... PI" ... 'P, ...... p/, then (a)' = PI,k p/k r , and this means that under the above isomorphism :!fi ~ :!fi', the principal divisors (a) E:!fi and (a)' E:!fi' correspond to each other. If the ring .0 has unique factorization, then we can easily construct a theory of divisors .0* -. [0, and in this theory all divisors will be principal. Indeed, break up the set of all nonzero elements of.o into classes of associate elements, and consider the set [0 of all such classes. For a E .0*, denote by (a) the class of elements associate with a. It is easily seen that under the operation of multiplication (a)(fJ) = (afJ), the set [0 becomes a semigroup with unique factorization, and that the mapping a -. (a), a E .0*, defines a theory of divisors for the ring .0. [The prime divisors in this theory are just the divisors of the form (n), where n is a prime element of .0.] By Theorem I any theory of divisors for this ring coincides with the one just constructed. Assume now the converse, that we have for some ring.o a theory of divisors .0* -. [0, in which all divisors of:!fi are principal. We now show that an element l
...
THEORY OF DIVISIBILITY
174
[Chap. 3
n :1= 0 of the ring .0 will be prime if and only if the corresponding divisor (n) is prime. Indeed, if (n) = P is a prime divisor and y divides n in the ring .0, then the divisor (y) must divide P (in the semigroup ~) and then, since P is prime, either (y) is equal to P or to the unit divisor c. In the first case y is associate with n, and in the second case y is a unit in .0, and this means that n is a prime element of the ring .0. Now let (a) be neither prime nor the unit divisor. Then (a) is divisible by some prime divisor P = (n), and a is divisible by the prime element n and is not associate with it. Hence a cannot be prime. We have shown that if every divisor is principal, then the element n is prime if and only if the divisor (n) is prime. Let a be any element of .0*. Ifwe have the factorization
(3.1)
(a) = P1 ... Pr
in Pr
!!)
(the prime divisors Pi are not necessarily distinct), and if PI then in the ring .0 we have the factorization
= (11: r ),
= (n 1 ),
... ,
(3.2) where e is a unit of the ring.o. Since any factorization of the form (3.2) induces a factorization of the form (3.1), we must have unique factorization in the ring .0. We have obtained the following result. Theorem 2. In order that the ring .0 have unique factorization, it is necessary and sufficient that .0 have a theory of divisors .0* -+ ~ in which every divisor of ~ is principal.
3.3. Divisors and Integrally Closed Rings We have already noted that not every ring has a theory of divisors. The existence of a homomorphism a -+ (a) which satisfies the requirements of a theory of divisors imposes strong restrictions on a ring. One such restriction is given in the following theorem. Theorem 3. If the ring .0 has a theory of divisors, then .0 is integrally closed in its quotient field K. Proof. Assume that the element
e of K satisfies an equation (a 1
, ... ,
an ED),
e
Ii I
I I ,'
1'1
but does not belong to .0. We represent it in the form = alP, where a E .0 and PE .0, and decompose the principal divisors (a) and (P) into prime factors. Since a is not divisible by Pin the ring .0 (we have assumed that rt .0), (a) is
e
Sec. 3]
DIVISORS
175
not divisible by (fl) [by condition (1)]. This means that some prime divisor p occurs in (fl) with greater exponent than in (a). Let p occur in (a) with exponent k ~ O. Since (fl) is divisible by pH I, we deduce by condition (2) that the right side of a"= -alfla"-I- ... -a"fl" is divisible by pk"+ I. But p occurs in (a") = (a)" with exponent kn, and thus a" is not divisible by pk"+I. This contradiction shows that ~ E ,0, and Theorem 3 is proved. Another necessary condition for the existence of a theory of divisors is given in Problem 1. Since the only orders in algebraic number fields which are integrally closed are the maximal orders, only maximal orders can possibly have a theory of divisors.
3.4. The Theory of Divisors and Valuations We now turn to the question of the practical construction of theories of divisors. We first assume that a theory of divisors ,0* -+ £0 exists for the ring ,0, and then proceed to clarify how this theory could be constructed. Taking an arbitrary prime divisor p, we can construct with it a function vp(a), which is similar to the p-adic valuation with respect to a prime p which was constructed in Chapter 1. Namely, for any a#-O of ,0, by vp(a) we denote the power to which p enters in the factorization of the principal divisor (a) into prime factors. Clearly, vp(a) is characterized by p.p(a)la
and
p.p(a) + l,r a.
Since zero is divisible by arbitrarily large powers of p, it is natural to set vp(O) = 00. It easily follows from the definition that vp(afl) = vp(a) vp(a
+ fl)
~
+ vp(fl),
min (vp(a), vp(fl))
(3.3) (3.4)
[for the proof of (3.4) we must use condition (2)]. The function vp(a) can be extended to the quotient field K of the ring ,0 in such a way that (3.3) and (3.4) still hold. For any ~ = a/fl E K (a, fl E D) set vp(~)
= vp(a) - vp(fl).
The value of vp ( ( ) dearly does not depend on the choice of the representation of ~ in the form ~ = a/fl. It is now easily verified that (3.3) and (3.4) still hold for the extended function vp • We shall now see what values the function vp(a) takes as a ranges through K.
176
I,
[Chap. 3
TH EORY OF DIVISI BI L1TY
Since the divisors P and p 2 are distinct, by condition (3) there is an element y E D which is divisible by p but not by p2. For this element we have vp(y) = 1. But then vp(/) = k for any integer k. Hence the function vp(C() takes on all rational integral values. Definition. Let K be any field. A function v(C(), defined for C( E K, is called a valuation of the field K, if it satisfies the following conditions: (1) v(C() takes on all rational integral values as C( ranges through the nonzero elements of K; v(O) = 00; (2) v( C(fJ) = v(C() + v(fJ); (3) v(C( + fJ) ~ min (v(C(), v(fJ)). We can now say that every prime divisor p of the ring D determines a valuation vp(C() of the quotient field K. It is easily seen that distinct prime divisors determine different valuations. For if p and q are distinct prime divisors, then by condition (3) the ring D contains an element y divisible by p and not divisible by q. But then vp(y) ~ 1 and vq(y) = 0, and hence vp i= vq . All valuations of the field K of the form vp clearly satisfy
vp(C()
~
0
for all
C(
E
D.
(3.5)
In terms of valuations we can give a simple expression for the factorization of the principal divisor (C(), C( E D*. The prime divisors which enter into this decomposition are characterized by vplC() > O. Then we have (C()
=
TI
Pivpi(a),
(3.6)
i
where Pi runs through all prime divisors for which vpi(C() > O. We thus see that the semigroup f0 of divisors and the homomorphism D -+ g are completely determined by the set of all valuations vp of the field K which correspond to prime divisors p. For the set of all divisors and the operation of multiplication are determined as soon as the set of all prime divisors is known (each divisor is a product of prime divisors with nonnegative exponents, and when divisors are multiplied the corresponding exponents are added). But the prime divisors are in one-to-one correspondence with the valuations vp ' Finally, the homomorphism D* -+ f0 is determined from (3.6). This means that the concept of a valuation can be used as a foundation for the construction of a theory of divisors. We shall proceed to develop this idea. We must first answer the following important question: How can we characterize the set 91 of valuations of the field K which must be taken to construct a theory of divisors for the ring D? The product (3.6) can contain only a finite number of factors. Hence, for any fixed C( E D*, the condition vp(C() = 0 must hold for almost all valuations of the set 91 (by" almost all" is meant" for all but a finite number ").
Sec. 3]
DIVISORS
177
From (3.5) we see that for all v E 91 we must have v(C() ~ 0 if C( E ,o. Conversely, assume that for some ~ ¥- 0 of K we have v(~) ~ 0 for all v E 91. If we represent ~ in the form ~ = C((fJ (C(, fJ E D), then we have v(C() ~ v(fJ) for all v E 91. But this means that the principal divisor (C() is divisible by the principal divisor (fJ)· Condition (l) now implies that C( is divisible by fJ in the ring ,0; that is, ~ E ,o. We hence have a second necessary condition: The set ofvaluations 91 must be such that v(C() ~ 0 for all v E 91 if and only if C( is an element of the ring ,o. We now give another necessary condition for 91. Take any finite set of valuations v1 , ••• , Vm of 91 which correspond to the prime divisors P1' ... , Pm' If k 1 , ••• ,km are fixed nonnegative integers, we consider the divisor 0 = p/ ' ... Pm km • From condition (3) it follows that the ring ,0 contains an element C(j which is divisible by OJ = 0P1 '" Pi-1 Pi+1 ... Pm and not divisible by 0iPi (l ~ i ~ m). Consider the sum
Using condition (2), we easily find that C( is divisible by p/' and not divisible by p/'+l. Hence the set 91 must satisfy the following condition: For any valuations Vb •.• , Vm of 91 and for any nonnegative integers k b ••• , kit. there exists an element C( in the ring ,0 for which vj(C() = k j (l ~ i ~ m). The necessary conditions which we have found. on 91 will now be shown sufficient to construct a theory of divisors for the ring ,o. To prove this, take a semigroup q,& with unique factorization, with the prime elements in one-to-one correspondence with the valuations of the set 91. The valuation v E 91, which corresponds to the prime element P E q,&, will again be denoted by vp ' By the first and second conditions, for any C( E ,0* the product (3.6) will make sense [the exponents vplC() are nonnegative and almost all of them are zero]. Since v(C(fJ) = v(C() + v(fJ) the mapping C( -+ (C() will be a homomorphism from ,0* to q,&. From the second condition it easily follows that C( is divisible by fJ in the ring ,0 if and only if v(C() ~ v(fJ) for all v E 91. Hence condition (l) is satisfied. Condition (2) is implied by the inequality v(C( ± fJ) ~ min (v(C(), v(fJ)). If 0 and b are two different elements of q,&, then some prime element P occurs in their factorizations with different exponents, say, k and I. Let k < I. By the third of the above conditions there is in ,0 an element C( which is divisible by 0 and for which vp(C() = k. But then C( is not divisible by b. This shows that condition (3) is also satisfied. Hence the homomorphism ,0* -+ q,& gives us a theory of divisors for the ring ,o. We formulate the results which we have obtained. Theorem 4. Let ,0 be a ring with quotient field K, and let 91 be a set of valuations of K. In order that the valuations of 91 induce a theory of divisors on ,0 it is necessary and sufficient that the following conditions hold:
I
'I'
I
II, ,
178
THEORY OF DIVISIBILITY
[Chap. 3
(1) For any rx #- 0 of ,0, v(rx) = 0 for almost all valuations v E 91. (2) An element rx of K belongs to ,0 if and only if v(rx) ~ 0 for all v E 91. (3) For any finite set of distinct valuations Vl' ... , Vm of 91 and for any set of nonnegative integers k l , ... , k m , there is an element rx E ,0 for which
I'
Henc.e the construction of a theory of divisors for the ring ,0 is reduced to the construction of the corresponding set 91 of valuations of its quotient field K. We shall not enter here into the determination of those integrally closed rings for which a theory of divisors can be constructed (see, for example, the book "Modern Algebra" by van der Waerden, Section 105, Ungar, New York, 1950). In Section 4 we show that if 0 is a ring with quotient field k and ,0 is the integral closure of 0 in some finite extension field K of k, then if 0 has a theory of divisors, so does D. Since the ring Z has a theory of divisors (being a ring with unique factorization), then we will have proved that there is a theory of divisors for the maximal order of any algebraic number field. The set of valuations 91 of the field K which must be taken to construct a theory of divisors depends essentially on the ring ,0, and, in general, this set will not consist of all valuations of the field K (Problem 6). It can even happen (Problem 7) that condition (1) of Theorem 4 will not hold for the set of all valuations of the fieldK. We now show, however, that in the case of the ringZ of rational integers, we must take all valuations of the field R of rational numbers (we shall see in the future that this is also true for maximal orders of algebraic number fields). To each prime number p E Z (that is, the prime divisor of the ring Z) there corresponds the valuation vp of the field R, the value of which is given for the nonzero rational number x
= pm_a b
(3.7)
(a and b integers not divisible by p) by
vix) = m.
(3.8)
This valuation vp is called the p-adic valuation of the field R [it is clear that the valuation (3.8) coincides with the p-adic valuation of the field R of p-adic numbers; see Section 3.2 of Chapter 1]. Theorem 5. Every valuation of the field of rational numbers is of the form vp for some prime p.
Sec. 3]
DIVISORS
179
Proof. Let v be any valuation of the field R. Since
v(l
+ ... + 1) ~
min (v(l), ... , v(l))
= 0,
then v(n) ~ 0 for all natural numbers n. If v(p) = 0 for all primes p, then we would also have v(a) = 0 for all a¥-O of R, which is impossible by condition (l) of the definition of a valuation. Hence for some prime p we must have v(p) = e > O. Suppose that for the prime q ¥- p we also had v(q) > O. Then from the equation pu + qu = 1 (u and u rational integers) we obtain 0= v(pu
+ qu) ~
min (v(pu), v(qu)) ~ min (v(p), v(q)) > O.
This contradiction shows that v(q) = 0 for all primes q except p. Hence v(a) = 0 for all integers a not divisible by p. For the rational number (3.7) we thus have v(x)
=
mv(p)
+ v(a) -
v(b)
=
me
=
evix).
Since the valuation v must take on all integral values, e = 1 and hence v = v ' p Theorem 5 is proved. Note that Theorem 5 could easily have been deduced from Theorem 3 of Section 4, Chapter 1, the second part of the proof of which we have essentially repeated above. We conclude this section by considering another special case. Assume that for some ring ,0 we have a theory of divisors ,0* -+ f0 with only finitely many prime divisors Pl' ... , Pm' Denote by v1 , ••. , Vm the corresponding valuations of the quotient field K. By condition (3) of Theorem 4 for any divisor a = Pl k, .,. Pm k m (k i ~ 0), there is an element rt. E ,0 for which V1 (rt.) = k b '" , vm(rt.) = k m. But this means that the divisor a coincides with the principal divisor (rt.). Thus all divisors of f0 are principal, and the ring ,0 has unique factorization (Theorem 2). If Pl = (n 1 ), •.. , Pm = (n m ), then the elements n 1 , '" , n m constitute a complete set of pairwise-nonassociate prime elements of the ring ,0 and every element rt. E ,0* has a unique representation in the form
where c; is a unit of the ring ,o. The prime elements n 1 , characterized by
•.. ,
n m are clearly
forj ¥- i. We have obtained the following result. Theorem 6. If for some ring ,0 we have a theory of divisors with only a finite number of prime divisors, then ,0 has unique factorization into primes.
180
THEORY OF DIVISIBILITY
[Chap. 3
PROBLEMS
1. If the ring D has a theory of divisors, show that every element of D has only a finite number of (nonassociate) factors. 2. Show that in any theory of divisors any divisor is the greatest common divisor of two principal divisors. 3. Let K = k(x) be the field of rational functions over a field k and let cp be some irreducible polynomial in k[x]. Every nomero rational function u of K can be written in the formu = cp'(f/g), wherefandg are polynomials in k[x] which are not divisible by cpo Show that the function V,p, given by v",(u) = k, is a valuation of the field K. 4. If f and g are nonzero polynomials of k[x] of degrees nand m, and u = f/g set v*(u) = m - n. Show that the function v* is a valuation of the field K = k(x).
E
k(x),
5. Let v be a valuation of the field k(x) such that v(a) = 0 for all nonzero a in k. Show that v is either of the form v", (for some irreducible polynomial cp E k[x]) or else v = v* (see Problems 3 and 4). 6. If we set D = k [x], determine the set SR of valuations of the fieldK = k(x) which satisfies the conditions of Theorem 4. Further, determine the set SR for the ring D' = k[l/x]. 7. Let K = k(x, y) be a field of rational functions in two variables over the field k. For any natural number n set x. = x/y"o A nonzero rational function u = u(x, y) E K can be represented in the form
J(x., y) u = u(x.y", y ) = y - ( - - ) , g x., y where the polynomialsfandg are not divisible by y. If we set v.(u) = k, show that thefunction v. is a valuation of the field K. Further, show that the valuations v. (n ;;. l) are all distinct, and that for all of them v.(x) > O. 8. Formulate and prove an .. Eisenstein irreducibility criterion" for polynomials over any ring 1) with a theory of divisors. 9. Show that if a ring D has a theory of divisors, then its quotient field K has algebraic extensions of all degrees. 10. Letfbe a nonzero polynomial in the ring D = k[x, y] of polynomials in two variables over the field k. Denote by (ii f) the smallest degree of a monomial which appears in fwith nonzero coefficient. Show that the function ii can be extended to a valuation of the field of rational functions k(x, y). Denote by ~I the set of all valuations of the field k(x, y) which correspond to irreducible polynomials of the ring D, and let SRI be obtained from SR by adjoining ii. Which of the conditions of Theorem 4 are not fulfilled for the ring D and the set SRI of valuations?
4. Valuations
Theorem 4 of Section 3 reduces the problem of constructing a theory of divisors for an integrally closed ring D to the determination of a set of valuations of the quotient field K which satisfy the conditions of the theorem. We turn to a systematic study of valuations.
Sec. 4]
VALUATIONS
181
4.1. Simple Properties of Valuations
From the definition of a valuation of a field K (Section 3.4) we immediately obtain v(±l)=O v( -C()
= v(C()
v(~) = v(C() v(C(n) v(C(,
= nv(C()
+ ... + C(n) ~
(f1 # 0),
v(f1), nEZ,
min (v(C(,), ... , v(C(n».
Now assume that v(C() # v(f1). If v(C() > v(f1), then v(C( + f1) ~ v(f1). On the other hand, since f1 = (C( + f1) - C(, v(f1) ~ min (v(C( + f1), v(C(», so that v(f1) ~ v(C( + f1). Hence v(C(
+ f1) = min (v(C(),
v(f1»
if v(C() # v(f1).
(4.1)
By induction we obtain v(C(,
+ ... + C(n) = min (v(C(l),
... , v(C(n»,
provided that the minimum value of V(C(l), '" , v(C(n) occurs only once. Definition. Let v be a valuation of the field K. The subring D. of the field K consisting of all elements C( E K for which v(C() ~ 0 is called the ring of the valuation v. The elements of D. are called integral with respect to the valuation v.
It is clear that all three conditions of Theorem 4 of Section 3 are fulfilled for the ring D and the set 91 consisting of the single valuation v. Hence the ring D. has a theory of divisors with a single prime divisor. From Theorems 3 and 6 of Section 3 we obtain the following results. y
Theorem 1. The ring D y of the valuation v of the field K is integrally
closed in K. Theorem 2. The ring D has (up to associates) a single prime element n, and any element C( # 0 of D. has a unique (for fixed n) representation in the form C( ~ E:n m , where E: is a unit in D y (m ~ 0). y
The prime element n is clearly characterized by v(n) = 1. In the ring D y , as in any ring, we can consider congruences with respect to the elements of D. (see the Supplement, Section 4.1). Since congruences
I
THEORY OF DIVISIBILITY
182
[Chap. 3
modulo associate elements are equivalent, the ring of residue classes modulo the prime element 1t does not depend on the choice of 1t but is completely determined by the ring D •. We denote this ring of residue classes by L. and will now show that it is a field. For if rx E D. and rx ;f. 0 (mod 1t), then v(rx) = 0 and this means that rx is a unit in D •. Then rx has an inverse ~ and rx~ == I (mod 1t), since rx~ = 1. The field L. is called the residue class field of the valuation v. 4.2. Independence of Valuations
Let the ring ,0 have a theory of divisors ,0* -+ g&, and let PI' ... ,Pm be distinct prime divisors of g&. By Theorem 4 of Section 3 there correspond to these prime divisors valuations VI' ... , Vm of the quotient field K, and these valuations are independent in the sense that there exist elements in K on which they take on any given set of values k l , ... , k m • For if we set k / = max (Q, k j ), i = 1, ... , m, and k;" = min (0, k;), then by condition (3) of Theorem 4 of Section 3 we can find elements rx and fJ in ,0 fur which vj(rx) = k/ and vj(fJ) = -k;", and then for the quotient ~ = rxlfJ we will have vl~) = k j (1 ~ i ~ m). We now show that this property of independence does not depend on the fact that the valuations V j corresponded to prime divisors in some theory of divisors, but is true for any finite set of valuations. Theorem 3. If VI' rational integers k l ,
are distinct valuations of the field K, then for any k m there exists an element ~ E K for which
, Vm ,
vIG)
= kl ,
... ,
vm(~)
= k m.
Let ,01'"'' ,om denote the rings of the valuations VI' ... 'V m and set = n;':.1 ,oj' Conditions (1) and (2) of Theorem 4 of Section 3 are clearly fulfilled for the ring ,0 and the set 91 consisting of the valuation SV I , ... , V m • From the formulation of Theorem 3 we see that condition (3) also holds, and hence the ring ,0 has a theory of divisors with a finite number of prime diyisors. Thus Theorem 3 implies that for any finite set of valuations VI' ... , V m of the field Kwe have a theory of divisors for the ring ,0 = n~= I D. From Theorem 6 of Section 3 we then derive the following result.
,0
Corollary. If
are distinct valuations of the field K with rings 0 ,0 i is a ring with unique factorization. Further, each nonzero element of ,0 has a unique representation in the form rx = e1tlk, .. , 1t mkm , where e is a unit in ,0, and 1t 1, ... , 1t m are fixed prime elements of ,0 characterized by ,01' .. , ,
VI' ... , V m
,om' then the intersection ,0 =
Vj(1tj)
= 1,
nr=
v/1t j)
=0
(j ¥- i).
Sec. 4]
VALUATIONS
183
= I, the assertion of the theorem is contained in the definition of a valuation. Assume that m ~ 2 and that the case of m - I valuations has already been proved. We show that then there do not exist rational integers C I , ... , Cm , not all zero, for which Proof of Theorem 3. For m
(4.2) for all nonzero ~ E K. Assume the contrary, that is, that (4.2) does hold. Among the coefficients at least two must be nonzero and have the same sign [otherwise there would be only two nonzero coefficients, say, C1 and C1 , with CI > and C1 < 0, and then from CI V1 (~) + C1v1(O = we would obtain vI(O = eVl(~) with positive e, and this is possible only for e = I and VI = v1 ]. Changing the numeration, if necessary, we can write (4.2) in the form
°
°
(4.3) where at least one of the rational coefficients OJ is negative. By the induction hypothesis there exist elements P and P' in the field K such that V
j(P)
vj(P)
for all i
= 0,
v;(P')
=I
= I, vj(P') =
°
if
OJ
~
if
OJ
< 0,
0,
= 2, .. , , m. Then (4.4)
Consider the sum P + p'. Since one of the numbers v j(P) and vj(P') (i = 2, .. , , m) equals and the other equals I, then vj(P + P') = min (vj(P), vj(P')) = 0. From the relation (4.3) we therefore obtain v1(P + P') = 0. On the other hand, from (4.4) we obtain v j(P + P') = min (v j(P), VI (P')) < 0.
°
This contradiction proves that (4.2) is impossible. Let ,0 now denote the intersection of the rings of the valuations V 1 , '" , Vm , and let E denote the group of units of this ring. Let n 1 , ... , n m denote prime elements of ,0, numbered so that v;(n;) = I (i = 2, ... , m) (recall that for the case of m - I valuations, Theorem 3, and hence also its corollary, are assumed). We now show that the valuation VI cannot be identically zero on the group E. Any element ~ E K* can be written in the form
where c; E E, k j = v;(~) (2 ~ i ~ m). If v1(c;) we would obtain Vl(~) = k1v1(nl) + '" which can also be written in the form
=
°for all
+ kmv1(n m),
(4.5) c;
E
E, then from (4.5)
!
184
THEORf OF DIVISIBILITY
[Chap. 3
where the rational integers ai = vlni) do not depend on ~, and this contradicts the fact that a relation of the form (4.2) is impossible. Hence the group E contains elements on which the valuation Vi is nonzero. Choose an element y in the group E on which the valuation Vi takes its smallest positive value I. It is clear that all values of Vi on E are divisible by I. We shall show that 1= 1. If all the values a 2 = Vi (n 2 ), •• , , am = Vi (n m) were divisible by I, then we would deduce from (4.5) that all values Vl(~) of the valuation Vi are divisible by I, which is possible only for I = 1. Consider the case where not all a j are divisible by I, say, a 2 is not divisible by I. Consider the element
where s is an integer chosen so that the number
satisfies the inequality 0 < Ii < I. It is clear that v1 (ct) i = 2, ... , m. Set e = y + ct. Since vle) = min (v;(y), vj(ct)) same time,
= Ii
and Vj(ct) > 0 for
= 0 for all i = 2, ... , m, then e E E. But at the
which contradicts the choice of y. This shows that the case when not all ai are divisible by I is impossible, and hence 1= 1. We may now assume that the prime elements n j (2 ~ i ~ m) of the ring.o are chosen so that vlni) = a j = O. For each ni can be replaced by n;' = niy-a" for which v1(n/) = a i - ajvi(y) = O. Setting nl = y, we have obtained a system of elements nl' n2' ... , n m , for which vj(ni) = I and Vini) = 0 for j #- i. If now k 1 , ... , k m are any integers, for the element ~ = nl k , ... n mk ", we have
Theorem 3 is proved. From Theorem 3 we easily deduce the following stronger result. Theorem 4 (Approximation Theorem). If Vi' ... , Vm are distinct valuations of the field K, then for any elements ~1' ... , ~m of K and any integer N, there exists an element ~ E K for which l
185
VALUATIONS
Sec. 4]
Proof. Choose in K elements ct 1, ... , (j #- i) and set ct k ~ = __ 1 _ ~1 1 ct/
+
ct m
such that Vj(ctj) = -1, v/ct i ) = 1 ct k
+ ... + _ m _k 1 + ct m
~m'
Since Vj(ctn #- 0 = vj(l) for all natural numbers k, then by (4.1) the value of vp + ct/) equals 0 for i #- j and equals -k for i = j, so that ct.k ) Vj ( 1 +' ct k = k
for
i #-j
and
Vj(~) = k. 1 + ct j
i
Hence V/~ -~) ~
It is clear that
~
min (k i
+ V/~i))'
will satisfy the theorem provided k ~ N - min v/
O.
i,j
4,3, Extension of Valuations
Let k be a field and K a finite extension of k. If v is some valuation of the field K, then by restricting v to the field k we obtain a function which clearly satisfies conditions (2) and (3) in the definition of a valuation (Section 3.4). The first condition may not be satisfied; that is, the values ofv on the elements of k may not exhaust the group Z. But v cannot be identically zero on k. If this were the case, then the field k would be contained in the ring of the valuation v, and since that ring is integrally closed (Theorem 1), then K would also be contained in it, and this is impossible. Thus v(a), a E k*, takes on both negative and positive values [if v(a) < 0, then v(a- 1 ) > 0], Let p denote any element of k on which v takes its least positive value v(P) = e. Then for any a E k* the value v(a) = m is divisible bye. For if m = es + r, 0 ~ r < e, then v(ap-S) = m - se = r, so by the minimality of e we have r = 0, Now setting (a
E
k*),
vo(O) =
00,
(4.6)
we obtain on k a fUQction vo, which takes all integral values and which consequently is a valuation of the field k. Definition. Let K be a finite extension of the field k. If the valuation Vo of the field k is related to the valuation v of the field K by (4.6), then we say that Vo is induced on k by the valuation v, and v is an extension of Vo to the
I
I I,
;I
186
THEORY OF DIVISIBILITY
[Chap. 3
field K. The uniquely determined integer e which appears in (4.6) is called the ramification index of v with respect to V o (or with respect to the subfield k). Note that in this defl.nition when e > 1, the term extension of a valuation does not correspond to the usual concept of the extension of functions to larger domains of definition. It follows from the above that every valuation v of K is induced by a unique valuation Vo of k. The converse assertion is also valid, that is, for any valuation V o of k there exists an extension to K (which is, in general, not unique). The proof of this fact is fairly difficult, and we shall give it in the next section. First, we consider some properties of extensions of a given vo , assumin'g that such extensions exist. Let k eKe K' be a tower of finite extensions, and let vo , v, v' be valuations of the fields k, K, K'. It is clear that if v is an extension of V o with ramification index e, and v' an extension of v with ramification index e', then v' is an extension of V o to the field K', and the ramification index of v' with respect to V o is equal to ee'. It is also easy to see that if v and Vo are induced by the valuation v', then v is an extension of V o ' Lemma 1. If K is a finite extension of the field k of degree n, then any valuation V o of the field k has at most n extensions to the field K. Vm be distinct extensions of V o to the field K. By Theorem 3 we can find elements ~l' ... , ~m for which v;(~i) = and viO = 1 for j # i. We shall show that these elements are linearly independent over k. Consider the linear combination
Proof. Let v:, ... ,
°
with coefficients aj of k, not all zero. Set k = min (vo(ad, ... , vo(a m)), and let i o be such that VO(aio) = k. Denoting by e the ramification index of Vio with respect to k, we have
and therefore
vIO(Y) = min (vio(a 1 ~ d, ... , vio(am~m)) = ek, so that Y # 0, which proves our assertion. From the linear independence of the elements ~l' ... , ~m over the field k, it follows that m ~ (K: k), and this means that the number of extensions Vi is not greater than n. Lemma 1 is proved.
Sec.4]
VALUATIONS
187
Assume now that Vb •.• , vm are all extensions of a fixed valuation V o of a field k to a finite extension K. Let 0 denote the ring of the valuation Vo and ,0 its integral closure in the field K, and let Db ... , ,om denote the rings of the valuations VI, ... , V m • Since 0 c Db and the ring ,oj is integrally closed in K, then ,0 c ,oj for i = I, ... , m, and hence
Later we shall see that equality actually occurs here. If this is so, then by the corollary of Theorem 3, ,0 has unique factorization with a finite number of nonassociate prime elements. Since the nonassociate prime elements 1tl' ... , 1t m of the ring ,0 are in one-to-one correspondence with the valuations VI' •.• , V m , we obtain a method for constructing valuations of K which are extensions of the valuation V o ' So assume that we know that the ring ,0, the integral closure of the ring of the valuation Vo in the field K, has unique factorization with a finite number of nonassociate prime elements. From Theorem 6 of Section 3, this assumption is equivalent to the existence in ,0 of a theory of divisors with a finite set of prime divisors PI' ... , Pm' We shall show that then the valuation Vo has precisely m extensions to the field K, these being the valuations VI' .•. , Vm of the field K which correspond to the prime divisors PI' ... , Pm . Let p be any prime element of the ring 0 of the valuation Vo [that is, any element of k such that vo(p) = 1], and let 1t 1, .. , , 1tm be a complete set of nonassociate prime elements of the ring ,0 [numbered so that v j (1tJ = 1]. Since oeD, the element p has a factorization p
= e1t I e,
'"
1tm e~
(4.7)
in the ring ,0, with nonnegative exponents ej (e a unit in D). Now if a is any element of k* and vo(a) = s, that is, a = pSu, where u is a unit in 0, then in ,0 we have
(4.8) If e j = 0, then Vj would be identically zero on k*, and we saw at the start of this section that this is impossible. Hence ej > O. Formula (4.8) now implies that each of the valuations V j (i = 1, ... , m) is an extension of V o to the field K. We also obtain that ej, the ramification index of V j with respect to vo , is given by (4.7) Assume now that V is an extension of the valuation Vo to the field K. Since 0 is contained in the ring of the valuation v, so is its integral closure ,0, that is, v(a) ~ 0 for all rx E ,0, and this means that v(e) = 0 for all units e E D. If V were distinct from VI' •.. , V m , then by Theorem 3 there would be a unit e of the ring ,0 such that v(e) i= O. Hence V must be one of the V j •
,
I'
1:
188
1
1<
Ii
THEORY OF DIVISIBILITY
[Chap. 3
Hence every extension of the valuation Vo to the field K is one of the valuations VI' ... , V m • By condition (2) of Theorem 4 of Section 3 we also obtain that the integral closureD of the ring 0 in the field K consists of all elements a E K for which v;(a) ~ 0 for all extensions vi' If we again denote the ring of the valuation Vi by ,o i , we can state this last result as m
,0=
n,o i .
(4.9)
i= I
We have shown that to guarantee the existence of extensions of the valuation to the field K and to give a complete description of all extensions, it suffices to verify that the ring ,0 has unique factorization (with a finite number of nonassociate prime elements). Vo
4.4 Existence of Extensions
Let, as before, k be a field with a valuation vo , 0 the ring of the valuation vo , and p a prime element of the ring o. We denote the residue class field of the valuation Vo by :[0' For each element a EO we denote the corresponding residue class modulo p by a. We then have a = [j in the field :[0 if and only if a == b (mod p) in the ring o. Now let K be a finite extension of k and let ,0 be the integral closure of l) in K. Lemma 2. If the number of elements of the residue class field :[0 of the valuation Vo is not less than the degree of the extension Kjk (in particular, if the field :[0 is infinite), then the ring ,0 is Euclidean and hence has unique factorization. The ring ,0 then contains only a finite number of pairwisenonassociate prime elements.
Proof. We define a E K* a function
Iiall =
Iiall, by setting 2>o(NK/k a ).
It is clear that this function satisfies IlaPIl = Ilall'IIPIl (a, PE K*). If a E ,0*, then lIall is clearly a natural number. We must show that for any pair of elements a and P i= 0 of ,0 there exist ~, p E ,0, such that a
= P~ + p,
(4.10)
where p is either zero or else IlpH < IIPII. If a is divisible by Pin the ring ,0, that is, a = Py, where y E ,0, then (4.10) holds with ~ = y and p = O. Assume that a is not divisible by P, that is, that the element y = ap- l does not belong to ,o. Letf(t) = t n + c1t n - 1 + .,. + Cn (Ci E k) be the characteristic polynomial of the element y with respect to the
Sec. 4]
VALUATIONS
189
extension Kjk. Since y f. ,0, not all the coefficients Ci belong to o. If mini" i,.n VO(Ci) = -r < 0, then all coefficients of the polynomial q>(t) = p'f(t) will belong to the ring 0, and at least one of them will be a unit in o. We now replace all coefficients of q>(t) by the corresponding residue classes modulo p. Since the leading coefficient of q>(t) is p', which is divisible by p, we obtain a polynomial ip(t) of the ring :[o[t] of degree ~n - 1, where not all coefficients are zero. Since we have assumed that the field :[0 contains at least n elements, there is an element a E 0 for which the residue class ais not a root of ip(t). This means that q>(a) "I- (mod p); that is, q>(a) is a unit in the ring o. We now compute lIy - ail· The characteristic polynomial of y - a equals f(t + a), and therefore
°
N K/k(Y - a) = ( - l)"f(a) = ( - l)"q>(a)p-',
so that
Ily-all =2-'< 1, Ilet-aPil < IIPII. Hence (4.10) is satisfied if we set ~ = a, p = et - ap. We have shown that ,0 is a Euclidean ring, and thus by Theorem 2 of Section 2 it has unique factorization. Let n be any prime element of the ring ,o. Since for any et E ,0*, its norm NK/k(et) is always divisible by et, then NK/k(n) = plu is divisible by n (u is a unit of 0, f ~ 1). But since n is prime and factorization into primes is unique, the element p is also divisible by n. Hence, if p has the factorization
in the ring ,0 (e a unit in D), then the prime elements n l , plete set of nonassociate primes of ,o. The proof of Lemma 2 is complete.
... ,
n m form a com-
We now turn to the proof of the basic results of this section. Theorem 5. Any valuation V o of the field k can be extended to any finite extension K of k. Theorem 6. Let 0 be the ring of the valuation Vo , and let ,0 be its integral closure in the field K. If Vi> ... , Vm are all extensions of the valuation Vo to the field K, and ,01' ... , ,om are their rings, then m
Theorem 7. Under the same notations, the ring ,0 has unique factorization, and the set of valuations of K which correspond to prime elements of ,0 is precisely the set of all extensions VI' ••• , Vm of the valuation Vo to the field K.
I,
,1
: ,I
190
[Chap. 3
THEORY OF DIVISIBILITY
If the prime elements 1r1> ... , 1r m of the ring ,0 are ordered so that v;(1rj) = I, and if the prime element p of the ring 0 has the factorization (c; a unit in D),
then ej is the ramification index of the valuation
Vi
with respect to
Vo'
Proof. If we assume that Theorems 5 and 6 have already been proved, then
i
,I
, I
'
by the corollary of Theorem 3 the ring has unique factorization (with a finite number of nonassociate prime elements), and hence all results obtained in the second half of Section 4.3 are valid. But these results are precisely the contents of Theorem 7. Theorems 5 and 6 will be proved by induction on the degree n of the extension Kjk. For n = I, there is nothing to prove. Let n> I and assume that Theorems 5 and 6 have already been proved for all extensions of degree
Sec. 4]
VALUATIONS
191
L' denote the ring of the valuation vo', a prime element of the ring 0', and the residue class field of 0' modulo p'. Two elements a and b of 0 are congruent modulo p' (in the ring 0') if and only if they are congruent modulo p in the ring o. Hence those residue classes of 0' modulo p' which contain an element of 0 form a subfield of L' isomorphic to LO' Having in mind this uniquely defined isomorphism LO -+ L', we shall assume that LO C L'. Since the element 0 is the root of a polynomial with coefficients in 0 and with leading coefficient 1, then 0 E 0' (since 0' is integrally closed). Let 0 denote its residue class in L'. The equation ep(D) = 0 can be reduced modulo p' and gives us ep(l}) = O. Since ep was chosen to be irreducible over the field LO' the powers T, 0, ... , l}n-2 are linearly independent over LO' This means that the field L' contains qn - I elements (q is the number of elements of the field LO)' But now (K': k')
=
(K': K)(K: k)
(k': k)
~
(11 - l)n 11 -
1
= n.
Since q ): 2 and n ): 2, we have
Since the number of elements in the residue class field L' of the valuation vo' is not less than (K': k'), vo' extends to a valuation v' of the field K'. Since v' is an extension of Vo to K', the valuation v, induced by v' on the field K, is also an extension of the valuation Vo (see Section 4.3). Theorem 5 is proved. To complete the proof of Theorem 6 we first show that vo' is the only extension of Vo to a valuation of the field k'. Assume that vo" is another extension of the valuation Vo to the field k'. By Theorem 3 the field k' contains an element y such that vo'(y) = 0, vo"(y) > O. Since the powers 1,0, ... ,on-2 form a basis for k' over k, the element y can be represented in the form y
= l(c o + cIO + .. , + Cn_2 0n - 2 ) = pk rx,
where all coefficients C j belong to 0 and at least one of them is a unit in o. We saw above that 0 E 0' and the classes 1,0, ... , on-2 of L' are linearly independent over LO' Hence the residue class Ii
= Co + CIl} + ... + Cn_20n-2
is nonzero (since at least one of the coefficients C j is nonzero). This means that rx is not divisible by p' (in the ring 0'); that is, vo'(rx) = O. Analogously we obtain vo"(rx) = O. Comparing the conditions vo'(y) = 0 and vo'(rx) = 0 with y = pk rx, we see that k = 0, and hence vo"(y) = vo"(rx) = O. But this contradicts the choice of y. Thus the valuation Vo has only one extension to the field k'. Since Theorem 6 is assumed valid by induction for the extension k'lk, the ring 0' of the valuation vo' coincides with the integral closure of the ring 0 in the field k'. Let ,0' denote the integral closure of the ring 0 in the field K'.
THEORY OF DIVISIBILITY
192
[Chap. 3
Since 0' c ,0' and the ring ,0' is integrally closed in K' (Supplement, Section 4.3), then ,0' is also the integral closure of the ring 0' in the field K'. Let VI" ••• , v: be all extensions of the valuation vo' to the field K', and let ,01" .. " ,0: be their rings. Since Theorem 6 holds for the extension K'lk' (by Lemma 2), then r
D'=nD/j;1
(4.11)
The set of valuations V/ is also the set of all extensions of the valuation Vo to the field K'. Equation (4.11) thus can be considered a proof of Theorem 6 for the extension K'ik and the valuation vo . Let VI, .,. , Vm denote all valuations of the field K which are induced by one of the valuations v/' and let ,01' ... , ,om denote their rings. If v/ is an extension of V; , then clearly ,0/ n K = D i • Noting that the intersection ,0' n K coincides with the integral closureD of the ring 0 in the field K, we have m
,0
= ,0' n
K
=n j~
(,0/ I
n K)
=n i; I
Di •
(4.12)
Suppose now that there is an extension V of the valuation V o to K different from VI' ••• , Vm • Then by Theorem 3 there would be an element rx in K for which v1(rx) ~ 0, ... , vm(rx) ~ (and hence rx E D) and v(rx) < 0. But this would contradict the fact that ,0 must be contained in the ring D. of the valuation v. Thus VI' ..• , Vm are the only extensions of the valuation Vo to the field K. Formula (4.12) coincides with the assertion of Theorem 6.
°
PROBLEMS
t~ " I 1. Show that an algebraically closed field has no valuations. 2. Let K = k(x) be a field of rational functions over the field k and let v be the valuation of K corresponding to the polynomial x - a (a E k). Show that the residue class field ~v of the valuation v is isomorphic to k. Show further that two elements f(x) and g(x) of the ring lie in the same residue class if and only if f(a) = g(a). 3. Let K = k(x) be a field of rational functions over the field k of real numbers, and let v be the valuation of K corresponding to the irreducible polynomial x 2 + I. Find the residue classfield ~v of the valuation v. 4. Let (I, and (12 be the rings of the valuations v, and V2 of some field K. Show that if (I, c (12. then v, = V2' 5. Find the integral closure of the ring of 3-integral rational numbers in the field R(V - 5) and determine all extensions of the 3-adic valuation v, to this field. 6. For all prime numbers p find all extensions of the p-adic valuation V p to the field R(V=!) and determine the corresponding ramification indices.
Sec. 5]
THEORIES OF DIVISORS FOR FINITE EXTENSIONS
7. Let KI k be a normal extension and Vo a valuation of the field k. Show that if extension of Vo to the field K, then all extensions have the form v'( IX) = v(a( IX»
193 v
is any
(IX E K),
where a runs through all automorphisms of Klk. 8. Let k be a field of characteristic p. Let Klk be a purely inseparable extension. Show that a valuation Vo of the field k has only one extension to the field K. [The extension Klk is called purely inseparable if every element of K is a root of degree p' (s ~ 0) of some element of k.]
9. Let k = ko(x, y) be a field of rational functions in x and y over some field k o . In the field of formal power series k o {t} (see Problem 7 of Section 4, Chapter 1, or Section 1.5 of Chapter 4) choose a series g(t) = L:=o cot O (co E ko) which is transcendental over the field of rational functions ko(t) [the existence of such series follows from the fact that the field k o {t} has higher cardinality than the field ko(t), and hence higher cardinality than the set of elements of ko{t} which are algebraic over ko(t)]. For a nonzero polynomial f = f(x, Y)E ko[x, y], the seriesf(t, g(t» will be nonzero by choice of g.1f tOis the smallest power of t which OCcurs in this series with nonzero coefficient, set vo(f) = n. Show that the function Vo (after suitable extension) is a valuation of the field k and that the residue class field of the valuation is isomorphic to the field k o .
5. Theories of Divisors for Finite Extensions
5.1.
Existence
Theorem 1. Let the ring 0 with quotient field k have a theory of divisors f0 which is determined by the set 91 0 of valuations. If K is a finite extension of the field k, then the set 91 of all valuations of K which are extensions of 0* ---+
valuations of 91 0 determines a theory of divisors for the integral closure D of the ring 0 in the field K. Proof. By Theorem 4 of Section 3 we need only verify that the set 91 satisfies
all conditions of that theorem. We first verify the second condition. For any valuation v E 91 and any a EO we clearly have v(a) ~ O. This means that 0 is contained in the ring of the valuation v. But then by Theorem I of Section 4 the integral closure of the ring 0 in the field K is also contained in the ring of the valuation v. In other words, v(C() ~ 0 for all C( E D. Conversely, let C( E K be an element such that v(C() ~ 0 for all v E 91. Let t r + a 1 t r - 1 + ... + ar denote the minimal polynomial of C( with respect to k. Let Vo be any valuation of k belonging to the set 91 0 , and let VI' ... , V m be its extensions to the field K. Since V1 (C() ~ 0, ... , vm(C() ~ 0, then by Theorem 6 of Section 4 the element C( lies in the integral closure in K of the ring of the valuation V o . But in this case all the coefficients ai' ... , a r must lie in the ring of the valuation Vo (see the Supplement, Section 4.3); that is, vo(a l ) ~ 0, "', VO(a r ) ~ O. Since this holds for all V o E 91 0 , the coefficients a1 , '" , ar belong to 0, and hence C( ED.
194
THEORY OF DIVISIBILTY
[Chap. 3
We now tum to the first condition. Let rx ED, rx #- 0, iL1.d let a, be determined as above. Then for all but a finite number of valuations vo of 91 0 we have vo(a,) = 0. Hence for all but a finite number of valuations v of 91 we have v(rx-l)=a,-l(rx'-l+.··+a'_l))~O, and this means that V(rx) =0. Hence v(rx) = for almost all v E 91. Only the third condition now remains to be verified. Let Vl' .,. , Vmbe distinct valuations of 91 and kl> ... , k m be nonnegative integers. Let VOl' ... , VOm be the corresponding valuations of 91 0 (the VOi are not necessarily all distinct). Expand the original set of valuations to the set Vl' ... , Vm, Vm+l' '" , V s ' consisting of all extensions of the valuations VOi to the field K. By Theorem 3 of Section 4 there is an element y in the field K for which
°
, I
vl(y) = kl> ... , vm(y) = k m,
Vm+l(}') = 0, ... , v.(y) = 0.
If this element y belongs to the ring ,0, just set rx = y. Assume that y does not belong to D. In this case denote by Vl', ... , v: the valuations of 91 which take negative values on y: and by VOl" ... , vo: the corresponding valuations of 91 0 (various vo/ also may be equal). Since each of the valuations vo/ is different from each of the valuations VOi, in 0 there is an element a such that vOi(a) =
°
(1 ~ i ~ m),
vo/(a) = / (1 ~j ~ r),
where / is taken equal to max (/1' ... , I,). Set rx = ya. Since v/(rx) = v/(y) + v/(a) ~ -/j + vo/(a) = -/j + / ~ 0,
then rx E D. Thus in any case we have in the ring ,0 an element rx such that vl(rx) = kl> ... , vm(rx) = k m , so that condition (3) of Theorem 4 of Section 3 also holds for the set 91 of valuations. The proof of Theorem I is complete. We apply Theorem I to the case of algebraic number fields. The maximal order ,0 of the algebraic number field K is, as we have seen, the integral closure in K of the ring Z of rational integers. Since Z has a theory of divisors (since it has unique factorization), then by Theorem I ,0 also has a theory of divisors. By Theorem 5 of Section 3 the theory of divisors for Z is induced by the set of all valuations of the field R of rational numbers, and since every valuation of the field K is an extension of some valuation of the field R we find that the theory of divisors for the ring ,0 is induced by the set of all valuations of the field K. We hence have the following theorem. Theorem 2. If ,0 is the maximal order of the algebraic number field K, there exists a theory of divisors ,0* ---+ g&, and this theory is induced by the set of all valuations of the field K.
Sec. 5]
THEORIES OF DIVISORS FOR FINITE EXTENSIONS
195
5.2. Norms of Divisors Let 0 be a ring with theory of divisors 0* --+!!J o and with quotient field k; let K be a finite extension of k, with .0 the integral closure of 0 in K, and let .0* --+!!J be a theory of divisors for the ring D. In this paragraph we establish a connection between the semigroups of divisors !!J o and !!J. Since 0 cD, elements of 0* correspond to principal divisors both in !!J o and in !!J. To distinguish these principal divisors, if a E 0*, we denote the corresponding principal divisor in !!J o by (a)k' and for any CI. E .0* we denote the corresponding principal divisor by (Cl.h. We have an inclusion isomorphism of semigroups 0* --+ .0*. Since any unit of the ring .0 which is contained in 0 is a unit of the ring 0, this inclusion induces an isomorphism (a)k --+ (a)K' for a E 0*, of the semigroup of principal divisors of the ring D. We now show that this isomorphism can be extended to an isomorphism !!J 0 --+!!J (which will not be onto). Theorem 3. There is an isomorphism of the semigroup !!J o into the semigroup !!J, which on principal divisors coincides with the isomorphism (ah --+ (a)K' a E 0*.
The isomorphism !!J o --+!!J is clearly characterized by the commutativity of the diagram 0*
--+
.0*
! !!J 0
! --+
!!J
that is, by the fact that the two composite homomorphisms 0* --+ .0* --+ !!J and 0* --+!!J o --+!!J coincide (the vertical homomorphisms denote the homomorphisms of the multiplicative semigroups of the rings onto the semigroups of principal divisors). Let p be any prime divisor of the ring 0, vI' the corresponding valuation of the field k, and V~l' "" v~m all extensions of vI' to the field K (\.l31' ... , \.l3m are the corresponding prime divisors of the ring D). Let el ... , em denote the respective ramification indices of the valuations V~l' ." , v~m with respect to vI" Since v~,(a) = e;vp(a) for all a E 0*, then the factor p'p(u) of the principal divisor (a)k E!!J o will become (\.l31 e, ... \.l3m em yp(U) in the principal divisor (a)K E!!J. This means that the isomorphism from !!J o to!!J defined by the mapping (5.1) (for all p) satisfies the requirements of Theorem 3. It is easily seen that the isomorphism !!J o --+!!J, satisfying the requirements of Theorem 3, is unique (Problem 5).
) , ,
I \'
~I
I! :
"'
196
THEORY OF DIVISIBILITY
[Chap. 3
By means of the isomorphism !!J o --+!!J we can identify the semigroup !!J o with its image in!!J. But prime divisors in!!J o will not, in general, remain prime in !!J. For by (5.1) each prime p has the decomposition (5.2) in the semigroup !!J. Using the embedding !!J o --+!!J we may speak of the divisibility of divisors of 0 by divisors of .0. From (5.2) we see that a prime divisor p of the ring 0 is divisible by a prime divisor 'l.' of the ring .0 if and only if the valuation v~ is an extension of the valuation vp • It is further clear that relatively prime divisors in !!J o remain relatively prime in !!J. Definition. Let 'l.'lp. The ramification index e = e~ of the valuation v~ with respect to the valuation vp is also called the ramification index of the prime 'l.' relative to p (or relative to k).
The ramification index is thus the largest natural number e such that 'l.'el p. If ct. is any element of .0, its norm N(ct.) = NK/k(ct.) lies in o. The mapping ct. --+ N(ct.), ct. E .0*, is a homomorphism from the multiplicative semigroup .0* to the semigroup o. Since the norm of any unit of the ring .0* is a unit in 0, this homomorphism induces a homomorphism (ct.)K --+ (N(ct.))k of the semigroup of principal divisors of the ring .0 to the semigroup of principal divisors of the ring o. We shall show that this homomorphism can be extended to a homomorphism of the entire semigroup !!J to !!J o . Theorem 4. There is a homomorphism from the semigroup of divisors !!J --+ !!J 0 , such that
to !!J 0 , N:!!J
(5.3) for any ct. E .0*. We can express (5.3) by saying that the diagram
is commutative. For a fixed prime divisor p E!!J o we denote the ring of the valuation vp by op and its integral closure in the field K by .op . By Theorem 7 of Section 4 the prime divisors 'l.'l' ... , 'l.'m of the ring .0 which divide p correspond uniquely to a complete set of pairwise-nonassociate prime elements 7t 1 , ••• , 7t m of the
Sec. 5]
THEORIES OF DIVISORS FOR FINITE EXTENSIONS
ring .01" The correspondence element ct. E K, if
197
'l.'i +-+7ti has the property that for any nonzero (5.4)
where e is a unit of the ring .01" then ki
=
v~,(ct.).
Let 'l.' be one of the prime divisors 'l.'i which divides 1', and let corresponding prime element of the ring .01' . Set
(5.5) 7t
be the (5.6)
It is clear that d~ does not depend on the choice of 7t. Taking the norm in (4) and comparing with (5) and (6) we obtain the relation vp(N K/k(ct.))
=
Ld~v~(ct.)
(5.7)
~Ip
('l.' runs through all prime divisors of the ring .0 which divide 1'). We can now construct the desired homomorphism N: ~ ~ ~o . A divisor m: = 'l.'t At •• , 'l.'r A r of the semigroup ~ can be conveniently written as an infinite product
m: = TI 'l.'A(~) ~
over all prime divisors 'l.' of~, in which, however, only a finite number of the exponents A(~) are nonzero. [A('l.') equals Ai' if ~ = 'l.'i' and equals zero if the divisor 'l.' is not one of 'l.'t, .,. , ~r'] We can write divisors of the ring 0 in an analogous fashion. If (ct.)K is the principal divisor corresponding to a nonzero element ct. of .0, then we have the representation (5.8) From (5.7) we see that if (N(ct.))k
= TI
pe(p),
(5.9)
I'
then e(p) must satisfy e(p) = Ld~v~(ct.).
(5.10)
~Ip
This suggests the following definition. Definition. Let ~r = TI~'l.'A(~) be a divisor of the ring D. For any prime divisor l' of the ring 0, set a(p)
=
Ld~A('l.').
~Ip
198
THEORY OF DIVISIBILITY
[Chap. 3
The divisor TIp pQ(p) of the ring 0 is called the norm ofthe divisor'll with respect to the extension Klk and is denoted by NK/k('ll), or simply by N('ll). Since A('l.') equals zero for almost all 'l.' (that is, all but a finite number of a(p) also equals zero for almost all p, and hence the expression TIp pQ(p) actually is a divisor of the ring o. From the definition it is clear that
'l.'), then
N('llIB) = N('ll)N(IB)
for any two divisors'll and IB of D. The mapping'll --+ N(~l) is thus a homomorphism of the semigroup .0 to the semigroup ~o. In the case of a prime divisor'll = ~ we clearly have (5.11)
!
j'
i
In view of (5.10), the norm of the divisor (5.8) equals the divisor (5.9) and hence we have proved the existence of a homomorphism N: ~ --+ ~o, which satisfies the requirements of Theorem 3. As in the case of the isomorphism ~o --+~, it can be shown (Problem 4) that the homomorphism N: ~ --+ ~o is uniquely determined by condition (5.3). One of the central problems of the theory of divisors is to determine a rule for the decomposition of the prime divisor p of the ring 0 into prime divisors of the integral closureD of 0 in some finite extension field. In the general case this problem is still unsolved (however, see the end of Section 8.2). Each decomposition (5.2) is characterized by the number m of prime divisors and by the various ramification indices e i = e~;. The natural numbers ei' however, cannot be taken arbitrarily (for a given extension Klk). For they are related to the numbers d~ [see (5.6)] by the formula
L d~e~ = n = (K:k),
(5.12)
~Ip
for the proof of which it suffices to apply formula (5.7) to the case of a prime element p of the ring op [recall that v~,(p) = e;]. 5.3. The Degree of Inertia
The definition of the homomorphism N: ~ --+ ~o depended on the numbers d"" which were defined in a rather formal manner in (5.6). We now clarify the deep arithmetical significance of these numbers. Let 'l3lp. Let op and .o~ denote the rings of the valuations vp and v~, and p and 7t the prime elements in these rings. Since for elements a and b of op the congruences a == b (mod p) in the ring op) and a == b (mod 7t) in the ring .0", are equivalent, each residue class in op modulo p is contained in a single residue class modulo 7t in .o~. This determines an isomorphic embedding of
Sec. 5]
THEORIES OF DIVISORS FOR FINITE EXTENSIONS
199
the residue class field ~p = op/(p) of the valuation vI' into the residue class field ~~ = D~/(7t) of the valuation v~. Using this isomorphism we assume that ~p c ~~. For any ~ E D~ we denote the residue class modulo 7t which contains ~ by ~. The subfield ~p of the field ~'ll then consists of those residue classes of the form a, where a E 01' . Let the residue classes WI' ... , wm of ~~ (Wi E D~) be linearly independent over the field ~p' We now show that then the representatives WI' ... , W m of these classes are linearly independent over the field k. Assume that this is not so, that is, that for some coefficients a i E k, not all zero, we have
Multiplying this relation by a suitable power of p, we may assume that all ai lie in the ring 01' and that at least one of them is not divisible by p. Passing now to the residue class field ~~, we obtain
aiw i + ... + amwm= 0, in which not all coefficients ai E ~p are zero. This contradiction proves our assertion. From the linear independence of WI' ... , W n over the field k it follows that m ~ n = (K: k). Thus the residue class field ~~ is a finite extension of the field ~p for which
Definition. Let the prime divisor I.l3 of the ring D divide the prime divisor p of the ring o. The degree f = f~ = (~~: ~p) of the residue class field of the valuation v~ over the residue class field of the valuation vI' is called the degree of inertia of the prime divisor I.l3 relative to p (or relative to k).
As in Section 5.2 we denote by D p the integral closure of the ring 01' in the field K. In analogy with the definition of a fundamental basis of an algebraic number field, we make the following definition. Definition. A basis WI' ... , W n of the extension K/k is called a fundamental basis for the ring D p relative to 01' if all its elements lie in D p and every element IX E D p is represented by a linear combination
(5.13)
II
I I,
\
with coefficients
0i
in
01' .
We shall see below that in the case of a separable extension K/k, a fundamental basis for the ring D p (for any p) always exists. On the other hand, by Problems 11 and 12, for nonseparable extensions K/k it may occur that the ring D p has no fundamental basis relative to 01' .
I
r
/
200
[Chap. 3
THEORY OF DIVISIBILITY
The value of the concept of a fundamental basis is indicated in the following theorem. Theorem 5. Let 'l.' be a prime divisor of the ring D which divides p, and let n be a prime element of the ring D p which corresponds to it. If the ring D p has a fundamental basis relative to 01" then f~
=
d~
= vp(N K/k(n).
Proof. The prime element n
E D p is clearly also a prime element of the ring We shall show that each residue class ~ of the ring D~ modulo n contains a representative in D p ; that is, for any ~ E D~ there is an element IX E D p such that ~ == IX (mod n).
D~.
Let 'l.' = 'l.'1' 'l.'z, ... , 'l.'m be all prime divisors of the ring D which divide p. By Theorem 6 of Section 4, Y E D p if and only if v~,(Y) ~ 0 for all i ='1, ... , m. Hence the element IX must satisfy V~(~ V~.cIX) ~
-
0
IX)
~
1,
(i = 2, ... , m),
and the proof of its existence is given by Theorem 4 of Section 4. Now let W 1 , ... , W n be a fundamental basis of the ring D p relative to 01" By the above, every element of ~~ can be represented in the form a1w1 + ... + anwn, where ai E 01" and hence ai E ~p. This means that the residue classes W1' ... , wn generate ~~ as a vector space over ~p • Iff = (~~: ~p) = f~ , then we can choose from among themf elements which are linearly independent over ~p' Let these be W1' ... , wf . It is clear that the congruence a1w1
+ ... + afwf == 0 (mod
n),
with ai EOI" holds in the ring D p if and only if ai == 0 (mod p), p being a prime element of the ring 01' . Since each of the residue classes Wj E ~~ forj = f + 1, ... , n can be expressed in terms of W1' ... , wf' then f
w j ==
for some b js from
01"
2: bjsws(mod n) s=j
(j
= f + 1, ... , n)
Set for i = 1, ...
ej =
f
-
2: bjsws + Wj
s= 1
for j = f
,f,
+ 1, ... , n.
Sec. 5]
THEORIES OF DIVISORS FOR FINITE EXTENSIONS
201
It is clear that el , ••• , en also form a fundamental basis of .0" relative to 0" (since all W s can be expressed in terms of es with coefficients in 0,,). Each element ef + l , ... , en is divisible in the ring .0" by n, and therefore the congruence
holds if and only if al == ... == af
.=
0 (mod p).
Consider the set 9R of all elements of the ring .0" which are divisible by n. By what was just proved, the set 9R consists of all linear combinations of the elements (5.14) with coefficients in 0". On the other hand, it is clear that 9R also coincides with the set of all linear combinations of the elements (5.15) with coefficients in 0". Let C denote the transition matrix from the basis (5.14) to the basis (5.15). Since every element ne j can be expressed in terms of the basis (5.14) with coefficients from 0", then det C is an element of 0". By symmetry this also holds for det C- l . Hence det C is a unit in the ring 0,,; that is, v" (det C) = o. If we multiply the first f columns of the matrix C by p, then we clearly obtain a matrix A = (a i ) , for which
ne
n
i
=
L aije
j •
j=l
Therefore, N K/k(n)
= det A = pf det C,
so that
and Theorem 5 is proved. Theorem 6. If the extension Kjk is separable, then.o" always has a fundamental basis relative to 0" .
Before starting the proof of this theorem we note that it is analogous to the proof of Theorem 6 of Section 2, Chapter 2. Since every element of K, after multiplication by a suitable power of a prime element of the ring 0", becomes integral with respect to 0", the extension Kjk
I
~
: !
!
THEORY OF DIVISIBILITY
202
[Chap. 3
has a basis, Ct l , .,. , Ct n , all elements of which lie in .0" . Consider the dual basis *, ... , Ctn * (see the Supplement, Section 2.3; here we have already assumed that Klk is separable). If Ct E .0" and
Ct l
,j,
~I I II i
(5.16)
!' '
:!
Ii
I
,I
!
Ii
where Ci E k, then Ci = Sp (CtCtJ, and this means that C i EO" (since CtCt i ED,,). For each s = 1, ... , n we consider in the ring .0" those elements which when expressed in terms of the basis Ct l *, ... , Ctn * have the form
1
,i' I' 'I
I
I I
csCt.*
+ ... + CnCt n *
(5.17)
and we choose among these an element Ws
= CssCt.* + ... + CsnCt n*
such that v,,(cs ) ~ vicss) for all coefficients Cs of elements of the form (5.17) of .0". It is clear that c"~ #- 0 for all s, so that the elements WI' ... , W n of .0" are linearly independent over k. Let Ct be any element of .0". If we represent it in the form (5.16), then CI = clla l , where al EO", by choice of WI' For the difference Ct - a l WI we have the expansion
and here C2 ' = c 22 a 2 , where a 2 E 0" by choice of W2 • Continuing this process n times we finally arrive at the expansion (5.13), in which all coefficients a i belong to 0". The basis W j is hence a fundamental basis relative to 0", and Theorem 6 is proved. From Theorems 5 and 6 and formula (5.12) we easily obtain the following assertion. Theorem 7. If the extension Klk is separable and p is a fixed prime divisor of the ring 0, then the ramification indices e~ and degrees of inertiaf~ of the prime divisors 'l.' of the ring .0 which divide p are connected by the relation
L e~f~ = n = (K: k).
~I"
Hence for separable extensions Klk formula (5.7) can be written in the form (5.18) Remark. For nonseparable extensions, Theorem 7 is no longer necessarily valid. However, the inequality L~I" e~f~ ~ n always holds (see Problem 13). It can further be shown that, in general, f~ ~ d~ .
Sec. 5]
THEORIES OF DIVISORS FOR FINITE EXTENSIONS
203
5.4. Finiteness of the Number of Ramified Prime Divisors Definition. The prime divisor p of the ring 0 is called ramified in the ring .0 if it is divisible by the square of some prime divisor of the ring .0, and is called unramified otherwise.
Hence p is unramified if and only if all ei in (5.2) are equal to 1. Under the assumption that the extension K/k is separable, we shall obtain an important condition for p to be unramified. Assume that the ring.o p contains some primitive element efor the extension K/k, such that the discriminant D(f) of its minimum polynomial f(t) is a unit in 01" We now show that in this case the powers 1, e, ... , en-I, where n = (K: k) form a fundamental basis for the ring .01' over 01" Let w 1 , ... , W n be any fundamantal basis for .01" and let C be the matrix of transition from the basis Wi to the basis ej • Then D(f) = D(1, e, ... , en- 1) = (det C)2 D(W1' ... , wn) [see the Supplement, formula (2.12)]. Since D(f) is a unit in 01" and both terms on the right belong to 01" then det C is a unit in 01" and hence 1, e, ... , en - 1 is also a fundamental basis. Let p be a prime element of the ring 01' and ~p the residue class field of the valuation vI" For any polynomial g(t) with coefficients in 01' we denote by g(t) the polynomial obtained by replacing all coefficients of g(t) by their residue classes modulo p. Since the discriminant D(J) E ~p of the polynomial J(t) E ~p[t] is equal to the residue class modulo p of the discriminant D(f) E 01" then by our assumption the discriminant D(f) is nonzero. Hence, all factors in the decomposition (5.19) into irreducible factors in the ring ~p[t] are distinct (here ij5i is some polynomial of op[t]). If we denote the degree of ij5i by d;, then we clearly have d1
+ ... + dm = n = (K:k).
(5.20)
Theorem 8. If the discriminant of the minimum polynomial f(t) of a primitive element e E .01' is a unit in 01' , then the prime divisor p is unramified in .0 and the prime divisors 'l.'i of the decomposition p
= 'l.'1 .. , 'l.'m
can be put in one-to-one correspondence with the irreducible polynomials ij5; E ~p[t] of the decomposition (5.19) in such a way that the degree of inertia fi of the prime divisor 'l.'i coincides with the degree d i of the corresponding polynomial ij5;(t). i 1,
204
THEORY OF DIVISIBILITY
[Chap. 3
Proof. Let g(t) be any polynomial of op[t]. We shall show that if the polynomials 9 and qJi are relatively prime in the ring ~p[t], then the elements g(O) and CPi(O) are relatively prime in the ring D p • For then there exist polynomials u(t), v(t), and /(t) in the ring op[t] such that g(t)u(t)
+ CPi(t)V(t) = I + p/(t).
If g(O) and cp;(8) were divisible in the ring D p by some prime element n, then since nip (Theorem 7 of Section 4), from the preceding equation (for t = 0), it would follow that nil. This contradiction proves our assertion. Since the irreducible polynomials qJi are distinct, then the elements CP1 (0), ... , CPm(O) are pairwise relatively prime. Assume that cp;(O) is a unit in D p , that is, that cp;(O)~ = I, ~ E D p • Since 1,0, ... , on-1 form a fundamental basis for D p over op, then ~ = h(O), where h(t) E op[t]. The equation cp;(O)h(O) = I implies that cpi(t)h(t) = I + l(t)q(t), where q(t) E op[t] [since the leading coefficient ofl(t) equals I]. Passing to the residue class field ~p we obtain qJih = I + qJ1 ... qJmq, and we again have a contradiction. Hence none of the elements CP1(O), ... , CPm(O) are units in D p • For each i choose in D p a prime element n;lcp;(O). Since we have proved that the cp;(8) are pairwise relatively prime, the prime elements nl' ... , n m are pairwise-nonassociate. Let 'l.'1'"'' 'l.'m denote the corresponding prime divisors of the ring .0, and 11' ... ,1m denote the degrees of inertia of these divisors. In the residue class field ~~i of the valuation V~i' the residue classes I, {}, ... , {}d,-l are linearly independent over ~p (d i is the degree of qJ J For if there is a polynomial g(t) E op[t] of degree < d i for which g({}) = 0, then the element g(O) is divisible by ni in the ring D p and hence g(O) and CPi(O) are not relatively prime. But we saw at the beginning of the proof that then g(t) must be divisible by qJ;(t), and this can happen only if all coefficients of g(t) are zero. We have shown that (i
= I, ... , m).
Comparing this inequality with (5.20) and considering Theorem 7, we see that 'l.'1' ... , 'l.'m are the only prime divisors which divide p, that their ramification indices ei all equal I, and that d i =/;. This proves Theorem 8. Finally, we note that since cp;(e) is divisible by n i but not divisible by any other prime element n j ' then n i can be determined as the greatest common divisor in the ring D p of the elements cp;(O) and p. Corollary. If the extension Kjk is separable, then there are only finitely many prime divisors p of the ring 0 which are ramified in D. Let 0 be any primitive element of the extension Kjk which is contained in D. The discriminant D = D(l, 8, ... , en -1) is an element of 0*. If p,r D,
I'IIi.
Sec. 5]
THEORIES OF DIVISORS FOR FINITE EXTENSIONS
205
then by the theorem p is not ramified in D. Thus only those prime divisors of the ring 0 which divide D can be ramified in D.
PROBLEMS
1. Let 0 be a ring with a theory of divisors. k its quotient field. and k eKe K' a tower of finite extensions. Let C and C' denote the integral closure of the ring 0 in the fields K and K'. If 1.l3' is any prime divisor of the ring C'. denote the prime divisor of the ring C which is divisible by 1.l3' by W. and the prime divisor of the ring 0 whiCh is divisible by 1.l3' by \.1. Show that the degree of inertia of 1.l3' relative to k equals the product of the degree of inertia of 1.l3' relative to K and the degree of inertia of I.l3 relative to k. Formulate and prove an analogous assertion for the index of ramification. 2. Let the ring 0 with quotient field k have a theory of divisors with only a finite number of prime divisors, and let the prime divisor \.1 correspond to the prime element p of the ring o. Show that the residue class ring o/(p) is isomorphic to the residue class field ~" of the valuation "". 3. Let "" be a valuation of the field k. 0" its ring. K/k a finite separable extension.C" the integral closure of the ring 0 in the field K. and WI • .•.• W n a basis for the field Kover k. all elements of which lie in the ring D". Show that if the discriminant D(wl> .... w n ) is a unit in the ring 0". then W, • ... , W n is a fundamental basis for the ring C" over 0". 4. Show that the homomorphism N : §J ->- §Jo. satisfying the conditions of Theorem 4. is unique. 5. Show that the isomorphic embedding §J 0 -+ §J. satisfying the conditions ofTheorem 3, is unique. 6. Let a be a divisor of the ring o. Considering it as a divisor of the ring C (using the embedding §J ->- §Jo) show that (n = (K:k».
7. Let K/ k be a separable extension of degree n. Show that if the divisor a of the ring 0 becomes a principal divisor of the ring C. then an is a principal divisor of o. 8. Let K/k be separable. Show that the norm NK/.(a) of a divisor a of the ring C is the greatest common divisor of the principal divisors (NK/.(ex» •• where ex runs through all elements of C divisible by a. 9. The polynomial f(t) = t n + alt n -I + ... + an with coefficients in the ring 0 is called an Eisenstein polynomial relative to the prime divisor \.1. if al> .... an are all divisible by i • ' and an. while divisible by \.1. is not divisible by \!'. If the ring C contains a primitive element 8 for the extension K/k of degree n, with the minimum polynomial of 8 an Eisenstein polynomial relative to \.1. show that \! is divisible by only one prime divisor 'll of the ring C and that (the degree of inertia of I.l3 relative to \.1 hence equals 1). 10. Under the same hypotheses show that the basis 1.8..... 8n -I is a fundamental basis of the ring £" relative to 0". 11. Let k o be any field of characteristic p and k = ko(x. y) a field of rational functions in x and y over the field k o . Consider the valuation "0 of k, which was defined in Problem 9
I
I
THEORY OF DIVISIBILITY
206
[Chap. 3
of Section 4, where for the series g(l) E k o {I} [transcendental over ko(t)] we take a series of the form
By Problem 8 of Section 4 there is a unique extension of the valuation I' to the purely inseparable extension K = k({/y) of degree p over k. Show that the ramification index of I' relative to 1'0 equals I, and that the residue class field of the valuation I' coincides with the residue class field of the valuation 1'0 (under the inclusion isomorphism). It now follows from Theorem 5 and (5.12) that the ring C of the valuation 1', which is the integral closure in K of the ring of the valuation 1'0, does not have a fundamental basis relative to o. 12. Under the same assumptions as in the preceding problem, give a direct proof (without involving Theorem 5) that C has no fundamental basis over o. 13. Let be a ring with a theory of divisors, k its quotient field, K/ k a finite extension of degree n, C the integral closure of in K, \1 a prime divisor of the ring 0, '13" ... , '13m the prime divisors of the ring C which divide \1, e" ... ,em their ramification indices, and flo ... ,f.. their degrees of inertia relative to \1. For any s = I, ... ,m we denote by a'll' the residue class in the field L'll' which contains ct E C'll,' Choose elements W" E Cp (I ~ i ~ f,) so that the classes <.O,/'ll, form a basis for L'll,/L p and also so that I''ll}w,/) ;:" eJ for j # s, 1 ~ j ~ m. Prime elements of the ring Cp , corresponding to the divisors '13" ... , '13.. are denoted by 7T" ... , 7T... Show that the system of elements
°
°
°
(s
=
I, ... , m; i
=
I, ... ,f,; j
=
0, I, ... , e, - 1)
(*)
is linearly independent over k. Hinl: Consider linear combinations
with coefficients from op, at least one of which is a unit in op. Let I'p(c'%Jo) = 0, where jo is chosen so that I'p(c'o/J) > 0 for all j ~ jo and all i. Then I''l',o(ct) = jo.
14. Show that if the extension K/ k is separable, then the system (*) forms a fundamental basis for Cp over op. 15. Show that if the extension K/ k is separable, then for any ct E Cp we have
..
SPK/.(ct)P = ~ e,SPEm IE (a'll,). s= 1
.... s
P
16. Let f(l) be the characteristic polynomial of the element ct E Dp relative to K/k. Taking the corresponding residue classes in L p , we obtain a polynomial 1(t) E L p [I]. For s = I, ... , m let cp,(I) denote the characteristic polynomial of the element a'll, E L'll' relative to the extension L'll,/L p • Generalizing the preceding problem (for separable K/k), show that i(t)
=
cpl(l)el ... cp..(I)e...
17. Let K/k be separable. For each \1 choose in the ring over op. Set
°a fundamental basis ctlo ... ,ct.
Sec. 6]
DEDEKIND RINGS
207
Show that the integer d" ;;" 0 is almost always zero. The integral divisor bK1k = II\.ld"
" of the ring 0 is cal1ed the discriminant of the extension K/k (relative to the ring 0). 18. Show that the prime divisor \.l of the ring 0 does not occur in the discriminant bK1k (that is, d" = 0) if and only if \.l is unramified in 0 and the extensions ~\jJs/~" (s = 1, ... , m) are all separable. 19. Let the ring 0 have a fundamental basis w" ... , w. over o. Show that the discriminant bK1k coincides with the principal divisor (D(w" ... , w.».
6. Dedekind Rings 6.1. Congruences Modulo Divisors
We consider a ring .0 with quotient field K for which there exists a theory of divisors .0* -.~. Definition. We say that the elements modulo the divisor a E ~, and write IX
if the difference
IX -
IX
and f3 of the ring .0 are congruent
== f3 (mod a),
f3 is divisible by a.
In the case of a principal divisor (/1) the congruence IX == f3 (mod (/1)) is clearly equivalent to the congruence IX == f3 (mod /1) in the sense of the definition of Section 4.1 of the Supplement. We indicate some elementary properties of congruences which easily follow from the definition. (1) Congruences modulo a can be added and multiplied termwise. (2) If a congruence holds modulo a, then it also holds modulo b for any divisor b dividing a. (3) If a congruence holds modulo a and modulo b, then it also holds modulo their least common multiple. (4) If an element IX E .0 is relatively prime to a [that is, if the divisors (IX) and a are relatively prime], then from the congruence IXf3 == 0 (mod a) it follows that f3 == 0 (mod a). (5) If IX divides both sides of a congruence modulo a, and IX is relatively prime to a, then we may cancel IX from the congruence. (6) If p is a prime divisor and IXf3 == 0 (mod p) then either IX == 0 (mod p) or f3 == 0 (mod p).
I
! I,
I, 1 ,
,
208
II I
THEORY OF DIVISIBILITY
[Chap. 3
It follows from property (l) that the residue classes of the ring .0 modulo a given divisor 0 can be added and multiplied. It is easily verified that under these operations the set of residue classes becomes a ring. It is called the ring of residue classes modulo the divisor 0 and is denoted by .0/0. Property (6) can then be interpreted as saying that for a prime divisor l' the ring .011' has no divisors of zero. Assume now that .0 is the maximal order of an algebraic number field K. The divisors of the ring .0 we call in this case the divisors of the field K. Since every divisor 0 of the field K divides some nonzero number a E .0, and the number a in its turn divides some natural number a [for example, IN(a) I is divisible by a], then for each divisor 0 there is a natural number a which is divisible by o. By property (2) numbers in distinct residue classes modulo 0 remain in distinct classes modulo a. Recalling now that in the order .0 the number of residue classes modulo a is finite (actually equal to an, where n is the degree of the field K; see the proof of Theorem 5 of Section 2, Chapter 2), we obtain the following theorem.
Theorem 1. For any divisor class ring .0/0 is finite.
0
of the algebraic number field K, the residue
Let l' be any prime divisor of the field K. The corresponding valuation vp induces on R the p-adic valuation vp for some prime p. Since vp(p) = 1, then vp(p) > 0; that is, p == 0 (mod 1'). If the prime number q is different from p, then vp(q) = 0, and therefore vp(q) = 0; that is, q ;ji 0 (mod 1'). The residue class ring .011', being finite and without divisors of zero, is a finite field (Supplement, Section 3). Since for any a E .0 we have pa == 0 (mod 1'), then the characteristic of this field is p. Hence we have Theorem 2. Any prime divisor l' of an algebraic number field divides one and only one rational prime p. The residue class ring .011' is a finite field of characteristic p.
A theory of divisors for an algebraic number field hence has the property that the residue class ring modulo a prime divisor is a field. In general, this is not the case. For example, in the ring of polynomials k[x, y] in two variables over a field k the residue class ring of the prime divisor (x) is isomorphic to the ring of polynomials k[y] and hence is not a field. The residue class ring .011' is a field if and only if the congruence a~ == 1 (mod 1') is always solvable when a ¥= 0 (mod 1'). Hence only under this assumption can we expect to construct a completely adequate theory of congruences in the ring D.
Sec. 6]
DEDEKIND RINGS
209
6.2. Congruences in Dedekind Rings Definition. A ring .0 is called a Dedekind ring if it has a theory of divisors .0* --+ ~ and for every prime divisor l' E ~ the residue class ring .0/1' is a field. Examples of Dedekind rings, other than the maximal orders of algebraic number fields, can be obtained by taking the integral closure of the polynomial ring k[x] in a single variable in a finite extension of the field of rational functionsf(x) (Problems I and 2). The valuation ring D. of any valuation v is also a Dedekind ring (see Section 4.1), as is any ring which has a theory of divisors with only a finite number of prime divisors (Problem 3). Lemma 1 . If .0 is a Dedekind ring and Ct E .0 is not divisible by the prime divisor 1', then the congruence Ct~ == 1 (mod pm) is solvable in .0 for any natural number m. Proof. For m = 1 the congruence is solvable by the definition of a Dedekind ring. The lemma will be proved by induction on m. Suppose that for some ~o ED we have Ct~o == 1 (mod pm). Choose an element w in the ring .0 for which vp(w) = m. The principal divisor (w) has the form (w) = pm a, where a is not divisible by p. Choose an element Y E .0 for which vp(Y) = 0 and y == 0 (mod a). The product y(Ct~o - 1) will be divisible by pm a = (w), and hence y(Ct~o - 1) = Wll with Il E D. We now try to solve the congruence Ct~ == 1 (mod pm+l), taking as ~ an element in the form ~ = ~o + WA, where A is to be chosen suitably in D. Since y(Ct~ - 1)
=
y(Ct~o - 1)
+ YCtwA = w(1l + YIXA)
and W == 0 (mod pm), then we shall achieve our goal if A satisfies the congruence AIXY == - Il (mod 1'). But since Cty is not divisible by 1', this congruence is solvable. Hence there is an element ~ ED for which y(Ct~ -1) == 0 (mod pm + 1) and since vp(Y) = 0, dividing by y, we obtain Ct~ - 1 == 0 (mod pm+l). Lemma 1 is proved. l
Theorem 3. If PI' '" , Pm are distinct prime divisors of the Dedekind ring .0, and PI' ... ,Pm are any elements of .0, then there is an element ~ inD which satisfies
II I.
! ~ == (k 1 ,
... ,
Pm (mod Pm
I
km )
k m are any natural numbers).
Proof. For each divisor
(i
= 1, ... , m)
I
Ii
I Ii , ,
II
210
THEORY OF DIVISIBILITY
[Chap. 3
we can find an element Ct. i E .0 which is divisible by 0 1 but not divisible by PI' Lemma 1 guarantees that we can solve the congruence Ct.i~1 == f3i (mod p/ i ) in ~ i E D. It is easily seen that the element
satisfies the requirements of the theorem. Theorem 4. If Ct. # 0 and 13 are elements of a Dedekind ring .0, then the congruence Ct.~
== 13 (mod 0)
(6.1)
is solvable if and only if 13 is divisible by the greatest common divisor of the divisors (Ct.) and o. Proof. We first assume that the divisors (Ct.) and
0 are relatively prime, and will show that in this case the congruence (6.1) is solvable for any 13. Let I km o = p/ '" Pm = P/'Oi' where PI' '" ,Pm are distinct prime divisors. By Lemma 1 for each i = 1, ... , m in the ring .0 there is an element ~;' such that Ct.~;' == 13 (mod p/ ' ). By Theorem 3 we can find for each i an element ~I for which ~i == ~/ (mod p/') and ~i == 0 (mod oJ It is now clear that the sum ~I + ... + ~m = ~ will satisfy the congruences Cf.~ == 13 (mod 1'/') for i = 1, ... , m, and hence will also satisfy (6.1). We now prove the theorem in general. Let b = Pill ... Pm 1m be the greatest common divisor of the divisors (Ct.) and o. If (6.1) holds modulo 0, then it also holds modulo b, and since Ct. == 0 (mod b), then we must also have 13 = 0 (mod b). This proves the necessity of this condition. Assume now that 13 is divisible by b. By Theorem 3 of Section 4, there is an element J1. E K for which
(i=l, ... ,m).
(6.2)
We shall show that we can choose J1. so that also (6.3)
!
I
~I II
I:
I, I
Ii
for all prime divisors q, distinct from Pb ... , Pm' Suppose that J1. does not satisfy condition (6.3), and let ql' ... , qs be the prime divisors, different from 1'1> ... , Pm' for which Vqj(J1.) = -r j < O. Choose in .0 an element Y such that Vqj(Y) = r j (l ~j ~ s) and vP1(Y) = 0 (l ~ i ~ m). It is clear that the element J1.' = J1.Y satisfies both conditions (6.2) and (6.3), and our assertion is proved. Let the divisor b be determined by 0 = db. If J1. satisfies conditions (6.2) and (6.3), then the element Cf.J1. belongs to .0 and is relatively prime to b. Since we have assumed 13 divisible by d, then f3J1. also belongs to D. We have already
Sec. 6]
DEDEKIND RINGS
211
proved that then there exists an element ~ in the ring D such that CtI1~ == {311 (mod b). For i = 1, ... , m we have vp,(Ct~ - {3)
and this means that
~
=
Vpi(CtI1~
-
{311)
+ Ii
~ k i - Ii
+ Ii =
k i,
satisfies (6.1).
6.3. Divisors and Ideals In this section we show that there is a one-to-one correspondence between divisors and nonzero ideals in a Dedekind ring. For each divisor 0 denote by a the set of all elements of the ring D which are divisible by o. It is clear that a is a nonzero ideal of the ring D. Theorem 5. In a Dedekind ring D the mapping 0 -> ii (0 E .@) is an isomorphism of the semigroup of divisors .@ onto the semigroup of all nonzero ideals of the ring D.
We first verify the following lemma. Lemma 2. If Ct l , ••• , ct. s are any nonzero elements of the Dedekind ring D and b is the greatest common divisor of the principal divisors (ct. l ), '" , (ct. s ), then any element Ct E D which is divisible by b can be written in the form
The proof )fthe lemma is by induction on s. For s = 1 the lemma is obvious. Let s ~ L. Let bl denote the greatest common divisor of the divisors (Ct l ), ... , (Cts - l )· Then b is the greatest common divisor of the divisors b l and (ct..). Let Ct be divisible by b. By Theorem 4 the congruence Cts~ == Ct (mod b ) has a l solution ~ E D. By the induction hypothesis there are elements ~1> ••• , ~s-l in the ring D such that Ct - ~Cts = ~ICtI + '" + ~s-ICts-I' Lemma 2 is proved. Proof of Theorem 5. By condition (3) of the definition of a theory of
divisors the mapping 0 ...... a takes distinct divisors to distinct ideals. Let A be any nonzero ideal of the ring D. For each prime divisor l' set
a(1') = min vp(Ct). aEA
It is clear that a(1') will be nonzero for only a finite number of prime divisors 1'. Hence the prodJlct 0 = 1'0(1'), in which l' runs through all prime divisors for which a(1') # 0, is a divisor. We shall show that 0 = A. Let ct. be any element of o. It is clear that we can find a finite set of elements ct. l , ... , Ct s in A such that a(1') = min (vp(ct. l ), ... , vp(Cts)) for all 1'. This means that the divisor 0 is the greate:,' common divisor of the principal divisors (Ctd, ... , (ct. s)' By Lemma 2
TIp
I !
II
I 1 i
I
:
I
I
THEORY OF DIVISIBILITY
212
[Chap. 3
the element rx can be represented in the form rx = ~1 rxl + ... + ~srxs with coefficients ~i ED. It follows that rx E A, and hence that G c A. Since it is clear that A C G, we obtain A = G. We have thus proved that a -+ a is a one-to-one mapping of the set of all divisors of the ring .0 onto the set of all nonzero ideals of D. We shall now show that this mapping is an isomorphism, that is, that for any two divisors a and b we have
ab = abo
(6.4)
Denote the product ab by C. Since C is a nonzero ideal of .0, there is a divisor c such that c = C. We must prove that c = abo Let the prime divisor I' enter into the divisors a and b with exponents a and b. Then min vvCy) = )lEe
min vp(rx{3) = min vp(rx) rIEo,peb
aEa
+ min vp({3)
=
a
+ b.
pel,
Since this is true for all prime divisors 1', then c = ab and (6.4) is proved. From the fact that the mapping a -+ a is an isomorphism, it follows, in particular, that the set of all nonzero ideals of the Dedekind ring .0 form a semigroup with unique factorization under the operation of multiplication. To construct a theory of divisors in Dedekind rings (in particular, in the maximal order of an algebraic number field), we could take the semigroup of nonzero ideals for the semigroup ~. The image of the element rx under the homomorphism .0* -+ ~ would then be the principal ideal (rx) generated by this element. This construction of a theory of divisors is due to Dedekind. I
.~
6.4. Fractional Divisors If we construct a theory of divisors .0* -+ ~ for the ring .0, then we obtain some information on the structure of the semigroup .0*. It is natural to try an analogous procedure with the multiplicative group K* of the quotient field K. To do this we need to extend the concept of a divisor. Following an established tradition, we shall reserve the term" divisor" for this broader concept, and will call divisors in the earlier sense "integral divisors. " Definition. Let .0 be a ring with a theory of divisors, with quotient field K, and let 1'1' ... , Pm be a finite system of prime divisors. An expression
(6.5)
with integer exponents k 1 , ••• , k m (not necessarily positive) is called a divisor of the field K. If all the exponents k i are nonnegative, then the divisor is called integral (or a divisor of the ring D). Otherwise it is called fractional.
Sec. 6]
DEDEKIND RINGS
213
It is sometimes convenient to write a divisor (6.5) as a formal infinite product (6.6)
over all prime divisors p, in which almost all exponents a(p) are zero. Multiplication of divisors is determined by the formula (Q pQ(P»)(Q pb(P»)
=Q
pQ(p)+b(p).
For integral divisors this definition coincides with the definition of multiplication in the semigroup :?2. It is easily seen that under this operation the set of all divisors of the field K is an Abelian group, which we shall denote by fh. The unit element of this group is the divisor e for which all exponents a(p) in (6.6) are zero. Since every nonzero element ~ E K is the quotient of two elements of .0, it follows from condition (I) of Theorem 4 of Section 3 that for all but a finite number of the valuations vp ' which correspond to the prime divisors p, we have vi~) = O. We denote this finite set by vP1 ' ... , vp~' The divisor
is called the principal divisor corresponding to the element ~ E K*, and is denoted by (~). When applied to elements of the ring .0, the new concept of a principal divisor coincides with the previous one (Section 3.4). By condition (2) of Theorem 4 of Section 3 the principal divisor (~) will be integral if and only if ~ belongs to D. From the definition of a valuation (Section 3.4) it easily follows that the mapping ~ --+ (~), ~ E K*, is a homomorphism K* --+ fh of the multiplicative group of the field K to the group of divisors ;g. By Theorem 2 of Section 3 this homomorphism maps onto the entire group jJ) (that is, is an epimorphism) if and only if .0 has unique factorization. The kernel of this map clearly is the group of units of the ring .0, and this means that for elements ~, 11 of K* we have (~) = (11) if and only if ~ = 118, where 8 is a unit of the ring D. We now define a concept of divisibility for arbitrary divisors. Let 0 = nppQ(p) and b = n P pb(p) be two divisors (not necessarily integral). We say that 0 is divisible by b if there is an integral divisor e such that 0 = be. In other words, o is divisible by b if and only if a(p) ~ h(p) for all p. For any 0 and b set d(p) = min (a(p), b(p)). Since the rational integer d(p) is equal to zero for almost all p, then the product b = np pd(p) is a divisor. The divisor b is called the greatest common dh'isor of the divisors 0 and b (0 and b are both divisible by band b is divisible by every common divisor of
, I
I
I
214
THEORY OF DIVISIBILITY
[Chap. 3
o and b). The least common multiple of the divisors 0 and b is defined analogously. The element CI. E K is cal1ed divisible by the divisor 0 = TIp 1'a(p), if CI. = 0 or the principal divisor (CI.) is divisible by o. In terms of valuations this is characterized by vp(a) ~ o(p) for al1 p. The correspondence of the preceding section between integral divisors and ideals of a Dedekind ring can be extended to fractional divisors, providing the proper generalization of the concept of ideal is used. As in Section 6.3, we denote the set of al1 elements of the field K which are divisible by the divisor 0 by a (these elements may now be nonintegral). From condition (3) in the definition of a valuation (Section 3.4) it fol1ows that if CI. and P are divisible by 0, then CI. ± P are also divisible by o. This means that the set a is a group under addition. Further, if CI. E a and ~ E .0, then the product ~CI. also belongs to o. We now verify the fol1owing formula: (y)o = yo
For the element
(6.7)
(YEK*, oED).
~
is divisible by (y)o if and only if any of the fol1owing hold: for all 1'; vp(~/Y) ;?; a(p) for all 1'; ~/y E 0; ~ E yo [here a(1') denotes the power to which p appears in the divisor oj. It is clear that for any divisor we can find an element y E .0* such that the divisor (y)o is integral. FOP'1Ula (6.7) shows that for such a y we will have yo c D.
vpW ;?; vp(y) + a(1')
I I
I 1
!
I I r
: 1:: !I I
~I
I I
I
iI I
II
il
Definition. Let D be a Dedekind ring with quotient field K. A subset A c K, containing at least one nonzero element, is called an ideal of the field K (relative 1.0 D), if it satisfies: (l) A is a group under the operation of addition. (2) For any CI. E A and any ~ E .0, the product ~CI. lies in A. (3) There is a nonzero element y of the field K such that yA s D. The ideal A is called integral if it is contained in .0 and otherwise is called fractional. An integral ide'al in K is clearly just a nonzero ideal in D. If A and B are two ideals of the field K, then by their product AB we mean the set of al1 elements y E K which can be represented in the form y
=
Cl.1Pl
+ ... + Cl.mPm
(m;?; 1)
Cl. i E
A,
Pi E
B
(l ~ i ~ m).
It is clear that the product of two ideals of a field K is again an ideal of the field K. (When the ideals are integral, the definition of product coincides with the usual notion of the product of two ideals in a commutative ring.) We have already verified that for any divisor 0 of K, the set a is an ideal in K. Assume that for two divisors 0 and b we have a = b. Choose a nonzero element y so that the divisors (y)o and (y)b are both integral. From formula (6.7) we
Sec. 5]
DEDEKIND RINGS
215
have (y)a = (y)b, so that (y)a = (y)b and hence a = b. Hence the mapping a ..... a is a monomorphism. Now let A be any ideal of the field K. If the element l' i' 0 is chosen so that yA c .0, then yA will be a nonzero ideal of the ring .0, and hence by Theorem 5 there exists an integral divisor c such that c = yA. Set a = C(y)-I. Then yA = (y)a = ya, so that A = a. Thus each ideal of the field K is the image of some divisor under the mapping a ..... a. If a and bare two divisors, then, taking elements l' i' 0 and 1" i' 0 so that (y)a and (y')b are integral divisors, we have [from Theorem 5 and formula (6.7)]
dab = (y)a'(y')b = (y)a'(y')b = ya'y'b = yy'ab, so that ab = abo The mapping a ..... a hence is an isomorphism. It follows that the set of all ideals of the field K is a group under multiplication. The unit element in this group is the ring .0 = e. The inverse of the ideal a will be the ideal a-I. We formulate this generalization of Theorem 5. Theorem 6. Let .0 be a Dedekind ring with quotient field K. For every divisor a, denote by a the set of all elements of K which are divisible by a. The mapping a ..... ais an isomorphism of the group of all divisors of the field K onto the group of all ideals of the field K. This mapping takes integral divisors to integral ideals and conversely.
I,
r J
I PROBLEMS
1J
i
1
1. Show that the ring k [x 1of polynomials in one variable over a field k is Dedekind. 2. Let 0 be a Dedekind ring with quotient field k. Show that the integral closure C of the ring 0 in any finite extension of the field k is also Dedekind. 3. Show that any ring which has a theory of divisors with a finite number of prime divisors is Dedekind. 4. Show that a system of congruences
I
I Ii I ~
=
in a Dedekind ring is solvable if and only if IXI IXj (mod 01), i #- j, where hI) is the greatest common divisor of the divisors 01 and OJ. 5. Let C be a Dedekind ring and 0 a divisor of C. Show that the set of those residue classes in Cj 0 which consist of elements relatively prime to 0 is a group under the operation of multiplication. 6. Let f(x) be a polynomial of degree m with coefficients in the Dedekind ring D, with not all coefficients divisible by a prime divisor~. Show that the congruencef(x) == 0 (mod ~) has at most m solutions (noncongruent modulo ~), in C.
l' I
[Chap. 3
THEORY OF DIVISIBILITY
216
7. Let (; be a Dedekind ring, ~ a prime divisor of D, and f(x) a polynomial with coefficients in D. If for some element et E D we have feet)
show that for every
m;;;'
II
.1 !
I I
I'
f'(et)~
2 there exists an element f(O
I! .
= 0 (mod ~), == 0 (mod
~m),
g in
0 (mod
~),
the ring 0 such that
g:="" et (mod
~).
8. Show that in a Dedekind ring every ideal is either principal or is generated by two elements. 9. Let (; be a Dedekind ring with quotient field K. Show that under the isomorphism a ->- a of the group of divisors of the field K onto the group of ideals of the field K, the greatest common divisor of divisors corresponds to the sum of the corresponding ideals, and the least common multiple of divisors corresponds to the intersection of the corresponding ideals. (By the sum A + B of the ideals A and B we mean the set of all sums IX + {J, where IX E A and (J E B.) 10. The ring D = k[x, y I of polynomials in two variables over the field k has unique factorization and hence has a theory of divisors. Show that the ideal A = (x, y) of the ring l: which is generated by the elements x and y does not correspond to any divisor. 11. Show that if 0 is a ring with a theory of divisors D* ->- ~ in which every nonzero ideal of D is of the form a (where a E ~), then D is Dedekind. 12. Let D be a ring in which the nonzero ideals form a semigroup with unique factorization under multiplication. Show that 0 is Dedekind. 13. Let £: be a Dedekind ring with quotient field K. If A and B are ideals of the field K (relative to C), we say that A is divisible by B if there is an integral ideal C such that A = Be. Show that A is divisible by B if and only if A c B. 14. Let 0 be any ring with a theory of divisors and let ~ be a prime divisor. Show that the set ~ of all elements IX E l: which are divisible by \J is a minimal prime ideal of the ring O. (An ideal P in a ring 0 is called prime if the quotient ring OJ P has no divisors of zero, that is, if the product of any two elements of D, which do not lie in P, does not lie in P. The prime ideal P is called minimal if it does not contain any other prime ideal except the zero ideal.) 15. If C is a ring with a theory of divisors, show that any nonzero prime ideal P of D contains a prime ideal of the form j), where ~ is some prime divisor of the ring D.
l
7. Divisors in Algebraic Number Fields
'j, :
7.1. The Absolute Norm of a Divisor
1I
I
By Theorem 2 of Section 5 the maximal order D of any algebraic number field K is a ring with a theory of divisors. Further, we saw in Section 6.1 that the residue class ring Dip modulo a prime divisor p is a finite field, and hence that the ring D is Dedekind. Consider the algebraic number field K as an extension of the field of rational numbers R Coffinite degree). Since the divisors of the ring Z are in one-to-one correspondence with the natural numbers, we can assume that the group of
Sec. 7]
DIVISORS IN ALGEBRAIC NUMBER FIELDS
217
all divisors (integral and fractional) of the field R coincides with the multiplicative group of positive rational numbers. In Section 5.2 we defined the concept of the norm of a divisor of the ring D relative to a given extension Kfk. If a is a divisor of the order D of the algebraic number field K, then we call the norm N(a) = N K/R(a) the absolute norm of a. We extend the concept of absolute norm to fractional divisors by setting
N(~)
=
lJ
N(m) ,
N(lJ)
where m and lJ are integral divisors. The mapping a ..... N(a) will then be a homomorphism from the group of all divisors of the field K to the multiplicative group of positive rational numbers. The absolute norm of a principal divisor (~), ~ E K*, equals the absolute value of the norm of the number ~: N«~))
=
IN(~)I.
Indeed, if ~ is integral this is just (5.3). If ~ = rxf Pwith integral rx and
N(rx)
N(e)
(7.1)
P, then
IN(rx)1
= N(P» = IN(P)I = IN(~)I.
The degree of inertia f of a prime divisor l' of the field K relative to R is called the absolute degree of inertia of l' (or simply the degree of 1'). The ramification index e of the divisor l' relative to R is called the absolute ramification index of p. If l' divides the rational prime l' and if l' has degree f, then by (5.11), N(p) = p'.
(7.2)
Let PI' ... , Pm be all prime divisors of the field K which divide p, and let el' .,. , em be their ramification indices. Then in the field K we have the decomposition By Theorem 7 of Section 5 the ramification indices ei and degrees fi of the divisors Pi are connected by the relation (7.3) Theorem 1. The absolute norm of an integral divisor a of the algebraic number field K is equal to the number of residue classes in the maximal order D modulo a. Proof. We first prove the theorem for a prime divisor p. Let p be the rational prime which is divisible by p. The degree of inertiaf of the divisor l' (by the
I I Ii ! I
I;
I.
[Chap. 3
THEORY OF DIVISIBILITY
218
definition of Section 5.3) equals the degree of the residue class field ~1' of the valuation v 1' over the residue class field ~1' of the valuation v 1" Since L1' clearly consists of p elements, ~1' is a finite field with pI elements. Hence it suffices to show that the fields ~1' and DIp are isomorphic, that is, that the inclusion isomorphism DIp --+ ~1' maps the field DIp onto the entire field L1' • To do this it suffices to show that for any ~ E K for which v1'(O ~ 0, there exists an element IX E D such that v1'(~ - IX) ~ 1. We denote by ql' qs all those prime divisors of the field K for which Vqi(~) = -k i < O. By Theorem 3 of Section 6 there is an element "I in the order D such that 00"
"I == 1 (mod p), "I == 0 (mod q/i),
I;
O=l,oo.,s).
It is clear that IX = y~ ED and V1'(IX - ~) ~ 1. Hence Theorem 1 is proved in the case of prime divisors. To prove the theorem in general it suffices to prove that if it holds for integral divisors a and b, then it also holds for their product abo By condition (3) of Theorem 4 of Section 3 there is an element "I #- 0 in the maximal order D such that alY and the divisor ("I) a -I is relatively prime to b. Let lXI' ... , IX, [r = Na)] be a complete set of residue-class representatives of D modulo the divisor a, and PI' ... , Ps [s = N(b)] be a complete set for b. We shall show that then the rs numbers (7.4) form a complete system of residue-class representatives modulo abo Let any number of D. For some i (1 ~ i ~ r) IX
==
IXi
(mod a).
IXi
(mod ab).
IX
be
Consider the congruence y~
==
IX -
(7.5)
Since by choice of "I the greatest common divisor of the divisors ("I) and ab equals a, and IX - IXi is divisible by a, then by Theorem 4 of Section 6 this congruence has a solution ~ E D. If ~ == Pi (mod b) for some j (1 ~j ~ s), then y~ == yP i (mod ab). Along with (7.5) this shows that IX
==
IXi
+ yP i
(mod ab).
We have proved that every residue class modulo ab has a representative of the form (7.4). We now must show that the numbers (7.4) are pairwise-noncongruent modulo abo Let IXi
+ yP j == IXk + "1131
(mod ab).
I Sec. 7]
DIVISORS IN ALGEBRAIC NUMBER FIELDS
II
219
Since this congruence also holds modulo a, andy == 0 (mod a), we obtain == Ct. k (mod a), and this means that i = k, and we obtain
Ct.!
y(f3j - 13,)
== 0 (mod ab).
(7.6)
Let the prime divisor p occur in the divisors a and b with exponents a and b > O. Since vl'(y) = a, it follows from (7.6) that vl'(f3 j - 131) ?: b. Since this is true for all prime divisors p which occur in b with positive exponent, then f3 j == 131 (mod b), so thatj = I. Hence the numbers (7.4) for a complete set of residue representatives modulo abo The number of residue classes of the ring .0 modulo ab hence equals rs = N(a)N(b) = N(ab). Theorem I is proved. If a is any divisor of the field K (integral or fractional) as in Section 6.3, we denote by a the ideal of the field K, consisting of all Ct. E K which are divisible by a. Let the number "I be chosen so that ya c D. By the corollary of Theorem 2 of Section 2, Chapter 2, the set ya is a module of the field K (a submodule of the ring D). But then the ideal a also is a module of the field K. If Ct. E n, Ct. #- 0, and W l , ... , W n is a basis for the ring .0, then all the products Ct.W l , ... , Ct.W n lie in a, and hence a contains n = (K: R) linearly independent (over R) numbers of the field K. Hence, for any divisor a, the ideal a is a full module of the field K. Its coefficient ring will clearly be the maximal order D. Conversely, if A is a full module of the field K, whose coefficient ring is the maximal order .0, then A fulfills all the conditions of being an ideal (Section 6.4). Thus the set of all ideals a coincides with the set of all full modules of the field K which belong to the maximal order D. In Section 6.1 of Chapter 2 we introduced the concept of the norm of a full module of an algebraic number field. We can therefore speak of the norm of the ideal a. We shall show that the norm of any divisor coincides with the norm of its ideal: (7.7) N(a) = N(a).
i
I.
1
I I J
For integral divisors this follows from Theorem I of this section and Theorem I of Section 6, Chapter 2. If the divisor a is fractional, then we can find a "I E K* such that the divisor ("I -l)a = b is integral. Then by Theorem 2 of Section 6, Chapter 2, we have N(a) = N(b)IN(y)1 = N(b) IN(y) I = N(yb) = N(yb) = N(a),
and (7.7) is proved for all a. As a simple application of the concept of norm we give a more precise estimate for the number w(a) of nonassociate numbers in the maximal order
I I
1!
1
i
220
THEORY OF DIVISIBILITY
[Chap. 3
I
whose norm has absolute value equal to a (in the proof of Theorem 5 of Section 2, Chapter 2, we showed that w(a) ~ a"). Let ljJ(a) denote the number of integral divisors with norm a. Since the numbers a and p are associate if and only if the principal divisors (a) and (p) are equal, then from (7.1) we have w(a)
~
ljJ(a).
We will find an estimate for (a). Let a
=
p/t ... p/.,
where the Pi are distinct primes. If N(a) = a, then a = at ... as' where a i consists of those prime divisors V which divide Pi' By formula (7.2) and the multiplicativity of the norm, we have N(a i ) = p/', and this means that ljJ(a) = ljJ(p/ t ) ... (V/ s ). It therefore suffices to obtain an estimate for ljJ(pk). Let Vt, ... , Vm be the distinct prime divisors which divide p, and letft, .. · ,fm be their degrees. Since N(VtXt ... VmXm)
, I·
I'
the problem reduces to the determination of all solutions of the equation ftXt
°
I'
i II :1
'II,
I, ,f I'
= pJtXt+ '" +JmXm
+ ... + fmxm
= k
in nonnegative Xi' Since ~ Xi ~ k, then the number of solutions cannot exceed (k + 1)m. But m ~ n = (K: R), and thus ljJ(a) ~ «k t
+ 1) '"
(k s
+ 1))".
The expression in parentheses on the right equals, as is well known, the number r(a) of all divisors of a. We have hence obtained the estimate w(a)
~
ljJ(a)
~
(r(a))".
(7.8)
To compare our estimate (7.8) with the previous estimate w(a) ~ a", we note that for any e > 0, the quantity r(a)/a' converges to zero as a--+ 00. 7.2. Divisor Classes
f, .
I'
1
I
I
,I
I: il
Definition. Two divisors a and b of the algebraic number field K are called equivalent, and we write a - b, if they differ by a factor which is a principal divisor: a = b(a), CI. E K*. The set of all divisors of K which are equivalent to a given divisor a, is called a divisor class and denoted by [a].
In the terminology of group theory, the equivalence a - b denotes that the divisors a and b belong to the same coset of the subgroup of all principal divisors, and the divisor class [a] can be defined as the coset of the subgroup
Sec. 7]
DIVISORS IN ALGEBRAIC NUMBER FIELDS
221
of all principal divisors which contains the divisor a. We clearly have [a] = [b] if and only if a ~ b . For any two divisor classes [a] and [b] set [a]'[b]
= [ab].
It is easily verified that this definition is independent of the choice of a and b in these divisor classes, and that under this operation the set of all divisor classes becomes a group, the divisor class group of the field K. The unit element is the class [e], consisting of all principal divisors. The inverse of the class [a] is the class [a-I]. In the terminology of group theory the divisor-class group is the factor group of the group of all divisors by the subgroup of principal divisors. The divisor-class group, and particularly its order, is an important arithmetic invariant of the algebraic number field K. If the number of divisor classes equals I, then this means that every divisor is principal, which is equivalent to the maximal order of the field K having unique factorization (Theorem 2 of Section 3). The question of whether the algebraic integers of the number field K have unique factorization is hence a part of the problem of determining the number of divisor classes of this field. We shall now show that this number is always finite. Theorem 2. The divisor class group of an algebraic number field is a
finite group. Proof From the definition of equivalence of divisors it easily follows that
the divisors a and b are equivalent if and only if the corresponding ideals a and Ii are similar (in the sense of similarity of modules; Section 1.3 of Chapter 2). The partitioning of divisors into classes of equivalent divisors hence corresponds to the partitioning of ideals of the field K (that is, of full modules whose coefficient ring is the maximal order of the field K) into classes of similar ideals. By Theorem 3 of Section 6, Chapter 2, the number of classes of similar modules with given coefficient ring is finite. Thus the number of classes of similar ideals, and the number of classes of equivalent divisors, are also finite. Remark 1. Theorem 2 was obtained as a simple corollary of Theorem 3 of Section 6, Chapter 2. The proof of the latter theorem was based on geometric considerations, in particular, on Minkowski's lemma on convex bodies. Hence, the proof of Theorem 2 is also based on Minkowski's lemma. Remark 2. From the proof of Theorem 3 of Section 6, Chapter 2, we can deduce the following strengthening of Theorem 2. In each divisor class
222
[Chap. 3
THEORY OF DIVISIBILITY
of an algebraic number field K of degree n = s + 2t, ther.e exists an integral divisor with norm (2jrrY where D is the discriminant of the field K (that is, the discriminant of the ring of integers of the field K). Let [b] be any divisor class. Then there is an ideal A = ab -1, similar to the ideal r;-=t, for which A ::::J .0 and (A : D) ~ (2jrrY (see the proof of Theorem 3 of Section 6, Chapter 2). Since the ideal A contains .0, its inverse will be integral: A = a=t with integral a. From a -1 = aP it follows that a(a) = b, that is, that the integral divisor a is contained in the class [b], and here (Problem 2)
v'IDI,
v'1DT
N(a)
N(e)
-
_
= N(a_1)=(a-1:e)=(A:.o)~
(2)1 !InI ; ylDj.
Theorem 3. If the divisor-class group of the field K has order h, then the hth power of any divisor is principal. Proof. The assertion of the theorem is a simple corollary of elementary
group theory. The order of every element of a finite group divides the order of the group. Let a be any divisor. Since [a]h is the unit element of the divisorclass group, then [a h ] = [e], and this means that the divisor a h is ·principal.
1
I
I' I
I
I
,I,
I:
I: f !
'I' ,
j
Corollary. If the number h of divisor classes of the field K is not divisible by the prime number I, and if the divisor a' is principal, then a is also principal.
Since 1 and h are relatively prime, we can find rational integers u and v such that lu + hv = I. Since the divisors a' and a h are principal (the first by assumption, and the second by Theorem 3), it follows that a1u and ahv are also principal. But then so is the product a1u + hv = a. By Problem 20, every algebraic number field K can be embedded in a larger algebraic number field K, so that every divisor of the field K will be principal in K. We cannot, however, assert that every divisor of the field K is principal. Moreover, it has recently been shown (by Golod and Shafarevich) that there exist algebraic number fields, for example, K= R(v'-3·5·7·IC·-i3·17·19), which are not contained in any extension field with h = I. The following question is still open: Are there infinitely many algebraic number fields with h = I? Examination of tables show that such fields occur rather frequently (see the tables of h for real quadratic fields and totally real cubic fields). For certain classes of fields (for example, for quadratic and cyclotomic fields, see Chapter 5) formulas for the number of divisor classes have been found, but in the general case little is known about h and the divisor-class group. Among the few general theorems about the number h is the theorem of Siegel and Brauer, which asserts that for all fields with a fixed degree n,
Sec. 7]
DIViSORS IN ALGEBRAIC NUMBER FIELDS
223
the number h of divisor classes, the regulator R, and the discriminant Dare related by the following asymptotic formula:
In (hR) InJID/
for IDI .....
--.,...,= ..... 1
OCJ
(*)
[R. Brauer, On the zeta functions of algebraic number fields, Am. J. Math, 69, No.2, 243-250 (1947)]. Since for imaginary quadratic fields the regulator of the field is equal to 1, then it follows from (*) that as IDI --+ 00, so does h --+ 00. In particular, we may deduce that there are only a finite number of imaginary quadratic fields with h = 1. In tables we see nine imaginary quadratic fields with h = 1 (their discriminants are - 3, - 4, - 7, - 8, -11, -19, - 43, - 67, -163). It is known that there is at most one more imaginary quadratic field with h = 1. It is not known whether or not it exists. In the general case we can say almost nothing from (*) about the behavior of the number h, since we know very little about the value of the regulator R.
7.3. Applications to Fermat's Theorem The results of the preceding section allow us to prove the validity of Theorem 1 of Section 1 for a much wider class of exponents I. Theorem 4. Let 1 be an odd prime and let ( be a primitive Ith root of 1. If the number of divisor classes of the field R(O is not divisible by I, then the first case of Fermat's theorem holds for the exponent I. Proof. Assume that, contrary to the theorem, there exist rational integers
x, Y, and z, not divisible by I, and satisfying the equation Xl
We may further assume that ring of integers of the field
X,
+ yl = Zl.
y, and z are pairwise relatively prime. In the
Reo our equation can be written in the form 1-[
n (x +
(k y )
=
Zl.
kzO
Since X + Y == Xl + yl = Zl == z (mod I) and z is not divisible by I, then x + y is also not divisible by I. Then, as we proved in Lemma 5 of Section 1, for m t= n (mod I) there are numbers ';0 and rJo in the ring Z[(] such that
+ ("y)';o + (x + (mY)rJo = 1. divisors (x + (ky ) (k = 0, 1, ... ,I (x
Hence the principal 1) are pairwise relatively prime. Since their product is an Ith power [of the divisor (z)], then each of them separately must be an lth power. In particular, (x
+ (y)
= ai,
224
THEORY OF DIVISIBILITY
[Chap. 3
where a is an integral divisor of the field R(O. Since we have assumed that the number of divisor classes of the field R(O is not divisible by I, it follows from the corollary to Theorem 3 that the divisor a is principal; that is, a = (a), where a belongs to the maximal order .0 = Z[(] of the field R(O. From the equality
it now follows that
x where
G
+ (y = ea l ,
is a unit of the ring D. Analogously we obtain
x - (z = (a 1 ED,
~
•
;
. j I
i
I'
I·
. I
I
I
I.
r i
I
Glal1
Gl a unit in D). We have reached equations which, as was shown in Section 1.3, lead to a contradiction (in that part of the proof of Theorem 1 of Section 1 unique factorization was not used). Hence Theorem 4 is proved. Those odd primes I for which the number of divisor classes of the field R(O, (I = I, is not divisible by I, are called regular primes, and all others are caned irregular. By very beautiful number-theoretic and analytic arguments Kummer obtained a fairly simple criterion (which we shan present in Section 6.4 of Chapter 5), which allows one to check easily whether a given prime I is regular or not. Using this method it can be verified that among the prime numbers < 100 only three, 37, 59, and 67, are irregular, and all the rest are regular. To show how much broader the class ofexponents for which Theorem 4 holds is than the class for which Theorem 1 of Section 1 holds, we note that among the prime numbers < 100, only for the first seven, 3, 5, 7, 11, 13, 17, 19, do we have unique factorization in the ring .0 := Z[(] where (I = 1. In his first paper Kummer stated the hypothesis that the number of irregular primes was finite. He later retracted this and hypothesized that regular primes occur twice as frequently as irregular ones. With the aid of electronic computers, it has been shown that of the 550 odd primes ~ 4001, there are 334 regular ones and 216 irregular ones. A table of an irregular primes ~ 4001 is given at the end of the book. Jensen (see Section 7.2 of Chapter 5) showed that the number of irregular primes is infinite. It is not known whether there are infinitely many regular primes; there are no indications that there are only finitely many. The first case of Fermat's theorem for the exponent I is also connected with the number ho of divisor classes of the field R«( + C 1) = R[2 cos (2njl)]. It is easily seen that R«( + C 1 ) consists of all real numbers of the field R(O. Vandiver showed that if the number ho of divisor classes of the field R«( + (-1) is not divisible by I, then the first case of Fermat's theorem is valid for the exponent I [H. S. Vandiver, Fermat's last theorem and the second factor in
Sec. 7]
DIVISORS IN ALGEBRAIC NUMBER FIELDS
225
the cyclotomic class number, Bull. Am. Math. Soc. 40, No.2, 118-126 (1934)]. It is not known if there exist prime numbers / for which the number h o is divisible by /. It has only been verified that there are none among numbers ~ 4001. We note here some other results which bear on the first case of Fermat's theorem. Wieferich showed that the first case of Fermat's theorem is valid for all primes / such that 21- 1 1= 1 (mod /2) [A. Wieferich, Zum letzen Fermatschen Theorem, J. Math. 136,293-302 (1909)]. To indicate the strength of this result we note that only two prime numbers / ~ 200, 183, namely, 1093 and 3511, satisfy the condition 2 1- 1 == 1 (mod /2) [Erna H. Pearson, Math. Camp. 17, No. 82, 194-195 (1963)]. However, it is not known whether there are infinitely many such /. Several other authors have shown that the first case of Fermat's theorem holds for all/ such that ql-1 1= 1 (mod /2) for some prime number q ~ 43 (D. Mirimanoff, H. S. Vandiver, G. Frobenius, F. Pollaczek, T. Morishima, J. B. Rosser). This has made it possible to verify the first case of Fermat's theorem for all prime numbers <253, 747, 889 [D. H. Lehmer and Emma Lehmer, On the first case of Fermat's last theorem, Bull. Am. Math. Soc. 47, No.2, 139-142 (1941)]. 7.4. The Question of Effectiveness
Up to this time we have avoided the question of the practical construction of a theory of divisors for a given algebraic number field K. Since all divisors are determined once we know all prime divisors, and the prime divisors are determined by the valuations of the field K, our question reduces to the effective construction of all extensions to the field K of the valuation v1' of the field R for any fixed p. In addition to enumerating the prime divisors, it is important to have a finite algorithm for computing the number h of divisor classes of the field K. For only then will the results of the preceding section concerning Fermat's theorem have any real value. In this section we shall show how to construct all extensions of the valuation v1' and how to compute the number h, both in a finite number of steps. Let 01' be the ring of the valuation v1' of the field R (that is, the ring of pintegral rational numbers, see Section 3.2 of Chapter 1) and .01' its integral closure in the field K. Every number E .01' is the root of a polynomial t k + a1tk-1 + ... + ak with p-integral coefficients ak' If m is a common de= a will be a root of the polynomial nominator for all ai' then the number t k + mal t k -1 + ... + mkak with coefficients in Z; that is, it will lie in the ring of integers .0 of the field K (the maximal order). The converse assertion also holds: If a E .0 and if the rational integer m is not divisible by p, then aim E .01' . Thus the ring .01' consists of all numbers of the form aim, where a E .0 and the rational integer m is not divisible by p. Choose some fundamental basis
e me
I
I I
226
THEORY OF DIVISIBILITY
[Chap. 3
of the field K (that is, a basis for the ring .0 over 2). Then we have shown that the number ~ E K, which has the representation
W 1 , ... , W n
(a j
E
R),
will lie in the ring .01' if and only if all a j are p-integers. By Theorem 7 of Section 4 our first problem (that is, the construction of all extensions of the valuation v1' ) reduces to the determination of a complete system of pairwise-nonassociate prime elements rr 1 , .,. , rr m of the ring .01" Once the elements rr j have been" found, then for any ~ E .01' * we can easily obtain the factorization (7.9) where '7 is a unit in .01" To do this we divide successively by each of the rr j until the quotient would not lie in the ring .01'; at some stage we obtain a quotient '7 which cannot be divided by any of the rr j and hence is a unit in .01' . Since each element of K is the quotient of two elements of .01' (even of .0), the representation (7.9) can also be found for any ~ E K*. But this determines the valuations Vi> ••. , Vm of K which are extensions of v1" The ramification indices of these valuations are found in the factorization p = err 1 e, ••• rr me ", (e a unit in .01'). Let rr be any prime element of the ring.o 1' . Since rational integers which are not divisible by p are units in .01' , we may assume that rr E .0. For any a E .0 the number rr + p2 a = rr [I + (p2/ rr)a] is associate with rr, since the factor 1 + (p2/ rr)a lies in.o 1' and is not divisible by any ofthe prime elements rr 1 , ... ,rr m • Thus a complete set of paiiwise-nonassociate prime elements in .01' can be chosen from the system of numbers
where 0 ~ Xj < p2 (i = 1, ... , n). Since the set of all such numbers is finite, the set of prime elements can be found, and the valuations V 1 , ... , V m determined, in a finite number of steps. To find the degrees 11' ... ,fm of the prime divisors Vi> ... , Vm ,corresponding to the valuations V 1 , ... , V m , we can use Theorem 5 of Section 5. By this theorem for each prime element rr i E .0 of the ring.o 1' we have N(rr i )
= pf i a,
where the rational integer a i~ not divisible by p. Hence the degree /; of the prime divisor Vi isjust the exponent with whichp occurs in the rational integer N(rrJ, We now turn to our second question, the effective computation of h, the number of divisor classes.
I' :
Sec. 7]
DIVISORS IN ALGEBRAIC NUMBER FIELDS
227
In a remark after Theorem 2 it was noted that every divisor class contains a divisor a for which (7.10) (see also Problem 9). Let (7.11) be all integral divisors of the field K which satisfy (7.10). The number of such divisors is finite, since there are only finitely many divisors with given norm in K [for fixed a, from N(Pl k, ••• p,kr) = a we easily deduce bounds on the prime numbers p which are divisible by the Pi' and on the exponents kil. To determine the number of divisor classes we must find in the set (7.11) a maximal set of pairwise-nonequivalent divisors. To make this effective, we must have a practical method for determining whether Or not two given divisors are equivalent. Let a and b be two integral divisors. Choose in K a number p #- 0, which is divisible by b, and consider the divisor ab -1(P). The divisors a and b are equivalent if and only if the divisor ab -1(P) is principal. Hence we need to be able to determine whether or not a given integral divisor is principal. Denote the norm of such a divisor by a. In Section 5.4 of Chapter 2 we showed that we could find, in a finite number of steps, a finite set of numbers (7.12) with norm ±a, such that any rt E.o with norm ±a is associate with a number of (7.12). If the divisor a is principal; that is, if a = (rt) with rt E .0*, then IN(rt) I = a, and hence for some i (I ~ i ~ r) we shall have a = (rt;). Hence if we have already found the system (7.12), then to determine if the given divisor is principal, we need only compare it with each of the principal divisors (rt 1 ), ... , (rt,). Hence we have shown that the number h can be computed for the field K in a finite number of steps. The determination of the decomposition of the rational prime p into a product of prime divisors is often more easily done by considering the norms of k-nomials of numbers (k ?: 2). To describe this method we need some auxiliary results. Let 0 be an integral primitive element of the algebraic number field K of degree n. Then the index of the order.o' = {I, 0, ... , O·-l} in the maximal order.o is called the index of the number O. Lemma. If the prime divisor P does not divide the index k of the number 0, then any number rt E .0 is congruent modulo P to some number of the order .0' = {I, 0, ... , on- 1 }. 1
I
!
>
228
THEORY OF DIVISIBILITY
[Chap. 3
Since p,r k, then kx == 1 (mod 1') for some integer x. Set y = kxCt.. Since kCt. ED', then also y ED', and Ct. == Y (mod 1'). Corollary. If l' does not divide the discriminant D' = D(l, 0, ... , on-I), then any integer Ct. E .0 is congruent modulo l' to some number of the order .0' = {I, 0, ... , on-I}.
1I
,
From the formula D' = Dk 2 , where D is the discriminant of the field K [Lemma 1 of Section 6, Chapter 2, and (2.12) of the Supplement], it follows that if l' does not divide D', then also l' does not divide k. Assume now that the rational prime p does not divide the index of the integer E K. Let l' be a prime divisor with degreefwhich divides p, and let {} be the residue class of 8 modulo p. By the lemma the residue-class field .011' is generated by the residue class {} containing 0. If Xl' ..• , X J independently run through a full system of residues modulo p (in the ring Z), then among the numbers
°
y
=
Xl
+ X 20 + ... + x f Of - 1 + of
there will be one and only one which is divisible by p. Computing the norms N(y), we can easily determine which yare divisible by some prime divisor which divides p. If, for example, for f = 1, we find s numbers y whose norms are divisible precisely by the first power of p, then we have found s prime divisors of the first degree which divide p. Assume now that all prime divisors of first degree which occur in p have been found (so we have a set of numbers PI' ... , Pu with norms pa;, p,r a;). Now settingf = 2, we isolate those numbers y whose norm is divisible by p2. Dividing by the numbers Pi already found, we can eliminate those y which arose from prime divisors of degree 1, and if after this N(y) = p2(ble) (be,p) = 1, then y is divisible by a prime divisor of degree 2. If by this method we can find all prime divisors of degree 2 which divide p, then we takef = 3, and so on. Of course, for large n the computations will generally be large; for n = 3 and n = 4 we can often achieve our goal quite quickly. Some refinements of this method are given in Problems 25 to 27.
j:
I
Example 1. We shall factor the numbers 2, 3, 5, and 7 into prime divisors in the field of fifth degree R(O), where OS = 2. The discriminant D(l, 0,0 2 ,8 3 , ( 4 ) equals 24 5 5 , and hence only the primes 2 and 5 can divide the index of 0. By Problem 15, the number 2 does not occur in the index. Since 0 2 = 2, then 1'2 = (0) is a prime divisor of the first degree, and we have
2
= 1'/.
From
N(O)
= 2,
N(O
+ 1) =
3,
N(O - 1)
= 1
(7.13)
Sec. 7]
DIVISORS IN ALGEBRAIC NUMBER FIELDS
229
it follows that only one prime divisor of first degree divides the number 3, namely, 1'3 = (0 + 1), and p/,j'3 by Theorem 8 of Section 5. Further, N(O
+ 2) = 2'17,
N(O - 2)
= -2·3'5.
(7.14)
The second of these equations shows that the number 5 has a prime divisor 1'5 of degree I, and since 0 - 2 = (0 + 1) - 3 is divisible by 1'3' we have (0 - 2) = 1'21'31'5' The number 0 - 2 satisfies the equation (0 - 2)5
+ 10(0 -
2)4
+ 40(0 -
2)3
+ 80(0 -
2)2
+ 80(0 -
2)
+ 30 = 0.
By Problem 9 of Section 5 we have
5 = 1' 5 5. The result of Problem 15 also shows that 5 does not divide the index of the number 0, and this means that the ring of all integers of the field R(O) coincides with the order {l, 0, 02, 03 , 04}. Combining (7.13) and (7.14) with N(O + 3) = 5'7 2, N(O - 3) = -241, we see that there are two possibilities. Either the number 0 + 3 is divisible by the square of a prime divisor (which divides 7) of first degree, or it is divisible by a prime divisor of second degree (which divides 7). But for the number 0-4 = (0 + 3) - 7 we have N(O - 4) = -2'7'73, and hence the first possibility holds. This means that 7 has one (and only one) prime divisor of first degree 1'7' where 1'/17. To determine whether 3 and 7 have prime divisors of second degree, we consider trinomials of the form 02 + Ox + y. We have N(02
+ xO + y) = 2x 5 +.y5 -
IOx 3 y
+ IO xy 2 + 4.
(7.15)
Substituting for x and y the values 0, I, -1, we obtain nine numbers, none of which is divisible by 9. This means that no prime divisor of degree 2 divides 3. By formula (7.3) there is now only one possibility for the decomposition of the num her 3: 3 = 1'31'3', where 1'3' is a prime divisor of fourth degree. Now if x and y take the values 0, ± I, ± 2, ± 3 in (7.15), then ofthe49 numbers which arise only oneis divisible by 7 2 : N(02 + 20 - 3) = 5'7 2 • 2
But 0 + 20 - 3 = (0 + 3)(0 - I), and therefore we have only the square of the divisor 1'7 , so that also for 7 we have the factorization 7 = 1'71'/,
I,
i ,
where 1'/ is a prime divisor of fourth degree.
I I I
I
230
THEORY OF DIVISIBILITY
[Chap. 3
Example 2. Consider the cubic field R(O), 03 - 90 - 6 = O. Since D(I, (), ()2) = 3 5 '2 3 , by Problem 15 only 2 can divide the index of 0 (it can
be shown that the order {I, (), 02} is maximal, but we shall not need this). By Problem 9 of Section 5 we have the decomposition 3
= 1'/.
From
N«()) = 6,
N(O
+ 1) =
-4,
N«() - 1) = 14,
(7.16)
we conclude that the number 2 has at least two prime divisors of first degree, 1'2 and 1'2': (7.17) (0 - 1) = 1'2'1'7 (that there are only two would follow from the maximality of the order {I, 0, 2 }, for then 2 would not divide the index of the number 0). But from the equation «() - 1)3 + 3«() - 1)2 - 6(e - 1) - 14 = 0
e
'II
i.
!
L
I :1, ,
we see that 2 is divisible by 1'/2, and hence (7.18) The norms (7.16) and also
N(O
+ 2) =
-4,
N(O - 2) = 16
(7.19)
are not divisible by 5. This means that 5 has no prime divisor of first degree. Since the field is cubic, it follows that the principal divisor 5 is prime. To decompose the number 7, we must also consider the norms
N«()
+ 3) = 6,
N(O - 3) = 6.
Since there is only one norm divisible by 7, then 7 has only one prime divisor of first degree. Since 1'/ ,(7, we must have 7 = 1'71'/, where 1'/ is a prime divisor of second degree. In the process of decomposing rational primes into the products of prime divisors by our method of examining the values of the norms of integers, we have also obtained a series of equivalences among divisors. These equivalences allow us to reduce the number of divisors of the system (7.11) from which we must choose a maximal set of pairwise-nonequivalent divisors to determine the number h. Thus, in Example 2, by Problem 9 the system (7.11) consists of integral divisors with norm ~ (3 !/3 3 ) ..}3 5 • i 3 < 10, that is, of the divisors 3 1,1'2,1'/,1'3,1'/,1'2,2,1'21'/,1'21'3, P2'p3, 1'7,1'2 ,1'/1'2',2,1'2'3,1'/. (7.20) !'
I
Sec. 7]
DIVISORS IN ALGEBRAIC NUMBER FIELDS
231
It follows from (7.18) that P2,2 - 1 and P2 - 1 (l being the unit divisor), and then from (7.17) and (8 + 3) = P2'P3 that P3 - I, P2' - I, P7 - l. Hence all divisors of the system (7.20) are principal and we have h = 1 for the field R(8), 82 - 98 - 6 = O. Sometimes (for small discriminants) the system of divisors (7.11) consists only of the unit divisor. In these cases we obtain h = 1 without further computations. For example, for the field R(8), 8 3 - 8 - 1 = I, the discriminant of the basis, I, 8, 8 3 equals - 23, so by Problem 8 of Section 2, Chapter 2, this basis is fundamental and - 23 is the discriminant of the field. By Problem 9 there is an integral divisor in each divisor class of the field R( 8) with norm
J-23 < 2,
43!
~ ~ 33
and this means that in the field R( 8) every divisor is principal. For quadratic fields the number of divisor classes can also be computed by the theory of reduction, considered in Problems 12 to 15 and 24 of Section 7 of Chapter 2. PROBLEMS
1. Show that in an algebraic number field of degreen the number !f1(a) of integral divisors with given norm a does not exceed the number -r.(a) of all solutions to the equation XIX2 •. , x. = a (x" ... , x. independently taking all natural values). 2. Let a and b be two divisors of an algebraic number field (integral or fractionaI), with a and b the corresponding ideals. Show that if a is divisible by b, then
(6: a)
=
(Nab-I).
3. Show that in any two distinct divisor classes there exist relatively prime integral divisors. 4. If a is an integral divisor of an algebraic number field, let 9l( a) denote the number of residue classes modulo a which consist of numbers relatively prime to a (this generalizes Euler's function). Show that if the integral divisors a and b are relatively prime, then 5. Prove the formula 9l(a)
=
N(a)
n (I - ~»), N(IJ p
in which IJ runs through all prime divisors which divide the integral divisor a. 6. Show that for any integer IX which is relatively prime to the integral divisor a, we have IX",(a)
== I
(mod a)
(this generalizes Euler's theorem). Further, show that for any integer of an algebraic number field, IXN(P) == IX (mod IJ) (this generalizes the small Fermat theorem).
IX
and prime divisor IJ
,
I
,i I
232
THEORY OF DIVISIBILITY
[Chap. 3
7. Prove the formula
:E 9l( c) =
N( a),
<
where the sum is taken over all divisors c which divide the integral divisor a (including e and a). 8. Let g" .. , ,g, (s =N(\J) - 1) be a system of residues for the prime divisor \J, not divisible by \J. Show that then f, .. , g, == -1 (mod \J) (Wilson's theorem). 9. Let K be an algebraic number field of degree n = s + 21 and discriminant D. Use Problem 2 of Section 6, Chapter 2, to show that in every divisor class of K there is an integral divisor a with
(4)' SviDI.
N(a),;;;:;;:
I
Sec. 7]
19. Let a divisor. Sho.... 20. Show every divisor 21. Let KI divisors in K and hence pia 22. Show tion 2, Chapt, 23. Let K of the form IX (z a rational ir 3x 3 + 4y 3 + ~ Hinl: Assu where g is an i
10. Show that for the quadratic fields with discriminants 5, 8,12,13, -3, -4, -7, -8, -11 the number of divisor classes is 1. Here"
11. Show that the number of divisor classes of the field R( V - 19) equals 1.
12. Show that the ring of integers of the field R(~), where ~ is a primitive fifth root of 1, has unique factorization. 13. Show that the number of divisor classes in the field R( V - 23) is equal to 3. 14. Let K" K and K 3 be the three cubic fields described in Problem 21 of Section 2, Chapter 2. Show"that the number 5 remains a prime divisor in the fields K, and K 2 , and in the field K 3 it factors as a product of three distinct prime divisors offirst degree: 5 = \J\J'\JH. Further, show that the number 11 factors as a product of three distinct prime divisors in the field K 1 , II = q q' q", and that 11 remains prime in K 2 • (It foIlows that the fields K" K2 , and K 3 are distinct.) 15. Let the primitive element E K be the root of an Eisenstein polynomial relative to the prime number p. Use Problem 9 of Section 5 to show that p does not divide the index of the number e. 16. Let the prime number p be less than the degree n of the algebraic number field K. If there is an integral primitive element in K whose index is not divisible by p, show that p cannot factor in K as the product of n distinct prime divisors of first degree. 17. Use Problems 18 and 19 of Section 5 to show that a rational prime number is ramified in the algebraic number field K (that is, is divisible by the square of a prime divisor) if and only if it divides the discriminant of the field K. 18. Letf(x" .,. , x n ) be a quadratic form whose coefficients are integers in the algebraic number field K, and let 8 be its determinant. Let \J be a prime divisor which does not divide 2 or 8. If IX is an integer of K not divisible by \J, set (IX/\J) = + 1 if the congruence ==IX (mod \J) is solvable, and (IX/\J) = -1 otherwise. If N is the number of solutions of the congruence f(x" ... , x n) == 0 (mod \J) show that N = N(\J)n - I , if n is odd,
e
~:
!, j i
I
e
N = N(\J)n -,
(1
)n/2
+ ( - \J
8)
N(\J)(n -2)/2 (N(\J) - 1)
if n is even.
=
2-
[N(v) = 5'5 3 ]
Section 5, Ch; 17 and 22, anI setting g = u -
where cI>, 'Y, a of the six valu (and in 3-adic; 24. Let a a
d= ab 2 > 1. S follows:
Hinl: In the
e
25. Let be polynomial 9l(1 that, modulo p
where 9l" ... , ' Show that the I
:Chap.3
luding e
Use integral
t D.
I
-7,
Sec. 7]
DIVISORS IN ALGEBRAIC NUMBER FIELDS
19. Let a be a divisor of the algebraic number field K, such that am = (a) is a principal divisor. Show that the divisor a becomes principal in the field K(m~Fi). 20. Show that for any algebraic number field K there is a finite extension K/ K such that every divisor a of the field K becomes principal in the field K. 21. Let Kbe a cubic field andp a prime which factors as a product of three distinct prime divisors in K: p = \)\)'\)". If IX is an integer in K with Sp(IX) = 0 and \)\)'IIX, show that \)HI IX and hence pllX. 22. Show that the field R(8), 8 3 = 6, has only one divisor class. [By Problem 24 of Section 2, Chapter 2, the numbers 1,8,8 2 form a fundamental basis for the field R(B).] 23. Let K be the cubic field K = R(B), 8 3 = 6. Show that there is no number IX #- 0 of K of the form IX = X + y8, x and y relatively prime rational integers, for which N(IX) = 10z 3 (z a rational integer). Deduce that the equation x 3 + 6y 3 = IOz 3 (and hence also the equation 3x 3 + 4y 3 + 5z 3 = 0) has no nontrivial solution in rational integers. Hint: Assume that the number IX exists and show that it must have the form IX = IXog3, where g is an integer of the field K and IXo is one of the following six numbers: AfL, AfLE, AfLE 2, AV, AVE, AVE 2.
-s,
Dot of 1,
Here A=2-8 [N(A) = 2]; fL=8-1 [N(fL) = 5]; v=(8 2 +8+1)2=13+S8+38 2 [N(v) = 5'5 3 ]; and E = 1 - 68 + 38 2 is a fundamental unit of the field K (Problem 4 of Section 5, Chapter 2). For the proof use Problem 21, applied to the number IX8, Problems 17 and 22, and also prime factorizations in the field K of the numbers 2, 3, and 5. Further, setting g = u + v8 + w8 2 , write
:ction 2, K 2 , and
= \)\)'\)H.
visors in K" K 2 ,
lative to l1e index
IX = IXoe =
if Hint: In the case d
.lgebraic >t divide ==IX s of the
e
+ 'Y8 +
n8 2 ,
where <1>, 'Y, and n are integral cubic forms in the variables u, v, and w. Show that for any of the six values of IXo the equation n(u, v, w) = 0 has only the trivial solution in rational (and in 3-adic) numbers.] 24. Let a and b be natural numbers which are square-free and relatively prime, and let d = ab 2 > I. Show that in the field R(f/d) the number 3 factors into prime divisors as follows: if do;t ±I (mod 9),
field K. w thatp
ramified r) if and
233
d== ±I (mod 9).
== ± 1 (mod 9) consider the norms w
1
=
"3 (I
+a
~
~ ab 2
+T
N(w - 1), N(w), N(w
+ I),where
~ a 2 b),
a=±l, T=±l, aa==Tb==1(mod3).
25. Let 8 be an integral primitive element of the algebraic number field K, with minimum polynomial ep(t), and let p be a rational prime which does not divide the index of 8. Suppose that, modulo p, we have the factorization
where ep" ... , epm are distinct irreducible polynomials modulo p with degrees Show that the decomposition of the number p in the field K takes the form
I" ... .fm.
234
THEORY OF DIVISIBILITY
[Chap. 3
where the distinct prime divisors \)" ... , \)m have degrees II> ... ,1m and where lptCO) "'" 0 (mod \)1) for i = I, ... , m. Hint: Use the fact that every integer of K is congruent modulo \)1 to a linear combination (with rational integer coefficients) of the powers 0' (s ;;;. 0). 26. Let 0 be an integral primitive element of the field K, and p a prime number which does not divide the index of O. For any rational integer x, show that the number 0 + x is not divisible by any prime divisor which divides p and is of degree greater than I. Further, show that 0 + x is not divisible by the product of any two distinct prime divisors which divide p. 27. Under the same assumptions, let \)" ... , \), be distinct prime divisors which divide p, with degrees I" ... ,Is, and let r < I, + .., +Is. If xo, ... ,xr - , are any rational integers, show that the number or + xr_IOr -' + ... + Xo is not divisible by the product \)1 ... \),.
8. Quadratic Fields
In this section we consider the somewhat simpler theory of divisors for the case of quadratic fields. We start with a description of the prime divisors. , I
I'
8.1. Prime Divisors
Ii I;
Since every prime divisor divides one and only one rational prime, to give a complete description of the set of all prime divisors it suffices to show how each rational prime p factors as a product of prime divisors. For quadratic fields it follows from (7.3) that there are only three possibilities for the numbers m,li' ei: (1) m = 2, it =12 = 1, et = e 2 = 1; (2) m = 1, 1= 2, e = 1; (3) m = 1, 1= 1, e = 2. Corresponding to these we have the following types of decomposition: (1) p = 1'1", N(p) = N(p') = p, l' #- 1"; (2) p = 1', N(p) =p2; (3) p = lJ2, N(p) =p. Our problem is to discover what determines the type of factorization for any prime p. The answer will be easily derived from Theorem 8 of Section 5. In Section 7.1 of Chapter 2 we showed that every quadratic field has a unique representation in the form R( .jd), where d is a square-free rational integer. First, let p be an odd prime. If p does not divide d, then it also does not divide the discriminant of the polynomial x 2 - d, a root of which generates the field. We then deduce from Theorem 8 of Section 5 that p has a decomposition of either the first or the second type, depending on whether the polynomial x 2 - d is reducible modulo p or not. This in turn depends on whether d is a quadratic residue modulo p or not.
Ii
I
j
I
Sec.B]
If pld, then d It follows from
QUADRATIC FIELDS
235
= pd1 , where d 1 is not divisible by p, since d is square-free.
that all prime divisors which divide p occur with an even exponent in its factorization, which is possible only for the third type of decomposition. Thus for odd p we have the first, second, or third type of decomposition in the following three cases, respectively: (1) p,r d, (dip) = 1; (2) p,r d, (dip) = - 1; (3) pld. Note that since the discriminant D of the field R( .Jd) is either d or 4d (Theorem 1 of Section 7, Chapter 2), then in each of these conditions we could replace d by D. The case p = 2 remains. Assume first that 2,r D. By Theorem 1 of Section 7, Chapter 2, this means that D = d == 1 (mod 4). It is clear that R( .Jd) = R(co), where co = (- 1 + .J D)/2. The minimum polynomial of co is
1-D x 2 +x+ _ _ 4
(8.1)
Since the discriminant of the basis 1, co is odd, we obtain from Theorem 8 of Section 5 that the number 2 will have either the first or the second type of decomposition, depending on whether the polynomial (8.1) is reducible or not. But the polynomial x 2 + x + a is reducible modulo 2 if and only if 21a. Thus for 2,r D we obtain the first and the second types of decomposi tion in the respective cases D == 1 (mod 8) and D == 5 (mod 8). We now show that if 21 D, then 2 always has the third type of decomposition. If 21d, then d = 2d', 2,r d', and from
just as in the case of odd p, we see that 2 has the third type of decomposition. If 2,r d, then d == 3 (mod 4) (Theorem 1 of Section 7 of Chapter 2) and we have (1 with the integer et = (1
+ d)/2 +
+ .JJ)2 = 2et
.JJ relatively prime to 2, since its norm
is not divisible by 2. Thus we again obtain the third type of decomposition for 2. We have obtained the following theorem.
THEORY OF DIVISIBILITY
236
[Chap. 3
Theorem 1. In a quadratic field with discriminant D the prime number p has the decomposition
p = p2,
N(p) = p,
if and only if p divides D. If p is odd and does not divide D, then p
= pp', p #- p',
N(p)
=
p=p,
N(p')
=
P
N(p)=p2
for (;) = 1; for
(~)
= - 1.
If 2 does not divide D [and hence D == I (mod 4)], then p #- p',
2 = pp',
N(p) = N(p') = 2 2
= p,
N(p)
=4
for D = I (mod 8); for D = 5 (mod 8).
;I t
'Ii
I'
8.2. Rules of Decomposition
Theorem 1teIls us that the type of decomposition of the odd prime p is determined by the residue of D (or d) modulo p, and even by the Legendre symbol (Dip) = (dip) as a function of p. It is natural to ask if the theorem can be reformulated so that the decomposition depends on the residue of p with respect to some modulus (the modulus depending only on the field). To achieve such a formulation, we use the reciprocity law for the Jacobi symbol. It is weIl known that the Jacobi symbol (clb) is defined for odd c and positive odd b, relatively prime to c. The reciprocity law for this symbol states that
(~)
= ( - I)[(b - 1)/2]' [(c -
1)/2]C:I)
(the proof for c < 0 is easily reduced to the positive case). Let p be any odd prime. If d = D == I (mod 4), then
(~) = (~)
= (-
1)[(p - 1)/2]' [Cd -
1)/2](1~1)
=
C~I)'
(8.2)
since (d - 1)/2 is even. If d == 3 (mod 4), then
(f) = (~) = ( - I)[(P-l)/21'[(d-l)/21C~I) = ( -
I)(P-l)/2C~I)'
(8.3)
since (d - 1)/2 is odd. FinaIly, for d = 2d', 2,td', we have
(f)
=
(~)
=
e)(~)
= (_1)[(P -1)/Sl+[(P-l)/2 1[(d'-I)/2 1C:'I)' 2
(8.4)
Sec.B]
QUADRATIC FIELDS
237
The value of the Jacobi symbol (plldl) [or (p/ld'l)] depends only on the residue of p modulo jdl (or Id'l). If d == I (mod 4), so that d = D, then (Dip) depends only on the residue of p modulo Idl = IDI. If d == 3 (mod 4), so that D = 4d, then (Dip) depends not only on the residue of p modulo Idl, but also on the number (_1)(P-l)/2, that is, on the residue of p modulo 4; hence (Dip) depends onthe residue ofp modulo 41dl = IDI.Finally,if d = 2d',D = 4d=8d', then (Dip) depends on the residue of p modulo Id'l, (_1)(P-l)/2 depends on the residue ofp modulo 4, and ( - I )(p2 - 1 )/8 depends on the residue ofp modulo 8. Hence in this case the value of (Dip) depends on the residue of p modulo 81d'l = IDI· Thus in all cases the type of decomposition of the prime number p depends only on its residue modulo IDI, so that all prime numbers having the same residue have the same decomposition. This conclusion, which is completely nonobvious a priori, is the most important property of the decomposition rules for prime numbers in quadratic fields. To make this new form of the decomposition rule more clear, we introduce a new function. For all x relatively prime to the discriminant D, we set
X(x)
=
( _
(1:1)
for d == 1 (mod 4),
1)(X-l)/2(~)
for d == 3 (mod 4),
Idl
( -
2
1)[(x -1)/8]+[(X-l)/2]' [(d'-l) I2J
C;,,)
(8.5)
for d = 2d'
[in case d == 2,3 (mod 4) the expressions (_1)(x-l)/2 and (_1)(x 2 -1)/8 make sense since the discriminant D = 4d is even and so x is odd]. In the arguments above which showed that for odd p the value of (Dip) depends only on the residue of p modulo IDI, we never used the fact that p was prime. Hence by the same arguments it follows that X(x) depends only on the residue of x modulo 1DI. Further, it is easily verified that if (x, D) = I and (x', D) = I, then X(xx') = X(x)X(x'). This means that the function X can be considered as a homomorphism of the multiplicative group of residue classes modulo 1DI, relatively prime to IDI, to the group of order 2 consisting of + I and - I. If we extend such a function by giving it to the value 0 on all numbers not relatively prime to D, it is called a numerical character. Definition. The numerical character X with modulus IDI, where the value of X(x) for x relatively prime to D is given by (8.5), is called the character of the field R( .Jd).
Returning to (8.2), (8.3) and (8.4), we see that the decomposition of an odd prime p which does not divide D will be of the first type if X(p) = + 1 and of
238
IiiI ~
Ii
H
II. II
II
I'
[Chap. 3
the second type if X(p) = -1. This result remains true for p = 2. For if 2,r D, then D == 1 (mod 4) and this means that X(2) = (2f1D!), which equals + 1 for D == 1 (mod 8) and - I for D == 5 (mod 8). Hence we have the fol1owing new formulation of the rule for decomposition in quadratic fields.
I
~f ' ,
Ii
THEORY OF DIVISIBILITY
Theorem 2. If X is the character of the quadratic field R( .jd), then the decomposition of a prime pin R( /d) is given by the conditions: p
= pp', p #' p', p=p,
P
= p2,
N(p)
= N(p') = P
N(p)=p2, N(p)
= p,
if X(p) = 1; ifX(p)
= -1;
if X(p) = O.
AI1 rational integers are partitioned into three sets, depending on the value of X. Each of these sets consists of certain residue classes modulo IDI. By Theorem 2 the type of decomposition of p depends only on which of the three sets contains p. A law of decomposition like that in quadratic fields, where -the type of decomposition depends only on the residue of the prime p with respect to a certain fixed modulus, also occurs for certain other fields. This is the case, for example, for cyclotomic fields (see Section 2.2. of Chapter 5). But it is far from being the case in general. Since the knowledge of such a law of decomposition al10ws us to solve many number-theoretic problems (see, for example, the fol1owing section and Section 2 of Chapter 5), it would be interesting to know for precisely which fields we have such a simple law of decomposition. The answer to this question leads into class field theory. It can be shown that any such field is a normal extension of the field of rational numbers, the Galois group of which is Abelian. Among such fields lie, of course, quadratic fields, which have a cyclic group of order 2 as Galois group. The simplest examples of non-Abelian fields are cubic fields whose discriminant is not a perfect square. An example is the field R(O), where (J3 - 0 - 1 = O. Hence for this field there does not exist any integer M such that the type of decomposition into prime divisors of the prime number p depends only on the residue of p modulo M. Class field theory solves much more general problems than those we have mentioned. It al10ws one to describe the law of decomposition for prime divisors of an arbitrary algebraic number field k in some extension K/k, provided that the Galois group of this extension is Abelian (we spoke above of the special case when k = R). Class field theory has many number-theoretic applications. It al10ws us to carryover the theorem on quadratic forms with rational coefficients, proved in Chapter 1, to the case of quadratic forms with
i
240
THEORY OF DIVISIBILITY
[Chap. 3
of strict divisor classes is also finite, and is related to the number h of divisor classes in the usual sense by ford
.'
li = h
for d > 0,
N(e) = -1;
li = 2h
for d > 0,
N(e) =
+ 1.
Theorem 4 of Section 7 of Chapter 2, when applied to modules which belong to the maximal order of the field R( Jd) with discriminant D, can be reformulated as follows: the strict divisor classes of the quadratic field R( Jd) are in one-to-one correspondence with the classes of properly equivalent primitive binary quadratic forms ofdiscriminant D (which are positive definite if D < 0). We shall try to apply the results of Sections 8.1 and 8.2 to the question of the representation of numbers by binary forms. By Theorem 6 of Section 7 of Chapter 2 the natural number a is represented by some form of discriminant D if and only if there is an integral divisor of the field R( Jd) with norm a (the norm of a divisor coincides with the norm of the corresponding module). Now we can use Theorem 2 to characterize all numbers which are norms of divisors. By this theorem the norm N(p) of a prime divisor p equals the prime number p if x(p) = 0 or ifX(p) = 1, and equals p2 if X(p) = -I. Hence the number a is representable in the form N(o) for some integral divisor 0 = pQ(p) of the field R(Jd) if and only if all prime
n p
factorsp, for which X(p) = - I, occur in a with even exponent. By using the Hilbert symbol (Section 6.3 of Chapter I) we can put this result in somewhat different form. We compute (a, Dip) for all primes which do not divide D. Let a = lb, where b is not divisible by p. From the properties of the Hilbert symbol we obtain
C'pD) (a'2 D ) = (_
=
(b'pD) (;) k =
(*) k
=
1)[(b-')/2].[(D-')/2]+k[(D2_
X(p)k
)/8]
= (-
for p
* 2, p,t D;
l)k[(D2_
)/8]
'
'
= X(2/
for p
= 2, 2,t D
[for p = 2, 2,t D, we have used the fact that D == 1 (mod 4)]. This formula proves the second part of the following theorem. Theorem 3. A natural number a is represented by some binary form of discriminant D if and only if every prime p, for which X(p) = - I, occurs with even exponent in the prime factorization of a. This in turn occurs if and only if
D) =+1
a, (- p
forallp,tD.
Sec.B]
QUADRATIC FIELDS
241
Since the integers a and ab 2 are either both represented or both not represented by forms of discriminant D, we may limit our consideration to squarefree numbers a. If p #- 2, p,r D, and p,r a, then we know that (a, Dip) = + 1. Hence Theorem 3 only imposes a finite number of conditions on the number a, and these conditions only involve the residues modulo IDI of the prime divisors of the square-free number a. Theorem 3 could have been deduced easily from Theorem 7 of Section 7 of Chapter 2. We gave a proof based on Theorem 2 to point out the connection between the question of the representation of numbers by forms of discriminant D and the question of the decomposition into factors in the corresponding quadratic field. This result is not all that we might wish to obtain. We would like to have a criterion for the representation of the number a by forms from a given class of properly equivalent forms, and Theorem 3 only gives us a condition for the representability of a by forms from some class. The following question then arises: Can we partition the classes of forms into nonintersecting collections, so that, for any a, all forms which represent the number a (if any exist) are contained in the same collection? Such a partition was found by Gauss. It is connected with rational equivalence of quadratic forms. Definition. We say that two primitive binary quadratic forms with discriminant D belong to the same genus if they are rationally equivalent.
Since integrally equivalent forms are certainly rationally equivalent, all forms of the same class lie in the same genus. Hence each genus is the union of certain classes. It follows that the. number of genera of forms (for given discriminant D) is finite. In Section 7.5 of Chapter I we defined the invariant eij) for a nonsingular binary rational form/, where p is a prime number or the symbol 00. In the case of a primitive formjwith discriminant D, the determinant equals -tD, and therefore
a, eif)= ( p
D) ,
where a#-O is any number which is rationally represented by the form f Let G be any genus of forms. Since all forms of G have the same invariant, we can set wherejis any form of G. Let a be any nonzero number represented by the form f By the second assertion of Theorem 3 we have eij) = (a, Dip) = 1 for all primes p which
I
IIi !
ji
242
THEORY OF DIVISIBILITY
[Chap. 3
do not divide D. Further e 00(/) = I, since in the case D < 0 we are considering only positive-definite forms. Hence for any genus G of forms with discriminant D we have (8.6) for p,t D and p = 00. Hence each genus G is uniquely determined by the invariants ep(G), where p runs through all prime divisors of the discriminant D. We can now give conditions for a number to be represented by some form of a fixed genus. Theorem 4. Let a be a natural number and G be a genus of forms with discriminant D. In order that a be integrally represented by some form of G, it is necessary and sufficient that
(
a, D) P =ep(G)
for all primes p. Proof. The condition is clearly necessary. If for some a we have (a, Dip) = ep(G) for all p, then by (8.6) [(a, D)lp] = I for all p,t D. Theorem 3 implies that a is represented by some form j of discriminant D, and since ep(j) = [(a, D)lp] = ep(G), then j belongs to the genus G. The theorem is
proved. The assertion of Theorem 4 is interesting in that the condition for the representability of a by some form of the genus G only involves the residue of a modulo IDI [assuming that a is represented by some form of discriminant D, that is, that [(a, D)lp] = I for all p,t D]. In the case when each genus consists of one and only one class, Theorem 4 gives us an ideal answer to the question of the representation of numbets by binary forms. In the general case this result cannot be improved, in the following sense. Suppose we take a set S of classes of forms, which is not the union of some genera. Then there does not exist any modulus m such that the representation of a number a by some form of our set S depends only on the residue of a modulo m. In particular, if a genus consists of several classes, then it is not possible to characterize the numbers represented by forms of one of those classes in terms of the residues of the numbers for some modulus. These facts can be proved by class field theory. The proof has the following flavor. The representation of a prime number p by some form from our set of classes S can be interpreted in terms of the type of splitting of this prime into prime divisors in some field L. The field L will have an Abelian Galois group over the rational field if and only if our set S is a union of genera [H. Hasse, Zur Geschlechtertheorie in quadratischen Zahlkorpern, J. Math. Japan 3, No. I, 45-51 (1951)].
Sec.B]
QUADRATIC FIELDS
243
We now investigate the question of the number of genera. Let PI' ... , Pt be all prime divisors of the discriminant D. By (8.6) every genus is uniquely determined by the invariants ei = ep,(G). These invariants cannot be arbitrary, since, iff E G and the number a#-O is represented by f, we have el
...
et =
TI eiG) == TI (a, D) p
p
p
=1
[see (7.17) of Chapter I; the product is taken over all prime numbers P and the symbol C()]. We now show that the relation el ... et
=I
(8.7)
between the numbers ei = ± I is not only necessary, but also sufficient, in order that these numbers be the invariants of some genus G. Denote by k i the power to whichp divides D (k i equals I for all Pi #- 2 and equals 2 or 3 for Pi = 2). For i = I, ... , t choose an integer ai' not divisible by Pi, such that (ai' Dip;) = ei' and then determine a by the system of congruences a == a i (mod p/') (l ~ i ~ t). For any a which satisfies these congruences we have (by the properties of the Hilbert symbol)
(a~~) = (a~iD)
=
ei.
Our problem is then to find, among all such values of a, one. for which (a, Dip) = I for all p,t D. We use here the theorem of Dirichlet on prime numbers in arithmetic progressions (Section 3 of Chapter 5). Since the set of all such values of a is a residue class modulo ID I = TIp/' consisting of numbers relatively prime to D, we may choose among them a prime valueq, by Dirichlet's theorem. We then have
(q~~) = (a~~) = ei; (q~D)=1 (q~D)
forp,tD,p#-2
and
p#-q; "/ 1,1
= (_
1)[(q-I)/2][(D-I)/2]
= I
for2,tD.
The relation TIp (q, Dip) = I then yields e l ... et(q, Dlq) = I, so it follows from (8.7) that the value of the symbol (q, Dlq) is also 1.
I,
:'
i I
II!
ii, " 11
II'II..
I
':1
I,'
I'
I
il
I
111
,fll
t
[Chap. 3
THEORY OF DIVISIBILITY
244
Thus there exists a natural number a (which is also prime) such that
a, D) = e (1 ~ i ~ t) (--p;j
and
(a,p D) = 1
for p,t D.
By Theorem 3, a is represented by some formf with discriminant D. If this form belongs to the genus G, then (1 ~ i ~ t).
This proves our assertion on the existence of a genus with given invariants [satisfying, of course, (8.7)]. Since there are 2,-1 possible choices of e i = ± 1 which satisfy (8.7), the number of genera of forms of discriminant D also equals 2' - I •
:I
Theorem 5. Let PI' ... ,P, be" the distinct prime divisors of the discriminant D of the quadratic field R( .jd). For any choice of the values ej = ± 1 (1 ~ i ~ t) such that e l ... e, = 1, there is a genus G of forms of discriminant D for which ep,(G) = ej' Hence there are 2,-1 genera of forms of discriminant D. Remark 1. The theory of genera of forms, given in this section when the discriminant coincides with the discriminant of the maximal order of a quadratic field, can also be developed for forms with discriminant Df2.
I
I:
'
Remark 2. If every genus of forms with negative discriminant Df2 consists of a single class, then there is a simple formula (Problem 18) for the number of representations of an integer relati vely prime to f by a fixed form of discriminant Df2. A table of the known values for the discriminant Df2 < 0 with each genus consisting of a single class is given at the end of the book. It is not known if this table is complete. It has been proved that the number of such discriminants is finite. For the even numbers Df2 in this table the numbers _ !Df2 were found by Euler, who called them convenient numbers. They were used by Euler to find large prime numbers because of the following property: If a and b are relatively prime numbers whose product ab is a convenient number and if the form ax 2 + by 2 represents the number q in essentially only one way (with relatively prime x and y), then the number q is prime (see Problem 19). For example, the difference 3049 - 120y2 is a square only when y = 5, and this means that the number 3049 is represented by the form x 2 + 120y2 in only one way: 3049 = 7 2 + 120· 52, and hence is prime. By this method Euler found many primes which were very large for those times. It is clear that for larger convenient numbers, the work involved in proving uniqueness is less.
Sec.B]
QUADRATIC FIELDS
245
8.4. Genera of Divisors
The results on genera of forms obtained in Section 8.3 allow us to draw some conclusions on the structure of the group of divisor classes (in the strict sense) of a quadratic field. We carryover the concept of genus to divisors. By Theorem 6 of Section 6 every divisor a (integral or fractional) corresponds to a unique ideal a, which consists of all numbers of the field which are divisible by a. For a quadratic field every basis {IX, fl} of the module a, which satisfies condition (7.10) of Chapter 2, corresponds to a primitive form f(x, y) =
N(lXx + fly) N(a) .
(8.8)
If we pass to another basis of the module a [which also satisfies (7.10) of Chapter 2], the form f will be taken to a strictly equivalent form. Hence by (8.8) the divisor is associated with a whole class of strictly equivalent forms. This mapping sets up a one-to-one correspondence between classes of divisors in the narrow sense and classes of strictly equivalent forms of discriminant D, which was already remarked at the beginning of Section 8.3. Definition. Two divisors of a quadratic field are in the same genus if their corresponding classes of forms are contained in the same genus of forms (that is, are rationally equivalent).
Since divisors which are strictly equivalent correspond to the same class of forms, then each genus of divisors is a union of classes (in the strict sense) of divisors. A genus of divisors, corresponding to the genus of forms G, will also be denoted by G. By the invariants ep(G) of a genus G of divisors, we mean the analogous invariants for the corresponding genus of forms. We then have the formula (8.9) where a is any divisor of the genus G. For by definition we have ep(G) = (a, Dip), where a is some nonzero rational number which is represented by the form f(x, y) given by (8.8). But the form N(lXx + fly) represents all squares of rational numbers, so in particular it represents N(a)2. Hence,f(x, y) represents N(a), which proves (8.9). The genus of divisors Go, all invariants of which equal I, is called the principal genus. The divisors of the principal genus are characterized by the condition (N(a), Dip) = I for all p. Hence the principal genus is a subgroup (with respect to the operation of multiplication of divisors) of the group of all divisors. Further, any genus G of divisors is a coset of aGo of the subgroup
246
THEORY OF DIVISIBILITY
[Chap. 3
Go, where a is any divisor of the genus G. The set of all cosets of the subgroup Go is a group, the factor group of the group of all divisors modulo the subgroup Go . Hence we can consider the set of all genera as a group. It is called the group 01 genera. By Theorem 5 the order of the group of genera is 2'-1, where t is the number of distinct prime divisors of the discriminant D. We now characterize the genera of divisors in terms of divisors, without mentioning forms.
"
Theorem 6. Two divisors a and a l of a quadratic field belong to the same genus if and only if there is an element of positive norm in the field such that N(a l ) = N(a)N(y).
fl} and {lXI' (7.10) of Chapter 2. Then the forms
Proof. Choose bases {IX,
f(x, y)
=
N(IXX
fld for the ideals 0 aJld 01 which satisfy
+ fly)
N(a)
,
correspond to the divisors a and a l . By Theorem 11 of Section 1 of the Supplement, the forms I and II are rationally equivalent if and only if there is a nonzero rational number which is represented by both of these forms. But this would mean that N(~)
N(~I)
N(a)
N(ad
--=--
(~, ~I
'*
0),
and the assertion of the theorem follows. Divisors of the principal genus have the following important characterization. Theorem 7. The divisor a belongs to the principal genus if and only if it is strictly equivalent to the square of some divisor. Proof. Suppose that the divisor a belongs to the principal genus. Since the
unit divisor belongs to the principal genus, it follows from Theorem 6 that there is a number y for which N(a) = N(y). Replacing a by the equivalent divisor a(y -I), we may assume that N(a) = 1. Now write a as a product of prime divisors. Here we distinguish between those prime divisors l-\ for which their exists another prime divisor p;' with the same norm (the first type of decomposition in the terminology of Section 8.1), and all other prime divisors qj a = f1 p/;p/b; f1 q/J. i
j
Sec.B]
247
QUADRATIC FIELDS
Since N(Pi) = N(pn = Pi and N(qJ = q/j (where have
'j
equals I or 2), then we
TI pt + TI q/jCj = 1 i
b
i
;
j
[since N(a) = 1]. Since the primes p j and qj are all distinct, h j = cj = 0, so that a = TI ptip/-Ui.
-OJ
and
i
But PiP;' =P;, so that p;,-l ~ Pi' from which it follows that
(here the sign ~ denotes strict equivalence of divisors). Conversely, if a ~ b 2, that is, a = b 2(1X), with N(IX) > 0, then N(a) = N(fJ), where fJ = N(b )IX, and then, by Theorem 6, a belongs to the principal genus. Theorem 7 is proved. Let (t denote the group of classes of strictly equivalent divisors. If we map each class C E (t to that genus G which contains the class C, we obtain a homomorphism of the group of classes onto the group of genera. Its kernel is the set of all classes which are contained in the principal genus. By Theorem 7 the class C' is contained in the principal genus if and only if it is the square of some class of (t. Hence the kernel of the homomorphism of the group (t onto the group of genera is the subgroup (t2 which consists of all squares C 2 of classes C E (t. Using an elementary theorem on homomorphisms in group theory and the fact that the group of genera has order 2'-1, we arrive at the following result. Theorem 8. The factor group (t/(t2 of the group of classes of strictly eq uivalent divisCt~·s by the subgroup of squares has order 2' - 1, where I is the number of distinct prime numbers which divide the discriminant D of the quadratic field.
The value of Theorem 8 lies in the information which it gives on the structure of the group (t. By Theorem I of Section 5 of the Supplement, the group (t can be decomposed into the direct product of cyclic subgroups. From Theorem 8 it easily follows that precisely I - I of these subgroups have even order. In particular, we obtain the following fact. Corollary. The number of classes of strictly equivalent divisors of a quadratic field is odd if and only if the discriminant of the field is only divisible by one prime.
il
~~
II'
248
THEORY OF DIVISIBILITY
[Chap. 3
Such fields are R( ~=1), R( ~2), R( ~ - 2, R( ~P>, with p of the form + 1, and R(~ -q) with q of the form 4n + 3. This fact is the basis for the little we know about the structure of the divisor class group.
4n
PROBLEMS
I
I'
1. Let X be the character of the quadratic field with discriminant D. Show that X can be expressed in terms of the Hilbert symbol by the formula x(a) =
TI (a, D) PID
(a, D) = 1.
P
2. If Y is any integer of a quadratic field which is relatively prime to the discriminant D, show that the congruence x 2 "" N(y) (mod IDi) is always solvable. 3. Let G be the group of residue classes of rational integers (mod IDi)which are relatively prime to D, and let H be the subgroup of those classes which contain the norm of some integer of the quadratic field with discriminant D. Show that the index (G: H) is equal to 2', where t is the number of distinct primes which divide D. 4. Under the same notations as Problem 3, let H* denote the subgroup of G consisting of all residue classes which contain the norm of some integral divisor of the quadratic field with discriminant D. Show that (G : H*) = 2. 5. If Y is any number with positive norm of the quadratic field with discriminant D, show that for all p,
(N(~,
D)
=
I.
6. Let a and b be integral ideals which are relatively prime to D. Show that a and b belong to the same genus if and only if for some integer y we have N(a)
~
N(y)N(b) (mod IDI).
7. If the discriminant of a real quadratic field is divisible by only one prime, show that the norm of a fundamental unit is -1. 8. Show that the automorphism a: iX -+ ixu of the quadratic field R(Vd) (not the identity automorphism) induces an automorphism a -+ aU of the group of divisors for which (ix U) = (ix)U for all ix #' O. What is the behavior of this automorphism on prime divisors? 9. The automorphism a of the group of divisors (Problem 8) induces an automorphism a: C -+ C U of the group of classes of strictly equivalent divisors. Namely, if a E C, then C U is that class which contains aU. The class C is called invariant if CU = C. Show that a class C is invariant if and only if C 2 is the principal class. 10. Show that the subgroup of the group of classes of strictly equivalent divisors which consists of all invariant classes is of order 2,-1 (t is the number of distinct primes which divide the discriminant).
Sec.B]
249
QUADRATIC FIELDS
11. If f3 is an element of a quadratic field with N(f3) = 1, show that there exists an such that
IX
IX"
N(IX)
> 0, f3 = ± -. IX
12. Show that every invariant class C contains a divisor a for which a" = a. 13. Let~" ... , ~t be the distinct prime divisors which divide the discriminant D. Show that each invariant class C contains precisely two divisors of the type (k=O, 1, ... ,1).
14. The subgroup of all invariant classes contained in the principal genus is a direct product of cyclic subgroups of order 2. Let Q; be the group of classes of strictly equivalent divisors, and recall the definition of the invariants of a finite Abelian group (Supplement, Section 5.1). Show that the number of cyclic components of the above subgroup equals the number of invariants of Q; which are divisible by 4. 15. Show that the number of positive integers r which divide the discriminant Dare square-free, and satisfy
C'pD)
=
for all p,
1
is a number of the form 2". Show further that the number of invariants of the group Q; which are divisible by 4 equals u - 1. 16. Let m be a natural number which is relatively prime to the index f of the order 0 , in the maximal order of the quadratic field R( v'd). Show that the number of modules in R(v'd) which have coefficient ring 0" are contained in 0 / and have norm m, equals the number of integral divisors of the field R(v'd) with norm m. 17. Show that the number of integral divisors of the quadratic field R( v'd) with norm m equals
where X is the character of the field R( v'd) and r runs through all natural numbers which divide m. 18. Let gl(X, y), ... ,g,(x, y) be a full system of pairwise nonequivalent positive primitive quadratic forms with discriminant Dp < [D being the discriminant of the maximal order of the field R( v'd»), and let m be a natural number which is relatively prime to f Show that the number N of all representatiohs of the number m by all the forms g" ... , gs is given by
°
N= x
where
2: x(r),
'I'"
for D= -3,/= 1; for D
=
-4,/= 1;
for Dj2 < -4. 19. Let g(x, y) be a positive form with discriminant Dp < -4 and let q be a natural number relatively prime to Df2. Assume that every genus of forms with discriminant Df2 consists of a single class. If g(x, y) = q has precisely four solutions in integral, relatively prime x and y, show that q is a prime.
I
250
THEORY OF DIVISIBILITY
[Chap. 3
20. Let hI be the number of classes of similar (in the usual sense) modules of a quadratic field which belong to the order f) I (Problem 11 of Section 7, Chapter 2). Show that
./
where X is the character of the quadratic field and P runs through all primes which divide f 21. Show that a prime number is represented by the form x 2 + 3y 2 if and only if it has the form 3n + 1. 22. Show that the form x 2 - 5y 2 represents all prime numbers of the form IOn ± 1 and does not represent any primes of the form IOn ± 3. 23. Show that the natural number m is represented by the form x 2 + 2y 2 with relatively prime x and y if and only if in the representation
ex is either 0 or I, and each odd prime Pi is of the form 8n + 1 or 8n + 3. 24. Show that there exist quadratic fields (both real and imaginary) with arbitrarily large numbers of divisor classes. 25. Let P" ... ,Ps be the distinct prime numbers which divide the discriminant D of the quadratic field R(Vd). From
(P';)D)
=
(-I)"')
(l ~ i,j ~ s)
we obtain a matrix (al) whose elements lie in the field of residue classes modulo 2. Let p denote the rank of this matrix [over the field GF(2)]. Show that the number of invariants of the group of classes of strictly equivalent divisors, which are divisible by 4, equals s - p - 1. 26. Let P and q be prime integers, with P oF 2 and q ~ P (mod 4). Show that the number of divisor classes of the field R(V -pq) is divisible by 4 if and only if (qjp) = 1. 27. Let p" ... ,Ps be distinct prime numbers of the form 4n + 1, and let d = PI'" Ps "" 1 (mod 8). Show that every genus of divisors of the field R(V -d) consists of an even number of classes. 28. Let E be a fundamental unit of the real quadratic field R(Vd), whose discriminant is not divisible by any prime number of the form 4n + 3. Show that if the principal genus of divisors of the field R( Vd) consists of an odd number ofclasses of strictly equivalent divisors, then N(E) = - 1. 29. Let p be a prime number of the form 8n + 1. Show that the number of divisor classes of the field R(V ~p) is divisible by 4. \
CHAPTER
4
Local Methods
In Section 7 of Chapter 1 we gave a proof of the Hasse-Minkowski theorem on the representation of zero by rational quadratic forms. Both in the formulation and in the proof of this theorem we had to embed the field R in the p-adic fields R p , and in the real field Roo, that is, in all completions of the field R. A method of solving problems in number theory by use of the embeddings of the ground field in its completions is called a local method. Such methods have important number-theoretic consequences, not only when applied to tie field of rational numbers, but also when applied to algebraic number fields. Local methods are also instrumental in the study of algebraic function fields. In thi~ chapter we describe the basic facts that hold for local methods for any ground field, and then make an intense application to prove some deep results on the representation of numbers by nonfull decomposable forms (Section 1.3 of Chapter 2). We refer particularly to the theorem of Thue, which states that the equationf(x, y) = c, where f(x, y) is an integral homogeneous irreducible polynomial of degree ~ 3, has only a finite number of solutions in integers. Thue himself proved this theorem by using the theory of rational approximations to algebraic numbers. A proof based on local methods was given by Skolem. Actually Skolem's proof involves putting a small restriction on the polynomialf(x, y). On the other hand, his proof has the advantage of allowing a general approach to the problem of the representation of numbers by a fairly wide class of nonfull decomposable forms. We return to this point in Section 6.4. The basic idea in Skolem's method can be clarified in the following simple example. We shall try to establish the finiteness of the number of solutions in 251
i
'I
1
j, 252
LOCAL METHODS
[Chap. 4
integers to the equation (0.1) where c and d are integers, and d is not a cube. Consider the cubic field R(O), where 0 = We can now write (0.1) in the form
Vd.
N(x
+ yO) = c.
(0.2)
Hence the problem reduces to finding all numbers in the nonfull module {t, O} of the field R(O) with given norm. We embed the module {I, O} in the full module {t, 0, 02}, which coincides in this case with its coefficient ring.o. The solutions of (0.2) are hence numbers IX E .0 with norm c, for which in the representation IX = X + yO + Z02 the coefficient z equals O. But we have already solved the problem of finding all numbers with given norm in a full module (Theorem I of Section 5, Chapter 2). In this case we have s = I, t = I (since the polynomial x 3 - d has one real root and two complex roots). Hence there is a unit e E .0 with norm + I, and a finite set of numbers j,t1' ... , j,tk each with norm c, so that every IX E .0 with norm C has a unique representation in the form j,tje" for some i = I, ... ,k and some rational integer u. To prove that (0.1) has only a finite number of solutions, it will suffice to show that only a finite number of the numbers J1.ie" have the form x + y(}. Along with the field R«(}) consider its conjugate fields R«(}') and R(O") and, for every IX E R(O), denote by IX' E R«(}') and IX" E R(O") the corresponding conjugates. If we set j,te" = x
+ yO + Z02,
then, taking conjugates, we shall also have
jie'" = x j,t"e"" = x
+ y(}' + Z(},2, + yO" + ZO,,2.
From these three equations we can find an expression for z. It will have the form
where Yo, YI' Y2 are certain (nonzero) numbers of the field K = R«(}, (}', 0"). Solutions of (0.1) hence lead to solutions of the equation (0.3) in the rational integer u. Since this equation contains only one unknown, it is natural to expect that it will have only a finite number of solutions. But it is not at all simple to give a proof. Skolem's method is based on the consideration of the left side of (0.3)
Sec. 1 ]
FIELDS COMPLETE WITH RESPECT TO A VALUATION
253
as an analytic function F(u) in the p-adic domain. If (0.1) has an infinite number of integral solutions, then F(u) has an infinite number of integral zeros. In Section 3.4 of Chapter 1 we saw that the collection of p-adic integers is a compact set, and hence the function F(u) would vanish at an infinite sequence of points which converge to a limit point (in its domain of definition). In the theory of analytic functions of a complex variable it follows from the uniqueness theorem that such a function is identically zero. The proof of this fact translates word-for-word to the case of p-adic analytic functions. Hence the function F(u) must be identically zero and we obtain a contradiction. Already in this example the p-adic numbers introduced in Chapter 1 are insufficient. Since the numbers Yo, Yl' Yz, e, e', en in (0.3) are algebraic numbers, we must develop a theory, analogous to the theory of p-adic numbers, with the rational field R replaced by an arbitrary algebraic number field k and the prime number p replaced by a prime divisor p. This leads us to Section 1.
1. Fields Complete with Respect to a Valuation 1.1. The Completion of a Field with Respect to a Valuation
In Section 4 of Chapter 1 we showed that to every prime number p, that is, every prime divisor of the rational field R, corresponds the p-adic metric qJp of the field R, the completion of which is the field R p of p-adic numbers. The definition of the metric used no properties of the field R other than the existence of the p-adic valuation vp [see formula (4.1) of Chapter 1]. Therefore, the construction of analogous completions can be carried out for any field k, provided we have a theory of divisors in it. If P is any prime divisor of the field k, and v = Vp is the corresponding valuation, then by taking any real number p with 0 < p < 1, we can define a metric qJ = qJp on k by setting qJ(X)
= pv(x)
(x
E
k).
(1.1)
Then, following the construction of Section 4.1 of Chapter 1, we may form the completion k = kp of the field k with respect to this metric. [The fact that the function (1.1) is a metric is easily verified.] The field kp is called the p-adic completion of the field k. The completion k = kp clearly does not depend on what theory of divisors we have in mind for the field k. It is completely determined by the single valuation v = vp ' Hence we shall also call it the completion of k with respect to the valuation v. In this section we study such completions and their finite extensions. Let Kbe the completion of the field k with respect to the valuation v. We now show that the valuation v can be extended in a unique fashion to a valuation
254
LOCAL METHODS
[Chap. 4
V of the field k. In Section 4.1 of Chapter 1 we saw that the metric cp of the field k [see (1.1)] can be extended to a metric iP of the field k, so that if cx E " and cx = lim n-+ an, where an E k, then iP(cx) = lim n-+ cp(an). But in this case zero is the only limit point of the set of numbers cp(a), a E k, and hence the sequence {cp(a n)} must either converge to zero (if a = 0), or it must remain constant from some point on (if a #- 0). Hence the sequence {v(a ll )} converges to infinity if a = 0 and eventually becomes constant if a #- O. We can therefore set et)
et)
v(a) = lim v(a n ). It is now easily verified that the function v(cx) (whose value clearly does not depend on the choice of the sequence {ann is a valuation of the field k, with v(a) = v(a) for all a E k. It is also clear that the metric iP of the field kisrelated to the valuation v by (a E k).
It will be convenient later to express convergence in the field k in terms of the valuation v instead of the metric iP (just as was done for the p-adic numbers in Section 3.4 of Chapter 1). Let 0 be the ring of the valuation v, that is, the ring of all a E k such that v(a) ;:. 0 (Section 4.1 of Chapter 3). We now show that the closure 0 of the ring 0 in the field K coincides with the ring of the valuation iI (if A c: k is any set, by the closure A of A we mean the set of all elements of k which are limits of sequences of elements of A). If a E 0, then a = limn -+ an' where an E 0, so that v(a) = limn -+ v(an);~ O. Assume, conversely, that v(a);:. O. Since a is the limit of a sequence of elements of k, then for any natural number n we can find an element a E k such that iI(a - an) ;:. n. Then a = limn _. an' where et)
et)
et)
v(a n) = v(a - (a - an)) ;:. min (v(a), (iI(a - all)) ;:. 0; that is, an E o. Our assertion is proved. By Theorem 2 of Section 4, Chapter 4, the ring 0 has, up to associates, only one prime element n, which is characterized by v(n) = 1. It thus remains a prime element in the ring 0 [since v(n) = 1]. Let ~v and ~v denote the residue class fields of the valuations v and v (see Section 4.1 of Chapter 3). Since elements of 0 are congruent modulo n in 0 if and only if they are congruent modulo n in 0, we have an isomorphism from the field ~v to the field ~v. On the other hand, for any a E [) there is an element a E 0 such that v(a - a) ;:. I; that is, a == a(mod n). This means that the mapping ~v -+ ~v is an isomorphism onto the entire field ~ •. Because of this isomorphism we shall frequently identify the field ~v with ~v.
Sec. 1 ]
FIELDS COMPLETE WITH RESPECT TO A VALUATION
255
1.2. Representation of Elements by Series In this section we assume that k is complete with respect to the valuation v [that is, that it is complete under the metric (1.1)]. We shall call the ring 0 of the val uation v the ring of integral elements (or integers) of the field k. We denote some fixed prime element of the ring 0 by n. The residue class field ~ of the valuation v will also be called the residue class field of the field k. Everything that was said about p-adic series in Section 3.4 of Chapter 1 clearly remains true for series in the field k. In particular, Theorem 8 of Section 3, Chapter I, is valid. Taking arbitrary integers CJ.n(m ~ n < (0), we consider the series (1.2)
n Since v(CJ.nn ) = v(CJ. n) + n ~ n, then CJ.nn n --+ 0 as n --+ 00; that is, the general term of the series (1.2) converges to zero. Hence the series (1.2) converges and its sum is some element of the field k. It is now natural to ask if every element of k has a representation in the form (1.2), and, if so, if there is a canoniCal representation such as that obtained for p-adic numbers (Theorem 10 of Section 3, Chapter I). The answer will be affirmative. We choose in the ring 0 some complete system of residues modulo n. We assume that 0 E S, that is, that the class of elements of the ring 0 which are divisible by n is represented by O.
,
'
:' "j
I, l!
'f 1;1
r:
,.
; '! ~
1
Theorem 1. Let k be a complete field under the valuation v, with 0 the ring of integers of k, n a prime element of 0, and S a complete system of residues (containing 0) of the ring 0 modulo n. Then every element CJ. E k has a unique representation as the sum of a series 00
CJ.=
i 'a.n L., I , i=m
where a i
E
S (m
~
(1.3)
i < (0).
:J. = 0 we have the representation 0 = Li~OO·ni. Let CJ. #- O. If IrI v(o:) = m, then v«(xn- ) = O. The element CJ.n- mis congruent modulo n to some nonzero element of S, say, to am' Since CJ.n- m - am =n~, where ~ E 0, then
Proof. For
Assume that for some n > m we have found the representation
rI ii
[I
I'. t:.' ~ :'
,
i. .
i
256
LOCAL METHODS
[Chap. 4
where ai E S (m ~ i ~ n - 1), '711 E o. Choose all E S, so that '711 == all (mod n). Since '711 = all + '711 + 1 TC, where '711 + 1 E 0, then we have the representation m a. = amTC + ... + a"n" + '711+ I n" + 1 for a.. We continue this process indefinitely. Since v('7lrr") ~ n, then '7l1rrn -+ 0 as n -+ (f), and hence a. = I~m aiTC i. If not all coefficients ai in (1.3) are zero, then we may assume that am "# O. In this case v(a m ) = 0, since all elements of 0 which are not divisible by n are units. Then m v(tma;nj) = v(amn ) = m. From this it follows that the representation for a. for some a. "# 0 we have two representations: 00
= 0 is unique. Assume that
00
,j a.= "L.,.ajTC j = "L.,.ajTC
i=m
i=m'
If am"# 0 and am" "# 0 in these representations, then it follows that m = m'. Suppose we have already established that ai = a;' for m ~ i < n (n ~ m). Multiply the equation I~lIaini = I~lIa/TCi by n- II . Turning to'congruences modulo n we find that all == an' (mod n), and since both all and a.' lie in S, then all = all" This proves Theorem 1. Note that in the case when k = R p , n = p, and S = (0, 1, ... ,p - 1), Theorem 1 reduces to Theorem 10 of Section 3 of Chapter 1. Corollary. Using the notations of Theorem 1, every integral element a. has a unique representation in the form (all E S).
E
k
(1.4)
It is easily seen that Theorem 9 of Section 3 of Chapter 1 is valid for series in the field k. Hence convergent series in k can be multiplied by the usual method in analysis. In particular, we can treat series of the form (1.3) as a power series in n. But when we operate with series of the form (1.2) using the rules for power series we must keep in mind that we will obtain series of the form (1.2), in which the coefficients a. 1I do not belong to the system S. We can translate the obtained series into the form (1.3) by replacing in turn each coefficient a. 1I by its residue all E S, where a. 1I = all + nYII' adding at each stage the element YII E 0 to the following coefficient.
Remark 1. The representation (1.3) clearly depends on the choice of the system S. Among all such systems there is, in many important cases, a .. best" system which has the property of multiplicative closure, or even is a subfield of the field k (see Problems 7 to 11).
Sec. 1 ]
257
FIELDS COMPLETE WITH RESPECT TO A VALUATION
Remark 2. These results generalize the analogous facts for p-adic fields (Section 3.4 of Chapter 1). We must warn that Theorem 6 of Section 3, Chapter 1, is no longer valid for arbitrary fields complete with respect to a valuation. It holds only for those fields k where the residue class field 1: has a finite number of elements. The same is the case for Theorems 1 and 2 of Section 5, Chapter 1 (when F is a form with coefficients in the ring 0). But Theorem 3 of Section 5, Chapter I, carries over word-for-word to the case of an arbitrary field k which is complete with respect to a valuation. In the future we shall use the corollary to this theorem in the following form: If F(x) is a polynomial with integral coefficients in k and there is an integer E k for which F(e) == 0 (mod n) and F'(e) "I- 0 (mod n), then there is an integral element () E k for which == () (mod n) and F«(}) = O.
e
e
1.3. Finite Extensions of a Field Complete with Respect to a Valuation Let k be complete under the valuation Vo • Then k has algebraic extensions of all degrees (Problem 9 of Section 3, Chapter 3). Let K be an extension of k of degree n. By Theorem 5 of Section 4, Chapter 3, there is a valuation v of K which is an extension of Vo ' Our goal is to show that in this case v is unique and that K is complete under v. Let L be a subspace of K, considered as a vector space over the field k, and let Wi> ... , W s be a basis for Lover k. Each element a. of L has a unique representation in the form
(ai
E
k).
(1.5)
If vO(a i) ~ N (i = I, ... ,s), then, by the properties of valuations,
v(a.)
~
min v(aiw;)
~
eN + min v(w;),
where e denotes the ramification index of v relative to Vo (see Section 4.3 of Chapter 3). Conversely, we shall show that if the element a. E L is very small under the valuation v (recall that" small" elements are characterized by large values of the valuation), then all the coefficients a i in (1.5) will be small under the valuation Vo ' More precisely, this means that for any N we can find an M such that whenever v(a.) ~ M, then VO(a.i) ~ N (i = 1, ... ,s). For s = 1 the assertion is clear. We prove the general case by induction on s. Let s ~ 2, and assume that the assertion is false, that is, that there exists a number Nand elements a. E L so that the value of v(a.) can be made arbitrarily large, but for which there is at least one coefficient a i with vo(a;) < N. We may clearly assume that this inequality always holds for the first coefficient al' Hence for each natural number k we choose an element a. k E L for which V(a.k) ~ k + eN and the coefficient a l (k) in the decomposition
(a/ k ) E k),
[Chap. 4
LOCAL METHODS
258
{Pd, where + b1(k lw1 + ... + b/klws •
satisfies vOCal (kl) < N. Consider the sequence Pk
= a:kal(kl-I =
WI
(1.6)
Since V(Pk) = v(a: k) - evo(al (k l ), then V(Pk) > k.
The differences s
PH I - Pk
= L (b/ H
Il - b/kl)w i
i= 1
all lie in the subspace of dimension s - 1 which is spanned by the elements Wl' ... , w s ' and they satisfy
V(Pk)) > k;
that is, V(PH I - Pk) -+ 00 as k also have (for i = 2, ... , s)
-+
00. Then by the induction hypothesis we
v(b?+ Il - b/ kl ) -+ 00
for k -+ 00.
Hence since the field k is complete (see Theorem 7 of Section 3, Chapter 1) the sequence {b?l}k"= I converges to some element b i E k. Passing to the limit in (1.6) as k -+ 00 and noting that Pk -+ 0, we obtain WI
:[
I I i
I.
I i
Of!
I
t
+
b1Wl
+ ... + bsw s = 0,
which contradicts the linear independence of the elements WI' ... , W s over the field k. This contradiction proves our assertion. We now take for L the whole field K. If the sequence {a:d of elements of K is a Cauchy sequence, that is, if V(a:HI - a: k ) -+ 00 as k -+ 00, then by what we have just shown, the sequences {a?l}k"= 1> which arise from the decompositions a:k
=
al(k)wI
+ ... + a.(k lwn
(a/ kl
E
k)
(1.7)
(here WI' ... , W n is a basis for Kover k), will converge in the field k. Then the sequence {a:d will also converge. This shows that K is complete with respect to the valuation v. In addition, we see that convergence in K, relative to the valuation v, is completely determined by convergence in k (relative to the valuation yo). From this fact it easily follows that there is only one extension of the valuation Vo to the field K. For suppose that v and v' are two different extensions. By the independence of valuations, there is an element a: E K, for which yea:) > 0 and v'(a:) = O. The sequence {a: k } converges to zero relative to the valuation v, but does not converge relative to v' [since v'(a: k+ I - a: k ) = v'(a: - 1) does not converge to infinity]. This contradicts the fact that convergence in K is determined by the valuation vo . We have hence obtained the following theorem.
Sec.1]
FIELDS COMPLETE WITH RESPECT TO A VALUATION
259
Theorem 2. Let k be complete with respect to the valuation Vo , and let K be a finite extension field. The valuation Vo has a unique extension v to the field K. K is complete with respect to v. If Wl' ... , W n is any basis for Kover k, then a sequence {O:k} of elements of K is convergent if and only if each of the sequences {a/ kl } (1 ~ i ~ n), which are determined by (1.7), converges in k.
1.4. Integral Elements
We now study the relationship between the ring 0 of integral elements of the field k, complete under the valuation Vo , and the ring.o of integral elements of the finite extension K of k. Since the valuation Vo has only one extension v to the field K, then by Theorem 6 of Section 4, Chapter 3, the ring .0 (the ring of the valuation v) is the integral closure of the ring 0 in the field K. Hence for any 0: E .0 the norm N(o:) = NK1k(0:) belongs to 0, and then the norm N(e) of any unit e of the ring .0 will be a unit in the ring o. Now let 0: ¢ .0. Since 0:- 1 ED and is not a unit, then N(o:-l) = N(o:)-1 belongs to 0 and is not a unit in o. But in this case N(o:) = (N(o:)-l)-1 does not belong to o. We have proved the following theorem. Theorem 3. Let 0: be an element of the finite extension K of the field k, complete with respect to a valuation. Then 0: is an integral element if and only if N KIk(O:) is integral in k. Corollary. An element e E K is a unit in the ring .0 if and only if its norm N(e) is a unit in the ring o.
The rings
0
and .0 can be considered as rings with a theory of divisors. Let
V and ~ denote the (only) prime divisors of these rings. The degree of inertia
f of the divisor
~ relative to p, that is, the degree of the extension (~ : ~o) of the residue class field ~ of the field K over the residue class field ~o of the field k, is also called in this case the degree of inertia of the extension K/k. Analogously, the ramification index e of the divisor ~ relative to p is called the ramification index of the extension K/k. If no and n are prime elements in o and .0, then we know that
(1.8)
where e is a unit of .0. Let So be a complete system of residues in the ring 0 modulo no. As before, we assume that 0 E So. It is easily seen that if the residue classes WI' ... 'WI of ~ form a basis over ~o, then the set S which consists of the linear combinations (1.9)
260
[Chap. 4
LOCAL METHODS
where aI' .,. ,al independently run through all elements of So, is a complete system of residues in the ring .0 modulo 1L Definition. A basis (}1' .. , , (}n for the field Kover k is called afundamental basis if all (); are integral and for any integral a. E K all coefficients a; in (a; E k)
are integral in k. Theorem 4. Let k be complete with respect to the valuation Vo and let K be a finite extension with ramification index e and degree of inertia! Let ~o and ~ be the residue class fields of k and K, and let n be a prime element of the ring of integers of the field K. If WI' ... 'WI are residue classes of the field ~ which form a basis over the field ~o, then the system of elements (i
= 1, ... ,f;
j
= 0, 1 ... , e - 1),
(1.10)
is a fundamental basis for the extension Klk. Proof. We first show that the elements (1.10) are linearly independent over k. Assume, on the contrary, that we have
where ai j are elements of k, not all equal to zero. We may assume that all aij are integers and that at least one of them in a unit in 0 (ifthis'is not so, multiply by a suitable power of the prime ,element no EO). Let jo (0 ~ jo ~ e - 1) be the smallest index for which there exists an io (1 ~ io ~f) with aiojo a unit in o. Hence ifj < jo, then vo(aij) ~ 1 for all i. Since 1 aijoWi '# 12,0 the sum 1 aijow; is not divisible by n, and hence for the element
IJ=
IJ=
I
Y=
L1aijowi njo
i=
we have
On the other hand,
If j <jo, then If j > jo, then
Sec. 1 ]
FIELDS COMPLETE WITH RESPECT TO A VALUATION
261
Hence v(y) ~ min v(aijw j 1t j ) > jo. j*jo
This contradiction shows the linear independence of the elements (1.10) over the field k. Now let r:J. be any element of .0. By the corollary of Theorem 1 we have r:J.
==
~o
+ ~11t + .. , + ~e_l1te-1
(mod
e 1t ),
where ~ j are elements of some fixed system S of residues, which we may take to consist of numbers of the form (1.9). Since 1to and 1t e are associate in .0 [see (1.8)], then congruences in .0 modulo 1to and modulo 1t e are equivalent. Hence we have f.-I 0 )W 1t j ( mo d 1to, ) r:J. - " L, " L,(aij i i= 1 j=o
=
and this means that
Analogously,
Continuing this process indefinitely, we obtain a sequence of equations fe-I _"
r:J... -
"(n)
L, L, a ij W i 1t i= 1 j=O
j
+ 1t Or:J. n + b
(a..n+lE.o,
For fixed i andj we have an infinite sequence {aiJ}. Consider the series 00
"L,
a!n)~ n IJ "0 .
n=O
Since the aWare integers, this series converges and its sum element of k; that is, aij E o. We shall show that
aij
is an integral
(1.11 ) By the construction of the elements
r:J. 1 , r:J. 2 , •••
from which it follows that the difference
we have
262
LOCAL METHODS
[Chap. 4
is divisible by non (in the ring .0). Since this holds for all n, this difference must be zero and (1.11) is valid. If f3 is any element of K, then for some m, the element f3nom will be integral. Representing it in the form (1.11), we see that f3 is a linear combination of the elements (1.10) with coefficients in k. Hence the system (1.10) is a basis for K over k, and since for integral a. E K all coefficients aij in (1.11) lie in 0, this basis is fundamental. Theorem 4 is proved. Since there arefe elements in the basis (1.10), we also have the following result. Theorem 5. The product of the ramification index and the degree of inertia is equal to the degree n = (K : k); that is,
fe= n.
Set NK/k(n) = nomu, where u is a unit of the ring o. Taking the norm of (l.8), we obtain
!
I
II
where v is also a unit in o. It follows that n = me (and v = 1), and hence m = f Hence the degree of inertiaf of the extension Klk could also be defined by (1.12) where n is a prime element of the ring of integral elements of K. From this it easily follows that for any a. of the field K we have (1.13)
I I
'I
if I
I I
!
I
Note that Theorem 5 and (1.12) are both immediate corollaries of Theorem 5 and (5.12) of Section 5, Chapter 3. Definition. If e = 1, the extension Klk is called unramified. If e Klk is called totally ramified.
= n, then
It follows from Theorem 5 that for unramified extensions the degree of inertia coincides with the degree of the extension. For totally ramified extensions the residue class fields ~ and ~o coincide; that is, every integral element of K is congruent modulo n with an integral element of k. It can be shown (Problem 12) that if the residue class field ~ of the field K is separable over the residue class field ~o of the field k, then there is a uniquely determined intermediate field T, such that the extension Tlk is unramified and KITis totally ramified. The field Tis called the inertiafield of the extension
Klk.
Sec. 1 ]
FIELDS COMPLETE WITH RESPECT TO A VALUATION
263
1.5. Fields of Formal Power Series
Fields of formal power series are fields, complete with respect JO a valuation. These fields are constructed in the following manner. Let k o be any field. The set 0 of all formal power series of the type (1.14) becomes a commutative ring with unit when the operations of addition and multiplication are defined as is usual for power series. This ring has no divisors ofzero, and the units are precisely those series (1.14) in which ao -::j. O. The quotient field of 0 is called the field offormal power series in t over the field k o . It is denoted by ko{t}. Just as for the field of p-adic numbers (Section 3.3 of Chapter 1), every nonzero element ~ of the field ko{t} has a unique representation in the form
where m is some integer (positive, negative, or zero). Setting v(~) = m for ~ -::j. 0 and v(O) = 00, we obtain a valuation v, and the field ko{t} is easily shown to be complete under v. The ring of the valuation v coincides with the ring 0 of series of the type (1.14). As prime element in 0 we may take t. Since two series of the form (1.14) are congruent modulo t if and only if their initial terms coincide, then each residue class of modulo t contains a unique element of k o . Hence the residue class field lo of the field k o{ t} is canonically isomorphic to the field k o . It is easily seen that the field of formal power series ko{t} is a completion of the field of rational functions ko(t), under the valuation corresponding to the irreducible polynomial t of the ring ko[t] (see Problem 7 of Section 4 of Chapter 1). Since k o c k o{t} and k o ~ lo, the characteristic of a field of formal power series coincides with the characteristic of its residue class field. It can be shown that this property characterizes formal power series fields among all fields which are complete under a valuation. Namely, if k is complete under a valuation and its characteristic equals the characteristic of its residue class field, then k contains a subfield k o , where the elements of k o form a complete system of residues modulo the prime element 71:. But for such a system of residues, the operations with series (1.3) reduce to the usual operations with power series, and hence k will be a field of formal power series in n with coefficients from k o . The proof of the existence of the subfield k o in the general case is rather difficult, and we shall not give it. (Two special cases, for which the proof is relatively easy, are indicated in Problems 7 and 11.) If k o' is an extension of k o , then ko'{t} is an extension of ko{t}, and if k o' /k o is finite, then k o'{t} /k o{t} is also finite and has the same degree. Another
LOCAL METHODS
264
[Chap. 4
method for constructing finite extensions of the field ko{t} is to map it isomorphically into the field ko{u}, with t --+ un (n a natural number). If we identify k o{ t} with its image under this mapping, that is, if we set t = un, then ko{u} will be a finite extension of ko{t} of degree n. It is clear that ko{u} is obtained from ko{t} by adjoining an nth root of t. For fields of characteristic zero, all finite extensions can be reduced to these two types. Theorem 6. Let k o be a field of characteristic zero. If K is a finite extension of the field k = ko{t}, with ramification index e, then K is a subfield of an extension of the form ko'{u}, where k o' is a finite extension of k o and ue = t. ~o and ~ denote the residue class fields of k and K, let f denote the degree of inertia of Kfk, let n be a prime element of K, and for any ~ E K let ~ be its residue class in~. The elements of the field k o , as we have seen, form a natural system of representatives for the residue classes of ~o. We first show that there is a subfield S of K which contains k o and is a complete system of representatives for the residue classes of~. Since any finite extension of a field of characteristic zero is simple, then ~ = ~o(~), where ~ is some residue class of ~. Let F be the minimum polynomial of ~ over ~o. If we replace all coefficients of F (which are residue classes of ~o) by the corresponding elements of k o , we obtain an irreducible polynomial F over k o of degree f, for which
Proof. Let
F(~)
== 0 (mod n)
and
F'(~)
:f. 0 (mod n).
By the second remark at the end of Section 1.2, there is an element () in the field K with () = ~ and F«(}) = O. Consider the subfield S = k o«(}) of the field K. Since () is a root of an irreducible polynomial over k o of degreef, then (S : k o) =f, and every element of S has a unique representation in the form ao+a 1() +"'+G f - 1(} f- 1
(
GiE k) '0'
the corresponding residue classes modulo n are given by ii o + ~ = ~o(~) and (~: ~o) =f, these linear combinations run without repetition through all residue classes of~. This shows that the elements of the subfield S (which is a finite extension of the field k o) form a complete system of representatives for the residue classes of~. By Theorem 1 the field K is the field of formal power series in n with coefficients in S; that is, K = S{n}. Theorem 6 would be proved (in a stronger form) if we could show that n can be chosen so that it is an eth root of t. However, it is not always possible to choose n in this way, and therefore we must pass to some finite extension k o' of the field of coefficients S. By (1.8) we have Since () = iil~
~,
+ ... + iif_l~f-l. Since
I
=
nee,
(1.15)
Sec. 1 ]
FIELDS COMPLETE WITH RESPECT TO A VALUATION
265
where E is a unit in the ring of integral elements of K. Let IX denote that element of S for which IX == E (mod n) and denote by k o' the field S(~a) (if IX = y' for some YES, then k o' = S). The field of formal power series K' = ko'{n} clearly contains K as a subfield and is a finite extension of k. We show that it can be represented in the form ko'{u}, where u' = t. Consider the polynomial G(X) = xe - E. Since in K' we have G(y) == 0 (mod n)
and
G'(y)
"I- 0 (mod n),
where by y we denote the root ~a., then in K' there exists a unit 1'/ with 1'/ == y (mod n) and 1'/' = E (here we again apply the remark at the end of Section 2.2). Now replace the prime element n of the field K' by the element u = n1'/. Then K' can also be considered as the field of formal power series in U over the field k o', that is, K' = ko'{u}, where u' = t by (1.15). The proof of Theorem 6 is complete. Remark. Theorem 6 is no longer valid for arbitrary finite extensions of formal power series fields k = ko{t} of characteristic p "# O. However, it is easily seen that it remains true in the case of extensions Kfk, for which the residue class field ~ is separable over ~o and the ramification index e is not divisible by p.
PROBLEMS 1. A nontrivial metric rp of a field k is called discrete if the only limit point for the set of values rp(x), x E k, is zero. Show that any discrete metric is induced by some valuation v of k by (1.1). 2. Let k be complete under a valuation and let K/k be a finite extension with a fundamental basis 8" ... ,8•. Show that the elements
also form a fundamental basis for Kover k if and only if all a'J are integral and the determinant det(a'J) is a unit in k. /
3. Using the notations of Theorem 4, let oc
=
e-l
L L a'Jw,TTJ (al) E k) 1=, J=O
be any element
of K. Set m = min vo(a'J)' Show that if jo is the smallest value of the indexj for which there exists i = i o with vo(a'oJo) = m, then v(oc) = jo + em, where v is the valuation of the field K. 4. Show that every element of the field of formal power series k o {t}, not lying in k o • is transcendental over k o .
266
LOCAL METHODS
[Chap. 4
5. Under the assumptions of Theorem 6, show that the subfield S c K, which contains k o and is a complete set of representatives for the residue class field of K, is uniquely determined. 6. Let k be algebraically closed and of characteristic zero, and let k = k o {t}. Show that k has one and only one extension of degree n, for all natural numbers n, namely, k(':jt) (that is, show that any two extensions of degree n are isomorphic under an isomorphism which is the identity on k). 7. Let K be complete under a valuation with residue class field 1: of characteristic zero. Show that K contains a subfield S which is a full system of representatives for the residue classes of 1:, and hence that K = S {1T}, where 1T is a prime element of K. (Use the fact that any field can be obtained from the prime field by a pure transcendental extension followed by an algebraic extension.) 8. Under the assumptions of Problem 7, assume also that the> residue class field 1: is algebraic over the prime field. Show that the subfield S is then unique. 9. Let K be complete under a valuation with residue class field 1:. If 1: is a perfect field of characteristic p (which means that every element is a pth power, and the map which takes every element to its pth power is an automorphism), show that there is a unique" multiplicatively closed" system S of representatives of the residue classes of 1:, so that if IX E Sand f3 E S, then IXf3 E S. (Dete!:.mine IX E S, representing the class (E 1:, by IX = lim IX. P·, where IX. belongs to the class
e-..)
10. Under the same notations, assume that 1: is a finite field with pf elements. Show that in the field K the polynomial t Pf - t factors into linear factors and that the set of its roots is a multiplicatively closed system S of representatives of the residue classes of 1:. 11. Assume that the field K of Problem 9 also has characteristic p, and that its residue class field 1: is perfect. Show that then the multiplicatively closed system S will also be additively closed, so that it will be a subfield of K, and K = S {1T}, where 1T is a prime element of K. 12. Let k be complete under a valuation and let K be a finite extension. Assume that the residue class field 1: of K is separable over the residue class field 1: 0 of k. Show that among the intermediate fields L, k c L c K, which are unramified over k, there is a largest such field T (which contains all other intermediate fields which are unramified over k). Verify that the residue class field of T coincides with 1:, and that the degree (T: k) equals
(1: : 1: 0 ). 13. Let f(X) = x m + a,xm -1 + ... + a m be an irreducible polynomial over a field complete with respect to a valuation. If the constant term am is integral, show that all other coefficients a" ... , am -1 are also integral. 14. Let ~ be a primitive root of degree p' of 1 (s ~ 1). Show that the degree of the field over the field R p of p-adic numbers is (p - I)p' -1, and that the extension Rp(~)/R p is totally ramified. Rp(~)
p-l
15. Let ~ be a primitive root of degree p of 1. Show that Rp(~) = RpCv' - p). 16. Let k be complete under a valuation, K/k a finite extension, and 1: and 1: 0 the residue class fields of K and k. Show that if 1:/1: 0 is separable, then K/k has a fundamental basis consisting of powers of a single element (that is, D = ()[8], 8 E D, where D and 0 are the rings of integral elements of K and k). Hint: Show that if 1: = 1: 0 (8), then a representative 8 E D can be chosen so that f(8) is a prime element in D. Here f(t) E o(t) is chosen so that i(t) E 1: 0 (t) is the minimum polynomial of the element 8 E 1:.
Sec. 2]
FINITE EXTENSIONS OF FIELDS WITH VALUATIONS
267 <X>
17. In a field complete under a valuation, show that the infinite product II (I
+ an),
11=1
an # -1. converges if and only if an -+ 0 as n -+ 00.
2. Finite Extensions of Fields with Valuations
Let k be a field with a valuation vp and let Kjk be a finite extension. The ring u = up can be considered as a ring in which we have a theory of divisors with a unique prime divisor p. By Theorem I of Section 5, Chapter 3, if.o is the integral closure of the ring u in the field K, then .0 has a theory of divisors with a finite number of prime divisors ~1' .... ~m (all of which divide p). Let ~ be a prime divisor of the ring .0 and let K~ be the completion of the field K with respect to the valuation v~. Those elements of K~ which are limits of elements of k form a subfield, which is topologically isomorphic to the completion k p of the field k with respect to the valuation vp ' Using the embedding k p --+ K~ we may assume that k p is a subfield of the field K~. Let K = k(r1.1' ... , r1. r ). The elements r1.i E K also belong to K~, and since they are algebraic over k, they are also algebraic over k p • Hence the extension k p(r1. 1, ... , r1. r )jk p is finite (with degree not exceeding the degree of Kjk), and then by Theorem 2 of Section I the field k p(r1. 1• ... , r1. r ) is complete. Every element of K~ is the limit of a sequence of elements of K, and since K c k p(r1. b ... , r1. r ) and k p(r1. b ... , r1. r ) is complete, then K~ c k p(r1.b ... , r1. r ). Since the reverse inclusion also holds, we have K~ = k p(r1. 1, ... , r1. r ). We have proved that the extension K~jkp is finite and that (K~
: kp)
~
(K: k).
Since the residue class fields of the valuations vp and v~ coincide with the residue class fields of the completions k p and K~ (see Section 1.1), then the degree of inertiaf~ of the divisor ~ relative to p coincides with the degree of inertia of the extension K~(kp . It is also clear that the ramification index e~ of the divisor ~ relative to p coincides with the ramification index of K~(kp . By Theorem 5 of Section I the numbers f~ and e~ are related to the degree n~ = (K~ : k p ) by f~e~
=
n~.
For the rest of this section we shall assume that the extension K(k is separable, and we will study the relations between the various completions K~ , ... , K~n of the field K under all extensions of the valuation vp ' Let OJl, ... , OJ n be a basis for the extension K(k. If for some r1. E K, all the coefficients aj in the representation (2.1)
[Chap. 4
LOCAL METHODS
268
are small relative to p (that is, small relative to the valuation vp), then the element a. will clearly be small relative to each of the prime divisors ~s' The following converse also holds.
lemma 1. For any integer N there exist an M such that whenever vIll.(a.) ~
M for all s
= 1, ... ,m, then all coefficients a j in (2.1) satisfy
vp(a j ) ~ N.
Proof. Let
Wi *, .,. , W n * be the dual basis of the basis Wi> ... , W n (here we are already using separability; see Section 2.3 of the Supplement). Then aj
= SPK/k(a.W/) = Sp a.w/.
Let es be the ramification index of ~s relative to p and p be a prime element in the ring up of the valuation vp ' so that es = vlll. (p). Set
M
= max (esN
- vlll,(w/)).
s.j
If vlll,(a.)
~
M for all s, then for fixed j we have
vlll.(a.w/) ~ esN = Vllls(pN), and this means that a.w/ = pN y, where vIlls(Y) ~ 0 (1 ~ s ~ m). By Theorem 6 of Section 4 of Chapter 3, the element y belongs to the integral closure of the ring up in the field K, and therefore Sp y E up , that is, vp(Sp y) ~ 0, so that vp(a j )
= vp(Sp(a.w/)) =
Vp(pN Sp y) ~ N,
and Lemma 1 is proved. Corollary. If a sequence {a.d of elements from the field K is a Cauchy sequence relative to each of the prime divisors ~s (s = 1, ... , m), then all the sequences {all}:;' l ' defined by
a.k = a /k l wl + .. ,+
an (k l wn
(a /k) E k),
are Cauchy sequences relative to p. Now consider the completions Kill,' ... , K Illm of the field K with respect to each of the prime divisors ~l' ... , ~m and form the direct sum Kill, EEl .,. EEl K Illm , which we denote by K p • The elements of this direct sum are sequences e = (el' ... , em), where el E Kill' ... , em E K Illm • We define addition and multiplication in K p component-wise, so that K p becomes a ring. For any y E k p , set y(el' ... , em) = (yel, ... , yem)' The ring K p then becomes a vector space over the field k p • If we denote the degree of Kills over k p by ns , then the dimension of K p over k p is given by (2.2)
Sec. 2]
FINITE EXTENSIONS OF FIELDS WITH VALUATIONS
269
There is a natural way of defining convergence in the ring K p • We say that the sequence {(e 1(kl, ... , em (k»))~ 1> e/ k) E K~s' converges to the element (e 1, ... , em) if for each s the sequence g/k)} converges to es relative to convergence in the field K~s' It is easily seen that, under this definition of convergence, the operation of multiplication of elements of K p by elements of k p is continuous. In other words, if "I = limk-+ 00 y(k), y(k) E k p , and = lim k-+ 00 ~):(k) ,~):(k) E K p' then lim y(k)e(k) = "Ie. (2.3~
e
k-+
00
We now define an imbedding K .... K p , setting
a=
(ex, ... , ex)
E
Kp
,
(ex
E
K).
Since K c K~. for all s, the sequence (ex, ... , ex) is an element of K p • It is clear that the mapping ex .... a defines an isomorphism of the field K into the ring K p • We denote the image of K under this isomorphism by K. To avoid confusion we note that the components of the product
ya = ("lex,
... , "lex)
which in this representation look identical, actually may be different, since the product "lex depends on the field K~. in which it is taken, even when "lex E k p • Theorem 1. If WI' ... , W n is a basis for the separable extension K/k, then is a basis for the ring K p as a vector space over k p •
(01' ... , (On
Proof. We first show that K is everywhere dense in K p ' that is, that every element of K p is the limit of a sequence of elements from K. Let = (el' ... , em) be any element of Kp , es E K~s (s = 1, .. , , m). Since K is dense in K~s ' then for any natural number k there exists an element (ex?» E K for which v~.(es exs(k» ~ k. By Theorem 4 of Section 4, Chapter 3, there is an element ex(k) in the field K for which V~.<ex(k) - ex/ k» ~ k for s = 1, ... ,m. The element ex(k) satisfies v~.(es - ex(k» ~ k (s = 1, ... , m),
e
and this means that the sequence {a(k)};,= 1 of elements of K p converges in the ring K p to the element Represent each element ex(k) in the form
e.
ex(k)
= a 1(k)W 1 + ... + an (k)w n ,
(a /k)
E
k).
Since the sequence {ex(k)} is a Cauchy sequence relative to each prime divisor ~s, then by the corollary of Lemma 1 the sequences {ap)} are all Cauchy sequences relative to p, and hence they converge in k p • Set "I j = lim k-+ oo a/ k ) (j = 1, ... ,n). Since for any a Eke k p and any E K p ,
e
ae = de,
(2.4)
[Chap. 4
LOCAL METHODS
270
then II
II
12k = L il/k)Wj = L ap)w j . j=1 j=1 Passing to the limit as k
--+ 00
and considering (2.3), we obtain II
~
= lim 12k = LyjWj . j=1
k-oo
This proves that the elements wj generate the vector space K p • We still need to show that they are linearly independent over k p • Let
· k'IS d ense In . k p' t hen Yj = l'Imk=oo SInce a:(k)
aj (k)
= a l (k)W I + .,. + all(k)w
Then lim 12(k) = lim" L, a J(k)W.J
, ;
:I II
ii'
;! . J.. ,
k .... oo
h ,were II
E
aj (k) E
K.
=" ,,·W. = 0
k .... oo j
L, IJ j
k . Set
J
•
This means that the sequence {a:(k)} is a null sequence in K relative to all the prime divisors ~. (s = 1, ' .. ,m). Then by the corollary of Lemma 1 all the sequences {all} in k are null sequences relative to p, and this means that 'II = 0, .. , , 'III = O. The proof of Theorem 1 is complete. Remark. In terms of the tensor products of algebras, Theorem 1 shows that the algebra K p over the field k p is isomorphic to the tensor product K flh k p , that is, that it can be obtained from K (regarded as an algebra over k), by extending the ground field from k to k p •
We have shown that ~he dimension of the vector space K p over k p equals n = (K: k). On the other hand, this dimension is given by the sum (2.2). Since n. = nllls = elll.JIlls' we arrive at
L elllflll = n ~
(~ running through all prime divisors of the ring -0). Hence we have obtained another proof of Theorem 7 of Section 5, Chapter 3.
Theorem 2. Let q>(X) denote the characteristic polynomial of the element a: E K, relative to the separable extension Klk, and let q>Ill(X) denote the characteristic polynomial of a: relative to KIlllk p • Then q>(X)
= TI q>Ill(X), III
Sec. 2]
FINITE EXTENSIONS OF FIELDS WITH VALUATIONS
271
Proof. Consider the linear mapping ~ --+ a~ of the vector space Kp to itself.
If a.W k =
I?= 1 ak,w
ak1
"
E
k, then by (2.4), rXWk =
L ak,w, . I
This means that the characteristic polynomial of our transformation coincides with the characteristic polynomial of the matrix (a k1 ), that is, coincides with q>(X). We now take another basis for Kp over k p • Let PS j (j = 1, ... ,ns) be any basis for the extension K'(j.lk p (s = 1, '" ,m). If we denote by PSj that element of K p which is zero in all components except the sth one, and whose sth component equals PS j ' then the set of all elements Psj
(s=I, ... ,m;j=l, ... ,n,)
(2.5)
is a basis for the ring K p over k p • Let n.
(S)P sl' a. Psj - " L. Yjf 1= 1
so that q>dX) is the characteristic polynomial of the matrix (yj,), It is now easily seen that the matrix of the linear mapping ~ --+ a~ with the basis (2.5) will be a block-diagonal matrix with the blocks (yj,) on the main diagonal. Theorem 2 follows immediately. For elements a. E K we introduce the concepts of local norm N'(j(a.) and local trace SP'(j(a.): N'(j(a.) = N K'(lIkp(a.), From Theorem 2 we deduce that N K/k(a.)
= TI '(lip
N'(j(a.),
SPK/k(a.) =
I
SP'(j(a.)·
(2.6)
'(lip
From the first of these formulas and (1.13) we deduce vp(NK/k(a.» = If'(jv'(j(a.),
(2.7)
'l1IP
which we proved by other methods in Section 5 of Chapter 3.
e,
Theorem 3. Let K(k be a separable extension with primitive element so that K = k(e), and let q>(X) be the minimum polynomial of e over k. Let p be a prime divisor of k, and let
q>(X) = q>1(X)", q>m(X)
by the factorization into irreducible polynomials in kp[X]. Let \.01' ... , \.Om be the prime divisors of the field K which divide p. There is a one-to-one correspondence between the divisors \.Os and the factors q>.(X), such that the polynomial q>.(X) which corresponds to \.Os coincides with the minimum polynomial of the element e E K'(j, over the field k p .
272
Proof. Since q>(X) is the characteristic polynomial of () relative to K(k, by Theorem 2 we have the factorization q>l(X)", q>m(X), where q>j(X) is the characteristic polynomial for () relative to K~)kp' Hence the factors q>.(X) correspond to the prime divisors ~s' But we saw in the preceding section that K~. = k p «(}), () EKe K~., and therefore each of the polynomials q>.(X) is irreducible over k p ' and Theorem 3 is proved.
Remark. Let 0 be any ring with a theory of divisors (and quotient field k) and let V be a prime divisor in o. If K(k is a finite separable extension, then Theorem 3 gives a description of all prime divisors of the integral closure .0 of 0 in K which divide V (at least it gives their number m and the products e~f~)·
3. Factorization of Polynomials in a Field Complete with Respect to a Valuation
In view of Theorem 3 of Section 2 it is important to have a method for factoring polynomials into irreducible factors in a field complete under a valuation. In this section we shall show that in such fields the decomposition of a polynomial with integral coefficients is completely determined by its decomposition modulo some power of the prime element. lemma. Let
be a subring of the field k, and let g(X) and h(X) be polynomials of degree m and n with coefficients in o. If the resultant p = R (f, g) of the polynomials f and 9 is nonzero, then for any polynomial/(X) E o[X] of degree ~m + n - 1 there exist polynomials q>(X) and IjJ(X) in o[X] of degrees ~ n - 1 and ~ m - 1 such that 0
L aru s + r+s=i L brvs = pc;
(i
= 0, 1, ... , m + n
- 1).
(3.1)
r+s=i
Proof. Set
'I'
n
m
g(X) =
L a;X m- i,
L b;X n-;,
h(X) =
i=O
m+n-l
l(X) =
;=0
q>(X)
=
"-1
L UiX n- 1 - i,
L
Cixm+n-l-i,
i=O
IjJ(X)
;=0
=
m-l
L ViX m- 1 - i.
i=O
To determine the m + n unknowns U o , ... , Un-I; V o , ... , V m - b we equate the coefficients of like powers of X in (3.1), obtaining a system of m + n equations
L
r+s=i
1
1
I 1,: I,! ! :1
I
[Chap. 4
LOCAL METHODS
.• ,,'
Ii
aru.
+
L brvs =
r+s=i
pc;
(i
= 0, 1, ... , m + n - 1).
Sec. 3]
FACTORIZATION OF POLYNOMIALS
273
The determinant of this system equals
(3.2)
.....
n
m
(with zeros everywhere else); that is, it equals the resultant p = R(g, h). Since p "# 0, this system has a unique solution, and since all the constant terms PC i are divisible by p, then the values of U i and Vi will belong to the ring u. The lemma is proved. Now let k be complete under the valuation v, with u the ring of integral elements of k and n a prime element of u. Two polynomials f(X) and h (X) of u[XJ are called congruent modulo n\ and we write f(X) == h (X) (mod nk ), if this congruence holds for the coefficient of each power of X. Theorem 1. Let f(X) E u[XJ have degree m + n. Suppose that there exist polynomials go(X) and ho(X) in u[XJ of degrees m and n such that (1) f and goho have the same leading coefficients; (2) the resultant R(go , h o) is nonzero; and (3) if r = v(R(go, ho)), then
(3.3) Then there are polynomials g(X) and h(X) in u[XJ of degrees m and n, for which f(X) = g(X)h(X), g(X) == go(X),
h(X) == ho(X) (mod nr+l)
and the leading coefficients of g(X) and h(X) coincide with those of go(X) and ho(X), respectively. Proof. For each k ~ I we construct by induction polynomials
({>k E
u[XJ
of degree ~ m - 1 and t/Jk
i
I
, I
•I I i
[Chap. 4
LOCAL METHODS
274
o[X] of degree ~ n - 1 so that the polynomials gk =go + nr+1({J1 + + nr+k({Jk' r 1 hk = ho + n + t/Jl + + nr+kt/Jk
will satisfy
E
f== gkhk (mod n 2r +H 1).
I
(3.4)
Assume that we have constructed the polynomials ({Jl' ... , ({Jk-l and t/Jl' ... , t/Jk-l with the required properties, so that f = gk-lhk-l + n 2r +kl, (3.5) where leX) E o[X]. The polynomials go and gk-l' as well as h o and hk- b have the same leading coefficient, so leX) has degree ~m + m + n - l. Further, gk-l == go, hk- 1 == ho(mod nr+1) and therefore R(gk-b hk- 1) == R(go, ho) (mod nr+1), which means that V(R(gk-l' hk- 1)) = r. By the lemma there exist polynomials ({Jk and t/Jk in o[X] with degrees ~m - 1 and ~n - 1, for which nrl=gk_It/Jk+hk_l({Jk'
(3.6)
We shall show that (fJk and t/Jk satisfy our requirements. Since gk = gk-I
+ nr+k({Jk' hk- 1 + nr+kt/Jk'
then by (3.5) and (3.6) f - gkhk = n 2r +kl_ n r+k(gk_lt/Jk
+ hk- 1(fJk) ~ k + 1).
2r 2k n 2r +2k q,kt/Jk = - n + (fJkt/Jk'
so (3.4) also holds (since 2k Now consider the polynomials 00
g(X) = go
'I'
+
00
L nr+k({Jk'
heX) = ho +
k=1
L nr+kt/Jk>
k=1
whose coefficients (except the leading ones) are defined as the sums of convergent power series. Since 9 == gk and h == hk (mod n Hr + 1), then gh == gkhk (mod n r+H I), so that by (3.4)
f== gh (mod n r + H1 ).
Since this holds for all k, thenf = gh, and Theorem 1 is proved. Remark. From the proof of Theorem 1 it easily follows that if go and ho satisfy the condition f== goh o (mod n S ~ 2r + 1, instead of (3.3), then 9 and h can be chosen so that S
),
Sec. 3]
FACTORIZATION OF POLYNOMIALS
275
We consider an important special case of Theorem 1. The polynomial f(X) E u[X] will be called primitive if at least one of its coefficients is a unit in u. Let 1: be the residue class field of the ring u modulo the prime element 71:. If we replace the coefficients off by the corresponding residue classes in 1:, we obtain a polynomial j with coefficients in the field 1:. Assume that in the ring 1:[X] the polynomial jhas a decomposition (3.7) in which the factors iJo and no are relatively prime. We may choose polynomials 90 and ho in the ring u[X] so that the degree of 90 coincides with the degree of iJo and the degree and leading coefficient of the polynomials f and 90 h o coincide. Consider the resultant R(90, h o) of the polynomials 90 and h o , that is, the determinant of (3.2). If we replace each entry in this determinant by the corresponding residue class modulo n, we obtain the resultant ·R(iJo, no) of the polynomials iJo and no (here the leading coefficient of no may be zero). The resultant R(iJo, no) is nonzero, since by choice of 90 the leading coefficient of iJo is nonzero, and the polynomials iJo and no are assumed relatively prime. (Recall that the resultant of two polynomials is zero if and only if the two polynomials have a common divisor, or both of them have leading coefficient zero.) Hence R(90, h o) ¥= 0 (mod n)p that is, v(R(90, ho)) = r = O. Equation (3.7) means that f == 90ho (mod n). Thus we see that 90 and h o fulfill all the conditions of Theorem 1 (with r = 0), and we have the following result. Theorem 2 (Hensel's lemma). Let f(X) be a primitive polynomial with coefficients in the ring u of integral elements of a field complete under a valuation. If in the residue class field 1: the polynomialjE 1:[X] has a factorization (90' h o E u[X])
with iJo and no relatively prime, then there exist polynomials 90 and u[X] such that
ho in
f(X) = 9(X)h(X),
with iJ = iJo' n = no, and the degree of 9 equal to the degree of iJo . With the aid of Theorem 1 we can solve the problem of the factorization of polynomials over a field complete under a valuation. We need consider only polynomials with integral coefficients and leading coefficient 1 (if the leading coefficient of a polynomial of degree n in u[X] is a, then we can multiply the polynomial by (/'-1 and take aX as a new variable). Since Gauss's lemma on the factorization of polynomials with integer coefficients also
1,1 '
II
I i
,I
I
,I
, I
I
" I
.' t
,
.
II ,
276
LOCAL METHODS
[Chap. 4
holds for the ring o[X], then every irreducible factor of f(X) with leading coefficient 1 will lie in 0 [X]. If the polynomialf(X) does not have multiple roots (in any finite extension of the field k), then its discriminant D(f) = ± R(f, 1') is nonzero. Let d = v(D(f)), and consider any factorization f==
~
in which the leading coefficient of each
;[. Ii
f(X)
= gl (X) ... gm(X),
(3.9)
in which the polynomial gs E o[X] has the same degree as
m as large as possible, then f has a factorization of the form (3.9), in which each gs is irreducible and has the same degree as the corresponding
(3.10)
into irreducible polynomials in the ring ~[X]. Therefore we have the following Corollary. Let f(X) E o[X] have discriminant D(f) which is a unit in 0, and let the decomposition of J into irreducible polynomials in ~[X] be given by (3.10). Then there exist polynomials gl' ... ,gm in o[X] such thatf = gl ... gm and 91 = iP1' ... , 9m = iPm· PROBLEMS
1. Let k be complete under a valuation, K/k a finite separable extension with ramification index e, 0 and D the rings of integral elements of k and K, and 7To and 7T prime
Sec. 4]
METRICS ON ALGEBRAIC NUMBER FIELDS
277
elements in these rings. Show that if Cl ED is divisible by 7T, then SPK/.(Cl) is divisible by 7To. Deduce from this that SPK/k(7T 1 _eC) co. Apply Problems 12 and 16 of Section 2, Chapter 2, to this case to show that, if e > 1, then for any 8 ED with characteristic polynomial f(t), ['(8) is divisible by 7T. 2. Let k be a finite extension of the field of p-adic numbers, with ramification index e over R p and let 7T be a prime element of k. Assume that k contains a primitive pth root of 1, and that e is divisible by p - 1 (Problem 14 of Section 1). Show that any integral m I Cl E k, for which Cl c= 1 (mod 7T + ), where m = pe/(p - 1) = ps = e + s, is a pth power of some element of k. [Use the fact that if f3 = 1 + 7T e + k y (y integral), k> s, p = 7T e e - I , then f3 c= (l + 7Tkye)P (mod 7T e +k+I). Then apply Problem 17 of Section 1.) 3. Under the conditions of Problem 2 assume that Cl is congruent to 1 modulo 7Tm but is not a pth power in k. Show that k(r;/rx)/k is an unramified extension of degree p. [Find the characteristic polynomial f(t) of the element y = 7T -saj~ - 1) and verify that !'(y) is a unit; now apply the last assertion of Problem 1.) 4. Retaining the conditions of Problem 2, assume that Cl E k is integral and satisfies h the conditions: Cl == 1 (mod 7T ), Cl c= 1 (mod 7T h 1, (h, p) = 1, h < m = ep/(p - 1). Show that Cl is not a pth power in k and that the extension k(r;l~)/k is totally ramified. (Consider the exponent with which the prime element of the field k(J;j <0 occurs in 1 - Cl = n:~~ (l - ~ iJ;j Cl), where ~ is a primitive pth root of I.)
4. Metrics on Algebraic Number Fields
4.1. Description of Metrics In Section 4.2 of Chapter 1 we gave a description of all possible completions of the field R of rational numbers, these being the p-adic fields R p and the real field Roo· We now do the same for any algebraic number field k. As we saw in Section 1, each prime divisor p of the field k corresponds to the p-adic completion k, that is, to the completion under the metric
+ ... + <po(an)
If the metric
L1
i=
for all ¢ E k. But this bounded values.
IS
impossible, since a nontrivial metric never takes
278
[Chap. 4
LOCAL METHODS
By Theorem 3 of Section 4 of Chapter 1, the metric
L
and since all powers ak (k ~ 0) of a also lie in SJ p ' we must have
+ (3n/.
But then I =
+
I,
and we have a contradiction. Hence there is only one prime element n" for which
279
METRICS ON ALGEBRAIC NUMBER FIELDS
Sec.4]
on the field R can be extended to the field C in only one way, namely, as the metric I~IP, where 1~1 denotes the absolute value (modulus) of the complex number~. Let ljJ be some extension. Then we claim that ljJ(¢) = 1 for all ~ E C with I~I = 1. Otherwise we would have some ~ E C with 1 and I~I = 1. Taking a natural number n and setting ~" = IX + f3i (IX, f3 E R~:J, we obtain ljJ(~")
::;; ljJ(lX)
+ ljJ(f3)ljJ(i) ::;; 1 + ljJ(i),
since ljJ(lX) = IIXI P ::;; 1, and analogously ljJ(f3)::;; 1. But this is impossible, since ljJ(¢)" > 1 + ljJ(i), if n is sufficiently large. Now let ~ be any nonzero complex number. We have just shown that ljJ(~/lm = 1. Hence
which is what was to be shown. Every algebraic number field k of degree n = s + 2t (see Section 3.1 of Chapter 2) has n different embeddings in the field C of complex numbers (s real, and t pairs of complex conjugates). Let a be any such embedding. If for any ~ E k we set
= la(~W,
then the function of the field k under this metric we have the metric iP which is the only continuous extension of
is topologically isomorphic to the real field Roo. Ifwe denote by a the (unique) topological isomorphism of R to Roo, then for any')' E R we shall have iP(')') = la(')'W. Take in k a primitive element 0, so that k = R(O), and letf(X) be its minimum polynomial over R. Thenf(X) factors over the real field into s linear and t quadratic terms. Hence in the field R we have the decomposition f(X)
= (X
- 0,) .. , (X - 0s)(X 2
+ PIX + q,) ...
(X
2
+ PIX
+q,).
Since f(O) = 0, then 0 must be a root of one of these polynomials. Assume first that 0 = 0;. Since 0 E R and thus K = R(O) c R, then the isomorphism a : R -+ Roo induces a real embedding a: k -+ C, such that if ~ E k, then
=
ij5(~)
=
la(~W·
Hence the metric is topological1y isomorphic to the real field.
= R;
280
[Chap. 4
LOCAL METHODS
a
Now let be a root of one of the quadratic terms. In this case (R(O) : R) = 2, and hence the isomorphism a: R -+ Roo can be extended in two ways to an isomorphism a : R(O) -+ C. The induced mapping a : k -+ C is clearly a complex embedding of k in the complex field C. We have shown that there is only one metric on C which coincides with the metric IIX\P on Roo, namely, the metric 1r,IP, r, E C. Hence for any ~ E k, we have qJ(~) =
o o a o s b
P
that is, qJ = qJrr for the complex embedding a. The field l(", [which coincides with R(O)] is topologically isomorphic to the field of complex numbers. Hence we have proved the following theorem. Theorem 1. Any nontrivial metric qJ of the algebraic number field k of degree n = s + 2t coincides either with a p-adic metric (0 < p < 1,
S
~ E
o
s
g r
4
k),
corresponding to a prime divisor P, or with one of the s form (0 < p ::::; 1, ~ E k),
+t
metrics of the
n d
where a is an isomorphism of the field k into the field C of complex numbers. Definition. The completion k p of the algebraic number field k under the metric qJp is called the field of p-adic numbers. From Theorem 1 it follows that every completion of an algebraic number field is either a p-adic field, the field of real numbers (for s > 0), or the field of complex numbers (for t > 0). To emphasize the analogy between the metrics qJp and qJrr of the algebraic number field k of degree n = s + 2t, we introduce s + t = r new objects PI,oo' ... , Pr.oo' called infinite prime divisors, which correspond to the metrics qJrr' Ordinary prime divisors, distinct from the infinite ones, are then called finite prime divisors. The infinite prime divisor P = Pi.oo is called real if it corresponds to a metric qJrr with a real embedding a, and is called complex if the corresponding metric qJrr = qJij comes from a pair of complex-conjugate embeddings a and a. In the case of the rational field R there is a unique infinite (real) prime divisor p 00 , which we introduced in Section 7.2 of Chapter 1 and denoted by the symbol 00 Those prime divisors PI' .. , , Pm of the field k, which correspond to extensions of the p-adic valuation vp to k, are the divisors of the number p (considered as a divisor of the field R). In an analogous manner, we call the divisors PI.oo, '" , Pm, oc divisors of p 00' since the corresponding metrics are extensions of the metric Ixl P on the rational field.
v c
I c
s (
t a
f s
Sec. 4]
METRICS ON ALGEBRAIC NUMBER LINES
281
The ring K p , which we considered in Section 2, when specialized to the case of an extension klR and a rational prime p, is denoted by k p and consists of all m-tuples (¢1' ... ,¢m), where ~i E k p ,' The dimension of the ring k p as a vector space over the p-adic field R p is equal to n = (k : R) (Theorem 1 of Section 2). In an analogous fashion we can construct the ring k poo ' consisting of all (s + t)-tuples (¢1' ... ,¢" ¢s+l' ... , ¢s+t), where ¢i (I ~ i ~ s) belongs to the field of real numbers, and ¢s+t (1 ~j ~ t) to the field of complex numbers. The ring k poo ' being a vector space of dimension n = (k : R) over the real field Roo, clearly coincides with the ring Es.t , which we considered in Chapter 2, which was of such great interest in the study of the group of units and classes of modules of the algebraic number field k. The ring k poo will again playa large role in Section I of Chapter 5. 4.2. Relations between Metrics
For any prime divisor p of the field k (finite or infinite) we introduce the normed metric <{Jp' determined by special choice of p. If p is a finite prime divisor, then the normed metric <{Jp is determined by 1 ).p(~) «Jp(¢)= ( N(p)
(¢Ek),
where N(p) is the norm of the divisor p. For an infinite real prime p, which corresponds to the real embedding a : k -+ C, set (¢
E
k).
Finally, if p is an infinite complex prime divisor, corresponding to the pair of complex embeddings a and a, then the normed metric <(Jp is given by «Jp(¢) = la(¢)1 2 = la(¢)1 2 = a(¢)a(¢).
Note that the last function la(¢W, is not, strictly speaking, a metric in the sense of the definition of Section 4.1 of Chapter 1 since the triangle inequality (4.2) does not hold. However, since la(¢)1 2 is the square of a metric, it can also be used to define convergence in the field k, and therefore we shall consider it a metric. For any ¢ i= 0 of k we clearly have only a finite number of prime divisors p, for which «Jp(¢) i= 1. Therefore, the formally infinite product Dp<{Jp(¢) makes sense. Theorem 2. For any ¢ i= 0 of the algebraic number field k, the values of the normed metrics satisfy (4.3)
(p runs through all prime divisors of the field k, finite as well as infinite).
282
[Chap. 4
LOCAL METHODS
Proof. Let P and P' denote the products of all (fJp(¢), taken over the infinite and the finite primes, respectively, so that the left side of (4.3) equals PP'. From the definition of the normed metric for infinite p, we have
P=
I) laWI = II) a(~)\ = IN(¢)I
(here a runs through all n = s + 2t embeddings of k in the field C). On the other hand, by formula (7.1) of Chapter 3, the norm of the principal divisor (¢) = Dp pVpw (here p runs through all finite prime divisors) equals
INWI = N(I) pvpw) =
II N(pYpw = ~"
which proves the theorem.
PROBLEMS
1. Let r:p" ... , r:p, (r = s + I) be the metrics of the algebraic number field k, of degree s + 2/, which correspond to the infinite prime divisors. Show that for any i = 1, ... , r there exists a number g in k for which
n
=
(j# i).
Show that the metrics r:pl> .. , , r:p, define different notions of convergence on k. 2. Show that every relation of the form
(g E k*) between the normed metrics r:pp of an algebraic number field k is a consequence of the relation (4.3), that is, show that this will hold for all g E k* only if there is some integer m with m p = m for all lJ.
5. Analytic Functions in Complete Fields 5.1. Power Series
Let k be a complete field with valuation v. We have already studied some properties of series in k (see Section 1.2 of this chapter and Section 3.4 of Chapter 1). We know that the series 1 an converges in the field k if and only if an -+ 0, as n -+ 00; that for convergent series the operations of addition, subtraction, and multiplication by a constant can be carried out termwise; and that the order of terms in a convergent series may be changed and the sum remains the same. From this it easily follows that if we take all products aibj of terms of the two convergent series I~l a i = sand IJ=l b j = t, and
I:;"
Sec. 5]
283
ANALYTIC FUNCTIONS IN COMPLETE FIELDS
write them in any order, then the resulting series will converge and its sum will be st. We note, for future use, a simple theorem on double series. Recall that the double series (5.1)
is said to converge to the sum s, if If=lIJ=l aij ....... s as m, n .......
00.
The series
are called the repeated series of the double series (5.1). Theorem 1. If, for any N, for all but a finite number of pairs (i,j) we have v(aij) > N, then the double series (5.1) converges and its sum equals the sum of each of the repeated series, which also converge. If we form a simple series of all the terms of the double series (5.1) in any way, then the simple series will also converge, and to the same sum.
The proof of this theorem is completely elementary, and is left to the reader. Any series in k of the form 00
f (x) =
•
I anxn = a 0 + a 1x + ... + anxn + "', n=O
(5.2)
with an E k is called a power series. If (5.2) converges for x = X o E k, then we claim that it converges for all x E k with v(x) ~ v(x o). For any such x we have n v(an~) ~ v(anxo ), and therefore the terms allxn also converge to zero as n ....... 00. Thus if we set 11 = min v(x), where x runs through all values of k for which (5.2) converges, then the region of convergence will consist of all x for which v(x) ~ 11 [or (5.2) will converge for all x]. If we have two power series f1(X) = I:'=o an~ and f2(X) = I::': 0 bn~, then by their product we mean the power series obtained by formal multiplication, that is, the series h(x)=I:'=ocnxn, where Cn = Ii+j=naibj' Let the series f1(X) andf2(x) converge for v(x) ~ III and v(x) ~ 112' It is then clear that the series h(x) will converge for v(x) ~ maX(1l1' 112), and that its sum will equal f1(X)fix). A power series f(x) is a continuous function of x in its region of convergence. Indeed, all terms an~ for n ~ 1 can be made as small as desired by taking x to be sufficiently small. Hence f(x) ....... ao = f(O) as x ....... 0; that is, the function f(x) is continuous at the point x = O. Now let C be any value in the region of convergence of the series f(x). Replacing each term an~ by the
i II
284
I
expression an(c + y)n, expanding each term and taking the sum, we obtain a power series fc(y). For all values of y from the domain of convergence of f(x), we have
I
LOCAL METHODS
:1 I
i
I
I
I I i
l
+ y) = fc(Y)·
fCc
.'1
[Chap. 4
(5.3)
It now follows that fc(y) -+ fc(O) as y -+ 0, and hence f(x) -+ f(c) as x -+ C, so f(x) is continuous at x = c. A function f(x), defined on some domain in a complete field with valuation,
and represented on this domain by a convergent power series, is called an analytic function. Consider a power series g(y) = b1y
+ ... + bnyn + ...
without constant term. We claim that it is possible to substitute g(y) for x in a power series f(x), and obtain a series F(y) in y. For if (5.4) then F(y) = a o + cllY
+ (C 12 + C22)y2 + .,. + (c ln + C2n + ... + cnn)yn + .,.
Theorem 2. (On Substitution of Series in Series). Let the series f(x) converge for vex) ~ Jl. If the above series g(y) converges for some y E k and v(bmy"') ~ Jl for all m ~ 1, then the series F(y) also converges (for this value of y) and
F(y)
= f(g(y».
Proof. Consider the double series
(5.5) From (5.4) we have cnmym =
L
anba,ya, ... banyan.
a1 • ...• a"~
1 al +···+an=m
Let N = min v(bmym). Then
Since N = v(xo) for some X o and for x = x o , the series f(x) converges; then v(a n) + nN = v(anxon) -+ 00, and this means that v(cnmym) -+ 00 as n -+ 00 uniformly for all m. Further, for fixed n the series (5.4) converges (being the
Sec. 5]
ANALYTIC FUNCTIONS IN COMPLETE FIELDS
285
product of convergent series), and therefore v(cnmym) -> CIJ as m -> CIJ. This proves that the double series (4.5) satisfies the conditions of Theorem 1. By this theorem both repeated series for (4.5) converge and have the same sum. We now need only note that and and Theorem 2 is proved. In the next two sections we shall also consider analytic functions of n variables, that is, functions which can be represented as power series, f(x , •.• , x ) = La.! ....nXI·! ... X • n. n
n
l
aI, _..• a:,,~O
Suppose that the seriesf(x t , ... , x n ) converges in the region in n-dimensional space over k consisting of all points with v(xJ ~ N (i = 1, '" ,n). If c = (c t , ... , cn ) is a point of this region, then, just as in the case of one variable, we easily obtain
+ C\>
f(x i
+ cn) = fc(x t ,
... , X n
for all points of the region vex;)
~ N
... ,
x n ),
[fc also converges for vex;) ~ N].
5.2. Exponential and Logarithmic Functions In this section we assume that k is a finite extension of the p-adic field R p • We denote by the valuation of k by v, the ramification index over R p bye, and a prime element of the ring of integral elements of k by n. Consider in k the power series exp x = I log(l
X
X
Z
xn
+ -l! + -2! + ... + -n! + ... '
+ x) = x
X
Z
__
2
(5.6)
n
x + ... + (_I)n - t _ + ...
(5.7)
n
We shall find the region of convergence of the series (5.6). Since the prime number p occurs in n! with exponent [nip] + [nlpz] + "', then yen!) = e([!2J P
+
[nzJ P
+ ... ) <
en
f ~P = ~, P-
k= I
1
and hence n
v(X ) = nv(x) _ yen!) > n(v(x) _ _ e_). n! P- 1
(5.8)
1
I I
286
LOCAL METHODS
[Chap. 4
If vex) > ej(p - 1), then v(x"jn!) -+ 00, and the series (5.6) converges. On the other hand, if vex) ::;; ej(p - 1), we have for n = pS,
v(:~) = nv(x) -
e(ps-1
= nv(x) _
+ ... + P + 1)
e n - 1 = n (V(X) _ _ e_) p-l p-l
+ _e_ p-l
::;; _e_ , p-l
and hence for such x the general·term in (5.6) does not converge to zero. This shows that the series (5.6) converges precisely for those x for which vex) ~ K, where K=
[_e1+ p-l
1.
Formal multiplication of the power series exp x and exp y is easily seen to give the series exp(x + y), and hence for vex) ~ K and v(y) ~ K we have the formula exp (x
+ y) = exp x·exp y.
(5.9)
We now turn to the series (5.7). If vex) ::;; 0, then v(x"jn) does not converge to infinity as n -+ 00, and hence for such x the series (5.7) does not converge. Now let vex) ~ 1. If n = pan1 , (n 1, p) = 1, then pa::;; nand yen) = ea ~ e(ln njln p), so that v(X") n
=
nv(x) _ yen)
~ nv(x) _
e In n , ~p
and this means that v(x"jn) -+ 00 as n -+ 00. Hence the series (5.7) converges if and only if vex) ~ 1. If vex) ~ 1, then the element 8 = 1 + x is a unit in the ring 0 of integral elements of the field k, with 8 == 1 (mod n). Conversely, if a unit 8 satisfies this congruence, then it has the form 8 = 1 + x, where vex) ~ 1. We can such a unit of the ring 0 a principal unit of the field k. The series (5.7) hence defines a function log 8 on the multiplicative group of an principal units of the field k. We shan show that for any two principal units 81 and 82 , we have the formula (5.10)
Let 81 = 1 + x, and
82
= 1 + y, and let
v(y) ~ (x), so that y
=
tx with t integral
+ x)(1 + y) = 1 + (t + l)x + tx 2 • We shan consider the expression (t + l)x + tx 2 as a power series in x, for which an terms lie in the region of convergence of the series log(l + z). Since the (l
Sec. 5]
ANALTYIC FUNCTIONS IN COMPLETE FIELDS
formal substitution of this expression in the series 10g(1 + 10g(1 + tx), then by Theorem 2 10g(1
287
+ z) gives log(l + x)
+ (t + I)x + tx 2) = 10g(1 + x) + log(l + tx),
which proves (5.10). The formal substitution of the series (5.7) in (5.6) and of the series exp(x - I) in (5.7) give us the following formal identities: exp log (I log
+ x) = I + x; exp x = x.
(5.11) (5.12)
Since these are formal identities, to verify them we can assume that x is a complex variable and use the theorem on substitution of series in series for complex power series (see, for example, K. Knopp, "Elements of The Theory of Functions," Sections 41 and 45, Dover (New York, 1952). To see under what conditions the formal identities (5.11) and (5.12) can be considered as equations in k, we turn to Theorem 2. By this theorem (5.11) will hold provided the terms of the series log(l +x) satisfy v(x"(n) ~ K. For n = I this gives us v(x) ~ K. But if v(x) ~ K, then v(x"(n) ~ nK ~ K for I :s;; n :s;; p - I and v(X") _ K
n
~ (n
_ I)K _ V(II)
> (n _ I) _e_ _ e In n p-I
~p
= e(n
- 1)
In p
(~
_
p - I
~) ~ 0
n - I
for n ~ p ~ 2 [here we are using the fact that the function In t(t - I) for t ~ 2 is monotone decreasing]. Hence (5.11) is valid under the condition v(x) ~ K. Further, under this condition, v(log(1 + x)) ~ K. We turn to formula (5.12). It follows from (5.8) that if v(x) ~ K, then every term of the series exp(x - I) is contained in the region of convergence of the series 10g(1 + x), and this means that (5.12) holds for all x for which exp x is defined. We denote by A the additive group of all x E k for which v(x) ~ K, and by M the multiplicative group of units of the form 8 = I + x, X E A. We have shown that the mapping 8 -+ log 8 (8 E M) is a homomorphism from the group M to the group A. We now show that the mapping x -+ exp x is a homomorphism from A to M. In view of (5.9) we need only show that v(x"(n!) ~ n for all x E A and all n ~ I. Let p" :s;; n < p" + 1. Then
vG~) - ~ (n - I)K - e([~] + [;2] K
~
(11 - I)e en p" - I p- I p" p -·1
~O,
+ ... +
[~])
[Chap. 4
LOCAL METHODS
288
which is what is required. Formulas (5.11) and (5.12) now show that the mappings log: M -+ A and exp: A -+ M are both one-to-one and are inverses of each other. Hence we have the following result. Theorem 3. The mapping x -+ expxis an isomorphism of the additive group of all numbers of the field k which are divisible by nrc (K = [e(p - 1)] + 1) onto the multiplicative group of all principal units e which are congruent to 1 modulo nrc. The inverse isomorphism is given by e -+ log e [for e == 1 (mod nrc)].
The mapping e -+ log e is, in general, not an isomorphism on the whole group of principal units (Problem 5). Also, the value of log e is not necessarily integral. In real analysis one also considers the exponential function aX = eX In a. Its analog in the field k is the function r(
= exp(x log 1"/),
(5.13)
where 1"/ is a principal unit of the field k. This function is defined provided that K - v(log 1"/). Therefore if 1"/ == 1 (mod nrc), then I"/x will make sense for all integral x of k, and the value of I"/x will satisfy I"/x == 1 (mod nrc). If 1"/ == 1 (mod nrc) and x and yare any integral elements, then we have the formulas
v(x) ~
PROBLEMS 1. Let f(x) be an analytic function in the region v(x);;o}-L (in a complete field with valuation v). If f has an infinite number of zeros in the region v(x);;o }-L, show that f is identically zero. 2. Let k be a field of characteristic zero, complete under a non-Archimedean metric r:p (Problem 4 of Section 4, Chapter I). Assume that the metric r:p satisfies r:p(p) < 1 for some rational prime p. Show that the region of convergence of the series logO + x) is the set of all x for which r:p(x) < I, and the region of convergence of the series exp x is given by r:p(x)
<
p- ;,jr:p(p).
3. Under the same conditions, determine the regions of convergence of the series x2"
CD
sin x
=
:E ("=1
I)" -1
-1
(21/ - I)! '
cos
X =
CD x 2" ~ ( - I)" (2 ) , .
"- 0
n .
4. Find the error in the following proof of the irrationality of the number1T. The number is the smallest positive number for which sin 1T = O. Let 1T be rational. Since 1T> 3, the numerator of 1T must be divisible either by an odd prime p, or by 2 2 (in the latter case set p = 2). From this it follows that the series sin x and cos x converge in the p-adic field R p 1T
Sec. 5] for
ANALYTIC FUNCTIONS IN COMPLETE FIELDS
x = "TT.
289
But in view of the formula sin (x + y)
it follows from sin
"TT =
=
sin x cos y + cos x sin y
0 that sin
n"TT=
0
for all natural n. The function sin x thus has an infinite number of zeros in its region of convergence. But then, by Problem I, it would be identically zero, which is a contradiction. 5. Let k be a finite extension of the p-adic field R., and let e be a principal unit of k. Show that log e = 0 if and only if e is a root of degree p' (s;;. 0) of I. 6. Keep all notations of Section 5.2. The principal units e, for which e """ I (mod TT'), form a multiplicative group, which we call M •. The integers of k which are divisible by TT' form an additive group A•. Show that for k;;. K the mapping e -+ log e is an isomorphism of the group M. onto the group A. (the inverse mapping being x -+ exp x, x EA.). 7. In any complete field with valuation, show that the region of convergence of a power series f(x) = a.x· is contained in the region of convergence of its derivative f'(x) = na.x· Give an example in whieh f and f' have different regions of convergence (with the field k of characteristic zero). 8. Show that in the ring of 2-integral rational numbers the sum
2::= 0
2::=0 -I.
2· 22 23 2+2"+3+ ... +-;;can be made divisible by any given power of 2, by taking n large enough. 9. Show that all coefficients a. of the series 2
X'
E.(x)=exp ( x+ -
p
) + x· 2 -+ ...
p
are p-integral rational numbers. (Hint: Show that the number
equals the number of elements in the symmetric group of nth degree which have order a power of p. Use the theorem which says that if d divides the order of the finite group G, then the number of elements U E G which satisfy the equation ud = I is divisible by d.) 10. Show that E.(x)
=
TI
(I - x'") -p(.. l/r.
(m,p)=!
(m runs through all natural numbers relatively prime to p; f-L(m) is the Mobius function). 11. Let TJ be a principal unit in a finite extension field of the field R., and let x be a p-adic integer. Choose a sequence of natural numbers {a.} which converges to x. Show that lim._ oo TJa. exists and that it is independent of the choice of {a.}. Further, show that the function TJa = lim TJa. '_00 of the p-adic integer x coincides with the function (5.13).
II' ~ 'I :1 "I
,I ,I ,
LOCAL METHODS
290
[Chap. 4
6. Skolem's Method
In this section we study the application of Skolem's method to equations of the form
I
(6.1 )
·1
:1
where F is an irreducible, decomposable, nonfull form (Section 1.3 of Chapter 2), and c is a rational number. This method is based on some simple properties of local analytic manifolds over p-adic fields, which will be proved in the next section. An example which illustrates the idea of Skolem's method was given at the beginning of this chapter.
I
6.1. Representation of Numbers by Nonfull Decomposable Forms
i i ,i
,I
In Section 1.3 of Chapter 2 we saw that (6.1) can be written in the form
,>; .:\
(6.2)
'1"
or ,"·
I' 1
if l'
N(rx)
= a,
(rx EM),
(6.3)
where Ill' ... ,11m are numbers of some algebraic number field k, and M = {Ill' ... ,11m} is the module generated by these numbers (a is a rational number). Replacing, if necessary, the form F by a form integrally equivalent to it, we may assume that the numbers Ill' ... ,11m of the module M are linearly independent over the field R of rational numbers. Since Mis nonfull, m < n = (k: R). In Chapter 2 we saw how to find all solutions to (6.3) when M is a full module of k. It is thus natural to embed M in a full module M, and to use the methods of Chapter 2 to find all solutions of the equation N(rx) = a, rx E M, and then to pick out those solutions which lie in M. It is clear that any module of k can be embedded in a full module. To do this it suffices to extend the linearly independent set Ill' ... ,11m to a basis Ill' ... ,Iln of the field k and to set M = {Ill' ... ,Iln}· If all rx E M for which N(rx) = a have already been found, then we shall obtain all solutions of (6.3) if we can isolate those solutions for which in the representation
rx = the coefficients X m+l
X m + 1, ... 'X n
Xllll
+ ... + xnll n
are equal to zero. To express the conditions
= ." = X n = 0 directly in terms of rx, it is convenient to use the dual basis
III *, ." , Iln * (see Section 2.3 of the Supplement). Since the trace Sp Iljlli * is 0 for i t= j and 1 for i = j, then Xi = Sp rxlli* (1 ~ i ~ n). It follows that the
Sec. 6]
SKOLEM'S METHOD
291
numbers IX E M which lie in the submodule M are characterized by the conditions Sp IXlli* = 0
(i = m
+ I, ... , n).
(6.4)
By Theorem 1 of Section 5, Chapter 2, all solutions of the equation N(IX) = a, IX E M, can be written in the form (1
~j ~
k),
(6.5)
where "11' ... ,Yk is a finite set of numbers of the module M with norm a; is a system of independent units of the field k; and U 1 , •.. , Ur are arbitrary rational integers. From (6.4) we see that every solution to (6.3) corresponds to a solution in one of the k systems of equations
Cl' ... , Cr
(i = m
+ 1,
... , n)
(6.6)
in rational integers Ul> '" , Ur (here "I is one of they j)' Let K be an algebraic number field which contains all fields conjugate with k, and let 0"1' ... , O"n be all isomorphisms of k into K. Since Sp ¢ = 0"1(0 + ... + O"n(O for any ¢ E k, then the system (6.6) can be written n
L O"/yp/)O"iCt)"' ... O"icr)"' = 0
j=1
(i
=
m
+ 1,
... ,n).
(6.7)
It is clear that to prove the finiteness of the number of solutions to the equa-
tion (6.3) it suffices to show that each system of the form (6.7) has only a finite number of solutions in rational integers U 1 , '" , u.' Remark. The set of all numbers of the field k of the form cl"' ... Cr "', where u1 , ... , U r run through all rational integers, is a multiplicative subgroup of k which we shall denote by U. The solutions of (6.3) then coincide with numbers of the intersections
M n yjU
(j
= I, ... ,k).
(6.8)
Instead of the set (6.8) we may also consider the similar set "I j -1 M n U. Then the problem of finding all solutions to (6.1) is reduced to the problem of finding the intersection of a module and a multiplicative subgroup of the field k. We note also that we may replace the module M by the vector space L (over R) which is spanned by Ill> ... , Ilm' For yjU c M and L n M = M, so L n "I j U = M n Yj U. 6.2. The Relation to Local Analytic Manifolds
The idea of Skolem's method is that in some cases we can prove the finiteness of the number of solutions of (6.1) by proving that the system (6.7)
I
. I··
292
,1
[Chap. 4
LOCAL METHODS
has only a finite number of solutions even when the variables U 1, ... , U r take on ~-adic integral values (that is, they take integral values in the completion K",), where ~ is any prime divisor of the field K. After such an extension of the domain of possible values for the variables, we may consider the set of all solutions of the system (6.7) as a local analytic manifold in r-dimensional space, and then apply properties of such manifolds. When we allow that variables U 1, ... ,Ur in (6.7) to take ~-adic values, we encounter the obstacle that the exponential function SU = exp(u log s) is only defined for all integral ~-adic u when s == 1 (mod ~") (I( is an integer which depends only on the field K",; see the end of Section 5). We avoid this difficulty in the following manner. By Problem 6 of Section 7, Chapter 3, there exists a natural number q such that any integer rx E K which is not divisible by ~ satisfies
rx q == 1 (mod Each exponent
(6.9)
in (6.5) can be written
Ui
,: ; I
~").
Ui
= Pi
and hence the unit s =
+ qv
SI u,
...
j ,
(0::;; Pi
< q,
Vi
EZ),
s/r has the representation (l
= 1, ... , qr),
where fJ I is one of the q' numbers (0::;; Pi < q).
Hence we obtain a new representation for numbers rx of the form (6.5), in which Si is replaced by Si\ and the finite set of numbers "I j by the set of numbers "IjfJ I • Since the Si are units, then the congruence (6.9) holds for all of the numbers a j(sJ, and hence the functions a /Siqt are defined for all ~-adic integers U E K",. We have proved the following result. Lemma 1. After making new choices, if necessary, for the numbers "Ij and Si in (6.5), the functions aj(sJu are defined for all integers of the field
K",. In the future we shall assume that this condition holds without special mention. We turn to the system (6.7). In view of (5.9) and (5.13), we can put this system in the form n
LA j;!
, Ii
ij
exp L/u!, ... , u.)
=0
(i
=
rn
+ 1,
... , n),
(6.10)
Sec. 6]
SKOLEM'S METHOD
293
where r
Lj(u l ,
... ,
2: Uk log aiek),
ur ) =
k=1
A ij
=
aj(yp/).
Since the left side of (6.10) consists of power series which converge for all integral U 1 , ... , Un and hence represent analytic functions, then the set of all solutions to (6.10) can be interpreted as a local analytic manifold (in a neighborhood of any solution) in the sense of the definition of Section 7. The system (6.10) consists of n - m equations in r variables. It is natural to expect that the manifold defined by this system will consist of only a finite number of points, provided that n - m ~ r. Recall that the number r comes from the Dirichlet theorem on units, and r = s + 1 - 1, where s is the number of real, and 1the number of pairs, of complex embeddings of the field k in the field of complex numbers. Since n = s + 21, then n - m ~ r if and only if 1 ~ m - 1. In the simplest interesting case m = 2, and the condition reduces to 1 ~ 1. This means that there should be at least one pair of complex embeddings of k. This case leads to Thue's theorem, and will be considered in the next sections. Assume that the system (6.10) has an infinite number of solutions (ulS' ... , urs ), S = 1,2, .... Since the ring of ~-adic integers is compact (see Theorem 6 of Section 3, Chapter 1, and the second remark at the end of Section 1.2 of this chapter), we can choose from this sequence a convergent subsequence, the limit of which we denote by (u 1 *, ... , U r *). It is clear that the point (u 1 *, ... , ur*) is also a solution of (6.10), that is, that it lies on the manifold determined by this system, and that in any neighborhood of this point there are an infinite number of points of the manifold. We now change variables by the formula ~-adic
(1::;;;::;;r).
The system (7.10) then becomes
2:" A~ exp L/v
l , ... ,
vr )
=0
(i
=
m
+ 1, ... , n),
(6.11)
j= I
where A i/
= Aij expLj(ul*'
... , ur *).
The constant terms of the series on the left of(6.11) are all zero. We denote by V the local analytic manifold (see Section 7) determined by (6.11) [in the neighborhood of the point (0, ... , 0)]. Since this manifold does not consist of a single point (any neighborhood of the origin contains an infinite number of points of the manifold), then by Theorem 2 of Section 7 the manifold V
294
LOCAL METHODS
[Chap. 4
contains an analytic curve; that is, there is a system of formal power series
(not identically zero and without constant terms) with coefficients from a finite extension of K<JJ' such that the series (6.12) identically satisfy the relations n
L1A;j exp Pit) =
0
(i
j;
=
m
+ 1, ... , n),
Hence we have the following result. Theorem 1. If the equation (6.1) has an infinite number of solutions, then at least one local analytic manifold of the type (6.11) [for some I' = I'j and some point (u 1 *, '" , u:)] contains an analytic curve.
This theorem is the heart of Skolem's method. It reduces the question of the finiteness of the nnmber of solutions to (6.1) to the proof that the systems (6.11) do not have solutions in formal power series, that is, that the corresponding local analytic manifolds do not contain analytic curves. Note that there are n - r linear relations on the n series Pit) defined by (6.12): n
LB j;l
1j
Pit) =
o.
(1 ::;; i ::;; n - r),
since they are linear combinations of the r series Wk(l). Thus the existence of an analytic curve on V implies the solvability [in power series P;(t) without constant term] of the system n
L A;j exp Pit) = 0 j;l
(m
+ 1 ::;; i
::;; II),
(6.13)
n
L BijP/t) = 0 j;l
(1 ::;; i ::;; n - r = t
+ 1),
in which both groups of equations are linearly independent. [The linear independence of the equations of the first group follows from the fact that the determinant det a/I'Il;*), whose square is the discriminant of the basis I'lli*' is nonzero, and hence the rank of the matrix (Ai) (m + 1 ::;; i ::;; n, 1 ::;;j::;; n), and, consequently, of the matrix (Aij*)' isn - m.] If we assume that n - m ~ r, then the total number of equations in (6.13) will be ~n.
295
SKOLEM'S METHOD
Sec. 6]
6.3. Thue's Theorem
Thue's theorem states that if the formf(x, y) = aox· + a 1 x"-l y + + a.y· in two variables with rational integral coefficients is irreducible and has degree n ~ 3, then the equation (6.14) f(x,y) = c 00.
has only a finite number of solutions in integers. Since forms in two variables are always decomposable, and when n > 2 they are nonfull, then the equation (6.14) is a special case of (6.1). Here m = 2, so to apply Skolem's method we must have t ~ I; that is, the equation f(x, I) = 0 must have at least one complex root. In such a case we shall say that the formf(x, y) has complex roots. We shall prove Thue's theorem by Skolem's method under this assumption. In other words, we shall prove the following assertion. Theorem 2. If the formf(x, y) has integral coefficients, is irreducible, has degree ~ 3, and has at least one complex root, then the equation f(x,y)
=c
has only a finite number of solutions in integers. Proof. We assume that the coefficient a o of x· equals I [otherwise we can multiply (6.14) by ao·-] and replace aox by x]. Setk = R(8), K = R(8], ... ,8.),
where 8 = 8J, 82 ,
00'
,
8. are determined by the decomposition f(x, I)
= (x + ( 1 ) " , (x + 8.).
For eachj = I, n we denote by aj the isomorphism of k to K which takes 8 to 8j ' Since f(x, y) = N(x + y8) (N denoting the norm of kj R), then (6.14) can be written in the form (6.3), where by M we mean the module {I, 8}. Hence in this case /1] = I, /12 = 8 (m = 2). Assume that the equation (6.3) has an infinite number of solutions IX = x + y8 in the module M = {l, 8}. Then for some I' = I'j E k, an infinite number of these solutions will be of the form (6.5), where the independent units e1' ... , e, of k satisfy Lemma I. The exponents u 1, ... , u, in (6.5), corresponding to each solution IX, will satisfy the system (6.10). We choose among the so that the corresponding points solutions IX a sequence lXI' IX2 , 00.
,
00.
(s=I,2,oo.)
(6.15)
converge to some point (u] *, ... , u:). In Section 6.2 we saw that the local analytic manifold V, defined by (6.11), contains an analytic curve (01(1), (O,(t), and for any such curve on V the series (6.12) satisfy some system of the form (6.13). The rest of the proof of Theorem 2 is based on the following important auxiliary result. 00'
,
296
[Chap. 4
LOCAL METHODS
Lemma 2. Let there be given a system of equations
L• a
ij
exp P j = 0
j= 1
(6.16)
•
L bijP
j
= 0
j=l
in which each group of equations is linearly independent. If n1 = n - 2, n2 ~ 2 and if the system has a solution in formal power series P1 (t), ... ,p.(t) without constant term, then Pk(t) = Pit) for at least two distinct indices k and j. [The coefficients aij and b ij , as well as the coefficients of the series Pit), lie in some fixed field of characteristic zero.] We give the proof of this lemma below, but now we show how the lemma implies Theorem 2. By Lemma 2, for any curve Wl (t), ... , wr(t) on V, Pk(t) = Pit) for at least two distinct indices j and k; that is, (6.17)
I
In r-dimensional space consider the points (Vl' '" ,vr ) of the manifold W which are determined by
1
:.~; 1
I
J
i:l '.'1·: :1 t
It follows from (6.17) that any curve which lies on the local analytic manifold V also lies on W. But then by Theorem 3 of Section 7, V c W; that is, all points of the manifold V, contained is some sufficiently small neighborhood of the origin, also belong to W. But on the other hand, we shall now show that among the points (Vis' ... , vrs ) E V, which are obtained from the points (6.15) by U is = u i * + Vis and which converge to the origin, only a finite number lie on the manifold W. This contradiction will prove Theorem 2. Let a = x + yO and a' = x' + y'O be two points of the sequence {as}, for which the corresponding points of V lie in the manifold determined by L k = L j • If a = yet"! ... er"' and Ui = u i * + Vi' then Ui
=
Ui*
+ Vi'
u/a) = u/y)u/e 1)"!· .. · U/er)"'·u/et)"! ... U/e r)"'
=
Cj
exp L/v 1 ,
... , Vr )
(with cj independent of a) and analogously uk(a) = Ck exp L k (v 1 ,"',
Vr),
SKOLEM'S METHOD
Sec. 5]
297
so that oia) uk(a) --=--
In precisely the same fashion we find that uia') = uk(a') cj Ck
From the last two equations we obtain x+Y(}j X+Y(}k = , x' + y'(}j x' + y'(}k
so that (xy' - X'Y)«(}k - ()) = 0,
and since ()j ¥-
(}k,
then xY' - x'y = 0.
This means that x + Y() = d(x' + y'(}) for some rational d. Taking norms and using the fact that N(a) = N(a'), we obtain d n = I, so that d = ± I, and a' = ±a. Thus each of the n(n - 1)/2 manifolds given by L k = L j , whose union is W, contains not more than two points of V which correspond to numbers of the sequence {a.}. Then W contains at most n(n - 1) such points. Thus any neighborhood of the origin contains points of V not lying on W, so V (as a local analytic manifold) is not contained in W, which contradicts our earlier conclusion that V c W. As we have noted, this contradiction proves Theorem 2. Proof of Lemma 2. Since the first group of equations linearly independent (and n1 = n - 2), we can, after changing the numbering if necessary, express exp Pi (i = I, ... ,n - 2) in terms ofexp Pn - 1 and exp Pn :
exp Pi = a i exp P n -
1
+ hi exp Pn •
(6.18)
If a i = 0, then from the absence of constant terms and the equation exp Pi = hi exp Pn , we deduce that hi = I and Pi = Pn • Hence we may assume that all a i are nonzero. Set
(i=I, ... ,n-l) and assume that all Qi are nonzero. By (6.18) we have exp Qi
= a i exp
Qn-l
+ hi'
(6.19)
298
LOCAL METHODS
[Chap. 4
so that by differentiation with respect to t (Problem 10) we obtain Q;' exp Qi = aiQ~-l exp Qn-1'
(6.20)
From (6.19) and (6.20) we deduce
Q;'=Q~-lexpQn-1
il !
,
!
I
ci
(i = 1, ... , n - 2),
1
+ exp Qn-1
(6.21)
where Ci = biai-1. We now use the second group of equations of (6.16). By assumption there are at least two linearly independent equations in this group. Hence we can find a nontrivial relation among the Qi : n-1 I diQi = 0. i= 1 Differentiating this identity and replacing Q;' by (6.21), we obtain
,
Qn-1 exp Qn-1 and since Q~-l ¥-
(n-2 I
di i= lei + exp Qn-1
+
dn - 1 ) = 0, exp Qn-1
°and exp Qn-1 ¥- 0, then n-1
d.
I
I
i=l Ci + exp Qn-1
=0
(6.22)
(here we take Cn- 1 = 0). We claim that (6.22) can hold only if the rational function n-1
d.
I-' i= 1 C + Z
(6.23)
i
is identically zero. Assume that it is not, that is, assume that (6.23) equals rp(z)fljJ(z), where rp(z) ¥- 0. Then since rp(exp Qn-1) = 0, the nonconstant formal power series exp Qn -1 is the root of a polynomial, which contradicts the assertion of Problem 4 of Section 1. It is clear that the function (6.4) can vanish identically only when Ck = cj for at least two distinct indices j and k. Since C = ba, we then find from (6.19) that
from which it easily follows that Pk = Pj • Lemma 2 is proved. Remark. Skolem's method allows us to prove that (6.14) has only a finite number of integral solutions. But it does not give an algorithm for finding
Sec. 6]
SKOLEM'S METHOD
299
these solutions. The reason is as follows. After proving that the system (6.7) has only a finite number of 'P-adic solutions, it is easy to find an algorithm for the computation of the coefficients in the expansions of these solutions in power series of the prime element. However, there is no algorithm which allows us to judge from a finite number of coefficients whether or not we are dealing with a rational solution. This defect is shared by all known proofs of Thue's theorem. Even when the equation (6.14) is of third degree, there is no known algorithm for finding all integral solutions, or even for determining if there exist any solutions. 6.4. Remarks on Forms in More Variables The following question now arises: Under what conditions does an equation of the type (6.1) with a nonfull decomposable form have only a finite number of solutions in integers? Such equations sometimes have an infinite number of solutions. As an example consider the equation x4
+ 4 y4 + 9z 4
-
4X 2 y2
-
6X 2 Z 2
-
12 y2 z2
= N(x + yJ"2 + zJ3) = 1
[the norm is taken in the extension R( .)2, .)3)/R]. This equation has two infinite sets of solutions, given by the formulas
x
+ y.)"2 =
±(l
+ .)2)"
(z = 0),
x
+ z.)3 =
±(2
+ .)3)"
(y
= 0).
The reason for this is that, setting z = 0 or y = 0, we obtain from our form the square of a full form: (x 2 - 2y 2? or (x 2 - 3Z 2 )2. This occurs because the module {l, .)2, .)3}, which corresponds to our form, contains full modules of smaller fields, namely, {l, Ii} c R( .)2) and {l, .)3} c R( .)3). We describe a general class of forms with analogous properties. We write (6.1) in the form (6.3) and consider the vector space L (over R) which is generated by the numbers of the module M. The module M is called degenerate if the corresponding space L contains a subspace L' which is similar to some subfield k' c k, where k' is neither the field of rational numbers nor an imaginary quadratic field. We show that for a degenerate module the equation (6.3) has an infinite number of solutions (at least for some a). For if L' = yk' (y E k) and M' = L' (\ M, then y- 1 M' is a full module of the field k'. By the assumptions on the field k' the number of fundamental units in any order is nonzero, and therefore the equation (6.24) has an infinite number of solutions (provided it has at least one solution). Set
300 al
LOCAL METHODS
= Nk/R(y)d, where r = (k : k').
[Chap. 4
Since
Nk/R(~Y) = (Nk'/RWYNk/R(y) = a
and ~y E M' c M [for any ~ which satisfies (6.24)], then the equation Nk/R(f/) = al' f/ E M, has an infinite number of solutions. The basic conjecture on equations of the form (6.1) is that any such equation has only a finite number of integral solutions, provided that the associated module is not degenerate. Apparently the only known approach to this hypothesis lies in Skolem's method (application of which, as we have seen, requires the additional restriction that t ~ m - I). The basic stage at which the condition m = 2 was used in the proof of Theorem 2 was Lemma 2. The generalization of Lemma 2 to the case nl + n z ~ n (instead of nl = n - 2 and n z ~ 2) is apparently the principal obstacle to a proof of the above hypothesis (in the case t ~ m - 1). Skolem proved this generalization in the case n = 5, nl = 2, n z = 3 and thus deduced that the number of solutions to (6.1) is finite when n = 5, m = 3, t = 2 [T. Skolem, Einige Satze uber p-adische Potenzreihen mit Anwendung auf gewisse exponentielle Gleichungen, Math. Ann. 111, No.3, 399/424 (1935)]. This indicates the validity of our hypothesis in the case n = 5 (under the condition t ~ m - I; the nondegeneracy of the module does not arise here since the field k has prime degree and hence has no subfields). The validity of the hypothesis was proved for m = 3 (and hence under the restriction that t ~ 2) by Chabauty [C. Chabauty, Sur les equations diophantiennes liees aux unites d'un corps de nombres algebriques fini, Ann. Mat. Pura Appl. 17, 127/168 (1938)]. His method, however, avoids consideration of the system (6.16) by introducing other more refined techniques. Hence the generalization of Lemma 2 remains unproved even in the case n l = n - 3 (except for the case n = 5 considered by Skolem).
PROBLEMS
1. Let the series f(t) = ao + alt + a 2t 2 + for all p-adic integral values of t. If
...
withp-adic integral coefficients converge
(k = 2, 3, ...),
show that the equationf(t) = 0 has precisely one solution in p-adic integers if I'p(ao) ;;. I'p(a l ), and has no solution in p-adic integers if I'p(ao) < I'p(a l ). 2. Let d> 1 be a square-free natural number, and let (a, b) and (a, b) be two nontrivial (distinct from (1,0» integral solutions of the equation
x3
+ dy 3 = 1.
Sec.6] Set
13 =
SKOLEM'S METHOD
a + b-tld and
13, =
a,
+ b, -tid in
. .
the cubic field K
301 =
R(-tld). Show that then
13 =13,
for rational integers U and v, at least one of which is not divisible by 3. 3. Keeping the notations of the preceding problem, assume that d", ± 1 (mod 9). Then in K we have the decomposition 3 = \)' (Problem 24 of Section 7, Chapter 3), and hence the completion Kp of the field K is of degree 3 over the field R, of 3-adic numbers. Assuming that v", 0 (mod 3), set t = u/v. Show that the number t, considered as a 3-adic integer, is a root of the equation
2:'" ant n =
(*)
0,
11=2
where an = (1 /n!) Sp«log 1])n), 1] = 13'. (Here Sp denotes the trace for the extension K p / R, .) Show that the series in (*) converges when t is any 3-adic integer. (Hint: Show that Sp(log 1]) = 0 and Sp 1]' = 3, 1], = 13 ,'.) 4. Show that the coefficients an in (*) satisfy v,(a2) = v,(a,) = flo + 3, v,(an) > flo + 3 where flo = 1I,(a'b'd) (II, being the 3-adic valuation). [Hint: Use the fact that if 1] = 1 + 3x, x = ab-tld 13, then
log 1]
== 3x -
for n> 3,
9 - x 2 + 9 x' (mod 34 +U),
2
and also the fact that the trace of any element of the ring 0,[ -tI d] is divisible by 3 (0, is the ring of 3-adic integers).] 5. Using Problems 1 to 4, show that the equation x' + dy' = 1, with d '" ± 1 (mod 9). has at most one nontrivial solution in rational integers. 6. Prove the assertion of the preceding problem in the case d == ± I (mod 9). [Hint: Recall that the number 3 factors in K = R(-tId) in the form 3 = \)2 q (Problem 24 of Section 7, Chapter 3), and carryover the considerations of Problems 3 and 4 to the direct sum K, = Kp lOB K q (see Section 2). The logarithmic function on K, is defined just as on a field; the series converges for all g = (a, f3) E K" where a and f3 are principal units in Kp and K q • The trace Sp(g) is defined as the trace of the matrix of the linear mapping f gt (f E K,), and therefore for any elements of i (Section 2) it coincides with the trace of the corresponding number of K.] 7. Let the seriesf(t) = ao + a,t + a2t2 -!- ... have p-adic integralcoefficients,andconverge for all p-adic integral values of t. If an is a p-adic unit and a, == 0 (mod p) for all s> n, show that the equation f(t) = 0 has at most n solutions in p-adic integers. 8. Let the sequence of integers -)0
Uo, Ul, ... , U II
,
•••
(**)
satisfy the recurrence relation Un = a 1 Un _, + '" + amUn_ m (am # 0) with rational integer coefficients a" ... ,am' Assume that the polynomial g;(x) = x m - a,x·' - I - ... - a., has no multiple roots. Show that there exists a natural number M with the following property: For each residue class modulo M, either all Un (with n in that class) are equal, or no number occurs infinitely often among the u. (that is, show that the sequence (**) is either periodic or else assumes any given value only a finite number of times). [Hint: Use the formula u. = A,a,· + ... + Ama mn [a, are the roots of (g;(x)] and the fact that for any prime p and natural number M the function aiM. = exp (x log aiM) will be an analytic function for all p-adic integral values of x.]
Sec. 7]
LOCAL ANALYTIC MANIFO LDS
303
a function on the points of V lying in some e-neighborhood of the origin (the neighborhood depending on the function). Hence we call the factor ring D the ring of analytic functio ns on V. Defini tion. The local manifold V is called irreducible if the ring of functions DjW. v on V has no divisors of zero. Otherwise it is called reducib le. The investigation of local analytic manifolds is based on three simple facts, one from algebra and two dealing with the properties of power series. We state them without proof, giving references.
Lemma 1. Let gl(t), ... ,gm(t) be polynomials of k[t] with leading coefficient 1. There is a system hI' ... , hr of polynomials in several variabl es, one for each coefficient of the gj' and with integer coefficients, such that if the coefficients of gl (t), ... , gm(t) are substituted for the corresponding variable s in hI' .. , , h" then hI = ... = hr = 0 if and only if gt> ... ,gm have a commo n root in some extension of k. If m = 2, then r = I and hI is the resultant of the polynomials gl and g2 . The general case is easily reduced to this case. The proof is given in " Modern Algebr a" by B. L. van der Waerden, Vol. II, Section 77, Ungur, New York, 1950. Lemma 2. Suppose that the power series f(x , .. , ,x ) is such that all l n terms of degree
= x/ + CPl(X l ,
k l ... , Xn_l)x n -
+ ... + CPk(X l ,
.. , ,
xn-
l ),
where CPt> ... , CPk are power series in Xl' ... 'X - with zero constan t n l term. This corollary to the Weierstrass prepara tion theorem is proved by O. Zariski and P. Samuel, in "Comm utative Algebra," Vol. II, p. 145, Princet on University Press, Princeton, N.J., 1960. Note that the condition that the coefficient of x/ be nonzero always can be obtained after a linear change of variables. Further , it is easily checked that if .we have a finite setfl' ... ,fm of power series, then a linear change of variables can be found so that they all satisfy this condition simultaneously. Lemma 3. Any ideal W. of the ring D has a finite set of generators; that is there exist power series hI' ... , hs in W. such that any hEW. can be represe nted in the form
for some gt> .. , ,gs of D.
i !
304
LOCAL METHODS
[Chap. 4
The proof of this lemma can also be found in "Commutative Algebra," Vol. II, p. 148. We need Lemma 3 for the proof of the following theon:lil. Theorem 1. Every local manifold is a finite union of irreducible manifolds. Proof. Let the manifold V be determined by (7.1). If V is reducible, then
there exist power seriesfand 9 in .0, which do not vanish on the points of V in any neighborhood of the origin, but such thatfg is identically zero on V in some neighborhood of the origin. Let VI and VI' be the manifolds obtained by adjoining to the system (7.1) the equations f(X) = and g( X) = 0, respectively. It is clear that VI and VI' are submanifolds of V and that
l
t
°
If the manifolds VI and VI' are irreducible, then the theorem is proved. If one of them is reducible, then we can, in the same way, represent it as the union of two proper submanifolds. Continuing this process, we either represent V as a finite union of irreducible submanifolds, or we obtain an infinite sequence of manifolds V
= Vo
# VI # V # ....
(7.2)
~Vi
be the ideal of the variety
2
We show that the second case is impossible. Let Vi' From (7.2) it follows that ~v
# ~lvl # ~V2 # ....
(7.3)
Denote by ~ the union of the ideals ~Vi' By Lemma 3 the ideal ~ is generated by a finite system of series hI' ... , h s • Since each series of ~{ is contained in some ideal ~Vi' then there is an integer k such that all the series hi> .. , , hs are contained in ~Vk' But then ~ c ~Vk and hence ~Vk = ~{Vk+ 1 = "', which contradicts (7.3). Theorem I is proved. We now describe a general method for studying local manifolds, based on a reduction to manifolds in spaces of lowest possible dimension. Let the manifold V in the space f(n be defined by the equations (7.1). Assume that V is different from kn, so that the series fI' .. , ,fm (m ~ 1) are not identically zero. Also assume that we have made a linear change of variables so that the polynomialsfi all satisfy the conditions of Lemma 2. Then, by this lemma, we can find power series eI(X), ... , em(X) in .0 with nonzero constant term, such that (7.4)
where qJij = qJij(XI' .•. ,xn - I ) are power series in n - 1 variables with zero constant term. Since ei(X) #- in some e-neighborhood of the origin, then the
°
J
Sec. 7]
LOCAL ANALYTIC MANIFOLDS
305
local manifold V is also given by the system of equations 91(X) = 0, ... ,9m(J{) = 0,
(7.5)
where each 9j is a polynomial in X n with leading coefficient 1. We now apply Lemma I to these polynomials. The corresponding polynomials hI' '" , h, in the coefficients of the polynomials 91' ... ,9m will be power series in Xl' ... , Xn- l without constant term, and since all the qJij converge in some e-neighborhood of the origin, then the series hI' ... ,h, will converge in the same neighborhood. Consider now the local manifold W in the space [(n-l defined by the equations hl(Xl , ... ,xn- l )
= 0, ... ,h,(xl , ... ,xn- l ) = 0.
It is clear that a point (al> , an-I) E [(n-l belongs to W if and only if there , an-I' an) E V. Thus W is a projection of the exists an an such that (at. manifold V into the hyperplane X n = 0. Here each point (aI' ... , an-I) E W is the projection of a finite set of points (aI' ... ,an-I' an) E V, since an is a common root of the polynomials 9i(a l , ... ,an-l,xn). The passage from the manifold V to its projection W gives us a method for investigating local manifolds. ,
Definition. By a curve in the space [(n we mean a system of n integral formal power series wl(t), .. , , wn(t) which have zero constant term and have coefficients in k or in some finite extension of k (and are not all identically zero).
For our purposes it is not necessary to assume that the series Wi(t) = + a i ,t 2 + ... converge, and it is simpler not to do so. Thus a curve does not consist of a set of points, but only of a collection of series Wi(t).
a i ,!
Definition. We shall say that the curve wl(t), ... , wn(t) lies on the manifold V if for any series I(x l , ... , x n) of the ideal ~v the power series I(wl (t), ... , wn(t)) is identically zero.
Our basic result on local analytic manifolds is the following. Theorem 2. Any local manifold either coincides with the origin or contains some curve. The proof will be given by induction on the dimension n. By Lemma 3 the ideal ~v has a finite set of generators. Hence we may assume that the system (7.1), which defines the variety V, consists of a set of generators for the ideal ~v . For n = I the variety V consists only of the origin if at least one of the series Ii is not identically zero, and coincides with P
If
306
LOCAL METHODS
[Chap. 4
if allfi are identically zero. In the second case any series w(t) satisfies the system (7.1). Now let n > 1. The assertion of the theorem is clear if allfi are identically zero (or if m = 0). Therefore we assume that none of the series fl' '" ,fm (m > 0) are equal to zero. Further, assume that these series are in the form given by Lemma 2, so that instead of the equations (7.1) we have V given by (7.5), where 9i is determined by (7.4). Let W be the projection of V in k·- I . By induction, we assume the theorem valid for W. If W coincides with the origin, then the local manifold V will be defined by the equations 9;(0, .. , ,0, x.) =
°
(1
~
~
i ~ m),
~
i
p
F
(7.6)
whose coefficients lie in the field k l {t} of formal power series in t over k l . Further, these polynomials have a common root X. = ~ in some finite extension n of the field kl{t}. By Theorem 6 of Section 1 the field n is contained in the field of formal power series k'{u}, where u e = t for some natural number e, and k' is a finite extension of k l • Hence the element ~ can be represented as a power series ~ = w(u) with coefficients in k'. Since ~ is a root of the polynomials (7.6), which have leading coefficient 1 and integral coefficients in the field k l {t}, then the series w(u) is an integral element of the field k' {u}, that is, it does not contain any term with negative exponent. In the representation (7.4) all the series qJij have zero constant term. Substituting the series wl(u e ), •.• , w._I(u e ) for XI' ... , X._I in (7.4), and substituting w(u) for X., we see that the series w(u) has zero constant term and that (1
c(
o
w
i ~ m),
and it will also coincide with the origin. If W is different from the origin, then W contains a curve wl(t), ... , w.-I(t). Let k l denote a finite extension of the field k which contains all coefficients of the power series WI' ... , w._ I . From the definition of W it follows that if we substitute the series WI (t), ... , W. -I (t) in the series 91' ... ,9m for XI' ... , X._ I then we obtain m polynomials in X., (1
S
~
"
T L
cc
al
si s~
h T1 ,( ,(
Sl m).
Since the series WI' .. , , W. -I are not all zero, then the set of series WI (u e ), .•. , w._I(u e), w(u) is a curve in k·. By assumption the seriesfl' ... ,1m' and thus also the series 91' ... ,9m generate the ideal ~[y. Hence for any seriesf(xl , •.• , x.) of my we have
a
f
a and this means that the curve wl(u V. Theorem 2 is proved.
e
), '"
,
w._I(u e ), w(u) lies on the manifold
Sec. 7]
LOCAL ANALYTIC MANIFOLDS
307
Theorem 3. If V and V' are two local manifolds in j(n, where V is not contained in V', then there is a curve in i(n which lies on V and does not lie on V'. Proof. We can assume that the manifold V is irreducible, since otherwise we may replace V by one of its irreducible components. Let the manifold V' be defined by the equations
where F j is a series of the ring.o. Since V¢ V', then at least one of the series F j does not vanish on the points of V (in any neighborhood of the origin). We denote this series by F(X) and will show that there is a curve wl(t), '" , wn(t) on V for which F(w l (t), ... , wn(t)) ¥- 0.
The proof will proceed by induction on n. We can clearly assume that the series F(X) satisfies the conditions of Lemma 2, so that there exists a series e(X) = e(xl , ... , x n) ED with nOnzero constant term so that e(X)F(X)
= G(Xl>"" x n) = X/ + !/JIX/- I + ... + !/Jk'
(7.7)
where !/JI, ... , !/Jk are series in XI' •. , ,xn - I • In the case V = j(n (in particular, if n = 1) Theorem 3 is proved, for example, by taking wl(t) = '" = wn-I(t) = 0, wn(t) = t. If V¥- j(n, then we consider the projection We j(n-I of the manifold V (here we assume that the series II' ... ,fm, as well as F( X), satisfy the conditions of Lemma 2; as we have seen, this can be achieved by a linear change of variables). The manifold W is also irreducible, since the ring of functions on it, that is, the factor ring .on-I/'lI w = .on-I, is a subring of the ring of functions .o/'lI v =.6 on V (as .on -I c .0 and 'lI w c 'ltv). For each series lED we denote by J the corresponding function of .6. It follows from (7.4) that
x:' + iPiIX/,-1 -* ... + iPik, = 0, and this means that the function xn of the ring .on -I' It follows that the functions G
.6 is integral over the subring
.7: - k - I .7: = X-nk + '!'IX + ... + '!'k n
also are integral over .6n-I' We take an equation
G + LI G S
I
S -
+ '" + L s =
°
(7.8)
LOCAL METHODS
308
[Chap. 4
with smallest possible s. It is clear that Is ¥- 0, since then we could obtain an equation for G with smaller s. Hence the series L s E D n - 1 does not vanish on the points of W (in any neighborhood of the origin). By the induction hypothesis there exists a curve w 1(t), ... ,Wn -1(t) in the space f(n-1 which lies on W and is such that L s(w 1(t), ... , Wn -1(t)) ¥- 0. In the proof of Theorem 2 we saw that there exist curves of the form W1(U'), ... , w n - 1(u'), w(u) which lie on the manifold V. We shall show that for such a curve (w(u) ¥- 0)
and hence that this curve does not lie on the manifold V'. For if the series on the left were identically zero, then by (7.8) we would have L s(Wl(U'), ... , w n - 1(u') = 0,
and after replacing u· by t, Llwl(t), ... , W n -l(t) = 0,
which is impossible by choice of the curve Wl(t), ... ,Wn -l(t). Theorem 3 is proved.
>.4
an on
)0-
W aw he
CHAPTER
5
Analytic Methods
Dn
3
In Chapter 3 we saw how important a role the number h of divisor classes of an algebraic number field played in the arithmetic of the field. Thus one would like to have an explicit formula for the number h, in terms of simpler values which depend on the field K. Although this has not been accomplished for arbitrary algebraic number fields, for certain fields of great interest (such as quadratic fields and cyclotomic fields) such formulas have been found. The number of divisor classes is a characteristic of the set of all divisors of the field K. Since all divisors are products of prime divisors and the number of prime divisors is infinite, then to compute the number h in a finite number of steps we must use some infinite processes. This is why, in the determination of h, we shall have to consider infinite products, series, and other analytic concepts. The apparatus of mathematical analysis can be applied to solve many problems of the theory of numbers. In this chapter we give an example of the application of this apparatus by using it to compute the number of divisor classes. 1. Analytic Formulas for the Number of Divisor Classes
1.1. The Dedekind Zeta Function The determination of the number h of divisor classes of the algebraic number field K is based on consideration of the Dedekind zeta function (K(S), defined by (1.1) 309
[Chap. 5
S
where a runs through all integral divisors of the field K, and N(a) denotes the norm of the divisor a. We shall show that the series on the left side of (1.1) converges for I < s < 00, and is a continuous function of the real variable s on this interval. Further, we shall obtain the formula
e:
ANALYTIC METHODS
310
= hK,
lim (s - IKK(s)
(1.2)
5-1+0
where K is a constant which depends on the field K in a simple manner, and which will be computed in the course of the proof. Formula (1.2) becomes valuable because the function (K(S) also has a representation as an infinite product (K(S)
=
I) 1 -
[ljN(p)'J ,
(1.3)
where a runs through all integral divisors of the given divisor class C, and the exterior summation is taken over all h classes C. To prove that the series (1.1) converges, it suffices to show that each of the series I
=
VI
VI
a
I
carried out over all prime divisors p of the field K, this representation being called Euler's identity. If for the field Kwe have a good knowledge of the prime divisors (that is, if we know how rational primes factor into prime divisors in K), then we can obtain an explicit expression for h from formulas (1.2) and (1.3). By this route we shall obtain formulas for h in later sections when K is a quadratic or a cyclotomic field. We break the series (1.2) into the sum of h series
fcCs)
d
L N( a )5
(1.4)
aEC
o s a r
~
t s
c s t
r ~
t t
E
converges for s > I. Further, if we show that for each class C the limit lim (s - l)fc(S) 5-1 +0
exists and has the same value K for each divisor class C, then we will have obtained formula (1.2). We now transform the series (1.4) into a series over certain integers of the field K. In the inverse divisor class C- I we choose an integral divisor a'. Then for any a E C the product aa' will be a principal divisor: aa'
= (c<),
1:
s
(c< E K).
It is clear that the mapping
, a
-+
(c<)
(a
E
C)
5
Sec. 1 ]
e )
establishes (for fixed a') a one-to-one correspondence between integral divisors a of the class C and principal divisors (a) divisible by a'. Using the equality
s
)
FORMULAS FOR NUMBER OF DIVISOR CLASSES
N(a)N(a')
311
= IN(a)l,
we obtain
ids) = N(a')S
" L... (a)
-1-
IN(a)IS'
(1.5)
a=O(modo')
where the summation is taken over all principal divisors of the field K which are divisible by a: Since two principal divisors (al) and (a2) are equal if and only if the numbers a1 and a2 are associate, then we may consider that the summation in (1.5) is taken over a complete set of nonzero pairwise-nonassociate numbers of the field K which are divisible by a'. To put the series (1.5) in a still more convenient form, we use the geometric representation of points of the field K by points in the n-dimensional space 9ln = £S,I and in the logarithmic space 9l s + 1 [here n = s + 2t is the degree of the field K; see Section (3.3), Chapter 2]. We shall determine a cone X in 9ln such that in each class of associate numbers of the field K there is one and only one whose geometric representation lies in X (by a cone we mean a subset of 9ln which, whenever it contains any nonzero point x, also contains the whole ray ~x, 0 < ~ < (0). In Section 3 of Chapter 2 (all notations of which we preserve), we defined a homomorphism x -+ I(x) of the multiplicative group of points x E 9l n with nonzero norm N(x) to the additive group of vectors of the logarithmic space 9l s + 1 by formula (3.13). If e.. ... ,e, is some system of fundamental units of the field K, then we showed that the vectors I(e.), ... , I(e,) formed a basis for the subspace of dimension r = s + t - I consisting of all points (A.. ... , As + 1) E 9l s +t, for which Al + .. , + AS +1 = O. Since the vector
= (1,
1*
... , 1; 2, ... , 2)
" - S- " " -1- "
does not lie in this subspace, then the set of vectors 1*, l(e 1), ... , I(e,)
is a basis for 9l s + l • Any vector I(x) sented in the form I(x)
=
~I*
E
9l s + 1 [x
(1.6) E
9ln, N(x) =I 0] can be repre-
+ ~ 11(e1) + ... + ~,I(e,),
(1.7)
where ~, ~1' ... , ~, are real numbers. Let m denote the order of the group of roots of I contained in the field K.
.I
I
312
II i'
ANALYTIC METHODS
[Chap. 5
Definition. A subset X of the space ill" is called a fundamental domain for the field K if it consists of all points x which satisfy the following conditions: (1) N(x) oF O. (2) In the representation (1.7) the coefficients i (i = 1, '" ,r) satisfy the inequality 0 ~ i < 1. (3) 0 ~ arg Xl < 2nlm, where Xl is the first component of the point x.
e
e
Note that for s ~ 1 the number m equals 2, so that condition (3) in this case simply means that Xl > O. In the next section we shall see that the fundamental domain X is a cone in ill", and we shall use this fact to prove the following theorem. Theorem 1. In every class of associate numbers (oF 0) of the field K there is one and only one number whose geometric representation in the space ill" lies in the fundamental domain X.
We turn to the series (1.5). If we denote by 9Jl the n-dimensional lattice in ill" which consists of all images x(a), where a is an integer of K divisible by a', then since IN(a) I = IN(x(a» I we can write (1.5) in the form
fees) = N(a')'
I
xE!lJlnX
1 IN( )IS' X
(I.8)
where the summation is taken over all points X = x(a) in the lattice 9Jl which are contained in X. In Section 1.4 we shall prove a general result on series, in which the summation is carried out over all points of a lattice which lie in some cone (Theorem 3). Applying this result to our case, we find that the series (1.8) converges for s > 1 and 1 v lim (s-I) I - - = (1.9) s"'1+0 xE!lJlnX IN(xW ~ , where ~ is the volume of a fundamental parallelepiped of the lattice 9Jl and v is the volume of the set T which consists of all points X of the fundamental domain X for which IN(x) I ~ 1. By Theorem 2 of Section 4, Chapter 2, and (6.3), Chapter 2, ~ is given by ~
J-
= f1 N(a') IDI,
(1.10)
where D is the discriminant of the field K. We shall compute the volume v of T in Section 1.3, where we will show that 2sn t R v=--,
m
(1.11)
.5
Sec. 1 ]
in
where R is the regulator of the field K. From (1.9), (1.10), and (1.11) it easily follows that 2s + t n t R lim (s - I)fc(s) = ~ ,
n-
FORMULAS FOR NUMBER OF DIVISOR CLASSES
;e
m....;
s-1 +0
~e
and since (K(S)
= Lcfc(s),
313
IDI
we have established the following basic result.
Theorem 2. If K is an algebraic number field of degree n =
S
+ 2t,
the
series
n
e
converges for all
S
> 1. Further, we have the formula
" where h, D, and R denote the number of divisor classes, the discriminant, and the regulator of the field K, and m is the number of roots of 1 contained in K. We now turn to the verification of those assertions used in the derivation of Theorem 2. 1.2. Fundamental Domains
If ~ is a positive real number, we shall compute
I(~x) E
£s.t, where x
E
91",
N(x) =F O. From (3.12) of Chapter 2 we have (l ~ k ~ s), i
, i I !
It follows that I(~x)
= In ~·I· + I(x),
and this means that the vectors I(x) and I(~x) will have the same coefficients for 1(6 1 ), ... ,/(6,) in terms of the basis (1.6). Since N(~x) = ~"N(x) =F 0 and arg(~x) = arg x, then if x lies in the fundamental domain X, so does ex; that is, the domain X is a cone in 91" [X is nonempty, since it contains the point x(l), the image of the number 1 E K]. Lemma 1. If Y
E
91" and N(y) =F 0, then y has a unique representation in
the form y
= xx(e),
(1.12)
where x is a point of the fundamental domain X and e is a unit of the field K.
ANALYTIC METHODS
314
[Chap. 5
Proof. We represent the vector ley) in terms of the basis (1.6):
and for j = 1, ... , r we set Yj=kj+~j'
where k j is a rational integer and 0 ~ ~j < 1. We set 1"/ = e/' sider the point z = yX(I"/-l). We have I(z)
... e/
r
and con-
= I(y) + /(1"/-1) = I(y) - k 1/(e 1) - ... - kr/(er)
= 'II· + ~1/(e1) + .. , + ~r/(er)' Now let arg Zl = qJ. For some integer k, 2nk
2n
m
m
O~qJ--<-.
Under the isomorphism a -+ (T1(a) (a E K), the mth roots of 1 in K are mapped to the mth roots of 1 in the field C of complex numbers. Denote by ( that mth root of 1 (which must be primitive) for which (T1(() = cos(2njm) + i sin(2njm). We shall show that the point x = zx(C k) belongs to the fundamental domain X. We have I(x)
where 0 Xl
~ ~j
= I(z) + I(C k) = I(z) = 'II· + ~ 1/(e1) + .,. + ~r/(er),
< 1, so that conditions (1) and (2) are fulfilled. Further,
= ZlX(Ck)l = Zl(T1(O-k, so that arg Xl = arg z 1
-
2n k-
m
2nk
= qJ - -
m
and hence 2n
o ~ arg Xl < - . I I
Thus X E X. Now note that x(a)-i = I
.i
I I
y
m 1 x(a- ), so that
= zX(I"/) = XX«(k)X(I"/) = xx(e),
where e = (kl"/. Hence we have represented y in the form (1.12). We now must show the uniqueness of this representation. Assume that also y = x'x(e'), where x' E X and e' is a unit in K. Since xx(e) = x'x(e'), then lex)
+ lee) = I(x') + lee').
The vectors lee) and lee') are integral linear combinations of the vectors
Sec. 1]
FORMULAS FOR NUMBER OF DIVISOR CLASSES
315
.•. ,/(e,). The coefficients of the vectors I(e i ) in the expansions in the basis (1.6) of I(x) and I(x') are nonnegative and less than 1 [by condition (2)]. Hence I(e') = I(e) and this means that e' = e(o, where (0 is an mth root of 1 (see Section 3.4 of Chapter 2). From the equation x(e') = x(e)x((o) it follows that x = x'x((o), and hence
l(e 1 ),
By condition (3) we have
21t
21t o ~ argx 1 , < , rn
o ~ argx 1 < -m,
and hence 0 ~ larg 0'1((0)1 < 21tlm, and since 0'1((0) is an rnth root of 1, this is possible only when arg 0'1((0) = 0, so that 0'1((0) = 1 and (0 = 1. Hence x' = x and e' = e. Lemma 1 is proved. Proof of Theorem 1. Let p be any nonzero number of K. By Lemma 1 we can write x(P) = xx(e), where x E X and e is a unit. The number a = pe -1 is associate with p, and its geometric image x(a) (coinciding with the point x) lies in the domain X. Since the decomposition (1.12) is unique, the number a which satisfies p = ae and x(a) E X is uniquely determined, and this proves Theorem 1. As an example we shall find the fundamental domains for quadratic fields. First, we take the case where K is a real quadratic field, so that n = s = 2, t = 0, r = s + t - 1 = 1. We shall assume that K is a subfield of the field C of complex numbers, and that the first isomoI:phism 0'1 : K ~ C is the inclusion mapping (see Section 3.1 of Chapter 2). If e is a fundamental unit of the field K, then -e, lie, -lie are also fundamental units, so we may assume that 2 e> 1. If x = (Xl' X2) E 91 , with N(x) = XIX2 =I 0, then I(x) = (lnlxll, Inlx21). The decomposition (1.7) then has the form I(x) = eo, 1)
+ e,(ln e,
-In e).
The fundamental domain X is determined by the conditions
It is easily seen that and hence
Ix,1 = The condition 0
~
IX2Ie2~1.
e, < 1 then leads to 1~ :r
~ > -2 Ix,1 e .
316
ANALYTIC METHODS
[Chap. 5
The fundamental domain X hence consists of the points indicated in Figure 7 (the boundary rays which lie closest to the positive x-axis are not included in X).
FIG. 7
, i FIG. 8
Now let K be an imaginary quadratic field. Since here s = 0, t = 1, then r = s + t - 1 = O. Hence the fundamental domain X consists of all points X = Y + iz for which N(x)
.= y2 + Z2 =1=
[see Figure 8 for the case K = R(
0, (0 ~ arg
.J- 3), with m =
x
< 2;)
6].
1.3. Computation of the Volume We now turn to the computation of the n-dimensional volume of the set T, which consists of all points of the fundamental domain X for which IN(x) I ~ 1.
~p.5
e7 ed
Sec.1]
FORMU LAS FOR NUMBE R OF DIVISOR CLASSES
317
It will be shown in the course of the compu tations that the volume exists
and is nonzero. (For quadra tic fields, the set Tis indicated in Figures 7 and 8.) We first show that the set T is bounde d. In every ray which is contain ed in the cone X, there is one and only one point x for which IN(x) I = I. Denote the set of all such points by S. It is clear that T consists of all points ~x (0 < ~ ~ I), where x runs through all points of S. Consid er the formula (1.7) for any point x E 9l" with nonzer o norm. We compu te the sum of the compo nents of this vector. By (3.15) of Chapte r 2 the sum on the left equals InIN(x)l. By (3.18) of Chapte r 2 the sum on the right is ~(s + 21) = n~. This means that ~ = (Ijn) InIN(x)l, and (1.7) can be written in the form (1.13)
Now if XES, then InIN(x)1 = 0, and hence the point lex) = (II (x), .. , , Is+rCx)) E 9l,+t is represented in the form lex) = ~Il(el) + ... + ~/(er), where ~ ~i < I. It follows that there is a constan t p such that lix) < p, and then IXkl < eP for I ~ k ~ sand IXs+jl < eP/ 2 for I ~j ~ 1 for all XES [see (3.13)
o
and (3.12) of Chapte r 2]. This shows that the set S, and hence also the set T, is bounde d. We shall replace the set T by anothe r set which is easily obtaine d from T, and which has the advanta ge that it is defined by a simpler set of conditi ons. We first note the following almost obviou s lemma. Lemm a 2. If e is a unit of the field K, then the linear transfo rmation of the space 9l" given by x -+ xx(e) is volume-preserving. Under any nonsing ular linear transfo rmation the volume of a set is multiplied by the absolut e value of the determ inant of the matrix of the transfo rmation [see (4.2) of Chapte r 2]. We showed in Section 3.1 of Chapte r 2 that the determ inant of the transfo rmation x -+ xx(e) equals N(x(e)), that is, equals N(e) = ±1. As before, let ( denote that mth root of I for which 0"1(0 = cos(2nj m) + i sin(2njm). Consid er the sets T k (k = 0, I, ... , m - I), obtaine d from Tby the linear transfo rmation x -+ XX((k) (To = T). By Lemma 2 we have v(T k) = veT) (provided the volume of one of the sets exists). Since IN(xx((k))1 l(xx((k) )
= IN(x)N((k)1 = IN(x)l,
= lex) + l((k) = lex),
arg (XX((k))1 = arg x I
2n k, m
+-
then (by the definition of the fundam ental domain X) the set T consist s of all k
ANALYTIC METHODS
318
points x
E
[Chap. 5
91" for which:
(1) 0 < IN(x) I ~ 1. (2) The coefficients in (1.13) satisfy 0 ~ ~ i < 1. (3) 2nkjm ~ argx 1 < (2njk)(k + 1). Thus To, T 1 , '" ,Tm - 1 are pairwise-nonintersecting and their UnIon Uk'';o' T k is defined by conditions (1) and (2) [without condition (3)]. Let f denote the set of all points x E Uk'=-Ol T k , for which Xl > 0, ... , X s > 0 [see (3.2) of Chapter 2]. We fix a set of 5 signs 15 1 , •• , , bs (15; = ± 1). Ifwe multiply all points in 91" by the point (15 1, ... , bs ; 1, ... , 1) E £s.t =91", we obtain a volume-preserving linear transformation of 91". If we apply a1l2 s such linear transformations to f, we obtain 2S pairwise-nonintersecting sets whose union coincides with Uk'':-Ol T k • If we can show that f has nonzero volume ii, then it will follow that T has a well-defined volume, which is given by v(T)
"
'.
I
2s
=-
m
(1.14)
ii.
(For real quadratic fields f is that part of T which is contained in the first quadrant, and for imaginary quadratic fields f coincides with the unit disc minus the origin, see Figures 7 and 8.) The vector equation (1.13) yields the following system:
(j
'
where ej = 1 if 1 by the formulas
~j ~ 5,
and ej = 2 if 5 (k
Yj
= 1, ... , s + t),
+ 1 ~j ~ 5 + t.
We change variables
= 1, ... , s),
= Ps+j C.OS C{Jj
}
(j
= 1, ... , t).
Zj=ps+jsmC{Jj (Here the real numbers Y j and Z j are given by X s + j = Y j + iz j ' 1 ~ j ~ t, see Section 3.1 of Chapter 2.) The Jacobian of this transformation is easily computed to be Ps+1·"Ps+t. Since l/x) = Inp/ i and N(x) = flj~t1 p/i (we assume that Xl> 0, ... , X s > 0), then in terms of the variables P1' ... , Ps +" C{J" '" , C{J" the set f is given by the conditions: (1) P1 > 0, ... ,Ps+t > 0, flj~\ p/ i ~ 1.
(2) In the equations
11
e. (s+t ) In p/ i = ~ In p/'
(j
= 1, .. , ,x + t)
the coefficients
~k
r
+ k~l ~klj(£k)
satisfy 0
~ ~k
< 1 (k = 1, ... ,r).
Sec.1]
FORMULAS FOR NUMBER OF DIVISOR CLASSES
319
Since these conditions do not impose any restrictions on the variables then they independently take on all values in [0,2n). We now replace PI' ... ,Ps+ t by the new variables~, ~l' ... ,.~, by the formulas qJI' .. , , qJt,
(j
= 1, '" , s + t).
(1.15)
Adding these equations and noting that s+t
s+t
Le
j
L liCk) = 0,
= n,
j~1
j~l
(1.16)
we obtain s+t
~
= [lp/J.
(1.17)
j~l
The set Tis determined by the conditions (k
It is now clear that the volume
v = veT) exists.
= 1, ... , r).
Since
the Jacobian of the transformation (1.17) equals
J= Ps+ t
n~
Ps+tl () Ps+tl () s+t Cl ... s+t C, es + t es + t
In the last determinant we add all rows to the first row. Considering (1.16) and (1.17) and recalling the definition of the regulator R of the field K (see Section 4.4 of Chapter 2), we obtain R
IJI=-t---2Ps+I"'Ps+t
'!
320
[Chap. 5
ANALYTIC METHODS
v:
It is now easy to compute the volume
v = r·-j dX 1 ... dxsdy1dz l '" dy,dz, (T)
= J ... J Ps+ 1 ... Ps+' dpi ... dps+, d({J1 ... d({J, (T)
= J:"d({J1 ... J:"d({J, roo J Ps+ 1
oo'
Ps+' dpi ... dps+,
= 2'n' r·-jIJIPs+I'" Ps+t d~
d~I'" d~r
_ 'Rfl d~Jld~I'" -n 0 0
n'R.
fld~r = 0
Substituting this value of v in (1.14) we finally obtain,
2s n'R
v(T)=- . m
1
::1.
,
104. Dirichlet's Principle II ;
We first consider the function (K(S) when K is the rational field R. Since integral divisors in R can be identified with natural numbers and N(n) = n, then (1.18) Hence the Dedekind (-function for R coincides with the Riemann (-function (s). We shall show that the series (1.18) converges for s > 1. Since the function IIx' is decreasing for X> 0, then
J
"+I
,
dx 1 .. < ---;< x n
J'
,-I
dx --;, x
where the inequality on the left holds for n ~ 1, and that on the right for n ~ 2. Hence for any natural number N > I we have
N+ 1 dx I1
"
i !
N1
-
S
,~l
n
IN dx
-.
S
xS
1
Since the integral J'j'(dxlx') converges for s> 1, the inequality on the right shows that the series (1.18) converges. Further, for s > 1, we have
J
ro -dx < (s)
1
X
S
< 1+
Jro -, dx 1
X
S
I I Sec. 1]
FORMULAS FOR NUMBER OF DIVISOR CLASSES
321
or
i
1 1 < ((S) < 1 + - . s-1 s-1
'i
I
Multiplying this inequality by s - 1 and letting s tend to 1, we obtain lim (s - 1)((s) = 1,
(1.19)
s-1 +0
which indicates the order of growth of the function ((s) as s -+ 1. We now turn to the proof of a general theorem of Dirichlet on series. Let X be a cone in the space illn , and let F(x) be a positive real-valued function on X. (We assume that the origin is not contained in X.) The function F on X is assumed to satisfy the following conditions: (I) For any x E X and any real number ~ > 0, the equation F(~x) = ~nF(x) holds. (2) The set T, which consists of all points of X for which F(x) ~ 1, is bounded and has nonzero n-dimensional volume v = v(T). The points of the cone at which F(x) = 1 form a surface which intersects each ray of the cone in precisely one point, and which separates from the rest of the cone a bounded subset with nonzero volume. It is clear that the giving of such a surface in X is equivalent to the definition of such a function F(x). Let Wl be an n-dimensional lattice in illn with the volume of a fundamental parallelepiped denoted by ~. Consider the series
_
((S)
=
,,1 L. xE!lJlnX
(1.20)
(s> 1),
-
F(X)5
taken over all points x of the lattice Wl which are contained in the cone X. This series depends on the cone X, the function F, and the lattice Wl. Theorem 3. Under the assumptions given above, the series (1.20) converges for all s > 1 and
lim (s - 1)C(s) 5-1 +0
= .!:.. .
(1.21)
~
Proof. For any real r > 0 we denote by Wl r the lattice which is obtained by contracting Wl by a factor of r. The volume of a fundamental parallelepiped of Wl is then given by ~/rn. If N(r) is the number of points of the lattice Wl r which are contained in the set T, then by the definition of volume we have . ~ . N(r) v = v(T) = hmN(r)--;; = ~ hm - n r-oo
r
r-oo
r
.
(1.22)
Consider the set rT, obtained by expanding T by a factor of r. It is clear that
,
I
322
[Chap. 5
ANALYTIC METHODS
N(r) also equals the number of points of the lattice Wl contained in rT, and that this is equal to the number of points x E Wl (\ X, for which F(x) ~ r". The points of Wl (\ X can be arranged in a sequence {xd so that 0< F(x l ) ~ F(x 2 )
~ ••• ~
F(x k) ~ ., ..
Set z!i(Xk) = rk ·
The points Xl' ... , Xk belong to the set rkT, so N(r k) ~ k. But for any e> 0, the point Xk does not belong to the set (r k - e)T, so N(r k - e) < k. Thus N(r k - e) < k
~
N(r k).
so that N(rk - e)(r k - e)" k - - < ---;; e) (rkrk rk
--:---"----~"
Taking the limit as k
-+ 00,
that is, as rk -+
~
N(r k) rk
--" .
00, and
considering (1.22), we obtain
k v lim--=-. k~oo F(Xk) L\
(1.23)
We compare the series (s) = Lk=l IjF(xk)S with the series (1.18). Since limk~oo [kSjF(xSl = (vjL\)' =I 0, then along with the series (1.18) the series (1.20) also converges (if, of course, s> 1). Let e be any positive number. By (1.23) we have _1 < (~ + e)~ (~L\ _e)~k < F(Xk) L\ k
for all k greater than some sufficiently large k o . Hence
V )S 00 1 00 1 (V )S 00 1 L -< LS L - - < -+e (--e L\ k=ko k k=ko F(xkY L\ k=ko k S for all s > 1. We multiply this inequality by s - 1 and let s ten l to 1 from the right. Since lims~l(s - I)Li,,=~/(IjkS) = 0, then by (1.19), kns~l+O(S - 1) D>')=ko (Ijk S) = 1. Since also Ijms~l(s - l)LZo=-ll[ljF(xk),l = 0, we obtain the inequality
~
. 1
l:
L\
- e
~ lim (s - 1)~(s) ~ lim (s - l)~(s) ~ ~ + e. S~o
s~l+O
L\
and since e was arbitrary, this proves Theorem 3.
,,;,I' ,
II Iii
'Ii
Remark. There is a certain similarity between (1.21) and (1.22). To make this similarity more precise, we assume that the volume L\ of a fundamental parallelepiped of the lattice Wl is equal to 1, and we write them in the form
Sec. 1 ]
FORMULAS FOR NUMBER OF DIVISOR CLASSES
lim (s - 1)~(s)
= V,
323
(1.21 ')
s-! +0
lim r-oo
, I
, I
~N(r) = v.
(1.22')
r
Both limits have the same value, the volume of the set T. The volume is determined in (1.22') in the following way. The lattice Wl is shrunk by a factor of r, and the number N(r) of points of Wl r which are contained in T is determined. Then the number N(r) is multiplied by the volume Ijr n of a fundamental parallelepiped of the lattice Wl n , and finally we pass to the limit as r -. 00. The same idea is involved in (1.21 '). Here the sum (s) plays the role of the number N(r), and the factor (s - 1) corresponds to the factor Ijr n • We take the limit as s -. 1 from the right instead of as r -. 00. Turning to the fundamental domain X of an algebraic number field K, we see that the function F(x) = IN(x) I satisfies conditions (1) and (2). Hence we may apply Theorem 3 to the series (1.8), and this means that it converges for s > 1 and that relation (1.9) holds. We have now proved all assertions which were used in the first paragraph, and hence have completed the proof of Theorem 2.
1.5. Euler's Identity To use the formula (1.2) to compute the number h, we must have some other method for determining the limit lims~l(s - lKK(s). In some cases this can be done by using the representation of (K(S) as an infinite product, known as Euler's identity. Theorem 4. For s > 1, the function (K(S) can be represented as a convergent infinite product 1 (K(S) = 1 - [ljN(p)'J ,
IJ
,
where P runs through all prime divisors of the field K. Proof. For every prime divisor P we have
1 1 1 - [ljN(p)'J = 1 + N(p)'
I'
1 + N(p)2s
+ ....
(1.24)
Let N be any natural number, and Pb '" , Pr all prime divisors with norms not exceeding N. Multiplying the absolutely convergent series (1.24) for P = PI' ... , Pr' we obtain
)-1 =
1 1 - --s ( N(p)';;N N(p)
fl
I kr~O N(PI 00
k,
k, .....
1
1 s = I'--s , ... Pr) Q N(a) kr
I !
ANALYTIC METHODS
324
[Chap. 5
I',
where in the sum a runs through all integral divisors of the field K which are not divisible by a prime divisor with norm exceeding N. Comparing the . series with the series CK(S) = IljN(a)', we obtain
I'
1-)-1 - CK(S) I < I 1 n (1 - -N(a)S N(p}' I since the series I' contains all terms corresponding to integral divisors with N(a»N
N(p)';;N
norm
~ N.
Since for
S
> I the series (1.1) converges, then 1
I --+0 N(a»N N(a)' as N -+ 00, and this proves the theorem. Theorem 4 will be valuable because, along with Theorem 2, it establishes a connection between the number h and prime divisors of the field K. As we remarked in Section 1.1, if we know all prime divisors of the field K, then using Theorem 4, the left side of (1.2) can be computed, and this allows us to obtain a formula for h. On the other hand, since Kh =F 0, Theorem 4 gives an important property of prime divisors in a field K. For example, we shall use it in Section 3 to obtain the celebrated theorem of Dirichlet on the distribution of rational prime numbers in arithmetic progressions.
PROBLEMS
1. Use the convergence of the series series
~:=l (l/n')
(s> 1) to show that when s> 1, the
where II runs through all prime divisors of the field K, also converges. 2. Use Problem 1 to prove the convergence of the product
I] 1 -
1 [1/N(Il)']
(s> 1).
Deduce that the series
converges. 3. Let at and bt (k;;. 1) be positive real numbers with lim t _., bt/at = c. Show that if the series ~:': 1 at' converges for s > 1 and lim'_l + 0 (s - 1):= 1 at' = A, then the series ~:=l b t ' also converges for s> 1 and
.,
lim (s - 1) &-+1+0
Lb.' = cA.
k=l
5
h
Sec. 2]
DIVISOR CLASSES OF CYCLOTOMIC FIELDS
325
4. Let C be any divisor class of the algebraic number field K. Denote by Z(t, C) the number of integral divisors a of the class C for which N(a) ~ Show that
t.
.
1~~
Z(t, C)
-t- =X=
2'+'1T'R
mvlDj'
5. Let .p(a) denote the number of integral divisors of the algebraic number field K with norm a. Show that
~K(S) = ~(s)
i
~
"= 1
n' '
where
[/L(a)
is the Mobius function].
2. The Number of Divisor Classes of Cyclotomic Fields
Let m be a natural number, and let ( be a primitive mth root of 1. Since the mth roots of 1 in the complex plane divide the unit circle into m equal parts, the field R«) is called the mth cyclotomic (circle-dividing) field. In this section we shall use Theorems 2 and 4 of Section 1 to find a formula for h, the number of divisor classes in a cyclotomic field. To do this we must determine the factorization of rational primes into prime divisors in such fields. We first determine the degree of the field R«).
2.1. Irreducibility of the Cyclotomic Polynomial The degree of the field R(O equals the degree of the minimum polynomial of the number (over the rational field R. In this section we shall show that the minimum polynomial of ( is the polynomial <1>m = <1>m(t) =
TI (t _ (k) (k, m)= 1
(the product is taken over the indicated residue classes modulo m), which has as roots all primitive mth roots of 1. Since the degree of<1>m equals the value of the Euler function cp(m), it will follow that (R(O : R) = cp(m). The polynomial <1>m(t) is called the mth cyclotomic polynomial. We first show that the coefficients of <1>m(t) are rational integers. For m = 1, this is clear (<1>1 = t - 1). We proceed by induction on m. Since every mth root of 1 is a primitive root of some degree dim, then t
m
-
1 = TI <1>d , d
where d runs through all divisors of the number m. By the induction assumption the polynomial F = TIum<1>d has rational integral coefficients and its
I·
326
ANALYTIC METHODS
[Chap.5
leading coefficient is I. Hence the coefficients of <1>", = (t'" - l)jF are also rational integers. As usual, Z denotes the ring of rational integers, Zp the field of residue classes modulo the prime number p, and for a E Z we denote the corresponding residue class in Zp by ii. If /(t) is a polynomial with rational integer coefficients, we denote by J(t) the polynomial obtained from / by replacing all coefficients by their residue classes modulo p. It is clear that the mapping / -+ Jis a homomorphism of the ring Z[t] onto the ring ZAt]. Since (J + gY = jP + iY, and aP = ii, then in the ring Zp[t] we have the formula
(J(t»P = J(tP).
(2.1)
Set h = t'" - I. If p does not divide m, then the polynomial Ii of ZAt] is relatively prime to its derivative and hence has distinct roots. Noting that ill", divides Ii, we have the following assertion.
I.
Lemma 1. If the prime number p does not divide m, then the polynomial
ill E Zp[t] has no multiple roots. If/(t) is the minimum polynomial of (, then <1>", = /G, where G and/both belong to the ring Z[t]. If p is any prime not dividing m, then (P is a primitive mth root of I; that is, ",«(P) = O. We shall show that (P is a root off Otherwise we would have G«(P) = O. Then consider the polynomial H(t) = G(tP). Since H(O = G«p) = 0, then H is divisible by f, that is, H = /Q, where Q E Z[t]. Passing to Zp we obtain H = JQ. But by (2.1), H(t) = G(t P ) = (G(t»p, so that
GP =JQ. Let l/i be any irreducible factor of J (in the ring ZAt]). It follows from the last equation that G is divisible by l/i. But then it follows from ill", = JG that ill", is divisible by l/i2, contradicting Lemma I. Thus (P cannot be a root of G(t), and hence is a root of/(t). If (' is any root of <1>"" then (' = (k, where k is relatively prime to m. Let k = PtP2 '" Ps' We have just shown that (PI is a root of /(t). Analogously, replacing ( by (PI, we find that (PIP2 is a root of /(t). Continuing this process, we find that (k is a root of /(t). We have shown that any root of <1>", is also a root off, and hence <1>", = f We formulate this result in the following theorem. Theorem 1. For any natural number m, the cyclotomic polynomial <1>", is irreducible over the field of rational numbers. Corollary. The degree of the mth cyclotomic field is qJ(m) [where qJ(m) is Euler's function] .
1
.1
:1. .1 i 1'1,\ I
Sec. 2]
DIVISOR CLASSES OF CYCLOTOMIC FIELDS
327 I , I
2.2. Decomposition of Primes in Cyclotomic Fields
Since the mth cyclotomic field R(O has degree qJ(m), then the numbers 1, (, ... , ('I'(m)-l
(2.2)
form a basis for R(O over R. Lemma 2. If the prime number p does not divide m, then it does not divide the discriminant D = D(I, (, .. , , ('I'(m) -1) of the basis (2.2). Proof. The discriminant D equals the discriminant D(m) of the cyclotomic
polynomial m' The residue class D(m) E Zp of the number D(m) clearly coincides with the discriminant D(m) =I 0, which means that D = D(m) is not divisible by p. Lemma 3. If the algebraic number field K contains a primitive mth root of I and P is any prime divisor of K which is relatively prime to m, then N(p)
== 1 (mod m).
Proof. Let .0 be the ring of integers of K, p the rational prime which is divisible by P, and ( a primitive mth root of I E D). In Section 2.1 we saw that the polynomial t m - 1 has no multiple roots in the extension field Djp of Zp (since p,r m). Hence the residue classes 1, (, .. , , (m-1 of Djp are pairwise-distinct. These classes form a group under multiplication, a subgroup of the multiplicative group of the field Djp. But the order of this group is N(p) - I, and since the order of a subgroup divides the order of the group, m divides N(p) - 1. The lemma is proved.
«(
Theorem 2. If p is a prime number not dividing m, let f be the smallest natural number such that pI == I (mod m), and set 9 = qJ(m)jf Then the prime p has the factorization
P=Pl"'Pg,
(2.3)
in the mth cyclotomic field, where the prime divisors PI' ... , Pg are distinct and N(p;) = pl. Proof. Since (p, m) = I, then by Lemma 2, p does not divide the discriminant of the basis (2.2). It now follows from Theorem 8 of Section 5, Chapter 3, that p has a decomposition of the type (2.3). We need only determine the degree of each prime divisor Pi' and show that there are qJ(m)1f of them. Let P be any of the prime divisors Pi and let s be its degree, so that N(p) = p'. By Lemma p' 3, == I (mod m), and hence s ~ f To prove the opposite
I'
ANALYTIC METHODS
328
[Chap. 5
inequality, we consider the residue class field Dip, where D is the ring of integers of the field R«). By the corollary of the lemma in Section 7.4 of Chapter 3 every residue class of Dip contains a representative of the form (2.4) where a j are rational integers. We raise (2.4) to the pIth power. Since pI == 1 (mod m), then (pI = (. But also (a + {3yl == a pl + W\mod p) for any a and {3 in D, and thus apl == a (mod p) for any rational integer a. Hence from (2.4) we obtain the congruence (pI == (mod p). Thus any residue class ~ E Dip is a root of the polynomial t PI - t. But in any field the number of roots of a polynomial does not exceed its degree, so pS ~ pI, and s ~f Hence we have s = f We have shown that all prime divisors p in (2.3) have the same degree f, which is equal to the order of the number p modulo m. Now applying Theorem 8 of Section 5 of Chapter 3, we see that the number of prime divisors Pi equals qJ(m)lf Theorem 2 is proved. 2.3. The Expression of h in Terms of L-Series
Let (K(S) be the (-function of the mth cyclotomic field K = R«), (m = 1. If we group together those terms in Euler's identity which involve the prime divisors p of each rational prime p, we obtain
=
(K(S)
IJ J;!1 -
1 [lIN(p),J
(2.5)
(the product being taken over all rational primes p). Only a finite number of terms correspond to prime divisors p which divide m. We denote the product of these terms by G(s)
=!! ( 1)-1
(2.6)
1 - N(p)'
If (P, m) = 1 and p is any prime divisor of p, then N(p) = pIp, where fp is the order of the number p modulo m. Since the number of distinct p dividing pis qJ(m)!fp (Theorem 2), then (K(S)
= G(s)
1 )
TI
(1- p (p.m)=1
Ips
-
(2.7)
Each factor in this product can be put in more convenient form. We use the expansion
(
1)/p_IP-1( ll) - TI 1 - -
1- - S P
k=O
PS
'
(2.8)
5
Sec. 2]
DIVISOR CLASSES OF CYCLOTOMIC FIELDS
where e = ep = cos(2n/fp)
+ i sin(2n/fp).
329
Then the product
/P-I( n 1-~e
k)-f/J(m)//p
P
k=O
1 d
n y o r,
1
has
n n( 1 _ XC))-l :
(K(S) = G(s)
(p,m)=l
X
P
(2.9)
(the second product being over all characters X of the group G). In the place of characters of the group Gm , we may consider numerical characters modulo m (see Section 5.3 of the Supplement). If Xis a numerical character modulo m, and p is a prime which divides m, then X(P) = 0, and hence (2.9) takes the form (K(S) = G(s)
nn(1 _X(:))-l P p
x
(here p runs through all prime numbers and X runs through all numerical characters modulo m). Reversing the order of multiplication, we arrive at the formula
n£(s, X),
(2.10)
1 1 _ [X(p)/pS]
(2.11)
(K(S) = G(s)
x
where £(s, X)
= I]
Note that all the products converge for s> 1, and hence all the operations made on infinite products are easily justified.
I'
ANALYTIC METHODS
330
[Chap. 5
Remark. In formula (2.10) the term G(s) can be dropped if we let Xrun through all primitive characters modulo d, for all d which divide m; see Problems 13 to 16.
The term L(s, Xo) in the product (2.10), which corresponds to the unit character Xo, differs only slightly from the Riemann (-function ((s). Since Xo(P) = 1 for (p, m) = 1 and Xo(p) = 0 for (p, m) > 1, then L(s, Xo)
= (p,m)=1 n
1 1_
(II p')
(s> 1).
On the other hand, applying Theorem 4 of Section 1 to the rational field R, we obtain ((s)
= I) 1 -
Thus L(s, Xo)
=
1 (lIp')
!ll _ )-1 1
(
(lip')
((s).
Substituting this expression in (2.10), we obtain the following formula for (K(S) : (K(S)
= F(s)((s)
n L(s, X)
(s> 1),
(2.12)
1.*XO
, i
where [see (2.6)] F(s)
=
n (1 -_1 ,)-1. n (1 _~). N(p) p
vim
plm
We now simplify the functions L(s, X). Since the series 'Lx(n)ln' converges absolutely for s > 1, we obtain, as in (1.24), the equation 1 1 - [x(p)lp']
(X(p))k k~O [7 00
=
By an almost verbatim repetition of the proof of Theorem 4 of Section 1 (using only the multiplicative property of the character X), we easily find that L(
) = ~ x(n) s, X L., n= 1 n
(s> 1).
(2.13)
The series on the right in (2.13) is called the L-series or the Dirichlet series for the numerical character X. Our first goal is now to show that the L-series of a nonunit character converges not only for s > 1, but even for s > 0 (however, convergence in the interval 0 < s ~ 1 will be nonabsolute), For this we prove the following lemma.
5
Sec. 2]
DIVISOR CLASSES OF CYCLOTOMIC FIELDS
331
n
1-
t
Lemma 4. Let the sequence of complex numbers {an} (n = 1,2, ... ) be such that the sums An = Ik= 1 ak are bounded; that is, IAn I ~ C for all n ~ 1. Then the series
e
f(s)
f
=
n= 1
a: n
converges for all real s > O. For any (J > 0, convergence is uniform in the interval [(J, (0), so that the sumf(s) is continuous in s. Proof. Fix (J> O. For any e> 0 we can pick no so that Iln a < e for all n > no. For all such n> no, we also have lin' < e, provided that s ~ (J. Let M > N > no. Then
I: a~= I:
k=N k
I:
A k -A k- 1
k=N
A k _ MIl Ak k=N k' k=N-l (k + 1)'
k'
(1
M~l
AN- 1
1)
AM
= - ~ + kf'N A k k' - (k + 1)' + M" so that Makl
C
1)
M-l(1
/ k~N ~ ~ N' + C k~N ks -
(k
C
+ I)' + M' =
2C N' < 2Ce
for all s in the interval [(J, (0). Lemma 4 is proved. Corollary. If X is a nonunit character, the series L(s, X) converges for s > 0 and represents a continuous function on the interval (0, (0). For if X i' Xo, then LX(k) = 0, where k runs through a complete set of residues modulo m. Representing the natural number n in the form n = mq + r, o ~ r < m, we find An = 1 x(k) = 1 x(k), SO that IAnl ~ r < m. Turning to the function CK(S), we multiply (2.12) by s - 1 and take the limit as s -+ 1 from above. By (1.19) we find that
Iz=
lim (s -
I;; =
lK K(s) = F(1)
,-1+0
where L(l, X)
=
f
TI L(I, X),
x(n)
n= 1
(2.14)
x*xo
n
(2.15)
Note that since the series (2.15) does not converge absolutely, we must keep in mind that its terms appear in order of increasing n. Comparing (2.14) with Theorem 2 of Section 1, we obtain the following formula:
wJfDJ
h = 2'+' tR F (1)
n
TI
1.*Xo
L(1, X)
(2.16)
332
ANALYTIC METHODS
[Chap. 5
(here w denotes the number of roots of 1 contained in K). The expression (2.16) for the number of divisor classes of a cyclotomic field is not definitive, since it still contains the infinite series £(l, X). The summation of these series will be carried out in the next section. 2.4. Summation of the Series £(1, X).
Assuming that X is a nonunit character modulo m, we turn to the series (2.13). Omitting those summands which are zero and noting that if n1 == n 2 (mod m), then x(n 1 ) = x(n 2 ), we arrive at the following form (valid for s> 1): £(s, X) = L: X(x) L: ~1 (x,m)=l
nEx(modm)n
The inner series can be written in the form
~~s '
L,
n=l
n
where for n == x (mod m),
= {I0 n
C
for n =1= x (mod m).
To find a convenient way of writing the coefficients Cn , we consider the following formula: for r == 0 (mod m), for r
=1=
0 (mod m),
where 2n
C= cos - + i m
. 2n SIn -
m
is a primitive mth root of 1. We stress that when considering the algebraic properties of a cyclotomic field, it does not matter which primitive mth root of 1 is denoted by " but for analytic computations we must fix a definite complex root. Hence we have 1 m-1 cn = - L: c(x-n)k.
I I
Ii
I
m k=O
Thus £(s, X)
L:
=
00
X(x)
(x.m)=l
1 m-1(
= -
m
L:
k=O
1 m-1
L: - L:
1
c(x-n)k
n
n=lmk=O
L:
(x,m)=l
X(x)CX
k)
s
L: 00
n=l
C- nk -s .
n
!
Sec. 2]
DIVISOR CLASSES OF CYCLOTOMIC FIELDS
333
We have already encountered the expression in parentheses in the case m = p in Section 2 of Chapter 1, where it was called a Gaussian sum. We now define Gaussian sums for arbitrary m. Definition. Let ( be a fixed primitive mth root of 1, and let Xbe a numerical character modulo m. The expression 'u(X)
=
L
X(x)(UX,
xmodm
where x runs through a full (or a reduced) system of residues modulo m, is called the Gaussian sum corresponding to the character Xand the rational integer a. The Gaussian sum 'u(X) depends not only on Xand the residue of a modulo m, but also on the choice of the root (. In'the future we shall always assume that ( is cos(2nlm) + i sin(2nlm). The Gaussian sum with this choice of ( is called normed. The sum 'l(X) will also be denoted by ,(X). If X is not the unit character, then 'o(X)
=
L
(x,m)=l
x(x) = O.
Hence our expression for L(s, X) can be written in the form 1
L(s, X)
=-
m-l
00
L 'k(X) L
mk=l
0=1
(-nk -S •
n
We can apply Lemma 4 to the series L:'=l(CO k ln S )(C k ;61 for k ;6 0, so L~~l cnk = 0). By this lemma our series converges for 0 < s < 00 and represents a continuous function of s. Hence we may set s = 1 in this last equation and obtain
To find the sum of the inner series, we turn to the power series L:'=l(zOln). It is well known that it converges for Izi < 1 and represents there that branch of the function -In(l - z), the imaginary part of which (that is, the coefficient of i) is contained in the interval (- n12, nI2). Since this series also converges at the point z = C k (on the unit circle), then by Abel's theorem 00 (-ok - = -In(l-C k ), 0=1 n and hence
L
(2.17)
i
'!
i
ANALYTIC METHODS
334
[Chap. 5
Hence we have obtained a finite expression for the series L(l, X). Substituting it in (2.16), we obtain a formula for the number of divisor classes of a cyclotomic field which does not contain any infinite series. The formula (2.17) can be further investigated and considerably simplified. In the next section we shall do this, but only for the case when X is a primitive character. In Section 5 we shall apply these results to the further study of the formula for h in the case of the lth cyclotomic field, with I a prime. In this case the formula has particularly important applications. 2.5. The Series L(l, X) for Primitive Characters
We shall show that if X is a primitive character modulo m and (a, m) = r> I, then
Set m = rd. It is clear that Ca is a primitive dth root of I, and therefore Ca = Ca , provided that z == I (mod d). We take as z a number for which (z, m) = I, z == I (mod d) and X(z) i' I (the existence of such a d is guaranteed by Theorem 4 of Section 5 of the Supplement). As x runs through a complete system of residues modulo m so does zx, so that %
Ta(X)
=
L:
X(zxW:
x
= X(z)
x mod m
L:
X(xW
X
= X(z)ra(X)·
x mod m
Since X(z) i' I, it follows that Ta(X) = O. Further, if (a, m) = I, then
=
Ta(X)
x(a)-IT(x)·
Indeed, as x runs through a complete system of residues modulo m, so does ax and thus x(a)Ta(X)
=
L
x mod
X
x(axW t= TI(X) m
= T(X)·
\
Hence if X is a primitive character we can write (2.17) in the form
'I
i
L(l, X) = - T(X) L: i(k)ln(l - C k ). m (k,m)= I
(2.18)
We turn to the study of the sum Sl.
=
L:
i(k)ln(l - C k )
(2.19)
(k,m)= I
(k running through a reduced system of residues modulo m). The study of the
sum Sl. leads to two essentially different types of behavior. To distinguish these cases we need the following definition.
l.5
109 10-
:d. ve he lis
Sec. 2]
DIVISOR CLASSES OF CYCLOT OMIC FIELDS
335
Defini tion. The numerical charact er X is called even if X( -1) = 1 [and hence X( -x) = X(x) for all integers x], and is called odd if X( -1) = -1 [and xC-x) = -X(x)].
Since (x( - 1))2 = X(( - 1)2) = x(I) = 1,
then X( -1) = ± 1, and thus every charact er X is either even or odd. The numbe r 1 - C k (for 0 < k < m) can be represented as 1- C where
nk) = 2 sin nk( ~ cos (n "2 - ~
k
-n12 < nl2 -nklm < n12; k
In(l - C Further , since 1 -
C k and 1 -
(k
. (n"2 - -;; nk)) ,
Sin
therefore
= Inl1
)
+ i
~).
- Ckl + inG -
are conjugate, then
(1 k)
In(l - ( k ) = Inl1 - ( k I - .m 2" - m . (Note that the last two formula s are valid only when k lies between 0 and m.) Now assume that the charact er X (and hence also i) is even. Interch anging k and - k in (2.19), we obtain Sl.
=
L
X(k)ln(I - (k),
(k,m)= 1
and with (2.19) this yields
L
2Sl.=
(k,m)= 1
iCk)[ln(I -Ck)+ ln(I _(k)]
L
=2
(k,m)= 1
iCk)lnl1 - (kl = 2
L
X(k)ln 2 sin
(k,m)= 1 o
nk . m
If the charact er X is odd, then when we interch ange k and - k in (2.19) we obtain Sl.
= -
L
iCk)ln(l - (k),
(k,m)=l
so that 2Sl.
=
L
(k,m)=l
= 2
iCk)[ln (l -
C k) - In(I _
L X(k)ni(~ - ~).
(k,m)= 1 2 m O
(k)]
,
r
, I
[Chap. 5
s
Since L(k,m= 1) x(k) = 0 (X is not the unit character), then we obtain the following result from (2.18).
r
336
ANALYTIC METHODS
Theorem 3. Let X be a primitive character with modulus m > 1. If X is even, then
L(1, X)
=
L
= - rex) m
(k,m)= 1 O
X(k)ln sin nk . m
(2.20)
If X is odd, then L(1, X) =
nir(x)
L
-2(k,m)=1 O
m
_ x(k)k.
(2.21)
PROBLEMS 1. If X is a primitive character modulo m, show that
!,.(x) [ = vi;;. 2. Let p be an odd prime and set p*
= (-1)(.-.)/2 p .
Show that the quadratic field
R(V p*) is contained in the pth cyclotomic field (use Problem 5 of Section 2, Chapter 1, with a = b = 1). 3. Show that every quadratic field is contained in some cyclotomic field. 4. Using the notation of Problem 6 of Section 5 of the Supplement, show that
[assume that the m1th root of 1 used to define the Gaussian sum ".(Xl) is ~"/"l, where ~ is the primitive mth root of 1 which is used to define the sum ,..(X)]. 5. Let p be a prime number which does not divide m, and let f be the smallest natural number such that pI "" 1 (mod m). Show that the polynomial W..(t) with coefficients in Z. (see Section 2.1) factors in Z.[t] as a product of
(~1)
=(_1)(.-.)/2
(this is the first supplement to the law of quadratic reciprocity). 7. Let p and q -# 2 be distinct primes, let K be the qth cyclotomic field, and let g be the
5
Sec. 2]
DIVISOR CLASSES OF CYCLOTOMIC FIELDS
337
number of distinct prime divisors of K which divide p. Using the Euler criterion == a(q-l)/2 (mod q), show that
(a/q)
(~) =
(-I)'.
a. Using the same notations, consider the quadratic subfield k = R(Yq*) of the field K, where q* = (_1)
(~) =(-1)'. Thus p factors in k if and only if g is even. [Hint: If q == 1 (mod 4), use Problem 7 and show that from (p/q) = (p*/q) = 1 it follows that (q/p) = (q*/p) = 1.) 9. Use the preceding two problems to prove the law of quadratic reciprocity:
(~) (~) =
(_1)[(,-1l/ 2 J!
10. Let q be a prime which factors as the product of two distinct prime divisors in the field R( y2), and let q == 1 (mod 4). Show that q == 1 (mod 8). [Consider the factorization of q in the field R(Y2, Y -1), the 8th cyclotomic field.) 11. Using the notations of Problems 7 and 8, show that the prime p = 2 factors into two distinct prime divisors in the field k if and only if g is eVen. 12. Comparing the result of the preceding problem with Theorem 1 of Section 8, Chapter 3, show that (2/q) = + 1 if and only if q* == 1 (mod 8); that is, show that
(this is the second supplement to the law of quadratic reciprocity). 13. Show that the prime number p has the factorization p = \.1',
g = (('(pO) = pO -1(p - 1),
N(\.1) = p,
in the pth cyclotomic field. 14. Let m = pOm', (p, m') = 1, and let f be the smallest natural number for which pI == 1 (mod m'). Show that the prime number p has the factorization p
= (\.11
... \.1,)',
N(P,)
= pI,
in the mth cyclotomic field, where e = ({'(pO), fg = ({'(m') «({' is Euler's function). 15. If G(s) is the function determined by (2.6), show that G(s) =
n n Plmxmodm'
(1 _X(~»)
-1 ,
P
where p runs through all prime divisors of m, and X(for given p) runs through all numerical characters modulo m', where m = pOm', p,r m'.
,I
338
ANALYTIC METHODS
[Chap. 5
16. Using Problem 9 of Section 5 of the Supplement, formula (2.10), and the preceding problem, show that the ~-function ~K(S) of the mth cyclotomic field has the representation
n n
~K(S) =
L(s, X),
dim Xmodd xprtm
where d runs through all divisors of m (including 1 and m), and X(for given d) runs through all primitive characters modulo d. Deduce that lim (s - l)~K(S)
n n
=
L(1, X).
dim Xmodd Xprim
d~l
3. Dirichlet's Theorem on Prime Numbers in Arithmetic Progressions
In Section 2 we used Theorems 2 and 4 of Section 1 to compute the number of divisor classes of a cyclotomic field. In this section we shall show that from the existence of formula (1.2), with a nonzero constant on the right, we can deduce important results on prime divisors of first degree and prime numbers in arithmetic progressions. 3.1. Prime Divisors of First Degree Theorem 1. Any algebraic number field K has an infinite number of prime divisors of first degree. Proof. By Theorem 4 of Section 1 the function (K(S) has the expansion
(K(S)
=
1)-1 D(1 - N(p)'
(3.1)
Since convergent infinite products are nonzero, (K(S) =I- 0 for s > 1. Taking logarithms in (3.1), we obtain 1
00
In(K(s)
= L: L: N( Pr' P m=1 m
(3.2)
We isolate the following summands: P( s)
= L: PI
1 N(
PI
)5'
(3.3)
the summation being taken over all prime divisors PI of K of first degree. If we denote the sum of all remaining terms by G(s), then (3.2) can be put in the form In(K(S)
II I I
= pes) + G(s).
(3.4)
5
g
n
Sec.3]
DIRICHLET'S THEOREM ON PRIME NUMBERS
339
Let f denote the degree of the prime divisor p, so that N(p) = pl. If f ~ 2, then
If f= 1, then 00 1 001 1 2 m!;z mN(p)ms < m!;z psm = pS(pS _ 1) < pZs'
For each rational prime p there are at most n = (K: R) prime divisors of the field K which divide p, so we have the following estimate for G(s): 00
2n
G(s)
<
1
LpP2S < 2n m=lm L --rs'
It follows that the function G(s) is bounded as s --> I + O. But since Kh i' 0 in (1.2), both (K(S) and In (K(S) must go to infinity as s --> I + O. We have seen that G(s) is bounded, and hence by (3.4) the sum (3.3) must contain an infinite number of terms. Theorem 1 is proved.
We note that this proof uses the idea of one of the proofs of the existence of infinitely many prime numbers (see Problem 1). 3.2. Dirichlet's Theorem Theorem 2 (Dirichlet's Theorem). Every residue class modulo m which consists of numbers relatively prime to m contains an infinite number of prime numbers. Proof. The proof of Section 3.1 was based on the nonvanishing of the limit (1.2). Analogously, the proof of Dirichlet's theorem uses the fact that L(I, X) i' 0 for any nonunit character X modulo m, which is an immediate consequence of (2.16). Consider the representation of L(s, X) as an infinite product,
L(s, X)
=
I)(1-X~)r
1
(3.5)
From the convergence of this infinite product it follows that L(s, X) is nonzero for all s > I. (Here X may be any numerical character modulo m, including the unit character Xo .) Therefore we can consider the complex function In L(s, X) on the interval (I, (0). We choose a fixed branch of the logarithm function as follows. In each factor of the infinite product (3.5) we choose the value of the logarithm so that
f
_ In(1 _ X(P») = X(p)n. pS n=1 npsn
(3.6)
340
ANALYTIC METHODS
[Chap. 5
Summing the series (3.6) over all p, we obtain
~-
In( 1 -
S
"E
X~)) = ~ x~r:) + R(s, X)
(
where 1 X(p)Z 1 X(P)3 R(s,X)=" ( - - + - - + "': 2 pZs 3 p3S
...
)
1
(it is clear that all series involved are absolutely convergent for s > 1). The value for In £(s, X) is now chosen so that X(p) In £(s, X) = " "': pS
+ R(s, X)
(3.7)
for all s> 1. Note that the values of In £(s, Xo) will be real for the unit character Xo' We estimate the function R(s, X): «)
IR(s,
1
1
x)1 < L:pn=Zp L: -;; < L:pPP(
1)
v f
1
«)
v c s
< n=1 L: nn ( + 1) = 1.
t
s
Thus IR(s, X)! < 1 for all s > 1. Along with the numerical character X we consider the corresponding character of the group Gm (which consists of all residue classes modulo m which consist of numbers relatively prime to m), which we also denote by X. Let C run through all classes of the group Gm • Since X(p) = X(C) for p E C, then
[recall that X(p) = 0 if p divides m]. Setting f(s, C) =
L: peC
1
~
P
,
we can put (3.7) in the form In £(s, X) =
L: x(C)f(s, C) + R(s, X).
(3.8)
C
Since there are
L: x(A x
-1
)In £(S, X) =
L: L: X( CA C
x
1 )f(s,
C)
+ Ris),
(3.9)
s
1
5
Ie
Sec. 3]
DIRICHLET'S THEOREM ON PRIME NUMBERS
ILx
341
where we have the estimate IRA(s)j = X(A -1 )R(s, x)1 < cp(m) for all s> 1. 1 By formula (5.6) of the Supplement the sum X(CA- ) equals cp(m) for C = A and equals zero for C i' A. Therefore (3.9) takes the form
In L(s, Xo)
+
L x(A x*xo
-1
Lx
)In L(s, X) = cp(m)f(s, A)
+ RA(s).
(3.10)
This gives the value of f(s, A) in terms of the system (3.8). Now we let s approach I from the right. If X i' Xo, then L(s, X) -+ L(I, X), with L(l, X) i' 0, as was noted at the beginning of the proof. Hence the sum on the left in (3.10) (over all nonunit characters) has a finite limit. Taking this sum to the right side and combining it with RA(s), we obtain
In L(s, Xo)
it
= cp(m)f(s, A) + Tis),
(3.11)
where T A remains bounded as s -+ 1 + 0. Now we assume that the number of primes in the class A is finite. Then the function f(s, A) = LPEA I Ips will have a finite limit as s -+ 1, and therefore the right side of (3.11) will remain bounded as s -+ 1 + 0. But this is impossible, since lim L(s, Xo) = 00, s-l +0
since L(s, Xo)
= (s)
TI (1 -~). P
plm
This contradiction proves Theorem 2. Dirichlet's theorem can be strengthened as follows. Set f(s)
= IJ(s,
A)
L -.1
=
A
(p.m)= 1 P
Dividing (3.11) by cp(m) and summing over all A
In L(s, Xo) = f(s)
E
G m , we obtain
+ T(s),
(3.12)
where T(s) is bounded as s -+ 1 + 0. Comparing the right sides of (3.11) and (3.12) and taking the limit as s -+ 1 + 0, we arrive at the formula
L-
L -
1/ 1) =1lim ( s pEAP (p.m)=lP' cp(m)'
s-1+0
This formula says that, in a certain sense, the prime numbers which are relatively prime to m are uniformly distributed in the residue classes of Gm •
342
ANALYTIC METHODS
[Chap. 5
PROBLEMS
1. Show that the difference between the functions In S(s) and g(s) = L. lip' (p running through all rational primes) remains bounded as s -'> I + O. 2. Let P(s) be the function determined by (3.3). Show that the difference I P(s)-ln-s- I
remains bounded as s -'> I + O. 3. The rational integer a is called an nth power residue modulo the prime p, if the congruence x· ;= a (mod p) is solvable. For any a and any n, show that there are infinitely many p such that a is an nth power residue. 4. Let the integers aI, .. , , a. be such that a 1 Xl ... a. X. is a square if and only if all X, are even. For any choice of e" ... ,e. (e, = ± I), show that there exist infinitely many primes p (not dividing a" ... ,a.) for which
Hint: Consider the sum
4. The Number of Divisor Classes of Quadratic Fields
4.1. A Formulafor the Number of Dipisor Classes Let K = R( Jd) be a quadratic field (d a square-free rational integer). By Theorem 2 of Section 8, Chapter 3, a rational prime p has the following factorization into prime divisors in K: (1) P = PI", I'
:' I
(2) P = 1', (3) p
= 1'2,
Pi' p',
N(p)
=
N(p')
=
N(p)
=
p2,
if X(p) = - 1;
N(p)
=
p,
if X(p) = 0;
p,
if X(p) = 1;
where X is the character of the quadratic field K (see the definition of Section 8.2 of Chapter 3). Hence in the product
.1
5
19
1-
Y
Sec. 4]
DIVISOR CLASSES OF QUADRATIC FIELDS
343
the factor corresponding to p will be one of the following: (1)
(2) (3)
( pS1)-I( pI)-I. S' 1)-1_( - 1--pS1)-I(1+-pS1)-1 (1-p2s 1--
1--
1
1--.. p
In all three cases this can be written
(1- ?f (1- X~)fl l
e
p
Since [lp(l - 1jp S)-1 representation
= (s) v
~K
()
S
(Theorem 4 of Section 1), then (K(S) has the
= ~V() s [l ( 1 -
x(p)) -S
P
P
I
.
(4.1)
The infinite product on the right is the L-series L(s, X) for the character X(with modulus ID I, where D is the discriminant of the field K), and since this character is not the unit character, then L(s, X) is a continuous function on the interval < s < 00 (corollary of Lemma 4 of Section 2). Multiplying (4.1) by s - 1 and taking the limit as s -+ 1 + 0, we obtain [by (1.19)]
°
lim (s - l)(K(s)
= L(1, X).
s-I +0
Now we use Theorem 2 of Section 1. For real quadratic fields s = 2, t = 0, m = 2, and R = In e (e> 1 a fundamental unit of the field); for imaginary quadratic fields s = 0, t = 1, and R = 1, and hence
JI5
for d > 0,
h = [2lne L(l, X)
rnJIDI 2rr
L(l
,x)
for d < 0.
[By Section 7.3 of Chapter 2 the number m of roots of 1 contained in K equals 4 for K = R( equals 6 for K = R( -3), and equals 2 for all other imaginary quadratic fields.] In the next section we shall show that the character of a quadratic field with discriminant D is a primitive character modulo ID I (see the definition of Section 5.3 of the Supplement), and also that it is even for real fields and odd for imaginary fields. Therefore we can use formulas (2.20) and (2.21) to find
J=1),
J
I'
344
ANALYTIC METHODS
[Chap. 5
L(l, X). To find a closed formula for h we must still find the value of the normed
Gaussian sum ,(X) = '1(X), In Section 4.3 we shall see that the sum ,(X) equals for real fields and equals i JI DI for imaginary fields. Noting also that for real fields X(D - x) = X(x), we can formulate the following theorem [to simplify the formulas we eliminate the fields R( J-=T) and R( J - 3), which have discriminants - 4 and - 3, and for which m equals 4 and 6; for these fields h = I].
JJ5
i
r.
Theorem 1. The number of divisor classes of a discriminant D is given by 1 h= - L: X(x) In sin In e (x.D)= 1 O<x
real quadratic field with nx
-, D
(4.2)
where e> I is a fundamental unit of the field; for an imaginary quadratic field with discriminant D < - 4 we have the formula h
=- -
1
IDI
L:
X(x)x.
(4.3)
(x,D)=1
O<x
In both cases Xdenotes the character of the given field, defined in Section 8.2 of Chapter 3 [formula (8.5)]. We note some number-theoretic consequences of Theorem 1. We begin with formula (4.2). Consider the number
"=
TI sin -nbj TI SIn. -na , DaD
(4.4)
b
where a and b run through all natural numbers in (0, D/2) which are relatively prime to D and satisfy x(a) = + I, X(b) = -1. Then formula (4.2) can be written in the form eh = ". Hence" is a unit of the quadratic field in question, with" > I (since e > I). Hence we have the following theorem. Theorem 2. Let K be a real quadratic field with discriminant D and character X. The number" given by (4.4) is a unit in K, and is related to the fundamental unit e > 1 by h
e = ", where h is the number of divisor classes of K.
In spite of its simple formulation, there has never been an elementary proof of Theorem 2. Further, it has not been proved by purely arithmetic methods even that" > I. From the inequality" > 1 we can deduce some consequences on the distribution of quadratic residues modulo a prime p == I (mod 4).
5
Sec.4]
:i
The quadratic field R( has discriminant p and its character X(x) coincides with the Legendre symbol (xlp). Therefore we have the inequality
)
DIVISOR CLASSES OF QUADRATIC FIELDS
345
Ji>
)
. 7T.b TI' 7T.Q TI SIO-p> aSIO-, p
1
b
where Q and b run through the quadratic residues and nonresidues, respectively, in the interval (0, pI2). Since the function sin x is monotone on the interval (0, 7T./2), it follows from this inequality that the values of the 7T.blp are "on the average" greater than the values 7T.Qlp, that is, that the quadratic residues modulo p "cluster" at the beginning of the interval (0, pI2), and the nonresidues at the end [when p == I (mod 4) precisely half of the numbers in the interval (0, p12) are quadratic residues]. For prime numbers p == 3 (mod 4) we can obtain information on the distribution of residues and nonresidues by considering formula (4.3) for the field R(J -p). First, we put formula (4.3) in simpler form in the general case. We denote IDI bym. First, assume that m is even. It is easily verified (Problem 9) that in this case X(x +mI2) = -X(x) and formula (4.3) gives us
hm
= -
= -
L
X(x)x -
L
X(x)x
L
X(x),
o <x<mj2
O<x<mj2
m
= -
+
+~)(x +~) 2 2
L
x(x
L
X(X)(X
L
X(x).
0 <x<mj2
O<x<mj2
+ ~) 2
20<x<mj2
so that 1
h=-
2 0 <x<mj2
Note that since m is even, X(2) = 0. Now let m be odd. Since the character X of an imaginary quadratic field is odd, that is, X( -1) = -1 (we have already noted that this will be proved in the following section in Theorem 6), then it follows from (4.3) that hm
= -
L
o <x<mj2
= - 2
X(x)x -
L
x(m - x)(m - x)
0 <x<mj2
L X(x)x + m0 <x<mj2 L X(x). 0< x<mj2
(4.5)
ANALYTIC METHODS
346
[Chap. 5
On the other hand,
hrn = -
L
X(x)x -
O<x<m x even
=
-4
L
x(rn - x)(rn - x)
O<x<m x even
L X(2x)x O<x<m/Z
+ rn
L X(2x), O<x<m/Z
so that
hrnx(2) = - 4
L X(x)x o <x<m/Z
+ rn 0
L X(x). <x<m/Z
(4.6)
Equating the sums LX(x)x in (4.5) and (4.6), we obtain the equation
h(2 - X(2») =
L X(x). 0< x< m/Z
Since this equation also holds for even m [when 21 m, then X(2) = 0], we have the following theorem. Theorem 3. For an imaginary quadratic field with discriminant D < - 4 and character X we have the following formula:
h=
1
L
2 - X(2) 0 <x< IDI/Z (x,D)= 1
X(x).
(4.7)
We now apply Theorem 3 to the case of the field R( J -p), where p is a prime of the form 4n + 3. Since -p == 1 (mod 4), in this case D = -p and the value of the character X(x) coincides with that of the Legendre symbol (x/p). The number of summands in Lo<x
:1
,
,I· !]
1 3
= -(V
- N)
for p == 3 (mod 8).
It clearly follows from Theorem 4 that V> N. Thus if p is a prime of the form 4n + 3, then the quadratic residues outnumber the nonresidues on the interval (0, p/2) [by a number divisible by 3 if p == 3 (mod 8) and p i' 3]. This assertion, despite its simplicity, lies among some very deep results of number theory. It was obtained by us as a simple corollary of the fact that the number h, and hence the expression on the right in (4.7) is positive. However,
DIVISOR CLASSES OF QUADRATIC FIELDS
347
1.5
Sec. 4]
6)
the sign of this expression depends on knowledge of the value of the Gaussian sum 'l(X), and we shall see in Section 4.3 that the determination of the sign 'l(X) is a very difficult problem. If D == 1 (mod 8), the formula for the number h for an imaginary quadratic field can be proved by purely arithmetic methods. This was done by B. A. Venkov. His proof was based on the theory oLthe representation of binary forms by sums of squares of linear forms and on some delicate properties of continued fractions [B. A. Venkov, On the number of classes of binary quadratic forms with negative determinant. I and II, Izv. Akad. Nauk SSSR Ser. VII, No. 4-5, 375-392 (1928); No. 6-7, 455-480 (1928)]. In the case D == 1 (mod 8), as in the case of real quadratic fields, a purely arithmetic derivation of the formula for h has never been obtained. Also there is no known elementary proof of the fact that for a prime p of the form 8n + 7 the interval (0, p12) contains more quadratic residues than nonresidues.
re
4
a :i
II
Remark. Letp be a prime of the form 8n + 7. It can be shown by elementary means (Problem 7) that the interval (0, p12) contains just as many odd quadratic residues as odd nonresidues. Hence the number h for the field R(J - p), p == 7 (mod 8), is also given by
h= V* - N*,
where V* and N* denote the number of quadratic residues and nonresidues among the even integers in the interval (0, pI2).
I,
I
4.2. The Character of a Quadratic Field
We shall prove those assertions about the character of a quadratic field which were used in Section 4.1. Theorem 5. The character X of a quadratic field with discriminant D is primitive (with modulus ID I). Proof. By Theorem 4 of Section 5 of the Supplement it suffices to show that for any prime number p which divides D there is an x such that (x, D) = 1, x == 1 (mod ID lip) and X(x) = -1. First, consider the case p i' 2. Choose any quadratic nonresidue s modulo p and pick x as a solution to the system of congruences
x == s (mod p),
348
ANALYTIC METHODS
[Chap. 5
Using formula (8.5) of Chapter 3 it is easily checked that X(x) = (xlp) = = -1. Now let p = 2. If d == 3 (mod 4), D = 4d, then, solving the congruences
(sip)
x == 3 (mod 4), x == I (mod 2Id!),
we shall also have X(x) = (_I)(;c-l)/2 = -1. If d = 2d', D = 4d = 8d', then for the number x, given by x == 5 (mod 8), x == 1 (mod 4Id'!),
we shall have X(x) = = -1. We have shown that Xis primitive. (_1)(;(2- 1 )/8
Theorem 6. The characters of real quadratic fields are even and the characters of imaginary quadratic fields are odd. Proof. Let X be the character of the quadratic field R( .Jd). We compute X( -I), using (8.5) of Chapter 3. If d == 1 (mod 4), then
X( -
I)
= (~/) = (_1)(ld l -l)/2 = (-
1)l(d-l)/2 J +[(ld l -llf2].
If d == 3 (mod 4), then X( - I)
= _ (~II) = - (-
l)
= (_1)[(d-l)/2 J +[(ld l -l)/2 J•
Finally, if d = 2d', then X( -
I) = (_I)(d'-1)/2
(I~'~) =
(_I)[(d'-1)/2 J +[(ld'I-1)f2].
But if a is odd, then for a> 0, for a < 0. Hence in all cases x( - I) = {
1 -I
Theorem 6 is proved.
for d > 0, for d < 0.
.5
~n
'-
Sec. 4]
DIVISOR CLASSES OF QUADRATIC FIELDS
349
4.3. Gaussian Sums for Quadratic Characters In deriving formulas for the number of divisor classes of a quadratic field, we used a formula for the value of the normed Gaussian sum r(x). Recall that the Gaussian sum ra(x) of the character X modulo m is called normed if ( = cos 2n/m + i sin 2n/m is taken as the primitive mth root of 1 in its definition (see Section 2.4). We now consider the computation of the value of r(x). By Theorem 5 the character X of the ql.ladratic field R( -}d) with discriminant D is a primitive numerical character modulo IDI. Also it satisfies the condition X2 = Xo , where Xo is the unit character. This simply means that the character X takes the values ± 1 (and, of course, zero). Definition. A nonunit numerical character X is called quadratic if X2
= Xo'
We shall show that every primitive quadratic character is the character of a quadratic field. By Problem 8, primitive quadratic characters occur only for moduli of the form rand 4r (one character for each) and 8r (two characters), where r is an odd square-free natural number. The set of these moduli ,hence coincides with the set of numbers of the form ID I, where D is the discriminant of a quadratic field. We note that when ID I = 8r there are two quadratic fields; R( -}2r) and R( -}'--' 2r), which have distinct characters, since one is even and the other odd. Hence each primitive quadratic character is the character of a quadratic field. The value of the Gaussian sum for primitive quadratic characters is determined by the following theorem. Theorem 7. Let X be a primitive quadratic character modulo m. Then the normed Gaussian sum rl(X) = r(x) satisfies
if X( - 1)
= 1,
if x( - 1)
=-
1,
Proof. We shall give the full proof of Theorem 7 only in the case of odd prime modulus p, since this case contains most of the essential difficulties. The transition to the general case is relatively easy. At the end of the proof we sketch this transition. Hence let p be an odd prime and set ( = cos 2n/p + i sin 2n/p. Since the nonunit quadratic character X modulo p coincides with the Legendre symbol (x/p) (Problem 4 of Section 2 of Chapter 1), then the normed Gaussian sum r(x) is given by
[Chap. 5
s
(the prime on the summation sign indicates that x runs through a reduced residue system modulo p). We find the complex conjugate r(x). Since ~ = C 1, then
I I 1
350
ANALYTIC METHODS
(4.8) On the other hand, by Theorem 4 of Section 2 of Chapter 1, r(x)r(x) = p.
(4.9)
From (4.8) and (4.9) it follows that r(x)2 =
(~
l)p = (_ 1)(P-l)/2p,
and hence
= 1 (mod 4), if p =3 (mod 4).
±Jp
r(x)
if p
= { 'J~
±I
P
(4.10)
To complete the proof of Theorem 7 (for m = p) we need only determine the signs of and i But it is precisely here that the principal difficulty of the proof lies. We represent the sum r(x) in another form. Let a run through all quadratic residues modulo p, and let b run through all nonresidues. Then
Jp
Jp.
But so that a
If x takes the values 0, 1, ... , p - 1, then, modulo p, x 2 takes the value and each quadratic residue twice. Hence we can write r(x) in the form
°
once
p-l
r(x) Now consider the matrix
=
L
x=o
C
2 •
(4.11 )
DIVISOR CLASSES OF QUADRATIC FIELDS
5
Sec. 4]
:d
By (4.11) the Gaussian sum T(X) coincides with the trace of the matrix A. Hence if we denote the characteristic roots of A by AI' ... , Ap , then we will have
l)
(4.12)
~)
351
The computation of T(X) hence reduces to the finding of the characteristic roots of the matrix A. We square the matrix A. Since p-l
L
Ct(IY
=
t=O
p-l
L
(t(x+ y )
=
{p 0
1=0
+ y == 0 (mod p), for x + y ;f:. 0 (mod p),
for x
then
)
A2 =
f
P
0
0
0
0
P
0
P
0
As is well known, the characteristic roots of the matrix A 2 coincide with the squares (4.13) of the characteristic roots of A. But the characteristic polynomial of A 2 is easily computed. It equals (t _ p)
+ pYP-l)/2.
Hence the sequence (4.13) contains (p + I)/2 numbers equal to p, and (p - 1)/2 numbers equal to -po Hence each Ak is one of the numbers ± ..)p, ± i..)p, and if a, b, c, and d denote the multiplicities of the characteristic roots ")p, - ..)p, i ..)p, and - i ..)p, then p+l
a+b=-2 '
p-l c +d=-2-'
(4.14)
The sum (4.12) can be represented in the form T(X) = (a - b + (c - d)i)
..)j).
(4.15)
Comparing with (4.10), we find that a - b a
= ± 1,
C
=d
= b, c - d = ± 1
for p == 1 (mod
4),}
for p == 3 (mod 4).
(4.16)
352
ANALYTIC METHODS
[Chap. 5
To determine the multiplicities a, b, C, and d, we must find another relation among them. We compute the determinant of the matrix A. Since det(A 2 ) = pP( -1)P(P-l)/2, then (4.17) The determinant det A is a Vandermonde determinant. Introducing the notation 1/ =- cos nip + j sin nip, we have
!
TI
det A =
1
((' -
(S)
= TI 1/r+s(1/r-s -
p-l~r>s~O
= TI 1/r+s TI r>s
=
r>s
(2i sin _(r_---.-:s)_n) P
'p(p-l)/22 P (p-l)/2
I
s)n TI S . ( I'i-n , p
r>s
since
J:,
p-l
p-I r - I
(I' + s)
= r~1 s~o (I'
1/-(r-s»)
r>s
1'(1'
P-I(
+ s)
= r~1
1'2
+
-1)) = 2p (P-2_
1)2
2
is divisible by 2p. We compare this expression for det A with (4.17). Since siner - s)rrlp > 0 for 0 ~ s < I' ~p - I, we must have the plus sign in (4.17). Thus det A = i P(P-I)/2 p P/2. On the other hand, we have det A =
P
TI Ak = (- I)bn -
i)d p P/2 = i 2b + C - dpp/2.
k=1
This yields the congruence
2b
p-I
+ C - d == P-2- (mod 4),
from which, in view of (4.14) and (4.16), we deduce a - b
= P + 1 - 2b 2
p+1
p-I
== - - - - - = 1 (mod 4) 2 2 C -
I
: I
I: '
for p == 1 (mod 4),
d == _ p - 1 + 2b 2
p-l
p+l
2
2
= - - - + - - = 1 (mod 4)
for p == 3 (mod 4).
i
Sec. 4]
DIVISOR CLASSES OF QUADRATIC FIELDS
353
These congruences show that the differences a - band c - d in (4.16) both equal + 1, and by (4.10) this finally yields
Jp iJP
rex) = {
for P
=1 (mod 4),
for P = 3 (mod 4).
This completes the proof of Theorem 7 for the case of a prime modulus m=p. To prove the theorem in the general case, use the result of Problem 4 of Section 2. If X is a primitive quadratic character modulo m, this problem shows that the normed Gaussian sum rex) can easily be expressed in terms of the normed Gaussian sums for the nonunit modulo 4, the two primitive characters modulo 8, and quadratic characters of odd primes p. Since we know all these Gaussian sums (see Problems 10 and 11 for the moduli 4 and 8), then the formula of Problem 4 of Section 2 allows us to give an explicit expression for rex). Suppose, for example, that we have the character X(x)
= (_1)[(X2-1J/8]+[(X-1J/2](~),
(x, 2r)
=1
modulo m = 8r, where r is an odd square-free natural number. If r = PI ... Ps' then X has the representation x(x)
= (_1)[(X'-lJ/8
+[(X-lJf2](;J ... (;J
Let (X be the number of primes among PI' ... ,Ps which are congruent to 3 modulo 4. Then
rex) = 2iJ"i iaJ;( _1)[(r2-1J/8]+[(r-1J/2] (~) TI (Pk) r
k*i Pi
= ja+ 1~( _1)[(r-1J/2]+C~2 = J;'ja+ 1 +2a+a(a-1J
if X( -1) = ( _ly-1 = 1, if X( - 1) = ( -
1)a + 1 = -
1.
The Gaussian sums for the other primitive quadratic characters are treated analogously. This proof of Theorem 7 (for prime moduli) is due to Shur. Another proof, found by Kronecker, is given in Problems 13 to 16. PROBLEMS
1. Knowing 1ha1 (I + V5)/2 = 2 cos 7T/5 is a fundamental unit for the field R(V5), use formula (4.2) to compute the number h for this field.
II ,
354
ANALYTIC METHODS
[Chap. 5
2. Compute the number h for the fields R( V - 5) and R( V- 23). 3. Show that a quadratic field with discriminant D is a subfield of the mth cyclotomic field, where m = IDI. 4. Let p be an odd prime, and let' be a primitive pth root of 1. Show that the cyclotomic field R(') contains one and only one quadratic subfield. This subfield is R(Vp) if p == 1 (mod 4), and R(V -p) if p == 3 (mod 4). (To solve this and succeeding problems, use the fundamental theorem of Galois theory.) 5. Let p == 1 (mod 4) be a prime, and define the number
. 7Tbj TIsm-, . 7Ta TIsmp
b
P
•
where a and b run through the quadratic residues and nonresidues modulo p in the interval (0, p/2). Without using Theorem 2 show that this number is a unit of the quadratic field R( Vp), and that its norm is -1. 6. Using ,the second assertion of Problem 5, show that the number of divisor classes in the field R( Vp), p a prime, p == 1 (mod 4), is odd and that the norm of a fundamental unit of this field is - 1. 7. Let p be a prime number of the form 8n + 7. Show that precisely half of the even numbers in the interval (0,p/2) are quadratic residues modulo p. 8. If there is a primitive quadratic character with modulus m, show that m is of the form r, 4r, or 8r, where r is an odd square-free natural number. Further, show that every primitive quadratic character is of the form ! ,
X(x) =
X(x)
U}
for the modulus r,
(x, r) = I
= (-l)(~ -il / 2(:;), (x, 2r) =
1
X(x) = (_l)(.>-ll/S(n,
for the modulus 4r,
} (x, 2r) = 1
X(x) =
(_li(~>-')/84[('-1l/2l(~),
for the modulus 8r.
9. If X is a primitive quadratic character with even modulus m (m show that X( x
+
i)= -
(_l)(~2-11/8
and
XH(x) =
4r or 8r with odd r),
X(x).
10. Let X be the character modulo 4 given by X(x) the normed Gaussian sum 'T,(X) equals 2i. 11. Consider the primitive characters x'(x) =
=
= (-l)(~ -Il / 2, (x,
(_l)[(~2-11/8l+[('-')/2J
Verify that the normed Gaussian sums are 'T,(X') = 2v2, and 'T'(X 12. Give the proof of Theorem 7 for arbitrary moduli.
2)
=
1. Show that
(21' x) modulo 8.
H )
=
2iV2.
Sec. 5]
DIVISOR CLASSES OF PRIME CYCLOTOMIC FIELDS
355
13. Let p be an odd prime and set' = cos 27r/p + isin 27r/p. Let <.-0/2
o= n
(,% _ , -X).
X=l
Show that
02 = (-1)<' -O/2p • Thus 0 2 coincides with the square r 2 of the Gaussian sum r 14. Using the same notations, show that for p for p
=
L~: ~ (x/p)'x.
== 1 (mod 4), == 3 (mod 4).
Further, setting A = 1 - " show that the congruence
(~2) 0 ==
(p ~ I)!
,\<. -0/2
(mod
,\<.+0/2).
n
holds in the order Z[ 15. Verify the congruence
in the ring Z[
n
Hint: Decompose the sum L~: ~ xc. -1)/2 (1 - A)% into powers of A, using the fact that • - 1
{O (mod p)
X=1
-1 (modp)
LX"'=:
for 0 < m < p - 1, for m =p-l.
16. Use the two preceding problems to show that for p for p
== 1 (mod 4), == 3 (mod 4).
5. The Number of Divisor Classes of Prime Cyclotomic Fields
5.1. The Decomposition of the Number h into Two Factors
The formulas (2.16) and (2.17) for the number of divisor classes of the mth cyclotomic field do not contain any infinite series or products. But they are somewhat unsatisfactory in that they express the number h of classes, which is of course a natural number, in terms of irrational and complex numbers. In this section we shall put these formulas for h in a more complete form, limiting ourselves to the case of prime cyclotomic fields. Hence let I = 2m + 1 be a prime number, and let K = K«() be the Ith cyclotomic field. For ease of computation we shall assume that K is a subfield
356
ANALYTIC METHODS
[Chap. 5
of the field of complex numbers, and that ( = cos 2n/1 + i sin 2n/1 (the value of ( needs to be precisely fixed for analytic computations). We compute for K the terms in the product in (2.16). Since the degree(K: R) is 1- 1(corollary of Theorem 1 of Section 2) and all isomorphisms of K into the complex field are complex (they are simply automorphisms of K), then s = 0 and t = (/- 1)/2 = m. By Lemma 3 of Section 1, Chapter 3, the number w, the number of roots in 1 in K, equals 21. The norm of the principal divisor I = (1 - () equals N(I) = N(1 - () = 1 [see (1.5) of Chapter 3], so that the divisor I is prime, and by Lemma 1 of Section 1, Chapter 3, the number 1 has the factorization 1= Hence the factor F(s) in (2.12) equals
1'-1.
(1 __ 1 )-1(1 _.!:.)I S1'
F(s) =
=
N(I)'
We turn to the computation of the discriminant of the field K. Theorem 1. The numbers
1, (, .. , , (/- 2 form a fundamental basis for the lth cyclotomic field K = R(C). Proof. If s 1= 0 (mod I) the characteristic polynomial of the number (S is X I - 1 + X I - 2 + .,. + Xl + 1, so that
Sp (' =
-1
if s 1= 0 (mod I),
{ 1-1
if s == 0 (mod I).
(5.1)
Let (a i
E
R)
be any integer of the field K. We must show that all the coefficients a l are rational integers. Since aC k - a( is an integer, then the trace 1-2
Sp(aC is a rational integer (0 the number fa = bo
~
k
k
aC)
-
~
=
lak -
1-2
L a; + ;;0 L a; = lak ;;0
1 - 2). We set la k = bk
+ b1' + .. , + b,_2('-2
= Co
,
1 - ( = A and consider
+ CIA + ." + C, _ 2"t'-2,
where the bk and Ck are all rational integers. We shall show that all the coefficients Ck are divisible by I. Suppose this has been established for C1' ... , Ck - 1 (0 ~ k < 1- 2). Consider the last equation as a congruence modulo Ak+ 1 (in the ring of integers of the field K). Since 1 == 0 (mod Ak+ 1) (Lemma 1 of Section 1, Chapter 3), then this congruence yields Ck Ak
== 0 (mod Ak + 1),
.5
Ie K of e
(2
:>f Is
Id
Sec. 5]
DIVISOR CLASSES OF PRIME CYCLOTOMIC FIELDS
357
so that Ck is divisible by A. and hence also divisible by / (Lemma 2 of Section 1, Chapter 3). But then all the coefficients bk must also be divisible by /, so that all ak are integers. Theorem 1 is proved. Corollary. The discriminant of the Ith cyclotomic field (I> 2) equals ( _1)(1-1)/2//-2.
Far by formula (5.1) the discriminant of K equals the determinant -1
-1
-1
-1
-1
1-1
-1
1-1
-1
(instead of the basis of Theorem 1, we take here the basis" (2, .. , , (1-1). Formula (2.16) for the case of the Ith cyclotomic field can now be written 11/ 2
h = 2m -
s
)
mR
I
1r
TI
L(l, X),
(5.2)
X*Xo
where R is the regulator of the field K, m = (/ - 1)/2 and X runs through all numerical characters modulo /, except the unit character Xo' Since all terms in the formula (5.2) outside the product sign are real and positive, the formula will remain valid if each term L(l, X) is replaced by
IL(l, x)l. For a prime modulus all nonunit numerical characters are primitive. Therefore we can apply Theorem 3 of Section 2. To do this we must separate the even and the odd characters. Let g be a fixed primitive root modulo / (that is, g generates the cyclic group Gl of residue classes modulo I), and let () be a primitive (1- 1)th root of 1. The group of numerical characters modulo / is cyclic and has order / - 1. If we denote by X that character modulo / for which
then its powers X, X2, ... , i-I = Xo will comprise the entire group ofcharacters modulo /. Since
x5( -
1)
= X(g(l-I)/2)5 = 0-(1-1)/25 = (-1)'
then each character of the form X2k - 1 will be odd, and each of the farm X2k will be even. Using formula (2.20) and Theorem 4 of Section 2 of Chapter 1, we obtain
358
ANALYTIC METHODS
for the even characters
lk [1
[Chap. 5
~ k ~ (l- 3)/2]:
IL(l, xZk)1 = IT(XZk)II'fiZk(g')lnll _ (g'll
t
,=0
I
= -I 8Zk ' I \'-Z
Jt
Setting r lation
= [(1-
1)/2]
+ S,
Inll - (g', .
,=0
where 0
~ S
< [(l- 1)/2] = m, and using the re(5.3)
we obtain and thus
I
2 \ m-I IL(l, xZk)1 = -I 8 Zk 'Inii _ (g', .
Jt
,=0
We can apply formula (2.21) to the odd character XZk - 1 in an analogous manner. Let U. denote the smallest positive residue of 9' modulo l. Then I-I
I
iZk-l(r)r
=
,=1
I-Z
I
XZk-l(gS)-lgs
s=o
=
I-Z
I
gs8(Zk-I)S
= F(8 Zk - I ),
.=0
where F denotes the polynomial F(X)
=
IL(I, XZk - I )!
=
I-Z
I
s=o
g.X'.
Hence
TC~tlF(82k-')I.
Substituting these values for IL(I, xZk)l, I ~ k ~ m- I, and IL(I, xZk-')I, I ~ k ~ m, in (5.2), we obtain (5.4)
where m 1m 1
ho = h*
2 -
R)J-
Im-1 Zk ,~o 8 ' Inll -
I
(g'l ,
= __ I -IF(8)F(03) ... F(8 - Z )I. ' (2l)m-'
(5.5)
(5.6)
In the following sections we shall show that both ho and h* are natural numbers. Hence formula (5.4) gives us a representation of the number h as a product of two natural numbers.
.5
Sec. 5]
DIVISOR CLASSES OF PRIME CYCLOTOMIC FIELDS
359
Remark 1. Sometimes h* is denoted by hI' and h o by h 2 , and they are called theftrst and second factors of h. Remark 2. The factor h o equals the number of divisor classes of the subfield R«( + C l ) of degree (l- 1)/2, which consists of all real numbers of the field R(O (see Problems 1 to 4).
e5.2. The Factor h o 3)
To shorten our formulas, we set ~
(r
0).
From (5.3) we see that am+r = ar . This means that the value of ar depends only the residue of r modulo m = (l- 1)/2. If we set
Yl(m:t (Jur ar ) , k=1 r=O l
A= IS
then (5.5) can be written in the form
2m -
l
hO=R 1A1 .
(5.7)
We shall show that the product (ao
+ a l + ... + a m -
l
)A
equals, up to sign, the determinant
!1
= det(ai+)O<;i,j<;m-l =
al
Consider the cyclic group G or order m, generated by (J2, which is a primitive mth root of 1. For 0 ~ k ~ m - 1, set Xk((J2r) = (J2rk. The function Xk is clearly a character of the group G. We also define a function f on G, by setting f«(J2r) = ar . By Problem 13 of Section 5 of the Supplement, our product takes the form
JXCt>2kr ar ) = JJ:Ct>k«(J2r)!«(J2r))
= det(f«(J2(i-il» = det(ai-j)O<;i,j<;m-l' Since the matrices (a i - ) and (a i +) differ only in the order of the columns, we have shown the desired result.
360
The sum ao
ANALYTIC METHODS
[Chap. 5
+ a1 + .. , + am -1 is nonzero, since ao + a 1
1((1 - I= lnJT
+ ... + a m - 1 = In I
(g')
(5.8)
by (1.5) of Chapter 3 and (5.3). Hence we can divide the determinant .1 by (5.8) to obtain a new expression for A. If we add every column in .1 to some fixed column, we obtain a column in which every entry is equal to (5.8). Hence, up to sign, A equals the determinant .1', obtained from.1 by replacing every element in one column by 1. If we now subtract the first row from all other rows, we see that IAI equals the absolute value of any minor of order m - 1 of the matrix (5.9) Consider the number rj
= _ ({/ + 1 )/2 = cos ~ + j sin ~ I
I
which is a primitive root of degree 21 of 1. Since 1_(k k_1rjk_rj-k k_1 --=rj =rj 1- ( rj - rj 1
rj2
= " then
sin (kn/l) sin(n/l) .
For k 1= 0 (mod I) the number on the left is a unit of the field K (see the proof of Lemma 1 of Section 1, Chapter 3); hence the numbers
sin(k7r/l)
= -:--:-7.':"' . sin(Tr/ I)
(5.10)
()k
are also units of the field K for all k 1= 0 (mod I). These units clearly are real and positive. There are m = (l- 1)(2 pairs of conjugate isomorphism of the field K into the field of complex numbers. Since the numbers" (g, ... , (gm - I are all nonconjugate, then the isomorphisms (j
are pairwise-nonconjugate (each
Uj
= 0,1,
... , m - 1)
is conjugate to the isomorphism (-+ C
gi
=(gm+ i ).
Let r denote the absolute value of the smallest residue of g' in absolute value, modulo I. Then 1 _ 19' _~
_"'_ = +
1-(
-rj
g'-18_
"
Applying the automorphism u j to this identity, we obtain 1 _ (gr+J 1 _ (gi = ± (u j rj)9'-l U /O j ),
Sec. 5]
DIVISOR CLASSES OF PRIME CYCLOTOMIC FIELDS
361
and after taking the logarithm of the absolute value we find that ar + j
-
aj
= InluiOr)l.
(5.11)
We shal1 show that when r takes the values 1, ... ,m - 1, then r runs through 2, ... , m. For if gl == ±gj (mod I), with 1 ~ i ~j ~m - 1, then gi - i == ± 1 (mod I) and 0 ~ j - i ~ (l- 3)/2, which is possible only for j - i = O. Hence the values of r are pairwise-distinct, and since they satisfy 2 ~ r ~ m = (l- 1)/2 and there are m - 1 of them, then each of the numbers 2, '" ,m is some r. It fol1ows from (5.11) that the matrix (5.9) differs from the matrix (5.12) only in the order of the rows, and hence the absolute value IAI equals the absolute value of any (m - I)th minor of the matrix (5.12). We now turn to a system of fundamental units of the field K. By Lemma 4 of Section 1, Chapter 3, any unit of the field K is the product of a power of ( with a real unit. Hence the fundamental units 8 1 , ... ,8m - I can be chosen real and positive. Then any positive real unit can be represented in the form 81 C 1 . . . 8~~11 with the C1 rational integers. In this case the functions l/ct.), defined in Section 3.3 of Chapter 2, have the form lirx) = In luirxW = 21n luirx)l, 0 ~j ~ m - 1. With the fundamental units 81' •.• ,8m -I we form the matrix (5.13) Since the matrix (4.6) of Chapter 2 is obtained from (5.13) by multiplying al1 rows by 2, it fol1ows from the definition of the regulator R that the absolute value of any (m -l)th minor of the matrix (5.13) equals R/2 m - l . The units Ok of the form (5.10) for k = 2, ... ,m are real and positive, and they can be expressed as m-I
Ok =
n8/
kl
(k
= 2, ... , m),
1= 1
with
Cki
rational integers. Since m-I
InluiOk)! =
L ck1 lnlui8 JI
i= 1
the matrix (5.12) is the product of the matrix (ckJ and the matrix (5.13). It fol1ows that each (m - I)th minor of the matrix (5.12) equals the product of det(cki) with the corresponding minor of the matrix (5.13), and this means that
IAI
R = Idet(ck)1 2m -
I '
362
ANALYTIC METHODS
[Chap. 5
Comparing with (5.7) we obtain h o = Idet(ck)i.
Since all Cki are rational integers and h o #- 0, we have shown that h o is a natural number. Further, by Lemma 1 of Section 6 of Chapter 2, we have the following result. Theorem 2. The factor h o in the number of divisor classes of the lth
cyclotomic field equals the index (E: Eo) of th'e group Eo, generated by the units ()k
=
sin(kn/I) --:--;-""7:":"""
sin(n/I)
of the field K, in the group E of all positive real units of the field K. In view of Remark 2 at the end of Section 5.1 it is interesting to compare this result with Theorem 2 of Section 4. 5.3. The Factor h*
We shall show that the number h*, defined by (5.6), also is a natural number. The product B
= F«())F«()3) ... F«()l-Z)
is an integer of the algebraic number field R«()), where () is a primitive (1- l)th root of 1. Since the complex conjugate of ()k is ()l-l - k, then B is left fixed when () is replaced bye, and hence is a real number. Finally, we note that IBI = (hlh o)(2/)m-l is a rational number [see (5.4) and (5.6)]. It follows from these three facts that B is a rational integer. We now show that B is divisible by 2m - 1 and r- 1 (here 1 #- 2). As in Section 5.1 we let g. denote the least positive residue of g' modulo I, where 9 is a fixed primitive root modulo I. Since gm+.
°
+ g. == gm+. + g' = g'(g(l-l)/Z + 1) == (mod 1),
then gm+.+g.=l.
It follows that gm+. and g. differ by an even number. We shall now consider
congruences modulo 2 in the ring of integers of the algebraic number field R«()). Since ()m = -1, we have for odd k, m-l
F«()k) =
L (g.()k. + gm+.()k(m+.))
.=0
m-l
=
L (g. -
.=0
m-l
gm+.)Ok. ==
L ()k. (mod 2),
.=0
Sec. 5]
DIVISOR CLASSES OF PRIME CYCLOTOMIC FIELDS
363
so that F«(l) (I - Ok) == 0 (mod 2). This shows that the product B(I - 0) (I - ( 3 )
'"
(I - 01 -
2
I
)
is divisible by 2m • On the other hand, since 0 and 02 are primitive roots of degrees l - I and (l - I)/2, then
I - 1=
I-I m-1 = 11 (l - 02S),
1-2
11 (l
- Ok),
2
k=1
s=1
so that
(1 - 0)(1 - 0 3 )
•• ,
(I - 01-
2
)
= 2.
This shows that B is divisible by 2m -1. 1 , we first find the decomposition of the To show that B is divisible by number l into prime divisors in the field R(O). Since l is relatively prime to l - I and l == I (mod l - I), then by Theorem 2 of Section 2, the number l is the product of qJ(l- 1) distinct prime divisors, where the norm of each prime divisor equals l. Let q be one of these prime divisors. The numbers 0, I, 0, '" ,0' - 2 are pairwise-noncongruent modulo q (see the proof of Lemma 3 of Section 2), so they form a complete set of residues modulo q. Since
r-
1-2
I - gl-1 =
11 (l
- Ok g )
== O(mod I)
(5.14)
k=O
then q must divide one of the differences I - Okg. If I - Okg == 0 (mod q) and I - OSg == 0 (mod q), then Ok == Os (mod q), and this means that Ok = Os. Thus q divides one and only one of the differences I - Okg in (5.14). We shall show that k is relatively prime to l - 1. If (k, l - I) = d, then by raising the congruence 1 == Okg (mod q) to the power (l- 1)/d, we see that g(I-I)/d - 1 is divisible by q, and hence also divisible by l. But this is possible only if d= 1. Since there are qJ(l- I) prime divisors q of l, and qJ(l- I) numbers 1 - Okg in (5.14) with (k, l - I) = I, each of the numbers 1 - Okg is divisible by one and only one q. Denoting this prime divisor of l by qk' we have (5.15) and we note that if s is not relatively prime to l - I, then I - OSg is not divisible by any qk' Hence l can be represented as
1=
11
qk>
(k,l-l)=1
where k runs through a reduced system of residues modulo l - I.
364
ANALYTIC METHODS
[Chap. 5
We shall now show that B is divisible by l'" -1. In the ring of integers of the field R(O), we have the congruence 1-2
F(ok)(l - gOk)
= L (gOk)'(l -
gOk)
.=0
= l_(gOky-l = l_gl-1 =O(modl), so FW)(l - gOk) is divisible by I. From the above results we see that F(Ok) isdivisibleby/if(k,I...,-I» l,andby/qk-1if(k,I-I)= I.If(k,I-I» 1, let qk denote the unit divisor. Then F(Ok) is divisible by Iqk -1 for all k. Hence the product B = F(0)F(03) ... F(01-2) is divisible by 1m
fl
qk- 1 = 1m
k=1.3 •...•1-2
fl
(k.l-l)=1
q; 1 = 1m -1 ,
which completes the proof that h* is an integer. 5.4. The Relative Primality of h* and I In Section 7.3 of Chapter 3 we saw how important it is to have criteria for determining whether h and I are relatively prime, that is, for determining whether the prime I is regular. Since h = hoh m , then I will be a regular prime if and only if neither of the factors h o , h* is divisible by I. In this section we shall find a condition which is necessary and sufficient for I not to divide h*. In the next section we shall show that if I does not divide h*, then it also does not divide h o , so our condition will turn out to be a criterion for the regularity of I. Preserving the notations of Section 5.3, consider the expression B
Im -
1
fl
k=I.3 .....1-2
(5.16)
[here we identify the principal divisor «(X) with the number (Xl In view of (5.6) the number h* is divisible by I if and only if the rational integer (5.16) is divisible by all prime divisors q.. with (s, 1- 1) = 1. In particular, the number (5.16) will be divisible by ql _ 2 = q-1' so that at least one of the integral divisors FW)qkl-1 (k = 1,3, ... ,1- 2) is divisible by q-l' For this to happen it is necessary and sufficient that the divisor F«(Jk)qk be divisible by q:l' We shall show that this cannot happen for k = 1- 2 = - 1 (mod 1- 1). By (5.15), 0-l g = 1 (mod q-l), so that 1-2
F(O-I) = L(0-l g )'=1-1= -1 (mod q-l), r=O
that is, F(O-I) is not divisible by q-l' and this means that F(O-I)q_l is not
p.5
Sec. 5]
the
divisible by q:l' Thus for h* to be divisible by I it is necessary and sufficient that FW) be divisible by q:l for some k = 1,3, ... ,I -4. Up to this time we have not imposed any restrictions on the choice of the primitive root g modulo l. Now we assume that g satisfies the congruence
365
DIVISOR CLASSES OF PRIME CYCLOTOMIC FIELDS
i-I == 1 (mod 12) (if g does not satisfy this congruence, replace g by g + xl with suitable x). Since the congruence (5.14) will now hold modulo fl, then 1 - (ig will be divisible by q/ for any k relatively prime to 1- 1. In particular,
() == g (mod
q:l)'
With this choice of g the condition for q:l to divide F«()k) can easily be found. Indeed, since 1-2
FW)
1- 2
= L gs()'k == L gsgsk (mod
q:
1), s=o s=o then the number F«()k) is divisible by q:l if and only if 1-2
L gsgSk == 0 (mod 12 ). s=o
'or tlg
In order to put (5.17) in more convenient form, consider the congruence.
ar
gs == gS
[n
.I
*,
:-
[)
+ las (mod 12 )
(0 ~ s ~ 1- 2),
(5.18)
where as is an integer. If we raise both sides of (5.18) to the power k (k = 1, 3, '" , 1- 4), then we find that
+ (k + I)gsk[a s == gs(k+l) + (k + l)gs\gs _
+1
g/+1 == gs(k+l)
that is, g/+1 == (k
i)
i) s e
(5.17)
+ l)gsgSk -
gS) (mod 12 );
kg s(k+l) (mod [2).
Summing (5.19) for s = 0, 1, ... , 1- 2 and noting that if + 1 k + 1 ~ 1- 3 and gl-1 == 1 (mod 12 ), we obtain 1- 2 g (1- 1 )(k + 1) I L gS(k+ 1) = - == 0 (mod [2) s=o l+I_1 and hence 1-2
y
1-2
L g/+ 1 == (k s=o
But k
+ 1 '" 0
+ 1) L
gsgsk (mod [2).
s=o
(mod I), and therefore (5.17) is equivalent to 1-2
1-1
Sk+l= Lgsk+1 = Ln k+1 ==0(mod[2). s=o n= 1
Hence we have proved the following theorem.
(5.19) '"
1 (mod I) for
ANALYTIC METHODS
366
[Chap. 5
Theorem 3. In order that the number h* not be divisible by I, it is necessary and sufficient that none of the numbers
(k = 2, 4, ... , 1- 3) be divisible by
(5.20)
F.
Note that each of the numbers Sk [k '" 0 (mod 1- 1)] is divisible by I [see (8.10)]. We reformulate Theorem 3' in terms of Bernoulli numbers (Bernoulli numbers will be defined and studied in Section 8). Since the numbers 2, 4, .. , , I - 3 are not divisible by I - 1, then by the theorem of von Staudt (Theorem 4 of Section 8) the Bernoulli numbers B 2 , B 4 , '" , Bt - 3 are I-integers (I does not appear in their denominators). Further, we have the congruence
(k = 2, 4, .. , , I - 3)
(5.21)
[in the ring of I-integral numbers; see (8.11)]. Hence the following theorem is valid. Theorem 4. In order that h* not be divisible by I, it is necessary and sufficient that the numerators of the Bernoullt numbers B 2 , B 4 , ••• , B t - 3 not be divisible by I.
For example, since the numerators of the numbers B 2 , B 4 , B 6 , B g , B lO
,
B12 , B14 are not divisible by 17, then 1= 17 is regular. Remark. To determine whether h* and I are relatively prime, it is not necessary to find the precise value of the Bernoulli numbers. It suffices to consider the recurrence relation (8.2) as a congruence modulo I and to use these congruences to compute the sequence B 2 , B 4 , ... , B t - 3' The number h* will be relatively prime to I if and only if none of these numbers is divisible by I.
PROBLEMS
1. Let K o be the subfieId of the Ith cyclotomic field R(~) which consists of all real numbers in R('). Show that K o = R(' + '-1) and is of degree (1- 1)/2. Further, show that the field K o has discriminant 1(1-3)/2, and that its regulator R o is related to the regulator R of the field R(') by R = 2(1-3)/2 R o • 2. Let p be a prime different from I, and let I be the smallest natural number for which p' == 1 (mod I). Show that the number p factors in the field Ko as the product of (1- 1)/21 prime divisors of degree I when I is odd, and as the product of (1- 1)/1 prime divisors of degree 1/2 when I is even.
Sec. 6]
A CRITERION FOR REGULARITY
367
3. Show that the '-function '''o(s) of the field K o satisfies lim (s - 1)'"o(s) =
n
L(l, X),
X*Xo
s-+1+0
X(-l)= 1
where X runs through all even numerical characters modulo /, except the unit character Xo . 4. Show that the real subfield R(' + ,-I) of the Ith cyclotomic field has h o divisor classes, where ho is the factor of the number of divisor classes of the field R('). 5. Show that h* is given by h*
=
(2/;m_Ildet(gm+I+) - gl+)O", I, Hm-,l,
where g, is the smallest positive residue of the number g' modulo / = 2m + I (g is a primitive root modulo I). 6. Compute the factor h* for / = 7. 7. Show that the prime number 37 is irregular.
6. A Criterion for Regularity
Our goal in this section is to show that when the factor h* of the number of divisor classes of the Ith cyclotomic field is not divisible by I, then the factor ho is also not divisible by I, and hence the prime I is regular. En route we shall also show that when I is regular, every unit of the field K = R(O which is congruent modulo I to a rational integer is an lth power. On this assertion, known as Kummer's lemma, is based the proof of the second case of Fermat's theorem for regular primes. Both the regularity criterion and Kummer's lemma will be found as simple corollaries of the following result. If I ,r h* and K. is the I-adic completion of the field K = R(O where 1 = (1 - 0, then the numbers log 0k l - 1 [k = 2, 3, ... , (1- 1)/2] form a basis for the set of all .. real" l-adic integers with zero trace [the units Ok are defined by (5.10)]. 6.1. The Field of I-adie Numbers
We know that the cyclotomic field K = R«), (= cos 2n/1 + i sin 2n/l, 3 a prime, has degree 1- 1 and that the number I has the factorization 1= 11- 1 , where 1 = (1 - 0 is a prime divisor of first degree. We consider the I-adic completion K[ of the field K. The elements of this completion are called l-adic numbers. The complete field K 1 contains a subfield which is canonically isomorphic to the field R, of I-adic numbers (this subfield coincides with the completion of the field R in K.). Using this canonical isomorphism we shall assume that R 1 c K[ . Since 1 is the only prime divisor which divides I, then by Theorem 1 of Section 2, Chapter 4, the degree of the extension Kd R 1 equals I - 1 = (K: R). Hence [see (2.6) of Chapter 4] for any r:x E K we have I
~
NK/R(r:x)
= NK[/RI(r:x)·
(6.1)
368
ANALYTIC METHODS
[Chap. 5
Lemma 1. There is an element A in the ring of r-adic integers such that:
+ 1 = 0, A == ( - 1 (mod
(1) A'-1
(2)
A2 ).
The element A is uniquely determined by (1) and (2). In view of (1.5) of Chapter 3 we have (1 _
1
0'
1
, 2 = (1 + ()(l + ( + ( 2 ) .,. (l + ( + ... + ().
We now consider congruences modulo the prime element 1 - ( of the field K 1 [recall that vI (1 - 0 = 1]. Since (== 1 (mod 1 - 0 and (1- I)! + 1 == o (mod I) (Wilson's theorem), then (l _
1
0
1
'
== 2·3 ... (1 -
1)
== - 1 (mod 1 - O.
We shall show that the r-adie unit
-1 a = .,..-,(1------:::-0"' I
i'
, I
'
which is congruent to 1 modulo 1 - (, can be represented in the form a = y'-t. Consider the polynomial F(X) = a. Since F(l) == 0 (mod 1 - 0 and P(1) ;;j:. 0 (mod 1 - 0, then there is a unit y in K for which F(y) = 0 (see 1)y, Section 1.2 of Chapter 4). Hence a = y'-t, as was claimed. Setting A = we obtain a prime element A with the desired properties. Any other number At, satisfying the first condition of the lemma, has the form A.8, where () is a (1- 1)th root of 1. From A.8 == A(mod A2) it follows that () == 1 (mod A). If the root () were different from 1, then 1- 1 would be divisible by A, which is impossible. Hence () = 1 and At = A. Lemma 1 is proved. From now on A will denote that prime element of the field K which is uniquely determined by the conditions of Lemma 1. For each k which is relatively prime to 1 the correspondence ( ~ (k determines an automorphism Uk of the extension K/R. If u is any of these automorphisms, then the function v'(a) = vI(u(a)), a E K, is a valuation of the field K, and is an extension of the I-adic valuation V, of the field R. But there is only one extension of V, to the field K, namely VI • Hence v' = VI' and this means that vI(u(a)) = vI(a) for all a E K. It follows from this that the automorphism u takes any Cauchy sequence of elements of K (relative to the metric which corresponds to the prime divisor I) to another Cauchy sequence in K. This allows us to extend the automorphism u = Uk to the field K I • Namely, if = limn .... ooa n (an E K), then we can set
x,-t -
«( -
e
uW = lim u(an)
1
Sec. 6]
A CRITERION FOR REGULARITY
369
[it is easily verified that u(e) does not depend on the choice of the sequence {an}, and also that the mapping -+ u(e) is an automorphism of the extension KdRd· Since the extension KdR, has degree of inertia 1 and ramification index l- 1, then by Theorem 4 of Section 1, Chapter 4, all I-adic integers can be uniquely represented in the form
e
ao + alA
+ ... + a l _ 2 A' - 2
(6.2)
where the a j are l-adic integers. The subfield of real numbers of the field K consists of all a E K which are left fixed by the automorphism u -1 : , -+ ,-1. We shall determine which I-adic numbers are invariant under the automorphism u -1 . Since A' - 1 = -l, then also .(U_1(A))'-1 = -l, and this means that U_1(A) = AO, where 0 is an (l- l)th root of 1. By Problem 4 of Section 3, Chapter 1, the root 0 is contained in R so that "
U_1 2(A) = U-1(U-1(A)) = U-1(OA) = OU_1(A) = 02 A,
and since also u _1 2 (A) = A, then 0 = ± 1. If 0 = 1, then any I-adic number which can be represented in the form (6.2) with l-adic coefficients ai, would be left fixed by the automorphism u -1' and this is not the case. Hence 0 = -1, and u -1 (A) = - A. Hence when the automorphism u -1 acts on the field K[, the I-adic numbers which are left fixed are those of the form m-I
I: biA
2i
i=O
( biE R "
m
1) .
1= -2-
(6.3)
The set of all such numbers is a subfield of K( of degree m = (/ - 1)/2 over R, . It will be convenient to call them" real" I-adic numbers. We compute the trace of the I-adic number (6.2) (relative to the extension KdR,). For any i = 1, ... , l - 2, the matrix of the linear transformation 2 -+ Aie E K() with respect to the basis 1, A, ... ,A - will have zeros on the ' main diagonal (since l'-l = -l), and therefore SPKIIR,(A i ) = 0 (for i = 1, ... , l - 2). It follows that the trace of the number (6.2) equals a o(/ - 1). The I-adic numbers with trace (over R,) equal to zero are thus characterized by having the coefficient a o equal to zero in (6.2). We shall be interested in the set 9Jl of all "real" I-adic integers with zero trace. It is clear from the above remarks that 9Jl coincides with the set of all linear combinations of the form
e
(e
m-1
I: b A
2i
i
(6.4)
i= 1
where the b i are l-adic integers. We consider the functions log e and exp a over the field K(, which are
370
ANALYTIC METHODS
[Chap. 5
defined by power series (see Section 5.2 of Chapter 4). Since the ramification index e of the extension KdR, equals 1 - 1, then the number [el(l- 1)] + 1 equals 2, and this means that the series exp IX converges for all integers IX E K[, divisible by ).2. As we know, the function log e is defined for all principal units of the field K I • If e is a principal unit of the field K, that is, e == 1 (mod ).), then for any automorphism Uk we again have uk(e) == 1 (mod ).), and this means that log uk(e) is defined. But then (Corollary 1 of Theorem 11, Section 2 of the Supplement), I-I SPKI/R, log e = Uk(lOg e) = log (uk(e)
L
L
k= 1
k
= log(I} Uk(e)) = log(N KI/R,e). Now assume that e is a unit of the field K. It is clear that e is also a unit in the field K, but log e is not necessarily defined, since e will not, in general, be a principal unit. But for some rational integer 0 which is not divisible by 1 we shall have e == 0 (mod ).). From 0 1- 1 == 1 (mod ).) it follows that el - 1 == 1 (mod ).); that is, el - 1 is a principal unit in K[. The logarithm log el - 1 is thus defined, and by formula (6.1) SPK(/R,(logel -
1
)
= log(NK[/R,e ' - 1) =
log(N K/Rel - 1)
= 0;
that is, the l-adic integer log /:;'-1 has zero trace. If e is a real unit of the field K, then it is clear that log el - 1 will be "real." Hence for any real unit e of the field K the l-adic number log el - 1 belongs to the set 9Jl; that is, it can be represented in the form (6.4). In particular, this holds for the units. Ok (k = 2, 3, ... , m = (1- 1)/2) defined by (5.10). Thus we have log 0k l -
1
=
m-l
Lb
;=1
ki
).2i
(2
~
k
~
m)
(6.5)
where the coefficients bki are I-adic integers. Our problem is now to show that when 1 does not divide h* (the factor of the number of divisor classes of the field K), then the l-adic numbers log 0k l - 1 form a basis for 9Jl over the ring of I-adic integers in the sense that any ~ E 9Jl has a unique representation as a linear combination of the log 0k'-1 with I-adic integer coefficients. To do this it clearly suffices to show that det(bkJ is an I-adic unit, that is, that det(bkJ 1= 0 (mod I).
6.2. Some Congruences The series for exp x in the field K( converges only for those integers x
Sec. 6]
A CRITERION FOR REGULARITY
371
which are divisible by A2 • We also consider the polynomial
obtained from the series for exp x by deleting all terms with degrees ~ I. Since the coefficients l/k! for k ~ 1- 1 are I-adic integers, then E(x) will be a principal unit of the field K, for all integral x == 0 (mod A). We know that the formal product of the series exp x and exp y equals the series exp(x + y). It follows that E(x)E(y)
= E(x + y) + F(x, y),
(6.6)
where F(x, y) is a polynomial with I-adic integral coefficients in which all terms have degree ~ I. Lemma 2. The congruence
E(A)' == 1 (mod A21 -
1
)
holds in the ring of I-adic integers. Set E(x)
where g(x) = 1 + x/2! + integer coefficients. Then E(x)'
= 1 + xg(x),
... + x ' - 2 /(/_
= 1 + Ct' xg(x) = 1 + lh(x)
1)! is a polynomial with I-adic
+ ... + C,'-I(xg(x»'-1 + xlg(X)'
+ xlg(X)I,
where hex) is also polynomial with I-adic integer coefficients. On the other hand, by (6.6) we see that E(X)I
= E(lx) + x'M(x),
and this means that lh(x)
Ix
=-
I!
(Ix) 2
+-
2!
(lX)'-1
1
+ ... + - - + x H(x) (I-1)!
'
(6.7) i I
where H(x) = M(x) - g(X)I. Looking at the coefficients for the various powers of x in this equation, we see that all coefficients of H(x) are I-adic integers divisible by I. Dividing (6.7) by I, we arrive at hex)
=
IX 2
x
+ 2! + ... +
11- 2 X 1-' (1- I)!
1
+ x G(x),
I;
372
ANALYTIC METHODS
[Chap. 5
where G(x) has l-adic integer coefficients. If we set x = A, we obtain the equation h(A) == A(mod AI), and this means that Lh(A) == LA (mod A21 - 1). (6.8) Further, since g(A) == 1 (mod A), then g(A)1 == 1 (mod AI) so that AIg(A)1 == AI(mod A21 ).
(6.9)
From (6.8) and (6.9) we obtain
(since lA
E(A)' = 1 + Lh(A)
+ Alg(A)' == 1 + LA + AI =
+ AI = 0), which
proves the lemma.
1 (mod A2I - 1)
, I
Lemma 3. The following congruence holds for any natural number k: E(kA) == Ck (mod AI).
It follows from formula (5.6) that E(kA) == E(A)k (mod AI), so it suffices to prove the lemma for the case k = 1. By the definition of the prime element A we have C== 1 + A (mod A2 ). On the other hand, E(A) == 1 + A(mod A2), and therefore
C 1 E(A) == 1 (mod A2 ). Set where y is an I-adic integer. Raising this equation to the Ith power and using Lemma 2, we obtain the congruence
Y(IA2 + L(l
~ 1) YA4 + ... + yl-1 A21) == o(mod A21-1).
The expression in parentheses is divisible by AI+ 1 (and by no higher power of A), so y == 0 (mod AI- 2), and
C 1E(A) == 1 (mod AI), which proves the lemma. We also consider the polynomial x2
L(l
+ x) = x -"2 + .,. + (-
obtained from the series log(l
+ x)
XI- 1
1)'-2
L_
(6.9')
1'
by deleting terms of degree
~
1.
Sec. 6]
A CRITERION FOR REGULARITY
373
Lemma 4. If the I-adic integer rx is divisible by A2 , then
L(1
Indeed, for n
~
+ rx) == log(1 + rx) (mod A').
1 we have
~
I
+ (n
- l)
l)n ( In I In n ) -- - In I 1- 1 n - 1
+ (l -
~
I
(see Section 5.2 of Chapter 4). Lemma 5. Let e1 and e2 be principal I-adic units. Then L(e 1 e2 )
Since the series log(l and log(l + y), then L(I
== L(e l ) + L(e 2 ) (mod A').
+ x + Y + xy) equals the sum of the series log(l + x)
+ x + Y + xy) = L(I + x) + L(I + y) + G(x, y),
where the polynomial G(x, y) contains only terms of degree ~ 1 and has I-adic integer coefficients. The assertion of Lemma 5 follows from the fact that G(x, y) == 0 (mod A') if x and yare divisible by A. Lemma 6. The congruence
L(O == A(mod A.' ) holds in the ring of I-adic integers. To prove this we use the formal equality log exp x easily that L(E(x))
=
x
= x. From this it follows
+ H(x),
where H(x) is a polynomial in which all terms have degree ~ I, and which has I-adic integer coefficients. Setting x = A and using Lemma 3 for k = 1 we obtain the desired congruence. Remark. Let U be the multiplicative group of cosets modulo A' in the group of all principal I-adic units, and let X be the additive group of cosets modulo A' in the group of all I-adic integers which are divisible by A. It is now easily seen that the mapping e -+ L(e) induces an isomorphism of the group U onto the group X. The inverse isomorphism X -+ U is induced by the mapping rx -+ E(rx) [rx == 0 (mod A)].
374
ANALYTIC METHODS
[Chap. 5
S.
6.3. A Basisfor the Real I-Adic Integers in the Case (h*, I) = 1
al
We return to the question which was raised at the end of Section 6.1. To determine whether the determinant det(bkJ is divisible by I, it is only necessary to consider the coefficients bki modulo I. It is clear that two I-adic integers of the form (6.2) are congruent modulo I if and only if their corresponding coefficients in the expansion (6.2) are congruent modulo I (in the ring of I-adic integers). Hence to find the bki modulo Iwe may replace the numbers log 0k l - I by any I-adic integers congruent to them modulo I (that is, modulo
w
AI-I).
We use the notations of Section 5.2. The principal unit 0/- 1 is real, hence congruent to 1 modulo A2 , so that, by Lemma 4, log 0k l We now compute L(Ok ' -
I
).
I
=L(Ok
l
-
l
)
(mod AI).
B
(6.10) Si
Since
Zl
1 o -~1'1 I-k , vk
k-v ~
-
then
A
0k l = (1 + (+ ... + (k-I)I( _1)I-k.
But (
=1 (mod A), so that
=k (mod A), (1 + (+ ... + (k-I)I =k (mod AI). ' 1+
and hence Since k l
( + ... + ,k-I
=k (mod AI-I), then also
(1 + (+ ... + (k-I)/
(t
Ie tl
=k (mod A'-I).
Thus
al
or
T
By Lemma 5 we have
L(Ok
1- I
- 1) ) =L -A-1) - L ('k ~ + (k (' -
1
B
I + L(~)(mod A1- I ). 1) -2-
But by Lemma 3, (k - 1 = E(kA) - 1 ( d "-I) kA kA mo A. ,
y
5
Sec. 6]
375
A CRITERION FOR REGULARITY
and, therefore, using Lemma 6, we obtain
I.
y
c
LUh'-I) ==
1) _~ _L(E(k~~ - 1) + k2.1. (mod ..1.'-1).
L(E(..1.~ -
We now show that
1) _: = mf X +X' k=I(2k)!2k 2k
x
s
)
B 2k
L(E(X) -
g
1R(x)
(6.11)
'
2
where the polynomial R(x) has l-adic integer coefficients and B 2k are the Bernoulli numbers (see Section 8). We use the identity
I
_x_= B nxn. ex-I n=on! Since B 1 = -1, and all remaining Bernoulli numbers with odd index equal zero, then our identity can be written in the form
~_!_!= 2
eX - l
x
I
B 2k x 2k k=I(2k)!
I•
After integrating we obtain eX - 1 x ;. B 2k 2k I n - - - - = L.. x x 2 k=1 (2k)!2k
(6.12)
(the constant term of the series equals zero, since for x = 0 the function on the left vanishes). The formula (6.11) now follows from (6.12). If we substitute the value k..1. for x in (6.11), we find that L
(
E(k,1,) k..1.
1)
k..1.
m-I
-"
-"2 = if-I
B .k 2i . 1. 2i
(d..1.' - 1) mo ,
21
(2i)!2i
and hence L(e~-I) ==
m-I
L
B (1 2i
k 2i )..1. 2i
~,.
(21).21
i= 1
(mod . 1.' -
1
(6.12')
).
This shows that the coefficients b kl in (6.5) satisfy bki ==
B2i (l - k (2i)! 2i
2i
)
(
(mod I)
2~k ~m
I- 1
= -2-' 1 ~ i ~
But then det(b ki ) is congruent modulo I to the determinant 2 2 - 1 2 4 -1 m-I
i[1
(-l)m- I B 21
32
-
1
34
-
1
2' - 3 _1 3'
3 -
1
(2i)! 2i
m 2 -1
m 4 -1
m' - 3
-
1
)
m - 1 .
376
ANALYTIC METHODS
[Chap. 5
We easily evaluate this determinant by reducing it to the Vandermonde determinant. It equals the product
Il
(r 2
S2)
-
l~s~,~m
= Il (r + s)(r -
s),
s
in which no factor is divisible by I. If h* 1= 0 (mod I), then the numerators of the Bernoulli numbers B 2 , ••• , B1 _ 3 are not divisible by I, and we find that ~•
I
det(b ki )
1= 0 (mod I).
We have proved the following theorem. Theorem 1. If h* 1= 0 (mod I); then the" real" I-adic integers with zero trace are uniquely represented as linear combinations m
L ak log Okl - 1
k=2
(6.13)
with I-adic integer coefficients.
6.4. A Criterion/or Regularity and Kummer's Lemma Theorem 1 allows us to prove easily the following theorem. Theorem 2. If the factor h* of the number of divisor classes of the lth cyclotomic field R(C) is not divisible by I, then the factor ho is also not divisible by I.
Proof. Assuming that ho = (E : Eo) is divisible by I (see the notations of Theorem 2 of Section 5), we can find a positive real unit e E E, which is not contained in Eo, but for which el E E. Then
(6.14) where the rational integers Ck are not all divisible by I (otherwise e would belong to Eo). Raising (6.14) to the power 1- 1 and taking the logarithm (in the field K.), we obtain m
flog el -
1
=
L Ck log Okl - 1 • k=2
(6.15)
Since the number log e l - 1 belongs to 9Jl, it has a representation in the form (6.13). Comparing this representation with (6.15), we conclude that all the expressions ckll are I-adic integers. But this is impossible, since not all Ck are divisible by I. This contradiction proves Theorem 2. i
I
II j
J
Sec. 6]
377
A CRITERION FOR REGULARITY
Corollary. The prime number I ~ 3 is regular if and only if the numerators of the Bernoulli numbers B 2 , B 4 , .•. , B, _ 3 are not divisible by I. Theorem 3 (Kummer's Lemma). Let I be a regular prime number. If a unit of the Ith cyclotomic field R(O is congruent modulo I to a rational integer, then the unit is the lth power of another unit. Proof. Let e == a (mod I). We first show that e is a real unit. If e = ,ke1 , with e1 a real unit, then e1 == b (mod ).2) with b a rational integer, and ,k == 1 + k)' (mod ).2). From a == b(1 + k).)(mod ).2) it follows that k == 0 (mod I), which proves our assertion. Since -1 = (-1)', then we can assume that e > 0; that is, e E E. From the congruence: e' - 1 == a ' - 1 == 1 (mod I) it follows that log e' - 1 == 0 (mod I), and therefore by Theorem 1, m
log e' - 1 = LIck log (h'-1,
(6.16)
k=2
with I-adic integral Ck • On the other hand, since the subgroup Eo is of finite index in E, then e E Eo for some natural number a, and hence Q
(6.17) with rational integers dk • We can assume that the set of numbers a, d2 , ••• , dm has greatest common divisor 1 (since the group E has no elements of finite order). Raising (6.17) to the power 1- 1 and taking the logarithm (in the field K 1), we obtain
=
a log e'- 1
m
Ld
k
log (h'- 1 •
k=2
Comparing with (6.16), we arrive at
dk = lack
(k
= 2, ... , m).
Since the numbers aCk are I-adic integers, then it follows that all dk are divisible by I, and this means that e is an lth power: e =e 1', where e1 E Eo. Since (a, d2 , .,. , dm ) = 1, then a is relatively prime to I, and by picking rational integers u and v such that 1 = au + lv, we find Q
Q
e = (e
Q )"(
eV )'
= (e 1"e v )',
which proves the theorem. PROBLEMS
1. Let p be a prime number of the form 4n + 1, ~ m = (p - 1)/2. Set
,= n
p-I
t= 1
Ol -CliP),
=
cos 27T/p
+ i sin 27T/p, .\ =
~ -
1,
378 where O.
ANALYTIC METHODS =
[Chap. 5
sin (kTr/p) [sin (7T/p»)-t, 1 .,; k.,; p - 1. Show that the congruence
holds in the pth cyclotomic field. Here L denotes the function defined by (6.9') and B m the Bernoulli number. [Use (6.12') and Problem 14, Section 4.) 2. Let E = T + uviJ> 1 be a fundamental unit in the quadratic field R( VP), where 1 (mod 4), and let h be the number of divisor classes of this field. Using the preceding problem and Theorem 2 of Section 4, show that
pc-=;
, ! !
hU oo=TBm (mod p)
!
(
1)
_ p m --2
(in the ring of p-integral rational numbers).
7. The Second Case of Fermat's Theorem for Regular Exponents
7.1. Fermat's Theorem Theorem 1. If the prime number I X'
~
3 is regular, then the equation
+ y' =
(7.1)
Zl
has no solution in nonzero rational numbers
X,
y, z.
Proof. Assume that
!
(I
I
j
I't '
I ll"
"
( ;:
~ I, I
I, j
j I
,I , I! 't !
1
X, y, z are relatively prime (nonzero) integers which satisfy (7.1). Since the first case of Fermat's theorem has already been treated in Section 7.3 of Chapter 3, we may assume that one (and only one) of these numbers is divisible by I. We shall let I divide z [if, for example, y is divisible by I, then we can write (7.1) in the form x' + (-Z)' = (- y)']. Let z = Ikz o , where (zo , I) = I, k ~ I. In the Ith cyclotomic field R(O, the number I has the factorization I = (I - 0 ' - 1 e, where e is a unit in R(O (Lemma I of Section I, Chapter 3). Hence we can put (7.1) in the form
x' ,
+ y' = e(l
- o'mzo',
(7.2)
where m = k(/ - 1) > O. To prove the theorem it suffices to show that an equation of the form (7.2) is impossible. We shall actually show somewhat more. Not only will we show that an equation of the form (7.2) is impossible in rational integers x, y, and Zo relatively prime to I, but even that it is impossible in integers of the field R(O which are relatively prime to I - C. Assuming the converse, we take that solution of (7.2) in which the exponent m ~ I is smallest. To avoid introducing new notation, we shall assume this solution to be given by (7.2). Hence x, y, and Zo denote integers of R(O which are relatively prime to I - C, and e is some unit of the field R(O.
1.5
Sec. 7]
379
FERMAT'S THEOREM FOR REGULAR EXPONENTS
Rm.
8m =re
ng
As in Section 6, I denotes the prime divisor (1 - 0 of the field We factor the left side of (7.2) into linear terms and then pass to the corresponding equation in divisors. We obtain 1-1 (x + y ) = lima', (7.3) k=a
n
,k
where the divisor a = (za) is relatively prime to I. Since 1m ~ I> 0, it follows from (7.3) that at least one of the terms on the left is divisible by I. But since
x
+ ,iy = X + 'k y
then all the numbers x
+ ,k y
are divisible by I. If for some x
+
°
°
~
'k y ==
_ (k(1 _ ,i-k)y,
(0 ~ k ~ 1 - 1)
(7.4)
k < i ~ 1- 1,
+
X
,iy (mod 1
2
),
2
then also 'ky(l - ,i-k) == (mod 1 ), and this is impossible, since ,ky is relatively prime to I, and 1 - ,i-k is associate with 1 - ,. Hence the numbers (7.4) are pairwise-noncongruent modulo t2, so the expressions x
1 -,
l)
h d
:e
e
"
e
)
+ ,k y
(k
= 0, I, ... ,1 - 1)
are pairwise-noncongruent modulo I. Since N(I) = I, these expressions form a complete set of residues modulo I, and hence one of them is divisible by I. It follows that one (and only one) of the numbers (7.4) is divisible by 12 • Since we may replace y by any of the numbers y in (7.2), we may assume that x + y is divisible by 12 , and that all the other numbers x + y are divisible by I but not divisible by 12 • Then the left side of (7.3) is divisible at least by 1'-11 2 = 1'+1, so that m > 1. Now let m denote the greatest common divisor of the divisors (x) and (y). Since x and yare not divisible by I, then m is not divisible by I. Then it is clear that (x + y ) is divisible by 1m, and (x + y) is divisible by II(m-1)+l m . We set
'k
'k
,k
(x
+ y) =
II(m-1)+l mCa , (k = 1, ." , 1- 1),
11
and we shall show that the divisors Ca, C1' '" ,CI-1 are pairwise relatively prime. Indeed, if Ci and Ck (0 ~ i < k ~ 1- 1) had common divisor p, then x + ,iy and x + ,ky would be divisible by Imp, so that ,iy(l - ,k-i) and x(1 - ,k-i) would also be divisible by Imp. But this would imply that x and y were divisible by mp, contradicting the choice of m. Writing (7.3) in the form m'I'mcaC1 ...
C'-l
= Ilmal,
,I ~
II I,
I r
I,
380
ANALYTIC METHODS
we deduce (since the
Ck
[Chap. 5
are pairwise relatively prime), that (0:::;; k :::;; 1- 1),
and this means that (7.5) (1 :::;; k :::;; 1 - 1).
(7.6)
Solving (7.5) for m and substituting in (7.6), we obtain (x
+ ,ky )I ' (m-1) =
(x
+ Y)(OkOo -1 y,
(7.7)
from which it follows that the divisors [(OkOO -1)/ are principal (since I = (1 - 0]. Now we use the regularity of I. Since the number of classes of divisors of the field R(O is not divisible by I, then by the corollary to Theorem 3 of Section 7, Chapter 3, the divisors 0kOO -1 are also principal; that is, °kOO -1
=
G:)
(1 :::;; k :::;; I-I),
(7.8)
where (Xk and Ih are integers of the field R(O. The divisors Ok (1 :::;; k :::;; 1- 1) and 00 are relatively prime to I, so we may assume that (Xk and 13k are not divisible by I. Principal divisors are equal if and only if the corresponding numbers differ only by a unit factor. Therefore by (7.7) and (7.8) we have (x
+ 'k
y)(1-
ol(m-1)
= (x +
y)(;:)'ek
(1 :::;; k :::;; 1- 1),
(7.9)
where ek is a unit of the field R(O. Now consider the following obvious equation: (x
+
Ifwe multiply it by (1 (x
+
'y)(1 + 0 - (x + '2 y)= '(x + y). o'(m-l)
and use (7.9) with k = 1 and k = 2, we obtain
y)(;:)'e (1 + 0 - (x + y)(;:)'e 1
2
= (x +
yK(1 -
o'(m-1),
so that
Hence we have shown that (7.10)
where (x, 13, and yare integers of R(O, not divisible by I, and eo and e' are units of the field RC'). We shall transform (7.10) to the form (7.2).
,I:
Sec. 7]
FERMAT'S THEOREM FOR REGULAR EXPONENTS
We have seen that m> 1, so that m - 1 > 0 and I(m - 1) means that a l + eo{31 == 0 (mod 11).
381 ~
I, and this
Since {3 is relatively prime to 1, there is a number {3' such that {3{3' == 1 (mod 11). Multiplying the last congruence by {3", we obtain eo == WI (mod II),
where w = -a{3' is an integer of the field R(O. Since N(I) = I, then any integer of R(O is congruent modulo I to a rational integer. If w == a (mod I), then l Wi == a (mod II), and this means that the unit eo is congruent modulo II to a rational integer. By Kummer's lemma (Theorem 3 of Section 6; here we again use the fact that the prime I is regular) the unit eo is an lth power in R(O, that is, eo = rJ l , where rJ is another unit of the field R(O. The equation (7.10) then takes the form l a + (rJ{3)' = e'(1 - Ol(m-l)yl. We have obtained an equation of the same type as (7.2), but the exponent m has here been replaced by m - 1. But this is impossible, since we chose m to be as smal1 as possible. This contradiction shows that the equation (7.1) has no solution in nonzero rational integers x, y, and z, one of which is divisible by I, that is, that the second case of Fermat's theorem holds for regular primes. Theorem 1 is proved.
7.2. The Infinitude of the Number of Irregular Primes In all existing tables the irregular primes are outnumbered by the regular ones. However, it is not known whether this is true for all intervals (1, N). Further, the infinitude of the set of regular primes is stil1 an open question. Hence the following theorem is of considerable interest. Theorem 2. There are infinitely many irregular prime numbers.
The proof of Theorem 2 is based on some properties of Bernoul1i numbers. These properties are formulated and proved in the following section. Let Pi> '" ,Ps be any finite set of irregular prime numbers. Theorem 2 will be proved if we can find an irregular prime number P, different frompl' ... ,Ps' Set n = r(Pl - 1) .. · (Ps - 1). Since the Bernoul1i numbers B lk have the property B 2k -
I2k l
-+00
as k -+
00
"
382
ANALYTIC METHODS
[Chap. 5
(see the end of Section 8), then we can choose r large enough so that Bnln has absolute value greater than 1. Let p be any prime number which divides the numerator of this number (in lowest terms). If (p - 1) I n, then by von Staudt's theorem (Theorem 4 of Section 8), p would divide the denominator of Bn , and this is not the case by choice of p. Hence (p - 1) I n, and p is different from Pl, ... , Ps (and different from 2). Let n = m + a(p - 1), with 2:::;; m :::;; p - 3 (note that m is even). We now use Kummer's congruence (Theorem 5 of Section 8), and obtain the congruence Bm
Bn
m
n
- == - (mod p)
in the ring of p-integral rational numbers. But Bnln == 0 (mod p), so Bmlm == '" ,p - 3, it follows from the corollary of Theorem 2 of Section 6 that the number p is irregular. Theorem 2 is proved.
o(mod p) and Bm == 0 (mod p). Since m is one of the numbers 2, 4,
PROBLEMS
1. Show that the equation x 3 + y3 = 5z 3 has no solution in rational integers with z#O. 2. Show that there are infinitely many irregular primes of the form 4n + 3 (use p- -l,lems 9 and 10 of Section 8).
8. Bernoulli Numbers
In this section we shall prove those properties of Bernoulli numbers which have been used in the preceding sections. All the power series to be considered converge in some neighborhood of the origin, but their radii of convergence could easily be computed. But we shall not worry about questions of convergence, since for our purposes it suffices to consider all series formally (except in the proof of Theorem 6). Definition. The rational numbers B m (m t
--= 1 +
e' - l
co
L
~
1), defined by
B
~tm
m=lm!'
(8.1)
are called Bernoulli numbers. We use the following notations. Iff(x) = a + ax + ... + ax is a polynomial, then by f(B) we mean the number a o + alB l + ... + anBn . Analogously, if (x, t) is a power series of the form L~=ofn(x)tn, where fn(x) is a polynomial,
3S3
5
Sec.S]
7
then by f(H, t) we mean the series r..';';ofiH)t". Using this notation, the expansion (8.1), which defines the Bernoulli numbers, can be written in the form
S
BERNOUlli NUMBERS
1
t
r
-t - = e
e
Bt
1
-
.
It is easily seen that for any number a
(to prove this it suffices to multiply the series on the left). Theorem 1. The Bernoulli numbers satisfy the recurrence relation for m ~ 2, (I + B)m - B m = 0
(8.2)
which in expanded form becomes 1+
m-I
r.. CmkBk = 0
(m ~ 2)
k;1
(the Cm k are the binomial coefficients). To prove this theorem we write (8.1) in the form t
=
e( 1+ B)t _
eBt.
Comparing the coefficients of the terms t m1m! (m ~ 2), we obtain the relation (8.2). For m = 2 formula (8.2) gives us I + 2HI = 0, and this means that
HI
= -1·
Theorem 2. All Bernoulli numbers with odd index, except HI' equal zero: H2m + 1
=0
for
m
~ 1.
(8.3)
The equality (8.3) is clearly equivalent to the fact that the function t
t
-t--
e-l
+- = 1+ 2
B m ~t
r.. 00
m;2m!
is even, and this is easily verified. We give the values of the first 12 Bernoulli numbers with even index: I' B 6' 4
B =2
1 30'
= --
691 B I2 = - 2730'
6
6'
---no'
B 22 =
1 30'
B = --
7
B I4 =
174611
B 20 = -
1 42'
B =-
B I6 = -
854513 138 '
8
3617 510' B 24 = -
B
10
5 66'
=-
43867 B I8 =
798'
236364091 2730
384
ANALYTIC METHODS
[Chap. 5
Bernoulli numbers are connected with the sums of series of natural numbers. Set
Theorem 3. The sums Sk(n) satisfy the formula (m
- B m+1,
+ l)Sm(n) = (n + B)m+l
m ~ 1,
(8.4)
or in expanded form m
(m
+ l)Sm(n) = L C~+ IBkn m+1-\
m ~ 1
k~O
(B o = 1).
(8.5)
In fact, the expression on the right in (8.4) equals the coefficient of tm + 1/ + I)! in the series e("+B)t - e Bt • On the other hand,
(m
= nt +
("-IL r L m~1 r~1 00
m
m 1 )t + - , = nt m.
+ L 00
m~1
(m
+ l)Sm(n)tm+1
,
(m+1).
which proves the formula (8.4). Note that for n = 1 the formula (8.4) coincides with (8.2). Theorem 4 (von Staudt's Theorem). Let p be a prime and m and even integer. If (p - l),r m, then B m is p-integral (that is, p does not appear in the denominator of B m ). If (p - 1) I m, then pBm is p-integral, and (modp).
We prove Theorem 4 by induction on m, using the relation (m
+ I)Sm(P) =
m-I
(m
+ 1)BmP + L
C~+IBkpm+I-\
k~O
which is obtained from (8.5) by substituting p for n. We write this in the form (8.6) and we shall show that all terms under the summation sign are p-integers which are divisible by p (in the ring of p-integral numbers). The term pBk for k < m is a p-integer by the induction assumption. We consider the terms 1 k m-k - - 1 Cm + 1 p .
m+
(8.7)
385
BERNOULLI NUMBERS
Sec. 8]
If p = 2, then since m + I is odd, this number is a 2-integer and is divisible by 2 (since k < m). If p =1= 2, we write (8.7) in the form _1_ C m+ i - k m-k = m(m - 1) .. · (k + 1) m-k m+l m+i P (m-k+l)! P . The number p occurs in (m - k
[~] +
+ I)! = r!
[;2] + ... < ~ + ;2 + ... =
with exponent
P
~1~~~r -
1 = m - k,
and hence [I/(m - k + I)!]pm-k is a p-integer and is divisible by p. Hence we have shown that pBm is p-integral and that (8.8)
pBm == Sm(P) (mod p) in the ring of p-integral numbers. On the other hand, we have the congruences Sm(P) == -l(mod p)
if (p - 1) I m,
(8.9)
Sm(p)==O(modp)
if(p-l),rm.
(8.10)
Indeed, if(p - I) I m, then X" == I (mod p), for I ~ x ~ p - I, and hence p- 1 p-i Sm(P) = L X m == L 1 = P - 1 == -1 (mod p). x;i
x;i
If (p - I),r m, then, taking 9 to be a primitive root modulo p, we shall have p-i p-2 g(p-i)m_l Sm(P) = L x m == L gmr = m == 0 (mod p), x; 1 r;O g - 1 since gr i == I (mod p) and gm ¢. I (mod p). Comparing (8.8) and (8.10), we see that if (p - I),r m, then pBm == 0 (mod 0), and this means that B m is p-integral. The second assertion of Theorem 4 follows from (8.8) and (8.9). In the case m ~ P - I the number p - I does not divide any number k < m, so that all Bk for k < mare p-integral. Hence every term on the right in (8.6) is divisible by p2, and we have the following assertion. Corollary. If p
=1=
2 and m ~ p - I (m even), then
pBm == Sm(P) (mod p2).
(8.11)
Theorem 5 (Kummer's Congruence). If p is prime and (p - I),r m (m positive and even), then the number Bm/m is a p-integer, and Bm + p -
i
m+p-l
_B- ( mod) = p. m
m
(8.12)
386
ANALYTIC METHODS
[Chap. 5
In other words, the expression Bm/m (for (p - l),r m) has period p - I modulo p. Proof. Consider the function
(8.13) where 9 is a primitive root modulo p, I < 9 < p. We set e t F(t) =
-
I
= u. Then
gt t - - = tG(u) (1+u)9-1 u '
where
g G(u) - (-1-+-u-)-9---1
1 u
g gu
+ '" + u 9
U
It is clear that the numbers Ck are p-integral. We shall show that in the expansion of the function G(u) in powers of t: (8.14) all the coefficients Am are p-integral, and that they have period p - I modulo p (for m > 0). It is clear that if this latter property holds for some collection of series, then it also holds for any linear combination of these series with p-integral coefficients. Hence it suffices to verify it for the functions (e t - l)k. But these functions are in turn linear combinations of the functions err with r ~ 0, and
so by the small Fermat theorem r"+ p-l
==
r" (mod p)
(n > 0).
Hence the functions ert have the desired property, and our assertion about the coefficients Am is proved. Comparing the coefficients in (8.13) and (8.14), we see that
Bm(gm - 1) m!
Am - 1 (m-l)!'
so that
Since gm - 1 1= 0 (mod p) (because (p - l),r m), then the sequence of numbers gm - 1 also has period p - 1 modulo p, by the small Fermat theorem.
Sec.S]
3S7
BERNOULLI NUMBERS
It now follows from what we have proved about the numbers Am that the numbers Bm(m, when (p - 1) %m, are p-integral and have period p - 1 modulo p. Theorem 5 is proved. Theorem 6. The Bernoulli number B 2m is given by the formula B 2m = ( -1)
m_1 2(2m)! (2n)2m (2m),
(8.15)
where (2m) is the value of the Riemann (-function (s) for s = 2m. To prove this we use the expansion of the function l(e t - 1) into partial fractions 1
1
1
+00
-t = - - + L e -1 2 n=-oot-2nin 1
1
2t
00
= - 2" + t + n~l t 2 + (2nn)2 .
(8.16)
This expansion can be derived from the familiar expansion for the cotangent 1 cot z = - + Z
00
L1 Z 2 -
n=
2z
(nn
)2 '
by using the fact that
eiz + e- iz 2i cotz=i. e'z _ e-'z. =i+ e2''z - 1 • It follows from (8.16) that t t 00 t2 -t - - 1 - - + 2 " 2 e - 12 n~l t + (2nn)2 ' and since
L 00
m=l
(_l)m-l ( - t ) 2m , 2nn
then
Comparing this equation with (8.1) and equating the coefficients of the various powers of t, we obtain (8.15).
388
ANALYTIC METHODS
[Chap. 5
From the formula (8.15) we obtain an estimate for the growth of the numbers IE2m l with increasing m. Since '(2m) > 1 and (2m)! > (2m/e)2m (by Stirling's formula), then m)2m IB 2m l > 2 ( ne . In particular, we find that
I~~I-+oo
-+
as m
00.
PROBLEMS
1. Show that (x
+ B)m =
m ~ 1.
(x - 1 - B)m,
2. Show that
(~2 +B)m
=
(_1 -l)Bm. 2m -
1
3. Let p be an odd prime number. Show that
x.f
(p-1)/2 x(p -1)/2
l
== 2
((2)) P2
B(P+1)/2
(mod p).
4. Let p > 3 be a prime number of the form 4k + 3. If h denotes the number of divisor classes of the imaginary quadratic field R(V -p), show that h satisfies the congruence
== -2B(p+1)/2 (modp).
h
5. If p
> 3 is a prime number, show that 1 1 1 +- +- + 2 3
... + p-1 -1- == 0 (mod p 2 ).
6. Prove the formula (kx + B)m = k m-1
t-1( L
'=0
x
+ -S + B)m k
(k and m are natural numbers).
7. The function tan x has the expansion X2n-1
<Xl
tanx=
LT (2n-I)! n
n=l
where
To
n
=
2 2n (2 2n _ 1) IB2n l 2n .
Show that all the coefficients Tn are natural numbers.
,
Sec. 8]
BERNOULLI NUMBERS
389
8. If m > 1, show that 2B 2m
"'"
1 (mod 4).
9. Let q be a prime number such that 2q + 1 is composite [for instance, q "'" 1 (mod 3)]. Show that the numerator of the Bernoulli number B 2 • is divisible by a prime of the form 4n+ 3. 10. Let p" ... , P, be prime numbers greater than 3, and let q be a natural number such that q =< 1 (mod M), where M = (PI - 1) ... (P, - 1). Show that none of the prime numbers p" ... ,P, divide the numerator of the fraction B 2 ./"U[.
Algebraic Supplement
1. Quadratic Forms over Arbitrary Fields of Characteristic #- 2 In this section we describe some of the general properties of quadratic forms over arbitrary fields. We shall state some well-known results without proof. Throughout, K will denote an arbitrary field whose characteristic is not 2. For any matrix A, we shall denote the transpose by A'.
1.1. Equivalence of Quadratic Forms
By a quadratic form over the field K we mean a homogeneous polynomial of degree 2 with coefficients in K. Any quadratic form f can be written n
f=
L
aijXiX j ,
i,j= 1
where aij = a ji • The symmetric matrix A = (aij)
is called the matrix of the quadratic form! If the matrix is given, the quadratic form is completely determined (except for the names of the variables). The determinant d = detA is called the determinant of the quadratic form! If d = 0, the form f is called singular, and otherwise it is called nonsingular. If we let X denote the column vector of the variables XI' X2 , •. , , X n , then the quadratic form can be written f= X'AX. 390
Sec.1]
QUADRATIC FORMS OVER ARBITRARY FIELDS
Suppose we replace the variables according to the formula
XI' .•. , X n by
391
the new variables Yl' ... ,Yn
n
Xi
=
L CijYj
(1 ,;;; i ,;;; n,
Cij E
K).
j=1
In matrix form this linear substitution becomes
X=CY, where Y is the column vector of the variables YI' ... , Yn' and C is the matrix XI' .. , , X n infby the corresponding expressions in YI' ... ,Yn' then (after carrying out the indicated operations) we shall obtain a quadratic form 9 (also over the field K) in the variables YI' ... , Yn' The matrix Al of the quadratic form 9 equals
(cij)' Ifwe replace the variables
(1.1) Two quadratic forms f and 9 are called equivalent, and we write f '" g, if there is a nonsingular change of variables which takes one form to the other. From formula (1.1) we obtain Theorem 1. If two quadratic forms are equivalent, then their determinants differ by a nonzero factor which is a square in K.
Let 'I be an element of K. If there exist elements f(ct lo
.. , ,
ct n )
ct l , ... , ctn
in K for which
= 'I,
then we say that the form f represents y. In other words, a number is represented by a quadratic form if it is the value of the form for some values of the variables. It is easily seen that equivalent quadratic forms represent the same elements of the field K. We shall further say that the formf represents zero in the field K if there exist values ct; E K, not all zero, such that f(ct 1 , ... , ct n ) = 0. The property of representing zero is clearly preserved if we pass to an equivalent form.
ct
Theorem 2. If a quadratic form f in n variables represents an element '# 0, then it is equivalent to a form of the type ctXI
2
+ g(X2'
... ,
x n ),
where 9 is a quadratic form in n - I variables. Regarding the proof of this theorem we note only the following. If f(ct 1 , .. , , ct n) = ct, then not all ct; are equal to zero, so we can find a nonsingular matrix C, whose first row is ct l , .,. , ct n • If we apply to f the linear substitution whose matrix is C, we obtain a form in which the coefficient of the square of the first variable is ct. The rest of the proof is carried out as usual.
392
ALGEBRAIC SUPPLEMENT
If the matrix of a quadratic form is diagonal (that is, if the coefficient of every product of distinct variables equals zero), then we say that the form is diagonal. Theorem 2 now implies Theorem 3. Any quadratic form over K can be put in diagonal form by some nonsingular linear substitution. In other words, every form is equivalent to a diagonal form. In terms of matrices, Theorem 3 shows that for any symmetric matrix A there exists a nonsingular matrix C such that the matrix C'A C is diagonal. 1.2. The Direct Sum of Quadratic Forms Since the names of the variables are not of any significance, we can assume that two given quadratic forms f and 9 have different variables. In this case the form f + 9 is called the direct sum of f and g, and is denoted by f 9 (this must not be confused with the usual addition of quadratic forms when the forms have the same variables). It is clear that if 9 ~ h, thenf 9 ~ f h. We shall now show that the following converse holds.
+
+ +
Theorem 4 (Witt's Theorem). Letf, g, and h be nonsingular quadratic forms over the field K. If the forms f 9 and f h are equivalent, then the forms 9 and h are also equivalent.
+
+
Proof. Let fo be a diagonal form equivalent to f Then, as noted above, f 9 ~ fo 9 and f h ~ fo h, so that fo 9 ~ fo h. Hence we may assume thatfis a diagonal form. It is now easily seen that to prove the theorem it suffices to consider the case f = ax 2 , a -# 0. Let A and B denote the matrices of 9 and h. Since the forms ax 2 9 and ax 2 h are equivalent, there exists a matrix
+
+
+
+
+
+
+
+
such that
y (S'
T')
Q'
(aO A 0) (yT QS) = (a° B' 0)
(Here S is a row matrix and T is a column matrix.) From this equation we obtain (1.2) yaS S'aS
+ T'AQ = 0,
(1.3)
+
(1.4)
Q'AQ = B.
Sec.1]
f
QUADRATIC FORMS OVER ARBITRARY FIELDS
393
We must show that there exists a nonsingular matrix Co such that Co' A Co = B. The matrix Co will be found in the form
CO =Q where the element
~
+ ~TS,
must be suitably chosen. By (I.2) and (1.3) we have
+ ~S'T')A(Q + ~TS) Q'AQ + ~S'T'AQ + ~Q'ATS + eS'T'ATS Q'AQ + a[(1 - y2)e - 2y~]S'S.
Co'ACo = (Q' = =
e e 'I 11
In view of (I.4) the last expression will equal the matrix B, provided that (I - y2)e - 2y~ = I. This equation, which can also be written in the form (y~ + 1)2 = 0, always has a solution ~o E K for any y E K (recall that the characteristic of K is not 2). Hence we have found a matrix Co = Q +~oTS, for which Co' A Co = B. Since the matrix B is nonsingular, then Co is also nonsingular. Theorem 4 is proved.
e-
!.
1.3. Representation of Field Elements e
Theorem 5.. If a nonsingular quadratic form represents zero in the field K, then it also represents all elements of K.
..,
Proof. Since equivalent forms represent the same field elements, it suffices
y
n
:s
:s
Ie
2)
3)
4)
to prove the theorem for a diagonal form f = al Xl 2 + ... + anx/. Let 2 2 al et 1 + .. , + anet n = be a representation of zero, and let y be any element of K. We can assume that etl ;6 0. We express the variables Xl> ... , X n in terms of a new variable t:
°
= et l (1 + t), X k = etk(l - t) Substituting in the form f we obtain f* = f*(t) = 2a l et 1 2 t - 2a 2 et/t - ...
(k
Xl
If we now set t = y/4a l et 1
2
,
= 2,
- 2a net n2 t
... , n).
= 4a l et/t.
we obtain/* = y.
Theorem 6. A nonsingular quadratic formf represents the element y ;6 in K if and only if the form - YXo 2 f represents zero.
+
°
Proof. The necessity of the condition is clear. Assume that
-yet o2
+ f(et l , ... , et n) =
0,
where not all eti equal zero. If et o ;6 0, then y = f(etl/eto,"" etn/et o). If et o = 0, then the form f represents zero, and hence by Theorem 5 it represents alI elements of the field K.
394
ALGEBRAIC SUPPLEMENT
Remark. From the proof of Theorem 6 it is clear that if we determine all representations of zero by the form -yx/ ! (only those in which Xo #are relevant), then we have also determined all representations of y by the form! Hence the question of the representability of an element of the field K by a nonsingular form can be reduced to the question of the representability of zero by a nonsingular form in one more variable.
°
+
Theorem 7. If a nonsingular form! represents zero, then it is equivalent to a form of the following type:
YlY2
+ g(Y3'
, Yn)'
, ct n such thatf(ct l , ... , ct n) = 1. By Theorem 2 we can now put f in the form x 1 2 + fl (X 2, ... ,xn). Since the form x/ fl represents zero, we can find P2' ... , Pn such thatfl(p2' ... Pn) = -1. Again applyin§ Theorem 2, we can put!l in the form -x/ + g(Y3' ... , Yn). Setting Xl - X2 = Yl' and Xl + X2 = Y2' we obtain the desired result.
Proof. Using Theorem 5, we first find ct l ,
+
Remark. If we know some representation of zero by the formj, then all the operations described in the proof of Theorem 7 can be carried out explicitly, and the form g(Y3"'" Yn) can be determined. Now assume that for any quadratic form which represents zero over the field K, an actual representation of zero can be found. Then any nonsingular form can be transformed to a form of the type
YlY2
+ '" + Y2s-lY2s + h(Y2s+1'
... , Yn),
(1.5)
where the form h does not represent zero. In any representation of zero by the form (1.5), at least one of the variables Yl' Y2' ... , Y2s+l' Y2s must be nonzero. To determine all representations of zero in which, say, Yl = ct l #- 0, we note that we can give Y3' .. , ,Yn arbitrary values ct3 , ... , ctn and then determine Y2 by the condition ct l Y2
+ ct3ct4 + ... + g(ct2s+l'
... , ctn) = 0.
This gives us an effective method for finding all representations of zero by a nonsingular quadratic form over the field K, provided that we have a method for determining whether or not a given form represents zero, and, in case it does, an algorithm for finding some specific representation of zero. Theorem 8. Let the field K contain more than five elements. If the diagonal form
represents zero in the field K, then there is a representation of zero in which all the variables take nonzero values.
Sec. 1 ]
395
QUADRATIC FORMS OVER ARBITRARY FIELDS
Proof We first show that if
°
ae
= A #- 0, then for any b #there exist nonzero elements rx and p such that arx 2 + bp2 = A. To prove this fact we consider the identity (t - 1)2 4t ----=+ = 1. (t + 1)2 (t + 1)2
Multiplying this identity by a(
ae = A, we obtain
t 1) 2 c-+at (-2~- )2 =A. t+1 t+1
(1.6)
Choose a nonzero y in K so that the value of t = to = by2/a is not ± 1. This can be done because each of the equations bx 2 - a = and bx 2 + a = has at most two solutions for x in K, and the field K has more than five elements. Setting t = to in (1.6), we obtain
°
°
to - 1)2 (2~y)2 +b - - =..1., to + 1 to + 1
a ( ~--
and our assertion is proved. We can now easily complete the proof of the theorem. If the representation a 1 ~12 + .,. + a.~. 2 = is such that ~1 #- 0, ... , ~r #- 0, ~r+1 = ... = ~. = 0, where r~ 2, then we have shown that we can find rx #- and p#-O such that ar~/ = arrx 2 + ar+ 1P2, and this yields a representation of zero in which the number of nonzero variables is increased by one. Repeating this process, we arrive at a representation in which all the variables have nonzero value.
°
°
1.4. Binary Quadratic Forms
A quadratic form in two variables is called a binary quadratic form. Theorem 9. All nonsingular binary quadratic forms which represent zero in K are equivalent.
Indeed, by Theorem 7, any such form is equivalent to the form Y1Y2 . Theorem 10. In order that the binary quadratic form f with determinant d#-O represents zero in K, it is necessary and sufficient that the element -d be a square in K (that is, -d = rx 2 , rx E K). Proof. The necessity of the condition follows from Theorems 1 and 7. Con-
versely, if f
=
ax 2 + by 2 and -d = -ab
=
rx 2 , thenf(rx, a)
=
arx 2 + ba 2 = 0.
Theorem 11. Letf and 9 be two nonsingular binary quadratic forms over the field K. In order that f and 9 be equivalent, it is necessary and sufficient
396
ALGEBRAIC SUPPLEMENT
that their determinants differ by a factor which is a square in K, and that there exist some nonzero element of K which is represented by both f and g. Proof. Both conditions are clearly necessary. To prove sufficiency, let rx #- 0 be an element of K which is represented by both f and g. By Theorem 2 fand 9 are equivalent to the formsf1 = rxx 2 + fly 2 and gl = rxx 2 + fl'y2. Since rxfl and rxfl' differ by a square factor, then fl' = fJ'/, y E K, and this means that
f1-g1 andf-g·
PROBLEMS
1. Show that a singular quadratic form always represents zero. 2. Show that Theorem 5 does not hold for singular quadratic forms. 3. If the binary form x 2 - cxy2 represents the elements y, and y2 of K, show that it also represents their product. 4. Show that Theorem 8 is not valid for fields with not more than five elements. 5. We shall decompose the set of all nonsingular quadratic forms over Kin n = 0, 1,2, ... variables into the so-called Witt classes. (We treat the zero form as a nonsingular form on the empty set of variables, and consider it to represent zero.) Two forms I, and 12 belong to the same Witt class [Id = [/2], if the corresponding forms h in (1.5) have the same number of variables and are equivalent. We add Witt classes by the formula [Id + [/2] = [I, +/2]' Show that these definitions make sense and that under 1his operation the set of Witt classes becomes a group. 6. Determine the group of Witt classes for the real and complex fields. 7. Show that a quadratic form over a finite field (characteristic oF 2) in three or more variables represents zero.
2. Algebraic Extensions
Many theorems of this section are given without proof. For the proofs the reader may consult, for example, "Modern Algebra," by B. L. van der Waerden, Vol. 1, Chap. 5, Ungar, New York, 1950. 2.1. Finite Extensions
If the field n contains the field k as a subfield, then we say that n is an extension of the field k. To denote that n is being considered as an extension field of k, we write njk. If K is a subfield of n which contains k, that is, k eKe n, then K is called an intermediate field for the extension njk. For any extension njk, we may consider n as a vector space over k. Definition. The extension Kjk is called finite if K, considered as a vector space over k, is finite-dimensional. This dimension is called the degree of the
Sec. 2]
ALGEBRAIC EXTENSIONS
397
extension and is denoted by (K: k). Any basis for K as a vector space over k is called a basis for the extension Klk. If the extension Klk is finite, then for any intermediate field K o , the extensions Kolk and KIKo are clearly both finite. The following converse also holds. Theorem 1. Let K o be an intermediate field for the extension Klk. If the extensions KIKo and Ko/k are finite, then Klk is also finite, and its degree equals the products of the degrees of the extensions KIKo and Ko/k:
(K: k) = (K: Ko)(Ko : k). Proof. Let 01 , ... Om, be a basis for KIKo , and Wi' ... , W n be a basis for Kolk. Then every element of K can be represented as a linear combination (over k) of the products wiO j , so the extension Klk is finite. Further, it is easily checked that these products are linearly independent over k, so (K:k) = mn. For any field k we denote the ring of polynomials in the variable t with coefficients in k by k[t]. Let nlk be an extension of the field k. An element rx En is called algebraic over k; it is the root of some nonzero polynomialf(t) of k[t]. Among all such polynomials we take that polynomial cp(t) "# 0 which is of lowest degree and has leading coefficient 1. Since all polynomialsf(t) which have rx as a root are divisible by cp(t) (otherwise the remainder after division offby cp would be a polynomial of lower degree with rx as a root), then the polynomial cp(t) is uniquely determined. It is called the minimum polynomial of the algebraic element rx over the field k. The minimum polynomial cp E k[t] is always irreducible, since from cp = gh it follows that rx is either a root of g(t) or of h(t). Any element a E k is algebraic over k, and its minimum polynomial is t - a. An element ~ E n which is not algebraic over k is called transcendental over k. The extension nlk is called algebraic if every rx E n is algebraic over k.
:, j :' :1
Theorem 2. Any finite extension is algebraic. i
Theorem 3. Let the element rx of the extension n/k be algebraic over k, and let its minimum polynomial cp(t) E k[t] have degree m. Then the elements 1, rx, ... , rx m - 1 are linearly independent over k and the set of all linear combinations (2.1) with coefficients ai in k is an intermediate field, denoted by k(rx). The extension k(rx)lk is finite of degree m. To add two elements of the field k(rx), written in the form (2.1), we simply add the corresponding coefficients. To put the product of the elements
398
ALGEBRAIC SUPPLEMENT
~ = g(rx) and '1 = h(rx) (g and h are polynomials in k[t] of degree the form (2.1), we must divide gh by cp with remainder:
g(t)h(t) = cp(t)q(t)
~
m - 1) in
+ r(t),
where the degree of r(t) does not exceed m - 1; since cp(rx) = 0, then ~'1 = r(rx). Hence the operation of multiplication in the field k(rx) is determined by the minimum polynomial cp(t) of the element rx. Let rx1' .,. , rxs be a finite set of elements of n, which are algebraic over the field k, and let m 1 , ••• , m s be the degrees of their minimum polynomials over k. The set of all linear combinations of the elements
with coefficients in k is an intermediate field. It is denoted by k(rx1' ... ,rxs ) and is called the field generated by the elements rx1, .,. , rx s ' Its degree over k does not exceed the product m1 ... ms • Any finite extension Kjk, contained in n, can be represented in the form k( rxl> ... , rx.) for some rx1' ... , rx s ' Definition. A finite extension Kjk is called simple if there is an element () such that K = k«()). Any element () E K, for which K = k«()), is called a primitive element of the extension Kjk.
The primitive elements of Kover k are those elements whose minimum polynomial has degree equal to the degree of Kjk. Theorem 4. Let njk and n'jk be two extensions of the field k, and let nand ()' E n' be algebraic elements over k with the same minimum polynomial cp(t). Then there is a unique isomorphism of the field k«()) onto the field k«()') for which () -+ ()' and a -+ a for all a E k. () E
Let m be the degree of the polynomial cp(l). The isomorphism k«()) of Theorem 4 coincides with the mapping ao + a1 ()
+ ... + am + 1 ()m - 1 -+ ao + a1 ()' + ... + am + 1 ()'m - 1
-+
k«()') (2.2)
(a1' ... , am -1 are arbitrary elements of the field k). So far we have considered finite extensions Kjk which are contained in a given extension njk. We now turn to the question of the construction of finite extensions over a fixed field k. Theorem 5. Let k be a field, and let cp(t) be an irreducible polynomial of k[t] of degree n. Then there exists a finite extension Kjk of degree n in which the polynomial cp has a root. The extension Kjk is unique (up to an isomorphism which is the identity map on k). If cp«()) = 0, () E K, then K = k«()).
Sec. 2]
ALGEBRAIC EXTENSIONS
399
The field K (in the case n > 1) is constructed in the following manner. We choose some new object e and consider the set K of all formal linear combinations (2.3) with coefficients in k. If we denote the polynomial ao + alt + ... + an_lt n- 1 by get), then the expression (2.3) can be written gee). Let ~ = gee) and '1 = h(e) be two linear combinations of the type (2.3) (g and h are polynomials in k[t] of degree ~ n - 1). Let s(t) denote the sum g(t) + h(t) and let r(t) denote the remainder after division of the product g(t)h(t) by
+ '1 = see), ~'1 = r(e).
~
It is easily verified that under these operations K is a field with the desired
properties. Corollary. For any polynomial j(t) E k[t] there is a finite extension Kjk in which jet) factors into linear factors.
If k is a field such that the only algebraic extension of k is k itself, then k is called algebraically closed. It is clear that k is algebraically closed if and only if every polynomial in k[t] factors into linear factors. 2.2. Norm and Trace
Let Kjk be a finite extension of degree n. For any (X E K the mapping ~ -> (X~ K) is a linear transformation of K (considered as a vector space over k). The characteristic polynomial j~(t) of this transformation is also called the characteristic polynomial of the element (X E K, relative to the extension Kjk. If W l , .. , , W n is a basis for the extension Kjk and (~E
n
(XW j
=
L aijw
j
(aj j
E
k),
(2.4)
j=l
then faCt)
= det(tE -
(aij»'
where E is the n by n identity matrix. Theorem 6. The characteristic polynomial j~(t) of an element (X E K relative to the extension Kjk is a power of its minimum polynomial
400
ALGEBRAIC SUPPLEMENT
By Theorem 3 the powers, 1, ct, '" , ct m -1 form a basis for the extension k(ct)/k. If 1 , '" , s is a basis for K/k(ct), then we can take for a basis of K/k the products
e
e
The matrix of the linear transformation ~ -> ct~ in this basis will clearly be a block-diagonal matrix, with s blocks down the main diagonal, each block being equal to
o
0
1
o o
o
0
0
o
010
0
1
0
The characteristic polynomial of each block is easily computed to be t m + Cltm-1 + ... + cm = cp~(t). Hencef~ = cp~s, and Theorem 6 is proved. Since when we pass from one basis to another the matrix of a linear transformation is replaced by a similar matrix, then the determinant and trace of the matrix (aij)' defined by (2.4), do not depend on the choice of'the basis Wl' .,. , W n •
Definition. The determinant det(aij) of the matrix (aij) of (2.4) is called the norm, and its trace Sp(ai) = L~= 1 au is called the trace of the element ct E K relative to the extension K/k. The norm and trace are denoted by NK/ict) and SPK/k(ct), or, more briefly, by N(ct) and Sp(ct).
If a E k the matrix of the linear transformation ~ -> a~ (~ E K) will be the diagonal matrix aE. Therefore for every element of k we have N K/k(a) = an, SPK/k(a) = na.
When linear transformations are added or composed, their matrices are added or multiplied (for a fixed basis), and hence for any elements ct and [J of K we have the formulas N K/k(ct[J) = N K/k(ct)NK/k([J), (2.5) SPK/k(ct
+ [J) =
SPK/l ct ) + SPK/k([J).
(2.6)
The matrix of the linear transformation ~ -> act~ (a E k, ~ E K) is obtained from the matrix of the transformation ~ -> ct~ by multiplying all entries by a. Hence we also have the formula (a
E
k, ct
E
K).
(2.7)
Sec. 2]
ALGEBRAIC EXTENSIONS
401
If ct #- 0, then the transformation ~ -> ct~ is nonsingular, and hence the norm NK/k(ct) is also nonzero. Hence we see by (2.5) that the mapping ct -> NK/k(ct) is a homomorphism of the multiplicative group K* of the field K to the multiplicative group k* of the field k. As for the mapping ct -> SPK/k(ct), by ().6) and 11.7) it is a linear function on K with values in the field k. Theorem 7. Let ct E K have characteristic polynomiallr.(t) relative to the extension K/k, and let n/k be an extension in which .t:(t) factors into linear factors:
Then
Proof. If
then a l = -Sp(aij),
an = (-It det(aij)'
On the other hand, it is easily checked that
and this proves the theorem.
Theorem 8. We keep the notations of Theorem 7. If y = g(ct) E K (g(t) E [tJ), then the characteristic polynomial f/t) has the following factorization in
(2.8) Proof. We first note that the coefficients of the polynomial (2.8), being
symmetric expressions in ct l , ... , ct n , belong to the field k. Let cp/t) be the minimum polynomial of y over k. If we apply the isomorphism k(ct) -> k(cti) (in which ct -> cti and a -> a for a E k) to the equation cp(g(ct)) = 0, we obtain cp(g(ct;)) = 0. Hence every root of the polynomial (2.8) is a root of the polynomial cp/t), which is irreducible over k. This is possible only if the polynomial (2.8) is a power of cp/t). Thus, we can see that the t~eorem now follows from Theorem 6. Let k eKe L be a tower of finite extensions. We choose bases wl , ... , w n
ALGEBRAIC SUPPLEMENT
402
and
e
1
, ••. ,
em for the extensions Klk and LIK. For any "I E L set m
ye j
L <XjseS
=
(<X jS E
K),
s=l
<X jsw i
•
L a jsirWr
=
(a jsir E
k).
r=l
Since ywie j
= L a jsirWr e s , S,r
then SPL/K(Y) =
Li,j aiijj'
On the other hand, we also have
SPK/k(SPL/K(Y)) = Hence for any "I
E
SPK/k(~
<X jj )
J
=
~
ajjii •
I,)
L,
SPL/b)
= SPK/k(SPL/K(Y)).
(2.9)
An analogous formula holds for the norm (Problem 2).
2.3. Separable Extensions Definition. A finite extension Klk is called separable if the linear mapping ~ -> SPK/k( ~), ~ E K, is not identically zero.
Since SPK/k(l) = n = (K: k), every finite extension over a field of characteristic zero is separable. The same holds for every extension over a field of characteristic p for which the degree of the extension is not divisible by p. For a finite separable extension Klk we choose a basis W 1 , ... , W. and consider the matrix (2.10) If the determinant of this matrix were zero, then we could find elements C , ... , c. in k, not all zero, for which 1
•
L Cj Sp(wiw j ) = ° j=l Setting "I =
C1W 1
+ .,. + c.w.,
(i
= 1, ... , n).
we can write this last equation (i = 1, ... , n).
(2.11 )
Let ~ be any element of K. Since "I "# 0, then ~ can be represented in the form ~ = a 1 w 1 y + ... + a.w.y, ai E k, and by (2.6), (2.7), and (2.11) we have Sp ~ = O. But this would contradict the separability of Klk. Thus for separable extensions the matrix (2.10) is always nonsingular.
Sec. 2]
ALGEBRAIC EXTENSIONS
403
Definition. The determinant det(Sp(wiw)) is called the discriminant of the basis w!> ... , Wn of the finite separable extension Klk and is denoted by D(w!> ... , w n).
We have shown that the discriminant of any basis of a finite separable extension is a nonzero element of the ground field. Let wt', ... , wn' be any other basis of the extension Klk, and let n
W;' =
L cijw j=l
j
(i = 1, '" , n).
Since the matrix (Sp(w;'w/)) equals the product (cij)(Sp(WiW))(Ci)', then
D(w l ', ... , w n') = (det(ci))2 D(wl' ... , w n).
(2.12)
Thus the discriminants of two different bases differ by a factor which is a square in the ground field. We fix a basis W l , ... ,Wn for the extension Klk. Then for any elements Cl' ... , Cn of k there exists a unique element (X E K such that
Sp(Wi(X) = Ci
(i = 1, ... , n).
(2.13)
(indeed, representing (X in the form (X = XlW l + ... + XnWl (Xi E k) and substituting in (2.13), we obtain a system of n linear equations in the n unknowns X i with nonzero determinant) In particular, we can find elements W *, '" , w/ l in the field K such that for i =j, for i #j.
(2.14)
These n elements are linearly independent over k, since if ClW l * + ... + CnWn* = 0 (c i E k), then, multiplying by Wi and taking the trace we obtain c j =0. Definition. The basis W l *, ... , W n * of the separable extension Klk, which is determined by (2.14), is called the dual basis to the basis W l , ... , Wn .
The dual basis allows us to express the coefficients in
explicitly in terms of (x. Indeed, taking the trace of the product (Xw j *, we obtain
ai = Sp«(Xw i*)
(i= 1, ... ,n).
Assume that the minimum polynomial ({J(t) of the element (X of the separable extension Klk factors completely into linear factors in the extension Qlk:
({J(t) = (t - (Xl) ... (t - (Xm)'
ALGEBRAIC SUPPLEMENT
404
It follows easily from (2.9) that the extension k(l1.)jk is also separable. Since the minimum polynomial cp is also the characteristic polynomial for 11. relative to the extension k(l1.)jk, then by Theorems 7 and 8 k SPk(a)/kl1. =
m
L 11./, s= 1
and hence we have the following expression for the discriminant D(l, 11., '" , = D of the basis 1,11., ... , I1.m - 1 of the extension k(l1.)jk:
I1. m - 1 )
D
= det (
L I1.si + j m
5=1
) O~i,j~m-l
= det(l1. si). det(l1.j) =
Since D #- 0, l1.i #-
I1.j'
n
O~i<j~m-l
(l1. i - l1.y.
and we have proved the following fact.
Theorem 9. The minimum polynomial of any element of a separable extension has no multiple roots (in that field in which it factors into linear factors). Theorem 10. Any finite separable extension Kjk is simple; that is, there exists an element 11. such that K = k(I1.). Theorem 11. Let Kjk be a finite separable extension of degree n. There is an extension njk such that there are precisely n isomorphisms of K into n which are the identity map on k. Denote these isomorphisms by O't> ... , O'n' If 11. is any element of K, then the characteristic polynomial fa(t) has the factorization
in the field
n.
The elements 0'1(11.), ... , O'n(l1.) (which lie in the field n) are called the conjugates of the element 11. E K. The images O'l(K), ... , O'n(K) of the field K under the isomorphisms O'i are called the conjugate fields of the field K. If () is a primitive element of the field Kover k, then it is clear that O'i(K) = k(O'i«())' Corollary 1. Using the above notations we have
N K/k(l1.) = 0'1(11.)0'2(11.)'" O'n(I1.), SPK/k(l1.) = 0'1(11.)
+ 0'2(11.) + ... + O'n(I1.)· ~
Corollary 2. There are precisely n isomorphisms of any finite extension of the rational numbers of degree n into the field of complex numbers.
(
s
Sec. 3]
FINITE FIELDS
405
Let WI' '" , w n be a basis for K/k. Since Sp(wiw) = L~= 1 Us(Wi)Us(W), then the matrix (Sp(wiw)) is the product of the matrices (u;(w))' and (u;(w)). Hence we have the following formula for the discriminant of the basis Wi: (2.15)
PROBLEMS
1. Let n = k(x) be the field of rational functions in x with coefficients from k. Show that any element of n which does not lie in k is transcendental over k. 2. Let k eKe L be a tower of finite extensions. If 8 is any element of L, prove the formula N K/1 (NL/ K(8» = NL/K(G).
[First assume that L for K/k.l
=
K(8), and consider the basis w/8J for L/k, where w/ is a basis
3. Find a primitive element for the extension R(V2, V3) of the field R of rational numbers, and express it in terms of and 4. Show that a finite extension K/k is simple if and only if there are only a finite number of intermediate fields. 5. Let k be any field of characteristic p 'f O. Show that the polynomial 1(/) = IP - 1 - a (a E k) is either irreducible or factors completely into linear factors in k[/]. Further, in the former case show that the extension k(8)/k, where 1(8) = 0, is separable. 6. Let k o be a field of characteristic p 'f 0, and let k = ko(x) be the field of rational functions in x with coefficients from k o . Show that the polynomial 1(/) = IP - x is irreducible in k[/]. Further, show that the extension k(8)/k, where f(8) = 0, is inseparable. 7. Let K/k be a finite extension of degree n. If there exists some extension n/k for which there are n isomorphisms of K imo n which leave every element of k fixed, show that the extension K/k is separable. 8. Let k be a field of characteristic 'f p which contains a primitive pth root of I (that is, an element f: with f:P = I and f:> 'f I for 0 < k < p). If the element Cl: E k is not the pth power of some element of k, show that (k({/~) : k) = p. 9. Let K/k be a finite separable extension and let g; be a linear mapping from K (considered as a vector space over k) to k. Show that there is a unique element Cl: in the field K such that
vi vi
(~E K).
3. Finite Fields A field 1: is called finite if it has only a finite number of elements. The field Zp of residue classes in th~ ring Z of integers modulo a prime number p is an example of a finite field. Every finite field is of finite characteristic and, if the characteristic of the finite field 1: equals p, then this field contains a prime subfield (a subfield not containing any proper subfie]d) which is isomorphic
406
ALGEBRAIC SUPPLEMENT
to Zp- Hence we may assume that Zp C 1:. The extension 1:/Zp is clearly finite. Ifit is of degree m and if WI' ... , W m is a basis for 1:/Zp,then every element ~ E 1: has a unique representation in the form ~ = CIW I + ... + CmW m , where Ci E Zp. Since there are pm such linear combinations, we have proved that the number of elements of any finite field is a power of its characteristic. The multiplicative group 1:* of a finite field 1: is a finite Abelian group. We consider its structure. Lemma 1. A finite subgroup G of the multiplicative group K* of any field K is al'Nays cyclic. Proof. We first show that if an Abelian group G contains elements of
orders m and n, then it contains an element whose order equals the least common multiple r of m and n. Let the elements x and y of G have orders m and n, respectively. If(m, n) = I, then it is easily seen that the product xy has order r = mn. In general, by considering the prir,le factorizations of the numbers m and n, we can find factorizations so that (m o , no) = I and r = mono. The elements x m , and y"' have orders m o and no, and their product xm'y"1 has order r = mono. Now let G be a finite subgroup of order 9 of the multiplicative group of the field K. If m is the maximum of the orders of the elements of G, then clearly m ~ g. On the other hand, it follows from what we have just shown that the order of every element divides m; that is, every element of the group G is a root of the polynomial t m - I. Since a polynomial of degree m can have at most m roots, 9 ~ m. Hence 9 = m, and this means that G is cyclic. Applying this lemma to the case of finite fields, we obtain the following fact. Theorem 1. The multiplicative group of a finite field which has pm elements is a cyclic group of order pm - I.
e t s tl a n
a;
oj
t .
fie fo dl
Corollary. Any finite extension of a finite field is simple.
Indeed, if 8 is a generating element of the group 1:*, then it is clear that Zi8) = 1:. Hence for any intermediate field 1: 0 we have 1: 0 (8) = 1:. It also follows from Theorem I that all elements of 1: are roots of the polynomialJP~ - t, and since the degree of this polynomial equals the number of elements in 1:, then in the ring 1:[t] we have the factorization t P'"
-
/
= Il (I ~eI:
(~
runs through all elements of the field 1:).
- ~)
fin
be ax ele an~
fiel
Sec. 3]
rly :nt
~re
FINITE FIELDS
407
Theorem 2. For any prime number p and any natural number m there exists one and only one (up to isomorphism) finite field with pm elements.
:he
Proof. By the corollary to Theorem 5 of Section 2 there is an extension Q/Z in which the polynomial t pm - t factors into linear factors. Let 1: denote the :et of all roots of this polynomial (in Q). Since in any field of characteristic
LIp.
p the formula
K
of st m as he
10
)f Iy
Ie
a t
holds, then the sum and difference of any two elements of 1: also belongs to L It is clear that the set 1: is closed under the operations of multiplication and division (except division by zero). Hence 1: is a subfield of the field Q. The pm polynomialt pm - thasnomultipleroots(sinceitsderivativepm t - 1 - I = -I never vanishes) and hence 1: consists of pm elements. The existence of a finite field with pm elements is proved. Let 1: and 1:' be two extensions of Zp of degree m. Choose a primitive element 0 in 1: (corollary of Theorem I) and denote its minimum polynomial by cp(t). Since cp(t) divides the polynomial t pm - t, and the latter polynomial splits into linear factors over L', then ({J(t) has a root (J' E L'. The degree of the extension Zp(()')/Zp equals the degree of the polynomial cp(t), that is, m, and therefore Zp(()') = 1:'. The existence of an isomorphism of 1: onto 1:' now follows from Theorem 4 of Section 2. The finite field with pm elements is often denoted by GF(pm) (and finite fields are often called Galois fields).
Corollary. For every natural number n there is an irreducible polynomial of degree n over the finite field Lo = GF(p'). Indeed, p' - I divides pn, - I, so that the set of all roots of the polynomial t - t in the field 1: = GF(p,n) forms a subfield which is isomorphic to the field 1: 0 , Hence we can assume that 1: 0 c 1:. If () E 1: is a primitive element for the extension 1:/1: 0 , then the minimum polynomial of () will be an irre-
ducible polynomial in 1: o[t] of degree n, since . ~ ) _ (1: : Zp) _ rn _ ( 1: . ~o - - - n. (1: 0 : Zp) r
f
In conclusion we note that in order to show that a given finite commutative ring is a field, it suffices to show that it has no divisors of zero. Indeed, let 0 be a finite ring without zero divisors, and let a be a nonzero element of O. If aXl = aX2' then a(x 1 - X2) = 0, and Xl = X2' Thus as X runs through all elements of the (finite) ring 0, ax also takes on every value in O. Then for any b in 0 the equation ax = b is solvable in 0, and this means that 0 is a field.
408
ALGEBRAIC SUPPLEMENT
PROBLEMS
1. Let r(m) denote the number of distinct irreducible polynomials of degree m with leading coefficient 1 in the ring Zp[t]. Show that r(m) =
.!. L JJ.(~)pd
m
dl'"
d
JJ. denotes the Mobius function). 2. Find all irreducible polynomials of degree 2 over the field Z, = GF(5). 3. Show that the field GF(p'") is contained in the field GF(P") (up to isomorphism) if and only if min. (d runs through all divisors of m, and
4. What is the degree over Zp of the splitting field of the polynomial t" - 1? 5. Let ~ = GF(p'"). Show that each mapping a, : g--'" g-', g E ~ (i = 0, I, ... , m - I) is an automorphism of the field ~ and show that every automorphism of ~ coincides with one and only one of the a,. 6. Let ~o = GF(p') and let ~ be a finite extension of ~o of degree n. Show that each mapping g --",gP" (i = 0, I, ... , n - I) is an automorphism of the field ~ which leaves every element of ~o fixed. Further, show that these n automorphisms are distinct and that every automorphism of ~ which is the identity on ~o coincides with one of them. Let f(t) be the characteristic polynomial of the element g E ~ relative to the extension ~/~o. Show that we have the factorization h(t) = (t - g)(t - g.) .. ; (t - en-I),
in the field ~, where q
=
p' (use Theorem 8 of Section 2). Deduce that
7. Show that a finite extension of a finite field is always separable. 8. Using the above notations, show that every element of the field ~o is the norm of Sl'me element of ~. 9. Let ~ = GF(q'), where p'" = q, and let IX E~. Show that the equation is solvable in ~ if and only if IX + IX· + ... + IX.' - I = O.
e- g
=
IX
'0. Let e be a primitive pth root of 1 (over the field of rational numbers). Let ~o = GF(p) and ~ = GF(p'"). Since the elements of the field ~o can be considered as residue classes of integers modulo the prime p, then the expression eSP!Y) makes sense for any y E ~ (the trace is taken relative to the extension ~/~o). Show that for
IX
'f 0,
for
IX
=
O.
11. Let X be a character of the multiplicative group of the field ~ = GF(p'"), and set p'" = q (for the definition of a character, see Section 5). Extend X to the Whole field ~ by setting X(O) = O. The expression (IX E
~),
Sec. 3]
FINITE FIELDS
409
which is a complex number, is called the Gaussian sum of the finite field 1:. Assuming that the character X is not the unit character, prove the formulas 0:=1-0,
!T'<X)!
=yi
2 T,<X) =
0:=1-0,
O.
• ;<0
12. Let p=l- 2. Then the set of all squares in the multiplicative group 1:. of the field GF(pm) is a subgroup of index 2. If we set if1(0:) = + 1 if 0: =I- 0 is a square, and if1(0:) = -1 otherwise, we obtain a character of the group 1:•. Show that if 0:f3=1- 0, then
1: =
13. Show that for 0: =I- 0 ") if1(g>- 0:)=-1.
~~
14. Let f(xlo ... , x.) be a nonsingular quadratic form with determinant 8 and with coefficients in 1: = GF(pm), where p =I- 2 and we set pm = q. Let 0: be any element of 1: and let N denote the number of solutions in 1: of the equation
f(Xlo ... , x.) = 0:.
Show that N satisfies the formulas ifn=2r+l,
N=q2"+q"if1«-IY0:8) N = q2" -1
+ wq" -1if1«
-ly8
if n = 2r,
where w = -1 if 0: =I- 0 and w = q - 1 if 0: = O. 15. Let p and q be distinct odd primes. If x is an integer, we shall also use the letter x to denote the corresponding residue classes in the fields GF(p) and GF(q). Let 11 be an extension of GF(q) in which the polynomial t P - 1 splits into linear factors, and let E denote a primitive pth root of 1 contained in 11. The Legendre symbol (x/p) clearly coincides with the character if1(x) of the field GF(p) which was defined in Problem 12. Since it takes the values ± 1, we may assume that (x/p) E 11. Show that the" Gaussian sum"
T=2
"'oGI'(P)
(x)
-ExEI1
p
of the field GF(p) satisfies the equations (1) (2) 16. Use the representation of the Legendre symbol (p/q) = p(O-1l/2 in the field GF(q) and formulas (1) and (2) to prove the Gaussian reciprocity law:
(_I)-[(P-1)/211(O-1)/21(~) =
(;).
410
ALGEBR AIC SUPPLE MENT
4. Some Result s on Comm utativ e Rings
Throug hout this section the word ring will mean a commu tative ring with unit elemen t I and withou t divisor s of zero (that is, an integra l domain ).
4.1. Divisibility in Rings Let .0 be a ring, and let rx. and P¥-O be two elemen ts of .0. If there exists an elemen t ¢ E .0 such that P¢ = rx., then we say that rx. is divisible by P (or that P divides rx.), and we write Plrx.. Since .0 contain s no divisors of zero, there is at most one elemen t ¢ such that rx. = p¢. The concep t of divisibility in an arbitra ry ring clearly possesses all the usual propert ies of divisibi lity in the ring of rationa l integers. For exampl e, if riP and Plrx., then rlrx.. An elemen t e E .0 which divides the unit elemen t I, is called a unit of the ring .0 (or an invertible element). Theor em 1. The units of the ring .0 form a group under multipl ication . Proof. Let E be the set of all units of the ring .0. If e E E and I] E E, then ee' = 1 and el] = I for some e' and 1]' of .0. But then el](e'I]') = 1, and this means that el] E E. Since 1 E E, and if e E E, then e' E E, where ee' = I, we have verified that E i"s a group under multipl ication, and this proves the theorem .
Elemen ts rx. ¥- 0 and p¥-O of the ring .0 are called associate if they divide each other. From rx. = P¢ and P = rx.1] (¢ E .0, I] E .0) it follows that rx. = rx.¢1] and hence I = ¢I]. Thus two nonzer o elemen ts of .0 are associa te if one is a unit multipl e of the other. Let J1 be a nonzer o elemen t of the ring .0 which is not a unit. We shall say that the elemen ts rx. and Pof .0 are congru ent modulo J1 and write rx. == P(mod J1), if the difference rx. - Pis divisible by J1. All the usual propert ies of congru ences in the ring of integer s also hold for congru ences in the ring .0. For any rx. E .0 we denote by iX the set of all elemen ts of .0 which are congru ent to rx. modulo J1. The set iX is called a residue class modulo J1. We clearly have iX = P if and only if rx. == P (mod J1). We can define the sum and produc t of two residue classes modulo J1 by setting iX
+ P = rx. + P,
iXP
= rx. p.
It is easily checked that these definitions do not depend on the choice of the represe ntatives (residues) rx. and p. It is also easily verified that under these operati ons the set of all residue classes modulo J1 become s a commu tative ring with unit elemen t I (but possibly with divisor s of zero). It is called the ring of residue classes modulo J1.
vith
an tfl at ry of
Ie
Sec .4]
SOME RESULTS ON COM MUT ATIV E RINGS
411
If in each residue class mod ulo J1 we choose a representative, then the set S of all representatives is called a complete syst em of residues mod ulo J1. A complete system S of residues is clearly characterized by the pro pert y that every element of.o is con grue nt mod ulo J1 to one and only one element of S. 4.2. Ideals
A subset A of the ring .0 is called an ideal if it is a sub grou p of the add itive grou p of .0, and if for any a E A and any ~ E .0 the pro duc t ~alies in A. The subset consisting only of zero and the entire ring are trivially exam ples of ideals. The first of these ideals is call ed the zero ideal, the second the unit ideal. Let aI' ... , am be any elements of the ring .0. It is clear that the set A of all linear com bina tion s ~lal + ... + ~mam of these elements with coef ficie nts ~i in .0 is an ideal in .0. It is called the ideal gen erat ed by the elem ents al' .,. , am' and is den oted by A = (a , ... , am)' The elements al' ... , am are called gen l erat ors of the ideal A. In general, not ever y ideal has a finite system of generat ors. An ideal A is called prin cipa l if it has a system of generators consisting of a sing le element, that is, ifit has the form A = (a). A nonzero principal ideal (a) consists of all elements of A which are divi sible by a. The zero and unit ideals are both principal. The zero ideal is generat ed by the zero element, and the unit ideal is generated by any unit e of the ring .0. Two principal ideals (a) and (fl) are equal if and only if the elements a and fl are associate. Let A and B be two ideals of the ring .o. The set of all elements ~ E .0 whi ch can be represented in the form ~
= alfll + .,. + asfl.,
where ai E A and fli E fl (s ~ 1), is also an ideal in .0. This ideal is call ed the prod uct of the ideals A and B, and is denoted by AB. Since multiplicati on of ideals is com mut ativ e and associat ive, then the set of all ideals of the ring .0 is a com mut ativ e semigroup und er the ope rati on of multiplication. Two elements a and fl of .0 are said to be congruent mod ulo the ideal A, and we write a == fl (mo d A), if thei r difference a - fllie s in A, that is, if a and fl belong to the same coset of the additive grou p A. If Yden otes the cose t of A which contains y, then we hav e ii = ~ if and only if a == fl (mod A). For the principal ideal (}l), the concep t of congruence mod ulo (J1) coincide s with that mod ulo the element J1. Con side r the factor gro up .0 /A of the add itive grou p of the ring .0. When the sub grou p A is an ideal, then we can define multiplication in .o/A. Namely, for ii and ~ in .o/A we set ii~
= afl.
If ii = ii l and ~ = ~l' then since alfll - afl = a l (fll - fl) + fl(a - a), we have l afl == alfll (mo d A). This means that the pro duc t ii~ does not dep end on the
f
I f,
412
ALGEBRAIC SUPPLEMENT
choice of the representatives IX and P (it is essential here that A is an ideal). It is easily verified that under this definition the factor group DIA becomes a ring. The ring DIA is called the factor ring of the ring D by the ideal A. For a principal ideal (p) the factor ring D/(p) coincides with the ring of residue classes modulo p. 4.3. Integral Elements
Any ring 0 (commutative and without zero divisors) can be embedded in a field. To show this we consider the set of all formal fractions alb, where a and b are elements of 0 and b =1= O. Two fractions alb and eld are called equal if and only if ad = be. Addition and multiplication are defined by the formulas a
e
ad
b+ d= a e b d
+ be bd
ae bd
It is easily verified that these operations are compatible with the notion of equality, and that with these operations the set of all fractions alb becomes a field. We denote this field by k o · Ifwe identify each fraction all = aele (e =1= 0) with the element a E 0, then 0 will be a subring of the field k o . Hence every element of k o is the quotient of two elements of o. Now let n be any field which contains 0 as a subring. Let k be the set of all quotients alb, where a and b lie in 0 (b =1= 0). Clearly, k is a subfield of the field n. This subfield is called the quotient field of o. It is easily checked that the field k is isomorphic to the field k o constructed above, and hence that it is uniquely determined by the ring 0 (up to isomorphism). Definition. Let the ring 0 be contained in the field n. An element IX E n is called integral over 0, if it is the root of a polynomial with coefficients in 0 and with leading coefficient 1.
Since any element a E 0 is the root of the polynomial t - a, then every element of 0 is integral over o. Let WI' '" , W m be arbitrary element of n. The set M of all linear combinations a 1WI + ... + amWm with coefficients a l E 0 is called a finitely generated o-module in n, and the elements WI' ••. , W m are called generators of the o-module M. Since 1 EO, then all the WI are contained in M. Lemma 1. If the finitely generatedo-module M is a ring, then all its elements are integral over o.
Sec. 4]
SOME RESULTS ON COMMUTATIVE RINGS
413
Proof. We can of course assume that not all w j are zero. Let ex be any element of M. Since for any i the product exw j belongs to M, then m
exw j =
L aijW j=l
j '
a lj
(i
E 0
= 1, ... , m).
= 0 (E is the unit matrix). Hence the element ex is a root of the polynomial f(t) = det(tE - (ill) which has all coefficients in 0 and has leading coefficient 1, and this proves the lemma.
It follows that det(exE - (a j )
Theorem 2. The set of all elements of
n
which are integral over
0
is a
ring. Proof. We must verify that the sum, difference, and product of two integral
elements of the field polynomials
where
aj
n are again integral over o. If ex and p are the roots of the
and bj are elements of 0, then
am
= al + a2ex + ... + amexm- l ,
p'
= bl = b 2P + ... + b.p·-l.
It easily follows that the o-module which consists of all linear combinations of
the products (0 ~ i< m, 0 ~j< n)
(4.1)
with coefficients in 0, is a ring (since any product with k ~ 0 and I ~ 0 can be expressed as a linear combination of the elements exkpl with coefficients in 0). By Lemma 1 all elements of this ring are integral over 0; in particular, this will hold for ex ± Pand exp. Theorem 2 is proved. Definition. Let 0 be a subring of the field n, and let .0 be the set (which is a ring by Theorem 2) of all elements of n which are integral over o. The ring .0 is called the integral closure of the ring 0 in the field n. Definition. A subring Do of a field K is called integrally closed in K if its integral closure in K coincides with Do. Definition. A ring .0 is called integrally closed if it is integrally closed in its quotient field k. Theorem 3. Let 0 be a subring of the field n, and let .0 be the integral closure of 0 in n. Then the ring .0 is integrally closed in n.
414
ALGEBRAIC SUPPLEMENT
Proof. Let 0 be any element of n which is integral over .0, so that on
=
1X1
+ 1X20 + ... + IXnOn-I,
(4.2)
where all IX; lie in .0. We must show that 0 E .0. For each i = 1, '" , n there is an integer m for which m,
cx., = "L. aip; j - l mj
(au EO)
(4.3)
j=l
(since IX; is integral over 0). Consider the o-module M which is generated by the products IX/''''IX/"Ok
(O~k;<m;,O~k
(4.4)
It easily follows from (4.2) and (4.3) that any product lXii, ••• IXn'nO I with nonnegative exponents can be expressed as a linear combination of the elements (4.4) with coefficients in 0, and this means that the module M is a ring. By Lemma 1 every element of M is integral over o. In particular, 0 is integral over 0 and this proves the theorem. Lemma 2. Let 0 be an integrally closed ring with quotient field k, and let f(t) E o[t] be a polynomial with leading coefficient 1. If the polynomial cp(t) E o[t] dividesf(t) and has leading coefficient 1, then cp(t) E O[t]. Proof. Let njk be an extension in which the polynomial f(t) factors into
linear factors. If.o is the integral closure of 0 in n, then every root of f(t) clearly lies in.o, and this will also be true for all roots of cp(t). From the identity cp(t) = (t - Yl) ... (t - Ys), it follows that all coefficients of cp(t) also lie in .0, and since .0 n k = 0 (as 0 is integrally closed), then these coefficients lie in 0, which proves the lemma. The following fact is an obvious consequence of Lemma 2. Theorem 4. Let 0 be an integrally closed ring with quotient field k, and let njk be an algebraic extension of the field k. In order that the element IX E n be integral over 0, it is necessary and sufficient that all coefficients of its minimum polynomial lie in o.
PROBLEMS
1. An ideal A of the ring ,0 is called maximal if A #- ,0 and the only ideal of ,0 which properly contains A is the unit ideal D. Show that the ideal A is maximal if and only if the factor ring ,oj A is a field. 2. If 0 is an integrally closed ring, show that the polynomial ring 0[11 is also integrally closed.
Sec. 5]
CHARACTERS
415
5. Characters (4.2)
ere is
In this section we describe some properties of characters of finite Abelian groups and numerical characters.
(4.3)
5.1. The Structure of Finite Abelian Groups
:d by
The structure of finite Abelian groups is determined by the following theorem (see, for instance, M. Hall, "The Theory of Groups," Macmillan, New York, (I 959)).
(4.4)
nonnents g. By tegral
nd let omial
·s into )f f(t) lentity in .0, e in 0,
k, and t
IX E
n
of its
Theorem 1. Every finite Abelian group can be represented as the direct product of cyclic subgroups. By Problems I and 2 a finite cyclic group cannot be represented as the direct product of proper subgroups if and only if its order is a power of a prime. Therefore a finite Abelian group G can be represented as a direct product G = Al X ... x As of cyclic subgroups Ai of prime power order. This representation is not, in general, unique. But the orders of the cyclic subgroups of prime power order are uniquely determined by G. These orders (which are powers of prime numbers) are called the inrariants of the finite Abelian group G. The product of all invariants of G clearly equals the order of G.
5.2. Characters of Finite Abelian Groups Definition. A homomorphism of the finite Abelian group G into the multiplicative group of the field of complex numbers is called a character of the group G. In other words, a character of G is a nonzero complex-valued function X on G for which x(xy)
= X(x)X(y)
(5.1)
for any x and y of G. Since any homomorphism of groups takes the unit onto the unit, then x(I) = I. If the element x E G has order k, then
(X(X))k = X(x k) = x(I) = I;
o which j
only if tegrally
(5.2)
that is, X(x) is a kth root of I. If m is the maximum of the orders of the elements of G, then by Problem 3 the order of every element of G divides m. Hence X(x) is an mth root of I for all x E g, and this means that a character could also be defined as a homomorphism from G to the group of all mth roots of I.
416
ALGEBRAIC SUPPLEMENT
We represent G as a direct product of cyclic groups: G
Since every element x
E
= {ad x ... x {as}.
G can be represented in the form x
= a/I
... a/',
(5.3)
then by (5.1), x(x)
= x(al)k, ... x(as)k.,
so the character X is completely determined by the values x(al), ... , x(as)' If = 1, ... ,s let 8/ be any mjth root of 1, and for any x E G of the form (5.3) set
aj has order m, then by (5.2)x(a j) is an mjth root of 1. Conversely, for i
(5.4) It is easily seen that the value of (5.4) is independent of the choice of the exponents k j (which are only defined modulo m;), and that this function is a character of the group G. Each root 8 t can be chosen in m j ways, so there are m l .• , m s distinct functions of the type (5.4). Hence we have the following theorem. Theorem 2. The number of characters of a finite Abelian group equals the order of the group.
We shall define a multiplication for characters. If X and X' are characters of the group G, set (xx')(x)
= X(x)x'(x)
(x
E
G).
It is clear that the function X also is a character of the group G. The character Xo, for which Xo(x) = I for all x E G, is called the unit character. It is clear that XXo = X for any character X. If for a character X of the group G we set i(x) = x(x)
(x E G),
where X(x) is the complex conjugate of the number X(x), then the function i also will be a character of the group G, and xi = Xo. Since multiplication of characters is clearly associative, then the set of all characters of a finite Abelian group is a group under the operation of multiplication. Let G = {a} be a cyclic group of order m and let 8 be a fixed primitive mth root of 1. Let X be that character of the group G for which x(a) = 8 (and hence k x(ak) = e ). Since lea) = er , then the characters Xo = Xm , X, X2 , •.• , Xm - l are pairwise-distinct, and hence exhaust the characters of the group G. Hence we have shown that the character group of a cyclic group is cyclic. In the general
Sec. 5]
CHARACTERS
417
case it is easy to prove the following theorem: Any finite Abelian group is isomorphic to its character group. Let G be an Abelian group of order n and let H be a subgroup of order m. Ifwe restrict a character of G to H, we clearly obtain a character ofthe group H. We denote this character by X. It is clear that the mapping X --+ ~ is a homomorphism from the character group X of G to the character group Y of H. Let A be the kernel of this map. The characters of A are characterized by having X(z) = 1 for all z E H. If X E A and x and x' lie in the same coset of H in G, then X(x) = X(x'). Setting i(x) = X(x), where X E A and x is the coset of H in G which contains x, we obtain a function i on the factor group GfH, which is a character of this group. Conversely, if l/J is any character of the factor group GfH, then by setting X(x) = l/J(x)
(x E G),
we obtain a character X E A, for which ~ = l/J. Since under the mapping X --+ i distinct characters in A go to distinct characters of the group GfH, we have shown that the number of characters contained in A equals the number of characters of the factor group GfH, that is, equals nfm (Theorem 2). Hence the image of the group X under the homomorphism X --+ ~ must have order nf(nfm) = m, and since by Theorem 2 the group Yhas order m, then the image coincides with Y. This means that every character of the group H is of the form ~ for some character X of the group G. It is clear that the number of characters X E X which induce a given character of H equals nfm = (G: H). We have proved the following theorem. Theorem 3. If G is a finite Abelian group and H is a subgroup, then any character of the group H can be extended to a character of the group G, and the number of such extensions equals the index (G : H). Corollary 1. If x is any nonunit element of the group G, then there exists a character X of the group G for which X(x) =1= 1. Indeed, consider the cyclic group {x} = H. Since Hhas order greater than 1, there is a nonunit character X' of H, and X'(x) =1= 1. Extending X' to a character of the group G, we obtain the desired character X. Corollary 2. If the element x of the group G is not contained in the subgroup H, then there is a character X of the group G such that X(x) =1= 1 and X(z) = 1 for all z E H. Indeed, the unit character of the group H can be extended to a nonunit character of the subgroup {x, H}, which in turn can be extended to a character of the group G.
; i
418
ALGEBRAIC SUPPLEMENT
We now consider some relations between the values of characters. If Xo is e G Xo(X) = n, the unit character, then Xo(x) = I for all x E G, and hence where n is the order of the group G. Assume that the character X is not the unit character, so that X(z) ¥- I for some z E G. If x runs through all elements of the group G, then zx also runs through all elements of G. Setting s = e G x(x), we have S = L x(zx) = x(z)S.
Lx
Lx
xeG
Since X(z) ¥- I, we must have S = O. Thus we have the formula
L l.(x) = {n xeG
0
if X = Xo,
(5.5)
if X¥- Xo·
The value of any character X on the unit element of the group equals I, hence x(1) = n (here X runs through all characters of the group G). We set T= x(x). By the first corollary to Theorem 3 there is a character X' for which X'(x) ¥- I (if x ¥- I). As X runs through the character group of G, so does Xx'. Therefore,
Lx Lx
T
= L (X'X)(x) = L X'(x)X(X) = X'(x)T, x
x
and since x'(x) ¥- I, then T
= O.
We have proved the formula
L X(x) = {n x
0
if x = 1, if x¥- 1.
(5.6)
5.3. Numerical Characters For any natural number m let Gm denote the group under multiplication of all residue classes modulo m which consists of all residue classes of numbers relatively prime to m. The residue class modulo m which contains a will be denoted by ii. Every character X of the group Gm can be associated to a function X* on rational integers relatively prime to m by setting
x*(a) = x(a). We extend this function to all rational integers by setting x*(a) = 0 if a and m are not relatively prime. Such a function X* (defined on all rational integers) is called a numerical character modulo m. In the future we shall denote X* by the same symbol, X, as used for the corresponding character of the group G m • It is clear that distinct characters of the group Gm correspond to distinct numerical characters modulo m, and that the number of numerical characters modulo m equal cp(m).
CHARACTERS
Sec. 5]
419
The following properties of numerical characters easily follow from the definition. (1) For any rational integer a the value of x(a) is a complex number which is zero if and only if a and m are not relatively prime. (2) If a == a' (mod m), then x(a) = x(a'). (3) For any rational integers a and b we have x(ab) = x(a)x(b). We shall show that numerical characters are completely characterized by these three properties. Let '1 be a function which satisfies (1) to (3). For any class ii E Gm , (a, m) = 1, we set X(ii) = '1(0). By (2) the value of X(ii) does not depend on the choice of a, and by (I) it is nonzero. Also, if (a, m) = 1 and (b, m) = 1, then by condition (3) X(ii 5)
=
x(ab)
= '1(ab) = '1(a)'1(b) =
x(ii)x(li).
Hence X is a character of the group Gm , and the corresponding numerical character coincides with the function '1. Let m' be a natural number which is divisible by m. From any character X modulo m we can form a character x' modulo m'. Namely, if a is relatively prime to m' (and hence also to m), set x'(a) = x(a); if (a, m') > 1, setx'(a) = O. The function X' satisfies conditions (I) to (3), and hence is a numerical character modulo m'. We shall say that X' is induced by the character X. Definition. Let Xbe a numerical character modulo m. If there is a proper divisor d of the number m and a character Xl modulo d such that Xl induces X, then the character is called nonprimitive; otherwise it is called primitive. Theorem 4. In order that the character X modulo m be primitive, it is necessary and sufficient that for any proper divisor d of the number m, there be a number x which is congruent to 1 modulo d and relatively prime to m, such that X(x) ¥- 1. Proof. If the character Xis nonprimitive, then it is induced by some character Xl modulo d, where d is a proper divisor of m. This means that for any x
which is relatively prime to m we have X(x) = XI(X). If x == 1 (mod d), then = Xl(X) = XI(l) = 1. Conversely, assume that for some proper divisor d of the number m we have X(x) = 1 for any x which is relatively prime to m and congruent to 1 modulo d. For any a which is relatively prime to d, we can find a number a', for which (a', m) = 1 and a' == a (mod d). Set X(x)
Xl (a)
=
x(a').
We claim that the value of Xl (a) does not depend on the choice of a'. Indeed,
i
420
ALGEBRAIC SUPPLEMENT
if a' == a" (mod d), where a" is also relatively prime to m, then a" == xa' (mod m) for some x relatively prime to m (since a' and a" are both relatively prime to m). Since x == I (mod d), by the conditions of the theorem we have X(x) = I, and then x(a") = x(x)x(a') = x(a'). By setting Xl(a) = 0 when (a, d) =1= I, we obtain a function Xl which is easily seen to be a numerical character modulo d. Since x(a) = Xl(a) for (a, m) = I, then X is induced by the character Xl' The theorem is proved.
PROBLEMS
1. If a finite cyclic group has prime power order, show that it is not the direct product of proper subgroups. 2. Let G be a finite cyclic group whose order is the product of the relatively prime numbers k and I. Show that G is the direct product of two subgroups of orders k and I. 3. Let a be an element of maximum order in the finite Abelian group G. Show that the cyclic subgroup {a} is a direct factor of G. 4. Let k be a natural number. Show that the element x of the finite Abelian group G is a kth power in G if and only if X(x) = 1 for all characters X of the group G for which
X' =
Xo.
5. Let G be a finite Abelian group of order n. Write in any order the elements Xl, and the characters Xl> ... , X•• Show that the matrix
••• ,
x.
is unitary. 6. Let m10 •• , , ml be pairwise relatively prime natural numbers and let m = ml ... m l • Show that any numerical character modulo m has a unique representation as a product of characters XI modulo ml (i = 1, ... , k), that is, that
for any rational integer a. [For any i the character XI is defined by XI(a) = x(a'), where a' is determined by the congruences a' == a (mod m,), a' == 1 (mod mimI)'] 7. If the character X of Problem 6 is primitive, show that the characters X10 ... , Xl are also primitive. 8. Let d l and d 2 be divisors of the natural number m and letd = (d 1o d 2 ). If the character X modulo m is induced by some character modulo d and also induced by some character modulo d, show that it is induced by some character modulo d. 9. Show that every character modulo m is induced by a unique primitive character. The modulus f of this primitive character is called the fundamental modulus of the character X. 10. Show that the number of primitive characters modulo m equals
Sec. 5]
CHARACTERS
421
(d runs through all divisors of the number m; p. is the Mobius function; f{! is Euler's
function). 11. Show that there exist primitive characters modulo m if and only if m is either odd or is divisible by 4. 12. Let ff be the vector space over the complex numbers which consists of all complexvalued functions on the finite Abelian group G. For each element w EG let T", denote the shift operator, defined by the formula (T",f)(a) = f(wa). Show that every character of the group G is an eigenvector of the operator T",. What are the corresponding eigenvalues? 13. We keep the notations of the preceding problem and consider, for some fixed fE ff, the square matrix where a and '1' run through all elements of the group G, arranged in some order. Show that the determinant of this matrix is
(a runs through all elements and X through all characters of the group G).
[Hint: The matrix A is the matrix of the operator T= consists of the functions la' where
L'" f(w)T", in the basis which
for a=T, for a i' T. Find the eigenvalues of the operator T.] 14. Prove the assertion of Problem 13 by considering the determinant of the product of the matrices (X(a»~.a and A.
Tables TABLE 1 h,
TIlE NUMBER OF DIVISOR CLASSES, AND
THE REAL QUADRATIC FIELDS THE NORM WHEN
R(Vd),
E, TIlE FUNDAMENTAL UNIT GREATER THAN I,FOR
d
WHERE
IS A SQUARE-FREE INTEGER,
2,,;; d,,;; 101.
N(E) IS ALSO GIVEN. HERE w=(l + yd)/2 WHEN d=1 (MOD 4), AND w=yd
d==22, 3 (MOD 4) d
h
2 3 5 6 7 10 11 13 14 15 17 19 21 22 23 26 29 30 31 33 34 35 37 38 39 41 42 43 46 47 51
1 1 1 1 1 2 1 1 1 2 1 1 1 1 1 2 1 2 1 1 2 2 1 1 2 1 2 1 1 1 2
E
l+w 2+w w 5 +2w 8 + 3w 3+w 10+ 3w l+w 15 +4w 4+w
3 + 2w 170+ 39w 2+w
197 + 42w 24+ 5w 5+w 2+w 11 +2w 1520 + 273w 19 + 8w 35+6w 6+w 5 +2w 37+6w 25 +4w
27 + lOw 13 + 2w 3482+531w
24335 + 3588w 48 +7w 50+7w
N(E)
d
-1 +1 -1 +1 +1 -1 +1 -1 +1 +1 -1 +1 +1 +1 +1 -1 -1 +1 +1 +1 +1 +1 -1 +1 +1 -1 +1 +1 +1 +1 +1
53 55 57 58 59 61 62 65 66 67 69 70 71 73 74 77 78 79 82 83 85 86 87 89 91 93 94 95 97 101
422
h
E
1 3+w 2 89 + 12w 1 131 + 40w 2 99 + 13w 1 530 + 69w 1 17 + 5w 1 63 + 8w 2 7 +2w 2 65 +8w 1 48842 + 5967w 1 11 + 3w 2 251 + 30w 1 3480 + 413w 1 943 + 250w 2 43 + 5w 1 4+w 2 53 +6w 3 80+9w 4 9+w 1 82 + 9w 2 4+w 1 10405 + 1122w 2 28 +3w 1 447 + l06w 2 1574 + 165w 1 13 + 3w 1 2143295 + 221 064w 2 39+4w 1 5035 + 1138w 1 9+2w
N(E)
-1 +1 +1 -1 +1 -1 +1 -1 +1 +1 +1 +1 +1 -1 -1 +1 +1 +1 -1 +1 -1 +1 +1 -1 +1 +1 +1 +1 -1 -1
TABLES
423
TABLE 2 h,
TIlE NUMBER OF DIVISOR CLASSES, AND
TIlE REAL QUADRATIC FIELDS
R(yd),
N(E), THE NORM OF A FUNDAMENTAL UNIT, FOR d IS A SQUARE-FREE INTEGER, 101,,;; d < 500
WHERE
d
h
N(E)
d
h
N(E)
d
h
N(E)
d
h
N(E)
101 102 103 105 106 107 109 110 111 113 114 115 118 119 122 123 127 129 130 131 133 134 137 138 139 141 142 143 145 146 149 151 154 155 157 158 159 161 163 165
1 2 1 2 2 1 1 2 2 1 2 2 1 2 2 2 1 1 4 1 1 1 1 2 1 1 3 2 4 2 1 1 2 2 1 1 2 1 1 2
-1 +1 +1 +1 -1 +1 -1 +1 +1 -1 +1 +1 +1 +1 -1 +1 +1 +1 -1 +1 +1 +1 -1 +1 +1 +1 +1 +1 -1 +1 -1 +1 +1 +1 -1 +1 +1 +1 +1 +1
166 167 170 173 174 177 178 179 181 182 183 185 186 187 190 191 193 194 195 197 199 201 202 203 205 206 209 210 211 213 214 215 217 218 219 221 222 223 226 227
1 1 4 1 2 1 2 1 1 2 2 2 2 2 2 1 1 2 4 1 1 1 2 2 2 1 1 4 1 1 1 2 1 2 4 2 2 3 8 1
+1 +1 -1 -1 +1 +1 +1 +1 -1 +1 +1 -1 +1 +1 +1 +1 -1 +1 +1 -1 +1 +1 -1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 -1 +1 +1 +1 +1 -1 +1
229 230 231 233 235 237 238 239 241 246 247 249 251 253 254 255 257 258 259 262 263 265 266 267 269 271 273 274 277 278 281 282 283 285 286 287 290 291 293 295
3 2 4 1 6 1 2 1 1 2 2 1 1 1 3 4 3 2 2 1 1 2 2 2 1 1 2 4 1 1 1 2 1 2 2 2 4 4 1 2
-1 +1 +1 -1 +1 +1 +1 +1 -1 +1 +1 +1 +1 +1 +1 +1 -1 +1 +1 +1 +1 -1 +1 +1 -1 +1 +1 -1 -1 +1 -1 +1 +1 +1 +1 +1 -1 +1 -1 +1
298 299 301 302 303 305 .307 309 310 311 313 314 317 318 319 321 322 323 326 327 329 330 331 334 335 337 339 341 345 346 347 349 353 354 355 357 358 359 362 365
2 2 1 1 2 2 1 1 2 1 1 2 1 2 2 3 4 4 3 2 1 4 1 1 2 1 2 1 2 6 1 1 1 2 2 2 1 3 2 2
-1 +1 +1 +1 +1 +1 +1 +1 +1 +1 -1 -1 -1 . +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 -1 +1 +1 +1 -1 +1 -1 -1 +1 +1 +1 +1 +1 -1 -1
424
TABLES
TABLE 3 h, THE NUMBER OF DIVISOR CLASSES, FOR THE REAL QUADRATIC FIELD IS A PRIME AND P <2000"·b
d
h
N(E)
d
h
N(E)
366 367 370 371 373 374 377 379 381 382 383 385 386 389 390 391 393 394 395 397 398 399
2 1 4 2 1 2 2 1 1 1 1 2 2 1 4 2 1 2 2 1 1 8
+1 +1 -1 +1 -1 +1 +1 +1 +1 +1 +1 +1 +1 -1 +1 +1 +1 -1 +1 -1 +1 +1
401 402 403 406 407 409 410 411 413 415 417 418 419 421 422 426 427 429 430 431 433 434
5 2 2 2 2 1 4 2 1 2 1 2 1 1 1 2 6 2 2 1 1 4
-1 +1 +1 +1 +1 -1 +1 +1 +1 +1 +1 +1 +1 -1 +1 +1 +1 +1 +1 +1 -1 +1
I
I
R(VP5,
WHERE p
d
h
N(E)
d
h
N(E)
435 437 438 439 442 443 445 446 447 449 451 453 454 455 457 458 461 462 463 465 466 467
4 1 4 5 8 3 4 1 2 1 2 1 1 4 1 2 1 4 1 2 2 1
+1 +1 +1 +1 -1 +1 -1 +1 +1 -1 +1 +1 +1 +1 -1 -1 -1 +1 +1 +1 +1 +1
469 470 471 473 474 478 479 481 482 483 485 487 489 491 493 494 497 498 499
3 2 2 3 2 1 1 2 2 4 2 1 1 1 2 2 1 2 5
+1 +1 +1 +1 +1 +1 +1 -1 +1 +1 -1 +1 +1 +1 -1 +1 +1 +1 +1
• E. L. Ince, Cycles of reduced ideals in quadratic fields, in "Mathematical Tables," Vol. IV, British Association for the Advancement of Science, London 1934. b There are 303 prime numbers p less than 2000 (including p = 2). For 26 of these primes h=3 for R(VP). They are: p =79,223,229, 257, 359,443,659,
733,761,839,1091,1171,1223,1229,1367,1373,1489,1523, 1567, 1627, 1787, 1811, 1847, 1901, 1907, 1987. For 7 primes h =5. They are: p=401, 439, 499, 727, 1093, 1327, 1429. For 4 primes h=7. They are: p=577, 1009, 1087, 1601. For p=1129 we have h=9 (and the group of divisor classes is cyclic), and for p = 1297 we have h = 11. For the remaining 264 prime numbers p <2000, the field R(vP) has only one divisor class (that is, every divisor is principal).
TABLES
425
TABLE 4 h,
THE NUMBER OF DIVISOR CLASSES, FOR 1HE IMAGINARY QUADRATIC FIELDS
WHERE
a IS
SQUARE-FREE AND
1~a
R(V -a),
< 500
a
h
a
h
a
h
a
h
a
h
1 2 3 5 6 7 10 11 13 14 15 17 19 21 22 23 26 29 30 31 33 34 35 37 38 39 41 42 43 46 47 51 53 55 57 58 59 61 62 65 66
1 1 1 2 2 1 2 1 2 4 2 4 1 4 2 3 6 6 4 3 4 4 2 2 6 4 8 4 1 4 5 2 6 4 4 2 3 6 8 8 8
67 69 70 71 73 74 77 78 79 82 83 85 86 87 89 91 93 94 95 97 101 102 103 105 106 107 109 110 111 113 114 115 118 119 122 123 127 129 130
1 8 4 7 4 10 8 4 5 4 3 4 10 6 12 2 4 8 8 4 14 4 5 8 6 3 6 12 8 8 8 2 6 10
134 137 138 139 141 142 143 145 146 149 151 154 155 157 158 159 161 163 165 166 167 170 173 174 177 178 179 181 182 183 185 186 187 190 191 193 194 195 197 199 201
14 8 8 3 8 4 10 8 16 14 7 8 4 6 8 10 16 1 8 10 11 12 14 12 4 8 5 10 12 8 16 12 2 4 13 4 20 4 10 9 12
202 203 205 206 209 210 211 213 214 215 217 218 219 221 222 223 226 227 229 230 231 233 235 237 238 239 241 246 247 249 251 253 254 255 257 258 259 262 263 265 266
6 4 8 20 20 8 3 8 6 14 8 10 4 16 12 7 8 5 10 20 12 12 2 12 8 15 12 12 6 12 7 4 16 12 16 8 4 6 13 8 20
267 269 271 273 274 277 278 281 282 283 285 286 287 290 291 293 295 298 299 301 302 303 305 307 309 310 311 313 314 317 318 319 321 322 323 326 327 329 330 331 334
2 22 11 8 12 6 14 20 8 3 16 12 14 20 4 18 8 6 8 8 12 10 16 3 12 8 19 8 26 10 12 10 20 8 4 22 12 24 8 3 12
131
133
10
2 5 12 4 5 4
426
TABLES
TABLE 5 THE DISCRIMINANTS OF ALL KNOWN ORDERS OF IMAGINARY QUADRATIC FIELDS FOR WHICH EVERY GENUS OF MODULES BELONGING TO THE ORDER CONSISTS OF A SINGLE CLASS·· b
a
h
a
h
a
335 337 339 341 345 346 347 349 353 354 355 357 358 359 362 365 366 367 370 371 373
18 8 6 28 8 10 5 14 16 16 4 8 6 19 18 20 12 9 12 8 10
374 377 379 381 382 383 385 386 389 390 391 393 394 395 397 398 399 401 402 403
28 16 3 20 8 17 8 20 22 16 14 12 10 8 6 20 16 20 16 2
406 407 409 410 411 413 415 417 418 419 421 422 426 427 429 430 431 433 434 435
h
16 16 16 16 6 20 10 12 8 9 10 10 24 2 16 12 21 12 24 4
a
437 438 439 442 443 445 446 447 449 451 453 454 455 457 458 461 462 463 465 466
h
20 8 15 8 5 8 32 14 20 6 12 14 20 8 26 30 8 7 16 8
a
h
467 469 470 471 473 474 478 479 481 482 483 485 487 489 491 493 494 497 498 499
7 16 20 16 12 20 8 25 16 20 4 16 7 20 9 12 28 24 8 3
I
• L. E. Dickson, "Introduction to the Theory of Numbers," Dover, New York, 1929.
bI. The discriminants of maximal orders (65 values):
3 4 7 8 -11 -15 -19 -20 -24 -35 -40
43 51 52 67 84 88 91 -115 -120 -123 -132
-148 -163 -168 -187 -195 -228 -232 -235 -267 -280 -312
-340 -372 -403 -408 -420 -427 -435 -483 -520 -532 -555
- 595 627 660 708 715 760 795 840 -1012 -1092 -1155
-1320 -1380 -1428 -1435 -1540 -1848 -1995 -3003 -3315 -5460
CH
427
TABLES
2_ The discriminants of nonmaximal orders (36 values): -3-2 2 -3-3 2 -3-4 2 -3-5 2 -3.7 2 -3-8 2
-4-2 2 -4-3 2 -4.4 2 -4.5 2 -7-2 2 -7-4 2
-7.8 2 -8.2 2 -8-3 2 -8.6 2 -11.3 2 -15-2 2
-15-4 2 -15-8 2 -20-3 2 -24-2 2 -35.3 2 -40-2 2
-88.2 2 -120.2 2 -168-2 2 -232.2 2 -280.2 2 -312-2 2
-408.2 2 -520.2 2 -760-2 2 -840-2 2 -1320-2 2 -1848.2 2
The suitable numbers of Euler: 1,2,3,4,5,6,7,8,9,10,12,13,15,16,18,21,22,24, 25, 28, 30, 33, 37, 40, 42, 45, 48, 57, 58,60, 70, 72, 78, 85, 88, 93, 102, 105, ll2, 120, 130, 133, 165, 168, 177, 190,210,232,240,253,273,280,312,330,345,357,385,408, 462,520, 760,840, 1320, 1365, 1848_
TABLE 6 h, THE NUMBER OF DIVISOR CLASSES, FOR CERTAIN CUBIC FIELDS R(V 3m)
n
m
n
m
n
1
22 23 26 28 29 30 31 33 34 35 37 38 39 41 42
3 I
43 44 45 46 47 63 65 91 124 126 182 215 217 342 422
12 1 1 1 2 6 18 9 9 9 27 21 27 27 21
m
2 3 5 6 7 10 II 12 13 14 15 17 19 20 21
I I
1 3 I
2 I
3 3 2 I
3 3 3
3 3 1 3 3 I
3 3 3 3 6 1 3
I
TABLES
428
TABLE 7 h,
THE NUMBER OF DIVISOR CLASSES, FOR ALL TOTALLY REAL CUBIC
FIELDS WITH DISCRIMINANT
< 20,000·,b
Bounds for the discriminants
Number of fields
Bounds for the discriminants
Number of fields
1- 1,000 1,001- 2,000 2,001- 3,000 3,001- 4,000 4,001- 5,000 5,001- 6,000 6,001- 7,000 7,001- 8,000 8,001- 9,000 9,001-10,000 10,001-11,000
22 32 35 39 34 41 37 47 40 39 42
11,001-12,000 12,001-13,000 13,001-14,000 14,001-15,000 15,001-16,000 16,001-17,000 17,001-18,000 18,001-19,000 19,001-20,000
52 37 43 42 46 52 39 39 48
Total
806
• H. J. Godwin and P. A. Samet, J. London Math. Soc. 34,108-110 (1959); H. J. Godwin, Proc. Cambridge Phil. Soc. 57, 728-730 (1961). b The cubic field R(O) is called totally real if s =3, t =0, that is, if all its isomorphisms into the complex numbers are real. If the minimum polynomial of 0 factors into linear factors in R(O), then the field R(O) is called cyclic. Cyclic cubic fields are characterized by the fact that their discriminant is the square of a rational integer. There are 830 totally real cubic fields with discriminant < 20,000. Of these 24 are cyclic. For 16 cyclic cubic fields h=l. These fields have discriminants 72, 9 2,13 2,19 2,31',37 2, 432,61',672,732,792,972, 103 2, 109 2, 127 2, 139 2. For each of the discriminants 63 2, 91',117 2,133 2, there are precisely two cyclic cubic fields, and for all of these fields, h= 3. The noncyclic totally real cubic fields with discriminant < 20,000 are distributed as follows (for each value of the discriminant there is only one field: Among these fields there are 748 with h = 1. There are 29 fields with h= 2. Their discriminants are 1,957, 2,777, 3,981, 6,809, 7,053, 7,537, 8,468, 8,789,9,301, 10,273, 10,889, 11,197, 1,324,11,348,12,197,13,676,13,768,14,013,14,197, 15,188, 15,529, 16,609, 16,997, 17,417, 17,428, 17,609, 17,989, 18,097, 19,429. Fields with h =3 (there are 26 of them), have discriminants 2,597, 4,212, 4,312, 5,684, 6,885, 7,220, 8,829, 9,653, 9,800, 9,996, 10,309, 11,417, 13,916, 13,932, 14,661, 14,945, 15,141, 15,884, 16,660, 16,905, 18,228, 18,252, 18,792, 19,220, 19,604, 19,764. The three fields with discriminants 8,069, 16,357, 19,821 have h = 4. There are no fields with h';:l:. 5 (among the totally real cubic fields with discriminant < 20,(00). Remark: Within the limits of the table there is one and only one noncyclic totally real cubic field for each value of the discriminant. However, this is not always true. Thus, for example, there are at least three fields with discriminant 22,356 (Problem 21 of Section 2, Chapter 2).
TABLES
429
TABLE 8 THE
FACTOR
FOR THE
lth
h· = h·(l) OF lHE
NUMBER OF DIVISOR CLASSES
CYCLOTOMIC FIELD FOR PluME NUMBERS
I
h·
I
3 5 7 11 13 17 19 23 29 31 37 41
1 1 1 1 1 1 1 3 23 9 37 1]2
43 47 53 59 61 67 71 73 79 83 89 97
1<100
h· 211 5·139 4,889 3'59·233 41'1,861 67'12,739 7 2 '79,241 89'134,353 5'53'377,911 3· 279,405,653 113'118,401,449 577· 3,457 .206,209
430
TABLES
TABLE 9 ALL IRREGULAR PRIME NUMBERS ~ NUMBERS
2a
4001.
ALONG WITH THE PRIME
DIVISIBLE BY
I.
THERE ARE
219
ARE LISTED THOSE
I
2a
32 44 58 68 24
B 2a
IRREGULAR PRIME NUMBERS ~
< 4000 WHICH DO NOT APPEAR IN THE TABLE ARE
37 59 67 101 103
I
(2~2a ~1-3) FOR WHICH THE NUMERATOR OF THE BERNOULLI NUMBER
2a
I
IS
4001. ALL ODD PRIMES REGULAR (THERE ARE 331 OF THEM)ab I
2a
647 653 659 673 677
236,242,554 48 224 408,502 628
1307 1319 1327 1367 1381
382,852 304 466 234 266
131 149 157 233 257
22 130 62,110 84 164
683 691 727 751 757
32 12,200 378 290 514
1409 1429 1439 1483 1499
358 996 574 224 94
263 271 283 293 307
100 84 20 156 88
761 773 797 809 811
260 732 220 330,628 544
1523 1559 1597 1609 1613
1310 862 842 1356 172
311 347 353 379 389
292 280 186,300 100,174 200
821 827 839 877 881
744 102 66 868 162
1619 1621 1637 1663 1669
401 409 421 433 461
382 126 240 366 196
887 929 953 971 1061
418 520,820 156 166 474
1721 1733 1753 1759 1777
30 810,942 712 1520 1192
463 467 491 523 541
130 94, 194 292,336,338 400 86
1091 1117 1129 1151 1153
888 794 348 534,784,968 802
1787 1789 1811 1831 1847
1606 848, 1442 550,698, 1520 1274 954, 1016, 1558
547 557 577 587 593
270,486 222 52 90,92 22
1193 1201 1217 1229 1237
262 676 784,866,1118 784 874
1871 1877 1879 1889 1901
1794 1026 1260 242 1722
607 613 617 619 631
592 522 20,174,338 428 80,226
1279 1283 1291 1297 1301
518 510 206, 824 202, 220 176
1933 1951 1979 1987 1993
1058, 1320 1656 148 510 912
560 980 718 270, 1508 388, 1086
431
TABLES
2a
l
l
2a
l
2a
1997 2003 2017 2039 2053
772, 1888 60,600 1204 1300 1932
2663 2671 2689 2753 2767
1244 404, 2394 926 482 2528
3491 3511 3517 3529 3533
2544 1416,1724 1836, 2586 3490 2314,3136
2087 2099 2111 2137 2143
376, 1298 1230 1038 1624 1916
2777 2789 2791 2833 2857
1600 1984, 2154 2554 1832 98
2153 2213 2239 2267 2273
1832 154 1826 2234 876, 2166
2861 2909 2927 2939 2957
3539 3559 3581 3583 3593 3607 3613 3617 3631 3637
2082,2130 344, 1592 1466 1922 360,642 1976 2082 16, 2856 1104 2526, 3202
2293 2309 2357 2371 2377
2040 1660, 1772 2204 242,2274 1226
2999 3011 3023 3049 3061
776 1496 2020 700 2522
3671 3677 3697 3779 3797
1580 2238 1884 2362 1256
2381 2383 2389 2411 2423
2060 842,2278 776 2126 290,884
3083 3089 3119 3181 3203
1450 1706 1704 3142 2368
3821 3833 3851 3853 3881
3296 1840,1998,3286 216,404 748 1686,2138
2441 2503 2543 2557 2579
366,1750 1044 2374 1464 1730
3221 3229 3257 3313 3323
98 1634 922 2222 3292
3917 3967 3989 4001
1490 106 1936 534
2591 2621 2633 2647 2657
854, 2574 1772 1416 1172 710
3329 3391 3407 3433 3469
1378 2232,2534 2076, 2558 1300 1174
352 400,950 242 332,1102,2748 138, 788
a D. H. Lehmer, Emma Lehmer, H. S. Vandiver, 1. L. Selfridge, and C. A. Nicol, Proc. Natl. A cad. Sci. U.S. 40, No.1, 25-33. (1954); No.8, 732-735 (1954); 41, No. II. 970-973 (1955). b Table 9 was corrected in the second printing to include four additional pairs as noted in Wells Johnson, Math. Compo 27 (1973), 387-396 and in V.V. Kobelev, Soviet Math. Dokl. II (1970), 188-189. See also 1.L. Selfridge and R. Pollack, Notices Amer. Math. Soc. II (1964), 97.
Index A
Convex set, 110 Cyclotomic field, 325 Cyclotomic polynomial, 325
Absolute index of ramification of divisor, 217 Absolute norm of divisor, 216 degree of inertia, 217 Absolutely irreducible polynomial, 10 Algebraic element extension, 396 number, 78 Algebraic integer, 92 Algebraic number field, 78 Analytic curve, 305 function, 284 Associate numbers of module, 89
D Decomposable form, 78 Dedekind ring, 207 Degree of field extension, 396, 397 Degree of inertia of extension of field with valuation, 259 of prime divisor, 199 Determinant of quadratic form, 390 Diagonal quadratic form, 392 Direct sum of quadratic forms, 392 Dirichlet series, 330 Discrete set of points, 99, 100 Discriminant, of algebraic number field, 92 of basis, 403 of binary quadratic form, 139 of full module, 92 Division with remainder, 166 Divisor, 170, 212 Dual basis, 403
B Basis, of field extension, 397 of lattice, 99 of module, 83 Bernoulli number, 382 Binary quadratic form, 395 Bounded p-adic sequence, 28 Bounded set of points, 100
C Centrally symmetric set, 110 Character of Abelian group, 415 Character of quadratic field, 238 Characteristic polynomial. 399 Class of divisors, 220 Coefficient ring, 87 Complete field under valuation, 255 Complete metric field, 35 Completion of field, under metric, 35 under valuation, 253 Congruence, of elements of ring modulo a divisor, 207 of polynomials, 3 Conjugate fields, 404 elements, 404
E Equivalence of divisors, 220, 221 of integral polynomials modulo a prime, 3 of quadratic forms, 391 Euclidean ring, 166 Even numerical character, 335 Extension field, 396, 397 Extension of valuation, 185
F Finite extension of field, 397 Finite prime divisor, 280 Fractional divisor, 212 Full decomposable form, 83 433
434
INDEX
Full lattice, 99 Full module, 83 Fundamental basis of finite extension of field with valuation, 260 of integral closure, 200 Fundamental domain, 312 Fundamental modulus of numerical character, 420 Fundamental parallelepiped, 101 Fundamental sequence, 34 Fundamental units 'of algebraic number field, 114
G Gaussian sum, 14, 333 Genus, of divisor, 245 of form, 241
H Hilbert symbol, 55
M
Metric, 33 Metric field, 32 Minimum polynomial, 397 Modular equivalence, 149 Module in algebraic field, 81 N
Norm of divisor, 198 of element, 400 of module, 124 of point, 98 Normed Gaussian sum, 349 metric, 281 N umerical character, 418
o Odd numerical character, 335 Order in algebraic number field, 88
I
Ideal of field relative to Dedekind ring, 214 Index of finite extension of field with valuation,259 of ramification of prime divisor, 196 of valuation, 186 Inertia field of finite extension, 263 Infinite prime divisor, 280 Integral closure, 413 Integral divisor, 212, 213 Integral element of field with valuation, 255 over ring, 412 over valuation, 181 Integral equivalence of forms, 77 Integrally closed, 413 Invariants of finite Abelian group, 415 Irregular prime number, 224 L
Lattice, 99 Local analytic manifold, 302 Local method, 251 Logarithmic representation of algebraic numbers, 104
p
p-Adic completion, 253 p-Adic field, 25, 35 p-Adic field, 280, 281
field of formal power series, 40,263 integer, 20 metric, 27 metric, 277 valuation, 23, 178 p-Integral rational number, 22 Prime divisor, 170, 171 Prime element of ring, 165 Primitive character, 419 Primitive element, of algebraic number field, 79 of finite extension, 398 Primitive form, 140 Primitive polynomial, 275 Principal divisor, 170, 171 Proper equivalence of binary quadratic forms, 140
p-Adic p-Adic p-Adic p-Adic
Q
Quadratic field, 130 Quadratic numerical character, 17, 237, 349
INDEX
R Reduced number basis of planar lattice, 145 of imaginary quadratic field, 148 module of real quadratic field, 153 of imaginary quadratic field, 148 ofreal quadratic field, 153 Regular prime number, 224 Regulator of algebraic number field, 115 of order, 115 Representation of number by quadratic form, 391 of zero by quadratic form, 391 Residue class field of valuation, 181 of field with valuation, 255 Ring of residue classes modulo a divisor 208 ' of valuation, 187
S Separable extension, 402 Similar modules, 82 Simple finite extension, 185, 186 Strict equivalence of divisors of quad-
435
ratic field, 239 Strictly similar modules in quadratic field, 140 Suitable numbers of Euler, 427
T Topological isomorphism, 35 Totally ramified extension of field with valuation, 262 Trace of element, 400 Transcendental element, 397 U
Unique factorization, 166 Unit of algebraic number field, 93 of order, 93 p-adic, 21 Unramified extension of field with valuation, 262
v Valuation, 175
z Zeta-function of Dedekind, 309 of Riemann, 320