Lectures on Algebra - PDF Free Download

S S Abhyankar Lectures on Algebra Volume I Lectures on Algebra Volume I This page is intentionally left blank S S...

Author: Shreeram S. Abhyankar

127 downloads 2211 Views 35MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

S S Abhyankar

Lectures on Algebra Volume I

Lectures on Algebra Volume I

This page is intentionally left blank

S S Abhyankar Purdue University, USA

;.*;,.'=siatj3

s i ^ i a i ^ s v : ••}••. ru-aaiTi1.'. •'•

"

iSfa^^K

Lectures on Algebra Volume I

Y | ^ World Scientific NEW JERSEY • LONDON • SINGAPORE • BEIJING • SHANGHAI • HONG KONG • TAIPEI • CHENNAI

Published by World Scientific Publishing Co. Pte. Ltd. 5 Toh Tuck Link, Singapore 596224 USA office: 27 Warren Street, Suite 401-402, Hackensack, NJ 07601 UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE

British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library.

LECTURES ON ALGEBRA Volume I Copyright © 2006 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording or any information storage and retrieval system now known or to be invented, without written permission from the Publisher.

For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required from the publisher.

ISBN 981-256-826-3

Printed in Singapore by World Scientific Printers (S) Pte Ltd

PREFACE

In universities and colleges it has become customary to give two algebra courses, the first being called abstract algebra and the second linear algebra. The present volume, Lectures on Algebra I, is meant as a text-book for an abstract algebra course, while the forthcoming sequel, Lectures on Algebra II, should serve as a text-book for a linear algebra course. The author's fondness for algebraic geometry shows up in both volumes, and his recent preoccupation with the application of group theory to the calculation of Galois groups is evident in the second volume which contains more local rings and more algebraic geometry. Both volumes are based on the author's lectures at Purdue University during the last several years. An attempt has been made to make these volumes self-contained. The reader may prefer to start with the sixth lecture which gives a rapid summary of the first five lectures. He may also find it helpful to look at the detailed contents printed at the end of the volume just before the index; this is particularly significant for the enormous (about 300 pages) Section §5 of Lecture L5. When in a certain lecture we are referring to an item from another lecture, the citation of the other lecture precedes the citation of the item. Thus, for instance, in the proof of Theorem (Q4)(T13) in Lecture L5, the reference L4§5(011) is to Observation (Oil) of §5 of Lecture L4, whereas the reference ( T i l ) is to Theorem ( T i l ) of Lecture L5. Frequently, assertions made in one place are proved or expanded later on. For this purpose, forward reference is indicated by [cf.]. For instance, on page 4, at the end of the sentence "This is unique up to isomorphism, i.e., between two copies of it there is a one-to-one onto map preserving sums and products;" the phrase "[cf. L5§5(Q32)(T138.2)]" means that the proof of the preceding claim that there is a unique field GF(q) of q elements, will be given in Theorem (T138.2) of Quest (Q32) of Section §5 of Lecture L5. Like a Russian Petrushka doll, there are many books within this book. For instance, the first three lectures, LI, L2, L3 constitute a booklet on a basic abstract algebra course. The sixth lecture L6 by itself constitutes another such course. These two alternatives togther make up a larger such booklet. The fourth lecture L4 is a booklet on commutative algebra. It is continued in the fifth lecture L5 which may be V

PREFACE

VI

viewed as a treatise on commutative algebra. Within it, Quest (Q31) is a pamphlet on Suslin's work on projective modules and special linear groups over multivariable polynomial rings. Finally, Sections §§2-5 of L3, §§8-9 of L4, and Quests (Q33)-(Q35) of L5§5 form a short course on algebraic geometry. Thanks are due to Sudhir Ghorpade, Nan Gu, Nick Inglis, Valeria Grant Perez, Avinash Sathaye, David Shannon, Balwant Singh, Umud Yalcin, and Ikkwon Yie for much help. Thanks are also due to NSF Grant DMS 99-88166 and NSA Grant MSP H98230-05-1-0040 for financial support. Shreeram S. Abhyankar, Mathematics Department, Purdue University, West Lafayette, IN 47907, USA, e-mail: [email protected]

CONTENTS

Lecture LI: QUADRATIC EQUATIONS §1: Word Problems §2: Sets and Maps §3: Groups and Fields §4: Rings and Ideals §5: Modules and Vector Spaces §6: Polynomials and Rational Functions §7: Euclidean Domains and Principal Ideal Domains §8: Root Fields and Splitting Fields §9: Advice to the Reader §10: Definitions and Remarks §11: Examples and Exercises §12: Notes §13: Concluding Note

1 1 3 3 6 8 9 13 14 16 16 23 27 29

Lecture L2: CURVES AND SURFACES §1: Multivariable Word Problems §2: Power Series and Meromorphic Series §3: Valuations §4: Advice to the Reader §5: Zorn's Lemma and Well Ordering §6: Utilitarian Summary §7: Definitions and Exercises §8: Notes §9: Concluding Note

30 30 34 39 43 44 52 52 59 60

Lecture L3: TANGENTS AND POLARS §1: Simple Groups §2: Quadrics §3: Hypersurfaces §4: Homogeneous Coordinates

61 61 63 64 66

viii

CONTENTS

§5: Singularities §6: Hensel's Lemma and Newton's Theorem §7: Integral Dependence §8: Unique Factorization Domains §9: Remarks §10: Advice to the Reader §11: Hensel and Weierstrass §12: Definitions and Exercises §13: Notes §14: Concluding Note

70 72 77 81 82 83 83 90 98 98

Lecture L4: VARIETIES AND MODELS §1: Resultants and Discriminants §2: Varieties §3: Noetherian Rings §4: Advice to the Reader §5: Ideals and Modules §6: Primary Decomposition §6.1: Primary Decomposition for Modules §7: Localization §7.1: Localization at a Prime Ideal §8: Afnne Varieties §8.1: Spectral Afnne Space §8.2: Modelic Spec and Modelic Affine Space §8.3: Simple Points and Regular Local Rings §9: Models §9.1: Modelic Proj and Modelic Projective Space §9.2: Modelic Blowup §9.3: Blowup of Singularities §10: Examples and Exercises §11: Problems §12: Remarks §13: Definitions and Exercises §14: Notes §15: Concluding Note

100 100 104 105 107 108 134 136 137 144 146 152 152 153 154 157 159 160 161 171 172 195 200 201

Lecture L5: PROJECTIVE VARIETIES §1: Direct Sums of Modules §2: Grades Rings and Homogeneous Ideals §3: Ideal Theory in Graded Rings §4: Advice to the Reader §5: More about Ideals and Modules

202 202 206 209 216 216

CONTENTS

(Ql) Nilpotents and Zerodivisors in Noetherian Rings (Q2) Faithful Modules and Noetherian Conditions (Q3) Jacobson Radical, Zariski Ring, and Nakayama Lemma (Q4) Krull Intersection Theorem and Artin-Rees Lemma (Q5) Nagata's Principle of Idealization (Q6) Cohen's and Eakin's Noetherian Theorems (Q7) Principal Ideal Theorems (Q8) Relative Independence and Analytic Independence (Q9) Going Up and Going Down Theorems (Q10) Normalization Theorem and Regular Polynomials (Qll) Nilradical, Jacobson Spectrum, and Jacobson Ring (Q12) Catenarian Rings and Dimension Formula (Q13) Associated Graded Rings and Leading Ideals (Q14) Completely Normal Domains (Q15) Regular Sequences and Cohen-Macaulay Rings (Q16) Complete Intersections and Gorenstein Rings (Q17) Projective Resolutions of Finite Modules (Q18) Direct Sums of Algebras, Reduced Rings, and PIRs (Q18.1) Orthogonal Idempotents and Ideals in a Direct Sum (Q18.2) Localizations of Direct Sums (Q18.3) Comaximal Ideals and Ideal Theoretic Direct Sums (Q18.4) SPIRs = Special Principal Ideal Rings (Q19) Invertible Ideals, Conditions for Normality, and DVRs (Q20) Dedekind Domains and Chinese Remainder Theorem (Q21) Real Ranks of Valuations and Segment Completions (Q22) Specializations and Compositions of Valuations (Q23) UFD Property of Regular Local Domains (Q24) Graded Modules and Hilbert Polynomials (Q25) Hilbert Polynomial of a Hypersurfaces (Q26) Homogeneous Submodules of Graded Modules (Q27) Homogeneous Normalization (Q28) Alternating Sum of Lengths (Q29) Linear Disjointness and Intersection of Varieties (Q30) Syzygies and Homogeneous Resolutions (Q31) Projective Modules Over Polynomial Rings (Q32) Separable Extensions and Primitive Elements (Q33) Restricted Domains and Projective Normalization (Q34) Basic Projective Algebraic Geometry (Q34.1) Projective Spectrum (Q34.2) Homogeneous Localization (Q34.3) Varieties in Projective Space (Q34.4) Projective Decomposition of Ideals and Varieties

ix

216 218 219 220 225 229 230 236 241 247 261 268 272 277 280 300 311 340 341 344 345 348 354 364 372 381 385 393 397 399 401 408 414 433 441 514 529 534 534 536 541 545

x

CONTENTS

(Q34.5) Modelic and Spectral Projective Spaces (Q34.6) Relation between AfRne and Projective Varieties (Q35) Simplifying Singularities by Blowups (Q35.1) Hypersurface Singularities (Q35.2) Blowing-up Primary Ideals (Q35.3) Residual Properties and Coefficient Sets (Q35.4) Geometrically Blowing-up Simple Centers (Q35.5) Algebraically Blowing-up Simple Centers (Q35.6) Dominating Modelic Blowup (Q35.7) Normal Crossings, Equimultiple Locus, and Simple Points (Q35.8) Quadratic and Monoidal Transformations (Q35.9) Regular Local Rings §6: Definitions and Exercises §7: Notes §8: Concluding Note

547 548 552 552 553 555 555 559 566 567 569 577 578 596 597

Lecture L6: PAUSE AND REFRESH §1: Summary of Lecture LI on Quadratic Equations §2: Summary of Lecture L2 on Curves and Surfaces §3: Summary of Lecture L3 on Tangents and Polars §4: Summary of Lecture L4 on Varieties and Models §5: Summary of Lecture L5 on Projective Varieties §6: Definitions and Exercises

598 598 603 606 608 611 634

BIBLIOGRAPHY

689

DETAILED CONTENT

691

NOTATION-SYMBOLS

713

NOTATION-WORDS

717

INDEX

725

Lecture LI: Quadratic Equations §1: W O R D PROBLEMS Consider the following word problem. One morning I went to the garden and plucked some roses. Seeing that there were not enough I went to as many flower shops as I had roses and from each of them I purchased as many roses as I had plucked. Thus armed with sufficient flowers I went to the Ganesh Temple, Ganesh being the God of Learning and especially Mathematics. I put at his feet four times as many roses as I had plucked. Then I went to the Shiva Temple and deposited ten roses. The remaining eight roses I took to my spouse. How many roses did I pluck? Now in algebra what we do not know we call x. So having plucked x roses, from the various shops I bought x x x = x2 roses. Thus armed with x2 + x roses I offered to Ganesh Ax roses and then having deposited ten to Shiva the eight for my spouse makes x2 + x = Ax + 10 + 8. Bringing everything to one side gives us the quadratic equation x2 - 3x - 18 = 0. We can solve this quadratic equation by completing the square. In other words we want to add a quantity to x2 — 3x to make it a complete square. By the binomial theorem we have (x + y)2 = x2 + 2xy + y2. So we want 2y = —3, i.e., y = =£ and hence y2 = | . Thus transferring 18 to the right hand side and then adding | to both sides we get x2 - 2.x + | = f + 18, i.e., (x - | ) 2 = f + 18 = ^p- = f = ( | ) 2 . Therefore x - § = ± § . Thus x = | + | = 6 o r a ; = | - | = - 3 . Discarding the negative solution —3 as not applicable in this case we conclude that I plucked 6 roses. This completing the square method of solving a quadratic equation was conceived around 500 A.D. by the Indian mathematician Shreedharacharya. It was put in verse form in 1150 A.D. by another Indian mathematician Bhaskaracharya in his book [Bha] on Algebra called Beejganit. Thus, writing Y for the unknown, we have the quadratic equation aY2 + bY + c where a, b, c are its coefficients with a ^ 0. To make it monic, i.e., to arrange the coefficient of the highest degree term to be 1, we divide by a to get Y2 + BY + C with B = | and C = ^. Now we complete the square by writing R\2

/

Y

2

+ BY + C=lY+-j 1

R2

+C-—.

2

LECTURE LI: QUADRATIC EQUATIONS

Putting back the values of B and C, we get the solutions _ — b ± y/b2 — 4ac ~ 2o ' In the above word problem, first I thought of bringing 5 flowers to my spouse. That would make a = 1, b = —3, c = - 1 5 and putting this in the above formula we would get the solutions to be 3±%^. Since 69 has no integer square root I would be faced with the dilemma of picking an irrational number of flowers. That is why I decided to bring 8 flowers to my spouse since I knew this would lead to a constant term in my equation which could be factored into two factors whose negative sum equals the middle term. What I am referring to is the identity (Y - a){Y -/3) = Y2-(a

+ 0)Y + aP = Y2 + BY + C

which tells us that C — a0 and B = —(a + 0), i.e., the constant term is the product of the roots and the coefficient of Y is the negative sum of the roots. Around 1500 A.D., similar but much more complicated formulas for solving cubic and quartic equations by radicals were given by Cardano and Ferrari in Italy. Here by radicals we mean successively extracting square roots, cube roots, and so on. Then for the next three hundred years people tried to solve general quintic equations, where by solving they meant solving by radicals. This was proved to be impossible, first by the Norwegian mathematician Abel in 1820 and then by the young French mathematician Galois in 1830 [Gal]. But Galois went much further and gave a criterion of solvability for any polynomial equation of any degree. What he did was to associate to the equation a certain finite permutation group, which is now called the Galois group, and to prove that the equation can be solved by radicals if and only if its Galois group is solvable in a technical sense which he introduced. To explain this let f(Y)=a0Yn

+ aiYn~1+---

+ an

with

a0 ± 0

be a polynomial of any degree n. The coefficients ao, a i , . . . , an could be rational numbers, or real numbers, or complex numbers. All these are examples of fields, i.e., collections of objects in which we can carry out the operations of addition, subtraction, multiplication and division; in the last case the quantity we are dividing by is required to be nonzero. Letting the coefficients ao, a\,. • •, an of / belong to any field K, we find its roots a i , . . . , an in some bigger field giving us f(Y) =

a0(Y-a1)...(Y-an).

We assume the polynomial / to be separable by which we mean that the roots ai,...,an are all distinct. The Galois group of / is going to be a certain group of permutations of the roots ai,... ,an. A permutation on a set S is a bijection, i.e., a one-to-one onto map, of S onto itself. Under composition the set of all bijections of

§2: SETS AND MAPS

3

S forms a group which we denote by Sym(S'). In case the set S is finite and its size | 5 | (= the number of elements in it) is n, it is customary to write Sn for Sym(S'). For the size of an infinite set S we put \S\ — oo. Let us now formally introduce the terms set, map, composition, bijection, group, and field. §2: SETS A N D M A P S A map <j> '• S —> T from a set 5 (= a collection of objects called its elements) to a set T is a rule which to every x € S, i.e., element x of S, associates 4>{x) £ T, which is called the image of x under <j>\ this is sometimes indicated by writing x — i > {x) which is read as x maps to (x). The map

(x) ^ (y). The map is surjective (= onto) if for every y £T there is some x £ S with y = cj>(x). The map <j> is bijective if it is injective as well as surjective. A bijection (resp: an injection, a surjection) is a bijective (resp: injective, surjective) map. The composition of maps <j> : S —> T and ip : T —> £/ is the map ^ 0 : S —» C/ given by (tp / : 5 —> T the inverse -1 : T —> 5 is defined by <j)~1{y) = x <=> <j)(x) = y. The logical symbols A => B, A <= B, and A <=> B stand for A implies B, A is implied by B, and A is true if and only if B is true respectively. §3: G R O U P S A N D FIELDS A group is a set G in which: any two elements x, y can be multiplied to produce a third element xy; this multiplication is associative, i.e., for all x, y, z in G we have (xy)z = x(yz), and so we may omit the parenthesis; there is an identity element 1 in G with lx = xl — x for all i ? G ; and every x £ G has an inverse a: -1 £ G with xx"1 = x~xx = 1. Clearly Sym(S') is a group under composition, with the identity map x — i > x as 1. In particular Sn is a group with \Sn\ = n\, where n! is read as n factorial and stands for the product 1 x 2 x • • • x n. It S is infinite then clearly Sym(S') is infinite. The group G is commutative or abelian if xy = yx for all x, y in G. If G is abelian we may write x + y and 0 and —a: in place of xy and 1 and a; -1 respectively, and when we do so, we may call G an additive abelian group. In the general case when we are writing xy, we may call G a multiplicative group; this time G may or may not be commutative. A field K is an additive abelian group with an associative commutative multiplication defined on it such that the nonzero elements form a commutative multiplicative group and the two operations are connected by the distributive laws saying that for all a;, y, z in K we have x(y + z) = xy + xz and (y + z)x = yx + zx. The rational numbers, the real numbers, and the complex numbers clearly form infinite fields; they are usually denoted by Q, R, and C respectively. To get a finite field, in the eventful year 1830, the 19 year old Galois

4

LECTURE LI: QUADRATIC EQUATIONS

[Gal] noticed that for any prime p, the integers 0 , 1 , . . . ,p — 1 form a field when they are added and multiplied modulo p, i.e., after adding or multiplying the answer is to be divided by p and replaced by the remainder. This gives the Galois Field GF(p). At the Chicago World's Fair of 1893, E. H. Moore [Mol; Mo2] extended Galois' thought by showing that for every power q of p, there is a field GF(q) of q elements. This is unique up to isomorphism, i.e., between any two copies of it there is a one-to-one onto map preserving sums and products; [cf. L5§5(Q32)(T138.2)]. If 1 + 1 + . . . is never zero in a field K then K is said to be of characteristic 0. Otherwise 1 + H h 1 = 0 when 1 is repeated a certain number of times, and the smallest such number is easily seen to be a prime number, which is then called the characteristic of K. For instance the real and complex fields are of characteristic zero, and the Galois Field GF(g) is of characteristic p. Just as a set S is a collection of objects, a subcollection R is called a subset and denoted by R c S; we may also write S D R and call S an overset of R; this is characterized by the implication: x G R =>• x G S. For R\ c S and i?2 C S, their intersection Ri fl R2 is denned to be the set of all x £ S such that x G R\ and x G i?2, and their union Rx U R2 is denned to be the set of all x G S such that x € i?i or x G R%. Similarly for the intersection and union of more than two subsets. For R\ C S and R2 C S we define Ri \ R% to be the set of all x G 5 such that x G R\ but x S- R2; we may call this the complement of R2 in R\. Note that, similar to x ^ y, the symbols x £ R2, Ri <jL R2, and so on, denote the negations of a; G R2,Ri C i?2, and so on. The symbol 0 denotes the empty set. Given any set-theoretic map <j> : S —> T, i.e., given any map 0 of a set S into a set T, for any R C S we put 4>{R) = {4>{x) : x G R}, i.e., <j){R) = the set of all elements of T which can be expressed in the form (x) with x G R; we call <j>{R) the image of R under \ we also put im(<£) = ; finally, for any C / c T w e put <£ -1 ([/) = {x G S : <j>(x) G U). The set consisting of elements x\,...,xe (which may or may not be distinct) is denoted by {x\,..., xe}. However, the group whose only element is 1 may be denoted by 1 instead of the more cumbersome {1}. If H C G are groups under the same operations (and the same identity elements) then we say that H is a subgroup of G or G is an overgroup of H, and we denote this by H < G or G > H; also we write H < G or G > H to mean H < G and H ^ G. If H < G is such that for all h G H and g G G we have ghg~x G H then we say that H is a normal subgroup of G and indicate this by writing H
§3: GROUPS AND FIELDS

5

elements of G which can be expressed in the form xy with y £ H, and we call xH a left coset of H in G; similarly Hx = {yx : y £ H} is called a right coset of H in G; the left cosets of H in G form a partition of G, and so do the right cosets of H in G. Note that a partition of a set 5 is a collection of nonempty subsets of S such that S is their union and any two of them have an empty intersection. For H < G, by G/H we denote the set of all left cosets of H in G; if H < G then G/H becomes a group by defining (xH)(yH) = (xy)H\ we now call G/H the factor group of G by ff. For H < G we put [G : H] = \G/H\ = the number of left cosets of H in G and we call this the index of H in G; this notation is used especially when the index is finite; if G is finite then the index is clearly finite; if G is infinite then the index may be finite or infinite. A finite group G is said to be solvable if there is a chain 1 = Go 5. To introduce A„, we note that any a e Sn, i.e., any permutation on n letters, can be written as a product of transpositions, where by a transposition we mean a permutation which permutes only two letters while leaving all the other letters unchanged. Moreover, the parity, i.e., the evenness or oddness, of the number of transpositions occurring in such a product depends only on a. The permutation a is called even or odd according as the parity of the product is even or odd. Now An is defined to be the group of all even permutations. Clearly \An\ = \Sn\/2 = n!/2 and hence the index [Sn : An] = 2. Therefore An<SnSince An is simple for n > 5, it follows that Sn is unsolvable for n > 5. It only remains to note that the Galois group of the "general" quintic is 55. Therefore the general quintic cannot be solved by radicals. This is Galois' proof. It applies to the general equation of degree n for any n > 5.

6

LECTURE LI: QUADRATIC EQUATIONS

Thus we have the hierarchy of groups in increasing order of complexity: cyclic, abelian, solvable, simple, unsolvable. §4: RINGS A N D IDEALS Generalizing slightly, if in the definition of a group we do not require the existence of the inverse x~l then what we get is called a monoid, and what we get by further dropping the existence of identity is called a semigroup. The terms commutative or abelian semigroup, additive abelian semigroup, multiplicative semigroup, subsemigroup, and oversemigroup are obvious generalizations of the corresponding terms for groups; similarly for a monoid; note that the identity element of a submonoid is required to coincide with the identity element of the monoid. A semigroup H is cancellative means for any x,y, z in it we have the implications which say that: xy = xz or yx = zx =>• y = z. If H is a subsemigroup of a group G then clearly H is cancellative. As an example of an additive abelian group we have the set of all integers, usually denoted by Z, as an example of a submonoid of Z we have the set of all nonnegative integers, usually denoted by N, and as an example of a subsemigroup of Z we have the set of all positive integers, usually denoted by N+. Turning to fields, if K C L are fields under the same operations (and the same zero and identity elements) then we say that K is a subfield of L or L is an overfield of K. As examples, the rational numbers are a subfield of the real numbers which themselves are a subfield of the complex numbers. Again generalizing slightly, a ring R is an additive abelian group which is also a commutative multiplicative monoid such that the two operations are connected by the distributive laws saying that for all x, y, z in R we have x(y + z) = xy + xz and (y + z)x = yx + zx; we call R a null ring if in it we have 1 = 0, i.e., equivalently if \R\ = 1. The concepts of a subring and an overring are defined in an obvious manner; in particular they have the same 0 and 1. A domain is a nonnull ring whose nonzero elements form a cancellative multiplicative monoid. If R is a subring of a field K then R is clearly a domain. In particular the ring of integers, being a subring of the field of rational numbers, is a domain which, as said above, is usually denoted by Z. Dropping the commutativity of multiplication in a field (resp: ring, domain) we get the notion of a skew-field (resp: skew-ring, skew-domain). In the definitions of fields and rings we have spelled out two distributive laws, although they follow from each other, exactly because in case of skew-fields and skew-rings they do not. Moreover, in the definition of a skew-field, by requiring the left-distributive law x(y + z) = xy + xz, but not requiring the right-distributive law (y + z)x = yx + zx, we get the notion of a near-field. Finite near-fields were exhaustively dealt with by L. E. Dickson in 1905. Dickson was the first Ph.D. student of E. H. Moore in the newly opened University of Chicago which had become a vigorous center of mathematical research

U: RINGS AND IDEALS

7

in the westward expansion of the New World. At that time another disciple of E. H. Moore was the Scottish mathematician J. H. M. Wedderburn. Having classified finite fields in 1893, Moore asked his two young followers to study near-fields and skew-fields respectively. Dickson gave a complete list of finite near-fields which was later shown to be exhaustive by the German mathematician H. Zassenhaus as part of his 1935 Hamburg Ph.D. thesis written under the guidance of another famous German mathematician E. Artin. On his part Wedderburn showed that a finite skew-field is necessarily a field. As an additive abelian group, every subgroup / of a ring R is a normal subgroup and so we can form the factor group R/I; note that a typical member of R/I is a residue class (= additive coset) a + I with a £ R. By an ideal in a ring R we mean an additive subgroup I of R such that for all a £ R and x £ I we have ax £ / , and when that is so we make R/I into a ring by defining (a + I)(b + I) = (ab) + I for all a, b in R. We call R/I the residue class ring of R modulo I. Also we call / a maximal ideal or a prime ideal in R according as R/I is a field or a domain. Likewise, we call / a nonzero ideal or a nonunit ideal in R according as I ^ {0} or I ^ R; note that every ideal contains the zero ideal / = {0} and is contained in the unit ideal I = R; moreover, the zero ideal is prime O- R is a domain; likewise, / is the unit ideal <^ R/I is the null ring. For any a £ R we put aR = the set of all multiples az of a with z £ R, and for any I ^ C f l w e put WR — the set of all finite linear combinations a\Z\ + • • • + aeze with a i , . . . ,ae in W and z\,..., ze in R, and we note that these are ideals in R (by convention an empty sum is zero and hence 0 £ WR); they are called ideals generated by a and W respectively; an ideal of the form aR is called a principal ideal in R and a is called a generator of it; in case W = {wi,u>2,... }, we may denote WR by (wi,u>2, • • • )R and call wi,u>2,... its generators. It is easy to see that every ideal in Z is of the form pL with p £ N; moreover: pL is a maximal ideal in Z <=> pL is a nonzero prime ideal in Z «=> p is a prime number. If p is a prime number then we may identify Z/pZ with the Galois field GF(p). The concepts of normal subgroup and ideal can be motivated in terms of homomorphisms thus. For groups G and J, a homomorphism : G —> J is a map such that (p(l) = 1 and (xy) = (f>(x)(f>(y) for all x, y in G; the kernel of , denoted by ker(^), is the set of all x £ G with <j)(x) = 1; note that is injective iff (= if and only if) ker($) = 1 and when that is so we call <j> a monomorphism; if is surjective, i.e., if im() = J, then we call an epimorphism; if is bijective then we call it an isomorphism; if J = G and <j> is an isomorphism then we call an automorphism of G; the set of all automorphisms of G is clearly a subgroup of Sym(G) and we denote it by Aut(G). If G —> J is a group homomorphism then clearly im(0) < J and ker() < G. Conversely, for any H < G we get a group epimorphism G —> G/H with ker(c/>) = H by taking (x) = xH for all x £ G; we call

: G —> J becomes
8

LECTURE LI: QUADRATIC EQUATIONS

4>(x + y) = <j)(x) + 4>(y) for all x, y in G. For rings S and T, a (ring) homomorphism 0 : S —» T is a homomorphism of additive abelian groups such that 4>(1) = 1 and 4>{xy) = cj)(x)4>{y) for all x, y in S; now im(S') is a subring of T and ker(^) is an ideal in S, and the definitions of monomorphism, epimorphism, isomorphism, and automorphism carry over from the group case. Conversely, given any ideal / in a ring 5, for the canonical ring epimorphism <j> : S —> S/I we have ker(0) = / . The set of all ring automorphisms of a ring S is clearly a subgroup of Sym(5') and we denote it by Aut(S'); momentarily letting this Aut be written as Ring-Aut(5) and writing Group-Aut(S) for the automorphism group of the additive abelian group 5 we clearly have Ring-Aut(S) < Group-Aut(5) < Sym(5); it is usually clear from the context which automorphism group is being considered and so it is not necessary to use such a cumbersome notation. Finally for a subring R of a ring S, by an i?-automorphism of S we mean an automorphism
§6: POLYNOMIALS AND RATIONAL

FUNCTIONS

9

generated by U\ U • • • U Ue is denoted by U\ + • • • + Ue. When in the above definitions we want to stress the reference to the ring R, we may say .R-module, .R-submodule, .R-linearly independent, .R-basis, and so on. For modules V and V* over a ring R, by an i?-homomorphism (or .R-linear map) (f>: V —> V* we mean a homomorphism (f> of the underlying additive abelian groups such that for all r £ R and v £V we have ) and im() are submodules of V and V* respectively. Now the meaning of JJ-monomorphism, R-isomorphism, and so on, is obvious. For a skew-ring R, the above description defines a left .R-module V, and an obvious change defines a right .R-module V with a scalar multiplication which to every r £ R and v g V associates vr 6 V. The rest of the above definitions now apply with obvious modifications. By a vector space over a field K we mean a K-module V, and by a subspace of V we mean a .ft'-submodule of V, and so on. Elements of a vector space V, are frequently called vectors. It can be shown that a vector space V always has a basis, and there is a bijection between any two bases. The number of elements in a basis of V is denoted by dim#V, and we call this the dimension of V over K or the /^-dimension of V. Note that then either dim^V € N or dimpcV = oo. It can be shown that any set of generators of V contains a basis of V and hence if V is generated by a finite number of vectors v\,... ,ve then d i m ^ F < e. As said above, any overring of a ring R is an .R-module in an obvious sense. In particular R is an .R-module, and an ideal in R is exactly an .R-submodule of R. In particular, any overring L of a field K is a ff-vector space, and we put [L : K] = dim^L. In particular this applies to an overfield L of K, and then we call [L : K] the field degree of L/K, i.e., of L over K. For a ring or skew-ring R, by R+ we denote the underlying additive abelian group of R, and by Rx we denote the set of all nonzero elements of R; note that K x is a multiplicative group in case of a field or skew-field K. We extend the last notation to any additive abelian monoid (such as an additive abelian group, or a module, or a vector space) V and denote the set of all nonzero elements in it by V x . For an overfield L of a field K, [L : K] usually denotes the field degree rather than the index of K+ in L+. For a homomorphism ) = 1 (resp: im((j)) = 0), where we note that the submonoid {1} (resp: {0}) is denoted by 1 (resp: 0). §6: POLYNOMIALS A N D RATIONAL F U N C T I O N S Given any ring R, we can consider the polynomial ring R[X] in an "indeterminate" X over R, and regard this as an overring of R. A typical polynomial in X

10

LECTURE

LI: QUADRATIC

EQUATIONS

over R, i.e., a typical member of il[X], has a unique (finite) expression A(X) = Y2 A-iX1

with

Ai e R

and i varying over a finite set of nonnegative integers. The degree of A is the largest i with Ai =£ 0; if A = 0, i.e., if A, = 0 for all i, then the degree is taken to be —oo; A is nonconstant means it does not belong to R, i.e., its degree is a positive integer; A is monic means A ^ 0 and Ae = 1 where e is the degree of A. For any other polynomial B(X) = ^2 Bix*

with

Bi&R

the sum C(X) = A{X) + B{X) = J2 CiX1 is given by componentwise addition

and the product = ] T DiXi

D(X) = A(X)B(X) is given by the "Cauchy multiplication rule"

Dk= Y.

AiB

i-

i+j=k

The "formal" X-derivative AX{X) = ^ r Ax(X)

1

=

of i4(X) is denned by putting Y,iAiXi~1.

Clearly A — i > >lx gives an i?-derivation of -R[X], i.e., {A + B)x = Ax + Bx

and

{AB)X = AXB + ABX

with Ax = 0 for all A E R. If i? is a domain then the degree of the product of two nonzero polynomials equals the sum of their degrees, and hence in particular the product is nonzero. It follows that if R is a domain then so is the polynomial ring i?[X]. For an indeterminate X over a field K, by K(X) we denote the field of rational functions in X over K. Members of K{X) are "quotients" of polynomials A(X)/B(X) with nonzero B. The above .ff-derivation of K[X] can be extended to a -^-derivation of K(X) by the "quotient rule" (u/v)x

= (uxv —

uvx)/v2.

Similarly, given any ring R, we can consider the polynomial ring R[X\,...,Xm} in a finite number of "indeterminates" X\,..., Xm over R, and regard this as an

§6: POLYNOMIALS

AND RATIONAL

11

FUNCTIONS

overring of R. A typical polynomial in Xi,..., Xm over R, i.e., a typical member of R[Xi,..., Xm] has a unique (finite) expression = YJAil...imX?...Xi™

A{X1,...,Xm)

with

Ah...im€R

and ( i i , . . . ,im) varying over a finite set of m-tuples of nonnegative integers. The degree of A is the maximum of i\ + • • • + im with Ai...i m ^ 0; if A = 0, i.e., if Ai1.„im = 0 for all i i , . . . ,im, then the degree is taken to be —oo; A is nonconstant means it does not belong to R, i.e., its degree is a positive integer. For any other polynomial B(XU.

..,Xm)

= Y,Bn-^x

---Xt

with

Bix...im

E R

the sum C(X! ,...,Xm)

= A(Xi ,...,Xm)

= J2 di-imXi1

+ B(XU ...,Xm)

•••Xt

is given by componentwise addition ^ii...im

=

-Aii...i m +

Dix...im

and the product D(X1,...,Xm)

= A(X1,...,Xrn)B(X1,...,Xm)

=

Y/Dil...irnXi>...Xl™

is given by the "Cauchy multiplication rule" Uki...km

=

£_J i\+ji=k\,...,im+jm=km

The "formal" partial derivative AXj {Xx,..., is defined by putting AXj{X1,...,Xm)

•™-i1...imBj1...jm.

Xm) =

QA

^->X^)

0f

A{XX

,...,Xm)

^ijAil...imX?...X)'s11Xi'-1X$1\..X%

=

and noting that A >—> AXj gives an R[Xi,...,X,_i,X,+i,..., X m ]-derivation of R[X\,..., Xm]. If J? is a domain then the degree of the product of two nonzero polynomials equals the sum of their degrees, and hence in particular the product is nonzero. It follows that if ii is a domain then so is the polynomial ring R[Xi,..., Xm}. For a finite number of indeterminates X\,..., Xm over a field K, by K(Xi,... ,Xm) we denote the field of rational functions in X\,... ,Xm over K. Note that members of K{X\,... ,Xm) are "quotients" of polynomials A(X\,..., Xm)/B{X\,..., Xm) with nonzero B. Again by the "quotient rule" we can extend the above K[Xi,...,

X,_!, Xj+i,...,

X m ]-derivation of K[Xi,...,

Xm\

to a K(Xi,...,Xj-i,Xj+i,...,Xm)-derivation

of

K(X\,...,Xm).

12

LECTURE

LI: QUADRATIC

EQUATIONS

Given any overring S of the ring R, any finite number of elements x\,..., S can be "substituted" for the indeterminates X\,..., Xm in any A{X\,..., R[Xi,...,Xm] to get A(x\,

. . . , Xm) = y ^ &i-v...imx\

• • • xm

xm in Xm) G

^ &

and we define R[xu...,xm]

= {A(xi,...,xm)

: A{Xi,...,Xm)

G R[Xi,...

,Xm}}

and we note that then R[xi,..., xm] is a subring of S. We call R[x\,..., xm] the subring generated by xi,...,xm over R. The i?-epimorphism R\Xi,..., Xm] —* R\xi,..., xm] and the i?-homomorphism R[Xi,..., Xm] —> S which send every polynomial A(X\,..., Xm) to A{x\,..., xm) are called natural maps or substitution maps. An element x G S is algebraic over R if it satisfies a polynomial equation A(x) = 0 with 0 7^ j4(-f) G -R[X]; otherwise it is transcendental over R; we may express this by saying that x/R is algebraic or transcendental respectively. For W C S, we say that W is algebraic over R if every a; G W is algebraic over R; again we may express this by saying that W/R is algebraic. Finite number of elements (i.e., a finite sequence) x\,..., xm are algebraically dependent over R if they satisfy a polynomial equation A(xi ,...,xm) = 0 with 0 ^ A(Xi,..., Xm) G R[Xi,..., Xm]; otherwise they are algebraically independent over R; we may again express this by saying that (xi,..., xm)/R are algebraically dependent or algebraically independent respectively. Finite number of elements x\,..., xm in 5 form a transcendence basis of S/R (= S over R) if they are algebraically independent over R, and S is algebraic over R[xi,...,xm). For W C S we let R\W] = L1R[XJ1 ,... ,Xje] with union over all finite sequences Xj1,..., Xje in W, and note that this is also a subring of S which we call the subring generated by W over R; note that R[
§7: EUCLIDEAN DOMAINS AND PRINCIPAL IDEAL DOMAINS

13

§7: E U C L I D E A N D O M A I N S A N D PRINCIPAL IDEAL D O M A I N S For any field K, it can be shown that the one variable (= indeterminate) polynomial ring K[X] as well as the many variable polynomial ring K[X\,..., Xm] are UFDs in the following sense. A unit in a ring R is an element u in R such that uu' = 1 for some u' € R; note that then u' is unique and, in accordance with the field case, we may denote it by u - 1 , and for any v £ R we may write v/u for v u _ 1 ; the set of all units in R forms a multiplicative group which we denote by U(R); note that for a field K we have U(K) — Kx. We extend the notation U(R) to denote the multiplicative group of all (two-sided) units in a skew-ring R; similarly, the (two-sided) inverse of a unit u in a skew-ring R may be denoted by u _ 1 ; again note that for a skew-field K we have U(K) = Kx. Elements z, z' in a ring R are associates of each other (in R) if the ideals zR and z'R coincide; note that in case of a domain R, this is equivalent to saying that z' = uz for some unit u in R. An irreducible element in a ring R is a nonzero nonunit element z in R such that z cannot be expressed as z = z'z" where z' and z" are nonzero nonunit elements in R. A ring R is a UFD (= unique factorization domain) if R is a domain and every nonzero element z in R can uniquely be expressed as z = uz\ ...zT where u is a unit in R and irreducible elements in R; here uniqueness means that if z = u'z[... z'r, is any other such factorization then r' — r and there is a permutation . . . , a(r) of 1 , . . . , r such that z',^ is an associate of Zi for 1 < i < r. In case of K[X] or K[X\,..., Xm], the nonzero elements of K are exactly all the units in K[X] or K[X\,... ,Xm] respectively. In case of X p f ] , every nonzero nonunit in K[X] is the associate of a unique nonconstant monic polynomial. Analogous to Z, every nonzero nonunit ideal in K\X] is of the form A(X)K[X] where A(X) is a nonconstant monic polynomial; moreover this ideal is prime <=> it is maximal & A(X) is irreducible. In particular the rings K[X] and Z are PIDs (= Principal Ideal Domains), where by a PID we mean a domain R in which every ideal is principal, i.e., it is of the form xR for some x S R. Moreover both these rings are EDs (= Euclidean Domains), where by an ED we mean a domain R together with a Euclidean Function, i.e., a map <j> : Rx —> N such that for all x,y in Rx we have: y S xR => <j>(x) < 4>{y), and y £ xR => y = qx + r for some q £ R and r G Rx with (r) < 4>{x). If this r (and hence also q) is unique (depending on x and y) then we may call R a special ED, and if there is a subset R of R such that r becomes unique when required to belong to R then we may call R a quasispecial ED relative to the quasispecial subset R. In case of K[X] we take <j>(x) = the X-degree of x, and we note that then K[X] becomes a special ED. In case of Z we take (j)(x) = \x\, i.e., 0 or x < 0, and we note that then Z becomes a quasispecial ED relative to the quasispecial subset N. It can be shown that: ED => PID =• UFD, and: R is a UFD => R[X] is a UFD. Now by induction on m we see that K[X\,..., Xm] is a UFD.
14

LECTURE

LI: QUADRATIC

EQUATIONS

§8: ROOT FIELDS A N D SPLITTING FIELDS In view of what we have said above, in case of the univariate (= one variable) polynomial ring K[Y] over a field K, for any irreducible g(Y) G K[Y] the residue class ring K[Y]/g(Y)K[Y] is a field L' which we may regard as an overfield of K, and calling the image of Y under the canonical epimorphism K[Y] —* V to be a.\ we have V = K\a{\ = K(cc\) with g(a\) = 0. We call V a root field of g over K. Given any nonconstant f(Y) £ K\Y] of some degree n, by taking g(Y) to be an irreducible factor of f(Y) in K[Y] we get /(c*i) = 0 and 0 ^ fx{Y) = f{Y)/{Y-al) is of degree n — 1. If n — 1 > 0 then repeating this process we find an element 0:2 in an overfield of K{ax) such that / i ( a 2 ) = 0 and 0 ^ / 2 ( F ) = fi(Y)/(Y - a2) is of degree n — 2. Continuing this process n times we find elements ai,...,an in an overfield of K such that /00=a0(r-ai)...(y-an) where ao is the coefficient of Yn in / ( F ) . The overfield L* = i f [ a i , . . . , a n ] = iiT(ai,..., a n ) of K is called a splitting field of / over K and as notation we put SF(f,K)

= L* =

K(a1,...,an).

For any overfield L of L*, we call L* the splitting field of / over K in L; it can be shown that then for any if-isomorphism <j>: L* —> L** where L** is any overfield of K in L we must have L** = L*, i.e., <j> must be a if-automorphism of L*. To define the concept of a AT-isomorphism, given any overrings S and T of a ring R, a homomorphism (resp: monomorphism, epimorphism, isomorphism) <j> : 5 —> T is called an .R-homomorphism (resp: i?-monomorphism, iJ-epimorphism, il-isomorphism) if (j>(x) = x for all x S R; note that an R-automorphism of S is an .R-isomorphism S —> S. With this definition at hand, it can be shown that any two splitting fields of / over K are -ff-isomorphic, i.e., there is a X-isomorphism of one onto the other. Thus the notation SF(/, K) is unique up to isomorphism. Upon letting f(Y) = a0Yn + axYn-1 + • • • + an with ao,a\,...

,an in K, for the Y—derivative fy(Y) of f(Y) we have fY{Y)

= naoY"-1 + (n - l)aiYn~2

+ •••+ o„_i.

It is easily seen that / is separable (i.e., the roots ai,...,an are distinct) iff f(Y) and / y ( V ) have no nonconstant common factor in ^ [ V ] , By a Galois extension of K we mean an overfield of K which is the splitting field of some nonconstant separable univariate polynomial over K. Assuming / to be separable, by the Galois group Gal(L*,JK') of L* over K we mean the group Aut/f(L*). For any a € Gal(L*,K) we have f(Y) = ao(Y — cr(a\))... (Y — a(an)) and hence ( o i , . . . , a«) i-> (a(ai),..., a(an)) gives a permutation a* of { a i , . . . , an}. Clearly

§S: ROOT FIELDS AND SPLITTING FIELDS

15

a >-> a* gives a monomorphism Gal(L*,if) —> Sn = the permutation group on { a i , . . . , a „ } . We define the image of this monomorphism to be the Galois group of / over K and denote it by Gal(/, K). Thus we have a natural isomorphism Gal(L*,if) —* Gal(/, K) and hence in particular Gal(L*,if) is a finite group. Now Galois himself [cf. L6§6(E6)] characterized Gal(/, K) as the set of all relations preserving permutations of the roots, i.e., the set of all T G Sn such that for every P(Xi,... ,Xn) G K\Xi,...Xn) with P{ai,... ,a„) = 0 we have P ( r ( a i ) , . . . , r(a„)) = 0. For any intermediate field L, i.e., a field L with K C L C L*, it is clear that L* = SF(/,L) and Gal(L*,L) < Gal(L*,if). Likewise, for any H < Gal(L*,if), upon letting fixL.(ff) = {z £ L* : ft(z) = z for all ft G # } it is clear that this is an intermediate field; we call this the fixed field of H. Eventually, following Galois, we shall prove the following [cf. L6§6(E1)]: FUNDAMENTAL THEOREM OF GALOIS THEORY (Tl). The mapping L — i * Gal(L* ,L) = H gives an inclusion reversing bijection of the set A of all fields L between K and L* onto the set T of all subgroups H of Ga\(L*,K), and the inverse bijection is given by H — i > fix/,. (77) = L, where inclusion reversing means: L\ C Li <=> Gal(L*,L x ) D GaI(L*,L 2 ). Moreover iT< Gal(L*,^) <=> L is a Galois extension of K, and when that is so Ga\(L*,K)/H is naturally isomorphic to Gal(L,K); in greater detail, by restricting a € Ga^L*,/^) to L we get <7 G Gal(L,if), and a ^> a gives an epimorphism Gal(L*,if) —> Gal(L, if) with kernel H. Finally for every L e A we have |Gal(L*,L)| = [L* : L}. By an r-cyclic extension of K, where r is a positive integer, we mean a Galois extension L of K such that Gal(L, K) is a cyclic group of order r. The following theorem dates back to 1775 [cf. L6§6(E3)]: LAGRANGE RESOLVENT THEOREM (T2). Assume that K has r distinct rth roots of 1. Then any r-cyclic extension of K is obtained by adjoining an r-th root of some nonzero element a of K, i.e., it is of the form K(a) with 0 ^ ar = a £ K. Conversely, for any 0 ^ a G K, the splitting field of Yr — a over K is an s-cyclic extension of K for some divisor s of r. The above two theorems have the following consequence [cf. L6§6(E3)]: SOLVABILITY THEOREM (T3). Assume that K has n! distinct n!-th roots of 1. Then Gal(/, K) is solvable if and only if the equation / = 0 can be solved by radicals, i.e., if and only if there is a chain of fields K = KQ C ifi C • • • C Kt = K(ai,... ,an) such that for 1 < i < t wehaveifj = Ki-i(ui'r') with 0 ^ Ui e ifj-i and rj G N+. The following theorem dates back to 1665 [cf. L6§6(E4)]:

16

LECTURE

LI: QUADRATIC

EQUATIONS

NEWTON'S SYMMETRIC FUNCTION THEOREM (T4). For the generic n-th degree polynomial F(Y)=Yn

+ X1Yn~1

over K = k(X\,..., Xn), where X\,..., haveGal(F, J ft:) = 5'„.

+ --- + Xn

Xn are indeterminates over a field k, we

We have already noted the following theorem dating back to 1830 [cf. L6§6(E5)]: GALOIS' SYMMETRIC GROUP THEOREM (T5). The group Sn is unsolvable for n > 5. Finally, as a consequence of (T3), (T4), (T5) we get the [cf. L6§6(E5)]: UNSOLVABILITY THEOREM (T6). Assume that the field A; contains n\ distinct n!-th roots of 1. Then, for n > 5, the generic n-th degree polynomial F(Y) over K = k(X\,..., Xn) cannot be solved by radicals. §9: A D V I C E TO T H E R E A D E R We shall now fill in some details of argument which were left out in the above discourse so as not to break the smooth flow of thought. This will be done in a series of Definitions (Dl), (D2), ..., Remarks (Rl), (R2), ..., Examples (XI), (X2), ..., Exercises (El), (E2), ..., and Notes (Nl), (N2), ... . We shall end up with a Concluding Note in the style of Aesop's Fables. The same pattern may be followed in the succeeding lectures. In any lecture, assertions such as Theorems, Corollaries, Claims, and Propositions, will be labelled (Tl), (T2), ..., and formulas and other items will be labelled (1), (2), ..., (Al), (A2), ..., (Bl), (B2), ..., and so on. Sometimes there may be Observations (01), (02), ... . In a lecture, say in Lecture L3, items from another lecture, say Lecture L2, will be cited as L2(D1), L2(T4), and so on. Some details left out in a lecture may very well be covered in a later lecture. The reader may need a certain amount of patience to follow this method of concentric circles, whereby the ideas expounded at a time are explained and expanded in an outer circle. The patience will be rewarded by entertainment and ease of absorption. Also the discerning reader should have no difficulty in separating out the heuristic geometric discourse from the official logical dogma. §10: DEFINITIONS A N D R E M A R K S DEFINITION (Dl). [Divisibility and Prime Ideals]. Let R be a ring. Given elements x, y in R, we say that x divides y (in R) or y is divisible by x (in R) to

§10: DEFINITIONS AND REMARKS

17

mean that y G xR, i.e., y = xz for some z G R; we may express this by writing x\y. Recall that z G R is irreducible (in 7£) means z is a nonzero nonunit in R which is not the product of any two nonzero nonunits in R. Similarly z G R is said to be prime (in R) if z is a nonunit and for all x,y in R we have: 2|(xy) => z|a: or z|j/. Recall that an ideal 7 in R is a prime ideal or a maximal ideal according as R/I is a domain or a field. This is easily seen to be equivalent to the more standard definition, i.e.,

{

an ideal 7 in a ring R is maximal <3> I ^ R and there is no ideal strictly between 7 and 7?

an ideal 7 in a ring R is prime O 7 ^ 7? and for all x, y in R \ I we have xy g' I.

(Ao)

J an element z i n a ring R is prime < I <=$ the principal ideal zR is prime.

Note that a maximal ideal is always prime but not conversely; for instance for a field K, the zero ideal in the univariate polynomial ring K[Y] is prime but not maximal; more interestingly, in the bivariate (two variable) polynomial ring K[X, Y], the ideal generated by X is prime but not maximal since the residue class ring is obviously isomorphic to the univariate polynomial ring 7C[y]. REMARK (Rl). [Division Algorithm]. Given any A = A(X) in the univariate polynomial ring R[X] over a ring R, we call A monic or submonic (in X) if A / 0 and, upon letting a to be the coefficient of Xd in A where d is the degree of A, we have a = 1 or a G U(R) respectively. Assuming A(X) to be submonic, the division algorithm in R[X] says that given any B(X) G R[X] there exist unique q(X) and r(X) in R[X] such that B(X) = q(X)A(X) + r{X) and the degree of r(X) is < d. It follows that the univariate polynomial ring K[X] over a field K is a special ED. Note that the division algorithm in Z says that given any x,y in Z with x 7^ 0 there exist unique q, r in Z such that y = qx + r and 0 < r < \x\. This makes Z a quasispecial ED with N as the quasispecial subset. REMARK (R2). [Long Division or Euclidean Algorithm]. In the ancient Indian literature the following procedure for finding a gcd (= greatest common divisor) of two nonzero elements x\, x to be the euclidean function, let us inductively define qi+2,xi+2 for i = 1, 2 , . . . by saying that if xt $ xi+\R then qi+2 G R and xi+2 G Rx

18

LECTURE LI: QUADRATIC EQUATIONS

are such that Xi = qi+2Xi+i +xi+2 and 4>{xi+2) < (xi+i). This stops after t steps, with (x2) > t £N, when xt+i £ xt+2R- Note that if R is a special ED then the elements qi+2, %i+2 are unique, and if R is a quasispecial ED then they can be chosen uniquely. At any rate, without assuming special or quasispecial, this algorithm enables us to express xt+2 as a linear combination of the elements x 1,0:2, and to express the elements 0:1,0:2 as multiples of 2^+2, i-e., to write o:t+2 = ao:i + bx2, 0:1 = co;t+2, 0:2 = dxt+2, with a, b, c, d in R. Thus the element o: t+2 is a generator of the ideal in R generated by 0:1,0:2, and hence xt+2 is a gcd of 0:1,0:2 in the following sense. For a subset 5 of a ring R (which need not be an ED), by a cd (= common divisor) of S (in R) we mean x £ R such that x\y for all y £ S, and by a gcd of S (in R) we mean a cd x of S such that x is divisible by every cd of S; any gcd of S may be denoted by gcd(S); note that if the ideal in R generated by S is principal then a gcd of S is nothing but a generator of that ideal; moreover, without assuming the ideal SR to be principal, if x £ R is a gcd of S then: y £ R is a gcd of S <=)• x and y are associates of each other. By a cd of a finite or infinite sequence of elements y 1, j/2> • • • in a ring R we mean a cd of S = {2/1,2/2, • • • }, and by a gcd of 2/1,2/2,... we mean a gcd of S = {2/1,2/2, •• • }; again any gcd of 2/1,2/2, • • • may be denoted by gcd(2/1,2/2, • • • )• Although a gcd is determined only up to associates, for euphony we may sometimes call it the gcd. Again assuming R to be an ED, we extend the above procedure of finding a gcd to a finite sequence 2/1,.. •, ys in Rx, with s £ N, by putting ZQ = 0, z\ = 2/1, ^2 = gcd(zi,2/2),23 = gcd(z 2 ,2/3),...,2 s = gcd(z s _i,2/ s ), where the gcd is as found in the above two element procedure, and noting that now clearly zs is a generator of the ideal in R generated by 2/1, • • •, 2/s, and hence zs is a gcd of 2/1, • • •, 2/s • For a finite sequence z/i,..., ys in R we apply this procedure to the sequence obtained by deleting all the zeros from the sequence j / i , . . . , ys, and thereby obtain a generator of the ideal in R generated by j / i , . . . , j/«, which is hence a gcd of j/i,. • •, 2/sREMARK (R3). [ED =» P I D ] . By extending the above method to infinite sequences we can show that ED =>• PID. But this can be shown more easily, although nonconstructively, thus. Given any nonzero ideal 7 in a euclidean domain R with euclidean function (j>, take 0 ^ x £ I having the smallest Rvalue; if there were y £ I \ xR then we could find q £ R and r £ Rx such that y — qx + r and <j>{r) < 4>(x), which would be a contradiction because 0 ^ r £ I. In case R is the univariate polynomial ring K[X] over a field K, for any S C R we put GCD(5) = 0 if S C {0} and otherwise we put GCD(S) = the unique monic generator of the ideal in R generated by S, and for any finite or infinite sequence 2/1,2/2, • • • in R we put GCD(2/i, 2/2, • • •) = GCD({2/i, 2/2, •••})• I n case R is the ring of integers Z, for any S C R we put GCD(S) = the unique nonnegative generator of the ideal in R generated by S, and for any finite or infinite sequence 2/1,2/2, • • • in i? we put GCD(j/i, 2/2, • • •) = GCD({2/i,2/2, •••})• Here GCD stands for the Greatest Common

%10: DEFINITIONS

AND

REMARKS

19

Divisor (in R). Note that in the above two situations, the GCD is a gcd. To make the reference to the ring R explicit we may write gcd^ and GCD^ instead of gcd and GCD respectively. REMARK (R4). [ED => U F D ] . Later on we shall prove PID => UFD [cf. L5§5(Q18.4)]. Here we shall give a direct proof of ED =>• UFD. First let us establish some generalities about UFDs. So let R be a domain. Recall that R is a UFD means (Fl) every nonzero element in R is a product of a unit and irreducible elements and (F2)

the irreducible factors in (Fl) are unique up to order and associates.

It can be shown that if R is a UFD then (F3)

every irreducible element in R is prime.

It can also be shown that if (Fl) and (F3) hold then so does (F2). In other words (Bl)

in a domain R with (Fl) we have: (F2) & (F3).

It is clear that any subset S of a UFD R, and hence any sequence j/i, 2/2, • • • in R, has a gcd; namely if S C {0} then 0 is the only gcd of S, and if S has a nonzero element z then we can write z = uz\x... z|' where u is a unit in R, e i , . . . , e.t are positive integers, and z\,..., zt are irreducibles in R such that no two of them are associates of each other; now upon letting d, to be the largest integer such that zi * divides every element of S, we see that x £ R is a gcd of S iff x = vz^ ... zt* for some unit v. Elements 2/1,2/2, •• • in a UFD are coprime (or relatively prime) means 1 is a gcd of them. To establish some generalities about EDs, recall that R is an ED means it has a euclidean function, i.e., a map (f>: Rx —• N such that for all x,y in Rx we have: (Gl)

y£xR=>

{x) < <j>{y)

and (G2)

y $ xR =>• y = qx + r for some q € R and r € Rx with (j)(r) < <j>(x).

It is easily seen that given any euclidean function <j> : Rx —> N, for all x, y in Rx we have (G3)

x and y are associates => <j>{x) = 4>(y)

and (G4)

y € xR and (j)(x) = (j>(y) => x and y are associates.

20

LECTURE LI: QUADRATIC EQUATIONS

With 0 as above, for all z € Rx by (Gl) we know that 0(z) > 0(1); by induction on (j){z) let us show that z can be expressed as a product of a unit and irreducible elements; for <j>{z) = 0(1) this follows from (G4); so let (z) > 0(1) and assume for all values of <j>(z) smaller than the given one; if z is irreducible then we have nothing to show; otherwise z = z'z" where z' and z" are nonzero nonunits, and by (G4) we get (z') <
every ED satisfies (Fl).

Again with 0 as above, let x, y, z in Rx be such that z is irreducible with xy £ zR and x $ zR; then 1 is a gcd of x, z and hence by (R2) we can write 1 = ax + bz with a, b in R; multiplying this equation by y we get y = axy + byz and hence y G zR. Thus (B3)

every ED satisfies (F3).

By (Bl), (B2), (B3) we see that ED => UFD. REMARK (R5). [Factorization of Integers]. Applying the above Remarks (Rl) to (R4) to the case R = Z we see that: R is an ED, PID, and UFD, with U(R) = {±1}. Every nonzero nonunit element in R is an associate of a unique integer > 1. Every irreducible element in R is an associate of a unique positive irreducible element, i.e., a prime number. Every nonzero element z in the quotient field Q of R has a unique factorization z — ±p\: ... p\ * where e\,..., et are nonzero integers and pi,...,pt are pairwise distinct prime numbers (by convention an empty product is 1); moreover z € R <=> e% > 0 for 1 < i < t. All the nonzero prime ideals in R are maximal, and they are of the form pR where p is a prime number. For any positive integer n the residue class ring S = R/(nR) is clearly a ring of size n; moreover S is a field -^ n is a prime p, and then we call S the Galois Field of p elements and denote it by GF(p). REMARK (R6). [Factorization of Univariate Polynomials]. Applying the above Remarks (Rl) to (R4) to the univariate polynomial ring R = K[Y] over a field K we see that: R is an ED, PID, and UFD, with U(R) = KX. Every nonzero nonunit element in R is an associate of a unique nonconstant monic polynomial. Every irreducible element in R is an associate of a unique nonconstant monic irreducible polynomial. Every nonzero element h = h(Y) in the quotient field K(Y) of R has a unique factorization h = ag*1 ... g\% where a G Kx, e i , . . . , et are nonzero integers, and g\ = gi(Y),... ,gt = gt(Y) are pairwise distinct nonconstant monic irreducible polynomials; moreover h G R •& et > 0 for 1 < i < t. All the nonzero prime ideals in R are maximal, and they are of the form gR where g = g(Y) is a nonconstant irreducible polynomial, which may be chosen to be monic. For any

§i0: DEFINITIONS AND REMARKS

21

nonconstant / = f(Y) G R of some degree n > 0, clearly Kr\(fR) = {0} and hence we may identify K with a subring of the residue class ring L = R/(fR), and then upon letting y to be the image of Y under the residue class map R—>Lwe see that 1) y, • • • > yn~l is vector space basis of L over K and hence [L : K] = n; moreover L is a field <=> / is irreducible. DEFINITION (D2). [Characteristic]. For any ring R there is unique homomorphism Z —•> R. The unique nonnegative generator of its kernel is called the characteristic of R and is denoted by ch(fl). Clearly if R is a domain then ch(R) is either 0 or a prime number. The image of the homomorphism Z —> R is called the prime subring of R. Clearly ch(R) = 0 <=> the prime subring of R is isomorphic to Z; in this case we may identify the prime subring with Z. Also clearly ch(i?) is a prime number p <=> the prime subring is a field; in this case the prime subring is obviously isomorphic to the Galois field GF(p), and we identify the former with the latter. Needless to say that ch(R) = 1 <$ the prime subring is the null ring o R is the null ring. DEFINITION (D3). [Minimal Polynomial]. Let L be an overfield of a field K. Then y £ L is transcendental or algebraic over K according as the kernel of the iiT-homomorphism K[Y] —> L which sends Y to y is zero or nonzero, i.e., according as \K[y] : K) = oo or [K[y] : K] = a positive integer n. In the algebraic case n is the field degree of the field K[y] = K{y) over K, and the unique monic generator of the said kernel is a monic irreducible polynomial of degree n in Y over K; we call this polynomial the minimal polynomial of y/K, i.e., of y over K. Note that for any y G L we always have K[y] C K(y) and hence: y is algebraic over K •& K[y] = K(y) <& [K[y] : K) < oo <^ [K(y) : K) < oo.

DEFINITION (D4). [Least Common Multiple]. In grade school when we learn of cd and gcd we also learn of cm (= common multiple) and 1cm (= least common multiple). To introduce these let it be a ring and let y\,..., ya be a finite sequence of elements in R with s > 0. By a cm oiy\,...,ya (in R) we mean x € R such that x is divisible by j/j for 1 < i < s. By an 1cm of y i , . . . ,ya (in R) we mean a cm of j / i , . . . , ys which divides every cm of j / i , . . . , ys, and we denote it by lcm(j/i,... ,ya). Clearly if \cm(yi,... ,ya) exists then it is unique up to associates. For euphony we may sometimes speak of the 1cm instead of an 1cm. Now assume that it is a UFD. Then we can find a finite number of irreducible elements z\,..., zt in R, no two of which are associates of each other, such that for 1 < i < s we have ?/* = Uiz\iX ... z\H where Ui is a unit in R and en,..., eu are nonnegative integers. For 1 < j < t let lj = max{ejj : 1 < i < s} (with lj = 0 if &ij = 0 for all i), and let y = z± ... ztl. Then clearly y is an 1cm of y\,..., ys. In case R is the univariate polynomial ring K[Y] over a field K (resp: the ring

22

LECTURE LI: QUADRATIC EQUATIONS

of integers Z) we define the LCM (= Least Common Multiple) of y\,... ,ys (in R) to be the unique monic (resp: positive) 1cm of j / i , . . . ,ys, and we denote it by LCM (1/1,..., ys); note that by taking z\,... ,zt to be monic (resp: positive) we get LCM(?/i,..., y3) = y. To make the reference to the ring R explicit we may write lcmij and LCMR instead of 1cm and LCM respectively. Continuing with the assumption that R is a UFD, any element £ in the quotient field of R can be expressed as ( = £/r? with £ G R and r\ G Rx such that the elements £ and 77 are coprime, and we call £/?? a reduced form of £. In case R is the univariate polynomial ring K[Y] over a field K (resp: the ring of integers Z) this becomes unique by requiring 77 to be monic (resp: positive), and so we may call £/?7 the reduced form of (,. REMARK (R7). [Relative Algebraic Closure]. To continue the discussion of (R6) and (D3), let L be an overfield of a field K, and let K' and K" be subfields of L with K c K' c K". It can be shown that then and K / \K" : ^ < 0 ° ° \K" :K'}<0° \ ' -K)
(Hint: if (zi)i
respectively then

J if [K" : .fiT] < 00 then every y G .fiT" is algebraic over K and [the degree of the minimal polynomial of y over K divides \K" : K}. It also follows that (Co)

J if K" is algebraic over K', and .fiT' is algebraic over K, < I then K" is algebraic over .fiT.

Moreover it follows that if K" = K(W) where W C K" is such that (C4)

{ every j / G W is algebraic over .fiT then K" = K\W] and K" is algebraic over if.

Finally it follows that . (Co)

I the set K of all elements of L which are algebraic over K < [is a subfield of L;

we call K the (relative) algebraic closure of K in L; the adjective relative is sometimes used to distinguish this from an (absolute) algebraic closure of K by which we mean an overfield K of K such that K is algebraic over K and every nonconstant univariate polynomial over K has a root in K.

§11: EXAMPLES AND EXERCISES

23

REMARK (R8). [Uniqueness of Splitting Field]. Let L be an overfield of a field K, and let f(Y) be a nonconstant polynomial of some degree n > 0 in Y over K such that / splits completely in L, i.e., f(Y) = ao(Y — a\)... (Y — an) with 0 ^ CLQ S K and c*i,..., an in L; we call L* = K(a\,..., a n ) the splitting field of / over K in L. Let L' be an overfield of a field K' such that there is an isomorphism : K —> K' and / ' splits completely in L' where f'(Y) is obtained by applying (j> to the coefficients of f(Y). Now upon letting a'0 = (ao) we clearly have f'(Y) = a'0(Y - a[) ...(Y — a'n) with a[, •'•• ,a'n in £/. We want to show that there is an isomorphism <j>* : L* —> V* = K'(a[,... ,a'n) such that <j>*(x) = <j){x) for all x £ K. We do this by induction on n. For n = 1 the assertion is obvious because then L* = K and L'* = K'. So let n > 1 and assume for n — 1. Let g(Y) be the minimal polynomial of Qi over K, and let ff'(F) be obtained by applying to the coefficients of g(Y). Then clearly g'(Y) is the minimal polynomial of o^ over K' for some i, and upon relabelling a[,... ,a'n suitably we may assume that i = 1. Now Kx = K{al) and K[ = K'(a[) are root fields of g(Y) and g'(Y) over K and K' respectively, and they are naturally isomorphic to K[Y]/(g(Y)K[Y}) and K'[Y]/(g'(Y)K'[Y]) respectively, and therefore there is an isomorphism (f>\ : K\ —> K[ such that (/>i(ai) = a[ and (x) for all x £ K. Let / i ( y ) = ( y - a 2 ) . • • (Y-an) and /{(y) = (Y-a'2)... (Y-a'n). Then h(Y) is a polynomial of degree n — 1 over /fi, and /{(y) is obtained by applying i to its coefficients. Also clearly L* and L'* are the splitting fields of / i and f[ over i^x and K[ in L and V respectively. Therefore by the induction hypothesis there exists an isomorphism (j>* : L* -> V* such that 0*(a;) = <j>x{x) for all i G i f i . §11: EXAMPLES A N D EXERCISES EXAMPLE (XI). [Special Permutations]. A permutation, say of the integers {1,2,. . . , 6 } , is a rearrangement like (\ 2 3 4 5 6 \ \ 2 4 6 1 5 37 where we send 1 to 2, 2 to 4, 3 to 6, 4 to 1, 5 to 5 and 6 to 3. Let us call the above permutation a. Let us consider another permutation, say r, given by A 2 3 4 5 6\ \ 4 3 6 1 2 5J ' To multiply r by a, we first apply a and then follow it by r. Thus T<7

_ (\ 2 3 4 5 6\ / l 2 3 4 5 6 \ _ A 2 3 4 5 6\ ~ V4 3 6 1 2 5 / V2 4 6 1 5 3y \ 3 1 5 4 2 67

because: er sends 1 to 2 and r sends 2 to 3, a sends 2 to 4 and r sends 4 to 1, a

24

LECTURE LI: QUADRATIC EQUATIONS

sends 3 to 6 and r sends 6 to 5, a sends 4 to 1 and r sends 1 to 4, a sends 5 to 5 and r sends 5 to 2,CTsends 6 to 3 and r sends 3 to 6. Doing it the other way round we get <7T

_ / l 2 3 4 5 6\ / l 2 3 4 5 6^ _ / l 2 3 4 5 6\ ~ V2 4 6 1 5 3 / \ 4 3 6 1 2 5 / ~ \l 6 3 2 4 5 /

which illustrates that in general multiplication is not commutative. The above permutation a can be written as a product of disjoint cycles

where the cyclic permutation (124) is the permutation which sends 1 to 2, 2 to 4, and 4 to 1 (and leaves all the other numbers unchanged) , i.e., in "long form"

and similarly, the cyclic permutation (36) sends 3 to 6 and 6 to 3, and, finally, the cyclic permutation (5) sends 5 to 5. Since (5) leaves all the other numbers unchanged, so indeed (5) is simply the identity permutation (I 2 3 4 5 6\ ~ \1 2 3 4 5 6j •

[b)

The cyclic permutation (124) is a cycle of length 3. A cycle of length 2, such as (36), is called a transposition. The cycles (124), (36) and (5) are disjoint means no two of them have any number in common. EXAMPLE (X2). [General Permutations]. A permutation a of a finite set, say the set of integers { 1 , 2 , . . . , n}, i.e., a £ Sn, is written as / 1

2

•••

n

\

V ( l ) <x(2) • • • a(n)J or more generally as (

%\

\a(ii)

h

•••

in

\

a(i2) ••• cr(in)J

where i\, i2, • • •, in is any labelling of 1,2,..., n. Thus, by nipping the representation of a we get 1=(a(l)a(2).-.a(n)\

\

1

2

•••

n J '

We can chooseCT(1)in n different ways, and then we can choose
%11: EXAMPLES AND EXERCISES

25

\Sn\ = n\, i.e., the symmetric group of degree n has size n\. A permutation on n letters is called a permutation of degree n, to remind us of the fact that an element of the Galois group of a polynomial of degree n is a permutation of the n roots. For a positive integer e < n, an e-cycle or a cycle of length e in Sn is a permutation r for which there is a rearrangement i\,... ,in of 1 , . . . ,n such that i"(ii) = i2,T~(i2) = h,... ,r{ie-i) = ie,r(ie) = i i , r ( i e + i ) = i e + 1 , . . . , r ( i n ) = i n ; we denote this by (iii2 . . . ie), or for clarity by (ii,i2, • • •, *e)- As in the above example, any permutation a of { 1 , 2 , . . . , n} can be written as a product of disjoint cycles, say of lengths e\,.,., e^; note that we include cycles of length 1; also note that every i € { 1 , . . . , n} occurs in exactly one of the ft cycles, and we have e\ + • • • + e^ = n. Now, a cycle of length e can be expressed as a product of e— 1 transpositions, where by a transposition we mean a cycle of length 2; for example (12345) = (12)(23)(34)(45). Therefore, a can be written as a product of e\ + e?. -\ Yeh — h transpositions; the permutation a is called even or odd according as the integer e\ + e^ + • • • + e^ — ft is even or odd. To justify this terminology we have to show that if the integer ei + e2 H 1- eh — ft is even (resp: odd) then any way of writing a as a product of transposition must contain an even (resp: odd) number of them. One way of proving this is described in the following Exercise (El). From the definition of evenness, it follows that the set of all even permutations of 1,2,..., n forms a subgroup of Sn. This subgroup of Sn is called the alternating group of degree n and is denoted by An. It can easily be seen that An < S„ and |5 n /i4 n | = 2 or 1 according as n > 1 or n = 1. EXERCISE (El). Let Sn be the group of all permutations of { 1 , . . . , n} where n is a positive integer. Show that every a G Sn can be expressed as a product of disjoint cycles, and investigate how far this expression is unique; in particular show that the lengths of the cycles are determined by a. Show that a cycle of length e is a product of e — 1 transpositions. Consider the polynomial of degree n(n — 1) in n variables over Z given by

D(X1,...,Xn)=

J]

(Xi-Xj)

i<j in {l,...,n}

and for any a e Sn put D(Xi,...,

Xn)a =

J|

(X^i) -

Xa{j)).

i<j in {l,...,n}

Show that for all a, r in Sn we have

D(x1,...,xn)^

=

(D(xl,...,xnyy

26

LECTURE LI: QUADRATIC EQUATIONS

and for all transpositions 0 we have D(X1,...,Xn)e

=

-D(X1,...,Xn)

and from this deduce that the parity (i.e., the evenness or oddness) of the number of transpositions whose product is a depends only on a and not on the particular product representation. Moreover by defining the signature sgn(o-) to be 1 or —1 according as the said number of transpositions is even or odd we have D{XU .. .,Xny

= sgn(a)D(Xu..

.,Xn).

As we shall discuss later [cf. L5§5(Q32)], the square of the above mentioned polynomial D(Xi,..., Xn) is the modified ^-discriminant of the n-th degree monic polynomial in Y whose roots are Xi,..., Xn, i.e., of the polynomial r i i 1. Deduce that An < Sn- Generalize by showing that a subgroup of index 2 is always normal. EXERCISE (E3). Show that the symmetric group Sn is generated by the transpositions (12), (13),..., (In); hint: if l,r,s are distinct then (rs) = ( l r ) ( l s ) ( l r ) . Show that An is generated by 3-cycles (123), (124),..., (12n); hint: if 1, r, s are distinct then (ls)(lr) = (Irs), and if l,2,r, s are distinct then (12s)(12s)(12r)(12s) = (Irs). EXERCISE (E4). Show that if H for all x,y m G we have xyx~ly~x £ H. The element xyx~ly~l is called the commutator of x, y and is denoted by [x, y}. The subgroup of a group G generated by the commutators of various pairs of elements of G is called the commutator subgroup of G or the (first) derived subgroup of G, and is denoted by [G,G] or G'. We may rephrase this exercise by saying that for any subgroup H of a group G we have: G' < H <$• H 5 then An is simple and Sn is unsolvable. Hint: for any distinct integers a, b, c, d, e upon letting x = (ode) and y = (cbe) we have xyx~1y~1 = (abc), and hence if H < G < Sn with G/H abelian are groups such that G contains all 3-cycles then so does H; therefore Sn is unsolvable; for simplicity of An [cf. L6§6(E43)]. EXERCISE (E6). For 1 < n < 4, find all subgroups of Sn and show that they are all solvable. Simplify the verification by showing that if H < G are finite groups with G solvable then H is solvable. EXERCISE (E7). Prove the existence and uniqueness of the division algorithm

§12: NOTES

27

equations y = qx + r and B{X) = q(X)A(X) + r(X) asserted in (Rl). Hint: division is repeated subtraction, or equivalently, subtraction is slow division. Thus for the first equation, if y > x > 0 then for r = y — x we have 0 < r < y. Similarly for the second equation, if degxB(X) = e > d = degxA(X) > 0 and e d the coefficients of X and X in B(X) and A(X) are b and a respectively then for r(X) = B{X) - (b/a)Xe-dA(X) we have degxr{X) < e. EXERCISE (E8). Give details of the equations xt+2 = ax\ + bx2, x\ — cxt+2, x2 = dxt+2 asserted in the Euclidean Algorithm described in (R2). Hint: the first step x\ = q$X2 + X3 of the algorithm gives us £3 = x\ — q^X2 and combining this with the second step X2 = QAX^ + x± gives us Xi= X2— 94^3 = -QAXI + (1 + 9394)^2, and so on. EXERCISE (E9). Prove claim (Bl) concerning a UFD made in (R4). Prove that an ED has properties (G3) and (G4) stated in (R4), and prove claims (B2) and (B3) concerning an ED made in (R4). EXERCISE (E10). Concerning algebraic field extensions, fill in the details of argument in (R6) and (D3), and prove assertions (CI) to (C5) of (R7). Hint: recall the "rationalization of surds" method from school math according to which 3 + y / 3 _ (3 + ^ ( 2 - 7 3 ) ^ 3 - V 3 2 +A/3 (2 + \ / 3 ) ( 2 - V3) 1

= 3

^

Generalizing this, given an algebraic element y over a field K with \K(y) : K] = n, we let G(Y) be the minimal polynomial of y over K, and then for any A(Y) G K[Y\ and B{Y) e K[Y] \ (G(Y)K[Y\), by division algorithm and long division we find C(Y),H(Y) in K[Y) with degyC(y) < n such that A(Y) = B{Y)C{Y) + G{Y)H{Y), and this gives

proving that 1,«/,..., yn~1 is a if-vector-space basis of K(y) = K[y\. §12: N O T E S NOTE (Nl). [Derivations]. In calculus derivatives are defined in terms of limits. In algebra we define derivatives in terms of formal rules of manipulation. In greater detail, by a derivation of a ring S with values in an 5-module V we mean a homomorphism D : S —> V of the underlying additive groups such that for all x, y in S we have D(xy) = xD(y) + yD(x). For any s e S w e put (sD)(x) = sD(x) for all x € S, and for any other derivation of D' of S with values in V we put (D + D'){x) = D(x) + D'{x) for all x £ S. This makes the set of all derivations of S

28

LECTURE LI: QUADRATIC EQUATIONS

with values in V into an S-module which we denote by Der(5, V). If D(x) = 0 for all a: in a subring R of S then we say that D is an .R-derivation of S or a derivation of S/R. The submodule of all derivations of S/R with values in V is denoted by Derfl(5, V). By a derivation of S we mean a member of Der(S,S), and by an Rderivation of S we mean a member of Der/{(5, S). Now, referring to the section on Polynomials and Rational Functions, for any ring R the formal derivative A i-> Ax is an i?-derivation of iipf], and if R is a field K then this induces a ^-derivation of K(X); likewise, for 1 < j < m, the formal partial derivative A ^ Axd is an R[X\,..., Xj-i, Xj+i,..., X m ]-derivation of R[Xi,..., Xm], and if R is a field K then this induces a K(Xi,..., Xj-i,Xj+i,..., X m )-derivation of K(X1,..., Xm). Note that in the said section, in the defining equation Ax(X) = ^2iAiXi^1 for i = 0 we take iAiXl~l = 0, and in the defining equation of Ax{X\,..., Xm) for ij — 0 we take the corresponding term to be zero. This avoids negative powers. EXERCISE ( E l l ) . Let D be a derivation of a ring S. Show that for all x in the prime subring of S we have D{x) = 0. Show that if S is a field then for any u, v in S with D ^ O w e have (the quotient rule): D(u/v) = (vD(u) - uD(v))/v2. Deduce that if 5 is a domain then D has a unique extension to the quotient field K of S, i.e., there is a unique derivation D' of K such that D'(z) = D(z) for all z € S. EXERCISE (E12). Let A(XU ...,Xm)e R[XU. ..,Xm] where Xu...,Xmbe indeterminates over a ring R, and let x\,..., xm be elements in an overring S of R. Show that for any D in Der^(5, S) we have (the chain rule): D(A(xi,...,xm))=

Yl

AXj(xi,...,Xm)D(xj).

l<j<m

More generally show that for any D in Der(S', S) we have (the more general chain rule): D(A(xi,...,xm))=AD(xi,...,xm)+

^2

AXd(xi,...,xm)D(xj)

l<j<m

where AD(XI, . . . , Xm) 6 R[X\,..., cients of A(Xi,..., Xm).

Xm] is obtained by applying D to the coeffi-

NOTE (N2). [Very Long Proofs]. As said before, CT = the Classification Theorem of finite simple groups has a proof of about ten thousand pages. Actually, the 1980 proof had some gaps which were repaired (?) only in 2004. Now a proof which is that long is not very digestible. In doing mathematics it is good to make clear what your assumptions are, and to base your arguments on results whose proofs you have checked. But results like CT can be useful guides in the search for truth. For applications of CT see Kleidman-Liebeck's book [KLi].

§ 13: CONCL UDING NO TE

29

§13: C O N C L U D I N G N O T E Algebra means what you do not know, call it x. The experimental data will produce an equation in x. Solve it. The value of x so obtained will tell you what you wanted to know.

Lecture L2: Curves and Surfaces §1: MULTIVARIABLE W O R D PROBLEMS In the last lecture I switched from calling the unknown quantity x to calling it Y. I did this to make room for bivariate (= two variable) equations because a word problem may simultaneously have two unknown quantities. For instance, let us again go to the garden and pluck some roses and some gardenias. Then let us go to two more gardens and pluck the same number of roses as well as gardenias as we did from the first garden. From yet one more garden, we pluck only roses, as many as from the first garden, but do not pluck any gardenias. Counting up all the flowers we see that we have collected a total of fifty which is a nice number to make a bouquet. Needing also a garland, we go to four more gardens and from each pluck the same number of roses as well as gardenias as from the first garden, and make a garland of the sixty flowers so collected. How many roses and how many gardenias did we pluck from the first garden? Calling X the number of roses we plucked from the first garden and Y the number of gardenias which were plucked there, we get the two simultaneous equations: AX + 3Y - 50 = 0

and

AX + AY - 60 = 0.

Subtracting the first equation from the second we get Y = 10 and substituting this in the first or the second equation we obtain X = 5. Now the word problem may be more complicated, such as going to as many shops as the number of roses first picked and from each of them buy the same number of roses, and so on. This could lead to equations such as: Y2 + 7XY + 3Y + 2X - 9 = 0

and

AY3 + 2XY2 + 2Y - 11 = 0

or even three variable equations such as Y3 + XZ-5

= 0

and

Y2 + AX2 - 4 = 0.

Of course, we need not always pick flowers. We could shoot arrows instead or count the number of elementary particles as they collide in a cyclotron. Or we may compare distances between stars, or what have you. At any rate, such word problems frequently lead to several polynomial equations in several variables. We solve them, or try to solve them, and interpret the found values of the variables as measurements of the physical quantities we started with. To solve these equations we could take recourse to the geometric method initiated by Descartes around 1630. For instance the first two linear equations represent two lines in the (X, F)-plane, and they intersect in the point (5,10). Turning to the second set of equations and plotting them as curves in the (X, Y)-plane, the first equation, which is of degree 2, is a hyperbola and the second equation, which 30

%1: MULT1VARIABLE WORD PROBLEMS

31

is of degree 3, is a cubic curve. By Bezout's theorem they intersect in 2 x 3 = 6 points, giving us six solutions of the two simultaneous equations. The third set of trivariate (= three variable) equations represent surfaces in the (X, Y, 2T)-space. Their degrees being 3 and 2, again by Bezout's theorem they intersect in a space curve of degree 3 x 2 = 6. Note that the second equation is a vertical elliptical cylinder. So what is this theorem of Bezout? Well, Descartes' analytic geometry eventually evolved into algebraic geometry and Bezout's theorem, proved around 1770, was the first substantial theorem of that new discipline. It says that in the plane, a curve of degree m and a curve of degree n always meet in mn points. Likewise, in three-space, a surface of degree m and a surface of degree n always meet in a curve of degree mn. Moreover, in three-space 3 surfaces of degrees di,d2,d^ will meet in ^1(^2^3 points. All this, as we shall see, provided things are counted properly! But learning to count properly may take a while. Quite generally, our word problem may lead to several polynomial equations in several variables. Generalizing the idea of a curve in the plane and a surface in 3-space, for any positive integer N, the points in A^-space satisfying an A'-variate (= N variable) polynomial equation F(Xi,... ,XN) = 0 are said to form a hypersurface, and the degree of F is called the degree of that hypersurface. Again as a curve in the plane is 1-dimensional and a surface in 3-space is 2-dimensional, so a hypersurface in N-space is (N — l)-dimensional, the intersection of two of them is (N — 2)-dimensional, the intersection of three of them is (N — 3)-dimensional, and so on; all this only "in general." If our word problem leads to N hypersurfaces Fi(X\,..., XN) = 0 for 1 < i; < N, then Bezout says that they meet in d\.. .dry points where di is the degree of Fi. If for some M < N we have only the first M hypersurfaces F\ = 0 , . . . , FM = 0 then Bezout says that they meet in an (N — M)-dimensional variety of degree d\... &M- Again provided everything is counted properly. We also have to ensure that the hypersurfaces are "mutually independent"; for instance the equation F% = 0 should not be a consequence of the equations F\ = 0 and F2 = 0, as it would be say if F3 = Fi + F2. Also we must attach precise meanings to a variety, its dimension, and its degree. Very briefly, a variety is the geometric locus consisting of the common solutions of a finite number of polynomial equations in a finite number of variables, its dimension is the "degree of freedom" of a point moving on it and is definable in terms of suitable transcendence degrees, its degree is the number of points in which it intersects a complementary dimensional linear subspace and is definable in terms of suitable field degrees, and so on. To get ready for this proper counting, let us start by considering an algebraic curve C in the (X, F)-plane over a field k. Now C is given by a bivariate polynomial equation f(X, Y) — 0 where

f(X,Y) = a0(X)Yn+ J2 di(X)Yn~i l
32

LECTURE L2: CURVES AND SURFACES

with Oi(X) G fe[X] and 0 ^ a 0 (X) G k[X]. To discuss points in the (X, Y)-plane let us introduce cartesian products. The cartesian product 5 x T of sets S and T consists of all pairs (a, /3) with a £ S and (3 G T. Similarly the cartesian product S\ x • • • x Sm of sets Si,... ,Sm consists of all m-tuples ( 7 1 , . . . , 7 m ) with 74 G Si. If 5i = • • • = Sm — S then S\ x • • • x Sm is denoted by Sm. Now for a typical point (a,/?) in the (X, Y)-plane A;2 we have a £ k and /? G k; this point is on C if / ( a , /3) = 0. Assuming / to be irreducible in k[X,Y], and projecting vertically onto the X-axis, above each point X = a, in general there lie n points ( a , / 3 j ) , . . . , (a,@n) on the curve C. The degree of C, i.e., the (X, F)-degree of / , could be greater than the covering degree n, i.e., the Y- degree of / . To ensure that in general / ( a , Y) has n roots 0i,..., /3n in k, we need to assume that the field k is algebraically closed, which means every nonconstant univariate polynomial over k has a root in k. For instance the complex field C is algebraically closed. This used to be called the fundamental theorem of algebra, though it may be argued that it is neither fundamental nor a theorem of algebra. At any rate, most of its proofs involve analysis and topology. Many of them were given by Gauss between 1800 and 1840. Then around 1880, Kronecker constructed an algebraic closure of any field in a purely algebraic manner. Here by an algebraic closure of a field K we mean an algebraically closed overfield K of K which is algebraic over K. Sometimes we may call K an absolute algebraic closure of K to distinguish it from the (relative) algebraic closure of K in an overfield L of K by which we mean the subfield of L consisting of all elements of L which are algebraic over K. Indeed, over any field K the method of first constructing a root field of a univariate irreducible polynomial and then using it to construct a splitting field SF(F, K) of any nonconstant univariate F(Y) G K\Y] described in Lecture LI is due to Kronecker. In turn this may be used to construct an algebraic closure K of K. Namely, let us label all the distinct nonconstant monic polynomials in K[Y] as Fi(Y),F2(Y),..., and then after letting Ki+i - SF (FuKi) with Kx = K, let us putK = l)?L1Ki. This is ok if K is countable, i.e., if K is either finite or there is a bijection of N+ onto K, because in both these cases the nonconstant monic polynomials in K\Y] can be labelled as .Fi, F2, But if K is uncountable then what to do? Well, let us try making "longer" sequences. At the end of all positive integers let us put a symbol u>, thus getting the "sequence" 1,2,..., w, where u > i for all i G N+. Then start again with LJ + 1,W + 2,.. .,LJ2+OJ,... ,LJ2 + 2LJ, .. . , 2 w 2 , . . . ,3w 2 ,... and then 3 4 w u u w w , . . . , w , . . . ,w ,w ' + l,w + 2,... ,u) +CJ, ... ,w w ^, and so on to kingdom come! But will the kingdom ever come? Nobody knows! In other words, it is not known if in this "constructive" manner we will ever get beyond a countable set. So we take recourse to the axiom of choice or the well ordering principle or Zorn's Lemma. Let us proceed to describe these three postulates and show their mutual equivalence. The axiom of choice says that given any set of nonempty subsets of a set, it is

§J: MULT1VARIABLE WORD PROBLEMS

33

possible to pick one element from each subset. To reformulate this in terms of maps, for any set T let V{T) = the power set of T, i.e., V(T) = the set of all subsets of T, and let Vx (T) = the restricted power set of T, i.e., V* (T) = V(T) \ {0}. Then the axiom of choice says that given any map 4> : S —> VX(T) from a set S to the restricted power set Vx (T) of a set T, there exists a map ip : S —> T such that for all a; € 5 we have VK^) € x < z, and for all x, y in S we have x < y and y < x •& x = y; we write x < y to mean x < y and x ^ y. A poset (= partially ordered set) is a set with a partial order on it. A lo (= linear order = order) on a set S1 is a partial order on S such that for all x, y in S we have either x < y or y < x. A loset (= linearly ordered set) is a set with a linear order on it. A chain in a poset S is a nonempty subset of 5* which is a loset under the induced partial order. A maximal element of a poset S is an element y of S such that for all x € S we have: y < x => y = x. An upper bound of a nonempty subset R of a poset S is an element y of S such that x < y for all x £ R, and a smallest element of R is an element z of R such that z < x for all x € R. A wo (= well order) on a set S is a po on S such that every nonempty subset R of S has a smallest element; the said smallest element is clearly unique, i.e., it is determined by R; we may denote it by min(fl). A woset (= well ordered set) is a set with a well order on it. For any x =fi y in a woset 5, we have x < y or y < x according as the smallest element of {x, y} is x or y. Therefore every woset is a loset. The well ordering principle says that every set has a well order. A poset has the Zorn property if every chain in it has an upper bound. Zorn's Lemma says that if a nonempty poset has the Zorn property then it has a maximal element. As said above, out of the three postulates, the axiom of choice is self-evident. Out of the remaining two, the well ordering principle is surprising but Zorn's Lemma is more versatile. At any rate, as we shall see in a moment, the three are equivalent. After establishing the equivalence, we shall complete the above proof of the existence of an algebraic closure first by using the well ordering principle and then by Zorn's Lemma. We shall also show that any two algebraic closures K and if of a field K are if-isomorphic, i.e., there is a if-isomorphism K —> K . Consequently we may sometimes speak of the algebraic closure of K, rather than an algebraic closure of K. Now bijections define cardinals or cardinality. Two sets have the same cardinality if there is a bijection between them. For instance, all triads, i.e., sets of size three, define the cardinal 3, or as they used to say in school "new math," the threeness of three. The cardinality of a set S is denoted by |£|. It can be shown that cardinals are linearly ordered by defining l^l < \T\ to mean that there is an injection from 5 to T. The assertion that cardinals are partially ordered under this

34

LECTURE L2: CURVES AND SURFACES

binary relation, is known as the Schroeder-Bernstein Theorem; we can reformulate it by saying that if there are injections S —> T and T —> S then there is a bijection S —> T. To see that this partial order is a linear order, what we need to prove is that given any sets S and T, either there exists an injection from S to T, or there exists an injection from T to S. As bijections between (unordered) sets define cardinals, order preserving maps between well ordered sets define ordinals in the following manner. A bijection

x e R. Let us denote the ordinal of a woset S by ||5||, and for any wosets S and T let us define | | 5 | | < ||T|| to mean that there is an order preserving bijection from S onto a lower segment of T. By using Zorn's Lemma it can be shown that under this relationship, any set of ordinals is well ordered. An ordinal is a limiting or nonlimiting ordinal according as it does not or does have an immediate predecessor, i.e., according as the set of all smaller ordinals does not or does have a maximal element. For instance w, 2u>,..., u)w are limiting ordinals, whereas 1,2,..., w + 1, u> + 2 , . . . , ww + 1 are nonlimiting ordinals. The idea of polynomials in UJ is relevant for constructing the algebraic closure of the meromorphic series field k((X)) in characteristic p ^ 0. So let us proceed to introduce meromorphic series. §2: P O W E R SERIES A N D M E R O M O R P H I C SERIES Writing a polynomial in an indeterminate X over any ring R in ascending powers of X and letting them go to infinity on the right hand side we get a power series in X, and letting them have a finite number of terms on the left, i.e., terms with negative exponents, we get a meromorphic series. We denote the power series ring by -R[[X]], and we denote the meromorphic series ring by R{(X)). We regard R[X] as a subring of i?[[X]], and we regard i?[[-X"]] as a subring of R((X)). More precisely, a meromorphic series A(X) in X over R, i.e., a member of R((X)), has a unique expression

A(X) = YlAiXi

with

AieR

such that Supp(A) is a well ordered subset of Z where the support of A = or the X-support of A = A(X), is defined by putting Supp(A) = Supp x A = {i e Z : At ^ 0}

A(X),

§2: POWER SERIES AND MEROMORPHIC SERIES

35

and we note that: Supp(A) = 0 •& A = 0. We define the order of A, or the X-order of A, by putting ord(,4) = ordxA = min Supp(A) i.e., ,, ,. [ the smallest element of Supp(A) oral A) = < \oo

if A ^ 0 ifA = 0.

As in the case of polynomials, for any other meromorphic series B(X) = J2 BiX*

with

Bi e R

C{X) = A(X) + B(X) =

J2°ixi

the sum

is given by componentwise addition Ci = Ai + Bi and the product

D(X) = A(X)B{X) = ] £ A** iGZ

is given by the "Cauchy multiplication rule"

Dk = 5 3 AiBj. i+j=k

It is this rule which makes us require Supp(A) to be well ordered, because otherwise there may be infinitely many pairs (i,j) with Ai ^ 0 ^ Bj and i + j = k. At any rate, in an infinite summation, by convention we disregard all the zero terms, and so the summation makes sense provided there are only finitely many nonzero terms. The power series ring is defined by putting R[[X}} = {A = A{X) 6 R((X))

: ord(A) > 0}.

As in the polynomial case, the "formal" X-derivative Ax{X) = R((X)) is defined by putting AX{X)

=

dx

' of A{X) in

Y,iAiXi~1

and again this gives an /?-derivation of R((X)). If R is a domain then the order of the product of two nonzero meromorphic series equals the sum of their orders, and hence in particular the product is nonzero. It follows that if R is a domain then so is the meromorphic series ring R((X)) as well as the power series ring i?[[X]].

LECTURE LS: CURVES AND SURFACES

36

For finitely many variables (= indeterminates) Xi,... ,Xm over a ring R, we consider only the power series ring H[[Jfi,..., Xm]], which we regard as an overring of R. A power series in Xi,..., Xm over R, i.e., a member of R[[Xi,..., Xm]}, has a unique expression

A(Xu...,Xm)=

A

h-imX[1...Xi^

E

with

Ah...imeR.

(ii,...,im)eN m

The order of A, or the (Xi,..., X m )-order of A, is the minimum of %i + • • • + im with j4jj.,.jm ^ 0; if A = 0, i.e., if Aiu^im = 0 for all z i , . . . , i m , then the order is taken to be oo; we denote the order of A by ord(A) or ord^x1,...,xm)AFor a n y other power series B(Xu...,Xm)=

Yl

Bil...imX?...X%

with

Bh...imGR

«m)6Nm

(»i

the sum C ( X i , . . . , Xm) = A ( X i , . . . , X m ) + B ( X i , . . . , X m )

=

cv.. im xf i ...x£

E (n,...,i m )eN m

is given by componentwise addition <~'ii...im

— - ^ i i ...im. >

£>ii...iTn

and the product D(X\,...

,Xm)

= A(X\,...

=

,Xm)B(Xi,...

E

,Xm) i

Dh.,.imX ^...Xi-

(«l,...,»m)€N"'

is given by the "Cauchy multiplication rule" i'fei...fc tn =

2^i

•Ai1...imBj1,..jm.

»l+jl=fcl>...,*m+jm = fcm

The "formal" partial derivative AXj {X1,...,Xm)= in R[[X\,..., X m ]] is defined by putting AXj (Xi,...,Xm)=

(Xu^..,xm)

h A i i • • -im X%\ • • • X j ' - 1

22 (»i,...,«m)eN

aA

X%

of

j

^ ^ ^ _^

^

X

j+~1 • • • X%m

m

with the understanding that for ij = 0 the corresponding term in the above summation is zero, and we note that A — i > Ax, gives an R[[Xi,..., X , _ i , X , - + 1 , . . . , Xm]}derivation of R[[Xi,..., Xm]]. If R is a domain then the order of the product of two nonzero power series equals the sum of their orders, and hence in particular the product is nonzero. It follows that if R is a domain then so is the power series ring i?[[-Xi,..., Xm}\. For a finite number of variables Xi,..., Xm over a field K, by

37

%2: POWER SERIES AND MEROMORPHIC SERIES

K((Xi,..., Xm)) we denote the field of meromorphic functions in X\,..., Xm over K. Members of the meromorphic function field K((Xi,... ,Xm)) are "quotients" of power series A{X\,..., Xm)/B{X\,..., Xm) with nonzero B. By the "quotient rule" we extend the above X[[-Xi,... ,Xj~i,Xj+i,...,Xm]]-derivation of the domain K[[Xi,..., Xm}} to a K((Xi,..., Xj-i, Xj+i,..., X m ))-derivation of the field

K((Xu...,Xm)). Formally speaking, for any ring R, we may regard R((X)) as the set of all functions (= maps) from Z to R with well ordered support. At any rate, R((X)) c i? z , where for any sets S and T, by ST we denote the set of all functions :T —> 5 . In case S has a special element e, like the zero element in a ring, by the support of (j> we mean the set of alH € T such that 4>(t) ^ e, and we denote it by Supp((/>). By (5'T)finite let us denote the set of all 4> £ ST such that Supp(0) is finite, and in case T is a poset, by (5 r ) we iiord let us denote the set of all € ST such that Supp(i/>) is well ordered. Note that then R{{X)) = (#z)weiiord, R[[X]] = Rn, and R[X] = (i?N)flniteIn case of a finite number of variables Xi,..., Xm we have -R[[X 1 , . . . , X m ]] = -ft" and R[X\,... ,Xm] = (RN )flnite- In case of R((X)), the function which sends 1 £ Z to 1 S R and every other element of Z to 0 £ R corresponds to X. In case of R[[Xi,..., Xm]}, the function which sends ( 0 , . . . , 0 , 1 , 0 . . . , 0) e N m with 1 in the i-ih spot to 1 € R and every other element of N m to 0 6 R corresponds to Xj. Similarly for R[[X}} and R[X]. To speak formally about quotient fields, we first introduce the concept of an equivalence relation. An equivalence relation on a set S is a binary relation x ~ y which holds between certain pairs (x, y) 6 S2 and which is reflexive: for all x € S we have x ~ x, symmetric: for all x, y in S we have x ~ y => y ~ x, and transitive; for all x, y, z in S we have x ~ y and y ~ z =^ £ ~ z. By putting all the elements which are equivalent to each other in one box, the set S gets divided into a disjoint partition, i.e., a family (= set) of nonempty subsets any two of which have an empty intersection. This family of boxes = equivalence classes is denoted by S/ ~ and is called the quotient of S by ~; the map which sends every x G S to the box containing it is called the natural surjection S —> S/ ~ . Now the quotient field F of a domain E is defined to be the set of all equivalence classes of pairs (u, v) £ E2 with v ^ 0 under the equivalence relation: (u, v) ~ (u',v') <$• uv' = vu'. The equivalence class containing (u, v) is denoted by u/v or ^, and we add and multiply the equivalence classes by the rules ~L + ^ = "i"a+u2t'i and i£ x *a = ajsa. Also we "identify" any u e £ with f e F . This m a k e s F a field, and £ a subring of F . We may write QF(F) for F. Given two finite sets of variables X\,..., Xm and Zi,...,Zd over a ring i?, and given any power series P^(Zi,..., Zj) "devoid" of constant term, i.e.,

pW(Zll...,Zd)=

J; 0'i,.-.,jd)eN<»

^

^

-

^

with

P^.,d£R

38

LECTURE L2: CURVES AND SURFACES

such that

PQ...O

=

®> ^n a n y P o w e r series

A(X1,...,Xm)

=

Ail...imXi^...Xi-

J2

with

Au...imeR

im)eN m

(tl

we can "substitute" P ( i ) for Xt for 1 < I < m to get A(p\Z1,...,Zd)

A(pW(Z1,...,Zd),...,p(m\Z1,...,Zd))

= (JI

jd)eNd

This makes sense because it does not involve infinite sums of coefficients. Indeed, for any i = (ii,...,im) € N m , upon writing

p^(z1,...,zdr...p^mHz1,...,zdy-= with P-j

Yl

p

it.iA'---zdd

• G R, clearly we have P

h-jd

=

°

whenever

jx +

\- j d < ix -\

+ im.

Thus A i-> A ( P ) gives the P-homomorphism 0 : R[[Xi,..., Xm]] —> P [ [ Z i , . . . , Zd]], which we call the substitution homomorphism. Note that by taking d = 0 we get R[[Zi,..., Zd]] = R and by taking pW = 0 for 1 -> A{0,..., 0) = 4>... 0 . T n e P ° w e r series P ^ ^ Z i , . . . , Zd),..., P ^ m ' ( Z i , . . . , Zd) (which are assumed devoid of constant terms) are said to be analytically independent or analytically dependent over P according as ker(0) = 0 or ker() ^ 0. If P ( 1 ) ( Z i , . . . , Zd),..., P ( m ) ( Z i , . . . , Zd) are power series (devoid of constant terms) over a field K such that the substitution map <j> : K[[Xi,.,. ,Xm}] —> K[[Zi,... ,Zd}] is injective, then <j> uniquely extends to an injective K-homomorphism K((X\,..., Xm)) —> K((Zi,..., Zd)) which we again call the substitution map. Considering the geometric series identity (i)

(1-X)(1

+ X + X2 + ...) = 1

in the univariate power series ring P[[X]] over a ring P , and taking any power series P ( Z i , . . . , Zd) in a finite number of variables Z\,..., Zd over P with P ( 0 , . . . , 0) = 0 and substituting it for X we get (l-P(Z1)...,Zd))P'(Z1)...,Zd) = l where

P'(Z1,...,Zd)=

J2 0
P{Zl,...,Zd)i£R[[Zu...,Zd)}.

§3: VALUATIONS

39

If the element Q{Z\,..., Zd) £ R[[Z\, • • •, Zd]] is such that Q(0, • •., 0) is a unit in R then letting P(ZU ...,Zd) = l - Q ( 0 , . . . , 0 ) - 1 Q ( Z 1 , . . . , Zd) we get P ( 0 , . . . , 0) = 0 and Q(Zi,...,Zd) = Q(0,... ,0)(1 - P(ZU... ,Zd)) and therefore by letting Q'(ZU..., Zd) = Q ( 0 , . . . , O ) " 1 ^ ^ ! , . . . , Zd) we get Q(Zlt..., Z d ) Q ' ( Z i , . . . , Zd) = 1. Thus we have shown that if Q ( 0 , . . . , 0) is a unit in R then Q(Z\,..., Zd) is a unit in R[[Zi,..., Zd]]. Conversely if Q(Zi,..., Zd) is a unit in R[[Zi,..., Zd}} then for some Q*(Z1:. ..,Zd) in R[[Z1:..., Zd}\ we have Q(ZU ..., Zd)Q*{Zu. ..,Zd) = 1, and putting Zx = • • • = Zd = 0 we get Q ( 0 , . . . ,0)Q*(0,... ,0) = 1, and hence Q ( 0 , . . . , 0) is a unit in R. Recalling that U(R) is the group of all units in R, we conclude that (u(R[[Z1,...,Zd}])=

{Q(Z1,...,Zd)eR[[Z1,...,Zd}}:

\

Q(0,...,0)£U(R)}.

Applying the above consideration to the power series ring ivT[[Xi,... ,Xm}] in a finite number of variables X\,... ,Xm over a field K, we see that the units in . ^ [ [ X i , . . . , Xm]} are exactly those power series whose order is zero. It follows that in the univariate case, the meromorphic series field K((X)) is a concrete representation of the quotient field of X[[X]], i.e., there is no discrepancy in the two meanings of

K((X)). §3: VALUATIONS Again considering the power series ring if [[Xi,... ,Xm]} in a finite number of variables over a field K, we can uniquely extend the order function from the domain K\[Xi,..., Xm]] to its quotient field K((X\,... ,Xm)) by defining the order, or the (Xi,..., X m )-order, of u/v for any u, v in K[[X\,..., Xm]\ with v ^ 0 by putting oid(u/v) = ovd(x1,...,xm)u/v

= ord(u) — ord(v)

with the convention that ord(0) = oo. Now for all x,y in K((Xi,..., (VI)

Xm)) we have

ord(xy) = ord(x) + ord(j/)

and (V2)

ord(o; + y) > min(ord(x),ord(2/))

with obvious conventions about oo. In particular, for all x in K((X\,..., have (V3)

Xm)) we

ord(:r) = o o « i = 0.

The second inequality also holds for all x,y in i?[[-X"i,... , X m ] | for any ring R. To make this transparent it is best to express every power series as a sum of

40

LECTURE

L2: CURVES AND

SURFACES

homogeneous polynomials by "collecting terms of like degree." In greater detail, a polynomial H(X1,...,Xm) = ^fr<1...imX*l...X^

with

Hh...im£R

is homogeneous of degree j if for all ( i i , . . . , im) £ Supp(H) we have i\ -\ \-im = j ; note that the zero polynomial is regarded as homogeneous of every degree. Now a power series

A(X1,...,Xm)=

A

Y,

n-imX^...X^

with

4..,mefi

(ti,...,tm)€Nm

can uniquely be written as A(X\,.. .,Xm)

=

2_^

Aj(Xi,...,Xm)

0<j
where the polynomial

Aj(X1,...,Xm)=

J2

Ail...imX?...X%

is homogeneous of degree j . Given any other power series B(X\,...

,Xm)

=

2_^ Bj(X\,...

,Xm)

0<j
where Bj (X\,..., Xm) e R[Xi,..., Xm] is homogeneous of degree j , assume A ± 0 ^ B and let ord(A) = a and ord(B) = b. Then A(Xi,...,

Xm) — Aa(Xi,...,

Xm) + terms of degree > a

B(Xi,...,

Xm) = Bb(Xi,...,

Xm) + terms of degree > b

and

with Aa{Xu.

..,Xm)^0^

Bb(Xu...,

Xm).

It follows that A(Xi,.

..,Xm)

+ B(Xi,..

.,Xm)

f

Aa(Xi,..., Xm) + terms of degree > a if a b if a > 6 = < Bb(Xi,..., Aa(Xi,.. .,Xm) + Ba(Xi,..., Xm) + terms of degree > a if a = b

§3: VALUATIONS

41

and hence ord(A + B) > min(a, b) with equality in case of a ^ b. Also note that if R is a domain then A(Xi,...,

Xm)B(Xi,...,

= Aa (Xi,...,

Xm)

Xm)Bb(Xi,...,

Xm) + terms of degree > a + b

and 0 ^ Aa(X\,... ,Xm)Bb(Xi,... ,Xm) G R[X\,... ,Xm] is homogeneous of degree a + b, and hence ord(AB) = a + b. To generalize the idea of meromorphic series, let G be an ordered abelian group, i.e., G is an additive abelian group which is also an ordered set such that for all x, y, x', y' in G we have: x < y and x' < y' => x + x' < y + y'. For instance G = Z or Q or R. Or G could be the set M'd' of lexicographically ordered d-tuples of real numbers r = (n,. • • ,rd.),s = (s\,..., Sd), • • •, where lexicographic order means: r < s •*=> either n — Si for 1 < i < d, or for some j with 1 < j < d we have T\ = Si for 1 < i < j and rj < Sj. Or G could be a subgroup of any of these. Inspired by properties (VI) to (V3), we define a valuation of a field K to be a map v : K —* G\J {oo} such that for all x,y m K we have (VI) to (V3) with ord replaced by v, with the conventions about oo that: for all g G G we have g < oo and g + oo = oo, and also 00 + 00 = 00. We call G the assigned value group of v, and by the value group of v we mean the subgroup of G given by :xGKx}.

Gv = {v(x) By the valuation ring of v we mean the ring Rv = {xGK

: v(x) > 0}

and we note that this is clearly a subdomain (= subring which is a domain) of K which is its quotient field. We say that v is trivial over a subfield k of K, or that v is a valuation of K/k, to mean that x G kx =>• v(x) — 0 and we note that this is equivalent to saying that k C Rv, i.e., k is a subfield of Rv. Now the ord function is obviously a valuation v of K{(X\,... ,Xm))/K with GV = G = Z

and

K[[XU . . . ,Xm]] C Rv.

Note that in case of m = 1 the above displayed inclusion is an equality. To construct a valuation having a given ordered abelian group G as its value group, and at the same time to generalize the idea of the univariate meromorphic series field K((X)) over any given field K, let K((X))G

= {A&KG

: Supp(A) is well ordered}

where the support of A G KG, denoted by Supp(j4), is defined by putting Su P P (A) = {g G G : A(g) ± 0}.

42

LECTURE L2: CURVES AND SURFACES

For A, B in

the sum is componentwise, i.e., for any g G G we have

K((X))Q,

(A + B)(g) = A(g) + B(g) and the product is by the "Cauchy rule" which says that for all g G G we have

(AB)(g) = J2

A

0)B(j)

i+j=g

and the well ordering enters in showing that (V4)

\{(h3)£G2

:i+j

= g with i eSupp(A) and j GSupp(B)}

1 is a finite set. We also have (V5)

Supp(yl) and Supp(B) are well ordered => Supp(AB) is well ordered.

Note that A = 0 means A(g) = 0 for all g G G. Also A = 1 means A(0) = 1 and A(g) = 0 for all other g G G. For any A G i f ( ( X ) ) G we define ,. ,, fmin SuppfA) ord(A) = < \oo

ifA^O ifA = 0

and we note that then for all x,y in K((X))G we have (VI) to (V3). Mimicking the proof of (ii) using (i) in §2, we can show that (V6)

0 ^ €

K((X))G

=* AA' = 1 for some A' €

K{{X))G.

Thus K((X))G is a field, and ord is a valuation of K{{X))a/K with value group G. Concretely speaking, a typical member of K((X))G can be written as

A{X) = Y,AiXi where we are writing Ai for the previous A(i). Assuming G ^ 0 and designating a suitable nonzero element of G as 1, the function which sends I £ G to 1 £ K and sends every other element of G to 0 S R now corresponds to X. For instance, if G = ZM or QM or R ^ , then designate ( 1 , 0 , . . . , 0) to be 1 G G. We also put K[[X]]G = {A(X) G A"((X)) G : oid(A) > 0} and we note that iC[[X]]c is the valuation ring of the ord valuation of K((X))GTaking Q to be the exponent group, i.e., taking Q for G, for any n G A^+ we put Qn = {j G Q : nj G Z} and for any prime p we put Qn

= {j G Q : njp r G Z for some r G N + }

§4: ADVICE TO THE READER

43

where r may depend on j , and we let ^(PO)newt = U ne N + {A(X) G K((X))Q

: Supp(A) C Q„}

and in case K is of characteristic p we let K((X))gnewt

= UneN+{A(X)

These are clearly subfields of K((X))q, of the domains

K[[X]] newt

G X((X)) Q : Supp(A) C Q n , p }. and they are the respective quotient fields

—

{A(X) G tf((A-))„ewt : ord(A) > 0} and K[[X}]enewt

= {A(X) G K((X))gnewt

: ord(A) > 0}.

We shall give several proofs of the following theorem which dates back to 1660. NEWTON'S THEOREM (Tl). Let k be an algebraically closed field of characteristic 0. Then k((X))newt is an algebraic closure of k((X)). We shall also prove the following generalization of Newton's Theorem. GENERALIZED NEWTON'S THEOREM (T2). Let k be an algebraically closed field. Then k((X))q is an algebraically closed overfield of k((X)). If k is of characteristic p > 0, then k((X))gnev^ is an algebraically closed overfield of k((X)), which is strictly bigger than the algebraic closure of k((X)); more precisely, k((X))gnewt contains elements whose supports are such that their ordinals are not polynomials in u, but the ordinal of the support of any element which is algebraic over k((X)) is a polynomial in w. §4: A D V I C E TO T H E R E A D E R The next section on Zorn's Lemma and Well Ordering, spread over seven pages, is rather heavy logical stuff. This inspires me to verbatim reproduce the following lines of advice from the preface of the Second Part of the great and famous Algebra Text-Book of George Chrystal written in 1889 [Chr]: "I may here give a word of advice to young students reading my second volume. The matter is arranged to facilitate reference and to secure brevity and logical sequence; but it by no means follows that the volume should be read straight through at a first reading. Such an attempt would probably sicken the reader both of the author and the subject. Every mathematical book that is worth anything must be read "backwards and forwards," if I may use the expression. I would modify the advice of a great French mathematician and say, "Go on, but often return to strengthen your faith." When you come on a hard or dreary passage, pass it over;

44

LECTURE L2: CURVES AND SURFACES

and come back to it after you have seen its importance or found the need for it further on." So, in a first reading, without loss of understanding, the reader may prefer to immediately proceed to the section after next where I shall give a Utilitarian Summary of the logical stuff. §5: ZORN'S LEMMA A N D WELL O R D E R I N G We want to show the equivalence of the three postulates: Axiom of Choice, Well Ordering Principle, and Zorn's Lemma. Let us abbreviate these as (AC), (WO), and (ZL), and restate them. Let us also include an Alternate Version of Zorn's Lemma which we abbreviate as (ZL*). (AC). Given any map (f> : S —• Vx (T) from a set S to the restricted power set P (T) of a set T, there exists a map ip : S —* T such that for all x £ S we have ip(x) £ 4>{x). (WO). Every set has a well order. (ZL). If a nonempty poset has the Zorn property then it has a maximal element. Recall that a poset has the Zorn property means every chain in it has an upper bound. (ZL*). If a nonempty poset has the weak Zorn property then it has a maximal element, where a poset has the weak Zorn property means every woset in it (= well ordered subset of it) has an upper bound. x

First let us prove the following: LEMMA (T3). Let P be a set of subsets of a poset S. Assume that every member of P is well ordered (in the partial order induced from S). Also assume that P is a loset (in the "lower segment" partial order on P according to which A < B in P means A is a lower segment of B). Let Q = 1)R€PR. Then Q is well ordered (in the partial order induced from S). PROOF. Given any nonempty subset Q' of Q, we can take R € P such that R n Q' is nonempty. Let x = min(i? n Q'). We claim that then x = min(Q'). So let there be given any y £ Q''. If y £ R then obviously x < y (in the partial order of S). If y £ R then y € Ri for some i?i £ P; since y £ Ri \ R, Ri cannot be a lower segment of R; since P is a loset, R must be a lower segment of the woset R\; therefore, since y £ R\\R and x £ R, we must have x < y. Now we are ready to prove the: EQUIVALENCE THEOREM (T4). (AC) =*> (ZL*) => (ZL) =• (WO) =>(AC).

§5: ZORN'S LEMMA AND WELL ORDERING

45

PROOF OF (AC) =>• (ZL*). Assume (AC), and let 5 be any nonempty poset in which every woset has an upper bound. Suppose if possible that S has no maximal element. By taking <j> to be the identity map VX(S) —> PX(S) in (AC), we get a map V : VX(S) - • S such that for all 0 ^ R C S we have i>{R) £ R. Let W(S) be the set of all nonempty well ordered subsets of 5, and for every R £ W(S) let R* be the set of all upper bounds of R which do not belong to R; by assumption R has an upper bound and S has no maximal element, and hence we can take an element in 5 which is strictly bigger than that upper bound thereby getting an upper bound not in R; thus R* is nonempty and so tp(R*) £ R*. For every y £ S let L(y) = {x £ S :x (S). Let V{S) be the set of all R £ W(S) such that: (i) min(i?) = z and (ii) for all z ^= y £ Rv/e have il>{L{y,R)*) = y. Clearly {z} € V(S) and hence V(S) ^ 0. We claim that V(S) is a loset (in the "lower segment" partial order). To see this, given any Ri and R2 in V(S) let Ro = {y £ Ri D R2 : L(y,Rx) = L(y,R2)}. Then z £ R0 and hence RQ T^ 0. Clearly RQ is a lower segment of Ri as well as R2. Suppose if possible that Ro ^ Ri for 1 < i < 2; then upon letting yt = min(i?j \ Ro) we get i?o = L(yt, Ri) for 1 < i < 2; consequently j/i = V(i(2/i,i?i)*) = ^((-Ro)*) = ip{L{y2,R2)*) = 2/2, and hence yi = y2 £ Ro which is a contradiction. Therefore either Ro = -Ri or Ro = R2, and hence either JJi is lower segment of R2 or i?2 is lower segment of R\. This proves the claim. Now, upon letting T = L)R^V(S)^-: by the above lemma we get T £ W(S) and from this it follows that T £ V(S). Consequently, upon letting f = T U {ip(T*)}, we see that f £ V(S) with f <£ T which is a contradiction. Therefore S1 must have a maximal element. Thus (AC) => (ZL*). PROOF OF (ZL*) =* (ZL). Obvious. PROOF OF (ZL) => (WO). Assume (ZL), and given any set S let W(S) be the set of all well ordered subsets of S. Note that W(S) contains a subset of S as many times as the ways it can be well ordered. Put the "lower segment" partial order on W(S), i.e., Ri < R2 in S •$=> R\ is order isomorphic to a lower segment of R2. Given any chain P in W(S), upon letting T = Dn^pR and upon putting the partial order on T induced by the partial orders of the various R £ P, we see that T £ W(S) and R < T for all R £ T, i.e., T is an upper bound of P. Therefore by (ZL), the set W(S) has a maximal element T. If T ^ S then by taking y £ S\T and letting T = T U {y} and putting the same partial order on T as T with the understanding that x < y for all £ £ T, we get T £ W(S) with T < T which is a contradiction. Therefore f = S. Thus (ZL) => (WO). PROOF OF (WO) =^(AC). Assume (WO), and put some well order on T; now given any 4> : S —> V*(T) as in the statement of (AC), for every x £ S let il)(x) = min(0(i)). Thus (WO) =*• (AC).

46

LECTURE LZ: CURVES AND SURFACES

REMARK (Rl). The above theorem indirectly tells us that (ZL) «• (ZL*). Of this the implication (ZL*) =>• (ZL) is obvious. Here is a direct proof of the implication (ZL) => (ZL*). Given a poset S, by a weakly cofinal subset S we mean a subset R such that there is no y € S for which we have x < y for all x € R. Let W(S') be the set of all subsets of 5 which are well ordered (in the partial order induced from S). In the "lower segment" partial order, clearly W(S) has the Zorn property because the union of any chain is clearly an upper bound for it. Therefore, assuming S to be nonempty and noting that any singleton subset {y} with y £ S obviously belongs to W(S), by (ZL) we get a maximal element T in W(S). If T is not weakly cofinal in S then by taking y € S\T such that x < y for all s e T and letting f = T U {y} we get f € W(S) with T < f which is a contradiction. Therefore T is weakly cofinal in S. It follows that if S has no maximal element then T has no upper bound in 5. Thus (ZL) =» (ZL*). REMARK (R2). In (Rl) we have shown that: (ZL) => every poset has a weakly cofinal well ordered subset. REMARK (R3). Given a poset 5, by a cofinal subset of S we mean a subset R such that for every x E S we have x < y for some y £ R. As a variation of (Rl) and (R2), we can show that: (ZL) => every loset has a cofinal well ordered subset, and we can use this implication to prove the implication (ZL) => (ZL*). REMARK (R4). [Existence of Algebraic Closure by Well Ordering]. The usual mathematical induction is based on the fact that the positive integers are well ordered. Similarly transfinite induction is based on well ordered indexing sets, and so in effect on the well ordering principle (WO). For instance we can prove the existence of an algebraic closure of a field K, which need not be countable, thus. Let the set S of all nonconstant monic polynomials in K[Y] be indexed by a well ordered indexing set I and write them as Fi(Y) with i £ J; for instance we could take I = S and well order it by the well ordering principle. Let K\ = K with 1 = min(i'). For any 1 ^ j e J, having denned Ki for all i < j in 7, put Kj = SF(Fj,UiSi with i<jKi). Now K = Ujg/Xj is an algebraic closure of K. If this sketch of a proof is not very convincing, because of the problems such as where to take the two unions, the reader is referred to the original source of this proof which is the 1910 Crelle Journal paper of Steinitz reprinted as a book [Ste] by Chelsea. In Remark (R6) we shall give a (perhaps more convincing) proof using Zorn's Lemma. But first in Remark (R5) we shall use Zorn's Lemma to show the existence of prime ideals and maximal ideals. REMARK (R5). [Existence of Maximal Ideals by Zorn's Lemma]. Let I be an ideal in a ring R. Recall that I is a maximal ideal if the residue class ring R/I

§5: ZORN'S LEMMA AND WELL ORDERING

47

is a field, or equivalently if I ^ R and there is no ideal between I and R other than these two. Assuming I ^ R, let 5 be the set of all ideals in R which contain I but are different from R; in the partial order given by inclusion, the union of any chain is an upper bound for it, and hence by Zorn's Lemma S has a maximal element M. Obviously M is a maximal ideal in R. REMARK (R6). [Existence of Algebraic Closure by Zorn's Lemma]. Alternatively, we can use Zorn's Lemma (ZL) to prove the existence of an algebraic closure of a field K thus. Let I be the set of all nonconstant monic polynomials in K[Y}. For any F = F(Y) G I, let D(F) be the Y-degree of F, let aF>1,..., aF KF given by (j)F(XF]j) = aFj for 1 < j < D(F); since the image of F is a field, its kernel MF is a maximal ideal in PF. Now consider the polynomial ring Qj in the infinitely many variables {XFj : F £ I and 1 < j < D(F)} over K. Note that the polynomial ring R[{Xi : I G L}] in an infinite number of variables {Xi : I G L) over a ring R is constructed in an obvious manner, and it equals the union of the polynomial rings U-R^X/j : h G H}} taken over all finite subsets H of L; thus each polynomial involves only a finite number of variables; any two polynomials together involve only a finite number of variables and so they can be added and multiplied; finally, if W = {xi G S : I G L} is any subset of any overring S of R then Xi H-> xi gives us an i?-homomorphism R[{Xi : I G L}] —> S (called the substitution map) whose image is the subring R[W] of S. In our case, for every F G / , the ring PF is a subring of the ring Qj. Moreover, we have the partition {(F,j) : F G / and 1 < j < D(F)} = UFei{(F>J) • 1 < J < D(F)}. The following Lemma can be proved easily: LEMMA (T5). If L = U f f e f i if is a partition of a nonempty set L (where ft is a set of pairwise disjoint nonempty subsets of L), if RL = R[{Xi : I G L}] is the polynomial ring in indeterminates {Xi : I G L} over a field R, if for each H G ft we are given a nonunit ideal Jfj in the polynomial ring RH = R[{Xh : h G H}], and if JL is the ideal in RL generated by all the ideals JH as H varies over O, then JL is a nonunit ideal in RL; [cf. L6§6(E44)]. In our situation, by (T5) we see that the ideal in Qj generated by all the ideals MF as F varies over I is a nonunit ideal in Qi, and hence by (R5) there exists a maximal ideal Mi in Qi such that for every F G I we have Mp C M/. Clearly Mi C\K = 0 and hence we may identify K with a subfield of the field K = QI/MI. For every F e / w e must have Pp D Mi = Mp and hence we may identify Kp with a subfield of K. Now K contains a splitting field of F over K for every F G I. Also K = K[{apj : F G I and 1 < j < D(F)}} and hence K is algebraic over K.

48

LECTURE LS: CURVES AND SURFACES

Therefore K is an algebraic closure of K. REMARK (R7). [Uniqueness of Algebraic Closure by Zorn's Lemma]. Let L be an overfield of a field K, and let 7 be a set of nonconstant polynomials in Y with coefficients in K such that every / = f(Y) £ I splits completely in L, i.e., upon letting n(f) to be the F-degree of / and F = F(Y) to be the monic associate of / , we have (Wl)

F{Y) =

(Y-aftl)...(Y-aiMf))

with a / , i , . . . , <Xf,n(f) m J->. We call (W2)

L* = K({af,j

: / G / and 1 < j < n(/)})

the splitting field of I over K in L; note that if 7 = 0 then L* = K; also note that if I is the set of all nonconstant polynomials in Y with coefficients in K then L* is an (absolute) algebraic closure of K as well as the (relative) algebraic closure of K in L. Let V be an overfield of a field K' such that there is an isomorphism : K -> K' and every / ' = f'(Y) £ V splits completely in V where / ' c K'[Y] is obtained by applying (j> to the coefficients of the various members of / . Let V* be the splitting field of V over K' in L'. We want to show that there is an isomorphism cj>* : L* —> V* such that 4>*{x) = (j>(x) for all x G K. For any J c I let J' C / ' be obtained by applying (j) to the coefficients of the various members of J, and let Lj and L'j, be the splitting fields of J and J' over K and K' in L and L' respectively. Let A be the set of all pairs (J,<j>j) where J d I and cf>j : Lj —> L'j, is an isomorphism such that 4>j(x) = j0{x) =
J s = [J J

(W3)

(./,<MeB

it can be shown that

{

there is a unique isomorphism jB : LjB —> L ^ such that for all (J,(j)j) £ B and x £ Lj we have (j>jB{x) = j{x)

and moreover (W5)

{JB^JB)

is a n upper bound of-B in A

Therefore by Zorn's Lemma, A has maximal elements. Finally by L1(R8) we see that (W6)

(Ji, 4>Ji) is maximal in A => J\ = I.

§5: ZORN'S LEMMA AND WELL ORDERING

49

REMARK (R8). [Existence of Vector Space Bases and Transcendence Bases by Zorn's Lemma], We shall now show that any vector space has a basis and any two of its bases have the same cardinality (which is called the dimension of the vector space). At the same time we shall show that any overneld has a transcendence basis and any two of its transcendence bases have the same cardinality (which is called the transcendence degree of the overneld). So let if be a field and let L be either a vector space over K or an overneld of K. Let H ^ b e a subset of L. Just for a few minutes, in the vector space case (resp: overfield case): let us call W independent if every finite subset of W is linearly (resp: algebraically) independent over K, and let us call W generating if L coincides with KW (resp: if L is algebraic over K{W)); in both cases call W a basis if it is independent and generating; given any other subset U of L, let us say that U is dependent on W if every u e U belongs to KW (resp: is algebraic over K(W)). It is easy to show that I W is a basis < ^

(W7)

<=> it is a minimal generating set •£> it is a maximal independent set

where the minimality means that W is a generating set but there is no generating set W with W' C W and W' ^ W, and the maximality means that W is an independent set but there is no independent set W' with W C W' and W ^ W'. It is also easy to see that (W8)

W is generating =>• there exists a basis W' with W' C W.

Obviously L is generating and hence by taking W = L in (W8) we get the existence of a basis. To prove equicardinality, assuming IF to be a basis and given any other basis U, let A be the set of all triples {U',W',^') where U' C U, W' c W, and bijection <j>' :U' —> W', are such that (W \ W') U U' is a basis. Now A becomes a poset by declaring (U',W',(j)') < (U",W",(j)") to mean that U' C U", W' c W", (j)"(U') = W, and ^"(u) = 4>'(u) for all u € U'. By taking U' = 9 = W and ' to be the obvious map, we see that A is nonempty. Moreover, given any chain B in A, by taking

UB=

(J (U',w,4>')£B

U' and

WB =

(J

W

(U',w,4>')eB

it can be shown that ,TTT„. (W9)

I there is a unique bijection &B • UB —> WB such that < \for all ({/', W, ') G B and u G V we have B{u) = '(u)

and moreover (W10)

(UB, WB, 4>B) is an upper bound of B in A.

50

LECTURE

L2: CURVES AND

SURFACES

Therefore by Zorn's Lemma, A has maximal elements. Finally it can be shown that (Wll)

([/*, W*,*) is maximal in A => U* = U and W* = W.

This completes the proof. REMARK (R9). [Linear Order on Cardinals by Zorn's Lemma]. For the cardinals | 5 | and \T\ of sets S and T, we recall that | 5 | = \T\ means there is a bijection S —> T, and | 5 | < \T\ means there is an injection S —> T. To prove that this is a partial order, i.e., to prove the Schroeder-Bernstein Theorem, given any injections (/>: S —> T and ip :T —> S we have to find a bijection S —> T. Let f = ip(p and R = ip(T). Then clearly: (*) / : S —> S is an injection and f(S) C i? C S; we claim that (*) implies the existence of a bijection g : S —> R. Obviously ip induces a bijection h : ii —> T, and hence the claim yields the bijection hg : S —* T. To prove the claim, for any x G 5 let /°(a;) = x and / n + 1 (a;) = /(/"(a;)) for all n € N; also let /°°(a;) = {/"(a;) : n G N}, and for any V C S let /°°(V) = U x e v/°°(a;). Let R' = f°°(R \ f(S)), and note that then R' c R. Since / ( 5 ) C R and i?' C R, we get a map g : 5 —> R by putting g(a;) = a; or /(a;) according as a; G R' or a: G 5 \ R'. It can be shown that then (W12)

g is a bijection.

To prove that < is a linear order, given any sets U and W, we want to show that either there exists a bijection of U onto a subset of W or there exists a bijection of W onto a subset of U. To do this let A be the set of all triples (U',W',4>') where U' C U, W C W, and 0' : U' —> W is a bijection. Now A becomes a poset by declaring (U',W',(p') < (U",W","(U') = W, and "{u) = '(u) for all u G C/'. By taking U' = 0 = W and 0' to be the obvious map, we see that A is nonempty. Moreover, given any chain B in A, by taking

UB=

(J (U\W',')eB

tf'

and

WB =

[J

W

(W ,W\4>')€B

it can be shown that (W13)

I there is a unique bijection cf>B '• UB —» W^B such that < [for all ([/', W ,
and moreover (W14)

(C/B, V^BI ^ B ) is an upper bound of B in A.

Therefore by Zorn's Lemma, A has maximal elements. Finally it can be shown that (W15) (U*,W*,*) is maximal in A => either U* = U or W* = W. This completes the proof.

§5: ZORN'S

LEMMA

AND WELL

ORDERING

51

REMARK (RIO). [Well Order on Ordinals by Zorn's Lemma]. For the ordinals | | 5 | | and ||T|| of well ordered sets S and T, we recall that | | 5 | | = ||T|| means there is an order preserving bijection S —> T, and | | 5 | | < ||T|| means there is an order preserving bijection of S onto a lower segment of T. To prove that this is a partial order, given any order preserving bijections <j>: S —• T" and ip : T —> S', where T" and S' are lower segments of T and S respectively, we have to find an order preserving bijection S —* T. Let R = ip(T') and for every x € S let f(x) = ip( R is an order preserving bijection; we claim that (*) implies R= S and / : S —> S is the identity map of S. Since <j> and ip are injections, the claim yields the conclusion that T' = T and hence (p : S —> T is an order preserving bijection. The above claim is clearly subsumed under the following stronger claim which can be proved easily: if / ' : S —> T" and / " : S —> T" are order preserving bijections (W16)

< of a well ordered set S onto lower segments T" and T" of a well ordered set T then we must have T" = 7" and / " = / ' .

To prove that < is a linear order, given any well ordered sets U and W, we want to show that either there exists an order preserving bijection of U onto a lower segment of W or there exists an order preserving bijection of W onto a lower segment of U. To do this let A be the set of all triples ({/', W, (/>') where U' is a lower segment of U, W is a lower segment of W, and <j>' : U' —» W is an order preserving bijection. Now A becomes a poset by declaring (U',W',') < (U",W",(j)") to mean that U' is a lower segment of U", W is a lower segment of W", "{u) = <j>'{u) for all u €U'. By taking U' = 0 = W and
(J

B=

{U',W',')€B

U

'

and

WB=

U

W

'

(U',W',')€B

it can be shown that [ there is a unique order preserving bijection B • UB -* WB (W17) < [such that for all ([/', W, <j>') £ B and u G U' we have 0B(u) = cj>'{u) and moreover (W18)

(f/s, WB, 4>B) is an upper bound of B in A

Therefore by Zorn's Lemma, A has maximal elements. Finally it can be shown that (W19)

(U*, W*, (/>*) is maximal in A =>• either [/* = U or M^* == W.

This completes the proof that < is a linear order. Let us now set up an order preserving bijection between a well ordered set and its lower segments. Namely,

52

LECTURE L2: CURVES AND SURFACES

given any element y in a well ordered set C, clearly {x G C : x < y} is a lower segment of C different from C; conversely given any lower segment L of C different from C, by taking y to be the smallest element of C\L, we get L = {x € C : x < y}; it follows that under the above defined partial order < on well ordered sets, y i—• {x e C : x < y} gives an order preserving bijection (W20)

of any well ordered set C onto the set of all lower segments of C other than C.

Finally let us show that under the above defined partial order < on well ordered sets, any set D of well ordered sets is well ordered. Namely, given any nonempty subset E of D, take C e E, and let E' = {C e E : C < C}. Then by (W20) we see that E' has a smallest element C", and clearly C" is a smallest element of E. §6: UTILITARIAN S U M M A R Y In the material of the previous section through (R3) we have shown the equivalence of the axiom of choice, Zorn's Lemma, and the well ordering principle. In (R4) we have shown that the well ordering principle implies the existence of algebraic closure. In (R5) we have shown that Zorn's Lemma implies that, given any ideal / of any ring R with I ^= R, there exists a maximal ideal M of R with I c M. In (R6), (R7), and (R8) we have shown that Zorn's Lemma implies the existence and (up to isomorphism) uniqueness of algebraic closure, where uniqueness means: given any isomorphism : K —> K' of fields and given any algebraic closures L* and L'* of K and K' respectively, can be extended to an isomorphism of L* onto L'*, i.e., there exists an isomorphism cf>* : L* —• V* such that <j)*{x) = cf)(x) for all x £ K; note that if J ' is the set of all members of if'[y] obtained by applying <j) to the coefficients of the members of a set J of nonconstant members of AT[y], and if Lj and L'j, are the splitting fields of J and J' over K and K' in L* and L'* respectively, then we automatically have (j)*(Lj) = L'j,\ in particular this shows that <j> can be extended to an isomorphism of Lj onto L'j,. In (R8) we have shown that Zorn's lemma implies the existence of vector spaces bases and transcendence bases. In (R9) we have shown that Zorn's Lemma implies that any set of cardinals is linearly ordered. Finally in (RIO) we have shown that Zorn's Lemma implies that any set of ordinals is well ordered. Without loss of understanding, all this may be taken for granted. As an exception, the material of (R8) through display (W8), i.e., the first 21 lines in the discussion of the existence of vector space bases and transcendence bases, should be read. §7: DEFINITIONS A N D EXERCISES DEFINITION (Dl). [Real and Complex Numbers]. Real numbers may be

§7: DEFINITIONS AND EXERCISES

53

defined as equivalence classes of Cauchy sequences of rational numbers, and the complex number field C may then be defined as the splitting field of the quadratic polynomial Y2 + 1 over the real number field R. By analogy with N+, by Q+ we denote the set of all positive rationals; moreover, by Q - > a n d Qo- we denote the set of all nonnegative rationals, negative rationals, and nonpositive rationals respectively; as usual, the absolute value of any r G Q is denoted by \r\, i.e., \r\ = r or — r according as r G Qo+ or r G Q_; by context, this will not be confused with the size | 5 | of a set 5. A sequence x = (xi)i Ne and j > Nc we have \xi —Xj\ < e. This is equivalent to the Cauchy sequence x' = (a^)i in symbols x ~ x', if for every e G Q+ there exists Me G N + such that for all i > Me we have lij — a^| < e. Now R may be defined to be the quotient CQ/ ~ of the set CQ of all Cauchy sequences in Q by the equivalence relation ~ . We "identify" Q with a subset of R by sending any q £ Q to the equivalence class of the Cauchy sequence {Qi)i 0}, G 0 + = G+ U {0}, G_ = {g G G : g < 0}, and Go- = G _ U { 0 } ; also we define the absolute value of any g G G by putting \g\ = g or —g according as g G Go+ or g G G_. In particular this defines the sets R+, Ro+, R~, Ro~, and defines the absolute value \r\ of any r G R. Note that now Z + = N+ and Zo + = N. An ordered abelian group G is archimedean means for all x,y in G + we have nx > y for some n G N+. Clearly R is an archimedean ordered field, i.e., an ordered field whose underlying additive ordered abelian group is archimedean. A sequence x = (zi)i Ne and j > Ne we have |#,- — Xj\ < e; the sequence x is convergent means it converges to a limit £ & G, i.e., for every e G G+ there exists M £ G iV+ such that for all i > M€ we have |£ — Xi\ < e; we indicate this by some standard notation such as xt —• £ as i —• oo or limj-Kjoajj = £. An ordered abelian group is complete means in it every Cauchy sequence is convergent. Clearly R is a complete field, i.e., an ordered field whose

54

LECTURE

L2: CURVES AND

SURFACES

underlying additive group is complete as an ordered abelian group. In the next several Definitions and Exercises we extend the above construction of the reals to embed any ordered abelian group in a complete ordered abelian group. DEFINITION (D3). [Torsion Subgroups and Divisible groups]. By the subgroup of a group G generated by elements x\,X2,--. in G we mean the smallest subgroup of G which contains these elements. The order of an element in a group is the order of the subgroup generated by it. The subgroup of an additive abelian group G generated by all of its elements of finite order is called the torsion subgroup of G; if this is zero then G is said to be torsion free. An additive abelian group G is divisible means for every g e G and n € Z x there is h e G with nh = g. EXERCISE (El). Show that for any elements a, 6 in a torsion free additive abelian group and any nonzero integer n we have: na = nb =>• a = b. Show that any ordered abelian group is torsion free, and hence any ordered field is of characteristic zero. Show that for all x,y in an ordered field we have \xy\ = \x\\y\. Show that the usual order on Q is the only order on it which makes it an ordered field. EXERCISE (E2). Show that for all x, y in an ordered abelian group G we have \x + y\ < \x\ + \y\, and from this deduce that any convergent sequence in G is Cauchy and has a unique limit. HINT: Divide the argument into two cases according as there does or does not exist a sequence of positive elements (ei)i<j £ and Xi —> - f in G => (xi) is Cauchy and £' = £. Upon letting y\ = £ — Xi and r] = £ — £', it suffices to show that: j/j —> 0 and yi —> r\ in G =3> (y$) is Cauchy and r\ = 0. In the first case, we claim that for any e > 0 in G we have e; + e/ < e for some I G N+; namely, since e» —> 0, for some m £ N + we must have e m < e; again since ej —> 0, for some n S N + we must have en < e - e m ; it suffices to take I = m o r n according as e m < en or em > e„. Now y, —> 0 =>• there exists N £ N+ such that for allz > N we have \yt\ < £/, and hence for &\\ i > N and j > n we have \yi — j/j| < e, which shows that (j/j) is Cauchy. Since y, -+ r], by choosing N large enough we can also arrange that for all i > N we have \r] — yi\ < e/, and this yields \v\ < \Vi\ + \v — Vi\ < e; + e; < e, which shows that 77 = 0. In the second case, y* —> 0 =>• there exists J V £ N + such that for all i > N we have y, = 0, because otherwise we can find an infinite sequence of integers 0 < j i < J2 < • • • such that y^ ^ 0 for all i and upon letting e* = (y^ | this would produce a sequence of positive elements {ei)i
§7: DEFINITIONS AND EXERCISES

55

free additive abelian group G, let G* — G x Z x / ~ where the equivalence relation ~ is given by: (g,n) ~ (g',n') •£> n'g = ng', and embed G in G* by identifying every g £ G with the equivalence class containing (g, 1). Define addition in G* by taking equivalence classes in the proposed equation (g, n) + (h, m) = (mg + nh, nm). Note that then G* is a divisible additive abelian group such that for every g~ £ G* we have rig £ G for some n £ Z x . We call G* the rational completion of G. Note that if G is divisible then G* = G. Now assume that G is an ordered abelian group. Note that then the order on G uniquely extends to an order on G* so that G* becomes an ordered abelian group. G is said to be orderwise complete if every Cauchy sequence in G is convergent and has a limit in G. Moreover, G is said to be an orderwise completion of a subgroup H if G is orderwise complete and every element of G is the limit of a sequence all of whose members belong to H. Let G = CQ/ ~ where the equivalence relation ~ on the set CG of all Cauchy sequences in G is defined as in (Dl) with G replacing Q. As in (Dl) and (D2), G is made into an ordered abelian group and G is embedded into it; see (E4) below. Clearly G is an orderwise completion of G, and we call it the ordered completion of G. Also we call G* the real completion of G. Note that G* is archimedean o G is archimedean <=> G is archimedean. Also note that if G is complete then G = G. Finally note that if G is divisible and complete then &=G. In particular if K is an ordered field then its underlying additive group is divisible and hence K* = K; as in (Dl) and (D2) we make this into an ordered field; it is a complete ordered field, i.e., an ordered field whose underlying additive group is orderwise complete. EXERCISE (E3). In (D2) show that the induced relation < on the equivalence classes of Cauchy sequences in Q is a linear order. EXERCISE (E4). In (D4) show that the induced relation < on the equivalence classes of Cauchy sequences in an ordered abelian group is a linear order. EXERCISE (E5). Let G be any nonzero archimedean ordered abelian group. Show that given any g > 0 in G and x > 0 in R, there exists a unique order monomorphism (i.e., a group monomorphism which is order preserving) 0 : G —> M such that 4>(g) = x, and there exists a unique order isomorphism (i.e., a group isomorphism which is order preserving) ijj : G* —> M. such that ijj(g) — x. Moreover, for these maps we always have (h) = tp(h) for all h £ G. EXERCISE (E6). Show that x K+ X2 gives a surjection M -> M 0+ . HINT. The usual method of finding the decimal expansion 1.14142 . . . of \/2 can be explained in terms of decimal expansions of integers by saying that l 2 < 2 < 2 2 ,

LECTURE L2: CURVES AND SURFACES

56

142 < 2 x 102 < 15 2 , 141 2 < 2 x 104 < 1422, 14142 < 2 x 106 < 14152, 141422 < 2 x 108 < 141432, and so on. More generally let n > 1 and d > 1 be any integers, and let y > 0 and i > 0 be any integers. Then clearly there is a unique integer Xi such that xf < yndx < (xi + l)d; Xi is nothing but the n-adic expansion of the largest integer < y 1 / , d n\ Obviously the sequence (xi/nl) is Cauchy and for its limit x in R+ we have xd = y. Any positive rational can be written in the form y/zd where y and z are positive integers, and then we get x/z G R + with (x/z)d = y/zd. Finally, any rj £ R+ can be written as the limit of a sequence (r]j) in Q + and then taking £,• G R + with £d = rjj we get a Cauchy sequence (£,) for whose limit £ € R+ we have £d = 77. EXERCISE (E7). Show that the identity map is the only field automorphism of R. Hint: by (E6) any field automorphism of R is order preserving, and since it must send 1 to 1, by (E5) it must be the identity map. EXERCISE (E8). In contrast with (E7), assuming the fact that C is an algebraic closure of R, show that C has uncountably many field automorphisms. DEFINITION (D5). [Rational and Real Ranks]. In view of the first sentence of (El), any torsion free divisible additive abelian group may clearly be regarded as a Q-vector-space. The Q-vector-space dimension of the rational completion of a torsion free additive abelian group G is called the rational rank of G and is denoted by r(G). Alternatively, r(G) may be characterized as the cardinal of a maximal (= nonenlargeable) Z-linearly independent subset H of G, where independent means that for any finite number of distinct elements x\,...,Xd in H and any integers n\,...,

rid we have: n\Xi + • • • + n^Xd = 0 => n\ = • • • = rid = 0.

Now let G be an ordered abelian group. By a segment of G we mean a nonempty subset H of G such that: g e G and h e H with \g\ < \h\ => g € H. By an isolated subgroup of G we mean a subgroup which is a segment. The next Exercise says that the set S(G) of all nonzero isolated subgroups of G is linearly ordered by inclusion. The order-type of S(G) is called the real rank of G and is denoted by p{G). Two linearly ordered sets are said to have the same order-type if there is an order preserving bijection between them. An ordinal is the order-type of a well ordered set. Examples of nonordinal order-types are the order-types of Z or Q or Q + . We write p{G) € N or p(G) = 00 according as S(G) is finite or infinite. As a comparison, recall that the dimension dim/cV of a vector space V over a field K is the common cardinality of a vector space basis of V/K, and we write dimj<-V 6 N or d i m ^ y = 00 according as the said basis is finite or infinite. Similarly the transcendence degree trdeg/c-^ of an overfield L of a field K is the common cardinality of a transcendence basis of L/K, and we write trdeg/fL € N or trdeg^L = 00 according as the said basis is finite or infinite. In the section on Valuations, for any nonnegative integer d, we introduced the

§7: DEFINITIONS AND EXERCISES

57

ordered abelian group R' d ' as the lexicographically ordered set of all d-tuples of real numbers. Upon replacing K by any ordered abelian group G we generalize this to G' d '. As a further generalization, given any well ordered set T we introduce the ordered abelian group GlT! as the set GT of all functions / : T -> G with componentwise addition and the order defined by setting: / < g •$=> / ^ g and f(i) < g(i) where i is the smallest j £ T with f(j) ^ g(j). EXERCISE (E9). Show that for any isolated subgroups H\,H2 of an ordered abelian group G we have either Hi C H2 or H2 C Hi. EXERCISE (E10). Let G be a nonzero archimedean ordered abelian group; by (E5) this is equivalent to saying that, up to order isomorphism, G is a nonzero subgroup of R; for instance G = K. For any nonempty well ordered set T, describe the nonzero isolated subgroups of C?'T' and show that they are in an order reversing bijective correspondence with T; hint: « * - » { / £ GT : f(j) = 0 for all j < i) is the desired bijection from T to the nonzero isolated subgroups. In particular, for any positive integer d, the ordered abelian group G ^ has exactly d nonzero isolated subgroups. EXERCISE ( E l l ) . Let G be any ordered abelian group. Show that r{G) = 0 <=> p(G) = 0 o G = 0. Show that G is archimedean <=> p(G) < 1. Show that r(G) < 00 =4> p(G) < r(G). Show that r(G) = p(G) = d e N+ => there exists an order monomorphism G —> Q' d ' and an order isomorphism G —» Q ^ . See the Hint in (E12) below. EXERCISE (E12). Show that, given any ordered abelian group G with p(G) = d 6 N+, there exists an order monomorphism G —* R^ and there exists an order isomorphism G* —» M.W. Hint: make induction on d starting with (E5), and use the fact that if H is any nonzero isolated subgroup of G then in a natural manner G/H can be made into an ordered abelian group with p(H) + p(G/H) = d. DEFINITION (D6). [Dedekind Cuts]. Instead of using Cauchy sequences to prove (E5), we can use Dedekind Cuts. So let G be a nonzero divisible archimedean ordered abelian group. A Dedekind cut of G is a pair (L, U) of nonempty subsets of G with U = G\L such that for a l l / e I and u £ U we have I < u and there is no u' e U with U = {u e G : u' < u}. For any t G G we get a Dedekind cut (Lt,Ut) with Lt = {I £ G : I < i) and Ut = {u S G : t < u}. Let DG be the set of all Dedekind cuts of G. It can be shown that G is complete O « H (L t , Rt) gives a surjection G —> DG. Indeed, Do may be defined to be the completion of G. At any rate, for proving (E5), given any h £ G let 9(h) be the real number which corresponds to the Dedekind cut (L, U) of Q where L = {m/n £ Q with m £ Z and n £ N+ : nh < mg} and U = Q\L, and take <j>(h) = x6(h).

58

LECTURE L2: CURVES AND SURFACES

EXERCISE (E13). Prove the geometric series identity (i) in the univariate power series ring i?[[X]] over any ring R, and from it deduce description (ii) of units in the multivariate power series ring i?[[i?i,... ,Zd]]- Prove claims (V4) to (V6) and thereby show that, for any field K and any ordered abelian group G, K((X))Q is a field with a valuation whose value group is G. [cf. L6§6(E45)]. EXERCISE (E14). Complete the existence proof for vector space bases and transcendence bases by establishing claims (W7) to (Wll) made in (R8). Complete the proof that cardinals are linearly ordered by establishing claims (W12) to (W15) made in (R9). Complete the proof that ordinals are well ordered by establishing claims (W16) to (W20) made in (RIO). DEFINITION (D7). [Approximate Roots]. In the Hint to (E6) we showed how to use n-adic expansions of positive integers to find successive approximations to the d-th root of a positive integer. Mixing a generalization of this with a generalization of the completing the square method of solving quadratic equations leads us to the concept of approximate roots of polynomials. So consider a monic polynomial

F = F{Y) = YN + J2

A YN l

i ~

l
of degree N > 0 in Y with coefficients Ai in a ring R. It N is a unit in R then we can generalize the completing the square idea to completing the iV-th power by writing

F(Y) = (Y - Ai/N)N

+ ]T

AHiiY-Ai/N)"-'

2
with A\ e R, i.e., by killing the coefficient of YN~X. To generalize this further let D > 0 be an integer which divides N. Instead of assuming N to be a unit in R, assume D to be a unit in R; note that in case of a field R this is equivalent to assuming that the characteristic of R does not divide D and so characteristic zero is always ok. Now we look for a monic polynomial

G = G{Y) = YN'D +

Yl

BiY^I0^

l
of degree N/D in Y with coefficients Bi in R such that GD is as close to being equal to F as possible. As (E15) below shows, if we interpret this as requiring degy(F — GD) < N—(N/D) then a unique G exists, and we call it the approximate -D-th root of F (relative to Y) and denote it by App£>(F) or App£>iy(-F). Recall that for any m,n in N with n > 1, the n-adic expansion of m consists of writing m = Y^i>o rninl where integers 0 < rrii < n are the digits of the expansion. Likewise for any / , g in R[Y] with g monic of positive Y-degree, the p-adic expansion of /

§«: NOTES

59

consists of writing / = ^2i>0 fig1 where fi € R[Y] with degy/j < deg y g are the digits of the expansion. By (E16) below these expansions exist and are unique. Moreover, if / is monic of Y-degree N > 0 and the Y-degree of g is N/D £ N+ where D £ N + is a unit in R, then fD = 1 and fi = 0 for all i > D; we try completing the -D-th power by putting Tf(g) = Tf:y(g) = g + (fo-i/D) and calling it the /-Tschirnhausen of g (relative to Y); see Krull's charming little book [Krl] for the reference to the 1683 work of Tschirnhausen who was a friend of Leibnitz. By (El 7) below, starting with any monic g of degree N/D and applying 77 to it N/D times will produce the approximate D-th root of / . EXERCISE (E15). Let F be a monic polynomial of degree N > 0 in Y over a ring R. Let D > 0 be an integer such that D divides TV and D is a unit in R. Show that there exists a unique monic polynomial G of degree N/D in Y over R such that degy(F - GD) < N - (N/D). Hint: With display as in (D7), the last condition gives the equations Ai — DBi + Pi(Bi,..., Bi_i) for 1 < i < N/D where the coefficient of YN~l in GD equals DBi + Pi(Bi,..., -Bj-i) with Pi a polynomial over Z; since D is a unit in R, these can be solved successively (in a unique manner). EXERCISE (E16). Given integers m > 0 and n > 1, show the unique existence of the n-adic expansion of m. Given univariate polynomials / , g over a ring R with g monic of positive degree, show the unique existence of the g-adic expansion / = Yli>o fi9% °f /• Show that if / is monic of degree N > 0 and the degree of g is N/D where D is a positive integer factor of N, then fo = 1 and / , = 0 for all i > D, and moreover: App£>(/) = g •*=> fn-i — 0. EXERCISE (E17). Let / , g be univariate monic polynomials of positive degrees N and N/D over a ring R where D is a positive integer which divides N, and is a unit in R. Let 17(3) = g, and let / = J2o
60

LECTURE L2: CURVES AND SURFACES

integers; all the rest is the work of man. §9: C O N C L U D I N G N O T E When there are several things which we do not know, we call them X, Y, Z and so on. This leads to one or more polynomial equations in these variables. One equation in two variables gives rise to a plane curve. One equation in three variables produces a surface in three space. Two equations in two variables give rise to the finite number of points in which the two curves intersect. Two equations in three variables produce one or more space curves in which the two surfaces intersect. Bezout makes a proper counting of the intersections.

Lecture L3: Tangents and Polars §1: SIMPLE G R O U P S After proving the simplicity of the alternating group An for n > 5, Galois enlarged the list of finite simple groups by showing that the projective special linear group PSL(2,p), which we shall define in a moment, is simple for any prime p > 5. Jordan extended this by proving the simplicity of PSL(n, p) for all n > 3; his proof may be found in his 1870 book [Jor] which was the first expansion of Galois' ideas on equation solving. Then Moore proved the simplicity of PSL(2, q) for any power q of p with q> p; indeed this was his main aim in the paper [Mol] which he presented at the Chicago World's Fair of 1893 which was the birthplace of the Galois field G F ( Q ) . Finally, in his influential 1901 book [Die], Dickson established the simplicity of PSL(n, q) for all n > 3 and q > p. Eventually we shall prove all these simplicity statements, [cf. L6§6(E46)]. To define PSL we start by introducing determinants and matrices. Given any ring R, for any integers m > 0 and n > 0, by an m x n matrix over R we mean a "rectangular array" A = (Aij) with Aij G R for 1 < i < m and 1 < j < n. By the (i, j)-th entry of A we mean the element A^. The 1 x n matrix whose (1, j)-th entry is A^ is called the i-th row of A, and the raxl matrix whose (i, l)-th entry is A^ is called the j - t h column of A. The set of all m x n matrices over R is denoted by MT(m xn,R). For any B = (By) G MT(m xn,R), we define the sum A + B = {{A + B)ij) G MT(m x n, R) by putting {A + B)^ = Atj + Bij. Clearly MT(m x n, R) becomes an additive abelian group whose 0 is the matrix with all entries reduced to the 0 of R. Next, for any integer r > 0 and any B = (Btj) G MT(n x r,R), we define the product AB = ((AB)„) G MT(m x r,R) by putting (AB)ij = ^2i 0 and any C G MT(r x s,R) we have (Ml)

(AB)C =

A{BC).

Now assume m = n. Then MT(n x n, R) becomes a skew-ring whose 1 is the matrix In = ((ln)ij) where ( l n ) y = the Kronecker 6ij which equals 1 or 0 according as i = j or i ^ j . The determinant of A is defined by putting

(M2)

det(A) = ^2 sgn(^) I I
A

™«)

\
where Sn is the permutation group (= the symmetric group) on ( 1 , 2 , . . . ,n), and sgn(
det(AB) = det(^)det(B). 61

62

LECTURE

L3: TANGENTS

AND

POLARS

Recalling that U(R) is the group of all units in R, it is easily seen that (M4)

f/(MT(n x n, R)) = {A e MT(n xn,R):

det(A) e U(R)}.

The group U(MT(n x n,R)) is called the n-dimensional general linear group over R and it is denoted by GL(n, R). The kernel of the determinant map GL(n,R) —> U(R), i.e., the homomorphism given by A \—> det(yl), is called the n-dimensional special linear group over R and is denoted by SL(n, i?). In other words SL(n,R) = {A e GL(n, .R) : det(A) = 1}. Being the kernel of a homomorphism, SL(n, R) < GL(n, R), i.e., SL(n, i?) is a normal subgroup of GL(n, R). Note that the determinant of a 0 x 0 matrix is 1; namely |5o| = 1 because |0 0 | = 1, and rii 0, we have an injective homomorphism fc* —> GL(n, k) which sends every a £ kx to the matrix whose (i,i)-th entry is a for all i and whose (i, j)-th entry is 0 for all i ^ j . The image of this homomorphism is called the n-dimensional homothety group over k and is denoted by HL(n,k). In other words, HL(n, k) is the group of all nonzero scalar matrices. As we shall eventually see, an n x n matrix induces a linear transformation of the n dimensional vector space kn over k. The transformation induced by a nonzero scalar matrix sends a geometric configuration to a configuration which is similar to it, like two similar triangles. For the use of the word homothety for such a transformation, see the last chapter starting on page 267 of the classic Pure Geometry book [Ask] by Askwith published in 1921 (first edition 1903) by the Cambridge University Press. We can easily see that (M5)

HL(n, jfe) < GL(n, jfc)

and we define PGL(n, k) = GL(n, fc)/HL(n, k) and PSL(n, k) = SL(n, k)/(SL(n, k)n HL(n, k)), and call these the n-dimensional projective general linear group over k and the n-dimensional projective special linear group over k respectively. In case k is the Galois field GF(q) where q is a power of a prime p, we may write GL(n, q), SL(n,q), PGL(n,g) and PSI^n,*?) in place of GL(n,k), SL(n, fc), PGL(n, k) and PSL(n, k) respectively. As said above, eventually we shall prove the simplicity of PSL(n, q) for n > 2 with (n,q) ^ (2,2), (2,3). Now the list of finite simple groups can be enlarged by considering finite orthogonal groups, i.e., the finite analogs of groups of distance preserving transformations of the ordinary euclidean space. The said orthogonal groups, as well as the related symplectic groups and unitary groups, are based on the geometry of quadrics; later on we shall give a detailed treatment of these groups. Collectively, the finite linear groups GL(n, q), SL(n,q), PGL(n,q) and PSL(n, q), together with the finite orthogonal, symplectic and unitary groups, are called finite classical groups. If we do not insist the field k to be finite then they are simply

§2: QUADRICS

63

called classical groups. This terminology goes back to Hermann Weyl's book [Wey]. Dickson [Die] called all of them linear groups. §2: QUADRICS To motivate the geometry of a quadric, let me start with the following: HOMEWORK PROBLEM (HI). Draw the two tangent lines to a circle D from a point A, and let n be the line joining the two points of contact. Call n the polar of A, and A the pole of n . Show that if the polar of A passes through a point B then the polar of B passes through A. If done pedantically, by analytic geometry, this may take five pages. The pure geometry solution given on page 14 of Askwith's book [Ask] cited above takes one page. Soon I shall show you a half-line proof. That proof will be based on homogeneous coordinates which are the mainstay of analytic projective geometry. Pure geometry may be regarded as synthetic projective geometry. Affine, i.e., ordinary, space is converted into projective space by adding points at infinity to it. In the above problem, the circle may be replaced by any conic such as an ellipse or parabola or hyperbola, or even by a quadric or a hyperquadric. A quadric is a surface in 3-space given by a quadratic equation, such as a sphere or an ellipsoid or a paraboloid or a hyperboloid. In case of a quadric, the polar of a point A is a plane which intersects the quadric in the points of contact of various tangent lines from A to the quadric. More generally a hyperquadric in AT-space is given by a quadratic equation in AT variables, i.e., it is a hypersurface of degree 2; similarly, a hyperplane is a hypersurface of degree 1. Now the polar of a point A is the hyperplane which intersects the hyperquadric in the points of contact of the tangent lines from A to the hyperquadric. This gives rise to the following: GENERALIZED HOMEWORK PROBLEM (H2). In case of a hyperquadric, show that if the polar hyperplane of a point A passes through a point B then the polar hyperplane of B passes through A. The half-line solution I spoke of actually works in the general hyperquadric case! So what are tangent lines, tangent cones, singular points, singular curves, and so on? Seeing that analytic geometry, like its sister discipline of projective geometry, has mostly been eliminated from college courses, let me first review these concepts from the viewpoint of calculus and then from the viewpoint of analytic geometry. In calculus we consider a plane curve C given by an explicit equation Y = $(X), and introduce the concept of the tangent T to C at a point P = (U, V) of C with V = $(t/) as the limit of chords (i.e., secant lines) of C through P. Then we

64

LECTURE

L3: TANGENTS

AND

show that the slope of T is the value of $'(X) = Y - V = $'({/) (X — U) as the equation of T. If C F(X, Y) = 0 then we use implicit differentiation to subscripts indicate partial derivatives. This yields

POLARS

^ at X = U. This gives us is given by an implicit equation get FxdX + FydY = 0 where ^ = =pL. If Py(£/, V) ^ 0

then the slope of T is "ffiffff giving Y - V = ~Fy(u,v) (X ~ U) a s t h e equation of T, and now by clearing the denominator and bringing things to one side we get FX(V, V)(X -U) + FY(U, V)(Y -V)

=0

as the equation for T. Likewise, if FX(U,V) ^ 0 then for the "antislope of T" (= the tan of the angle T makes with the Y-axis) we have %y\(u,v) = ~FYWV) giving X — U = ~F^m V\ (Y — V) as the equation of T, and again by clearing the denominator and bringing things to one side we get the same equation for T as displayed above. Thus the calculus method works if either Fx or Fy is nonzero at P . But if Fx and Fy are both zero at P then it breaks down and the books simply declare that P is then a singular point of C. Note that F(U, V) = 0 ^ FY(U, V) is exactly the condition under which, according to the Implicit Function Theorem, F(X,Y) = 0 can be solved near P giving Y = $(X) with V = $(£/). Similarly, if F(U,V) = 0 ^ FX{U,V) then we solve F{X,Y) = 0 to get X = * ( F ) with U = ^(V). The point P is singular exactly means that the Implicit Function Theorem does not work at P . §3: H Y P E R S U R F A C E S As I recently became aware, the modern calculus treatment of tangents to a surface S : F(X, Y,Z) = 0 in 3-space is even more troublesome. What is done is to introduce the gradient to S at a point P of S as the vector (Fx, Fy, Fz) evaluated at P . The gradient is also called the normal to S at P , and the plane perpendicular to it is called the tangent plane to S at P . More generally, given a hypersurface S : F(X\,..., XN) = 0 in TV-space, the gradient to S at a point P = (Ui,..., UN) of S is the vector of partials (Pxi, • • •, FxN) evaluated at P; thinking of P as a variable point, the gradient of S at P is the vector (Fu1,..., FuN) where Fut is the partial derivative of F(U\,..., UN) with respect to Ui. Again, the gradient is also called the normal to S at P , and the hyperplane Tp perpendicular to it is called the tangent hyperplane to S at P . Clearly Fux (Xi —Ui)-\ hFu N (X^ — UN) = 0 may be taken as the equation of Tp. Alternatively, Tp is defined to be the hyperplane spanned by the tangent lines to the various curves on S through P . Both these definitions break down if all the N partials are zero at P , i.e., if P is a singular point of S. The alternative definition has the further difficulty of having to define tangents to space or hyperspace curves, and of proving that sufficiently many of them can be drawn on the surface or hypersurface S. All these difficulties vanish if, instead of calculus, we follow the method of analytic geometry.

§3: HYPERSURFACES

65

To explain the method of analytic geometry, let us intersect the hypersurface S:F(X1,...,XN)=0 in iV-space by a line L through a point P =

(UU...,UN)

of S. Let the line L be given parametrically by the equations L : Xi = Ui + Rtt for 1 < i < N where t is a parameter and the components of the vector R = ( P i , . . . ,RN) are proportional to the "direction cosines" of the line; for the idea of direction cosines see the 1903 book [Bel] of R. J. T. Bell on Three Dimensional Analytic Geometry; in effect, they are the cosines of the angles a line makes with the various coordinate axes. To find the points of intersection, we substitute the equations of L in the equation of S. This gives F(Ui + Rrf, ...,UN + RNt) = h{t) = t"/i*(t) with h*(0) ^ 0 where fi = ordth(t). Now t = 0 is a root of h(t) of multiplicity fi, and so we visualize that /x points of S have coalesced at P . We call fj, the intersection multiplicity of S and L at P and we denote it by intp(5, L). To indicate the dependence of fi on P and R, we may write HP,R for fi. Expanding F around P we have

F{XU ... ,xN) = ^Gh...iN(x1

- u,r • • • (XN - uNy».

Let v = ordpF i.e., let v be the smallest value of i\ + • • • + i^ with G^...^ ^ 0. We call v the multiplicity of S at P and we denote it by multp.S. Geometrically v can be characterized by noting that, with P fixed, it is the minimum of fip,R taken over all R, i.e., taken over all lines through P . To indicate the dependence of v on P , we may write up for v. We call P a simple or double or triple or ... point of S according as vp = 1 or 2 or 3 or ... ; by a singular point of S we mean a point of S which is not simple. The line L is said to be tangent to 5 at P if its intersection multiplicity with S at P is greater than the multiplicity of S at P , i.e., if fip,R > up. Let

GV{XU...,XN)=

Yl

Gil...iN(X1-U1r...(XN-UN)i».

Now a cone with vertex P is a hypersurface passing through P such that, for every point P' ^ P on it, the entire line PP' is on it. Clearly G„ = 0 is a cone with vertex P , and a line L through P is on it iff L is tangent to S at P . We call Gv = 0 the tangent cone of 5 at P . If P is a simple point of S, i.e., if v = 1, then for

66

LECTURE L3: TANGENTS AND POLARS

1 < i < N we have F ^ = G0...010...0 with 1 in the i-th place, and hence the said cone is reduced to the hyperplane FVl(Xi

-Ui)

+ --- + FUN(XN

-UN)

=0

which we call the tangent hyperplane of S at P. Note that so far we did not assume F to be a polynomial. So at least when P is the origin ( 0 , . . . ,0), the above discussion applies to an analytic hypersurface, i.e., to the case when F is a (formal) power series. Now suppose that F(X\,..., Xjy) is a polynomial of degree d. Then d is also called the degree of the hypersurface S; if JV = 2 then S is a curve of degree d, and if N — 3 then S is a surface of degree d. To characterize d geometrically, we note that now h(t) is a polynomial of degree d, and hence it has d roots giving d points of intersection of S with L. This is so for most values of Ui and R4. For special values of Ui and Ri, the equation h(t) = 0 may have less than d distinct roots because some of them may be multiple roots. Thus d may be characterized as the maximum number of points in which a line meets 5; here and in what follows, the line is assumed not to lie on S. Moreover, if there are c distinct roots and their multiplicities are e\,..., ec, then for the sum e = e\ + • • • + ec we have e = d, and for the corresponding points P i , . . . , Pc in which S meets L we have intp^S, L) = e;. Thus the sum e of the intersection multiplicities of S and L at their points of intersection equals the degree of S. This is the first case of Bezout's Theorem, which is the oldest Theorem of algebraic geometry and was enunciated by Bezout in his 1770 book [Bez]. Here is an example which shows that we need to be careful in this. Namely, with TV > 1 and d > 1, let F(XU...,XN)

= XtlX2

+ X3 + • • • + XN - 1

a n d P = (1,1,0, . . . , 0 ) . Then h(t) = (1 + J R 1 i) d " 1 (l + Rat) + R3t + --- + RNt - 1 = Rf~1R2td

+ (terms of degree

Therefore if either Ri = 0 or R2 = 0 then the sum e is smaller than the degree d of S, because some points of intersection of S and L "have gone to infinity." To take care of this we must go over to projective spaces and homogeneous coordinates. For an elementary treatment of all these things, you may like to browse in my 1990 AMS book [A04]. §4: H O M O G E N E O U S COORDINATES Again let F(Xi,...,

XN) be a polynomial of degree d and consider the hyper-

67

§4: HOMOGENEOUS COORDINATES

surface 5 in JV-space given by F(X\,...,

Xjv) = 0. Expanding F we have

F(Xi, . . . ,Xpf) — 2_j^h—iN^l

• • -^N •

Collecting terms of like degree we have d

F(Xu...,XN)

=

J2Fj(Xi,...,XN)

where Fj(Xi,...,XN)

=

2_^

Fil...iNXll1 ..

.X$

is a homogeneous polynomial of degree j , i.e., every term in it is of degree j . For some j it may happen that Fj has no term in it, i.e., Fj may be reduced to zero. But Fd is not reduced to zero because F is of degree d. Now for very large values of the variables, the values of the terms of degree d are much bigger than the values of the terms of degree < d, i.e., Fd dominates the rest of F. Therefore, heuristically speaking, we may say that Fd = 0 gives the points at infinity of S. Algebraically, we "homogenize" F by multiplying terms of degree j by the (d—j)th power of a new variable XN+I to get a homogeneous polynomial f(Xi,..., X;v+i) of degree d. By "dehomogenizing" / , i.e., by substituting XJV+I = 1 in it, we get back F. Thus d

f(Xi,...

,XN+i)

= f^X^jFjjXi,.. j=o

.,XN)

and F(Xi,...

,XN) = f(Xi,...

,XN,1).

Alternatively, we substitute Xi = Xi/xM+i

for

1--->X")3=0

By substituting xjv+\ = 0 in f(xi,..., XJV+I) we get Fd(x\,... ,XN), and so we may say that Fd = 0 is the intersection of / = 0 with the "hyperplane at infinity" given by xN+i = 0.

LECTURE L3: TANGENTS AND POLARS

68

Geometrically, let A^ = A f = kN = {P = (Ui,...,

UN) : Ui G k for 1 < i < TV}

be the affine TV-space over the field k. (If you do not want to be bothered by "fields" and such, you may simply take k be the set of all real numbers R, and then the affine TV-space is your familiar M.N). We obtain the projective TV-space pN _ pjV o v e r fc w n e n w e a U g m e n t A^ by the "hyperplane at infinity" in the following manner. First, instead of representing a point P of KN by a single Ntuple (Ui,..., UN), let us represent it by all the (TV + l)-tuples u = (ui,..., UJV+I) such that UN+I ^ 0 and Ui = Ui/u^+i f° r 1 < » < N; these are called homogeneous coordinates of P. Now the remaining (TV + l)-tuples, i.e., the (TV + l)-tuples u = (u\,..., UAT+I) with UJV+I = 0, represent points of the hyperplane at infinity, again with the understanding that different (./V+l)-tuples represent the same point iff they are proportional. In other words, let us say that two (N + l)-tuples ( u i , . . . , UJV + I) and (u[,..., U'JV+I) a r e equivalent iff they are proportional, i.e., iff (u[,..., u'N+1) = (cu\,..., CUJV+I) for some nonzero constant c. Now P ^ may be identified with the set of all equivalence classes of (iV + 1)-tuples, with the understanding that the zero (N + l)-tuple ( 0 , . . . , 0) is excluded from consideration; in other words Pw = P^ = ( ^ + 1 \ { ( 0 , . . . , 0 ) } ) / ~ where ~ is the said equivalence relation. A projective hypersurface S of degree d, i.e., a hypersurface in the projective space P " , is given by an equation f(xi,..., XJV+I) = 0 where f(xi,..., XN+I) is a nonzero homogeneous polynomial of degree d; this is unambiguous because for any projective point P we have f(u\,..., u^+i) = 0 for one homogeneous coordinate tuple ( u i , . . . , UN+I) of P iff f(u\,..., u'N+1) = 0 for every homogeneous coordinate tuple ( u i , . . . ,u'N+1) of P; when this is so we say that P G S; otherwise P $. S. In particular, a projective hyperplane is given by a homogeneous linear equation a\X\-\ hfliv+i^iv+i = 0 where the coefficients a\,... ,ajv+i are determined up to proportionality, and at least one of them is nonzero; if a, ^ 0 for some i < N then this corresponds to the hyperplane a\X\ H h CLNXN + ajv+i = 0 in AN; if a< = 0 for alH < N then we get the hyperplane at infinity given by XN+I — 0. The portion of S "at finite distance," i.e., S (~l AN, is the affine hypersurface of degree d given by the equation F(Xi,... ,XN) = 0 where F(Xi,... ,XN) = f(X\,... ,Xff,l); the remaining portion of S consists of the points of S lying on the hyperplane at infinity. Any hyperplane H in PN is clearly a copy of P w _ 1 , and its complement PN \ H is a copy of A^; in particular, this is so for the hyperplane Hi : xt = 0, i.e., P w \ Hi is a copy A ^ of Af; moreover, P ^ = U ^ j ^ P ^ \ Hi), i.e., the projective TV-space can be covered by TV + 1 copies of the affine TV-space; in other words

Pf = uf = + 1 (P JV \^) = uf = + 1 A^ Therefore the definitions of simple point, singular point, multiplicity, intersection

69

§4: HOMOGENEOUS COORDINATES

multiplicity, tangent cone, and tangent hyperplane can be extended to projective hypersurfaces. Let u = ( u i , . . . ,UN+I) be homogeneous coordinates of a point P of S. Let x = (xi,...,xN+i) and fx = (fXl,... ,fXN+1) and fu = (fUl,... ,fUN+1), and as usual let us put X • fx = XifXl H

h XN+lfxN

+1

and X • Ju

=

X\JUl

+ •• •+

XM+1JUN+I

and so on. By Euler's Theorem on Homogeneous Polynomials we have x-fx = df(x), and hence (because P G S) we get u • fu = 0. For a moment assume that u^+i ^ 0 and let Ut = Ui/uN+i for 1 < i < N. Then by the equation f(x\,... ,XM+I) = xfj+lF(xi/xN+i,... ,XN/XN+I), we see that fUi = uN~^lFui for 1 < i < N; since P is singular for S iff F\ji = 0 for 1 < i < N, in view of the equation u • fu = 0 we conclude that: P is singular for S iff fUi = 0 for 1 < i < N + 1. In case P is a simple point of S, we know that the tangent hyperplane Tp of S at P has the afHne equation Fj/i(^i — Ui) + • • • + FUN(XN — UN) = 0; since u • fu = 0 and fm — u%~+iFui for 1 < i < N, we conclude that Tp is given by the homogeneous equation x • fu = 0. By symmetry it follows that both these conclusions remain valid without assuming UN+I ^ 0. Thus, for any point P of S with homogeneous coordinates u = (m,..., UN+I), we have that: (11) P is a singular point of 5 iff fUi = 0 for 1 < i < N + 1, and (12) if P is a simple point of S then the tangent hyperplane Tp of S at P is given by the homogeneous linear equation x • fu = 0. For any point A of PN with homogeneous coordinates y = (yi,... ,yjv+i), let us define the polar 11^ of A (relative to S) to be the hypersurface of degree d — 1 defined by the homogeneous equation y • fx = 0. By (II) we see that: (13) if P is any singular point of S then P e 11^. By (12) we see that: (14) if P is any simple point of S then: P G ILA <£> A £ Tp. By (14) we see that: (15) if P is any simple point of S with P ^ A then: P G n ^ <^> the line AP is tangent to 5 at P . It follows that if S is a nonsingular (= having no singular point) hyperquadric then the present definition of polar coincides with the definition given when we assigned Homework Problems (HI) and (H2). To do the Homework Problems (HI) and (H2), assume that S is a nonsingular hyperquadric. Let A and B be any points in PN with homogeneous coordinates y = (j/i,..., j/jv+i) and z = (z\,..., ZN+I) respectively. Then the polars HA and n ^ are given by the equations y • fx = 0 and z • fx = 0 respectively. We can write f(Xi, . . . , XN+l) =

2J l
aiX

^+

H2 l<j<j'
bjj'XjXf

70

LECTURE L3: TANGENTS AND POLARS

with constants
yfz=

]T

2aiyizi+

b

3j'(yjzj' +Vj'zj) =z-

Y^

l
fv

l<j<j'
Therefore

B eUA^y-

fz = 0^ z-

fy=0^AeUB.

§5: S I N G U L A R I T I E S Reverting to affine equations, let S : F(X\,..., XN) = 0 be a hypersurface in the affine iV-space over a field k. Given a point P of S, by a translation of the coordinate system, we may assume it to be the origin ( 0 , . . . , 0). This amounts to making the /^-automorphism of the polynomial ring k[X\,..., Xn] given by Xi — i > Xi + Ui where ( f / i , . . . , UN) are the original coordinates of P. Let multpS = v. Then F(Xi,...,

XN) = F„(Xi,...,

XN) + (terms of degree > v)

where F„(Xi,..., XN) is a nonzero homogeneous polynomial of degree v. First let us think of S as an analytic hypersurface at P, i.e., let us assume F to be a power series. Let fc[[Xi,..., XN]] be the ring of all formal power series in X\,..., XN with coefficients in k; here formal means we are disregarding questions of convergence, and we are going to do only algebraic manipulations; for instance, we permit the power series ^2,ilXl whose radius of convergence is zero. We shall discuss the notion of radius of convergence later on. Later on we shall also show that power series rings over a field are UFDs. Using this fact we can write F = E0E[' ... E? where r\,... ,rt are positive integers, E\,..., Et are pairwise coprime irreducible power series with £ i ( 0 , . . . ,0) = • • • = Et(Q,.. • ,0) = 0, and EQ is a power series with E0(0,...,0) ^ 0. We call Si : Ex = 0 , . . . , St : Et = 0 the analytic branches of S (at P). Let multpSj = Ui for 1 u^

where EiVi {X\,..., XN) is a nonzero homogeneous polynomial of degree Vi. For the "initial form" Fu of F we have the factorization FV{XU ...,XN)

= EQ(0, • • •, 0) Yl

EiVi{Xu...,

XN)ri

with £ 0 ( 0 , . . . , 0) ^ 0.

l
We say that S has a t-fold NC (= Normal Crossing) at P if all the analytic branches have a simple point at P and their tangent hyperplanes are linearly independent,

§5:

71

SINGULARITIES

i.e., if v\ = • • • = vt = 1 and upon letting Ea =

Yl

E

'ijXJ

with E

'ij

e

k

l<j
we have that the rank of the t x N matrix {E'^) is t. Note that the rank of an m x n matrix (a^) over k is the largest nonnegative integer r for which there exist sequences 1 < p\ < • • • < pr < m and 1 < q\ < • • • < qr < n such that the determinant of the r x r matrix (aPiqj) is nonzero. Also note that if S has a t-iold NC at P then obviously t < N. We may say that, in the sense of analytic geometry, S has a simple point at P iff it has a 1-fold NC at P. Now let us think of S as an algebraic hypersurface, i.e., let us assume F to be a polynomial of some degree d. Since the polynomial ring k[X\,..., X^] is a UFD, we can write

F = r 0 rf... r£where p\,..., pT are positive integers, T i , . . . , TT are pairwise coprime irreducible polynomials with T i ( 0 , . . . , 0) = • • • = r r ( 0 , . . . , 0) = 0, and To is a polynomial with r 0 ( 0 , . . . , 0) ^ 0. We call Ei : Ti = 0 , . . . , E T : TT = 0 the algebraic branches of S at P. Concerning these two factorizations, you may do the following Homework Problem, where we recall that a partition of a set I f is a collection of nonempty subsets of W whose union is W and which are pairwise disjoint, i.e., have pairwise empty intersection; we may express this by writing W = Llig/ W» where Wi are the subsets indexed by an indexing set / . HOMEWORK PROBLEM (H3). Show that the analytic factorization is a refinement of the algebraic factorization, i.e., there is a partition { 1 , . . . , i] = ]Ji
LECTURE L3: TANGENTS AND POLARS

72

a power series of even order with constant term 1 has a power series square root. In case of characteristic zero, this follows from Newton's Binomial Theorem for fractional exponents thus. By the said Theorem we have the power series identity

(Ti)

(i + xy = i + Y.

r(r

~ 1 ) --., ( r ~ i

+ 1)

x*

where r is any rational number [cf. §12(E16)]. For X we may substitute any power series in X of positive order. Therefore by taking r = 1/n we see that any power series A(X) whose constant term is 1 has an n-th root for every positive integer n. If the order of the power series B(X) is en for some integer e > 0 and the coefficient of Xen in B(X) is 1 then we can write it as B(X) = (Xe)nA(X) with A(0) = 1. Therefore B(X) has an n-th root. It follows that Y2 - X2 - X3 = [Y - X + (terms of degree > 1)] x [Y + X + (terms of degree > 1)] and Y2 - X* - X5 = [Y - X2 + (terms of degree > 2)] x [Y + X2 + (terms of degree > 2)]. Consequently, at a node we have a 2-fold NC, but at a tacnode we do not have an NC. At a cusp we of course do not have an NC. §6: HENSEL'S LEMMA A N D N E W T O N ' S T H E O R E M The fact that by L2§3(T1) we can find an n-th root of any A(X) € k[[X}] with A(0) = 1, and for the said n-th root a(X) € fc[[X]] we have Q(0) = 1, can be restated by saying that the polynomial Yn — A{X) is divisible by Y — a(X) in /c[[X]][y]. Moreover by carrying out the division we get the factorization Yn - A{X) = [Y - a(X)} x [Y"" 1 + (31(X)Yn~2 + ••• + /3„_i(X)] where 0i(X) e K[[X}] with A(0) = 1 for 1 < i < n - 1. Around 1900 this was generalized by Hensel into his celebrated lemma which has many incarnations. Let us now state and prove the following simplest version of it, where we note that two univariate polynomials over a field are coprime means they have no nonconstant common factor. BASIC HENSEL'S LEMMA (T2). Let F(X, Y) = Yn + Ai (X)Yn~1

+ --- +

An(X)

§6: HENSEL'S LEMMA AND NEWTON'S THEOREM

73

be a monic polynomial of degree n > 0 in Y with coefficients Ai{X) in fc[[X]] for 1 < i < n where k is any field. Assume that F(0,Y)

=

G(Y)H(Y)

where G(Y) and H(Y) are coprime monic polynomials of positive degrees r and s in Y with coefficients in k. Then there exist unique monic polynomials G(X,Y) and H(X, Y) of degrees r and s in Y with coefficients in A;[[X]] such that F(X,Y)

=

G(X,Y)H(X,Y)

and G(0, F ) = G(Y)

and

#(0, y) =

H(Y).

PROOF. Rewriting F, G, H as power series in X with coefficients polynomials in Y we have 'F(X, Y) = FQ(Y) + F1(Y)X < G(X,Y)

= G0(Y) + Gx(Y)X

H(X, Y) = H0(Y) + H1(Y)X

+ • • • + Fi(Y)Xi i

+ ••• + Gi{Y)X

i

+ ••• + Hi{Y)X

+ ... + ... + ...

where Fo(Y) = F(0, Y) = a monic polynomial of degree n over k GQ(Y) = G(Y) = a monic polynomial of degree r over k HQ(Y) = H(Y) = a monic polynomial of degree s over k Fi(Y) G k[Y] of degree < n for alii > 0 Gi(Y) e fc[F] of degree < r to be found for alH > 0 KHi(Y)

e /s[y] of degree < s to be found for alii > 0

and where by the coprimeness we can find P(Y) and Q(Y) in k[Y] such that (1)

P(Y)Go(Y)

+ Q(Y)H0(Y)

= l.

The requirement F = GH is equivalent to saying that for every / > 0 we want to satisfy the equation (2)

£

GiWHjiY)

= Ft{Y).

For I = 0 this is given. Now let / > 0 and suppose we have found d and Hj for all i < I and j < I such that the above equation is satisfied for every value of I smaller than the given value. By sending the known terms to the right we see that (2) is equivalent to requiring that (2')

Hl(Y)G0(Y)

+ Gi(Y)H0(Y)

= F[{Y)

74

LECTURE L3: TANGENTS AND POLARS

where F{(Y) = Ft(Y) -

Yl

Gi(Y)Hj(Y)

e k[Y] is of degree < n.

i-\-j=l with i < / and j
Upon letting ff/(y) = P(Y)F[(Y) (!')

and GJ(Y) = Q(Y)F{{Y),

by (1) we get

^ / ( ^ ) G o ( r ) + GKy)ffo(^) = F/(Y).

Dividing fl/(y) by flo(y) we find G'{{Y) and JJj(y) in k[Y] such that #/(Y) = G'l'(Y)H0(Y) + Hi(Y) and the degree of Ht{Y) is < than the degree of H0(Y) which is 5. Now upon letting Gt{Y) = G't(Y) + G'/(Y)G0(Y), by (1') we get (1")

Hl(Y)Go(Y)

+ G«(y)JJoOO =

Fl(Y).

In the above equation, the degrees of the first and the last terms are < n and so is the degree of the middle term GI{Y)HQ{Y), but in it the degree of HQ{Y) is s = n — r and hence the degree of the other factor Gi(Y) must be < r. Thus the degree conditions are satisfied and hence (1") yields (2'), which completes the induction on I. The uniqueness can also be proved by induction thus. Given I > 0, suppose G i ( y ) , . . . , G / _ i ( y ) and H!(Y),...,Hi-i(Y) are unique. Let Gf(Y) and flf(y) be any polynomials of degrees < r and < s respectively such that (2*)

H?(Y)G0(Y)

+ GUY)H0(Y)

= F/(Y).

Subtracting (2*) from (2') we get (2**)

Hr(Y)G0(Y)

=

-Gr(Y)H0(Y)

where G?(Y) = Gt(Y) - Gj(Y) G k\Y] is of degree < r and H?*(Y) = Ht(Y) H*(Y) £ k[Y] is of degree < s. Since Go(Y) and HQ{Y) are coprime and by the above equation GQ(Y) divides Gi*(Y)Ho(Y), it must divide G**(Y); but the degree of Gf*(Y) is smaller than the degree of Go(Y) and hence we must have Gi*(Y) = 0, i.e., G[(Y) = Gt(Y). Similarly Hfiy) = Hi(Y). This proves uniqueness. COMPLETING THE POWER. Now we shall use Hensel's Lemma to prove Newton's Theorem stated in L2(T1), i.e., in (Tl) of Lecture L2. An important ingredient of the proof will be Shreedharacharya's completing the square method in its incarnation of "completing the n-th power." By this we mean, given an n-th degree monic polynomial F(Y) = Yn + AxYn~l + • • • + An with coefficients A\,..., An in a ring R in which n is a unit (for instance R could be a field whose characteristic does not divide n), "by completing the n-th power"

§6: HENSEL'S LEMMA AND NEWTON'S

THEOREM

75

we can write

with A2, • •., An in R, and this gives us F(Y) = F(Y - (Ai/n))

= Yn + A2Yn~2 + ••• + ! „ .

The advantage of "killing" the coefficient of y n _ 1 is to detect multiple roots. So for a moment suppose that the coefficients A\,... ,An belong to an algebraically closed field K of characteristic 0, and let a\,... ,an be the roots of F in K, i.e., F(Y) =

(Y-ai)...(Y-an).

Assume that A\ = 0 but Aj ^ 0 for some j with 2 < j
ax)n = Yn - naiYn-1

+ ...

which would give us the contradiction A\ = —na.\ ^ 0. Therefore upon relabelling a\,..., an we can arrange matters so that for some positive integer r < n we have a\ = • • • = aT 7^ ar+j for 1 < j < s = n — r. Now upon letting G(Y) = (Y-ai)...(Y-ar)

and

H(Y) = (Y - ar+1)...

(Y -

ar+s)

we see that F(Y) = G(Y)H(Y) where G(Y) and H(Y) are coprime monic polynomials of positive degrees in Y with coefficients in K. Thus we have proved the following: LEMMA (T3). If F{Y) = Yn + £ i 0 with coefficients A^ in an algebraically closed field K of characteristic 0, and if Ax = 0 but Aj ^ 0 for some j with 2 < j < n, then F ( F ) = G(Y)H(Y) where G(F) and i? (10 a r e coprime monic polynomials of positive degrees in Y with coefficients vaK. PROCESSES ON ROOTS. To put the completing the power method in proper perspective, again consider a monic polynomial

F(Y) = Yn+ J2 AiY"-* l
of degree n > 0 in Y with coefficients Ai in a field K, and let Q 1 ; . . . , an be the roots of F in an overfield L of K, i.e.,

F(X)= n (Y-ai). l
76

LECTURE L3: TANGENTS AND POLARS

To "increase the roots" by B G L, by putting = Yn+

F*(Y) = F(Y-B)

]T

AtY"-*

l
with A* E L we get f~(Y)=

J ] ( ^ - ( a i + B)). l
Likewise to "multiply the roots" by 0 ^ C G L, by putting F**(Y) = C m F ( y C - 1 ) = Y " +

A**Y n _ i

^ l
with A** G L we get F**(Y) =

J]

(Y-Ca,).

l
As the third "standard process" on roots, to "reciprocate the roots" we assume An 7^ 0 and then by putting F***(Y) = YnF(Y~1)

= 1 + AiY + • • • +

AnYn

i.e., by "reading F backwards" we get F***(Y) = An

n

(Y-a/a,)).

l
With this preparation let us now prove a slight variation of Newton's Theorem which we call: BASIC NEWTON'S THEOREM (T4). Let there be given any monic polynomial F(X, Y) = Yn + Ai (X)Yn~1

+ --- +

An(X)

of degree n > 0 in Y with coefficients A\(X),... ,An(X) in k((X)) where k is an algebraically closed field of characteristic 0. Then for some integer m > 0 we have a factorization F(Tm, Y)=

[ J (Y -

Vi{T))

with

Vi{T)

£ fc((T)).

l
Moreover, if At{X)

G k[[X\] for 1 1 and assume true

§7-- fJVTJEGR-AL,

r}EFJ,&r*fD&WC7&

TT

for all values of n smaller than the given value. By completing the n-th power we have F*(X,Y)

= Yn+

= F(X,Y-(A1(X)/n))

A*{X)Yn~i

£ \
where A*(X) G k((X)) with Af (X) = 0. If yl*(X) = 0 for 2 < j < n then we have nothing to show. So assume that A*(X) ^ 0 for some j with 2 < j < n. Let u = min 1 0. Note that if Ai(X) £ fc[[X]] for 1 0. Let l
where A**(Z) = A*(Zw)Z~iv

6 k[[Z]} for 1 0 and n" > 0 in Y with coefficients infc[[£]]. By the induction hypothesis we can find integers m! > 0 and m" > 0 such that

F'{Tm',Y)=

J ] (Y-y'AT))

and F"(T m ",y)= JJ {Y-y'*(T))

l
l
with j/i(T) and y"(T) in &[[T]]. Upon letting rn = wm'm" positive integer and F*(Tm,Y)=

ft

(Y-y*(T))

with

we see that m is a

y*(T) e k((T))

l
and if w > 0 then y*(T) € fc[[T]] for 1 < i < n. It follows that F(Tm, Y)=

I ] (Y - yi(T))

with

y < (T)

G fc((T))

l
and if A^X)

€ fc[[X]] for 1 < i < n then yt(T) Gfc[[T]]for 1 < i < n. §7: I N T E G R A L D E P E N D E N C E

The last sentence of Theorem (T4) is a bit informal. It does not make completely clear whether the roots yi(T) asserted in the factorization display are automatically in k[\T]] when the coefficients Ai(X) are assumed to be in fc[[X]], or whether they

78

LECTURE L3: TANGENTS AND POLARS

can be so chosen. Of course the uniqueness of roots does show the two statements to be equivalent. To prove the apparently stronger statement, i.e., the implication At{X) G k[[X]] for all i => j/i(T) G k[[T]] for all i, let us introduce the ideas of integral dependence and integral closure. To go back in history, what is y/2 and why is it not rational? At any rate, the need to know y/2 arose since, by the Pythagoras Theorem, it is the length of the hypotenuse of an isosceles right angle triangle whose two equal sides have length one. Now l 2 = 1, 22 = 4 , . . . , and hence there is no integer whose square is two. Suppose there is a rational number t whose square is two, i.e., t satisfies the equation y2 — 2 = 0. We can write t = u/v where u and v are integers with v ^ 0; we may assume that this is a reduced form of t, i.e., u and v have no common prime factor; we may also assume that v > 0. Since (u/v)2 — 2 = 0 we get u2 = 2v2 which shows that every prime factor of v divides u. Therefore we must have v = 1. Thus t must be an integer which we have shown to be impossible. The belief that although y/2 is not rational, it is some sort of a number, makes it desirable to "complete" the field Q to get the field R; we can then accommodate y/2 — 1.4142..., n — 3.1415..., and so on. The above argument can be generalized to show that Z is a normal domain, i.e., any element t in its quotient field Q which is integral over it belongs to it. An element t in an overring 5 of a ring R is said to be integral over R if it satisfies a monic polynomial equation over R, i.e., if F(t) = 0 for some monic polynomial F(Y) = Yn + AiYn~l -\ h An of some degree n > 0 with coefficients Ai,...,An in R. We may say that t/R is integral to mean that t is integral over R. A subset T of S is integral over R means every t G T is integral over R; again we may indicate this by saying that T/R is integral. By the integral closure of R in S we mean the set of all elements in S which are integral over R. It can be shown that sums and products of integral elements are integral; moreover, any t € R is integral over R since it satisfies the equation Y — t = 0; consequently [cf. §10(E2)]: J the integral closure of a ring R in an overring S is a subring of S, 1 and R is a subring of the said integral closure. It can also be shown that [cf. §10(E2)] J if an overring 5 of a ring R is integral over R, and (^ an overring T of S is integral over S, then T is integral over R. A ring R is integrally closed in an overring S means R coincides with its integral closure in S. Thus a domain is normal means it is integrally closed in its quotient field. Since Z is a UFD, its normality will follow by showing that (J3)

every UFD is normal.

§7: INTEGRAL

DEPENDENCE

79

To put this in proper perspective, let us first show that J for any valuation v : K —> G U {00} of any field K, ) the valuation ring Rv = {x G K : v(x) > 0} of v is normal. To prove this let us extend v to a valuation w of K{Y) thus. For any

A = A{Y) = J2 AiYi G K[Y]

with

^ G /if

let w{A) = min

U(J4,)

taken over all i

and note that if A = 0 then u>(A) = oo and if A ^ 0 then IU(J4) £ G , = the value group of v. For verifying that w satisfies the valuation axioms, we observe that the valuation axioms imply that (J5)

v(l) = 0

and for any finite number of elements x\,..., (J6)

v(xi H

xm in K we have

1- xm) > min(u(a;i),..., v(xm))

and (J7)

if v(xi) < v(xi) for 2 min(ty(i4), iu(B)). Moreover, if A ^ 0 ^ £ then, upon letting i' = min{i : w(A) = v(Ai)} and f = min{j : w(B) = v(Bj)}, and upon letting C = AB = ^ C i ^ 1 with C; G AT, for all I we have

i+j=l

and hence v(Ci) > w(A) 4- w(B), and we also have CV+j/ =Ai>Bj> +

]T i+j=i'+j'

and hence v(d'+ji) (J8)

A B

iJ

and either i < i' or j < j '

= w(A) + w(B). Thus we have

w(AB) = w(A) + w{B) for all A ^ 0 ^ £ in AT[y].

80

LECTURE L3: TANGENTS AND POLARS

Therefore we may unambiguously put w(A/B) that

= w{A) — w(B), and then it follows

w : K(Y) -> G U {00} is a valuation of K(Y) (J9)

such that w(x) = v(x) for all x G K, and the value group of w coincides with the value group of v.

Given a domain R, let us say that R is overnormal to mean that if A(Y), B(Y), C(Y) are monic polynomials in V with coefficients (*)

in the quotient field K of R such that A(Y)B(Y)

= C(Y) e R[Y],

then A(Y) € R[Y] and B(Y) G R[Y}. Using (J8) let us show that J ^ or a n y v a m a t i ° n v : K ^> GU {00} of any field K, [the valuation ring Rv = {x € K : v(x) > 0} of v is overnormal. Namely, since v(l) = 0, we see that if A = A(Y) G K[Y] and B = B(Y) G K[Y] are monic in Y then w(A) < 0 and w(B) < 0, and if C = C(Y) G RV[Y] is monic in Y then w(C) = 0, and hence if also AB = C then by (J8) we get w(A) = 0 = w(B), and therefore A(Y) G RV[Y] and B{Y) G i? v [y]. Obviously (Jll)

every overnormal domain is normal.

In greater detail, let y in the quotient field K of a domain R satisfy C(y) = 0 with nonconstant monic C{Y) G R[Y]; then C(Y) = A(Y)B(Y) with monic A(Y) = Y -y G K[Y] and monic B(Y) = C(Y)/A(Y) G ^ [ F ] , and hence assuming R overnormal we get A(Y) G R[Y], i.e., y £ R. Now (J4) follows from (J10) and ( J l l ) . Given a ring R, let us say that R is a valuation ring to mean that R is a domain which is the valuation ring of some valuation of its quotient field. Given a domain R, let us say that R is an intval domain to mean that R is the intersection of a nonempty family of valuation rings of valuations of its quotient field. Now obviously if a domain R with quotient field K is the intersection of a nonempty (J12)

family of normal (resp: overnormal) domains with quotient field K, then R is normal (resp: overnormal).

By (J10) and the overnormal part of (J12) we see that (J13)

every intval domain is overnormal.

Likewise by (J4) and the normal part of (J12) we see that (J14)

every intval domain is normal.

§8: UNIQUE FACTORIZATION

DOMAINS

81

§8: U N I Q U E FACTORIZATION D O M A I N S Let R be a UFD with quotient field K. Recall that U(R) is the set of all units in R. Let W be a set of irreducible elements in R such that no two elements of W are associates of each other and every irreducible element in R is an associate of some element of W. Then every x £ Kx can be uniquely expressed as x = ux J | zVz{x) zew

with

ux e U{R)

and

vz{x) e Z

such that vz(x) = 0 for all except a finite number of z, with the understanding that in the product the terms which are reduced to 1, i.e., those for which vz(x) — 0 should be disregarded; also put vz(0) = oo. Then vz : K y-> ZLI {oo} is a valuation of K with value group Z, and clearly we have (J15)

R=

f]

RVz

zew

and hence (J16)

every UFD is an intval domain

and now (J3) follows from (J14). As above, we get a valuation wz : K(Y) \-> ZU{oo} when for every A = A(Y) = ^2 Aiyi

G K[Y]

with

Ai€K

we put wz(A) = min vz(Ai) taken over all i and so on. In particular, as in (J8), we have wz(AB)

= wz(A) + wz{B) for all A ^ 0 ^ B in R[Y].

This being so for all z e W, we get the GAUSS LEMMA which says that (J17)

c(AB) = c(A)c(B) for all A + 0 ^ B in R[Y]

where we define the content c(A) of 0 ^ A — A(Y) e R[Y] by putting c(A) = gcd(A 0 , Ai,...) with gcd as in L1(R4). Note that this defines c(A) up to associates in R, and (J17) means that for any values of c(A) and c(B), c(A)c(B) is a value of c(AB). As a consequence of (J17) it can be shown that (J18)

R is a UFD ^> R[Y] is a UFD.

For any positive integer r, by induction and (J18) we see that (J19)

R is a UFD => R[XU...,

Xr] is a UFD

82

LECTURE

L3: TANGENTS

AND

POLARS

and hence in particular (J20)

if is a field =» K[XX,...,

Xr] is a UFD.

As a consequence of (Jl) and (J20) it can be shown that: for any field K and positive integers r,mi,... the if-homomorphism a : K[Xi,...,

,mr,

Xr] -> K[Ti,...,

Tr]

with Xi >-> JT™' is injective, and by identifying K(Xi,..., (J21)

Xr)

{ with its image under the monomorphism /? : K (Xu.

. . , * r ) -> K(TU ...,Tm)

induced by a,

we have that [K(T\,... ,TT) : K(X\,...

,Xr)\ = n\\.. .mr,

and

K[Tu...,Tr) = the integral closure of K[X\,...,

Xr] in K{T\,...,

Tr).

§9: R E M A R K S REMARK (Rl). [Factorization of Univariate Power Series]. Recall that an ED is a domain R with a euclidean function, i.e., a function : Rx —» N such that for all x,y in Rx we have: (Gl) y € xR =^ (a;) < <j>(y), and (G2) 2/ ^ xR => y = qx + r for some q £ R and r £ Rx with ^>(r) < (a;); moreover, if the r (and hence also the q) is unique then we call R a special ED. We have noted that the univariate polynomial ring over a field is a special ED. Now we observe that the univariate power series ring R = K[[X]} over a field K is even a simpler example of a special ED. To see this, it suffices to take (j>(A) = ordx^l for all A = A(X)£RX. It follows that: R is a PID and UFD with U(R) = the set of all power series of zero order, and XlR with i varying in N are exactly all the distinct nonzero ideals in R. Every nonzero nonunit element in R is an associate of a unique power X1 of X with integer i > 0. Every irreducible element in R is an associate of X. Every nonzero element h = h(X) in the quotient field K((X)) of -R has a unique factorization h = aXe where a G U{R) and e is an integer; moreover h e R-& e > 0. The only nonzero prime ideal in R is XR and it is maximal. For any nonzero / = f(X) G R of some order n > 0, clearly K n (fR) = {0} and hence we may identify K with a subring of the residue class ring L = R/(fR), and then upon letting x to be the image of X under the residue class map R —> L we see that l,x,..., xn~l is vector space basis of L over K and hence [L : K} = n; moreover L is a field <=> n = 1. For any 5 C R we put 0 0 0 ( 5 ) = 0 if S C {0} and otherwise we put GCD(S) = the unique power X1 of X which is a generator of the ideal in R generated by S, and for any finite or infinite sequence 3/1,2/2> • • • in i? we put GCD(yi,y2, • ••) =

§10: ADVICE TO THE READER

83

GCD({j/i, 2/2, • • •}); n ° t e that then the GCD is obviously a gcd. Likewise, for any finite sequence yi,.--,ya of elements in Rx with s > 0, by L C M ^ , . . . ,y„) we denote the unique 1cm of j/i, • • •, ys which is a power X% of X. To make the reference to the ring R explicit we may write GCD# and LCM^ instead of GCD and LCM respectively. Finally we note any 0 ^ ( e K((X)) has a unique reduced form C = £/Xl with £ € R and i £ N, and we call this the reduced form of £. It follows that (J22)

K is a field =» K[[X]\ is a UFD.

To justify the last sentence of (T4), as a consequence of (Jl) and (J22) it can be shown that: for any field K and positive integer m, the i^-homomorphism a : K[[X\] -> K[[T\] with X \-> Tm is injective and, by identifying K((X)) 1/3: K((X))

with its image under the monomorphism

-> if((T)) induced by a,

we have that \K((T)) : K{(X))] = m ,and K[[T\] = the integral closure of K[[X)} in A"((T)). REMARK (R2). [Kronecker's Theorem]. Later on we shall prove the converse of (J14) saying that [cf. L4§12(R7)] (J24)

every normal domain is intval.

By (J13) and (J24) we get KRONECKER'S THEOREM which says that (J25)

every normal domain is overnormal.

Note that by (J13) and (J16) we already get the special case of (J25) saying that (J26)

every UFD is overnormal.

§10: A D V I C E TO T H E R E A D E R As I have said before, an elementary treatment of many of the things said above, can be found in my 1990 book [A04] entitled Algebraic Geometry for Scientists and Engineers. Although this Engineering Book is not a logical part of the present book, I recommend it as collateral reading. §11: HENSEL A N D W E I E R S T R A S S We now turn to Hensel's Lemma for more variables, and at the same time we shall deal with the very versatile WPT = Weierstrass Preparation Theorem

84

LECTURE L3: TANGENTS AND POLARS

discovered by Weierstrass around 1870. As a companion to W P T we shall also prove the equally versatile WDT= Weierstrass Division Theorem. Before stating these we note that, following the treatment given in my Local Analytic Geometry book [A06], here our scheme of proof will be WDT => W P T =>• Hensel, all for any number of variables. For a comparison between Hensel's Lemma and WPT, and for a proof of bivariate W P T similar to the proof of Basic Hensel's Lemma (T2) given above, see pages 119-124 of my Engineering Book [A04]. To get ready for stating WDT and WPT, let Xu ...,Xm,7bem+l variables over a field k where m is any nonnegative integer, and consider the power series ring S = k[[Xu...,Xm,Y}}

= R{[Y}}

with

R=k[[Xi,...,Xm]].

As we have seen before, S = S(Xi,..., Xm, Y) G S is a unit in S iff 6(0,..., 0) ^ 0. Likewise e = e(X\,..., Xm) G R is a unit in R iff e ( 0 , . . . , 0) ^ 0. Equivalently h = h(Xi,.. .,Xm) e f i i s a nonunit in R iff h(0,... ,0) = 0, i.e., iff h e M{R) where M(R) is the ideal in R generated by X\,... ,Xm. Note that M(R) is the kernel of the epimorphism R —» k given by h H-> h(0,..., 0) and hence M(R) is a maximal ideal in R. Since M(R) is the ideal of all nonunits, it is the unique maximal ideal in R; see observation (1*) below. We define the Weierstrass degree of any / = f{X\,..., Xm, Y) G S relative to (Y, R) by putting wideg(/) = widegy ifl / = o r d y / ( 0 , . . . , 0, Y). If / * £ S is of the form

/ • = Yd + J2 fjYd~J

with

/ / G M(R)

for

1<3 < dG N

i<j
then we say that / * is distinguished or, in greater detail, we say that / * is a distinguished polynomial (of degree d) in Y over R. Note that then d = d e g y / * = wideg y>fi /* with the understanding that for any ring R and any power series A = ^2ieN AtYl £ R[[Y}} with Ai G R we put degy .A = max SuppyA where by convention the said max equals - c o iff the Supp is empty, i.e., iff A = 0, and similarly the said max equals co iff the Supp is an infinite set, i.e., iff A 0 R\Y). Recall that elements / , g in a ring R are said to be associates in R if for some elements / ' , g' in R we have / = gg' and g = / / ' . Now let us state WDT and W P T together with some supplements.

§Ji: HENSEL AND

WEIERSTRASS

85

SUPPLEMENTED WEIERSTRASS PREPARATION THEOREM (T5). Let X\, . • •, Xm, V b e m + 1 variables over a field k where m is any nonnegative integer, and let S = k[[Xi,. ..,Xm, Y}] = R[[Y}\ with R = k[[Xi,..., Xm}}. Then we have the following. WDT (T5.1). Given any g e S and any / e S with wideg(/) = d G N we can uniquely write g = qf + r where q £ S and r £ R[Y] with degyr < d. W P T (T5.2). Given any / e S with wideg(/) = d £ N we can uniquely write / = Sf* where 6 is a unit in S and / * is a distinguished polynomial of degree d in Y over R. SUPPLEMENT (T5.3). If g e R[Y] and / € S is distinguished then the equation g = qf + r in (T5.1) is the division identity in R[Y], i.e., q S R[Y}. Moreover, if g is also distinguished and r = 0 then 9 is distinguished. SUPPLEMENT (T5.4). If / , g in S are distinguished then: / and g are associates in S & f = g. SUPPLEMENT (T5.5). If / e S is distinguished and / = Fx...Fe with Fi,...,Fe in S then, for 1 < i < e, Fi is an associate in S of a distinguished F* £S , and we have / = Ff . . . Fe*. We shall deduce (T5) from a more general version of it. To start this off, we observe that the above rings S and R are complete Hausdorff quasilocal rings in the following sense. A quasilocal ring is a ring R with a unique maximal ideal, which we denote by M(R). It can easily be seen that J a ring is quasilocal iff all the nonunits in it form an ideal; lathis ideal is then the unique maximal ideal in that ring. As a consequence of (1*) and the valuation axioms L2(V1) to L2(V3) it can be seen that J the valuation ring Rv of any valuation v of a field K [is a quasilocal ring with M{RV) = {x £ K : v(x) > 0}. For ideals B, C in any ring R, the product BC is the ideal in R defined by BC = the set of all finite sums ^ bet with bi e B and c, € C. Similarly for the product B\... Bn of a finite number of ideals B i , . . . , Bn in R. If B\ = • • • = Bn then we write Bn for Bi.. .Bn. In particular B° = R. Note that products and sums of ideals are connected by the distributive law: (Bi -\ (- Bn)C = B\C -\ (- BnC. Let R be a quasilocal ring. R is Hausdorff means f l i ^ ^ ^ ) * = 0. For any h € R we define the .R-order of /i by putting ordfl/i = max{i e N : / i £ M ^ ) * }

LECTURE

86

L3: TANGENTS

AND

POLARS

and we note that then R is Hausdorff means for any h G R we have: ord^h = 0 0 0 h = 0. A sequence x = (xi)i NE and j > NE we have xt - Xj G M(R)E. The sequence x is convergent means it converges to a limit £ G R, i.e., for every E eN there exists NE eN such that for all i > NE we have £ - Xj G M(R)E; we indicate this by some standard notation such as Xi —> £ as i —> 00 or limj_»0Oa;i = £• If i? is Hausdorff then a limit when it exists is unique. R is complete means every Cauchy sequence in it has a limit in it. If R is a complete Hausdorff quasilocal ring and Dj G M(R)u(ri with u(j) —> 00 as j —» 00 then a sequence like ]Cj'<,i> VjNote that if R is a quasilocal ring then the univariate power series ring S = R[[Y]] is a quasilocal ring with M(S) = M(R)S + YS, and if R is complete and Hausdorff then so is 5 . Clearly the rings S and R of (T5) are complete Hausdorff quasilocal rings. In particular R is Hausdorff because for any h G R we have ordn^ = ord(x1,...,J\:m)^The above power series rings are actually local rings in the following sense. If all ideals in a ring are finitely generated then we call it a noetherian ring. We shall eventually give a proof of the famous Hilbert Basis Theorem which says that a polynomial ring over a noetherian ring is noetherian, and then from this, by using WPT, we shall deduce that a power series ring is also noetherian. By a local ring we mean a noetherian quasilocal ring. Eventually we shall prove the Krull Intersection Theorem which says that every local ring is Hausdorff. For S = R[[Y]] with R quasilocal, the above concepts of wideg and distinguished apply except that now for any / = J2i>o ^% e & w ^ t n /i € i? we have wideg(/) = w i d e g y i j / = o r d y ^ ( / ) where i\) : S -> {R/M{R))[[Y]\ is the natural map given by i/>(/) = E,>o
/ is distinguished <=> deg(/) = wideg(/) < 00 and / is monic in Y.

Also note that for any / , / ' in S we have (4«)

deg(//') = deg(/) + deg(/')

and (5«)

wideg(//') = wideg(/) + wideg(/').

Finally note that 5 = Y^i>o < ^ *

G

& w i t ^ ^i G -R is a unit in S iff £0 is a unit in R.

Now let us state the said generalization of (T5).

§ i i : HENSEL

AND

WEIERSTRASS

87

ABSTRACT WEIERSTRASS PREPARATION THEOREM (T6). Let R be a complete Hausdorff quasilocal ring and consider the univariate power series ring S = R[[Y]]. Then we have the following. (T6.1). Given any g £ S and any / £ S with wideg(/) = d £ N we can uniquely write g = qf + r where q £ S and r G R\Y] with degyr < d. (T6.2). Given any / G S with wideg(/) = d £ N we can uniquely write / = Sf* with 5 a unit in S and /* a distinguished polynomial of degree d in Y over i?. (T6.3). If g £ i?[y] and f £ S is distinguished then the equation g = qf + r in (T6.1) is the division identity in R[Y], i.e., q £ R[Y). Moreover, if g is also distinguished and r = 0 then q is distinguished. (T6.4). If / , g in S are distinguished then: / and g are associates in S <=> / = g. (T6.5). If / G 5 is distinguished and / = F i . . . Fe with F j , . . . , F e in 5 then, for 1 < i < e, Fi is an associate in 5 of a distinguished F* £ S , and we have / = F 1 *...F*. Items (T6.1) to (T6.5) are identical with items (T5.1) to (T5.5) respectively, and hence (T5) follows from (T6). To prove (T6.1) we write

/ = E^ yi

with

i>0

^SJR

and upon letting

f = J2f
and

/= E /'yi

i>d

0
we see that / is a unit in S and hence upon letting G =g

and

F = -///

and

Q=

9

/

that

qf = (Yd - F)Q and

(•)

G = J2i>0 and

G yi

i

with

Gi£R

for all i

F = ]T\> 0 FiY1 with F* G M(R) for all i and therefore (T6.1) is equivalent to the following: LEMMA (T7). Let R be a complete Hausdorff quasilocal ring and consider the univariate power series ring S = R[[Y]]. Let G, F in S be as in (•), and let d be any nonnegative integer. Then degy[G — (Yd — F)Q] < d for a unique Q £ S.

LECTURE L3: TANGENTS AND POLARS

Moreover for this Q we have Q = £ u > 0 Q(") where Q G M(R)U[[Y}] are given by the recursive formulas Qe (••)

for all e > 0;

— Gd+e

^(u+1) _ y , „ n(u) r Ve — 2s0<j
Q

e

=

Ee>oQ^y

for all u > 0 and e > 0; for all u > 0.

PROOF OF (T7). To prove uniqueness it suffices to show that d e g y ( r d - F)Q Q

= 0.

If for some u > 0 we have Q G M{R)U[[Y}}, i.e., upon letting Q = £ \ > 0 QjY* with Qi G -R we have Qi G M(i?) u for all i, then for any e > 0, by the above degree condition, we get 0 = coeff of Yd+e in (Yd - F)Q = Qe + terms in and hence Qe G M(R)U+1. e > 0 we have Qe G M{R)U To prove existence we E„>o Q ( u ) w e h a v e d e Sv[G degv

G

M(R)U+1

Thus by induction on u we can conclude that for all for all u > 0 and hence Qe = 0. Therefore Q = 0. try to find Q{u) G M(.R)"[[y]] such that for Q = - ( F d - F)Q] < d, i.e., l u) F Y,Q{u)\Yd+\Y,Q - \ iu>0

lu>0

Solving degy[G - Q(°>yd] < d w e get Q<°> = £ e > 0 Q i ' y e w i t h Qe = G d+e for all e > 0. It remains to find Q G M(i?)"[[y]] for all u > 1 such that degy

£QM i«>i

£Q(U)

0

Since F G M(i?)[[y]], we can successively find Q{u) by "solving modulo M(R)U." Upon doing this by recursively letting Qi" + 1 ) = J2o<j 0 and e > 0, we see that the above degree inequality is satisfied by taking Q^ =

£e>o£eU)yeforallu^0PROOF OF (T6). As said above, (T6.1) is equivalent to (T7). The first assertion in (T6.3) follows from the uniqueness part of (T6.1) and the second assertion follows from (3*) to (5*). To prove (T6.4) let / , g in S be distinguished, and assume that they are associates in S. Upon relabelling / , g we may suppose that wideg(g) = e < d = wideg(/). Let ip : S —» (i?/M(i?))[[y]] be the natural map whose restriction to R is the residue class epimorphism R —> R/M(R). Since / and g are associates we have g = ff for some / ' G S. Applying tp to the equation g = ff we get the equation

%11: HENSEL AND

WEIERSTRASS

89

Ye = Ydip(f) in (R/M{R))[[Y]} and hence e = d. Now g - f = / ( / ' - 1) with degy(
in (T6).

Next we come to: ABSTRACT HENSEL'S LEMMA (T8). Let R = k[[Xi,.. .,Xm}} where m is any nonnegative integer and k is an algebraically closed field, or more generally let R be any complete Hausdorff quasilocal ring whose residue field R/M(R) is algebraically closed. Let : R —» R/M(R) be the residue class epimorphism and let ~4> : R[Y] -* (R){Y} be denned by putting ^ ( / ) = J2i>o (fi)yi for a11 / = E i > 0 fiY* G R[Y] with fi G R. Let F(Y) = Yn + Y^

AjYn-j

with

n G N+

and

A, G R.

l<j
Then we have the following. (T8.1) Let a i , . . . , ah be the distinct roots of (j)(F(Y)) in
HF(Y))=

n (Y-air. \
Then there exist unique monic polynomials F\{Y),..., Fh(Y) of degrees e i , . . . , e„ in y over i? such that F(Y) = Fi(Y).. -Fh(Y) and ~fi(Fi(Y)) = {Y - a t ) e i for l(R), then there exist unique monic polynomials G(Y) and ff(F) of degrees r and s in Y with coefficients in R such that F(K) = G(Y)H(Y) with 0(G(K)) = G(F) and 4(H(Y)) = F ( F ) . PROOF. By (T6.2) and (T6.3) we see that (T8.3)

(T8.2) is true when H(Y) = Ys.

Now it is easy to see that (T8.3) =$• (T8.1) => (T8.2).

90

LECTURE L3: TANGENTS AND POLARS

§12: DEFINITIONS A N D EXERCISES DEFINITION (Dl). [Minors and Cofactors]. Consider an m x n matrix

(An

A=

/ill

\A„

Aln\

... Aij ...

.. .

-**-ij , . .

•

Jly

-^mrt/

over a ring R. By the r x s submatrix of A located at the ( « i , . . . , ur; v\,..., vs)th spots, where r > 0 and s > 0 with 1 < u\ < • • • < ur < m and 1 < v\ < • • • < vs < n are integers, we mean the r x s matrix B = {Bpq) over R given by Bpq = AUpVq; if r = s then we may call B a square submatrix of A of order r. By the rank of A, denoted by rk(j4), we mean the largest order of its square submatrix whose determinant is a unit in R. By the row rank of A, denoted by rrk(j4), we mean max r such that A has an r x n submatrix of rank r. By the column rank of A, denoted by crk(A), we mean max s such that A has an m x s submatrix of rank s. If A is a square matrix, i.e., if m = n, then for 1 < i < n and 1 5: J < n, by the (i,j)-th square submatrix of A we mean the (n — l ) x ( n — 1) square submatrix of A obtained by deleting the i-th row and j-th. column, i.e., located at the ( 1 , . . . , i - l , i + l , . . .,n; 1 , . . . ,j-1, j + 1,.. .,n)-th spots, and (—l) t+J times its determinant is called the (i,j)-th cofactor of A. The term cofactor is motivated by (El) below. The word minor is used as a synonym for the determinant of a square submatrix. Thus, by a minor of A of order r we mean the determinant of a square submatrix of A of order r. Likewise, if m = n then the (i,j)-th cofactor of A is (_iy+J times the (i,j)-th minor of A. EXERCISE (El). Show that the determinant of an n x n matrix A = {Aij) over a ring R can be calculated by expanding in terms of any row or any column, i.e., show that for 1 < i < n and 1 < j < n we have det(A) 2^,\<j
%12: DEFINITIONS

AND

EXERCISES

91

of the matrix of cofactors of A, i.e., if we denote the adjoint of A by A* = (A*^) and the (i,j)-th cofactor of A by Cy- then A*j = Cji. For elements ai,..., an in R, by d i a g „ ( a i , . . . , a „ ) or diag(ai,... ,a„) we denote the nxn diagonal matrix whose (i,j)-th entry is a, or 0 according as i = j or i ^ j . By the nxn identity matrix we mean the matrix d i a g n ( l , . . . , 1); we may sometimes denote this by l n . For the mxn matrix A = (A^) and any r € R, by r A we denote the mxn matrix whose (i,j)-th entry is rAij. This makes the set MT(m x n, R) of all m x n matrices over R into an .R-module. For a field k, it converts MT(m x n, A;) into an mn dimensional vector space over k. EXERCISE (E3). Show that units in the group o f n x n matrices over a ring R are those matrices whose determinants are units in R, i.e., prove claim (M4). Hint: for an n x n matrix A = (Aij) over R, show that AA* = A*A — det(vl)l„ where A* is the adjoint of A and l n is the nxn identity matrix. EXERCISE (E4). Show that for n x n matrices over a field k with positive integer n, the nonzero scalar matrices form a normal subgroup of the group of all matrices of nonzero determinant, i.e., prove claim (M5). EXERCISE (E5). Prove Euler's Theorem on Homogeneous Polynomials, i.e., show that for any homogeneous polynomial H = H(Z\,..., ZN) of degree d in a finite number of variables Z\,..., ZN over a field k we have ]CiA: Rn ^ Rm given by cf>A(y) = Ay for all y € Rn = MT(n x 1, R). Now A i-> cj>A gives a bijection of MT(m x n,R) onto the set of all -R-linear maps Rn —> Rm. Moreover, for any B £ MT(n x r, -R) we have 4>AAB- In c a s e R is a field k, we have rrk(^4) = dimfeim((/>/i) = crk(^4) = n — dimfcker(0^). In particular if -R is a

LECTURE LS: TANGENTS AND POLARS

92

field k and m = n then: A is a linear fc-automorphism of Rn «=> det(A) ^ 0. DEFINITION (D4). [Linear and Polynomial Automorphisms]. Consider the rings P = k[Xi,.. .,Xn] C k[[Xi,... ,Xn}] = S where Xi,...,Xn are a finite number of variables over a field k. Let A = (Ay) be the m x n matrix depicted in (Dl) where we take R = k and m = n. Let N = n2 and let ( P such that c ^ X , ) = YLi<j 0}. Hence by (E8) below, cr^ uniquely extends to an automorphism T,I : S —> 5. We call T^ a fc-linear automorphism of S. If .A can be chosen so that for a given nonzero F*(Z\,..., ZJV) £ T we have F * ( a i , . . . , ajv) ^ 0 then we say that TA is a generic &:-linear automorphism of S. Given any 0 ^ F £ S, upon letting ordgF = v, we have that v is a nonnegative integer and we can uniquely write F = F(XU ...,Xn)=

FV(XU ...,Xn)

+ (terms of degree > v)

where Fv = FV(X\,..., Xn) s P i s a nonzero homogeneous polynomial of degree v. We call Fv the initial form of F relative to S or relative to (Xi,..., Xn), and denote it by infos-F or info^,...^,!)-^ w e a l s o P u t infosO = 0. Geometrically speaking Fv = 0 is the tangent cone of the analytic hypersurface F = 0 at ( 0 , . . . , 0). Now suppose n > 0. If for F"(Zi,..., ZN) = Fv(Zln,..., Znn) we have F"(au..., aN) ^ 0 then clearly TA(F)(XI,

..., Xn) = aA(Fv)(Xi,...,

Xn) + (terms of degree > v)

where OA(FV)(XI, . . . , Xn) £ P is a nonzero homogeneous polynomial of degree u with <JA(FV)(0, . . . , 0,1) ^ 0, and hence OV&STA(F) = v with inhsTA(F) = GA(FV) and upon writing ( X i , . . . , Xm, Y) for ( X i , . . . , X„) we get wideg(r J 4(F)) = v. If fc is infinite then by taking G = F*F'F" and applying (E6) we will have found a generic fc-linear automorphism TA '• S —> S such that wideg(r/i(F)) = v. Without assuming k to be infinite, we claim that we can always find a polynomial k-automorphism r : S —> 5 such that wideg(r(F)) < oo, where an automorphism r : S —> S1 is called a polynomial /c-automorphism if it is a /c-automorphism with T(P) = P; note that then we must have T(P+) = P+. TO see this, for any E = ( e i , . . . , e n ) € N™ with e n = 1 let S. Writing F = F(X1,...,Xn)= with FiGk

^FiX*

and X* = X[l ... Xln", we put Supp(F) = {i € N n : F4 ^ 0} and call

§12: DEFINITIONS

AND EXERCISES

93

this the support of F. Note that Supp(F) is nonempty because we assumed F ^ 0; also note that if F e P then Supp(F) is finite. Let us lexicographically order N n by saying that for i ^ j in N™, upon letting t to be the smallest subscript with it ^ jt we have: i < j •& it < jt- In this ordering any nonempty set has a smallest element and any nonempty finite set has a largest element. So let r be the smallest element of Supp(F), and in case of F £ P let s be the largest element of Supp(F). Upon letting deg and degx n be the {X\,..., X n )-degree and the X„-degree respectively, by (E9) below we see that if

{

^n—vrn—v

&n—u -> 2-/0
for 1 

degXn(aE(Xr'))

Supp {F)

= degXn(aE{Xr))

max

<

= e\r\ + ••• + enrn < oo, and in case

s ' e S u p p ( F ) 2-^0
e

™—v\sn-v ~

s

n—v)

for 1 

degXn(aE(X°'))

and hence deg(aE(F)) = degXn(crE{F)) = degXn(aE(Xs)) = eisi + • • • + ensn. Thus, in addition to finding E so that wideg(rE(F)) < oo, in case of F £ P we have also found E so that deg(cr£(F)) = degXn(aE(F)), i.e., aE(F) is regular in Xn in the following sense. Given any 0 ^ F £ P, upon letting deg(F) = fx, we have that fj, is a nonnegative integer and we can uniquely write F = F(XU ...,Xn)

= Fli(X1,...,Xn)

+ (terms of degree < /x)

where FM = F^(Xi,... ,Xn) £ P i s a nonzero homogeneous polynomial of degree (j,. We call Fu the degree form of F relative to P or relative to (Xi,..., Xn), and denote it by defopF or defo(x1,...,xn)-P1i w e a l s o P u t defopO = 0. F is regular in Xn means deg(F) = deg x (F), i.e., equivalently Fu(0,... ,0,1) ^ 0. Geometrically speaking F^ = 0 is the infinite portion of the hypersurface F = 0, and F is regular in Xn means the X n -axis, i.e., the line X\ = • • • = X n _ i = 0 does not meet this portion. If k is infinite then by taking Fu for F" in the above argument, we can arrange that VA{F) is regular in Xn and has the same degree as F . All this can be applied to any finite family of polynomials or power series by letting F be the product of that family.

94

LECTURE L3: TANGENTS AND POLARS

EXERCISE (E8). In (D4) show that any k-linear automorphism a : P -> P with c(-P+) = P+ can be uniquely extended to an automorphism r : S —> S. EXERCISE (E9). In (D4) show that (*) => (**), and show that (') => (")• EXERCISE (E10). Let Yi,...,Yn be a finite number of indeterminates over a ring R, and let I be the ideal in S = R[[Y\,..., Yn]] generated by Y\,..., Yn. Show that then S is /-complete and /-Hausdorff according to (D5) below. Also show that if R is quasilocal then S is quasilocal with M(S) = M{R)S + I. Finally show that if R is complete Hausdorff quasilocal then so is S. EXERCISE ( E l l ) . Give details for the proof of Abstract Hensel's Lemma (T8). DEFINITION (D5). [Hausdorff Modules and Bilinear Maps]. Let us now generalize the notions of Hausdorff and complete from the case of quasilocal rings to more general situations. So let / be an ideal in a ring R and let V be an ZJ-module. V is /-Hausdorff, or Hausdorff relative to / , means (li^PV = 0, where for any ideal J in R, by JV we denote the submodule of V given by JV = {J2jivi '• Ji G J and Vi G V (finite sums)}. For any h G V we define the (/?,/)-order of h by putting ord( fl /)/i = max{j G N : h G PV} and we note that then V is /-Hausdorff means for any h £V we have: oid^nj-jh = oo «=> h = 0. A sequence x = (xi)i NE and j > NE we have xt - Xj G IEV. The sequence x is /-convergent, or convergent relative to / , means it converges to an /-limit £ G V, i.e., for every E G N there exists NE G N such that for all i > NE we have £ — Xi G IEV; we indicate this by some standard notation such as Xj —> £ as i —> oo or limj^ooXj = £. If V is /-Hausdorff then a limit when it exists is unique. V is /-complete means every /-Cauchy sequence in it has a limit in it. If V is /-complete and /-Hausdorff and yj G /"(•?') V with u(j) —> oo as j —-> oo then a sequence like Yli'<ji' Vj- ^ R is quasilocal with I = M(R) and V = i? then, according to the previous definitions of the above notions, the adjective "relative to /" may be dropped. Let U, V, W be modules over R. A map / : V x W —> U is /Z-bilinear means for every » e 7 the map W —> L/ given bywi-» /(u, w) is .R-linear and for every w £W the map V —* U given by t> t-> f(v,w) is .R-linear. In a natural manner we can regard U/(IU),V/(IV),W/{IW) as modules over the ring R/(IR), and then in a natural manner / induces a (/?///?)-bilinear map (V/(IV)) x (W/(IW)) -> U/(IU). The following three Exercises deal with the above set-up. In particular, in (E14) we remove the hypothesis of R/M(R) being algebraically closed from (T8.2). EXERCISE (E12). [Completeness Lemma]. Let / be an ideal in a ring R,

%1S: DEFINITIONS AND EXERCISES

95

let U, V, W be any /^-modules, and let n be any positive integer. (E12.1) Show that if V is /-Hausdorff then Vn is /-Hausdorff. (E12.2) Show that if V is /-complete then Vn is /-complete. (E12.3) Show that if (xt) is an /-Cauchy sequence in V such that some subsequence (xr(j)) has an /-limit x in V, then x is an /-limit of the sequence (a;,). (E12.4) Show that for any it-linear map : W -> V we have is surjective then we have {InW) = InV. (E12.5) Show that for any ^-bilinear map 9 : V x W - • £/ we have 0 ( / n K x P W ) C /"?/. (E12.6) Show that if W is /-complete and there exists a surjective .R-linear map W —> V then V is /-complete. (E12.6*) Show that (E12.6) is not true with Hausdorff replacing complete. (E12.7) Show that if V is a finite /^-module and R is /-complete then V is /-complete. HINT. (E12.1) to (E12.5) are straightforward. For (E12.6*) take W = R = k[X] where X is an indeterminate over a field k, let / = XR, and let

V = R/(X - 1)R. (E12.7) follows from (E12.2) and (E12.6). To prove (E12.6), let (xi) be any Cauchy sequence in V. Then we can find positive integers r(l) < r(2) < . . . such that for all j , m, n in N+ with m > r(j) and n > r(j) we have xm - xn G P'V. Pick any y\ G 0 _ 1 ( x i ) . For all j > 1, by (E12.4) there exists 2/j G (rj). Then by (E12.4), £ is an /-limit of (xr(j)) in V, and hence by (E12.3), £ is an /-limit of (xi). EXERCISE (E13). [Hensel's Sublemma]. Let / be an ideal in a ring R such that R is /-complete. Let U, V, W be finite /^-modules such that U is /-Hausdorff. Let : R -> R = R/I, fa : U - • !7 = E//(J^), 0y : V -» V = V/(/V), <j>w :W^>W — W/(IW) be the canonical epimorphisms. Let 9 : V x W —> [/ be an /^-bilinear map and let 9 : V x W —> U be the /i-bilinear map induced by #. Let there be given F GU with G G 7 and # G W such that (i) fa(F) =j?(G,/7) and (ii)J/_= ^(G, W)_+ 0(7,H\_ where as usual 0(5", IF) = {0(G, W) : W G F } and 9(V,ll) = {9{V,H) : V G F } . Show that then there exist G G (<M - 1 ({G}) and H G ( ? w ) - 1 ( W ) s u c h that F = 9(G,H). HINT. Let G0 = H0= 0. By induction on I > 1 we find (i*) G/ G (0 V ) _1 ({G"}) and Hi G ( ^ ^ ( { t f } ) such that: (ii*) G1-G1-1 G J'- X V and /// - / / ; _ i G J ' " ^ with: (hi*) F - 9(Gi,Hi) G IlU. By (i) we can do it for 1 = 1. So let Z > 1 and assume we have done it for I — 1. Then F — 0(G;_i,//;_i) G Il~1U and hence F — 0(Gi-i,Hi-i) = J2£jXj with £j G / i - 1 and Xj G Z7 where the sum is finite. By (ii) we find yj G V and Zj G W such that Xj — 9(G,Zj) — 9(yj,H) G IU. Now it suffices to take G; = Gi~\ + Y^^jVj a n d Hi = H1-1 + £ £ j z j - I n v i e w °f (ii*)) by

96

LECTURE

L3: TANGENTS

AND

POLARS

(E12.7), (Gi) and (Hi) have /-limits G and H in V and W respectively. In view of (i*), by (E12.4) we see that G £ ( ^ v ) - 1 ({ : R —> R — R/M(R) be the residue class epimorphism and let (f>: R[Y] —> R[Y] be the obvious extension of cf>. Let F(Y) = Yn+

J2

AjYn-j

with

n £ N+

and

A, £ R.

l<j
Assume that v : V -» V = V/{IV), ~<$>w : W —> W = Wy(iW) be the epimorphisms induced by <£. Let ^ : V x W —> U be the i?-bilinear map given by multiplication, i.e., given by (V, W) H-> VW. Let 0 :V xW -* U be the i?-bilinear map induced by 6. In view of (E12.1), by (E13) we find G(Y), H(Y) in R[Y] of degree < r and < s respectively such that $(G{Y)) = G{Y) and ~$(H{Y)) = H{Y) with F{Y) = G(Y)H(Y). It follows that the degree of G(Y) is r and therefore for the coefficient G' of Yr in it we have {H') = 1. Therefore G',ff' are units in R with G'/T = 1. Now it suffices it to take G{Y) = G{Y)/G' and H{Y) = H(Y)/H'. EXERCISE (E15). Let D be a derivation of a field K. Let y, z be nonzero elements of K such that zr = y where r is a rational number. Show that then D(y) = r z r - 1 D ( « ) . OBSERVATION (01). I am not saying that given y and r this defines z uniquely. The matter is made precise by saying that for some way of writing r = m/n with m € Z and n € N + , and for some x £ K with y = xm, we have z = xn, and then interpreting the equation D{y) = rzr~1D{z) as the equation zD(y) — ryD(z). EXERCISE (E16). Prove Binomial Theorem (Tl) for fractional exponents. In other words, for any rational number r = m/n with m £ Z and n £ N+, upon letting

y(X) = i + £

r(r 1)

- -.,(r-*

+ 1)

X* e MM

%12: DEFINITIONS AND EXERCISES

97

where X is an indeterminate over a field k of characteristic zero, show that y{X) is the unique element in A;[[X]] with y(0) = 1 such that (1 + X)r = y(X)

HINT. By taking F(X,Y) there exists a unique

(1 + X)m =

or equivalents =Yn-{l

+ X)m

y{X)n.

in Basic Hensel (T2) we see that

y[X) = 1 + 5 ] Vix* e Ml*]]

with

Vi €

k

i>0

such that (l + X)m = y(X)n i.e.,

{i + xy = i + YlViXiek[[x}}i>0

For any i > 0, in view of (E15) and (01), differentiating both sides i times with respect to X and then putting X = 0 we get r(r - 1 ) . . . (r - i + 1) = i!j/j. EXERCISE (E17) Prove Binomial Theorem for integral exponents. In other words, show that for any n £ N and any elements x, y in any ring R we have xlyn-1 0
where (™) is the number of ways of choosing i things out of n things, i.e., the size of the set of all i size subsets of a set of size n. Also show that /n\ \iy

n! i\(n — i)\

n(n — 1 ) . . . (n — i + 1) i\

HINT. In addition to the usual high-school proof, give a second proof using (E16). OBSERVATION (02). Newton first proved (E17), then he generalized it to (E16), and finally he generalized it to (T4). EXERCISE (E18) Show that a univariate polynomial ring over a UFD is a UFD, i.e., complete the proof of (J18). Also complete the proofs of claims (J21) and (J23) about power series rings.

98

LECTURE

L3: TANGENTS

AND

POLARS

OBSERVATION (03). From this lecture, it only remains to do Homework (H3) and to complete the proofs of claims (Jl), (J2), and (J24). §13: NOTES NOTE (Nl). [Preparation for W P T ] . To take care of the finiteness of wideg in WPT, consider the power series ring S = fc[[-X"i,... ,Xn]] in a finite number of variables X\,..., Xn over a field k. Given 0 ^ F = F(X\,..., Xn) € S, we want to find an automorphism r : S —> S such that after taking (Xi,...,Xm,Y) for (Xi,... ,Xn) we would have wideg(r(F)) < oo. If k is infinite then by (D4) and (E8) we can achieve this by means of a generic fc-linear automorphism r = TA of S; geometrically speaking what we do is to rotate coordinates so that a generic line not on the tangent cone of F = 0 becomes the X n -axis, i.e., becomes the line Xi = • • • = X n _ ! = 0. Without assuming k to be infinite, by (D4) and (E9) we can always achieve it by means of a polynomial fc-automorphism T = TE of S; what we do is that instead of varying the coefficients of the equations of the automorphism over the field, we suitably vary their exponents over N+ which is an infinite set. NOTE (N2). [Abhyankar's Proof of Newton's Theorem]. What we called Shreedharacharya's Proof of Newton's Theorem, is also known as Abhyankar's Proof of Newton's Theorem; see pages 575 and 659 of volume II of Jacobson's Basic Algebra Book [Jac]. §14: CONCLUDING N O T E Analytic geometry was initiated by Descartes in 1637. Descartes' work was pushed further, first in France (1760-1790) by Bezout, then in England (1840-1890) by Cayley, Sylvester, and Salmon, then in Germany (1860-1910) by Plucker, Klein, and Noether, and simultaneously in Italy (1860-1920) by Cremona, Castelnuovo, Enriques, and Severi, and finally in the United States (1930 onwards) by Zariski, Weil, Chevalley, and others. Somewhere along the line, in this long march, analytic geometry changed its name to algebraic geometry. By stressing synthetic arguments and augmenting ordinary space by inserting points at infinity, analytic geometry is metamorphosed into projective geometry. As said above, analytic geometry and projective geometry have mostly been eliminated from the college curriculum in the last forty years or so. But I was lucky. In my time I had four years of analytic geometry. Also I had two years of projective geometry. In the one year course in Bombay College, my text for projective geometry was the classic Pure Geometry book [Ask] of Askwith. This was followed up by the one year projective geometry course I had at Harvard with Zariski where our text was the equally famous book [VYo] of Veblen and Young. Out of all these illustrious names, whom shall we elect as the father of algebraic geometry? By reading the relevant material in Klein's

%14: CONCLUDING NOTE

99

famous Entwicklung der Mathematik book [Kle] of 1910 which is reconfirmed in Karen Parshall's recent book [PaR] on Sylvester's Correspondence, I am convinced that George Salmon of Dublin, Ireland, is the true father of algebraic geometry and his 1852 book [Sal] on Higher Plane Curves is the cradle of that subject. In any case, on page 17 of Askwith's book, you will find Salmon's generalization of the pole-polar property of a circle.

Lecture L4: Varieties and Models §1: RESULTANTS A N D D I S C R I M I N A N T S One way of proving Bezout's Theorem, and of doing Homework L3(H3), is by using resultants and discriminants. Assuming n, m to be nonnegative integers, the y-Resultant of two polynomials f(Y)=a0Yn

+ a1Yn-1+---

m

g(Y)=b0Y

m 1

+ b1Y -

+ an

+ --- + bm

is the determinant R-esy(/,ff) = det(Resmaty (/,#)) of the n + m by n + m matrix / ao a\ • 0 ao a\

Resmaty(/,#) =

0 0 • b0 h • 0 b0 h

an 0 • • an 0

ao oi • • bm 0 •

b0 h

\ 0 0

brr,

0

0 0

bm J

where the first m rows consist of the coefficients of / and the last n rows consist of the coefficients of g. More precisely, the first row starts with the coefficients of / , these are shifted one step to the right to get the second row, shifted two steps to the right to get the third row, and so on for the first m rows, then the (m + 1)st row starts with the coefficients of g, these are shifted one step to the right to get the (m -f 2)-nd row, and so on for the next n rows. The matrix is completed by stuffing zeroes elsewhere. The determinant Resy(/, g) is sometimes called the Sylvester resultant of / and g because it was introduced by Sylvester in his 1840 paper [Syl] where he enunciated the following: BASIC FACT (Tl). If the coefficients aj, bj belong to a domain R then we have: Resy(/, g) = 0 ^ n + m ^ 0 and either a 0 = 0 = bo or / and g have a common root in some overfield of R.

100

%1: RESULTANTS AND DISCRIMINANTS

101

In case n > 0, the F-Discriminant of / is defined to be the F-Resultant of / and fY, i.e., DisCy(/)=ReSy(/,/y). where we view fy to be the polynomial fY(Y)

= na0Yn-1

+ (n - l ^ F " " 2 + • • • + a„_i

i.e., we let the discriminant to be the determinant of the appropriate 2n — 1 by 2n — 1 matrix without considering whether nao equals zero or not. From the Basic Fact (Tl) we deduce the following: COROLLARY (T2). If n > 1 and the coefficients a* belong to a domain R then: Discy (/) = 0 •*=> either ao = 0 or / has a multiple root in some overfield of R. OBSERVATION (01). [Resultant and Projection]. If XU...,XN are indeterminates over a field k with N £ N + and R is either the polynomial ring k[X\,... ,XN] or the power series ring k[[Xi,... ,XN}}, then Hesy(f,g) equals a polynomial or power series $ = &(Xi,..., XN). If ao and bo are in kx with nm ^ 0 and k is algebraically closed then, in the polynomial case, by the Basic Fact it follows that the hypersurface $ = 0 in the N-space of (X\,..., XN) is the projection of the intersection of the hypersurfaces / = 0 and g = 0 in the (TV + l)-space of (Xi,..., XN, Y). Moreover, without assuming k to be algebraically closed but assuming that ao and &o are nonzero constants, in the polynomial case as well as the power series case, by the Basic Fact it follows that: $ is identically zero (i.e., $ is the zero element of R) •*=> / and g have a nonconstant common factor in R[Y]. OBSERVATION (02). [Discriminant and Projection]. Again ifXx,...,XN are indeterminates over a field k with N e N+ and R is either the polynomial ring k[X\,..., XN] or the power series ring fc[[Xi,..., XN]], then Discy (/) equals a polynomial or power series A = A ( X i , . . . , XN)- NOW if ao is in fcx with n > 1 and k is algebraically closed then, in the polynomial case, for all values (Ui,... ,UN) of (Xi,..., XN) in k, the equation / = 0 has n roots which may or may not be distinct, and by the Corollary it follows that these roots are not distinct iff A ( f / i , . . . , UN) = 0. In other words, when we project the hypersurface / = 0 in (N + l)-space onto the TV-space, above most points there lie n points, and A = 0 is the locus of those points above which there lie less than n points. Moreover, without assuming k to be algebraically closed but assuming that ao is a nonzero constant, in the polynomial case as well as the power series case, by the Corollary it follows that: A is identically zero -S4- / has a nonconstant multiple factor in .ft[y]. OBSERVATION (03). [Isobaric Property]. View the coefficients a^bj as indeterminates over Z. Give weight i to a*, and j to bj. Then 0 ^ Resy(/,g) €

LECTURE L4: VARIETIES AND MODELS

102

Z[ao) • • •, o-n, bo, •.., bm] is isobaric of weight mn, i.e., for the weight of any monomial r aQ°... ajj" &o° ...b&r occurring in R e s y ( / , g) we have ( E o < r < n v ) + (Eo< s<m S39> ~~ mn. In particular, the principal diagonal affb^ has weight mn, and it does not cancel out because there is no other term of 6TO-degree n in the resultant; the principal diagonal of an N x N matrix (Ay-) is the term A11A22 • • -ANN; [cf. §12(R6)(14) and its proof following §12(R6)(15)]. The resultant being isobaric of weight mn is the fundamental fact behind various cases of Bezout's Theorem. The following two Observations, where we use the set-up of Observation (01), illustrate this for plane curves and general hypersurfaces respectively. OBSERVATION (04). [Plane Bezout]. Let N = 1 with X = Xx, and assume that ao, bo are nonzero elements in k, and / , g are polynomials of total (X, Y)degrees n and m respectively. By the isobaric property we see that then always d e g x $ < mn and "in general" d e g x $ = mn. Hence the n-degree plane curve / = 0 meets the m-degree plane curve g = 0 in mn points "counted properly." The possibility of d e g x $ < ran is explained by saying that some intersections have "gone to infinity." OBSERVATION (05). [Hyperspatial Bezout]. Let TV be general and assume that cio,bo are nonzero elements in k, and / , g are polynomials of total {X\,... ,X;v,y)-degrees n and m respectively. By the isobaric property we see that then always deg( Xl xN)$ < rnn and "in general" deg(x 1 ,...,x JV )^ = mnHence, in the (N + l)-dimensional space, the n-degree hypersurface / = 0 and the m-degree hypersurface g = 0 meet along a "secundum" (= a subvariety of dimension two less than dimension of the ambient space) which projects onto the (mn)-degree hypersurface $ = 0 in A^-dimensional space. Again the possibility of deg(x 1 ,...,x JV )^ < mn s a y s t n a t s o m e intersections have "gone to infinity." EXAMPLE (XI). [Resultant and Discriminant in terms of Roots]. If the coefficients a*, bj belong to a domain R and ao ^ 0 =fi bo then, upon writing

f(Y) = ao J ] (y-ai)

an

d

g(X) = b0 ]\

l
with ot\,...,an

l<j
and / 3 i , . . . , Pm in an overfield of R, we have

Resy(/,fl)=<« n n («<-&) l<j
l<j<m

\
= (-irn&o n Mi) l<j
(Y - 0j)

§1: RESULTANTS AND DISCRIMINANTS

103

and

Discy(/) = (-l)"/2aS

I]

(^-«i)2-

l
EXAMPLE (X2). [Quadratic Resultant]. Considering the quadratic polynomials / ( F ) = aY2 + bY + c

and

ff(F)

= a ' F 2 4- b'Y + c'

and calculating the 4 x 4 determinant (a 0 a' \0

b c 0\ a bc b' d 0 a'6'c7

we get Resy (/, g) = (a2c'2 + a'2c2) + (b2a'c' + b'2ac) - {abb'c' + a'b'bc) - 2aca'c'. EXAMPLE (X3). [Quadratic Discriminant]. Considering the quadratic f(Y) = aY2 + bY + c and calculating the 3 x 3 determinant a b c^ 2a b 0 ,0 2a bj we get Discy(/) = — a(b2 — Aac). EXAMPLE (X4). [Cubic Discriminant]. Considering the cubic f(Y) = a0Y3 + axY2 + a2Y + a 3 and calculating an appropriate 5 x 5 determinant we get Discy(/) = —ao(a\a\ — Aa^a\ — ka\az — 27a^a\ + 18ao«ia2«3)EXAMPLE (X5). [Special Quartic Discriminant]. Considering the quartic f(Y)=Y4+pY2

+ qY + r

and calculating an appropriate 7 x 7 determinant we get Discy(/) = 16p4r - 4 p V - 128pV + lUpq2r - 27q4 + 256r 3 .

104

LECTURE L4: VARIETIES AND MODELS EXAMPLE (X6). [General Quartic Discriminant]. Considering the quartic f{Y) = aQY4 + aiY3 + a2Y2 + a3Y + o 4

and calculating an appropriate 7 x 7 determinant we get Discy(/) = a0(256a0ia4i - 192a^aia 3 a| - 128aga|a| + 144aga2a§a4 - 27ala\ <

+lAAaoa\a,2a\ — 6000^0304 — 800001030304 + 1800010203 + 16aoa|o4 —4000^03 - 27a\a\ + 1801020304 — 4afa 3 - 40^0^04 +

ajalal).

OBSERVATION (06). [Future Plans]. Further details of the concrete explicit approach to equation solving expounded in this section, including proofs of the various claims, will be given by and by. First let us push forward the abstract approach via ideals and varieties. §2: VARIETIES The concepts of curves, surfaces, and hypersurfaces give rise to the idea of a variety. A variety consists of the common solutions of a finite number of polynomial equations in a finite number of variables. To study these in any detail we need to build up some facts about ideals in a ring. For instance, why we do not get anything extra by allowing an infinite number of equations is explained by proving that every ideal in a multivariate (finitely many variables) polynomial ring over a field is finitely generated. This is called the Hilbert Basis Theorem. It is one of the numerous versatile theorems of commutative algebra obtained by Hilbert [Hil] in the period 1890-1920. As said before, a ring in which every ideal is finitely generated is called a noetherian ring. This is in honor of Emmy Noether who, in 1925-1935, stressed the importance of that property. In a moment we shall prove the Hilbert Basis Theorem, and at the same time we shall prove that the multivariate power series ring over a field is also noetherian. A variety can be decomposed into its (finitely many) "irreducible components." The ideal theoretic counterpart of this will be the "primary decomposition" of ideals in a noetherian ring which we shall take up next. This will also set up an inclusion reversing correspondence between irreducible varieties and prime ideals. Modding out the polynomial ring by the prime ideal corresponding to an irreducible variety will produce the "affine coordinate ring" of the variety; its quotient field is the "function field" of the variety. Global properties of a variety, such as its dimension and degree, are reflected in the affine coordinate ring and the function field. Local properties of a variety "near a point," such as whether the point is simple or singular, are reflected in the properties of the "local ring" of the point on the variety. The construction of the local ring generalizes the formation of the quotient field of a domain. By taking the local ring of an irreducible subvariety of an irreducible variety we can detect whether most points of the subvariety are simple

§3: NOETHERIAN RINGS

105

points of the original variety; in technical terms, this will be so iff the said local ring is "regular." After talking about "localization" we shall resume the discussion of varieties, first in affine space, and then in projective space. The idea of localization will also enable us to reinterpret varieties as "models" which are suitable collections of local rings. In turn, the language of models will facilitate the discussion of various operations on varieties, such as the blowing-up of subvarieties of a variety for resolving its singularities. §3: N O E T H E R I A N R I N G S Let us now prove the: HILBERT BASIS THEOREM (T3). For any noetherian ring R and any positive integer m, the multivariate polynomial ring R[Xi,... ,Xm] is noetherian. PROOF. Clearly R[XU..., Xm] = R[X1:..., Xm-i][Xm} and hence by induction the general case follows from the case of m = 1. So assume m = 1 and let X = X\. Given any ideal J in S = R[X] we want to show that it is finitely generated. Let d denote the X-degree. For every i G N let J, = {r G R : d(s — rXl) d((r's) - (r'r)Jf*) < i with r's G J; and (iii') d(s - rX*) < i =^ d(sX - rXi+1)
(')

a finite number of generators (fij)i<j<e(i) °f J% < together with elements (sij)i<j<e(») m J such that d(sij — rijX1) < i for 1 < j < e(i).

Let Joo = UjeN«/j- Then JQQ is an ideal in R. Again since R is noetherian, we can find a finite number of generators (rj)i<j<e of J ^ together with l(j) G N and (sooj) G J such that d{sooj - TjX1^) < l(j) for 1 < j < e. Take / G N such that I > l(j) for 1 < j < e and let Sj = SoojX1"1^. Then for 1 < j < e we have Sj G J and d(sj - rjX1) < I. Now J for every i > I in N we may take e(i) = e j^and for 1 < j < e we may take r^ = Tj and s^ = SjXl~l. For every i G N, by (') we see that J given any s G J with d(s) = i there exists Oj G R such that \ d ( s - Ei<j< e (i) airijxi)

and hence d s

( ~ Ei<j<e(«)

a

^ ) < *'

106

LECTURE

L4: VARIETIES

AND

MODELS

In view of (1') and ("), by induction we see that (2')

{

given any s G J with d(s) > I there exists aJC G R such that d s

c l - E i < j < e AjSij) l<j<e(i)

Vijsij-

By first applying (2') and then (3'), we see that the elements (sy)i|* i for some s G J } . For all r, r' in R and s, s' in J we have: (i*) d(s - rX*) > i and d(s' - r'X*) > i => d((s + s') - (r + r')**) > i with s + s' G J; (ii*) d(s - rX*) > i => d({r's) - (r'r)X{) > i with r's G J; and (iii*) d(s - rX*) > i => d(sX - r X i + 1 ) > i + 1 with sX G J. Therefore Jj is an ideal in R with Jj C Ji+i- Since i? is noetherian, we can find a finite number of generators {fij)\<j<e{i) °f ^ (*)

together with elements (sy)i<j< e (j)

m

^

such that d(sy — r^ X*) > i for 1 < j < e(i). Let JQO = UjeN^i- Then J ^ is an ideal in R. Again since R is noetherian, we can find a finite number of generators (r.j)i<j<e of Joo together with l(j) G N and (sooj) £ J such that d(sooj - TjX1^) > l{j) for 1 < j < e. Take I G N such that I > l(j) for 1 < j < e and let Sj = SoojX1-1^. Then for 1 < j < e we have Sj G J and d(sj - rjX1) > I. Now for every i > I in N we may take e(i) = e and for 1 < j < e we may take r^ = rj and s^- = SjX1

§4: ADVICE TO THE READER

107

For every i £ N, by (*) we see that

{

given any s £ J with d(s) = i there exists a,j £ R such that <*(* - £ i < j < e ( 0 a3rUxi)

>{and

hence d

( s _ Ei<,'< e «) aJsij) > *•

In view of (1*) and (**), by induction we see that J given any s £ J with d(s) > I there exists a,jC £ R such that \

s

= Y,l<j<eAiSU

w h e r e

A

J = T,l
In view of (1*), by induction we also see that J given any s £ J with d(s) < I there exists 6y € i? such that ["V s

—

2^0 I-

By first applying (3*) and then (2*), we see that the elements (sij)i<j< e (i) constitute a finite number of generators of J. §4: A D V I C E TO T H E R E A D E R The next long section on Ideals and Modules is rather heavy abstract dry stuff. You may prefer to skip it in a first reading and return to it as necessary. At any rate, to soften the blow here is a brief summary. The totality of common solutions of a bunch of polynomial equations / ; = fi{Xi,...,Xff) = 0, with I varying over an indexing set L, in a finite number of variables X\,... ,XN over a field k, is unchanged by replacing the polynomial family (/j)iet by another family which generates the same ideal I in the polynomial ring R = k[Xi,..., X^}. By Hilbert's Basis Theorem (T3) we can choose the other family to be finite. Thus there is no loss of generality to let the variety V of common solutions be denned by a finite number of equations. The variety V may be decomposed into its irreducible components; for instance the variety in the {X,Y) plane given by the equation (X2 + Y2 - 1)(Y2 ~ X2) = 0 has the circle X2 + Y2 = 1 and the pair of lines Y = ±X as its irreducible components. Algebraically this corresponds to making a primary decomposition of / in R which generalizes the fact that an integer is a product of powers of prime numbers. The said decomposition amounts to writing / = Q\ D • • • fl Qn with "primary" ideals Qi,- • -,Qn- The "radical" of Qi is a prime ideal Pi. The pair prime-primary ideal generalizes the pair prime number and its power. The primary decomposition has certain uniqueness properties. The operations of sums, products, and quotients of integers are generalized to sums, intersections (as well as products), and colons (or quotients) of ideals. Most of this further generalizes from ideals to modules. At once dealing with modules has the extra advantage that in various arguments the roles of the first and the second elements in a product rs are separately stressed as r belonging to the ring and s belonging to the module.

LECTURE H: VARIETIES AND MODELS

108

Two circles meet at a point of tangency with "intersection multiplicity" two, a tangent meets a cubic curve at a point of inflexion with intersection multiplicity three, and so on. To algebracize intersection multiplicity we introduce the concept of the length of a module, which itself generalizes the idea of the dimension of a vector space. To formalize the idea of the dimension of a variety we introduce the ideas of the height and the depth of an ideal as well as the idea of the dimension of a ring. §5: IDEALS A N D MODULES To continue the general discussion of ideals and modules started in LI §4 and LI §5, let V be a module over a ring R. The sums and intersections of any (indexed) family of submodules (UI)IEL of V are again clearly submodules of V, where the latter is set-theoretic while the former is defined to be the union U(f//1 -\ h Uin) taken over all finite sequences l\,..., ln in L. Note that the members of an "indexed family" need not be distinct, i.e., we may have Ui — Ui* for I ^ /* in the "indexing set" L. For any B c R and any submodule C of V we put BC = < Y J 6jCj : bi G B and c* G C (finite sums) > and we note that this is a submodule of C. For any elements a, ai, 0 2 , . . . in R and any submodule C of V we put aC = {a}C

and

(ai, 0,2,...)C = {0,1,0,2, ••-}C

and we note that these are again submodules of C. For any submodule U of V we put radyC/ = {r e R: reV C U for some e G N+} and we call this the radical of U in V, and we note that it is an ideal in R. By taking V = R, all the above applies to ideals in R. In particular, sums and intersections of any families of ideals in R are again ideals in R, and the radical radfl/ = {r £ R : r e G I for some e G N + } of any ideal J in R is an ideal in R. The product of any finite number of ideals in R was already introduced in the paragraph following item (2*) of L3§11, and in case of two ideals B and C it agrees with the above definition. For any B C R or a G R, and any C C V, we put (C:B)v

= {vGV:

B{RV) C C}

and

(C : a)v = {v €V : a(Rv) C C}

and we call these the colons (or quotients) of C by B and C by a in V respectively, and we note that if C is a submodule of V then these are submodules of V which

§5: IDEALS

AND

MODULES

109

contain C. For any B c V or a £ V, and any C C V, we put ( C : B ) f t = { r € / J : r( J RS) C C}

and

(C : a) a = {r 6 R : r(ifa) C C}

and we call these the colons (or quotients) of C by B and C by a in i? respectively, and we note that if C is a submodule of V then these are ideals in R. Note that for V = R the two meanings of colon coincide. An element x in R is a zerodivisor in R means xy = 0 for some ?/ € Rx = i?\{0}. Now nonP means the negation of a property P, and so nonzerodivisor means an element which is not a zerodivisor. Note that a domain is a nonnull ring having no nonzero zerodivisor. An element x in R is a nilpotent means xe = 0 for some positive integer e. Just as in the ring of integers, a prime number generates a prime ideal, a positive power of a prime number generates an ideal which is primary as well as irreducible in the following sense. An ideal / in the ring R is said to be primary if I ^ R and for any elements r, s in R with rs £ I and s $ I we have re £ I for some positive integer e. Just as the primeness (resp: maximalness) of I is equivalent to the residue class ring R/I being a domain (resp: field), so the primaryness of / is clearly equivalent to R/I being a nonnull ring in which every zerodivisor is nilpotent. An ideal / in the ring R is said to be irreducible if I ^ R and / cannot be expressed in the form I = J\ C\ • • • fl Je where e is a positive integer and J\,..., J e ideals in R with I r £ P. (iii*) Q is primary and P is prime with radfl<5 = P. By a cyclical proof (i*) =>• (ii*) => (iii*) => (i*) we see that these three conditions are equivalent. When they are satisfied, we say that Q is primary for P, or that Q is P-primary. OBSERVATION (08). [Colons and Intersections of Primary Ideals]. As easy consequences of (07) we see that for ideals P, Q in a ring R we have the following:

110

LECTURE H: VARIETIES AND MODELS

(I*) Q is P-primary and B <£_ Q for ideal B in R =$• (Q : B)R is P-primary. (2*) Q = Qi n • • • n Qh where Q, is P-primary for 1 Q is P-primary. (3*) P is a maximal ideal in R with P c rad^Q ^ R =*> Q is P-primary. (4*) P is a maximal ideal in R and Pe Q is P-primary. OBSERVATION (09). [Primaries a n d Powers of Primes]. In connection with the above claim (4*), let us construct some relevant examples. EXAMPLE (X7). Here is an example of (•') a maximal ideal P and a P-primary Q which is not a power of P . Let e > 1 and m > 1 be integers, and let R = k[[Xi,..., Xm]] where X\,..., Xm are indeterminates over a field k. Let P = M(R) = {X\,..., Xm)R, let W be the set of all monomials of degree e in X\,..., Xm, i.e., set of all expressions X\* ... X%? with nonnegative integers ii,... ,im for which i\-\ \-im = e. Let W be a subset of W which contains X{,..., X^ but is different from W, and let Q be the ideal in R generated by W. EXAMPLE (X8). Also here is an example of (•") a nonmaximal prime P with P which is not P-primary but r a d ^ P = P . In the notation of the Example (X7), let e', e" be positive integers with e' + e" = e, let (f>: R —> R = R/(Xf — X\ Xf )R be the residue class map, and let P be the ideal in R generated by (Xi) and (f>(X2). OBSERVATION (010). [Factor M o d u l e s a n d I s o m o r p h i s m Theorems]. For a module V over ring R, and a submodule U of V, the additive abelian factor group V/U is made into an .R-module in a unique manner so that the residue class map (of additive abelian groups) V —> V/U becomes an P-epimorphism; the resulting P-module V/U is called the factor module; note that U is the kernel of the residue class epimorphism V —> V/U. In Ll§4 we have already dealt with the special case of this when V = R and U = an ideal / in R, and then we made R/I into a ring; now R/I also becomes an P-module. For any P-epimorphisms <\> : V —> V and V* where V, V',V* are P-modules, if ker(4>*) C ker(0) then clearly there exists a unique P-epimorphism <j>' : V* —* V such that (j> = 4>'(j>*; conversely if there exists an P-epimorphism 4>' : V* —> V such that ^ = 4>'(p* then clearly we must have ker(*) C ker(^). (1*') For any P-epimorphism (j>: V —> V (of P-modules) and any P-submodule T of V, let ker(<£) = U with 0(T) = T', and let a : T -> V be the map induced by (f>, i.e., given by a(x) = <^>(:r) for all x £ T; then clearly a is an P-epimorphism with

§5: IDEALS AND MODULES

111

0 _ 1 ( T ' ) = U + T and ker(a) = UC\T, and hence induces a unique .ft-isomorphism a' : T/(U f l T j ^ r such that a = a1 a* where a* :T -> T/(U n T) is the residue class epimorphism; also clearly <j)(U + T) = T" and for the map P : U + T —• T' induced by 0 we have that /3 is an i?-epimorphism with ker(/3) = U and hence 0 induces a unique ^-isomorphism ft' : (U + T)/U —* T" such that P = /?'/?* where /?* : C + T —> ([/ + T)/{7 is the residue class epimorphism; thus 4> induces the ^-isomorphism (a')" 1 / 3 ' : {U + T)/U -+ T/(UnT). Note that in all this, by taking 0 to be the residue class epimorphism V —> V/U we can let U be any preassigned .R-submodule of V. (2**) For any ^-epimorphism

V (of i?-modules) with ker(<£) = [/, clearly W H-+ 0(W) gives a bijection of the set of all i?-submodules of V containing U onto the set of all .R-submodules of V ; this bijection is inclusion preserving, i.e., for any submodules W C W* of V we have <^(W) C (W*). Moreover, for any Rsubmodule W of V with U cW, upon letting 0(W) = W, we have 0 _ 1 ( W ) = W, and ! induces an i?-epimorphism W —> W' with kernel [/, and it also induces ^-isomorphisms W/U -> W and K/W -> V'/W. Assertions (1**) and (2**) are sometimes called the First and the Second Isomorphism Theorems. We may use them tacitly, i.e., without explicit mention. OBSERVATION (Oil). [Abelian Groups and Isomorphism Theorems]. Any additive abelian group G may be viewed as a Z-module by declaring ng = 9 + 9 + • • • + 9 taken n times; this for all n £ N and g e G; the definition is completed by putting {—n)g = —{ng) for all n € N + and g € G. This makes all considerations of i?-modules over a ring R, for instance the above two theorems of (O10), applicable to additive abelian groups. In particular an ideal 7 in a ring R and a subring H of R are both additive subgroups of R and hence their sum I + H is defined as an additive subgroup of R; actually I + H is a subring of R and I is an ideal in I + H; likewise I fl H is an ideal in H. The said two theorems may be supplemented by the following Third, Fourth, and Fifth Isomorphism Theorems which may also be used tacitly. (3**) Let 4> '• R —* R' be a ring epimorphism with ker() = I. Then for any subring H of R and the additive group epimorphism a : H —* H' = we have that H' is a subring of R' with cj)~l(H') = I + H and ker(a) = In H, and hence induces a unique ring isomorphism a' : H/(I D H) —> H' such that a = a'a* where a* : H —» H/{I fl H) is the residue class epimorphism; also clearly 4>{I + H) = H' and for the map /? : I + H —> H' induced by <j> we have that j3 is a ring epimorphism with ker(/3) = I and hence induces a unique ring isomorphism P' :{I+H)/I^> H' such that P = P'p* where /?* : I + H -> {I+H)/I is the residue class epimorphism; thus induces the ring isomorphism (a')~1P> : (I + H)/I —>

LECTURE H: VARIETIES AND MODELS

112

H/(I fl H). Also J — i > { J) gives an inclusion preserving bijection of the set of all ideals in R containing / onto the set of all ideals in R'. Moreover, for any ideal J in R with I c J, upon letting <j>{J) = J', we have ~l{J') = J, and induces an additive group epimorphism J —> J' with kernel / , and it also induces an additive group isomorphism J/I —> J' and a ring isomorphism R/J —• R'/J'. (4**) Let : R —> R' be a ring epimorphism. Then for any ideal J' in R' upon letting J = 0 _ 1 ( J ' ) we have that J is primary (resp: prime, maximal) iff J' is primary (resp: prime, maximal), and for any ideal J in R' upon letting J = ~l{J ) we have that rad« J = J iff rad^/J' = J , and for any family of ideals ( J[)I^L in R' upon letting J; = 0 _ 1 (J ( ') we have that J = X)jg£ ^ ^ J' = SigL ^/> anc ^ w e n a v e that J = n ( e L J ( iff J ' = n J e L J/. (5**) Let <j> : V —> V be an i?-epimorphism of .R-modules where i? is a ring. Then for any i?-submodule W of V upon letting W = (j>~1(W) we have that W is primary iff W' is primary, and for any family of ii-submodules (W()ieL of V upon letting Wi = (j)-l{Wl) we have that W = £ / e L Wj iff W = £ ; e z , W/, and we have

that w = nl£Lwt iff w = nl€Lw;. OBSERVATION (012). [Annihilators a n d Colons]. For a module V over a ring R, colons in R can be expressed in terms of annihilators thus. For any B dV or a € V, we put a n n f l 5 = {r &R: r(RB) = 0}

and

ann R o = { r 6 f l : r(Ra) = 0}

and we call these the annihilators of B and a in R respectively, and we note that they are clearly ideals in R. Obviously ann#.B = (0 : B)R

and

ann^a = (0 : CL)R

and more generally for any submodule C of V we have ann f l ((C + RB)/C)

= (C : B)R

and

ann f l ((C + Ra)/C) = [C : a)R.

If / is any ideal in R with I C ann^V then the .R-module structure of V induces a unique (i2/7)-module structure on V such that, upon letting (j> : R —> R/I be the residue class epimorphism, for all r £ R and v G V we have
§5.- IDEALS

AND

MODUL.ES

113

OBSERVATION (013). [Primary Submodules]. To generalize the material of (07) and (08) to modules, let P be an ideal in a ring R and let Q be an Rsubmodule of an .R-module V. Again by a cyclical proof we see that the following three conditions are mutually equivalent, and when they are satisfied, we say that Q is primary for P, or that Q is P-primary; [cf. §13(E11)]. (i") Q is primary with radv<5 = P. ( i f ) P C ra.dvQ ^ R and: r G P and s<=V\Q with rs G Q => r G P. (in") Q is primary, annj^V/Q) is primary (as ideal), and P is prime with radyQ = PAs easy consequences of the above equivalence we have the following [cf. §13(E11)]: (1") Q is P-primary and B (Q : B)R is P-primary. (2") Q = Qi n • • • n Qh where Q, is P-primary for 1 Q is P-primary. (3") P is a maximal ideal in R with P C iadvQ ^ R => Q is P-primary. (4'*) P is a maximal ideal in R and PeV C Q ^ V" (for instance Q = P e 7 ) with e 6 N+ => Q is P-primary. OBSERVATION (014). [Noetherian Modules with ACC a n d MXC]. Generalizing the noetherian condition on a ring R, an jR-module V is said to be noetherian if every submodule U of it is finitely generated, i.e., U = RU for some finite subset U of V; let us call this the NNC = noetherian condition on V. In the proofs of the Hilbert Basis Theorems (T3) and (T4), we implicitly used the fact that the NNC is equivalent to the ACC = ascending chain condition which says that for any ascending chain Ui C U2 C . . . of submodules of V we have Ue = Ue+\ = Ue+2 = ... for some e G N + . Yet another equivalent condition is the MXC = maximal condition which says that every nonempty family (UI)I^L of submodules of V has a maximal element, i.e., there exists I' G L for which there is no I" G L with Ui> C Uv and Ui> ^ Ui". To prove the equivalence of the three conditions we go in the following cyclical manner: (5*) NNC =* ACC =*• MXC => NNC. To see that NNC => ACC, given any ascending chain U\ C U2 C . . . of submodules of V let U = UJ6N + Ui\ then U is a submodule of V and hence by NNC we can find a finite set of generators u i , . . . ,u<j of U; now Uj G C/e(j) for some e(J) G N + , and it suffices to take e G N + with e > e(j) for 1 < j < d. To see that ACC => MXC, let (Ui)i£L be any nonempty family of submodules of V; take any 1(1) G L; if U^i) is not maximal then for some 1(2) G L we have Ui(i) C Z7/(2) with U^i) ^ ^(2)! if £/((2) is not maximal then for some Z(3) G L we have t/;(2) C £/;(3) with £/((2) 7^ ^(3)i

114

LECTURE H: VARIETIES AND MODELS

and so on; by ACC this must stop and we get V G L with Uv maximal. To see that MXC => NNC, let U be any submodule of V; the family (= the set )U of all finitely generated submodules of U is nonempty since it contains the zero module; by MXC this family contains a maximal member U'; for any x GU\U', the module U" = U' + Rx is clearly in the family and for it we have U' C U"\ therefore by the maximality of U' we must have U' = U", i.e., x G U'; thus U = U' and hence U is finitely generated. OBSERVATION (015). [Irreducible and Primary Submodules]. Let us show that for a module V over a ring R we have the following: (6*) V satisfies MXC =*> every submodule U of V can be expressed as an intersection U = r\\ 1, let e > 0 be an integer, let P = M(R) = (X\,... ,Xm)R, let W be the set of all monomials of degree e in X\,..., Xm, let W\,..., Wn be a finite number of nonempty subsets of W such that W\C\---C\ Wn = 0. By (4*) we know

§5: IDEALS

AND

MODULES

115

that P e + 1 is P-primary and it can easily be seen that Pe+1 ^ ( P e + 1 + WiR) for 1 < i < n but P e + 1 = n 1 < i < n ( P e + 1 + WiP) and hence P e + 1 is not irreducible. OBSERVATION (016). [Spectrum and Irredundant Decomposition]. For a ring R we put spec(P) = the set of all prime ideals in R and we call this the spectrum of R, and we put mspec(P) = the set of all maximal ideals in R and we call this the maximal spectrum of R. For any J c R we put vspec fi J = {P G spec(P) : J C P } and we call this the spectral variety of J in R, and we put mvspec fi J = mspec(P) n vspec fl J and we call this the maximal spectral variety of J in R, and we put nvspec^ J = the set of all minimal members of vspec K J i.e., the set of all P in vspec# J such that for all P ' ^ P in vspec^ J we have P' <£_ P , and we call this the minimal spectral variety of J in R. We put svt(P) = the set of all spectral varieties in R i.e., the set of all subsets of spec(P) of the form vspec^J with J varying over the set of all subsets of R. For any W C spec(P) we put ispec^W =

r\P€wP

and we call this the spectral ideal of W in R; note that by convention: ispec^.W = R & W = 0. We put rd(P) = the set of all radical ideals in R i.e., the set of all ideals J in R such that r a d ^ J = J. Later on we shall show that, when R is a multivariate polynomial ring over a field, ispec^ gives a bijection svt(P) —> rd(P) whose inverse is given by vspec#. First we want to set up the machinery for proving partial uniqueness of the primary decomposition hinted at in (015). For any R-module V we put &SSRV

= {P G spec(P) : ann/ja is P-primary for some a £ V}

LECTURE L4: VARIETIES AND MODELS

116

and we call this the associator of V in R; note that we automatically have a ^ 0 because the annihilator of 0 is R. We also put nassftV = the set of all minimal members of ass^V i.e., the set of all P in ass^1!^ such that for all P' ^ P in ass^V we have P' <£ P, and we call this the minimal associator of V in R. Note that for any P-submodule U of V we clearly have assR(V/U)

C vspec R (ann/{(V/t/)).

For any P-submodule U of V, by an associated (resp: associated minimal, associated embedded) prime of U in V we mean a member of assfl(V/l7) (resp: a member of nassR(V/U), a member of &ssR(V/U) \ nassR(V/U)). These terms are motivated by an assertion about an irredundant primary decomposition of an P-submodule U of V, i.e., an expression of the form:

' U = ^i
= {Pi,.. .,Pn}

and nassR(V/U)

= nvspec fl (ann fl (V A /t/)).

Clearly: (8.1*) In (•) we have vspecR(imnR(V/U))

= Ui
Therefore, assertion (8*) is equivalent to the following assertion: (8.2') In (•) we have {P € spec(i?) : (U : a)R is P-primary for some a &V} = { P i , . . . , P„}. As a supplement to (8.2*) we shall also prove: (8.3*) If R is noetherian then in (•) we have {P € spec(P) :(U:a)R

= P for some a €V}

= {Pu...,

Pn).

§5: IDEALS AND MODULES

117

Before proving (8.2*) and (8.3*), let us prove the following claim: (8.4*) In (•), for any i £ { l , . . . , n } and a £ f~lje{i n}\{i}Qj with a g Qi we have that (U : CL)R is Pj-primary. Moreover, if (U : a)# 7^ Pj then for some x £ R with xa $? Qj we have (£/ : a)# ^ (£/ : XO)R. Finally, if R is noetherian then for some z £ R with za ^ Qi we have ([/ : ZCL)R = Pi. Namely, in view of (ii'") we see that a n n ^ V / Q j ) C (U : a)# C Pi and hence Pi C radfl(C/ : a)R^ R. Moreover: r £ R\Pi

and s £ i? with rs £ (U : a)R

=> r(sa) <=U cQi ' =3- sa € Qi [by (ii'*) because r £ Pi] =$• sa £ U [because U = Di s G (*/ :a)R and hence by (ii*) we see that (U : O)R is Pj-primary. Moreover, if (U : a),R ^ Pj then for some x,y in R\(U : O)R we have yx £ (U : a)n; now: yx £ (U : CI)R =*> y(a;a) £ U => y £ (U : XCL)R => ([/ : a)# ^ (£/ : xa)^; also: x g (U : CL)R and a £ <~}j£{i,...,n}\{i}Qj => xa $ Qi. If (U : X\O)R ^ Pi with xi = x then replacing a by x\a we find X2 £ R such that a^aria 0 Qj and (U : XICL)R ^ (U : a^xia)^. And so on. Now clearly (U : CI)R C (U : XIO)R C (U : X2X\O)R C It follows that if R satisfies ACC then this must stop and we find z = xexe-\... x\ £ R such that za £ Qi and (U : ZO)R = Pj. To prove (8.2 s ) and (8.3*), given i £ { 1 , . . . , n}, by the irredundancy we can find a as in (8.4*) and then by that claim we see that (U : CL)R is Pj-primary, and if R is noetherian then after replacing a by za with suitable z £ P we have (JJ : O)R = Pi. Conversely let P £ spec(P) be such that (U : O,)R is P-primary for some a £ V. Now P = rad fi (l7 : O)R = r\i
LECTURE

118

L4: VARIETIES

AND

MODELS

and tass coincide. (9*) If R is noetherian then in (•) we have tass#(V/[/) =

assrt(V/U).

This follows from (8.2*) and (8.3'). OBSERVATION (018). [Tacit Use of PIC and P M C ] . In the proof of (8.4*) in (016) we cited (ii*) and (ii") when we should have really said by the equivalence of conditions (i*) to (in') asserted in (07) and by the equivalence of conditions (i /# ) to (iii") asserted in (013). These are conditions for an ideal to be P-primary and for a module to be P-primary. We may call these PIC (= Primary Ideal Conditions) and PMC (= Primary Module Conditions) respectively and we may use them tacitly. OBSERVATION (019). [Multiplicative Sets and Isolated Components]. By a multiplicative set in a ring R we mean a subset S of R with 1 € S such that: x, y in S =>• xy € S. For a submodule U of a module V over R we put [U : S]v = Uses(U : s)v and we call this the isolated 5-component of U in V; note that [U : S]y is a submodule of V with U C [U : S]V- To distinguish this from (U : B)v for B c R we observe that (U : B)v = DbeB(U : b)v For a prime ideal P in R we put [U:P}v

=

[U:R\P]v

and we call this the isolated P-component of U in V; there should be no confusion between these notations because always l € S and 1 ^ P . More generally, for any prime ideals P i , . . . , P„ in R with n £ N+ we put [U:(Pi,...,Pn)]v

=

[U:ni
and we call this the isolated ( P i , . . . , P n )-component of U in V. In (016), we used the parenthetical colon operation (:) to prove the uniqueness of the primes Pj occurring in (•), and now we shall use the bracketed colon operation [:] to prove a partial uniqueness of the primaries Qi. The proof applies to a primary decomposition which need not be irredundant, i.e., when for a submodule U of V we have: U = ni
(CONVENTION: ni
where n G N and Qi is a primary submodule of V for 1 < i < n and rad/jQi = Pi for 1 < i < n.

§5: IDEALS AND MODULES

119

Now the uniqueness assertion is the following: (10*) In (••) let 1 < i(l) < i(2) < ••• < i(h) < n be any sequence of integers with h £ N+ such that: j £ { 1 , . . . , n} with Pj C P,(/) for some I £ { 1 , . . . , h} =>• j £ {i(l),...,i(h)}. Then we have [[/ : (P»(i),... ,Pi(h))]v = ni SnPm ^ 0. Then U = Q'nQ". We want to show that [U : S]v = Q'• Now

{

v£[U:S}v

=>• =*•

xv £ U for some x e S C r\i
To prove the other implication, for 1 < / < n — h we can first take yi £ SflPj^ then we can find e(l) £ N+ such that y f ( ' V c Qj(i). Now y = Y\l
=•

and £ S

yv £ Q' n Q" = U

=> v £ [U : S]y. OBSERVATION (020). [Zerodivisors and Quasiprimary Decomposition]. To generalize the contents of (016) to (019), PIC and PMC suggest the following weakenings of primaries in a ring R and in an i?-module V. Given ideals P, Q in R we say that Q is P-quasiprimary if P is prime with Q ^ R and: r £ R and s £ R\Q with rs £ Q =>• r £ P. More generally, given an ideal P in R and a submodule Q of V we say that Q is P-quasiprimary if P is prime with Q ^ V and: r £ R and s e F \ Q with rs e Q =>• r € P . By fixing s £ R \ Q in the above implication about ideals we see that r £ Q^>rs£Q=>r£P and hence: Q is a P-quasiprimary ideal in R =>• Q C P . Similarly, by fixing s e V \ Q in the above implication about modules we see that r £ annft(VyQ) ^>rs£Q=$>r£P and hence: Q is a P-quasiprimary submodule of V =*> &nnji(V/Q) C P . Moreover in the ideal case rad^Q = R iff Q = R, and in the module case T&djiQ = R iff Q = V. Therefore in (07) we see that (ii*) =^ Q C .P, and in (013) we see that (ii /# ) =£> annfl(Vy<2) c P . This remark was implicitly

LECTURE H: VARIETIES AND MODELS

120

used in the cyclical proofs asserted in (07) and (013). By a zerodivisor of V we mean r £ R such that rv = 0 for some 0 ^ v € V, and we put ZR(V) = the set of all zerodivisors of V. We also put SR(V) =

R\ZR(V)

and we note that, by the nonP convention, a nonzerodivisor of V means an element of SR(V); we also note that SR(V) is a multiplicative set in R. For a submodule U of V, members of ZR(V/U) or SR(V/U) may respectively be called zerodivisors or nonzerodivisors mod U. In particular, for an ideal I in R, members of ZR(R/I) or SR(R/I) may respectively be called zerodivisors or nonzerodivisors mod I. [The following six terms: loass, lmass, lpass, zoass, zmass, and zpass, are used only in this Observation (020), with the single exception of lmass to be used in the proof of (024)(20*). So the readers NEED NOT MEMORIZE them]. For any ideal I in R and multiplicative set S in R with J fl S = 0, we put loass^(7, S) — the set of all ideals J in R with I
\oassR(I,SR(R/I))

and we call this the zerodivisor overideal associator of J in R, and we put zmass/{(7) = lmassfi(7,

SR(R/I))

and we call this the zerodivisor maximal associator of I in R, and we put zpass i j(J) = lpass

R(I,SR(R/I))

and we call this the zerodivisor prime associator of I in R. Note that an ideal J in R consists only of zerodivisors mod I iff JDSR(R/I) = 0; in this case we may say that J is a zerodivisor ideal mod I. Similarly, a member of zoass#(/) (resp: zmassfl(J),

§5: IDEALS AND MODULES

121

zpassfl(/)) may be called a zerodivisor overideal (resp: zerodivisor maximal ideal, zerodivisor prime ideal) mod 7. In (12*) below we shall show that lmassfl(7, S) C lpass fl (7,5) and hence in particular zmass^(7) C zpass fl (7). In (13*) below we shall give a quasiprimary decomposition for any nonunit ideal 7 in R, and (14*) below we shall generalize it to a quasiprimary decomposition for any submodule U of V with U ^ V. To achieve the said generalization, we note that for any submodule U of V with JJ ^ V we clearly have &imR(V/U) D SR{V/U) = 0 and we put zoassv(C/) = loassR(<mnR(V/U),

SR(V/U))

and we call this the zerodivisor overideal associator of U in V, and we put zmassv(CZ) = lmassR(annR(V/U),

SR(V/U))

and we call this the zerodivisor maximal associator of U in V, and we put zpassv(U)

=

lpassR{annR{V/U),SR(V/U))

and we call this the zerodivisor prime associator of U in V. Note that an ideal J in R consists only of zerodivisors mod U iff J n SR(V/U) — 0; in this case we may say that J is a zerodivisor ideal mod U. Similarly, a member of zoassy(£7) (resp: zmassy(CZ), zpassy(t/)) may be called a zerodivisor overideal (resp: zerodivisor maximal ideal, zerodivisor prime ideal) mod U. By a principal component of a nonunit ideal 7 in R we mean an ideal in R of the form [7 : P]R for some P £ zmassft(7); in (13*) we shall show that [7 : P]R is P-quasiprimary. Likewise, by a principal component of a submodule U of V with U 7^ V we mean a submodule of V of the form [U : P)v for some P G zmass\/(C/); in (14*) we shall show that \U : P]v is P-quasiprimary. Note that since 1 belongs to every multiplicative set S in 7?, for any ideal 7 in R we clearly have 7 c [I: S]R and hence in particular for every prime ideal P in R we have 7 C [7 : P]R. Similarly, for any submodule U of V we have U C [U : S]v and U C [U : P]v(12*) For any ideal 7 in R and multiplicative set S in R with 7 n S = 0 we have 0 ^ lmassii(7,5) C lpass R (7, S) C vspec fi (7) C spec(i?) and for any ideal 7i in R with 7 C h and 7i n S1 = 0 we have lmassfi(7i,5) C

lmassR(I,S).

In particular, for any nonunit ideal 7 in R we have 0 y£ zmassfi(7) c zpass fi (7) c vspec fl (7) c spec(i?) and for any ideal 77 in R we have 77 n SR(R/I)

= 0 => i7 c P for some P e zmass fi (7).

122

LECTURE L4: VARIETIES AND MODELS

More generally, for any submodule U of V with U / V we have 0 ^ zmassv(U)

C zpass v (C) c vspecR(annfi(V/C/')) c spec(i?)

and for any ideal H in R we have if n SR(V/U)

= 0 =4- if C P for some P € zmass v (t/).

(13*) Given any nonunit ideal I in i?, let W = zmassR(I) with W = vspecR(I), and for every P G spec(P) let I[p] = [I : P]R. Then for every P € spec(i?) \ W we have 7[PJ = R, for every P G W the ideal I[p] is P-quasiprimary, and we have the quasiprimary decompositions I = P\p^wI[P] = <^PewI[p] = ^Pespec(R)I[P] where the first equation may be paraphrased by saying that I is the intersection of its principal components. Moreover, for every P G nvspec fl (7) the ideal 7[pj is P-primary. (14*) Given any submodule U of V with U =£ V, let W = zmassy([/) with W = {P G spec(P) : [/[p] ^ V} where for every P G spec(P) we are putting U[p] = [U : P]v- Then for every P G W the submodule £/[pj is P-quasiprimary, and we have the quasiprimary decompositions U = r\p&wU[p] = ^PeW'U[p] = ^Pespec(R)U[p] where the first equation may be paraphrased by saying that U is the intersection of its principal components. Also W C vspec fl (annij(V/[/)), and if V is a finitely generated jR-module then W = vspec i j(annii(V/[/)). Moreover, for every P G nvspecij(ann/{(V/C/)) the submodule U[p] is P-primary. PROOF OF (12*). To prove the first display, let us start by noting that lmassft(7, S) is nonempty because of Zorn's Lemma. To show that any P in lmassp_(/, 5) is prime, given any Xi G R \ P for 1 < i < 2, by the maximality of P we can find yi G (xiR + P) C\S, and then t/j = nxi +pi with ri G R and pi G P, and now 2/12/2 G S c R\P and 2/12/2 -r\riX\X2 = rixip2 + ^2^2Pi +PiP2 G P , and hence x\Xi £ P. The rest of the first display is straightforward and so is the second display. The third display follows by taking SR(R/I) for S in the first display, and the fourth display follows from the second display by also taking H +1 for I\. The fifth display follows by taking &niiR(V/U) for I and SR(V/U) for S in the first display, and the sixth display follows from the second display by also taking H + I for/i. PROOF OF (13'). Follows by taking (U,V) = (I,R) in (14*). PROOF OF (14*). Let us put I = a.nnR(V/U). Since U C U[P] for every P in spec(P), we have U C r\pew>U[pj = nPespec(R)U[p] C Dp^\yU[p]. Hence to prove the "quasiprimary decompositions" it suffices to show that u G C\pewU[P] =*> u £U.

§5: IDEALS AND MODULES

123

Now given any u £ Hp&wU[p], upon letting H = (U : U)R, we see that: P £W =>u£ U[P] ' => for some r G R \ P we have ru £ U and hence r £ H => H

£P.

Consequently by the last display in (12*) we conclude that H fl hence u £ U. Let there be given any prime ideal P in R. Now r £ R\P

SR(V/U)

^ 0 and

and s € V with rs £ U[P]

=>• a;rs G [/ for some x £ =4> (a;r)s G [/ with xr £

R\P R\P

^S£U[P].

This shows that U[p] is P-quasiprimary for every P € W'. In view of PMC, it only remains to establish the following three implications. (a) I £ P => U[P] = V. (b) U[p] - V and V = Rv\-\ l-Rvh with h in N + and vx,..., vh in V => J • P c radyC/jpj. To prove (a) note that:

'/£P ^ • s V c t / for some s £

R\P

< =>• V C (f/ : s)v for some s £

R\P

=>Vc[U:P]v ^U[P]=V. To prove (b) note that: U\p\ =V = Rvi + • • • + Rvh for some v\,...,

Vh in V with h G N +

=> XjUj G U with Xj G R \ P for 1 • a;V C t/ where x = x\...Xh => x £ I with x £

£

R\P

R\P

=>I<£P. To prove (c) assume that P G nvspeCp(J). Given any y £ P let S be the smallest multiplicative set in R containing y and R\P. If J were disjoint from S (i.e., if / n S = 0) then, by the first display in (12 0 ), there exists P' £ spec(i?) with I c P' and P'nS = 0. But P' D S = 0 implies that P ' D ( # \ P ) = 0, i.e., P' C P which in view of the minimality of P implies that P' = P and hence y £ P' which is a contradiction. Therefore / is not disjoint from S, i.e., j / n z £ I for some

124

n G N and z£ R\P. y G rady[/[ P ].

LECTURE H: VARIETIES AND MODELS

It follows that ynzV

c U and hence ynV C U[P]. Therefore

OBSERVATION (021). [Length of a Module]. The dimension of a vector space V over a field R can be characterized as the maximum length of strictly increasing finite chains of subspaces VQ C V\ C • • • C Vn. Generalizing this we define the length IR{V) of a module V over a ring R to be the maximum length n G N of a finite sequence (a)

VoCViC-cV„

of submodules of V with

0 = Vo^Vi^---^Vn-!^Vn

= V.

We call such a sequence a normal series of V of length n. When there does not exist a bound for the lengths of such sequences then we put £R(V) = oo. Note that iR(V) = 0 o V = 0. By a simple module we mean a module of length 1, i.e., a nonzero module which has no nonzero submodule different from the whole module. By a composition series of V we mean a normal series (a) such that the factor module Vi/Vi-i is simple for 1 < i < n. Two normal series (a) and

(<*')

Vo'cv/c-.-cK'

of V are equivalent means n = n' and there is a permutation a of { 1 , . . . , n} such that the factor modules V^/V^-i and Vi'/Vi'_1 are isomorphic for 1 n and there is a subsequence 0 < j i < • • • < j n - \ < n' such that VJ. = V% for 0 < i < n. To slightly generalize the above concepts, let W be any submodule of V. By a normal series between W and V of length n G N we mean a sequence (a) of submodules of V with

W = V0jtV1^---^Vn.1^Vn

= V.

This is called a composition series between W and V if the factor module Vi/Vi-i is simple for 1 < i < n. Two normal series (a) and (a') between W and V are equivalent means n = n' and there is a permutation cr of { 1 , . . . , n} such that the factor modules Va{%)IVa(i)-\ and Vi'/Vi'_1 are isomorphic for 1 n and there is a subsequence 0 < j \ < • • • < jn-i < n' such that Vj. =VitorO
§5: IDEALS AND MODULES

125

if (a) is a normal series of V of length n, then (a) can be refined to a composition series of V of length at most n'. Namely, if (a) is not a composition series then Vi/Vi-i is not simple for some i £ { 1 , . . . , n} and we can insert a submodule between Vi-i and Vi which is different from both. Now we must have n + 1 < n', and so on. (15*) If the module V has a composition series then any two composition series of V are equivalent (and hence in particular have the same length which equals £R(V)), and every normal series of V can be refined into a composition series of V. PROOF. By induction on the nonnegative integer n let us show that if V has a composition series (a) of length n then for the length n' of any normal series (a1) of V we have n' < n. This being obvious for n < 1, let n > 1 and assume true for all smaller values of n. If n' < 1 then we have nothing to show. So assume n' > 1. If V^,_j = V n _i then VQ C V/ C • • • C VrT(/_1 is a normal series of V n _i of length n' — 1 and Vb C Vi C • • • C Vn-\ is a composition series of Vn-\ of length n — 1, and hence by the induction hypothesis n' — 1 < n — 1 and therefore n' < n. If K ' - i ^ K - i with Vr^,_1 C Vn-i then V0' C V{ C • • • C V^,_x C V n _i is a normal series of V„_i of length n' and Vb C V\ C • • • C V„_i is a composition series of Vn-i of length n— 1, and hence again by the induction hypothesis n' < n— 1 and therefore n' < n. Finally suppose that VT^,_1 ^ V^_i. Then, since there is no submodule between Vn-i and V other than these two, we must have Vn-i + V^,_1 — V, and hence by the First Isomorphism Theorem (1**), the factor modules V/Vn-i and Vn'-i/(Vn-i n V ^ j ) are isomorphic, and therefore Vr^,_1/(Vrn_i n VrT(,_1) is simple. Since Vb C Vi c • • • C Vn-\ is a composition series of V n _i of length n — \ and V^-i n Vrf-i C V n _i with Vn-i n V^/_! ^ V^-i, by the induction hypothesis it follows that the length of any normal series of Vn-i H V^,_1 is at most n — 2, and hence V n -i fl V^/_1 has a composition series of length at most n — 2; since V^/_ 1 /(K l _i Pi V^/_1) is simple, it follows that V^,_1 has a composition series of length at most n — 1. Since VQ C F / C • • • C \rll,_1 is a normal series of V^,_1 of length n' — 1, by the induction hypothesis we get n ' — 1 < n — 1 and hence n' < n. This completes the induction. It follows that every composition series of V has length n, and any normal series of V can be refined into a composition series of V. It only remains to show that, assuming (a) and (a') to be composition series of V of lengths n and n' with n = n', there is a permutation a of { 1 , . . . ,n} such that the factor modules VCT(j)/Vra(j)_i and V{/V(_l are isomorphic for 1 1 and assume true for all values of n smaller than the given one. If V^_x = Vn-\ then deleting the last terms from (a) and (a') we get two composition series of V n _i of length n — 1, which are equivalent by the induction hypothesis, and hence (a) and (a') are equivalent. Now assume that V^_t ^ Vn-\. Then V^_1 + Vn-\ = V and hence, upon letting V*. = V^l_1 fl V„_i, by the First Isomorphism Theorem (1**), the two modules Vn-\/V*. and V/V^_x are isomorphic, and so are V^-i/V*, and

126

LECTURE

V/Vn-i.

L4: VARIETIES

AND

MODELS

Therefore by taking a fixed composition series

v0* c v; c • • • c v:,

(P)

of V*. we get the two equivalent composition series

(7)

V0* c V,* c • • • c v:. c K-i c V

and

(7')

V0* c V{ c • • • c V£ c K_! c v

of V. By deleting the last Vn-i which are equivalent equivalent; clearly we must Therefore (a) and (a') are

terms in (a) and (7) we get two composition series of by the induction hypothesis, and hence (a) and (7) are have n* = n - 2. Similarly (a') and (7') are equivalent. equivalent.

(16*) If W is a submodule of V such that there is a composition series between W and V then any two composition series between W and V are equivalent (and hence in particular have the same length which equals £R(V/W)), and every normal series between W and V can be refined into a composition series between W and V. If W is a submodule of V with residue class epimorphism <j>: V —» V/W, then, for any normal series (a) between W and V, the series ^(Vo) C 4>{Vi) C • • • C 0(V„) is a normal series of VyW and for 1 < £ < n the factor modules (j){Vi)/(p(Vi-\) and Vi/Vi_i are isomorphic; in particular: (a) is a composition series between W and V & 4>{VQ) C (Vi) C • • • C <j>(Vn) is a composition series of V/W. PROOF. The first sentence follows by taking V/W for V in (15*). The second sentence follows by the Second Isomorphism Theorem (2**). (17*) For any submodule W of V we have £R(V) = £R(W) + £R(V/W), where by convention 00 = 00 + 00 = 00 + integer. In greater detail, if (a) is a composition series between W and V, and Wo CW\C---C Wm is a composition series of W, then Wo C W\ C • • • C Wm C Vi C • • • C Vn is a composition series of V. PROOF. Follows from (16*). (18*) For any submodules U and W of the module V we have £R(C/) + lR(W) = £R(U + W) + £R(U n W^), again with the understanding that 00 + 00 = 00 + integer. PROOF. Taking (U + W, U) for (V, W) in (17*) we get £R(U + W)= £R(U) + £R((U +

W)/U).

The factor modules ([/ + W)/U and W/{U D W) are isomorphic by the First Isomorphism Theorem (!*'), and hence by taking (W, U D W) for (V, W) in (17*) we

§5/ IDEALS

AND

MODULES

127

get

IR(W) = eR(u nw) + eR((u + w)/u). We are done by adding the two equations, and taking care of infinities. OBSERVATION (022). [Heights and Depths of Ideals, and Dimensions of Rings]. Replacing chains of submodules by chains of prime ideals we get the definitions of heights and depths of ideals and the definition of the dimension of a ring. By the dimension dim(P) of a ring R we mean the maximum length n £ N of a finite sequence (6)

Po C Px C • • • C P„

of prime ideals in R with P0 + P\ + • • • ± Pn-l ± PnWe call such a sequence a prime sequence in R of length n. When there does not exist a bound for such sequences then we put dim(P) = oo provided R is not the null ring; if R is the null ring then we put dim(P) = —oo. By the height htRP (resp: depth d p t ^ P ) of a prime ideal P in R we mean the maximum length of a prime chain (<5) in R with P = Pn (resp: P = PQ), again with the understanding that when there does not exist a bound for such sequences then we put htflP = oo (resp: dptRP = oo). We define the height htft J and the depth dpt# J of a nonunit ideal J in R by putting htRJ = min{ht R P : P £ vspec f l J} £ N U {oo} and dptRJ = max{dpt f i P : P £ vspec f l J} € N U {oo} and we note that for J £ spec(P) these reduce to the previous definitions. Also note that for a nonunit ideal J in R, dpti? J is the maximum length of a prime chain (6) in R with J C Po. Finally note that for an ideal J in R, vspecRJ is empty iff J = R, and let us complete the definitions of height and depth by putting htRR = —oo = dptRR and let us observe that now d p t ^ J = dim(i?/J). EXAMPLE (X10). Taking R to be the n-variable polynomial ring k[Xi,..., Xn] over a field k and Pt to be the ideal in R generated by Xi,...,Xi, and putting X\ = • • • = Xi = 0 we get an epimorphism R —> k[Xi+i,..., Xn] with kernel Pj for 0 n

128

LECTURE H: VARIETIES AND MODELS

with htflPi > i and dptflPj > n - i for 0 < i < n. Later on we shall show that these three inequalities are actually equalities, [cf. L5§5(Q10)(T47)]. OBSERVATION (023). [Modular Law]. The following Modular Law gives a useful identity concerning a module V over a ring R: (19*) For any submodules T, U, W of V with U C T we have

TD{U + W) = U + (Tr\W).

To see this note that the RHS is obviously contained in the LHS. Conversely, if v is any element in the LHS then, because v G U + W, we get v = u 4- w with u £U and w &W. Now w = v - u € T because v eT and u eU CT. Thus i n e T n i y and hence w £ U + (T n W). OBSERVATION (024). [Generalities on Prime Ideals]. In the following assertions (20*) to (23*) we give some useful properties of prime ideals. (20*) In any ring R, the set of all nilpotent elements, i.e., the radical of the zero ideal, equals the intersection of all the prime ideals, with the understanding that if there are no prime ideals (i.e., if the ring is the null ring) then the said intersection is the unit ideal. [Note the analogy with L2(R5) about the existence of maximal ideals]. PROOF. Since any prime ideal contains 0, it must contain every nilpotent element. Conversely, given any nonnilpotent element v in R we want to find a prime ideal P not containing it. We can do this by letting 1 = 0 and S = {vn : n G N} in (12*), and taking P G lmass f l (/,S). (21*) Let P be a prime ideal in a ring R, and let J\,..., Jn be ideals in R with n G N+ such that Ji n • • • f~l Jn C P. Then for some i € { 1 , . . . , n) we have Ji C radft Ji C P. Moreover, if J\ n • • • O J„ = P then for some i G { 1 , . . . ,n} we have Ji = P. PROOF. If Ji <]L P for 1 < i < n then by taking Xi G Ji\P we get x\ ... xn G Jifl • • -fl J„ C P which contradicts the primeness of P. Therefore for some i G { 1 , . . . , n} we must have Ji C P and by taking radicals we get rad# Ji C P. The rest is obvious. (22*) Let J be an ideal in a ring R, and let P i , . . . , Pn be prime ideals in R with n G N + such that J C Pi U • • • U P„. Then for some i G { 1 , . . . , n} we have J c PiPROOF. By discarding some of the primes P i , . . . , P „ we may assume that

§5: IDEALS AND MODULES

129

Pj ^ Ui». We claim that: (24*)

DCC o MNC.

Namely, by reversing inclusions and changing maximal to minimal in the proof of ACC =^ MXC given in (014) we get DCC => MNC. Conversely, if V does not satisfy DCC then we can find an infinite sequence U\ D U2 D • • • of submodules of V with Ui 7^ U2 ^ •. • and clearly the family (C/();eN+ has no minimal element. We say that V is an artinian it-module to mean that V satisfies the DCC and hence the MNC. We say that R is an artinian ring to mean that R is an artinian .R-module. In assertions (25*) to (33*) below, we shall prove some general properties of artinian and noetherian modules. (25*) Let W be any submodule of V. Then V is an artinian module <=» W and V/W are artinian modules. Similarly V is a noetherian module <=> W and V/W are noetherian modules. PROOF. If DCC holds in V then it obviously holds in W and by the Second

130

LECTURE L4: VARIETIES AND MODELS

Isomorphism Theorem (2**) it also holds in V/W. Conversely, assume DCC holds in W and V/W, and let U\ D U2 D ... be any decreasing sequence of submodules of V. Then Ui n W D U2 n W D •.. and {Ux + W)/W D (U2 + W)/W D ... are decreasing sequences of submodules of W and V/W respectively, and hence for some e e N+ we have Ue n W = Ue+i n W = Ue+2 r\W = ... and Ue + W = Ue+\ + W = Ue+2 + W = ... . For every integer d > e we get V d = Ud n (J7d + W) <

because C/d c C/d + W

= UdD (Ud+1 + W)

because Ud + W = Ud+i + W

= Ud+i + (Ud n W)

by Modular Law (19*)

= Ud+i + (Ud+1 n W)

because f/d n W = C/d+1 n W

= Ud+\

because Ud+i C\W C Ud+i.

Therefore Ue = Ue+i = Ue+2 = ••• • For the noetherian case, in the undisplayed material change DCC to ACC and D to C, and in the displayed material change (d,d+l) to (d+l,d). (26") Assuming V = V\ + • • • + Vs, where V\,...,VS are a finite number of submodules of V, we have the following. If the modules V\,..., Vs are artinian then so is V. Similarly, if the modules V i , . . . , Va are noetherian then so is V. PROOF. Since, Vi + • • • + Va = (Vi -f • • • + V s _i) + Vs, by induction the general case follows from the s = 2 case. This case follows from (25*) by noting that, in view of the First Isomorphism Theorem (1*'), the modules (Vi + V"2)/Vi and V2/(Vi n V2) are isomorphic. (27*) If R is an artinian ring and V is a finitely generated i?-module then V is an artinian module. Similarly, if R is a noetherian ring and V is a finitely generated .R-module then V is a noetherian module. PROOF. In view of (26') it is sufficient to show that, assuming V = Rx with x G V, if DCC (resp: ACC) holds for ideals in R then DCC (resp: ACC) holds for submodules of V. So assume DCC (resp: ACC) for ideals in R, and given any descending (resp: ascending) sequence ({7j)ieN+ of submodules of V = xR let Ji = {a £ R : ax £ Ui}. Then (Jt)ieN + is a descending (resp: ascending) sequence of ideals in R, and for all i e N+ we have Ui = {ax : a€ J , } . By DCC (resp: ACC) for ideals we can find e G N+ such that J e = Je+i = Je+2 = • • • - I n both the cases it follows that Ue = Ue+i = Ue+2 = ... . (28*) (R(V) < 00 «4- the module V is artinian as well as noetherian. PROOF. If ^ H ( V ) = n < 00 then every descending or ascending sequence of

§5: IDEALS AND MODULES

131

submodules of V has at most n distinct members, and hence V is artinian as well as noetherian. Conversely suppose that V is artinian as well as noetherian. Let UQ — V. If UQ = 0 then £R(V) = 0. If UQ ^ 0 then, because V is noetherian, we can take a maximal member U\ in the family of submodules of Uo different from U0. If U\ = 0 then £R(V) = 1. If U\ ^ 0 then, again because V is noetherian, we can take a maximal member Ui in the family of submodules of U\ different from U\. And so on. Since V is artinian, this must stop. Thus we get n G N together with a decreasing sequence Uo D UI D • • • D Un of submodules of V with V = Uo ^ U\ 7^ • • • 7^ Un-\ ^ Un = 0 such that Ui/Ui+i is simple for 0 d i m ^ ^ < 00 <=$• V is a noetherian module. PROOF. Obvious. (30*) If there are maximal ideals Pi,...,Pn in R with n € N + such that P i . . . Pn = 0 then: R is an artinian ring o R is a noetherian ring. PROOF. Let Vi = P i . . . P n _ t . Then 0 = V0 C Vi C • • • C V„ = R is an increasing sequence of ideals in R. For 1 < i < n we have P n _j + i c ann/j(Vi/Vi_i) and hence the i?-module Vj/Vj_i may be regarded as a vector space over the field R/P„-i+i. Therefore our assertion follows from (29*). (31*) If R is any ring and J is an ideal in R with IRJ — 1 then a n n # J is a prime ideal in i?. PROOF. Now J 7^ 0 and hence ann# J ^ .R. Given any a,b in R\ ann« J, let A = aR and £? = bR. Then A J and B J are nonzero ideals contained in J. Since IRJ = 1, we get AJ = J — BJ. Consequently ABJ = J and hence a& belongs to R \ aniiR J. (32#) If R is a domain then: R is an artinian ring •& R is a, field. If R is an artinian ring then every prime ideal in R is maximal. PROOF. Assume R to be an artinian domain, and let 0 ^ x e R. Then xR D x2R D ... is a decreasing sequence of ideals in R, and hence for some e G N + and y G R we must have xe = y:r e+1 which gives xe(l — yx) = 0. Since xe =£ 0 we get ya; = 1. Thus R is a field. The rest is now obvious. (33*) R is an artinian ring iff it is either the null ring or a zero-dimensional noetherian ring.

132

LECTURE L4: VARIETIES AND MODELS

PROOF. The case of a null ring being obvious let us suppose that R is nonnull. First assume R to be a zero-dimensional noetherian ring. Then by (23#) we can find a positive integer n and maximal ideals P i , . . . , Pn in R such that P i . . . Pn c 0 and hence P i . . . Pn = 0. By (30*) we see that R is artinian. Conversely assume R to be artinian. Then we can take a minimal element 7 in the set of all ideals in R which can be expressed as nonempty finite products of prime ideals. We shall prove that 7 = 0 and, in view of (30*) and (32*), this will show that R is a zero-dimensional noetherian ring. Suppose if possible that 7 ^ 0 . Let A = ann#I. Then A is a nonunit ideal in R, and hence we can take a minimal element B in the nonempty set of all ideals in R which contain A but are different from A. Let P = (A : B)R and let : R —> R/A be the residue class epimorphism. Now <j){R) is an artinian ring and ^ , ( R ) 0 ( B ) = 1. Therefore by (31*) we see that ann^,(fl)0(B) is a prime ideal in (a) where 4>: 72 —• 7?/A and ^> : R —> # / B are the residue class epimorphisms (for => take a, b as in the above paragraph; for <$= write (1 — b) + b = 1). Clearly A, B are comaximal <=> no maximal ideal in R contains both A and B; hence if Ai,..., An, B\,..., P m are ideals in R with n,m in N+ such that Ai, Bj are comaximal for each i,j then A i . . . A„, B i . . . Bm are comaximal and so are Ai D • • • n An, B\ n • • • n B m . The above equality of the product and intersection can be generalized by showing that for any finite number of pairwise comaximal ideals A\,..., An we have J T 1 < i < n A j = rii 1 and assume for all smaller values of n. Then by the induction hypothesis

ni
and

rii
§5: IDEALS AND MODULES

133

and by assumption A\ + A2 = R and hence 'r\i
={Ax+A2){C\i = Ax (ni
cA1{jlmAi)+A2(u^2A) =

lll<j
and obviously rii rad^A and rad^i? are comaximal «• no maximal ideals in R contains both A and B •£> there are elements o, b in R such that : R-^ R/A and V : R -> # / # are the residue class epimorphisms. If A i , . . . , An, B\,..., Bm are ideals in R with n , m in N + such that Ai and Bj are comaximal for each i,j then A\.. .An and £?i... B m are comaximal and so are A\C\ • •• C\An and B\ n • • • D B m . (35*) If A i , . . . , 7 a n arc a finite number of pairwise comaximal ideals in a ring -R then rii
134

LECTURE H: VARIETIES AND MODELS

§6: P R I M A R Y D E C O M P O S I T I O N The fact that every positive integer can be written as a product of powers of prime numbers can be reformulated by saying that every nonzero ideal in the ring of integers can be expressed as a finite product, and hence finite intersection, of ideals generated by powers of prime numbers. In (015) to (020) we have generalized this to obtain primary and quasiprimary decompositions of ideals in a ring R. Let us restate these generalizations by first recalling some relevant definitions. The radical of an ideal 7 in R is denned by r a d ^ I = {r G R : re G 7 for some e G N + } . By spec(J?) and mspec(P) we denote the set of all prime and maximal ideals in R respectively. We put vspec/j/ = { P e spec(P) : 7 c P } , and nvspec#7 = set of all minimal members of vspecfl/. 7 is primary means I ^ R and: r £ R and s G R\ I with rs G 7 => re G I for some e G N+. If 7 is primary then radij/ is prime and we say that 7 is (radfl7)-primary. We say that 7 is P-quasiprimary if P G spec(P) with 7 ^ R and: r £ R and s G R\I with rs G J =$• r G P; then 7 is P-primary iff 7 is P-quasiprimary and P C rad#7; see (07). For any a G R we put (7 : a) f l = {r G 7? : ra G 7}. We put a,ssR(R/I) = {P G spec(7?) : (7 : a)/j is P-primary for some a G P } and tassn(R/I) = {P G spec(P) : (7 : O)R = P for some a G P } ; we also put nass#(P/7) = set of all minimal members of &SSR(R/I); members of assij(P/7) and nass/i(P/7) are called associated primes and associated minimal primes of 7 in R respectively. By a multiplicative set in R we mean a subset S of R with 1 G S such that: x,y in S1 =>• xy G S1. For any multiplicative set 5 in P we put [7 : S]R — U a e s ( 7 : O)R = the isolated S1component of 7 in P . For any P G spec(P) we put [I : P]R = [I : R\ P]R = the isolated P-component of 7 in R. More generally for any P i , . . . , P„ in spec(P) with n G N + we put [7 : ( P i , . . . ,P„)]ij = [7 : r i i < i < n ( P \ P)]fl = the isolated ( P i , . . . , P„)-component of I in R. An irredundant primary decomposition of 7 is an expression of the form 7 = ni
(CONVENTION: ni
where n G N and Qi is a primary ideal in R for 1 < i < n

(t)

such that n j e { i n}\{i}Qj *t- Qi f° r a n i G { 1 , . . . , n } , and upon letting radijQi = Pi we have Pj ^ Pj for alH ^ j in { 1 , . . . , n } .

A (not necessarily irredundant) primary decomposition of 7 is an expression of the form I = ni
(tt)

(CONVENTION: f)i
where n G N and Qi is a primary ideal in R for 1 < i < n and radftQj = Pj for 1 < % < n.

Given (ft), first by (2*) of (08) we arrange the fourth line of (f) to hold, and

§6: PRIMARY

DECOMPOSITION

135

then by discarding some of the Qi we arrange the third line of (f) to hold. Thus from a primary decomposition we extract an irredundant primary decomposition. Therefore by (6*) and (7#) of (015) we get the first sentence of the following Decomposition Theorem (T5) concerning the existence of (|); by (12*) and (13*) of (020) and (35*) of (026) we get the second sentence of (T5) and also Decomposition Theorem (T8). For primes occurring in (f), by (8.1*) to (8.3*) of (016) we get the following Uniqueness Theorem (T6), and for the primaries occurring in (f), by (10*) and (11*) of (019) we get the following Partial Uniqueness Theorem (T7). PRIMARY DECOMPOSITION THEOREM FOR IDEALS (T5). If R is noetherian then every ideal I in R has an irredundant primary decomposition. Without assuming R to be noetherian, if for an ideal I in R we have that (f f f)

vspecRl is a finite subset of mspec(P)

then I has an irredundant primary decomposition (f) for which Qin---nQn

=

Qi...Qn-

PRIME UNIQUENESS THEOREM FOR IDEALS (T6). In decomposition (f), nvspecft/ = na.ssp(R/I) and &SSR(R/I) = { P i , . . . , P n } . If R is noetherian then in decomposition (f), a.ssR(R/I) = tassR(R/I). If a nonunit ideal I in R satisfies (f f f) then in its decomposition (f), nvspecRI = nass#(P/7) = assR(R/I). PRIMARY UNIQUENESS THEOREM FOR IDEALS (T7). In (ft) (and hence in (f)) let 1 < i(l) < i(2) < • • • < i(h) < n be any sequence of integers with h G N+ such that: j G { l , . . . , n } with Pj C P ^ ) for some I G {l,...,h} =$• j G {i(l),...,i(h)}; then we have [I : ( P i ( i ) , . . . ,Pi(h))]R = ^i S 0 Pm = 0; then we have [/ : S]R = C\i
136

LECTURE

L4: VARIETIES

AND

MODELS

§6.1: P R I M A R Y D E C O M P O S I T I O N FOR MODULES Turning to a submodule U of an P-module V, the radical of U in V is defined by radyt/ = {r £ R: reV C U for some e 6 N + } . C/ is primary means U ^ V and: r € P and s £ V \U with rs € £/ => r £ r&dyU. If U is primary then radyt/ G spec(P) and we say that (7 is (rady[/)-primary. We say that U is P-quasiprimary if P G spec(P) with U ^V and: r € P and s£V\U with rs G U => r G P ; then U is P-primary iff U is P-quasiprimary and P C radyC/; see (013). For any a G P we put ([/ : a)v = {v G V : at> G U}. For any a G V we put ([/ : a)R = {r G R : ra G C/}. We put &ssR(V/U) = {P G spec(P) : ([/ : O)R is P-primary for some a £ V} and tassfi(V^/C/) = { ? £ spec(P) : (U : O,)R = P for some a G V}; we also put n a s s ^ V / t / ) = set of all minimal members of &SSR(V/U); members of &ssn(V/U) and nassfl(V/{7) are called associated primes and associated minimal primes of U in V respectively. Note that ann^(V r /f/) = {r £ R : rV C U}. For any multiplicative set S in R we put [U : S]v = Uaes{U : a)v = the isolated 5-component of U in V. For any P G spec(P) we put [U : P]y = [U : P \ P]v = the isolated P-component of U in V. More generally for any P i , . . . , P„ in spec(P) with n £ N+ we put [U : ( P i , . . . , P„)]v = [U : C\i
(to

(CONVENTION: ni
where n £ N and Qi is a primary submodule of V for 1 < i < n such that r\j£{x

n}\{i}Qj

€• Qi f° r aH « £ {1, • • • , n } , and

upon letting x&dyQi — Pi we have Pj ^ P, for alH ^ j in { 1 , . . . , n}. A (not necessarily irredundant) primary decomposition of U is an expression of the form U = D^KnQi

(ttO

(CONVENTION: n ^ o Qi = V)

where n £ N and Qi is a primary submodule of V for 1 first by (2'*) of (013) we arrange the fourth line of ( f ) to hold, and then by discarding some of the Qi we arrange the third line of ( f ) to hold. Thus from a primary decomposition we extract an irredundant primary decomposition. Therefore by (6*) and (7*) of (015) we get the first sentence of the following Decomposition Theorem (T5') concerning the existence of {]'); by (12*) and (14*) of (020) we get its second sentence and also the Decomposition Theorem (T8'). For primes occurring in ( f ) , by (8.1*) to (8.3*) of (016) we get the following Uniqueness Theorem (T6'), and for the primaries occurring in (f), by (10*) and (11*) of (019) we get the following Partial Uniqueness Theorem (T7')-

§7:

LOCALIZATION

137

PRIMARY DECOMPOSITION THEOREM FOR MODULES (T5')- If V is noetherian (i.e., every submodule of V is finitely generated) then every submodule U of V has an irredundant primary decomposition. Without assuming V to be noetherian, if for a submodule U of V we have that (f f f)

vspecR(annR(V/U))

is a finite subset of mspec(i?)

then U has an irredundant primary decomposition ( f ) . PRIME UNIQUENESS THEOREM FOR MODULES (T6 7 ). In decomposition (f), nvspecR{annR{V/U)) = nassR(V/U) and assR{V/U) = { P i , . . . , P n } . If R is noetherian then in decomposition ( f ) , assR(V/U) — tassR(V/U). If a submodule U of V with U ^ V satisfies (f f f ) then in its decomposition ( f ) , nvspecR(annR(V/U)) = nassR(V/U) = assR(V/U). PRIMARY UNIQUENESS THEOREM FOR MODULES (T7'). In (ft') (and hence in (f)) let 1 < i(l) < i(2) < • • • < i(h) < n be any sequence of integers with fceN+ such that: j £ { 1 , . . . ,n} with Pj C P^i) for some I G { 1 , . . . , h} => j £ {i(l),...,i(h)}; then we have [U : ( P i ( 1 ) , . . . ,Pi(h))]v = C)i 5 n Pm = 0; then we have \U : 5]v = rii
LECTURE H: VARIETIES AND MODELS

138

recall that a multiplicative set in R is a subset S of R with 1 £ S such that: s £ S and s' £ S => ss' G 5. Still more generally we may let any multiplicative set 5 in R play the role of denominators and thereby obtain the localization Rs of R at S thus. We define i?g to be the set of all equivalence classes of pairs (u,v) £ Rx S under the equivalence relation: (u, v) ~ (u\ v') 4=> v"{uv'-u'v) = 0 for some v" £ S. The equivalence class containing (u,v) is denoted by u/v or ^, and we add and multiply the equivalence classes by the rules ^ + &• = »i«a+"a«i and ^ x ^ = H i a . Also we "send" any •*

V\

V2

V1V2

^1

V2

VlV-2

J

u £ R to u/\ G i?s- This makes i?s into a ring and u i - m / i gives the "canonical" ring homomorphism (j): R -> AsClearly 0(5) C ?7(fls)

and

ker(<£) = [0 : S]R

where we recall that for any ideal / in R we have defined

{

[/ : S]R

= {r £ R : rs G / for some s £ S} = the isolated 5-component of / in R.

The localization of R at SR(R) is called the total quotient ring of R and denoted by QR(i£). Clearly [0 : SR(R)]R = 0 and hence the canonical map of R into QR(i?) is injective. By "identifying" every u £ R with u / 1 G QR(ii), we may and we shall regard QR(.R) to be an overring of R. Note that then SR(R) C U(QR(R)) and every element of QR(i?) is of the form u/v with u £ R and v G SR(R); also note that QR(i?) is its own total quotient ring. Conversely, if T is an overring of R with SR(R) C U{T) then clearly there is a unique i?-injection of QR(i?) into T whose image is the set of all u/v with u £ R and v £ SR(R); we may call this image the total quotient ring of R in T and again denote it by QR(i?). Note that if R is a domain then QF(f?) = QR(.R), and in this case the said image may be called the quotient field of R in T. In particular we may talk of the quotient field of a domain in an overfield. Let us revert to the general multiplicative set S in R and note that: ker() = 0 <=> S C SR(R). If S C SR(R) then clearly there is a unique i?-injection of Rs into QR(i?); we call its image the localization of R at S in QR(i?) and continue to denote it by Rs; let us call this the GOOD case; thus in the good case Rs is an overring of R. In the general case the passage from R to Rs can be achieved in two steps. First we take the residue class map a : R —» R/[0 : S]R. Now a(S) is a multiplicative set in a(R) with a(S) C 5 Q ( fl )(a(i?)) and we take a(R) c Q = a{R)a(s) a s m the good case, and we consider the map 7 = /3a : R —> Q where 0 : a(R) —» Q is the natural injection. Now we may identify Rs with Q via the unique isomorphism ip : Rs —> Q such that ip4> = 7.

§7: LOCALIZATION

139

The isomorphism ip is generalized in the following Characterization Theorem (T9) for the localization Rs- In Theorems (T10) to (T12) we give some basic properties of localization. CHARACTERIZATION THEOREM FOR LOCALIZATION (T9). Let 7 : R —> Q be a ring homomorphism. Then: there is an isomorphism ip : Rs —> Q with ipcfr = 7 <4> ker(7) = [0 : S]R with j(S) C U(Q) and every element in Q can be written as 7(u)/7(v) with u £ R and v £ S. Moreover, if 7(5) C U(Q) then there is a unique homomorphism ip : Rs —• Q with %l>4> = 7. PROOF. Straightforward. TRANSITIVITY OF LOCALIZATION (T10). Let 7 : R -» Rs> and <j>' : Rs ^ (Rs)t(S') D e the canonical maps where 5 ' is a multiplicative set in R with 5 C S". Then there is a unique isomorphism ip : (Rs)ct>(S') ~* Rs1 with V>' is an isomorphism. PROOF. Follows from (T9). PERMUTABILITY OF LOCALIZATION AND SURJECTION ( T i l ) . Let S : R —> R' be a ring epimorphism with ker(<$) D S = 0, and consider the map 7 = 'S : -R —+ ^ ( ^ i f s ) where $' : 5(R) —> S(R)s(s) 1S the canonical homomorphism. Then there exists a unique epimorphism ip : .Rs —> S(R)s^s) with ^i<^ = 7. PROOF. Follows from (T9). IDEAL CORRESPONDENCE FOR LOCALIZATION (T12). For any ideal I in R let 7s = (s) : (t,s) £ I x S}. Moreover:

is^Rs^ ms = ®. (b) For any ideal I in R we have I[s] = ~1(Is)(c) An ideal I in i? is a contracted ideal (i.e., for some ideal J in Rs we have I = 0 - i (j)) i f f / e C . (d) Every ideal J in Rs is an extended ideal (i.e., for some ideal I in R we have J = 0(J)fl s ). (e) I *—> Is gives a bijection of C onto E, and its inverse is given by J 1—> 0 _ 1 ( J ) . The sets C and £" are closed with respect to the ideal theoretic operations of radicals, intersections, and quotients, i.e., for instance: I £ C => TSARI £ C; recall that quotients = parenthetical colons, where for ideals I,H in R we have

LECTURE L4: VARIETIES AND MODELS

140

(I : H)R = {r £ R : rh £ / for all h £ H}. Moreover, the said bijection commutes with these operations, i.e., for instance: I G C =$• r a d # s i s = (radR-T)s. (f) If P and Q are ideals in R such that P is prime and Q is P-primary with Q n S = 0 then: P D S = 0, P and Q belong to C and both contain ker(^), Ps is prime, and Qg is (Pg)-primary. (g) A primary ideal, and hence in particular a prime ideal, in R is a contracted ideal iff it belongs to C iff it is disjoint from S. Moreover, P i-> Ps gives a bijection of C n s p e c ( P ) onto spec(P,s). Likewise, given any P € Cnspec(ii), Q H-> QS gives a bijection of all P-primaries in R onto the set of all (Ps)-primaries in Rs, and this bijection preserves intersections and quotients. (h) Given an irredundant primary decomposition / = Di S fl P m = 0. Then Is = <^i<j) = the intersection of all primary ideals in R which are disjoint from S, and also ker(^) — the intersection of those primary ideals which occur in an irredundant primary decomposition of 0 in R and which are disjoint from S. PROOF OF (a). By the definition of Rs, or by (T9), every element of Rs can be written as (j)(r)/<j>(s) with (r,s) G fix S. Therefore, since Is = (j>(I)Rs, every element of Is can be written as a finite sum Ylict>{ti)(i>{ri)l{si)- Now this sum equals (t)/(j)(s) :(t,s) £ I x S}. It follows that

1G Is » 1 = 4>{t)I'4>(s) for some (t,s) £ I x S < •£> (t - s) — 0 for some (t,s) £ I x S <=> s'(t - s) = 0 for some (t, s,s') £ I x S x S

<^/nS^0. V

Consequently: Is ^ Rs & I n 5 = 0.

because ker() = [0 : S]R

§7:

141

LOCALIZATION

PROOF OF (b). For any r G R we have

•& 4>(r) G i s <=> (r) = (t)/(s) for some (t,s) £ I x S

by (a)

O (rs - t) = 0 for some (t, s) e I x S <=> s'(rs - t) = 0 for some (£, s,s')e . ^ G PROOF

I xS xS

by (T9)

HI-

OF (c). If I G C then, upon letting J = 7 S , by (b) we see that I is

contracted. Conversely suppose I = (j)~l{J) for an ideal J in Rs- Then Is C J and hence 0 _ 1 ( / s ) C <j>~l{J) = 7. Therefore by (b) we get /[$] c / and hence J[s] = P PROOF OF (d). Given an ideal J in Rs let 7" = _1(J). By (T9), any u G J can be expressed as u = (f)(r) / (f>(s) with (r, s) £ Rx S, and multiplying this by 0(s) we get (^(r) G J. Consequently r £ / and hence u G i s - Thus J C Is and obviously i s C J. Therefore J = Is. PROOF OF (e). In view of (c) and (d), this follows from the following Lemma (T13.3). PROOF OF (f). Let P and Q be ideals in R such that P is prime and Q is P-primary with Q n S = 0. Then s G P n S = * > s e G Q n S f o r some e G N + which contradicts the assumption that Q D 5 = 0. Consequently P n S = 0, and hence P and Q belong to C, and therefore by (c) they are contracted ideals and hence they contain ker((f>). Now by (a) and (e) we see that Ps C r&dRsQs ^ Rs and hence, in view of PIC = (07), for proving that Ps is prime and Qs is (Ps)—primary, it suffices to show that: x G Rs and x' G Rs\ Qs w r t n xx' G Qs =$• x G P s - Given a;, a;' as postulated, by (a) we can write x = {s),x' = (s'),xx' = <j>(r")/4>(s") with r, r', r" in R and s, s', s" in S such that r' $ Q and r" G Q- Equating two values of xx' we obtain (j>(rr')/(j>(ss') = (r)/<j>(s) G P s . PROOF OF (g). In view of (c), (e) and (f), this follows from the following Lemmas (T13.1) and (T13.2) together with the additional observation that, say by (08), if Q, Q are P-primaries then so are QnQ and (Q : Q)R, except that: if Q C Q then (Q :Q)R = R and Qs C Qs with (Qs : QS)RS = Rs = RePROOF OF (h). Let 7 = ^i<j
142

LECTURE L4: VARIETIES AND MODELS

the second equation is an irredundant primary decomposition in Rs and we have 1 = Is- Also clearly: I = 7 <^> S n Pi = 0 for 1 (s) with (t, s) e 7 x S. We can take s' = Ilie{i n}\{*(i) i(h)} si w i t h s'i e s ' n Q/ s't G J and hence by (a) we get x = <j>(ts')/(j>(ss') G Is-

and

then

PROOF OF (i). Follows from (d). PROOF OF (j). Assuming R to be noetherian, by (T9) we know that ker(^) = Os and, in view of (T5), by taking I — 0 in (h) we see that ker(0) = the intersection of those primary ideals which occur in an irredundant primary decomposition of 0 in R and which are disjoint from S, and hence by (g) we see that ker(^) = the intersection of all primary ideals in R which are disjoint from S. LEMMA ON CONTRACTED AND EXTENDED IDEALS (T13). Let 0 : P -> T be a ring homomorphism. For any ideal I in R let Ie — <j>(I)T, and for any ideal J in T let J c = 0 _ 1 ( J ) ; note that if 9 is the natural injection of R into an overring T then Ie = IT and Jc = J n R. Let C be the set of all contracted ideals in R (relative to 9), i.e., ideals which can be expressed as J c with ideal J in T. Let E be the set of all extended ideals in T (relative to 9), i.e., ideals which can be expressed as Ie with ideal I in R. Then we have the following. (T13.1) For any ideals I C 7 in R we have Ie cT. For any ideals J C J in T we have Jc C J°. For any ideal / in R we have / c (Ie)c and Ie = ((/ e ) c ) e . For any ideal J in T we have ( J c ) e C J and J c = ((J c ) e ) c . (T13.2) If P, Q are ideals in T such that P is prime and Q is P-primary, then Pc is prime and Qc is (P c )-primary. If J = C\i Ie gives a bijection of C onto E, and its inverse is given by J >—> J c . The set C is closed under the ideal theoretic operations of radicals, intersections, and quotients. Moreover, if the set E is also closed under one of these three operations then the said bijection commutes with it. Finally the set E is always closed under the ideal theoretic operations of sums and finite products. PROOF. The proofs of (T13.1) and (T13.2) being straightforward, we shall prove (T13.3). Clearly I € C => I = {Ie)c- Also clearly J € E => J = ( J c ) e . Therefore / — i » Ie gives a bijection C —> E whose inverse is given by J — i > Jc. For any ideals / and J in R and T respectively, we clearly have (T13.4)

(rad f l /) e C rad T (7 e )

and

(rad T J) c = rad f l (J c ).

From the second part of the above display it follows that C is closed under the op-

143

§7: LOCALIZATION

eration of radicals, and if E is also closed under it then the said bijection commutes with it. For any families of ideals (Ii)ieL and (Ji)ieL in R and T respectively, we clearly have

(T13.5)

( r w , - ) ' c nj€LJf

(n,eLJj)c = nleLjf

and

and (T13.6)

(ZieLlir

= TieLIf

and

£ i e L J? C ( £ ; € L J ; ) C

and if the indexing set L is finite then we also have (T13.7)

O W 0

e

= r W f

and

UieL ^ ^ (UieL Ji)°

From the second part of (T13.5) it follows that C is closed under the operation of intersections, and if E is also closed under it then the said bijection commutes with it. From the first parts of (T13.6) and (T13.7) it follows that E is closed under the operation of sums and finte products respectively. To turn to quotients let I, I be any ideals in R and let J, J be any ideals in T. Then clearly (J : 7)% C (Ie : T)T

(T13.8)

and

C (Jc :

(J : lfT

T)R.

We claim that J£E^-(J:J)CT

(T13.9)

= (JC:

T)R.

Namely, assuming J e E, we have (Jc : T)R1

= (Jc : T)R{Ty

= ((J c : T)R{T)f

C (Jc)e

(where the first equation is because of the assumption J € E and the second equation is because of the first part of (T13.7)) and hence we get

(Jc:jyRC(J:J)T and therefore we have ((J c : T)R)C

C ((J :

J)T)C

and hence, because obviously ( J c : J )« C ((J c : J )eR)c, we get ( J c : J C ) f l C ((J : J)T)C and therefore, because of the second part of (T13.8), we conclude that ( J : J)CT = (Jc :

T)R.

144

LECTURE L4: VARIETIES AND MODELS

This proves (T13.9). By taking J = Ie and J = f in (T13.9) we see that (T13.10)

( / : 7

/eCand7GC^{

2f

G C a n d

_

By (T13.10) we conclude that C is closed under quotients, and if E is also closed under it then the said bijection commutes with it. §7.1: LOCALIZATION AT A P R I M E IDEAL Given any prime ideal P in R we get a particularly interesting case of localization by taking S = R\P. We put Rp =

RR\P

and call this the localization of R at P. Let $ : R^

RP

be the canonical map and note that now $(R \P)C

U(RP)

and

ker($) = [0 : P]R

where, for any ideal I in R, by definition [I:P]R

= [I:R\P]R = {r € R : rs € I for some s £ R\P} = the isolated P-component of I in R.

Note that if ZR(R) C P, where ZR(R) is the set of all zerodivisors in R, then we are in the good case, and Rp may be regarded as a subring of QR(-R). The following Theorems (T14) to (T17) follow by taking S = R\P in the above Theorems (T9) to (T12) respectively. CHARACTERIZATION THEOREM FOR PRIME LOCALIZATION (T14). Let 7 : R —> Q be a ring homomorphism. Then: there is an isomorphism ip : RP -> Q with ip$ = 7 <S> ker(7) = [0 : P}R with >y(R \ P) C U(Q) and every element in Q can be written as 7(u)/7(i>) with u G R and v £ R\P. Moreover, if j(R \ P) C C/(Q) then there is a unique homomorphism ip : Rp —> Q with •0$ = 7. TRANSITIVITY OF PRIME LOCALIZATION (T15). Let 7 : R -> i? P / and $ ' : JRP —> (Rp)^(p') be the canonical maps where P ' is a prime ideal in R with P ' C P . Then there is a unique isomorphism tp : (Rp)<s>{p>) -* Rp> with ^>$'$ = 7.

§7.1: LOCALIZATION AT A PRIME IDEAL

145

PERMUTABILITY OF PRIME LOCALIZATION AND SURJECTION (T16). Let 5 : R —• R' be a ring epimorphism with ker(5) c P , and consider the map 7 = $'<5 : Pi —> 5(P)<5(P) where $ ' : <5(P) —> (5(P)<s(p) is the canonical homomorphism. Then there exists a unique epimorphism tji : Rp —> £(P),5(p) with ?/>$ = 7. IDEAL CORRESPONDENCE FOR PRIME LOCALIZATION (T17). For any ideal 7 in R let Ip = 3>(I)Rp, and note that the two meanings of Rp do coincide. For any ideal 7 in P let 7jp] = [I : P]p, and let C be the set of all ideals 7 in R for which 7[pj = 7. Finally let E be the set of all ideals in Rp. Then we have the following. (a) For any ideal 7 in R we have IP = ($(*)/$(s) : (t,s) € I x (R\ P)}. Moreover: IP ^ RP <=> 7 C P . (b) For any ideal 7 in R we have 7[pj = $ _ 1 ( 7 p ) . (c) An ideal 7 in R is a contracted ideal (i.e., for some ideal J in Rp we have 7 = $ - i ( J ) ) iff J e C . (d) Every ideal J in P p is an extended ideal (i.e., for some ideal 7 in R we have J = $(/)PP). (e) 7 (—• Zp gives a bijection of C onto 75, and its inverse is given by J H-» $ _ 1 ( J ) . The sets C and £ are closed with respect to the ideal theoretic operations of radicals, intersections, and quotients, i.e., for instance: I € C => radp,/ S C; recall that quotients = parenthetical colons, where for ideals I, H in R we have (I : H)R = {r 6 R : r/i € 7 for all h G 77}. Moreover, the said bijection commutes with these operations, i.e., for instance: 7 G C =>• radp p 7p = (rad#7)p. (f) If P and Q are ideals in R such that P is prime and Q is P-primary with Q C P then: P C P , P and <5 belong to C and both contain ker($), P F is prime, and QP is (Pp)-primary. (g) A primary ideal, and hence in particular a prime ideal, in R is a contracted ideal iff it belongs to C iff it is contained in P . Moreover, P 1—> Pp gives a bijection of Cflspec(P) onto spec(Pp). Likewise, given any P e Cflspec(P), Q y-> QP gives a bijection of all P-primaries in R onto the set of all (Pp)-primaries in P p , and this bijection preserves intersections and quotients. (h) Given an irredundant primary decomposition 7 = Di P m C P . Then 7p = r\i<j
146

LECTURE H: VARIETIES AND MODELS

Some obvious consequences of the above Theorem (T17) are worth repeating as: LOCAL RING CONSTRUCTION THEOREM (T18). RP is a quasilocal ring, i.e., a ring with a unique maximal ideal M(Rp). We have M(RP)

= (P)RP

with

^(MiRp))

= P.

The mapping P i-> $(P)RP gives a bijection of {P G spec(i?) : P C P} onto spec(i?p) whose inverse is given by $ _ 1 , and hence in particular, referring to (022) for the definitions of dimension and height, we have dim(i?p) = h t ^ P . If R is noetherian then Rp is a local ring, i.e., a noetherian quasilocal ring. If ZR(0) C P (resp: if R is a domain) then regarding RP to be a subring of QR(i?) (resp: of QF(i?)), for any ideals I and J in R and Rp we have Q(I)RP

= IRP

and

$-*( J) = J n R.

§8: A F F I N E VARIETIES Postponing the discussion of varieties in projective space to a later opportunity, we shall now discuss varieties in affine space. Let k c

K

CA

be fields and let iV be a nonnegative integer. Consider the affine iV-space A^ = KN = {a = (ai,...,

aN) : cci € K for 1 N,k = k[Xi,.

over k. For any J c

BJV,A

..,XN]

let

V K (J) = { Q £ A f : f(a) = 0 for all / G J } where for / = f(X\,..., Xfj) G BN,\ and a = (a\,..., ajv) G A ^ we are writing / ( a ) = f(oti,..., ajv). We call this the affine variety defined by J. For polynomials / , g,... in BN<\ we may write YK(f, g,...) in place of V K ({/, g,...}) and we may informally call this the affine variety / = g = • • • = 0. By a variety in A ^ defined over k we mean a subset of A ^ which can be expressed in the form VK(J) for some J C R>N,k- We put avtfc(A^) = the set of all varieties in A ^ defined over k. Members of avtfc(A^) may be called varieties in Aj^. For any U C A ^ we put MU) = {/ G BN:k : f(a) = 0 for all

aeU}

§8: AFFINE VARIETIES

147

and we note that this is an ideal in Bjv,fc; we call it the ideal of U in B^tk- For points a,/3,... in A ^ we may write lk(a,P,...) in place of lk{{a,(3,...}). Given U,U' in avtfe(A^), U' is a subvariety of U means U' C U, and U' is a proper subvariety of U means U' C U with U' ^ U. Given U in avtfc(A^), [/ is reducible means ?/ is the union of two proper subvarieties; U is irreducible means it is nonempty and nonreducible. We put iavtfc(A^) = the set of all irreducible members of avtfc(A^). Referring to (022) for the definition of the dimension dim of a ring, for any U in avtjfc(A^) we define the dimension of U by putting dim(U) =

dim(BN,k/lk(U)).

Given U £ avtfc(A^), for any / e BN / ( a ) and, letting a vary only over U, this gives a polynomial map U —> /t. Since two such maps on {/ coincide iff the difference of the corresponding members of B;v,fc belongs to the ideal Ifc(i7), the ring of all these K-valued polynomial maps on U may be identified with the residue class ring Bjvjfc/Ijt(C/). Indeed this is the basic motivation behind the concept of residue class rings. We denote -Bjv,fc/2fc(f/) by k[U]* and call it the affine coordinate ring of U. In case lk{U) is not the unit ideal in i?jv,/t, we may and we do identify k with a subfield of k[U]* and we note that then, according to (027), k[U]* becomes an affine ring over k, and hence in particular it is a noetherian ring. In case U € avtfc(A^) is such that !&([/) is a prime ideal in i?;v,fc, we denote the quotient field of k[U}* by k{U)*, and we call k(U)* the function field of U (over k). We denote the localization of BN,W at lk(U) by Rk(U)*, and call it the local ring of U (over A;); note that by (T18) this is indeed a local ring. We denote the residue field Rk(U)*/M(Rk(U)*) of Rk(U)* by k{U)\ and we call this field fc(t/)» the alternative function field of U (over fc); again we may and we do identify k with a subfield of k(U)K Let $[/ : BNtk ~> k[U)* and *c/ : RkiP)* -> k{U)* be the residue class epimorphisms. Now clearly there is a unique homomorphism ifru '• Rk{U)* —» k(U)* such that for all / G BN,k and g € BNtk\h(U) we have ipu(f/g) = $u(f)/$u(g)Obviously im(ipu) = k(U)* and kev(ipu) = M(Rk(U)*) and hence there is a unique isomorphism (f>u : k{U)^ —> k(U)* for which we have 4>u^u = i>u- We call ipu and <j>u the natural epimorphism and the natural isomorphism respectively. Note that A^ C A^, and the ideal Ifc(a) of any point a in A ^ is the maximal ideal in B^,k generated by Xi - a\,..., XN — OLN\ also VK(Ifc(a)) = {a}, and writing a in place of {a} we get k = k[a\* = k(a)* C Rk(ct)* = the local ring of a. The above notions are particularly relevant when k or K is algebraically closed, or more precisely, when K contains an algebraic closure of k. To take care of fields which may not be algebraically closed, we recall that analogously, for any subset J of any ring A, in (016) we have defined the spectral variety, the maximal spectral

LECTURE L4: VARIETIES AND MODELS

148

variety, and the minimal spectral variety of J in A by putting vspec A J = { P e spec(i4) : J C P) and mvspec^ J = mspec(A) D vspecAJ and nvspec^ J = the set of all minimal members of vspec^ J where spec(A) and mspec(J4) are the sets of all prime and maximal ideals in A respectively. Moreover, for any U C spec(A) we have defined the spectral ideal of U in A by putting ispec^f/ = C\pei/P and we have put rd(.A) = the set of all radical ideals in A i.e., ideals which are their own radicals. We have also put svt(j4) = the set of all spectral varieties in A i.e., the set of all subsets of spec(yl) of the form vspecAJ with J varying over the set of all subsets of A; members of svt(A) may be called varieties in spec(A). Now we put msvt(j4) = the set of all maximal spectral varieties in A i.e., the set of all subsets of mspec(yl) of the form mvspec^*/ with J varying over the set of all subsets of A; members of msvt(A) may be called varieties in mspec(j4). Given U, U' in msvt(A) or svt(A), U' is a subvariety of U means U' C U, and U' is a proper subvariety of U means U' C U with V ^ U. Given U in msvt(A) or svt(A), U is reducible means U is the union of two proper subvarieties; U is irreducible means it is nonempty and nonreducible. We put imsvt(A) = the set of all irreducible members of msvt(A) and isvt(A) = the set of all irreducible members of svt(A). For any U in msvt(A) or svt(A) we define the dimension of U by putting dim((7) = dim(A/ispec A [/). If A is an affine ring over k then for any U in msvt(A) or svt(A) we denote A/ispecA(U) by k[U]* and call it the affine coordinate ring of U. In case ispec^C/

§S: AFFINE

VARIETIES

149

is not the unit ideal in A, we may identify k with a subfleld of k[U]* and we note that then k[U]* becomes an affine ring over k, and hence it is a noetherian ring. In case ispec^C/ is a prime ideal in A, we denote the quotient field of k[U]* by k(U)*, and we call k(U)* the function field of U (over k). We denote the localization of A at ispec^J/ by Rk(U)* and call it the local ring of U (over k); we note that by (T18) this is indeed a local ring. We denote the residue field Rk(U)*/M(Rk(U)*) by &(£/)* and we call it the alternative function field of U (over k); we may and we do identify k with a subfleld of k{U)K Let $ y : A -> k[U\* and *c/ : Rk{U)* -> fc(C/)" be the residue class epimorphisms. Clearly there is a unique homomorphism V>u : Rk{U)* —> &(£/•)* such that for all / S A and 5 € A \ ispec^t/ we have < M / / 3 ) = $ u ( / ) / $ u ( 0 ) . Obviously im(V>c/) = fc(E/)* and ker(V^) = M(Rk(U)*) and hence it follows that there is a unique isomorphism [/ : &([/)* —> k(U)* for which we have (pu^u = i>u- We call I/JU and <j>u the natural epimorphism and the natural isomorphism respectively. As GEOMETRIC MOTIVATION for the definitions, consider the variety / = 0 in A ^ where / is a nonconstant polynomial in Bx,k- For N = 1 this is a point-set (= a finite set of points) on the line, for N = 2 it is a plane curve, for N = 3 it is a surface, for N = 4 it is a solid, and in general it is a hypersurface, i.e., something defined by a single equation. For instance, by taking / = Xf + • • • + Xfj — 1 we get a point-pair, a circle, a sphere, a solid ball, and so on. Intuitively, a point-set is zero-dimensional, a curve is one-dimensional, a surface is two-dimensional, a solid is three-dimensional, ..., a hypersurface is (N — l)-dimensional. The hypersurface is irreducible if the polynomial / is irreducible. This gives rise to the INTUITIVE DEFINITION of an r-dimensional irreducible algebraic variety U in the affine AT-space A ^ as a geometric object which can be parametrized by an irreducible hypersurface in (r + l)-space. Here by a geometric object we mean something defined by a bunch of polynomial equations. Let P be the ideal generated by these polynomials in Bjv.fc- The irreducibility of U suggests that we require P to be a prime ideal. The dimensionality r of U suggests that we require it to equal trdegfcfc(t/)* where trdegfc denotes transcendence degree over k and k{U)* is the quotient field QF(k[U}*) of k[U}* = BNik/P. Referring to (022) for the definitions of the depth dpt and the height ht of an ideal, eventually we shall prove the following [cf. L5§5(Q10)(T47)]: DIMENSION THEOREM (T19). For any prime ideals P C P' in B = BN: B —• B/P be the residue class epimorphism, we have trdeg f c QF(5/F) = dim(B/P)

= dptBP

= N-

htBP

and trdeg fc QF(B/P) = htHB)4>{P') + dptHB)
150

LECTURE L4: VARIETIES AND MODELS

Eventually we shall also prove the following [cf. L5§5(Q32)(T144)]: PRIMITIVE ELEMENT THEOREM (T20). If k is algebraically closed then for any prime ideal P in B = B^,k: upon letting : B —> B/P be the residue class epimorphism, and after changing the variables by a linear transformation, it can be arranged that (Xi),... ,<j>(Xr) is a transcendence basis of QF(B/P) over k and either N = r or <j>(Xr+\) is a primitive element of QF(B/P) over QF((t>(Br(Brtk))((Xr+i)). [In the latter case we can find an irreducible polynomial F(Xi,... ,Xr+i) in P n 2?r+i,fc and then the irreducible variety U in A^ defined by P is "parametrized" by the irreducible hypersurface F = 0 in A£ +1 ; in the former case U is "parametrized" by A£]. Now (T20) provides the rationale behind calling the transcendence degree as dimension. In turn (T19) connects it to the ring-theoretic definition of dimension which we have chosen for U in avt or msvt or svt. At any rate, as suggested by the above GEOMETRIC MOTIVATION, in avt or msvt or svt, we call U a point-set or a curve or a surface or a solid according as dim(f/) = 0 or 1 or 2 or 3 respectively. Another rationale for choosing the ring-theoretic definition of dimension can be found in the following Theorems (T21) and (T22) according to which: a curve has points on it, a surface has curves on it, a solid has surfaces on it, and so on. Most parts of the following Inclusion Relations Theorem (T21) for the operators V, I, vspec, and ispec are straightforward. We shall soon complete the proof by establishing the remaining parts [see the cf. between (T21) and (T22)]. INCLUSION RELATIONS THEOREM (T21). Recall that A is any ring. Let '(R,S,V,V',V",I)

stand for

^ (5 iV;fc ,A^,V K ,avt fc (A^),iavt fc (A^),I fc ) or (A, mspec(j4), mvspec A , msvt (A), imsvt(^l), ispec^) or (A, spec(A), vspec A , svt(A), isvt(A), ispec^) and let I' be the set of all ideals J in R such that J = I(U) for some U C S. Then we have the following. (T21.1) For any subsets J and J' of R we have: J C J' =4> V(J') C V(J), and we have: rad f i (Ji?) = radfl(J'H) =5> V(J) = V(J'). (T21.2) For any subsets U and U' of S we have: U C U' => I(U') C I(U). (T21.3) For any family of ideals (Ji)ieL in R we have V(J2leL J{) = n/ e z,V(Ji), and if the family is finite then we have V(r\i£L Ji) = V(ILeL Ji) = ^ieLV(Ji). (T21.4) For any family of subsets (Ut)ieL of S we have I(UleLUi) = DleLI(Ui). (T21.5) For any J c R we have J C I(V(J)) and: J = I(V(J)) •& J 6 / ' => J = radR(JR).

§S; AFFINE

VARIETIES

151

(T21.6) For any U C S we have U C V(I{U)) and: U = V(I(U)) &UeV. (T21.7) V" = {UeV : I(U) G spec(ii)}. (T21.8) Assuming R is noetherian (this is certainly so in case R = BN,H), every U € V can be expressed as a finite union U = Ui ispecBiV U gives a bijection svt(S;v,fc) —* rd(j9jv,fc) whose inverse is given by J t—> vspec Bjvfc J. Now this assertion is the eighth amongst the following eight equivalent versions (T22.1) to (T22.8) of the famous Hilbert Nullstellensatz which we shall soon prove [cf. (T49) and (T50) of L5§5(Q11), and the material between (T47) and (C15) of L5§5(Q10]. HILBERT NULLSTELLENSATZ (T22). (T22.1) A field which is an affine ring over k is algebraic over k. (T22.2) For any maximal ideal J in BAT,*:, the field B^^/J is algebraic over k, i.e., over the image of k under the residue class map B^,k —> B^^/J. (T22.3) For any maximal ideal J in B^,k there is a unique set of ./V generators (fi(Xi, • • •, Xi) G Bi,k)i
X?i+gi(X1,...,Xi)

=

where n, is a positive integer and gi{X\,..., fe&Xj9i{Xi,.

-.,Xi)<

Xi) G Bj,fe with n-j

for

1 < j < i.

(T22.4) If k is algebraically closed then the mapping a H-» Ij.(a) gives a bijection A-^ —> mspec^A^ifc). (T22.5) If K contains an algebraic closure of k then for any ideal J in Bjv.fc we have that Ifc(VK(J)) = ia,dBNkJ. (T22.6) If K contains an algebraic closure of k then the mapping U <-^> fk(U) gives inclusion reversing bijections avtfc(A^) -> rd(Bjv,fc)

and

iavt fc (A^) -> spec(BN>k)

whose inverses are given by J — i » VK(J). (T22.7) The mapping U t—»ispecBjv kU gives inclusion reversing bijections msvt(i?jv,fc) —> rd(Sjv.fc)

and

imsvt(BN,k)

—> spec(BNtk)

whose inverses are given by J — i > mvspec Bw J. (T22.8) The mapping U H-> ispec Bjv kU gives inclusion reversing bijections svt(BN,k)

-> rd(BN>k)

and

isvt(Bjv,fc) -» spec(BN>k)

152

LECTURE H: VARIETIES AND MODELS

whose inverses are given by J — i > vspec Bjv J. §8.1: SPECTRAL A F F I N E SPACE Henceforth we shall consider points and varieties in A% rather than in A^. As we have already noted, the ideal Ifc(oj) of any point a in A% is the maximal ideal in Bjv.fe generated by X\ - a\,..., XN — a AT. In other words, a is the intersection of the TV hyperplanes Xi - a i = 0 , . . . , XN - a^ = 0. Also clearly a ^ a' in A^ => Ifc(a) ^ lk{a'). So via Ifc we may ENLARGE A ^ into spec(BAr,fc) which we may denote by (Aj^)CT and call it the spectral affine TV-space over k. The mapping a t-> Ifc(a) gives an injection A^ —• (&%)". The image of Aj^ under this injection may be denoted by (AN)P
where ss stands for isomorphism; here it is set-theoretic isomorphism, i.e., bijection. Now (T22.3) says that a point of (Aj^)*"7 is the intersection of TV hypersurfaces; moreover, these hypersurfaces are hyperplanes, i.e., they are of degree one, iff it is a point of (A%y. §8.2: MODELIC SPEC A N D MODELIC A F F I N E SPACE To describe another way of enlarging A^, for any domain A we put %J(A) = the set of all localizations Ap with P varying over spec(A) and we call this the modelic spec of A. By (T18) we see that Ap is a quasilocal ring for which M(AP) = PAP and M(AP) n A = P with dim(AP) = h t ^ P , and if A is noetherian then Ap is a local ring R. Thus P i-> Ap gives an inclusion reversing bijection of spec(^l) onto %3(A), and the reverse bijection is given by R *-* M(R)C\A. We denote VB{BNik) by (A%)s and call it the modelic affine TV -space over k. Now P \—» (T3;v,fc)p gives an inclusion reversing bijection (A^Y —> (AN)6 whose inverse is given by R i-> M(R) f~l BNtk. The images of (Af)"" and (A%y under this map may be denoted by (A%)pS and {ANYS and called the rational modelic affine TV-space over k and the minimal modelic affine TV-space over k respectively; note that because of the inclusion reversing property, (A^)MlS is the set of all minimal members of (A^)s. Now the isomorphisms (A?)" « (A?)"

and

( A ? ) " ' « (A?)"*

and

give rise to the enlargements A ? « (A»y* c (A?)"* C (A%)s.

(A»y

« (A?)'

§S.3: SIMPLE POINTS AND REGULAR LOCAL RINGS

153

Given any U £ (A^)CT, by (T22) we see that k(U)* = k &U £ (A^)pa whereas k(U)*/k is algebraic <^> U £ (Af )^CT; consequently, points in (A^)p(7 may be called rational and points in (A^)'" 7 may be called algebraic. Similarly, for any R £ (A%)s, after identifying k with a subfield of R/M(R), we have that R/M(R) = k & R £ (A%)p6 whereas (R/M(R))/k is algebraic &• R £ (A^)""5; consequently, points p6 in (A%) may be called rational and points in (Aj^)Ml5 may be called algebraic. By (T22) we also see that if K contains an algebraic closure of k then "points" of (A^)CT are the prime ideals of "irreducible varieties in A^ defined over k," and the corresponding "points" of (Af?)6 are their local rings. §8.3: SIMPLE P O I N T S A N D R E G U L A R LOCAL RINGS The embedding dimension of a local ring R is defined by putting emdim(-R) = the smallest number of elements which generate M{R). We shall soon prove the [cf. L5§5(Q7)(T29)]: DIM-EMDIM THEOREM (T23). For any local ring R we have emdim(i?) > dim(i?) and hence in particular (since R noetherian => M(P) is finitely generated) emdim(i?) and dim(i?) are nonnegative integers.

A local ring R is said to be regular if emdim(i?) = dim(.R). Concerning regular local rings we shall soon prove the following theorem, where we recall that a domain is normal means it is integrally closed in its quotient field (see L3§7), and for any element x in any quasilocal ring R we have put OIARX = max{i G N : x £ M(RY} where the max is taken to be oo if the set of i is unbounded (see L3§11). [cf. L5§5(Q4)(T7), L5§5(Q14)(T64), L5§5(Q15)(T71), L5§5(Q17)(T81.7)]. ORD VALUATION THEOREM (T24). Let R be any local ring. Then: (T24.1) For all 0 ^ x £ R we have ord R :r £ N, i.e., n i e N M ( i ? ) i = 0. (T24.2) If R is regular then for all nonzero elements x and y in R we have ordR(xy) = ordure + ord^y; consequently R is a normal domain and we get a valuation ord# : QF(R) —> Z by putting ordR(x/y) = ordfii - ordRy. [We shall continue to use this extended meaning of ord#]. (T24.3) For any x £ M(R) \ ZR(R) we have that: (i) R/{xR) is a local ring whose dimension is one less than the dimension of R, (ii) if R/(xR) is regular then ordfla; = 1, and (iii) if ord/ja: = 1 and R is regular then R/(xR) is regular.

154

LECTURE H: VARIETIES AND MODELS

[Note that if R is regular, or more generally if R is a domain, then the condition x e M(R) \ ZR(R) is equivalent to the condition 0 ^ x e M(R)]. (T24.4) If R is regular then its localization Rp at any prime ideal P in it is regular. Theorems (T19) and (T22.3) tell us that, the localization of B^,k at any U € (A^)MCT is an iV-dimensional regular local domain, i.e., equivalently, every R S (AjV)M<5 -1S a n jV-dimensional regular local domain. Geometrically this says that every point of (Aj^)^ is simple, and so is every point of (A^)M<S. By (T24.4) it follows that the localization of Bjv.fe at any U € (A^)CT is a regular local domain, i.e., equivalently, every R £ (A^)s is a regular local domain. EXAMPLE (Xll). Consider a hypersurface S : F = 0 in A ^ with F = F(Xi,... ,XN) £ Bjv,fc\fc and let a = ( a i , . . . , a ^ ) be a point of S, i.e., o n , . . . ,ajv are elements in k with F{a\,..., OJ AT) — 0. Expanding F around a we get FiX,,. ..,XN)

= J2 Gn-iN (*i - "I)* 1 •••{XN-

aN)iN

with Gi1...iN € /c. Let i' be the smallest value of i\ -\ HTV for which Gix,.,iN ^ 0. In L3§3 we have put ord Q F = u, called ^ the multiplicity mult Q S' of S at a, called a a i/-fold point of S, and said that a is a simple or singular point of S according as v = 1 or v > 1. In terms of (formal) partial derivatives we have dF v > 1 (i.e., a is a singular point of 5) <=> — -

= 0 for 1 < i < N.

In the present notation ovdR(ayF = v, and by (T24) the local ring is regular iff v — 1, i.e., iff a is a simple point of S.

R(a)*/(FR(a)*)

§9: MODELS As described in L3§4, we can construct the projective A^-space P ^ over a field k by patching up N +1 copies of the afflne A'-space A ^ over k. Similarly, by patching up several modelic specs we can construct a model. More precisely, we introduce the following definitions. Let A be a subring of a field K. Referring to L2§3 for the definition of a valuation v of K and the definition of the valuation ring Rv of v, recall that, according to L3§7 and L3§11, Rv is a quasilocal domain with quotient field K, and by a valuation ring we mean the valuation ring of some valuation of its quotient field. By a valuation of K/A (K over ^4) we mean a valuation v of K such that A C Rv. By a valuation ring of K we mean a valuation ring with quotient field K, and by a valuation ring of K/A we mean a valuation ring of K which contains A. We put JK(A') = the set of all valuation rings of K

§9: MODELS

155

and %\(K/A) = the set of all valuation rings of K/A and we call these the Riemann-Zariski space of K and the Riemann-Zariski space of K/A respectively. We also put 9\'(K) — the set of all quasilocal domains with quotient field K and OK'(K/A) = the set of all members of D\'(K) which contain A and we call these the quasitotal Riemann-Zariski space of K and the quasitotal Riemann-Zariski space of K/A respectively. Finally we put 9\"(K) = the set of all quasilocal domains which are subrings of K and m"(K/A)

= the set of all members of
and we call these the total Riemann-Zariski space of K and the total RiemannZariski space of K/A respectively. A quasilocal ring R is dominated by a quasilocal ring S means R is a subring of S and M(R) c M(S) or equivalently M(R) = RC\ M(S); alternatively we may say that S dominates R; we may indicate this by writing R <S

or

S > R.

This converts Vt(K),iR(K/A),W(K),W(K/A),W'(K),W'(K/A) into posets (= partially ordered sets). A valuation v dominates a quasilocal ring R means Rv dominates R. We shall soon prove the following Theorems (T25) to (T28) about valuations; [cf. §12(R7)]. VALUATION MAXIMALITY THEOREM (T25). V\(K) (resp: V\(K/A)) is the set of all maximal members of W(K) (resp: W(K/A)), i.e., R in W(K) (resp:
156

LECTURE L4: VARIETIES AND MODELS

VALUATION CHARACTERIZATION THEOREM (T28). A domain R is a valuation ring iff for all x, y in R we have either x £ yR or y £ xR. Equivalently, a domain R with quotient field K is a valuation ring of K iff for every x £ Kx we have either x £ R or 1/x £ R. To proceed with the definition of models, by a premodel of K (resp: K/A) we mean a nonempty subset E of W(K) (resp: ^ ' ( i f / A ) ) . The premodel E is irredundant means any member of 91{K) (resp: Di(K/A)) dominates at most one member of E. Note that a quasilocal domain S can dominate at most one member R of an irredundant premodel E, and if R exists then we call it the center of S on E. To see the uniqueness of R, let S dominate another member R' of E; identifying K with a subfield of the quotient field L of S, by (T26) we can find a valuation ring W of L dominating 5 and then by (T27) W n K is a valuation ring of K which dominates R as well as R' and hence R = R'. By a semimodel (resp: model) of K/A we mean an irredundant premodel E of K/A which can be expressed as a union E = UI E). A set E' of quasilocal rings dominates a set E of quasilocal rings (or E is dominated by E') means every member of E' dominates E\ we may indicate this by writing E < E' (or E' > E). A set E' of quasilocal rings properly dominates a set E of quasilocal rings (or E is properly dominated by E') means E' dominates E and every member of E is dominated by some member of E'. A semimodel or model of K/A is complete means it is dominated by 9t(K/A). Note that if K is the quotient field of A then V(A) is a complete model of K/A. Also note that if E is a semimodel (resp: complete semimodel) of K/A then E dominates (resp: properly dominates) 5J(A). A model E (of K/A) is said to be normal (resp: noetherian) if every R £ E is normal (resp: noetherian). A model E is said to be nonsingular if every R £ E is a regular local ring. Note that by (T24), every nonsingular model is a normal noetherian model. By the dimension dim(S) of a model E we mean max{dim(i?) : R £ E}; note that then

dim(E) <ENU{oo}.

§£U: MODELIC PRO J AND MODELIC PROJECTIVE SPACE

157

§9.1: MODELIC PROJ A N D MODELIC P R O J E C T I V E SPACE As we have seen in §8.1, for the iV-variable polynomial ring B;v,fc over a field k, the modelic spec ^J(Bj^tk) is bijective with spec(B/ftk) (i.e., there is a bijection between the two), and hence if k is algebraically closed then its portion consisting of AT-dimensional members is bijective with the affine space Aj^. We proceed to show that similarly a certain portion of a certain model is bijective with the projective space Pj£\ So for any family (xi)i^A of elements in K, with Xj ^ 0 for some j G A, we put W(A; (xt)leA)

=

(J

V(A[(xi/xj)i^])

jgA with xj#0

and we call this the modelic proj of (xi)ieA over A. Here A[(xi/xj)ieA] denotes the smallest subring of K which contains A and which contains XI/XJ for all / G A. If A is a finite set, say A = { 1 , . . . , n}, then we may write %D(A; x\,..., xn) instead of W(A; (XI)IZA) and call it the modelic proj of (xi,... ,xn) over A. Clearly for all i,i' in A with Xi ^ 0 ^ xv we have QF(A[(xi/xi)ieA\) = Q,F(A[(xi/xi')iGA\) and, upon letting K' be this common quotient field, W(A; (xi)i£A) is obviously a premodel of K'/A. Moreover, for any i ' e A with Xi< ^ 0, upon letting yi = xi/x^, we see that (j//);eA is a family of elements in K' such that yj ^ 0 for some j G A and W(A; {xi)iGA) = W(A; {yi)ieh)- According to Theorem (T29), which is stated below and which we shall soon prove, W(A; {XI)I&A) is actually a semimodel of K'/A and if A is finite then it is in fact a complete model of K'/A; [cf. §12(R8)]. By a projective model of K/A we mean a premodel E of K/A such that E — W{A; xi,..., xn) for some finite number of elements x\,..., xn in an overfield of K with Xj ^ 0 for some j G { 1 , . . . , n } ; by what we have just said, E is then indeed a complete model of K/A and we have E = W(A; y\,..., yn) for a finite number of elements yi,. • • ,yn in K at least one of which is nonzero. MODELIC PROJ THEOREM (T29). Given any family (xi)ieA of elements in K, with Xj ^ 0 for some j G A, let E = W(A; (xi)ieA) and let K' = Q,F(A[(x[/xi)[e\]) where i G A with Zj ^ 0; as noted above the QF is independent of the choice of i. Also let yi = xi/x^ for any fixed i' G A with x? ^ 0. Then we have the following. (T29.1) E is a premodel of K'/A and E = W{A; (xi/x)i5A) for all 0 ^ x G K. In particular {yi)i&A is a family of elements in K' such that yj ± 0 for some j G A, and we have E — W(A; {yi)i£A). (T29.2) Given any R G E and any subring S of K such that 5 is a quasilocal ring dominating R, there exists j G A with Xj ^ 0 such that xi/xj G S for all / G A. Moreover, for any such j we have R = BQ where B = A[(XI/XJ)I£A] and Q = BnM{S).

LECTURE L4: VARIETIES AND MODELS

158

(T29.3) E is a semimodel of K'/A, and if A is finite then E is in fact a complete model of K'/A. To justify the terms modelic proj and projective model, we proceed to show that there is a natural injection of the projective space P ^ into a projective model. Referring to L3§4 for details, recall that

Pf = (fc W + 1 \{(0,...,0)})/~ where for u = (u\,... ,UJV+I) a n d u ' = (u[,... ,u'N+1) mkN+1\{(0,.. .,0)} we have: u ~ v! o ( u ' j , . . . , Mjv+i) = (CMI, • • •, CUJV+I) for some 0 ^ c € k. Thus Pj^ is the set of all equivalence classes under this equivalence relation. Let Hi be the hyperplane in Pj^ consisting of all equivalence classes /3 £ Pj^ of those u e kN+1 for which Ui = 0, and let A j ^ be the copy of Aj^ consisting of those v = (vi,..., VN+I) € kN+1 for which Vi = 1. Now f3 H-> (ui/m,..., WJV+I/MJ) gives a bijection 0* : P ^ \Hi -* Aj^, and identifying Pj^ \ i?j with Aj^ via this bijection we have

i f = u£L|1(if \H i ) = u£t1A&Now Bjv+i,fc =fc[-X^i,• • •, -X^AT+I] and we take its quotient field k(X\,..., .Xyy+i) to be .ftf; also we take k to be A. For 1 < z < N + 1, upon letting Yu = Xi/Xi (with 5^j = 1), we get a copy -Bjv,fc,i = fe[Yi,,..., Y}v+i,i] of 5jv,fc whose modelic spec V3(BN (A^)" 5 be the natural injection whose image is (Aj^)'*5. For all i , j in {1,...,JV + 1} we clearly have QF(.Bjv,fc,i) = QF(BNtk,j) a n d we let K' stand for this common subfield of K. Then 2U(/c; Xi,..., XAT+I) is a projective model of K'/k. We denote 2B(fc; X\,..., Xj\r+i) by ( P ^ ) 5 and call it the modelic projective JV-space over k. By the definition of 2B we get

(Pf)*=

U

(A&)'.

l
We put

(Pf)^=

|J l
(A^)' 5

and ( P ^ =

|J

(A»tirS

l
and we respectively call these the rational and minimal modelic projective iV-spaces over k. By §8.2 and §8.3 it follows that (P^)* is a nonsingular iV-dimensional complete model, and for any R e (P^) 5 , after identifying A; with a subfield of R/M(R), we have that: R € (P^) p<5 <£> R/M{R) = k, and we have that: R € (^Nys ^ dim(il) = N & R is a minimal member of (PN)6 & R/M{R) is algebraic over k. We claim that there exists a unique injection 9 : P ^ —> (P^) 15 , such that for 1 < i < N + 1 we have 6(a) = 0j(a) for all a £ A ^ ; note that then the image of 9

%9.2: MODELIC BLOWUP

must be (F^)pS.

159

After proving this claim we will have

p^ «(v[*y6 c (p%rs c (P£V extending the enlargements of §8.2 from the affine case to the projective case. The uniqueness in the claim follows from the existence. To prove the existence, let (3 be the equivalence class of u — (u\,..., UJV+I) € kN+1 \ { ( 0 , . . . , 0 ) } . For any i £ { 1 , . . .,N+l} withuj ^ 0, let R = 0i(v) where v = (UI/UJ, . . . ,wjv+i/ui) € A ^ , and define 9{(3) = R. To show that 9 is well-defined, for any i' G { 1 , . . . , TV + 1} with Ui> ^ 0, let R' = 9i'(v') where v' = (u\/ui', . . . , ujv+i/ u i') G ^fcV- What we must prove is that R = R'. Since R and R' both belong to the irredundant premodel (P^)' 5 , it suffices to find a local domain 5 which dominates both R and R'; take 5 to be the localization of BN+\^ at the maximal ideal generated by X\ — u i , . . . ,XM+I - WJV+I- This defines 9 as a map from Pj^ to (P^)' 5 . To see that it is injective, let (3* be the equivalence class of u* — {u\,... ,u*N+l) € kN+1 \ { ( 0 , . . . , 0 ) } , such that 9{(3) = 9(f3*). We have to show that (3 = (3*. This is obvious if u* ^ 0, because 6i : Aj^ —> (Aj^)"5 is injective. Moreover, ut ^ 0 with 0(0) = 9i{v) = R =$• Xj/Xi e R for all j but Xj/Xi e M(fl) for exactly those j for which itj = 0; therefore, since 8(/3*) = 9(/3) = R, we must have u* ^ 0. §9.2: M O D E L I C B L O W U P The following Theorem (T30), which we shall soon prove, says that the modelic blowup, which we shall now introduce, is nothing but the coordinate free incarnation of the modelic proj; [cf. §12(R9)]. By and by we shall see that it is also the main key to the desingularization of curves and surfaces in particular and varieties in general. For any nonzero A-submodule P of K we put

W(A,P) = | J WiAlPx-1}) O^xEP

and we call this the modelic blowup of A at P. Here ^ P r r - 1 ] denotes the smallest subring of K which contains A and which contains y/x for all y £ P. MODELIC BLOWUP THEOREM (T30). Recalling that A is a subring of a field K, and P is a nonzero j4-submodule of K, we have the following. (T30.1) For any 0 ^ x 6 P we have (A[Pa;- 1 j)P = (A[Px~l])x and hence RP — Rx for every R e 5J(A[Pa; _1 ]). In particular, if P is a nonzero ideal in A then for every R e V(A[Px~1}) we have that PR is a nonzero principal ideal in R. (T30.2) Given any family (x{)i£A of generators of P, for all 0 ^ x € P we have AlPx'1} = A[(xi/x)leA], and we have W(A\ (xi)leA) C W(A,P). (T30.3) For any family (xi)ieA of generators of P we have W(A,P)

=

W(A;(xl)l&A).

160

LECTURE L4: VARIETIES AND MODELS

(T30.4) For all x ^ 0 ^ y in P we have QF(A[Pa;- 1 ]) = Q F ^ P y " 1 ] ) and letting K' denote this common QF we have that W(A, P) is a semimodel of K'/A, and if P is a finitely generated ^-module then 30(A, P) is a projective model of K'/A. In particular, if P is a finitely generated ideal in A then 2B(A, P) is a projective model of QF(A)/A. UP = Ax for some 0 ^ x E K then 2U(A,P) = 3D(A;x) = 33(A). (T30.5) If P is an ideal in A then {R £ 33(A) :PR = R} = {Re W(A, P):PR

= R}.

(T30.6) If A is quasilocal and P is an ideal in A then: P is a principal ideal in A <=> 2»(A, P ) = 33(A) « 4 e 2U(A, P ) .

§9.3: B L O W U P OF SINGULARITIES We shall now make some EXAMPLES to indicate how modelic blowup helps out in desingularizing plane curves and surfaces in 3-space. EXAMPLE (XI2). [Nodal a n d Cuspidal Cubics]. Consider the nodal cubic / = 0 with / = Y2 - X2 - X3 and the cuspidal cubic g = 0 with g = Y2 - X3 discussed in L3§5. For A take the bivariate polynomial ring k[X, Y] over a field k. For P take the ideal of the origin in the (X, y)-plane, i.e., the maximal ideal in A generated by (X,Y). Now W(A,P) = 33(A') U 33(A") where A' and A" are the bivariate polynomial rings k[X', Y'\ and k[X", Y"\ where (X' = X, Y' = Y/X) and (X" = X/Y,Y" = Y) respectively. Substituting the first set of equations in / we get / = X'2(Y'2-l-X') and hence, in the (X',y')-plane, the "total transform" of the nodal cubic consists of the "exceptional line" X' = 0 together with the "proper transform" Y'2 — 1 — X' = 0 which is a parabola meeting the exceptional line in two distinct points. Likewise, substituting the first set of equations in g we get g = X'2(Y'2 - X') and hence, in the (X', Y')-P l a n e . the "total transform" of the cuspidal cubic consists of the "exceptional line" X' = 0 together with the "proper transform" Y'2 — X' = 0 which is a parabola meeting the exceptional line at a single point where it is tangent. At any rate, in both the single blowup has resolved the singularity. You may reconfirm this by taking a similar look in the (X",y")-plane. We shall talk more about exceptional lines and proper transforms later. In the meantime you may like to glance at the relevant pictures on pages 2, 35, 131-134, and 154-155 of my Engineering Book [A04]. EXAMPLE (X13). [Higher Cusps]. Consider the plane curve ga = 0 with gs = Y2 — X2s+l where s > 1 is an integer; at the origin this has a double point which is a higher cusp, say a cusp of height s (pictured on pages 155-156 of [A04]). With notation as in the above Example (X12), substituting the first set of equations in gs we get g3 = X'2(Y'2 - x /2 <»- 1 > +1 ) and hence, in the (X', y')-pfcuie, the total

§m- EXAMPLES AND EXERCISES

161

transform of our curve consists of the exceptional line X' = 0 together with the proper transform Y'2 — ^ / 2 ( s _ 1 ) + 1 = 0 having a cusp of height s — 1. Repeating the process we get a cusp of height s — 2, a cusp of height s — 3, ..., a cusp of height 1 (like Y2 — X3 = 0), and finally a simple point. In Max Noether's picturesque terminology, we say that the plane curve gs = 0, in addition to having a double point at the origin, has one double point in its first neighborhood, one double in the second neighborhood, ..., and one double point in the (s — l)-th neighborhood, thus making a total of s — 1 double points infinitely near to the origin. So (including the origin) we have a totality of s double points. By and by we shall give precision to the idea of infinitely near points. In the meantime you may enjoy reading about them, from a somewhat heuristic viewpoint, in Lecture 19 on Infinitely Near Singularities (pages 145-158) of the Engineering Book [A04]. EXAMPLE (X14). [Circular Cones and Fermat Cones]. Consider the horizontal circular cone X2 — Y2 — Z2 = 0 (description on pages 197-198 of [A04]), or more generally the Fermat Cone / = 0 with / = Xn±Yn±Zn, where n > 1 is an integer which is nondivisible by the characteristic (discussed on page 202 of [A04]). For A take the trivariate polynomial ring k[X, Y, Z] over a field k. For P take the ideal of the origin in the (X, Y, Z)-space, i.e., the maximal ideal in A generated by (X, Y, Z). Now W(A, P) = *0(A') U 9J(J4") U <0(A'") where A', A", and A'" are the trivariate polynomial rings k[X',Y',Z'], k[X",Y",Z"}, and k[X'",Y'",Z'"] where {X' = X,Y' = Y/X,Z' = Z/X), (X" = X/Y,Y" = Y,Z" = Z/Y), and (X'" = X/Z, Y'" = Y/Z, Z'" = Z) respectively. Substituting the first set of equations in / we get / = X'n(l ± Y'n ± Z'n) and hence, in the (X',y',Z')-space, the total transform of the Fermat Cone consists of the exceptional plane X' = 0 together with the proper transform 1 ± Y'n ± Z'n = 0 which is a nonsingular "cylinder." §10: EXAMPLES A N D EXERCISES EXERCISE (El). [Algebraic Closure]. In L2(R6), while establishing the existence of an algebraic closure K of any given field K, we found an overfield K of K such that: (b) K is algebraic over K and contains a splitting field of every nonconstant monic polynomial F over K. The implication (b) => (bb), where (bb) says that K is algebraically closed, follows from the fact that an algebraic extension of an algebraic extension is an algebraic extension, which was proved in L1(R7)(C3). To have another transparent proof of the implication (b) =3- (bb), show that: given any algebraic field extension K/K and given any nonconstant monic G in i^[y], there exists a nonconstant monic F in K\Y] such that G divides F in /f[V]. EXERCISE (E2). [Integral Closure]. Recall that: an element t in an overring S of a ring R is said to be integral over R if it satisfies a monic polynomial equation

LECTURE U: VARIETIES AND MODELS

162

over R, i.e., if F(t) = 0 for some monic polynomial F(Y)=Yn

(»)

+ A1Yn-1

+ --- + An

of some degree n > 0 with coefficients A\,..., An in R; this is indicated by saying that t/R is integral; a subset T of S is integral over R means every t G T is integral over i?; this is indicated by saying that T/R is integral; finally, by the integral closure of R in S we mean the set of all elements in S which are integral over R. In L3§7 we asserted without proof that (Jl)

the integral closure of a ring R in an overring S is a subring of S, and R is a subring of the said integral closure

and (J2)

if an overring S of a ring R is integral over R, and an overring T of S is integral over S, then T is integral over R.

Note that assertions (Jl) and (J2) are the integral extension analogues of the algebraic extension assertions L1(R7)(C5) and Ll(R7)(C3) out of which the last was cited in (El) above. Also note that to prove (Jl) it suffices to show that iiti,...,tm (JO)

are any finite number of elements in a ring S

^ which are integral over a subring R of S then the ring R[ti,...,

t m ] is a finitely generated .R-module and is integral over R.

As an exercise, prove (JO) and (J2) in the following five STEPS (E2.1) to (E2.5), where S is an overring of a ring R. STEP (E2.1). For t G S let V be an fl[t]-submodule of S with ann f l [ t ] 7 = 0 such that V is finitely generated as an i?-module, and let J be an ideal in R such that tV C JV. Then F(t) = 0 for some F(Y) with Ai £ J f for 1 < i < n. HINT. Let Xu...,Xn be ^-generators of V with n e N+. Then for 1 < i < n we have tXi = Y^\
§i0: EXAMPLES AND EXERCISES

163

STEP (E2.2). For any t e S the following four conditions are equivalent. (1) t/R is integral. (2) The ring R[t] is finitely generated as an i?-module. (3) R[t] C T for some subring T of S such that T is finitely generated as an .R-module. (4) There exists an i?[t]-submodule V of S with a,nnR[t]V = 0 such that V is finitely generated as an .R-module HINT. (1) =» (2): from F(t) = 0, by induction we get tn+i € R+Rt+- • • + Rtn~1 for all i G N, and hence ( l , t , . . . , t n _ 1 ) are fl-generators of R[t\. (2) => (3): take T = R[t]. (3) =*> (4): take V = T and note that then 1 e V and hence for any r in ann fi(t] V we have r = r l = 0. (4) => (1): take J = Rin STEP (E2.1). STEP (E2.3). If S is a finitely generated ii-module, and an overring T of S is a finitely generated S-module, then T is a finitely generated il-module. HINT. If (2Jj)i<j (1) of (E2.2) it suffices to show that if elements * i , . . . , t m of S are integral over R then R[ti,..., tm] is a finitely generated .R-module. Make induction on m. For m — l this follows from (1) =$> (2) of (E2.2). Also m - 1 =>• m follows from the m = 1 case and (E2.3). STEP (E2.5). (J2). HINT. Assume S/R and r / 5 are integral. Then for any t € T we have F(f) = 0 for some F{Y) as in (|t) above with Ai,..., A„ in 5. Now by (E2.2) and (E2.4) we see that R[A\,..., An] is a finitely generated .R-module and R[t, A\,..., An] is a finitely generated R\A\,..., A n ]-module. Therefore R[t, A\,..., An) is a finitely generated .R-module by (E2.3), and hence t/R is integral by (E2.2). EXERCISE (E3). [Conditions of Integral Dependence]. (i) Note that in (E2.1) as well as (E2.2)(4), if either V ^ 0 and S is a domain, or if 1 e V (for instance if V is an overring of R), then the condition ann fl [ t ]V = 0 is automatically satisfied. (ii) Show that if R is noetherian then (2) of (E2.2) is equivalent to saying that: (2') R[t] is contained in a finitely generated i?-submodule of S. (iii) By an example show that if R is nonnoetherian then (2') 7^ (2).

164

LECTURE H: VARIETIES AND MODELS

EXAMPLE (X15). [Nonarchimedean Valuations]. Let v be a valuation of a field K such that the value group Gv of v is nonarchimedian, let a, (3 be elements in Gv such that a > n/3 > 0 for all n £ N+, let x, y be elements in the valuation ring R = Rv of v with v(x) = a and v(y) = /?; see L2§7(D2), L2§3, L3§7. Let t = l/y and V = Ru with u = 1/x. Then t/R is nonintegral but R[t] is contained in the finitely generated R-module V. EXERCISE (E4). [Cramer's Rule for Linear Equations]. Establish the Rule cited in (E2.1) above, i.e., given a finite number of simultaneous linear equations ^

AijXj = Yi

for

1< i < n

l<j
where the entries of the nxn matrix A = (Aij) as well as the elements Xj, Yj belong to a ring R, show that (E4.1) where the nxn

det(A)Xj=det(B^')) matrix B^

for

l<j
= {B\l ) is given by BU)=Uu

il

iil^j

\H=j

\Yi

i.e., B^ is obtained by replacing the j - t h column of A by the column vector (Y\,... ,Yn). Hence in particular, if det(^4) is a nonzerodivisor in R then, in the total quotient ring QR(i?), we have (E4.2)

Xj = det(B^)/det(A)

for

1 < j < n.

HINT. Let A^ be the matrix obtained by multiplying the j - t h column of A by Xj. Then B^ is obtained by adding to the j-th column of A^ linear combinations of the remaining columns. Hence by L3(E1) we get det(A ( j ) ) = det(A)Xj

and

det(B^)

= det(^>).

EXERCISE (E5). [Solutions of Homogeneous Linear Equations]. Consider the system of homogeneous linear equations Y^

aijXj = 0

for

1< i < M

l<j
where a = (a^) is an M x N matrix over a field k with M, N in N+. Let S be the solution space of the system, i.e., the set of all X = (X\,... ,XJV) S Aj^ satisfying the system; clearly S is a fc-linear subspace of A ^ . Let n be the rank of a, and

§10: EXAMPLES AND EXERCISES

165

note that then 0 < n < min(M, N). Show that if M = N then: 5 = { ( 0 , . . . , 0)} <^ n — N, i.e., <*=> det(a) ^ 0. More generally, without any assumption, show that dimfcS = N — n by finding a fc-linear injection cj) : A ^ ~ " —> Aj^ with im() = 5. Moreover, do this by establishing the explicit formulas which are described below. By relabelling ay we may assume that det(A) ^ 0 where A = (Aij) is the n x n matrix with A^ = a^ for 1 < i < n and 1 < j < n, i.e., A is the left upper corner submatrix of a. Let Y

i = -

Y,

aixXx

f r

°

1 i M

<< -

n<X
For 1 < j < n let B^ be the nxn matrix obtained by replacing the j - t h column of A by the column vector (Yi,.. .,Yn). Now for every Z = (Z\,..., ZN-U) 6 A j ^ _ n we put <j>(Z) = x = (xi..., XJV) € Aj^ where

Xj

_ fdet(B^)/det(A)

iil<j
~~ [Zj-n

iin<j
HINT. By the above displayed definition of Xj, and by looking at the expansion of d e t ( I ? ^ ) by its j-th column, we see that cp is an injective fc-linear map. So it only remains to show that its image is S. Now the given homogeneous system is clearly equivalent to the nonhomogeneous system y ^ AijXj =Yi

for

1< i < n

l<j
together with the extra equations ^21<j
Yi-

^2

aydet(B^)/det(A) = 0.

l<j
So given any such i, let C be the (n + l)x(n + l) matrix obtained by bordering A by the last column ( Y i , . . . , Yn,Y) and the last row (an,..., ain, Yi), and for n
^ l<j
and this completes the proof.

a«det(B W ) )

166

LECTURE H: VARIETIES AND MODELS

EXERCISE (E6). [Conditions for a Common Factor]. Let n and m be positive integers, and let / and g be polynomials of degrees < n and < m in Y with coefficients in a field k, respectively. Show that e

f + 9^ — 0 f° r some e and h in k[Y]

with degye < m — 1 and degyh < n — 1 < such that at least one of e and h is nonzero <*=> either degy/ < n and degyg < m or / and g have a common root in some overfield of k. HINT. k[Y] is a UFD. Moreover, if f,g have a nonconstant common irreducible factor then in its splitting field we get a common root for them. EXERCISE (E7). [Sylvester Resultant]. For polynomials / = f{Y) = a0Yn + a i F " - 1 + • • • + an g = g(Y) = bQYm + hY™'1

+--- + bm

with coefficients in a domain R, where n, m are nonnegative integers, establish the Basic Fact (Tl), i.e., show that: Resy(/, g) = 0 <=> n+m ^ 0 and either oo = 0 = &o or / and g have a common root in some overfield of R. HINT. The case when mn = 0 being straightforward, assume mn ^ 0. Let A = (Aij) be the (m + n) x (m + n) matrix over k = QF(i?) whose transpose is Resmaty(/, g). In view of (E6) it suffices to show that det(j4) = 0 iff ef + gh = 0 for some e and h in k\Y] with degye < m — 1 and degyh < n — 1 such that at least one of e and h is nonzero. Now e = e (Y) = XxYm~x

+ X2Ym~2

n

h = h(Y) = Xm+\Y ~

+ • • • + Xm

+ Xm+2Y

+ • • • + Xm+n

where elements X i , . . . , Xm+n, at least one of which is nonzero, are to be found in k, such that ef + gh = 0. Upon letting

ef + gh=

Yl

Ai"" +n-i

l
we see that for 1 < i < m + n we have l<j<m+n

Hence by (E5), det(A) = 0 iff there exists ( X x , . . . , Xm+n) € A ^ + n \ {(0,...,0)} for which D\ = ••• = Dm+n = 0. This completes the proof.

§m- EXAMPLES AND EXERCISES

167

EXERCISE (E8). [Elementary Properties of Determinants]. Prove the following properties of det(A) where A = (Ay) is an n x n matrix over a ring R which were used in (E4) to (E7) above. [(E8.1) to (E8.3) are vacuous for n = 0]. (E8.1) If a row (or a column) of A is multiplied by an element of R then det(j4) gets multiplied by that element. (E8.2) If to a row (or column) of A we add an i?-linear combination of the remaining rows (or columns) of A then det(A) is unchanged. (E8.3) If the rows (or columns) of A are permuted according to a permutation a then det(^4) gets multiplied by sgn(cr). In particular if we interchange two rows (or columns) then det(A) gets multiplied by —1. (E8.4) If A is replaced by its transpose then det(A) is unchanged. EXERCISE (E9). [Blowup of a Point in the Plane and Three Space]. In Examples (X12) and (X13) of §9.3 check what happens in QJ(A"), and in Example (X14) of §9.3 check what happens in 5J(A") and V{A'"). EXAMPLE (X16). [Blowup of a Line in Three Space]. In the Examples cited above we blew up a point and studied the effect on singularities of curves and surfaces. Now let us blowup the line L:X

= Y = 0

in the (X, Y, Z)-space over a field k, and study what effect it has on a surface singularity at the origin Q : X = Y = Z = 0. So consider the surface S:f

=

Y*-X2Zn~2=0

of degree n > 5 with double line L through the triple point Q, and the nonconical cubic surface T:g = Y2-

X2Z = 0

with double line L through the double point Q, discussed on pages 204-205 of my Engineering Book [A04]. Let P be the ideal generated by (X, Y) in A = k[X, Y, Z\. Then W{A,P)=V3(A')UV3{A") where A' and A" are the trivariate polynomial rings k\X', Y', Z] and k[X", Y", Z] where (X',Y') = (X,Y/X) and (X",Y") = (X/Y,Y) respectively. Substituting the first set of equations in / we get / = X'2(X'Y'3

-

Zn~2)

and hence, in the (X', Y', Z)-space, the total transform of S consists of the exceptional plane X' = 0 together with the proper transform X'Y'3 — Zn~1 = 0 which

168

LECTURE L4: VARIETIES AND MODELS

has a quadruple point at the origin. Likewise, substituting the first set of equations in g we get g = X'2(Y'2

- Z)

and hence, in the (X', Y', Z)-space, the total transform of T consists of the exceptional plane X' = 0 together with the proper transform Y'2 — Z = 0 which is a nonsingular parabolic cylinder. The reason why the singularity of T was resolved but the singularity of S was worsened (a triple point becoming a quadruple point) is that for T we were in the equimultiple case because the multiplicity of Q = 2 = the multiplicity of L, whereas for S we were in the nonequimultiple case because the multiplicity of Q = 3 > 2 = the multiplicity of L. EXAMPLE (X17). [Tight Associators and Monomial Ideals]. In §5(017) we showed that for a submodule U of a module V over a noetherian ring R, the associator assji(V/U) coincides with the tight associator tassR(V/U), i.e., if P is a prime ideal in R such that (U : a)# is P-primary for some a G V then (U : b)R = P for some b £ V. Here is an example that in case R is nonnoetherian, this need not be so even when V is R and U is an ideal Q in R. So let R be the polynomial ring k[Xi,X2,X3,...] in an infinite number of indeterminates (Xi)i^n+ over a field k, and let P and Q be the ideals in R generated by Xi, X2, X3,..., and X?, X$,X$... respectively, i.e., by (Xi)ieN+ and (Xf)ieN+ respectively. In a moment we shall show that P belongs to assR(R/Q) but not to tassn(R/Q). Actually we shall do this in a somewhat more general set-up. We start by proving some claims about monomial ideals. So let (Xj), e / be any family of indeterminates over the field k and consider [cf. L6§6(D10)] the polynomial ring Rj = k[(Xi)i£i]. Let W be the set of all maps w : I —+ N whose support supp(w) = {i G I: w(i) ^ 0} is finite, and for any such w write Xw = Yli€l X™ '. Note that then the set of all monomials M = {Xw : w € W} is a (free) vector space basis of Ri over k, and for any / G Rj, after writing / = Ylwew awXw with aw G k we have that its support supp(/) = {Xw : w € W with aw ^ 0} is finite. For any u and t in W we write u > t to mean that u(i) > t(i) for all i £ I, and we observe that this is so iff Xu = XfXw = Xt+W for some w G W, where we define t + w £ W by putting (t + w)(i) = t(i) + w(i) for all i £ I. For any T C W, let L(T) =

{Xt:teT}

be the corresponding set of monomials, let f = {u G W : u > t for some * G T} be the corresponding "saturated" subset of W, and let N(T) = {/ G P 7 : supp(/) C L(T)}.

§10: EXAMPLES AND EXERCISES

169

Note that then N(T) is the fc-vector-subspace of Rj generated by L(T), and L(T) is a fc-vector-space-basis of N(T). Let us consider a monomial ideal in Rj, i.e., an ideal generated by L(T) for some T C W; let us denote this ideal by J(T), i.e., J(T) =

L(T)Rj.

We claim that for any T C W we have (1)

J(T)=N(f)

and

J(T)nM

= L(f).

To prove this, note that obviously N(T) c J ( T ) . Conversely, any / £ J(T) can be written as / = X^ter' ^ ' / t where T' is a finite subset of T and ft £ Ri- Now we can find a finite subset W of W such that for all i £ T" we have ft = 23™ew" atwXw with atli) £ /c. Now T* = {t + w : (t, w) £ T" x W'} is a finite subset of T and we have / = J2uer* buXu where bu = T,{(t,w)eT<xW:t+w=u}atw G ^ for all u £ T*. Therefore supp(/) c T. This proves the first part of (1), and the second part follows from it. Consider the ideal in Rj generated by {Xi)i£P where I' is any subset of J, i.e., the ideal J(c(I')) where c(I') =

{c(j):j£l'}

and where c(j) is the characteristic function of j £ 7, i.e., c(j) e W is defined by putting

c(j)(i) = ll

\0

[ii=j

iii^j.

For any T C\W consider the condition: (D)

for all j in some infinite subset / ' of I we have t(j) ^ 1 for all t eT.

We claim that then: (2)

(tl) =» (J(T) : J(c(I)))Rl

= J(T).

To prove this, assuming (fl), given any / e f l / \ J(T) we want to find g £ J{c(I)) with fg £ J{T). Now supp(/) is a nonempty finite set and hence we may label its elements as t i , . . . , tn where n £ N + and ti,...,tn are pairwise distinct members of W. Since / 0 J(T), by (1) we know that some U does not belong to T and hence by relabelling we may assume that t\ & T. Since I' is an infinite set, we can find j £ I' such that ti(j) = 0; then (ti +c(j))(j) = 1; since tx £ f, by (fl) it follows that *i + c 0 ) ^ T. Let g = Xj. Then ti + c(j'),..., tn + c(j) are exactly all the distinct elements of supp(/#), and hence by (1) we see that fg & J(T). Also obviously g e J(c(l)). For any T c W consider the conditions: (»')

for every t e T we have t(i) ^ 0 for some i s /

170

LECTURE L4: VARIETIES AND MODELS

and (((")

for every j £ I there exists dj £ T such that: dj(i) =/= 0 iff j = i.

By (1) we see that: (3)

(«) + (f) => J(T) C J(c(/)) ^ J(T).

Now J(c(7)) is the kernel of the fc-epimorphism Ri —> k with Xi H-» 0 for all i £ 7, and hence J(c(I)) is a maximal ideal in /?/ and therefore by §5(08)(3*) we see that:

(4)

m + (n =• lJ{T)

is J c 7

( ( ))-p rimar y with

\vspec f l / J(T) = mvspec*, J(T) = {J(c(/))}. In view of Theorems (T5) and (T6) of §6, by (2) to (4) we conclude that: (5)

(tt) + (if) + (II") => ass fl/ (Ri/J(T)) = {J(c(I))} but tass*,(Ri/J(T)) = 0. As a special case of (5), assuming I to be infinite, upon letting P to be the ideal in Rj generated by (Xj)j e j and Q to be ideal in Ri generated by ( X " ' ) ^ / where n* € N + with n, > 1 for infinitely many i, we get assn,(Ri/Q) = {P} but tassfl 7 (i?//Q) = 0. In particular we have the beginning illustration when I = N+ with P and Q generated by (X,)i € N + and (Xf)i&^+ respectively. EXAMPLE (X18). [Isolated Components of a Module]. As we noted in §5(O20)(14*), given any submodule U of a module V over a ring R with U ^= V, if V is finitely generated then: P £ vspec(annfi(T^/J7)) =4- [[/ : P]v ^ V. We shall now show by an example that this is not true without finite generation. Let V be the bivariate polynomial ring k[X, Y] over a field k. Then U = k[X,XY] is a subring of V, and R = k[X] is a subring of U. So we may regard V as a module over R, and U a submodule. Note that by putting primes on X and Y, the pair U C V may be identified with the pair A C A' of Examples (X12) and (X13) of §9.3. For any / = Y^aijXW £ V with atj £ fc let supp(/) = {(i,j) £ N x N : atj ^ 0}. Clearly U = {/ £ V : i > j for all (t, j ) £ supp(/)}. Any 0 7^ g £ i? can be written as a finite sum g = J2 biX% where bi £ k with 6e ^ 0 for some e £ N; obviously (e,e + 1) belongs to supp(#y e + 1 ) and hence gYe+1 £ U with Ye+l £ V; therefore ann/^V/lO = 0. Thus for the zero ideal P in R we have P £ vspec(ann^(F/t/)). For any / £ V we can find a nonnegative integer d which exceeds the degree of / and then by the above criterion we get Xdf £ U with XdeR\P. Therefore [U:P]v = V. EXAMPLE (X19). [Annihilator of a Primary Module]. As we noted in §5(013), given any submodule U of a module V over a ring R: U is primary => a,niiii(V/U) is primary. Here is an example showing that the converse is not true. Let R be the bivariate polynomial ring k[X, Y] over a field k, and let U and V be the ideals in R generated by (X2,XY) and (X,Y) respectively. Then clearly

%11: PROBLEMS

171

annfi(V7{7) is the ideal in R generated by X which is prime and hence primary. But Y eR\iadR(a,nnR(V/U)) = R\ (XR) and X £ V\ U with YX e U. Therefore U is not primary. §11: PROBLEMS To initiate a new column, occasionally I shall list Problems (PI), (P2), . . . . A Problem is an Exercise which has not been completely worked out. By solving one of these, sometimes the student may get a mild satisfaction, sometimes a Ph.D. thesis, and sometimes fame. If there is some imprecision in the statement of a problem, a part of the exercise is to make it precise. PROBLEM (PI). [Explicit Equations of Integral Dependence]. Redo items ( J l ) , (J2) mentioned in (E2) together with the analogous items (C3), (C5) of L1(R7) cited there, by finding explicit formulas. In other words, let Fi(Y) = Yni + AnY"*-1 + ••• + Aini with Aij in a ring R be the equations of integral dependence satisfied by a finite number of elements U in an overring S of R for 1 < i <m. Find explicit formulas for B\,..., Bni and C\,..., Cn" so that t\-\ \-tm and t\... tm satisfy the equations G{Y) = Yn' + BtY71'-1 + ... + Bn, and

H(Y) = Yn" + dr""- 1 +... + c»« of integral dependence over R. Also let s € S satisfy F*(Y) = Ym+ UY™-1 + --- + tm and find formulas for Di,...,

Dn* so that s satisfies the equation

E(Y) = Yn' + D j F " * - 1 + . . . + £>„. of integral dependence over R. [SATISFACTION]. HINT. Think of the Aij as indeterminates over Z. Let G{Y), H(Y), and E(Y) be the minimal polynomials of t\ H h t m , t\.. .tm, and s over Q((^4ij)i, belong to Z[(Ay)i
172

LECTURE

L4: VARIETIES

AND

MODELS

PROBLEM (P3). [Resolution]. Let K/A be as in the above (P2). Show that there exists a nonsingular projective model of K/A. [FAME]. §12: R E M A R K S REMARK (Rl). [Laplace Development]. To describe the Laplace development of a determinant cited in L3§12(E2), which generalizes the development according to a row or column described in L3§12(E1), let A = (Aij) be an n x n matrix over a ring R, say as depicted in L3§12(D1) with m = n. Let r = ( r i , . . . , rt) be a sequence of positive integers with n + • • • + r^ = n. Recall that Sn is the set of all bijections a of {l,...,n}. Let Sn(r) be the set of all a € Sn such that
det(A)= Yl sgnO"-1) II d 6 * ^ ) TeSn(r)

l
and for any r in Sn(r) we have det(A)=

^
sgn^r-1)

J]

det(4;)).

l
These may respectively be called the Laplace development of det(A) according to the row partition (r\,...,rb) relative to the permutation a, and the Laplace development of det(A) according to the column partition ( r i , . . . , 77,) relative to the permutation (T,S); when a or T is the identity permutation we may drop the reference to it. REMARK (R2). [Block Matrices]. Let A = (Aij) be an m x n matrix over a ring R, say as depicted in L3§12(D1). Let (qi,..., qa) and ( r i , . . . , r&) be positive integers with 51 + • • • + qa = m and n + • •• + rb = n. The definition of a matrix over a ring introduced in L3§1 can be generalized in an obvious manner to define a matrix over (having entries in) any set. In particular we get an a x b matrix A = (Auv) where Auv = ((Auv)ij) is the qu x rv matrix over R given by putting (Auv)ij

= Aqi-i

\-qu~i+i,ri-\

\-rv-i+j

i.e., the (i, j)-th entry of Auv equals the (q\-\ \-qu-i+hi"i-\ \-rv-i+j)-ih entry of A. We call A a block matrix over R, or in greater detail a (qi,..., qa) x ( n , . . . , n)

173

§12: REMARKS

block matrix over R, and in an obvious sense we have the equation A = A which, for making clear that the LHS is an ordinary matrix while the RHS is a block matrix, may be written as -block

A.

Note that the "blocks" Auv are submatrices of A. We define sums and products of block matrices by an obvious generalization of the definitions for the sums and products of matrices over a ring given in L3§1. For instance, if B =uock B where B = (B^) is an m x n matrix over R and B = (Buv) is an (qi,...,qa) x ( l , • • • ,r&) block matrix over R then A + B = ((A + B)uv) is the (qi,... ,qa) x ( r i , . . . ,rb) block matrix over R defined by putting {A + B)uv = Auv + Buv, and we clearly have A + B =biock A + B. Likewise, if B —block B where B = {B^) is an n x o matrix over R and B = (Buv) is an ( r i , . . . , n,) x ( s i , . . . , sc) block matrix over R with si + • • • + sc = o then AB = ((AB)UV) is the {q\,..., qa) x ( s i , . . . , s&) block matrix over R defined by putting (AB)UV = Yli<w
where A' = (A'^), A* = (A*j), A** = (A*j), are m' xn,m*x over R with m' + m* = m, n* + n** = n, and

Ajj

—<

n*, m* x n** matrices

A'i;j

if 1 < i < m' and 1 < j < n

A-i-m'j A*lm,j_n,

if m'
and 1 < j < n* and n* < j
or equations of the type A'

I A"

where A' = (A'^), A" = ( 4 £ ) , A* = (A*^), A** = {A*j), are m' x n', m' x n", m* x n*, m* x n** matrices over R with m' + m* — m, n' + n" = n, n* + n** = n,

174

LECTURE LJ,: VARIETIES AND MODELS

and A1

if 1 < i < m! and 1 < j < n'

A"

if 1 < i < m' and n' < j
A*i—m',j

if ml < i < m and 1 < j < n*

A*. •m'

J—n*

if m! < i <m and n* < j < n.

For the sake of clarity, in cases as above, we may write A =qbiock (—) in place of

A={-). REMARK (R3). [Products of Determinants]. Referring to L3§12(E2), here is the proof of the product formula for determinants using the Laplace development. Given nxn matrices A,B over a ring R, let C = (Cy) be the (2n) x (2n) matrix over R which is given in terms of an (n,n) x (n,n) block matrix as C = (-i B) where 1 and 0 are the nxn identity and zero matrices. For n < j < 2n, we add to the j-th column of C the linear combination Xalock form as D = (^\ A 0 B ) and hence by (E8.2) we get det(C) = det(D). By applying the Laplace row development according to the row partition (n, n) we get det(C) = det(A)det(.B) and det(D) = det(AB). Therefore det(AB) = det(A)det(B). REMARK (R4). [Solutions of Nonhomogeneous Linear Equations]. To generalize (E5), let y]

ciijXj =bi

for

1< i < M

l<j
be a system of nonhomogeneous linear equations where a = ( a y ) is an M x N matrix over a field k with M, N in N, and a = (a%j) is the M x (N + 1) matrix over k obtained by augmenting a by the last column (&i,... , 6 M ) , i-e., for 1 < i < M we have a^ = a^ or bi according as 1 < j < N or j = N + 1. Let S be the solution space of the system, i.e., the set of all X = (Xi,..., XN) € A^ satisfying the system. Note that if S is the solution space of the corresponding homogeneous system

^2

a

ijxi = 0

for

1< « < M

l<j<JV

and if S is nonempty then it is clearly an additive coset of S in Aj^, i.e., for any X in S we have S = {X + X : X £ S}. Let n and n be the ranks of a and a respectively. Then clearly n
175

%12: REMARKS

1 < i < n and 1 < j < n, i.e., A is the left upper corner submatrix of a. Let Yi = bi-

Y^

a

^A

for

l
n<\
For 1 < j < n let B be the nxn matrix obtained by replacing the j-th column of A by the column vector (Y\,..., Yn). Now for every Z = (Z\,..., Zjv-n) £ A^~™ we put (Z) = x = (xi... ,X~N) G A^ where f det(B{i))/det{A)

if 1 < j < n

[Zj-n

iin<j
We CLAIM that S is nonempty iff n = n, and when that is so, <j> is an injective map whose image is S. PROOF OF THE CLAIM. By the above displayed definition of Xj, and by looking at the expansion of det(B ) by its j-th column, we see that 4> is an injective map. So it only remains to show that S is nonempty iff n = n, and when that is so, S is the image of
=Yi

for

1< i < n

l<j
together with the extra equations Y1KJ
Y^

aijdet(Bb))/det{A)=Ofoin
l<j
So given any such i, let C be the (n + 1) x (n + 1) matrix obtained by bordering A by the last column (Yi,..., Yn, Yi) and the last row (an,...,a;„, Yi), and for n < I < N + 1 let C be the (n + 1) x (n + 1) matrix obtained by bordering A by the last column ( a n , . . . , a n i , a u ) and the last row (an,...,ain,au). Expanding by the last column we see that det(C) = det(C ) - J2n
^ \<j
and this completes the proof.

a^de^B•t/k

176

LECTURE L4: VARIETIES AND MODELS

REMARK (R5). [The How and W h y of Cramer and Sylvester]. To solve the simultaneous linear equations

£

AijXj =Yi

for

1< i
l<j
we subtract An times the first equation from An times the i-th equation for 2 < i < n. This "eliminates" Xi and produces n — 1 equations in the n — 1 variables X2, •.., Xn. Iterating this procedure n times, the last step produces a value of Xn. Substituting this in the first n — 1 equations gives us n — 1 equations in the n — 1 variables X\,... ,Xn-i. Iterating this entire procedure produces a value of Xn-i, and so on down the line until we get values of all the variables X\,...,Xn. By condensing the whole hullabaloo in one giant step we end up with Cramer's Rule. This "explains" (E4). Similar "explanations" apply to the systems of linear equations dealt with in (E5) and (R4). Turning to the higher degree simultaneous equations f(Y) = a0Yn + a1Yn-1 + --- + an g(Y)=b0Ym

+ b1Ym-1 +

--.+bm

dealt with in (E7), since there is only one variable Y, how shall we eliminate it by itself? Well, we do it by letting the degree play the role of a variable. To wit, assuming n > m, we multiply the first equation by 60 and the second by aoYn~m, and then subtract the second from the first. This replaces the equations of degrees n, m by equations of degrees n — 1, m thereby decreasing the sum n + m. Iterating this procedure several times we make the sum of the degrees zero, i.e., we eliminate Y and get hold of an expression involving only the coefficients ao, • • •, an> bo, • . . , bm, which, if everything is nice and dandy, ought to produce something resembling the Sylvester Resultant. In (E7) we proved the Basic Fact (Tl) about the Sylvester Resultant. Here is an easier proof of that part of (Tl) which says that if n,m are positive integers and / , g have a common root in some overfield of R then Resy(/, g) = 0. Namely, call that root Y. Multiplying / by Ym~l ,Ym~'2,... ,Y,l and g by yn-i Yn~2,... ,Y, 1 we get m + n homogeneous linear equations in the m + n "variables" Ym+n~1,Ym+n'2,... ,Y, 1. This "solution vector" is a nonzero vector because it ends with 1. Therefore by (E5) the determinant of these equations, which is nothing but Resy(/,g), is zero. Q.E.D. In greater detail, the matrix T = (7V,)i|^™+^ of the said equations is nothing but Resmaty(/, g), and the equations are

V-

T. Ym+n-i

i<^+„ ^

=

{Ym-if{Y)=* \Y"-ig(Y)=0

ifl<»<m ifl
Note that T is the transpose of the matrix A of the HINT to (E7). Letting Y revert to its role of an indeterminate, and without assuming / , g to have a common root,

§12: REMARKS

177

but continuing to assume n,m to be positive integers and disregarding the = 0 (twice) of the above display, the said display says that by adding to the last column of T certain il[Y]-linear combinations of the remaining columns we get the column vector ( y ™ - 7 ( y ) , . . . , Yf(Y), f(Y), Yn~lg{Y),..., Yg{Y), g(Y)). Expanding the determinant of this new matrix by its last column we get an identity of the form f Resy(/,
+ v(Y)g(Y)

with u(Y), v(Y) in R[Y]

[of F-degrees < m - 1 and < n — 1 respectively. By letting Y be a common root of / , g, this gives yet another proof of the above mentioned part of (Tl). Having proved the Basic Fact (Tl), let us briefly comment on the remaining items of §1. Corollary (T2) follows by noting that a nonconstant univariate polynomial over a field has a multiple root iff the polynomial and its derivative have a common root. Observations (01) and (02) follow by noting that the operations of forming the resultant or the discriminant commute with the operation of giving values to the auxiliary variables X\,..., X^. The Isobaric Property (03) and the Product Formula (XI) will be proved in the next Remark (R6), where we shall also prove a product formula for the resultant matrix. Examples (X2) to (X6) are straightforward computations. Bezout Observations (04) and (05) will be dealt with later on. This takes care of the Future Plan Observation (06). REMARK (R6). [Product Formula for the Resultant Matrix]. To formulate the last identity (5.1) of the above Remark (R5) more precisely, and to prove the Product Formula (XI) of §1 together with its generalizations, for nonnegative integers n,m, in addition to the polynomials f(Y)=a0Yn

+ a1Yn-1 + --- + an

g(Y)=b0Ym

+ b1Ym-1 +

---+bm

with coefficients o o , . . . , an, bo,..., bm in a ring R, which is assumed to be nonnull, let us consider polynomials F(Y) = A0Yn + AxYn-1 m

G(Y) = B0Y

whose coefficients Ao,---,An,Bo,...,Bm

+ --- + An 1

+ BtY™-

+ ... + Bm

are indeterminates over R. Let

R = R[AQ, ... ,An,B0,

. •. ,Bm].

Since Resmaty-(/, g) and Resy(/, g) depend not only on / , g but also on n, m, strictly speaking we should write Resmaty (/, g) and Res y l ' m ) (/,g) for them and call them the Y-Resultant Matrix and the Y-Resultant of / , g relative to n, m respectively, with the understanding that the reference to n, m may be dropped when it

178

LECTURE

H:

VARIETIES

AND

MODELS

is clear from the context. As a more precise version of (5.1) we have the identity: ' R e s ^ ' m ) ( F , G ) = U(Y)F(Y) (6.1)

{

U{Y) = E i < , < m ^

m

+ V(Y)G{Y)

with

J

Y.m+i
- ^ and V{Y) =

in R[Y] of y-degrees < m — 1 and < n — 1 respectively, where Ctj S ~R is the (i, j)-th cofactor of Resmat^' m ) (F, G)

In the rest of this Remark, as an abbreviation, we denote Resmaty the (m + n) x (TO + n) matrix T{n>m\f,g)

(/, g) by

(T^m\f,g)ij).

=

and we note that then T(n'm) (/, g)ij € fl and

T^ n ' m ) (F, G)« € P .

We also put P(">m> (/, 5 ) = R e s ^ m ) (/,
= p(".™)(F, G)

and we note that then P{n'm\f,g)€R

and

Q(A0,...

,An,B0,...

,Bm) e R.

Finally, we let „ a

J a,i if i e { 0 , . . . , n} '~\0

if

ieZ\{0,...,n}

and, for any nonnegative integers u,v, we let S^n'u'v^(f) u x u matrix obtained by putting

= (S^n,u'v^(f)ij)

be the

and we call S^n'u'v^(f) the (n, u, u)-spread of / . The resultant matrix can be expressed in terms of the spreads by the block matrix identity:

(

c(n,m,m+n)r

f\\

g(m,n,m+n)

f„\ ) •

As direct consequences of the spread notation we have the following two relations. Relation between Polynomial Addition and Matrix Addition: (1)

if n = m then

S{n'u'v){f

+ g) = S{n'u'v)(f)

+

S{m'u'v)(g).

§12: REMARKS

179

Relation between Polynomial Multiplication and Matrix Multiplication: (2)

m+n u v

if integer q > n + u then

S^

' ' \fg)

=

S^'^ifiS^^ig).

Before using (0) to (2) for obtaining a product rule for T, let us observe some easy properties of P . Since a 0 x 0 determinant is 1, we see that: (3)

if n = 0 = m then

P ( n ' m ) (/, g) = 1.

The determinant of a triangular matrix, i.e., a matrix having zeroes above or under the principal diagonal, equals the product of the diagonal entries. Also the zeroth power of anything (including zero) equals 1. Hence: if n = 0 then

P ( n ' m ) (/, g) = a%

ifm = 0then

P{n'm)(f,g)

(4) whereas (5)

= b%.

By respectively expanding according to the first or the last row and then calculating with triangular determinants, i.e., determinants of triangular matrices, we get: p(n,m)(f

( 6 )

Q) =

\Eo
if m = 1

lEo^j-ir^ar* if n=i.

By (E8.1) we see that: (7)

for all c, d in R we have

p("' m > (c/, dg) = cmdnP{n'm)

(f,g).

Expanding by a row and invoking (E8.2) we see that the determinant of an N x N matrix (with positive integer N) is zero if either a row consists only of zeroes or if two rows are identical, and hence: (8)

if either mn^0

= f-goim^0

= fom^0

= g then p( n - m )(/, g) = 0.

If mn = 0 then by (3) to (5), and if mn ^ 0 then by (E8.3), we get the Permuting Rule for Resultant: P(m'n)(gJ)

(9)

= ( - i r n P ( " ' m ) (/,
Also we clearly have the Universality Property of Resultant: (10)

P{n'm)U,g)

= Q(a0, ...,an,b0,...,

bm).

LECTURE L4: VARIETIES AND MODELS

180

Next we record the following five properties (11) to (15) of the polynomial Q. As notation, coeS(U, V) denotes the coefficient of a monomial U in a polynomial V = V(Xi,... ,Xr,Yi,... ,YS) in indeterminates Xi,...,Xr,Yi,...,Ya over the ring R. Recall that V is homogeneous of degree t in (X%,... ,Xr) means that for every monomial U = X^ .. .XpYJ1 .. .Yj° with coeS(U,V) ^ 0 we have ii + ••• + ir — t. More generally, V is isobaric of weight r in (Xi,.. .,Xr) when we give weight Uj to Xi for 1 < i < r means that for every monomial U = XI1 ...XJrYJ1 ...Yj- with coeff^.V) ^ 0 we have «i»i + ••• + urir = T; clearly this is so iff as a polynomial in an extra indeterminate Z we have V(Z^XU .. .,Z^Xr,Yu. ..,Ya) = ZTV(XU.. .,Xr,Ylt.. .,YS). Diagonal Property of Resultant: (11)

coeffWB£,Q) = l

and

coe«(A™BZ,Q) = ( - l ) m n .

First Homogeneity Property of Resultant: (12)

Q is homogeneous of degree m in ( A o , . . . , A n ).

Second Homogeneity Property of Resultant: (13)

Q is homogeneous of degree n in (Bo,...,

Bm).

Weight Property of Resultant:

{

Q is isobaric of weight mn in ( A o , . . . , A„, Bo,.. •, Bm) when Ai and Bj are given weights i and j for 0 < i < n and 0 < j < m respectively.

Special Property of Resultant: if coeS(U, Q) ^ 0 for a monomial nK,

(,15)

\U = A«... A^Bl0 ...Bk<jL {Ao"B^, A-5 0 "} < then in < m and j m < n and in + j m <max(m, n) and if also m^n

then in + j

m

<max(m, n).

To prove these, by looking at the resultant matrix depicted at the beginning of §1 we easily get (11) to (13), whereas (15) follows from (12) to (14). So it only remains

%12:

181

REMARKS

to establish (14). To do this, first we note that . . . , ZnAn,

(Q(Z°AQ,

ZmBm)

Z°B0,...,

\= det (Tn>m{Z°AQYn

+ ••• + ZnAn,

Z°B0Ym

ZmBm))

+ ••• +

and now for 1 < i < m we multiply the i-th row of this matrix by Zl and for I < j < n we multiply its (m + j)-th row by Zi and then upon denoting the resulting matrix by T we get det(f) = ZSQ(Z°A0,...,

ZnAn,

ZmBm)

Z°B0,...,

where 5 = $ Z 1 < i < m * + Yli<j however, T is the same as the matrix we obtain when for 1 < I < m + n we multiply the Z-th column of Tn'm(F, G) by Zl and hence det(f) = ZCQ(A0, ...,An,B0,...,

Bm)

where e = ^Zi ^ v summing the three arithmetic series we conclude that £

_ g

=

(m+n)Tm+n+l) _ [mCm+1)

+

nfcH^j

=

^

^

h e n c e

b y

^

a b o y e

t w Q

displayed equations for det(T) we get (14). To start towards the product rule, for a polynomial h(Y) = c0Ym+n + c 1 r + n - 1 + • • • + cm+n with Co, ci,...,

cm+n in R, we record the following two sum rules.

First Sum Rule for Resultant: P{n'm+n)(f,h

(16)

P{n'm+n\f,h).

+ fg) =

Second Sum Rule for Resultant: (17)

p(m+n,m)(h

+

jgg^

=

p(m+n,m)^

gy

To prove these, by (0) we have T(m+n,m)(h

*•

„\ - I ^

W\

Vl>y) — I g ( m , m + n , 2 m + n ) / „ \ J

and by (1) and (2) we have {m+n,m)(hn T 1

, f„

\ -rJyi9)—\

^ - (S^m+n^m+n\h)

+ g(m+n,m+n,2m+n)i\

mm

S^

' +n\f)S^m+^m+"){g)\ J

and hence by adding certain linear combinations of the last m+n rows of the matrix T^m+n'm\h,g) to its first m rows we obtain the matrix T^m+n'm\h + fg,g) and

182

LECTURE

L4: VARIETIES

AND

MODELS

from this we conclude that these two matrices have the same determinant which establishes (17); now by (9) and (17) we also get (16). Recalling that u, v are any nonnegative integers, let also w be any nonnegative integer, and consider the following two shift rules. First Shift Rule for Resultant: P(n+W'm){f,g)

(18)

= {-l)wmh%

P{n'm){f,g).

Second Shift Rule for Resultant: (19)

p(n,m+W)ifig)

= ag >p(n."O(/ t 0).

/

,

\

/

ci(ni"'i1Tl+7l + 'w)/<,\

\

The proof of (19) follows by noting that T^m+W\f,g) = { — ^ f ^ 4 b ) where 0UtV denotes the uxv matrix with all entries zero; moreover, by (9) and (19) we get (18). Recall that lu is the u x u identity matrix, and note that for 1 £ R[Y] we have degyl = 0 < w. About these special matrices we have the obvious Relations: ®u,v + J = J + 0u,v for every uxv

matrix J ,

Ou,vJ — Ou,w for every v x w matrix J, . , (20)

{

J0U v = 0W „ for every w x u matrix J , \UJ = J for every uxv

matrix J ,

Jlu = J for every v x u matrix J, S(w,u,u+v+w)^ = (0uw^ i u ) o U i V ) as block matrix. The ENLARGED RESULTANT MATRIX is the (u + m + n + v) x (u + m + n + v) matrix defined as a block matrix by putting 0U ,m+n o„ 0m+n,uT^m\f,g)0m+n,v Vv,u

Vv,m-{-n

*-v

and we note that clearly (say by Laplace development) (20)

det (r^n'm^

(f,gj)=

P ( n ' m ) (/, g)

and (21)

det(r
=ctf.

%12:

183

REMARKS

By analogy with the equation

(

q(n,m+w,m+n+w)

I f\

o(m+w,n,m+n+w)

I „\

we introduce the SKEWSHIFTED RESULTANT MATRIX by putting

(

q(n,m,m+n+w)

( f\ ^

7

V'

a(m,n+w,m+n+w)

l„\

and we note that these two (m + n + w) x (m + n + w) matrices are clearly related to the (m + n) x (m + n) matrix T{-n'm\f,g) by the formulas c(n,w,m+n+w)

( f\

\

( 0 w ,lT.-4lp)) and fOQ/\ [

'

rp[n,w,m}( f

\ _ ( *•

U,9)

~

'

\0w,n\

l / i 9) \ Vm+n,w

\

S(^,m+w){g)J-

As an immediate consequence of (22') we have the Skewshift Rule for Resultant: (19')

det ( T ^ m ] ( / , 2 ) ) =

bZP{n'm\f,9).

Finally we introduce the ENLARGED SKEWSHIFTED RESULTANT MATRIX which is the (u + m + n + w + v) x (u + m + n + w + v) matrix defined as a block matrix by putting ,v

0m+n+w,uT^^(f,g)0m+n+w,v Vv,u

Uv,m+n+w

*-v

and we note that by (19') we have (20')

det (T^u'n'w'm'v\f,

g)) = 6™ p( n - m )(/, g)

and (21')

det M u '°'"' m '"] (1, g)) = bnm.

Now upon letting n',n*,m',m* n = n' + n*

be any nonnegative integers such that and

m = m' + m*

184

LECTURE

L4: VARIETIES

AND

MODELS

and upon considering any polynomials f'(Y)=a'0Yn'

+a'1Yn'-1

+

---+a'n,

g'(Y) = b'0Ym' + fciY™'-1 + • • • +

C

and = a*0Yn' + alY"*'1 + ••• + < .

f*(Y)

g*(Y) = b*0Ym' + b{Ym''x with coefficients a'0,...,a'n,,b'0,...

+ ... + b*m.

,b'm,,a^,...,a*n,,bl,..

/ = /'/*

and

.,b*m. in R such that

g = g'g*

we can state the following two product rules. Product Rule for Resultant matrix: (23)

T{m'n''n'fi\f\l)T{n'm\f,g)

T^'n''m'n'){f',g)T{n''m+n'){f\g).

=

Skew Product Rule for Resultant matrix: (23')

y[0,O,m',m*,n]/^ q*)T^n'm\f

n) = y ( m * > " > ' n ' ' 0 ) ( f q'\ J<[n,m',m"] / t

*\

To prove (23), we have the following equations where the first and the last equations are deduced by items (0) and (Oi) whereas the middle two equations are deduced by block multiplication followed by items (2) and (2o).

(

lm

0m,n

\

/q{n,m,m+n)(

0n;m 5("*."'-")(D

*

f\\

m+n)y\)

c(n,7n,7n+n) / f\ 0(771+71*,n',m+n) ( f*sj\ o{m-j-n' ,n* ,m+n) o(n',m,ro+n')/f/\ *-> U / c(m ) n',m+n')/' / , , \ *-> U/J Un* ,771+71'

/„\ n \ , N u m,n* \ / o f ^ m - t - r ^ m - r - n W i - * ^ n , * I I , ^ ' u n ' , n * I I g(m+Ti',n*,m+n)/'„} J-n* /

^

RHS of (23).

To prove (23'), we have the following equations where the first and the last equations are deduced by items (0') and (0[) whereas the middle two equations are deduced

185

%12: REMARKS

by block multiplication followed by items (2) and (2o).

f

£

'

(1) 0 m - i n \

s qfam^m+n),

*\\

o(n,m" ,m+n) I f\ c(m* +n,m',m+n) I f a*\ g(m,n,m+n)/g\ J-TTl*

fl vm',m*

Vm* ^m''-\-n c(«,"X* ,m+n)( f\ c(n,TB',m'+n)/f\ I I ^' " "'w; I I , o U) l \ u q(m',m'+n,m+n)(„*\ n,m'+n)fqi\l \

= RHSof(23').

Taking determinants of all the matrices in (23), by (19), (20) and (21) we get (24)

afP^m\f,g)

= a*0n'>("'•"*)(/''

,g)p(n">m\f\g)

and taking determinants of all the matrices in (23'), by (19'), (20') and (21') we also get (24')

b^P^m\f,g)

= b^'P^m'\f,g')P^m'\f,g*).

By the "principal of universal identities" let us show that the factors a™ and b^i can be cancelled out from the above two equations. So consider the polynomials F'(Y) = A'0Yn' + A'^Y71'-1 + ••• + < , G'{Y) = B'0Ym' + B j y " 1 ' - 1 + ... + B'm, and F*(Y) = A*QYn" + A^Y71'-1 + • • • + A*n. G*(Y) = B*QYm' + BIY™'-1

+ --- + B*m.

whose coefficients A' = (A'0,..., A'n,), B' = {B'Q,..., B'm,), A* = (A%, ...,A*n.) and B* = (BQ , . . . , B^,) are indeterminates over R which are assumed to be "independent" of the previous indeterminates A = (Ao,..., An) and B = ( S o , . . . , Bm). Then F'(Y)F*{Y)=

Y^

Fi{A',A\B)Yn-i

0
G'(Y)G*(Y)= J2 Gj(A,B',B*)Ym-j 0<j<m

LECTURE L4: VARIETIES AND MODELS

186

where Fi and Gj are polynomials over R in the exhibited indeterminates. Let Q{A',A*,B)

P(n''m){F',G)P{n"'
=

Q'(A, B', B*) = p("- m ')( j p, G')P{n'm'\F,

G*)

and Q(A',A*,B)

=

Q(F0(A',A*,B),...,Fn(A',A*,B),B0,...,Bm)

Q'(A, B', B*) = Q(A0, ...,An,

G0(A, B', B*),...,

Gm(A, B', B'))

where Q, Q ,Q, Q' are polynomials over R in the exhibited indeterminates. Now clearly Q(A' ,A*,B) = P< n ' m ) {F'F*, G) Q'(A,B',B*)

= P ( "' m ) (F,G'G*)

and AQ71 and B^ are nonzerodivisors in R[A', A*,B] and R[A, B', B*} respectively, and hence by (24) and (24') we get (25)

Q(A',A*,B)=Q(A',A*,B)

and (25')

Q'{A,B',B*)

=

Q'{A,B',B*).

Upon letting a = (a0,... ,an), b = (b0,... ,bm), a' = (a'0,... ,a'n,),b' = (b'0,... a* = ( o 5 , . . . , a ; . ) , b* = (bl,...,b*m.), we clearly have

,b'm,),

P ( " ' m ) (/, 9) = Q(a', a*,b) = Q'(a,b',b*) p(n\m)

( / / ) g)pin>

,m)

p(n,m')

( / ) g,)p{n,m*)

( /

. ^ g)

=

( / ? ff,} =

g ( f l / > fl* ( ft) g'^

^

^

and therefore upon substituting small letters for the corresponding big indeterminates in the polynomial identities (25) and (25') we get (26)

P{n'm)(f,g)

=

P{n''m)(f',9)Pin*'m)(r,9)

P ( " ' m ) (/,
P{n'm,)(f,9')P{n'mn(f,9*)-

and (26')

By repeated applications of (26) and (26') we get the

%12:

REMARKS

187

Product Rule for Resultant: if u, v, n(i),m(j)

are in N

with n = E i ^ u ^ W and m = E i < j < « m ( i ) (27)

{ and / ( i ) (y), s ( i ) (y) are in R[Y] with / = Ui
f(i) ^ d 5 = n K i < . 9U)

then P("- m )(/, f f ) =

Ili&KuYlimvPW'^Hf®,^)-

By (3), (4), (5), (6) and (27) we get the following three special product rules. First Special Product Rule for Resultant: (28)

if f(Y) = a0 Ui
R

Second Special Product Rule for Resultant: (29)

'if g(Y) = 60 UiKjKmV ~ Pi) w i t h & i n R mn 6&Ill<,-<m/(^)t n e n p(«.m)(/, fl ) = ( - l )

Third Special Product Rule for Resultant:

(30)

'if f(Y) = a0 Ui
- «i) and g(Y) = b0 Yl^^Y

= a^UiKiKnUi^Km^i

- Pi)

- Pi)-

By (8) and (27) we get the Tiny Resultant theorem: if q, n', ml are in N with q > 0 and n = q + n' and m = q + m' (31)

and / = hf and g = hg' where h, / ' ,
By (8) and (31) we get the Small Resultant theorem: if R is a domain with m + n ^ 0 and / and g have a (32)

nonconstant (i.e., of positive y-degree) common factor in R[Y] thenP("' m >(/,0) = O.

188

LECTURE L4: VARIETIES AND MODELS

By (32) we get the Little Resultant theorem: if R is a domain with m + n ^ 0 and f(-y) = 0 = 3(7) (33)

for some 7 in an overfield of R thenP("> m )(/,#) = 0.

In (E7) of §10 we proved the Basic Fact (Tl) stated in §1. Now, in view of (3) to (5), by (28) or (29) we get another proof of (Tl) which we restate as (35). The version (34) is clearly equivalent to (35). First Version of Resultant theorem: (34)

if R is a field then: p(n,m) = o<=>m + n ^ O a n d either a0 = 0 = b0 or / and g have a nonconstant common factor in R[Y}.

Second Version of Resultant theorem: if R is an algebraically closed field then: (35)

p(n,m) = 0 < ^ m + n ^ 0 a n d either aQ = 0 = b0 or / and g have a common root in R.

REMARK (R7). [Intval Domains, Divisibility Rings, Maximality Rings, and Equivalent Valuations]. From L2§3, L3§7, and L3§8, recall various terms related to valuations, and note that out of assertions (Jl) to (J26) about valuations made in L3§§7-9, we still have to prove that: (J24)

every normal domain is intval

where a domain is intval means it is an intersection of a nonempty family of valuation rings of its quotient field. We shall now prove this as well as the Valuation Theorems (T25) to (T28) stated in §9. First let us introduce some more concepts. By a divisibility ring we mean a domain in which, out of any two nonzero elements, at least one divides the other. In other words, a domain R with quotient field K is a divisibility ring means it satisfies one and hence all of the following five equivalent conditions: (1) (2) (3) (4) (5)

For any For any For any For any The set

nonzero element x in K we have either x € R or 1/x G R. nonzero elements x,y m K we have either x/y £ R or y/x € R. nonzero elements x, y in R we have either x/y £ R or y/x € R. principal ideals I,J'mR we have either I c J or J C I. of all principal ideals in R is linearly ordered by inclusion.

%12:

REMARKS

189

To see the equivalence of (1), (2), (3), in (1) change x to z and then write z = x/y. The equivalence of (3), (4), (5) is obvious. Given any divisibility ring R with quotient field K, we introduce an ordered abelian group T(R) together with a valuation JR : K —> T(.R)u{oo} in the following manner. We call these the divisibility group of R and the divisibility valuation of K induced by R respectively. It will turn out that the value group of 7# is T(R), and the valuation ring of 7^ is R. As a group T(R) equals the factor group U(K)/U(R) written additively, where we recall that U(R) denotes the group of all units in R. The restriction of j R to U(K) = Kx is the residue class epimorphism and we stipulate that JR(0) = 00. For any x,y in Kx we define: 1R{X) < JR(V) «• y/x e R. Since R is a divisibility ring, it follows that the relation < on T(R) is well-defined (i.e., is independent of the representatives in Kx) and converts it into an ordered abelian group. Using the assumption that R is a divisibility ring, we also see that JR is a valuation of K with value group T(R) and valuation ring R. By a divisibility ring of a field K we mean a divisibility ring with quotient field K. We may restate the above construction by saying that: if R is a divisibility ring of a field K then (7.1)

the divisibility group T(R) is an ordered abelian group and the divisibility valuation 7^ is a valuation of K with value group T(R) and valuation ring R.

Note that for any valuation v and any nonzero x, y in Rv we have either v(x) < v(y) or v(y) < v(x) and hence either y/x G Rv or x/y e Rv, and therefore Rv is a divisibility ring. This may be taken as a motivation for the above definition of a divisibility ring. Also note that for any valuation v of a field K, the restriction of v to Kx gives an epimorphism Kx —> Gv with kernel U(RV) and for any x, y in Kx we have: v(x) < v(y) <£=> y/x € Rv. This motivates the above definitions of the divisibility group and the divisibility valuation of a divisibility ring. Thus we have shown that: (7.2)

I a domain is a valuation ring { J^iff it is a divisibility ring.

Valuations v and w of a field K are equivalent means there is an order (preserving) isomorphism 8 of the value group Gv onto the value group Gw such that for all x S Kx we have 6(v(x)) = w(x). By what we have said above, it follows that: J every valuation is equivalent to the I divisibility valuation of its valuation ring

190

LECTURE L4: VARIETIES AND MODELS

and (7-4)

I valuations of a field are equivalent < [ iff their valuation rings coincide.

We claim that for any valuation v and for any ideals / and J in Rv we have: (*) either I c J or J C I. To see this, first note that: (**) x G J and y G Rv with v(x) < v(y) => y £ I (because y = x{y/x) with y/x G Rv). Now suppose J £ I and take y £ J\I. Then for every x £ I, by (**) we get u(x) > v(y) and hence by applying (**) to J we get x £ J. This proves (*). Now in view of the equivalence of (1) to (5), by (7.2) and (*) we see that: (7.5)

I a domain is a valuation ring { I iff the set of all ideals in it is linearly ordered.

By a maximality ring of a field K we mean a subring R of K such that R is a quasilocal domain which is not dominated by any quasilocal domain S which is a subring of K with S ^ R. We claim that: J a subring R of a field K is a valuation ring of K 1 iff it is a maximality ring of K. Namely, if R = Rv for some valuation v of K and S is a quasilocal domain dominating R such that S is a subring of K with S ^ R then we can find x £ S\R; clearly x G X \ -R =>• w(x) < 0 => (l/x) G M(i? t) ) =» (l/x) G M(S) (because S dominates R) => x ^ S* which is a contradiction. Conversely assume that R is a quasilocal domain which is not dominated by any quasilocal domain S which is a subring of K with S ^ R. Given any u £ K \ R let (j> be the unique i?-epimorphism of the univariate polynomial ring R[X] onto R[u] which sends X to u. Let J be the ideal in R[X] generated by ker() and M{R). If J ^ -R[^] then by Zorn's Lemma we can find a maximal ideal N in -R[X] with J C N, and this would give us a maximal ideal (./V) in R[u] whose intersection with R equals M(R); localizing R[u] at (j>(N) gives us a quasilocal domain R[u]^N) which dominates R and is a subring of K different from R; this is a contradiction. Therefore J = R[X], and hence we can find a positive integer n, elements a0,-.. ,an in R, and elements po ... ,pn in M(R) such that do + a\u + • • • + anun = 0 and (a 0 + aiX H 1- anXn) + (po+PiX -\ \-pnXn) = 1. The last equation implies that a0 = 1 -p0 £ R\M(R) and a, = —pi £ M{R) for 1 < i < n. Consequently a0 is a unit in R, and upon letting bi = ai/ao we get bi £ M(R) for 1 < i < n with

(!/«)"+ £ fc(l/«)n-< = 0. Ki
By minimizing n we may assume that 1/u does not satisfy any equation of this type of degree smaller than n. Suppose if possible that 1/u g- R. Then in a similar

§i2: REMARKS

191

manner we can find a positive integer m together with elements c; e M(R) for 1 n. Multiplying the above displayed equation for 1/u by cmun and then subtracting it from the above displayed equation for u we get d0um+

Yl

dium-i

=0

l
with d0 € R\M(R) unit do we obtain

and dt € M(R) for 1 1. Now dividing by u we get the equation u" 1 - 1 +

^

e^u" 1 - 1 -' = 0

l
which contradicts the minimality of m. This shows that R is a divisibility ring with quotient field K and hence we are done by (7.2). Given any chain (under domination) of quasilocal domains which are subrings of a field K, their union is clearly a quasilocal domain (whose maximal ideal is the union of the maximal ideals of the members of the chain), which is a subring of K and which dominates every member of the chain. Thus the said union is an upper bound of the chain. Therefore, in view of (7.6), by Zorn's Lemma (see L2§5) we conclude that: J if R is a subring of a field K such that R is quasilocal I then R is dominated by some valuation ring of K. We claim that J if P is a prime ideal in a subring A of a field K then \A is contained in a valuation ring T of K with P = M(T) n A. This follows by taking R = Ap in (7.7). Next we claim that: J if y is a nonunit in a subring A of a field K then [A is contained in a valuation ring T of K with y £

M(T).

This follows from (7.8) by noting that by Zorn's Lemma there exists a prime ideal (in fact a maximal ideal) P in R with y e P.

LECTURE L4: VARIETIES AND MODELS

192

If w is a valuation of a field L and K is a subfield of L then we get a valuation v of K by putting v(x) — w(x) for all x G K, and clearly Rv = Rw !~\ K and iJ w dominates R„. Thus we have shown that: .

J if S is a valuation ring of a field L and K is a subfield of L y then 5 n K is a valuation ring of K dominated by S.

If R is a valuation ring of a field if and L is an overfield of K then by (7.7) R is dominated by some valuation ring S of L and by (7.6) and (7.10) we must have R — SC\K. Thus we have shown that: if R is a valuation ring of a field K and L is an overfield of K (7.H)

then R is dominated by some valuation ring S of L and for any such pair we have R = 5 PI K.

This completes the proof of Theorems (T25) to (T28) of §9. If a subring R of a field K is integrally closed in K then for any x G K\Rwe must have x $ R[l/x] (because otherwise we can write x = qo + q\(\/x) -\ h qn(l/x)n n with n in N and qo,...,qn in R, and multiplying by x this would give an equation xn+1 — qoXn — qix11"1 qn = 0 for x over R), and therefore by taking A = R[l/x] and y = 1/x in (7.9) we can find a valuation ring T of K with R[l/x] C T such that 1/x G M(T) and hence such that x £T. Since every valuation ring is normal, it follows that: J if a subring R of a field K is integrally closed in K then I i? is the intersection of all valuation rings of K which contain it. This completes the proof of (J24). REMARK (R8). [Modelic Proj]. Let A b e a subring of a field K. Given any family (X;); 6 A of elements in K, with Xj ^ 0 for some j G A, let E = W(A; (XI)I&A) and let K' = QF(j4[(i|/ij)igA]) where t £ A with xt ^ 0, and note that K' is clearly independent of the choice of i. Also let yi = xi/xi> for any fixed i ' s A with Xi> ^ 0. Theorem (T29) of §9.1 says that then we have the following. (T29.1) E is a premodel of K'/A and E = W{A; (xt/x)leA) for all 0 / x G K. In particular (yi)ieA is a family of elements in K' such that yj ^ 0 for some j G A, and we have E = 221(^4; (yi)ie\)(T29.2) Given any R G E and any subring S oi K such that S1 is a quasilocal ring dominating R, there exists j G A with Zj 7^ 0 such that £;/:rj G S for all / G A. Moreover, for any such j we have R = BQ where B = A[(XI/XJ)I&A] and Q = BnM(5). (T29.3) E1 is a semimodel of K'/A, and if A is finite then E is in fact a complete model of K'/A.

%1S: REMARKS

^

193

PROOF OF (T29.1). It suffices to note that if xt ^ 0 then for all/ G A we have = a;,/^ and hence A[(^)ieA} = A[(xi/xi)leA].

PROOF OF (T29.2). Since R G W(A; {xi)ieA), there exists f G A with xr ^ 0 such that R G 33(B') where B ' = A[(xi/xj')i&A], and then we have xijxy G P for all I G A, and P = B'Q, where Q' = B' n M(R). Since 5 dominates P , we get a;//a;j' G 5 for all / G A, and Q' = B' n M(S). Therefore the first assertion follows by taking j = j ' . To prove the second assertion, given any j G A such that Xj =fi 0 and if/xj G S for all Z G A, let B = A[(xi/xj)i^A] and Q = B n M(5). Then 1 XJ/XJ' and xy /XJ are both in S and hence they are units in 5. Since Xj/xy G P , 5 dominates P , and Xj/xy is a unit in 5, we get that Xj/xy is a unit in R and hence XJ>/XJ G P ; consequently XI/XJ = {xi/xy){xy/XJ) G R for all / G A and hence B C R; since 5 dominates R and <2 = B(~\M(S), we get that <3 = Bf)M(R) and hence B Q C R. Again since XJ>/XJ G BQ, S dominates BQ, and xy/xj is a unit in S, we get that xy /XJ is a unit in -BQ and hence Xj/xy G -BQ; consequently xi/xy = (XI/XJ)(XJ/XJ>) G B Q for alU G A and hence B' C B Q ; since 5 dominates BQ and Q' = B ' n M(5), we get that Q' = B' n M(B Q ) and hence B Q , C B Q . Since -R = B Q , , we conclude that R = BQ. PROOF OF (T29.3). The first assertion follows from (T29.1) and (T29.2). To prove the second assertion assume that A is a finite set. In view of the first assertion it suffices to show that if V is any valuation ring of K' containing A then V dominates %0(A; (xi)ie\). Since A is a finite set, there exists j G A with Xj jL 0 such that XI/XJ G V for all / G A. Let B = A[(xi/xj)iex\ and Q = BC\M{V). Then V dominates BQ and BQ G W(A; (xi)ieA). REMARK (R9). [Modelic Blowup]. Let A be a subring of a field K, and let P be a nonzero A-submodule of K. Theorem (T30) of §9.2 says that then we have the following. (T30.1) For any 0 ^ x G P we have (A[Pa;- 1 ])P = {A[Px~l))x and hence RP = Rx for every R G V{A[Px~x}). In particular, if P is a nonzero ideal in A then for every R G 2J(A[.Pa:-1]) we have that PR is a nonzero principal ideal in R. (T30.2) Given any family {xi)ieA of generators of P, for all 0 ^ x G P we have A[Px~l] = A[{xi/x)l€A], and we have W(A; (ZJ)J€A) C W(A, P). (T30.3) For any family (:E/); 6 A of generators of P we have W(A,P)

=

W(A;(xl)l€A).

(T30.4) For all x ^ 0 + y in P we have Q F ^ P a ; - 1 ] ) = QF(A[Py- x ]) and letting K' denote this common QF we have that W(A, P) is a semimodel of K'/A, and if P is a finitely generated ^4-moduIe then %B(A, P) is a projective model of K'/A. In

194

LECTURE H: VARIETIES AND MODELS

particular, if P is a finitely generated ideal in A then W(A, P) is a projective model of QF(A)/A. UP = Ax for some 0 ^ x G K then W(A, P) = W(A; x) =
= R}.

(T30.6) If ^4 is quasilocal and P is an ideal in A then: P is a principal ideal in A «• 2J(A, P) = 9J(A) o ^ G 2»(A, P ) .

PROOF OF (T30.1). Obvious. PROOF OF (T30.2). The first assertion is obvious. The second follows from it. PROOF OF (T30.3). In view of (T30.2) it suffices to show that W(A,P) C W(A; (xi)ieA). So let any R G W(A, P) be given. Then there exists 0 / i e P such that R = BQ where B = AlPx'1] = A[{xi/x)i€K} C R and Q = B n Af (P). Since 0 ^ i £ P , there exists a nonempty finite subset A' of A such that x = J2i£\> rixi with ri G A. Then 1 = J2I&A> ri{xi/x) a n d n G R and (zj/a:) G P for all I G A', and hence there exists j G A' such that Xj/x $. M(R). Consequently Xj ^ 0, and Xj/x and X/XJ are units in R. In particular XI/XJ = (XI/X)(X/XJ) G R for alH G A and hence B' C R where B ' = A[(a;j/a;j)jeA]. Upon letting Q' = B ' n M ( P ) we get that B'Q, G W(A; (xi)ie\) and P dominates BQ,. Since z/z.,- G P Q , , R dominates BQ,, and X/XJ is a unit in R, we get that X/XJ is a unit in B'Q, and hence Xj/x G P Q , . Consequently xi/x = (XI/XJ)(XJ/X) G P Q , for all I £ A and hence P C B'Q,; since P dominates B'Q, and Q = P fl M(R), we get Q = B D M(B'Q,) and hence S Q C B'QI, i.e., RCB'Q,. Therefore R = B'Q,, and hence P G 2ZJ(A; {xi)t€A). Thus 22J(AP)c21J(^;(^);eA). PROOF OF (T30.4). Follows from (T30.3) and (T29). PROOF OF (T30.5). Assume P is an ideal in A. First let P G <8(A) such that PR = P; then P ^ M(R) and hence there exists 0 ^ x G P such that x g M(R); now A[Pa: _1 ] c P and hence, upon letting Q = A[Px~l\ n M ( P ) , we get P = (AIPX-^Q G 23J(AP)- Conversely let R G 22J(^,P) such that PR = P; now W(A,P) dominates 9J(A) and hence there exists R' G 5J(A) such that P dominates P'; since R dominates R' and PR = R, it follows that PR' = R'\ therefore R' G W(A, P) by what we have already proved; since P dominates P as well as R', and by (T30.4) W(A, P) is an irredundant premodel of the quotient field of A, we must have R = R'\ consequently R G %3(A). PROOF OF (T30.6). Assume A is quasilocal and P is an ideal in A. Now: P is a principal ideal in A => P = xA for some 0 ^ x € A => 2U(A, P) = 2U(A; a;) = 2J(v4).

§i3: DEFINITIONS AND EXERCISES

195

Also clearly: W(A,P) = A £ W(A,P). Finally: A £ W(A,P) =• A £ WiAlPx-1}) for some 0 ^ x £ P => AlPx'1} C A ^ AWx'1} = A => PA = P(A[P:r _ 1 ]) = x{A[Px~l\) = xA =$> P is a, principal ideal in A. §13: DEFINITIONS A N D EXERCISES EXERCISE (E10). [Hilbert Basis Theorem]. Show that an alternative proof of the polynomial case of the Hilbert Basis Theorem, i.e., Theorem (T3) of §3, can be obtained by slightly modifying the proof of the power series case, i.e., the proof of Theorem (T4) of §3. HINT. In the proof of (T4), everywhere change [[]] to [], and in the second summation of display (2*) change < oo to < deg(s). EXERCISE ( E l l ) . [Primary Modules]. Let P be an ideal in a ring R and let Q be an Pi-submodule of an Pi-module V. Recall that Q is primary if Q ^ V and for any r £ R and s £ V with rs £ Q and s $. Q we have r £ radyQ where radyQ = {r £ R : reV C Q for some e £ N+}. In §5(013) (which obviously generalizes §5(07) and §5(08) from ideals to modules) we said that the following three conditions are equivalent and when they are satisfied Q is called P-primary. (i") Q is primary with radyQ = P. (ii") P C radyQ ^ R and: r£R and s£V\Q with rs £ Q => r £ P. (iii/—) Q is primary, &niiR(V/Q) is primary (as ideal), and P is prime with radyQ = PWe also stated the following four consequences of the said equivalence: (1") Q is P-primary and B <£. Q for submodule B of V =3- (Q : B)R is P-primary. (2") Q = Qi D • • • n Qh where Qt is P-primary for 1 Q is P-primary. (3") P is a maximal ideal in R with P C radyQ ^ R =>• Q is P-primary. (4") P is a maximal ideal in R and PeV cQ^V (for instance Q = PeV) with e £ N+ => Q is P-primary. The exercise is to give the details of the proof. HINT FOR THE EQUIVALENCE. The definition of primary can be paraphrased in terms of zerodivisors by saying that

(11.1')

Q is primary <=*> Q ^ V and

ZR(V/Q)

C radyQ

196

LECTURE L4: VARIETIES AND MODELS

and similarly for an ideal J in R (11.2')

J is primary <=> J ^ R and ZR(R/J)

C rad^J.

Likewise (11.3')

(ii") » P c radyQ ^ i? and ^ ( V / Q ) C P.

Also clearly (11.4')

Q ± V & r&dvQ + R

and (11.5')

radvQ = T&dR&imR(V/Q)

and (11-6')

ZR{R/onnR(V/Q))

C

ZR(V/Q).

Putting annj^V/Q) = J, by (11.1') to (11.6') we see that (i") => (ii") =}• Q is primary and a.nnR(V/Q) is primary, and obviously (iii") =>• (i"). To complete the equivalence proof, we need to show that (ii") =*• P is prime with radyQ = P. For any r in vadyQ, by (11.5') we have re £ ann^(V r /Q) for some positive integer e; letting e to be the smallest such, and assuming radvQ ^ R, by (11.4') we get r 6 " 1 £ ann jR (V/Q); it follows that r e V C <2 and re~lV <£ Q; since re~lV £ Q, we can find t £V with r e - 1 £ ^ Q, and since r e V C Q, we get r e £ S Q; thus: 11.7'

f for some e € N + and £ £ P we have r e rad^Q ^ ii =>• I e , \r t £ Q with re~H £ Q.

Assuming (ii"), in the above set-up, by taking s = re~1t we have s £ V \ Q with rs £ Q and hence we get r £ P. Thus (11.8')

(ii") =• P = radvQ.

Again assuming (ii"), given any r\ £ R\P and r-i £ R with r £ P where r = r?,r\, in view of (11.8'), by taking s\ = {r2T\)e~1t in (11.7') we get si £ V \ Q with r2TiSi £ Q; since r\ £ R\P and si £ V \ Q, by (ii") we get S2 £ V \ Q where S2 = T\SI; since r2 £ R and s2 £V\Q with T2S2 £ Q, by (ii") we get t2 £ P . Thus (ii") => P is prime. HINT FOR THE FOUR CONSEQUENCES. To prove (1"), assume that Q is P-primary and B <£_ Q for submodule B of V. We want to prove that (Q : B)R is P-primary. By the definition of (Q : B)R we get (Q : B)RB C Q; in view of (ii") and because Q is P-primary, the last inclusion and the noninclusion B • p e V C Q for some e £ N+ => p e P C Q (because B c V) => pe £ (Q : B)R; thus:

§i3: DEFINITIONS AND EXERCISES

197

radv<2 C radfl(<3 : B)R; since Q is P-primary, we also have P C r&dyQ, and hence: (b) P C rad fi (Q : B)R. By (a) and (b) we get: (c) radR(Q : B)R = P. Now: rGR and s G R\(Q : B)R with rs G (Q : £ ) # =>• r G i? and s G R such that for some v G B we have sv £ Q but rsu e Q ^ r g rad„<3 =>• r G P by (c); thus: (d) r G R and s G R\(Q : B)R with rs G (Q : B)R => r G P. in view of (ii"), by (c) and (d) we see that (Q : B)R is P-primary. To prove (2"), assume that Q = Q\ n • • • f~l Qh where Qi is P-primary for 1 r G R and for some i G { 1 , . . . , h} we have s G y \ Q i with rs G Qi =>• r £ radyQ» = P by (ii"). Therefore again by (ii") we conclude that Q is P-primary. To prove (3"), assume P is a maximal ideal in R with P C iadvQ ^ R- Let r G R\ P and s G V be such that rs G Q. In view of (ii"), it suffices to show that then s G Q. Since P is maximal, we must have P + Rr = R and hence we can write 1 = c + dr with c G P and d G R. Since c G P C rad^Q, we can find m G N + with c m V C Q. Raising the equation 1 = c + rfr to the m-th power we get 1 — lm = cm + d'r with d' G R and multiplying throughout by s we obtain s = cms + d'rs G Q. To prove (4"), in view of (3"), it suffices to note that P C radvPeV. DEFINITION (Dl). [Support of a Module]. The support of a module V over a ring R is defined by putting s u p p f i ^ ={PG

spec(P)

:[0:P}v^V}.

EXERCISE (E12). [Support a n d Annihilator]. Show that for the support of a module V over a ring R we have the following. (E12.1) If V = RB with B c V then s u p p l y = \Jb&B vspec fi (ann fl &). (E12.2) If V is finitely generated then s u p p l y = vspec i j(ann/jK). HINT FOR (E12.1). Assuming V = RB, for any P e spec(P) we have P G supply &[0:P}v^V <=» there exists b G B such that b g [0 : P]y < <=> there exists b G B such that b g (0 : s)v for all s G R \ P «=> there exists b G B such that bs ^ 0 for all s G R \ P O there exists b G B such that &xmRb C P .^

p

£ Ubes vspec^ann^fc).

198

LECTURE L4: VARIETIES AND MODELS

HINT FOR (E12.2). Assuming V = RB with finite set B = { 6 1 , . . . , &„} where n G N+, for any P G spec(P) we have P e supply < «=> P G vspec H (ann fl 6j) for some i

[by (E12.1)]

<^> ann/{6j C P for some i and obviously P G vspecfl(ann/{V) <=$> arniflV' C P and hence it suffices to show that ann#6j c P for some z <=> ann/jV C P. The =>• part of the above is obvious because for all i we have ann^V C ann#&i. Moreover: the negation of the LHS => for all i we have ann#6j • for each i there is S, G P \ P with Sj&j = 0 = * > S I . . . S „ G P \ P with si... snV = 0 =>• the negation of the RHS. EXERCISE (E13). [Support and Quasiprimary Decomposition]. Let U be a submodule of a module V over a ring R. Show that: (E13.1) If U ^ V and V is finitely generated then SuppR(V/U)

= {P£ spec(P) : [U : P]v * V}

and suppR(V/U)

= {P G spec(P) : [f : P]v is P-quasiprimary}

(E13.2) If P is a minimal element of s\xppR{V/U)

then [U : P]v is P-primary.

HINT. Use the above Exercise (E12) and Theorem (T8') of §6.1. DEFINITION (D2). [Homogenization and Dehomogenization]. Given a polynomial / ( Y i , . . . , Yjv) in indeterminates Y i , . . . , Yjv over a field fc, taking a nonnegative integer d > deg(f), we can "homogenize" / to get a homogenous polynomial F — F{X\,..., XJV+I ) of degree d in indeterminates X\,..., XN+I over k by the substitution Yj = Xi/X^+i, i.e., by putting F(Xi,...

,XN+I)

= XN+1/(XI/XN+I,-

••

,XN/XN+I)-

§J3: DEFINITIONS AND EXERCISES

Conversely, given any homogeneous F = F{X\,... "dehomogenize" it by putting f(Yu.

..,YN)

= F(Y1, ...,YN,1)

199

,Xjv+i) of degree d we can

= XJid+1F(XiXN+i,...

,YNXN+1,XN+1).

EXERCISE (E14). [Embedding Projective Space Into Projective Model]. In §9.1 we gave a natural injection 6 from the projective iV-space Pj^ over a field k into the modelic projective TV-space (P^)*5 = W(k;X\,... ,XN+\). There we used the irredundancy of 2B. Without using the said irredundancy, give a more elementary construction of the said injection on the following lines by using homogenization-dehomogenization. Given u = ( u i , . . . , ujv+i) G A ^ + 1 \ { ( 0 , . . . , 0)} we want to find its image in W(k;X\,...

,XN+I)

= Ui
with BN,k,i = k\Xii' ••••> YN+i,i]

where

Yu =

X\/Xi.

For i, i' in { 1 , . . . , N + 1} with m ^= 0 ^ Ui> consider the local rings Rk(y)* and Rk(y')* where v = ( u i / w i , . . . , UN+i/ui) G Aj^ and v' = (u 1 /u i /,...,u;v+i/ u «') G A ^ , . If we show that these local rings coincide then we can send the equivalence class of u in P ^ to that common local ring. So the exercise is to show that Rk{v)* =

Rk(vTHINT. The case of i = i' being trivial, suppose i ^ i'. Relabelling suitably we can arrange that i = 1 and i' = 2. Disregarding the coordinate Yu which identically equals 1, we have -Bjv.fc = BN,k,i — k[Yi,...,

Yff]

where (YUY2, ...,YN)

= (X2/X1,X3/X1,

...,

XN+1/X!)

and V = ( u 2 / u i , U 3 / W l , - - - . UN+l/ui)

G Aj^.

Likewise, disregarding the coordinate Yu> which identically equals 1, we have •"JV.fc

=

B{q,k,i'

=

k\Yi, . . . , XJVJ

where (^l'> Y2> •••' YN)

=

{Xl/X<2, X3/X2,

...,

XN+1/X2)

200

LECTURE L4: VARIETIES AND MODELS

and v' = (ui/u2,u3/u2,...

,uN+i/u2)

eAy.

Now let us note that the local ring Rk(u)* is the set of all quotients of polynomials F(Xi,... ,XN+I)/G(XI, ... ,XN+I) with G(u\,... ,UJV+I) / 0, the local ring Rk(v)* is the set of all quotients of polynomials f(Y%,..., Y^)/g(Yi,..., YN) with g(v\,..., vpf) ^ 0, and the local ring Rk(v')* is the set of all quotients of polynomials f'(Y{,..., Y[<)/g'{Y{,..., y/r) with g'(v[,..., v'N) ^ 0. For f/g in Rk(v)*, homogenizing, i.e., multiplying / and g by Xf for large enough positive integer d we get f/g = F(X\,... ,XM+I)/G(X\, . . . , X J V + I ) where F,G are homogeneous of degree d with G(ui,... ,UN+I) ^ 0, and hence F/G £ Rk(u)* • Dehomogenizing, i.e., dividing F and G by X2d we get F/G = f'{Y{,... ,Y^)/g'{Y{,... ,Y^) with 5'(ui,...,«Ar) + 0, and hence f/g' G Rk(v')*. Thus Rk(v)* C Rk(v')*, and by symmetry Rk(v')* C Rk(v)*. Therefore Rk{v)* = Rk{v')*. §14: N O T E S NOTE (Nl). [Princeton Book]. My Princeton Book [AOl] provides a rapid introduction to parts of this and some of the following Lectures. In particular, Observation §5(O20)(13*) and Theorem §6(T8) stem from Chapter I Section 2 of that book, and Observation §5(O20)(14*) and Theorem §6.1(T8') are their modulifications. In [AOl] these items were motivated by the desire of giving a unified treatment of ramification of local rings which are always noetherian and valuation rings which are not (unless the value group is a subgroup of Z). NOTE (N2). [Resolution Book]. The language of models was introduced at the end of my Princeton Book [AOl] and was further developed in my Resolution Book [A05]. In particular the material of §8 and §9 is mostly taken from [A05]. More material from [A05] will be included in later Lectures. As a motivation for the idea of an irredundant premodel introduced in §9, the proof of the existence of a natural injection of a projective space into a projective model, which is given in §9.1 and which uses the irredundancy, should be compared with the longer but more elementary proof of the same which is given in §13(E14) and which uses the technique of homogenization-dehomogenization. NOTE (N3). [Kyoto P a p e r ] . The treatment of resultants given in §12(R6) is taken from my Kyoto Paper [A03]. The same is true of the material on Approximate Roots given in (D7) and (E15) to (E17) of L2§7. Later Lectures will contain further material from that paper. A significant feature of the treatment of resultants given in §12(R6) is the establishment of product rules for the resultant matrix and the ensuing deduction of the product formula for the resultant itself. This makes the whole theory applicable to polynomials over a ring with zerodivisors. The methods

§15: CONCLUDING NOTE

201

used are longer but more elementary since they depend only on the grade-school operations of addition and multiplication and do not involve root-extractions. NOTE (N4). [Valuations and L'Hospital's Rule]. The L'Hospital Rule from calculus says that if x(t),y(t) are infinitely differentiable functions of a real variable t which are not identically zero then the limit of x(t)/y(t) as t tends to zero always exists although it may be infinite. Motivated by this, what we have called a divisibility ring in §12(R7) could have been called a L'Hospital ring, and then assertion (7.2) of §12(R7) would have said that a L'Hospital ring is nothing but a valuation ring. The situation for two or more variable functions is quite different, as shown by the quotient X/Y which takes all possible values as X and V both tend to zero. This fundamental indeterminacy is mitigated by the introduction of valuations which may be viewed as an algebraization of the idea of limits. As we shall see in later lectures, the said indeterminacies can be "removed" by means of the modelic blowups introduced in §9.2 NOTE (N5). [What Remains To Be Done]. Referring to §1(07) and the last para of §12(R5), the only items from §1 which remain to be dealt with are Bezout Observations (04) and (05). The only other assertions from the present Lecture which remain to be proved are Theorems (T19) to (T24) of §8; Theorems (T25) to (T30) of §9 were proved in (R7) to (R9) of §12. Finally, referring to L3§12(03), we proved (Jl) and (J2) in §10(E2) and we proved (J24) in §12(R7), and so the only item from Lecture L3 which remains to be dealt with is Homework (H3). §15: CONCLUDING N O T E It is good to keep a balance between algebra and geometry. Let logical formality and narrative discourse follow each other.

Lecture L5: Projective Varieties §1: D I R E C T SUMS OF MODULES Before studying projective varieties which are defined in terms of homogeneous polynomials, we prepare some algebraic background. Given any polynomial in a finite number of variables X\,..., Xm over a ring R, by collecting terms of like degree in it we can uniquely write it as a finite sum of homogeneous polynomials. This is formalized by saying that, as an .R-module, the polynomial ring S = R[Xi, • •., Xn] is a direct sum

S = £

St

0
where Si is the set of all homogeneous polynomials of degree i, including the zero polynomial. Moreover, for all fi G Si and fj G Sj we have fofj G Si+j. This is expressed by saying that S is a graded ring over R with gradation ( S i ) ^ . Quite generally, a module V over a ring R is a direct sum of a family (Vj);g/ of submodules if it is the sum

and the sum is direct, i.e., for every finite subset I' of the indexing set / we have: ]T\ e I , Vi = 0 with Vi G Vi for all i G / ' =» vt = 0 for all i € V. Note that V = Y^isi ^ means every v G V can be written as v = ^2ieIi Vi for some finite subset I' of I and some u< G Vi for all i G I'. The fact that the sum is direct is expressed by writing something like V — V^ Vi

(direct sum)

iei

or something like

In case / is a finite set, say I = { 1 , 2 , . . . , n}, we may write something like V = Vi 9 V 2 ® •••®Vn. These are sometimes called internal direct sums to distinguish them from the external direct sums which we shall now introduce and which we shall again denote by ®. To stress the fact of being internal, the above displays may be written as V = y~] Vi

(internal direct sum)

202

§/: DIRECT SUMS OF MODULES

203

or V =0

Vi

(internal)

iei

or V = Vi © V2 © • • • © Vn

(internal).

Recall that W = U\ x • • • x Un is the cartesian product (see L2§1) of a finite number of sets Ui,...,Un and it consists of all n-tuples u = (ui,...,un) with Ui G Ui. Assuming U\,... ,Un to be i?-modules, we convert W into an i?-module by componentwise addition and scalar multiplication, i.e., by denning u + u' = (ui+Ui,... ,un + u'n) for all it' = (u[,... ,u'n), and ru = {rui,... ,run) for all r £ R. With this module structure we call W the (external) direct sum of U\,..., Un and indicate this by writing W = U\ © U2 © • • • © Un

(external)

where we shall usually omit the adjective external. If U\ = • • • = Un — U then, as in the case of cartesian products, we put W = Un and call it the (module theoretic) direct sum of n copies of U. More generally, the cartesian product of a family (Ui)i£i of sets Ui, where / is any indexing set (which need not be finite), is denned by putting IT Ui = the set of all maps <j>: I —» M Ui such that cf>(i) G Ui for all i & I. iei

i€l

In case the Ui are i?-modules, this is converted into an f?-module again by defining componentwise addition and scalar multiplication, i.e., for any tj> € Yiiei Ui w e P u ^ ((i) for all i £ I; this module is called the (external) direct product of the family, where the adjective external is usually omitted. The set of all <j) G Elie/ ^» whose support supp(0) = {i £ I : (i) ^ 0} is finite is called the (external) direct sum of the family and is denoted by ^H Ui

(external)

where the adjective external will usually be omitted. Note that this is a submodule of the direct product Y\isI Ui and is hence an .R-module. For any j G / we get an injective map (called the natural injection) \ij : Uj —> Ilie/ U*tys a v m g that for every x G Uj we have fij(x)(i) = x or 0 according as i = j or i ^ j , and we get a surjective map (called the natural projection) Vj : Y\i&1 Ui —> Uj by saying that for every y € YlieI Ui we have Vj(y) = y{j)- Clearly Vi\ii is the identity map of Ui. All this works for sets Ui. In the module case these maps are Rhomomorphisms, and they give rise to an obvious .R-monomorphism (again called the natural injection) JJ,J : Uj —• ®i^iUi and an obvious i?-epimorphism (again

204

LECTURE L5: PROJECTIVE

VARIETIES

called the natural projection) Vj : © i e / A —> Uj. Note that upon letting Ui to be the image of /x, we have ^ A i€I

(external) = ^

A

(internal).

iel

If the set Ui = a set U for alH e 7 then the cartesian product YlieI Ui is denoted by U1 and is called the set-theoretic 7-th power of U, and if U is an 7?-module then the module U1 is called the module theoretic J-th power of U. Finally, if the module Ui = a module U for alM € 7 then the module © i £ / A is denoted by U^ and is called the module theoretic restricted 7-th power of U. If {ai)iei is a family of 7?-homomorphisms on : Ui —• A where (Di)iGi is a family of .R-modules, then we get an 7?-homomorphism Y\ieI Ui —> YlieI A or ©,e/A —> © i e / A which sends any 0 in Y[iei A or © i e / A to i/> in flie/ A or © i e / A given by ?/)(i) = cti((j>(i)) for all i £ 7; we call this the direct product or the direct sum of the family (aj)jg/ and denote it by I l i e / a ' o r ®i£iai respectively. Note that the kernels of these maps are f ] i e / k e r ( a , ) and ©j e /ker(aj), and their images are Y[ieI im(ai) and ©i € /im(ai), respectively. Also note that for the definition of the ma P Yliel ai : Yliel A —* Hie/ A , it suffices that the a, : Ui —> Di be set-theoretic maps; in case Ui — U for all i G 7, the diagonal map [/ —» C/7 sends any a; e £/ to <j> £ U1 with (i) = x for all i € I, and multiplying it from the left by I L e / ai w e get a map U —> flie/ A which we call the diagonal product of the family {ai)i^i\ in case of modules, the diagonal map is an 7?-monomorphism and the diagonal product is an 7?-homomorphism. Note that if the indexing set 7 is finite then the direct product coincides with the direct sum. In particular, if 7 is finite then the diagonal product U —> Y\ieI Di = ©i € /7)j may be called the diagonal direct sum of the family (aj)j e /. If Ti is a submodule of A for all i e l , then clearly Yli€l Ti and ©ie/Tj are submodules of l i t e / A and © i S / A respectively, and upon letting /3j : Ut —> Uj/T, be the residue class epimorphism we see that Yliei Pi '• l l i e / A —* I l i e / ( A / 7 i ) and ©ig//?, : ©jg/A —> ®iei(Ui/Ti) are 7Z-epimorphisms with kernels ] l i e / ^ a n d ©ie/Ti respectively. For finite cartesian products Ui x •• • x Un, and for the corresponding external direct sums, we used the tuple notation. In case of indexed families (C/j)jG/, for cartesian products Y\ieI Ui and the corresponding external direct sums, we used the functional notation. The two are related by saying that the n-tuple u = ( u i , . . . , un) corresponds to the function (= map) u : { 1 , . . . ,n} —> Ujg/A with u(i) written as Ui. The concepts of the above several paragraphs carry over to the tuple notation. For instance the natural projection U\(B- • -®Un —> Uj picks out the j - t h component, and the natural injection Uj —» U\ ® • • • © Un puts zeros around it. Given maps oti : Ui —» A , their product a\ x • • • x an : U\ x • • • x Un —* D\ x • • • x Dn as

well as their direct sum a\ ® • • • ® an : U\ © • • • © Un —> £>i © • • • © Dn is given by ( u i , . . . ,u n ) H-> (ai(ui),... ,an(un)). Moreover, in case of U\ = • • • = Un = U,

%1: DIRECT SUMS OF MODULES

205

the diagonal maps U —> U\ x • • • x Un and U —> C/i © • • • © f/n are given by u t-> (M, . . . , u), and the diagonal product U —> £>i x • • • x D„ and the diagonal direct sum {/ —> £>i © • • • © Dn are given by u i-» ( a i ( u ) , . . . , a„(u)). We use the notation o?i © • • • © a n for the last map, i.e., we write a\ © • • • © an : U —> -Di © • • • © Dn

(diagonal direct sum)

where, as said above, a i © • • • © an(u) = (cti(u),... ,an(u)); we may drop the parenthetical phrase "diagonal direct sum" when clear from the context; note that now the maps on : U —> Di are from a common set U, and hence the notation a.\ © • • • © an : U —> D± © • • • ® Dn will not be confused with the similar notation a i © • • • © Q„ :U\®---®Un—>D\®---®Dn where the maps a» : U, —> Di are from differently labelled sets Ui. For the above diagonal direct sum we clearly have (1)

ker(ai © • • • © an) = ker(ai) n • • • D ker(a n ).

Most of the discussion of the above several paragraphs applies with internal direct sums replacing external direct sums. For instance the inclusion map Vj —> ®ieiVi is the natural injection, and the natural projection ©jg/Vi —> Vj is the obvious epimorphism whose kernel is ®i^i\{j}Vi. It is usually clear from the context whether we are dealing with internal or external direct sums. For instance when the constituents Vi are submodules of a module V then it must be internal, and when the modules Ui are not given to be submodules of some module then it must be external. For the lengths of finite (internal or external) direct sums of .ft-modules U\,..., Un, by induction applied to L4§5(021)(18 # ) we see that (2)

IR(UI

© • • • © [ / „ ) = W i ) + • • • + tR(Un)

with the usual convention about sums involving oo. All the above matter applies to additive abelian groups by viewing them as modules over Z. EXAMPLE (XI). [Annihilators and Direct Sums]. In L4§10(X18) we gave an example of a submodule T of a non-finitely-generated module U over a ring R together with a prime P in R such that a,miR(U/T) c P but [T : P]u = U, and in L4§10(X19) we gave an example of a nonprimary submodule W of a module V over a ring R such that aniiR(V/W) is a primary ideal in R. Now using (external) direct sums we give other such examples. First let x b e a nonzero element in a domain R such that r\ien{^lR) — 0; for instance R = the polynomial ring in one or more indeterminates X, Y,... over a field k, and x = X. Let U = ffiieN^ and T = ©ieN^i with Ui = R and I* = x'R. Let P be the zero ideal in R. Then P is a prime ideal in R. Given any O ^ r e f i we have r g xnR for some n £ N; we can take £ U with (f>(i) = 1 or 0 according as i = n or i ^ n, and then r € U with (r)(n) £ Tn and hence rcj) $. T\ consequently

206

LECTURE L5: PROJECTIVE

VARIETIES

r G" annn(C//T). Thus annfl([//T) = P. Given any ip G U we can find m G N such that i/)(i) = 0 for all i > m; now a:mi/> G T with x m G R\ P and hence V G [T : Pjy. Thus [T:P]u = U. Next let P be any domain having a nonzero prime ideal P ; in other words, let R be any domain which is not a field. Let OB and I s be the images of 0 G R and 1 G R in E = R/P. Let V = R © E and let W be the zero submodule of V, i.e., W = 0 © 0j5- Then the zero element in V is ( 0 , 0 B ) and for any r G R we have: r ( l , 0 £ ) = ( 0 , 0 B ) =>• r = 0; therefore ann#(V7W) is the zero ideal in R which is prime and hence primary. But we can take 0 ^ t £ P and then (0, 1B) G V \ W with i(0, 1 E ) = ( 0 , 0 E ) G W but no power of t belongs to a,rmR(V/W). Therefore W is not a primary submodule of V. §2: G R A D E D R I N G S A N D H O M O G E N E O U S IDEALS We are now ready to formalize the concept of a graded ring. Recall (see LI §4) that an additive abelian monoid is a nonempty set I together with an element 0 and a binary operation + which to every pair of elements i, j in I associates a third element i + j such that for all i, j , k in I we have: i + 0 = i, i + j = j + i, and (i + j) + k = i + (j + k). For instance, Z and N are additive abelian monoids, and for any positive integer d, so are Z d and Nd. By a graded ring S over a ring R we mean an overring S of R together with a family (5j)j g / of i?-submodules of S, where I is an additive abelian monoid (see Ll§4), such that S = Y^ Si

(internal direct sum of .R-modules)

with So = R and such that for all fi G Si and /_,- G 5,- with i, j in I we have fifj G S»+j. The family (5i)j g / is called the gradation (or grading) of S, and I is called the type of S. Elements of Si are called homogeneous elements of degree i. Collectively, elements of Ujg/Si are called homogeneous elements. The direct sum assumption tells us that every s G S has a unique expression

s = J2si with Si £ Si; the element SJ is called the homogeneous component of s of degree i, and collectively the elements st are called the homogeneous components of s. Note that Si — 0 for almost all (i.e., all except a finite number of) i. Also note that we have Cauchy Multiplication, i.e., if tj are the homogeneous components of t G S,

§2: GRADED RINGS AND HOMOGENEOUS IDEALS

207

and (st)k are the homogeneous components of st, then

(st)k = Y,

Sit

i-

i+j=k

An ideal J in S is said to be a homogeneous ideal if it satisfies one and hence all of the following three mutually equivalent conditions: (i) J = H,iei(J l~l Si) (as additive groups). (ii) The homogeneous components of every element of J belong to J. (iii) J is generated (as an ideal in S) by homogeneous elements. The unique decomposition s = Y2iei si yields (i) <=> (ii). Obviously (ii) =>• (iii). To prove (iii) => (ii), let (xj)jgi, be a family of generators of J with xi G S^i)- Any 1/6 J can be expressed as y = Y^ieL zixi where zi G S with zi = 0 for almost all L Writing y = Y,iei Vi where y, G Si with yt = 0 for almost all i, and zj = J2j&i zh where zij G Sj and z^ = 0 for almost all j , we get

and hence y, =

V,

z x

lj l

e

^

f ° ra u

i G /.

j+d(0=i

Thus we have shown that (1)

conditions (i), (ii), (iii) are mutually equivalent.

As in the case of an element, we call Si the homogeneous component of S of degree i, and more generally, for any homogeneous ideal J in S, we call J n Si the homogeneous component of J of degree i. We use i-th homogeneous component as a synonym for homogeneous component of degree i. Also we may say component instead of homogeneous component. By a graded subring of S we mean a subring T of S such that T is a graded ring of type I over T (1 R with gradation (T n Si)ie.rGiven any other graded ring S' of type / over a ring R' with gradation (5,-)^/, by a graded ring homomorphism : S —> S' we mean a ring homomorphism such that {Si) C S{ for all i G J. Clearly: if

1 onto the field fc/v-i and the kernel of the induced epimorphism A —> /CAT-I is J. Consequently J is a maximal ideal in A. Therefore by the induction hypothesis / has a unique set of N — 1 generators (fi(Xi,..., Xi) £ .Bj,fc)i
=

X^+9i(X1,...,Xi)

where rij is a positive integer and gi(X\,... de

g x , 9i{Xi,...,

,Xi) £ Bi^ with

Xi) < rij

for

1 < j < i.

Moreover, for 1 < i < N — 1 we have that: the polynomial / , ( x \ , . . . , Xi-\, Y) is the minimal polynomial of x^ over fcj_i (and hence the field degree [ki : /CJ_I] equals rij), the polynomial fi(X\,..., Xi) is irreducible in Bj,fc, and g{Xi,...,

Xi) i-> g{x±,... ,Xi)

gives a bijection

7, —> fcj.

Upon letting n^ = [fc/v : fc^-i] we see that njv is a positive integer and we can find /JV = JN(XI, ... ,Xpf) £ B such that / w ( x i , . . . ,xn-i,Y) is the minimal polynomial of xn over kN-i and JN(XI,.

where gN(X\,...,

.. ,XN)

= X^N

+ gN(Xi,...

,XN)

X^) £ B with deg Xj gN(Xi,...,XN)

< rij

for

1 < j < N.

Clearly /N(XI, ... ,XN) is irreducible in B and belongs to J. Given any h £ B, by the division algorithm we can uniquely write h = qfjy + r with q, r in B such that degxNr < njv. From this it follows that g(Xi,..., XN) *-* g(%i, • • •, XN) gives a bijection 7 ^ —> k^. Moreover h £ J =$• r £ I =$• r £ (fi,..., JN-I)A =i>fce (/1, • • • i IN)B, and hence ( / 1 , . . . , /W)S =
= X*

+ Gi(Xu.

where Ni is a positive integer and Gi(X\,..., de g j f , Gi(Xlt...,

Xi) < Nj

..,Xi)

Xi) £ Bitk with for

1 < j < i.

(Q10) NORMALIZATION

THEOREM AND REGULAR POLYNOMIALS

255

Now (IB) n A = I and hence there is a unique epimorphism 6 : B —> C = k^-i [Y] such that 0{XN) = Y and 9(z) = <j>(z) for all z € A. Let f(Y) = fN(x1,...,xn.1,Y)

and

F(Y) = FN(Xl,...

,xn-i,Y).

Then f(Y)C = 0(J) = F(Y)C and hence f(Y) = F(Y). Therefore NN = nN, and given any h e B, by the division algorithm we can uniquely write h = QF^ + R with Q,Rm B such that degx N i? < njv. From all this we see that ( F i , . . . , FJV-I)^4 = / . Hence by the uniqueness part of the induction hypothesis F, = /, with Ni = rij for 1 mspec(Bjv,fc), where we recall that for every a = ( a i , . . .,aN) € A f = kN we have put I fc (a) = {/ e Bjv,fc : / ( a ) = 0}. Note that for f = f(Xi,...,XN)

G BNtk

and

a = (ai,...,aw)eAf = K"

where k C /t are any fields we write /(a) = /(ai,...,aiv). Clearly (T22.4) follows from (T22.3) or even from (T22.2). To see the latter, assuming k algebraically closed, let there be given any J in mspec^^fc). By (T22.2) the residue class epimorphism <j> : BN^ —* BN,IC/J maps k onto the entire image. Hence there exists a £ A^ such that <j>(Xi) = 4>(ai) for 1 < i < N. Now clearly we havelfc(a) = J = (Xi - Q i , . . . ,XN - aN)BNtkTo prove the rest of (T22) and some relevant portions of L4§8(T21), henceforth let k C K be fields, and let R = BNtk

= k[Xu.

..,XN]

and

S = k% = KN

where N is a nonnegative integer. For any U C S let I{U) = lk(U) = {f eR:

f(a) = 0 for all a € U}

and for any J c R let V(J) = VK(J) = {a£S:

f(a) = 0 for all / e J } .

Let I' = {I(U)

:UCS}

and rd(i?) = set of all ideals in R which are their own radicals. let V = avt fc (A^) = {V(J)

:JdR}

256

L5: PROJECTIVE VARIETIES §5: MORE ABOUT IDEALS AND MODULES

and V" — set of all irreducible members of V' where U in V is irreducible means U is nonempty and cannot be expressed as union of two members of V which are subsets of U different from U. Now clearly

{

(1)

for any subsets J and J' of R we have: J C J' => V(J') C V(J) and we have: radR(JR)

= mdR(J'R)

=» V(J) = V(J')

and (2)

for any subsets U and U' of S we have: U C U' => /([/') C J(f/). We claim that for any family of ideals v(EieLJi)

(3)

=

(JI)IZL

in R we have

nieLv(jl),

and if the family is finite then we have

{v(nleL Ji) = v(UleL Ji) = yJieLVW and (4)

{

for any family of subsets {U{)ieL of S we have I(VI€LUI)

= n ;eL 7(C/()-

Concerning the first equation of (3), by the first implication of (1) we see that LHS C RHS; conversely: a € RHS => a € V(UieL-/() => a e LHS where the first implication is obvious and the second follows from the second implication of (1). Concerning the last line of (3), by the first implication of (1) we get the inclusions UieLV(Ji) C V(nj £ £,Jj) C VXPI/gL ^ ) i moreover, for any a € S we have that: a ^ UjgLV'(Jj) => for each I 6 L there exists /; G J; with ft(a) ^ 0 =^ / ( a ) 7^ 0 with / = X\leLfi G Y\l£LJi, and hence ^(flieL Jl) c U; e /,y(J;). Concerning equation (4), by (2) we get LHS C RHS; conversely: / G RHS => / ( a ) = 0 for every a G Ul€LUi =>fe LHS. We also claim that J for any J c f i w e have J C \ a n d : J = I(V(J))

I(V(J))

& J G V => J = r a d R J

and J for any U C S we have 17 C V{I{U)) [and: [/ = V{I(U))

**UeV.

The first line of (5) follows by first applying the first line of (1) and after that applying (2). The first line of (6) follows by first applying (2) and after that applying

(Q10) NORMALIZATION

THEOREM AND REGULAR POLYNOMIALS

257

the first line of (1). Concerning the second line of (5), obviously J = I{V{J)) =>• J £ I1; moreover 'J el' =>J = I(U) for some

UcS

=*. V(J) = V(I(U)) by taking V of both sides ' =>UC V(J) by first line of (6) => I(V(J))

C J by (2) because I{U) = J

=> J = J(V(J)) by the first line of (5); also: / G radfi J with J = I(U) for some U c S =4- / n e J for some n G N + < => / " ( a ) = 0 for all a G £/ => / ( a ) = 0 for all a G U

and hence J = r a d ^ J . Concerning the second line of (6), obviously U = V(I(U)) => U G V; moreover 'UeV => U = V(J) f o r s o m e ^ C i? =» /([/) = J(F( J)) by taking J of both sides U j C /(£/) by first line of (5) =*• V{I(U)) cUby (1) because V{ J) = U => U = V(I(U)) by the first line of (6). Next we claim that (7)

V" = {U G V : I(U) G spec(fl)}.

To prove (7) let U G V. Assuming U to be irreducible, we have U ^ 0 and hence I(U) ^ R; moreover, for 1 < i < 2, given any fc G R\I(U), upon letting Ui = {a G U; fi{a) = 0}, we have Ui G V with Ui C U and f7j ^ [/; since U is irreducible, we must have t/i U J72 ^ U, and hence /1/2 ^ 7(£/); thus I(U) is prime. Conversely assuming I(U) to be prime, we have I(U) ^ R and hence U =/= 0; moreover, given any C/i,C/2 in V" with J7iUt/ 2 = *7, by (4) we have I{Ui)(M{U2) = I{U) and clearly I{Ui)I{U2) C I(Ui) n 7(£/2) and hence I(Ui)I(U2) C 7(t/); now assuming f/j ^ J7, by (2) and the second line of (6) we get 1(17) C 7(17!) with I(U) ^ I{U\), and hence the primeness of I(U) tells us that 7(L/2) C I(U) and therefore by the first line of (1) we get U2 = U; thus U is irreducible.

258

L5: PROJECTIVE VARIETIES §5: MORE ABOUT IDEALS AND MODULES

Finally we claim that every U £ V can be expressed as a finite union /OA (8)

I <

U

= u^
and this decomposition is unique up to order if it is irredundant, i.e., if no Ui can be deleted from it, (where the irredundancy can be achieved by deleting some of the Ui, and after it is achieved, the remaining Ui are called the IRREDUCIBLE COMPONENTS of U). Before proving (8) we note that the noetherianness of R tells us that it has no strictly increasing infinite sequence of ideals, i.e., there does not exist any infinite sequence ideals J 0 C J\ C • • • C Jn C . . . in R with J0 ^ J\ ^ • •• ^ Jn^ ..., and hence by (2) and the second line of (6) we see that V has no strictly decreasing infinite sequence, i.e., there does not exist any infinite sequence ' W0 DWXD---D

WnD...

in V

^with W0 + Wl ± • • • ± Wn +

....

To prove the existence part of (8), suppose if possible that some U £ V has no irreducible decomposition, i.e., cannot be expressed as a finite union of members of V". Then U itself cannot be irreducible and we can write U = U\ U U^ with U\,U% in V such that U\ ^ U ^ U^\ note that the empty variety has the empty irreducible decomposition (with h = 0), and hence U is nonempty, and therefore U\,U2 are also nonempty. Clearly either U\ or U% has no irreducible decomposition (because if both did then putting them together would produce an irreducible decomposition of U). Say U\ does not have an irreducible decomposition. Let WQ = U and W\ = U\. In this manner we get an infinite sequence WQ D W\ D • • • D Wn D ... of members of V with Wo 7^ W\ 7^ • • • 7^ Wn T^ . . . such that none of the Wn has an irreducible decomposition. This contradicts (9) and so proves the existence part. To prove the uniqueness part of (8), in addition to assuming the decomposition displayed in (8) to be irredundant, let U = Ui<j
(Q10) NORMALIZATION

THEOREM

AND REGULAR

POLYNOMIALS

259

BN+i,k' - k'[Xx,.. .,XN+1] and W = JT + (1 - XN+1f)T'. Then clearly there is no a' = K , . . . ,a'N+l) G A ^ + 1 such that f'(a') = 0 for all / ' G H'. Suppose if possible that H' ^ T". Then if' is contained in some maximal ideal H* in T", and by (T22.4) we have # * = (Xi - a[,.. .,XN+i - a'N+1)T'. But this gives the contradiction that f'(a') = 0 for all / ' G H'. Therefore we must have H' = T". Hence, upon letting T = BN+hk = k[Xu..., XN+i] and H = JT + (1 XN+1f)T, by the following Comment (C15) we get H = T. Consequently we can express 1 as a finite sum 1 = (1 - XN+if)g

+ Y^

with

fi9i

fit

and

J

9,9i'mT.

l
Substituting l/X^+\ for X^+i in the above identity and then multiplying both sides by X^+1 for a large enough n e N+ we get

J2 fiGi

X%+1 = (XN+i-f)G+

with

fi£j

and G,Gi in T.

l
Now substituting / for Xw+i in the above identity we obtain

/" = H fihi

with

/» € J

and h e R

i

\
and hence / " € J. The sixth part (T22.6) of (T22) says that if K contains an algebraic closure of k then the mapping U t-> I{U) gives inclusion reversing bijections V —> rd(i?) and V" —> spec(-B) whose inverses are given by J i—• V^(J). In view of (1), (2), (5), (6) and (7), this follows from (T22.5). COMMENT (C15). [Ideals Under Field Extensions]. In the above proof of (T22.5) we used the fact that: if k c k' are any fields and T = k[X\,..., XM] and T" = k'[Xi,..., XN] are finite variable polynomial rings, then for any ideal H in T we have (HT') C\T = H. To see this, we can take a vector space basis (zj)ie.j of k' over k with Zj = 1 for some j £ I. By expressing the coefficients of any / € T" in terms of this basis we can uniquely write / as a finite sum f = ^2zifi

with

fiET

where J is a finite subset of I with j £ J. Note that then /

T <=* / , = 0 for all i € J \ {j}.

e

Assuming / € ifT" we can write it as a finite sum / = Y,

F

i9i

with

Fi G T1

and

ffi

G H.

260

L5: PROJECTIVE

VARIETIES §5: MORE ABOUT IDEALS AND MODULES

By taking J large enough, for all I £ L we have Fi = ^2 ZiFu

with

Fu G T.

ieJ

Now the uniqueness says that for all i £ J we have

fi = ^2FugieH. leL

Hence if / e (HT') n T then / = fiGH.lt

follows that ( F T ' ) n l = ff.

COMMENT (C16). [Supplement to Hilbert Nullstellensatz]. In the context of L4§8(T22.3), let cj> : B^,k -* B^^/J be the residue class epimorphism, let k0 be the subfield cj>(k) of the field {Xi) together with li = {9{Xi, ...,Xi)£

Bi,k : d e g x . 5 ( X i , ...,Xi)<

rij for 1 < j < i}.

Then, for 1 < i < N, as shown above: kt = /c,_j(a;j) is a finite algebraic field extension of the field fc,_i (i.e., kt is an overfield of the field fcj_i for which we have [hi : fcj_i] < oo and ki = ki-i(xi)) with [fcj : fcj_i] = m, the polynomial fi{x\,..., Xi-i, Y) is the minimal polynomial of Xi over fcj_i, the polynomial fi(Xi,..., Xj) is irreducible in Bitk, and g(X\,..., Xi) H-> 5(0:1,..., ij) gives a bijection 7* —> fcj. COMMENT (C17). [Decomposition of Ideals and Varieties]. In (1) to (8) above we have proved the first out of the three cases of the Inclusion Relations Theorem, i.e., L4§8(T21). The other two cases will be dealt with in the next Quest. In view of the portions of the Inclusion Relations Theorem and the Hilbert Nullstellensatz proved above, the decomposition of affine algebraic varieties into their irreducible components can be found thus. Let k C /t be fields such that K contains an algebraic closure of k. For any ideal J in the finite variable polynomial ring B = k[X\,... ,XN] take a primary decomposition J = ni<j< e Qi where Qi is primary for the prime ideal Pi in B. Then V K (P,) = VK(Qi) = an irreducible variety in A^ for 1 < i < e and V re (J) = Ui<j< e V K (Pj) is a decomposition of V K (J) into irreducible subvarieties. Moreover, if the primary decomposition of J is irredundant and is so labelled that Pi is a minimal or embedded prime of J according as 1 < i < d or d + 1 < i < e, then VK(J) = Ui
(Qll)

NILRADICAL,

JACOBSON

SPECTRUM,

AND JACOBSON

RING

261

QUEST ( Q l l ) Nilradical, Jacobson Spectrum, and Jacobson Ring At the end of this Quest we are going to give Two Supplements to the Dimension Theorem (T47) which slightly strengthen some parts of it. First we shall complete the proofs of the Inclusion Relations Theorem and the Hilbert Nullstellensatz, i.e., L4§8(T21) and L4§8(T22). To put the matter in proper perspective, we start by defining Nilradical, Jacobson Spectrum, and Jacobson Ring. Recall that in (Q9) and (Q10) we avoided using (Ql) to (Q8). In the current Quest we shall avoid using (Q2) to (Q8), except that we may use the incidental comment (Cll) of (Q7) but we shall not use Comment (C8) of (Ql). Also note that (Q9) to (Qll) are independent of §1 to §4 of this Lecture L5. After making some definitions, we shall prove the Nilradical Theorem (T48). Then we shall prove the Spectral Relations Theorem (T49) which includes the remaining two cases of L4§8(T21). After that we shall prove the Spectral Nullstellensatz (T50) which generalizes the remaining two parts (T22.7) and (T22.8) of the Hilbert Nullstellensatz of L4§8(T22), and adds yet another incarnation (T50.3) of it. Still after that we shall prove the Minimal Primes Theorem (T51) which says that (Cll)(2) remains valid without the noetherian hypothesis. Finally we shall prove the Supplementary Dimension Theorems (T52) and (T53). Recall that in (Q3) we defined the Jacobson radical of a ring R by putting

jrad(tf) =

f|

P.

P(Emspec(.R)

Now we define the nilradical of R by putting

nrad(tf) =

f]

P.

FGspec(R)

We also define the Jacobson Spectrum of R by putting jspec(-R) = < P £ spec(.R) : P = {

f]

Q\

Q£mvspec R P

J

and we note that then mspec(i?) c jspec(ii) C spec(R). The ring R is said to be a Jacobson Ring if jspec(i?) = spec(R). In analogy with the definitions of vspec^ J, mvspec#J, svt(-R), msvt(R), and imsvt(iZ) made in L4§8, for any J c f i w e put jvspec R 7 = {P € jspec(i?) : J C P) and call this the Jacobson spectral variety of J in R, and we put jsvt(ii) = the set of Jacobson spectral varieties in R

isvt(R),

262

L5: PROJECTIVE

VARIETIES §5: MORE ABOUT IDEALS AND MODULES

i.e., subsets of jspec(i?) of the form jvspec/{J for some J C R (we may call these varieties in jspec(-R)), and we put ijsvt(i?) = the set of all irreducible members of jsvt(i?) where U is irreducible means U is nonempty and U is not the union two proper subvarieties, i.e., subsets which belong to jsvt(i?) and are different from U. By an IRREDUCIBLE COMPONENT of U e jsvt(i?) we mean an irreducible subvariety U' of U such that U has no irreducible subvariety U" ^ U' with U' C U"; similar definitions hold for IRREDUCIBLE COMPONENTS of varieties in msvt(R) or svt(f?). Recall that rd(i?) = the set of all radical ideals in R and for any U C spec(iZ) we defined the spectral ideal of U by putting ispecfi£/ = npgt/P. NILRADICAL THEOREM (T48). For any ring R we have nrad(i?) = rad^O. More generally, for any ideal J in any ring R we have rad/jj = ispec R (vspec R J), i.e., radflJ is the intersection of all the prime ideals in R which contain J. PROOF. The first assertion is L4§5(O24)(20") which itself was a repetition of the first line of L4§5(O20)(12*). The second assertion follows by applying the first assertion to the residue class ring R/J. SPECTRAL RELATIONS THEOREM (T49). Given any ring R, let ' ( S , ^ ' , ! / " , / ) stand for (mspec(-R), mvspec R , msvt(i?), imsvt(i?), ispec R ) or (jspec(i?),jvspec fi ,jsvt(R), ijsvt(i?), ispec^) or (spec(-R), vspec^, svt(-R), isvt(ii), ispec fl ). In all the cases let I' stand for the set of all ideals J in R such that J = I{U) for some U C S. Then we have the following. (T49.1) For any subsets J and J' of R we have: J C J' =*> V(J') C V(J), and we have: radR(JR) = ia,dR(J'R) => V(J) = V(J'). (T49.2) For any subsets U and U' of S we have U C U' => I(U') C I(U). (T49.3) For any family of ideals (J;)/eL in R we have V(%2leL Ji) = r\i€LV(Ji), and if the family is finite then we have V(r\i£L Ji) = ^(ILeL «^) = ^ieLV(Ji). (T49.4) For any family of subsets {Ui)i€L of S we have I{Ui€LUi) = nieLI{Ui). (T49.5) For any J C R we have J C I(V(J)) and: J = I(V(J)) J £ / ' =*• J = radfl J. (T49.6) For any U C S we have U C V(I{U)) and: U = V(I(U)) &• U G V.

(Qll) NILRADICAL, JACOBSON SPECTRUM, AND JACOBSON RING

263

(T49.7) V" = {U £ V : I(U) G spec(P)}. (T49.8) Assuming R is noetherian, every U £ V can be expressed as a finite union U = L)i
W

and

I(V(T))

= T.

Moreover: (ii) if U £ V" is such that I(U) £ S then upon letting P = I(U) we have that P is the unique member of S with U = V(P). (T49.12) Without assuming R to be noetherian, in the last two cases, i.e., when S is either jspec(i?) or spec(i?), for any U £ V" we have that I{U) £ S and upon letting P = I(U) we have that P is the unique member of S with U = V(P). PROOF. The proofs of (T49.1) and (T49.2) are clear. Concerning the first equation of (49.3), by the first implication of (49.1) we see that LHS C RHS; conversely a £ RHS =» a £ V(UieLJi)

=>a£ LHS

where the first implication is obvious and the second follows from the second implication of (49.1). Concerning the last line of (49.3), by the first implication of (49.1) we get the inclusions UJ6LV(7,)

C V(nieL Ji) C V(Y[

Jt).

Moreover, for any a £ S we have that < =>• for each / £ L there exists // e J; with fi £ a ^ f £ a with / = n , 6 L fi G ILeL Ji and hence VillJOcUizLViJt).

264

L5: PROJECTIVE

VARIETIES §5: MORE ABOUT IDEALS AND MODULES

Concerning equation (49.4), by (49.2) we get LHS C RHS; conversely / G RHS =*• / G a for every a G UteLUi =>• / G LHS. The proofs of (T49.5) and (T49.6) are verbatim the same as the proofs of (5) and (6) given in the previous Quest (Q10), except that the references to (1), (2), (5), (6) should be changed to references to (49.1), (49.2), (49.5), (49.6) respectively, and the second braced display starting with "also" should be replaced by saying that, also: / G radfl J with J = I(U) for some U C S =$• fn £ J for some n G N+ < =» / " G a for all a G U => f G a for all a G U =>/€ J and hence J = rad# J. To prove (49.7) let U G V. Assuming U to be irreducible, we have U ^ 0 and hence /({/) 7^ R; moreover, for 1 < i < 2, given any fo e R\ I(U), upon letting [/< = {a G C/;/i G a } , we have Ui G V with Ui C U and C/j ^ [/; since 17 is irreducible, we must have U1UU2 ^ U, and hence /1/2 ^ -^(C^); thus /([/) is prime. Conversely assuming I(U) to be prime, we have I{U) ^ R and hence [/ ^ 0; moreover, given any Ui,U2 in V' with [/1 U t/2 = £/, by (49.4) we have I(Ui) n /(f/ 2 ) = /(I/) and clearly I(Ux)I(U2) C /(C/i) n /(C/2) and hence I(U\)I{U2) C / ( t / ) ; now assuming f/i 7^ C/, by (49.2) and the implication portion of (49.6) we get I(U) C I{U{) with I(U) ^ /(f/i), and hence the primeness of I(U) tells us that I{Ui) C /({/) and therefore by the first implication of (49.1) we get [/2 = U\ thus U is irreducible. The proofs of (T49.8) and (T49.9) are verbatim the same as the proofs of (8) and (9) given in the previous Quest (Q10), except that the references to (2), (6), (8), (9) should be changed to references to (49.2), (49.6), (49.8), (49.9) respectively. To prove (T49.10), by Zorn's Lemma, given any U G V" we have U C U* for some maximal U* G V", and clearly any such U* is an irreducible component of S. Given any P G U G V, replacing (U, S) by (V(P), U) in the previous sentence, we see that P G V(P) C U* for some irreducible component U* of U, and hence every U in V is the union of its irreducible components, [cf. §6(E13)]. The proof of part (i) of (T49.ll) is straightforward. To prove part (ii), let U G V" be such that upon letting P = I(U) we have P G S; then by (T49.6) we get U = V(P); conversely, if T G S is such that U — V{T) then by the last equation in part (i) we get I{U) = T and hence T = P. To prove (T49.12), given any U G V", upon letting P = I(U), by (T49.7) we see that P is a prime ideal in R. It follows that if S is speci? then P G S. On the other hand, if S is jspec(i?) then every Q G S is an intersection of members of

(Qll) NILRADICAL, JACOBSON SPECTRUM, AND JACOBSON RING

265

mspec(P), and hence by the equation P = I(U) = DQ£UQ with U C S we see that P is an intersection of members of mspec(-R); thus again P € S. The rest follows from part (ii) of (T49.ll). SPECTRAL NULLSTELLENSATZ (T50). Given any ring R, let '(S.V.V'.VVJstandfor (mspec(-R), mvspec^, msvt(P), imsvt(P), ispec fl ) or (jspec(P),jvspec fi ,jsvt(P),ijsvt(P), ispec fl ) or (spec(P), vspec^,svt(P),isvt(fl), ispec^). Then we have the following. (T50.1) If R is Jacobson or we are in the case of S = spec(.R), then for any ideal J in R we have I{V(J)) — rad#J. (T50.2) If R is Jacobson or we are in the case of S = spec(P), then the mapping U (-»I(U) gives inclusion reversing bijections V —> rd(i?) and V" —> spec(P) whose inverses are given by J H-> V(J). (T50.3) If R is an afflne ring over a field, then Pi is a Jacobson ring, i.e., every prime ideal in it is an intersection of maximal ideals. PROOF. The first part (T50.1) follows from (T48) and, in view of (T49.1), (T49.2), (T49.5), (T49.6) and (T49.7), the second part (T50.2) follows from the first part (T50.1). [cf. §6(E14)]. To prove the third part (T50.3), given any afflne ring R over a field k and any prime ideal H in R, we want to show that H = C\p£mvspecRHP' We can take a fc-epimorphism : B —> R where B = k[Xi,... ,XN] is a polynomial ring over k in a finite number of variables, and we can take an algebraic closure K of k. Let G = 4>~1{H) and J = DpemvspeCBaP. Then G C J with G £ spec(B) and J G rd(P). What we want to show is that G = J. Suppose if possible that G ^ J. Then by the sixth part (T22.6) of the Hilbert Nullstellensatz L4§8(T22), which we proved at the end of the last Quest, there exist a = (a\,..., Q;v) € nN such that a € VK(G)\VK(J), i.e., upon letting Q be the maximal ideal in C = n[X\,..., Xpf] generated by X\ — a\,..., X^ — &N we have G C Q with J <£ Q. Now C is integral over B and hence upon letting P = Q n B, by (T38.2) we see that P is a maximal ideal in B. Clearly G C P but J (Z! P which is a contradiction. MINIMAL PRIMES THEOREM (T51). Given any ring R, every prime ideal in R contains some minimal prime ideal in R, and we have rad^O = np e n s p e c (ft)P. More generally, given any ideal J in any ring R, every member of vspec^J contains some member of nvspec^J, and we have rad^J — C\p&mspecRjPPROOF. The second assertion follows by applying the first assertion to

R/J.

266

L5: PROJECTIVE

VARIETIES

§5: MORE ABOUT

IDEALS

AND

MODULES

To prove the first assertion, by (T49.10) we see that for any H £ spec(P) there exists an irreducible component UH of spec(P) with H G UH and, upon letting PH = ispec^C/ff, by (T49.12) we see that PH G nspec(P)

with

PH C H.

It follows that nrad(#) = n p e n s p e c ( i l ) P and hence by (T48) we get radfiO = n PenS pec(fl)P.

FIRST SUPPLEMENTARY DIMENSION THEOREM (T52). Given any affine ring A over a field k, let (Ji)i
= dim(A).

Moreover, for any prime ideal P in A, the length of every saturated prime ideal chain P 0 C • • • C Ph in A with ht^Po = 0 and Ph = P equals h t ^ P , and the length of every saturated prime ideal chain Po C • • • C Pd in A with Po = P and dpt^Pd = 0 equals d p t ^ P . PROOF. Let dim(A) = n. For 1 A/Jt be the residue class epimorphism, by (T47.1) we get dim(i(J4)) = n. Let P be any prime ideal in A. We can take saturated prime ideal chains Po C • • • C Ph and Po C • • • C Pd in A with ht^Po = 0, Ph = P, htAP = h, P 0 = P , dpt^P^ = 0, and d p t ^ P = d; clearly we must have Po = Ji for some i and applying (T47.3) to <j>i(A) we get h + d — n, i.e.: (•) h t ^ P + d p t ^ P = n. In particular, given any nonunit ideal H in A we have h t ^ P + d p t ^ P = n for every P in vspecAH and hence htAH + dptAH = dim(A). Given any saturated prime ideal chain Po C • • • C Ph in A with ht^Po = 0 and Ph = P, we can take a saturated prime ideal chain Po C • • • C Pd in A with Po = P , dpt^Pd = 0, and d p t ^ P = d; clearly we must have Po = Ji for some i and applying (T47.3) and (T47.5) to (f>i(A) we get h + d = n with h t ^ ^ j ^ P ) = h and d p t ^ ^ c ^ P ) = d; obviously d p t ^ . ^ ^ P ) = d p t ^ P , and hence by (•) we get htAP = h. Given any saturated prime ideal chain Po C • • • C Pd in A with Po = P and dpt/iPd = 0, we can take a saturated prime ideal chain Po C • • • C Ph in A with ht^Po = 0, Ph = P, and h t ^ P = h; clearly we must have P 0 = Ji for some i and applying (T47.3) and (T47.5) to
(Qll) NILRADICAL, JACOBSON SPECTRUM, AND JACOBSON RING

and dpt^^^)i(P) = d; obviously dpt^(A)i(P) d p t A P = d.

267

= d p t A P , and hence by (•) we get

SECOND SUPPLEMENTARY DIMENSION THEOREM (T53). Let A be any affine ring over a field k. Then dim(A) is the maximum number of elements of A which are algebraically independent over k. Moreover if A* is any subring of A such that A* is also an afRne ring over k, then dim(A*) < dim(/l). PROOF. The second assertion follows from the first. To prove the first assertion, let dim(P) = n, let (Ji)i<j< m be all the distinct minimal primes of 0 in A, and let 4>i : A —> A/Ji be the residue class epimorphism. By (T47.1) we know that t r d e g ^ ^ ^ A ) < n for 1 < i < m with equality for some j = i. We can take elements Z\,..., Zn in A such that the elements j(Zi),..., (pj(Zn) are algebraically independent over i(Yr) in i(k), and hence r : B —* R where B = k[X\,... ,X^} is a finite variable polynomial ring over k, and ker(<^>) is the ideal Ik{U) of some U G avtfc(A^) for some overfield K of k containing an algebraic closure of k; we may then regard R as the coordinate ring of U.

268

L5: PROJECTIVE VARIETIES §5: MORE ABOUT IDEALS AND MODULES

QUEST (Q12) Catenarian Rings and Dimension Formula A ring A is said to be catenarian if every pair of prime ideals P c Q in it satisfies condition (*) of (T47.2) which says that: there is no infinite chain of prime ideals between P and Q, and any two saturated chains of prime ideals between P and Q have the same length. Thus the first part of (T47.2) says that an afHne ring over a field is catenarian. A ring A is said to be universally catenarian if it is noetherian and the polynomial ring over it in any finite number of variables is catenarian. Given an overdomain A' of a domain A, we say that the dimension formula (resp: the dimension inequality) holds for A relative to A' if for every prime ideal P' in A', considering the localizations R' = A'P, and R = Ap with P = P' n A, we have dim(i?') + restrdeg f i # = dim(i?) + trdeg^i?' (resp: dim(fZ') + restrdeg^i?' < dim(i?) + trdeg R i?'). We say that the dimension formula (resp: the dimension inequality) holds in a domain A if it holds for A relative to every afRne domain over A. Thus (T47.6) says that the dimension formula holds in any affine domain over a field. As we shall now see, the above concepts enable us to view parts of the Dimension Lemma (T30) and the Dimension Theorem (T47) from a somewhat more general perspective. Let us start off by giving two sharper versions of (C13). In Lemma (T54) we consider the behavior of prime ideals in a simple ring extension, i.e., in a ring extension A[x] obtained by adjoining a single element a; to a ring A, and in Lemma (T55) we shall generalize it to a ring extension obtained by adjoining a finite number of elements. SIMPLE RING EXTENSION LEMMA (T54). Let A and A' be noetherian domains such that A' = A[x) for some x £ A'. Let P' be any prime ideal in A' lying above any prime ideal P in A, and consider the localizations R' = A'P, and R = AP. Then restrdeg R .R' < 1

and

trdeg^i?' < 1

and dim(R') + TestrdegRR' < dim(R) +

trdegRR'.

Moreover, if trdeg^-R' = 1 then dim(ii') + restrdeg fl i?' = dim(it) + trdeg H .R'.

(Q12) CATENARIAN

RINGS AND DIMENSION FORMULA

269

PROOF. The first display is obvious, and the third follows from (C13). So assume x is algebraic over A. Let B = A[X] be the univariate polynomial ring over A, let (/>: B —> A' be the unique A-epimorphism which sends X to x, let H = ker(<^>), and let Q = <j>~l(P'). Now H is a nonzero prime ideal in B lying above 0 in A, and hence by (C13) we have hte-ff = 1; also clearly hts-H" + htA>P' < h t s Q ; consequently 1 + dim(-R') <

dim(BQ).

Obviously restrdeg^Bg = restrdeg^-R' and Q is a prime ideal in B lying above P in .A, and hence by (C13) we get dim(B Q ) + restrdeg^i?' = 1 + dim(il). The above two displays imply dim(-R') + restrdeg fi i?' < dim(i?), and this completes the proof. MULTIPLE RING EXTENSION LEMMA (T55). Let A and A' be noetherian domains such that A1 = A[x\,..., xm] for some m G N and xi,...,xm in A'. Let P' be any prime ideal in A' lying above any prime ideal P in A, and consider the localizations R' = A'P, and R = Ap. Then restrdegfi.R' < m

and

trdeg fi i?' < m

and dim(i?') + restrdeg R i?' < dim(i?) + trdeg fi i?'. Moreover, if either trdeg fl i?' = m or the polynomial ring B = A[X\,..., variables is catenarian, then

Xm] in m

dim(i?') + restrdeg fi i?' = dim(R) + trdeg^i?'. PROOF. The first display is obvious. The second display and the first case of the third display are trivial for m = 0 and for m > 0 their implication m — 1 => m follows from (T54), and so they are done by induction on m. So now assume that B is catenarian. Let : B —> A1 be the vinique yl-epimorphism which sends Xi to xt for 1 ^1(P'). Now H and Q are prime ideals in B lying above 0 and P in A respectively, and hence by the first case of the third display, taking (A',P',A,P) = (B,H,A,0) and (A',P',A,P) = (B,Q,A,P), we respectively get htsH + trdeg^j/l' = m and hteQ + restrdeg^i?' = m + dim(i?).

270

LS: PROJECTIVE

VARIETIES §5: MORE ABOUT IDEALS AND MODULES

Since B is catenarian, by Lemma (T56.1) below we have hts<5 - n t ^ i ? = dim(it'), and hence by the above two displays we get dim(P') + restrdegfi.R' = dim(i?) + trdeg f i P' which completes the proof. CATENARIAN CONDITION LEMMA (T56). For any ring A we have the following. (T56.1) If A is a noetherian domain then: A is catenarian o <

for every pair of prime ideals P c Q in A we have htAQ = htAP + ht ( j 4 / p) (Q/P)

<^ for every pair of prime ideals P C Q in A with

^{A/P){Q/P)

= 1

we have ht^Q = 1 + h t ^ P . (T56.2) If A is catenarian then every homomorphic image of A is catenarian and so is the localization of A with respect to any multiplicative set in A. (T56.3) If A is universally catenarian then every homomorphic image of A is universally catenarian and so is the localization of A with respect to any multiplicative set in A. (T56.4) If A is universally catenarian then every affine ring over A is universally catenarian. PROOF. In (T56.1), to see that the first condition implies the second, we can take saturated prime ideal chains Po C • • • C Ph and Po C • • • C Pd in A with Po = 0, Ph = P, htAP = h, P 0 = P , Pd = Q, and ht{A/P)(Q/P) = d; clearly Po C • • • C Ph C Pi C • • • C Pd is a saturated chain of length h + d in A between 0 and Q and hence by the catenarianness of A we get htAQ = h + d. The second condition obviously implies the third. To see that the third condition implies the first, let P C Q be any prime ideals in A together with a saturated prime ideal chain Po C • • • C P„ in A between P and Q; then by the third condition we have ht^P, = 14- ht^Pj_i for 1 < i < n and hence ht^Q — htAP = n. The ideal correspondence between a ring and its homomorphic image given in the Third and the Fourth Isomorphism Theorems of L4§5(011), and the ideal correspondence between a ring and its localization given in L4§7(T12), yield (T56.2). A ring epimorphism <j>: A —» C uniquely induces an epimorphism of polynomial rings V •' A\Xi,..., Xm] —> C[X\,..., Xm] with ip(Xi) = Xt for 1 < i < m and ip(z) = {z) for all z £ A, and hence the first part of (T56.2) implies the first part of (T56.3). Similarly, for any multiplicative set S in any ring A, the canonical homomorphisms ( J 4 [ X I , . . . , xm])s uniquely induce an isomorphism ip : {A[Xi,..., Xm])s —> As[Xi,..., Xm] with tp(6(Xi)) =

(Q12) CATENARIAN

RINGS AND DIMENSION

FORMULA

271

Xi for 1 C" = B/P be the residue class epimorphism, and let Q' = 4>{Q). Now Q' is a prime ideal in the noetherian domain C" and it suffices to show that the length of every saturated prime ideal chain in C between 0 and Q' equals htcQ''• For showing this it is enough to prove that C" is catenarian, and hence in view of (T56.1) it is enough to prove that, for any pair of prime ideals I' C J' in C", upon letting R' = C'j, and S' = C'j,, and upon letting V = D'L, where ft : C" -> D' = C'/V is the residue class epimorphism and V = '( we have dim(S") = dim(iJ') + dim(T'). Let H — P n A. Then H is a prime ideal in A, the domain C is an affine domain over the subdomain C = '(C) and T = DL where L = L ' f l C . Then the domain D is a homomorphic image of A, and the domain D' is an affine domain over D, and hence again

272

L5: PROJECTIVE VARIETIES §5: MORE ABOUT IDEALS AND MODULES

by the dimension formula assumption we have dim(T') + restrdeg T T' = dim(T) 4- trdeg T T'. Now C is catenarian by (T56.2), and hence by (T56.1) we get dim(S) = dim(i?) + dim(T). Also clearly restrdeg T T' = restrdeg s S" and trdeg T T" = restrdeg fi i?'. By the above six displays we get dim(S") = dim(fi') + dim(T'), and this completes the proof. QUEST (Q13) Associated Graded Rings and Leading Ideals Let / be an ideal in a ring R. Recall that for any h e R we have put ord(yji/)/i = max{n e N : h € / " } , and the ring R is /-Hausdorff means: o r d ^ j j / i = oo <=> h = 0; see L3§12(D5). Also recall that in case R is quasilocal with I = M(R), o r d ^ j ) and /-Hausdorff are abbreviated to ord^ and Hausdorff respectively; see the first display after (2*) in L3§11. We define the associated graded ring of (R, I) to be the naturally graded ring grad(.R, I) whose underlying additive abelian group is obtained by putting grad(.R, I) = (${In/In+1)

(external direct sum of Z-modules).

neN

This is made into a ring by requiring that for all fm e Im and / „ £ In with m, n in N we have grad^i7)(/m) • grad^7)(/„) = g r a d ^ ( / m / „ ) where the central dot indicates multiplication and where grad^i7):/"^grad(fl,/) is the composition of A„ : / " -> (In/In+1)

and

/i n : ( / " / / " + 1 ) -> grad(fl, / )

where An is the natural surjection and /in is the natural injection. We also put grad(i?,J) n = g r a d ^ i 7 ) ( n

(Q13) ASSOCIATED GRADED RINGS AND LEADING IDEALS

273

and we observe that then ker (gradfoj)) =

In+1

and im (grad" R 7 ) J = grad(i?,7) n and moreover grad(ii,I) = Y J grad(i?,/)„

(internal direct sum).

We may call grad/ l f l / ) and grad(i?,/)„ the n-th graded map and the n-th graded component of (R, I) respectively, and for any h £ In we may call grad^fl t-. (h) the n-th graded image of h relative to (R, I). Note that the ring grad(il, I) is nonnull iff / ^ R. For any h G i? we put w iu\ (grad(R,i)W iford(RJ)/i = n e N v lefo ( / v ) (/i) = I ' 10 it oid^nj-fh = oo and we call this the leading form of h relative to (R,I). For any ideal J in R we put I the homogeneous ideal in grad(-R, I) ledi( / i /)(J) = < [generated by (Mo{RJ)(h))h£j i.e., equivalently f the homogeneous ideal in grad(i?, I) ledi( R / )(J) = < [ generated by U „ € N S r a a w ) (J

and we call this the leading ideal of J relative to (R,I). generators x = (XI)I£L of I and letting yi =

n

7

")

Taking any family of

gi&d\RJ){xi)

we see that grad(it, I) is a semihomogeneous (homogeneous if L is finite) ring over the ring grad(i?, I)0 with grad(JJ, J) = grad( J R,/) 0 [grad(ii,/) 1 ] = gia,d(R, I)0[(yi)ieL] and moreover considering the polynomial ring k\(Xi)iei\ with k = R/I as a semihomogeneous (homogeneous if L is finite) ring over the ring k (see (C6) of §3), we get a unique graded ring epimorphism ©?«,/)

:

M(*I)«6L]

- grad(ii, / )

with

k = R/I

274

L5: PROJECTIVE

VARIETIES §5: MORE ABOUT IDEALS AND MODULES

such that QfRj){z)

= Ho{z) for all z e k

and

9fRI)(Xi)

= yi for all I e L.

The homomorphism 0? f l 7N is said to be induced by (R, I) and x. Given any ring homomorphism : R —* R' and any ideal V in R! with (f)(1) C / ' , we get a unique graded ring homomorphism grad(R,/)(

giad(R',I')

such that, for every n £ N, the homomorphism gradfojjfa) : grad(fl,/)„ -> grad(i?',/')n induced by grad(flj)() are said to be induced by 4>. Note that upon letting J = kei((f>) and

H = ledi( fl) /)(J)

for the homogeneous ideal l e d i ^ / ^ J ) in grad(.R, /) we have that [cf. §6(E15)] (1)

\ed\{RJ)(J)

c ker(grad(Hi/)(4>))

and if is surjective with (f)(1) = / ' then grad(fli/)(
(2)

with ledi{RJ)(J)

= ker(grad (i j 7) (^))

and if (p is surjective with (f)(1) = I' and J is a principal ideal in R generated by an element h whose leading form le£o(R j)(h) is a nonzerodivisor in grad(.R, / ) (3)

then grad(/j)/)() is surjective and ker(grad( f l n ((/>)) is the principal ideal in gra.d(R,I) generated by

leio(Rj)(h).

In case R is a quasilocal ring with / = M(R), in the above three paragraphs we write R in place of (R,I), except that we write grad(i?) and grad(i?)„ in place of grad(ii,/) and grad(i?,/)„ respectively. A valuation function on a ring R is a map v : R —> Z U {oo} such that for all x, y, z in R we have: (i) v(xy) =v(x) +v(y),

(Q13) ASSOCIATED GRADED RINGS AND LEADING IDEALS

275

(ii) v(x + y) > mm(v(x),v(y)), and (iii) v(z) = oo & z = 0, with the usual understanding about oo. Thus in the notation of L3§11, a valuation function is like a valuation with R and Z playing the roles of the field K and the ordered abelian group G respectively. If v is a valuation function on a nonnull ring R then clearly R is a domain and we get a unique valuation v : QF(7?) —> Z U {00} such that for all x, y in Rx we have v(x/y) = v(x) — v(y); we say that the valuation v is induced by v; usually we denote v also by v. Considering the weaker property: (i*) v(xy) > v(x) + v(y), we see that ord(^j) satisfies (i*) and (ii); it satisfies (iii) iff R is 7-Hausdorff. We shall investigate this matter further in (T60) below and then again in the next Quest. Note that in case R is a local ring with 7 = M(R), (iii) is the Krull Intersection Theorem. LEMMA (T59). Let A = £ „ e N An be a naturally graded nonnull ring. Then A is a domain iff the product of any nonzero homogeneous elements in A is nonzero, i.e., iff: 0 ^ fm G Am and 0 ^ gn G An with m, n in N =$• fmgn ^ 0. PROOF. Given 0 ^ / = £ m e N / m e A and 0 ^ g = £ n e N S n G 4 with fm G j4 m and g„ G A„, let m and n be minimal with fm ^ 0 ^ gn- Then and / # = /mffn -I- Em+rKqeN ^9 w i t n ^ 9 e A> a n d hence

THEOREM (T60). Let 7 be an ideal in a ring ii. Then we have the following. (T60.1) 7 ^ R and o r d ^ / ) is a valuation function on R iff R is 7-Hausdorff and grad(7?, 7) is a domain. (T60.2) Upon letting R* = R/J* and 7* = 7/J* with J* = n n € N 7 n , we have that R* is 7*-Hausdorff, and if grad(7i, 7) is a domain then R* is a domain and ord( f l . ] / .) is a valuation function on R*. (T60.3) Let : R —> R' be a ring epimorphism, with J = ker(<j>), such that the leading ideal grad( f l / )(J) is a (homogeneous) prime ideal in grad(7Z,7). Then n„GN(7 + 7") is a prime ideal in R. PROOF. (T60.1) follows from (T59). To prove (T60.2) first note that 7?* is obviously 7*-Hausdorff, and then let * : R —* R* be the canonical epimorphism and assume that grad(i?, 7) is a domain. Now clearly ker(0*) = J* and 7* = (j>*{I) with grad( Ri/ ) (*) (J*) = 0, and hence grad(7i*, 7*) is a domain, because grad(iu)(*)(ker(*)) is the kernel of the induced epimorphism of grad(7?, 7) onto grad(7?*,7*). Thus we are reduced to (T60.1). To prove (T60.3), upon letting 7' = 4>(I), we see that grad(7?',7') is a domain, because grad(#j)(J) is the kernel of the induced epimorphism of grad(i?, 7) onto grad(i?',7'). Therefore by (T60.2) we see that R'/ n n eN 7 / n is a domain, and therefore nn€N7'™ is a prime ideal in R'. Clearly Dn&n(J + In) = ^^(rWN-f'"),

276

L5: PROJECTIVE VARIETIES §5: MORE ABOUT IDEALS AND MODULES

and hence C\ne^{J + In) is a prime ideal in R. THEOREM (T61). Let R be a local ring with dim(R) = d and emdim(i?) = e, let x = (xi,... ,xm) be a finite number of generators of M(R), let X = (X\,... ,Xm) be indeterminates over the field k = R/M(R), and recall that 9fj : k[X] —> grad(i?) is a graded ring epimorphism where k[X] = ^2n€nH^}n is the naturally graded polynomial ring with k[X]n = the set of all homogeneous polynomials of degree n (including the zero polynomial). Then we have the following. (T61.1) k e r ( 9 | ) n k{X}x = 0 &• m = e. (T61.2) k e r ( 9 | ) = 0 <S> m = d. (T61.3) If m = e then: R is regular «• k e r ( 9 | ) = 0. (T61.4) R is regular O there is a permutation a of { 1 , . . . ,m) together with elements a^ in k such that the ideal ker(9fj) in k[X] is generated by the m — e elements Xa^ - J2i<j<e aijXo{j) for e + 1 (r/In+1) be the natural surjection, and n n+1 let fin : (I /I ) —> grad(ii,/) be the natural injection. Also let yi = /ii(Ai(a:;)) for 1 all the a; are zero <=> all the bi belong to / . Now QR(Z) = H\(Xi(b\Xi + • • • + bmxm)). Since Hi is injective, we see that: GxR(z) = 0 o Ai(6in + • • • + bmxm) = 0. By the Nakayama Lemma (T3) we know that: m > e <^> Xi(b\Xi -\ 1-bmxm) = 0 for some elements b\,..., bm in R at least one of which does not belong to / . This completes the proof of (T61.1). Now clearly m = d o h t ^ / = m, and hence (T61.2) follows from the Relative Independence theorem (T34). By definition R is regular iff e = d, and hence (T61.2) implies (T61.3). So it only remains to prove (T61.4). First assume that R is regular. Then e = d, and by the Nakayama Lemma (T3) we can find a permutation a of { 1 , . . . , m) together with elements bij in R such that x a(i) = Xa<j< e bijx k[X'} be the unique fc-epimorphism of graded rings which sends Xa^ to Xa(i) or J2i<j<e aijXc,(j) according as 1 < i < e or e + 1 < i < m. Then clearly 0% = 9 | A, and hence k e r ( 9 | ) = ker(A) = H. Conversely assume that there exists a permutation a of { 1 , . . . , m} together with elements a»j in k such that the ideal ker(9^) in k[X] is generated by the m - e

(QU) COMPLETELY NORMAL DOMAINS

277

elements Xa{i) -

^2

cnjXa{j)

for

e + 1 < i < m.

l<j<e

It follows that x' = (a;CT(i),... ,xa(e)) are generators of / modulo I2 and hence by Nakayama they are generators of / . Now with k[X'],H, A as above we have ©ft — ©fiA w i t n ker(A) = H. Consequently ker(0^') = 0 and hence R is regular by (T61.3). COROLLARY (T62). If it is a ci-dimensional regular local ring then grad(-R) is isomorphic t o a d variable polynomial ring over the field R/M(R), and hence in particular grad(il) is a noetherian normal domain. PROOF. Follows from (T61.3). COMMENT (C19). [Graded Rings of Polynomial Rings]. Let X = {Xt)ieL be a family of variables over a field k, consider the polynomial ring R = k[X], and let / be the maximal ideal in R generated by X. Then we may "identify" k with R/I as well as with grad(i?, J) 0 . Now upon letting Y = (Yi)ieL

with

y , = g r a d ^ i 7 ) ( X , ) = lefo (fl>/) (X,)

we see that grad(i?, / ) = the polynomial ring k[Y] and

©ffl,/) = HX] ^ k[Y] is the /c-isomorphism which sends Xi to Y; for all / s L. QUEST (Q14) Completely Normal Domains Having proved L4§8.3(T24.1) as the Krull Intersection Theorem (T7), in (T64) we shall prove L4§8.3(T24.2) which says that the order function of a regular local ring is a valuation and hence the said ring is a normal domain. For dealing with the normality part of this we introduce the concept of complete normality and prove a lemma about it in (T63). A domain R is said to be completely normal if: x e QF(.R) with dxn £ R for some d e Rx and all n € N => x G R. COMPLETE NORMALITY LEMMA (T63). For any given ring R we have the following.

278

L5: PROJECTIVE VARIETIES §5: MORE ABOUT IDEALS AND MODULES

(T63.1) If R is a completely normal domain then R is a normal domain. If R is a noetherian normal domain then R is a completely normal domain. (T63.2) If I is an ideal in R such that for all z G R we have n „ e N ( z i ? + / n ) = zR, and grad(i?, I) is a completely normal domain, then R is a completely normal domain. (T63.3) For any ideal I in R we have that: I ^ R iff grad(i?, / ) is nonnull. (T63.4) If R is noetherian and I is any ideal in R then grad(.R, I) is noetherian. (T63.5) If R is noetherian and has an ideal I with 7 c jrad(i?) such that grad(.R, I) is a normal domain, then R is a completely normal domain. PROOF. To prove (T63.1), first assume that R is a completely normal domain, let x € QF(i?) be integral over R, and let

xm+ Yl aixm-i = 0 l
with m G N + and en G R be an equation of integral dependence for x. Write x = y/z

with

y GR

and

z £ Rx.

Let d=2m. Then clearly (ieflx

with

dxn € R

for

0 < n < TO.

By induction on n we shall show that for all m < n G N we have dxn G R. For n = mwe have already said this. So let n > m and assume for all smaller values of n. Multiplying the equation of integral dependence by dxn~m we get the equation

dxn = -

J2 o-idx71'1 \
whose RHS belongs to R by the induction hypothesis and hence so does the LHS. Therefore by complete normality we get x & R. Thus R is normal. Now assume that R is a normal noetherian domain, and let x G QF(i?) be such that dxn G R

for some

d £ Rx and all

n G N.

Then R[x] is contained in the finitely (by a single element) generated ^-module Rd_1, and hence (because of the noetherianness of R) by L4§5(025)(27*) we conclude that R[x] is a finitely generated i?-module. Consequently by L4§10(E2.2) we

(Q14) COMPLETELY

NORMAL

DOMAINS

279

see that x is integral over R, and hence (by the normality of R) we get x € R. Thus R is a completely normal domain. To prove (T63.2), let I be an ideal in R such that for every element z in R we have DneN(zR + In) — zR, and grad(f?, I) is a completely normal domain. By taking z = 0 we see that f]ne^In = 0, and hence by (T60.2) we see that R is a domain and o r d ^ j ) is a valuation function on R. To show that R is completely normal, let there be given any x G QF(R) such that dxn G R for some d G R* and all n G N. Write x = y/z

with

yeR

and

z e

Rx.

We want to show that x £ R, i.e., ?/ G zi?. By hypothesis nmen{zR

+ Im) = zR

and hence we are reduced to showing that y£zR

+ Im

for all

m G N.

We do this by induction. Since it is trivial for m = 0, let m > 0 and assume for m - 1 . Then y = az + 6

with

a G i?

and

6e/m_1.

Since cten G .R for all n G N, we get d(x - a)n G R for all n G N. Consequently, since x = y/z = a + (b/z), we get dbn G znR

for all

n G N.

Thus for all n G N we can write dbn = cn2™

with

c„ G i?.

Since o r d ^ j ) is a valuation function on R, taking leading forms relative to (R, I) preserves products, and hence for all n G N we have lefo(fl,/)(d) • (lefo (Ri/) (b))" = lefo (fli/) (c„) • (Mo{RJ){z))n Since grad(i?, I) is completely normal, this implies that lefo(fl i /)(6)/lefo (fli/) (z) G gr&d(R,I) and hence lefb(fl,/)(6) =lefo(fl,/)(c) • lefo (fli /)(z)

with

c G .R

.

280

L5: PROJECTIVE

VARIETIES §5: MORE ABOUT IDEALS AND MODULES

and therefore (because b e 7 m _ 1 ) by the definition of multiplication in grad(i?, J) we get

b-cz£lm. Thus y-{a

+ c)ze

Im

and hence y£zR

+ Im.

Part (T63.3) is obvious. Part (T63.4) follows from L4§5(027)(36*). In view of (T20) and (T60), by (T63.1) to (T63.4) we get (T63.5). ORD VALUATION LEMMA (T64). Let R be a regular local ring. Then for all nonzero elements x and y in R we have ovdR(xy) — ordRX + ordRy. Consequently R is a normal domain and we get a valuation oidR : QF(R) —> Z of QF(R) by putting ordR(x/y) = ordfl:c - ordRy. [We shall continue to use this extended meaning of ord#]. PROOF. Follows from (T62) and (T63). QUEST (Q15) Regular Sequences and Cohen-Macaulay Rings Let R be a ring. Let x = (a;i,..., xn) be a sequence of elements in R of length n S N, and for 0 < m < n let Im be the ideal in R generated by ( x i , . . . , xm). Let V be a module over R. Recall that SR(V) = R \ ZR(V) where ZR(V) is the set of all zerodivisors of V, i.e., elements r £ R with rv = 0 for some 0 ^ v G V. We say that £ is a F-regular sequence, if InV ^ V and for 1 < m < n we have xm £ SR(V/(Im-iV)). Moreover if J is an ideal in R with In c J then we say that £ is a ^-regular sequence in J. Furthermore if for every z € J ("1 ^ ( ^ / ( J n V ) ) we have J n y + zV = V then we say that £ is a maximal ^-regular sequence in J. By taking V — R we get the notions of i?-regular and maximal i?-regular sequences; note that now the condition xm £ SR(R/(Im-iR)) is equivalent to the condition m(xm) G S^m{R){(j)m{R)) where m : R —> R/Im-\ is the residue class epimorphism. We define the (R, J)-regularity of V by putting veg/R n V — the maximum length of a ^-regular sequence in J and we note that: this is in N U {—oo,oo}; it is —oo means V = 0; it is 0 means V ^ O and for every z G J n SR(V) we have zV = V; and it is oo means V =£ 0 and there is no bound on the said length. If R is local with J = M(R) then we call this

(Q15) REGULAR SEQUENCES AND COHEN-MACAULAY RINGS

281

the ^-regularity of V and denote it by reg#V, i.e., reg fl V = the maximum length of a V-regular sequence in M(R) and in this case we write reg(i?) in place of regfl.R, i.e., reg(i?) = the maximum length of an ii-regular sequence in M(R) and we call it the regularity of R; by (T27) this is a nonnegative integer < dim(i?). By parts (T66.4) and (T66.5) of Lemma (T66) below it follows that if R is noetherian, V finitely generated, and J is an ideal in R with JV ^ V, then there exist maximal ^-regular sequences in J and they all have the same length which equals reg(#,j)V. In particular if R is local then there exist maximal R-regular sequences in M(R) and the integer reg(-R) equals the length of every such sequence. The Generalized Principal Ideal Theorem (T27) inspires the following definitions of generating numbers, complete intersections, and CM (= Cohen-Macaulay) modules and rings. We define the generating number of V in R by putting gnb fl V = the smallest number of generators of V and we note that this is oo iff V is not finitely generated, and it is 0 iff V = 0. In particular this defines the generating number gnb#J of any ideal / in R by putting gnb fi 7 = the smallest number of generators of / . An ideal / in R is said to be an ideal-theoretic complete intersection in R if / ^ R and h t # / = gnbRI. A finitely generated -R-module is free means it is isomorphic to Rd for some d £ N , i.e., equivalently, it has a basis yi,. • • ,yd- In case R is local and V is finitely generated, V is said to be a CM module if reg^V = dim(i?/(0 : V)R). A local ring R is said to be a CM ring if it is a CM module over itself, i.e., if reg(i?) = dim(i?). Note that by (T27), for any local ring R we have the inequality reg(iZ) < dim(i?), and so the CM property says that equality holds; this may be compared with the fact that for any local ring R we have emdim(i?) > dim(i?) and R is regular means equality holds. A noetherian ring R is said to be a CM ring if, for every maximal ideal P in R, the localization Rp is a CM ring. An ideal / in a noetherian ring R is said to be unmixed if for every prime P in &SSR(R/I) we have h t ^ P = ht/j/, i.e., equivalently, if / has no embedded primes and all its minimal primes have the same height. The unmixedness theorem is said to hold in a noetherian ring R if every ideal in R, which can be generated by as many elements as its height, is unmixed; note that by (T27) this is equivalent to saying that if I is any ideal in R, which can be generated by e elements where the nonnegative integer e equals htjil, then / has no embedded primes. Amongst the various results about the above concepts which we shall now prove, in (T71) we shall prove the third part L4§8(T24.3) of the Ord Valuation Theorem.

282

L5: PROJECTIVE

VARIETIES

§5: MORE ABOUT

IDEALS

AND

MODULES

Before that, in (T67) and (T68) we shall relate regularity with the independence of (Q8) and the graded rings of (Q13). We start off with Lemmas (T65) and (T66). Out of this, the very versatile Lemma (T65) is of general interest. LOCALIZATION LEMMA (T65). For any ring R we have the following. (T65.1) If R is a domain then for any ideal I in R we have

n

(IRP)=i

PGmspec(H)

and in particular we have

f)

RP = R.

Pemspec(fl)

In both the above equations the intersection is taken in QF(i?). (T65.2) Without assuming R to be a domain, for any ideal / in R we have {r GR: cj>p{r) G (j>P{I)RP for all P G mspec(JR)} = / where Rp is the canonical homomorphism, i.e., equivalently we have

n

v-p]*=i

PGmspec(-R)

where the intersection is taken in R. PROOF. In the second equation of (T65.1), clearly RHS C LHS; to prove the other inclusion, given any x G LHS, upon letting

J = {z e Rx : xz e R] u {0} we see that J is an ideal in R with J $£ P for all P G mspec(R), and hence J = R by L2§5(R5), and therefore 1 € J, i.e., x G R. This proves the second equation of (T65.1), and hence the first equation of (T65.1) is subsumed under the first equation of (T65.2). In view of L4§7.1(T17)(b), the two equations of (T65.2) are equivalent to each other. In the second equation of (T65.2), clearly RHS C LHS; to prove the other inclusion, given any x G LHS, upon letting J = {z € Rx : xz G / } U {0} we see that J is an ideal in R with J <jL P for all P G mspec(i?), and hence J = R by L2§5(R5), and therefore 1 G J, i.e., x G I. REGULAR SEQUENCE LEMMA (T66). Let V be a module over a ring R, let x = (x\,... ,xn) be a ^-regular sequence in R of length n G N, let Im be the ideal generated by x\,... ,xm for 0 < m < n, and let I = In. Then we have the following.

(Q15) REGULAR SEQUENCES AND COHEN-MACAULAYRINGS

283

(T66.1) If j / i , . . . , yn are any elements in V with x\y\ + • • • + xnyn = 0 then yi e IV for 1 0 and xn € Sfi(Vy(J m V)) for 0 < m < n then the sequence (xn,xi,...,xn-i) is V-regular. (T66.4) If R is noetherian and J is an ideal in R with J V ^ V such that I C J, then there exists an integer n' > n together with elements xn+\, • • • ,xn' in J such that (x\,..., xni) is a maximal V-regular sequence in J. (T66.5) If R is noetherian, V is finitely generated, J is an ideal in R with JV =£ V such that I C J, and there exists a maximal F-regular sequence in J of length n, then £ is a maximal V-regular sequence in J. [If R is noetherian and J is an ideal in R with J V ^ V, then by taking n = 0 in (T66.4) we see the existence of maximal V-regular sequences in J, and moreover if V is finitely generated then by (T66.5) it follows that all maximal V-regular sequences in J have the same length and this common integer equals r e g ^ j j V ] . (T66.6) If R is noetherian, V is finitely generated, and J is an ideal in R with J V ^ V such that I C J, then reg (fl ,j) V = n + reg ( i J i J ) (V/(/V)). (T66.7) If i? is noetherian, V is finitely generated, and J is an ideal in R with J V 7^ V such that I C J, then upon letting n' = reg/ fl nV and v = gnb f l J we have v > n' >n and J has generators £ i , . . . , £„ such that (£i > • • • i fn') is a maximal V-regular sequence in J. (T66.8) Without assuming the sequence x to be V-regular, but assuming R to be noetherian and V to be finitely generated, if positive integer m < n is such that J m _i C ZR(V) and J m $! ZR(V), then for some elements r\,..., r m _ i in .R we have r i z i + • • • + r m _i:r m _i + xm e SR{V). (T66.9) Without assigning a V-regular sequence x in R, if R is noetherian, J is an ideal in R generated by v generators with i / £ N , and V is finitely generated with J V ^ V and reg( fi j)V = /x, then i/ > \x and J has generators f i , . . . ,&, such that ( £ i , . . . ,£M) is a V-regular sequence. PROOF. To prove (T66.1) we make induction on n. For n = 0 there is nothing to show. So let n > 0 and assume for n— 1. Let yi,..., yn in V with £i?/iH \-xnyn = 0. Since z n is a nonzerodivisor for V/(a;i,..., z„_i)V, we get yn = £ i < i < „ _ i xtZi with Zi e V. This gives X)i 0, and we may also assume d\ = d € N + with di = • • • = dn = 1 because then the sequence (x2,. -. ,xn) will be (V/(:rfV))-regular and we can repeat the argument. Now we

284

L5: PROJECTIVE VARIETIES §5: MORE ABOUT IDEALS AND MODULES

make induction on d. For d = 1 we have nothing to show. So let d > 1 and assume that the sequence (x(~1,x2,... ,xn) is V-regular. Clearly xf is V-regular. So let 1 < i < n and assume that (xf,X2, • • •, aJi-i) is V-regular. Let there be given any elements s,ti,...,U-i in V such that XiS = Xit\+x2t2-\ \-Xi-\U-\. Then by the induction hypothesis we have s = xf _ 1 ui 4- x2u2 H h Xi-iUi-i with u\,..., u*_i in V. Therefore X*~l{x\t\

-XiUi)

+ X2{t2 -XiU2)

-\

\-Xi-i(ti-i

-XiUi-i)

= 0

and hence by (T66.1) we get Xiti

-XiUi

G

(xf~1,X2,...,Xi-i)V.

It follows that XiU\ G (a;i,..., Xi-\)V and hence u\ G (xi,..., Xi-i)V and therefore s G (xf, x2,..., a:j_i)V. This completes the induction on i and proves (T66.2). To prove (T66.3) suppose n > 0 and xn G 5 f l ( y / ( / m y ) ) for 0 < m < n. We make induction on n. The case of n = 1 being obvious, let n > 1 and assume for n — 1. Applying the induction hypothesis to the V/(/i ^-regular sequence (x2,..., xn) we see that the sequence (x\,xn, x2,..., xn-i) is ^-regular. Suppose if possible that x\ € Z]i(V/(xnV)). However 'x! G

ZR(V/(xnV))

=>• x\y = xnz for some y,z in V with y ^ xnV => z = x\w for some w &V (because xn G Si{(Vy(:riV))) => xi(y - xnw) = 0 =>• y — xnw = 0 (because xi G S/{(V)) => contradiction to y g xnV. Therefore x\ G SR.(V/(xnV)), and hence (xn,xi,... ,xn-i) is V-regular. To prove (T66.4) and (T66.5) assume that R is noetherian and let J be an ideal in R with JV 7^ V such that I c J. If £ n +i, xn+2,... are any elements in J such that the sequence (x\,... ,xn+i) is ^-regular for i = 1,2,... then, upon letting In+i = (^l, • • •, xn+i)R we have that In C In+i C / n +2 C . . . are ideals in R with In 7^ -^n+i 7^ ^n+2 7^ • • • and hence by the noetherianness of R this must stop at some integer n' > n. This proves (T66.4). To prove (T66.5) also assume that V is finitely generated and there exists a maximal V-regular sequence x' = (x[,... ,x'n) in J of length n. By induction on n we shall show that then £ is a maximal V-regular sequence in J. This being obvious for n = 0, for a moment suppose that n = 1. Then J C ZR(V/(X'1V)) and hence, by (Q1)(T1), J C UP where the union is taken over the finite nonempty subset tassRiV/ix^V)) of spec(P) and therefore, by L4§5(024)(22'), J c P for some P G tassR(V/(x'1V)). Consequently, there exists t G V \ (x[V) such that for every j G J we have jt = x[j* for some j * G V. In particular xi* = x[u for some « e 7 ; since t g x[V and zi G SR(V), we must have u ^ i i 7 . Multiplying the

(Q15) REGULAR SEQUENCES AND COHEN-MACAULAY RINGS

285

equation x\t = x[u by j we get x\ jt = x[ju, and multiplying the equation jt = x[j* by x\ we get xijt = x'xx\j*\ equating the two RHSs we obtain x[ju = x[xij* and hence x[(ju — x\j*) = 0; since x\ € SR(V), this yields ju = x\j* and therefore j £ ZR(V/(XIV)). Thus J C ZR(V/(XIV)), and hence x = (x\) is a maximal V-regular sequence in J. Now let n > 1 and assume for n — 1. For 0 < m < n, upon letting I^ = (x'1,..., x'JR, we have J (jL ZR(V/(ImV)) and J <jL ZR{V/(I'mV)). Hence by (Q1)(T1) and L4§5(024)(22') we can find < £ J such that for 0 < m < n we have < £ Z f l (V/(I m V)) and x*n # ZR(V/(I'mV)). It follows that (xu... ,xn^,x*n) are and (x^,... ,x'n_i,Xn) V-regular sequences in J. By (T66.3) we see that (x„,xi,... ,xn-i) and ( a ; * , ^ , . . . , a ^ - i ) a r e ^-regular sequences in J; the second is maximal by applying the n = 1 case to the (y/(/^_ 1 V))-regular sequences x'n and x*n. Consequently (x\,..., xn-i) and (x[,..., x'n_1) are {V/{x*ny))-regular sequences in J; the maximality of the second follows by the maximality of the Irregular sequence (a;* ,x[,... ,x'n_x) and hence the maximality of the first follows by the induction hypothesis. This in turn yields the maximality of the V-regular sequence {x\,..., xn-i, x*n). From this, again by the n = 1 case, as applied to the (y/(/ n _iy))-regular sequences xn and x*n, we get the maximality of the V-regular sequence x. This completes the proof of (T66.5). In view of the bracketed remark following (T66.5), the proofs of (T66.6) and the inequality n' > n of (T66.7) follow from (T66.4). The rest of (T66.7) follows from (T66.9) which itself will be deduced from (T66.8). To prove (T66.8), without assuming the sequence x to be V-regular, but assuming R to be noetherian, V to be finitely generated, and m to be a positive integer with m < n such that Im-\ C ZR{V) and Im <jt ZR(V), we want to show that for some elements r\,..., rm-\ in R we have r\X\ + • • • + rm-ixm-i + xm £ SR(V). Now for some s\,..., sm in R we have sixi + • • • + smxm € SR(V). By (Q1)(T1) we know that ZR(V) = Pi U • • • U Pc where P\,...,PC are prime ideals in R with c £ N+ such that for all i =^ j we have Pi <£_ Pj (namely, these are the maximal members of ass^(V)). Since Im-l C ZR(V) and sixi -\

1- smxm

£ SR(V)

we must have xm & Pi for some i. Therefore by suitably labelling Pi,..., Pc we can arrange that xm g" Pt for 1 < i < b and xm £ Pj for b + 1 < j < c where b is an integer with 1 < b < c. By L4§5(024)(22'), for 1 tsixi -\

1- tem_!:rm_i + xm g Pi

286

L5: PROJECTIVE

VARIETIES §5: MORE ABOUT IDEALS AND MODULES

whereas it b + 1 s i i i H => tsia;i H _ => tsixi H

g ZR(V) = Pi U • • • U P c with xm G P,

h s m _ia; m _i 0 P* h is m _ia; m _i £ Pj h is m _i:r m _i + xm g Pi.

Thus isi^i H h t s m _ i x m _ i + x m ^ P* for 1 < i < c, and hence it suffices to take 77 = tsi for 1 < Z < m — 1. To prove (T66.9), without assigning a ^-regular sequence x in P, assume that R is noetherian, J is an ideal in R generated by v generators 771,..., j]v with v G N, and V is finitely generated with JV ^ V and reg( fl ^V = /x. We want to show that then v > fi and J has generators £ 1 , . . . , £„ such that ( £ 1 , . . . , £M) is a V-regular sequence. We make induction oni/. If v = 0 then /j = 0 and we have nothing to show. So let ^ > 0 and assume for v — 1. Let e be the largest nonnegative integer < 1/ such that (TJI, . . . ,r]e)R c ZR(V). If e = U then ju = 0 and again we have nothing to show. So assume that e < v. Then by (T66.8) we can find elements r\,..., re in R such that for £i = 7-1771 + • • • + rer]e + 7?e+i we have £1 e SR(V). Clearly J is generated by £1,772,.-., rje, r}e+2, • • •, f]v Let

^

Yl

VQ,...,VJ_\.

If

TiWlf* + "i-iPY

for

a11

r0, • • •, TJ-I in R

0
then it suffices to take Uj = Vj-i- So now assume that (2*)

(f>0(a + Vj-if3y =

]P

r (

i i'^(a + Vj-iPY

for

some r

o , • • •, J-j-1 in R.

0
We claim that [if i?-homomorphisms <j>i,...,i from Rn to R are linearly independent I where I S N, then I < n and dim^ fli 0 and assume for l — 1; clearly l < n and we can choose a basis x\,... ,xn of Rn so that xi,... ,xn-i+i is a basis of riii{xj) = 0 for 1 < j < n — I + 1; then (j>i(xj) = 0 for 1 i ). Now by taking (M, N) = ( Z - l , / ) i n L4§10(E5) we can find ( 0 , . . . ,0) ^ ( p i , . . . ,pi) £ Rl such that J2ii{xj) = 0 for 1 < j < n; this contradicts the linear independence of i, and hence kei(4>i) ^) rii<jP(a + Pj-iP)l)o
are linearly independent and P « Rn,

dimflPi=n-j>0

(Q31) PROJECTIVE

MODULES

OVER POLYNOMIAL

RINGS

455

and (5*)

dim f l P 2 = n - j + l > 0 .

By (1*) and (4*) we also see that (6*)

Pi # 0.

We claim that

(7*)

(a + ^ K P i l c A .

To see this, given any p E Pi, we want to show that (a + Vj-i(3){p) G Pi; now ' ( a + i/j-i/JXpjGPi < ^ 4>/3(a + j/ j _i/?) i ((a + */,-_!/?)(?)) = 0 for 0 < i < j - 1 by definition of Pi <5 P(a + vj-iP)i(p)

= 0 for 1 < i < j

obviously;

for 1 < i < j — 1 the last equation holds because p € Pi and hence it holds also for i=jby(2*). Next we claim that (8*)

/3(Pi)^0.

To see this, note that (a + Vj-if3, /3) is admissible by (T117.1) and hence by taking a + Vj-i0 for a in (3) we get Pi C P i where Pi = Eo P such that (9*)

v'(Q)cP2

and

v'(/3(p)) $ Pi.

Now considering the P-homomorphism Uj = i/j-i + 1/ : Q —» P

and considering P-homomorphism $i = [i{a + UjPY :P^>R

for

0/3(a + Vj-iPy

for

0< i <j - 1

456

L5: PROJECTIVE VARIETIES §5: MORE ABOUT IDEALS AND MODULES

and then we shall show that (11*)

the elements ($i)o<j<j are linearly independent

and this will complete the proof. To prove (10*) by induction on i, the assertion being obvious for i = 0, let 0 0(a + Vj-i(5)l-x{a

+ Vjff)

= 0(a + i/j-iPY^ia

by induction hypothesis

+ Vj-ip + v'0) i 1

=
by definition of Vj by expanding

and hence it suffices to show that <j)0(a + Vj^i0)%~lv'0 = 0; but this follows by noting that by (9*) we have im(i/) c Pi and by the definition of P 2 we have i ^ C k e r ^ f a + ^-i/?)*-1). To prove (11*) assume the contrary; then, because of (10*) and because (as said in the beginning of the proof) the elements {(j>0(a + i/j_i/3)*) 0 < i <j_ 1 are linearly independent, we can find (si,..., Sj) G Rj with Sj ^ 0 such that ^ 0 j = 0(a + Vj-iPy-^a j l

= <j>0{a + Vj-if3) - {a j

+ Vj0)

by (10*)

+ Vj-10 + v'0)

by definition of Vj

j l

= 4>0{a + i>j-i0) + 4>0{a + vj-i0) - v'0

by expanding

and hence, because $j(p) = 0, by evaluating the last line at p we get 4>0{a + Vj-i0)j(p)

+ <j>0(a + i/,-_i/3)'-V/?(p) = 0;

since p G Pi, by (2*) and the definition of Pi we have 0(a + Vj-i0)j(p) hence by the above display we get 0(a + vj-il3)i-1i/p(p)

= 0, and

= O;

now by (9*) we have v'0{p) G Pi \ Pi, and hence by the definitions of Pi and Pi we get 0(a + Vj-i0y~1v'0(p) ^ 0, contradicting the above display. Thus (11*) is established and the proof of (T117.4) is completed. PROOF OF (T117.5). Let us start by proving our assertion in the case of constant gnb, i.e., when there exists n E N such that for every J G mspec(P) we have gnbij^Pj = n. If P = 0 then our assertion is obvious. So assume that P ^ 0. Then by item (12') at the end of the proof of (T116.3) we must have n G N+. Let * be the set of all P-homomorphisms i/> : Rn —> P, and let $ be the set of all P-homomorphisms cfi : Q —» R. Also let N be the set of all P-homomorphisms

(Q31) PROJECTIVE MODULES OVER POLYNOMIAL RINGS

457

v : Q —> P, for every u e TV and <> / € $ consider the diagonal direct sum

and let I be the ideal in R obtained by putting

7=

Y^

det{0„,4,il))R.

By (T117.1) to (T117.3) we see that 7 c IR(a,(3). In a moment we shall show that I = R which will imply that IR((X, /3) = R and hence by taking s = 0 in (2) we will get our desired assertion. Suppose if possible that I ^ R. Then, because I is an ideal in R, by Zorn's Lemma I C J for some J £ mspec(i?). Let h : R —> R = R/J be the residue class epimorphism. Then P = P/(JP) and Q = Q/(JQ) are finite dimensional vector spaces over the field R, and the vector space dimension of P is n. Moreover, given any .R-homomorphism F : U —> V between any i?-modules U and V, it "induces" an i?-homomorphism F : U —> V between the Pi-modules U = U/(JU) and V = V/(JV). In this context we identify Rn with R ; then i\),<\),v as above induce .R-homomorphisms tp : R —> P , cf> ; Q —* R, V : Q —> P respectively. Also a and /? induce -R-homomorphisms a: P —> P and /?: P —> Q respectively, and we have the diagonal direct sum a©/3:P^P©Q. Clearly the admissibility of the pair (a, /3) implies the admissibility of the pair (a, (3). Also clearly (12*)

/i(det(0„,^)) = det(0 F i ? ^)

for the diagonal direct sum

We claim that: if U is a projective P-module (13*)

then for every i?-homomorphism F : U —> V we have F = F for some fl-homomorphism F : (7 —> V.

To see this, in the description of the "lifting property" at the beginning of this Quest (Q31), take a to be the residue class epimorphism V —» V, and take (3 to be the composition U —* U —> V where the first arrow is the residue class epimorphism and the second arrow is the given map F ; now it suffices to let F be the lifting homomorphism 7. We also claim that: (14*)

for some <j> e $ we have <j> 0 ^ 0.

458

L5: PROJECTIVE VARIETIES §5: MORE ABOUT IDEALS AND MODULES

To see this, note that n > 0 = ^ P ^ 0 and hence by the admissibility of (a,/3), in view of (3) we get (3 ^ 0. Therefore, since Q is a vector space over the field R, we can find an i?-homomorphism <j> : Q —> R with <j) P ¥" 0. Since Q is a projective .R-module, by (13*) we have (j) = (f> iov some (/!>€$. Now clearly

R = A/(XZ, YZ)A be the residue class epimorphism, and let / = (cj)(Y),(j)(X — Z))R. Clearly 4>{Y)(f>(Z) = 0 with <j)(Z) -£ 0; consequently (Y), 4>(X - Z) is not fl-regular. Let J=(Y,X-Z, XZ, YZ)A. Then J = -1{1). Also Z2 = XZ - (X - Z)Z G J and hence Z G rad^ J, and therefore M(A) = (X, Y, Z)A C TSAAJ. Consequently rad^J = M(R) and hence rad/?/ = M(R). Therefore ntRI = 2. Let G = G(X, Y,Z) G A be such that for g = 4>(G) we have htR(gR) = 1. By (E4), 4>(ZA) and ((X, Y)A) are exactly all the height zero primes in R and hence, in view of Krull's Principal Ideal Theorem §5(Q7)(T24), the equation htR(gR) = 1 is equivalent to the condition G $ (ZA) U (X, Y)A. Clearly G $ M(A) and hence upon letting S = G(X,Y,0) G B = k[[X,Y\] and T = G(0,0,Z) G C = k[[Z)\ we get G = S + T + UXZ + VYZ with S G M(B), T G M(C), U G A, V G A, Let H = (G,XZ,YZ)A. Then <j>~l(gR) = H. We shall show that: (1) TM(A) c H and (2) T £ H. From this it will follow that (gR : 4>(T))R = M(R) and hence M(R) G assR(R/xR) and therefore gR is not unmixed; consequently R does not

590

LECTURE

L5: PROJECTIVE

VARIETIES

satisfy condition (Sn) and hence it does not satisfy condition {Sn). To prove (1) it suffices to note that TX,TY belong to H because T £ M(C) and ZX, ZY belong to H, and moreover TZ belongs to H because S £ M(B) and (•)

T = G-S-

UXZ -

VYZ.

To prove (2), suppose if possible that T £ H. Then by (•) we get S £ H and hence (••)

G{X, Y, 0) = U(X, Y, Z)XZ + V(X, Y, Z)YZ + W(X, Y, Z)G(X, Y, Z)

with U(X, Y, Z), V(X, Y, Z)jW{X, Y, Z) in A. By putting Z = 0 in (••) and remembering that G <£ ZA we get W(X, Y, 0) = 1. Therefore W(X, Y, Z) is a unit in A and hence, remembering that G(X, Y, 0) £ M(B), by (••) we get G(X, Y, Z) £ (X, Y)A which is a contradiction. Remark: By §5(Q1) and §5(Q7)(T24), SR{R) DM(R) = {g £ R : htR(gR) = 1} and for any such g we have that: the jR-regular sequence g cannot be extended to a longer .R-regular sequence iff M(R) £ assR(R/gR). By §5(Q15)(T66) we also know that any two maximal .R-regular sequences have the same length. Therefore if g is nonunmixed for some g £ R with htR(gR) = 1 then g is nonunmixed for every g £ R with htR(gR) = 1. EXERCISE (E24). [Reduced Normality Quasicondition]. In §5(Q19)(T87) we showed that Condition (T) implies "Quasicondition" (T*) where these conditions on a noetherian ring R say the following: (T). R is a reduced normal ring. (T*). For every P £ nspec(i?) the ring R/P is a normal domain. By an example show that (T*) does not imply (T). Hint: Let R be the two dimensional reduced local ring as above. Upon letting Pi = ZA and P2 = (X,Y)A we have nspec(ii) = {4>{Pl),4>{P2)} and R^Pi) w APi for 1 < i < 2. Now APi is a regular local domain by §5(Q7)(T30.4) and §5(Q17)(81.7), and hence it is normal by §5(Q14)(T64). Thus R satisfies (T*). Note that the reference to §5(Q7)(T30.4) and §5(Q17)(81.7) for the regularity of APi may be replaced by a reference to §5(Q17)(C32) and §5(Q19)(C33). To show that R does not satisfy (T), in view of §5(Q19)(T86) and §5(Q19(T88), it suffices to show that R does not satisfy condition (S'2) of §5(Q19)(T86) which in our case says that: for every g £ R with htR(gR) = 1, the ideal (gR) is unmixed. So we are done by (E23). EXERCISE (E25). [Serre Quasicondition]. In §5(Q19)(T86) we showed that Serre Condition (5„) implies "Serre Quasicondition" (£*), where for a noetherian ring R and nonnegative integer n these conditions say the following: (S„) P £ spec(.R) with ieg(RP) < n => RP is CM. (5*). P £ spec{R) with htRP < n => RP is CM.

§6: DEFINITIONS AND EXERCISES

591

By an example show that (5*) does not imply (Sn). Hint: Modify the example of (E23) and (E24) by taking A to be a four variable power series ring, i.e., let 0 : A = k[[W,X,Y,Z]j -* R = A/(WZ,XZ,YZ)A be the residue class epimorphism, and let P\ = ZA and Po, = (W, X, Y)A. Then as in (E23) and (E24), R is a three dimensional reduced local ring with nspec(-R) = {(fi(Pi),
592

LECTURE L5: PROJECTIVE

VARIETIES

around display (1) of the preamble of §5(Q31)(C49), prove the following criterion. For any split i?-monomorphisms a : P —> Q and a' : P —> Q ' where P and Q are any modules over any ring R, we have that: a ~ a' <=> there exists an i?-isomorphism Q/a(P) —>

Q/a'(P).

EXERCISE (E28). [Limitations on Normalization Theorem]. Referring to the preamble of §5(Q33) for the definitions of restricted and strongly restricted domains, let us note the Finite Module Theorem §5(Q33)(T145) which says that every field is strongly restricted. A part of the Normalization Theorem §5(Q10)(T46) says that: given an affine domain A = R[x\,... ,xn] over an infinite domain R, with n G N + and trdegRA = d, assuming (1) R to be a field, (2) there exist d elements j / i , . . . , yd in A such that A is integral over R[yi,. • • ,yd], and moreover (3) j / i , . . . , J/<J can be chosen to be .R-linear combinations of x\,... ,xn. In the "Generalized Normalization Theorem" on page 124 of volume II of Zariski-Samuel's book [ZaS] it is "claimed" that (2) and (3) are true without assuming (1), and on the next two pages 125-126 this is used for "proving" a stronger version of §5(Q33)(T145) which says that every restricted domain is strongly restricted. The exercise is to show that without assuming (1), even (2) is untrue. Hint: For any nonnegative integer d let X\,..., Xd be indeterminates over R — Z and let A = R[x\,..., xn] where n = d+1 with Xi = Xi for 1 < i < d and xn G Q\Z (for instance xn = 1/2). Let if possible j / i , . . . , yd be elements of A such that A is integral over B = R\y\,...,yd]Then j / i , . . . , j/a must be algebraically independent over Q, and hence B is a normal domain. Therefore by (E7) the minimal polynomial F(Y) of xn over QF(5) belongs to B[Y]. But clearly F(Y) = Y - xn. This is a contradiction. EXERCISE (E29). [Auxiliary Theorem]. Prove Theorem §5(Q33)(T148). Hint: To prove assertion (T148.3) about monomials, use Comment §5(Q15)(C20) about binomial coefficients. To prove assertion (T148.1) and (T148.2), let A be a subdomain of a field K, let K be a finite algebraic field extension of K, let E be a semimodel of K/A given by E = U / £ A 5 J ( B ( ) where ( B ; ) ; S A is a nonempty family of subdomains of K with quotient field K such that A c B | , and let E = \JieA%}(Bi) where Bi = the integral closure of Bi in a finite algebraic field extension K of K. For any / G A and P G spec(£;), upon letting S = Bt\P, by_(T36) and (T37) of §5(Q9) we see that (B;)s is the integral closure of ( S J ) P in K, and hence by localization theory L4§7 we get 9J((Sj)s) = Vl((Bi)p,K). Consequently for any / G A we have
§6: DEFINITIONS AND EXERCISES

593

The proof of assertion (T148.4) is straightforward. EXERCISE (E30). [Homogeneous Function Field]. In the second paragraph before the statement of Theorem (T149) of §5(Q34.3) which starts with something like "In case iproj^f/ is the homogeneous prime ideal P in the homogeneous ring A over k," we discussed the homogeneous function field k(U)* and the alternative homogeneous function field k(U)^ of a projective variety U. This was the projective version of the third paragraph before the statement of Theorem (T19) of L4§8 which starts with something like "In case ispec^f/ is the prime ideal P in the affine ring A over k" and where we discussed the function field k(U)* and the alternative function field &(£/)* of an affine variety U. In both situations we had an epimorphism tpu : Rk(U)* —» k(U)* of the local ring Rk(U)* of U over k which induced an isomorphism : k(U)^ —> k{U)*. Recall that in the affine case k(U)* is the quotient field of A/P and Rk(U)* is the localization Ap, whereas in the projective case k(U)* is the homogeneous quotient field of A/P and Rk(U)* is the homogeneous localization A[py, in both cases &({/)* = Rk(U)*/M(Rk{U)*). Show that in the affine case, for the composition 9 : A —> Rk(U)* —> /c(f/)" (of the natural maps) we have ker(0) = P and QF(im(#)) = &([/)", which gives a second definition of (f>; also show that this does not work in the projective case. Hint: See L4§7.1(T16). EXERCISE (E31). [Projective Theorems]. Prove Theorems (T149) to (T152) stated in §5(Q34.3). Hint: To prove (T149) and (T150), recall that P is a relevant homogeneous prime ideal in the N + 1 variable polynomial ring B = k[X\,... ,XM+I] over a field k, let D = B/P be the graded residue class epimorphism, and take x S D*. In view of (Q34.2)(4) and the paragraph before (T149), we have £(£>) = QF(D/x(D)) = QF(D/{x - 1)D). In view of §5(Q7)(T24) and the fact that both D and D/(x — 1)D are domains, we see that (a: — 1)D is a prime ideal of height 1. Consequently (T149) and (T150) follow from L4§8(T19) and L4§8(T20) respectively. Similarly (T151) and (T152) follow from L4§8(T21) and L4§8(T22) respectively. DEFINITION (D5). [Homogeneous Localization]. To put Theorems (T153) and (T154) of §5(Q34.6) in proper perspective, in the Hint to the next Exercise we shall generalize them by replacing the finite variable polynomial ring B over a field by any integrally graded ring A = X^ez^*- First let us generalize the concept of homogeneous localization discussed in §5(Q34.2) to the homogeneous localization A\s] of any such A at any homogeneous multiplicative set S in A, i.e., at any multiplicative set S in A with S C A^ = \Jie%Ai. We convert the localization As

594

LECTURE

L5: PROJECTIVE

into an integrally graded ring As = J2i£z(As)i (As)i = [ J { a / s : a G Ai+j jez

VARIETIES

by letting and s e S n A , } .

Note that the localization map 4> : A —* As is now a graded ring homomorphism. We put A[s\ = (4s)o- For P G proj(A) we may write A[P] = instead of A^Aoo\P] and call it the homogeneous localization of A at P. Recall that in the good case, i.e, when S C SA(A), we may regard As, and hence also A[s], to be a subring of QR(A). Thus in the situation of §5(Q34.2), by taking At = Di or 0 according as i > 0 or i < 0, the notation D^Pj agrees with the present notation A[P]. EXERCISE (E32). [Ideals And Homogeneous Ideals]. Prove the relations between ideals and homogeneous ideals stated in §5(Q34.6)(T153) and the resulting relations between affine and projective varieties stated in §5(Q34.6)(T154). Hint: To prove a generalization of §5(Q34.6)(T153) valid in the above situation of (D5), let T(A) = the set of all homogeneous ideals in A and T(A$) = the set of all homogeneous ideals in AsAlso let T^(A) = {J G T(A) : [J : S}A = J) and T{A\s\) = the set of all ideals in A\s\Now let a : T(A) -> T(AS) be the map given by J H-> ( J)AS. By L4§7(T12) we see that a is an inclusion preserving surjective map which commutes with the ideal theoretic operations of radicals, intersections, sums, finite products, and quotients = parenthetical colons. Moreover, if J = Di T(As) be the map given by J H-> J = a ( J ) . Then clearly a is a bijection whose inverse is given by J H-> J = -1(J). Moreover a sends prime (resp: primary, radical) ideals to prime (resp: primary, radical) ideals. Let P : T(As) —> T(A[s]) be the map given by J i-> J n A[s\- It is easy to check that P is an inclusion preserving surjective map which also commutes with

§6: DEFINITIONS AND EXERCISES

595

the above mentioned ideal theoretic operations. /3 also send prime (resp: primary, radical) ideals to prime (resp: primary, radical) ideals. Furthermore, if A\ n S ^ 0, or more generally if (As)i C\ U(As) ^ 0, then /? is injective; namely, for any u in [As)i f~l U(As) and any homogeneous ideal J in As we have J f l {As)i = P{J)ul for alH e Z and hence J = 5Z»ez P(J)ul ls uniquely determined by (3{J). In general 0 need not be injective. In the situation of (Q34.6) take Ai = Bi or 0 according as i > 0 or i < 0. We get a (nongraded) ring isomorphism \i : B —» A by sending every finite sum x H » 6 N i £ B with n £ i?i to the finite sum £ ] i 6 N i* e -<4; this induces a bijection /x* : T(B) -> T(i4) given by J M M ( J ) . Taking 5 = { 1 , X J V + 1 , X ^ + 1 , . . . } we see that /3 is a bijection. We also get a bijection /x* : T'(B) —> T^.(^4) given by J — i > /u(J). We have a unique fc-isomorphism of rings i/ : B —> A[5] which sends Xj to Xi/X^+i for 1 T(J4[5]) given by J n ^(-O- Turning to the objects 7 and <5 defined in (Q34.6), we get a map 7* : T{B) -> f (B) given by J H 7 ( J ) , and a map 5* : f(B) -> T'(B) given by J H 5(J). Clearly /3a/i* = i/*7*

and

S " 1 / ? - 1 ^ = /I*<5*

and hence §5(Q34.6)(T153) follows from the above three paragraphs. Moreover, §5(Q34.6)(T154) follows from §5(Q34.3)(T151) and §5(Q34.6)(T153). To see that /3 need not be injective, with A as in the above paragraph, take S = kx; then As may be identified with A, and OAs and XN+IAS are distinct homogeneous ideals in As whose 0 images are the same, namely the zero ideal. To see that the condition AiflS ^ 0 implies the condition (As)inU(As) ^ 0, note that u 6 Ai<~\S =>• u/l € (As)if\U(As)To see that the converse is not true, in the above paragraph, taking S = {l,Xjj+vX%+1,... } we get XN+1/1 £ (As)i f~l U(AS). EXERCISE (E33). [Blowup of a Line in Three Space]. Verify Theorem §5(Q35.6)(T159) when A is the trivariate power series ring k[[X, Y, Z]\ over a field k, and P is the prime ideal in it generated by X and Y. EXERCISE (E34). [Regular Local Rings]. Prove properties (1+) and (2*) of regular local rings stated in §5(Q35.9). Hint: Let R be an n-dimensional regular local domain with n € N. If M(R) = (xi,... ,xn) then upon letting P = (x\,... ,xm)R with 0 < m < n, by §5(Q15)(T69.4) and §5(Q15)(T69.7) we get dim(R/P) = n - m and hence by (Q7)(T27) and (Q14)(T64) we see that R/P is a regular local domain, P is a prime ideal in R of height m, and Rp is an m-dimensional regular local domain. Conversely, let P be a prime ideal in R such that R/P is an (n - m)-dimensional regular local domain for some integer m with 0 < m < n. Let (j> : R —> R' = R/P be the residue class epimorphism. Take a sequence x = (x\,... ,xn) of elements in R such that (xi,..., xn)R = M(R), let X = (X\,..., X„) be indeterminates

596

LECTURE L5: PROJECTIVE

VARIETIES

over the field k = R/M(R), and let 6 | : k[X] —> grad(i?) be the graded ring isomorphism described in §5(Q13)(T61). Let <j>(x) = (<j>(xi),... ,(xn)), let us "identify" 4>{R)/M(4>(R)) with k, and let e j j g : k[X] -> gra,d{)0^. In view of §5(Q13)(T61.4), upon replacing (xi,. •., xn) by suitable (homogeneous) Rlinear combinations of them (whose determinant is a unit in R) we can arrange matters so that the kernel of Q^fX is generated by Xx,... ,Xm. By §5(Q13)(2) we know that grad#() : grad(ii) —> gra,d((f>(R)) is a graded ring epimorphism whose kernel is ledifi(P). It follows that the leading ideal ledi#(P) is generated by grad^(a;i), • • • ,grad^(a; TO ). Therefore, again upon replacing (xi,... ,xn) by suitable (homogeneous) i?-linear combinations of them (whose determinant is a unit in R) we can arrange matters so that Q C P where Q = {x\,..., xm)R. By what we have proved above, Q is a prime ideal in R with dim(.R/Q) = n — m. Therefore we must have P — Q. This proves (1*). The proof of (2*) follows from the references given there in a straight forward manner. §7: NOTES NOTE (Nl). [Noncancellative Quasiordered Monoids]. As suggested in §2(C2), an ordered abelian group may be defined as an (additive) abelian group G with a linear order < on it such that: (i) i < i' and j < j ' in G =>• i + j < i' +j', or equivalently such that: (ii) i\ = i\,... ,ia = i'a,ji < j[, • • • ,jb < jf, in G with a £ N a n d 6 e N + =^ iiH \-ia+ji~\ hjb < i[-\ Ma+iiH ^fb- In defining an (additive) abelian monoid to be ordered we chose condition (ii), because it implies cancellativeness; what we get by replacing (ii) by (i) may be called a quasiordered abelian monoid. So we may ask whether every quasiordered abelian monoid is an ordered abelian monoid, i.e., whether the implication (i) =*> (ii), and hence the implication (i) => cancellative, is true also for an abelian monoid. That this is not so comes out of a general construction found in Bhaskara's ancient algebra book Beejganit [Bha] composed around Anno Domini 1150 in Ujjain which is the city in India where I was born. What Bhaskara is doing is to describe the properties of oo which he calls KHAHARA (in Sanskrit) and which is the entity you obtain when you divide an ordinary quantity by zero. So let there be given any abelian monoid G with a linear order < satisfying (i); for instance G could be an ordered abelian group such as Z or Q or R or even the null group consisting of the single element 0. Let us adjoin an extra element oo to G to get a quasiordered abelian monoid I = G U {oo} by declaring oo + oo = x + oo and x < oo for all x G G. The said declaration also shows that / is neither cancellative nor ordered. NOTE (N2). [Embedding Monoids Into Groups]. In the above Note (Nl), it is easily seen that for any abelian monoid G we have: (i) + cancellative «=> (ii),

§S: CONCLUDING NOTE

597

i.e., a quasiordered abelian monoid is cancellative iff it is ordered. By mimicking the construction of the quotient field of a domain, it can also be seen that an abelian monoid is cancellative iff it can be embedded in an abelian group as a submonoid. This may be added to an appropriate list of Exercises. §8: CONCLUDING N O T E In my Historical Ramblings paper [A02] and in my Engineering Book [A04], I have divided Algebra into the High School Algebra of Polynomials and Power Series, the College Algebra of Rings and Ideals, and the University Algebra of Categories and Functors. There (see page 424 of [A02]) I declared Krull to be the college algebraist par excellence, listed Chevalley and Cohen as the follow-up leaders in that field, called Nagata [Nag] "a fitting successor to Krull," and on the previous page (page 423) I listed Dedekind and Emmy Noether as the predecessors of Krull. Zariski is the only person (see page 1 of [A02]) I included in the intersection of High School Algebra and College Algebra.

Lecture L6: Pause and Refresh By reading the following summaries of the first five lectures, the rest of the book may become intelligible without studying the details of those lectures. In the first lecture we have introduced the basic structures of algebra such as groups, rings, fields, vector spaces, ideals, modules, polynomials, rational functions, euclidean domains, principal ideal domains, and unique factorization domains. In the second lecture, after introducing power series, meromorphic series, and valuations, we show the equivalence of well-ordering and Zorn's Lemma and use them to establish the existence of vector space basis, transcendence basis, algebraic closure, and maximal ideals. The third lecture deals with the power series theorems of Newton, Hensel, and Weierstrass. The fourth and fifth lectures deal with ideals, modules, varieties, and models which are the avatars of varieties full of local rings. §1: S U M M A R Y OF LECTURE LI O N Q U A D R A T I C EQUATIONS For sets (= collections of objects) S and T, a map <j>: S —> T is an assignment which to every x £ S, i.e., to every element (= object) x of S, assigns 4>{x) £ T; this may be written x i—• (fi(x); the element <j>{x) is called the image of x under ; we put dom(g!>) = S and ran() = T and call these the domain and range of respectively. The composition of maps : S —> T and ip : T —* U is the map ip : S —> U given by (il>)(x) = ip({x) = cj>(y) =>• x = y. A subset of S is a set R whose objects are amongst the objects of S; we write R C S; we may also write S D R and call S an overset of R. We put {x) : x £ R} = the set of all . We put im() = ; the map is surjective, or is a surjection, if : S —> T we have the inverse map <j>~1 : T —> S given by 4>~l{y) — x •*=> (f>(x) = y. Without assuming the map : S —> T to be bijective, for any y £ T we put (f>~1(y) = {x £ S : 4>{x) = y}, and for any U C T we put (j)~l(U) = {x £ S : <j>(x) £ U}\ moreover, for any Ac S and B CT with <j>(A) C B, by <M(A,B) w e denote the map A —> B which sends every x £ A to (f>(x) in B, and we call this the restriction of (f> to (A, B)\ if B = (A) then we may denote this by \A and call it the restriction of <> / to A. The empty set is denoted by 0. For subsets R\ and -R2 of a set S, the complement of i?2 in Ri is denoted by Ri\R2, i.e., Ri\R2 = {x £ Ri : x g R2}; needless to say that x $ R2 means x is not an element of R2, just as x ^ y means x is not equal to y, and so on. For subsets Ri and R2 of a set S, their intersection is denoted by i?i fli?2 and their union is denoted by R1UR2, i.e., R\ fli?2 = {x £ S : x £ R\ and x £ R2} and RiL) R2 = {x £ S : x £ Ri ox x £ R2}. Similarly for more than two subsets Ri,..., Rm of a set S we have r\i
§i: SUMMARY OF LECTURE LI ON QUADRATIC EQUATIONS

599

Ui r is a prime number. A homomorphism of a group G into a group J is a map : G —> J such that (xy) = (f>(x)<j)(y) for all x, y in G; the kernel of <j> is defined by ker(^) = ^ _ 1 ( 1 ) , and we have im() < J and ker() < G; note that (j> is injective iff (= if and only if) ker(^) = 1 and when that is so we call is surjective, i.e., if im(0) = J, then we call <j> an epimorphism; if is bijective then we call it an isomorphism; if J = G and is an isomorphism then we call (f> an automorphism of G. For H < G we put xH = {xy : y £ H} and call this a left coset of H in G (similar definition of a right coset), and by G/H we denote the set of all left cosets

600

LECTURE L6: PAUSE AND REFRESH

of H in G, and note that this is a partition of G; also we put [G : H] = \G/H\ and call this the index of H in G. If H < G then G/H becomes a group by defining (xH)(yH) = (xy)H, and we call G/H the factor group of G by H; now x H-» xH gives an epimorphism G —> G / # with kernel i/; we call this the canonical epimorphism of G onto G/i?. A finite group G is solvable if there is a chain 1 = Go < G\ < • • • < Gr = G such that d/Gi-i is cyclic of prime order for 1 < i < r. The set of all bijections of a set S onto itself forms a group under composition which we call the symmetric group on S and denote it by Sym(S'). If |S| = n € N then Sym(S') = Sn- Any permutation a on n letters, i.e., a £ Sn, can be written as a product of a certain number v of transpositions, and the parity of u, i.e., its evenness or oddness, depends only on 1. For n > 5, An is simple and Sn is unsolvable. See (XI), (X2), and (El) to (E6) of L1§11. A group G is commutative or abelian if xy = yx for all x, y in G. An additive abelian group is a commutative group in which the operation is written as a sum x + y instead of a product, 0 is written for the identity, and — £ is written for the inverse of x. To contrast with this, a usual group, with the operation written as a product or multiplication, may be called multiplicative. Deleting (iii) from the the definition of a group we get the definition of a monoid, and by deleting (ii) and (iii) we get the definition of a semigroup. The terms commutative or abelian semigroup, additive abelian semigroup, multiplicative semigroup, subsemigroup, and oversemigroup are obvious generalizations of the corresponding terms for groups; similarly for a monoid; note that the identity element of a submonoid is required to coincide with the identity group of the monoid. A semigroup H is cancellative means for any x, y, z in it we have the implication: xy = xz and yx = zx =$• y = z. If H is a subsemigroup of a group G then clearly H is cancellative. As examples, Z is an additive abelian group, N is a submonoid of Z, and N+ is a subsemigroup of Z. A ring R is an additive abelian group which is also a commutative multiplicative monoid such that the two operations are connected by the distributive laws saying that for all x, y, z in R we have x(y + z) = xy + xz and (y + z)x = yx + zx; we call R a null ring if in it we have 1 = 0, i.e., equivalently if \R\ = 1. A domain is a nonnull ring whose nonzero elements form a cancellative multiplicative monoid. A field is a nonnull ring whose nonzero elements form a multiplicative group. If R C S are rings under the same operations (and have the same zero and identity elements) then R is a subring of S or S is an overring of R. Similarly for subfields

§i: SUMMARY OF LECTURE LI ON QUADRATIC EQUATIONS

601

and overfields, and subdomains and overdomains. As an additive abelian group, every subgroup 7 of a ring R is a normal subgroup and so we can form the factor group R/I; a typical member of R/I is a residue class (= additive coset) a + I with a £ R. By an ideal in a ring R we mean an additive subgroup I of R such that for all a £ R and x £ I we have ax £ I, and when that is so we make R/I into a ring by denning (a + I)(b + I) = {ah) + I for all a, b in R. We call R/I the residue class ring of R modulo I, and we call I a maximal ideal or a prime ideal in R according as R/I is a field or a domain; for more standard definitions of prime ideal and maximal ideal see (Dl). Likewise, we call I a nonzero ideal or a nonunit ideal according as I ^ {0} or I =£ R; note that every ideal contains the zero ideal I = {0} and is contained in the unit ideal I = R; moreover, the zero ideal is prime <=> R is a domain; likewise, I is the unit ideal <=>• R/I is the null ring. For any a £ Rwe put aR = the set of all multiples az of a with z £ R, and for any W C R we put WR = the set of all finite linear combinations a\Z\ + • • • + aeze with a\,..., ae in W and z\,...,ze in R, and we note that these are ideals in R (by convention an empty sum is zero and hence 0 £ WR); they are called ideals generated by a and W respectively; an ideal of the form aR is called a principal ideal in R and a called a generator of it; in case W = {w\,W2, • • •}, we may denote WR by (wi,W2, • • -)R and call wi,W2,- • • its generators. The zero ideal, and more generally the (additive abelian) zero group may be denoted by 0 instead of {0}. It is easy to see that every ideal in Z is of the form pZ with p £ N; moreover pZ is a maximal ideal in Z iff p is a prime number, and then Z/pZ is the Galois field GF(p). The set of all automorphisms of a group G is clearly a subgroup of Sym(G) and we denote it by Aut(G). For rings S and T, a (ring) homomorphism : S —* T is a homomorphism of additive abelian groups such that (xy) = (j>(x)(j>(y) for all x, y in S; now ker(0) is an ideal in S, and the definitions of monomorphism, epimorphism, isomorphism, and automorphism carry over from the group case. Conversely, if : S —» S/I is the canonical ring epimorphism where / is an ideal in a ring S then ker(0) = I. The set of all ring automorphisms of a ring S is clearly a subgroup of Sym(S') and we denote it by Aut(S); momentarily letting this Aut be written as Ring-Aut(S') and writing Group-Aut(S') for the automorphism group of the additive abelian group S we clearly have Ring-Aut(5) < Group-Aut(5) < Sym(S'); it is usually clear from the context which automorphism group is being considered. Finally for a subring R of a ring S, by an i?-automorphism of S we mean an automorphism of S which leaves R elementwise fixed, i.e., (x) = x for all x G R; these form a subgroup of Aut(S') and we denote it by AutR(S). Dropping the commutativity of multiplication in a field (resp: ring, domain) we get the notion of a skew-field (resp: skew-ring, skew-domain). In the definitions of fields and rings we have spelled out two distributive laws, although they follow from each other, exactly because in case of skew-fields and skew-rings they do not. Moreover, in the definition of a field, by requiring the left-distributive law

602

LECTURE

L6: PAUSE AND

REFRESH

x

(y+z) = xy+xz, but not requiring the right-distributive law (y+z)x = yx+zx, we get the notion of a near-field. The concepts of subskew-field, overskew-field, and so on, are defined in an obvious manner. Also the above definitions of homomorphism, monomorphism, and so on, have obvious generalizations to semigroups, monoids, and so on. See section Ll§5 on Modules and Vector Spaces for the definitions of: a module V over a ring R, scalar multiplication in V, linearly independent and linearly dependent elements of V, submodule of V, submodule generated by a set of elements or a subset, generators of a submodule, finitely generated submodule, basis of a submodule, it-homomorphism (or it-linear map) K - » y * where V* is another it-module (= module over it), it-monomorphism, R-isomorphism, left and right modules over a skew-ring, vector space over a field K (= module over K), subspace (= submodule of a vector space), vectors (= elements of a vector space), the dimension of a vector space V over a field K denoted by d i m ^ ^ , and so on. An overring of a ring R is an it-module in an obvious sense; thus it is an itmodule, and an ideal in R is exactly an it-submodule of R. In particular, an overring L of a field if is a if-vector space, and we put [L : K] — dim^L. This applies to an overfield L of K, and then we call [L : K] the field degree of L/K, i.e., of L over K. For a ring or skew-ring R, by R+ we denote the underlying additive abelian group of R, and by i t x we denote the set of all nonzero elements of R; note that Kx is a multiplicative group in case of a field or skew-field K. Extending this notation to a vector space V we denote the set of all nonzero elements in it by Vx. For an overfield L of a field K, the symbol [L : K] usually denotes the field degree rather than the index of K+ in L+. See section Ll§6 on Polynomials and Rational Functions for the definitions of: indeterminate (= variable) over a ring, univariate polynomial ring R[X] over a ring it, degree of a polynomial, monic polynomial, nonconstant polynomial, Cauchy multiplication rule, univariate rational function field K{X) over a field K, derivative, quotient rule, multivariate polynomial ring R[Xi,... ,Xm], degree and partial derivatives of a multivariate polynomial, multivariate rational function field K{X\,..., Xm), substitution epimorphism R[X\,...,Xm] —+ R\x\,..., xm] = the ring generated over R by elements x\,..., xm in an overring of R, algebraic or transcendental element over R, algebraic set of elements over R, algebraically dependent or algebraically independent sets of elements over R, field K(x\,..., xm) generated over a field K by elements x\,..., xm in an overfield of K, transcendence basis of an overfield L of K, and the number of elements in it called the transcendence degree of L/K (= L over K) denoted by trdeg^L. See section Ll§7 on Euclidean Domains and Principal Ideal Domains for the definition of: the group U{R) of all units in a ring R (or two-sided units in a skew-ring R), associates and irreducible elements in a ring, UFD = unique factorization domain, PID = principal ideal domain, euclidean function, ED = euclidean domain, special ED, quasispecial subset, and quasispecial ED.

p.- SUMMARY OF LECTURE L2 ON CURVES AND SURFACES

603

See section LI§8 on Root Fields and Splitting Fields for the definitions of: ii-homomorphism (resp: il-monomorphism, i?-epimorphism, /^-isomorphism) between overrings of a ring R, root field of a univariate irreducible polynomial over a field K, splitting field of a nonconstant univariate polynomial / over K denoted by SF(/, K), separable polynomial / , the Galois group Gal(L*,K) of the splitting field L* = SF(/, K) of a nonconstant univariate separable polynomial / over K, the natural isomorphism Gal(L*,.£Q —» Gal(/, K) = the Galois group of / over K which is the group of all relation preserving permutations of the roots of / , and for any H < GaA(L*,K) the fixed field fix7>(i/) of H which is a field L between K and L*. In (Tl) we state the theorem which includes the assertion that H — i>L = &x.L'(H) gives an inclusion reversing bijection of the set of all subgroups H of Gal(L*, K) onto the set of all fields L between K and L and the inverse bijection is given by L —» Gal(L*,L). After stating other relevant theorems in (T2) to (T5), in (T6) we deduce Galois' Unsolvability Theorem which says that if a field k contains n! distinct n!-th roots of 1 then, for n > 5, the generic n-th degree polynomial over K = k(X\,..., Xn) cannot be solved by radicals. In (Dl) we define divisibility x\y and prime elements in a ring. In (R2) to (R4) we define: gcd and GCD (= greatest common divisor), and coprime and relatively prime elements. In (D2) to (D4) we define: characteristic of a ring R denoted by ch(R), minimal polynomial of an algebraic element over a field, 1cm and LCM (= least common multiple), and reduced form. In (R7) we define the (relative) algebraic closure of a field in an overfield. In (X2) we define the decomposition of a permutation into disjoint cycles. In (El) we define the modified discriminant. In (Nl) we define derivations and in (Ell) and (E12) we describe their properties. Proper Containment. By a proper subset A of a set B we mean A C B with A ^ B. We indicate this by writing A C B or B ^ A, and by saying that A is properly contained in B or B properly contains A. This agrees with the above definition of proper subgroup, and with the definitions in L4§8 and L5§5(Q16). §2: S U M M A R Y OF LECTURE L2 ON CURVES A N D SURFACES In L2§1 we introduce the concepts of cartesian product, partial order, linear order, well order, cardinals, and ordinals. The cartesian product S\ x • • • x Sm of sets S i , . . . ,Sm is the set of all m-tuples ( 7 1 , . . . ,7m) with ji £ Si. If Si = • • • = Sm = S then we write Sm for Si x • • • x Sm. Also we define concepts of algebraically closed field, an (absolute) algebraic closure of a field, and the (relative) algebraic closure of a field in an overfield. To prove the existence of algebraic closures, we introduce the Axiom of Choice, Zorn's Lemma, and Well Ordering. These are based on the concepts of posets (= partially ordered sets), losets (= ordered sets = linearly ordered sets), and wosets (= well ordered sets). Cardinals are equivalence classes of sets under bijections, and ordinals are equivalence classes of well ordered

604

LECTURE L6: PAUSE AND REFRESH

sets under order preserving bijections. The power set V(T) is the set of all subsets of a set T, and the restricted power set VX(T) = V{T) \ {0}. In L2§2 on Power Series and Meromorphic Series we introduce the notation QF(.E) for the quotient field of a domain E. The quotient field of the power series ring K[[Zi,. ..,Zd\] over a field K is the meromorphic series field K([Zi,...,Zd)). We define the orders and derivatives of meromorphic series. We use the geometric series identity (1 - X)(l + X + X2 + . . . ) = 1 to show that the group U(S) of all units in the ring of power series S = R[[Z\,..., Zd]] over a ring R are exactly those power series Q(Zi,..., Zd) whose constant term Q ( 0 , . . . , 0 ) belongs to U(R). In L2§3 on Valuations we generalize the concept of the order of a meromorphic series by defining a valuation of a field AT to be a map v : K —> G U {00} where G is an ordered abelian group G such that for all x, y in K we have: (1) v(xy) = v(x) + v(y); (2) v(x + y)> min(v(x),v(y)); (3) v(x) = 00 <=> x = 0.

We call G the assigned value group of v, and by the value group of v we mean the subgroup of G given by Gv = {v(x) : x £ Kx}. By the valuation ring of v we mean the ring Rv — {x £ K : v(x) > 0} and we note that this is clearly a subdomain (= subring which is a domain) of K which is its quotient field. We say that v is trivial over a subfield k of K, or that v is a valuation of K/k, to mean that x £ kx => v(x) = 0. For instance ord gives a valuation of the meromorphic series field K((Zi,..., Zd)) over any field K which is trivial over K and whose value group is Z. Here by an ordered abelian group we mean an additive abelian group G which is also an ordered set such that for all x, y, x', y' in G we have: x < y and x' < y' => x+x' < y+y'. For instance G could be the set R ^ of lexicographically ordered c!-tuples of real numbers r = ( r i , . . . , 7^), s = ( s i , . . . , Sd), • • •, where lexicographic order means: r < s o either r^ = s, for 1 < i < d, or for some j with 1 < j < d we have ri = Si fox 1 < i < j and TV, < Sj. Or G could be a subgroup of R^' such as G = Z or or M. The field K((X)) is generalized by taking any field K and any ordered abelian group G and considering the field K((X))G = {A £ KG : Supp(^4) is well ordered} G where K is the set of all maps G —» K and where the support of A 6 KG, denoted by Supp(A), is defined by putting Supp(A) = {g £ G : A(g) ^ 0}. For any A £ K((X))G we define ord(A) = min Supp(^) if A ^ 0 and ord(A) = 00 if A = 0, and we note that now ord is a valuation of K((X))o/K with value group G. In (Tl) we state Newton's 1660 Theorem on the algebraic closure of k((X)) when k is an algebraically closed field with ch(fc) = 0, and in (T2) we formulate its generalization for ch(fc) = p > 0. In L2§5 on Zorn's Lemma and Well Ordering we prove that these two are equivalent and they are equivalent to the Axiom of Choice. Then by using Zorn's Lemma we prove the: (R5) existence of maximal ideals, (R6) existence of algebraic closure, (R7) uniqueness of algebraic closure, existence of vector space bases as well

§2: SUMMARY OF LECTURE L2 ON CURVES AND SURFACES

605

as transcendence bases, (R9) linear order on cardinals, (RIO) well order on ordinals. Let us state the contents of Remarks (R5) to (R8) in greater detail thus. (R5) Any nonunit ideal in a ring R is contained in some maximal ideal in R. (R6) Any field K has an algebraic closure, i.e., an algebraically closed field K which is algebraic over K. (R7) The K of (R6) is unique in the sense that given any field isomorphism <j> : K —> L and any algebraic closure L of L there exists an isomorphism <j> : K —> L such that 4>{x) = 4>{x) for all x G K. (R8) Let K he a, field and let L be either a vector space over K or an overfield of K. Let W be a subset of L. In the vector space case (resp: overfield case): let us call W independent if every finite subset of W is linearly (resp: algebraically) independent over K, and let us call W generating if L coincides with KW (resp: if L is algebraic over K{W))\ in both cases call W a basis if it is independent and generating; given any other subset U of L, let us say that U is dependent on W if every u G U belongs to KW (resp: is algebraic over K(W)). Then we prove the facts (W7) and (W8) stated below. W is a basis

•$=> it is a minimal generating set <=> it is a maximal independent set

(W7)

where the minimality means that W is a generating set but there is no generating set W with W C W and W ^= W, and the maximality means that W is an independent set but there is no independent set W with W C W and W ^ W. W is generating => there exists a basis W' with W' C W.

(W8)

In L2§7 on Definitions and Exercises we define: real and complex numbers, ordered fields, torsion subgroups and divisible groups, rational and real completions, Cauchy sequences and Dedekind cuts, rational and real ranks. In (D7) we define approximate roots of polynomials and in (E15) to (E17) list their basic properties. More About Fields. In L3§12(E6) we show that if G(Zi,..., ZN) is any given polynomial over an infinite field k such that G(a,\,..., ajv) = 0 for all a\,..., a^ in k then G{Z\,..., ZN) = 0. In L4§10(E1) we elucidate the above Remark (R6). In L5§5(Q27)(C41) we give a criterion for the minimal polynomial of an algebraic element to be separable. In L5§5(Q29)(T109) we discuss the linear disjointness of fields defined in the preamble of (Q29); we discuss this further in L5§6(E22). In L5§5(Q29)(C48) we give a dimension formula for subspaces of a vector spaces. In L5§5(Q32) we discuss separable and inseparable polynomials, separable and inseparable elements, purely inseparable elements, separable and inseparable field extensions, purely inseparable field extensions, finite fields, perfect fields, and the existence of primitive elements for finite separable algebraic field extensions. For any power q of any prime number p, in L5§5(Q32) we show that up to isomorphism there is a unique field of q elements; this is called the Galois field GF(q).

606

LECTURE L6: PAUSE AND REFRESH

§3: S U M M A R Y OF LECTURE L3 ON T A N G E N T S A N D POLARS In L3§1 we introduce the concepts of rectangular matrices with entries in a ring, and groups of square matrices whose determinants are units in the ring of entries. In L3§2 we study the geometry of quadrics and their pole-polar properties. In L3§3 to L3§5 this is expanded into the geometry of hypersurfaces, homogeneous coordinates, projective spaces, tangents, and singularities. In L3§6 we prove Basic Hensel's Lemma which says that if a monic polynomial in Y whose coefficients are power series in X factors into coprime factors after putting X = 0 then it factors before putting X = 0. Using this and using the completing the square method of solving quadratic equations conceived by the fifth century Indian mathematician Shreedharacharya, in L3§7 we establish Newton's Theorem on fractional power series expansion which was stated in L2§3(T1) as a theorem on the algebraic closure of k((X)). Newton's Theorem motivates the idea of integral dependence which we take up in L3§7 to L3§9. An element t in an overring 5 of a ring R is said to be integral over R if t satisfies a monic polynomial equation over R; a univariate polynomial over R is monic means its leading coefficient, i.e., the coefficient of the highest degree term in it, is 1. The set of all elements in S which are integral over R is a subring R' of S and it is called the integral closure of R in S; if R' = R then we say that R is integrally closed in S; if R is integrally closed in its total quotient ring QR(i?), as defined in L4§7, then R is said to be normal. In particular a domain R is normal means it is integrally closed in its quotient field K. A domain R is overnormal means for every pair of monic polynomials in an indeterminate Y over K we have: A(Y)B(Y) G R[Y] =>• A{Y) G R[Y\ and B(Y) G R[Y}. A domain R is intval means it is the intersection of a nonempty family of valuation rings of K, i.e., valuation rings of valuations of K. It turns out that for any domain we have: UFD =>• intval «=> overnormal o normal. The GCD of all the coefficients of a nonzero polynomial over a UFD is its content. Gauss Lemma says that the content of a product is the product of their contents. This yields the fact that a finite variable polynomial ring over a UFD is a UFD. The proofs of some of the above assertions may be found in L4§10(E2), L4§10(E3), and L4§12(R7). In L3§11 we deduce the multivariable version of Hensel's Lemma as a consequence of the Abstract WPT = Weierstrass Preparation Theorem and its companion the Abstract WDT = Weierstrass Division Theorem. These deal with the univariate power series ring S — i?[[V]] over a complete Hausdorff quasilocal ring R. By a quasilocal ring we mean a ring R with a unique maximal ideal M(R); note that a ring R is quasilocal iff nonunits in it form an ideal (which then equals M{R))\ also note that the valuation ring Rv of any valuation v of a field K is a quasilocal ring with M(RV) = {x G K : v(x) > 0}. The Weierstrass degree of any / G S is defined by wideg(/) = ord(F) where F G (R/M(R))[[Y}} is obtained by applying the residue class map R —> R/M(R) to the coefficients of / . Now W P T

§3: SUMMARY OF LECTURE L3 ON TANGENTS AND POLARS

607

says that every f £ S with wideg(/) = d G N can uniquely be written as / = Sf* where 5 is a unit in S and / * is a distinguished polynomial of degree d. i.e., /* = Yd + f{Yd-1 + ••• + ft with f{,..., fl in M(R), and WDT says that any g G S can be uniquely written as g = qf + r where q G S and r G i?[V] with deg(r) < d. The above abstract versions of WPT and WDT are converted into their concrete versions by taking R to be the m-variable power series ring /c[[Xi,... ,Xm}] over a field k. To apply these to the n-variable power series ring S = fc[[Xj,... ,Xn}] over a field k we need to ensure that wideg of a nonzero element f of S relative to Xn is finite so that we can take V = Xn and m = n — 1. In case the field k is infinite, we do this by rotating coordinates, i.e., by making a fc-linear automorphism as explained in L3§12(D4). Alternatively, without assuming k to be infinite, again as explained in L3§12(D4), we achieve the same thing by making a more general polynomial automorphism. About the polynomial ring R[X\,... ,Xm} and the power series ring R[[Xi,... ,Xm]] in indeterminates Xi,... ,Xm over a ring R with positive integer m let us note that, according the famous Hilbert Basis Theorem as proved in (T3) and (T4) of L4§3, if the ring R is noetherian then the rings R[X\,..., JsTm] and i?[[Xi,..., X m ]] are also noetherian, where we note that in honor of Emmy Noether a ring R in which every ideal is finitely generated is said to be noetherian. It follows that the power series ring A;[[Xi,..., Xm]] over a field k is a local ring, by which we mean a noetherian quasilocal ring. In L5§5(Q4) we prove the famous Krull Intersection Theorem which says that every local ring R is Hausdorff, i.e., in it we have nn€^M(R)n = 0. Thus in a local ring R, for every x G Rx = the set of all nonzero elements of R we have ord^rr G N, where for any x in any quasilocal ring R we have put ord/ja; = max{n G N : a; G M(R)n} with the understanding that if x G (~]n€nM(R)n then ordfla; = oo. Note that if R is the power series ring /c[[Xi,..., Xm}] over a field k then for any f = f(X1,...,Xm)=

J2

fi1...imXi*...X%

with

fn...imek

we have ordfl/ = ord(/) = m i n l ^ + •••+ im: (ii,...,im)

€ Supp(/)}

where Supp(/) = {{iu ... ,im) G Nm : fh..Am ^ 0}. In (Dl) to (D3) and (El) to (E4) of L3§12 we discuss the theory of matrices including: ranks, minors, cofactors, transposes, adjoints, and linear maps. In (D5) and (E12) to (E14) of L3§12 we generalize Hensel's Lemma by extending the notions of ord, completeness, and Hausdorffhess from quasilocal rings to more general situations in the following manner. Let / be an ideal in a ring R and let V be an i?-module. V is 7-HausdorfF, or

608

LECTURE L6: PAUSE AND REFRESH

Hausdorff relative to 7, means Hi^^PV = 0, where for any ideal J in R, by JV we denote the submodule of V given by JV = {$2J%Vi : ji G J and Uj £ V}. For any h e V w e define the (R, 7)-order of h by putting ord( fl| /)/i = max{i g N : / i £ Z'V} and we note that then V is 7-Hausdorff means for any h G V we have: ord^/jft = co <=> /i = 0. A sequence x = (xi)i NE and j > NE we have xi — Xj G IEV. The sequence x is /-convergent, or convergent relative to 7, means it converges to an 7-limit £ € V, i.e., for every E £ N there exists NE £ N such that for all i > NE we have ( - ^ e 7 B V; we indicate this by some standard notation such as xt —> £ as i —> oo or lim^ocX, = £. If V is 7-Hausdorff then a limit when it exists is unique. V is /-complete means every 7-Cauchy sequence in it has a limit in it. If R is quasilocal with 7 = M(R) and V = R then the adjective "relative to 7" may be dropped from the above definitions. §4: S U M M A R Y OF LECTURE L4 ON VARIETIES A N D MODELS In L4§1 we open up the lecture with Sylvester's 1840 theory of the resultant of two univariate polynomials / , g of degrees n, m which by eliminating the variable Y produces an n+m by n+m matrix Resmaty (/, g) in their coefficients, the vanishing of whose determinant Resy(/, g) gives a condition for them to have a common solution. In turn the vanishing of the discriminant Discy(/) = Resy(/, / y ) gives a condition for / to have a multiple root. Having already cited the Hilbert Basis Theorem proved in L4§3 which follows the general chit-chat about curves, surfaces, and varieties in L4§2, let us now describe L4§5 on ideals and modules consisting of Observations (07) to (027). Before coming to (07), in the preamble of L4§5 we talk about intersections and sums of submodules of a module V over a ring, and make the definitions of the colon (or quotient) modules (C : B)v and (C : a)y, where C is a submodule of V with B C R and a e R, by putting (C : a)v = {v € V : av e C} and (C : B)v = nb€B(C : a)v. In case of ideals we also consider their products, and we define the colon (or quotient) ideals (C : B)R and (C : O)R, where C is a submodule of V with B C V and a £ V, by putting (C : a)R = {r 6 R : rv € C} and (C : B)R = n b e B ( C : a)R. Moreover, for a submodule U of V and an ideal 7 in R, we define their radicals by putting r&dyU = {r £ R : reV C U for some e £ N+} and rad/j7 = {r £ R : re £ I for some e G N+}. Also we define zerodivisors and nilpotents. Finally we define primary and irreducible ideals as well as submodules. In (07), (08), (09), (012), (013), and (018) we discuss primary ideals and submodules as well as annihilators and colons and relate them to radicals. In (O10) and (Oil) we discuss basic isomorphism theorems. In (014) we define noetherian modules in terms of the three equivalent conditions of NNC (= noetherian condition saying that every submodule is finitely generated) and ACC (= ascending chain condition which says that every ascending

§4: SUMMARY OF LECTURE H ON VARIETIES AND MODELS

609

chain of submodules stops) and MXC (= maximal condition which says that every nonempty family of submodules has a maximal element). In (015) we show that every submodule U of a noetherian module V over a ring R has an i r r e d u n d a n t p r i m a r y decomposition U = U\ n • • • n Uh where U\,...,Uh are a finite number of primary submodules of V; here irredundant means none of the Z74 can be deleted and the radicals P\,... ,Ph of U\,..., Uh are distinct prime ideals in R. In (016) we introduce the s p e c t r u m spec(R) of a ring R as the set of all prime ideals in R. The set of all maximal ideals in R is denoted by mspec(-R). The set of all members of spec(.R) which contain an ideal J of R is called the s p e c t r a l variety of J and is denoted by v s p e c ^ J . The sets of all m a x i m a l and m i n i m a l members of vspecjiJ are denoted by rnvspec^J and n v s p e c ^ J respectively. In L5§5(Q7)(C11) we also put nspec(il) = nvspec^O. The set of all s p e c t r a l varieties in R is denoted by svt(i?). The spectral ideal of any subset W of spec(R) is defined by putting ispecRW = C\p rd(.R) whose inverse is given by vspec/j; this is part (T22.8) of the Hilbert Nullstellensatz L4§8(T22) proved in L5§5(Q11). In (016) and (017) we define the associator ass^V and the tight associator tass/jF of an i?-module V as certain subsets of spec(R), and show that for the above irredundant primary decomposition U = U\ n • • • n Uh of the submodule U of the noetherian .ft-module V we have SSSR{V/U) = tassR(V/U) = { P i , . . . ,Ph}Members of a,ssn(V/U) are called associated p r i m e s of U in V. In (019) we use bracketed colon to prove partial uniqueness of the primary components Ui in the above decomposition. A multiplicative set in a ring R is a subset S of R with l e S such that S contains the product of every pair of elements in it. For any submodule U of an il-module V we put [U : S]v = ^a^s(U : s)v and call this the isolated S'-component of U in V; the uniqueness follows by showing that [U : S]v = np i n s=0^i- For any prime ideals Q\,...,Qn in R with n e N+ we put [U : (Qi, • • • ,Qn)]v = [U : S]v where S = rii<j< n (.R \ Qj) and call this the isolated ( Q i , . . . ,(5„)-component of U in V, and we note that now {1 < i < h : Pi n S = 0} = {1 < i < h : Pi C Qj for 1 < j < n}. In (020) we generalize the idea of primary decomposition to q u a s i p r i m a r y decomposition without assuming the noetherian condition. We also introduce the notation ZR(V) = t h e set of all zerodivisors of a module V over a ring R, and SR(V) = t h e multiplicative set R \ ZR(V) in R. In (021) we study the properties of the l e n g t h (.R{V) of a module V over a ring R which we define to be the maximum length n £ N of a finite sequence 0 = Vb § Vi C • • • ^ Vn — V of submodules of V which we call a n o r m a l series of V of length n. When there does not exist a bound for the lengths of such sequences then we put £R(V) = OO. By a simple m o d u l e we mean a module of length 1.

610

LECTURE

L6: PAUSE AND

REFRESH

In (022) we define heights and depths of ideals and dimensions of rings. By the dimension dim(P) of a ring R we mean the maximum length n G N of a finite sequence (5) PQ C p1 C . . . C pn 0 f prime ideals in R. We call such a sequence a prime sequence in R of length n. When there does not exist a bound for such sequences then we put dim(P) = oo provided R is not the null ring; if R is the null ring then we put dim(P) = —oo. By the height h t # P (resp: depth dpt^P) of a prime ideal P in R we mean the maximum length of a prime chain (5) in R with P — Pn (resp: P = P0); again the maximum can be oo. We define the height ht# J and the depth dpt^J of a nonunit ideal J in R by putting ht# J = min{htijP : P G vspec fl J } G N U {oo} and dptRJ = max{dpt f l P : P G vspec fl J } G N U {oo}. Let us complete the definitions of height and depth by putting h t ^ P = —oo = d p t f i P . In (025), analogous to noetherian modules, we define artinian modules in terms of the two equivalent conditions of DCC (= descending chain condition which says that every descending chain of submodules stops) and MNC (= minimal condition which says that every nonempty family of submodules has a minimal element). A ring is artinian if it is artinian as a module over itself. We show that a module has finite length iff it is artinian as well as noetherian. We show that a ring is artinian iff it is either the null ring or a zero-dimensional noetherian ring. We show that a finitely generated module over a noetherian (resp: artinian) ring is noetherian (resp: artinian). In (026) we define ideals A,B in a, ring R to be comaximal to mean that A + B = R. We show that if A i , . . . , An are a finite number of pairwise comaximal ideals in R then rii : R —> Rs thus. We define Rs to be the set of all equivalence classes of pairs (u,v) G R x S under the equivalence relation given by: (u,v) ~ (u',v') o v"(uv' — u'v) = 0 for some v" G S. The equivalence class containing (u, v) is denoted by u/v or ^, and we add and multiply the equivalence classes by the rules ^ + ^ = "i^+"a"i and ^ x ^ = w*. Also we "send" any J

V\

V2

V\V2

Vl

V2

Vlf2

u G R to u / 1 G Rs- This makes Rs into a ring and u \—> w/1 gives the canonical ring homomorphism 0 : R -+ Rs- Clearly <j)(S) C U(Rs) and ker((/>) = [0 : S]R. The localization of R at SR(R) is the total quotient ring QR(i?) of R; clearly [0 : SR(R)] = 0 and hence we may and we do regard QR(P) to be an overring of R. Now: ker() = 0 « 5 c SR(R). If S C SR(R) then there is a unique P-injection of Rs into QR(P); we call its image the localization of R at S in QR(P) and continue to denote it by Rs; let us call this the GOOD case; thus in the good case Rs is an overring of R. For any prime ideal P in R we let Rp stand for RR\P and we call it

§5: SUMMARY OF LECTURE L5 ON PROJECTIVE VARIETIES

611

the localization of R at P. Some basic properties of localization are proved in Theorems (T9) to (T17) of L4§7 including L4§7.1. In particular, in (T12) we prove the salient fact which says that: I n-> J = (p(I)Rs gives a bijection of the set of all ideals I in R for which [I : S}s = I onto the set of of all ideals in Rs and its inverse is given by J — i > _1( J); this bijection commutes with the ideal theoretic operations of radicals, intersections and quotients. The geometry started in §§2-5 of L3 is picked up again in §§8-9 of L4. Common solutions of a finite number of polynomial equations in a finite number of variables with coefficients in a field form an afBne algebraic variety. Some properties of these varieties are stated in Theorems (T19) to (T22) of L4§8 and proved in (Q10), (Qll), and (Q32) of L5§5. In L4§8.1 we relate these varieties to the spectral varieties discussed in the above item (016) and then in L4§8.2, via localization, we relate them to modelic specs which are collections of local rings. Now a simple point is reincarnated as a regular local ring which means a local R whose embedding dimension emdim(-R) equals its dimension, where emdim(-R) is defined to be the smallest number of generators of M(R). In L4§8.3, L4§9, L4§9.1, and L4§9.2 we formulate some properties of modelic specs as Theorems (T23) to (T30) which are proved in (R7) to (R9) of L4§12 together with (Q7), (Q14), and (Q17) of L5§5. In L4§9.2 we show how modelic specs give rise to modelic blowups and in L4§9.3 we use them for simplifying singularities. In L4§11 we start a new column called Problems by doing which the student may get "mild satisfaction or Ph.D. thesis or fame." In (Rl) to (R5) of L4§12 we deal with Laplace development of determinants, block matrices, and Cramer's rule for solving linear equations. In (R6) of L4§12 we concoct a resultant theory which makes it valid over rings with zerodivisors. This is done by converting the product rule for polynomials into matrix multiplication and expanding this into a product formula for the resultant matrix. §5: S U M M A R Y OF L E C T U R E L5 O N P R O J E C T I V E VARIETIES In L5§1 we define the direct sums of modules and related objects. A module V over a ring R is an internal direct sum of a family (Vj); e j of submodules if it is the sum V = ^ i e / V a n d the sum is direct, i.e., for every finite subset I' of the indexing set / we have: X) i e / , Vi = 0 with vt e Vi for a l i i € V => Vi = 0 for all i € / ' . The fact that the sum is direct is expressed by writing V = X)ie/ ^» (internal direct sum) or V = © i e / V (internal). In case J is a finite set, say I = { 1 , 2 , . . . , n}, we may write V = V\ © V2 © • • • © Vn (internal). The cartesian product W = Ui x • • • x Un of a finite number of .R-module U\,..., Un is converted into an .R-module by componentwise addition and scalar multiplication, and then we call W the direct sum of U\,..., Un and indicate this

612

LECTURE

L6: PAUSE AND

REFRESH

by writing W = U\ © U2 © • • • © Un; to distinguish this from internal direct sum we may call it external direct sum. If U\ = • •• = Un = U then, as in the case of cartesian products, we put W = Un and call it the (module theoretic) direct sum of n copies of U. More generally, the cartesian product of a family (C/,)j £ / of sets Ui, where I is any indexing set (which need not be finite), is defined by putting Yli€l Ui = the set of all maps : I —> | J i e / Ut such that (f>{i) £ Ui for all i £ I. In case the Ui are i?-modules, this becomes an ii-module, called the direct product, by defining componentwise addition and scalar multiplication. The set of all (f> £ YlieI Ui whose support supp() = {i £ I : 4>(i) ^ 0} is finite is called the direct sum of the family and is denoted by 0 i e / Ui\ note that this is a submodule of the direct product Yliei U% a n d *s hence an i?-module; to distinguish this from the internal direct sum we may again call it the external direct sum. For any j £ I we get an obvious injective map \ij : Uj —> Yliei ^t called the natural injection, and a surjective map Vj : Yliei Ui —> Uj called the natural projection. Clearly i/j/Xj is the identity map of Ui. All this works for sets UiIn the module case these maps are i?-homomorphisms, and they give rise to an obvious i?-monomorphism (again called the natural injection) fij : Uj —> ®ieiUi and an obvious i?-epimorphism (again called the natural projection) Vj : (Bi^iUi —> Uj. Letting Ui to be the image of \ii we have © i e / Ui (external) = 0 i e / Ui (internal) . If Ui = a set U for all i £ I then the cartesian product Yliei ^% is denoted by U1 and is called the set-theoretic I-th power of U, and if U is an i?-module then the module U1 is called the module theoretic 7-th power of U. Finally, if the module Ui = a module U for a l i i £ I then the module ®i^iUi is denoted by U$ and is called the module theoretic restricted 7-th power of U. Other related definitions such as diagonal direct sum may be found in L5§1. In L5§2 we define a graded ring S over a ring R to be an overring of R such that S = Y^iei & a s a n internal direct sum of a family (5j)ig/ of i?-modules with So = R where the type I is any additive abelian monoid, and for all fi £ Si and fj £ Sj with i, j in I we have fifj £ Si+j. An ideal in S is homogeneous if it is generated by homogeneous elements, i.e., elements of Ui^iSi. A ring homomorphism (j> : S —> S" of graded rings of type / is graded if (fi(Si) C S'i for all i. If I = N then we call S a naturally graded ring. In L5§3 we define a semihomogeneous ring to be a naturally graded ring S such that S = S'o[<S'i]; if Si is a finitely generated module over R = So then we call S a homogeneous ring. Also we study the ideal theory of graded rings. The very long L5§5 on More About Ideals And Modules is divided into Quests (Ql) ro (Q35) which we proceed to summarize. Referring to (014) to (017) of the above Summary of L4, let U = U\ n • • • n Uh be an irredundant primary decomposition of a submodule U of a finitely generated module V over a noetherian ring R, and let P i , . . . , Ph be the prime ideals in R which are the respective radicals of U\,. •. ,Uh- In the Zerodivisor Theorem (Q1)(T1) we show that then ZR(V/U) = Pi U • • • U Ph.

§5.- SUMMARY OF LECTURE L5 ON PROJECTIVE VARIETIES

613

In (Q2) we define a faithful module over a ring R to be an i?-module D such that ann#D = 0. Note that ann f l D = {r e R : rd = 0 for all d G D). In (Q3) we define the Jacobson radical jrad(fi) of a ring R to be the intersection of all its maximal ideals, and we define a Zariski ring to be pair (R, J) where J is an ideal in a ring R which is contained in the Jacobson radical of R. In the very versatile Nakayama Lemma (Q3)(T3) we show that: if U is a submodule of a finitely generated module V over a ring R such that V = U + JV for an ideal J in R with J C jrad(i?), then V = U. The particularly noteworthy case of this when R is a local ring with J = M(R) was proved by the master algebraist Krull. In (Q4) we prove: the Krull Intersection Theorem (T7) saying that in any local ring R we have r\n^nM(R)n = 0, its Domainized Version (T10) saying that for any nonunit ideal J in any domain R we have r\n^Jn = 0, and its variation which is called the Artin Rees Lemma (T4) and which says that, given any ideals V, W, J in a noetherian ring R, there exists a G N such that for all c > b > a in N we have JCW D V = Jc-b{JbW D V). In (Q5) we explain Nagata's idealization principle which converts module situations to ideal situations, and use it prove the module incarnations of the above results of (Q4). We also prove a Characterization Theorem (T20) for Zariski Rings. In (Q6) we prove Cohen's Theorem saying that a ring is noetherian if all the prime ideals in it are finitely generated, and Eakin's Theorem saying that a ring is noetherian if some noetherian overring is a finitely generated module over it. In (Q7) we prove Krull's Generalized Principal Ideal Theorem (T27) saying that: if an ideal J in a noetherian ring R is generated by n elements x\,... ,xn with n G N then htjj/ < n, and if Xi is a nonzerodivisor mod x\,..., Xi-\ for 1 dim(iZ) = the smallest number of elements in R which generate an M(iZ)-primary ideal. We also deduce the Dimension Lemma (T30) about the polynomial ring S = R[X\,... ,Xm] as well as the power series ring S = i?[[Xi,..., Xm]} in a finite number of variables X\,..., Xm over a noetherian ring R, saying that: (1) S is noetherian, and for any nonunit ideal I in R and j G { 0 , . . . , m } , upon letting Ij = IS + (X\,... ,Xj)S we have Ij n R = I with htslj = j + h t ^ / , and if / G spec(f?) then we also have Ij G spec(S); (2) dim(S') = m + dim(R); (3) if R local and we are in the power series case then S is local with M(S) = M(R)S + (Xi,..., Xm)S and emdim(S) = m + emdim(i?), and hence: R is regular <=> S is regular; (4) If the ring R is a field then R[X\,..., Xm] is an m-dimensional noetherian ring and -R[[-X"i,..., Xm]] is an m-dimensional regular local ring; (5) in the polynomial case with m = 1, for any I = J D R with J G spec(S') we have J G spec(i?) and: (i) if J = IS then htg J = h t ^ / , whereas (ii) if J ^ IS then hts J = 1 + h t ^ / and there exists F(Xi) G J of degree E > 0 such that the coefficient of X^ in F belongs to R \ I. In the Relative Independence Theorem (Q8)(T34) we show that n elements

614

LECTURE

L6: PAUSE AND

REFRESH

in a noetherian ring generate an ideal of height n iff they are independent over that ring in the following sense. Considering the n variable polynomial ring R[Yi,..., Yn] in n variables Yi,...,Yn over a ring R with n G N, for any A c R and i G N we let A[Y\,..., Yn]i = the set of all homogeneous members of P [ Y i , . . . , Yn] of degree i all whose coefficients belong to A (including the zero polynomial in case 0 e A) and we put A[Yi,... , Yn]oo = UteN^I^i' • • • >^n]i- Let J be an ideal in R. For elements x\,..., xn in it let I = (rci,..., xn)R. The elements xi,...,xn are said to be independent over (P, J ) , or independent relative to (P, J ) , if 7 C J and: (f) F(YU... , Yn) G P[Yi,. •. ,y„]oo with F(Xl, . . . , ! „ ) = 0 ^ ^ . . . . . I ^ G

J[Y1,...,Y„]00.

The elements a?i,..., x n are said to be independent over R, or independent relative to R, if I ^ R and the elements are independent over (R, radfj/). Considering the P-epimorphism a : P [ Y i , . . . , Yn] —> R which sends Y, to :r, for 1 < i < n, we show that (tt) ker(a) = (Yi — x\,..., Yn — xn)R[Yi,..., Yn] and condition (f) is equivalent to the condition: (f) ker(a) n R[Yi,..., yn]oo C J [ y i , . . . , yn]oo- Assuming I C J, we show that condition (f) is equivalent to the condition: (}) F(YU... ,y„) G R[YU.. .,Yn]i with i G N and F(xu. ..,xn)€ Ii+1 ^F(Y1,...,Yn)sJ[Y1,...,Yn}i. In case R is the power series ring X [ [ X i , . . . , Xm]] in a finite number of variables over a field K, elements xi,...,xn in M(R) are said to be analytically independent over K if there is no nonzero / ( Y i , . . . , Yn) G if [[yi, •. •, Yn]] with f(xi,..., xn) = 0. In the Analytic Independence Theorem (Q8)(T35) we show that: (1) if the elements independent over (R, M(R)) then they are analytically independent over K, and (2) if m = n and the ideal I is M(P)-primary then the elements xi,..., xn are analytically independent over K. In (Q9) we show that the localization of any normal ring at a multiplicative set in it not containing any zerodivisor is again normal. A prime ideal Q in an overring S of a ring R lies above a prime ideal P in R means P = Q n R; we may also say that P lies below Q in P . In (Q9) we mostly deal with an integral extension of a ring, i.e., an overring S of a ring R such that S is integral over P . We show that for any prime ideal P in P lying below a prime ideal Q in S we have: P is maximal «=> <2 maximal. In the Lying Above Theorem (T40) we show that for any prime ideal P in P , there exists a prime ideal Q in S lying above P . In the Going U p Theorem (T41) we show that given any prime ideals P* C P in R, and given any prime ideal Q* in S lying above P*, there exists a prime ideal Q in S lying above P for which we have Q* C Q. In the Going Down Theorem (T44) we show that if R is a normal domain and S is a domain, then given any prime ideals P* C P in R, and any prime ideal Q in S lying above P , there exists a prime ideal Q* in S lying above P* with Q* c Q. In the Dimension Corollary (T45) we show that: (1) above any ascending finite sequence of prime ideals in R there lies an ascending finite sequence of prime ideals in S; (2) any strictly ascending finite sequence of prime

§5: SUMMARY OF LECTURE L5 ON PROJECTIVE VARIETIES

615

ideals in S lies above a strictly ascending finite sequence of prime ideals in R; (3) dim(S') = dim(P); (4) for any prime ideal Q in S lying above any prime ideal P in R we have dptsQ = d p t R P and htsQ < h t # P ; (5) if S is a domain and R is a normal domain, then for any prime ideal Q in S lying above any prime ideal P in R we have htsQ = h t ^ P . In (Q10) we prove the Noether Normalization Theorem (T46) which says that any affine ring A over a field k has a normalization basis over k, by which we mean a finite number of elements Y\,..., Yr in A which are algebraically independent over k and which are such that the ring A is integral over the subring k[Y\,..., Yr]. By a saturated chain of prime ideals in a ring R we mean a finite sequence Po ^ Pi ^ • • • ^ P n of prime ideals in R such that there is no prime ideal P* in R with Pi Cj P* C pi+1 for some i £ { 0 , . . . ,n - 1}; we call n the length of the chain; if P C Q are prime ideals in R with P = PQ and Q = Pn then we say that PQ ^ Pi ^ • • • ^ P n is a saturated prime ideal chain between P and Q; if there is no prime P in R with P C p0 a n c i there is no prime Q in P with Pn ^ Q then we say that Po ^ Pi ^ • • ^ P n is an absolutely saturated prime ideal chain in R. By an infinite chain of prime ideals in a ring R we mean a sequence of prime ideals (Pj)ieN in R such that for all i € N we have P, C P i + 1 ; if P c Q are prime ideals in R such that P C Pi C Q for all j e N then we say that this is an infinite prime ideal chain between P and Q. For a subdomain R of a domain R', by trdeg^P' we denote the transcendence degree of QF(P') over QF(P). More generally, for a prime ideal P' in a ring R' lying above a prime ideal P in a subring R, after identifying R/P with a subdomain of R'/P', by trdegR/pR'/P' we denote the transcendence degree of QF(R'/P') over Q F ( P / P ) ; in case R is a subfield fc of P ' , we write tvdegkR'/P' in place of trdegfc/oP'/^"> in case P' and P are the respective maximal ideals in quasilocal rings R' and R, we may write restrdeg^P' in place of t r d e g ^ / p P ' / P ' and call this the residual transcendence degree of R' over R. Using (T46) we establish the Extended Dimension Theorem (T47) which subsumes L4§8(T19) and which says that for any affine ring A over a field k we have the following: (T47.1) For any nonunit ideal J in A, letting J\,..., Jm be the distinct minimal primes of J in R we have dim(A/J) = dpt A J = maxi< i < m trdeg fc ( J 4/Jj) e N, and for any prime ideal P in A we have dim(A/P) — d p t A P = trdeg fe (A/P) e N. Hence in particular, if A is a domain then dim(A) = txdegkA £ N. (T47.2) Every pair of prime ideals P C Q in A has the property which says that: (*) there is no infinite chain of prime ideals between P and Q, and any two saturated chains of prime ideals between P and Q have the same length. Moreover: (**) this common length equals dim(A/P) - dim.(A/Q). (T47.3) If A is a domain then the length of every absolutely saturated chain of prime ideals in R equals the dimension of R. (T47.4) If A and H is a nonunit ideal H in A then htAH + dptAH = dim(A).

616

LECTURE

L6: PAUSE AND

REFRESH

(T47.5) If A is a domain then, for any prime ideal P in A, the length of every saturated prime ideal chain in A between 0 and P equals h t ^ P , and the length of every saturated prime ideal chain in A between P and any member of mvspec^P equals d p t ^ P . (T47.6) If A is a domain and A' is an affine domain over A then for any prime ideal P' in A' lying above any prime ideal P in A, upon considering the localizations R' = A'P, and R = Ap, we have dim(P') 4- restrdeg fl P' = dim(Pt) + trdegRR'. (T47.7) in the situation of (T46) we have dim(A) = r. In (Q10) and (Qll) we prove the Inclusion Relations Theorem L4§8(T21) and the Hilbert Nullstellensatz L4§8(T22). In (Qll), the Jacobson Spectrum And Nilradical of a ring R are defined by the equation jspec(ft) = { P e spec(R) : P = n<2emvSpecfipQ} and the equation nrad(fl) = (~\pesPec(R)P- The ring R is said to be a Jacobson ring if jspec(P) = spec(P). Using jspec we supplement L4§8(T21) in (Q11)(T50). Using nrad we prove the Minimal Primes Theorem(Qll)(T51) which says that, given any ideal J in any ring R, every member of vspec/j J contains some member of nvspecn J, and we have rad f i J = nPenvspeCRjP. A ring A is catenarian means every pair of prime ideals P c Q in it satisfies the above condition (*) of (T47.2). A ring A is universally catenarian means it is noetherian and the polynomial ring over it in any finite number of variables is catenarian. The dimension formula (resp: the dimension inequality) holds for a domain A relative to an overdomain A' means for every prime ideal P' in A', considering the localizations R' = A'P, and R = Ap with P = P' n A, we have dim(iZ') + restrdeg fi P' = dim(i?) + tidegRR' (resp: dim(R') + restrdeg fl i?' < dim(.R) + trdeg f i P'). The dimension formula (resp: the dimension inequality) holds in a domain A means it holds for A relative to every affine domain over A. Thus (T47.6) says that the dimension formula holds in any affine domain over a field. In (Q12) we sharpen Comment (Q7)(C13) on Univariate Ideal Extensions, which in the present summary corresponds to part (5) of the Dimension Lemma (Q7)(T30), by proving the Multiple Ring Extension Lemma (T55) which says that: the dimension inequality holds in any noetherian domains A relative to any affine domain A' over A, and i{A' = A[x\,..., xm] for some m £ N and X\ , . . . , Xyyi i n A1 then for any prime ideal P' in A' lying above any prime ideal P in A, considering the localizations R' = A'P, and R= AP we have restrdegijP' < m with trdeg^i?' < m, and if either trdeg R i?' — m or the polynomial ring B = A[X\,.. .,Xm] in m variables is catenarian, then dim(i?') + restrdegfl.R' = dim(P) + trdeg f i P'. The m = 1 case of (T55) is designated as the Simple Ring Extension Lemma (T54). In the Catenarian Ring Theorem (T58) we show that a ring A is universally catenarian iff A is a noetherian catenarian ring and the dimension formula holds in A/H for every prime ideal H in A. In (Q13) on Associated Graded Rings And Leading Ideals, given any ideal / in a ring R, we define the associated graded ring of (R, I) to be the naturally

§5: SUMMARY OF LECTURE L5 ON PROJECTIVE VARIETIES

617

graded ring grad(.R, /) = ©„ e N(-^"/^ n + 1 ) w i t ^ external direct sum of Z-modules. The multiplication is defined by g r a d ^ j 7 ) ( / m ) •grad" f i / ) (/ n ) = grad^""^/™/™) for all fm G Im and / „ G / " with m, n in N. Here grad" fl]/) : / " -> grad(#, /) is the composition of the natural surjection A„ : In —> (In/In+1) and the natural injection ^n : (In/In+1) -> grad(i?,J). We put gi&d(R,I)n = gmd^RJ)(In). For any h G R we put lefo(_Ri/)(/i) = gradjyj^/i) or 0 according as ovd(Rj)h = n G N or ord( fli /)/i = oo; we call this the leading form of h relative to (R,I). For any ideal J in R we put ledi( fi] j)(J) = the homogeneous ideal in grad(i?,/) generated by (lefo(flj)(/i))h 6 j and we call this the leading ideal of J relative to (R, I). Any family of generators x = (XI)I^L of I induces a unique graded ring epimorphism e ^ / } : k[(Xi)i€L] -> grad(i?,7) with k = R/I such that Qx{RJ){z) = /J.0{Z) for all z G k and ®xRI){Xi) = yi for all I G L where (X;)/ e £ are indeterminates over k and yi = gradj fl n(#/); we say that QXR 7> is induced by (R, I) and x. Note that grad(.R, /) is a semihomogeneous (homogeneous if L is finite) ring over the ring g r a d ( # , / ) 0 with grad{R,I) = grad(il,/) 0 [grad(ii,/)i] = grad(R,I)0[(yi)ieL\. In case R is a quasilocal ring with / = M(R), in the above paragraph we write R in place of (R,I), except that we write grad(i?) and grad(i?)„ in place of grad(i?,/) and grad(i?,/)„ respectively. A valuation function on a ring R is a map t ) : f i - » Z U {°°} such that for all x, y, z in R we have: (i) v(xy) =v(x) +v(y), (ii) v(x + y) > mm(v(x),v(y)), and (Hi) v(z) = oo <$=> z = 0, with the usual understanding about oo. Thus in the notation of L3§11, a valuation function is like a valuation with R and Z playing the roles of the field K and the ordered abelian group G respectively. If v is a valuation function on a nonnull ring R then clearly R is a domain and we get a unique valuation v : QF(i?) - > Z U {oo} such that for all x, y in Rx we have v{x/y) = v(x) — v(y); we say that the valuation v is induced by v; usually we denote v also by v. Theorem (T60) says that for any given ideal / in a ring R we have the following: (1) / T^ R and o r d ^ j ) is a valuation function on R iff R is /-Hausdorff and grad(i?, /) is a domain. (2) Upon letting R* = R/J* and I* = I/J* with J* = nn&nln, we have that R* is 7*-Hausdorff, and if grad(i?,/) is a domain then R* is a domain and o r d ^ . j . ) is a valuation function on R*. (3) Let : R —> R' be a ring epimorphism, with J = ker(), such that the leading ideal grad( fl) j)(J) is a (homogeneous) prime ideal in grad(i£, / ) ; then n nS N(«/ + In) is a prime ideal in R. Given any local ring R with dim(-R) = d and emdim(ii) = e, let x = (x\,... ,xm) be a finite number of generators of M(R), let X = (Xi,..., Xm) be indeterminates over the field k = R/M(R), and recall that QR : k[X] —> grad(ii) is a graded ring epimorphism where k[X] = ^ „ 6 N &[X]„ i s t n e naturally graded polynomial ring with fc[X]n = the set of all homogeneous polynomials of degree n (including 0). Theorem (T61) says that then we have: (1) ker(O^) n k[X]i = 0 •& m = e.

618

LECTURE L6: PAUSE AND REFRESH

(2) ker(e^) = 0 «• TO = d. (3) If m = e then: R is regular <£> k e r ^ ) = 0. (4) R is regular «• there is a permutation
a(i) - E l < j < e aiJX
In (Q14) we define a Completely Normal Domain to be a domain i? such that: x G QF(i?) with dxn G # for some d G i? x and all n G N => x G fl. Complete Normality Lemma (T63) says that for any given ring R we have the following: (1) If R is a completely normal domain then ii is a normal domain. If R is a noetherian normal domain then R is a completely normal domain. (2) If / is an ideal in R such that for all z G R we have C\nen{zR-\-In) = zR, and grad(-R, / ) is a completely normal domain, then R is a completely normal domain. (3) For any ideal / in R we have that: / ^ R iff grad(i?, /) is nonnull. (4) If R is noetherian and / is any ideal in R then grad(.R, I) is noetherian. (5) If R is noetherian and has an ideal / with / c jrad(i?) such that grad(-R, / ) is a normal domain, then R is a normal domain. We proved L4§8(T24.1) as the Krull Intersection Theorem (Q4)(T7), we shall prove L4§8(T24.3) as Another Ord Valuation Lemma (Q15)(T71), and we prove L4§8(T24.2) as the Ord Valuation Lemma (Q14)(T64) which says that given any regular local ring R, for all nonzero elements x and y in R we have ovd.R(xy) = ordflX + ordfly, and hence R is a normal domain and we get a valuation ordfl : QF(iZ) —> Z of QF(R) by putting ord#(3;/y) = OKIRX - ord^j/. To summarize (Q15) on Regular Sequences And CM Rings, let R be a ring, let x = (x\,... ,xn) be a sequence of elements in R of length n G N, for 0 < TO < n let Im be the ideal (x\,..., xm)R, and let V be a module over R. We say that a: is a F-regular sequence, if InV ^ V and for 1 < m < n we have xm G 5ij(V r /(/ m _iV)). Moreover if J is an ideal in R with In c J then we say that a: is a V-regular sequence in J, and if for every z G J fl SR(V/(InV)) we have InV + zV = V then we say that a; is a maximal V-regular sequence in J. We define the (R, J)-regularity of V by putting reg( fi JJ V = the maximum length of a V-regular sequence in J. If R is local with J = M(R) then we call this the i?-regularity of V and denote it by reg^V; in this case we write reg(R) in place of regfli? and we call it the regularity of R. We define the generating number of V in R by putting gnb^V = the smallest number of generators of V. An ideal / in R is said to be an ideal-theoretic complete intersection in R if / ^ R and htft/ = gnb^Z. A finitely generated i?-module is free means it is isomorphic to Rd for some d G N. In case R is local and V is finitely generated, V is said to be a CM ( = Cohen-Macaulay) module if r e g ^ ^ = dim(i?/(0 : V)R). A local ring R is said to be a CM ring if it is a CM module over itself, i.e., if reg(-R) = dim(ii). A noetherian ring R is said to be a CM ring if, for every maximal ideal P in R, the localization Rp is a CM ring. An ideal / in a noetherian ring R is said to be unmixed if for every prime P in assR(R/I) we have h t ^ P = lit/?/. The unmixedness theorem is said to hold in a noetherian ring R if every ideal in R,

§5: SUMMARY OF LECTURE L5 ON PROJECTIVE

VARIETIES

619

which can be generated by as many elements as its height, is unmixed. In Cohen-Macaulay Theorem (T70) we show that for any noetherian ring R we have the following: (1) If R is CM and x = (x\,..., xn) is any i?-regular sequence in R with n G N, then upon letting In = xR we have htfl/„ = n and for every P G ass^(i?//„) we have \ARP — n, and hence in particular /„ is an ideal-theoretic complete intersection in R, and /„ is unmixed. (2) If R is CM then for any nonunit ideal I in R we have iegtRI^R = htpl. (3) If R is CM and / is a nonunit ideal I in R which is an ideal-theoretic complete intersection in R then / is generated by an irregular sequence. (4) If R is CM then its localization Rs at any multiplicative set S in it is CM, and hence in particular its localization Rp at any prime ideal P in it is CM. (5) If R is CM then R is catenarian. (6) If R is CM and local then the length of any absolutely saturated prime ideal chain in R equals dim(-R) and for any nonunit ideal I in R we have htRI = dim(R) dim(R/I). (7) R is CM o- the unmixedness theorem holds in R. (8) R is a finite variable polynomial ring over a field => R is CM. (9) R is a finite variable power series ring over a field =4> R is CM. (10) R is CM =>• every finite variable polynomial ring over R is CM. (11) R is CM => every finite variable power series ring over R is CM. (12) R is CM =$• every homomorphic image of R is universally catenarian. In Another Ord Valuation Lemma (T71) we show that given a local ring R, for any x € M(R) \ZR{R) we have that: (i) R/(xR) is a local ring whose dimension is one less than the dimension of R, (ii) if R/(xR) is regular then ordftcc = 1, and (iii) if ordflx = 1 and R is regular then R/(xR) is regular. In (Q16) on Complete Intersections And GST Rings we start by making the following definitions. A local ring R is said to be a complete intersection if there exists a regular local ring T and an epimorphism

Lectures on linear algebra

Read more

Lectures on Linear Algebra

Read more

Lectures on Linear Algebra

Read more

Lectures on Algebra Volume 1

Read more

Six lectures on commutative algebra

Read more

Lectures on Infinite Dimensional Lie Algebra

Read more

Lectures on infinite-dimensional Lie algebra

Read more

Lectures on Infinite Dimensional Lie Algebra

Read more

Lectures in Abstract Algebra: 002

Read more

Linear Algebra in 25 Lectures

Read more

Six Lectures on Commutative Algebra (Progress in Mathematics)

Read more

Lectures on computational fluid dynamics, mathematical physics, and linear algebra

Read more

Lectures on Duflo Isomorphisms in Lie Algebra and Complex Geometry

Read more

Lectures on Antitrust Economics (Cairoli Lectures)

Read more

Lectures on entire functions

Read more

Lectures on Macroeconomics

Read more

Lectures on Macroeconomics

Read more

Lectures on Lie Groups

Read more

Eight Lectures on Yoga

Read more

Lectures on quantum mechanics

Read more

Lectures on Quark Matter

Read more

Lectures on invariant theory

Read more

Lectures on differential geometry

Read more

Lectures on Quantum Mechanics

Read more

Lectures on algebraic cycles

Read more

Lectures on Don Quixote

Read more

Lectures on QCD. Applications

Read more

Lectures on amenability

Read more

Lectures on vanishing theorems

Read more

Lectures on hyperbolic geometry

Read more

Recommend Documents

Lectures on linear algebra

Lectures on Linear Algebra

Lectures on Linear Algebra

Lectures on Algebra Volume 1

Six lectures on commutative algebra

Modern Birkhäuser Classics Many of the original research and survey monographs in pure and applied mathematics publishe...

Lectures on Infinite Dimensional Lie Algebra

Lectures on infinite-dimensional Lie algebra

Lectures on Infinite Dimensional Lie Algebra

lectures en Infinite-Dimensional Lie Algebra Minoru Wakimoto World Scientific Lectures on Infinite-Dimensional Lie...

Lectures in Abstract Algebra: 002

Linear Algebra in 25 Lectures

Linear Algebra in Twenty Five Lectures Tom Denton and Andrew Waldron March 27, 2011 Edited by Rohit Thomas 1 Content...