This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
, there exists h; E k[Yll'" ,Ym] such that X2-S+ h~, and henee there exists g2 E G sueh that lp(g2) divides lp(x2 - h~) = X2. Sinee the only terms strieUy smaller than X2 are terms involving Xl and the y variables only, and since G is the reduced Gr6bner basis and any term involving Xl could be reduced using 91 = Xl - hl, we must have 92 = X2 - h 2 for sorne h 2 E k[y!, ... ,Yml We proceed in a similar fashion for the remaining Xi'S.
84
CHAPTER 2. APPLICATIONS OF GROBNER BASES For the converse we first note that 4> is onto if and only if
Xi
E im( 4» for
1 ::; i ::; n. Since Xi - hi E C, we have Xi -S+ hi_ Since hi is a polynomial in the y variables alone, we have that Xi is in the image of 4> by Theorem 2.4.4, and hence 4> is onto. Again by Theorem 2.4.4, we see that Xi = hi(f"", , fm). D The above result gives an algorithm for determining whether the map 4> is onto or not. We first compute the reduced Gr6bner basis G for the ideal K, and, by inspection, we check whether there exists Yi = Xi - hi E G for each i = 1, ... ,n, with hi E k[Yl,'" ,Yml. EXAMPLE 2.4.8. We have seen in Example 2.4.6 that the map 4> was not onto, since x5 was not in the image of 4>. AlBo, the Gr6bner basis G did not have a polynomial of the form X ~ h(u, v). Now consider
4>' :
Q[u, v, wl
--->
Q[xl
u v w
~
x 4 +x
We Wf.mt to determine whether <jJ* is anto. We first compute a Gr6bner basis G* for the ideal
=
x3 )W
-
with respect to the lex term ordering with
X
K*
~U2V
(u - x 4
-
X, V -
x5 )
ç
Q[u, v, w, x],
> u > v > w to get
+ v 2 w + 2vw + w, u 3 ~ v 4 ~ 3v 3 ~ 3v 2 ~ v}
Since we have x - uv 2 + uv - u + w 2 E· G*, the map 1>* is anto. In fact we have X = 4>'(uv 2 ~ uv + u ~ w 2 ) = (X4 + X)(X3)2 ~ (x4 + x)x3 + x4 + X ~ (x5)2 We now extend the preceding resnIts to quotient rings of polynomial rings.
DEFINITION 2.4.9. An k-algebra is called an affine k-algebra if it is isomorphic as a k-algebra to k[Xl, ... , xnl/ l for sorne ideal l of k[Xl, ... , xnl. Clearly the polynomial ring k[Xl,'" , xnl is an affine k-algebra. Moreover, if fI, ... , fm E k[Xl, ... , Xn ], then, as we saw at the beginning of this section, the image, k[fI, ... , fm], of the map
4>: k[Yl,'" ,Yml--->k[xl,'" ,xnl which sends Yi to fi is isomorphic to k[Yl, .. . , Yml/ ker( 4» and hence is an affine k-algebra. We now want to study k-algebra homomorphisms between affine k-algebras. Let J be an ideal of k[Yl,." , Yml and let l be an ideal of k[Xl'''. , xnl. Consider a k-alge brahomomorphism
4>: k[Yl,". ,Yml/J ---> k[Xl, ... ,xnl/I.
2.4. POLYNOMIAL MAPS
85
Let us assume that fi+ J · We note that the rnap
= (91, ... ,9'), then, for i = l, ... , t,
9i(fI,··· ,fm) E J.
This condition can he checked easily using a Gr6bner basis for J and Theorem 1.6.2. Generalizing Theorem 2.4.2 we have THEOREM 2.4.10. LetK be the ideal ofk[YI, ... ,Ym,XI,.·· ,xnl whose generators are those of J together with the polynomials Yi - fi, 1 SiS m, that is, K = (J, YI - fI, ... ,Ym - fm). Then ker(
PROOF. Let f' E Knk[YI, ... ,Ym]. Then we can write m
f'(YI, ... ,Ym) = I)Yi-f;(XI, ... ,xn))hi(YI, ... ,Ym,XI,··· ,xn ) i=l
where W(YIJ··. ,Ym,Xl,'" ,Xn) = l:::Uv {YIJ ... ,Ym,Xl,'"
,Xn}Vv(Xl, ... ,Xn)
v
w{JI, ... ,lm, XI,
... ,Xn ) =
L
Uv(!I, ...
,fm, XI,
... ,Xn}Vv(Xll'"
,Xn )
El,
v
sînce each Vj.I El. Conversely, let f' E k[YI, ... ,Ym] with
f'(YI, ... ,Ym) = (f'(YI, ... ,Ym) - f'(fI,··· ,fm)) + f'(fI, ... ,fm) =
L
Cv (Yr'
... y:;,= -
g' ... f:;'=) + f'(fI,···
,fm).
v
By Lemrna 2.4.1, v
is in the ideal (YI - fI, . . . ,Ym -
f m) and hence
f'(YI, ... ,Ym) E (J,YI- fI,··· ,Ym - fm) = K,
CHAPTER 2. APPLICATIONS OF GROBNER BASES
86
since J'(ft, ... ,!m) El. Therefore J'(Y" ... ,Ym) E Knk[y" ... ,Ym].
D
We now prove the analog of Theorem 2.4.4. THEOREM 2.4.11. Let K = (1, y,- ft, ... ,Ym- fm) be the ideal as in Theorem 2.4.10, and let G be a Griibner basis for K with respect ta an elimination arder with the x variables larger than the y variables. Then f +1 E k[x" ... ,xnl/1 is
in the image of q, if and only if there exists h E k[y" ... , Ym] such that f In this case f + 1 = q,(h+ J) = h(f" ... ,fm) + I.
-S + h.
PROOF. Let f +1 be in the image of q,. Then there exists g E k[y" ... ,Ym] sueh that f - g(f" ... ,fm) El. We consider the polynomial f(x" ... ,xn )g(y" ... , Ym) E k[y, , ... ,Ym, x" ... ,xn ]. Since f(x" ... ,xn ) - g(y" .. . ,Ym) = g(f" ... ,fm) - g(y" ... ,Ym) + (f(x" ... ,xn ) - g(f" ... ,fm)), we have, using Lemma 2.4.1, that f(x" ... ,xn ) -g(y" ... ,Ym) is in K. The argnment proceeds as in the proof of Theorem 2.4.4 (Exercise 2.4.8). Conversely, let f E k[x" ... ,x n ] be such that Then.] - h E K, and hence
f -S + h with h
E k[y" ... ,Ym].
m
Lgi(Yb'" ,Ym, Xl,
...
,Xn)(Yi - fi(Xll ... ,xn ))
+ W(Yll'"
,Yml Xl,··· ,Xn ),
i=l
where W(Yll'" ,Ym,Xl,··· ,Xn) = LUV(Yll'" ,Ym,Xl,··· ,Xn)Vv (Xl,'"
,Xn)
v
with V v Eland where 9i, Uv E k[Yll'" ,Ym, Xl, ... ,xn ]. If we substitute fi for Yi, we see that f - h(f" ... ,fm) E l, and henee f + 1 = q,(h + J). D As before we have COROLLARY 2.4.12. Continuing the notation of Theorem 2.4.11, we have that
f
+I
E
k[x" . .. , xnl/ 1 is in the image of q, if and only if Nc(f) E k[y" ... ,Ym].
We finally determine whether the map q, is onto, again in a fashion similar to Theorem 2.4.7. THEOREM 2.4.13. Let K be the ideal as in Theorem 2.4.10, and let G be the reduced Grobner basis for K with respect ta an elimination arder with the x variables larger than the Y variables. Then q, is onto if and only if for each i = 1, ... ,n there exists a polynomial 9i = Xi - hi E G , where hi E k[Yl,' .. ,Ym]. PROOF. The proof is similar to the one of Theorem 2.4.7 and is left to the reader (Exercise 2.4.9). D
87
2.4. POLYNOMIAL MAPS EXAMPLE
2.4.14. Consider the following
1>:
---->
---->
x~y+I
---->
2xy+I,
---->
w, u 2 ~ v 2 ~ w 2 ~ 2w) ç
where J
= (uv
~
(x+y)(x~y) ~2xy+I
x 2 ~ y2 ~ 2xy + I
= 0,
and
1>(u 2 ~ v 2 ~ w 2 ~ 2w + J)
+ y)2 ~ (x ~ y)2 ~ 4x 2y2 ~ 4xy + I ~4x2y2 + I = O. (x
We now compute the kemel of 1> as in Theorem 2.4.10. 80 let K = (x 2 ~ 2xy,y2,u ~ (x
+ y), v ~ (x ~ y),w ~ 2xy) ç
We compute the reduced Grobner basis G for K with respect to the lex order with x > y > u > v > w to get G=
{u 2-
2w v
"
2Y -
1 + -v 1 x - -u 1 - -v 1 uv - w vw uw w -u 2 2' 2 2' '"
2} .
By Theorem 2.4.10, we have
(u 2 ~ 2w + J, v 2 + J, uv ~ w + J, vw + J, uw + J, w 2 + J) (u 2 ~ 2w+ J,v 2 + J,vw + J,uw+ J,w 2 + J) = L/J,
ker( 1»
2w, v 2 , VW, UW, w 2 , uv - W , u 2 - v 2 - w 2 - 2w) (recall that uv ~ w E J). Also, by Theorem 2.4.13, we see that the map 1> is onto, sinee x - ~u - ~v and y + ~v are in G. In fact we have where L = (u 2
-
!u
x
+I =
1
1
21>(u + v + J) and y +I = 21>(u ~ v + J).
To conclude, note that, by the First Isomorphism Theorem, we have
(
88
CHAPTER 2. APPLICATIONS OF GROBNER BASES
Exercises 2.4.1. Without using a Computer Algebra System, use the method of Example 2.4.3 to find ker(1)), where 1>: lQI[u, v] ---> lQI[x] is defined by 1>: u r--+ x 2 and 1>: v ---> x"' 2.4.2. Compute generators for ker(1)), where 1>: lQI[u,v,w] ---> lQI[x,y] is defined by 1>: u r--+ x 2 + y, 1>: v r--+ X + Y and 1>: w r--+ X - y2 Is 1> onto? 2.4.3. Determine whether the map 1> is onto, where 1>: lQI[r, u, v, w] ---> lQI[x, y] is defined by 1>: r r--+ X2 + y, 1>: u r--+ X + y, 1>: v r--+ X - y2, and 1>: w r--+ x2 + y2 2.4.4. (Shannon and Sweedler [ShSw]) This exercise assumes that the ,eade, is familiar with the definition of the minimal polynomial of an algebraic element over a field. Let f, ft, ... ,fm E k[XI,'" , Xn]. Consider the ideal K = (y - f, YI - ft,··· , Ym - fm) in k[y, YI,·.· , Ym,Xl,··· ,xn ]. Let G be the reduced Gr6bner basis for K with respect to the lex ordering with Xl > ... > Xn > Y > YI > ... > Ym' Let Go be the set of all polynomials in G involving only the variables y, YI, ... ,Ym and in which Y actually appears. a. Show that f is algebraic over k(fl, ... , f m) if and only if Go # 0. [Hint: In the case where we assume that fis algebraic over k(fl, ... , fm), we can find h= Lvhv(Yl"" ,Ym)Yv such that h(f,ft, ... ,fm) =0 and
ho (fI, ... ,fm) # O. We have h -S+ O. Analyze this reduction.] b. In the case that Go # 0 let go E Go be such that lp(go) is least. Show that go (y, fI,' .. ,!m) is a minimal polynomial for f over k(fl, . .. ,fm). 2.4.5. A polynomial f E k[Xl, ... , x n ] is called symmetric if
for all permutations (Y of {l, ... , fi}. In Exercise 1.4.18 we saw that the set of symmetric functions is a k-algebra generated by the following n functions: 0"1 0"2
+ X2 + ... + X n XIX2 + XIX3 + ... + XIXn + X2X3 + ... + Xn_lX n Xl
Use Theorem 2.4.4 to give a method for deciding whether a given function E k[.Tl, ... ,xn ] is a symmetric function. Use this method to check your answer in Exercise 1.4.18, part d. 2.4.6. (Shannon and Sweedler [ShSw]) In this exercise we extend the results of this section tomaps fromsubrings of k(Yl,'" ,Ym) to k(Xl, ... ,Xn), where k(Yl, ... , Ym) and k(xl,' .. ,xn ) are the fields of fractions of k[Yl, ... ,Ym] and k[Xl, . .. , x n ] respectively.
f
2.4. POLYNOMIAL MAPS
89
a. We consider the following map
q,:
k[YI,' .. , Ym]
---->
Yi
r----+
k[XI"'" Xn] fi.
Let P = ker( q,). Prove that q, can be extended ta a k-algebra homomorphisffi
7/J: k[YI,'" ,Ym]P ---->k(XI"" ,Xn),
HI
2.4.7.
2.4.8. 2.4.9. 2.4.10.
where k[YI,'" ,Ym]P = f,g E k[YI, ... ,Ym] andg rt P}. Note that k[YI, ... , Ym]P is a subring of k(YI,." , Ym). It is called the localization of k[YI,' .. , Ym] at P. b. Give a method for coruputing generators of ker( 7/J). c. Prove the following analog of Theorem 2.4.7. Let K be the ideal of k[YIJ'" ,Yml Xl,··· ,X n ] generated by Yi - fil i = 1, ... ,m. Let G be a Grübner basis for K with respect to the lex term arder with Xl > x2 > ... > x n > YI > ... > Ym' Then 7/J is onto if and only if for each i = 1, ... ,n there exists Yi E G such -that 9i = (XiIi - f3il where Qi rtp, "i E k[YI,'" ,Ym] andfli E k[Xi+I"" ,Xn,YI,··· ,Ym]. then Xi = ~.u, 8;(f"".,!m) [Hint: If Xi is in the image of 0/', 'P .... .!m) for some Di, "Ii E k[YI, ... , Ym] such that "Ii rt P. Choose "Ii such that lp(?;) is the smallest possible. Consider ti = ?i(YI, ... ,Ym)Xi -Oi(YI, ... ,Ym)' Prove that t i E K. Prove that there erists a polyuomial gi E G such that lp(gi) = yr' ... y:;'mXi, and sa conclude that gi is of the required farm. For the converse, first prove that X n is in the image, then Xn-l, etc.] d. Consider the map q,: k[u, v] ----> k[x] defined by u ----> X4 + x and v r----+ x 3 . Compute generators of P and determine whether 'if; is onto. e. Consider the map q,: Q>[u, v, w] ----> Q>[x, y] defined by u ----> x 2 + y, and v 1-----+ x + y and W J-----+ x - y2. Compute generators of P and determine whether 7/J is onto. Letq,: Q>[u,v]---->Q>[x] bedefined byq,: u ----> x 4 +x 2 +xandq,: v ----> x 3 -x. a. Using Theorem 2.4.7 show that q, is not onto. b. Show that x3 is not in the image of q,. c. Show that the map 7/J, corresponding ta the one given in Exercise 2.4.6, is onto. Complete the proof of Theorem 2.4.11. Prove Theorem 2.4.13. Let q,: Q>[u, v, wJ! J ----> Q>[x, yJ! J be defined by q,: u + J ----> x 2 + Y + J, q,: v+J ----> x+y+J, and q,: w+J ----> X 3 +xy2 +J and where J = (uv-w) and J = (xy+y). a. Prove that q, is well-defined. b. Find the kernel of q,. c. Show that q, is onto.
CHAPTER 2. APPLICATIONS OF GROBNER BASES
90
2.4.11. For those who have the appropriate algebra skills, generalize ExerciEe 2.4.6 to the case of
k[Yj,··· ,Ym]/J Yi+J
----> ---->
k[xj, ... ,xn]/I fi+I,
where we assume that J, lare ideals in the appropriate rings and I is a prime ideal. 2.5. Sorne Applications to Algebraic Geometry. In this section we will apply the results of the previous sections to study maps between varieties. Throughout tills section, for 1 an ideal in k[xl, ... ,X n ], we will consider the variety V,,( I) c::: k n as we did in Section 2.2. We will abbreviate ·this variety more simply by V(I). We begin by considering projection maps
km+n (2.5.1)
(al l
...
,am,bll
···
,bn )
If we apply this map to a variety V, we may not obtain a variety. For example, the variety V(xy -1) projects onto the x-axis minus the origin, and this is not a variety. We are interested in finding the smallest variety containing 7f(V). Before we do this we give the following general proposition. PROPOSITION 2.5.1. If Sc
k n , then V(I(S)) is the smallest variety contain-
ing S. That iB, ifW iB any variety containing S, then V(I(S)) called the Zariski closure of S.
c:: W
This set iB
W = V(J) c:: k n be a variety containing S, where J is an ideal in k[xj, ... ,Xn]. Then I(W) c I(S) and V(I(S)) C V(I(W)). But V(I(W)) = V( v0) = V(J) = W, by Theorem 2.2.5. Therefore V(I(S)) c:: W 0 PROOF. Let
As a simple example of the above proposition, consider tWQ varieties V and W contained in k n. Then V ~ W need not be a variety, and its Zariski closure is V(I(V - W)). We note that I(V - W) = I(V): I(W) (Exercise 2.5.2). Recall that we showed how to compute the ideal quotient in Lemmas 2.3.10 and 2.3.11. EXAMPLE 2.5.2. Consider the varieties V = V(x(y - z),y(x - z)) and W = V(y-z) in IC"' Then V consists of the four lines y = z = 0, x = z = 0, x = y = 0, and x = y = z. Moreover W is the plane y = z which contains just two of these lines, namely y = z = and x = y = z. Thus V - W consists of the union of the two lines x = z = and x = y = excluding the origin. We use the above method to compute the smallest variety containing V - W, namely V(I(V - W)) (although it is geometrically obvious that this variety is the union of the two lines including the origin). By the above we have I(V - W) = I(V): I(W). AIso, it is easy to see that I(V) = (x(y - z), y(x - z)) and I(W) = (y - z),
° °
°
2.5. 80ME APPLICATIONS TO ALGEBRAIC GEOMETRY sinee y'(x(y - z),y(x - z)) = (x(y - z),y(x - z)) and y'(y - z) compute I(V): I(W) using Lemma 2.3.11 to get
(X (y - z), y(x - z)): (y - z)
=
91
= (y - z). We
(x, yz).
Therefore the smallest variety containing V - W is, as we observed above, the union of the two lines x = z = and x = y = O. We now return to the projection map (2.5.1).
°
THEOREM 2.5.3. Let I be an ideal in k[YI, ... ,V=,X" ... ,xn ]. The Zariski closure o/7r(V(I)) is V(I n k[YI, ... ,Y=]). PROOF. Let V = V(I) and Iy = Ink[YI, ... ,Ym]. Let us first prove that 7r(V) c:: V(Iy). Let (a" ... , am, b ... ,bn ) E V, so that (a" ... ,am) E 7r(V). If " ) = 0, since 1 E I, and thus I(a" ... ,am) = 1 E Iy, then I(a" ... ,a=, b ... ,b n 0, sinee 1 contains only y" variables. Therefore 7r(V) c:: V(Iy). In view of Proposition 2.5.1, to complete the proof of the theorem we need to show that c:: V(I(7r(V))). Wc first show that I(7r(V)) c:: ..jI;. Let 1 E I(7r(V)), so that I(a" ... ,am) = for ail (a" ... ,a=) E 7r(V). If we view f as an element of k[Yll'" ,Yrn,Xl, ... ,xnL then f(al, ... ,am,bl,". ,bn ) = 0 for all (a" ... ,am, bl , ... ,bn ) E V. By Theorem 2.2.5, there exists an e such that E I. But, sinee f involves only YI,··· ,y=, E Iy, and hence 1 E ..jI;. Now we have V(Iy) = V(..jI;) c:: V(I(7r(V))). This complete the proof of the theorem. D
V(Iy)
°
r
r
We now turn our attention to an application of Theorems 2.3.4 and 2.5.3. We consider a map r.p: lin ----+ k'l7\ given by
h,··· ,lm: YI = h(xI, ...
,X n )
Y2 = h(XI, ... ,X n )
Ym
=
I=(XI, ...
These equations define a variety in
,X n ).
k m+n namely 1
V= V(YI- h,··· ,y= - lm). (We note that V is, in fact, the graph of
92
CHAPTER 2. APPLICATIONS OF GROBNER BASES
Consider the projection
Tim+n
rr:
Then we can see that rr(V) is the image of 'P, and so we apply Theorem 2.5.3. That is we let l = (Yl -
ft,··· ,Ym - Jm) ç k[Yl, ... ,Ym, Xl,···
,X
n].
Then the Zariski closure of rr(V) is V(I n k[Yl, ... ,Ym]). Note from Theorem 2.4.2 that the Zariski closure of the image of 'P is defined precisely by the ideal of relations of the polynornials ft,··· ,Jm. Therefore, ta find a set of defining equations for that variety, one computes a Grübner basis G for l with respect ta an elimination order with the X variables larger than the y variables. The polynomials in G which are in the y variables only are the desired polynomials. EXAMPLE 2.5.4. Consider the map 'P: 80
1[;2
1[;4
(x, y)
(X4,X3 y , xy3, y4)
that the parametrization is given by:
r U
(2.5.2)
v W
x4 x3 y xy3 y"
The Grübner basis for the ideal l = (r - x4, u - x 3 y, V - xy" w - y4) with respect to the lex term ordering with x > y > r > u > v > w is G = {uw 2 - v 3 ,rwuv, -rv 2 + u 2 w, -r 2 v + u 3 , _y4 + w, -xw + yv, -xv 2 + yuw, -xu + yr, XTV yu 2 , _ xy 3 + V, -x 2 v + y 2 u, _x 2 y 2 r + u 2 , _x3 y + 1t, _x4 + r}. The polynomials that do not involve x or y are-those that determine the smallest variety containing the solutions of the parametric equations (2.5.2):
More generally, we consider maps between two varieties V given by polynomials; Le.)
v
--->
(a" ... ,an)
>--->
a:
ç k n and W ç km
W (ft(a" ... ,an), ... ,Jm(a" ... ,an)),
where ft, ... ,fm E k[Xl' ... ,Xn]. Sueh a map Q' gives rise to a k-algebra homomorphism cr'" between the affine k-algebras k[YL.·. ,Yml/I(W) and k[Xl, ... ,xnl/ I(V) as follows: a':
k[Yl, . .. ,Ym]/ I(W) Yi + I(W)
---> e---->
k[xl, ... ' xn]/I(V) Ji + I(V).
2.5. SOME APPLICATIONS TO ALGEBRAIC GEOMETRY
93
To see that a' is well-defined, we need to show that for ail 9 E I(W), we have ,f=) E I(V). But for ail (a"", ,an) E V, wehave a(a"", ,an) E W, and hence
gU,,···
as desired. We have then that a' is a k-algebra homomorphism. Note also that if the map a is the identity map of the variety V onto itself, then the corresponding map Œ' is the identity of the affine algebra k[XI,'" ,xn ]/ I(V) onto itself (this follows since the field k is infinite). Thus the study of the map a between varieties might be done by studying the corresponding map Œ' between the corresponding affine k-algebras. We will give two examples illustrating this idea: deterrnining the image of a variety and determining whether a given map is a variety isomorphism. Suppose that we have a variety V in k n and a map a into k = given by polynomials h, ... ,f= E k[XI,'" ,xn ]: V
------+
li rn
(a" ... ,an)
>--+
(h(a" ... ,an), ... ,f=(a" ... ,an)).
Ct: :
We would like to deterrnine the Zariski closure of the image of the map Œ. In the case when V = k n we did this at the begining of this section. We can find I(im(Œ)) by considering the corresponding map Œ':
PROPOSITION 2.5.5.
k[YI, ... ,Y=] Yi
-+
>--+
k[XI,,,., xnl/ I(V) fi + I(V).
A polynomial 9 E k[YI, ... ,Y=] is in I(im(Œ)) if and
only if 9 E ker(a'). I(im(a)). Then for any (a"", ,an) E V, g(a(a"" . , an)) = 0, and hence g(a(xI'''' ,Xn)) E I(V), so that a'(g) = 0 and 9 E ker(a'). The PROOF. Let 9 E
argument is clearly reversible.
D
This proposition together with Theorem 2.4.10 gives us an algorithm for computing the ideal I(im(Œ)), and hence for determining the smallest variety containing im(Œ). EXAMPLE 2.5.6. Let V be the variety in C 2 defined by X2 + y2 -1 (a circle in the x, Y plane). Consider the map Œ given by the polynomials h = X2, h = y2, and h = xy; i.e. Œ:
V
(x, y)
CHAPTER 2. APPLICATIONS OF GROBNER BASES
94
The corresponding map
0;*
is
qx, y]/ (x 2 + y2 - 1) ---> x 2 + (x 2 + y2 - 1) y2 + (x2 + y2 _ 1) ---> ---> xy + (x 2 + y2 - 1). we consider the ideal K = (x 2 + y2 -1, u - x 2, V -
0;*:
qu,v,w] u v w
--->
Ta find I(im(a)), y2, W - XV) and find a Gr6bner basis G for K with respect ta the lex term ordering with x> y > u > v > w ta get G = {X2 + V - 1,xy - W,xv - Yw,xw + yv _ y,y2_ v, U + v-l, v 2 - V + w 2}. Therefore I(im(a)) = ker(a*) = K n
qu, v, w] =
(u + v-l, v 2 -
V
+ w2 ).
Geometrically, the equation u + v - 1 = 0 is the equation of a plane parallel ta the w axis. The equation v 2 - v + w 2 = 0 is the equation of a cylinder whose axis is parallel to the u axis. The intersection of these two surfaces is an ellipse. The second example illustrates how a* can be used for determining whether two varieties are isomorphic. DEFINITION 2.5.7. Two varieties V ç Tin and W ç Tim are said to be isomorphic over k if there are maps a: V ---> W and {J: W ---> V given by polynomials with coefficients in k such that a 0 {J = idw and (J 0 a = idv , where idv and idw are the identity maps of V and W respectively. THEOREM 2.5.8. The varieties V ç Ti n and W ç Ti mare isomorphic over k if and only if there exists a k-algebra isomorphism of the affine k-algebras
k[y" ... ,Ym]/I(W) and k[x" ... ,xnl/I(V).
PROOF.
a:
where
First let a and {J be inverse polynomial maps. Suppose that
V (a" ... ,an)
---> --->
W (J,(a" ... ,an),." ,fm(a" ... ,an)),
ft, ... ,fm E k[x" .. ' ,xn ], and suppose that {J:
W
--->
(b" ... ,bm )
--->
V (g,(b" ... ,bm ), ... ,gn(b" ... ,bm )),
where g" ... ,gn E k[y" ... ,Ym]' Note that the map ({J Xi >--+ gi(J" ... , fm) while the map a* 0 {J* is defined by
0
a)* is defined by
and sa, ({Joa)* =a*o{J*. Now, since {Joa =idv , ((Joa)* =idk[x" ... ,xnlII(V) and hence a* 0 (J* is also the identity of k[x" . .. , xnl/ I(V) onto itself. Similarly, we have that (J* 0 a* is the identity of k[y" ... ,Ym]/ I(W) onto itself. Therefore 0:* i8 a k-algebra isomorphism
2.5. SOME APPLICATIONS TO ALGEBRAIC GEOMETRY
95
For the converse, we assume that we have a k-algebra isomorphism
We will see that e = a* for a map a between the varieties V and W which is given by polynomials and such that a- l exists and is also given by polynomials. Let e(Yi + I(W)) = li + I(V), where fi E k[Xl"" ,xn ] for i = l, ... ,m, and let e-l(xj+I(V)) =gj+I(W), wheregj E k[Yl,'" ,Ym] for j = l, ... ,no Consider the maps a:
V
W (f,(a" ... ,an), ... ,fm(a"", ,an)),
--> t---+
,an)
(ail'"
and
13:
--> V W t---+ (b ... ,bm ) (g,(b ... ,bm ), ... ,gn(b ... ,bm )). " " " It is readily seen that a maps V into W, 13 maps W into V; and that a and are inverse maps. D
13
Therefore, to determine variety isomorphism, we need ta check whether a* is a k-algebra isomorphism. We have seen in Theorem 2.4.10 how to compute the kernel of Q* and in Theorem 2.4.13 how ta determine whether a* is anto. EXAMPLE 2.5.9. Consider the vitriety V ç: IC 2 defined by the equation x' yx + 1 in the x, y plane. Also, consider the variety W ç: IC' defined by the equation u 4 + u 3 + 2u 2 v + v 2 + uv + 1 in the u, v plane. Finally consider the map
V (x,y)
a:
--> t---+
W (y,_y2 -x).
We will show that this gives an isomorphism of the varieties V and W. First we show that a maps V into W. So let (x, y) E V Then if we replace u and v by y and _y' - x respectively in the equation defining W we get
y4
+ y3 + 2y' (-y'
_ x)
+ (-y'
_ x)'
+ y( _y' -
x)
+ 1 = x' -
xy + 1 = 0,
since (x, y) EV Now consider the corresponding map
a':
qu, v]/ J f +J
--> t---+
qx, y]/ l f(y, _y' - x)
+ l,
where J = (u 4 +u 3+2u 2v+v'+uv+ 1) and l = (x' -yx+l) (see Exercise 2.3.16 part a to see why 1= I(V) and J = J(W)). Let K = (x' -yx+ l, u-y, V+y2+X) be the ideal in qu, v, x, y] as in Theorem 2.4.10. We compute a Grübner basis for K with respect to the lex term ordering with x > y > u > v to get
G = {x+u' +v,y - u,u 4 +u3 + 2u2v Thus K
n k[u, v] =
(u 4 + u 3
by Theorem 2.4.10 and
80
+ uv +v' + l}.
+ 2u'v + uv + v' + 1) = J, and hence ker(a') = 0 a* is one ta one. Also, since x + u 2 + v and y - u
96
CHAPTER 2. APPLICATIONS OF GROBNER BASES
are in G, the map a* is onto by Theorem 2.4.13. Therefore a* is a IC-algebra isomorphism and, 80 by Theorem 2.5.8, Q gives an isomorphism of the varieties V and W. Note that the inverse map is given by a- 1 (u,v) = (_u 2 - v,u), for (u,v) E W If the reader is interested in studying further the ideas presented in this section we strongly recommend the book of Cox, Little, and ü'Shea [CLOS].
Exercises 2.5.1. Show that in IC 3, if the plane defined by x = 1 is removed frorn the variety V = V(xy2+XZ 2 _xy_y2 _z2 +y,x 2 +xy-2x-y+1),the Zariski closure of what remains is an ellipse. Conclude that V is the union of this ellipse and the given plane. 2.5.2. Let V and W be varieties in k n. Prove that I(V - W) = I(V): I(W). 2.5.3. a. Find the equation in IC 2 for the curve parametrized by x = t 3 , y = t 2 + 1. b. Find the equation in IC 2 for the curve parametrized by x = t 3 + l, Y =
t2 •
2.5.4.
2.5.5.
2.5.6.
2.5.7.
c. Find the ideal for the intersection of these two varieties and then determine aU points on this intersection. d. Do part c by solving the equations directly. Show by the rnethod of this section that the variety io IC 3 parametrized by x = u + uv + w, y = u + v 2 + w 2, Z = u 2 + V is all of IC"' [Hint: If you try to compute this example using lex, your computer may llot be able to complete the computation. However, if you use deglex on the u, v, w variables and also on the X, y, z variables with an elimination order between thern, you should encounter no difliculties.] Consider the variety V parametrized by x = t 3 , y = t 4 , z = t 5 in IC 3. a. ShowthatI(V) = (y5_z4,_y2+xz,xy3_z3,x2y_z2,x3_yZ). [Hiot: Use lex with x > y > z.] b. Verify that also I(V) = (xz - y2,x 3 - yz,x2y - Z2). c. Show that the tangent variety of V is parametrized by x = t 3 +3t2 u, y = t 4 + 4t3 u, Z = t 5 + 5t4 u. [Hint: The tangent variety is defined to be the union of ail the tangent lines of V. Sa this exercise is done using elementary multivariable calculus.] d. Compute generators for the ideal of the tangent variety of V. [Answer: (15x 4y2 - 48y5 - 16x 5z + 80xy3z - 30x 2yz2 - Z4).] Let V be the variety in IC 3 defined by x 2 + y2 - z2 = 0 and x 3 + y = O. Define a: V ---+ IC 4 by (a, b, c) f---' (a 2 ,a+b,c2 +a,c). Find the ideal of the image of a. In qx,y,z] let J = (_2y_y2+2z+Z2, 2x-yz- Z2) and in qu,v] let l = (uv+v).Definethemapa: V(I) ---+ V(J) by (a,b) f---' (a 2 +b,a+b,a-b). Prove that a defines an isomorphism between V(I) and V(J) (you rnay
2.6. MINIMAL POLYNOMIALS OF ELEMENTS IN FIELD EXTENSIONS
97
assume that l (V (J)) = J). 2.6. Minimal Polynomials of Elements in Field Extensions. In this section we will use the results of Section 2.4 to find the minimal polynomial of an element algebraic over a field k. We will assume that the reader is familiar with the most elementary facts about field extensions [Go, He]. The results of this section will not be used in the remainder of the book and may be skipped. Let k ç K be a field extension. Recall that if a E K is algebraic over k, then the minimal polynomial of " over k is defined ta be the monic polynomial p in one variable, with coefficients in k, of smallest degree such p( a) = O. Altematively, considering the k-algebra homomorphism k(a) defined by x r--4 a, we have that ker(
(2.6.1)
under the map defined by x + (p) r--4 a. We first consider the case where K = k(,,), with a algebraic over k, and onr goal is to compute the minimal polynomial of any (3 E K. We note that in arder to compute in k(a) it suffices, by Equation (2.6.1), to compute in the affine k-algebra k[xl/!;P). We assume that we know the minimal polynomial p of Œ. THEOREM 2.6.1. Let k ç K be a field extension, and let a E K be algebraie over k. Let p E k[x] be the minimal polynomial of Œ over k. Let 0 i' (3 E k(a), say ao + alQ + ... + ana n (3= , bo+b,a+···+bma m where ai, bj E k,O <: i <: n,O <: j <: m. Let f(x) = aD + a,x + ... + anx n and g(x) = bo+b,x+···+bmxm be the eorresponding polynomials in k[x]. Consider the ideal J = !;p,gy - f) of k[x, y]. Then the minimal polynomial of (3 over k is the monie polynomial that generates the ideal J n k[y].
Note that J n k[y] is generated by a single polynomial, sinee this is true for every ideal in k[y] (i.e. k[y] is a principal ideal domain). PROOF. Note that sinee k[x]/!;P) is a field, and g(a) i' 0 (it is the denominator of (3), there is a polynomial i E k[x] such that gi '" 1 (mod !;P)), that is gf - 1 E (p). Let h = jf. Note that h(a) = (3. Now consider
k[y] y
----> >-->
k[xl/ (P) h + !;P)
~ >-->
k(a) (3.
Note that q is in the kernel of
98
CHAPTER 2. APPLICATIONS OP GROBNER BASES
and hence y - h E (p,gy - J) so that (p,y - h) ç (p,gy - J). Conversely, gy - f == g(y - fR) = g(y - h) (mod (p)). Therefore gy - f E (p, Y - hl, and (p,gy-J) ç (p,y-h). 0 The above result gives an algorithm for finding the minimal polynomial of an element (3 in k( a): given a and (3 as in the theorem, we compute the reduced Grübner basis G for the ideal (p, gy - J) of k[x, y] with respect to the lex term ordering with x > y. The polynomial in G which is in y alone is the minimal polynomial of (3. EXAMPLE 2.6.2. Consider the extension field «.]I(a) of «.]l, where a is a root of 3 the irreducible polynomial x 5 - x - 2. Now consider the element f3 = 1_0:,::-20: E «.]I(a). We wish to find the minimal polynomial of (3. We consider the ideal J = (x 5 - X - 2, xy + 2x 3 + x -1) in «.]I[x, y]. We compute the reduced Grübner basis G for J with respect to the lex term ordering with x > y to obtain
y5
+ 211 y4 + 4 y 3 -
5y2
+ 95y + 259 } .
'i
Therefore the minimal polynomial of (3 is y5 + y4 + 4 y 3 - 5y2 + 95y + 259. This technique can be extended to a more general setting of field extensions of the form K = k(a" ... ,an). For this we need the following notation which we use for the remainder of this section. For i = 2, ... ,n and p E k( a" ... ,ai- Il [Xi], we let P be any polynomial in k[Xl' ... ,Xi] such that p(Ql, ... ,Qi-Il Xi) = p. We note that p is not uniquely defined, but every ap~ plication we will make of P will not depend on the particular choice we have made. We now determine the minimal polynomial of any element (3 of k(O!l' ... ,an) using the following result, which is similar to Theorem 2.6.1. THEOREM 2.6.3. Let K = k(O!l, . .. ,O!n) be an algebraic extension of k. For i = l, ... ln, let Pi E k(al1'" ,Lti-d[xil be the minimal polynomial oiai over k(O!I' ... ,O!i-I). Let (3 E k(O!l' ... ,O!n), say (3 - f(O!I' ... ,O!n) - g(CXl1'" ,an)' where f,g E k[Xl' ... ,Xn]. Consider the ideal J = (PI'··· ,Pn,gy- J) contained in k[Xl, . .. ,X n , y]. Then the minimal polynomial of (3 over k is the monie polynomial that generates the ideal J n k[y].
PROOF. We first show that
2.6. MINIMAL POLYNOMIALS OF ELEMENTS IN FIELD EXTENSIONS
99
using the map
We note that 4>n is anto, since Œl, ... ,Œn are algebraic over k. It remains to show that ker(
h(xn)
=
I(Œl, ... , Œn-l, x n ) E k(a" ... , Œn_l)[Xn],
and note that h(Œn) = O. Therefore Pn divides h, by definition of Pn. Say h = PnCn, forsomeC n E k(a"." , Œn-,)[xn]. Consider I-PnRn E k[Xl"" ,xn] and write
f-
pnfn
=
:L:gv(Xl1'" ,xn_dX~. v
Then, since
(f - PnRn)(Œl,'" , Œn-l, x n) = h - PnCn = 0, we see that for an v, gv(Œl 1 '" ,û:n-l) kernelof
=
O. Therefore gv(Xl, ... ,Xn-l) is in the
and hence
by induction. Thus 1 - PnRn E (Pl"" ,Pn-l)' and 1 E (Pl"" ,Pn)' Thus ker(
J= !J;"p2'Y- (Xl +X2)) = (xl -2,x~-5,y- (Xl +X2)) c::::Q[Xl,X2,Y]
with respect to the lex term ordering wih G = {1187xl - 48y 5 1187x2
-
Xl
>
X2
> Y ta obtain
45y4 + 320y3 + 780y2 - 735y + 1820,
+ 48y 5 + 45y4 -
320y3 - 780y2 - 452y - 1820,
y6 _ 6y4 _ lO y 3 + 12 y 2 - 60y + 17}.
Therefore the minimal polynomial of v'2 + ,y5 over Q is y6 - 6y4 - lO y3 + 12y2 60y+17. We also see that v'2 + ,y5 has degree 6 over Q and hence Q( v'2 + ,y5) = Q( v'2, ,y5).
CHAPTER 2. APPLICATIONS OF GROBNER BASES
100
EXAMPLE 2.6.5. The minimal polynomial of ij2 over Q iSPI = Xf-2 E Q[x,J. AU the roots of this polynomial generate the extension Q( ij2, il, where i the i complex number such that i 2 = - L We consider the element f3 = 1 + ij2. We
wish ta find the minimal polynomial of f3 over Q. First, the minimal polynomial of i over Q( ij2) is P2 = x§ + L Then P, = Pl, and P2 = P2. We consider the ideal J = (Xf - 2, x§ + l, X,y - (Xl + X2)) of Q[XI' X2, y]. The reduced Grübner basis for J with respect ta the lex term ordering with Xl > X2 > Y is computed ta be
G = {Xl - 2y3X2 +6y2X2 - 6yX2
+ 2X2,X~ + 1,y4 _4y 3 + 6y2 -4Y+~}.
Therefore the minimal polynomial of f3 is y4 - 4y 3 + 6y2 - 4y + ~. Alternatively, Q( ij2, i) = Q( ij2, iij2). The minimal polynomial of iij2 over Q( ij2) is P2 = x§ + v'2. Thus P2 = x§ + xi. We again want to compute the i minimum polynomial of f3 = 1 + ij2' We consider the ideal J = (xi - 2, x§ +
xi,xiy - (xi + X2)). The reduced Grübner basis for J with respect ta the lex term ordering with Xl > X2 > Y is computed ta be G = { xi
+ 2y2 -
4y
+ 2, X2 + 2y 3 -
6y2
+ 6y -
2, y4 - 4 y3 + 6y2 - 4y
+~} .
Sa we obtain the same result as before. Since the degree of f3 over Q is 4, and since the degree of Q( ij2, i) over Q is 8, we see that Q(f3) is a subfield of Q( ij2, i), but is not equal ta it. In the preceding two examples, we used a degree argument for deciding whether k(f3) is equal ta k(al, ... ,an)' We will give another method for determining this which has the added advantage of expressing the a;'s in terms of f3. This algorithm is a consequence of the foUowing theorem. THEOREM 2.6.6. Let al, ... 1 an and 1511'" ,Pn be as in Theorem 2.6.3. Let f(e", ... ,Œn). [ ] f3 E k ( Œn ) , say f3 = ( ) , w,th f, g E k X" ... ,X n . Let J be as
e", ... ,
9 Ql, . . . ,Qn in Theorem 2.6.3, and let G be the reduced Grobner basis for J with respect ta an elimination arder with the X variables larger than y. Then k(a"", , Œn) = k(f3) if and only if, for each i = l, ... , n, there is a polynomial gi E G such that gi = Xi - hi, for some hi E k[y]. Moreover, in"this case, Œi = hi(f3). PROOF. Let 1 = (Pl"" ,Pn ), and leU E k[XI, ... ,xn ] besuch that gR-l E 1. Set h = ff, and note that h(ŒI"" ,Œn) = f3. Consider
k[y] y
k[XI'''' ,xn]/I h+I
k(ŒI,'" ,Œn)
f3.
Then k(a"", , an) = k(f3) if and only if 1> is onto. We conclude by using Theorem 2.4.13 and the fact that J = (1, y - hl. 0
2.6. MINIMAL POLYNOMIALS OF ELEMENTS IN FIELD EXTENSIONS
101
EXAMPLE 2.6.7. In Example 2.6.4, the Gr6bner basis G contained the following polynomials 1187x, - 48y 5
-
45 y 4 + 320y3 + 780y2 - 735y + 1820,
1187x2 + 48y 5 + 45 y 4
-
320y3 - 780y2 - 452y - 1820.
This gives another proof that Q( 0 + ?'5) = Q( 0, ?'5). Moreover, we have (2.6.2)
0= 11~7
(48( 0 + ?'5)5 + 45( 0 +
V5)4 -
320( 0 + ?'5)3
-780( 0 + ?'5)2 + 735( 0 + ?'5) - 1820) , and (2.6.3)
?'5 =
11~7 ( -48( 0
+ ?'5)5 - 45( 0 + ?'5)4 + 320( 0 + ?'5)3
+ 780( 0 + ?'5)' + 452( 0 + ?'5) + 1820) . However, in Example 2.6.5, no polynomial of the form Xl - hl exists in G with h, E k[y], and thls gives another proof that Q(,6) oF Q( ij2, il. Finally, we also note that in Example 2.6.2 we have Q = Q(a) and
('-aa-a' )
1438
2183
4
3
10599
8465
2
101499
a = 45887,6 + 45887,6 - 45887,6 + 45887,6 + 45887'
where ,6 =
l-a-a? a .
ExercÏses
2.6.1. Compute the minimal polynomial of the following over Q. a. ij2+ «"4+5. b 0+7 . ij2+1' 2 +a h 3 - 0 - 1 =0. c. aa+3,werea 2.6.2. Compute the minimal polynomial of the following over Q. a. 0 + {12 + 5. b. {12 + {15 + 5. c. 0 + v'3 + y'5. 2.6.3. Show that Q(a!,a2) = Q(,6) and express a, and a2 in terms of,6, where 3
2
a, - a, - 1 = 0, a 2 = 5 and ,6 =
ŒI0!"2+1
.
01 +02
2.6.4. In Theorem 2.6.3 we required that Pi+' E k(a" ... , "'i)[xH,] be the minimal polynomial of "'H' over k(a" ... ,Qi). The minimal polynomial of QH' over k would not do as the following example shows. Let K =
CHAPTER 2. APPLICATIONS OF GROBNER BASES
102
IQI( VIz, i VIz) and (J =
VIz + i VIz. We wish to find the minimal polynomial of (J. a. Use Theorem 2.6.3 ta compute the minimal polynomial of (J over k. b. Instead of the minimal polynomial of i VIz over IQI( VIz), use the minimal polynomial of i VIz over IQI in the method described in Theorem 2.6.3, and see that, nQW, the method does not give the correct minimal polynomial of (J over IQI. 2.6.5. Find, explicitly, the quadratic subfield of IQI(() where ( is a primitive 7th root of unity. [Hint: ,Using the method of the text, note that the minimal polynomial of ( is ~..::-,' and that this quadratic subfield is the fixed field of the automorphism defined by ( >----> (2.] 2.6.6. Complete the proof of Theorem 2.6.3. 2.7. The 3-Color Problem. In this section we want to illustrate how one can apply the technique of Grübner bases to solve a well-known problem in graph theory: determining whether a given graph can be 3-colored. (The same technique would work for any coloring.) This material is based on a portion of D. B"yer's thesis [Ba]. The material in this section will not be used elsewhere in the text and may be skipped. Let us first state the problem precisely. We are given a graph g with n vertices with at most one edge between any two vertices. We want ta color the vertices in Bueh a way that only 3 colors are used, and no two vertices connected by an edge are colored the same way. If g can be colored in this fashian, then g is called 3-colorable. This can be seen to be the same as the 3-color problem for a map: the vertices represent the regions ta he colored, and two vertices are connected by an edge if the two corresponding regions are adjacent. First, we let ç = e';' E C be a cube root of unit y (i.e. = 1). We represent the 3-colors by 1, ç, ç2, the 3 distinct cube foots of unity. Now, we let Xl,' .. ,X n be variables representing the distinct vertices of the graph Çj. Each vertex is to be assigned one of the 3 colors 1, ç, ç". This can be represented by the following n equations
e
(2.7.1) AIso, if the vertices Xi and Xj are connected by an edge, they need to have a different color. Since = x;, we have (Xi - Xj)(xy + XiXj + xJ) = o. Therefore Xi and Xj will have different colors if and only if
xy
(2.7.2) Let l be the ideal of QXl,'" ,xn ] generated by the polynomials in Equation (2.7.1) and for each pair of vertices Xi, Xj which are connected by an edge by the polynomials in Equation (2.7.2). We will consider the variety V(I) contained in The following theorem is now immediate.
en.
THEOREM
2.7.1. The graph
gis 3-colorable if and only ifV(I) cl 0.
2.7. THE 3-COLOR PROBLEM
103
We have seen in Section 2.2 that we can use Gr6bner bases ta determine if V(J) = 0. We first compute a reduced Griibner basis for J. If 1 E G, then V(J) = 0 and otherwise V(I) of 0 (see Theorem 2.2.6). EXAMPLE 2.7.2. Consider the graph ç of Figure 2.1. Xs
~X~3~________________~~X2
'"" ______~X5
FIGURE 2.1. The graph The polynomials corresponcling to
xf -
ç
ç are:
1, for i
=
1, ... ,8
and
Xr + XiXj + x;,
for the pairs (i,j) E {(l, 2), (l, 5), (l, 6), (2,3), (2,4),
(2,8),(3,4),(3,8),(4,5),(4, 7), (5,6), (5,7), (6,7), (7,8)}. We compute a Griibner basis G for the ideal J corresponding to the above polynomials. Keeping in mind Corollary 2.2.11, we use the lex term ordering with Xl > X2 > ... > xs. We obtain
X6 -
Xs,
x~
+ X7XS + xâ, x~ -
1}.
Since 1 't G, we have that V(I) of 0, and hence, by Theorem 2.7.1, ç is 3colorable. We can use the Gr6bner basis G to give an explicit coloring, sinee the system of equations represented by G turns out to be easy to solve. Let us assume that the 3 colors we are using are bIue, red, and green. We must first choose a color for xs, say red, since the only polynomial in one variable in G is x~ - 1. We then must choose a different color for X7, say bIue, because of the polynomial x~ + X7X8 + x~ E G. Then we have that Xl and X3 must be blue because of the polynomials Xl - X7, x3 - X7 E G, and X4, X6 must be red because
CHAPTER 2. APPLICATIONS OF GROBNER BASES
104
of the polynomials X4 - Xg, X6 - Tg E G. Finally X2 and X5 have the same coloT, which is a different coloT from the colors assigned ta T7 and TS 1 80 X2 and Xs are green; this is because the polynomials X2 + T7 + xs, and X5 + T7 + Tg are in G. We note that it is evident from the Gri:ibner basis in Example 2.7.2 that there is only one way to color the graph g, up to permuting the colors, and 80 it should not be too surprising that solving the equations determined by the Gri:ibner basis is easy. However 1 if there is more than one possible coloring, the Gr6bner basis may look more complicated. This is illustrated in the following example. EXAMPLE 2.7.3. Consider the graph g' in Figure 2.2.
Xs
Xg
FIGURE 2.2. The graph The polynomials corresponding ta 3
Xi -
g'
1, for
g'
are: . ~
= 1, ' .. ,9
and
x; +
XiX;
+ xJ,
for the pairs (i,j) E {(1, 2), (1,4), (1,5), (1, 6), (2,3), (2, 5),
(2,7), (3,4), (3, 6), (3,9), (5,6), (6,8), (7, 8), (8, 9)}. We compute a Gri:ibner basis G' for the ideal l' corresponding to the above polynomials using the Lex term ordering with Xl > X2 > ... > T8 > Tg ta obtain Q' = {x~
- 1, x; + X8X9 + x~, (X7 Xij
-
x9){X7 + Xs + Tg),
+ X7 + Xs, (-X5 + X7)( -X5 + xs),
+ X4XS + X7XS + X4X9 + T7X9 + XSX9), (X4 - X5)(X4 + T7 + TS), X3X7 + X3xs - X7XS + X3X9 + X~, 2 X3X4 + X4X[j - X4X7 + X[jX7 - X4X8 + X[jX8 - X3XU - x g, X~ + X3X9 + X~, X2 + X7 + X8l Xl + X[j - X7 - X8}. (X5 -
X9)(X4X7
2.8. INTEGER PROGRAMMING
105
Since 1 cf' G', we have that V(I') of 0, and hence, by Theorem 2.7.1, Q' is 3-colorable. This Gr6bner basis looks much more complicated than the one in Example 2.7.2. This reflects the fact that there are many possible colorings of this graph. In fact, it is easy to 3-color the graph Q' by trial and error. ExercÏses We note that we have tried to keep these problems small, without making thern too trivial looking, but your Computer Aigebra System may still have trouble doing the computations.
2.7.1. Show that if we add an edge between X2 and Xs in the graph Q of Figure 2.1, the graph is no longer 3-colorable. (This can be done either by computing a Gr6bner basis for the new ideal, or by observing what was done in Example 2.7.2.) 2.7.2. Show that if we add one edge between vertices Xl and X3 in the graph Q' of Figure 2.2, then Q' is still 3-colorable. Show that now the 3-coloring is unique except for the permutation of the coloTs. 2.7.3. Generalize the method given in this section to the case of determining whether graphs are 4-colorable. 2.7.4. Use the method of Exercise 2.7.3 to show that in the trivial example where there are 4 vertices, each pair of which is connected by an edge, the graph is 4-colorable and show that the equations imply that all 4 vertices must be colored a different color. 2.7.5. Show that in principle (by this we mean that the computations would probably be too lengthy to make the scheme practical) the method in this section could be generalized ta giving a method ta determine whether a graph is rn-colorable for any positive integer m. 2.8. Integer Prograrnming. The material in this section is taken from P. Conti and C. Traverso [CoTr]. No use of this section will be made elsewhere in the book. The integer programming problem has the following form: let aij E Z, bi E Z, and Cj E R, i = 1, ... ,n, j = 1, ... ,mi we wish to find a solution (0"11 l72, . .. ,am) in l'lm of the system
(2.8.1)
a210"1
+ +
anlO"l
+
{"'""'
a220"2
+ +
+ +
an 20'"2
+
+
a120"2
a2m Œm
bl b2
anm(J"m
bn ,
almeTm
which minimizes the "cast function" m
(2.8.2)
c(al 1 0"2,··· ,(Tm)
=
L j=1
Cjaj_
106
CHAPTER 2. APPLICATIONS OF GROBNER BASES
This problem occurs often in scientific and engineering applications. There are many books on the subject which the reader may consult, see, for example, [Schri]. Our purpose here is to apply the results of Section 2.4 to indicate a solution method to this problem. Our strategy is to: (i) Translate the integer progranuning problem into a problem about polynomials; (ii) Use the Grübner bases techniques developed so far to solve the polynomial problem; (iii) Translate the solution of the polynomial problem back into a solution of the integer programming problem. In order to motivate the general technique presented below, we will first start with the special case when ail aij's and bi's are non-negative integers. We will alsa concentrate first on solving System (2.8.1) without taking into account the cost function condition (Equation (2.8.2)). We introduce a variable for each linear equation in (2.8.1), say Xl, X2,.·. 'X n1 and a variable for each unknown (Jj, say YI, Y2, ... ,Ym' We then represent the equations in (2.8.1) as
for i = 1, ... , n. Then System (2.8.1) can be written as a single equation of power products
or equivalently,
(2.8.3) We note that the left-hand side power product in Equation (2.8.3) can be viewed a..c;; the image of the power product Y2 2 . . . y~m under the polynomial map
Yrl
k[Yl,""Y=] Yj
The following lemma is then clear. LEMMA 2.8.1. We use the notation set above, and we assume that all aij '8 and bi '8 are non-negative. Then there exists a solution (ŒI, Œ2,'" ,Œm) E Mm of System (2.8.1) if and only if the power product x~' x~' ... x~n is the image under 1> of a power product in k[yl, ... ,Ym]' Moreover if X~l x~ ... x~n =
We have presented in Section 2.4 an algorithmic method for determining whether an element of k[Xl, ... , x n ] is in the image of a polynomial map such as 1> (see Theorem 2.4.4). However the above lemma requires that the power product X~l X~2 •.. x~n be the image of a power product, not a polynomiaL But,
2.8. INTEGER PROGRAMMING
107
because the map
'"
1
Xn], we
LEMMA 2.8.2. We use the notation above, and we assume that aU aij '8 and bi '8 are non-negative. If X~l xg 2 • • • x~n is in the image of
PROOF. LetK = (Yj_X~ljx~2j ... x~nj 1 j = 1, ... ,m} be theideal considered in Theorem 2.4.4. Let G be a Griibner basis for K with respect to an elimination order with the x variables larger than the y variables. Then, by Theorem 2.4.4 n xlb, X2b, ... X b n
("') · E lm 'f'
Moreover, if X~lX~2 ···x~n
b, X b, ... X b -{==:? Xl' 2 n
--S+
n
h with h E
G h --++ k[Yll'"
Wl·th h E k[ YI,···
,Ym] then X~lX~2
1
Ym ] .
...
x~n =
<jJ(h). We first note that the polynomials that generate K are all differences of two power products. Therefore, during Buchberger's Algorithm to compute G (see Algorithm (1.7.1)), only polynomials which are differences oftwo power products will be generated. lndeed, the S-polynomial of two polynomials which are both differences of two power products is itself a difference of two power products, and the one step reduction of a polynomial which is a difference of two power products by another polynomial of the same fOrIll produces a polynomial which is itself a difference of two power products. Therefore the polynomials in G are all differences of two power products. Now if 1 2 ••. x~n is in the image of cjJ, then it reduces to a polynomial h E k[Yl, ... ,Ym]. But the one step reduction of a power product by a polynomial which is a difference of two power products produces a power product. Therefore h is a power product and we are done. 0
xt xt
The proof of Lemma 2.8.2 gives us a method for determining whether System (2.8.1) has a solution, and for finding a solution: (i) Compute a Griibner basis G for K = (Yj - x~,j x~,j ... x~nj 1 j = 1, ... ,m) with respect to an elimination order with the x variables larger than the y variables; (ii) Find the remainder h of the division of the power product xî' x~' ... x~n by G; (iii) If h rt k[Yl, ... ,Ym], then System (2.8.1) does not have non-negative integer solutions. If h = Yrly~2 ... y~rn, then ((Ji, (J2, ... ,(Jm) is a solution of System (2.8.1). To illustrate the ideas presented so far, we consider a simple example. EXAMPLE 2.8.3. Consider the system (2.8.4)
+ +
(J3 (J3
10 5.
We have two x variables, Xl, X2, one for each equation. We also have four y variables, Yi, Y2, Y3, Y4, one for each unknoWIl. The corresponding polynomial
108
CHAPTER 2. APPLICATIONS OF GROBNER BASES
map is Q[YJ, Y2, Y3, Y4] YI Y2 Y3 Y4
'"
--;
c--+
Q[XJ, X2] XrX~
c--+
XI X2
c--+
XI X 2
c--+
Xl,
80 K = (YI - XrX~, Y2 - XIX21 Y3 - XIX2, Y4 - Xl) ç Ql[yl, Y2, Y3, Y4, Xl, X2]' The Grübner basis for K with respect to the lex order with XI > X2 > YI > Y2 > Y3 > Y4 is G = {JI, 12, h, 14, h}' where JI = Xl - Y4, 12 = X2Y4 - Y3, h = X2Y~ - YI, 14 = Y2 - Y3Y4, 15 = YIY4 - yj. Then
and
and h = Yh~ is reduced with respect to G. Using the exponent, of h we get that (0,0,5,5) is a solution of System (2.8.4). Now we tUTn our attention to the more general case where the aij's and bi's in (2.8.1) are any integers, not necessarily non-negative. We still focus our attention on determining whether System (2.8.1) has solutions and on finding solutions, that is, we still ignore the cost function condition (Equation (2.8.2)). We will proceed as before, except that we now have negative exponents on the x variables. Of course, this cannot be done in the polynomial ring k[XI,'" ,xn ]. Instead, we introduce a new variable w and we work in the affine ring k[Xl1'" ,xn,wl/I, where l = (XIX2" 'XnW -l). We may choose non~negative integers a~j and (Xj, for each j = 1, ... ,m and i = 1, ... n sueh that for each j = 1, ... ,m we have 1
(alj,a2j,'"
,anj) = (a~j,a~j, ... ,a~j) +0:)(-1,-1, ... ,-1).
For example, (-3,2,-5) = (2,7,0)+5(-1,-1,-1). Then in the a1fine ring k[xl, ... ,X n , wlj l we can give meaning to the coset x~lj x~2j . . • x~nj + l by defining
Similarly, (b l , b2 , .•• ,bn ) = (b~, b;, ... ,b~) + tJ( -1, -1, ... , -1), where b; and are non-negative integers for i = 1, ... ,n, and define
tJ
Therefore we have the following equation which corresponds to Equation (2.8.3)
(2.8.5)
We therefore proceed as before, and we note that the left-hand side of Equation (2.8.5) can be viewed as the image of the power product yf'y;' ... y':,;" under
2.8. INTEGER PROGRAMMING
109
the algebra homomorphism
As before we have LEMMA (0-1,0-2,'"
2.8.4. We use the notation set above. Then there exists a solution ,O-m) E Hm of System (2.8.1) if and only ifx:;x~; ... x~~w(3+I is the b'
b'
b'
image underq, of a power product in k[Yl,'" ,Ym]' Moreover ifx ,'x 2' ••. Xnnw~+ 1 = q,(Yr'Y~' ... y~=), then (0-1,0-2, ... ,O-m) is a solution of System (2.8.1). We have presented in Section 2.4 an algorithmic method for determining whether an element of k[Xl"" ,xn,wl/I is in the image of an affine algebra homomorphism sueh as q, (see Theorem 2.4.13). As in the first case we considb'
b'
b'
ered, the above lemma requires that X l l x2 2 ... Xnnwf3 + l be the image of a power product, not a polynomial. As before we have b'
b'
b'
2.8.5. Wc use the notation set above. Ifx ,'x2'" ·xnnw~+I is in the image of q" then it is the image of a power product Yr' y~' ... y~= E k[Yl, ... ,Ym]' LEMMA
PROOF. As in Theorem 2.4.13, let K
ç k[Yl,'" ,Ym, X"
...
. {Yj-XIa; ]X2] a; "'XnnJwCl:J a' . . IdealgeneratedbyxIX2'··xnw-land
,Xn.
w] be the
1 .
J = 1, ... ,m } .
Let G be a Grübner basis for K with respect to an elimination order with the x and w variables larger than the y variables. Then, by Theorem 2.4.13,
As in Lemma 2.8.2, the polynomials that generate K are ail differences of two power products, therefore, the argument used in the praof of Lemma 2.8.2 ean again be applied. That is, the polynomials in G are aU differences of two power products, and the reduction by G of a power product produces a power product. D EXAMPLE
(2.8.6)
2.8.6. We consider the following system -1 5.
We have two x variables, Xl, X2, one for each equation. We also have four y variables, YI, Y2, Y3, Y4, one for each unknown. We consider the ideal l = (XIX2W -1)
CHAPTER 2. APPLICATIONS OF GROBNER BASES
110
of Q[XI' X2, w] and the algebra homomorphism
"
Q[YI, Y2, Y3, Y4] YI Y2 Y3 Y4
--> >---> >---> >--->
>-->
Q[XI' X2, wlj l xi + l x~w2 + l xIw+I X2W +1.
xi
Thus K = (YI - xixi, Y2 - x~w2, Y3 - xiw, Y4 - X2W, XIX2W - 1). The Gr6bner basis for K with respect to the lex order with XI > X2 > w > YI > Y2 > Y3 > Y4 is G = {fI, h, ho J4, J5, J6,!7,!8,!9}, where fI = XI -YIYtyl, h = X2 -YIYh~, 13 = w - Y3Y'l, J4 = YIytyI - l, J5 = YIYh~ - Y2, J6 = YIY~Y';jh = YIY3yl° - y~, Js YIyl l - yi, fg Y2Y3 - Y4· We now reduce the power product x~w by G (note that xllx~ + l = x~w + I)
=
yi,
=
x~w
{j,Jsl -->
J,
-4
YhPY~s Vfy§5 y
ll
J,
yty§ly~4
J,
yfyJyF
-4 -4
J,
o
-4
YÎYhl
~
YIY2Y~,
and YIY2yl is reduced with respect to G. Observing the exponents of the different power products obtained during the reduction, we have the following solutions of System (2.8.6) (6,0,19,38), (5,0,15,31), (4,0,11,24), (3,0,7,17), (2,0,3,10), (l, l, 0, 2). We return to the original problem. That is, we want to find solutions of System (2.8.1) that minimize the cast function C(O"I' 0"2,··· ,0"=) = 2:;1'=1 CjO"j (Equation (2.8.2)). As we mentioned before, the only requirement on the term order in the method for obtaining solutions of System (2.8.1) described above, is that we have an elimination order between the x, w and the y variables with the x and w variables larger. Our strategy for minimizing the cost function is to use the cj's to define such a term order. DEFINITION 2.8.7. A term order
0-'
al
and C(o-ll'"
1
O"m) <
C(Œl, ...
,(Tm)
2.8. INTEGER PROGRAMMING
111
Term orders on the y variables which are compatible with c and 10 are exactly those orders which will give rise to solutions of System (2.8.1) with minimum cast as the next proposition shows. PROPOSITION 2.8.8. We use the notation set above. Let G be a Grobner basis for K with respect ta an elimination arder with the x and w variables larger than the y variables, and an arder
yr
U b~j3G 0" U X 2 ..• X n W ---1+ YI(Tl .•. ymm., Wl'th YI ' ... YntrY> Te d lice d'th Wl P ROOF. L et Xlb~b~ respect to G. Then (0"1,'" , O"m) is a solution of System (2.8.1) by Lemma 2.8.4. Now assume to the contrary that there exists a solution (Œ~, ... ,a:n) of System (2.8.1) such that I:?~1 ejO"j < I:?~1 CjO"j. Consider the correspond-
ing power product y~; ... y:;,.'rr.. Note that 1o(y~' ... y:;"=) ~ 1o(y~; ... y:;,.'rr.) =
yr
V b' b ' XI1X22 .. ' Xnn w f3 J. Therefore l
+
2.~.1O, k~r(1o)
l ..
'y~rn
l
-- y~l ... y':n= E ker(cp). By Theorem a'
a'
K, so that y~'" ,y:;"= - Y, ''-' 'Ym~ E K. Hence y~'" .y;?,,= y~l ... y':nm -S+ O. Since l • . . y~m >c y~l ... y~m by assumption, we have
ç
yr It{yfl ... y':n= - y~~ ... y':n'.n) yr y~Tn. But yr y~m. is reduced with respect to C, and therefore yr y~rn - y~~ ... y~'.n cannot reduce to 0 by G. 0 =
l
l
•.•
l
•.•
...
We note that a different minimal solution may be obtained if we use a different order, as long as we have an elimination order with the x and w variables larger than the y variables, and as long as we use an order on the y variables compatible with the cost function e and the map 10. For sorne cases, the term order
We use the lex order on w and the x variables with Xl > X2 > w. The power products in y are first ordered using the cast fUI1ctia~ a~d ~ies are broken by lex with YI > Y2 > Y3 > Y4. That is Yrly~2y~3y~4 < y~ly~2y~3y:4 if and only if 10000"1 +0"2+0"3+1000"4 < 10000"; +0"2+0"~+1000"~ or, 1O?00",1 +?"2+0"3 +1000"4 = lOOOO"~ + O"~ + 0"3 + lOOO"~ and Yrly~2y~3y~4
112
CHAPTER 2. APPLICATIONS OF GROBNER BASES
an elimination order with w and the x variables larger than the y variables. The reduced Grübner basis for K is 2 3 G = {w - Y2Y4J Y4 - Y2Y3,
Xl -
6 10 6 9 7 11 YIY2Y3 , X2 - YIY2Y3J YIY2Y3 - 1} .
We have x~w -S+ Y1Y~Y§ which gives the solution (l, 3, 2, 0). This solution is of minimal cast, Exercises 2.8.1. Use the method of Lenuna 2.8.1 to solve the following system of equations for non-negative integers.
+ +
0"3
10
0"3
12.
2.8.2. Use the method of Lemma 2.8.1 to solve the following system of equations for non-negative integers.
+ +
0"3
10
0"3
4.
2.8.3. Use the method of Lemma 2.8.1 to solve the following system of equations for non-negative integers. {
30"1 40"1 20"1
+ + +
20"2 30"2 40"2
+ + +
0"3
+
20"4
10
0"4
12 25.
0"3 20"3
+
2.8.4. Use the method of Lemma 2.8.1 to solve the following system of equations for non-negative integers. {
30"1 40"1 20"1
+ + +
20"2 30"2 40"2
+ + +
0"3
+
20"4
+
0"4
0"3 20"3
10 11 10.
2.8.5. Prove Lemma 2.8.4. 2.8.6. Use the method of Lemma 2.8.4 to solve the following system of equations for non-negative integers. 4 -3. 2.8.7. In Example 2.8.9 replace the cost fnnction by C(O"I, 0"2, 0"3, 0"4)
= 10000"1
+ 0"2 + 0"3 + 0"4·
2.8.8. In Example 2.8.9 replace the cost fnnction by C(O"l' 0"2, 0"3, 0"4)
= 0"1
+ 10000"2 + 0"3 + 0"4·
Chapter 3. Modules and Grobner Bases
Let k be a field. In this chapter we consider submodules of k[Xl' ... ,xnJ= and their quotient modules. In Section 3.1 we briefly review the concepts from the theory of modules that we require (see [Hun]). Then in Section 3.2 we define the module of solutions of a homogeneous linear equation with polynomial coefficients (called the syzygy module) and use it to give yet another equivalenl condition for a set to be a Grübner basis for an ideal. In Section 3.3 we follow Buchberger [Bu79J and show how to use this new condition to significantly improve the computations of Grübner bases. In Section 3.4 we show how 10 compute an explicit generating set for the syzygy module of a vector of polynomials. We then generalize the definition of Grübner bases and the results of the previous two chapters to modules (Section 3.5) and show that the same type of applications we had for ideals are possible in the more general context of submodules of free modules (Section 3.6). In Section 3.7 we generalize the results of Section 3.4 to systems of linear equations of polynomials. In Section 3.8 we show how this theory can be applied to give more efficient methods for the computations of Chapter 2 that required elimination. In the next to last section we explicitly compute Hom. That is, we compute a presentation of Hom(M, N) given Iwo explicitly presented modcles M and N. Finally, in the last section we apply the previous material to give results on free resolutions and outline the computation of Ext(M,N). 3.1. Modules. In this section we let A be any commutative ring. The ring that will be of interest to us in later sections is A = k[Xl'''' ,XnJ. We will consider the cartesian product
That is, Am consists of all column vectors with coordinates in A of length m. Although we will always consider the elements of A= as column vectors (which we will enclose in square brac.kets), in the interest of saving space in the book 113
114
CHAPTER 3. MODULES AND GROBNER BASES
we will usually write the elements of Am. as In other words, we will often use
rQW
vectors enclosed in parentheses.
(al, ... , am) instead of [
am' ] a
The use of column vectors is necessitated by the desire ta have function composition match the lisuai notation used in matrix multiplication. The set Am is called airee A-module. A set Mis called an A-module provided that M is an additive abelian group with a multiplication by elements of A (scalar multiplication) satisfying: (i) for ail a E A and for ail m E M, am E M; (ii) for ail a E A and for al! m,m' E M, a(m + m') = am + am'; (iii) for al! a, a' E A and for ail m E M, (a + a')m = am + a'm; (iv) for al! a, a' E A and for ail m E M, a(a'm) = (aa')m; (v) for al! m E M, lm = m. Scalar multiplication in Am. is done componentwis€; that is,
or using our space saving notation, a(al 1 ' " ,am) = (aal)." ,aam ), for a E A and (aIl'" ,am) E Am, The module Am. is called fTee because it has a basis, that is a generating set of linearly independent vectors. For example,
e, = (l,a, ...
,0),e2
= (0,1,0, ... ,0), ... ,em = (0,0, ... ,0,1)
is a basis which we will calI the standard basis for Am. In other words, every element a = (aIl'" 1am) of Am can be written in a unique fashion m
a =
L aiei, where ai E A. i=l
Note that an ideal of A is an A-module, and in fact a submodule of the Amodule A = Al A submodule of an A-module M is a subset of M which is an A-module in its own right. For example, if al, ... ,as are vectors in Am, then
M = {b , al
+ ... + bsa s 1 bi
E
A, i = l, ... ,s} ç: Am
is a submodule of Am. We denote this submodule by (a" ... ,as) ç: Am, and we calI {a" . .. , a,} a generating set of M. The concept of an A-module is similar to the one of a vector space, except that the set of scalars in the module case is the ring A which is not necessarily a field. Submodules of Am are used for linear algebra in Am in the same way subspaces of km are used for linear algebrain km. Forexample, let M= (al, ... ,as! ç Am.
3.1. MODULES
115
If we think of the vectors ai '8 as the colurrms of an m x s matrix S, then M is the "column space" of the matrix S, that is,
So, given a E Am, the system of m linear equations (with unknown the coordinates of b E A') determined by the matrix equation
(3.1.1)
Sb=a
can be translated into the question of whether the vector a is in M or not. In the case where m = 1, that is, M is an ideal of A, and System (3.1.1) has only one equation, then this problem is the ideal membership problem diseussed in Section 2.1. Other linear algebra problems in Am can also be translated this way into a module theoretic question. In this chapter we will develop algorithmic tools ta answer these questions in the case where A = k[Xl1'" ,Xn]. We first go back to the general theory of modules. Since we are mainly interested in the ring klxl,." , x n ] which is Noetherian by the Hilbert Basis Theorem (Theorem 1.1.1), we will assume that the ring Awe are eonsidering is Noetherian. We have the following THEOREM
3.1.1. Every submodule M of Am has a finite generating set.
PROOF. Let M be a submodule of Am. We use induction on m. If m = 1, then M is an ideal of A, and the result follows from the Hilbert Basis Theorem (Theorem 1.1.1). For m > 1, let
1 = {a E A 1 a is the first coordinate of an element of M}. Then 1 is an ideal of A, and henee, using the Hilbert Basis Theorem, 1 is finitely generated. Let
I=(a" ... ,a,). Let ml, ... 1 mt E M be sueh that the first coordinate of mi is ai- Now consider
M' = {(b 2 , ... , bm
)
1
(0, b2 , ... , bm ) E M}.
Note that M' is a submodule of Am.-l and so, by induction, is finitely generated, say by n~ l ' _. ,n~ E MI ç Am-l. For i = 1, ... 1 C, let ni be the element of Am with 0 in the first coordinate and the coordinates of n~ in the remaining m ~ 1 coordinates. Note that ni E M. To conclude, we show that
CHAPTER 3. MODULES AND GROBNER BASES
116
Let m E M. Then the first coordinate m, of m can be written as m, = L:~=1 diai· Now consider m' = m - 2:!=1 ~mi. Then m' E M and its first coordinate is zero. Therefore m' = 2:;=1 ct,ni. So, t
m = m'
+ 2:diffii = i=l
as desired.
,
t
LCini + Ldiffiil i=l
i=l
D
DEFINITION 3.1.2. An A-module M is called Noetherian if and only if every submodule of M is finitely generated. Thus Theorem 3.1.1 states that if A is a Noetherian ring then Am is a Noetherian module for all m ?: 1 and, in fact, any submodule of Am is a Noetherian module. Definition 3.1.2 is equivalent to saying that if
is an ascending chain of submodules of M, then there exists no such that Mn = Mno for ail n 2:: no. That these tWQ statements are equivalent is proved in exactly the same way as Theorem 1.1.2 was proved. Now for N any submodule of the A-module M, we define
MIN = {m+N
1= E M}.
MINis the quotient abelian group with the usual addition of cosets. We make MIN into an A-module by defining a·(m+N) = am + N, for aU a E A,m E M. It is an easy exercise to see that this multiplication is weU-defined and gives
MIN the structure of an A-module. We caU MIN the quotient module of M by N.
For M and M' two A-modules, we caU a function q,: M ---> M' an A-module homomorphism provided that it is an abelian group homomorphism, that is,
q,(m+m') = q,(m) + q,(m') for all m,m' E M, which satisfies
aq,(m) = q,(am), for aU a E A, mE M. The homomorphism q, is called an isomorphism provided that q, is one to one and onto. In this case we write M "" M'. Let N = ker(q,) = {m E M 1 q,(m) = O}. Then, it is easy to see that N is a submodule of M. Also, we note that q,( M) is a submodule of M'. We know from the theory of abelian groups that
MIN"" q,(M)
3.1. MODULES
117
as abelian groups under the map MIN m+N
---->
---->
q,(M) q,(m).
This rnap is, in fact, an A-module homomorphism as is easily seen and thus is au A-module isomorphism. This fact is referred to as the First Isomorphism Theorem for modules. As in the theory of abeliau groups, the submodules of MIN are ail of the form LIN, where L is a submodule of M containing N. Now, let M be an A-module, and let 'Inl, ... ,ms E M. Consider the map q,: AS ----> M defined as follows: s
q,(a" ... , as) =
L: aimi' i=l
It is easy to prove that q, is au A-module homomorphism. Moreover, the image of q; is the submodule of M generated by ml) . . ' ,ms' Hence if ml, ... ,ms generate M, then q, is onto. Letting el, . . . ,es denote the standard basis elements in AS 1 we note that cp is uniquely defined by specifying the image of each ei E AS, namely by specifying q,(ei) = mi. We will often define homomorphisms q, from AS by simply specifying q,(ei), for i = 1, ... ,s. Now given generators ml, ... ,ms of the A-module M, we define cp as above. Since 'Ins generates M we see that q, is onto. Let N be the kernel of q,. Then, by the First Isomorphism Theorem for modules, we have
m" ... ,
M~AsIN.
So we conclude LEMMA 3.1.3. Every finitely generated A-module M is isomorphic to AS IN for sorne positive integer s and sorne submodule N of AS.
Our purpose in this chapter is ta do explicit computations in finitely generated modules Qver A. 80 the first question we have to answer is what do we mean when we say that we have an explicitly given finitely generated A-module M? The first way is to be given N = (aIl'" ,am) for explict ab ... ,aTn E AS sueh that M ~ AS IN for sorne explicit isomorphism. Lemma 3.1.3 ensures the existence of such an sand N. DEFINITION
3.1.4. If M
~
A' IN, then we cali A' IN a presentation of M.
The second way ta have an explicitly given module M, provided that M is a submodule of AS, is ta have explicit ml, ... ,fit E AS such that M = ('lnl,." ,m,). Or, more generally, if we have an explicitly given submodule N of AS, the submodule M = ('lnl + N, ... ,m, + N) of AS IN is explicitly given.
CHAPTER 3. MODULES AND GROBNER BASES
118
It is also useful sometimes to have a presentation of M in the last we will show how we can obtaîn this in Section 3.8. COROLLARY
tWQ
cases, and
3.1.5. Every finitely generated A-module M is Noetherian.
By Lemma 3.1.3, M ~ AS IN, for sorne s and sorne submodule N of AS. The submodules of AS IN are of the form LIN, where L is a submodule of AS containing N. Since AS is Noetherian (Theorem 3.1.1), we have that every submodule of A' is finitely generated, and hence every submodule of A' IN is finitely generated. Therefore A' IN and hence M is Noetherian. 0 PROOF.
3.2. Grobner Bases and Syzygies. In this section we let A be the Noetherian ring k[XI"" ,xn ]. Let l = (fI, ... ,J,) be an ideal of A. We consider the A-module homomorphism 10 defined in Section 3.1,
10: A'
--->
l
given by S
(h"", , hs) ,........, L
hdi'
i=l
As we have seen in Section 3.1) l
(3.2.1)
~
AS1ker(lo), as A-modules.
3.2.1. The kernel oJ the map 10 is called the syzygy module oJ the 1 x s matrix [ fI Js J. It is denoted SYZ(f" ... ,Js). An clement (h" ... ,hs) oJ SYZ(f" ... ,!s) is called a syzygy oJ [ fI Js J and satisfies DEFINITION
hIf,
+ ... + hsJ, =
O.
Another way to say this is that Syz(fI, ... ,Js) is the set of aU solutions of the single linear equation with polynomial coefficients (the Ns) (3.2.2)
fIXI
+ ... + JsXs
= 0,
where the solutions Xi are also ta be polynomials in A. We note that the map 10 can also be viewed as matrix multiplication: s
lo(h ... ,hs) = [ fI "
That is, if F is the 1 x s matrix [ fI
= Lhdi' i=l
Js
J, and h
=
[
~:
]
E AS, then
lo(h ... ,hs) = Fh and SYZ(f" ... , Js) is the set of all solutions h of the linear " equation Fh = O.
3.2. GROBNER BASES AND SYZYGIES
119
EXAMPLE 3.2.2. Let A = lQI[x, y, z, w], and
1= (x 2 -yw,xy-wz,y2 -xz). The map
q,
is
q,: A 3
----.
l
given by
(h" h 2, h 3) >---> h (X 2 - yw) + h 2(xy - wz) + h3(y2 - xz). ' Then (y, -x, w) and (-z, y, -x) are both syzygies of [ x2
-
yw
xy -
WZ
y2 -
xz ] ,
since
y(x 2 - yw) - x(xy - wz)
+ W(y2 -
xz) = 0
and
-z(x 2 - yw) + y(xy - wz) - X(y2 - xz) = O. We will show later (see Example 3.4.2) that in fact these two syzygies generate Syz(x2 - yw, xy - wz, y2 - xz), that is, Syz(x 2
-
yw, xy - wz, y2 - xz) = ((y, -x, w), (-z, y, -x)) ç AS.
Because of the isomorphlsm given in Equation (3.2.1), the ideal l can be described as a quotient of a free A-module and SYZ(f" ... ,j,). Mareover, we can view SYZ(f" ... ,j,) as the set of alilinear relations among fI, ... ,j,. Also, homogeneous systems sueh as Equation (3.2.2) or, mare generally, such systems in Am play a central role in the theory of rings and modules, similar to the role they play in the usuallinear algebra over fields. Finally SYZ(f" ... ,j,) will play a critical role in the theory of Grobner bases; in particular, its use will lead to improvements of Buchberger's Algorithm (see Section 3.3). For these reasons and others, Syz(fI, ... ,j,) is a very important abject in commutative algebra. We note that SYZ(f" ... , j,) is finitely generated, since it is a submodule of A' (Theorem 3.1.1). One of our goals is to compute generators for SYZ(f" ... ,j,). The next lernma shows how to compute these generators in a special case (the general case is presented in Section 3.4). PROPOSITION 3.2.3. LetcI, ... ,c, E k-{O} and letX X 2, ... ,X, bepower " products in A. For i # jE {l, ... ,s}, we define X ij = Icm(Xi,X j ). Then the module Syz(c,X ... ,c,X,) is generated by " ij Xij - -X- e j E A' Il <. _ ~ < J. < _ S} , { --ei CiXi CjXj
where el, ... ,es form the standard basis for AS.
CHAPTER 3. MODULES AND GROBNER BASES
120
PROOF. First note that for all i
C,X,
i
ij
j we have X ei CjXi
-
X
ij
CjXj
ej is a syzygy of
Xij Xij ... ,0,---,0, ... ,0) = 1(0, ... ,0,--,0, CjX CjX
--i
O.
j
~
ith coord
jth coord
Therefore
Il:; i < j :;
s) ç
SYZ(C,X ... , csX,). " To prove the converse, let (h" ... ,h,) be a syzygy of [ C,X, that is, h,c,X , + ... + h,c,X, = O. Let X be any power product in 1l'n. Then the coefficient of X in h,c,X, + ... + hscsXs must be zero. Thus it suffices to consider the case for which = i = 1.1 " , ,s, and where c~ = 0 or XiX~ = X for a fixed power product X. Let <1' ... ,C~t' with il < i 2 < ... < i t , be the non-zero cj '8. Then we have ci Cl C2C2 + ... + C~Cs = C~l Cil + ... + C~t Cit = O. Therefore, using the same technique as in Lemma 1.7.5, we have
hi <X:,
+-
+(C~ICil + ... +c~tCiJc,i., ,
...
'
't
't
~O
1 Recall
that (al, ... ,as) stands for the column vector
bs as in the lisual matrix multiplication.
1
We have adopted this awkward looking convention
instead of the lisuai "dot product" because it will be consistent with the notation of computing syzygies of column vectors of matrices later.
3.2. GROBNER BASES AND SYZYGIES
as desired.
121
D
We observe that if C1 X 11 ... ,csXs are the leading terms of the polynomiais f" ... ,f" and if (h" ... ,h,) E Syz(e,X ... ,e,X,), then I::~lhdi has a " leading term strictly smaller than
x·
x· J X't
In particular the syzygy ----.:!:Lei CiXi
Cj
ej of [
C1 X 1
CsXs
j
J gives rise to
the S-polynomial of fi and fJ, sinee
[fI ls ] (0, ...
,0,
-
Xij CiXi'
Xi} CjX ' j
0, ... ,0, -
ith
~ jth
coord
coord
Xij
Xi}
Cii
Cjj
0, ... 10)
=
-X fi - -X fJ = S(j"fJ)· This last observation and Proposition 3.2.3 seem to indicate that the syzygy module of [ It(f,) ItU,) 1 might be relevant to the computation of a Grübner basis for (J" ... ,f,). In order to implement this idea we need the following definition. DEFINITION 3.2.4. Let X l, ... ,Xs be power products and CI, ...
1
Cs
be in
k - {O}. Then, for a power produet X, we caU a syzygy h = (h" ... ,h,) E Syz(e,X ... ,e,X,) homogeneous of degree X provided that eaeh hi is a term (that is, "It(hi ) = hi for aU i) and Xi Ip(hi ) = X for aU i sueh that hi i' O. We say that h E Syz(e,X ... ,e,X,) is homogeneous if it is homogeneous of degree " X for some power product x. Note that the generating set given in Proposition 3.2.3 consists of a finite set of homogeneous syzygies. The next theorem presents another equivalent condition for a set ta be a Gr6bner basis. THEOREM 3.2.5. Let G = {g" ... ,g,} be a set of non-zero polynomials in A. Let B be a homogeneous generating set of SYZ(lt(g,), ... ,It(g,)). Then G is a Grobner basis for the ideal (g" ... ,9,) if and only if for aU (h" ... ,h,) E B, we
have
h,g ,
+ ... + h,9'
G ----> +
O.
CHAPTER 3. MODULES AND GROBNER BASES
122
PROOF. If G is a Grübner basis, then by Theorem 1.6.2,
h,g ,
+ ... + htgt
G
--7+ 0
for all h" ... ,ht. Conversely, let 9 E (g" ... , gt), say
+ ... +Utgt.
9 = u,g,
(3.2.3)
Choose a representation as in Equation (3.2.3) with
x
= max (lp(ui)lp(gi)) l::;i::;t
least. Sinee, by Theorem 1.6.2, we need to show that 9 ean be written as in Equation (3.2.3) with X = lp(g), we may assume that lp(g) < X and show that then we ean obtain an expression sueh as (3.2.3) for 9 with a smaller value for X. Let S= {i E {l, ... ,t} IIp(ui)lp(g,) =X}. Then
Llt(ui) lt(gi) =
o.
iES
Let h = LiES lt(ui)ei (where e" ... ,et is the standard basis for At). Then hE Syz(lt(g,), ... ,lt(gt)) and h is homogeneous. Now let B = {h" ... ,he}, where for each j = 1, ... 1 P, hj = (hlj) ... l htj ). 80 h = L~=l ajhj, where, since h is a homogeneous syzygy, we may asssume that the aj '8 are terms Bueh that lp(a]) lp(hij ) lp(gi) = X for all i,j sueh that ajhij # O. By hypothesis, for eaeh j, L:~l hijgi ..2..,+ O. Thus by Theorem 1.5.9 we have for eaeh j = l, ... ,f t
L
t
h ij9i =
i=l
L
V ij9il
i=l
sueh that t
max lp(vi·) lp(gi) = lp('" hi ·gi)
l
-
L i=l
J
J
< l
-
The latter strict inequality is beeause L:~l hij lt(gi) = Thus,
9
o.
+ ... + Utgt Llt(ui)gi + L(Ui -lt(ui))gd LUigi
U,g, iES
iES
terms lower than X
e
t
LLa h j
ij 9i
+
terms lower than X
ij9i
+
terms lower than X .
j=li=l
e
t
L L aj v j=l i=l
3.2. GROBNER BASES AND SYZYGIES
123
We have ma;xlp(aj) lp(vij) lp(gi) t,J
< ma;xlp(aj) lp(hij ) lp(gi) = ~,J
x.
We have obtained a representation of 9 as a linear combination of the gi '8 such that the maximum leading power product of any summand is less than X. Thus the theorem is proved. 0 In Exercise 3.2.1 we will give an example which shows that the hypothesis that the generating set of syzygies be homogeneous is necessary. We observe that the above proof uses an argument similar to the one used in the proof of Theorem 1.7.4. In fact, as a corollary to the above result, we can recover Theorem 1. 7.4. COROLLARY 3.2.6. Let G = {g" ... ,gt} be a set of non-zero polynomials in
A. Then G is a Grobner basis if and only iffor aU i,j = 1, ... ,t, 8(gi, gj) ~+ O. PROOF. Let G be a Grobner basis. Then, since 8(gi, gj) is an element of the ideal (g" ... ,gt), we have 8(gi,gj) ~+ O. For the converse, we first use Proposition 3.2.3 ta see that the set Xij Xi, .. . . B= { lt(gi)ei-lt(gj)ejl'<J,t,J=I, ...
,t
}
ÇA
t
1.
is a homogeneous generating set of the syzygy module of [ lt(g,) lt(gt) As we have noted after Proposition 3.2.3, each element of B gives rise to an Spolynomial, which reduces to zero by hypothesis. Therefore, by Theorem 3.2.5, G is a Grobner basis. 0
Exercises 3.2.1. In Theorem 3.2.5 the generating set B of SYZ(lt(g,), ... ,lt(gt)) was required to be homogeneous. In thls exercise we show that this hypothesis is necessary. Consider the set G = {g" g2}, where g, = x + y, g2 = X + 1 E y. a. Prove that G is not a Grobner basis for the ideal it generates. b. Prove that the set {(x + 1, -x - 1), (x, -xl) is a generating set for Syz(lt(g,), lt(g2))' G
G
c. Prove that (x + l)g, + (-x - l)g2 --++ 0 and that xg, - Xg2 --++ O. 3.2.2. Give another example that shows that the hypothesis that B be homogeneous in Theorem 3.2.5 is necessary. 3.2.3. Let G = {g" ... ,gt} be a set of non-zero polynomials in A. Let B be a homogeneous generating set of Syz(lt(g,), ... ,lt(gt)). Prove that G is a Grobner basis for the ideal (g" ... , gt) if and only if for all (h"", ,h,) E B, we have h,g, + ... + htgt = V,g, + ... + v,g"
CHAPTER 3. MODULES AND GROBNER BASES
124
where
lp(h,g ,
+ ... + h,g,) =
ma.x(lp( vIl lp(g,), ... , lp( V,) lp(g,)).
[Hint: See the proof of Theorem 3.2.5.] 3.2.4. At this point we do not know yet how to compute generators for the syzygy module of [ fI J, We will see in Section 3.4 how to do this. However, in certain instances we can easily find sorne elements of SYZ(f" ... , J,). Let l be the ideal of k[x, y] generated by fI, h h the 2 x 2 minors of the 2 x 3 matrix S, where
1.
S=[x1
y
x
xl. y
a. Give a method for fimling elements of Syz(fI, h hl. [Hint: Think of 3 x 3 matrices whose determinant must be zero, and obtain 2 syzygies this way.] b. Generalize this idea to the case of m x (m + 1) matrices with entries in k[Xl"" , Xn]. 3.3. Improvernents on Buchberger's Algorithm. So far we have not discussed the computational aspects of Buchberger's Algorithm presented in Section 1.7 (Algorithm 1.7.1). A carefullook at this issue is outside the scope of this book (see, for example, [BaSt88, Bu83, GMNRT, Huy, MaMe, Laz91]). However the computational complexity of Buchberger's Algorithm often makes it difficult to actually compute a Grôbner basis for even small problems and .this limits, in practice, the scope of the applications of the theory. So we will now discuss improvements in the algorithm for computing Grobner bases. We note that the results of this section are rather technical and will only be used occasionally in sorne of the exercises and examples in the remainder of the book. Sorne algebraic results, such as the results presented below, can be used in COlljunction with sorne simple heuristic observations ta improve significantly Buchberger's Algorithm (see [Bu'T9]). It can also be shown (but we shall not do it here) that certain algebraic (respectively geometric) properties of the ideal l
(respectively of the variety V (I)) have a direct influence on the difficulty of the computation of a Grôbner basis for l (see the references above). Recall that the algorithm has two steps: the computation of S-polynomials and their reduction. A problem that arises is the potentially very large number of S-polynomials that have ta be computed. Indeed, as the computation progresses, the number of polynomials in the basis gets larger, and therefore, each time a new
polynomial is added to the basis, the number of S-polynomials to compute also increases. Since the algorithm does terminate, the proportion of S-polynomials which reduce ta zero eventually increases as we get far in the algorithm. A huge amount of computation IDight be performed for very little gain, since few new polynomials are added to the basis. In fact, at some point before the algorithm terminates, the desired Grobner basis is obtained but we do not "know" it.
3.3. IMPROVEMENTS ON BUCHBERGER'S ALGORITHM
125
At that point, the computation of S-polynomials and their reductions are ail together useless except for the fact that they verify that we do have a Grübner basis. One way to improve this situation is to ((predict" that sorne S-polynomials reduce to zero without actually computing these S-polynomials or reducing them. The following results are the basis for two criteria for a priori zero reduction of S-polynomials. LEMMA 3.3.1. Let f,g E A =
klx" ... ,xn ],
bath non-zero, and let d
gcd(j,g). The following statements are equivalent: (i) lp(~) andlp(~) are relatively prime;
(ii) S(j, g) ~ + O. In parlicular, {J, g} is a Grobner basis if and only iflp( ~) and lp(
a) are relatively
prime. PROOF. (i) =? (ii). Assume first that d = gcd(j, g) = 1. We write f = aX + 1', 9 = bY + g', where lt(j) = aX, lt(g) = bY, a, b E k, and X, Y are power . products. Then X = ~(j - J'l, and Y = l;(g - 9'). CASE 1. l' = g' = O. Then f and 9 are both terms and S(j,g) = O. CASE 2. l' = 0 and g' # O. Then, since gcd(lp(j),lp(g)) = l, we have S(j,g) = ~Yf -l;Xg = ';'b(g - g')i"- ';'bfg = - ';'bg'j. We see that (Exercise
1.5.4) S(j, g) ..!...,+ O. CASE 3. l' # 0 and CASE 4. l' # 0 and
S(j,g) =
9' = O. 9' # O.
This is the same as Case 2. Then, since gcd(lp(j), lp(g)) = l, we have
~Yf - ~Xg = .I.(g - g')f _.I.(j - J'lg = .I.(j'g - g'J). a
b
ab
ab
ab
If lp(j'g) = lp(g'J), then lp(j')lp(g) = lp(j'g) = lp(g'J) = lp(g')lp(j), and since gcd(lp(j),lp(g)) = l, we have lp(j) divides lp(j') and lp(g) divides lp(g'). This is a contradiction, since lp(j') < lp(j) and lp(g') < lp(g). Therefore lp(j' g) # lp(g' J), and the leading term of ';'b (j' 9 - 9' f) appears in l'9 or g' f and hence is a multiple oflp(j) or lp(g). If lp(j' g) > lp(g' J), then
S(j, g)
--"--> :b ((j' -lt(j'))g - g' f)
If lp(j' g) < lp(g' J), then
S(j,g)"!"" :b(j'g- (g' -lt(g'))J). Using an argument similar to the one above, we see that the leading term of ~ ((j' - lt(j'))g - g' J) or ~ (j' 9 - (g' - lt(g')) J) is a multiple of lp(j) or lp(g). Therefore this reduction process continues using only f Or g. At each stage of the reduction the remainder has a leading term which is a multiple of lp(J) or lp(g). We see that we can continue this process until we obtain 0, that is,
S(j,g) ~+ O.
126
CHAPTER 3. MODULES AND GROBNER BASES
Now assume that d = gcd(J,g) of 1. Then ged(~,:1) = 1. By assumption We have gcd(lp(~),lp(~)) = 1, and hence, by the case above, H,~} is a Griibner basis. Thus {d~, d~} is also a Grobner basis (Exercise 1.6.13). By Theorem {J,g}
1.7.4, S ( 1,g ) ---->+ O. (ii) =? (i). Let us first assume that gcd(J, g) = 1. We need to show that lp(J) and lp(g) are relatively prime. Let lp(J) = DX and lp(g) = DY, where D, X, and Y are power products in A, ged(X, Y) = 1. Then
Y
X
S(J,g) = lc(J/ - lc(g)g· By assumption we have S(J, g) such that
~+ 0, and hence there exist u, v E k[Xl"" ,xnl Y
(3.3.1)
X
S(J, g) = lc(J/ - lc(g)g =
u1 + vg,
where lp(uf) S; lp(S(J,g)) and lp(vg) S; lp(S(J,g)). From Equation (3.3.1), we obtain. . (lc%) Therefore 1 divides (let) relatively prime. Also,
lp(u)DX
= lp(uf)
+v)
+ v),
9 = Ceyf)
-u) f.
and 9 divides (lerf) - u), sinee
S; lp(S(J,g))
1
and 9 are
< Xlp(g) = Ylp(J) = DXY.
Thus, lp(u) < Y, and hence lp(lerf) - u) = Y. But 9 divides Cerf) - u), so lp(g) = DY divides lp( lerf) - u) = Y, and hence D = 1. Therefore Ip(J) and lp(g) are relatively prime. Now assume that d = gcd(J,g) of 1. Then gcd(~,;I) = 1. It is easy to prove that if S(J,g) ~+ 0, then 2S(J,9) Ill} + 0 (see Exercise 3.3.3). It is also easy to prove that âS(J, g) = S(~, ;1), since d is monic (see Exercise 3.3.3). Therefore, by the above, we have lp(~) and Ip(;I) are relatively prime as desired. The last statement of the lemma is an immediate consequence of Theorem 1.7.4. 0 Lemma 3.3.1 gives a criterion for a priori zero reduction: during Buchberger's Algorithm, whenever 1 and 9 are such that lp(~) and lp(:'l) are relatively prime, then it is not necessary to compute S(J, g), since S(J, g) will reduee to zero using 1 and 9 alone, and hence S(J,g) will not create a new polynomial in the basis. We note that if Ip(J) and lp(g) are relatively prime, then d = 1 and so
S(J,g) ~+ O. This is the form in which we will use the criterion below (see critl). Now we tUTn OUT attention to another criterion that turns out to be remarkably effective in improving the performance of Buehberger's Algorithm.
3.3. IMPROVEMENTS ON BUCHBERGER'S ALGORITHM
127
LEMMA 3.3.2. Let X X 2 , ... ,X, be power products in A = k[Xl,'" ,xn ] and let Cl, ... ,Cs E k - "{O}. For i,j = l, ... ,s, define X ij = lcm(Xi,Xj ), and let ij X ij s Tij = --ei - -X- e j E Syz (C1Xl, ... ,cs X s ) ÇA, CiXi
CjXj
where el, ... es is the standard basisfor AS. For eachi,j,f = 1, ... ,8 letXijR = lcm(Xi , Xj, Xi). Then we have 1
Xi)1!
--Ti)
Xiji + -Xiji -Tjf + --iei =
Xij
XjE
Moreover, if Xi divides Xij, then and TR.i.
Tij
O.
Xii
is in the submodule of AS generated by
Tjf
PROOF. We have Xijl!
Xiji + -Xiji -TjE + --7fi =
--Tij
Xii
Xij
X ij-, ( --ei X.ij. - -X-ije j ) Xij
CiXt
=
CjXj
Xiji
Xiji
qXi
CjXj
XCi
X ij-' ( -Xji +-ej XjR.
--ei - - - e j
CjXj
+ -Xiji -ej CjXj
p -X- e , ) CeXi
X ij-' ('-Xli +-e, XP.i
CeXp.
Xiii!
Xiji
Xije
ClXi
CR.XR.
CiXi
- - e , + - - e , - --ei =
Xli) --ei CiXi
0
.
Now if Xe divides Xi), then Xiji = Xij, and we have Tij
Hence
Tij
Xij
Xij
Xji
Xii
+ --TjR + - T b
is in the submodule of
AS
= O.
generated by
Tjf
and
TRi'
0
COROLLARY 3.3.3. We continue ta use the notation oJ Lemma 3.3.2. Let B ç {Tij 1 1::; i < j ::; s} be a generating set Jar SYZ(C,X ... ,csX,), Sup, " E B, and such pose we have three distinct indices i,j, f such that 'Til, 'Tjl, Tij that X, divides X ij = lcm(Xi , X j ). Then B - {Tij} is also a generating set for SYZ(C,X ... , c,Xs ). " We will use Corollary 3.3.3 to improve Buchberger's Algorithm in the following way. Let {f" ... ,Js} be a set ofgenerators for an ideal l in k[Xl, ... ,xn]. Let CiXi = lt(fi) and use the notation above. We begin by letting B = {Tij 1 1 ::; i < j ::; s}. Of course, B generates Syz(lt(h), ... ,lt(f,)). We apply Corollary 3.3.3 to eliminate as many of the Tij E B as possible obtaining a possibly smaller set of generators for Syz(lt(h), ... , lt(fs))' We then compute the S-polynomial, S(Ii, Ii), corresponding to one of the Tij remaining in Band reduce it as far as possible; we add the reduction to the set {h, ... , Js} if it does not reduce to zero, calling it Js+l. We enlarge B by the set {Ti,s+1 Il ::; i ::; s} to obtain a new set B which now generates Syz(lt(h), ... , lt(fs), lt(fs+l)). We again apply Corollary 3.3.3 ta eliminate as many of the Tij E B as possible obtaining a possibly smaller set of generators for Syz(lt(h), ... , lt(f,), It(fs+1))' We again compute an Spolynomial corresponding to a Tij remaining in B. We continue this process until
128
CHAPTER 3. MODULES AND GRbBNER BASES
all of the S-polynomials eorresponding to elements in B have been computed and redueed to zero, always maintaining B as a basis of the current syzygy module. In order ta keep track, in the algorithm, of those Tij E B whose corresponding S-polynomial has been computed and reduced we break up the basis B into two parts. We also use just the indices. So we will use N C for the set of ail indices {i, j} of Tij EBat any given time for which the S-polynomial has not been computed and use C for the set of all indices {i, j} of Tij EBat any given time for which the S-polynomial has been computed. We note that at any time in the algorithm after N Chas been initialized, {Tij 1 {i, j} E N C u C} is a generating set of the syzygy module of the current set of leading tenns (see the proof of Proposition 3.3.4). So we continue the algorithm until NC = 0. We now give an improved version of Buchberger's Aigorithm as Aigorithm 3.3.1. We note that the purpose of the first WHILE loop is to initialize NC. The commands used in the algorithm are defined as follows. The command, critI (i, j), returns "TRUE" if and only if IpU;) and IpUj) are relatively prime. If critI(i,j) = TRUE, then, by Lemma 3.3.1, we know without eomputing it, that SUi, Jj) reduces to zero. Nevertheless Tij must be added to
C. The command crit2(NC,C, s) is given as Aigorithm 3.3.2. Aigorithm 3.3.2 is an implementation of the ideas in Corollary 3.3.3 and the ensuing discussion. We nQW make a few technical points about this algorithm. The basic idea is to find triples of indices ",p"p sueh that {v,p,},{",p},{p"p} are in NC U C and Xv divides lcm(X~, X p ). Because of the way we cali this procedure in the main algorithm (Algorithm 3.3.1), we need only consider the cases where one of v, J-L, p is s. This is because the cases of aH triples with v, M, p ail less than s were checked before. Sinee p" p are interchangeable, it is enough to consider the cases p = sand " = s. These two cases are the two main WHILE loops in Aigorithm 3.3.2. Mareover, we note that a pair of the form {i, s} cannot lie in C (this explains why checking mernbership in NC U C was often just done by checking membership in N C). Finally, we only check, in the second main WHILE loop, whether {i, j} is in N C sinee we are only interested in eliminating it fromNC. PROPOSITION 3.3.4. Given a set of non-zero polynomials F = {fI, ... ,J,}, the Improved Buchberger's Algarithm (Algarithm 3.3.1) will produce a Grôbner basis Jar the ideal l = (F). PROOF. Let G = {f" ... ,Jt} (t 2': s) be the output of Aigorithm 3.3.1. We first note that the S-polynamials carresponding to every pair in C reduce to zero. It then suffices ta show that {Tij I{ i, j} E C} is a generating set for Syz(lt(f,), ... ,lt(f,)), and for this it suffices ta show that at any stage of the algorithm {Tij 1{i, j} E N C U C} is a generating set for the syzygy module of cuuent leading terms. We see this as follows. At each stage of the algorithm there is one of two possibilities. Either the S-polynomial reduces ta zero, and
3.3. IMPROVEMENTS ON BUCHBERGER'S ALGORlTHM
INPUT: F = {fi, ... ,fs}
ç
k[Xi, ... , XnJ with
fi T' 0 (1
129
~ i ~ s)
OUTPUT: A Grübner basis G for (fi, ... ,Is) INITIALIZATION: G:= F
:=0 Ne:= {{l, 2}} i:= 2 C
WHILE i < s DO
Ne :=Neu{{j,i+ 1}
Il ~j ~ i}
Ne := crit2(Ne, C, i + 1) i:= i
+1
WHILE Ne T'
0 DO
Choose {i,j} E Ne Ne:=Ne~{{i,j}}
C :=CU{{i,j}}
IF crit1(i,j) = FALSE THEN
S(fi, 1;)
-'"->+ h, where h is reduced with respect to G
IF hT' 0 THEN
Is+1
:=
h
G:= Gu {fs+i}
s:= s + 1 Ne:=Neu{{i,s}11~i~s~1}
Ne:= crit2(Ne,C, s) ALGORITHM 3.3.1. Improved Buchberger's Algorithm
Neuc does not change, or a polynomial is added to G, and the relevant pairs are added to Ne u C and so the set of Tij'S corresponding to this updated Ne u C is a generating set for the syzygy module of the new set of leading terms. Then we apply crit2 to the new Ne which does not alter this last statement by Corollary 3.3.3. The algorithm stops for the same reason Algorithm 1.7.1 stopped. D We note that in Algorithm 3.3.1 we do not give a rule for choosing the pair
CHAPTER 3. MODULES AND GROBNER BASES
130
INPUT: NC,C,s from Algorithm 3.3.1 OUTPUT: NC with pairs delete" using Corollary 3.3.3 INITIALIZATION: f
:=
1
WHILE f < s DO IF {f,s} E NC THEN
i:= 1 WHILEi < s DO IF {i,e} ENCuC AND {i,s} ENC THEN IF X, divides lcm(Xi , X,) THEN
NC:=NC-{{i,s}} i:= i + 1 f:= f
+1
i:= 1 WHILE i< s DO IF {i,s} E NC THEN j := i
+1
WHILEj <s DO IF {j,s} ENC AND {i,j} ENC THEN IF X, divides lcm(Xi,X j ) THEN
NC:=NC-{{i,j}} j := j
i:= i
+1
+1 ALGORITHM 3.3.2.
crit2(NC,C, s)
{i,j} E NC far which we compute the corresponding S-polynomial. Sorne studies have shawn that this choiee is of vital importance. Often the S-polynamials are computed in such a way that S(fi, fJ) is computed first if lcm(lp(fi)' lp(fj)) is least (with respect ta the current term order) among the lcm(lp(fv),lp(f~)). This procedure is called the normal selection strategy. Experimental results show that this warks very well for degree compatible term orders. It is not sa
3.3. IMPROVEMENTS ON BUCHBERGER'S ALGORlTHM
131
good for the lex term ordering. This can be explained as follows: if at sorne point the basis contains polynomials fI, ... ,Je in which the largest variable (say x n ) does not appear, then the normal selection strategy will first consider only fI) ... ft disregarding the Orres in which x n appears; in effect, a Grobner basis for (fI, ... , Je) will be computed. Experimental and theoretical results show that this Grübner basis subcomputation can be worse than the original Grübner basis computation. There are techniques to get around this problem but they are beyond the scope of this book. There is another technique that is used ta improve the performance of Buchberger's Algorithm. Namely, the palynomials Ns are inter-reduced at the beginning, and the basis is kept inter-reduced as the algorithm progresses. Even though this may require a lot of computation, it helps to avoid an unmanageable growth in the number of polynomials in the basis, and in the number of divisions necessary throughout the computation. This method is discussed in Buchberger IBu85J. We conclude by noting that the use of critI and crit2 help to reduce the number of S-polynomials that have to be computed. In fact, empirical evidence shows that if N is the number of S-polynomials that would be computed without these criteria, the use of these two criteria reduces that number to about VJii (see, for example, lez]). Ta illustrate Algorithm 3.3.1 we consider the following example. Since the polynomials generated by the algorithm are scattered throughout the text of the example, we have put boxes around these polynomials for easier reference. 1
1
EXAMPLE 3.3.5. Consider polynomials 1fI = x2y2 - z2, Il
12
= xy2 Z
-
xyz, 1
113
and = xyz3 - xz21 in Qlx, y, zJ. We use the deglex term order with x < y < z. We follow Algorithm 3.3.1. However, we will not trace the algorithm crit2 except for one significant example at the end. Wc start with G = {J" h, h}, C = 0, and Ne = {{l, 2}, {l, 3}, {2, 3}}. We see that we do Ilot have to consider {l, 3}, that is, we have Ne =crit2(Ne,C,3) = {{l, 2}, {2, 3}}. We choose the pair {1,2}, changing Ne ta {{2,3}} and C to {{l, 2}}, and compute SU" 12) = x'yz - z3 which is reduced with respect ta G. Thus we add 1J4 = x2yz - z31 ta G. We update Ne to include the pairs with 4 in them and then we use crit2 to compute the new Ne = {{2,3},{1,4},{3,4}}. We now choose the pair {1,4}, changing Ne ta Ne = {{2, 3}, {3, 4}} and C to C = {{l, 2}, {l, 4}}, and compute SU" J4) = yz3 - z3 which is reduced with respect to G. Thus we add J5 = yz3 - z3 ta G. After applying crit2 again we obtain Ne = {{2, 3}, {3, 4}, 3,5 . We choose the pair {3, 5}, sa that now Ne = {{2, 3}, {3, 4}} and C = {{l, 2}, {l, 4}, {3, 5}}, and wc compute S(h, J5) = xz;{ - xz 2 which is reduced with respect to G. Now we add
116 = xz 3 -
xz 2 ta G. Applying crit2 again we get Ne = {{2, 3}, {3, 4}, {3, 6}}. Next we choose the pair {3,6}, update Ne and C = {{l, 2}, {l, 4}, {3, 5}, {3, 6}}, and we compute S(h, JG) = xyz2 - xz 2 which is reduced with respect ta G. Thus we add
1
132
1
h =
CHAPTER 3. MODULES AND GROBNER BASES
XYZ2 - XZ21 to G. Applying crit2 again we get NC = {{2, 7},{3, 7},{4, 7}}.
Now we choose the pair {4, 7} and compute S(f4, h) =
_Z4
+ X2 z2
which is
reduced with respect to G. Thus we add J8 = _Z4 + X2z2 to G. After applying crit2 we have NC = {{2, 7}, {3,7}, 2,8, 4,8, 5, 8},{6, 8}}. Now we choose the pair {2,7} and observe that S(12, h) = O. We choose next the pair {3,7} and see that S(h, h) -S+ O. Then we choose the pair {6, 8}, we update NC = {{2, 8}, {4, 8},{5, 8}} and C = {{l, 2}, {1,4}, {3,5}, {3,6}, {4, 7}, {2,7}, {3,7}, {6, 8}}, and compute S(f6, J8) = x3 Z2 - xz 3 which is reduced with re-
Jo
= X 3 z 2 - xz 3 to G. After applying crit2 we have NC={{2,8},{4,8},{5,8, 4,9, 6,9 .TheS-polynomialscorrespondingto the remaining pairs in NC aU reduce to zero. Thus G = {ft, 12 ,h, J4, Js, Jo, h, J8, Jg} is a Grübner basis for 1= (J" h, hl· We nQw give one example of crit2. We consider the situation right after adding J6 to G. There we started with NC = {{2, 3}, {3, 4}, {l, 6}, {2, 6}, {3, 6}, {4,6}, {5,6}} andC= {{1,2},{1,4},{3,5}}. For 1 we first note that {1,6} is in NC and we must consider the pairs {2,6} and {4, 6} (we do not consider the pair {3,6} because the pair {1,3} rte NC u C, and we do not consider the pair {5,6} because {l, 5} rte NC U Cl. For {2,6} we see that Ip(ft) = x 2 y2 does not divide lcm(lp(12),lp(f6)) = xy2 z3, and for {5,6} we see that Ip(ft) = x 2y2 does not divide lcm(lp(fs), Ip(f6)) = xyz"- So we eliminate no pairs from NC. For e = 2 we note that {2,6} E NC and we only consider {1,6} and {3,6}. For {1,6} we note that Ip(12) = xy2z divides lcrn(lp(ft),lp(f6)) = x 2y2 z 3 and so we eliminate the pair {1,6} from NC. We may not eliminate {3,6}. Now NC = {{2,3},{3,4},{2,6},{3,6},{4,6},{5,6}}. For e = 3 we consider {2,6}, {4,6} and {5,6} and all three of thern are eliminated giving us NC = {{2, 3}, {3,4}, {3,6}}. We do not need to consider e = 4,5, since {4,6},{5,6} rte NC. We do not need to consider e = 6, since there is only one pair with 6 in it left in NC. So we finaUy arrive at NC = {{2, 3}, {3,4}, {3,6}}. Although it might appear in this example that no real saving in computation time has been gained in using crit2, we recall that, in practice, the most time consuming part of Buchberger's Algorithm is the reduction of S-polynomials and we have avoided most of these reductions by using the CUITent algorithm. Indeed, we computed a total of 13 S-polynomials, 6 of which generated elements of the Grübner basis. Had we not used crit2 we would have had to compute and reduce 9 8 = 36 S-polynomials. Note that the computations in crit2 are always trivial. 2 To conclude this section, we mention two other difficulties that arise during the computation of Grübner bases, namely, the possible rapid growth of the degrees and coefficients of the S-polynomials. Even though the degree and/or size of the coefficients of the original polynomials and the Grübner basis may be of modest size, the intermediary polynomials generated by the S-polynornial computations and reductions can become quite large. This can dramatically slow down the computation. Doing computations with large coefficients can be very spect to G. Thus we add
e=
3.3. IMPROVEMENTS ON BUCHBERGER'S ALGORlTHM
133
costly because of the large amount of arithmetic that becomes necessary. As an example of this situation, the reduced Grübner basis for the ideal (4x'y2 + 3x, y3 + 2xy, 7x 3 + 6y) with respect to the lex order with x > y is {x, y}, while the coefficients during the computation grow as large as 10 8 . This can be seen by expressing x as a linear combination of the three original polynomials fI = 4x'y2 + 3x, 12 = y3 + 2xy, and Jo = 7x 3 + 6y: 401408 10 1835008 9 9604 8 43904 7 7 2 5 x = ( 54 x y - 56428623 Y - 56428623 Y - 18809541 Y - 18809541 Y 200704 6 18809541 Y
1)
(7 3 6 27 x y
+ 3 fI + -
38416
2 7
+ 18809541 x
y
175616
+ 18809541 x
4802
7
1605632
+ 56428623 x
2 6
401408
y
+ 18809541 xy
21952
6
100352
2 9
y
7340032
+ 56428623 x
2
7
-
5
917504
2 8
y
3xy + 18809541 Y 1
3)
+ 6269847 Y + 6269847 Y + 6269847 Y + 3Y
8
12
1128~7246y5(917504y5 + 14406y 4 + 65856 y 3 + 301056 y 2 + 6269847)fs. There are techniques to try to get around this problem, but again they are beyond the scope of this book. To illustrate the growth in total degree, consider the ideal l = (x 7 + xy + y, y5 +yz+z, z2 +z+ 1) ç lQI[x, y, z]. The generators ofthis ideal have maximum total degree 7. However the reduced Grübner basis for l with respect to lex with z > y > x contains a polynomial of degree 70. The problem with polynomials with large total degree is the fact that they can have a very large number of terms. For example, the reduced Grübner ba.~is for the ideal l above has 3 polynomials with 58, 70, and 35 terms respectively. Again, a large number of terms make any computation very costly. Exercises 3.3.1. Compute a Grübner bases for the following ideal using Algorithm 3.3.1 without using a Computer Algebra System. a. (x 2y-y+x,xy2-x) ç; lQI[x,y] usingdeglexwithx < y. Comparewith Example 2.1.1. b. (x'y + z, xz + y) ç lQI[x, y, z] using deglex with x > y > z. Compare with Exercise 1.7.3. c. (x'y + z, xz + y, y' z + 1) ç lQI[x, y, z] using lex with x > y > z. 3.3.2. The following exercise may tax your Computer Algebra System. If so, try to make up your own more modest example that illustrates the same point. Let 1= (x 7 + xy + y, y5 + yz + Z, Z2 + z + 1) ç lQI[x, y, z]. a. Find the reduced Grübner basis for l with respect to lex with x > y> z. (No computation is needed!) b. Compute the reduced Grübner basis for l with respect to lex with z > y > x. Compare the two bases.
134
CHAPTER 3. MODULES AND GROBNER BASES
c. Can you give a reason for the difference between the bases in a and b? d. Use c to give a method for generating polynomials h, ... ,J, in n variables whose total degree is small, but such that the reduced Grübner basis for (h, ... ,J,) with respect to a certain lex order has polynomiaIs of very high degree. [Hint: In the given example, note that there are 2 solutions for z, and for each such solution, there are 5 solutions for y, etc.] e. Give examples of polynomials which satisfy d. f. Experiment by changing the degree of the x, y, z terms in the polynomials above, but keeping lex with z > y > x. How does it affect the computing time? g. Experiment by changing the term order and the order on the variables for the examples in this exercise. How does it affect the computation and the computing time? 3.3.3. Let J, g, d E k[XI, ... ,xn ] such that d divides both J and g. a. Show that S(~,~) = ,"S(f,g).
b. Show that if S(f,g)
1.01+ 0, then
,"S(f,g)
{U,l + O.
3.4. Computation of the Syzygy Module. In this section we show how ta compute SYZ(f" ... ,J,) for h,··· ,J, E A = k[XI, ... ,xn ]· This is done in two steps. We first compute a Grübner basis G = {g" ... ,g,} for (h,··· ,J,) and compute SYZ(gl, ... ,Yt). For convenience, we will assume that 91, ... ,9t are monic. We then show how to obtain SYZ(f" ... ,J,) from SYZ(g" ... ,g,). We will assume that we have a fixed term order on A. Let {YI, ... ,9t} be a Gr6bner basis, where we assume that the Yi '8 are monie. For i E {l, ... ,t}, we let Ip(gi) = Xi and for i of j E {l, ... ,t}, we let X ij = Icm(Xi , X,). Then the S-polynomial of gi and gj is given by
By Theorem 1.6.2, we have
, S(gi,gj) = "L,hijvgv , v=l
for sorne hijv E A, such that max (Ip(h ijv ) Ip(gv)) = Ip(S(gi,g,)).
1 "::;v$.t
(The polynomials h ijv are obtained using the Division Algorithm.) We now define for i, j = l, ... ,t, i i- j,
3.4. COMPUTATION OF THE SYZYGY MODULE
135
We note that Sij E SYZ(g", .. ,g,), sinee gt ]
Sij
,
Lhi;v9v
8(gi,9j) -
=
O.
v=l
With the notation above, the collection {Si; Il S i < j S t} is a generatin9 set for SYZ(g1,'" ,9')· THEOREM 3.4.1.
PROOF. Suppose to the contrary that there exists (U1,'" ,u,) such that
(U1,'" ,Ut)
E SYZ(91, ... ,9,) - (Si;
Il Si < j st).
Then we can choose such a (U1,' .. ,Ut) with X = max1';i,;,(lp( U,) Ip(9i)) least. Let 8
= {i E {l, ... ,t} IIp(ui) Ip(9i) = X}.
Now for each i E {l, ... , t} we deline
, {U Ui
=
i
Ui
u; as follows:
ifij"8 -lt(UiJ ifi E 8.
Also, for i E 8, let It( Ui) = CiX:' where Ci E k and X; is a power product. Since (U1,'" ,Ut) E SYZ(91, ... ,9t), we see that
LC;X;Xi
= 0,
iES
and
80
L ciX;ei
E
Syz(Xi 1 i
E
8).
iES
Thus, by Proposition 3.2.3 we have
for sorne ai; E A. Since each eoordinate of the vector in the left-hand side of the equation above is homogeneous, and sinee X:Xi = X, we cau choose aij to be a
CHAPTER 3. MODULES AND GROBNER BASES
136
.
X
constant multiple of X .. . Then we have 'J
,uD
LCiX:ei+(U~) ... iES
ij ij '~ " aij (X X ) + (Ull'" ' Tei - Tej ,u~) ~
i<j
J
i,jES
L We deline (V1,'"
aijSij
L
+ (u~, ... ,u~) +
i<j
i<j
i,jES
i,jES
,v,) = (u~, ... ,u;)+ L
'<j
aij(hij11
··.
,hijt).
aij(hij1 , ... ,hij')' We note that
i,jES
(V1, ... , v,) E SYZ(91, ... ,9,) - (Sij 1 1 :<; i < j :<; t), since (U1, ... , u,), Sij E SYZ(91, ... ,9,) and (U1, ... ,Ut) fic (Sij Il:<; i < j:<; t). We will obtain the desired contradiction by proving that max'<:v",(lP(vv) Ip(9v))< X. For each v E {l, ... , t} we have
lp(u~
Ip(vv)lp(9v)
+
L
aijhijv)Xv
i<j
i,jES
max(lp(u~), max(lp(aij) Ip(hijv)))Xv '
<
i<j
i,jES
But, by definition of
u~,
we have
Ip(u~)Xv
< X.
AIso, as mentioned above, ai]
X
is a constant multiple of X .. ' and hence for all i, j E S, i < j, we have 'J
Therefore lp(vv) Ip(9v) < X for each v E {l, ... , t} violating the condition that = max1<:v<:,(lp(uv)lp(9v)) is least. 0
X
EXAMPLE 3.4.2. We return to Example 3.2.2. Let 91 = x 2 -wy, 92 = xy-wz, and 93 = y2 - XZ. These form a reduced Grübner basis with respect ta the degrevlex ordering with x > y > z > w. Using the notation of the abave result, we have:
Now S(91,92) Therefore
= _wy2 + xwz = X
X
-W93, sa hm
=
h '22
=
0 and h '23
'2 '2 812 = -X e, - --e2 - (hm, h ,2 2, h ,23 ) = (y,-x,w). 1 X2
-w.
3.4. COMPUTATION OF THE SYZYGY MODULE
137
AIso, S(g" g3) = x 3z - y3w = xzg, - ywg3, so that h l3l = xz, h '32 = 0 and h 133 = -yw. Therefore S13
X '3 X '3 e3 - (h = -e, - -X ,3" h ,32 , h ,33 ) =
X,
3
). (y 2 - xz, 0, -x 2 + yw
Finally, S(g2,g3) = x 2z - yzw = Zg" so that h 23l = Therefore
Z,
h 232
= 0 and
h233
= O.
X 23 X 23 S23 = X e2 - X e3 - (h23l,h232,h233) = (-z,y, -x). 2 3 By Theorern 3.4.1, SYZ(g"g2, g3) = ((y, -x, w), (y2 - xz, 0, _x 2 + yw), (-z, y, -x)). We note that
S13
= yS12
+ XS231
80
we have, in fact, that
SYZ(g"g2,g3) = ((y,-x,w),(-z,y,-x)). We now tUTn our attention to computing SYZ(f" ... , J,), for a collection
{J" ... ,J,} of non-zero polynomials in A which may not form a Grübner basis. We first compute a Grübner basis {g"." , g,} for (f" ... , Js)' We again assume that g" ... ,g, are monie. Set F = [ ft Js land 0 = [ g, g, l· As we saw in Section 2.1, there is a t x s matrix S and an s x t matrix T with entries in A such that F = OS and 0 = FT. (Recall that S is obtained using the Division Aigorithm, and T is obtained by keei>ing track of the reductions during Buehberger's Algorithrn.) Now using Theorem 3.4.1, we can compute a generating set {SI, ... , Sr} for Syz(O) (the Si'S are column vectors in A'). Therefore for each i = 1, ... 1 r 0= OSi
= (FT)Si = F(Ts i ),
and hence
(Ts i 1 i = 1, ... , r) ç Syz(F). Moreover 1 if we let 15 be the s x s identity matrix, we have
F(Is - TS) = F - FTS = F - OS = F - F = 0, and hence the columns T" ... ,T s of ls -TS are also in Syz(F). THEOREM 3.4.3. With the notation above we have
... ,TSr,Tl, ... ,T s ) ç AS. " PROOF. Let 8 = (a), ... , as) E SYZ(f"", , J,). Then 0 = Fs = OSs, and hence Ss E SYZ(g"", , g,). By the definition of S), ... , Sr, we have S8 = L~~) hi8i for sorne hi E A. Thus we have TS8 = L~~) h i (T8,). Finally, Syz(ft,··· ,Js) = (Ts
8
=
S -
TSs + TS8 = (ls - TS)s +
r
s
r
i=l
i=l
i=l
L hi(Tsi ) = L aiTi + L hi(Tsi ).
CHAPTER 3. MODULES AND GROBNER BASES
138
Therefore SYZ(fll'" ,is) ç (Ts l has already been noted. 0
) ... ,TST)Tl)'"
=
,T
s). The reverse inclusion
=
=
EXAMPLE 3.4.4. Let h x 2y2 - x 3y, 12 xy3 - x 2y2, and h y4 - x 3. Let F = [h 12 We first compute a Gr6bner basis G with respect to the lex term ordering with x < y for (h, 12, hl· We find 2 G = [91 92 93 94 95 where gl = y4 ~ X3 , 92 = xy3 - x 3y, 93 = x 2y2 - x 3y, g4 = x 4y - x 4 J and 5 4 g5 = x - x . Moreover we have
hl·
[ 91
92
l,
93
[h 12 hl
94
95
l=
[~
1 1
1
0 0 0
-(x + y) -y x
-(x h) ] -y(y + 1) and 2 -x +xy+x
v
T
s To compute generators for SYZ(91,92,93,94,95) we need to reduce ail 8(9i,9j)'S for 1 <:: i < j <:: 5 using 91,92,93,94,95 (Theorem 3.4.1). In view of Coroilary 3.3.3 and Exercise 3.4.4, it is enough to reduce 8(91,92),8(92,93),8(93,94), and 8(94,95) which give the following respective syzygies: 81 = (X,
S3
-y, -X, -1, 0),
= (0,0,x 2, -y, y),
S4
82 =
=
(0, X, -x - y, 0, 0),
(0,0, O,x -1, -y + 1).
Ey Theorem 3.4.1, we have
Then
2We will often use the same letter for a finite set and a row vector corresponding to it, when the context is cIear.
3.4. COMPUTATION OF THE SYZYGY MODULE
T [
~2
=
-y
139
_X2 + y' ] _Xy+y3 . [ x 2y _ xy'
Y We note that (_x 2 +y2, _xy+y3, x 2y_xy2) is in the submodule of A3 generated by (-y,x,O) and (x 2,_y3,_x2y+xy'). Finally, it is easy to verify that
Therefore Syz(fI, h, h) = (( -y,x, 0), (x 2 , _y3, _x 2y + xy2)). Using the notation of Theorem 3.4.3, we note that in Example 3.4.4 the rows of !, - TS did not give any syzygies that were not already in the submodule (TB" ... , Ts,) of A'. This is not always the case (see Exercise 3.4.2). However it is easy to see (Exercise 3.4.3) that if the polynomials fI, ... ,t, appear in the list g" . .. ,g" then Syz(f" ... ,t,) = (TB" ... ,TB,). If the Gr6bner basis for (fI, ... ,t,) is computed using Buchberger's Algorithm (Algorithm 1.7.1) or by the improved Buchberger's Algorithm (Algorithm 3.3.1), then the polynomials h, ... ,18 do appear in the list gb'" ,9t- However, if a reduced Grobner basis is computed, as is done in all Computer Algebra Systems, then the polynomials ft, ... ,18 may not appear in the list gb ... ,gt· ExercÎses 3.4.1. Compute generators for Syz(fI, ... ,t,) in the following examples: a. fI = x 2 y + z, h = xz + Y E Ql[x, y, z]. b. fI = x 2 y - Y + x, h = xy2 - xE Ql[x, y]. c. fI =x2y+z, h =xz+y, h =y2z+1 EQl[x,y,z]. 3.4.2. In Theorem 3.4.3 we had to include the columns of the matrix l, - TS in the set of generators for Syz(f" ... ,t,). In this exercise, we show that these vectors are necessary. Consider the polynomials fI = xy + 1, h = xz + 1, and h = yz + 1 E Ql[x, y, z]. a. Verify that the reduced Gr6bner basis for! = (fI'!2, h) with respect to lex with x> y > z is G = {g"g2,g3}, where g, = y - Z, g2 = X - z, and g3 = Z2 + 1. Also, show that
[g,
g2
g3]=[fI
h
-z 0 x v
T
-i z2
z
]
.
140
CHAPTER 3. MODULES AND GROBNER BASES
1
3.4.3.
3.4.4.
3.4.5. 3.4.6.
1
b. Compute the matrix S such that [il 12 h = [g, g, g3 S. c. Compute the 3 generators, say 81, 82, 83, for Syz(gl, 92, g3). d. Compute I3 - TB. This matrix has 2 non-zero colurons, say Tl, T2. e. Verify that (Ts Ts"Ts 3) '" (Ts Ts 2,Ts3,r"r2)' " ,J,} ç {g" ... ",g,} and {g" ... ,g,} is a Grübner Assume that {f" ... basis for l = (f" ... ,J,). Prave that, with the notation of Theorem 3.4.3, Syz(f" ... ,J,) = (Ts ... ,Tsr ), that is, the columns of the matrix l, TB are not necessary. " Genera!ize Theorem 3.4.1 as follows: Let {g" ... ,gt} be a Grübner basis and let Ti, be as in Section 3.3. Let B ç {Tij 1 1 <:: i < j <:: t} be a generating set for Syz(lt(g,), ... ,It(g,)). Prave that {Si' 1 Tij E B} is a generating set for SYZ(g" ... ,g,). Apply Exercise 3.4.4 and Corallary 3.3.3 to the computations in Exercise 3.4.l. Let il, ... ,J" 9 E klxl" .. ,xnJ and consider the !inea!" equation
hd,
+ h,J2 + ... + h,J,
= g,
with unknowns hl, ... ,hs E k[Xl' ... ) xn]. Let S be the set of solutions; I.e. S = {(h" ... ,h,) E A' 1 hd, + h 212 + ... + h,J, = g}. a. Praye that S is not empty if and only if 9 E (fr, ... ,J,). b. Praye that if S '" 0 then S = h + SYZ(f" ... ,J,) = {h + sis E Syz(f" ... ,J,)}, where h is a particular solution. Give a method for computing h. c. Use the above ta find the solution set for the equation
h , (X 2y' _ X3 y ) + h 2 (Xy 3 _ x2y')
+ h3(y4 _
x3) = y7 _ y6
3.5. Grübner Bases for Modules. As before, let A = k[Xl' ... ,xnJ for a field k. We have seen in the previous sections of this chapter that certain submodules of Am are important. In this section, we continue to study such submodules, but now from the point of view adopted for ideals in the earlier parts of this book. Namely, we will genera!ize the theory of Grübner bases to submodules of Am. As a result we will be able ta compute with submodules of Amin a way similar to the way we computed with ideals previously. The idea is to mimic the constructions we used in Chapter 1 as much as possible. Let us recall the ingredients for the methods wc used before for ideals. First 1 in Section 1.4 we defined t,he concept of a power product and then defilled a term order on these power prodncts, that iS a total order with special properties with respect to divisibility (wc willneed ta define a concept of divisibility). Using these ideas, in Section 1.5 wc defined the concept of reduction whieh in turn led to the Division Aigorithm. Theil in Section 1.6 we dcfined the concept of a Grobuer basis, givillg the cquivalent conditions for Ct Gr6bller basis in Theorem 1.6.2 (and further in Theorcm 1.9.1). The llext issu<" discuss('d in Section 1.7, 1
3.5. GROBNER BASES FOR MODULES
141
was how to compute Grübner bases, and for that we developed S-polynomials and Buchberger's Algorithm. We will follow this development very closely. Indeed many of the proofs in this and the next two sections are very similar to the corresponding ones for ideals and will be left to the exercises. We again need the standard basis el
= {l,0, ... ,0),e2 = (0,1,0, ... ,0), ... ,em = (0,0, ... ,0,1)
of Am. Then by a monomial 3 in Am we mean a vector of the type X ei (1 SiS m), where X is a power product in A. That is, a monomial is a column vector with ail coordinates equal ta zero except for one which is a power product of A. Monomials in Am will replace the notion of power products in the ring A. Sa, for example, (0,XîX3, 0) and (0,0,X2) are monomials in A3, but (O,XI +X2,0) and (0,X2,XI) are not. If X = Xei and Y = Yej are monomials in Am, we say that X divides Y provided that i = j and X divides Y Thus in A 3 we see that (0, XîX3, 0) divides (0, xix~, 0), but does not divide (0, XIX3, 0) or (xîx~, 0, 0). We note that in case X divides Y there is a power product Z in the ring A such that Y = Z X. In this case we define<
Y=Y=Z X X . So, for example
(0, xix~, 0) xîx~ 2 = - 2 - =X3· (0,X I X3,0) Xl x3 Similarly, bya term, we mean a vector of the type eX, where e E k-{O} and X is a monomial. Thus (0, 5xIx1, 0, 0) = 5X, where X = (0, xIx1, 0, 0) = xix1e2' is a term of A 4 but not a monomial. Also, if X = eX ei and Y = dYej are terms of Am, we say that X divides Y provided that i = j and X divides Y We write
y X
dY cX
Sa, for exarnple,
(0, 5xlx~, 0) (0, 2xIx3, 0)
5xîx~
5
= 2xi x 3 = -:lx 3.
We now can define a term order on the monomials of Am.
3.5.1. By a term order on the monomials of Am we mean a total arder, <, on these monomials satisfying the following two conditions: (i) X < ZX, for every monomial X of Am and power produet Z of 1 of A; DEFINITION
3In the case that m = 1 we have now called a monomial what we referred to before as a power product. From now on in the book we will use monomial and power product interchangeably for such elements in the ring A. 4Be careful about what we are doing here. We are "dividing" two vectors in Am to obtain an element in A; but we are only doing this in the very special case where each of the vectors has only one non-zero coordinate in the same spot which is a power product and one power product divides the other. The "quotient" is defined to be the quotient of those two power products.
142
CHAPTER 3. MODULES AND GROBNER BASES
(ii) If X < Y, then ZX < ZY for aU monomials X, Y
E
Am and every
power product Z E A. Looking at Definition 1.4.1 we see that Conditions (i) and (ii) there correspond to Conditions (i) and (ii) in Definition 3.5.1. Condition (i), specialized to the CaBe of m = l, simply says that Z > l, since the monomial X can be cancelled. This is Condition (i) of Definition 1.4.1. The corresponding second conditions are exactly parallel. If we are given a term order on A there are two natural ways of obtaining a term order on Am. These are given in the following two definitions. DEFINITION
3.5.2. For monomials X = Xei and Y = Yej of Am, we say
that
X
=
2, using deglex on the
(x,O) < (0, x) < (y,O) < (xy, 0). DEFINITION
3.5.3. For monomials X = Xe, and Y = Yej of Am, we say
that
X
~;j i = j and X
< Y.
We call this arder POT for' "position over term", since it places more importance on the position in the veetoT than on the term arder on A. 80 in this case we have, again for the case of two variables and m deglex on the power products of A with x < y,
=
2, using
(x,O) < (y,O) < (xy,O) < (D,x). It is easily verified that these two orders satisfy the two conditions of Definition 3.5.1 (Exercise 3.5.1). Of course, each of these two orders could just as weil have been defined with a different ordering on the subscripts {l, ... , m}. In order to indicate which order we are using we will write, for example, el < e2 < ... < e m . There are many other examples of orders and we will use an order different frOID either one of the above in the next section. We note that we are using the symbol "<" in two different ways, bath for a term order on the power products of A and for a term order On the monomials of Am. The meaning will always be clear from the context. In analogy to Theorem 1.4.6 we have the following
3.5. GROBNER BASES FOR MODULES
143
LEMMA 3.5.4. Every term arder on the monomials of A= is a well-ardering.
PROOF. The proof of this lemma is exactly the same as the proof of Theorem 1.4.6 except that the Hilbert Basis Theorem (Theorem 1.1.1) used there must be replaced by Theorem 3.1.1 (Exercise 3.5.2). 0
We now adopt sorne notation. We first fix a term order < on the monomials of A=. Then for ail f E A=, with f '" 0, we may write
f
=
a,X, +a2X2
+ ... + a·rXc,
where, for 1 :::; i :S r , 0 #- ai E k and Xi is a monomial in ATn satisfying X, > X, > ... > Xr. We define • ImU) = X" the leading monomial of f; • leU) = a" the leading coefficient of f; • ItU) =a,X" the leading termof f. We define It(O) 0, Im(O) = 0, and le(O) = O. Note that, consistent with using monomials in Am instead of the power prod~ ucts in A, we now use leading "monomials" instead of leading "power products" , and use the symbol "lm" instead of "lp" . EXAMPLE 3.5.5. Let f = (2x 3y _ y3 +5x,3xy2 +4x+y') E A 2, with the lex ordering on A = lQI{x, yi with x < y. Then in the TOP ordering with e, < e2 of Definition 3.5.2 above, we have
=
f
= _y3 e ,
+ 3xy'e2 + y 2e2 + 2x 3ye, + 4xe2 + 5xe"
and so ImU) = y3 e" le(!) = -1, and ItU) = _y3 e ,. On the other hand, in the POT ordering with e, < e, of Definition 3.5.3 above, we have
f
= 3xy2e2
+ y'e, + 4xe2 -
y3 e ,
+ 2x 3ye, + 5xe"
and sa ImU) = xy 2e2' leU) = 3 and ItU) = 3xy 2e2 . We note that lm, le and It are multiplicative, in the following sense: ImUg) = lp(f) lm(g), leUg) = le(f) le(g) , and lt(fg) = lt(f) It(g), for ail f E A and 9 E A= (Exercise 3.5.6). We now maye on ta the second ingredient in our construction of Grobner bases for modules, namely, reduction and the Division Algorithm. It should be emphasized that now that we have defined monomials (in place of power products), divisibility and quotients of monomials, and term orders, the definitions can he lifted word for word from Section 1.5. The basic idea behind the algorithm is the same as for polynomials: when dividing f hy fi"" ,f8' we want to cancel monomials of f using the leading terms of the fi'S, and continue this process until it cannot he done anymore. DEFINITION 3.5.6. Given f,g,h in A=, 9 '" 0, we say that f reduces ta h modulo g in one step, wntten f --"-+ h,
144
CHAPTER 3. MODULES AND GROBNER BASES
if and only iflt(g) divides a term X that appears in f and h = f -
I,-(;,)g·
We can think of h in the definition as the remainder of a one step division of by 9 similar to the one seen in Section 1.5. Observe that the terms introduced by subtracting I,-(;,)g from f are smaller than the term X subtracted out of f. We can continue this process and subtract off ail possible terms in f divisible by lt(g). EXAMPLE 3.5.7. Let f = (-y3+2x 3 y,3xy2+y2+4x) andg = (x+l,y2+x) be in A 2 We use the lex ordering on A = Q[x, y], with x < y, and TOP with e, < e2 in A 2 Then, lt(g) = (0, y2) = y 2e2' and
f
f
~ ~
(-y3+2x3y-3x2-3x,y2-3x2+4x) (_y3 + 2x 3 y _ 3x 2 - 4x - 1, -3x 2 + 3x).
Let f, h, and f" ... ,f, be veetors in Am, with f" ... , f, non-zero and let F = {fl>··· ,fs}. We say that f reduces to h modulo F, denoted DEFINITION 3.5.8.
f
F
--->+ h,
if and only if there exisls a sequence of indices i" i2, ... , it E {l, ... , s} and vectors hl,'" ) h t - 1 E Am such that
EXAMPLE 3.5.9. Let f, = (xy - y,X 2),f2 = (X,y2 - x) E A 2 We use the lex ordering on A = Q[x, y], with x < y, and TOP with e, < e2 in A 2 Let F = {J"h} and f = (y2 + 2x2y,y2). Then
f ~+ (y2+2y-x,-2x 3 -2x 2 +x), since
(y2
+ 2xy _
+ x) J.2., (y2 + 2y - x, _2x3 - 2X2 + x). vector h = (y2 + 2y - x, -2x 3 - 2X2 + x) cannot
X, -2x3
Notice that thls last be reduced further by f, or f2' This is because lm(f,) = (xy,O) and no power product in the first coordinate of h is divisible by xy and lm(f 2) = (0, y2) and no power product in the second coordinate of h is divisible by y2. DEFINITION 3.5.10. A veetor r in Am is ealled reduced with respect ta a set F = {fl>'" , fs} of non-zero veetors in Am if T = 0 or no monomial that appears in T is divisible by any one of the lm(fi)' i = 1, ... , s. If f ~+ rand r is redueed with respect ta F, then we call r a rernainder for f with respect to F.
3.5. GROBNER BASES FOR MODULES
'45
The reduction process allows us ta give a Division Algorithm that mimies the Division Algorithm for polynomials. Given J, J" ... , J, E Am, with fI' ... ,fs oF 0, this algorithm returns quotients al) . - . ,as E A = k[Xl1 ... ,xnl, and a remainder r E A"t, which is reduced with respect to P, such that
J=ad, +···+asJ,+r. This algorithm is given as Algorithm 3.5.1. INPUT: J,I" ... ,l, E Am with Ji
of 0 (1 <:: i <:: s)
+ ... + asJ, + r {J" ... , J s} and
OUTPUT: a" ... , a, E A, r E Am with J = ad, and r is redueed with respect to
max(lp(a,) lm(f ,), ... , !p(a s ) hn(f ,), lm(r)) = lm(f) INITIALIZATION: a, := 0, a2 := 0, ... , as := 0, r := 0, 9 := WHILE 9
J
of 0 DO
IF there exists i sueh that lm(fi) divides lm(g) THEN Choose i least sueh that lm(fi) divides lm(g) lt(g)
ai
:=
ai
+ lt(fi) lt(g)
9
:=
9 - lt(fi) Ji
:=
r
ELSE
r
+ lt(g)
g:=g-lt(g)
ALGORlTHM 3.5.1. Division Aigorithm in Am Note that the step r:= r+lt(g) in the ELSE part of the algorithm is used to put in the remainder the terms that are not divisible by any lt(fi) and the step 9 := 9 -lt(g) is used to continue in the algorithm to attempt to divide into the next lower term of g. EXAMPLE 3.5.11. We will redo Example 3.5.9 going step by step through Algorithm 3.5.1. INITIALIZATION: a, := 0, a2 := 0, r := 0, 9 := (y2 + 2x2y, y2) First pass through the WHILE loop: (xy,O) = lm(f ,) does not divides lm(g) = (0, y2) (0,y2) = hn(f2) divides lm(g) = (0,y2)
146
CHAPTER 3. MODULES AND CROBNER BASES
a a 2.·- 2
+ It(f2)It(g) - 1
It(g) f = (y2 g .'= g _ It(f2) 2
+ 2x2y
y2) _ l
(O,y')
(O,y2)
(x y2 - x) 1
= (y2+2x2y-x,X) Second pass through the WHILE loop: Neithèr Im(f,) nor Im(f2) divides Im(g) = (y2,0) r := r + It(g) = (y2, 0) g := g -It(g) = (2x2y - X, x) Third pass through the WHILE loop: (xy,O) = Im(fl) divides Im(g) = (x 2y, 0) a .- a + It(g) - 2x 1·-
1
It(fl)-
g := (2x2y - x, x) - (~~::o~) (xy - y, x 2) = (2xy - x, -2x 3 + x) Fourth pass through the WHILE loop: (xy,O) = Im(fl) divides Im(g) = (xy,O) al := al + I~~~:) = 2x + 2 g := (2xy - X, -2x 3 + x) - ?::,~) (xy - y, x 2 ) = (2y - x, -2x 3 - 2x2 + x) Fifth pass through the WHILE loop: Neither Im(fl) nor Im(f2) divide Im(g) = (y,O) r := r + It(g) = (y2 + 2y, 0) g := g -It(g) = (-x, -2x 3 - 2x2 + x). The remaining four passes through the WHILE loop, one for each of the four remaining terms in g, will be similar ta the last one, since neither Im(!l) nor Im(f2) divides any of the remaining terms of g. Sa we finally get, as we did in Example 3.5.9, that F
2
'3
f~+(y +2y-x,-2x' -2x 2 +:z:)=r
and, mQreovcr
f=(2x+2)fl +f 2 +r.
THEO REM 3.5.12. Given a set F = {fl, ... ,fJ of non-zero vectors and 3.5.1) will pradnce polynomials al, ... ,a,5,E A and a vectorr E Am Buch that
f in A=, the Division Algorithm (Algorithm
f=adl + .. ·+a,f.,+r, with r reduced with respect to F, and Im(f) = max(lp(aJJlm(f 1)'''' ,Ip(a.,) Im(fJ, Im( r)J.
3.5. GROBNER BASES FOR MODULES
147
PROOF. The proof is exactly the same as the proof of Theorem 1.5.9 except that we use Lemma 3.5.4 instead of Theorem 1.4.6. 0 We now have what we need to define Grübner bases in modules. So let M be a submodule of Am. DEFINITION 3.5.13. A set of non-zero vectors G = {g" ... ,gt} contained in the submodule M is called a Grübner basis for M if and only if for all f E M, there exists i E {l, ... ,t} such that lm(gi) divides lm(j). We say that the set G is a Grübner basis provided G is a Grobner basis for the submodule, (G), it generates.
We nQW give the characterizations of a Grobner basis analogous to those in Theorems 1.6.2 and Theorem 1.6.7. The proof is basically the same as the one for the ideal case and is left to the exercises (Exercise 3.5.9). We first define, for a subset W of Am, the leading term module of W to be the submodule of Am,
Lt(W) = (lt(w)
1
W E
W)
ç Am.
THEOREM 3.5.14. The following statements are equivalent for a submodule M ç Am and G = {g" ... ,gt} ç M with gi oJ 0 (1 :'Ô i:'Ô t). (i) For all f E M, there exists i such that lm(gi) divides Im(j) (that is, G is a Grobner basis for M).
(ii) f E M if and only if f -S+ o. (iii) For aU f E M, there exists h" ... ,ht E A such that f = h,g, + .. +htg t and Im(j) = max'<;i<;t(lp(htllm(gi)). (iv) Lt(G) = Lt(M). (v) For aU f E Am, if f -S+ Tl, f -S+ T2, and Tl, r2 are reduced with respect to G, then
Tl = T2.
For completeness sake, we note the Corollaries of this result which correspond to the Corollaries of Theorem 1.6.2 (Exercises 3.5.10 and 3.5.11).
COROLLARY 3.5.15. If G = {g" ... ,gt} is a Grobner basis for the submodule M of Am, then M = (g" ... ,gt). COROLLARY 3.5.16. Every non-zero submodule M of Am has a Grobner basis. We will now introduce the analogue of S-polynomials. We continue to emphasize that we need only copy what was done in the polynomial case, except that it is not yet clear what the least common multiple of two monomials should be. 80 let X = X ei and Y = Yej be two monomials in Am. Then by the least common multiple of X and Y (denoted lcm( X, Y)), we mean .0,ifioJj; • Lei where L = lcm(X, Y), if i = j. For example, lcm((x 2 yz, 0), (xy3, 0)) = (x 2 y3 z , 0) and lcm((x 2 y, 0), (0, xy3)) = (0,0).
148
CHAPTER 3. MODULES AND GROBNER BASES
DEFINITION 3.5.17. Let 0 vector
io 1,g
E Am. Let L = lcm(lrn(f),lm(g)). The
L S(f,g)
=
L
lt(f/ - lt(g)g
is caUed the S-polynomials of 1 and g. The motivation for tills definition is the same as the one used for the definition of S-polynomials in Definition 1.7.l. Namely, we are interested in linear combinations of 1 and 9 in which the leading monomials cancel out. This cannot happen if lm(f) and lm(g) have their respective non-zero entries in different coordinates and 80, in this case, we define their least cornmon mult~ple ta be the zero vector making their S-polynomial the zero vector. On the other hand, if lm(f) and lm(g) have their respective non-zero entries in the same coordinate, then S(f,g) is set up to cancel out these leading monomials in the most efficient way.
EXAMPLE 3.5.18. We consider A =
x
< y and A2 with the TOP ordering with e, < e2. Then, S((x 2
+ 1, 5xy3 + x), (x 2 y, 3x3y + y)) =
(0 x 3y 3) (0 x3y3) 2 , (x +1,5xy3+ x )- ' 3 (x 2 y,3x 3 y+y) = (0,5 xy3) (0,3x y) x2 2 3 y2 2 3 1 4 1 2 5(x +1,5xy +x)-3(x y ,3x Y + Y)=(5 X +5 X
1
2 3
-3 XY
1 3 '5 x
1
3)
-3 Y
·
Note that the leading monomial of each of the summands is (0, x 3 y 3) wlllle the leading monomial of the S-polynomial is (x 2 y 3, 0) < (0, x 3 y 3). THEOREM 3.5.19. Let G = {g"", ,g,} be a set of non-zero vectors in Am. Then G is a Griibner basis for the submodule M = (g"", ,g,) of Am if and only if for aU i io j,
We willleave the proof of tills Theorem to the exercises (Exercise 3.5.13); it follows the proof of Theorem 1.7.4 exactly. This last Theorem allows us to give the analog of Buchberger's Algorithm, Algorithm 1.7.1, for computing Gr6bner bases. It is given as Algorithm 3.5.2. Although this algorithm is exactly the same as Algorithm 1.7.1, we will restate it here for the convenience of the reader. The proof of the correctness of the algorithm is left to the exercises (Exercise 3.5.14). 5We use the term "S-polynomial" even though the result is clearly a vector with polynomial coordinates.
3.s. GROBNER BASES FOR MODULES
INPUT: F= {J" ... ,!J
149
ç; Am with!i,fO (1 Si S s)
OUTPUT: G= {g" ... ,gt}, a Grübnerbasis for (f" ... ,fs) INITIALIZATION: G := F,
g := {{Ji' !j} 1 !i ,f !j
E G}
WHILEg,f0DO Choose any {J,g} E g
g:= g - {{J,g}} S(f,g)
-S+ h,
where h is reduced with respect to G
IFh,fOTHEN
g := g U {{ u, h}
1
for ail u E G}
G:=GU{h}
ALGORITHM EXAMPLE
!,
3.5.2. Buchberger's Algorithm for Modules
3.5.20. Consider the following vectors of (
= (O,y,x),
!2
= (O,x,xy-x), h
= (X,y2,0), !4 = (y,O,x).
We use the deglex term order on
S(lI, !2)
=
y!, - J,
=
(0,y2 - x,x)
-S+ (-x, -x - y,O).
This vector is reduced with respect to G, so we set !s = (-x, -x - y, 0), and we add it to G. Next, S(f,'!4) =!, -!4 = (-y,y,O). Again, this vector is reduced with respect to G, so we set!6 = (-y,y,O), and we add it to G. One readily sees that S(f2'!4) -S+ o. The new vectors!5 and ! 6 generate only one S-polynomial that needs to be considered,
S(f5, !6) = yfs - x!6
=
(0, -2xy - y2, 0)
-S+
(0, -2xy - x - y, 0).
This vector is reduced with respect to G, so we set! 7 = (0, -2xy - x - y, 0), and we add it to G. This new vector generates only one S-polynomial that needs to be considered,
150
CHAPTER 3. MODULES AND GROBNER BASES
We set fs = (0, _2X2 + ~x + ~y, 0) and add it to G. This new vector generates two S- polynomials we need to consider, 1 1 2 fs = (3 S(f3,!S ) = 2x 2 f3 +y 2x '2 xy2 + 2 Y3)G ,0 --->+ 0 and S(f7,f s )
2
3
1
2
= Xf7 - yfs = (0, -x - 2 xy - 2 Y ,0)
G
--->+ O.
Therefore G = {f1,!2, f 3, f4' J5,!S,!7,!8} is a Grübner basis for M. Finally, we conclude this section by noting that the results in Chapter 1 concerning reduced Gr6bner bases hold in this context as weil. We will not prove the results here as, again, they exactly parallel the ones in Chapter 1, but we will state the main Definition and Theorem here for completeness. The proof will be left for the exercises (Exercise 3.5.17). DEFINITION 3.5.21. A Grobner basis G = {gl' ... ,gi} ç:; Am is a reduced Grobner basis if, for ail i, gi is reduced with respect ta G - {gJ and lc(g,) = 1 for aU i = 1, ... ,t. Thus for all i, no non-zero term in Yi is divisible by any lm(gj) for any j '" i. THEOREM 3.5.22. Fix a term arder. Then every non-zero submodule M of Am has a unique reduced Grobner basis with respect ta this term arder. This Grobner basis is effectively computable once M has been given as generated by a finite set of veetors in Am. EXAMPLE 3.5.23. We go back to Example 3.5.20. In that example we had that G = {fI' f2' h, f 4 , f5' fs, f 7 , fs} is a Grübner basis for M. We first observe that since lm(fl) divides lm(f2) and lm(f4) we may eliminate f2 and f4 from G and still have a Grübner basis. Moreover, f3 --->+ (0,y2 - X -y,O). Therefore the reduced Grübner basis for M is 2. 211) { (0, y, x), (0, y - x - y, 0), (0, x - 4x - 4Y' 0 ,
(0, xy +1 2x + 2 Y' 0), 1 (y, -y, 0), (x,} x + y, 0) . Exercises 3.5.1. Prove that the POT and TOP orders of Definitions 3.5.3 and 3.5.2 are term orders on Am. 3.5.2. Complete the proof of Lemma 3.5.4. 3.5.3. Prove the analog of Proposition 1.4.5: Let < be a term order on Am. For X, y monomials in Am, if X divides Y, then X:::; Y. 3.5.4. Prove the analog of Exercise l.4.6: Let < be a total order on the monomials of Am satisfying Condition (ii) of Definition 3.5.1, and assume that < is also a well-ordering. Prove that.X < ZX for every monomial X of Am and power product Z '" 1 of A.
3.5. GROBNER BASES FOR MODULES
151
3.5.5. Write the following vectors as the sum of terms in decreasing order according to the indicated term orders: a. f = (X 2y_xy2, x 3+1, y3_1), with the deglex term order on A = lQl[x, y] with x > y, and the TOP ordering on A3 with e, > e2 > e3. Then change the order to deglex with y > x and the POT ordering with e, < e2 < e3. Finally, change the order to lex with y > x and the TOP ordering with e, < e2 < e3. b. f = (x 2 + xy, y2 + yz, z2 + xz), with the degrevlex term order on A = lQl[x, y, z] with x > y > z, and the TOP ordering on A3 with e, < e2 < e3. Then change the order to degrevlex with z > y > x and the POT ordering with e, > e2 > e3. Finally, change the order to lex with z > y > x and the TOP ordering with e, < e2 < e3. 3.5.6. Prove that lt, lm, and le are multiplicative. Namely, prove that for ail f E A and 9 E A= we have lt(fg) = lt(f) lt(g), lm(fg) = lp(f) lm(g), and !c(fg) = !c(f) !c(g). 3.5.7. Complete the proof of Theorem 3.5.12. 3.5.8. As in Example 3.5.11, follow Algorithm 3.5.1 to find the quotients and remainders of the following divisions: a. Divide f = (x 2y+y,xy+y2,xy2+x 2 ), by F = {t,,!2,h, f4}' where f, = (x 2,xy,y2), f2 = (y,O,x), h = (O,x,y), and f4 = (y, 1,0). Use lex with x > y on A = lQl[x, y] with the TOP ordering on A3 with el> e2 > e3· b. Divide f = (x 2y+y,xy+y2,xy2+X 2 ), by F = {t,,!2,h.f4}' where f, = (xy,O,x), f2 = (y,x,O), f3 = (x+y,O,O), and f4 = (x,y,O). Use deglex with x > y on A = lQl[x, y] with the POT ordering on A 3 with
3.5.9. 3.5.10. 3.5.11. 3.5.12.
el
>
€2
>
€3.
Prove Theorem 3.5.14. Prove Corollary 3.5.15. Prove Corollary 3.5.16. Prove the analog of Exercise 1.6.13: Let {g"", ,g,} be non-zero vectors in A= and 0 of h E A. Prove that {g"", ,g,} is a Grübner ba3is if and only if {hg ... , hg,} is a Grübner ba3is. " 3.5.13. Prove Theorem 3.5.19. 3.5.14. Prove that Algorithm 3.5.2 produces a Grübner basis for (f"", '!sl. 3.5.15. Compute the Grübner bases for the following modules with respect to the indicated term orders. You should do the computations without a Computer Algebra System. a. M = (f,,!2,f3,!4) ç A3, where f, = (x 2 ,xy,y2), f 2 = (y,O,x), f3 = (O,x,y), and f4 = (y, 1,0). Use lex on A = lQl[x,y] with x> y and the TOP ordering on A3 with e, > e2 > e3. b. M = (f , ,!2,!3,f4 ) ç A3, where f, = (x-y,x,x), f 2 = (yx,y,y), f3 = (y, x, x), and f4 = (y, x, 0). Use deglex on A = lQl[x, y] with y> x and the TOP ordering on A3 with e, > e2 > e3.
152
CHAPTER 3. MODULES AND GROBNER BASES
3.5.16. Give an example that shows that the analog of Lemma 3.3.1 is false in the modnle case; that is, critl of Section 3.3 cannot be used in Algorithm 3.5.2. 3.5.17. Prove Theorem 3.5.22. 3.5.18. Find the reduced Grübner bases for the examples in Exercise 3.5.15. 3.5.19. In this exercise, we will take a different view of the module Am which allows us to implement Grübner basis theory in Computer Algebra Systems that don't have a built-in module facility. Let el, e2, ... ,em be new variables and consider the polynomial ring k[Xl 1 ' " ,Xn , el,··· ,e'TrJ We identify Am with the A-submodule of k[XI"" ,Xn,el, ... ,e m ] generated by el, e2,··· , e m simply by sending (fI,··· , J,) to fIel + ... + Jmem E k[Xl, ... 1 Xn' el, . '. ,em ]. Fix a term order on k[Xl' ... ) In]. Consider any order on the variables eb e2,· .. 1 em sueh that el < e2 < ... < em . Prave the following: a. Consider an elimination order between the x and e variables. Then, in the above correspondence, if the ;1; variables are larger than the e variables we have the TOP ordering in Amand if the e variables are larger than the x variables we have the POT ordering in Am. b. Note that division and quotient of monomials in Am just mean the lisuai division and quotient in k[Xl1'" ,X n ,el" .. lem]' Then note that Definitions 3.5.6, 3.5.8, and 3.5.10 become the usual ones in k[xj,- .. , X n , el, ... , e m ]. Show that the Division Algorithm for modules (Algorithm 3.5.1) corresponds directly to the polynomial Division Algorithm (Algorithm 1.5.1). c. Show that the usual Buchberger's Algorithm (Algorithm 1.7.1) performed in k[Xl1'" 1 In) el, ... 1 e m ] can be used to compute Grobner bases in Am with the following modification: In the definition of the least common multiple, we must set lcm(X ei, Yej) = 0 for power products X and Y in the x variables when i # j. d. Redo the computations in Exercise 3.5.15 using the above method. 3.6. Elementary Applications of Grobner Bases for Modules. Let M = (f 1'" . , f ,) be a submodule of Am. As in the case of ideals in A (see Section 2.1), we show in the present section how to perform effectively the following tasks: (i) Given f E Am, determine whether f is in M (this is the module membership problem), and if so, find hl, ... , h, E A such that f =
hd 1 +···+h,f,; (ii) Given M', another submodule of Am, determine whether M' is contained in M, and if M' ç M, whether M' = M; (iii) Find coset representatives for the elements of Am lM; (iv) Find a basis for the k-vector space Am lM. Moreover, we show how the theory of elimination we introduced in Section 2.3 is carried over ta the module setting.
3.6. ELEMENTARY APPLICATIONS OF GROBNER BASES FOR MODULES 153 Let F = {J" ... ,J,} be a set of non-zero vectors in Am and let M = (f" ... ,!,). Let G = {9" ... ,9,} be a Grübner basis for M with respect to sorne term order. We start with Task (i). Let 3.5.14 that
J
JE M
E Am. We have already noted in Theorem
=J
G
--->+ O.
So we can determine algorithmically whether J E Mor not. Moreover, if J E M, we apply the Division Algorithm for modules presented in Section 3.5 (Algorithm 3.5.1) to get
J = h'9, + ... + h,9t·
(3.6.1)
As in the ideal case, we can find an s x t matrix T with polynomial entrÎes such that, if we view F and G as matrices whose columns are the f /8 and the 9/S respectively, we have G = FT. (T is obtained, as in the ideal case, by keeping track of the reductions during Buchberger's Algorithm for modules.) Therefore Equation (3.6.1) can be transformed to express the vector J as a linear combination of the vectors fI' ... 1 f 8' EXAMPLE 3.6.1. Let A =
J,
(xy,y,x), J2
=
J3
=
= (-y,x,y),
(x 2 +x,y+x 2,y), J4 = (x 2,x,y).
We use the lex term ordering on A with x < y and TOP with e, > e2 A 3 . To indicate the leading term of a vector we will underline it; e.g.
> e3
on
J,=(xy,y,x), J,=(x 2 +x,)t+x',y), J3 = (-y, X, y), J4
=
(x',x,)t).
The reduced Grübner basis for M is G = {9,,92,93,94,95,96}, where
9,
= (x 3 + x,x 2 93
X,
-x), 92
= (x,)t + x' - x,O),
= ()t+x',O,O), 94 = (x 2,x,)t),
95 = (x 2, x 3, _x 3), 96 = (x 2
-
2x, -x' + 2x, x 5
-
x4
-
3x 3 + x 2 + 2x).
Consider the vector J = (- 2x, x - 1, xy + x). To determine whether J is in M, we perform the Division Algorithm presented in Section 3.5 (Algorithm 3.5.1)6:
J
~(-x3-2x,-x2+x-1,x) -',g, (-X,- 1 1 0) • ~
Since xe" and e2 cannot be divided by any Im(9i), the vector (-x,-l,O) is reduced with respect to G, and hence J is not in M. 6We remind the reader that when we write
f ~ h we mean that h = f - X g.
CHAPTER 3. MODULES AND GROBNER BASES
154
Now consider the vector 9 = (yx 5 we perform the Division Algorithm:
, ,
_
yx + x, yx 4 + y + 2X2 - x, x 5 + yx). Again
x~ (-yx _ x 7 +x,yx4 +y + 2X2 - x,yx+x 5 )
9
-
x~ (-yx _ x -~3 (_x7 _
+ X, Y - x 6 + x 5 + 2x2 x5 + x3 + X, Y _ x 6 + x 5 + 2x2 7
_
x
5
~(_x7_X5+X,y_x6+x5+x2_x)x5)
~
(_x 7 _
-x 4 ,g1
-->
x5 ,
+ x 5) X, xy + x 5 )
X,
xy
-
_x 6 -+x 5 ,x 5 )
(0,0,0).
Therefore 9 is in M. Moreover 1 we get
X5 g3 + X4 g2 - xg 3 + xg 4 + g2 - X 4 g, -x4g, + (x 4 + 1)g2 + (x 5 - x)g3 + xg4·
9
We nQW want ta express 9 as a linear combination of the original consider the matrix T that transforms G into F. We have
f /8.
Sa we
(3.6.2)
= [
J, J,
J3
J4
[" 1
-x x-l
0 1 0 -1
0 0 -1 1
0 0 0 1
.
_x 2 _y
h,
x+y x(l - y) xy-y-x
h2 h3 h4
T
l
where
h, h2 h3 h4
(3.6.3)
y2 + 2x2y _ xy - 2y + x 4 - x 3 - 3x 2 + X + 2 _y2 - x 2y + 2y + 2x2 + X - 2 xy2 + x 3y - x 2y - 3xy - 2X 3 + x 2 + 4x _xy2 + y2 - x 3y + 2x2y + 2xy - 2y - 2x2 - 3x + 2.
= =
= =
Therefore we have 9
-x4g, + (x 4 + 1)g2 + (x 5 - x)g3 + xg 4 4 4 -X (-J, + J2 - xfs + (x -1)J4) + (x + 1)(f2 - J 4 ) +(x 5 - x)(-J3 + J 4) +XJ4 4 X J, +J2+ X J3-J 4·
Now we turn our attention to Task (ii). Let M' be another submodule of Am, say M' = (f;, .. . ,J;). Then M' c:: M if and only if J; E M for i = l, ... ,R. This can be verified algorithmically using the method described above. Moreover, if M' c:: M, then M' = M if and only if M c:: M', and this, again, can be verified algorithmically using the above method. Alternatively, we can compute reduced
3.6. ELEMENTARY APPLICATIONS OF GROBNER BASES FOR MODULES 155 Grübner bases for M and M'and use the fact that reduced Grübner baBes for subrnodules of Am are unique (Theorem 3.5.22) to determine whether M = M'. EXAMPLE 3.6.2. We go back to Example 3.6.1. Let M' be the submodule of A3 generated by {f 2,f3,f4,g}. M' is a submodule of M, since we showed in Example 3.6.1 that g E M. However, using the same order we did in Example 3.6.1, the reduced Grobner basis for M' is given by {g~,g2,g~,g~,g~,g~}, where
g'3
= (x 6 " x 7 _x 7 ) g'4 = (x 7 + x 5 , x 6 _ 1
g; = (x',x,y), g~ = (x 6
-
2x S , _x 6
+ 2x S ,x9 _
S S X , _X ) ,
x8
_
3x 7
+ x 6 + 2x S ).
Since the reduced Grübner basis for M' is not equal to the one for M, the modules M and M'are not equal. Now ifwe consider the submodule Mn of A3 generated by {I 1,12' 13,g}, then again MI! is a submodule of M. Moreover, we can compute that the reduced Grobner basis for Mil is the same as the reduced Grôbner basis for M, and therefore Mn = M. We next turn our attention to Task (Hi), that is, we find coset representatives for the quotient module Am / M. Let G be a Grübner basis for M and let 1 E Am. We know from Theorems 3.5.12 and 3.5.14 that there exists a unique vector r E A ~, reduced with respect to G, such that
e f ---->+ r. As in the ideal case, we call this vector r the normal form of G, and we denote it by N e (!).
1 with
respect to
PROPOSITION 3.6.3. Let 1 and g be vectors in Am. Then
1+M
=
g + M in Am /M
<==?
Nc(!) = Ne(g).
Therefore {Ne(f) IlE Am} is a set of coset representatives for the quotient module Am/M. Moreover the map Ne:
Am
---->
1
>---+
Am N c(!)
is k-linear. The proof of this result is similar to the one for the ideal case (PropoSition 2.1.4) and is left to the reader as an exercise (Exercise 3.6.4). As in the ideal case (Proposition 2.1.6), we have the following Proposition whose proof we also leave to the exercises (Exercise 3.6.5). It solves Task (iv). PROPOSITION 3.6.4. A ba.sis for the k-vector space Am /M consists of aU the cosets of monomials X E Am such that no lm(g,) divides X.
156
CHAPTER 3. MODULES AND GROBNER BASES
EXAMPLE 3.6.5. We again go back to Example 3.6.1. The leading terms of the reduced Grobner basis are:
Therefore a basis for the Q- vector space A 3 !M is
To conclude this section, we consider the theory of elimination presented in Section 2.3, but now in the module setting. Again the proofs are very similar to the ones for the ideal case and, except for the proof of Theorem 3.6.6, are left to the reader. Let YI, ... ) YR. be new variables, and consider a non-zero module
M ç: (A[YI' ... ,y,])m = (k[XI' ...
,X n ,
YI, ... ,y,])m.
As in the ideal case (Section 2.3), we wish to "eliminate" sorne of the variables. For example, we wish to compute generators (and a Grübner basis) for the module MnATn, that is, we wish to eliminate the variables YI,'" ,YR.- First, we choose an elimination order on the power products of A[YI, ... ,y,] with the Y variables larger than the x variables (see Section 2.3). The next result is the analog of Theorem 2.3.4 in the module context (in fact Theorem 2.3.4 is the special case m = 1 in Theorem 3.6.6).
THEOREM 3.6.6. With the notation as above, let G be a Grobner basis for M with respect ta the TOP monomial ordering on (A[YI, ... ,y,])m. Then G n Am is a Grobner basis for M n Am. PROOF. Clearly (GnAm) ç: MnAm. So let 0 of f E MnAm. Then there is agE G such that Im(g) divides Im(f). Since the coordinates of f involve only the x variables, we see that Im(g) can only involve the x variables as weil. Then sinee we are using an elimination order with the y variables larger than the x variables we see that the polynomial in the coordinate of 9 giving rise to Im(g) cau contain only x variables. Finally, since the order is TOP on Am we see that the polynomials in ail of the coordinates of 9 must contain only x variables. D We will give an example in the exercises where the above result is false if the TOP ordering is replaced by the POT ordering on Am (see Exercise 3.6.8). EXAMPLE 3.6.7. In Example 3.6.1 we saw that the reduced Grübner basis for M has three vectors in x alone, namely gl,g51 and g6' Therefore by Theorem 3.6.6, Mn (Ql[x])3 is generated by {g"g5,go}. We can use this result to compute intersection of submodules of A= and ideal quotients of two submodules of Am. First, as in the ideal case (Proposition 2.3.5) we have
3.6. ELEMENTARY APPLICATIONS OF CROBNER BASES FOR MODULES 157 PROPOSITION 3.6.8. Let M = (J" ... ,J,) and N = (91'··· ,9,) be submod· ules of A= and let w be a new variable. Consider the module
L
=
(wfr, ... ,wf" (1 - w)9l'··· ,(1 - W)9t) ç (A[w])m.
ThenMnN=LnA=.
We note that in Proposition 3.6.8, neither {f" ... ,f,} nor {9l'··· ,9t} need be a Grübner basis. As a consequence of this result we obtain a method for computing generators for the module Mn N ç Am: we first compute a Grübner basis G for L with respect to the TOP ordering on monomials of (A[w])m, using an elimination order on the power products in A[w] with w larger than the x variables; a Grübner basis for Mn N is then given by G n A=. EXAMPLE 3.6.9. Let M be the submodule of A3 of Example 3.6.1, and let N be the submodule of A3 generated by the vector 91 = (y, x, xy). The reduced Grübner basis for (Wf" wJz, wh, wJ 4' (1 - w)9,) ç (A[W])3 with respect to the TOP term ordering with e, > e2 > e3 using the lex ordering in A[w] with w > y > x has 8 vectors, two of which are in A3 : h,
h2
+ 2yx 5 + 25yx4 + 7yx 3 - 9yx 2 - 9yx, 9yx _7x 7 + 2x 6 + 25x 5 + 7x 4 - 9x 3 - 9x 2, 9y 2x - 7yx 7 + 2yx 6 + 25yx S + 7yx 4 - 9yx 3 - 9yx 2) (yx 7 - yx 6 - 3yx 5 + yx 4 + 2yx3 ,x8 - x 7 - 3x6 + x 5 + 2x4 , yx S _ yx 7 _ 3yx 6 + yx S + 2yx4 ). (9y2 - 7yx 6
Therefore Mn N = (h" h 2 )
ç A"-
DEFINITION 3.6.10. Let M and N be two submodules of Am. The ideal quotient N: M is defined to be N: M = {f E A
1
f M ç N} ç A.
Note that N: M is an ideal in A. As in the ideal case (Lemmas 2.3.10 and 2.3.11) we have LEMMA 3.6.11. Let M = (f" ... ule of Am. Then
,Js ) ç
N: M =
Am and let N be any other submod·
,
n
N : (f,).
i=l
Since we have a method for computing intersection of ideals (Proposition 2.3.5 or equivalently Proposition 3.6.8 with m = 1), we only need to show how to compute N: (f) for a single vector f E Am.
CHAPTER 3. MODULES AND GROBNER BASES
158
LEMMA 3.6.12. Let N be a submodule of Am and let f be a vector in Am. Then N: (f) = {a E A 1 y = af E N n (f)}.
Thus we may compute N: (f) by first computing a set of generators for N n (f) and dividing these generators by f using the Division Algorithm. The quotients obtained are then a set of generators for N: (f). EXAMPLE 3.6.13. In Exarnple 3.6.9 we saw that
Mn (Y,) = (h" h2). Note that (9y - 7x 6 + 2x 5 + 25x4 + 7x 3 - 9x 2 - 9X)YI (x 7 - x 6 - 3x 5 + x 4 + 2x 3)YI·
h, h2
Therfore, by Lemma 3.6.12, M: (YI) = (9y-7x6+2x5+25x4+7x3_9x2-9x,X7 -x 6 -3x 5 +x4 +2x 3) çA. Now let Y2 = (y + x 2, y, x 2) E Ao. We wish ta compute M: (Y" Y2). By Lemma 3.6.11, we first need ta compute M: (Y2). We proceed as in Example 3.6.9 using the same term arder, and we find
where
(x 7 - x 6 - 3x 5 + x 4 + 2x 3)Y2 (9y+ 2X6 -7x 5 - 2x4 + 16x 3 + 9x2 - 9X)Y2·
h3 h4
Therefore, by Lemma 3.6.12, M: (Y2) = (x 7 _x6_3x5+x4+2x3,9y+2x6_7x5_2x4+16x3+9x2_9x) ÇA. Now, by Lemma 3.6.11,
Ta compute this intersection, we find the Grübner basis for the ideal (w(9y - 7x 6 + 2x 5 + 25x 4 + 7x 3 - 9x2 - 9x), w(x 7 - x 6 _ 3x 5 + x 4 + 2x3), (1-w)(x 7 _x6 _3x 5 +x4 +2x 3), (1-w)(9y+2x 6 -7x 5 -2x 4 + 16x3 +9x 2 -9x)) of A[w] with respect ta the lex term ordering with w > y > x. The Grübner basis has five polynomials , three of which do not contain the variable w, Ul
x7
U2
9yx -
U3
3y2 -
-
+ x 4 + 2x 3 5x 6 + 4x 5 + 14x4 - 5x 3 - 9x 2 x 6 + 2X5 + x 4 - 2x 3 - 3x 2.
x6
-
3x 5
Therefore M: (Y"Y2) = (UI,U2,U3) ÇA.
3.6. ELEMENTARY APPLICATIONS OF GROBNER BASES FOR MODULES 159
Sorne of the computations we have performed in thîs section can also be done more efliciently using the syzygy module of the matrix [ J, J, J ' where fi E Am. In the next section, we introduce these syzygies, and then, in Section 3.8, we give these applications. ExercÎses 3.6.1. Consider the following 3 vectors in (
J,
= (xy - x,y), J2 = (x, y), J 3 = (y,xy2).
You should do the following without the use of a Computer Algebra System.
a. Let J = (_X,_x 3y2 + 2xy - y2 + y). Show that J E (J"J 2,J3). Moreover 1 express f as a linear combination of fI' f 2, and f 3' b. Let 9 = (-x, x 3y2 + 2xy - y2 + y). Show that 9 ct (J" fz, J3)' c. Clearly we have (J 2' g) ç (J" J 2' g). Prove that this inclusion is strict. 3.6.2. Consider the following vectors in (
J,
= (O,y,x), J2 = (O,x,xy-x), ,,= (y,x,O), J4 = (y2,y,0).
You should do the following without the use of a Computer Algebra System. a. Compute a Grübner basis for M = (J" J2, J 3, J4) with respect to the TOP ordering on (
M = ((x 2 - y, y, xz - y), (xz + x, yx + y, yz + z), (x, 0, x)) and M' = ((-y,y,zx - x 2 _ y), (y2 +y, yx' _ y' +2xy _ y, yx 2 _x 3 +y2 +y),
(x, 0, x), (0, xy + y, -xz - x
+ yz + z)).
Determine whether any of the following holds:
MÇM', M'ÇM, M=M'. 3.6.4. Prove Proposition 3.6.3. 3.6.5. Prove Proposition 3.6.4. 3.6.6. Consider the module M of Example 3.6.1. Determine whether J + M = 9 + M for the following examples: a. J = (2x,y2 X + y2 +yx+2y - 3x,-y+x), 9 = (_y2 +x _y,y3 +
2y2,y2_X). b. J = (y3 + y'
+x -
y, y2
+ y, Y + x), 9 = (x, y3 + 2y2 -
yx - x, 0).
CHAPTER 3. MODULES AND GROBNER BASES
160
3.6.7. Find a Q-basis for the vector spoce A 3 lM for eoch module M in Exercise 3.5.15. 3.6.8. In Theorem 3.6.6 we required that the order be TOP. In this exercise we show that this is necessary. Consider the vectors fI = (y,xy), h = (0, x+1) E (Q[x, y])2. We use the POT ordering on (Q[x, yJ)" with el > e2 and lex on Q[x, yi with x > y. a. Prove that G = {fI' f 2} is a Gr6bner basis and note G n (Q[y])2 = 0. b. Prove that Mn (Q[yJ)" contains a non-zero vectar. 3.6.9. Consider the submodule M of (Q[x, y, z])3: M = ((x, xz, z2), (x, x 2, X
+ y), (y, O,x), (x, 0, z)).
Compute generators for the following modules: a. Mn (Q[x, yI)". b. Mn (Q[y, z])3. c. Mn (Q[x, z])'3.6.10. Prove Proposition 3.6.8. 3.6.11. Compute generators for the intersection of the following two submodules of (Q[x, y])3 : M = ((x,x 2, X + y), (y, 0, x)) and N = ((x 2, xv, y2), (x - y, x, x)).
3.6.12. State and prove the analog of Exercise 2.3.8 for the computation of the intersection of more than two submodules of Am. Use this to compute generatars for the intersection of the following three submodules of (Q[x, y])3 : MI = ((x,x,-y), (x,y,-x)), M 2 = ((x,y,y), (x,x,x)), M3 = ((y,y,y),
(y, x, x)). 3.6.13. Prove Lemma 3.6.1l. 3.6.14. Prove Lemma 3.6.12. 3.6.15. Compute generators for M: N, where M = ((0, y, x), (0, x, xV-x), (y, x, 0), (y2,y,0)) andN= ((y,x,x), (x,y,y)) ç (Q[X,y])3. 3.6.16. Consider module homomorphisms q,: A' ----+ A' and T Am ----> A', such that imb) ç im(q,). A theorem of Module Theory states that there exists a "lifting", that is, a homomorphism ,p: Am ----+ AS such that q, o,p = 'Y; i.e. the following diagram commutes:
a. Show how to compute ,p. That is, show how to compute ,p(ei), i = 1, ... ,m. [Hint: Let {YI"" ,Yt} be a Grobner basis for im(q,) and
3.7. SYZYGIES FOR MODULES
161
apply the method used to solve Task (i) at the beginning of the section.] b. Compute 'IjJ in the following example. Let A = lQ[x, y]. We define 'Y' A ----> A 3 bYi'(l) = (x2y2+x2,y3+xy+y2,xy2_1), and wedefine q,: A3 ---->A3 byq,(ed = (x 2 +xy,y2,xy_1), q,(e2) = (x 2 y_x,y2+ x, xy - x), and q,(e3) = (y2 + X + y, x 2 , X - y). You should first verify that im(!') ç im(q,). 3.7. Syzygies for Modules. We now turn our attention to computing the syzygy module of a matrix [ f, f, of column vectors in Am. This computation is very similar to the computation of the syzygy module of the 1 x s matrix [ fI fs of polynomials in A (see Section 3.4). Recall that in Section 3.1 we defined the map q,: AS ----> Am by q,(h" ... ,hs) = I::~, hifi · As in the case where m = 1 (see Section 3.2), the kernel of this map is called the syzygy module of [ f, f s land is a submodule of AS. More formally we have
1
1
DEFINITION 3.7.1.
f,
fs
Let
J" ...
1is a vector (h"
,fs E Am. A syzygy of the m x s matrix F = ... ,h,) E AS such that s
Lhifi=O. i=l
The set of aU such syzygies is caUed the syzygy module of F and is denoted by Syz(f" ... ,f,) orbySyz(F). In other words Syz(F) = Syz(f l' ... ,f,) can be viewed as the set of all polynomial solutions h E AS of the system of homogeneous linear equations Fh = 0 with polynomial coefficients. That is, if f, = (J11"" '!m'), ... , fs = (J,s>", ,fm,), then Syz(J, , ... ,f,) is the set of all simultaneous polynomial solutions Xl, ... ,Xs of the system
f11X' f21X,
1
fm'X,
+ ... + fIsXs + ... + h,xs
o o
+ ... + fm,Xs
o.
1
As in the case of the syzygy module of a 1 x s matrix [ fI fs of polynomials in A, the computation of Syz(f, , ... ,f,) is done in two steps. We first compute a Grobner basis {g" ... ,g,} for (f" ... ,f,) ç Am and compute Syz(g" ... ,g,) ÇA'. We then obtain Syz(f" ... ,f,) ç AS from Syz(g,, ... ,g,). So let us first start with G = [ g, g, l. As we did in the polynomial case, we !'Ssume that !c(gi) = 1. We follow c10sely the construction we used for the ideal case (see Section 3.4). Let i of j E {l, ... ,t}. Let Im(gi) = Xi and
162
CHAPTER 3. MODULES AND GROBNER BASES
X ij = lcm(X i , Xj). Then the S-polynomial of gi and gj is given by Xij X ij ( ,gj ) = X· gi - X·gj· 8gi ,
By Theorems 3.5.12 and 3.5.19, we have
J
,
8(gi,gj) = Lhijvg v , v=l
for sorne hijv E A, such that
(3.7.1)
We easily see that Sij E Syz(g" ... ,g,). We first state the analog of Proposition 3.2.3. PROPOSITION 3.7.2. Syz(X Xij { Xi
€i -
... ,X,) is generated by " XX ij ej E A , 1 ~,J .. { }} E 1, ... ,8 '
.
j
We now give the analog of Theorem 3.4.1. The proof is identical except that instead of Proposition 3.2.3 we use Proposition 3.7.2. We leave the proofs of both the Proposition and the Theorem as exercises (Exercises 3.7.4 and 3.7.5). THEOREM 3.7.3. With the notation above, the collection {Sij Il :'Ô i < j :'Ô t} is a generating set for Syz(G) = Syz(g" ... ,g,). EXAMPLE 3.7.4. We go back to Example 3.5.23. Recall that the set {g"g2, g3, g4, g5, g6} is a Gr6bner basis with respect to the deglex term order on lQJ[x, y] with x > y and the TOP order on (lQJ[x, yJ)" with e, > e2 > e3, where g,
=
g3
(O,y,x), g2
=
(x, X
1 g5 = (0, xy + 2x
=
+ y, 0), 1
+ 2Y' 0),
(0,y2 -x-y,O), g4
=
(y, -y, 0),
g6 = (0, X
2 -
1 1 =lx - =lY' 0).
To obtain the generators for Syz(g"g2,g3,g4,g5,g6)' we follow Theorem 3.7.3 and 80 we compute and reduce aH S-polynomials. The S-polynomials wc need to compute are 8(g2,g5)' 8(g2,g6)' 8(g5,g6) and 8(g3,g4). For example, sinee
3.7. SYZYGIES FOR MODULES
163
S(93' 94) = Y93 -X94 = 92 + 295' we have 834 = (0, -1, y, -x, -2,0). The other syzygies are computed in a similar way (Exercise 3.7.2) to obtain
(0, x
1
3
+ 2,0,0, -y + 2,1)
1 3 1 (0'-4'0,0,x- 4'-Y- 2)· Note that we have not included 826 because it is in (834,825,856) (See Exercise 3.7.7). EXAMPLE 3.7.5. We consider the submodule M of A3 = ( e2 > e3 and with lex in A with y > x, where 91
= (x 3 + x, x 2 - x, -x),
92
= (x, 1{ + x 2
2
-
x, 0),
-
3x 3
2
93 = (1{+x ,0,0), 94 = (x ,x,1{), 2 x 3 _x 3 ) ' 6 9· = (x 2 9 5 = (x ,~)
-
2x , _x 2
+ 2x
x5
x4
-
1-
+ x 2 + 2x).
The syzygy module SYZ(91,92,93,94,95,96) can be computed using Theorem 3.7.3. For example 3
S(91,93)
Y91 - X 93
(yx - x 5 ,yx2 (yx - x 5
-
yx, -yx)
x 3 , -yx - x 4 + x 3 , -yx)
-
(_x 5
-
2x 3 , -yx - x 4
(_x 5
_
2x 3
+ x 2 , _x 4 + 2X 3 -
X2,
-yx)
+ x 2 , _x4 + 2x3 ,0)
x3
(_X S -
+ x 3 , -yx)
X3 _x 3 ) (X2 ,-,
(0,0,0).
813
= (y
+ x 2 , _x 2 + X, ~x3
- x,x, -1,0).
The other generators of the syzygy module are obtained similarly (Exercise 3.7.3): 825
846
( _x
(x
4
_x
2
5
+ X + 2, _x3 x
3
-
_x 2 , x 3 , y
4x + 2x + 4, x 2
+ x + 3x 4
l
3
-
x
2
-
2
-
2x, x
2
+ x2 -
X -
2, 1)
2
2x, _x + 2x, -
X -
2, y
+ 2).
These are the only generators needed, since the remaining S-polynomials are aU zero.
164
CHAPTER 3. MODULES AND GROBNER BASES
We now consider the computation of SYZ(fl,'" ,fJ, for a non-zero matrix = [ f, f, of column vectors in Am. We first compute a Grübner basis {y" ... ,y,} for (F) and set G = [ y, y, As in the ideal case, there is a t x s matrix S and an s x t matrix T with entries in A sueh that F = GS and G = FT (S is obtained using the Division Algorithm and T is obtained by keeping track of the reductions during Buchberger's Algorithm for modules.) As in the ideal case we compute generators of Syz(y" ... ,y,), say SI)'" ,Sr) and we let Tl,,·. ,T s be the columns of the matrix Is -TB, where l, is the s x s identity matrix. The proof of the following result is simitar to the one for the ideal case (Theorem 3.4.3) and we leave it to the reader (Exercise 3.7.9).
1
F
1.
THEOREM 3.7.6. With the notation above we have
Syz(J, , ...
,f,) = (Ts" ... ,Ts" T"
. ..
,r,) ÇA'.
EXAMPLE 3.7.7. We go back to Example 3.7.5. Recall from Example 3.6.1 that _the original vectors are f, = (xy,y,x), f2 = (x 2 +X,y+X 2,y), 2
fa
= (-y,x,y), f4 = (x ,x,y).
The Grübner basis G for (J',!2,!3,!4) given in Example 3.7.5 consists of the six vectors gi' i = 1, ... ,6. Also, in that example we saw that a basis for SYZ(Y"Y2'Y3'Y4'Y5'Y6) is S13, S25, and S46· Recall also that the matrix T which gives G in tenns of F is given in Example 3.6.1 in Equations (3.6.2) and (3.6.3). Ta compute Syz(J" f2' fa,!4) we use Theorem 3.7.6 and compute TS'3
(0,0,0,0)
TS 25
(0,0,0,0)
TS 46
(y3
+ 2y2x2 _
y2x
+ yx4 _
yx3, _y3 _ y 2x 2 + yx 2 + x4,
+ y 2 x 3 _ y 2 x 2 _ y 2 x _ yx 3 _ x 5 + x 4 + x 3 , _y3 x + y3 _ y2x3 + 2y 2x 2 _ yx3 _ x 4 _ x3).
y3 x
Now we need the matrix S that expresses F in terms of G. We have
[ f,
f2
f3
f
4
1=
[ y,
Y2
Y3
Y4
ys
Y6
1
-1 1 X 0 0 0
0 1 0 1 0 0
0 0 -1 1 0 0 S
0 0 0 1 0 0
3.7. SYZYGIES FOR MODULES
165
Then
[~
14 -TB =
0 0 0 0
0 0 0 0
:o 1. 0
Therefore
In the last example, the rows of the matrix 1, - TB did not contribute to the syzygy module 8YZ(f,,12' J 3, J 4). It is not true in general that the TS i in Theorem 3.7.6 give a complete set of generators for 8yz(F) as the next example shows. EXAMPLE 3.7.8. Let A =Q[x,y] and let J, = (y+2x2+x,y), J 2 = (-y+ x, y), and iJ = (x 2 + X, y) be vectors in A2 Then the reduced Grübner basis G for (J" J2, J 3) with respect to the TOP term ordering on A 2 with e, > e2 and the lex ordering on A with y > X is {9" 92}, where 91
(y+x 2,0)
92
(x 2 + x, y).
Since Im(9,) = ye, and lm(92) = ye2, we see that 8yz(91,92) = ((0,0)). Also, we have [91
92
1= [J, J2 J3 1[
~1 ~
l
'---v----' T
and
[J, fz
Jol
=
[91
92
1[~ ~1
n·
~
s
We have
Therefore
8yz(f" fz,
J3)
=
((l, l, -2)) ç A 3.
We conclude this section by showing that the generators for 8yz(9,,··· ,9,) computed in Theorem 3.7.3 (or in the polynomial case Theorem 3.4.1) form a Grübner basis for 8yz(9" ... , 9,) with respect to a certain order which we deline next (see 8chreyer [Schre]). This result is technical and will only be used in 8ection 3.10.
CHAPTER 3. MODULES AND GROBNER BASES
166
LEMMA 3.7.9. Let 911 ... ,Yt be non-zero vectors in Am and let < be a term arder in Am. We define an order < on the monomials of At as follows:
X e· < Y e· ,
{::=?
J
Im(Xgi) < Im(YgJ) or { Im(Xg ) = Im(Ygj) and j < i. i
Then < is a term oredering on A rn . The reader should note that when Im(Xg i ) = hn(Ygj) in the lemma, we have X ei < Yej when j < i, that is, the j and i are reversed. Note that the hypotheses of the lemma do not require that {g"", ,g,} be a Grübner basis. DEFINITION 3.7.10. The term order defined in Lemma 3.7.9 is called the arder on At induced by [ g, g, (and of course, implicitly, by the term order < on Am).
1
PROOF OF LEMMA 3.7.9. We first show that < is a totaJ arder. Let X, Y be power products in A. First, if i of j E {l, ... ,t}, then either Im(Xg i ) = hn(Ygj) and one of i < j or j < i holds, or one of Im(Xg i ) < Im(Ygj) or Im(Yg j ) < Im(Xg i ) holds. In any case, one of Xei < Yej or Yej < Xei holds. If i = j E {l, ... ,t}, and X of Y, then one of hn(Xgi ) < Im(Ygi) or Im(Ygi) < Im(Xg i ) holds, for otherwise, if Im(Xg i ) = Im(Ygi)' then
=
X lm(gi)
and hence X Yei
<
= Y,
since Yi
Im(Xg i )
i-
= Im(Ygi) = Y lm(gi)'
O. Therefore, we have one of X ei < Yei or
Xei·
We now verify that < is a term arder as defined in Definition 3.5.1. Let X, Z be power products in A such that Z of 1. Let i E {l, ... , t}. Then Im(Xg i ) < Zlm(Xg i ) = Im(ZXg i ), and hence Xei < ZXej. Finally, let X, Y, Z be power products in A, and let i,j E {l, ... ,t}. Assume that Xei < YeJ' If Im(Xg i ) < Im(Ygj)' then Im(ZXg i ) = Zlm(Xg i )
and hence ZXei
< ZYeJ . Iflm(Xg i )
Im(ZXg i )
=
Zlm(Xg i )
<
Zlm(Yg j ) = Im(ZYg j ),
= Im(Yg j ) and j
=
Zlm(Yg j )
<'i, then
= Im(ZYg j )
and j < i, sa ZXei < ZYej. 0 EXAMPLE 3.7.11. We first consider the case where m = 1 and polynomials 91 = x 2y2 _x 3y, 92 = xy3 _x 2y2, and 93 = y4_X 3 . We use the lex term ordering in lQI[x, y] with x < y. In the notation of Lemma 3.7.9, we have
since
lp(xg,)
= lp(yg,) = x'y3 < xy4 = lp(xg3)'
Note that < is neither TOP nor POT as defined in Section 3.5.
3.7. SYZYGIES FOR MODULES
167
EXAMPLE 3.7.12. Let us use the six vectors gi' i = l, ... ,6 of Example 3.7.5. These vectors deterrnine a term ordering on A6 as described in Lemma 3.7.9. In order ta distinguish the basis vectors ei in A3 and A6, we will use e3i and e6i for the ith basis vector in A 3 and A 6 respectively. In A 3 we are using lex with y > x and TOP with e33 < e32 < e3'· Using the order induced by [ g, g6 ] we have xe66
<
e62
<
xe64
<
ye61,
since lm(xg6) = x6e33, lm(g2) = ye32, lm(xg4) = xye33, and lm(yg,) = yx3e3" and
THEOREM 3.7.13. Let G = {g" ... ,g,} be a Griibner basis. With the notation of Theorem 3.7.3, the collection {Sij Il Si < j S t} is a Griibner basis for Syz(g" ... ,gt) with respect ta the term arder < on monomials of At induced by X [ gl gt ]. Moreover, lm(sij) = ; : ei for each 1 Si < j S t. PROOF. We first prove that for 1 S i < j S t, we have 1m( Sij) Note that we have lm
(X) = (X.) = x~: gi
lm
x~ Yj
Xij.
Therefore
since i < j. Now let Xe, be a monomial that appears in (h ij " ... ,hij ,). Then
lm(Xg,) S lm(S(gi' gj)), by Equation (3.7.1). But 1m(S(gi,gj)) < lm
(
X':
X gi ) ,
X Xi We now show that {Sij Il Si < j S t} is a Grübner basis for Syz(g" ... ,g,) with respect ta < . Let S E Syz(g" ... ,gt). By Defintion 3.5.13 we need ta show that there exist i,j such that 1 S i < j Stand lm(sij) divides lm(s). First we write S = I::~, ale"~ where a, E A. Let Ye = lp(a,) and Cp = lc(a,). Note that we have lm(s) = Y,ei for sorne i E {l, ... ,t}. For this i define
therefore X e~
<
----.!.1..ei.
S
= {R E {l, ...
,t} Ilm(Yeg,)
= lm(Y,gi)}·
We observe that if RES, then R :2: i, by the definition of < . Define a new vector
Since
S
is a syzygy of [ g,
gt ] ,we have
L c,Ye lt(g,) 'ES
=
o.
CHAPTER 3. MODULES AND GROBNER BASES
168
Therefore
S'
is a syzygy of [ It(g,)
It(gt)
1. Noting that the indices of the
non-zero coordinates of s'are in S, we have, by Proposition 3.7.2, that s' is in the
eme E Ale, Xt submodule of A t generated by B = {Xem --ee - mES, C < m } . m Xe X m Thus we have , '\' Xem) , S = L atm (Xem X-ei- Xem l,mES
l
m
e<m for sorne aem E A. Now Im(s') = Im(s) = liei and thus, since j > i for all j oF i E S, we see that
Ciliei
=
It(s') = Llt(aijriJ ei,
where the sum is over all j E S, j immediately that for sorne j E S, desired. 0
,
oF
-?:'
li
Ip(aij)-i7. It follows ei = Im( Si)) divides Im( S') = hn( s) as i such that
=
We might hope that a result similar to Theorem 3.7.13 holds for finding a Grëbner basis for Syz(f" ... ,J,), where the corresponding order on monomials of AS would be the one induced by [ J, J, 1 where {JI"" ,J,} is nat necessarily a Gr6bner basis. We saw in Theorem 3.7.6 how to compute generators for SYZ(f1l'" ,/s)' These generators do not farm, in general, a Grëbner basis with respect ta the arder induced by [ JI J, (see Exercise 3.7.15). So, ta obtaln a Grëbner basis for Syz(fl,'" , J,), we would use the algorithm presented in Section 3.5 (Algorithm 3.5.2) starting with the generators of Syz(f" ... , f,) given in Theorem 3.7.6.
1
Exercises 3.7.1. Let J,fI,··· ,Js E k[XI,'" ,xn]. Show that J E (fI, ... ,J,) if and only if in the Grëbner basis, G, of Syz(f, fI, ... , J,) with respect to the POT ordering with the first coordinate largest, there is a vector (u, Ul, ... ,us) E G with u oF 0 and u E k. In this case we obtain J = -~ I:;~I Udi' 3.7.2. Complete the computations in Example 3.7.4. 3.7.3. Complete the computations in Example 3.7.5. 3.7.4. Prove Proposition 3.7.2. 3.7.5. Prove Theorem 3.7.3. 3.7.6. State and prove the analog of Theorem 3.2.5 in the module case; that is, give a definition of Gr6bner bases for modules in terms of the syzygy module of the leading terms. 3.7.7. State and prove the analog of Corollary 3.3.3, and use it to describe how crit2 can be implemented in Aigorithm 3.5.2. Note that we have already seen in Exercise 3.5.16 that critl cannot be used in the module case. 3.7.8. Use Exercise 3.7.7 to state and prove the analog of Exercise 3.4.4 in the module case (generalizing Theorem 3.7.3).
3.7. SYZYGIES FOR MODULES
169
3.7.9. Prove Theorem 3.7.6. 3.7.10. Compute generators for Syz(f" f2, f3, f 4) in thefollowingexamples. You should do this without the use of a Computer Algebra System. a. f, = (xy+y,x), f2 = (-y+x,y), f3 = (x,y+x), f4 = (-x,y) E (Q[X,y])2 b. f, = (xy - x,y), f2 = (x 2 - y,x), f3 = (x 3 - x,y + x 2), f4 (_X+y2,y) E (Q[X,y])2 3.7.11. State and prove the analog of Exercise 3.4.3 in the module case. 3.7.12. Consider the following analog of Exercise 3.4.6. Let f"", ,1,,9 E Am. We consider the linear equation hd, +h2!2
+ ... +h,f, =
9,
with unknowns h" ... , hs E A. Let S ç A' be the set of all solutions (h" . .. ,hs). a. Prove that Sis not empty if and only if 9 E (f"", , f,). b. Prove that if S # 0 then S = h+Syz(f"", ,f,) = {h+s 1 s E Syz(f"", ,fs)}, where h is a particular solution. Give a method for computing h. c. Use the above to find the solution set for the equation h , (x+y, y, x) +h 2(x, y, x) +h3( -x, -x+y, x) +h4(X, X, y) = (y, 0, x 3 ).
3.7.13. Consider 9, = (x 2,y), 92 = (xy + y,y3), and 93 = (x 2 ,_x+y3) E (Qlx, y])2 We use the lex order on Q[x, y] with x > y and the POT ordering on (Q[X,y])2 with e, > e2. Consider the vector f = (xy2 + y3,x 2y + yx 2,y3 + x 2y) E (Q[x,y])3. Write f as the sum of terms in descending order according to the order on (Q[x, y])3 induced by [91 92 93 1. Repeat the exercise using deglex with x > y on Q[x, y]. 3.7.14. Verify that the following vectors form a Grübner basis with respect to the indicated term arder, compute generators for Syz(gl' ... ,gt), and verify Theorem 3.7.13. a. 91 = (0,x 2), 92 = (y,x), 93 = (2x, x), 94 = (0,2y+x). Use the deglex order on Q[x, y] with y > x and the POT ordering on (Q[x, y])2 with el
>
e2.
= (0, x 2),
(y2 - y, x), 93 = (0, xy), 94 = (x - y, x - y), 95 = (0, y2). Use the deglex order on Q[x, y] with x > y and the POT ordering on (Q[x, y])2 with e, > e2. 3.7.15. We mentioned at the end of the section that a result similar to Theorem 3.7.13 does not hold for the generators of Syz(f "" . ,1sl obtained in Theorem 3.7.6. Namely, the generators for SYZ(f"", ,l,) obtained in Theorem 3.7.6 do not, in general, form a Grübner basis for Syz(f " ... ,f,) with respect to the order induced by [ f"", ,f, Consider the vectors b. 91
92
=
1.
f, = (x+y,y,x), f 2 =(x-y,x,y), f 3 =(x+y,x,y),
170
CHAPTER 3. MODULES AND GROBNER BASES
J4
= (-X+y,y,X), J 5 = (X,X,X)
E (lQIlx,y])3.
a. Verify that the reduced Grübner basis for (fI' J 2, J3' J4' h) with respect ta the TOP ordering with e1 > e2 > e3 and lex on IQIlx, y] with y > x is given by the vectors 91
= (O,y,x), 92 = (O,O,y - x), 93 = (O,x,x), 94
= (y, 0, 0), 95 = (x, 0, 0).
b. Verifythat SYZ(91,92' 93,94,95) = ((-x,-x,y,O,O),(O,O,O,-x,y)). c. Verify that Theorem 3.7.6 gives Syz(fp J 2,!3, J4' h) = (( -x, -x, -x, -x, 2y+2x), (y,x, -x, -y, 0)).
d. Verify that the two vectors given in c do not form a Grübner basis with respect to the order induced by [ J 1 J 2 J 3 J 4 3.7.16. Let M = (f1"" ,J,) be a submodule of A=. Assume that we have a generating set for Syz(f1"" ,J,), say Syz(f1"" ,J,) = (S1,'" ,s,) ç AS. Now consider vectors gi = 2..:;=1 aijfjl for sorne aij E A, for i =
hl·
"1, ... ,t. a. Use Theorem 3.5.22 to give a method to decide if M = (91"" ,9,). If so, give a method ta find bij E A such that Ji = L~~1 bij 9j' Use
the praof of Theorem 3.7.6 to find a generating set for 8YZ(91 , ... ,9,). (Note that the proof of Theorem 3.7.6 has nothing to do with the theory of Grübner bases.) b. Apply the methods given in a ta the following example. Consider J 1 = (xy,y,x), J 2 = (x,y+x,y), fa = (-y,x,y), and J4 = (x,x,y) E (lQIlx, yJ)3. It is easy ta verify that 8yz(f 1, J 2,! 3' J 4) = ((y3 + y2x, _y3 _y2x+yx2 +x3, y3 x _ yx2, _y3 x + y3 _yx 2 _x 3)). Now consider the polynomials 91 = (0, y, 0) = J2 - J 4,92 = (-y-x, 0, 0) = fa - J4' 93 = (x, x, y) = J 4, 94 = (_x 2, 0, x) = J1 - J 2 + XJ3 + (-x + I)J 4' 95 = (0, x 2, _x 2 +x) = (-x-y+ I)J 1 + (x+y-l)J 2 + (-:yx+x)fa + (yx-y-x+l)J 4, and 96 = (_x 2, _X 2,X3) = (x 2 _y2+y)J1 +(-2x 2+ y2 _ y)J2 + (yx 2 - xy2 +xy)fa + (_x 2y + xy2 + 2X2 - y2 - X +y)J4. You should first verify that M= (f1,J2,!3,!4) = (9u92,93,94,95,96)'
3.7.17. (Müller IMa90]) In this exercise, we give a more efficient way ta solve the prablem raised in Exercise 3.7.16. We have the same hypotheses as in Exercise 3.7.16: we are given a generating set {J 1"" ,J'} for a submodule M of A= (not necessarily a Grübner basis), a generating set {S1'''' ,s,} ç A' for 8yz(f1'''' ,!,), and a collection of vectors {91"" ,9,} such that 9i = L;~1 aijJ j , for some aij E A, for i = 1, ... ,t. As in Exercise 3.7.16, we wish to determine whether M = (91"" ,9,) and if 80 to find generators for SYZ(gl' ... ,9t). Let esi, i = 1, ... ,S and
3.8. APPLICATIONS OF SYZYGIES
171
etj,j = 1, ... ,t be the standard basis for AS and At respectively. Finally, let ai = (ail, ... ,ais) E AS, for i = l, ... lt. a. Let W = {(u,v) E AS+' 1 u = (u" ... ,us) E A',v = (V" ... ,Vt) E At,~:~, ud i = ~;~, v,g). Prove that
{(a" etl),· .. ,(at, ett), (s" 0), ... ,(se, O)} generates W b. Prove that M = (g" ... ,gt) if and only if there are b" ... ,b, E At such that (esi, bi ) E W, for i = 1, ... ,s. c. Fix an order <s and an order
Prove that M = (g" ... ,gt) if and only the reduced Grobner basis for W with respect ta < is the union of the two sets {( esi, bd 1 i = 1, ... ,s} and {(O,ti) 1 i = 1, ... ,C}. Prove that the set {(O,t.,) 1 i = 1, ... ,C} i8 a generating set for Syz(g" ... ,gt). d. Redo the computation in Exercise 3.7.16 b using this method. One can think of the vectors a, as the rows of a matrix A which defines a linear transformation T: M ----} M. This exercise answers the question of whether T is anto and whether there is a linear transformation T': M ---). M defined by a matrix B sueh that (T' 0 T) (M) = M (B is an "inverse" of Ain the sense that BA-ls ç Syz(J" ... ,f,), where ls is the sxs identity matrix). The method presented here is a generalization of a linear algebra method ta compute the inverse of the matrix of a linear transformation. In !inear aigebra over fields, ta compute the inverse of an s x s matrix A, one reduces the matrix [ Ails J ta the row reduced echelon form [ Is 1 B ] , where ls is the s x s identity matrix. The matrix B is then the inverse of A. In the case of a module M, we use the Grobner Basis Aigorithm ta transform the matrix
iuto the matrix
[~ Syz(g,~ .. ,g.,) ] . 3.8. Applications of Syzygies. The purpose of this section is to reconsider and solve more efficiently various problems, such as computing the intersection of submodules of Am (where, again, A = k[x" ... ,xnJ for a field k). These problems were solved previously using elimination. As we have noted before,
172
CHAPTER 3. MODULES AND GROBNER BASES
the lex term ordering (or more generally, any elimination ordering) is very inefficient from a computational point of view. Indeed, mueh effort in the theory of Grobner bases has been expended in order ta avoid the use of elimination in computations involving Grobner bases. Here, we will show how the computation of intersections, ideal quotients and kernels of homomorphism.s can aIl be done using syzygies, where the latter may be computed using any term order whatsoever. Even in A (when m = 1) these computations using syzygies tUTU out to be more efficient than using eliroination orders in A. We begin by considering the simplest case, the intersection of two ideals in A. This case, nevertheless, cantains aIl of the essential ingredients of the rnost general situation. Let land J be ideals in A. Assume that I = (fI, ... , J,) and J = (YI, ... ,Yt) (we will not assume that either {J" ... , J,} or {YI, ... ,Yt} is a Grübner basis). Then a polynomial h is in I n J if and only if
h=
ad, + a2h + ... + a,J, and h = b'Yl + b2Y2 + ... + bty"
for sorne polynomials al, ... ,as, bl , ... ,bt E A. This is the same as the two conditions that (-h, a" ... , a,) is a syzygy of the matrix [1 fI J, ] and (-h, b ... , b,) is a syzygy of the matrix [1 91 Yt ]. It is then easy to put"these two conditions into a single condition for syzygies of vectors in A 2 Let i
= [ ~ ] , f, = [ ~ ] , ... , f s = [
Then it is easily seen that h is in I
~ ] ,g, = [ ;, ] , ... ,gt = [ ~ ].
nJ
if and only if there are polynomials
al,· .. ,as, bl, ... ,bt E A such that
(381) is a syzygy of
~ o fI
[i f,
gt ] = [
J,
o
0 YI
o ].
Yt
Thus we have essentially shown PROPOSITION 3.8.1. Usiny the notation above,
In J
=
{h
1
there exist polynomials al, ... 1 as, bl , ... ,ht such that fs
YI
gt ]}.
Moreover, ifh l1 ··· ,hr is a generatingsetforSyz(i,I I , ... ,IS,g!1'" ,gt) and the first coordinate oJ hi is hi Jor 1 :0: i:O: r, then {h" ... , h r } is a generatiny set for I n J.
3.8. APPLICATIONS OF SYZYGIES
173
PROOF. For the first statement we simply note that h E l
nJ
if and only if
-h E In J, and this accounts for the difference between the statement of the proposition and Expression (3.8.1). The second statement follows from the first statement, since if a vector is a linear combination of other vectors, then the first coordinate of the vector i8 a linear combination of the first coordinates of the other vectors. 0 EXAMPLE 3.8.2. We will consider the following ideals in
1= (xy - x - y - l, x 2 + 1) and J = (_x 2 + xy,x 2y + y, y3 + x). We wish to compute l let
n J.
We use the deglex ordering with x > y in A. So we
.=[1] f =[Xy-x-y-l] 1'1 0
't
f =[X2+l] 0
12
'
[y3~X]'
YI = [ -x20+XY] 'Y2 = [ X 2yO+y] 'Y3 =
Then we can compute that the module of syzygies with respect to TOP and e, > e2 is generated by (x 2y + y, 0, -y, 0,-1, 0), (xy3 + xy2 + 3y3 - x 2 + xy + y2 + X + 3y, 2xy 2 + y2 + 2y + l, _2 y 3 + 2y 2 + 1, 2xy 2 - y2 - 3y - l, 2xy _ 2y 2 Y - 3, -1), (x 3 + 2y3 + 2x2 - 2y2 +x + 2y, xy2 - 2xy+y2 +x -1, _y3 +3y2 - x3y-l,xy2 -2xy+x-y+2,xy_ y 2 -2x+2y-2, -1), and (-4xy2 -lOy3 -4xlOy, x 2y2 _x 2y_5xy2 +3xy-4 y 2 - 2x-4y-2, _ xy3 +2xy2 +6y3 -xy-8y2 +4y2, x 2y2 -x 2y-6xy2 +xy+2 y 2 +9y-l, x 2y_xy2 _x 2 -5xy+6y2+x+ 10, -x+4). So the first coordinates of these vectors farm a generating set for In J, that i8,
In J = (x 2y + y, xy3 + xy2 + 3y3 - x 2 + xy + y2 + X + 3y, x 3 + 2y3 + 2x 2 - 2y2 +
X
+ 2y, -4xy2 - lOy 3
-
4x - lOy).
These polynomials do not form a Griibner basis for InJ with respect to the given term order. Computing a Grobner basi8 for this ideal from the given generators, we get the much simpleT generating set
We note that in Proposition 3.8.1, as we just saw, we only obtained a generating set for In J. Of course, we could obtain a Griibner basis for In J by applying Buchberger's Aigorithm to this set of generators. Alternatively we have THEOREM 3.8.3. We use the same notation as above. Assume we have a fixed term order < on A and consider the corresponding POT order on Al+s+t with el targest. Let {h" ... , he} be a Grobner basis for Syz( i, f"" . ,f" YI' ... ,y,). Assume that the first coordinate of hi is hi for 1 SiS r. Then {h" .. , ,he} is a Grobner basis for l n J with respect ta < .
CHAPTER 3. MODULES AND GROBNER BASES
174
PROOF. We need ta show that (lt(h , ), ... ,lt(hr )) = Lt(I n J). One containment is clear, sa let h E InJ, where we may assume that h '" o. From Proposition 3.8.1 we cau choose
h = (h, a" ... ,a" b ... ,b,) "
E
Syz(i,J" ... , 1,,91'··· ,9,).
Since hl, . .. ,hr is a Gr6bner basis for this module, we can find polynomials
C,,··· , Cr such that h = I:~~1 Cihi and Im(h) = max'<:i<:r(IP(Ci) lm(h i )). Then, since the arder is POT, h '" 0, and the fust coordinate is largest, we see that (lt(h), 0, ... ,0) = It(h) = I:lt(G) lt(h i ), where the sum is over al! i such that Im(h) = Ip(ci) Im(h i ). It is then clear that we have lt(h) = I: It(G) lt(h i ) where i ranges ovef the same i's as before. This gives the desired result. D We note that in Theorem 3.8.3, as should be evident from the proof of the theorem, we made essential use of the POT ordering. In particular, the TOP ordering would not work, as an example in Exercise 3.8.3 shows. EXAMPLE 3.8.4. We redo the computation of the previous example. For simplicity we use the Grübner bases for the ideals I and J with respect ta deglex with x > y. We compute that I = (x + y, y2 + 1) and J = (y3 + y, X - y). Sa using
i = [
~
]
,1, =
[ x; y ]
,12 =
[
y2; 1 ],91 = [ y3
~y
] ,92 = [ X
~y
] ,
we compute a Grübner basis for SYZ(i,J,,/2,9,,92) with respect ta POT with e, > e2 > e3 > e4 > es, and obtain (0,0,0, x-y, _y3_ y), (0, y2+1, -x-y, 0, 0), (y3 +y, 0, -y, -1, 0), (xy2 +x, 0, -x, -1, _yZ -1), (x 2 _y2, -x+y, 0, 0, -x - y). Reading off the first coordinates of these vectors, we obtain the same Grübner basis we did in the previous example. At this point it is convenient ta compactify the notation. Let H, be an SI x t , matrix and H 2 be an S2 x t 2 matrix. We define the (SI + 82) X (t, + t2) matrix H, H 2 (called the direct sum of H, and H 2 ) ta be the matrix whose upper left-hand corner matrix is the 81 x tl matrix Hl, whose lower right"':hand corner matrix is the 82 x t 2 matrix H2 and the Test of whose entries consist entirely of zeros. Thus, for example,
EIl
[fIh
h]EIl[::] 14 93
[hho 14h 0 0 0
0 0
1
J.]
92 93
EIl
Similarly, let H" ... , Hr be Si x t i matrices for <:: i <:: T. We define Hl H 2 EIJ Hr ta be the (SI + ... + Sr) X (t , + ... + tr) matrix with the matrices H l1 ... ,Hr clown the diagonal and zeros elsewhere. FUrther, let Hl,'" 1 Hr be matrices where, this time, each Hi is an s x t i matrix for 1 <:: i <:: T. We define [ H,IH21·· ·IHr J as the S x (t , + ... + q
... EIl
3.8. APPLICATIONS OF SYZYGIES
175
matrix whose first t l columns are the columns of Hl, whose next t 2 columns are thecolumns of H2, etc. For example, using the notation above in the discussion of l n J, let F be the 1 x s matrix [ fI J, J and let G be the 1 x t matrix [ g, g, J. If we let
H = [ ilF EIl G
J=
[~ ~
Js
o
0 g,
then the set of first coordinates of Syz(H) is l n J. As a second illustration of this notation we consider the intersection of more than two ideals. So let h, h, ... ,Ir be ideals in A, Ij = (fj1 ,ij2, ... ,ij")' and define the 1 x t j matrix Fj = [fJ, fJ2 fJ" for 1 " j Let i be the r x 1 matrix (colunm vector) ail of whose entries are 1. Set H = [ ilF, EIl F 2 EIl ... EIl Fr J . Then, in exactly the same way as above, we see that the set of first coordinates of a generating set for Syz (H) is a generating set for ft n h n ... n Ir, and if the POT order is used on A l+tl +t2+··+tr ta compute a Grübner basis for Syz(H), then the set of first coordinates is, in fact, a Grübner basis for h n h n· .. n Ir. EXAMPLE 3.8.5. Let A =
"r.
J,
H=
1 'x-y 1 0 [ 1 0
y-z 0 0
z-x 0 0
0 x-l 0
0 y
°
0 0 y+l
0 0 x+l
° ].
z-l
We then compute, using the TOP ordering on A 3 with el > e2 > €3, that Syz(H) = ((0,1,1,1,0,0,0,0,0), (xy - y2, 0, y, y, -y, y-l, y, -y, 0), (x 2 y2 _ X + y, 0, y + z - 1, x + z - 1, -X, Y - 1, Y + z - 3, -x - z + 3, x - y), (y2 Z - yz2 - y2 + yz, 0, -yz + y, 0, 0, -yz + z2 + Y - z, -yz + y + 2z - 2, 0, yz-y-2), (xz+y2-2yz+y-z, 0, -y+z-l, z, -z, -y+2z-l, -y+z+l, -z, y + 1), (0,0, x - z, y - z, -y, x-l, x - z + 2, -y + z - 2, -x + y)), from which we read off that
h nh nh
=
(xy_y2,x 2 _y2 _x+y,y2z_yz2 _y2 +yz,xz+y2 -2yz+y-z).
We note that this is not a Grübner basis for h n h n h. We next consider the case of the intersection of tWQ submodules M, N of Am. Assume that M = (f" ... ,fs) and N = (91"" , 9,)· Then a vector h E A= is in M n N if and only if
h = ad , + a2f 2 + ... + ad, and h = b 9, + b2 92 + ... + b,9" ' for sorne polynomials al, ... ,as, b1 , ... ,bt E A. This is the same as the two conditions that (-h,a" ... ,as) is a syzygy of the matrix [ I=lf ... ,f, J and " (-h,b 1 , ... ,b,)isasyzygyofthematrix [1=191'''' ,9, J (here,I=denotesthe m x m identity matrix, and (-h, al, ... ,as) is the vector in Am+s whose first
176
CHAPTER 3. MODULES AND GROBNER BASES
m coordinates are those of -hl. It is then easy to put these two conditions into a single condition for syzygies of vectors in Am.+s+t. Let
(sothat7 J= t [Imllm ]),andletFbethemxsrnatrixF= [J, J,l and G be the m x t matrix G = [ 91 9, 1. Set H = [ JIF EIl G 1. Then, as in Proposition 3.8.1, we see that the set of vectors h which are the first m coordinates of vectors in Syz(H) is Mn N. Moreover, the set of vectors which consist of the first m coordinates of each of the vectors of a set of generators of Syz (H) is a generating set for M n N. EXAMPLE 3.8.6. We again let A = lQI[x, y, z] and consider the terrn order deglex with x> y > z. Let M = ((x-y, z), (x, y)), and N = ((x+1, y), (x-l, z)). 80 we set 0 1 0 x-y x H=
0 1 [ 1 0 o 1
z
y
0 0
0 0
0 x+1 y
Then, using the TOP ordering on A 4 with el > e2 > e3 > €4, we obtain Syz(H) = ((xy-2x+y, y2_y_z, l, -y+1, -y+1, 1), (x 2z-2x 2+xz+4x-2y, xyz - xy - xz - y2 +yz + 2y+ 2z, x - 2, -xz +x+y - z - 2, -xz +x +y - 2, x - y - 2)), from which we read off that
MnN = ((xy - 2x +y,yZ -y- z), (x 2z - 2x2
+ xz + 4x -
2y, xyz - xy - xz - yZ
In general, if we have r submodules matrices Fi (1 ~ i ~ T) and
H
= [ , [
Mi
Imllml·· ·IIm
+ yz + 2y + 2z)).
generated by the columns of the m x t i
llFl EIl Fz EIl··· EIl F, l,
r copies
where again lm denotes the m x m identity matrix, then the set of vectors h which are the first m coordinates of vectors in Syz(H) is the intersection of the Mi's. Moreover 1 as in the case of the intersection of two modules, the set of vectors which consists of the first m coordinates of each of the vectors of a set of generators for Syz( H) is a generating set for the intersection of the r modules MiWe now turn to the ideal quotient. This can be computed as a special case of the computation of intersection of ideals as we did in Section 2.3. On the other hand it is easy to recognize it directly as a syzygy computation. So let l = (f" ... ,f,) and J = (91, ... ,9,) be ideals in A. Recall that I: J = {h E 7For a matrix S, we denote its transpose by tS.
3.8. APPLICATIONS OF SYZYGIES
177
AI hJ ç; I} = {h E A 1 hgi , i = 1, ... ,t, is a linear combination of h,··· ,J,}. Let 9 be the column vector 9 = (g"", ,g,) and let F be the row vector F = [h J, 1. In this case, we set H = [ glF 8l F 8l ... 8l F t
1.
copies
Then 1: J is the set of ail first coordinates of Syz(H). EXAMPLE 3.8.7. We let A = Ql[x, y] and consider the term order lex with x> y. Let l = (x(x + y)2, y) and J = (x 2, X + y). Now consider
H
= [x
2
x+y
x(x + y)2
° ° Y
0 X(X+y)2
°1
Y
.
Then using the TOP ordering on A2 with e, > e2, we compute Syz(H) = (( ~y, 0, x 2 , 0, x+y), (~xy~2y2, y, ~xy2, 0, x 2 +3xy+2y2), (X2+xy~y2, ~x+y, ~ xy2, ~ 1, xy + y2)), from which we read off that
Computing a Grübner basis for this ideal (or by simply looking at it), we obtain I: J = (y, x 2 ), in agreement with the computation of the same ideal in Example 2.3.12. We willleave the computation of the ideal quotient of two submodules of A= (Definition 3.6.10) to the exercises (Exercise 3.8.5). We now consider modules which are given by a presentation as defined in Section 3.1. Let J"", ,J, be in Am, let N = (f"", ,J,) and consider M = A m IN. We wish to show how to compute the intersection of two (or more) submodules of M, find the ideal quotient of two submodules of M and find the annihilator of M. We begin with the latter as it is a special case of what we have already done. We define the annihilator of M to be the ideal
ann(M)
= {h
E A 1 hM
= {O}}.
Since M = Am IN we see that ann(M) = N: Am. Although we have relegated the computation of the ideal quotient of two modules to the exercises, in this special case it is easy to write clown the correct matrix whose syzygies allow us to computeann(M), and we will do so now. We simply observe that A= is generated by the usual standard basis e" ... ,em and so h E ann(M) if and only if hei EN for all i = 1, ... ,m. Thus letting F = [ J, J, (an m x s matrix), and H = [ , [ 'e,l" ·l'em llF EB F 8l ... 8l F
1
l,
m copies
CHAPTER 3. MODULES AND GROBNER BASES
178
we see that ann(M) is the set of ail fust coordinates of Syz(H). (We note that the matrix t [ tell-" 1 te m ] is the m 2 x 1 matrix of the vectors ei stacked up on top of each other.) EXAMPLE 3.8.8. We consider N = ((x 2 + y, xz - y), (xy - yz, z - x)) C A 2 and let M = A2 IN. SO in this case
0
x2+Y xz - y 0
xy - yz z- x 0
1
0
0
1
o H=
[
o o
~
x 2 +y xz-y
xy yz z-x
J.
Then using TOP with el > e2 > e3 > e4 and degrevlex with x > y > Z, we see that Syz(H) = ((x 2yz - xyz2 + x 3 - xy2 - x 2z + y2z + xy - yz, -x + z, -xz + y, -xv + yz, x 2 + y)), from which we conc1ude that
ann(M) = (x 2yz - xyz2 + x 3 - xy2 - x 2z + y2z + xy - yz). We now consider two submodules M M 2 of M = AmiN and we wish to " compute their intersection. There are submodules K tl K 2 of Am. sueh that Ml = K,fN and M 2 = K,fN. We note that, K, = {h E Am 1 h + N E Md, or alternatively if M, = (g, + N, . .. ,g, + N) then K,
= (g"", ,g,) + N = (g"", ,g" f"", ,t,).
We wish to compute M, n M2. Clearly M, n M 2 = (K, n K 2 )IN. Since we know how ta compute KI n K 2 we can compute Ml n M 2 . Alternatively, continuing to use the notation above, and setting M 2 = (h, + N, ... , h r + N), we see that a coset p+N E M, nM2 if and only ifthere are a" ... ,a, E A and b ... , br E A " such that r
t
p+N= Lai(gi+N) andp+N= Lbi(hi+N). i=l
i=l
This last statement is eQuivalent to t
s
r
$
p = Laigi + Lajf j and p = Lbihd Lbjf j i=l
j=1
,
j=1
i=l
for sorne al, ... ,at)a~, ... ,a~,bl, ... ,br,b~, ... b~ E A. Thus, setting
F = [
f,
f, ] , G =
[
g,
g, ] ,H = [ h,
and lm the m x m identity matrix, we see that the set of aIl sueh p's is just the set of first m coordinates of Syz( 5), where 5 = [ ' [ Iml1m
]1 [ GIF ] El)
[
HIF ] ].
The same sort idea will allow one to construct M,: M 2 (Exercise 3.8.8). We now go on to consider the following Question. Let N C M be submodules of Am. We would like to determine a presentation of MIN (see Definition 3.1.4).
3.8. APPLICATIONS OF SYZYGIES
179
Sa assume that M = (f" ... ,J,) and N = (g" ... ,g,), where we do not assume that either generating set is a Grübner basis. We define an·A-module homomorphism
q,:
A'
---.
MIN
€i
>----+
Ji + N
(for 1 ::; i ::; s). It is clear that q, maps A' onto MIN sinee M = (f" ... ,J,). Let K = ker(q,). Then the desired presentation is A'1K ~ MIN. SA we need ta compute an explicit set of generators for K. We note that h = (h" ... ,h,) is in K if and on1y if hrf 1 + ... + h,J, is in N, which, in turn, is true if and only if there are polynomials al, ... ,at E A Buch that
Let H = [
J,
J,
g,
g,
J.
We have proved THEOREM 3.8.9. With the notation abave, let Plo ... ,PT E A,+t be a generating set for Syz(H), and let hi E klxl, ... ,xn ]' denote the vector whose coordinates are the first s coordinates of Pi' 1 :S i ::; r. Then
K = (h" ... ,hT ). 3.8.10. Let A = Qlx,y], M N = (( _x , y), (0, Y + x 2 )). Then, sinee EXAMPLE
= ((xy,y), (-y,x), (x 2 ,x))
C A 2 and
3
we see that N
(_x 3 ,y)
=
(xy,y)+x(-y,x)-x(x 2 ,x)and
(0, y + x 2 )
=
(xy, y)
+ x( -y, x),
c M. Sa ta determine K C A 3 such that MIN _y
x2
_x 3
X
X
y
y:x 2
~
A3IK, we set
]
and compute
Syz(H) = ((0,0, x, 1, -1), (1, x, 0, 0, -1), (-x, 0, y, 0, 0)) (this last computation was done using TOP with e, Thus
> e2 > and lex with y > x).
K = ((0,0, x), (1, x, 0), (-x, 0, y)). Therefore MIN ~ A3IK.
CHAPTER 3. MODULES AND GROBNER BASES
180
We close this section by generalizing this last result to the case of an affine algebra. In this case we let A = k[x" ... , xnlf1, for an ideal 1 = (d" ... , de) c A. We again consider Ne M ta be submodules of ATn, but now, of course, the coordinates of the elements of Am are cosets of 1. 80, for a vector b = (b" ... , bm ) E k[x" ... ,xn]m, we set b= (b, +1, ... ,bm+I) E Am. 80 we may assume that N = (y" ... ,y,) for 9,,·.· ,9t E k[x" ... ,xn ]'" and M = (1" ... ,lsl for !" ... ,!, E k[x" ... , xn]m. We deline an A-module homomorphism >:
A'
--.
MIN
ei
>--->
li + N
(for 1 <; i <; s). It is clear that > maps A' onto MIN sinee M = (1" ... ,t,). Let K = ker(». Then the desired presentation is A' 1K '" MIN. 80 we need ta compute an explicit set of generators of K. We note that li = (h, +1, ... , h,+1) is in K if and only if (h, + I)I, + ... + (h, + I)1, is in N, which is true if and only if there are polynomials a" ... , at E k[x" ... , x n ] such that
(h,
+ I)I, + ... + (h, + I)1,
= (a,
+ 1)?h + ... + (ad I)Yt·
This last statement is readily seen to be equivalent to the statement that every coordinate of
hrf,
+ ... + h,!, -
(a'9,
+ ... + at9t)
is an element of 1. Thus in this case, we let F = [ 9, 9t and D = [ d, de and set
1'
1
H = [
[!,
!,
l,
G
FIGID EIl DEll· .. EIl Dl· m copies
We have proved THEOREM 3.8.11. With the notation above, let Pl""
,Pr E k[ Xl,···
,X n
]'+t+m'
be a generating set of 8yz(H), and let hi E k[x" ... , Xn]' denote the veetor whose coordinates are the first s coordinates of Pi (1 <; i <; r J. Then
We note that ail the computations that we have done in this section could be formulated using Theorem 3.8.11 (see Exercise 3.8.10). EXAMPLE 3.8.12. We will redo Example 3.8.10, exeept that we will now assumle that the ring is A = Q[x,ylf1 where 1 = (x 3 +y2,x 2 ). 80 now M = ((xy,y),(-y,x), (x2 ,x)) C A 2 and N = ((_x3 ,y), (Ü,y +X2)). We have that Ne M as before. 80 to determine K C A3 such that MIN", A 3 IK, we set _y
x2
X
X
3.8. APPLICATIONS OF SYZYGIES
181
and compute Syz(H) to be generated by (0, 0, x, 1, -1, 0, 0, 0, 0), (1, x, 0, 0, -1,0,0,0,0), (0,0,0,1, -1,0, x, 0, 1), (-x, 0, 0, 0, x, 0, y, 0, -x), (0, y, 0, 1, -x - 1, 1,0,0, x + 1), (-x, 0, y, 0, 0, 0, 0, 0, 0), and (0, 0, 0, 0, y - x 2, 0, 0, -1, x 2 +x) (tbis last computation using TOP with e, > e2 and lex with y> x). Thus K = ((0,0, x), (1, x, 0), (-x, 0, 0), (0, y, 0), (-x, 0, y, )). ExercÎses 3.8.1. Redo the computation of the intersection of the ideaJs given in Example 2.3.6, Exercises 2.3.6 and 2.3.7 using the technique of syzygies presented in this section. 3.8.2. Redo the computation of the intersection of the modules given in Example 3.6.9, Exercises 3.6.11 and 3.6.12 using the technique of syzygies presented in this section. 3.8.3. In Theorem 3.8.3 we required that the order be POT. In tbis exercise, we show that the TOP ordering would not give rise to a Grübner basis for the intersection of two ideals. Consider the ideals l = (x 2, yx + x, xy2 +y) and J = (y2, X - xy, x 2 - y) of Q[x, y]. Compute generators for In J by computing generators for the syzygy module of
2 x xy+x 10
H=[l
°
° 2°-y 1 ° y°2 x-xyx
xy2+y
using the TOP ordering on (Q[x, y])2 with e, > e2 and deglex on Q[x, y[ with y > x. Show that these generators do not form a Grübner basis for InJ with respect to deglex with y> x. One might think that the problem comes from the fact that the generators for l and J above did not form a Grübner basis with respect to deglex with y > x to start with. This is not the case as is easily seen by repeating the exercise with the ideals 1= (x + y, x 2 + 1) and J = (y2, x 2 - y) using the same orders as above. 3.8.4. Redo the computation of the ideal quotient given in Example 2.3.12 and Exercise 2.3.11 using the technique of syzygies presented in this section. 3.8.5. Give the analog of the matrix H UBed to compute generators for 1: J, where land J are ideals, in the module case. That is, find H such that M: N can be obtained from Syz( H), where M and N are modules. Redo the computation of the ideal quotient of the modules given in Example 3.6.13 and Exercise 3.6.15 using this technique. 3.8.6. Consider the submodule N of (Q[x, y])3, N = ((xy - Y,x,Y +x), (xy - x,x,xy +x), (x
+ y,y,xy + y)).
Compute generators for ann((Q[x, y]) 3 / N). 3.8.7. Consider the following two submodules of (Q[x, y])3, K, = ((xy,x,y), (-y,x,x), (x 2,x,y)) and
182
CHAPTER 3. MODULES AND GROBNER BASES
K 2 = ((0, y2+y, y2+y), (0, O,x 2-2xY-X+y2+ y ), (0, x-y, x-y), (y, y, y)).
3.8.8.
3.8.9. 3.8.10.
3.8.11.
3.8.12.
a. Prove that N = ((y2 +y,O,x 2 - 2xy+y2 -x+y), (0,xY+X,y2 -xy+ x 2 + y)) is a submodule of K, and K 2 . b. Compute generators for the module (K,jN) n (K 2 /N) ç (iQl[x, y]) 3 /N. Give the analog of the matrix H used to compute generators for (K,jN) n (Kz/N) which can be used to compute generators for (K,jN): (K2/N). Compute generators for the ideal quotient (K,jN): (K2/N), where K" K 2 and N are the modules of Exercise 3.8.7. Find the presentation of Kz/ N, where K 2 and N are as in Exercise 3.8.7. Follow the construction of Theorem 3.8.11 to generalize all the computations performed in this section to the case where A i8 an affine ring. Namely, for A = k[XI"" ,xn]/I, where l = (d" ... ,de), give the analog of the matrix H that i8 used ta compute generators for the intersection of two ideals in A, the intersection of two submodules of Am, the ideal quotient of two submodules of Am, the annihilator of M = Am/N, the intersection of submodules of Am/N, and the ideal quotient of two submodules of Am/N. Let l = (fI,··· , J,) ç k[XI,'" , xnl be an idea!. Show that J + l has an inverse in k[XI"" , x n ]/ J if and only if there is an element in the reduced Gr6bner basis for Syz(1, fI, ... , J" f), with respect to any POT order with the first coordinate largest, of the form (1, h"", , h" hl. Shaw that, in this case, h + J is the inverse of J + J. Reda the computations in Example 2.1.10 and Exercise 2.1.6 using this technique. Ali the computations done in this section can be done using only Theorem 3.8.9. That i8, all the computations done in this section can be viewed as the computations of kernels of certain A-module homomorphisms, where A = k[Xl 1 '" ,x n ]. We illustrate this in the present exercise. Consider the following diagram of free modules,
We define the pullback of this diagram to be the submodule PB of A'œAm defincd as follows
PB = {(g,h)
E
We get a commutative diagram
A' œA m 1 cj;(g) = -y(h)}.
3.9. COMPUTATION OF HOM
183
A' ~~
3.9. Computation of Hom. As before we let A = k[x" ... , x n ]. Also, we consider two A-modules M and N. We are interested in the study of the set of ail A-module homomorphisms between M and N. We deline Hom(M,N) = {
----+
N
1
We deline the usual addition of two elements
(
=
+ ,p(m) for ail m
E M.
It is easily verilied that Hom(M, N) is an abelian group under this addition. We
can also deline multiplication of elements in Hom(M, N) by elements of A: for
a E A and
184
CHAPTER 3. MODULES AND GROBNER BASES
Then, as in linear algebra, cp is given by matrix multiplication by the matrix whose columns are the cp(ei)'s. We will denote this matrix by cp also. Note that cp is a t x s matrix. We further identify the matrix cp in Hom(A', A') with the column vector cp' in A" formed by concatenating the columns of cp in order. This identification gives an explicit isomorphism Hom(AS, At) ~ ASt, sinee it is readily verified that it gives a one to one and onto A-module homomorphism
Hom(M,N)
-->
AM
EXAMPLE 3.9.1. As an example of the above identification, we identify the
matrix
[j: j: j: 1
with the column vectors
(h, 12, 13, J4, J5, J6).
Or, as an-
other example, suppose we have
given bycp(f"h) = ((2x+1)h -x 2 j"xh +j,,(x 2 -x)h). Then it is readily verified that cp is alQ[x]-module homomorphism. Moreover, cp(l, 0) = (2x+1, x, 0) and cp(O, 1) = (_x 2 , 1,x2 - x). Hence the matrix associated with cp is
and so the vector in lQ[x]6 associated with cp is (2x + 1, x, 0, _x 2 , 1, x 2 - x). Now, in order to describe Hom(M, N) explicitly, we need to assume that we are given M and N explicitly. That is, we assume that we are given presentations of M and N, say
M"'A'IL
and
N"'A'IK.
Then the idea is to compute Hom(A' 1L, At1K) by adapting the ideas in the computation of Hom(A', At) above. To do this, suppose that we have a presentation of the A-module M, M "" A'1L. Then we have a sequence of A-module homomorphisms (3.9.1) where the map i: L ~ AS is the inclusion map, and 7r: AS --+ M is the map which sends the standard basis of A' onto the generating set of M corresponding to the standard basis in the isomorphism M "" AS IL. Sequence (3.9.1) is called a short exact sequence. In general a sequence of A-modules and A-module homomorphisms
BRecall our space saving notEttion for column vectors!
3.9. COMPUTATION OF HOM
185
is called exact if im(Œi+l) = ker(Œi) for each i. It is easy to check that Sequence (3.9.1) is exact, which, in this case, means that i is one to one, ker(1f) = im(i), and 1r is anto. Now we lind a presentation of L, say L~AS,/L"
and thus we have another short exact sequence 0----4 Ll ~ ASl ~ L
-------7
O.
This leads ta an exact sequence
where r = i a 1f1 (the exactness is easily checked). We will only be interested in the exact sequence ASl -S AS ~ M -------7 O. We use this last sequence, and a similar one for N to compute Hom(M, N). In order to do this we review the elementary homological properties of Hom. What we need is summarized in Lemma 3.9.2. We refer the reader to [Hun]. We assume we are given A-modules Ml, M2, and N and an A-module homomorphism
Hom(M2 ,N)
,p
~ Hom(M"N) and Hom(N,M,) ,p 0
Hom(N,M2 )
>--->
It is easy to verify that 0
0
(3.9.2)
Let P be anothec A-module. Then the two sequences (3.9.3)
and (3.9.4)
Hom(P, M,) ..!2., Hom(P, M 2 ) ..!2., Hom(P, M 3 )
---->
0
are exact. In parlicular,
(3.9.5)
and
(3.9.6)
Hom(P, M 3 )
~
Hom(P, M 2 )/ker(,po) = Hom(P, M 2 )/ im(<po).
CHAPTER 3. MODULES AND GROBNER BASES
186
We now return to the computation of Hom(M, N) for two A-modules M and N. From now on, ta ease notation, we will identify M and N with their respective presentations, that is, we will assume that M = A' j Land N = At jK. (However, if M and N are not given initially by presentations, we will have to take into
account the isomorphisms between M (resp. N) and A'jL (resp. AtjK).) So, as noted above, we have the two exact sequences (3.9.7)
(3.9.8) As before, the map r (resp. Ll) is given by a matrix which we will also denote r (resp. Ll). The colmnns of r (resp. Ll) are easily seen to be the vectors which generate L (resp. K), since the image of r (resp. Ll) is L (resp. K). We have that r is an s x 81 matrix and D.. is a t x tl matrix. Now in Sequence (3.9.3) we let P = N and replace Sequence (3.9.2) with Sequence (3.9.7) to obtain the exact sequence
where
--"-+ Hom(A', N)
0...--, Hom(M, N)
(3.9.9)
Œ =01f
and '1
..2.., Hom(A", N),
=or. Thus from Equation (3.9.5) we see that Hom(M, N)
(3.9.10)
3;'
kerh).
Thus to get a presentation of Hom(M, N) it suffices ta obtain a presentation of kerh). For this we first compute presentations of Hom(A', N) and Hom(A", N), then compute the map ')'* corresponding ta 'Y between the two presentations, and then we can compute ker('f*) using Theorem 3.8.9. Now in Sequence (3.9.4) we let P = AS and P = A" respectively, replacing Sequence (3.9.2) with Sequence (3.9.8) to obtain the exact sequences (3.9.11)
Hom(AS,A") ~Hom(A',At) --"-+Hom(A',N) ---+0,
where D = D.. o and p,
(3.9.12) where 8' (3.9.13)
= 7r~
Hom(A S" At,)
for P 8'
---+
= AS
and
Hom(A S" At)
J.!-' ---+
Hom(A", N)
---+
0,
= Llo and p.' = 1f~ for P = A". These give, using (3.9.6) Hom(A',N) Hom(A",N)
~ ~
Hom(A',A t )jim(8) Hom(A", At)j im(8').
and
Since, Hom(AS, At) 3;' Ast, it now suffices to describe im(8) as a submodule of Ast in arder ta obtain a presentation of Hom(A',N). Of course, a similar
3.9. COMPUTATION OF HOM
187
statement halds far Ham(A",N). The map 6: Ham(AS, At,) --> Ham(AS,At) corresponds to a map 8* which is given by an st x sh matrix S. That is 8'
-->
Ast
e---;
8/.
In arder ta facilitate the statements of the following results, we will adopt the notation for making new matrices out of old Olles presented fight after Example 3.8.2. LEMMA 3.9.3.
The matrix 8 above is the matrix given by
~ EIl ... EIl ~. '-----v------' scopies
PROOF. A basis for Hom(AS,At,) is the set oft, x s matrices ,pij whose entries are aU zero except for the ij entry which is equal to one. These matrices correspond to the colunm vectors 'l/Jij E Astt obtained, as before, by concatenating the columIlB of the matrix ,pij. The vectors ,plj form the standard basis of As t , ~ Hom(AS, At,). The columIlB of the matrix 8 are then the images of the vectors 1f;ij under 8*. That is, the columns of S are the vectors obtained from the matrices 8(,pij) E Hom(A', At) by concatenating the columns of 8( ,pij). We note that by concatenating the colurrllls the way we do, we have, in effect ordered the ,pI/s (or equivalently the ,pi;'S) as follows:
Now for each i and j, 8(,pij) = ~ 0 ,pij. Because of the identification of the homomorphisms .6. and Wij with the corresponding matrices ~ and tPij respectively, we have 8(,pi,) = ~,pij, where the right-hand side expression is a matrix multiplication. Now the t x s matrix .6.'lj;ij is the matrix whose j-th column is the i-th calumn of ~ with all other entries equal to zero. Therefore, aiter concatenation of columns, the matrices f1Wu) f1W2b ... ) f1WtIl correspond to the ftrst t, columns of ~ EIl··· EIl ~. Similarly the matrices ~,p12, ~,p22, ... ,~,pt,2 '-----v------'
scopies
correspond to calumns t,
+ 1 through 2t,
of
~ EIl ... EIl ~. '-----v------'
This analysis can be
scopies
continued and we see that 8 =
~ EIl ... EIl ~ . '-----v------' 8
0
copies
Now the image of 8 is the column space of 8; that is, the submodule of Ast generated by the COIUIDIlB of 8 (in general, we denote the module generated by the COIUIDIlB of a matrix T by (T)). Thus we have (3.9.14)
Ham(AS, N) ~ A st /(~ EIl . ~. EIl ~). scopies
CHAPTER 3. MODULES AND GROBNER BASES
188
Similarly
Hom(A", N) ~ A"t 1 (~EIl .:. EIl Do).
(3.9.15)
81
copies
Therefore we have a presentation of Hom(A', N) and ofHom(A", N), as desired. We now return ta the map ~
Hom(A',N) ;
Hom(A",N) ; 0 r.
f--+
1t corresponds ta a map '"'1*
A,t1(Do EIl .:. EIl Do)
A"t I(Do EIl':' EIl Do).
scopies
LEMMA
'Y' (;'
SI
copies
3.9.4. There is an S1 t x st matrix T which defines 'Y' in the sense that T;' + (Do EIl . :. EIl Do). Moreover, T is the transpose of
+ (~ EIl .: . EIl Do)) = s copies
SI
copies
the tensor praduct r ® It. The matrix r 0 It is defined ta be the st x Slt matrix obtained by replacing each entry 'Yij of r by the square matrix 'YijI" where It is the t x t identity matrix. PROOF. We lirst deline the map
Hom(A', At) ;
-2..
Hom(A", At) ; 0 r.
f--+
Using the isomorphisms A,t ~ Hom(A',At) and A"t ~ Hom(A",A t ), we see that '1 corresponds to a map 1": As t --.. AStt. Let 'lj;ij and 7./Jij be defÎned as in Lemma 3.9.3. Then 7* is given by a matrix T whose columns are the vectors "j*('ljJij ) E AStt. These vectors are obtained by concatenating the columns of "I(-I/Jij ). Now from Equation (3.9.13) we have Hom(A', N) ~ Hom(A', At)1 im(6) and Hom(A",N) ~ Hom(A", At)1 im(6'i. We next show that the map
T Hom(A',N) ---->Hom(A",N) is induced by the map 7; that is, 'Y(; + im(6)) = 7(;) + im(6'). To do this, it suffices to show that "I(im(6)) ç im(6'). So let 1/; E Hom(A',A"); then
7(6(1/;)) = "I(Do 0 1/;) = (Do 0 1/;) 0 r = 6'(1/; 0 r) E im(6'). Thus 'Y' is induced by "l'and
,*
scopies
S1
copies
S1
copies
Hence is given by the matrix T. We now wish ta describe T. As mentioned above, the columns ofT are the vectors 7'(1/;7,) E A"t obtained from the matrices "I( 1/;ij) = 1/;ij 0 r by concatenating columns. Again, because of
3.9. COMPUTATION OF HOM
189
the identification of the maps ,pij and r with the matrices ,pij and r respectively, wehave 'Y(,pij) = ,pijr, where the right~hand side expression is a matrix multi~ plication. For each i and j, ,pijr is the t x SI matrix whose i~th row is the j~th row of r. 80 if we order the ,pi/S as before, the matrices ,p11r, ,p2Ir, ... ,,pt1r have their respective first, second, etc. row equal ta the first row of r. The corresponding vectors in ASlt obtained by concatenation of columns are then the columns of the transpose of the matrix [ 1'l1IthI2Itl" 'l1'b l t
1'
where [1'11 1'12 1'15 1is the first row of rand It is the t x t identity matrix. Similarly, the matrices 't/J12r, '1,b22f, ... ,Wt2f correspond ta vectors which form the columns of [ 1'2IIth22Itl" '11'2s I t
1'
where [')'21 122 l'2s ] is the second row of r. We can continue this analysis and we see that T is obtained as indicated in the statement of the lemma. 0
Recal! that Hom(M, N) is isomorphic to ker{')') and hence to ker{')"). 80 we now compute the kernel of 1'*. We first compute the kernel of the hOlilomorphism
AS' -->A"t/(~E9':'E9~) 81
copies
given by
1>'
>---->
T1>' + (~E9 ... E9 ~).
Let U be the matrix whose colunms generate this kernel. Thns the columns of U are given by the first st coordinates of the generators of the syzygy module of the columns of T and those of ~ E9 ... E9 ~ (see Theorem 3.8.9). Therefore (3.9.16)
Hom(M, N) "" (U)/(~ E9':' E9 ~). scopies
A presentation of Hom(M, N) can then be computed again using the method given in Theorem 3.8.9. To summarize we now state THEOREM 3.9.5. Let M and N be A~modules given by explicit presentations M"" A' / Land N "" At / K. We compute Hom(M, N) as follows. (i) Use the generators of Land K as columns to define the matrices rand ~ respectively; (ii) Let T = '(r@It); (iii) Compute the matrix U defined by the kernel of the composite map A st ~
ASIt ------+ ASl
t
/(~ EB':' EB il), SI
copies
190
CHAPTER 3. MODULES AND GROBNER BASES
using Theorem 3.8.9. This gives Equation (3.9.16); (iv) Compute a presentation of Hom(M, N) using Theorem 3.8.9.
Ta illustrate the Theorem, we now give an example. EXAMPLE 3.9.6. We let A =
and let N = (gl,g2,g3) ÇA', where gl
= [
~~
~: ],
], g, = [
Z g3 = [ x / : yz' ] .
We first need to compute presentations of M and N. To do this we use Theorem 3.8.9 .. Let L = SYZ(fI' f" f3' f 4 ) and compute L to be
L= ((-l,x+z,O,-l),(-z,z',-l,x-z)) ÇA 4 . We also let K = Syz(gl,g"g3) and compute K to be
K= ((yx,-yx+yz-z',-y+z)) ÇA"So we have the presentations
M'l'!A 4 jL
N'l'!A 3 jK.
and
We let
r=
[ -,
x;z -1
z' -1 x-z
"1
and
Ll
= [ -yx :~z -
z' ] .
-y+z
Thus the matrix T is
-1
0
0 0
-1
-z 0 0
0 0
-z 0
0 0
-1 0 0 -z
x+z
0
0 0
x+z
z' 0 0
0 0
0 0
x+z
z'
0 0
0
z'
0 0 0
-1 0 0
0 0 0 0
-1
0 0 0 0 0
0
-1
-1
0
0 0
-1
x-z 0 0
0 0
0 0
-1
x-z
0 0
0
x-z
To compute the kernel of the map Al' ----> AB j(Ll EIl Ll) given by T composed with the projection A6 ----> AB j (Ll EIl Ll), we compute the syzygy module of the
3.9. COMPUTATION OF HOM
191
columns of T and of
~yx
yx +yz
o o o
~ z2
~y+z
o o o
yx ~yx+yz ~ z2 ~y+z
The first 12 entries of the generators of that syzygy module are the columns of the matrix U, where
u=
~1
0
0 0 0 0 0 x 0 0 1 0 0
~1
0 0 0 0 0 x 0 0 1 0
0 0 ~1
0 0 0 0 0 X
0 0 1
x 0 0 1 0 0 0 0 0 z 0 0
0 x 0 0 1 0 0 0 0 0 z 0
0 0 x 0 0 1 0 0 0 0 0 z
y
0 yz -
Z2
~y+z ~y
Y 0 0 0 0
~y
0 0 0 0 0 yz - z2 ~y+z
~yz
~y
yz 0
y 0
From Equation (3.9.16) we have
Hom(M,N) '-" Hom(A 4 /L,A 3 /K) '-" (U)/(il. EIl il. EIl il. EIl il.). Finally to compute a presentation of (U) / (il.EIlil.EIlil.EIlil.) we compute the syzygy module of the eight columns of U and the four columns of il. EIl il. EIl il. EIl il. =
yx ~yx
+ yz ~ Z2
~y+z
0 0 0 0 0 0 0 0 0
0 0 0 yx ~yx + yz ~
z2
~y+z
0 0 0 0 0 0
~yx
0 0 0 0 0 0 yx + yz
~ Z2
~y+z
0 0 0
0 0 0 0 0 0 0 0 0 yx ~yx+yz ~ z2 ~y+z
CHAPTER 3. MODULES AND GROBNER BASES
192
There are 4 generators for this syzygy module and we need the first 8 coordinates of each of them, which give
y -y
0 0 0
Y -y
tl =
, t2 =
0 1 0
0 0 0 0 0 1
0
0 0 0 0
, t3 =
yz - z2
-y+z
-x
0 0 0 0
0
-x
yz - z2
, t4 =
-y+z
Therefore
Hom(M,N) "" Hom(A4/L,A3/K) "" A8/(tl,t2,t3,t4)' We now show how to generate A-module homomorphisms from A 4/ L to A3 / K, and hence from M to N. For a = (al,'" ,as) E AB, the product Ua is a vector in A 12 . Using this vector we construct a 3 x 4 matrix whose colmnns are the 4 cOIlBecutive 3-tuples of the vector Ua. This matrix defines a homomorphism from A 4/ L to A3 / K. The fact that this homomorphism is well-defined follows from the construction of U. Let ei,i '" 1, ... ,8 be the standard basis of A8 Then Hom(A4/L,A3/K) is generated by the matrices obtained froID U ei, i = 1, ... ,8, that is, by the matrices obtained from the columns of U. Therefore Hom(A4 / L, A3 / K) is generated by the cosets of the eight matrices
[
~1
[~
Xl] o ,
0 0 0
0 0
1 0 0 0 0 0
~
0
~ ],
[
~1
0 0 0
~
o 1
o
o 0] xl,
o 0 0 0
0
n,
[
~1 ~
-y 0 0 0 -yz ] [ y yz z2 0 0 Y y; ,
0 0
o
0 0
x
0 0
0 0 1 0
n
n
-y] Y . 0
It can easily be verified that, for i = 1, ... ,8, we have
I: ai
where ai E A. We recall that we identify the homomorphism
3.9. COMPUTATION OF HOM
193
To obtain a description of Hom(M, N) from Hom(A4/L, A3 / K) we make use of the isomorphisms ~
M f , f2 f3 f4
~
>---+ >---+ >---+ c-->
A 4/L e, +L e2 +L e3 +L e4+ L
A3/K e~ +K
~
~
N
>---+
g,
e~+K
>---+
e;+K
c-->
g2 g3
where the vectors el, e2, e3, and e4 are the standard basis vectors for A 4 , and the vectors el' €21 and e~ are the standard basis vectors for A 3 . Let tP be any homomorphism in Hom(A 4/ L, A3 / K), say 1> = I:~~l ai1>i. Then we deline a homomorphism 1>' in Hom(M, N) as follows. First, it is enough to determine 1>'(fi) for i = 1,2,3,4. But 1>'(fi) is just the image of 1>(ei + L) under the isomorphism A 3 / K '" N. In particular, 1>' (f 1) is the image under the isollorphism A 3 / K '" N of
1>(el + L) i=l
a, [
~1
]
+ a2
[
~1
~
]
+ a6
[
~
+a5 [
[
]
]
+ a3 + a7
[
[
~J + ~
~;+z;
:a:':(:~"-+):î~7 - yas ]
-a2 + -a3
]
a4 [
]
+ as [
~y
]
+K
+K
+ xa6 + (-y + z)a7
(-al +xa4+yas)(e; +K)
+( -a2 + xas + (yz - z2)a7 - ya8)(e~ + K) +(-a3 +Xa6 + (-y + z)a7)(e~ + K). That is
(-a, + xa4 + yaS)g, + (-a2 + xas + (yz +( -a3 + xa6 + (-y + z)a7)g3·
z2)a7 - yaS)g2
The images of f2' Is, and f4 can be computed in a similar way. In particular Hom(M, N) is generated by the homomorphisms 1>; correspond-
194
CHAPTER 3. MODULES AND GROBNER BASES
ing to the homomorphisms 1>i which generate Hom(A 4IL, A3IK):
1>'l '.
1>'4'.
M f, f,
--+
h
--+
f4
--+
M f, f, f3 f4
1>'7'.
--+
--+
--+
--+ --+ --+
M f, f, f3 f4
Exercises 3.9.1. Let A
N -g, 0 xg , g,
--+
N xg, g, 0 zg,
--+
--+
--+ --+
--+
1>',..
1>'5'.
M f, f2 f3 f4 M f, f, f3 f4
N (yz - z2)g, +( -y + z)g3 -yg, + yg, 0 -yzgj + yzg,
--+
--+ --+ --+ --+
--+
--+ --+ --+ --+
1>'8'.
N -g, 0 xg, g,
1>'3 ..
N xg, g, 0 zg,
1>'6'.
M fI f2 f3
--+
f4
--+
--+
--+ --+
M f, f,
--+
h
--+
f4
--+
M f, f,
--+
h
--+
f4
--+
--+
--+
--+
--+
N -g3 0 xg3 g3 N xg 3 g3 0 zg3
N yg, - yg, 0 (yz - z2)g2 +(-y + z)g3 -yg, + yg,.
= Qlx, y, z], M = A 31L, and N = A 41K, where L= ((x,x+y,z),(x,x',x-z),(-xz,x-z,y-z)) and
K = ((x+y,x', xy+yz, y-z), (xy, x-z, y-z, xz 2), (xy, y-z, x-z, yz2)). Compute generators for, and a presentation of Hom(M, N). 3.9.2. Consider the following modules
M= ((O,y,x),(y,O,y),(y,y,x),(x,-y,xy),(O,x,x)) ç (Qlx,y])3 and N = ((xy,y,y,y), (x',x,x,x)) ç (Qlx,y])4 Compute generators for, and a presentation of Hom(M, N). 3.10. Free Resolutions. In this section we first show how to compute an explicit free resolution of a finitely generated A-module, where A = klxl, ... , Xn]. We will then use this free resolution to prove a theorem of commutative algebra concerning A. Finally, the free resolution together with the results from the last section will be used to give an outline of the method for the computation of the Ext functor. Let M be a finitely generated A-module. We saw in Section 3.1 that M has a presentation, that is, M ==: A SO / Mo for sorne So and sorne submodule Mo of A SO • We also have the following short exact sequence
(3.10.1)
3.10. FREE RESOLUTIONS
195
ta indicate that A" IMo is a presentation of M. The map io: Mo --+ A" is the inclusion map, and KO: A 050 ---7 M is the map which sends the standard basis of A" onto the generating set of M corresponding to the standard basis in the isomorphism M ~ A 'olMo. Now we find a presentation of Mo, say Mo~AS'IM"
and thus we have another short exact sequence
O ---7 M 1
i, -----+
ASl
1f1 -----+
M.0
---7
0.
This leads ta an exact sequence
O -----+ M 1
i, -----+
ASl -----+ .pl ASO
!/Jo
---7
M
0
---7,
where
o
0
\/ M,
/\
A"
\/ /\
'M
, 0
\/
M2
0
• ASQ
A"
Mo
0
FIGURE
/\
0
0
3.3. Presentations of M, Mo, and M,
Thus we have obtained an exact sequence
DEFINITION 3.10.1. The exact sequence obtained in (3.10.2) is called a free resolution of the module M.
196
CHAPTER 3. MODULES AND GROBNER BASES
In module theory, free modules are the analog of vector spaces. A free resolution of a module M is used ta "measure" how far away from a free module M is and ta give sorne useful numerical invariants for the module M and the ring A. If in Sequence (3.10.2) it turns out that there exists an Csuch that A'j = 0 for all j 2: C, wesay that the free resolution has finite length and that its length is <::: C. We will show below that finitely generated A-modules have free resolutions of finite length. In fact, for each A-module M, there exists a finite resolution of length at most n, the number of variables in A. We say that A has global dimension less than or equal ta n. We will now nse the theory we have developed sa far ta prove this result (Theorem 3.10.4). First we have a technicallemma. Let {gl' ... ,gt} c;: Am be a Grübner basis for M = (JI'··· ,J,) with respect ta sorne term arder on the monomials of Am. Let {Sij Il <::: i < j <::: t} ç At be as in Theorem 3.7.3. Recall that it is a Grübner basis for Syz(gl'··· ,gt) with respect ta the term arder < on monomials of At induced by [ gl gt by Theorem 3.7.13. We note that we are using two different term orders: one in Am which is used ta compute the Grübner basis {gl' ... ,gt} for M, and the other is defined in At with respect ta whlch the set {Sij Il <::: i < j <::: t} is a Grübner basis·for SYZ(gl' ... ,gt). We will use the same notation for bath orders, that is, we will nse < and we will write lm(gi) and !m( Sij). The context should indicate which module we are working in and which order we are using. In fact, we need ta be careful which arder we put on At Notice that the arder on At changes if we reorder the vectors Y1' ... ,gt' AIso, Binee we will be using the standard bases for bath At and Am, we will denote by el, ... ,et the standard basis for At, and by e~, ... ,e~ the standard basis for Am. We will assume that they are ordered as follows: the gi'S are arranged in such a way that if v < Ji. and lm(gJ = Xvej, Im(g,J = X".ej for sorne power products Xv, X". E A and sorne j E {l, ... ,m} (that is, lm(gv) and lm(g".) have the same non-zero coordinate) then Xv > X". with respect ta the lex ordering with Xl > X2 > ... > Xn (regardless of the arder that was used ta compute the Grübner basis for M).
1'
LEMMA 3.10.2. Leti E {l, ... ,n-l}. Ifthevariablesxl, ... ,Xi do notappear in lm(gv), for some v E {l, ... ,t}, then the variables Xl, ... ,Xi, Xi+l do not appear in lm(sv".), for every Ji. such that v < Ji. <::: t. Sa, if Xl, ... ,Xi do not appear in any Im(9v) for v = 1, ... ,t, then Xll'" ,Xi, Xi+l do not appear in any lm(sv".) for 1 <::: v < p <::: t. In particular, Xl does not appear in any lm(sj".), for 1 <::: j < Ji. <::: t. PROOF. Let v, Ji. E {l, ... ,t}, v < Ji.. If lm(gv) and lm(g".) do not involve the same coordinate, then X vp. = 0, and hence svj.t = 0, so no variables appear in sv". at all. Otherwise, let Xv = Im(gvJ = Xvej and X". = !m(g) = X".ej for sorne j E {l, ... ,m}. Then, by Theorem 3.7.13,
3.10. FREE RESOLUTIONS
197
where XV{J- i8, as usual, lcm(Xp,l Xv). First, let us assume that Xl, ... ,Xi do not appear in Xv. Since Xv > X~ with respect to the lex term ordering with Xl > X2 > ... > Xn, then Xl, ... ,Xi do not appear in Xp, and the power of xH1 in Xv i8 at least as large as the power of Xi+l in X w Therefore the power of Xi+l in XVIl- i8 the same as the power of Xi+l in XV) and hence the variables
X
do nat appear in ---..!::..!!:.. and 80 do not appear in Im(svjt). Thus Xv ifx1, ... ,Xi do nat appear in anylm(gv)' v= 1, ... ,t, thenxl, ... ,Xi,Xi+l do not appear in any lm(sv~), for 1 :S v < '" :S t. The last statement is proved in exactly the same way. 0
Xl,· .. ,Xi, Xi+l
3.10.3. As in Example 3.7.5 we let M be the submodule of A 3 = (
Grôbner basis with respect to the TOP term ordering in A3 with e, and with lex in A and y > x, where y, = (x 3 + X, x 2 - X, -x), Y2 = (x, y
Y5
= (x 2, x 3, _x 3),
X, 0),
2x, _x 2 + 2x, x 5 - x 4 - 3x 3 + x 2 + 2x).
= (x 2 -
Y6
+ x2 -
> e2 > e3
We re-order the y's as required in Lenmm 3.10.2 with y > x, and we get
y,
Y5
= (y+x 2,0,0),
= (x 2,x,y),
Y6
=
Y2
= (x 3 +x,x 2 -x,-x),
(x 2 - 2x,-x 2 +2x,x 5 _x 4 - 3x 3 +x2
+ 2x).
The syzygies are now 812
834 856
(x 3
+ X, -y -
x2 , x2
-
X,
1, -X, 0)
+ 2, _x 3 , -1) (x 2 - 2x, _x 4 + x 3 + 4x 2 - 2x - 4, _x 2 + 2x, _x 2 + X + 2, x 5 - x 4 - 3x3 + x 2 + 2x, -y - 2). 2
(X ,X
2
-
X -
2,x , -y - x 2 + X 3
Using the order induced by the y's we have Im(s'2) Im(s34) Im(s56) THEO REM 3.10.4. Every finitely generated A -module has a free resolution of length less than or equal ta n, where n is the number of variables in A.
198
CHAPTER 3. MODULES AND GROBNER BASES
PROOF. Let M be any finitely generated A-module. Then M has a presentation M ~ AS, IMo, where Mo = (g" ... ,g,) <:: AS,. If Mo = {O}, then we are done; otherwise we may assume that G = {gI,'" ,gt} fOrIns a Gr6bner basis for Mo with respect to sorne term order in A 80. Assume also that the g/8 are arranged as in Lemma 3.10.2. Let Xl, ... ,Xi be the variables that do not appear in any of the lm(gv)'s (i = 0 is possible). We will prove that M has a free resolution of length less than or equal to n - i. CASE 1. n = i. Then none of the variables Xl, ... ,X n appear in any of the lm(gj)'s. We need ta show that M ~ AS, IMo is isomorphic to a free module. Since the leading monomials of the gi '8 are of the farm aej for sorne a E k, and j E {l, ... , so}, we see that the module Lt(g" ... ,g,) is the free submodule of AS, generated by the ej's that appear in sorne lm(gi). Let M' be the free submodule of AS, generated by the other ej's. We will show that M ~ M'. Consider the map 1f:
M' f
--->
AsoIMo~M
e--;
f+Mo·
It is easy to see that 1f is an A-module homomorphism. Also, if f E M'and f E Mo, then, by Theorem 3.5.14, lm(f) is divisible by some lm(gi) for some i E {l, ... , t}. But this is impossible unless f = 0, since M' is generated by those ej's which do not appear as leading monomials of any gi- Therefore 7f is one ta one. Finally, by Proposition 3.6.3, we see that for ail f E AS" f + Mo = Nc(f) + Mo, where Nc(f) is the remainder of f under division by G. Moreover Nc(f) E M', sinee the monomials of Nc(f) are exactly those monomials which are not divisible by any e J which appear in some lm(gi). Therefore 1f is onto, and hence 'if iB an isomorphism. CASE 2. n - i > O. We construct a free resolution
recursively as follows. At the jth step, we choose a monomial order on the free module A'j and find a Grübner basis G for ker(1)j). We arrange the elements of G according to Lemma 3.10.2. We then choose A'H' to be a free module whose basis maps onto Gand we let 1>j+1 be the projection map. Note that ker( 1>0) = Mo = (g" ... ,g,) and ker(1),) = 8yz(g" ... ,g,). If X" ... ,Xi do not appear in the leading monomials of the g~,'S, then, by Lemma 3.10.2, Xll'" ) Xi) Xi+l do not appear in the leading monomials of the elements of the Grübner basis for ker(1),). If Xl appears in lm(gj) for some j, then, by Lemma 3.10.2, Xl does not appear in the leading monomials of the elements of the Grübner basis for ker(1),). 80 ifwe apply Lemma 3.10.2 recursively, we see that no variables appear in the leading monomials of the elements of the Grübner basis for ker(1)n_i). By the case n - i = O~ we see that
3.10. FREE RESOLUTIONS
is free. Therefore if we replace
resolution.
A'n~'
by
199
AS"~'/ker(4)n_i),
we get the desired
D
EXAMPLE 3.10.5. We continue Exemple 3.10.3. Since the leading term of and 856 involve different basis vectors, we see that SYZ(S121 834, 856) = (0,0,0). Therefore A3 "" (S12, S34, S56) and so wc have the following free resolution for M = (91,92,93,94,9s,96) = (f"!2'!3'!4) ç; A3: 812, 834,
where
The kernel ofthis map is SYZ(91,92,93,94,9s,96)' Also,
4>,:
A3
--->
A6
C2 ,C3 )
--->
C,S12+f2S34+C3SS6'
(C "
As we saw above, the kernel of 4>, is (0,0,0). To conclude this section, we use the techniques developed 80 far ta outline the computation of Ext n (M, N). We will assume that the reader is familiar with this concept and we will only give an indication on how ta go about the computation. We begin with a free resolution for ao A-module M • • • fi+2 -----.--t
ASi+l
fi+l ---------+
ASi ............... ri ASi-l
r ---+ . . . ---+ 2
ASl
fI ---+
AS'
----Jo
M'
---+
° 1
which wecompute as above. For the A-module N we form the usual complex which at the ith position looks like ... --->
Hom(As,+" N) ~ Hom(A", N) ..:!'-., Hom(AS,~" N)
---> ....
As in Lemma 3.9.2 aod Lemma 3.9.3 we cao compute presentations of these Hom modules aod using Lemma 3.9.4 we cao compute the maps between thern yielding another complex which at the ith spot looks like
We can use Theorem 3.8.9 to compute ker(Ti ). Also, im(1i+1) is obtained using the columns of the matrix that determines T i + 1. Thus we can compute
Exti(M, N) "" ker(Ti )/ im(Ti+1), again using Theorem 3.8.9. ExercÏses 3.10.1. Compute a free resolution for the module M = ((x, y, z), (y, x, z), (y, z, x),
(x, z, y), (y, x - z, z), (y, z, x - z)) ç; (lQI[x, y, z])3.
Chapter 4. Grobner Bases over Rings
In the previous three chapters we considered the theory of Grobner bases in the ring A = k[Xl,'" ,X n ], where k is a field. We are now going to generalize the theory to the case where A = R[Xl1'" ,xn ] for a Noetherian commutative ring R. Sometimes we will need to be more specifie and require R ta be an Integral domain, a unique factorization domain (UPD), or a principal ideal domain (PID). We will give many of the same type of applications we gave in the previous chapters in this more general context. Moreover, the theory of Grobner bases over rings will allow us ta use inductive techniques on the number of variables; for example, for a field k, k[x, y] can be viewed as a polynomial ring in one variable y over the ring k[x] (i.e. k[x, y] = (k[x])[y]). We give an example of this technique in Section 4.4 where it lS used ta test whether Ideals are prime. The theory of Grobner bases over rings has complications that did not appear in the theory over fields. Indeed, many of the results will not hold in this generality. Moreover, many of the basic techniques will become more complicated because we now have ta deal with Ideals of coefficients in the ring R. In Section 4.1 we give the basic definitions and lay the foundations for the theory of Gr6bner bases over rings. An algorithm for constructing Grobner bases will be presented in Section 4.2. We will have to assume certain computability conditions on R in order for this algorithm to be effective. We use a method presented by Moller [MoSS] to compute the appropriate syzygies needed for this algorithm. We then give examples of computing Grobner bases over the rings Z, Z20 and Z[ Al. In Section 4.3 we give the usual applications including elimination, computing syzygy modules, and a result of Zacharias [Za] which gives a method for computing a complete set of coset representatives of A modulo an ideal. We then go on, in Section 4.4, to discuss questions related to rings of quotients and use this material to give an algorithm to determine whether an ideal in A is a prime ideal. Next, in Section 4.5 we specialize the ring R to be a PID and show that in this case we may again use the notion of S-polynomials to compute Gr6bner bases. We conclude that section by giving a structure theorem of Lazard [LazS5] for Grobner bases in polynomial rings in one variable over a PID. In the last section, we use Lazard's result to compute the primary
201
202
CHAPTER 4. GROBNER BASES OVER RINGS
decomposition of ideals in Bueh rings. 4.1. Basic Definitions. In this section we develop the theory of Grübner bases for polynomials with coefficients in a Noetherian Ting R. As we did for modules in Section 3.5, we will mimic the constructions of Chapter 1 as much as we cano We will assume that we have a term order < on the power products in the variables Xl) ... ,X n - With respect ta this term order, we have the lisuai concepts of leading power product, leading term, and leading coefficient of a polynomial in A = R[xl, ... ,xn ]. Next, w€ need the concept of reduction, see Section 1.5 and 3.5. In those sections we required the notion of divisibility of leading terms. Actually, there we were nat concerned about whether we were dividing leading terms or dividing leading power products since one was a non-zero element of the field k times the other and this had no effect on the divisibility. When the coefficients are not elements of a field, this becomes a very important issue. It turns out that in order to have a reasonable theory of reduction and Grobner bases using the same ideas of reduction as in the field case, we need R to be a PID. We will explore this in Section 4.5. Since we want to define reduction and Gr6bner bases in the context of rings more general than PID's, we must modify our previous concept of reduction. The correct way to do this is to work with syzygies in the ring R. After we define this new concept of reduction, we will be able to pattern our results again on what we did in Chapter 1. There is one major exception and that is in the definition of Grübner basis itself. In the case of ideals of polynomials with coefficients in a field, and also in the case of modules, the definition again involved the concept of dividing one leading term by another. So we need to change our very definition of Grübner basis. It turns out that many, but not al!, of the equivalent conditions for a Gr6bner basis over a field given in Theorem 1.9.1 will work for us in our new situation. So to reiterate the setup, we assume that we are given a Noetherian commutative ring R and we let A = Rlxl, ... ,xn ]. We then have from the Hilbert Basis Theorem 1.1.3, that A is also a Noetherian ring. We assume that we have a term order < on the power products, 1rn , in the variables Xl, •.. ,Xn . Then frOID Theorem 1.4.6, we know that < is a well-ordering on 1rn (the point here is that this is a property of the power products 1r n , not of the polynomial ring klxl, ... ,xn ], for a field k, even though the proof of Theorem 1.4.6 used the Hilbert Basis Theorem in klxl, ... ,xn ]). For J E A, Ji 0, we write J = alXI + ... + a,X" where al, ... ,as E R, ai # 0 and Xl, ... 1 X s are power products in Xl, ... ,Xn with Xl > X 2 > ... > X,. We define as before, IpU) = Xl' leU) = al and ItU) = alXI (called the leading power product, leading coefficient and leading term of J, respectively). We now turn our attention to the concept of reduction. Recall, that in the case where R = k is a field, given three polynomials J, g, h in k[XI' . '. ,X n ], with g i 0, we say that J ~ h, if and only if lp(g) divides a term X that appears
4.1. BASIC DEFINITIONS
203
in 1 and h = 1 - ltt) g. The case where R is not a field differs in two ways. The first difference is that in the case of rings, it is convenient ta ouly reduce the leading term of 1. It is readily seen that all the results on Grübner bases in Chapter 1, excluding the oues making explicit use of reduced polynomials, are valid with this restricted concept of reduction. In the case of rings, however) the results involving reduced polynomials are no longer valid and 80 reducing tenus that are not leading terms is unnecessary (see Exercise 4.1.6). With this in mind, we could fephrase our definition of reduction over a field k by saying that 1 ~ h, providedthat lp(g) divides lp(j) and h = 1 - ::gjg (note that lt(j) = ltn:i;jg) and sa tbe leading term of 1 has been canceled). We see that this notion requires that we divide by lt(g) = lc(g) lp(g). The problem with this is that over a ring R we may not be able ta divide by the ring element lc(g). We could build this into our definition just as we must require that lp(g) divide lp(j), but this turns out ta be tao restrictive when we are attempting ta divide by more than one polynomial, as, of course, the theory requires. The key idea in resolving this diffieulty is to use a linear eombination of the the leading terms of the divisors, whose leading power products divide lp(j), ta eliminate lt(j). Sa we nQW assume that we are considering polynomials f and fI fs in A = R[XI, ... ,Xn ] with fJ, ... ,1, # 0, and we want ta divide 1 by fJ, ... ,j,. That is, we want to caueel the leading term of 1 using the leading terms of fJ, ... ,Is· We should be allowed ta use any li which has the property that lp(j;) divides lp(j) aud sa what we desire is that lt(j) be a linear combination of these lt(j;). We thus arrive at 1 •••
1
DEFINITION 4.1.1. Given two polynomials 1 and h and a set of non-zero polynomials F = {/J, ... , l,} in A, we say that 1 reduces ta h modulo F in one step, denoted F 1 ----> h,
il and only il h=
1 - (CIXt!1 + ... + csXsIs )
lor CI, ... , c, E R and power products XI,··· , X, where lp(f) = Xi lp(/t) lor aU i Buch that ct # 0 and lt(f) = CjXIlt(/J) + ... + esX, lt(f,). EXAMPLE 4.1.2. Let R = Z and let 1 = xy -1, /J = 7x + 3, h = llx3 2y 2 + 1 and h = 3y - 5. We will use the deglex ordering with x < y. Sa with F = {/J,hh}, we see that 1 ~ h, where h = -3y -lOx - 1 since h = 1 - (y/J - 2xh) and xy = lt(f) = Y lt(f,) - 2x lt(h)· (Here C2 = 0, as it must, sinee lp(f) = xy is not divisible by lp(h) = x8) So we have done what we said we wanted ta do, namely, we have cauceled out the leading term of 1 using the polynomials in the set F. Also, 1 could not have been reduced using only one of the polynomials /J, 12, h·
204
CHAPTER 4. GROBNER BASES OVER RlNGS
We draw attention ta the condition Ip(f) = Xi Ip(fi) for aIl i such that Ci # O. Its purpose is to ensure that each CiXi It(fi) in the difference h = f - (clXdl + ... + c,X,f,) with Ci # 0 is actually used to help cancellt(f). Because of the possibility of zero divisoIs in the ring R, we must be careful about this. For example, if we only required Ip(f) = Ip(CiXdi) for all i such that Ci # 0, we could end up with a term c Ip(f) remaining in h. To see this consider the case where R = ZIO with deglex and x > y and let f = 3y, ft = 5x 2 + y, and 12 = y. Then It(f) = 2lt(fd + 3It(h) and Ip(f) = Ip(2ft) = Ip(3h) = y whereas h = f - (2ft + 312) = -2y. Having been careful with the definition we have the following cruciallemma whose easy proof we leave ta the exercises (Exercise 4.1.13). LEMMA 4.1.3. With the notation of Definition 4.1.1 we have Ip(h) < Ip(f). Let f E A and let F = {ft, ... ,f,} be a set of non-zero polynomials in A. We now examine how we would determine whether f is reducible modulo F. We first find the set J = {j IIp(!,) divides Ip(f),l <:: j <:: s}, which is readily done. We are réstricted to such J by the requirement that Ip(f) = Xi Ip(f;) in Definition 4.1.1.Then we must solve the equation (4.1.1)
Ie(f)
=
I), !c(!,) JEJ
for b/s in R. This equation can be solved if and only if le(f) E (Ie(!,) 1 j E J) R, where for a subset C ç: R we denote by (C) R the ideal in the ring R generated by the elements of C. Once we have the b/s then we can reduce f:
EXAMPLE 4.1.4. We put Exanlple 4.1.2 in this context. We have Ip(f) = xy and so J = {l,3}. Thus we need ta solve !c(f) = 1 = bl 1e(fJ) + b3 Ie(h) = 7b l + 3b3· We choose the solution bl = 1, b3 = -2 and thus we reduce f as h = f - (blyft + b3x/3) = f - (yft - 2xh) = -3y - lOx - 1. It is thus clear that we must be able ta solve linear equations in the ring R in order ta be able ta reduce in A. This condition is one of two conditions R must satisfy in order ta compute the objects of interest ta us in this chapter. We Hst these two conditions in the following DEFINITION 4.1.5. We will say that linear equations are solvable in R provided that (i) Given a, al, . . . ,am E R, there is an algarithm ta determine whether a E (al, ... , a=) R and if it is, ta compute bl ,.·· , bm E R such that a = al bl + ... + ambm ;
4.1. BASIC DEFINITIONS
205
a" ... ,am E R, there is an algorithm that computes a set of generators for the R-module
(ii) Given
Examples of such rings include Z, Zm, Qlxl, ... ,xn ], Zli] where i 2 = -l, and
ZIR]. The first condition in Definition 4.1.5 is the one discussed above which we found necessary in order to make the reduction process computable. The second condition is needed to guarantee that the algorithm (Algorithm 4.2.1) to be presented in the next section for computing Grübner bases in A is actually implementable. We will always assume that !inear equations are solvable in R when an algorithm is presented. However, as has been the case throughout this book, we will otherwise be informaI about our assumptiolls on the ring. In particular, without the assumption that linear equatiolls are solvable in R, what we present in this chapter is valid, and should be viewed as mathematical existence statements. As in the case of ideals and modules where the coefficients lie in a field, we need to iterate our reduction process. DEFINITION 4.1.6. Let f, h, and JI, ... ,f, be polynomials in A, with JI, ... ,f, of 0, and let F = {JI, ... ,fs}. We say that f reduces ta h modulo F, denoted F
f ----->+ h, if and only if there exist polynomials h" ... ,ht-l
E
A such that
We note that if f ~+ h, then f - h E (f" ... ,J,). EXAMPLE 4.1.7. We continue Example 4.1.2, where R = Z and f = xyl,JI = 7x+3, h = llx3 _2 y 2 + 1,13 = 3y- 5 and F = {f" fz, h}. We see that
f ~+ -lOx - 6, since f ~ -3y -lOx - 1 ~ -lOx - 6. The first reduction is the one noted in the previous example and the second is obtained by simply adding h to -3y - lOx - 1. We note that this reduction could not have been done in one step. DEFINITION 4.1.8. A polynomial r is caUed minimal l with respect to a set of non-zero polynomial;; F = {f" ... ,J,} if r cannat be reduced. l In the case of ideals and modules where the coefficients were in a field we required that every term in r could not be reduced. Here we are only requiring that lt(r) cannot be reduced modulo F. In the literature this latter concept is sometimes refered to as "reduced", but we adopt the word "minimal" so that we are consistent with the similar concepts in the rest of this book.
206
CHAPTER 4. GROBNER BASES OVER RINGS
Wc recall the notation that for a subset W is denoted
Lt(W) = ({lt(w) 1 w
ç A, the leading term ideal of W E
W}).
We have the following eaBy LEMMA 4.1.9. A polynomial r E A, with r
#
0, is minimal with respect ta a set oJ non-zero polynomials F = {h,· .. ,J,} ÇA iJ and only iflt(r) 't Lt(F). PROOF. Ifr is not minimal, then r can be reduced and so from Definition 4.1.1 we have that lt(r) = c,X,lt(h) + .. -+c,X, lt(j,) for Ci E R and power products Xi. It immediately follows that lt(r) E Lt(F). Conversely, if lt(r) E Lt(F), then lt(r) = h,lt(j,) + ... + h, lt(j,) for sorne polynomiaIs hi E A. If we expand this equation out into its individual terms, we see that the only power product that cau DCCUI with a non-zero coefficient is Ip(r); that is, we may assume that each hi is a term CiXi. It is then clear that r - (c,X,j, + ... + c,X,!,) is a reduction ofr. 0 We note that in Example 4.1. 7, the polynomial r = -lOx - 6 is minimal with respect ta F = {J" h, Jd since only h has the property that lp(ji) divides lp(r) = x and kIr) = -10 't (k(h))z = (7)z. We have
# 0, and set F = A, minimal with respect ta F, such that
THEOREM 4.1.10. Let J, h, ... ,J, E A with h, ... ,J,
{J" ... ,J,}. Then there is an r
E
F f ---------++ r. Moreover, there are hI, ... ,hs
J
=
h,j,
E A such that
+ ... + h,!, + r
with lp(j) =max((max lp(hi)lp(ji)),lp(r)). l:S;tS;s
If linear equations are solvable in R, then hlJ ... ,hS1 rare computable. PROOF. Either J is minimal with respect to F or J ...!:... r,. Similarly, either rI is minimal with respect ta F or rI ~ T2' Continuing in this way we get
where we have, by Lemma 4.1.3, lp(j) > lp(r,) > Ip(r2) > .... This process must end since the order on the power products is a well-ordering, and 80 we obtain the desired polynomial r. We now have
By the definition of reduction,
4.1. BASIC DEFINITIONS
207
for sorne CIl,··· ,Cl s ER and sorne power products X I1 , ... ,X1S1 where It(f) = CllXlllt(fI) + ... + C"X It(f,) and Ip(f) = X,i Ip(!;), for ail i ·such that Cli # o. This gives a representation of the desired type for f - r,. Similarly, Tl - T2 = C21X21h + ... + C2sX2sfs for sorne C21, ... ,C2s E R and sorne power produets X 2l ,··. , X 2" where It(r,) = C2lX2llt(fI) + ... + C2,X2s It(f,) and Ip(r,) = X 2i Ip(fi), for all i sueh that C2i # O. Since Ip(f) > Ip(r,) > Ip(rz), we get a representation of the desired type for f - r2, namely
,s
Continuing in this way we eventually obtain a representation for desired type. The last statement is clear. D
f - r
of the
The method for eomputing r is given as Aigorithm 4.1.1. We note that in obtaining It(r) = C,X, + ... + c,Xs in Aigorithm 4.1.1, we are assuming that Ci = 0 for ail i sueh that Ip(fi) does not divide Ip(r). Assuming that !inear equations are solvable in R we can determine whether the Ci '8 exist -and compute them if they do. INPUT: f,fI, ... ,f, E A with!;
#
0 (1 SiS s)
OUTPUT: h"., . , h" r, where f = hd,
+ ... + hsf, + r with
Ip(f) = max(maxl<:i<:s (Ip(h i ) Ip(fi)), Ip(r))
and r is minimal with respect to {fI, . .. , f,} INITIALIZATION: h, := 0, ... , h, := 0, r :=
f
WHILE there is an i sueh that Ip(fi) divides Ip(r) and there are
Cll ... ,Cs E
R and power products Xl, ... ,Xs
sueh that It(r) = C,X,lt(fI)
+ ... + c,Xs It(fs) and
Ip(r) = Xi Ip(!;) for ail i sueh that r := r - (c,Xd,
Ci
# 0 DO
+ ... + esXsf,)
FOR i = 1 to s DO
ALGORITHM 4.1.1. Division Algorithm over Rings EXAMPLE 4.1.11. We reconsider Examples 4.1.2 and 4.1.7 using Aigorithm 4.1.1. The first pass through the WHlLE loop was done in Example 4.1.2 and gaveusr= f-(yfI + (-2x)h) = -3y-lOx-1, h, =y, h 2 =0, andh3 = -2x. The second pass through the WHlLE was done in Example 4.1. 7 and gave us
208
CHAPTER 4. GROBNER BASES OVER RINGS
r= (-3y-lOx-1)-(-h) = -lOx-6, hl =y, h 2 =0, andh3 = -2x-1. The WHILE loop stops, sinee only lp(ft) divides lp(r) but there is no Cl sueh that -lOx = lt(r) = ellt(ft) = cl(7x). Thus, f = yft + (-2x -l)h + (-lOx - 6). We may nQW give the first characteristic properties of Grobner bases over A = R[Xl, ... , xnl, for a ring R. THEOREM 4.1.12. Let I be an ideal of A and let G = {91, ... ,9,} be a set of non-zero polynomials in I. Then the followin9 are equivalent. (i) Lt(G) = Lt(!). (ii) For any polynomial f E Awe have
f
E
I if and only if f
-S+ O.
(iii) For all f E l, f = h l 91 + ... + ht9t for some polynomialB hl, ... , ht E A such that lp(f) = maxl<:i<:t(lp(hi) Ip(9i)). PROOF. (i)==? (ii). We know that if f -S+ 0, then f E I. Converselyassume that f E I. Then we know from Theorem 4.1.10, that f -S+ r with r minimal. If r oF 0 then, by Lemma 4.1.9, lt(r) cl Lt(G). But f Eland f - rEl imply that rEl and so lt(r) E Lt(!) = Lt(G), whieh is a contradiction. (ii)==? (iii). This is the special case of r = 0 of Theorem 4.1.10. (iii)==?(i). For f E I we need to show that lt(f) E Lt(G). We have that f = h l 91 + ... + ht9t such that lp(f) = maxl
DEFINITION 4.1.13. A set G of non-zero polynomials contained in an ideal I i8 called a Grübner basis for I provided that G satisfies any one of the three equivalent conditions of Theorem 4.1.12. A set G of non-zero polynomials contained in A is called a Grübner basis provided that G is a Grobner basis for (G). EXAMPLE 4.1.14. Let R = Z and A = Z[x, yi with the aeglex ordering with x < y. Let ft = 4x+1, h = 6y+1 and I = (ft,h). Then 3yft -2xh = 3y-2x E I while It(3y - 2x) = 3y cl (lt(ft), lt(h)) = (4x,6y), and thus {ft, h} is not a Grobner basis for I. On the other hand, as another example, let 91 = 2x+ l, 92 = 3y+ 1 and set!' = (91,92). Then by simply looking at alllinear combinations of 91 and 92, it is easily seen that Lt(!') = (2x,3y,xy) = (2x,3y) = (lt(91),lt(92)), and sa {91, 92} is a Grübner basis for!'. COROLLARY 4.1.15. IfG is a Grobner basis for the ideal I in A, then I = (G).
4.1. BASIC DEFINITIONS
PROOF. This is immediate from part (iii) of Theorem 4.1.12.
209
0
We note that the remainder r obtained in Theorem 4.1.10 is not necessarily unique, even in the case where F is a Grobner basis (see Exercise 4.1.6). We have, however, in Theorem 4.1.12 that for G a Grübner basis and f E (G), the only possible remainder for f is 0 with respect ta G. That is, COROLLARY 4.1.16. IfG is a GTobner basis and f E (G) and f r is minimal, then r = O.
-S+ T,
wheTe
PROOF. Since f E (G) we have that r E (G) and sa, since G is a Grübner basis, we see that if r cf 0 then T cannat be minimal by Lemma 4.1.9. 0 We further note that the Noetherian property of the ring R, and hence of the ring R[XI, ... ,X n ], yields the following Corollary, whose proof we leave ta the exercises (Exercise 4.1.5). COROLLARY 4.1.17. Let 1 finite Grobner basis.
ç R[XI,." ,Xn] be a non-zero ideal. Then l has a
We will develop a method for computing Grobner bases over rings in the next section. However, in the special case where R = k[y], for a field k and a single variable y, we see froID the next theorem that we can compute a Grobner basis over R using the theory presented in Chapter 1. This result will not be used until Section 4.5.
THEOREM 4.1.18. If G = {g" ... ,gt} is a Grobner basis in k[y, X" ... ,xn ] with respect ta an elimination arder with the x variables larger than y, then G is a Grobner basis in (k[yJ) [Xl, ... ,xn ]. PROOF. Let f E 1 = (g" ... ,gt). We will denote by Lt(l) , lt(f), lp(f), and le(f) the leading term ideal of l, the leading term, leading power product, and leading coefficient of f with respect ta the elimination arder <, and by Ltx(l), lt x (f),lpx (f) , and lcx(f) the leading term ideal of l, tbe leading term, leading power product, and leading coefficient of f in (k[yJ)[XI"" ,xn ] (i.e., here the arder is the one on the x variables alone which we will denote by <x). Note that lex(f) E k[y]. We will denote the leading term of a polynomial a E k[y] by
lty(a). We need ta show that Ltx(l) = (ltx(g,), ... ,ltx(gt)). One inclusion is trivial. For the reverse inclusion, let f E J. We write gi = aiXi+ lower x terms, where Xi is a power product in the X variables alone and ai E k[y]; sa ltx(gi) = aiXi. Since Gis a Grübner basis in k[y, Xl,' .. ,Xn ] we can apply the Division Algorithm (see Theorem 1.5.9 and its proof) to write (4.1.2)
CHAPTER 4. GROBNER BASES OVER RlNGS
210
where T j is a power product in the x variables alone, smee Ip(a,yVjTjgij ) = yV'Tj Ipy(ai,)Xi"
o.j
E k, for 1 S j S N and,
(4.1.3) yV'T, lpy(ai,)Xi ,
> YV'T2 Ipy(ai,)Xi , > ". > yVNTN Ipy(aiN )XiN"
Since we are using an elimination order with y smallest, we must have
Choose io least such that TjoXijo > Tjo+lXijo+l. Then T1Xi1 = TjXij for 1 ::; j S; jo, and T , X i , > X for al! other x power products, X, appearing in the right side of Equation (4.1.2). Thus jo
ltx(f) = (~= ajyVj aij )T, X ip
(4.1.4)
j=l
provided that h = L;';", ajyVjaij oF O. But from Equation (4.1.3), looking at the first io terms and canceling TjXij = T1Xi1 we get yV, lpy(ai,)
and so lpy(h) = yV, lpy(ai,) jo
> y"' lpy(ai,) > ... > yVjo lpy(aijo)'
oF O.
Finally, from Equation (4.1.4) we see jo
ltx(f) = I:ajTjlpy(aij)Xij = I:ajTJltx(gij)' j=l
as desired.
j=l
0
The converse of Theorem 4.1.18 is not true as the following example shows. EXAMPLE 4.1.19. Consider the ideal l = (y (y + l)x, y 2x) C;; Rix]' where R = klyJ. Then {y (y + l)x, y2X} is a Gr6bner basis in Rix]' since, in that ring, the polynomials y(y + l)x and y 2x are both terms. However, {y(y + l)x, y2X} is not a Gr6bner basis in klx, yJ with respect to the lex order with x > y (or any other order for that matter), since the polynomial yx is in l but its leading term (itself) is divisible by neither lt(y(y + l)x) = y 2x nor It(y 2x) = y 2x. Exercises 4.1.1. In 21x, y], let J, = 6x 2 + Y + 1, h = lOxy - y - x, h = 15y 2 + x, and F = {f" h h}. Consider deglex with x > y. a. Show that 2x 3y ~ _x 3 - x 2y - 2xy2 - 2xy. b. Show that x 3y2 ~ x 4 + x 3y + x 2y2 _ xy3 _ xy2. c. Show that x 3y2+5x 4 +X 3y ~+ _xy3_x2y_2xy2_y3_x2 _2xy_y2 In each case note that the remainder polynomial is minimal with respect to F. There are many ways one can reduce the above polynomials; the reductions above are just one possibility.
4.1. BASIC DEFINITIONS
211
4.1.2. In Z[x, y, z], let h = x'yz+yz+1, 12 = 5yz'-xy+z, h = 7yz-z-7y-2, and F = {f" 12, h}. Consider deglex with x < y < z. a. Use the Division Algorithm (Algorithm 4.1.1) on f = -3xy + 2xy2 2yz + 2z2 + 3x 2y'z' ta write f = hd, + h 2h + h3 h + r where r is minimal with respect to F and
lp(f) = max(lp(h , ) lp(j,) , Ip(h 2 ) lp(h), Ip(h 3 ) lp(h), lp(r )).
4.1.3.
4.1.4.
4.1.5. 4.1.6.
4.1.7.
(There are two obvious ways to begin this exercise; try them both.) b. Show that the set F is not a Gr6bner basis. In ZlO[X, y], let h = 3x 2y+3x, 12 = 7xy' +y, and F = {f" h}. Consider lex with x > y. a. Use the Division Algorithm (Algorithm 4.1.1) on f = x 3 y 3 + 5x 2y2 + x 2 y + 1 to write f = h,h + h 2 h + r, where r is minimal with respect to F and lp(f) = max(lp(h , ) lp(j,) , Ip(h 2 ) Ip(f2), lp(r)). [One answer is r = -x + 1.] b. Show that the set F is not a Gr6bner basis. Show that the subset F = {3x 2 y - x, 2Z2 - x} ç Z[x, y, z] is a Gr6bner basis with respect ta deglex where x > y > z and is not a Gr6bner basis with respect ta lex where x > y > z. Prove Corollary 4.1.17. If you attempted ta mimic the definition of reduced Gr6bner basis of Chapter 1, what would be the significance of the equality of ideals in Z[x, y], (2x', 3y2 + x') = (2x 2 , 3y2 + 3x 2)? Alternatively think about this example for the idea of uniqueness of reduction. [Hint: Either set is a Gr6bner basis with respect ta deglex with y > x. Also, try reducing 6x'y2 - x 4 ] Show that for any polynomials f,g E R[XI, ... ,xn], for any finite set of non-zero polynomials F in R[Xl1'" ) x n ], and for any power product X E R[Xl, ... ,xn ], we have F
a. If f E F, then fg --->+ O. F
F
b. If f --->+ g, then Xf --->+ Xg. 4.1.8. Let {g" ... ,g,} ç R[XI, ... ,xn ] and 0 op h E R[XI, ... ,X n ], where R is an integral domain. Prove that {gl, ... ,g,} is a Gr6bner basis if and only if {hg ... , hgt} is a Gr6bner basis. 4.1.9. Let 1 "be a non-zero ideal in R[XI' .. . , x n ] and let G be a Gr6bner basis for 1. We say that G is a minimal Gr6bner basis provided that for ail 9 E G, 9 is minimal with respect to the set G - {g}. Prove that every Gr6bner basis for l contains a minimal Grôbner basis for J. 4.1.10. Let 1 be a non-zero ideal in R[Xl,.'. ,xn ] and let G be a Gr6bner basis for 1. a. Prove that G n R generates 1 n R. b. Cali a generating set F for an ideal J ç R a minimal generating set provided that for ail r E F we have (F - {r}) op J. Show that if G
212
4.1.11.
4.1.12.
4.1.13. 4.1.14.
CHAPTER 4. GROBNER BASES OVER RlNGS
is a minimal Grübner basis for I (see Exercise 4.1.9), then G n R is a minimal generating set for I n R. Let I be an ideal in R[XI,'" , xnJ and let 71" denote the quotient map R[XI, ... , xnJ - - 4 (R/(I n R)[XI, ... , x n ]. Let G c:: I be given. a. Show that if G is a Grübner basis for I then 71"( G) is a Grübner basis for 7I"(I). [Hint: For all f E l, f ct In R, write f = fo + ft where 71"(10) = 0 and 7I"(lc(j,)) # O.J b. Show that G is a minimal Grübner basis for I (see Exercise 4.1.9) if and only if GnR is a minimal generating set for InR (see Exercise 4.1.10), 7I"(G-GnR) is a minimal Grübner basis for7l"(I), andlc(g) ct InR forallgEG-GnR. Let G be a Grübner basis for a non-zero ideal I of R[XI"" , Xn ], where R is an integral domain. Let K be the quotient field of R. Let J be the ideal of K[XI"" , xnJ generated by I. Prove that Gis also a Grübner basis for J with respect to the same order. Prove Lemma 4.1.3. Use the ideas in this section and those of Section 3.5 to obtain a theory of Grübner bases for R[xl, ... , xnJ-submodules of R[XI, ... , xnJm. In particular state and prove the analog of Theorem 4.1.12 for R[XI,." ,xn]modules.
4.2. Computing Grübner Bases over Rings. In this section we will give another characterization of Grübner bases (Theorem4.2.3) which is similar to the S-polynomial criterion in Theorem 1. 7.4, and is the direct analogue of Theorem 3.2.5. Of course, now our syzygy modules are submodules of (R[XI, ... ,xnW. We will then give the analogue of Buchberger's Aigorithm (Algorithm 1.7.1) for the case of rings (Algorithm 4.2.1). We will conclude this section by giving an iterative algorithm for computing the syzygy module needed for this generalized Buchberger Algorithm (see Aigorithm 4.2.2). We note, by Theorem 1.1.3, that R Noetherian implies R[XI, ... , xnJ is Noetherian. Moreover, by Theorem 3.1.1, (R[XI, ... , xn])' is Noetherian, and hence any submodule of it is Noetherian and finitely generated.
Given power products Xl,'" ,Xs and non-zero elements Cl, ••. ,Cs in R set L = [ C 1 X 1 csXs ]. Then, for a power product X, we cali a syzygy h = (h" ... ,h,) E Syz(L) c:: (R[XI, ... ,xn ])' homogeneous of degree X provided that each hi is a term (i.e. lt(h i ) = hi for aIl i) and Xi Ip(h i ) = X for aIl i such that hi # O. DEFINITION 4.2.1.
We have the following easy lemma. LEMMA 4.2.2. With the notation above, Syz(L) has a finite generating set of homogeneous syzygies.
213
4.2. COMPUTING GRbBNER BASES OVER RlNGS
PROOF. As noted above Syz(L) is finitely generated. Thus it suffices to show that given any syzygy h = (h" ... ,hs) E Syz(L) we may write h as a SUll of homogeneous syzygies. Now we know that h,c,X, + ... + h,csX, = O. Then expanding the polynomials hi into their individual terms, we see that for any power product X we may collect together ail the terms in this last SUffi whose power product is X and these must also add to zero, sinee aU the
This immediately gives the desired representation.
CiXi
are terms.
D
We can now give the following theorem whose proof parallels exactly the proof of the similar Theorem 3.2.5 (Exercise 4.2.1). THEOREM 4.2.3. Let G = {g" ... , gt} be a set of non-zero polynomials in A. Let B be a homogeneous generating set for SYZ(lt(g,), ... ,lt(gt)). Then G is a Grobner basis for the ideal (g" ... , gt) if and only if for all (h" ... , ht) E B, we have
h,g ,
+ ... + htgt
G
----;+ O.
One can view the expression h,g , + ... + h,gt above as a generalized Spolynomial, since in that expression the leading terms cancel (this is the basic idea that was used in Section 3.3). Thns we see how we will go about generalizing Buchberger's Algorithm. We first must find a hornogeneous generating set for the syzygy module of the leading terms. We then fonn the generalized S-polynomials and reduce each one of thern using the reduction presented in the previous section, If o~e of these does not reduce to zero, we add the reduction to our set and repeat the procedure. The next question we must answer is how do we go about constructing a homogeneons generating set for SYZ(lt(g,), ... , lt(gt)). Or in general, given power products Xl,'" ,XI'; and non-zero elements CI, ...
,Cs
ER how do we construct a
homogeneous generating set for SyZ(C,X ... , c,X,). In view of Equation (4.1.1) and the surrounding discussion, we make" the following DEFINITION 4.2.4. For any subset J ç {J, ... , s}, set X J = lcm(Xj 1 j E J). We say that J is saturated with respect to X" ... , X s provided that for aU j E {l, ... , s}, if X j divides XJ, then j E J. For any subset J ç {l, ... ,s} we cali the saturation of J the set J' consisting of all j E J such that X j divides XJ. (Note that X J = XJ'.)
EXAMPLE 4.2.5. Let X, = xy,X2 = X 2 ,X3 = y, and X 4 = x" Then if J = {1,2} we see that XJ = x 2 y and J is not saturated since 3 ct J, while X 3 = y divides XJ = x 2 y. On the other hand, if J = {l, 2, 3} we see that XJ = x 2 y and J is saturated since 4 is the only element of {l, 2, 3, 4} not in J and X 4 = x 4 does not divide X J = x 2 y. Clearly {l, 2, 3} is the saturation of
{1,2}.
214
CHAPTER 4. GROBNER BASES OVER RINGS
We recall the notation for the standard basis vectors
e, = (1,0, ... ,0), e2 = (0,1,0, ... ,0), ... , e, = (0,0, ... ,0,1) for AS. Given the above notation we are now prepared to state THEOREM 4.2.6. For each set J ç {1, ... , s}, which is saturated with respect ta X" ... , X" let BJ = {bu, . .. , bvû } be a generating set for the R-module of syzygies SyzR(Cj 1 j E J). (Note that each of the vectors bvJ is in the R-module RIJI, where IJI denotes the cardinality of J). For each such bvJ , denote its jth coordinate, for jE J, by bjvJ . Set
(Note that each of the vectors SvJ is in AS.) Then the set of vectors SvJ, for J ranging over aU such saturated subsets of {l, ... ,s} and 1 :s: v :s: VJ, forms a homogeneous generating set for the syzygy module SyZ(C,X ... , csXs ). " PROOF. It is first of all clear that each of the vectors SvJ is homogeneous of degree XJ. Moreover, SvJ is a syzygy of [ C1 X 1 csXs J, since
c,X, J I>jvJ ~J ej jEJ
J
L bjvJ ~J CjXj = XJ L bjvJcj = 0, JEJ
J
jEJ
by the definition of bvJ . Now, let h = (h" ... , hs) E SyZ(C,X, ,··· , c,X,). Since, by Lemma 4.2.2, SYZ(C,X ... , c,X,) is generated by homogeneous syzygies, it " suffices to write h as a linear corobination of the svJ's in the case that h is a homogeneous syzygy, say of degree Y We write h = (d, Y" ... , d,Y,) for d" ... , d, E R and power products Yi, ... , Y,. Set J = {j 1 dj of O} and let J' be the saturation of J. We have that YjX j = Y for ail j E J, since h is homogeneous of degree Y Then, since h is a syzygy of [ C1 X 1 csXs ] , we have L,jEJdjYjCjXj = YL,jOdjcj = O. Thus (dj 1 jE J') is a syzygy of [ Cj 1 j E J' J and so by hypothesis vJ
(dj
li E J') =
,
L rubuJ" 1/=1
that is, for each j E J', vJ ,
dj =
L 1.'=1
TvbjvJIJ
4.2. COMPUTING GROBNER BASES OVER RINGS
215
for sorne rv E R. Now YjXj = y for all jE J implies that X J = XJ' clivides Y. It now follows that
VJI
= ~)L rvbjvJ' )Yjej
=
jEJ'
jEJ' v=l
as desired.
L
S
djYjej
=L
djYjej
= h,
j=l
D
We note that we have reduced the problem of computing a generating set for SyZ(CIX ... , c,X,) to computing the subsets of {l, ... , s} which are saturated " with respect to X" ... , X, (an easy task) and cornputing syzygies in R (see Definition 4.l.5). EXAMPLE 4.2.7. We consider R = Z and let C,X, = 2xyz, C2X2 = 5xy2, C3X3 = 85y2, and C4 X 4 = 6x 2z. We will assume that the reader can solve the elernentary linear diophantine equations that occur in this example (i.e. that the reader can "solve linear equations in R = Z"). One readily checks that the saturated subsets of {1,2,3,4} are {l}, {3}, {4}, {1,4}, {2,3}, {1,2,3}, and {l, 2, 3, 4}. Since R =Z is an integral domain, the singletons {l}, {3}, {4} do not give rise to any non-zero syzygy. For J = {1,4} we need to solve in R = Z the equation 2b , +6b4 = 0 and one finds a generating set for the solutions of this equation to be {(-3,1)}. Since X J = x 2 yz, the corresponding syzygy is -3X;::e, +
,
{2,3}, we need to solve 5b2 + 85b3 = 0 , , which gives us {(-17,1)} and the syzygy -17~e2 + 'f}re3 = (O,-17,x,O). The set J = {l, 2, 3} gives the cliophantine equation 2b , + 5b2 + 85b3 = O. This may be solved to yield two generators for the solutions, namely, (-40, -l, 1) , and (-5,2,0). Then, with XJ = xy2z, we obtain the syzygies -40"xYy:e, -
xx'": e4
= (-3x, 0, O,.y). Now for J =
xx~2ze2 + x~~ze3 = (-40y,-z,xz,O) and -5xx~:el + 2 x: :2z e2
= (-5y,2z,O,O). Finally for J = {l, 2, 3, 4}, we get the generators (-40, -l, 1,0), (-5,2,0,0) and (-3,0,0,1). These will give syzygies that have already been obtained, as is readily checked, and so are not needed in our generating set. So we obtain that
Syz(2xyz, 5xy2, 85y2, 6x 2z) = (( -3x, 0, 0, y), (0, -17, x, 0), (-40y,
-2,
xz, 0), (-5y, 22, 0, 0)).
Now ·that we cau compute, by Theorem 4.2.6, a homogeneous generating set for Syz(lt(ft), ... ,lt(f,)), for any set of polynomials {ft,· .. , J,}, we can give Algorithm 4.2.1, the algorithrn for computing Grübner bases for ideals in A = R[XI, ... , Xn].
4.2.8. IJ R is a Noetherian ring and linear equations are solvable in R then Algorithm 4.2.1 produces a Grobner basis Jor the ideal (f"", , J,). THEOREM
216
CHAPTER 4. GRbBNER BASES OVER RINGS
INPUT: F
=
{!J, ... , j,} ç A
with ji
7' 0
=
R[Xl' ...
,xnl
(1 <:: i <:: s)
OUTPUT: G={g" ... ,gt}, aGrübnerbasisfor (J" ... ,j,) INITIALIZATION: G WHILE G'
:=
0, G' := F
7' G DO
G:=G' Let the elements of G be g" . .. , gt Compute B, a homogeneous generating set
for SYZ(lt(g,), ... ,lt(gt))
FOR each h = (h" ... , h,) E B DO Reduce h,g,
G' + ... + htgt ---->+ r,
with r minimal with respect ta C'
IF r
7' 0 THEN G':= G'U{r}
ALGORITHM 4.2.l. Grobner Basis Algorithm over Rings. PROOF. If the algorithm stops, it clearly, by Theorem 4.2.3, gives a Grübner basis for the ideal (J" ... , j,). As the algorithm progresses we add ta a set of polynomials G a polynomial or, minimal with respect ta C, ta obtain a new set G'. By Lemma 4.l.9, lt(r) rf Lt(G) and sa Lt(G) ç Lt(G'). Thus, since the ring A is Noetherian, the algorithm stops. D We will give examples of computing Grübner bases shortly, after we have giveu a more efficient method for computing the relevant syzygies. We will incorporate this method for computiug syzygies iuto our Grübner basis Algorithm and so we will actually use Algorithm 4.2.2 ta compute these examples (see Examples 4.2.11, 4.2.12, and 4.2.13) Müller [MoSS], has given a method for computing the syzygies that arise in Algorithm 4.2.l. In particular, this method gives an efficient way ta avoid duplication in the computation of syzygies (see Algorithm 4.2.2). At each stage of Algorithm 4.2.1 we add one or more polynomials ta the generating set for the ideal (!J, ... , j,). Sa we have ta compute a generating set for the syzygy module of this increased set of leading terms. The idea of Müller is ta use the computations of syzygies already done in arder ta compute the new syzygies.
217
4.2. COMPUTING GROBNER BASES OVER RINGS
Note that any syzygy of a set of leading terms is automatically a syzygy of a larger set of leading terms (using zeros for the extra leading terms). Again consider power products Xl) ... X s and non-zero elements Cl, ... ,Cs E R. Let Sa = SyZ(C,X ... 'caXa ) for 1 -<: cr -<: s. We will compute a homoge~ " Ss = SYZ(C1X11 • • • 1 csXs ) by inductively constructing neous generating set of generating sets for the Sa. We note that a homogeneous generating set of S, consists of a generating set of the ideal {r E R 1 rCl = O} of R (called the anni~ hilatorof Cl and denoted by ann(cl)) viewed as a subset of A. Also, ifwe take a homogeneous syzygy (b,Y ... ,baYa) in Sa, there are two possibilities. Either " a-,) is a homogeneous syzygy in Sa-l; or ba # 0 ba = 0 and (b,Y ... , ba-lY 2 " and ba E (Cl, ... ,Ca-l)R: (Ca)R. So we proceed as follows. We consider all subsets J of {l, ... , cr}, saturated with respect ta Xl)'" ,Xa1 sueh that (j E J. For each sueh J let b1J ,··· ,bp,JJ denote a generating set for the ideal in R, (Cj 1 j E J, j # cr) R: (ca) R. N ow for each b~J there are bj E R such that 1
L
bjcj
+ bp,Jca
=
0,
JE] j#-o-
and we define the homogeneous syzygy in Sa by ~
XJ x.
JE]
J
L-- bj
Sj.tl =
ej
XJ
+ bp,J"X-ea' a
j#-<7
(The bj's may not be unique, but any choice will do.) We also consider a homogeneous generating set al, ... 1 am for 50'-1, which we assume, by induction, has been computed. Define for 1 -<: i -<: m, (ai, 0) to be the homogeneous syzygy in Sa with the coordinates of ai in the first cr - 1 coordinates and 0 in the last coordinate. We nQW can state
(al,O), ... , (a=,O) together with the syzygies for J c:: {l, . .. , (5} saturated with respect to X" ... , X a with cr E J, form a homogeneous generating set for the syzygy module Sa. THEOREM 4.2.9. The syzygies
s~J
PROOF. Let d = (d,Yl , ... ,daYa ), for d" ... ,da ER and power products Y l , ... 1 Yu, be a homogeneous syzygy in Sa of degree Y. If da = 0 it is clear that d is a linear combination of (a" 0), . . . , (a=, 0). SA assume that da # o. Let J' be the set of all j such that dj # 0 and let J be the saturation of J' inside {l, ... ,cr}. NotethatXJ =XJ' dividesY. Thenda E (Cj Ij E J,j #cr)R: (Ca)R and so da = L;~, ribiJ. Then it is easily checked that ~J
d- LT/-LSj.tJ f.L=1
2Recall that for two.ideals I,J
RlrJ ç I}.
ç
Y XJ
R the ideal quotient, 1: J, is defined ta be 1: J
=
{r E
CHAPTER
218
4.
GROBNER BASES OVER RlNGS
is a homogeneous syzygy with a zero in its oth coordinate and combination of (a" 0), ... , (am, 0), giving the desired result. D
80
is a linear
EXAMPLE 4.2.10. We will redo Example 4.2.7 using this method. We start with S, = Syz(2xyz) = (0). We now consider S2 = Syz(2xyz,5xy 2). The only saturated subset of {1, 2} that need be considered is {1,2} itself (actually {2} is saturated and contains 2 but will only yield the 0 syzygy). We note that (2)z: (5)z = (2)" giving the syzygy (-5,2) of [2 5]. Thus S2 = (( -5y, 2z)). We now tum to S3 and note that the saturated subsets of {1, 2, 3} containing 3 are {2,3} and {1,2,3}. Working with the first one, we note that (5),,: (85)z = (l)z giving the syzygy (-17,1) of [5 85] and so the syzygy (0,-17,x) in S3. For the second set, {1, 2, 3}, we note that (2,5),,: (85)" = (1)" and we may use the same syzygy as before. Therefore S3 = ((-5y,2z,0),(0,-17,x)). We finally tum our attention to S4 and note that the saturated subsets of {1, 2, 3, 4} containing 4 are {l,4} and {1,2,3,4}. For {1,4} we compute (2),,: (6)z = (l)z giving the syzygy (-3,1) of [2 6] and so the syzygy (-3x, 0, 0, y) in S4. Finally for {1, 2, 3, 4} we need to compute (2,5, 85)z: (6)" = (l)z and we may use the same syzygy as we did for {1,4}. So we obtain
S4
=
Syz(2xyz, 5xy2, 85y2, 6x 2z) = (( -5y, 2z, 0, 0), (0, -17, x, 0), (-3x, 0, 0, y)).
We note that the syzygy (-40y, -z, xz, 0) is not in this list, but was included in Example 4.2.7. It is not needed, since (-40y,-z,xz,0) = z(0,-17,x,0) + 8( -5y, 2z, 0, 0). We give the improvement of Algoritbm 4.2.1 which makes use of Theorem 4.2.9 as Algorithm 4.2.2. We leave the easy proof that it is correct to the exercises (Exercise 4.2.9). We close this section by giving sorne examples of Algorithm 4.2.2. Sinee the polynomials generated by the algoritbm are scattered throughout the text of the example, we have put boxes around them for easier reference. EXAMPLE 4.2.11. We continue with R = Z and use lex with x > y in Z[x, yi. Let l = (f" 12), where 1 j, = 3x 2y + 7y 1 and 12 = 4xy2 - 5x. The case CT = 1 in the algorithm does not generate new polynomials, sinee 0 z: (3)z = {O} (we note that, sinee R = Z is an integral domain, we never need ta consider singleton saturated sets). Now consider the case CT = 2. The only non-singleton saturated subset containing 2 is {1,2}. We compute (3)z: (4)" = (3)z which gives the syzygy (4y,-3x) in Syz(3x 2 y,4xy2). The corresponding S-polynomial, 4yj, - 3xh = 15x2 + 28y 2, is minimal with respect to Gand so we add it to G as 1Jo = 15x 2 + 28y2.1 To compute new syzygies, we need only consider the saturated subsets J ç {1, 2, 3} containing 3 and they are {1,3} and {l, 2, 3}. For {1, 3}, we compute (3),,: (15)z = (l)z giving the syzygy (5,0, -y) in Syz(3x 2y, 4xy2, 15x 2). The corresponding S-polynomiaI5j, -yJo = -28y 3 +35y is mininlal with respect to G and so we add it to G as f4 = -28y 3 + 35y. We also compute the syzygies corresponding to {l, 2, 3} by computing 3,4 z: (15)" =
4.2. COMPUTING GROBNER BASES OVER RINGS
INPUT: F = {f" ... , J,} ç Rlxl, ... , xnl with Ji
7' 0
219
(1 ::; i ::; s)
OUTPUT: Ga Gr6bner basisfor (f" ... , Js) INITIALIZATION: G WHILE
(J ::;
:= F, (J :=
1, m := s
m DO
Compute S = {subsets of {l, ... , (J}, saturated with respect to lp(j,), ... ,lp(fa), which contain
(J}
FOR each J E S DO
XJ:= lem(lp(fi)lj
J)
E
Compute generators biJ , i = 1, ... ,Ml
for (lc(fi)lj E J,j
FOR i
=
7' (J)R: (le(fa))R
1, ... , i'J DO
Compute bj E R, j E J, j
L
such that
7' (J
bj le(fj)
+ biJ leUa) =
0
jEJ,j:f=a
R.educe
" XJ L... bj 1 (J') fi jEJ,jj::.a
P
J
XJ
G
+ biJ~(f) Ja ---;+ r, P
0'
where r is minimal with respect to G
IF
r7' 0 THEN
m:=m+l
(J
:=
(J + 1
ALGORITHM 4.2.2. Grobner
Basis Algorithm in Rlxl, . .. , xnl using MOlier's Technique
(l)z and we may use the same syzygy (5,0, -y) as before. The saturated subsets of {l, 2, 3, 4} containing 4 are {2,4} and {l, 2, 3, 4}. For the first we compute (4)z: (-28)z = (l)z which gives the syzygy (0, 7y,0,x). The corresponding Spolynomial is 7Yh + XJ4 = -35xy + 35xy = O. As before the set {1,2,3,4} gives no new polynomial. We see now that {f" h, fa, J4} is a Gr6bner basis for (f" hl· This is a minimal Gr6bner basis (see Exercise 4.1.9 for the definition of
220
CHAPTER 4. GROBNER BASES OVER RlNGS
minimal Grübner basis). EXAMPLE 4.2.12. We consider the ideal J in Z20[X, y] (where Z20 is the ring Z/20Z) generated by the polynomials 1 j, = 4xy + x, 1 and [ 12 = 3x 2 + y. [ We use the lex term order with x > y. We again follow Algorithm 4.2.2 to compute a Grübner basis for J. We first consider 0' = 1. The only saturated subset of {1} is {1} itself. We compute (0): (4) = (5) (throughout this example we will use (... ) for ( .. ·)z,,). This gives rise to the polynomial 5j, = 5x. This polynomial cannot be reduced so we let h = 5x, and we add it to G. We next compute the saturated subsets of {1, 2 which contain 2. These are {2}, and {1, 2}. Sinee 3 is a unit in Z20, {2} does not give rise ta a new polynomial, i.e. (0): (3) = (0). For the set {1,2} we compute (4): (3) = (4) which gives rise ta the polynomial -3xj, + 4yh = -3x 2 + 4y2 ~ 4y2 + y. This polynomial cannot be reduced so we let 14 = 4y2 + y, and we add it to G. We now compute the saturated subsets of 1,2,3 which contain 3. These are {3}, {1, 3}, {2, 3}, {l, 2, 3}. For the set {3} we compute (0): (5) = (4) which gives rise ta the polynomial 4h = O. For the set {1,3} we compute (4): (5) = (4) which gives rise ta the polynomial 5j, -4yh = 5x ~ O. For the set {2, 3} we compute (3): (5) = (1) which gives rise to the polynomial 1512 - xh = 15y which cannot be redueed. Therefore we let 1 ~s )= I~Y) 1 and add it ta G. For the set {1, 2, 3} we see that (4,3): (5) = (3): 5 = 1 and this gives rise to the same polynomial as the set {2,3}. We next compute the saturated subsets of {1, 2, 3, 4} which contaln 4. These are {4},{1,3,4}, and {1,2,3,4}. For the set {4} we compute (0): (4) = (5) which gives rise to the polynomial 5/4 = 5y ~ O. For the set {l, 3, 4} we compute (4,5): (4) = (1) which gives rise to the polynomial yj, - Xl4 = O. The set {1, 2, 3, 4} does not generate a new polynomial, since (4,3,5): (4) = (4,5): (4). We compute the saturated subsets of {1, 2, 3, 4, 5} which contain 5. These are {5},{1,3,5}, {1,2,3,5}, {4,5}, {1,3,4,5} and {1,2,3,4,5}. For the set {5} we compute (0): (15) = (4) which gives rise ta the polynomial 4/5 = O. For the set {1, 3, 5} wc compute (4,5): (15) = (1) which gives rise ta the polynomial 3yh -xfs = O. Note then that {l, 2, 3, 5}, {l, 3, 4, 5} and {l, 2, 3, 4, 5} do not give rise ta any new polynomials. Finally, for the set {4, 5} wc compute (4): (15) = (0): (15) = (4) which does not give rise ta a new polynomial. Thus the algorithm stops and a Grübner basis for J is {j" 12, h, 14, Is}. Note that we have also shawn that Syz(lt(j,), lt(h), It(h), !t(f4), It(fs)) = ((5,0,0,0, 0), (-3x, 4y, 0, 0, 0), (0, 0, 4, 0, 0), (5,0, -4y, 0, 0), (0, 15, -x, 0, 0),
(0,0,0,5,0), (y, 0, 0, -x, 0), (0,0,0,0,4), (0,0, 3y, 0, -x)). This will be used later. EXAMPLE 4.2.13. We will now give one more example where the coefficient ring does not have the property that every ideal is principal. Wc let R =
4.2. COMPUTING GROBNER BASES OVER RINGS
221
Z[A] = {u + vAlu, vEZ}. It is well-known (see, for example, [AdGo]) that Z[AI is not a UFD and so not a PID. Indeed it is not hard to show that 2,3,1 + A, and 1 - A are not units and cannot be factored in Z[AI, but that 2·3 = (1 + A)(1 - A). According to Theorem 4.2.3, in order to compute a Grabner basis in such a setting we will need to be able to compute a homogeneous generating set for the syzygies of the leading terms of polynomials. We will follow Algorithm 4.2.2. To do this we need to compute ideals of the form (Cl, ... ,cg): (c) for ct, ... , C" cE Z[A] (in this example, of course, (Cl, ... ,Cf) means (Cl, ... ,c,lz[v'=5]). One can do this either by using sorne elementary theory of quadratic fields (see [Ad Go]) to find generators for these ideals, or one may proceed as follows. We need to find all '" E z[AI such that 00 = CIal + ... + Cial for sorne al, ... ,Œ,e E Z[.J=5]. Each of the c's and o's are of the farm u + vyC5 for integers u, v. Sa in the desired equatioll, one can simply equate the real and imaginary parts and obtain a pair of linear Diophantine Equations. These are discussed in elementary number theory courses (see Niven, Zuckerman, and Montgomery [NZM]). Many computer algebra systems have the facility to solve such equations. Of conrse, this method will give a generating set for (Cl, ... ,cg): (c) as a Z-module. This generating set is then also a generating set for (Cl, ... ,Cf): (c) as an Z[A]-module, but sorne of the generators might be redundant. In the computations below we will write down only the non-redundant generators but it is possible to verify that these generators are correct by the above procedure. We illustrate this more specifically for the case f = 1. 80 in this case we need ta solve CIal = co: for ŒI and Œ. We write Cl =U, +vlA andc=u+vA and "'1 =fJ, + ')'1 A and a = fJ+')'A, where Ul, VI, U, V, {311 /1, {3, 'Y are all integers. Then taking the real and imaginary parts of the equation Cl al - ca = 0, we need ta solve the pair of equations U,fJ, - 5V1Î'1 - ufJ + 5v')' = 0 and V,fJ, + U1Î'l - vfJ - u')' = 0 for the integers {31, 'YI, {3, 'Y. For an alternative method to do these computations, which is in the spirit of this book, see Exercise 4.3.l. Let us consider the ideal l in (Z[A])[x, y] generated by the two polynomials = (1 + A)x2 - xy. We use the lex ordering with .2.2 to compute a rabner basis for I. Sinee Z[A] is an integral domain we do not need to eonsider saturated sets which are singletons. 80 the first saturated set we must consider is {1,2}. We compute (2): (1 + A ) = (1- A,2). Since 3·2 - (1- A)(1 + A ) = 0, the first generator, 1- A , gives rise to the polynomial 3xfr - (1- A)Yh = (1 - A)xy2 + 3Axy, and this cannot be reduced any further using fr, h·
fr = 2xy + A y and h x > y. We follow 19orit
We let 1 h = (1 - A)xy2 + 3Axy·1 Also, the second generator,2, gives rise to the polynomiaJ
(l+A)xfr -2yh We let
1
f4 =
=
2xy2+A(l+A)xy ~ A(l+A)xy-Ay2 2
A(1 + A)xy - A y 1
222
CHAPTER 4. GROBNER BASES OVER RINGS
We now consider the saturated subsets of {1, 2, 3} which contain 3. They are
{1, 3} aud {1, 2, 3}. For the set {1, 3} we compute (2): (1- A) = (1 + A, 2). The first generator gives rise ta the syzygy (-3y, 0, (1 + A)) and the second ta the syzygy ((1- A)y, 0, -2). These give rise ta the following polynomials 3A(1 + A)xy - 3Ay2 J!.., 0, -6Axy + A ( l - A)y2 J'.., (5 + A)y2 - 15y.
-3yj, + (1 + A)h (1 - A)yj, - 2h
We let 110 = (5 + R)y2 -:- 15y.1 For the saturated set {1, 2, 3} we compute
(2,1 + A): (1 - A )
=
(1). This gives rise ta the polynomial
aud we let 16 = -3Rx2y + xy3 + Axy2. We now consider the saturated subsets of 1,2,3, 4} which contain .4. They are {1, 4},{1, 2,4}, {1, 3, 4} aud {1, 2, 3, 4}. For {1, 4} we compute (2): (-5+A) = (-1 + A, 2). These give rise ta the polynomials
3Aj, + (-1 + A)/4 (5 - A)j, + 2/4
= =
(5 + A)y2 - 15y Jo.., 0, -2Ay2 + 5(1 + A)y.
We let 1Ir = -2Ry2 + 5(1 + A)y.1 For the saturated set {1, 2, 4} we compute (2,1 + A): (-5 + A ) = (1). This gives rise ta the polynomial
3xj, - yfz
+ Xl4
=
(1 - A)xy2
+ 3Axy Jo.., O.
For the saturated set {1, 3, 4} we compute (2,1- A): (-5+ A ) = (1). This gives rise ta the polynomial
2yj, + h + yl4
=
3Axy - A
y3 + 2Ay2,
y3 + 2Ay2 ·1 We note that 18 cannat be reduced because 3 A rte (2, -5 + A). It is easy ta see that for the saturated set {1,2,3,4} we cau use the same syzygy as we did for {1,3,4}. aud we let 118 = 3Rxy - R
We now consider the saturated subsets of {1, 2, 3, 4, 5} which contain 5. They are {1,3,4,5} aud {1,2,3,4,5}. For {1,3,4,5} we compute (2,1- A,-5+ A): (5 + A ) = (1). This gives rise ta the polynomial
3yj, - h - Xl5
=
3(5 - A)xy + 3Ay2 J!.., O.
It is easy ta see that for the saturated set {1, 2, 3, 4, 5} we cau use the same syzygy as we did for {1, 3, 4, 5}. We now consider the saturated subsets of {1, 2, 3, 4, 5, 6} which contain 6. They are {1, 2, 4, 6} and {1, 2, 3, 4,5, 6}. For {1, 2,4, 6} we compute (2,1 + A, -5 +
4.2. COMPUTING GROBNER BASES OVER RlNGS
A): (3A) = (2,1 + A): (3A) rise to the polynomials
= (2,1 + A).
These generators gives
2xy 3 + 2Axy2 - l5xy 2Axy2 - l5xy - A y3 -15xy - A y3 + 5y2
J.'., J'.., 3Ayh + (1
223
-(5 + A)y3 + l5y2 0, (1 + A )xy 3
+ A)!6 y' ~fs
~
-
(5 + 2A)xy2
(-5 + A)xy2 _ A
y3
O.
For {l, 2, 3, 4, 5, 6} we compute (2, H A , l - A , -5+A, 5+A) :(3A)
= (2, 1 + A): (3A) = (2, 1 + A) and so we can use the same syzygies as in the previous case. We now consider the saturated subsets of {l, 2, 3, 4, 5, 6, 7} which contain 7. They are {5,7}, {l, 3, 4, 5, 7} and {l, 2, 3, 4, 5, 6, 7}. For {5,7} we compute (5 +
A): (-2A) = A ( - u A ) : A(-2) = (-uA): (2) = (3,1A). These generators give rise to the polynomials (1 + A)!5 +3h = 0 and 2!5 + (1- A)h = O. For {l, 3, 4, 5, 7} we compute (2,1- A, -5 + A, 5 + A): (-2A) = (1). We obtain the polynomial AyiJ
+ xh 15~f,
~
5(1 + A)xy - 5y2 -5(1 - A)y2 - l5Ay
o.
It is easy to see that for the saturated set {l, 2, 3, 4, 5, 6, 7} we can use the same syzygy as we did for {l, 3, 4, 5, 7}. We now consider the saturated subsets of {l, 2, 3, 4, 5, 6, 7, 8} which contain 8. They are {l, 4, 8}, {l, 3, 4, 5, 7, 8}, {l, 2, 4, 6, 8} and {l, 2, 3, 4, 5, 6, 7, 8}. For {l, 4, 8} we compute (2, -5 + A): (3A) = (2, -5 + A). We obtain the polynomials
2 Ay3 - 4Ay2 - l5y (5 + A)y2 - l5y 0,
-5(1 -Y!!!2jY/7
+ A)y3 + 5(5 + 2A)y2
O.
For {l, 3, 4, 5, 7, 8} we compute (2, l - A , -5+A, 5+A, -2A) :(3A) = (2, -5 + A), and so we may use the same syzygy as in the previous case. For {l, 2, 4, 6, 8} we compute (2,1 + A, -5 + A, -3A): (3A) = (1)
224
CHAPTER 4. GROBNER BASES OVER RINGS
We obtain the polynomial
16 + xls
(1 - A )xy3
+ 3Axy2 A o.
It is easy to see that for the saturated set {1, 2, 3, 4, 5, 6, 7, 8} we can use the same syzygy as we did for {1, 3, 4, 5, 7, 8}. We now have that the polynomials h, 12, 13'/4'/5, 16, h, 18 form a Grübner basis for I. We note that the leading coefficients of h, 14 and fs form an ideal which is equal to (1) = 2[A], so we can add to the Grübner basis a polynomial whose leading power product is xy and whose leading coefficient is 1. Indeed, 8·2 + 3(-5 + A ) + (-1)3A = 1, and so tbis polynomiàl is 8h +
18 - xy + A y 3 - 5Ay2 + SAy = 19. It can then {h'/5, h, 19 is also a Grübner basis for l Exercise 4.2.10).
3/4 -
be shown that
ExercÎses 4.2.1. Prove Theorem 4.2.3. 4.2.2. Prove the following: Let G = {g" . .. ,g,} be a set of non-zero polynomials in A. Let B be a homogeneous generating set of Syz(lt(g,), ... ,It(g,)). Then G is a Grübner basis for the ideal (g" ... ,g,) if and only if for all (h" ... ,h,) E B, we have h,g, + ... + h,g, = v,g, + ... + v,g, where Ip( h,g, + ... + h,g,) = max(lp( v,) Ip(gd, ... ,Ip( v,) Ip(g,)). [Hint: See the proof of Theorem 3.2.5.J 4.2.3. Show that in Müller's method of computing syzygies of terms (i.e. in Theorem 4.2.9 and Algorithm 4.2.2), if J ç J' ç {1, ... ,o-} with 0- E J, such that
and the set J has been used, then the set J' may be ignored. 4.2.4. Show that in Algorithm 4.2.2, if we consider the case where R = k is a field then we may improve the algorithm as follows: For each 0- in the main WHILE loop find the minimal number of distinct j" ... ,je, with 1 ~ jv < 0- such that for each J E S there is a jv E J. Then in the remainder of the WHILE loop we need only compute the reduetions of SUj" , lu) for 1 ~ v < r. Use this method to redo Example 3.3.5. Compare this result to the use of crit2 in Example 3.3.5. 4.2.5. Compute generators for the following syzygy modules. a. For R = 2 compute Syz(3x'y, 5x 2z, 9yz2, 7xy2z). [An answer: {( -5z, 3y, 0, 0), (-3z 2, 0, x 2, 0), (-4yz, y2, 0, x), (0,0, -7xy, 9z)}.J b. For R = 2'5 compute Syz(2x 2y, 5x 2 z, 9yz', 7xy2Z). [An answer: {(0,3, 0,0),(5z,y,0,0),(0,0,5,0),(3z 2,0,x2,0),(0, 0, 7xy, 9z), (-yz, _y2, 0, x)}.J
4.3. APPLICATIONS OF GROBNER BASES OVER illNGS
4.2.6.
4.2.7.
4.2.8.
4.2.9. 4.2.10. 4.2.11.
225
c. For R = Z[i] (where i 2 = -1) compute Syz(3ix 2y, (2 + i)x 2z,5yz2, 7xy2z). [An answer: {( -(2 + i)z, 3iy, 0, 0), (0, -(2 - i)yz, x 2, 0), (4iyz, (2 - i)y2, 0, x), (0, 0, -7xy, 5yz)).] For the ring R = Z compute a Grübner basis for the ideals generated by the given polynomials with respect to the given term order. a. fI = 2xy - x, fz = 3y - x 2 and lex with x < y. b. fI = 3x2y - 3yz + y, fz = 5x 2Z - 8z 2 and deglex with x > Y > z. c. fI = 6x 2 + y2, fz = lOx 2y + 2xy and lex with x > y. For the ring R = Z15 compute a Grübner basis for the ideal generated by the polynomials fI = 2x2y + 3z and fz = 5x 2Z + y with respect to deglex with x > y > z. For the ring R = Z[i] (where i 2 = -1) compute a Grübner basis for the ideal generated by the polY!1omials fI = 3ix 2y + (1 + i)z and fz = (2 + i)x 2 z + y with respect ta deglex with x > y > z. Praye that Algorithm 4.2.2 terminates and has the desired output. Show that {fz, 15, h, fg} forms a Grübner basis as asserted at the end of Example 4.2.13. Generalize the results in this section to the computation of Grübner bases for R[XI, ... , xn]~submodules of (R[XI, . .. , xn])m (see Exercise 4.1.14).
4.3. Applications of Grobner Bases over Rings. We are interested in applications similar to the aues in the previous tWQ chapters. We have basically seen in Section 4.1 how to solve the ideal membership problem and we will give an example of this. We will then show how to compute a complete set of coset representatives modulo an ideal. This requires more effort than it did in the case of fields because now one has to take into acconnt the ideals in the ring R. We will then explore the use of elimination in this context to compute ideal intersection, ideal quotients and ideals of relations. These applications are very mueh the same as before and do not require any sedous modifications in their statements or proofs. We will close this section by showing how ta compute a generating set for the syzygy module of an arbitrary set of polynomials. Sa let R be a Noetherian ring in which linear equations are solvable (see Definition 4.1.5) and let A = R[XI, ... , xn]. Let J be a non~zero ideal of A. Suppose that J = (fI, ... ,1,), set F = {f" ... , l,}, and let G = {gI, ... , g,} be a Grübner basis for J. Then by Theorem 4.1.12 we know that 1 E J if and only if 1 -S+ O. Thus we can determine whether or not 1 E J (ideal membership problem). Moreover as we saw in Algorithm 4.1.1, if 1 E J, we can obtain 1 = h,g , + ... + h,g, explicitly. Also, using Algorithm 4.2.2, we can find a matrix T such that (g" ... ,g,) = TU"", ,j,). Wc can then substitute in for the g;'s to obtain 1 as a linear combination of the l/s, provided 1 E J. EXAMPLE 4.3.1. We go back to Example 4.2.11. Recall that a Grübner basis for the ideal generated by fI = 3x'y + 7y and h = 4xy' - 5x in Z[x, y] with respect to the lex ordering with x > y is {fI, 12, h, 14} where fa = 15x2 + 28y2
CHAPTER 4. GROBNER BASES OVER RlNGS
226
and f4 = -28y 3 + 35y. We let f = 2x2y2 - 3x 2y + 5x 2 - 4xy 3 15x + 14y2 - 7y. First we verify that f E (ft, 12). We have
f
~
f-(-7xh+2y 2h) -3x 2y - 30x 2 - 4xy3
-:2:1,'
-30x 2 - 4xy 3
-
-
-
12xy2 + 5xy + 15x - 56y4
12xy2 + 5xy + 15x - 56y4
12xy 2
+ 5xy +
+ 14y 2 -
7y
+ 14y 2
-:!c!,' -4xy3 - 12 xy2 + 5xy + 15x - 56y4 + 70y2 -:'0,' -12 xy2 + 15x - 56y4 + 70y2 -:!c!,' -56y4 + 70y2 ~ o. Thus we have
f
(-7xh+ 2y 2h)-ft-2h-Yh-3h+2yf4 - ft - (7x
+ y + 3)12 + (2 y2 -
2)h + 2yf<.
In Example 4.2.11 we saw that
4y [ hj~f4 ]-- [~ _4y2 + 5
.
~] [ l·
-3x· 3xy
ff 1 2
T
Thus f = (2y - 1)ft + (-x - y - 3)h We nQW cOllsider the problem of determining a complete set of coset representatives for Aj I, where I is a non-zero ideal of A. We adapt the method given in Zacharias IZa]. In arder ta do this we must assume that given any ideal 'S of R, we can determine a complete set C of coset representatives of RICZS and that we have a procedure ta find, for ail a E R, an element c E C such that a =' c (mod 'S). If this is the case we say that R has effective coset representatives. Such rings include the integers Z, the finite rings ZjnZ, the polynomial rings
over fields, ZIyC5I, and ZIA]. We consider a Grübner basis G = {gl, ... ,g,} for the ideal I. With respect to the set {lt(gl), ... ,lt(g,)} of leading tenns of G, we consider the saturated subsets J ç {1, ... ,t} as defined in Definition 4.2.4. However here we also consider J = 0 to be saturated. Then for each saturated subset J ç {1, ... ,t}, we let I J denote the ideal of R generated by {ic(gi) 1 i E J} (if J = 0, then IJ = {O}). Let C J denote a complete set of coset representatives for RjIJ. We assume that 0 E C J . Also, for each power product X, let J x = {i IIp(gi) divides X}. It is clear that J x is saturated for ail power products X (in the proof of Theorem 4.3.3 we will need CJx for aU power products X, and of course J x could be equal to 0 for sorne X and that is why we had to add 0 in our list of saturated subsets).
4.3. APPLICATIONS OF GROBNER BASES OVER RINGS
227
DEFINITION 4.3.2. We call a polynomial r E A totally reduced provided that for every power product X, if eX is the corresponding term of r (here, of course, c E R may very well be 0), then c E C Jx' For a given polynomial f E A, we call a polynomial r E A a normal form for f provided that f r (mod I) and r is totalZy reduced.
=
It should be emphasized that the definition of normal form depends not only on the set of polynomials G, but also on the choices of the sets of coset repre-sentatives CJ for the set of saturated subsets J. The complete set of coset representatives is given by the following THEOREM 4.3.3. Let G be a Grobner basis for the non-zero ideal l of A. Assume that for each saturated subset J c:: {l, ... ,t}, we have chosen, as above, a complete sets of coset representatives CJ for the idealIJ . Then every f E A has a unique normal form. The normal form can be computed effectively provided linear equations are solvable in Rand R has effective coset representatives. PROOF. We will first show the existence of a normal form for f. The proof will be constructive. (We note that this part of the proof does not depend on the fact that G is a Gr6bner basis for I.) The proof will be similar in nature to the reduction algorithm. We will use induction on Ip(f) to obtain a totally reduced polynomial r with f = r (mod I) satisfying Ip( r) S Ip(f). If f = 0 then the result is cleaL Thus we assume the result for ail polynomials whose leading power product is less than Ip(f), where we assume that f oF O. To simplify the notation we set J = J lpU )' Wc may choose c E CJ such that le(j) c (mod I J ). (Note that c = 0 if and only if f is reducible.) Then we may write !e(f) - c = I:iEJ Ci !e(gi) (this can be done effectively by our assumption that linear equations are solvable in R). We consider
=
fI
=
f -
LC;~P(f) gi· iEJ
P(gi)
Write fI = clp(f) + ff· Then we have that Ip(f{) < Ip(f). Thus, by induction, there is a polynomial ri which is totally reduced, satisfies Ip(r,) S Ip(j{) and has the pro perty that ff ri (mod I). We now have that
=
, f = fi - ri
" Ip(f) + 'L., Ci~( .)g,+ (clp(f) + r,) iEJ
=
clp(f)
P 9t
+ ri
(mod I).
Let r = clp(f) + ri. Noting that Ip(r,) S Ip(f{) < Ip(f), we sec that Ip(r) S Ip(f). Finally, Iph) < Ip(f) implies that r = clp(f) + ri is totally reduced and 80 r is a normal farm for i, and the induction is complete. We next show the uniqueness of normal forms. So suppose that Tl, T2 are totally reduced, f = ri (mod I) and f = r2 (mod I). Then we have ri - r2 E l
and, since G is a Gr6bner basis for l, we have rI - r2 ~+ 0; in particular,
CHAPTER 4. GROBNER BASES OVER RINGS
228
if r, i' r2, we have that r, - rz is reducible. Thus if X = lp(r, - rz) and c = lc(rl - r2) we have that c E h x . Let d" d 2 be, respectively, the coefficients of X in TIl T2. Then since Tl and r2 are totally reduced, dl, d2 E C Jx ' Thus sinee c = d, - dz i' 0 we have that d, and dz represent distinct cosets mod IJx and so their difference cannot represent the 0 cosetj Le. c f/- l J x' This Îs a contr.adiction. Thus c = 0 and r, = r2, as desired. D We note that in the case where R = k is a field, Theorem 4.3.3 asserts that a k-basis for k[XI, ... ,xn]/I consists of all power products X such that J x = 0. This is precisely the statement of Proposition 2.1.6. EXAMPLE 4.3.4. We go back to Example 4.2.12. Recal! that a Grabner basis for the ideal I = (4xy + X, 3x 2 + y) in 2 zo [x, y] with respect to the lex ordering with x > y was computed to be {fI, 12, h 14, 15}, where fI = 4xy + x, 12 = 3x 2 + y, h = 5x, 14 = 4y2 + y, 15 = 15y. We wish to describe a complete set of caset representatives for 2 zo[x, y]/ I. We fol!ow Theorem 4.3.3 and we use the notation given there. First note that if X is any power product not 1, x, or y, then I Jx = 2 zo . We also have h, = {O}, and I Jx = h y = (5). We now choose a complete set GJx of caset representatives for Z2o/IJx for each power product X. Clearly for those ideals I Jx equal to 2 zo , CJx = {O}. We choose C Jx = CJy = {O, 1,2,3, 4}, and C J, = 2 20 . Therefore a complete set of coset representatives for 2 20 [x, y]/ Iis the set {a+bx+cy 1 a E 2 20 , b, cE {O, 1,2,3, 4}}. Thus, for example if 1 = 3x 2y + 2xy + 13y - 5, we have
1
~ 1-(3xyh-3xfI)=3x z +2xy+13y-5
J'-, 2xy + 12y - 5 ~
12y-2x-5 2y + 3x + 15 (mod 1).
As we did in Example 2.1.8 we can also construct a multiplication table for = X, x 2 J'-, x 2 -7h = -7y ==
220[X,yJ/I. Namely, we have xy ~ xy-yh + fI 3y (mod 1), and y2
i:!.dJ! y2 + 14 + yl5 = x 1 x y
y. Thus we obtain
1 x 1 x x 3y y x
Y
y x y
So, for example, (2 + 4x + 2y)(3 + 4x + 4y) = 6 + 14y + 16x 2 + 8y z + 4xy == 6 + 14y + 8y + 8y + 4x == 6 + 4x + lOy == 6 + 4x (mod 1). EXAMPLE 4.3.5. We go back to Example 4.2.13. Recall that a Griibner basis for the ideal I = (2xy + Ay, (1 + A)x2 - XV) in (2[yC5]) [x, y] with respect to the lex ordering with x > y was computed to be {J2, 15, h, fg}, where 12 = (1 + A)x2 -xv, 15 = (5+A)y2 -15y, h = - 2 Ay2+(5+5A)y, and 19 = xy + A y3 - 5AyZ + 8Ay. We wish to describe a complete set of
4.3. APPLICATIONS OF GROBNER BASES OVER RINGS
229
coset representatives for (Z[Al)[x, yI!J. We follow Theorem 4.3.3 and we use the notation given there. Note tirst that for every power product X not equal to l,xv, or Y~, v,l-'::> l, we have JJx = ZIA]. We also have JJ, = h, = JJ, = {D}, hv = (1 + A ) for v > l, and JJ,v = (5 + A , -2A) = (10,5+ A) for 1-' > 1. We now choose a complete set C Jx of coset representatives for Z[Al/JJx' for all power products X. Clearly, for those hx's equal to Z[Al we have CJx = {D}. We choose C J ," = {D, 1,2,3,4, 5} for v ::> 2 (note that 6 E (1 + A ) and that aj + a2A ~ aj - a2 (mod 1 + A)). We choose CJ,v = {D,l, 2, 3, 4, 5, 6, 7, 8, 9} for 1-' ::> 2 (as above we have aj + a2A ~ aj - 5a2 (mod 5 + A ) , and 10 E JJ,v)' Therefore a complete set of coset representatives for (Z[ADJx,YI!J is n
m
v
{a + bx + cy + Ldvx + L 1/=2
d v E {D, l, 2, 3,4, 5}, and
e~y~
1
a,b,c E Z[H],
J1=2
e~ E {D,l, 2, 3,4, 5, 6, 7, 8, 9} }.
We now turn to applications of the method of elimination (see Sections 2.3 and 3.6). The proofs are very similar to the ones in those two sections, 80 most of them will be omitted and left to the exercises. Let YI, ... ,Ym be new variables, and consider a non-zero ideal
J ç A[Yj,·.· ,Ym] = R[xj, ... ,Xn,Yj,··· ,Ym]. We wish ta "eliminatel.1 the variables YI, ... 1 Yrn, Le. we wish to compute generators (and a Grübner basis) for the ideal J n A. First, we choose an elimination order on the power products of A[Yj, ... ,Ym] with the y variables larger than the x variables (see Section 2.3). The next result is the analog of Theorems 2.3A and 3.6.6 and a generalization of Exercise 4.1.10 a. THEOREM 4.3.6. With the notation set above, let G be a Grobner basis for l with respect to an elimination order on A[Yj, ... ,Ym] with the y variables larger than the x variables. Then G nAis a Grobner basis for InA. PROOF. We will use the notation LtA and LtA[y[ for the leading tenn ideals in A and A[Yj, ... ,Ym] respectively. Then we need to show that LtA (G n A) = LtA(I nA). So if D of f E InA then lt(J) E LtA[y](I) = LtA[y](G) and so lt(J) is a linear combination of elements lt(g) such that 9 E G with coefficients in A[Yj, ... ,Ym]. Since lt(J) is a term involving only the x variables, we may assume each summand is a term and involves only the x variables. Hence for cach 9 E G appcaring in the snm for lt(J), lt(g) involves only the x variables and thus, since we are using an elimination order with the y variables larger than the x variables, each term in such a 9 involves only the x variables, i.e. 9 E G n A. Thus lt(j) E LtA(GnA), as desired. 0
CHAPTER 4. GROBNER BASES OVER RINGS
230
EXAMPLE 4.3.7. We go back to Example 4.2.12. Recall that a Gr6bner basis for the ideal l = (4xy + X, 3x' + y) in Z20[X, y] with respect to the lex ordering with x> y was computed to be {4xy+x, 3x 2+y, 5x, 4y2+ y , 15y}. Using Theorem 4.3.6 and Exercise 4.1.10 a, we see that In Z20[y] = (4y2 + y, 15y), and that InZ20 = {O}. EXAMPLE 4.3.8. We go back to Example 4.2.13. Recall that a Gr6bner basis for the ideal l = (2xy+,/=5y, (1+,/=5)x 2 -XV) in (Z[,/=5]) [x, y] with respect to the lex ordering with :r > y was computed to be
{2Xy
+ Ny, (1 + N)x 2 -
N ( l + N)xy - N -2Ny2
xv, (1 - N)xy2
+ 3Nxy,
y2, (5 + N)y2 - 15y, -3Nx2y + xy3
+ 5(1 + N)y, 3Nxy -
N
y3
+ Nxy2,
+ 2 Ny2}.
Using Theorem 4.3.6 and Exercise 4.1.10 a, we see that
In (Z[N]) [y] = ((5 + N)y2 - 15y, -2Ny2
+ 5(1 + N)y)
and In Z[N] = {O}. Our first application will be to compute the intersection of two ideals of A and the ideal quotient of two ideals of A. First, as in the ideal case over fields, Proposition 2.3.5, and as in the module case, Proposition 3.6.8, we have PROPOSITION 4.3.9. Let l = (J" ... ,j,) and J and let w be a new variable. Consider the ideal L = (wj" ... ,wj" (1- W)g" ... ,(1 - w)g,)
Then In J = L
= (g" ... ,g,)
ç A[w]
be ideals oj A
= R[XI, ... ,Xn, W].
n A.
As a consequence of this result we obtain a method for computing generators for the ideal l n J ç A: we first compute a Gr6bner basis G for the ideal L in Proposition 4.3.9 with respect ta an elimination order on the power products in A[w] with w larger than the x variables; a Gr6bner basis for In J is then given by GnA. EXAMPLE 4.3.10. In Z[x, y], we wish to compute (3x - 2, 5y - 3) n (xy - 6). So following Proposition 4.3.9 and Theorem 4.3.6 we consider the polynomials j, = 3xw - 2w, h = 5yw - 3w and 13 = xyw - 6w - xy + 6 and compute (J" h, 13) n Z[x, y]. We will outline the computation. We consider the deglex ordering with x > y on the variables x and y and an elimination order between w and x, y with w larger. We follow Algorithm 4.2.2 (observe that mueh use of Exercise 4.2.3 is made in this computation). The first saturated set to consider is {l, 2} which gives rise to 5yj, - 3xh ~+ O. The only saturated set containing 3 is {l, 2, 3} and this gives rise to 13 - 2yj, + xh ~+ 4wy - 8w - xy + 6 = j4. For the saturated sets containing 4 we need to consider {2,4} which gives rise to -5j4 +4h = 28w+5xy-30 = j5 and {J, 2, 3, 4} which gives rise to -Xj4 +413 = 8wx-24w+x 2y-4xy-6x+24 = j6. For the saturated sets containing 5 we need
4.3. APPLICATIONS OF GROBNER BASES OVER RINGS
231
to consider {1, 5} which gives rise to 3xJs -28h --++ 15x 2y-lOxy-90x+60 = h, {2, 4, 5} which gives rise to yJs - 7J4 --++ 5xy2 - 3xy - 30y + 18 = Is, and {1, 2, 3, 4, 5} which gives the salhe result as the last set (Exercise 4.2.3). For the saturated sets containing 6 we need to consider {1, 5, 6} which gives rise to J6 - 12h + xJs = 6x 2 y - 4xy - 36x + 24 = J9 and {1, 2, 3, 4, 5, 6} which gives the same result as the previous set. For the saturated sets containing 7 we need only consider {1, 2, 3, 4, 5, 6, 7} which gives rise to wh - 5xyh --++ O. The remaining cases (only 4 need to be computed) ail go to zero and we see that {f"h,h,J4,JS,J6,h,Js,fg} is a Grübner basis for (J"h, 13)· Thus
(3x - 2, 5y - 3)
n (xy -
6) = (5xy2 - 3xy - 30y + 18, 15x 2y - 10xy - 90x + 60,
6x 2 y - 4xy - 36x + 24). We also note that ho = h - 2J9 = 3x2 y - 2xy - 18x + 12 is in (J" h, 13) and we can replace h, J9 by ho so that {h,!2, 13, J5, J6, Js, JlO} is also a Grübner basis for (J" h, 13)· We thus obtain the simpler generating set
(3x - 2, 5y - 3) n (xy - 6) = (5xy2 - 3xy - 30y + 18, 3x 2y - 2xy - 18x + 12). The computation of ideal quotients is almost the same as in the previous cases except that one must be careful of possible zero divisors in Rand hence in A. PROPOSITION 4.3.11. Let l = (J" ... , Js) and J = (YI, ... , gt) be ideals oJ A. Then t
I: J={fEAIJJç:I}= nI: (Yj)· j=l
M oreover, if
In (g)
=
(h,g, ... , h,g)
and 9 is not a zero divisor in A, then we have
I: (g) = (h" ... ,hl). PROOF. The proof is exactly the same as in the ideal case over a field, see Lemmas 2.3.10 and 2.3.11, until we must verify that In (g) = (h,g, ... , h,g) implies that I: (g) = (h" ... , h,) which requires that 9 must be canceled and this requîres that 9 nat be a zero divisar. D As a consequence of this result we obtain a method for computing generators for the ideal 1: J ç A, provided that none of gl, ... ,9t are zero divisors. EXAMPLE 4.3.12. In Z[x, yi, we wish to compute (3x - 2,5y - 3): (xy - 6). From Proposition 4.3.11 we need ta compute (3x - 2,5y - 3) n (xy - 6) and divide the generators of this ideal by xy - 6. This intersection was computed in Example 4.3.10 and so dividing the polynomials obtained there by xy - 6 we obtain immediately that
(3x - 2, 5y - 3): (xy - 6) = (5y - 3, 3x - 2).
232
CHAPTER 4. GROBNER BASES OVER RlNGS
In Section 2.4 we used elimination to compute the kernel of an algebra homomorphism. Sometimes Rlxl, ... , x n ] is called an R-algebra, in order to emphasize the special role played by the ring R. In a similar vein, a ring homomorphism between tWQ polynomial rings QVeT R
q,: RIYI, . .. , Ym]
--->
Rlxl, ... , x n ]
is called an R-algebra homomorphism provided that q,(r) = r for aU r E R. It is then dear that q, is uniquely determined by
q,: Yi
f---->
1;,
for h, ... , Jm E RIXI, ... , Xn]. That is, if we let h E RIYI,." , Ym], say h = Lv cvyr 1 • • • y~tn, where Cv E R, v = (VI, ... , V m ) E Nm) and only finitely many cv's are non-zero, then we have
v
Given this setup we have the analogue of Theorem 2.4.2. THEOREM 4.3.13. With the notation above, let J = (YI - h,··· , Ym - Jm) ç; RIYI, ... ,Ym,XI,··· ,xn ]. Then ker(q,) = JnRIYI,'" ,Ym].
Another way to view Theorem 4.3.13 is that J n RIYI, ... , Ym] gives the ideal of relations among the polynomials h, ... , J,. EXAMPLE 4.3.14. Consider the Z-algebra homomorphism q,: Zlu, v, w] ---> Zlx,y] defined by q,: u f----> 3x-2, q,: v f----> 5y-3 and q,: w f----> xy-6. We wish to find the kernel of q,. Following Theorem 4.3.13 we let h = 3x - 2 - u, h = 5y - 3 - v, and 10 = xY - 6 - w and compute (h, h, 10) n Zlu, v, w]. We consider the term orders deglex with u > v > 'UJ on u, v, w and deglex with x > y on X, y and an elimination orcler between X, y and u, v, w with X, y larger. We follow Algorithm 4.2.2. The first saturated set to consider is {1,2} which gives rise to 5yh - 3xh --->+ O. The only saturated set containing 3 is {l, 2, 3} and this gives rise to -10 + 2yh - xh = vx + 3x - 2uy - 4y + w + 6 = J4. For the saturated sets containing 4 we need to consider {l,4} which gives rise to 3J4 - vh --->+ -6uy - I2y + uv + 3u + 2v + 3w + 24 = J5 and {l, 2, 3, 4} which gives rise to yJ4 - vh --->+ -2y 2u - 4y2 + yu + yw + 8y + vw + 6v = J6' There are three saturated sets containing 5 but we need only consider (Exercise 4.2.3) {2,5} which gives rise to 5J5 + 6uh = -uv - 3u - 2v + 15w + 84 = h and {l, 2, 3, 5} which gives rise to XJ5 + 6uh --->+ O. Ali remaining polynomials arising from saturated sets containing 6 or 7 recluce to zero. Thus we see that ker(q,) = (uv + 3u + 2v -15w - 84). We conclude this section by giving a method for computing a set of generators for the syzygy module, Syz(/J, ... , J,), for a set of non-zero polynomials {h, ... , J,} in A. Wedo this byfirst computing SYZ(YI,'" ,y,) fora Grobner basis {YI, ... ,g,} for the ideal (h, ... , J,). The theorem describing Syz(gj, ... ,y,)
4.3. APPLICATIONS OF GROBNER BASES OVER RINGS
233
is the analogue of Theorem 3.4.1. In our case, we will have to replace Theorem 1. 7.4 with Theorem 4.2.3. Let {g" ... , g,} be a Grübner basis in A. We let lt(gi) = CiXi for Ci ER and power products Xi. Let B = {h" ... , hg} be a homogeneous generating set of the syzygy module SYZ(lt(g,), ... ,lt(g,)) = SyZ(CIX ... , c,X,). Assume that " for dji E R and power for 1 S j S C we have that hj = (djlY,I, ... , Cj'Y,,) products Yji, where for each j we assume that hj is homogeneous of degree Zj; i.e. for all i,j such tbat dji of 0, we have XiY,i = Zj. (we assume that Y,i = 0 if dji = O.) Then by Theorem 4.2.3, for each j, 1 S j S C the generalized S-polynomial L:~l djiY,igi has the representation
,
,
Ldji0i 9i =
Lajvgl/l
i=l
1/=1
,
where
max lp(ajv) lp(gv) = lp('" dji}jigi) L-t i=l
1
< max Y,i lp(gi). 1
We now deline for 1 S j S C,
Sj = hj - (ajl, ... , aj') E At We note that Sj E SYZ(g" ... ,g,).
With the notation above, the collection {Sj Il S j S C} is a generating set for SYZ(g" ... , g,). THEOREM 4.3.15.
PROOF.
Suppose to the contrary that there exists (UI, ... ,u,) such that (UI, ... ,u,) E SYZ(g" ... ,g,) - (Sj Il S j SC).
Then we can choose such a (UI, ... , u,) with X = max'<:i",,(lp(u;) lp(gi)) least. Let
S = {i E {l, ... ,t} IIp(ui)lp(gi) = X}. Now for each i E {l, ... , t} we deline ,
Ui =
u; as follows:
{UiifirjeS
Ui - lt( Ui) if i E S.
Of course we have that for all i, lp( u;) lp(gi) < x. Now, for i E S, let lt( Ui) = C;X;. So for all i E S we have XiX; = X. Since (UI, ... ,u,) E SYZ(g" ... ,g,), we see that
L C~CiX:Xi
= 0,
iES
and
80
L <X;ei E SyZ(C1X ll ··. ,CtXt). iES
234
CHAPTER 4. GROBNER BASES OVER RINGS
,
Thus, we may write
Lc~X:ei
LVjhj)
=
iES
j=l
A. We note that, we may assume that either Vj = 0 or that Vj = bj ~ , bj E R. We can see this sinee, for each i E S, we have <XI = L~=l VjdjiYji
for sorne
Vj
E
from which we obtain, aiter multiplying through by Xi, C;X = ~~~, vjdjiZj ; and thus the power products involved are independent of i. Then we have
(Ul, ... ,Ut)
=
L<X:ei+(u~"",U~)
,
iES
L bj :
hj
+ (u;, ...
, u;)
Lbj ZBj
+ (u;, ...
,u;) + Lbj Z(aj" ... , aj').
j=l
, j=l
J
X
'
j=l
J
We define
X
J
, (w" ... ,w,) = (u;, ... ,u;) + Lbj :(aj" ... ,aj'). j=l
J
We note that (w" ... ,w,) is in SYZ(9', ... ,9,) - (Sj 1 1 :s; j (u" ... ,u,) and each Sj are in SYZ(9',··· ,9,) and (u" ... ,u,) j :s; i). We will obtain the desired contradiction by proving that
:s; i),
rt
sinee
(s,Il :s;
For ead v E {l, ... , t} we have
lp(u~
,
X
+ Lbj ZŒjv)Xv j=l
J
:s; max(lp(u~), But, by definition of u~, we have
Ip(u~)Xv
max (ZX Ip(ajv)))Xv .
l::;J~l
j
< X. Also,
Therefore lp(w v ) lp(gv) < X for each v E {l, ... , t} violating the condition that X = max'
1
1.
4.3. APPLICATIONS OF GROBNER BASES OVER RINGS
235
that S is obtained using Algorithm 4.1.1 and T is obtained by keeping track of the reductions in Algorithm 4.2.1 or Algorithm 4.2.2.) Now using Theorem 4.3.15, we can compute a generating set {SI, ... ,se} for Syz(G). Then exactly as we did in Theorem 3.4.3 we have (Exercise 4.3.12) THEOREM 4.3.16. With the notation above we have Syz(h,· .. ,l,) = (Ts I where
Tl) ...
, ...
,r 8 are the columns of 18
~
,Tsf, Tl,··.
,T s )
ç; A',
T S.
EXAMPLE 4.3.17. We go back ta Example 4.2.12. R.ecall that a Grübner basis for the ideal l = (h,h) in Z20[X, y], where h = 4xy + x and 12 = 3x 2 + y, with respect ta the lex ordering with x > y, was computed to be h, 12, 13 = 5x, 14 = 4y2 + y, is = 15y. We wish to compute a generating set for Syz(h, h h!4, Is) and Syz(h, 12). As in Theorem 4.3.15, we first compute a homogeneous generating set for Syz(lt(h), lt(h), lt(h), It(4), ItUs)) = Syz( 4xy, 3x 2, 5x, 4y2, 15y). In Example 4.2.12 we found that the syzygy module is generated by (5,0,0,0,0), (-3x,4y,0,0,0), (0,0,4,0,0), (5,0,-4y,0,0), (0, 15,-x,0,0), (0,0,0,5,0), (y,O, 0, -x, 0), (0,0,0,0,4), (0,0, 3y, 0, -x). These syzygies give rise to the following polynomials
5h
-3xh +4yh 413 5h -4yh
1512 -xh
5x=h -3x 2 + 4y2 ~ 4y2
+ Y = 14
°5x=h
514
15y = Is 5y = 31s
yh -X14 41s 3yh - xls
° °O.
Therefore
((5,0, -1, 0, 0), (-3x,4y + 1,0, -l, 0), (0, 0, 4, 0, 0), (5, 0, -4y -1, 0, 0), (0,15, -x, 0, -1), (0,0,0,5, -3), (y, 0, 0, -x, 0), (0,0,0,0,4), (0,0, 3y, 0, -x)). To compute Syz(h, 12), we first compute the matrices Sand T such that
[h 12 13 14
Is] =
[h 12] T,
[h h]=[h 12 13 h
and
is]S
CHAPTER 4. GROBNER BASES OVER RlNGS
236
It is easy to see that
We have
T(5, 0, -1, 0, 0) T( -3x, 4y + 1,0, -1, 0) T(O, 0, 4, 0, 0) T(5, 0, -4y - 1,0,0) T(O, 15,-x,0,-1) T(O, 0, 0, 5, -3) T(y, 0, 0, -x, 0) T(O, 0, 0, 0, 4) T(O, 0, 3y, 0, -x)
(0,0) (0,0) (0,0) (0,0) (0,0) (0,0) (y + 3x 2, -4xy - x) (0,0) (15y+5x 2,5x).
Finally we have J2
-
TS
=
[~ ~].
Therefore Syz(fI ,12) = ((y + 3x 2 , -4xy - x), (15y + 5x2, 5x)). Exercises 4.3.1. In this exercise we give another method for computing the ideals in Exam~ pIe 4.2.13. Wejirst observe that Z[HJ '" Z[xJ/(x 2 + 5) under the map H c--> x+ (x 2 + 5). Thus to find, for example, (2): (1 + H), we need to find the syzygies of the matrix [ 1 + x 2 x 2 + 5 J and read off the coefficients of 1 + x. We note that {1 + x, 2, x 2 + 5} is a Grabner basis and we can compute a generating set Syz(l+x, 2, x 2 +5) = {( -2, l+x, 0), (1x,-3,1),(0,x 2 +5,-2)} which yields (2): (1+ H) = (-2,1- H). Verify the statements made so far and then go on to use this method to compute ail of the ideal quotients in Example 4.2.13. 4.3.2. Generalize Exercise 4.3.1 to the case where Z[xJ/(x 2 + 5) is replaced by R[Xl1'" ,xnlj I, where R is a commutative ring and 1 is an ideal in R[Xl, ... ,x n ], that is, give a method for computing generators for ideal quotients in R[x" . .. ,xnJ/J. 4.3.3. As in Example 4.2.12, consider the ideal J ç; Z20[X, yJ generated by fI = 4xy + x and fz = 3x 2 + y. Show that f = 12x 3 y 2 - x 3 y - IOx 3 + 4x 2y2 4x 2y - 4xy3 - 4xy2 + Uxy - 6x Eland write f as a linear combination of fI and h
4.4. A PRlMALITY TEST
237
4.3.4. In Example 4.2.11 compute a complete set of coset representatives for Z[x, yI! J. Find the totally reduced form of 5x3 y 3, 3x 3 y 3 and 4x3 y 3. 4.3.5. Prove Proposition 4.3.9. 4.3.6. In Z[x,y] compute (3x-2,xy-6)n(5y-3). [Answer: (15xy-9x-lOy+ 6,5xy2 - 48xy + 27x, 10y2 - 96y + 54).] 4.3.7. Prove Proposition 4.3.11. 4.3.8. In Z[x, y] compute (3x-2, xy-6): ·(5y-3). [Hint: Look at Exercise 4.3.6.] 4.3.9. Prove Theorem 4.3.13. 4.3.10. Consider the Z-algebra homomorphism q,: Z[u, v] ---. Z[x, y] defined by q,: u c--> 3x - 2 and q,: v ---. xy - 5. Compute generators for ker( q,). 4.3.11. Consider the Z-algebra homomorphism q,: Z[u, v, w] ---. Z[x, y] defined by q,: u ---. 3x - 2, q,: v ---. 2y - 5 and q,: w c--> x - 5y. Compute generators for ker( q,). 4.3.12. Prove Theorem 4.3.16. 4.3.13. Compute generators for the syzygy module of the Grübner bases constructed in Exercise 4.2.6. Use this to construct the syzygy module for the original polynomials (of course, by unique factorization, this latter problem is trivial). Repeat this exercise for Exercises 4.2.7 and 4.2.8. 4.3.14. Compute SYZ(h,J5, 17, J9) for the Grübner basis in Example 4.2.13 (although this problem is quite doable, it is a long, messy computation). 4.3.15. The following exercise depends on Exercises 4.1.14 and 4.2.11. Generalize the following to the case of R[x1' ... , xn]-submodules of R[x1, ... , xn]m. a. Tasks (i) and (ii) at the beginning of Section 3.6. b. Task (iii) at the beginning of Section 3.6. c. Theorem 3.6.6. d. Theorems 3.7.3 and 3.7.6 (of course use the ideas in Theorems 4.3.15 and 4.3.16). e. As much of Section 3.8 as energy permits. 4.4. A Primality Test. In this section we will give an algorithm that will determine whether a given ideal l in A = R[Xl 1 • • • l Xn ] is prime or not. Recall that an ideal P is called prime if and only if given any polynomials J, 9 E R[x1, ... , x n ] with J9 E P, we have J E P or 9 E P. This is easily seen to be equivalent to the statement that the ring R[x1, ... , x n ]/ Pis an integral domain. Prime ideals are basic building blocks like prime integers are building blocks in Z. Geometrically, prime ideals in k[Xl1 ... ,xnl, where k is a field, correspond ta irreducible varieties (varieties which are not the union of two proper subvarieties) (see [CLOS]). Algebraically, any radical ideal is the intersection of prime ideals. The test for primality that we present in this section is taken from the paper of P. Gianni, B. Trager and G. Zacharias [GTZ]. Before we give the primality test, we must consider rings of fractions of the ring A. Although this is not strictly necessary, we will assume when discussing rings of fractions that R is an integral domain, since that is an that is needed in
238
CHAPTER 4. GROBNER BASES OVER ruNGS
order to obtain the primality test (even in the case where R is not an integral domain). Then A is also an integral domain. We have the field of fractions
K =
{~
1
f, 9 E A, 9 '" 0 }
of A. We consider certain subrings of K. Let S c A be a multiplicative set; that is, 1 E S, 0 ri' S, and if f, 9 E S then f 9 E S. Examples of multiplicative sets are S = {g E A 1 9 '" O}, S = {g E A 1 9 ri' P}, where P is a prime ideal in A, and S = {g" 1 vEN} for a fixed 9 E A. Given a multiplicative set Sc A we define the subring S-l A of K by
S-l A =
{~ E Kif E A and 9 ES} .
It is easy to see that S-l A is a subring of K, whlch we call the ring of fractions of A with respect to S. In the three examples above, if S = {g E A 1 9 '" A}, then S-lA = K; if S = {g E A 1 9 ri' P}, then S-lA is called the local ring at P and is denoted by A p ; and finally if S = {g" 1 VEN} then S-l A = 1 f E A and VEN} is denoted by Ag. We note that A is a subring of S-l A, since for ail f E A, we have f = and 1 E S, by assumption. Our main concern is the saturation of a non-zero ideal l of A with respect to S, whlch is defined to be S-l J n A,
{/v
f
{f
where S-l J = 1 f E J and 9 ES}. It is readily seen that S-l J is an ideal in S-l A and is, in fact, the ideal in S-l A generated by the set J. It is often also written as J(S-l A), since every element of S-l J is the product of an element of J and an element of S-l A. We show that for two multiplicative sets S we can easily compute generators for the saturation of an ideal l with respect to S, namely for the cases where S = {g"lv = 0,1, ... } (Proposition 4.4.1) and S = {r E Rlr '" O} (Proposition 4.4.4). PROPOSITION 4.4.1. Let R be an integral domain. Let 9 E A, 9 '" O. Let w be
a new variable. Consider the ideal (J, wg - 1) of A[w]. Then J Ag nA = (J, wg - 1) nA. PROOF. Let f E J Ag n A. Then there is a non-negative integer v such that g"f E J. Then w"g"f E JA[w] ç (J,wg-l) and so
f = w" g" f as desired. Conversely, let
+ (1 -
w" g")f E (J, wg - 1),
f E (J, wg - 1) nA. Then, m
f=f(X1"" ,xn )= I;gM(X1"" ,xn )hM(X1,'" ,xn,w) (h=1
+(Wg(X1"" ,xn ) -1)ho(X1,'" ,xmw) E A = R[X1,'" ,xn ],
4.4. A PRlMALITY TEST
239
where g~ Eland h~ E A[w] for all IL. Sinee the variable w does not appear in f = f(X1,'" ,xn ) we rnay substitute w = }, which shows that f E lAg. 0 Proposition 4.4.1 should be eompared with Theorem 2.2.13. Thus, in order to compute generators for the saturation of an ideal l ç A = R[X1, ... ,xn ] with respect to 8 = {g" Iv E N} we eonsider an elimination order on the variables Xl, ... 'X n1 W with w bigger than the x variables. We compute a Griibner basis, G, for the ideal (1, wg - 1) in A[w] = R[X1"" ,X n , w]. Then G nAis a Griibner basis for this saturation (Theorem 4.3.6). We will give an example which illustrates Proposition 4.4.1 in Example 4.4.6. Now let 8 = R- {a}, and let k denote the quotient field of R (i.e. k = 8- 1 R). In order to compute generators for the saturation of l with respect to S, we need two preliminary results. For an integral domain R, let 8 be any multiplicative subset of R. We note that (8- 1 R) [Xl, ... ,X n ] = 8- 1 (R[X1, ... ,X n ]) (this is an easy exercise). In this situation we see that Griibner bases are well-behaved. PROPOSITION 4.4.2. Let R be an integral domain. Let 8 c R be a multiplicative set and let 1 A be a non-zero ideal of A. Let G be a Griibner basis for 1 with respect ta some term order. Then G is a Griibner basis for the ideal 8- 11 in 8- 1 A.
c:
PROOF. This follows easily from the third eharacterization of Griibner basis in Theorem 4.l.12. Namely, let f E 8- 1 1. Then there is a s E 8 c R such that s f E 1. Suppose that G = {g" ... ,gt}. Then s f = h , g , + ... + htg, such that Ip(sf) = max'';i,;t(lp(hi) Ip(gi))' Thus f = (~h,)g, + ... + (~h,)gt. Since Ris an integral domain, we have Ip(sf) = Ip(J) and Ip(~hi) = Ip(h i ) (1:; i:; t). Thus we have a representation of f in terms of 91, ... ,gt of the desired type showing that G = {g" ... ,g,} is a Gr6bner basis for 8- 11. 0 We note that in the following lemma we do not need that R be an integral domain. LEMMA 4.4.3. Let J J=l.
c: 1 be ideals in A
and assume that Lt(I)
c: Lt(J).
Then
c:
PROOF. Of course, since J 1, we have that Lt(J) = Lt(l). We observe that the proof of Theorem 4.l.12 did not require that the set G be a finite set. Then we see that J is a (infinite) Gr6bner basis for 1, and hence by Corollary 4.l.15, J generates 1. But J is an ideal and so generates itself, that is, J = 1. 0 Reeall that for s E R we denote by R, = {:" 1 r E R, VEN}. We now give the result which, when combined with Proposition 4.4.1, allows us to compute generators for the saturation of an ideal 1 with respect to 8 = R - {a}.
240
CHAPTER 4. GROBNER BASES OVER RlNGS
PROPOSITION 4.4.4. Let R be an integral domain with kits quotient field. Let IcA = R[x], ... ,xn ] be a non-zero ideal and let G = {g" ... ,g,} be a Grobner basis for l with respect ta sorne term ordering. Set s = lc(g,) lc(g2)" ·lc(g,). Then
(4.4.1) Ik[x" ... ,xn]nR[x" ... ,Xn] =IR,[x" ... ,xn]nR[x" ... ,Xn]. PROOF. We will need leading term ideals in R[x" ... ,xn ], whieh we denote by Lt, and in Rs [x" ... ,x n ], whieh we denote by Lt s . We note that it suffiees ta show that
(4.4.2)
Lts(Ik[x" ... ,xn]nR,[x" ... ,Xn]) ÇLts(IR,[x" ... ,Xn]).
lndeed, if Equation (4.4.2) holds, we have from Lemma 4.4.3, applied ta the ring Rs[x" ... ,xn ], that Ik[x, , ... ,xn ] nR,[x" ... ,xn ] = IR,[x" ... ,xn ]. The desired result (4.4.1) follows by interseeting this last equation with R[x], ... ,xn ]. CLA]M: Lt(I)k[x" ... ,xn]nR[x" ... ,xn ] = Lt(I)Rs[x" ... ,xn]nR[x], ... , xn]. Assuming the Claim, we prove (4.4.2) as follows. We let f E Ik[x" ... ,xn ] n Rs [x" ... ,xn ]. We need ta show that It(f) is in Lt s (IR, [X" ... ,xn ]). Since, by Proposition 4.4.2, G is a Gr6bner basis for Ik[xl, ... 1 xnJ, we may write, by Theorem 4.1.12, f = h,g, + ... + h,g, where hi E k[x" ... ,xn ] and Ip(f) = max'99(lp(hi )lp(gi))' Let V = {i [ Ip(f) = Ip(hi)lp(gi)}' Then It(f) = I:iEV It(h;) It(gi)' Since f E R,[x" ... ,xn ] there is an non-negative integer l' sueh that s~ f E R[x" ... ,xn ]. We see that It(s~ f) = 2)t(s~hi) It(gi) E Lt(I)k[x], ... ,Xn]
n R[x" ... ,Xn],
iEV
and sa from the Claim It(s~ f) E Lt(I)Rs[x" ... ,xn ] n R[x" ... ,xn ].
Thus, reealling that Lt(I) = Lt( G), we may write
It(s~f)
,
=" ~Xilt(gi), ~S!/i i=l
for ai E R, non-negative integers Vi, and power produets Xi, sueh that Ip(f) = Xi Ip(gi) for ail i such that ai # O. Henee
,
It(f) =
ai L -.+-Xilt(9i) sv, p,
E Lt,(IRs[x" ... ,Xn]),
i=l
since It(gi) E Lts(IR,[x" ... ,xn ]), as desired. It remains ta prove the Claim. We will show that bath sides are equal ta the ideal, (lp(gi) [1 ::; i ::; t) in R[x" ... ,xn ]. We first note that Rs ç k and sa we have immediatelythat Lt(I)R,[x" ... ,xn]nR[x" ... ,xn ] ç Lt(I)k[x" ... ,xn]n R[x" ... ,xn ]. We write It(gi) = CiXi for Ci E R and a power product Xi'
4.4.
A PRlMALITY
TEST
241
Then, since s = aiCi for sorne ai E R, by the definition of Xi = ~(C;Xi) E Lt(I)Rs[x" ... ,xn ] and so we see that
(Xi
Il::; i
::; t)
8,
we have that
ç Lt(I)Rs[x" ... , x n ] n R[x" ... ,xn ].
It remains to show that Lt(I)k[x" .. . ,xn ] n R[x" ... ,xn ] ç (Xi 1 1 ::; i ::; t). Since Lt(I) = Lt(G) we can express each element J of Lt(I)k[x" ... ,Xn] n R[x" . .. ,Xn ] as a linear combination ofthe Xi with coefficients in k[x" . .. ,xn ]. Let eX be a term in J, where c E R and X is a power product. Then in order for c to be non-zero we must have an Xi dividing X and then we see that Lt(I)k[x" ... ,xn ] n R[x" ... ,xn ] as desired.
ç (Xi Il::; i
::; t),
0
We note that if R is a UFD, then the element s in Proposition 4.4.4 can be replaced by s = lcm(lc(gi)11 ::; i ::; t) (Exercise 4.4.1). COROLLARY 4.4.5. Let R be an integral domain in which linear equations are solvable and let k be its quotient field. Let IcA = R[x" ... ,xn ] be an ideal. Then we can compute. generators for the ideal Ik[x" ... ,xn ] n R[x" ... ,xn ]. PROOF. This follows from Proposition 4.4.4 and Proposition 4.4.1.
0
EXAMPLE 4.4.6. We consider Example 4.2.11 and compute the saturation of I = (3x 2 y + 7y,4xy2 - 5x) with respect to Q[x, y]. That is, we compute IQ[x, y] n Z[x, y]. In that example we computed a Gr6bner basis for I, with respect to lex with x> y, to be fI = 3x 2y+7y,h = 4xy2_5x, h = 15x 2 +28y2, and J4 = 28y3 - 35y. Then, following Proposition 4.4.4 and Exercise 4.4.1, we need to compute IZ 420 [X, y]nZ[x, y], since lcm(3, 4,15,28) = 420 (here, of course, we are adopting the notation of this section and 1,420 does not ruean the integers mod 420). Then, from Proposition 4.4.1, we need to compute (J" 12, 420w 1) n Z[x, y], which we do using Theorem 4.3.6. We, of course, begin with the polynomials fI, 12, h, J4 together with J5 = 420w - 1 and observe that, sinee fI,h,J3,J4 is a Gr6bner basis, we may start with (J = 5 in Algorithm 4.2.2, provided we use the lex term ordering with w > j; > y. We will not go through the computations, but will note that {f"h, h,J4, J5,525wx - xy2, 784wy2 + x 2, 525wy_ y3, 980wx+x 3 , 980wy+x 2y, 4y3 -5y, 3x3 +28xy2_28x} is a Gr6bner basis for (fI, ho 420w -1), and hence {f" 12, h,J4, 4y3 - 5y, 3x 3 + 28xy2 - 28x} is a Gr6bner basis for IQ[x, y] n Z[x, y]. Finally we are ready to discuss the issue of primality in rings. We are no longer assuming that R is an integral domain. We need two lemmas from commutative algebra which give criteria for when an ideal in R[Xl,'" ,xnl is prime.
CHAPTER 4. GROBNER BASES OVER RINGS
242
LEMMA 4.4.7. Let 1 ç R[x!, ... ,Xn] be an ideal. Then 1 is a prime ideal if and only if the image of 1 in (R/I n R)[x!, ... ,xn ] is prime. Moreover, in this case 1 n R is prime.
For r E R we let r = r+lnR be the image of r in R/lnR. For f = I::~! aiXi E R[x!, ... ,Xn ], with ai E R and power products Xi, we write 7 = I:i~! aiXi, its image in (R/I n R)[x!, ... ,xn ]. We define a ring homomorphism PROOF.
q,:
(R/lnR)[x!, ... ,xn]->R[x!, ... ,xn]/I
by
, q,:
,
LaiXi>--+ LaiXi+ I . i=l
i=l
That the map is well-defined follows since if I:;~! aiXi = 0, then ai E 1 n R for all i and so clearly I:;~! aiXi E 1. It is then easy to see that q, is onto and ker( q,) is the image of 1 in (R/lnR)[x!, ... ,Xn]. It is now immediate that 1 is a prime ideal if and only if the image of 1 in (R/ 1 n R) [X!, ... ,xn ] is prime. Finally, it is trivially seen that if 1 is prime then so is 1 n R. D LEM MA 4.4.8. Let R be an integral domain with quotient field k and let 1 ç R[x!, ... ,xn ] be an ideal such that 1 n R = {O}. Then 1 is prime if and only if Ik[x!, ... ,xn ] is prime in k[x!, ... ,xn ] and 1 = Ik[x!, ... ,xn ] n R[x!, ... ,xn ]. PROOF. We first assume that 1 is a prime ideal in R[x!, ... ,xn ]. Let f, g E k[x!, ... ,Xn] and assume that fg E Ik[x!,- .. ,Xn]. Then fg = ~ sueh that h Eland r E R. Let d, e E R be sueh that df, eg E R[x!, ... ,xn ]. Then we have r(df)(eg) = deh Eland so r(df) E 1 or eg E l, since 1 is prime in R[x!, ... ,xn ]. We then have immediately that f E Ik[x!, ... ,xn ] or g E Ik[XI ,xn ], and 80 Ik[XI ,Xn] is prime, as desired. Moreover, let ~ E Ik[x!, ... ,xn ] n R[x!, ... ,xn ] with h Eland r E R, r of O. Then r( ~) Eland so rEl or ~ El (reeall that ~ E R[x!, ... ,xn ]). By the hypothesis, InR = {O}, r fi- land 80 ~ E I. The reverse inclusion is trivial. We now assume that the ideal Ik[x!, ... ,Xn] is prime in k[x!, ... ,xn] and 1 = Ik[x!, ... ,Xn] nR[x!, ... ,Xn]. Let f,g E R[x!, ... ,xn] and assume that fg E 1. Then fg E Ik[x!, ... ,xn ] and so by hypothesis, f E Ik[x!, ... ,xn ] or g E !k[x!, ... ,Xn]. We assume that f E Ik[x!, ... ,Xn]. Then f E Ik[X!, ... ,Xn] n R[x!, . .. ,Xn ] = 1 and so wc see that 1 is a prime ideal. D l '"
1 '"
COROLLARY 4.4.9.
Let 1 ç R[x] be an ideal. Then 1 is prime if and only if
(i) 1 n R is a prime ideal in Rand (ii) if we let R' = R/lnR, k' be the quotient field of R', and l' be the image of 1 in R'[x], then l' R'[x] is a prime ideal of R'[x], and l' = l'k'[x] nR'[x]. PROOF.
Noting that l'
4.4.7 and 4.4.8.
D
n R'
= {O}, this follows immediately from Lemmas
4.4. A PRIMALITY TEST
243
Sinee R[XI' ... ,Xn] = (R[XI,'" ,Xn-I])[Xn], Corollary 4.4.9 gives us an inductive technique of determining whether a given ideal in R[Xl) ... ,xn ] is prime. The idea is summarized in the next paragraph. For a single variable x and an ideal J ç R[x] we want to decide if J is a prime ideal. By Corollary 4.4.9, we first need to determine whether J n R is prime. We may compute a generating set for J n R from Theorem 4.3.6 and then, assuming we can determine whether ideals in R are prime, we can determine whether J n R is prime. In this case we will consider the integral domain R' = RjJ n R, its quotient field k' , and the ideal l', the image of Jin R' [x]. Again by Corollary 4.4.9, we then need to determine whether l'k'[x] is a prime ideal for one variable x. Since x is a single variable, k'[x] is a PID and, using the Euclidean Algorithm (Algorithm 1.3.2) on the known generators of l', we can determine a single generator f of l'k'[x]. Now, assuming that we can determine whether f is irreducible or not in k'[x], we can determine whether l'k'[x] is a prime ideal or not. Finally, by Corollary 4.4.5 we can determine whether or not
l'
=
l'k'[x]
n R[x].
Thus we see that in order to have an algorithm determining the primality of ideals in R[Xl1'" ,xn ], we must assume that we can do similar things in ~. Specifically, we must assume that we can determine when an ideal in R is prime. Moreover, we must assume that we can determine, given a prime ideal P of R[XI" .. ,X n], whether polynomials in one variable with coefficients in the quotient field of R[XI, ... ,xn]jPare irreducible or not. For example Z and Q have these properties. The algorithm for determining the primality of an ideal is given as Algorithm 4.4.1. EXAMPLE 4.4.10. We now give an example of Algorithm 4.4.1. Let J c Q[x, y, z] be the ideal generated by fI = xz - y2, h = X3 - yz and h = x 2 y - Z2 As we saw in Exercise 2.5.5, this is the ideal of relations among the tillee polynomials t3,t4,t5 and hence must be a prime ideal (sinee then Q[x, y, z]j J is isomorphic to a subring of the integral domain Q[t]). In this example we will show that J is prime using Algorithm 4.4.1. We use the notation established in that algorithm. We first compute a Grübner basis G for J using the lex ordering with x > y > z and get G = {fI, h h f<, fs}, where f4 = xy3 - Z3 and fs = y5 - z4 We have n = 3, J4 = h = (0), h = (y5 - z4), and J, = J. We first note that the case i = 4 in the algorithm is trivial. The next case to consider is i = 3. We have h = J n
CHAPTER 4. GROBNER BASES OVER ruNGS
244
INPUT: J, an ideal of R[Xl"" ,xn ] OUTPUT: TRUE if J is a prime ideal, FALSE otherwise Set Ri := R[Xi'" . Compute Ji := J
,xn ] for i = l, ... ,n, and
n FI.; for i
= l, ... , n
Rn+1
:= R
+1
IF J n +1 is not a prime ideal of R THEN
result:=FALSE ELSE result:=TRUE i:=
n+ 1
> 1 AND result:=TRUE DO
WHILE i
R' .·-R·/J· ~
J'
:=
~
Ji-1R'[Xi-l]
k' := quotient field of R' Compute the polynomial IF
f
f
such that J'k'[Xi_l] = (I)
is not zero or irreducible over k' THEN result:=FALSE
ELSE Compute J'k'[Xi_l]
n R'[Xi-l]
IF J'k'[Xi_l] n R'[Xi_l] oF J' THEN result:=FALSE ELSE i:= i - 1
RETURN result ALGORITHM 4.4.1.
Primality Test in R[Xl, ... ,xn ]
unique factorization in Q[z]. Moreover, it cannot be the product of a cubic and a quadratic in Q(z) [y]. One way to see this is as follows. If f = y5 _z4 were the product of a cubic and a quadratic in Q(z)[y], then we would have a system of 5 polynomial equations in 5 unknowns which would have to have a solution in Q(z).
4.4. A PRlMALITY TEST
245
If one computes a Grübner basis for tbese polynomials one finds that there would . have to be a rational function e(z) E lQI(z) satisfying eS = Z8 (use lex with e > z being the smallest variables where e is the constant term of the quadratic) and again this is impossible. It is then trivial to check that (y5_ Z4)IQI(Z) [y] nlQl[y, z] = (y5 _ z4)IQI[y, z] (if( y 5 - Z4) ~~~o C>v(z)yv E (y5 - z4)IQI[y, z] with C>v(z) E lQI(z), then a simple induction shows that C>v(z) E lQI[z]). Thus we have completed the WHILE loop for i = 3 with "result=TRUE". We now consider the case i = 2. In this case J, = J. Wehave R' = lQI[y,z]f(y5Z4), J' = J((IQI[y, zl/ (y5 - z4) )Ix]) and k' = quotient field of R'. For polynomials f E lQI[x,y,z] we denote by 7 the polynomial in R'[x] = (lQI[y,z]f(y5 - z4))[x] obtained by reducing the coefficients of the powers of x (these being polynomials in lQI[y, z]) modulo (y5 - Z4); i.e. we have a homomorphism
lQI[x, y, z]
-.
(lQI[y, z]f (y5 - z4)) [x]
f
e--->
f.
We also denote the image of an ideal K C lQI[x, y, z] in R'[x] by K. We first must find a generator for J'k'[x], where J' is the ideal generated by 7,,72,73,74 E R'[x]. It is, in fact, easy to see that 72,73 ,74 are multiples of 7, in k'[x]. For example, ignoring the "bar" notation for the moment, viewing the following equation as being in k'[x], and noting that z, z2, and z3 are non-zero in R', we see that x 3 - yz = (zx - y2)( ~x2 + z'o-y 2 x + z'o-y4); that is, 72 is a multiple of 7,. Thus f = 11 and it is irreducible over k' , sinee it is of degree 1 in x. It remains to show that J'k'[x] n R'[x] = J'. We will apply Proposition 4.4.4 and Proposition 4.4.1. So we first need to compute a Grübner basis for the ideal J' = (f" 72' 73 , 74) C R'[x]. We will use Algorithm 4.2.2. We make the following general observation which is easily proved: if K is an ideal of lQI[x, y, z] containing y5 - Z4 and g E lQI[x, y, z], then K: (g) = K: (g). We consider the saturated subset {1,2} for which we need to compute (z): \il = (z) and the corresponding syzygy is
(x 2, -z) which gives the S-polynomial x 27, - Z72 = -ïlx 2 + yz2
l.!..,
O. Now the saturated subsets of {l, 2, 3} containing 3 are {l, 3} and {l, 2, 3}. For {l, 3} we compute (z): (y) = (z, y5 - Z4): (y) = (z, y5 - Z4): (y) = (2, y4) (note that this latter is just a computation in lQI[y, z] and so can be done using Lemma 2.3.11). The two syzygies are (yx, -z) and (Z3 x , -y4). The first gives the S-
+ Z3 L., O. The second gives the S-polynomial y2 z 3 x + y4z2 1>.., O. The saturated set {1,2,3}
polynomial yx7, - Z73 = _y3 x
7, - y473 = (z4 and the saturated subsets of {l, 2, 3, 4} containing 4 can be handled in the same way. After this computation, we see that a Grübner basis for the ideal J' is just the original generating set {f" 72' 73 , 74}' Then, as in Proposition 4.4.4, we set s = lea,) le(2) le(3) le(74 ) = ZÏyy3 = y4z, and we need to show that J' R;[x]nR'[x] = J'. We do this using Proposition
z3 x
y5)x 2 -
246
CHAPTER 4. GROBNER BASES OVER RlNGS
4.4.l. Thus we need ta compute
(J', wzy4 - 1) n R'[x] = (J" h, h, J., y5 - z4, wzy4 - 1) n Q[x, y, z]. This latter computation can be done in Q[x, y, z, w]. Using lex with w > x > y > z, we compute that a Griibner basis for the ideal (J" 12, 13,14, y5 - Z4, wzy4 - 1) consists of the 6 polynomials listed above together with wy3z3_X2) wyz4-x, and wzS_y. Thus (J"h,h,14,yS-z4, wzy4-1)nQ[x,y,z] = (J"h,h,f4,yS-z4) and sa (J', wzy4-1)nR'[x] = (J"h,h,J., ys-z4, wzy4-1)nQ[x,y,z] (J" 12, 13,14) = J'and the algorithm terminates with "result=TRUE." Exercises 4.4.l. Prove that if R is a UFD then in Proposition 4.4.4 we can let s = !cm(lc(g,), !c(g2),'" , !c(g,)). 4.4.2. Compute the saturation of the following ideals in Z[x, y] with respect ta Q[x, y] using Proposition 4.4.4. a. (6x 2 + y2, lOx 2y + 2xy) . . b. (3x 2 y - 3yz + y, 5x 2z - 8z 2 ). 4.4.3. Using Algorithm 4.4.1 and lex with z > y > x, show that the ideal (xzy3, yz - X2) ç Q[x, y, z] is not prime. 4.4.4. Using Algorithm 4.4.1, show that the ideal (y4 - Z3, y2 - XZ, xy2 _ Z2, x 2 _ z) ç Q[x, y, z] is prime. 4.5. Grobner Bases over Principal Ideal Domains. In this section we specialize the results of the previous sections ta the case where the coefficient ring Ris a Principal Ideal Domain (PID). Recall that an integral domain is a PID if every ideal of R is principal, that is, if every ideal of Rean be generated by a single element. We note that sueh rings are aIso Unique Factorization Domains (UFD) (see [Go, He, Hun]). Wc will make extensive use of this fact in this section and in Section 4.6. Examples of such rings include Z, Z[\I'2], Z[i], where i 2 = -1, and k[y], where k is a field and y is a single variable. Of course the theory that we have developed sa far in this chapter applies to these rings. But, because of the special properties of PID's, we will show that we may construct Griibner bases using S-polynomials as we did in the case of fields (Algorithm 4.5.1). We will then define strong Griibner bases which are similar ta Gr6bner bases when the ring R is a field, and we will show how ta compute them in Theorem 4.5.9. We will also describe the structure of strong Griibner bases in the case of a polynomial ring in one variable over a PID. This will give us a lot of information about the given ideal as we will see in Section 4.6. We will also specialize the ring R to k[y], where k is a field and y is a single variable, and we will describe the relationship among the different notions of Gr6bner bases in this case. Recal! that in order to compute a Gr6bner bases for (I!, ... , l,) in A = R[xj, ... , x n ], we need to compute a homogeneous generating set for the syzygy
4.5. GROBNER BASES OVER PRlNCIPAL IDEAL DOMAINS
247
. module, Syz(lt(j,), ... ,lt(j,)) (Theorem 4.2.3). When R is a field, we saw that a generating set B for Syz(lt(j,), ... ,lt(j,)) exists such that every element of B has exactly two non-zero coordinates (these syzygies correspond ta S-polynomials; see Proposition 3.2.3). DEFINITION 4.5.1. A generating set B oJ Syz( (lt (j,) , ... ,lt(j,)) is caUcd an S-basis if every element of B is homogeneous and has exactly two non-zero coordinates.
In general) when the coefficient ring is not a field, there is no S-basis for Syz((lt(j,), ... ,lt(j,)) (see Example 4.5.4). However, when R is a PID such a generating set exists as the next proposition shows. We assume that s > 1 in this entire section) because the case s = 1 is trivial, since any single polynomial is automatically a Grübner basis (note that this is not the case if R has zero divisors; e.g. {2x + 1} is not a Grübner basis in Z6[X]). We first prove the following identity4. LEMMA 4.5.2. Let R be a PID and let a, a" ...
(a"
,a,
be in R - {O}. Then
, ... ,a')R: (a)R = L.: ((ai)R: (a)R). i=l
PROOF. Since R is a PID, we have (see Proposition 1.3.8 for the case where
R = k[x])
(a" ... ,a,)R=(gcd(a" ... ,a'))R and (see Lemma 2.3.7 for the case of polynomial rings)
(ai)R n (a)R
=
(lcm(ai, a))R for ail i
= l, ...
,R.
Moreover, it is easy ta show that in any UFD we have lcm(gcd(a" ... ,a,), a) = gcd(lcm(a,, a), ... ,lcm(a" a)). Thus
(a" ... ,a')Rn(a)R = (gcd(a" ... ,a'))Rn(a)R
=
(lcm(gcd(a" ... ,a,), a))R
=
(gcd(lcm(a,, a), ... ,lcm(a" a)))R
,
, = (lcm(a" a), ... ,lcm(a" a)) R = L.: (lcm(ai, a)) R = L.: (a,) Rn (a)R. i=l
i=l
The result now follows easily as in Lemma 2.3.11 and Proposition 4.3.11.
0
PROPOSITION 4.5.3. Let R be a PID and let J" ... ,J, be non-zero polynomiais in R[x" ... ,xn ], with s > 1. Then Syz((lt(f,), ... ,lt(f,)) has an S-basis. 4For ideals h, ... ,le in a ring R we define the ideal L:=1 Ii to be the ideal in R generated
by the ideals Il, ... ,Il- That is,
2::=1 Ii =
f
=
E :2::=1 Ii can be written as
f
(Il, ... , le). It
/1 + ... + li
where
i8 easy to see that every element
li E Ii for each i = 1, ... ,e.
248
CHAPTER 4. GROBNER BASES OVER RlNGS
PROOF. For i = 1, ... ,s let lt(Ji) = CiXi, where Ci. E Rand lp(Ji) = Xi. For any subset J of {l, ... ,s} we deline, as before, XJ = lcm(Xj 1 j E J). We use the notation and technique presented in Theorem 4.2.9 to construct the desired generating set. Let Su = SYZ(CIXl, ... ,cuXa ) for 1 :": (J :": s. We will show by induction on (J that every Sa has an S-basis. Our induction starts at (J = 2. Then it is easy to see that S2 is generated by the syzygy ( -'XX , - -'- XX ), CIl C22 where c = lem(cl,c2)' and X = lcm(Xl ,X2 ). Now let (J > 2 and assume by induction that we have computed an S-basis 80--1 for Bd - l . \Ve now construct an S-basis Bu for Sa. Recall from Theorem 4.2.9 that Bu consists of two groups of elements. First, to each element a of Ba-l corresponds the element (a,O) in Ba. Clearly (a,O) has only two non-zero coordinates by the choice of Ba-l. The other elements in Ba_l are obtained from the ideal (Cj 1 jE J,j op (J)R: (Ca)R, where J is a saturated subset of {l, ... ,(J} containing (J. From Lemma 4.5.2 we have
(Cj 1 jE J,j
op (J)R: (Cu)R
= L(Cj)R: (Ca)R. JEJ ji-u
Nowfor jE J, j
op CT,
let dj be a generator of the ideal (Cj)R: (Ca)R. Therefore
Moreover, as in Theorem 4.2.9, associated ta each dj we have the element S"J J
= dje u X ju e· _ d X Ja e Cj
Xj
J
J
Xa
a,
since djc a E (Cj). Clearly the element sp has only two non-zero coordinates. By Theorem 4.2.9 the vectors SjJ, where jE J,j op (J, and J ranges over all saturated subsets of {l, ... ,(J} which contain (J, together with the vectors (a, 0), where a ranges over aIl elements of a generating set for 8 u- l , forrus a generating set for Sa. Therefore Sa has an S-basis. 0
We now give an example which shows that S-bases do not exist for rings which are fiot PID's. EXAMPLE 4.5.4. Consider the ring R = Z[zl. Ris not a PID, sinee, for exampIe, (2, z) is not principal. We note that Ris a UFD. Now consider the following polynomials in (Z[z]) [x, yi fI
= 2xy2 + y, 12 = zx2y + x, 13 = (2 + z)xy + 1.
We use the lex order with x > y. Then we have lt(JIl
= 2xy2, lt(j,) = zx 2y, lt(h) = (2 + z)xy.
The homogeneous elements of Syz(lt(fI), lt(J2) , lt(h)) with exactly two non-zero coordinates are multiples of SI = (-zx, 2y, 0), S2 = (-(2 + z), 0, 2y), or S3 =
4.5. GROBNER BASES OVER PRINCIPAL IDEAL DOMAINS
249
(0, -(2 + z), ZX). However, the vector (-x, -y, xy) is in Syz(lt(ft),lt(h), It(J3)) , and sa, if SI, S2, S3 generated Syz(lt(ft), lt(h), lt(h)), we would have
(-x, -y, xy)
=
h l (-zx, 2y, 0)
+ h 2( -(2 + z), 0, 2y) + h3(0, -(2 + z), zx),
for sorne hl, h 2, h3 E (Z[z]) [x, y]. Then xy = 2yh 2 + ZXh3' Note that h 2 = a2X and h3 = a3Y, for sorne a2, a3 E Z[x, y, z]. But then 1 = 2a2 + za3, and this is impossible. So Syz(lt(ft), lt(h), It(J3)) does not have an S-basis. In fact, it can be shown that if R is a UFD and if Syz(lt(ft), ... ,lt(J,)) has an S-basis for every ideall = (ft, ... ,/,) in R[Xl,'" ,Xn], then R is a PID (see Exercise 4.5.14). However, there are rings which are not UFD's for which Syz(lt(h), ... ,lt(J,)) has an S-basis for every ideal l = (fI,'" ,/s) in R[Xl"" ,Xn] (see Exercise 4.5.15). We note that the syzygies obtained in Proposition 4.5.3 are the analog of the syzygies used to deline the S-polynomials in the case where the coefficients are in a field. Indeed, iflt(Ji) = ",Xi,lt(Ii) = CjXj , where Ci,Cj ER, then the syzygy with exactly two non-zero coordinates corresponding ta these two polynomials is
where c = lcm(ci,Cj), and X and!j as (4.5.1)
= Icm(Xi,Xj ). We define the S-polynomial of fi
cX cX S(j" Ii) = -;;; X/' - Cj X j
k
Proposition 4.5.3 can then be used to modify Algorithm 4.2.1 to obtain an algorithm for computing Grübner bases in R[Xl"" ,xn ], where R is a PID. This algorithm is presented as Algorithm 4.5.1. We note that Algorithm 4.5.1 is similar to the algorithm given in the case of R = k, a field (see Algorithm 1.7.1). We now give an example of how Algorithm 4.5.1 is applied. EXAMPLE 4.5.5. We go back to Example 4.2.11 and use Algorithm 4.5.1 to recompute a Grübner basis with respect ta the lex term ordering with x > y for
1 = (fI, 12), where ft = 3x2y + 7y, 12 = 4xy2 - 5x and R = Z. We initialize G = {JI, h} and 9 = ft,h . Since 9 '" ,we choose {JI, h} E g, so that now 9 = 0. We compute C = lcm(3, 4) = 12, X = lcm(x2y, xy2) = x 2y2, so that the corresponding S-polynomial is S(Jl, 12) = 4yft - 3xh = 15x 2 + 28y 2 This polynomial cannot be reduced and we add h = 15x2 + 28y 2 to the basis, so that now G = {Jl,hh} and 9 = {{ft,h , h h . Since 9 '" 0 we choose {JI, h}, so that 9 = {{h h}}. In this case we have c = 15 and X = x2y, sa that the corresponding S-polynomial is S(Jl, h) = 5ft - yh = -28y3 + 35y. This polynomial cannot be reduced and we add !4 = - 28y 3 + 35y to the basis, so that G = {ft,hh!4} and 9 = {{hh}, ft,!4 , h!4 , h!4}}' Since
CHAPTER 4. GROBNER BASES OVER RINGS
250
INPUT: F = {f" ...
,l,} ç
R[Xl,'" ,Xn] with
li i 0 (1
OUTPUT: G = {g" ... ,g,}, a Grobner basis for (fI,··· INITIALIZATION: G ;= F, 9 := {{f, g}
Iii 9
~; ~
s)
,1,)
E G}
WHILEgi0DO Choose any {f,g} E g. Let It(f) = cf X f ,1t(g) = CgXg
g:= 9 - {{j,g}} Compute e = !cm(cf' Cg) and X = lcm(Xf' X g ) cX --1 cf Xf
eX
G
- - g ---->+ h, where h
Cg X g
.... lS mmunal wlth
respect ta G
IFhiOTHEN
9 := 9 U {{ u, h}
1
for all u E G}
G:=GU{h}
ALGORITHM 4.5.1. Grobner Bas;s Algorithm over a PJD
9 i 0 we choose {h h}, 80 that 9 = {{f" 14}, {h 14}, {ho h}}· In this case we have C = 60 and X = x2y2, sa that the corresponding S-polynomial is
Since 9 i 0, we choose {f" 14}, sa that 9 = {{h 14}, {ho 14}}. In this case, C = 84, and X = X2 y 3, sa that the corresponding S-polynomial is S(f" 14)
= 28 y2fI + 3x2/4 = 105x 2y + 196y 3 A
196 y 3
-
245y~ O.
It is easy ta see that the other two elements of 9 also give rise ta polynomials which reduce ta zero and sa do not contribute new polynomials ta the basis. Therefore we get (as we did in Example 4.2.11) that {fI, h ho 14} is a Grobner basis for J. We note that this computation required more steps than the computation of this Grobner basis did in Example 4.2.11 nsing Algorithm 4.2.2. In arder ta improve the efficiency of Algoritbm 4.5.1, Mimer gave an analogue of crit2 whicheliminates many S-polynomial computations. The interested reader should consult [Mo88]. Recal! that we defined Grobner bases in R[Xl"" ,xn] the way we did in Definition 4.1.13 because of the problem of dividing by elements of the coefficient Ting R. In the case when the coefficient ring is a PID, there is a stronger version
4.5. GROBNER BASES OVER PRINCIPAL IDEAL DOMAINS
251
of Gr6bner bases which is similar ta the one we gave when the coefficient ring was a field.
DEFINITION 4.5.6. Let G = {gI, ... ,g,} be a set of non-zero polynomials in R[XI, ... ,xn ]. Then we say that Gis a strong Grobner basis for J = (gl, ... ,g,) if for each f E l, there exisls an i E {l, ... , t} such that lt(gi) divides lt(f). We say that G is a minimal strong Grobner basis if no lt(gi) divides lt(gj) for i # j. Note that this definition does not in itself require that R be a PID. However strong Grobner bases exist onIy in the case when R is a PID (Exercise 4.5.16). EXAMPLE 4.5.7. Let R = k[y, z], where k is a field. Consider the ideal J = (x, y, z) in R[x]. This ideal does not have a (finite) strong Grobner basis. This is because there are an infinite number of non-associate irreducible polynomials in the two variables y, z in J. If a strong Grobner basis G existed, then each of these irreducible polynomials would have leading term (which is the irreducible polyuomial itself) divisible by the leading term of an element of G. This would force an infinite number of elements in G, which violates the definition of a strong Grobner basis. The following result is immeruate and its proof is left to the reader (Exercise 4.5.6). LEMMA 4.5.8. Jf G = {gl, ... ,g,} then it is a Grobner basis.
ç R[XI, ... , x n ] is a strong Grobner basis,
We now show how ta construct a strong Gr6bner basis frOID a given Grobner basis. 80 let {ft, ... , f,} be a Grobner basis for an ideal l in R[XI, . .. , x n ]. Let lt(fi) = CiXi, where Ci E Rand lp(/i) = Xi' For each saturated subset J of {l, ... , s}, let C] = gcd(cj 1 j E J) and write c] = LjE] ajcj (any such representation will do). Also, let X] = lem(Xj 1 jE J). Consider the polynomial
THEOREM 4.5.9. Let R be a PJD, and J be an ideal of R[XI, ... ,xn ]. Assume that {ft, ... , f,} is a Grobner basis for J. Then the set
{!J 1 J is a saturated subset of {l, ... ,s}} is a strong Grobner basis for 1. In particular, every non-zero ideal in R[Xl1 ... ,xnl has a strong Grobner basis. PROOF. Let 0 # f E J. Then lt(f) E Lt(ft,··. , fs) = (lt(ft), ... , lt(f,)). Let J = {i E {l, ... ,s} 1 Xi divides lp(f)}. It is clear that J is saturated and that X] divides lp(f). We also have lt(f) E (lt(/j) 1 j E J) and so le(f) = LjE] djej for sorne dj E R, from which we conclude that C] divides le(f). Therefore lt(!J) divides lt(f). D
252
CHAPTER 4. GROBNER BASES OVER RlNGS
We leave it to the exercises to show that every ideal has a minimal strong Grübner basis (Exercise 4.5.9). Together with the concept of strong Grübner bases, we could also define the concept of strong reduction. Reduction of J modulo a set F = {fJ, ... , J,} is performed if lt(f) E (lt(f,), ... ,lt(f,)) (see Definition 4.1.1). Strong reduction would require instead that an lt(fi) divides lt(f). One can show that for a set G of non-zero polynomials in R[XI,'" ,xn ], we have that every element in (G) strongly reduces to zero if and only if G is a strong Grübner basis (Exercise 4.5.1). Theorem 4.5.9 gives us a method for computing strong Grübner bases. We give an illustration in the next example. EXAMPLE 4.5.10. We go back to Example 4.5.5. We saw that a Grübner basis for the ideal l = (3x2y + 7y, 4xy2 - 5x) ç Z[x, yi with respect to the lex term ordering with x > y is {fJ, 12, h, J4}, where fJ = 3x2y + 7y, 12 = 4xy2 - 5x, h = 15x2 + 28y2, and J4 = -28y3 + 35y. FoIJowing the proof of Theorem 4.5.9 we first compute all the saturated subsets of {l, 2, 3, 4}: J, = {2}, J 2 = {3}, J3 = {4}, J4 = {l,3}, J5 = {2,4}, J6 = {1,2,3}, J 7 = {l,2,3,4}. The sets J" J 2, and J, give rise to the original polynomials h, = 12, h, = h, ha = J4. Associated with J4 we have cJ, = gcd(3, 15) = 3, and X J , = x2y, so h, = fJ. Associated with J 5 we have cJ, = gcd(4, -28) = 4, and X J , = xy3. Therefore, h, = y12· Associated with the set J 6 we have cJ, = gcd(3, 4,15) = 1, and X J , = x 2y2 Therefore h, = x12 - yfJ = x 2y2 - 5x 2 - 7y2 Associated with the set J 7 we have Ch = gcd(3,4,15,-28) = 1 and X h = x 2 y 3. Therefore h, = xy12 - y2 fJ = yh,· We see we do not need h, and h, and therefore, a strong Gr6bner basis for l is {4xy2 - 5x, 15x2 + 28y2, - 28y3 + 35y, 3x 2y + 7y,x 2y2 - 5x 2 _ 7y2}. To conclude this example we give an exarnple of determining ideal membership using strong reduction. Consider the polynomial J = 7x 2y2 - 15x 2 y - 35x 2 28xy3 + 35xy - 56y4 - 28y3 21y2. Then we have
+
-15x 2y - 28xy 3
J -5,il ~
-7y,h
---->
+ 35xy -
- 28xy3 + 35xy - 56y _56y4 - 28y3 - 28 y 3
4
-
56y4 - 28y3 + 70y2 28y3 + 70y2
+ 35y
+ 70y2 + 35y
+ 35y -:y., O.
Thus J = 7h, - 5fJ - 7y12 + (2y - I)J4 and since h, = x12 - yfJ we have J = (-7y - 5)fJ + (7x - 7y)12 + (2y - 1)f<. We note that the first reduction above could not have been done with a single polynomial from fJ, 12, h, J4' We now consider the case where R = k[y], with k a field and y a single variable. In this setting we have three concepts of Grübner bases: Grübner bases in k[y,XI,'" ,xn ] (as defined in Section 1.6), Grübner bases in (k[Y])[XI,'" ,xnl (as defined in Section 4.1), and strong Grübner bases in (k[y]) [XI, ... ,Xn] (as
4.5. GROBNER BASES OVER PRINCIPAL IDEAL DOMAINS
253
defined in Definition 4.5.6). Clearly the ring k[y,XI,'" ,xn ] is the SaIDe as the ring (k[Y])[XI"" ,Xn], but, in the latter ring, polynomials are viewed as polynomials in the variables x" ... ,Xn with coefficients in k[y]. We have seen in Lemma4.5.8 that if {g" ... ,gt} is astrong Gr6bner basis in (k[Y])[XI"" ,xn] then it is a Gr6bner basis in (k[Y])[XI'''' ,xn ]. Moreover, we saw in Theorem 4.1.18 that if {g"." , g,} is a Gr6bner basis with respect to an elimination order with the X variables larger than the y variable in k[y,XI, ... ,xn ], then it is a Gr6bner basis in (k[y])[x", .. ,x n ]. The next theorem strengthens this result considerably. We will first need a lemma. In both results we assume we have an elimination order with the x variables larger than y. We need a notation to distinguish between leading terms in k[y, x" ... , x n ] and (k[Y])[XI"" ,xn ]. We use It, Ip, le for their usual meanings in k[y, Xl,'" 1 X n ], and we will use It x1 1px'lcx for corresponding abjects in (k[Y])[XI'''' ,xn ]. 80, for example, if f E k[y, X" ... ,X n ], then lex(f) is a polynomial in y. Finally for a E k[y] we use Ity(a) for the leading term of a. Note that because of the chosen order, for f E k[y, X" ... ,xn ] we have It(f) = Ity(lcx(f)) IPx(f)· LEMMA 4.5.11. Let R = k[y]. Then G = {g" ... ,g,} ç R[XI,". ,xn ] is a strong Grobner basis if and only if (i) G is a Grobner basis in Rlxl, ... ,x n ] and (ii) for every J ç {l, ... ,t}, satumted with respect to {IPx(g,)"" ,IPx(gt)}, there exists i E J such that lcx(gi) divides lcx(gj) for ail jE J. PROOF. Let J = (g" ... ,gt). We first prove that (i) and (ii) imply that Gis a strong Gr6bner basis in R[xl, ... , x n ] (this proof is very similar to the proof of Theorem 4.5.9). Let f E J. Sinee G is a Gr6bner basis in R[xl,'" ,Xn ], we have Itx(f) E (Itx(g,)"" ,Itx(gt)). Let J = {j E {l, ... ,t} IIPx(gj) divides IPx(f)}. Then J is saturated. Moreover we have
It x (f) =
L: aj Itx (gj), JEJ
for sorne aj E R[Xl1 ... 1 xnl, which we may assume are terms with respect ta the variables Xl, ... ,Xn with Ipx (f) = Ipx (aj) Ipx (gj) for ail j E J such that aj # O. Then
lcx(f)
=
L: lcx(aj) lex(gj). JE]
Now choosing i E J as in (ii) we get lcx(gi) divides lcx(f) and sa, by definition of J, we get ltx(gi) divides \tx(f) as desired. Conversely, let G be a strong Gr6bner basis in Rix!, ... ,xn ]. Then, by Lemma 4.5.8, G is a Gr6bner basis in R[x!, ... ,xn ]. Let J be a saturated subset of {l, ... ,t}. Let c = gcd(lcx(gj) 1 j E J) and write c = 2: jE J dj lcx(gj), for sorne
254
CHAPTER 4. GROBNER BASES OVER RlNGS
dj E R = k[y]. AIso, let X = lcm(lpx(gj) 1 j E J). Now consider the polynomial X f= Ldj~( .)gj. jEJ Px gJ Note that Itx(f) = eX and f E J. Since G is a strong Grübner basis there is a gi E G such that Itx(gi) divides Itx(f). Therefore i E J since IPx(gi) divides X. Moreover, lcx(g;) divides e and we are done. 0 THEOREM 4.5.12. Let R = k[y]. Then G = {g" ... ,g,} is a Grobner basis in k[y, Xl, ... ,xnl with respect- ta an eliminaiion order with the x variables larger than y if and only if G is a strong Grobner basis in (k[y]) [Xl, ... ,xn ]. PROOF. Let us first assume that G = {gl,'" ,g,} is a Grübner basis in k[y,xl,'" ,Xn] with respect to an elimination order with the X variables larger than y. By Theorem 4.1.18 we see that Gis a Grübner basis in (k[Y])[Xl"" ,xn ]. Thus by Lemma 4.5.11 it suffices to show that given a subset J of {l, ... ,t}, saturated with respect to {IPx(g,), ... ,IPx(gt)}, there exists i E J such that lcx(g.,)·divides lcx(gj) for all j E J. So let J be such a saturated set. Let e = ged(lcx(gj) 1 j E J) and X = lcm(lpx(gj) 1 j E J). We can write e = 2..: jE s bj lcx(gj), for sorne bj E k[y]. Now consider the polynomial X
f =.Lbj~( .)gj JES
Px gJ
E (gl,'"
,gi)'
Note that Itx(f) = eX. Moreover, It(f) = lty(e)X, since we have an elimination order with the X variables larger than the y variable. Since G is a Grübner basis in k[y, Xl, ... ,X n], there exists gi E G such that It(gi) divides It(f) = Ity(e)X. But then i E J since IPx(gi) divides IPx(f) = X. AIso, Ity(lcx(gi)) divides Ity(lcx(f)) = Ity(e). Therefore, deg(lcx(gi)) S deg(e) and so the fact that c divides lcx(gi) implies that e is a non-zero constant in k times lcx(gi) , which immediately gives the desired result. Conversely, let us assume that G is a strong Grübner basis in (k[y]) [Xl, ... ,xn ]. Let f E J. Then there exists gi E G such that Itx(gi) = lex(g;) IPx(gi) divides Itx(f) = lcx(f) IPx(f)· So IPx(gi) divides IPx(f) and lcx(gi) divides lcx(f). But then It(gi) = Ity (lc x (gi)) IPx(gi) divides It(f) = Ity(lcx(f)) IPx(f). Therefore G is a Grübner basis in k[y, X" ... ,xn ]. 0 We now go back to the case of an arbitrary PID Rand conclude this section by giving a eharacterization of strong Grübner bases in R[x], where X is a single variable. This will be used in Section 4.6 to give the primary decomposition of ideals in R[x]. From Lazard [Laz85] (see also Szekeres [Sz]), we have THEOREM 4.5.13. Let R be a PJD and let X be a single variable. Assume that G = {g" ... ,gi} is a minimal strong Grobner basis for an ideal J of R[x] and
4.5. GROBNER BASES OVER PRlNCIPAL IDEAL DOMAINS
255
let lp(gi) = xv'. We further assume that we have ordered the Yi 's in such a way that VI :'0 V2 :'0 ... :'0 Vt· Let Y = gCd(Y1,'" ,Yt). Then at9 ath29
YI Y2
a3'"
Yi
ai+l ... at hi9
Yt-1 Yt
at h t-19
a2a3'"
ht9,
for i = 2, ... t. 1
PROOF. From Exercise 4,5,11, we have that
basis if and only if {
~ ,'" ,~ }
{YI" ' , ,Yt} is a strong Gr6bner
is a strong Gr6bner basis, Thus we may assume
that gCd(Y1"" ,Yt) = L Let lt(gi) = Ci XV , , where Ci E R (1:'0 i:'O t), CLAIM L VI < V2 < ' ,. < Vt. PROOF. Assume to the contrary that we have an i, 1 :'0 i < t, such that Vi = Vi+l. Let c = gcd(Ci, Ci+l) and write c = biG + bi + 1Ci+l, for sorne bi1 bi + 1 E R. Consider the polynomial h = biYi + bi+1Yi+1' Then h E J and lt( h) = ex v, . Sinee Gis a strong Gr6bner basis, there exists a j E {l, ... ,t} sueh that lt(Yj) = CjX Vj divides lt(h) = ex"'. Then Vj :'0 Vi' But then lt(Yj) divides lt(Yi), and so, byour assumption that G is minimal, we have that i = j. Henee c = Ci, and lt(Yi) divides lt(Yi+1)' This contradiets our assumption that G is minimal. CLAIM 2. Ci+l divides Ci, for i = 1, ... l t - 1. PROOF. Let i E {l, ... ,t -1}. As in Claim 1, let e = ged(ci,C;+1) and write c = biCi + bi + 1 Ci+1, for sorne bi, bi + 1 E R. Now consider the polynomial h = bixv,+,-v'Yi + bi+1Yi+1 E J. Note that lt(h) = ex v"" Since G is a strong Gr6bner basis, there exists aj E {l, .. , ,t}, such that lt(Yj) = CjX Vj divides lt(h). Thus Vj :'0 Vi+1, and so j :'0 i + 1, and ej divides c, and so lt(YJ) divides lt(Yi+1)' Since G is minimal, we have that j = i + 1, and so Ci+l = C = gcd(Ci, ci+d. Therefore C;+1 divides c;. C·
CLAIM 3.
-'-Yi+1 E (YI, ... ,Yi), for i = 1, ... ,t - L Ci+l
PROOF,
e·
Let h = -'-Yi+1 - xVH,-v'Yi, Note that h E J and that lp(h) < CHI
lp(Yi+1)' Sinee lp(Yi+1), '" ,lp(Yt) are larger than lp(h), and since h ~+ 0, the only polynomials that cau be used to recluce h to zero are 9b ... 19i. Therefore hE (gl)'" ,9i)' and hence CLAIM 4,
YI E R.
80
is
~9i+l' Ci+l
CHAPTER 4. GROBNER BASES OVER RINGS
256
PROOF. Let C(91) E R be the greatest cornmon divisor of the coefficients of the powers of x that appear in 91. Then 91 = C(91)P(91), where p(91) E R[x] and the greatest cornmon divisor of the coefficients of P(Y1) is 1. We show by induction on i that P(91) divides Yi. The case i = 1 is clear. Now assume that
P(Y1) divides YI, 92, ... ,Yi· Since ~9i+1 E (91, ... ,Yi) by Claim 3, we see that Ci+1 P(Y1) divides
~YH1. Ci+l
But gcd
(~,P(Y1))
= l, since any cornmon factor of
Ci+l
~ and p(91) would have to be in Rand would have to be a factor of the Ci+l
coefficients of powers of x appearing in P(Y1)' Therefore P(Y1) divides Yi+1' Now, since P(Y1) divides every Yi, P(91) must be 1 because of the assumption that the y;'s have no factor in cornmon. Therefore YI = c(y,) E R. CLAIM 5. Ci divides Yi for ail i = l, . . . ,t and ct = 1. PROOF. We use induction on i. The case i = 1 is clear, sinee 91 = CI. Assume that Cl divides 91, C2 divides 92, ... l Ci divides 9i. Then, by Claim 2, we have that Ci divides Y1,Y2,'" ,9i' Since ~9H1 E (YI, ... ,Yi) by Claim 3, we see Ci+1 that Ci divides ~gi+l' Therefore Ci+l divides 9i+l. Now, sinee Ci divides gi Ci+1 for each i, and sinee Ct divides Ci for each i, we see that Ct divides 9i for each i. Therefore Ct = 1 because of the assumption that the Yi '8 are relatively prime. C· y. Now set ai+l = - ' - and hi = ---=-. Then we have Ci
CHI
y,
cth, = h,
Y'-l
ct-l h t-l =
C'_l h ct-t-l
Y'-2
ct-2 h t-2 =
cC'_2 t - -C'-l - - h t-2
ct
Ct-l
ct
h
= at t-l
h
= at_lat t-2
9i Y2 YI
a3'" at_lath2
a2a3' .. at·
6. hi+l E (hil aihi-l,··· ,a3'" aih21 a2'" ai}. PROOF. By Claim 3 we have ~Yi+1 E (91,'" ,9i)' that is, CLAIM
Ci+l
The result follows after dividing by aH1 ... a,. The Theorern is now completely proved. 0 We note that the converse is also true. That is, any set of polynomials YI, ... ,y, in R[x] which satisfies the conditions in Theorem 4.5.13 is a strong Grübner basis and is minimal if no ai is a unit (Exercise 4.5.12).
4.5. GROBNER BASES OVER PRINCIPAL IDEAL DOMAINS
257
COROLLARY 4.5.14. Let k be a field and let y, x be variables and set A = k[y, x]. We assume that we have the lex term ordering with y < x. Let G = {g" ... ,gt} be a minimal Grobner basis for an ideal lof k[y, x] and let IPx(gi) = We assume that we have ordered the 9i 's in Buch a way that Vt· Let 9 = ged(g" ... ,g,). Then XVi.
for i
=
E
a2a3'"
9i
ai+l ... athi9
9t-l
a,ht - 1 9 h'9,
k[y], lt(hi ) = xv;, VI <
:S
V2 :::; ... ::::;
at9
91 92
a3'" ath29
9t
where ai
VI
1/2
< ... < V" and
2, ... ,t.
PROOF. We have from Theorem 4.5.12, that G is a strong Grübner basis in (k[y])[x] and so the result follows immediately from Theorem 4.5.13. 0 EXAMPLE 4.5.15. We consider the ring R = lQI[y] and the ideal l = ((x + y)(y2 + 1), x 2 - X Y + 1) in R[x]. We compute a strong Grübner basis for l by computing a Grübner basis for l viewed in lQI[x, y] witb respect to the lexicographie arder with x > y (Theorem 4.5.12). We find
+
91 92
93
= = =
y4 + 2y 3 + 2y2 + 2y + 1 = (y + 1)2 (y2 + 1) = a2 a3 xy2+x+y3+y=(y2+1)(x+y)=a3h2 x 2 - X + Y + 1 = h3·
EXAMPLE 4.5.16. Consider the three polynomials in Z[x]
fI 12 h
= = = =
630x - 630 = 9 . 5 . 14(x - 1) = a2a39 70x 2 + 70x - 140 = 5(x + 2) . 14(x - 1) = a3h29 14x4 + 70x 3 + 196x 2 - 70x -210 (x 3 + 6x 2 + 20x + 15) . 14(x - 1) = h39.
It can be easily verified nsing Lemma 4.5.1l that {J" 12, h} is a strong Grübner basis. We also note that the fi's have the form required in Theorem 4.5.13, since h3 = x 3 + 6x 2 + 20x + 15 = (x + l)(x + 2)(x + 3) + 9(x + 1) E (9, x + 2) and so it also follows from the COnverse of Theorem 4.5.13 (Exercise 4.5.12) that {J" h h} is a strong Grübner basis.
258
CHAPTER 4. GROBNER BASES OVER RlNGS
Exercises 4.5.1. We deline strong reduction in Rix" ... , xn], for R a PID, with respect to a set F = {J" ... , J,} of non-zero polynomials in Rix" ... , x n ] as
follows. For J, 9 E Rix" ... , x n ] we write J --"--" 9 provided that for sorne i,1 ::; i ::; s we have lt(fi) divides lt(f) and 9 = J - l~tU;)) k We write f ~+,s g, as lisuaI, when we iterate the preceding. Show that the following are equivalent for a set G = {g" ... , g,} of non-zero polynomials in Rix" ... , x n ] where we set J = (G). a. G is a strong Gr6bner basis for J.
e
b. For ail J E J we have J -->+" O. 4.5.2. Show that in Exercise 4.5.1 the following statement:
4.5.3,
4.5.4.
4.5.5.
4.5.6. 4.5.7.
4.5.8. 4.5.9. 4.5.10.
4.5.11.
(see Equation (4.5.1)) does not imply that G is a strong Gr6bner basis. IHint: Look at the polynomials Ir = 2x + 1 and h = 3y + x in Zlx, y].] Prove the converse of Theorem 4.5.9. That is, prove that if the set of !J delined there is a strong Gr6bner basis for J then {Ir, ... , J,} is a Gr6bner basis for J. Let J,g E Rix" ... ,xn], with J,g oF 0, and let d = gcd(f,g). Prove that {J,g} is a Gr6bner basis if and only if gcd(it(~),lt(~)) = 1. (This is the analog of crit1.) IHint: Follow the proof of Lemma 3.3.1.] For the ring R = Z use Algorithm 4.5.1 to compute a Gr6bner basis for the ideals generated by the given polynomials with respect to the given term order. a. Ir = 2xy - x, h = 3y - x 2 and lex with x < y. b. Ir = 3x 2y - 3yz + y, h = 5x 2z - 8z 2 and deglex with x > y > z. c. Ir = 6x 2 + y2, h = lOx 2y + 2xy and lex with x > y. Prove Lemma 4.5.8. For the ring R = Z use Theorem 4.5.9 to compute a strong Gr6bner basis for the ideals generated by the given polynomials with respect to the given term order in the exerCÎses in Exercise 4.5.5. Compute Ji strong Gr6bner basis for the ideal in Zlillx, y, z] in Exercise 4.2.5. Prove that every non-zero ideal of Rix" ... , Xn], where R is a PID, has a minimal strong Gr6bner basis. Show that for the strong Gr6bner basis constructed in Exercise 4.5.7 (for Exercise 4.5.5 part cl, -30x 3 y2 + 6x 3 - 5x 2y 3 - 6x 2 y2 + 16x2y + 5xy3 + xy2 + 2xy - 5y 5 + y" strongly reduces to zero. Prove that if {g" ... , g,} is a set of non-zero polynomials in Rix" ... , x n ] where R is a PID and 9 = gCd(gl,'" , g,), then {gl, ... , g,} is a strong Gr6bner basis if and only if {~, ... , is a strong Gr6bner basis.
a:}
4.6. PRIMARY DECOMPOSITION IN Rix] FOR R A PID
259
4.5.12. Prove the converse of Theorem 4.5.13 as stated immediately aIter the proof of Theorem 4.5.13. 4.5.13. Verify that the Grobner bases for (_x 2 - xy + x 2 y2 + xy3, -5y + 5xy3xy2 + 3x 2 y2) ç: Q[x, y] with respect ta the lex ordering with x > y has the form stated in Corollary 4.5.14. 4.5.14. Prove that if R is a UFD and iffor every {!J, ... , f,} ç: R[x" ... ,xn ] we have that Syz(lt(!J), ... ,lt(f,)) has an S-basis, then R is a PID. [Hint: If R is not a PID, then there exist a, bER such that gcd(a, b) = 1 and 1 rt (a, b). Note that (l, l, -1) E Syz(a, b, a + b).] 4.5.15. Show that if R is a Dedekind Domain, then for every {J" ... , fs} ç: R[x" . .. , x n ] we have that Syz(lt(fIl, ... ,lt(f,)) has an S-basis. [Hint: Prove the identity in Lemma 4.5.2 and follow the proof of Proposition 4.5.3 (note that more than one dj may be needed).] 4.5.16. Prove that if R is a UFD and iffor every non-zero ideal l ç: R[x" ... , x n ], l has a strong Grobner basis, then R is a PID. [Hint: Use the idea in Example 4.5.7. Assume the facts that if every prime ideal in a UFD is principal then it is a PID, and that every non-principal prime ideal in a UFD cantains an infinite number of non-associate irreducibles.] 4.6. Primary Decomposition in R[x] for R a PID. In this section we follow Lazard [Laz85] and use the results of Section 4.5 to "decompose" ideals in R[x], where R is a PID and x is a single variable. The decomposition we have in mind is one similar ta the decomposition of natural numbers into products of powers of prime numbers. In our setting, the analog of a product is an ideal intersection, and the analog of a prime integer is a prime ideal. Recall that an ideal P in a commutative ring A is prime if f g E P implies that either f E P or g E P, or equivalently, if the set S = A - P is a multiplicative set (see Section
4.4). We will need the following elementary fact about Noetherian rings. 4.6.l. Let A be a Noetherian ring and let S be any non-empty set of ideals of A. Then S contains a maximal element, i. e. there is an ideal lES such that there is no ideal J E S such that l ç J. LEMMA
We first consider the decomposition of the radical Recall from Definition 2.2.4 that
.Ji = LEMMA
{f E
.Ji for
an ideal l in A.
Air El for sorne vEN }.
4.6.2. Let l be an ideal in a Noetherian ring A. We have
.Ji=
n
P.
lep p
prime ideal
260
CHAPTER 4. GROBNER BASES OVER RINGS
PROOF. The inclusion
Vi ç
nP
follows from the fact that if a prime
ICP P prime ideal ideal P contains I, it also cantains Vi. We now consider the reverse inclusion. Sa let us assume that there is an element f in P - Vi. Consider the set S = {Iv 1 v = 0,1, ... }. Note that
n
[CP
prime ideal M E I, Sn Vi = 0, for otherwise, if E Vi for sorne v E N, then (JV)M = for sorne i" E N, and this implies that f E VI, which is a contradiction. Now consider the collection S of all ideals of A which contain VI and have empty intersection with S. Clearly S is non-empty, since VI E S. By Lemma 4.6.1, there exists an ideal MES which is maximal in S. In particular, VI ç M and M n S = 0. We now prove that M is a prime ideal. Let gh E M and assume that neither 9 nor h is in M. By maximality of M we have P
r
(g, M)
r
n S of 0 and
(h, M)
n S of 0.
Therefore there exist v, VI E N, m, m' E M, and a, a' E A Bueh that
ag +m But then
-
= rand a'h+m' = r'.
-'. n
r+ v ' = (ag +m)(a'h + m') = (aa') (gh) + (ag + m)m' + (a'h)m E Mn S, ES
EM
EM
'
which is a contradiction, and 80 M is prime. Now we have a prime ideal M which contains VI and henee I, sa f E P ç M byassumption. But Mn S = 0, ICP
prime ideal and we 0 btain a contradiction. D P
In view of the above lemma, one might think that any ideal in Rix] ean be decomposed as the intersection of powers of prime ideals. This is not the case as the following example shows. EXAMPLE 4.6.3. ConsidertheidealQ = (4,X2) inZlx]. Anyprimeideal which contains Q must contain both 2 and x, and hence must be equal to M = (2, x), sinee M is a maximal ideal of Zlx] (sinee Zlx]/(2, x) "" Z2). Therefore, if Q were the intersection of powers of prime ideals, it would be a power of (2, x). But M3 ç: Q ç: M 2 80 Q cannot be a power of M. The correct analog to powers of primes is the following. DEFINITION 4.6.4. An ideal Q of A is called primary if Q then either f is in Q or some power of 9 is in Q.
of A
and if fg E Q
It is easy ta see that prime ideals are primary. However 1 powers of prime ideals need not be primary (see IAtMD]) although, using Lemma 4.6.13, we see that powers of maximal ideals are primary.
4.6. PRIMARY DECOMPOSITION IN Rlxl FOR R A pm
261
EXAMPLE 4.6.5. The ideol Q given in Example 4.6.3 is primary. To see this, let fg E Q = (4, X2), and let f cf' Q. It is easy to see that we can write f = ht+rf, and 9 = hg + r g , where ht, hg E Q, and rt = atx + bt , rg = agx + bg, where at, ag,bt , bg = 0, 1,2, or 3 (notethat rf andrg are the totally reduced remainders of f and 9 as defined in Definition 4.3.2). Moreover, since f cf' Q, we have that either af or bj is not equol to O. Since fg E Q, 4 divides atbg + agbf and bjbg. Note that if bg = 0, then g2 E Q, and we would be done. So we may assume that bg # O. If bf = then 4 divides afb g, and 80 since af and bg are non-zero, we have af = bg = 2 giving g2 E Q. Otherwise, bf # and so bj = bg = 2, so again we have g2 E Q.
°
°
LEMMA 4.6.6. If Q is a primary ideal in a Noetherian ring A, then yfQ is a prime ideal. M oreover yfQ is the smallest prime ideal containing Q. PROOF. Let fg E yfQ. Then (fg)" = rg" E Q and so r E Q or (g")!' = 9"!' E Q. Therefore f E yfQ or 9 E yfQ and yfQ is a prime ideol. The second statement follows from the fact that any prime ideal containing Q also contains yfQ. 0 DEFINITION 4.6.7. IfQ is primary and yfQ = P, we say that Q is P-primary. We cau now define what we ruean by decomposition.
DEFINITION 4.6.8. Let 1 = n~~l Qi, where Qi is Pi-primary for each i. We cali n~~l Qi a primary decomposition of 1. If, in addition, the Pi are ail distinct and for aU i, 1 :S i :S we have n#i Qj 't Qi, we caU the primary decomposition irredundant. In this latter case, the ideal Qi is said to be the primary component of 1 which belongs to Pi, and Pi is said to be a prime component of 1.
r,
It is easy to prove that if Q" ... ,Qr are ail P-prmary then n~~l Qi is also P-primary. Given this, and the obvious statement that we can remove superfluous Qi, we see that any ideal that has a primary decomposition olso has an irredundant primary decomposition. We note that in Lemma 4.6.2 we gave a primary decomposition of VI. Also, in Example 4.6.5, the ideal Q was primary, and thus was its own primary decomposition. In general, we have
THEOREM 4.6.9. Every ideal in a Noetherian ring A has a primary decomposition. PROOF. The key to the proof is the concept of irreducible ideals. An ideal 1 is irreducible if 1 = Ir n 12 implies that 1 = Ir or 1 = I,. The proof of the theorem is done in two steps. First we prove that every ideal in A is a finite intersection of irreducible ideals, and then we show that every irreducible ideol is primary.
262
CHAPTER 4. GROBNER BASES OVER RINGS
Let S be the collection of ail ideals in A which axe not the intersection of a finite number of irreducible ideals. Assume ta the contrary that S is not empty. By Lernma 4.6.1 there exists a maximal element M in S. Now, sinee M is in S, M is not irreducible. Therefore there exist ideals M, and M 2 such that M # M" Mol M 2 and M = M, n M 2 . Thus M ç: M" M 2 . By the maximality of M, we have that M, and M 2 are not in S. Therefore they are both a finite intersection of irreducible ideals. But then M is also a finite intersection of irreducible ideals. This is a contradiction, and therefore S = 0. Now let 1 be an irreducible ideal. We will show that 1 is primary. Sa let f 9 E 1 and let f rte 1. Consider the ascending chain
1: (g) ç 1: (g2) ç ... ç 1: (g') ç .. ' . Since A is Noetherian we have that for sorne e, 1: (g') = 1: (g'+'). CLAIM. (I + (g')) n (1 + (f)) = 1. PROOF. Clearly 1 ç (1 + (l)) n (1 + (f)). For the reverse inclusion, let '" + rg' = fJ + sf E (I + (g')) n (1 + (f)), where "',fJ E 1, r,s E A. Then rg'+' = '-v------' -g'" + gfJ + _sfg E 1. Therefore r E 1: (g'+') = 1: (g'). But then El
El
'" + g'r E 1, and the Claim is proved. Now by the assumption that 1 is irreducible, either 1 = 1 + (f) or 1 = 1 + (l). Since f rte 1, we have 1 = 1 + (g'), and hence g' E 1. Therefore 1 is primary. 0 We note that the preceding Theorem is purely existential, that is, it gives no indication how ta go about computing the primary decomposition of a given ideal in same specifie ring. There has been mueh work done on this problem, see [GTZ, EHV]. The main purpose of this section is ta use Theorem 4.5.13 ta show how ta do this explicitly in the ring R[x], where R is a PID in which certain computability assumptîons must be made. Namely, we assume that linear equations are solvable in R, that we can factor in R, and that given any prime element uER we can factor in (RI (u) )[x]. Examples of such rings include Q[y], Z and (ZlpZ) [y] for a prime integer p. Let 1 # R[x],{O} be an ideal in R[x] and let {g" ... ,gt} be a minimal strong Grübner basis for 1. For simplicity we will assume that 9 = gcd(g" ... ,g,) = l. In this case we say that the ideal l is zero-dimensionaI 5 . It follows from Theorem 2.2.7 and Corollary 4.5.14 that this definition coincides with the one given before for the special case of lQi[x, yi. We will be using Theorem 4.5.13 and the notation set there.
5For certain PID's, R, this definition is more restrictive than the one usually found in the literature.
4.6. PIDMARY DECOMPOSITION IN Rix] FOR R A pm
263
That is, we have 91 92
a2a3 ... at a3' .. ath2
9i
aHl ... athi
gt~l
atht-l
9t
hl,
for i = 2, ....' t. We first show how to compute ail of the prille ideals containing 1. THEOREM 4.6.10. Let l, P be ideals in Rix] with 1 zero-dimensional and P prime. Then, with the notation above, (i) 1 c;: P if and only if there exists i ::0: 2 such that (ai, hi) c;: P; (ii) Let i E {2, ... ,t}. If (ai, hi) c;: P then P = (u, v), where u is an irreducible factor of ai and v is an irredueible factor of hi modulo u; (iii) If 1 c;: P, then P is maximal. PROOF. To prove (i), let 1 c;: P. Then 91 = a2a3'" at E P, so that there exists i ::0: 2 such that ai E P, and we choose i largest with that property. Now gi = ai+l ... athi E P, but ai+l ... at '1:. P by the choice of i, and 80 hi E P. Therefore (ai, hi) c;: P. For the converse, let i ::0: 2 and assume that ai, hi E P. First note that for all j = 1, ... ,i we have 9j E (ai, hi)' Now we show that for j = i+ 1, . '. t we have gj E (ail hi). Since hj E (h j - l , aj_ 1h j - 21 •. . ,a2' .. aj-l), it is an easy induction on j ta show that for j = i + 1, ... ,t we have J
(hj_l' aj_ I hj _ 2 , ... ,a2'" aj_l)
c;: (ai, hi)'
Thus, since 9j is a multiple of hj we have 9j E (ai, hi) for j = i + 1, ... , t. We now see that 1 c;: (ai, hi)' So, since (ai, hi) c;: P, we have 1 c;: P. We now prove (ii). Let (ai, hi) c;: P. Since ai E P, an irreducible factor of ai, say u, is in P. Note that the ideal (u) is now a maximal ideal of R, and hence R = R/ (u) is a field and Rix] is a PID. Let P be the image of P in Rix]. Then, since hi E P, P # {O} and so P is a maximal ideal of Rix]. Since Rix] is a PID, we see that P is generated by an irreducible polynomial il E Rix], that is, P = (il). The image hi of hi is in P, and so il is an irreducible factor of hi. Let v E Rix] be a pre-image of il. Since il E P, and sinee u E P, we see that v E P. Therefore (u, v) c;: P. We also have
RlxJl(u,v) '" Rlx]/P = Rlx]/(il).
CHAPTER 4. GROBNER BASES OVER RlNGS
264
Since R[xl/P is a field, we have that R[x]/(u,v) is a field and so (u,v) is a maximal ideal. Therefore P = (u, v). Statement (iii) is now immediate. 0
,
LEMMA 4.6.1l. Let J be an ideal in a Noetherian ring A Buch that every prime
ideal containing J iB maximal. Let J =
n
Qj be a primary decomposition of J,
j=l
where Qj is primary and VQ; = Mj is maximal. Then for any maximal ideal M Buch that J ç M, there exisls j E {l, ... , e} such that M = MJ' PROOF. We have
Since l
ç
M, we have
,
,
IlMJçnMjçM. j=l
j=l
Therefore there exists j E {l, ... ,e} sueh that Mj M=Mj. 0
ç M.
But Mj is maximal, so
Therefore, to compute the primary decomposition of the zero-dimensional ideal J ç R[x], we first have to determine al! the maximal ideals containing J. To do this we find all maximal ideals which contain (ai, hi)' for eaeh i = 2, ... ,t. Theorem 4.6.10 gives an explicit method for finding al! such maximal ideals: given {gl,'" , g,} a strong Grobner basis for J as in Theorem 4.5.13, for each i = 1, ...
,t
(i) compute the irreducible factors of ai; (ii) for each u computed in (i), compute the irreducible factors of hi modulo u;
(iii) each u, v computed in (i) and (ii) respectively gives rise to a maximal ideal containing J, namely M = (u, v). Note that the method presented above gives, in fact, a way to compute the primary decomposition of .,fI for a zero-dimensional ideal J (combining Theorem 4.6.10 and Lemma 4.6.2). We give an example to show how this method is used. EXAMPLE 4.6.12. We go back to Example 4.5.15 where R = Q[y] and J = ((X+y)(y2 + 1),x 2 -x+y+1). We computed a strong Grobner basis for J to be
gl g2 g3
= =
y4 + 2y3 + 2y2 + 2y + 1 = (y + 1)2(y2 + 1) xy2+x+y3+y=(y2+1)(x+y)=a3h2 x2-x+y+l=h3'
= 02a3
Note that the greatest common divisor of gl, 92 and 93 is l. We find al! maximal ideals containing J. 50 we find those maximal ideals which contain (02, h 2 ) ((y + 1)2, X + y) and those which contain (a3' h 3) = (y2 + 1, x 2 - X + Y + 1).
4.6. PRlMARY DECOMPOSITION lN Rix] FOR R A PID
265
Maximal idealB which contain ((y + 1)2, X + y): The only irreducible factor of (y + 1)2 is u = Y + 1. Also, x + y == x - 1 (mod y + 1) is irreducible. We let v = x - 1. Therefore the only maximal ideal which contains ((y + 1)2 ,x + y) is M , =(y+1,x-1). Maximal ideals which contain (y2 + l, x 2 - X + Y + 1): Clearly u = y2 + 1 is irreducible in Q[y]. Now we find the irreducible factors of x 2 - x + y + 1 modulo u, or equivalently, we find the irreducible factors of the image of x 2 - x + y + 1 in the ring R[x], where
R= R/(u) = Q[y]/(y2 + 1) = Q[i], where i = Fr. The image of x 2 - x + y + 1 in (Q[i]) [x] is the polynomial x 2 - x + i + 1. It is easy to see that this last polynomial can be factored as
x2 - x
+i + 1 =
(x - i)(x + i - 1),
and each ofthe factors in the right-hand side polynomial is irreducible in (lQI[i])[x]. We find pre-images of these factors in R[x] and we have
x2 -
X
+ Y + 1 ==
(x - y)(x + y - 1)
(mod y2
+ 1).
Therefore we have two maximal ideals in R[x] containing (y2 + l, x 2 -
M 2 = (y2
+ l, x
- y), and M3 = (y2
+ l, x + Y -
X
+ Y + 1):
1).
We now have the primary decomposition of ..fi:
..fi = M, nM2 nM3 = (y+ 1,x-1)
n (y2 + l,x-y) n (y2 +1,x+y-1).
Since we nQW can compute aU the maximal ideals containing I, we need to find the primary ideals which correspond to each maximal ideal in order to compute the primary decomposition of J. We do this in two steps. We first give a criterion to determine whether a given ideal Q is M-primary. We then give a criterion to determine which of the M-primary ideals belong to the primary decomposition of J. LEMMA 4.6.13. Let A be a Noetherian ring. Let M be a maximal ideal of A and let Q ç: M be an ideal of A. FurtheT assume that faT each m E M there exists l/ E N such that m" E Q. Then Q is M -primary.
PROOF. We first prove that ,fi:J = M. Since M is prime and Q ç: M, we have ,fi:J ç: M. Moreover, if m E M then m E ,fi:J, sinee a power of m is in Q, and thus M ç: ,fi:J as weIl. It remains to show that Q is primary. Let fg E Q and f ct Q. We show that 9 E M. Suppose to the contrary that 9 ct M. Then there exist h E A and m E M such that hg + m = l, since M is maximal. Let l/ E N such that m" E Q. Then
1 = 1" = (hg
+ m)" =h'g+m",
CHAPTER 4. GROBNER BASES OVER RlNGS
266
for sorne h' E A. Then f = h' gf gEM. 0
+ m" f
E Q, a contradiction.
Therefore
,
LEMMA 4.6.14. Let A be a Noetherian ring and let l be an ideal of A. Let
1=
n
Qi be a primary decomposition of l such that Qi is Mi·primary with Mi
i=l
maximal. Then for j = 1, ... ,
e
Qj={!EAII: (j)çtMj }. PROOF. Let jE {1, ... ,Cl. We denote {! E A 1 J: (j) çt M j } by Qj. Let f E Qj. Then there exists 9 E A such that 9 ~ Mj = VQ; and fg E J ç: Qj. Since Qj is primary, either f E Qj or a power of 9 is in Qj. But since 9 ~ VQ;, we must have f E Q j. Therefore Qj ç: Q j. For the reverse inclusion let f E Qj. For each i E {l, ... , Cl, i '" j, there exists Si E Mi - Mj, since Mi and Mj are distinct maximal ideals. Since Mi = VlJi, there exists Vi E N such that sr; E Qi. We deline s=
,
,
i=l i-:pj
i=l
II sr; E II Qi· i-:pj
Note that s ~ Mj by construction. Then f s E S
E J: (j) and s ~ M j , sa that
f
E Qj.
,
,
i=l
i=l
II Qi ç: nQi = J. Therefore
0
We now return ta the case where A = R[x] where R is a PID. We will use the above ta give a method for computing the primary decornposition of J. Sa let M = (u, v) be a maximal ideal which eontains (ai, hi), as obtained in Theorem 4.6.10. Then u is an irredueible factor of ai and so u divides g, = a2··· a,. Let m be the largest integer such that u m divides g, (sa m ;0. 1). Now, we know that the image il of v in the ring (RI (u) )[x] is an irredueible factor of the image hi of hi. Therefore il divides the image ?f, of g" sinee g, = h, E (ai, hi) and u divides ai. Let n be the largest integer such that il n divides ?f, (note that n ;0. 1). Then we have g, '= vnw (mod u), for sorne w E R[x], sueh that the image {jj of w is not divisible by il. THEOREM 4.6.15. We use the notation above. Let V, W E R[x] be such that
g, '= VW
(mod u m
),
V '= vn
(rnod u), and W '= w
(mod u).
Then Q = (U m ,g2, ... ,g'-l, V) is M-primary and is the M-primary component of J. PROOF. We first note that l ç: Q. Indeed, we can write g, = VW + hum, for sorne h E R[x], and sa g, E Q. Moreover g, E Q, since u m divides g,. Therefore J
ç: Q.
4.6. PRIMARY DECOMPOSITION IN Rix] FOR R A PID
267
We now show that Q is M-primary using Lemma 4.6.13. Clearly Q ç M, sinee u m , V, and 92, ... ,9t-l are in M (reeaU that we noted in the proof of Theorem 4.6.10 that g2, ... , g,_, E (ai, hi) ç M). To conclude, it is sufficient to show that sorne power of u and v are in Q. Clearly u m E Q. Now we can write V = v n + h' u, for sorne h' E R[x]. Therefore v n = V - h' u, and hence v nm = (V - h' ulm E Q. We now show that Q is the M-primary component of J using Lemma 4.6.14. We deline Q' = {J E R[x] 1 J: (1) rt M}. Let f E Q' and let 9 E J: (1) - M. Then fg E J ç Q. Since Q is M-primary, either f E Q or gV E Q, for sorne vEN. But if gV E Q, then 9 E ,jQ = M, a contradiction. Therefore f E Q, and Q'ÇQ. We now prove the reverse inclusion. First note that ~ is relatively prime to m
u
u because of the choice of m. Therefore ..!!2... fj. M, for otherwise, sinee U E M, . um we would have 1 E M (recall that g, and u are in Rand that R is a PID). Also, um = E J, and so u m E Q'. Now let j be such that 2 <: j <: t - 1. Then u 1 ct M, and 19j E J, so that gj E Q'. It remains to show that V E Q'. To see this we note that W ct M, for otherwise, W = hu + h' v for sorne h, h' E R[x], and so w '" W '" h' v (mod u). But this means that divides and this is a m contradiction. ThenW ~ ct M and VW = g, - h, u for sorne h, E R[x] imply m
g:
g,
v
w,
u
g, VWm u
= (g,
- h,u
m) -g, = g,g, m m - h,g, u
u
E J.
Therefore V E Q'. .D The last thing that remains to be done, in order to compute the primary decomposition of J, is to compute the polynomials V and W required in the last theorem. This is always possible and it follows from a result known as Herisel's Lemrna. We will state the result and show how it is used in examples, but we will not provide a proof of it. The interested reader should consult [Cohl. THEOREM 4.6.16 (HENSEL'S LEMMA). Let u be an irreducible element of the PJD R and let f E R[x]. Let g('), hl') E R[x] be two polynomials such that their images in (R/(u))[xl are relatively prime and such that
Then for any m there exist polynomials lm) and Mm) in R[x] such that f '" g(m)h(m)
(mod u m ), g(m) '" g(1)
(mod u), h(m) '" h(!)
(mod u).
Jf linear equations are solvable in R then these polynomials are computable. We wil~ now illustrate the technique presented in this section in three exampIes. These exarnples will also illustrate how the Hensellifting technique works.
CHAPTER 4. GROBNER BASES OVER RINGS
268
EXAMPLE 4.6.17. We go back to Example 4.6.12. We found that J = ((x + y)(y2 + 1), x 2 - X + Y + 1) C;; (
y4 + 2y3 + 2y2 + 2y + 1 = (y + 1)2(y2 + 1) = a2a3 xy2+x+y3+ y =(y2+1)(x+y)=a3 h2 x2 - X +Y + 1 = h3,
91 92
=
93
=
and that the maximal ideals which contain lare Ml
= (y + 1, x - 1), M 2 = (y2 + 1, x - y), M3 = (y2 + 1, x + y - 1).
We now find the primary components QI, Q2, and Q3 corresponding to Ml, M2' and M3 which belong to J. We use Theorem 4.6.15 and the notation set there to find QI, Q2, and Q3. Primary component which corresponds ta Ml: In this case we have u = y + 1 and v = x -1. The largest integer m such that u m divides 91 = (y + 1)2(y2 + 1) is m = 2. Now we need to factor 93 modulo u. We have
2
2
93 = x - X + Y + 1 "" x - X "" (x - l)x
(mod y + 1).
Therefore the largest n such that v n divides 93 modulo u is n = 1. We also have w = x. Now we need to find V and W such that
VW "" x 2
-
X
+Y+1
(mod (y
and W "" x
+ 1)2),
V""
(mod y
X -
1
(mod y
+ 1),
+ 1).
To do this we find polynomials h and h' in
V = (x - 1) + (y + l)h and W = x
+ (y + 1) h',
and which satisfy the congruence
93
x 2 -x+y+l xix - 1) + (y + 1)
+ 1)2) xix - 1) + x(y + l)h + (x - 1)(y + 1) h' +(y + 1)2hh' (mod (y + 1)2) xix - 1) + x(y + l)h + (x - l)(y + 1) h' (mod (y + 1)2). Therefore y + 1 "" x(y + l)h + (x - l)(y + 1) h' (mod (y + 1)2), or equivalently 1"" xh + (x - 1) h' (mod y + 1). One obvious solution to this equation is h = 1 VW (mod (y
and h' = -1. Therefore we have
V = (x - 1) + (y + 1) = x
+ y,
and W
= x - (y + 1) = x - y - 1.
By Theorem 4.6.15 the primary component of J which corresponds to the maximal ideal Ml is
4.6. PRlMARY DECOMPOSITION IN Rix] FOR R A
pm
269
Primary component which corresponds ta M 2 : In this case we have u = y2 + 1 and v = x-y. The largest integer m such that u m divides 91 = (y + 1)2(y2 + 1) is m = 1. Now we need to factor 93 modulo u. We have 93 =x 2 -x+y+1
= (x-y)(x+y-1)
(mody2+1).
Therefore the largest n such that v n divides 93 modulo u is n = 1. We also have w = x + y - 1. Since m = 1 we may let V = v and W = w. Therefore the primary ideal corresponding to M 2 and which belongs to J is
Primary component which corresponds ta M 3 : This computation is the same as the previous one and we get tbat the primary ideal corresponding to M3 and which belongs to J is Q3 = (y2 + l, (y2 + l)(x + y), x + y - 1) = M3' Therefore we have ((x + y)(y2 + 1), x 2 -
J
((y + 1)2,x + y)
X
n (y2 +
+ Y + 1) = Q,
l,x - y)
n Q2 n Q3
n (y2 +
l,x+y-1).
The next example illustrates how the Hensel lifting technique is applied repeatedly. EXAMPLE 4.6.18. In this example we again consider the ring R = Q[y]. Let J be the ideal ((x + y)2(y - 1), x 2 + X + y) of R[x]. Again, to compute the strong Gr6bner basis for J we compute the Gr6bner basis for J viewed as an ideal in Q[x, y] with respect to the lex term ordering with x > y. We get 91 92 93
= =
yS _y4 = y4(y_1) = a2 a3 xy-x+2y4_y3_y=(y-l)(x+2y3+y2+y)=a3h2 x 2 +x+y=h 3.
Note that the greatest common divisor of 91,92, and 93 is 1. As in Example 4.6.12 we first compute the maximal ideals which contain J. We find those maximal ideals which contain (a2, h 2) = (y4, x + 2y3 + y2 + y) and those which contain (a3, h 3 ) = (y - 1, x 2 + X + y). Maximal ideals which contain (y4,x + 2y3 + y2 + y): The only irreducible factor of a2 = y4 is u = y. The only irreducible factor of x + 2y3 + y2 + Y modulo y is v = x. Therefore the maximal ideal which contains (y4, X + 2y3 + y2 + y) is M, = (y, x).
Maximal ideals which contain (y - l, x 2 + X + y): It is easy to see that the only maximal ideal that contains (y - l, x 2 + X + y) is
CHAPTER 4. GROBNER BASES QVER RINGS
270
That is, we have u = y - 1 and v = x 2 + X + 1, sinee x 2 + x + 1 is irreducible in the ring R[x], where R = R/(u) = Ql[yl/(y - 1) "" Ql. 80 the original ideal (y - 1, x 2 + X + y) = M 2 is itself maximal. Thus, the primary deeomposition of v7 is
v7 =
Ml nM2 = (y,x) n (y -1,x 2 +x + 1).
We now find the primary eomponents that belong to l and whieh correspond to Ml and M 2 • We use Theorem 4.6.15 and the notation set there to find QI and Q2' Primary component which corresponds ta Ml: In this case we have u = y and v = x. The largest integer m sueh that u m divides gl = y4(y -1) is 17' = 4. Now we need to factor g3 modulo u. We have (mody).
g3 =x2+x+y",x2+x",x(x+l)
Therefore the largest n sueh that v n divides g3 modulo u is n = 1. We also have w = x + 1. Thus we need to find V and W sueh that
To do this we use the Hensellifting technique three times to find V(i), W(i) such that at each stage we have V(i)W(i) '" x 2 + X
+Y
(mod yi), V(i) '" x
(mod y), W(i) '" x
+1
(mod y),
for i = 2,3,4. This is done inductively by constructing V(i+l) and W(i+1) in terms of V(i) and W(i) respectively. We start with'lifting modulo y2 To do this we must find polynomials h(2) and h,(2) in Ql[x, y] such that
= x + yh(2)
V(2)
and W(2)
= x + 1 + yh,(2),
and whieh satisfy the following congruence g3
X2+X+Y V(2)W(2) (mod y2)
x2+ X
+ (x + l)yh(2) + xyh,(2)
(mod y2).
Therefore we have y '" (x + l)yh(2) + xy h,(2) (mod y2), or equivalently 1 (x + l)h(2) + x h,(2) (mod y). An obvious solution to this congruence is h(2) = 1 and h,(2) = -1. Therefore we have V(2) =
X
+ Y and W(2)
= x - y
+ 1.
To lift modulo y3, we find h(3) and h,(3) sueh that V(3) = V(2) W(3)
+ y 2h(3)
= X
+ Y + y 2h(3),
= W(2) + y 2h,(3) = X _ Y + 1 + y 2h,(3)
4.6. PRlMARY DECOMPOSITION IN Rix] FOR R A PID
271
and which satisfy the congruence 2 x +x+y V(3)W(3) (mod y3)
93
x2 +X
+Y -
y2
+ (x + y)y2 h,(3) + (x -
y
+ 1)y2h(3)
(mod y3).
Canceling y2 we have (x
+ y)h,(3) + (x - y + 1)h(3)
A solution to this congruence is h(3) V(3)
=
= x + y2 + Y and
'" 1
1 and h,(3) W(3)
=
(mod y).
= -1.
Therefore we have
x - y2 - Y + 1.
Finally we lift modulo y4. We find h(4) and h,(4) such that V W
= V(4) = V(3) + y 3h(4) = X + y2 + Y + y 3h(4),
= W(4) = W(3) + y 3h,(4) = X
_
y2 _ Y + 1 + y 3h,(4)
and which satisfy the congruence 93
2 x +x+y V(4)W(4) (mod y4) x 2 +x +y _ 2 y 3 + (x +y2 +y)y3h,(4) +(x - y2 _ Y + 1)y3h(4) (mod y4).
Canceling y3 we have (x
+ y2 + y)h'(4) + (x _
A solution to this congruence is h(4) V
y2 _ Y + 1)h(4) '" 2
= 2 and
h,(4)
(mod y).
= -2. Therefore we have
= V(4) = X + 2 y 3 + y2 + Y and W = W(4) = X _
2 y 3 _ y2 - Y
+ 1.
Now that we have V and W we can find the primary ideal QI corresponding to MI and which belongs ta l,
QI
=
(U m ,g2,V)
=
(y4,(y_1)(x+2y3+y2+y),x+2y3+y2+y) (y<,x
+ 2y3 + y2 + y).
Primary component which corresponds to M 2 : In this case we have u = y - 1 and v = x 2 + X + 1. The largest integer m such that u m divides 91 = y4(y - 1) is m = 1. We thus have that Q2 = (y - 1, x 2 + X + 1) = M 2 . Therefore the primary decomposition of 1 is 1= QI
n M 2 = (y4,x+2 y 3 +y2 +y) n (y -1,x 2 +x + 1).
We canclude this section with an example in Z[x].
CHAPTER 4. GROBNER BASES OVER RlNGS
272
EXAMPLE 4.6.19. The polynomials
91 92 93
45 = 9 . 5 = a2a3 5x + 10 = 5(x + 2) = a3h2 x 3 + 6x 2 + 20x + 15 = h3'
form a minimal strong Grabner basis in Z[x]. We will compute the primary decomposition of l = (91,92,93), We first compute the maximal ideals wbich contain (a2, h 2) and (a3, h 3) respectively. Maximal ideals which contain (9, x + 2): The only irreducible factor of 9 is u = 3. Also, v = x + 2 is irreducible modulo 3. Therefore there is only one maximal ideal which contains (9, x + 2), and it is M, = (3,x
+ 2).
Maximal ideals which contain (5, x 3 + 6x 2 + 20x + 15): The only irreducible factor of 5 is u = 5. Also,
x 3 + 6x 2 + 20x + 15 == x 3 + x 2 == x 2 (x + 1)
(mod 5).
Therefore there are two irreducible factors modulo 5, namely x and x + 1. Thus there are two maximal ideals which contain (5, x 3 + 6x 2 + 20x + 15), M 2 = (5,x) and M3 = (5,x + 1). The primary decomposition of
v7 =
v7 is
M, n M 2 n M3 = (3, x
+ 2) n (5, x) n (5, x + 1).
Now we find the prirnary component for each of these maximal ideals. Primary component which corresponds ta M,: In this case we have u = 3 and v = x + 2. The largest integer m sueh that u m divides 91 = 45 is m = 2. Now we need to factor 93 modulo u. We have 93 = x 3 + 6x 2 + 20x + 15 == x 3 + 2x == x(x2 + 2) == x(x + 1)(x + 2)
(mod 3).
Therefore the largest n such that v n divides 93 modulo u is n = 1. We also have w = x(x + 1). Now we need to find V and W such that
VW == x 3 + 6x 2 + 20x + 15
(mod 9), V == x + 2
and W == x(x + 1)
(mod 3),
(mod 3).
As before we use the Hensellifting technique. We find polynomials h and h' in
Z[x] such that V=(x+2)+3hand W=x(x+l)+3h',
4.6. PRlMARY DECOMPOSITION IN Rix] FOR R A PID
273
and which satisfy the following congruence 93
+ 6x 2 + 20x + 15
=
x3
_
VW (mod 9)
_
x3
+ 3x 2 + 2x + 3x(x + l)h + 3(x + 2) h'
(mod 9).
Therefore we have 3x 2 + 6"" 3x(x + l)h + 3(x + 2) h' (mod 9), or equivalently, x 2 + 2 "" x(x + l)h + (x + 2) h' (mod 3). A solution to this congruence is h = 0 and h' = x + 1. Thus we have
v = x + 2 and W
= x(x + 1)
+ 3(x + 1) =
x2
+ 4x + 3.
We now can find the primary ideal Q, corresponding to M, and whlch belongs to J, Primary component which corresponds to M 2 : In this case we have u = 5 and v = x. The largest integer m such that u m divides 9' = 45 is m = 1. Now we factor 93 modulo u. We have 93 = x 3
+ 6x 2 + 20x + 15 "" x 3 + x 2
""
2
x (x
+ 1)
(mod 5).
Therefore the largest n such that v n divides 93 modulo u is n = 2. We also have w = x + 1. Since m = 1 we may let V = v and W = w. The primary ideal Q2 corresponding to M 2 and which belongs to J is
Primary component which corresponds to M 3 : In this case we have u = 5 and v = x + 1. As above the largest integer m such that u m divides 9, = 45 is m = 1. Also, as above 93"" x 2 (x + 1) (mod 5), and so the largest n such that v n divides 93 modulo u is n = 1. We also have w = x 2 • As above we can choose V and W equal to v and w respectively, and so the primary ideal Q3 corresponding to Ma and whlch belongs to J is Q3 = (U m ,92,v) = (5,5(x
+ 2),:1: + 1) =
(5,x
+ 1).
Therefore we have the following primary decomposition of J J
= Q, n Q2 n Q3 = (9, x + 2) n (5, x 2) n (5, x + 1).
Exercises 4.6.1. Consider the following three polynomials in QI[x, yi
92
(y2 - Y + 1)2(y - 1) = a2a3 (y - 1)(x2 + y) = a3 h 2
93
xS
9'
+ 2x 6 + 3x 4 + x 2 -
Y + 1 = h3.
274
CHAPTER 4. GROBNER BASES OVER RlNGS
Let J = (9,,92,93) ç (
9' 92 93
4·7 =
+ 1) = a3h2 + 4x + 3 = h3'
7(x
x
4
a2a3
2
Let J = (9,,92,93) ç Z[x]. Verify that {9,,92,93} is a strong Grübner basis for J and find the primary decomposition of J.
Appendix A. Computations and Algorithms
Computations. There are many Computer Algebra Systems which have a Grübner basis package, for example AXIOM, MAPLE, and MATHEMATICA. There are other packages which are entirely devoted to computing in polynomial rings and which have an extensive list of commands ta perform sorne of the computations presented in this book. In particular, we mention CoCoA and MACAULAY. Most of the computations in the examples and exercises in this book (except in Chapter 4) were performed using CoCoA. Many of these computations could not have been done with the other systems listed above. MAPLE and MATHEMATICA do not allow computations in modules and have only a limited choice of orders. These systems allow the user to program and the algorithms presented in this book are, in principle, programmable. However, any practical implementation of these algorithms requires a lot of material not included in this book and many hours of work. MACAULAY computes only with homogeneous polynomials and foelises on computations applied ta algebraic geometry. Most of our examples are non-homogeneous. Sorne of the computations could still be done using MACAULAY, but with care (see Exercises 1.4.9 and 1.6.18). None of the above systems have an implementation for the computation of Grobner bases over rings which are nat fields. Of course the algorithms Qver rings could also, in principle, be implemented using sorne of these systems. CoCoA provides commands for the algebraic manipulation of polynomials in k[Xl, ... , Xn], ideals of k[Xl, ... ,X n], and submodules of the free modules (k[Xl, ... ,xn])m, where k is a field. Moreover, the user can choose among predefined term orders or define a custom ordering. The algebraic procedures Înclude the computation of the intersection of ideals and modules, Ideal quotients, normal forms, syzygy modules, elimination Ideals and modules, free resolutions, and, of course, Gr6bner bases for ideals and modules. When starting a computation with CoCoA the user specifies the characteristic of the field. Even when computing ovef Q, CoCoA performs arithmetic modulo a large prime number, 80 errars may occur and the user must keep this in mind. CoCoA was developed at the University of Genova, Italy, by Antonio Capani, Alessandro Giovini, Gianfranco Niesi, and Lorenzo Robbiano. Ta obtain a copy 275
276
APPENDIX A. COMPUTATIONS AND ALGORlTHMS
of CoCoA send a message to the developers at [email protected].
Aigorithms. The algorithms in this book are presented in pseudo-code, and the format is as follows. We always start our algoritluns by specifying the input (whlch follows the word INPUT) and the output (whlch follows the word OUTPUT). We then always specify how we initialize the variables involved in the algorithm (thls follows the word INITIALIZATION). When we need to assign an expression to a variable we use the instruction variable ;= expression For example, if the CUITent value of the variable C is 4 and the CUITent value of the variable G is {l, 2, 3}, and if the instructions
G:=GU{C} R:= C+ 1 are executed, then the operation {l,2,3}U{4} = {l,2,3,4} is performed, and G takes on its new value {l, 2, 3, 4}, and then the operation 4 + 1 = 5 is performed, and R-takes on its new value 5. We use the following conditional structure: IF condition THEN action 1
ELSE action 2 This means that if condition i8 true, then action 1 i8 performed, and if condition is false then action 2 is performed. Note that the truth of condition depends on the CUITent value of the variables in the algorithm. Sometimes we omit the ELSE statement which always means that action 2 is simply "do nothlng." The indentation always indicates what action 1 and action 2 are and when the conditional structure terminates. We also use two loop structures: WHILE condition DO action and FOR each item in a set S DO action In the WHILE loop, action is repeated as long as condition holds and is not performed if condition does not hold. In the FOR loop, action is performed once for each item in the set S. Note that, in many instances, an order on the set S is prescribed, and the FOR loop must be executed in that order. No other instruction is executed until the entire WHILE or FOR loop is completed. The indentation always indicates what the action is and when the loops terminate.
Appendix B. Well-ordering and Induction
In this book we use a form of "proof by induction" which may be unfamiliar ta the reader. Il is the purpose of this appendix ta briefly describe this process. We consider a non-empty set T which we assume has a total order ":S;" on it. That is, we assume we have a relation ":S" on T satisfying the following properties: • the relation :S is reflexive: for an t E T, t ::; tj • the relation <: is transitive: for all s, t, u E T, if s <: t and t <: u then s S u; • the relation <: is antisymmetric: for ail s, t E T, if s <: t and t <: s then s = t; • the relation <: is total: for all s, t E T, either s <: tort <: s. We say that a total arder <: is a well-ordering provided that we have the additional property • Let S ç T such that S # 0. Then S contains a smallest element. That is, there is an sES such that for all tES we have s <: t. We will use two examples of well-ordered sets. There is, of course, the one familiar ta most people, the set of natural numbers N = {D, 1,2, ... }. The one we will use most in this book is the set '[n of power products in the variables Xl, ... , X n on which we will put various orders (see Section 1.4); ail of them will be well-orderings. Let T be a well-ordered set. Since T is a subset of itself, it has a smallest element which we will denote by 1 (of course, in N this element is 0). We assume that we are given a set {Pt 1 t E T} of statements to be proved. We go about this in One of two ways. • We argue by contradiction. If one of the statements is false, we can let S be the set of all t E T such that Pt is false. Since S # 0 we may choose sES least. Another way ta say this is that if sorne of the Pt 's are false then we may choose a smallest sET for which P s is false. The idea then is ta find t < s for which Pt is also false and sa arrive at a contradiction and conclude that aU Pt '8 are true. • The second way to proceed is in the more traditional "induction argu277
278
APPENDIX B. WELL-ORDERING AND INDUCTION
menf' style (commonly referred to as strong induction). We assume that (i) P, is true; (ii) For al! t E T the truth of P, for ail s < t implies the truth of Pt. We conclude from this that ail of the PtS are true.
The validity of the first method of reasoning is obvious. Moreover the proof that these two forms of induction are equiva!ent proceeds exactly as it does in the case T = N.
References [AdGo] W.W. Adams and L.J. Goldstein, Introduction to Number Theory, Prentice Hall, Englewood Cliffs, NJ, 1976. [AtMD] M.F. Atiyah and LG. MacDonald, Introduction to Commutative Algebra, Addison-Wesley, Reading, MA, 1969. [Ba] D. Bayer, The Division Algorithm and the Hilbert Scheme, Ph.D. Thesis, Harvard University, Cambridge, MA, 1982. [BaSt88] D.A. Bayer and M. Stilhnan, On the complexity of camputing syzygies, J. Symb. Comp. 6 (1988), 135-147. [BaSt90] D.A. Bayer and M. Stillman, Macaulay: A system for computation in algebraic geometry and commutative algebra. User's manual. [BeWe] T. Becker and V. Weispfenning, Grobner Bases: A Camputational Approach ta Commutative Algebra, Springer Verlag, Berlin and New York, 1993. [Bu65] B. Buchberger, Ein Algorithmus zum Auffinden der Basiselemente des Restklassenringes nach einem nulldimensionalen Polynomideal, Ph.D. Thesis, Inst. University of Innsbruck, Innsbruck, Austria, 1965. [Bu79] B. Buchberger, A criterion for detecting unnecessary reauctions in the construction of Grobner bases, in EUROSAM'79, An International Symposium on Symbolic and Algebraic Manipulation (E.W. Ng, ed.), Lecture Notes in Comput. Sei., vol. 72, Springer Verlag, Berlin and New York, 1979, 3~21. [Bu83] B. Bucbberger, A note on the complexity of constructing Grobner bases, in EUROCAL'83, European Computer Algebra Conference (J.A. van Hulzen ed.), Lecture Notes in Comput. Sei., vol. 162, Springer Verlag, Berlin and New York, 1983, 137~145. [Bu85] B. Bucliberger, Grobner bases: An algorithmic method in polynomial ideal theory, in Multidimensional Systems Theory (N.K. Bose ed.), Reidel, Dordrecht, 1985, 184~232. [CGGLMW] B. Char, K. Geddes, G. Gonnet, B. Leang, M. Monogan, and S.M. Watt, Maple V Library Reference Manual, Springer Verlag, Berlin and New York,1991. [Cohl H. Cohen, A Course in Computational Algebraic Number Theory, Springer Verlag, Berlin and New York, 1993. [CoTr] P. Conti and C. Traversa, Buchberger algarithm and integer programming,
279
280
REFERENCES
in Applied Algebra, Algebraic Algorithms, and Error-Correcting Codes AAECC'9 (H.F. Mattson, T. Mora, and T.R.N Rao eds.), Lecture Notes in Comput. Sci., vol. 539, Springer verlag, Berlin and New York, 1991, 130139. [CLOS] D. Cox, J. Little, and D. O'Shea, Ideals, Varieties, and Aigorithms: An Introduction to Computational Aigebraic Geometry and Commutative Aigebra, Springer Verlag, Berlin and New York, 1992. [Cz] S.R. Czapor, Grobner basis methods for solving algebraic equations, Ph.D. Thesis, University of Waterloo, Canada, 1988. [EHV] D. Eisenbud, C. Huneke, and W. Vasconcelos, Direct methods for primary decomposition, Invent. Math. 110 (1992), 207-235. [FGLM] J.C. Faugère, P. Gianni, D. Lazard, and T. Mora, Efficient computation of zero-dimensional Grobner bases by change of ordering, J. Symb. Compl. 16 (1993), 329-344. [Ga] D. Gale, The Theory of Linear Economie ModeZs, McGraw-Hill, New York, 1960. [GTZ] P. Gianni, B. Trager, and G. Zacharias, Grobner bases and primary decomposition of polynomial ideals, J. Symb. Comp. 6 (1988), 149--167. [GMNRT] A. Giovini, T. Mora, G. Niesi, L. Robbiano, and C. Traverso, "One sugar cube, please" or selection strategies in the Buchberger algorithm, in Proc. International Symposium on Symbolic and Algebraic Computation ISSAC'91 (S.M. Watt ed.), ACM Press, New York, 1991, 49--54. [GiNi] A. Giovini and G. Niesi, CoCoA user manual, 1990. [Go] L. J. Goldstein, Abstract Aigebra, a First Course, Prentice Hall, Englewood Cliffs, NJ, 1973. [He] LN. Herstein, Topics in Aigebra, Blaisdell, New York, 1964. [Hi] H. Hironaka, Resolution of singularities of an algebraic variety over a field of characleristic zero, Ann. Math. 79 (1964), 109--326. [Hun] T.W. Hungerford, Aigebra, Springer Verlag, Berlin and New York, 1974. [Huy] D.T. Huynh, A superexponential lower bound for Grobner bases and Church-Rosser commutative Thue systems, Inf. Contr. 68 (1986), 196-206. [JeSu] R.D. Jenks and R.S. Sutor, AXIOM: The Scientific Computation System, Springer Verlag, Berlin and New York, 1992. [Lak] YN. Lakshman, On the complexity of computing Grobner bases for zerodimensional polynomials ideaZs, Ph.D. Thesis, Rensselaer Polytechnic Institute, Troy, NY, 1990. [Lan] S. Lang, Aigebra, Addison-Wesley, Reading, MA, 1970. [Laz85] D. Lazard, Ideal bases and primary decomposition: Case of two variables, J. Symb. Comp. 1 (1985), 261-270. [Laz91] D. Lazard, Systems of algebraic equations (algorithms and complexity), Computational Algebraic Geometry and Commutative Algebra (D. Eisenbud and L. Robbiano, eds.), Cambridge University Press, London, 1993.
REFERENCES
281
[Mac] F.S. Mru::aulay, Some properlies of enumeration in the theory of modular systems, Proc. London Math. Soc. 26 (1927), 531-555. [MaMe] E.W. Mayr and A.R Meyer, The complexity of the ward problems for commutative semigroups and polynomial ideals, Adv. Math 46 (1982),305329. [Mi] B. Mishra, Algorithmic Aigebra, Springer Verlag, Berlin and New York, 1993. [Mo88] H.M. Moller, On the construction of Griibner bases using syzygies, J. Symb. Comp. 6 (1988), 345-359. [Mo90] H.M. Moller, Computing syzygies a la Gauss Jordan, in Effective Methods in Algebraic Geometry (T. Mora and C. Traverso, eds.), Progress in Mathematics 94, Birkhiiuser Verlag, Basel, 1990, 335-345. [MoMo] H.M. Moller and T. Mora, New corutructive methods in classical ideal theory, J. Algebra 100 (1986), 138-178. [MoRo] T. Mora and L. Robbiano, The Griibner fan of an ideal, J. Symb. Comp. 6 (1988), 183-208. [NZM] 1. Niven, HB. Zuckerman, and H.L. Montgomery, An Introduction ta the Theory of Numbers, 5th ed., Wiley, New York, 1991. [RDSw] L. Robbiano and M. Sweedler, Subalgebra bases, in Proc. Commutative Algebra Salvador (W. Burns and A. Simis eds.), Lecture Notes in Math., vol. 1430, Springer Verlag, Berlin and New York, 1988, 61-87. [Sebre] F.O. Sehreyer, Die Bereehnung von Syzygien mit dem verallgemeinerlen Weierstrasschen Divisioruatz, Diplomarbeit, Hamburg, 1980. [Schri] A. Sehrijver, Theory of Linear and Integer Programming, Wiley, Chichester, NY, 1986. [Se] A. Seidenberg, Constructions in algebra, Trans. Amer. Math. Soc. 197 (1974), 272-313. [ShSw] D. Shannon and M. Sweedler, Using Griibner bases to determirœ algebra membership, split surjective algebra homomorphisms determine bira~ tional equivalence, J. Symb. Comp. 6 (1988), 267-273. [Sz] G. Szekeres, A canonical basis for the ideals of a polynomial domain, Am. Math. Mon. 59 (1952), 379-386. [Wo] S. Wolfram, Mathematica: A System for Doing Mathematics by Computer, Addison-Wesley, Reading, MA, 1991. [Za] G. Zacharias, Generalized Griibner bases in commutative polynomial rings, Bachelor's Thesis, MIT, Cambridge, MA, 1978.
List of Symbols
li
natural numbers ring of integers ring of integers modulo n field of rational numbers IQI field of real numbers IR field of complex numbers C 'lrn set of power products in the variables Xl, ... ,X n a field k affine space kn ring of polynomials in the variables Xl, ... ,X n with k[x" ... , xnl coefficients in the field k ideal (submodule) generated by fI,··· , J, (f" ... ,J,) J == g (mod 1) J congruent ta g modulo l k[x" ... ,xnl/I quotient ring of k[x" ... , xnl by the ideal l coset of J modulo l J+I V(8) variety in k n defined by the set of polynomials 8 variety in k n defined by the polynomials fI,·· . , J, V(f" ... , J,) I(V) ideal in k[x" ... , xnl defined by V ç k n ideal generated by the polynomials in the set 8 (8) greatest common divisor gcd least common multiple lem dimension of the k· vector space L dim.(L) lt(f) leading term of J leading power product of J lp(f) leading coefficient of J le(f)
Z Zn
xa J~h G J --4+ h
X~l ... x~n
J reduces to h J reduces to h
modulo G in one step modulo G
283
284
LIST OF SYMBOLS
/~h Lt(S)
S(f,g) Na(f) VK(S)
k
-Ji InJ I:J im( <1» ker( <1» k[fI, ... ,/,]
kiF] k(Xl, ... ,Xn)
k(a) k(a" ... ,an ) Am M!N el,··· lem
/, 1 [ J, J, 1 SYZ(f" ... , J,) [ fI
Syz(F)
Im(J)
FœG [FIG]
ts
ann(M)
F0G Hom(M,N)
(U) S-lA Ap Ag
/ reduces to h and h = / - X 9 leading term ideal (submodule) defined by S S-polynomial of / and 9 normal form of / with respect to G variety in Kn defined by S ç k[Xl' ... , x n ] algebraic closure of the field k radical of the ideal l intersection of land J ideal quotient of l by J image of the map <1> kernel of the map <1> k-algebra generated by the polynomials fI, ... , /, k-algebra generated by the polynomials in the set F rational function field field extension of k obtained by adjoining a field extension of k obtained by adjoining QI, ... ,Qn set of column vectors with entries in the ring A quotient module of M by N isomorphic ta standard basis for the free module Am 1 X s matrix whose entries are polynomials h, ... ,fs m x s matrix whose columns are vectors fI' ... ,fs syzygy module of the matrix [ fI /, syzygy module of the matrix F leading monomial of the vector J direct sum of the matrices F and G concatenation of the matrices F and G transpose of the matrix S annihilator of the module M tensor product of the matrices F and G set of all A-module homomorphisms between M and N module generated by the columns of the matrix U localization of the ring A at the multiplicative set S localization of the ring A at the prime ideal P localization of the ring A at the set {l, g, g2, g", ... }
1
Index
algorithm for modules, 149 improved algorithm, 129 Theorem, 40, 48
affine algebra, 84, 180 space, 2 algebra, 232
affine, 84, 180 First Isomorphism Theorem, 80 homomorphism, 79, 92, 232 membership problem, 39 algebraic dosme, 62 element, 97 geometry, 90 algebraically closed field, 62 algorithm, 275 crit2, 130 division module case, 145 rnultivariable case, 28 one variable case, 12 ring case, 207 Euclidean, 14 Grohner basis
CoCoA,275 column vector, 113 commutative ring, 1 computation, 275 Computer Algebra System, vii, 275 confluence relation, 37 congruence, 5 Conti, 105 coset, 5 field case multivariable case, 57 one variable case, 15 module case, 155 ring case, 227 cost function, 105 critl, 128, 152, 258 crit2, 128, 168, 224
Buchberger's, field case, 43 Buchberger's, module case, 149 improved Buchberger's, 129 PID case, 250 ring case, 216 ring case using Mûller's technique,
deglex, 20 degrevlex, 20 determine, 5, 53 Dickson's Lemma, 23 direct sum of matrices, 174 division for vectors, 141 multivariable case, 25, 26, 27 one variable case, 11 division algorithm field case multivariable case, 28 one variable case, 12 module case, 145 ring case, 207
219 primality test, 244 annihilator, 177, 217 ascending chain condition, 6 basis forA Tn jM,155 for k[Xl, ... ,xnl/I, 58
in Am., 114 standard in Am, 114 Bayer, 102 Buchberger, viii, 39, 113, 124, 131 algorithm, ix, 43
effective, 53 effective coset representatives, 226 285
INDEX
286
elimination
Gauss-Jordan, -2 ideal
field case, 70 ring case, 229 module, 156 order) 69, 229 Euclidean Algorithm, 14 exact sequence, 185 short, 184 explicitly given module, 117 Ext functor) 194, 199 Faugère, 68 field algebraic dosme, 62 algebraically closed, 62 First Isomorphism Theorem algebra ease, 80 module case, 117 free module, 114 free re~olution, 195 Gauss-Jordan elimination, 2 generating set, 3, 114 finite, 6 Gianni, 68, 237 global dimension, 196 graph, 102 graph of a map, 91 greatest cornmon diviso!, 13, 15, 71 Grobner basis algorithm Buchberger's, field case, 43 Buchberger's, module case, 148 improved Buchberger's, 129 PID case, 250 ring case, 216 ring case using M611er's technique,
219 for ideal, field case, 32, 34 minimal,47 reduced,48 universal, 50 using syzygies, 121
for ideal, ring case, 201, 208 minimal, 211 minimal strong, 251, 254, 262 strong, 251 for module, 147 reduced, 150 Hensel's Lemma, 267 Hilbert Basis Theorem, 5, 6 Nullstellensatz, 62 Hom, 183 homogeneous ideal,49 component, 22 polynomial, 22, 38 syzygy, 121, 212 homogenization, 38 homological algebra, 185 homomorpmsm algebra, 79, 92, 232 image of, 80 kernel of, 80, 232 module, 116 ideal, 3 field case elimination, 70 Grobner basis for, 32, 34 homogeneous, 49 intersection, 70, 172, 175 leading term, 32 membership problem, 5, 53 monomial, 23 of relations, 80 quotient, 72 quotient for modules, 157, 177, 182 irreducible, 261 leading term, 32, 206 monomial, 23 P-primary, 261 primary, 260 prime, 61, 237, 259 principal, 13, 246 radical of, 62, 259
287
INDEX ring case elimination, 229 Grobner basis for, 201, 208 intersection, 230 leading term, 206 membership problem, 225 of relations, 232 quotient, 217, 231 saturation of, 238 zero-dimensional, 64, 262 image of a homomorphism, 80 implicitization, 91 induction, 277 integer progranuning, 105 inter-reduced, 131 intersection of ideals field case, 70, 172, 175 :ring case, 230 of modules, 157,175,176,178 inverse in k[Xl, ... ,xnl/I, 59, 182 irreducible ideal, 261 isomorphic varieties, 94 isomorphism, module, 116 kernel of a homomorphism, 80, 232 Lakshman, 77 Lazard, 68, 201, 254, 259 leading coefficient multi-variable case, 21, 202 one variable case, 10 vector case, 143 leading monomial, 143 leading power product, 21, 202 leading term ideal, 32, 206 module, 147 multivariable case, 21, 202 one variable case, 10 vector case, 143 least cornmon multiple of polynomials, 71 of vectors, 147 lex, 19
linear equations are solvable, 204, 262 local ring, 238 localization, 89 long division, 10 membership problem algebra,39 ideal over field, 5, 53 ideal over ring, 225 module, 153 minimal Grobner basis, field case, 47 Grobner basis, ring case, 211 strong Grobner basis, 251 with respect to F, 205 minimal polynomial, 97 module, U3 elimination, 156 explicitly given, 117 First Isomorphism Theorem, 117 free,114 Grobner basis for, 147 homomorphism, 116 ideal quotient, 157, 177, 182 intersection, 157, 175, 176, 178 isomorphism, 116 leading term module, 147 membership problem, 153 Noetherian, 116 normal forro, 155 quotient, 116 reduction in, 143 S-polynomial, 148 syzygy, U8 Moller, 170, 201, 216, 250 monic polynomial, 13 monoid,37 monomial ideal, 23 in modules, 141 leading, 143 Mora, 23, 68 multiplicative set, 238 Noetherian
288
INDEX
module, 116 ring, 6, 259 normal form polynomial case over a field, 57 over a ring, 227 vector case, 155 normal selection strategy, 130 Nullstellensatz, 62
quotient module, 116 ring, 5
order, 18 degree lexicographie (deglex), 19 degree reverse lexicographie (degrevlex), 20 elirnination, 69 induced by a matrix, 166 lexicographie (lex), 19 position over term (POT), 142 term, 18, 140 term over position (TOP), 142 total, 18, 277 weJl-ordering, 18, 277
radical of an ideal, 62, 258 reduced Grobner basis ideal case over fields, 48 module case, 150 polynomial over a field, 27 totally, 227 vector, 144 reduction linear case, 8 module case, 143 multivariable case over a field, 25, 26,
P-primary ideal, 261 parametrized variety, 91 polynomial, 1 elementary symmetric, 25
multivariable case over a ring, 203, 205 one variable case, 11 strong, 252, 258 remainder module case, 144 muItivariable case over a field, 27 one variable case over a field, Il ring commutative, 1 local, 238 localization, 89 Noetherian, 6, 259 of fractions, 238 quotient, 5 Robbiano, 23, 39 row echelon form, 2
ideal, 61, 237, 259 principal ideal, 13, 246 Principal Ideal Domain (PID), 13,246 projection map, 90
pullback, 182
27
homogeneous, 22, 38 minimal, 97 monie,13 normal form, 57, 227 reduced, 27 square free, 75 symmetric, 25, 88 position over terrn (POT), 142 power product, 1, 18 leading, 21, 202 presentation, 117, 119, 195 primality test, 244 primary component, 261 decomposition, 261 ideal, 260 prime component, 261
S-basis, 247 S-polynomial module case, 148 multivariable case over a field, 40, 121,
124
INDEX PID case, 249 ring case, 213 SAGBI basis, 39 saturated set, 213, 226 saturation of a set, 213 saturation of an ideal, 238 Schreyer, 165 Seidenberg, 78 Shannon, 82, 88 square free polynomial, 75 standard basis, 32 strong Grobner basis, 251 minimal Gr6bner basis, 251, 254, 262 reduction, 252, 258 subalgebra, 39 submodule, 114 Sweedler, 39, 82, 88 symmetric polynomial, 25, 88 syzygy and Grobner bases, 121 applications of, 171 homogeneous, 121, 212 of a matrix, 161 module of, 161 of vectors over a field, 118 module of, 118, 134 of vectors over a ring, 212 module of, 232, 246 Szekeres, 254 tensor product of matrices, 189 term module case, 140 order module case, 140 polynomial case, 18 polynomial case, l, 18 term over position (TOP), 142 three color problem, 102 total order, 18, 277 totally reduced, 227 Trager,237 transpose matrix, 176 Traverso, 105
289 universal Gr6bner basis, 50 variety, 2, 61, 90 isomorphic, 94 parametrized, 91 vector space, 1 well-ordering, 18, 277 Zacharias, 201, 226, 237 Zariski closure, 90 zero divisor, 61 zero-dimensional ideal, 64, 262