Introduction to Geometric Probability This is the first modern introduction to geometric probability, also known as integral geometry. The subject is presented at an elementary level, requiring little more than first year graduate mathematics. The theory of intrinsic volumes due to Hadwiger, McMullen, Santal6 and others is presented, along with a complete and elementary proof of Hadwiger's characterization theorem of invariant measures in Euclidean n-space. The theory of the Euler characteristic is developed from an integralgeometric point of view. The authors then prove the fundamental theorem of integral geometry, namely the kinematic formula. Finally the analogies between invariant measures on polyconvex sets and measures on order ideals of finite partially ordered sets are investigated. The relationship between convex geometry and enumerative combinatorics motivates much of the presentation. Every chapter concludes with a list of unsolved problems. Geometers and combinatorialists will find this a stimulating and fruitful tale.
Daniel A. Klain is Assistant Professor of Mathematics at Georgia Institute of Technology. Gian-Carlo Rota is Professor of Applied Mathematics and Philosophy, Massachusetts Institute of Technology.
Lezioni Lincee Sponsored by Foundazione IBM Italia Editor: Luigi A. Radicati di Bmzolo, Scuola Normale Superiore, Pisa
This series of books arises from lectures given under the auspices of the Accademia Nazionale dei Lincei and is sponsored by Foundazione IBM Italia. The lectures, given by international authorities, will range on scientific topics from mathematics and physics through to biology and economics. The books are intended for a broad audience of graduate students and faculty members, and are meant to provide a 'mise au point' for the subject with which they deal. The symbol of the Accademia, the lynx, is noted for its sharp-sightedness; the volumes in this series will be penetrating studies of scientific topics of contemporary interest.
Already published Chaotic Evolution and Strange Attractors: D. Ruelle Introduction to Polymer Dynamics: P. de Gennes The Geometry and Physics of Knots: M. Atiyah Attractors for Semigroups and Evolution Equations: O. Ladyzhenskaya Asymptotic Behaviour of Solutions of Evolutionary Equations: M. 1. Vishik Half a Century of Free Radical Chemistry: D. N. R. Barton in collaboration with S. 1. Parekh Bound Carbohydrates in Nature: L. Warren Neural Activity and the Growth of the Brain: D. Purves Perspectives in Astrophysical Cosmology: M. Rees Molecular Mechanisms in Striated Muscle: S. V. Perry Some Asymptotic Problems in the Theory of Partial Differential Equations: O.Oleinik
PUBLISHED BY THE PRESS SYNDICATE OF THE UNIVERSITY OF CAMBRIDGE
The Pitt Building, Trumpington Street, Cambridge CB2 1RP, United Kingdom CAMBRIDGE UNIVERSITY PRESS
The Edinburgh Building, Cambridge CB2 2RU, UK http://www.cup.cam.ac.uk 40 West 20th Street, New York, NY 10011-4211, USA http://www.cup.org 10 Stamford Road, Oakleigh, Melbourne 3166, Australia
©
Cambridge University Press 1997
This book is in copyright. Subject to statutory exception and to the provisions of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press. First published 1997 Printed in the United Kingdom at the University Press, Cambridge Typeset in Computer Modern 10/13pt
A catalogue record for this book is available from the British Libmry ISBN 0 521 59362 X hardback ISBN 0 521 59654 8 paperback
Contents
Preface Using this book 1
1.1 1.2 1.3 2
2.1 2.2 2.3 3
3.1 3.2 3.3 3.4 4
4.1 4.2 4.3 5
5.1 5.2 5.3 5.4 5.5 5.6
page xi xiv
The Buffon needle problem The classical problem The space of lines Notes
1 1 3 5
Valuation and integral Valuations Groemer's integral theorem Notes
6 6 8 11
A discrete lattice Subsets of a finite set Valuations on a simplicial complex A discrete analogue of Helly's theorem Notes
13 13 21 28 29
The intrinsic volumes for parallelotopes The lattice of parallelotopes Invariant valuations on parallelotopes Notes
30 30 35 41
The lattice of polyconvex sets Polyconvex sets The Euler characteristic Helly's theorem Lutwak's containment theorem Cauchy's surface area formula Notes
42 42 46 50 54 55 58
viii 6
6.1 6.2 6.3 6.4 6.5 6.6 6.7 7
7.1 7.2 7.3 7.4 7.5 8
8.1 8.2 8.3 8.4 8.5 8.6 8.7 9
9.1 9.2 9.3 9.4 9.5 9.6 9.7 9.8 9.9 10
10.1 10.2 10.3 10.4
Contents Invariant measures on Grassmannians The lattice of subspaces Computing the flag coefficients Properties of the flag coefficients A continuous analogue of Sperner's theorem A continuous analogue of Meshalkin's theorem Helly's theorem for subspaces Notes
60 60 63 70 73 77 81 83
The intrinsic volumes for polyconvex sets The affine Grassmannian The intrinsic volumes and Hadwiger's formula An Euler relation for the intrinsic volumes The mean projection formula Notes
86 86 87 93 94 95
A characterization theorem for volume Simple valuations on polyconvex sets Even and odd valuations The volume theorem The normalization of the intrinsic volumes Lattice points and volume Remarks on Hilbert's third problem Notes
98 98 106 109 111 112 115 117
Hadwiger's characterization theorem A proof of Hadwiger's characterization theorem The intrinsic volumes of the unit ball Crofton's formula The mean projection formula revisited Mean cross-sectional volume The Buffon needle problem revisited Intrinsic volumes on products Computing the intrinsic volumes Notes
118 118 120 123 125 128 129 130 135 140
Kinematic formulas for polyconvex sets The principal kinematic formula Hadwiger's containment theorem Higher kinematic formulas Notes
146 146 150 152 153
Contents Polyconvex sets in the sphere 11.1 Convexity in the sphere 11.2 A characterization for spherical area 11.3 Invariant valuations on spherical polytopes 11.4 Spherical kinematic formulas 11.5 Remarks on higher dimensional spheres 11.6 Notes Bibliography Index of symbols Index 11
ix
154 154 156 159 162 164 166 168 174 176
Preface
If we were allowed to rename the field of geometric probability - sometimes already renamed integral geometry - then we would be tempted to choose the oxymoron 'continuous combinatorics.' On more than one occasion the two fields, geometric probability and enumerative combinatorics, are brought together by mathematical analogy, that most effective breaker of barriers. Like combinatorial enumeration, where sequences of objects bearing a common feature are unified by the idea of a generating function, geometric probability studies sets of geometric objects bearing a common feature, which are unified by the idea of an invariant measure. The basic idea is extremely simple. When considering straight lines, pairs of points, or triangles in space, one determines the invariant measure on the variety of straight lines, of pairs of points, of triangles. This idea is strangely reminiscent of the underlying idea of enumerative geometry, with one major difference: whereas enumerative geometry is bound to the counting of finite sets, geometric probability is given greater freedom, by extending the concept of enumeration to allow the assigning of invariant measures. Invariant measures are far easier to compute and, we dare add, more useful than the curiously large integers that are computed in enumerative geometry. This basic idea goes back to Crofton's article in the ninth edition of the Encyclopaedia Britannica, an article that created the subject from scratch and that is still worth reading today. The one other brilliant contribution to geometric probability in the past century was Barbier's solution of the Buffon needle problem, which remains to this day the basic trick of the subject, still being secretly exploited in ever unsuspected ways. Geometric probability has suffered in this century the fate of other fields that would have enjoyed a healthy autonomous development, had
xii
Preface
it not been for the overpowering development of representation theory. One can reduce integral geometry to the study of actions of Lie groups, to symmetric spaces, to the Radon transform; in so doing, however, the authentic problematic of the subject is lost. Geometric probability is a customer of representation theory, in the same sense that mechanics is a customer of the calculus. The purpose of this book is to present the three basic ideas of geometric probability, stripped of all reliance on group-theoretic techniques. First, we investigate measures on polyconvex sets (i.e., finite unions of compact convex sets) in Euclidean spaces of arbitrary dimension that are invariant under the group of Euclidean motions. A great many mathematicians are still basking in the illusion that there is only one such measure, namely, the volume. We merrily destroy this illusion by proving what is at present the fundamental result of the field (due to Hadwiger), stating that the space of such invariant measures is of dimension n + 1 in a Euclidean space of dimension n. The proof of this fundamental result given in the text is new, due to the first author. It becomes clear, on reading the applications of the fundamental theorem, that the basic invariant measure to be singled out from such a bounty is not the volume, but the Euler characteristic (as Steve Schanuel was first to realize). Here again we meet with wide ignorance on the part of the mathematical public: the fundamental fact that the Euler characteristic is an invariant measure (in fact, it is the only integer-valued invariant measure) is not as well known as it should be. It leads to one-line proofs of most of the fundamental theorems on convex sets. We develop the theory of the Euler characteristic from scratch, in a way that makes it look like an ordinary integral. Second, we prove the fundamental formula of integral geometry, viz., the kinematic formula. Here we displace the common device of Minkowski sums from its typically central role, not merely as a display of mathematical machismo, but with an ulterior motive. Third, we try to bring out from the beginning the striking analogy between the computation of invariant measures and certain combinatorial properties of finite partially ordered sets. The second author pointed out in 1967 that the notion of Euler characteristic could be extended to such partially ordered sets by means of the Mobius function. We now go one step further and show that an analogue of the theory of invariant measures in Euclidean space can be worked out in partially ordered sets, including finite analogues of the kinematic formula and even of Helly's theorem. This analogy brings out in stark contrast the
Preface
xiii
unexplored terrain of classical geometric probability, namely, a thorough understanding of the integral geometric structure of the lattice of subspaces of Euclidean space under the action of the orthogonal group. It also brings us closer to the current outer limits of mathematics, to the theory of Hecke algebras, to Schubert varieties and to the quantum world. We hope that the reading of this introduction to the field of geometric probability will encourage further development of these analogies. The text is based on the 'Lezioni Lincee' given by the second author in 1986 at the Scuola Normale Superiore in Pisa. The authors wish to thank Ennio De Giorgi, Edoardo Vesentini, and Luigi Radicati for providing an interested audience for the original lectures. Thanks are also due to Stefano Mortola for his careful reading of the initial draft, and to Beifang Chen, Steve Fisk, Joseph Fu, Steven Holt, Erwin Lutwak, and three anonymous referees for their valuable comments and suggestions.
Using this book
Although parts of this book assume a knowledge of basic point-set topology, measure theory, and elementary probability theory, the greater part of the text should be accessible to advanced undergraduates. Proofs are either given in full or else stretched to the point from which the reader will be able to reconstruct them without effort. Only on certain technical measure-theoretic points have we felt the need to omit details that, although indispensable in a detailed treatment, are of questionable relevance in an exposition that is meant to stress geometric insight and combinatorial analogy. Some notions that appear vague in the early sections will be revisited later on, after language has been developed for a treatment in clear and rigorous terms. References and open problems are deferred to the notes at the end of each chapter.
1 The Buffon needle problem
We begin with what is probably the best-known problem of geometric probability, the Buffon needle problem. This solution of the needle problem via the characterization of an additive set functional serves to motivate the study of valuations on lattices, the topic of Chapter 2. Variations and generalizations of the Buffon needle problem are presented in Chapters 8 and 9.
1.1 The classical problem
Parallel straight lines are drawn on the plane R2, at a distance d from each other. A needle of length L is dropped at random on the plane. What is the probability that the needle shall meet at least one of the lines? This problem can be solved by computations with conditional probability (Feller, for example, solved it in this way in his well known treatise [23, p. 61]). It is, however, more instructive to solve it by another method, one that minimizes the amount of computation and maximizes the role of probabilistic reasoning. Let Xl be the number of intersections of a randomly dropped needle of length Ll with any of the parallel straight lines. If the needle is long enough, the random variable Xl can take several integer values, whereas if the needle is short, it can take only the values 0 or 1. If Pn is the probability that the needle meets exactly n of the straight lines, and if E(Xd denotes the expectation of the random variable Xl, then we have
E(X l ) =
L npn· n2:0
2
1 The Buffon needle problem
Thus, if L1 < d, then
and P1 is the probability we seek. Therefore, it is sufficient to compute the expectation E(X1)' Suppose that another needle of length L2 is dropped at random. The number of intersections of this second needle with any of the parallel straight lines drawn on R 2 is another random variable, say X 2 . The random variables Xl and X 2 are independent, unless the needles are welded together. Suppose that the needles are rigidly bound at one of their endpoints. They may form a straight line, or they may be at an angle. In either case, if the two rigidly bound needles are simultaneously dropped on R 2 , their total number of intersections will still be Xl + X 2 . The random variables Xl and X 2 will no longer be independent, but their expectation will remain additive: (1.1) The same reasoning applies to the random variable Xl + X 2 + ... + X k , for the case in which k needles are welded together to form a polygonal line of arbitrary shape. Since E(X 1) clearly depends on the length L 1, we can write E(Xd = I(L 1 ), where 1 is a function to be determined. By welding together two needles so that they form one straight line we find that E(X1 + X 2) = I(L1 + L 2), and we infer from (1.1) I(L 1 + L 2)
=
I(L 1) + I(L2).
It then follows that 1 is linear when restricted to rational values of L. Since 1 is clearly a monotonically increasing function with respect to L, we infer that I(L) = rL for all L E R, where the constant r is to be determined. If C is a rigid wire of length L, dropped randomly on R2, and if Y is the number of intersections of C with any of the straight lines, then C can be approximated by polygonal wires, so that Y is approximately equal to Xl + X 2 + ... + Xk. Passing to the limit, we find that
E(Y)
= rL.
(1.2)
This allows us to determine the value of the constant r, by choosing a wire of suitable shape. Let C be a circular wire of diameter d. Obviously E(Y) = 2, and L = 7rd. It then follows from (1.2) that 2
= r7rd,
1.2 The space of lines
3
whence r = 2j(7rd). Thus, for a short needle, we have
E(X 1 ) =
2L
= 7rd·
P1
This result has been used (rather inefficiently) to compute the value of 7r. Instead, we shall use it as the theorem leading into the heart of geometric probability, following the ideas of Crofton and Sylvester.
1.2 The space of lines
Let Graff(2,1) denote the set of all straight lines in R2 (the reason for this notation shall be made clearer in Chapter 7). It is well known that this set enjoys some notable properties. To this end, denote by Z1 the number of intersections of a straight line taken at random with a straight line segment of length L 1 , and let Ai denote the invariant measure on Graff(2,1). The integral
r
Z1
JGraff(2,1)
dAi
depends only on L 1. Since Z1 takes only the values 0 or 1, this integral is equal to the measure of the set of all straight lines that meet the given straight line segment. Since the value of the integral depends only on the length L1 of the straight line segment, denote this value by f(L 1 ). We can now repeat the argument we used for the Buffon needle problem: given a polygonal line consisting of segments of length L 1 , L 2 , ••. , the number of intersections of a randomly chosen straight line with the polygonal line is
r
(Z1+Z2+ ... )dAr=f(L1+L2+ ... ).
JGraff(2,1)
Since integrals are linear, this becomes
r
JGraff(2,1)
Z1 dAr
+
r
JGraff(2,1)
Z2 dAr
+ ... =
f(L 1) + f(L 2) + ... ,
and we again conclude that f(L) = rL. We shall not normalize the measure Ai by setting r = 1; rather, we shall decide later what the 'right' normalization should be. Again we may pass to the limit. Recall that a subset K of the plane is convex if any two points x and y in K are the endpoints of a line segment lying inside K. A curve C in the plane is called convex if C encloses a convex subset. Let C be a convex curve in the plane of length L, and let
4
1 The Buffon needle problem
Zc be the number of intersections of C with a randomly chosen straight line. Then ( zcdAi = rL. lGraff(2,1) In particular, let K1 and K2 be compact convex sets in the plane with non-empty interiors, and with boundaries C 1 = oK1 and C 2 = oK2 of length L1 and L 2. For each i, we have (
ZCi
lGraff(2,1)
dAi
= rLi'
On the other hand, since Ki is convex, a straight line meets Ki either twice or not at all (excluding the limiting cases of tangents, which can be shown to have measure zero). Thus, the function ZC i takes either the value 2 or the value O. If we denote by Di the set of all straight lines in R2 that meet K i , then we have (
ZCi
dAi = 2Ai(Di ).
lGraff(2,1)
To re-state these results in terms of probability, assume that K1 ~ K 2. The conditional probability that a straight line shall meet the compact convex set K 1 , given that it meets K 2 , is the ratio
AI(D 1 ) AI(D2 ) • The computation above shows that this ratio is equal to
L1 length(oK1) = :----"'--=,--7:::-::-::'C:L2 length(oK2)" Note that the value of the normalization constant r is irrelevant to the computation of this conditional probability. The results above (sometimes designated Sylvester's theorem) can be compared to the analogous result for points: if K1 ~ K 2 , the conditional probability that a point taken at random shall belong to K 1 , given that it belongs to K 2 , is area(K1 ) area(K2)' Thus, we see a striking analogy: replacing every occurrence of the word 'point' by the word 'line' corresponds to replacing the word 'area' by the word 'perimeter.' This analogy suggests that a generalization of Sylvester's theorem to arbitrary dimension may prove worthwhile.
1.3 Notes
5
1.3 Notes The solution to Buffon's needle problem presented here is due to Barbier [5], and was later generalized still further by Crofton in [14, 15, 16]. Crofton's main paper, which set geometric probability on its modern footing, is the Encyclopaedia Britannica article [17]. It is still an excellent reference. In [95] Sylvester considered a variation of the Buffon needle problem in which the needle is replaced by a finite rigid collection of compact convex (and possibly disjoint) sets K 1 , .•. , Km tossed randomly into a plane tiled by evenly spaced lines. Sylvester then considered the cases in which a line meets one, some, or all of the sets K i . In the previous section we measured the set of all lines meeting a compact convex set K in the plane. When dealing with multiple convex sets Sylvester was led to consider also the measure of the set of lines that separate two disjoint compact convex regions of the plane. This theme has also been pursued extensively in the work of Ambartzumian [1, 2]. Buffon's result gives a very inefficient means of approximating the number 11"; for a history of this technique, see [30]. For additional modern treatments of geometric probability in the plane, see also [1, 2, 49, 82, 90].
2 Valuation and integral
In Chapter 1 we expressed the Buffon needle problem in terms of a set functional (1.1) on a certain collection of sets in the plane satisfying a certain kind of additivity. We then solved the problem by characterizing this additive functional in (1.2), using in this case the fact that the functional was monotonically increasing and invariant with respect to certain motions of sets in the plane. In this chapter we make more precise the notion of 'additive set functional', or valuation, on a lattice of sets. The abstract notions developed in this chapter will then be specialized to several different specific lattices in the chapters following, leading in turn to similarly elegant solutions to generalizations and analogues of Buffon's original problem. Section 2.2 is devoted to Groemer's integral theorem, which is needed to prove Groemer's extension theorems in Sections 4.1, 5.1, and 11.1.
2.1 Valuations We now introduce a class of set functions that comprise the most basic and important tools of geometric probability, namely valuations. We begin with partially ordered sets and lattices. A partial ordering S; on a set L is a relation satisfying the following conditions for all x, y, z E L.
(i) x S; x. (ii) If x S; y and y S; x then x = y. (iii) If x S; y and y S; z then x S; z. The partially ordered set L is called a lattice if, for all x, y E L, there exist a greatest lower bound (or meet) x 1\ y ELand a least upper bound (or join) x V y E L. A lattice L is said to be distributive if, for all x, y, z E L, we have the following.
2.1 Valuations
(i) x V (y 1\ z) = (x (ii) x 1\ (y V z) = (x
y) 1\ y) V
1\ V
7
(x V z). (x 1\ z).
Let S be a set, and let L be a family of subsets of S closed under finite unions and finite intersections. Such a family is clearly a distributive lattice, in which the partial ordering is given by subset inclusion, while the meet and join are given by intersection and union of sets, respectively. A valuation on a lattice L of sets is a function JL defined on L that takes real values, and that satisfies the following conditions: JL(A U B) = JL(A)
+ JL(B)
- JL(A
n B),
0 is the empty set.
JL(0) = 0, where
(2.1) (2.2)
By iterating the identity (2.1) we obtain the inclusion-exclusion principle for a valuation JL on a lattice L, namely JL(A 1 U A2 U ... U An) = L i
JL(Ai) - L JL(Ai i<j
n Aj) + L
JL(Ai
n Aj n A k) + ... (2.3)
i<j
for each positive integer n. If A is any subset of S, the indicator function (or simply the indicator) of A, denoted by lA, is the function on S given by • lA(S) = 1; sEA, • lA(S) = 0; s tJ. A.
A finite linear combination k
f = L(XilAi'
(2.4)
i=l
where (Xi E R, and Ai E L, is said to be an L-simple function, or a simple function for short. The set of all L-simple functions forms a ring under the usual operations on functions. Indicator functions satisfy the following properties:
By iteration of the identities (2.5) and (2.6) we obtain the inclusionexclusion formula for indicators,
2 Valuation and integral
8
IA 1UA 2 U... UA n = 1 - (1 - IAJ(1 - IA 2 ) · · · (1 - IAJ
+
= L1Ai - LIAinAj
i<j
L IAinAjnA k i<j
+....
(2.7)
A subset G of L that is closed under finite intersections is said to be a generating set of L when every element of L is a finite union of elements of G. Using the inclusion-exclusion formula for indicators, it can be shown that every L-simple function can be written as a finite linear combination r
f
= L{3i I B i
(2.8)
,
i=l
where Bi E G. A real-valued function v on G is called a valuation on G provided that v satisfies identities (2.1) and (2.2) for all sets A, BEG such that AUB E G as well. Note that, since G need not be closed under unions, identity (2.1) does not make sense for all pairs of sets A, BEG. Hence, there is no reason to assume that the identities (2.3) should hold for v if n > 2. Since every element BEL can be expressed as a union B = Bl U ... U Bn with B l , ... , Bn E G, we can attempt to extend v to a valuation p, on all of L by setting
p,(B)
=
Lv(Bi) - L V(Bi n Bj) i<j
+ ... ,
(2.9)
as is suggested by (2.3). There remains to check that p,(B) is well defined, in the case that B could be expressed as a union of elements of G in more than one way. Given a valuation p, on G, define the integral with respect to p, as follows. For an L-simple function f = alIA 1 + ... + akIA1, with Ai E G for 1 :S i :S k, define
Jf
k
dp,
=
L aiP,(A).
(2.10)
i=l
In general, a simple function f has infinitely many expressions of the form (2.4), for Ai in G. Consequently we must check that the integral in (2.10) is well defined.
2.2 Groemer's integral theorem
The existence of the extension (2.9) and the integral (2.10) turn out to be equivalent properties of p" a nontrivial fact stated formally as follows.
2.2 Groemer's integral theorem
9
Theorem 2.2.1 (Groemer's integral theorem) Let G be a generating set for a lattice L, and let f.L be a valuation on G. The following statements are equivalent.
(i) f.L extends uniquely to a valuation on L. (ii) f.L satisfies the inclusion-exclusion identities f.L(B 1UB 2 U···UB n ) = Lf.L(Bi )- Lf.L(BinBj )+ ... , (2.11) i<j
whenever Bi E G and B1 U B2 U··· U Bn E G, and for all n :2: 2. (iii) f.L defines an integral on the vector space of linear combinations of indicator functions of sets in L. Proof We prove the implications (i) ::::} (ii) ::::} (iii) ::::} (i). If f.L extends uniquely to a valuation on all of L, then (ii) follows from an iteration of identity (2.1). Therefore, (i) implies (ii). To show that (ii) implies (iii), suppose there exist non-empty distinct K 1, ... , Km E G and nonzero real numbers a1, ... , am such that m
LaiIKi =0,
(2.12)
i=l
while m
L aiJt(Ki)
=I 0.
(2.13)
i=l
Let L1 = K 1,···, Lm = K m, Lm+1 = K 1nK2, Lm+2 = K 1nK3 , and so on, to define a list L 1, L 2, ... , L p , comprising all possible intersections of the sets K i . Since G is closed under intersections, Li E G for all i. Note also that the collection {Li} is closed under intersections. Suppose that p
LaiILi =0,
(2.14)
i=q
while p
L
aif.L(Li ) =I 0,
(2.15)
i=q
where a q =I 0. Choose an instance of these equations such that q is maximal. It follows from (2.12) and (2.13) that q :2: 1, while the conditions (2.14) and (2.15) imply that q < p.
10
2 Valuation and integral Suppose that x E Lq -
U;=q+l L j •
Then (2.14) implies that
P
O'.q = LO'.ihi(x) = 0, i=q
contradicting our assumption. It follows that
so that
For i > q, note that Lq n Li = L j , where j > q. Using the principle of inclusion-exclusion (ii) we obtain
so that p
L
(3ij.l(L i )
-I- 0
(2.16)
i=q+l
by (2.15), where each (3i is obtained by collecting the terms containing j.l(Li). Meanwhile, application of the same inclusion-exclusion procedure to the indicator functions yields
so that p
L
f3ih i =0
(2.17)
i=q+l
by (2.14). Together (2.16) and (2.17) contradict the maximality of q. This completes the proof that (ii) implies (iii). To show that (iii) implies (i), suppose that the function j.l defines an integral on the space of L-simple functions. For A E L define j.l(A)
=
J
fA
dj.l.
The linearity of the integral together with the identity (2.6) implies that this extension of j.l is a valuation on L. D
2.3 Notes
11
A linear functional T on the vector space of simple functions determines a valuation JL by setting
for every A E L. It is easily verified that, for a simple function
T(f)
=
Jf
f,
dJL.
Thus, insofar as simple functions are concerned, there is a bijective correspondence between linear functionals and valuations. Let B (L) be the relative Boolean algebra generated by the distributive lattice L; that is, the smallest family of subsets of S containing L that is closed under finite unions, finite intersections, and relative complements. Note that for A, BEL (2.18) Let I( L) denote the algebra of simple functions generated by finite sums, products, and differences of indicator functions of sets in L. If follows from (2.5), (2.6), and (2.18) that Ie E I(L) for all C E B(L). Corollary 2.2.2 A valuation JL defined on a distributive lattice L has a uniquely defined extension to the Boolean algebra B (L ).
Proof By Theorem 2.2.1, JL defines an integral on the space of indicator functions I(L). For C E B(L) define JL(C)
=
J
Ie dJL.
The linearity of the integral together with identity (2.7) implies that this extension of JL is a valuation on B(L). D
2.3 Notes
The study of valuations, while natural enough on its own as a finitely additive precursor to the measure theory of modern probability, was invigorated especially by interest in dissection problems on polytopes, and Hilbert's third problem in particular [8, 71, 72, 81] (see also Section 8.6). In the sections that follow we shall see that most of the interesting functionals of geometric probability satisfy the valuation property in some respect.
12
2 Valuation and integral
The integral and extension theorems of Groemer may be found in [32). McMullen and Schneider gave a thorough survey of the modern theory of valuations on convex bodies in [72), later updated by McMullen in
[71).
3 A discrete lattice
In this chapter we focus on combinatorial properties of the lattice of subsets of a finite set, properties which carryover in analogous forms to the lattice of parallelotopes in Chapter 4, of subspaces in Chapter 6, of polyconvex sets in Chapters 5, 7-10, and of spherical polyconvex sets in Chapter 11. An especially important result of this chapter is the characterization of valuations invariant under the permutation group. The idea of characterizing valuations invariant with respect to a group action or a set of symmetries is central to our treatment of geometric probability; this theme will recur frequently in the chapters following.
3.1 Subsets of a finite set Let S be a non-empty set with n elements, and denote by P(S) the set of all subsets of S, partially ordered by subset inclusion. The set P( S) is a (finite) Boolean algebra of subsets. Recall that the union and intersection of sets coincide with the least upper bound and greatest lower bound in the partially ordered set P(S). We denote the elements of P(S) by lower case letters x, y, etc. A segment of P(S), denoted by [x, y], where x ::::: y, consists of all elements Z E P(S) such that x::::: Z ::::: y. Every segment [x, y) is naturally isomorphic to the Boolean algebra P(y - x). A chain in P(S) is a linearly ordered subset; that is, a subset in which, for every pair x, y, either x ::::: y or y ::::: x. An antichain is a subset A <:;; P(S) such that, if x, YEA, then neither x < y nor y < x. A flag F in P(S) is a maximal chain; that is, a chain such that if G ;;;? F and G is a chain, then G = F. These notions apply to all partially ordered sets.
14
3 A discrete lattice
For x E P(S), the rank r(x) is the number of elements of the set x. The antichain consisting of all elements of P(S) of rank k shall be denoted by Pk(S). The size, or number of elements, of Pk(S) is the binomial coefficient
A flag in P(S) is naturally identified with a linear order (Sl' S2,"" sn) on the one-element subsets Si of S. Hence, there are n! flags in P(S). A flag contains an element x E Pk (S) whenever x = Sl U S2 U ... U Sk. Thus, there are k!(n - k)! flags containing a particular x E Pk(S). This elementary argument gives the classical expression for the binomial coefficient:
n! ( n) k - k!(n - k)!' Actually, the same argument can be made to yield a much stronger result. Denote by IAI the size of a finite set A. Theorem 3.1.1 (Sperner's theorem) Let A be an antichain in P(S).
Then
IAIS
((n~2))'
Here the expression (n/2) denotes the greatest integer smaller than or equal to n/2. Evidently equality is attained in Theorem 3.1.1 when A = P(n/2) (S). Proof The proof of this theorem depends on a more precise result, known as the Lubell-Yamamoto-Meshalkin (L.Y.M.) inequality. Let A be an antichain in P(S), and let Ak consist of all elements of A of rank k. Then (3.1) To prove (3.1), notice that every flag meets A in at most one element of P(S). Therefore, the number p of flags meeting A is n
p
=
L k!(n k=O
k)!/Akl·
15
3.1 Subsets of a finite set
Since there are n! flags in P( S), we have p :::; nL Dividing by n! proves the L.Y.M. inequality (3.1). To complete the proof of Sperner's theorem, recall that
for all 0 :::; k :::; n. The L.Y.M. inequality (3.1) now gives
t~
k=O (~) -
((n/2)) -
,
that is,
D
Suppose that 1 :::; r :::; n + 1 is a integer. A subset F ~ P(S) is called an r-family if chains in F contain no more than r elements. For example, an antichain is a I-family. Given an r-family F in P(S) let Fk = FnPk(S). Since every flag in P(S) meets F in at most r elements, we have n
Lk!(n - k)!/Fk/ :::; n!· r. k=O
We then obtain the following generalization of (3.1): n
/Fk /
k=O
k
(3.2)
L-(n) :::;r.
This inequality leads in turn to a generalization of Sperner's theorem to r-families: Theorem 3.1.2 Let F be an r-family in P(S). Then
In order to prove Theorem 3.1.2 we make use of the following lemma.
Lemma 3.1.3 Suppose that 0:::; i :::; n, and if Xo
Co
2:
+ Xl + ... + Xn
CI
2:
2: ... 2:
Co
Cn
>
o.
If Ci 2:
+ CI + ... + Cr-l,
Xi
2: 0 for (3.3)
16
3 A discrete lattice
then (3.4) If Co > ... > Cn > 0, then equality holds in (3·4) if and only if Xi for 0 :::; i :::; r - 1 and Xi = 0 for r :::; i :::; n. Proof Suppose that Xo, . .. ,X n
:::::
= Ci
0 minimize the sum (3.5)
subject to the condition (3.3). If this minimum value of (3.5) is less than r then Xi < Ci for some i :::; r - 1. Let i be the smallest index such that Xi < Ci, so that Xk/Ck = 1 for all 0:::; k < i (unless i = 0). The inequality (3.3) then implies that there is an index j ::::: r such that Xj > O. Let j be the largest such index, so that Xk = 0 for k > j (unless j = n). In particular, note that j > i. Suppose Ci = Cj. Since Ci ::::: CHI··· ::::: Cj, we have Ci = CHI = ... = Cj. It then follows that Co
+ ... + Ci-I + Xi + ... + Xj
+ ... +xn > Co + ... + Cr-I Co + ... + Ci-I + (r Xo
i)Ci,
since r - 1 < j. Therefore, Xi
It follows that
+ ... + Xj
:::::
(r -
i)Ci.
(3.6)
3.1 Subsets of a finite set
17
by (3.6). In other words, n
L
Xk
2:
(3.7)
r.
k=O Ck
This contradicts the assumption that Xo, .•. , Xn Illllllmizes (3.5) at a value less than r. Therefore, inequality (3.4) follows. Now suppose instead that Ci > Cj. If Xi + Xj :s; Ci set Yi = Xi + Xj and Yj = O. Otherwise, set Yi = Ci and Yj = Xj - (Ci - Xi). In either case set all other Yk = Xk· Since Ci > Cj it follows in both cases that Yi
+ Yj <
Xi
Ci
Cj
Ci
+ Xj, Cj
so that n
L
n Yk
k=O Ck
Xk
,
k=O Ck
contradicting the assumption that Xo, • .• ,X n minimizes (3.5). Inequality (3.4) now follows again for this case. If Co > ... > Cn, then Ci > Cj is guaranteed, and so it follows that (3.5) is minimized only if Xi = Ci for 0 :s; i :s; r - 1 and Xi = 0 for r :s; i :s; n. D
Proof of Theorem 3.1.2 Relabel the binomial coefficients Co, CI,···, Cn in descending order, so that Co 2: CI 2: ... 2: en > 0; then relabel the numerators /Fk / in (3.2) by xo, Xl, •.. , X n , so that each Xk is the numerator of that term of (3.2) having Ck as denominator. The inequality (3.2) now becomes
It then follows from Lemma 3.1.3 that Xo
+ Xl + ... + Xn :s; Co + CI + ... + Cr-l·
In other words,
D
The theory of binomial coefficients and antichains generalizes nicely to a theory of multinomial coefficients and special collections of ordered
3 A discrete lattice
18
partitions, known as s-systems. Sperner's result can also be generalized from a bound on the size of an antichain to a bound on the size of an s-system. A map 15 : {I, ... ,r} --+ P(8) is called an r-decomposition of 8 if (i) 8(i) n 8(j) = 0 for i i= j, and (ii) 15(1) U··· U 8(r) = 8. Denote by Dec(8, r) the set of all r-decompositions of 8. Note that for each 15 E Dec(8, r) 115(1)1
+ ... + 18(r)1 =
n.
Given non-negative integers aI, a2, ... , ar such that al + ... + ar = n we denote by Pal , ... ,a r (8) the set of all r-decompositions 15 such that 18(i)1 = ai for i = 1, ... ,r. In other words, Pal , ... ,ar (8) is the set of all (ordered) partitions of 8 into disjoint unions of subsets having sizes aI, ... , a r . Evidently the set Dec( 8, r) can be expressed as the finite disjoint union Dec(8,r) The size of
Pal, ... ,ar
=
(8) is given by the multinomial coefficient
An s-system of order r (or an s-system in Dec(8, r)) is a subset (J Dec( 8, r) such that the set {15 (i) : 15 E (J}
~
(3.8)
is an antichain in P(8) for each 1 ::::; i ::::; r. An obvious example of an s-system of order r is Pal , ... ,a r (8) for some admissible selection of aI, ... , a r . If 15, ( E Pal, ... ,a r (8) then 15 (i) and ((i) both have size ai, so that either 8(i) = ((i) or the two sets are incomparable in the subset partial ordering on P(8). This holds for i = 1, ... , r, and so the antichain condition on (3.8) is satisfied. Other disguised examples with which we have already worked are the s-systems of order 2. Let A be an antichain in P(8). For each x E A we can express 8 as the disjoint union x l±J 8 - x, so that the pair (x, 8 - x) is a 2-decomposition in Dec(8, 2). Moreover, the set
{8-x:xEA}
19
3.1 Subsets of a finite set
is also an antichain in P(S), so that the set IT
= {(x, S - x) : x E A}
is an s-system of order 2. Thus the notion of s-system is a generalization ofthe notion of an antichain. Similarly, the collection Pk(S) can also be viewed as Pk,n-k(S) through the bijection x f-+ (x, S - x). For 15 E Pa1, ... ,ar(S) we say that a flag (XO,XI,""X n ) in P(S) is compatible with 15 if (i) x a1 = 15(1), and (ii) x a1 +'+ ai - xal+.+ai_l = 8(i), for i :2: 2. Here the difference X a1 +.+ai - X a1 +.+ai-l denotes the complement of the set xal+.+ai_l inside the larger set x a1 +.+ ai . For A <:;;: Pa1, ... ,a r , let Flag(A) be the set of all flags (xo, Xl,.'" Xn) compatible with some 15 E A. Note that for each 15 E Pa1, ... ,ar(S), there are exactly al!a2!'" ar! flags compatible with 15; that is, to choose a flag compatible with 15 one must choose a permutation of the al elements of 15(1), of which there are all, and then a permutation of the a2 elements of 15(2), of which there are a2!, and so on up through 8(r). Since each of the n! flags is compatible with one (and only one) r-decomposition in Pa1, ... ,ar(S), it follows that
Just as Sperner's Theorem 3.1.1 gives the maximum possible measure for an antichain A in P(S), a generalization of this theorem gives the maximum possible size for an s-system in Dec( S, r). En route to such a generalization we prove a multinomial version of the L.Y.M. inequality.
Theorem 3.1.4 (The multinomial L.Y.M. inequality) Let Dec(S, r) be an s-system. For
so that IT=
al
+ ... + ar =
IT
C
n let
u
is a disjoint union. Then
(3.9)
20
3 A discrete lattice
Proof For al + .. ·+ar is given by
=
n the number of flags compatible with aal, ... ,a r
IFlag(aal, ... ,ar )1
=
laal, ... ,ar lal"" ar·, ·
Suppose that a flag (XO,Xl, ... ,Xn ) is compatible with both ,,(,8 E a. Then "(1) = x a1 and 8(1) = Xbll where al = h(l)1 and bl = 18(1)1. Since (XO, Xl,"" Xn) is a flag, we have Xa1 ~ Xbl or vice versa. However, a is an s-system, so that either "(1) = 8(1) or the two sets are incomparable. Therefore "(1) = 8(1) and al = bl . Continuing, we have "(2) = x a1 +a2 - Xa1 and 8(2) = Xbl+b2 - x a1 (since al = bI). A similar argument then implies that "(2) = 8(2) and a2 = b2 . Continuing in this manner we conclude that "(i) = 8(i) for each 1 ~ i ~ r, so that "( = 8. In other words, every flag in P(S) is compatible with at most one r-decomposition 8 E a. It follows that
ar', la al, ... ,a r lall.·· .
L
IFlag(aal, ... ,aJI
=
IFlag(a)I ~ n!
al+···+ar=n so that
D
We are now ready to prove a multinomial generalization of Sperner's Theorem 3.1.1. Theorem 3.1.5 (Meshalkin's theorem) Let a be an s-system in Dec(S, r). Then
lal
~ ((n/r) , ... , (n/r), (n7r) + 1"", (n/r) + 1) , , v
r-b
",
v
(3.10)
",
b
where n == b mod r. Here (n/r) denotes the largest positive integer less than or equal to n/r.
Proof It is not difficult to show that, for all compositions al + .. ·+ar = n of a positive integer n, we have
3.2 Valuations on a simplicial complex
21
For a sketch of a proof, see the analogous Proposition 6.5.2. Let aal, ... ,a r = an Pa1 ,... ,ar (8). It now follows from (3.9) that
L
al
(
L
/aal, ... ,arl ) < /za1,;;.,a)/ ::; 1, +·+ar=n (n/r), .. ,(n/r),(n/r)+l, .. ,(n/r)+l al +·+ar=n al,···,ar
so that
/a/
al+~r=laal, ... ,ar/ ::; Cn/r) , .'.. , (n/r) , (n7r) + 1"", (n/r) + 1)' D
Several other results of extremal set theory follow from the L.Y.M. inequality or its variants, but we reluctantly move on to the next topic.
3.2 Valuations on a simplicial complex Define a simplicial complex to be a subset A of P( 8) such that if x E A and y ::; x then YEA. A simplicial complex is a partially ordered set in the order induced by P(8). The set of maximal elements of a simplicial complex is an antichain. A simplicial complex having exactly one maximal element is called a simplex. A simplex whose unique maximal element is a set of size k is called a k-simplex. The (set-theoretic) union and intersection of any number of simplicial complexes is again a simplicial complex. Thus, the set L(8) of all simplicial complexes in P(8) is a distributive lattice, and we can study valuations on L(8). For x E P(8), denote by x the simplex whose maximal element is X; that is, the set of all y E P(8) such that y ::; x. It follows from Groemer's integral Theorem 2.2.1 and Corollary 2.2.2 that every valuation fJ on L(8) extends uniquely to a valuation, again denoted by fJ, on the Boolean algebra P(P(8)) of all subsets of P(8), which is generated by L(8). Such a valuation is evidently determined by its value on the one-element subsets of P( 8); that is, by arbitrarily assigning a value fJ( {x}) for each X E P(8). ,Let x be of rank k, and let A 1, A 2 , ••. ,Ak be the maximal simplices Ai E x such that Ai i=- x. (These simplices are sometimes called the facets of x.) Then
3 A discrete lattice
22
The right-hand side can be computed in terms of simplices of lower rank, by the inclusion-exclusion principle. Thus, by induction on the rank, we have the following theorem.
Theorem 3.2.1 Every valuation J-l on the distributive lattice L(8) of all simplicial complexes is uniquely determined by the values J-l(X) , x E P(8). The values J-l(x) may be arbitrarily assigned. D A valuation J-l on L(8) is called invariant if it is invariant under the group of permutations of the set 8; that is, if J-l(A) = J-l(gA) for every simplicial complex A and for every permutation 9 of the set 8 (which induces a permutation on L(8), also denoted by g). We next establish the existence of the Euler characteristic. The following is an immediate consequence of Theorem 3.2.1.
Theorem 3.2.2 (The existence of the Euler characteristic) There exists a unique invariant valuation J-l on L(8), called the Euler characteristic, such that J-lo(x) = 1 for every simplex x with rex) > 0, and such D that J-lo(0) = 0. Next, we derive the classical alternating formula for the Euler characteristic. Define a valuation on P(P(8)), denoted J-l~, by setting
J-l~(0) = 0, and J-l~({x})
= (-I)k-l,
if rex) = k. Then
J-l~(X)
=
L
J-l~( {y})
y<;'x
L
J-l~({Y}) +
y<;,x,r(y)=l
Since the simplex
L
J-l~({Y})
+ ... + J-l~({x}).
y<;,x,r(y)=2
x contains
elements of rank j, the right-hand side simplifies to
G) -G) + G) -... +
(_I)k+l
G)
=
1,
3.2 Valuations on a simplicial complex
23
so that JL~(x) = JLo(x), for all simplices x. It now follows from Theorem 3.2.1 that JL~ = JLo, and that the following formula holds.
Theorem 3.2.3 (The discrete Euler formula) Let A be a simplicial complex, and let fk be the number of elements {or 'faces'} of rank k. Then
JLo(A) =
h - h + h - ....
(3.11) D
For i > 0, set
and extend JLi to all of L(8) by Theorem 3.2.1. Clearly, for every simplicial complex A,
The discrete Euler formula can now be rewritten
JLo(A)
=
JL1(A) -1L2(A)
+ JL3(A) -
... ,
(3.12)
for any simplicial complex A. The valuations JLk can also be expressed in terms of symmetric functions. Let P 1(8) = {a1' a2, ... , an}. Given the symmetric function
let ti(X) = 1 if ai E x and ti(X) = 0 if ai fj. x. Evaluated at x, for x f 0, we have til ti2 ... tik = 1 if {ail' ... ,aik} E x, and til ti2 ... tik = 0 otherwise. It follows that ek(t1, t2, ... , tn) = JLk(X) for every simplex x other than {0}. Note also that if x E P(8) and r(x) = j then JLi({X}) = 1 if i = j, while JLi({X}) = 0 ifi fj.
Theorem 3.2.4 (The discrete basis theorem) The invariant valuations JLo, JL1, ... ,JLn span the vector space of all invariant valuations JL on L(8) such that JL( {0}) = O. The only linear relation among them is formula {3.12}.
Proof Suppose JL is an invariant valuation on L(8) such that JL( {0}) = o. Extend JL to all of P(P(8)). Note that the extended valuation, which is still denoted JL, is again invariant. If x and y have the same rank in P(8), say r(x) = r(y) = i, then there exists a permutation g of 8 such
3 A discrete lattice
24
that gx = y. Therefore, JL( {x}) Thus, the valuation
= JL( {y}) = Ci, for some constant Ci'
vanishes on all singleton sets {x}, for all x E P(8), and therefore vanishes on all of P(P(8)). D As an application of the discrete basis theorem, we shall derive a discrete analogue of the kinematic formula (whose classical geometric version appears in Chapter 10). One way to construct invariant valuations on L(8) is the following. Start with any valuation JL on L(8) such that JL( {0}) = 0, and let B be any simplicial complex. For any simplicial complex A, set
JL(A; B)
1
= , L JL(A n gB), n.
9
where 9 ranges over all permutations of the set 8 of size n. For fixed A, the set function JL(A; B) is a valuation in the variable B; in fact, it is an invariant valuation. It can therefore be expressed as a linear combination of the valuations JLi, with coefficients ci(A) depending on A: n
JL(A; B)
L
=
ci(A)JLi(B).
(3.13)
i=l
Meanwhile, for fixed B, the set function JL(A; B) is a valuation in the variable A. From this it follows that each of the coefficients ci(A) is a valuation in the variable A. The coefficient ci(A) can be given an explicit expression in terms of JL. This follows from the fact that if x E P(8) and r ( x) = j then JLi ({x}) = 1 if i = j and is zero if i =f- j. Consider the case in which JL is an invariant valuation. If so, then
JL(A;B)
1
,LJL(AngB) n. 9
1
, LJL(gA n B) n.
=, 1
n.
LJL(g-lAnB) 9
= JL(B; A).
9
Moreover, the coefficients ci(A) are now invariant valuations in the variable A. Therefore, Theorem 3.2.4 implies that n
JL(A; B)
=
L i,j=l
CijJLi(A)JLj(B).
25
3.2 Valuations on a simplicial complex
Since JL(A; B) = JL(B; A), it is evident that Cij = Cji. It turns out that most of the constants Cij are equal to zero. In order to compute the constants Cij explicitly, extend the valuation JL to the Boolean algebra P(P(8)) generated by L(8), and let (Vi denote the value of JL on a singleton set in P(P(8)) whose element is a subset of 8 of size i.
Theorem 3.2.5 (The discrete kinematic formula) Suppose that JL is an invariant valuation on L( 8). For all A, B E L( 8),
Proof Suppose that Xi, Yj C 8 with size i and j respectively. Let A = {Xi} and B = {Yj}. For any permutation 9 of 8, the set An gB = 0 if i =f- j. If i = j then An gB = 0 if Xi =f- gYj. Since there are i!(n - i)! permutations 9 of 8 such that Xi = gYj (if i = j), we have
JL(A;B)
1 i'(n - i)' =, LJL(AngB) =. , 'JL(A) =
n.
n.
9
(n)-l
.
(Vi·
2
Meanwhile, JLk(A) = 1 if k = i and is equal to zero otherwise. Similarly, JLk(B) = 1 if k = j and is equal to zero otherwise. Hence, n
JL(A; B) = L CijJLi(A)JLj(B) = Cij' i,j=l Therefore,
if i
=j
and is equal to zero otherwise.
D
We shall be particularly concerned with the case JL = JLo. The discrete Euler formula (3.11) implies that JLo({xd) = (-I)i+I, so that
~! ~ JLo(A n gB) =
t(
_1)i+l
(~)
-1 JLi(A)JLi(B),
for all A, BE L(8). If x and 'fJ are simplices, then either x n 'fJ is a smaller (non-empty) simplex, in which case JLo(x n 'fJ) = 1, or x n 'fJ = 0, in which case
26
3 A discrete lattice
J-lo (x
n y) = O. The probability that a randomly chosen k-simplex
Xk
shall meet a fixed l-simplex Yl can now be computed as follows:
so that 1
n!
~J-lO(YI n gXk) =
t;( _1),+1 (n) (l) (k) n.
-1
iii .
(3.14)
The equation (3.14) leads to a notable example of how the discrete kinematic formula can be used to generate identities for the binomial coefficients. To generate such an identity, we use elementary probabilistic reasoning to compute instead the probability that Yl n gXk = 0, for a random permutation g. Label the elements of S by {Sl, .. . , Sn} so that Xk = {Sl, ... ,Sk}. In order for Yl ngxk = 0 to hold, we require gSl E S-Yl, of which there are n-l choices. There then remain n-(l+I) possible values for gS2, etc., so that there are (n -l)(n -l-I)··· (n - l - k
+ 1)
choices of values for gSl, ... , gSk. Having chosen these values, there are n - k possible choices remaining for gSk+1, then n - k -1 possible values for gSk+2, and so on, up to one possible value remaining for gSn. It follows that there are (n -l)··· (n -l - k
+ 1)(n _
k) ... 1 = (n -l)!(n - k)! (n - k -l)!
permutations g of S such that Yl n gXk = 0. Hence the probability that Yl n gXk = 0 for a random permutation g is given by
~ (n -l)!(n - k)! n!
(n-k-l)!
=
k!(n - k)!(n -l)! = (n)-l (n -l) n!k!(n-k-l)! k k'
It now follows from (3.14) that
(3.15) By adding the term corresponding to i = 0 to both sides of (3.15) and multiplying by -1 we obtain the following identity:
3.2 Valuations on a simplicial complex
27
Theorem 3.2.6
for all positive integers 0 :::; k, l :::; n. Note that, if k
0
+ l > n, then
In the preceding argument this corresponds to the case in which the two sets Yl and gXk have non-empty overlap for any permutation g, i.e. the case in which Yl n gXk = 0 with probability zero. We conclude this discussion of subsets and simplices with an application of the results of Section 3.1 to a question posed by Spemer. Let A E £(8), and suppose that all ofthe maximal elements of A have rank kj i.e. suppose that every face of A is contained in a k-face of A. Let [All denote the collection of all l-faces of A, for 0 :::; l :::; k. Is there a lower bound for the number of l-faces I[Altl, given the number of k-faces (maximal faces) I[Alkl? The L.Y.M. inequality gives one answer to this question. Theorem 3.2.7 Suppose A E £(S) such that every maximal element of A has rank k. For 0 :::; l :::; k,
I[Alti ~
k!(n - k)! l!(n -l)! I[Alkl·
Proof Let Bl = PI(S) - [A]t. If Y E Bl then Y cannot be contained in any x E A. In other words, the set [Alk UBI is an antichain. It then follows from (3.1) that
Meanwhile, IBtI
so that
=
IPI(S) - [Altl
=
(7) -1[Altl,
28
3 A discrete lattice
It follows that
I[All! ~
(n) -1 k!(n - k)! ( n) l k I[Alkl = l!(n -l)! I[Alkl·
o For example, in the case l
=k-
1, we have
3.3 A discrete analogue of Helly's theorem We turn next to a special case and discrete analogue of Helly's theorem, whose classical geometric version is given in Chapter 5.
Theorem 3.3.1 (The discrete Helly theorem) Let S be a finite set of size lSI = n, and let F be a family of subsets of s. Suppose that, for any subset G <:;;: F such that IGI ::; n (that is, every subfamily of cardinality at most n of F),
n
A#0.
n
A#0.
AEG
Then
AEF
In other words, if every n elements of F have non-empty intersection, then the entire family F of subsets has non-empty intersection. Proof If IFI ::; n the result is trivial. Suppose that Theorem 3.3.1 holds for IFI = m, for some m ~ n. We show that the theorem also holds for IFI = m + l. Write S = {Sl, ... , sn} and F = {AI, ... ,Am+ 1 }, where Ai <:;;: S. For each 1 ::; j ::; m + 1 denote by L j the intersection
Our induction assumption for the case IFI = m implies that each L j is non-empty. Therefore, there exists Sij E L j for each j. However, since m + 1 > n there must be s E S such that s E Ljl n Liz for some j1 # j2. Since s E Ljp we have s E Ai for all i # j1. Similarly, s E Ai for all
3.4 Notes
29
i =f- j2. Hence, m+l
sE
n A= n A.
i=l
AEF
o 3.4 Notes For a general reference to the theory of lattices and partially ordered sets, see [92] (also [3, 27,42, 80]). For a graph-theoretic viewpoint, see [7]. In [84], Schanuel developed the Euler characteristic from a categorytheoretic perspective. In [12], Chen extended the Euler characteristic to linear combinations of indicator functions of unbounded closed convex sets and unbounded relatively open convex sets. A thorough treatment of the combinatorial theory of the Euler characteristic appeared in [79, 80]. The face enumerators J-li playa role analogous to that of the intrinsic volumes on parallelotopes and polyconvex sets (see Sections 4.2 and 7.2). The discrete basis theorem and kinematic formulas can be extended to the more general setting of finite vector spaces; see [53]. Sperner's Theorem 3.1.1 first appeared in [91], as did Theorem 3.2.7. The proof presented in this section is due to Lubell [63]. Meshalkin proved Theorem 3.1.5 in [73]. In this section we follow a more simplified approach due to Hochberg and Hirsch [45]. The generalization of Sperner's theorem to r-families (Theorem 3.1.2) was originally due to Erdos [22], but the proof given in this section is due to Harper and Rota [42]. While Theorem 3.2.7 gives a lower bound to the number of l-faces of a simplicial complex A, all of whose maximal elements have dimension k, Katona [48] and Kruskal [57] independently obtained a much stronger result, giving the exact minimum size of [A]l, a minimum that is independent of the size of the ambient set S. Katona's original paper may also be found in [27]. For a shorter proof of the Katona-Kruskal theorem, see [3, 7]. A survey of results in extremal set theory, including generalizations of Sperner's theorem and the L.Y.M. inequality, was presented in [29]. The discrete Helly Theorem 3.3.1 is a special case of a more general theorem of convex geometry. This geometric result is treated in greater detail in Chapter 5 (see also [85, pp. 3-5]). Helly's original theorem motivated considerable developments in the field of combinatorial geometry, and 'Helly-type' theorems are now common in many branches of mathematics [7, 18, 21].
4 The intrinsic volumes for parallelotopes
We develop next a theory of invariant valuations for the lattice of finite unions of orthogonal parallelotopes, having edges parallel to a fixed frame. Many of the central results in geometric probability can be stated and proven easily in the context of parallelotopes. For this reason the lattice of parallelotopes will serve as a model for the more difficult task of developing a lattice theory for finite unions of compact convex sets in Rn. The Euler characteristic, intrinsic volumes, and valuation characterization theorems for the lattice of parallelotopes will serve as prototypes in Chapters 5-9 for analogous constructions and characterization theorems in the more general context of polyconvex sets.
4.1 The lattice of parallelotopes Choose a Cartesian coordinate system in Rn, which shall remain fixed throughout this chapter, and let Par(n) denote the family of sets that are obtained by taking finite unions and intersections of orthogonal parallelotopes (i.e. rectilinear boxes), with sides parallel to the coordinate axes. If P E Par(n), we shall say that P is of dimension n (or has full dimension) if P is not contained in a finite union of hyperplanes of Rn; that is, if P has a non-empty interior. Otherwise, we shall say that P is of lower dimension. (Recall that a hyperplane in R n is a plane of dimension n - 1, not necessarily through the origin.) In general, a set P E Par( n) has dimension k if P is contained in a finite union of k-planes in R n, but is not contained in any finite union of k - 1 planes. Note that Par(n) is closed under finite unions and intersections. This follows from the basic fact that the intersection of two parallelotopes is a parallelotope. In other words, Par(n) is a distributive lattice.
31
4.1 The lattice of parallelotopes
Denote by Tn the group generated by translations and permutations of coordinates in Rn. For A ~ Rn and 9 E Tn, write
gA = g(A) = {g(a) : a E A}. A valuation f..L defined on Par( n) is said to be invariant when
f..L(gP)
=
(4.1)
f..L(P)
for all 9 E Tn and all P E Par(n). If f..L(gP) = f..L(P) is known to hold only for translations 9 of R n, then we shall say that f..L is translation invariant. The object of this section is to determine all invariant valuations defined on Par(n). To avoid pathological cases, we shall impose a continuity condition on the valuations to be considered. For A ~ Rn and x E R n , the distance d(x, A) from the point x to the set A is given by
d(x, A)
inf d(x, a),
=
aEA
where d(x, a) = Ix - al is the usual distance between points in Rn. Note that d(x, A) = 0 if x E A or if x is a limit point of A. For K,L ~ Rn, the Hausdorff distance 8(K,L) is defined by
(4.2)
8(K, L) = max (sup d(a, L), supd(b, K)) . aEK
bEL
If K and L are compact, then 8(K,L) = 0 if and only if K = L. A sequence of compact subsets Kn of Rn converges to a set K, or Kn --+ K, if 8(Kn' K) --+ 0 as n --t 00. Let Bn denote the unit ball in Rn. For K ~ Rn compact and f > 0, define
K
+ fB n =
{x
+ EU
:
x E K and u E Bn}.
The following lemma gives an easier and more practical way to think about the distance 8. Lemma 4.1.1 Let K, L ~ Rn be compact sets. Then 8(K, L) ::::; only if K ~ L + fB n and L ~ K + fB n .
f
if and
Proof Suppose that K ~ L + fBn. For x E K there exist y ELand u E Bn such that x = y + EU. In other words, Ix - yl : : ; f, so that d(x, L) ::::; f. Similarly, if L ~ K + fBn, then d(y, K) ::::; f for all y E L. Therefore, 8(K, L) ::::; f.
32
4 The intrinsic volumes for parallelotopes
Meanwhile, if there exists x E K such that x 1:. L + fBn (or vice versa), then Ix - yl > f for all y E L, so that 8(K, L) ;::: d(x, L) > f. 0 In view of Lemma 4.1.1, we see that a sequence of compact subsets Kn -----t K, if for f > 0 there exists N > 0 such that K c:;:: Ki + fBn and Ki c:;:: K + fBn whenever i > N. Theorem 4.1.2 The distance 8 defines a metric on the set of all compact subsets ofRn. Proof The distance 8 is clearly symmetric and non-negative. To verif:Y the triangle inequality, suppose that K, L, M c:;:: Rn are compact. Let fl = 8(K, M) and f2 = 8(L, M). By Lemma 4.1.1, K c:;:: M + f1B n and M c:;:: L + f2Bn' so that K c:;:: L + f2Bn + f1B n
= L + (f2 +
fdB n .
,
Similarly, L c:;:: K +(f2+fl)Bn. Lemma 4.1.1 then implies that 8(K, L) :s;
fl + f2.
0
We now focus once again on the lattice Par( n) of finite unions of parallelotopes. A valuation f..L is said to be continuous on Par(n), provided that
whenever Pi, P are parallelotopes (and not just finite unions) and Pi -----t P. Another condition that will prove useful is monotonicity. A valuation f..L is said to be increasing on Par(n), provided that f..L(P) :s; f..L(Q), whenever P, Q E Par(n) and P c:;:: Q. Similarly one defines decreasing valuations. A valuation f..L is said to be monotone on Par(n), if f..L is either an increasing valuation or a decreasing valuation. When studying valuations on Par(n) we may restrict our attention to the generating set of parallelotopes in R n with edges parallel to the coordinate axes. Spe~ifically, we have the following extension theorem. Theorem 4.1.3 (Groemer's extension theorem for Par(n)) A valuation f..L defined on parallelotopes with edges parallel to the coordinate axes admits a unique extension to a valuation on the lattice Par( n). Proof In view of Groemer's integral Theorem 2.2.1, it is sufficient to show that f..L defines an integral on the space of indicator functions of parallelotopes.
4.1 The lattice of parallelotopes
33
The proposition is trivial in dimension zero. Assume that the proposition holds in dimension n - 1. Suppose that there exist distinct parallelotopes PI, ... , Pm such that m
(4.3)
LaJPi =0 i=l
while m
L aiJ.l(Pi ) =
(4.4)
1.
i=l
Let k be the number of parallelotopes Pi in the expressions above of full dimension n, and suppose that k is minimal over all possible such expressions. If k = 0 then PI, ... , Pm are each contained inside a hyperplane. Let l denote the (finite) number of hyperplanes containing the parallelotopes PI"'" Pm, and suppose that l is minimal over all such expressions. If l = 1 then PI,'" , Pm are all contained in a single hyperplane. It then follows from the induction assumption (on the dimension of the ambient Euclidean space) that we have a contradiction. Therefore l > 1, and there exist hyperplanes HI"'" HI, orthogonal to the coordinate axes, such that Pi ~ HI U ... U Hz for i = 1, ... ,m. Suppose, without loss of generality, that PI ~ HI. Since IpinHl = IpJHll it follows from (4.3) that m
(4.5)
L aJpinHl = O. i=l
Meanwhile, m
L aiJ.l(Pi n Hd =
0,
(4.6)
i=l
by the induction assumption on dimension, since each Pi n HI ~ HI, a hyperplane. Subtracting equations (4.5) and (4.6) from (4.3) and (4.4) respectively, we have m
L ai(Ipi - IpinHJ = 0,
(4.7)
i=l
and m
L ai(J.l(Pi ) ,-- J.l(Pi n HI)) = 1. i=l
(4.8)
34
4 The intrinsic volumes for parallelotopes
Since PI n HI = PI, equations (4.7) and (4.8) take the form of (4.3) and (4.4), where the nonzero terms involve parallelotopes Pi in at most l - 1 hyperplanes, contradicting the minimality of l. It follows that k 2: 1. Suppose then that PI has dimension n. Choose a hyperplane H, with associated closed half-spaces H+ and H- such that PI n H is a facet of PI, oriented so that PI c H+. Since I Pi nH + = Ip,!H+, it follows from (4.3) that m
= O.
L <Xi1pinH+ i=l
Similarly, m
m
L<XilpinH
=
0 and
i=l
L<xJpi nH -
=
o.
i=l
Meanwhile, since JL is a valuation, m
m
m
m
i=l
i=l
i=l
i=l
Since the sets Pi n H lie inside a space of dimension n - 1, the sum
2::1 <XiJL(Pi n H) = 0 by the induction assumption. Because PI n Hhas dimension n -1, the sum 2::1 <XiJL(Pi nH-) = 0 by the minimality of k. From (4.4) we have m
m
i=l
i=l
There are n hyperplanes HI"'" Hn such that n
By iterating the preceding argument, we have m
L <XiJL(Pi n Hi n··· n H:}:) i=l
so that m
L <XiJL(Pi n Pd
=
i=l
while a similar argument using (4.3) gives m
L<XiIPinPl i=l
= O.
1,
=
1,
4.2 Invariant valuations on parallelotopes
35
After repeating this argument with parallelotopes P2 , ••. , Pm we have
~aiJ-l(PI n .. · nPm) = (~ai) J-l(PI n .. · nPm) =
°
This implies that al + .. ·+a m =f- and that PIn·· ·npm =fa similar argument using (4.3) gives
f
aJp1n ... nPm
i=l
so that either al either case.
=
(f
a i ) Ipln ... nPm
1.
0. Meanwhile
= 0,
i=l
+ ... + am =
°or PI n··· n Pm = 0, a contradiction in 0
4.2 Invariant valuations on parallelotopes To begin the classification of invariant valuations on Par(n), consider the problem in RI. An element of Par(l) is a finite union of closed intervals. Set
J-l6(A) = number of connected components of A, J-l~(A) = length of A.
One easily verifies that J-l6 and J-l~ are both continuous invariant valuations on Par(l). We shall prove that every continuous invariant valuation on Par(l) is a linear combination of J-l6 and J-ll. Suppose that J-l is a continuous invariant valuation on Par(l). Let c = J-l(A) , where A is a set consisting of a single point in Rl, and let J-l' = J-l - CJ-l6· Note that the invariant valuation J-l' vanishes on points. Define a continuous function f : [0, +00) ----t [0, +00) by the equation
f(x)
=
J-l'([0, x]).
If A is a closed interval of length x, then the invariance of J-l' implies that J-l'(A) = f(x). If A and B are closed intervals of length x and y, such that An B is a point, then
f(x + y) = J-l'(A U B) = J-l'(A) + J-l'(B) - J-l'(A n B) = f(x) + f(y), so that f(x) = rx for some constant r. Hence, J-l' = rJ-lL and our assertion is proved. We now turn to Rn. There is one well known continuous invariant valuation defined on Par(n), namely, the volume. Denote by J-ln(P) the volume of a finite union P of parallelotopes of dimension n.
4 The intrinsic volumes for parallelotopes
36
Recall that the elementary symmetric functions of the polynomials
eo
=
Xl,
X2, ... ,Xn are
1,
n
ek(XI, X2,··· ,Xn ) =
L
Xi1Xi2 .. , Xik' 1 :S k :S n.
l:'Oh < .. -
We shall prove the following theorem. Theorem 4.2.1 For 0 :S k :S n, there exists a unique continuous valuation fJk on Par(n), invariant under translations and permutations of coordinates, such that
(4.9) whenever P is a parallelotope with sides of length
Xl,
X2, ... ,Xn-
For example, fJn(P) is the volume of P, if P has dimension n. Proof Let fJ6 and fJ~ be the valuations previously defined on R I, and let fJf = fJ6 + tfJ~, where t is a variable. Consider the product valuation on Rn, given by the n-fold product (4.10) Because a parallelotope in Par(n) is a Cartesian product of line segments, fJf defines a valuation on parallelotopes. By Groemer's extension Theorem 4.1.3, fJf then extends to a valuation on all of Par(n). Let us compute the value of fJf on the parallelotope P with edges of length Xl, X2, ... , Xn . Since P = h X 12 x··· x In, where Ij is an interval of length Xj, the definition (4.10) implies that (4.11) Meanwhile
fJ}(Ij) = fJ6(Ij)
+ tfJ~(Ij) = 1 + tXj.
By expanding the right-hand side of (4.11), we obtain fJ~(P)
(1 + txd(l
+ tX2) ... (1 + tXn) 1 + el(xI, ... xn)t + e2(xI, ... xn)t 2 + ... + en(xI, ... Xn)t n .
Now let Q E Par(n). The set Q can be expressed as a union Q = PI U P 2 U ... U Pn , where the Pi are parallelotopes. (Recall that the
4.2 Invariant valuations on parallelotopes
37
intersection of a collection of parallelotopes is a parallelotope.) By the incl usion-excl usion principle,
JL~(Q)
=
LJL~(Pi) - LJL~(Pi i
n Pj ) + ... - ....
i<j
Collecting terms in each power of t, we conclude that there exist valuations JLo, JLI, ... ,JLn such that
where the valuations JLi are uniquely determined by the identities (4.9).
o Note that, if the parallelotope P has dimension k < n, then the valuation JLi(P) has an ambiguous sense. It may indicate the value of JLi(P) in Rn as defined in the previous theorem, but it may also denote the value of JLi(P) when computed within a lower dimensional plane containing P (and isomorphic to R m , where k :s; m < n). It is a notable consequence of the preceding theorem that the two valuations coincide. In other words, JL7'(P) = JL7(P).
Therefore, there will be no need to indicate the dependence of JLi(P) on the space R n in which the parallelotope P is embedded. We summarize this remarkable fact as follows. Theorem 4.2.2 The valuations JLi on Par(n) are normalized independently of the dimension n. 0
In other words, the value JLk(P) is 'intrinsic' to the set P, and independent of the dimension of the ambient space. For this reason the valuation JLk is called the kth intrinsic volume. The valuation JLo is called the Euler characteristic. As a corollary of Theorem 4.2.1, we see that the Euler characteristic is the only valuation on Par(n) that takes the value 1 on all (non-empty) parallelotopes: Let HI and H2 be complementary orthogonal subspaces ofRn spanned by subsets of the given coordinate system and having dimensions hand n - h, respectively. Let Pi be a parallelotope in Hi and let P = PI X P2 . The intrinsic volumes satisfy the following property with regard to orthogonal Cartesian products.
4 The intrinsic volumes for parallelotopes
38
Proposition 4.2.3 f.li(PI x P 2 ) =
L
(4.12)
f.lr (Pdf.ls (P2 ).
r+s=i
The identity (4.12) is therefore valid when PI and P2 are finite unions of parallelotopes. Proof Suppose that PI has sides of length Xl, ... ,Xh and P2 has sides of length YI, ... , Yn-h. Then we have r+s=i
Let jr+l = kl Yn-h. Then
+ h, ... ,ji = jr+s = ks + h, and let Xh+1 = YI,···, Xn
=
r+s=i
X·Jl ···X·Jr X·Jr+l ···X·Ji
°
o
A valuation f.l on Par(n) is said to be simple if f.l(P) = for all P of dimension less than n. The restriction of the volume f.ln to the lattice Par( n) is characterized by the following theorem.
Theorem 4.2.4 (The volume theorem for Par(n» Let f.l be a translation invariant simple valuation defined on Par(n), and suppose that f.l is either continuous or monotone. Then there exists C E R such that f.l(P) = Cf.ln(P) for all P E Par(n); that is, f.l is equal to the volume, up to a constant factor.
Proof Let [0,1]n denote the unit cube in Rn, and let C = f.l([0,1]n). Recall that f.l is translation invariant and vanishes on lower dimensions. Since f.l([0, 1]n) = c, a simple cut-and-paste argument shows that
39
4.2 Invariant valuations on parallelotopes
fL([O, 1/k]n) = c/k n for all integers k > O. Therefore, fL(C) = CfLn(C) for every box C of rational dimensions, with sides parallel to the coordinate axes. This follows from the fact that such a box can be built up by stacking cubes of the form [0,1/k]n for some k > O. Since fL is either continuous or monotone, it follows that fL( C) = CfLn (C) for every box C of positive real dimensions, with sides parallel to the coordinate axes. It then follows from the inclusion--exclusion principle that fL(P) = cfLn(P) for all P E Par(n). 0 The condition of either continuity or monotonicity is necessary to the characterization given by Theorem 4.2.4. If we omit these conditions then counterexamples to Theorem 4.2.4 can be found even in the case of Par(1)! To see this, recall that R is a vector space of infinite dimension over the field Q of rational numbers. Denote this vector space RQ. Since the dual space is also of infinite dimension, there exists a nontrivial map f E RQ; i.e., a linear map f : RQ ---., Q such that
RQ
f(1)
= 1 and f(x) E Q for all x E R. A parallelotope P E Par(1) is just a closed bounded interval of the form [a, b], having length b - a. Define a valuation TJ on parallelotopes (intervals) in Par(1) by the formula
TJ([a, b]) = f(b - a). Evidently TJ is invariant, depending only on the length of the closed interval. Moreover, if a ~ C ~ b ~ d, then
TJ([a, b]
U
[c, dj)
+ TJ([a, b] n [c, dj)
= TJ([a, d]) + TJ([c, b]) = f(d - a) + f(b - c)
+ f(b -
c) =f(b-a)+f(d-c)
= (f(c - a) = TJ([a, b])
+ f(d -
b))
+ f(b -
c)
+ TJ([c, dj),
by the linearity of f. It now follows from Groemer's extension Theorem 4.1.3 that TJ extends to an invariant valuation on Par(1) that vanishes on lower dimensions. However, TJ is not equal to length (onedimensional volume), since TJ takes only rational values. In other words, invariance alone is insufficient to characterize the volume - either continuity or monotonicity is also required. The reasoning that underlies this counterexample to Theorem 4.2.4 is easily extended to provide counterexamples for Par( n), where n 2: 1.
40
4 The intrinsic volumes for parallelotopes
We are now able to determine all continuous valuations on Par(n) that are invariant under translation and permutations of coordinates. We shall not yet prove that they are also rotation invariant. Theorem 4.2.5 The valuations /-Lo, /-Ll, .•• ,/-Ln form a basis for the vector space of all continuous invariant valuations defined on Par(n).
Proof Let /-L be a continuous invariant valuation on Par(n). Denote by Xl, X2, ... ,Xn the standard orthonormal basis for R n, and let H j denote the (n -I)-hyperplane in Rn spanned by the coordinate vectors Xl, ... , X j -1 , X j +1, ... , Xn- The restriction of /-L to H j is an invariant valuation on parallelotopes in H j . Proceeding by induction, we may assume that n-l
= L Ci/-Li(A),
/-L(A)
i=O
for all A E Par( n) such that A ~ H j . Moreover, the coefficients <;i are the same for each choice of H j , since the valuations /-L; /-Lo, ... , /-Ln-l are invariant under permutation of the coordinates Xl, •.• , X n . Thus, the valuation n-l
/-L- LCi/-Li i=O
vanishes on all lower dimensional parallelotopes in Par(n), since any such parallelotope is contained in a hyperplane parallel to one of the hyperplanes H j . By Theorem 4.2.4, n-l
/-L - L
Ci/-Li
=
Cn/-Ln,
i=O
where /-Ln is the volume on Rn, and where Cn is a real constant. In other words, n
/-L = LCi/-Li. i=O
o A valuation /-L on Par(n) is said to be homogeneous of degree k > 0 if
/-L(aP) for all P E Par(n) and all a 2:
o.
=
ak/-L(P)
4.3 Notes
41
Corollary 4.2.6 Let f-l be a continuous invariant valuation defined on
Par(n) that is homogeneous of degree k, for some 0::; k ::; n. Then there exists C E R such that f-l(P) = Cf-lk(P) for all P E Par(n). Proof By Theorem 4.2.5 there exist
Cl,""
Cn
E
R such that
n
f-l = LCif-li. i=O
If P
=
[0, l]n then, for a > 0,
f-l(aP) =
~Cif-li(ap) = ~Ciaif-li(p) = ~ (7)Ciai
Meanwhile,
f-l(aP) = akf-l(P) = a k Therefore,
Ci
~Cif-li(P) = ~ (7)Ciak
°
= if i =f. k, and f-l = Ckf-lk·
o
4.3 Notes
For a more complete discussion of the Hausdorff topology on the space of compact subsets of Rn and the subspace of compact convex sets, see [85, pp. 47-61]. Theorem 4.1.3 is a special case of a more general extension theorem of Groemer, in which the lattice Par(n) is replaced with the lattice of polytopes in Rn (see [32]). Theorem 4.2.5 can be generalized to a classification of all continuous translation invariant valuations on the lattice Par(n) , omitting the requirement that valuations be invariant under permutation of coordinates. The result is a 2n -dimensional space of valuations, with a basis indexed by the collection of all coordinate subspaces of Rn with respect to the fixed basis for parallelotope edges in Par(n); that is, by the set of all 2n subsets of that n-element basis. For a detailed treatment, see [54].
5 The lattice of polyconvex sets
We turn now to the lattice of polyconvex sets, which is a natural setting for the study of classical geometric probability. In Section 5.2 we define the Euler characteristic on polyconvex sets, which is an important tool for the extension in Chapter 7 of the intrinsic volumes of Section 4.2 to polyconvex sets. The Euler characteristic will also reappear in Chapter 10, in which we generalize the discrete kinematic formula of Chapter 3 to polyconvex sets. Section 5.5, while interesting in its own right, points to the correct normalization of the rotation invariant measures on Grassmannians in Section 6.1.
5.1 Polyconvex sets A subset K of Rn is said to be convex if any two points x and y in K are the endpoints of a line segment lying inside K. Denote by Kn the collection of all compact convex subsets of R n. A finite union of compact convex sets will be called a polyconvex set (a term suggested by E. De Giorgi). If A is a polyconvex set in Rn, we shall say that A is of dimension n if A is not contained in a finite union of hyperplanes of R n; that is, if A has a non-empty interior. Otherwise, we shall say that A is of lower dimension. The union and intersection of polyconvex sets are polyconvex. This follows from the basic fact that the intersection oftwo convex sets is convex. In other words, the family of polyconvex sets in Rn is a distributive lattice. We denote this lattice Polycon(n). Note that Par( n) is a sub lattice of Polycon( n). The lattice Polycon( n) is also sometimes called the convex ring. A non-empty compact convex set K E Kn is determined uniquely by its support function hK : sn-l --+ R, defined by hK(U) =
5.1 Poly convex sets
43
maxXEK{ x . u}, where· denotes the standard inner product on Rn. For example, if vERn and v denotes the line segment with endpoints v and -v, then hv(u) = lu· vi, for all u E sn-l. More generally, suppose that h: sn-l ----+ R, and consider the radial extension h : Rn ----+ R given by h(au) = ah(u), for all u E sn-l and a 2': O. The original function h is a support fUEction of a compact convex set in Rn if and only if the radial extension h is sublinear; that is,
h(x + y)
~
h(x)
+ h(y),
for all x,y ERn. Note that, for u E sn-l, a compact convex set K lies entirely on one side (the '-u' side) of the hyperplane H(K, u) determined by the equation X· u = hK(U). The hyperplane H(K,u) is called the support plane of K in the direction u. If H(K, u)- denotes the closed half-space x· u ~ hK(U) bounded by H(K, u), then we have
K =
n H(K,u)-. uES n
For compact convex sets K and L the Minkowski sum K by
K
+L
=
{x
+y : x
E
K and y E L}.
+ L is defined (5.1)
It is not difficult to show that hK + L = hK + h L . Recall from Lemma 4.1.1 that for compact sets K and L in Rn, the Hausdorff metric satisfies d(K, L) ~ E if and only if K ~ L + EB and L ~ K + EB. It follows easily from (5.1) that
8(K, L)
=
IhK(U) - hL{u)l.
sup uES n
(5.2)
- 1
In other words, the Hausdorff topology on Kn is also given by the uniform metric topology on the set of support functions of compact convex sets. Denote by En the Euclidean group on Rn; that is, the group generated by translations and (proper or improper) rotations. If A C Rn and 9 E En, write gA
= g(A) =
{g(a) : a E A}.
The subgroup of translations (relative to a fixed Cartesian coordinate system) shall be denoted by Tn. A valuation J-l defined on polyconvex sets in Rn is said to be rigid motion invariant (or simply invariant, when no confusion is possible) if J-l(A)
= J-l(gA)
(5.3)
44
5 The lattice of polyconvex sets
for all 9 E En and all A E Polycon(n). If the equality (5.3) holds only when 9 E Tn, we say that J.L is translation invariant. Our objective is to determine all invariant valuations defined on polyconvex sets in Rn. As with Par(n), we impose a continuity condition on the valuations to be considered. A valuation J.L is said to be convexcontinuous (or simply continuous, when no confusion is possible) provided that
whenever An, A are compact convex sets and An --+ A with respect to the metric (5.2). Examples of continuous invariant valuations on Polycon(n) include volume and surface area. The following proposition shows that we can restrict our attention to convex-continuous valuations defined on the generating set Kn. Theorem 5.1.1 (Groemer's extension theorem for Polycon(n» A convex-continuous valuation J.L on Kn admits a unique extension to a valuation on the lattice Polycon( n).
Proof Suppose that J.L is a continuous valuation. In view of Groemer's integral Theorem 2.2.1, it is sufficient to show that J.L defines an integral on the space of indicator functions. The theor,em is trivial in dimension ~ero. Assume that the theorem holds in dimension n-1. Suppose that there exist distinct K 1 , ••. ,Km E Kn such that LaJKi =0
(5.4)
i=l
while Tn
L aiJ.L(Ki) =
1.
(5.5)
i=l
Let m be the least positive integer for which such expressions (5.4) and (5.5) exist. Choose a hyperplane H, with associated closed half-spaces H+ and H- such that Kl C lnt H+. Since I KinH + = IKJH+, it follows from (5.4) that Tn
LaiIKinH+ i=l
= O.
5.1 Polyconvex sets
45
Similarly, m
m
L cxiIK,nH = 0 and
LCXJK,nH- = O.
i=l
i=l
Meanwhile, since J-l is a valuation, m
m
m
m
i=l
i=l
i=l
i=l
Since the sets Ki n H lie inside a space of dimension n - 1, the sum cxiJ-l(KinH) = 0 by the induction assumption. Because KlnH- = 0, the sum L~l cxiJ-l(Ki n H-) = 0 by the minimality of m. From (5.5) we have L~l
m
m
i=l
i=l
Choose a sequence of hyperplanes HI, H 2 , ... such that Kl C lnt Hi and
By iterating the preceding argument, we have m
LCXiJ-l(Ki n Hi n··· n H:) = 1 i=l
for all q 2: 1. Since J-l is continuous, the limit as q
~
(Xl
gives
m
L cxiJ-l(Ki n K l ) = 1, i=l
while a similar argument using (5.4) gives m
L cxiIK,nK1
=
O.
i=l
After repeating this argument with the bodies K 2 , . .• ,Km we have
tCXiJ-l(Kl n··· n Km) = (tCXi) J-l(Kl n··· n Km) = l. This implies that CXl + .,. + CXm =I- 0 and that Kl n .. , n Km =I- 0. Meanwhile a similar argument using (5.4) gives
f
i=l
CXJK1n ... nKm =
(f ,=1
CX i )
IKln ... nKm =
0,
46
5 The lattice of polyconvex sets
so that either a1 in either case.
+ ... + am = 0 or K1 n ... n Km = 0, a contradiction 0
5.2 The Euler characteristic Next, we shall extend the valuation J-lo to the entire distributive lattice Polycon(n). We have seen that J-lo is a well-defined functional on sets that are finite unions and intersections of parallelotopes, and that J-lo(P) = 1 if P is a non-empty parallelotope. These results motivate the following theorem.
Theorem 5.2.1 (The existence of the Euler characteristic) There exists a unique convex-continuous invariant valuation J-lo defined on the family Polycon(n) of all polyconvex sets in Rn, such that J-lo(K) = 1 whenever K is a non-empty compact convex set. The valuation J-lo is again called the Euler characteristic. Proof We proceed by induction on the dimension n, the case n = 1 having been established previously. By Theorems 2.2.1 and 5.1.1, it will suffice to establish the existence of a linear functional Ln defined on Kn-simple functions, such that Ln(IK) = 1 whenever K is a non-empty compact convex set. For n = 1, set L 1 (1) = L(I(x)-f(x+O)), xER
where f(x + 0) = lima--+o+ f(x + a). The sum on the right-hand side is finite, and, for f = I K , where K is an interval [a, bj, we have
Thus, L 1 (IK
)
= J-l6(K),
so that L 1 (1) =
J
f dJ-l6·
For arbitrary n, choose an orthogonal coordinate system Xl, X2,"" x n . Given the first coordinate x, let Hx be the hyperplane parallel to the coordinates X2, ... , Xn and passing through the point (x, 0, ... ,0). Let f = f(X1, X2,"" x n ) be a simple function. The function fx(X2,' .. ,xn ) = f(x, X2,' .. ,xn ) is a simple function in H x , and we
5.2 The Euler characteristic
47
assume that L n - l (fx) has been defined in H x , since Hx is isomorphic to Rn-l. Set F(x) = Ln-l(fx), and set
Note that the function F is simple, so that the right-hand side is well defined. If f = I K , where K is a compact convex set, then fx is the indicator function of the slice of K by the hyperplane at Xl = X, and F is the indicator function of the projection of K onto the xl-coordinate axis. It follows that Ll (F) = 1. Since Ln(f) =
Jf
d'/a,
for some valuation f.lo, it follows that f.lo is the desired valuation.
0
Note that the Euler characteristic f.lo is normalized. In other words, if K is a polyconvex set of dimension k in R n, and if V is a plane of dimension j containing K, then f.l~(K) computed within V is equal to f.lo(K) computed in Rn. This follows from the fact that f.lo(K) can be computed via the inclusion-exclusion principle after K has been expressed as a finite union of compact convex sets, whereas f.lo(K) = 1 for all non-empty compact convex sets K in spaces of any (finite) dimension. For this reason we write f.lo in place of f.lo. The argument in the preceding proof can also be used to compute the Euler characteristic of a polytope. By Corollary 2.2.2, the valuation f.lo extends uniquely to a valuation defined on the relative Boolean algebra generated by Polycon(n), a valuation that is again denoted by f.lo. Consider the (smaller) distributive sublattice ofPolycon(n) generated by compact convex polytopes. Recall that a convex polytope is the intersection of a finite collection of closed half-spaces. A polytope is a finite union of convex polytopes. Given a polytope P, the boundary 8P is also a polytope (which is not the case for an arbitrary compact convex set). Therefore, f.lo(8P) is defined. Theorem 5.2.2 If P is a compact convex polytope of dimension n > 0, then
Proof Using again the notation of Theorem 5.2.1, we note that H x n8P = 8( Hx n P) if X is not a boundary point of 1f P, the orthogonal projection
48
5 The lattice of polyconvex sets
of P onto the line H;:. Let F(x) = J-lo(o(Hx np)), where J-lo is taken in the space Hx. For the case n = 1 we have J-lo(oP) = 2 = 1- (-1), since OP consists of two distinct points. For n > 1 it follows from the induction hypothesis that
when x E nP is not a boundary point of nP. Meanwhile, if x E o(nP), we have
J-lo(Hx n oP) = 1, since Hx n P is a face of P (though possibly a single point). Finally,
J-lo(Hx n oP)
=
0,
when Hx n oP = 0. We can now compute
L1(F(x)) = ~)F(x) - F(x + 0)), x
a sum that vanishes except at the two points, call them a and b (with a < b), where Hx touches the boundary of P. The right-hand side then reduces to
F(a) - F(a + 0)
+ F(b) -
F(b + 0).
We compute
F(b+ 0) F(b)
=
=
0,
1,
F(a) = 1, F(a+O)
= 1-
(_I)n-l.
On adding, we find that
Ll(F(x)) = 1-1 + (_I)n-l as desired
+ 1 = 1 + (_I)n-l = 1- (_I)n, o
If P is a compact convex polytope of dimension k in R n, let V be the k-dimensional plane containing P. We denote by relint(P) the interior of P relative to the topology of V; that is, the relative interior of P.
49
5.2 The Euler characteristic
Theorem 5.2.3 Let P be a compact convex polytope of dimension k in Rn. Then 1L0(relint(P))
= (_l)k.
Proof Since 1L0 is normalized independently of the ambient space, we compute within the k-dimensional plane in Rn containing P. From Theorem 5.2.2 we have
= 1L0(P) -1L0(8P) = (_l)k.
1L0(relint(P»
o We can now generalize Euler's formula to arbitrary (nonconvex) polytopes. To this end, we define the notion of a system of faces F of a polytope P. This will be a family with the following properties: • Every element of F is a convex polytope. UQEF relint(Q) = P . • If Q, Q' E F and Q -=I- Q', then relint(Q) n relint(Q') =
•
0
We can now prove the following result.
Theorem 5.2.4 (The Euler-Schliifli-Poincare formula) Let F be a system of faces of a polytope P, and let fi be the number of elements of F of dimension i. Then
1L0
fo --;-
=
II + h - ...
First proof Place a linear ordering on the elements of F, or 'faces', such that, if Q < Q' then flim(Q) ::; dim(Q'). Evidently, Ip
=
L
(IQ - lQ1)'
QEF
where Q1 = Q n (UQI
l
L
dlLo
J(IQ - lQ1) dlLo +
QEFo
+
L QEF2
L QEF1
J (IQ - l Q1 ) dlLo
+ ...
J (IQ - lQ1) dlLo
5 The lattice of polyconvex sets
50
= 0,
For Q E F o, we have IQl
so that
For Q E F i , where i > 0, we have
UQ
Ql = Q n (
1
)
= 8Q,
Q'
and hence,
f.Lo(Q) - f.Lo(8Q) = 1 - (1 - (_l)i) l-l+(-l)i=(-lt Therefore,
L
J
(fQ - IQ,) df.Lo = (_l)i Ii,
QEFi
so that
f.Lo(P)
=
J
Ip df.Lo
=
fo - h
+ h - .... o
Second proof We use the relative interior theorem. This requires extending the valuation f.Lo to the relative Boolean algebra generated by polyconvex sets. If F is any system of faces, then
f.Lo(P)
= f.Lo (
U relint(Q)) = L QEF
L
f.Lo(relint(Q))
QEF
(_l)dimQ
QEF
by Theorem 5.2.3, since each Q E F is a convex polytope. We collect terms of the same dimension, and the theorem then follows. 0
5.3 Helly's theorem The following is a remarkable application of the Euler characteristic.
5.3 H elly's theorem
51
Theorem 5.3.1 (Klee's theorem) Let F be a finite family of compact convex sets such that
is convex. Let i < IFI, and suppose that, for any subset that IGI = i (that is, every subset of cardinality i of F),
G~
F such
n K=I 0. KEG
Then there exists a subset H of F with cardinality i
+ 1,
such that
n K=I 0. KEH
Proof Let n > 1 be a positive integer. Recall that 1-
(7) + (;) - (~) + ... +(-l)j(;) =1O,
(5.6)
for all positive integers j < n. To see why (5.6) holds, suppose that j ~ (n/2). In this case the left-hand side of (5.6) is an alternating sum of strictly increasing terms, and is therefore unequal to zero. Since
and
it also follows that (5.6) holds if we replace j ~ (n/2) by j ::::: (n/2). Now let n = IFI. From Theorem 5.2.1 and the inclusion-exclusion formula we have l=f-lO
(U K) KEF
L KEF
f-lo (K) -
L
f-lo (K
n L) + ...
K#-LEF
whenever nKEG K = 0 for all G ~ F such that IGI this is impossible, by virtue of the inequality (5.6).
= i + 1.
However, 0
Given a set A ~ Rn, the convex hull of A is the smallest convex set in R n that contains A; that is, the intersection of all convex sets containing
52
5 The lattice of polyconvex sets
A. The following is a simple and yet fundamental property of convex hulls in Rn. Theorem 5.3.2 (Caratheodory's theorem) Let T be the convex hull of a family of compact convex sets K 1 , K 2 , ..• ,Km in Rn. For each x E T, there exists a subfamily Kjl' ... ' K jp ' with convex hull T x , such that p ~ n + 1 and x E T x. Proof If n = 0 the result is trivial. Assuming that the theorem holds for dimension n - 1, we prove the theorem for dimension n. Let x E T. If x lies on the boundary of the compact convex set T, let H denote a support plane of T at x. Because H supports the convex set T, all of T lies inside one of the closed half-spaces bounded by H. Since x E T n H, it follows that x lies in the convex hull of the convex sets K j n H. By the induction assumption on dimension, there exists a subfamily Kl1 n H, ... ,Kjp n H with convex hull T* such that p ~ n and x E T*. It follows that x lies in the convex hull of the subfamily K j1 , ... ,Kjp. If x lies in the interior of T, suppose that x rj. K j for any j (otherwise the proof is finished). Let £ denote a line through x that also meets K m , and let x' be the point of intersection of £ with the boundary of T. Since £ meets the boundary of T at two points, choose x' so that x lies between x' and £ n Km. It follows from the argument in the previous paragraph that x' lies in the convex hull of a subfamily Kjl , ... ,Kjp of F, where p ~ n. It then follows that x must lie the convex hull of the sets K j1 , ... ,Kjp ' K m , a collection of at most n + 1 sets in F. 0 Combining CaratModory's Theorem 5.3.2 with Klee's Theorem 5.3.1 we obtain the following celebrated theorem of Helly. Theorem 5.3.3 (Helly's theorem) Let F be a finite family of compact convex sets in Rn. Suppose that, for any subset G ~ F such that IGI ~ n + 1 (that is, every subset of cardinality at most n + 1 of F),
n K#0. KEG
Then
n K#0. KEF
5.3 Helly's theorem
53
In other words, if every n + 1 elements of F have non-empty intersection, then the entire family F of convex sets has non-empty intersection. Proof If IFI ~ n + 1 the result is trivial. Suppose that Theorem 5.3.3 holds for IFI = m, for some m 2': n + 1. We show that the theorem also holds for IFI = m + 1. Let F = {K 1 , ... , K mH }. For each 1 ~ j ~ m + 1 denote by L j the intersection
Our induction assumption for the case IFI = m implies that each L j is non-empty. Let M denote the convex hull of the union L1 u··· u L m +1. If x E M then Caratheodory's Theorem 5.3.2 implies that x lies in the convex hull of the union Lil U Li2 U ... U Lin+l for some 1 ~ i1 ~ ... ~ in+ 1 ~ m + 1. For each i ¢:. {iI, ... , in+ I},
so that x E K i . In other words, m+1
UK
M S;;
i·
i=l
Let Mi = Ki
n M for 1 ~ i
n
Mi
=
i-f-j
Since M1 U ... U Mm+1 rem 5.3.1 that
~
m
+ 1.
n
Ki n M
Note that, for each j,
= Lj n M = Lj
-=I-
0.
i-f-j
= M
is convex, it follows from Klee's Theo-
Hence,
n
m+1
i=l
n
m+1
Ki;2
Mi -=I- 0.
i=l
o Theorem 5.3.3 is in fact a generalization of Theorem 3.3.1. To see this, suppose that S = {Sl' ... ,sn} is a finite set, and associate to each Si a distinct point Xi E Rn-l, chosen so that the collection {Xl, ... ,x n } is affinely independent. Let ~ denote the geometric simplex in R n - 1 having vertices {Xl, ... , x n }. Subsets of S now correspond to faces ofthe
5 The lattice of polyconvex sets
54
simplex ~, and Theorem 3.3.1 is now a specialization of Theorem 5.3.3 to families of faces of the simplex ~.
5.4 Lutwak's containment theorem We now turn to a beautiful application of Helly's theorem to the question of containment of convex bodies. Given compact convex sets K and L with non-empty interiors, is there a simple condition that guarantees that some translate of K is a subset of L? It turns out that the answer to this question is determined by the relationship of K to the simplices in R n that contain L. For K E Kn and vERn, denote by K + v the set
K In other words, K
+v =
{x + v : x E K}.
+ v denotes the translation of the set K
by the vector
v. Theorem 5.4.1 (Lutwak's containment theorem) Let K, L E Kn with non-empty interiors. The following are equivalent.
(i) For every simplex
~ such that L ~ ~, there exists vERn such that K +v ~~. (ii) There exists Va E R n such that K + Va ~ L.
In other words, if every simplex containing L also contains a translate of K, then L itself contains a translate of K. Proof The implication (ii) '* (i) is obvious. We show that (i) (ii). To begin, suppose first that L is a polytope, with facets L 1 , L 2 , .•. ,Lrn and corresponding facet (outward) unit normal vectors U1, U2,.'" Urn' Assume also that every selection of n distinct unit normals Uj is a linearly independent set. (Were this not the case, a small perturbation of L would make it so.) For each facet L i , let Hi denote the (n - I)-dimensional hyperplane in Rn containing L i , and let Ht denote the closed half-space bounded by Hi and containing the polytope L. Finally, denote by Ti the set of vectors vERn such that K + v c Ht. Since each Ht is a (convex) closed half-space and K is compact, it is clear that each Ti is a nonempty closed convex set. The independence condition on the unit normals {Ui} implies that, for each distinct selection Uil' Ui2' ... ,Uin+l of n + 1 unit normals, either
'*
5.5 Cauchy's surface area formula
55
the corresponding intersection n+1
H-~1,~2""'~n+l . .
-nH+ is
-
(5.7)
8=1
contains a simplex .6. il ,i2, ... ,in +l such that L ~ .6.i l,i2, ... ,i n +1l or this intersection contains a translate of the ball O'.B of radius 0'., for all 0'. > O. (This is the case in which the intersection (5.7) is unbounded.) In the first case, the hypothesis of the theorem implies the existence of a vector vERn such that K + v ~ .6.il ,i2, ... ,i n +l' In the second case there also exists v such that K + v lies in the intersection (5.7). In either case, there exists v E Til n· .. n T in + l . In other words, each collection of n + 1 sets Ti has a non-empty intersection. ReIly's Theorem 5.3.3 then implies the existence of a vector v such that m
In other words, K + v ~ Ht for i = 1, ... ,m. Since L = Hi n· .. n H:!, it follows that K + v ~ L. Now suppose that L is an arbitrary compact convex set. Let {Pi}~l be a decreasing collection of polytopes such that Pi ~ L as i ~ 00, and such that each n of the facet normals to Pi are linearly independent. If .6. is a simplex containing Pi, then L ~ Pi ~ .6., so there exists a vector w such that K + w ~.6.. Since the theorem holds for the polytopes Pi, it then follows that there exists a vector Vi for each i, such that K + Vi ~ Pi' Since the Pi are decreasing (with respect to the relation of subset containment), the sequence {Vi} is bounded and must contain a convergent subsequence. Assume then without loss of generality that Vi ~ v. Since Pi ~ L, it follows that K + v ~ L. 0
5.5 Cauchy's surface area formula We conclude this chapter with the following interpretation of the surface area S(K) of a convex body in Rn, which will be of use to us in the sequel. If K E Kn and V is an (n - I)-dimensional subspace of R n , denote by KIV the orthogonal projection of K onto V. Let W n -1 denote the (n - I)-dimensional volume of the unit ball B n - 1 in Rn-1.
Lemma 5.5.1 For v E sn-l,
f
lSn-l
lu, vi du
= 2Wn -1.
56
5 The lattice of polyconvex sets
Proof Recall from elementary calculus that
where sn-l is partitioned into many small regions Ai, having area S(Ai), and where Ui E Ai for each i. Let Ai denote the orthogonal projection of Ai onto the tangent hyperplane to sn-l at the point Ui. Denote by AilvJ.. the orthogonal projection of Ai onto the hyperplane vJ..; that is, into the disk B n- l in vJ... Because Ai is a flat region lying inside a hyperplane with unit normal Ui, we have S(AilvJ..) = Iu . vIS(Ai)' Meanwhile, for Ai sufficiently small, we have S(Ai) ~ S(Ai) and S(AilvJ..) ~ S(AilvJ..). Therefore,
kn-l
Iu· vi du
~L
S(AilvJ..) .
•
Since the collection of sets {Ai IvJ.. } covers the disk Bi twice, projecting from both of the directions v and -v (that is, from both hemispheres of sn-l), we have { lu, vi du }Sn-l
~ 2Wn - l ,
where the similarities converge to equalities in the limit, as the mesh of the partition {Ai} goes to zero. 0
Theorem 5.5.2 (Cauchy's surface area formula) For all K E Kn,
S(K)
= - I Wn-l
1
Sn-l
!-In-I(Klu J.. ) duo
(5.8)
Here sn-l denotes the (n-I)-dimensional sphere; i.e. the set of all unit vectors in Rn. Note that !-In_I(KluJ..) is just the (n - I)-dimensional volume of the projection of K onto the subspace uJ... Proof Let P be a compact convex polytope with facet unit normal vectors VI, ... ,Vm and corresponding facet surface areas al, ... , am. If a facet Pi of P has unit normal vector Vi and surface area (that is, (n - I)volume) ai, then the projection PiluJ.. has (n - I)-volume
!-In_I(PiluJ..) = ail u , vii· For u E sn-l, the (n-I)-volume !-In_I(PluJ..) is computed by summing the (n-I)-volumes of the projections of the facets of P. This sum gives
5.5 Cauchy's surface area formula
57
twice the desired value, since each point of PluJ.. is hit twice (from above and below) as we sum over projections of all facets of P. Hence, we have
r
JSn-l
/-In-I (PluJ..) du
=
where the third equality follows from Lemma 5.5.1. Since (5.8) then holds for any convex polytope P, the equation holds for any convex body K by continuity. 0 Recall that, for parallelotopes P E Par(n), /-In-I(P) = ~S(P).
We are therefore motivated to define /-In-I(K) = ~S(K),
for all K E
}Cn.
The equation (5.8) now becomes
1
/-In-I(K) = -1/-In-I(Klu J.. ) duo 2Wn-1 Sn-l
(5.9)
Since every line f through the origin in R n meets sn-I at exactly two points, we can rewrite this result as an integral over the projective space Gr(n,l) (Le. the set of all lines f through the origin in Rn) instead of integrating over the sphere. We then renormalize the resultant measure on Gr(n, 1) so that the space Gr(n, 1) has total measure 1. This measure is called the Haar probability measure on Gr(n,l). Integrating with respect to the Haar probability measure on Gr(n, 1) we obtain /-In-I(K)
=a
r
/-In_I(KlfJ..) df,
JGr(n,l)
where a is a constant independent of K. To compute a, set K the unit ball in R n , to obtain nw 2n
= /-In-I(Bn) = a
r
JGr(n,l)
/-In_I(BnlfJ..) df
= aWn-I,
= Bn ,
58
5 The lattice of polyconvex sets
since the total measure of the space Gr( n, 1) is 1. It follows that
nwn 2Wn - l
a=--.
Denote by Tn the rotation invariant measure on Gr(n, 1) normalized so that Tn(Gr(n, 1)) = 2nWn . Wn-l
(5.10)
The equation (5.9) now becomes !-In-l(K)
=
r
JGr(n,l)
!-In_l(KlfJ..) dTn.
(5.11)
5.6 Notes For a general reference to the theory of convex bodies, see the book by Schneider [85] and the survey on valuations by McMullen and Schneider [72]. The normalization for intrinsic volumes is due to McMullen [70]. Groemer's extension Theorem 5.1.1 originally appeared in [32], which contains a comprehensive presentation of extension theorems for valuations on various classes of compact convex sets. For a treatment of the combinatorial theory of the Euler characteristic, see [79, 80]. See also [75] for the connection between the Euler characteristic and algebraic topology. Hadwiger presented numerous geometric applications of the Euler characteristic in [37, 39]. For a modern treatment of the Euler characteristic as a valuation on convex sets, see [71, 72, 85]. Helly's Theorem 5.3.3 first appeared in [43]; the proof given in this section is due to Hadwiger [37]. Theorem 5.3.2 is due to Caratheodory [85]. The proof of Caratheodory's theorem presented above may also be found in [37], where it appeared as a special case of a more general theorem of Steinitz [93]. Klee's Theorem 5.3.1 was noted by Klee in [56], and has many generalizations (see [6, 28, 37, 62]; also [18, pp. 123-128]). A non-numerical form (in terms of cone functions) was given by Chen [11]. A general survey on Helly's theorem, its proofs, variations and applications, may be found in [18] and in [21]. It is conceivable that the generalization of the Euler-SchHifli-Poincare formula (5.2.4) to intrinsic volumes should lead to interesting quantitative generalizations of Klee's Theorem 5.3.1 and thereby of Helly's Theorem 5.3.3. Lutwak's containment Theorem 5.4.1 appeared in [68].
5.6 Notes
59
Cauchy published his surface area formula (Theorem 5.5.2) for dimensions 2 and 3 in [9] and [10]. Generalizations of Cauchy's formula are treated in Sections 7.4 and 9.4 (see also [82, p. 218] and [85, p. 295]).
6 Invariant measures on Grassmannians
Before we extend the notions introduced in Chapter 4 to the lattice of polyconvex sets we shall require a deeper understanding of the lattice of subspaces of Rn. In Section 6.1 we introduce the the flag coefficients, which rely on a crucial choice of normalization for the rotation invariant measures on real Grassmannians. This construction leads in turn to continuous analogues of the extremal combinatorial results of Chapter 3, here in the context of subspaces. The flag coefficients will reappear in Chapters 8-10 as we investigate connections between the integral geometry of Grassmannians and the intrinsic volumes on polyconvex sets (see especially Section 9.4).
6.1 The lattice of subspaces Let Mod(n) denote the set of all linear subspaces of Rn; that is, the set of all linear varieties passing through the origin (having fixed an origin once and for all). The set Mod(n) is a partially ordered set under the relation of inclusion of linear subspaces. Moreover, it is a lattice, whereby the join xVy and the meet x Ay oftwo elements x, y E Mod(n) are defined respectively as the linear subspaces spanned by x and y and as the intersection of x and y. The lattice Mod(n) may be viewed as a continuous analogue of the lattice P(S) of subsets of a set S with n elements. Note, however, that this analogy is only a partial one, since the distributive law governing unions and intersections of subsets of S does not hold in the lattice Mod(n). Nonetheless, this analogy shall carry us as far as we need to go. In the lattice Mod( n) one defines the notions of chain, flag, and rank as for the lattice of subsets of R n. In particular, an element x of Mod( n) has rank k (that is, r(x) = k), whenever x is a linear subspace of dimension
6.1 The lattice of subspaces
61
k. The subspace {O} is the minimal element ofthe lattice Mod(n). Much as the group of permutations of the set S acts naturally on P( S), the orthogonal group O(n) (that is, the group of rotations about the origin and reflections across hyperplanes through the origin) acts naturally on the lattice Mod(n). The set of all elements of Mod(n) of dimension (rank) k, denoted Gr(n, k), is called the Grassmannian. Our objective is to describe the invariant (Haar) measure that acts on Gr(n, k). It is known that this measure is unique up to a common factor (but we will not prove this). If our analogy is not misleading, the total measure of Gr( n, k) should be an analogue of the binomial coefficient. To begin, consider the invariant measure Tn on Gr(n,l); that is, on the set of all straight lines through the origin. Denote [nJ = T n (Gr(n,l)) = 2nWn ,
Wn-l
(6.1)
where Wn denotes the volume of the unit ball En in Rn. The real numbers [nJ will playa role analogous to that of the positive integers n in the discrete case. The measure Tn may be viewed constructively in the following way. Let the invariant measure on the unit sphere sn-l (that is, on the surface of the unit ball) be denoted by an-I. For any measurable subset A of Gr(n,l), let A' be the subset of the unit sphere sn-l defined by A'
U xnsn-l.
=
xEA
It then follows from (5.10) that
Tn(A)
=
an-l (A') . 2Wn -1
To check this, note that the surface area of the unit sphere is an_I(Sn-l)
= nWn
so that T n (Gr(n,l)) =
nWn .
2W n -1
Again it is clear that the measure Tn is invariant under rotations. Let Flag(n) be the set of flags in Mod(n). For x E Mod(n), denote by Flag(x) the set of all flags that contain x; that is, the set of all sequences (XO, Xl, ... , Xn) of Xi E Mod(n), where dim(xi) = i, such that
6 Invariant measures on Grassmannians
62
Xo Xo
~ Xl ~ '"
~
Xn , and such that one of the Xi equals x. Note that = {O} and Xn = Rn. For fixed Xk, the set of all sequences (Xk, Xk+l, ... , Xn) with Xi E Mod(n), such that dim(xi) = i and Xi ~ Xi+l, is isomorphic to Flag(nk). Similarly, the set of sequences (xo, Xl>' .. , Xk) is isomorphic to Flag(k). Denote by
... , xn) is a simple function on the set Flag(n), then let
J
f d
=
JJ
f(xo, Xl> ... , xn)
d
. .. , Xn-l) dTn,
so that the measure ... , Xn) is a flag in Mod(n), let
The sequence (YI, Y2, ... , Yn) is a sequence of orthogonal straight lines, a frame. Conversely, given a frame (Yl> Y2,· .. , Yn), we obtain a flag by setting
Xo = {O},
Xl
= YI, X2 = YI VY2, X3 = YI V Y2 V Y3, ...
This defines a one-to-one correspondence between flags and frames. If f(xo, Xl> ... , Xn) is a real-valued (measurable) function on flags, let /(Yl> Y2, ... , Yn) be the corresponding function on frames. Then
This result can be read in the simpler language of combinatorics. Once the line YI has been chosen, which can be done in Tn ways, the subspace X2 is determined by the choice of a straight line Y2 in the space orthogonal to Xl (through the origin), which can be done in Tn-l ways, etc. The measure of Flag( n) turns out to be
[n]! =
= 1. Note that
n!WnWn-I" 'WI 2nWn-IWn_2'" Wo
(6.2)
6.2 Computing the flag coefficients
63
We now define an invariant measure on Gr(n, k). For A ~ Gr(n, k), let Flag(A) be the set of all flags (xo, XI, ... ,xn ) such that Xk EA. Set (6.3)
To justify this normalization combinatorially, note that, for each Xk E A, there are exactly [kJ![n - kJ! flags containing Xk; that is, to choose a flag containing Xk one must choose a frame for the vector space Xk, of which there are [kJ! choices, and a frame for the (n - k)-dimensional complementary space of which there are [n - kJ! choices. The measure Ilk is evidently invariant under rotations, and we have
xt,
Ilk (Gr(n, k))
=
[kJ![~~ kJ! = [~].
These values, called flag coefficients, are continuous analogues of the binomial coefficients of the discrete lattice in Chapter 3. From (6.2) we obtain
n! Wn [n] k - k!(n - k)! WkWn-k
-
(n) k
Wn WkWn-k·
(6.4)
6.2 Computing the flag coefficients It is not difficult to compute the numerical values of the numbers [nJ and their associated flag coefficients. We begin by deriving the following well known formula for the volume Wn of the unit ball in Rn.
Proposition 6.2.1 For n
~
Wn
1
= r((n/2) + 1)·
Here r(t) denotes the Euler gamma function, given by
Proof We first compute the integral
64
6 Invariant measures on Grassmannians
We can think of the square of this integral as the product of integrals in two independent variables, x and y.
(1:
e- x2 dX) 2
=
1:
e- x2 dx
1:
e- y2 dy
=
1:
2 e-(x +y2)
dxdy
On switching to polar coordinates in the xy-plane, we obtain
1: 121r 2n
e-(x 2+ y2 )
1
00
1
00
dxdy
e _r2 r dr dO
e- r2 r dr
n,
so that
(6.5) Following a similar argument in n-variables,
since the surface area of the sphere sn-l is equal to nwn . On substituting y = r2, we obtain
6.2 Computing the flag coefficients
65
Hence, we have 1fn/2 Wn
r(~
=
+ 1)" o
Proposition 6.2.1 can be given a more computationally pliable form, provided that we examine separately the cases in which the dimension n of the unit ball is even or odd. Proposition 6.2.2 Let k be a non-negative integer. Then and
W2k+l
=
22k+11f k k! (2k + I)!
Proof It immediately follows from Proposition 6.2.1 that W2k
=
1fk f (k + 1)
1fk
= If·
To compute W2k+l first note that
1
00
f(I/2) = On substituting x
f(I/2)
e-xx- i dx.
= y2 and dx = 2y dy we obtain =
1
00
o
1 e- y2 -2y dy = Y
1
00
e- y2 dy = y1f,
-00
where the last equality follows from (6.5). It now follows from Proposition 6.2.1 that 2k+" 1f2-
2k+l 2k-l ... .! IJr 2 2 2Y "
2k+ 11fk (2k
+ 1)(2k -
1)··· (3)(1)
6 Invariant measures on Grassmannians
66
2k+11f k2kk! (2k + 1)! 22k+11f k k! (2k + 1)!
o Proposition 6.2.2 leads to easy computations of [n]. Once again there are two formulas, depending on the parity of n. Proposition 6.2.3 Let k be a positive integer. Then 21f(2k - 1)!
[2k]
21fk (2k - 1) k '
= 4k(k _ 1)!(k - 1)! = 4k
and [2k+1]=
4k(k!? k (2k)-1 (2k)! =4 k
Proof Computing directly from the definition (6.1) and Proposition 6.2.2 we get [2k]
2kw2k 2W2k-1 (2k - 1)! 1 2k ~ 2 - 1fk-1(k - 1)! k1fk
21fk (2k 4k k
1)
.
Similarly, [2k+ 1]
(2k + 1)W2k+1 2W2k (2k + 1)2 2k+11f kk! k! (2k + 1)! 21fk 4kk!k! (2k)! .
o The first few values of W n , [n], and [n]! run as follows:
6.2 Computing the flag coefficients !l
71"2
r(~+1)
~ 2W n _l
n!w n 2n
n
Wn
[n]
[n]!
0
1
1
1
1
2
n
2
7r
""3
7r
71"2
371"
371"2
""2
5
871"2
""6
7
1671"3 105 71"4 24 3271"4 945 71"5 120
10
4""
8
27r 2
1571"
1571"3 -8-
16
67r 3
"3
71"3
(6.6)
4"
15
6
9
"2
2
4
8
1 71"
"2
471"
3
1 71"
67
16 "5
10571"4
3571"
16
32
35
128
247r4
31571" 256
94571"5
32
Proposition 6.2.3 leads to a number of interesting relations among the numbers [n]. For example, one can easily show that, for k > 0, [2k][2k
+ 1] = k7r,
and [2k _ 1][2k]
=
(2k - 1)7r, 2
from which it follows that [n+ 2]
~
n+ 1 n
for all n > O. Using either of Propositions 6.2.2 and 6.2.3 one may easily compute the values of the flag coefficients. Once again the formulas vary according to the parity of the parameters.
Proposition 6.2.4 Let m and k be positive integers. Then [2m] 2k [ 2m] 2k+1
=
=
(2m) (m) -1 2k k
7r (2m)! 4 m k!m!(m-k-1)!
an
2m7r (
=
4m
d 2m - 1
)
k,m,m-k-1 '
68
6 Invariant measures on Grassmannians
while [2rr;:1] =4 k (7)C:)-1
and
[~7: 11] = 4m- k (mr: k) C~~ kk)) -1 Proof Combining (6.4) with the formulas of Proposition 6.2.2, we obtain
[~7] = (~7) W2k~::-2k = (~7):~ k~~::_~! = (~7) (7)-1 With a little more effort, we have
(2~: 1) W2k+1~::-2k-1
(2m - 2k - I)! ( 2m ) 1fm (2k + I)! 2k + 1 m! 22k+11f kk! 22m-2k-11fm-k-1(m - k - I)! 2m ) _1f_ (2k + 1)!(2m - 2k - I)! 2k + 1 m!22m k!(m - k - I)! 1f (2m)! 4m k!m!(m - k - I)!' (
Computing once again,
1) 1)
W2m+ 1 ( 2m + 2k W2kW2m-2kH 2m + 22m+11fmm! k! (2m - 2k + I)! ( 2k (2m + I)! 1fk 22m - 2k+11fm-k(m - k)! k! 4km!k! (2k)!(m - k)! k! 4k(7) Ckk)-1 The final formula now follows easily from the dual symmetry of the flag coefficients:
1] =
[2m + 2k+l
[2m+l] =4m- k ( m ) (2(m-k))-1, 2(m-k) m-k m-k
by the previous computation.
o
6.2 Computing the flag coefficients
69
By means of Proposition 6.2.4 one may easily generate the following Pascal triangle for the flag coefficients [~], as n = 1, ... ,8. 1 1 8
1 1571"
1 16
1 1
3571"
32
5"
16
7
2 371"
1 "3
""4
"2
1 1
2 371"
3
4
""4
4 1571"
5
6
1 71"
1
5
!l
8
8
1 8
"3
6
10571"
35
10571"
32
3
32
1 1571"
16
7
1 16
5"
1 3571"
32
1
(6.7) Certain patterns are immediately noticeable. For example, the table (6.7) suggests that
[;]=n-1 for all n ::::: 2. This indeed follows easily from Proposition 6.2.4. It is also not difficult to verify that
[n~l ]= 2-
n-l 2- ,
for all odd positive integers n. If n is even, the value of the middle flag coefficient is given by one of two formulas, depending on the congruence class of n modulo 4. Computations similar to those preceding show that if n == 0 mod 4 then
If n
== 2 mod 4 then (n) ( ~ ) (n + 2)71' ( n ) (n + 2)71' [n] ~ = ~ n4 2 2n+2 = ~,n42, n!2 2n+2'
Recall that, for fixed n, the classical binomial coefficient (~) is maximized at k = n/2 if n is even, or k = (n - 1)/2 if n is odd. In the next section we will show that flag coefficients are also maximized 'in the middle,' as the table (6.7) suggests.
6 Invariant measures on Grassmannians
70
6.3 Properties of the flag coefficients
The flag coefficients satisfy a number of properties analogous to those of the binomial coefficients. For example,
In analogy to Pascal's triangle relations we have the following. Proposition 6.3.1 For 1 ~ k Wk-lWn-k Wn-l
[n - 1] + k -
~
n -1,
WkWn-k-l
1
[n - 1] =
Wn-l
WkWn-k
[n]
Wn
k
k
(6.8)
Proof We prove (6.8) by direct computation: Wk-lWn-k Wn-l
[n - 1] + k -
1
WkWn-k-l
[n - 1]
Wn-l
k
D
In the discrete case, Pascal's triangle for binomial coefficients is much cleaner:
(n-1) + (n -1) (n) =
k-1
k
k
.
A similar expression exists for the flag coefficients, with a relation of
inequality. The inequality will follow from a more elementary additive inequality for the numbers [nJ. Recall from elementary calculus that the volume of the unit n-ball may be computed by integrating over its (n - 1)-dimensional slices; that is,
(I
Wn
The substitutions x
J
2
= 2 o wn - l ( 1 - X )
n-l
2
dx.
= sin () and dx = cos () d() yield
6.3 Properties of the flag coefficients
71
In other words,
[n] = nWn = n {:!f;; cos n 0 dO. 2Wn - l Jo This equation leads in turn to the following proposition.
(6.9)
Proposition 6.3.2 For all positive integers m and n,
[n]
+ [m] > [n+m].
Proof If 0::; e::; 1r/2, then 0::; cose ::; 1. From (6.9) we then obtain n
[n]+[m]
l:!f;; cosn e dO + m l:!f;; cosm 0 dO
> n l:!f;; cos n +m e dO + m l:!f;; cosn +m e de (n
+ m) l:!f;; cosn +m 0 de =
[n + m]. D
Proposition 6.3.2 and Theorem 6.3.3 illustrate a fundamental difference between the combinatorics of finite sets and the measure structure of this vector space analogue. Theorem 6.3.3 (The Pascal inequality for flag coefficients) For 1 ::; k ::; n - 1,
1] + [n -k 1] > [n]k .
[nk-l
Proof For 1 ::; k ::; n - 1, we have
[n - I]! [k - 1]![n - k]!
+
[n - I]! [k]![n - k -I]!
[n - 1]!([k] + [n - k]) [k]![n-k]! By Proposition 6.3.2, [k]
1]
[n k -1
+
+ [n -
[n k
1]
k] > [k + (n - k)] = [n], so that [n - 1]![n]
[n]!
> [k]![n - k]! = [k]![n - k]!· D
72
6 Invariant measures on Grassmannians
Recall that the real numbers [nJ playa role analogous to that of the positive integers n in the discrete case. The following proposition demonstrates in part the expedience of our choice of normalization for the measure Tn; that is, for the value of [nJ. Proposition 6.3.4 The map n 1---+ [nJ is an increasing function.
Proof From Proposition 6.2.1 we have
= nJ1iT(~) = nJ1iT(~) =vnr(~). 2 r (nt ) 2 ~r (~) r (~)
[nJ= nWn 2W n-1 Define a function
f on the positive real numbers by f(t) = vn
r (t + ~) r(t)
Since fG) = [nJ, it is sufficient to show that of t. Recall that [4, p. 15J
r(t) = lim k->oo
.
f is an increasing function
ktk!
t(t + 1)··· (t + k)
This implies that
f(t)
( vnlim k->oo
( r
kt+(1/2) k' 1
1
r= l' 1m k-+oo
) 1
(t + "2)(t +"2 + 1)··· (t + "2 + k)
ktk'.
k~~ t(t + 1)··· (t + k)
y7r
.
X
)-1
t(t + 1)··· (t + k)y'k 1 1 1 (t + "2) (t + "2 + 1) ... (t + "2 + k)
(6.10)
Since the function t 1 --=1--t+~ 2t+l
is increasing with respect to t > 0, so is the product on the right-hand side of (6.10). It follows that f is increasing for t > 0, and we conclude that [nJ is an increasing function of the positive integers. D From Proposition 6.3.4 we deduce the following useful property for the generalized factorial [nJ!: Proposition 6.3.5 For 1 :::; k :::; I:::; n/2,
[kJ![n - kJ! 2:: [IJ![n - IJ!
6.4 A continuous analogue of Sperner's theorem
73
Proof To begin, note that
n o <- k <- l <2 -
l< - n - k -< n.
It follows from Proposition 6.3.4 that
[n - k]··. [n -l + 1] 2: [l]··· [k + 1]. (Note that there are l - k factors on each side of this inequality.) Multiplying on both sides, we obtain
[n - k]![k]! 2: [n - l]![l]! D
The flag coefficients satisfy the following property, in evident analogy to the classical binomial coefficients. Proposition 6.3.6 For 1 :s; k
:s; n,
Once again (n/2) denotes the greatest integer less than or equal to n/2. Proof Since
[~] = [k]![~~ k]! = [n: k]' it is sufficient to consider the case in which k < (n/2). On applying Proposition 6.3.5 to the case l = (n/2), we find that
[n - k]![k]! 2: [(n/2)]![n - (n/2)]!, so that
[n]! [n]! [n - k]![k]! :s; [(n/2)]![n - (n/2)]!' D
6.4 A continuous analogue of Sperner's theorem Define a measure Vn on Mod(n) by taking the direct sum of the measures vI:. That is, for any measurable subset A ~ Mod(n), define n
vn(A) =
L v'k(A n Gr(n, k)). k=O
The measure Vn satisfies the following analogue of the classical L.Y.M. inequality (3.1).
74
6 Invariant measures on Grassmannians
Theorem 6.4.1 (The continuous L.Y.M. inequality) Let A Mod(n) be an antichain. For 0::; k ::; n let
Ak
~
= An Gr(n, k),
so that
is a disjoint union. Then (6.11)
Proof For each 0 ::; k ::; n, the measure of flags meeting Ak is given by
by the definition (6.3) of most one point, we have
vI:.
Since every flag in Flag(n) meets A in at
n
n
k=O
k=O
L v'k(Ak) [k]![n - k]! = L 4>n(Flag(Ak)) = 4>n(Flag(A)) ::::; [n]!
It follows that
o We now have the necessary tools to prove a continuous analogue of Theorem 3.1.1.
Theorem 6.4.2 (The continuous Sperner theorem) Let A be an antichain in Mod( n). Then
Evidently equality is attained in Theorem 6.4.2 when A = Gr(n, (n/2)). Proof We reason in analogy to the proof of Theorem 3.1.1. For 0 ::; k ::; n let Ak = AnGr(n, k). Combining (6.11) and Proposition 6.3.6 we obtain
t
t
vl:(A k) < v'k(A k) < 1 k=O [(n/2J - k=O [~J -,
6.4 A continuous analogue of Sperner's theorem
75
so that
o In analogy to the discrete lattice P(S), a subset F ~ Mod(n) is called an r-family if chains in F contain no more than r elements. Given an r-family F in Mod(n) let Fk = FnGr(n, k). Since every flag in Mod(n) meets F in at most r elements, we have n
L
From (6.3) we then obtain the following generalization of (6.11):
~ VJ::(Fk)
~---
_.
(6.12)
This inequality leads in turn to a continuous analogue of Sperner's theorem for r-families (Theorem 3.1.2). Theorem 6.4.3 Let F be an r-family in Mod(n). Then
Proof As in the proof of the discrete case (Theorem 3.1.2), relabel the flag coefficients Co, Cl, ..• ,Cn in descending order, so that Co :2:: Cl :2:: ... :2:: en; then relabel the numerators VJ::(Fk) in (6.12) by xo, Xl,···, Xn so that each Xk is the numerator of that term of (6.12) having Ck as denominator. The inequality (6.12) now becomes
It then follows from Lemma 3.1.3 that
Xo
+ Xl + ... + Xn
::;
Co
+ Cl + ... + Cr-l·
In other words,
o
6 Invariant measures on Grassmannians
76
We now consider a continuous analogue of a question first posed by Sperner (see also Theorem 3.2.7). Given A <:;:; Gr(n, k), let [A]z denote the collection
[A]z = {W
E
Gr(n, l) : W
<:;:;
V for some V
E
A}.
Here we assume that 0 ::; l ::; k ::; n. Sperner's question, recast in the language of subspaces, asks for a lower bound on the measure of [A]z, given only the measure IIn (A). The continuous L.Y.M. inequality gives us such a bound. Theorem 6.4.4 For A <:;:; Gr(n, k), IIn
[k]![n- k]! ([A]z);::: [l]![n -l]! lin (A).
Proof Let Bz = Gr(n, l) - [A]z. If WE Bz then W cannot be contained in any V E A. In other words, the set A U Bz is an antichain. It then follows from (6.11) that
Meanwhile,
so that lin (A)
[~J +
1_
IIn
([A]z) < 1
[7J
_.
It follows that IIn
n] [n] -1 [k]![n - k]! ([A]z);::: [ l k IIn (A) = [l]![n -l]! lin (A).
o For example, in the case l IIn
=
k - 1, we have
[k] ([A]k-1) ;::: [ k ] lin (A). n- +1
Theorem 6.4.4 can also be expressed in the language of simplicial complexes. As for the discrete lattice P(S), define A <:;:; Mod(n) to be a simplicial complex, if whenever V E A and W <:;:; V we have W E A. (Such a set A is also called an order ideal of Mod(n).) In other words,
6.5 A continuous analogue of Meshalkin's theorem
77
if V E A, then all subspaces of V are also elements of A. Once again the set of maximal elements of a simplicial complex is an antichain, and a simplicial complex having exactly one maximal element is called a simplex (or principal order ideal). The elements of dimension k in a simplicial complex A are called the k-faces of A. Theorem 6.4.4 can now be thought of as giving a lower bound for the measure of the set of I-faces of A, given the measure of the set of k-faces of A.
6.5 A continuous analogue of Meshalkin's theorem We now turn to the continuous (vector space) analogues of the r-decompositions and s-systems on the discrete lattice P(S); see also Section 3.1. A map 8: {I, ... ,r} ---+ Mod(n) is called an r-decomposition ofRn if (i) 8( i) ~ 8(j) for i =I- j, and (ii) 8(1) EEl··· EEl 8(r) = Rn Denote by Dec(n,r) the set of all r-decompositions of Rn. Note that, for each 8 E Dec(n, r), dim 8(1)
+ ... + dim 8(r) = n.
Given positive integers aI, a2, ... , a r such that al + ... + a r = n, we shall denote by M ult (n; aI, ... , a r ) the set of all r-decompositions 8 such that dim8(i) = ai for i = 1, ... ,r. In other words, Mult(n;al, ... ,ar ) is the set of all (ordered) decompositions of Rn into direct sums of subspaces having dimensions al, ... , a r . Evidently the set Dec(n, r) can be expressed as the finite disjoint union
Dec(n,r)
=
An s-system of order r is a subset a
~
Dec(n, r) such that the set
{8(i):8Ea}
(6.13)
is an antichain in Mod(n), for each 1 :::; i :::; r. An obvious example of an s-system of order r is Mult( n; aI, ... , a r ) for some admissible selection of al, ... , a r . If 8, ( E Mult(n; al, ... , a r ) then 8(i) and ((i) both have dimension ai, so that either 8(i) = ((i) or the two subspaces are incomparable in the subset partial ordering on Mod(n). This holds for i = 1, ... , r, and so the antichain condition on (6.13) is satisfied.
6 Invariant measures on Grassmannians
78
Other disguised examples with which we have already worked are the s-systems of order 2. Let A be an antichain in Mod(n). For each V E A we can express R n as the direct sum V EEl V J.., so that the pair (V, V J.. ) is a 2-decomposition in Dec(n, 2). Moreover, the set
{VJ..: V E A} is also an antichain in Mod(n), so that the set (T
= {(V, V J..) : V
E A}
is an s-system of order 2. Thus the notion of s-system is again a generalization of the notion of an antichain. Similarly, the Grassmannian Gr(n, k) can be viewed as Mu1t(n; k, n - k) through the bijection V 1-+ (V, VJ..). In analogy to the construction of the measure v'k on Gr(n, k), define invariant measures on the sets Mult( n; aI, ... , a r ) as follows. For 8 E Mu1t(n;al,.'" ar) define a flag (xo, Xl,'" ,Xn) E Flag(n) to be compatible with 8 if (i)
(ii)
X al
= 8(1), and
Xal+.+a)Xal+.+ai_l
= 8(i),
for i 2: 2.
Here the quotient x al +.+ai / x al +.+ai-l denotes the orthogonal complement of the vector space x al +.+ai-l inside the larger space x al +.+ai' Let Flag(A) be the set of all flags (XO, Xl, ... , Xn) compatible with some 8 E A for A ~ Mult( n; aI, ... , a r ). Define
(6.14) To justify this normalization combinatorially, note that, for each 8 E A, there are exactly [al]![a2]!'" far]! flags compatible with 8; that is, to choose a flag compatible with 8 one must choose a frame for each of the vector spaces 8i , of which there are [ail! choices for each i. The measure v;;l, ... ,a r is evidently invariant under rotations, and we have
These values, called multiflag coefficients, are continuous analogues of the multinomial coefficients of the discrete lattice in Chapter 3. Define a measure vn;r on Dec( n, r) by taking the direct sum of the
6.5 A continuous analogue of Meshalkin's theorem measures v::l, ... ,a r define
79
That is, for any measurable subset A ~ Dec(n, r), (6.15)
Just as the continuous Spemer Theorem 6.4.2 gives the maximum possible measure for an antichain A in Mod(n), a generalization of this theorem gives the maximum possible measure for an s-system in Dec( n, r). En route to such a generalization we prove a multinomial version of the continuous L.Y.M. inequality. (Compare with Theorem 3.1.4.)
Theorem 6.5.1 (The continuous multinomial L.Y.M. inequality) Let u ~ Dec( n, r) be an s-system. For al + ... + a r = n denote
so that U=
u
(J'al, ... ,ar'
is a disjoint union. Then (6.16)
Proof For al + .. ·+ar = n the measure of flags compatible with ual, ... ,a r is given by A. 'f'n
(Flag(ual,···,ar ))
n = v al, ... ,a r (u al, ... ,ar )[al]"" •
[a r· ]'
by the definition (6.14) of v::l, ... ,a r ' Suppose that a flag (Xo, Xl, ... , Xn) is compatible with both ,,(,8 E u. Then "((1) = x al and 8(1) = Xbll where al = dim"((l) and bl = dim8(lt Since (Xo, Xl,"" Xn) is a flag, we have x al ~ Xb l or vice versa. However, u is an s-system, so that either "((1) = 8(1) or the two spaces are incomparable. Therefore "((1) = 8(1) and al = bl . Continuing, we have "((2) = Xal+a2/Xal and 8(2) = Xbl+b2/Xal (since al = bI). A similar argument then implies that "((2) = 8(2) and a2 = b2 • Continuing in this manner we conclude that "((i) = 8(i) for each 1 ::; i ::; r, so that "( = 8. In other words, every flag in Flag(n) is compatible with at most one r-decomposition 8 E u. It follows that
80
6 Invariant measures on Grassmannians
:::; [n]!, so that
D
The multifiag coefficients also satisfy the following property, in analogy to the classical multinomial coefficients. Proposition 6.5.2 Let r :::; n be positive integers, and suppose that n = rq + b, where q :::: 0 is an integer and 0 :::; b :::; r - I is the integer remainder. For al + ... + a r = n,
Proof Let al, ... ,ar be positive integers such that al + ... + a r = n. Without loss of generality, suppose that al < (nlr). Then ai > (nlr) for some i > 1. Again without loss of generality, suppose that
Then
a2 -
al :::: 2, so that
It then follows from Proposition 6.3.5 that
Replace al with al + I and a2 with a2 - 1. Note that the identity al + ... + a r = n is preserved. This process is repeated until ai :::: (nlr) for alII:::; i :::; r; that is, until ai = (nlr) + I for I :::; i :::; band ai = (nlr) for b + I :::; i :::; r, where b is the integer remainder upon division of n by
r. Since each iteration of this procedure decreases the value of the product [al]!··· [ar]!, it follows that
[al]!··· [ar]! :::: ([(nlr)]!r- b ([(nlr)
+ I]!)b,
81
6.6 Helly's theorem for subspaces for all al
+ ... + ar
=
n. Therefore,
[n]! [n]! [aI]!···[a r ]!::; ([(nlr)]!r- b ([(nlr)+l]!)b' for all al
+ ... + ar
= n.
D
We are now able to prove a continuous analogue to Meshalkin's Theorem 3.1.5, a multinomial generalization of the continuous Sperner Theorem 6.4.2. Theorem 6.5.3 (The continuous Meshalkin theorem) Let 17 be an s-system in Dec(n,r). Then
Vn -r(I7) < [ ,
n
],
,(nlr),···, (nlr) , ,(nlr) + 1"", (nlr) + 1"
-
",
where n
'V'
V
r-b
b
== b mod r.
Proof We reason in analogy to the proof of Meshalkin's Theorem 3.1.5. For al + ... +ar = n let l7a1 , ... ,a r = 17 nMult(n; aI, ... ,ar ). By combining (6.16) and Proposition 6.5.2 we obtain
L al
n v al,···,ar (17al,···,ar )
+·+ar=n [(n/r), .. ,(n/r),(n/r)+l, .. ,(n/r)+l]
so that n val, ... ,a r (17al,···,a r )
al+ .. ·+ar=n
<
[(nlr), ... , (nlr) , (nlr)
+
1"", (nlr) + 1]' D
6.6 Helly's theorem for subspaces Using the language of simplicial complexes in Mod(n) introduced in Section 6.4, it may be possible to apply the Euler characteristic techniques of Section 5.3 to prove the following Helly-type theorem for subspaces of R n. However, this theorem also follows easily from elementary linear algebra.
82
6 Invariant measures on Grassmannians
Theorem 6.6.1 (Helly's theorem for Mod(n)) Let F be a family of nonzero subspaces ofRn. Suppose that, for any subset G ~ F such that IGI :S n (that is, every subset of cardinality at most n of F), dim
(n x)
> O.
(n x)
> O.
xEG
Then dim
xEF
In other words, if every n elements of F contain a common line through the origin, then there is at least one line £, contained in all o( the subspaces in F. Proof To begin, suppose that F is a finite family of subspaces. For this case, the proof is by induction on the size IFI of the family F. If IFI :S n then the theorem holds trivially. Suppose that the theorem holds for the case IF I = m for some m 2: n. We then consider the case of IF I = m + l. Write F = {XI,X2,'" ,Xm+I}, and denote (6.17) Since the theorem is true for families of size m, each Yi has positive dimension. That is, for each i E {1, ... , m + 1} there exists a nonzero vector Vi E Yi. Since m 2: n, the collection {VI, V2, ... , V m + d must be linearly dependent. Without loss of generality, we may then assume that Vm+1
= CIVI
+ ... + CmVm '
where not all the coefficients Ci are zero. However, (6.17) implies that Vi E Xm+1 for all i E {1, ... ,m}. It follows that Vm+1 E Xm+1 as well. Since Vm+1 E Ym+l, we have m+1 Vm+1
E
n
Xj'
j=1
This completes the induction step and the proof of the finite case. To prove the general case, consider the set x of all lines through the origin contained in a given (nonzero) subspace x; that is
x = {£, E Gr(n, 1) : £, ~ x}.
83
6.7 Notes For all nonzero x E Mod(n), the set space Gr(n, 1). Define
F = {x:
x is a closed subset of the compact x
E F},
and define G similarly for subfamilies G of F. Suppose G is any finite subfamily of subspaces in F. Since F satisfies the intersection condition of Theorem 6.6.1, so does the subfamily G. Since G is finite, it follows from the previous argument that dim
(n x)
>
o.
xEG
oi
In other words, the family G clo~d sets has a non-empty intersection, for every finite subfamily G of F. It follows from the compactness of Gr(n, 1) that the family F of closed sets has a non-empty intersection. Equivalently, the intersection of all subspaces in F has positive D dimension.
6.7 Notes In this chapter we have assumed the existence and uniqueness of Haar (orthogonal invariant) measures on the spaces Gr(n, i) and Flag(n), which are homogeneous spaces of the orthogonal group O(n). For a complete discussion of Haar measures on Lie groups and their homogeneous spaces, see [76]. See also [97, p. 151]. The measures Tn, ¢n, V n , and vn;r may also be defined from the perspective of the action of 0 (n) on the domains of these measures. Choosing a normalization for Tn determines a normalization for the O(n)invariant measure on the group O(n) itself. The actions of O(n) on the spaces Flag(n), Mod(n), and Dec(n, r) then induce precisely the invariant measures ¢n, V n , and vn;r on these spaces. We give a brief sketch of this approach, leaving the details to the reader (see also [76]). To begin, recall that the columns of an orthogonal n x n matrix A determine a unique orthogonal frame for Rn. Conversely, given an orthogonal frame in Rn, we can construct an orthogonal matrix by choosing a unit vector from each line in the frame. Since each line in an orthogonal frame contains two unit vectors, there is a 2n-to-1 correspondence between the group 0 (n) and the set offrames for R n, and consequently between O(n) and Flag(n). Once again setting [n] = Tn (Gr(n, 1)), we can construct an O(n)-invariant measure ¢n on the group O(n) itself,
84
6 Invariant measures on Grassmannians
such that
¢n(O(n))
=
2[n]¢n-l(O(n - 1)),
so that
Recall that the group O(n) acts transitively on Gr(n, k), for 0 .::; k .::; n. Moreover, the stabilizer Stab(V) of a subspace V E Gr(n, k) under this action is isomorphic to the product group O(k) x O(n - k). This follows from an argument similar to the 'combinatorial' argument on frames given in Section 6.1. It follows from elementary Lie group theory that there exists a diffeomorphism between the space Gr( n, k) and the quotient space (no longer a group)
O(n) O(k) x O(n - k)" This diffeomorphism induces an invariant measure that
vi:
on Gr(n, k), such
[n]! [k]![n- k]!' as in our original approach. A similar construction works for the measures vn;al, ... ,ar on the spaces Mult(n;al, ... ,ar ), and again this construction via group actions agrees with that of Section 6.5. Theorems 6.4.2, 6.4.3 and 6.5.3, which give continuous analogues of Sperner's and Meshalkin's theorems, are due to Klain and Rota [55]. Many results in the lattice theory of Mod(n) hold independently of the normalization of the measure Tn; that is, the value of [n]. For example, in [24], Fisk developed a similar construction, in which the total measure of Gr(n, 1) is taken to be nwn/2, the measure suggested by the usual twoto-one quotient map from the unit sphere in Rn. However, this normalization does not permit an extension of the lattice analogy sufficient to admit analogues of Sperner's and Meshalkin's theorems, since the value of nwn /2 is not an increasing function of n. Moreover, our choice (6.1) of normalization [n] in Section 6.1 is compatible with Cauchy's surface area formula in the sense described by the equation (5.11). In subsequent sections we shall see that (6.1) is in fact the unique choice of normalization that agrees with a fundamental normalization for the rigid motion invariant measures on R n and its subspaces, namely, the intrinsic volumes (see also [55]).
6.7 Notes
85
The bound given in Theorem 6.4.4 is probably not the best. In the discrete case, Katona [48] and Kruskal [57] independently obtained a much stronger result (see Section 3.4), which unlike Theorem 6.4.4 did not depend on the size of the ambient set (or, in the language of subspaces, the dimension n of the ambient vector space). Perhaps a continuous analogue to the Katona-Kruskal theorem is waiting in the wings.
7 The intrinsic volumes for polyconvex sets
In this chapter we shift our attention from the Grassmannian (k-planes passing through the origin in Rn) to the affine Grassmannian (all kplanes in R n ), in order to extend the intrinsic volumes of Section 4.2 from the lattice of parallelotopes to the larger lattice of polyconvex sets. A crucial tool in this generalization will be the Euler characteristic of Section 5.2, which will serve as a device for testing whether a compact convex set intersects a given k-plane. Section 7.4 concludes the chapter with a preliminary version of the mean projection formula, which provides a fundamental connection between the intrinsic volumes and the lattice of subspaces. An improved version of the mean projection formula will appear in Section 9.4.
7.1 The affine Grassmannian We next consider the partially ordered set Aff(n) of all linear varieties in Rn, whether through the origin or not. The partially ordered set Aff(n) is not a well-behaved lattice. The group En of Euclidean motions acts naturally on Aff(n). Denote by Graff(n, k) the subset of Aff(n) consisting of all elements of rank k; that is, all linear varieties of dimension k. The minimal element of the partially ordered set Aff(n) is the empty set 0 (unlike in Mod(n), where the minimal element was the zero subspace
{o}). We shall prove the existence of a measure oAk on Graff( n, k) that is invariant under the Euclidean group En. To this end, we parametrize Graff(n, k) as follows. Given V E Graff(n, k), let VJ.. be the maximal linear subspace of R n orthogonal to V and containing the origin. There is a unique maximal linear subspace or(V) orthogonal to V J.. and containing the origin. The subspace or(V) is of dimension k; we shall say
7.2 The intrinsic volumes and Hadwiger's formula
87
that V and or(V) are parallel. The set V n V J.. is a point in V which we denote by p(V). Thus, to every V E Graff( n, k) there corresponds a pair (or(V), p), where or(V) E Gr(n, k) and p E or(V)J.. ~ Rn. (Note that or(V)J.. = VJ...) Conversely, given any pair (Vo,p), where Vo E Gr(n,k), and p E VoJ.., there is a unique linear variety V E Graff( n, k) such that or(V) = Vo and V n VoJ.. = p. For V E Graff( n, k) and pERn, denote by V + p the translation of the linear variety V by the vector p. If f is a real-valued measurable function on Graff(n, k), let 1: Gr(n, k) x Rn ~ R be given by the equation
/(Vo,p) = f(Vo + p), and define
J
f dAI: =
1 1
Gr(n,k) vo-L
f(Vo,p) dp dvk'(vo)'
where dp denotes the ordinary Lebesgue measure on vl ~ Rn-k. We thereby define a measure AI: on Graff( n, k) that is invariant under the group En. To see this, suppose that gv E En corresponds to translation by the vector v. Then the composition of functions
f
0
gv(Vo,p) = f
0
gv(Vo
+ p) =
f(Vo
+ P + v) =
f(Vo
+ P + vIV-L), o
where vlv-L denotes the orthogonal projection of the vector v onto the o subspace vl. We then have
1 1
Gr(n,k) Vo-L
r
r
JGr(n,k) JVo-L
f(Vo
+ P + vlv-L) dp dvk'(vo) 0
f(Vo
+ p) dp dvk'(Vo)
J
fdAI:,
by the translation invariance of the measure dp. A similar argument shows that AI: is invariant under rotations (and reflections), and consequently is rigid motion invariant.
7.2 The intrinsic volumes and Hadwiger's formula We now consider the relationship between the intrinsic volumes, as defined in Chapter 4, and the invariant measure AI: defined on Graff( n, k). Once again, fix an orthogonal coordinate system in Rn, and consider
88
7 The intrinsic volumes for poly convex sets
the lattice Par(n) of finite unions of parallelotopes with sides parallel to this fixed coordinate system. For A eRn, denote by Graff( A; k) the set of all V E Graff( n, k) such that A n V =1= 0. The relationship between Ak and the intrinsic volumes is given by the following theorem. Theorem 7.2.1 For all parallelotopes P in Par(n), P,n-k(P)
= C kAk(Graff(P; k))
(7.1)
where the constants C k depend only on nand k. Note that the equation (7.1) is asserted only for the case in which P is a parallelotope and not for arbitrary elements of Par( n) (i.e. not for all finite unions of parallelotopes). Theorem 7.2.1 immediately suggests the following question: what are the values of the constants C~? In Section 9.3 we will prove the amazing fact that C~ = 1, for all n 20 and all 0 :s; k :s; n. However, much can be accomplished in the meantime. In the course of proving Theorem 7.2.1, we shall need the following lemma. Lemma 7.2.2 Let A and B be compact convex sets in Rn such that A U B is convex. Let V be a linear variety of positive dimension such that V n A =1= 0 and V n B =1= 0. Then V nAn B =1= 0.
Proof The set V n (A U B) = (V n A) U (V n B) is convex, since it is the intersection of two convex sets. If V nAn B is empty, then choose points a E (V n A) - (V n B) and bE (V n B) - (V n A). Let 1 denote the straight line segment with endpoints at a and b. Clearly 1 ~ V. Since Au B is convex, we have 1 ~ Au B, so that 1 = (1 n A) U (1 n B). Since 1 is connected and both 1 n A and 1 n B are closed, it follows that 1 n (A n B)
=
(1 n A) n (1 n B) =1=
contradicting the assumption that V nAn B
0,
= 0.
o
Lemma 7.2.2 can be re-stated as follows. If A and B are compact convex sets such that A U B is convex, then for every k > 0 we have Graff(A n B; k)
= Graff(A; k) n
Graff(B; k),
so that Ak(Graff(A U B; k))
= Ak(Graff(A; k)) + Ak(Graff(B; k)) -
Ak(Graff(A n B; k)). (7.2)
7.2 The intrinsic volumes and Hadwiger's formula
89
We now turn to the proof of the theorem. Proof of Theorem 7.2.1 To begin, define a function rJ on parallelotopes by
rJ(P) = Ak(Graff(P; k)).
(7.3)
It follows from (7.2) and Theorem 4.1.3 that rJ has a unique extension to a valuation on all of Par(n) (although the equation (7.3) will not hold for arbitrary finite unions of parallelotopes). Since Ak is invariant, so is the valuation rJ. Now suppose that P is a parallelotope, and let fp be the indicator function of Graff(P; k) in Graff(n, k). In the language of the preceding section we have
Ak(Graff(P; k))
=
J
fp dAk
=
JJ
!p(Vo,p) dpdvk(Vo),
(7.4)
where p ranges over Vl. For a fixed Vo E Gr(n, k), we have
!p(Vo,p) = fp(Vo + p) = 1 if and only if (Vo + p) n P -I- 0 (otherwise the function takes the value zero); that is, if and only if p E PlVl, where PIVoJ.. denotes the orthogonal projection of P onto the subspace Vl. In other words, the function !p(Vo,p) is the indicator function of PIVoJ... For a > 0 we now have
Ak(Graff(aP; k))
so that
Ak(Graff(aP; k)) = ( voln-k(aPlVl) dvk'(vo), JGr(n,k)
(7.5)
where voln-k denotes the (n - k)-dimensional volume in the (n - k)dimensional space VoJ... Since (n-k)-dimensional volume is homogeneous of degree n - k, we can continue:
rJ(aP)
Ak(Graff(aP; k)) a n- k ( voln-k(PlVl) dvk'(vo) JGr(n,k)
l
an-kj IplV-LdPdvk'(vo) Gr(n,k) Vo-L 0
90
7 The intrinsic volumes lor polyconvex sets a n- k Xk(Graff(P; k)) an-krJ(P).
It follows that rJ is homogeneous of degree n - k on parallelotopes, and consequently on all of Par(n). Since the (n - k)-volume is continuous on compact convex sets in vl, it also follows from (7.5) that rJ is a continuous valuation. Therefore, by Corollary 4.2.6, there exists a constant 'Y'k E R such that
rJ(P) = 'YkJ-ln-k(P)
for all P E Par(n). It is clear from (7.5) that rJ(P) > 0 if P has a non-empty interior in Rn. It follows that 'Y'k =1= O. Setting Ck = Ih'k, we have J-ln-k(P) = C'k Ak(Graff(P; k))
for all parallelotopes P E Par( n).
D
Theorem 7.2.1 relates the intrinsic volumes, as defined in Chapter 4 in terms of product valuations, to the invariant measure Ak on Graff(n, k). So far our main result asserts that the intrinsic volume J-ln-k, which for parallelotopes is evaluated as a symmetric function of the side lengths, also evaluates the 'measure' of the set of k-dimensional planes meeting a given parallelotope. This interpretation extends to convex sets. The obvious idea for the further extension of J-ln-k to all polyconvex sets is to use Groemer's extension Theorem 5.1.1; that is, to define a continuous set function J-l~-k on a compact convex set K by J-l~_k(K)
= CkAk(Graff(K;k)).
(7.6)
It then follows from (7.2) and (7.5) that J-l~-k is a continuous valuation on compact convex sets. Theorem 5.1.1 then asserts that J-l~-k has a unique extension to all polyconvex sets in Rn. Note, however, that the equation (7.6) is valid only if K is convex. Alternatively, we can give a more constructive definition for J-l~-k on Polycon( n). It turns out that we can let the Euler characteristic do most of the work. To motivate this method, recall the definition of J-ln-k on Par(n) as a product valuation. Let I be any simple function on Par(n). We have shown that
J
I dJ-ln-k
=L 0-
JJ... J
I(Xl, X2,··., xn) dJ-l&(Xo-(l)) dJ-l&(xo-(2))
7.2 The intrinsic volumes and Hadwiger's formula
where the sum ranges over all permutations The integral
(J'
91
of the set {I, 2, ... ,n}.
JJ... J
f(xI, X2,···, xn) dfL6(XI)'" dfL6(x k)
(7.7)
can be rewritten as follows. Let V be the linear variety of dimension k parallel to the linear subspace spanned by Xl, X2, ... , Xk, and passing through the point 0, ... ,0, Xk+l, ... , Xn. Let A E Polycon(n) , and let f = h· Then the integral (7.7) simplifies to
JJ... J
hfv dfL6(XI)'" dfL6(x k)
= fLo(A n V).
To indicate the dependence of V on the coordinates Xk+l, Xk+2,···, Xn , let us write V = V(Xk+l,'" ,xn ). We then have
J
fA dfLn-k
=L
JJ... J
fLo (A n V(Xo-(k+l) , ... ,Xo-(n)))
0-
This formula suggests replacing the summation over the symmetric group of permutations by integration over the space Gr(n, k) relative to the invariant measure v'k. Specifically, let V (Vo; p) be the linear variety of dimension k parallel to the linear subspace Vo, and intersecting the linear subspace vl at the point p E Rn. We then obtain
JJ
fLo (A n V(Vo;p))
dfL~=~(P) dv'k(vo).
This can be simplified further, since the integration is practically carried out over the invariant measure on Graff(n, k). For all A E Polycon(n), define
ji~_k(A) = C'k
J
fLo (A n V) dAk(V)·
(7.8)
This formula is known as Hadwiger's formula. The integral ranges over all V E Graff(n, k). It is easily verified that fL~-k' as extended by (7.8), is a convex-continuous invariant valuation on Polycon(n). Note that fLo(K n V) = fGraff(K;k) (V) if K is a compact convex set. Hadwiger's formula (7.8) then implies that ji~_k(K)
= C'kAk(Graff(K;k)).
Thus, the valuation ji~-k defined by (7.8) agrees with the alternative extension (7.6) as well as with Theorem 7.2.1. However, the equation (7.8)
92
7 The intrinsic volumes for polyconvex sets
defines intrinsic volumes on every polyconvex set A, giving the desired extension explicitly. Recall that the intrinsic volume /tk was called 'intrinsic,' because /tk(P) remains the same for a parallelotope P, even if P is rigidly embedded within a higher dimensional space. In other words, the valuations /tk on Par(n) are normalized independently of n (see Theorem 4.2.2). We shall see later that this property carries over to the extension of /tk given by (7.8), so that our use of the term 'intrinsic' remains justified (see Theorem 8.4.1). One way to see this is to let K E Kn, embed Rn into Rn+l, and compare /t!:;(K) with /t~+l(K) as given by (7.8). The value of /t~+l is given as the measure of all (n - k + 1)-dimensional planes in Rn+l that meet K. Since a generic (n - k + l)-plane in Rn+l meets Rn in a plane of dimension n - k, the measure of all (n - k + l)-planes meeting K in Rn+l is proportional to the number of (n - k)-planes meeting K in Rn, where the proportionality constant is independent of K, and is given by the measure (in the appropriate sense) of (n - k + 1)planes in Rn+l containing a given (fixed) (n - k)-plane. If we call this proportionality constant a~,n+l, we have
for all K E Polycon(n). Meanwhile, Theorem 4.2.2 asserts /t~+l(P) = /t!:;(P) for all P E Par(n). It follows that a~,n+l = 1, and the intrinsic volumes are universally normalized. The problem in this argument is the claim that the measure of all (n - k + l)-planes meeting K in Rn+l is proportional to the number of (n-k)-planes meeting K in Rn. This statement, although true, requires additional measure-theoretic detail in order to be made precise. Instead, we defer a rigorous proof of the universal normalization to Section 8.4. For the interim, and in order to avoid ambiguity, we will continue to use the notation /t!:; for the kth intrinsic volume on Polycon( n) until we are able to prove the normalization property rigorously in Theorem 8.4.1. We will make an exception for /to, which is clearly normalized independently of the ambient space Rn, since /to(K) = 1 for all non-empty compact convex sets K. We are now in a position to generalize the results of Section 1.2. Let K and L be compact convex sets in Rn. Suppose that L has dimension n, and that K ~ L. Then Graff(K; k) ~ Graff(L; k), and the conditional probability that a linear variety of dimension k shall meet K, given that
7.3 An Euler relation for the intrinsic volumes
93
it meets L, is given by the ratio Ak(Graff(K; k)) Ak(Graff(L; k)) .
From (7.6) and (7.8) we now deduce the following.
Theorem 7.2.3 (Sylvester's theorem) Let K ~ L be compact convex sets. Suppose that L is of dimension n. The conditional probability that a linear variety of dimension k shall meet K, given that it meets L, is given by J.l~_k(K) J.l~_k(L)
. D
Note that this probability can be computed relative to any invariant measure on Graff(n, k). The normalizing constants Ck are irrelevant, since they cancel in the ratio above.
7.3 An Euler relation for the intrinsic volumes The Euler characteristic formula of Theorem 5.2.4 generalizes to intrinsic volumes. To this end, we recall from Theorem 5.2.3 that the Euler characteristic for the interior of a convex polytope P of dimension n is given by J.lo(int P)
=
(_l)n.
Let P be a compact convex polytope. For almost all V E Graff( n, n k) (with respect to the measure A~_k)' we have relint(P) n V =I- 0 whenever P n V =I- 0. Hadwiger's formula (7.8) then implies that
J.lk (relint(P)) =
Ck
= Ck
r
J.lo(relint(P) n V)
r
(_l)dim(relint(p)nv) J.lo(P n V)
}vEGraff(n,n-k)
dA~_k(V) dA~_k(V).
}vEGraff(n,n-k)
Note that if relint(P) n V =Idim(relint(P) n V)
0, then
= dim(P n V) = dimP + dim V -
dim(P U V),
where dim(PUV) denotes the dimension of the smallest plane containing
94
7 The intrinsic volumes for poly convex sets
PUV. If J1k(relint(P)) =I- 0, then dimP 2': k. Therefore, dim(PUV) = n for almost all V E Graff( n, n - k). It follows that dim(relint(P) n V)
= dimP + (n - k) - n = dimP - k,
and we obtain the formula
J1k(relint(P))
=
(-l)dimP-k J1k (P)
(7.9)
for all convex polytopes P. From Hadwiger's formula (7.8) we now derive Euler relations for the intrinsic volumes of an arbitrary polytope P. As in Section 5.2, we define a system of faces of P to be a family F of compact convex polytopes such that if Q,Q' E F and Q =I- Q' then (relint(Q)) n (relint(Q')) = 0, and such that
U(relint(Q))
=
P.
QEF
Under these conditions, the formula (7.9) yields at once
J1k(P)
=
2: (_l)dim
Q -k J1k(Q)·
QEF
7.4 The mean projection formula We conclude this section with an alternative interpretation of Hadwiger's formula (7.8) for compact convex sets.
Theorem 7.4.1 (The mean projection formula) For all K E Kn,
J1k(K)
= C;;:-k
r
JGr(n,k)
J1Z(KlVo) dvk'(vo).
Recall that J1Z denotes the k-dimensional volume on each k-dimensional subspace of Rn. Theorem 7.4.1 states that the kth intrinsic volume J1k(K) is proportional to the mean of the k-volumes of the orthogonal projections of K onto all k-dimensional subspaces of Rn. Proof For K E Kn, we have C;;:_k.A~_k(Graff(K;
C;;:-k
r
r
JGr(n,n-k) JVl
n - k))
fK(Vo,P) dp dV;;:_k(Vo)
7.5 Notes
95
Recall that lK(Vo,p) = 1 if Kn(Vo+p) -I- 0 and is zero otherwise. Since K n (Vo + p) -I- 0 if and only if p lies inside the projection KlVl, we have lK(Vo,p) = IKlVl(p). Therefore, f.1~(K)
C~_kA~_k(Graff(K;
r C~_k r C~_k r C~_k
JGr(n,n-k)
JGr(n,n-k)
JGr(n,k)
1
n - k))
vl
IKlVo(p) dp
dV~_k(Vo)
f.1~(KlVl) dV~_k(Vo)
f.1~(KlVo) dvk'(Vo),
where the last equality follows from the fact that the orthogonal duality between Gr(n, k) and Gr(n, n - k) preserves measure. D Note that the preceding argument also yields
A~_k(Graff(K; n -
k))
=
r
JGr(n,k)
f.1~(KlVo) dvk'(Vo)
for all polyconvex sets K. However, since (7.6) defines f.1~ for convex sets only, Theorem 7.4.1 need not hold for all K E Polycon(n).
7.5 Notes A brief survey of Euler relations for valuations on Polycon(n) appeared in [72, p. 218-220]. See also [37, 39, 79, 80]. For a general reference on the theory of valuations on convex bodies, see [71, 72] and [85]. For additional related integral geometric formulas and applications, see also [82] and [99]. The theory of intrinsic volumes can be extended to partially ordered sets. Let P be a finite partially ordered set, with a unique minimal element o. An order ideal of P is a subset A of P such that, if x E A and y ::; x, then YEA. The set of all order ideals of P is denoted by L(P). It is closed under unions and intersections; that is, it is a distributive lattice. (It can be shown that every finite distributive lattice can be represented in the form L(P) for some partially ordered set P). We shall consider valuations on L(P). If x E P, denote by x the order ideal consisting of all yEP such that y ::; x. A valuation f.1 on P will be called invariant if f.1(A) = f.1(B) whenever the order ideals A and Bare isomorphic as partially ordered sets. It can be shown that every valuation f.1 on L(P) extends uniquely to the Boolean algebra of all subsets of P, and that a valuation is uniquely
96
7 The intrinsic volumes for polyconvex sets
determined by assigning the value f.1(x) for each x E P. We typically consider only valuations such that f.1(a) = O. There are always at least two invariant valuations on P, namely, the size;
a(A)
= IAI- 1, A =1= {a},
and the Euler characteristic, which is uniquely determined by setting
f.10(x) = 1 if x
=1=
{a}.
It can be shown (see [78, 79, 80]) that the Euler characteristic is closely related to the Mobius function of P; that is, the integer-valued function on P uniquely defined by the conditions
f.1(a,8) Lf.1(8,q)
=
= 1
0, p > 8.
q~p
One finds that
f.1o(A)
= -
L
f.1(8,p).
pEA,p#8
Thus, one obtains a generalization of the expression for the Euler characteristic in terms of the 'number of faces'. Several special cases have already been studied in detail. Other invariant valuations are defined as follows. Let ex be an isomorphism class of 'simplices' X. For every A E £(P), let f.1a be the number of x of class ex contained in A. We conjecture that the f.1a and f.1o span the vector space of all invariant valuations. The identities holding among the f.1a are particularly interesting to determine. Actually, the theory of invariant valuations on partially ordered sets should proceed along general lines. There is a more general notion of invariant measure that deals with arbitrary segments [x, y] of P, rather than just segments x = [0, x]. Such valuations can be multiplied in a way that resembles multiplication in the incidence algebra of P (see [78, 79, 92]). There is more than one gap in the analogy between the discrete and the continuous case. On the one hand, the ease with which the theory of invariant valuations can be carried over to arbitrary partially ordered sets suggests the possibility of generalization of the intrinsic volumes to a more general space than R n. On the other hand, the analogy between £(8) (or P(8)) and Mod(n) is deficient, since we do not consider order
7.5 Notes
97
ideals in Mod(n) (or Aff(n)) as our basic building blocks, but rather the polyconvex sets. The difficulty here is that of singling out a sufficiently ample class of order ideals in Mod(n) that can be taken as simplices (so as to replace convex sets). Convexity on Grassmannians is at present too unwieldy a notion, and somehow the Schubert cell structure of the Grassmannian must be brought to bear on the problem (see, for example,
[74]).
8 A characterization theorem for volume
In this chapter we state and prove a fundamental theorem of geometric probability, namely the characterization of volume on polyconvex sets as a continuous rigid motion invariant simple valuation Polycon( n). The characterization of volume will lead in turn to a straightforward characterization for all of the intrinsic volumes (see Section 9.1). In Section 8.4 we verify the universal normalization of the intrinsic volumes. In Section 8.5 we investigate a connection between volume and random motions, leading to an analogue of the Buffon needle problem, in which evenly spaced lines are replaced by a discrete additive subgroup of Rn. The results of Section 8.5 will be used in Section 9.6 to solve the Buffon needle problem in a still more general form.
8.1 Simple valuations on polyconvex sets In this section we state and prove a characterization theorem for volume, leading to a generalization of Theorem 4.2.4 to Polycon(n). Recall that Theorem 4.2.4 characterized volume in two ways, one involving continuity and the other involving monotonicity. It turns out to be much easier to generalize Theorem 4.2.4 in the monotonic case than it is in the continuous case. Theorem 8.1.1 Suppose that J-L is a monotone translation invariant simple valuation on Polycon( n). Then there exists C E R such that J-L(K) = CJ-Ln(K), for all K E Polycon(n).
Proof We shall assume without loss of generality that J-L is an increasing valuation, for, if J-L is decreasing, then the valuation -J-L is increasing.
8.1 Simple valuations on polyconvex sets
99
Consider first the restriction of f.-l to Par(n). By Theorem 4.2.4, there exists c E R such that f.-l(P) = Cf.-ln(P), for all P E Par(n). Next, consider K E Kn. Recall from the definition of volume in elementary calculus that f.-ln(K) is given by the supremum of f.-ln(P) over all P E Par( n) such that P ~ K. Since f.-l is increasing, we have f.-l(P) ~ f.-l(K) for all P ~ K. In other words, Cf.-ln(P) ~ f.-l(K) for all P ~ K. It follows that cf.-ln(K) ~ f.-l(K). Recall again that f.-ln (K) is also given by the infimum of f.-ln (P) over all P E Par(n) such that K ~ P. It then similarly follows that f.-l(K) ~ cf.-ln(K). Hence, f.-l(K) = cf.-ln(K), for all K E Kn, and therefore also for 0 all K E Polycon(n). A characterization for volume as a continuous valuation, free of monotonicity conditions, requires considerably more work. We begin with some preliminary definitions. Recall from Section 5.1 that a non-empty compact convex set K E Kn is determined uniquely by its support function hK : sn-l - > R, defined by h K ( u) = maxxEK{ X· u}, where· denotes the standard inner product on Rn. Recall also that, if vERn and v denotes the line segment with endpoints v and -v, then hv(u) = lu, vi, for all u E sn-l. For K E Kn, denote by -K the set {x : -x E K}; that is, the reflection of K through the origin. If K = - K we say that K is centered or symmetric about the origin. A set K is centered or symmetric if some translate of K is centered about the origin. Denote by K~ the set of all centered compact convex sets in Rn. Recall that for compact convex sets K and L the Minkowski sum K + L is defined by
K
+L =
{x
+y :x
E
K and y
E
L},
and that hK +L = hK + h L . A zonotope is a finite Minkowski sum of straight line segments. A convex body Y is called a zonoid if Y can be approximated in Kn by a convergent sequence of zonotopes [85, p. 183J. We shall need the following useful fact concerning zonoids and smooth convex bodies. A complete discussion of this result and its proof may be found in [26, 34, 85J. Proposition 8.1.2 Let K E K~, and suppose that the support function hK is COO. Then there exist zonoids Y 1 , Y 2 such that
8 A characterization theorem for volume
100
Proof For 9 E coo(sn-l), the cosine transform of g, denoted Cg, is given by the equation Cg(u)
=
r
}Sn-l
Iu· vlg(v) dv.
The transform C is a bijective linear operator on the space of all even Coo functions on sn-l. This fact is a consequence of Schur's lemma (or the Funke-Hecke theorem) for spherical harmonics [85, pp. 182-189] (see also [26, 34]). Since the function hK : sn-l -----t R is Coo and even, there exists an even Coo function 9 : sn-l -----t R such that hK = Cg; that is,
hK(U) Let g+(v)
=
r
}Sn-l
Iu· vlg(v) dv.
= max{g(v),O}, and let g-(v) = max{-g(v),O}. Then
hK(U)+
r
}Sn-l
Iu,vlg-(v)dv=
r
}Sn-l
It is easy to check that the functions hY1
Iu,vlg+(v)dv.
(8.1)
= Cg+ and hY2 = Cg- each
satisfy the properties of a support function of a centered convex body, which we denote by Yl and Y2 respectively. The equation (8.1) is then equivalent to the statement that K + Y 2 = Y 1 . Moreover, since the Riemann sums converging to the integrals in (8.1) are linear combinations of support functions of line segments (Le. support functions of zonotopes), it follows that Y 1 and Y 2 are zonoids. 0 Let SO(n) denote the special orthogonal group; that is, the set of all rotations of R n. Let B = {el' ... , en} denote the standard basis for R n , and denote by SO(n, B) the set of all rotations in SO(n) that fix at least n - 2 elements of the basis B. Proposition 8.1.3 Suppose that 1> E SO(n). Then there exists a finite collection 1>1,1>2, ... , 1>m E SO(n, B) such that 1> = 1>11>2'" 1>m-
Proof The proposition holds trivially in dimension n = 2, since SO(2, B) = SO(2). Suppose that n 2: 3 and that the proposition holds for dimension n - 1. Let 1> E SO(n), and suppose that 1> tj SO(n, B). Let v = 1>en , and assume without loss of generality that v =I=- en. Let v' denote the unit normalization of the orthogonal projection of v onto Span{ el, ... , en-I} = Rn-l. There exists 'l/J E SO(n) such that 'l/Je n = en and 'l/Jv' = en-I. As v lies within Span{v', en}, it follows that 'l/Jv lies within Span{ en-I, en}.
8.1 Simple valuations on polyconvex sets
101
Let ( be the rotation that fixes e1, ... ,en-2 and rotates 'lj;v to en. Then ( E SO(n, B), and ('lj;¢e n = ('lj;v = en· Let 7] = ('lj;¢. Since 'lj; and 7] both fix en, it follows from the induction assumption on SO(n - 1) that there exist 'lj;1,' .. 'lj;i, 7]1, ... 7]j E SO(n, B) such that 'lj; = 'lj;1 ... 'lj;i and 7] = 7]1 ... 7]j. Thus,
¢ = 'lj;-lC l 7] = 'lj;:;1 ... 'lj;llC l 7]l ... 7]j. D
A valuation J-L on Kn is said to be simple if J-L vanishes on sets of dimension less than n. Theorem 8.1.4 Suppose that J-L is a continuous translation invariant
simple valuation on Kn. Suppose also that J-L([O,l]n) = 0, and that J-L(K) = J-L(-K), for all K E Kn. Then J-L(K) = 0, for all K E Kn. Here [O,l]n denotes the n-fold Cartesian product of the closed unit interval [0, 1] with itself; that is, a unit n-cube. Proof If n = 1 then the result follows readily, since a compact convex subset of R is merely a closed line segment. Since J-L is simple and vanishes on the closed line segment [0,1]' it must vanish on all closed line segments of rational length. It then follows from continuity that J-L vanishes on all closed line segments. For n > 1, assume that Theorem 8.1.4 holds for valuations on Kn-1. Since J-L is translation invariant and simple, the fact that J-L( [0, l]n) = implies that J-L([O, l/k]n) = for all integers k > 0. Therefore, J-L(C) = for every box C of rational dimensions, with sides parallel to the coordinate axes. This follows from the fact that such a box can be built up out of cubes of the form [0, l/k]n for some k > 0. The continuity of J-L then implies that J-L( C) = for every box C of positive real dimensions, with sides parallel to the coordinate axes. Next, suppose that D is a box with sides parallel to a different set of orthogonal axes. If n = 2 then it is easy to see that D can be cut into a finite number of pieces, translations of which can be pasted to form a box C with sides parallel to the original coordinate axes (see Figure 8.1). Since J-L is simple and translation invariant, it follows that J-L(D) = J-L(C) = 0. If n > 2, then for all rotations (E SO(n,B), a box with sides parallel to the basis (B can be cut, translated, and re-pasted into a box parallel to B, using precisely the operations followed in the case n = 2. This works because the rotation ( fixes at least n - 2 of the original coordinate axes. More generally, for 'lj; E SO(n), Proposition 8.1.3 states that 'lj; is a finite product of elements of SO(n, B).
°
°
°
°
102
8 A characterization theorem for volume
________ 1
Fig. 8.1. Re-orient a frame without use of rotations.
Therefore, a box with sides parallel to the basis 'If;E can be cut, translated, and re-pasted into a box parallel to E, using a finite iteration of operations of the type used in the case n = 2. It follows that, if D is a box with sides parallel to any orthogonal frame in Rn, then D can be transformed into a box C with sides parallel to the original coordinate axes, by means of cutting, pasting, and translations. Therefore, we have p,(D) = p,(C) = O. Next, define a valuation 7 on Kn-l as follows. Given a compact convex subset K of R n - l , set
7(K) = p,(K x [0,1]). Note that 7([0, l]n-l) = p,([o,l]n) = O. Notice also that 7 satisfies the hypotheses of Theorem 8.1.4 in dimension n - 1. The induction hypothesis then implies that 7 = O. Since p, is simple, it follows that p,(K x [a, b]) = 0, for any convex body K ~ R n - l and any rational numbers a and b, with a :::; b. The continuity of p, then implies that p,(K x [a, b]) = 0 for all a, bE R. Said differently, p, is zero on any right cylinder with a convex base. Let Xl, ... ,Xn be the coordinates on R n. We can represent R n-l by the hyperplane Xn = O. The right cylinders for which we have shown p, to be zero have tops and bottoms that are congruent and that lie directly above and below each other. In other words, the edges connecting the top face to the bottom face are orthogonal to the hyperplane Xn = O. This process can be applied to right cylinders with base in any (n-1)dimensional subspace of Rn. Since p, = 0 on boxes in every orientation, it follows (from the preceding argument) that p, = 0 on right cylinders of every orientation. Suppose that M is a prism, or slanting cylinder, for which the top and bottom faces are congruent and both parallel to the hyperplane Xn =
8.1 Simple valuations on polyconvex sets
103
Fig. 8.2. Turn a prism into a right cylinder.
0, but whose cylindrical boundary is no longer orthogonal to Xn = 0, meeting it instead at some constant angle. See Figure 8.2. Cut M into two pieces, MI and M 2 , separated by a hyperplane that is orthogonal to the cylindrical boundary of the prism. Rearrange the pieces Mi and re-paste them together along the original (and congruent) top and bottom faces. We are then left with a right cylinder C whose surrounding boundary is orthogonal to the new top and bottom faces. Since f.1 remains constant under this operation, it follows that
(Actually, such a cutting and rearrangement is possible only if the diameter of the top/bottom of M is sufficiently small compared with the height and angle of the cylindrical boundary; i.e. provided that M is not too 'fat'. If the base of M is too large, however, we can subdivide M into 'skinny' prisms by subdividing the top/bottom of M into convex bodies of sufficiently small diameter and considering separately each prism formed by taking the convex hull of the (disjoint) union of a piece of the bottom of M with its corresponding congruent piece of the top of
M.) Now let P be a convex polytope having facets PI, ... , Pm, and corresponding outward unit normal vectors UI, ... , U m . Let v ERn, and let v denote the straight line segment connecting the point v to the origin o. Without loss of generality, let us assume that PI, ... ,Pj are exactly those facets of P such that Ui . v > 0, for each 1 ~ i ~ j. In this case,
104
8 A characterization theorem for volume
the Minkowski sum P
+ v can be expressed in the form
where each term of the above union is either disjoint from the others, or intersects another in a convex body of dimension at most n - 1. It follows that
Notice, however, that each term of the form Pi f.1(Pi + v) = 0. Hence, f.1(P
+ v) =
+ v is a
prism, so that (8.2)
f.1(P),
for all convex polytopes P and all line segments V. By induction over finite Minkowski sums of line segments, it immediately follows from (8.2) that, for all convex polytopes P and all zonotopes Z,
f.1(Z)
= 0,
and f.1(P
+ Z) = f.1(P).
The continuity of f.1 then implies that f.1(Y) = 0, and f.1(K
+ Y)
=
for all K E K n and all zonoids Y. Next, suppose that K E K~ has a Coo support function h K Proposition 8.1.2, there exist zonoids Y1 and Y2 such that K + Y2 In this case, (8.3) implies that f.1(K) = f.1(K
+ 1'2) =
(8.3)
f.1(K),
.
=
By Y1 .
f.1(Y1 ) = 0.
Since any centered convex body K can be approximated by a sequence Ki of Coo centered convex bodies, it follows (by continuity) that f.1 is zero on all of K~. Now let .6. be an n-dimensional simplex, with one vertex at the origin. Let Ul, ... ,Un denote the other vertices of .6., and let P be the parallelotope spanned by the vectors Ul,"" Un. Let v = Ul + ... + Un. Let 6 be the hyperplane passing through the points Ul, ... , Un, and let 6 be the hyperplane passing through the points v - Ul, .•. ,v - Un. Finally, denote by P* the set of all points of P lying between the hyperplanes 6 and 6. We can now write P
= .6. U P* U (-.6. + v),
105
8.1 Simple valuations on polyconvex sets
where each term of the union intersects another in dimension at most n - 1. Since P and P* are centered, we have
0= p,(P)
=
p,(I:::.)
+ p,(P*) + p,( -I:::. + v) =
p,(I:::.)
+ p,( -1:::.).
In other words, p,(I:::.) = -p,( -1:::.). Meanwhile, we are given that p,(I:::.) = p,( -1:::.). Therefore p,(I:::.) = 0, for any simplex 1:::.. Let P be a convex polytope in Rn. The polytope P can be expressed as a finite union of simplices
such that the intersection I:::. i n I:::. j has dimension less than n, for all i =I- j. It follows that m
p,(P)
=
"Lp,(l:::. i )
=
O.
i=l
Since the set of all convex polytopes is dense in Kn, the continuity of p, then implies that p,(K) = 0 for all K E Kn. D Theorem 8.1.4 is equivalent to the following theorem. Theorem 8.1.5 (The volume characterization theorem) Suppose that p, is a continuous translation invariant simple valuation on Kn. Then there exists c E R such that p,(K) + p,(-K) = cp,n(K), for all KEKn. Note that Theorem 8.1.5 implies that p,(K) = (c/2)P,n(K) for all centered convex bodies K E K~. Proof of equivalence Suppose that p, is a continuous translation invariant simple valuation on Kn. For K E Kn, define
v(K)
= p,(K) + p,( -K) - 2p,([0, l]n)P,n(K).
Then v satisfies the hypotheses of Theorem 8.1.4, so that v(K) all K E Kn. Therefore,
p,(K)
+ p,( -
K)
=
= 0 for
cp,n(K),
where c = 2p,([0,1]n). Hence, Theorem 8.1.4 implies Theorem 8.1.5. The reverse implication is obvious. D
106
8 A characterization theorem for volume
8.2 Even and odd valuations A valuation f.1 on Kn is said to be even if f.1(-K) = f.1(K)
for all K E Kn. If f.1(-K) = -f.1(K)
for all K E Kn then f.1 is said to be odd. Every valuation f.1 on Kn has a decomposition f.1 =
where
f.1even
f.1odd
+ f.1odd,
is the even valuation defined by
=
f.1even(K)
and
f.1even
~(f.1(K)
+ f.1( -K))
is the odd valuation defined by f.1odd(K)
=
~(f.1(K) - f.1(-K)).
Theorem 8.1.5 implies that the volume f.1n is the only continuous, translation invariant, even, simple valuation on Kn, up to a constant factor. More generally, for any continuous, translation invariant, simple valuation f.1, there exists C E R such that f.1(K)
= cf.1n(K) + f.1odd(K)
(8.4)
for all K. A natural question at this point is that of whether the property of evenness is necessary to characterize volume. Do there exist any nontrivial continuous, translation invariant, odd, simple valuations? The answer turns out to be yes, even in the case of dimension 2. For example, let .6. denote the equilateral triangle in R 2 of unit side length, centered at the origin and with a side parallel to the x-axis. Define a valuation 'T/ on K2 by the equation (8.5) It is clear that 'T/ is a translation invariant continuous function of K. To see that 'T/ is a valuation, we require the following proposition.
Proposition 8.2.1 Suppose K, L, M E Kn such that K U L is convex. Then
(K U L)
+M
=
(K + M) U (L + M),
(8.6)
8.2 Even and odd valuations
107
and
(KnL) +M = (K +M) n (L+M).
(8.7)
Proof The equation (8.6) is obvious, even when K U L is not convex. To prove (8.7), suppose that x E (K n L) + M. Then x = y + m, where y E KnL and m EM. Evidently x E K +M and x E L+M, so that
(K n L) + M
~
(K + M) n (L + M).
Next, suppose that x E (K +M)n(L+M). Then x = a+ml = b+m2, where a E K, bEL, and ml,m2 E M. Since K U L is convex, with K and L each compact, the line segment with endpoints at a and b must contain a point y E K n L, which we can express in the form y = (1 - t)a + tb for some 0 :s:; t :s:; 1. Let m = (1 - t)ml + tm2. Since M is convex, mE M, and y + mE (K n L) + M. Meanwhile,
=
y+ m
(1 - t)a + tb + (1 - t)ml + tm2
=
(1 - t)x + tx
= x,
so that x E (K n L) + M, and
(K n L) + M
~
(K + M) n (L + M). o
This completes the proof.
M
Now suppose that K, L E K} such that K U L is convex. If we set = .6. then Proposition 8.2.1 implies that
J-l2((K U L) +.6.) + J-l2((K n L) +.6.) =
J-l2((K +.6.) U (L + .6.)) + J-l2((K +.6.) n (L + .6.))
=
J-l2(K +.6.) + J-l2(L + .6.),
and similarly if.6. is replaced with -.6.. Consequently, we have
77(K U L)
+ 77(K n L) = 77(K) + 77(L).
Moreover, since J-l2 is even,
77(-K)
= J-l2(-K +.6.) - J-l2(-K + -J-l2(K +.6.) -77(K),
so that 77 is an odd valuation on K2.
(-.6.))
= J-l2(K +
(-.6.))
8 A characterization theorem for volume
108
Fig. 8.3. The Minkowski sum 6
+ (-6).
It immediately follows that 'f} is simple! To see this, suppose that K is a symmetric convex body; i.e., K = -K. Then we have
'f}(K) = 'f}( -K) = -'f}(K), so that 'f}(K) = O. However, all points and line segments are symmetric convex bodies, so that 'f} must vanish in dimensions 0 and 1. Finally, we check that 'f} -I- O. The fact that 'f}(f:l.) -I- 0 follows easily from the Brunn-Minkowski inequality in the plane [85, p. 309]. However, this fact can also be seen by a simple and direct computation. It can be seen in Figure 8.3 that the Minkowski sum f:l. + (-f:l.) is a regular hexagon of unit edge length, having area
whereas
Hence,
'f}(f:l.) = /12(f:l. + f:l.) - /12(f:l.
/3 -I- o. + (-f:l.)) = -2/12(f:l.) = -2"
Shortly after the discovery of Theorem 8.1.5, which characterizes the continuous translation invariant even simple valuations, Schneider used an analogous approach to characterize the continuous translation invariant odd simple valuations. This led to the following theorem.
Theorem 8.2.2 (Schneider's characterization theorem) Suppose that /1 is a continuous translation invariant odd simple valuation on Kn. Then there exists a continuous odd function 9 : sn-l - > R and a measure S K on sn-l such that
/1(K) for all K E Kn.
=
r
lSn-l
g(u) dSK
o
8.3 The volume theorem
109
Here sn-l denotes the unit sphere in Rn. The measure SK is sometimes called the Aleksandrov-Fenchel-Jessen measure associated with K. Since any continuous translation invariant simple valuation f..t can be expressed as a sum f..t = f..teven + f..todd of even and odd valuations, Theorems 8.1.5 and 8.2.2 combine to improve the characterization (8.4). In other words, a continuous translation invariant simple valuation f..t must have the form f..t(K)
where c E Rand 9 :
= cf..tn(K) +
sn-l ----t
r
}Sn-l
g(u) dSK
,
R is a continuous odd function.
8.3 The volume theorem In order to generalize Theorem 4.2.4 to the lattice Polycon(n), we require the following proposition relating rotation invariance to invariance under reflections. Proposition 8.3.1 (Sah) Let .6. be an n-dimensional simplex. There exist polytopes PI, ... , Pm such that
where each term of this union intersects another in dimension at most n -1, and where each of the polytopes Pi is symmetric under a reflection across a hyperplane. Proof Let xo, ... , Xn be the vertices of .6., and let .6. i be the facet of .6. opposite to Xi. Let Z be the center of the inscribed sphere of .6., and let Zi be the foot of the perpendicular from Z to the facet .6. i . For all i < j, let Ai,j denote the convex hull of Z, Zi, Zj, and the face .6. i n .6. j . Then
.6. =
U
Ai,j,
O~i<j~n
where the distinct terms Ai,j of this union intersect in at most dimension n-1. It is also evident that each Ai,j is symmetric under reflection across the n - 1 hyperplane determined by the point Z and the face .6. i n .6. j • Now relabel the polytopes Ai,j by a linear ordering PI' ... ' Pm, where m = ~n(n + 1). This gives
.6.
= PI U··· U Pm,
where the polytopes Pi satisfy the desired conditions.
o
110
8 A characterization theorem for volume
We now generalize Theorem 4.2.4 to the lattice Polycon(n).
Theorem 8.3.2 (The volume theorem for Polycon( n)) Suppose that f1 is a continuous rigid motion invariant simple valuation on Kn (or Polycon(n)). Then there exists c E R such that f1(K) = Cf1n(K), for all K E Kn (or Polycon(n)). Recall from Groemer's extension Theorem 5.1.1 that a convex-continuous valuation is well defined on Polycon( n) if and only if it is well defined on the generating set Kn. Proof Since f1 is translation invariant (as well as rotation invariant) and simple, Theorem 8.1.5 implies the existence of a E R such that f1(K) + f1( -K) = af1n(K), for all K E Kn. Let .6. be a simplex in R n. Then we have (8.8) If the dimension n of the ambient Euclidean space is even, then .6. differs from -.6. by a rotation, so that
Meanwhile, if n is odd, from Proposition 8.3.1 there exist polytopes PI,"" Pm such that
where each term of this union intersects another in dimension at most n -1, and where each of the polytopes Pi is symmetric under a reflection across a hyperplane. It follows that each Pi differs from - Pi by a proper rigid motion (i.e. by a rotation followed by a translation), so that f1( -Pi) = f1(Pi). Therefore, m
m
i=l
i=l
(8.9) Taken together, (8.8) and (8.9) imply that f1(.6.) = (a/2)f1n(.6.) for any simplex .6.. Let c = a/2, and suppose that P is a convex polytope in Rn. The polytope P can be expressed as a finite union of simplices
8.4 The normalization of the intrinsic volumes such that the intersection i =1= j. It follows that
M(P)
~i
n
~j
111
has dimension less than n, for all
+ ... + M(~7n) CMn(~l) + ... + CMn(~7n)
M(~l)
CMn(P). Since the set of all convex polytopes is dense in Kn, the continuity of M then implies that M(K) = CMn(K) for all K E Kn. This concludes the proof of Theorem 8.3.2. 0
8.4 The normalization of the intrinsic volumes Recall from Theorem 4.2.2 that if P is a parallelotope in Rl, and if we consider Rl ~ Rn for some n > l, then Ml(p) = M'k(P), for all k ~ o. We will now show that this universal normalization holds when P is replaced by any polyconvex set. Theorem 8.4.1 (The universal normalization theorem) The valuations Mi on Polycon(n) are normalized independently of the dimension n.
Proof Theorem 8.4.1 clearly holds for i = 0, since Mo(K) = Mo(K) = 1 for all non-empty compact convex subsets of R n for all dimensions n. Moreover, it is clear that M~ restricts to Ml for all l < k, since both valuations are identically zero on polyconvex sets of dimension l < k. Let n > k, and suppose that M~-l restricts to Ml for all k ~ l ~ n - 1.
(8.10)
We need to show that the condition (8.10) holds for M'k as well. Since M'k vanishes in dimension less than k, the restriction of M'k to a k-dimensional plane in Rn is a continuous, invariant, simple valuation on Polycon(k). By the Volume Theorem 8.3.2 there exists C E R such that M'k(K) = CM~(K) for all K E Polycon(k). Since M'k(P) = M~(P) for all parallelotopes P E Par(k) (by Theorem 4.2.2), it follows that C = 1 and M'k = M~ on Polycon(k). If k = n - 1 then we have achieved our goal. If k < n - 1, suppose that
M'k restricts to Ml for some k ~ l < n - 1.
(8.11)
8 A characterization theorem for volume
112
To complete the induction step and prove the theorem, we need to show that the condition (8.11) holds for Ikk and 11~+1 as well. Denote by v the restriction of 11k to Polycon( l + 1). Then v restricts to 11~ on Polycon(l) by the condition (8.11), whereas 11~+1 restricts to 11~ on Polycon(l) by the condition (8.10). It follows that v - 11~+1 vanishes on Polycon(l), so that v - 11~+1 is a continuous invariant simple valuation on Polycon(l + 1). By Theorem 8.3.2 there exists c E R such that v - 11~+1 = on Polycon(l + 1). Since v - 11~+1 vanishes on all of Par(l + 1) by Theorem 4.2.2, we have c = 0 and v = 11~+ I. The theorem now follows by double induction, first on l and then on n. 0
cl1lti
Henceforth we are justified in omitting reference to ambient spaces when discussing the intrinsic volumes, and so we shall simplify the notation 11k to 11k. The induction technique used in the proof of Theorem 8.4.1 actually leads to a much more powerful result, with which we shall begin Chapter 9.
8.5 Lattice points and volume Let B = {VI, ... , v n } be a basis for R n, and let £ denote the collection of points in R n consisting of all vectors having integer coordinates with respect to B; that is,
£={alvl+···+anvn
al, ... ,anEZ}.
Here Z denotes the set of integers. The set £ forms a discrete subgroup of R n with respect to vector addition. Discrete subgroups of R n are traditionally called lattices, and the elements of a lattice £ are called lattice points. Lattices in R n are not to be confused with the distributive lattices of sets treated in the preceding chapters ~ although the two notions are related, this relation will not be relevant in the present context. For the remainder of Chapter 8, the term lattice will refer to discrete additive subgroups of R n. In this section we ask the following question. If a convex body K is moved in R n by a random Euclidean motion, what is the expected number of lattice points to be found in K? In other words, what is the expectation of the random variable IgK n £1, where 9 is a random Euclidean motion? Using Theorem 8.1.1 we will show that this expected number is proportional to the volume of K. Let C denote the parallelotope C
= {alvl + ... +anvn
0:::; al,.' .,an :::; I}.
8.5 Lattice points and volume
113
The parallelotope G is called a fundamental domain for the lattice 'c. If A is a matrix whose ith column is given by the coordinates of the vector Vi (with respect to the standard orthonormal basis for R n), then the volume of G is given by J-tn(G) = Idet AI. Proposition 8.5.1 For any x E R n and any positive integer k, the translate kG + x of kG contains at least k n points of ,C and at most (k + l)n points of ,c. Proof For x = XlVI + ... + XnVn the set (kG + x) n,C consists of all vectors V = YI VI + ... + Yn Vn such that Xi ::; Yi ::; k + Xi. There are either k or k + 1 possible integer values for Yi within this interval, for each i E {I, ... , n}. Therefore kG + x contains between k n and (k + l)n points of ,c. 0
We consider next a simplified version of our original question. Suppose that a polyconvex set K is moved in Rn by a random translation x, resulting in a new body K + x. What is the expected number of lattice points in K + x? Theorem 8.5.2 Let K E Polycon(n), and let XK denote the number of points in the set (K + x) n,C, for a random translation x. Then the expectation E(XK) is given by E(X ) K
=
J-tn(K) J-tn(G) .
Proof To begin, note that for x E Rn the number of lattice points in K + x is given by J-to (( K + x) n £), where J-to is the Euler characteristic. In order to compute the expectation E(XK) we must average over all x E R n the function J-to(gK n ,c). This makes no sense, however, since such an integral would diverge. Nonetheless, because ,C is symmetric under the set of translations by points of 'c, it is sufficient to average over the set of translations by vectors x E G. Thus we have
so that E(XK) is a translation invariant monotonically increasing valuation in the parameter K. Evidently E(XK ) = 0 if K has dimension less than n, so the valuation E(XK) is also simple. It then follows from
114
8 A characterization theorem for volume
Theorem 8.1.1 that there exists
Ct
E
R such that (8.12)
for all K E Polycon(n). To compute the constant Ct, consider K k, Proposition 8.5.1 asserts that
= C.
For all positive integers
It then follows that
From (8.12) we then obtain
for all positive integers k. It follows that
Ct
=
1/J-tn (C).
o
We are now able to answer the question of how many lattice points are expected inside gK, for a random Euclidean motion g. Theorem 8.5.3 Let K E Polycon(n), and let X K denote the number of points in the set gK n £, for a random Euclidean motion g. Then the expectation E(XK) is given by E(X ) = J-tn(K) K J-tn(C)·
Proof To begin, note that, for 9 E En, the number of lattice points in gK is given by J-to(gK n C), where J-to is the Euler characteristic. In order to compute the expectation E(XK) we must average the function J-to(gKn£) over all motions g. Again this appears to make no sense, since the required integral would diverge. However, because £ is symmetric under the set of translations by points of £, it is sufficient to average over the set of motions En consisting of any rotation or reflection ¢ E O(n) followed by a translation by a vector x E C. Hence, we have E(XK)
=
k
J-to(gKn£)dg
En
r r J-to((¢K+x)n£)dxd¢,
JO(n) Jc
8.6 Remarks on Hilbert's third problem
115
where the integrals are taken with respect to the unique invariant probability measures on C and O(n). Since
1
cJto((¢K +x) nC)dx
=
Jtn(¢K) Jtn(C)
=
Jtn(K) Jtn(C)
by Theorem 8.5.2, it follows that
o
as well.
8.6 Remarks on Hilbert's third problem Let P be a polytope in R n. A dissection of P is an expression of P as a union
of polytopes PI, ... , Pm such that each intersection Pi nPj has dimension less than n for all i f j. Two polytopes P and Q in R n are said to be scissors congruent if there exist dissections of P and Q into a finite set of polytopes PI, P2 , ... , Pm and Ql , Q2, ... , Q m respectively, such that each Pi is congruent to Q i by some rigid motion of Rn; that is, such that for each i we have ~ = giQi for some rigid motion gi of Rn. It is clear that, if P and Q are scissors congruent, then they have the same volume: Jtn(P) = Jtn(Q). At the Paris International Congress of Mathematicians in 1900, David Hilbert posed the converse question (see [44]): if two polytopes P and Q in Rn have the same volume, are they necessarily scissors congruent? This question, along with subsequent variations, has come to be known as Hilbert's third problem. It can easily be shown that the answer is 'yes' for polytopes in the plane; that is, if n = 2 (see [81, p. 5]). For dimensions n ~ 3, however, a negative answer was given by Dehn in [19] only a year after Hilbert had posed the problem (see also [8, 81]). Dehn discovered a functional 1jJ on the set of polytopes in R n (for n ~ 3), known as the Dehn invariant, such that 1jJ(P) = 1jJ(Q) whenever P and Q are scissors congruent. He then exhibited two polytopes P and Q having the same volume such that 1jJ(P) f 1jJ(Q). In modern parlance the term 'Dehn invariant' actually refers to a family of scissors congruence invariant functionals that distinguish between
116
8 A characterization theorem for volume
some polytopes ofthe same volume. An example in R3 is constructed as follows. Let a = (arccos(I/3))/1f, and note that a is irrational (see, for example, [8, p. 102]). Recall again that R is a vector space of infinite dimension over the field Q of rational numbers. Let f : R -----t R be a Q-linear functional on R such that f(l) = 0 and f(a) = 1. (Such a function is possible because a is irrational.) For a compact convex polyhedron P in R3, let M I , ... , Mm denote the edges of P, and let (h, ... ,()m denote the outer dihedral angles between the facets adjacent to each corresponding edge. Define
With some effort one verifies that 'IjJ is invariant under scissors congruence. Let .6. denote the regular tetrahedron in R3 having unit volume. Since the dihedral angle between any two adjacent facets of .6. is given by arccos(I/3), we have 'IjJ(.6.) > O. Meanwhile, the unit cube C in R3 has outer dihedral angles 31f/2, so that 'IjJ( C) = O. It follows that C and .6. are not scissors congruent, in spite of having the same volume. A more general form of Dehn invariant is given in terms of tensor products (see also [81, p. 2]). Suppose n 2: 3, and let P be a compact convex polytope in Rn having (n - 2)-dimensional faces M I , ... , Mm. Let ()I,"" ()m denote the angles between the facets adjacent to each corresponding (n - 2)-face. Denote by R/Z the normalized circle group, and let R @Z R/Z denote the tensor product of the Abelian groups R and R/Z. Define a functional W : pn -----t R @Z R/Z by
The functional W is a Dehn invariant for all n 2: 3. Hilbert's original question has many variations. Let G be any group of rigid motions of Rn, for example, the group of translations, or some finite group of symmetries. We say that two polytopes P and Q are scissors congruent with respect to G if there exist dissections of P and Q into a finite set of polytopes PI, P2, ... , Pm and QI, Q2, ... , Qm respectively, so that each Pi is congruent to Qi by some motion in G; that is, so that for each i we have Pi = giQi for some gi E G. Once again it is clear that, if P and Q are scissors congruent with respect to G, then J-tn(P) = J-tn(Q). Once again one may pose the converse question. The answer remains in the negative for n 2: 3, but, for the case n = 2 (the Euclidean plane),
8.7 Notes
117
the answer is more interesting. In particular, suppose that G = T 2 , the set of all translations of the plane. In this case Hilbert's question has once again a negative answer, even in the plane! This follows from our construction (8.5) of the valuation TJ in Section 8.2. Recall that TJ was defined by
where K E K2 and ~ is the equilateral triangle of unit side length, centered at the origin. The valuation TJ is simple and translation invariant. It follows that, if P and Q are polygons in R2 that are scissors congruent with respect to T2 , then TJ(P) = TJ(Q). However, recall also that TJ(K) = 0 whenever K is a centered convex body (such as a rectangle), while TJ(~) -I- O. Let P = ~, and let Q be a square with the same area as P. Since TJ(P) -I- TJ(Q), the polygons P and Q cannot be scissors congruent with respect to the translation group T 2 . In view of Theorem 8.1.5, one might suspect that the answer to Hilbert's third problem is once again 'Yes' in dimension 2, provided that we allow the group G to contain both the translations of the plane and the reflection through the origin (corresponding to scalar multiplication by -1). Indeed, this was shown to be true by Hadwiger and Glur in [40]. A detailed treatment of the results of Hadwiger and Glur may also be found in [8, pp. 69-92] and [81]. The remarks in this section give only a hint of the beauty and complexity to be found in the study of polytopes, dissections, and congruences. For a more thorough treatment of this theory, see [8, 71, 81].
8.7 Notes Theorem 8.3.2 is due to Hadwiger, who gave a long and difficult proof in [39]. The more general Theorem 8.1.5 is due to Klain [50]. Schneider's characterization theorem appeared in [86]. Proposition 8.3.1 is due to Sah [81, pp. 16-17]. For a discussion of spherical harmonics and other background for Proposition 8.1.2, see [85, p. 184] and [33,34]. Theorem 8.1.5 leads in turn to a connection between continuous even valuations on compact convex sets and continuous functions on Grassmannians. This connection can be described in part by using generating distributions for symmetric compact convex sets. For a detailed discussion, see [54]. The question of how to characterize volume as a valuation on defined only on polytopes remains open (see [72]).
9 Hadwiger's characterization theorem
In Section 9.1 we use the volume Theorem 8.3.2 to complete our characterization of invariant valuations on polyconvex sets with one of the most beautiful and important theorems in geometric probability, Hadwiger's characterization theorem. We then use Hadwiger's theorem to derive simple proofs of numerous results in integral geometry and geometric probability. In Sections 9.3 and 9.4 we give cleaner re-statements of the intrinsic volume formulas of Chapter 6. Sections 9.2, 9.7, and 9.8 deal with the computation of intrinsic volumes in special cases. In Section 9.6 the mean projection formula of Section 9.4 is combined with the results of Section 8.5 to yield a generalization of the Buffon needle problem to spaces and planes of arbitrary finite dimension.
9.1 A proof of Hadwiger's characterization theorem The following result generalizes Theorem 4.2.5 to the lattice Polycon(n). As in the case for parallelotopes, this theorem is, in fact, equivalent to the associated volume Theorem 8.3.2.
Theorem 9.1.1 (Hadwiger's characterization theorem) The valuations Jto, JtI, ... , Jtn form a basis for the vector space of all convexcontinuous rigid motion invariant valuations defined on polyconvex sets in Rn.
Proof Let Jt be a convex-continuous invariant valuation on Polycon(n). Let H be a hyperplane in R n; that is, a linear variety of dimension n -1. The restriction of Jt to H is an invariant valuation on H. Proceeding by
9.1 A proof of Hadwiger's characterization theorem
119
induction, we may assume that n-l
jt(A)
=
L
Cijti(A),
i=O
for every polyconvex A
H. Thus, the valuation
<:;;;
n-l
jt -
L
Cijti
i=O
vanishes on all lower dimensional polyconvex sets of R n. This follows from the invariance of the valuations jti and from the fact that any lower dimensional polyconvex set is contained in the image of some rigid motion of the hyperplane H. By the volume Theorem 8.3.2, n-l
jt -
L
Cijti = Cnjtn,
i=O
where jtn is the volume on words,
Rn,
and where
Cn
is a real constant. In other
n
jt =
L
Cijti·
i=O
D
A valuation jt on Polycon(n) is said to be homogeneous of degree k > 0 if
for all K E Polycon(n) and all a
~
O.
Corollary 9.1.2 Let
jt be a convex-continuous rigid motion invariant valuation defined on Polycon(n) that is also homogeneous of degree k, for some 0 :s: k :s: n. Then there exists C E R such that jt(K) = cjtk(K) for all K E Polycon(n).
Proof By Theorem 9.1.1 there exist
Cl,""
n
jt
=L i=O
If P
=
[0, 1]n then, for a > 0,
Cijti·
Cn
E
R such that
120
9 Hadwiger's characterization theorem
Meanwhile,
Therefore,
Ci
= 0 if i =I-
k, and /-l
o
= Ck/-lk.
9.2 The intrinsic volumes of the unit ball As an application of Theorem 9.1.1, we compute the intrinsic volumes /-li(Bn) of the unit ball in Rn. We require the following facts about Minkowski sums. Let K, L be compact convex sets, and let a > O. Recall that the Minkowski sum K + aL is defined by
K
+ aL = {x + ay : x
E
K and y
E
L}.
Proposition 9.2.1 For K E Kn and any unit vector u ERn,
Here u denotes the straight line segment connecting the point u to the origin o. Proof Let L = K + EU. The volume /-In of L can be computed by integrating over the hyperplane u~ the length of each linear slice of L; that is,
where ex denotes the straight line parallel to u through the point x E u~. Since /-ll(L n ex) = /-ll(K n ex) + E for all x E Klu~, while L n ex = 0 if x ~ Klu~, we have
11-
/-ll (L
r
iKlu1-
/-In(K)
n ex) dx
/-ll(Knex)+Edx
+ E/-ln-l(Klu~). o
9.2 The intrinsic volumes of the unit ball
Let that
for
en denote the n-dimensional unit cube.
°: ;
121
Recall from Theorem 4.2.1
i ::; n.
Proposition 9.2.2 For
E ~
0,
Proof Let Ul, U2, ... , Un denote the standard orthonormal basis for Rn. Let Ui denote the line segment with endpoints at the origin 0 and the point Ui. From Proposition 9.2.1 we have
for all n ~ l. Suppose that the equation (9.1) holds for lower dimensions, and that
for some 1 ::; k < n. Then
+ EUI + ... + EUk+l) = J-ln(Bn + EUI + ... + EUk) + EJ-ln-l(Bn + EUI + ... + EUklut+l)
J-ln(Bn
=
L k
(k) i
.+
Wn-i E'
EJ-ln-l (Bn - l
+ EUI + ... + EUk)
,=0 =
t,
G)Wn-i Ei
+
t,
G)Wn-l-iEi+ l
9 Hadwiger's characterization theorem
122 Since
the equation (9.1) follows by induction to the step k
+ 1 = n.
Theorem 9.2.3 (Steiner's formula) For K E Kn and
E
0
2: 0,
n
J-ln(K
+ EEn) = LJ-li(K)wn_i En - i .
(9.2)
i=O
Proof Let 'fJ(K) = J-ln(K + En), for K E Kn. It follows from Proposition 8.2.1 that 'fJ is a continuous invariant valuation. Theorem 9.1.1 then implies the existence of constants co, ... ,Cn E R, such that n
'fJ(K) = L CiJ-li(K), i=O
for all K E Kn. Therefore, for
E
EnJ-ln
> 0, we have
(~K + En) = ~CiJ-li(K)~ En
n
L CiJ-li(K)E n- i .
(9.3)
i=O
Setting K
=
en and comparing equations (9.1) and
(9.3) we find that
o We are now able to compute the intrinsic volumes J-li(En).
Theorem 9.2.4 (The intrinsic volumes of the unit ball) For 0 S is n,
Proof Setting K
= En
in (9.2), we obtain
n
L J-li(En)wn_iEn-i i=O
L n
(
n) i
n
i
WnE - ,
.=0
for all E > O. The proposition then follows from a comparison of the coefficients of each En-i. 0
123
9.3 Crofton's formula
9.3 Crofton's formula We are now ready to compute the constants C k from Theorem 7.2.1. Specifically, we have the following remarkable result. Theorem 9.3.1 For 0:::; k:::; nand K E Kn,
J-ln-k(K)
=
Ak(Graff(K; k)).
(9.4)
In other words, C k = 1 for all n, k 2: O. We shall see that our judicious choice of normalization (6.1) for [n] leads in turn to this expression of the equation (9.4), free from normalizing constant factors. Once again, note that the equation (9.4) is asserted only for the case in which K is convex and not for arbitrary polyconvex sets. Proof Recall from Section 7.2 that
Ck Ak(Graff(K; k))
=
J-ln-k(K)
for all compact convex sets K. For the case K
=
CkAk(Graff(Bn;k)) = J-ln-k(Bn) = (n: k)
B n , we have
~:
= Wn-k [~l
Meanwhile,
r
r
r
rI
JGr(n,k) JVl JGr(n,k) JVl
J-lo(Bn n (Vo Bn _ k
+ p)) dpdvk(Vo)
dpdvk(Vo)
r
Wn-k dVk(Vo) JGr(n,k) wn-kvk(Gr(n, k)) Wn-k
[~l
Hence, we have
o
so that Ck = 1.
Theorem 9.3.1 enables us in turn to rewrite Hadwiger's formula (7.8):
J-ln-k(K)
=
r
JGraff(n,k)
J-lo(K n V) dAk(V)'
(9.5)
124
9 Hadwiger's characterization theorem
Unlike (9.4), the formula (9.5) is valid for all polyconvex sets K, although it reduces to (9.4) in the event that K is actually convex. In a similar vein, we obtain the following relation between intrinsic volumes of different degree, thereby generalizing Theorem 9.3.1.
Theorem 9.3.2 (Crofton's formula) For 0 Polycon(n),
!
/1j(K n V)
:s:
i,j
:s:
nand K
[i; j] /1i+j(K).
d).~_JV) =
E
(9.6)
Graff( n, n-i)
Note that, if j
= 0 and K
E
Kn, then the identity (9.6) reduces to (9.4).
Proof If i + j > n then both sides of (9.6) are zero. Suppose that + j :s: n. For K E Kn, define
i
ry( K)
!
=
/1j (K n V)
d).~-i (V).
Graff( n, n-i)
On applying (9.5) we then obtain
ry(K) =
!
!
/10 (K n V n W) d).~=i_ j (W) d).~_i (V),
Graff(n,n-i) Graff(V,n-i-j)
where Graff(V, n - i - j) denotes the space of linear varieties W ~ V having dimension n - i - j. Since each W ~ V, we can rewrite the previous integral as
r
r
/1o(K n W)
JGraff(n,n-i) JGraff(V,n-i-j)
= [oEGr(n,n-i)
[l
/1o(K n (Wo
!woEGr(VO,n-i-j)
d).~=Lj(W) d).~_i(V)
!Wlnvo
+ p + q)) dq dl/;:-=:_j (Wo) dpdl/~_i(VO)
= kr(n,n-i) kr(VO,n-i-j) [l !Wlnvo /1o(K n (Wo
+ p + q)) dqdpdl/;:-=Lj(Wo) dl/~_i(VO)
= kr(n,n-i) kr(VO,n-i-j) [ltB(Wlnvo) /1o(K n (Wo + v)) dvdv;:-=:_j(Wo) dl/~_i(VO)
9.4 The mean projection formula revisited =
krCn,n-i) krcvO,n-i-j) JwoJ.. J-lo(K n (wo
=
125
{
+ v)) dv dll~=Lj(Wo) dll~_i(VO)
{
(
}GrCn,n-i) }Grcvo,n-i-j) }wt
= {
(
}GrCn,n-i) }Grcvo,n-i-j)
IK1wJ..
dvdll~=:_j(WO) dll~_i(VO)
0
J-li+j(KIWl)
dll~=Lj(Wo) dll~_i(VO),
since dim(Wl) = i + j. Because J-li+j is a continuous valuation, homogeneous of degree i + j, the linearity of the integrals above implies that 'f/ is a continuous valuation on Kn, homogeneous of degree i + j. It then follows from the invariance of J-li+j and of the measures lI~=Lj and lI~_i that 'f/ is invariant. By Corollary 9.1.2, there exists c E R such that 'f/ = cJ-li+j' To compute the constant c, set K = En, the unit ball in Rn, and note that
J J J-li+j (En Iwl ) dll~=L j (Wo) dll~_i (Vo) GrCn,n-i) GrCvo,n-i-j) n .J [ n ~ i .J Wi+j , [n-z n-z-] while
cJ-li+j (En) =
C
[i : j JWi+j,
by Theorem 9.2.4. Hence,
o 9.4 The mean projection formula revisited The conclusion of Theorem 9.3.1 that C k = 1 for all 0 :::; k :::; n also allows us to re-state Theorem 7.4.1, thereby generalizing Cauchy's formula 5.5.2. Theorem 9.4.1 (The mean projection formula) For 0 :::; k < n and K E Kn,
J-lk(K) = ( J-lk(KlVo) dllk(Vo). }GrCn,k)
(9.7)
o
126
9 Hadwiger's characterization theorem
In other words, the kth intrinsic volume /-lk(K) of a compact convex subset K of any dimension in R n is equal to the integral of the k-volumes of the projections of K onto all k-dimensional subspaces of Rn. Recall that the intrinsic volumes J-tk are normalized; that is, /-lk(L) of an l-dimensional convex body L is the same regardless of the dimension n 2: l of the ambient space R n. The absence of any additional normalizing factor in (9.7) demonstrates once again the importance of making the correct choice of normalization for the Grassmannian measures v'k. The mean projection formula can also be expressed in probabilistic terms, using random variables. For a compact convex set K in R n let Xk(K) denote the k-volume of a projection of K onto a randomly chosen k-dimensional subspace V E Gr(n, k). The expectation E(Xk(K)) is computed by averaging over all projections; that is, by integrating over all subspaces with respect to the Haar probability measure on Gr(n, k), to wit, E(Xk(K))
=
r
/-lk(KIV) dV,
lGr(n,k)
where
r
dV
= 1.
lGr(n,k)
Integrating instead with respect to the measure v'k, we have E(Xk(K)) = [n k ] -1
r
/-lk(KIV) dv'k.
lGr(n,k)
Thus, we obtain the following version of Theorem 9.4.1. Corollary 9.4.2 For 0
:s: k :s: nand K
E JCn,
D
Hadwiger's Theorem 9.1.1 yields a simple proof of a yet more general form of Theorem 9.4.1. Theorem 9.4.3 (Kubota's theorem) For 0 :s: k
:s: l :s: nand K
E JCn,
9.4 The mean projection formula revisited
127
Proof Define a valuation 'f/ on Kn by
r
'f/(K) =
/1k(KIV) dvi(V).
lGr(n,l)
Evidently 'f/ is continuous, invariant, and homogeneous of degree k. It then follows from Corollary 9.1.2 that there exists c E R such that 'f/ = C/1k· To compute the constant c we consider the case K = En:
so that c
[l]! [k]![n - k]! [n]! [k]![l - k]! [n]! [l]![n -l]! [n-k]! [n-k] [n - l]![l - k]! - 1 - k .
o Kubota's theorem can also be viewed combinatorially. For 0 :::; k :::; 1 and K E KI, the mean projection formula (9.7) states that
J
/1k(K) =
/1k(KIW) dvk(W).
Gr(l,k)
Consequently,
J
/1k(KIV) dvi(V)
Gr(n,l)
J J
=
Gr(n,l) Gr(V,k)
Since each W S;; V, we have (KIV)IW
J Gr(n,l)
/1k((KIV)IW) dvk(W) dvi(V)·
/1k(KIV) dvi(V) =
= KIW,
J J
so that
/1k(KIW) dVk(W) dvi(v)·
Gr(n,l) Gr(V,k)
(9.8) How does this last integral in (9.8) compare with the ordinary mean projection formula (9.7)? The difference is that in (9.8) we are 'counting' a k-subspace W E Gr(n, k) once for each l-subspace V E Gr(n, l)
128
9 Hadwiger's characterization theorem
containing W. Ideally, our combinatorial intuition would suggest that counting Z-subspaces that contain W E Gr(n, k) is the same as counting the (Z- k)-subspaces of the quotient space RnjW ~ Rn-k, of which there are
k] [nZ- k . From this observation it would then follow that
r
r
JGr(n,l) JGr(V,k)
= [nz ~kk]
!1k(KIW) dVk(W) dvz(V)
r
JGr(n,k)
!1k(KIW) dvk'(W),
which is precisely what Kubota's theorem 9.4.3 asserts.
9.5 Mean cross-sectional volume
A k-dimensional cross-section of a compact convex set K is a subset of the form KnV, where V E Graff(K; k). Let Yk(K) denote the k-volume of a randomly chosen k-dimensional cross-section of K. The mean kcross-sectional volume of K is given by the expectation E(Yk(K)), which is computed by averaging over all V E Graff(K; k). In other words, E(Yk(K))
=
An(G k ra
~(K-, k)) JGraff(n,k) r !1k(K n V) dAk(V)'
By Theorem 9.3.1 we have Ak(Graff(K; k)) = !1n-k(K), so that
r r
JGraff(n,k)
!1k(K n V) dAk(V)
f
JGr(n,k) Jv.L
!1k(K n (V
+ x)) dx dvk'(v).
Recall from elementary calculus that
r
Jv.L
!1k(K
n (V + x)) dx
= !1n(K),
for each V E Gr(n, k). It follows that
r
JGr(n,k) !1n(K)
!1n(K) dvk'(v)
[~l
129
9.6 The Buffon needle problem revisited
The mean k-cross-sectional volume of K is then given by
E(Yk(K))
=
Mn(K) [n]. Mn-k(K) k
A one-dimensional cross-section is sometimes called a chord. Recall that Mn-l(K) = (1j2)S(K), where S(K) is the surface area of K. The expected length of a random chord in K is now given by
E(Yl(K)) = Mn(K) [n] = 2[nl Mn (K). Mn-l(K) 1 S(K) In particular, the expected length of a random chord in the unit n-ball B is
9.6 The Buffon needle problem revisited We are now ready to generalize the Buffon needle problem to n-dimensional spaces. Suppose that V E Gr( n, k) and that {Vl, ... ,vn-d is a basis for V J... Let V denote the collection V = {V
+ alVl + ... + an-kVn-k : al, ... , an-k
E
Z}
of k- planes in R n. If a compact convex set K is moved randomly in R n , what is the expected number of intersections of K with the collection V? In other words, if we denote by X K the number of connected components of gK n V for a random Euclidean motion g, what is E(XK)? Let 12 denote the lattice in V J.. given by
and let C denote a fundamental domain of C. Once again the concept of random motion is meaningful, since the symmetry of the collection V allows that we consider only motions involving translations by vectors in the fundamental domain C. Therefore, the collection of motions under consideration forms a compact Lie group, on which there exists a unique Haar probability measure. Consider the set ¢K + x, where ¢ is a rotation and x is a vector. For a fixed ¢ E O( n), the number of intersections of ¢K + x with V is equal to the number of elements of (¢K + x) IV J.. n C. Hence, the expected number of intersections of ¢K + x with V is equal to
Mn-k((¢K + x)IVJ..) Mn-k(C)
Mn-k( (¢K) IV J..) Mn-k(C)
(9.9)
130
9 Hadwiger's characterization theorem
by Theorem 8.5.2. Therefore, the expected number of intersections of >K +x with V over all x and all > is equal to the expected value of (9.9) over all orthogonal transformations >. We now apply Corollary 9.4.2 to obtain E(XK)
= [ n ] -1 !1n-k(K). n- k
!1n-k(C)
As an example, consider the case in which K is a needle of length L in R 2 and V is a collection of lines evenly spaced by a distance d. In this case £ is a one-dimensional lattice of points on the line V J.., evenly spaced by a distance d. Since
[2]
-1
=
1
(2)
-1 WIW1
1
W2
= ~, 7f
it follows that E(X )
= ~!11(needle) = 2L
K
d
7f
7fd'
which agrees with Buffon's original solution.
9.7 Intrinsic volumes on products Next we use Hadwiger's characterization Theorem 9.1.1 to examine how the intrinsic volumes !1i evaluate on orthogonal Cartesian products. We have the following generalization of Proposition 4.2.3.
Theorem 9.7.1 Let 0 ::; k ::; n, and suppose that K ~ Rk and L ~ Rn-k are polyconvex sets. Then !1i(K x L)
=
L
!1r(K)!1s(L).
(9.10)
r+s=i
Proof Evidently the set function !1i(K x L) is a continuous valuation in each of the variables K and L when the other is held fixed. Note that each motion > E Ek of Rk is the restriction of a motion E En that restricts to the identity on the complementary space Rn-k. Therefore,
In other words, the set function !1i(K x L) is a continuous invariant valuation in each variable. By an iteration of Theorem 9.1.1 in each
131
9.7 Intrinsic volumes on products
variable, there exist constants Crs E R such that k n-k
Jti(K x L) =
LL
crsJtr(K)Jts(L)
r=Os=O
for all K E Kk and L E Kn-k. Let C m denote the unit m-dimensional cube. For a, f3 2': 0, k n-k
L L crsJtr(Ck)Jts(Cn-k)ar f3s r=O
s=O
Meanwhile,
by Proposition 4.2.3. Therefore, for 0 ~ r ~ k and 0 ~ s have Crs = 1 if r + s = i and Crs = 0 otherwise. Hence, k n-k
Jti(K x L)
=L
L
r=Os=O
crsJtr(K)Jts(L)
=
L
~
n - k, we
Jtr(K)Jts(L).
r+s=i D
Corollary 9.7.2 Suppose that Jt is a convex-continuous invariant valuation on Polycon( n) such that
(9.11) for all K ~ Rk and L ~ Rn-k, where 0 ~ k ~ n. Then either Jt there exists C E R such that
=0
or
(9.12) Conversely, if Jt is a valuation of the form (9.12) then Jt also satisfies the multiplicative rule (9.11). Proof By Hadwiger's Theorem 9.1.1, there exist constants that Jt = coJto
+ clJtl + c2Jt2 + ... + cnJtn·
Ci
E R such
132
9 Hadwiger's characterization theorem
For k E {O, 1, ... ,n} denote by C k the unit k-cube in a, /3 2: 0, the condition (9.11) implies that
Rk.
Then, for all
k n-k L L crCsJ-tr(Ck)J-ts(Cn-k)ar/3s. r=Os=O Meanwhile, by Theorem 9.7.1, n
LCiJ-ti(aCk x /3Cn- k) i=O n
L J-tr(Ck)J-ts(Cn_k)a r/3s r+s=i k n-k L L Cr+sJ-tr (Ck)J-ts (Cn_k)a r /3s. r=Os=O L
Ci
i=O
Therefore, cr+s = CrC s for all 0 ::; r, s ::; n. In particular, Co = Co+o = c6, so that Co = 0 or Co = 1. If Co = 0 then Cr = cr+o = CrCo = 0, so that J-t = O. If Co = 1 then relabel C = Cl. For r > 0, we then have Cr = cHH .. +l = cl = c r , from which (9.12) follows. The converse follows from Theorem 9.7.1 by means of a similar argument. 0 We conclude this chapter with another example of a convex-continuous invariant valuation. Choose a real-valued non-negative continuous function f(t) of a non-negative variable t, such that f(t) is decreasing sufficiently fast to 0 as t -+ 00. For any n, and for any non-empty compact convex set K in R n, set
J-t(f; K)
=
r
JRn
f(d(p, K)) dp
where d(p, K) is the distance from the point p to the set K, and where dp denotes the ordinary n-dimensional Lebesgue measure. We show that J-t satisfies the inclusion--exclusion principle; that is,
J-t(f; Kl U ... U Km)
=
L J-t(f; K i ) - L J-t(f; Ki n K j ) i<j
+ ...
(9.13)
9.7 Intrinsic volumes on products
133
whenever K 1 ,K2 , ... , Km and Kl U··· UKm are convex. We begin with a direct proof of the case m = 2. Let p ERn. Since Kl and K2 are compact, there exist unique points ql E Kl and q2 E K2 such that d(p, qi) = d(p, Ki). We assume that Kl U K2 is convex, so that Kl n K2 of- 0. Evidently, d(p, Kl U K 2) = min{d(p,K1 ),d(p,K2)}, and d(p,K 1 n K 2) 2: max{d(p,K1 ),d(p,K2)}. Since Kl U K2 is convex, the line segment I connecting ql and q2 must intersect Kl nK2 at some point q'. Either d(p, q') :::; d(p, qd or d(p, q') :::; d(p, q2). Therefore,
n K 2) :::; d(p, q') :::; max{ d(p, K 1 ), d(p, K 2)}
d(p, Kl
as well, so that d(p, K 1 nK2)
=
max{d(p, K 1 ), d(p, K 2)}. We then obtain
+ f(d(p, Kl n K 2)) = f(min{ d(p, K 1 ), d(p, K 2 )}) + f(max{ d(p, Kd, d(p, K 2)}) = f(d(p,Kd) + f(d(p,K2 )),
f(d(p, Kl U K 2))
for all p ERn. Hence,
Since f is a continuous function, it follows easily from the definition of the Hausdorff topology that /-l is a continuous function of K. The general case of (9.13) then follows from Theorems 5.1.1 and 2.2.1. Thus we can extend /-lU; K) to an invariant valuation defined on all polyconvex sets K. The expression of /-lU; K) as a linear combination of the intrinsic volumes (given by Theorem 9.1.1) exhibits the moments
mjU) of the function
=
1=
x j f(x) dx
f and has the form n
/-lU; K) = f(O)/-ln(K)
+L
Cimi-l (f)/-ln-i(K),
i=l
where the constants Ci are independent of the function f. To understand this, and to compute the Ci, consider the case of the n-dimensional ball aB of radius a, centered at the origin. From the definition, we have /-lU; aB)
=
(
JRn
f(d(p, aB)) dp
9 Hadwiger's characterization theorem
134
1B f(O) dp + JRn-aB f(lpl - a) dp f(O)P,n(aB)
+
1 1
00
uESn-l
f(r - a)r n- 1 drdu
'"
f(O)P,n(aB)
+ nWn
f(O)P,n(aB)
+ nwn
1 1
f(O)P,n(aB)
+ nWn
I: ~ 1) 1
00
f(r - a)r n- 1 dr
00
f(y)(y
00
(n
j=O
f(O)P,n(aB)
+ a)n-l dy
J
f(y)yja n- 1 - j dy
0
.
~(n-1) + nWn ~ i _ 1 an-'mi_l(J)
1)
nWnmi-l (J) () () ( ) ~ (n f 0 P,n aB + ~ i _ 1 P,n-i(B) P,n-i aB . Once again Wn denotes the volume of the unit ball B, so that the surface area of the unit sphere sn-l is given by nWn- It follows that Ci =
(n
-1) _n_w-:-:::n::-;P,n-i(B)
=
i-1
(n -l)nwn ( n )-l_Wi i-1 n-i Wn
=
iWi,
so that n
p,(J; K)
=
f(O)P,n(K)
+ 2: iWimi-l(J)P,n-i(K).
(9.14)
i=l
A notable special case is obtained by setting f(x) = e-7rX 2 • The resulting valuation is called the Wills functional and is denoted by W(K).
Theorem 9.7.3 If K ~ V and L ~
vJ.. for some linear variety V,
W(K x L) = W(K)W(L).
Proof The moment
mi-l
of f(x)
=
e- 7rX2 is given by
then
9.8 Computing the intrinsic volumes
On substituting u
=
mi-I
= 7rX 2 , we
1
00
1
27ri/2
0
135
have
i - I -u U 2 e
du
=
1
27ri/2
(i)
r "2 =
1 iWi·
On substituting into (9.14), we have n
fLU; K) = fLn(K)
+L
n
L fLn-i(K).
iWimi-1 U)fLn-i(K) =
i=1
(9.15)
i=O
It then follows from Corollary 9.7.2 that W(K x L) = W(K)W(L).
o From Theorem 9.7.3 we obtain the following characterization for the Wills functional W. Once again let Cn denote the unit n-cube in Rn.
Corollary 9.1.4 Suppose that fL is a convex-continuous invariant valuation on Polycon( n) such that fL(K x L)
= fL(K)fL(L),
for all K ~ Rk and L ~ Rn-k, where 0 :::; k :::; n. If fL(C I ) = 2 then fL(K) = W(K) for all K E Polycon(n). Proof Since fL(Cd
i-
0, Corollary 9.7.2 implies the existence of some
c E R such that n
fL
= fLo + CfLI + C2fL2 + ... + CnfLn =
L CifLi. i=O
Hence, n
2
= fL(CI ) = L
cifLi(Cd
= fLo(Cd + CfLI(CI ) = 1 + c,
i=O
so that
C
= 1.
It then follows from (9.15) that fL
= W.
o
9.8 Computing the intrinsic volumes The actual computation of the intrinsic volumes of a polyconvex set is difficult in general. It is difficult even for (ordinary) volume, and also for the surface area, which is given by 2fLn-l. It can be difficult even in low dimensions: consider the perimeter of an ellipse, for example. However, we are already able to compute the intrinsic volumes of a large class of compact convex sets. Theorem 4.2.1 gives a formula for the kth intrinsic volume of an orthogonal parallelotope in Rn, and
136
9 Hadwiger's characterization theorem
Theorem 9.2.4 gives the intrinsic volumes of the unit ball. Theorem 9.7.1 gives a formula for computing the intrinsic volume of an orthogonal Cartesian product of two polyconvex sets whose intrinsic volumes are already known. In this section we work with two additional classes of polyconvex sets. First, we compute the intrinsic volumes of an arbitrary (possibly nonorthogonal) parallelotope. We then comment on the intrinsic volumes of an arbitrary convex polytope and give an explicit formula for the first intrinsic volume f.-LI of a convex polytope in R3 in terms of the lengths of its edges and the angles between the outer normals to its adjacent facets. In Section 4.2 we proved Theorem 4.2.4, which characterized volume on Par(n). Recall that Par(n) denoted the lattice of finite unions of orthogonal parallelotopes having edges parallel to the coordinate axes of Rn. A similar theorem holds for more general parallelotopes. Let VI, ... , Vn be a basis for Rn, and let Par( VI, ... , v n ) denote the lattice of finite unions of parallelotopes with edges parallel to the vectors Vi. The characterization theorem for volume in Par(n), Theorem 4.2.4, generalizes immediately to the lattice Par (VI, ... , v n ). Theorem 9.8.1 (The volume theorem for Par(VI, ... , v n ) Let f.-L be a translation invariant simple valuation defined on Par( VI, ... , vn ), and suppose that f.-L is either continuous or monotone. Then there exists C E R such that f.-L(P) = Cf.-Ln(P) for all P E Par(vI, ... , v n ); that is, f.-L is equal to the volume, up to a constant factor.
Proof For each i, let Vi denote the line segment having endpoints at Vi and the origin o. The proof of Theorem 9.S.1 is the same as the proof of Theorem 4.2.4, provided that we replace the unit cube [0,1]n with the 'unit parallelotope' C =VI
+ ... +vn
of Par(vI, ... , v n ). It then follows that f.-L
= Cf.-Ln, where C = f.-L(C)/ f.-Ln(C),
o Theorem 9.S.1leads in turn to a formula for the kth intrinsic volume of an arbitrary parallelotope. Note that if P E Par(vI, ... , v n ) is a parallelotope, then P is a translate of a parallelotope of the form
for some al, ... , an ;::: O.
9.8 Computing the intrinsic volumes Theorem 9.8.2 For
all
1 :::; k :::;
n and all al,"" an
137
2: 0,
The value of each term /-tk (ail ViI + ... + aik Vik) is easily computed using elementary linear algebra. If A is a k x n matrix whose jth row is given by the coordinates of the vector aij Vij' for j = 1, ... , k, then /-tk(ai l ViI
+ ... + aik Vik)
=
Jdet (AAT).
See, for example, [94, p. 234]. Proof of Theorem 9.8.2 Define
and extend Vk to a valuation on all ofPar(vl, ... , v n ) with Theorem 4.1.3. Let'TJ = /-tk-Vk. We will prove that 'TJ(P) = 0, for all P E Par(vI, ... , vn ). Since both /-tk and Vk vanish in dimensions less than k, it follows that 'TJ vanishes in dimensions less than k as well. If we restrict 'TJ to the k-plane Vil, ... ,ik spanned by the (independent) vectors Vip •.• ,Vik' then 'TJ becomes a continuous translation invariant simple valuation on Par(vi l , ... , Vik)' It follows from Theorem 9.8.1 (in dimension k) that there exists c E R such that 'TJ( P) = C/-tk (P) for all P E Par( ViI' ... , Vik)' However,
by the definitions of 'TJ and Vk. Therefore C = 0, and 'TJ vanishes in dimension k. In other words, if we restrict 'TJ to a k + I-plane Vil, ... ,ik+l spanned by the vectors ViI' ... , Vik+l' then we have once again a continuous translation invariant simple valuation, this time on a lattice of parallelotopes in a space of dimension k + 1. Theorem 9.8.1 again applies, and there exists C E R such that 'TJ( P) = C/-tk+ I (P) for all P E Par( ViI' ... , Vik+J. However, 'TJ is homogeneous of degree k, whereas /-tk+l is homogeneous of degree k + 1. Hence, we have c = 0, and 'TJ vanishes in dimension k + 1. Continuing this argument in each higher dimension we conclude that 'TJ(P) = 0 for P of any dimension in Par(vI, ... , v n ). 0
138
9 Hadwiger's characterization theorem
Computing the intrinsic volumes of arbitrary polytopes is more difficult. Recall from Steiner's formula 9.2.3 that, for all r ~ 0, n
/-tn(P
+ rB)
=
L /-ti(P)wn_irn-i,
(9.16)
i=O
where B denotes the unit ball in R n. Let us consider the convex set P + r B more carefully. If x E P + r B, then there exists a unique point Xp E P such that
Ix-yl,
Ix-xpl S;
for all YEP. This follows from the fact that P is compact and convex. If x E P, then evidently x = x p. If x ~ P, then x p lies on the boundary OP of P. Moreover, if x ~ P and y E OP, then y = Xp if and only if x - y.l H, where H is a support plane of P and yEP n H. Let Pi(r) denote the set of all x E P + rB such that Xp lies in the relative interior of an i-face of P. So, for example, Pn(r) = int(P). Evidently n
P+rB= UPi(r),
(9.17)
i=O
a disjoint union. Denote by Fi(P) the set of all i-dimensional faces of P. For each proper face Q of P denote by M (Q, r) the set M(Q,r) = {y+O'v : 0::; 8::; r},
where y E relint( Q) and v is any outward unit normal to oP at the point y. See Figure 9.l. It turns out that, for 0 ::; i < n, Pi(r)
=
U
M(Q, r).
QEFi(P)
To see this, suppose that x E Pi(r). Then Xp E relint(Q) for some i-face Q of P. If x E relint(Q), then x E M(Q, r). If x ~ relint(Q), then x of- Xp. Let v = (x - xp)/Ix - xpl. Then v .1 H for some support plane H of P at x p, and x = x p + O'v for 0 < 8 = Ix - x pi::; r. In other words, x E M(Q, r). Conversely, suppose that x = y + 8v E M (Q, r) for some i-face Q. If 8 = 0 then x E relint(Q). Otherwise, we have x - y .1 H for some support plane H at the point y E relint(Q). It follows that y = Xp and x E Pi(r).
9.8 Computing the intrinsic volumes
139
M({x},r)
Fig. 9.1. The Decomposition of P
+ rB.
Given a subset A of Rn, the affine hull of A is the intersection of all planes in Rn (not necessarily through the origin) containing A. Let A~ denote the set of vectors in Rn orthogonal to the affine hull of A. If Q is an i-face then Q~ has dimension n - i, so that f-tn(M(Q, r)) = rn-if-tn(M( Q, 1)) It follows that f-tn(Pi(r)) = r n- i f-tn(Pi (l)). From (9.17) we then obtain n
f-tn(P + rB) = Lf-tn(Pi(l))r n- i ,
(9.18)
i=O
for all r
> O. By comparing (9.16) and (9.18) we find that f-ti(P)
=
f-t n(Pi (l)). Wn-i
(9.19)
For example, consider a convex polyhedron P in R3 having edges Each edge Zi is formed by the meeting of two facets Qil and Qi2 of P, having outward unit normals Uil and Ui2 respectively. Therefore, the volume of M(Zi' 1) is given by
z'l, ... ,ZTn'
f-t3 (
M(-. 1)) _ Z"
-
f-tl
(_.)arccos(uil 'Ui,) _ f-tl(Zi)(}i 2
Z,
-
2
'
where (}i is the angle between the outer normals to the facets Qil and 7f, it follows that
Qi 2 • Since W3-1 = W2 =
1 Tn
f-tl(P) =
'2 Lf-t1(Zi)(}i. 7f i=l
In other words, the first intrinsic volume of a polyhedron P in R3 is given
140
9 Hadwiger's characterization theorem
by 1/(21f) multiplied by the sum over all edges of P of the product ofthe length of each edge and the corresponding angle between the normals to the facets adjacent to that edge. Suppose that P is an orthogonal parallelotope in R3 having dimensions al x a2 x a3. In this case P has 12 edges, four of length ai for each i = 1,2,3, while the angles (h are all equal to 1f /2. We then obtain 1
/-tl(P)
=
21f
3
L 4ai~ = al + a2 + a3, ,=1
as asserted by Theorem 4.2.1. Similarly, it is not difficult to show that if Ts is a regular tetrahedron in R 3 of edge length s, then
formulas for /-tl (under a slightly different normalization) evaluated on a variety of common solids in R3 are described by Hadwiger in [38, pp. 36-37J. 9.9 Notes
For the original proof of Hadwiger's characterization Theorem 9.1.1, see [39J. In addition to providing straightforward verification of many results in integral geometry, Theorem 9.1.1 also provides a connection between rigid motion invariant set functions and symmetric polynomials [13J. The normalization Ak of the invariant measures on Graff(n, k) which we have used in the text is natural for Roo; it leads to a good definition of the intrinsic volumes in Hilbert space. The invariant measure under O(n) in Gr(n, k), corresponding to the measure Ak on Graff(n, k) under En, is the Gauss-Wiener measure (as can be easily verified). The article by Schanuel [83J provides an interesting and accessible perspective on Steiner's formula and the first intrinsic volume, or generalized length, /-tl, for compact polyhedra in R3. See also [38, pp. 36-37J for extensive computations of /-tl in R3. Steiner's formula (Theorem 9.2.3) is a special case of the polynomial formula for mixed volumes. For K l , K 2, ... , Km E Kn, and real numbers AI, A2' ... ' Am > 0, the Minkowski linear combination K = AlKl + A2K2 + ... + AmKm is the convex body consisting of the set
The basis of the theory of mixed volumes is the polylinearization of
141
9.9 Notes
volume with respect to Minkowski linear combinations. If Kl' ... ' Km E IC n and AI, ... , Am > 0, then the volume /-tn is a homogeneous polynomial in the positive variables AI, ... , Am; that is, m
/-tn(AlKl
+ ... + AmKm) =
L
V(Ki1 , ... , KiJAil ... Ai n , (9.20)
where each symmetric coefficient V(Kil' .. . , KiJ depends only on the bodies K i1 , ... , Kin. Given Kl, ... ,Kn E IC n , the coefficient V(Kl, ... ,Kn) is called the mixed volume of the convex bodies Kl' ... ' Kn. It is well known, but not trivial, that the mixed volume V(Kl' ... ' Kn) is a non-negative continuous function in n variables on the set IC n , symmetric in the variables K i , and monotonic with respect to the subset partial ordering on IC n . Moreover, the mixed volume V(Kl' ... ' Kn) is a valuation in each of its parameters K i , provided that the other parameters are held fixed. The mixed volumes also satisfy several useful inequalities, such as the Brunn-Minkowski and Aleksandrov-Fenchel inequalities. The intrinsic volumes /-ti studied in the present work are normalizations of the mixed volume of a convex body K with the unit ball Bn. Specifically,
/-ti(K)
=
(~) _l_V(K, ... , K'fJn'.~.' B n), Z Wn-i '---v----' i
n-i
for all K E IC n . For a thorough treatment of the theory of mixed volumes, see [85]. Theorem 9.3.2 is originally due to Crofton [14, 15, 16] (see also [85, p. 235]). Crofton's formula generalizes the ideas of Barbier presented in Chapter 1. Theorem 9.4.3 was proved by Kubota in [58]; see also [85, p. 295]. Numerous variations of the formulas of Crofton and Kubota can also be found in [1, 2, 82, 99]. Wills first defined the functional W(K) in the context of lattice point enumeration [101]. For a discussion of the Wills functional and its applications, see also [85, p. 302] and [96]. There exist many important functionals on IC n that are invariant with respect to rigid motions, but that elude Hadwiger's characterization. One striking example, pointed out by Hadwiger in [38, p. 44], is the affine surface area, denoted n. Although originally defined only for polytopes and smooth convex bodies [85, p. 419], the affine surface area was later extended to an affine invariant functional on all of IC n (see, for example, [20,59,60,61,66, 89, 100]). Extended affine surface area was
142
9 Hadwiger's characterization theorem
shown to be a valuation by Schlitt [88]. It turns out, however, that affine surface area cannot be expressed as a linear combination of intrinsic volumes, the reason being that it is not continuous, but only upper semicontinuous, on Kn. Instead, the affine surface area satisfies a very strong invariance property: If K E Kn, then n(¢K) = n(K), for all affine transformations ¢; that is, for all translations and linear transformations having unit determinant. The affine surface area also satisfies a number of useful inequalities [67]. The question of how to characterize the affine surface area as a valuation possessing certain properties (in the style of Hadwiger's characterization theorem) remains open. Another interesting open problem of convex set theory is the problem of classification of invariant set functions that are convex-continuous and of polynomial type. For example, a bivaluation J.l(K, L), defined for polyconvex sets K and L, is a real-valued set function that is a valuation in either variable, provided that the other variable is held fixed. In a similar vein one defines a trivaluation and more generally an n-valuation. If J.l is an n-valuation, set (for K convex)
v(K) = J.l(K, K, ... ,K). The set function v is called a homogeneous polynomial set function of degree n. A finite sum of homogeneous polynomial set functions is called a set function of polynomial type. It would be interesting to give an 'intrinsic' definition of set functions of polynomial type. It is tempting to conjecture that the classification of invariant set functions of polynomial type is related to symmetric functions. For example, if f(XI,"" xn) is any homogeneous symmetric polynomial, and if one sets v(P) = f(XI,'" ,x n ) for any parallelotope with side lengths Xl, ... , X n , one can extend v to all polyconvex sets by using the expression of f in terms of elementary symmetric functions and then replacing each occurrence of an elementary symmetric function by the corresponding intrinsic volume. Does one get all invariant set functions of polynomial type in this way? (See [13].) Hadwiger's characterization Theorem 9.1.1 and Theorem 9.3.1 together suggest yet another striking interpretation in the context of simplicial complexes. Recall that elements of the distributive lattice L(8) of Chapter 3 consist of finite unions of simplices, a simplex being a collection of subsets of a finite set 8 which is closed under the taking of subsets; i.e. an order ideal with respect to subset inclusion. The distributive lattice Polycon(n) admits a partial analogy to this notion. Loosely speaking, we can think of Polycon(n) as a collection of
9.9 Notes
143
simplicial complexes, such that the simplices are the compact convex sets in R n, and the faces of such a simplex (a convex set) are its intersections with lower dimensional planes; i.e. the k-faces of an n-dimensional body K would be the sets K n V for V E Graff(n, n - k). Note, however, the codimensional duality. More precisely, define a filter F of a partially ordered set P to be a subset of P such that if x E F and y 2: x then y E F. In other words, a filter of P is an order ideal of the dual P* of P (in which all of the order relations in P are reversed). In dual analogy to order ideals, the set of minimal elements of a filter F is an antichain in P, and we can view the filters of P as simplicial complexes, such that a simplex is now defined to be a filter with a unique minimal element. As with order ideals, the collection of all filters in P forms a distributive lattice. Now consider the case P = Aff(n), ordered by subset inclusion. For V E Aff( n), denote by V the filter consisting of all linear varieties in Rn containing V; that is, V = {W E Aff(n) : V ~ W}. One may view V as a simplex in the distributive lattice of filters in Aff(n). Note that, although this lattice is also graded from 0 to n, this grading is dual to that of Aff(n). In other words, the k-faces of V are the (n - k)dimensional linear varieties in V. If K E Polycon(n) and VnK =I=- 0, then WnK =I=- 0 for all W E V. This motivates us to define k = {V E Aff(n) : V n K =I=- 0}. Evidently, k is a filter in Aff(n). Consequently one may view k as a simplicial complex in the lattice of filters of Aff( n), and it is then natural to ask how the lattice Polycon(n) relates to the lattice of filters of Aff(n). (Is it a quotient lattice?) Viewed in this light, the intrinsic volumes J.tk become the polyconvex analogues of the face enumerators (also denoted J.tk) of Section 3.2. This analogy is buttressed by the formula (9.4), which states that the intrinsic volume J.tk is, in some sense, an enumerator (measure) of the 'k-faces' of a 'simplex' (convex set) Kin Polycon(n). In a similar spirit, one may view the discrete basis Theorem 3.2.4 and Hadwiger's characterization Theorem 9.1.1 as entirely analogous classifications of invariant valuations on the discrete and polyconvex lattices, respectively. The analogy is pressed further by the existence of Helly-type theorems (Theorem 3.3.1 and Theorem 5.3.3) in both contexts. Such remarkable similarities suggest the possibility of additional ties between the simplicial and order structures of these two lattices. For example, do there exist polyconvex analogues of the binomial coeffi.-
144
9 Hadwiger's characterization theorem
cients, and if so, how are these analogues related to the flag coefficients of Chapter 6? Similar observations may be made concerning the lattice Mod(n). Although the discrete basis Theorem 3.2.4, Theorem 4.2.5 and Hadwiger's characterization Theorem 9.1.1 characterize all suitably invariant valuations on the lattices L(8), Par(n), and Polycon(n) respectively, there remains the question of how to characterize invariant valuations on simplices in Mod(n), as defined in Section 6.4. Indeed the success of the lattice analogy for Mod(n) in Chapter 6 suggests the possibility of a comprehensive theory of simplicial complexes in Mod(n) that has yet to be developed. The extension of the intrinsic volumes from elementary symmetric functions on parallelotope edge lengths (see Theorem 4.2.1) to invariant valuations on compact convex sets motivates the following question regarding total positivity. Let A be an n x n matrix of real numbers, and denote by Aij the entry of A in the ith row and jth column. The matrix A is said to be totally positive if det A : : : 0 for all minors A of A. See, for example, [47]. For K E Kn, denote by A(K) the matrix with entries Aij(K) = J.lj-i(K) if j - i ::::: 0; otherwise let Aij(K) = O. An interesting open question is that of whether the matrix A(K) is totally positive. For example, if we let K denote an orthogonal parallelotope with dimensions al x ... x an, then the total positivity condition on the 2 x 2 minors of A(K) is equivalent to Newton's inequality (see [69, p. 106] or [41, pp. 51-55]) for the elementary symmetric functions el, ... , en of the variables ai:
For general compact convex sets K, it may be necessary to replace the intrinsic volumes by a renormalization of the J.li in order to obtain total positivity for the matrix A(K). Among related open questions, the problem of syzygies seems particularly difficult. Let AI,"" Aj be compact convex sets, and let l~~) be the indicator functions of the set Graff( A; k) on Graff( n, k). The problem is to find a basis for all linear relations
for all k ::::: 0 (so that l~ = fA). Even for k = 0, where l~k) = lA, the problem is not entirely trivial (though it has been solved fully). The
9.9 Notes
145
restricted inclusion-exclusion principle for convex sets yields some such identities, but there are probably some more special identities for each k. No doubt there exist additional lattices, for which these questions may be asked and answered. For example, many results known for the lattice Polycon(n) (such as Hadwiger's characterization theorem) remain open questions for the ostensibly simpler lattice pn of polytopes in Rn, as well as for the lattice of polyconvex subsets of the sphere sn (see also Chapter 11). For a survey of open questions in this area, see [71, 72]. Another lattice sharing many qualities with Polycon(n) is the lattice of star-shaped sets in R n . A set A ~ R n is said to be star-shaped, if o E A, and if for each line £ passing through the origin in R n, the set A n £ is a closed interval. Advances have recently been made in the development of a comprehensive, though far from complete, theory of star-shaped sets, a theory dual to the theory of convex sets developed in the present text. In the dual theory, convex bodies are replaced by star-shaped sets, and projections onto subspaces are replaced by intersections with subspaces. In [64] Lutwak introduced dual mixed volumes, in analogy to the classical mixed volumes of Minkowski (see also [65]). Dual (star-shaped set) analogues also exist for the intrinsic volumes, the mean projection formula (replaced in the dual theory by a mean intersection formula; see [65]), kinematic formulas [103], and Hadwiger's characterization theorem [51, 52]. A comprehensive introduction to the theory of star-shaped sets and geometric tomography has been presented in Gardner's book [26]. Finally, one can ask whether an analogous lattice theory is possible for the collection of finite unions of compact convex subsets of the Grassmannians Gr(n, k), for which the notion of 'convex set' is suitably defined. Indeed, one may go even further, to the study of polyconvex subsets of compact Lie groups, about which almost nothing is known at present. A characterization theorem in the spirit of Hadwiger's characterization theorem for invariant valuations on compact Lie groups would have profound consequences throughout mathematics.
10 Kinematic formulas for polyconvex sets
In this chapter we use Hadwiger's characterization Theorem 9.1.1 to generalize the kinematic formulas of Chapter 3 to the lattice of polyconvex sets. These formulas will have application to additional questions regarding random motions of polyconvex sets. In Section 10.2 we use the principal kinematic formula of Section 10.1 to give a condition under which one compact convex set in R2 must be contained in a translate of another.
10.1 The principal kinematic formula Before pursuing additional applications of Theorem 9.1.1, we digress briefly to discuss the Haar measure on the group of Euclidean motions (including reflections across hyperplanes). It is easy to derive an explicit formula for such a measure by the arguments we have already employed for Grassmannians. The Haar measure on the orthogonal group O(n) is closely related to the invariant measure on the set of frames (and, therefore, the set of flags of subspaces) in R n, since orthogonal transformations are parametrized by Cartesian coordinate systems. In other words, after fixing a Cartesian coordinate system Xl, X2, ... ,Xn in R n, an arbitrary orthogonal transformation is uniquely determined by the choice of another Cartesian coordinate system UI, U2, ... ,Un with the same origin. (See also Section 6.7.) Therefore, an invariant measure on O(n) is obtained from the invariant measure on the set of frames in Mod( n) after multiplying it by 2n. However, for the purposes of this section we shall normalize the invariant measure on O(n) so that the total measure of O(n) equals one; i.e. we use the Haar probability measure on O(n). Recall that En denotes the set of all Euclidean motions of R n . Since every Euclidean motion is
147
10.1 The principal kinematic formula
a (possibly improper) rotation followed by a translation, an invariant measure is induced on En by taking the product of the Haar probability measure on O(n) with the n-dimensional Lebesgue measure. We denote this Haar measure on En by dg, where 9 E En. It turns out that Sylvester's Theorem 7.2.3 can be generalized to the case in which, in place of a linear variety, we substitute any compact convex set. Let A and K be compact convex sets. For 9 E En, denote by gK the set {g(b) : b E K}, and consider the integral /-to (A,
K) = ( /-to (A n gK) dg.
(10.1)
lEn
This integral has an evident geometric interpretation as the measure of the set of all 9 E En such that AngK =I=- 0. Alternatively one may think of (10.1) as the 'measure' of all convex sets gK in Rn congruent to K that meet A. If A and C are both compact convex sets of dimension n and if C ;2 A, then the quotient
f /-to (A n gK) dg f /-to(C n gK) dg
(10.2)
gives the probability that a 'rigid' convex set K, dropped at random on Rn so as to meet C, shall also meet A. Note that this probability is independent of the choice of normalizing constant for the Haar measure dg. The computation of the probability (10.2) can be carried out using Hadwiger's Theorem 9.1.1. Indeed, the set function /-to (A, K) is a continuous valuation in each of its variables K and A when the other is held fixed. Moreover,
/-to(g'A,K)
J J
/-to(g'AngK) dg /-to (A
=
J
/-to(Ang'-lgK) dg
n gK) dg
by the invariance of the Euler characteristic and of Haar measure. One verifies similarly that /-to (A, g' K) = /-to (A, K) for all g' E En. Finally, note that
J
/-to (A
n gK) dg
J J
/-to (g-l An
K) dg
=
J
/-to(gA n K) dg- 1
/-to(gA n K) dg,
since the Haar measure on En is invariant under the inversion map 9
1--+
148
10 Kinematic formulas for polyconvex sets
g-l. Thus, f-to(A, K) = f-to(K, A). From two applications of Hadwiger's
Theorem 9.1.1 we obtain the expansion f-to(A, K) =
n
n
i=O
j=O
LL
Cijf-ti(A)f-tj(K),
where the coefficients Cij = Cji depend only on n, and where f-ti and f-tj denote the intrinsic volumes. It turns out that most of the constants Cij are equal to zero. In fact, they can even be computed explicitly.
Theorem 10.1.1 (The principal kinematic formula) For all A, K E Polycon(n),
Proof From the argument above we know that f-to(A, K)
=
1
f-to(A
n gK)
dg
En
=
t
Cijf-ti(A)f-tj(K),
i,j=O
where the Cij are constants depending only on i, j, and n, and where Cij
= Cji'
Let En denote the unit ball in Rn, centered at the origin. For a, b 2: 0, denote by aEn and bEn the balls of radii a and b in R n, centered at the origin. Then
1
f-tO(aEn
En
r r r r
JO(n) JRn JO(n) JRn
n gbEn) dg
f-to(aE n
n (¢bEn + V)) dv d¢
f-to(aE n
n (bEn + v)) dv d¢,
since ¢En = En for any orthogonal transformation ¢. Since the total measure of O(n) is equal to one, we continue:
10.1 The principal kinematic formula
149
iRn I(a+b)Bn dv
(a + b)nwn Wn
t (~)aibn-i
.
• =0
In other words, J.lO(aBn, bBn) is a homogeneous polynomial in the nonnegative variables a and b. Meanwhile, n
L
CijJ.li(aBn)J.lj(bBn)
i,j=O n
L
Cijaibl J.li(Bn)J.lj(Bn).
i,j=O Therefore, Cij
=
0 if i
+j
=I=-
n, and
by Theorem 9.2.4.
o
In view of Theorem 10.1.1, the expression (10.2) becomes
J J.lo(A n gK) dg J J.lo(C n gK) dg giving once again the probability that a 'rigid' convex set K, dropped at random on Rn so as to meet C, shall also meet A. The principal kinematic formula can also be expressed in more probabilistic terms, using random variables. For a compact convex set K in Rn let Xi(K) again denote the i-volume of a projection of K onto a randomly chosen i-dimensional subspace V E Gr(n, i). Combining Corollary 9.4.2 with the principal kinematic formula (Theorem 10.1.1) we obtain
a formula mysteriously reminiscent of the classical binomial theorem.
150
10 Kinematic formulas for polyconvex sets
10.2 Hadwiger's containment theorem As an application of the principal kinematic formula 10.1.1, we derive Hadwiger's condition for the containment of one planar convex region by another. Theorem 10.1.1 takes the following form for polyconvex subsets of R2:
for all K, L E K2. The formula (6.4) for flag coefficients then yields
JrE2 J-Lo(K n gL) dg =
J-LO(K)J-L2(L)
+ ~J-Ll(K)J-Ll(L) + J-L2 (K)J-LO (L). K
(10.3) Now suppose that K and L are convex polygons in R2 with non-empty interiors. Denote by 8K and 8L the boundaries of K and L respectively, and let 9 be a Euclidean motion of the plane. If K n gL = 0 then 8Kng8L = 0 as well. However, if KngL -I- 0 there are two possibilities. The first possibility is that 8K n g8L -I- 0 as well, in which case 8K n g8L consists of an even number of distinct points in R2. (There is a possibility that 8K and g8L will intersect in some other way, but the set of such motions 9 is easily seen to be of measure zero in E2.) The second possibility is that 8K n g8L is still empty, which implies that either K S;;; int gL or gL S;;; int K. In other words, the motion 9 moves one of the regions K or L into the interior of the other. Suppose that neither K nor L can be moved into the interior of the other by any rigid motion of the plane. This eliminates the second possibility. In terms of the Euler characteristic, this non-containment assumption implies that (i) If J-Lo(K n gL) = 0 then J-Lo(8K n 9 8L) = 0 as well, and (ii) If J-Lo(K n gL) = 1 then J-Lo(8K n g8L) = 2k for some positive integer k, for all 9 E E2 (except a set of measure zero). In other words, if neither K nor L can be moved into the interior of the other by any rigid motion of the plane, then we have
J-Lo(8K n 9 8L) ?: 2J-Lo(K n gL).
10.2 Hadwiger's containment theorem
151
It follows that
( /-to(8K n g8L) dg
iE2
~
(
iE2
2/-to(K n gL) dg.
The principal kinematic formula (10.3) for R2 then implies that /-to (8K)/-t2(8L)
+
2
-/-tl (8K)/-tl (8L) 1f
+ /-t2(8K)/-to(8L) 4
> 2/-to(K)/-t2(L) + -/-tl (K)/-tl (L) + 2/-t2(K)/-to(L). 1f
Because K and L are convex, we have /-to(K) = /-to(L) = 1. Since K and L are polygons, 8K and 8L consist of finite unions of line segments. Therefore, /-t2(8K) = /-t2(8L) = 0, and we have 2
-/-tl(8K)/-tl(8L) 1f
~
2/-t2(L)
4
+ -/-tl (K)/-tl (L) + 2/-t2(K). 1f
(10.4)
If we denote by A(K) and P(K) the area and perimeter of a compact convex subset K of R2 (with a non-empty interior), the inequality (10.4) becomes
~P(K)P(L) ~ 2A(L) + ~ P~) P~L) + 2A(K), so that
21f(A(K) + A(L)) - P(K)P(L) :S
o.
(10.5)
Given K, L E K2 with non-empty interiors, define fl(K, L) by
fl(K, L)
=
21f(A(K)
+ A(L)) -
P(K)P(L).
(10.6)
We have shown that, if K and L are convex polygons with non-empty interiors, and if neither K nor L can be moved into the interior of the other by any rigid motion of the plane, then fl(K,L) :S O. Since the set of convex polygons is dense in the set of all compact convex subsets of R2, a simple continuity argument implies that fl(K, L) :S 0 if K, L E K2 (not necessarily polygons) such that neither contains a translate of the other in its interior. Thus, we have the following theorem.
Theorem 10.2.1 (Hadwiger's containment theorem) Let K, L E K2 with non-empty interiors. If fl(K, L) > 0 then there exists a Euclidean motion g E E2 such that either K ~ int gL or L ~ int gK.
o Which way will the containment go? If fl(K, L) > 0 then Theorem 10.2.1 implies that A(K) =I=- A(L). Therefore, the convex body of larger area will contain a rigid motion of the other.
152
10 Kinematic formulas for polyconvex sets
10.3 Higher kinematic formulas In addition to the principal kinematic formula 10.1.1 one can derive analogous kinematic formulas for the remaining intrinsic volumes J-L1, ... , J-Ln. Define a function (k on pairs of polyconvex sets by the formula (k(A, K) =
r
J-Lk(A n gK) dg
JEn
For convex A and K, one may interpret the expression
fEn J-Lk(A n gK) dg fEn J-Lo(A n gK) dg as the mean value, or the expected value, of the kth intrinsic volume of An gK, taken over all gK in Rn congruent to K that meet A. If A and C are compact convex sets of dimension n and if C ;2 A, then the quotient
f J-Lk(A n gK)dg f J-Lo(C n gK)dg
(10.7)
gives the expected value of J-Lk(A n gK) given that K meets C. For A, K convex, Theorem 9.3.1 asserts that J-Lk(A n gK) = '\~_k(Graff(A n gK; n - k))
Therefore, we have (k(A,K)
=
hn 1 '\~_k(Graff(A
J-Lk(AngK) dg
En
n gK; n - k)) dg
r r J-Lo«AngK)nV)d'\~_k(V)dg r r J-Lo«A n V) n gK) dgd'\~_k(V) JGraff(n,n-k) J En J En JGraff(n,n-k)
r
JGraff(n,n-k)
~ [7] i=O
n-k
~
t [~] i=O
-1 J-Li(A
n V)J-Ln-i(K)
d'\~_k(V)
2
-1 J-Ln-i(K)
r
JGraff(n,n-k)
J-Li(A n V)
[i +k k] [n]-l i J-Ln-i (K)J-LHk (A),
d'\~-k(V)
153
10.4 Notes
where the last equality follows from Theorem 9.3.2. Hence, we have the following generalization of Theorem 10.1.1 for the intrinsic volumes. Theorem 10.3.1 (The general kinematic formula) For 0::; k ::; n,
]-1
r
[. + k] [ lEn Jtk(A n gK) dg = n-k ~ 't k ~ for all A, K
E
Jtk+i(A)Jtn-i(K).
o
Polycon( n).
Theorem 10.3.1 and Hadwiger's characterization Theorem 9.1.1 together provide a kinematic formula for the integral
r
lEn
Jt(A n gK) dg,
for any continuous invariant valuation Jt on Polycon(n).
10.4 Notes The principal kinematic formula 10.1.1 has variously been attributed to Blaschke, Chern, Hadwiger, and Santal6 (see [82, p. 262] and [85, p. 253]). In [46], Howard gave a generalization of the kinematic formula to Riemannian homogeneous spaces. By a different approach Fu proved a generalized principal kinematic formula for invariant differential forms on homogeneous spaces [25]. Zhang [103] recently developed dual kinematic formulas for the lattice of star-shaped sets in Rn. For numerous variations and applications of kinematic formulas, see also [82, 85, 87, 98, 106]. Hadwiger's containment Theorem 10.2.1 first appeared in [35, 36]; see also [82, pp. 121-123]. A spherical version of Hadwiger's containment theorem is treated in Section 11.4; see also [82, p. 324]. The question of how to generalize Theorem 10.2.1 to higher dimensions remains open in many cases; some new ideas have recently been put forward by Zhou [77, 104, 105, 106, 107] for convex bodies in R3 and in R2n, and by Zhang [102] for Rn in general. For a generalization of Theorem 10.2.1 to the projective and hyperbolic planes, see [31]. Once again we emphasize the analogical nature of these results. In particular, the kinematic formula 3.2.5 of Chapter 3 is a discrete analogue of the general kinematic formula 10.3.1 for Polycon(n). This analogy suggests the possibility of additional kinematic formulas for the lattice Mod(n), with applications to appropriate probabilistic questions in this context.
11 Polyconvex sets in the sphere
The successful characterization of many classes of valuations on polytopes and compact convex sets in Euclidean space motivates similar questions about valuations on polytopes and convex sets in non-Euclidean spaces. Unfortunately little is yet known about valuations in a non-Euclidean context. In this chapter we consider valuations on the lattice of polyconvex sets in the unit sphere sn, with emphasis on the elementary example of continuous O(3)-invariant valuations on the unit sphere S2 in R3.
11.1 Convexity in the sphere Recall that sn denotes the set of unit vectors in Rn+l. The intersection En sn of a two-dimensional subspace E of Rn+l with the sphere is called a great circle. More generally, the set E n sn is called a great k-subsphere of sn if E is a subspace of Rn+l of dimension k + 1. Two points x, y, E sn-l are said to be antipodal if y = -x. The sphere sn inherits a Riemannian structure from the ambient space R n+ 1 , in which the shortest path (or geodesic) between two nonantipodal points x -I=- y E sn is given by the shorter arc of the unique great circle in sn passing through x and y. A great (n -l)-subsphere (J separates sn into two hemispheres, each the antipode of the other. The intersection of n + 1 distinct hemispheres having linearly independent normals is called a spherical n-simplex. The intersection of at most n hemispheres is called a lune. For u E sn denote by u-L the n-dimensional subspace of Rn+l orthogonal to u. If ~ is a simplex inside the great (n - 1)-subsphere u-L n sn, the lune through ~, denoted L( ~), consists of the union of all half circles with endpoints at u and -u and bisected by a point of ~.
11.1 Convexity in the sphere
155
A set P ~ sn is a convex spherical polytope if P can be expressed as a finite intersection of hemispheres. Denote by p(sn) the set of all convex spherical polytopes in sn. A spherical polytope is a finite union of convex spherical polytopes. More generally, a set K ~ sn will be called convex if K is contained in some hemisphere of sn and if any two points of K can be connected by a geodesic (i.e. an arc of a great circle) inside K. Alternatively, a set K lying inside a hemisphere of sn is convex if the cone 0 * K in R n+1 defined by 0*
K
=
{Au : u
E
K and
°: ; ). : ;
1}
is convex in R n+ 1 . Denote by K(sn) the set of all compact convex sets in sn. The set K(sn) is endowed with the topology induced by the Hausdorff metric on compact sets in Rn+l (see Section 4.1). In analogy to the Euclidean case, we call a set K ~ sn poly convex if K can be expressed as a finite union of compact convex sets in sn. Evidently the collection of polyconvex sets in sn forms a distributive lattice under union and intersection of sets. A function cp : p(sn) ---t R is called a valuation on p(sn) if cp(0) = 0, where 0 is the empty set, and if cp(P U Q) = cp(P)
+ cp(Q) -
cp(P n Q),
(11.1)
for all P,Q E p(sn) such that PUQ E p(sn) as well. A valuation cp on p(sn) is simple if it vanishes on spherical polytopes of dimension less than n. A valuation cp on p(sn) will be called invariant if cp(gP) = cp(P) for all orthogonal transformations (rotations and reflections) g of sn. Similarly, a function cp : K(sn) ---t R is said to be a valuation if cp(0) = and if (11.1) is satisfied for all compact convex sets K, L ~ sn such that K U L E K(sn) as well. The following is an adaptation of Groemer's extension Theorem 5.1.1 for valuations on the sphere sn.
°
Theorem 11.1.1 (Groemer's extension theorem for sn)
(i) A valuation cp defined on convex polytopes in sn admits a unique extension to a valuation on the lattice of all polytopes in sn. (ii) A continuous valuation cp on compact convex sets in sn admits a unique extension to a valuation on the lattice of polyconvex sets in sn. In each case the extension of cp to finite unions is given by iteration of the inclusion-exclusion identity (11.1). 0
156
11 Polyconvex sets in the sphere
The proof of part (i) of Theorem 11.1.1 follows the same lines as the proof of Theorem 4.1.3. Similarly, the proof of Theorem 5.1.1 (for the Euclidean case) goes through for the sphere sn without essential change, since the original proof is based not on the geometry of Rn, but rather on the algebra of indicator functions, and on the fact that a polytope is the intersection of half spaces in R n, a property which carries over analogously to spherical polytopes and hemispheres. (See Section 5.1.) In the arguments that follow, the unique extension of a valuation 'P given by Theorem 11.1.1 will allow us to consider the value of'P on all finite unions of convex spherical polytopes (or compact spherical convex sets in sn), whether or not such unions are actually convex.
11.2 A characterization for spherical area
We now turn our attention to the two-dimensional sphere S2. Important examples of continuous invariant valuations on P(S2) (and K(S2)) include spherical area, denoted 'P2, spherical length 'PI, and the Euler characteristic 'Po. The spherical length 'PI (K) of a spherical convex region K with a non-empty interior in S2 is given by one half of the perimeter of K; that is, one half of the length of the curve in S2 that forms the boundary of K. It is easy to verify that 'PI is an extension of geodesic length in S2 to a continuous invariant valuation on K(S2). To see this, note that
for all K E K(S2). The Euler characteristic 'Po (K) of a spherical convex region K is defined to be 1 if K i=- 0, while 'Po(0) = o. Theorem 11.1.1 insures that 'Po has a unique continuous and invariant extension to all finite unions of spherical convex sets in K. As in the Euclidean case, this unique extension of 'Po coincides with the Euler characteristic of algebraic topology. It can be shown that in fact every continuous invariant valuation on p(S2) (or K(S2)) is a linear combination of the valuations 'PO,'PI,'P2. To this end, we prove a characterization theorem for spherical area 'P2. Theorem 11.2.1 (The spherical area theorem) Suppose that 'P is a continuous invariant simple valuation on p(S2). Then there exists c E R such that 'P(P) = C'P2(P), for all P E P(S2). Remark: Theorem 11.2.1 actually holds under the weaker assumption
11.2 A characterization for spherical area
157
that cp is invariant under the group SO(3) of rotations of S2. This follows from an argument similar to that in the proof of Proposition 8.3.1. Before proving Theorem 11.2.1, we consider two preliminary cases. Proposition 11.2.2 Suppose that cp is a continuous invariant simple valuation on closed arcs I of the circle SI. Then there exists c E R such that cp(I) = CCPl (I), for all closed arcs I ~ SI.
Proof Let c = cp(SI) / (27r), and define 1/(1) = cp(I) - CCPl (I), for all closed arcs I ~ SI. Note that 1/ is also continuous, invariant, and simple. In addition, I/(SI) = o. It now suffices to show that 1/(1) = 0 for all I. Suppose that In is a closed arc of length 27r / n, where n is a positive integer. Since the circle SI can be tiled with n rotations of In, the invariance and simplicity of 1/ imply that nl/(In) = I/(SI) = 0, so that I/(In) = O. Since any closed arc I of length 27rm/n can be tiled with rotations of In, for any positive integers m < n, it follows that 1/ vanishes on arcs of length rational in proportion to the circle. It then follows from the continuity of 1/ that 1/(1) = 0 for all closed arcs I. 0 Proposition 11.2.3 Suppose that cp is a continuous invariant simple valuation on p(S2) such that cp(S2) = O. Then cp(~) = 0, for all spherical simplices ~ C S2.
Proof Let 0" denote a great circle in S2. For any closed arc I contained in a half-circle in 0", denote by £(1) the lune through I, and define 1/(1) = cp(£(I)). Evidently the valuation 1/ satisfies the conditions of Proposition 11.2.2. Therefore, there exists c E R such that 1/(1) = ccpl(I) for all I ~ 0". However, 1/(0") = cp(S2) = 0, so that c = O. It follows that cp vanishes on alllunes through closed arcs of 0". Since cp is invariant, it follows in turn that cp vanishes on alllunes in S2. Now suppose that ~ is a spherical 2-simplex in S2; that is, a spherical triangle; given by the intersection of hemispheres ~ = HI nH2 nH3 . See Figure 11.1. For A ~ S2 denote by AC the closure of the complement S2 - A. Note that
where - ~ denotes the orthogonal reflection of R3. Since cp is simple and invariant, we obtain
cp(Hl U H2 U H 3 ) = cp(S2) - CP((HI UH2 U H 3
~
through the origin in
n = 0-
cp( -~) = -cp(~). (11.2)
11 Polyconvex sets in the sphere
158
Fig. 11.1. An intersection of hemispheres in 8 2 •
Meanwhile, iteration of the inclusion-exclusion identity (11.1) gives 3
cp(Hl UH2UH3) =
L cp(Hi) - L cp(HinHj)+cp(HlnH2nH3). i=l
(11.3)
i<j
Note that the intersections Hi n Hj are lunes in 8 2 for i =1= j. Since cp vanishes on hemispheres and lunes, the identity (11.3) implies that (11.4) It then follows from (11.2) and (11.4) that cp(~) = O.
cp(~) =
-cp(~);
that is, 0
Remark: A spherical triangle ~ in 8 2 has angle measures ct, /3, 'Y E [0,11'] if the inner Euclidean (dihedral) angles between planar sides of the cone o * ~ in R3 are given by ct, /3, 'Y. It is well known that the spherical area of ~ is given by the spherical excess:
(11.5) One can prove (11.5) by the same inclusion-exclusion technique that was used in the proof of Proposition 11.2.3, using the fact that the lune through a great circular arc of length 8 has spherical area 28. We are now ready to prove Theorem 11.2.1. Proof of Theorem 11.2.1 Suppose that cp is a continuous invariant simple valuation on P(8 2). Let c = cp(8 2)j(411') and define v(P) = cp(p) Ccp2(P) for all P E P(8 2). Since the valuation v satisfies the conditions of Proposition 11.2.3, it follows that v(~) = 0 for all spherical simplices ~ <;;; 8 2 . For P E P(8 2 ) express P as a union of spherical simplices
P= ~l U··· U~m'
11.3 Invariant valuations on spherical polytopes where
dim(~i
n ~j) < 2 for all i
=1= j.
159
Since v is simple, it follows that
m
v(P)
= L V(~i) = 0, i=1
for all P E p(S2).
o
Since any continuous invariant simple valuation
for all K
E
K(S2).
0
11.3 Invariant valuations on spherical polytopes
Theorem 11.2.1 leads to the following characterization theorem for continuous invariant valuations. Theorem 11.3.1 Suppose that
p(S2). Then there exist Co, Cl, C2 E R such that, for all P E p(S2),
Theorem 11.3.1 can be thought of as a spherical analogue of Hadwiger's characterization Theorem 9.1.1. Since the set p(S2) is dense in K(S2), Theorem 11.3.1 also holds if p(S2) is replaced with the larger collection K(S2). Proof Suppose that
160
11 Polyconvex sets in the sphere
vanishes on all P E P(S2) of dimension less than 2; that is, v is a continuous invariant simple valuation on p(S2). Theorem 11.2.1 then implies the existence of C2 E R such that v(P) = C2!.p2(P) for all P E
p(S2).
0
Theorem 11.3.1 implies that any continuous invariant valuation on
p(S2) is determined completely by its values on a singleton {x}, a great circle 0", and the entire sphere S2. Indeed, the coefficients Ci in Theorem 11.3.1 can be computed using the following table of values:
{x}
0"
S2
1 0 0
0 21f 0
2 0 41f
!.po !.pI
!.p2
(11.6)
Note well that !.pI (S2) = O. This follows from the fact that !.pI (H) = 1f, for any closed hemisphere H, while !.p1(0") = 21f if 0" is a great circle. Denote by -H the antipode of H. We then obtain
since H n -His a great circle. One of the advantages of Hadwiger's characterization Theorem 9.1.1 for invariant valuations on Euclidean convex sets is the ease with which that theorem allows one to prove a variety of classical results in integral geometry. We now demonstrate similar advantages of the spherical area Theorem 11.2.1 and the resultant characterization Theorem 11.3.1. We begin with a spherical analogue of the mean projection formula (9.7) of Section 9.4. For K E K(S2) and u E S2, define the projection Ku of K onto the great circle u.l n S2 by
Ku
= {x
E u.l
n S2 : lux] n K -:f. 0},
where lux] denotes the (unique) half great circle through x with endpoints at u and -u. The mean spherical width sw(K) of a compact convex set Kin S2 is defined by (11. 7) The functional sw(K) is continuous on K(S2). To see this, note that, for a fixed u E S2 and K E K(S2), the interval Ku varies continuously in a neighborhood of (K, u) provided that either u E int(K) or u 1:. K. Thus the functional !.pI (Ku) is continuous at (K, u) unless u E oK. Since oK
11.3 Invariant valuations on spherical polytopes
161
has measure zero in S2, the functional ifJl (Ku) is integrable with respect to spherical Lebesgue measure, and the integral (11. 7) varies continuously with respect to K. Although we do not apply the formula (11.7) to non-convex sets, it is possible to extend the functional sw to a continuous valuation on all polyconvex sets in S2 as follows. Suppose that K c S2 is compact and convex. If u E S2 and x E u-L n ~2, then ifJo(Kn lux]) = 1 if and only if x E K u , otherwise ifJo(Kn lux]) = 0. (An exception may occur if K contains a great half-circle, in which case this exception will occur with measure zero.) Therefore, the equation (11.7) can be rewritten sw(K) =
r
iS2
ifJl(Ku ) du =
r
1,
iS2 u-Ln S2
ifJo(K n lux]) dx duo
(11.8)
Since ifJo is an invariant valuation, it follows from (11.8) that the mean spherical width sw(K) is a continuous invariant valuation of K. Moreover, the integral (11.8) is defined for all polyconvex sets in S2, giving the desired extension of sw. Proposition 11.3.2 For K E K(S2), sw(K)
= 41rifJl(K) + 21rifJ2(K).
Proof Since sw is a continuous invariant valuation on K(S2), Theorem 11.3.1 implies that
for some constants Ci E R. Since the formula (11.8) is valid (as a valuation) for all finite unions of compact convex sets, we can evaluate (11.8) to compute sw on great circles and on the full sphere S2. To compute the coefficients Ci, note that sw vanishes on singleton sets, so that Co = 0. If 0" is a great circle, then 0" n lux] consists of a single point almost always; that is, provided that u rf- 0". Thus, ifJo(O" n lux]) = 1 almost always, and we obtain
Cl ifJl (0")
= 21rCl =
r 1,u-LnS2 iS2
dx du
= 81r2,
or Cl = 41r. Finally, if K = S2 then S2 n lux] = lux]' a half-circle, so that ifJO(S2 n lux]) = 1. Since ifJl(S2) = 0, we obtain
C2ifJ2(S2) or C2
= 21r.
= 41rC2 =
r 1,
iS2 u-Ln S2
dx du
= 81r2, D
11 Polyconvex sets in the sphere
162
In a similar manner Theorem 11.3.1 can be used to obtain an easy proof of the spherical Crofton formula, which expresses the spherical perimeter (or length) of a spherical polyconvex set K as the measure of the great circles meeting K; that is,
r 4?o(K nul..) du
JS2
=
44?1(K),
(11.9)
for all polyconvex sets K in S2.
11.4 Spherical kinematic formulas In analogy to the methods of Sections 10.1 and 10.3, the characterization Theorem 11.3.1 yields an easy proof of a spherical kinematic formula for polyconvex sets in S2, which generalizes the spherical Crofton formula (11.9).
Theorem 11.4.1 (The principal kinematic formula for S2) all poly convex sets K, L ~ S2,
1
4?o(K n gL) dg
=
1
41f 4?0 (K)4?2 (L)
0(3)
For
1
+ 21f24?1(K)4?1(L)
1
+ 41f4?2(K)4?0(L) -
1
81f 2 4?2(K)4?2(L).
Here the integral is taken with respect to the invariant Haar probability measure on the orthogonal group 0(3). Proof To begin, define
4?o(K, L)
=
1
4?o(K n gL) dg.
0(3)
For fixed K, the set function 4?o(K, L) is a valuation in the variable L; in fact, it is an invariant valuation, since
4?o(K,goL)
1 1
4?o(K n ggoL) dg
0(3)
4?o(K n gL) dg,
0(3)
for each go E 0(3). By Theorem 11.3.1, the functional 4?o(K, L) can be expressed as a linear combination of the valuations 4?i' with coefficients
11.4 Spherical kinematic formulas
163
ci(K) depending on K: 2
'Po(K, L)
=
I>i(K)'Pi(L). i=O
Meanwhile, for fixed L, the set function 'Po(K, L) is a valuation in the variable K. From this it follows that each of the coefficients ci(K) is a valuation in the variable K. One way to check this fact is to insert special values for L and use the table (11.6). Since 'Po is invariant, we ~liave
r r r
'Po(K, L)
'Po(K n gL) dg
JO(3)
'PO(g-1 K n L) dg
JO(3)
'Po(gK n L) dg
= 'Po(L, K).
JO(3)
Therefore, the coefficients ci(K) are invariant valuations in the parameter K, so that 2
'Po(K, L)
=
L
Cij'Pi(K)'Pj(L),
(11.10)
i,j=O again by Theorem 11.3.1. Since 'Po(K,L) = 'Po(L,K), it is evident that Cij = Cji' One can verify this, as well as compute the values of the coefficients Cij, by evaluating 'Po(K, L) for the cases in which each of K and L is either a point (singleton) {x}, a great circle 0", or the whole sphere 8 2 , using the values of 'Pi given in the table (11.6). For example, if K = L = {x}, then (11.10) takes the form: 0= Coo. We then let K
o
= 0"
and L
= {x},
COl 'Po (0" )'PI ( {x})
so that (11.10) becomes
+ ClO'PI (0" )'Po ({x}) + Cll 'PI (0" )'PI ({ X } )
0+ 27rClO + 0 so that ClO = COl = O. In a similar manner one evaluates both sides of (11.10) for each case of K, L E {{x}, 0", 8 2 } (using the table (11.6)) to compute the values of Cij, thereby completing the proof of Theorem 11.4.1. 0
164
11 Polyconvex sets in the sphere
Using similar methods one can easily show that, for polyconvex sets K,L<:::; S2,
and
r
JO(3)
ifJ2(K n gL) dg = 41 ifJ2(K)ifJ2(L), n
thereby classifying all kinematic formulas for continuous invariant valuations on S2 (by Theorem 11.3.1). Theorem 11.4.1 and its higher dimensional generalizations have numerous applications to questions in spherical geometric probability. For example, Theorem 11.4.1 (as stated for S2) leads in turn to a spherical analogue of Hadwiger's containment Theorem 10.2.1. Theorem 11.4.2 Let K, L E K(S2) with non-empty interiors, and suppose that (11.11) Then there exists 9 E 0(3) such that either K <:::; int gL or L <:::; int gK. D
To determine the direction of containment in Theorem 11.4.2, one compares the values of ifJ2(K) and ifJ2(L). Evidently the set of larger spherical area will contain a rigid motion of the other. If we define A(K) = ifJ2(K) and P(K) = 2ifJl(K) (spherical area and perimeter respectively), then the inequality condition (11.11) becomes: 2n(A(K)
+ A(L)) -
A(K)A(L) - P(K)P(L)
> O.
Compare this with (10.5) in Section 10.2. In analogy to the proof of Theorem 10.2.1, the proof of Theorem 11.4.2 makes use ofthe spherical principal kinematic formula (Theorem 11.4.1), along with the fact that two polygonal convex curves in the sphere will almost always intersect at an even number of points. For details, see Section 10.2.
11.5 Remarks on higher dimensional spheres A natural question at this point (or even earlier) is that of how spherical volume on polytopes or convex sets in higher dimensional spheres is
11.5 Remarks on higher dimensional spheres
165
to be characterized. In order to characterize spherical area (on S2) we proved the following sequence of assertions for a given continuous invariant simple valuation cp:
cp is simple
t-t
cp
=
area on lunes (by induction on dimension) t-t cp = spherical area,
where the second implication follows from an inclusion--exclusion argument (see (11.3)). Unfortunately, this inclusion-exclusion argument fails in S3 and for s2n+1 in general. Without a characterization theorem for spherical volume in S3, the induction step for the first implication is not possible, so that the proof of Theorem 11.2.1 also fails to generalize to S4 and so on. However, we do have the following partial result. Denote by S the (invariant) spherical volume on s2n. Recall that a lune in s2n is a subset of s2n consisting of the intersection of at most 2n hemispheres. Theorem 11.5.1 Suppose that cp is a continuous invariant simple valuation on p(S2n). Ifcp(L) = S(L) for alllunes L ~ S2n, then cp(P) =
S(P) for all P E p(S2n). Proof Without loss of generality we may assume that cp(S2n) = 0, so that cp(L) = 0 for alllunes L ~ s2n. (Just substitute cp - S for cp.) It remains to show that cp(P) = 0 for all P E p(S2n). Suppose that b. is a spherical simplex in s2n given by the intersection of hemispheres b. = HI n ... n H 2n + l . Again denote by AC the closure of the complement S2n - A, for any A ~ s2n. Note that
Since cp is simple and invariant, we obtain
= 0 - cp( -b.) = -cp(b.). (11.12) Meanwhile, the inclusion-exclusion principle gives
cp (
2n+1
U Hi )
2n+1
L
CP(Hi) -
L
CP(Hil n H i2 ) + ...
i=1
(11.13)
166
11 Polyconvex sets in the sphere
Since cp vanishes on hemispheres and lunes, all of the terms on the righthand side of (11.13) vanish except for the last term: (-1)2ncp(Hl n· .. n H 2nH ) = cp(~). It then follows from (11.12) that
cp(~) = cp
ild Hi
2n+l (
)
= -cp(~),
so that cp(~) = O. Since every polytope P E p(S2n) can be expressed as a union of spherical simplices intersecting in dimension less that 2n, it follows that cp(P) = 0 for all P E p(S2n). 0 This proof of Theorem 11.5.1 fails for s2nH because the sign of the last term on the right-hand side of (11.13) is negative in this case, so that, on combining it with (11.12), we would obtain the tautological observation that
cp(~) = -cp
ild Hi
2n+l (
)
= cp(~).
For this reason a proof of such a spherical volume characterization for S2nH, if possible, will require some new ingredient besides the decomposition of simplices by inclusion-€xclusion. Indeed the question of whether spherical volume is the only continuous invariant simple valuation on p(sn) (or K(sn)) remains open for dimension n 2: 3.
11.6 Notes
Theorems 11.2.1 and 11.5.1 are both originally due to McMullen. In [70] McMullen developed the polytope algebra, which places many dissection and inclusion-exclusion techniques in an algebraic framework (see also [72]). The proofs of Theorems 11.2.1 and 11.5.1 given in Section 11.2 give a more simplified approach to these specific questions. Groemer made observations regarding Theorem 11.1.1 at the end of his paper on extensions of additive set functionals [32]. Theorem 11.4.2 and other non-Euclidean analogues of Hadwiger's containment theorem can be found in [82, p. 324]. For additional references see Section 10.4. Theorem 11.4.1 generalizes to a principal kinematic formula for spheres of arbitrary dimension; see [82, p. 321]. Similarly, the spherical Crofton formula (11.9) generalizes to higher dimensions [82, p. 316]. However, it is not known whether the characterization Theorem 11.3.1 generalizes to spheres of dimension n 2: 3.
11.6 Notes
167
Theorems 11.2.1 and 11.2.4 vastly simplify valuation characterization theorems for valuations on star-shaped sets in R3. In [51, 52] characterizations are given for continuous homogeneous valuations and continuous rotation invariant valuations on a class of star-shaped sets called Ln-stars; that is, star-shaped sets in R n with n-integrable radial nmctions (see also [26]). For the general case of star-shaped sets in R n these characterizations require that the valuations in question be defined on the entire class of Ln-stars. However, by using Theorem 11.2.4 one can prove similar characterizations for valuations defined merely on finite unions of convex cones in R 3 (of finite height) having a common apex at the origin. This is a much smaller and more manageable class of objects, which is nonetheless dense in the original and larger class of L3- stars. Clearly a higher dimensional version of Theorem 11.2.1 or Theorem 11.2.4 would have similar and important consequences in the theory of valuations on star-shaped sets in Rn.
Bibliography
[1]
[2] [3] [4] [5] [6] [7] [8] [9]
[10]
[11]
[12] [13] [14]
R. V. Ambartzumian. Combinatorial Integral Geometry. (New York: John Wiley and Sons, 1982). R. V. Ambartzumian. Factorization Calculus and Geometric Probability. (New York: Cambridge University Press, 1990). I. Anderson. Combinatorics of Finite Sets. (New York: Oxford University Press, 1987). E. Artin. The Gamma Function. (New York: Holt, Rinehart, and Winston, Inc., 1964). E. Barbier. Note sur Ie probleme de l'aiguille et Ie jeu du joint couvert, J. Mathematiques Pures et Appliquees (2), 5 (1860), 273-286. C. Berge. Sur une propriete combinatoire des ensembles convexes, C. R. Acad. Sci. Paris, 248 (1959), 2698-2699. C. Berge. Hypergraphs. (New York: Elsevier Science Publishers, 1989). V. Boltianskii. Hilbert's Third Problem. (New York: John Wiley and Sons, 1978). A. Cauchy. Note sur divers theorems relatifs a la rectification des courbes et a la quadrature des surfaces, C. R. Acad. Sci., Paris, 13 (1841), 1060-1065. Also: Oeuvres Completes (1), 6 (Paris: 1888), pp. 369-375. A. Cauchy. Memoire sur la rectification des courbes et la quadrature des surfaces courbes, Mem. Acad. Sci. Paris, 22 (1850), 3. Also: Oeuvres Completes (1),2 (Paris: 1908), pp. 167-177. B. Chen, The Gram-Sommerville and Gauss-Bonnet theorems and geometric measures for noncompact polyhedra, Adv. Math., 91 (1992), 269-29l. B. Chen, On the Euler characteristic of finite unions of convex sets, Disc. Comput. Geom., 10 (1993), 79-93. B. Chen and G.-C. Rota. Totally invariant set functions of polynomial type, Comm. Pure Appl. Math., 47 (1994), 187-197. M. W. Crofton. On the theory of local probability, applied to straight lines drawn at random in a plane; the methods used being also extended
Bibliography
[15] [16] [17] [18]
[19] [20] [21]
[22] [23] [24] [25] [26]
[27] [28] [29]
[30] [31]
[32] [33]
[34]
169
to the proof of certain new theorems in the integral calculus, Philos. Trans. Roy. Soc. London, 158 (1868), 181-199. M. W. Crofton. Sur quelques theorems de calcul integral, C. R. Acad. Sci., Paris, 68 (1869), 1469-1470. M. W. Crofton. Geometrical theorems relating to mean values, Proc. London Math. Soc., 8 (1877), 304-309. M. W. Crofton. Probability. Encyclopaedia Britannica, 9th Edition, 19 (1885), 768-788. L. Danzer, B. Griinbaum and V. Klee. Helly's theorem and its relatives. Convexity, Proceedings of Symposia in Pure Mathematics, Volume VII. (Providence, RI: American Mathematical Society, 1963), 101-177. M. Dehn. Uber den Rauminhalt, Math. Ann., 55 (1901), 465--478. G. Dolzmann and D. Hug. Equality of two representations of extended affine surface area, Arch. Math., 65 (1995), 352-356. J. Eckhoff. Helly, Radon, and Carathe6dory type theorems. Handbook of Convex Geometry, Peter M. Gruber and Jorg M. Wills, Eds. (Amsterdam: North-Holland, 1993), 389-448. P. Erdos. On a lemma of Littlewood and Offord, Bull. Amer. Math. Soc., 51 (1945), 898-902. W. Feller. An Introduction to Probability Theory and Its Applications, Volume II, 2nd Edition. (New York: Wiley Publishers, 1957). S. Fisk. Whitney numbers of projective space over R, C, H, and the p-adics, J. Combinatorial Theory, Series A, 70 (1995), 165-169. J. H. G. Fu. Kinematic formulas in integral geometry. Indiana Univ. Math. J .. 39 (1990), 1115-1154. R. J. Gardner. Geometric Tomography (New York: Cambridge University Press, 1995). Classic Papers in Combinatorics, Ira Gessel and G.-C. Rota, Eds. (Boston: Birkhiiuser Verlag, 1987). A. Ghouila-Houri. Sur l'etude combinatoire des familles de convexes, C. R. Acad. Sci. Paris, 252 (1961), 494--496. C. Greene and D. Kleitman. Proof techniques in the theory of finite sets. Studies in Combinatorics, G.-C. Rota, Ed. (Mathematical Association of America, 1978), 22-79. N. T. Gridgeman. Geometric probability and the number 1[', Scripta Math., 25 (1960), 183-195. E. Grinberg, D. Ren and J. Zhou. The symmetric isoperimetric deficit and the containment problem in a plane of constant curvature. Preprint. H. Groemer. On the extension of additive functionals on classes of convex sets, Pacific J. Math., 75 (1978), 397-410. H. Groemer. Fourier series and spherical harmonics in convexity. Handbook of Convex Geometry, Peter M. Gruber and Jorg M. Wills, Eds. (Amsterdam: North-Holland, 1993), 1259-1296. H. Groemer. Geometric Applications of Fourier Series and Spherical Harmonics. (New York: Cambridge University Press, 1996).
170 [35] [36]
[37] [38] [39] [40] [41] [42] [43] [44]
[45]
[46] [47] [48] [49] [50] [51] [52] [53] [54] [55] [56]
Bibliography H. Hadwiger. Uberdeckung ebener Bereiche durch Kreise und Quadrate, Comment. Math. Helv., 13 (1941), 195-200. H. Hadwiger. Gegenseitige Bedeckbarkeit zweier Eibereiche und Isoperimetrie. Vierteljschr. Naturforsch. Gesellsch. Zurich, 86 (1941), 152-156. H. Hadwiger. Eulers Charakteristik und kombinatorische Geometrie, J. reine angew. Math., 194 (1955), 101-110. H. Hadwiger. Altes und Neues uber Konvexe Korper. (Basel: Birkhiiuser Verlag, 1955). H. Hadwiger. Vorlesungen uber Inhalt, Oberftiiche und Isoperimetrie. (Berlin: Springer Verlag, 1957). H. Hadwiger and P. Glur. Zerlegungsgleichheit ebener Polygone, Elem. Math.,6 (1951),97-106. G. H. Hardy, J. E. Littlewood, and G. P6lya. Inequalities, 2nd Edition. (New York: Cambridge University Press, 1988). L. H. Harper and G.-C. Rota. Matching theory, an introduction, Adv. Probability,1 (1971), 171-215. E. Helly. Uber Mengen konvexer Karper mit gemeinschaftlichen Punkten, Jber. Deutsch. Math. Verein., 32 (1923), 175-176. D. Hilbert. Mathematical Problems. Lecture delivered before the International Congress of Mathematicians in Paris, 1900. Translated by M. W. Newson, Bull. Amer. Math. Soc., 8 (1902), 437-479. M. Hochberg and W. M. Hirsch. Sperner families, s-systems, and a theorem of Meshalkin, Annals New York Acad. Sci., 175 (1970), 224237. R. Howard. The kinematic formula in Riemannian homogeneous spaces, Memoirs Amer. Math. Soc., 509 (1993). S. Karlin. Total Positivity. (Stanford, CA: Stanford University Press, 1968). G. Katona. A theorem of finite sets, Theory of Graphs, (Akademia Kiad6, Budapest: 1968), 187-207. M. Kendall and P. Moran. Geometrical Probability. (New York: Hafner Publishing Co., 1963). D. Klain. A short proof of Hadwiger's characterization theorem, Mathematika, 42 (1995), 329-339. D. Klain. Star valuations and dual mixed volumes, Adv. Math., 121 (1996), 80-101. D. Klain. Invariant valuations on star-shaped sets, Adv. Math., 125 (1997),95-113. D. Klain. Kinematic formulas for finite vector spaces, to appear in Discrete Math. D. Klain. Even valuations on convex bodies. Preprint. D. Klain and G.-C. Rota. A continuous analogue of Sperner's theorem, Comm. Pure Appl. Math., 50 (1997),205-223. V. Klee. On certain intersection properties of convex sets, Canadian J. Math., 3 (1951), 272-275.
Bibliography [57]
[58] [59] [60] [61] [62] [63] [64] [65] [66] [67]
[68] [69] [70] [71]
[72]
[73] [74] [75] [76] [77] [78]
171
J. B. Kruskal. The number of simplices in a complex. Mathematical Optimization Techniques. (University of California Press, 1963), 251278. T. Kubota. tIber konvex-geschlossene Mannigfaltigkeiten im ndimensionalen Raume, Sci. Rep. T8hoku Univ., 14 (1925), 85-99. K. Leichtweiss. Zur Affinoberfiache konvexer Korper, Manuscripta Math., 56 (1986), 429-464. K. Leichtweiss. tIber einige Eigenschaften der Affinoberfiache beliebiger konvexer Korper, Resultate Math., 13 (1988), 255-282. K. Leichtweiss. Bemerkungen zur Definition einer erweiterten Affinoberfiache von E. Lutwak, Manuscripta Math., 65 (1989), 181-197. F. W. Levi. Eine Erganzung zum Hellyschen Satz, Arch. Math., 4 (1953), 222-224. D. Lubell. A short proof of Sperner's theorem, J. Combinatorial Theory, 1 (1966), 299. E. Lutwak. Dual mixed volumes, Pacific J. Math., 58 (1975), 531-538. E. Lutwak. Intersection bodies and dual mixed volumes, Adv. Math., 71 (1988), 232-261. E. Lutwak. Extended affine surface area, Adv. Math., 85 (1991), 39-68. E. Lutwak. Selected affine isoperimetric inequalities, Handbook of Convex Geometry, Peter M. Gruber and Jorg M. Wills, Eds. (Amsterdam: North-Holland, 1993), 151-176. E. Lutwak. Containment and circumscribing simplices, to appear in Disc. Comput. Geom. M. Marcus and H. Mine. A Survey of Matrix Theory and Matrix Inequalities. (New York: Dover Publications, 1992). P. McMullen. Non-linear angle-sum relations for polyhedral cones and polytopes, Math. Proc. Camb. Phil. Soc., 78 (1975), 247-261. P. McMullen. Valuations and dissections, Handbook of Convex Geometry, Peter M. Gruber and Jorg M. Wills, Eds. (Amsterdam: NorthHolland, 1993), 933-988. P. McMullen and R. Schneider. Valuations on convex bodies, Convexity and Its Applications, Peter M. Gruber and Jorg M. Wills, Eds. (Boston: Birkhauser Verlag, 1983), 170-247. L. D. Meshalkin. A generalization of Sperner's theorem on the number of subsets of a finite set, Theor. Probability Appl., 8 (1963), 203-204. J. Milnor and J. Stasheff. Characteristic Classes (Princeton, NJ: Princeton University Press, 1974). J. Munkres. Elements of Algebraic Topology. (Menlo Park, CA: Benjamin/Cummings, 1984). L. Nachbin. The Haar Integral. (New York: D. Van Nostrand Company, 1965). D. Ren. Topics in Integral Geometry. (Singapore: World Scientific International Publisher, 1992). G.-C. Rota. On the foundations of combinatorial theory, I: Theory of Mobius functions, Z. Wahrscheinlichkeit. 2, 368 (1964), 340-368.
172 [79]
[80] [81] [82] [83]
[84]
[85] [86] [87]
[88] [89] [90] [91] [92] [93] [94] [95] [96] [97] [98]
[99]
Bibliography G.-C. Rota. On the combinatorics of the Euler characteristic. Studies in Pure Mathematics (Papers Presented to Richard Rado) (London: Academic Press, 1971), 221-233. Gian-Carlo Rota on Combinatorics: Introductory Papers and Commentaries, Joseph P. S. Kung, Ed. (Boston: Birkhauser Verlag, 1995). C.-H. Sah. Hilbert's Third Problem: Scissors Congruence. (San Francisco: Fearon Pitman Publishers, 1979). L. A. Santalo. Integral Geometry and Geometric Probability. (Reading, MA: Addison-Wesley, 1976). S. Schanuel. What is the length of a potato? An introduction to geometric measure theory. Categories in Continuum Physics, 1982, Springer Lecture Notes in Math., 1174 (1986),118-126. S. Schanuel. Negative sets have Euler characteristic and dimension. Category Theory, 1990, Springer Lecture Notes in Math., 1488 (1991), 379-385. R. Schneider. Convex Bodies: The Brunn-Minkowski Theory. (New York: Cambridge University Press, 1993). R. Schneider. Simple valuations on convex bodies, Mathematika, 43 (1996), 32-39. R. Schneider and J.A. Wieacker. Integral geometry, Handbook of Convex Geometry, Peter M. Gruber and Jorg M. Wills, Eds. (Amsterdam: North-Holland, 1993), 1349-1390. C. Schutt. On the affine surface area, Proc. Amer. Math. Soc., 118 (1993), 1213-1218. C. Schutt and E. Werner. The convex floating body, Math. Scand., 66 (1990), 275-290. H. Solomon. Geometric Probability. (Philadelphia: Society for Industrial and Applied Mathematics, 1978). E. Sperner. Ein Satz uber Untermengen einer endlichen Menge, Math. Z., 27 (1928), 544-548. R. Stanley. Enumerative Combinatorics. (Monterey, CA: Wadsworth & Brooks/Cole Advanced Books and Software, 1986). E. Steinitz. Bedingt konvergente Reihen und konvexe Systeme, J. reine angew. Math., 143 (1913), 128-175. G. Strang. Linear Algebra and its Applications, 3rd Edition. (New York: Harcourt Brace Jovanovich, 1988). J. J. Sylvester. On a funicular solution of Buffon's 'problem of the needle' in its most general form, Acta Math., 14 (1890), 185-205. R. Vitale. The Wills functional and Gaussian processes, Ann. Probab., 24 (1996), 2172-2178. F. W. Warner. Foundations of Differentiable Manifolds and Lie Groups. (New York: Springer Verlag, 1983). W. Weil. Kinematic integral formulas for convex bodies, Contributions to Geometry, Proc. Geometry Symp., Siegen, 1978, J. Tolke and Jorg M. Wills, Eds. (Boston: Birkhauser Verlag, 1978), 60-76. W. Weil. Stereology: A survey for geometers, Convexity and its Applications, Peter M. Gruber and Jorg M. Wills, Eds. (Boston: Birkhauser
Bibliography [100] [101] [102] [103] [104] [105] [106]
[107]
173
Verlag, 1983), 360-412. E. Werner. Illumination bodies and affine surface area, Studia Math., 110 (1994), 257-269 J. M. Wills. Zur Gitterpunktanzahl konvexer Mengen, Elemente Math., 28 (1973), 57-63. G. Zhang. Geometric inequalities and inclusion measures of convex bodies, Mathematika, 41 (1994),95-116. G. Zhang. Dual kinematic formulas. Preprint. J. Zhou. A kinematic formula and analogues of Hadwiger's theorem in space, Contemporary Math., 140 (1992), 159-167. J. Zhou. The sufficient condition for a convex body to fit another in R 4 , Proc. Amer. Math. Soc., 121 (1994),907-913. J. Zhou. Kinematic formulas for mean curvature powers of hypersurfaces and Hadwiger's theorem in R 2n , Trans. Amer. Math. Soc., 345 (1994), 243-262. J. Zhou. When can one domain enclose another in R 3 ?, J. Austral. Math. Soc. (Series A), 59 (1995), 266-272.
Index of symbols
B n ,31 CI:,123 C~, 88 En, 86 Fi(P), 138 H+,34 H-,34 lA, 7, 144 K +EBn, 31 K +L, 43, 99 KIV,55 L(B), 21 L(/l), 154 M(Q, r), 138 O(n),60 P(P(B)), 21 P(B), 13 Pier), 138 Pk(B), 13 Pal , ... ,a r (B), 18 B(K),55 BO(n), 100 BO(n,B),100 BK,109 T2,117 Tn, 43 W(K),134 Zc,4 [0, l]n, 38, 101 [n]!,62 [n], 61, 66 [x, y], 13 /l(K,L),151 ret), 63 0,141 Q,39 R2 1
Rn',30 R Q ,39
Sn-l, 56 Z,112
k,143 143 b(K, L), 31 ix, 120
V,
0, 7
x, 95 Ai, 3 Ak, 86
(n/2), 14 (nlr),20 f.L(J; K), 132 f.LQ, 22, 35, 37, 46 f.Lk, 23, 36, 91 f.Leven, 106 f.Lodd, 106 vI:, 63 vn,73 1/{}:1, ... ,a r l78 v n ;r(A),79
55, 63, 66 I8>Z,116 'it, 120 Wn,
V, 103
x,
21 8K,4
19
61
58
'ei , 156 Tn, 30 36
Iir,
~k'
90
pn, 83
62 (k, 152
Tn,
d(x, A), 31
Index of symbols ek(tl,t2, ... ,tn ),23 hK, 42, 99 o*K,155 rex), 13 u-L,
56
x V y, 6, 60 x 1\ y, 6, 60
K(Sn),155 Kn,42 K'd, 99 £,112
p(Sn), 154 V,129
Ll,~aJ, [~l ' 63
m,
78
14,69
(al ,n,aJ, 18, 19 IAI,14
sw(K), 160
175
Index
Aff(n), 86 affine Grassmannian, 86 affine hull, 138 affine surface area, 141 Ambartzumian, R. V., 5 antichain, 13 Barbier, E., xi, 5, 141 binomial, 14 bivaluation, 142 Boolean algebra, 11, 13 box, 30 Brunn-Minkowski inequality, 108 Buffon needle problem, xi, 1, 129 Caratheodory's theorem, 52 Cauchy's surface area formula, 56 centrally symmetric, 99 chain, 13 Chen, B., 29, 58 chord, 129 combinatorics, xi, 62, 63, 71, 78, 84, 127 compatible discrete flag, 19 flag of subspaces, 78 cone, 155 continuity, 31, 44 convex hull, 51 convex ring, 42 convex set centered, 99 in Rn, 42 in the plane, 3 random, 147 spherical, 155 symmetric, 107 convex-continuity, 44 cosine transform, 100 Crofton's formula, 124 Crofton, M. W., xi, 3, 5, 141
cross-section, 128 De Giorgi, E., 42
Dec(n,r),77 Dec(S,r),18 Dehn invariant, 115 discrete basis theorem, 23, 144 dissection, 115 Erdos, P., 29 Euclidean motion, 43, 146 Euler characteristic, 29, 58 discrete, 22 on parallelotopes, 37 on polyconvex sets, 46 on spherical polytopes, 156 Euler's formula discrete, 23 for intrinsic volumes, 93 for polytopes, 49 expectation, 1, 112, 126, 128, 129, 149, 152 face enumerator, 23, 29, 143 facet, 21 filter, 143 Fisk, S., 84 flag, 13 flag coefficient, 63 formulas for, 67 maximal, 73 Pascal's triangle, 69 Flag(n),61 frame, 62 Fu, J., 153 fundamental domain, 112 gamma function, 63, 72 Gardner, R., 145 generating set, 8
Index geometric probability, xi, 1, 5, 25, 92, 147
join, 6, 60
Gr(n, k), 61
Katona-Kruskal theorem, 29 kinematic formula discrete, 25 general, 153 principal, 148, 153 spherical, 162 Klain, D., 84, 117 Klee's theorem, 50 Kubota's theorem, 126, 141
Graff(2, 1), 3 Graff(n, k), 86 Graff(A; k), 88 Grassmannian,61 affine, 86 great circle, 154 Groemer's extension theorem, 44 for parallelotopes, 32 in the sphere, 155 Groemer's integral theorem, 8 Groemer, H., 12, 58 group action, 60, 83, 86 Haar measure on Grassmannians, 61 on projective space, 57 on the Euclidean motions, 146 on the orthogonal group, 83 probability, 57, 126 Hadwiger's characterization theorem, 118 for parallelotopes, 40 spherical analogue, 159 Hadwiger's containment theorem, 151 spherical analogue, 164 Hadwiger's formula, 91, 123 Hadwiger, H., xii, 58, 117 Harper, L., 29 Hausdorff distance, 31 Helly's theorem, 52 discrete, 28 for subspaces, 81 hemisphere, 154 Hilbert's third problem, 11, 115 Hirsch, W. M., 29 Hochberg, M., 29 homogeneous space, 83 Howard, R., 153 hyperplane, 30 inclusion--exclusion principle, 7 indicator function, 7, 144 integral, 8 intrinsic volume, 58, 88, 94, 130 computation, 135 discrete, 23 normalization, 111 of a convex polytope, 137 of a convex set, 90 of a parallelotope, 37, 136 of a product, 37, 130 of the unit ball, 122 invariance, 22, 31, 43, 95
L.Y.M. inequality continuous, 73, 79 discrete, 14, 19 lattice, 6, 29 distributive, 6, 21 in Rn, 112 of parallelotopes, 30 of polyconvex sets, 42 of subsets, 13 of subspaces, 60 lattice point, 112 Lie group, 83, 145 linear functional, 10, 46 Lubell, D., 29 lune, 154, 165 Lutwak's containment theorem, 54 Lutwak, E., 145 Mobius function, 96 McMullen, P., 11, 58, 166 mean chord length, 129 mean cross-sectional volume, 128 mean projection formula, 94, 125 mean spherical width, 160 meet, 6, 60 Meshalkin's theorem continuous, 81 discrete, 20 Minkowski linear combination, 140 Minkowski sum, 43, 99 mixed volumes, 140, 141 Mod(n), 60 moment, 133 monotone, 32
Mult(n;al, ... ,ar ),77 multiflag coefficient, 78 multinomial coefficient, 18 needle, xi, 1, 129 normalization theorem, 37, 111
or(V),87 order ideal, 76, 95 orthogonal group, 60, 83, 146 special, 100
177
178
Index
Par(n),30 Par(Vl, ... , v n ), 136 parallelotopes, 30 partially ordered set, 6, 21, 60, 86, 95, 143 Pascal inequality, 71 Pascal's triangle for flag coefficients, 69, 70 for binomial coefficients, 70 permutation group, 22, 24 permutation of coordinates, 30, 36, 40 Polycon( n), 42 polyconvex set, 42 polygon, 2, 117, 150 polynomial type, 142 polytope, 47 spherical, 154 quotient space, 128 r-decomposition discrete, 18 in Mod(n), 77 r-family discrete, 15 in Mod(n), 75 random variable, 1, 3, 25, 112, 126, 128, 129, 149 rank, 13 relative interior, 48 relint(P), 48 Rota, G.-C., 29, 84 s-system, 18, 77 Sah, C.-H., 109, 117 Santal6, L., 153 Schutt, C., 141 Schanuel, S., xii, 29, 140 Schneider's characterization theorem, 108, 117 Schneider, R., 11, 58 scissors congruent, 115 segment, 96 of a Boolean algebra, 13 simple function, 7 simple valuation, 101 simplex, 21, 54, 77, 142 dissection of, 109 random, 25 simplicial complex, 21, 77, 142 special orthogonal group, 100 Sperner's theorem continuous, 74 discrete, 14, 29 for r-families, 15, 75 spherical area theorem, 156 spherical convex set, 155
spherical excess, 158 spherical geometry, 154 spherical length, 156 spherical polytope, 154 stabilizer, 84 star-shaped set, 145, 167 Steiner's formula, 122, 140 support function, 42, 99 support plane, 43 surface area, 56 Sylvester's theorem, 4, 93, 147 Sylvester J.J., 3, 5 symmetric convex set, 99 symmetric function, 23, 35 syzygy ,144 tensor product 116 totally positive 144 translation, 30, 43, 101, 116 translation invariance, 36, 44 valuation, 6, 22, 30, 32, 42, 44 even, 106 homogeneous, 40 increasing, 32 monotone, 32 odd, 106 on the sphere, 155 simple, 101 volume characterization theorem, 105 volume of unit ball, 63 volume theorem for parallelotopes, 38, 136 for polyconvex sets, 110 spherical, 164 Wills functional, 134, 141 wire, 2 Zhang, G., 153 Zhou, J., 153 zonoid,99 zonotope, 99