REGULARITY PROPERTIES OF FUNCTIONAL EQUATIONS IN SEVERAL VARIABLES
Advances in Mathematics VOLUME 8 Series Editor: J. Szep, Budapest University of Economics, Hungary
Advisory Board: S.-N. Chow, Georgia Institute of Technology, U.S.A. G. Erjaee, Shiraz University, Iran W. Fouche, University of South Africa, South Africa
P. Grillet, Tulane University, U.S.A.
H.J. Hoehnke, Institute of Pure Mathematics of the Academy of Sciences, Germany F. Szidarovszky, University of Airzona, U.S.A. P.G. Trotter, University of Tasmania, Australia
P. Zecca, Universitb di Firenze, Italy
REGULARITY PROPERTIES OF FUNCTIONAL EQUATIONS IN SEVERAL VARIABLES
ANTAL JARAI Eotvos Lorand University, Budapest, Hungary
- Springer
Library of Congress Cataloging-in-Publication Data A C.I.P. record for this book is available from the Library of Congress.
ISBN 0-387-24413-1
e-ISBN 0-387-24414-X
Printed on acid-free paper.
O 2005 Springer Science+Business Media, Inc. All rights reserved. This work may not be translated or copied in whole or in part without the written permission of the publisher (Springer Science+Business Media, Inc., 233 Spring Street, New York, NY 10013, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now know or hereafter developed is forbidden. The use in this publication of trade names, trademarks, service marks and similar terms, even if the are not identified as such, is not to be taken as an expression of opinion as to whether or not they are subject to proprietary rights. Printed in the United States of America. 9 8 7 6 5 4 3 2 1
SPIN 11377986
This book is dedicated to JBnos Aczkl, my "mathematical grandfather", the teacher of several of us in the field of functional equations, and to my teacher ZoltAn Dar6czy who introduced me to functional equations.
TABLE OF CONTENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix . . . . . . . . . . . . . . . . . . 1 Chapter I . PRELIMINARIES $1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 1 $ 2. Notation and terminology . . . . . . . . . . . . . . . . . . 40
Preface
Chapter I1. STEINHAUS TYPE
. . . . . . . . . $ 3. Generalizations of a theorem of Steinhaus . . . . . . . . 5 4. Generalizations of a theorem of Piccard . . . . . . . . . . Chapter I11. BOUNDEDNESS AND CONTINUITY OF SOLUTIONS . . . . . . . . . . . . . $ 5. Measurability and boundedness . . . . . . . $ 6 . Continuity of bounded measurable solutions 5 7. On a problem of Mazur . . . . . . . . . . . . . . . . 5 8. Continuity of measurable solutions . . . . . . . . . . . $ 9. Continuity of solutions having Baire property . . . . . . $10. Almost solutions . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. 73 . 73 . 76 . 81 . 86 . 94
Chapter IV . DIFFERENTIABILITY AND ANALYTICITY . 5 11. Local Lipschitz property of continuous solutions 5 12. Holder continuity of solutions . . . . . . . . $ 13. Solutions of bounded variation . . . . . . . . $ 14. Differentiability . . . . . . . . . . . . . . 5 15. Higher order differentiability . . . . . . . . . 5 16. Analyticity . . . . . . . . . . . . . . . .
. . . . . . .
109 109 128 132 137 141 144
THEOREMS
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . 53 . . . 53 . . . 66
. . . . . . .
100
Chapter V . REGULARITY THEOREMS O N MANIFOLDS . . . . . . 157 5 17. Local and global results on manifolds . . . . . . . . . . . . 157 Chapter VI . REGULARITY RESULTS
W I T H FEWER VARIABLES
5 18. ~ w i a t a k ' smethod . . . . . . . . . . . § 19. Between measurability and continuity . . . 5 20 . Between Baire property and continuity . . $ 21. Between continuity and differentiability . .
. . . .
. . . .
. . . .
. . . .
Chapter VII . APPLICATIONS . . . . . . . . . . . 5 22 . Simple applications . . . . . . . . . . . . . . . $23. Characterization of the Dirichlet distribution . . . . $24. Characterization of Weierstrass's sigma function . .
. . . .
. . . .
. . . . . . . .
. . . . . . . . . .
. . . . .
169 170 174 204 218
. . . .
. . . .
231 231 275 285
. . . .
viii
Table of contents
References . . . . . . . . . . . . . . . . . . . . . . . . . . Index . . . . . . . . . . . . . . . . . . . . . . . . . . . .
335 347
PREFACE This book is about regularity properties of functional equations. It contains, in a unified fashion, most of the modern results about regularity of non-composite functional equations with several variables. It also contains several applications including very recent ones. I hope that this book makes these results more accessible and easier to use for everyone working with functional equations. This book could not have been written without the stimulating atmosphere of the International Symposium on Functional Equations conference series and thus I am grateful to all colleagues working in this field. This series of conferences was created by JBnos Aczitl. I am especially grateful to him for inviting me to the University of Waterloo, Canada, which provided a peaceful working environment. I started this book in September 1998 during my stay in Waterloo. I thank Mikl6s Laczkovich, the referee of my C.Sc. dissertation very much for his remarks and suggestions. Between 1974 and 1997 I worked at Kossuth Lajos University, Debrecen, Hungary. Naturally, I am grateful to all members of the Debrecen School of functional equations. Finally, I would like to thank those of my colleagues, joint pieces of work with whom in one way or another are contained in this book: JAnos Aczkl, ZoltBn Darbczy, Roman Ger, Gyula Maksa, Zsolt PAles, Wolfgang Sander, and LAsz16 Szitkelyhidi. I thank my sons, Antal JArai Jr. and ZoltAn JBrai for correcting several errors of my "Hunglish"; all the remaining errors are mine. Finally, I wish to express my gratitude to Kluwer Academic Publishers, in particular, Professor Jen6 Sz6p and John C. Martindale for the first rate and patient technical help. Financial support for this book was provided mainly by Szitchenyi Scholarship for Professors, Hungary and partly by OTKA TO31995 grant. Budapest, May 25, 2003. Antal JBrai Department of Computer Algebra Eotvos LorAnd University PBzmAny Pitter sittAny 1/C H-1117 Budapest, Hungary e-mail:
[email protected] http://compalg.inf.elte. hu/-ajarai
Chapter I.
PRELIMINARIES
In this chapter, starting with simple examples, we describe the problems with which we will deal in this book. We also present simple examples of our methods. First we formulate the fundamental problem, then analyse its conditions and explore its applicability. We then formulate theorems that follow from our results as corollaries to that fundamental problem. Then we survey possibilities for generalization. We close this chapter by summarizing our notation and terminology, including the formulation of theorems not readily available in the literature or usually formulated in a different way.
1. INTRODUCTION 1.1. G e n e r a l considerations a n d simple examples. As a first, illustrative example let us consider the best-known functional equation, Cauchy's equation
with unknown function f . In a wider sense differential equations, integral equations, variational problems, etc. are also functional equations, but here we will use this expression in a more restrictive sense for functional equations
2
Chapter I. Preliminaries
without infinitesimal operations such as integration and differentiation. For a more formal definition, see Aczd [3], 0.1. To formulate a functional equation exactly we have to give the set of functions in which we look for solutions. We also have to give the domain of the functional equation. In the above example this is the set of the pairs (x, y) of the variables x and y for which equality has to be satisfied. For example, we may look for all measurable functions f : R -+ R such that (1) is satisfied for all (x, y) E [0, m[ x R. Conditions such as measurability, Baire property, continuity everywhere or in a point, boundedness, differentiability, analyticity, etc. are called regularity conditions. If this kind of conditions are imposed on the solution, then we say that we look for regular solutions. Otherwise, if we look for solutions among all maps from a given set into another given set, then we say that we look for the general solution of the functional equation. Usually, the domain of the functional equation is the set of all tuples of the variables for which both sides are defined. For example, if we say that f : R -+ R is a solution of Cauchy's functional equation, then it is implicitly understood that (1) is satisfied for all (x, y) E R x R. If the domain of the equation is not the largest possible for which both sides are defined, then we speak about an equation with restricted domain; the term conditional equation is also used, especially if the domain of the equation also depends on the solution or solutions. Cauchy's equation is a functional equation with two variables; the variables denoted by x and y in (1). Equations like f (x) = f (-x), f (x) = -f (-x), f (22) = f ( x ) ~or , difference equations are called functional equations in a single variable. The "single variable" may also be a vector variable; it is understood that there are no more variables in the equation than the number of places in the unknown function or the minimal number of places in the unknown functions - if there is more than one. Otherwise we speak about a functional equation in several variables. This distinction is very useful in practice. There is a large difference between functional equations with a single variable and several variables: the methods used in the two cases are quite different. In this book we deal with functional equations in several variables. About equations in a single variable see the books Kuczma [I261 and Kuczma, Choczewski, Ger [128]. The distinction between functional equations in a single variable and in several variables and what we have said about variables, domain, regular and general solutions also apply to systems of functional equations. Further simple examples of functional equations are Cauchy's exponen-
§ 1. Introduction
3
tial equation
Cauchy's power equation
and Cauchy's logarithmic equation
Observe, that solutions f of (2) mapping 10, GO[ into the normed algebra of all bounded linear operators on a Banach space gives operator semigroups. The usefulness of semigroups in the study of evolution equations such as the heat equation or Schrodinger's equation is well known, see for example Hille and Phillips [72]. The overall importance of equations (1)-(4) is due to the fact that they describe homomorphisms. We move toward a general theory of functional equations, and we do not intend to study specific functional equations, except as examples, even if they are very important. It is a well-known phenomenon that one functional equation can determine several unknown functions. This is the situation, for example, for the analogue of Cauchy's functional equation with several unknown functions which is called Pexider's equation:
Indeed, if f 1, f 2 ,f 3 : R -+ R, by putting y = 0 and x = 0 in (5) we may express f 2 and f 3 by f l , respectively. By putting x = 0 and y = 0 simultaneously in (5) and using the resulting relation we obtain that f = f l - f1(0) satisfies Cauchy's equation (1). Hence (5) can be reduced to (1). Similar phenomena occur often when different occurrences of the unknown function f are replaced by f l , f 2 , etc., a process sometimes called "Pexiderization". Jensen 's equation
can be considered as a special case of Pexider's equation, and we obtain that an f : R -+ R function is a solution of (6) if and only if the function f - f (0) satisfies (1).
4
Chapter I. Preliminaries
It is also possible to consider functional inequalities. Functional inequality
related to Cauchy's equation describes subadditive functions and functions satisfying
are the so-called Jensen convex functions. We will use the above simple functional equations as illustrative examples. Their detailed study can be found in the book of Acz6l [3] or in the book of Acz6l and Dhombres [20]. 1.2. Simple examples: smooth solutions. Let us suppose that a solution f : IR + R of Cauchy's equation f (x y) = f (x) f (y)is analytic. Substituting y = x we obtain the equation f (22)= 2f (x)in a single variable x E R. Analyticity is such a strong regularity condition that even this single variable equation has not too many analytic solutions. For the solution we obtain f (x)= co clx
+
+
+
+
in a neighborhood of the origin, and hence that the solution can only be f (x)= cx with an arbitrary constant c = cl. Substitution shows that this is indeed a solution of Cauchy's equation. The case of Cauchy's exponential equation f (x y) = f (x)f (y)is much more interesting. As above, we obtain the single variable equation f (22)= f ( x ) ~and, , if f : R + R is analytic, f (x)= co clx , then
+ + +
3
.
.
Hence co = ci. There are two possibilities. The first is that co = 0, which implies that c, = 0 for each n, and hence f = 0. The second is that co = 1. In this case cl could be arbitrary, and from the equation
$1. Introduction
5
using the notation c = cl, we obtain by induction that cn = cn/n!. Hence all analytic solutions are given by f (x) = C r = o cnxn/n! = exp(cx). The same method gives complex analytic solutions f : C + C,too. Let us observe that this is a nice way to introduce exponential functions (and hence the related functions sin, cos, sinh, and cosh) using only the most important property of exponentiation. Note that this was Cauchy's original motivation to investigate functional equations 1.1.(1)-1.1. (4): he wanted to avoid "circulus vitiosus" by studying power functions; see the historical remarks in the book of Acz6l and Dhombres [20], pp. 365-371. Now let us only suppose that the solution f : R + R is twice differentiable. In the case of Cauchy's equation, f (x y) = f (x) f (y), let us differentiate both sides with respect to y. This "kills" the first term on the right-hand side, and we obtain that f l ( x y) = f '(y) for every x, y E R. Differentiating again, but with respect to x we can "kill" the other term on the right-hand side, too, and we obtain f"(x y) = 0. Substituting y = 0 we have f "(x) = 0, a differential equation. All solutions of this equation have the form f (x) = co + cx, co, c E R. Substituting this into the original functional equation we see that co = 0, and we obtain that twice differentiable solutions are exactly the functions f (x) = cx. This simple example illustrates a general method to get "smooth enough" solutions. The general tactic is to "kill" some terms by applying appropriate differential operators, and to obtain differential equations by appropriate substitutions. Usually, appropriate substitutions or use of certain symmetries of the equation results in a differential equation with lower degree. For example, substituting y = 0 in the equation f l ( x y) = fl(y) we obtain that f l ( x ) = c with c = fl(0), a first order equation. Cauchy's exponential equation, f (x y) = f (x)f (y), similarly yields f '(x y) = f (x)f '(y), and after substituting y = 0 we obtain f '(x) = cf (x), where c = f '(0). Let us observe that in both cases, the general once differentiable solution f : R + R is the same as the general analytic solution.
+
+
+
+
+
+ +
1.3. Simple examples: regularity properties. How to obtain solutions of the above examples, Cauchy's equation and Cauchy's exponential equation under much weaker regularity assumptions? A general way is to prove that weak regularity conditions, say continuity or measurability of solutions implies much stronger regularity conditions, their differentiability or even analyticity. For example, let us observe that in both of the above cases the differential equation obtained for the solutions in the previous point implies directly that the solutions are analytic (see Dieudonnh [49], 10.5.3). If we have a continuous solution f : R + R, then integrating Cauchy's
6
Chapter I. Preliminaries
equation over an interval [a,b] of positive length yields
Substituting a new variable u = x
+ y we obtain
The right-hand side is differentiable, so we obtain that f is differentiable. If we want to deduce that f is twice differentiable, we can apply the same obtained by differentiation with reasoning to the equation f '(x y) = f '(x) respect to x from the original. Higher order differentiability can be obtained analogously. In the case of Cauchy's exponential equation f 0 is one of the continuous solutions f : R + R. If f (yo) # 0, then we can choose a neighborhood [a,b] of yo such that f (y)/f (yo) 2 112 for each y E [a,b]. Integrating we obtain
+
-
and hence that
This implies that f is differentiable. Here, again, applying the same method for the equation fl(x y) = fl(x)f (y) obtained from the original equation by differentiation with respect to x gives that the solutions are twice differentiable, etc. Now, let us consider a measurable solution f : R -+ R of Cauchy's equation. Let [a, b] be an interval with positive length 7. Let xo E R be arbitrary. By Lusin's theorem there exists a compact set C1contained in [xo a , xo b] and having Lebesgue measure greater than 3714 such that f lC1is continuous. If /x- x o /< 7718, then the set C1- x is contained in C = [a - 718, b 7/81. Since the Lebesgue measure of C \ (C1- x) and C \ (C1 - xo)are less than 712, they cannot cover C.Hence the intersection (C1- x) n (C1- xo) is nonvoid. Now, let E > 0 be arbitrary. Since f lCl is uniformly continuous, there exists a S > 0 such that if u,u' E C1then If (u)- f ( .')I < E. Hence, if Ix - xol < min{71/8, S) then for any y E (C1- x) n (CI- xo) we obtain
+
+
+
+
7
'$1.Introduction
i. e., f is continuous at xo. Since x0 was arbitrary, f is continuous everywhere. The same method can be applied to Cauchy's exponential equation after introducing the new variable t = x y instead of x, i. e., to the equation
+
f (4= f (t - df (Y).
+
Note that Cauchy's logarithmic equation f (xy) = f (x) f (y) has no other solution f : R + R than f = 0;this follows by substituting y = 0. In the case of Cauchy's power equation f (xy) = f (x)f (y) there are solutions f : R + R which are measurable but non-continuous, continuous but non-differentiable, etc. Indeed, the functions x e /x1 and x ci\ 1x1 sgn x are solutions for any c E R if OC is understood as 0. 1.4. Hilbert's fifth problem. In his celebrated address to the 1900 International Congress of Mathematicians, in his fifth problem Hilbert ([70] p. 304) asked1
". . . how far Lie's concept of continuous groups of transformations is approachable in our investigations without the assumption of differentiability of the functions" More precisely2:
". . . hence there arises the question whether, through the introduction of suitable new variables and parameters, the group can always be transformed into one whose defining functions are differentiable . . . >,.
Explaining that the group property is connected to a system of functional equations, in the second part of his fifth problem Hilbert goes on as follows3:
". . . inwieweit der Liesche Begriff der kontinuierlichen Transformationsgruppe auch ohne Annahme der Differenzierbarkeit der Funktionen unserer Untersuchung zuganglich st." ". . . es ensteht mithin die Rage, ob nicht etwa durch Einfuhrung geeigneter neuer Verandertlicher und Parameter die Gruppe stets in eine solche iibergefiihrt werden kann, fiir welche die definierenden Funktionen differenzierbar sind, . . . " . " ~ b e r h a uwerden ~t wir auf das weite und nicht uninteressante Feld der Funktionalgleichungen gefuhrt, die bisher meist nur unter der Voraussetzung der Differenzierbarkeit der auftretenden Funktionen untersucht worden sind. Insbesondere die von ABEL (Werke, Bd. 1, S. 1,61, 389) mit so vielem Scharfsinn behandelten Funktionalgleichungen, die Differenzengleichungen und andere in der Literatur vorkommende Gleichungen weisen an sich nichts auf, was zur Forderung der Differenzierbarkeit der auftretenden Funktionen zwingt, und bei gewissen Existenzbeweisen in der Variationsrechnung fie1 mir direkt die Aufgabe zu, aus dem Bestehen einer Differenzengleichung die Differenzierbarkeit der betrachteten Funktionen beweisen zu mussen. In allen diesen Fallen erhebt sich daher die Frage, inwieweit etwa die Aussagen, die wir im Falle der Annahme differenzierbarer Funktionen machen konnen, unter geeigneten Modifikationen ohne diese Voraussetzung giiltig sind."
8
Chapter I. Preliminaries
"Moreover, we are thus led to the wide and interesting field of functional equations which have been heretofore investigated usually only under the assumption of the differentiability of the functions involved. In particular the functional equations treated by Abel (Oeuvres, vol. 1, pp. 1, 61, 389) with so much ingenuity, the difference equations, and other equations occurring in the literature of mathematics, do not directly involve anything which necessitates the requirements of the differentiability of the accompanying functions. In the search for certain existence proofs in the calculus of variations I came directly upon the problem: To prove the differentiability of the function under consideration from the existence of a difference equation. In all these cases, then, the problem arises: In how far are the assertions which we can make in the case of differentiable functions true under proper modifications without this assumption ?" (Hilbert's emphases.) After this Hilbert quotes a result of Minkowski which states that under certain conditions the solutions of the functional inequality
are partially differentiable, and remarks that certain functional equations, for example the system of functional equations
where a, p are given real numbers, may have solutions f which are continuous but non-differentiable, even if the given function g is analytic. In our present-day language, it is customary to formulate the fifth problem of Hilbert as the question whether a locally Euclidean topological group is a Lie group. However, in the second part of his fifth problem, Hilbert draws attention to more general problems which today are called regularity problems. They require to prove that differentiability assumptions for functional equations, differential equations, and other equations can be replaced by much weaker assumptions (possibly with appropriate modifications of the problem). This idea returns in problems nineteen and twenty of Hilbert concerning calculus of variation and partial differential equations. See the book of Zeidler [209],II/A, pp. 86-93. As a general reference about Hilbert's problems, see the book [26] edited by Alexandrov.
§ 1. Introduction
9
Concerning Abel's work on functional equations mentioned by Hilbert see the short note [12] and the paper [14] of Aczkl. These initiated intense research to solve Abel's equations under as weak conditions as possible. Two of these equations will be used here as illustration to our methods among applications (see 22.10 and 24); concerning a third one see the papers of Sablik 11741, 11751, [176]. 1.5. A general scheme t o solve functional equations. The simple examples above, as well as the second part of Hilbert's fifth problem, suggest a general scheme to obtain regular solutions of functional equations: (1) Prove - using regularity theorems - that each solution of the functional equation satisfying some "weak" regularity property - say measurability, the Baire property, etc. - satisfies "strong" regularity conditions for example, local integrability, continuity, differentiability or analyticity.
(2) Obtain the "strongly" regular solutions by deriving another kind of equation from the functional equation (differential equation, integral equation, an equation system for the coefficients of the power series expansion, etc.) or by using special methods of the theory of functional equations. Hundreds of papers illustrate this time-honored practice. In step (2) we use special properties of the functional equations and ad hoc considerations: special substitutions, symmetries of the equations, integral transforms, "killing" some terms by applying appropriate differential operators, etc. This is practiced in several books and papers about functional equations. The reader can find some examples among the applications in the last chapter of this book. It is impossible to refer here to all the enormous literature. We refer the reader to books, survey papers, and research papers. The following books contain introductory material about functional equations in several variables: Aczd [2], [5]; Hille [71]; Kuczma [127]; Saaty [173], section 3. The following are monographs about functional equations in several variables: Acz4l [I], and the revised English edition [3]; Aczd-Dhombres [20]. Special topics are dealt with in Acz6l [lo] (social and behavioral sciences); Aczkl-Dar6czy 1191 (measures of information); Aczkl-Golqb [22] (geometry); Dhombres [45] (conditional Cauchy equations); Eichhorn [51] (economics); Szekelyhidi [I931 (convolution-type functional equations). See moreover the survey papers [8], [14], [16], [120], the survey papers in the volume [15], the volumes of Aequationes Mathematicae, and the bibliography of papers on functional equations in the volumes of Aequationes Mathematicae.
10
Chapter I. Preliminaries
The main subject of this book is to deal with step (1). We will obtain general regularity results for a large class of functional equations, called non-composite functional equations with several variables. Our results give answers to most of the questions of the second part of Hilbert's fifth problem for this class. Here it is also impossible to refer to all the papers dealing with the regularity problem of special functional equations. We refer the reader to the book and survey papers cited above and to the volumes of Aequationes Mathematicae and the bibliography therein. Part of the regularity results dealing with general types of functional equations in several variables will be cited in the introduction of the appropriate sections of this book. Before starting to deal with regularity questions, we give a short overview of and some references about general methods to get "smooth" solutions of functional equations. 1.6. General methods of determining "smooth" solutions of functional equations. In 1.2 we have seen two methods. One of them was to find analytic solutions by obtaining a system of equations for the coefficients from the functional equation. The other one was to reduce the functional equation to a differential equation. This second method, which is used extensively, usually gives the most comfortable way to determine "smooth" solutions. Examples can be found in the book of Aczd [3], see especially 4.2.1, 4.2.4. The deduction of the differential equation usually uses the tactic of "killing some terms" by differentiation, and using special substitutions and special symmetries of the equation. Actually this method goes back to Abel. He wrote (quote from Aczkl [9]):4 "Actually, as many equations can be found by repeated differentiations with respect to the two independent variables as are necessary to eliminate arbitrary functions. In this manner, an equation is obtained which contains only one of these functions and which will generally be a differential equation of some order. Thus it is generally possible to find all the functions by means of a single equation. "In der Tath lassen sich durch wiederholte Differentiationen nach den beiden verandertlichen Grossen, so vie1 Gleichungen finden, als notig sind, um belibige Functionen zu eliminieren, so dass man zu einer Gleichung gelangt, welche nur noch eine dieser Functionen enthalt und welche im Allgemeinen eine Differential-Gleichung von irgend einer Ordnung sein wird. Man kann also im Allgemeinen alle die Functionen vermittelst einer einzigen Gleichung finden. Daraus folgt, dass eine solche Gleihung nur selten moglich sein wird. Denn d a die Form einer beliebigen Function die in der gegebenen Bedingungs-Gleichung vorkommt, vermoge der Gleichung selbst, von der Formen der andern abhangig sein soll, so ist offenbar, dass man im Allgemeinen keine dieser Functionen als gegeben annehmen kann."
5 1. Introduction
11
From this it follows that such an equation can exist only rarely. Indeed, since the form of an arbitrary function appearing in the given conditional equation, by virtue of the equation itself, is to be dependent on the forms of others, it is obvious that, in general, one cannot assume any of these functions to be given." For functional equations with several variables and several unknown functions PAles [163] developed this method into a general algorithm. We will treat his algorithm in detail in 5 16 and we will illustrate it in 22.11. Presently this is the most general method to find smooth solutions. Another important method is to use an integral transform to get a simpler functional equation. This method is illustrated in the book of Aczkl [3], 4.1.1. Szkkelyhidi developed this idea into a very general method. His books [191], [I941 contain several examples, mostly functional equations on commutative groups, where mean periodic functions, Fourier transform and some refinements and spectral synthesis play an important r d e ; see also our joint survey paper [120]. With Szkkelyhidi's methods various special cases of the general functional equation
can be treated, where functions f j , gi, hi are unknown and functions cpj, y ! ~ ~ are given. In general, cpj, $j are homomorphisms, or in some sense structurepreserving mappings of the common domain of the unknown functions. G is a commutative topological group or semigroup. For simplicity we restrict ourselves here to the case where the range of the unknown functions is in C, although more general ranges can be considered as well. Equation (1) can be roughly characterized as a "linear equation with linear arguments", although it is linear only in the sense that linear combinations of solution vectors (fl , . . . , f,) also satisfy a similar equation with another n. Although the methods and results depend heavily on the commutative structure of the underlying groups or semigroups, in some cases they can be applied also in the noncommutative situation. An important special case is n = 0, where regular solutions are continuous (generalized) polynomials, i. e., they can be represented as a finite sum A,+(x,x, . . . ,x) where A,+ : G " C is a k-additive function, i. e., x ++ additive in each variable. The exact characterization of the solutions depends on some algebraic conditions. Other important special cases are the Levi-CivitA type equation (case m = 1, cpl = $1 = id) and the D'Alembert-type equation (case m = 2,
Ck
Chapter I. Preliminaries
12
~1 = $1 = ( ~ = 2 -g2 = id). In these cases regular solutions f l and f l , f2, respectively, are continuous exponential polynomials, i. e., can be represented pkmk, where each p,+ : G -+ C is a (generalized) polynomial as a finite sum and each mk : G -+ C is a multiplicative function. All these special cases are related to systems of convolution-type functional equations f *,u = 0, where p runs in a given set of compactly supported complex measures. The essence of the method for solving such convolutiontype systems is to prove that the continuous solutions form a translation invariant closed linear subspace, and there are "sufficiently many" exponential polynomials in this subspace. The continuous solutions can be constructed from exponential polynomial solutions. All these methods work for general solutions, too, if we consider G with the discrete topology.
The elementary method of SGto is based on the following observation: Let X, Y be sets, F; : X + IF, gi : Y + IF (i = 1 , 2 , .. . ,n ) be mappings into a field. Suppose, that the functional equation
x n
0=
(2)
fi (x)gi(y)
for all x E X , y E Y
i=l
is satisfied. I f f l , f 2 , , . . . , f, are linearly independent and f j = El='=, ci,jfi for j = r 1,.. . ,n with some ci,j E IF, then gi = - Cy=r+l ci,jgj for i = 1 , 2 , . . . ,r . Especially,
+
Indeed,
and since f l , . . . ,f, are linearly independent, their coefficients have to be zero for any y E Y.
-
We remark that the above statement is a generalization of the frequently used trivial observations that if f (x)g(y) 0, then f (a) G 0 or g(y) 0 and that if f (x) = g(y) for all x E X, y E Y, then there exists a constant c such
-
3 1. Introduction
13
that f (x) c and g(y) = c. These correspond to the case n = 1 and n = 2, respectively. Of course, the above statement is a special case of more general theorems. We mention it here explicitly only because several nonlinear or even composite functional equations can be solved by Suto's method, reducing them - by differentiation or by other tricks - to equation (2). See for some recent applications of S ~ t o ' smethod the papers [144-1501 of Lundberg. We will give a simple example of how to apply Suto's method in 22.11. Certain functional equations can be reduced to integral equations. We do not illustrate this method here; see the book of Aczd [3], 4.1.2. Another method, which needs only weak regularity conditions, the socalled "distribution method" was initiated by Fenyo [54]. The main idea is that continuous (or locally integrable) solutions are considered as Schwartz distributions and the equation is viewed as an equation for distributions. Differential equations (in the distribution sense) are obtained using similar tricks as for smooth solutions, but usually with more technical problems. Finally, distribution solutions turn out to be regular distributions belonging to (smooth) functions. See the papers of Baker [31], [32], [33], [34] about this method. The applicability of the distribution method is restricted, because no multiplication among Schwartz distributions is defined. By Schwartz's impossibility theorem, this cannot be done in a satisfying way. It is even more hopeless to substitute distributions into general Cm functions with several variables. The distribution method has to be restricted to functional equations that are not very far from being linear. Sometimes it is possible to find the solution of a functional equation directly, on a dense set. In such cases continuous solutions are determined by solutions on this dense set. For example, for Cauchy's equation, by substitution x = y = 0 we obtain f (0) = 0 and by induction f (nx) = n f (x) for n E N. Substitution y = -x shows that f (-x) = -f (x), hence the above formula holds for all n E Z.Now substituting x l n for x shows that f ( x l n ) = f (%)In,whenever 0 # n E Z.Hence f (rx) = r f (x), whenever r E Q. This shows that f (1) determines the values of f at rational points and by approximating real numbers with rational numbers we obtain that the only continuous solutions are f (x) = cx where c = f (1). For further examples see the book of Aczdl [3], section 2.
A further method is to reduce the functional equation to another functional equation which is already solved. We have seen an example for this in 1.1. In this more or less algebraic way we can usually find the general
Chapter I. Preliminaries
14
solutions of the equation. Usually, regular solutions of the original equation are obtained from regular solutions of the other equation, hence we may determine the regular solutions, too. The reader can find several examples for this kind of reduction of a functional equation to another functional equation in Aczkl [3] and in Aczkl-Dhombres [20]. We note that merely the knowledge of the general solution does not always make it possible to deduce the regular solutions easily. For example, the general solution of Cauchy's equation is the following: let us fix an arbitrary base B of R over Q,a Hamel base. Then every real number x can be uniquely written as finite linear combinaribi of base elements hi with rational coefficients ri. Any function tions f : B -+ R can be uniquely extended to a solution of Cauchy's equation by f (x) = Ci ri f ( h i ) Let us observe that even in this very simple case it is not clear - without regularity results - which solutions are measurable.
xi
We have to mention here the method of Vincze. It is based on the following observation: Suppose that functions filj, i = 1 , 2 , . . . ,m, j = 1 , 2 , . . . ,n m a p an arbitrary set X into a field IF and they satisfy the functional equation
Then for any function f o : X
+ IF
the functional equation
is satisfied, too, where fi,0 = f o for all i = 1 , 2 , . . . , m . The statement can be easily verified by expanding all determinants in (4) with respect to f O ( x k ) k, = 0, I , . . . ,n. By choosing an appropriate f o some terms may become zero and we get a simpler functional equation. For further information on how to apply this method and how to reduce certain functional equations to the form (3) see the paper of Vincze [205]. We conclude this section with some general remarks. For general purposes usually the simplest way is to use general regularity results to prove that the solutions are a few times differentiable, and then to obtain a differential equation for the solutions. We usually cannot completely avoid using regularity results with the other methods either. Methods that use
5 1. Introduction
15
less regularity are usually more complicated and cannot be applied to the most general cases. Despite general methods, we are very far from having algorithmic methods to solve a large part of the functional equations that are of recent interest.
1.7. Composite and non-composite functional equations with several variables. The main topic of the present book is to prove general regularity results. For several functional equations regularity theorems do not hold. Besides functional equations in a single variable - like those mentioned by Hilbert, see 1.4 - there are also several composite functional equations (in which unknown functions are substituted into each other) that have continuous, but non-differentiable solutions. For example, the Acz& Benz functional equation
i(n:*
first studied in [17], has the solution x H 1x1). All continuous solutions of this equation were given by Dar6czy [41]. The functional equation system describing group axioms in the first part of Hilbertls fifth problem also needs special treatment (re-coordinating). Although, for example, in the paper of Acz6l [4] under very general conditions it is proved that the real function solutions f of the functional equation
are monotonic, it seems to us that there is no general regularity theory yet for such equations. (We will summarize some recent results about composite equations in 1.16.) However, for those functional equations with several variables where the unknown functions are not substituted into themselves or into each other, general regularity theorems can be proven. Such theorems constitute the main body of this book. Our purpose is to investigate the "most general" non-composite functional equation with several variables having the form
for all (X, Y) from a set E. Here f , fo, fl l . . . , fn are unknown functions, all other functions are given. All functions are vector-vector functions. Introducing the new variable x = G(X, Y) and solving the above equation for
16
Chapter I. Preliminaries
f (x) we usually get a somewhat simpler form, which much better suits our purposes. In several cases introducing another new variable y = Go(X,Y ) , we can get the form
where in one of the unknown functions only the variable y appears. Since for our purposes this "explicit" equation is the most convenient one, we will use this form in general and we do not deal with the conditions under which the above transformations can be (at least locally) performed. In this equation it seems to be unnecessary to include fa - and hence also to introduce the new variable y = Go(X,Y) -, but f o has a distinguished r61e among the unknown functions on the right-hand side. As a problem of Sander [I821 shows (see 22.2), it is often enough to suppose much weaker conditions (or none at all) for f than for the other unknown functions. For the given functions we mainly have to prescribe smoothness conditions: simple examples show that these conditions cannot be omitted. Of course, we may suppose, that the right-hand side of the equation really depends on y, and so the equation is not an equation in a single variable. This is certainly agi the case if we suppose that the rank of the matrices - is maximal. We 89 usually suppose that the set D is an open set. A general problem of regularity is to investigate if Lebesgue measurable solutions or solutions having the Baire property are analytic. Most of our regularity theorems state that, if the given functions are "sufficiently nice" and the unknown functions f l , f P ,. . . , fn have a regularity property, then the function f has a regularity property which is "one better". To obtain an affirmative answer to the regularity problem, the following steps can be used: (I)
measurability implies continuity;
(11)
solutions having the Baire property are continuous;
(111) continuous solutions are almost everywhere differentiable; (IV) almost everywhere differentiable solutions are continuously differentiable;
+ 1 times con-
(V)
all p times continuously differentiable solutions are p tinuously differentiable;
(VI)
infinitely many times differentiable solutions are analytic.
5 1. Introduction
17
If f = f o = f l = . . . = f n r then we obtain step by step that the measurable solutions, or solutions having the Baire property, are analytic. If this is not the case, then expressing other unknown functions from the equation might help to draw a similar conclusion. Let us remark that the regularity problem of composite and non-composite functional equations overlap. Indeed, calling functions "unknown" and "known" in regularity theorems is done only to distinguish those functions for which very weak regularity assumptions (say measurability) are supposed from those for which much stronger assumptions (say continuous differentiability) are supposed. Hence weakening regularity assumptions on "known" functions yields results much more useful for composite equations. To better understand why the above steps (I)-(VI) are chosen, what the main difficulties are, and what kind of conditions we have to impose on the given functions, we first show some basic ideas. 1.8. The classical method. The following general method goes back to Andrade (1900) and Kac (1937) and is well known among functional equationists. It is described (in a somewhat less general form) in the book of Aczdl [3], 4.2.2, 4.2.3. It can be applied to functional equations of the form
c n
h(x, ~ )(x) f = ~ o ( xY),
+
hi (x1Y, fi ( ~ i ( xY,) ) ) ( 2 1 Y) t D; i=l where f , f l , . . . , fn are the unknown functions defined on some open sets X, X I , . . . ,Xn c R and taking values in some open sets Z, Z1,. . . , Zn c R. Let us suppose that Y C R is an open set, D C X x Y is also open, h, ho : a h aho ahi ahi D 4 R and hi : D x Zi + R are continuous, and --, - - - fi, a x a x ' dx ' du ' i = 1 , 2 , . . . ,n are also continuous. Let us suppose that the functions are twice continuously differentiable. Let xo E X and suppose that there exists agi a yo E Y such that (x0,yo) E D , ~ ( X Oyo) , # 0, and -(xo,yo) # 0 for ay i = 1 , 2 , . . . ,n. Then f is continuously differentiable in a neighborhood of 20. To prove this, let us observe that for some open bounded interval Xo and for an interval [a,b] c Y we have that Xo x [a, b] C D, (xO,yo) E Xo x ]a, b[, and J~~ h(x, y) dy # 0 whenever x E Xo. Integrating both sides of equation (1) with respect to y over [a, b] we obtain
(1)
ii
18
Chapter I. Preliminaries
Using the notation gi,,(y) = gi(x, y) let us introduce the new variable u = i = 1 , 2 , . . . ,n separately in each of the integrals on the right-hand side. For this we substitute Xo with a smaller neighborhood of xo and [a,b] with a smaller neighborhood of yo, if necessary. Then we obtain gi,%(y),
From this equation it follows that f is continuously differentiable. Indeed, a parametric integral of the type
is continuously differentiable if 2,Y c R are bounded open sets, 3 : x x [a, b] + ? and ?2. : x x ? + R are continuous functions, and the partial derivatives and and let us denote
2
are continuous, too. To see this, let yl E 8 ( x , y) =
? be fixed,
l:
L(x, u) du.
Then H is continuously differentiable and hence H (x, g(x, a ) ) is also continuously differentiable.
f (x)
= H (x, g(x, b)) -
ah
Let us observe that we only need the continuity of and z. Therefore, h(x, y) can have the form h* (x, y, fo(y)) with continuously differentiable h* but merely continuous (maybe unknown) f o which can even be vector valued. hi (x, y) f j (y) with m < 0 and with In particular, we may take h(x, y) = (possibly unknown) continuous functions f j , m 5 j < 0 and continuously differentiable functions hi. Moreover, let us observe that equations like
may be reduced to (1) introducing new variables x = G(Z, Q ) , y = Go(2,Q ) .
§ 1. Introduction
19
The generalization of this method to complex or even vector-valued unknown functions is immediate, but the generalization to vector-vector unknown functions is non-trivial. We can integrate over a simplex S with nonvoid interior, and we obtain the parametric integral
Of course, the differentiability of such parametric integrals is classic, if 3 and ?L are smooth, but we only know that 3 is continuously differentiable and h together with its partial derivative with respect to x is continuous. We investigate these questions and the generalization of the above classic method to vector-vector functions in 5 11. 1.9. Higher order differentiability. The above classical method can be used to prove higher order differentiability, if we use the fact that a parametric integral
I"(x) =
+
with smooth 3 is p 1 times differentiable if h is p times continuously differentiable and its p t h derivative is continuously partially differentiable with respect to x. An even better trick, which works also for the more general nonlinear functional equation 1.7.(1) is to take the p t h derivative of both sides of the equation. What we get is of type (1) from the previous point, and so can be treated by the above mentioned generalization of the classical method. This trick will be applied in 5 15.
1.10. Continuity of solutions. The classical method (and its generalization too) can also be applied to derive continuity of solutions from their local integrability. Hence, if we can obtain local boundedness of solutions from their measurability, then we can obtain continuity, too. This was the usual way to obtain continuity from measurability. For example, by Cauchy's equation f (x y) = f (x) f (y), if a solution f : R + R is measurable on a set K having positive Lebesgue measure, then it is automatically bounded on a smaller set C still having positive measure, because the measure of the sets K, = {x E K : (x)l 5 n ) goes to the measure of K as n + m. Now by a classical theorem of Steinhaus, the set C - C contains a neighborhood of the origin. Function f is bounded on this neighborhood because if x E C - C , then x can be represented as x = z - y , z , y E C, i. e., there exists a y E C such that x y E C , too. But hence
+
+
If
+
20
Chapter I. Preliminaries
i. e., f is bounded on a neighborhood of the origin. Now by integrating over [-E, E] we obtain
This implies that f is continuous near the origin, and by applying (1) again we get that f is differentiable near the origin. But by Cauchy's equation, fixing any y we see that f around y is merely a constant plus a translate of f around the origin. Hence if a solution is continuous, differentiable, analytic, etc. near the origin, then the same holds everywhere. This way to get all measurable solutions f : R 4 R also works for Cauchy's exponential equation. Today this method is much less important as was earlier, and we use it only if the measures in question are not Radon measures. Some application of this method for invariant expansions of the Haar measure and for covariant expansions of the Lebesgue measure will be given in $5 5, 6. There is a much more appropriate method, which works for Radon measures, specially for the Lebesgue measure. It is based on using Lusin's theorem: an idea which seems to have first appeared in McKiernan [I591 and in Baker [29]. We have seen an example of this method in 1.3. The main idea works also for the general case
i. e., for equation 1.7.(2): Let us choose some compact set C with positive measure. Choosing a "large" compact subset Kiof gi(xo,C) on which fi is continuous, if we are able to prove that the set
is nonvoid whenever x is close enough to xo, then by substituting this y into the equation we obtain that f (x) is close to f ( x 0 ) We will use this method to obtain continuity directly from measurability: see § 8. An important observation here is that it is enough if the functional equation is satisfied almost everywhere (see $ 10). The importance of this observation lies in that if the solutions are differentiable almost everywhere, then the derivative (which is automatically a Bore1 function, see 14.1) satisfies also a functional equation, and utilizing the above observation we obtain that the derivative is continuous. This idea, namely that "differentiability
§ 1. Introduction
21
almost everywhere implies continuous differentiability" is treated in 5 14. Let us observe that this means that from having locally bounded variation (for real variable solutions) or from having local Lipschitz continuity (for vector variable solutions) we can conclude continuous differentiability. An interesting, but out-of-the-main-stream question is treated in 5 7. Problem 24 of Mazur in the Scottish Book [I571 asks whether additive functional~on a Banach space that are Lebesgue measurable along any curve are continuous. This is some kind of "measurability implies continuity" problem. We will briefly discuss it, proving that being Lebesgue measurable along any curve is equivalent to being universally measurable. 1.11. Steinhaus type theorems. As we have seen in the previous point, the theorem of Steinhaus [I891 is a useful tool to treat measurable solutions of functional equations. Another version asserts that, for any Lebesgue measurable sets Al, A2 C IR with positive Lebesgue measure, Al A2 has an interior point. This theorem allows various generalizations and modifications; IR may be replaced by other topological measure spaces and the addition may be replaced by a two variable function. A large part of these generalizations are based on Weil's idea [206] that the convolution
+
is continuous. From this, by setting fi to the characteristic function of Ai the theorem of Steinhaus is easily obtained. Let us observe that the p measure of set 1.10.(1) in the previous point is
where fi is the characteristic function of Ki.This shows that what we need to prove "measurability implies continuity" type theorems is some kind of a generalized Steinhaus theorem, and the two topics are strongly connected. In 5 3 we first generalize Weil's theorem proving the continuity of a
Using this result, we generalize the theorem of Steinhaus for the case of continuously differentiable functions of more than two variables. Several previously known results follow as special cases. This generalization will be used implicitly in $5 5, 8, 10, and 19 and explicitly in 5 6 and in the application given in 5 23.
22
Chapter I. Preliminaries
1.12. Baire category. There is a strong analogy between measure and Baire category: the book of Oxtoby [I611 is a nice introduction. Based on this, we may expect that most of the measure theoretical results have the category analogue. The category analogue of Steinhaus' theorem is the theorem of Piccard, stating that if C C R is of second category and has the Baire property, then C - C contains a neighborhood of the origin. This has similar generalizations as the theorem of Steinhaus: see $ 4. "Baire property implies continuity" type results are analogous to ''measurability implies continuity" type results; they are treated in $ 9.
1. l 3 . Differentiability of continuous solutions. No doubt, the hardest step is to prove differentiability of solutions from continuity. As we have seen in 1.10, it is enough to derive differentiability almost everywhere; this is implied if we prove local Lipschitz property. The generalization of the classical method 1.8 for vector-vector functions is useful but far from being satisfying. We will investigate some properties between continuity and local Lipschitz property: Holder continuity (5 12) and essential bounded variation (5 13). Recently with some additional compactness conditions the author obtained a theorem, which is strong enough to use in most of the practical cases: see 11.6. The first application of this new result was the solution of the equation of "the duplication of the cube" 22.15. But I think the best illustration of the power of this result is that the recent results of the "Habilitationsschrift" of M. Bonk about the characterization of Weierstrass's sigma function by functional equation (a problem which goes back to Abel) can be obtained, even in a more general setting; see $24. 1.14. Analyticity. The final step could be to prove that infinitely many times differentiable solutions are analytic. In this step there are only weak results; see $ 16. We will consider there a result from JBrai [95]. Often it is possible to derive &om the functional equation a differential equation. In such cases, regularity theory of differential equations can be used. See Dieudonni! [49], 10.5.3 for ordinary differential equations and the references in Zeidler [209], II/A, pp. 86-93 and the books [73], [74] of Hormander for partial differential equations. Such type of method is used in the paper of Lawruk and ~ w i a t a k[I381 to prove analyticity of the solutions of a generalized mean value type equation. PBles [I631 also used differential equations to deduce analyticity on a "large" set; see the details in $ 16. 1.15. Other regularity properties. As we have seen above, steps (I)(VI) from 1.7 give a natural "stairway" to climb up from measurability or Baire property to analyticity. Of course, several other regularity properties could be imagined. Let us briefly mention some of them.
§ 1. Introduction
23
Often used for simpler cases, but not so useful for more general equations is to suppose that the solution of the functional equation satisfies some property at a point or on a neighborhood of a point. The most popular condition is that there exists a point, where the solution is continuous. As we mentioned, by Cauchy's equation f (x y) = f (x) f (y) fixing a y we see that the solution around y is merely a constant plus a translate of the solution around the origin. Hence by Cauchy's equation local continuity implies continuity everywhere. (The same is true for several other properties.) Recently Matkowski [I541 generalized this consideration to the Cauchy-type equation f (&,Y)) = h ( x , y , f ( 4 , f(Y))
+
+
with unknown function f . But in the case of other equations it seems to be not so useful to suppose continuity at a point. As an illustrative example, we call the attention of the reader to the fact that those solutions f : ]0,1[ + IR of the fundamental equation of information
that are continuous at a point, are not so easy to obtain. See the papers of Diderrich [46], [47], [48], and Maksa [152]. Compare this with 22.8 to see how easily measurable solutions are obtained by applying our general results. (The detailed study of this equation can be found in the book of Aczd and Dar6czy [19]. A very good survey paper on newer results about functional equations and information measures is Aczkl [Ill.) Another popular condition is that the solution is bounded, bounded from above or from below, or locally bounded. For example, if f : R + R is a solution of Cauchy's functional equation which is bounded from, say, above on a neighborhood of a point xo, then f (x) = x f (1)for all x E R. Indeed, let us observe that because of f ( r x sy) = r f (x) sf (y) whenever x, y E R and r, s E 0, the graph of f is a linear subspace of IR2 over Q.If f (x) # x f (1) for some x E R, then (1,f (1)) and (x, f (x)) are (over R) linearly independent elements of the graph, and hence the graph of f is dense in R2. Even much less than local boundedness is enough to prove that for solutions f : IR -+ R of Cauchy's equation we have f (x) = xf (1) everywhere. If we suppose only that f is bounded (say, from above by K ) on a measurable set C with positive Lebesgue measure (a condition equivalent to the existence of a measurable majorant function of f on C), then
+
+
Chapter I. Preliminaries
24
+
whenever x, y E C , i. e., f is bounded from above by 2K on the set C C. But by Steinhaus' theorem (1.10) this set contains a nonvoid open set. Again, such boundedness or local boundedness conditions work well as regularity assumptions for simpler equations but are not so useful for more complicated equations. As an example let us mention again the above fundamental equation (1) of information. Locally bounded solutions (and hence solutions continuous at some point) were obtained by Diderrich, [46], [47]. See also the papers of Diderrich [48] and of Maksa [152]. There was a longstanding open question whether all nonnegative solutions of this equation are continuous. Finally, Dar6czy and Maksa [44] gave a counterexample. But as a much simpler example let us observe that among f : R -+ C solutions of Cauchy's exponential equation there are bounded but non-continuous functions: any function f : R -+ C defined by f (x) = e i f l ( " ) where f l : R i R is a solution of Cauchy's functional equation, is a solution of Cauchy's exponential equation, and has absolute value 1 everywhere. Furthermore we remark that even for very nice functional equations a complex solution is non-differentiable as a complex function, although it is even analytic as a function of two real variables. A trivial example is Cauchy's functional equation. The most general continuous solution f : C i C is f (2) = cz dz with complex constants c, d. It is not differentiable as a complex function if d # 0. (However, there are some exceptions, see Hille [71], pp. 20-22, and Bonk's regularity theorem, 24.2.) The above examples support our view that it is better to omit studying continuous at a point, locally bounded or globally bounded solutions from our general considerations, and not try to obtain results stating complex differentiability.
+
1.16. Regularity of composite equations. Of course, several composite equations have been studied under weak regularity conditions. A triumph in this field has already been mentioned in 1.4: the solution by M. Sablik [174], [175], [I761 of Abel's functional equation
with three unknown real functions under continuity assumptions. Of course, it is not possible to list here all results concerning solutions of composite equations under mild conditions. As we explained in 1.7, presently it is not clear for which general type of composite equations we may expect general regularity results. Nevertheless, for real function solutions, besides the general result of Acz6l mentioned in 1.7, recently Pdes [I651 suggested the following scheme to find solutions:
§ 1. Introduction
25
(I) Use monotonicity and the functional equation to prove Jensen convexity of the internal functions; (11) Berstein-Doetsch theorem shows that Jensen convexity implies convexity, and hence differentiability of the internal functions at all but countably many points; (1II)Lebesgue theorem on almost everywhere differentiability of monotone functions yields differentiability of the external functions everywhere, and this can be used to prove continuous differentiability of the internal functions; (IV) Differentiating the equation, the composite parts can be eliminated and a non-composite functional equation for the derivatives of the internal functions can be obtained. In his survey papers [165], [166] PBles gives some recent examples of composite equations solved by this scheme. He also formulated there several open problems. Here we restrict ourselves to cite briefly some examples - only to give some impression which kind of composite equations can be solved presently. For the rest of this point, all functions are unknown. The equations
and
are related. Continuous and strictly monotone decreasing solutions of (1) were found by Aczkl, Maksa, and PBles [23] and all strictly monotonic solutions of (2) by PBles [164]. The equation
was solved without any assumptions by Aczkl, Ger, and JBrai in [21]. See 22.18 for further details. The strictly monotonic solutions of the more general equation
26
Chapter I. Preliminaries
and the related equation
were found by Aczd, Maksa, Ng, and PBles [25]. A further related equation is
Strictly monotonic solutions were found by Aczd, Maksa, and PBles in [24]. In the paper of Maksa, Marley, and PBles [I531 some strictly monotonic solutions of the functional equation
and the related equation
are found. A further related equation is
All strictly monotonic solutions of this equation are determined in PBles[164]. For the more general equation
it is proved by Gilanyi and PBles [59] that if the functions are strictly monotonic (all pi, Gi, and gi in the same way), then the functions pi and gi are differentiable, and other functions are strictly concave or strictly convex, moreover, f is continuously differentiable.
5 1. Introduction
27
Several problems concerning functional means leads to interesting composite equations. See the summary of some recent results about such type of questions in PBles [165]. Here we consider as an example only the MatkowskiSfito equation
All C2 solutions were found by Matkowski [155]. Quite recently Dar6czy and PBles found all strictly monotonic solutions of (11).
1.17. A general problem. The methods treated above suggest that equation 1.7.(2), the "most general non-composite explicit equation'' may be the central object of our investigations. If we suppose that f = f o = f l = f 2 = . . . = f,, then we can use steps (I)-(VI) of the "bootstrap" method of 1.7. Here there is no need to include the term fo(y) = f (y); if we omit it we do not restrict generality. The condition that the right-hand side really depends on y will be ensured by supposing that the rank of the matrices agi - is maximal. In this way we obtain the following fundamental regularity dy problem of non-composite functional equations: 1.18. P r o b l e m . Let X, Y, and Z be open subsets of R', RS, and Rt, respectively, and let D be an open subset of X x Y and let W be an open subset of D x Zn. Let f : X + Z, gi : D + X (i = 1 , 2 , . . . ,n ) , and h : W + Z be functions. Suppose that
(2) h is analytic;
(3) gi is analytic and for each x E X there exists a y for which (x, y) E D 8s. y) has rank r (i = 1 , 2 , .. . , n). and -(x,
dy Is it true that every f , which is measurable or has the Baire property is analytic?
1.19. R e m a r k : variants of t h e problem. It is possible to substitute the word "analytic" by 'Y!03"in the above problem. Clearly, this problem is even more important because most often regular solutions of functional
Chapter I. Preliminaries
28
equations are obtained by reduction to differential equations. More generally, we can investigate the "CP-problem", where 1 5 p 5 cc or p = w, where CWmeans the class of analytic functions (w > oo). Since it is not a strong restriction to suppose that the given functions are smoother than we expect the solution to be, it can be useful to investigate the "CP-Cqproblem" : We suppose that the given functions are in CP, and ask whether the solution f is in Cq? Here 0 5 q 5 w, 1 5 p 5 w, and q 5 p. For example, in some theorems we will assume that the functions gi are twice continuously differentiable to obtain that f is continuously differentiable, or to prove that f is continuous we will assume that the functions gi are continuously differentiable. Of course, more refined C P ? ~ - C Q > Pquestions can be also considered, where CP@ means the class of those CP-functions whose p-th derivative is locally Holder continuous with exponent a. 1.20. Remarks. (1) The Coo-version of the above general regularity problem 1.18 was formulated by the author (JArai [MI) in a somewhat weaker form, for the functional equation
The analytic version of the problem was included by J. Aczd in his survey talk on the XXI. International Symposium on Functional Equations among the most important open problems on functional equations (see Aczd [8], p. 256). The practice of the last fifteen years shows that the restriction X = Y introduced implicitly by the term f (y) does not help. Hence we will investigate here the above stronger form 1.18. (2) Earlier versions of this problem were formulated under the supposition that h is defined on the set D x Zn. Moreover, several theorems were proved for such type of h. Recent methods make it possible to treat the above stronger version, and in some applications the stronger version proved to be useful. Note that for those "local" theorems where all the functions are known to be continuous, the weaker version can be applied locally to obtain the stronger one, hence they are equivalent. (3) If instead of supposing that for each x E X there exists a y E Y such dgi that (x, y) E D and that -(x, y) has rank r for 1 5 i 5 n, we suppose that dy dgi the rank of -, 1 5 i 5 n is r for everywhere on D , and for each x E X
8~
there exists a y E Y such that (x, y) E D, we get an equivalent problem. Indeed, D could be replaced by the open subset of D l where the rank of each 8gi - is r .
8~
§ 1. Introduction
29
1.21. The above problem is well formed. We will prove by simple counterexamples that none of the conditions of the problem above can be omitted without introducing new conditions. This supports our calling this problem "the fundamental regularity problem of non-composite functional equations with several variables". On the other hand, these counterexamples show us the limits of possible changes in the conditions. The implicit equation
shows that no regularity is satisfied in general for implicit equations; every function f : R + R is a solution for which f (x) = 0 if x @ ]1,2[. Indeed, if x, y E ] 1,2[ then x y E ]2,4[, hence the product is zero. Some of these functions are measurable, but non-continuous, continuous but non-differentiable, etc. Another example, connected to the equation characterizing Weierstrass's sigma function can be found in Remark 24.3. Cauchy's power equation
+
shows that if the equation is implicit in the sense that substituting new variables we cannot convert it to the form f (x) = . . . , then also no regularity is satisfied in general. Indeed, here all functions f : R -+ R, f (x) = /xiCand f (x) = lxjCsgnx are solutions; c E R and OC understood to be 0. Some of these functions are measurable but non-continuous, etc. That the domain D of the functional equation in the above problem has to be open seems to be a technicality: otherwise it would be harder to speak about the differentiability of the inner functions gi. That it is not possible to omit this condition, even if the functions gi are very nice, is shown by the example of Cauchy's exponential equation. Substituting a new variable we obtain f b ) = f ( x - y ) f (Y). On any domain D containing the origin (0,O) and the open set D o = { (x, y) : 0 < y < x ) but contained in the closure of D o , the function f (0) = 1, f (x) = 0 for x > 0 is a measurable but non-continuous solution. If the outer function h in the above problem is not in CP, then the solution need not be in CP, 0 5 p 5 w . This is proved by the following simple example: Let H : R + R be a one-to-one Bore1 function, f : R -+ R,and let us consider Cauchy's functional equation for H o f , i. e.,
30
Chapter I. Preliminaries
This equation is equivalent to the equation
where h is defined by h(zl, z2) = H-' (H(z2) - H ( z l ) ) whenever zl, z2 E R Hence f is a measurable [continuous, etc.] solution of (1) if and only if f is a measurable [continuous, etc.] solution of (2). But i f f is measurable, then H 0 f is measurable too, hence with some constant c we have H ( f (x)) = cx, i. e., f (x) = H-'(cx) for all x E R. If H-I is measurable but non-continuous [continuous but non-differentiable, etc.] then we see that (2) has a measurable [continuous, etc.] solution, which is non-continuous [non-differentiable, etc.]. A very similar example shows that if at least one of the inner functions gi in the problem above is not in CP, 0 5 p 5 W , then the solution f need not be in CP. Let G : R -+ R be a one-to-one Bore1 function with the property that if A C R is a set having Lebesgue measure zero, then G-I (A) also has Lebesgue measure zero. Let f : R + R be a function and let us consider Cauchy's functional equation for f o G, i. e.,
Substituting x = G ( X ) , y = G(Y), we obtain that this equation is equivalent to the functional equation
+
where g is defined by g(x, y) = G (G-'(2) G-l(y)) whenever x, y E R. Hence f is a measurable [continuous, etc.] solution of (3) if and only if f is a measurable [continuous, etc.] solution of (4). But if f is measurable, then f o G is measurable too, hence with some constant c we have f ( G ( X ) ) = cX, i. e., f (x) = cG-'(x) for all x E R. If G-I is measurable but non-continuous [continuous but non-differentiable, etc.] then we see that (4) has a measurable [continuous, etc.] solution, which is non-continuous [non-differentiable, etc.]. Let us observe that in the special case G ( X ) = x3 we obtain that f (x) = cx1I3. In this case in equation (4) we have g(x, y) = (x1I3 y1/3)3. The function g is continuous on Rx R and infinitely many times differentiable at each point (x, y), x # 0 # y. Even the function y ++ g(0, y) = y is differentiable with a non-vanishing derivative. But for x = 0 there is no such y E R for which g is differentiable at (x, y). This shows that we cannot omit the differentiability of the inner function g. This example also shows that we cannot omit the condition that for each x there is a y such that (x, y) E D.
+
§ 1. Introduction
31
8% has to be equal to r , is shown by the dY simple example obtained from Cauchy's power equation by introducing new variables: f (4= f (y)f ( x ~ Y ) , x E R, 0 # Y E R. Finally, that the rank of
-
&l Here, for x = 0 there is no y, for which (0, y)
# 0,
where g(x, y) =
xly. Among real-valued solutions are f (x) = /xiCand f (x) = IxlCsgn x, c E R,where OC is understood to be 0. Some of these are measurable but non-continuous, continuous but non-differentiable, etc. The same equation yields a counterexample in higher dimensions, too, if x, y E RT, none of the coordinates of y is zero, and the division x l y is understood coordinatewise. Here among the solutions are the functions f (x) = fi(xi) where each function fi has the form fi(xi) = lxilCi or fi(xi) = Ixiici sgn xi. dgi has to be equal to r is Sincov's Another example that the rank of --89
nl=l
The general solution f : R x R -+ R of this equation is f (xl, x2) = g(x2)g ( x l ) , where g : R + R is any function. Indeed, any such function f is a solution. To see that an arbitrary solution f can be represented in this form, let us define g(x) = -f (x, c), where c E IR is an arbitrary constant. 1.22. Transfer principle: a simple example. There are several tricks that make it possible to use our results in cases where seemingly they cannot be used directly: see the chapter containing applications. There is one very important "trick" we have to mention here explicitly. Seemingly, equation 1.18.(1) is rather special in the following respect: it contains only one unknown function. But this restriction is not so strong as it seems to be. If we have a general non-composite functional equation with several variables and several unknown functions, and we can express each unknown function from it, then (after writing different variables in each equation) we may consider a vector-valued function having the different unknown functions as coordinates and use results concerning equations with only one, but vector-valued unknown function. This clearly shows how important it is to discuss vector-valued unknown functions with vector variables. Before we formulate this "transfer principle" in full generality, we illustrate it on the simple example of Pexider's equation
32
Chapter I. Preliminaries
From this equation we obtain the following three:
where with the notation u = ( u l ,u2, u 3 ) , v = ( v l ,~ 2 , 2 1 3 )the function h : R6 + R3 is defined by h ( u ,v ) = (u2 vs, vl - us, ul - v 2 ) . Let us observe that equation ( 1 ) satisfies all the conditions of problem 1.18. This example seems to be rather special, and it is not clear how to ~ defined by generalize it. So instead we use the function h : ( R ~-+ )R3
+
h ( ~ u27 l , u3,u4, u5r ~ 6 =) (u1,2+ u2,3,u3,l - u4,3,U 5 , l - u6,2) where ui = ( u ~U Q,, ui,3) E R3 for i = 1,2,3,4,5,6 and obtain the functional equation
where g is an arbitrary continuously differentiable function mapping R into R and having a non-vanishing derivative everywhere. Equation ( 2 ) also satisfies all the conditions of problem 1 .l8. We remark that the condition that all unknown functions have to be expressible from the original equation cannot be omitted. The following simple examples show this. By equation the functions f , f o , where f ( x ) = cx and f o is any function which is zero outside ]1,2[gives a solution. By the equation
f (4= f ( x + Y ) - f ( Y ) + f O ( e x ~ 3-) f o ( e X y 2 ) f 0 ( ~ ) , f , f o : R + the functions f , f o , where f ( x )= cx and f o ( x ) = 1x1C0 for some co E $ where O C O understood to be zero, yields a solution. Below we formulate the general transfer principle, which shows that the regularity problem of the general non-composite functional equation 1.7.(1), if it implies an explicit functional equation having the type 1.7.(2) for all the unknown functions, can be reduced to problem 1.18.
§ 1. Introduction
33
1.23. General transfer principle. Let Xi c R r i , Yi C PSSi , Zi C R t i , and Di C Xi x Y , be open sets, let ni be positive integers, f i : Xi -+ Zi be functions (i = 1,2, . . . , m ) . Suppose, that 1 5 ki,j 5 m is an integer and gi,j : Di -+ X k i I jis a function for 1 5 i 5 m and for 1 5 j 5 ni. Let moreover W i be an open subset o f Di x Zki,, x . . x ZkiIniand let hi : W i -+ Zi be a function for 1 5 i 5 m . Suppose, that for 1 5 i 5 m we have
and
whenever ( x i ,y i ) E Di. (In these functional equations the functions fi are considered unknown, other functions are considered known.) Then for the function defined by f (x) = ( f l ( x l ) ., . . , f m ( x m ) ) where x = ( X I ,... , x m ) and mapping the open set X = XI x X p x . . x X , C IWT , r = rl + - + rm into the open set Z = Z1 x Z2 x x Z , there exist integers n and s and there exist open sets Y c Rs, D C X x Y , W C D x Z n and functions g j : D -+ X ( 1 5 j 5 n ) , h : W -+ Z such that
and the functional equation
is satisfied whenever ( x ,y ) E D. The functions h and g j may be defined in a way that the following statements also hold true (Cw understood to be the class of analytic functions) : (1) i f each hi, i = 1 , .. . ,m , is in C P;
CP
for some 0 5 p 5 w, then h is also in
(2) i f each g i , j , i = 1, . . . ,m, j = 1, . . . ,ni is in C P for some 0 5 p 5 W , then g j , j = 1,2,. . . ,m are also in CP. (3) i f the functions gi,j are in C 1 and for each 1 5 i 5 m and for each xi E Xi dg, there exists a yi such that ( x i ,y i ) E Di and rank A ( x i ,yi) = rki,j for 15 j
5 ni, then for each x agj
and rank - ( x , y ) dy
E
= r for 1
X there exists a y
5 j 5 n.
a ~ i E Y such
that ( x ,y ) E D
Chapter I. Preliminaries
34
(4) if there are compact sets Ci C Xi given with nonvoid interior and yi can be chosen so that gi,j(xi,yi)E Cki,j for 1 5 i < m and 1 5 j 5 ni, then y can be chosen so that gj(x, y) E C for 1 < j 5 n , where C = C1 x C2 x . . . x C,.
<
Proof. First of all, let us choose numbers s:, 1 i 5 m such that si s{ 2 r j is satisfied for each 1 j m. Let Y,' = RS;. For each 1 5 i , k 5 m , let us choose an arbitrary analytic submersion g:,k of Yi x Y,' into the interior of Ck. (Remember that a differentiable map from an open set of some Euclidean space into another Euclidean space is called submersion, if the rank of its derivative is equal to the dimension of the range space everywhere.) For each 1 5 i 5 m and 1 5 j ni let us choose a permutation Pi,j mapping {1,2, . . . ,m ) onto itself, such that pi,j (ki,j) = i . Let us define gi,j,k by gi,j,k(xi,~i,y:)= gi,j(xi,yi) if (xi,yi,y;) E Di x Y,' and = If k f ki,j1 then let gi,j,k(x,i,j(k),~ p i . j ( k ) c = g~i,j(k),k(y~i,j(a);y~i,j(k)) whenever (xpi,,( k ) , YPi,j ( k ) E Dpi,j( k ) x Yii,,(k). With this definition, gi,j,k maps Dpi,j( k ) into X k . Finally, let us define gj, , where j' = n l nz . . ni-1 j so that gjt (x, y) is defined on the set D of those pairs (x, y) for which (xk, yk, y i ) E Dk x YL for k = 1 , 2 , .. . , m where x = (xl, x2,. . . , x,), y = (fh,U2, . . . ,Urn), and yk = (yk,yi) for k = 1 , 2 , . . . , m; the function gjt (x, y) is defined so that it has the coordinates gi,i,k(xpi,,( k ) , yPi,,(k), k = 1 , 2 , . . . ,m. Let us observe that by this definition,
+
< <
<
YL~,~(~))
+
+ +
+
Finally, with this notation, let W be the set of those points (x, y, zl , . . . ,z, ) for which (xi, yi, z ; , ~., . . ,x;,,~) E Wi for 1 i 5 m where is equal to the kilj-th coordinate of ~ ~ ~ + , . , + ,E~Z1 - ~x + j x Zm for 1 j ni; let the function h be defined such that its coordinates at the point (x,y, zl, . . . ,2,) are equal to hi(xi, pi, zll1, . . . ,ziIni),I 5 i 5 m.
<
< <
1.24. Summary of results about the main problem. The complete answer to problem 1.18 above is unknown. We may divide the problem into parts corresponding to steps (I)-(VI) in 1.7. The problems corresponding to steps (I), (11), (IV), and (V) are solved, and step (111) is solved under a weak additional compactness condition, which is usually satisfied by applications. Concerning step (VI) we have results only under rather strong conditions. We will treat steps (I), (11), (111), (IV), (V), and (VI) in $5 8, 9, 11, 14, 15, and 16, respectively. Other related regularity results for problem 1.18 can be found in 55 5, 6, 7, 10, 12, 13. Here we sum up some of the consequences of the more general results of these sections for problem 1.18.
§ 1. Introduction
35
1.25. Theorem. With the notation ofproblem 1.18, i f h is continuous and the functions gi are continuously differentiable then every solution f , which is Lebesgue measurable or has Baire property, is continuous.
Proof. This is a consequence o f Corollary 8.7 and Corollary 9.5 if we apply them locally. 1.26. Theorem. With the notation ofproblem 1.18, i f the functions h and gi are p times continuously differentiable, then every almost everywhere differentiablesolution f is p times continuously differentiable ( 1 5 p 5 m ) .
Proof. Follows from Theorems 14.2 and 15.2.
1.27. Theorem. With the notation of Problem 1.18, i f the functions h and gi arep times continuously differentiable,then every solution f which has locally bounded variation is p times continuously differentiable ( 1 5 p 5 CQ). Proof. From Theorem 1.25 it follows that f is continuous. Theorem 13.2 implies that f is locally Lipschitz. From Theorem 1.26 it follows that f is p times differentiable. 1.28. Theorem. With the notation of Problem 1.18, i f the functions h and gi are max(2,p) times continuously differentiable and there exists a compact subset C of X such that for each x E X there exists a y E Y satisfying gi ( x ,y) E C besides other conditions in 1.1 8. (3), then every solution f , which is Lebesgue measurable or has the Baire property, is p times continuously differentiable ( 1 5 p 5 m ) . Let us observe that the additional compactness condition in this theorem is retained by the transfer principle.
Proof. Follows from Theorems 1.25, 1.26, and 11.6.
1.29. Theorem. W i t h the notation o f Problem 1.18, i f t = 1, n = 2, g1 ( x ,y) = y, and the function h is p times and the function gz is max(2,p) times continuously differentiable, then every solution f , which is Lebesgue measurable or has the Baire property, is p times continuously differentiable (1 5 P 5 CQ). Proof. The statement is a consequence of Theorems 1.25, 11.4, and 1.26. 1.30. Theorem. With the notation o f Problem 1.18, i f equation 1.18.(1) has the following special form:
36
Chapter I. Preliminaries
where the functions hi : D x Z -+ Rt are p times, the functions gi are max(2,p) times continuously differentiable, then every solution f , which is Lebesgue measurable or has the Baire property, is p times continuously differentiable ( I 5 p 5 m).
Proof. This theorem summarizes Theorems 1.25, 11.3, and 1.26. 1.31. Theorem.
t
With the notation of Problem 1.18, suppose that
= 1, D = ]a, b[x]a, b[, the equation has the special form
with ci E R (i = 1 , 2 , . . . ,n ) , the functions
are given, and the following conditions are satisfied: (2) gi(x, y) is between x and y whenever x, y €]a, b[; (3) gi is analytic and with some constant 0 < A < 1 we have
whenever x, y €]a, b[ and p = 1 , 2 , . . . ;
-
(4) for all x €]a, b[ and i = 1,2, . . . ,n the mapping y t+ gi (x, y) of ]a, b[ into ]a, b[ is strictly monotonic, and the function gi, for which the mapgi(x, y), is twice ping z ++ gi(x, z) is the inverse of the mapping y continuously differentiable on its domain. Then every solution f which is Lebesgue measurable or has the property of Baire is analytic.
Proof. This theorem summarizes Theorems 1.30 and 16.2. 1.32. Regularity on manifolds. For functional equations arising from geometry or from physics sometimes the natural domain of definition of the functional equation is a manifold. Most of the results of 55 5-15 are general enough to apply to strictly nonlinear equations, vector-vector solutions, and have a local flavor. Hence we may expect that most of the results can be generalized to manifolds. For local results this does not cause any problem but global results need a new proof. In § 17 we will treat the generalization to manifolds. It is useful to reformulate the main regularity problem of noncomposite functional equations with several variables for manifolds:
§ 1. Introduction
37
1.33. Problem. Let X , Y, and Z be analytic manifolds, let D be an open subset of X x Y and let W be an open subset of D x Zn.Let f : X + Z , gi : D + X (i = 1 , 2 , .. . ,n), and h : W -+ Z be functions. Suppose that
(2) h is analytic;
(3) gi is analytic and for each xo E X there exists a yo for which (xo,yo) E D and y c, gi (xo,y) is a submersion at yo for i = 1 , 2 , . . . ,n. Is it true that every f , which is measurable or has the Baire property is analytic? We remark that here as in 1.19 it is possible to formulate the CP problem or the CP-Cq problem, etc., which include that the manifolds X , Y, and Z , too, are supposed only to be CP manifolds. 1.34. Summary of results on manifolds. The complete answer to the problem above is unknown. Just as problem 1.18, this problem can also be divided into parts (I)-(VI). The complete solution of the problems corresponding to (I), (11), (IV), and (V) easily follows from results of 55 8, 9, 14, and 15. We solve (111) for compact X ; in the noncompact case we will obtain results under some additional conditions. Our results can be summarized in the following theorems: 1.35. Theorem.
With the notation of Problem 1.33 if X , Y, and
Z are Cm manifolds, h is continuous and the functions gi are continuously differentiable, then every solution f which is measurable or has the Baire property is continuous.
Proof. It follows from Theorems 17.1 and 17.2 if we apply them locally. 1.36. Theorem. With the notation of Problem 1.33, if X , Y, and Z are Cm manifolds and the functions h and gi are p times continuously differentiable, then every locally Lipschitz solution f is p times continuously differentiable (1 5 p 5 m).
Proof. It follows from Theorems 17.6 and 17.7
38
Chapter I. Preliminaries
1.37. T h e o r e m . With the notation of Problem 1.33, if X , Y, and Z are C" manifolds, the functions h and gi are max(2,p) times continuously differentiable and there exists a compact subset C of X such that for each xo E X there exists a yo E Y, for which in addition to condition (3) of the problem above gi(xo,yo)E C (i = 1 , . . . , n ) then every solution f which is measurable or has the Baire property, is p times continuously differentiable (1 l p l 00).
Proof. It follows from Theorems 1.35, 1.36, and 17.3. 1.38. T h e o r e m . With the notation of the Problem 1.33, if X, Y, and
Z are Cw manifolds, X is compact, moreover h and gi are Cm functions, then every solution f which is measurable or has the Baire property is in C". Proof. It follows from the previous theorem. 1.39. T h e o r e m . With the notation of Problem 1.33, if gl (x, y) = y, n = 2, X , Y, and Z are C" manifolds and dim(Z) = 1, the function h is p, and the function g2 is max(2,p) times continuously differentiable, then every solution f which is Lebesgue measurable or has the Baire property, is p times continuously differentiable (1 p 5 m ) .
<
Proof. This is a consequence of Theorems 1.35, 17.5, and 1.36. 1.40. T h e o r e m . With the notation of Problem 1.33, if X and Y are C" manifolds, Z is an open subset of Rt, and the functional equation has the following special form:
where the functions hi : D x Z + Rt are p times continuously differentiable and the functions gi are max(2, p) times continuously differentiable, then every solution f which is Lebesgue measurable or has the Baire property, is p times continuously differentiable (1 5 p 5 m ) . Proof. This theorem summarizes Theorems 1.35, 17.4, and 1.36. 1.41. Regularity results w i t h fewer variables. Roughly speaking, all the results of § § 5-17 prove regularity of an r place function f which is the solution of a functional equation only if there are at least 2r variables in the functional equation. The reason for this, using notation of Problem d g i is equal to r, the dimension 1.18, is that the condition that the rank of -
8~
§ 1. Introduction
39
of X , implies that s, the dimension of Y cannot be less than the dimension of X . The last two examples of 1.21 show that the condition about the dgi rank of - cannot be simply omitted. Nevertheless, that, at least in some dy cases, the number of variables can be reduced down to r 1 is shown by some regularity results of ~ w i a t a k . (The case of r variables in a functional equation for an r place function would already mean a functional equation in a "single variable" .) ~ w i a t a khas proved general regularity results having the type "continuous solutions are C"". She obtained her results using distributions. The essence of her method is to prove that solutions in the distribution sense satisfy a linear partial differential equation having only infinitely many times differentiable solutions. This idea was used by ~ w i a t a kto prove general regularity results for the functional equation
+
where f is the only unknown function. Roughly speaking, she applies a partial differential operator d; in y to the equation in the distribution sense. Of course, the nonlinear term on the right-hand side disappears. If there exists a yo such that gi(x, yo) x for 1 5 i 5 n (a very strong assumption), then after substituting this fixed yo we obtain a linear partial differential equation P ( d , x) f = H. If the partial differential operator P ( d ,x) is of constant strength and hypoelliptic at an x = xo, then by the regularity theory of partial differential equations all distribution solutions correspond to a Cm function. We will briefly discuss the results of ~ w i a t a kin 5 18. Generalizing our methods we may hope to obtain regularity results for general nonlinear functional equations. This seems to be impossible using the method of ~ w i a t a kbased on Schwartz distributions. We can avoid the very strong assumption that there is a yo such that gi(x, yo) = x for 1 5 i 5 n. The somewhat artificial condition of hypoellipticity also has to disappear. We prove "measurability implies continuity" type results for nonlinear functional equations with "few" variables in § 19. Well-known analogies between measurability and Baire property (see Oxtoby [161]) suggest trying to prove analogous results for Baire property. Differences, such as the lack of the %-technique", the lack of convergence in measure and theorems connected with it (for example Riesz' theorem), the lack of Hausdorff measure, etc., justify a separate discussion. "Baire property implies continuity" type results for general nonlinear functional equations with "few" variables will be
=
40
Chapter I. Preliminaries
proved in 5 20. Finally, in § 21, we will prove "continuity implies continuous differentiability" and "p times differentiability implies p 1 times differentiability" type results for functional equations with "few" variables. In the latter case, our theorems are applicable to general nonlinear equations. In the former case, they presently are applicable only to linear equations, but without several other strong assumptions. We will also give some examples and investigate the connection with the results of ~ w i a t a k .
+
2. NOTATION A N D TERMINOLOGY 2.1. Sets and functions. The cardinal number of a set A will be denoted by card(A). If card(A) 5 No, we will call A countable. The cardinal number continuum, i. e., 2'0 will be denoted by c. We will denote by N,23, Q, R, C, and T the set of nonnegative integers, integers, rational, real, complex, and unit length complex numbers, respectively. I f f is a function, dmn f and rng f denote the domain and the range of f , respectively. The multiplicity function Nf is defined by Nf(y) equal to the number of elements of the set f -'(y) (possibly m). We will also use the notation { f = y) = f -l(y) and analogously {f > y), {f E A), {f = g), etc. All normed spaces are supposed to be real; the norm is denoted by I 1. The sign I/ (1 used only for the operator norm of a linear operator. I f f : D + Y is a function mapping an open subset of a normed space into a normed space, then f ' will denote the derivative of f . If D c X I x Xg x x X,, we will use the partial sets
The partial functions fxi : DXi + Y are defined by
whenever ( x l , . . . ,x,) E D. Also Dxil,... and fxil,... larly. Now, if Xi and Y are normed spaces and
are defined simi-
is an open subset of Xi, we define the partial derivative denoted by
5 2.
Notation and terminology
as the derivative of f,, ,,,,,,i-l,,i+ dex, let
41
(if it exists). If a E W is a multiin-
,,, n
If D C Rn and Y = Rm, then f ' may be identified with the matrix
where f = (fj)y=l and x = If y = f ( x ) , 1 5 i l , . . . ,i, 1 5 jl , . . . , j, 5 m, then we denote the determinant of the matrix
5 n,
a ( ~ j , r . . i. ~ j , ) d(xil1. . - xi,) ' Other notions of calculus are used in the usual way. In case of doubt the reader is referred to Rudin [172]. 2.2. Topology. We will use the terminology and notation of Bourbaki [38] concerning topology and topological groups. Hence, every regular, completely regular, normal, compact or locally compact space is supposed to be Hausdorff. For any class E of subsets of a set X , let E, and Isdenote the class of all countable unions and countable intersections of members of El respectively; the empty union is the empty set and the empty intersection is X. The topology of a topological space, i. e., the class of all open sets, will be denoted by 5;, and the class of closed sets will be denoted by F. So the classes G6, F,,, GsU = (6s)alFU6 = etc. are defined. A path in X is a continuous function mapping some closed interval of R into the topological space X. For a topological space X the weight of X is the least cardinal number of an open basis. The character of a topological space X is the least cardinal number n for which there is an open basis at x with cardinal number not greater than n whenever x E X. If x, y are points of a metric space and a > 0, we say that x and y are a-near if their distance dist(x, y) is less than a. Similarly, if x and y are points of a uniform space and a is a relation from
Chapter I. Preliminaries
42
the uniformity we say that x and y are a-near if (x, y) E a. In a metric space the closed ball having radius r 2 0 and center x will be denoted by I& ( x ) . Concerning topological dimensions dim, ind, and Ind we refer the reader to the classic book of Hurewicz and Wallman [76]. All results concerning topological groups used here can be found in the monograph [66], [67] of Hewitt and Ross.
2.3. Continuity modulus, Holder a n d Lipschitz functions. For a function f mapping a subset of a metric space into a metric space we define the continuity modulus wf of f by
<
whenever 0 5 r m. If the domain of f is not empty then always wf ( 0 ) = 0. It may happen that (even for every positive r ) we have w f ( r )= m. Clearly wf is continuous (from the right) at 0 if and only i f f is uniformly continuous. If for some exponent 0 < a 5 1 and constant 0 5 Ha < m we have w f ( r )5 H , . r f f whenever 0 5 r < oo, then we say that f is Holder continuous with exponent a and Holder constant Ha. If a = 1, the Holder continuous functions and Holder constants are called Lipschitz functions and Lipschitz constants. We will use the local notion of Holder continuity, Holder constants, Lipschitz function, and Lipschitz constant. In connection with Lipschitz and locally Lipschitz functions see the book of Federer [53]. 2.4. M e a s u r e theory. Concerning measure theory, we follow the terminology of Federer [53]. Hence a measure means a countably subadditive extended real-valued nonnegative function defined on all subsets of a set; this is called outer measure in other terminology. The a-algebra of measurable sets is defined by the Carath6odory condition: a set A C X is called p measurable if p(T n A ) ,u(T \ A ) = p ( T ) for every T C X . The measure p is called finite if p ( X ) < m. If p ( X ) = 1, then we say that p is a probability measure. If a set A C X can be represented as a countable union of measurable sets having finite measure, then it is called a-finite. If X itself is a-finite, then the measure p is called a-finite. A measure is called diffuse if any set of only one element has measure 0. If Y C X, the restriction of p to 2Y is a measure on Y denoted by py. Let p [Y defined by ( p[ Y )( A ) = p(Y n A) for a11 A C X . Every function f : X -+ Y induces a map f# of measures over X into the class of measures over Y defined by ( f # p )( B ) = p (f - ' ( B ) ) for B C Y. If a nonnegative set function v is defined on an arbitrary class 3-1 of subsets of a set X , then
+
p ( T ) = inf{CiEI v ( H i ) : I is countable, Hi E 3-1 if i E I}
5 2.
Notation and terminology
43
for each T c X defines a measure p on X . (Empty sum is understood to be zero.) The set function p is an extension of the set function v if and only if v is countably subadditive. It is not hard to prove that a set A C X becomes p measurable if and only if
whenever H E 31. Specially, if 31 is a a-algebra and v is countably additive on 31 then p is an extension of v and all H E 31 is p measurable. If p is a measure on X we say that the measurable set B c X is a p hull ofAifACBCXand
p (A n T) = p ( B n T)
for every p measurable set T.
A measure p over X is called regular if for each set A C X there exists a p measurable set B such that A c B and p (A) = p ( B ) . Let p be a measure on X . A base of the measure p is a class B of p measurable sets such that for any p measurable set A and for any E > 0 there exists a B E B such that p(A A B ) < E. The weight of the measure p is the least cardinal number of a basis of p. The character of p is the least cardinal number n for which p y has a base with cardinal number not greater than n whenever Y is p measurable with finite p measure. We remark that if p is a finite measure, then the weight of p and the Hilbert space dimension of the corresponding L2 space are equal, whenever one of them is infinite. (See Hewitt and Ross [66], (16.12).) If p and v are measures over X we say that the measure p is an expansion of the measure v if any v measurable set A is also p measurable and p(A) = v(A). If p is a measure on a group X , we say that p is left [right] invariant, if for each z E X and A c X we have p(xA) = p(A) [p(Az) = ,u(A)]. An will denote the Lebesgue measure on Rn, and X h i l l denote the kdimensional Hausdorff measure on a metric space. The Lebesgue density of a set A c Rn at a point x E Rn is defined as lim,40 An (I$.(x) n A) / A n (I++ (x)) if the limit exists. Let Y, be a measure on X and let Y be a topological space. A function is called p measurable on a set A c X if its domain contains almost all of A, its range is in a topological space Y, and if A f? f -'(v)is p measurable whenever V is an open subset of Y. Especially, A has to be measurable. We refer the reader to Federer [53] concerning measure theoretical results used here.
Chapter I. Preliminaries
44
2.5. Topological measures. Our terminology concerning topological measures is somewhat different from that of Federer's book [53]. We call a measure p over a topological space X topological measure if all open subsets of X are p measurable. Of course, if p is a topological measure on X, then Borel sets of X - elements of the smallest a-algebra containing all open sets - are measurable. The support of the topological measure p is defined by sptp = x
\ U{V
: V
cX
open, p(V) = 0 ) .
A topological measure is called Borel regular if for each A C X there exists a Borel set B such that A C B C X and p(A) = p ( B ) . By a Radon measure we mean a topological measure p defined on a Hausdorff space X, with the following properties: (1) The p measure of any compact subset K of X is finite;
(2) for every open subset V,
(3) if A is any subset of X , then p (A) = inf { p (v): A
c V,
V open).
Clearly, a Radon measure is uniquely defined by its values on compact sets. Let p be a Radon measure over X and A C X . The set A is p measurable if and only if P(V n A) \ A) 5 P(V)
+
for any open set V. Indeed, if this condition is satisfied, then for any T C X we have
whenever T c V and V is an open set. Taking infimum for all V containing T , we obtain that A is measurable. Similarly, A is p measurable if and only if P ( K n A) + P ( K \ A) i p ( K ) for any compact set K . Indeed, let V be an open set for which p(V) < cc, and let K, be compact subsets of V for which p ( V \ K,) < l l n . Then p ((vn A) \ (K, r l A)) p(V \ K,) < l l n , and hence p(K, r l A) 4 p(V rl A)
<
5 2. as n -+ m. Similarly, p(K, is satisfied, then
Notation and terminology
\ A) -+ p(V \ A).
45
Hence, if the above condition
p(V) = lim p(K,) = lim p(K,nA)+ lim p ( K n \ A ) = p ( V n A ) + p ( V \ A ) . n+m
n+m
n-+m
Finally, A is p measurable if and only if A n K is p measurable for all compact subsets K . Indeed, this condition implies that
for any compact set. It is not hard to prove that if p is a Radon measure, then p(A) = s u p { p ( ~ :) K
c A, K compact)
whenever A is a p measurable set with finite p measure. 2.6. Lusin measurable functions. Let p be a Radon measure on a Hausdorff space X and let Y be a topological space. Let f be a function mapping almost all of a subset E of X into Y. The function f is called a Lusin p measurable function on E, if for each measurable subset A of E having finite measure and for each E > 0 there is a compact subset C of A such that p ( A \ C ) < E and f lC is continuous. In this setting Lusin's theorem reads as follows (see moreover Federer 1531, 2.3.6 for further weakening of the conditions) :
2.7. Lusin's Theorem. Let p be a Radon measure on a Hausdorff space X , let E be a p measurable subset of X and let Y be a topological space. A function f mapping a subset of X into Y which is Lusin p measurable on E , is p measurable on E. Conversely, i f f is p measurable on E and the values o f f are for almost all of E in a subspace of Y whose topology has a countable base, then f is Lusin p measurable on E .
Proof. We have to prove that if V is an open subset of Y, then E n f -'(v) is measurable. It is enough to prove that ~ f lf -'(v) ~ n is measurable whenever K is a compact subset of X . The set K n E can be written as the union of a null set N and a sequence of compact sets C, such that f lC, is continuous. Now Cn n f -'(v) is relatively open in C,, hence a Bore1 set. This implies the statement. In the proof of the converse we follow the proof of Oxtoby [161], 8.2. Let V, be a countable base of the given subspace of Y, let A be a measurable subset of E and E > 0. By approximating A from the inside with a compact
46
Chapter I. Preliminaries
set, we may suppose that A is compact. Let us choose for each n an open set U, and a compact set C, such that Cn C A n f (v,) c Un and p(Un \ C,) < ~ 1 2 , . If C = A \ Up?l(Un \ Cn), then C is compact, p(A \ C ) < E and f-'(v,) n C = U, ri C is relatively open in C . This implies that f lC is continuous.
-'
2.8. Haar measure. On a locally compact group G, there exists a left [right] invariant Radon measure X for which the measure o f any nonvoid open set is nonzero. A n y such measure is called a left [right] Haar measure. For any two such measures A, A' there is a constant 0 < c < cc such that A' = cX. Moreover, there exists a unique continuous homomorphism A o f G into the multiplicative group o f the positive real numbers (the so-called modular function o f G) such that i f X is a left Haar measure, x E G and L is X measurable, then Lx and L-' are also X measurable
and
X(L-') =
A(x-')dX(z).
On a locally compact group left Haar measure is usually denoted by A.
2.9. Baire category. The most important facts concerning Baire category can be found in Bourbaki [38];see chapter IX, § 5, and the corresponding exercises, but here we will use the different (and more usual) terminology of Oxtoby [161]. For clarity we summarize the notions and facts we will use. We will say that a subset A of a topological space X is of first category, if A can be represented as a countable union of nowhere dense sets (i.e., sets having closure with empty interior), otherwise A is of second category. It is easy to see that if X is a topological space, Y is a subset of X , A is a subset of Y, and A is nowhere dense [is of first category] as a subset of the subspace Y, then A is nowhere dense [is of first category] as a subset of X , too. Conversely, if Y is an open subset, and A is nowhere dense [is of first category] as a subset of X, then A is nowhere dense [is of first category] as a subset of Y, too. A topological space X is called Baire space if the intersection of any countable class of dense open sets is dense. The theorem of Baire states that locally compact spaces and complete metric spaces are Baire spaces. We note that the topology of a subspace of a complete metric space can be derived from a metric of the subspace for which it is complete if and only if the subspace is a Gs subset of the space.
5 2.
Notation and terminology
47
Let E be a subset of the topological space X. We say that E is of second category at a point x E X if E n V is of second category for every neighborhood V of x. Let D ( E ) denote the set of all points x E X for which E is of second category at x. Then D ( E ) = 0 if and only if E has first category. Moreover D ( E ) is closed and the set E \ D ( E ) is of first category. We will say that E C X has the Baire property if there exists an open set V such that the symmetric difference E A V is of first category. All subsets of X having Baire property form a a-algebra. Of course this a-algebra contains Borel sets, the members of the smallest a-algebra containing all open sets. With the notation of the previous paragraph, a set E C X has the Baire property if and only if E \ D ( E ) is of first category. The set E has Baire property if and only if each point x of X has an open neighborhood U such that U Ti E has the Baire property in X. Let X and Y be topological spaces and suppose that the topology of one of them has a countable base. A subset E x F of X x Y has the Baire property if and only if one of the sets E, F is of first category or both of them have Baire property. Combining these facts with the proof in Oxtoby [161], chapter 15, we get the following form of a well-known theorem of Kuratowsky and Ulam: 2.10. Kuratowsky-Ulam T h e o r e m . Let X and Y be topological spaces, and suppose that Y has a countable base. Let E be a subset of X x Y having the Baire property. Then except for a set of points x of X which is of first category, the set Ex has the Baire property. Moreover E is of first category if and only if the set Ex is of first category in Y with the exception of a set of x 's of first category.
2.11. B a i r e p r o p e r t y of functions. The function f has the Baire property on E if the domain of f contains E except for a set of first category, the range of f is in a topological space Y and E n f -'(v)has the Baire property in X for every open subset V of Y. We simply say that f has the Baire property, if it has the Baire property on X. This definition is very similar to the definition of a Borel function. A function f mapping some subset of a topological space X into another topological space Y is called a Borel function, if for each open subset V of Y the set f - ' ( ~ ) is a Borel subset of X. The properties of functions having the Baire property are very similar to the properties of measurable functions. We will use the following statements.
If f is any function defined on a subset of X having the Baire property, then all subsets B of Y for which f - ' ( B ) has the Baire property form a
48
Chapter I. Preliminaries
a-algebra. Hence i f f has values in the topological space Y and has the Baire property on a subset of the topological space X , moreover i f g is a Borel function on a subset o f the topological space Y , then g o f has the Baire property on its domain.
ni
Suppose that Y = Y, is a countable product o f spaces each having a countable base o f topology. A function f mapping a subset o f a topological space X into Y has the Baire property i f and only i f all functions pi o f have the Baire property where pi is the natural projection o f Y onto Y i . Suppose that X is a topological space, Y a metric space, and the functions f , f n ( n = 1 , 2 , . . . ) are defined on X except for a set o f first category (depending on the function), have values in Y , and have the Baire property. Then the set E = { X : fn(x) + f ( x ) ) has t h e Baire property. Indeed, the sets
have the Baire property and E differs from the set
only in a set of first category.
Suppose that X is a topological space, Y is a metric space, the functions f , f n ( n = 1 , 2 , . . . ) are mappings from X into Y , the functions f n have the Baire property for all n, and f n ( x ) + f ( x ) for all x E X except for a set o f first category. Then f has the Baire property, too. Indeed, if V is an open subset of Y and V, is the open subset of Y containing those points of Y having distance larger than l / i from Y \ V , then f -'(v)differs from the set
only in a set of first category. We say that a function mapping some subset E of a topological space X into another topological space Y has the Lusin-Baire property on E , if there
5 2.
Notation and terminology
49
exists a set F of first category such that f I E \ F is continuous. The following theorem is analogous to Lusin's theorem: 2.12. Theorem. Suppose that X and Y are topological spaces and E is a subset o f X which has the Baire property. I f a function f mapping some subset o f X into Y has the Lusin-Baire property on E , then f has the Baire property on E. Conversely, i f f has the Baire property on E and the values o f f for all points o f E except a set o f first category are in a subspace o f Y whose topology has a countable base, then f has the Lusin-Baire property on E .
Proof. Suppose that F is a set of first category and let V be any open subset of Y. Since f 1 E \ F is continuous, (E \ F) n f (v) has the form U n E \ F for some open subset U of X, hence it has the Baire property. Since F n f-'(V) is of first category, it also has the Baire property. Hence E Ti f -l(V) also has the Baire property. Conversely, let V, be a countable base of the topology of the given subspace of Y. Since E n f -'(Vn) has the Baire property, it can be written A are of first category. Now let in the form U, \ F, U FA, where F, and F F = Up?=, (F, U FA). Then ( E \ F) f l f -'(Vn) = (E \ F) n U,, hence it is relatively open in E \ F . Since V, is a basis, this means that f lE \ F is continuous. From the Kuratowski-Ulam theorem the following analogue of Fubini's theorem follows directly: 2.13. Theorem. I f X , Y, and Z are topological spaces, Y and Z have countable bases, f : X x Y + Z has the Baire property, then except for a set o f points x which is o f first category, the function f , has the Baire property. 2.14. Bounded variation. Besides the elementary notion of bounded variation function of a real variable we will use the notion of bounded variation for functions having vector variables. Following the book of Giusti [60], if X c Rn is an open set, we will say that the function f : X + R has bounded variation if it is integrable and
f divg:g
E
J C ' ( X ; R ~and ) lgl 5 1
is finite; here K1(X;R n ) denotes the class of all continuously differentiable functions having compact support mapping X into Rn, and / / is the norm on R n . If f is continuously differentiable, then the expression above is equal to
Chapter I. Preliminaries
50
Sx
I f ' l . We will use the same symbol to denote the above supremum for arbitrary functions having bounded variation. An equivalent condition to having bounded variation would be that the gradient of f in the distributional sense is representable by integration with respect to a vector-valued finite Radon measure. Jx 1 f ' 1 is the total variation of this measure. Strictly speaking, this notion is not a generalization of the usual notion of a one variable function having bounded variation on an interval. But rather it is a generalization of the notion of a function having bounded essential variation. This connection and the basic properties of functions having bounded variation can be found in Giusti [60]. We will use results from this book only. Further results can be found in Federer [53], 4.5.9, 4.5.10. We say that a vector-valued function with values in a Euclidean space has bounded variation if all its coordinates have bounded variation. We will say that f has locally bounded variation on X if each point of X has a neighborhood such that the restriction of f to this neighborhood has bounded variation. 2.15. Manifolds. In general we will use the terminology of Dieudonnk [49] about (Cm or analytic) manifolds. Hence, a manifold is always understood to be finite dimensional, separable, and metrizable. We use the notion of a Banach manifold as defined by Zeidler [209], book IV, chapter 73. Concerning Lebesgue measures, Lebesgue measurability, null sets on manifolds we follow the terminology of Dieudonnk [49],see 16.22.2. A Lebesgue measure p on a pure n-dimensional Cm manifold X is characterized by the property that if cp is a chart, then a set A C dmn(p) is p measurable if and only if cp(A) is Lebesgue measurable in Rn; moreover for the chart cp there exists a positive real valued Cm function f , defined on rng(cp) such that p (A) =
J
f , (x) dXn (x),
whenever A C dmn(p) is measurable.
4.4)
Although the notion of a Lipschitz function cannot be transferred to manifolds, this is possible for the notion of a locally Lipschitz function: a function f : X 4 Y between manifolds is called a locally Lipschitz function if for each x E X there is a chart cp on a neighborhood of x and there is a chart $ on a neighborhood of f (x) such that $ o f o p-I is a Lipschitz function on a neighborhood of cp(x). Then for any other charts @ and defined on a neighborhood of x and f (x), respectively, it is also true that of o is a Lipschitz function on a neighborhood of @(x).It is not hard to prove that if X and Y are connected, then choosing arbitrary Riemannian structures on X and Y compatible with the manifold structure, the function f is a locally Lipschitz function by the definition above if and only if it is
4,
4
§ 2. Notation and terminology
51
a locally Lipschitz function with respect to the distance resulting from the Riemannian structures. 2.16. Distributions. We will use the standard notation of the book of Hormander [74] about distributions. The class of test functions and distributions will be denoted by D and Dl (or more precisely D(Rr) and D1(Rr), etc., if necessary), respectively. All results we will use about distributions can be found in Hormander's book [74], chapters I-VI. We remind the reader that if f 6 D'(Rr) and h : Rr + C is a Cm function, then h f 6 Dl is defined by ( hf ) ( p ) = f (hp) whenever p E D. Moreover, if g : Rr + IWT is a diffeomorphism, then the distribution f o g is defined by , ~ ( g - l )denotes the absolute value (f 0 g) ( p ) = f ( ( p o g-I) ~ ( g - I ) ) where of the Jacobian of g-l.
Chapter 11.
STEINHAUS TYPE THEOREMS
The generalizations of Steinhaus's theorem and its Baire category analogy, Piccard's theorem, play a key role in several later sections as well as certain applications. This chapter deals with these generalizations.
3. GENERALIZATIONS OF A THEOREM OF
STEINHAUS A famous theorem of Steinhaus [I891 from 1920 asserts that, for any measurable set A c PS with positive Lebesgue measure the set A - A contains an interval. More generally, if A, B C Rn are measurable sets with positive Lebesgue measure, A B has an interior point; see for example the paper of Kemperman [123]. The proof can be based on Weil's idea [206] that the convolution of the characteristic functions XA and XB (in case A and B has finite measure) is a continuous function, hence the function
+
Chapter 11. Steinhaus type theorems
54
is continuous and as follows from Fubini's theorem, not everywhere zero. This means that A B contains a nonvoid open set. This proof works directly when X is a Haar measure on a locally compact group. This theorem allows various generalizations and modifications. In the generalizations the following problem is treated: if we replace the addition by a binary operation F ( x , y), under what conditions on F can we prove that F ( A , B) contains a nonvoid open set? The first step was done by Erdos and Oxtoby [52] proving in the case x, y E R that, if F is a continuously differentiable function with nonvanishing partial derivatives, then F (A, B) contains a nonvoid open set. Further generalizations detail the case when x and y are from different topological measure spaces and F satisfies certain solvability conditions in x and y. See in this direction Kuczma [I301 and Sander [178]. Sander has pointed out that one of the sets A, B may be nonmeasurable. These results apply to the case when x, y E Rn and F is a continuously differentiable function of which the partial derivatives are nonsingular. In this 5 we will treat a generalization for function F with more then two variables. Of course, if F maps IR x R x R into R we obtain a problem already solved by the theorem of Erdos and Oxtoby. To obtain a really interesting new problem, we have to consider a function with values in R2. The condition about the nonvanishing partial derivatives will be substituted with the condition that the null space of the derivative (as linear mapping) is in general position. Theorem 3.11 and Remark 3.13 are such generalizations of Steinhaus' theorem. As a corollary we obtain the well-known special case F : R n xRn + W . Our proof depends on a very general version of the theorem about the continuity of the convolution formulated in Theorem 3.4. As corollary we obtain a result about the continuity of a mapping
+
where the functions gi,t do not map sets with positive measure into zero sets and depend smoothly on the parameter t . The investigation of this mapping occurs in an implicit way in JArai [go] and in an explicit way in Krausz [125]. As we explained in the introduction, the lower semicontinuity of mapping (1) and several other variants of Steinhaus' theorem have applications in the theory of functional equations. The results of this 5 are used in $5 5, 6, 8, 10, and when discussing applications in 5 23. More detailed references about earlier results may be found in Kuczma [130], Kuczma and Kuczma [131], Sander [178], and Grosse-Erdmann 1611.
5 3.
55
Generalizations of a theorem of Steinhaus
Another kind of refinement of Steinhaus' theorem using one-sided lower densities in R is given by Raikoff [I681 and by Matkowski and ~ w i ~ t k o w s [156]. ki The results of this § are published in the paper JBrai [99]. We start with the investigation of an important condition. 3.1. Condition. In this 5 and in $5 8, 10, and 19 a measure theoretical condition will play an important r61e. In general form this may be formulated as follows.
Let X and Y be sets with measures p and v, respectively, let T be a set, x Y, and g : D + X a function. Our condition is the following: (1) For each E > 0 there exists a 6 > 0 such that if B C Dt, v(B) E, t E T , then p (B)) 2 6.
D
cT
>
Concerning this condition we summarize some simple properties. We treat this condition in 3.10 in the most important case when X and Y are open subsets of Euclidean spaces. Further results can be found in J&rai [80]. 3.2. Remarks. With the notation of the previous point,
if Dl C D2 C T x Y, g2 : D2 3.1.(I), then gl too;
-+ X , g1 = g2/ D l , and g2 satisfies condition
if D = D l U D2, g1 = 9101, 92 = g10 condition 3.1.(1), then g too;
2 ,
moreover gl and g2 satisfies
if for all E > 0 there exists a decomposition D = D l U D2 such that 9102 satisfies condition 3.1.(1) and p (gt ( ~ 1 , ~<) E) for all t E T, then g also satisfies condition 3.1. (1); condition 3.1. (1) is equivalent with the following one: for all E > 0 there exists a 6 > 0, such that if A c X , p(A) < 6, then v ( g , l ( ~ ) ) < E for all t E T; if p is a regular measure, then it is enough if (4) is satisfied for all measurable A; if X and Y are Hausdorff spaces, p and v are Radon measures, all gt is continuous on the Borel set Dt and the condition 3.1. (1) is satisfied, then gt-l(A) is measurable for all a-finite measurable set A C X whenever t E T; under conditions of (6) if B c A and A is a ,LL hull of B , then g,l(A) is a v hull of g , l ( ~ ) whenever t E T;
Proof. (1)-(5) are trivial. To prove (6) let us represent A as the union of a Borel set B and a set C having measure zero. Then g;l (A) = g,l ( B ) U
56
Chapter 11. Steinhaus type theorems
( C ) where g,l(B) is a Bore1 set and 9,' ( C ) has measure zero. To prove (7) suppose indirectly that for some measurable set E C Y we have
Then by the measurability of g r l ( A ) and since v is a Radon measure there exists a compact set C c E n g,l(A \ B ) such that v ( C ) > 0; but by gt ( C )c A \ B we should have p (gt ( C ) )= 0. 3.3. Lemma. Let T be a topological space, X a uniform space, C a compact uniform space, D C T x C , to E T , { t o ) x C C D , and let g : D + X be a continuous function. Then for any relation a from the uniformity of X there exists a relation ,B from the uniformity of C and a neighborhood V of to such that i f t , t' E V, the points y , y' are ,&near in C and ( t ,y ) , (t',y') E D , then the points g ( t , y ) and g(t', y') are a-near in X .
Proof. Let 6 be a symmetric relation from the uniformity of X for which 6 o 6 c a. For all y E C there exists a neighborhood V, of to and a symmetric relation y, such that if t E V, and the point y' is 7,-near to y , then g ( t , y') and g(to, y ) are 6-near in X . Let us choose for each y, a symmetric /, o ,By C y,. The open kernels of the sets ,By ( y ) give relation ,By for which 3 ,By,, an open cover of C . Let us choose a finite subcover and let ,B = V = V,,. If the points y and y' are p-near in C , then there exists a yi such that y and yi, moreover y' and yi are yyi-near, hence g ( t , y ) and g ( t o ,yi) moreover g ( t ' , y') and g ( t o ,y i ) are &near. This means that g ( t , y ) and g ( t l , y') are a-near in X .
nkl
3.4. Theorem. Let T be a topological space, Y a Hausdorffspace, X i (i = 1 , 2 , . . . ,n ) be completely regular spaces, and let Z , Zi(i = 1 , 2 , . . . ,n ) be Banach spaces. Let v and pi be finite Radon measures over Y and X i , respectively. Consider the functions fi : X i + Zi, gi : T x Y Xi, h : Z1 x . x Zn + 2.Suppose that the following conditions hold: (1) h maps bounded subsets into bounded subsets and is continuous; (i = 1 , 2 , . . . ,n ) ; (2) fi is Lusin pi measurable and is in L m ( p i ) (3) gi is continuous and for each E > 0 there exists a 6 > 0 such that p i ( g i , t ( B ) ) 2 6 whenever B c Y , v ( B ) E , t E T , and 1 5 i 5 n. Then the function
>
is continuous on T .
§ 3. Generalizations of a theorem of Steinhaus
57
Proof. First we prove that the integral exists. We may replace each fi by a bounded Borel function defined on all of Xi, which is almost equal to f i , and is separable valued. This switch does not change the integral, because, by (3), the set of points y for which the value of fi (gi (t,y)) are changed, has measure 0. Hence we may assume that the functions fi are separable valued bounded Borel functions. By (1) the function
is a separable valued Borel function whenever t E T is fixed, and its image is a bounded subset of 2.Hence the integral exists whenever t E T . Now let E > 0 and to E T. Let us choose a real number M > 0, for which the image of (4) is contained in the closed ball with center 0 and radius M . By (3), there exists a 6 > 0 such that B C Y, u(B) 2 E' = ~ / ( 1 6 M n ) t, E T , and 1 5 i 5 n implies pi ( g i , t ( ~ )2 ) 6. Let us choose a compact set C c Y for which u(Y \ C ) < ~ l ( 8 M )There . exists a compact set Ci in Xi for which pi(Xi \ Ci) < S and filCi is continuous. Let us choose uniformities on the spaces C , X I , . . . , X, compatible with their topology. By (1) there exists an a! > 0 such that
z E fi(Ci) and lzi - zQ) < a. Because of the uniform continuity whenever zi, Q of fi ICi there exists a reflexive symmetric relation pi in the uniformity of Xi such that Ifi(xi) - fi(x1) < a
1
whenever xi, xi E Xi and xi and xi are pi-near, that is, (xi, xi) E pi. By Lemma 3.3 there exists an open neighborhood V of to in T and a reflexive symmetric relation y in the uniformity of C such that gi ( t o y) , and gi (t, y') are &-near in Xi whenever t E V and y and y' are y-near in C. Now let t be an element of V and let
Then
58
Chapter 11. Steinhaus type theorems
and hence (using (3) and that pi(Xi \ Ci) < S),
Using this, we have with the notation
that
By (5) the first term on the right side is not greater than 2Me/(4M) = ~ / 2 . By the choice of K, a , P I , . . . ,,On, y, and V, the second term on the right ) ~/2. side is not greater than u ( Y ) E / ( ~ u ( Y )=
3.5. Corollary. Let T, Y, Xi, u, pi, and gi be the same as in the previous theorem. Suppose that condition (3) of the previous theorem is satisfied, and let Ai be a subset of Xi. Suppose that Ai is pi measurable if 2 5 i 5 n. Then the function
is continuous on T.
Proof. Condition (3) of the previous theorem by 3.2.(7) implies that the set 9;: (B1)is a u hull of 9;: (A1) whenever B1 is a p1 hull of A1. Hence
where X A ~is the characteristic function of Ai and function of B1.
XB,
is the characteristic
5 3.
Generalizations of a theorem of Steinhaus
59
3.6. Theorem. Let T be a topological space, X and Y be Hausdorfl' spaces with Radon measures p and v, respectively. Let D be an open subset of T x Y, g : D -+ X, to E T , and let K c X be a compact set. Suppose, that (1) the mappings g and (t, x) t, g,l(x) are continuous and gt is a homeomorphism of Dt onto X if t E T; (2) for each E > 0 there exists a 6 > 0 such that p(gt (B)) > 6 whenever B c Y, v(B) E, and t E T. Then . V ( ~ ; ~ ( K ) A ~ ; ~ (- +KO ) ) if t - + t o .
>
Proof. Let E > 0 and let us choose a 6 > 0 for ~ / 2 .Let C = g;l(K), and let W be an open subset of Dt, containing C for which v(W \ C) < ~ / 2 . Similarly, let U be an open subset of X containing K for which p(U \ K ) < 6. Let us choose an open neighborhood V of to such that if t E V, y E C , then gt(y) is defined and gt(y) E U ; moreover if t E V, x E K , then g,l(z) E W. Then by g;l(K) \gtnl(K) C V \ C we have v ( y ; l ( ~ ) \ g , o ' ( ~ ) < ) ~ / 2 Moreover . v (K) \ 9': ( K ) ) < &/2because the mapping gt maps the set gtil(K) \ g ; l ( ~ ) into U \ K , for which p ( U \ K ) < 6. Summarizing, v (,g;'(K) A g , ' ( ~ ) ) < E if t E V. 3.7. Corollary. Let G be a locally compact group and let X be a left Haar measure on G. Let Ai, i = 1,2, . . . ,n be subsets of G with finite measure. Suppose that Ai is X measurable if 2 I i 5 n. Then the mapping
of Gn into R is continuous.
Proof. Since the replacement of Al by a X hull does not change this function, we may suppose that Al is measurable too. Let E > 0 and T = Gn. Let us choose the compact sets Ki such that Ki C Ai and X(Ai \ K i ) < E are satisfied. Let Y = X1 = X2 = . = X, = G and gi(t,y) = t;ly
if
( t l , . . . , t n ) E T and y E Y.
Since for any measurable subsets B, C with finite measure
Ix(B) and
-
X(C)I I X(B A
c)
Chapter 11. Steinhaus type theorems
60
by the previous theorem we obtain
I x ( ~ ~ K n~ t 2 K 2n - n t , ~ , )
-
x ( t ! ~n, t ; ~ n, . . . n t ; ~ , ) (
n
X(tiKi A tf K i ) + 0 whenever ti
5
-+
ti0 ,
i=l that is
t
x ( ~ , Kn , t , ~ n, . - .n t , ~ , )
t)
is continuous on T . But
5
C X(tiAi \ t i K i )< ne. i=l
Hence
( t l , .. . ,t,) e x ( ~ n~. . An t~, ~ , ) is the uniform limit of continuous functions, and so itself is continuous. In the following lemma which will be needed for the proof of our main result, we give sufficient conditions for the validity of condition (3) in Theorem 3.4. 3.8. Lemma. Let Y be an open subset of IRk, let T be a topological space, yo E Y, and to E T. Let g : T x Y -+ RT be a continuous function and
8s is continuous and suppose that 8~
rank
($( t o ,
y o ) ) = r.
Then there exist open neighborhoods Y* and T* of yo and to, respectively, and there exists a constant 0 < C < cc such that Y* C Y, T * C T, and
whenever B C Y* and t E T*. (Here diamB denotes the diameter of the set B -1 Proof. Let q = k-r and let us divide the coordinates of y = ( y l , . . . ,y k ) into two groups y' = ( y i , . . . ,yb) and y" = (y:, . . . ,y f ) so that the condition
5 3.
Generalizations of a theorem of Steinhaus
61
is satisfied. Let us introduce the notation
Using the proof of the inverse function theorem (see Rudin [172], theorem 9.24) we obtain that, if Y'' is an open ball with center y: in Rr , t E T , (y', y") E Y, and
whenever y" E Y", then gt,,~ is a homeomorphic mapping of Y" onto an open subset U(t, y') of Rr . Now let
and
Using the continuity of the expressions in (2) and (3) we can choose an open ball Y" with center y t and open sets Y' and T * such that to E T*,yb E Y', Y* = Y' x Y" C Y, moreover t E T*, y' E Y', y" E Y" implies that
Let a ( q ) denote the Aq measure of the q dimensional unit ball ( ~ ( 0 = ) 1). We are going to prove that a ( q ) (diam B)"' X ~ BF )A' (gt (B))Y
62
Chapter 11. Steinhaus type theorems
whenever B C Y * and t E T*. Let R = diam B. Then there exists a closed ball V with radius R in IWQ such that B c (V n Y ' ) x Y".Suppose to the contrary that there exists a t E T* for which
where C = a(q)/y.Then we can choose an open set U for which gt(B)c U and X ~ B )> Xr(U)CRq. Let B* = g,l Then B
(u)n ( ( vn Y ' ) x Y " ).
c B*, B* is a Bore1 set and gt(B*)C U , that is
We are going to prove that this is impossible. Let
B,*,= { y"
: ( y ' , y") E
B * } if y' E V n Y ' .
Using the theorem concerning transformation of integrals we have that
whenever y' E V n Y'. By Fubini's theorem
which is a contradiction. Hence the proof is complete. 3.9. Lemma. Under the conditions of the previous lemma, if a subset D of WT has density 1 in the point g ( t o , y o ) , then g ~ l ( Dn) Y* has density 1 in the point yo.
89 Proof. By the continuity of -(to, y ) the function gt, satisfies the Lipdy schitz condition on a neighborhood of yo. Hence there exist a y > 0 and an 0 < M < m such that y E Y * and ( g ( t o , y ) - g ( t o , y o )Mly-yol I whenever
<
§ 3. Generalizations of a theorem of Steinhaus
63
ly - yo/ < y. Let a ( k ) and a(r) denote the X h n d AT measures of the Ic and r dimensional unit balls, respectively. Let E > 0, and let
where C is the constant from the previous lemma. Let us choose a ,B > 0 such that, whenever V is a closed ball with center g(to,yo) and radius less than ,f3, then AT (V n D ) 2 (1 - S)Xr (V). We prove that if W is a closed ball in Y* with center yo and radius less than y and P / M , then xk(wn & ( ~ ) ) 2 (I - E ) P ( w ) . Suppose to the contrary that for such a W with radius R,
Then there exists a compact subset B EXYW). Hence by the previous lemma,
c W \ g;'(~)
for which X ~ B>)
But gto(B) is a compact subset of V \ D , where V is the closed ball in RT with center g(to,yo) and radius M R < P. Since XT(V\ D) < SXr(V) we get
which contradicts the choice of S. 3.10. Lemma. Let Y be an open subset of Rk , T a topological space, D an open subset of T x Y, and (to,yo) E D . Suppose that the function g : D + RT is continuous and continuously differentiable with respect to y.
8s yo) is r, then there exist such neighborhoods If the rank of the matrix -(to, 8~
T* and Y* of to and yo, respectively for which (1) for each E > 0 there exists S > 0 such that XT(gt(B)) 2 S whenever t E T*, B c Y* Ak(B)2 E; (2) ifA is a A' measurable subset ofRr , then g,' (A)n Y * is a X b e a s u r a b l e subset of Y for each t E T*.
Proof. Let us use Lemma 3.8. Shrinking the so obtained set Y* we may suppose that it is bounded. Now from the statement of Lemma 3.8 and from 3.2.(4) we obtain (1) and from 3.2.(7) we obtain (2).
Chapter 11. Steinhaus type theorems
64
3.11. Theorem. Let X be an r-dimensional Euclidean space, and let X I , . . . ,X, be orthogonal subspaces of X with dimensions 9-1,.. . ,r,. 1 (1 5 i 5 n ) and CYylri = r. Let U be an open Suppose that ri subset of X and F : U -+Rm be a continuously differentiable function. For each x E U let N, denote the null space of F1(x). Let Ai be a subset of Xi (i = I , . . . , n ) and suppose that Ai is XTi measurable for 2 5 i 5 n. Let a E U, dim N, = r - m. Let pi denote the orthogonal projection of X onto Xi. Suppose that pi(Na) = Xi and Ai has density 1 in the point pi(a) whenever I 5 i 5 n. Then F (A1 x . x A,) is a neighborhood of F ( a ) .
>
Proof. Let k = r - m. Since x ++ rank F' (x) is lower semicontinuous, and r a n k ( ~ I ( a ) )= m, we may suppose that r a n k ( ~ I ( x ) )= m whenever x E U. Similarly, choosing a smaller U if necessary, we may suppose that pi(N,) = Xi whenever x E U and 1 5 i 5 n; to prove this, suppose to the contrary that there exists an i and for each natural number j there exists an 31) E U and there exist orthonormal vectors e p ) , . . . , e,-,,+, (j) in N,, such that x j -+ a and pi(ep)) = 0 whenever j = 1 , 2 , .. . and 1
< s 5 k - ri + 1.
Using the compactness of the unit sphere we can pass to a subsequence and suppose that e!$ + e, if j -+ m. But this proves that the vectors e, are orthonormal in Na and pi(e,) = 0 whenever 1 5 s 5 k - ri 1, which is a contradiction. Now, choosing a smaller U if necessary and using the rank theorem (see Dieudonni! [49], 10.3.1), we have that there exist mappings u, p, and v and an open neighborhood V of b = F ( a ) in Rm with the following properties: u maps U onto the open cube IT,where I = 1-1, 1[, u is invertible and moreover u and u-I are continuously differentiable; v maps Imonto V, v is invertible and moreover v and v-I are continuously differentiable; p is the projection ( X I , . . .1 2 , ) P : ( X I , . . .7 2 , )
+
of ITonto I m ; and finally F = v o p o u. We may write ITas IT= T x Y where T = Imand Y = I k . Let u ( a ) = (to,yo)E T x Y. Now let us use some facts from differential geometry (see Dieudonni! [49], mainly 16.8.8). U ~i F - I (v(t)) is a closed submanifold of U whenever t E T. The tangent space of this submanifold in a point x E U n F - l (v(t)) is equal to the subspace N, of X . Clearly u-I is a diffeomorphism of the closed submanifold {t} x Y
5 3.
Generalizations of a theorem of Steinhaus
65
of T x Y onto U f l F - I ( v ( t ) .) Let gi = pi o u-I if 1 5 i 5 n. By the choice of U , pi is a submersion of U n F-I ( v(t)) into Xi. Hence the mapping gilt : Y + Xi is a submersion too, that is, its derivative has rank ri whenever Y and t E T . Now, by Lemma 3.8, there exist open sets T* and Y * and there exists a 0 < K < cc such that to E T* c T , yo E Y * C Y , and
whenever B C Y * ,t E T * . Let X,* = X i , A: = Ai, and gg the restriction of gi onto T * x Y * . Applying Corollary 3.5 to the sets and functions marked by stars we have that the function
is continuous on T * . By Lemma 3.9, g;;:(~i) has density 1 in the point yo. Since g:,;: ( A i )n Y * is measurable by Lemma 3.10 if 2 5 i 5 n, we have that
has density 1 in the point yo. Hence f ( t o )> 0 and we have that there exists a neighborhood V of to for which f (t) > 0 if t E V . Clearly v ( V ) is a neighborhood of b in Rm. If z E v ( V ) , then t := v-' ( z ) E V and hence the set n
is nonvoid. If y is an element of this set, then ~ ( u - ' ( t y, ) ) = v ( p ( t ,y ) ) = v ( t ) = z and xi = pi (u-'(t, y)) = g:,t ( y ) E Ai if 1 5 i 5 n. This means that F ( x l ,. . . , x n ) = Z . 3.12. Corollary. Let U be an open subset o f W x RT and F : ( x ,y) ++ F ( x ,y) a continuously differentiable mapping of U into RT. Let A , B C RT and suppose that B is AT measurable. If ( a ,b) E U ,
dF det -(a, dx
b) # 0 ,
dF det -(a, dy
b) # 0,
A has density 1 in the point a and B has density 1 in the point b, then F ( A ,B ) is a neighborhood of F ( a , b).
66
Chapter 11. Steinhaus type theorems
Proof. By Theorem 3.11 we have to prove only that pl (Na,b) = RT and p2(Natb)= Rr where Nalbis the null space of F1(a,b). Let (x, y) E Na,b. If pl (x, y) = 0, then x = 0. Hence dF 0 = F 1 ( a ,b) (x, y) = -(a, dy
b) (y).
aF But det -(a, b) # 0, hence y = 0. This proves that pl : Na,b + Rr is a dy = Rr. one-to-one mapping, that is, pl (Na,b)= Rr . Similarly p2 3.13. Remark. The previous theorem may be stated in the following global form: IfN, is in general position, i.e., dim N, = r-m and pi (N,) = Xi for all x E U (i = 1,2, . . . ,n ) , moreover A1 x A2 x . x A, C U, Xri(Ai) > 0 (i = 1,2,. . . ,n ) , and the set Ai is Xri measurable for 2 5 i 5 n , then F(A1 x . . . x A,) contains a nonvoid open set.
Proof. Let us choose a point a E U for which Ai has density 1 at pi(a) whenever 1 5 i 5 n , and let us apply the previous theorem.
4. GENERALIZATIONS OF A THEOREM OF
PICCARD Piccard's result [I671 analogous to the theorem of Steinhaus states that the sum of two Baire sets having second Baire category has an inner point. Very strong generalizations exist; in this case also addition can be replaced by a two variable function with weak solvability conditions. These results are useful in the proof of "Baire property implies continuity" and "Baire property implies boundedness" type regularity theorems for functional equations. We refer the reader to the papers Sander [179], [181], [183], Kominek [124], J h a i [86], Grosse-Erdmann [61], Lindberg and Szymanski [139], and the references cited in them. The purpose of this section is to give a generalization of Piccard's theorem analogous to the results of the previous section. These results were published in J&-ai [103]. The following theorem is an abstract version of our generalization of the theorem of Piccard.
§ 4. Generalizations of a theorem of Piccard
67
4.1. Theorem. Let T , Y , and Xi be topological spaces, gi : T x Y + Xi continuous functions, and suppose that giSt(B)has second Baire category whenever B c Y is a subset of Y with second Baire category. Suppose that Ai C Xiand Ai is a Baire set whenever 1 5 i 5 n. Then the set V ofpoints t E T for which n
is of second category is an open subset of T Proof. The sets Ai can be written in the form Ai = Ei A Mi, where Ei is open and Mi is of first category. Suppose that to E V, and let K = n>,g; ( A i ) ,K t = K n (n:=,g;,fi ( E , ) ) . Since 9;; (Mi) is of first category in Y whenever i = 1 , 2 , .. . ,n , we have that
is of first category, hence K t is of second category in Y . Let yo be a point of K ' for which W f l K t is of second category in Y for each open neighborhood W of yo. Clearly gi(to,y o ) E Ei if i = 1 , 2 , . . . ,n. Since the sets Ei are open and the functions gi are continuous, the sets g Z - l ( ~ , )are open and contain the point (to,y o ) , hence there exist open sets V' and W', such that to E V', yo E W', and V' x W' c n~=lgz:l(~i). We will prove that
is of second category for each t E V'. Were this not true, the sets
would cover - except for a set of first category - the set W'. If we prove that these sets are of first category, then we have a contradiction. But this follows from the inclusion
which is a consequence of W' C 9,:
(E,).
Chapter 11. Steinhaus type theorems
68
4.2. Remark. I f we suppose that Y is a complete separable metric space and X1 is metrizable, then we may omit the condition that the set A1 has the Baire property.
To prove this, let C1 denote the set of all points xl E X1 such that for each neighborhood U1 of xl the set Ul n A1 is of second Baire category. It is known that C1 is a closed set and A1 \ C1 is of first category. Let B1 denote the set of inner points of C1. Then B1 is open and Al \ B1 is also of first category. As in the previous proof we obtain that W' \ g<;(B1) and W' \ g,;,'(Ai), 2 i 5 n are of first category. It is enough to prove that W' f l (A1) is of second category, because then it follows that
gl,i
<
cannot be of first category. Suppose that W' \ gl,; (A1) is of first category. Then, using that W' n g,(B1) i is an open set of second category, we obtain that
is a Baire set of second category. Let G be a Gs subset of second category of the set above. Then gl,t(G) is of second category as a subset of X I . By Bourbaki [38], IX, 56, Exercise 10, gl,t(G) is a Baire set in X I . Clearly ~ I , ~ (C G B1 ) \ Al. Writing gl,t(G) = U A F where U is open and F is of first category, we see that U n B1 is a nonvoid open set for which the intersection with A1 is of first category. This contradicts the definition of B1. In Laczkovich [135], 11.9.9 it is proved that the continuous image of a Polish space in a Hausdorff space is a Baire set. This shows that it is enough to suppose that X1 is Hausdorff. The following lemma allows us to use derivatives to verify that the conditions on the functions gi in the previous theorem are satisfied. 4.3. Lemma. Let Y be an open subset of R\ T a topological space, D an open subset of T x Y, and (to,yo) E D. Suppose that the function g : D -+ RT is continuous and has continuous partial derivative with respect &I to y. If the rank of -(to, yo) is r , then there exist open neighborhoods T*
8~
and Y* of t o and yo, respectively, such that
§ 4. Generalizations of a theorem of Piccard
69
(1) if B has second category in Y *, then gt (B) has second category in RT for each t E T *; (2) if A is a Baire set in RT, then g,l (A) n Y * is a Baire set in Y for each t E T*. Proof. We have proved in the proof of Lemma 3.8, that there exist open sets T* and Y' and an open ball Y" centered at y{ such that t o E T*, yb E Y', T* x Y' x Y" C D, and gt,,, is a homeomorphism of Y" onto an open subset U(t, y') of RT whenever t E T * and y' E Y'. Suppose tha,t there exists a subset B of Y* = Y' x Y" of second category, and a t E T*, such that gt(B) is of first category in RT. Let us choose a Borel set U of first category in RT, for which gt(B) C U C gt(Y*) and let B * = g , l ( ~ ) n Y*. This set B* is a Baire set and is of second category in Y*, but gt(B*) is of first category in IWT. By the Kuratowski-Ulam theorem the set of all points y' E Y' for which B;, is of second category is a set of second category. On the other hand, by the same theorem, the set of all points y' E Y' for which B,", is not a Baire set, is of first category. From this it follows that there exists y' E Y' for which B;, is a Baire set of second category in Y". Since gt,,/ is a homeomorphism of Y" onto U(t, y'), the set gt,,, (B,",)is of second category in RT. This is a contradiction, because gt,,~(By*,)C gt (B*). Hence (1) is proved. To prove (2) suppose that A is a Baire set in RT, and let us choose a Borel set B for which A C B and B \ A is of first category. Then
Using that g,l(B) is a Borel set and g c l ( B \ A) n Y* has first category by ( I ) , we have that g,' (A) f l Y * is a Baire set. Now we are prepared to prove the local version of our generalization of the theorem of Piccard for a function from an open subset of RT into IWm. The condition of the following theorem means, roughly speaking, that the null space of the derivative is large enough and is in general position. 4.4. Theorem. Let X be the r-dimensional Euclidean space, and let X I , . . . ,X , be orthogonal subspaces of X with dimensions r l , . . . ,r,, respectively. Suppose that ri 2 1 whenever 1 5 i 5 n and Cy=lri = r . Let U be an open subset of X and F : U + Rm a continuously differentiable function. For each x E U let N, denote the null space of F1(x). Let Ai be a Baire subset of Xi ( i = 1 , 2 , . . . ,n), and suppose that a E U and
70
Chapter 11. Steinhaus type theorems
dim Nu = r - m. Let pi denote the orthogonal projection of X onto X i . Suppose, that pi ( N u )= Xi and pi ( a ) has a neighborhood Uisuch that Ui\ Ai is offirst category if 1 5 i 5 n. Then F ( A 1 x x A n ) is a neighborhood of F(4.
Proof. Let us define t o ,yo, T , Y , and gi as in the proof of 3.11. By the above lemma there exist sets T* and Y * such that t o E T* c T , yo E Y * c Y , and gi,t(B) is of second category whenever B C Y * has second category and t E T * . Let X t = X i , A f = A i , and g: be the restriction of gi to T * x Y * . Applying the theorem above to the sets marked by a star we have that the set V * of points t for which n
is of second category is open in T * . Since gljtomaps Y * onto an open neighborhood of pi(a) and g* ( t o ,yo) = pi ( a ) , there exists an open neighborhood W of yo in Y * such that W \ g;;: (A,) is of first category if 1 5 i 5 n. This proves that t o E V * . Clearly v ( V * ) is an open neighborhood of b in Rm . If x E v ( V * ) ,then v - ' ( z ) E V * , and hence the set
is nonvoid. If y is an element of this set, then u - l ( t , y ) E F - ' ( x ) and xi = pi (u- ( t ,y ) ) E Aiwhenever 1 5 i 5 n. This implies F ( x l , . . . ,x n ) = z , which is enough since v ( V * )is an open neighborhood of b = F ( a ) .
4.5. Corollary. Let W be an open subset of Rr x IF,and let F : ( x ,y) H F ( x ,y ) be a continuously differentiable mapping of W into IF. Suppose, that A , B C Rr and A, B are Baire sets. If (a, b) E W,
dF det -(a, ax
b) # 0 ,
dF det -(a, 8~
b) # 0,
and there exist neighborhoods U and V of a and b respectively such that U \ A and V \ B is of first category, then F ( A ,B ) contains a neighborhood of F ( a , b).
Proof. Similar to Corollary 3.12 of the previous section. The above theorem can be formulated in the following global form:
§ 4. Generalizations of a theorem of Piccard
71
4.6. Theorem. Let X be the r-dimensional Euclidean space, and let X I , . . . ,X , be orthogonal subspaces of X with dimensions r l , . . . ,r,, respectively. Let pi denote the orthogonal projection of X onto Xi. Suppose that ri 2 1 whenever 1 5 i 5 n and Cy=l ri = r . Let U be an open subset of X and F : U + Rm a continuously differentiable function. For each x E U let N , denote the null space of F ' ( x ) . If dim N x = r - m and p i ( N x ) = Xi whenever x E U and i = 1 , 2 , . . . , n, moreover A1 x . x A , c U and Ai is a Baire set having second category i f 1 i n, then F ( A 1 x x A,) contains a nonvoid open set.
< <
a
Proof. A Baire set Ai can be written in the form Ui A Fi where Ui is a nonvoid open set and Fi is o f first category. For any a E U1 x U2 x . x U, we may apply the previous theorem. 4.7. Remark. Using the previous Remark, we may omit the condition that Al is a Baire set.
Chapter 111.
BOUNDEDNESS AND CONTINUITY OF SOLUTIONS
In this chapter we discuss how we can prove boundedness, local boundedness, or continuity of solutions of functional equations from their measurability or Baire property.
5. MEASURABILITY AND BOUNDEDNESS The classical method treated in the introduction to prove continuity of solutions works only for locally integrable solutions. Hence for measurable solutions we have to prove that they are bounded and hence locally integrable. Therefore we need "measurability implies boundedness" type theorems. For certain special equations such type of results can be found for example in the papers of VajzoviE [204], Dar6czy [40], and Sander [179]. First we prove a general theorem about a functional inequality. This theorem was first published in the paper JBrai 1801. As we have already mentioned in the introduction the results of 5 8 make it possible to directly conclude continuity of solutions from measurability if the measures are Radon measures. Hence "measurability implies boundedness" type results are used today only in cases where there are non-Radon measures involved.
5.1. Theorem.
Let T, Y, and Xi (i = 1 , 2 , ,. . ,n) be sets, u and Xi (i = 1 , 2 , . . . ,n), respectively. Further
pi be finite measures on Y and
Chapter 111. Boundedness and continuity of solutions
74
IetDcTxYandf :T+R, fi:Xi+R, fo:Y+R,gi:D+Xi ( i = 1,2, . . . ,n ) , h : D x IRnf R be functions. Assume that the following conditions hold: ( I ) for every (t,y) E D
> 0 there is a K > 0 such that xi I k ( i = 0,1, . . . , n ) and ( t ,y) E D implies h ( t ,y, xo,2 1 , .. . , xn) I K ;
(2) for every k
(3) the functions f i , i = 1,2,. . . ,n are measurable; (4) for each E > 0 there is a S > 0 such that p i ( g i , t ( ~ )2) S whenever 1 5 i 5 n, t E T , B c Dt, and u ( B ) 2 E ;
(5) there exists an E > 0 such that u ( D t ) 2 E for every t E T . Then f is bounded ii-om above on T . Proof. Since the sequence of sets
is increasing and it's union is Y , for the that with the notation
E
> 0 of (5) there exists
a ko such
. we have u ( C o )2 u ( Y ) - ~ / 2 Hence
By (4) there exists a 6 > 0 such that pi ( g i , t ( ~ ) > ) S whenever 1 2 i I n , t E T , B C Dt, and u ( B ) _> ~ l ( 2 n ) Since . for 1 I i 5 n the increasing sequence
{ x :x
€ x if,i ( x ) 5 k ) ,
k = 1,2,...
of measurable sets has union X i , there exist ki's such that with the notation
we have pi(Xi
\ C i ) < S.
We will prove that the set
5 5.
75
Measurability and boundedness
is nonvoid for all t E T . Indeed, if this set would be void, then the set Dt n Co would be covered by the sets g$ (Xi \ Ci). Since u(Co n D t ) 2 r/2, for some 1 5 i 5 n we would have u ( g $ ( ~ i \ C,)) rl(2n). This would imply that pi(Xi \ Ci) 2 S, contradicting the choice of Ci. Now let k be the maximum of the numbers ko,kl, . . . , k, and let y be any element of set (6). Then gi (t, y) E Ci and hence by (1) and (2) we obtain that
>
f(t)
< h (t, Y
fo (I/), fi (91(t, 9))
1
, f n (9n (tr Y))) 5 K.
5.2. Remark. Applications of the previous theorem are not restricted to real-valued functions. For example if functions f , fi have values in a semimetric space then we may take the distance of the function value from a fixed point of the space and apply the previous theorem to these real-valued functions. Even the condition that the measures v and pi are supposed to be finite is not a strong restriction: choosing appropriate smaller sets T , Y, D , and Xi finiteness can be obtained. We will use these observations in the proof of the following statement.
5.3. Statement. Let G be a locally compact group, H a metric group with left invariant metric d, and let f : G -+ H be a homomorphism. I f f is measurable with respect to some invariant expansion p of left Haar measure, then f is bounded on compact sets.
Proof. Let T be a compact subset of G and let Y be a compact subset of G having positive measure. Let D = T x Y and X = TY. Then
Introducing the notation and
f (x) = d(e, f (x)) the
function
If is p measurable
-
f (t) I f ( t y ) + f ( y ) if t , y E D .
Applying the previous theorem we obtain that f is bounded on T .
76
Chapter 111. Boundedness and continuity of solutions
6. CONTINUITY OF BOUNDED MEASURABLE
SOLUTIONS As we mentioned in the introduction, "bounded measurable solutions are continuous" type results are not so important today as were before because of the new "measurability implies continuity" type results of 5 8. Hence we discuss only two - unpublished - results, 6.6 and 6.8. Our interest in these theorems is that they can be applied to covariant expansions of the Eebesgue measure and to invariant expansions of Haar measure. Note that Theorem 3.4, treated among Steinhaus type theorems is also connected with this topic and can be used to prove continuity of bounded measurable solutions. First we have to survey some results about invariant expansions of the Haar measure and covariant expansions of the Lebesgue measure. The full discussion can be found in JBrai [82]. In 1950, Kakutani and Oxtoby [I211 proved that Haar measure on the circle group T can be expanded to an invariant, countably additive measure for which the corresponding L2-space has Hilbert space dimension 2', where c = card(T) is the cardinal number of the continuum. The same result holds for any compact metric group (see Hewitt and Ross [66], 5 16). On the other hand, using the result of Kakutani and Kodaira [122], in 1959 Hulanicki [75] proved that 2' discontinuous characters of the circle group can be made simultaneously measurable under an invariant expansion of Haar measure. This result was extended by Itzkowitz [77] for connected compact infinite Abelian groups and by Hewitt and Ross [68] for arbitrary locally compact Abelian groups by showing that 2card(G)characters of any infinite locally compact Abelian group G can be made simultaneously measurable under an invariant expansion of Haar measure. In JBrai [82] the author generalized the theorem of Kakutani and Oxtoby for arbitrary locally compact groups. This result is based on an abstract expansion theorem for a given finite measure to a very much larger measure so that the expanded measure has the same transformation law for a certain class of transformations as the original measure. To explain these results we introduce two notions. 6.1. Definition. A measure p is called J covariant measure if and only if a set T of mappings r is given such that each r E T maps a p measurable set into X, and for every r E T an extended real-valued nonnegative p measurable function J ( T ) is defined satisfying the following property:
5 6.
Continuity of bounded measurable solutions
77
(1) the multiplicity function N (r/A,y) is p measurable and
whenever A is a ,u measurable set and T E T = dmn J . If T is one-to-one and J(r)is p almost equal to the characteristic function of r E T, then the measure p is called T invariant. d m n whenever ~ We remark that if every element T of T is one-to-one, then condition (1) is equivalent to (2) r (A) is p measurable and p (T(A)) = JA J(r)(x) dp (x) whenever A is p measurable and r E T . Hence if every element r of T is one-to-one, then p is T invariant if and only if (3) r ( A ) is p measurable and p (T(A)) = p(A) whenever A is p measurable, 7 E T , and A c d m n ~ . 6.2. Examples. (1) Let X = Rn, let p be the Lebesgue measure on Rn, and let T be the group of all translations of Rn. Then p is T invariant. = 1 for every x E Rn whenever r E T . In this case, J(T)(x) (2) Let X = R n , let p be the Lebesgue measure on Rn, and let T be the set of all linear mappings of Rn into itself. If J(r)(x) = 1 det r / whenever x E Rn and T E T , then p is J covariant. (3) Let X = Rn,let p be the Lebesgue measure on Rn, and let T be the set of all continuously differentiable functions r , each mapping an open subset of Rn into Rn. Let J ( r ) ( x ) be equal to the absolute value of the Jacobian of r at x if x E d m n r and 0 if x $ d m n r . Then p is J covariant.
6.3. Remark. If p is a J covariant measure, then we have for every nonnegative p integrable function f the following formula concerning the transformation of integrals: (1) for every r E dmn J the function y ++ C,ET-liy) f (x) is a p measurable function and
This statement is obvious in the case when f is the characteristic function of a p measurable subset of X. Hence, using approximation by linear combination of characteristic functions we obtain the statement for nonnegative integrable functions. Our abstract expansion theorem is as follows:
78
Chapter 111. Boundedness and continuity of solutions
6.4. Theorem. Let p be a J covariant measure on X for which 0 < p ( X ) < oo, and suppose that every element of T = dmn J is one-to-one. Suppose that for some infinite cardinal number n and a subset .F of 2 X the following properties hold: (1) card(X) = 2"; (2) card(T) 5 2";
(3) card(.F) 5 2" and for every p measurable A for which p(A) > 0 there exists an F E .F for which card(F) = 2" and F C A. Then there exists a J covariant expansion v of p with weight 22". The proof is based on the method of Kakutani and Oxtoby. The essential difference is in the elimination of the topological feature of the original version. This makes it possible to use only purely set-theoretic and measuretheoretic devices. The proof uses the axiom of choice, but does not use the continuum hypothesis. Our expansion theorem for Haar measure reads as follows:
6.5. Theorem. Let G be a locally compact nondiscrete group and X a left Haar measure on G. Let A denote the modular function of G, and let n denote the character of the topological space G. Then the measure X (which has character n) can be expanded to a measure K such that K: has character 22n and if z E G and K is K measurable then (1) zK is K measurable and K ( z K )= K ( K ) ; (2) K z is K measurable and K ( K z )= A ( z ) K ( K ) ; (3) K-I is K measurable and K ( K - ' ) = l / A ( z )~ K ( z ) .
&
The proof is based on the abstract expansion theorem above. As another application of the abstract expansion theorem we proved that the Lebesgue measure on the n-dimensional Euclidean space can be expanded to a measure for which the corresponding L2 space has Hilbert space dimension 2' and is transformed by differentiable coordinate transformations as the Lebesgue measure, namely, J covariant with respect to J from example 6.2.(3). This is only a special case of a more general result about covariant expansions of the Lebesgue measure. A similar theorem is true for manifolds: see my thesis [98], kj 5.
6.6. Theorem. Let n , k, m l , . . . ,m,, and s be positive integers. Let Xi be an open subset of IW" let T and Y be open subsets of IWS, let Zi be let D be an open subset of T x Y, and let Z be a an open subset of Rmi,
$ 6. Continuity of bounded measurable solutions
79
Euclidean space. Let us consider the functions f : T -+ 2,gi : D -+ Xi, fi : X i + Z i , h i : D x Z i + Z ( i = 1 , 2 , . . . , n ) , a n d h o : D + Z . Let T J(T)be the mapping given in example 6.2.(3). Let p be a J covariant expansion of the Lebesgue measure on Rk.Suppose that (to,yo) E D and (1) for each (t, y) E D we have
(2) hi is continuous (i = 0 , 1 , . . . , n ) ; (3) the function fi is p measurable and bounded on Xi; (4) gi is twice continuously differentiable and
Then f is continuous on some neighborhood of to. Proof. Using coordinates, we may suppose that Z = R Let us choose a compact set K which is a neighborhood of yo and an open neighborhood V of to having compact closure such that V x K C D. If t E V, then integrating over K we obtain
Since p ( K ) # 0, it is enough to prove that the integrals on the right-hand side are continuous functions of t. This is clear for the first integral because ho is uniformly continuous on V x K . Leaving out indices, we have to prove
is continuous on V if V and K are small enough. By substituting y = g c l ( x ) and using that the measure p is J-covariant, we obtain that
whenever t E 7. Since g is continuous on the compact set 7 x K , the set C = g(V x K) is compact, too. Since g t 1 ( x ) and (~9;') (x) are continuous on
80
Chapter 111. Boundedness and continuity of solutions
the compact set 7x C, they are bounded and uniformly continuous there. Since the range of f is contained in a compact set L and h is uniformly continuous on the set V x K x L, we obtain that for each E > 0 there exists a S > 0 such that with the notation
whenever t , t l E V, It - tll exists an M > 0 such that
IH(~,x)I~M
<
S, and x E g t ( K ) n gt/(K). Moreover, there
if
t t V
and
x€gt(K).
Hence
By 3.6 the multiplier of M goes to zero if t
-+
t', hence the statement follows.
6.7. Remark. In the previous theorem, the sum of functions h l , . . . , hn cannot be replaced by a function having n variables. The addition cannot be replaced even with complex multiplication. Indeed, by the expansion theorem I ' of Hewitt and Ross 1681 a lot of non-continuous homomorphisms f : G 4 ' of a locally compact nondiscrete Abelian group G (for example, G = R) can be made measurable by an appropriate invariant expansion of Haar measure. This is not true for homomorphisms having values in Rn, as shown by the following statement. 6.8. Statement. Let G be a locally compact group and let f : G + Rn be a homomorphism which is measurable for a left invariant expansion p o f left Haar measure. Then f is continuous.
Proof. By the results of the previous section f is locally bounded. Similarly, as in the proof of the previous theorem, if we integrate both sides of the
§ 7. On a problem of Mazur
81
equation f (t) = f (ty) - f (y) over a compact set K having positive measure, we obtain K
The mapping
r
is continuous, because
where X is Haar measure. The expression on the right-hand side continuously depends on t and is zero if t = to, hence f is continuous.
7. ON A PROBLEM OF MAZUR In about 1935 S. Mazur asked (Problem 24 of "The Scottish Book" [157]) the following question: An additive functional f in a (real) Banach space X is given with the property
(P)
for each path g in X , the function f
o
g is Lebesgue measurable.
Is it true that f is continuous? This long-standing question was answered in 1984: Labuda and Mauldin [I341 proved, that the answer is yes, more generally, i f f is an additive operator mapping X into a Hausdorff topological vector space, and f has property (P), then it is continuous. The result was extended by Lipecki [I401 to the case when X and Y are Abelian Hausdorff topological groups with X metrizable, connected, locally arcwise connected, and complete. More general functional equations have also been considered. R. Ger [58] has obtained similar results concerning property (P) for Jensen convex functions, and L. Szkkelyhidi [I911 for exponential polynomials. The aim of this section is to prove that, on certain topological spaces, property (P) is equivalent to a more usual measurability condition, namely,
Chapter 111. Boundedness and continuity of solutions
82
universal measurability (J8rai [89]). This result makes it possible to use "measurability implies continuity" type results, that are known for more general functional equations, to get "property (P) implies continuity" type results. It is clear that several questions are open in this field, but the results below show that the two questions are strongly connected and may be starting points for further research. In particular, using a result from the classic book of Schwartz [187], we obtain the result of Labuda and Mauldin (see 7.5).
7.1. Definitions. For a measure p let M, denote the a-algebra of all p measurable sets. For a topological space X let Ux = n{M, : p is a Borel regular measure on X ) , the set of universally measurable subsets of X . Similarly, if X is a Hausdorff space, let Rx = n{M, : p is a Radon measure on X). In these definitions the expression "Borel regular measure" and "Radon measure" may be replaced with the expression "diffuse probability Borel regular measure" and "diffuse probability Radon measure", respectively. In the case when X is a complete separable metric space, Rx = Ux. More general results can be found in Schwartz [187], pp. 117-130. See also Federer [53], 2.2.16. Let
Px
=
{A : A C X , g-l(A) is Lebesgue measurable for each path g in x).
Trivially, Px is a a-algebra, and if f : X -+ Y is a function mapping X into a topological space Y, then f -l(V) E Px is satisfied for any open subset V of Y if and only if for each path g : [a, b] + X , the function f o g is Xra,b1 measurable. So we have to investigate only the connection of Px with other a-algebras. Clearly, Bx c Px, where Bx denotes the class of all Borel subsets of X. More generally, we have 7.2. Theorem. With the notation of the definition above, if X is a Hausdorff topological space, then Rx c Px.
4
Proof. Suppose that A is a subset of X for which A P x . Then there exists a path g : [a, b] + X for which g-'(A) is not Lebesgue measurable. Let p = g#(X[,,b1). Using results 2.2.17, 2.1.5(4) from Federer [53], we get that p is a Radon measure, and A cannot be p measurable.
5 7 . On a problem of Mazur
83
The converse is not true in all Hausdorff spaces. Clearly, if X is totally disconnected, than Px = 2 X . Hence some connectedness properties are needed. To prove the converse for a wide class of spaces, we need a lemma, which is more or less equivalent to a theorem of Marczewski published in Polish* (see [140]).
7.3. Lemma. Let p be a diffuse probability Radon measure on a complete metric space X. Then for each E > 0 there exists a compact subset C of [ O , l ] and a homeomorphism g : C + g(C) c X such that g is measure preserving between C and g(C), that is, g#(Xc) = pLg(C), moreover 1 > p(g(C)) > 1 - E . Proof. Let G(x, r) denote the open ball, B ( x , r) denote the closed ball with center x and radius r, respectively. Let t, be a strictly decreasing sequence tending to 1. Let us choose a compact set K C X for which 1 > p ( K ) > I - ~ 1 2 .We may suppose that spt(p1K) = K . For a fixed x E K , the function r H p ( K n B(x, r ) ) is positive for each positive r, monotone increasing, and tends to 0, as r tends to 0. Hence, we can choose a number 0 < r, < 112, for which the function is continuous in the point TX and has values less than i p ( ~ ) Choosing . a finite subcovering from the open covering G(x, r,), x E K , we get points xi and radii ri = r,,, for which
Moreover, we can choose radii r,! > ri, i = 1 , 2 , .. . , n , for which
Let K1 = K f l B ( x l , r l ) , and by induction let
If one of the sets Ki has measure zero, let us drop it, decreasing n. Finally, let n
Clearly, the sets Ki are disjoint compact sets with diameter less than 1, for each i the inequality 0 < p(Ki) < 112 holds, K(') C K , and p ( K \ ~ ( ' 1 ) < *Recently collected works of Marczewski have been published in English.
84
Chapter 111. Boundedness and continuity of solutions
~ / 2 Let ~ .us select disjoint closed intervals Ci, i = 1,2,.. . , n in [O,1],such that p(Ki) < X(Ci) < t l p ( K i ) ,whenever i = 1 , 2 , . . . , n and let c(')denote their union. In the second step, let us repeat this process for each Ki instead of K . We can get the disjoint compact subsets
of Ki with diameter less than 1/2, with measure positive but less than 1/4, such that with the notation
K ( ~C) K(') and p(K(')\ ~ ( ~ <1 ~ ) / 2 We ~ . choose the corresponding closed subintervals
Ci,l,Ci,2,.. . , Ci,ni
is satisfied. Let
n
nil
Continuing this process, by induction we get the systems
and the sets K(" and c @ ) . Let
To each point x E C there corresponds a unique sequence
Similarly, to each y E K , there corresponds a unique such sequence. Let g(x) = y if the corresponding sequences are equal.
85
§ 7 . On a problem of Mazur
To prove that g is continuous, let q > 0. If 1/2"' < 7 , and 6 is the minimal distance between sets Ci, ,i, ,.. ,ik, then 1x - x'l < 6 implies that g(x) and g(xl) are in the same set Kil,i2 ,,,. ,ik, hence have distance less than q. If q > 0 and tk/2k < q, let S be the minimal distance between sets Kil,i2,,., ,ik. If the distance between g(x) and g(xl) less than 6, then x and x' are in the same set Cil, i 2 , . .. , i k , hence lx - x11 < q and g-' is continuous. To prove that g# (Ac) = p [g(C), it is enough to prove that ,
because every open subset of C and g(C) is a disjoint countable union of open subsets of C and g(C) of this form, respectively, and the measures are Radon measures. But
7.4. Theorem. Let X be a complete, connected, and locally arcwise connected metric space. Using the notation of the previous definition, Px c
Ex. Proof. Suppose, that A is a subset of X for which A f! Rx. We will prove that A f! Px.Let us choose a diffuse probability Radon measure, for which A f! M,. Let A' denote the complement of A. Then p(A) p(A1) > p ( X ) = 1. Let us choose an E > 0 for which p(A) p(A') > 1 2 ~ .By the previous lemma, there exist a compact subset C of [O,1] with X measure greater than 1-E and a measure preserving homeomorphism g between C and K = g(C) C X. By theorem 5, $50 in Kuratowski, [132],g can be extended to a path h mapping [ O , 1 ] into X. Clearly h#(XLC) = g#(Xc) = p [ K . By the measurability of K, p(A) = p ( A n K ) + p ( A \ K ) , hence p ( A n K ) > p(A) -E. Similarly, p(A' n K ) > p(A1)-e. This proves, that A is not p LK measurable. By 2.1.2 of Federer [53], h-'(A) is X measurable if and only if A is h#(XLB) measurable for each subset B of I, which is not the case if B = C. Hence
+
A$
Px.
+ +
86
Chapter 111. Boundedness and continuity of solutions
7.5. Corollary. The answer to the problem of Mazur is affirmative, i. e., if for an additive functional f of a (real) Banach space X the function f 0 g is Lebesgue measurable for all continuous functions g : [a,b] + X, then f is continuous. Proof. Using the notation of the previous definition, we obtain from the previous theorem that f is universally Radon measurable. Since for all x E X the mapping t H f (tx) is Lebesgue measurable and satisfies Cauchy's equation, it is continuous. Hence we obtain that f is homogeneous. By a theorem from the book of Schwartz [I871 (p. 157), a linear and universally Radon measurable functional on a (real) Banach space is continuous. It follows that f is continuous. 7.6. Remark. The previous corollary can be extended to additive operators mapping a (real) ultrabornological topological vector space into a (real) locally convex Hausdorff topological vector space, if we use the ideas of the proof on p. 159 in the book of Schwartz [187].
8. CONTINUITY OF MEASURABLE SOLUTIONS Several papers dealt with the following question. Does, for a given type of functional equations, measurability of a solution imply its continuity? Among several papers dealing with measurable solutions of a special functional equation, the following papers treat general types of functional equations: Baker, [29], McKiernan [I581, Itzkowitz [78], Paganoni [162], and Sander [180]. Szkkelyhidi's results concerning regularity properties of (generalized) polynomials and exponential polynomials on locally compact Abelian groups can also be considered as general "measurability implies continuity" type results. See his book [193]. In this section first we will treat results first published in the papers JBrai [80], [all, [86]. The presentation here is somewhat more general than there, and several earlier results can be obtained as special cases. These theorems yield several new results; some of these will be treated in the last chapter among applications. Then we will discuss a result of K.-G. Grosse-Erdmann [61] and a new result of ours connected to it. We will refer to a quite recent "measurability implies continuity" type result of Guzik among applications, in 22.4. The reader can find a large number of further references in the papers mentioned above.
§ 8. Continuity of measurable solutions
87
8.1. Theorem. Let Z and Zi ( i = 1,2,. . . , n ) be completely regular spaces and Zo a a-compact completely regular space. Let Xi ( i = 1,2, . . . ,n ) and Y be locally compact spaces and T be arbitrary topological space. Let v be a Radon measure on Y and let pi ( i = 1,2,. . . , n ) be Radon measures on X i . Suppose that v ( Y ) < m, p,(Xi) < cx (i = 1,2,. . . , n ) , and let D be subset of T x Y . Consider the functions f : T -+ Z , f o : Y 4 Zo, f i : X i + Z i , h : D x Z o x Z 1 x . . . x Z n + Z , g i : D + X i ( i = 1 , 2, . . . , n ) . Let to be a fixed element of T, and suppose that the following conditions hold: (1) for each ( t ,y) E D,
h is continuous; (3) the function f i is Lusin pi measurable on Xi (i = 1,2,. . . ,n ) ; (4) gi iscontinuouson D (i = 1,2, . . . , n ) ; (5) Dt is measurable and there exists an 77 > 0 such that v(Dt n Dto) 2 7 for each t E T; B)) (6) for each E > 0 there exists a S > 0 for which P ~ ( ~ ~ ,2~S (whenever B c Dt, v ( B ) E , t E T , and I < i 5 n . Then f is continuous at the point to. (2)
>
>
Proof. By ( 5 ) , there exists an 17 > 0 for which v ( D t n D t o ) 77 whenever t E T. Let C be a compact subset of Dto for which v ( C ) > v ( D t o )- 7714. Now let E = 77/(8n). Using (6), we get a 6 > 0 for which 1 5 i n, t E T , B C Dt, and v ( B ) 2 17/(8n) imply p i ( g i , t ( ~ )2) 6. There exists a compact subset Ci of Xi for which pi(Xi \ C i ) < S and filCi is continuous. Hence the v measure of the open sets g ; ; , ' ( ~ i \ Gi) is less than 7 / ( 8 n ) whenever 1 i n and t E T. Consider now the set
<
< <
This is a measurable subset of Y, and it has a measure greater than 712. Let B 1 ,B q ,B 3 , . . . be an increasing sequence of compact sets in the space Zol for which U E , Bi = Zo. Then lim v ( f o 1 ( ~ i = ) )v ( Y ) < m,
i+m
88
Chapter 111. Boundedness and continuity of solutions
and, for a suitable i , the compact set Co = Bi satisfies the condition
Comparing this with what we have established before, we see that the set
is not empty for any t E T. Consider on the spaces Z , Zo, Zi, Xi, and Y uniformities compatible with the topology. Let a be a relation from the uniformity of Z and consider the compact set c x Co x fl(C1) x . . . x fn(Cn). By (2) and Lemma 3.3 there exists a neighborhood V' of to and there exist relations ,&, . . . ,,Bn from the uniformities of Z1,. . . , Zn, respectively, such that the points
are a-near whenever t E V', (t, y) E D , y E C, fo(y) E Co, zi, 2: E fi(Ci), and the points zi, zl are ,&-near if i = 1 , 2 , . . . ,n. Now, fi is uniformly continuous on Ci because Ci is compact. Hence there exists a relation yi from the uniformity of Xi such that the points fi(xi) and fi(x',) are &-near whenever xi,xi E Ci are yi-near. By (4) and Lemma 3.3 there exists a neighborhood V C V' of to for which the points gi(t, y) and gi(t0,y) are yi-near whenever t E V, (t, y) E D and y E C. Let t E V. Let us choose an arbitrary element y from the set (7). Then we have that (t,y) E D , (to,y) E D, fo(y) E Co, gi(t,y) E Ci, and gi(to,y) E Ci (i = 1,2, . . . , n ) . Hence the points fi(gi(t,y)) and fi(gi(to,y)) are ,&near, and so
are a-near. This shows that f (t) and f (to) are a-near whenever t E V, that is, f is continuous at to.
§ 8. Continuity of measurable solutions
89
8.2. Theorem. Let Z, Zi (i = 1 , 2 , . . . , n ) be completely regular spaces, Zo a a-compact completely regular space, Y and Xi (i = 1 , 2 , . . . , n ) locally compact spaces, T an arbitrary topological space. Suppose that D c T x Y . Let f : T -+ 2,f o : Y -+ Z o , g i : D - + X i (i = 1 , 2, . . . , n ) , fi : X i -+ Zi (i = 1,2, . . . , n ) , a n d h : D x Z o x Z l x . . . x Z n -+ Z befunctions, v a Radon measure on Y and pi a Radon measure on Xi (i = 1,2, . . . , n ) . Suppose that to E T and the following conditions are satisfied:
(1) for each (t,y) E D ,
(2) h is continuous;
(3) fi is Lusin pi measurable on the subset Ai of Xi ( i = 1,2, . . . ,n ) ; (4) gi is continuous on D (i = 1 , 2 , . . . , n ) ;
(5) there exist sets V and K such that V is open, K is compact, V x K to E V, v ( K ) > 0, and K c n ~ = ~ ~ ~ ; ~ ~ ~ ( ~ i ) ~
c D,
(6) for each E > 0 there exists a S > 0 such that B c K and v(B) implies pi ( g i , t ( ~ )2) 6 whenever 1 5 i 5 n and t E V.
2
E
Then f is continuous on a neighborhood of to.
Proof. We will reduce this theorem to the previous theorem by replacing D by a suitable smaller set D*. Let us choose a S > 0 for v ( K )l(4n) and let Xf = gi,t, ( K ) and let Wi be an open set with compact closure containing Xf for which pi(Wi \ X f ) < S. By Lemma 3.3 we may choose an open neighborhood T * c V of to for which gi,t(K) C Wi whenever i = 1 , 2 , . . . ,n and t E T*. Let Y* = K , v* = v(K)/2, and t t be an arbitrary element of T*, D * = {(t*,y*) : t* E T * and y* E n~=lgz;$ (x:) n Y*}, Zz*= Zi
( = O , .
n),
Z* = Z,
f * = flT*,
fo* = folY*,
and let v* and p: be the restrictions of v and pi to Y* and Xf (i = i , 2 , . . . ,n ) , respectively. If we prove that the conditions of the previous theorem are satisfied for the sets, points, functions, and measures marked by stars, then the proof is complete. Clearly, the conditions 8.1. ( I ) , 8.1.(2), and 8.1. (4) are satisfied, and 8.1. (3) follows by the choice of X,?;. 8.1. (6) also follows from the definition
90
Chapter 111. Boundedness and continuity of solutions
of T * and Y * . Hence we only have to prove that 8.1.(5) is satisfied. It is enough to prove that
u ( g ~ ( ~ : ) n Y2*2 7) ' - E
if
l
and
~ E T * ,
because from this
((n n
v ( D ~n * D,:) = u
g;;*
n(n n
(xt)n Y*)
i=l
gL;:
(x:)n Y * ) )
i=l
Suppose to the contrary that there exists an i (1 5 i 5 n ) and t E T * for which V (Y*n g,;t(x:)) < 24' - E l
On the other hand, by the choice of T * if y E Y * \ g , ~ , ' ( X r )then , gi,t(y) E Wi but g i , t ( ~ $) X:, i.e.1
This is a contradiction because pi(Wi \ X t )
<S
8.3. Theorem. Let Z , Zi (i = 1,2,. . . ,n ) be completely regular spaces, Zo a a-compact completely regular space, and T an arbitrary topological space. Let Y be an open subset of IW'C , Xi an open subset of F i ( i = 1,2,. . . ,n ) , and D an open subset of T x Y . Consider the functions f : T + 2, f o : Y + Z O , g i :D + X i , fi : X i + Zi (i = 1 , 2 , . . . , n ) , and h : D x Zo x Z1 x . . . x Zn 4 2 . Suppose that to E T and the following conditions are satisfied: (1) for each ( t ,y) E D ,
(2) h is continuous; (3) fi is Lusin XTi measurable on the subset Ai of Xi ( i = 1,2,. . . , n ) ;
5 8.
Continuity of measurable solutions
91
(4) gi is continuous on D (i = 1 , 2 , . . . , n ) ; g;;o~~i)) > 0; agi (6) the partial derivative - is continuous and has rank ri for each (t, y) E
(51 X~
8~
D (i = 1 , 2, . . . , n ) . Then f is continuous on a neighborhood of to. Proof. We will reduce this theorem to the previous theorem by replacing D by a suitable smaller set D*. Let yo be a density point of the set
By Lemma 3.10 there exist open neighborhoods V and W of to and yo, respectively, with the following properties:
(7) for each E > 0 there exists a 6 > 0 such that Xri (gilt(B)) > S whenever 1 Ii I n , t E V , B C W, and Xk(B) > E ; (Ai) n W is a X h e a s u r a b l e subset of ~ h h e n e v e r1 5 i I n and (8) t E V; (9) V and W are compact sets, V x W c D and
Let D* = V x W, h* = hlD* x Zo x Z1 x x Zn, gd = gilD* (i = 1 , 2 , .. . ,n), and let us apply the previous theorem on the set D* instead of D.
8.4. Remark. Some conditions of the theorems above may be weakened at the expense of other conditions. In this way we obtain other versions of the above theorems. On such versions see the papers Jhrai [81] and [86]. In his paper [61] K.-G. Grosse-Erdmann proved an interesting "measurability implies continuity" type theorem. Although his result can be applied only for Cauchy-type equations
(where f and f l are the unknown functions), his conditions are remarkably weak and generalize several earlier results about Cauchy-type equations.
92
Chapter 111. Boundedness and continuity of solutions
Roughly speaking, he removes absolutely all conditions about the dependence of h on variable y. As a special case, h may depend on fo(y),where f o and its range are arbitrary. His conditions on the inner function G are also somewhat weaker than ours. (To compare the two conditions, substitute t = G(x, y) locally.) Here we formulate one of his results. The proof will not be given: this is Theorem 5.A4 in the paper [61] of K.-G. Grosse-Erdmann. 8.5. Theorem [K.-G. Grosse-Erdmann]. Assume that X and Y are Hausdorff topological spaces with Radon measures p and v, respectively. Let T be a first countable topological space, D an open subset of X x Y, and G : D + T a continuous mapping. Let Z and Z1 be topological spaces, let Z1, iand h : Y x Z1 + Z bemappings and suppose f : T i Z, f l : XI that: (1) f ( G ( x , y ) ) = h ( y , f 1 ( 4 ) for all (x,Y)E D ; (2) for each fixed y E Y the mapping h is continuous in the other variable; (3) f l is Lusin p measurable on a measurable subset Al of Xl of positive measure; (4) for every to E T thereisan open subset U x V o f D with0 < p ( A l n U ) < cc and v(V) < cc such that, for every compact subset K of Al n U with p ( K ) > 0, there is a S > 0 and a neighborhood W of t o such that for each t E W the set { y : (z,y) E
D and G(x,y) = t forsomez E K )
contains a v measurable set having measure not less than 6. Then f is continuous. Unfortunately, the proof of K.-G. Grosse-Erdmann does not work if there are several unknown functions fi on the right-hand side. But if we combine his technique with the strong results of 5 19, infra, then under the usual condition 3.1 on the inner functions we can remove any condition concerning the dependence of h on y also in the more general case with several unknown functions. 8.6. Theorem. Let T , Z , and Zi (i = 1 , 2 , .. . , n ) be topological spaces, Y and Xi (i = 1,2, . . . , n ) Hausdorff spaces. Suppose that D c T x Y a n d W C D x Z 1 x . . . x Z n. Let f : T + Z , g i : D + X i ( i = 1 , 2 , . . . , n ) , fi : Xi -+ Zi (i = 1 , 2,... , n ) , and h : W + Z befunctions, v a Radon measure on Y and pi a Radon measure on Xi (i = 1 , 2 , .. . ,n). Suppose that to E T has a countable base of neighborhoods and the following conditions are satisfied:
5 8.
Continuity of measurable solutions
93
(2) for each fixed y in Y , the function h is continuous in the other variables; (3) f i is Lusin pi measurable on the measurable subset Ai of Xi (i = 1,2, ... , n ) ; (4) gi is continuous on D (i = 1 , 2 , . . . ,n); (5) there exist sets V and K such that V is open, K is compact, V x K c D , to E V , u ( K )> 0, and K c ~ Y = ~ ~ ; ( A ~ ) ; (6) for each E > 0 there exists a S > 0 such that B c K and u ( B ) implies pi (gi,t ( B ) )2 6 whenever 1 5 i 5 n and t E V . Then f is continuous at to.
>
E
Proof. Let t , be a sequence in V converging to to. Applying Theorem 19.5 infra for cp defined by cp(y, t ) = gl ( t ,y ) whenever ( t ,y ) E V x K and for the restriction of the measure u to the subsets of K we obtain ~ y ) ) for almost all a subsequence t,, for which f 1 ( g l (tm,, y ) ) + f l ( g (to, y E K. By applying the theorem again for fi, g z , and the subsequence t,,, we obtain a sub-subsequence, etc. Finally, we obtain a subsequence t,, of the original sequence t , such that for almost all y E K we have f i (gi (t,, , y ) ) --+ fi (gi (to,y ) ) for 1 5 i 5 n. Fixing any such y E K, by the functional equation and the properties of h this means that each sequence t , -+ to in T has a subsequence t,, for which f (t,,) + f ( t o ) .Suppose, that f is not continuous at to. Then there exists a neighborhood Wo of f ( t o )for is not a neighborhood of to. If Urn,m = 1 , 2 , . . . is a countable which f neighborhood base of t o ,then let us choose a sequence t, E Urn \ f -'(Wo). No subsequence of the sequence f (t,) can converge to f ( t o ) .
(wo)
8.7. Corollary. Let T , 2, and Zi (i = 1 , 2 , .. . ,n ) be topological spaces. Let Y be an open subset o f R k , Xi an open subset of Rri (i = 1 , 2 , .. . ,n ) , D an open subset of T x Y , and W c D x Zl x x 2,. Let f : T + 2 , g i : D -+ Xi (i = 1,2,... , n ) , fi : X i + Zi (i = 1,2, . . . , n ) , and h : W -+ Z be functions. Suppose that to E T has a countable base of neighborhoods and the following conditions are satisfied:
(2) for each fixed y in Y , the function h is continuous in the other variables;
94
Chapter 111. Boundedness and continuity of solutions
(3) fi is Lusin X T i measurable on the subset Ai of Xi (i = 1 , 2 , . . . ,n); (4) gi is continuous on D (i = 1,2, . . . , n ) ;
(5) X k ( f l ~ = l ~ ~ t l o ( > A i0;) ) (6) the partial derivative
D (i = 1 , 2,... ,n).
dgi dy
- is
continuous and has rank ri for each (t,y) E
Then f is continuous at to.
Proof. This corollary follows similarly from Theorem 8.6, as Theorem 8.3 follows from Theorem 8.2.
9. CONTINUITY OF SOLUTIONS HAVING
BAIRE PROPERTY Results similar to the theorems in 5 8 can be proved for solutions having the property of Baire. Results of this kind can be found in the papers of Haupt [64], Sander [179], and Grosse-Erdmann [61]. Here, again, we start with the investigation of the most general problem, and specializing it we will obtain a less general but easy-to-use result. Theorems 9.1, 9.2 were published in the paper J&rai [81]. Finally, generalizations of the results of Grosse-Erdmann will be proved (unpublished). Further references can be found in the papers cited above. 9.1. Theorem. Let T , Y, Xi, Zi (i = 1 , 2 , . . . , n ) be topological spaces, Zo a a-compact space, and Z a completely regular space. Let D c T x Y, Ai C Xi (i = 1,2, . . . , n ) , to E T, and consider the functions f : T -+ Z , f o : Y + Z o , g i : D -+ Xi, f i : Xi + Zi (i = 1 , 2, . . . ,n ) , h : D x Zo x Z1 x . . . x Z, -+ Z. Suppose that the following conditions hold:
(1) for each (t, y) E D we have
(2) h is continuous; (3) fi has the Lusin-Baire property on the subset Ai of Xi (i = 1,2, . . . , n);
(4) gi is continuous (i = 1 , 2 , . . . , n ) ;
§ 9. Continuity of solutions having Baire property
95
(5) t h e r e e x i s t s e t s V a n d K s u c h t h a t V x K C D , V i s o p e n , t o ~ V , K i s of second category and has the Baire property, and
(6) if B is of second category subset of Y and B c K , then gi,t(B) is of second category subset of Xi whenever 1 5 i 5 n and t E V. Then f is continuous on a neighborhood of to. Proof. Since Ai has the Baire property, there exist sets Ei,M,!, M!', for which Ei is open in Xi, M!', c Ei, the subsets Ml, M,!' of Xi are of first category in Xi, M,! n Ei = 8 , and Ai = (Ei \ Mi') U M,!. First we prove that there exist sets V' and K' for which V' is open in T, K ' is of the second category in Y and has the Baire property, to E V' C V, V' x K ' c D , and K ' C K , and further, n
K' i
n
g$
(Ei)
whenever t E V'.
i=l
Let K" = K n (fl,",lgz~~(Ei)).Since, by condition (6), K n gt;i0(M,!) is of first category in Y whenever 1 i n, we have by condition ( 5 ) that
< <
is of first category in Y. Hence Kt' has the Baire property and is of second category in Y, and n
Let yo be a point of K " such that for every open set W containing yo the set W n K " is of the second category in Y. Since the functions gi are continuous on D , the sets gz:l(~i) are open, and contain the point (to,yo), there exist open sets V' and W' for which to E V' C V, yo E W' c Y , and
It is clear that the set K ' = Kt' n W' satisfies the required conditions.
96
Chapter 111. Boundedness and continuity of solutions
The proof will be complete if we prove that f is continuous on V'. Let t E V' and let U be a neighborhood of f (t) in Z. Since the function fi has the Lusin-Baire property on the subset Ai, there exists a subset Mi" C Ai which is of first category in Xi such that the restriction of fi to Ai \ M,!" is continuous. Let Mi = Mi' U M y . Then the restriction of fi to Ai \ Mi" is continuous (i = 1,2, . . . ,n). Since by condition (6) the sets K ' n :9; (Mi) are of first category in Y, the set
has the Baire property and is of second category in Y. Let B1,B 2 , .. . be an increasing sequence of compact sets of Zo for which U,OO,,Bi = Zo. Since
there exists a compact subset Co = B j of Zo for which f c l ( C o ) f l K"' is of the second category in Y. Let y be a point of this set with the property that W n f;l(Co) n K"' is of second category in Y whenever W is a neighborhood of y. With the notation x; = gi (t, y) and zi = fi(x;), i = 1 , 2 , .. . ,n we have that xi E E i \ M i , fo(y) E CO,and
Let us choose a uniformity on Z for which the uniform topology is the original topology of the space. We can find a reflexive symmetric relation a from the uniformity of Z with the property that z E U whenever z and f (t) are a-near in Z. By Lemma 3.3 there exist neighborhoods V", W", and Ui of t, y, and xi, respectively, such that h(t, y', zo,x i , . . . ,z;) and h(tl, y', zo,zy, . . . ,z:) are a-near for all zo E Co whenever t' E V", y E W", and zj, zl' E Ui. Using the fact that the restriction of fi to Ei \ Mi is continuous at xi we can choose an open neighborhood V , of xi in Ei for which fi (xi)E Ui (i = 1,2, . . . ,n ) , whenever xa E V, \ Mi. Since gi is continuous at (t,y), there exist open sets V"' and W'" for which t E V'" c V" n V', y E W"' c W" n W', and gi(V1" x W"') C V (i = 1,2, . . . ,n). This proves that
n n
w"' n f c 1 ( c o )n K"' c
9%;
i=l
(v,),
5 9.
Continuity of solutions having Baire property
whenever t' E V"'. By condition (6), the sets K"' fl g,;! category in Y if t' E V"', and hence the set
97
( M i ) are of first
is nonvoid whenever t' E V"'. Let y' be an arbitrary element of this set. Then ( t ,y') E D , (t',y') E D, ~ o ( Y ' = ) zo E CO,and gi ( t ,Y ' ) E V , \ Mi, gi (t',Y ' ) E V , \ Mi for i = 1 , 2 , . . . ,n. Hence fi(gi(t,y'))= Z( E Ui, fi(gi(t',y'))= z,!' E Ui for i = 1,2,.. . ,n , and by the choice of V"', W'", Co, and Ui (i = 1,2,.. . , n ) we have that
are a-near in 2, that is, f (t') E U. This proves that f is continuous on V'.
9.2. Theorem. Let T and Zi ( i = 1 , 2 , . . . ,n ) be topological spaces, Y an open subset o f R k ,Xi open subset o f F i (i = 1,2,. . . , n ) , Z a completely regular space, and let Zo a a-compact space. Let D be an open subset of T x Y . Takethefunctionsf : T + Z , f o : Y -+Zo,gi : D - + X i , fi : X i + Z i ( i = 1 , 2 ,. . . , n ) , a n d h : D x Z o x Z 1 x . . - x Z , + Z . Suppose, t h a t t o E T and that the following conditions hold: ( I ) for each (t ,y ) E D ,
(2) h is continuous;
(3)
fi
has the Lusin-Bairepropertyon thesubset Ai o f X i (i = 1,2,.. . , n ) ;
(4) gi is continuous (i = 1,2,.. . , n ) ; (5) the set
g2;:0
( A i )is of second category in Y ;
agi (6) the partial derivative - is continuous and has rank ri whenever ( t ,y ) E 8~ D ( i = l , 2,... , n ) .
98
Chapter 111. Boundedness and continuity of solutions
Then f is continuous on a neighborhood of to.
Proof. We will reduce this theorem to the previous theorem using Lemma 4.3. It is enough to prove that conditions (5) and (6) of the previous theorem are satisfied if K and V are suitably chosen. Because of condition (5) of the present theorem, the set
is of second category in Y, so there exists a yo for which (to,yo) E D and for each open neighborhood W of yo the set
is of second category. By Lemma 4.3 we can choose open sets V and W such that the following conditions hold:
(7) gi,t(B) is of second category as a subset of IFi whenever i = 1 , 2 , . . . , n, t E V, B C W, and B is of second category; 9 , ~ ;(Ai)) ~ has the Baire property and is of second category (8) W n as a subset of Y;
(9) V x W Now let
c D, t o E V, and yo
E W.
Then the conditions of the previous theorem are satisfied and our proof is complete.
9.3. Remark. Some conditions of the theorems above can be weakened if other conditions are strengthened. See the papers J b a i [81] and [86] in connection with such variations. In his paper [61] K.-G. Grosse-Erdmann proved a "Baire property implies continuity" type theorem for the equation f (G(x,y)) = h (y, f 1 (x)), where f and f 1 are the unknown functions. (Theorem 5.B1 in [61]). The virtue of his result is that he completely removes any condition about the dependence of h on the variable y. Apart from this, as the substitution t = G(x, y) shows, his conditions are not very different from ours. Therefore, we do not repeat his results here, but prove a version with several unknown functions on the right-hand side. The proof is based on a result of 5 20 infra.
§ 9. Continuity of solutions having Baire property
99
9.4. Theorem. Let T , Z , Y , Zi, and Xi ( i = 1,2,. . . , n ) be topological spaces. Let D c T x Y , W C D x Z1 x . . x Z,, and consider the functions f : T - + Z , g i : D + X i , f i : X i + Z i ( i = 1 , 2 , . . . ,n ) , h : W + Z . Suppose that t o E T has a countable base of neighborhoods and the following conditions hold:
(2) for each fixed y in Y , the function h is continuous in the other variables;
(3) fi has the Lusin-Baire property on the subset Ai of Xi ( i = 1,2, . . . , n ) ; (4) gi is continuous on D ( i = 1,2,. . . , n ) ;
(5) there exist sets V and K such that V x K C D , V is open, to E V , K is of second category, has the Baire property, and
(6) i f B isasecond categorysubset ofY and B c K , thengi,t(B) isasecond category subset of Xi whenever 1 5 i 5 n and t E V. Then f is continuous at t o .
Proof. Let t, be a sequence in V convergent to t o . Applying Theorem 20.4 infra to cp defined by cp(y,t ) = gl ( t ,y) whenever (t,y ) E V x K, we have f 1 ( g l (t,, y ) ) + f l ( g l ( t o y , ) ) for all y E K except for a set of first category. By applying the theorem again for f z , 92, etc., we have for all y E K except for a set of first category, fi (gi (t, , y )) + f i (gi ( t o ,y)), whenever 1 5 i 5 n . Fixing any such y E K , the functional equation and the properties of h imply that for each sequence t , + to in T we have f (t,) -+ f ( t o ) Since to has a countable base of neighborhoods, this means that f is continuous at t o . 9.5. Corollary. Let T , Z , and Zi ( i = 1,2,. . . , n ) be topological spaces. Let Y be an open subset of Rk,Xi an open subset of F i( i = 1 , 2 , . . . ,n ) , D an open subset of T x Y, and W C D x Z1 x . . . x Z,, and consider the functions f : T + Z , gi : D i X i , f i : Xi -+ Zi ( i = 1,2,. . . , n ) , h : W -+ Z . Suppose that to E T has a countable base of neighborhoods and the following conditions hold:
Chapter 111. Boundedness and continuity of solutions
100
(2) for each fixed y in Y, the function h is continuous in the other variables;
(3) fi has the Lusin-Baire property on the subset Ai of Xi(i = 1 , 2 , . . . ,n); (4) gi is continuous on D (i = 1 , 2 , .. . ,n ) ;
(5) the set
n:=, g;jo (Ai) is of second category in Y;
agi (6) the partial derivative - is continuous and has rank ri whenever (b, y) E
D (i = 1 , 2,... , n ) .
aY
Then f is continuous at t o . Proof. This corollary follows similarly from Theorem 9.4, as Theorem 9.2 follows from Theorem 9.1.
10. ALMOST SOLUTIONS In 1960 P. Erdos raised the following problem: Suppose that a function
f : IR + IR satisfies the relation
for almost all (x, y) E R2 in the sense of the planar Lebesgue measure. Does there exist an additive function g : R R such that f (x) = g(x) almost everywhere in the sense of the linear Lebesgue measure? A positive answer was given to this question by N. G. de Bruijn, and independently, by W. B. Jurkat. For further references, see the book [127] of Kuczma, pp. 443-467. Such "almost solutions" , i. e., functional equations satisfied almost everywhere, arise naturally if we differentiate a functional equation having monotonic or locally Lipschitz solutions. In this case the solutions are known to be measurable. Another case where measurable almost solutions arise naturally are characterization problems of probability distributions. A prototype of such results is the well-known theorem stating that if J and q are independent random variables and J q and J - 17 are independent, too, then all have normal
+
§ 10. Almost solutions
101
+
distributions. Supposing that (, q, ( q, and ( - q have density functions f t , fi7,fCfl),and ft-,, respectively, we obtain that
= 2/B
+
f6,i7(~
U,U
-
u) dudu,
for any Bore1 subset B of R2, where
for almost all (u, v ) E R2. This is a functional equation which is a special case of the equation treated in § 24. A much more complicated characterization problem of probability distributions will be considered in § 23. A further example from harmonic analysis where measurable almost solutions arise naturally is shown in Aczd [7]. If we apply the distribution method to a locally integrable solution of a functional equation (see 5 18) we obtain only that the solution is almost equal to a smooth function. If we can conclude that the solution is continuous, then it is equal to this smooth function. Hence we arrive again at a regularity problem of measurable almost solutions. As the above examples show, the following question concerning measurable almost solutions of functional equations is of great importance in practice: Suppose that the general functional equation considered in 1.7
with unknown measurable functions f , f l , f 2 , . . . , f n is satisfied for all (x, y) from a subset E' of some open subset E of RT x IRk, for which E \ E' has Lebesgue measure zero in RT x Rk. Here all functions are supposed to be defined on some open subset of some Euclidean space, and taking values in some Euclidean space. The known functions H and G, G I , . . . , G, are supposed to be smooth. Is it possible to find functions such that f" = f = fi almost everywhere for i = I , . . . , n and such that replacing and f , f l , . . . , f n with f", f;,. . . , fn, respectively, the functional equation (1) is satisfied everywhere on E?
f"i
102
Chapter 111. Boundedness and continuity of solutions
We will prove that under reasonable conditions the answer is yes and the functions f", i = 1 , . . . , n are continuous. Of course, this question is related to results of 5 8. The tools there seem to be strong enough to answer this problem too, but technically it will be simpler to use the tools of $19, infra. Most of the results of this section was published in JBrai [105]. First, we discuss the connection between almost solutions of explicit and implicit equations. Then we prove two theorems: the first one is a general version, and the second one considers the special case of several real variables. We will also deal with the analogous Baire category question. Some related results of ~ w i a t a kare referred to in l8.5.(2).
f"i,
10.1. Connection between almost solutions of explicit a n d implicit equations. Under certain condition equation
just as in 1.7 - can be reduced to a simpler explicit equation. Namely, suppose, that the term f (G(x,y)) can be expressed from equation ( I ) , and it is possible to introduce a new variable t = G(x, y) instead of x to obtain the functional equation
-
Here D' is the image of E' by the mapping (x, y) (G(x,y), y) . We suppose that this mapping is a diffeomorphism mapping the open set E onto the open set D , hence this mapping and its inverse carry sets having Lebesgue measure zero into sets having Lebesgue measure zero. Now, if we are able to prove that there is a subset T' of the domain T of f such that T \ T' has measure zero and f IT' can be (uniquely) extended to a continuous function f" defined on T , then the functional equation (2) is satisfied with f" instead of f almost everywhere on D, namely, on the set Dh = Di n (Ti x Ktk). This means that if we replace f by f", then equation (1) is still satisfied, at least on some subset Eh of Eo for which E \ Eh has measure zero. Now, repeating this process with f l , we may obtain a subset Ei of Eh and a function f1continuous and equal almost everywhere to f l such that (1) is satisfied i f f and f l are replaced by f" and f;,respectively. Finally, we obtain a subset E" = EL such that E \ Eil has measure zero and (1) is still satisfied if we replace f , f l , . . . , f, by the continuous functions f", f;,. . . , fn equal almost everywhere to them. Since the left-hand side of (1) is continuous on E and equal to zero on a dense
5 10. Almost
solutions
103
subset E" C E, we obtain that equation (1) is satisfied on E if f , f l , . . . ,f, are replaced by f", f;,. . . , f,, respectively. Summarizing, our problem concerning equation (1) can be reduced to the following problem: Suppose that equation (2) is satisfied on a subset D' of the open set D for which D \ Dl has measure zero. Give reasonable conditions under which there is an appropriate subset T' of the domain T of f such that T \ T' has measure zero and the function f IT' has a (unique) continuous extension f" to T . We will prove that the set T' can be chosen to be the set of all points t E T for which the set {y : (t, y) E D \ D') has measure zero. By Fubini's theorem, T \ T' has measure zero. More generally, f" is determined by any subset of this T' which is still dense in T . This will be proved in a much more general setting in the next theorem. The proof uses Theorem 19.5. infra. Note that we use the terminology of Bourbaki concerning topology. The limit of a function at a point is defined by Bourbaki without excluding the point. 10.2. Theorem. Let T and Zi (i = 1,2, . . . , n ) be topological spaces, Z , Y, and Xi (i = 1,2, . . . ,n ) Hausdorff spaces. Suppose that D C T x Y , T' c T , X,' C Xi, W c D x Z1 x . . . x 2,. Let f : T' -+Z, gi : D + X i (i = 1 , 2 , . . . ,n ) , fi : X,! + Zi (i = 1 , 2 , .. . ,n ) , and h : W + Z be functions, v a Radon measure on Y, pi a Radon measure on Xi, and pi (Xi \ X,!) = 0 (i = 1 , 2 , .. . ,n ) . Suppose that to E T has a countable base of neighborhoods and the following conditions are satisfied:
(1) for each fixed y in Y, the function h is continuous in the other variables; (2)
is Lusin 1,2, . . . , n ) ;
fi
measurable on the measurable subset Ai of Xi (i =
(3) gi is continuous on D (i = 1 , 2 , .. . , n ) ; (4) there exist sets V and K such that V is open, K is compact, V x K to E V , v ( K ) > 0, and K c n:,lgz$J~i);
c D,
(5) for each E > 0 there exists a 6 > 0 such that B C K and v(B) implies pi ( g i , t ( ~ )2) S whenever 1 5 i 5 n and t E V ;
2
E
(6) there exists a subset V' of TI n V such that to is contained in the closure o f V 1and for each t E V' for almost ally E K we have
104
Chapter 111. Boundedness and continuity of solutions
Then limtEvl,t-ttof (t) exists.
Proof. Let t, be a sequence in V' convergent to to. Applying Theorem 19.5 for cp defined by p(y, t) = gl (t, y) whenever (t, y) E V x K and for the measure VK we obtain a subsequence t,, for which f l (gl(tmk,y)) + f l (gl(to,Y)) for almost all y E K. Now repeating this with f n , g2, and the subsequence t,,, we obtain a sub-subsequence, etc. Finally, we obtain a subsequence t,, of the original sequence t , such that for almost all y E K we have fi (gi (trn,,Y)) -+ fi (gi (to, Y)) for 1 I i I n and
Fixing any such y E K , by the functional equation and the properties of h this means that each sequence t, -+ to in T has a subsequence t,, for which f (t,,) is convergent. Of course, its limit zo E Z does not depend on y. Hence, for all y from a set K ' C K for which u ( K \ K t ) = 0 we have zo = h (to, 3, h (a(to, V ) ) , . . .
, f n (gn(to, y )) )
Suppose that it is not true that limtEv~,t+t,f (t) = xo. Then there exists a neighborhood Wo of zo such that there is no neighborhood U of to for which the set U n V' is mapped by f into Wo. If Urn, m = 1,2,. . . is a countable neighborhood base of to, then let us choose a sequence t& E U,nV' for which f (t,) @ Wo. Repeating the above process with t k instead of t,, we obtain a subsequence t L j such that for almost all y E K we have ) 1 I i I n and fi(gi(tL,,y)) -+ f i ( g i ( t 0 7 ~ )for
Now, if we choose a y to be in K', then using the functional equation and (7) we obtain a contradiction. 10.3. Theorem. Let Z be a regular space, Zi (i = 1 , 2 , .. . ,n ) topological spaces, and T a first countable topological space. Let Y be an open subset of EXk, Xi an open subset of Rri (i = 1 , 2 , . . . , n ) , D an open subset o f T x Y, and W C D x Z1 x x 2,. Let T' c T be a dense subset, and h : W + 2 . Suppose that the function fi is f : T' + 2, gi : D + Xi, almost everywhere defined on Xi with values in Zi (i = 1,2, . . . ,n ) and the following conditions are satisfied:
105
§ 10. Almost solutions
(I) for all t E T' for almost all y E Dt we have
(2) for each fixed y in Y, the function h is continuous in the other variables; (3) fi is Lusin Xr2 measurable on its domain (i = 1 , 2 , . . . , n); dgi (4) gi and the partial derivative - is continuous on D (i = 1 , 2 , .. . , n); dy (5) for each t E T there exists a y such that (t, y) E D and the partial dgi derivative - has rank ri at (t, y) E D (i = 1 , 2 , .. . ,n). dy Then f has a unique continuous extension to T
Proof. We will reduce this theorem to the previous theorem. We will prove that the limit limtET,,t,t, f (t) exists for each to E T. If this is proved, then defining the extension of f to T by this limit, the theorem is proved. Let us choose for a given to a yo such that (to,yo) E D and the partial dgi derivative - has rank ri at (to,yo) (i = 1 , 2 , . . . ,n). To prove that the
8~
limit exists, we will replace D by a suitable smaller set D*. By Lemma 3.10 there exist open neighborhoods V and Wo of to and yo, respectively, with the following properties: (6) for each E > 0 there exists a S > 0 such that Xri (gi,t(B)) S whenever 15 i < n , t E V, B c Wo, and XYB) 2 E ;
>
(7) g,2(dmn f,) r l Wo is a X"easurab1e subset of Ti@ whenever 1 < i and t E V; (8) Wo is a compact set, V x Wo c D and
Let D * = V x Wo, h* = hlW n (D* x Zl x . . . x Zn), g: = gijD* (i = 1,2, . . . , n ) , and let us apply the previous theorem on the set D* instead of D . Now we consider the problem of Baire property solutions of a functional equation satisfied except for a first category set. The arguments of 10.1 can
106
Chapter 111. Boundedness and continuity of solutions
be repeated in this situation, and therefore here again it is enough to consider the explicit equation 10.1.(2). The following theorem goes back to the work of Haupt 1641.
10.4. Theorem. Let T , Y, Zi, and Xi (i = 1 , 2 , .. . , n ) be topological spaces. Let Z be a Hausdorff space. Let D c T x Y, T' c T , X,/ c Xi, W C D x Z1 x x Z,, and consider the functions f : T' i Z , gi : D i Xi, fi : XI i Zi (i = 1 , 2 , .. . ,n),h : W + 2. Suppose, that Xi \ X,' is of first category and that to E T has a countable base of neighborhoods and the following conditions are satisfied: (1) (2) (3) (4)
for each fixed y in Y, the function h is continuous in the other variables; fi has the Lusin-Baire property on the subset Ai of Xi (i = 1 , 2 , . . . ,n ) ; gi is continuous on D (i = 1,2, . . . , n ) ; there exist sets V and K such that V x K c D, V is open, to E V, K is of second category and has the Baire property, and
ng;t'o(~i); n
KC
i=l
(5) i f B isasecond categorysubset ofY and B C K , thengi,t(B) isasecond category subset of Xi whenever 1 5 i 5 n and t E V; ( 6 there exists a subset V' of T' n V such that to is contained in the closure of V' and for each t E V' except a set y E K of first category of we have
Then limtEv~,t+to f (t) exists.
Proof. Let t, be any sequence in V' convergent to t o . Applying Theorem 20.4 infra for p defined by p(y, t) = gl (t, y) whenever (t, y) E V x K, we have f 1 (gl (t,, y)) i f 1 (gl (to,y)) for all y E K except a set of first category. By applying the theorem again for f 2 , g2? etc., for all y E K except a set of first category we have fi (gi (t,, y)) i fi (gi (to,y)) for 1 5 i 5 n and
Fixing any such y E K , by the functional equation and the properties of h we get that for each sequence t, i to in V' the sequence f (t,) is convergent. Of course, its limit zo E Z does not depend on y. Since to has a countable f (t) = ZO. base of neighborhoods, we have that limtEV/,t+to
§ 10. Almost solutions
107
10.5. Remarks. (1) If the above theorem can be applied for each to E T , and Z is a regular space, then we obtain that f has a unique continuous extension to T . For example, using the method of the proof of Theorem 9.2, we may obtain a result analogous to 10.3. (2) The above theorem can be proved using the simpler methods of 5 9 without referring to Theorem 20.4, although the proof is somewhat longer. (3) As we have already mentioned, the above theorem goes back to the paper [64] of Haupt. He formulated his theorem for the functional equation
which is not a significant difference, but makes the conditions somewhat more complicated. The main difference between his and the above theorem (and the results of § 9) is, roughly speaking, that he assumed that W = D x Z1 x . . . x Zn, h is continuous on W, V = T , and K = Y. (4) In his paper Haupt considers the case when there are countable infinite functions f l , f 2 , . . . on the right-hand side. Theorems 10.4, 10.2, 9.4, 9.5, 8.6, and 8.7 remain true for this more general case. (5) In his paper Haupt also considers more general a-ideals than the a-ideal of the sets of first category. Roughly speaking, he supposes only that the complement of any set from the a-ideal is dense, and the restrictions of the functions fi is continuous on the complement of an appropriate set from the a-ideal. Hence his results can be applied for sets of first category and functions having the property of Baire, but cannot be applied in the case of measure zero sets and measurable functions. (6) In his paper Haupt also proves that even if we do not suppose any continuity or Lusin-Baire property about the functions h, f l , f 2 , . . . , the extension of f (if any) is unique. We can formulate this as the following statement. 10.6. Statement. Let T , Y, Z, Zi, and Xi (i = 1 , 2 , . . . , n ) be sets and y and Xi be proper a-ideals of Y and Xi, respectively. Let D c T x Y, W c D x Z1 x - - .x Zn, and consider the functions gi : D -+ Xi, f i : Xi -+ Zi (i = 1 , 2 , .. . ,n ) , h : W -+ Z . Suppose that the following conditions are satisfied: (1) for each t E T we have Dt $ y; (2) for each t E T if B C Dt, B $ y, then gi,t(B) $ Xi whenever 1 5 i
5 n.
Then there exists a t most one function f" : T -+ Z for which there exist functions fi : Xi + Zi such that fi = fi on Xi except a set in Xi such that
108
Chapter 111. Boundedness and continuity of solutions
for each (t,y ) E D we have
Proof. We have to prove that for any t E T, f(t) is uniquely defined. Let us observe that if the map
is constant on the set Dtl then this constant is uniquely determined even by the functions f i , because the map
takes this value everywhere except a set in
Y
Chapter IV.
DIFFERENTIABILITY AND ANALYTICITY
This chapter deals with the problem of a chain of deductions about solutions of functional equations. Namely, starting from continuity, deducing local Lipschitz property and hence almost everywhere differentiability, then using that for deducing continuous differentiability, then from there multiple differentiability and finally concluding with analyticity. We also examine Holder continuous solutions and solutions having locally bounded variation.
11. LOCAL LIPSCHITZ PROPERTY OF
CONTINUOUS SOLUTIONS The results of this section - and also several results of the subsequent sections - crucially depend on a theorem on differentiability of the parametric integral
F(t)=
Of course, the differentiability of such parametric integrals for h and g smooth enough is classic. For example it plays a r61e in the derivation of the equations of hydrodynamics, see Zeidler [209], IV., pp. 441-445. For us the most
110
Chapter IV. Differentiability and analyticity
important fact is that it is not necessary to suppose that h is smooth. A theorem of this kind was proved in JBrai [86] under the condition that h and dh are continuous and g is twice continuously differentiable. Mikl6s Laczkovich suggested a formula for the derivative of the above parametric integral in which no second partials of g appear. This version was proved with the use of smoothing in the paper JBrai [91]. Although using the homotopy formula of currents and differential forms (see Federer [53], p. 363) it would be possible to give a shorter proof, the essence of the argument, however, would be the same. Therefore we will present here the elementary proof from [91]. For most of the results in this section we make restrictions on the form of the functional equation. First we will prove that for a "sum form" functional equation, continuity implies continuous differentiability. This result will be very useful in subsequent sections. We remark that the same method can be applied to somewhat more general equations; we refer the reader to the applications in chapter VII. Then we treat the special "Cauchy-type" case where the unknown functions appear only three times in the functional equation. These results were published in JBrai [86]. (Formulation of Theorem 5.3 therein contains a misprint.) Finally, we will prove a theorem for functional equations of general type mentioned in Problem 1.18, using an additional compactness condition. The first version of this theorem has been proved for real-valued solutions with real variables (in this case the estimates are simpler) and was published in the paper JBrai [96]. The more general variant is from JBrai [94]. Although this theorem does not solve Problem 1.18 completely, it can be applied to several concrete equations: see for example the "equation of the duplication of the cube" 22.15 and the characterization of Weierstrass's sigma function in section 24. We remark that very recently A. Lundberg found a way to derive differentiability from continuity using sequential derivatives. His method works for real-valued solutions defined on open intervals of the functional equation
See his forthcoming paper [150]. 11.1. Theorem. Let S be a simplex with nonvoid interior in R k , and let U , V, and W be open subsets of R k , RS, and R k , respectively. Let g : (t,y) t, x be a function mapping V x W into U , and let h : (t, x) t, x be a function mapping V x U into R. Suppose, that S c W and
§ 11. Local Lipschitz property of continuous solutions
111
(1) g is continuously differentiable, ( J g t ) y )# 0 if t E V and y E W, and gt is invertible for all t E V; a h are continuous. (2) h and at Then the function r
F(t)=
h(t, x ) dXk ( x ) whenever t E V
is continuously differentiable on V and its partial derivatives are bounded by
where H is the bound of h and H' is the bound of the partial derivatives iJg - on the sets g t ( S ) , t E V ; moreover GI is the bound of the absolute value
at;
ofthe Jacobian Jgt and G is the bound of the norm of the gradient of g on
V x S. Proof. We may suppose without loss of generality that V is connected, and W is convex, so the Jacobian Jgt does not change sign if y runs in W and t runs in V. Let i be an index between 1 and s. In the first step we will a F exists and we will give a formula for it supposing that g and prove that ati h are smooth enough. In the second step using this formula we prove that aF is continuous. In the third step we start to investigate the general case with the help of smoothing of g and h. In the fourth step we prove that the formula we had in the smooth case is valid in the general case as well. In the fifth step the proof will be finished. I. First suppose that h is continuously differentiable and g is twice continuously differentiable. With a simple substitution we get that
and hence denoting the coordinates of g by upper indices,
112
Chapter IV. Differentiability and analyticity
Changing back the integrals to integrals over gt (S), let us investigate the second and the third integral. If H ( t , x) denotes the vector-valued function having h(t, x)
a ' (t,g;' $!ti
(a))as its j-th coordinate, then
hence, from the second and third integral using the Gauss-Green theorem and denoting by n the outer normal we have
where
We will prove that G is zero. Let g, and [ e l , . . . , ek] denote the inverse of gt and the determinant of the vectors e l , . . . , ek, respectively. Moreover, let the upper indices stand for the coordinates. Then
then we have
§ 11. Local Lipschitz property of continuous solutions
113
On the other hand,
that is G(t, x) is equal to zero. 11. As a second step we observe that, under the conditions of the first aF K may be written as the sum of step> d F is continuous on V. Indeed, 7 integrals
Let E o ,El, . . . ,El, be the sides of S . The last integral can be written as
Let us investigate one of the terms of this sum. For simplicity let us omit the index of E. Let us choose an orthonormal base
such that e k is orthogonal to E. Omitting a sign which does not depend on t , we have
where .Jt(x) is the square root of the Gram determinant of the directional
8
&
( t ,g;l(x)), . . . , (t, CJ;'(X)). derivatives ek-1 integral above, we get the integral
By substitution, from the
114
Chapter IV. Differentiability and analyticity
So we have to prove that the integrals (4) and (5) depend continuously on the parameter t. This easily follows from the fact that the functions are continuous, hence uniformly continuous on compact sets. 111. We want to use smoothing to prove that the same representation is valid for dF if we only suppose that g is continuously differentiable and that dh exists and is continuous. Let t' be an arbitrary but fixed point of V. Let us choose an open ball V* with center t' which has a compact closure contained in V and a convex open set W* containing S with compact closure - contained in W. The image of the compact set V* x W* under the mapping g is compact as well. Let us choose an open set U* which contains this compact image and which has a compact closure contained in U. We will investigate the function F on the set V*. The values of F on this set depend only on the values of g on V* x W* and on the values of h on V* x U*; hence multiplying these functions with an infinitely many times differentiable function with value 1 on V* x W* and V* x U*, respectively, F does not change, so we may suppose that g and h have compact supports. Let us extend g and h to the complement of their domain as zero, and for simplicity let us denote the extended functions also by g and h. Let w be an infinitely many times differentiable function with support contained in the unit ball of IWS x Rk w dXS x Xk = 1. Let wE(u) = W ( U / E ) / E ~ + ~ and with the property if E > 0 and let g' = g * w' and h' = h * w' be defined by convolution over RS x IRk. It is well known that g' and h' are infinitely many times a h * we, moreover differentiable, = * w', = * we, ah' -
haXIWk
$
a ,&at-"-+ -+ 3
g, "
ge -+ g, h' i h, and + if&$O, and the convergences are uniform on compact sets. We want to use the results of I and I1 for the smoothed functions on the sets marked by *. To do this we have to prove that gf maps W* into U*, it is invertible on W*, and has a Jacobian different from zero if E is small enough. If K is a compact neighborhood of - gt and Jg; -+ Jgt uniformly on K , hence only V* x W* in V x W , then gf the invertability of g: is nontrivial. Let us observe that the matrix of the inverse of gi may be expressed with the determinant and subdeterminants of the matrix of g:, hence there exits an EO > 0 such that if (t, y) E K and 0 < E < EO, then the inverse of gf'(y) is defined and g:'(y)-l i gi(y)-l uniformly if EJO. Using the proof of the inverse function theorem we get that if Y is an open ball in Rk with center y' E W*,(t: y) E K , and
5 11. Local Lipschitz
property of continuous solutions
115
for each y E Y, then gZ is a one-to-one function on Y. We prove that the radius of the ball Y may be chosen to be greater than a number r > 0 which does not depend on t and y'. Indeed, choosing a positive constant smaller than the infimum of the numbers
and using that
7J
is uniformly continuous on K , we may choose the number
r so that if ly - y'l
< r, then
I ~ ( Y ) - s:(d)/I< c. Now, decreasing
EO
if necessary, we have
on K if 0 < E < E O . So we have proved that g,E cannot map points with distance less than r into the same point. For points having distance not less than r this is impossible as well. To prove it observe that the infimum of the continuous ( 3 )- gt (9') is positive on the compact set function
lgt
/
hence choosing a positive number d less than this infimum, decreasing E O if necessary, we have lgf ( y ) - (y') 2 d on this set if 0 < E < E O . IV. We will prove that in the general case F is partially differentiable and its partial derivative has the same representation as a linear combination of integrals of type (4) and (5) as in the case of smooth g and h. Let
1
if t E V* and E is sufficiently small. Representing F as the last integral on the right side and using the fact that the functions are uniformly continuous on compact sets, and the convergences are uniform on compact sets if ~ $ 0 it, follows that F c ( t )-+ F ( t ) if t 6 V*. Let f l , . . . , f, denote the usual base of RS.Clearly there exists a 6 > 0 for which t' 7f, E V* if I T / 6. Let
+
<
Chapter IV. Differentiability and analyticity
116
By what we have already proved the function p E ( r )can be represented as the linear combination of integrals of type (4) and (5) if we replace t by t' r f i , h by h E , and g by g'. Let us consider the same integrals for g and h and let p ( r ) denote the same linear combination of them. In the same way as in , on the interval the case of functions F E we get that p' -+ p if ~ $ 0 uniformly [-S, 61. Hence the functions
+
converge uniformly to the function
On the other hand,
1-
p E ( e )de = F E(t'
+ r f i )- F E(t'
-
6
6fi)4 F(tl
+ r f i )- F ( t l
-
6f i )
if ~ $ 0 Hence . we have
By step I1 the functions pE are continuous, hence p is continuous on the a F exists and equals p(O), that is the interval [-S, S]. This implies that -
at;
partial derivative
x is the linear combination of integrals of type (4) and
(5). V. To finish the proof we observe that in step I1 we have used only the continuity of functions in the integrals (4) and ( 5 ) . Hence repeating a F is continuous. Since the partial the argument used in step I1 we get that ati derivatives of F are continuous, we have that F is continuously differentiable. Finally, using the representations (4) and ( 5 ) we obtain the required estimate. 11.2. Lemma. Let T , Y , and U be open subsets of 1WS , IW" and RT, respectively. Let D be an open subset o f T x Y . Let h : D x U + R be a ah ah continuous function for which - and - are continuous and g : D 4 U be at av a twice continuously differentiable function, ( t o , y o )E D , and suppose that
§ 11. Local Lipschitz property of continuous solutions
117
@ ( t o ,yo) has rank r . Let S be a simplex in the unit ball o f Rk with nonvoid 8~ interior. Then there exists a convex neighborhood Vo of t o , an Ro > 0 real number, and a constant C such that for each R satisfying 0 < R < Ro we have R S yo C Dt and the function
+
is continuously differentiableon Vo and its gradient is bounded by C R ~ - '
Proof. The proof depends on the previous theorem about differentiation of parametric integrals. . , x r ) . Without loss of generality we may Let g t ( y ) = x = ( x 1 , x 2 ,.. suppose that the Jacobian
is positive at the point ( t o ,y o ) Let us choose open balls Vo and W o with centers t o and yo, respectively, such that Vo x W o c D is satisfied and the Jacobian above is positive if t E Vo, y E W o . Let Ro be the radius of W o . Let x* = ( x ; ,. . . , x,*,x,*+~, . . . , x i ) be defined in the following way: xd = xi, if 1 5 i r and x: = yi if r < i 5 k. We will denote the mapping y F-+ x* by gf . Let p(x*) = x . It is clear that the Jacobian
<
is equal to the Jacobian above and hence nonzero if t E Vo and y E W o . Let L ( t ) = g;'(yo). Suppose that
if y E W o Then by the proof of the inverse function theorem (see Rudin [172],Theorem 9.24), g; is a homeomorphism of W onto an open subset of Rk.Let 1
Taking a smaller Ro and V if necessary we may suppose that
Chapter IV. Differentiability and analyticity
118
whenever t E V and y E W . Now let Uo be an open subset whose compact closure is contained in g,"o ( W ) . By taking a smaller V , we may suppose that Uo c gF ( W ) for all t E V. Now shrinking V and decreasing Ro we may suppose that g,* ( W ) C U for each t E V. Taking x* = g,*(y) we have that the parametric integral
for all t E V satisfies all conditions of the previous theorem. Applying the previous theorem we have that F is continuously differentiable over V and for some constant C . its gradient is bounded by CR"' 11.3. Theorem. Let n, k , r l , . . . ,r,, m l , . . . ,m,, and s be positive integers. Let Xi be an open subset of RTi ( i = 1,2,. . . , n ) , T be an open subset of R S , Y be an open subset o f IRk, Zi be an open subset of Rmi ( i = 1,2, . . . , n ) , D an open subset o f T x Y , and let Z be a Euclidean space. We consider the functions f : T + 2,gi : D + X i , f i : Xi + Zi, , E D and hi : D x Zi -+ Z , ( i = 1 , 2 , . . . ,n ) , and ho : D + Z . Let ( t o yo) suppose that for each ( t ,y) E D ,
ho and
ah0 ahi ahi are continuous and hi, and are continuous (1 5 at dt &I
-
i 5 n); f i is continuous on X i ;
agi gi is twice continuously differentiable and -(to, y o ) has rank ri, 1 5 8~ i 5 n. Then f is continuously differentiable on a neighborhood of to.
Proof. Without loss of generality we may suppose that Z = R. Let us choose a simplex S small enough and an open set V for which S is a neighborhood of yo, V is a neighborhood of to, and V x S c D. Let us integrate equation (1) over S with respect to y. We get that
§ 11. Local Lipschitz property of continuous solutions
119
Since Js dy is a nonzero constant, we must prove that the integrals on the right side of (5) are continuously differentiable over V with respect to t whenever S and V are sufficiently small. This follows from the previous lemma. 11.4. Theorem. Let T , Y, X I , Zo, and Z1 be open subsets of Euclidean spaces of dimensions s, k , r l , mo, and 1, respectively. Let D be an open subset of T x Y. We consider the functions gl : D + X I , f : T + P, f o : Y + Zo, f l : X1 + Z1, and h : D x Z o x Z 1 + R m . Supposethat to E T and the following conditions hold: (1) for each (t, y) E D ,
(2) h is locally Lipschitz;
(3) gl is twice continuously differentiable; (4) f o and f l are continuous;
(5) there exists a yo, for which (to,yo) E D and
&Jl
- has
dy
rank r l .
Then f is Lipschitz on a neighborhood of to.
Proof. We must prove that there exists an open neighborhood V of to and a number M for which
Let zo = fo(yo),zl = f l (gl(to,yo)). Let us choose balls V c T , W C Y , Uo C Zo, and Ul C R with centers to, yo, zo, and zl, respectively, such that fo(y) E Uo, f1(91(t,y)) E Ul, and
whenever t , t' E V and y E W. Now, if we prove that (for a smaller V if necessary) there exists a number M3 such that, for every t , t' E V there exists a y E W such that
then the proof is complete.
120
Chapter IV. Differentiability and analyticity
Let S be a simplex which is a neighborhood of yo and let
By the previous lemma F is continuously differentiable with bounded derivatives if S and V are sufficiently small, that is, there exists a number M4 for which IF(t) - F ( t 1 ) [5 M 4 / t- t'l if t, t' E V. On the other hand, since f l is a real-valued continuous function and
there exists a y E S for which, denoting by XIC the Lebesgue measure on R k , F ( t ) - W') = AYS) . (fl (g1(t, Y))
-
fl(91 (it,Y ) ) )
With this y and with the notation M3 = M~/X'(S)we have
Hence the proof is complete. Under certain additional "compactness" conditions on the given functions gi, we are able to prove that the continuous solutions of a functional equation containing one unknown function are locally Lipschitz functions. The case when the unknown function is real-valued with one real variable was first examined in JBrai [96]. The proof of the general case of a vectorvalued function with a vector variable depends on the differentiability of parametric integrals with respect to the parameter and on the explicit estimate of the derivative which we will use in the form of the previous lemma. This more general theorem - in a somewhat different form - was published in the paper JBrai [91]. A part of the proof will be contained in the following lemma.
11.5. Lemma. Let Z be an open subset o f a Euclidean space, let Z1, Z z , .. . , Zn be convex open subsets of Euclidean spaces. Let T and Y be open subsets o f Rs and IRk, respectively and let Xi be an open subset o f IFi ( i = 1 , 2 , . . . ,n). Let D an open subset o f T x Y , (to,yo) E D. Consider the functions f : T i Z , fi : Xi + Zi, gi : D iXi(i = 1,.. . ,n ) , h : D x Z1 x Z2 x . . . x Zn + Z . Suppose, that
§ 11. Local Lipschitz property of continuous solutions
121
(1) for each ( t ,y ) E D ,
z
(2) h is continuously differentiable,the functions ah are Lipschitz continu-
ous with Lipschitz constant Li, the functions by BI, and Bi, respectively (i = 1 , 2 , . . . ,n ) ;
1 $$/ 1 I and
are bounded
(3) gi is a twice continuously differentiableLipschitz function with Lipschitz constant Li and % ( t o , y o ) has rank ri for i = 1 , . . . ,n; dY (4) the functions fi are continuous with continuity modulus wi Then there exist constants Ro > 0 and co > 0 and an open ball V centered at to such that i f t , t' E V , 0 < R < Ro, then
Proof. Let us fix a simplex S contained in the unit ball of IRk and containing the origin in its interior. Applying Lemma 11.2 for the coordinates of fi we obtain 6 > 0 , Ro > 0 , and C such that for the open ball V centered at t o ,having radius 6 and the closed ball W centered at yo, having radius Ro the set V x W is contained in D and the mappings
are continuously differentiable on V and the norm of the gradient of each of these mappings is bounded by CRk-' whenever 0 < R < Ro. Let t , t l E V . Let R be an arbitrary real number for which 0 < R < Ro. Let us integrate both sides of the functional equation over the simplex RS yo with respect to y . We have
+
Chapter IV. Differentiability and analyticity
122
where IS1 > 0 is the measure of the simplex S. Hence
To get a good upper estimate for the left-hand side we need an upper estimate for the difference
We may apply Taylor's theorem for the function h with points
z = ( t ,y , z l , . . . , z n ) and
2'
=
(t',y, x i , . . . ,z k ) ,
where t', t E V , y E W , zi = fi ( g i ( t ,y ) ) , and 2,' = fi (9i(t1,y ) ) if i = 1 , . . . ,n. The points z and z', and hence the segment with these endpoints are contained in V x W x Z l x - x Z n . Hence
+
" i=l
1 dr, ldh
(rz
+ ( 1 - r ) z i )(zi - zZ))
dr.
Using this and omitting the variables we have
+
By the triangle inequality we obtain n 1 terms on the right-hand side. The trivial upper bound of the first term is 1 SI~ " 6 It' - tl, where B&is an upper bound of Let us use the notation zp = fi (gi ( t , (i = 1 , 2 , . . . ,n ) ,
/ I.
§ 11. Local Lipschitz property of continuous solutions
123
and z0 = (t, yo, z!, . . . ,z:). If hi denotes the value of the partial derivative at zO,then the other terms can be written as the norm of (rz
+ (1 - r ) z i )
-
hi)
First we give an estimate for the norm of the first term of this sum. lzi - zi I has an upper estimate wi ( ~ i J-ttll), because Li is a Lipschitz constant for gi on V x W. Hence
We need an estimate of the difference
+
This is not greater than Li times the norm of r z (1- r ) z l - zO,i.e., Li times the maximal distance between the vectors z1 and zO = (t,yo, zy, . . . , z:), where L: is a Lipschitz constant for The maximal distance between z' and z0 can be estimated by
z.
Hence we obtain the upper bound
for the first term. To obtain an upper bound for the second term, we need an upper bound for the absolute value of
124
Chapter IV. Differentiability and analyticity
because h : is trivially bounded by the upper bound Bi of
1g1.From the
previous lemma we obtain the upper bound It - t'lCRk-I for this integral. Summing up all these estimates, we get the estimate
which gives the statement of our lemma with the notation co = C/lSI. 11.6. Theorem. Let Z be an open subset of a Euclidean space. Let T and Y be open subsets of RS and R k , respectively. Let D be an open subset of T x Y, C a compact subset of T , and W C D x Z n . Consider the functions f : T -+ Z , gi : D + T (i = 1 , .. . ,n ) , h : W + 2.Suppose, that (1) for each ( t ,y) E D we have
(2) h is twice continuously differentiable; (3) gi is twice continuously differentiable on D and for each t E T there d exists a y such that (t, y) t D, gi(t,y) E C, and $(t, y) has rank s for Y z = 1, . . . ,n; (4) the function f is continuous. Then f is a locally Lipschitz function on T .
Proof. The key idea in the proof is to show that the continuity modulus of f satisfies a functional inequality. Since the continuity modulus is not subadditive in general, first we will define a modification of it. For an E > 0 let C, = {x : dist(x, C ) f- E ) denote the (closed) E neighborhood of C. If E f- 0 let C, = C. Let us fix an E > 0 such that C, c T . For each 0 5 r 5 E let
§ 11. Local Lipschitz property of continuous solutions
125
Clearly w is increasing, w(0) = 0, w is continuous at 0 because f is uniformly continuous on the compact set C,, and w (rl r 2 ) 5 w (rl) w (r2) whenever 0 5 rl, r 2 ,rl + r 2 5 E. TOprove the last assertion, suppose that this inequality does not hold. Then there exist x, y such that Ix - yl 5 rl 7-2, x E C,, and y t Ce-lx-vl, but (x) - f (y)l > w ( r 1 ) w ( r 2 ) Suppose that 7-2 5 r l . The point y is contained in the E - lx - yl neighborhood of some point of C , hence x is contained in the E neighborhood of the same point. Choosing x on the segment connecting x and y such that lx - zl 5 TI and Ix - yl 5 r2, we have z E C,-I,-,I, hence (2) - f (z)I 5 w ( ~ I )and (2) - f 5 ~ ( r 2 ) which , is a contradiction. For an arbitrary to E C,, let us choose a yo (depending on t o ) according = f (x:) if i = 1 , 2 , .. . , n. to (3). Let xo = f (to), x: = gi(to,yo), and Let Z* denote the open ball centered at zo, having radius St,. Similarly, let T*, Y*, and Z; denote open balls centered at to, yo, and z t , respectively. Decreasing the radii, if necessary, we may suppose that
+ +
+
If
+
If
If
(Y)I
and that the range of h* = hlT* x Y* x 2; x x Z,* is a subset of Z*. Decreasing further the radii, if necessary, we may suppose that the partial derivatives of h* are bounded Lipschitz functions. Let us choose for the points xf E T open balls X; with radii at most ~ / 3 .Decreasing the radii, if necessary, we may suppose that the range of f: = filX; is contained in 2;. Finally, decreasing the radii of the balls T * and Y* around the points to and yo, respectively, we may suppose that the range of g: = gi IT* x Y* is contained in X;. Let f * = f IT*. Let w," denote the (usual) continuity modulus of f:. We will prove that w,"(r) 5 W ( T ) for a11 r 2 0. If xi, xi E X;, then Ixi - xpl < ~ / 3 and lxi - xsi < ~ / 3 ,hence using that x; E C , we obtain that xi E C, and Ixi - xi I < 2 ~ 1 3 ,which implies that xi E CE-lxi-,:l. Suppose that
Taking the supremum on the left-hand side we obtain the inequality w," (r) 5 ~ ( 4 . Let us apply Lemma 11.5 with the sets and functions marked by *. We get that there exists an open ball KO centered at t o , having radius > 0, and there are constants Rto > 0 and cto such that if t and t' are in this open
Chapter IV. Differentiability and analyticity
126
ball and 0
< R < Rto, then
Of course here the constants Bh, BQ, Li, and Li depend on to, which is not expressed in the notation. Using the inequality between w: and w, from the inequality above we get
For varying to the open balls KOcentered at to and having radii give an open covering of the compact set C,. Hence there exists a finite set To c C, such that the open balls corresponding to the points to E To give an open covering of C,. Let L denote a positive integer, which is greater than or equal to all of the constants Li (i = 1 , 2 , . . . ,n ) corresponding to all points to E To. Similarly, let B', L', and co denote a constant which is greater than Q (i = 0 , 1 , . . . ,n ) , Li (i = 1 , 2 , . . . ,n ) , and cto or equal to all constants B corresponding to all to E To,respectively. We have that if t and t' are in the same ball Vto and 0 < R < Rt,, then
Let S > 0 denote a Lebesgue number corresponding to the compact set C, and open covering K O ,to E To,i.e., let us choose a S such that if t; t' E C,, It - t'l < S, then there exists a to E To such that t , t l E K O . Let 0 < Ro 5 inf{Rto : to E To}. Decreasing Ro and 6, if necessary, we may suppose that L(S Ro) 5 E . Let t be an arbitrary element of C, and let t' be an element of C,-jt-t,i for which It - t'l < S. There exists a to E To for which t , t' E K O . Using the previous estimate and the fact that L is an integer, moreover the
+
5 11. Local Lipschitz
property of continuous solutions
127
subadditivity of w, we obtain that for an arbitrary real number R for which O
I f (t) - f (t')l5 B'lt
If It
- t'l
-
t'l
+ nB'lt - t'l-coR
I R , this estimate can be written
in the form
where cl, c2, c3, and c4 do not depend on t, ti, and R. Taking supremum first on the right-hand and then on the left-hand side for all t E C, and t' E C,-lt-t,l for which It - t'l 5 r , we obtain the inequality
whenever 0 5 r I 6 I R < Ro. If we choose an R which satisfies the condition c2R c3w(R) _< 112 - which can always be done by decreasing 6, if necessary, - we have
+
whenever 0 5 r 5 6. This proves that f is a locally Lipschitz function on C. For an arbitrary t E T, without loss of generality we may suppose that t is an interior point of C , since we may replace C with the union of C and a compact neighborhood of t. Hence the proof is complete.
11.7. Example. Usually the above theorem can be used together with some additional arguments. We illustrate this with a simple example. More complicated cases can be treated similarly. (See the "equation of the duplication of the cube" in 22.15 and the characterization of Weierstrass's sigma function in 3 24.) Suppose, that f satisfies the functional equation
Here f : Rm + is the unknown function, h is a given C" function, Ai, Bi are matrices with m rows and m columns, Bi is nonsingular, and llAil\ < 1.
128
Chapter IV. Differentiability and analyticity
Then every continuous solution is locally Lipschitz (hence all measurable solutions are Cw ) . To prove this, let us fix a natural number N > 0, let T be the open ball with radius N centered at the origin, let us choose an 0 < r < 1 such that liAill +rllBill < 1 (i = 1 , 2 , . . . , n ) , and let D = T x Y , where Y is the open ball with radius r N centered at the origin. Using the previous theorem, we have that f is locally Lipschitz on T , hence it is locally Lipschitz everywhere, because N was arbitrary.
12. HOLDER CONTINUITY OF SOLUTIONS In this section we only prove one theorem. This may turn out to be an important step in the general proof that "continuity implies local Lipschitz property". In any case, it proves that, under the conditions of Problem 1.18 if a solution is locally Holder continuous with some exponent 0 < a < 1, then it is locally Holder continuous with all exponents 0 < a < 1. The proof of the first result of this kind for real-valued functions in the paper JBrai [93] uses the fundamental lemma of the theory of Campanato spaces, which is a generalization of the famous classical Morrey lemma from the regularity theory of partial differential equations. (For further references about this lemma see the book of Zeidler [209], II/A, pp. 90-93.) Later a simpler proof and a generalization for several variables have been found. (JBrai 1921.) This simpler proof is the basis of the proof of Theorem 11.6. 12.1. Theorem. Let 0 < a < 1, and let Z , Zi be open subsets of Euclidean spaces (i = 1 , 2 , . . . ,n). Let T, Y, and Xi be open subsets of JP, Rk,and IFi, respectively. Let D be an open subset of T x Y. Consider the functions f : T + Z, fi : X i + Zi (i = I , . . . , n ) , g i : D -+Xi (i = I , . . . ,n), h : D x Z1 x Z2 x . . . x Z, -+ Z. Suppose, that
(1) for each ( t ,y) E D ,
(2) h is twice continuously differentiable; (3) gi is twice continuously differentiable on D and for each t E T there exists a y such that (t, y) E D and h ( t , y) has rank ri for i = 1 , . . . , n;
8~
(4) the functions fi,i = 1 , . . . ,n are locally Holder continuous with exponent a.
5 12. Holder
continuity of solutions
Then f is locally Holder continuous with exponent 2 a / ( a
129
+ 1).
Proof. We have to prove that for each point to E T the function f is Holder continuous on a neighborhood of to with exponent 2a/(l a ) . Replacing fi with the coordinates we may suppose without restricting generality that Zi c R for i = I , . . . , n. Let us choose yo by (3) for to. By Lemma 11.2 there exist a convex open neighborhood Vo of t o , a simplex with nonvoid interior contained in the unit ball, and constants Ro > 0 and C such that for the closed ball Wo with center yo and radius Ro we have Vo x Wo C D, and the mappings
+
t - 1
fi(gi(t,~))dg RSSyo
are continuously differentiable on Vo with gradient bounded by CR"'. Decreasing Vo and Ro if necessary we may suppose that Ro 5 1 and gi is a Lipschitz function with Lipschitz constant L on Vo x Wo. Similarly, we may suppose that fi is Holder continuous with exponent a and Holder constant H and 1 fi 1 bounded by B on gi (Vo x Wo); moreover on a convex closed set containing
z
the functions ah are Lipschitz continuous with Lipschitz constant L:, and
1% 1 I I
the functions and are bounded by Bb and Bl, respectively (i = 1 , 2 , .. . ,n). Let us fix Ro, Vo, Wo, and yo. We will prove that f is locally Holder continuous on Vo with exponent 2 a / ( l + a ) . Let t, t' denote arbitrary elements of Vo and let 0 < R < Ro. Let us integrate the two sides of the functional equation over the simplex R S yo. We have
+
where c
> 0 is the measure of the simplex S. Hence
130
Chapter IV. Differentiability and analyticity
To get a good upper estimate for the left-hand side we need an upper estimate for the difference h ( t , 3, f l ( g l ( t ,3 ) )7 . . .
, f n (gn ( t ,9 ) ) )
We may apply Taylor's theorem for the function h with points
z = (t,y , z l , . . . , x n ) and
Z'
= (t',y, x i , . . . , z;)
where t ' , t E V , y E W , zi = f i ( g i ( t y, ) ) , and I , . . . ,n. We have
Z:
= fi (gi(t',y ) ) for
i
=
Using this and omitting variables we have
+
Using the triangle inequality, we get n 1 terms on the right-hand side. ' Bb For the first term we get the trivial upper bound c ~ ~ ~ b- ltl,t where is an upper bound of . Let z! = fi ( g i ( t ,y o ) ) (i = 1,2,. . . ,n ) , and let dh z0 = ( t ,yo, Z! , . . . ,z:). If hl denotes the value of the partial derivative dzi at the point xO,than the other terms can be rewritten in the form
1%1
S
(zi-~,!)drdy+h;
(zi
-XI)
dy.
RS+yo
First we give an upper estimate for the absolute value of the first term of - t ' ~ ) where ~ , H is a this sum. An upper estimate of z i - Z : is H ( L I ~ Holder-constant for f i and L is a Lipschitz-constant for gi. Hence
dr dy.
131
fj 12. Holder continuity of solutions
Furthermore we need to estimate the difference
+
This is not greater than LQmultiplied by the norm of r z (1 - r)z' - x O , that is, Li times the maximal distance between the vectors z' and z0 = ( t ,yo , zy , . . . , z:) , where LQis a Lipschitz-constant for ah The maximal
2g.
distance between x' and zOcan be estimated by
Hence we get the upper bound
for the first term. To get an upper bound for the second term, we need an upper bound for the absolute value of
because ihil is trivially bounded by the upper bound Bl of
1
221
. From the
lemma we get the upper bound It - t'ICRbl for this integral. Summing up all these estimates, we get
If It
-
t'l 5 R this can be rewritten as If(t) - f(t')I 5 Colt -t'l
+ Cllt-
It t'l"Rff + Cz-
-
R
t'I
'
where Co, C1, and C2 do not depend on t , t', and R. If we choose R such " ) , we have that it satisfies the condition R = It - t l ~ ( l - " ) / ( l f then
1
+ + C 2 ) ( t t'(201(1Cn)
I f ( t )- f (t') 5 (Co C1
-
and t , t' E Vo. This proves that f is locally whenever It - t' 1 < R0(1+")1(1-") Holder continuous on Vo which implies the theorem.
132
Chapter IV. Differentiability and analyticity
12.2. Corollary. Under the conditions of Problem 1.18, if we only suppose that the functions h and gi are twice continuously differentiable, then every solution f which is locally Holder continuous with some exponent 0 < a < 1, is locally Holder continuous with any exponent 0 < a < 1.
Proof. If f is locally Holder continuous with some exponent a 0 = a , 0 < a 0 < 1, then by the previous theorem it is locally Holder continuous with exponents a, = 2an-l/(l a,-l), n = 1 , 2 , . . . , as well. Since an -f 1, we obtain the statement.
+
13. SOLUTIONS OF BOUNDED VARIATION As we mentioned in 2.14 we follow the terminology of the book of Giusti [60] regarding functions of bounded variation with several variables. Hence, strictly speaking, we are working with a generalization of the notion of bounded essential variation for functions of a real variable (not the generalization of the usual notion of bounded variation). We will prove only one theorem, which says that solutions of locally bounded variation satisfy the local Lipschitz condition. This result appeared in [108]. The following lemma is the key to our main result. 13.1. Lemma. Let V, W, and U be open subsets of RS, R k , and Rr , respectively, f : U -+ R a function of bounded variation, g : V x W + U a continuously differentiable function, to E V, yo E W, and suppose that
%J yo) has rank r. Then there exists a ball Vo c V centered in t o , a ball -(to, 8~
B
C
W centered in yo, and a constant C, such that
whenever t', t E Vo.
Proof. Taking smaller U, V, W if necessary, without loss of generality we may suppose that U, V, and W are bounded convex open sets, W = W' x W" where W' c Rr and W" C lRk-', and g is a Lipschitz function on V x W with Lipschitz constant M, moreover, that the determinant of
5 13. Solutions of bounded
133
variation
is not zero at the point yo, where y' E W ' , y" E W". Let p denote the projection ( y ' , y") +-+ 3' mapping IRk onto Rr. Let g,* denote the mapping ( y ' , y") +-+ (gt ( y ' , y"), Y") of W = W' x W" into U x W". The linear operator L o = gt*,'(yo) has determinant equal to the determinant of
hence it differs from zero. Let
Choose an open convex neighborhood X of Lo such that each operator L in X is invertible and satisfies
Taking an appropriate ball Vo C V centered at to and a smaller W if necessary, we may suppose that g f l ( y o ) E X and
whenever t E Vo and y E W . The last two inequalities imply that
whenever t E Vo and y E W. By the proof of the inverse function theorem (see Rudin [172],Theorem 9.24), from this it follows that gf is a homeomorphism of W onto an open subset of U x W". We need a somewhat stronger result. Let t , t' E Vo and 0 5 T 5 1. Let Gt,tl,T(y) = r g ; ( y ) ( 1 - r ) g ; ( y ) . We need that this function is a homeomorphism. Since
+
and X is convex, this linear operator is in X. On the other hand,
Chapter IV. Differentiability and analyticity
134
and hence
This proves that we may apply the proof of the inverse function theorem for Gt,t~,,and we get that GtItt,, is a homeomorphism of W into U x W". Moreover, from
we obtain the estimate
11
which is valid whenever [IT- SII < l / l I ~ - l / .This implies Ilgt'(y)-l 5 l l c and 5 l / c whenever t , t l E Vo, 0 5 r 5 1, and y E W. Hence the absolute values of the Jacobians of gf-I and G E ~ ,are , bounded by l/ck on g,* (W) and Gt,t~,,(W), respectively. Now we are ready to prove the lemma - first for a continuously differentiable function f . Let B be an arbitrary closed ball in W with center y o Using Taylor's theorem and the substitution x* = Gt,tl,t(y),we have the following estimate:
G:,~.,,(~)-'I
$13. Solutions of bounded variation
135
where ( W UisJ the Lebesgue measure of W" and J denotes the absolute value of the Jacobian. Finally, to get the statement, we use theorem 1.17 from the book of Giusti 1601. By this theorem, there exists a sequence of C" functions on U such that r
if j
+ m, whenever t, t'
E
V. Hence the lemma is proved.
13.2. Theorem. Let T , Y, X I , . . . ,X, be open subsets of PS", FS'", and R r l , . . . , Rrn,respectively, and Z , Zo,21,. . . , Z, be open subsets o f Euclidean spaces. Let D be an open subset of T x Y and W be an open subset of D x Zo x Z1 x x 2,. Consider the functions f : T + Z , f o : Y + Zo, fi : X i + Z i (i = I , . . . , n ) , g i : D + X i ( i = I , . . . ,n ) , h : W -+ 2.Suppose that (1) for each (t,y) 6 D we have
(2) h is continuously differentiable;
136
Chapter IV. Differentiability and analyticity
(3) gi is continuously differentiable on D and for each t E T there exists a y such that (t, y) E D and the rank of (t, y) is ri for i = 1 , . . . , n;
%
(4) f o is continuous and the functions fi, i = I , . . . , n are continuous and have locally bounded variation.
Then f is a locally Lipschitz function.
Proof. Replacing fi (i = 1,2, . . . ,n ) by its coordinates we may suppose that each fi is real valued. Let to E T and let us choose yo by (3). Let us choose open neighborhoods Ui of gi (to,yo) on which fi has bounded variation and neighborhoods V, Wo of to,yo, respectively, such that gi maps V x Wo into Ui. If V and Wo are small enough, we may apply Taylor's theorem for the function h with points z = (t, y, zo, z l , . . . , zn) and z' = (t', y, zo, zi, . . . , z n ) where t t , t E V, y E W, 20 = fo(y), zi = f z ( g i ( t , y ) ) and , = fi(gi(t'7y)) for i = 1 , . . . , n . We have f (t')- f (t) = h(z')
-
h(z) =
[:
-
(rz'
+ (1- r ) z ) (t'
-
t) d r
If V and Wo are small enough, then the partial derivatives of h are bounded by a constant K. Using the previous lemma, we can choose Vo C V and B c Wo such that the statement of the lemma is true for i = 1 , 2 , . . . ,n. Integrating both sides over B we have
where IBI is the Lebesgue measure of B. Using the previous lemma it follows that f satisfies a Lipschitz condition on Vo.
§ 14. Differentiability
137
14. DIFFERENTIABILITY In section 11 we have proved for special "sum form" functional equations that continuity of solutions implies their continuous differentiability. In two other cases we have shown continuity implies the local Lipschitz property. By the well-known theorem of Rademacher (see Federer [53], 3.1.6) a Lipschitz function f : Rm + Rn is Am almost everywhere differentiable. A similar result holds for any locally Lipschitz function mapping some open subset of a Euclidean space into some other Euclidean space. (See the more general theorem of Stepanoff in the book of Federer [53], 3.1.9.) In this section we will prove that differentiability almost everywhere implies continuous differentiability. We will use the results about continuity of measurable solutions. Combining "almost everywhere differentiability implies continuous differentiability" type theorems with the above mentioned classical results, we see that "local Lipschitz property implies continuous differentiability". By Lebesgue's theorem, monotonic functions of a real variable, or vector-valued functions of a real variable that are of bounded variation are almost everywhere differentiable. This result also can be combined with the theorems of this section to prove that "real variable solutions of bounded variation are continuously differentiable". (A theorem of this kind was proved first directly by another method in the paper [94].) In this section we need the fact that the derivative of an almost everywhere differentiable function is Lebesgue measurable. We will prove much more. We will prove that the derivative of any function mapping an open subset of some Euclidean space into a separable Banach space is defined on a Borel set and is a Borel function. This result was published in the paper J&rai [84]and generalizes the analogous result for continuous functions from the book of Federer ([53], 3.1.1.). I believe this result is interesting in itself. 14.1. Theorem. Let U be an open subset of the finite dimensional normed space X , let Y be a separable Banach space, and let f : U + Y be an for any closed subset arbitrary function. Then (f')-'(F) is in the class .Tos F of the normed space L(X, Y) of continuous linear operators mapping X into Y.
Proof. Without loss of generality we may suppose that U = X . It is well-known (see Hewitt and Stromberg [69], p. 78 for instance), that the set C of all continuity points of f is a Gs set of X . It is clear that L ( X , Y) is a separable Banach space. Let us choose a dense subset L1, L 2 ,. . . of F . Whenever i, j, and k are positive integers, let
138
Chapter IV. Differentiability and analyticity
Every Ci,j,k is a relative closed subset of C. To prove this, let x E C \ Cilj,k. Then there exists a n h E X for which lhl < l / j but
+
Let h' be defined by h' = h x - x' whenever x' E X. We observe that h' = x + h, so f (x' h') = f ( x h ) . B y the continuity of f at x, there exists a 6 > 0, such that Ih'l < l / j and
x'
+
+
+
whenever / x - X I / < 6. Now we will prove that
Suppose that f 1 ( x ) = L E F . Then x E C , and for each i there exists an Lk(i)and a n j ( i ) for which
whenever 1 hl
< j ( i ) . This proves
that
whenever lhl < l / j ( i ) ,i. e., x E Cz,j(i),k(i). On the other hand, suppose that x E C and for each i there exists a j(i) and a k ( i ) for which
whenever Ihl < l / j ( i ) . Let il and iq be two positive integers, and let us estimate the distance IILkcil)- Lkci2)11.If Ihi < l / j ( i l ) and jhl < l / j ( i z ) , then we get
5 14. Differentiability
139
This proves that Lk(i),i = 1,2, . . . is a Cauchy sequence in C(X, Y). Let L denote the limit of this sequence. Clearly, L E F. Choosing an arbitrary i, we can choose an i* 2 i for which. IILJ-(~*) - LII < l l i . Hence
whenever I hl < llj (i*), and this proves that L is the differential of f at x. Having proved ( I ) , and from the fact that C is a Gs set, it follows that there exist closed subsets F,,t of X such that
and
i. e., (f')-I (F)E FO6. 14.2. Theorem. Let Z be a normed linear space, Zi open subsets of separable Banach spaces (i = 1,.. . ,n),Y an open subset of R\ T an open subset of IW" , and the Xiopen subsets of Rri . Let D be an open subset of T x Y. Consider the functions f : T -+ Z , fi : Xi -+ Zi, gi : D + Xi (i = 1 , 2 , .. . ,n),h : D x Z1x - .. x Z, -+ Z. Suppose that to E T and the following conditions hold:
(1) for each (t, y) E D ,
dh dh (2) for any fixed y E Y the partial derivatives - and - (i = 1 , 2 , . . . ,n) at dzi are continuous in the other variables;
Chapter IV. Differentiability and analyticity
140
(3) fi is X r i almost everywhere differentiable (i = 1 , 2 , . . . , n ) ;
(4) gi has a continuous derivative, and there exists a yo for which (to,y o ) E D dgi and the matrix -(to, y o ) has rank ri (i = 1,2,.. . , n ) .
d~
Then f has a continuous derivative on a neighborhood o f t o .
Proof. Let us choose open sets T' and Y' with compact closure for which ( t oy, o ) E T' x Y' c T'x Y' c Dl and for every E > 0 there exists a 6 > 0 such that X r i ( g i , t ( ~ ) ) S whenever t E F , B C F. This is possible by Lemma 3.10. Let now D' be the set of all pairs (t',y') for which t' E T ' , y' E Y ' , and f i is differentiable at the point gi(tl,y') for 1 5 i 5 n. Let us observe that D' is a Bore1 set and
>
{y' : y' E Y ' , f,l does not exist at the point gi (t',y ' ) )
has measure 0, whenever t' E TI, because its image by gi,t/ has measure 0 in - Xi. This proves that T' x Y'\ D' has A% X k measure 0. Let X( = gi (T' x Y ' ) (i = 1 , 2 , . . . ,n). Suppose, that (t',y') E Dl and fix y'. We study the function
This function is defined on a neighborhood of t' and has a derivative at t'. Hence f is differentiable at t' and
"
+
C i=l
dh
agi
(t',Y ' , f l
st)), . . . , f. (s. (t',9 ' ) ) )I,'(%(t',9 ' ) ) at (t',y').
(s1(t',
This means that f ' , f l , f i , . . . ,f,, f ; satisfy an equation of the form
for all (t',y') E Dl, i. e., almost everywhere on T' x Y'. We apply Theorem 10.2 to this equation to prove that f' is continuous on TI. Let tb be an arbitrary element of TI. The measurability of the functions fi easily follows from their differentiability almost everywhere. By Lusin's theorem we see that all conditions of Theorem 10.2 are satisfied at tb.
5 15.
141
Higher order differentiability
15. HIGHER ORDER DIFFERENTIABILITY In this section we study the following question: If a solution is p times differentiable, is it also p + 1 times differentiable? Two theorems will be proved. The only difference between them is in their conditions about functions fi and h.
15.1. Theorem. Let Z be a normed linear space, let Zi be open subsets of some separable Banach spaces (i = 1,. . . ,n), let Y be an open subset of some Euclidean space, and let T and Xi, i = 1 , 2 , .. . ,n be open subsets of RS and IWTi, respectively. Let D be an open subset of T x Y. Consider the functions f : T + Z , gi : D + Xi (i = 1 , 2 , . . . ,n ) , fi : Xi Zi (i = 1,2, . . . ,n ) , h : D x Z1 x . . x Z, i Z . Suppose that p > 0 is an integer, to E T, and the following conditions hold: ( I ) for each ( t ,y) E D ,
(2) for any fixed y E Y all partial derivatives 8;' 8; in the other variables whenever 0 5 Icy 5 p;
. . .a,","
h are continuous
(3) the function fi is XT2 almost everywherep times differentiable (1 5 i 5 4 ; (4) gi is p times continuously differentiable and there exists a yo for which &?i (to,yo) E D and -(to, yo) has rank ri (i = 1,2,. . . ,n ) . ay Then f is p times continuously differentiable on a neighborhood of to.
Proof. For p = 1 we obtain the last theorem of the previous section. The proof will be similar to the proof of that theorem. Let us choose open sets T' and Y' with compact closure for which (to,yo) E T' x Y' C T'x Y' C D , and for every E > 0 there exists a 6 > 0 such that XTi (gilt(B)) 2 S whenever B C Y'. This is possible by Lemma 3.10. t E T', Let now D' be the set of all pairs (tt,y') for which t' E TI, y' E Y', exists at the point gi(tr,y') for 1 5 j 5 p and 1 5 i n. Let and X,' = gi(T' x Y') (i = 1 , 2 , . . . ,n). As in the proof of the previous theorem, we obtain that f ', f l , f i , . . . , f,, f; satisfy an equation of type
fp)
<
142
Chapter IV. Differentiability and analyticity
whenever (t',y') E D'. Here hl satisfies the same condition as h but for p - 1 instead of p. Suppose, that (t',9') E D' and let us fix y'. The function
is defined on a neighborhood of t' and is differentiable at t'. Hence f' is differentiable at t'. By differentiating we obtain that
f",fl,f:,f:l,... ;fn,fL,fl satisfy an equation
for all (t',y') E Dl. By induction we obtain that
satisfy an equation
for all (t',y') E D' where h, is continuous. Now as in the previous proof, Theorem 10.2 can be applied for this equation and we obtain that f ( p ) is continuous on TI. 15.2. Theorem. Let Z be a Euclidean space, Y and Zi open subsets o f some Euclidean spaces (1 i 5 n ) , and let T and Xi (i = 1,2,. . . , n ) be open subsets of RS and R T i , respectively. Let D be an open subset of T x Y . Consider the functions f : T + Z , gi : D + Xi ( i = 1,2,. . . ,n), fi : X i :+ Zi (i = 1,2,... , n ) , h : D x Z1x . . . x Zn 4 Z . Suppose that p > 0 is an integer, t o E T , and the following conditions hold:
<
( I ) for each ( t ,y ) E D ,
(2) all partial derivatives 8;' 8F11 .. ever 0 < 1ct.l < p;
h are continuously differentiablewhen-
(3) the functions f i , i = 1 , 2 , ... , n arep times continuously differentiable;
§ 15. Higher order differentiability
(4) gi is p
143
+ 1 times continuously differentiable and
there exists a yo for agi which (to,yo) E D and --(to, yo) has rank ri (i = 1 , 2 , . . . ,n). ay Then f is p 1 times continuously differentiableon a neighborhood of to.
+
Proof. Let 1 5 q 5 s and differentiate equation (1) partially with respect to t,. We have, omitting the variables,
fi = ( f i , j ) , and gi = (gi,k). This equation shows Here xi = (xilj),X i = that, whenever a E Ns , Ial = I , 8" f satisfies the functional equation
for ( t ,y) E D. Here, if the q-th coordinate of a equals 1, all others are 0,
fa,p = -
for some i , j , k ,
8Xi.k
and
for some i, j , k. It is clear that h,,p : D + Z is continuous and its p - 1-th partial derivative with respect to t is continuously differentiable, moreover f,,p : Xi + R for some 1 5 i 5 n and is p - 1 times continuously differentiable if 0 5 p 5 n,. Repeating this process, we have, by induction with respect to la1 that if a E Ns and 1 5 la1 5 p, then 6'" f satisfies the functional equation
144
Chapter IV. Differentiability and analyticity
whenever ( t ,y ) E D. Here : D + Z is continuous and all of its at most p - /al-th partial derivatives with respect to t are continuously differentiable; moreover f,,p : Xi + R for some 1 5 i 5 n is p - Ial times continuously differentiable, and, finally, g,,p = gi for the same i for which dmn f,,p = Xi. Now let a E NS,la/= p, and use Theorem 11.3. We have that every p 1-st partial derivative of f is continuous on a neighborhood of to and hence f has a continuous p 1-st derivative on a neighborhood of t o .
+
+
16. ANALYTICITY In this section a theorem will be proved which can be considered as a first step towards proving analyticity of solutions and was published in the paper JBrai [95]. Another result concerning "Cm implies analyticity" have been found by PBles [163]. He has a general method which is applicable to reduce linear functional equations with two real variables to differential equations. He uses this method to prove that Cm solutions are analytic except for isolated singular points. PQles's method will be discussed, too. We will use some basic properties of analytic functions. All these results can be found in Federer [53]. The following lemma can also be found there (p. 239), but formulated differently. 16.1. Lemma. I f the real functions f and g are p times differentiable at the point t and x = g ( t ) , respectively, then
where the sum is taken for the set S ( p ) o f all sequences a havingp nonnegative integer terms such that P
moreover the coefficients e, are positive integers depending only on a. I f A and B are nonnegative real numbers, then
§ 16. Analyticity
145
Proof. ( 1 ) can be proved easily by induction with respect to p. To prove ( 2 ) let us consider the functions
and
Then
and the derivatives are easy to calculate. Applying (1) for these functions in point t = 0 we get the statement. 16.2. Theorem. Suppose, that the unknown function f : ] a ,b[ + R satisfies the functional equation
where the ci E R (i = 1 , 2 , . . . ,n ) , the functions
are given, and the following conditions are satisfied: (2) g i ( t , y ) is between t and y whenever t , y E ] a ,b [ ;
whenever t , y E ] a ,b[ and p = 1 , 2 , . . . , with some constant 0
< A < 1;
(4) for all t E ] a ,b[ and i = 1 , 2 , . . . ,n the mapping y e gi(t,y ) o f ] a ,b[ into ] a ,b[ is strictly monotonic, and the function gi, for which the mapping x ++ g(t,x ) is the inverse o f the mapping y ++ gi (t,y) , is twice continuously differentiable on its domain. Then every infinitely many times differentiable solution f o f (1) is analytic on ] a ,b[. Proof. We will prove that if [c,d] is a compact subinterval of ] a ,b [ , then there exists a real constant 0 < B < oo,such that
146
Chapter IV. Differentiability and analyticity
for all t E [c,dl. Hence f is analytic on ]c,d[, and the same is true for ] a ,b[. To get the estimation (5) we will use induction with respect to p. For df - the existence of such a B is trivial, because it is continuous. Let us dt dPf (t) is on the left differentiate p times both sides of equation (1). Then dtP side, while the right side is the linear combination of partial derivatives of terms f (gi(t, y)) with respect to t. Now we are going to estimate such a term, for simplicity omitting the subscript i. By the preceding lemma such a partial derivative has the form
where S ( p ) is the set of all sequences a having p nonnegative integer terms such that P
and the coefficients e, are positive integers. Let us integrate both sides of the equation (6) with respect to y from c to d. Then the left side has the form
and the right side is a linear combination of terms of type
In each of these terms let us introduce a new variable with the substitution x = g(t, y). Then we get
By the differentiability of integrals with respect to the limits, and by the theorem concerning differentiability of parametric integrals with respect to the
§ 16. Analyticity
147
parameter, expression (8) is differentiable with respect to t , and its derivative is
We will get an upper bound to the absolute value of this expression in the following way: Since gi is twice continuously differentiable on its domain, there exists a constant 1 < E < m, such that on the compact set
we have
Hence we get the following upper estimate for the second and the third terms in (9):
The estimation of the integral is more difficult. If we take the derivative of the last term in the parenthesis with respect to t, then the upper estimate is
Chapter IV. Differentiability and analyticity
148
and if we take the derivative of the j-th term, then the upper estimate is
Hence the total upper estimate for the integral is
4 (d - c
(A's!)~".
) l ~ ~ p B (Ca)! ' ~ s=l
Summing up for a E S(p), and using the lemma above, the upper estimate for all terms originated from a term f (gi(t, y)) is
Finally, summing up for i, we have that
for all t E [c,
4,if
B>
(
n
2E 4 E 2 +d-- )c ~ / c i ~
and
i=l
Hence it is proved that f is analytic on ] c , d [ .
A 1-A'
B>-
5 16. Analyticity
149
16.3. Pgles's method. Up to the end of this 5 we will treat PAles's general method to find C" solutions of functional equations and to prove their analyticity. Although our treatment will be different from that of Pdes, everything here has to be attributed to him. A large part of the material is in his paper [163]; other parts he explained during our conversations. Let us see first the basic idea. Consider the functional equation
with unknown real functions f o ,f l , . . . , f n and with two real variables x, y. Applying differential operators
we obtain from (1) a system of fiinct ional-differential equations
If we consider all the differential operators having order 5 m, then we obtain (m I ) (m 2)/2 equations. Let us consider the (m 1 ) n unknown quantities f j(0, 1 - 0 , . . . , m , j = 1,.. . , n. If m is large enough we have more equations than unknowns. Solving some of the equations for some of the unknowns and substituting the result back into the other equations we may hope to arrive at an equation having the form
+
+
+
Now let us fix any yo and let us introduce the variable t = go(x,yo). Solving this equation we may express x as a function of t , say x = ljo(t,yo). Hence we obtain the differential equation
Solving this equation, we obtain fo. Similar procedures work for further unknown functions. Let us observe that we may get more than one equation similar to (2). This may be used to eliminate some terms, for example some higher order terms from (3). Another possibility is to apply the procedure to (3) and to use the resulting system to eliminate some higher order terms.
150
Chapter IV. Differentiability and analyticity
More generally, we may consider functional equations in several variables. Let D c RN and D j c RNj , j = 0 , 1 , . . . , n be open subsets, where Nj < N for j = 0 , 1 , . . . , n. Let gj : D -+ D j be given, "sufficiently smooth" functions and h also a "sufficiently smooth" function mapping an open subset of some Euclidean space into R. Let us suppose that the unknown, but "sufficiently smooth" functions f j : D j -+ R satisfy the functional equation
Applying all differential operators id, dl, 6'2,. . . , d ~d:, , d l & , . . . , d;, . . . , d F up to order m we arrive at a system of functional-differential equations
where the symbol (dmf j ) (gj (x)) has the meaning
Now we may eliminate from this system all terms except (dmf o )(go(x)) to obtain a partial differential equation for fo. To include the vector-valued case, too, even more generally, we may start with a system of functional-differential equations such as (6) with k > 0 and m 2 0. For a fixed M 2 m let us apply all partial differential operators d", a E NN, jal 5 M - m to all equations of system (6). We arrive at another system
Here for the number of equations
5 16.Analyticity
151
and for the number of terms coming from (dMf j ) ( g j (x)), j = 1 , 2 , . . . ,n (which we would like to eliminate) we have
To see that for sufficiently large M we have more equations than unknowns we observe that
because the numerator and denominator are polynomials in M having degree m - 1 and m, respectively. Let us observe that this general method works even for composite equations, if we know that the solutions are "sufficiently smooth". After eliminating the outer unknown functions fi, i = 0 , 1 , . . . ,n, we arrive at a functionaldifferential equation for the unknown functions gi. If the equation is still composite we may continue this process. 16.4. PAles's m e t h o d for linear equations. We do not have good tools to solve general nonlinear equation systems. This suggest that we restrict ourselves to the linear case. Let us suppose, as above, that D c IRN and D j C I R N 3 , j = 0, 1 , . . . , n are open sets, where N j < N for j = O , l , . . . , n. For simplicity, let us suppose, that the given functions g j : D + Dj and h, h j : D + R, j = 0,1, . . . ,n are analytic, and D is connected. Let us suppose that the unknown functions f j : Dj + R satisfy the linear equation n
(1)
Chj(~)fj(~j= ( ~h(x) ) ) whenever x E D. = jO
Chapter IV. Differentiability and analyticity
152
If the unknown functions are at least m times continuously differentiable, we may obtain from this equation a system of linear equations
where the functions hi, hi,, : D -+ IF2 are analytic. More generally, starting with any system (2), supposing that the functions f j , j = 0,1, . . . ,n are M times continuously differentiable, and applying any set of linear differential operators
with analytic coefficients a, : D + R, we arrive at a similar system as (2) but with some K instead of k and M instead of m. The advantage of supposing that the functions hi, hh,j are analytic is, that we may consider the quotient field JF of the ring of all real-valued analytic functions over D (or the quotient field of any subring of this ring, if all of the functions hi, hi,, are in this subring). The notions of ordinary linear algebra, such as linear dependence and independence, rank, determinant, etc. can be applied to investigate the solvability of system (2). We will use two matrices H and H*. In both the elements are hj,,, in both the line index i runs from 1 to k. In H the column index runs on all pairs (j,a),1 5 j 5 n , la1 5 m. n , la/ 5 m. In H* the column index runs on all pairs (j,a),0 5 j
<
In this setting, the main result is: 16.5. PBles's t h e o r e m . With the above conditions and notation, suppose that rank(H) < k. Let us suppose that for some
c
{
2.
k}
and
J c { ( a , j ) : la1 5 m , l 5 j l n )
with card(1) = card(J) = rank(H) the determinant of the corresponding submatrix of H is nonzero. Then system 16.4.(2), i E I can be uniquely solved for the unknowns (8" f j ) o g j , (a,j) E J . Putting these solutions back into equations 16.4.(2), the resulting equations do not contain any terms (dof j ) o g j , 1 5 j 5 n, la/ 5 m. If, moreover, rank(H*) > rank(H), then at least one of the resulting equations depends on the term (dmf o ) 0 go. In this setting, the statement is obvious. We remark, that PBles formulated the statement differently, namely, that starting with equation 16.4.(1)
§ 16. Analyticity
153
there exists a non-trivial linear differential operator "killing" all the terms f j (gj (x)), j = 1 , 2 , .. . ,n , and if rank(H) < rank(H*), then this operator does not "kill" the term f o (go(x)). Moreover, he does not suppose that the functions gj, h j are analytic. See his paper [I631 for details.
16.6. PBles's algorithm. The above theorem gives us an algorithm to solve linear functional equations or even linear functional-differential equation systems. Starting with such a system, and getting newer and newer equations by applying differential operators, we may always check the rank of the corresponding H and H * . This can be done most efficiently using Gaussian elimination. It is also possible to add the new equations in blocks, say, for example, all equations corresponding to differential operators with a given order. This procedure will not result in loss of information: If rank(H) < rank(H*) for a given system of equations, then this inequality will stay true if we add new equations. If the equations are not homogeneous, then it is better to calculate the rank of a new matrix, too, which we get by adding a new column hZ, i = 1 , 2 , .. . , k to H*. If this increases the rank, a nontrivial condition has been found which should have been satisfied by functions hZ. e @ to carry We remark that recently A. HAzy [65] wrote a ~ a ~ l program out these calculations automatically. We will give examples in 22.11, how to use this algorithm. Computations will be given as transcripts of computer interaction to avoid programming technicalities that make the essence of the procedure hard to grasp. Another related idea proposed by Baker is to solve the linear functional equation system 16.4.(2) in the distribution sense. This is based on the possibility to define the pullback of a distribution: if D and E are open subsets of Euclidean spaces and g : D + E is a submersion, then there exists a unique continuous linear map g# : D1(E) + V1(D) such that g # ( f ) = f o g when f : E -+ C is a continuous function having compact support. We call g# (f ) the pullback of f . See the details and proofs in Hormander [74], chapter VI. Baker suggested using the pullback of distributions and regularity results for distribution solutions of differential equations to find distribution solutions of functional equations. Two examples are given in his paper [34]. Using the conditions of 16.4, 16.7. PBles's regularity theorem. suppose that N = 2, Nj = 1 for j = 0 , 1 , . . . , n , rank(H) < rank(H*), and that go maps D onto Do. Then f o is analytic on an open subset of Do whose relative complement to Do is discrete. @ ~ a ~is la eregistered trademark of Waterloo Maple Inc
154
Chapter IV. Differentiability and analyticity
Proof. By the above theorem we obtain a differential equation having the form
where a , a j are analytic functions on D , r 5 m, and a, is not identically zero on D . Now we introduce a concept to be used in this proof only. A point t on Do is called a singular point with respect to a real analytic function h defined on D if go (x, y) = t always implies h(x, y) = 0. We show that the set of singular points is a countable set without limit points in D , provided that h is not identically zero. Assume, on the contrary, that there exists a non-trivial convergent set, = to E Do. Without quence t, of singular points such that limn,, loss of generality, we can also assume that this sequence is strictly monotonic, say increasing. Choose a point (xi,yi) € g,l(ti) arbitrarily. Let (x*,y*) E D be fixed. Since D is connected, there exists a continuous function (u, v ) : [O, 11 -+ D such that
Using the Weierstrass approximation theorem and Lagrange interpolation, one can find polynomials arbitrarily close to u and v satisfying (2), too. Thus we may assume that u and v are polynomials themselves. Hence the functions cp, $ defined by
are analytic on [ O , l ] , further we have $(O) = t o , $(I) = tl. By Bolzano's theorem, for each n there exists 7, E [ O , l ] such that $(r,) = t,. Since t, is a singular value this yields cp(r,) = 0. Being analytic, cp must be identically ) h(x*,y*) = 0, and this shows that h is identically zero zero. Then ~ ( 1 1 2 = on D . The contradiction obtained proves our statement concerning singular values. The derivative of go cannot be identically zero on D since then go would be constant, but go(D) = Do is open, so this is impossible. Thus we may assume that &go is not identically zero on D. Using the above statement, we see that the set of singular points with respect to aTdlgois countable and has
155
$16. Analyticity
no limit points in Do. Let to E Do be a regular (i.e. non-singular) point. We show that f o is analytic at to. The non-singularity of to implies the existence of a point (xo,yo) E D such that a, (xo,yo) # 0 and &go(xo,yo) # 0. The latter property says that the function x H go(x, yo) is strictly monotonic in a neighborhood of xo. Thus there exists a strictly monotonic analytic function p such that go (p(t),yo) = t in this neighborhood and p(to) = xo. In a (possibly smaller) neighborhood U of to we also have a, (p(t),yo) # 0. Substituting now x = p(t) (t E U) and y = yo into ( I ) , we arrive at the ordinary linear differential equation
fi') +
c
a; (t)f f ) = a*(t),
t t U7
where a; (t) = a j (,u(t),yo) /a, (p(t),yo) , j = 0, 1,. . . ,r - 1, and a* (t) = a ( p ( t ) ,yo) / a , (p(t), Since all coefficients of (3) are analytic functions, hence f o must be analytic on U.
Chapter V.
REGULARITY THEOREMS ON MANIFOLDS
In this chapter there is only one section. Here we generalize the most important results to manifolds.
17. LOCAL AND GLOBAL RESULTS ON
MANIFOLDS As mentioned in the introduction, most of the regularity results can be generalized to manifolds. Most of the theorems are of a local nature, and thus generalization causes no problem in their case. Theorem 11.6 is of a global nature hence we have to give a new proof for manifolds, which differs in several technical details from the original one. In what follows we only formulate the generalization of the most important theorem for manifolds. In the case of "local" theorems the more general version can be easily obtained from the special one. In the case of the only "global" theorem we give an independent proof. These results appeared in [log].
17.1. Theorem: Measurability implies continuity. Let T and Z be topological spaces and let Zi be topological spaces with countable base. Let Y and Xibe differentiable manifolds (i = 1 , 2 , .. . , n), D an open subset o f T x Y , and W c D X Z1x . - . x 2,. Considerthefunctions f : T + 2,
158
Chapter V. Regularity theorems on manifolds
gi : D + Xi, fi : Xi + Zi (i = 1 , 2 , . . . , n ) , and h : W + Z. Suppose, that to E T has a countable base of neighborhoods and the following conditions are fulfilled: (1) for each (t, y) E D we have
(2) for each fixed y in Y, the function h is continuous in the other variables;
(3) the function fi is Lebesgue measurable on the subset Ai of Xi (i = 1 , 2 , .. . ,n);
(4) n:=,
g,2(Ai) is not a null set;
(5) for each t E T, gi,t is a C1 submersion and considering the open subset of T times the tangent bundle of Y of all triples (t, y, v), where (t,y) E D and v is in the tangent space of Y at y, the mapping
is continuous into the tangent bundle of Xi (i = 1 , 2 , .. . ,n). Then the function f is continuous at to. Proof. The theorem will be reduced to Corollary 8.7 using local coordinates. Since Y can be covered by the domains of countably many charts, there exists a chart cp such that the intersection of its domain Wo with the set n:=,g;;io (Ai) is not a null set. Consider the image of this set by ip and let yo* be a density point of it. On sufficiently small neighborhoods of the points gi(to,yo) there are charts ti defined. Let us choose a small enough neighborhood V of to and let us decrease Wo, if necessary, so that V x Wo c D and gi (V x Wo) c dmnti (i = 1,2,. . . ,n) are satisfied. Let W," = rng(cp), D* = V x W;, X; = rng((i), A: = &(Ai), gZ(t,y*) =
if (t, y*) E D * and (t, ip-'(y*), zl, . . . ,zn) E W. Corollary 8.7 can be applied to the sets and functions denoted by * and hence the proof is complete.
5 17. Local and global results
on manifolds
159
17.2. Theorem: Baire property implies continuity. Let T , Z , and Zi ( i = 1,2,. . . ,n ) be topological spaces. Let Y and Xi be differentiable manifolds ( i = 1,2,. . . , n ) ,D an open subset of T x Y and W C D x Z1 x x 2,. Consider the functions f : T -+ 2, gi : D -+ X i , fi : Xi -+ Zi (i = 1,2,. . . , n ) , and h : W + Z . Suppose that to E T has a countable base o f neighborhoods and the following conditions are fulfilled: (1) for each (t,y) E D we have
(2) for each fixed y in Y , the function h is continuous in the other variables; (3) the function fi has the Lusin-Baire property on the subset Ai o f Xi ( i = 1,2,.. . , n ) ;
(4)
nEl
( A i ) is of second category as a subset of Y ;
g Z ~ ~ O
(5) for each t E T , gi,t i s a C 1 submersion and considering the open subset of T times the tangent bundle of Y of all triples ( t , y , v ) ,where ( t , y ) E D and v is in the tangent space o f Y at y, the mapping
is continuous into the tangent bundle of Xi ( i = 1 , 2 , . . . ,n) Then the function f is continuous at t o . Proof. The theorem will be reduced ordinates. Since f l ~ = l g (zA~i~) is of second for which ( t o ,yo) E D and for each open is of second category -I ( A ) ) WOf l (n?=1gi,to this result reduces to Corollary 9.5.
to Corollary 9.5 using local cocategory in Y , there exists a yo neighborhood W o of yo the set in Y. Now in local coordinates
The next theorem is the most important result of this section; it states that continuous solutions are locally Lipschitz. The proof depends on Lemma 11.5.
17.3. Theorem: Local Lipschitz property of continuous solutions. Let 2, T , and Y be manifolds, D an open subset o f T x Y , C a compact subset of T , and W an open subset o f D x Zn. Consider the functions f : T -+ 2, gi : D i T ( i = 1,. . . , n ) ,h : W 4 Z . Suppose, that
160
Chapter V. Regularity theorems on manifolds
(1) for each (t, y) E D we have
(2) h is twice continuously differentiable; (3) gi is twice continuously differentiable on D and for each to E T there exists a yo such that (to,yo) E D , gi(tO,yo) E C, and y H gi(to,y) is a submersion at yo whenever i = 1, . . . ,n; (4) f is continuous. Then f is a locally Lipschitz function on T.
Proof. Let us choose and fix a Riemannian structure on 2,Y , and T (see Dieudonnk [49], 20.7.13). We will use a distance on all three manifolds resulting from the fixed Riemannian structure, and for simplicity we will denote this with Q in all three cases. This distance is defined in the following way: on all components of the given manifold there exists a natural distance Q' ([49], 20.16.3). Let ~ ( xy), = min{l, ~ ' ( x , y ) )if x and y are in the same component of the manifold, otherwise let Q ( X , y) = 1. With this definition, Q is a distance generating the topology of the manifold, and if ~ ( xy), < 1 then x and y are in the same component, ~ ' ( xy) , is also defined, and equal to d x , Y). The proof uses the continuity modulus of f . Since this is not a subadditive function in general, we will use a modification. For this modification, we will prove that it satisfies a functional inequality, which will imply that f is a locally Lipschitz function. For an E > 0 let C, = {x : Q(X, C ) 2 E ) denote the (closed) E neighborhood of C . Let C, = C if E 0. Since T is locally compact, there exists an open neighborhood of C with compact closure. The distance of the complement of this neighborhood from C is positive. If E is less than this distance, then C, is a compact set. Let us choose for each point t of C an open neighborhood & in the same component, and a number 0 < ~t < 112 such that each open ball with center in & and radius ~t is strictly geodesically convex (see [49], 20.17.5). Let us choose from the open covering &, t E C a finite covering of C , and let E > 0 be less than the corresponding values ~t so that C, is compact. Let
<
§ 17. Local and global results on manifolds
161
Clearly w is increasing, w(0) = 0, and w is continuous at zero, because f is uniformly continuous on the compact set C,. We will prove that
We may suppose that 7-2 I r l . Suppose that w(r1 +r2) > w(r1) +w(rn). Then there exist x and y such that e(x, y) I r l r2, x E C,, and y E CE-e(z,y), but ~ ( (x), f f (y)) > w ( r l ) ~ ( r p )For . some c E C we have
+
+
hence ~ ( xc), 5 E. This implies that for some ~t > E the open ball centered at c and having radius Q is strictly geodesically convex, hence there exists exactly one geodetical arc connecting x with y and having length e(x,y). , Choosing the For an arbitrary point z on this arc, ~ ( xy), = ~ ( xz), ~ ( zy). , ~ ( xz), I r l , and e(z, y) I 7-2, point z in such a way that ~ ( yz), I ~ ( xz), we obtain
+
<
hence z E C,-,(,,,) C C,_,(,,,j. This implies ~ ((x), f f ( 2 ) ) w(r1) and Q (f (z), f (y)) Iw (r2),a contradiction. Let us fix an integer 7 > 1. For an arbitrary to E C, let us choose a yo by (3) (depending on to). Let zo = f (to),20 = gi(to,yo), and z: = f (x;) if i = 1 , 2 , . . . , n. Applying 20.17.1, 20.17.5 from [49] to the component of the manifold Z containing 20, we have that there exists a 0 < St, < 112, such that for all 0 < r I St, the open ball centered at zo and having radius r is strictly geodesically convex. Decreasing dt,, if necessary, we may suppose that the normal coordinates II, are defined on the open ball centered at zo and having radius St,, moreover, that if z and z' are in this ball, then
1 -
7
( z )-z
Ie
,I7
-
II,(z1)I,
i.e., II,and its inverse are Lipschitz functions between the open ball-neighborhood of zo with radius St, and its image in the coordinate space (which is also an open ball having radius St,, centered at z i = II,(zO)).(See 1491, 20.16.4, 20.16.5.) Let Z* denote the open ball centered at zz and having radius St, . Similarly, we may choose strictly geodesically convex open balls to the points to, yo, and z: of the manifolds T, Y, and Z , respectively, such that and their inverses are defined and are the normal coordinates 7 , cp, and
Chapter V. Regularity theorems on manifolds
162
Lipschitz functions with Lipschitz constant q. Let T*, Y*, and Z$ denote the image of these balls in the corresponding coordinate spaces. Decreasing the radii, if necessary, we may suppose that
and that the function
x Z; into Z*. This mapping will be denoted maps the set T* x Y* x Zf x by h*. Decreasing further the radii, if necessary, we may suppose that the partial derivatives of h* are bounded Lipschitz functions. Continuing similarly, let us choose for the points x: of the manifold T strictly geodesically 3 which the normal coordinates convex open balls Xi with radii at most ~ / on ti are defined and are Lipschitz functions together with their inverses with Lipschitz constant at most 17. Let X$ denote the image of these balls in the corresponding coordinate space. Decreasing the radii, if necessary, we may suppose that the mapping
fc.
Finally, decreasing maps X,* into 2;.We will denote this mapping by the radii of the balls T* and Y* around the points t; = r ( t o ) and y; = $(yo), respectively, we may suppose that the mapping
is a Lipschitz map of T * x Y* into X; ; it will be denoted by 9:
. The mapping
t* i $(f (rv'(t*))) of T * into Z* will be denoted by f * . Let w$ denote the (usual) continuity modulus of We will prove that w:(r) 5 qw(qr) for all r 2 0. If xi, xi E Xi, then Q(X~,X:) < &/3 , using that 2; E C , we obtain that xi E C, and and Q(X{,X;) < ~ / 3 hence @(xi,xi)< 2 ~ / 3which implies that xi E C,-,(,,,,;). Suppose that Iti(xi) ti(xi) 5 r . Then @(xi,xi) q r , which implies that ~ ( (xi), f f (xi)) w(qr). Hence
ft.
I
<
<
§ 17. Local and global results on manifolds
163
Taking the supremum on the left-hand side we obtain the inequality wd (r) I nw(rlr). Let us apply Lemma 11.5 with the sets and functions marked by *. We have that there exists a strictly geodesically convex open ball centered at to and having radius 0 < < 112, on which the normal coordinates r are defined, and there are constants Rt, > 0 and ct, such that if t, t' are in the open ball mentioned, and 0 < R < Rt,, then
5 B;ne(t, t')
+
n
B;ne(t, t') i=l
2
Of course here the constants Bh, Bl, Li, and Li depend on to, which is not expressed in the notation. Using the inequality between wd and w, from the inequality above we have
+ C rlw (Liq2@(t,t')) L: ( v Q ( ~t'), + R + i=l
c n
i j w ( n ~ (oe(t, j t')
+~ 1 ) ) .
j=1
give an For varying to the open balls Vt, centered at to and having radii open covering of the compact set C,. Hence there exists a finite set To c C, such that the open balls corresponding to the points to E To give an open covering of C,. Let L denote a positive integer, which is greater than or equal to all of the constants Li (i = 1,2, . . . ,n ) corresponding to all points to E To. Similarly, let B', L', and co denote a constant which is greater than or equal to all constants Bl (i = 0 , 1 , . . . ,n ) , Li (i = 1,2,. . . ,n ) , and ct,
164
Chapter V. Regularity theorems on manifolds
corresponding to all to E To, respectively. We have that if t and t' are in the same ball KOand 0 < R < Rto, then
Let S > 0 denote the Lebesgue number corresponding to the compact set C, and open covering Vto, to E To, i.e., let us choose a S such that if t, t' E C,, e ( t , t l ) < 6, then there exists a to E To such that t,t' E K O . Let 0 < Ro 5 inf(Rt, : to E To}. Decreasing Ro and 6, if necessary, we may suppose that qL(776+ R o ) 5 E. Let t be an arbitrary element of C, and let t' be an element of C,-,(t,t,) for which ~ ( tt'), < 6. There exists a to E To for which t , t' E KO. Using the previous estimate and the fact that L and rj are integers, moreover the subadditivity of w, we obtain that for an arbitrary real number R for which 0 < R < Ro,
If e(t, t') 5 R, this estimate can be written in the form
where cl, c2, cg, and c4 do not depend on t, t', and R. Taking supremum first on the right-hand, and then on the left-hand side for all t E C, and t' E CE-,(t,t,) for which e(t, t') 5 r , we obtain the inequality
5 17. Local and global results on manifolds
165
< < < + <
whenever 0 r S R < Ro. If we choose an R which satisfies the condition c z R c3w(R) 112 - which can always be done by decreasing 6, if necessary, - we have
< <
whenever 0 r S. This proves that f is a locally Lipschitz function on C. For an arbitrary t E T , without loss of generality we may suppose that t is an interior point of C, because we may replace C with the union of C and a compact neighborhood o f t . Hence the proof is complete. We remark that this theorem, as Theorem 11.6, can be better used with some additional argument. (See the example after Theorem 11.6.) For equations having a more special form we can state more. 17.4. Theorem. Let T , Y, Z,, and Xi be manifolds (i = 1 , 2 , .. . , n ) . Let D be an open subset of T x Y and let Z be a Euclidean space. Consider the functions f : T + Z, gi : D + Xi, fi : Xi + Zi, hi : D x Zi -+ Z , (i = 1 , 2,... , n ) , and ho : D + Z. Let (to,yo) E D and suppose that
(1) for each ( t , y ) E D ,
(2) hi is continuously differentiable (0
< i 5 n);
(3) fi is continuous on Xi (i = 1,2, ... , n ) ; gi(to,y) is a submersion (4) gi is twice continuously differentiable and y at yo for 1 5 i < n. Then f is continuously differentiableon a neighborhood of to. Proof. The theorem can be reduced to Theorem 11.3 using local coordinates. If there are only three unknown functions and one of them is real-valued, then also we can say more: 17.5. Theorem. Let Z , T , Y , X I , Zo, and Z1 manifolds, Z1 with dimension 1. Let D be an open subset of T x Y . Consider the functions gl : D + X 1 , f : T + Z , f o : Y + Zo, f l : X I + Z1, a n d h : D x Zo x Z1 -+ Z. Suppose that to E T and the following conditions are satisfied:
166
Chapter V. Regularity theorems on manifolds
(1) for each ( t ,y) E D ,
(2) h is a locally Lipschitz function; (3) gl is twice continuously differentiable; (4) f o and f l are continuous;
(5) there exists a yo such that ( t o ,yo) E D and y t-+ ggl ( t o ,y) is a submersion at Yo. Then f is a locally Lipschitz function on a neighborhood o f t o . Proof. Using local coordinates we may reduce the statement t o Theorem 11.4. 17.6. Theorem: almost everywhere differentiable solutions are continuously differentiable. Let Z be a Banach manifold, Zi ( i = 1, . . . , n ) be separable Banach manifolds, Y , T , and Xi ( i = 1,2, . . . ,n ) manifolds. Let D be an open subset of T x Y . Consider the functions f : T + Z , f i : Xi -+ Z i , g i : D - + Xi ( i = 1,2, . . . , n ) , h : D x Z 1 x . - . x Z n -+ 2 . Suppose, that to E T and the following conditions are satisfied:
(1) for each ( t ,y) E D ,
dh dh (2) for any fixed y E Y the partial derivatives - and - ( i = 1,2,. . . ,n ) at azi are continuous in other variables; (3) fi is almost everywhere differentiable ( i = 1,2, . . . ,n ); (4) gi is continuously differentiableand there exists a yo, such that ( t o ,yo) E D and the mapping y t-+ gi(to,y) is a submersion at yo ( i = 1 , 2 , . . . ,n ) . Then f is continuously differentiable on a neighborhood of t o . Proof. Using local coordinates the statement may be reduced to Theorem 14.2.
Let Y , Z , T , Xi ( i = 17.7. Theorem: C P solutions are in C p + ' . 1 , 2 , . . . , n ) , and Zi (0 5 i 5 n ) be manifolds. Let D c T x Y be an open set. Consider the functions f : T -+ 2, gi : D + Xi ( i = 1,2,. . . , n ) , f i : X i : - + Z i ( i = 1 , 2,... , n ) , f o : Y - + Z o , h : D x Z o x Z 1 x ~ ~ ~ x Z n + Z . Suppose that p > 0, t o E T , and the following conditions hold:
§ 17. Local and global results on manifolds
167
(1) for each ( t ,y ) E D,
(2) the function h is p
+ 1 times continuously differentiable;
(3) fo is continuously differentiable and the functions f i , i = 1,2, . . . ,n are p times continuously differentiable;
+
(4) the functions gi are p 1 times continuously differentiable, and there exists a yo such that ( t o ,y o ) E D and y c, gi(to,y ) is a submersion at yo whenever i = 1,2,. . . ,n. Then f is p 1 times continuously differentiableon a neighborhood of t o .
+
Proof. Using local coordinates we may reduce this theorem to Theorem 15.2.
Chapter VI.
REGULARITY RESULTS WITH FEWER VARIABLES
As explained in the introduction (see 1.41), roughly speaking, all the results of 55 5-17 prove regularity of an r place function f that is the solution of a functional equation only if there are at least 2r variables in the functional equation. The last two examples of 1.21 show that the number of variables cannot be reduced without introducing some additional conditions. The first general regularity results which overcome this difficulty were given by ~ w i a t a k .She applied her distribution method to generalizations of the mean value equation
(hi E R,gi E Rr are fixed) with unknown function f and proved that continuous solutions are in CCO. First she investigated the generalized mean value equations
Chapter VI. Regularity results with fewer variables
170
(see [195], [196], [138], [197], [198]) and finally the equation (1) n
(see [199], [201]). Equation (1) is "almost linear", so, formally, it is much less general than equation (1) in problem 1.IS. However ~wiatak'stheorems can be applied even if the rank of
dgi is much less than the dimension of d~
-
the domain of the unknown function f. Roughly speaking, the method of ~ w i a t a kcan be applied even if f is an r place function and there are only 1 variables; this is the minimal number of variables if the equation is r not a functional equation in a "single variable". So the results of Swiatak suggest that the rank condition in the problem 1.18 is too strong, and the results concerning problem 1.18 can be extended for other cases, at least if we add some further conditions. In this chapter we discuss regularity results for functional equations with "few" variables, i. e. with r place unknown function but with less than 2r (but more than r) variables. First we briefly overview the main results of ~ w i a t a k in 5 18. Then new 'Lmeasurabilityimplies continuity", "Baire property implies continuity", and "continuity implies Cw" type results will be treated in 5s 19-21, respectively. Most of the new results can be applied to the nonlinear functional equation (1) of Problem 1.18.
+
Here we briefly overview ~wiatak'sdistribution method. Most of the material here is presented as in her paper [201], but her papers [196], [197], [198], [199], [200], and [202] and her joint paper [I381 with Lawruk were also used.
18.1. Parametric families of distributions. The essence of ~wiatak's method is to consider parametric families of distributions. Let T be an open subset of RS, and let us consider a Cw function h : T x IWr -+ R and a function g : T x Rr + IWr . If f E D'(Rr ), we consider the parametric family h(t, .) f 0 g(t, .) = h(t, .) (f o g(t, .)) , t E T of distributions and its differentiability with respect to t. To this end, let 0 < p j m and let us suppose
5 18. ~ w i a t a k ' smethod
171
that g and (t,s) e g;l(s) are in Cp(T x Rr ), for each t E T the function gt is a Cw diffeomorphism, and that for each multiindex a E N8, la1 5 p the function y e (drg)(t,y) is in Cw(Rr), where a? = dff1dy2- - - d:~, i. e. a partial differential operator acting only in the variable t. The next Lemma shows that calculating formally
we obtain a distribution, which in some sense acts as the derivative corresponding to of the family. To simplify our notation let us denote by d,ol (h(t, .) f o g(t, .)) the distribution obtained by calculating (1) formally. Of course, this expression depends an t. Let
a,ol(h(t,.) f o g(t, .)) 1 t=to denote the
distribution obtained by substituting t = to after the formal differentiation.
18.2. Lemma [Qwiatak]. Under the conditions of the above definition, for any test function cp E V ( F ) we have
whenever t E T .
Proof. This is Lemma 4.1 in [201]. Although it is very important, we do not repeat here the proof, which is of technical nature. 18.3. Theorem [Qwiatak]. We use the notation of the previous definition. Let T be an open subset of PSS, let to E T , and let 0 < p 5 ca. Let f : IlBT + R, gi : T x Rr -+ Rr (i = 1 , 2, . . . , n ) , gi : Rr + Rr (i = n + l , . . . , m ) , hi : T xPST + R (i = 0,1, . . . , n ) , h : I F xRm-n be functions, and suppose that the following conditions are satisfied: (1) whenever t E T and y E Rr we have
(2) h is continuous and hi is in Cw whenever 0 5 i 5 n; (3) if n < i 5 m then the function gi is continuous; (4) if1 5 i 5 n then for each t E T themappinggi,t i s a C w diffeomorphism of F onto itself, the mappings gi and (t, s) ri g,? (s) are in CP ( T x IWT ) , and afgi,t is in Cw(PST) whenever /3 E N8, 1/31 5 1);
Chapter VI. Regularity results with fewer variables
172
(5) gi(to,y)
-- y whenever I I i I n;
(6) there exists an ct. E Ns for which 0 < jal 5 p such that each distribution solution f of the partial differential equation
is represented by a function in C". Then every continuous solution f of (1) is in C". Note that there are well-known conditions for partial differential equations which imply that distribution solutions are in Cm. For example, for any Cm function on the right hand side in (6) if the differential operator on then the left hand side is of constant strength, hypoelliptic at an yo E F, any distribution solution of (6) is in Cm; see Hormander [73], p. 176.
Proof. Let f be a continuous function. For any fixed t E T both sides of (1) can be identified with a distribution. From (1) we obtain that for any test function cp E D ( R T )we have
Using the previous Lemma we obtain that f as a distribution satisfies
whenever t E T. Substituting t = t o and using (5) we obtain that f as a distribution satisfies the partial differential equation in (6). Hence by (6) we obtain that f is almost everywhere equal to a CW function. Since f is continuous, it is equal to this Cm function. The following corollary (Theorem 6.4 in ~ w i a t a k[201]) is less general but easy to apply.
5 18. Swiatak's
method
173
18.4. Corollary [~wiatak]. Suppose that the conditions of the previous theorem are satisfied with s = 1 and p = 2, but instead of (6) we have a (to,y) span the space PST and hi (to,y) > (6') for every y E Rr the vectors (-gi) at 0 for 15 i 5 n. Then every continuous solution f is in C" .
Proof. We have to prove that (6) of the previous theorem is satisfied. The principal part of the equation
can be written as
where gi = (gi,. . . , gr ). The corresponding quadratic form for
< = (J1, . . . , <,)
To prove Q(E) > 0 for ( # 0 notice that Q ( < )= 0 if and agi y), i = 1 , . . . , n . But this is impossible only if is orthogonal to all -(to, at because these vectors span Rr. Therefore Q([) > 0 for [ # 0, i. e. (7) is an elliptic differential equation. Using regularity theorems of the theory of partial differential equations (see Hormander 1731, pp. 176-177) we obtain from the previous theorem that f E Cm. because hi(to,y)
<
> 0.
18.5. Remarks. (1) From the previous general theorem, which is sometimes difficult to apply, further much more special, but easier to apply results can be deduced. Applications of the previous general theorem to generalized mean value type equations are Theorem 6.2 and Theorem 6.3 in ~ w i a t a k [201]. In these cases differentiation twice and four times with respect to the (scalar) variable t is used, respectively. It is remarkable, that in the second case the proof that the resulting linear partial differential equation with constant coefficients is hypoelliptic is far from being trivial (10 pages).
174
Chapter VI. Regularity results with fewer variables
(2) The method of Swiatak is capable of treating locally integrable solutions, too, but in this case we only obtain that each solution is almost everywhere equal to a Cm function. Adding not very restrictive assumptions one can prove that all the locally integrable solutions are equal almost everywhere to another solution which is in Cm (see Swiatak [197], [199], [202]). The fact that every locally integrable solution of 18.3.(1) is a Cm solution was proved by ~ w i a t a kunder a rather strong assumption implying s 2 r, i. e. that there are at least 2r variables, if the unknown function f is an r place function (see [197], [199], [202]). (3) With the method of ~ w i a t a kwe may deduce that continuous solutions are analytic (see Lawruk and ~ w i a t a k[138]). For example, if the linear partial differential operator on the left hand side of 18.3.(6) is elliptic with analytic coefficients and the right hand side is analytic, too, then Theorem 7.5.1 in Hormander [73], p. 178 implies, that all continuous solutions f are analytic. (4) The method can be easily extended to treat functional-differential equations; see ~ w i a t a k[200]. (5) Among applications we compare ~wiatak'smethod with ours on some examples, see 22.16, 22.17.
19. BETWEEN MEASURABILITY AND
CONTINUITY What seems to be most important for equations with "few" variables is to prove "measurability implies continuity" type results, because by the method of Swiatak we may only start with continuous solutions - a consequence of the distribution method. To the best knowledge of the author such "measurability implies continuity" type results without the strong rank condition in 1.18.(3) or some abstract version of it were known earlier only for very special equations such as for example the mean value equation
(hiE R, gi E Rm are fixed) in the paper of McKiernan [158]. The proof there is based on algebraic properties of the solutions. In this 5 we will prove a "measurability implies continuity" type result for the general explicit nonlinear functional equation (1) from Problem 1.18
§ 19. Between measurability and continuity
175
without the strong rank condition 1.18.(3) on the inner functions. In the spirit of the "bootstrap" method in 1.7, we introduce a sequence of propinterpolate between measurability and erties, which - roughly speaking continuity. This sequence of properties gives a stairway to climb up from measurability to continuity. First we will investigate the basic properties of the new notions. Then the regularity theorem will be proved. An example is given how to apply the theorem in non-trivial cases. A refinement of the theorem is also proved. Finally, further properties of the new notions are investigated. These results have been published in [110]. -
19.1. Definition. Let X be a set, Y a metric space, and f : X + Y be a function. Let U be a Hausdorff space with the Radon (outer) measure p , and P a topological space, the "parameter space" with a given point po E P. Let p be a function from U x P into X. We will think of cp as a surface p, : u e p ( u , p ) for each p, depending on the parameter p. Lusin's theorem (2.7) and generalizations of Steinhaus' theorem in 53 suggest that the following condition is connected with measurability:
(L) For each E > 0, each a > 0, and for each compact subset C exists a neighborhood Po of po such that if p E Po,then
cU
there
The condition above can be reformulated in the following sequential way: (S) For each a Pm -$ PO,
> 0, for each compact subset C C U , and for each sequence
In this form the condition strongly resembles convergence in measure. Riesz' theorem suggests the following condition: (R) For each sequence pm -+ po there exists a subsequence pmi such that for almost all u E U we have
Chapter VI. Regularity results with fewer variables
176
This condition resembles the following condition treated by Trautner in a special case for characteristic functions of measurable sets (see remark below): (T) For each sequence p, + po and for almost all u E U there exists a subsequence pmZsuch that
To investigate the connection between these conditions we need some kind of measurability condition: (M) u
Hf
( p ( u , p o ) )is p measurable.
It is clear that conditions (L) and (S) have meaning also if the values of f are in a uniform space Y; simply a has to be replaced by a reflexive symmetric relation from the uniformity of Y and we have to consider the set of those points u for which the two values o f f are not a-near. Condition (R) has the advantage that it has meaning even if Y is only a topological space. The same is true for (T) and (M). It seems that (T) has no advantage over (R). We will often check condition (L) [(S), (R), (T), (M)] locally. If for each uo E U there is a neighborhood Uo of uo and Po of po such that plUo x Po satisfies (L) [(S)],then p also satisfies (L) [(S)].To see this, we will choose a finite covering of C by open sets having finite measure and we will apply (L) [(S)]to a sufficiently good inner approximation of these open sets by compact sets: Let us choose for each x E C a neighborhood Ux of x and a neighborhood P, of po such that p(Ux x P, satisfies (L). Shrinking U, if necessary we may suppose that U, is open and has finite p measure. Let U,, , . . . , U,, be a finite subcovering of C, let E, a > 0 and let us choose compact sets Ci c Uxi for which p(Uxi \ Ci) < ~ l ( 2 r )Choosing . a neighborhood Po of po for which Poc n;='=Pxi , such that the sets
have p measure less than ~ / ( 2 r for ) each p E Po,we obtain that
because this set is covered by u:=~&(p)u US='=, (Uxi\ Ci ) . Similarly, if p, + p and (S) is satisfied by pjUx x P,, then for given E , a > 0 for i = 1,.. . ,r we
5 19.
Between measurability and continuity
177
obtain an Mi such that for m 2 Miwe have p, E PZiand Ri (p,) has p measure less than E / ( 2 r ) for each m Mi. Hence for m M = maxl
>
>
Similarly, if for each uo E U there is a neighborhood Uo of uo and Poof po such that pJUox Po satisfies (R) [(T), (M)], then supposing that U is a Lindelof space we have that cp satisfies (R) [(T),(M)]. For (R) this follows using the diagonal process. Countably many of the sets Uo cover U. Let us enumerate these open sets, and let us consider repeatedly sub-sub-. . . sequences of the sequence p,. The diagonal process gives a subsequence, for which the convergence is satisfied almost everywhere. In the case of (T) and (M) the statement is trivial. Let X be an open subset of Rn and 0 k 5 n. The class of all functions f for which the condition (L) [(S), (R), (T), (M)] is satisfied whenever U is an open subset of R\ p = AX", P is an open subset of some Euclidean space, po E P, and p : U x P + X is a C1 function for which p, is an immersion of U into X for each p E P, will be denoted by & ( X , Y) or shortly by Lk [Sk, R k ,G,Mk].(Recall, that a C1 mapping of U into X is an immersion if and only if its derivative is an injective linear mapping for each point of U. For k = 0, take R0 = (0) and An ((0)) = 1, i. e., An is the counting measure on RO. A function p : (0) x P += X is a C1 function if and only if p H p(0,p) is a C1 function. Any function mapping a subset of R n , i. e., 0 or (0) into X is considered an immersion.) In the first two cases we suppose that the values of f are in a uniform space, in the other three that they are in a topological space. It is clear that f E MI, if and only if the condition
<
(M') f 0
+ is p measurable
is satisfied for p = Ahhenever
)I
is an immersion of some open subset U of
RX"into X. 19.2. Remarks. (1) For our purposes, the function class R k ( X ,Y) will be the most convenient one, because we want to avoid supposing that Y is a uniform space. It is even more important, that using R k ( X ,Y) we can avoid supposing uniform continuity for the given functions in our regularity theorems and it is enough to suppose continuity. The classes Mk and C k will also play a role. Our main results will show that, roughly speaking, solutions f from are also in Rk. We will prove that R o is the class of continuous functions, and that all An measurable functions f : X + Y
Chapter VI. Regularity results with fewer variables
178
from the open subset X C Rn into some second countable space Y are in R,. Hence, step-by-step, measurability of solutions implies their continuity. (2) In his paper [203] Trautner proved that for a Lebesgue measurable subset M of [a, b] c R with positive Lebesgue measure and for a sequence p, E [a,b] there exists a u E R and a subsequence p,, such that pms +u E M . This follows from the fact that a Lebesgue measurable function is in 5. Indeed, let us replace p, with a subsequence converging to a point po E [a, b]. Let f = be the characteristic function of M , and let cp : R x R -+ R be E 5 it follows that for almost all u E M + p o there cp(u,p) = u - p . From exists a subsequence pms such that
tM
tM
+
This means that u p,, E M for large enough s. Trautner used his theorem - among others - to give a new proof of the well-known result of Steinhaus, that measurable additive mappings of R into itself are continuous. Trautner's method was generalized to locally compact groups and to an even more general setting by Grosse-Erdmann [61]. We cited his result in 58.
z,
(3) The class Lk [Sk,R k , Mk]remains the same if we suppose only that (L) [(S), (R), (T), (M)] is satisfied whenever U is an open subset of R k , p = A', P is an open subset of some Euclidean space, po E P, and cp : U x P -+ X is a C1 function for which cp, is an embedding (i. e., an immersion which is a homeomorphism of its domain onto its range) of U into X for each p E P. This easily follows from the locality principle mentioned in the definition. Similarly, supposing only that cp,, is an immersion, the resulting class LI, [ S k , R k , M k ] remains the same.
z,
(4) In condition (L) [(S)]the words "for each compact subset C of U" can be equivalently replaced by "for each a-finite measurable subset C of U" . This easily follows using inner approximation by compact sets. We start with the investigation of the simplest connections between the classes Ck, Sk,Rk, 5 ,and M k . 19.3. Theorem. With the notation of the definition above, condition
(L) implies condition (S). If the point po has a countable base of neighborhoods, then (L) follows &om (S). If the uniformity of Y has a countable base and p is a-finite, then (S) implies (R). (R) always implies (TI. If Y is a
§ 19. Between measurability and continuity
179
uniform space with a countable base of topology, (R) is satisfied, and ( M ) is satisfied for all po E P , then (S)is satisfied, too. Hence, if Y is a separable metricspace, then& = S k C R k C G a n d & n M k = S k f l M k = R k n M k .
Proof. It is easy to see that (L) implies (S) and if the point po has a countable base of neighborhoods then (L) follows from (S). Condition (R) implies (T) trivially. The proof that if Y is a metric space and p is 0-finite then (S) implies (R), mimics the proof of the classical Riesz' theorem: Let C be an arbitrary compact subset of U and let us choose a sequence ai 4 0. We may choose a subsequence pmi such that the set
has p (outer) measure less than 2-'. Let Ai denote a p-hull of this set. Now if u is not in the zero set fl,00=, U& Ai, then
for all i 2 j for some j . Let us choose a countable almost cover of U by compact sets C1, C2,. . . . Let us consider repeatedly sub-sub-. . . -sequences of the sequence p,. The diagonal process gives a subsequence, for which the convergence is satisfied almost everywhere. The same proof works in the case of a uniform space having a countable base of uniformity. Now suppose that Y is a separable metric space. If f satisfies (M) for every po E P, then we obtain that u u cp(u,p) is p measurable for all p E P. Using that Y is separable, we obtain that for any pair p,pl E P the mapping u u (f (y(u,p)),f ( c p ( ~ , ~ ' ) ) ) of U into Y x Y is measurable too. This implies that for each pair p,pl E P the mapping
is measurable. Now suppose that (S) is not satisfied by cp with po E P. This means that there is a sequence p, -+ po, a > 0, E > 0, and a compact set C c U such that the measure of the measurable sets
180
Chapter VI. Regularity results with fewer variables
is greater than E for infinitely many m. Let us choose a subsequence pmi for which each of the measurable sets
has measure 2 E. Then for an arbitrary subsequence p,. for any u from the %3 measurable set flp=l UzkAij having measure 2 E we have
This is contradictory to f satisfying (R). Hence we have
The same proof works for second countable uniform spaces. 19.4. Theorem. We will use the notation of 19.1. Let Y be a topological space and X an open subset of Rn. Then M o ( X , Y ) = yX and R o ( X , Y ) = % ( X , Y ) = C(X,Y), the class of continuous functions from X into Y. If Y is a uniform space, then also L o ( X ,Y) = So(X,Y) = C(X, Y).
Proof. It is trivial that Mo contains all functions from X into Y. Now let us prove that any continuous function f : X + Y is in Ro, hence also in To. There are only two cases, U = 0 or U = (0). In the first case, there is nothing to prove; in the second case we may choose p,, = pk. The converse is proved indirectly: if f E To,but not continuous, then there exists an xo E X , a sequence x, -+ xo, and a neighborhood W of f (xo) such that f (x,) $ W. Let U = {0), P = X , po = X O , cp(0,p) = p for p E P. Choosing a subsequence of the sequence p, = x, for which
we obtain a contradiction. If Y is a uniform space, f is continuous, and C = {0), then every po E P has a neighborhood Po such that if p E Po,then f ( I P ( o , ~ ) and ) f (cp(~,po)) are close enough, whence f E Lo c So. Supposing f is discontinuous at an xo E X, and choosing U = C = {O), P = X , po = xo, p(0,p) = p for p E P, we obtain a sequence p, + po such ~ )f)( p ( O , p ~ ) ) are not close, which shows that f is not in that f ( ' ~ ( 0 , ~and so.
5 19.
Between measurability and continuity
181
We will prove that Lebesgue measurable functions over an open subset
X of Rn are in R,. To make the connection with results of §§ 3, 5, 6, 8, 10 clear, we do the main part of the proof in the following abstract setting:
19.5. Theorem. W e will use the notation o f 19.1. Let P be a topological space, U and X Hausdorff spaces with Radon measures p and v , respectively. Suppose, that p is a-finite. Suppose that cp : U x P -+ X is a continuous function with the following property:
(1) For each E > 0 there exists a S > 0 such that i f p E P , B C U , p ( B ) 2 E , then v(cp, ( B ) ) 6.
>
Suppose, moreover, that po E P and f is a Lusin v measurable function with values in a topological space Y on some measurable set D containing cppo ( U ) . Then for U , P , po, cp, and f the conditions (M), ( R ) , and ( T ) are satisfied. If, moreover, Y is a uniform space, then (L) and (S) are also satisfied. Proof. Let us first prove that ( M ) is satisfied. Let us represent U as the union of countable many compact sets and a set having measure zero. Let Di, i = 1 , 2 , .. . be the image of those compact sets under cp,,. Let D' = UZ1Di. Then v ( D i ) < cc and p ( D \ D ' ) ) = 0. Let us choose a sequence of compact sets Ki,j, j = 1 , 2 , .. . in Di such that v(Di \ K i I j )4 0 as j + cc and f lK;,j is continuous. Let V be any open subset of Y. Since ( f J K ; , ~ ) - ' ( is v )relatively open in Ki,j, it is a Borel subset of X. With the notation K = UZl Uj00,' Ki,j we see that B = ( f ~ K ) - ' ( V )is a Borel subset of X. The set E = D' \ K has v measure zero, hence the set N = ( f IE) -' ( v ) is also a zero set. Now let us observe that
(92
On the left-hand side, cp;(B) is a Borel set and by condition ( I ) , the set cp,-d- (N) has measure zero, and cp,-d- ( D \ Dl) also has measure zero. This means that ( M ) is satisfied. Now we suppose that Y is a uniform space and we will show that (L) is satisfied. Let C be a compact subset of U , and let K = cp,, ( C ) . Let E > 0 and let us choose a S > 0 corresponding to &/2 by (1). Let us choose an open subset V containing K such that v ( V \ K ) < 612. Since f is a Lusin measurable function, there exists a compact subset KO of K such that v ( K \ K O ) < 612 and f lKo is continuous. Let us choose a uniformity on the compact Hausdorff space KO compatible with the topology. Since f lKo is also uniformly continuous, for each reflexive symmetric relation a from
182
Chapter VI. Regularity results with fewer variables
the uniformity of Y there exists a reflexive symmetric relation ,B from the uniformity of K O such that f (x) and f (x') are a-near in Y whenever x and x' are @-nearin Ka Let us choose a reflexive symmetric relation y from the uniformity of K Ofor which y o y c P. For each u E C there exists an open neighborhood U, c U of u and an open neighborhood P, of po such that U, x P, is mapped by cp into V and each point of cp(U, x P,) which is in K O ,is y-near to cp(u,po). Choosing a finite subcover U,, , U,, , . . . , U,, of C , we obtain that for each p E Po the mapping cpp maps C for Po = nZ1PUt into V and for any u E C , if p(u, p) is in K O then it is @-near to cp(u,po). Let p E Po and let us consider the set C n cp;' ( K O )n cp,-d- ( K O ) .This set is mapped into K O by cp, and by cp,, too, and for any u from it, p(u, p) and p(u, p o ) are ,B-near in K O ,hence f (cp(u,p)) and f (cp(u,p o ) ) are a-near in Y. If we prove that the complement of this set has measure less than E, then we are done. Since the complement of this set with respect to C is covered by the union of C \ cpg1( K O )and C \ :;pc ( K O ) ,it is enough to estimate the measure of these sets. The first set is mapped by cp, into V \ K O ,hence it . second set is mapped cannot have measure greater than or equal to ~ / 2 The by cp,, also into V \ K O ,hence, similarly, it has measure less than ~ / 2 . In the remaining part of the proof we use the observation that whenever K ' is a compact subset of X and C' = cp2(K1) has finite p measure, then for each E > 0 there exists a neighborhood Poof po such that for each p E Po we have p (C' \ p i 1 ( K t ) ) < E. To prove this, let us choose a compact subset C" of the Bore1 set C' for which p(C' \ C") < ~ / and 2 let K" = cp,, (C"). Let us choose an open set V containing K" such that v(V \ K") < 6, where S corresponds to ~ / by 2 (1). For each u E C" there exist open neighborhoods U, and P, of u and po, respectively, such that p(U, x P,) c V. Let us choose a finite subcovering U,, , . . . , UUn of the covering U,, u E Cl1, and ~. for p E Po the set C1' \ cp;l(K") is mapped by cp, let Po = f ~ y = ~ P ,Then into V \ K", hence has p measure less than ~ / 2 .Now since K" c K ' and C1\cp;'(~') c (c'\c")u(c"\~~;~(K")) we obtain that p(C1\cp;l (K')) < E. Now let us suppose only that Y is a topological space. We will prove that (R) is satisfied, which implies (T). Let again C be a compact subset of U and K = cp,, ( C ) , moreover let p, + po be a sequence in P . Let E i = 2-' and let Si > 0 be the corresponding sequence of numbers 6 by (1). Let us choose a compact subset K1 c K such that f lKl is continuous and v ( K \ K1) < S1 and let C1 = cpz(K1). Then p ( C \ C1) < ~ 1 By . induction, using the statement of the previous paragraph, we may find a sequence of such that p(C1 \ c p , ' ( ~ ~ ) ) < E i + l whenever j mi. indices m l < m2 < ) ~ 1 Now . let K 2 be a compact This implies that p(Cl \ fl,"=,cp;2v ( ~ 1 ) <
>
5 19. Between
183
measurability and continuity
subset of K such that f lK2 is also continuous and u ( K \ K2) < S2. Let C2 = cp; (K2), then p ( C \ C2) < 62. Let us apply induction again, but using the new subsequence instead of the original sequence. Then we obtain a . this process subsequence such that p(C2 \flg1cp;2 ( ~ 2 ) )< ~ 2 Continuing r.9 and taking the diagonal sequence, we arrive at a subsequence p,, of p, such that the measure of the set
is less than 2 ~ Now ~ . let E = np=lUZkEi. Clearly, p ( E ) = 0. If u E C \ E then there exists a k such that u @ Ei for i 2 k. This means on the one hand that u $ C \ Ci for i 2 k, that is, u E Ci for i 2 k. This implies that cppo(u) E Ki for i k, in particular cp,,(u) E K k . On the other hand, if i t k, then for each t 2 i we have u @ Ci \ c p ~ ; (Ki). ~ We will apply this only for i = k to obtain that pPmt(u) E K k whenever t k. Since f lKk is continuous, we obtain that f (cp,, (u)) + f (cpPO (u)) . The general case can be obtained by representing U as the union of a a-compact set and a set of measure zero and using the diagonal process.
>
>
19.6. Theorem. W e will use the notation of 19.1. Let X be an open subset o f Rn. I f Y is a topological space, then every Lusin An measurable function f : X -+ Y is contained in R n ( X ,Y), K ( X , Y ) , and M n ( X ,Y). If moreover Y is a uniform space, then f is contained in Ln (X, Y) and Sn(X, Y).
Proof. Let U c Rn be open, P an open subset of some Euclidean space, po E P, cp : U x P + X a C1 function for which each cp,, p E P is an embedding. We will apply the previous theorem for cp locally. Let uo E U and let us choose a c > 0 such that det cpbo ( ~ 0 ) ) > C. Choosing a neighborhood Uo of uo having compact closure and Po of po having compact closure such that cp, is one-to-one on Uo for each p E Po and det cp' (u) 2 c whenever ( p u E Uo and p E Po,by the transformation formula of integrals we have for any measurable subset B c Uo that
I (
I
I
>
)I
The inequality An (cpp (B)) cAn (B) is also satisfied for nonmeasurable sets B, because otherwise we can find a Borel hull A > cpp(B)for which An (A) < cAn (cppl(A)) for some p E Po. This is a contradiction, because cppl (A) is a Borel set, hence measurable.
184
Chapter VI. Regularity results with fewer variables
Now, the previous theorem can be applied for cplUo x Po. As it was mentioned at the definition of (L), etc., this is enough to prove that (L) [(S), (R), (T), (M)] is satisfied for U, P, PO,cp, An.
19.7. Theorem. We will use the notation of 19.1. Let Z , Zi (i = 1 , 2 , . . . , n ) be topological spaces. Let Xi (i = 1,2, . . . , n ) and X be open subsets of Euclidean spaces and let Y C R' be open. Let D be an open subset of X x Y and W c D x Z1 x x 2,. Consider the functions f : X -+ 2, f, : X i + Z i , h : W + Z , g i : D + X i (i = 1 , 2 ,... , n ) . Let U c R k be open, P be an open subset of some Euclidean space, po E P, cp : U x P + X a C1-function, for which cp, is an immersion of U into X for all p E P, and suppose that the following conditions hold: (1) For each (x, y) E D we have
-
(2) (3) (4) (5)
for each fixed y E Y, h is continuous in the other variables; the function fi is in Rk+' on Xi (i = 1,2, . . . ,n ) ; gi is C1 on D (i = 1 , 2 , . . . , n); for each uo E U there exists an yo such that (cp(uo,po), E D and the rank of the derivative of (u,~) ~~(cP(~,Po),Y)
at (uo,yo)is k + I for each 15 i 5 n. Then condition (R) is satisfied for f , U, P, PO,cp, Ak.
Proof. Suppose that p, + po. Let us choose an open neighborhood Uo of uo, Po of po, and Yo of yo such that (cp(u,p) , y) is in D whenever u E Uo, p E Po,y E Yo; moreover, the rank of the derivative of the mapping (u, y) e gi (cp(u,p) ,y) is equal to k I for all u E Uo, p E Po,y E Yo, and for 1 5 i 5 n. This is possible, because D is open, gi and cp are C1-functions, the rank is lower semicontinuous, and U x Y has dimension k + I, hence the rank cannot increase above k + I . Since the function f l is in Rk+l,there is a subsequence pmTof p, such that except for pairs (u, y) E Uo x Yo from a set El having Ak+' measure zero we have
+
5 19. Between measurability and continuity
185
Now using for the subsequencep,, that f2 is in Rk+lwe obtain a subsequence pmrgfor which, except for pairs (u, y ) E Uox Yo from a set E2having Xk+' measure zero we have
etc. Finally, we obtain a subsequence p,,
of pm such that except for a set
E = Uy=lEiof pairs (u,y) E Uox Yohaving Xk+' measure zero we have
for i = 1 , 2 , . . . ,n. By Fubini's theorem, for almost all y E Yo we have for almost all u E Uo that (u, y) 4 E. Fixing any such y, from the functional equation and from the continuity of h for fixed y we obtain that
which is condition (R) with the function (PI Uo x Po. Hence we have proved that for each uo E U there is an open neighborhood Uoof uo such that for a subsequence p,, of pm and for almost all u E Uo we have f(~(u,~m+ , ) )f ( ~ ( U , P O ) ) . Since U is a Lindelof space, by the remark in the definition we obtain that (R) is satisfied.
19.8. Example. We will use the notation of 19.1. Let us consider the following functional equation:
whenever x E Rm, y E R. Suppose that the functions ho : PS" x R + R, hi : Rm x R + R \ (0) are continuous and the functions gi : R + Rm are in C1 (i = 1,.. . ,n). Introducing the variable xj = x gj(y) instead of x, we obtain
+
Chapter VI. Regularity results with fewer variables
186
To see that condition (5) is satisfied we have to check the rank of the matrix
where c p g ) and are the coordinate functions of p, and gi, respectively. If this is k 1, then we may apply our theorem with 1 = 1. This means, geometrically, that the vector gi(y) - g$(y) is not contained in the range of the linear operator ,cp; (u) (which is known to be k-dimensional). This range can be any k-dimensional linear subspace in Rm. It may happen that for each k-dimensional linear subspace, there exists a y E R such that none of the vectors gi (y) - gi (y), i # j is contained in the linear subspace. Then our theorem can be applied directly and proves that f E Rk+limplies f E R k . If this is the case for k = m - 1,m - 2 , . . . , 0 , then we obtain that every measurable solution is continuous. But there are situations when this is not the case. If, for example, the derivative of the functions gi is constant, i. e., if gi(y) = yai bi, then for any fixed j, equation (1) cannot be applied to get f E Rk from f E R k + l , because for some functions cp the range of ybo(u) will contain some of the vectors g((y) - gi(y) = ai - a j . But we have the possibility to use any of the equations (1). Using that to be in Rk is a local property, it is enough to prove that for any k-dimensional linear subspace of Rm there exists a j such that none of the vectors ai - a j , i # j is contained in the given subspace. For example this is the situation if n 2 m and the vectors ao, . . . ,a, are in general position. If this condition is not satisfied, then it is still possible that our theorem can be applied. A similar (but somewhat simpler) situation was studied in the paper [13], in the proof of Theorem 2.3.
+
+
19.9. Remark. Although, as the example above shows, Theorem 19.7 can be applied in several cases, it is not satisfying because condition (5) is too strong. If we want to apply Theorem 19.7 to prove that f E Rk,then cp can be arbitrary. Hence condition (5) implicitly means that the rank of ax agi has to be large, even if - has a large rank. This in practice means that
aY
the gi have to depend on all coordinates of z,which is not comfortable. We want to relax this condition. Instead of supposing that
5 19. Between measurability and continuity
187
+
has maximal possible rank k 1 at (uo,yo) we will only suppose that it has a constant rank ki (depending on i) on a neighborhood of (uo,po,y o ) But in this case we have to work with functions from Rk n M k , and, roughly speaking, our theorem says that solutions in R k + l fl M k + l are also in Rk fl Mk. First we deal only with the measurability condition (M). We will use the following lemma to prove that condition (M) for the unknown functions fi implies condition (M) for f . 19.10. Lemma. We will use the notation of 19.1. Let X be an open n, and f E M k ( X ,Y). If $I is subset of Rn, Y a topological space, 0 5 k I a C1 mapping of the open subset U of Rm into X for which the rank of the derivative is k everywhere, then f o $ is Am measurable.
Proof. The lemma directly follows from the rank theorem. Indeed, the rank theorem implies, that for each uo E U there exists an open neighborhood Uo such that $IUo can be written as (2: o p o /3. Here, with the notation I = (-1, I ) , the mapping ,B is a diffeomorphism of Uo onto Imsuch that /3(uo) = 0, the projection p of Iminto In has the form p(xl, x2, . . . , x,) = (xl, 22,. . . , xk, 0, . . . ,O), and a is a diffeomorphism of Inonto an open set Xo mapping 0 into s o = $I(uo).Identifying the set I% {O) C Inwith ~ % ehave that a Ikis an immersion, hence (f o ( aIk)) (V) is Aheasurable for each open subset V of Y. Since p-l(A) is Am measurable for each Ak measurable subset A of I< and ,E1(B)is Am measurable for each Am measurable subset B of Im,we obtain that f o ($lUo) is Am measurable. Now using that U is a Lindelof space, we get the general case. 19.11. Theorem. We will use the notation of 19.1. Let Z be a topological space and let Zi (i = 1,2, . . . ,n ) be separable metric spaces. Let Xi (i = 1 , 2 , . . . , n ) and X be open subsets ofEuclidean spaces and let Y C RL be open. Let D be an open subset of X x Y and W c D x Z1 x . x 2,. Consider the functions f : X + Z , fi : Xi + Zi, h : W + Z , gi : D + Xi (i = 1 , 2 , ... , n ) . Let u c JRk beopen, $ : U + X b e a C 1 immersion o f U into X, and suppose that the following conditions hold: (1) For each (x, y) E D we have
188
(2) (3) (4) (5)
Chapter VI. Regularity results with fewer variables
for each fixed y E Y, h is continuous in the other variables; the function fi is in M k i on Xi (i = 1 , 2 , .. . , n ) ; gi is C1 on D (i = 1 , 2 , . . . , n ) ; for each uo E U there exists a yo such that ($(uo),yo) E D and the rank of the derivative of (u, Y ) ti gi ($(u) 1 Y)
is ki on a neighborhood of (uo,yo) for each 1 5 i 5 n. Then u ti f ($(u)) is measurable.
Proof. Let us choose an open neighborhood Uo of uo and Yo of yo such that ($(u), y) is in D whenever u E Uo, y E Yo, moreover, the rank of the derivative of the mapping (u, y) ti g, ($(u), y) is equal to ki for all u E Uo, y E Yo, and for 1 5 i 5 n. This is possible by condition (5). By the previous lemma we obtain that the mapping (u, y) H fi (gi ($(u), y)) is An+' measurable. By F'ubini's theorem except for a set Ei of points y from Yo with A' measure zero the mapping u ti fi (gi ($(u), y)) is A* measurable on Ua. Hence, except for the set E = Uy=lEi, for all y E Yo the mapping
of Uo into Wy is measurable. Since for any fixed y the function h is continuous in other variables, we obtain that for any fixed y E Yo \ E the mapping
is measurable. This means that u ti f ($(u)) is measurable on Uo. Since U is a Lindelof space, the statement follows. The following theorem is the key to the generalization 19.13 of Theorem 19.7. 19.12. Theorem. We will use the notation and P be open subsets of Euclidean spaces, po E space, cp : U x P + X a C1 function, for which u E U, p E P . If f E M (X,Y) n Lk (X, Y), then for f , U, P, po, (P, and Am.
of 19.1. Let U C Rm , X P, Y a separable metric rank p;(u) = k for each condition (L) is satisfied
§ 19. Between measurability and continuity
189
Proof. Let uo E U. Since the rank of pLo(uo)is equal to k, we may write u as u = (ul, u2) E R* x IRm-* such that the determinant of
is not equal to 0. Hence there exists a neighborhood Ul x U2 of uo and a of U1 is compact, E x U2 C U, neighborhood Po of po such that the closure and the mapping U l +-+ cp(u1,u2, P ) is an immersion of Ul for each u2 E U2, p E Po. We may suppose that X ~ U ~ ) and Am-"u2) are finite. Since f E C k , for each E, a > 0 and for each us E U2 < 6, then u/2 E U2 and there exists a S > 0 such that if Iu/2 - u 4 < S, ( p- P OI
Applying this for p = po, too, and combining the two inequalities, we obtain that (1) E
for each u', for which lul, - u2 I < S and for each p for which Ip - po I < S. For a fixed E, a > 0, let S,, be the S corresponding to u2 E U2. Let C be an arbitrary compact subset of U1 x U2 and let C2 = {u2 : (ul , u2) E C ) be the projection of C. The closed balls with center u2 E C2 and radius less than S,, gives a Vitali covering of C2, and hence it is possible i = 1 , 2 . . . of them which almost covers to find a disjoint sequence Bi, c2. Since f E M *, by the previous lemma the mappings u +-+ f (p(u,p ) ) are Am measurable for each p E Po. Hence the mapping
is measurable, too, i. e.,the sets
190
Chapter VI. Regularity results with fewer variables
are Am measurable too. Using ( 1 ) and Fubini's theorem we obtain that the Am measure of the set ( 2 ) is at most X ~ - " B ~ ) E / A ~ - " U ~ Since ) ) . the sets Biare a disjoint almost cover of C 2 , we have that
Hence we have proved that each uo E U has a neighborhood Uo = U1 x U2 such that ( L ) is satisfied on this. By the remark in the definition of (L) the statement follows. 19.13. Theorem. We will use the notation of 19.1. Let Z be a topological space and let Zi ( i = 1 , 2 , . . . ,n ) be separable metric spaces. Let Xi(i = 1,2, . . . ,n ) and X be open subsets of Euclidean spaces and Y C R1 be open. Let D be an open subset of X x Y and W C D x Z1 x . . . x 2,. Consider the functions f : X -+ Z , fi : X i -+ Zi, h : W -+ Z , gi : D -+ Xi ( i = 1 , 2 , . . . ,n ) . Let U c R'" be open, P an open subset of some Euclidean space, po E P , cp : U x P -+ X a C1-function, for which each p,, p E P is an immersion of U into X, and suppose that the following conditions hold: (1) For each ( x ,y ) E D we have
(2) for each fixed y E Y , h is continuous in the other variables; (3) the function fi is in Rkif l M k i , (i = 1 , 2 , . . . ,n ) ; (4) gi is C1 on D (i = 1 , 2 , . . . , n ) ; (5) for each uo E U there exists a yo such that ( p ( u opo), , yo) E D and the rank of the derivative of
is ki on a neighborhood of the point ( u o , p o yo) , for each 1 5 i 5 n. Then the conditions (R) and (M) are satisfied for f , U , P , P O , p, A k .
§ 19. Between measurability and continuity
191
Proof. From Theorem 19.11 it follows that condition (M) is satisfied by f , U, P, po, p , A" Let us fix a .uo E U and let us choose a yo for uo by (5). Let us choose open neighborhoods Uo, Po,and Yo of uo , po , and yo such that ( p ( u , p ) , y ) E D whenever u E Uo, p E Po,and y E Yo, moreover the rank of the derivative of
is ki on Uo x Po x Yo for each 1 5 i 5 n. Now the proof that condition (R) is also satisfied is exactly the same as in Theorem 19.7, but we have to use the previous theorem instead of the definition. 19.14. Conditions. In what follows we will use the notation of 19.1, but we will only investigate the situation where X is a nonvoid open subset of Rn and f maps X into a separable metric space Y, because we want to avoid any difficulties arising only from the poor topology of the range Y. 19.15. Remark. We will use the conditions of 19.14. There is a kind of locality other than the one treated after Definition 19.1. We have f E C k ( X ,Y) if and only if each xo E X has an open neighborhood Xo c X such that f lXo E Ck(Xo,Y). The "only if" part is trivial. To prove the "if" part we will use the notation of Definition 19.1. Let us note that for each point uo E U there exist open neighborhoods Uo and Po of uo and po, respectively, such that for xo = p ( u o , p o )the set p(Uo,Po)is contained in Xo. This means that (L) is satisfied for y/Uo x Po Now from the locality principle in the definition we have that f E & ( X , Y). The same locality is true (and the same proof works) for Sk,R k ,Z, and M k . 19.16. The class Mk. We will use the conditions of 19.14. Let
where {O,1) is taken as discrete space. It is easy to see that Ak is a a-algebra, and a function f : X + Y is in M k(X, Y) if and only if f (V) is in Ak for each open subset V of Y. Hence the investigation of M k(X,Y) is reduced to the investigation of the a-algebra Ak. It is easy to see that A, is the class of all An measurable subsets of X and A. is the class of all subsets of X . We will prove that A E Ak if and only if for each open set U c Ktk and for each immersion $ : U + X the set A n rng $ is X h e a s u r a b l e . For each u E U, there exists a compact neighborhood C of u such that the restriction of $ to C is one-to-one. By the transformation formula of
Chapter VI. Regularity results with fewer variables
192
integrals, if $-'(A) n C is Lebesgue measurable, then $(C) n A is Hausdorff measurable. In the other direction, if $(C) n A is Hausdorff measurable, then, using that the Hausdorff measure of $(C) is finite, there exist Borel sets B, N c $(C) such that B c A, (A n $(C)) \ B c N , and X ~ N=) 0. The sets ($IC)-' (B) and (+IC)-'(N) are Borel sets, and the latter may only have measure 0. This means that the X h e a s u r e of ($IC)-l(A \ B) is zero, too, and hence ($IC)-'(A) is Xheasurable. Now for each u E U choosing a compact neighborhood C as above, countably many of them covers U . If A n rng+ is X h e a s u r a b l e , then the (A) is X%easurable. sets ( $ 1 Ci)-l (A) are all X%easurable, and hence In the other direction, if (A) is X%easurable, then the sets (A) n Ci are measurable, too, and hence Anrng$ = (ui$(Ci)) n A is a X%easurable set. What we have proved until now implies that every Xk measurable set is in A k , because rng $ is always X%easurable. A countably ( ~k)4rectifiable set is in Ak if and only if it is X h e a s u r a b l e . We have only to prove that if A E Ak is countably (xk,k) rectifiable, i. e., if A is a X%lmost subset of a countable union of Lipschitz images of bounded subsets of IW" then A is X h e a s u r a b l e . By Theorem 3.2.29 from [53], A C N U (U,OO=lSi), where X " ~ ) = 0 and each Si is a k-dimensional C1 submanifold of X. Dividing Si into smaller parts, if necessary, we may suppose that each Siis the image of (A) is X%easurable, some open subset of JRk by a C1 immersion +i. Since $il the set A n rng$i = A n Si is Xk measurable for each i. Hence
is
measurable. There are Xk nonmeasurable sets in Ak. Any non Xk measurable subset ([53],2.2.4) of a purely unrectifiable compact subset with finite X%easure is an example. For such a set A, the set (A) has measure 0 for each immersion $ from an open subset of ~ " n t o X. Example of a purely unrectifiable set can be found in [53], 3.3.20. See moreover [160], 3.17. Xk
x.
19.17. Connections between M k , Ck, SkyRk, and We will use the conditions of 19.14. One of the simplest questions is, whether f E M implies f E Ck, Sk,R k or Tk. We know that this is true for k = n. If k < n then the characteristic function of the intersection of X and an appropriate k-dimensional plane is in MI, but contained in none of the classes Cb, Sk, Rk,
'G.
In the other direction, suppose, that f E Ck = Sk C Rk C Tk. The question is, whether f E M k is satisfied. This is trivial for k = 0. We will
§ 19. Between measurability and continuity
193
show that this cannot be proved in ZFC for 0 < k 2 n. Namely, we will give an example f under the continuum hypothesis for which f E Lk but f @ M k . By the famous results of Godel and Cohen, the continuum hypothesis is independent from the axioms of ZFC. This means that MI, c Lk cannot be proved in ZFC. Another question is whether Sk= Rlc This is trivial for k = 0. We will show by a counterexample under the continuum hypothesis that for 0 < k < n this is not a theorem in ZFC. I do not know anything about the case k = n. Similarly, we may ask whether R k = '& or at least MI,n R k = Mkn '&. This is also true for k = 0. For 0 < k < n we will prove that MI, n Rk M k n '& hence Rk '&. For k = n we know that M, C R, c 7 , hence of course M, Ti R, = M, n 7,.I do not know whether R, = 7,.
5
5
19.18. Hierarchy of function classes belonging t o different dimensions. We will use the conditions of 19.14. Let us fix dimensions 0 2 Ic < 1 5 n and let us investigate the connection between the classes Mk, L h , etc. and classes MI, Ll, etc. We may hope that decreasing the dimension, conditions (L), (S), etc. become stronger. One of the only two positive results in this direction is that this is true for the conditions (L), (S), and (R) under measurability:
The proof of this statement is very similar to the proof of Theorem 3.6, therefore we do not repeat the argument. We will show by a counterexample under the continuum hypothesis that for k > 0, ZFCY Mk flLk c M1Ux. (k indicates that the right-hand side is a theorem in the system on the left.) Similarly we will show by a counterexample under the continuum hypothesis that ZFCF M k n L k n L 1C MI
except for the trivial case k = 0. It is much easier to see that inclusions in the other direction do not hold in general. Although
Mz c Mo is satisfied trivially, in general
194
Chapter VI. Regularity results with fewer variables
This is shown by the characteristic function of a non Xk measurable subset of the intersection of X and an appropriate k dimensional plane. The same example shows that M I ~ LpMkU'7i. z If we take the characteristic function of the intersection of X and an appropriate k-dimensional plane, then we see that
We will show that
MZnRk c Mk. I do not know whether Rk may be replaced here by Tk except for the trivial case k = 0. Let us see the proofs. 19.19. Theorem. Under the conditions of 19.14 and for 0 5 k we have MI n Rk c M k .
Proof. This is trivial for k = 0. Otherwise, let $ be an immersion of an open subset U c IRk into X. Let uo E U and let V be an 1 - k dimensional subspace of Rn orthogonal to rng$'(uo). Let T : @ "-I. + V be a linear isometry, and let us define p by p ( u ,p) = $(u) ~ ( p. )Then for po = 0 we have p,, = $. Let us choose open neighborhoods Uo and Po of uo and po, respectively, such that p(Uo,Po)c X and p is an immersion of Uo x Po into X. Since f E MI, the mapping (u,p) ++f (cp(u,p)) is XZ measurable. Hence ) measurable. Let for XZ-"almost all p E Pothe mapping u e f ( p ( ~ , ~is)Xk us choose a sequence p, + po such that each u c~ f ( p ( ~ , ~ , )is) measurable. By f E R k it is possible to choose a subsequence pms such that
+
for Xk almost all u E Uo. Hence u e f ($(u)) is measurable over Uo, i. e., locally. This implies that f E M k . 19.20. Counterexample. Under the conditions of 19.14 we will show by a counterexample that for 0 < k < n we have M k n Rk Mk n
5
z.
§ 19. Between measurability and continuity
195
Proof. For simplicity, we will work with a nonvoid k-dimensional plane in X having the form V = X n W where
for some fixed x;,,, . . . ,x:. Without loss of generality we may suppose that xi+, = . . . = x: = 0. Our function f will depend only on X I , . . . , xk and from the subspace W. Let f (x) = 0 xk+, whenever r = 0. Let g(y) be 0 or 1 on IEk depending on whether the sum of the integer parts of the coordinates of y E R%s even or odd, respectively. We will use a smoothing h of this "chessboard" function g to define f . The continuous function h is obtained taking the mean of g for a box around y, namely, on the set of all z E JRk for which the difference zi - yi of all coordinates is between -114 and 114. Now for any nonnegative integer m if (1 - cu)2-,-l for some 0 < cu 5 1 then let us define xn = on the distance r =
+
Since f is continuous on the two parts V and X function, hence it is in M, for any 0 5 m 5 n. First we will prove that f $! R k . Let
T
\
V of X, it is a Bore1
be the embedding
of Rk into IP.Let us choose a K E N and a vector yo from 2 - K ~ ksuch that if U is the set of all points y for which all coordinates of y - yo are greater than zero and less than 2-K, then the closure of T(U) is in V. For ) where en is the unit vector (0,. . . , 0 , 1 ) E Rn. p E R let cp(u,p) = ~ ( u+pen For an appropriate M we have cp(u,p) E X whenever u E U and p E P = {p : Ipl < 2-M). Let po = 0 and p, = 2-rn whenever m > M. For any subsequence p m 3of prn it holds that, if for a given u E U for infinitely many , 1) = 1, then f ( ( ~ ( 2 ~ P7 , ~)) fS f ( ( ~ ( uPO)) , = 0. Hence we have f ( ~ ( uprns with the notation Urn = {u E U : f (cp(u,~~,))= I} convergence can occur only if there exists an S such that for each s 2 S we have u +f Urn,, i. e., if u $! n$?=,UEs Urn,. Hence convergence almost everywhere may happen only if k m 03 ( n s = , us=, Urn,) = 0.
Chapter VI. Regularity results with fewer variables
196
This means that for convergence almost everywhere Xk(Ums)+ 0 is necessary. But this does not hold because X'(U,, ) = X " ~ ) / 2 b h e n e v e r m, > K. It is much harder to prove that f E %. Let U be an open subset of Rk, let P be an open subset of some Euclidean space, po E P, and p : U x P i X a C1 function for which each p,, p E P is an immersion. Let pm 4 po be a convergent sequence in P. Since the function f is continuous on X \ V, if p(u, po) $ V, then f (p(u,p m ) ) i f (p(u, Hence we have to deal only with the set Z = {u E U : p(u, po) E v). Let us introduce the notation
We have to prove that for almost all u E Z there exists a subsequence pms = 0. This means that for each of p, for which f (p(u,pm,)) i f (p(u, E > 0 and for each M there exists an m 2 M such that u $ UA, i. e., that u 6 U,>o UG=, nE=MU&. Hence we have to prove that this set has A" measure zero. Since decreasing E the set UE=, nE=M U& increases, if we take a sequence E , > 0 tending to 0 and restrict the union for only these numbers E,, the union does not change. Hence it is enough to prove that for each E > 0 the set U%=, n z = M U& has measure zero, or, equivalently, that for each E > 0 and for each M the set flE=MU& has ~"easure zero. If this is not the case, then there exists an E > 0 and an M for which there exists a density point uo of this set. Suppose for contradiction that this is the case and let us fix E , M, and uo. Moreover, we may suppose that uo E n,"=MU&. Let us write p = (91,9 2 ) where p l ( u , p ) is the first k coordinates of p ( u ,p) and 9 2 (u,p) is the last n-k ones. Since uo is a density point of 2, too, (uo) # 0. Using the proof of the inverse we have pi,po(uo) = 0 and det function theorem, it is possible to find a c > 0, an open ball Uo with center uo and a neighborhood Poof po such that whenever Bs (uo)is contained in Uo and p E Po,then BCs(pl,,(uo)) is contained in p l , (Bb( u o ) ) . Furthermore we may suppose that pi,, (u) I c/(16&) whenever (u,p) E Uo x Po. Shrinking Uo and Po,if necessary, we may also suppose that for some positive constant C we have J(pl,,) (u) 5 C whenever (u,p) E Uo x Po,where J is the absolute value of the Jacobian. Let a ( k ) denote the X h e a s u r e of balls having radius 1 in Rk. Then, of course, the X h e a s u r e of any ball having radius 6 is a ( k ) S k Since uo is a density point, there exists a So > 0 such that for the closed ball Bb(uo) we have
11
11
§ 19. Between measurability and continuity
197
whenever 0 < S I So. For this So let us choose an so > 1 for which 2-'0+1 I cSo/&. Let us choose an Mo such that for m 2 Mo we have p, E Po and the distance of y ( u o ,p,) from W is less than 2-30-2. Let us fix an m 2 max{M, Mo). Since uo E U&, the distance of y ( u o , p m ) from W is greater than 0 but less than 2-'0-~. Let us choose an s such that this so. distance is not less than 3 2-3-3 but less than 3 2-'-2. Clearly s Let &2-'/c < S I &2-'+'/c. Then we have 0 < 6 5 So. Let S denote the set of all those y E Rk for which all coordinates of 2'y have the same integer part as the corresponding coordinates of 2'yo where yo = cpl ( u o ,p,). The set S is the Cartesian product of intervals having length 2-'. Hence the diameter of S is &2-' and because yo E S, the set S is contained in p ~ , (~ ~, (uo)) 6 . Using the estimate of Y ; , , ~ ( u ) valid for all u E IB6(uo) we obtain the estimate
>
11
11
This implies that the distance of y(u,p,) from W is between 2-3-2 and 2-'. Let Sodenote those points y of S for which all of the three functions h(2'y), h ( Y + l y ) , and h(2'+2y) take the value zero. A y E S is in So if and only if the fractional part of all the coordinates of 2 Q y ,2'+ly, and zS+2 y is between 114 and 314. This means that the fractional part of all the coordinates of 2'y is in [5/16,6/16] U [10/16,11/16]. Hence the X b e a s u r e of Sois 2-3"3k If u E (uo) and Y = Y l ( u , ~ r n )E SO,then u $ \U&. But J ( Y I , ~ , ) ( U ) I C , hence by the transformation formula of integrals we have
This contradicts the choice of So. This contradiction proves that f E Tk. For the following counterexamples we need a lemma. The counterexamples are related to the existence of the so-called almost invariant sets. These sets were used by Kakutani and Oxtoby [I211 to prove that the Lebesgue measure on the complex unit circle can be expanded to an invariant measure such that the Hilbert space dimension of the corresponding L 2 space becomes 2', where c is the cardinal number continuum. The construction below is a refinement of the construction from the paper [82] of the author, where the result of Kakutani and Oxtoby was extended - among others - to arbitrary locally compact groups. The ideas there are combined with the well-known ideas of Sierpiriski to construct under the continuum hypothesis a subset of
198
Chapter VI. Regularity results with fewer variables
the unit square with (outer) measure 1 and containing at most two points on each line. To understand the typical application of this abstract set theoretic lemma, we may think of the case when X is the plane, T is the class of all diffeomorphisms mapping some open subset of the plane onto some other open subset of the plane, F is the class of all compact plane sets having positive Lebesgue measure, 6 is the class of all one-dimensional C1 submanifolds of the plane, and n = c = N1.
19.21. Lemma. Let X be a set and T a class of one-to-one transformations each mapping a subset of X into X and let F, 6 be classes of subsets of X . Suppose that there exists a cardinal number n > No with the following properties: (1) card(X) = n; (2) card(T) 5 n; (3) card(F) 5 n and for every F E F we have card(F) = n;
(4) card(6) 5 n and for every F E F and Go c 6 for which card(&) < n we have card(F \ UBo) = n; (5) The classG i s T invariant, i. e., i f G E 6, 7 E T , then r ( G ) E 6 and +(G) E 6 . Then there exists a family {Xy)yEr of subsets X, of X with the following properties: (6) card(r) = n;
(7) the sets X,, y E r are pairwise disjoint; (8) for each y E r and G E 6 we have card(X, fl G) < n; (9) card(F fl X,) = n whenever y E I? and F E F ; (10) for every subset roof and for every 7 E T
Proof. Let R be the smallest ordinal having cardinality n. We may suppose that F is nonvoid, because otherwise we may replace it with {X). Let Y be an arbitrary set with cardinality n. Since card(Y x F) = n, there (y,, Fa)of the set of ordinals { a : 0 5 a < exists a one-to-one mapping a R) onto Y x F . The transfinite sequence Fo,. . . ,Fa,.. . , 0 5 a < R contains every element F of F exactly n times. Similarly, we may suppose that 6 is nonvoid, because otherwise we may replace it with (01,and we may choose a transfinite sequence G o , .. . , G,, . . . , 0 5 a < R containing all elements of
§ 19. Between measurability and continuity
199
G. Let us choose a mapping a t, r, of the set {a : 0 5 a < R) onto the set {Ix)U T for which r 0 = lx where lx is the identical mapping of X onto itself. For each x E X and each ordinal a < R let C,(x) denote the set of all points of X that can be written as
where n = 1 , 2 , .. . , k = 1 , 2 , . . . ,n,0 5 ,i?k 5 a, and ~k is 1 or -1. Here r1 means the mapping r and r - I means the inverse of 7 . Clearly, we have x E C,(x) and for x E X and 0 5 p 5 a < R we have Cp(x) C C,(x) and TP (c&) = r p ( X ) n C, (x). We also have card(C, (x)) 5 max{card(a), H O )
< n.
If A c X , then we will use the notation C,(A) for U Z E ~ C c U (We x ) . will show that there exists a transfinite double sequence
of elements of X such that:
the sets
{C,(x;)
:0
5 ,B 5 a < R )
are pairwise disjoint;
C, ( x t ) is disjoint from any C, (G,), y 5 a . If we agree that (y, 6) < (a,,B)whenever y < a or y = a and 6 < ,B (lexicographic ordering), then {(a,P) : 0 5 P 5 a < R ) is a well ordered set. We will define the sequence {x; : 0 5 P 5 a < R) by transfinite induction. Let x! be an arbitrary point of Fo\ Go. Suppose that 0 5 ,B 5 a < R and that x i have already been defined for all pairs (y, 6) < (a,P), 0 5 6 5 y. Consider the union D ( a , P) of the sets C, (x:) as (y, 6) runs over all pairs (776) < (a,P). Then card (D( a ,0)) 5 (card(a)) max{card(a) , No )
< n.
Let E(a) be the union of all sets C, (G,), y 5 a . By ( 5 ) , E(a) is the union of some G, C G with card(6,) < n. By (4) the cardinal number of F, \ E(a) is n, hence (F, \ ~ ( a )\ D ) ( a , p) is nonvoid. Let x; be an arbitrary point of (F, \ E(Q))\ D ( a , P). Then C, (x;) is disjoint from every Cc(x) where
Chapter VI. Regularity results with fewer variables
200
x = xi for some (y, S) would have 7;: o
< (a, p) ..
or x E Gs for some
o 7;: (x;) = 7 : ' o
4
< 5 a. Otherwise we
. o::7 (x),
where ,& 5 a,Sj 5 a,~k is 1 or -1, and v j is 1 or -1, k = 1,2,..., n, j = 1,2,. . . , m. Hence
and this contradicts the choice of x;. Now let I? = {C : is an ordinal and 0
<
< < < 9);
Properties (6) and (7) are obvious. Since x? E F and xy E C,(xy) c XC a < R and Fa = F , we have that F f l XC has at least n whenever C elements. Hence (9) is satisfied. To prove (8) let us observe that
<
whenever a
> y. Hence, if G = G,,
then
and the right-hand side has cardinality less than n. To prove (10)let I'o c I' and 7 E T. Suppose that 0 5 y r, = r. Using that
we have that
the right-hand side has cardinality less than n. Hence (10)is proved.
and
5 19.
Between measurability and continuity
201
19.22. Counterexample. Using the conditions of 19.14, under the continuum hypothesis for 0 < k 5 n we have Ck $Z M k .
Proof. We will give a function f E Ck for which f $ M k . We want to apply the previous lemma. We will use only that the functions cp in the definition of Ck are continuous and that by Remark 19.2.(3) we may suppose that the functions cp, are one-to-one. Let T denote the class of all one-towhere U is one functions 7 which can be represented in the form cp, o cp;', an open subset of IF@, P is an open subset of some Euclidean space, and cp : U x P -+ X is a continuous function for which all cp,, p E P is one-toone. Since the cardinality of all pairs U, P is continuum and any continuous function cp is uniquely determined by the values on a countable dense subset, the cardinality of the class T is continuum. Let F denote the class of all compact k rectifiable subsets of X having positive X h e a s u r e . Since each compact set is uniquely determined by its complement, and the open complement is determined by its subsets from a fixed countable base, it follows that the class F has c elements, and all elements have cardinality c. Applying the previous lemma with 6 = 0 we obtain a class of subsets X,, y E R of X . Our counterexample will be the characteristic function f of Xo i.e., X, for y = 0. Let U be a bounded open subset of ~ h n $Id : U -+ X be an immersion for which the rectifiable and X h e a s u r a b l e set M = $ ( U ) has positive but finite X%easure. Let us observe that if X o n M were of Xk measure zero, then M \ X o would contain some F E F,which is impossible because F n X o # 0. If Xo n M were X h e a s u r a b l e with positive Xk measure, then it would contain some F E F. But this is impossible because F n X, # 0 and X, fl Xo = 0 for any y # 0. Hence Xo n M is non-Xk measurable. By 4.3 this implies that f $Mk. We will prove that f E Ck. Let C be a compact subset of U. The set
is equal to the set
For the mapping r = cp,,
o
cp;'
this set is a subset of the set
If we suppose that the continuum hypothesis holds, then this set is countable.
Chapter VI. Regularity results with fewer variables
202
19.23. Counterexample. Using the conditions of 19.14, for 0 n under the continuum hypothesis Sk R k .
5
Proof. We apply the construction of the previous lemma, choosing for T, F,and G the same classes as above to obtain the sets X,, y E R. If m E N and m 2 2, let f, (x) = g, (x)h, (x), where g, (x) is the characteristic function of the set X,, and if dist (x, D )
A; ) if & < d i s t ( x , D ) < A; if dist(x, D )
h,(x) =
m(m
+ 1) (dist(x, D ) -
< A; 2
where D is a given nonvoid k-dimensional closed disk contained in the intersection of X with a k dimensional plane. Let f = Cz=,f,. (As in the previous counterexample we can prove that f 6 M k . ) AS in the previous counterexample it follows that each g, is in Ck = Sk,hence in R k , too. The same is trivial for the continuous function h,. From this it follows for that it is also in R k . Since everywhere on the open set the product g,h, X \ D the function f is locally the finite sum of such products, we have that flX\D€Rk. Letp:UxP-+Xandlet
Clearly F is a closed set. Let C be a compact subset of F. For p, + po, let R,,j denote the set of all points u for which p(u, p,) E X j but p(u, po) 6 X j or p(u, p,) 6 X j but p(u, po) E Xj. Under the continuum hypothesis, the sets R,,j and their union R = U2,j=:l&,3 are countable and hence have X" measure zero. Let us observe that for each i there exists an mi such that if m > mi, then for each u E C we have
4
then p(u,p,) Xj Hence, if u E C but u 6 R , and u 6 ~joo,~y,-d-(Xj), whenever j 2 i. Hence gj (cp(u,p,)) = 0 for j 2 i. On the other hand, hence hj(y(u,p,,)) = 0 whenever j 5 i. So we dist(p(u,p,),~) < obtain that f (p(u,p,)) = 0 whenever u 6 R, u 6 ~j00,~pz (x,),and m >
&,
§ 19. Between measurability and continuity
203
mi. Since the sets X j are disjoint, f (cp(u,p,)) + f ( c p ( ~ , ~ ~for ) ) m + oo whenever u $ R, i. e., almost everywhere. Taking union for countably many sets C we obtain that f E R k . On the other hand, if e # 0 is orthogonal to D and cp(u,p) = $(u) pe, where $ is an isometric immersion mapping some nonvoid open subset of IRk into D , p o = 0 , then, forp, = l / m we have that
+
if m is large enough, except for a countable set. The set on the left-hand side has the same X G e a s u r e as C. This shows that f $ S k . 19.24. Counterexample. Using the conditions o f 19.14, under the continuum hypothesis for 0 < k < 1 5 n we have M I , n Lk n Ll M E .
Proof. We will give an example of a function f E M I , n Lk ri L1 but f 4 M 1 . We want to apply Lemma 19.21. We will use that by Remark 19.2.(3) we may suppose that the functions cp, in the definition of LI are one-to-one immersions. Let T denote the class of all one-to-one functions T which can be represented in the form o cpgl, where U is an open subset of Rz, P is an open subset of some Euclidean space, and cp : U x P + X is a C1 function for which all cp,, p E P are one-to-one. Let F denote the class of all compact 1 rectifiable subsets of X having positive X 1 measure. Let 6 be the class of all k-rectifiable Bore1 subsets of X . It is not hard to prove that the class G is T invariant. Moreover all G E G has X 1 measure zero, hence the same is true for the union of countably many G E G. This means measure, hence cardinality c for any countable that F \ UGo has positive subfamily Go c G and for any F E F . Other conditions of Lemma 19.21 have already been checked in 19.22. Applying Lemma 19.21 we obtain a class X,, y E R where each X, contains only countably many points from each G E G, but X, n F # 0 for each F E F, hence X, n F is not X 1 measurable for any F E F . Let f be the characteristic function of Xo. Along the same lines as in 19.22 we get that f E Ll but f $ M I . Since for any C1 embedding $ of an open subset of LRk into X the function f o $ is zero except for a countable set, we get that f E M I , and f E L k , too. Hence the statement is proved.
,
19.25. Counterexample. Using the conditions o f 19.14, under the continuum hypothesis for 0 < k < I L n we have M I , n Lk P M I U 7i.
204
Chapter VI. Regularity results with fewer variables
Proof. Let us apply Lemma 19.21 for the same T, F, and B as in the previous counterexample. We obtain a class X,, y E R where each X, contains only countably many points from each G E B, but X, n F # 0 for each F E F, hence X, n F is not measurable for any F E F . Let Z be an I dimensional plane which has a nonvoid intersection with X and let f be the characteristic function of the set Z n Xo. Then f E Mk n Sk = Mk n &, but f $ 5 and f $ M I .
20. BETWEEN BAIRE PROPERTY AND
CONTINUITY In this 5 we will prove "Baire property implies continuity" type results for the general explicit nonlinear functional equation (1) from Problem 1.18 without the strong rank condition in 1.18.(3) to the inner functions. All earlier "Baire property implies continuity" type results that I know of use the strong rank condition in 1.18.(3) or some abstract version of it. In the spirit of the "bootstrap" method described in 1.7 we introduce a sequence of roughly speaking - interpolate between Baire property properties, which and continuity. This sequence of properties gives a stairway to climb up from Baire property to continuity. First we will investigate the basic properties of the new notions. Then the regularity theorem will be proved. A refinement of the theorem is also proved. Finally, further properties of the new notions are investigated. These results have been published in JBrai [ I l l ] . -
20.1. Definition. Let X be a set, Y a metric space, and f : X + Y be a function. Let U be a topological space, and P a topological space, the "parameter space'' with a given point po E P. Let p be a function from U x P into X . We will think of p as a surface p, : u p ( u , p ) for each p, depending on the parameter p. The analogue of Lusin's Theorem 2.12 and generalizations of the theorem of Piccard (see 5 4) suggest that the following condition is connected with the Baire property: (S) For each sequence p, + po we have f (p(u,p,)) for a set of first category of points u E U .
-+
f (p(u,PO)) except
For our investigations we need the following property: (B) u e f ( p ( ~ , ~has ~ )the ) Baire property. We will often check conditions (S) and (B) locally. If for each uo E U there is a neighborhood Uo of uo and Po of po such that plUo x Po satisfies
5 20.
Between Baire property and continuity
205
(S), then cp satisfies (S). This easily follows from the locality of first category mentioned in 2.9. Similarly, if for each uo E U there is a neighborhood Uo of uo and Po of po such that cplUo x Po satisfies (B), then cp satisfies (B). Let X be an open subset of Rn and 0 5 k n. The class of all functions f for which the condition (S) [(B)]is satisfied whenever U is an open subset of Rk , P is an open subset of some Euclidean space, po E P, and cp : U x P -i/ X is a C1-function for which cp, is an immersion of U into X for each p E P, will be denoted by S k ( X , Y ) [Bk(X,Y)]or shortly by Sk [&I. (We take l@ = (0)). It is clear that f E Bk if and only if the condition (B') f
o
$ has the Baire property
is satisfied whenever $ is an immersion of some open subset U of Rk into X . 20.2. Remarks. (1) Our main results will show that, roughly speaking, solutions f of a functional equation from are also in S k . We will prove that Sois the class of continuous functions, and that all functions f : X -i/ Y from the open subset X c Rn into some second countable space Y and having the Baire property are in Sn. Hence, step-by-step, Baire property of solutions implies their continuity. (2) The analogy with the measure theoretical case is remarkable but not complete. About the history of the analogous measure theoretical notions see some references in the previous 5.
(3) Solutions of functional equations having the Baire property were studied by several authors. See the references in 5 9. (4) The class Sk [ak] remains the same if we suppose only that (S) [(B)] is satisfied whenever U is an open subset of R k , P is an open subset of some Euclidean space, po E P, and cp : U x P + X is a C1-function for which cp, is an embedding (i. e., an immersion that is a homeomorphism of its domain onto its range) of U into X for each p E P. This easily follows from the locality principle mentioned in the definition. Similarly, supposing only that cp,, is an immersion, the resulting class Sk [ak] remains the same. 20.3. Theorem. Let Y be a topological space and X an open subset o f Rn. Then using the notation o f 20.1, Bo(X,Y ) = YX and S o ( X ,Y) = C(X,Y) , t h e class of continuous functions from X into Y.
Proof. It is trivial that Bo contains all functions from X into Y.
206
Chapter VI. Regularity results with fewer variables
Now let us prove that any continuous function f : X -+ Y is in S o . Since t-, f ( p ( ~ , is ~ continuous )) for each
U = 0 or U = {0), clearly the function p u E U . This implies f E S o .
The converse is proved by contradiction: if f E S o , but not continuous, then there exists an xo E X , a sequence x, -+ xo, and a neighborhood W of f ( x o )such that f (x,) $! W . Let U = { 0 ) , P = X , po = xo, p(0,p) = p for p E P . Choosing the sequence p, = x, we have
hence we obtain a contradiction. We will prove that functions having the Baire property over an open subset X of Rn are in Sn. To make the connection with earlier results in 55 4, 9 clear, we do the main part of the proof in the following abstract setting: 20.4. Theorem. Let P, U ,and X be topological spaces. Suppose that p : U x P -+X is a continuous function with the following property: (1) If p E P and A category.
cU
is of second category, then pp ( A ) is also of second
Suppose, moreover, that po E P, the function f has values in a topological space Y and has the Lusin-Baire property on some set D which has the Baire property and contains cpp, (U). Then for U , P, po, p, and f the conditions (S) and (B) from 20.1 are satisfied.
Proof. Let us first prove that (B) is satisfied. Let F be a set of first category for which f ID \ F is continuous. We may suppose that F is a Borel set. Let us write D in the form ( V \ F l ) U F2 where V is an open subset of X and F l , F2 are of first category. Here also - enlarging F2 if necessary - we may choose Fl to be a Borel set. Now D = C U N , where C = V \ ( F U F l ) c D \ F is a Borel set and N = D n ( F U F2) is of first category. Let W be any open subset of Y. Since the set A = ( f lC)-' ( W )is relatively open in C , it is a Borel subset of X . The set N is of first category hence the set M = ( f I N ) - ' ( W ) is also of first category. Now let us observe that ( f O P P ) - ~ ( w ) = p p 1 ( A )U ~ p l w ) . On the left-hand side, p z ( A ) is a Borel set and by condition (I), the set cp,-d- ( M ) is of first category. This means that (B) is satisfied.
§ 20. Between Baire property and continuity
207
Now we will show that (S) is satisfied. Let uo be an arbitrary element Then there are neighborhoods Uo of uo and Po of po such of U' = cp;:(V). that Uo C U' and p(Uo x Po)c V. With the sets F and Fl above we have :; (F)and cp; (Fl)are of first category for m = 0 , 1 , 2 , . . . . Let Eo that 9 be the union of all these sets. If u E Uo \ Eo,then p(u,p,) and cp(u,po) are ) v ( ~ , P o )Hence . we have f ( ~ ( u , ~ m-+ ) )f ( Y ( ~ , P O ) ) . in C and ~ ( u , p m --+ Now let us consider the set
What we have proved until now is that E f l U' is of first category (as a subset of U) at each uo E U'. Since U' is open, this implies that E n U' is of first category at each point of U' if we consider U' as a subspace. Hence by 2.9 the set E n U' is of first category as a subset of U', hence as a subset of U, too. The set E \ U' is a subset of p,-d-(N), hence is of first category. This proves that E is of first category, which implies (S).
20.5. Theorem. We will use the notation of 20.1. Let X be an open P. If Y is a topological space, then every function f : X --+ Y subset of I having the Lusin-Baire property is contained in S, (X,Y) and 23, (X, Y) .
Proof. There is a subset F of first category of X such that f lX \ F is continuous. Let U c IWn be open, P an open subset of some Euclidean X a C1 function for which each p,, p E P space, po E P, p : U x P is an embedding. We will apply the previous theorem for p locally. Let uo E U. Choosing a neighborhood Uo of uo and Po of po such that cp, is a homeomorphism of Uo onto an open subset of X for each p E Po,we obtain that for any subset A of Uo which is of second category, the image p,(A) is also of second category. Now, the previous theorem can be applied to cplUo x Po. As it was mentioned at the definition this is enough to prove that (S) and (B) are satisfied for f , U, P, PO,p. 20.6. Theorem. We will use the notation of 20.1. Let Z, Zi (i = 1 , 2 , .. . , n ) be topological spaces. Let Xi (i = 1 , 2 , . . . ,n) and X be open subsets of Euclidean spaces and let Y C R1 be open. Let D be an open subset x 2,. Consider the functions f : X -+ Z , of X x Y and let W C D x Z1 x fi : X i + Zi, h : W -+ Z , g i : D + X i (i = 1 , 2, . . . , n ) . Let U c R' be open, P be an open subset of some Euclidean space, po E P, p : U x P -+ X a C1-function, for which p, is an immersion of U into X for all p E P, and suppose that the following conditions hold:
208
Chapter VI. Regularity results with fewer variables
(1) For each (x, y) E D we have
(2) (3) (4) (5)
for each fixed y E Y, h is continuous in the other variables; the function fi is in Skil on Xi(i = 1 , 2 , .. . ,n ) ; gi is C1 on D (i = 1 , 2 , . . . , n ) ; for each uo E U there exists a yo such that (cP(uo,po), yo) E D and the rank of the derivative of
at (uo,yo)i s k + l for each 1 5 i 5 n. Then condition (S) is satisfied for f , U, P, po, 9 .
Proof. Suppose that p, + pa. Let us choose open neighborhoods Uo, Po,Yo of uo, po, yo such that ( p ( u , p ) ,y) is in D whenever u E Uo, p E Po,y E Yo, moreover, the rank of the derivative of the mapping (u, y) ti gi(cP(u,P),y)is equal to k+l for allu E Uo,p E Po,y E Yo, and for 1 5 i 5 n. This is possible, because D is open, gi and cp are C1-functions, the rank is lower semicontinuous, and U x Y has dimension k $1, hence the rank cannot increase above k 1. Since the function f l is in Sk+1,we have that, except for pairs (u, y) E Uo x Yo from a set El of first category,
+
Now using that f 2 is in Sk+1we obtain that, except for pairs (u, y) E Uo x Yo from a set E2 of first category
etc. Finally, we obtain that except for a set E = Uy=lEi of pairs ( u , y ) E Uo x Yo of first category we have
§ 20. Between Baire property and continuity
209
for i = 1 , 2 , . . . ,n. By the theorem of Kuratowski and Ulam, except for a set of first category of y's from Yo, we have that the set of all u E Uo for which (u, y) E E is of first category. Fixing any such y, from the functional equation and from the continuity of h for fixed y we obtain that
except for a set of u's which is of first category. This is condition (S) with the function cpl Uo x Po. By the remark in the definition we obtain that (S) is satisfied. Example 19.8 from the previous 5 can be treated similarly here too. Just as there, we may conclude that the rank condition of the above theorem is not completely satisfying. For the generalization first we prove a theorem about condition (B). For the proof we need a lemma.
20.7. Lemma. We will use the notation o f 20.1. Let X be an open subset of Rn, Y a topological space, 0 5 k 5 n , and f E Bk(X,Y). I f $ is a C1 mapping of the open subset U o f Rm into X with rank k of the derivative everywhere, then f o $ has the Baire property. Proof. The lemma directly follows from the rank theorem. Indeed, the rank theorem implies, that for each uo E U there exists an open neighborhood Uo such that $lUo can be written as a o p o p. Here, with the notation I = 1- 1,1[, the mapping P is a diffeomorphism of Uo onto Imsuch that P(uo) = 0, the projection p of I" into In has the form p(xl,x2,. . . , x m ) = (xl, x2,. . . ,xk, 0 , . . . , O), and a is a diffeomorphism of In onto an open set Xo mapping 0 into xo = $(uo). Identifying the set I" {O} C In with I" we have that allk is an immersion, hence (f o ( aIk)) (V) has the Baire property for each open subset V of Y. Since p-l(A) has the Baire property for each subset A of Ikwhich has the Baire property, and ,W1(B) has the Baire property for each subset B of Imwhich has the Baire property, we obtain that f o ($lUo) has the Baire property. Now using locality mentioned at the definition of (B), we get the general case.
-'
20.8. Theorem. We will use the notation o f 20.1. Let Z be a topological space and let Zi (i = 1,2, . . . ,n) be topological spaces having countable bases. Let Xi (i = 1 , 2 , . . . ,n ) and X be open subsets of Euclidean spaces and let Y C R1 be open. Let D be an open subset o f X x Y and W C D x Z1 x 2,. Consider the functions f : X -+ Z , fi : Xi -+ Zi,
210
Chapter VI. Regularity results with fewer variables
h : W + Z , g i : D + X i ( i = 1 , 2 ,... , n ) . LetUCIFtk b e o p e n , $ : U + X be a C1 immersion of U into X , and suppose that the following conditions hold: (1) For each (x, y) E D we have
(2) (3) (4) (5)
for each fixed y E Y, h is continuous in the other variables; the function fi is in Bkt on Xi (i = 1 , 2 , . . . , n); gi is C1 on D (i = 1 , 2 , . . . , n ) ; for each uo E U there exists a yo such that ($(uo), Yo) E D and the rank of the derivative of
(.,Y)
++s ~ ( $ ( u ) , Y )
is ki on a neighborhood of (uo,yo) for each 1 5 i 5 n. Then u f ($(u)) has the Baire property.
Proof. Let us choose an open neighborhood Uo of uo and Yo of yo such that ($(u), y) is in D whenever u E Uo, y E Yo, moreover, the rank of the derivative of the mapping (u, y) H gi ($(u), y) is equal to ki for all u E Uo, y E Yo, 1 5 i 5 n. This is possible by condition (5). By the previous lemma we obtain that the mapping (u, y) H fi (gi ($(u), y)) has the Baire property. By the analogue of F'ubini's Theorem 2.13, except for a set Ei of y's from Yo which is of first category, the mapping u ri fi (gi ($(u), y)) has the Baire property on Uo. Hence, except for the set E = for all y E Yo the mapping
of Uo into W, has the Baire property. Since for any fixed y the function h is continuous in other variables, we obtain that for any fixed y E Yo \ E the mapping
has the Baire property. This means that u H f ($(u)) has the Baire property on Uo. Now by the locality principle mentioned at the definition of (B) the statement follows.
5 20.
Between Baire property and continuity
211
The following theorem is the key to the generalization 20.10 of Theorem 20.6.
20.9. Theorem. We will use the notation of 20.1. Let U C W , X and P be open subsets of Euclidean spaces, po E P, Y a metric space, cp : U x P -+ X a C1 function, for which rank cpb (u) = k for each u E U, p E P . I f f E B k ( X , Y )n S k ( X , Y ) ,then the condition (S)is satisfied for f , U, P, Po, and cp. Proof. Let uo E U, and let p j -+ po be a sequence. Since the rank of x R ~ such - ~ that cpLo (uo) is equal to k, we may write u as u = (ul, u2) E the determinant of
is not equal to 0. Hence there exists a neighborhood U1 x U2 C U of uo and a neighborhood Po of po such that the mapping
is an immersion of Ul for each u2 E U2,p E Po. Since f E S k , for each u2 E U2 there exists a subset F,, of Ul of first category such that if ul E Ul \ Fu,, then we have f (cp(u1,u2,pj)) -+ f (cp(u1,u2,po)). By the previous lemma, u ++ f ( ~ ( u , ~has ) ) the Baire property. Hence the set
has the Baire property (see 2.11). By the Kuratowski-Ulam Theorem 2.10 we obtain that its complement is of first category.
20.10. Theorem. We will use the notation of 20.1. Let 2 be a topological space and let Zi (i = 1,2, . . . ,n) be separable metric spaces. Let Xi (i = 1 , 2 , . . . , n) and X be open subsets of Euclidean spaces and Y c lX1 be open. Let D be an open subset of X x Y and let W c D x Z1 x . . x 2,. Consider the functions f : X -+ Z , f i : Xi -+ Zi, h : W -+ 2 , g i : D -+ Xi (i = 1 , 2 , .. . ,n).Let U c Rk be open, P an open subset of some Euclidean space, po E P, cp : U x P -+ X a C1-function for which each cp,, p E P is an immersion of U into X , and suppose that the following conditions hold: (1) For each (x,y) E D we have
212
(2) (3) (4) (5)
Chapter VI. Regularity results with fewer variables
for each fixed y E Y, h is continuous in the other variables; the function fi is in Skif l Bk, (i = 1 , 2 , .. . ,n); gi is C1 on D (i = 1 , 2 , . . . , n ) ; for each uo E U there exists a yo such that (p(uo,po),yo)E D and the rank of the derivative of
is ki on a neighborhood of the point (uo,po,yo) for each 1 5 i Then the conditions (S) and (B) are satisfied for f , U ?P , PO,p .
5 n.
Proof. From Theorem 20.8 it follows that condition (B) is satisfied by f , U, P, pol p. Let us fix an uo E U and let us choose a yo for uo by (5). Let us choose open neighborhoods Uo, Po, and Yo of uo, po, and yo such that (p(u,p ) , y) E D whenever u E Uo, p E Po,and y E Yo, moreover the rank of the derivative of (u1 Y) gi ( ~ ( uP), Y)
*
is ki on Uo x Pox Yo for each 1 5 i 5 n. Now the proof that condition (S) is also satisfied is exactly the same as in Theorem 20.6, but we have to use the previous theorem instead of the definition. 20.11. Conditions. In what follows we will use the notation of 20.1, but we will only investigate the situation, where X is a nonvoid open subset of IFS" and f maps X into a separable metric space Y, because we want to avoid any difficulties arising only from the poor topology of the range Y. 20.12. Remark. We will use the notation of 20.11. There is another kind of locality than the one treated after Definition 20.1. We have f E & ( X , Y) if and only if each xo E X has an open neighborhood Xo C X such that f \Xo E S k ( X o Y). , The "only if" part is trivial. To prove the "if" part we will use the notation of Definition 20.1. Let us note that for each point uo E U there exist open neighborhoods Uo and Po of uo and pol respectively, such that for xo = v ( u o, p o ) the set p(Uo,Po)is contained in Xo. This means that (S) is satisfied for pjUo x Po. Now from the locality principle in the definition we have that f E S k ( X , Y ) . The same locality is true (and the same proof works) for &.
5 20.
Between Baire property and continuity
213
20.13. The class Bk. We will use the notation of 20.11. Let
where { O , 1 ) is taken as a discrete space. It is easy to see that Ak is a a algebra, and a function f : X -+ Y is in Bk(X,Y) if and only if f - l ( V ) is in Ak for each open subset V of Y. Hence the investigation of Bk(X,Y) is reduced to the investigation of the a-algebra Ak. It is easy to see that A, is the class of all subsets of X having the Baire property and dois the class of all subsets of X . Each Ak contains the Bore1 subsets of X . We will prove that A E Ak if and only if Anrng $ has the Baire property in the subspace rng$ for any embedding $ of some open subset of PS'C into X. Indeed, if A E Ak then $-'(A) has the Baire property in dmn $ and since $ is a homeomorphism of dmn $ onto rng $, the set $ ($-I (A)) = A nrng $ has the Baire property in rng 4. Similarly, if A n rng q!J has the Baire property in (A) rng +, then $-I (A) has the Baire property in dmn $. By 20.2. (4) if has Baire property for any embedding $ of some open subset of into X then A E A k . Similarly, A E Ak if and only if A n rng$ has the Baire property in the subspace rng$ for any immersion $ of some open subset of Itk into X. Let us represent U = dmn$ as a countable union of open subsets Ui of U such that $IUi is an embedding of Ui into X. If A E Ak then $-I (A) has the Baire property in U, hence Ui n $-'(A) also has the Baire property in Ui. From this Ai = ($IUi) ($-'(A)) has the Baire property in $(Ui), i. e., Ai A V , c Fi for some relatively open subset V , of $(Ui) and some Fi which is of first category in $(Ui). Since Fi is of first category in rng$ too, UiFi is of first category in rng$. Since
we have that the symmetric difference of Anrng $ = UiAi and the a-compact set UiV, is of first category. This proves that Aflrng $ has the Baire property in rng $. In the other direction, if A fl $(U;) has the Baire property, then ($IUi)-'(A) has the Baire property in Ui, hence in U, too. Since this is true for any immersion $ of some open subset of !Rk into X , we obtain that A E A,+. Finally, A E Ak if and only if A f7 M has the Baire property in the subspace M for each pure Ic dimensional submanifold M of X . Indeed, if this is true, then in particular A n rng $ has the Baire property in rng $ for each immersion $ of some open subset of PS'C into X , hence A E Ak.
2 14
Chapter VI. Regularity results with fewer variables
On the other hand, each pure k dimensional submanifold M of X can be represented as the range of some immersion 4 of some open subset of EXk. Hence A n M = A n rng has the Baire property in M = rng 4. 20.14. Connections between Bk and S k . We will use the notation of 20.11. One of the simplest questions is, whether f E Bk implies f E S k . We know that this is true for k = n. If k < n then the characteristic function of the intersection of X and an appropriate k-dimensional plane is in Bk but not contained in S k . In the other direction, suppose, that f E S k . The question is, whether f E Bk is satisfied. This is trivial for k = 0. We will show that this cannot be proved in ZFC for 0 < k 5 n. Namely, we will give an example f under the continuum hypothesis for which f E S k but f $ Bk. By the famous results of Godel and Cohen, the continuum hypothesis is independent from the axioms of ZFC. This means that Bk C S k cannot be proved in ZFC.
20.15. Hierarchy of function classes belonging to different dimensions. We will use the notation of 20.11. Let us fix dimensions 0 5 k < I 5 n and let us investigate the connection between the classes Bk and S k and classes B1 and S l . We may hope that decreasing the dimension condition (S) becomes stronger. One of the only two positive results in this direction is that this is true for condition (S) under condition (B):
The proof of this statement is very similar to the proof of Theorem 20.9, therefore we do not repeat the argument. We will show by a counterexample under the continuum hypothesis that for k > 0,
ZFCF Bk
C
Bl U s l .
Similarly we will show by a counterexample under the continuum hypothesis that
Z F C F B k f l s k n ~Cl B 1 except for the trivial case k = 0. It is much easier to see that inclusions in the other direction do not hold in general. Although
B1 C Bo
§ 20. Between Baire property and continuity
215
is satisfied trivially, in general
This is shown by the characteristic function of a subset in the intersection of X and an appropriate k dimensional plane which does not have the Baire property in the given plane. The same example shows that
If we take the characteristic function of the intersection of X and an appropriate k dimensional plane, then we see that
We will show that
a, n skc a,.
Let us see the proofs.
20.16. Theorem. Under the conditions of 20.11 for 0 have 231 n Skc Bk.
< k < 1 5 n we
Proof. This is trivial for k = 0. Otherwise, let $I be an immersion of an into X. Let uo E U and let V be an 1 - k dimensional open subset U c -+ V be a linear subspace of Rn orthogonal to r n g g l ( u o ) . Let .ir : I!%-'" . for po = 0 we isometry, and let us define cp by cp(u,p) = $(u) $ ~ ( p ) Then = $. Let us choose open neighborhoods Uo and Po of uo and po, have ,cp, respectively, such that cp(Uo,Po)C X and cp is an immersion of Uo x Po into X. Since f E B1, the mapping (u,p) t, f (cp(u,p)) has the Baire property. Hence, by the analogue of Fubini's theorem (see 2.13), except for a set of first category, for all p E Po the mapping u ++ f (cp(u,p)) has the Baire property. Let us choose a sequence p, -+ po such that each u f (cp(u,pm))has the Baire property. By f E Bk we have that
*
has the for all u E Uo except for a set of first category. Hence u t, f ($(u)) Baire property on Uo, i. e., locally. This implies that f E Bk.
216
Chapter VI. Regularity results with fewer variables
For the following counterexamples we need Lemma 19.21. To understand the typical application of this abstract set theoretic lemma here, we may think of the case when X is the plane, T is the class of all diffeomorphisms mapping some open subset of the plane onto some other open subset of the plane, F is the class of all Borel subsets of the plane of second category, 6 is the class of all one dimensional C1 submanifolds of the plane, and n = c = N1.
20.17. Counterexample. Using the conditions o f 20.1 1, under the continuum hypothesis for 0 < lc 5 n we have SkP Bk.
Proof. We will give a function f E Skfor which f $ Bk. We want to apply Lemma 19.21. We will use only that the functions cp in the definition of Skare continuous and that by Remark 20.2.(4) we may suppose that the functions p, are one-to-one. Let T denote the class of all one-to-one functions r which can be represented in the form cp, o p ~ ' where , U is an open subset of IW" P is an open subset of some Euclidean space, and cp : U x P + X is a continuous function for which all p,, p E P is one-to-one. Since the cardinality of all pairs U, P is continuum and any continuous function cp is uniquely determined by the values on a countable dense subset, the cardinality of the class T is continuum. Let F denote the class of all subsets of X representable in the form $(G) where 0 # U C E%%s open, $ : U + X is an embedding, and G c U is a Borel subset of U of second category in U. Each element of F is a Borel subset of X , hence the cardinality of F is at most c (continuum). Moreover, by the theorem of Piccard, G - G contains a neighborhood of the origin, hence each elements of .F has cardinality c. Applying Lemma 19.21 with 6 = 0 we obtain a class of subsets X,, y E R of X . Our counterexample will be the characteristic function f of X o , i.e. X, for y = 0. If f were in Bk then, by the results of 20.13, for any embedding $ of (Xo) would be a Baire some nonvoid open subset U of @ t h e set A. = set. A. cannot be of first category, because then for a Borel set G c U \ A. of second category $(G) would not intersect with Xo. Similarly, if A. is of second category, then choosing a Borel set B c A. of second category we obtain $ ( G ) C Xo contradicting $(G) n X, f 0 and X, n Xo = 0 for each
r # 0. We will prove that f E S k . Let U be an open subset of Ikk, P be an open subset of some Euclidean space, po E P, and cp : U x P + X a C1 function
§ 20. Between Baire property and continuity
217
for which all cp, is an embedding. The set
{u E U : f
(cpPO(4)
#f(~~(4))
is equal to the set
For the mapping
T
= pp0 o
cppl this set is a subset of the set
If we suppose the continuum hypothesis, then this set is countable. 20.18. Counterexample. Using the conditions of 20.11, under the n we have Bk r\ Sk fl Sl G 01. continuum hypothesis for 0 < k < 1 I Proof. We will give an example of a function f E BknSknSLbut f !$ Bl. We want to apply Lemma 19.21. We will use that by Remark 20.2. (4) we may suppose that the functions cpp in the definition of Sl are embeddings. Let T denote the class of all one-to-one functions T which can be represented in the where U is an open subset of EXz,P is an open subset of some form cpp o Euclidean space, and cp : U x P + X is a function for which all p,, p E P is an embedding. Let F be the same as in the previous counterexample. Let 6 be the class of all Borel subsets of X which are contained in a union of countably many k dimensional submanifolds of X. We have to prove that G is T invariant. The domain of any r E T is an 1 dimensional submanifold of X. If a set G E G is contained in UjOO,lMj,where each M j is a k dimensional submanifold of X, then for each x E G n Mj n dmn r it is possible to find an E > 0 and a k dimensional submanifold Mj of d m n r such that for each y for which ly - x /< E we have y E X , moreover, that y E G f l M j i l d m n r if and only if y E G f l MI n d m n r . This proves that the Borel set G f l d m n r can be covered by countably many k dimensional submanifolds of dmn r . Hence the Borel set T(G) can also be covered by countably many k dimensional submanifolds. Since the topological dimension dim = ind = Ind of any subset G of a k dimensional submanifold is 5 k, the intersection of G with an 1 dimensional submanifold L of X is of first category in L. The same is true for any G E G, and, moreover, for the union G of any countable subfamily Go c 6 . This
cpgl,
218
Chapter VI. Regularity results with fewer variables
proves that for any F E F the set F \ G has cardinality c. Other conditions of Lemma 19.21 have already been checked at the previous counterexample. Applying Lemma 19.21 we obtain a class X,, y E R where each X , contains only countably many points from each G E 6 , but X , f l F # 0 for each F E F . Let f be the characteristic function of X o . Along the same lines as in the previous counterexample we get that f E Sl but f 4 BI. Since for any C1 embedding $I of an open subset of PS" into X the function f o q!~is zero except for a countable set, we get that f E Bk and f E S k , too. Hence the statement is proved. 20.19. Counterexample. Using the conditions of 20.11, under the continuum hypothesis for 0 < k < I 5 n we have Bk n S k P Bl U S l .
Proof. Let us apply Lemma 19.21 for the same T, F , and 6 as in the previous counterexample. We obtain a class X,, y E R where each X , contains only countably many points from each G E 6, but X , n F # 0 for each F E F . Let Z be an 1 dimensional plane which has a nonvoid intersection with X and let f be the characteristic function of the set Z n X o . Then f E Bk n S k , but f $ Bl and f $ S l .
21. BETWEEN CONTINUITY AND
DIFFERENTIABILITY In this 5 we prove general "continuity implies C"" type results for functional equations with "few" variables. The "C1 implies Cm" part holds for the general explicit nonlinear functional equation
with unknown function f without the strong rank condition in 1.18.(3) on the inner functions. This is equation (1) from Problem 1.18. The "continuity implies C1" part holds only for linear equations of the type
§ 21. Between continuity and differentiability
219
with unknown function f . In the spirit of the "bootstrap" method in 1.7 we introduce a sequence of properties, which - roughly speaking - interpolate between continuity and continuous differentiability. This sequence of properties provides us with a stairway to climb up from continuity to continuous differentiability. First we investigate the basic properties of the new notions. Then a "continuity implies C1" type theorem will be proved. A refinement of the theorem is also proved. Finally, a "C1 implies Cw" type theorem is proved. These results appeared in [112]. The main advantages of our method compared to ~wiatak'swhich can be applied to the equation
considered in § 18 are the following: We do not need the very strong condition that there is a yo such that gi(x, yo) x for 1 i n. This condition does not hold for most of the import ant functional equations. The somewhat artificial condition of hypoellipticity is also avoided. Our conditions - besides smoothness conditions for the given functions are only linear algebraic in nature. Seemingly, equation (3) of ~ w i a t a kis nonlinear, and our equation (2) is linear. However, substituting y = yo in equation (3) the conditions gi(x, yo) G x for 1 5 i 5 n yield
--
< <
) from this equa( ( tion, and substituting back into (3), after division by Cy=l hi(x, yo) we Expressing the term h x, f gn+l(z)) , . . . , f (gm (x))
obtain an equation of the type (2). Hence our methods can be applied to prove "continuity implies C1" for ~wiatak'sequation. We prove "C1 implies Cw" for the most general nonlinear functional equation ( 1 ) . Finally, generalizing our methods we may hope to obtain "continuity implies C1" type results for the most general nonlinear functional equation (1). This seems to be impossible using the method of ~ w i a t a kbased on Schwartz distributions. The applicability of the distribution method is restricted, because no multiplication among Schwartz distributions is
220
Chapter VI. Regularity results with fewer variables
defined. By Schwartz's impossibility theorem, this cannot be done in a satisfying way. It is even more hopeless to substitute distributions into general Cm functions with several variables. The distribution method has to be restricted to functional equations that are not very far from being linear. 21.1. Definition. The basic idea is to consider parametric integrals of the form
where f : X i Y is a function mapping the set X into the Banach space Y. Such a parametric integral is given by a parametric integration quintuple (P,U, w, cp, p ) , where P is the parameter space, p is a measure on U, the function w is a weight function w : U x P -+ $ and the function cp : U x P --+ X is considered representing a parametric family cp,, p E P of surfaces in X. We may consider a set P of parametric integration quintuples and to denote by F ( X , Y, P; (7) the class of all such functions f : X i Y for which for all quintuples from P the parametric integral (1) is in the class of functions 8. For our purposes a somewhat simpler setting will be sufficient. For simplicity, we may suppose that f is a continuous function mapping an open subset X of Rn into a Banach space Y. We will consider the set Pk of all integration quintuples (U, P, w, cp, p) for which U is an open subset of R k , the "parameter space" P is some open subset of some Euclidean space, p is the restriction of Lebesgue measure X~ to subsets of U, and w : U x P -+ R and cp : U x P + X are arbitrary functions satisfying some smoothness conditions. We use weight functions that are at least continuous and have compact support. The functions cp are supposed to be at least C1. Under such conditions the parametric integral
exists for each p E P and is continuous. Here integration is with respect to k dimensional Lebesgue measure. Theorem 11.1shows that if w E C1 and iff is merely continuous, but cp is twice continuously differentiable, this parametric integral will be continuously differentiable for k = n. If k = 0 then the continuous differentiability of such integrals becomes equivalent to the continuous differentiability of f . Roughly speaking, the continuous differentiability of such parametric integrals is a
5 21.
Between continuity and differentiability
221
stronger condition on f , the smaller k is, and provides us with a stairway to climb up from continuity to continuous differentiability. For convenience we introduce the following notation. Suppose that X is k n an integer, W is a an open subset of Rn, Y is a Banach space, 0 class of functions w mapping some product U x P (depending on w) into $ where U is an open subset of and P is an open subset of some Euclidean space, is a class of functions cp mapping some product U x P into X where, again U is an open subset of Rk and P is an open subset of some Euclidean space, and 6 is a class of functions mapping some open subset P of some Euclidean space into Y. Let
< <
denote the class of all continuous functions f : X -+ Y for which whenever w E W and cp E have the same domain U x P, the parametric integral (2) is defined for all p E P and is in the class 6. The function classes W, a, and 6 will be defined via smoothness conm and let Cm denote the class of all functions which ditions. Let 0 m are defined on some open subset of some Euclidean space, take values in a Banach space, and are m times continuously differentiable. Let Km be the subclass of Cm consisting of functions that have compact support. Let Zm denote the class of functions cp E Cm which map some Cartesian product U x P of open subsets of Euclidean spaces into a Euclidean space so that u t-+ cp(u,p) is an immersion for each p E P . Similarly, let Em denote the class of those functions p E Cm which map some Cartesian product U x P of open subsets of Euclidean spaces into a Euclidean space so that u e cp(u,p) is an embedding for each p E P . The most important classes for us will be &(X, Y, K1,Z2;C1), 0 5 k 5 n. We will often check the condition f E Fk(X, Y, K1,Z2;C1) locally. If for a given function cp : U x P -+ X for each uo E U and po E P there is a neighborhood Uo of uo and Po of po such that the parametric integral (2) is in C1 whenever the support of w is contained in Uo x Po,then for any w : U x P -+ R the parametric integral (2) is in C1. This easily follows using a partition of unity.
< <
21.2. Remarks. (1)Suppose that X is an open subset of Rn and let the Our main results will show that, roughly speaking, solutions range be P. f from F ~ + ~ ( X , I W ~ , K ~ are , Z ~also ; C in ~ )F ~ ( x , I I B " , K ~ , z ~ ; c ~We ) . will prove that the class To(X, Rm , Z2;C1) is the class of C1 functions, and that all continuous functions f : X -+ Rm are in Fn(X, Rm ,K1, Z2;C1). Hence, step-by-step, continuity of solutions implies that they are in C1.
xl,
222
Chapter VI. Regularity results with fewer variables
(2) This case is somewhat analogous with the measure theoretical and the Baire category case. About the history of the analogous measure theoretical notions see 5 19.
(3) As we mentioned in the introduction, it is a classic technique to use parametric integrals to prove regularity theorems for functional equations with unknown functions having one real variable. See the book [3] of Aczkl, 4.2.2, 4.2.3 for its history. The generalization for functional equations with unknown functions of several variables has been considered in § 11.
(4) If X is an open subset of R n , Y is a Banach space, and 0 5 5 5 n, then the class F k ( X , Y ,K1,Z2;C1)is equal to the class F k ( X , Y ,K1,E2;C1). This easily follows from the locality principle mentioned in the definition. 21.3. Theorem. Let X be an open subset of Rn . Then
Proof. In this case the parametric integral simply becomes the mapping ) f We may take P = X, p + cp(0,p) to be the identity p + ~ ( 0 , ~(cp(0,p)). mapping, and p + w(0,p) to be equal to 1 in a neighborhood of a given point xo = po E X . Then it follows that f is a Cvunction in a neighborhood of xo. Conversely, if f E C" then the mapping p t, w (0,p)f (cp(0,p)) is also in ck. Next we prove that continuous functions mapping an open subset X of Rn into Rm are in Fn(X, Rm ,K1, z2;C1). The proof depends on Theorem 11.1. 21.4. Theorem. Let X be an open subset of Rn, and let f : X -+ B P be a continuous function. Then with the notation of 21.1 we have f E ,T,(x,FS", K1,E2;C1).
Proof. Arguing coordinate-wise, we may suppose, that f is real-valued, i. e., that m = 1. Let P be an open subset of R3 and let U be an open subset of Rn. Let cp : U x P t, X be a function from z2.By the locality principle from the definition, it is enough to prove that any po E P and uo E U have neighborhoods Po C P and Uo C U, respectively, such that whenever
§ 21. Between continuity and differentiability
223
w : U x P + R is a C1 function having support contained in Uo x Po, the parametric integral
is in C1.If Uoand Poare small enough, then we can substitute x = cp(u,p) for each p E Po. We may choose a simplex S containing Uo. Then the integral above becomes F
for all po E Po.Hence, by Theorem 11.1 the integral is continuous. 21.5. Theorem. Let X and Xi, 1 5 i 5 n be open subsets of Euclidean spaces and let Y be an open subset of R1. Let D be an open subset of X x Y. Consider the functions f : X + Rm,fi : Xi + Rm, hi : D + R, gi : D - + X i(i = 1,2,... ,n).Let U C Rk be open, P be an open subset of some Euclidean space, po E P,cp : U x P + X a z2function, and suppose that (with the notation of 21.1) the following conditions hold: (1) For each (x,y) E D,
(2) hi is continuously differentiable for i = 1,.. . ,n; (3) the function fi is in F ~ + ~ ( X ~iC1,E2;C1), , I R ~ , ( i = 1,2,.. . ,n); (4) gi is C2 on D (i = 1,2,.. . ,n); (5) for each uo E U there exists a yo such that (cp(uo,po), yo) E D and the rank of the derivative of
(u, Y) ++ gi ( ~ (PO), u ,Y)
5 n. Then for any function w : U x P -+ X that belongs to iC1 the parametr at (uo,yo) is k + I for each 1 5 i
integral
is continuously differentiable on a neighborhood of po.
Chapter VI. Regularity results with fewer variables
224
Proof. Let us choose open neighborhoods Uo, Po, Yo of uo, PO,yo such that (cp(u,p), y) is in D whenever u E Uo, p E Po, y f Yo. We can also ensure that the rank of the derivative of the mapping (u, y) gi (cp(u,p),y) is equal to k 1 for all u E Uo, p E Po,y E Yo, and for 1 5 i 5 n. This is possible, because D is open, gi and cp are C2 functions, the rank is lower semicontinuous, and U x Y has dimension k+l, hence the rank cannot increase above k I. Now let w : Uo x Po + R be a K 1 function. Let us choose a K 1 function wo : Yo + R for which JYo wo(y)dy # 0. By (1) we obtain
+
*
+
Integrating both sides over Uo x Yo we obtain
Now the right-hand side is in C1. This proves that
is continuously differentiable on Po. For an arbitrary K' function w : U x P + R the statement follows using a partition of unity subordinate to an appropriate finite covering of the support of u e w(u,po). Example 19.8 from 5 19 can be treated similarly here, too. Just as there, we may conclude that the rank condition of the above theorem is not completely satisfying. For the generalization we need a lemma. 21.6. Lemma. Using the notation of 21.1, let X be an open subset of Rn, l e t 0 5 k < n , andlet
Let U and P be open subsets of Euclidean spaces and suppose that for the C2 mapping cp : U x P + X the derivative of the partial mapping
§ 21. Between continuity and differentiability
225
u t+ y ( u ,p) has rank not less than k at the point ( u o , p o )E U x P. Then there exists a neighborhood Uo o f uo and Po of po such that for any continuously differentiable function w : U x P + R having support contained in Uo x Po the parametric integral
is continuously differentiable on P . Proof. Denoting by u l the first k coordinates of u , and by uz the rest of the coordinates of u , we may assume that the mapping ul e y ( u l , u 2 , p ) has nonzero Jacobian at ( u o P , O ) .Let us choose a neighborhood Uo = Ul x U2 of uo and Po of PO such that this Jacobian is nonzero on Uo x Po. Then the mapping
is in C1. Integrating with respect to u2, we obtain the statement of the lemma. 21.7. Corollary. Using the notation of 21.1, i f 0 5 k 5 1 5 n then
The next theorem is our generalization of Theorem 21.5.
<
21.8. Theorem. Let X , Y , and X i , 1 i 5 n be open subsets of Euclidean spaces. Let D be an open subset o f X x Y . Consider the functions f : X + R m , fi : X i + R m , h i : D i R , g i : D + X i ( i = 1 , 2 , . . . , n ) . Let U C Rk be open, P be an open subset of some Euclidean space, po E P , cp : U x P + X an z2function, and suppose that (with the notation of 21.1) the following conditions hold: (1) For each ( x ,y) E D ,
(2) hi is continuously differentiablefor i = 1 , . . . ,n ; (3) the function f i is in .Fki(xi, IRm, K1,f 2 ; C 1 ) , ( i = 1 , 2 , . . . , n ) ; (4) gi is C 2 on D (i = 1,2,. . . , n ) ;
226
Chapter VI. Regularity results with fewer variables
(5) for each uo E U there exists a yo such that (cp(uo,Po),Yo) E D and the rank of the derivative of
at (uo,yo) is at least ki for each 1 5 i 5 n. Then for any function w : U x P -+ X that belongs to K1, the parametric integral P
"
W(",P)f (CP(U,P))du
is continuously differentiable on a neighborhood of po.
Proof. Let us choose open neighborhoods Uo, Po, Yo of uo, PO, yo, respectively, such that (CP(U,p), y) is in D whenever u E Uo, p E Po,y E YO. Moreover, we can ensure that the rank of the derivative of the mapping y) is not less than ki for all u E Uo, p E Po,y E Yo, and (u, y) +-+ ygi (cp(u,p), for 1 5 i 5 n. This is possible, because D is open, ggi and cp are C2 functions, and the rank is lower semicontinuous. Now let w : Uo x Po-+ R be a K1 function. Let us choose a K1 function wo : Yo -+ R, for which Syo w0(y)dy # 0. From (1) we obtain
Integrating both sides over Uo x Yo we obtain that
Now the right-hand side is in C1 by the previous lemma. This proves that
is continuously differentiable on Po. For an arbitrary K1 function w : U x P + R the statement follows by using a partition of unity subordinate to an appropriate finite covering of the support of u t, w(u, po).
227
§ 21. Between continuity and differentiability
Our last theorem is about higher order differentiability. Here the functional equation is allowed t o be nonlinear. 21.9. Theorem. Let X be an open subset of PSS. Let Y , Z and X i , Zi, 1 i 5 n be open subsets of Euclidean spaces. Let D be an open subset of X x Y . Consider the functions f : X + Rm, fi : Xi + Zi, h : D x Z1 x . . . x Zi + Z , gi : D - - + X i (i = 1 , 2 , . . . , n ) . Let r 2 1 an integer. Let U C Rk be open, P be an open subset o f some Euclidean space, po E P , cp : U x P + X an z2function, and suppose that (with the notation of 21.1) the following conditions hold: (1) For each ( x ,y ) E D ,
<
dFll . . .a:; h are continuously differentiablewhen(2) all partial derivatives aai; ever 0 la1 r , where a = ( a oa, l , . . . ,a,) E W+l and la1 = (3) all partial derivatives o f order r o f the function fi are in
<
<
~k~
~~'o
(xi,lRm,K 1 , f 2 ;c'),
(i = l , 2 , . . . ,n ) ;
(4) gi is Cr+l on D (i = 1 , 2 , . . . ,n); (5) for each uo E U there exists a yo such that (cp(uo,p o ) , y o ) E D and the rank of the derivative o f
at ( u o , y o ) is at least ki for each 1 5 i 5 n. Then for any function w : U x P + X that belongs to K 1 and for any multiindex a E Ns for which /a1= r , the parametric integral
is continuously differentiable on a neighborhood of po.
228
Chapter VI. Regularity results with fewer variables
Proof. Let 1 5 q 5 s , and differentiate equation (1) partially with respect to xq. We have, omitting the variables,
Here zi = (zi,j), X i = (xi,k), fi = (fi,j), and gi = (gi,k). The last equation shows, that whenever a E N3, la1 = l, the function f satisfies the functional equation
a"
for (x, y) E D. Here, if the q-th coordinate of a equals 1, and all other coordinates are zero, then
and
for some i , j , k . It is clear that h,,p has a continuous p-th partial derivative with respect to x whenever 0 5 ,8 5 n,, moreover fa,p maps some Xi into R, and all p - 1-st partial derivatives of f,,p exist. Repeating this process, we have by induction on la\,that if a E Ns, 1 5 la1 5 p, then
+
whenever (x, y) E D. Here h,,p : D + Z is continuous, its p 1 - lal-th partial derivative with respect to x is continuous moreover fa,p : Xi --+ R for
§ 21. Between continuity and differentiability
229
some 1 5 i 5 n, and all of its p - lal-th partial derivatives are continuous. Finally, g,,p = gi for the same i for which dmn f a j p = Xi. Now we use Theorem 21.8. Then we have that for any function w : U XP + X that belongs to K1, and for any multiindex a E Ns , the parametric integral
is continuously differentiable on a neighborhood of po.
Chapter VII.
APPLICATIONS
It is clear that the theorems of this book admit several possible applications. We give here some examples to illustrate how our results can be used. Further applications can be found in the book [50] of Ebanks, Sahoo and Sander.
22. SIMPLE APPLICATIONS 22.1. Cauchy's equation and its generalizations. Let us start with the well-known Cauchy equation. For references on this equation see the book of Acz6l [3]. Let f : Rn -+ Rm be a function and suppose that f satisfies Cauchy's equation
With the substitution t = x
+ y we have
Hence, if f is measurable on a set of positive measure or has Baire property on a Baire set having second category, then f is continuous by Theorem 8.3 or 9.2, respectively. Now, applying Theorem 1.30, we have that f is Cm.
Chapter VII. Applications
232
Similarly, let f : Rn + R be an unknown function and h : R2n+2+ R a given Cw function. Suppose that f satisfies the generalized Cauchy equation (3)
f ( x + y ) = h ( x , y , f ( x ) , f ( y ) ) if x , Y E R n .
For more about this equation see Aczkl [3], Sander [179], [180]. With the substitution t = x y we have
+
From Theorem 8.3 and 9.2, respectively, we see that every solution f of (4) which is measurable on a measurable set of positive measure or has Baire property on a Baire set having second category is continuous. Now we obtain from Theorem 1.29 that f is Cw. We can apply the same method to the equation (5)
f ( G ( x , y ) ) = h ( x , y , f ( x ) , f ( y ) ) if
X,Y
where G : Rn x Rn + IlB" is a given Cm function and for each to E Rn there exist xo, yo E W for which G(xo,yo) = to,
det
dG
-(xo, yo)
ax
dG
# 0 and det (xo,yo) # 0. dy
Then, with the substitution t = G(x, y), we have locally
where G(g(t,y), y) = t , and we can apply Theorems 8.3, 9.2, and 1.29. 22.2. Pexider's equation and its generalizations. At the XVIth International Symposium on Functional Equations, Wolfgang Sander raised the following problems (see [l82]).
Let f , g, and h be real functions, H : R2 + R a continuous function and suppose that
Are the following statements true? (1) I f g and h have the Baire property, then f is continuous; (2) I f g and h are Lebesgue measurable on a set of positive measure, then f is continuous;
(3) I f g or h is Lebesgue measurable, then f is continuous. I f ( I ) , (2) or (3) is true, is it possible to change the addition to a more general two variable function?
5 22.
Simple applications
233
All three problems and the generalization are solved if we prove that i f f , g, and h are functions mapping Rn into Rm and H : Rnx Rnx Rm x Rm -+ E P is a continuous function and the functional equation
is satisfied, where G : Rn x Rn -+ Rn is a given continuously differentiable function such that for each to E IWn and xo E Rn there exists a point yo E Rn for which
then the Lebesgue measurability of g on a set with positive measure or the Baire property of g on a set with second category implies that f is continuous. This solution of the problem of Sander was published in JBrai [79], [81]. With the substitution t = G(x,y) locally we have
where G(G*(t,y ) , y) = t, and we can apply Theorem 8.3 and 9.2, respectively.
22.3. Operator groups and semigroups. Solutions of Cauchy's exponent ial equation
mapping R or 10, m[ into the Banach algebra B(X, X ) of bounded linear operators of some Banach space X are called operator groups or semigroups, respectively. They are very important and they are well known from the famous book of Hille and Phillips [72]. As an example here we will show how some theorems about operator groups and semigroups follow from theorems about general functional equations. From Theorem 8.6 follows Theorem 10.2.3 of [72], which states that if an operator semigroup is almost separable valued and measurable, then it is continuous. Indeed, let n = 1, t = u v , y = u, f = f l = F, h ( t , y , z l ) = F (y)zl and let us apply Theorem 8.6. Similarly we obtain Theorem 10.2.3 of [72] stating that if f is strongly measurable, then it is strongly continuous. This means that if for any x E X we have s e F ( s ) x is almost separable valued and measurable, then for each x E X we have that s H F ( s ) x is continuous. Indeed, let n = 1, t = u v, y = u, f = f l = F(.)x, and
+
+
234
Chapter VII. Applications
h ( t , y l z l ) = F(y)xl. Theorem 9.3.1 and Theorem 9.8.1 of [72] treating the functional equation F(U V ) = H(F(u), F(v))
+
follow similarly and their conditions can even be considerably weakened. "Pexiderizations" of these equations can also be treated. Moreover even a more general version of the "measurability implies continuity" type results of chapter XXV of [72] may also be obtained from Theorem 8.6; see in connection with the Lie group case the results of the next example about D'Alembertls functional equation. It is clear that the full power of Theorem 8.6 is not used here and Theorem 8.5 is also enough to draw the same conclusion. The situation is quite different if we consider differentiability questions. Most of the results of the present book cannot be applied if the range of the unknown functions has infinite dimension. The differentiability results of the present book strictly depend on Rademacher's theorem stating that Lipschitz maps between Euclidean spaces are almost everywhere differentiable. This does not remain true if the range has infinite dimension. For example, if f maps 0 5 t 5 1 to the characteristic function of [0,t] considered as an element of L'[O, I], then this mapping is a Lipschitz mapping but is nowhere differentiable. A further illustration of the difficulties is the operator semigroup example given in 1721 after Theorem 9.3.1. Although recently the extension problem of Rademacher's theorem for infinite dimensional spaces became a very active research field, it seems to me that the situation is presently not sufficiently cleared up to find a general class of functional equations for which it would be possible to prove differentiability in the case of infinite dimensional domain and/or range. Hence presently if we want to prove differentiability theorems for solutions having values in an infinite dimensional space - similar to Theorem 9.9.1 of [72] about the generalized Cauchy equation - then we are forced to use the special form of the equation. 22.4. Translation equation. In 1989 Zdun proved the following theorem concerning the translation equation (1) (see [208], Theorem 1; cf. also [207], Theorem 1.1):
Theorem [Zdun]. Let X be a compact metric space, f : 10, m[ x X X, and suppose that f satisfies the functional equation
-+
and f is measurable with respect to the first variable and continuous with respect to the second one. Then f is continuous.
5 22.
Simple applications
235
Quite recently Guzik extended this result proving a "measurability implies continuity" type result for the more general equation n
f ( ~ + v , x =) ~ h i ( a i ( v , ~ ) , f i ( ~ , b i ( ~ ur ,~v)~) ])O, , m [X, E X . i=l He considered function f : 10, ca[x X -+ Y as a mapping t e f t = f (t, .) from 10, CQ[ into a function subspace of YX. (Similarly, fi, ai, and bi also can be considered mapping 10, GO[ into function spaces.) It is well known (see for example Bourbaki, [38], chapter X, § 3, 3, Theorem I ) , that if X is a compact metric space and Y is a separable metric space, then C(X, Y) with the metric of uniform convergence is separable. The following lemma (Guzik [62], Lemma 1) shows that the condition in Zdun's theorem is equivalent to the measurability of t e ft. Lemma [Guzik]. Let p be a measure on M , let A C M , let X be a compact metric space, and let Y be a separable metric space. A function f : A x X -+ Y is measurable with respect to the first variable and continuous with respect t o the second one i f and only i f the mapping a e f a = f (a, .) is a measurable mapping from A to C(X, Y ) .
Proof. One of the directions is obvious, because for any fixed x E X , the mapping z e z(x) from C(X, Y) to Y is continuous. To prove that a e f a is measurable, let Xo be a dense subset of X . It is enough to prove that if zo E C(X, Y) and r > 0, then the inverse image of the open ball centered at zo and having radius r is measurable. But
= {a E A : sup d i s t ( f ( a , x ) , r o ( x ) )< r } xEXo
hence the lemma is proved. Now observe that substituting t = u f t = f30ft-,,
+ v and s = v, from (1) we obtain
O<s
hence using that (zl, z2) e z1 o z2 is a continuous mapping of C(X, X ) x C(X, X ) into C(X, X ) Theorem 1.25 (or 8.3, 8.5, 8.7, 10.3) implies that t t, f t is continuous and hence Zdun's theorem follows.
Chapter VII. Applications
236
Similarly, from Corollary 8.7, the following theorem which is somewhat more general than Theorem 1 in Guzik [62], can be derived:
Theorem [Guzik]. Let Al, A 2 , .. . ,An be sets, X , X I , . . . ,Xn be compact metric spaces, let Y be a normed space, and let Yl,. . . ,Yn be separable metric spaces. Let f : ]O,m[ x X + Y, fi : ]O,m[ x Xi + Y,, ai : ] O , m [ x X -+ Ai, bi : ] O , m [ x X -+ Xi, and hi : Ai x Y, -+ Y (i = 1 , 2 , . . . ,n ) be functions. Suppose, that (1)
(2) for each O < t < m the function f t = f ( t ,.) is continuous; (3) for each xi E Xi the function fi,,i is Lebesgue measurable and for each O < u < cc the function fi,, is continuous; (4) for each 0 < s such that
<m
and 1 5 i 5 n there exists a Lipschitz constant Li,,
for all yi,y,! E Y, and x E X . Then the mapping t H f t E C(X, Y) is continuous; in particular, the function f is continuous. Indeed, substituting t = u
+ v, s = v, from (1) we obtain
where D = { ( t , s ) : O < s < t < m}. Let Z = C ( X , Y ) and Zi = C(Xi,Y,) with the distance of uniform convergence. Let
and let h : W
-+ Z be defined by
5 22.
237
Simple applications
by the functional equation and (2) we have that h (t, s , zl ,z2, . . . ,zn ) is in Z . To apply Corollary 8.7 we only have to check that for any fixed 0 < s < cc the function h is continuous. But
whenever (t, s, zl, . . . , z n ) and (t', s, x i , . . . ,zh) are in W. 22.5. D'Alembert's functional equation. Let G be a locally compact group and let H be a Hausdorff topological ring, f : G i H. D'Alembert 's functional equation
(1)
f (xy)
+ f (xy-I) = 2f (x)f (y)
whenever
x, y E G
was studied by Kurepa [133] and Baker [29] in the case where G = R or G = Rn and Szkkelyhidi [I931 in the case G is a commutative group. From (1) with the substitution t = xy-I we obtain (2)
f (t) = 2f (ty)f (y) - f (ty2)
whenever
t , y E G.
Theorems 17.1 and 17.2 directly imply the following
Corollary. If G is a Lie group, then every solution f of (2) that is Lusin measurable or has Lusin-Baire property is continuous. Proof. Taking Y to be a symmetric open neighborhood of the origin on which y t, y2 is a submersion (see Dieudonnk [49], (19.8.5), (19.8.6)) and
T=X1=X2=X3=A1=A2=A3=G,D=TxY,Z1=Z2=Z3=H, h ( z l , ~ 2 , 2 3= ) 22122 - 2 3 , g l ( t , ~ = ) 3, g 2 @ , ~=) t ~ g ,3 ( t , ~ = ) t ~ 2 we , can apply Theorems 17.1 and 17.2 to conclude that every solution which is Lusin measurable or has Lusin-Baire property, is continuous. Of course, using 8.6 and 9.4, the corollary above can be generalized for the case
where h is continuous and fi is Lusin measurable or has Lusin-Baire property, gi : G -+ G is continuous, ki E N,and ki > 0 for i = 1 , 2 , . . . , n.
238
Chapter VII. Applications
We will consider here another direction of generalization (see JBrai [79], [102]) for an arbitrary locally compact group satisfying the following condition: (4) G has a compact subset K with positive left Haar measure such that for any compact subset C C K with positive left Haar measure the left Haar measure of the set {x2 : x E C ) is positive too. Condition (4) is fulfilled in many important locally compact groups, but not in all of them. For example, if G = {-1,l) with multiplication and discrete topology, and n is a cardinal number, then Gn satisfies (4) if and only if n is finite. If G1 and G2 satisfy (4) with subsets K1 and K2, then G = GI x G2 satisfies (4) with K = K1 x K2. Indeed, if C is a compact subset of K and X(C) > 0, then by F'ubini's theorem there exists a compact subset C1 of K1 such that X1 (C1) > 0 and X2{x2 : ( X I , x2) E C ) > 0 for every X I E C1. Hence X2{x$ : ( z l , x 2 ) E C ) > 0 for every X I E C1, thus with the notation B = {(x?,x;) : ( x l , x 2 ) E C ) for every element yl of the set B1 = {x? : xl E C1) with positive measure the set {y2 : (y1,y2) E B ) has a positive measure, i. e., B has a positive measure. Each Lie group satisfies this condition. Let G be an n-dimensional Lie group with unit element e. It is not hard to prove using some theorem on the left Haar measure of Lie groups (see Chevalley [39]) that there exist open subsets U, V, and a homeomorphism cp of U onto an open subset of Rn so that e E V C U, ~ ( e =) 0, the mapping (x, 9 ) u cp (v-'(x) . (v-' (y-'))-l) is an analytic mapping of p ( V ) x y ( V ) into p(U) and that for every compact subset C of V the left Haar measure of C and the Lebesgue measure of cp(C) vanish together. Since the Jacobian of the mapping
of v ( V ) into y(U) is equal to 2n at 0, we have that there exists an open neighborhood W of e in G so that W2 C V and the Jacobian over p ( W ) is not less than 1. Hence by the formula on transformation of integrals for any compact subset C of W the Lebesgue measure of cp({x2 : x E C}) is not less than the Lebesgue measure of cp(C). Hence G satisfies condition (3) with any compact subset K of W which has a positive left Haar measure. From Theorem 8.2 we can deduce the following Corollary. Suppose, that G is a locally compact group satisfying (4). Then every Lusin measurable solution of (2) is continuous.
§ 22. Simple applications
239
Proof. Let X denote a left Haar measure on G. Let to denote an arbitrary element of G. We prove, that f is continuous at to. Let T be a compact set containing a neighborhood of to, let K be the set from (3), D = T x K , ) -z3, and and let Y = X1 = X 2 = X 3 = G, h ( t , y , z o , z l , z 2 , ~ 3=2z1z2
We will use Theorem 8.2 with sets Al = A2 = A3 = G. The only non-trivial fact is that the last condition is satisfied by g3. Since if B c K then
for all t E T, it is enough to prove that for each E > 0 there exists a 6 > 0 such that B c K and X(B) 2 E implies X{y2 : y E B) 2 6. Suppose, that this is not the case. Then there exists an EO > 0 and for every natural number n an open subset U, of G for which X(U,) < 112, and
Since the mapping y e y2 is continuous, the sets {y : y2 E U,) are open, and so the sets {y : y2 E U,) n K are measurable. Let
X(A,) < 1/2"-l, and the sets {y : y2 E A,) Then A1 3 A2 > measurable with finite measures which are not less than EO.If .
then X(B)
.
a
,
are
2 ~o > 0, but
If C is a compact subset of B with positive left Haar measure we have a contradiction with (4). Hence the proof is complete. Of course, this corollary can be generalized also for equation (3), but then we have to suppose that
(5) G has a compact subset K with positive left Haar measure such that for any compact subset C C K with positive left Haar measure and for any exponent ki in (3) the left Haar measure of the set {xh : x E C ) is positive too.
240
Chapter VII. Applications
Similarly as above for (4),it follows that (5) is satisfied on Lie groups. In a rather general situation, we can even directly deduce from equation (2) that f E C". More generally, from Theorem 1.37 we can obtain the following
Corollary. Suppose that G is a connected Lie group, X is a C" manifold, and f : G + X satisfies the functional equation
where h : G2 x Xn + X is C" and E i E (0, I), E (-1, I), ki E N,kki > 0 for i = 1 , 2 , .. . ,n. Then i f f is measurable or has the Baire property, f is C".
Proof. Let g denote the Lie algebra of G and let K = maxi ki. There exists an open neighborhood Vo of 0 E g such that the exponential mapping is a diffeomorphism of this neighborhood onto some open neighborhood Uo of e in G (see Dieudonnd [49], (19.8.5), (19.8.6)). Let us choose a convex symmetric neighborhood V of 0 in g such that KV C Vo. Then with the notation U = exp(V) we have = exp(V) and the mapping y H y q s a submersion in each point of U whenever 0 < k 5 K . By Dieudonn6 [49], (19.9.11) for any m > 1 we have
for sufficiently small llvj I/, j = 1,2, . . . ,m. Hence by further shrinking V if necessary we may suppose that all of the mappings
j = 1 , 2 , .. . , K map K V into some compact subset C' of exp(KV). We want to apply Theorem 1.37. Let us fix a k 2 2K and let X = uk, Y = U. First let us observe that since the left shifts and the inversion i)'~ are diffeomorphisms, for any xo E X all the mappings y t-$ ( ~ Z ~ y ~are submersions at any point y E Y and for any i = 1 , 2 , .. . ,n.
5 22.
Simple applications
241
-k-K
Let C = U C' U c'-'B~-'. Clearly C is a symmetric compact set. Let us choose an open neighborhood W of e for which C'W C exp(KV). Then -k-K -k-K-K U C'W c exp(KV) C U U = 0' C 5.
u ~ - ~
-k-K
-k-K
But since U C'W C Uk, and hence C c X = C'W is open, we have U uk. Let zo E X and let us represent xo as xo = ~ 1 x - 2- . xk, where x j E U for j = 1 , 2 , .. . , k. For each j let uj be the unique element of V for which exp(uj) = xj. Let us define
Clearly, yo E Y and if ci = 0, then
If
E~
=
1, then by the choice of C' we have
Since C-I = C we obtain that (xEiyki)siE C for each i = 1 , 2 , .. . ,n. Let us take for each (xo,yo) an open neighborhood Wo C X x Y of this i X and let D be pair which is mapped by all maps (x, y) t+ ( x E i y h ) binto the union of all these Wols. With this setting, Theorem 1.37 can be applied and implies that f is infinitely many times differentiable on X = Uk. Since .G is connected, G = u -~ > ~ hence ~ u the ~ corollary , is proved.
22.6. Example. Let f : Rm -+ LQk be an unknown function and let Ai2
Bi (i = 1 , 2 , .. . , n ) be non-singular matrices with m rows and m columns, ci E
R. The variants of the functional equation
were studied by many authors, for references see Szdkelyhidi [191]. By Theorem 1.30 every solution of (1) that is measurable or has Baire property, is C". Much more general results can be obtained using the special form of the solutions; for details see the books [193], 11941 of Szdkelyhidi.
Chapter VII. Applications
242
22.7. A functional equation from the spectral theory of random fields. At the XXXIIth International Symposium on Functional Equations, in his talk Kboly Lajk6 [I371 asked whether the measurable solutions of the functional equation
are locally bounded. (He investigated the continuous and the locally bounded measurable solutions of this equation.) We show how it follows from our general results that the measurable solutions are continuous, hence locally bounded. (See [loo]). There exists a compact set A with positive measure such that B(2a) # -1 if a E A because otherwise for any fixed t the left-hand side would be 0 almost everywhere and the right-hand side would be 2 almost everywhere. Hence ~ ( t =)
1
( ~ (+t a ) + ~ +B(a) B(2a)
(- at ) ) ,
a€A,t ER
and Theorem 8.1 can be applied to prove that B is continuous everywhere. The same question was also proposed under the restriction t 2 a. In this case the existence of A can be used only to derive the continuity of B at t > sup A, hence we have to prove that there exists such a set A in any neighborhood of -m. If this were not the case, B would be -1 almost everywhere in a neighborhood of -m. With the substitution x = t - a we get from the original equation
Fixing any x 2 0, the terms B ( a ) , B (2a), B ( x + 2a) are - 1 almost everywhere in a neighborhood of -m, hence B ( x ) = 1. Now fixing an a for which B ( a ) = - 1 and B (2a) = -1, we get a contradiction for large x. 22.8. The fundamental equation of information. Let f : ]0,1[ -+ R be an unknown function and
The functional equation
5 22.
Simple applications
243
whenever (x, y) E D plays an important r61e in axiomatic information theory; see e. g., Acz6l and Dar6czy [19], Maksa [151], and Acz6l [6, 111. From Theorem 1.30 we get that any solution of (1) which is measurable or has the Baire property is CW. TO solve equation (1) let us differentiate both sides with respect to x and y; this will kill two of the terms:
Introducing the notation g(t) = tfl1(t) we have
Substituting u
= y / ( l - x)
and v
= x / ( l - y)
we obtain that
(1 - u)g(u) = (1 - v)g(v) whenever 0
< u, v < 1
This implies that both sides are constant, say c, i. e., g(t) = c / ( l - t). Hence C
f "(t) = ---
t(1 - t ) '
which implies f ( t ) = c(t1nt whenever 0 < t
+ (1 - t ) l n ( l
-t)) +clt +c2
< 1. This function satisfies equation (1)if and only if c2 = 0.
22.9. The generalized fundamental equation of information. Let
be unknown functions, a E R an arbitrary constant, and
The functional equation
Chapter VII. Applications
244
whenever (x,y) E D is the so-called generalized fundamental equation of information of degree a. This equation also plays an important rBle in axiomatic information theory; see for example Sander [I841 and Acz6l [ l l ] . From Theorems 8.3 and 9.2 we get that if f l and f 2 are Lebesgue measurable or have the Baire property, then f o and f 3 are continuous. Let us apply substitutions x = v(1 - u ) / ( l - uv) and y = u ( l - v ) / ( l - uv) to get equation
and we may conclude that f l and f 2 are continuous, too. By using Theorem 11.3 we similarly obtain that f o , f l , f2, and f3 are C1, and by applying Theorem 15.2, that they are in Cm. Exactly as for the fundamental equation of information we obtain
+
where gi(t) = tf,ll(t) (1 - a)f,l(t), i = 1,2. Hence similarly as for the fundamental equation of information we obtain that in the case a = 1 we have f i ( t ) = c(t1nt (1 - t) ln(1 - t ) ) ~ i , l t ~ , 2 i, = 1,2. Let hi(t) = fi(t) - c(t In t (1- t) In(1- t)) for i = 0,3. Using that the equation is linear and satisfied by f o ( t ) = f l (t) = f 2 ( t ) = f3(t) = c(t In t (1 - t ) ln(1- t ) ) , we obtain that hi(t) = c ~ , ~ ~ + i. e., c ~fi(t) , ~= , c(t ln t + ( l - t ) ln(1-t)) + ~ , ~ t + c ~ , ~ for i = 0,3, moreover that co,l = c1,2+~2,1,c3,1 = c1,l +c2,2, and c o , 2 + ~= ~,~ c2,2 ~ 3 , have 2 to be satisfied to obtain the most general solution in the case a = 1. ~ , l t " ~ i , 2for In the case a # 1 we obtain that fi(t) = c(1 - t)" i = 1,2. Substituting into the equation we obtain that f o ( t ) = -cl,2(l t)" c2,1ta C O , ~and f3(t) = -c2,2(l - t)" cl,lta c3,2 with c0,2 = c3,2 gives the most general solution in the case cu # 1. Note that equation (1) has importance also in the case, where x, y E Rn are vectors, 0 < x, y, x y < 1 (operations and relations are understood coordinatewise). For regular solutions in this case and for further references see JBrai and Sander [118].
+
+
+
+
+
+
+
+
+
+
+
+
+
22.10. The dilogarit hm equation. At the XXVIth International Symposium on Functional Equations in 1988, JAnos Aczd showed how we can use regularity theorems to obtain the locally Lebesgue integrable solutions of the
5 22.
245
Simple applications
functional equation
where 0 < u, v < 1 and f : ]0,1[ + R. Lebesgue integrable solutions (if u = 0 and v = 0 are also permitted) were originally determined by Dar6czy and Kiesewetter [43] and on the open domain ] O , 1 [ by Aczkl [13]. From Theorem 1.30 it follows that all solutions which are Lebesgue measurable or have Baire property are CCO (see [go]). To solve equation (1) we follow Dar6czy and Kiesewetter [43] and Aczd [14]. Let us differentiate with respect to u. We obtain
Introducing the notation g(t) = t(1 - t ) f ' ( t ) , and the new variables x = 1-u, y = 1 - (1 - u ) / ( l - uv) we obtain that the functional equation
holds whenever 0 < x, y, x + y < 1. Hence exactly the same way as for the fundamental equation of information we obtain that g(t) = c(t l n t
+ (1
-
t ) ln(1 - t ) )
+ clt + c2
has to be satisfied. But this function only satisfies equation (3) if c2 Hence we obtain f (t) = c(dilog(t) - dilog(1 - t ) )
t2 + cl In + co, 1-t
where
is the dilogarithm function. A simple calculation shows that
=
-2cl.
Chapter VII. Applications
246
satisfies equation (1). Let
A simple, but tedious computation shows that f a satisfies equation (2). This means, that
is a constant for any fixed 0 < v < 1 as a function of 0 < u < 1. Taking limit u -+ 0 and using that dilog(0) = 7r2/6, we obtain that this constant is 7r2 /2 5a, independent of v. Hence if a: = -7r2 110, then f, satisfies (1). Finally, because the only constant function satisfying (1) is zero, we obtain, that the (regular) solutions of (1) are the functions
+
with arbitrary constants c, cl. As Acz4 explains in [14], a manuscript of Abel contains the system of functional equations
(4)
f (5'") 1-xl-y
and f (x)
+f(1
-LC)
=
C - l n ( x ) l n ( l - x),
and Kiesewetter reduced them to the above equation (1) (with different f ) . Note that if 1 is in the domain, then the second equation follows from the first with y = 1 - x, C = f (1). It causes no problem to solve equation (4) alone and directly, even on the open domain 0 < x, y, x y < 1. From Theorem 1.30 it follows that all solutions which are Lebesgue measurable or have the Baire property are Cm. Let us differentiate with respect to x to obtain
+
§ 22. Simple applications
247
Differentiating again with respect to y and substituting x = v(1- u ) / ( l - uv) and y = u(1 - v ) / ( l - uv) and introducing the new functions g(t) = f t ( t ) tft1(t) and h(t) = (1 - t)2g(t)we arrive at
+
To solve this equation let us differentiate with respect to u and v to obtain h"(uv)uv + ht(uv) = -1. This means that hl'(t)t hl(t) = -1 is satisfied everywhere on ] O , l [ . Hence we obtain that h(t) = cln(t)-t+cl. Substituting into (6) we see that cl = 1. Now we obtain that
+
ln(1-t)
lnt
+t
-
and hence that f (t) = c(dilog(t) - dilog(1 - t ) )
+ dilog(1 - t ) + c' ln(t) + c".
Substitution shows that dilog satisfies (5). Hence substituting into (4), the difference of the two sides is constant for every fixed y. Taking limit x + 0 we obtain that this constant is zero for every y, hence dilog satisfies the inhomogeneous equation (4). Similarly we obtain that
satisfies the corresponding homogeneous equation. Since 1 and In do not satisfy the homogeneous equation, we conclude that the most general (regular) solution of (4) is f (t) = c(dilog(t) - dilog(1 - t ) -
-)6 + dilog(1 - t). IT2
Note that the "Pexiderization" of equations (1) and (4) can be treated similarly. 22.11. The equation of the (2,2)-additivity. Let f : ] O , 1 [ an unknown function. The functional equation
f (xy) (1)
+f
+ R be
(T)+ f (*)
( 5+) f
1-Y 1-Y = f(x)+ f(1-x)+ f(x)+ f(1-y) whenever
O<x,y
<1
Chapter VII. Applications
248
is also useful in information theory, see the survey paper of Aczkl [Ill. The measurable solutions of this equation were determined by Dar6czy and JBrai in [42]. An important step in this paper shows that every measurable solution of (1) is C". With the substitution t = xy we obtain from (1)
Hence, by Theorem 1.30, every solution of (1) which is measurable or has Baire property is C". To solve equation (1) let us differentiate with respect to x and y to "kill" all terms on the right-hand side. Introducing the notation g(t) = f '(t) +t f "(t) we obtain
Further calculations to get a differential equation are somewhat tedious, hence it is better to use a computer algebra system, for example Maple@. For detailed calculations see Maple worksheet 1. First let us apply the dif(1 - y) to "kill" the second term. This ferential operator (1 - x) d
;
4
results in an equation containmg (y - 1)g (x(1 - y)). To "kill" this term we d .After (1 - y) T divide by (y - 1) and apply the differential operator xzd Y multiplying by y - 1 and introducing the new variables u = xy, v = y(1 - x) ~ arrive at and multiplying by (u v ) we
+
+
0 =(v2 - u2)(gl(u) - g1(v)) (4) -
v((u
+ u ((u +
+ v)2 - v)g1I(v),
0
- u)gll(u)
< u, v, u + v < 1.
This functional-differential equation can be solved by a simple application of d d d2 the principles detailed by PBles's method: Applying id, - - and d u ' dv' du2 to (4),and solving the first three equations for gl(v), gU(u),and gl"(u) and putting the results into the fourth equation we obtain
@ ~ a ~is la eregistered trademark of Waterloo Maple Inc.
§ 22. Simple applications
249
> FE:=g(x*y) +g( (1-x) * (1-y) ) -g(x* (1-y) ) -g(y* (1-x) ) ; >
>
FE:=g(xy) + g«l-x)(l-y))-g(x(l-y))-g(y(l FE2:=simplify((1-x)*diff(FE,x)-(1-y)*diff(FE,y));
FE3:=simplify((x*diff(FE2/(y-1),x)+(1-y)*diff(FE2/(y-1)#y))); -(D{2))(g)(xy)x2-2D(g)(xy)x-2D(g)(-y(-l+x))x-2%l
%l:=(D(2))(g)(-y(-l+x)) > FE4:=simplify(subs(D(g)=gl,D(D(g))=D(gl),FE3)*(y-1)); g\(xy)-D(gl)(-y(-\+x))y-D(gl)(xy)x2-2gl(xy)x-2gl(-y(-l+x))x
+
-2D(gl)(-y (-1 +x))x + D(gl)(-y (-1 + x))x2
> FE5:=simplify(subs(x=u/y,y=v+u,FE4)*(u+v) A 2); FE5:=-D(gl)(v)u2v-D(gl)(u)u2-2D(gl)(v)uv2 3
+ D(gl)(u)uv2 + 2D(gl)(u)u2v
2
2
+ D(gl)(u) u + gl(v) v - gl(v) u +D(gl)(v) v + gl(M) v2 - gl(K) u2 -D(gl)(v) v3 [ > DEs:=convert([FE5,diff(FE5,u) ,diff(FE5,v) , d i f f ( F E 5 , u , u ) ] ,D) : [ > Es:=subs(D(gl)=g2 / D(D(gl))=g3,D(D(D(gl) ))=g4,DEs) : > sols:=solve({op(Es[l..3])},{gl(v),g2(v),g3(v)}); 1 g3(u) u - g3(ii) u v2 - 2 g2(M) v2 1 =- - ,gl(v) = 2 v 2 2g2(u)v2-g3(u)uv-2g2(u)v g3(v) = >
g3(u)u\ v
E:=simplify(subs(op(sols),Es[4]));
E := 6 g3(w) uv- g4(w) u - 3 g3(u) u + g4(w) w v2 + 3 g3(w) v2 + 2 g4(u) u v + g4(w) w3
> DE:=simplify(subs(g4=D(g3),E)/((u+v)A2-u)); DE:=D(g3)(u)u
Maple worksheet 1
(Note that for technical reasons in Maple calculations g', gn', etc. are replaced by 9i, ^2^ etc., respectively.) This means that (6)
g""(u)u + 3g"'(u) = 0
whenever 0 < u < 1.
Solving this differential equation we obtain that (7)
g(u) = clnu + C2U2 + c\u + CQ.
Chapter VII. Applications
250
Using that In and 1 are solutions of the linear equation (3) we easily obtain that cq = -c1 have to be satisfied. By the definition of g we obtain that
Using that t H t ln t and t M 1 satisfy the linear equation ( I ) , and putting back t M 4t3 - 9t2 cst c3 In t into (1) we obtain that the most general regular solution of (1) is
+
+
f (t) = ct l n t + b(4t3 - 9t2 + 5t) + a with arbitrary constants a , b, c. Another way to conclude (7) from (4) is to apply Suto's method (see 1.6). Let us rewrite (4) in the form
as sum of products of u-function times v-function. Let 0 < E < 1 and let us consider this equation for u E 10, 1- E[ and v E 10, E[. The rank of the system of u-functions plus the rank of the system of v-functions is at most 6. But 1, u, u2, and 1, v, v2, too, are linearly independent, hence both ranks are exactly 3. This means that the functions
are polynomials with degree at most 2. Hence
holds for every interval ]O,1 - E [, 0 < E < 1, and therefore on all of ] O,1[. We see that g(u) is a linear combination of 1, u, u2, lnu, u lnu. If the coefficient of the last term were not zero, then gl(u)+ugl'(u) could not be a polynomial. Hence we arrived at (7).
§ 22. Simple applications
251
> with(linalg):FE0:=g(x*y)+g((1-x)*(1-y))-g(x*(1-y))-g(y*(1-x)); FEO:=g(xy) + g((l-x)(l-y))-g(x(l-y))-g(y(\-x)) > s : = 4 : t : = 3 : D E s : =[(diff(FEO, [x$i =j + l . . k f y $ i = l. . j ] ) $ j = 0. ,k)$k=0.
.s]
[ > FEs:=subs(g=gO, ' (D@@i) (gO)=g.i'$i = l. .s,DEs) : [ > LEs:=subs(x*y=zO, (1-x)*(l-y)=z3,x*(l-y)=zl,y*(1-x)=z2,FEs) : > r:=nops(LEs);H:=matrix(r,(s+1)*t,['''coeff(LEs[k],g.i(z.j))'$i=0 ..s'$j=l. .t'$k=l. .r]) :Hs:=matrix(r, (s + 1)* (t + 1) , [ ' ''coeff(LEs[k] , g.i(z.j))'$i=0..s'$j=O..t'$k=l..r]): r:=15 > nops(LEs):(s+l)*t:(s+1)*(t+1):rank(H);rank(Hs); 13 14 > R:=[15=NULL]:RH:=convert(subsop(op(R),convert(H,listlist)),matri x):RHs:=convert(subsop(op(R),convert(Hs,listlist)),matrix): > rank(RH);rank(RHs); 13 14 > RR:=[14=NULL]:RRH:=convert(subsop(op(RR),convert(RH,listlist)),m atrix):RRHs:=convert(subsop(op(RR),convert(RHs,listlist)),matrix ) : \> rank(RRH);rank(RRHs); 13 13 > RLEs:=subsop(op(R),LEs):RRLEs:=subsop(op(RR),RLEs):unkns:={''g.i (z.j)'$i=0..s'$j=l..3}:sols:=solve({op(RRLEs)},unkns):simplify(s ubs(sols,RRLEs)); [0,0,0,0,0,0,0,0,0,0,0,0,0] > ind:=l:LE:=simplify(subs(sols,RLEs[op(1,RR[ind])])):FE:=subs(z0= x*y,LE):DE:=subs('g.i=(D@@i)(gO)'$i=l..s,g0=g,FE);
DE:=-
3(D(3))(g)(xy)x-(D('))(g)(xy)xy2-3(D(3))(g)(xy)y
y+
y2-2y+l A
DEl:=simplify(DE*(1-y) 2); DEl:=(D(4))(g)(xy)x2y + 3(D(3))(g)(xy)x-(D(*))(g)(xy)xy2-3(D(3))(g)(xy)y DE2:=subs(y=w/x,(D@@3)(g)=g3,(D@@4)(g)=D(g3) f DEI);
DE2:=D(g3)(w)xw + 3g3(w)x-
D(g3)(w)w2 X
-3
g3(w)w X
DE3:=simplify(DE2*x/(x A 2-w)); DE3 := D(g3)(w) w + 3 g3(w)
Maple worksheet 2
We can illustrate how Pales's method works by deriving differential equation (6) from the functional equation (3) by his method. Calculations are
252
Chapter VII. Applications
given on Maple worksheet 2. In these Maple calculations for technical reasons g, gf, etc. are replaced by go, #1, etc., respectively, and xy, (1 — x)(l — y), x(l — y), y(l — x) are replaced by z0, z\, z2, 23, respectively. Let us observe that the calculations are algorithmic and can be converted to a general program (see A. Hazy [65]). >
print(Hs);
[1,0,0,0,0,-1,0,0,0,0,-1,0,0,0,0,1,0,0,0,0] [0, 3 ; , 0 , 0 , 0 , 0 , - 1 + 3 ; , 0 , 0 , 0,0,.y, 0 , 0 , 0 , 0 , - l + y , [O,JC,0,0,0,0,*,0,0,0,0,-1+*,
0,0, 0,0,-1
0,0,0]
+*,0,0,0]
[0,0,/,0,0,0,0,-(l->;)2,0,0,0,0,-3;2,0,0,0,0,(-l+};)2,0,0] [ 0 , 1 , * 3;, 0 , 0 , 0 , 1 , x ( l - ; y ) , 0 , 0 , 0 , 1 , y ( l - x ) , 0 , 0 , 0 , 1 , ( - 1 + * ) ( - l + 3 > ) , 0 , 0] [ 0 , 0 , x , 0 , 0 , 0 , 0 , -x [0 , 0 , 0 , /
, 0 , 0 , 0 , 0 , - ( 1 -x)2
, 0 , 0 , 0 , 0 , ( - 1 + xf
3
, 0 , 0 , 0 , 0 , - ( 1 - >0 , 0 , 0 , 0 , 0 , /
, 0 , 0]
, 0 , 0 , 0 , 0 , ( - 1 +yf
, 0]
[0,0,2JC,X2>;,0,0,0,-2JC,-JC2(1-3;),0,0,0,2-2X,(1-X)2>',0,0,0,-
[ 0 , 0 , 0 , x3, 0 , 0 , 0 , 0 , x3, 0 , 0 , 0 , 0 , - ( 1 - x)3, [0 , 0 , 0 , 0 , /
0 , 0 , 0 , 0 , ( - 1 + x)3,
0]
4
, 0 , 0 , 0 , 0 , - ( 1 - 3O , 0 , 0 , 0 , 0 , - / , 0 , 0 , 0 , 0 , ( - 1 + y)*]
[ 0 , 0 , 0 , 3 / ,*/
, 0 , 0 , 0 , 3 ( 1 - 3O2 , JC ( 1 - y)3,0,0,
[ 0 , 0 , 2 , 4 xy , x2 y2 , 0 , 0 , -2 , - 4 x ( 1 - y),
[ 0 , 0 , 0 , 3 x2 , x3 y , 0 , 0 , 0 , 3 x2 , x3 ( 1 - y),
0,3 /
,(\-x)y
3
,0,0,0,
-x2 ( 1 - y)2 , 0 , 0 , -2 , - 4 j ( 1 - * ) ,
0 , 0 , 0 , 3 (1 - * ) 2 , (1 - * ) 3 y , 0 , 0 , 0 ,
[ 0 , 0 , 0 , 0 , x 4 , 0 , 0 , 0 , 0 , -x4 , 0 , 0 , 0 , 0 , - ( 1 - JC) 4 , 0 , 0 , 0 , 0 , ( - 1 + x)4]
Maple worksheet 3
After some trial and error we conclude that the 15 equations obtained psk
by applying ? k, k = 0,1,2,3,4, % + j = k to (3) will be sufficient to eliminate the last three terms of (3). The resulting matrix H* (see 16.4 for notation) is given on Maple worksheet 3. The ranks of H* and H are 14 and 13, respectively. Trial and error again shows that the 15th equation gives no
§ 22. Simple applications
253
new information and can be omitted. Solving the first 13 equations for the variables gi (zj), i = 0, . . . , 4 , j = 1 , 2 , 3 and putting back the solution into the 14th equation we arrive at a differential equation for g(zo). Indeed, we obtain ( g ~ ~ ~ ~ ( x y ) x y + 3 g ' 1 ' ( x y ) ) ( x - y ) = 0O, < x , y < l . This implies (6). In the case of the more general equation
= f4(x)+
f5(y) whenever
O < x , y < 1,
the method used for the generalized fundamental equation of information can be applied to prove that if f i , 0 5 i 5 3 are measurable or have the Baire property, then f i , 0 5 i 5 5 are in Cm. All methods of solution applied above for the special case work here, too. The somewhat "brute force" approach to apply PBlesls method to this equation also works, but takes considerable computer time (% 1 hour); see Maple worksheet 4 for details. Another related equation is
where f takes positive values, x, y E Rn, 0 < x, y < 1, and all operations and relations are taken coordinatewise. Taking g = In f we arrive at an equation similar to (3) and substituting t = xy instead of x we can conclude that all solutions which are measurable or have the Baire property are Cm. This observation (see JBrai [85]) made it possible for C. Sundberg and C. Wagner [I901 to find f in a much simpler way than was known earlier. Their procedure is similar to the first method above to solve (3). See their paper for details and further references. 22.12. Remark on a paper of A c z 4 and Chung. In their paper [I81 J. Aczkl and J. K. Chung have proved among other results that, if the functions fi (i = 1 , 2 , . . . ,n ) are locally Lebesgue integrable and the functions p k and q k (k = 1, 2,. . . ,m ) are L-independent, moreover the functional equation
254
Chapter VII. Applications
>
with(linalg):FE0:=f0(x*y)+f1((l-x)*(l-y))+f2(x*(l-y))+f3(y*(1-x) )-f4(x)-f5(y);
>
s:=6;t:=5;DEs:=convert([(diff(FEO,[x$i=j+l..kfy$i=l..j])$j=0..k) $k=0..s],D):
f:=5 > FEs:=subs(/f.j=f.j.0'$j=0..t,''(D@@i)(f.j.0)=f.j.i'$i=l..s'$j=O. .t,DEs): > LEs:=subs(x*y=zO,(1-x)*(l-y)=zl,x*(l-y)=z2,y*(1-x)=z3,x=z4,y=z5, FEs) : > r:=nops(LEs):H:=matrix(r, (s+1)*t,subs(z4=x,z5=y,['''coeff(LEs[k] ,f.j.i(z.j))'$i=0..s'$j=l..t'$k=l..r])):Hs:=matrix(r,(s+1)*(t+1) ,subs(z4=x,z5=y,['''coeff(LEs[k],f.j.i(z.]))'$i=0..s/$j=0..t'$k= 1.-r])): > nops(LEs):(s+1)*t:(s+1)*(t+1):rank(H);rank(Hs); 26 27 > R:=[27=NULL]:RH:=convert(subsop(op(R),convert(H,listlist)),matri x):RHs:=convert(subsop(op(R), convert(Hs,listlist)),matrix): > rank(RH);rank(RHs); 26 27 > RR:=[2 6=NULL]:RRH:=convert(subsop(op(RR),convert(RH,listlist)),m atrix):RRHs:=convert(subsop(op(RR),convert(RHs,listlist)),matrix ) : > rank(RRH);rank(RRHs); 26 26 > RLEs:=subsop(op(R),LEs):RRLEs:=subsop(op(RR),RLEs):unkns:={''f.j .i(z.j) '$i = 0 . .s'$j=l. .5}:sols:=solve({op(RRLEs)},unkns) :simplify (subs(sols,RRLEs)); [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0] > ind:=l:LE:=simplify(subs(sols,RLEs[op(1,RR[ind])])):FE:=subs(z0= x*y,z4=x,z5=y,LE):DE:=subs('f0.i=(D@@i)(fO)'$i=l..s,f0=g,FE): > DEl:=simplify(DE*(l-y) A 2): > DE2:=subs(y=w/x,(D@@4)(g)=g4,(D@@5)(g)=D(g4),(D@@6)(g)=D(D(g4)), DEI) : > DE3:^simplify(DE2*x/(x A 2-w)); DE3 := w2 (D(2))(g4)(w) + 8 w D(g4)(w)+ 12 g4(w)
Maple worksheet 4
is satisfied, where 0 7^ Xi 7^ Xj for i =fi j , then the functions fi, Pk-> and qu are
§ 22. Simple applications
255
in Cm. L-independence means, e. g., for the qk7s,that Cckqi(y) = O
for almost all
y€]C,D[
implies that cl = c2 = . . . = C, = 0. We prove that L-independence and local Lebesgue integrability may be replaced by linear independence and by Lebesgue measurability. This note was published in JArai [83]. Observe, that if the functions pk are not linearly independent, then one of them can be expressed as the linear combination of the others and a similar equation is satisfied but with m - 1 instead of m. Hence we may suppose that the functions pk are linearly independent. Similarly, we may suppose that the functions qk are also linearly independent. This shows that linear independence is a more natural condition than Lindependence. First we observe (as in [18]) that
for a suitably chosen sequence C < y l linear independence of qk . Similarly,
< yn < . < y, < D
because of the
where A < X I < x2 < . . < B. Hence the pk and qk are Lebesgue measurable too. Now, with the substitution t = x Xiy we have that
+
+
+
whenever C < y < D and A Xiy < t < B Xiy. Hence, using Theorem 8.3 we have that fi is continuous. So, by (2) and (3), the functions pk and qk are continuous too. Similarly, as in [18], choosing C * between C and D and integrating, we obtain that
256
Chapter VII. Applications
If we introduce new variables u = x side individually, then we have that
+ Xiy in each integral on the left-hand
where
The functions Q k are linearly independent for otherwise there would exist constants ck not all zero such that
that is,
which is impossible because the q,+ are linearly independent and continuous. Hence the pk are linear combinations of the continuously differentiable functions
and so are continuously differentiable. The same holds for the qk. By (4) and Theorem 11.3 the functions fi are continuously differentiable too. Using now (4) and Theorem 15.2, we have that the fi are twice continuously differentiable and, by (2) and ( 3 ) , so are the pk and qk. By repeating this argument we get the result that fi,pk, and qk are Cm. We note that another possibility would be to apply the general transfer principle 1.23; see § 24 where a harder case is treated. 22.13. Sum form functional equations. L&szl6 Losonczi discussed the so-called "sum form" functional equation in several papers; see for example [141],[142],and [143]. This equation has the form
§ 22. Simple applications k
257
I
where x i = , Xi = 1, Cj=l yj = 1 and xi, yj 2 0 (closed domain) or xi, yj > 0 (open domain). We note that solutions on the closed domain are also solutions on the open domain. This supports considering equation (1) only on the open domain. Another reason is that here we want to consider regularity questions and on the closed domain equation (1) has solutions which are measurable but discontinuous, or which are continuous but not differentiable, etc. For example, if filj(x) = g ~ ( x=) hl,j(z) = zC,where 0 x 1, c E $ and OC is understood to be zero, moreover gi,t = ht,j = 0 for t > 1, then (1) is satisfied. In his papers Losonczi proved that solving of equation (1) in the general k, 1 2 2 case can be reduced to solving of the special case k = 1 = 2. We will consider here only the regularity question of this special case. After simplifying notation we obtain equation
< <
whenever x, y E 10, I[. Similar considerations as in the previous point show that if f o , f l , f 2 ,f 3 are measurable or have the Baire property, then they are
C". Using this regularity result Losonczi in [I431 deduces a structure theorem for equation (1): He proves that if f i , j are measurable, then they are linear ' explicitly given combinations of finitely many functions t H t X ~ ( l n t )with bounds for the nonnegative integer exponents s. To obtain this theorem he applies certain linear differential operators to "kill" terms f l , f 2 ,f 3 on the left-hand side of (2). The structure theorem makes it possible for him to solve some simpler cases completely, for example case (2) with n = 1. His considerations are much beyond the scope of this note here. 22.14. Another equation from information theory. In our joint paper with Wolfgang Sander [I181 we study the regularity of the functional equation
+
for all x, y E 10, l[nfor which x y E 10, l[nand present the regular solution in a special case. All operations are understood coordinatewise. (See [I181
258
Chapter VII. Applications
for further references.) Seemingly our general regularity theorems cannot be applied for this functional equation because there are several different unknown functions with the same inner functions. But if the ai's are different we may obtain a new functional equation from (1) for which our general results can be applied; the case where the ai's are not necessary different may be reduced to the case of different ai's. In the characterization of symmetric divergences and distance measures equation (1) arises in the special case m = 2, f = h, gl = kl, and g2 = k2. To keep calculations simple we will treat here only this special case. We also suppose that n = 1. The idea for the most general case is the same. This special case has been studied by Sander [ I N ] , where all f , gl, 92 satisfying (1) are determined under the assumption that they are three times continuously differentiable. Here we show how it follows from our general regularity theorems that the differentiability condition may be replaced by measurability. Our method can be applied to several other functional equations.
Theorem. Let a # ,B be fixed real numbers, f , gl, g2 : ] 0 , 1 [ -+ functions, and suppose that the functional equation
issatisfied whenever0 < x < 1, 0 < u < 1, a n d 0 < x + u functions f , gl, g2 are Lebesgue measurable, then they are Cm.
R
< 1. I f t h e
Proof. The functional equation can be rewritten as
This form will be used to prove the regularity of f . We want to derive further equations which will be useful to prove the regularity of gl and g2, respectively.
§ 22. Simple applications
259
Let us subtract f ( x ) from both sides of equation (2). With the substitution u = t ( l - x ) we obtain that
+ (1
-
t ( 1 - x ) )Pg2
(-x ) 1
t(1- x )
-
f (4
is satisfied whenever 0 < x < 1 and 0 < t < 1. To eliminate the term containing g2 on the left-hand side, let us substitute y instead of x in (4), multiply by ( 1 - x)P and subtract ( 1 - y)P times ( 4 ) . The difference is
The multiplier ( 1 - x ) P ( l - y ) a - ( 1 - ~ ) 0 (-1 x)" of g l on the left-hand side is not zero if x # y. Dividing both sides with it we obtain
(5)
Chapter VII. Applications
260
whenever 0 (6)
< x, y, t < 1 and x # y.
Similarly we get that
whenever 0 < x , y , t < 1 and x # y. Now we may apply our general regularity theorems. From Theorem 8.3 using equations (5) and (6) we obtain that if gl, g2, and f are Lebesgue measurable, then gl and g2 are continuous, respectively; we only have to check that for each 0 < t < 1 there exists 0 < x, y < 1, x # y such that at least one of the partial derivatives d/dx, dldy of the inner functions in the unknown functions on the right-hand side is not zero at (t, x, y). These inner functions are (t, X , Y) x> ( t 7 1' Y) yl (t, X , Y) t(l - ~ ) (1 t , x >Y) t(l - Y), ( t , x , y ) I+ x / ( l - t ( l - x ) ) , and ( t , x , y ) I-+ y / ( l - t(1 - y ) ) , respectively. It is easy to check that any pair x,y, x # y will do. The same theorem and equation (3) implies that f is continuous. Now using Theorem 11.3 we similarly obtain that gl, g2, and f are continuously differentiable. Finally, using Theorem 15.2 we get that if gl , g2, and f are p > 0 times continuously differentiable, then by (3) the function f , and by (5), (6) the functions gl and g2, respectively, are p 1 times continuously differentiable. Hence by induction our theorem is proved.
*
*
*
*
+
22.15. The equation of the duplication of the cube. Motivated by some classical results of de Saint-Vincent related to the "duplication of the cube" C. Alsina and J. L. Garcia-Roig studied the functional equation
for all x, y E R where 0 < p < 1 and p = 1 - p and found its continuous solutions f : R + 10, cm[ in the case p = 113. C. Alsina proposed the problem to find the continuous f : R + 10, cm[ solutions of (1) for 0 < p < 1, p # 113
261
$ 2 2 . Simple applications
(see [ 2 7 ] ) . Gyula Maksa and the author solved this problem and the result was announced without detailed proof during the XXXIth ISFE in Debrecen 1993 ( [ 1 1 5 ] ) .The result below gives all nonzero measurable solutions for any fixed 0 < p < 1. It was published in the joint paper [I 161.
Theorem. Let 0 < p < 1 be fixed, p = 1 - p, I C R be an open interval of positive length and f : I + R \ (0) be measurable on a subset of I with positive Lebesgue measure. Then f satisfies the functional equation (1) i f and only i f O
(2)
here cl , c2,
CJ
and
f(x)=cl
i f x ~ I or
and
f ( x ) = ~ ~ i ef ~x ~~ ~I or
and
f ( x ) = cl ( x
+
are arbitrary constants, cl # 0,
CJ)
-CJ
ifx E I ;
6I .
Proof. The first step is to prove that every solution which is measurable on a set A with positive Lebesgue measure is infinitely many times differentiable. We rewrite the equation in the form
Let us choose a q for which 1 > q > 1 - l / ( l / p + l / p ) , i. e., 0 < ( 1 - q ) ( l / p + lip) < 1. Using Lebesgue's density theorem we may find a point c E I and an r > 0 such that for C = [c - r , c r ] we have C c I and X(A n C ) > q X ( C ) , where X denotes the Lebesgue measure on R Let gl,,(y) = p x py and g2,2(y) = px p y whenever x , y E I . We would like to apply Theorem 8.3 to prove that f is continuous on a neighborhood of c. The only non-trivial condition to check is that the Lebesgue measure of the set g<: (A) n (A) is positive. The mappings g<: and g<: are central enlargements with center c and factors l / p and lip, respectively. Hence C \ ( A ) is contained in g;: ( C \ A) for i = 1 , 2 and has Lebesgue measure smaller than X(C)( 1 - q ) / p and X(C)(l - q ) / p , respectively. This shows that the Lebesgue measure of Cn n g < : ( ~ )is at least X(C)( 1 - ( 1 - q ) / p - ( 1 - q ) / $ > 0 . Let c E I and as above for an r > 0 let C denote the closed interval [ c - r , c + r ] C I . Let 1 < Q 5 min{(l - p / 2 ) / p , ( 1 - ~ / 2 ) / 1 7 ) . If ly - cl 5 r / 2 and ( x - cl < Q r then lpx + p y - c / < p Q r r p / 2 = r ( p Q + p / 2 ) 5 r and
+
+
+
g;i(~)
+
Chapter VII. Applications
262
+
similarly 1% p y - cl < r . Hence using equation (1) we have that if f is continuous on C , then f is continuous on ] c - Qr, c Qr[ f l I . Taking an appropriate increasing sequence of intervals we obtain that f is continuous on I. Now we prove that f is a locally Lipschitz function on I . Theorem 11.6 can be applied with the compact set C above, and we get that f is a locally Lipschitz function on ]c - Qr, c Qr[ n I. Taking an appropriate C if I is bounded or choosing an appropriate sequence of C's if I is not bounded, we get that f is a locally Lipschitz function on I. Now applying Theorem 1.26 to (3) implies that f is infinitely many times differentiable. The second step is to solve the functional equation (1). Differentiating with respect to x we have
+
+
for all x, y E I. Differentiating this equation with respect to y we obtain
for all x, y E I . With the substitution y
=x
this equation implies that
for all x E I. Define the function g on I by
Then it follows from (4) that
If g(xo) = 0 for some xo E I, then, by (6), gl(xo) = 0, too. On the other hand the zero function is a solution of (6) on I, thus because of uniqueness we get that g is identically zero on I . Hence (5) implies (2). If g(x) # 0 for all x E I,
5 22. then define h on I by h for all x E I, therefore
=
263
Simple applications
llg. It follows from (6) that hl(x) = (3p - l)/p
for some c E R and for all x E I. If p = 113, then by the definition of h we have that c # 0 and (5) implies that f '(x)- (llc)f (x) = 0 for all x E I. Thus we have (2). If p # 113, then -pc/(3p - 1) $ I and again by the definition of h equation (5) implies that
Therefore for some d E R \ (0) we have 3p - 1 j(x) = d ( p x
+
P/(~P-1)
C)
for all x E 1.
Let a! denote pl(3p - 1). Then the function f above is a solution of (1) if and only if the equation
is satisfied for all x ,y E I. With a linear substitution we get that this is true if and only if the equation
is satisfied for all x ,y E I
+ c. If this equation is satisfied, then the function
is 0 in a neighborhood of 1. We will show that this is possible only if a = 1 and hence p = 112. To prove this we calculate the derivatives of F at 1. It is I) P, (p)(3pclear that all of these derivatives have the form F ( ~ ) (= for
Chapter VII. Applications
2 64
some polynomial P, of p. Using a computer algebra system for calculations we get that the value of the function and its first three derivatives is 0 , but P4 ( p ) =
-48p8
+ 124p7 - 90p6 - 12p5 + 40p4 - 16p3 + 2p2
Since we supposed p f 113, 0 < p < 1 , the only remaining possibilities are p = 112 and p = -118 m / 8 . This later value is a root of P5, but not of
+
+
P6( p ) = - 1 0 0 8 0 ~ ~ ~4 5 1 9 2 ~ "
-
+
1 0 9 0 4 4 ~ ~165872p9 ~ - 141426p8
+ 43526p7 + 26452p6 - 30532p5 + 12166p4
-
2298p3
+ 172p2,
because
Hence we have cr = 1 and p = 112. In this case the function f in ( 8 ) clearly satisfies ( I ) , hence we obtain ( 2 ) .
22.16. Functional equations with few variables: some examples. In her paper [I961 Halina ~ w i a t a killustrates her regularity results (see 5 1 8 ) with the following examples:
( 5 ) f ( x + t ,y ) + t 2 f ( x ,y + t ) + ( t - t 2 - I )
(6)
f (x
f ( x ,y )
= t(x2+2x+t+t3+2yt+y2),
+ t , y ) + t2f ( x ,y + t ) - ( 1 + t 2 ) f( x ,y ) = t ( 2 x + t + t3 + 2 y t 2 ) ,
§ 22. Simple applications
265
(8) f ( x + t , y ) + f ( x - t , y ) + f ( x , ~ + t ~ ) + f ( x , ~ - t ~ ) = 4 f ( x , ~ ) , where the unknown function is f : R2 4 R Differentiation with respect to t and putting t = 0 yields
for equations (1)-(6),
for equation (7), and
for equation (8). It is easy to verify that equations (9), ( l o ) , (11) are hypoelliptic and therefore we may apply Theorem 18.3 to obtain that all the continuous solutions of equations (1)-(8) are in Cm. To find the most general solutions of these equations it is enough to solve some differential equations which we obtain by differentiating our equations with respect to x, y, and t. Simple computations show that the most general continuous solutions of these equations are f (x, y) = c, f (z,y) = ay +b, f (x, y) = x2+c, f (x, y) = x2, f (x, y) = x2 y 2 , f (x, y) = x2 y2 C, f (x, y) = axy bx cy d, and f (x, y) = axy bx cy d, respectively, where a, b, c, d E R are arbitrary constants. It is easy to see that the theory developed in $5 19-21 also can be applied in all these cases. Indeed, these equations belong to the class of generalized mean value type equations considered in Example 19.8. For example, by equation ( I ) , putting go(t) = (0,0), gl(t) = (t,O), g2(t) = (0,t2) for t 6 R we only have to check that for each proper linear subspace of R2 there exists a 0 5 j 5 2 and a t E R such that for all i # j, 0 5 i 2 the vectors g{(t)- gi (t) are all out of the given subspace. Putting first j = 0 we see that this is true if the subspace is different from S1 = {(u, 0) : u E R) and from S2 = {(O,u) : u E R). Now putting j = 1 we have g{(t) - gi(t) $i! S2 for each t E R and putting j = 2 we have g$(t)- gi(t) $i! S1 if t # 0. (In the case of some of the other equations (2)-(8) we have to avoid such values of t for which some of the coefficients disappear, but this does not cause any problem.) Hence results of the theory developed in $$ 19-21 imply that all solutions of the equations (1)-(8) which are measurable or have the property of Baire are in Cm.
+
+ + +
+ +
+ + +
<
Chapter VII. Applications
266
22.17. Functional equations w i t h few variables: a counterexample. Of course, it is very easy to show examples for which the theory developed in $3 19-21 can be applied, but the theory of ~ w i a t a kfrom § 18 cannot. We will give an example, for which ~wiatak'stheory can be applied, but ours cannot. For the equation (1) 0 = 4 f (x, Y) f (x 2t, Y) + f (x, Y + 2t) + f (x - 2t, Y) + f (x, Y - 2t)
+
+
where f : R2 -+ R is the unknown function, Corollary 18.4 of ~wiatak's theory can be applied, and implies that all continuous solutions are in Cm. To solve this equation, let us differentiate with respect to x and y, and put t
=O
to obtain
af ax
-=O
and
af au
- = 0, v
respectively, and to conclude that f
-
0
is the only solution. This equation also belongs to the class of generalized mean value type equations considered in Example 19.8. Putting g ~ ( t= ) (O,O), gl(t) = (2t, 0), g2(t) = (0,2t), g3(t) = (-2t,O), g4(t) = (0, -2t), g d t ) = (id),g d t ) = (4, t), g7(t) = (-t, -t), g8(t) = (t, -t) for t E R we have to check that for each proper linear subspace of R2 there exists a O 5 j 5 8 and a t E R such that for all i # j, O 5 i 5 8 the vectors gi(t) - gi(t) are all out of the given subspace. Putting first j = O we see that this is true if the subspace is different from S1 = {(u,o) : u E R ) , S 2 = ( ( 0 , ~ :) u E R ) , S+ = { ( u , u ) : u E R), and S- = {(u, -u) : u E R). NOWputting j = 1 we have gi(t) - g:(t) @ S2 for each t E R and putting j = 2 we have gi(t) - g$(t) $ S1 for each t E $ but putting any j we always have an i+ # j for which gi+ (t) - gi (t) E S+ and an i- # j for which gl- (t) - g$(t) E S-. Roughly speaking, the directions in S+ and S- are "critical directions", and make it impossible to apply the theory developed in § § 19-21. To understand the reason for this phenomenon, let us consider equation (2) 0 = 4 f (2,Y) + f (x + 24 Y) + f (x, Y + 2t) + f (x - 2t, Y ) + f (2,Y - 2t)
where f : IR2 -+ R is the unknown function. This equation only differs from equation (1)in some signs. Our general nonlinear theory is not capable of taking care of such minor differences: if it were possible to apply it to equation ( I ) , then if would be possible to apply it to equation (2). But
§ 22. Simple applications
267
+ +
any function f (x, y) = cp(x y) $(x - y) is a solution of equation ( 2 ) , so no regularity phenomenon holds for equation (2). Observe the role of the "critical directions". 22.18. A composite functional equation arising from utility that is both separable and additive. The problem of determining all utility measures over binary gambles that are both separable and additive leads to the functional equation
In our joint paper with J&nos Aczd and Roman Ger [21] we solved this functional equation for both f and Q without any supposition (except the boundedness of Q and f from below by zero, implicit in the statement of ranges) :
Theorem. Among the functions f : [0,k[ + [0, m[, k E 10, m ] , and Q : [O, 11 + [0, m [ , the functional equation (1) has the trivial solutions (2)
f =0,
Q arbitrary
(51i f k < m ) ,
and
For all other solutions there exist constants a
Ifk
> 0, P > 0 such
that
> 1 and f ( 1 ) = 1 then a = 1. Conversely, all pairs of functions of the form (2), (3), and (4) satisfy (1).
The first attempt at a proof used differentiability of the functions f and Q. Next we succeeded to reduce this assumption to continuity, while saving essential steps of the previous argument. Subsequently we derived continuity from strict monotonicity and finally we eliminated even that assumption. In the paper we looked briefly at weakening the equation itself to be satisfied only almost everywhere. Here we give the details, as an example.
268
Chapter VII. Applications
Theorem. Suppose that the functions f : [0, k[ + [O, m[, k E 10, m], and Q : [0,1] -+ [O, 11 if k < m , Q : [0,1] -+[0, m [ if k = m satisfy functional equation (1) for almost all pairs (x, Y) E [0, k[ x [ O , l ] with respect to the Lebesgue (outer) measure A2. Then there exist functions f" : [0, k[ 4 [0, m[, k E 10, m ] , and Q : [O, 11 4 [O, 11 if k < m , Q : [O, 11 -+ [0, m [ if k = m satisfying functional equation (1) for all (x, y) E [0,k [ x [O,1] such that f = f" and Q = Q almost everywhere with respect to the Lebesgue (outer) measure A. Proof. We will only use that (5) There exists an X c 10, k[ such that 10, k[ \ X has measure zero and for each x E X (1) is satisfied for this x E X and for all y E Yx, where Y, c ] O , l [ and ]0,1[\ Yx has measure zero.
(6) There exists a Y C ] O , 1 [ such that ] O , 1 [ \ Y has measure zero and for each y E Y (1) is satisfied for this y E Y and for all x E X,, where X, C 10, k[ and 10, k[ \ X, has measure zero. These conditions follow from Fubini's theorem because the functional equation is satisfied almost everywhere. The key step is that there exists a monotone function f" such that f = f" almost everywhere. We define f(0) = 0 and let f"(x) be the infimum (maybe m ) of all such r 0 such that f is less than or equal to r almost everywhere on 10, x[. Clearly, f" is nonnegative and increasing on [0, k[. First we will prove that it is finite everywhere. From (5) and (1) we get that for each x E X and for each y E Y, the inequality f (xy) 5 f (x) is satisfied. This means that for each x E X we have f (2) 2 f (x) for almost all 0 < z < x, hence f"(x) 5 f (x) for x E X . Since X is dense in 10, k[, we obtain that f" is finite on [0, k[. Moreover, f" 5 f almost everywhere. It remains only to prove that f 5 f" is also satisfied almost everywhere. If this would not be the case, then there would exist an E > 0 and a (possibly nonmeasurable) set A c 10, k[ such that f (x) > f"(x) E for all x E A. Let B be a A-hull of A. Let us choose a density point x of B such that f" is continuous at x. Then for a small enough S > 0 we have f"(z) f ( x ) - ~ / and 2 f (z) > f"(x) ~ / 2 for x - S < z < x, z E A. But this set of z's has positive outer measure if 6 is small enough, contradicting the definition of f"(x). This proves that f = f" almost everywhere. The first case is when Q(y) = 0 for some y E Y. In this case we obtain that f ( x ) = f ( x y ) + f ( 0 ) for all x E X , .
>
+
>
+
5 22.
Simple applications
269
But f = f" almost everywhere, hence
for this fixed y and almost all x . Taking limit x 4 0 we obtain f ( 0 ) = 0. But this with the monotonicity of f" means that f" is constant on 10, k[. Now if Q ( y ) > 0 for some y E Y, then
for this y and almost all x , which is a contradiction. So we proved that Q = 0 almost everywhere. There remain the cases where Q ( y ) > 0 for all y E Y. Then for any fixed y E Y we obtain that
for almost all x . Letting x
4 0 we obtain lim,40 f " ( z )= 0.
The second case is when Q ( y ) x > 0. In the former case we have
> 1 for some y E Y or f ( z ) = 0 for some
for almost all x , i. e., f " ( x y )= 0 , so we only have to treat the latter case. Let xl denote the largest number for which f ( x ) = 0 for x E [0,X I [ .We will show that xl < k is impossible. Indeed, taking any fixed y E Y, we have
for almost all x . Taking an x for which xl < x < x l / y we have f " ( z y ) = 0 and hence J ( X Q ( ~ ) ) = f ( x ) > 0. But this implies x Q ( y ) 2 xl and hence Q ( y ) x l / x . Letting x 4 xl we obtain Q ( y ) 1 for an arbitrary y E Y. But this implies that f"(xy) = 0 for any fixed y E Y for almost all x , i. e., f"=0. So there remains the third case 0 < Q ( y ) < 1 for y E Y, limXSof ( x ) = 0 but f ( z ) > 0 for all 0 < x < k . In this case f" is strictly monotone because for any fixed y E Y for almost all x we have
>
>
Chapter VII. Applications
270
Moreover, f" is continuous. Indeed, for any 0 < x < k, taking a sequence y, E Y, y, J, 0 and a sequence x, 4x such that x,Q(yn) < x and
the left-hand side converges to the right limit of f". The first term on the right-hand side converges to 0, hence the other term has to converge also to the right limit of at x. But J(X,Q(~,)) is always less than or equal to the left limit of f" at x. Hence the two limits are equal. Now fixing any y E Y and taking a sequence x, tending to x such that
fx
for any y E Y and for all x. Finally, fixing an arbitrary 0 < x
< k , we obtain
for all y E Y. This means that Q is equal to a continuous function on the dense set Y. Let us define Q as the unique continuous extension of Q to [O, 11. Then
J(x) = f^(x~) +
S(XQ(Y))
for all x and for all y E Y. Taking limits and using continuity of Q and f" we obtain that this equation is satisfied for all x and y. Sets of Lebesgue measure 0 can be replaced in this proof for example by sets of first Baire category. 22.19. Additive function with graph invariant under some rotations. At the Twenty-fifth International Symposium on Functional Equations C. Alsina and J.-L. Garcia-Roig [28] posed the following problem: given a rotation of the plane around the origin, does there exist a function F : R + R such that the graph of F is invariant under the given rotation? During the meeting I gave an affirmative answer [87]. In an informal discussion at the same meeting, B. Schweitzer raised the question whether there are also additive functions with this property and how dense the graph of such a function can be. These questions were answered at the next meeting by JBrai [88] and the first question (independently) by J. Ratz [170]. Since these examples show that solutions of some "nice" functional equations can behave very "badly", we give here the details.
5 22.
Simple applications
271
Theorem. Let there be given a set of rotations of the plane around the origin, which has cardinality less than continuum. Then there exists an additive function F : R + 3% whose graph is mapped onto itself by all of the rotations. The function F can be chosen so that its graph intersects with every plane set having positive Lebesgue measure. The proof depends on the axiom of choice, but does not depend on the continuum hypothesis.
Proof. I. It will be convenient to identify F with its graph, R x E with C, and the set of rotations with a subset R of the complex unit circle T. So every rotation is a multiplication by an element r of R, and we have to construct an F C @ additive function defined on R for which r F = F (r E R). We may suppose that R is a multiplicative group, because otherwise we may replace R with the subgroup of T generated by R , and this does not change the inequality card(R) < card@) = c. 11. Let us define
the subalgebra of C over
Q generated by R.
Clearly,
If A c @, let us define
and
A is a linear subspace of @ over Q and PA c A. If card(A) < c, then card(A) < c and card(2) < c. For any set A c cC let dmn(A) = {R(z) : z E A) be the domain of the relation A.
Chapter VII. Applications
272
5
111. If z and w are complex numbers, we will use the notation z w to express that z and w are on the same vertical line. To be more exact, x w means that x # w but %(z) = %(w). Clearly, if u E @. then z w implies z u w u and if 0 # u E R, then z w implies zu wu. With this notation a subset A of C is a real function if and only if there are no z, w E A such that z w. In the construction of F we will start with (0) and add to this set in each step a new point z. If A is the set of points already defined having the property that A is a function, and z E R is given, we have to choose a y E R for which (A U {x iy))- remains a function. There are two possibilities. If x E dmn(A), then let us choose the unique y for which z = x iy E A. With this choice, (A U {z})- = A. Indeed, for an arbitrary element of (A u {z))" we get
+ 5 +
5
5
1:
+
5
+
where p, q, q* E P, qq* # 0, zo, zt E A. In the case x 4 d m n ( ~ there ) are "dangerous" y's: Let
D (A, x) =
iy
:
(A U {x
+ iy})- is not a function
1.
But if card(A) < c, then card(D(A,x)) < c. To prove this, suppose, that y E D(A, x) and z = x iy. As (A U {z))- is not a function, there exist p', ql,p", q" E P, q'q" # 0, 26, z; E A such that
+
p'z
+ z6 4'
5
+ z;
p"x ,I
This is equivalent to
which we can reformulate in the form
The number p/q cannot be real. Indeed, if p/q = 0, then 0 A was not a function. If 0 # p/q E R, then there were
zo/q E
A, hence
273
§ 22. Simple applications
x = %(x) E dmn(A). Let us replace y with y
+ Ay, Ay # 0. Then
This proves that for given p',p'l, q', q" E P and zb, x i E most one y satisfying (1). Hence
2 there
exists at
c a r d ( ~ ( Ax)) , 5 c a r d ( ~ ) ~ c a r d (<~ c.) ' To make F intersect with every plane set having positive plane measure it is enough to make F intersect with every compact plane set having positive plane measure. By Fubini's Theorem, every such set C has the property
By Steinhaus' Theorem, if X(E) > 0, then E - E contains a neighborhood of the origin. Hence
This implies that there exist c many x E R for which card(Cx) = c.
IV. Let R denote the first ordinal number whose cardinality is c. There exists a transfinite sequence
of all real numbers, and there exists a transfinite sequence
of all compact plane sets having positive plane measure. To prove the existence of the second sequence we note that the topology of C has countable base, hence there exist at most c open subsets of the plane, and there exist at most c closed subsets of the plane.
Chapter VII. Applications
2 74
We will define by transfinite induction real numbers yp, x i , y&, ,B < R such that x& iy& E Cp and with the notation
+
F p = {x, +iy,,xL +iyL : a I p),
d p is a function. First, let yo = 0, Bo = {0), so Bo = Bo = Bo = (0). Let us choose an xb $ Bo for which card{y : xb iy E Co) = c
+
If ,f?< R and y,, x', ,yk, a < ,B are already defined so that F, is a function for each a < p, let Ap = U,
5
+
+
4
and an y& from the set
Then F p = B p U {xi
+ iy&>is a function.
V. As the last step, let F = U,
+
+
+
Similarly, F is invariant under all rotations from R, because if x and r E R, then r ( x iy) E F = F, and
+
x
+ iy = r -r1( x + iy) E r F- = r F .
+ iy E F
5 23.
Characterization of the Dirichlet distribution
275
23. CHARACTERIZATION OF THE
DIRICHLET DISTRIBUTION 23.1. Introduction. In their paper [57]Dan Geiger and David Heckerman provide a new characterization of the Dirichlet distribution. Their results show that in certain statistical problems we need not to suppose that the random variables have Dirichlet distribution, this is automatically implied by the more natural independence conditions. Their method consists of proving that probability considerations result in the functional equation
This equation should be satisfied whenever
and
( , . . . , ) A 1
forl<j
(here Am denotes the m-dimensional unit simplex
Am = { ( a in Rm ). Furthermore, xi, 1 5 i
and by
while wi,jis defined by
5 k is defined by
276
Chapter VII. Applications
where zk,j is defined by
With the notation
we have also
Note that ( X I , .. . , xn-1) E and (wi,l,.. . , wi,,-l) E An-1 for 1 2 i I k, moreover f,, 0 5 i 5 k and gj, 0 5 j 5 n are real-valued functions defined on An-, and Ak-l, respectively. As Geiger and Heckerman remark, the functional equation (1) may be considered in a symmetrical way where the x's and w's are the free variables, and (will,.. . ,wi,,-l) E i. e., (1) is satisfied for all ( x l , . . . , xk-1) E An-,, 1 5 j 5 n. In this case yj, 1 5 j 5 n is defined by
and by
moreover, zi,j is defined by
where wi,, is defined by
$ 23. Characterization of the Dirichlet distribution
277
With the notation
we have also
In short, there is a complete symmetry with respect to f -H g, x -H y, z H w, k-~n,it,j. Geiger and Heckerman solve functional equation (1) by reducing it to a differential equation. To do this they have to suppose that the unknown functions in (1) are smooth and positive everywhere but we only know that they are density functions of some random variables. This leads to the following problem of Dan Geiger: 23.2. Problem. Suppose that the unknown functions in the functional equation 23.1.(1) are density functions (i. e., nonnegative and Lebesgue integrable with integral 1). Does it follow that they are positive everywhere and C" ? Our results make it possible to give an affirmative answer to this question. (This result has been published in JBrai [104].) We will use our general theorems to prove that the positive solutions of 23.1.(1) are C". We need the following lemma. 23.3. Lemma. I f y, z E Rm, z # 0, y # 0, then the real eigenvalues and eigenspaces of the linear transformation u H ( u ,y)z are
Proof. Suppose that au = ( u ,y ) z . There are two possibilities. If a = 0 then ( u ,y) = 0. If a # 0, then u = cz for some c E R and hence acz = (cz,y ) z which implies a = ( z ,y). 23.4. Theorem. Suppose that the functions f i , 0 5 i 5 k and g j , 0 5 j 5 n are Lebesgue measurable and positive everywhere and 23.1.(1) is
Chapter VII. Applications
2 78
satisfied, where the inner functions are defined by 23.1.(2)-(5). 0 i k a n d g j , 0 5 j 5 n are in Cm.
< <
Then f i ,
Proof. Take the logarithm of both sides of the functional equation 23.1.(1). To prove that the functions In fi, 0 5 i 5 k and lngj, 0 j 5 n are Cm, we will use Theorems 8.3, 11.3, 15.2. First, we will show that Theorem 8.3 implies that ln f o is continuous. The only non-trivial condition to check is that for arbitrary ( y l , . . . , yn-1) there are z's, such that the rank of the derivative of all the functions substituted into g j , 0 5 j n and fi, 1 5 i < k as a function of the z's is maximal, i. e., k - 1 for the inner functions of the g's and n - 1 for the inner functions of the f 's. We will show that for any choice of the 2's this is the case. For the inner functions of the gj, 1 j 5 n this is trivial. For the inner function of go, for any 1 j 5 n the determinant of the matrix
<
<
<
<
is equal to g:-', and hence nonzero. For the inner function of the f i , where 1 5 i
< k, we have
where S is Kronecker's delta. Let Zi denote the linear transformation u (u, y)zi of E4F-l into itself, where
H
With this notation.
where 1 is the identical transformation. By the lemma above, the only real eigenvalues of Zi are 0 and (y,zi). Since xi # 0 and (y, zi) = zi,lyl . zi,,-l yn-1 # xi, the determinant on the right-hand side is nonzero. For the inner function of the function f k we have
+ +
5 23.
Characterization of the Dirichlet distribution
2 79
Let 21,denote the linear transformation u H (u, y)zk of Rn-I into itself, J , Z ~ , ~ - IWith ) . this notation, where y = (yl, . . . ,yn-1) and zk = ( Z ~ ,... for any 15 i < k, n-1
det (Zk - x k l ) . As above, the right-hand side is nonzero because xk # 0 and ( y , z k ) = Zk,lYl ' ' ' Zk,n-lyn-1 # xk. As a second step, we will show that Theorem 8.3 implies that lngj is continuous for I 5 j 5 n. Here, as above, the only non-trivial condition to check is whether for an arbitrary sequence (zl,j,. . . ,zk-l,j) there exist yi, 1 5 i < k and z , , ~ ,1 5 s < k, 15 t 5 n , t # j such that the rank of the derivative of all the inner functions in the gt's, 0 5 t 5 n, t # j and in the fi's, 0 5 i < k as a function of yi, zSltis maximal, i. e., k - 1 for the inner functions of the g's and n - 1 for the inner functions of the f's. We will show that for any choice of the variables yi, 1 5 i < k and z , , ~ ,1 5 s < k, 1 5 t 5 n, t # j this is the case. For the inner functions of the functions fo and gt, 1 5 t 5 n , t # j this is trivial. For the inner function of go, and for 1 5 t _< n, t # j the determinant of the matrix
+ +
and hence nonzero. is equal to For the inner function of f,, where 1 5 i
< k, we have
denote the linear transformation u H (u,y)z: of Rn-I into itself, Let Z,! where y = (yl,. . . ,y,-1) and z; = (zill - zi,,, . . . , zi,,-l - zi,,). Using this notation, det
((2) ) fly:; n-l
j,t=1
Since z{ # 0 and (y, z:) = xi side is nonzero.
- xi,,
=
zijj det ( x i l - 2;).
x:n-2
# xi, the determinant on the right-hand
Chapter VII. Applications
280
For the inner function of the function f k we have
Let Zi denote the linear transformation u H (u,y)zk of Rn-I into itself, where y = (yl,. . . ,yn-1) and Z; = ( Z ~ J- ~ k , ~. .,. , ~ k , ~ --zk+). l With this notation,
The right-hand side is nonzero because xk
# 0 and (y, zk) = xk - zk,, # xk.
The third step is to use the symmetry of the functional equation. Exchanging the rble of the y's and z's with that of the x's and W'S, we obtain that the functions Info and lngj, 1 j 5 n are also continuous. This implies that fi, 0 i k and gj, 0 j 5 n are continuous.
< <
<
<
Now we can apply Theorem 11.3. In the same way as above we obtain that all functions in the functional equation are in C1. Now, applying Theorem 15.2 in the same way as above, we obtain that all functions in the functional equation are in C2. Repeating this process shows that all these functions are in CW . To prove that all density function solutions are positive everywhere we will use the new generalization of Steinhaus' theorem from § 3. We need a lemma.
§ 23. Characterization of the Dirichlet distribution
281
23.5. Lemma. Let L : Rrl x . . . xRTn -+ IRY be a linear transformation having null space N . Let pi denote the projection
defined by
Let r = Cy=lri. The conditions dim(N) = r - m and p i ( N ) = RTi for 1 i 5 n are satisfied i f and only i f for all 1 5 i 5 n the transformation
<
has rank m. Proof. The condition dim(N) = r - m is equivalent to rank(L) = m. Suppose that rank(L) = m and pi(N) = Rrifor 1 5 i 5 n. Let y E Rm. We SO choose X I , . . . , xi-1, xi, xi+1, . . . , X, that
Since X i E pi ( N ) , there exist x/,, . . . ,xi-1,
. . ,x:,)
L ( x i , . . . , x i + xi, With the notation x(i' = x j
- xi
for j
. . . ,xk
such that
= 0.
# i we have
i. e., y is contained in the range of Li.Since y was arbitrary, rank(Li) = m. Now suppose that rank(Li) = m for 1 5 i m. This implies rank(L) = m because m 2 rank(L) 1 rank(Li). Let xi E Rri and
<
Now choose 21, . . . ,xi-1, xi+l, . . .
, X,
Then L(xl, . . . ,xi- 1, xi, xi+1, . . . ,x,) was arbitrary, pi ( N ) = Rr2.
such that
= 0,
i. e., pi ( N ) contains xi. Since xi
282
Chapter VII. Applications
23.6. Theorem. Let fi, 0 5 i 5 n and gj, 0 5 j 5 k be density functions satisfying 23.1. (11, where the inner functions are defined by 23.1. (2)-(5). Then f,, 0 5 i 5 k and gj, 0 5 j 5 n are positive everywhere and are in Cm.
Proof. Because of the previous theorem we only have to prove that the functions are everywhere positive. This will be done in five steps. We will use the notation { fo = 0) = E An-1 : f o(y) = 0) and the analogous notation {fi = o>, {fi # 01, {gj = 01, {9j # 0).
iy
I. First we will prove that the sets {fi # 0), 0 5 i 5 k and {gj # 01, 0 5 j 5 n contain inner points. Since fi, gj are nonnegative and have Lebesgue integral 1, they are positive on a Lebesgue measurable set with positive measure. We will use Remark 3.13. To prove that {go # 0) contains an inner point, we have to check that the (total) derivative of ( x l , . . . ,xlc-1) with respect to the variables y's and z's has rank k - 1 and has a null space in "general position" as described in Remark 3.13. By the previous lemma these conditions follow from
checked in the proof of the previous theorem. Similarly, to prove that {fi # O), 1 5 i < k contains an inner point we may use that
and det
((2) ) n-l
j,t=l
Finally, to prove that {fk # 0) contains an inner point we may use that
and det
(
()+
0.
j,t=l
Using the symmetry of the equation it follows that the sets {fo {gj # 0), 1 5 j 5 n contain inner points, too.
# 0) and
§ 23. Characterization of the Dirichlet distribution
283
11. Suppose that one of the vectors
is fixed, others vary in an open set. Then the vector valued functions
defined by equations 23.1.(2)-(5), map this open set onto an open set. Similarly, if one of the vectors
is fixed, others vary on an open set, then the vector-valued functions
defined by equations 23.1. (8)-(11) map this open set onto an open set. To prove this, first suppose that (yl , . . . ,yn-1) E A,-1 is fixed, and the z's vary on an open set. We prove that ( x l , . . . , xkWl) maps this open set onto an open set. For an arbitrary point of the range, let us choose a corresponding sequence of vectors (zl,j,. . . ,xk-l,j), 1 5 j 5 n mapped into this point. Fixing all the z's except z,,l, 1 5 s < k, and using that for any 1 5 j 5 n (by the calculations in the proof of the previous theorem) the determinant of the matrix
is nonzero, the inverse mapping theorem implies that the point chosen in the range is an inner point. Other cases follow similarly, using the calculations in the proof of the g, x +, y, z +, w, previous theorem, and the symmetry with respect to f k e n , i e j .
*
Chapter VII. Applications
2 84
<
<
111. We prove that the sets {gj # O), 0 j 5 n and {fi # 0), 0 5 i k are open. Suppose that fo(yl,. . . ,y,-1) # 0. For each 1 5 j 5 n we choose an inner point (zl,j,. . . , zk-l,j) of {gj # 0). Let xi, 1 i < k and wi,j, 1 i k, I 5 j < n denote the values calculated from the y's and x's by equations 23.1.(2)-(5). Fixing the y's and varying the x's in a small neighborhood, the left-hand side of the equation is nonzero, hence all terms on the right side, too. By step I1 this implies that (xl , . . . , ~ k - ~is) an inner point of {go # 0) and (will,.. . ,zui,,-1) is an inner point of {fi # 0) ( I i k). Fixing one of these points and varying the others, in the same way we obtain that (yl, . . . ,yn-1) is an inner point of { f o # 0). All other cases can be treated similarly.
<
< <
< <
IV. Now we prove that the sets {gj
0
< i 5 k are also open.
=
0), 0 5 j 5 n and { fi = O),
<
Suppose that f o ( y l , .. . ,vn-1) = 0. For each 1 5 j n let us choose a point ( x ~ ,. .~. ,, zk-l,j) of {gj # 0). Let xi, I i < k and wi,j, 1 5 i k, 1 5 j < n denote the values calculated from the y's and x's by equations 23.1.(2)-(5). Since the product on the left-hand side is zero, at least one of the terms on the right-hand side is zero, too. Suppose that g o ( x l , .. . , xk-1) = 0. We fix ( X I , . . . ,xk- 1). Varying the w's in a small neighborhood, the z's remain close because of continuity, hence the product gj remains nonzero. Since the right-hand side is zero, f o has to be zero, but the x's run in a neighborhood of the originally chosen point, hence this point is an inner point of {fo = 0). All other cases can be treated similarly.
<
<
njn,,
V. Now the statement of the theorem follows: The set {gj # 0) is a nonvoid open subset of Ak-1; the set {gj = 0) is an open subset of Ak-1. Since Ak-1 is connected, {gj = 0) has to be void. Similarly, { f i = 0) = 0. 23.7. Remark. Usually, if we consider characterization problems in probability theory we obtain a functional equation for the unknown density functions satisfied only almost everywhere. Therefore we can consider equation (1) in 23.1 almost everywhere on A,-l x Theorem 23.4 can be modified stating that if f,, gi are Lebesgue measurable and positive almost everywhere, then they are almost equal to C" functions. The proof of this version can be done by simply referring to Theorem 10.3 (and 10.1) instead of 8.3. Theorem 23.6 and its proof also can be modified in this spirit: for sets such as { f o # 0) replace statements like "equals A,-1", "is open", "contains an inner point", etc., by statements "is almost equal to An- , "is almost equal to an open set", "almost contains a neighborhood of a point", etc., respectively.
§ 24. Characterization of Weierstrass's sigma function
285
24. CHARACTERIZATION OF WEIERSTRASS'S
SIGMA FUNCTION In this section it will be proved that measurable and not almost everywhere zero functions f l , f 2 : IWn + C satisfying the functional equation
are infinitely often differentiable. In the case k 5 2 the equation is solved. This is essentially the material of our joint paper [I181 with Wolfgang Sander. Although most of the results of this section have been obtained earlier by others, the method here is new and may apply in some more general cases. 24.1. Introduction. At the Thirtieth International Symposium on Functional Equations (1992, Oberwolfach) W. Sander raised the following question [I851:
Problem. Let f , gl, gz, h l , h2 : Rn -+C. Suppose that f is not everywhere zero and that gl and g2 are linearly independent. If f , gl, g2, h l , h2 satisfy
prove or disprove that the measurability o f f implies the continuity o f f . This question is connected with the characterization of Weierstrass's sigma function by addition theorems. Assuming continuity a characterization of (1) was done by M. Bonk in his "Habilitationsschrift" and published in [36]. He was motivated by the survey paper "The state of the second part of Hilbert's fifth problem" [14] by J. Aczd. In his paper [14] J. Aczd gave a survey on results concerning functional equations treated by Abel in his work. The papers of Abel on functional equation are cited by Hilbert with the remark that it would be interesting to get Abel's results without differentiability assumptions. J. Aczkl gives details on results obtained about different equations treated by Abel. Concerning the equations
286
Chapter VII. Applications
he wrote that "Abel reduced this system to differential equations of up to fourth order, of course under supposition of differentiability (of fourth order), that is, since the functions are complex, of analyticity. Haruki 1631 has found, also by reduction to differential equations of at most fourth order, the general (in fourth order) differentiable solutions of (2) alone. The general continuous solution either of (2) alone or of the system (2), (3) for all u, v E C is not known (to me); even less are those under regularity conditions weaker than continuity or on subsets of c2." M. Bonk [35] investigated the functional equation
and the special case of it, the equation
The functions are supposed to be complex valued. He proves a regularity result stating that all continuous solutions fl,fa, gi, hi of (4) with linearly independent gi's and hi's can be (uniquely) extended to analytic functions on P so that those extensions also satisfy (4) on P. Using this result he presents all continuous solutions of (5) in the case k 5 2. Of course, as a special case, it is possible to get the continuous solutions of (2) or of (3) and of the system (2), (3). In his investigations M. Bonk uses Fourier transform and special estimates. Roughly speaking, his regularity theorem depends on the fact that a solution f of (5) can be "renormalized" so that it becomes a function vanishing exponentially at infinity, and that the Fourier transform of such an f satisfies the same type of equation. Then the Laplace transform gives the The case k 5 1 is elementary. In the case k 5 2 solutions extension to P. can be found using complex function theory, but this case is much more complicated than the case k 1. In 1992 R. Rochberg and L. A. Rube1 [I711 used different methods to find all analytic solutions of equation (4) in the case n = 1 and k 5 2. They presented two ways to solve (4). The first method reduces the functional equation to a differential equation (by using a computer). The second method uses special symmetries of the equation and is a more geometrical proof. (They missed some solutions in both cases.)
<
5 24.
Characterization of Weierstrass's sigma function
287
Finally Bonk obtained in 1997 [37] all continuous solutions of equation (4) in the case k 5 2. The aim of this section is to prove that every measurable solution (f l , f2) of (4) which is not almost everywhere zero is in Cm = Cm (Rn ,C). In particular, the problem of W. Sander will be solved. On the other hand, we show that the results of M. Bonk can also be obtained using reduction to differential equations. Hence we obtain generalizations of the results of M. Bonk and the results of R. Rochberg and L. A. Rube1 with a proof using only generally applicable methods. The proof of the regularity part depends on regularity theorems of this book, especially on Theorems 8.1, 11.6, and the general transfer principle 1.23. By our method it is possible to prove that the not almost everywhere zero solutions f l , fz of the more general equation k
(6)
fl ( ~ 1 ( uv,) ) f 2 ( ~ 2 ( u , v )=) z
g i (u)hi(v),
(w)E
C
R
X
Rnl
i=l
are also in Cm. Here the given Cm functions Fl,F 2 are supposed to satisfy some weak conditions, for example the system x = Fl (u, v), y = Fz(u, v) has to be solvable with respect to u, v. (The method of Bonk does not work in this generality.) Although by our method even more general equations can be treated, for simplicity we restrict ourselves only to equation (4). We obtain and solve differential equations like those in the paper of R. Rochberg and L. A. Rubel, but the difference is instead of finding analytic solutions of ordinary differential equations we have to find complex-valued C m solutions of a system of partial differential equations. We remark that in 1974 J. A. Baker 1301 found the measurable and not almost everywhere zero complex-valued solutions f 1, f2 of the equation
With m = 2 his result gives the solutions of (4) in the special case n = 1, k = 1. In this last case K. Lajk6 [I361 also presented the general solution. Before we start our investigations we formulate the regularity theorem of Bonk, although we will not use it in what follows. 24.2. Theorem. Suppose that f l , f 2 , g l , . . . ,gk, h l , . . . , hk : Rn are continuous functions satisfying the functional equations
+C
288
Chapter VII. Applications
Then there exist analytic functions f;, f"2, 31,. . . ,jk,L I , . . . , h k : P -+ C satisfying the same functional equation such that f; lRn = f 1 and lIRn = f2.
f"z
Proof. This is essentially Theorem 1.7 in Bonk 1351. 24.3. Remark. As already known, the answer to the question of W. Sander, as it stands, is negative, that is the measurability of f does not imply the continuity of f . This is shown by the following example: Let f = hl = hz = tQ,the characteristic function of the set of the rationals, let gl be arbitrary, and let g2 = tq - gl. Then (1) is satisfied since
Moreover, in the following example, not only gl and g2, but hl and h2 are also linearly independent: f (x) = x t q (x), gl (u) = u2tQ(u), g2 (u) = -tQ(u), hl (v) = t ~ ( h4 dv) = u 2 t ~ ( v ) . In these examples it is important to have that {x : f (x) # 0) is a subgroup of R and of Lebesgue measure zero. Similar examples can be constructed using other subgroups of the reals having measure zero. Counterexamples defined on Rn can be constructed for example by using projections and the counterexample in the one-dimensional case. These simple examples show that we can expect a positive answer by a modification of the above problem assuming that f is not almost everywhere zero. This modified question is answered by the following theorem. 24.4. Theorem. Suppose that the functions f l , f 2 , gi, hi : Rn satisfy the functional equation
+C
moreover, f 1, f 2 are measurable and not almost everywhere zero. Then f are continuous.
,f 2
Proof. We may suppose that g i , hi are linearly independent (if not, (1) is valid with kt, kt 5 k). By the linear independence of the functions gi and hi there exist regular (k, k)-matrices (ai,j) and (bitj) and fixed vectors u j , v j E R n , 1 5 j 5 k such that
5 24.
Characterization of Weierstrass's sigma function
289
and
Hence the measurability of f and f2 implies the measurability of hi and gi. Introducing the new variables x = u + v and y = u - v , we obtain from (1)
Let us choose a compact set Y with positive measure such that f2(y) whenever y E Y. Then we have
#0
For arbitrary xo E Rn choose a compact neighborhood X of xo. Letting D = X x Y and choosing the compact set ( X + Y)/2 for the functions gi and the compact set (X - Y)/2 for the functions hi as domain, we can apply Theorem 8.1 to show that f l is continuous at xo. Since xo was arbitrary, this means that f l is continuous everywhere. Similarly we obtain that f2 is continuous everywhere. Notice that we used only that f 2 is not almost everywhere zero and there is a point in which fl is not zero.
24.5. Remark. Similar regularity theorems do not remain true for the more general functional equation
Consider the functional equation
where f is an arbitrary measurable function which is zero outside ]1,2[. Now let a1 = bl = a2 = 1, bg = b3 = -1, a3 = 3. I f x = u + v ~ ] 1 , 2 [ a n d 2y E ]3,6[, hence the product y = u - v E ]1,2[, then a ~ u b3v = x is always zero. Thus f satisfies the more general functional equation, f is measurable, but f could be not continuous.
+
+
290
Chapter VII. Applications
24.6. Theorem. Suppose, that the functions f l , f 2 , gi, hi : IWn satisfy the functional equation
+C
moreover, f 1, f 2 are measurable and not almost everywhere zero. Then f 1, f 2 are Cm.
Proof. Theorem 24.4 above shows that f l and f 2 are continuous. As in the proof of Theorem 24.4, we may suppose that the functions gi and the functions hi are linearly independent. Using 24.4. (2) and 24.4. (3) we obtain that there exist complex constants ci,j and 2k fixed vectors ui, vj E W such that
Substituting x = u
+ v and y = u - v we obtain that
whenever x, y E Rn. Now if f l = f2 = f , then we can simply apply Theorem 1.28: choose a bounded open set Y on which f is nonzero and an upper bound K of the vectors ui, vi, and y E Y. Then conditions of Theorem 1.28 are satisfied for the functional equation
x E X , y E Y where X is the open ball around the origin having radius 5K and C is the closed ball around the origin having radius 4K. Hence, f is Cm on X, and because K may be arbitrarily large, everywhere on Rn.
5 24.
Characterization of Weierstrass's sigma function
291
It would be possible to reduce the general case to this special one using the results of Bonk [35] depending on the special form of the equation. Instead of this we will apply the general transfer principle 1.23 to reduce the regularity problem of the above functional equation with two unknown functions to the regularity problem of one functional equation with one unknown function. We will use this more general procedure. Let us choose bounded open sets Yl and Y2 such that f l is nonzero on Yl and f 2 is nonzero on Y2. Introducing new variables and letting XI = X2 be the open ball around the origin having radius 5K where K is an upper bound of the norm of all vectors ui, vi, y E Yl , and y E Y2, we obtain from equation (1) the following system of functional equations :
where21 E X I , y1 E Y2,x2 E X2, and y2 E Yl. Now let X = X1 x X 2 andlet f be defined by f (x) = (fl ( x l ) ,f2(x2))where x = ( X I , x2). By the transfer principle - with appropriate domain D and functions Gi, H - function f satisfies a functional equation
wherey = (yl,y2). Indeed,letY = Y2xY1, D = X x Y , Go(yl,y2) = ( Y ~ , Y I ) , and for 1 5 i 5 k let
Chapter VII. Applications
292
Moreover, let H = (HI,H2) where H1 and Hz are appropriate complexvalued functions (i. e., they do not depend on some variables). Now letting C1 = C2 be the closed ball around the origin and having radius 4K, and letting C = C1 x C2,we may apply Theorem 1.28 to get that f is in Cm on X. This implies that f l and f 2 are Cm on the open ball having radius 5K and centered at the origin. Since K can be arbitrary large, f l and f2 are Cm. We will prove that the measurable and not almost everywhere zero solutions of the functional equation 24.1(4) can be given in the case k = 2 with Weierstrass's sigma function. Here we only sum up the most important properties of Weierstrass's functions in the form we will use them. The proofs can be found in the book of Saks and Zygmund [177]. 24.7. Weierstrass's functions. Let w l , w 2 be complex numbers, different from zero, and such that wl/w2 $ R. Let R = wlZ+w2Z. Weierstrass's sigma function is defined by the absolutely convergent product o ( ~w,l , w2)
=2
n OfwER
(I
-
4 w ) exp (4 + (4) 2, w 2 w
a is an entire function of the variable z and its (simple) roots are the points of 0. The function
where R = wlZ, and the function
where S2 = (0) can be considered as limit cases. The pairs w l , w2 will be called lattice constants; i. e., w l = m and w2 = oo, or 0 # w l E C and w2 = m , or 0 # WI E C, 0 # w2 E C, and wl/w2 $ R. The derivative of Weierstrass's functions with respect to the first variable will be simply denoted by a prime. It is clear from the definition that for any complex number cu # 0 we have
§ 24. Characterization of Weierstrass's sigma function
293
Weierstrass's C function is defined as the logarithmic derivative of the sigma function:
<
is a meromorphic function in the first variable, its poles are the points of 0. In the above limit cases it is known that
erwise
.,
\
1
-
1
1
Z \
Here it is also clear from the definition that for any complex number a we have 1 < ( a ~ , W , a w n= ) -C(w4w2). a Weierstrass's P function is defined by
#0
In the special cases we get
and
otherwise
1
P(wJl,~2 =) 2 Z
C
+
O#w€R
(x-w)~
w2
The P function is periodic in its first variable, and R gives the set of all periods. By the definition for any complex number a # 0 we have P ( a z ,a w l , a w n )
=
1 ,P(z, a
w l , w2).
Chapter VII. Applications
294
We will use the differential equations of the P function. In the special cases we have that
otherwise
where 1
g2=g2(~1,~2)=60
w
and g3=g3(w1,w2)=140
O#wER
O#wER
1 w6
'
gz
It is not hard to prove that - 27g32 # 0. It can be proven (see Saks and Zygmund [177], Ch. VIII, 5 l3), that if g; - 27g32 # 0, then there exist lattice constants w l , w2 different from infinity such that the corresponding g2, g3 are the given ones. We introduce unified notation: if w l = w2 = m, then let gl = g2 = gs = 0; if w l # oo, but w2 = m , then let gl = 47r2/wf and g2 = g3 = 0; otherwise let gl = 0. With this notation we always have gl = 0 or g2 = g3 = 0, or both, but the differential equation of the P functions can be written in a unified way as
Let us note that the P function satisfies the second order differential equation
too. The numbers gl ,gz ,g3 are usually called invariants, because they only depend on R, but not on w l , w2. Since the above first and second order differential equations are autonomous, with any shift value e E C the function P ( z e, w l , w2) satisfies the same differential equation as the function P ( z ,w l ,wz). Using the P function we can get solutions of any (complex) differential 2 equation y' = 4y~+clY2+c2y+c3,namely in the form P ( z + e , w l , w2)+c with
+
5 24.
Characterization of Weierstrass's sigma function
295
appropriate lattice constants w l , w2 and complex constants e, c. Indeed, if all the zeros el, e2, e3 of the polynomial 433 cly2 c2y c3 are different, then let c = ( e l e2 e3)/3 = c1/12, and we introduce the new variable Q = y c. Then with appropriate complex constants g2, g3, for which g: - 27g32 # 0, we obtain
+
+ +
+
+
+
With the corresponding lattice constants w l , w2 different from infinity and with any shift e E C the function x H P ( z e, w l ,w2) c is a solution of the differential equation yt2 = 4y3 cl y 2 c2y cg. If not all of the zeros el, e2, es of the polynomial 4y3 cly2 c2y c3 are different, then the special cases will give the solutions. If all the three zeros coincide, then again let c = (el e2 e3)/3 = el = e2 = e3 = c1/12. Introducing the new variable Q = y c we have
+
+
+
+
+
+
+
+
+
+ +
If only two zeros, say el and ez coincide, then let c = (el and introducing the new variable fi = y c we have
+
+ e2)/2 = el = e2,
24.8. Lemma. With the above notation, let w l , w2 be lattice constants and let gl, g2, g3 be the corresponding invariants. Let u s consider the equation
For all solutions (x, y) E such that
C x C,y #
0 of this equation there exists a z E
C
P(z,W I , W Z ) = x and P t ( z ,wl, w2) = y. For any other solution z' E C we have z'
-x
E 0.
Proof. For the case w2 # cc the proof can be found in the book of Saks and Zygmund [177], Ch. VIII. 14.5. The case w2 = cc can be proved by direct computation.
Chapter VII. Applications
296
24.9. Lemma. Using the notation of 24.7, let us suppose that with some complex constants c, E, d # 0, e, E, and with some lattice constants w l , w2 and (;Il, G2, respectively, we have
for all x &om a nonvoid open interval of the real line. Then w l , w2 and GI, G2 generate the same lattice R, the functions
coincide and we have c = 2, and e - E E R.
Proof. Let R and fi denote the lattice generated by w l , w2 and (;I1,(;I2, respectively. The left-hand side of the equation (1) is analytic on the open set @ \ ((0- e)/d), and the right-hand side is analytic on the open set @\ ((fi - E)/d), and these sets are equal on an interval of the real line. Hence by the principle of analytic continuation (see Dieudonnh [49], 9.4.2) they are equal on the intersection of these open sets. But the set ( (R - e) U (fi - e") ) /d contains only isolated points, hence all points of this set are poles of both functions, i. e., (R - e)/d = (fi - E)/d. Hence R = fi and e - E E 0. The sum representation of the P function implies that z t+ P ( z , w l , w2) and z t-, P(z,(;I1, (;I2) are equal. Since e - e" is a period of this function, for any x for which dx e is not a pole we have that
+
hence c = 2.
24.10. Lemma. Using the notation of 24.7, suppose that w l , w2 and are lattice constants, and the corresponding invariants gi and Qi are the same, i. e., gl = gl, g2 = j2,and g3 = g3. Then
(;Il, (;I2
for all z E C.
Proof. If g2 = j 2 = 0 and g3 = g3 = 0, then the statement directly follows from the definition of P ( z ,w l , CQ). Otherwise let e be an arbitrary real number for which y = P1(e,w l , w2) # 0 is defined and let x = P ( e , w l , w2). Then 2 3 2 Y = 4 x -g1x -g2x-g3,
§ 24. Characterization of Weierstrass's sigma function
hence by Lemma 24.8 there exists E E
297
C,for which
P(E,G I , G2) = x and P'(E,Gl, G2) = y. If s is an appropriate "square root", i. e., the inverse of the restriction of z H z2 to an appropriate neighborhood of y, then t ~ P ( t + e , w ~ , w zand )
teP(t+E,Gl,G2)
also satisfy the differential equation u'
=
s(4u3
2
- glu - g2u - g3)
and the initial condition u(0) = x, hence these functions are equal for real numbers t from an appropriate neighborhood of the origin. Hence the previous lemma can be applied, and we obtain the statement. 24.11. Functional equations and sigma functions. We start with some simple remarks about the solutions f l , f 2 , g l , .. . ,gk, hl, . . . hk of the functional equation k
f l ( +~v ) f 2 ( ~
(1)
-2))
= Cgi(h)hi(v)-
i=l
We will consider only complex-valued solutions defined on Kn (K = R or K = C). Solutions defined on C? can be considered as solutions defined on Rzn. The restriction of any solution to any linear su bspace over IR is a solu tion over the given subspace. Especially, the restriction of solutions over are solutions over Rn. For any linear mapping L : KT + Kn the functions f l o L, f 2 o L, gi o L, hi 0 L are solutions over P. For any constants cl, c2 E Kn the shifted mappings x H f l (x cl), Y f i (Y c2), u t-+ ggi (U (CI ~ 2 ) / 2 )v, hi (v (cl - cz)12) are also solu tions. 41, hl are arbitrary solutions in the case k = 1, then f l f;,f2f2, If f;, giQl, and hihl (i = 1,. . . , k) are also solutions. If T is an arbitrary (complex) k x k matrix having an inverse and with transpose TI, then f l , f z , iji, and hi are solutions too, where 9! = (g1,... igk), h = ( h l , . . . , h k ) , 4 = (41,... ,&J, ?2. = ( L . . , h k ) , and g(u) = T-lg(u), h(v) = T1h(v).
"
+
f"z,
+ +
+
+
Chapter VII. Applications
298
Using the above transformations we may obtain several solutions fiom a given solution of equation (1). In the case k = 1 the complex analytic functions
are solutions. Using the above transformations we obtain from these the
Rn --+ @ type solutions
where a l , a2 E @ are constants, L1, L2 : Rn --+ C are linear forms, and Q : Rn --+C is a quadratic form. Additional solutions in the case k = 2 for lattice constant w l # m , w:! # m (see the previous point) are given by
(see Saks-Zygmund [177], Ch. VIII. 5 7); in the case are the lattice constants, the three solutions are
flk) = f 2 ( 4 (10)
LII(Z)
= +,w,
w2
m),
= hz(2) = a(z,w, w ) ~ ,
-92(2) = hl(z) = -al(z, w, C O ) ~ ,
and
(12)
= m , i.
e., if w, m
§ 24. Characterization of Weierstrass's sigma function
299
All these are C i C type solutions. Note that in all these cases the functions gl, g2 and the functions h l , h2 too, are linearly independent of any neighborhood of any point of the complex plane. From these solutions we obtain by transformations the solutions
the solutions
and the solutions
having the type Rn -+ C, where w l ,w2 and w, cc are lattice constants, a1, a2 E C, el, e2 6 Rn are constants, L, L1, L2 : Rn -+ C are linear forms, and Q : Rn -+ C is a quadratic form. Note that if L $ 0, then the corresponding functions gl ,g2 and hl , h2 are linearly independent on any neighborhood of any point of Rn. Our main result will state that from the complex-to-complex "basic solutions" given by (7) and (9)-(12), using the above transformations, we can obtain any solutions. To be more exact, in the case k = 2 the pairs f l , f 2 : Rn -+ C given by (8) and (13)-(15) are all solutions of the functional equation (1) for which f l and f s are measurable and none of them is almost everywhere zero. In (8) and in (13)-(15) we have not listed the functions gi, hi corresponding to a given pair f l , f2, but they can be obtained easily (cf. [37]). Moreover, as shown by the following lemma (for a proof see Theorem a and H. Gauchman and L. A. Rube1 2.3.1 in T. M. Rassias, J. ~ i r n ~[I691 [56]),it is enough to find for a given pair f l , f2 one system of solutions gi, hi i = 1,2, . . . ,k . The importance of this lemma is that it shows that by solving equation 24.1.(4) we may concentrate on finding the form of the functions fl, f2.
300
Chapter VII. Applications
24.12. Lemma. Let U , V be sets, let gi, iji : U + C , hi, hi : V + C be functions, and let us suppose that the functions gl, . . . , gk, the functions h l , . . . , hk are linearly independent, respectively, and moreover that
and
are two - representations o f the function F- : U x V- -+ C. Then i2 k . Moreover, k = k i f and only i f $1,. . . ,gl, and h l , . . . , hk are linearly independent. 1f i= k, then there exists a uniquely determined constant k x k matrix T having an inverse such that with the notation g = ( g l ,. . . ,g k ) ,3 = ($1,. . . ,& ) , h = ( h l , .. . , h k ) , and h = ( h l ,. . . , h k ) we have $ ( u ) = T - ' g ( u ) and h ( v ) = T 1 h ( v )for all u E U , v E V . Here T' is tho transpose of T . T h e following lemma reduces the problem once more. Roughly speaking, it tells us that it is enough t o find the solution locally i f you want t o obtain the global solution. 24.13. Lemma. Let f l , f2,gi, h i , J i ,?li : Rn -+ C (i = 1,.. . , k ) be functions, and let us suppose that the functions gi, hi are analytic and there exists a S > 0 such that
whenever lui, IvI < 6. Suppose that for any E > 0 the functions g,, i = 1 , 2 , . . . ,k and the functions hi, i = 1 , 2 , . . . , k are linearly independent, respectively, on the open neighborhood of the origin having radius E . Suppose moreover that
is satisfied for all u , v E Rn . Then (1) is satisfied everywhere, and f analytic.
1 , f:! are
Proof. I t is clear that there is a maximal S among those extended real numbers 0 < 6 <_ m for which ( 1 ) is satisfied whenever lul, Ivl < 6. W e have
5 24.
Characterization of Weierstrass's sigma function
30 1
to prove that this maximal value is m. Suppose to the contrary that the maximal 6 is finite. We will prove that f2 is not zero everywhere on any neighborhood of the origin. If f2 would be zero everywhere on a ball having center zero and radius k E I 26, then equation ( 1 ) would imply 0 = x i = , gi ( u )hi ( v ) for lul, Ivl < ~ / 2 , contradicting the linear independence of the functions gi, h i . Substituting new variables in ( 1 ) we obtain that k
f1
( x ) f 2( Y ) =
C gi (1)x + y
hi
(1)x - y
i=l
+
for all pairs x , y E Rn for which / x yl < 26 and lx - yi < 26. Let us fix a yo for which f g ( y o ) # 0. Dividing both sides of ( 3 ) by f 2 ( y o ) we obtain that
whenever 1x1 < 26 - Iyal Hence f l is analytic on the open ball of radius 26 - lyol around the origin. Since lyol can be arbitrarily small, we obtain that f l is analytic on the open ball of radius 26 around the origin. Similarly we obtain that f 2 is also analytic on this ball. Now we use equation ( 2 ) . Lemma 24.12 implies that & and hi are linearly independent on any open ball around the origin. Let E > 0 and let us choose k vectors u j , j = 1 , . . . , k such that ivj 1 < E and det ( ( h i ( u i ) )i,j= # 0. The equations k
f l ( ~ + v j ) f ~ ( u - q=)
C &u()hi ( u j) i=l
are satisfied for all u E Rn , j = 1, . . . , k. From these equations all Ljj ( u )can be expressed as linear combinations with constant coefficients of the terms f ( u v j )fg ( u - u j ) (cf. 24.4. ( 3 ) ) . Hence tji is analytic on the open ball of radius 26 - E around the origin. Since E can be arbitrarily small, each lji is analytic on the open ball of radius 26 around the origin. Now similarly as in the previous step, we obtain that f l and f 2 are analytic on the open ball of radius 46 around the origin. Hence the left and the right-hand side of equation ( 1 ) are both analytic on the open set
+
Chapter VII. Applications
302
and they are equal on the subset
of this set. By the principle of analytic continuation (see Dieudonnk [49], 9.4.2) equation (1) is satisfied whenever lui, lvl < 26. This contradicts the maximality of 6, i. e., we have proved that 6 = m. 24.14. D e t e r m i n a n t s . Let
where gl, . . . ,gk and h l , . . . , hk are linearly independent sets in the linear function spaces U and V, respectively. Moreover let A:, . . . ,A: be k 1 arbitrary linear functionals acting on functions of U and let A:, . . . , A t be k+ 1 arbitrary linear functionals acting on functions of V. Since the functionals are arbitrary, they can - for example - associate to a function the value of the function at a given point, the value of some partial derivative or some directional derivative, some linear combinations of these, etc. Now the k 1 functions v I-+ A$ F (u, v), j = 0,1, . . . , k are for each j linear combinations of the k functions hi, hence they cannot be linearly independent. Thus there exist constants cj which are not all zero such that C jk= , cjAgF(u,v) = 0. Applying the linear functionals Ah to this equation we obtain that
+
+
This means that
(1)
det (A;AJ,F(U, v))!a,j=O = 0.
We will use this equation to obtain a differential equation for the functions f l , f z in 24.1.(4). 24.15. Differential equations. We will use the notation of the previous point. Let us consider first the more simple one-dimensional case n = 1. To simplify the notation, let f,, f,, f,, , f,,, etc. denote the partial derivatives of a function f with respect to the variables u, v. Let us fix u and v.
§ 24. Characterization of Weierstrass's sigma function
303
Then choosing A: (g) = g(u), Ai(g) = gl(u), A: (g) = gl1(u), A t (h) = h(v), Ai(h) = hl(v), A:(h) = hl1(v),24.14.(1) goes over into
The only interesting case is the case when F ( u , v ) # 0. Let In denote the inverse of the restriction of the function exp to the set {x : IS(z)1 < T ) . Suppose, that F ( u o ,vo) # 0. Then - in a neighborhood of the origin - the o , is defined and function G(u, v) = l n ( ~ ( u uo, v v O ) / ~ ( u vo))
+
+
The determinant above can be easily expressed by G. Dividing by the common factors, and subtracting appropriate multiples of the first and second row and column from other rows and columns, respectively, we obtain that
+
+
+
+
2GuGuVv 2G,Guuv 2~:, G,,,,. Subtracting where * = 4GuG,G,, an appropriate multiple of the second row and column from the third row and column, we arrive at
i. e.,
24.16. Partial differential equations. We want to obtain a similar equation for all cases n 1. We will use the notation of the previous point. If the directions (i. e., arbitrary vectors not necessarily having length 1) U I , U Z , . . . are given, we denote by
>
304
Chapter VII. Applications
the directional derivatives of the function f . As above, fixing the points u and v and using directional derivatives with respect to arbitrary directions u l , u2, . . . and v1, v2, . . . , choosing A: ( g ) = g ( u ) ,A: ( g ) = gul ( u ) ,A: ( g ) = = hv2u3(v),24.144) goes g u 2 u 3 ( 4 , A 3 h ) = h ( v ) , q , ( h ) = hu, (4, over into
(1) 0 Gwl 0 GU, GU,,, + G,3G,lv2
0 G,,GU3u1 + GU3G,2,1 + GU,V,,, **
+ G,2,3vl
where
Expanding the determinant, we obtain a partial differential equation, but this is much more complicated than the equation obtained in the case n = 1. To simplify it, let us look for other connections. Choosing Az(g) = g(u), = = h ( 4 , A t ( h ) = hvl (4, At ( g ) = g u 1 ( 4 , m g ) = gu2 (4, h,, ( v ) ,similarly as above we obtain that
Using this, and clloosing A i ( g ) = g(u,),A t ( g ) = g,, ( u ) , A: ( g ) 12: ( h ) = h ( v ) ,At ( h ) = gvl ( v ) ,A: ( h ) = h,,,, ( v ) we obtain that
=
g,, ( u ) ,
and similarly, choosing A: ( g ) = g(u), A: ( g ) = g,, ( u ) , A%( g ) = g,,,, At ( h ) = h ( v ) ,A t ( h ) = h,, ( v ) ,A: ( h ) = hv2( v ) we obtain that
From (2)-(4) we also get the equations
(u),
§ 24. Characterization of Weierstrass's sigma function
305
Using these equations, the equation which can be obtained expanding the determinant of the matrix (1) can be considerable simplified and we obtain that
+
+
Now with substitutions x = u v, y = u - v, and notation H ( x ) = ln(fl (x ~ 0 ) l f (xo)), l K ( y ) = ln(f2(y + yo)/f2(yo)), and choosing U l = el, vl = e;, u2 = e2, v2 = ek, UJ = eg, vg = ei, where el, ei , e2, eh, e3, e', are arbitrary vectors from Rn, we obtain that for all x, y from some given neighborhood of the origin the following partial differential equations are satisfied: (note that we get G(u,v) = H ( x ) K ( y ) if F ( u , v ) = f l ( u v) f 2 ( u - v))
+
+
We will solve this system of equations, first locally, and then globally. Local solutions will be found in the following three lemmas, and global solutions in our main result (Theorem 24.20). In the lemmas we only treat some of the logically possible cases, all other cases will be covered in the proof of Theorem 24.20. The lemmas have completely local nature as well as the above considerations about the system of partial differential equations. Hence these considerations can be used by treating our functional equation 24.1.(4) or similar functional equations on subsets of Rn (or more general structures). As a first step we consider the solutions along a direction. We will use the notation of 24.16 for directional derivatives, and the notation of 24.7 concerning lattice constants and the P ' function.
306
Chapter VII. Applications
24.17. Lemma: Solution of a system of differential equations along a direction. Let f E Rn be a unit vector in Rn. Suppose, that for some E > 0 the functions H and K are complex-valued Cm functions defined on an E neighborhood of the origin of Rn and that they satisfy the equations (6)-(9) of 24.16 for each 1x1, iyl < E .
(1) I f H f f (4 - K f f ( 9 ) # 0, H f f f (4 # 0, and K f f f ( Y ) # 0 whenever x , y E R n, 1x1, Iyl < E , then there exist 0 < E' < E and 0 < 6 E - E', such that for some c E C, for some lattice constants wl, w2, and for some complex-valued continuous functions e, e" defined on an E'-neighborhood of the origin we have
<
and K f f (+~s f ) = - p ( s + E ( y ) , w l , w 2 ) + C whenever x , y E R n, 1x1,1y1 < E', t , s E R, and Itl, Is1 < 6 ; (2) I f H f f (4 - K f f ( 9 ) # 0, H f f f (4 # 0, and K f f f ( Y ) = 0, whenever x , y E Rn and 1x1, Iyl < E , then there exist 0 < E' < E and 0 < 6 5 E - E', such that for some c E C,for some lattice constants w ( x ) ,m and for some complex-valued function e ( x ) defined on an E'-neighborhood o f the origin we have
whenever x , y E Rn,1x1, lyl
< E', t , s
E R, Itl, Is1
< 6;
(3) I f H f f (4 - K f f ( 9 ) # 0, H f f f ( x ) = 0, and Kf f f ( 9 ) # 0, whenever x , y E Rn and ixI,lyl < E , then thereexist 0 < E' < E and0 < 6 5 E - E ' , such that for some c E C, some lattice constants w ( y ) ,m and for some continuous complex-valued function e ( y )defined on the &'-neighborhood of the origin we have
whenever x , y E R n, 1x1, Iyl
< E', t, s
E R, Itl, 1st
< 6;
(4) I f H f f f ( x )= 0 and K f f f( y ) = 0, whenever x , y E Rn and 1x1, Iyl < E , then for some c E @ and for all 0 < E' < E we have Hff(x
+t f )= c
w h e n e v e r x , ~E W , jxl,lyl
< E',
and
+
Kff(y sf)=c
and Itl,js/ < 6 = E - E ' .
§ 24. Characterization of Weierstrass's sigma function
307
Proof. Let 0 < E" < E be arbitrary. For fixed x and y let N ( t ) = H f f b + i f )and K ( s ) = K f (y s f ) . With this notation and choosing el = e2 = e3 = ei = e', = e$ = f we obtain from equation 24.16.(9)
+
whenever t, s E R, Itl, Is/ < E - E". First we will treat case (1). With the notation C ( t ) = X ( t ) - K(0) we have C ( 0 ) # 0 and C 1 ( t )# 0. From equation (5) we get
This means that function C satisfies the differential equation
Using well-known methods this differential equation can be reduced to the first order differential equation
where C is a constant. Instead of making this reduction step exact, we will prove that under the initial conditions
equations ( 6 ) and ( 7 ) are locally equivalent. To be more exact, any solution C of ( 7 ) , for which C' is nowhere zero, is a solution of ( 6 ) . In the other direction, for any solution C of ( 6 ) satisfying (8), too, there is a unique constant C satisfying
and there exists an open ball around the origin in R such that on this ball the solution C satisfies ( 7 ) . Let us consider first a solution of ( 7 ) with non-vanishing derivative. Differentiating equation ( 7 ) we obtain that
Dividing both sides by 2C' and multiplying by C we obtain that
Chapter VII. Applications
308
Expressing -2CL2 from (7) and substituting into this equation we obtain (6). The other direction is somewhat harder. Clearly there exists exactly one constant C satisfying (9). Hence under the given initial conditions C is uniquely determined. First we prove that solutions of (6) under the initial conditions (8) are locally unique in the sense that for any two solutions of (6) there exists a neighborhood of the origin such that these solutions are equal on this neighborhood. Both solutions satisfy on some neighborhood of zero the explicit equation
Under the initial conditions (8) there is one and only one complete solution of (10). Hence any two local solutions of (6) coincide with this solution of (10) on some neighborhood of the origin. Now let us consider an arbitrary non-vanishing solution L of (6). Choosing the unique constant C defined by (9), any solution of (7) with nonvanishing derivative and satisfying (8) (there exists such a solution, as we will see below), is equal to L on a neighborhood of zero. Hence L satisfies (7) on a neighborhood of zero. The next step is to solve (7) under the initial conditions (8). By the properties of the P function (see 24.7) there exists a constant c l and lattice constants w l and wz such that for any constant e E C the function
t
-
-P(t
+ e, w,, w,) + c,
satisfies equation (7), except in at most countably many isolated points t E R By Lemma 24.8 the complex constant e can be chosen so that the initial conditions (8) are satisfied. Note, that e depends only on the initial conditions, and although not unique, the difference of any two possible e's is in R. Since (6) and (7) are locally equivalent, we have proved that there exists a 6 > 0, such that with some constant c
whenever ltl < 6. We obtain similarly, that - for some 8 > 0 constants c" and E and lattice constants Gland & 2 , such that
- there
exist
5 24.
Characterization of Weierstrass's sigma function
309
whenever Is1 < 8. Here also E only depends upon the initial conditions and is uniquely determined up to an additive constant from fi. Our considerations above are valid for fixed x and y. Let us investigate how the result depends upon x and y. If, say, y = 0, then for all x for which 1x1 < 6, the above considerations can be applied, and we obtain that there exist lattice constants w l (x),w2 (x) and complex constants c(x) and e(x) and 6(x) > 0, such that
whenever ltl < S(x). Similarly, there exist lattice constants Gl (y),2 2 (y) and complex constants E(y) and E(y) and 8(y) > 0, such that
whenever Is1 < 8(y). We will investigate the dependence of the parameters upon x and y, respectively. Let gi (x) be defined by w1 (x),w2(x) and let fii (y) be defined by Gl (y) G2(y), respectively, where i = 1 , 2 , 3 (see 24.7). To simplify notation we will not indicate the dependence on x and y, respectively. Moreover we will use the handy notation F ( t ) = P ( t e , w1, w2) and P ( s ) = P ( s E, GI, ha). The functions P and P satisfy the equations
+
+
and
respectively. We substitute (11) and (12) into (5), use the last two equations and obtain (after cancellations)
Differentiating with respect to t we obtain that
310
Chapter VII. Applications
Differentiation of (14) with respect to s yields
whenever It 1 and Is1 are sufficiently small. This is possible only if
Substituting (15) into (14) we get
which is possible only if
Putting (15) into ( l 3 ) , a comparison of terms not containing P ( t ) and leads to
p(s)
We will prove that E = c , = gl, g2 = g2, and 3 3 = g3 are constant, hence we may choose Wl = w l and W2 = w2 also to be constant. First we prove that there exists no pair x , y such that Wz(y) = w, but wz(x) # co, or that w2(x) = m , but W2(y) # w . We will treat only the first case, the other can be treated similarly. If there would be such a pair, then, using that 3 2 = $3 = g1 = 0, (cf. 24.7) equation (16) would imply 12g2 = ij:, and (17) would give g: = 216g3. But then -31112 would be a zero of 4z3 - g2z - g3 and its derivative, and hence (at least) double root of the equation 4z3 - g2z - g3 = 0, contradicting that w2 # w . So there remain only two cases: w2 (x) = W2 (y) = m for all x, y, or both values are finite for any pair x, y. In the first case g2(x) = g3(x) = 0 = &(y) = 33(y) everywhere, and hence from (17) we obtain that gl(x) = &(y) everywhere, i. e., both are constant and they are equal. Then by Lemma 24.10 we may choose w l = W1 to be constant. In the second case gl (x) = 0 = (y) everywhere, and hence from (16) we get that g2(x) = g2(y), and from (17) we see that g3(x) = g3(y) everywhere. Hence g2 and g2, moreover g3 and 33 are constant and they are equal, respectively. Using again Lemma 24.10 we may choose w1 = GI and w2 = W2 to be constant. In both cases we obtain from (15) that c(x) and E(y) are constant and that they are equal.
§ 24. Characterization of Weierstrass's sigma function
311
Now let us investigate e(x) and e"(y). As we have mentioned, e(x) only depends upon the initial values H f f ( x ) and H f f f ( x ) and is unique modulo R. We know that Hff (x) = -P (e(x):w1, w2) c
+
Let X ( x ) = P ( e ( x ) , w 1 , ~ 2 )and
Y(x)=P'(e(x),wl,w2).
By the above equations X = c - Hff and Y = -Hff f , thus they are in Cm. The functions X and Y satisfy the equation y2 = 4 x 3 - g l x 2 - g 2 X - g3. Since Y is nonzero in the origin, Y is uniquely given by X around the origin. By the same reason the mapping z ++ P ( z , w l , w2) has an inverse on some neighborhood of e(0). Let I denote its inverse and let e*(x) = I (c - H~~(x)). On some neighborhood of the origin we have that
Since Y is uniquely given by X, we have on some neighborhood of the origin
But this means by Lemma 24.8 that
Hence we may suppose that e = e*, i. e., that e E C". Similarly we obtain that we may suppose that e" E Cm and thus E is continuous. Finally, we have to prove that 6(x) and 8(y) can be chosen to be independent from x and y and that they coincide. Let us choose S(x) to be maximal between those numbers 6' which are not greater than E" and for which H f f ( x t f ) = -P(t e(x),wl,wz) c
+
+
+
whenever It1 < 6'. We will prove that if 1x1 < E' = ~ " / 2 ,then S(x) 2 E'. Suppose that for some x this does not hold, for example that arbitrarily close to S(x) < E' there exists a t > S(x) such that
Chapter VII. Applications
312
(The case when this is true for t < -6(x) can be treated similarly.) Introducing the new variables t" = t - 6(x) and 2 = x +(x)f we obtain that H f (Z and - P ( f e(x) G(x),w l , w2) c coincide on some left-hand side neighborhood of the origin but they are not identically equal on any right-hand side neighborhood of the origin. Since on the other hand
+ t"f)
+
+
+
+
whenever lt"l < 6(2), by Lemma 24.9 we obtain e(2) - e(x) - 6(z) E R. But then we have H f f (2 f f ) = -P(t"+ e(x) S(z),wl, w2) c on a right-hand side neighborhood of the origin too, which is a contradiction.
-
+
+
+
In the next step, we will treat case (2). Let us fix some x and y. In this case Kt 0 and K" = 0, and hence as in case (1) we obtain that L = 3t-K(0) satisfies equation
on some neighborhood of the origin (cf. (7)), moreover L satisfies the initial conditions (8). By the properties of the P function (see 24.7) there exists a constant cl and there exist lattice constants w, m such that for any constant e the function t e -P(t e,w, m ) c1
+
+
satisfies equation (18) except in at most countably many isolated points t E R. Here also by Lemma 24.8 the complex constant e can be chosen so that the initial condition (8) is satisfied, moreover e depends only upon the initial conditions and although e is not uniquely determined, the difference of two suitable e's is in R. Hence we have proved, that there exists a 6 > 0 such that with some constant c we have
whenever It 1 < 6. Moreover clearly K(s) = E for some constant E on some neighborhood of the origin. Again we investigate the dependence of the result upon x and y. Here we also obtain that there exist lattice constants w(x),m and complex constants c(x) and e(x) such that 6(x) > 0 and
§ 24. Characterization of Weierstrass's sigma function
Moreover there exist complex constants Z(y) and
313
6"(y) > 0, such that
Let us now investigate the dependence of the parameters upon x and y. Let g1 (x) be defined by w ( x ) ,cc (see 24.7). To simplify the notation we will not indicate the dependence upon x and y, respectively. Moreover we will use the handy notation P ( t ) = P ( t e, w, co). The function P satisfies the differential equations
+
P ' ~= 4P3 - glP2 and PI1 = 6P2 - glP. Substituting N and iC into ( 5 ) and using the above equation we obtain that
Differentiating with respect to t we obtain that
This is possible only if
g l ( c - C") = 6(c - E ) ~ . Substituting this into (19) we obtain that c ( x ) = E(y),which is possible only if both are the same constant c. We have to prove that S ( x ) and 8 ( y ) may be chosen independent from x and y, respectively, and equal to each other. This can be done in the same way as in case ( 1 ) using Lemma 24.9. The proof of (3) is analogous to the proof of ( 2 ) . Finally, ( 4 ) trivially follows from ( 5 ) . 24.18. Lemma: Connections between different directional derivatives. Suppose that 6 > 0, and e and f are vectors ofRn. Suppose that H , K are complex-valued CM functions defined on an open 6 neighborhood of the origin of Rn satisfying equations (6)-(9) of 24.16.
(1) If H,,,(x), K,,,(y), and H e e ( x )- Ke,(y) are not zero for any x , y E Rn, 1x1, Iyi < 6, then there exist complex constants d, c', and c" such that
whenever x , y E Rn and 1x1, jyl < 6;
3 14
Chapter VII. Applications
(2) I f H e e e ( x a ) n d H e e ( x ) - K e e ( y )arenot zeroforanyx,y E Rn, 1x1, IyI < 6, but K e e e ( y )= 0, whenever lyl < 6 , then there exist complex constants d, cee,c e f , and c f f such that
whenever x , y E Rn and 1x1, Iyl
< 6;
(3) I f K e e e ( y and ) H e , ( x ) - K e e ( y )are not zero for a n y x , y E F, 1x1, Iyl < 6, but H e e e ( x )= 0, whenever 1x1 < 6, then there exist complex constants d , cee,c e f , and c f f such that
whenever x , y E Rn and 1x1, Iyl
< 6;
(4) I f He,,, H f f , Kee e,and K f are zero everywhere on an open 6-neighborhood of the origin, then there exist complex constants cee,c e f , and c f f such that Hee(x) = Cee = Kee(y), Hef ( x ) = C e f = Ke ( Y ) H f f (4= C f f whenever x , y E Rn and 1x1, Iyl < 6.
=K
f f (Y)
Proof. We first derive some additional equations from (7)-(9) in 24.16. Let us write down again equation 24.16.(9), but changing the roll of el and e i , of ez and eb moreover of e3 and e',, respectively. Let us subtract from the new equation the original one. Then we arrive at
Simplifying and changing again the role of el and ei we obtain that
§ 24. Characterization of Weierstrass's sigma function
315
Further useful equations can be obtained from equation (7)and (8)of 24.16. In equation 24.16.(8)let us substitute e3 with ei and let us replace el with ei and e2 with ei. If we add the resulting equation to 24.16.(7)or if we subtract it from 24.16.(7)then we obtain two simpler equations. After replacing ell with en these simpler equations are
and
Let us start with the proof of (1).Choosing el = e2 ei = f in (5) we obtain
= e3 = ell =
e&= e,
Since He,, and Keee are nowhere zero we conclude that
Now (6)implies
Dividing both sides by He,, we see that
which can be valid only if both sides are the same constant. Finally, from equation (6)of 24.16 we have that
By hypothesis we may divide both sides by He, (x)- K e e(y),and obtain that H f (x)- d2H,, (x)and K f f (y)- d 2 ~ ,(y) , are the same constant. The last two equations in part (1) follow from the representation of H f f and K f f , using (8).Thus part (1)of the lemma is proven.
316
Chapter VII. Applications
To prove (2), we choose ei = e; = ei
=e
in (5) and obtain that
Since the right-hand side is zero everywhere, we have that K,,,,,, is identically zero for arbitrary vectors el, e2, e3. Especially each partial derivative of Kee, Kef , and K f is equal to zero, and hence K e e l K e f , and K f are constants. Let us denote these constants by cee, cef, and c f f , respectively. Now we consider some directional derivative of the function Hef
(x)- Cef
Hee(x) - Cee with respect to some direction g. This is
By substituting el = e2 = e/z = e, ei = f , ei = g in (6) we see that the nominator is equal to zero. Hence each directional derivative is equal to zero, 1. e., Hef - Cef d H e e (x) - Cee
(XI
for some constant d. Using K f (y)
=
cff and Kee(y)= cee we get from (9)
The proof of (3) is analogous to the proof of (2) (using (7) instead of (6)). To prove part (4) of the lemma we substitute el = e2 = e3 = ei = e/z = e', = e into (9) of 24.16 and get
Hence He, and Kee are constant and they are equal. Similarly we obtain that Hf and K f are also constant and they are equal. Now, using equation 24.16.(6), we arrive at
which leads to He (x) = Kef (y)
= cef
(say).
§ 24. Characterization of Weierstrass's sigma function
317
24.19. Lemma: local solutions. Suppose that H and K are complexvalued Cm functions defined on a &neighborhood of the origin ofRn satisfying the equations (6)-(9) of 24.16. Moreover assume that
<
(I) for any index 1 5 j n, the functions d,3H and tf'K are either zero everywhere in the given neighborhood or are never zero in the given neighborhood; (2) for any such index j, for which a;H or d,3K is never zero in the given neighborhood, the difference~ ; H ( X-)a ; K ( y ) is also never zero for any x and y from the given neighborhood; (3) there exists an index j, such that t f H or the given neighborhood. Then, with the notation x notation of 24.7 we have
=
( x l , .. . , x n ) , y
q K or both are never zero on =
( y l , .. . , y n ) , and with the
qK
(4) i f for each index j, ?H and are never zero in the given neighborhood, or i f both of these functions are zero in the given neighborhood, then we have for some lattice constants wl, wz and for some complex numbers a , 6 , bi, &, c, F, ci,j = cj,i,di, e, d on some neighborhood of the origin
and
moreover di is zero i f and only i f 8; H and 19; K are zero everywhere;
(5) i f for each index j either t$ H is never zero and K is everywhere zero on the given neighborhood, or if both are zero everywhere on the given neighborhood, then we have for some lattice constants w, cx and for some complex numbers a , 6 , bi, bi, c, ci,j = cj,i,di, e, E on some neighborhood of the origin H ( x )= a
+
bizi i=l
+
ci,jxixj i,j=l
+ In(co(dlxl + . . + dnxn + e;w, m))
Chapter VII. Applications
318
and
n
n
moreover di is zero if and only if d?H is zero;
(6) if for every index j either 8: H is zero everywhere and K is never zero or if both are zero everywhere on the given neighborhood, then with some lattice constants w , oo and with some complex constants a , 6, bi, bi, c", ci,j = cj,i, di, e, Z on some neighborhood of the origin
and
moreover di is zero if and only if 8: K is zero;
(7) in all other cases possible under conditions (I), (2) (3) there is no solution on any neighborhood of the origin.
Proof. By Lemma 24.18 it is clear that if there exists an index j such that d,3H is nowhere zero, but is identically zero on the given neighborn on the given neighborhood, then d f K is identically zero for all 1 i hood. Similarly, if there exists an index j that ?K is never zero but q H is identically zero on the given neighborhood, then d?H is identically zero for all 1 5 i 5 n on the given neighborhood. Hence there is a solution on the given neighborhood only in the cases given by (4)-(6) and hence we have proved (7). For the proof of (4)-(6) we use induction with respect to the dimension. Without loss of generality we may suppose that those indices i for which d?H and d:K are identically zero are greater than those for which one or both are nonzero. Let us start with the proof of (4). In the case n = 1 the statement follows from Lemma 24.17. By induction we suppose that the statement is true for n - 1, i. e., there exist lattice constants w l ,w:! and complex constants a, 6, bi, bi, C, E, ci,j = cj,;, di, e, e", (i,j < n ) such that on a neighborhood of the
q~
< <
$24. Characterization of Weierstrass's sigma function
319
zero vector
and
whenever we are in the n - 1 dimensional subspace spanned by the first n - 1 standard base vectors fl,fi,. . . , f,-l; moreover for each i < n it is true that di = 0 if and only if d,"H - 0 8~, " ~ . To obtain handy notation let us introduce
First we will investigate the case when d : and ~ d;K are never zero. From Lemma 24.17 we have that
(10)
d ; ( X I~
+ x, f,) = -P
(xn + el (z') , hl,hz)+ cl
and
for sufficiently small x', y', x,, and y,, where x,, yn are real numbers and x', y' are vectors from Rn with last coordinate equal to zero. The main idea of the proof is now to show that there is a constant d, and there are continuous functions ei(xi,. . . ,x,-l) and Ei(yi,. . . ,y,-I), i = 2,.. . , n satisfying
and
Chapter VII. Applications
320
i
< n. Let us observe, that e, and e", are constants. From Lemma 24.18.(1) we obtain that for i < n with some constants Di
and c:, cy we have
did,H(x) (15)
+ c:
= Did,"H(x)
+
did,K(y) = Did,"K(y) ci, ~ ~ H ( x ) = D ~ ~ , " H ( xand ) + cd ~~ K ( y ) = D , " d ~ ~ ( y ) + c ~ , and
d ; H ( x ) = D : ~ : H ( X ) and d ; ~ ( y =) D ; d ; K ( y ) for all sufficiently small x , y. From the last line follows that Di # 0, i < n. Differentiating expression (10) with respect to x, and then substituting x, = 0 and using the last line of (15) for i = 1 and the induction hypothesis we get (using ( 8 ) )
On the other hand, using (10) and the second line of (15) and ( 8 ) we arrive at
for all sufficiently small X I . The implicit function theorem implies that for any fixed x2, x3,. . . ,x,-1 the function el is continuously differentiable with respect to X I . Differentiation of (17) with respect to xl yields
Hence using (16) we see that
i. e . e l ( x . . . , x ) = 1 . This means that for some continuous function e2( x 2 ,. . . , X n v l ) we have
5 24.
Characterization of Weierstrass's sigma function
321
that is (13) with i = 1. Defining d , = D l d l (note that d, D l # 0 and dl # 0 ) , using ( 1 7 ) and 24.7.(1) we obtain (19) ~ ( d l ~ l l d e, z ( x 2 , . . , x,-l), G I , G2) - C l = d : ~ ( d l x l . . . dn-lxn-l e, w l , ~ 2 -) 2 ~ ; c l l- CY
# 0
because
+
+ + + = 7'(dlxl/d, + + d , - l ~ , - l / d , + e / d n , wild,, wnld,)
-
2 ~ 1 D: 1
-
C:
Putting x2 = . . . = x,-1 = 0 into this equation Lemma 24.9 implies that C I = 2 ~ 1 1 D ; c'1/ and that P ( z , G l , G 2 ) = P ( z , u l / d n , w 2 / d n ) Defining c,, = ell Df cY/2 we conclude from ( 1 0 )
+
+
Now let x , = 0 in ( 2 0 ) and use ( 1 7 ) to get
In exactly the same manner we obtain (because of the analogies between H and K in ( 1 5 ) )
Let us remark that substituting y2 = . - - = y,-1 = 0 into this equation and substituting x2 = = x , - ~ = 0 into equation (21) moreover using Lemma 24.9 it follows that d, e2 ( 0 , . . . , 0 ) - e and d , e"2 ( 0 , . . . , 0 ) - e" are in the lattice R generated by w l , w2. In the first step we arrived at (21) and ( 2 2 ) using (15) for i = 1. Now we continue similarly, but now we apply ( 1 5 ) with i = 2. Substituting x , = 0 in ( 2 0 ) and using the second line of ( 1 5 ) for i = 2 and the induction hypothesis we obtain that
Chapter VII. Applications
322
Using (21) this equation implies that D2d2= d,. Differentiating (20)with respect to x, and then substituting x, = 0, and using the last line of (15) and the induction hypothesis we conclude that
i. e., because of D2d2 = d,,
By (21) for any fixed 23,. . . , x,-1 we get from the implicit function theorem that the function e2 is continuously differentiable with respect to 22. Differentiating (21) with respect to x2 we obtain that
Hence using (25) follows that
everywhere. This means that for some continuous function we have
. ,x,-I)
e 2 ( ~ 2 , ..
d2x2
e3(x3,. ..
, x,-l)
+ ~ ( 2 3 ,. ..,&-I),
=dn
which is (13) with i = 2. In the same manner it follows that for some ~, continuous function E3 (33,. . . ,yn- 1) we have
Continuing the induction in this way we finally obtain (12)-(14)and thus also
§ 24. Characterization of Weierstrass's sigma function
323
and
+
d i ~ ( ~yn' f n ) = 2cnn
-
di7'(dlyl
+ d2y2 + . . . + dnyn + dnEn,wl,w2).
Here both dnen - e = dne2(0,.. . , 0 ) - e and dnEn - E = dnE2(0,.. . , 0 ) - E are in the lattice R and since the points of R are periods, we have
~:H(x'
+
X,
f,) = 2cnn - di7'(dlzl
+ d 2 ~ 2+ . . + dnxn + e, w l , a
~
2
)
where the functions B1 and B1 are in Coo. Differentiating these equations with respect to xl and yl and then substituting x, = 0 and y, = 0, respectively, using the first line of (15), the equations (8) and (9), moreover that Didi = d, whenever i < n, we obtain that dlBl = dlBl is the same constant. Let us denote this constant by 2cl,,. Thus we have
Continuing by induction, we finally arrive at
where we have defined b,
= B,
and
b,
= B,, respectively.
324
Chapter VIL Applications
Finally, we integrate thèse équations. By the induction hypothesis d2x2 H
( \n(ca(d1y1
+ d2y2 +
+ dnxn + e,ui,u2))
and
h dnyn + ë,u1,u;2))
are defîned on a neighborhood of the origin. Hence by intégration of the previous équations we see that in a small open bail around the origin we hâve H[x' + xnfn)
= \n(ca(diXi
+ d2x2 + • • • + dnxn + e, UÛI^U)2)J
n-l n
/
,j
'
and 1
+ ynfn) = ln{c&(diyi + d2y2 H
h dnyn + ë,uji,u)2))
n-l
Substituting xn — 0 and yn — 0 and using initial conditions (8) and (9), respectively, we obtain that ^... ,x n _i) = a + 2^biXi + ^ J
c
ijxixj
Hence the statement follows. We still hâve to investigate the case when d^H and d^K are both identically zéro. In this case we hâve to prove that the statement holds with dn = 0. By Lemma 24.17.(4) there exists a constant c\ G C such that with c nn — ^i/2 for ail sufficiently small x\ y'\ xn, and yn we hâve dlH(x' + xnfn)
= 2cnn
and d2nK(y' + ynfn)
= 2cnn.
Let us observe that didnH{xf) and didnK(y') is the same constant for ail i < n: If ô^iJ and c^if are nonzero, then (15) holds true and from the last line of (15) we obtain D{ = 0, hence the statement follows from the first line
5 24.
Characterization of Weierstrass's sigma function
325
of (15). If df H and a! K are both identically zero, then the statement follows from case (4) of Lemma 24.18. Now let us integrate the equations above. We arrive at
The functions B1 and B1 are in Cm. Differentiating these equations with respect to xl and yl and then substituting xn = 0 and yn = 0, respectively, we obtain that dlBl = dlBl are constant. Denoting this constant by 2cl,, we have that
Continuing by induction, we finally obtain (denoting again Bn and B, by bn and b,, respectively) that
and
Integrating these equations gives
Substituting xn tively, leads to
=0
and yn = 0 and using the equations (8) and (9), respec-
326
Chapter VII. Applications
and
hence because dn = 0, the statement follows. Thus the statement ( 4 ) is completely proven. Now we prove (5). The case n = 1 follows from Lemma 24.17. Our induction hypothesis is that
and
if we are in the subspace spanned by the first n - 1 standard basis vectors f 1 , f 2, . . . , fn- 1 . Moreover di is equal to zero if and only if 8: H is identically zero. First we will investigate the case when 8: H is never zero. From Lemma 24.17 we obtain that
and that
whenever x', yl, x,, and yn are sufficiently small, where xn, yn are real numbers and x', y' are vectors from Rn with last coordinate equal to zero. ) Using this, we obtain The induction hypothesis implies 8 ; ~ ( ~=' 2cii. from Lemma 24.18 that with some complex constants Di, ci,, and cnn,
8: K ( ~=)2cii, &dnK ( y ) = 2cin, 8: ~ ( y=)2cnn, didnH(x)- 2cin = Di ( d ; H ( x ) - 2cii), 8; H ( x )- 2cnn = 0:(8: H ( x ) - 2cii) 8:H(x) = D ; ~ ; H ( x )
5 24.
Characterization of Weierstrass's sigma function
327
whenever x, y are sufficiently small. From the last line it follows that Di # 0 for all i < n. From (28) and the first line of (29) it follows that cl = 2cnn, moreover that
(8: ~ ( x ) )- 4 (8; H(X)
- 2c,,)
(~3w4 -2~,,)~
3
- 0; -
(8: H (x)) - 4 (8; H ( x ) - 2 ~ 1 1 ) ( ~ ? H ( x-) 2 ~ 1 1 ) ~
By the induction hypothesis, using the differential equation of the P functions we obtain that if w = m , then the right-hand side is zero. If ilr(xl,. . . ,x,-1) would be somewhere different from infinity, then the left-hand side would be 47r2/ilr(xl, . . . ,x , - I ) ~ , which is impossible, hence ilr(xl, . . . , x,-1) = m. If w # m , then the right-hand side is 4d?D;n2/w2 # 0, hence the left-hand side is nonzero, i. e., b ( x l , . . . ,x,-1) = m is nowhere valid. Thus the left, we obtain that ilr(xl, . . . ,x,- 1) = hand side is 47r2/ilr(xl, . . . ,2,- I ) ~ hence hw/(dl Dl). Since the sign is unimportant (the lattice remains the same) we may suppose that ilr(xl, . . . ,xn-1) = w/(dl D l ) everywhere. Using this together with the notation d, = Dldl and
(27) goes over into
If we substitute x, = 0 into (30) we obtain that
Now we first differentiate (30) with respect to x, and then we put x, = 0 to get 8;H(x1) = -d;p1(d,el ( z l , . . . ,2,-I), w , m ) .
the above equations show that the functions X and Y can be represented by means of 8: H ( x l ) and 8: H ( X I ) and hence they are in Cm . Suppose that w # m . The functions X and Y satisfy the equation Y = 4 x 3 - 47r2x2/w2.
328
Chapter VII. Applications
Since Y is nonzero at the origin, it is uniquely defined by X around the origin. For the same reason the mapping z I--+ P ( z , w,oo) has an inverse on some neighborhood of dnel(O,.. . ,O). Let I denote the inverse of this mapping and let eT (xl , . . . ,xn- I ) = I (- (8; H ( X I ) - 2cnn)Id:) Idn. In some neighborhood of the origin we have
Since Y is uniquely defined by X, on some neighborhood of the origin it is also true that
But by Lemma 24.8 this means that
Hence we may suppose that el = eT, i. e., that el E C". If w = cc we get el E Cm similarly. Let us investigate the function el. Substituting x, = 0 into (30) and using the third line of (29) and the induction hypothesis we obtain that
Using that d, = Dldl this equation implies
On the other hand, differentiating equation (30) with respect to x, and then substituting x, = 0, moreover using the last line of (29) and the induction hypothesis, we arrive at
§ 24. Characterization of Weierstrass's sigma function
329
i. e., because of Didi — dn, we get V1 (dne1(x1,,.. ,^ n _!),a;,oo) hdn-i^-i^oo). = V/(d1x1 + By (32) — for any fîxed x2,. •. , xn-\ — using the implicit function theorem we hâve that the function e\ is continuously differentiable with respect to x\. Differentiation of (32) with respect to x\ yields dnV' \
)
dV Now, (31) and (32) resuit in x) everywhere. But this means that for some continuous function we hâve
Putting this into équation (32) and choosing x2 = • • • — xn-i = 0 from Lemma 24.9 we obtain also that d n e2(0,... ,0) — e is in the (degenerated) lattice Q generated by LU. In the next step we show that (12) and (13) are valid with i = 2. Substituting xn = 0 into (30) and using the third line of (29) and the induction hypothesis we obtain that dlV{d1x1 +dne2(x2,... (36)
2
,z n _i),a; 3 oo) - 2cnn 2
= -d nH(x') - -D 2dlH(x') + 2D22c22 - 2cnn
= D^d\V(diXi H h dn-1xn-i + e,o;, oo) — 2c nn . Using (32) from this équation we get that D2d2 = dn. Differentiating équation (30) with respect to xn and then substituting xn = 0, moreover using the last line of (29) and the induction hypothesis, we obtain that dnV'(diXi +dne2(x2,...
(37)
,z n _i),a;,oo)
= -dlH{x') = -DldlH{x')
= Dld\Vk\diXi + [-dn-xXn-u U,OO), i. e., because of D2d2 — dn, it follows that V'{d\x1 +dne2(x2,... ,a;n-i),cc;,oo) \Oo)
V (d H h d n ) Equation (38) is exactly équation (25) with u)\ = UJ, UJ2 = oo. Following the argument after équation (25) we arrive (with obvious changes) at (5).
330
Chapter VII. Applications
We have to investigate also the case when d:H and d:K are both zero everywhere. In this case we have to prove that the statement holds true with dn = 0. By Lemma 24.17 we obtain that there exists a constant cl E C such that for all sufficiently small x', y', x,, and yn we have
hence with notation cnn = c1/2 we get d : ~ ( x ' + x , f ~ ) = 2cnn and
d:K(yl+
ynfn) = 2cnn.
Let us note that didnH ( x t ) and didnK(y1)are the same constant for all i < n. If d f H and d f are ~ nonzero, then equations (29) are true and from the last line of (29) we get D i= 0, hence the statement follows from the first and second line of (29). If d f H and 13fK are both zero, then the statement follows from case (4) of Lemma 24.18. Now we may follow the lines of the proof of (4). Finally, the proof of (6) is analogous to the proof of (5). 24.20. Theorem.
Rn
Suppose that the functions f l , f 2 , g l ,g2, h l , h2 :
+ C satisfy the functional equation
the functions f and f 2 are measurable and none of them is almost everywhere zero. Then there exist constants a , 6 , e, E E C,linear forms L, L1, L2 : Rn + C and a quadratic form Q : IWn + C, moreover lattice constants w l , w2, such that
f~(~)=ex~(a+~~(x)+Q(x))a(~(x)+ if e X , wE~ R , w~and ~) fz(y) = ~ x P ( ~ + L ~ ( Y ) + Q ( Y ) ) ~ ( L ( Yif) +Y ~€ ,R~n I, , ~ ~ )
+ Ll (x) + Q ( x ) ) a( ~ ( x+) el w l , m ) f2(y) = exp(6 + L ~ ( Y4-) &(Y)) if Y E Rn,
f (x) = exp (a
if x E Rn
+ L1 (x) + Q(x)) if x E Rn and Y E,w1,m) ) f2(y) = exp(6 + Lz(y) + Q ( Y ) ) ~ ( L (+
if Y E Rn.
f
(2) =
exp (a
and
§ 24. Characterization of Weierstrass's sigma function
331
The functions gl, g2, h l , h2 can be obtained &om the representation of f l , f 2 using 24.12 and 24.1 1.
Proof. Let us first suppose that there exists a point xo E Rn with f l (xo) # 0 and an index 1 5 i 5 n such that with the notation H ( x ) = In (f 1(x xo)/ f (xo)) the derivative 8: H is not identically zero on any neighborhood of the origin, or that there exists a point yo E Rn with f2(yo)# 0 and an index 1 5 i 5 n such that with the notation K ( y ) = ln(f 2 (y + y o ) /f 2 (yo)) the derivative @K is not identically zero on any neighborhood of the origin. We will treat only the first case since the second case can be treated in exactly the same way. Without loss of generality we may also suppose that i = 1. Thus there exists an xo E Rn such that d:H is not identically zero on any neighborhood of the origin. (Note that H depends on xo.) We will prove that xo, yo E Rn and 6 > 0 may be chosen so that
+
(a) d : is~ not identically zero on any neighborhood of the origin, (b) the equations (6)-(9) of 24.16 are also satisfied for all 1x1, Iyl
< 6, and
(c) moreover for all 1 5 j 5 n the following conditions are also satisfied: (1) d , 3 ~ ( xis ) either zero for all 1x1 < 6 or different from zero for all 1x1 < 6;
(2) d,3K (y) is either zero for all 1 y 1
< 6 or different from zero for all 1 y 1 < 6;
qH
# 0 or d,3K # 0, (or both) everywhere in a 6-neighborhood of the origin, then 87H (x) - d:K (y) is also different from zero for all 1x1 < 6, lyl < 6. To prove this statement let us start with an E > 0 and with points xl and yl with f l ( x l ) # 0 and f2(y1) # 0 for which the functions HI (x) = ln(f~(x + x ~ ) / f l ( x l ) and ) K l (y) = 1n(f2(y y l ) / f ~ ( y l ) )are defined on the open ball centered at the origin and having radius E , moreover that 8 ; ~ is not identically zero on any neighborhood of the origin. Decreasing E if necessary we may suppose that for all xo, yo satisfying 1x0-xl I < E , ]yo-yl 1 < E, fl(x0) # 0, and fz(y0) # 0 the functions H ( x ) = l n ( f l ( x + x o ) / f l ( x o ) )and K (y) = In(f2(y yo)/f2 (yo)) (also depending upon xo and yo, respectively) are defined on the open ball having radius 6 = E - max{/xo - xl 1, lyo - yl 1 ) and centered at the origin. Let us investigate the connection between H1 and H and between K1 and K , respectively. If E is small enough, then (3) if
+
+
~
Chapter VII. Applications
332
This means that the derivatives of H around the origin behave in the same way as the derivatives of H1 around xo - X I . Similarly we obtain that
i. e., that the derivatives of K around the origin behave in the same way as the derivatives of K 1 around yo - y1. It is now clear that decreasing E if necessary we may suppose that for any IxoI, / y o [< E the functions H and K satisfy the system of partial differential equations (6)-(9) of 24.16 on the open ball around the origin having radius S and that @ H is not identically zero on any neighborhood of the origin. Let us consider the open set
< <
This is a Baire space. For each 2 j n the set X j of all those points x E X which either have a neighborhood in X on which 8;H1 is nowhere zero or have a neighborhood in X on which d,3H1 is identically zero, is a dense open set in X (note that X j is the boundary of { x E X : d,3Hl(x) = 0)). Hence the set n7=,Xj is a dense open subset of X . Let x2 be an arbitrary point of this set. There is a neighborhood of X I x2 such that for all points xo from this neighborhood (1) is satisfied. We obtain the point y2 similarly: Let us consider the open set Y = { y E RP : ly/ < E ) . This is a Baire space. For each 1 5 j 5 n the set Yj of all those points y E Y which either have a neighborhood in Y such that d,3K1 is nowhere zero on this neighborhood or have a neighborhood in Y such that q K 1 is identically zero on this neighborhood, is a dense open set in Y. Hence the open set njn,,Yj is dense in Y . Let y2 be an arbitrary point of this set. There exists a neighborhood of yl y2 such that for all points yo from this neighborhood ( 2 ) is satisfied. Finally, we have to prove that xo and yo may be chosen so that ( 3 ) is also satisfied. This easily follows by induction. Let xo = X I x2 and yo = y1 92. If ( 3 ) is already satisfied for 1 j < k, ) 0 , then let us replace x2 by x2 tek, where and if d l Hl ( x 2 )= ~ : H ( o # ek is the k-th vector of the standard base and t is small enough so that the condition (c) remains valid, and if 8:H1(x2) = d i ~ ( 0=) 0, then, using that 6': K 1 ( y 2 )= 8: K ( 0 ) # 0 , let us replace y2 by y2 tek, where t is small enough so that the condition (c) remains valid. If with these new values x2, y2 we reconsider the points xo = xl x2 and yo = yl y2, then ( 3 ) can be satisfied for 1 5 j k. Now let us apply Lemma 24.19. We obtain that there exist constants a', 6', c, El el, e"' E C, linear maps L, L;, Ll, : Rn -+ C and a quadratic map
+
+
+
<
+
+
+
+ +
<
§ 24. Characterization of Weierstrass's sigma function
Q : Rn -+ have
333
C such that for a sufficiently small neighborhood of the origin we
H ( x ) = a'
+ L', ( x )+ Q ( x )+ l n a ( c ( L ( x )+ el, w l , w z ) )
and
H ( x ) = a ' + L ' , ( x ) + Q ( x ) + l n ( c a ( L ( x ) + e ' , w l , o o ) ) and
+ L', ( x )+ Q ( x ) and K ( y ) = 6' + Ll, ( y ) + Q ( y )+ l n ( ~ a ( L ( + y )e"', w l , m)) This means that the mappings x f l ( x + x o ) and y H ( x ) = a'
+
I-+ H fi(y yo) have the stated representations on a small neighborhood of the origin. But from 24.11 we know that these functions on a small neighborhood of the origin satisfy a functional equation having type ( 1 ) with everywhere analytic, and on every neighborhood of the origin linearly independent, functions & ,& , k l , kz on the right-hand side. Hence Lemma 24.13 can be applied and f l and f z have the representation stated in the theorem everywhere. Only missing is the investigation of the case that for each xo E Rn, for which f l ( x o ) # 0 each partial derivative 8 : of~ the function H ( x ) = In ( f 1 ( x $ 0 ) / f 1 ( x o ) )is identically zero on some neighborhood of the origin, and for each yo E Rn, for which f z ( y o ) # 0 each partial derivative @ K of the function K ( y ) = ln(fz( y y o ) / f 2 ( y o ) ) is identically zero on some neighborhood of the origin. We will prove that in this case the statement of the theorem is satisfied with L = 0, i. e., we will prove that there exist complex constants a, 6 , linear mappings L 1 ,L2 : Rn + C,and a quadratic P -+ C such that map Q : l
+
+
f l ( x ) = e x p ( a + L l ( x ) + Q ( x ) ) if x € R n and (4)
~ z ( Y= ) exp(6
+ L2(y) + Q ( Y ) )
if Y E Rn.
Here also we first prove that the above representation is satisfied locally. By the statement ( 4 ) of Lemma 24.18 for any xo and yo for which f l ( x o )# 0 and f z ( y o ) # 0, we have
334
Chapter VII. Applications
for appropriate complex constants a\â\ linear functions I>i, I/2 : M71 —> C and a quadratic function Q : Rn —>• C. Now we use /i(a;+a;o) = /i(^o) exp(H(x)) and / 2 (y + yo) = /2(yo) ex p(^(y))? introduce new variables and on a neighborhood of #o and yo arrive at fi(x) = exp(a + Zri(rc) + Q(^))
and
with appropriate complex constants a, a and appropriate linear maps Iq, £2 • Mn —> C. Let us fix a pair #o>yo a n d let us consider the maximal radius 0 < 5 < 00 for which j \ is nowhere zéro on the open bail centered at xo and having radius S. Since /1 is analytic on this bail, the above représentation is satisfied everywhere on this bail with a, Li, and Q belonging to the pair XQ, yo. If we assume that S < 00, then the mapping x »-> a + L\(x) + Q(x) would be bounded on the bail and the function fi could not be continuous because |/i| would hâve a positive lower bound. This means that S = 00, i. e., that the above représentation is satisfied on the whole W1 with a, L\ and Q belonging to the pair xo-,yo- Similarly, the function / 2 also can be represented in form (4) with a, L2, and Q belonging to xo,yo-
REFERENCES [ I ] JBnos AczC1, Vorlesungen iiber Funktionalgleichungen und ihre Anwendungen. Birkhauser, 1961. [2]JBnos AczC1, Ein Blick a u f Funktionalgleichungen und ihre Anwendungen. V E B Deutscher Verlag der Wissenschaften, 1962. [3]JBnos AczB1, Lectures on functional equations and their applications. Academic Press, 1966. [4]JBnos AczC1, O n strict monotonicity o f continuous solutions o f certain types o f functional equations. Canad. Math. Bull. 9(2) (1966), 229-232. [5] JBnos AczCl, O n Applications and Theory o f Functional Equations. Academic Press, 1969. [6] JBnos AczC1, Notes on generalized information functions. (1981), 97-107.
Aequationes Math. 22
[7] JBnos AczCl, Some good and bad characters I have known and where t h e y led. (Harmonic analysis and functional equations.) In: Canadian Mathematical Society Conference Proceedings, A M S , Vol. 1 (1981), 177-187. [8]JBnos AczC1, Some unsolved problems in the theory o f functional equations, II. Aequationes Math. 26 (1984), 255-260. [9] JBnos AczCl, O n history, applications and theory o f functional equations. In: J . AczCl ( E d . ) : Functional Equations: History, Applications and Theory, D. Reidel (1984))312. [ l o ] JBnos AczC1, A short course on functional equations. D . Reidel, 1987.
[11] JBnos Acz61, Characterizing information measures: Approaching the end o f an era. In: Lectures Notes in Computer Science 286, Springer (1987))359-383. [12] JBnos AczCl, Remarks and problems 7. In Report o f Meeting; T h e Twenty-fifth International Symposium on Functional Equations. Aequationes Math. 35 (1988)) 116-117. [13]JBnos Acz61, Remark 15. In Report o f Meeting; T h e Twenty-sixth International Symposium on Functional Equations. Aequationes Math. 37 (1989), 107. [14] JBnos AczC1, T h e state o f the second part o f Hilbert's fifth problem. Bull. Amer. Math. Soc. 20 (1989), 153-163. [15]JBnos AczBl(Ed.), Aggregating clones, colors, equations, iterates, numbers, and tiles. Birkhauser, 1995. (Also as Vol. 50 o f Aequationes Math.) [16]JBnos AczB1, W h a t t o do until (and when) the functional equationist arrives. Publ. Math. Debrecen, 52/3-4 (1998), 247-274.
336
References
[17]JBnos Acz61, Walter Benz, ~ b e das r harmonische Produkt und eine korrespondirende Funktionalgleichung. Abhandlungen aus d e m Math. Sem. der Univ. Hamburg. 43 (1975), 3-10. [18] JBnos Acz61, Jukang Chung, Integrable solutions o f functional equations o f a general type. Studia Sci. Math. Hungar. 1 7 (1982-84), 51-67. [19]Jdnos AczC1, ZoltBn Darbczy, O n measures o f informations and their characterizations. Academic Press, 1975. [20]JBnos AczCl, Jean Dhombres, Functional equations in several variables. Encyclopedia o f mathematics and its applications 31. Cambridge University Press, 1989. [21]JBnos AczCl, Roman Ger, Antal JBrai, Solutions o f an equation arising from utility that is both separable and additive. Proc. Amer. Math. Soc. 127 (1999), 2923-2929. [22] JBnos AczCl, Stanislaw Gol+b, Funktionalgleichungen der Theorie der Geometrischen Objekte. Paristwowe Wydawnictwo Naukowe, Warsawa, 1960. [23] Jdnos AczC1, Gyula Maksa, Zsolt PBles, Solutions t o a functional equation arising from different ways o f measuring utility. J. Math. Anal. Appl. 233 (1999), 740-748. [24] JBnos AczC1, Gyula Maksa, Zsolt PBles, Solutions t o a functional equation arising i n an axiomatization o f the utility o f binary gambles. Proc. Amer. Math. Soc. 129 (2001), 483-493. [25]JBnos AczC1, Gyula Maksa, Che Tat Ng, Zsolt Pdles, A functional equation arising from ranked and separable utility. Proc. Amer. Math. Soc. 129 (2001), 989-998. [26]Paul S. Alexandrov ( E d . ) , Die Hilbertschen Probleme. sellschaft Geest & Portig K.-G., 1971.
Akademische Verlagsge-
[27]Claudi Alsina, Problem 1. Aequationes Math. 47 (1994), 302. [28]Claudi Alsina, J.-L. Garcia-Roig, Problem 4. Aequationes Math. 35 (1988), 114. [29]J o h n A . Baker, Regularity properties o f functional equations. Aequationes Math. 6 (1971), 243-248. [30]John A . Baker, O n the functional equation f(x)g(y) = tiones Math. 11 (1974), 154-162.
nr=lh i ( a i x + biy). Aequa-
[31]John A . Baker, Functional equations, tempered distributions and Fourier transforms. Trans. Amer. Math. Soc. 315 (1989), 57-68. [32]John A . Baker, Functional equations, distributions and approximate identities. Can. J . Math. 17(4) (1990), 696-708. [33]J o h n A . Baker, Difference Operators, Distributions and Functional Equations. Periodica Math. Hungar. 23(3) (1991) 171-183. [34]John A . Baker, Distributional methods for functional equations. Aequationes Math. 62 (2001), 136-142. [35]Mario Bonk, T h e addition theorem o f WeierstraB's sigma function. Math. Ann. 298 (1994), 591-601.
References
337
[36] Mario Bonk, The Characterization of Theta Functions by Functional Equations. Abh. Math. Sem. Univ. Hamburg 65 (1995), 29-55. [37] Mario Bonk, The addition formula for theta functions. Aequationes Math. 53 (1997), 54-72. [38] Nicolas Bourbaki, Elements of mathematics. 1966.
General topology. Addison-Wesley,
[39] Claude Chevalley, Theory of Lie groups I. Princeton University Press, 1964. [40] Zolt&n Darbczy, On the measurable solutions of a functional equation. Acta Math. Sci. Hungar. 22 (1971-72), 11-14. [41] Zolt&n Darbczy, ober die stetigen Losungen der Acz6l-Benz'schen Funktionalgleichung. Abhandlungen aus dem Math. Sem. der Univ. Hamburg 50 (1980), 210-218. [42] Zolt&n Darbczy, Antal J&rai, On the measurable solutions of a functional equation arising in information theory. Acta Math. Acad. Sci. Hungar. 34 (1979), 105-116. [43] Zolt&n Darbczy, Helmut Kiesewetter, Eine Funktionalgleichung von Abel und die Grundgleichung der Information. Period. Math. Hungar. 4 (1973), 25-28. [44] Zolt&n Darbczy, Gyula Maksa, Nonnegative information functions. Collection: Analytic function methods in probability theory (Proc. Colloq. Methods of Complex Anal. in the Theory of Probab. and Statist., Kossuth L. Univ. Debrecen, Debrecen, 1977), 67-78. [45] Jean Dhombres, Some aspects of functional equations. Chulalongkorn University, Bangkok, Thailand, 1979. [46] Georg T. Diderrich, Local boundedness and the Shannon entropy. Information and Control 36 (1978), 292-308. [47] Georg T . Diderrich, Continued fractions and the fundamental equation of information. Aequationes Math. 19 (1979), 93-103. [48] Georg T. Diderrich, Boundedness on a set of positive measure and the fundamental equation of information. Publ. Math. Debrecen 33 (1986), 1-7. [49] Jean DieudonnB, Grundziige der modernen Analysis I-IX. VEB Deutscher Verlag der Wissenschaften, 1971-1988. [50] Bruce R. Ebanks, Prasanna K. Sahoo, Wolfgang Sander, Characterization of Information Measures. World Scientific Publishing Co., 1999. [51] Wolfgang Eichhorn, Functional Equations in Economics. Addison-Wesley, 1978. [52] Paul Erdos, John C. Oxtoby, Partitions of the plane into sets havingpositive measure in every no-null measurable product set. Trans. Amer. Math. Soc. 79 (1955), 91-102. [53] Herbert Federer, Geometric measure theory. Springer, 1969. [54] Istv&n Fenyo, Uber eine Losungsmethode gewisser Funktionalgleichungen. Math. Acad. Sci. Hung. 7 (1956), 383-396.
Acta
338
References
[55] Zbigniew Gajda, Christensen measurable solutions of generalized Cauchy functional equations. Aequationes Math. 31 (1986)) 147-158. [56] Hillel Gauchman, Lee A. Rubel, Sums of products of functions of x times functions of y. Lin. Alg. and its Appl. 125 (1989)) 19-63. 1571 Dan Geiger, David Heckerman, A characterization of the Dirichlet distribution through global and local parameter Independence. Ann. of Statist. 25 (1997), 1344-1369. [58] Roman Ger, Mazur's criterion for continuity of convex functionals. Bulletin of the Polish Academy of Sciences, Mathematics, 43 (1995), 263-268. [59] Attila Gilbnyi, Zsolt PBles, A regularity theorem for composite functional equations. Arch. Math. 77 (2001), 317-322. [60] Enrico Giusti, Minimal Surfaces and Functions of Bounded Variation. Birkhauser, 1984. [61] Karl-Goswin Grosse-Erdmann, Regularity properties of functional equations and inequalities. Aequationes Math. 37 (1989), 233-251. [62] Grzegorz Guzik, Continuity of measurable solutions of some functional equations. International Journal of Bifurcation and Chaos 14(7) (2003), 189551901. [63] Hiroshi Haruki, Studies on certain functional equations from the standpoint of analytic function theory. Sci. Rep. Osaka Univ. 14 (1965), 1-40. [64] Otto Haupt, Uber einen Eindeutigkeitssatz fiir gewisse Funktionalgleichungen. J Reine Angew. Math. 186 (1944), 58-64. [65] Attila HBzy, Fiigpe'nyegyenletek sz8mit6ge'pes megoldha. Kossuth Lajos Tudombnyegyetem Matematikai 6s Informatikai Intkzete, Debrecen, 1999. [66] Edwin Hewitt, Kenneth A. Ross, Abstract harmonic analysis I. Springer, 1963. [67] Edwin Hewitt, Kenneth A. Ross, Abstract harmonic analysis II. Springer, 1970. [68] Edwin Hewitt, Kenneth A. Ross, Extension of Haar measure and ofharmonic analysis for locally compact Abelian groups. Math. Ann. 160 (1965), 171-195. 1691 Edwin Hewitt, Karl Stromberg, Real and abstract analysis. Springer, 1965 [70] David Hilbert, Gesammelte Abhandlungen Band III. Springer, 1970. [71] Einar Hille, Topics in Classical Analysis. In Vol. I11 of the book Lectures on Modern Mathematics, Ed. Thomas L. Saaty. John Wiley & Sons, 1965. [72] Einar Hille, Ralph S. Phillips, Functional analysis and semi-groups. Amer. Math. Soc. Coll. Publ., Vol. 31, 1957. [73] Lars Hormander, Linear partial differential operators. Springer, 1964. [74] Lars Hormander, The analysis of linear partial differential operators I.-IV. Springer, 1983. [75] Andrzej Hulanicki, On subsets of full outer measure in products of measure spaces. Bull. Acad. Polon. Sci. 7 (1959), 331-335.
References
339
[76] Witold Hurewicz, Henry Wallman, Dimension Theory. Princeton University Press, 1948. [77] Gerald L. Itzkowitz, Extension of Haar measure for compact connected Abelian groups. Bull. Amer. Math. Soc. 71 (1965), 152-156. [78] Gerald L. Itzkowitz, Measurability and continuity for a functional equation on a topological group. Aequationes Math. 7 (1971), 194-198. [79] Antal JBrai, Remark 17. Solution of two problems of W. Sander. Aequationes Math. 19 (1979), 286-288. [80] Antal J k a i , On measurable solutions of functional equations. Publ. Math. Debrecen 26 (1979), 17-35. [81] Antal JBrai, Regularity properties of functional equations. Aequationes Math. 25 (1982)) 52-66. [82] Antal JBrai, Invariant extension of Haar measure. Diss. Math. 233 (1984), 1-26. [83] Antal JArai, A remark to a paper of J. Acz6land J. K. Chung. Studia Sci. Math. Hungar. 19 (1984))273-274. [84] Antal JArai, Derivatives are Borel functions. Aequationes Math. 29 (1985), 24-27. [85] Antal JBrai, Remark 12. In: Proceedings of the Twenty-third International Symposium on Functional Equations. Centre for Information Theory, University of Waterloo, 1985. [86] Antal J&rai, On regular solutions of functional equations. Aequationes Math. 30 (1986), 21-54. [87] Antal J&ai, Remark to the problem 4 of C. Alsina and J.-L. Garcia-Roig. Aequationes Math. 35 (1988), 120. [88] Antal JBrai, Remark 3. (Solution of a problem of C. Alsina.) Aequationes Math. 36 (1989) 98. [89] Antal JBrai, On a problem of S. Mazur. Publ. Math. Debrecen 59,'-2 193.
(2001), 187-
[90] Antal J k a i , Remark 22. (to a theorem of J . Acze'l). Aequationes Math. 37 (1989), 111. [91] Antal J&rai, Differentiation of parametric integrals and regularity of functional equations. Grazer Math. Ber. 315 (1991), 45-50. [92] Antal JBrai, Holder continuous solutions of functional equations. Compes Rendus Math. Rep. Acad. Sci. Canada 14 (1992), 213-218. [93] Antal JBrai, On Holder continuous solutions of functional equations. Publ. Math. Debrecen 43/3-4 (1993), 359-365. [94] Antal JBrai, On Lipschitz property of solutions of functional equations. Aequationes Math. 47 (1994), 69-78.
340
References
[95]Antal JBrai, O n the analytic solutions o f functional equations. Annales Univ. Sci. Budapest, Sect. C o m p . 14 (1994), 71-77. [96]Antal JBrai, O n continuous solutions o f functional equations. Publ. Math. Debrecen, 44/1-2 (1994), 115-122. [97]Antal JBrai, FiiggvCnyegyenletek regularithi tulajdonstigai. HabilitBci6s CrtekezCs. (Habilitation thesis.) Debrecen, 1994. 132 pages. [98]Antal JBrai, FiiggvCnyegyenletek regularithi tulajdonstigai. HabilitBci6s CrtekezCs tCzisei. ( S u m m a r y o f t h e habilitation thesis.) Debrecen, 1994. 34 pages. [99]Antal JArai, A Steinhaustype theorem. Publ. Math. Debrecen 47 (1995), 1-13. [loo]Antal J k a i , Remark 30. (Solution o f a problem o f K. Lajkd.) Aequationes Math. 49 (1995)) 196. [ l o l l Antal JBrai, Regularity properties o f functional equations. Leaflets in Mathematics.
Janus Pannonius University, PCcs, 1996, 1-77. [I021 Antal JBrai, Remark 23. (To the talk o f R . Badora.) Aequationes Math. 51 ( l 9 9 6 ) , 178. [I031 Antal JArai, A generalization o f a theorem o f Piccard. Publ. Math. Debrecen 52 (3-4)) 1998, 497-506. [I041 Antal JArai, Regularity Property o f the Functional Equation o f the Dirichlet Distribution. Aequationes Math. 56 (1998))37-46. [105]Antal JBrai, Measurable solutions o f functional equations satisfied almost everywhere. Math. Pannonica 10/1 (1999), 103-110. [106]Antal JBrai, T obbviiltozds figgvCnyegyenle tek regularithi t ulajdonstigai. AkadCmiai doktori disszert8cid. (D.Sc, thesis.) Budapest, Hungary, 1999, 250 pages. [107]Antal JBrai, TobbvBtozds fuggvCnyegyenletek regularit h i tulajdonsiigai. AkadCmiai doktori disszertticid tkzisei. (D.Sc. thesis summary.) Budapest, Hungary, 1999, 46 pages. [I081 Antal JBrai, Solutions o f functional equations having bounded variation. Aequationes Math. 61 (2001), 205-211. [I091 Antal JBrai, Regularity Property o f the Functional Equations on Manifolds. Aequationes Math. 64 (2002), 248-262. [I101 Antal J k a i , Measurability implies continuity for solutions o f functional equations even with few variables. Aequationes Math. 65 (2003), 236-266. [ I l l ] Antal JBrai, Baire property implies continuity for solutions o f functional equations - even with few variables. Acta Sci. Math. Szeged 66 (2000), 579-601. [112]Antal JBrai, Continuity implies differentiability for solutions o f functional equations - even with few variables. Acta Sci. Math. Szeged 67 (2001), 719-734.
References
341
[I131 Antal JBrai, ~j eredme'nyek a fiiggve'nyegyenletek regu1arit;iselmdlete'ben. (New results in the regularity theory of functional equations.) Kozgyulksi eMaddsok, 2000 May, Hungarian Academy of Sciences. [I141 Antal Jdrai, Gyula Maksa, Remark 19 Solution of the problem of C. Alsina. Proceedings of the Twenty-third International Symposium on Functional Equations. Centre for Information Theory, University of Waterloo, 1985. [I151 Antal J h a i , Gyula Maksa, Remark 2. (Solution of a problem of C. Alsinaand J.L. Garcia-Roig.) Aequationes Math. 47 (1994), 302. [I161 Antal Jdrai, Gyula Maksa, Regular solutions of a functional equation. C. R. Math. Rep. Acad. Sci. Canada, 17(1) (1995), 7-10. [I171 Antal JBrai, Zsolt Pdles, Remark 10. Solution of a problem of T. M. K. Davison. Aequationes Math. 53 (1997)) 190. [118] Antal Jdrai, Wolfgang Sander, A Regularity Theorem in Information Theory. Publ. Math. Debrecen 50 (1997), 339-357. [I191 Antal J k a i , Wolfgang Sander, On the characterization of Weierstrass's sigma function. In: Zoltdn Darbczy, Zsolt PAles (Eds.): Functional Equationes - Results and Advances. Kluwer, 2002, 29-79. [120] Antal JBrai, Lhzlb Szkkelyhidi, Regularization and General Methods in the Theory of Functional Equations. Survey paper. Aequationes Math. 52 (1996), 10-29. [121] Shizuo Kakutani, John C. Oxtoby, Construction of a non-separable invariant extension of the Lebesgue measure space. Ann. of Math. (N. S.) 52/2 (1950))580-590. [122] Shizuo Kakutani, Kunihiko Kodaira, A non-separable translation invariant extension of the Lebesgue measure space. Ann. of Math. (N. S.) 52/2 (1950), 574-579. 11231 Johannes H. B. Kemperman, A general functional equation. Trans. Amer. Math. SOC.86 (1957), 28-56. [I241 Zygfryd Kominek, Some generalization of the theorem of S. Piccard. Prace Naukowe Uniwersytetu las ski ego w Katowicach 37 (1973))31-33. 11251 Tam& Krausz, Steinhaus-tip us^ tdtelek. Ditikktjri dolgozat. KLTE Mat. Int., Debrecen, 1980. [I261 Marek Kuczma, Functional equations in a single variable. PWN - Polish Scientific Publisher, 1968. [I271 Marek Kuczma, An Introduction to the Theory of Functional Equations and Inequalities. Palistwowe Wydawnictwo Naukowe, Uniwersytet slqki, 1985. [I281 Marek Kuczma, Bogdan Choczewski, Roman Ger, Iterative Functional Equation. Encyclopedia of mathematics and its applications 32. Cambridge University Press, 1990. 11291 Marcin E. Kuczma, A generalization of Steinhaus' theorem to coordinatewise measure preserving binary transformations. Colloq. Math. 36 (1976), 241-248.
342
References
[I301 Marcin E. Kuczma, Differentation of implicit functions and Steinhaus' theorem in topological measure spaces. Colloq. Math. 39 (1978), 95-107. [I311 Marcin E. Kuczma, Marek Kuczma, An elementary proof and an extension of a theorem of Steinhaus. Glas. Mat. 6(26) (1971), 11-18. [I321 Casimir Kuratowski, Topology. Academic Press and Panstwowe Wydawwnictwo Naukowe, 1968. [I331 Svetozar Kurepa, A cosine functional equation in Hilbert space. Canad. J. Math. 12 (1957), 45-50. [I341 Iwo Labuda, Richard Daniel Mauldin, Problem 24 of "The Scottish Book" concerning additive functionals. Coll. Math. 48 (1984), 89-91. [135] Mikl6s Laczkovich, Val6s fiiggvbnytan. Egyetemi jegyzet. Eotvos Lorbnd University, 1995. [I361 Kbroly Lajk6, On the general solution of rectangle-type functional equations. Publ. Math. Debrecen 28 (1981), 137-143. [I371 KAroly Lajk6, Remarks to a functional equation arising in the spectral theory of random fields. (Talk on the 32. ISFE.) Aequationes Math. 49 (1995))p. 175. [I381 Bohdan Lawruk, Halina ~ w i a t a k On , Functions Satisfying a Generalized Mean Value Equation. Aequationes Math. 11 (1974), 1-10. [I391 William F.Lindberg, Andrzej Szymanski, Projectively solid sets and n-dimensional Piccard's theorem. Scrdica Math. I. 24 (1998), 37-48. [I401 Zbigniew Lipecki, On continuity of group homomorphisms. Coll. Math. 48 (1984), 93-94. [I411 Lhzl6 Losonczi, On a functional equation of sum form. Publ. Math. Debrecen 26 (1989), 167-177. [I421 Lhzl6 Losonczi, Measurable solutions of functional equations of sum form. Acta Math. Hung. 61 (1993), 165-182. [I431 Lhzl6 Losonczi, A structure theorem for sum form functional equations. Aequationes Math. 53 (1997)) 141-154.
+
[I441 Anders Lundberg, On the functional equation f ( ~ ( x ) g(y)) = p(x) Aequationes Math. 16 (1977), 21-30.
+ h(x + y).
[I451 Anders Lundberg, Generalized distributivity for real, continuous functions. I: Structure theorems and surjective solutions. Aequationes Math. 24 (1982)) 74-96. [I461 Anders Lundberg, Generalized distributivity for real, continuous functions. II: Local solutions in the continuous case. Aequationes Math. 28 (1985), 236-251. [I471 Anders Lundberg, Scto's method applied to a problem of Fiire and Mitchell. Aequationes Math. 43 (1992), 127-144. [I481 Anders Lundberg, Canonical forms of irreducible polynomials of second degree in each of two complex variables. Aequationes Math. 57 (1999), 241-253.
References
343
[I491 Anders Lundberg, A rational Snto equation. Aequationes Math. 57 (1999), 254-277. [I501 Anders Lundberg, Sequential derivatives and their application t o a SCto equation. Aequationes Math. 62 (2001), 48-59. [I511 Gyula Maksa, T h e general solution o f a functional equation related t o the mixed theory o f information. Aequationes Math. 22 (1981), 90-96. [I521 Gyula Maksa, Bounded symmetric information functions. C . R. Math. Rep. Acad. Sci. Canada Vol. I1 (1980),No. 5 , 247-252. [I531 Gyula Maksa, A . A . J . Marley, Zsolt PBles, O n a functional equation arising from joint-receipt utility models. Aequationes Math. 59 (2000), 273-286. [154]Janusz Matkowski, Continuous solutions o f a functional equation. Publ. Math. Debrecen, 52/3-4 (1998), 559-562. [I551 Janusz Matkowski, Invariant and complementary invariant quasiarithmetic means. Aequationes Math. 57 (1999), 87-107. [I561 Janusz Matkowski, Tadeusz ~ w i ~ t k o w s kAi ,refined Steinhaus theorem and subadditive functions. Manuscript. [157]Richard Daniel Mauldin ( E d . ) , T h e Scottish Book. Birkhauser, 1981. [I581 Michael A . McKiernan, Boundedness on a Set o f Positive Measure and the Mean Value Property Characterizes Polynomials a n a Space V n , Aequationes Math. 4 (1970), 31-36. [159] Michael A . McKiernan, Difference and mean value type functional equations. Centro Internationale Matematico Estivo. Corso tenuto a La Mendola (Trento) dal 20 a1 28 agosto 1970. [I601 Frank Morgan, Geometric measure theory. A beginner's guide. Academic Press, 1988 [I611 John C . Oxtoby, Mass und Kategorie. Springer, 1971. [I621 Luigi Paganoni, Sulla equivalenza fra misurabilita e continuitti per le soluzioni di una classe di equazioni fiinzionali. Riv. Mat. Univ. Parma (3)3 (1974), 175-188. [I631 Zsolt PBles, O n reduction o f linear two variable functional equations t o differential equations without substitutions. Aequationes Math. 43 (1992), 236-247. [I641 Zsolt P&les, ~ j a b bm6dszerek a fiiggve'nyegyenletek e's fiiggvCnyegyenliitlense'gekelm41ete'ben. (New methods i n the theory o f functional equations and functional inequalities.) D.Sc. Thesis, in Hungarian, Institute o f Mathematics and Computer Science, Kossuth Lajos University, Debrecen, Hungary, 1999. [I651 Zsolt P&les, Problems i n the regularity theory o f functional equations. Survey talk. Aequationes Math. 63 (2002), 1-17. [I661 Zsolt PBles, ~j m6dszerek a fiiggvhyegyenletek regularitsiselme'lete'ben. (New methods i n the regularity theory o f functional equations.) Kozgyul6si eloadhok, 2000 May, Hungarian Academie o f Sciences.
344
References
[167] Sophie Piccard, Sur les ensembles de distances des ensembles de points d'un espace euclidien. MCm. Univ. Neuchiitel, Vol. 13, SecrCtariat de 1'UniversitC NeuchBtel, 1939, 212 pages. [I681 D.Raikoff, On the addition of point-sets in the sense of Schnirelman (in Russian) Recueil Mathkmatique T 5(47), N. 2 (1939), 425-440. [I691 Themistocles M. Rassias, Jaromir S i m ~ aFinite , Sums Decompositions in Mathematical Analysis. John Wiley & Sons, Chichester, 1995. 11701 Jiirg Ratz, Remark 2. (Solution of a problem of C. Alsina.) Aequationes Math. 36 (1989) 98. [I711 Richard Rochberg, Lee A. Rubel, A functional equation. Indiana Univ. Math. J. 41 (1992), 363-376. [I721 Walter Rudin, Principles of mathematical analysis. Second edition. McGraw-Hill, 1964. [I731 Thomas L. Saaty, Modern Nonlinear Equations. Dover, 1981. [I741 Maciej Sablik, The continuous solutions of a functional equation of Abel, (Contribution to HilbertJs Fifth Problem). Aequationes Math. 39 (1990), 19-39. [I751 Maciej Sablik, A functional equation of Abel revisited. Hamburg 64 (1994), 203-210.
Abh. Math. Sem. Univ.
11761 Maciej Sablik, Final part of the answer to a Hilbert's question. In: ZoltAn Dar6czy, Zsolt PAles (Eds.): Functional Equationes - Results and Advances. Kluwer, 2002, 231-242. [I771 Stanislaw Saks, Antoni Zygmund, Analytic Functions. Warszawa-Wroclaw 1952. [178] Wolfgang Sander, Verallgemeinerungen eines Satzes von H. Steinhaus. Manuscripta Math. 18 (1976), 25-42. Wolfgang Sander, Errata hierzu. Manuscripta Math. 20 (1977), 101-103. [I791 Wolfgang Sander, Regularitatseigenschaften von Funktionalgleichungen. Glas. Math. Ser. I11 13(33) (1978), 237-247. [I801 Wolfgang Sander, Verallgemeinerte Cauchy-Functionalgleichungen. Aequationes Math. 18 (1978), 357-369. [IN]Wolfgang Sander, A generalization of a theorem of S. Piccard. Proc. Amer. Math. Soc. 73 (1979), 281-282. [I821 Wolfgang Sander, Problem P179. Aequationes Math. 19 (1979), 287. [I831 Wolfgang Sander, Ein Beitrag zur Baire-Kategorie-Theorie. Manuscripta Math. 34 (1981), 71-83. [I841 Wolfgang Sander, The fundamental equation of information and its generalizations. Aequationes Math. 33 (1987), 150-182. [I851 Wolfgang Sander, Problem 19. Aequationes Math. 46 (1993), 294.
References
345
[186]Wolfgang Sander, O n a generalized fundamental equation o f information. Results in Math. 26 (1994), 372-381. 11871 Laurent Schwartz, Radon measures on arbitrary topological spaces and cylindrical measures. Oxford University Press, 1973. [I881 Jaroslav Smital, O n functions and functional equations. A d a m Hilger, 1988. [I891 Hugo Steinhaus, Sur les distances des points dans les ensembles de mesure positive. Fund. Math. 1 (1920), 93-104. [I901 Carl Sundberg, Carl Wagner, A functional equation arising i n multiagent statistical decision theory. Aequationes Math. 32 (1987), 32-37. [I911 LBsz16 Szkkelyhidi, O n a class o f linear functional equations. Publ. Math. Debrecen 29 (1982), 19-28. 11921 LBszl6 Szkkelyhidi, O n a problem o f S. Mazur. Trans. Amer. Math. Soc. 316 (1989), 161-164. [I931 LBsz16 Szkkelyhidi, Convolution T y p e Functional Equations on Topological Abelian Groups. World Scientific, 1991. [194]LBsz16 Szkkelyhidi, Functional equations via spectral synthesis. Leaflets i n mathematics. Janus Pannonius University, Pkcs, 1995. [195]Halina ~ w i a t a k O , n the regularity o f the locally integrable solutions o f the functional equations a i ( x ,t )f ( x + p i ( t ) ) = 0. Publ. Math. Debrecen 15 (1968), 49-55.
xi
11961 Halina ~ w i a t a k O , n the regularity o f the distributional and continuous solutions o f the functional equations ~ f = = a ;,( x ,t )f ( x (oi(t))= b ( z ,t ) . Aequationes Math. 1 (1968), 6-19.
+
[I971 Halina ~ w i a t a k O , n the regularity o f the locally integrable solutions o f the functional equations a i ( x , t )f ( x p i ( t ) ) = b ( x ,t ) . Aequationes Math. 4 (1970), 291-296.
c:=~
+
, for the regularity o f continuous and locally integrable solu[I981 Halina ~ w i a t a k Criteria tions o f a class o f linear functional equations. Aequationes Math. 6 (1971), 170-187. , n certain regularity problems for solutions o f functional equations. [I991 Halina ~ w i a t a k O Bull. Acad. Pol. Sci., Ser. Math., Astr. et Phys. X I X (3) (1971), 209-212. , problems for the solutions o f functional differential equa[200]Halina ~ w i a t a kRegularity tions. Report o f meetings, Aequationes Math. 8 (1972), 146-147. [201]Halina ~ w i a t a k T, h e regularity o f the locally integrable and continuous solutions o f nonlinear functional equations. Trans. Amer. Math. Soc. 221(1) (1976), 97-118. , and regularity problems for nonlinear functional equations. 12021 Halina ~ w i a t a kExistence Acta Math. Acad. Sci. Hungar. 30 (1-2) (1977), 21-31. [203]Rolf Trautner, A covering principle i n real analysis. Quart. J . Math. Oxford 38(2) (1987), 127-130
346
References
[204] Fikret Vajzovit, On the solution of one functional equation. Mat. Vesnik 5 (1968), 25-28. [205] Endre Vincze, Eine allgemeiner Methode in der Theory der Funktionalgleichungen, I., II., I I . , I V . Publ. Math. Debrecen 9-10 (1962-63), 149-163, 314-323; 191-202, 283-318. [206] And& Weil, L'intkgration dans les groupes topologiques et ses applications. Paris, 1951. [207] Marek Cezary Zdun, Continuous and differentiable iteration semigroups. Naukowe Uniwersytetu ~ l a s k i e ~woKatowicach 308 (1979).
Prace
12081 Marek Cezary Zdun, On continuity of iteration semigroups on metric spaces. Comment. Math. 29 (1989), 113-116. 12091 Eberhard Zeidler, Nonlinear functional analysis I.-IV. Springer, 1990.
INDEX Page number of the most important reference (usually the definition) is given in italic.
JZk
191, 192, 213 Abel, Niels Henrik 7, 8, 9, 10, 22, 246, 285, 286, 337 Abel's functional equation 24 Aczkl, JBnos -5, 2, 4, 5, 9, 10, 11, 13, 14, 15, 17, 23, 24, 25, 26, 28, 101, 222, 231, 232, 243, 244, 245, 246, 248, 253, 267, 285, 335, 336, 337, 339 Aczkl-Benz equation 1 5 addition theorem 285 additional argument 127 additional compactness condition 35, 37, 110 additional condition 169 additive 81, 86, 178, 267 additive function 11, 100, 270, 271 additive function with graph invariant under some rotations 270 additive functional 21 Aequationes Mathematicae 9, 10 D' Alembert 's functional equation 234, 237 D' Alembert-type equation 11 N1 198, 216 Alexandrov, Paul S. 8, 336 algebraic properties 174 algorithm 11, 153 algorithmic method 15 almost all 100, 101, 104, 105, 175, 176, 268 almost contains 284
almost equal 101, 284 almost everywhere 100, 101, 102, 104, 140, 174, 268, 284, 285, 288, 290, 292, 299, 330 almost everywhere differentiability implies continuous differentiability 137 almost everywhere differentiable 16, 25, 35, 137, 140, 141, 166, 234 almost invariant set 197 almost separable valued 233 almost solution 100, 102 la1 4 1 a-near 41, 42 Alsina, Claudi 260, 270, 336, 339, 341, 344 analytic 2, 4, 5, 8, 10, 16, 17, 20, 22, 24, 27, 28, 33, 34, 36, 37, 144, 145, 151, 152, 153, 155, 174, 286, 288, 300 analytic coefficient 152 analytic manifold 37, 50 analytic solution 286, 287 analyticity 5, 9, 22, 144, 149, 286 Andrade, J. 17 application 34, 231 axiom of choice 78, 271
Index -
Baire category 22, 46, 102 Baire property 2, 9, 16, 17, 22, 27, 35, 36, 37, 38, 39, 47, 48, 49, 68, 94, 95, 99, 105, 106, 107, 204, 205, 206, 209, 210, 213, 231, 232, 233, 240, 241, 243, 244, 245, 246, 248, 253, 257, 265 Baire property implies boundedness 66 Baire property implies continuity 22, 39, 66, 98, 159, 170, 204 Baire property of function 4 7 Baire set 66, 67, 68, 69, 70, 71, 231, 232 Baire space 46 Baire's theorem 46 Baker, John A. 13, 20, 86, 153, 237, 287, 336 Banach algebra 233 Banach manifold 50, 166 Banach space 3, 56, 137, 139, 141, 220, 221, 222, 233 base of measure 43 basic solution 299 behavioral sciences 9 Benz, Walter 336, 337 Berstein-Doetsch theorem 25 binary gamble 267 binary operation 54 Bolzano's theorem 154 Bonk, Mario 22, 285, 286, 287, 288, 291, 336, 337 Bonk's regularity theorem 24 bootstrap 27, 175, 204, 219 Borel function 20, 29, 30, 47, 48, 57, 137 Borel regular measure 44, 82 Borel set 44, 47, 55, 137, 140, 181, 213, 216
Borel subset 82, 101 bound 111 bounded 2, 23, 24, 56, 73, 74, 75, 76, 79, 111, 117, 121, 129, 267 bounded essential variation 50, 132 bounded from above 23, 24 bounded linear operator 3, 233 bounded measurable solutions are continuous 76 bounded variation 49, 50, 132, 137 Bourbaki, Nicolas 41, 46, 68, 103, 235, 337 Bruijn, Nicolas Govert de 100 brute force 253
40 40 Cw 27, 28, 38, 39, 128, 144, 149, 170, 171, 172, 173, 174, 231, 232, 240, 241, 243, 245, 246, 248, 253, 257, 258, 265, 266, 277, 278, 282, 284, 285, 287, 290 Cm implies analyticity 144 Cw manifold 37, 38, 50 Cw solution 287 Cm 221, 227 C 235, 236 C" 28, 33 C1 33, 177, 178, 184, 185, 187, 188, 190, 198, 205, 207, 209, 210, 211, 212, 216, 220, 221 C1 implies Cw 218, 219 CP 29, 30, 33 CP?" 28 question 28 CP~"-CQ~P CP-Cq problem 28, 37 c
Index
CP manifold 37 CP-problem 28, 37 calculus of variation 8 Campanato space 128 Carathkodory condition 42 card 40 cardinal number 40, 198, 238 cardinality 271 Cauchy, Augustin 1, 5, 231 Cauchy's equation 1, 2, 3, 4, 5, 6, 13, 14, 19, 20, 23, 24, 29, 30, 86, 231 Cauchy's exponential equation 2, 4, 5, 6, 7, 20, 24, 29, 233 Cauchy's logarithmic equation 3, 7 Cauchy's power equation 3, 7, 29, 31 Cauchy-type equation 23, 91, 110 character 76, 78 character of a topological space 41 character of measure 43 characterization 22, 100, 101, 110, 127, 284, 285 chart 50, 158 Chevalley, Claude 238, 337 X k 43, 191, 192 Choczewski, Bogdan 2, 341 Chung, Jukang 253, 336, 339 circle group 76 circulus vitiosus 5 classical method 17, 19, 22, 73 closed 41, 137 closed ball 42 closed domain 257 closure 106 coefficient 144 Cohen, Paul J. 193, 214 column 153
349
commutative group 11, 237 commutative topological group 11 commutative topological semigroup 11 compact 20, 34, 35, 37, 38, 41, 44, 45, 46, 56, 59, 75, 81, 89, 103, 124, 126, 159, 175, 178, 181, 182, 198 compact metric group 76 compact metric space 234, 235, 236 compact support 49, 153, 220, 221 compactness condition 120 complete metric space 46, 83 complete separable metric space 68, 82 completely regular 41, 56 complex 40 complex analytic 5 complex differentiability 24 complex function theory 286 complex multiplication 80 complex solution 24 composite equation 13, 15, 17, 24, 25, 27, 151 composite functional equation arising from utility that is both separable and additive 267 computer 286 condition 55 conditional Cauchy equations 9 conditional equation 2 connected 50, 85, 240, 284 connected compact infinite Abelian group 76 constant strength 39, 172 continuity 5, 9, 16, 19, 21, 22, 23, 86, 107, 137, 175, 177, 204, 219, 267, 285, 288
350
continuity implies C°° 170, 218 continuity implies C1 40,110, 218, 219 continuity implies local Lipschitz property 128 continuity modulus 4-2, 121, 124, 125, 160, 162 continuity point 137 continuous 2, 7, 8, 13, 15, 16, 17, 18, 20, 21, 23, 24, 28, 29, 30, 31, 35, 37, 45, 46, 49, 53, 54, 55, 56, 58, 59, 60, 63, 67, 68, 73, 76, 79, 80, 81, 86, 87, 89, 90, 91, 92, 93, 94, 97, 99, 100, 101, 102, 103, 105, 106, 107, 110, 111, 116, 118, 119, 120, 121, 124, 128, 136, 137, 139, 141, 153, 158, 159, 160, 165, 166, 171, 172, 173, 174, 177, 178, 180, 181, 184, 185, 188, 190, 205, 206, 208, 210, 212, 220, 221, 222, 231, 232, 233, 234, 235, 236, 237, 238, 242, 244, 257, 260, 265, 266, 286, 287, 288, 289 continuous at a point 23, 24 continuous differentiability 21, 25, 137 continuous extension 103, 105, 107 continuous linear operator 137 continuous on a neighborhood 89, 91, 95, 98 continuous partial derivative 68 continuous solutions are C°° 39 continuous solutions are in C°° 169 continuous solutions are locally Lipschitz 159
Index
continuously differentiable 16, 17, 18, 26, 28, 35, 36, 37, 38, 49, 54, 63, 64, 65, 69, 70, 71, 77, 111, 117, 118, 121, 132, 134, 135, 136, 140, 141, 142, 143, 152, 165, 166, 167, 219, 220, 221, 223, 225, 226, 227, 233, 258 continuum 40, 76, 197, 271 continuum hypothesis 78, 193, 197, 201, 202, 203, 214, 216, 217, 218, 271 convergence in measure 39, 175 convexity 25 convolution 21, 53, 54 convolution-type functional équation 9, 12 coordinate transformation 78 coordinatewise 244, 253, 257 countable 40, 46 countable base 45, 47, 48, 49, 157, 178, 179, 209 countable base of neighborhoods 93, 99, 103, 106, 158, 159, 178 countable infinité fonction 107 countable product 48 countably additive 43, 76 countably subadditive 42, 43 counterexample 24, 29, 31, 266 counting measure 177 covariant expansion 20, 76, 78 critical direction 266, 267 current 110 V 51 V 51 DXi 40 Darôczy, Zoltân -5, 9, 15, 23, 24, 27, 73, 243, 245, 248, 336, 337
Index
denominator 151 dense 13, 46, 102, 103, 104, 270 dense open 46 density 62, 64, 65, 66 density function 101, 277, 280, 282, 284 densitypoint 91, 158 derivative 40, 137, 171, 187, 188, 190 determinant 14, 152, 302, 303, 304, 305 Dhombres, Jean 4, 5, 9, 14, 336, 337 diameter 60 Diderrich, Georg T. 23, 24, 337 Dieudonnk, Jean 5, 22, 50, 64, 160, 237, 240, 296, 302, 337 diffeomorphism 51, 102, 171, 198, 216 difference equation 8 differentiability 5, 8, 9, 22, 25, 29, 170, 234, 267, 286 differentiability almost everywhere 20, 22, 137 differentiability assumption 285 differentiability implies higher differentiability 40 differentiability theorem 234 differentiable 2, 5, 6, 8, 14, 20, 26, 30, 141, 144, 257 differentiable manifold 157, 159 differential equation 1, 5, 8, 9, 10, 14, 22, 28, 144, 149, 153, 154, 277, 286, 287, 294, 302 differential form 110 differential geometry 64 differential operator 5, 9, 149, 150, 153 differentiation 2, 10 diffuse 42
351
diffuse probability Bore1 regular measure 82 diffuse probability Radon measure 82, 83, 85 dilog 245, 247 dilogarithm equation 244 dilogarithm function 24 5 dim 42, 21 7 dimension 1 165 direction 303, 304, 305, 306 directional derivative 304, 305, 313 Dirichlet distribution 275 discontinuous 76, 257 discrete 153 discrete topology 12 dist 41 distance 41, 51, 160 distance measure 258 distribution 39, 51, 153, 170, 171 distribution method 13, 101, 169, 170, 174, 219 distribution solution 39, 153, 172 distribution solutions are in CCO 172 distributional sense 50 dmn 40 domain 2, 29, 40 duplication of the cube 22, 110, 127, 260
£6
41
Ern 221 f, 41 Ebanks, Bruce 337 economics 9 Eichhorn, Wolfgang 9, 337 eigenspace 277 eigenvalue 277
352
Index
elliptic differential équation 173 elliptic differential operator 174 embedding 178, 205, 213, 221 empty intersection 41 empty sum J^3 empty union 41 équation 8, 9, 295 équation of the duplication of the cube 260 équations of hydrodynamics 109 Erdôs, Pal 54, 100, 337 essential bounded variation 22 Euclidean space 101 évolution équation 3 expanded measure 76 expansion J^3 expansion theorem 76, 77, 78, 80 explicit 16, 32, 102, 204 exponent 42, 128, 129, 132 exponential function 5 exponential polynomial 12, 81, 86 exponentiation 5 extend 102, 286 extended real-valued 42 extension 43, 105, 107
{/ = } 40
{f = y} 40 U>y} 40 /# 42 {feA} 40 Tk
221, 222, 223, 224, 225, 227
/ ' 40 TaS Ta
41, 137 41
/*, 40 Fédérer, Herbert 42, 43, 44, 45, 50, 82, 85, 110, 137, 144, 337
Fenyô, Istvân 13, 337 few variables 38, 39, 40, 170, 174, 218 fîfth problem 8 finite dimensional 50, 137 finite measure 42, 45, 73, 76 first category 46, 47, 48, 49, 67, 70, 95, 99, 105, 106, 107, 204, 205 first countable 92, 104 formai differentiation 171 Fourier transform 11, 286 Fubini's theorem 49, 54, 62, 103, 185 function space 235 functional équation 1, 2, 5, 8, 9, 10, 11, 13, 14, 15, 17, 20, 22, 23, 24, 27, 28, 30, 33, 36, 38, 39, 40, 54, 66, 86, 110, 120, 127, 137, 145, 149, 150, 153, 169, 170, 185, 205, 218, 219, 222, 227, 233, 234, 240, 241, 242, 243, 245, 246, 247, 253, 256, 257, 258, 260, 261, 267, 268, 270, 275, 276, 277, 284, 285, 286, 287, 288, 289, 290, 292, 297, 299, 330 functional équation System 15, 153 functional équation with few variables 264, 266 functional inequality 4-> 8, 73, 124, 160 functional means 27 functional-differential équation 149, 150, 151, 174, 248 fundamental équation of information 23, 24, 242, 243, 244, 245 fundamental lemma 128
Index
fundamental regularity problem of non-composite functional equations 27, 29 G 41 g5 41, 46, 137 Qsa 41 9# 153 91,92,93 294 Gajda, Zbigniew 338 Garcia-Roig, Jamie L. 260, 270, 336, 341 Gauchman, Hillel 299, 338 Gauss-Green theorem 112 Gaussian elimination 153 Geiger, Dan 275, 276, 277, 338 general explicit nonlinear functional equation 174, 218 general method 10, 11, 15, 17, 144, 149, 151, 287 general non-composite functional equation 31, 32 general nonlinear functional equation 219 general position 54, 66, 69, 186, 282 general regularity result 14, 24, 39, 169 general regularity theorem 258, 260 general regularity theory 15 general result 23 general solution 2, 12, 13, 14, 31, 287 general theorem 173, 277 general theory of functional equations 3 general transfer principle 32, 33, 287, 291
353
generalized Cauchy equation 232, 234 generalized fundamental equation of information 243, 244 generalized mean value equation 169 generalized mean value type equation 22, 173, 265, 266 generalized Steinhaus theorem 21, 280 geodetical arc 161 geometrical proof 286 geometry 9, 36 Ger, Roman 2, 25, 81, 267, 336, 338, 341 Gilänyi, Attila 26, 338 Giusti, Enrico 49, 50, 132, 135, 338 global nature 157 global result 36 global solution 300, 305 globally 305 Gödel, Kurt 193, 214 Golab, Stanislaw 9, 336 gradient 50, 111, 117, 129 Gram determinant 113 graph 270, 271 Grosse-Erdmann, Karl-Goswin 54, 66, 86, 91, 92, 94, 98, 178, 338 group 7 group axioms 15 Guzik, Grzegorz 86, 235, 236, 338
354
Haar measure 20, 46, 54, 59, 75, 76, 78, 80, 238, 239 Hamel base 14 harmonic analysis 101 Haruki, Hiroshi 286, 338 Haupt, Otto 94, 106, 107, 338 Hausdorff measure 39, 4 3 Hausdorff space 41, 44, 45, 55, 56, 59, 68, 175, 181 Hdzy, Attila 153, 338 heat equation 3 Heckerman, David 275, 276, 277, 338 Hewitt, Edwin 42, 43, 76, 80, 137, 338 higher order differentiability 19, 141, 227 Hilbert, David 7, 8, 9, 15, 285, 335, 336, 338, 344 Hilbert space dimension 43, 76, 78, 197 Hilbert's fifth problem 7, 8, 9, 10, 15 Hille, Einar 3, 9, 24, 233, 338 Holder constant 42 Holder continuity 22, 42 homeomorphism 59, 83, 85, 133, 134, 1'18, 205 homogeneous 86, 153 homomorphism 3, 11, 46, 75, 80 homotopy formula 110 Hormander, Lars 22, 51, 153, 172, 173, 174, 338 Hulanicki, Andrzej 76, 338 hull 43, 55, 58, 59, 183 Hurewicz, Witold 42, 339 hypoelliptic 39, 172, 173, 219, 265
Index
Zrn 221 223, 225, 227 immersion 177, 178, 184, 187, 190, 191, 205, 207, 210, 211, 213, 221 implicit 29, 102 Ind 42, 217 ind 42, 217 independent 100, 275 infinite cardinal number 78 infinite dimension 234 infinitely many times differentiable 16, 22, 39, 145 infinitesimal operation 2 information measures 23 information theory 243, 244, 248, 257 inner point 66, 282, 284 integer 40 integrable 49, 77 integral equation 1,9, 13 integral transform 9, 11 integration 2 interior 110, 117 internal function 25, 151 interval 53 invariant 46, 76, 198, 270 invariant expansion 20, 75, 76, 80 invariant measure 43, 77, 197 invariant metric 75 invariants 294, 295, 296 inverse function theorem 61, 114, 117, 133, 134 invertible 111 Itzkowitz, Gerald L. 76, 86, 339
x2
Index -
J covariant expansion 78, 79 J covariant measure 76, 77, 78 Jacobian 51, 77, 111, 134, 135 JArai, Antal 22, 25, 28, 54, 55, 66, 73, 76, 82, 86, 91, 94, 98, 102, 110, 120, 128, 137, 144, 204, 233, 238, 244, 248, 253, 255, 270, 277, 336, 337, 339, 340, 341 Jensen, J. L. W. V. 4 Jensen convex 4, 25 Jensen's equation 3 Jurkat, Wolfgang B. 100
xm
221
K 1 49, 223, 226, 227 Kac, Mark 17 Kakutani, Shizuo 76, 78, 197, 341 Kemperman, Johannes H. B. 53, 341 Kiesewetter, Helmut 245, 246, 337 kill 5, 9, 10, 153, 243, 248, 257 Kodaira, Kunihiko 76, 341 Kominek, Zygfryd 66, 341 Krausz, Tam& 54, 341 Kuczma, Marcin E. 54, 341, 342 Kuczma, Marek 2, 9, 54, 100, 341, 342 Kuratowski, Casimir 85, 342 Kuratowski-Ulam theorem 4 7, 49, 69, 209, 211 Kurepa, Svetozar 237, 342
L2 43, 76, 78, 197 Labuda, Iwo 81, 82, 342 Laczkovich, Mikl6s 68, 110, 342 Lagrange interpolation 154 Lajk6, KAroly 242, 287, 340, 342 An 43 A 46, 82 A0 177 Laplace transform 286 lattice 296 lattice constants 292, 294, 295, 296, 298, 299, 305, 306, 317, 318, 330 Lawruk, Bohdan 22, 170, 174, 342 Lebesgue density 43 Lebesgue integrable 245, 277 Lebesgue measurable 21, 35, 36, 38, 50, 81, 82, 86, 137, 158, 178, 181, 232, 233, 236, 244, 245, 246, 255, 258, 277, 284 Lebesgue measure 6, 20, 21, 23, 30, 43, 50, 53, 76, 77, 78, 79, 100, 101, 102, 197, 198, 220, 268, 288 Lebesgue number 126, 164 Lebesgue's theorem 137 left Haar measure 46 Levi-Civith type equation 11 lexicographic ordering 199 Lie, Marius Sophus 7 Lie algebra 240 Lie group 8, 234, 237, 238, 240 lim 104, 106 limit 103, 105, 106 Lindberg, William F. 66, 342 Lindelof space 177 linear 86, 153 linear algebra 152, 219 linear dependence 152
Index
linear differential equation 155 linear differential operator 152, 153 linear equation 40, 151, 152, 218 linear equation with linear arguments 11 linear form 299, 330 linear functional 302 linear functional equation 144, 153 linear functional-differential equation system 153 linear independence 255 linear operator 40, 77 linear partial differential equation 39, 173 linear partial differential operator 174 linear transformation 277, 281 linearly independent 12, 255, 285, 286, 288, 290, 299, 300, 302 Lipecki, Zbigniew 81, 342 Lipschitz constant 42, 236 Lipschitz function 42, 50, 119, 121, 137, 234 local boundedness 19 local continuity 23 local coordinates 158, 159, 165, 166, 167 local Lipschitz property implies continuous differentiability 137 local Lipschitz property of continuous solutions 159 local nature 28, 36, 157 local notion 42 local solution 305, 317 locality principle 176, 178, 191, 204, 205, 212, 222
locally 35, 37, 176, 204, 232, 233, 300, 305 locally arcwise connected 85 locally bounded 23, 24, 80, 242 locally bounded variation 21, 35, 50, 132, 136 locally compact 4 1, 46 locally compact Abelian group 76, 80, 86 locally compact group 46, 54, 59, 75, 76, 78, 80, 178, 197, 237, 238 locally convex 86 locally Euclidean 8 locally Holder continuous 28, 128, 129, 132 locally integrable 9, 13, 19, 73, 101, 174 locally Lebesgue integrable 244, 253, 255 locally Lipschitz 21, 22, 35, 37, 42, 50, 51, 100, 119, 120, 124, 127, 128, 132, 136, 137, 160, 166 logarithmic derivative 293 Losonczi, L&szl6 256, 257, 342 lower semicontinuous 54 Lundberg, Sven Anders Filip 13, 110, 342, 343 Lusin measurable 45, 56, 87, 89, 90, 92, 93, 94, 103, 105, 181, 183, 237, 238 Lusin-Baire property 48, 49, 94, 97, 99, 100, 106, 107, 159, 206, 207, 237 Lusin's theorem 6, 20, 45, 49, 140, 175, 204
Index
Mk
177, 178, 179, 180, 183, 187, 188, 190, 191, 192, 193, 194, 201, 203 main problem 34 Maksa, Gyula 23, 24, 25, 26, 243, 261, 336, 337, 341, 343 manifold 36, 37, 38, 50, 78, 157, 159, 165, 166, 240 Maple 153, 248, 249, 251, 252, 253, 254 Maple calculation 249, 252 Marczewski, Eduard 83 Marley, Antony A. J. 26 Matkowski, Janusz 23, 27, 55, 343 Matkowski-Süto equation 27 matrix 152, 153, 186, 297, 305 Mauldin, Richard Daniel 81, 82, 342, 343 Mazur, Stanislaw 21, 81, 86, 345 McKiernan, Michael A. 20, 86, 174, 343 mean periodic function 11 mean value equation 169, 174 measurability 5, 9, 16, 19, 20, 22, 39, 86, 140, 175, 193, 235, 258, 288 measurability condition 176, 187 measurability implies boundedness 73 measurability implies continuity 21, 22, 39, 76, 82, 86, 91, 157, 170, 174, 1W, 234, 235, 285 measurable 2, 6, 7, 14, 16, 17, 20, 23, 27, 29, 30, 31, 37, 38, 42, 43, 44, 45, 50, 53, 55, 58, 59, 63, 64, 65, 66, 73, 74, 75, 76, 77, 78, 79, 80, 82, 85, 87, 100, 101, 128, 137, 176, 177, 178, 181, 183, 187, 188, 191, 231, 232, 233, 234, 235, 240, 241,
357
242, 243, 248, 253, 257, 261, 265, 285, 288, 289, 290, 292, 299, 330 measurable almost solution 100, 101 measurable function 2, 43, 47, 107 measurable set 4^-> 82 measurable solution 86, 287 measure 20, 22, 42, 43, 44, 50, 54, 55, 75, 76, 78, 82, 220, 235 measure preserving 83, 85 measure theoretical condition 55 measure theory 42 measure zero 102, 103, 107 measures of information 9 meromorphic 293 metric group 75 metric space 42, 43, 48, 175, 179, 187, 188, 190, 191, 204, 211, 212, 235, 236 metrizable 50, 68 Minkowski, Hermann 8 modular function ^6, 78 monotonic 15, 24, 100, 137 Morgan, Frank 343 Morrey lemma 128 42 ß[Y JJL measurable 4% multiindex 41 multiplication 13 multiplicity function ^0, 77 N 40 Nf 40 natural projection 48 neighborhood 47, 50, 56, 60, 63, 64, 65, 68, 70, 117, 118, 119, 140, 141, 143, 165, 166, 167,
358
175, 176, 177, 187, 188, 190, 204, 205, 210, 212, 227 Ng, Che Tat 26, 336 noncommutative 11 noncompact 37 non-composite explicit equation 27 non-composite functional equation 10, 15, 17, 25 non-continuous 7, 24, 29, 30, 31, 80 non-differentiable 7, 8, 15, 24, 29, 30, 31 nonlinear equation System 151 nonlinear functional equation 13, 19, 36, 39, 40, 170, 204, 227 nonlinear term 39 nonlinear theory 266 nonmeasurable 54 nonnegative 24, 77, 277 nonnegative integer J^0 non-Radon measure 73 nonsingular 127 norm J^O
I I 40 II II 40 normal j^l normal coordinates 161, 162 normal distribution 100 normed algebra 3 normed Space 40, 137, 139, 141, 236 nowhere dense ^ö nowhere differentiable 234 null set 50, 158 null space 54, 64, 69, 71, 281 numerator 151
Index
Ü 292 one-sided lower density 55 open 17, 27, 28, 29, 33, 37, 38, 41, 44, 46, 49, 54, 55, 59, 60, 63, 64, 65, 66, 67, 68, 70, 71, 78, 89, 101, 103, 106, 110, 116, 118, 119, 120, 121, 124, 128, 132, 135, 137, 139, 141, 142, 150, 151, 153, 157, 159, 165, 166, 170, 171, 177, 178, 181, 183, 184, 187, 188, 190, 191, 198, 205, 206, 207, 209, 210, 211, 212, 213, 216, 220, 221, 222, 223, 224, 225, 227, 283, 284, 296 open domain 257 open interval 110 open problem 25, 28 operator 86 operator group 233 operator norm ^0 operator semigroup 3, 233, 234 ordinal 198 orthogonal projection 64, 70, 71 orthogonal subspace 64, 69, 71 outer measure 42 outer normal 112 Oxtoby, John C. 22, 39, 45, 46, 47, 54, 76, 78, 197, 337, 341, 343
Vx 82, 85 Vk 220 V 293, 296, 305 Paganoni, Luigi 86, 343 Pales, Zsolt 11, 22, 24, 25, 26, 27, 144, 149, 152, 336, 338, 341, 343
359
Index
PBles's algorithm 153 PBles's method 144, 149, 151, 248, 251, 253 PBles's regularity theorem 153 PBles's theorem 152 parameter 175, 204 parameter space 175, 204, 220 parametric family 170, 220 parametric integral 18, 19, 109, 110, 117, 118, 120, 220, 221, 222, 223, 225, 226, 227 parametric integration quintuple 220
acYf
41
af axi
41
aif
41
partialderivative 40, 91, 94, 97, 100, 105, 139, 141, 142, 166, 227 partial differential equation 8, 150, 172, 173, 287, 303, 304, 305 partial differential operator 39, 150, 171
Piccard's theorem 22, 66, 69, 204 pole 293 Polish space 68 polynomial 11, 12, 86, 151, 154 positive almost everywhere 284 positive everywhere 277, 280, 282 positive Lebesgue measure 271 power function 5 power series expansion 9 principal part 173 principle of analytic continuation 296, 302 probability distribution 100, 101 probability measure 42 probability theory 284 problem 34, 37 programming 153 projection 281 pullback 153 pure manifold 50 purely unrectifiable 192
Q
partial function
40
dm 150 partial set 40 ax,f 41 a ( ? / g l ,...,?/j,) qxil
,,.. ,xi,)
41
partition of unity 221 path 41, 81, 82, 85 Pexider's equation 232 Pexiderization 3, 234, 247 Pexider's equation 3, 31 Phillips, Ralph S. 3, 233, 338 physics 36 Piccard, Sophie 66, 340, 341, 342, 344
40 quadratic form 173, 299, 330 quotient field 152 40
R x 82, 85 Rk 177, 178, 179, 180, 183, 184, 187, 190, 191, 192, 193, 194, 202 Rn 191 lRO 177 Rademacher's theorem 137, 234 Radon measure 20, 44, 45, 46, 55, 56, 59, 73, 82, 85, 87, 89, 92, 175, 181 Raikoff, D. 55, 344 random field 242
360
Index
random variable 100, 275, 277 range 40 rank 16, 27, 28, 31, 33, 34, 38, 63, 68, 91, 94, 97, 100, 105, 117, 118, 119, 121, 124, 128, 132, 136, 140, 141, 143, 152, 153, 170, 174, 184, 186, 187, 188, 190, 204, 208, 209, 210, 212, 218, 223, 224, 225, 226, 227, 278, 281 rank theorem 64, 187, 209 Rassias, Themistocles M. 299, 344 rational 40 Ratz, Jiirg 270, 344 real 40 real function 15, 144 real variable 110, 120, 132, 137 real variable solutions of bounded variation are continuously differentiable 137 real-valued function 75, 120, 128, 152 real-valued solution 110 rectifiable 192, 201 reduction to differential equation 286, 287 reflexive symmetric relation 176 regular 41, 107 regular distribution 13 regular measure 43, 55 regular solution 2, 9, 11, 12, 14, 27 regularity 29, 38 regularity condition 2, 4, 286 regularity phenomenon 267 regularity problem 8, 10, 16, 17, 28, 32, 291 regularity problem of non-composite functional equations 36
regularity property 5, 16, 22 regularity result 10, 14, 34, 39, 153, 157, 170, 257, 264, 286 regularity theorem 9, 15, 17, 66, 173, 175, 177, 204, 222, 244, 258, 286, 287, 289 regularity theory of differential equations 22, 39, 128 relation 41, 56 representable by integration 50 restricted domain 2 Riemannian structure 50, 51, 160 Riesz' theorem 39, 175, 179 ring 152 rng 40 Rochberg, Richard 286, 287, 344 Ross, Kenneth A. 42, 43, 76, 80, 338 rotation 270, 271 Rubel, Lee A. 286, 287, 299, 338, 344 Rudin, Walter 61, 117, 133, 344
Sk 177, 178, 179, 180, 183, 191, 192, 193, 194, 202, 205, 207, 208, 211, 212, 214, 215, 216, 217, 218 S ( P ) 144 Saaty, Thomas L. 9, 338, 344 Sablik, Maciej 9, 24, 344 Sahoo, Prasanna 337 Saint-Vincent le Pkre Gregoire 260 Saks, Stanislaw 292, 295, 298, 344 Sander, Wolfgang 16, 54, 66, 73, 86, 94, 232, 233, 244, 257, 258, 285, 287, 288, 337, 339, 341, 344, 345 satisfied almost everywhere 20, 267
Index
Schrodinger's equation 3 Schwartz, Laurent 82, 86, 345 Schwartz distribution 13, 39, 219 Schwartz's impossibility theorem 13, 220 Schweitzer, Bert 270 Scottish Book 21 second category 22, 46, 47, 66, 67, 69, 71, 95, 97, 99, 100, 106, 159, 206, 216, 231, 232 second category a t a point 47 second countable 178, 205 semigroup 233 semimetric space 75 separable 50, 57, 137, 139, 141, 166, 179, 187, 188, 190, 191, 211, 212, 235, 236, 267 sequence 93, 99, 104, 106, 175, 176, 178 sequential 175 sequential derivative 110 several unknown functions 3, 31 several variables 2, 15, 128, 132, 150, 222 Sierpiliski, Waclaw 197 a-algebra 42, 44, 47, 48 u-finite 42, 55, 178, 181 sigma function 297 a-ideal 107 a 292 simple example 31, 32, 127 simpler proof 128 simplex 19, 110, 117, 275 ~ i m ~Jaromir a, 299, 344 Sincov's equation 31 single variable 2, 4, 15, 16, 39, 170 singular point 144 Srnital, Jaroslav 345
361
smooth 4, 5, 10, 11, 19, 54, 101, 109, 115, 150, 151, 277 smoothing 110, 111, 114 smoothness condition 16, 221 social sciences 9 solvability condition 54, 66 special form 35, 36, 38, 165 spectral synthesis 11 spectral theory 242 spt 44 statistical problem 275 Steinhaus, Hugo Dyonizy 21, 178, 340, 341, 342, 343, 344, 345 Steinhaus' theorem 19, 22, 24, 53, 54, 55, 66, 175 Steinhaus type theorem 21, 76 Stepanoff 's theorem 137 strictly concave 26 strictly convex 26 strictly geodesically convex 160, 161, 162 strictly monotonic 25, 26, 27, 36, 145, 155, 267 Stromberg, Karl 137, 338 strong assumption 39, 40, 174 strong rank condition 175, 204, 218 strongly continuous 233 strongly measurable 233 structure theorem 257 subadditive 4, 124, 160 submanifold 64, 198, 213, 216 submersion 34, 37, 65, 153, 158, 159, 160, 165, 166, 167, 237 subring 152 subsequence 93, 104, 175, 176, 178 subspace 46 substitution 5, 9, 10 sum form 110, 137, 256
362
Sundberg, Carl 253, 345 support 44 surface 175, 204, 220 Slito, OnasaburG 12 Slito's method 13, 250 ~ w i a t a k Halina , 22, 39, 40, 102, 169, 170, 171, 172, 173, 174, 219, 264, 266, 342, 345 ~ w i ~ t k o w s kTadeusz i, 55, 343 symmetric difference 47 symmetric divergence 258 symmetric relation 56 symmetry 5, 9, 10, 277, 280, 283 system of functional equations 2, 7, 8 Szkkelyhidi, LBsz16 9, 11, 81, 86, 237, 241, 341, 345 Szymanski, Andrzej 66, 342
Index
totally disconnected 83 transfer principle 31, 35 transfinite 198, 199 transfinite induction 199, 274 transformation law 76 transformation of integrals 62, 77 translation 77 translation equation 234 translation invariant closed linear subspace 12 Trautner, Rolf 176, 178, 345 twice continuously differentiable 17, 28, 36, 79, 110, 111, 116, 118, 119, 121, 124, 128, 132,
two real variables 144, 149 (2,2)-additivity 247
T 40, 76 T invariant 77 177, 178, 179, 180, 183, 191, 192, 193, 194, 203 tangent bundle 158, 159 tangent space 158, 159 Taylor's theorem 122, 130, 134, 136 test function 51, 171 topological dimension 42, 217 topological group 8, 41, 42 topological measure 21, 44, 54 topological ring 237 topological space 41, 43, 44, 45, 46, 47, 48, 49, 56, 59, 60, 63, 67, 68, 157, 159, 175, 176, 177, 180, 181, 183, 184, 187, 190, 204, 205, 206, 207, 209, 211 topology 41, 103 total variation 50
I.lx
82 ultrabornological 86 uniform convergence 235 uniform limit 60 uniform space 41, 56, 176, 177, 179, 180, 181 uniformity 42, 176, 178 uniformly continuous 6, 42, 88, 125, 177 unique 102, 103, 105, 107, 108 unit circle 40, 197 universally measurable 21, 82 universally Radon measurable 86 unrectifiable 192 utility measure 267
363
Index
VajzoviE, Fikret 73, 346 variables 2 variational problem 1 vector variable 2, 49, 120 vector-valued finite Radon measure 50 vector-valued function 31, 50, 120, 137, 150 vector-vector function 15, 19, 22 Vincze, Endre 14, 346 Vitali covering 189 Wagner, Carl 253, 345 Wallman, Henry 42, 339 Waterloo Maple Inc. 248 weak additional compactness condition 34 weak regularity condition 5, 13, 16, 17, 24 Weierstrass approximation theorem 154 Weierstrass's function 292 Weierstrass's P function 293 Weierstrass's sigma function 22, 29, 110, 127, 285, 292 Weierstrass's function 292 weight 41, 43, 78 weight function 220 Weil, Andr6 21, 53, 346 well ordered 199 worksheet 248, 249, 251, 252, 253, 254
<
40 Zdun, Marek Cezary 234, 235, 346 Zdun's theorem 235 Zeidler, Eberhard 8, 22, 50, 109, 128, 346
<
293 ZFC 193, 214 Zygmund, Antoni 344
292, 295, 298,