Formal Systems and Recursive Functions: Proceedings of the Eighth Logic Colloquium, Oxford, July 1963 (Studies in Logic and the Foundations of Mathematics, 40)

STUDIES IN LOGIC AND THE FOUNDATIONS OF MATHEMATICS Editors L. E. J. BROUWER, Laren (N.H.) A. HEYTlNG, Amsterdam A. R...

Author: John N. Crossley | Michael Dummett (editors)

7 downloads 459 Views 4MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

STUDIES IN LOGIC AND

THE FOUNDATIONS OF MATHEMATICS

Editors L. E. J. BROUWER, Laren (N.H.)

A. HEYTlNG, Amsterdam A. ROBINSON, Los Angeles P. SUPPES, Stanford

Advisory Editorial Board Y. BAR-HILLEL, Jerusalem K. L. DE BOUVERE, Amsterdam

H. HER M E S, Munster if W. J. HINTIKKA, Helsinki A. MOSTOWSKI, Warszawa J. C. SHEPHERDSON, Bristol E. P. SPECKER, Zurich

NORTH-HOLLAND PUBLISHING COMPANY AMSTERDAM

FORMAL SYSTEMS AND RECURSIVE FUNCTIONS PROCEEDINGS OF THE EIGHTH LOGIC COLLOQUIUM OXFORD, JULY 1963

Edited by

J. N. CROSSLEY Fellow of St. Catherine's College, Oxford and

M.A.E.DUMMETT Fellow of All Souls' College, Oxford

~It

m ~ 1965

NORTH-HOLLAND PUBLISHING COMPANY AMSTERDAM

No part of this book may be reproduced in any form by print, microfilm or any other means without written permission from the publisher

PRINTED IN THE NETHERLANDS

PREFACE In July 1956 the first Logic Colloquium was held in Oxford thanks to the efforts of Professor A. Prior. It was a fairly small gathering. Since that time, however, the Colloquium has grown considerably. In 1963 the second colloquium to have a larger international membership took place and there were nearly 100 logicians present. The Colloquium was recognized as a meeting of the Association for Symbolic Logic. Financial support was provided by NATO and the Colloquium was a NATO Advanced Study Institute with a Symposium on Recursive Functions sponsored by the Division of Logic, Methodology and Philosophy of Science of the International Union of the History and Philosophy of Science. We are grateful to NATO for their generous contribution towards the publication of this book. MICHAEL DUMMETT JOHN

Oxford, June 1964

N.

CROSSLEY

I. FORMAL SYSTEMS

SOME MODAL CALCULI BASED ON IC R. A. BULL Wadham College, Oxford, UK

The motivation for this work was an attempt to find modal logics acceptable to intuitionists. While I do not know whether any of the systems described here would in fact satisfy an intuitionist philosopher, they seem to me to have some formal interest. A fundamental difficulty is that the customary modal equalities

yield

Met

= NLNet and La. = NMNa. CNNLpLp,

which is an intuitionistically implausible thesis. Of the two approaches described here, the first accepts this thesis, and the second avoids it by having only L and not M. The first system formalises the position that, while contingent propositions obey intuitionist logic, necessary propositions obey classical logic. I must confess that I know of no philosopher who actually maintains such a view. The first system is suggested by the Wajsberg completeness proof of S5 with respect to the Henle model (see [4]). It is obtained by adding to IC f- a. => f- Let

CLpp CLpLLp CLCpqCLpLq ALNLpLp Met = dfNLNa.

4

R. A. BULL

(cf. the Godel axioms for S5 given on p. 312 of [3]). The model I use for this system has elementsxc., 02' 03' •.. ), where each Gi is an element of i) x ,1) with ¢, ¢, ¢, ... ) designated. The values of A, K, C, and N are found by applying i) x to the corresponding terms of the sequences. The word L\I. has value <¢, ¢, ¢, ... ) when \I. has value <¢, ¢, ¢, ... ), and value
<

\I. -

CLp\I., CNLP\I. - CLELPCf3f3\1., CLELf3NCf3P\I.,2)

and that parts of words can be replaced by their necessary equivalents. Applying these rules, to each word there corresponds an equivalent set of words, each of the form

Cia, CL\l.2· .. CL\l.mCNLf3lCNLf32' .. CNLf3nY, where the \I.;'S, the f3/s, and y have no modal operators. I shall call words of this form normal words. If any of the words Cal Ca2' .. CamP j

1 ::; j ::; n

Cal Ca2' .. C\l.mY

is a thesis of IC, then the normal word Cl.a, CLa 2 . . . CLCl.mCNLf3lCNLf32' .. CNLf3nY

can be derived from it in our axiom system. Let us suppose, on the other hand, that none of these words is a thesis of Ie. Then there are allocations of values of i) x which give IX l, Cl. 2, ... , Cl.m value ¢ and in turn give Ph 132, ... , 13m Ynon-designated values (see the Lemma below). Taking these allocations in rotation gives sequences which reject the normal word A word is derivable if all its normal words are derivable, and is rejected if one of its normal words is rejected, since the model verifies the axiom system. This gives the required completeness proof.

fl x is a topological model for IC, the designated element being ,p, and being H. In fact any normal model for IC with the property that if ca{3 is not a thesis of IC then there is an allocation verifying a and rejecting (3, will suffice. I append a proof that fl x has this property to the end of this paper. Not all models for IC have it, for example the Lindenbaum model does not. 2) I use ~ as an equivalence relation between sets of words which can be derived from each other. 1) See [1].

',p

SOME MODAL CALCULI BASED ON IC

5

The second approach is suggested by Lemma 1. 11 of [2]. Let us consider as models for modal calculi the octuples (H, K, {I}, + ,., -'-,0, I), where

+,., -'-,0) is a Heyting algebra with unit 1. (2) K is a sub-set of H such that there is a greatest element of K below each element of H. (3) For each x in H, Ix is the greatest element of K below x. (1) (H,

All such models verify the axiom system, which I call 1M, given by adding to IC R I- O:x{3 ~ I- CLr:xL{3 1 CLpp

2 CLpLLp.

I shall say that a model (H, {I}, +,., -'-, 0, I) for 1M is normal when

it strongly verifies the rules, and H is partially ordered under

x

~

y when y -'- x = 1.

A normal model for 1M has the properties (1) (H, +,., -'-,0) is a Heyting algebra with unit 1. (2) K = {Ix I each x in H} is a sub-set of H such that for each element in H there is a greatest element of K below it. For given x in H, and

Ix Iy

~

~

x by 1,

x implies Iy

~

Ix by Rand 2.

(3) For each x in H, Ix is the greatest element of K below x. Thus all the normal models for 1M are of the form described above. It can be shown that 1M has a characteristic normal model, using the equivalence classes of words under I- Er:x{3. We can now show that 1M has the finite model property, and is therefore decidable. Suppose that a word r:x is rejected by a normal model (H, K, {I}, +,., -'-, 0, I) its parts taking the values aI' a z, ... , am' Take (HI, +,.) as the sub-algebra of (H, +,.) generated by 1, aI' az, ... , am, 0, and take K

I

=

trr, K.

6

R. A. BULL

Now (1)
x -'- 'y = x -'- y.

°

°

(Cf. Theorem 1.10 of [2].) (2) is in H' and K, therefore is in K', so there is an element of K ' below each element of H'. If x and yare in H' then (x+y) is in H'. Since x is in K if and only if

Ix = x, and

ELALpLqALpLq is a thesis of 1M, if x and yare in Kthen (x + y) is in K. So if x and yare in K' then (x+y) is in K '. It follows that there is a greatest element of K'

below each element of H', namely the sum of those elements of K' below it. (3) We can now define I' on H' by taking I'x as the greatest element of K' below x, for each x in H'. Thus
IX =>

I- LIX

corresponds to

CKLpLqLKpq corresponds to ANpp corresponds to

1

««.

K closed under .,
+,., -'-, 0)

Boolean.

The systems obtained by adding combinations of these to 1M can be shown to be complete with respect to the finite models with the corresponding restrictions, using appropriate modifications of the proof given above. In my opinion the most plausible of these systems for the role of intuitionist logic of necessity is that obtained by adding to IC the Godel rule and axioms for S4:

SOME MODAL CALCULI BASED ON IC

7

I- ex => I- Lex CLpp

CLpLLp CLCpqCLpLq (cf [3], p. 312), this being the extention of 1M with I- ex => I- Lex

CKLpLqLKpq. If Cex/3 is not a thesis of IC then there is an allocation of values which verifies ex and rejects /3.

LEMMA:

of i> x

(I use the notation of [1]). Suppose that the variables of ex and /3 are Pl' P2' ... , Pk. Since Cex/3 is not a thesis of IC and i> x is characteristic for IC, there is an allocation of the values of i> x which gives Cex/3 a nonempty set for its value. Suppose that ex, /3, and PI' P2' ... , Pk have values A, B, and P l , P 2 , ••• , Pk under this rejection, and that (m, n) is a point in B ...:... A. It follows that (m, n) and the points below it are not in A, but that (m, n) is in B. Given any value X of i> x , with Y as its sub-tree under (m, n), 1 form a value X' of i> x as follows: PROOF.

(1) For each i the sub-tree of X' under (i, n) is a copy ofY. (2) X' is the closure of these points. Allocate value P/ to each Pj: it will be found by induction on the length of words that ex gets value A' and /3 gets value B'. Clearly A' is empty but B' is not, so this allocation verifies ex but rejects /3. References [1] M. A. E. Dummett and E. J. Lemmon, Modal Logics between 84 and 85. Zeitschr. f. math. Logik und Grundlagen d. Math. 5 (1959) 250-264. [2] J. C. C. McKinsey and Alfred Tarski, On Closed Elements in Closure Algebras. Annals of Math. 47 (1946) 122-162. [3] A. N. Prior, Formal Logic, 2nd ed. (Oxford 1962). [4] M. Wajsberg, Ein erweiterter Klassenkalkul. Monat. f. Math. Phys. 40 (1933) 113-126.

THE LOGIC OF INTERROGATIVES M. J. CRESSWELL Victoria University of Wellington, New Zealand

This paper attempts to give a formal analysis of a concept which bears many resemblances to "p is the answer to d" where p is a statement and d a question. Let Spd represent this concept. The sense of "answer" which it characterizes is such that the following laws hold; Al

(Spd. Sqd)

~

(p = q)

Every question has at most one answer. This will exclude answers which give too much information. E.g., if! ask "Is it raining?" and receive the reply, "No it isn't; in fact the sun is shining." Then this would usually count as an answer. But in the sense of "answer" characterized by S the only correct answer to "Is it raining?" would be, "It is not raining" though the statement, "It is not raining and the sun is shining" obviously entails the answer to "Is it raining?" A2 (tlp)Spd, Every question has at least one answer. A3 Spd ~ p, The answer to a question is true. S then designates the true or correct answer to a question. A4

[0 (p == q). Spd]

~ Sqd,

If a statement is the answer to a question then any statement logically equivalent to it answers that question. This does make S diverge from many ordinary senses of "answer". It is a sense which, e.g. is of no use to the mathematician, since all mathematical truths are logically equivalent. But it does represent "answer" in that given a p which answers d wecan deduce an answer which satisfies any stricter criterion for "answer".

9

THE LOGIC OF INTERROGATIVES

There are cases where the same statement answers different questions. If I ask "Are both Mary and John here?" and someone else asks "Which of Mary and John are here?" then if both Mary and John are here the answer to both questions is "Both Mary and John are here" (though in the first case we would express it by saying "yes" and in the second by saying "both".) But they are different questions; for suppose Mary is here but John isn't. Then the answer to the first is, "Mary and John are not both here" and the answer to the second is "Mary is here but John isn't." (This last is an "answer" to the first question which gives too much information.) Identity of questions is defined,

Def=

(d

=

e)

=

df

(P) 0 (Spd

=Spe)

The following question-forming operator on questions is such that; Q

(d v e) is a question whose answer is the conjunction of the answer to d and the answer to e. If I ask, "Which of Mary and John is here?" then the possible answers are,

"John is here and Mary is here" "John is here but Mary isn't" "Mary is here but John isn't" "Neither Mary nor John is here" Q

and these are the only possible answers. The axioms for v are; A5

Q

(Spd.Sqe) => S(p.q) (d v e) Q

Q

A6 (d v e) = (e v d) Q

Q

Q

Q

A7

((d v e) V f) = (d v (e v f)

A8

(d

= e)

=>

Q

Q

[(fv d) = (fv e)J

The simplest kind of question is the yes/no question, i.e, the question which has the form "Is it the case that p'I", If I ask "Is it raining?" there are only two possible answers, "It is raining" "It is not raining"

Letting Qp = "Is it the case that p?" we have A9

SpQp v

S~ pQp

10

M. J. CRESSWELL

from which we may prove (using A3) SpQp == p, S"'pQp == "'p, and Qp = Q'" p. But yes/no questions form only a subclass of questions. A much more extensive class may be formed by introducing a questionforming quantifier. We use (Qa)A(a) to mean, "for which of the a's does A hold?" where a represents any variable and A a wff in which a occurs. I might ask, (Qp)fp, for which p does the propositional function/hold?

or

(Qx)¢x, which x's ¢?

or

(Q¢ )¢x, what properties does x possess? etc.

The answer to (Qa)A(a) will be the true conjunction of A(an)'s or '" A(an)'s for every an" To express this formally we express what it is for a proposition p to entail and be entailed by such a conjunction. p entails such a conjunction iff, (a) [p ~ A(a) . v -P -'5 '" A(a)] p is entailed by such a conjunction iff it is entailed by every q which

entails such a conjunction (since among these q will be the conjunction itself) i.e. iff (q){(a)[p ~ A(a) => q -'5 A(a):. :p -'5 '" A(a) => . q -'5 '" A(a)] => (q -'5 p)} 0

0

0

p is the answer to (Qa)A(a) if both these are satisfied and p is true.

AlO p. (a)[p

~

A(a). v.p ~ '" A(a)]. (q){(a)[p ~ A(a). => . q -'5 A(a):. :

p ~ '" A(a). => . q -'5 '" A(a)] => (q ~ p)).:

== :. Sp(Qa)A(a)

If we consider the class of questions definable in terms of (Qa) and restrict our logic to these we can define the expression (Sa)pA(a) (i.e. "p is the answer to the question 'which a's A?''') as; p.(a)[p -'5 A(a).v.p ~ ",A(a)].(q){(a)[p ~ A(a). =>.q ~ A(a):.: p -'5 ",A(a). =>.q -'5 ",A(a)] => (q

~

p)}

A wider class of questions can be considered by defining the schema Sp(Qna) [A 1(a), ... , An(a)] where (Qna) [A 1(a), ... , A,,(a)] would be equivQ

Q

alent to (Qa)A 1(a) v ... v (Qa)An(a). It is then possible, substituting (Qa)A 1(a)/d, (Qa)Aia)/e, etc., to prove as theorems the equivalents of Al-AlO by means of this definition.

THE LOGIC OF INTERROGATIVES

11

This method is sufficient to define, in a modal system with quantification, the concept "p is the answer to d" (in the sense of "answer" outlined) whenever the question d can be interpreted as asking, "Which a's A 7" I.e. it is sufficient for questions where a set of possible answers is so definable that the true member of the set is the answer to the question. If we can interpret questions like "When - - - - 7" as asking, "At what time- - - - 7", "Why- - - - 7" as asking, "For what reason - - - - 7", "How- - - - 7" as asking "By what means- - - - 7", etc. then the account given will suffice as an account of question logic in respect of "p is the answer to d", References [1] C. L. Hamblin, Questions. Australasian Journal of Philosophy 36 (1958) 159. Discussion-Questions aren't Statements. Philosophy of Science 30 (1963) 62. [2] David Harrah, Communication: A Logical Model (M.LT. Press 1963). [3] Henry S. Leonard, Interrogatives, Imperatives, Truth, Falsity and Lies. Philosophy of Science 26 (1959) 172. A Reply to Professor Wheatly. ibid. 28 (1961) 55. [4] J. M. O. Wheatly, Note on Professor Leonard's Analysis of Interrogatives. ibid. 28 (1961) 52.

SOME GENERALIZATIONS AND APPLICATIONS OF A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULP) RONALD HARROP University of Newcastle upon Tyne, UK

1. Introduction In an earlier paper [2], a study was made of the possibility of reducing propositional calculi of a general structure to ones with equivalent decision problem of a fairly simple structure. As an application a proof was given of a result, a particular case of which was the theorem of Post which states the undecidability of the problem of testing for completeness of a finite set of tautologies under substitution and detachment. To avoid many technical and notational complexities in the proofs of the relativization results, the theory was in the main developed only far enough to enable the application mentioned above to be made. This had, however, the effect that several restrictions were imposed which were unnatural and which, from the point of view of the truth of the main theorems, were completely unnecessary. It was stated that many of these restrictions would be removed in the present paper. It was also said that further applications of the theory would be given, including the construction of sequences of decidable and of undecidable calculi satisfying certain conditions. The present paper will be found to make frequent reference to [2] not only when use is made of lemmas and theorems proved there but also in order to avoid the necessity of excessive repetition of definitions and of notational conventions introduced there. 1) This paper, together with [2], covers the results outlined in the paper presented to the Logical Colloquium at Oxford in July 1963 under the title: A relativization procedure for propositional calculi and some applications.

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI

13

2. Generalization of the relativization result

For technical simplicity in the proof of Theorem 2 of [2], a propositional calculus L was considered such that (i) its set of formulae was defined in terms of an infinite set of propositional variables and a finite set of binary connectives and (ii) its provable formulae were obtained using a finite number of axiom schemes and two-premise rules, no axiom scheme or premise or conclusion of a rule being a vaf (variable for arbitrary formula). We generalize the theorem in two stages corresponding roughly to the use instead of L of the calculi K, J described below. For each of K, J it is shown that there is a calculus L with equivalent decision problem, the calculus satisfying the conditions imposed on L in [2] except that if K, or J, contains infinitely many axiom schemes or rules or J has infinitely many constants or connectives, then L also has infinitely many axiom schemes or rules. The corresponding calculus L + has in these special cases infinitely many axiom schemes. Let K be such that (i) its formulae are built up from an infinite set of (propositional) variables a 1 , a 2 , ••. together with a finite, or denumerably infinite set a 1 , •.. , rx" or a 1 , ... , of (propositional) constants, by means of a finite or denumerably infinite set 't'1' ••. , 't'so or 't'1' .•• , of connectives. For each i for which 't'i is a connective, 't'i shall be ti-ary for some t i > O. It is important to notice that when we later use the phrases "of r, form", "for some a/' we will automatically assume that we are only concerned with values of i, j for which 't'i is a connective and rx j a constant. Other similar phrases involving suffixes which belong to a set which may be finite should be similarly construed. Only at a few isolated places in the paper, where special emphasis seemed desirable, is direct reference made to this type of restriction on the values of suffixes. We use A 1 , ••• to denote vafs of K. By a formula scheme of K we will mean an expression constructed from vafs and constants by means of connectives . .5/'X for a formula X and Y tp; JfqJ for a formula scheme qJ will be defined as in [2], it being noticed that substitution can take place for vafs or variables but not for constants. As in [2], we will think of substitution or taking instances as being done simultaneously for all vafs or variables occurring in the formula under consideration. Certain vafs or variables may of

14

RONALD HARROP

course be subject to an identity substitution, that is, be unchanged Suppose
for some formula schemes
({Ji'

In the case when there are infinitely many constants, connectives, axioms or rules, we presuppose the satisfaction of certain recursivity conditions with respect to some "standard" type of Godel numbering. We assume that, following normal techniques using primitive recursive functions, Godel numbers are attached to formulae and formula schemes in such a way that, again using primitive recursive procedures, we can test whether or not a number is the Godel number of a formula or formula scheme and, if it is, can obtain the formula or formula scheme concerned, and, if desired, can determine its main connective, subformulae or subformula schemes etc. We suppose that the axiom schemes are recursively enumerable in the sense that their Godel numbers can be given as the set of values of a recursive function or of a partial function everywhere undefined. A more restrictive condition is imposed on the presentation of the rules if infinite in number, namely, that they shall be given so that, for each i, the vafs occurring in R i shall be exactly Au . . . , As; for some s i and further that it shall be possible to test recursively, in terms of Godel numbering,

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI

15

whether or not a given arbitrary finite set of formula schemes t/t l' . . . , t/t n are, in order, the n-I premises of some (n-I) premise rule (in the precise form in which they are given in the statement of the rule) and the conclusion of that rule. If they are of such a form then the number of the rule concerned must be uniquely determined and recursively obtainable. It will be noticed that these requirements have the effect that we can test recursively whether or not a set of formula schemes form a variant form of some rule and more generally whether or not a set of formula schemes form the premises and conclusion of some application of a rule. In cases when there are finitely many constants, connectives, axiom schemes or rules, the number of such shall in each case be explicitly known and it will be supposed that the Godel numbers of members of such finite sets will, if required, also be known. The calculus L to be associated with K shall have propositional variables ai' ... , and connectives a, b, C, 0, all binary. We shall use Ai' ... , for vafs of L. For d anyone of a, b, C, and D the corresponding one of A, B, C, we define DiJ for any formula or formula scheme J of L and any i 2:: 1 by

DJ = D iJ = JdJ D i + 1 J = Jd(DiJ.dJ) all i 2:: 1.

(2)

We will adopt the standard convention for omission of brackets in a formula that reinsertion of omitted brackets shall take place so as to keep the scope of unary connectives as small as possible. For this purpose A, B, and C will be considered as unary connectives. We will often be concerned with expressions obtained by iterated application of the connective O. If E 1 , ••• , En (n 2:: 1) are formulae (or, similarly, formula schemes) of L, possibly with 0 as main connective, then (3)

will denote an arbitrary one, possibly fixed through a limited context, of the formulae which are obtainable from E 1 0 ... 0 En by the insertion of (n-I) pairs of brackets. The inserted brackets will show the method of construction of the formula from E 1 , ••• , En. The statement that a formula F is ofform E 1 0 ... 0 En will mean that Fis one of the formulae representable as E 1 0 ... 0 En. The expression (4)

16

RONALD HARROP

will be used to indicate the particular formula represented by £1 O ... OEn which is obtained by inserting the required brackets by association from the left. For notational convenience, we generally in this section of the paper use
H,;(X l '

... ,

(formula or formula scheme case respectively)

Xt,) = BtHX 10 ... 0 HXtJ*, all connectives 'i and formulae Xl' ... , Xt"

H';(Xl, ... , Xt) = BtHX10 ... 0 HXtJ*,

(5)

aU connectives 'i and formula schemes Xl> ... , Xt,'

Since H is an operator from expressions of K to expressions of L and not a connective of L, it should cause no difficulties in respect of the insertion of omitted brackets. For easy visual appreciation of the structure of formulae it can, however, be considered as a unary connective. The calculus L shall have an axiom scheme corresponding to each axiom scheme ofK, the axiom scheme corresponding to Wi being (6)

Among the axioms of L are those of the form Cal 0 (Cal 0 HJw i) which may be considered as being the simplest axioms arising directly from the transforms of the axioms of K. Not all the axioms of L are however of this form though all can be written in the form 9'(Ca 1 0 (Cal 0 HJw;)). We have not used an expression based on this form in the actual statement of the axioms since it would not have been immediate that the axioms would then have been expressible in terms of axiom schemes, one for each value of i. The desire to express the axioms of L

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI

17

conveniently in terms of axiom schemes, which were formula schemes, is partly responsible for the use in this paper of vafs, such as A l , A z , as well as of expressions for arbitrary formulae such as X, Y. As rules of L we take the following, noting that technically the premise of R2 i needs to be repeated so that the rule can become a two-premise rule, a requirement to be satisfied by the rules of L:

Rl R2· ,

CAl 0 A z CA 10A3 CA10 (A z 0 A 3 )

CAl 0 [( CAl 0 H CPil) 0 ... 0 (CAl 0 HCPik,)]* CAl 0 (CAl 0 Hcp;)

n) (8)

L will be seen to have infinitely many axiom schemes or rules if and only if K has respectively infinitely many axiom schemes or rules. LEMMA 1 (i): With the notational conventions given above, any formula scheme which can be written as H!7cp can also be written as !7Hcp, and if !7cP is obtained from cP by replacing A j for relevant j by ljJ j then Y Hip is obtainedfrom Hcp by replacing A j + 1 for the samej by HljJj' (ii) The formula scheme CAl 0 (CAl 0 H!7w i ) is a substituted case of the axiom scheme (6) ofL. PROOF. Result (i) is immediate from (5) and (ii) follows from (i) and the fact that in the construction of Y Hip from Ho no substitution is made for A l • A formula scheme 0 of L will be said to be potentially provable (of PP form) if and only if it is of the form

for some w (~ 1) and some formula schemes 0 1 , 000' Ow of L, A formula X of L is said to be of PP form if and only if it can be written in the form

for some w, 0 1 ,

••

0' Ow as above or, equivalently, in the form

where Xl> 0.0' X w are formulae ofL.

18

RONALD HARROP

LEMMA 2: If 0 is a PP formula scheme, then, with the above notation, w, the bracketing to be inserted in (9) to obtain 8, and, under the same substitution 1) as is used in (9), Y AI> YOlo' .. , YOw are all uniquely determined by 8. A similar result holds for PP formulae.

We use induction on the length of O. It follows from the hypothesis of the lemma that 8 must be of the form (CeO 0 «(to (2) where ( is the formula scheme substituted for A 1 under the substitution used in (9). If (1 is (e(, then w = I, Y8 t = (2 and no inserted bracketing is required. If (1 is not (e(, then «(cO 0 (1' «(eO 0 (2 are PP formula schemes which are shorter than O. Further, any expression in form (9) of can be considered as trivially arising from similar expressions for these schemes. The required result now follows by suitable use of the induction hypothesis. We say that a formula scheme ofL is of-t i form (with the usual restriction, if necessary, on the range of i) if it is of the form YB i[A I 0 ... 0 AtJ* and of tu form ifit is of the form YA iA I . The formula scheme AilJ will be called the IJ-rt i formula scheme. PROOF.

°

LEMMA 3: Noformula scheme can be of t , and «.form or of i .form for two distinct values of i or of «, form for two distinct values of i or the IJ-rti formula scheme for more than one choice of the ordered pair (IJ, i). PROOF.

Immediate from (2), (3) and the definitions of the terms in-

volved. The next two lemmas give a result which corresponds to Lemma 6 of [2]. Their proofs are similar to the corresponding parts of the proof of that lemma but there are some additional complications. These originate from the fact that the operator H used in this paper is less simple than the corresponding operator T of [2]. LEMMA 4: Suppose t/J l' . . . , t/J m; XI' .•. , Xm are formula schemes of K, 8,8 1 , .•. , Om are formula schemes ofL and 5', 5" substitutions for vafs of L in each of which AI' if it occurs, is replaced by 8. Suppose further that (i) 0i = YHt/Ji' I :::;; t s; m, under 5', (ii) OJ = YHXb I :::;; i s; m, under 5", (iii) no vaf in any Ht/Ji' HXi' I :::;; t s. m, except possibly AI' gets 1) Strictly, the restrictions of the substitution to the vafs in the formula schemes concerned.

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI

19

replaced under S', SIt respectively by a Tj formula scheme or by the B-~j formula scheme for any j in the appropriate ranges, (iv) in S' and also in SIt, distinct formula schemes are substituted for distinct vafs A j' A k (j ~ 2, k ~ 2). Then l/Ji and Xi' 1 :<:::: i s; m, are variants of each other under some common substitution. PROOF. The result is proved by induction on the sum of the lengths of the l/Ji' If, for any i, l/Ji is a vaf then Xi must also be a vaf for that i for otherwise either Xi has a connective or is a constant and Y"HXi is thus of T j form for some j or is the B-~j formula scheme for some j. In either case this would involve Bi being obtained from Hl/Ji by a substitution which would violate (iii). Similarly, if, for some i, Xi is a vaf, are vafs then, for that i, l/Ji must also be a vaf. Further, by (iv), if l/Ji" Xit' Xi2 are occurrences of the same vaf if and only if l/Ji" l/Ji2 are then occurrences of the same vaf. By similar use of (iii) in conjunction with Lemma 3, we can show that for any i for which one of l/Ji' Xi is C1.q the other of l/Ji' Xi must also be IXq • The truth of the lemma for all cases in which none of the l/Ji has a connective follows at once. Suppose now that some 1/Ii' say l/Jio is neither a vaf nor a constant. It must have a main connective, say Ta. Y"Hl/Jio must be of Ta form. By our previous results, Xio must have a main connective which must, by (ii) and Lemma 3, also be T a • We thus know that for certain formula schemes l/J~, ... , l/J;a' X~, ... , X;a of K

v.,

BiO = Y"HTa(l/J~,

and Bio =

Y"HTa(X~,

,

l/J;J with substitution S'

, x;J with substitution SIt.

} (12)

Hence, by (5) and (2), Y"Hl/J~ under S' is the same as Y"HX~ under SIt for all p such that 1 :<:::: P :<:::: tao Combining this fact with (i) and (ii) restricted to the cases i #- i o and applying the induction hypothesis noting that the combined sum of the lengths of the l/Ji (i #- io) and of the l/J~ (1 :<:::: P :<:::: ta) is less than the sum of the lengths of all the l/Ji' we deduce that, under a common substitution, l/Ji is a variant of Xi (i #- i o) and l/J~ of X~ (1 :<:::: P :<:::: ta)· From the form of l/Jio' Xio (see (12)), we deduce at once that under the same substitution l/Ji is a variant of Xi (1 :<:::: i :<:::: m). This completes the proof of the lemma.

20

RONALD HARROP

For notational simplicity, we will in the statement and proof of the next lemma use the term ()-substitution to refer to a substitution in which the vaf A 1 , if it occurs in the formula scheme in which substitution is beingmade, is replaced by (). LEMMA 5: Suppose l/J1' ... , l/Jn are formula schemes ofK, (), ()1' ... , ()n are formula schemes of Land () i = /7 Hl/J i' l s i s n, under a common 8-substitution S. Then there exist l/J't, l s i s n, such that vt> /7l/Ji under a common substitution and in addition (a) ()i = /7 Hl/Jf under a common 8-substitution S* and (b) if 8 i = /7Hl/J~ under a common ()-substitution S' for some formula schemes l/J~, •.. , l/J~ of K, then l/Jf = /7l/J~ (l s i s n) under a common substitution.

Although the fact that l/Jf = /7l/Ji under a common substitution will not be proved this way, it can be noticed that it would follow immediately from (b) by taking l/J; as l/Ji (1 s i s n). Suppose that the distinct vafs which occur in the l/J;, 1 s i s n, are Art' ... , A rq and that the formula schemes substituted for the corresponding vafs of the Hl/J;, that is, for Art + l' . . . , A rq + 1> under S are (1' ... , (q. For each j, 1 s j s q, replace A r j throughout the l/Ji by Are where e is the least integer such that (e, (j are the same formula scheme. Further, replace in the formula schemes just obtained from the l/J;, A rr by llh for eachf, if any, for which there is an h such that (I is the O-llh formula scheme. Denote the resulting formula schemes by l/J11), ... , l/J~1). We know that the l/J~1) are obtained from the l/Ji by a common substitution and that forl s i s 11,8 i = /7Hl/J~1) by a common O-substitution, say S(1). Let I.51) denote the list of distinct formula schemes substituted under S(l) for vafs other than A 1 in the Hl/J~1). None of the members of I.51) will be a ()-rxh formula scheme for any h. Distinct vafs A p , A q (p, q ;::; 2) of the Hl/JP) will have different members of I.5 1 ) substituted for them under S(1). Suppose that with corresponding notation we have defined up to L(ml, s», l/Jlm), ... , l/J~m) knowing that l/J~m) = /7l/Ji' 1 s i s n, under a common substitution, ()i = /7Hl/J~m), 1 s i s n, under the common 8-substitution s», no member of I.5m) is a 8-a h formula scheme for any h, and that distinct vafs A p , A q (p, q ~ 2) of the Hl/J~m) have distinct members of I.5m) substituted for them. If no member of I.5m) is of 't"j form PROOF.

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI

21

for any i. we take t/Jim), ... , t/J~m) for t/Ji, ... , t/J: respectively and S(m) for S*. Otherwise, suppose , is the longest, or one of the equal longest, of the members of 15m) which is of 'Cj form for some j. Suppose it is of 'C a form and is YB a[A 1 0 ... 0 AtJ*, and that it is substituted for A e + 1 under S(m). We obtain t/J~m + 0, ... , t/J~m + 1) by replacing A e throughout the t/J~m) by 'CiXl' ... , Xt) where Xl' ... , Xt a are formula schemes of K chosen as follows: If in the formation of' from Ba[A 1 0 ... 0 At.]", AI is replaced by the member of L(m) which is substituted for A c + 1 under s». then XI is taken as A c • If A I is replaced by the 8-rx g formula scheme, XI is taken as rx g • Any of Xl' ... , Xt a which are not determined by the above two methods are taken as "new" vafs, that is, as ones not occurring in any of the t/J~m), the same vaf being used for XI and Xg if and only if the same formula is substituted for A I' A g in the formation of , from B a[A 1 0 ... 0 AtJ*. It is immediate that t/J~m + 1) = Yt/Ji' 1 :<::: i :<::: It, under a common substitution and that 8i = YHt/J~m + 1), 1 :<::: i :<::: It, under a common 8-substitution which we may denote by s(m + 1). Further, if 15 m+ 1) is the list of distinct formula schemes substituted under S(m + 1) for vafs other than A 1 in the t/J~m + 1), then no member of 15 m+ 0 is a 8-rxh formula scheme for any h, and distinct vafs A p , A q ofthe Ht/J~m + 1) (p, q ;;::: 2) have distinct members of 15 m + 1) substituted for them. Since the construction given for I5m + 1) will have the effect that I5m + 1) will have arisen from 15 m ) by the replacement of' by a set (possibly null) of formula schemes") the total length of which is less than the length oH, the process can be effectively iterated until further iteration is impossible. We take the final t/J~m), ... , t/J~m) as t/Ji, ... , t/J~ and the final s(m) as S ", We have thus shown the existence of formula schemes t/J1, ... , t/J': such that (i) t/J't = Yt/Ji under a common substitution, (ii) 8i = YHt/Ji under a common 8-substitution S*, (iii) distinct vafs A p , A q (p, q ;;::: 2) of the HljJ; get replaced under S* by distinct formula schemes and no formula scheme substituted under S* for a vaf other than A 1 is of 'Cj form for any j or the 8-rxj formula scheme for any j. 1) Namely the formula schemes which under S(m+l) replace vafs of the Htp/m+l) which correspond to any "new" vafs used in the specification of the XI' 1 :s; f:S; tao

22

RONALD HARROP

Finally suppose that Oi = Y'Hl/J; (1 S; i S; n) under a common I)-substitution S'. By the above result there exist l/J;* (1 S; i S; n) such that (i), (ii), (iii) hold with l/Ji, l/Ji, S* replaced by l/J;, l/J;*, S'* respectively. It follows immediately by Lemma 4 that l/Ji, l/J;* (1 S; i S; n) are variants under a common substitution and hence, using (i) for l/J;, l/J;\ that l/J't = Y'l/J; under a common substitution. This completes the proof of Lemma 5. THEOREM 1 (i): A formula scheme can be written in the form

e is provable

in L

if and

only if it

for some provable formula schemes l/Jl' ... , l/Jw(w 2 1) ofK. (ii) A formula P is provable in L if and only if it can be written in the form for some provable formula schemes l/J l' ... , l/J w( w 2 1) of K. (iii) A formula or formula scheme of L which is provable is potentially provable.

(ii) is immediate from (i) and the fact that a formula P is provable in L if and only if the corresponding formula scheme, which is obtained by replacing a, by Ai for each Qi which occurs in P, is provable in L. (iii) is immediate from (i), (ii) and the definition of potentially provable. We now show, by induction on the length of proof of l/J in K, that if a formula scheme l/J is provable in K, then CAl 0 (CAl 0 Hl/J) is provable in L. If l/J is Y'OJ i for some axiom scheme OJi of K, the required result is immediate by Lemma 1 (ii). Suppose l/J is obtained from Xl' ... , Xk, by an application of the rule R, of K. By induction hypothesis, CAl o(CA l o HXr) is provable in L for each r, 1 S; r S; k, and thus, by iterated application ofR1, so is PROOF.

CAl 0 [(CAl OHX1)O ••• O(CA 1 0 HXk,)]*'

(14)

Let Q denote the set of vafs occurring in R, as specified in (1). Let S denote the common substitution, known to exist, such that Xr = Y'qJiro 1 S; r S; k i , l/J = YqJi and suppose that, under S, A g is replaced by X~, all A g E Q. Considering the substituted form of rule R2 i of L in which A 1

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI

23

is unchanged and, for all g such that A g E Q, A g + 1 is replaced by H X~, and using Lemma l(i) and the provability of (14) in L, we see that CAl 0(CA 1 OHljJ) is provable in L. Thus, for any ljJ provable in K, CAl 0 (CAl 0 HljJ) is provable in L. Hence, using appropriately iterated application of rule R1 of L and the fact that the set of provable formula schemes ofL is closed under substitution, we can deduce that any formula scheme of type (13), for W ;;::: 1 and ljJ 1, •.. , ljJw provable formula schemes of K, is provable in L. To complete the proof of the theorem we now show by induction on the length of proof of () that if () is a provable formula scheme of L then it can be expressed in form (13) with the stated conditions on w, ljJ l' . . . , l/Jw all satisfied. If () is a substituted case of an axiom scheme of L, the required result is trivial (see (6». Suppose () is the conclusion of an application C(10(Z

C(1 0 ( 3

C(1 0«(ZO(3)

(15)

of rule R1 of L. By induction hypothesis, the premises of the rule can be written as and where the l/Jij' i = 1,2, 1 ::;; j ::;; Wi, are provable in K and where we can assume without loss of generality that the vafs in (16) other than A 1 do not occur in (17). Since it follows from (15) that the formula scheme substituted in (16), (17) for A 1 must in each case be (1' we can deduce that () can be written as 9'(CA 1o ([(CAl

o HljJll) 0

'" 0(CA 1 o Hl/J1w)] 0

[(CAl

o HljJZ1) 0

... 0(CA 1o HljJzw,)]»

and this can easily be expressed in the required form. Finally, suppose () is the conclusion of an application 9'(CA 1 0 [(CAl 0 Hrpil)O ... 0 (CAl 0 HrpikJJ*) 9'(CA 1 0 (CAl 0 HrpJ)

(18)

of rule R2 j of L where the substitutions indicated are common and will

24

RONALD HARROP

be denoted by S. By induction hypothesis the premise of (18) can be written as .9"(CA10 [(CAl OHt/JDo •.. O(CA 1

OHt/J~,)J)

(19)

for some t/J~, ... , t/J~, provable in K and some substitution S'. It follows at once from Lemma 2 that k, = w', that [J can be replaced in (19) by []* and that there exist formula schemes 0*, 0 1 , ••• , Ok, such that (0 S, S' are O"'-substitutions, (ii) OJ = Hq>ij, 1 ::; j ::; k i, under S and (iii) ()j = Ht/Jj, 1 ::; j ::; k i, under S'. Let Ok, + 1 = .9"Hq>i under S and let t/J~, + 1 be a variable other than A 1 which does not occur in any t/Jj, 1 ::; j ::; k;, so that we can extend S' so that Ok, + 1 = .9"Ht/J~, + 1 under S'. By Lemma 5, there exist t/Ji, ... , t/J:, + 1 such that (iv) t/Jj = .9"q>ij, 1 ::; j ::; k i, t/Jk' + 1 = .9"q>i under a common substitution, (v) t/Jj = .9"t/Jj, 1 ::; j ::; k, + 1 under a common substitution, and (v i) OJ = .9"Ht/Jj, 1 ::; j ::; k, + 1 under a common 0'" -substitution. It follows from (v) that the t/Jj, 1 ::; j ::; k i , are provable formula schemes of K and hence, using (iv) and (1), that + 1 is provable in..K. We can deduce from the definitions of 0*, Ok, + l' that 0, which is the conclusion of (18), can be written as CO* 0 (CO* 0 Ok, + 1)' Thus, using the case j = k i + 1 of (vi), 0 can be written as .9"(CA10 (CA 10 Ht/Jt. + 1)) which is of the required form. This completes the proof of Theorem 1.

»:

THEOREM 2: The decision problems of K, L are with standard Giidel numbering primitively recursively equivalent to each other. PROOF. We first note that both in the calculus K and in the calculus L, a formula is provable if and only if the corresponding formula scheme, obtained by replacing a, by Ai for all occurring i, is provable. If X is provable in K, then, by Theorem 1 (ii) and (5), Cal 0 (Cal 0 HX) is provable in L. On the other hand, if Cal 0 (Cal 0 HX) is provable in L, then so is CAl 0 (CAl 0 Hip for the formula scheme q> which corresponds to X. Using Theorem 1 (i), it follows that this formula scheme can be written in form (13) for some substitution S and some formula schemes t/Jl' ... , provable in K. By Lemma 2, w must be 1, S must be an Acsubstitution, and, with substitution S, Hip must be .9"Ht/J r- Hence, by Lemma 5, there exists t/J* such that t/J* = .9"tp, t/J* = .9"t/Jl and Hip = .9"Ht/J* under an A1-substitution. Since, by

v;

A RELATlVIZATlON PROCEDURE FOR PROPOSITIONAL CALCULI

25

Lemma l(i), Ht/J* (= HYcp) can be written in the form YHcp under an AI-substitution, it follows that H cp and Ht/J* are variants of each other, the substitution employed in obtaining Htp from Ht/J* and that employed in obtaining Ht/J* from Hip being an Acsubstitution. A trivial application of Lemma 4, taking m = 1 and e, t/Jl' Xl' el as AI' cp, t/J*, Hip respectively, now shows that ip, t/J* are variants of each other. Hence cp is provable in K. Thus a formula X is provable in K if and only if Cal 0 (Cal 0 HX) is provable in L. To complete the proof of the theorem we show that we can determine whether or not a formula P is provable in L either directly or in terms of the provability or otherwise in K of certain effectively obtainable formulae of K the Godel numbers of which would, under standard Godel numbering, be primitively recursively bounded in terms of the Godel number of P. P is provable in L if and only if the corresponding formula scheme e is provable in L. Using Theorem 1(iii), the fact that we can determine effectively whether or not a formula scheme is potentially provable, Lemma 2 and Theorem 1(i), we note that we can test whether or not e is provable directly, or reduce the problem to determining whether or not certain primitively recursively obtainable formula schemes (1' ... , (w (yet> ... , yew of Lemma 2) can be expressed as YHt/Jl' ... , YHt/Jw under a common substitution with a given substitution for Al (Y Al of Lemma 2), for some t/Jl' ... , t/Jw provable in K. We can assume for simplicity and without loss of generality that there are no variables in common between any t/J hand t/J iz for which i l =f. i 2 • Our problem thus reduces to determining, given a formula scheme ( of L, whether or not ( can be expressed in the form Y Ht/J for some provable formula scheme t/J of K. Since, for any formula scheme t/J of K with at least one connective, Ht/J is longer than t/J, and, except for fairly trivial substitutions, YHt/J is longer than Ht/J, the consideration of ( reduces to the determination as to whether or not at least one of a finite number of primitively recursively obtainable formula schemes of K restricted so that no two are variants of each other, is provable in K. Since such a determination can be made by considering the provability or otherwise in K of the corresponding formulae, this completes the proof of Theorem 2. Suppose given a calculus K satisfying the conditions imposed at the beginning of this section of the paper. We can construct from it the

26

RONALD HARROP

calculus L described above and from L can construct, in the manner described in [2], calculi L', L * (ambiguously denoted by L +) which can, if desired, by taking ::J as the only connective, be made subsystems of positive implicational logic. L + will have detachment as its only rule. If K has only finitely many axiom schemes and rules, the number of connectives and constants possible being infinite, then L will satisfy the conditions required of that calculus in [2] and so, by Theorem 2 of the present paper and Theorem 2 of [2], the decision problems of K and L + are equivalent, and in fact are primitively recursively equivalent. If K has infinitely many axiom schemes or rules, then L + , as constructed following the method described in [2], will have respectively infinitely many axioms of type 1+,2+ (there will never be infinitely many axioms of type 3 + since L has only finitely many connectives). -The presence of infinitely many axiom schemes in L does not affect the proofs of Theorems 1, 2 of [2] and we can still assert that the decision problems of K, L + are primitively recursively equivalent. In cases when there are infinitely many rules in K, and therefore in L, although the proof of Theorem I of [2] is unaffected, the proof of Theorem 2 of [2] requires some change. In the consideration of the cases ({3+), (y+), we have to note that by our assumptions concerning the form of the rules ofK, and their effect on the form of the rules ofL, we can test recursively whether or not the members of a set of formula schemes form variants, under a common substitution, of the premises and conclusion of some rule R i of L and if they are can determine which rule it is. We can thus still show that the decision problems of K and L + are recursively equivalent but our proof will not show that they are primitively recursively equivalent. We have thus proved THEOREM 3: (Extension to K of Theorem 2 of [2]). With the above notation the decision problems of K and L + are recursively equivalent. L + has finitely many axiom schemes and rules except in cases when K has infinitely many axiom schemes or rules. L + has, in such cases, infinitely many axiom schemes.

The calculi so far considered have all had their formulae built up by general iterated application of connectives of definite order to the members of an infinite set of propositional variables together possibly with the members of a set of propositional constants. They have had axiom

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI 27

schemes which were formula schemes, and rules, the premises and conclusions of which were formula schemes. This has meant that there has been a simple duality between the provability offormulae and offormula schemes in that a formula was provable if and only if its corresponding formula scheme was provable and vice versa, that is, a formula scheme was provable if and only if its corresponding formula was provable. Considerable use has been made of this fact. To conclude our discussion of systems more general than the system L of [2] to which the relativization process applies, we develop in outline the theory for a system J for which the duality properties are not available. There are other examples of similar type which could have been used instead of J. Consider a system J which is defined exactly as K except that there shall only be finitely many propositional variables, say ai' ... , a q and that there shall be a recursivity condition imposed on the axiom schemes of J similar to that imposed on the rules of K, and therefore of J, namely that the axiom schemes shall form a recursive set and that the ith axiom scheme shall involve exactly the vafs Ai' ... , Ar; for some rio We can still use A 1, ••• , for vafs, there being no need for them to be finite in number. Let (i' R; denote respectively the ith axiom scheme and the ith rule of J (for appropriate values of i). An example of a system of type J which illustrates the differences which can exist between such a system and one of type K is the following: propositional variables - just a i ; propositional constants - none; connectives - just ~ (binary); axiom scheme A 1 ~ Ai; rules

Ai (A 3

~

~

(A 3 A z)

~

~

A z)

Ai

It is immediate that the set of provable formulae of the system is the set of formulae of the form U ~ V where U, V are formulae, that is, it is the set of instances of Ai ~ A z . The formula scheme Ai ~ A z is not itself provable. If in the definition of this system, the conclusion of the second rule is replaced by Ai ~ A z, no change would occur in the set of provable formulae but Ai ~ A z would become provable. It will be

noticed that the axiom schemes and rules of the system both in original and revised form are independent. We associate with J a calculus K which satisfies the conditions we

28

RONALD HARROP

imposed on a calculus denoted by K. Its propositional variables will be ai' .... It will have the same constants as J, and, for connectives, the connectives of J together with F, G (unary) and * (binary). We define a mapping E from formulae of J to formula schemes of K by Ea,

= F~l

EIX i

=

1 ~ i ~ q,

each constant

IXi

IXi

of J,

I

E(-r:iUl' ... , Uk)) = TiEU1' ... , EU k) each connective Tj } (20) (krary) and all formulaej U 1 , ••• , U k j of J. In this definition, F iA 1 is defined by') F 1A1 = FA 1 Fe + 1 A 1 = F(F eA1 )

all c z-L

K has one axiom scheme corresponding to each variable of J and one corresponding to each constant of J. The axiom schemes corresponding to a, (1 ~ i ~ q) and IXi (relevant i) are respectively (21)

K has one rule corresponding to each connective of J, one corresponding to each axiom scheme of J and one to each rule of J. The rules corresponding, for relevant values oft, to the connective T; (k i ary), the axiom scheme ~ i and the rule R; are respectively FA 1 *F A z .,. FA 1 *FA k i + 1 FA 1 FA 1

* FA z

.. , FA 1

FA 1

* G~;

where the vafs occurring in ~i are A 1 , by replacing A r by A r + 1 for each r, 1 FA 1

* FA z

... FA 1

* FA

ti

(22)

* FT;(Az, ... , A k i + 1)

+ 1

FA 1

••• ,

~

r

FA 1

* Gp;

* FA

si

+ 1

(23)

A r i and ~; is obtained from ~i r.; and

~

* Gp;l

... FA 1

* Gp;h, (24)

1) Note that F is an explicit unary connective of J, not part of an abbreviated form of an expression involving a binary connective like D in (2).

A RELA TIVIZA TION PROCEDURE FOR PROPOSITION AL CALCULI

29

where R; is

, Pil ... Pih,

R~

(25)

are all the vafs which occur in R;, and P;j (1 ::;; j ::;; hJ and P; are obtained from Pij (1 ::;; j ::;; h;) and Pi by replacing A r by A r + I for each r, 1 ::;; r ::;; Si. The special restriction imposed on the form of the rules of K is automatically satisfied in view of the satisfaction of the AI' ... , As,

corresponding conditions imposed on the form of the axiom schemes and rules of J.

We can now prove the following results: (a) A formula is provable in K if and only if the corresponding formula scheme is provable in K and vice versa. (b) Given formula schemes sp, l/J of K, we can effectively determine whether or not cp can be expressed in the form :/EU for some formula U of J under a substitution in which Al if it occurs at all is replaced by l/J. If it can be so expressed then U is uniquely determined by cp, l/J and can be effectively obtained. The substitution will also be fully determined by cp, l/J since no vaf other than Al can arise in EV for any formula V of J. (c) We can effectively determine whether or not a formula scheme ofK can be expressed in one of the forms :/(FA I * FEU), :/(FA I * GEU) for some formula U of J. Ifit can be so expressed then the form concerned and the formula U involved are uniquely determined and can be effectively obtained. (d) If U is a formula of J then FA I * FEU is a provable formula scheme ofK. (e) If U is a provable formula of J then FA I * GEU is a provable formula scheme of K. (f) If cp is a provable formula scheme of K then it must be either of the form :/(FA I * FEU) for a formula U of J or of the form :/(FA 1 * GEU) for a provable formula U of J. (This result can be proved by induction on the length of proof of cp in K, making considerable use of (c)). It follows from (a), (c), (d), (e) and (f) that the decision problems for provability of formulae in J and K are primitively recursively equivalent. Hence, using Theorem 3, we obtain the following theorem:

30

RON ALD HARROP

THEOREM 4: (Extension to J of Theorem 2 of [2]). With the above notation the decision problems of J and L + are recursively equivalent. L + has finitely many axiom schemes and rules except in cases when J has infinitely many constants, connectives, axiom schemes or rules. In such cases L + has infinitely many axiom schemes.

3. Iteration of the relativization process - preliminary results Later in this paper we construct by iteration of the relativization process, which led in [2] from L to L +, certain sequences of distinct calculi. The proofs that the calculi are distinct rest on some results now to be obtained which are concerned with the relation of the structure of provable formulae of L + to that of provable formulae (if any) of L which have ~ as main connective. We will use, as far as possible and often without comment, definitions and notations used in [2] but, since we are working towards an iterative application of relativization, it will not be possible to keep quite as rigidly as in [2] to the use of particular letters to denote formulae or formula schemes of particular calculi. For example, in [2], formulae such as TX, Y'TX would be "recognized" as formulae of L', L* and L + even before they were stated to be so. Further, the equation Q = Y'TX would look "reasonable" whereas the equation Y = Y'TX would look "unreasonable". We now consider explicitly some effects of the possibility of ~ being a connective of L and will on occasions wish to consider expressions such as TX, Y'TX not only as formulae of L', L* and L + but also as formulae of L. Under such circumstances any attempt to have a rigid distinction between the use of Q and of Y would probably lead to confusion. 1) Since considerable reference will be made to theorems, lemmas and displayed formulae of [2], we will through the remainder of this paper denote the use of such by means of an asterisk, for example, Theorem 1*, Lemma 1*, (1)* will refer to Theorem 1, Lemma 1 and formula (1) of [2]. Suppose L, L +, T satisfy the conditions imposed on the calculi L, L + and the transformation T in [2] and that, in addition, Land L + 1) The possibility of ::J being a connective of L and of TX being a formula of L as well as of L + was naturally present throughout [2]. The use of the notational conventions referred to relied on the fact that we never wished in that paper to consider TX other than as a formula of L! (or L', L*).

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI

31

both have :::> as their only connective, this having the effect that s = I and (J t, is :::> in the definition of T (see (1)*). Let S, l: denote respectively the set of formulae, formula schemes, of L, and therefore also of L + • Suppose XES. We define the rank of X through the following conditions, used inductively; (i) If X cannot be expressed in the form !/'TY. :::> ZZ for some Y, Z E S, Y having at least one connective and ZZ denoting Z :::> Z as in (2)*, then the rank of X is I, (ii) If X can be expressed in the form !/'TY:::> ZZ for some Y, Z E S, where Y has at least one connective and has rank r, but cannot be expressed in the form !/'TY' :::> Z'Z for Y', Z' E S where Y' has at least one connective and has rank greater than r, then the rank of X is r + I. Using the fact that, for any U, V, and any substitution, !/'TU:::> V Z is longer than U, we can prove inductively that the rank of a member of S is a uniquely determined effectively obtainable integer and that the rank of a member of S is equal to the rank of any variant of that member. By changing X, Y, Z, S to cp, t/J, x, l: respectively, throughout the definition of rank of a formula, we obtain a corresponding definition for the rank ofa formula scheme. Suppose M is a calculus which has S as its set of formulae. M is said to be offinite rank if it does not possess provable formulae of arbitrarily large rank. The rank of M will be defined to be zero if M has no provable formulae and to be r (> 0) if M has provable formulae of rank r but none of greater rank. If M is not of finite rank it is said to be of infinite rank.

6: If X, YES, Y = !/'X and X is of rank r, then Y is of rank greater than or equal to r. LEMMA

If r = I, there is nothing to prove. If r > 1, then there exist W, Z E S, with W having at least one connective and being of rank r-l, such that X = !/'TW. :::> ZZ. Hence Y can be written as !/'(!/'TW. :::> ZZ). By Lemma 5(i)*, this formula is of the form !/'TW. :::> Z,Z for some substituted form of TW and some Z' E S. The required result can be obtained immediately. We note trivially that for some formulae, for example at, a substituted form can have a greater rank than the original formula, while for others, for example at :::> at, all substituted forms have the same rank as the original formula (in this case, I; see Lemma 2 (i) (aJ (,yz)*). PROOF.

RONALD HARROP

32

THEOREM 5: Suppose L, L +, T are as above. If L has finite rank, so has L +. More precisely, if the rank of L is r, then that of L + is r+ 1. PROOF. By Theorem 1*, a formula P is provable in L + if and only if it satisfies condition D+. By Lemmas 1(ii)* and 3* and the definition of rank of a formula, if P is provable in L + then it is of rank 1 unless it satisfies D + by (a+). Hence if r = 0, that is, ifL has no provable formulae, then L +, which will have some formulae which satisfy D+, but none of them by (a+), will be of rank 1. Suppose that r > and that P, of rank b greater than or equal to r+ 1 satisfies D+. Then, (i) since P satisfies (a+), it is of the form !JYTX.::> y2 for some provable formula X of L and some YES and (ii) since P has rank b it can be written in the form !JYTX'.::> Y'2 for some X', Y' E S where X' has at least one connective and has rank b - 1. By Lemma 7*, P can be written in the form YTZ. ::> y 2 for some provable formula Z of L which is of the form !JYX'. Hence, by Lemma 6, L has a provable formula of rank at least b - 1. Thus b - I :s; r and the rank of L + is not greater than r + 1. Since the rank of Lis r, there is a provable formula W of L of rank r. The provable formula TW. ::> ai of L + which satisfies D+ by (a+), has rank at least r+ 1. Hence the rank of L + is at least r+ 1 and thus is exactly r+ 1. This completes the proof of Theorem 5. Our final results in this section are concerned with a calculus which is essentially the "union" of the relativizations of two "disjoint" calculi. Suppose that L 1 , L 2 are calculi which satisfy the conditions imposed on Lin [2] and that they have disjoint sets of binary connectives which we will denote by CT1' ••• , CTs ; CTs + 1, ..• , CTk respectively. Let S1' I 1 ; S2' I 2 denote the sets of formulae and formula schemes of these calculi and let S, I denote the set of formulae, formula schemes constructed from a l' . . . ; A 1, .•. , by means of the single binary connective ::>. Let T be the mapping defined as in (1)* with respect to the complete set CT 1, , CTk of connectives. Tthus maps the set of formulae constructed from aI' , by CT 1, . . . , CTk' and thus also S1 and S2' into S. We construct as usual, using the restriction of T to 8 1 , II' and consider the case in which Lt has no connectives other than c . There is a slight complication in the construction of the corresponding relativized form of L 2 since the definition of L + in [2] used (1)* and this depended,

°

t.;

A RELA TlVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI

33

through its third line, on the connectives of the calculus concerned being numbered consecutively as the 1st, ... , sth connectives, for some s, whereas the restriction of T as defined above to Sz, X z treats the connectives of L z as if they were numbered as the (s+ l)st, ... , kth connectives.' ) Let Li(s) denote the relativized calculus, with ~ as its only connective, which has axiom schemes and rules formally the same as those given for Li in [2], that is, for L + with L replaced by L z , except that in the statement of 3*, 1 ::;; r ::;; s, 1 ::;; t ::;; s should be replaced by s + 1 ::;; r ::;; k, s + 1 ::;; t ::;; k respectively. T should be considered as the restriction of the mapping Tdefined above to the sets Sz, 2,'z. It will not be surprising to find that Li(S) behaves like Li (obtained by first renaming the connectives as 1st, ... , (k-s)th), since the main aspects of (1)*, (4)* are retained. These are the possibility of carrying over the structure of the formulae of L fully into L + in such a way that formulae of some recognizable structure not previously used (namely (4)*), would be available in L + for use in the construction of the axiom schemes of L +. We denote by Ltz the calculus with formulae S, formula schemes X, axiom schemes the union of the axiom schemes of L 7, Li (s) and detachment as its only rule. Similar notation is used in corresponding cases when calculi other than Lj , L z are involved. The calculi Ltz and Li.1 will, in general, naturally be distinct. It is shown later that their decision problems are primitively recursively equivalent to each other. Let »t, denote the condition obtained from D+ by making the following changes: (i) replace everywhere, D+ by Dtz and S+ by S. (ii) replace (ct+), (r), (y+) by (atZ)l' (cttz)z, Wi.Z)l, (f3tz)z, (ytZ)l' (ytz)z, the conditions with suffix i having L i for L, i = 1,2. The changes in (y+) are consequential on those in (r) in the sense that (ytZ)i shall include a reference to (f3tZ)i' i = 1,2, where (y+) includes a reference to (r). (iii) (0+), (e+), (C+) shall be renamed (otz), (etz), ("i.z) respectively. The special condition imposed on (otz), which has consequential effects ') The third line of the definition of T restricted to the connectives of L 2 reads T(EaiF) = TE :J ((TE)3+i :J (TF)2), s-} 1 :::; i ::;; k. For normal theory it would read T(Ea s + i F) = TE :J ((TE)3+i :J (TF)2), 1 :::; i :::; k-s.

RONALD HARROP

34

on (Btz), (n.z) shall be modified so as to read "it is required that Ql' Qz should be of the form fT(A l urA 3 ) , fT(A z utA 4 ) respectively, for some r, t such that either both 1 ~ r ~ sand 1 ~ t ~ s or both s + 1 ~ I' ~ k and s + 1 ~ t ~ k:" THEOREM 6: With the above notation, a formula N is provable in Li.z

if and only if it satisfies

»t.:

PROOF. The result follows by consideration of the details of the proof of Theorem 1* that is, of the proof of Lemmas 4*, 9*. In obtaining, following the proof of Lemma 4*, a proof that if N (E S) satisfies »i, then it is provable in Li.z, the main cases are (ai.Z)l and (ai.zh- These, however, lead the consideration straight back to proofs of corresponding results for Lt, Li(s) and these can be obtained by direct use of the method employed in the consideration of (a +) in the proof of Lemma 4 *. The restrictions imposed in the definition of axiom scheme 3* are satisfied at places where the axiom scheme is required for use both in the (ai.Z)l and in the (at2)Z case. The cases (Y:'z); are referred back to the use of cases of axiom scheme 2 + of L:'z, these being cases due to Lt or to Li(s) according as i = 1 or 2. The remaining cases are trivial, noting where necessary the special restriction imposed on (btz), (atz) and (n.z)·

We now show that if N is provable in Lt2 then it satisfies »i; This trivially reduces (compare the proof of Lemma 9*) to showing that if P, P :;) R are provable in Li.z and satisfy Di.z then R, which is provable in Ltz, also satisfies D7.2' Due to the splitting of each of (a+), (r), (1'+) of D+ into two parts when Dtz was formed, there are now 81 instead of 36 cases to be considered. Cases corresponding to those in parts (i), (ii) and (iii) of the proof of Lemma 9* can still be shown to be trivially impossible by use of the structural considerations previously employed, or to lead easily to the conclusion that R satisfies Di.2' This accounts for 36 + 27 + 12 (= 75) cases. Consider now cases of type (iv), that is, cases in which P satisfies D7.z by (Ci.2) and P :::> R satisfies D7.z by a method other than (arZ)l' 2 (C7.2)· P is of the form Ql A Qz. :;) M where, in the case of D'1'.z, Ql, Q2 are respectively of the form JT(A lurA 3 ) , fT(A 2utA 4 ) and either 1 ~ r ~ s, 1 S t ~ s, or s + 1 S r S k, s + 1 s t s k. From Lemma 2(i)(a;)(p)*, P :::> R cannot satisfy D1.z by (btz), (S'1'.2)' If P :::> R satisfies

«:»;

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI 35 D~.z by (b~.z), (e~.z) then R satisfies D' by (e~.z), «(~.z) respectively. We know, by Lemma 2(i) (IX;) (f3)* that P :::J R cannot satisfy Dtz by (f3i.Z)i' i = 1, 2. Suppose P :::J R satisfies Di.z by (yi.Z)i' Then R is of the form R t :::J M Z. Since P satisfies ot; it follows, using Lemma 3*, which with trivial notational modification applies to »i; that both Qt :::J M Zand Qz =:> M Zsatisfy Dtz by (IXtz)t, (IXtz)z or Since Qt, Ql' u, are of the form JT({Jt, JT({Jz, JTt/J (under a common substitution) for some rule ({Jt ({Jz/l!J of L i, it follows, again using lemma 3*, that Qt =:> M 1 , Qz =:> M 1 must satisfy Dtz by (IXtz)t or (IXtz)z. Further, by using Lemma 1(iii)* modified so as to apply to Ltz, L t, L z, we see that Qt, Qz, R are respectively of 1';" 1';2' T, form where all of it, i z, j are in the range 1, ... , s if i = 1, and in the range s + 1, ... , kif i = 2. Further application of lemma 1(iii)* shows that Qt =:> M 1 and Qz =:> M Z must both satisfy »t, by (IXtZ)i' Using Lemma 6* as in the proof of Lemma 9*, we can now show that R t :::J M Z satisfies Dtz by (IXi.Z)i by following directly the last part of the consideration of case (iv) in the proof of Lemma 9* substituting r, for Land (IXtZ)i for (IX+). This completes the proof of Theorem 6.

«z,»

THEOREM 7: With the above notation and any standard type of Giidel numbering, the predicate corresponding to provability in Ltz is primitive recursive in the corresponding predicates for L t , L z, and, for i = I, 2, the predicate corresponding to provability in L; is primitive recursive in the corresponding predicate for Li.z. PROOF. This is similar to the proof of Theorem 2* preceded by the proof of a result which corresponds to Lemma 10*. Suppose X E S, and that X is provable in L i, then TX . =:> ai satisfies »t, by (IXi.Z)i and is provable in Ltz by Theorem 6. Suppose now that X E S, and that TX . =:> ai is provable in Ltz. By Theorem 6, the definitions of (lXi.z)t, (lXi.z)z, and Lemmas 3*,1 (ii) (iii)* (having put them in a form applicable to Ltz, Lj , L z), we can deduce that X is not a variable and that there exists X' which is a provable formula of Si' and is thus also not a variable, such that TX = !/TX'. Hence, by lemma 8*, which generalizes to the new context, X = !/X' and therefore X is provable in L; Hence, for i = 1, 2, if X E Si then X is provable in L, if and only if TX . =:> ai is provable in Li.z. Thus the decision problems of L, , L z are primitive

36

RONALD HARROP

recursive in that of Li.z' The proof that the decision problem of Li.z is primitive recursive in those of Lj , L z can be obtained, using Theorem 6, by trivial modification of the corresponding part of the proof of Theorem 2*. Using the notation of Theorem 7 we can obtain the following two corollaries the first of which is trivial and the second of which can be obtained by noting that the decision problem of Li.z is primitive recursive in the decision problems of L t, L z each of which is primitive recursive in the decision problem ofL;.t. r ) Corollary 1. Ltz is decidable if and only ifL 1 andL z are both decidable. Corollary 2. The decision problem of Li.z is primitively recursively equivalent to that of L;.l'

Suppose L 3 is a calculus of type L and that t is an integer greater than zero. Consider the calculi Lj and Lj(t). It seems likely that a proof of the primitive recursive equivalence of the decision problems for these calculi could be obtained by constructing explicitly a condition (I) for Lj
o;

o;

Corollary 3. The decision problems of L;
A RELA TIVIZA TION PROCEDURE FOR PROPOSITIONAL CALCULI

37

4. Iteration of the relativization process - applications

In the following applications of iterated relativization, we construct sequences of calculi with specified decision properties. The motivation for the examples, and to some extent for the relativization theory, has been the fact that until comparatively recently most of the decidability and undecidability results for propositional calculi have been related to individual calculi, possibly of philosophical interest, and there seemed to be a shortage of general mathematical results which covered in one theorem or construction infinitely many propositional calculi. The applications of iteration to be given, like the application of relativization to prove Post's theorem in [2], are probably not the most general or the most interesting consequences of the processes described in [2] and in the present paper. It is thought though that they, and the methods used in obtaining them, might prove useful as a basis for later more advanced work. Each of the calculi in the sequences constructed in applications (i)(iii) below will be a subsystem of positive implicational calculus, the set of formulae of which we will again denote by S. Further, each of the calculi will have its axioms specified through a finite number of axiom schemes and will have detachment as its only rule. Since we wish to consider the calculi as calculi of type L, and since such a calculus cannot have a vaf as a premise or conclusion of a rule, it is necessary for us to consider detachment as written in the form At ;:)

Az

(At:::J

A3

A z)

:::J

A4

:::J

(A 3

:::J

A4 )

(26)

This form can be used since all provable formulae of the calculi with which we will be concerned will have :::J as main connective. All calculi constructed during (i)-(iii) as relativized forms of other calculi will have ;:) as their only connective and will be formed by using the simpler of the two methods of relativization, that is, the method denoted by the superscript ' in contrast to that denoted by the superscript *.

3R

RONALD HARROP

(i) Construction of an infinite strictly increasing sequence of (distinct)

decidable calculi:

Let L o denote the calculus with formulae S and with no axioms and with detachment (in form (26)) as its only rule. The sequence to be constructed will be{L{nl}, n z 1, where L{l} = L o and L{r +1l = (L{r})', all r z 1. The transformation T which arises in the determination of L{r + 1l from L lrl will automatically be defined, see (1)*, by Ta, = a.; TA i = Ai' all i; T(E ~ F) = TE ~ «TE)4 ~ (TF?) for all E, F which are either both formulae or both formula schemes of Lid. By induction on n we can show that for any n z 1, the axiom schemes of Lin) include the axiom schemes of L{m} for all m (if any) such that 1 :::; m < n. It will be noticed that the axiom schemes ofL l 2} are the schemes denoted by 2', 3' in the definition of L' in [2]. These are trivially in all the L(n), n z 2. Since the L(n} all have detachment as their only rule and VI} has no axioms, it is easy to show that the calculi form an increasing sequence and also, using induction and Theorem 5, that the rank of L{n} is n - 1, all n 2: 1. Hence, the calculi are distinct and thus strictly increasing. The calculi can be proved to be decidable by induction, using Theorem 2* and the fact that L(l} is decidable. (ii) Construction of an infinite sequence of distinct undecidable calculi:

In [1], an example is given of an undecidable calculus which is of type K and has finitely many axiom schemes and rules. By Theorem 3, there is thus an undecidable finitely axiomatizable calculus with formulae S and with detachment as its only rule. Replace ~ throughout the definition of this calculus by * and denote the resulting calculus, which will be of type L, by L 1 . Let L o be the calculus defined and denoted by L o in construction (i) above. Consider L O. 1 • By Corollary 1 of Theorem 7, this calculus is undecidable. Further, since no formula is provable in L o and all provable formulae of L 1 have * as main connective, it follows by Lemmas 3* and 1(iii)* (modified to meet the new context), that no formula of the type .~Tcp. ~ M 2 , where cp has ~ as main connective, satisfies D~.l' and therefore, by Theorem 6, that no such formula is provable in L~.l' In JTcp . ~ M 2 , as just used above, T denotes the transformation used during the construction of L~.l from L o and Lj, that is, T is defined as in (1)* using connectives 0"1' 0"2 which are ~, * respectively. Thus, although L~.l has some provable formulae, it has

A RELA TIVIZA TION PROCEDURE FOR PROPOSITION AL CALCULI

39

none of the form .'7TX :::J y2 for members X, Yof S where There denotes the transformation, used in the definition of rank of a formula, which is defined as in (1)* using just the connective (J 1 which is :::J. Hence, L~.1> which is a calculus with S as its set of formulae, has rank 1. Further, it has a finite number of axiom schemes, has detachment as its only rule, and, as stated above, is undecidable. Hence the calculi L
If the sequence L
1,

40

RONALD HARROP

(c) (T(A I =:>A 2 ) . =:> B 2 ) =:> .(T(A 1 =:>A 2 ) A T«A I =:>A 2 ) =:>(A 3 =:>A 4 ) ) . =:>B 2 ) 2 =:> (T(A 3 =:> A 4 ) . =:> B ) (d)(T(A l*A 2).=:> B 2 ) =:>.(T(A l*A 2 ) 2 =:> (T(A 3 * A 4 ) . =:> B )

(e) (AI

=:>

B2)

=:>. (A 2 =:>

B2 )

=:>

(AI

A

A

T«A l * A 2)*(A 3*A 4)).=:> B

2

)

A 2 • =:> B 2 ) .

Here, T is defined as in (1)* with s = 2 and with the connectives being =:>, * respectively. For each n ~ 5, L, will have detachment as its only rule. By Theorem 6 and Lemmas 3* and l(iii)*, the only formulae of Lm n ~ 5, which have rank greater than 1, that is the only ones of the form YT(o/l =:> 0/2)' =:> M 2 (or equivalently .?T(X 1 =:> X 2 ) . =:> M 2 ) ) , where T is temporarily restricted to act only on S so that the rank of a formula will be defined, are those which satisfy condition (et~.3)n of D~.3.1) These are exactly the formulae of the form .?TX. =:> M 2 where X is a provable formula of L, _ i - Hence, by Theorem 1*, the provable formulae of Ln , n ~ 5, of rank greater than 1 are exactly the provable formulae of L~ - 1 of rank greater than 1. If we can show that L 4, which is L~(l) has rank 1, we can deduce at once, by induction and Theorem 5, that L, will have rank n - 3 all n ~ 5, and therefore that the calculi of our sequence are distinct. Let L o now be used to denote the calculus with S as its set of formulae and with no axiom schemes and no rules. Then, compare the proof of Corollary 3 of Theorem 7, L~.2 will be the same calculus as L~(l). Since, by Theorem 6 and Lemmas 3* and 1(iii)*, there are no provable formulae of L~.2 of rank greater than 1, while, on the other hand, L~.2' which has =:> as its only connective, does have some provable formulae, it follows that L~.2 has rank 1. Hence the calculi L m n ~ 5 are distinct. The calculi Ln , n ~ 5 can be seen to be undecidable by induction using Corollary 1 of Theorem 7, the definition of L, and the fact that L 3 which is equivalent to L 2 is undecidable (see the definition of L~.l (=L 2 ) in construction (ii)). Hence, to complete the proof that our construction is satisfactory, it is now sufficient to show that the calculi L m n ~ 5, form an increasing sequence. We first show that the axiom schemes of L 6 include those of Lj. Since schemes (b), (c), (d) and (e) are automatically 0"1,0"2

1) (ex' n.3)n denotes the condition related to L n , L. in the way in which (C/1.2)1 is related to L 1 , L. in the definition of L1.2'

A RELATIVIZATION PROCEDURE FOR PROPOSITIONAL CALCULI

41

in L 6 all we need to prove is that the axiom schemes of type (a) in L, are axiom schemes of L 6 • Since L, = L~.3' the schemes involved are the axiom schemes of type I' (see the definition of L' in [2]) of L~, that is of (L~(1))'. Now the axiom schemes (b), (d), (e) of L, are the axiom schemes of L~(l) and these will provide the required axiom schemes of (L~(1»)' as axiom schemes of L S . 3 , that is, of L 6 , of type (a). Hence the axiom schemes of L 6 include those of L s. Suppose now that, for some r ~ 5, the axiom schemes of L, are included among those of L, + r- The axiom schemes of L, + 2 consist of (b), (c), (d), (e) and the schemes of type (a) which arise from the axiom schemes of L, + r- Thus by our hypothesis, the axiom schemes of L, + 2 include (b), (c), (d), (e) and the schemes of type (a) which arise from the axiom schemes of L" that is, they include the axiom schemes of L, + i - Hence, by induction, the axiom schemes of L, + 1 include those of L, for all n ~ 5. Since the calculi L m n ~ 5, all have detachment (in form (26)) as their only rule, they form an increasing sequence. This completes the proof that the constructed calculi satisfy the required conditions.

References [I] R. Harrop, On the Existence of Finite Models and Decision Procedures for

Propositional Calculi. Proc. Camb. Phil. Soc. 54 (1958) 1-13.

r2]

R. Harrop, A Re1ativization Procedure for Propositional Calculi with an Application to a Generalized Form of Post's Theorem, Proc. London Math. Soc. 14 (1964) 595-617.

A METHOD FOR PRODUCING REDUCTION TYPES

IN THE RESTRICTED LOWER PREDICATE CALCULUS H. HERMES AND D. RODDING') University of Munster (Westf.), Germany

1. A class T of formulae with unsolvable decision problem for validity

The method is based on an appropriate description of a Semi-Thuesystem in the language of the restricted lower predicate calculus. No effort has been made to optimize the resulting reduction type. We hope that our method will yield better results in the future. We start with the well-known notion of a Semi-Thue-system S. The words W (including the empty word 0) are built up from the letters aI' " ., aN' The relations W =;Os W' (W immediately produces W') and W ~s W' (W produces W') are introduced in the usual way by the use of a finite system (L k , R k ) (k = 1, ... , M) of rules. The task of the wordproblem for S is to find a method to decide for arbitrary words W, W', whether or not W ~s W'. This problem is known to be unsolvable for many S. Now let us assume that we have a method to associate effectively to each triplet S, W, W' (W, W' are words in the alphabet of the SemiThue-system S) a formula (f.s; w, w' of the restricted lower predicate calculus such that (*)

W

~s

W' iff (f.s; w, w' is (universally) valid.

Let us further assume first that the word-problem for S is unsolvable and second that for all W, W' in the alphabet of S the resulting formula (f.s;w,w' belongs to a class T of formulae. Then it is obvious that the decision problem for validity is unsolvable for T. 1) Communicated by H. Hermes.

A METHOD FOR PRODUCING REDUCTION TYPES

43

2. A reduction type T for validity

The completeness of the restricted lower predicate calculus can be expressed in the following way: There exists a Turing machine M and an initial complete configuration Co of M, and a function C which associates effectively with every formula o: a complete configuration C(ct) of M, such that ('1'*)

Co gives rise to C(ct) iff o: is valid.

Every Turing-machine M can be described in terms of a suitably chosen Semi-Thue-system SM' To every complete configuration C there corresponds a word W(C). If C1 and Cz are arbitrary complete configurations, we have (***)

C 1 gives rise to

c,

iff W(C 1) -+ SMW(CZ)'

(*), (**) and (***) imply that

ct is valid iff cts M ; W(Co), W(C(~)) is valid. Here the formula ctSM; W(Co), W(C(~)) can be effectively calcu ited from ct. The last result shows that T is a reduction type for validity. 3. Construction of the formula ct s ;w, w'

Let S be a Semi-Thue-system over the alphabet at> ... , aN' Let the rules of S be (L k , R k ) (k = 1, ... , M), where L k , R k are words in the alphabet of S. One can assume that none of the L k , R k are empty and that every word LkR k consists of at least three letters (because all SM can be chosen in this manner). The formula ct s; w, w' contains at most the following N + 2 predicate variables: Singulary predicate variables E j (j = 1, ... , N), Binary predicate variables A, T. We shall now give the standard interpretation of a closed formula built up from these predicate variables: The individual domain is the class of all words in the alphabet of S (including the empty word D). The interpretation of E j is the property of ending with a j' of A is the binary relation which holds between two words W, W' iff W' == Waj for a suitable aj

44

H. HERMES AND D. RODDING

of T is the binary relation which holds between two words W, W' iff W~SW'.

We define Cjxy by (1)

Cjxy

~

Axy

A

Ejy

(j

= 1, ... , N),

Furthermore we give inductive definitions for Pwx (for every W) and for Cwxy (for non-empty W) by (2)

(3)

PoX ~ , Elx

A ... A'

ENx,

PWaox ~ V(PwZ ACjZX) } z

(j = 1, ... , N),

Cajxy

(j

~

= 1, (j = 1,

Cjxy

Cwaoxy ~ V(CwXZ } z

A

Cjzy)

, N), , N).

The standard interpretation respectively associates with P w the property of being identical with W,

Cw the binary relation which holds between two words

W', WI! iff W" == W' W.

Now let

IXo

be the conjunction of the following formulae (i, j = 1, ... , N), (k = 1, ... , M), where

IX4 ij ' IXs , IX6' IX7' IX8k IXl

~ VPox x

IXs ~

AAA(Txy A Tyz

Gt6 ~

AA(Pox

x y z

Gt 7 j ~

x y

A

Poy

AAAA(Txy x y z w

A

~

~

Txz)

Txy)

CjXZ

A

Cjyw

~

Tzw)

IXl, IX1j' IX3j,

A METHOD FOR PRODUCING REDUCTION TYPES

45

It is easily seen that IlC O is valid under the standard interpretation. Now we are in a position to give the following definition:

(0)

IlC S-

w

.,

W'

~ AA(IlC OA x y

Pwx

A

Pw'y ~ Txy),

4. Proof of (*)

We have to show (a) If IlCs;w,w' is valid, then W ~s W', (b) If W ---"s W', then IlCs;w,w' is valid. For (b) see section 5. (a) can easily be shown as follows: Let cxs; w. W' be valid. Then it holds for the standard interpretation. Hence the formula IlCO A Pwx A Pw'y ~ Txy holds when arbitrary words are associated with x and y. If we associate W with x and W' with y, the formulae Pwx and Pw,y are valid. Since llC o is also valid, we find that Txy holds. This means that W ~s W'. 5. Proof of (b) (cf. section 4)

We start with the following lemmata, where the premise llC o has been suppressed. LEMMA

1:

vr;«

LEMMA

2:

AVCwxy

LEMMA

3:

AAA(PwXA PwY

LEMMA

4:

AA(Pyx ---" (Cwxy

LEMMA

5:

AA(Pwx

LEMMA

6:

AAAA(Txy A CwXW

x

xy

x y z

xy

xy

xywz

A

A

(W =1=

D)

(W =1=

D).

CjXZ ---" Cjyz)

~

PywY))

PwY ~ Txy) A

Cwyz ---" Twz)

(W =1=

D).

We omit the proofs which are in all cases straight-forward by induction on W. To prove (b) we show by induction on n, that, if we suppose that

46

H. HERMES AND D. RODDING

=>S W 2 =>S ... =>s Wn , then Txy is deducible from lXo, P W1x, Pwny· For n = I see Lemma 5. Let us assume that the proposition is proved for n (induction hypothesis). Now we have to show that if we suppose that W1 =>s W2 =>s ... Wn =>s Wn + 1 ... , then Txz is deducible from lXo, P W,x, P Wn + 12. We shall show in section 6 that Tyz is deducible from lXo, PwnY, P Wn + lZ, provided that Wn =>s Wn + r- Using the induction hypothesis together with IXs we get TX2 from IXO, P W,x, PwnY, P Wn + 1Z, provided that W1 =>s ... =>s Wn + l' Now it is obvious from Lemma 1 that we can eliminate pwny.

W1

6. Deducibility of Tyz Wn + 1 means that there exist words U and Vand a k such that Wn == U L k V and Wn + 1 == U R; v: According to Lemma 1 there exists a U such that Puu. According to Lemma 2 there exist an I and an r such Wn

=>s

that CLkul and CRkur (we should keep in mind that L k i= 0 and R k i= D). With IXSk we get Til'. Lemma 4 yields PUL) and PURkr. Case 1: V == D. Then Wn == UL k , Wn + 1 == UR k , P w ) , P w " + 11'· Lemma 5 yields Tyl and Trz. This, together with Til' and (xs, Implies Tyz. Case 2: V i= D. Lemma 4 gives Cyly from PUL) and pw"y, and Ccrz from PURkr and p w " + lZ. From Til', Csl y, Ccrz we get Tyz using Lemma 6. 7. Application to the decision problem 1Xl> .•• , (X7j are given in prenex normal form (Po and C, are quantifierfree). Their conjunction has a prenex form of prefix YAyNA 3 . CLk and CRk are contained in IXSk' Let Ilk + 2 be the length of the word LkR k• Ilk ~ 1, accordmg to our assumptions in section 3. Then IXSk can be brought into prenex normal form with prefix Ailk + 3. Let Jl be the greatest of all the Ilk' Then the conjunction of all (XSk can be brought into prenex normal form with prefix All + 3. Hence lXo has a prenex normal form with prefix YAyNAIl+2. Pwx A Pw.y has a prenex normal form with a purely existential prefix. No finite upper bound exists

A METHOD FOR PRODUCING REDUCTION TYPES

47

for the length of this prefix. We denote this type of prefix by the symbol YOCJ . C(O /I Pwx /I Pw,y has a normal form with prefix YOCJAyNI\I<+2. The resulting prefix of a normal form of C(s;w,w' is AOCJYANyl' + 2. The resulting reduction type for satisfiability is hence

where (N, 2) denotes a kernel with N singulary and 2 binary predicates. Using a well-known procedure of Suranyi we can simplify this type to

Chosing appropriate descriptions SM of the Turing machine M (cf. section 2) it is possible to get N = 2 (with a moderate 11) or 11 = I (with a moderate N).

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC JAAKKO HINTIKKA University of Helsinki, Finland

1. The purpose of this paper

The distributive normal forms of first-order logic (functional calculus of first order, quantification theory, predicate calculus) were first described and proved to exist in the author's dissertation of 1953.1 ) Subsequently, some of the basic ideas of these normal forms have been independently rediscovered and used by others (Hanf, Ehrenfeucht, Dana Scott. 2) In this paper, an attempt will be made to explain, in a form more compact and more manageable than that of my original paper, what these normal forms are. We shall also attempt to carry their theory further by studying some of their most important properties. In particular, we shall describe and prove semantically complete a disproof procedure which is connected especially closely with their structure. The properties of the distributive normal forms would seem to repay closer study in several directions. In this paper, however, we are trying to survey the basic features of these normal forms rather than to push their applications to the limit in any particular direction. This paper is restricted to first-order logic, although similar normal forms are easily seen to exist elsewhere, e.g. in higher-order logics and in modal logics. The reason for the restriction is that the properties of these 1) Jaakko Hintikka, Distributive Normal Forms in the Calculus of Predicates. Acta Philosophica Fennica, 6 (1953) (Helsinki, 1953). The notation and terminology used in this work is not identical with that employed in the present paper. 2) See e.g, A. Ehrenfeucht, An Application of Games to the Completeness Problem for Formalized Theories. Fundamenta Mathematicae, 49 (1961) 129-141. Distributive normal forms have also been employed by Francis C. Oglesby in his monograph, An Examination of a Decision Procedure. Memoirs of the American Math. Soc. 44 (1963).

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

49

parallel normal forms are so different as to make a separate treatment advisable. 2. Special cases The distributive normal forms of first-order logic are generalizations of the well-known "complete" normal forms of propositional logic and of monadic first-order logic. The notation and terminology that will be employed in this paper can be conveniently explained in terms of these more restricted normal forms. Consider first propositional logic. Each consistent formula F of propositional logic has a complete disjunctive normal form which is a disjunction of certain conjunctions that will be called constituents. If F does not contain any atomic formulae different from Pl' P2' ... , Pk> then a 'Constituent occurring in its normal form contains for each i = 1, 2, ... , k either Pi or ,...., Pi (but not both) as a member. An arbitrary conjunction of this kind will be referred to in the sequel as k

II Pi'

i = 1

Different conjunctions of this kind may be distinguished from each other by attaching subscripts to II. It will be assumed that these subscripts run from one on so that the same kind of notation can be used repeatedly. By means of this notation the normal forms of monadic first-order logic (without identity) are easily characterized. They are again disjunctions of certain conjunctions that will be called constituents. A constituent of course depends on the predicates which occur in it. If these are P 1X, P 2X, ••• , Pkx, an arbitrary constituent of monadic first-order logic is of the form j = 2k i =k II (Ex) IIj PiX. (1) j=l

i=l

Whenever the limits of our pi-operations are inessential or can be gathered from the context, they may be omitted. Thus instead of (1) we may sometimes write II (Ex) IIj PiX

j=l

i=l

50

JAAKKO HINTIKKA

or even

n

j = 1

(Ex)

n, Px.

The intuitive meaning of (I) is worth noting as it is generalizable to the whole of first-order logic. Using the language of an interpreted logical system, one may say that in propositional logic the constituents list all the different possible states of affairs or "possible worlds" that can be specified by means of the atomic propositions PI (plus propositional connectives). In monadic first-order logic, the constituents (I) likewise describe all the different kinds of worlds (states of affairs) that can be specified by means of the monadic predicates PiX (plus quantifiers and propositional connectives). From (1) we see how these descriptions come about. First all the possible kinds of individuals that can be specified by means of the predicates PiX (plus propositional connectives) are listed. This is what the conjunctions

(where j = 1,2, ... , 2k ) accomplish. Then it is indicated, for each such possible kind of individual, whether individuals of that kind exist or not. This is what the rest of (I) does. On the basis of this intuitive meaning of (l) certain simple observations can be made. For instance, we see that (I) can be written in a different form. Instead of listing all the different kinds of individuals that there exist and also all the different of individuals that do not exist, it clearly suffices to list all the existing ones and then to add that they are all the existing ones. In other words, (I) can be rewritten as follows: (2)

(Ex)nilPx &

(Ex)nhPx & ... &

& (Ux)(nj,px v nhPx v '"

(Ex)njmPx

v njmPx)

where {nil Px, ... , n j m Px} is the set of those conjunctions whose existential closures occur unnegated in (I). Here it is useful to have a shorthand notation which will save us the trouble of writing out the main conjunction and disjunction of (2) explicitly. Using such a notation, (2) may be written (3)

ti; j=l

(Ex)

n,

i=1

PiX & (Ux) a,

n,

j=1i=1

PiX.

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

51

The conventions on which this more compact version is based may be expressed as follows: Given the conjunction IT, Pi' tt,

i = 1

Pi

i

=

1

is the conjunction of all its un negated members, and (J,

i = 1

Pi

is the disjunction of the same formulae. More generally, given two arbitrary functions f and g whose arguments and values are formulae,

n, f(p)

i = 1

is the conjunction of all the formulae f(pJ where Pi ranges over all the un negated members of IT, Pi; and

i = 1

is the disjunction of all the formulae g(p) with the same choice of the arguments Pi. Thus tt, and (J, essentially express the formation of arbitrary conjunctions and disjunctions. The identity of subscripts merely serves to indicate that the same selection of arguments is involved in the two cases. The equivalence of (I) to (3), which was found intuitively obvious, is also readily demonstrable. In order to convert (I) to (3), you may proceed as follows: First replace every combination of symbols ~ (Ex) by (Ux) ~ ; then let all the universal quantifiers thus introduced merge into one (in virtue of the distributivity of universal quantification with respect to conjunction). From propositional logic it follows that the formula which then constitutes the scope of the universal quantifier is equivalent to (J, I1 j PiX j

=1 i =1

where the subscript of the first IT in (1) is assumed to be r. Another observation: In propositional logic, a formula has a (nonempty) normal form if and only if it is consistent. In monadic first-order logic, this remains true, but only with a qualification. This qualification

52

JAAKKO HINTIKKA

pertains to the "only if" part of the equivalence. This part remains valid only if all the constituents (1) are consistent (satisfiable). Now there is one constituent (and one only) which is satisfiable only in an empty domain of individuals. It is that constituent (1) all of whose members are negated. On the basis of the intuitive meaning of (1) it is seen to deny that there are individuals of any kind in existence, i.e. that the universe is empty. Admitting empty domains of individuals on a par with nonempty ones thus simplifies our discussion in that it enables us to say that all constituents of monadic first-order logic are consistent. This simplifying assumption will be made throughout this paper. 3. The definition of distributive normal forms The distributive normal forms of full first-order logic (without identity) are also disjunctions of certain conjunctions which we shall call constituents. In order to define the normal form we therefore have to define a constituent. A constituent depends essentially on the following features which will be called its parameters: (P.1)

The set of all the predicates occurring in it;

(P.2)

The set of all the free individual symbols occurring in it;

(P. 3) The maximal length of sequences of nested quantifiers occurring in it. The parameter (P. 3) will be called the depth of the formula in question. More loosely, the depth of a formula is the number oflayers of quantifiers it contains. If it is stipulated that the scopes of two quantifiers which contain the same bound variable must not overlap, we can characterize (P . 3) in analogy to (P. 1)-(P . 2): The depth of a formula is the number of all the different bound individual variables it contains when this number is made as small as possible by renaming bound variables. The definition of a constituent is facilitated by a few shorthand notations. Given a set (P .1) of predicates, let Ala!> a2' ... , ak) (where i = 1, 2, ... ) be all the atomic formulae that can be formed from the members of (P.l) and from the free individual symbols ai' a2' ... , ai. Furthermore, let Bla!> a 2, ... , ak) (i = 1,2, ... ) be all those atomic formulae so defined which contain at least one occurrence of ak' Then we have, trivially,

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

(4)

II, A;(a1' ... , a k -

i = 1

=

IIs i = 1

A;(a 1, ... , ak - 1) &

l'

53

ak) =

IIt B;(a 1, ... , ak - 1, ak)

i = 1

for each given r and for suitably chosen (with a view to r) sand t, Before defining a constituent with given parameters (P .l)-(P. 3), it is advisable to define a closely related kind of formula which will be called an attributive constituent (in short, an a-constituent) with the same parameters. An attributive constituent with a given fixed parameter (P .1) whose depth is d and whose parameter (P.2) is {a 1, a z, ... , ak} will be referred to as cri«; ... , ak ) . It may be defined recursively in terms of a-constituents of depth d-l as follows: (5)

Ct~(a1' ... ,ak) =

=

IIs B;(a1' ... ,ak) &

i = 1

IIt i = 1

(Ex)Ctt- 1(a 1, ... ,ak-1,ako x).

We have assumed here that indices are used to distinguish different a-constituents with the same parameters from one another. In (5) the first index r is of course a function of sand t. It does not matter what the dependence is as long as the values r can assume are 1, 2, . .. up to some finite number. 1 ) When d = 0, the second member of the right-hand side of (5) vanishes, giving us a basis for recursion. After an attributive constituent is defined, it is easy to define a constituent: (6) Cd(a 1, ... , ak) = II A;(a 1, ... , ak _ 1) & i = 1

Ct d(a 1,···, ak - l' ak)·

For simplicity, the indices of c-, of II, and of Ct d have not been indicated here. The first depends on the other two; it is again assumed that the values it can take run consecutively from Ion. What is thus defined will be called the first distributive normal form. The constituents and attributive constituents just defined will be said to be of the first kind. 1) One especially simple method of defining this dependence would be to put

r = (s-I)+2 n . (1-1)+ 1 where n is the number of different formulae of the form

Bi(ah ... , ak) (with the appropriate parameters). Analogous definitions may be used

elsewhere on similar occasions. We shall not use the details of these definitions, however.

54

JAAKKO HINTIKKA

In monadic first-order logic, every constituent (1) may be rewritten as (3). In the same way, constituents and attributive constituents of the first kind may be transformed so as to become constituents and a-constituents of the second kind. Attributive constituents of the second kind may be defined as follows: (7) i

Ct~(al' ... ,ak) n, (Ex)Ct1=

I(a

1

l, ... ,ak'x)

= II. 8;(a l , i = 1

&

...

&

,ak)

(Ux) a, Ct1-

I(a

i = 1

l,···,a k,x).

Here r depends on sand t as before. The first parameter (P .1) is assumed to be constant throughout (7). Constituents of the second kind may be defined in terms of a-constituents of the second kind in the same way as before, viz. by (6). Alternatively, this definition may be rewritten as follows:

cs«; ... ,ak) =

(8)

i

n, (Ex)Ct1=

1

I(a

l,···,ak,x)

IIs i = 1

Aj(a l, ... ,ak) &

&

(Ux) a, Ct1j

I(a

= 1

l,···,ak>x).

4. The intuitive meaning of constituents and of attributive constituents The normal forms of monadic first-order logic are special cases of the more general normal forms just defined. A comparison of (1) with (5) and of (3) with (7) or (8) shows that these special cases are not entirely unrepresentative. The similarity between (I) and (5) or between (3) and (8) can be further heightened by writing (1) as

(1)*

II (Ex) Ct?(x) j

and (3) as (3)*

= 1

tt, (Ex) Ct?(x)

; = 1

&

(Ux)

(J,

i = 1

Ct?(x).

This similarity helps us to appreciate the nature of the general normal forms in many respects. For one thing, the intuitive meaning of constituents and of attributive constituents can be explained pretty much in the same way as in the monadic case. If we speak as if we were dealing with an interpreted system, we may say that constituents with certain

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

55

fixed parameters (P. l)-(P. 3) list all the different kinds of world that can be described by the sole means of these parameters (plus quantifiers and propositional connectives). Attributive constituents do not describe possible worlds (states of affairs) but rather possible kinds of individuals. (Hence their name.) If the attributive constituents (5) or (7) are considered as complex attributes of the individual referred to by ai, they may be said to list all the different kinds of individuals that can be specified by the sole means of (i) a given fixed set of predicates (P. 1); (ii) the "reference-point" individuals specified by a I' ... , ak _ I; (iii) at most d layers of quantifiers; (iv) propositional connectives. The recursion equation (5) shows how the list comes about: First we list all the kinds of individuals that can be specified by means of (i) the same set of predicates; (ii)' the reference-point individuals specified by a I ' . . . , ak _ I as well as ak; (iii)' at most d - I layers of quantifiers; (iv) propositional connectives. This is what the a-constituents Ctd-I(al, ... , a k_ l, ak' x) do. Then we specify, for each such kind of individuals, whether individuals of that particular kind exist or not. This is what the second pi-operator and the accompanying quantifier accomplish in (5). This adds one more layer of quantifiers, but it also makes the result something we can attribute to the referent of ak' Finally, we specify how the referent of ak is related to those of a 1, ... , ak_l; this is what the first member of the right-hand side of (5) does. Intuitively, (7) is obviously related to (5) in the same way as (3) is to (1). The equivalence of (7) to (5) (the same parameters in both cases) can also be proved by an argument closely reminiscent of the proof which was sketched in section 2 for the equivalence of (3) to (I). The former equivalence is most easily proved by induction on d. Both the case d = I (which gives us a basis for induction) and the inductive step can be dealt with in the same way as the relation of (3) to (1) was dealt with earlier. From (8) we see what a constituent with given parameters (P. 1), (P.2) (= {ai' ... , ak}), and (P. 3) (= d) says. First, it says which possible kinds of individuals, specifiable with reference to ai' ... , ak by means of at most d-l layers of quantifiers, there exist; second, it says how the referents of a l' . . . , ak are related to each other.

56

JAAKKO HINTIKKA

5. The existence of distributive normal forms

The possibility of converting every formula F (of first-order logic without identity) with given parameters (P. l)-(P. 3) into the first distributive normal form with the same parameters (or with certain fixed larger ones) can be proved by induction on the depth d of F. A basis for induction is given us by the propositional normal forms (d = 0). In the general case F is a truth-function of formulae of the following two forms: (a) A;(al' ... , ak), where {al> ... , ak} is the parameter (P .2) of F; (b) (Ex)G, where all the predicates of G are among those of F, where the depth of G is at most d-l, and where all the free individual symbols of G are among aI' ... , ak, x. By the inductive hypothesis G is therefore equivalent to a disjunction of formulae of the form i

IT A;(al' ... , ak) & ee

I

Ct d - l(a l, ... , ak, x).

By well-known laws of first-order logic (the distributivity of existential quantification with respect to disjunction, the irrelevance of the scope of an existential quantifier as far as members of a conjunction which do not contain the bound variable in question are concerned), (Ex)G is then equivalent to a disjunction of formulae of the form i

IT A;(a l, ... , ak) & ~

I

(Ex) Ct d - l(al' ... , ak, x).

But this means that F is equivalent to a truth-function of formulae of the following two kinds: (a) (b)'

A;(a l, ... , ak), as before;

(Ex) Ctd-l(al, ... , ak> x).

The desired normal form is then obtained simply by a transformation to the propositional normal form. This completes the proof of the existence of the first distributive normal form. From the argument just given we can read a set of directions for actually converting each given formula to the first distributive normal form. The term distributive normal form is chosen because of the importance of the distribution of existential quantifiers in this conversion. It is seen that the process by which the distributive normal forms are reached is in a sense opposite to the one by which a formula is converted to the prenex

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

57

form: Instead of pulling the quantifiers out to the beginning of the formula in question they are pushed as deep into it as they will go. The possibility of converting each given formula to the second distributive normal form now follows from the equivalence of the first and the second normal form. In the sequel, we shall consider only the second normal form, usually omitting the word "second". Likewise, only constituents and a-constituents of the second kind will normally be considered unless the opposite is stated in so many words. There also exist normal forms and constituents dual to the ones we have defined. They will not be considered in this paper either. As a special case of the convertibility of every formula into the distributive normal form, we see that each constituent with depth d and with certain given parameters (P. I )-(P . 2) can be converted into a disjunction of constituents with the same parameters (P. I )-(P . 2) but with a greater depth d-i-e, for every e = I, 2, .... These constituents will be said to be subordinate to the constituent in whose normal form they occur. We see that every a-constituent Ct d of depth d may be converted into a disjunction of a number of a-constituents of depth d-i-e with the same parameters (P .l)-(P .2) as These a-constituents are said to be subordinate to Ct". Again, this holds for every e = 1,2, .... The procedure by means of which this is accomplished is, mutatis mutandis, the same as in the case of constituents.

cr.

6. The structure of distributive normal forms

The structure of the distributive normal forms of both kinds is very clear-cut. In the constituents of the first kind there are no disjunctions and no universal quantifiers. All negation-signs are prefixed to atomic formulae or to existential quantifiers. In the constituents and a-constituents of the second kind all negationsigns are prefixed to atomic formulae. Since disjunction, conjunction, and both kinds of quantification are monotonic operations as far as the logical strength of formulae is concerned, it follows that the same is true of constituents and attributive constituents (of the second kind): Whenever a subformula Sl of an arbitrary a-constituent Ct., implies another formula Sz, Ct., implies the result of replacing Sl by Sz in Ct o. The same obviously holds for the simultaneous replacement of non-

58

JAAKKO HINTIKKA

overlapping subformulae by weaker formulae. From this the following omission lemma immediately follows: If we. omit from an a-constituent any number of subformulae of the following kinds: (i) negated or unnegated atomic formulae; (ii) quantified formulae, then the result is implied by the original a-constituent. The same result obviously holds also for constituents (of the second kind). Certain qualifications are needed in both cases, however. It has to be understood that not all the members of any conjunction which occurs as a memberofa disjunction are omitted, i.e. that no member of a disjunction is allowed to disappear altogether. It has to be understood also that connectives which become idle as a result of the omissions are likewise omitted. It is readily seen that whenever an a-constituent occurs in another a-constituent or in a constituent (of the second kind), it also occurs there without being a subformula of any disjunction, universally quantified formula, or (trivially) negation. Now the operations of conjunction and of existential quantification cannot remove an inconsistency from a formula. It therefore follows that whenever an a-constituent is inconsistent, every constituent or a-constituent that contains it is likewise inconsistent. This observation will be called the inconsistency lemma. A formula which is like one of the constituents or a-constituents we have defined except for (i) the order of the conjunctions and disjunctions it contains, (ii) the repetitions of some of the members of conjunctions and disjunctions, or (iii) the naming of the bound individual variables it contains will be called its notational variant. We shall call two constituents or a-constituents different only if they are not notational variants of each other. If this convention is presupposed, we can say that in propositional logic two different constituents with the same atomic formulae are logically incompatible. As a special case, we see that two different constituents or a-constituents of the first kind with the same parameters are logically incompatible. From the equivalence of our two normal forms it follows that the same is true of constituents and of a-constituents of the second kind. (Incompatibility lemma.) Each a-constituent and constituent of the second kind has the structure of a tree in the mathematical sense of the word. The elements of the tree are the a-constituent in question -let us call it Ct~ - and all the a-con-

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

59

stituents of lesser depth occurring in Ctg. If one of them is Ct 4 - e, the elements covering it are all the a-constituents of depth d - e - 1 occurring in it. Turning the tree upside down, the structure of the a-constituent Ct~ can therefore be represented schematically by the following diagram:

The two dividing lines (solid and dotted) will be given a meaning later.

7. The effects of identity So far, we have been dealing with first-order logic without identity. In first-order logic with identity, normal forms similar to the ones we have discussed can be defined. In fact, an attributive constituent may still have exactly the same appearance as it had in first-order logic without identity. The only difference is that quantifiers must now be given an exclusive interpretation. I) That is to say, a formula of the form (Ex)F will now be understood as shorthand for the formula (Ex) (x :F a l & x :F a2 & ... x :F a k & F) where ai' a2' ... , a k are all the free individual symbols of F and where the quantifier is understood in the usual way. The universal quantifier has of course to be reinterpreted in the same way: (Ux)F will now mean the same as the formula (Ux) ((x :F al & x :F a 2 & ... & x :F ak) ::::l F) used to mean on the old interpretation. In (5) and 1) See my paper on Identity, Variables, and Impredicative Definitions, Journal of Symbolic Logic 21 (1956) 225-245. The difference between the two variants of exclusive interpretation which I distinguish in this earlier paper is immaterial in the case of distributive normal forms. For this reason, I shall in the sequel occasionally resort to the solecism of speaking of the exclusive interpretation.

60

JAAKKO HINTIKKA

(7) this reinterpretation has to be applied to all quantifiers, but in all

other respects these definitions will remain intact. A constituent will still be like (6) except that it will always contain as additional members of the main conjunction all the formulae a, =F aj where i =F j, i, j ~ k. It may also contain as additional members of the main conjunction a number of formulae of the form b = a., one for each b, where b is different from all the free individual symbols a, (i = 1,2, ... , k). It is obvious that formulae of the latter form cannot affect the consistency or inconsistency of the constituent in question. Hence they may be disregarded for many purposes (cr. section 8). Any formula (with identity) whose depth is no greater than d and whose free individual symbols are all among the a;'s and b's can be converted into a disjunction of the constituents just described. The argument that was given in section 5 for the corresponding result for formulae without identities can be extended to the case at hand. In fact, the operations by means of which the normal form was reached remain applicable on the exclusive interpretation of quantifiers. Hence essentially the only new thing to do is to transform the given formula first to a form in which the interpretation of quantifiers can be changed. There are no difficulties about this, however. 1) The three lemmata formulated above (the omission lemma, the inconsistency lemma, and the incompatibility lemma) remain valid in firstorder logic with identity. Some qualifications are needed here, however. In the omission lemma the prohibition against omitting members of disjunctions has to be applied also to the disjunction (Ux) (x = al V x = = a 2 v ... v x = ak v F) which is implicit in the universally quantified formula (Ux)F, exclusivelyinterpreted. The incompatibility lemma applies to a-constituents without change, but not to constituents. It does apply to what we shall call the main parts of constituents, i.e. to the constituents just defined minus the identities b = a.. Thus it applies e.g. to closed constituents. 1) Suppose, for instance, that a is the only free individual symbol in (Ex)F. Then this formula is trivially equivalent to a formula of the form (Ex) (x = a & F ,) v (Ex) (x a & F 2) which in turn is equivalent to F,(a/x) v (Ex) (x a & F 2) where F,(a/x) is the result of replacing x by a in F" subject to the usual precautions concerning variables. By using the exclusive interpretation, this may be written simply F,(a/x) v (Ex)FaThe cases in which we have more than one free individual symbol may be treated similarly, and universally quantified formulae dually.

*

*

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

61

8. Distributive normal forms and decision problems In propositional logic and in monadic first-order logic distributive normal forms yield a decision method: If a formula has a non-empty normal form, it is satisfiable, and vice versa; it is logically true if and only if its normal form contains all the constituents with the same parameters as it. In view of Church's undecidability result they cannot do this in the full first-order logic (with or without identity). It is easily seen that this failure is possible only if some of our constituents are in this case inconsistent. In fact, the decision problem of first-order logic is seen to be equivalent to the problem of deciding which constituents are inconsistent. More explicitly, the decision problem for formulae with certain fixed parameters is equivalent to the problem of deciding which constituents with these parameters are inconsistent. For instance, the decision problem for formulae of depth ~ 2 is equivalent to the problem of locating the inconsistent constituents of depth 2. In this case, the problem is known to be solvable.') In fact, it can be shown (although we shall not do it here) that the conditions (A)-(C) which we shall formulate in section 9 give us a decision method in this case. In the case of formulae of depth 3 no decision method can be available, for it is known that these formulae constitute a reduction class. In fact, this remains true even though we impose limitations on the other parameters of these formulae; for instance, we may require that they contain only one fixed dyadic predicate plus an indefinite number of monadic predicates, and that they contain no free individual symbols.") (It is assumed here that no identities are present.) In addition to obtaining reduction classes for the decision problem of the whole first-order logic (without or with identity) we obtain formulations for a number of more specific decision problems. For instance, 1) See G. H. von Wright, On Double Quantification, Societas Scientiarum Fennica, Commentationes physico-mathematicae 16, no. 3 (Helsinki 1952). As pointed out by Wilhelm Ackermann in his review of von Wright's paper (Journal of Symbolic Logic 17 (1952) 201-203), this case can be reduced to cases which have been proved solvable Godel, Kalmar, and Schutte. A generalization of Ackermann's point was put forward by Dana Scott in a paper read at the January 1963 meeting of the Association for Symbolic Logic in Berkeley, California. 2) See Janos Suranyi, Reduktionstheorie des Entscheidungsproblems im Pradikatenkalkul der ersten Stufe (Verlag der ungarischen Akademie der Wissenschaften, Budapest 1959).

62

JAAKKO HINTIKKA

consider a finitely axiomatizable first-order theory the conjunction of whose non-logical axioms is F. Then it is easily seen that the theory in question is decidable if and only if the set of all inconsistent constituents which have the same parameters as F and which are subordinate to one of the constituents C I , C 2 , •.• , Cf occurring in the normal form of F is recursive. For a decision method it even suffices to have a recursive function rF(d) which for each d indicates how many inconsistent constituents (with the appropriate other parameters) there are subordinate to one of the Cj(i = 1, 2, ... ,f). For if we know this number, we can for each d enumerate recursively all the inconsistent formulae until this number of constituents of depth d (with the appropriate other parameters) has appeared in the enumeration; then we know that the rest are all consistent. Such a function rid) of course always exists, although it is not always recursive. It will be called the range function of F. Even when it is not recursive, it can be seen to be of the same degree of unsolvability as the theory axiomatized by F. A theory of range functions might be of considerable interest. For instance, from a recent result of Hanf's it follows that every recursively enumerable degree of unsolvability contains the range function of some formula. I ) Here we shall only list some of the most obvious properties of range functions: (i) The range function of each F is the sum of the range functions of the constituents occurring in its normal form. The range functions of constituents thus constitute a base for all range functions. (ii) In particular, the range function of a constituent Cd of depth dis for every e = I, 2, ... equal to the sum of the range functions of all the constituents of depth d-i-e which have the same parameters (P .l)-(P. 2) as Cd and which are subordinate to c-. When a constituent is split into a disjunction of deeper constituents, its range function is thus split into the sum of their range functions. (iii) The range function of any constituent subordinate to Cd is Turing reducible to that of cr. The higher up we climb along a branch of the tree which is constituted by all the constituents subordinate to the

c;

1) Hanf has proved that every recursively enumerable degree of unsolvability con-

tains the class of the Godel numbers of the theorems of some finitely axiomatizable theory. I have seen his result only in the form of an abstract. - By the degree of unsolvability of a theory we of course .mean the degree of the set of the Godel numbers of its theorems.

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

63

easier our decision problems thus become - in so far as they change at all. (iv) Given two constituents C 1 and C2 with the same parameters (P .l)-(P. 2) neither of which is subordinate to the other, the sum rc,(d)+rc,(d) of their range functions belongs to the join of the degrees to which I'c,(d) and r c2(d) belong in the semi-lattice of degrees of unsolvability. (v) A formula axiomatizes a complete theory if and only if its range function is everywhere = 1. 9. Conditions of consistency The theory of range functions will not be developed any further here. It has been mentioned mainly as an illustration of the interest of the

problem of locating the inconsistent constituents - or at least some of them. As far as the general form of this problem is concerned, we know that the set of all inconsistent constituents is not recursive (Church's undecidability theorem). It is however, recursively enumerable. One thing we may thus hope to accomplish is to find as natural methods of recursive enumeration as possible (i.e. as natural disproof procedures as possible). What we shall do in the next few sections is just to provide such methods of recursive enumeration - methods which are especially closely connected with the structure of our constituents and attributive constituents. This will be done in two stages. First, certain sufficient (but not necessary) conditions of inconsistency for constituents and a-constituents will be formulated. Then it will be proved that these conditions, when used in a certain systematic fashion, really provide us with a method of recursively enumerating all the inconsistent constituents and a-constituents. In other words, it will be proved that every inconsistent constituent and a-constituent has a disproof of a certain simple structure. This structure can be compared with the structure of the disproofs which are in the simple normal forms of first-order proofs that have been established by Herbrand and by Gentzen. Prima facie, the relation of our disproofs to the Herbrand and Gentzen normal forms is not very close. As far as the propositional structure of our disproofs is concerned, they embody the ideals of Herbrand and Gentzen well enough; no applications of modus ponens or of the cut rule are needed. As far as quantifiers are concerned, however, our procedure may seem to be diametrically opposed to that of

64

JAAKKO HINTIKKA

Herbrand: Instead of eliminating quantifiers in our disproofs, as Herbrand does, we introduce new ones all the time. There is a closely related normal form of first-order disproofs, however, which is much more obviously comparable with the Herbrand-type proofs and disproofs. It will be explained briefly in section 18. We shall not examine its exact relation to the Herbrand proofs and disproofs, however. The sufficient conditions of inconsistency (necessary conditions of consistency) which we shall formulate will be given in two different forms. The first set of conditions makes fuller use of the structure of the constituents and attributive constituents than the second. On the other hand, the second set of conditions seems even simpler than the first. Either of them serves the purpose of recursive enumeration; hence there does not seem to be much to choose between them on the level of general considerations. The first will be described and shown really to establish inconsistency in section 10, the second in section 12. 10. Conditions based on the compatibility of different partitions

In order to find suitable necessary criteria of consistency for attributive constituents, let us assume that we are given an a-constituent (7). Each a-constituent of depth d -1 occurring in (7)-say CI~ - l(a l, ... , ai, x)is again of the form (9)

II v B;(a l, ... , ak, x) & TC w (Ey) CIt - 2(al' ... , ab x, y)

i = 1

i = 1

& (Uy) lT w Ct~-2(al' ... ,ak,x,y). i = 1

Consider the simplest case k = 0 as an example. Switching over to the language of an interpreted system once again, we may say that (7) then gives us a list of all the different kinds of individuals that can be specified by means of the parameters of (7) and that are exemplified in the world in case (7) is true. In (9) all the different kinds of individuals that there are in the world are listed again; this time kinds of individuals specifiable by means of the same parameters plus the reference-point individual referred to by x. Since (9) occurs in (7), at least one such reference-point individual will exist if (7) is true. Hence the two lists have to be compatible in order for (7) to be consistent: Every individual mentioned in the "relative" list (9) has to find a place in the "absolute" list (7), and vice

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

65

versa. Moreover, the individual referred to by x will have to find a place in its own "relative" list (9). It is obvious that similar considerations apply to the general case. In every case, the partitions effected by (7) and by (9) have to be compatible in a similar way in order for (7) to be consistent. The three intuitive conditions of consistency which were just explained can be converted into explicit formal conditions, formulated in terms of the structure of the a-constituent in question. Let us consider the intuitive conditions one by one. (A) Assume that we are given (7), (9), and furthermore some a-constituent Ct~ - 2(aI' ... , a k, x, y) which occurs in (9). We shall call the following formula the bough 01'(9) determined by zeal' ... , ak> x, y):

c.; -

(10)

II v Bi(al' ... , ak, x)

i = 1

&

Ct~ - 2(a 1 ,

••• ,

ak> x, y).

The reason for the choice of the term "bough" should be obvious in view of the tree structure of a-constituents. Notice that the first conjunction of (10) is the same as that of (9). We shall apply to boughs criteria of identity similar to the ones we have been applying to constituents and attributive constituents: notational variation does not constitute a reason for calling two boughs different. Hence we may say, in the same way as in the case of a-constituents and for the same reason, that any two different boughs with the same parameters are logically incompatible. Whenever a bough (or one of its notational variants) is determined by one of the a-constituents of depth d - 2 occurring in (9) or in (7) we shall say that this bough is contained in (9) or in (7), respectively. If the roles of x and yare interchanged in (10), we obtain a formula which is again a bough with the same parameters (up to notational variation, which we are here disregarding). This bough will be called the inverse of (10). The operation of forming the inverse will be expressed by " inv" . After these preparations, we may argue as follows: By the omission lemma, (7) implies

(Ex) (Ey) (II v Bi(a 1 , i = 1

...

,ak,x)

&

Ct~-2(al' ... ,ak'x,y»,

for this formula can be obtained from (7) by omitting quantified for-

66

JAAKKO HINTIKKA

mulae and by extending the scope of the quantifier (Ey). In virtue of the permutability of the two existential quantifiers, (7) also implies (11) (Ex)(Ey)(inv(I1 v Bi(a 1, ... ,ak'x) & i = 1

Ct~-2(al' ... ,ak'x,y))).

On the other hand, (7) implies by the omission lemma the formula (12)

(Ux) (Uy) (Bg 1(x, y) v Bgz(x, y) v ... )

where the members of the disjunction are all the boughs of depth d - 2 that are contained in (7). Because of the incompatibility lemma (as applied to boughs), (11) and (12) are compatible only if the inverse of (10) is among the members of the disjunction in (12). But this means that an a-constituent of depth d is inconsistent unless it contains the inversion of each bough of depth d-2 contained in it. This condition is the formal counterpart to the intuitive requirement that each individual mentioned in the relative list (9) has to find a place in the absolute list (7). (B) Assume again that we are given (7) and (9) in (7) and that we are also given another a-constituent Ct:- 1(a b ... , ak, x) which likewise occurs in (7). Then by the omission lemma (7) implies (Ex) (Uy) (BguJx, y) v Bgu/x, y) v ... )

where the members of the disjunction are now all the boughs of depth d-2 that are contained in (9). In virtue of the well-known exchange rule for quantifiers of different kinds (7) also implies (13)

(Ux) (Ey) (inv(BguJx, y)) v inv(Bgu,{x, y)) v ... ).

On the other hand, by the omission lemma (7) also implies (14)

(Ex) (Uy) (Bgq,(x, y) v Bgq2(x, y) v ... ),

where the members of the disjunction are all the boughs of depth d - 2 that are contained in Ct:-1(al' ... , ak, x). In virtue of the incompatibility lemma, (13) and (14) are incompatible unless the two disjunctions share at least one member. But this means that of two attributive constituents of depth d-l occurring in the same consistent a-constituent of depth d one has to contain the inversion of at least one of the boughs of depth d- 2 which the other contains.

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

67

This condition is the formal counterpart to the intuitive requirement that every individual mentioned in the absolute list (7) must find a place in the relative list (9). (C) By universal instantiation the a-constituent (9) implies II v Bla 1 , •.• ,ak'x) & O"w Ctf- 2(a l ' ... ,ak>x,x), i = 1 i = 1

This formula is inconsistent unless

(15)

II v Bi(a 1 ,

i = 1

•••

,ak,x) & Ct~-2(al'" .,ak>x,x)

is consistent for at least one a-constituent ci; - 2(a 1, ... , ak' x, y) occurring in (9). In order for (15) to be consistent, it must not contain any conjunction one of whose members is the negation of another. (This is so for the same reason for which the inconsistency lemma is valid.) Whenever this is the case, we say that the corresponding bough (10) is strongly symmetric with respect to x and y. By the same token, in order for (7) to be consistent it must contain at least one bough of depth d-l which is strongly symmetric with respect to ak and x. Furthermore, by the very same token (7) must for each ai (where i = 1, 2, ... , k) contain at least one bough of depth d-l which is strongly symmetric with respect to a, and x. In general, it may be said that every consistent a-constituent ofdepth d whose outermost bound variable is x must, for every free individual symbol b occurring in it, contain at least one bough of depth d-l which is strongly symmetric with respect to x and b. This is the general, exact form of the intuitive requirement that the individual referred to by x in (9) must find a place in its own list. It is not difficult to see that in a given a-constituent (9) there can be contained at most one bough of depth d-2 (say (10» which is strongly symmetric with respect to x and y, provided that the conditions (A) and (B) of consistency are satisfied. (There cannot be more than one place, we may thus say, which the referent of x may assume in its own "relative" list.) This suffices to explain what the three conditions (A)-(C) of consistency are. When they are said to be applied to a constituent or an a-constituent, it is understood that they are applied to this constituent or a-constituent as well as to all the a-constituents of lesser depth occurring in it. If one of these a-constituents is inconsistent, then so is the given constituent or

68

JAAKKO HINTIKKA

a-constituent by the inconsistency lemma. Even in this extended sense, the question whether a given constituent or a-constituent fulfills the conditions (A)-(C) (or any single one of them) can always be decided in a finite number of steps. The three sufficient conditions of inconsistency (A)-(C) were the main content of chapter 5 of the author's dissertation. 1 ) Here they have been reformulated in a different terminology and notation, and derived from one and the same intuitive principle. 11. The effects of an exclusive interpretation of quantifiers What happens to the conditions (A)-(C) in first-order logic with identity? As was pointed out earlier, a change in the interpretation of quantifiers is the only modification which we have to make here. How, then, do the conditions (A)-(C) fare on the exclusive interpretation of quantifiers? It is easily seen that (A) carries over without any changes. The condition (B) is also seen to apply without major changes. Its applicability has to be restricted to the cases in which the a-constituents 1 (9) and (a 1 , ••• , a k , x) which were assumed to occur in (7) are really different a-constituents. Since two different a-constituents with the same parameters are logically incompatible (by the incompatibility lemma), the individuals satisfying them must be different from each other. And this may be seen to suffice to restore the condition (B). In fact, we shall assume that this restriction is built into the condition (B) itself. This does not make any difference for our purposes. It is true that in the original formulation of (B) we did not exclude the case u = q. However, the force of (B) in this special case is also obtained from (C), as you may easily verify. Hence we may exclude this case from (B), and say that it remains unchanged on the exclusive interpretation. In contrast to (A) and (B), the condition (C) is based entirely on modes of reasoning that are incompatible with the exclusive interpretation of quantifiers. It therefore becomes inapplicable in first-order logic with identity.

Ct:-

12. Omitting layers of quantifiers In this paper the conditions (A)-(C) will not be discussed as much as certain consequences of theirs. These consequences may also be derived 1) See the first footnote of this paper.

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

69

independently in a rather simple manner. They are found by asking: What happens to a constituent or a-constituent when a layer of quantifiers is omitted from it? There are two cases to be considered here: (a) the omission of the innermost layer of quantifiers and (b) the omission of the outermost layer of quantifiers. The other cases in effect reduce to these. In order to eliminate an intermediate layer of quantifiers from an a-constituent (7) - say to eliminate the e-th layer of them - it suffices to omit the outermost layer of quantifiers from every a-constituent cr ::: 1 of depth d - e + I occurring in (7). (a) What happens to (7) when all the subformulae of the form (Ex d) Cto(a 1o

or

(UX d) (J

••• ,

a., Xl' ... , Xd -

crc«; ..., ak> Xl'

l'

... , Xd -

Xd) 10

Xd)

are omitted? An answer is obtained by examining what happens to the subformulae of the form Ct 1(a 1. . . . , a., Xl' ... , Xd _ 1) of (7), i.e. by considering the special case d = 1. In this case, the result is seen to be of the form Cto(aI' ... , ai, Xl' ... , x d _ 1) with the possible exception of notational variation. Because of the way deeper a-constituents depend on shallower ones it follows that in the general case (7) becomes a formula of the form Ct" - 1(a1, ... , ak), i.e. becomes an a-constituent with the same parameters (P .1) -(P .2) but with depth d -1, with the possible and inessential exception of notational variation. From the omission lemma it follows that the resulting a-constituent is implied by (7). (b) Assume that we are given (7) and (9) in (7). A part of 0) is then 1 (Ex) (a 1 , •.• , ai, x). What happens to this part of (7) if all the atomic formulae containing x (and the quantifier (Ex» are omitted from it? The result is obviously of the form

Ct:-

(16)

1r

i

p (Ey)Ctt-

=1

2(a1,

... ,a k, y) & (UY)(JpCtti

=

1

2(a

1, ... ,ak,Y)

except, perhaps, for notational variation. If we add to (16) as an additional member of the conjunction the unquantified part IIs i = 1

B i (a 1 ,

••• ,

ak )

of (7), we obtain a formula which is of the form Ct d- 1(a1, ... , ak), again up to notational variation. In virtue of the omission lemma, this formula

70

JAAKKO HINTIKKA

is implied by (7). It will be said to be obtained from (7) by reduction with respect to (9). When (7) is reduced with respect to the different a-constituents of depth d-l occurring in it, we obtain a number of formulae of the form Ct d- 1(al ' ... , ak). If there are two formulae among them which are different (apart from notational variation), then (7) is inconsistent, for it implies both of these formulae which are mutually incompatible by the incompatibility lemma. In order for (7) to be consistent, the results of reducing it with respect to the different a-constituents of depth d-l occurring in it must all coincide. This gives us a necessary condition of consistency for an a-constituent (7). It will be called condition (D). When in the sequel it will be said to be applied to a given constituent or a-constituent, this will be understood to mean that it is also applied to the a-constituents of lesser depth occurring therein. In the diagram of section 6 the reduction of the constituent illustrated there (with respect to one of the a-constituents of depth d-l occurring in it) is represented schematically by the solid line. As we just saw, the result should be independent of the choice of Ct~-l. Further conditions of consistency are obtained by comparing the results of eliminating the different layers of quantifiers which occur in an a-constituent, say in (7). If (7) is to be consistent, the result is in the case of each layer an a-constituent of depth d-l, as we just saw. It may now be added that all these resulting a-constituents of depth d - I must be identical (up to notational variation); otherwise they would be incompatible, though they are all implied by (7). Hence the omission of a layer of quantifiers must yield the same result, no matter which layer is omitted, if a constituent or a-constituent is to be consistent. This requirement will be called condition (E). In our diagram (section 6) the omission of the last layer of quantifiers is indicated by the dotted line. Notice that the two omissions that are represented schematically in the diagram must yield the same result if the attributive constituent represented by the diagram is to be consistent. Since the result of omitting one layer of quantifiers from a given a-constituent Ct~(aI' ... , ak) is unique if this a-constituent is consistent, we can refer to all these results in one and the same way: Each of them will be called Ct~[-l](al"'" a k). Incase Ct~(al' ... , ak)does not yield a unique result when a layer of quantifiers is omitted, or if the result is not

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

71

an a-constituent, we shall say that Ct~[-l](al' .,., ak) disappears, If it does not disappear, it is easily seen to satisfy the conditions (D)-(E) of consistency and hence to yield a unique result when another layer of quantifiers is omitted. The result of applying the same operation to ca«; ... , ak ) e times will be called Ct~[-e](al' .. " a k). Notice that there is an operation which in a certain sense is the inverse of the operation of omitting a layer of quantifiers. It is the operation of expanding a constituent or a-constituent Cd of depth d to a disjunction of a number of subordinate constituents or a-constituents of depth d+ 1. The procedure which was mentioned earlier in section 5 for converting formulae into the normal form may be assumed to be such that each of the subordinate constituents (or a-constituents) of depth d-s- 1 again yields Cd when the innermost layer of quantifiers is omitted. We may also require that this is the case no matter which layer of quantifiers is omitted from the subordinate constituent or a-constituent in question, for if the result is not C", the subordinate constituent is inconsistent and hence may be omitted from the normal form of c- with depth d-s- 1. The relation of the two sets of conditions (A)-(C) and (D)-(E) (conceived of as conditions of consistency) is straightforward. It may be proved that whenever a constituent or a-constituent satisfies (A)-(C) it also satisfies (D)-(E). In fact, it may be proved that it satisfies (D)-(E) whenever it satisfies (A) plus (B) in its original, strong form. Conversely, it may be shown that a constituent or a-constituent Cd satisfies the conditions (A)-(B) if at least one of its subordinate constituents or a-constituents of depth d-s- 1 satisfies the conditions (C)-(F). In a sense, the two sets of conditions (A)-(C) and (C)-(E) are thus equally powerful for the purpose of discovering inconsistent constituents and a-constituents. In order to apply the latter ones successfully to a constituent, however, we must first explicate its content by expanding it into a disjunction of constituents of depth d+ 1, and apply the conditions (C)-(E) to each of these. These results may be proved rather simply by means of the diagrams of constituents and a-constituents which have been explained in section 6. For reasons of space the proofs are not given here. Suffice it to say that the implication from the satisfaction of (A)-(C) to that of (D)-(E) is proved conveniently by induction on d. That this should be the case is not

72

JAAKKO HINTIKKA

surprising in view of the intuitive meaning of the conditions (A)-(C), for from the intuitive meaning they have it is rather easy to gather that they require (among other things) that the omission of two adjacent layers of quantifiers has to give us the same result. 13. The effects of identity again

So far, we have considered the process of omitting a layer of quantifiers only in first-order logic without identity. What changes are occasioned by the exclusive interpretation in this respect? The omission of the last (innermost) layer of quantifiers can be accomplished as before. The omission of an intermediate layer of quantifiers also reduces to the omission of the outermost quantifier in the same way as before. What cannot be done in the same way as above is the process of relative reduction. Applied to (7) with respect to (9), it no longer gives us a formula which is implied (in all cases) by (7). The reason why the omission lemma fails in this case is the following: In order to get rid of the outermost quantifiers (Ex) and (Ux) we have to omit from (9) not only all the atomic formulae containing x but also all the identities involving x which on the exclusive interpretation are implicit in the quantifiers (Ex) and (Ux) themselves as well as in all the inner quantifiers (Ez) and (Uz) of (9). Simply omitting all these identities does not always fall within the scope of the omission lemma. In some cases it does; thus the part of (7) beginning with (Ux) is omitted altogether in the reduction, eliminating all problems concerning identities occurring in it. Furthermore, all identities implicit in existential quantifiers are easily seen to fall within the scope of the omission lemma. Thus there only remains the problem of dealing with a universal quantifier (Uz) occurring in the inner layers of (7). On the exclusive interpretation, this quantifier really means (Uz) (z =ft x:::> ... ). Intuitively, it is easy to see whence the trouble comes here. We are trying to omit x, that is, we are trying to convert a statement about all individuals different from the referent of x into a statement about all individuals without restrictions. Clearly this is possible only if we add a new clause which takes care of the case in which one of these "all" individuals is the referent of x. The way to do this is as follows: Assume that we are reducing (7) with respect to (9). The at the same time as we omit all the atomic formulae which contain x from (9), we add

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

73

as a new member of the main conjunction of (9) the following formula: (17)

and as a new member of the outermost disjunction x, y) the following formula:

lT w

ct' - 2(a l,

... , ai,

0: - I[-l](al' ... , ai, y).

(18)

The same operation has to be applied to every a-constituent ci; - e(a I, . . . , ai, x, y, ... ) which occurs in (9), and whose outermost quantifiers are (let us say) (Ez) and (Uz). At the same time as we omit all the atomic formulae which contain x from it, we add as a new member of the main conjunction the formula (17)*

(E z)

cr:t p

e[ - 1](

aI' ... , ak' z, y, ... )

and as a new member of the outermost disjunction the related formula (18)*

0: -

Ct~ -

e[-I](a I, ... , a k,

z, y, ... ).

Here l[-I\aj, ... , ab x) and O~ - e[-l](al' ... , ai, x, y, ... ) may be defined as the respective results of omitting the last layer of quantifiers from (9) and from Ct~ - e(a I' . . . , a k , x, y, ... ). The result of carrying out this operation in all the a-constituents which occur in (9) as well as in (9) itself, at the same time as we omit all the atomic formulae containing x from (9) and add as a new member of the main conjunction of (9) the unquantified part

ITs Blal' ... , ak )

i = 1

of (7), will be called the result of reducing (7) with respect to (9) in firstorder logic with identity. This result is implied by (7) (on the exclusive interpretation of quantifiers, of course). This implication will not be proved formally here. No proof is probably needed to convince the reader, for on the basis of the intuitive considerations which led us to modify the process of relative reduction it should be obvious (at least on a moment's reflection) that the modification is just what is needed to reinstate the implication. After the reduction of an a-constituent with respect to another has thus been redefined so as to restore the crucial implication, everything else may be done in the same way as in first-order logic without identity.

74

J AAKKO HINTIKKA

We can define what it means to omit a layer of quantifiers, and we can reformulate the necessary conditions of consistency (D)-(E) for firstorder logic with identity. The relation of these conditions to (A)-(B) (the latter in the weak form) is even more clear-cut than before. A constituent or a-constituent csatisfies (D)-(E) if it satisfies (A)-(B); and it satisfies (A)-(B) if at least one of its subordinate constituents or a-constituents of depth d+ I satisfies (D)-(E). 14. A disproof procedure defined

The conditions (A)-(E) are connected in an intimate and intuitive way with the structure of constituents and a-constituents. It will now be shown that they provide us with a disproof procedure for inconsistent constituents and a-constituents, a procedure which is semantically complete in that every inconsistent constituent is subject to this disproof procedure. The procedure can be described very simply. Given a constituent Cd of depth d, how can we try to find out whether it is consistent or inconsistent? It may be the case that our conditions (A)-(E) suffice to establish its inconsistency. If not, we do not yet know whether Cd is consistent or not. What we can do, however, is to expand Cd into a disjunction of a number of subordinate constituents of depth d+ I (with the same parameters (P .l)-(P. 2)), to which we may apply our conditions. If all of them are inconsistent by our sufficient conditions, their disjunction and therefore Cd itself is likewise inconsistent, and we have an answer to our question. If not, we have to keep on expanding Cd into a disjunction of subordinate constituents of greater and greater depth d-i-e. If during this procedure some constituents are inconsistent by our conditions, they may be omitted in the sequel. If for some e all the subordinate constituents of depth d-i-e turn out to be inconsistent by (A)-(E), then so is C". What we want to show is that for each inconsistent constituent cthere is an e such that this happens at depth d + e. In other words, whatever inconsistencies there may be in a constituent can be brought to light by adding to its depth. And since every formula can be brought to a distributive normal form, this likewise gives us a method of disproving every inconsistent formula. This method consists of the rules for converting

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

75

a formula to the (second) distributive normal form (which also give us rules for expanding a constituent to a disjunction of a number of deeper constituents) plus our sufficient conditions (A)-(E) ofinconsistency. The statement that this method really can be used to disprove every inconsistent formula will be called the completeness theorem of our theory of distributive normal forms. Because of the connection between the two sets of conditions (A)-(C) and (C)-(E) either of these combinations of conditions may be used in the disproof procedure just described (in first-order logic without identity). The only difference between the two sets of conditions is that if we use the former we shall be able to see inconsistencies one step earlier than if we used the latter; i.e. if using the former we have to go down to depth d-i-e (in the case of some particular formula), then using the latter we have to go down to depth d+e+ 1. The same relationship holds in firstorder logic with identity between the two sets of conditions (A)-(B) and (D)-(E). In proving the completeness theorem either set of conditions may be used. Because of the greater simplicity of (D)-(E) they will be used in what follows instead of (A)-(E). 15. Completeness proof (first part)

The disproofs described in the preceding section have the structure of a tree. Since each constituent of depth d has only a finite number of subordinate constituents of depth d+ 1, at each point of this tree only a finite number of branches can diverge. Hence the tree theorem (Konig's lemma) applies, showing that the completeness theorem can be proved by proving the following result: A constituent is consistent (satisfiable) if it can occur in a sequence of constituents which satisfies the following conditions: (i) each member of the sequence is subordinate to its immediate predecessor; (ii) each member satisfies the conditions (C)-(E) of consistency (in first-order logic with identity, the conditions (D)-(E». We may also assume, for simplicity, that the depth of each member of the sequence is d + 1 when the depth of its immediate predecessor is d.

76

JAAKKO HINTIKKA

It clearly suffices to prove that the first member of each sequence which

satisfies (i)-(ii) is consistent. Let us assume that we are given a sequence So of deeper and deeper constituents which satisfies all the conditions just mentioned. We shall show that the first member of So is satisfiable. For this purpose, we shall first construct a sequence of attributive constituents S r- This will be done in such a way that instead of adding to the depth of constituents (as in So) we introduce new free individual symbols. In fact, the depth of all the members of Sl will be the same as that of the first member of So (say d); however, each of them will have one free individual symbol more than its immediate predecessor. Each member of Sl will be chosen in such a way that it occurs in the corresponding member of So, i.e. occurs there with appropriate bound variables substituted for some of its free individual symbols, of course. For the first member of Sl we may take the attributive constituent of depth d which occurs in the first member of So' The main question which remains to be answered is therefore: How is a member of Sl obtained from its immediate predecessor? In order to answer this question, let as assume that (7) is an arbitrary member of S r- Let is also assume that of the free individual symbols of (7) a l' a2 . . . , a j (1 :s; j :s; k) occur already in the first member of So while a j +1' aj +2, ... , a k do not occur there. Then an a-constituent of the form Ct~(a 1, . . . , aj, Xj + 1, . . . , Xk) occurs in the corresponding member of So' Since the next member of So is obtained (as was pointed out in section 12) by adding one more layer of quantifiers, there must be in the next member of So at least one (usually there are several) attributive constituent which is subordinate to the one just mentioned and which is of the form Ct: + teal' ... , aj' X j +1' ... , Xk) or, more explicitly, (19)

II B;(a 1, ... ,aj'x j+ 1, ... ,xk) &

s i = 1

nt' i = 1

(EXk+l)Ct~(al' ... ,a j,xj+l' ... 'X k'Xk+ 1) &

(UXk + 1)

(Jt'

i = 1

Ct~(al> ... , aj' Xj + 1, ... , Xb Xk + 1)'

In order to obtain the next member of Sl' we choose one of the a-constituents of depth d occurring in (19). The principle of selection will be explained later. If the attributive constituent chosen is

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

77

(20) the next member of S 1 is simply (20)* Ct~(a1' ... , a j , a j + l' ... , ai, ak + 1)' where a k + 1 is a new free individual symbol. This a-constituent satisfies by construction the requirement that the corresponding bound-variable formula (20) occurs in the corresponding member of SQ. How is (20) to be selected? We shall first explain one particular method of making the choice. Subsequently, it will be pointed out that this method can be considerably generalized. Let Tk be the set of all the a-constituents of depth d -1 which occur in (7), and let be the set of all the similar a-constituents which contain x j + l' Xj + 2' . . . , X k instead of a j + l' a j + 2, . . . , ak' Each of the a-constituents from among which (20) is chosen arises from a member of through the addition of a new layer of quantifiers. For the members of T: we shall shortly establish a certain seniority ranking. After this has been accomplished, (20) may be chosen to be any of the a-constituents which arise from members of T: of the highest rank. Because of the similarity of Tk and T:, a similar ranking will be automatically induced for the members of Tk , too. The only thing that remains in order to define Sl is therefore to explain how the seniority ranking is established. This ranking will be a linear quasi-ordering, i.e. it will be a linear ordering of the different ranks into which the members of T~ (and ofTk ) will be partitioned. The ranking of the a-constituents of depth d-l occurring in the first member of Sl does not matter. Hence the only thing we have to do is to explain how this ranking is carried over from one member of Sl to the next one. (In each case, we have a ranking of the a-constituents of depth d-l occurring in a member of Sl') In order to explain this, notice that one and the same result is obtained from (19) in two different ways:

T:

T:

(a) By omitting the last layer of quantifiers; (b) By reducing it with respect to (20). This identity follows from the fact that (19) satisfies the conditions (D)-(E). The result of'{a) is simply ca«; ... , aj' Xj + l' . . . , x k ) , which

78

JAAKKO HINTIKKA

is therefore also obtainable through the operation (b), i.e. whose quantified part is obtainable by omitting all reference to X k + 1 from (20). This fact establishes a one-to-many correlation between the members of T{ and the members of the set T~ + 1 of all a-constituents of depth d-1 which occur in (20): each of the former arises from one of the latter by omitting all mention of X k + 1. This correlation will be called the weak correlation between the members of the two sets and conceived of as a symmetric relation. A similar relation, which will be referred to in the same way, obtains between the members of Tk and the members of the set Tk + 1 of all the a-constituents of depth d -1 which occur in (20)*. It is to be observed that this relation is determined as soon as the sequence S1 is given; its definition does not in any way turn on the definition of our seniority ranking. The weak correlation can be strengthened (artificially, so to speak) into a one-to-one correlation between all the members of Tk (or of TD and some of the members of Tk + 1 (or of T~ + l' respectively). All we have to do for the purpose is to choose one of the weak correlates of each member of Tk and assign it to this member as its strong correlate. This can of course be done in a variety of ways in most cases: in the sequel we shall consider some particular way of doing so. One important exception will be made here, however. That member of Tk which gave rise to (20)* (i.e. which is identical with Ct~[-1](a1' ... , a k , Xk + 1) will not be assigned any strong correlate in Tk + 1; instead, we shall say that it is associated with (20)*. The same will of course apply to T~, T{ + l ' and (20). When the strong correlation has been established, the seniority ranking may be carried over from Tk to Tk + 1 by stipulating that strong correlation preserves relative rank, and that members of Tk + 1 which do not have any strong correlate in Tk rank lower than one which has such a correlate. The net effect of the transition from (7) to (20)* (i.e. from a member of S1 to the next one) on the seniority ranking is thus that the ranking is preserved except that (i) one member of the highest rank gets lost and that (ii) a new rank will be created which is lower than all the old ones. From this we can gather what happens in the long run in the sequence S1. Every rank will become empty while lower and lower new ranks will (usually) be created. Every chain of strong correlations will come to an end with an a-constituent which does not have a strong correlate any

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

79

more but which is associated with the next member of St. Conversely, every member of St is associated with an a-constituent of depth d-1 which occurs in its predecessor. Notice also that if we move backwards in St no chain of weak correlations comes to an end until it reaches the first member of St. What has been said so far applies in the first place to first-order logic without identity. In first-order logic with identity, the situation is slightly complicated by complications that were needed in the definition of relative reduction in section 13. They enter the present discussion through the operation (b) which was applied to (19) earlier in this section. The net effect of the complications here is that one of the members of Tk may become weakly correlated, not with any member of Tk + t, but rather with Ct~[ -t](a t, ... , ak , X k + t). This member of T k will, however, be the one which is not going to have any strong correlate anyway. The fact that it may now also lack a weak correlate does not interfere with the way the seniority ordering is carried forward in St. Nor does it interfere with the statements made in the preceding paragraph. This completes our explanation of how the sequence St may be obtained from the given sequence So' We can see that such a sequence can always be obtained. It only remains to show that all the members of St are simultaneously satisfiable. If we can do this, then we have shown that the first member of St is satisfiable. In first-order logic with identity, the satisfiability of the first member of So is thereby made obvious. In first-order logic without identity, the satisfiability Of the first member of So can be established by an argument similar to the one we shall give at the end of section 17. 16. Attributive constituents and model sets

The simultaneous satisfiability of all the members of St may be proved by imbedding them into one and the same model set. t ) If no negation1) For model sets and for a proof of the fact that imbeddability in a model set equals satisfiability (consistency), see my papers, Form and Content in Quantification Theory, Acta Philosophica Fennica 8 (1955) 11-55, and Notes on Quantification Theory, Societas Scientiarum Fennica, Commentationes physico-mathernaticae 17, no. 12 (Helsinki 1955). Cf. also Modality and Quantification, Theoria (Lund, Sweden) 27 (1961) 119-128. - It is convenient to assume here that (c. &) and C. v) have been generalized so as to apply also to conjunctions and disjunctions with more than two members.

80

JAAKKO HINTIKKA

signs are allowed except those which immediately precede an atomic formula, if no identities are admitted, and if the only propositional connectives are r-«, &, and v, then a model set may be defined as a set of formulae - say fl - which satisfies the following conditions: (C.

~)

(c.&)

(C.v) (C.E)

If F G u, then not '" F G fl. If (F & G) G fl, then F G fl and G G fl. If (F v G) G II, then F G fl or G G fl (or both). If (Ex)F G u, then F(a/x) symbol a.

G

fl for at least one free individual

Here F(a/x) is the result of replacing x everywhere by a in F, subject to the usual precautions concerning the binding of variables. The same notation will be used in what follows. (C.V)

If (Ux)F G fl and if b is a free individual symbol occurring in at least one formula of fl, then F(b/x) s fl.

Satisfiability has to be interpreted here as satisfiability in an empty or non-empty domain of individuals. For the exclusive interpretation of quantifiers, the definition of a model set can obviously be modified by changing (C.E) and (C.V) as follows"): (C.Eex)

If (Ex)F 8 fl, then F(a/x) 8 fl for at least one free individual symbol a which does not occur in F.

(C,V ex)

If (Ux)F G fl and if the free individual symbol b occurs in the formulae of fl but not in F, then F(b/x) G fl.

These conditions can be geared more closely to the structure of constituents and attributive constituents. It is easily seen that an a-constituent is imbeddable in a model set and therefore satisfiable ifit can be imbedded in a set A of attributive constituents which satisfies the following conditions (in these conditions (7) is thought of as an arbitrary a-constituent):

(Cict-c )

If an atomic formula all of whose individual symbols are free occurs unnegated in a member of A, then it never occurs negated in any member of A.

1) In the terminology of the paper referred to in section 7 (footnote), p. 59, these conditions formulate the weak fy exclusive interpretation of quantifiers.

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

81

(C.ctE)

If (7) occurs in A, then for every a-constituent (9) of depth d-l which occurs in (7) there is a free individual symbol b such that Ct:-l(a l, ... , ai, b) eA.

(C.ctU)

If (7) occurs in A, then for every free individual symbol b which occurs in at least one member of A there is an a-constituent (9) of depth d-l which occurs in (7) and which is such that \al' ... , ai, b) eA.

Ct:-

A set A of a-constituents which satisfies these conditions will be called a constitutive model set"), For the exclusive interpretation of quantifiers, the condition (C.ctE) and (C.ctU) have of course to be modified in the same way as in (C.E ex) and in (C,U ex ) , respectively, by requiring that b does not occur in (9) - or in (7), which is the same thing. The resulting conditions will be called (C.ctE ex) and (C.ctU ex) ' 17. Completeness proof (concluded)

The simultaneous satisfiability of all the members of the sequence Sl which was defined earlier will be proved first for first-order logic with identity. Subsequently, the modifications needed for the ordinary interpretation of quantifiers will be explained briefly. On the exclusive interpretation ofquantifiers, all we have to do in order to imbed Sl in a constitutive model set is to form its closure with respect to the following operations: (a) The operation of omitting a layer of quantifiers. (b) The operation of omitting a free individual symbol (i.e. of omitting all the atomic formulae which contain this symbol, together with all the connectives which hence become idle). The operation (b) requires a few comments. First, we shall restrict it by requiring that the free individual symbol omitted is not the distinguished 1) Given a constitutive model set A, an ordinary model set fl is easily obtained as the closure of J. with respect to the following operations: (i) Whenever a conjunction occurs in A, adjoin to A all its members. (ii) Whenever (7) occurs in J. and b occurs in the formulae of A, adjoin the disjunction l at Ct/- (a b ••• , ak' b) i= 1

to A. This fl contains Aas a part and has exactly the same free individual symbols as A.

82

JAAKKO HINTIKKA

(last) free individual symbol of an a-constituent. For instance, the symbol

ak must not be omitted from (7). Otherwise (b) might not give an a-con-

stituent as a result when applied to an a-constituent. Furthermore, in first-order logic with identity the operation (b) has to be modified in the same way and for the same reason as the operation (a). Suppose, for instance, that we are omitting the free individual symbol a, (1 ::;: i ::;: k) from (7); and suppose that Ct;-e(a 1 , •.. , ai, x, y, ... ) is an arbitrary a-constituent which occurs in (7) and whose outermost quantifiers are (Ez) and (Uz) (exclusively interpreted, of course). At the same time as we omit all the atomic formulae which contain a, we must add as a new member of the main conjunction the formula (E)C z t dp -

e[ - l ] (

aI' ... , a j

_

l'

z, a i + 1,

... ,

ak , x, y, ... )

and as a new member of the outermost disjunction the formula

cr: p

e[-l]( aI'

... , a j _

1,

z, aj +

1, . . . ,

) a k , x, y, ....

Let the closure of Sl with respect to the operations (a) and (b), so qualified, be A. Then it is easy to see that A satisfies (Cict-«). For the purpose, consider an arbitrary atomic formula which occurs in the formulae of Awith all its individual symbols free. If it occurs (in this way) in a member of A, negated or unnegated, it must likewise occur, in view of the way A was obtained from Sl' in a member of Sl' Hence it suffices to verify (Cict o- ) for Sl alone. Now all the free individual symbols of Sl are aI' a 2, ... ; we assume for simplicity that they are different from all the bound individual variables we are dealing with. If the last member of this sequence which occurs in F is ai; then F occurs (negated or unnegated but not both) in exactly one member of Sl' viz. in (7) provided of course that k > j. But if k ::;: j, F cannot occur in any member of S 1 at all. Hence F cannot occur both negated and unnegated in the members of 8 1 nor therefore in the member of A. It is also easy to see that all the members of A satisfy the other two defining conditions of a constitutive model set provided that all the members of Sl do so in A. Indeed, the relation between (7) and (9) which is mentioned in (C.ctE) and (C.ctU) and hence also in (C.ctEex ) and (C.ctUex) continues to obtain if one of the operations (a) and (b) is applied to both of them. From this it follows in first-order logic without identity that the other (new) members of A satisfy (C.ctE) and (C.ctU)

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

83

if the members of Sl do so. In first-order logic with identity, there are two additional cases we have to worry about. They are caused by the fact that in omitting a free individual symbol on the exclusive interpretation a new attributive constituent of depth d-l is added to each attributive constituent of depth d from which a free individual symbol is omitted. Let the latter be (7) and the former therefore (21) Now), might prima facie fail to satisfy (C.ctEex) because of the presence of this new a-constituent of depth d-l in the modified from of (7), and it might prima facie fail to satisfy (C.ctUex ) because a free individual symbol (viz. ai ) which formerly did occur in (7) does not do so any more and hence seems to open a new possibility of applying (C.ctUex) ' In both cases the violation of one of the two conditions is avoided because A is closed under (a) and hence contains the formula (cf(21)):

Hence the only thing that remains for us to do in order to show that A is a constitutive model set is to prove that the members of Sl satisfy the conditions (C.ctE) and (C.ctU) or (C.ctE ex) and (C.ctUex) ' as the case may be, when they are considered as members of A. This may be proved by means of certain lemmata concerning Sl' These lemmata follow directly from the way in which Sl was constructed. LEMMA 1: Whenever Ct~ - \a 1 , ... , ak, x) e T k is weakly correlated with Ct: - \a 1 , ••• , ak, ak + l ' x) e T k + i - the former results from the latter (up to notational variation, as usual) by omitting the free individual symbol ak + 1 in the sense of operation (b) defined earlier in this section.

This is, on reflection, just what being weakly correlated means. Lemma I can be generalized:

ci;-

1*: Whenever \al' ... , ak' x) e T k is connected by the ancestral of the weak correlation with Ct: - l(a l, •.. , ak' ak + 10 •.. , ak + I' x) e T k + l ' the former results from the latter by omitting the free individual symbols ak + l' ak + 2, . . . , ak + I' LEMMA

Another useful lemma is the following:

84

JAAKKO HINTIKKA

LEMMA 2: Whenever Ct~ - 1(a1, ... , ak' x) s T k is associated with the next member (20)* of Sl' it results from the latter by omitting one layer of quantifiers.

This is just what being associated means. By means of Lemmata 1* and 2 we can show that the members of Sl satisfy (C.ctE e x) and (C.ctUex ) in A. For this purpose, assume that (7) is a member of Sl and that (22)

occurs in (7). As pointed out in section 15, the chain of strong correlations which passes through (22) will always come to an end at some a-constituent which is not strongly correlated with any further a-constituent but which is instead associated with the next member of Sl .- say with (23) Then by Lemmata 1* and 2 it follows that (22)* results from (23) by first omitting one layer of quantifiers and by then omitting the free individual symbols ak + 1, ak + 2, . . . , ak + I' But since Ais closed under the operations (a) and (b), this implies that (22)* belongs to A. This means that (C.ctE) is satisfied by (7). Moreover, since ak + 1 does not occur in (7) or (22), it also means that (C.ctE ex) is satisfied. In order to verify (C.ctU ex) , assume that (7) occurs in Sl' Then every free individual symbol of A which does not occur in (7) (nor in the less deep a-constituents which occur in (7)) is of the form a, + I' Consider now the first member of Sl which contains ak + I; let it be (23). Then (23) is associated with some member of T k + 1 _ 1 which is in turn connected by a chain of weak correlations with some member of Tk , i.e. with some a-constituent of depth d-l occurring in (7). Let this a-constituent be (22). Then from Lemmata 1* and 2 it follows in the same way as in the case of (C.ctE e x) that (22)* results from (23) by means of the operations (a) and (b) and hence belongs to A. This suffices to verify (C.ctU ex) for (7). This completes our argument to the effect that all the members of Sl are simultaneously satisfiable for a system with identity. For a system without identity, an additional argument is needed to take care of those cases of (C.ctU) in which b occurs in (7), i.e. in which b is an a, where

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

85

I .:::;; i .:::;; k. In the special case of Sl we know that each member (7) of Sl satisfies the condition (C) of consistency. Hence there must be in (7) an a-constituent (9) of depth d-l such that the bough of (7) determined by (9) is strongly symmetric with respect to x and a.. Consider, then, the formula Ct~- teal' ... , ak- l, ak, aJ obtained from (9) by replacing x by a.. By a straightforward argument whose details are here omitted, it can be shown that because of strong symmetry this formula is identical with the result of reducing (7) with respect to (9) and therefore also identical with the result of omitting one layer of quantifiers from (7). (Identity here means, as usual, identity except for the naming of bound variables and the order of conjunctions and disjunctions, and in this case also for the vacuous repetition of some members of conjunctions and disjunctions.) Since A is closed with respect to the operation (a), this implies that Ct~-l(al> ... , ak-l, ai, a i ) e A and shows that the condition (C.ctU) is satisfied also in the cases which do not fall within the scope of (C.ctU ex ) ' 18. General considerations

This brings to an end our completeness proof for the disproof procedure described in section 14, both in first-order logic without identity and in one with it. As a dual of the disproof procedure we obtain a complete proof procedure for first-order logic. Proofs of this form have an easily surveyable structure, and most of their steps are quite innocentlooking. They may be taken to be linear, i.e. each line of the proof is a single formula which is obtained from its immediate predecessor. Most of these formulae are conjunctions; hence we may, if we want, split the proof into branches (Beweisfiiden) which are tied together into the form of a tree by the simple inference rule F, G f- (F & G). It may be worth while to list, by way of summary, the different kinds of step which will occur in these proofs (with the optional exception of the inference rule just mentioned). In listing these steps, we are looking at the proofs not in the direction from the axioms to the formula to be proved but in the opposite direction. The following operations are needed to convert a formula into the dual of its distributive normal form:

86

JAAKKO HINTIKKA

(1) Transformation into propositional normal form without affecting the parameters (P .l)-(P. 3) of the formula in question.

(2) The distributivity of the universal quantifier with respect to conjunction. (3) The possibility of reducing the scope of a universal quantifier by omitting from it members of disjunctions which do not contain the variable which is bound to the quantifier in question. We must be able to carry out these operations also within a formula (i.e. to apply them to a subformula of a larger formula). Furthermore, we must of course assume the usual interrelation between the two quantifiers and the possibility of renaming bound variables. These operations also enable us to reach the dual of the second distributive normal form. In adding to the depth of the dual of a constituent we must assume something further: (4) One may add to the depth of a formula by introducing propositionally redundant parts which do not change the other parameters (P .l)-(P .2). Finally, in each branch of the proof or in each conjunct of the first line of the rest of the proof (looking at it now in the direction from axioms to the formula to be proved) we need exactly one initial operation which is the dual of the argument by means of which attributive constituents not satisfying (A)-(C) (or, in first-order logic with identity, (A)(B» were shown to be inconsistent. From section 10 it is seen that in the case of (A) and (B) these operations are, apart from certain inessential preparatory steps, essentially applications of the well-known exchange laws for adjacent quantifiers: f- (Ux) (Uy)F

==

(Uy) (Ux)F.

f- (Ex) (Uy)F ~ (Uy) (Ex)F.

In the case of (C) this initial step is essentially an application of the law

(5h

f- (Ux)F ~ F(a/x)

where a occurs in F. The application of these laws may perhaps be considered as the only

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

87

non-trivial step of the argument. In each initial conjunct, only one application of one of them is needed. In first-order logic with identity, a few unimportant complications arise because of the presence of identity. We shall not discuss them in detail here. We also obtain two dual methods of proofs from assumptions. These proofs are interesting because they are linear; each line of the proof consists of a single formula.'). We may consider that method which turns on our original (second) distributive normal form. In order to prove G from F by means of this method, we may proceed as follows; First, we pool together the parameters (P. I)-(P .2) of F and G, take the maximum (say d) of their depths, and convert F and G into their respective normal forms F d and Gd in terms of the parameters so obtained. Then we expand the normal forms F d and Gd by splitting their constituents into disjunctions of deeper and deeper constituents while their other parameters are unchanged. At each depth we test all the constituents for consistency by means (A)-(C) or (C)-(E) (or, if identities are present, by means of (A)-(B) or (D)-(E)) and omit all the ones which fail the test. Let us call the disjunction of the remaining ones of depth d-s- e, and cr-, respectively. Then if G really follows from F there will be an e such that all the members of F d + e are among the members of c-:». A proof of G from F will then proceed from F to F d to r-: 1 to ... to F d + e to c-:» to G d+ e- 1 to ... to c-: 1 to G d to G. All the steps of this proof except the one from F d + e to Gd+e are equivalences. The same procedure gives us a complete method of equivalence proofs. If F and G are equivalent, then for some e the disjunctions F d + e and Gd+e will have the same members and therefore be equivalent. In this case, all the steps of the proof from F to G will be equivalences. In disproofs, proofs, proofs from premises, and equivalence proofs, we thus have to add to the depth of the formulae we are dealing with. Consider disproofs as an example. In order to disprove F, we bring it to its distributive normal form F d (where d is the depth of F) and keep adding to the depth of the constituents of F d until at some depth d-s- e all the subordinate constituents are inconsistent by our conditions. Here the

r-:»

1) For an interesting discussion of a different standard form of linear proofs from assumptions, see William Craig, Linear Reasoning, A New Form of the HerbrandGentzen Theorem, Journal of Symbolic Logic 22 (1957) 25(}-268.

88

JAAKKO HINTIKKA

difference e between the depth of F and the depth at which the inconsistency of F becomes explicit may be considered as a kind of (rough) measure as to how deeply hidden the inconsistency of F is and therefore also as a measure of the amount of "development" or "synthesis" required to bring this inconsistency into the open. This idea seems to have some philosophical interest. It also applies to the other kinds of proofs we have mentioned. For instance, in the method of equivalence proofs just described the difference between d-i-e and the depth of F perhaps serves as a numerical measure of the amount of "synthesis" which has to be performed on F in order to bring out its equivalence with G 1 ) . It may be objected here that these suggested measures are all relative to the particular conditions of consistency which are being used in the proof in question, and hence not likely to have much general significance. That they depend on the conditions employed is true; but on the other hand it seems to me that in some rather elusive sense our conditions (A)-(E) of inconsistency are as strong as we can possibly hope natural conditions to be. Further work is needed here to clear up the situation. From the way we proved the completeness theorem we can also read directions for a different standard form for first-order proofs which is much closer than the ones we have just mentioned to the earlier standard forms of Herbrand and Gentzen. Consider disproofs first. In this new standard form, each stage of the proof is again a disjunction of constituents, preceded by a transformation to the distributive normal form. Again, each disjunction is obtained from its immediate predecessor. The way in which this happens is now different, however. Instead of adding to the depth of our constituents we add new free individual symbols to them. In order to describe the procedure in more detail, assume that (8) occurs in one of the disjunctions. We choose one of the a-constituents of depth d-l occurring in (8), say (9); the principles of selection will be commented on later. We drop the existential quantifier (Ex) which precedes an occurrence of (9) in (8), and replace x everywhere in this occurrence of (9) by a new free individual symbol ak + l ' This leaves the parameter (P .1) of (8) and its depth unchanged, but adds a new member to the set (P. 2). Hence we may transform the formula we 1) If the depths of F and G are identical, exactly the same amount of "synthesis" is required to convert F into G by the method just described as is required to disprove ~ (F == G), if these amounts are measured in the suggested way.

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

89

have obtained into a disjunction of constituents with the same (P .1) and the same depth as (8) but with one more free individual symbol ai: This procedure corresponds to the expansion of a constituent of depth d into a disjunction of subordinate constituents of depth d-s- 1. To each of the new constituents we may apply the conditions (A)-(E) and eliminate those which turn out to be inconsistent. Each of the remaining ones is related to (8) in the same way as each member of St was related to its predecessor. We can set up strong and weak correlations as well as associations in the same way as before and also define a seniority ranking among the a-constituents which occur in (8) and which have depth d-l in the same way as in the completeness proof. For the particular a-constituent of depth d-l in (8) into which the new free individual symbols was first introduced we may simply choose one of the highest ranking ones. This disproof procedure is complete in the same way as the earlier one: Either at some stage of the procedure all the constituents are inconsistent by our conditions (A)-(E) or else we have a sequence of constituents which is just like the sequence St of a-constituents and which can be shown to be satisfiable in the same way as St. The only new feature is that there is not just one way of going from one stage to the next one, for there usually is some choice in the selection of that a-constituent of depth d - 1 in (8) into which the new free individual symbol is first introduced. In fact, both here and in the completeness proof there may even be more choice than we have allowed so far. The situation is the same in the two cases; hence we may consider St by way of example. Suppose that we construct St without any regard to the selection of (20) in (19) (section 15). Then we can define weak correlation and association as before. We can also define strong correlation in a variety of ways in most cases. From section 17 (from the argument which follows the lemmata) we can see that there is only one further thing which is needed in order for us to be able to carry out the completeness proof. This is that every chain of strong correlations eventually comes to an end when we proceed further and further in St- The use of the seniority ranking was simply an artifice for securing this end. If it can be obtained by other means, so much the better. In any case, the conventions concerning the seniority ranking need only be adhered to from some (arbitrarily late) stage on.

90

JAAKKO HINTIKKA

This suffices to describe the new disproof procedure. Similar procedures for proofs from premises and for equivalence proofs are obtained as a consequence, and a similar proof procedure as its dual. The number of new free individual symbols which are introduced in one of these proofs may be said to indicate the amount of synthesis performed in it. The situation is now more complicated than before in that a given formula F may now have different disproofs with different numbers of free individual symbols introduced in them. The smallest of these numbers may be taken, however, to indicate the amount of synthesis needed to bring out the hidden inconsistency of F. It may be shown that this measure of the amount of synthesis required to disprove F coincides with the measure suggested earlier. Similar considerations apply to the other types of proofs. We have described a proof procedure in which the free individual symbols remain unchanged but in which the depth of our formulae grows steadily, and one in which the depth of our formulae remains intact while new free individuals are introduced. These are really the two ends of a long spectrum of mixed proofs in which our two methods are employed together. The method by means of which the completeness theorem was proved is now seen to be tied to the second type of disproof (introduction of new free individual symbols) much more closely than with the first (adding to the depth of constituents) although it is the first one which was initially considered in the completeness proof. In fact, in the completeness proof we first transformed the increase in depth into an increase in the number of free individual symbols. This may seem a rather roundabout way of proving the completeness of the first main method of disproof. In fact, we could have avoided this complication, but only at the expense of complicating the proof considerably in other respects. For this reason, the "direct" proof will not be attempted here. It would be of interest to construct a model for a sequence So of deeper and deeper constituents which satisfy (A)-(E) and which are subordinate to their predecessors directly without going by way of the auxiliary sequence St, for such a construction promises us a survey of the different kinds of model which So (essentially, an arbitrary consistent and complete theory) can have. An interesting line of development seems to open here, a line in which the

DISTRIBUTIVE NORMAL FORMS IN FIRST-ORDER LOGIC

91

important results of Vaught's on models of complete theories (which e.g. entail the Ryll-Nardzewski No·categoricitytheorem) appear to assume a natural place.')

1) For Vaught's results see Denumerable Models of Complete Theories, in Infinitistic Methods, Proceedings of the Symposium on Foundations of Mathematics, Warsaw, 2-9 September 1959, pp. 303-321 (Pergamon Press, Oxford and London 1961).

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I SAUL A. KRIPKE Harvard University, Cambridge, Mass., USA

The present paper gives a semantical model theory for Heyting's intuitionist predicate logic, and proves the completeness of that system relative to the modelling. The model theory and completeness theorem were announced in [1]. The semantics for modal logic which we announced in [1] and developed in [2], [3], together with the known mappings of intuitionistic logic into the modal system S4, inspired the present semantics for intuitionist logic. It would in fact be possible to derive the completeness of Heyting's predicate logic in our semantics by using the mappings into S4 together with the results of [2], [3]. We prefer, however, to develop the semantics of intuitionistic logic independently of that of S4; this procedure will enable us, we believe, to obtain somewhat more information about intuitionistic logic, including the mapping into S4 as a consequence thereof"), Further, a fairly recently worked-out development, not contained in the announcement of [1], is included: an exposition of Cohen's notion of "forcing" [5] in terms of the present semantics. In addition to giving a simple decision procedure for Heyting's propositional calculus, Part II will present a result not announced in [1] but mentioned in [4]-the undecidability of monadic intuitionistic quantification theory. The proof is based on the semantics previously developed. It should be mentioned that, for the pure implicational intuitionistic propositional logic, Beth [6] has announced the rediscovery of essentially the present modelling; also that, for all of intuitionist propositional logic, 1) The reader who wishes to understand thoroughly the deeper motivation of the present paper, however, is strongly urged to consult [2], [3], and [16], which give the underlying analysis of modal logic.

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

93

a modelling equivalent to ours can be extracted from the results of Lemmon and Dummett [7].1) The results of this paper, though devoted to intuitionistic logic, are proved only classically, except as mentioned below. Intuitionistically, the situation is essentially the same as that for Beth's completeness theorem [8], as analysed by Dyson and Kreisel in [9]; a reader who is interested in intuitionistically valid proofs can consult [9] and apply a similar analysis to the present results. We will give indications below which (we believe) will be sufficient for a reader familiar with [9] to make such an analysis. In the course of these indications, we will prove some results about Kreisel's system Fe which are parenthetical to the main theme of this paper. In particular, we will show that Kuroda's conjecture and Markov's principle are both refutable in Fe. Some notations that will be used throughout the paper are the following: P", Qn, R" (n ~ 0) are n-adic predicate letters; a O-adic predicate letter is usually called a "proposition(al) letter." Occasionally the superscript on a predicate letter will be omitted if this does not sacrifice clarity. We use letters x, y, Z, . . . , with or without subscripts, as (individual) variables. The formulae of the intuitionistic propositional calculus are to be built out of the usual connectives A, V, ::::>, -', starting with the propositional letters as atomic formulae. In the predicate calculus, not only propositional letters but also formulae pn(x u . . . , x n) are taken as atomic; thence formulae are built up from these in the usual manner, using the connectives just given and the quantifiers (x) and (3x). We use A, B, C, . . . , for arbitrary formulae of propositional or predicate calculus, depending on the context; if we wish to call attention to certain free variables in a formula, we use such notations as A(x 1 , ... , x n ) . We assume, finally, that the reader is familiar with standard presentations of Heyting's formalized intuitionistic propositional and predicate calculus, say the presentation in [10].

1) Kreisel's conjectured "reinterpretation of the (intuitionistic) logical constants" in [171 is also, if his conjectures prove correct, related to the present model theory.

94

SAUL A. KRIPKE

1. The model theory

We define an (intuitionistic) model structure (m. s.) to be an ordered triple (G, K, R) where K is a set, G is an element of K, and R is a reflexive and transitive relation on K. An (intuitionistic) model on a m. s. (G, K, R) is a binary function ¢(P, H), where P ranges over arbitrary proposition letters") and H ranges over elements of K, whose range is the set {T, F}, and which satisfies the following condition: if ¢(P, H) = T and HRH' (H, H'cK), then ¢(P, H') = T. Given a model ¢(P, H), we can define a value ¢(A, H) (=T or F) for an arbitrary formula A of propositional calculus by induction on the number of connectives in A .If A has no connectives, then it is a proposition letter and ¢(A, H) = T or F has already been defined for each H. Assume that ¢(A, H) and ¢(B, H) have already been defined. Then we stipulate: a) ¢(A A B, H)= T iff ¢(A, H) = ¢(B, H) = T; otherwise, ¢(A A B, H) = F. b) ¢(A v B, H) = T iff ¢(A, H) = T or ¢(B, H) = T; otherwise, ¢(A v B, H) = F.

c) ¢(A ::J B, H) = T iff for all H' E K such that HRH', ¢(A, H') = F or ¢(B, H') = T; otherwise, ¢(A ::J B, H') = F. d) ¢(,A, H) = T iff for all H' E K such that HRH', ¢(A, H') = F; otherwise, ¢(,A, H) = F.

Notice that the conditions on A and v are exact analogues of the corresponding conditions on classical conjunction and disjunction; but the conditions on :::> and, are not analogous to the classical conditions. It is easy to show by induction, for any H, H' E K such that HRH', that if ¢(A, H) = T, then ¢(A, H') = T. This property has been stipulated 1) In [2), we let ¢(P, H) range over H E K and atomic subformulae of a fixed formula A. We called this a model of A. We could equally well have adopted this orientation

here; conversely (2) could have adopted, mutatis mutandis, the present definition. The viewpoint of (2) is exploited in the analysis of Cohen's "forcing", where we consider models assigning values only to formulae built out of a fixed atomic formula P(x). We should also remark that, although in this section we have taken the atomic formulae to be proposition letters and formulae pn(x" ... , x n), the definitions would equally well go through if formulae were built out of an arbitrary fixed class of atomic formulae; this fact is exploited in the "provability interpretation," section 1.3, below.

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

95

for a propositional letter, and it follows for more complex formulae using the clauses (a) - (d). Notice that, intuitionisticaIly, the inductive definition here given does not work, since it clearly appeals to the law of excluded middle in clause (c) and (d) (e.g., in (d), either for all H', ¢(A, H') = F, or not). Thus intuitionistically, it would be best to define a model ¢ as a mapping ¢(A, H) in {T, F}, where A ranges over arbitrary formulae of propositional calculus, and which happens to satisfy the clauses (a) - (d) as well as the condition that ¢(P, H) = T and HRH' implies ¢(P, H') = T. Clearly, from the classical viewpoint, this modification leaves the notion of a model essentially unchanged. We call a formula A of propositional calculus valid iff ¢(A, G) = T for every model ¢ on a model structure (G, K, R). A model ¢ on a m. s. (G, K, R), such that ¢(A, G) = F, is called a countermodel for A. To extend the modelling to quantification theory, we define a quantificational model structure (q. m. s.) to be a model structure (G, K, R), together with a function IjJ (the "domain function"), defined on K, such that IjJ(H) is a non-empty set for all H E K, and IjJ(H) s:; IjJ(H') if HRH' (H,H' EK). (Intuitionistically, we require that IjJ(H) not only be non-empty, but that it contains at least one element; of course, a species may be known not to be empty without any particular element thereof being known.) We define a quantificational model ¢ on a q. m. s. (G, K, R) to be a function ¢(p n , H), where P" ranges over l1-adic predicate letters (for all n), and H ranges over elements of K. If 11 = 0, ¢(pn, H) = T or F, and if n ~ 1, ¢(r, H) is a subset of the Cartesian product [1jJ(H)r. We again require for n = 0, that ifHRH', and ¢(pn, H) = T, ¢(r, H') = T; for n ~ 1, analogously we require that if HRH ', ¢(r, H) s:; ¢(r, H'). Let U

= U IjJ(H). H£K

Given a quantificational model ¢, we can define, for each formula A of intuitionistic quantification theory, a value ¢(A, H) = T or F, for each HE K, relative to a fixed assignment of elements ofU to the free individual variables of A. If A is an atomic formula, it is either a propositional letter P, in which case ¢(P, H) = TorFis given, or itis a formula P'(x., ... , x n)

96

SAUL A. KRIPKE

(n ~ 1). In this latter case, let elements aI' ... , an of U be assigned to Xl' ... , Xn; then we can define, relative to this assignment, ¢(pn(XI' ... , x n), H) = Tiff (a l, ... , an) E ¢(r, H), and ¢(r(Xl' ... , x n), H) = F iff (a l, ... , an) ¢: ¢(pn, H). Given this assignment to atomic formulae, we

can build up the assignment to more complex formulae by induction. Suppose A(Xl' ... , Xm y) is a formula, where at most the variables 'X m y). Assume, that relative to each assignindicated occur free in A(x l' , Xm y, a truth-value ¢(A(x l , . . . , Xm y), ment of elements of U to Xl' H) has been defined for each H. We can then obtain values for ¢«y)A (Xl' ... , Xn, y), H) and ¢«3y)A(Xl' ... , Xn, y), H) as follows. Let the elements al' ... , an of U be assigned to the variables Xl' ... , x.; Then:

e) We say ¢«3y)A(Xl' ... , Xm y), H) = T iff there is e b e l/f(H) such that ¢(A(x l, ... , Xn, y), H) = T when Xl' ... , Xn are assigned al' ... , am respectively, and y is assigned b; otherwise ¢«3y)A(x l , . . . ,X n, y), H) = F.

f) We say ¢«y)A(x l, ... , Xm y), H) = T iff for each H' E K such that HRH' ¢(A(x l, ... , Xm y), H') = T when Xl> ... , x, are assigned al' , an, and y is assigned any element b of l/f(H'); otherwise, ¢«y)A(x l, , X n , y), H) = F.

Finally, we stipulate that if truth-values ¢(A, H) and ¢(B, H) (for all E K), are given relative to an assignment to the free variables of A and B, then corresponding values ¢(A A B, H), ¢(A v B, H), ¢(A :::J B, H), and ¢(,A, H) are to be defined according to the prescriptions (a) - (d). To get a proper intuitionistic definition of model, we should again modify the given conditions and stipulate that a model ¢ is a function ¢(r, H) as above, together with a function ¢(A, H), assigning T or F to ¢(A, H) relative to a given assignment of elements of U to the free variables of A, and satisfying the previously stated conditions (e.g., that ¢(r(x l, ... x n ) , H) = T when Xi is assigned aD .:0;; i .:0;; n) iff (a l , . . . , an) E ¢(pn, H)). Again, this definition is classically substantially equivalent to the old one. We note that all of the results to follow would remain valid if we allowed
SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

97

1. 1. Intuitive interpretation

A triple (G, K, S), with K a set, G E K, and S a relation defined on K, is called a tree (and G is called its origin) iff: (1) There is no HE K such that HSG; (2) for every HE K except G, there is a unique H' E K such that H'SH; (3) for every HE K, GS*H, where S* is the ancestral of the relation S(i.e., H 1S*H z iff H 1 =H z or H 1SnH z for some power n of S). If HSH', wecallH the predecessorofH', and H' a successor of H; the tree is finitary iff every H has only finitely many successors. An element H without successors is called an endpoint. Note that K can be characterized in terms of S as its field, and G can then be characterized as the unique element of K without a predecessor. (This definition of tree is adopted from [2]. Intuitionistically, we must further require that the elements H be natural numbers and that S be decidable.) A m. s. (G, K, R) is called a tree m. s. iff there exists a relation S such that (G, K, S) is a tree and R is the smallest reflexive and transitive relation containing S (i. e., R = S*). In our remarks on intuitive interpretation, we will primarily be concerned with tree models (i. e., models defined on a tree m. s. (G, K, In fact, we will show below in section 1. 2 that any model can be replaced by an "equivalent" tree model. The rest of this section will consist of an informally stated intuitive interpretation of the modelling, together with indications how to state the interpretation more formally in terms of Kreisel's theory [11] of absolutely free choice sequences.") The reader unfamiliar with [11] (or uninterested in these details is advised to omit the remarks relating to [11] but to read the rest of the section. The interpretation proceeds as follows. Suppose we are given a model ¢ for a formula A of propositional calculus whose sole atomic subformulae are P, Q, R. For example, suppose we have a tree model ¢ on a m. s. (G, K, R) diagrammed as follows:

R».

1) We are informed that Godel (unpublished) has proposed that such sequences be called "absolutely lawless," presumably on the ground that they are not completely free, being governed by the "higher order" requirement that no restrictions, other than those defining the spread in question, ever be placed on choices later by a free decision. Since Godel's suggestion has not yet been adopted in print, we hesitate to make this change ourselves.

98

SAUL A. KRIPKE

P

G

Figure 1.

The elements of K are G, Hi' H 2 , H 3 , H 4 . We have written an atomic formula above a node G or Hi if ¢ assigns it the value T on this node; we omit it if ¢ assigns it the value F. Thus, e.g., ¢(P, G) = T while ¢(Q, G) = ¢(R, G) = F. We intend the nodes H to represent points in time (or "evidential situations"), at which we may have various pieces of information. If, at a particular point H in time, we have enough information to prove a proposition A, we say that ¢(A, H) = T; if we lack such information, we say that ¢(A, H) = F. If ¢(A, H) = T we can say that A has been verified at the point H in time; if ¢(A, H) = F, then A has not been verified at H. Notice, then, that T and F do not denote intuitionistic truth and falsity; if ¢(A, H) = T, then A has been verified to be true at the time H; but ¢(A, H) = F does not mean that A has been proved false at H. It simply is not (yet) proved at H, but may be established later. Now given a point in time G, there are various possibilities open for gaining further information about the propositions. One situation is diagrammed in Figure 1. At the point G (representing our present information), we have proved P. For all we know, we may remain "stuck" at G for an arbitrarily long time, without gaining any new information. But it is possible that we will gain enough information to "jump" to point Hi (in which case we have a proof of R in addition to P), or to the point H 2 (where we get a proof of Q in addition to P), or even to the points H 3 or H 4 • If we have "jumped" to the point H 2 , so that we have proved both P and Q, then as far as we know, we may remain "stuck"

SEMAN TICAL ANALYSIS OF INTUITIONISTIC LOGIC I

99

for an arbitrarily long time at HZ; but we may advance to H 3 or H 4 • Notice that if we jump to the "situation" H 3 , we still have proved no more than P and Q; but this does not mean that the situation H 3 is exactly like Hz. In fact, as long as we remain at Hz, the possibility is still open to us that we will some time or other be able to advance to H 4 and prove R; but, if we are at the situation H 3 , we have gained enough information to exclude the option that R will ever be proved. Now, in general, in a model structure (G, K, R), we interpret G as the present "evidential situation." If H is any situation, we say HRH' if, as far as we know, at the time H, we may later get enough information to advance to H'. Thus, since the information we have at H may be all the knowledge we have for an arbitrarily long time, we stipulate that HRH; and the transitivity property of R is intuitively obvious. The requirement that, for any A, if ¢(A, H) = T and HRH', then ¢(A, H') = T, simply means that if we already have a proof of A in the situation H, then we can accept A as proved in any later situation H'-we don't forget. Finally, the inductive clauses for propositional calculus are in consonance with the intuitionistic interpretations of these notions. Thus A A B [A v B] is proved when both A and B have been proved [either A has been proved or B has been proved]; so ¢(A A B, H) = Tiff ¢(A, H) = ¢(B, H) = T [¢(A v B, H) = T iff ¢(A, H) = Tor ¢(B, H) = T]. Notice that disjunction and conjunction behave, in a given situation H, as if they were classical truth-functions. Negation and implication, on the other hand, are not so treated. To assert-cs intuitionistically in the situation H, we need to know at H not only that A has not been verified at H, but that it cannot possibly be verified at any later time, no matter how much more information is gained; so we say that ¢(.,A, H) = T iff ¢(A, H') = F for every H' E K S.t. HRH'. Again, to assert A => B in a situation H, we need to know that in any later situation H' where we get a proof of A, we also get a proof of B; the inductive definition of ¢(A => B, H) formalizes this requirement. Consider the following two point countermodel to the law of excluded middle: p

H

G

Figure 2.

100

SAUL A. KRIPKE

We have ¢(P,H) = T, ¢(P, G) = F. Since ¢(P, H) = T, ¢(,P, G) = F, and hence ¢(P v ,P, G) = F. Intuitively, at the present situation G, we have not yet proved P; nor can we assert ,P, since the possibility remains that we will get enough information later to advance to H and assert P. Thus, at the point G, we are not in a position to assert P v v]', These considerations can readily be formulated in terms of Kreisel's theory FC of absolutely free choice sequences. Intuitively, an absolutely free choice sequence (a.f.c.s.) is a free choice sequence a, chosen from a given spread S, in which it is stipulated from the beginning that no restrictions, other than the conditions defining the spread S, can ever be placed on the choices. Figure 2, then, for example, can be interpreted in terms of the present theory as follows: Consider a.f.c.s.'s from the spread S consisting of free choices ofO's and 1's, in which, however, 1 can be followed only by 1. Intuitively, we interpret the situation G as a choice of 0 and H as a choice of 1. Since, starting with G, we can remain "stuck" at G as long as we like, we permit 0 to be followed by an arbitrary number of O's as well as by 1; but, since H is followed only by itself, we permit 1 to be followed only by 1. Then P{a) is the assertion "a 1 occurs on the a.f.c.s. a" (i.e., (3n) (a{n) = 1), where n ranges over natural numbers). As long as we have chosen only O's in a, we have not established P{a); but on the other hand, since a is chosen with no restrictions other than being in S, we cannot exclude the possibility of the choice of a 1 later, so we cannot establish ,P{a). These considerations can be formalized easily in Kreisel's FC so as to yield a proof of ,(a t S) (P{a) v,P{a)), where at S ranges over a.f.c.s.'s in S. More generally, given any (intuitionistically defined) countable tree model ¢ of A on a m.s. (G, K, R), suppose we identify the nodes (elements of K) with natural numbers, identifying G in particular with 0. Define in terms of (G, K, R) a spread S consisting of all free choice sequences in which the initial choice is 0, and the choice of any natural number m must be followed either by a further choice of m or by a choice of some successor ofm on the tree. To any atomic subformulaP of A, and a.f.c.s. a in S, associate a formula P{a) abbreviating (3x) (3m) (a{x) = m and ¢(P, m) = T). Given B, C, and associated formulae B{a) and C{a), associate with B A C, B{a) A C{a); with B v C, B{a) v C{a), etc. Then, it is easily seen by induction that, for any subformula B of A,

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

101

if¢(B, m) = T,then(a~S)«3x)(a(x) = m) =:> B(a)),andif¢(B,m) = F, , (et ~ S) «3x) (et(x) = m) =:> B(et)). In particular, if ¢(B, G) = T[ =F], then since every a.f.c.s. in S contains = G), we have (et ~ S) B( «) [,(a ~ S) B(et)]. If the m.s. (G, K, R) and model ¢ can be formally described in

°(

Kreisel's FC, the preceding reasoning can be formalized in FC, and thus in particular, if ¢(A, G) = F, f- ,(et ~ S)A(et) in FC, giving a counterexample to the validity of A. To extend this treatment to quantifiers, consider first the following countermodel to (x) (P(x) v Q). =:> • (x)P(x) v Q:

{a}

{a, b}

pea)

P(a),Q

G

H

Figure 3.

We have ¢(P(x), G) = ¢(P(x), H) = T, when x is assigned a, but ¢(P(x), G) = ¢(P(x), H) = F when x is assigned b. Further, ¢(Q, G) = F, ¢(Q, H) = T, GRH but not HRG, and ljJ(G) = {a}, ljJ(H) = {a, b}. All this information is included in the diagram. It is easily verified that ¢«x) (P(x) v Q), G) = T, but ¢«x)P(x) v Q, G) = F. Intuitively, we can interpret the situation as follows: Identify the elements a and b with the integers and 1, respectively. Let R be Fermat's last theorem, and let

°

°

Q be R v ,R. Let V be the species containing 0, and containing 1 if Q is true (i.e., V = {mlm = v (m = 1 A Q)}), and let x be a variable ranging over V. Let P(x) be the statement x = 0. Then, already at the present situation G, we can assert V s; {a, l}, and 1 E ViffQ; so we can assert (x) (P(x) v Q). But so long as we have not advanced to the situation H, where Fermat's last theorem has been decided, so that we can assert Q, we cannot assert (x)P(x) v Q. N.B. It should be remarked that (x) (P(x) v Q). =:> • (x)P(x) v Q holds in any quantificational model such that ljJ(H) is constant. Thus, in general, if the variables in a formula A range over a domain D, then for each situation H, ljJ(H) is the species of all individuals known to be in D on the basis of the information available at H. (So, in the case of the paragraph above, at the present situation G, ljJ(G) = {O}; but when at H, Q has been proved, ljJeH) = {a, I}. Since D is to contain an

102

SAUL A. KRIPKE

element, we must know at least one element ofD from the outset, so that I/J(G) must contain at least one element. The restriction that HRH' is to imply I/J(H) C;; I/J(H') should now be obvious on the intended interpretation. Notice that, to assert in a situation H that for every element x of D, P(x) is true, we must know not only that P(x) is true for every x in I/J(H), but also that it is true for every x which may later be proved to be in D; i.e., for every x in I/J(H'), where HRH'; and this is exactly the inductive clause for universal quantification. On the other hand, to assert the existence of an x in Dsuch that P(x) is true, we need to find an element x which has already been proved to be in D (i.e., which is in I/J(H)), and such that P(x) is true; and this is exactly what the condition on existential quantification requires. These facts can again be stated more formally in terms of the theory of absolutely free choice sequences. Suppose we are given an (intuitionistically defined) countable tree m. s. (G, K, R) in which D, and hence I/J(H} for each H is countable. Then, we can identify both the elements of K and the elements of D with natural numbers, identifying G in particular with O. We then associate with (G, K, R) a spread S of absolutely free choice sequences, defined just as above. Further, for any a.f.c.s. a in S, let D" be the species of all natural numbers n such that there is a natural number x such that n E I/J(a(x)). Let x, be a variable ranging over D" (i.e., (x,,) ( ... ) isto be interpreted as (x)(x E D" ::l . . . ) and similarly for (3x,,)). Then since «(O) = 0 = G, and I/J(G) contains a natural number, D" has an element for all a. Let ¢ be an (intuitionistically defined) q. model on (G, K, R)for some formula A. Given any atomic subformulaP'(x., ... , x n) , and an a.f.c.s. a of S, we associate with these two an assertion pea, x I, , . . . , x n), where the variables XI~' ... , xn~ range over D", and where pea, XI~' ... , xnJsays that ¢(pn(x l , •.• , x n), m) = T for some m on a, when Xi~ is assigned to the variable Xi (i = 1, ... , n; note that Xi~ E D" C;; D). Given formulae A( ct, x I~' •.. , xnJ and B( a, Y I~' ... , ymJ associated with A(x l , . . . , x n) and B(y I' ... , Ym), respectively, associate A(a, XI~' ... , xnJ /I B(a, YI~' ... , YmJ with A(x l , . . . , x n) /I B(YI' ... , Ym), and similarly for the other connectives. Further, associate (xi)A(a, XI~' ... , xnJ with (xi)A(x l , . . . , x n), and similarly for the existential quantifier. Then, we prove, by induction, that, for any mE K, if A(x l , . . . , x n ) contains only the free variables listed and XI' ... , Xn are assigned ai' ... , an E I/J(m), then if ¢(A(xI' ... ,

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

103

x n) , m) = T[ =F] relative to this assignment, we have in FC (a t S) ((3x) (a(x) = m) ::::> A(a, aI' ... , an» [.,(a t S)((3x) (a(x) = m) ::::> A (a, aI' ... , an))]. In particular, if m = 0 = G, since (a t S) (3 x) (a(x) = 0), we get (a tS)A(a,a l, ... ,an) [.,(atS)A(a,al, ... , an)]. Thus, if A does not contain free variables, and ¢(A, G) = F, we get a proof in FC that A is not generally valid.

To translate, then, the example given above into FC, notice that, where B is the full binary spread, (a)

(a

t B)(x)((3y) (a(y)

= x)

::::>

(x = 0 V (3y) (a(y) =

1»),

but also (b)

,(a t B)((x)((3y) (a(y) = x)

::::> X

= 0) v (3y) (a(y) = 1»).

Thus we have refuted the "law" (x) (P(x) v Q). ::::> • (x)P(x) v Q; for if it held, it would hold for any free choice sequence a, with x ranging over the species of all z such that (3y) (a(y) = z), contrary to (a) and (b). Notice that, since (a) is a triviality and (b) follows from the fan theorem, we could simply have used the ordinary theory of free choice sequences instead of Fe. We remark that, following Dyson and Kreisel [9], the countermodels in FC that we have described, assigning certain infinite sequences of natural numbers to formulae, can classically be interpreted as countermodels in Baire space (the space of all sequences of natural numbers, with the usual topology). In fact, by examination of the countermodels actually produced below, it follows that every unprovable formula has a countermodel in the Cantor set, as Dyson and Kreisel assert. The following remarks on the uses of absolutely free choice sequences are not relevant to the main point of the present paper, but will be added here: REMARK.

1. All the theorems which are proved in the last chapter of Heyting

[12], using Brouwer's method of free choice sequences depending on the

solving of problems, can be carried out in Fe. To take the first example given by Heyting: to show that it is absurd that, for every real number a, a i= 0 should imply a # O. For if this were true, then for any free choice sequence a in the binary spread, by associating with a the real number

104

SAUL A. KRIPKE 00

~ rx(x)/2 x=o

x

,

we could show that -,(x)(rx(x) = 0) ~ (3x) (rx(x) = 1); hence, in particular, this would hold for absolutely free choice sequences. But it is easy to show, in FC, that (rx ~ B) -,(x)(rx(x) = 0). Hence we need only show in FC that -,(rx ~ B)(3x)(rx(x) = 1); but this easily follows from the fan theorem, since (rx ~ B)(3x)(rx(x) = 1) would imply (3m)(rx ~ B)(3x S m) (C«(x) = 1), which is absurd. Similar treatments are possible for all the refutations of classical theorems treated by Heyting by this method in [12].

I think it probable that such treatments in FC will extend to all the counterexamples to classical theorems which Brouwer gives by his method; but I have not made a survey of the literature. A careful reader of the present section on the interpretation of our models will find it plausible that, conversely, a good deal of the interpretation, at least for propositional calculus, that has just been carried out in Fe, could be carried out using Brouwer's method of ips depending on the solving of problems. 2. The following example, which refutes both Kuroda's conjecture (cf. [13]) and Markov's principle (cf. [14]) in FC, was inspired by applying the methods of the present section to obtain a countermodel to (x) -, -, A(x) ~ -, -, (x)A(x). Let S be the finitary spread consisting of all free choice sequences o: such that «(x + 1) = «(x) or «(x + 1) = «(x) + I for every x. We show in FC (a)

(rx ~ S)(m)-'-'(3n) (rx(n) ~ m)

(b)

(rx

~

S)-,(m) (3n) (rx(n)

~

m).

To prove (a), let o: be an a.f.c.s. in S, let m be an integer and suppose for reductio ad absurdum that -'(3n)(rx(n) ~ m). Then, since o. is absolutely free, by axiom 5. 1 of FC, there is an initial segment iX(x) of a such that (*) (13 t S) ((j(x) = iX(x) ~ -,(3n) (p(n) ~ m». Now rx(x) < m, for otherwise (3n) (rx(n) ~ m). Hence, since every ips on S is non-decreasing, for all y < x, rx(y) < m. Now (*) asserts that, if we have chosen the first x components of 13 so that P(x) = iX(x), we can never choose pen) ~ m for any n. But by axiom 5.3 of FC, there are a.f.c.s.'s 13 in S, satisfying the conditions P(x) = iX(x) and f3(x + i) = «(x) + i (0 s i s m - rx(x»,

SEMAN TICAL ANALYSIS OF INTUITIONISTIC LOGIC I

105

since this finite sequence of choices accords with the spread law of S. But then if n = x + m - a(x), {len) = m, contrary to (*). To prove (b), let a be an a.f.c.s. in S, and for reductio ad absurdum assume (m) (3n) (a(n) ~ m). Then, again by axiom 5.1 of FC, there is and x such that (**) ({l ~ S) (~(x) = ii(x) ::) (m) (3n)(~(n) ~ m». Given any a.f.c.s, {l in S, assign a value f({l) as follows: If ~(x) ¥ ii(x), let f({l) = 0; if ~(x) = ii(x), let f({l) be the least n such that {len) ~ a(x) + 1. By (**), f is well defined for all such {l, so by the fan theorem there is some finite integer p such thatf({l) is wholly determined by ~(p). We can thus write f({l) as f(~(p». Clearly, by the definition of J, p ~ x. Now, again using axiom 5.3 of FC, determine {l by requiring ~(x) = ii(x), {l(x + i) = a(x) (0 ::::; i ::::; P - x). Then (**) asserts that {l(f(~(p»)., ~ o(x) + 1. But this is clearly absurd, since again by 5.3 we are perfectly free to continue the choices by {l(p + j) = rx(x) (0 ::::; j ::::; f(~(p» -- p), so that, takingj =f(~(p» -p, we would get {l(f(~(p») = «(x) < «(x) + 1. So (b) is proved. We will now use (a) and (b) to refute Kuroda's conjecture [13] and Markov's principle [14]. Kuroda's conjecture asserts that for m a number variable, (m) -, -,A(m) implies -, -, (m)A(m). Using Kuroda's conjecture, we could derive from (a) the assertion (rx ~ S) -, -, (m)(3n)(rx(n) ~ m), which directly contradicts (b); so Kuroda's conjecture is refutable in FC. Similarly Markov's principle asserts that, for a decidable predicate A(x) and number variable n, -, -, (3n)A(n) implies (3n)A(n). But, if we take A(n) to be rx(n) ~ m, then A(n) is primitive recursive and hence decidable. Then Markov's principle would allow us to derive (rx ~ S) (m) (3n) (rx(n) ~ m) from (a), again contradicting (b). In spite of the proofs by G6del and Kreisel that strong completeness of Heyting's predicate calculus implies certain forms of Markov's principle, I am unable to see how to convert these results into a proof in FC that Heyting's predicate calculus is not strongly complete, and I doubt that such a conversion is in fact possible. If S' is the spread consisting of all tx such that there is a {l in S such that (x) (rx(x + 1) = {lex»~, it is easy to conclude from the present results that (rx ~ S') -'(3n)(rx(n) ~ «(O) and that -,(rx ~ S') (3n)(rx(n) ~ rx(O»; but, since rL here ranges over absolutely free choice sequences of S' and not ordinary free choice sequences, we are unable to apply Theorem 1 of Kreisel [15] to conclude that Heyting's predicate calculus is not strongly complete.

106

SAUL A. KRIPKE

1.2. Relationship to the Beth models

In this section, we discuss the relationship of the present model theory to that of Beth [8]. We will show that the present models can be "translated," in a natural way, into Beth models. Using an intuitive interpretation of the Beth modelling, we will also show that the mapping leads to an interpretation of our own quantificational models which is alternative to that of the previous section; in this interpretation, the variables always range over the species of natural numbers. This section can be omitted, if desired, without loss of continuity. First, we present the notion of Beth model in our terminology as follows: Let (G, K, S) be a tree, and let R = S*, so that (G, K, R) is a tree m.s. By a path in the tree (G, K, S) we mean a sequence {H;} of elements of K, indexed on either the sequence of natural numbers or on some finite initial segment thereof, satisfying the conditions: (a) H o = G; (b) for i > 0, H;_ISH;; (c) if {HJ has a last element H m H, is an endpoint of (G, K, S). If some H; = H, we say the path is through H. Let B be a subset of K. If every path through H intersects B, we say that H is barred by B. Thus, for example, H is barred by {H}. By a Beth model on (G, K, S), we mean a binary function IJ(P, H) satisfying the following conditions: (a) rJ(P, H) = T or F, where P is atomic and HE K. (b) If HSH' and rJ(P, H) = T, then rJ(P, H') = T. (c) IfH is barred by Band rJ(P, H') = T for every H' E B, then rJ(P, H) = T. Given a Beth model n, we define by induction values rJ(A, H) for an arbitrary formula A of the propositional calculus. Suppose rJ(A, H) and rJ(B, H) have already been defined. Define rJ(,A, H), rJ(A A B, H), and rJ(A :::> B, H) exactly as was done above for a model ¢; simply replace "¢" by "IJ" throughout. Finally define rJ(A v B, H) = T iff there is a subset B ofK such that H is barred by Band rJ(A, H') = Tor rJ(B, H') = T for every H' E B; otherwise, I1(A v B, H) = F. Notice that if I1(A, H) = T or I1(B, H) = T, I1(A v B, H) = T; we can take {H} as the set B barring H. Notice that condition (b) above actually implies the strengthened condition (b'): If IJ(P, H) = T and HRH', then I1(P, H') = T. Using this fact, it is easy to prove by induction that the properties (a) - (c) actually hold not only for an atomic formula P, but also for an arbitrary formula A. As in the case of models ¢, the inductive definition of I1(A, H) just

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

107

given depends on the law of the excluded middle. Again as in the case of a model, we can correct the situation by modifying the definition of a Beth model. We leave the modification to the reader. A Beth model '1 on a tree (G, K, S) is called finitary if (G, K, S) is finitary. Beth's own version of his models in [8] is actually equivalent to our notion of a finitary Beth model. We call a Beth model '1 a strong Beth model iff for all HE K and formulae A and B, '1(A v B, H) = T implies '1(A, H) = T or '1(B, H) = T. Notice that, on account of the validity of condition (b') above, a Beth model '1 is also a model in our sense. However, since the inductive clause for disjunction in a Beth model differs from the inductive clause for our sense of model non-atomic formulae may be given different values according as '1 is considered as a model in our sense or as a Beth model. A strong Beth model is precisely a Beth model in which this eventuality never happens. The intuitive rationale behind the Beth models is simple: Again the elements ("nodes") of the tree model (G, K, S) are points in time, or evidential situations; but we no longer suppose that we are allowed to remain at a given point H as long as we please. On the contrary, if H is a node of the tree (G, K, S), we are forced, unless H is an endpoint, to "jump" within a fixed, finite time to one of the successors of H in the tree. (Paradigmatic of such a game, of course, are free choices in an (absolutely) free choice sequence (J(: after each choice we are forced to make another, within a finite length of time, unless the spread-law states that the choice we have just made is terminal.) '1(P, H) = T[ =F] means that P has been established [has not yet been established] at the time H, so the conditions (a) and (b) on '1 are clear. If H is barred by B ~ K, condition (c) says if we know that P will be established at any H' E B, then we already know at H that P is true; for, once we are at H, we must get to some H' E B in a finite time. Similarly, the inductive clause which defines '1(A v B, H) observes that to establish A v B at H it is sufficient to know that, in a finite number of "moves," we must either establish A or establish B; that is to say, it suffices to know that there is a B which bars H such that every H' E B either establishes A or establishes B. The inductive clauses for the other connectives are as before. As in section 1.1, we can give a more precise justification of the

108

SAUL A. KRIPKE

definition in terms of absolutely free choice sequences. As before, we identify the elements of the (countable) tree (G, K, S) with natural numbers, associating 0 with G. We then consider the spread S of all absolutely free choice sequences of elements of K whose first term is 0 and which satisfy the condition that a(n)Sa(n + 1), unless a(n) is an endpoint of (G, K, S), in which case a(n) = a(n + 1). For any atomic P, associate a formulaP(a) which says (3n) (3x) (a(x) = n A I/(P, n) = T). We then define inductively a formula A(a) associated with an arbitrary formula A, exactly as in section 1.1. Again as in 1.1, if IJ(A, 11) = T [=F] we can derive (a ~ S) «3x)(a(x) = n) ~ A(a)) [.(a ~ S)«3x) (a(x) = n) ~ A(a))] in FC We now show how the ideas of section 1.1 can be modified so as to show how every model can be transformed into at. "equivalent" strong Beth model. Let ¢ be a model on a m.s. (G, K, R). Define a tree (G', K', S') as follows: Let K' be the set of all finite non-empty sequences {H;}7= l' where Hi E K(1 sis n), H 1 = G, and HiRH i + 1(1 s i < n). Let G' be the sequence whose sole term is G. We say, for H~, H~ E K', that H~ S'H; iff H~ is the initial segment of H~ formed by omitting the last term of H;. (Then, if R' = S'*, H~ R'H; iff H~ is an initial segment of H~. ) For any H' E K', let l(H') be the last term of H', then l(H') E K. Define IJ(P, H') (P atomic, H' E K') by IJ(P, H') = ¢(P, l(H')). Let H' E K', H' = {HJ~= l' We define an associated path P(H') = {Hj}.f:=o as follows: For 0 s j < n, let H, be the unique initial segment of H' with j + 1 terms. For j :2: n, let Hi be the j + I-termed sequence whose first n terms are H 1 , . . . , H, and whose other terms are all equal to H n • So for j :2: n, let l(Hi) = Hn- Clearly P(H') is a path through H'; further, for any Hi on this path, l(Hj)Rl(H'). We now assert: THEOREM 1 (First part): IJ is a strong Beth model. Further IJ is equivalent to ¢ in the sense that IJ(A, H') = ¢(A, l(H')) for any H' E K' and formula A. In particular IJ(A, G') = ¢(A, G) for any A. PROOF. First we show that IJ is a Beth model. Condition (a) is clear. For (b), if IJ(P, H~) = T and H~ S'H;, then l(H~)Rl(H;). Since ¢(P, l(H~)) = IJ(P, H~) = T, and since ¢ is a model, ¢(P, l(H~) = T, hence IJ(P, H;) = T. For (c) let H' be barred by B ~ K'. Then the path P(H') intersects B. Let H" be some point of the intersection. To establish (c)

SEMAN TICAL ANALYSIS OF INTUITIONISTIC LOGIC I

109

it is sufficient to show that if 'l(P, H") = T, 'l(P, HI) = T. Since H" is on the path P(H '), l(H")Rl(H '). Since 17(P, H") = T,
pI---~1

1

p

l l

p p

i

l

p

1" ..

p~

I

.l

p l

I

i

.l

pl p! I

Figure 4a.

It is clear that the model would not essentially change if the infinite vertical branches were reduced to a single point:

pI I

pI

II

p[ I I

I

Figure 4b.

i

p[oo, I

i

Figure 4b is exactly Beth's countermodel in [8] to P v-P.

110

SAUL A. KRIPKE

Since 11 is a strong Beth model, it is also a model; thus the method allows us to transform a model

Let (G, K, R) be any q.m.s., with domain function Ijt(H). For HRH' and H'RH. Let K be the set of all such Hfor every H E K. For H, H' E K, let HRH' ijfHRH', and let $(H) = Ijt(H). Then (G, K, R) is a q.m.s. with domain function l/i. Moreover, if

H

E

K, let

H be the set of H' E K such that

The proof of the lemma is straightforward and is left to the reader.

Theorem I (Second part): Let (G, K, R) be a m.s. such that R is a partial ordering. Let S be any irreflexive relation such that R = S*. Let K be the set of all finite non-empty sequences {H i }7= 1 such that H 1 = G, and H,SH i + 1 for every i(l :5: i < n). Let G be the sequence whose sole term is G. For any H 1, Hz E K, let H 1SHz ijfH 1 is the initial segment of Hz formed by omitting the last term of Hz· Let R = S*. Then (G, K, R) is a tree m.s. Moreover, if

B,

SEMAN TICAL ANALYSIS OF INTUITIONISTIC LOGIC I

111

H) = T. Let HI be any member ofK such that HRH I , where H = 1(0). Then either H = HI' or HSnH I for some n > 0: in either case there exists HI E K such that HRH I and HI = l(H I ). By assumption, either '1(A, HI) = F or '1(B,Ol) = T, whence, by the induction hypothesis, either q,(A, HI) = F or q,(B, HI) = T. Since HI was arbitrary (subject to HRH I , q,(A => B, H) = T. Conversely, suppose l1(A => B, H) = F. Then for some HI such that HRH I , '1(A, 01) = T and l1(B, HI) = F. By the induction hypothesis, q,(A, l(H I ) ) = T and q,(B, 1(H 1 ) ) = F; since l(O)Rl(H I ) , q,(A => B, l(H)) = F, as desired. The case of' is quite

similar. Q.E.D.

Notice that the situation contrasts with that in S4, where it is often impossible to replace an arbitrary finite model by an equivalent finite tree model (cf. [2]). The third part of Theorem 1 extends the procedure for finding a tree model equivalent to an arbitrary model to quantificational models. Here we cannot use the same construction as a tree q. model and as a Beth q. model, as will be seen when we define the latter, in preparation for the fourth part of theorem 1. THEOREM 1 (Third part): Let (G, K, R) be a q.m.s. with domain function ljJ(H). (R need not be anti-symmetric.) Let S be any relation (not necessarily irreflexive) such that R = S*. Let q, be a quantificational model on (G, K, R). Let (G, K, R) be defined as in the second part of the theorem, and let ilI(H) = ljJ(l(O)). Let '1(r, H) = cp(r, l(A)) for each predicate letter P" and each H E K. Then '1 is a quantificational model on the q.m.s. (G, K, R) with domain function ill. Further, relative to a given assignment to the free variables of A, '1(A, H) = q,(A, l(H)): in particular, '1(A, G) = q,(A, G).

The proof is left to the reader. Notice that, since S is not required to be irreflexive, it may in particular be R itself: thus (G, K, R) may be as in the second part of Theorem 1, or may be identical with the Beth model (G', K', R') of the first part. As a quantificationa1 model, however, '1 will not be a Beth quantificationa1 model, to the definition of which we now turn. Unlike our own models, with their variable domains (a feature we have noted to be essential), the Beth quantificationa1 models are based on a fixed domain D. We define a Beth q.m.s. to be a Beth m.s. (G, K, R),

112

SAUL A. KRIPKE

together with a domain D with at least one element. A Beth q, model Yf is a binary function Yf(pn, H), whose value is T or F when n = 0, and is a subset of D" for n ~ 1. We require, in addition to the conditions (b) and (e) above on n, the analogues for n ~ I: (bn). If HRH', Yf(r, H) S; H'); (en) if H is barred by B S; K, then

«r:

n

H', B

Yf(r, H')

S;

Yf(pn, H).

For an atomic formula r(x l , . . . , x n), define Yf(r(x I , . . . , x n), H) = T, relative to an assignment of aI' .. , anEDtox l , . . . ,xmiff(a I , . . . ,an)E Yf(r, H); otherwise, =F. We then define the values for more complex formulae by induction. The inductive clauses for the propositional connectives are as above. Let the formula A(x I , ••• , X m y) contain only the free variables listed. We define Yf«y)(A(x I , . . . , x m y), H) = T, relative to an assignment of a, ED to Xi (1 ::::; i ::::; n), iff Yf(A(x I , ••• , (xn,y), H) = T relative to any assignment of an element bED to y and a, to Xi; otherwise, = F. Again Yf«3y)A(x I , ••• , X n, y), H) = T when a, is assigned to Xi iff there is a B S; K such that H is barred by B and for any H' E B there is a bED such that Yf(A(x I , .•• , X m y), H') = T when ai is assigned to Xi and y is assigned b; otherwise, = F. Using the inductive clauses and the conditions on atomic formulae, we can prove the analogues of (b) and (e) for an arbitrary formula A, relative to a fixed assignment to its free variables in a Beth quantificational model n, If Yf(A, H) = T and HRH', Yf(A, H') = T. If H is barred by B and 1'f(A, H') = T for any H' E B, then Yf(A, H) = T. Suppose we are given a quantificational model ¢ on a m.s. (G, K, R) such that u = U l/I(H) H,K

is countable. We will transform ¢ into a Beth quantificational model whose domain D is the set N of non-negative integers. Let (G/, K /, S') be as above, and R' = S'*. Notice that N is a countable union of disjoint countable sets; call these NJi = 0, ... ). We have a procedure, which, for each H' E K', generates certain elements of N at H'; the set of elements generated at H' will be identical with n

UN;

i=O

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

113

for some n. Further, ifP is any path in K', every pEN will be generated at some H' E P. Further, the procedure will satisfy the condition that if H' R'H", every element generated at H' is also generated at H". An element generated at H', but not at its predecessor (if any exists), is said to be introduced at H'. Further, any natural number n generated at H' is assigned a unique element of t/J(l(H')); this element is called v(n, H'). The v-function will satisfy the condition that if n is generated at H', and H'R'H", then v(n, H') = v(n, H"). We give an inductive definition on the tree (G', K', S') of a procedure with these properties; at any stage, satisfaction of these properties will be taken to be part of the inductive hypothesis. First, consider the origin G' of the tree. We generate exactly the elements of No at G', and we define v(n, G '), for n E No, in such a way that No is mapped onto t/J(G). (This is possible since t/J(G) is at most countable. All arbitrary choices can be made precise, if desired, using well-orderings of the denumerable sets Nand Ll.) Suppose we have defined the set of all integers generated at H' it is, say, m

(M = U N;) i~O

and have defined v(n, H') for each n E M. Let H'S'H". Then introduce all elements of N n + l' so that the set of elements generated at H" is M v N m + l' Define v(n, H") for n E M v N m + 1 by v(n, H") = v(n, H') for n E M, and such that v(n, H") maps N n + 1 onto t/J(1(H")). Then the inductive definition is complete. We now define a Beth quantificational model Yf whose domain is N on the Beth m.s. (G', K ' S') as follows: If P is a propositional letter, define Yf(P, H') = ¢(P, l(H')). For an n-adic predicate letter n define Yf(r, H') to be the set of n-tuples (ml' , m n ) of natural numbers such , m; are all generated at H" and that, for every H" E K' such that m l , H'R'H", (v(m l , H"), ... , v(mm H")) E ¢(r, l(H")). THEOREM I: (Fourth part): Yf is a Beth quantificational model on (G', K ', S') whose domain is N. For any H' E K' and formula A(x l , . . . , x n ) , whose free variables are exactly those listed, and natural numbers m l' . . . , m.; which have been generated at H', Yf(A(x1 , ••. , x n) , H') = T when Xl' ••• , X n are assigned m l , . . . , m i; respectively, if and only if ¢(A(x 1 , ... , x n) , l(H')) = T when Xl' ••. , X n are assigned v(m 1 , H'), ... , v(m n ,

114

SAUL A. KRIPKE

H'), respectively. In particular (n = 0), ifA is a closedformula, rJ(A, H') = cf>(A, l(H'». PROOF. We show first that rJ is a Beth quantificational model. Conditions (b) and (b n ) are obvious. Condition (c) is proved as in the first part of the theorem. Condition (en) (n ~ 1) is proved as follows: Suppose H' E K' is barred by B £; K', and suppose {m I' ... , m n) is not in rJ(pn, H'). We show that there is an H" E B such that (m l , ... , m n ) is not in rJ(pn, H"). Since (m I' ... , m n) is not in I/(pn, H'), there is an H~ E K' such that H' R'H~, m I' . . . , m n are all generated at H~, and (v(m I' H~), ... , v(mn> H~» is not in cf>(pn, l(H~)). As in the first part of this theorem, let P be the path P(H~) through H~, with the property that, for H" on the path and H~R'H", l(H~) = l(H"). Then P intersects B in an element H". If H" R'H~, then since clearly (m l , ... , mn) is not in rJ(P", H~), by condition (b"), it is not in rJ(pn, H"). If H~R'H", then since l(H") = l(H~), and v(mi' H~) = v(mi' H"), we have «v(m l , H"), ... , v(m n, H"» tt ¢(pn, l(H"», so that (m l , . . . , mn) ¢ rJ(P", H"), the desired conclusion.

We now prove the assertion in the second sentence of the present Fourth part by induction; the third sentence is a special case. Let A(x l , . . . , nX) be atomic. If n = 0, see the proof of the first part of this theorem. If n > 0, write A(x l , . . • , xn) as P"(x l , . . . , xn) . Suppose m l , . . . , mn are all generated at H' E K '. Let H = l(H'), and a, = v(mi' H'). If c/>(P"(XI' ... , x n) , H) = T, when Xi is assigned a, (l ~ i ~ n), then (a I ' . . . , an) E cf>(P", H). If H' R'H~ (H~ E K'), let H o = l(H~). Then HRH o, hence a I' , an E ",(H o) . Also a, = v(mi' H') = v(mi , H~). This shows that , Inn) E rJ(pn, H'), hence rJ(pn(x l , . . . , x n) , H') = T, relative to the. (m l , assignment of m, to Xi' as desired. On the other hand, if cf>(P"(x I' •.. , x n ) , H) = F relative to this assignment, and hence (ai' ... , an) ¢ cf>(pn, H), we clearly have (m l , . . . , mn ) ¢ rJ(P", H'), again as desired. The inductive clauses for the propositional connectives are as in the first part of this theorem. Suppose the result proved for A(x l , . . . , X n , y). Again let m, be assigned to Xi' let H = l(H'), and let a, = v(mi' H') (i = 1, ... , n). Let cf>«3y)A(xl , ••• , X n , y), H) = T when Xi is assigned a.. Then there is e b e ",(H) such that cf>(A(x I' ... , X n , y), H) = T when in addition y is assigned b. v(p, H') maps the elements generated at H' onto ",(H), so let v(p, H') = b, where p is generated at H'. Then, by inductive hypothesis rJ(A(xl , • . . , X n , y), H') = T when Xi is assigned m, (i = 1, ... , n) and

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

115

X n, y), H') = T when x, is assigned m.; On the other hand, suppose
y is assigned p; hence '7«3y)A(x I' ... ,

116

SAUL A. KRIPKE

1](A(Xl' ... , x n , y), Hi) = F when Xi is assigned m, and y is assigned p. Hence 1]«y)A(x 1 , ••• , x n ' Y), H~) = F when Xi is assigned m.; but since H'R'H'b 1]«y)A(x 1 , ••. , Xm y), H') = F relative to this same assignment.

This concludes the proof of the theorem. Q.E.D.

The fourth part of Theorem I shows how any quantificational model 1]. Essentially if we have arrived at a certain position H' E K' and if H = 1(H'), the numbers introduced at H' are "identified" with certain elements 0 f I/I(H) by v (n, H'). An example, following the spirit though not the letter of the proof of Theorem I, fourth part, converts the countermodel of section I . I, Figure 3, for (x) (P(x) v Q). ~ . (x)P(x) v Q into a corresponding Beth quantificational countermodel in the natural numbers. In Figure 3, there are two evidential situations, G and H; I/I(G) = {a}, t/J(H) = {a, b}. As natural numbers are generated, as long as we remain at the evidential situation G, we must "identify" each natural number with a (and therefore give it all properties assigned to a in Figure 3), but if we pass to H, we must "identify" some natural number with b. These considerations lead to the following figure:

¢ can be transformed into a Beth quantificational model

P(O)

P(1)

i

P(2)

Q i H. I

i---r

Q i H.

Figure 5.

P(3)

I

Q i H, I

This is exactly Beth's counterrnodel to (x) (P(x) v Q). ~ . (x)P(x) v Q. As long as we remain on the horizontal branch but are uncertain that we will continue thereon, we have not established (x)P(x) v Q; but on the other hand, for each natural number x, either P(x) or Q is eventually established. We have not mechanically applied the proof of Theorem I to obtain this model, but instead have reproduced its spirit; in particular, we have introduced a simplification analogous to that required to obtain Figure 4b from Figure 4a. Notice that Figure 5 can be interpreted in terms of absolutely free choice sequences as follows: Let a be an absolutely free choice sequence on the binary spread. Let P(x) abbreviate a(x) = 0, and let Q be (3x) (a(x) = I). Then, if x ranges over the natural numbers, clearly (a ~ B) (x) (P(x) v Q), but ,(a ~ B) «x)P(x) v Q). And, analogously, as Kreisel

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

117

and Dyson (Kreisel [II] and Dyson & Kreisel [9]) have observed, countable Beth quantificational models can always be interpreted thus. So Theorem I gives a new intuitive interpretation of our models, in which all quantifiers range over the natural numbers. Since below we will obtain a completeness theorem for countable quantificational models, and since such models can always be transformed into Beth (q.) models, our completeness results include those of Beth. (Beth required his models to be finitary, but we will show in part II how to obtain finitary Beth models.)

1.3. Other interpretations ofthe models Sections 1. I and 1.2 gave interpretations of our models which were intended to accord with the interpretations intuitionists customarily assign to their logical constants. In this section we will give two formal interpretations of the modelling which do not claim any direct intuitionistic content. (Both interpretations are actually direct special cases of the modelling; they simply consider a restricted class of models.) One interpretation is based on provability in formal systems; it was described briefly in [3]. The other is based on Paul Cohen's notion of forcing [5]. The two interpretations are intimately related to each other. This section may be omitted without loss of continuity.

I. Provability interpretation. Let Eo be a formal system, and let E be an arbitrary extension thereof. Let K be the set of all such E, and let ERE' iff E' is an extension of E. We define an atomic formula P to be a closed wff of Eo. (Note that P need not be an atomic formula of Ea.) We can then build non-atomic formulae out of the P's using the connectives A, ::J, " v. If we define l/>(P, E) = T iff P is provable in E and F otherwise, then l/>(P, E} is a model on the m.s. (Eo, K, R). Thus for any complex formula A which is a theorem of the intuitionist propositional calculus, l/>(A, Eo} = T. If Eo is elementary number theory Z, and P is Godel's undecidable formula, then l/>(P v ,P, Eo} = F; for P is not provable in Eo, but it is provable in certain extensions E. The larger problem, whether Heyting's propositional calculus is complete with respect to this particular choice of Eo, remains open. To interpret intuitionistic quantification theory in this manner, we must assume that the system Eo and its extensions have notions of free

118

SAUL A. KRIPKE

variables and of constants, and that Eo contains at least one constant. For any E E K, let I/I(E) be the set of all constants of E. Then if ERE', I/I(E) s I/I(E'). For every n, define an n-adic atomic predicate P" to be a formula of Eo with n free variables, together with a I-I function from the integers I, ... , n to the free variables of P", The variable assigned by this function to m(1 ~ m ~ n) is called the mth free variable of P". Define, for n ~ 1, the set ¢(pn, E) s [I/I(E)]n as follows: An n-tuple (a l ' . . . , an) of constants in I/I(E) is in ¢(P", E) iff the result of the simultaneous substitution of aj(1 ~ i ~ n) for the ith free variable of P" is a theorem of E. Out of the atomic n-adic predicates (which play the role of the n-adic predicate letters above), we can build more complex formulae using the propositional connectives and the quantifiers. ¢(P", E) then becomes an intuitionistic quantificational model. It is clear that in the preceding K can be replaced by any subset K' thereof (e.g., the finitely axiomatizable extensions of Eo). Further, restrictions, such as recursive enumerability, on the notion of formal system, can be removed at will. There is also a more "model-theoretic" variant of the present interpretation of Heyting's predicate calculus, which eliminates the assumption that E must-contain constants. Further, the interpretations can be extended in other directions so as to yield new interpretations of larger parts of intuitionistic mathematics; in particular, we can give an interpretation of FC which leads to a proof that FC is an inessential extension of Heyting's arithmetic"). For more on provability interpretations of intuitionistic and modal logics, cf. [3]. 2. Cohen's notion of "forcing," Let D be an arbitrary countable infinite set. Let 9 = (9 0 , ( 1) be a pair of finite, disjoint subsets of D, and let K be the set of all such pairs. If 9 = (.0/'0' !!J I) and 9' = (9~, 9'1) are in K, theqdefine 9 R9' (or, f!}' is an extension of 9) iff 9"0 s 9~ and 9 1 S 9~. Further, let I/I(g» = 9 0 u:3' 1. Now consider a single monadic predicate letter P. For any g; E K, define ¢(P,9) = 9 0 . Let K' be the set of all 9 E K such that 1/1(.9) is non-empty. Then for any g; E K', (g>, K', R) is a q.m.s., with the associated domain function 1/1. (If we had modified Heyting's predicate calculus so as to admit the empty domain and thus permit I/I(Y') to be empty, the rather artificial use of K' in place 1) Kreisel has independently obtained this result using an elimination of free choice sequences by contextual definition.

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

119

of K could be dropped.) Then ¢ is a model on (g'J, K', R), and for any formula A built from P using propositional connectives and quantifiers, the inductive definitions we have given define a truth-value ¢(A, g'J'), for any g'J' E K', relative to a fixed assignment of elements of D to the free variables of A. If this value is T, we say that g'J' forces A relative to the assignment. (Notice that the value of ¢(A, g'J') is clearly independent of the choice of the "designated" element g'J of (g'J, K', R).) If D' is a subset of D, we say that g'J , agrees with D' iff g'J~ s D' and g'J~ s D - D'. We can say that D' forces A (relative to a given assignment to the free variables) iff there is a g'J' E K' which agrees with D' and forces A. Notice that if g'J' and g'J" agree with D', they have a common extension which agrees with D'; thence it easily follows that D' cannot force a statement together with its negation. Call D' generic iff for every A, and fixed assignment to the fret: variables thereof, D' forces either A or ..,A. Cohen proves that generic sets exist: Let {An} be an enumeration ofall the ordered couples Ai = BIB. If D' is generic and forces ..,..,A, it clearly must force A; hence a nonempty, generic D' forces every classically valid formula not containing universal quantifiers. Cohen has proved an even stronger fact: If D' is generic and A has no universal quantifiers, then (relative to an assignment to free variables), A is forced by D' if and only if it is true when the existential quantifiers (taken as ranging over D) and the propositional connectives are interpreted classically, and "P(x)" is interpreted as "x ED'," The

e::

e-:

120

SAUL A. KRIPKE

assertion is readily proved by induction on the complexity of A. Since, classically speaking, a (x) can always be replaced by ,(3xh the restriction that universal quantifiers be absent is not important. The definition we have given differs from Cohen's in inessential respects. (It may be closer to a definition given by Feferman, which we have not seen 1). It is clear that the notion can be extended. For example, we need not deal with a single predicate P(x); we can deal with several such, not all of which need be monadic. The modifications needed for this more general situation should be obvious. Further, we can replace the countable set D by a set of regular cardinality N,,; K will consist of disjoint pairs of sets of cardinality less than ~". Cohen's motivation was radically different from ours, but it is clear that his notion is intimately related to our model theory. The "deeper" reasons for this relation may yet be unknown. It should be noted that Dana Scott had already observed that Cohen's idea was similar to an interpretation conjectured by Kreisel [17]. And indeed, if Kreisel's conjectures prove correct, his interpretation of intuitionism will be closely related to ours.

2. Semantic tableaux In this section we develop Beth semantic tableaux for intuitionistic logic. The notion developed here is similar to those of [2], [11], which can be read as background if desired. We deal at each stage of the construction with a system of alternative sets of tableaux; each alternative set is ordered in the form of a tree, and the origin of the tree is called the main tableau of the set. We call the tree ordering relation on an alternative set "S"; the smallest reflexive and transitive relation containing "S" is called "R". We can assume, at a given stage of the construction, that each alternative set is diagrammed on a piece of paper; corresponding to the system of all the alternative sets of the stage, we have a leaflet of which the separate sheets of paper are pages. Given a formula A of Heyting's predicate calculus, to see whether it is valid we attempt to find a countermodel to A. If A has the form Al A •.. Am. :=> • B I V ... B n, then what we need is a model ¢, such that relative to some assignment to the free variables of A, ¢(A j, G) = T and 1) See note at end of paper.

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

121

¢(B j ' G) = F, 1 :0; i :0; m, 1 :0; j :0; n. We represent the situation by putting A l , ... , Am on the left, and B l , . . . , B; on the right of the main tableau of a construction. We continue the construction, which gives a systematic attempt to find a tree countermodel to A, by the following rules, which apply to any tableau of any alternative set of the construction:

NI. If..,A appears in the left column of a tableau, put A in the right column of that tableau. Nr. If..,A appears in the right column of a tableau t, start out a new tableau t l , with tSt", by putting A on the left of t l • AI. If A left of t.

A

B appears on the left of a tableau t, put A and B on the

Ar. If A A B appears in the right column of a tableau t, there are two alternatives; extend the tableau t either by putting A in the right column or by putting B in the right column. If the tableau t is in an ordered set Y, it is clear that at the next stage we have two alternative sets, depending on which extension of the tableau t is adopted. Informally speaking, if the original ordered set is diagrammed structurally on a sheet of paper, we copy over the entire diagram twice, in one case putting in addition A in the right column of the tableau and in the other case putting B; the two new sheets correspond to the two new alternative sets. The formal statement is rather messy: Given a tableau t in an alternative set Y, if t has A A B on the right, we replace Y by two alternative sets Y 1 and Y 2, where !f'l = Y - {t} u {tl} and Y 2 = Y - {t} u {t 2}, and t l [t2 ] is like t except that in addition it contains A [B] on the right. The tree ordering S 1 of the new set Y 1 is precisely the same as S, save that t l replaces t throughout; and similarly for the tree ordering S2 of Y 2 • (Formally, Sl agrees with S on !f' - {t}, and, if t' is the predecessor [a successor] of t, then t'Sltl[tlSlt'].) We say!f' splits into Y l and !f'2' Similar remarks apply to the rule VI and PI below. VI. If A v B appears on the left of t, put either A on the left of t or B on the left of t. (As in the case of Ar, this splits the set !f' containing t into two alternative sets.) Vr. If A v B appears on the right of t, put A and B on the right of t. PI. If A

=:>

B appears on the left of t, either put A on the right of t

122

SAUL A. KRIPKE

or put B on the left. (Thus again the set g> containing t is replaced by two alternative sets.) Pro If A ::> B appears on the right of t, start out a new tableau t l, with A on the left of t 1 and B on the right, such that tSt l • For a construction involving quantifiers, we associate, at a given stage of a construction, a set I/1(t) of variables with each tableau t. We start out the definition of I/1(t) by assuming that, at the initial stage of the construction, which starts out with a single tableau to, I/1(to) consists of a single variable x. At later stages I/1(t) is to be enlarged only as required by the rules Ilr and II below and the stipulation that tSt l is to imply that I/1(t) S I/1(t 1 ) . We are now in a position to state the rules for quantifiers as follows: Ill. If (x)A(x) appears on the left of t and y is any variable in I/1(t), put A(y) on the left of t. Ilr. If (x)A(x) appears on the right of t start out a new tableau t 1 with tSt l • If y is the alphabetically earliest variable which has not yet occurred in any tableau of any alternative set at this stage, put y E I/1(t 1) and put A(y) on the right of t': El. If (3x)A(x) appears on the left of a tableau t, and y is the alphabetically earliest variable which has not yet appeared in any tableau of any alternative set at this stage, put y E I/1(t) and put A(y) of the left of t. Et, If (3x)A(x) appears on the right of a tableau t, and y is a variable in I/1(t), put A(y) on the right of t.

In addition to the rules we have stated, the following stipulation holds throughout the construction: if t and t 1 are tableaux of some one alternative set, at any given stage, such that tSt! , and A appears on the left of t, then put A on the left of t 1. Notice that, since the stipulation is to be iterated an arbitrary number of times, it also applies when A is on the left of t and tRt l • The relation tSt l is to hold in a construction only as required by the rules listed above. The rules may be applied in any order, as long as the order stipulated is such that every applicable rule is eventually applied. A tableau t is called closed iff some formula occurs in it on both the left and the right. A set or tree of tableaux is closed iff some tableau in

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

123

the set is closed. A system of alternative sets is closed iff every set of the system is closed. A construction started out by putting A on the right of the main tableau of the construction is called the construction for A. We can place the following restrictions on constructions: A rule is not to be applied to a tableau of a closed set; nor is it to be applied if it is "superfluous" (e.g., Al is not to be applied if A and B already appear on the left of the tableau t in question). Let us call an alternative set at any stage of a construction terminal iff it is not replaced at any stage of the construction by another set or pair of sets; thus, in particular, every closed set is terminal. In any construction, let a be some fixed sequence Y'1' Y'2' ... of alternative sets such that Y'1 is a set at the first stage of the construction and Y'i+ 1 is the set or one of the two sets, which, at the (i + 1)-th stage, replaces Y'i; a terminates at Y' n iff Y' n is terminal. (If the construction does not terminate there is at least one infinite such sequence a.) Any tableau t in Y'I or in Y'i + 1 which is not an immediate descendant of any tableau in Y'i is called an initial tableau. Let K be the set of all sequences. of tableaux tl' ' 2, ... such that 11 is an initialtableau andr, + I isan immediate descendant of t, and r terminates at iff belongs to a terminal set Y'm. Let be that member of K whose first term II is in Y'1' Let tpx', for r, t ' in K, iff for some Y'i in a there are terms t, I' of t, t' in Y'j such that IRt' (R the ancestral of the tree ordering S). Then, intuitively, (.0' K, p) forms a q.m.s. with domain function

'n 'n

.0

'in

If a quantificational model ¢ is defined so that, for any sentence letter P, ¢(P, r) = Tiff P appears on the left of some I in r, and, for any predicate letter P", ¢( P", r) is the set of n-tuples (x l ' . . . , x n) of variables such that pn(x 1 , .•• , x n) appears on the left of some I in r, then, for every formula B, if B appears on the left of some I in r, ¢(B,.) = T (relative to the

assignment of each free variable in B to itself). Further, the dual law that, for every B. if B appears on the right of some I in r, then ¢(B, r) = F, holds iff z does not terminate in a closed set Y' n' Hence, if the construction was a construction for A, this is just the condition under which a provides a countermodel for A.

124 THEOREM

SAUL A. KRIPKE

2: The construction for A is closed if and only if A is valid.

The proof, which follows the lines sketched intuitively above, and in addition shows that the alternative sets of the construction for A exhaust the possibilities of finding a countermodel for it, is omitted because it is a routine variation on the proofs of the corresponding theorems of [2] and [16jl). 3. Completeness theorem 3. 1. Consistency property THEOREM

3: If A is provable in Heyting's predicate calculus, then A is

valid.

This theorem is almost trivial; we need only verify that, in a standard formalization of Heyting's predicate calculus, the axioms are all valid, and the rules preserve validity. Such a verification is left to the reader. It follows that if A is provable, the construction for A is closed. 3 . 2. Completeness property

We show that every valid formula A is provable by showing that if the construction for A is closed, then A is provable. As in [2] and [16], we do this using a notion of "characteristic formula." As in [2], define the rank of a tableau in a finite tree of tableaux (or, indeed, of a node in any finite tree), as follows: An endpoint of the tree has rank O. If tis not an endpoint, let t 1 , ..• , t n be its successors; then Rank(t) = Max {Rank (ti)}+l. It is easy to verify that, for any finite tree of tableaux, a unique rank is defined for each tableau of the tree. 1) Define A to be tree valid iff '" (A, G) = T for every model rp on a tree q.m.s,

(G, K, R). Then what really is readily proved is that the construction is closed iff A is tree valid. But, by section 1.2 above, validity coincides with tree validity. Alternatively, we can argue as follows without use of section 1.2: Clearly validity implies tree validity, and provability implies validity. The completeness result below shows that tree validity implies provability, so the three notions coincide. We could have defined a tableau procedure, based on a relation R, which would have been more appropriate to models than to tree models; a reader familiar with [21 will know how this could be carried out. Notice that, as observed in analogous cases in [21 and [161, the countermodels for non-valid formulae obtained by Theorem 2 from tableaux are always on a countable tree q.m.s. (G, K, R) with a countable set U of individuals involved. This "LowenheimSkolem" result will be used in part II to show that the present completeness results include those of Beth [81.

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

125

Given any tableau t in a tree of tableaux, define the following sequence {til: to = t, t j + l = the predecessor of t j , if such a predecessor exists, and undefined otherwise. The sequence is clearly finite, and its last term is the origin of the tree We call it the "path from t back to the origin." The terms of the sequence other than t "come before t" on the tree. For any t on a tree, let X(t) be the set of all variables occurring free in t but not in any tableau coming before it. At any stage of a construction, the tableaux of an alternative set form a finite tree. We define the characteristic formula of a tableau t in the set at a given stage by induction on its rank in the set. Given a tableau t, let AI' ... , Am[Bl , . . . , B n] be the formulae occurring on the left [right] of t. Further, let Xl' ... , x q be the elements of X(t). (Possibly q = 0.) If Rank (t) = 0, then the characteristic formula of t is defined as (x.). .. (Xq ) (AI A .. . A m.::::> .B l V .. • B n ) ; or, if there are no formulae on the left [right] of t, as (Xl)' .. (Xq) (B I V ... B n ) [(Xl)' .. (xqHA 1 A Am)]' If Rank (t) > 0, let t l , . . . , t p be the successors of t, and let C l , , Cp be the corresponding characteristic formulae. Then the characteristic formula of t is (Xl)' .. (x q) (AI A •.. Am.::::> .B; V ... B; V C, V ... C p ) ; or, if there are no formulae on the left [right] of t, the characteristic formula is (Xl)" . (Xq ) (B 1 V .. . B; V C l V " .Cp ) [(Xl)" . (Xq ) (A 1 A .. • A m.::::> ,C 1 V ... C p ) ] . The characteristic formula of an alternative set (tree) of tableaux is defined as the characteristic formula of the main tableau of the set. The characteristic formula of the entire system of alternative sets at a given stage of a construction is defined as the conjunction of the characteristic formulae of the alternative sets of the system. In a natural sense, the present notion of characteristic formula is "dual" to that of [2] and [16]. It may facilitate the reader's comprehension of the notion of characteristic formula if he consults the corresponding treatment of characteristic formulae in [2], [16]. LEMMA: If A o is the characteristic formula of the initial stage of a construction, and B o is the characteristic formula of any stage of the construction, then I- B o ::::> A o.

PROOF. It suffices to show that the characteristic formula of any stage of the construction implies the characteristic formula of the preceding stage. But the characteristic formula of the mth stage has in general the orm D 1 A .. . Dj A .. . Dn , where the Di(l :::; i :::; 11) are the characteristic

126

SAUL A. KRIPKE

formulae of the alternative sets of the stage. The rule which is applied and changes the mth stage into the m + lth affects only one alternative set, say with characteristic formula D i: If the rule is PI, Ar, or VI, it will change this set into two distinct alternative sets, with characteristic formulae D', and Dj; we wish to prove, then, J- D 1 A .. . D', A DjA ... D n . =:> • D 1 A ... D j A ... D w To do this, it suffices to prove D', A Dj. =:> • D i: Similarly, if the rule applied is other than PI, Ar, or VI, then D j is transformed into Dj; to prove that J- D 1 A ... D', A ... Dn' =:> • V 1 A ... D j A ... D n , it suffices to prove J- D', =:> D i: So, when a rule is applied transforming the mth stage of a construction into the m + 1th, we need only consider the characteristic formula of the set to which the rule is actually applied. Suppose, then, a rule (other than PI, or Ar, or VI) transforms a set Y with characteristic formula D j into one with characteristic formula Dj; we wish to prove J- Dj =:> D i: Let t be the tableau to which the rule is actually applied, and let C be its characteristic formula. Further, let C' be the characteristic formula of the tableau t' into which t is transformed by the given rule. (The rules Nr, Pr and IIr leave t unchanged, appending a new tableau t': In this case t' will be identical with t, but the new characteristic formula C' of t will not be identical with the old one C.) Suppose we can show J- C' =:> C. Then if t is the main tableau of the set Y, we have shown J- Dj =:> D j • Otherwise, let t 1 be the predecessor at stage m of t, let t~ be the predecessor at stage m+ Lof r', and let Cl[C~J be the characteristic formula oft 1 [ t a Then C 1 is a universal quantification (u.q.) of a formula of the form X. =:> • Yv C, and C~ is a u.q. of X. =:> • Yv C'. Since J- C' =:> C, clearly J- (X. =:> • Yv C') =:> (X. =:> • Y V C). Applying universal generalization to this last statement, and distributing universal quantifiers across the implication sign, we obtain J- C~ =:> C iIf t 1 is the main tableau of g, then C~ =:> C 1 is D', =:> D ; Otherwise, let t 2[t;J be the predecessor of tl[t~J, and apply the same reasoning as before. Eventually we will obtain D', =:> D j • Thus in the case ofany rule other than PI, VI, or Ar, we need only consider the tableau t to which the rule is actually applied, and prove the formula C' =:> C stated above. Notice that in general C, the characteristic formula of t, is a u.q. of a certain formula B, and C' is a u.q. of a certain formula B'. If we prove J- B' =:> B, then by universal generalization and distribution of the quantifiers across the implication sign, we can obtain C' =:> C.

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC I

127

Bearing these remarks in mind, we break down the proof into the following cases, depending on the rule applied to obtain the m + 1th stage from the mth. We can say a case is "justified," if we have shown, for the case, that f- D / ::::J D j' which usually reduces to f- B' ::::J B. The reader is advised to consult the similar treatments in [2] and [16]. In considering a rule, we will in general assume that the tableau t to which it is applied contains formulae both on the left and the right, and that its characteristic formula is therefore an implication. The cases where the left or right side is empty will be left to the reader. Case NI. The characteristic formula of t is a u.q. of X A ,A. ::::J. Y; after A has been put on the right, its characteristic formula becomes a u.q. of X A ,A. ::::J • Y v A. The case is justified by f- X A ,A. ::::J • Yv A: ::::J : XA,A.::::J .Y. Case Nr. The characteristic formula of t is a u.q. of X. ::::J • ,A v Y. When we start out a new tableau t 1 with A on the left, and t St", the characteristic formula of r ' is ,A (since X(t 1 ) is empty because any free variable of A already occurs in t), and that of t becomes a u.q. of X.::::J .,A v Yv ,A. The case is justified by f- X.:::::>. ,A v Yv,A::::::> :X. :::::>. ,A v Y. Case A 1. Justified by f- X

A

A

A

BAA

A

B. ::::J . Y: ::::J :X A A

A

B. ::::J . Y.

Case Ar. Let the characteristic formula of t, call it C, be a u.q. of X.:::::> . Yv (A A B). The rule Ar "splits" t into two alternative tableaux, t' and t", whose characteristic formulae C' and C" are u.q.'s of X . :::::> • Y v (A A B) v A and X.::::J . Yv (A A B) v B, respectively. Using f- (X.:::::>. Y v (A A B) v A) A (X. ::::J . Yv (A A B) v B): ::::J: X. ::::J . Yv (A A B), and generalizing, and distributing quantifiers, we obtain f- C' A C". :::::> • e. If t is the main tableau of the set, this is the desired result f- Dj A Dj. ::::J .Dj • Otherwise, let t 1 be the predecessor of t. The characteristic formula C 1 of t 1 is a u.q. of Xl' ::::J. Y1 V C; it is transformed by Ar into two alternative characteristic formulae C~ and e~, which are u.q.'s, respectively, of Xl' ::::J . Y1 V C' and Xl' ::::J . Y 1 V C", Using f- C' A C", ::::J • C, we easily obtain f- C~ A e~. ::::J . C 1 . Continuing this process along the path from t back to the origin, in a finite number of steps we obtain f- Dj A Dj. ::::J . Di: Case PI. Like Ar, using f- (X A (A ::::J B). ::::J . Y v A) B.:::::>. y)::::::>:XA (A :::::> B).:::::>. Y.

A

(X A (A ::::J B)

A

128

SAUL A. KRIPKE

Case Pro Let the characteristic formula of t be a u.q. of X. ::::> • Y v (A ::::> B). Pr instructs us to start out a tableau t 1 , with A on the left and B on the right, whose characteristic formula is thus A ::::> B (X(t 1 ) being empty). Then the characteristic formula of t is transformed into a u.q. of X.::::>. Yv (A ::::> B)v (A::::> B), and I- X.::::>. Yv(A ::::> B) v (A::::> B):::::>: X. ::::> • Yv (A ::::> B) justifies the case. Case VI. Like Ar, using I- (X A (A v B) B.::::>. Y):::::> :(X A (A v B).::::>. Y).

A

A.

::::> • Y) A

(X A (A v B)

A

Case Vr. Justified by I- X.::::>. Yv (A v B)v A vB:::::> :X.::::>. Yv (A v B). Case 171. If t has as characteristic formula C, a u.q. of X A (3x)A(x). ::::> • Y, after application of 171, t is transformed into t 1 , whose characteristic formula C' is a u.q. of X A (3x)A(x) A A(a). ::::> • Y. Since a is a new variable not previously introduced, a E X(t 1 ) . Thus, we can take C' to be a U.q. of (a) (X A (3x)A(x) A A(a). ::::> • Y). So I- (a) (X A (3 x)A (x) A A(a). ::::> • Y):::::> :X A (3x)A(x). ::::> • Y justifies the caseCase 17r. Justified by I- X.::::>. Y v (3x)A(x) (3x)A (x). Case III. Justified by I- X

A

(x)A(x)

A

A(a).

A

A(a):::::> :X.::::>. Y v

::::> • Y: ::::>

:X A (x)A(x).

::::> • Y.

Case IIr. The characteristic formula of t is a u.q. of X. ::::> • Y v (x)A(x). IIr instructs us to start out a new tableau t\ with tSt\ and with A(a) on the right, where a has not previously been used. Then X(t 1 ) = {a}, since a is the only free variable of t 1 which does not occur in t, Hence the characteristic formula of t 1 is (a)A(a), and the characteristic formula of t is transformed into a u.q. of X.::::>. Yv (x)A(x) v (a)A(a). So I- X. ::::> • Y v (x)A(x) v (a)A(a): ::::> :X. ::::> • Y v (x)A(x) justifies the case.

Finally, we must justify the rule stipulating that if a formula A appears on the left of a tableau t, and t St", we must put A on the left of t 1 • This is justified by XAA.::::>.Yv«X' AA)::::> Y'):::::>:XAA.::::>.Yv (X' ::::> Y'). The lemma is proved. THEOREM 4: If A is valid, then A is provable in Heyting's predicate calculus. PROOF. We can assume A has no free variables. Since A is valid, the

SEMANTICAL ANALYSIS OF INTUITIONISTIC LOGIC l

129

construction for A is closed. Then there is a stage at which each alternative set is closed; let the characteristic formula of that stage be D 1 /\ • • • D., where the D /s are the characteristic formulae of the alternative sets of the stage. By the lemma, D 1 A ..• D n • =:l.A (since A is the characteristic formula of the initial stage). So it suffices to show D j for each j. The alternative set whose characteristic formula is D j' being closed, contains a closed tableau t. Then t contains a formula B on both sides, so its characteristic formula C is a u.q. of X /\ B. =:l • Y v B. Clearly I- C. If t is the main tableau of the set, this is D i: Otherwise, let t 1 be the predecessor of t. Then the characteristic formula C 1 of t 1 is a u.q. of X'. =:l • Y ' V C. Clearly I- C 1 • Continuing in this manner, we are driven back along the path from t to the origin until we obtain I- D I: Q.E.D. REMARK. The theorem gives a finitary proof that if the construction for A is closed, I- A. We could have proved it alternatively by showing that the tableau procedure is equivalent to a standard Gentzen formulation of Heyting's system. Of course the theorem and proof apply to the propositional calculus, even though the proof was carried out for the predicate calculus.

References [I] Saul A. Kripke, Semantical Analysis of Modal Logic (abstract). The Journal of Symbolic Logic 24 (1959) 323-324. [2] Saul A. Kripke, Semantical Analysis of Modal Logic I. Normal Modal Propositional Calculi. Zeitschrift fur Mathematische Logik und Grundlagen der Mathemathik 9 (1963) 67-96. [3] Saul A. Kripke, Semantical Considerations on Modal and Intuitionistic Logic. Acta Philosophica Fennica 16 (1963) 83-94. [4] Saul A. Kripke, The Undecidability of Monadic Modal Quantification Theory. Zeitschrift fur Mathematische Logik und Grundlagen der Mathematik 8 (1962) 113-116. [5] Paul J. Cohen, The Independence of the Continuum Hypothesis. Proceedings of the National Academy of Sciences, U.S.A. 50 (1963) 1143-1148. [6] E. W. Beth, Observations on an Independence Proof for Peirce's Law (abstract). The Journal of Symbolic Logic 25 (1960; published, 1962) 389. [7] M. A. E. Dummett and E. J. Lemmon, Modal Logics between S4 and S5. Zeitschrift fur Mathematische Logik und Grundlagen der Mathematik 4 (1958) 250-264. [8] E. W. Beth, Semantic Construction of Intuitionistic Logic. Mededelingen der Koninklijke Nederlandse Akademie van Wetenschappen, Afd, Letterkunde, Nieuwe Reeks, Deel 19, No. 11. [9] V. H. Dyson and G. Kreisel, Analysis of Beth's Semantic Construction of In-

130

SAUL A. KRIPKE

tuitionistic Logic. Technical Report no. 3, Stanford University Applied Mathematics and Statistics Laboratories, Stanford, California. [l0] S. C. Kleene, Introduction to Metamathematics. (Van Nostrand, New York; North-Holland Publishing Co., Amsterdam and P. Noordhoff Ltd., Groningen). 1952. [ll] G. Kreisel, A Remark on Free Choice Sequences and the Topological Completeness Proofs. The Journal of Symbolic Logic 23 (1958) 369-388. [l2] A. Heyting, Intuitionism: An Introduction. (North-Holland Publishing Co., Amsterdam 1956). [l3] S. Kuroda, Intuitionistische Untersuchungen der formalistischen Logik. Nagoya Mathematical Journal 2 (1951) 35--47. Known only from references. [14] A. A. Markov, 0 nepreryvnosti konstruktivnyh funkcij (On the continuity of constructive functions). Uspehi Matern. Nauk 9 (1954) 226-230. Known only from references. [15] G. Kreisel, On Weak Completeness of Intuitionistic Predicate Logic. The Journal of Symbolic Logic 27 (1962) 139-158. [l6] Saul A. Kripke, A Completeness Theorem in Modal Logic. The Journal of Symbolic Logic 24 (1959) 1-14. [l7] G. Kreisel, Set Theoretic Problems suggested by the Notion of Potential Totality in: Infinitistic Methods (Warsaw 1961). Note (added in proof, August 9,1964). We have since seen Feferman's paper, and his version of forcing is indeed virtually identical with ours, although he, of course, does not base it on any model theory for or connection with intuitionistic logic. He credits his version to Dana Scott. Note (added in proof, October 28,1964). In connection with the "Remark" at the end of section 1.1, it should be pointed out that the example in part I of the Remark already refutes Markov's principle. For we observed there that, in FC, (a) (IX I B) , (x) (IX (x) = 0), but also (b) , (IX I B) (3x) (IX x) = 1). By (b), noting that since B is the binary spread, (IX I B) (x) (IX (x) =F 0 :::> IX (x) = I), we have (c) , (IX I B) (3x) (IX (x) =F 0). But (a) and (c) jointly contradict Markov's principle. The example in part 2 of the Remark is of interest in showing that a single counterexample can refute both Markov's principle and Kuroda's conjecture. It should be noted that Markov's principle would imply, for IX on the full binary spread, that , (x) (IX (x) = 0) :::> (3x) (IX (x) = 1). From this it is easy to derive, for a real number a, that a =F 0 implies a # 0 (similarly to part 1 of the Remark). Hence if Brouwer's disproof (using ips depending on the solving of problems) of the latter is accepted, Brouwer has already refuted Markov's principle. I wish to thank M. A. E. Dummett and John Crossley for their help in editing this paper, and in particular, M. A. E. Dummett for an important correction in section 1.2.

SET THEORY AND HIGHER-ORDER LOGICl) RICHARD MONTAGUE University of California, Los Angeles, Calif., USA

Several mutual applications of set theory and higher-order logic are developed. Second-order logic is used to discover the standard models of Zermelo-Fraenkel set theory, consideration of standard models leads to the introduction of new systems of set theory, one of these systems is applied in finding a definition of truth for higher-order sentences, and finally Zerrnelo-Fraenkel set theory with individuals is given a philosophical justification as logically true within higher-order logic. 1. Standard models

Let us consider three well-known first-order theories. The first, called Peano's arithmetic, has the non-logical constants 0, S, +, " and the following axioms"): ,0= Sx,

Sx = Sy

--+

x = y,

x+O = x, x+Sy = S(x+y), l) I am indebted to the United States' National Science Foundation, which supported the preparation of most of this paper under grant number NSF GP 1603 (Montague). 2) It is convenient for the purposes of the present paper to regard a first-order theory as determined by a sequence of non-logical constants and a set of axioms. I use the logical constants " A, Y, -->-,"-', A, Y, =, which are the respective symbols of negation, conjunction, disjunction, implication, equivalence, universal quantification, existential quantification, and identity. (I use ",", "A", etc. as names of certain symbols of the object language, and I indicate concatenation by juxtaposition.)

132

RICHARD MONTAGUE

x :0

= 0,

x'Sy

= (x'y)+x,

prO] A Ax[P[x]

P[Sx]]

-+

-+

AxP[x].

The last principle is regarded as a schema, called the Induction Schema; we take as axioms all formulas of Peano's arithmetic obtainable without clash of variables by substituting a formula for P in the schema. The second theory, called (at least in an alternative formulation 1» the theory of real closed fields has the non-logical constants 0, 1, +, " -, -1, ~, and the following axioms: x+(y+z) x+y x+O

= (x+y)+z,

= y+x, = x,

x+( -x) x . (y . z)

= 0, = (x . y) . z,

x : y = y' x,

= x, = 0 -+ x

x·l ., x

. x- 1 = 1,

x· (y+z) = (x : y)+(x' z),

.,0

o~ o~ o~ x

=

~

0- 1

1,

x v0 A 0 xA0

x

~ ~ ~

-x, - x -+ x = 0, Y -+ 0 ~ x +Y A 0

~ X •

y.

y ...... O ~ y+(-x),

= 0,

Vx P[x]

A

VyAx[P[x]

Az[Ax[P[x]

-+

-+

x ~ y]

x ~ z]

-+

-+

Vy(Ax[p[x]

-+

x ~ y]

A

y ~ z)).

The last principle is called the Continuity Schema and plays a role analogous to that of the Induction Schema; that is, we consider as axioms all formulas of the present theory obtainable (again without clash of vari1) The usual formulation involves fewer primitive symbols, and hence has somewhat more complicated axioms. It is clear that several of our symbols, for instance -1 could be defined in terms of the others.

133

SET THEORY AND HIGHER-ORDER LOGIC

abIes) by substituting a formula for P in the schema. As a final example, consider Zermelo-Fraenkel set theory, whose only non-logical constant is e and whose axioms are the following: Au[u e a

+-+ u

Vu u e a

-+

s b]

-+

a

= b,

Vu[u e a It. ., Vv(ve u It. v e a)],

VaAu[uea+-+u

= xvu = y],

VbAu[u s b +-+ Vv(u e v It. v s a)], VbAu[u e b +-+ Ax(x s u Va[Vu u s a

It.

Au(u s a

-+ -+

x e a)], Vv[u s v It. v e a])],

AxAyAzAaAbAcAqAr[Au(u e a +-+ u

Au(u e b +-+ u Au(u s q +-+ U It.

P[q] It. P[r]

=

x)

It.

= x v u = y) It. Au(u s c +-+ u = x v u = z) = a v u = b) It. Au(u s r +-+ u = a v u = c) -+

y = z]

-+

AsVtAy[y e t

= x] It. Au[u e b +-+ u = x Au[ueq+-+u = avu = b] It. P[q])]. Au[u s a

+-+ u

+-+ VxVaVbVq(x

vu

It.

e s It.

= y] It.

The last principal is called the Replacement Schema; as with the schemata above, we take as axioms all formulas of Zermelo-Fraenkel set theory obtainable without clash of variables by substituting a formula for P in this schema. 1) A possible model of Peano's arithmetic is a structure
AxAyAz[P[(x, y)]

A

P[(x, z)]

AsVtAy[y e t <--> Vx(x

£ S A

->-

y

= z] ->-

P[(x, y)])].

134

RICHARD MONTAGUE

quantifiers on those variables. 1) The concepts of possible model and model may of course be used also in connection with other theories. For instance, a possible model of the theory of real closed fields will have the form
°

1) For an exact definition of the notion of truth in a possible model, see Tarski and Vaught [12]. 2) For the general notion of isomorphism of structures, see, for instance, Tarski [11].

135

SET THEORY AND HIGHER-ORDER LOGIC

ventionally identifying 1/0 with 0), and R is the usual relation of magnitude among real numbers. For set theory we first introduce a transfinite hierarchy of types. Intuitively, if a is any ordinal, T(a) is to be the a~ type in a Russellian hierarchy which begins with an empty set of individuals; the recursive definition is the following. T(O) = the empty set A. T(a+ 1) = the set of all subsets of T(a).

If A is a limit number, T(A) is the union of all sets

T(~)

for

~

< A.

(Notice that the types are according to this definition cumulative and hence not pairwise disjoint. For instance, the empty set is a member of both T(l) and T(2). This is in no way incompatible with Russell's theory. Though that theory requires distinct symbols for the empty set of type o and the empty set of type 1, nothing prevents the two symbols from designating the same object. The identity between the two symbols is not a sentence of the theory and can be neither asserted nor denied.) By a standard model of Zermelo-Fraenkel set theory is understood a structure isomorphic to
~

P[Sx]]

~

AxP[x]).

(Here P now serves as a genuine predicate variable.) Similar the secondorder theory of real numbers and second-order Zermelo-Fraenkel set 1) For several equivalent definitions of the notion of a strongly inaccessible ordinal, see Montague and Vaught [8], which contains investigations of certain properties of the structures (T(rx), E;(T(rx))).

136

RICHARD MONTAGUE

theory are to be exactly like the theory of real closed fields and ZerrneloFraenkel set theory respectively, except that the Continuity Schema and the Replacement Schema are to be replaced by the second-order axioms obtainable from them by prefixing universal quantifiers on P. The possible models of any of these three theories coincide with the possible models of the corresponding first-order theory. (Thus as always the collection of possible models depends only on the sequence of nonlogical constants of the theory in question.) A possible model of a second order theory will be considered a model of that theory if all axioms of the theory are true under the interpretation supplied by the possible model; individual variables are regarded as ranging over the universe of the possible model (that is, the set which is the first constituent of the possible model), and (one-place) predicate variables as ranging over the set of all subsets of that universe. Now it is well known that the models of second-order Peano's arithmetic coincide exactly with those structures which were described earlier as standard models of (first-order) Peano's arithmetic, and it is rather easily shown that the models of the second-order theory of real numbers and those of second-order Zermelo-Fraenkel set theory coincide respectively with the standard models of the theory of real closed fields, and the standard models of (first-order) Zermelo-Fraenkel set theory. These facts suggest a unification of the divergent special notions: whenever we speak of the standard models of a first-order theory T we have in mind a related second-order theory U; the standard models of Tare then identified with the models of U. In any context in which standard models are considered the theory of basic interest seems always to be a second-order theory (or, in some cases which will not arise in this paper, a theory of higher than second order). We may, of course, consider as well various first-order subtheories of the second-order theory.') For some purposes, indeed, it is essential to do so in view of certain properties - compactness and the Lowenheim-Skolern property, for example - which are possessed by all firstorder theories but not by any interesting second-order theory. Suppose that we have selected a certain second-order theory as our 1) One theory is called a sub theory of another if all theorems of the first are theorems of the second. (A theorem of a first- or second-order theory T is a first- or second-order formula of T which is true in every model of T.)

SET THEORY AND HIGHER-ORDER LOGIC

137

basic object of attention. How are we to select a subtheory that might be called "the corresponding first-order theory" ? Let us consider a procedure applicable to those second-order theories T whose axioms, like those in our examples, all have the form

where Po, ... , P n-l are predicate variables and ¢ is a formula without second-order quantifiers. 1) By a first-order instance within T of the formula displayed above, we understand a first-order formula obtainable (without clash of bound variables) by substituting formulas of T for the predicate variables Po' ... , P n - 1 in ¢. The first-order theory corresponding to T might then be identified with that theory whose non-logical constants are those of T and whose axioms are the first-order instances within T of axioms of T. It is according to this notion that the three second-order theories considered above can be said to have Peano's arithmetic, the theory of real closed fields, and Zermelo-Fraenkel set theory as their corresponding first-order theories. This way of selecting a first-order theory is, however, unnatural: we can easily find two equivalent second-order theories") whose first-order counterparts in the present sense are not equivalent. A much more natural procedure, and the one we shall adopt, is to identify the firstorder counterpart of a second-order theory T with that theory whose nonlogical constants are those of T and whose axioms consist of those firstorder sentences which are theorems of T. In the light of this analysis two of the first-order theories considered above, Peano's arithmetic and Zermelo-Fraenkel set theory, lose interest. Neither is equivalent to what we now regard as the first-order counterpart of second-order Peano's arithmetic or second-order ZermeloFraenkel set theory. The first-order counterparts of these two theories are, to be sure, not equivalent to any recursively axiomatized theories (or even to theories with arithmetical axiom sets, in the sense of Kleene [3]), and we may for some purposes wish to consider recursively axiomatized first-order subtheories. But Peano's arithmetic and Zermelon

1) The first-order axioms of our examples can be regarded as having this form with

= O.

2) Two first- or second-order theories are said to be equivalent if they have the same models.

138

RICHARD MONTAGUE

Fraenkel set theory, though they satisfy this description, seem in no tangible sense pre-eminent among a wide range of other, non-equivalent theories which also qualify. The situation changes when we consider the second-order theory of real numbers. It is a result of Tarski [10] that what we now call the first-order counterpart of this theory is equivalent to the theory of real closed fields. 2. Rank-free set theory There are interesting structures of the form 0)2), and seek a system of set theory which plays in this connection the role played by second-order Zermelo-Fraenkel set theory in connection with the more restricted collection of structures considered earlier. The problem, then, is to find a theory T whose only non-logical constant is s and such that the models of T coincide with the structures isomorphic to
order to maintain harmony with conventional model theory, which does not countenance any model with an empty set of elements. 3) The fact that these axioms give a theory with the required property is not completely obvious. Some preliminary derivations from the axioms which make the remainder of the proof rather simple may be found in the forthcoming monograph Montague, Scott, and Tarski [7].

SET THEORY AND HIGHER-ORDER LOGIC

(I)

Au[uBa~uBb)---+a=b,

(2)

YbAx[x Bb ~ Yy(y Ba

(3)

YkAmAb(Ax[x B m ---+ x B k) A Ax[x Bb ~ Yy(y B m A Az[z B X

A

Az[z B X

139

---+ Z BY))),

---+ Z E y))) ---+ b Bk v Ax[x s a ---+ x E b)),

(4)

APAaYbAx[x e b ~ P[x) A x e a).

(In the last axiom, the familiar Aussonderungsaxiom, P is of course a predicate variable.) We may call this theory second-order rank-free set theory. (The name comes from the fact that the theory is neutral with respect to the ranks, or types, of its models.) Perhaps more interesting, at least from the viewpoint of philosophy and empirical science, is that kind of set theory which allows for the possible existence of individuals or non-sets (objects which contain no elements but differ from the empty set). We therefore consider also a modified version of the last system, and understand by second-order rank-free set theory with individuals that theory whose non-logical constants are e and l: (the latter understood as the predicate of being a set) and whose axioms are the following: Ax[x e a ~ x s bJ A l: a A l: b ---+ a

=

b,

y e x ---+ l: x, l: a ---+ YbAx [x s b ~ , l: x v Yy(ye a A

A z [z e x

---+

z e yDJ,

AaYkAmAb(Ax[x e m ---+ x s k) A A x [x e b ~ , l: x v Yy(y s m A Az[z E X ---+

b e k v Ax[x e a

---+

Z E

yDJ

---+

x e b)),

APAa[l:a ---+ Yb(l:b A Ax[x s b ~ P[x) A x e a))).

(The last axiom is another version of the Aussonderungsaxiom.) A possible model of this theory will have the form
m

140

RICHARD MONTAGUE

extension (possibly definitional, possibly axiomatic) of ZermeloFraenkel set theory with individuals"), Thus in our metatheory we allow for the possible existence of individuals. We do not commit ourselves as to their number; it may indeed be zero. We do not even need to commit ourselves as to whether the individuals form a set, although the natural approach is to assume, as in the next-to-last axiom displayed above, that they do. Now given any set U we can construct a transfinite cumulative Russellian hierarchy based on U. If a is any ordinal, Tu(a) is to be the a0 type in this hierarchy; the recursive definition is the following. Tu(O) = U. Tu(a+ 1)

= the union of Tu(a) and the set of all subsets of Tu(a).

If A is a limit number, then TuU) is the union of the sets ~ < A.

Tu(~)

for

(Notice that if U is the empty set and a any ordinal, then Tu(a) = Tla).) The possible models which most naturally come to mind in connection with set theory with individuals are structures of the form
141

SET THEORY AND HIGHER-ORDER LOGIC

ture, a model of second-order rank-free set theory with individuals. With the help of a few auxiliary notions we can state some facts about type structures which do not depend on the number of individuals. Let 'll be a type structure, and let 'll have the form
T\lI(O) = In \lI' T\lI(e>:+ 1) = the set of x in A such that, for all y, if
If A- is a limit number, then '\lI(A-) is the union of the sets ~

< )..

'\lI(~)

for

It will turn out that A = '\lI(e>:) for some ordinal rx; we call the least such ordinal the rank of'll. If B is any set and rx any ordinal greater than 0, then there is a type structure 'll with rank a such that In\lI = B. If B is a non-empty set, there is a type structure'll of rank such that In\lI = B. If'll and ill' are type

°

structures of the same rank, and In\lI and In\lI' have the same cardinality, then'll is isomorphic to'll'. If'll and 'U' are type structures, and f is a one-to-one correspondence between In\lI and In\lI" then there is at most one isomorphism between ill and ill' which is an extension off If ~l is a type structure, 'll = :) is non-empty, then '\lI(rx), R', B) is a type structure of rank «, where R' is the restriction of R to '\lI(rx). If in second-order rank-free set theory and second-order rank-free set theory with individuals we drop the initial quantifier of the Aussonderungsaxiom and treat the result as a schema, we shall obtain two first-order theories, which we may calljirst-order rank-free set theory andjirst-order rank-free set theory with individuals respectively. Like Peano's arithmetic, these theories have no theoretical pre-eminence among a number of recursively axiomatized first-order subtheories of the corresponding second-order theories, but they have some practical interest. Various useful systems of set theory, some well-known and others as yet unexploited, can be obtained from these two first-order theories in a uniform way, by the addition of "axioms of infinity", that is, principles imposing conditions on the ranks of models. For example, Zermelo-Fraenkel set theory with individuals may be

<

142

RICHARD MONTAGUE

obtained by adding to first-order rank-free set theory with individuals the axiom AxVyxey,

as well as all formulas of this theory obtainable without clash of variables by substituting a formula for the two-place predicate R in the schema AxVy R[x, y] ~ Vb[Ax( ~x s a x s b)" Ax(x s b ~ Vy[ye b r; R[x, y]])]

(which may, to borrow a term from Raymond Smullyan, be called the Principle of Disjunctive Closure). Additional examples are furnished by Zermelo-Fraenkel set theory and the set theory of Morse, formulations of which can be obtained similarly, starting from first-order rank-free set theory. Among the less familiar examples are a theory T 1 which, roughly speaking, bears the same relation to the theory of Morse as that theory bears to Zermelo-Fraenkel set theory, a theory T z which bears the same relation to T 1 , and so on; the latter theories seem relevant to foundational problems arising in connection with several branches of mathematics, for instance, abstract algebra, model theory, and algebraic topology. 1 ) 3. Higher-order logic

The problem of defining truth (or more precisely, truth in a model) for sentences containing variables of transfinite type seems never to have been completely settled in the literature, but can be rather easily and naturally solved with the aid of considerations in the preceding section. As ingredients of higher-order formulas we assume the following disjoint categories of symbols to be available: (1) the logical constants listed in footnote 2, p. 131, (2) an additional logical constant 1'/, regarded as indicating membership, (3) for each natural number n, a collection (whose cardinality need not be specified here) of n-place predicates, (4) for each natural number n, a collection (again of unspecified cardinality) of n-place operation symbols, and (5) for each ordinal Ct, a denumerable set of variables of type Ct. 1) For a discussion of a few of these problems see MacLane [4].

SET THEORY AND HIGHER-ORDER LOGIC

143

The quantifiers and = may apply to variables of any type; the predicates and operations symbols, on the other hand, may apply only to individual variables (that is, variables of type 0) or, somewhat more generally, to individual terms.

We understand by an individual term an expression t which is a constituent of some finite sequence s such that each constituent of s is either an individual variable or, for some natural number n, the concatenation of an n-place operation symbol with n earlier constituents of s; by a term either an individual term or a variable of type greater than 0; and by a higher-order formula an expression ¢ which is a constituent of some finite sequence s such that each constituent of s is either (1) t = u or t 17 u, for some terms u and t, (2) the concatenation of an n-place predicate with n individual terms, for some natural number n, (3) the negation of an earlier constituent of s, (4) the conjunction, disjunction, implication, or equivalence formed from earlier constituents of s, or (5) A v¢ or Vv¢, where ¢ is an earlier constituent of s and v a variable of arbitrary type. This characterization of higher-order formulas could have been more liberal in two ways. In the first place, the only higher-order variables we have admitted are one-place predicate variables. We could also have included predicate variables of various numbers of places (and of various types). Such an approach is appropriate in connection with a truncated higher-order logic, in which a finite upper bound is placed on the types considered. When no such bound is present, however, everything expressible by use of predicate variables of several places can also be expressed by use of one-place predicate variables; and it seems desirable to avoid the unpleasantly complicated type hierarchy that predicate variables of various numbers of places would introduce. A second possible extension of the present approach would consist in admitting predicates and operation symbols that meaningfully apply not to individual terms but to variables of higher type. Such an approach would require a more general notion of a model than the one usual in the literature, which is also the one defined below. The more general notion of a model would indeed have interest. It would permit a unified treatment of such structures as topological spaces, uniform spaces, and systems of classical particle mechanics of a given number of dimensions, 1) 1) The last phrase is used in the sense of McKinsey, Sugar, and Suppes [5J.

144

RICHARD MONTAGUE

which cannot well be construed as first-order structures (that is, models in the usual sense). But a discussion of such matters is best deferred to another occasion. On the other hand, it is possible to imagine an approach of a more, rather than less, restrictive character than ours. We could have imposed some condition of stratification on the formulas t = u and t 11 v, where t, u, v are variables - for instance, that t and u have the same type, and that the type of v be greater by one. It seems preferable not to impose such restrictions, however; they would lead to no simplification in the problem of interpreting higher-order formulas but only to a considerable reduction in power of expression. In the examples considered earlier a model (or a possible model) was construed as a sequence, and position in that sequence determined the intended correspondence between interpretations and symbols to be interpreted, which were themselves given in a sequence. In the general situation it is more convenient to establish this correspondence directly, by means of a function. Thus we now understand by a model an ordered pair
Now let
SET THEORY AND HIGHER-ORDER LOGIC

145

F(n), (2) for each ordinal a, the variables of c/J of type a are regarded as

ranging over the set TA(a), (3) = is interpreted as the identity relation, and (4) rJ is interpreted as membership. 1) Thus the types are regarded as cumulative; a consideration of transfinite types would indicate rather clearly the desirability of this course. Furthermore, the choice between cumulative and non-cumulative types has no effect on the truth of second-order formulas satisfying the usual conditions of stratification. In particular, let c/J be a higher-order sentence such that (I) whenever u = v is a subformula of c/J, both u and v are individual terms, and (2) whenever u rJ v is a subformula of c/J, u is an individual term and v is a variable of type 1. In addition, assume that
146

RICHARD MONTAGUE

be regarded as true in
SET THEOR Y AND HIGHER-ORDER LOGIC

147

preted as the identity relation, and (iv) 1'/ is interpreted as the relation R, where 58 has the form
148

RICHARD MONTAGUE

References [1] [2] [3] [4]

Kurt Godel, The Consistency of the Continuum Hypothesis (Princeton 1940). John L. Kelley, General Topology (Princeton 1955). S. C. Kleene, Introduction to Metamathematics (Princeton 1952). Saunders MacLane, Locally Small Categories and the Foundations of Set Theory. Infinitistic Methods (Warsaw 1961). [5] J. C. C. McKinsey, A. C. Sugar, and Patrick Suppes, Axiomatic Foundations of Classical Particle Mechanics. Journal of Rational Mechanics and Analysis 2 (1953) 273-289. [6] Richard Montague, Reductions of Higher-order Logic. Proceedings of the International Symposium on the Theory of Models (Amsterdam, forthcoming). [7J Richard Montague, D. S. Scott, and Alfred Tarski, An Axiomatic Approach to Set Theory (Amsterdam, forthcoming). [8J Richard Montague and R. L. Vaught, Natural Models of Set Theories. Fundamenta Mathematicae 47 (1959) 219-242. [9J Patrick Suppes, Axiomatic Set Theory (Princeton 1960). [IOJ Alfred Tarski, A Decision Method for Elementary Algebra and Geometry, 2nd ed. (Berkeley and Los Angeles 1951). [11] Alfred Tarski, Contributions to the Theory of Models I. Indagationes Mathematicae 16 (1954) 572-581. [12J Alfred Tarski and R. L. Vaught, Arithmetical Extensions of Relational Systems. Compositio Mathematica 13 (1957) 81-102.

EXISTENCE IN LESNIEWSKI AND IN RUSSELL A. N. PRIOR Manchester University, Manchester, UK

Anyone who learns his logic in Manchester, Notre Dame or Chapel Hill, is bound to hear a good deal about Lesniewski's logic, and especially about the discipline that Lesniewski called "ontology"; but anywhere else in the world, even in Warsaw, the student is likely to find Lesniewski's name hardly mentioned. I suspect that one of the reasons for this is that Lesniewski's theories, and again I have in mind especially his ontology, have often been rather puzzlingly presented. It is often said by its advocates that ontology is an answer to an early prayer of Russell's. Principia Mathematica contains a theorem, namely *24.52, which asserts that the universal class is not empty, that is, that there is at least one individual. And this is a theorem which Russell found an embarrassment - in a footnote to his Introduction to Mathematical Philosophy (p. 203) he describes it as "a defect in logical purity". In Lesniewski's ontology this defect, if it is one, doesn't exist - ontology is compatible with an empty universe. What is puzzling is the explanation which is commonly given of this achievement. The lowest-type variables of ontology are described, like Russell's lowest-type variables, as standing for names; but it is said that whereas Russell's variables stand for singular names only, Lesniewski's stand equally for empty names, singular names and plural names. Existence is therefore something that can be significantly predicated with an ontological "name" as subject - "a exists" is a well-formed formula, and is in some cases but not in all cases true, and that it is true in some cases is, although true and statable, not a theorem of the system. Another peculiarity, connected with the preceding ones, is that the ontological symbol "s", unlike the Russellian symbol "s",

150

A. N. PRIOR

stands between expressions of the same logical type. "a 6 a", for example, is well-formed. What are we to make of all this? I want to suggest that what we are to make of it is that ontology is just a broadly Russellian theory of classes deprived of any variables of Russell's lowest logical type. Ontology's so-called "names", in other words, are not individual names in the Russellian sense, but class names. This immediately explains the first two of the peculiarities I have mentioned. For while it makes nonsense to divide up individual names in this way, class-names are divisible into those which apply to no individuals, those which apply to exactly one, and those which apply to several. It makes sense also to say that some classes "exist", either in the sense of having at least one member or in the sense of having exactly one member, and some classes do "exist" in these senses and some do not. The disappearance of the theorem that there is a non-null class still requires explanation, and so does the typehomogeneity of the arguments of the functor "6", but we shall consider these points shortly. Before getting on to that, I want to mention one feature of Lesniewski's socalled "names" which exponents of his theories don't generally make much of, but which seems to me to tell quite conclusively in favour of interpreting them as class-names, namely that they can be logically complex. Given any pair of Lesniewskian names we can for instance form their logical product and their logical sum, and we can construct a name which is logically empty, e.g. the compound name "a and not-a". Russellian variables of lowest type, on the other hand, are logically structureless - you can construct other things out of them, together of course with other symbols; for example, you can construct a one-place predicate (e.g. "- shaves Peter:') out of a two-place predicate and a name; but there is nothing out of which you can construct Russellian individual names. (Definite descriptions, e.g. "the x such that x shaves Peter" are notoriously not "names" in Russell's view.) The formal development of a Russellian class theory without variables of the lowest type presents, however, some very taxing problems. For as Russell presents this theory, individual variables are not just an optional appendage which can be lopped off without damaging the rest of the system. On the contrary, Russell regards classes as logical constructions out of individuals and functions of individuals. He has so to speak a primary language and a secondary one. In the primary language there

EXISTENCE IN LESNIEWSKI AND IN RUSSELL

151

are just individual names, functors forming sentences out of these, functors of higher type operating on the preceding functors, and so on. His class theory is merely a set of convenient alternative locutions by which talk about individuals can sometimes be replaced. The basic sentences of this class language are of the form "x e ex", asserting an individual's membership of a class, and where "ex" is, say, the class of things that/, the form "x e ex", "x is an f-er", is simply a re-writing of "x f''s", or more accurately of "For some g, such that if anythingf's it g's and vice versa, x g's". And when we wish to define complex classes, for example the logical product of two classes, or the null class, we fall back again and again on this basic form - the logical product of the ex's and the /3's, for example, is the class of x's such that x is an ex and x is a /3 - these x's, these individual names, are just not dispensable. Lesniewski meets this dfficulty by introducing an undefined constant expressing a relation between classes - it can be, but it does not need to be, the functor "s" previously mentioned. This functor, as I have also previously said, has arguments of the same logical type, so that what it expresses is not Russellian class-membership. It expresses rather the inclusion of a unit class in another class. This interpretation of the Lesniewskian "s" was suggested some time ago by Jerzy Los, and although Lesniewski himself did not like it, no other interpretation of the symbol seems to me intelligible. It is tantamount to reading the form "a s b" as "The a is a b", or "There is exactly one a and every a is a b", There are of course Russellian forms, though not the form "x e ex", that have this meaning. And the Lesniewskian form "a = b" does not express Russellian class identity - "The a's coincide with the b's" - but means rather "The a is the b", that is, "There is exactly one a and exactly one band they are the same". But this is not quite Russellian individual identity either - "The a is the a" is false if there are no a's, or if there are several, but the Russellian "x = x" is a law of the system, and is in fact definable as the obvious truism "For any f, if x f 's then x f''s", So if we define individual existence as Lesniewskian self-identity, it amounts to a class's being a unit one, and is predicable of some classes but not of others, whereas if we define it as Russellian self-identity it is predicable of everything a Russellian name can stand for. Complex classes can also be defined by using the Lesniewskian "a", Lesniewski's rules of definition are in fact a little complicated, more

152

A. N. PRIOR

complicated than those which are required in Russell's primary language, the language of individual names and functors operating on these; but they constitute an interesting solution of a genuine and interesting problem, and they are much tidier than the rules for translating Russell's secondary language, in which he talks about classes, into his primary one; they do not give rise, for example, to scope ambiguities. And while Lesniewski's procedure requires the introduction of a primitive constant that Russell's doesn't need, since Russell can get all the constants he wants in his primary language out of propositional calculus and quantification theory, this very fact gives Lesniewski greater freedom over what his class theory will or will not contain. For this constant "8", in terms of which existence and non-existence, for example, are definable, must have special axioms laid down for it, and it is easy enough so to choose these axioms that the formula "For some a, the a exists", or "For some a, the a is the a" is not a theorem. No Lesniewskian denies that it is a truth, but it is not a provable truth in Lesniewski's system. From Russell's system, on the other hand, it is impossible to delete this theorem, for what Russell means by the non-emptiness of the universe is that for some x, x is x, that is, for some x, for every f, if x f's then x f's, and this is provable from propositional calculus and quantification theory alone. Or if we can eliminate it by attaching complicated provisos to the ordinary rules of quantification, the resulting system is very difficult to interpret. It is profitable at this stage to ask ourselves whether the appearance of this theorem in Principia Mathematica really does indicate a "defect in logical purity", and if so what is the source of this infection. Basically it comes from the interpretation of Russell's lowest-type variables as standing for individual names, that is to say symbols whose only contribution to what is actually said by the sentences in which they occur is the identification of the individual objects that the sentences are about. If such a symbol fails to identify any individual object, the sense of the sentence is incomplete and nothing is really said. Using the word "This" as a symbol of this sort, what is said by "This exists" is bound to be either a truth or nothing at all. Hence the assumption that there are complete statements of the form ''fx'', where "x" is a symbol of this kind, already involves the non-emptiness of the universe. One way of purging logic of this assumption would be to conceive quantification

EXISTENCE IN LESNIEWSKI AND IN RUSSELL

153

theory as being concerned simply with the application of quantifiers to functors with their arguments, without regard to what parts of speech these functors and arguments are. The form of quantification theory would in fact be unchanged if we interpreted Russell's lowest type variables as standing for Lesniewskian "names", that is to say class names, and his predicates for functors forming sentences out of these. The form "For some x, for all j, iffx thenfx", or as we might now prefer to write it "For some a, for all j, if fa then fa" would still be provable from propositional calculus and quantification theory alone, and indeed it is so provable in ontology, but it now carries no existential implications, since an a that would instantiate it could be an empty class. If however, we may regard Russell's interpretation of his lowesttype variables as an extra-logical matter, we may equally so regard the undefined constant which is required in ontology. For as we have seen we can define existence in terms of this constant, and we can formulate the proposition that for some a, the a exists, even if we so choose our axioms that we cannot prove it. Such a choice of axioms seems, indeed, strangely arbitrary - the proposition concerned is, after all, formulated purely in terms of the constants and variables of the system, and is acknowledged to be true, so if those constants are regarded as purely logical, why is the truth not so regarded? I cannot really see much sense in this. H may seem from I have said that ontology, on my interpretation of it, is committed to the existence of classes as nameable entities, though in fact Lesniewski was notoriously nominalistic. But this is a misunderstanding, arising from the use of the perhaps unfortunate term "classname". What we have to do with here are common nouns, and these are not strictly speaking names of objects at all. When we read the Russellian "x eel' as "x is a member of the class of IX'S", for example, "Russell is a member of the class of men", this looks as if we are asserting a relation between a concrete object and an abstract one, but the theory of types itself should warn us that this is not quite right, and we might do better to read the form simply as "x is an IX", for example "Russell is a man". Here the form "is a" is not quite a proper verb, that is a functor which makes sentences out of individual names; rather it makes a sentence out of a name and a common noun. And the functors which join the a's and b's in ontology, and the IX'S and {J's in class theory, are not, properly

154

A. N. PRIOR

speaking, predicates; they are functors like "Every - is a -", 'The - is a -", "There is no such thing as a -". In fact, these functors which take arguments of Lesniewski's lowest type include ordinary numerical functors, like "There is exactly one -", "There are exactly forty-three - s", and so on. It is no doubt convenient to use forms like "The class of a's is an empty class", "The class of a's is a member of the class of pairs", and so on, and Lesniewski introduces a higher-order "s" which is so defined that "i e g" may be read as "The unit class-of-classes f is included in the class-of-classes g". But these are no more than convenient locutions; "The class of a's is an empty class", for example, means no more and no less than "There is no such thing as an a", from which the suggestion of naming an abstract object, the class of a's, has been removed. It is true that Lesniewski quantifies over variables of his lowest type, and indeed over variables of all types, and there is a doctrine current among some American logicians that any variable subject to quantification thereby counts as standing for a name, but this seems to me a quite eccentric criterion of namehood. What ontology in fact does is to combine the maxim that only individuals are real with the view that the only way we can linguistically get at individuals is by speaking of them as what certain common nouns apply to - maybe uniquely; and that their application is unique is of course something that can be said within the system, not by having Russellian individual names in it, but by having as it were an individuating functor, namely the Lesniewskian "s" or "The - is a -". The phrase "The so-and-so" is not, as it is in Frege, itself an individual name; there are no individual names; but the phrase does occur as part of the larger functor, and so to speak individuates, or purports to individuate, as it makes the full statement. There are many contemporary philosophers, here in Oxford for example, who are not very happy about Russellian individual names, and would rather like to do without them, and ontology seems to me worth offering to these philosophers as a system in which their programme is really carried out. In fact it may be far less important as an answer to one of Russell's prayers than as an answer to one of the prayers of the anti-Russellians. Lesniewski's own system is, indeed, characterised by an extreme extensionalism which is not likely to appeal very much to the philosophers I have in mind, and for that

EXISTENCE IN LESNIEWSKI AND IN RUSSELL

155

matter it doesn't appeal to me either; this extensionalism, moreover, is as thoroughly wrought into Lesniewski's methodology - underlying, for example, his rules of definition - as the use of individual names is wrought into Russell's theory of classes. However, I am sure that with a little trouble one can disentangle the more desirable features of ontology from this less desirable one, just as ontology itself disentangles the pure theory of common nouns from its Russellian name-and-predicate basis.

FUNCTIONS AND ROGATORS1) A. SLOMAN University of Sussex, Brighton, UK

Section A 1. The concept of a "function", though frequently used by logicians (e.g. in talking about truth-functions or propositional functions), has rarely been discussed systematically since the writings of Frege and Russell. The notion may be approached in several different ways, either syntactically, through the notion of a "function-sign", or semantically, through the notion of what corresponds to such signs. The syntactical approach may either deal with function-signs as "incomplete" or "unsaturated", following Frege, or it may deal with them as complete signs (e.g. signs containing variable-letters "x", "y" etc., or signs prefixed with Church's lambda operator). The latter approach is more common, the former more fundamental. (A similar distinction could be made at the semantic level: see end of par. 11, below.) The semantic approaches may also be subdivided into two sorts, depending on whether they are intensional or extensional. Once again, the latter is more common, the former more fundamental. It would be of considerable interest and importance for the philosophy of logic to analyse these various approaches and their interrelations, especially as most modern text-books are somewhat narrow, favouring one or other approach as the only acceptable one, others being, at best, mentioned with a few disparaging remarks.i) 1) I wish to thank Michael Dummett and members of the Philosophy department at Hull University, for helpful comment and criticism at various stages in the development of this paper. 2) See, for example, P. Suppes, Introduction to Logic (Princeton 1957) 229f. Similar remarks are made by A. Tarski on p. 72 of his Introduction to Logic (New York 1946).

FUNCTIONS AND ROGA TORS

157

2. In this paper only the semantic conceptions will be discussed. An attempt will be made to explain the difference between the extensional and the intensional approach, with the aim of showing that the latter is not just a confused version of the former, but is something quite different, and, in one sense, prior to the other. This is not a new suggestion. Some of what I have to say has been said before, e.g. by Russell (in Introduction to Mathematical Philosophy, p. 12 ff, p. 183 ff) and F. P. Ramsey (in The Foundations of Mathematics, p. 15). But I am not aware of the existence of any detailed discussion of the distinction or its applications. This first section will be devoted to a brief explanation of the distinction. The next (section B) will compare it with other distinctions likely to be confused with it. In the final section (C) some applications will be mentioned.

3. The following notion of a function-sign will be assumed to be familiar: a function-sign is obtained from a sentence or referring expression (for example), by replacing one or more words or phrases in it by so-called "variable-letters" such as "x" or "y". Examples are: "The mother of x", "The town in which x was born", "y is the father of x". The semantic approach involves regarding a function as something which, in some sense, corresponds to such a function-sign. It is said to take arguments and yield values correlated with the arguments. If a name or sign for an argument is substituted for each variable-letter in a function-sign, then the result is taken to be a name or sign for the value correlated by the corresponding function with that argument or set of arguments. The things which correspond to function-signs and which take arguments and yield values are normally described as "functions", but I shall use two words "function" and "rogator" to mark the difference between the extensional and the intensional concepts. (This is less cumbersome than talking about "extensional functions" and "intensional functions", and avoids confusion which might arise out of the fact that this latter terminology has been used to mark another distinction, to be mentioned below. I retain the word "function" for the extensional concept, since that seems to be its normal use at present, though it could be

158

A. SLOMAN

argued that the normal meaning is somewhat indefinite in this respect. A mathematician recently said to me that he thought of a function as a sort of machine, which churned out numbers as numbers were fed into it. This could be taken as an intensional explanation.) Functions and rogators, then, are thought of as corresponding to function-signs, and as taking arguments and yielding values. This much they have in common.

4. In order to explain the difference between functions and rogators we need the notion of "extensional equivalence". Two functions, or two rogators, are said to be extensionally equivalent if (a) they each have values for the same arguments, and (b) they correlate the same values with the same arguments. That is, two functions, or rogators, "Fx" and "Gx" are extensionally equivalent if, and only if, (x) (y) [(y

= Fx) ==

(y

=

Gx)].

The difference can now be explained. Extensional equivalence is a necessary and sufficient condition for the identity of functions, but not for identity of rogators. Thus, the functions corresponding to "the mother of x" and "the woman first loved by x" may be extensionally equivalent, in which case they will be one and the same function. But this does not mean that there will be one and the same rogator corresponding to them, even though the two rogators are extensionally equivalent. To say that rogators are intensional entities, then, is simply to say that extensional equivalence does not guarantee identity of rogators: there are no further metaphysical or psychological implications. Of course, this account of the difference between rogators and functions is not a definition of either. We may regard it as a partial definition, or a criterion for adequacy of a definition of the notions. Let us now see if we can give adequate complete definitions. 5. If we are allowed to make use of the notion of a set, then we can define "function" in the familiar way as a set of ordered pairs satisfying the condition that no two pairs in the set have the same second element. Since sets satisfy extensional criteria for identity it follows that this definition of "function" fits the criterion of the previous paragraph. That is, if two functions contain exactly the same ordered pairs, then they are identical, since the sets of ordered pairs are identical. But containing

FUNCTIONS AND ROGATORS

159

exactly the same ordered pairs means correlating the same values with the same arguments. So we have found something (the usual thing) that can be called a "function". 6. It is not so easy to give a full definition of "rogator", We want things which take arguments and correlate them with values, thereby generating sets of ordered pairs, but which do not satisfy the extensional criterion for identity. I claim that we are talking about such things whenever we talk about functions as pairing off elements according to a plan 1 ) , or mention the way in which a function yields or produces its value from its argument") or the principle of classification"). For example, it is clear that to the two expressions (1) "The sum of the first x odd numbers" and (2) "The number which is equal to the product of x by itself" there correspond two different methods or principles of calculation, even though when applied to positive integers they always give the same result. So although the two functions (on the domain of positive integers) corresponding to (1) and (2) are identical, there are other things which are not. Hence these other things, namely the rules, methods or principles (etc.) do not satisfy extensional criteria for identity. Let us therefore say that in talking about rogators we are simply talking about these other things, in effect, and that the criteria for identity of rogators are simply the criteria which we normally use for identifying and distinguishing these other things. Then talking about an object as an argument for a rogator which correlates it with a value, is just a neater, and more general, way of talking about the object as something to which a rule or principle may be applied in order to yield a result or outcome of the application. I shall not attempt to give an explicit definition of "rogator" in terms of "rule" or "method" of "principle", etc., since (a) these terms are in some contexts subject to the extensional-intensional ambiguity themselves, (b) it is not clear that their use is sufficiently general and (c) it would be odd to describe them as having arguments and values. The connection between the concept of a "rogator" and these other concepts will simply have to

') w. v. O.

Quine, Mathematical Logic (Cambridge, U.S.A. 1955) 198.

2) A. Church, Introduction to Mathematical Logic (Princeton 1956) 16. 2) F. P. Ramsey, The Foundations of Mathematics (London 1931) 15.

160

A. SLOMAN

be hinted at and illustrated by the remarks already made, and the examples which will now be discussed.

7. We have admitted that the functions "the woman first loved by x" and "the mother of x" may be identical. But even if they are, it is clear that the principle by which we pick out a woman as someone's mother is quite different from the principle by which we select a woman as the first person loved by that person, even if we end up with the same woman in each case. So here we are applying non-extensional criteria for identity of the principles involved, and these criteria enable us to distinguish two rogators, even if there is only one function. Again, if we consider the expression "the town in which x was born", then we may say that there is a function corresponding to it which correlates (some, but not all) persons with towns. Suppose that Aristotle's first pupil, whoever he was, was born in Athens. Then Athens is the value of the function for that man as argument. But that man might have been born elsewhere, for example if his mother had decided to go on holiday just before his birth. In that case a different town would have been the value for the same man as argument. But a value of what? A different town could not be the value of the same junction, for then the set of ordered pairs would be different, and so, since a function just is a set of ordered pairs (or at any rate something satisfying extensional criteria for identity), it would be a different function. Hence, if, as seems quite natural, we wish to say that the same something might have had a different value for the same argument, then, if we are not to contradict ourselves, we must regard the "something" as not satisfying extensional criteria for identity. Clearly, it is the same rogator that is wanted: for one and the same principle or rule might have correlated the same man with a different town if he had been born not in Athens but elsewhere. That is, the rogator corresponding to that rule might have had a different value for the same argument. 8. It is important to note that the remarks made in the previous paragraph could not have been made, and the reader would not have understood them, if they had not employed the concept of a rogator or some other non-extensional concept. We can therefore take the fact that the remarks are intelligible as demonstrating that there are such things as rogators, or at least that the concept of a rogator is a coherent one, and

FUNCTIONS AND ROGATORS

161

not unfamiliar. There is a further argument, used by Russell, to show that there must be rogators. The argument is simply that unless there were such things we should not be able to talk about individual functions such as "the square of x" or "the town in which x was born", whose domains are either infinite or unsurveyable on account of being scattered about in space and time. For how can I have this set of ordered pairs in mind rather than that, and how can I know that you and I are talking about the same set in these cases? The function "the square of x" contains infinitely many different pairs of numbers, and the other function includes pairs containing persons and towns that I have never seen or heard of (especially if it applies to persons who have lived in the past, or will live in the future). So in neither case can I say that I have in mind just this function because I have identified all the pairs in it. And I cannot say that I am sure you have the same function in mind on the basis of having checked through the set of pairs which you have in mind. Thus, if there is one function that I have in mind, and if you have the same function in mind, it can only be because we use some principle, or rule, i.e. a rogator, according to which we can tell whether an ordered pair does or does not belong to the function in question. It follows that since we can and do identify and talk about functions with infinite or unsurveyed domains, there are such things as rogators. I am not saying that extensional functions could not exist if there were no rogators, only that individual ones could not be talked about or even thought about without them. (Though as pointed out by Ramsey in The Foundations of Mathematics, pp. 15 and 22, it may be possible to make general assertions about them, not mentioning individual ones, without presupposing the existence of a rogator. Whether there are some functions - or sets - to which norogators correspond, so that they cannot be talked about or thought about individually, is a question which I shall not discuss. One form of Platonism involves giving an affirmative answer to this question. This sort of view seems to have lain behind the axiom of reducibility, and Ramsey's claim that the axiom was unnecessary.)

9. These considerations seem to establish that there are such things as rogators and that they are, in one sense at least (namely, epistemologically), prior to functions. Although this fact was acknowledged by Russell, he did not wish to pay much attention to it, since he was preoccupied

162

A. SLOMAN

with giving mathematics a logical foundation, and apparently thought this could be done without introducing intensional considerations. (See Introduction to Mathematical Philosophy, p. 187.) This may also explain why Frege apparently was not very interested in an intensional approach to the concept of a function. It should be noted at this stage that, although I have indicated in a rough sort of way what sorts of things rogators are, I have not yet given a definition, for I have not yet stated a set of necessary and sufficient criteria for identity of rogators. Certainly if "Ex" and "Gx" are the same rogator (involve the same rule or principle) then they must correlate the same arguments with the same values, and if a certain argument (e.g. Aristotle's first pupil) would have been correlated with a different value by "Ex" (e.g. "the town in which x was born") if the world had been different, then in the same conditions "Gx" would have had the same value for that argument. So extensional equivalence in all possible states of affairs (i.e. necessary extensional equivalence) is a necessary condition for identity of rogators. But we do not wish to say that it is a sufficient condition, since we wish to say that the two rogators mentioned in par. 6, namely "the sum of the first x odd numbers" and "the number which is equal to the product of x by itself" (defined on the domain of positive integers), are different rogators, despite the fact that they necessarily, i.e. in all possible states of the world, have the same values for the same arguments. It might be thought that the only difference is that they correspond to different signs, that is that the criteria for identity of rogators are partly syntactical. But this is not so, for it is possible that in some strange language the expression "the sum of the first five odd numbers" means what we mean by "the number which is equal to the product of five by itself", in which case the rogator corresponding to their expression "the sum of the first x odd numbers" would be different from the rogator corresponding to ours, since it would correspond to a different principle of calculation, despite the syntactical and extensional equivalence. These remarks should suggest that it is not easy to give necessary and sufficient criteria for identity of rogators, i.e. to explain, in a clear and non-circular manner, how we identify and discriminate rules or principles or methods of calculation. Ultimately, we simply have to make use of something like the notions "same pattern" and "different pattern", i.e. the notions of identity and difference of properties or universals. All explanation of meanings must start with examples, and it

FUNCTIONS AND ROGATORS

163

seems clear that we have here something which can be taught by means of examples, but which cannot be described, except in a circular manner. I shall therefore not attempt to formulate .sufficient criteria for identity of rogators. 10. NormaIly, when we wish to talk about a function, we specify the one in question not by enumerating its arguments and values, but by indicating some principle according to which they can be picked out. And this is usually adequately achieved by the use of a function-sign as illustrated in par. 3, above, for if the function-sign is constructed out of parts which have unambiguous meanings, the method of construction, together with those meanings, uniquely determines a principle or rogator. This permits us to talk about the rogator corresponding to such a sign, just as we talk about the function corresponding to it. We could, of course, introduce a notation for talking about functions and rogators by enclosing the function-sign in different sorts of quotation marks or by using prefixes, such as Church's prefix" AX-" for functions, and perhaps "px-" for rogators. However, if we talk about "the function 'Fx'" or "the rogator 'Fx" there should be no ambiguity. (In such locutions the letter "x" is, of course, a sort of bound variable.) To one function there generally correspond many different rogators, since one and the same set of ordered pairs may be picked out in many different ways, i.e. according to many different rules or principles. Since, for reasons mentioned, no complete definition of "rogator" has been given, it may be useful to compare and contrast the functionjrogator distinction with several other distinctions with which some may be inclined to confuse it.

Section B II. Near the end of section 71 of The Logical Syntax of Language Carnap implies that Frege's distinction between a function and its value-range (Wertverlauf) is a distinction between intensional and extensional entities. But this seems to be a misunderstanding, for this distinction of Frege's is a distinction between entities which are "complete" and entities which are "incomplete" or "unsaturated", and, as far as I can see, has nothing to do with different criteria for identity. Frege did not use "function" to mean "set of ordered pairs", since he

164

A. SLOMAN

defined the notion of a set or class in terms of the notion of a function. Nevertheless, it seems likely that he thought of functions in an extensional way, since he thought of concepts as being functions of a certain sort, and he thought of them as extensional. For he wrote: "coincidence in extension is a necessary and sufficient criterion for the occurrence between concepts of the relation corresponding to identity between objects". (See Translations from the Philosophical Writings of Gottlob Frege, by Geach and Black, p. 80, and also "Class and Concept" by P. T. Geach, in Philosophical Review, October 1955.) Strictly speaking, Frege could not regard the relation of identity as applicable to functions, since, for him, they were "incomplete" or "unsaturated", and this was why he had to introduce value-ranges. (Loc. cit. pp. 26ff.) Frege's distinction between complete and incomplete entities seems to be based, in the first place, on a syntactical distinction between function-signs and argument-signs. (Loc. cit. pp. l2ff., 32, l13ff, 152.) He apparently thought that the analysis of (say) a sentence into argument-sign and function-sign could be parallelled by analysis of what was expressed into function and argument, or, in some cases, concept and object. But the important thing about his functions, or concepts, was not intensionality but incompleteness. So Frege's distinction was not the same as the function/rogator distinction. Indeed, a follower of Frege might argue that just as Frege distinguished between "incomplete" functions and "complete" value-ranges, so ought I to distinguish between "incomplete" and "complete" sorts of rogators. The incomplete ones would correspond to Frege's incomplete functionsigns, such as "the mother of .. ", whereas the complete ones would correspond to complete signs or names for rogators, such as "the rogator mentioned in the previous sentence". So Frege's distinction cuts across mine. Next it may be thought that the notion of a rogator might be explained in terms of the notion of a function by saying that if R is the rogator corresponding to the function-sign "Ex" , and F is the corresponding function, then R is just a function which takes different arguments and values from F, as follows. If any 0 bject is taken as an argument of F, then that argument must be picked out or identified in some way, and the method by which it is picked out will fix the sense of the argumentsign which refers to it. Similarly, any sign which picks out the value of 12.

FUNCTIONS AND ROGATORS

165

F for a given argument must have a sense, corresponding to the way in which the thing is picked out. The suggestion I am considering is that R is just a function from senses to senses: that is, instead of taking objects as arguments and values, it takes senses of argument-signs and correlates them with senses of signs for the corresponding values of F. So R is supposed to be a set of ordered pairs of senses of signs, or a set of ordered pairs of ways of identifying arguments and values. Now there is no reason at all why we should not talk about such functions from senses to senses,and it seems certain that they would mirror some of the properties of rogators, such as there being many different rogators corresponding to one function. But the argument of par. 8 shows clearly that such "secondlevel" functions will not do everything that rogators can do. In particular they cannot explain how we are able to think and talk about particular functions whose arguments and values we cannot enumerate. For, if there are too many arguments, then there will automatically be too many senses of possible argument-signs, or ways of identifying objects, since everyone of the arguments may be referred to in many different ways. Hence, if F 2 is a second-level function from senses to senses, corresponding to the "unsurveyable" function F, then F 2 will be even more unsurveyable, and we shall need a rogator in order to talk about it! This argument is most important, for it can be used against any attempt to construe a rogator as a kind offunction. For example, Professor Richard Montague, referring to work done by Tarski, suggested to me that we could avoid talking about rogators if we talked instead about functions with an additional argument-place, to be filled by a (sign for a) possible state of the world, which would certainly enable us to deal with the examples of par. 7. But if such functions were really extensional, that is, if they consisted of sets of ordered triples, one member of each triple being a (sign for a) possible state of the world, then, as before, every such function would be far more complicated than the set of ordered pairs corresponding to the actual state of the world. For example, since we cannot enumerate arguments and values for the function "the town in which x was born", we shall find it even more difficult, on account of the geater multiplicity of arguments, to enumerate arguments and values for the function "the town in which x was (or would have been) born in possible world y" (apart from any difficulties in identifying the same particulars in all possible states of the world). Hence, as before, if we are to think or talk

166

A. SLOMAN

about such a function, we need something non-extensional, such as a principle of correlation, or a rogator, by means of which it can be identified. This sort of argument works equally well against the much cruder suggestion that a rogator is just a time-dependent function. I shall not elaborate on this suggestion, for it should be clear by now that a rogator is not a type of function at all. 13. The terminology of "intensional functions" and "extensional functions" has been used by Russell (Principia Mathematica, 2nd ed., p. 72ft') and Kneale (The Development of Logic, p. 609), but for them these terms are not so much concerned with the distinction between rogators and functions as with a different distinction, noticed by Frege (See "On Sense and Reference", in Translations). Quine has illuminatingly described the distinction as being between "referentially opaque" and "referentially transparent" contexts. The distinction may be illustrated by the pair of function-signs: (I) "the day on which our chairman first thought about x" and (2) "the day on which our chairman was first seen by x".

These both look as if they correspond to functions in the usual way, but there is a difference: for if a sign which does not refer to anything is substitued for "x" in (2), then the resulting sign does not refer to anything, and if two signs referring to the same argument are substituted in (2), then the two resulting complex expressions cannot refer to different days. On the other hand, there is no person referred to by "Mr. Pickwick", yet if it is substituted in (I) the resulting expression will probably refer to a definite day, and if the two expressions "Bertrand Russell" and "The author of The Principles of Mathematics", which refer to the same person, are substituted in turn for "x" in (I), then it is very likely that the resulting expressions will pick out different days. In short, the value of (2) depends only on what, If anything, is taken as argument, whereas the value of (I) seems to depend on how the argument is identified, that is, on the sense of the sign for the argument. We may say that (I) corresponds to an "oblique" function, (2) to a "direct" function. But it should not be thought that this distinction is the same as the distinction between rogators and functions. For the rogator "the town in which x was born" takes a value only if a sign which refers to something is taken

FUNCTIONS AND ROGATORS

167

as argument-sign: it cannot have a value for a non-existent argument. Moreover, its value depends only on which person or animal is the argument, not on how the argument is identified. It is in order to avoid this ambiguity that I refrained from describing rogators as "intensional functions": as already remarked, such a terminology might be confused with Russell's. 14. Finally, it is clear that the distinction between rogator and function is in some ways analogous to Frege's distinction between sense and reference (op, cit.). For the reference of a name (or definite description) is an object, and the sense is that in virtue of which this object is the one referred to by the name or other expression. Similarly, it should by now be clear that the rogator corresponding to a functional expression is that in virtue of which a particular function (set of ordered pairs) is the one corresponding to that expression. But, for Frege's purposes, it is important to distinguish between a complete expression referring to a function, e.g. the expression "the function described in paragraph 7", or "the function (corresponding to) 'the square of x"', and an incomplete function-sign, such as that which is common to "the square of six" and "the square of twenty-two". Strictly speaking, the latter is not a sign at all, but an aspect or pattern or structure common to different signs. We could say that rogators and functions serve respectively as senses and referents for such "incomplete" entities. The previous kind, being "complete" signs, already fall under Frege's discussion of sense and reference. Despite the analogies, to identify rogators with senses of signs involves some linguistic strain, since it is odd to say that a sense can take arguments and have values. (I do not know whether Frege himself made any attempt to extend his sense/reference distinction to what he called function-signs.) This completes the comparison of the rogator/function distinction with other distinctions, and now all that remains to be done is to describe some applications of the distinction. Section C

15. The first and most obvious application is analogous to Frege's application of the sense/reference distinction to identity statements. For, just as identity statements, such as "The evening star is (identical

168

A. SLOMAN

with) the morning star" would be either quite trivially true or self-contradictory if referring expressions were directly associated with objects without the mediation of a sense (or method of identification), so also would statements of extensional equivalence between functions, such as "For any argument x, the function 'the mother of x' has the same value as the function 'the first woman loved by x"', reduce either to mere triviality or to self-contradiction if the sign for a function were directly correlated with a set of ordered pairs without the mediation of a rogator (that is, a rule or principle). The significance of statements of identity depends on the fact that it may be a significant (e.g. contingent) question whether two senses pick out the same referent. Similarly, it is because the question whether two rogators pick out the same function, the same set of ordered pairs, may be a significant (e.g. empirical) question, that statements of extensional equivalence have any significance. 16. Secondly, once the distinction has been made, we can see that the notion of a function can be explained or analysed or "reduced" in terms of the notions of a rogator and extensional equivalence. But it is not possible to "reduce" the notion of a rogator to that of a function, or set. A third application may be mentioned briefly here, in connection with this. Since "function" can be defined in terms of "rogator", and since a rogator is something like a rule or principle, which can be identified independently of any enumeration of the objects which it correlates, it follows that there is something wrong with the statement in Principia Mathematica (2nd ed., p. 39) that a function is only well-defined if its values are already well-defined. So there is something wrong with one argument in favour of the vicious circle principle. I shall not enlarge on this, but it seems likely that further investigation might lead to a better understanding of some of the problems connected with the ramified theory of types and the axiom of reducibility. 17. The fourth application which I shall mention is one which seems to me to be particularly interesting and important for the philosophy of logic. If we look back at two of our examples of rogators, namely "the town in which x was born", and "the square of x", which may be

FUNCTIONS AND ROGATORS

169

referred to as "Fx" and "Gx" respectively, we notice the following difference (cf. par. 7, above): suppose the value of "Fx" for Aristotle's first pupil as argument to be the town Athens. Then the same rogator might have had a different value for the same argument, since the man in question might have been born in some other town. However, if we take the number six as an argument for the second rogator, we see that its value is thirty six, and could not have been anything else in any circumstances. It looks as if we have a distinction between two sorts of rogators: one sort has a value which depends on how things happen to be in the world, whereas the other fully determines its value independently of contingent facts. As pointed out to me by Mr. Dummett, there is something odd about putting the distinction in this way, since if we take different argument-signs, the rogators in question seem to exchange their positions with regard to the distinction. For example, if we apply "Fx" to the argument identified as "the man whose mother was the first woman in 1930 to give birth in Rome to her only son", then it is clear that the value (if there is one at all) must be Rome. On the other hand if we apply the rogator "Gx" to the argument identified as "the number of hours between lunch and dinner according to the Colloquium time-table", then it seems that although the value is thirty six, it makes good sense to say that it might have been different, if the printers had made a mistake on the time-table, or if the eating arrangements had been different. Moreover, a problem arises if we apply several different rogators, such as "the mother of x", "the father of x", "the wife of x", "the day on which x was born" etc., to the person taken previously as argument for "Fx", namely Aristotle's first pupil. For even if it makes sense to say of each of these in turn that it might have had a different value for the same argument, it certainly does not make sense to say that all of them might simultaneously have had different values for the same argument. For how could one have had a different mother, a different father, a different wife, been born on a different day, and in a different town, etc., and still been the same person? 18. Such difficulties are avoided if we describe the contrast not as one between types of rogators, but as a contrast between cases of application of a rogator to an argument to yield a value. In general, the value of a rogator for a given argument is fully determined by three factors

170

A. SLOMAN

(a) the rogator itself (i.e. the principle according to which arguments are correlated with values), (b) the method by which the argument is identified (or, in particular, the sense of the expression taken as argumentsign) and (c) contingent facts, or how things happen to be in the world. As the application of "Fx" to Aristotle's first pupil, and the application of "Gx" to the number of hours between lunch and dinner according to the Colloquium time-table show, it is not generally the case that two of the factors suffice to determine a value. On the other hand, some of the other examples show that in some cases the first two factors (a) and (b) do suffice. Thus, how things happen to be in the world cannot affect the outcome of applying the rogator "the square of x" to an argument identified as the number six. 19. We can now give a precise formulation of the distinction referred to two paragraphs ago. It is a distinction between cases where two (or one) of the factors (a) (b) and (c) suffice to determine the value of a rogator for an argument identified in a certain way, and cases where all three factors are required. In particular, when the third factor, how things happen to be in the world, is not relevant, i.e. where (a) and (b) suffice to determine the value, I shall say that the application of the rogator satisfies the N'Cb-conditton (the non-contingent determination condition). In most mathematical contexts the NCD-condition is satisfied, since the standard methods of identifying numbers, or other mathematical objects (e.g. as things which satisfy certain axioms), are such that once they have been used to fix a number they automatically determine all its properties and relations to other numbers, and therefore also help to determine the values of mathematical rogators taking those numbers as arguments. On the other hand, the normal methods of identifying nonmathematical objects, such as persons, places, etc., do not automatically determine their properties and their relations to other objects of the same kind, in general: these depend on contingent facts. Since the NCDcondition is normally satisfied in mathematical contexts, philosophers primarily concerned with the foundations of mathematics have not felt any pressing need to take account of the distinction between cases in which it is satisfied and cases in which it is not. This is connected with the fact that the distinction cannot be made if the concept of a function is used instead of the concept of a rogator. For, since a function is iden-

FUNCTIONS AND ROGATORS

171

tified in terms of which objects it correlates with which (i.e. via a set of ordered pairs), it makes no sense to distinguish cases in which a function might have had a different value from/cases in which it could not have had a different value (for the same argument). For, since functions are extensional, the value cannot be different unless the function is. 20. This shows that the concept of a rogator or some other nonextensional concept is essential for making the distinction between cases where the NCO-condition is satisfied and cases where it is not. It may be noted that there is something unsatisfactory about describing the distinction in terms of factors which determine the value of a rogator. For it might be said that in all cases the value is in some sense determined by the two factors (a) the rogator and (b) the sense of the argumentsign, the difference between the general case and cases where the NCDcondition is satisfied being that in the latter the value is determined in two different ways. (E.g. if "a" is an expression referring to a person, then the sign "the town in which a was born" usually identifies a town. On the other hand, if "a" is the expression "the man whose mother was the first woman in 1930 to give birth in Rome to her only son", then we have two different ways of referring to the value, the new one being by means of the word "Rome". These two expressions - or their senses - must, independently of contingent facts, pick out the same thing, and this, it might be said, is all that satisfaction of the NCO-condition comes to.) This way of looking at the distinction, though illuminating, makes no difference for our present purposes and will not be discussed any further. It should also be noted that I have not taken account of the fact that in some cases, even where the NCO-condition appears to be satisfied, the three factors (a), (b) and (c) may fail to determine a value at all, on account of the failure of some term to refer, or for some other reason. E.g. Aristotle might not have had one first pupil, if at the start he took his pupils in groups; or his first pupil, if there was one, may not have been born in any town at all. In either case, applying the rogator "the town in which x was born" to Aristotle's first pupil could yield no value. This may be allowed for by inserting the qualification "if it has a value" at various points in the discussion. It has been omitted in the interests of simplicity.

172

A. SLOMAN

21. We have seen how the notion of a rogator, unlike the notion of a function, can be used in a formulation of the distinction between satisfaction of the NCD-condition and non-satisfaction of the condition. This may now be illustrated and applied further. Any two-valued rogator can be used to define a propositional function. If R(x, y, z, . . . ) is the rogator, whose value for any set of arguments is always one or other of the two objects K and L, whatever they may be, then there corresponds to it a propositional function which is satisfied by the ordered set of objects (a, b, C, . . • ) if and only if R(x, y, z, . . .) has the value K for these objects as arguments (and a similar propositional function may be defined in terms of L). Conversely, it is possible to think of any propositional function as if it were simply the "value-range" (in Frege's sense) of a rogator taking the words "true" and "false", or any other arbitrarily selected pair of objects, as values.') The normal methods of replacing non-logical words and phrases in a sentence by variables to yield a sentential matrix can be used to represent such a rogator: e.g. "x is A", "All A's except x and yare B's", "p or q", "« or not-p", etc., can all be thought of as representing what I call propositional rogators. In general, the logical form of a proposition can always be thought of as a rogator, sometimes a rogator whose arguments are of different types, as in "x is A". This shows that the familiar analysis of propositions in terms of functions and arguments can be replaced by an analysis in terms of rogators and arguments. The sense of a sentence expressing a proposition is then partly determined by the rogator corresponding to the logical words and constructions in the sentence. We can conclude that insofar as rogators are prior to functions (i.e. to sets of ordered pairs), the sense of a proposition is prior to the set of its truth-conditions. (This might be 1) This seems to be what is important in Frege's decision to regard sentences as names of truth-values. To object that this is an unacceptable use of the word "name" is to miss the important point. The main advantage of this move is that it yields a theory of meanings, propositions and truth which fully accounts for all the properties and relations of these concepts which are of interest to logicians, without depending on discussions of such notions as "thinking", "asserting", "communicating" or the presuppositions and implications of such activities as statement-making. In short, it clearly sorts out confusions between logic and the sociology or psychology of language. In his paper on "Truth" (Proc. Aristotelian Society 1958-59) Michael Dummett attempts to criticise such a Fregean theory, but I think it can be shown that his criticisms fail to take account of its full potentialities. Perhaps Frege was not aware of them either. (It is hoped that this will be developed in another paper.)

FUNCTIONS AND ROGATORS

173

developed to support a claim that there is a sense in which meaning is prior to use.) 22. We have seen that in most non-mathematical contexts the NCDcondition is not satisfied by the application of rogators to arguments, and this applies equally to the propositional rogators corresponding to logical constants or logical forms. For example, the rogator "p or q" may be applied to the two propositions "the moon is shining" and "dawn is breaking", and its value will be (say) the word "true" or the word "false". But there is no way of finding out which it is, even if the time and place of utterance are known, except by empirical investigation of contingent facts, for the value is not fully determined by the rogator and the methods by which the arguments are identified: the NCDcondition is not satisfied. This fact, that in general the third factor (c), mentioned in par. 18, is relevant to the value of a propositional rogator is what justifies correspondence theories of truth. To say that truth is a matter of correspondence with facts, is to mention one instance of the generalisation that the value of a rogator depends on how things happen to be in the world. (This shows that falsity is also a matter of correspondence with the facts.) Similarly, to say that any proposition determines a set of possible states of the world in which it would be true, its "truthconditions", is to draw attention to one application of the more general fact that if R(x, y, Z, . . . ) is a rogator, (a, b, c, . . .) arguments of R, and K a possible value of R, then the rogator, the argument-set and the value K together determine a set of possible states of the world, namely those in which R would take the value K for the arguments in the set (a, b, c, ... ). By considering rogators which take more than two values we thus find a natural interpretation for systems of many-valued logic. The fact that the propositional rogator "p or q", and the methods for identification of its arguments, do not in general suffice without the third factor to determine the value of the rogator, is what makes it possible for such logical words and constructions to be used in sentences which express contingent propositions, i.e. say things about the way the world happens to be. So the rules according to which they are used must make allowance for this connection with contingent fact, and this is a point that is missed by those who say that logical constants are governed by purely syntactical rules, that their use can be fully characterised by means of formal systems,

174

A. SLOMAN

and that logic can be reduced to syntax. Moreover, it can be argued that to speak of "truth", "proposition", "validity" etc., in connection with a formal system which in no way allows for the influence of contingent facts (how things happen to be in the world) on truth-values, is simply to generate confusion, since it obscures the fact that no such formal system could ever do what can be done by real languages, namely enable us to make statements about something non-linguistic.

23. Once we have seen that it is essential to propositional rogators that their applications do not always satisfy the NCO-condition, we are in a position to be struck, in a new way, by the fact that they sometimes do. How can their values sometimes be determined independently of contingent facts even though they are constructed or defined in such a way that contingent facts are to be relevant to their values? Or again, how is it that, starting with rogators whose values normally depend on contingent facts (e.g. "p or q", "not-p") we can construct new ones (e.g. "p or not-p") whose values never depend on contingent facts, whose applications always satisfy the NCO-condition? What I am getting at is that the necessary truth of a proposition can often be construed as illustrating the more general notion of satisfaction of the NCO-condition by the application of a rogator. And if we develop a theory of rogators, which describes and compares the different ways in which values of rogators may come to be determined independently of how things happen to be in the world (e.g. sometimes relations between the ways in which arguments are identified, sometimes relations between the method of identifying an argument and the rule for the rogator, sometimes only the way the rogator is constructed out of others, will be relevant), we may find (as I have found) that it is quite natural to say that there are different sorts of necessary truth, some of which can be described as "logical", some as "analytic", some as "synthetic". (This would provide an interpretation for a system of modal logic with different modal operators of different "strengths".) It is even to be hoped that studying the various ways in which the.Nt.Dvcondition may come to be satisfied, and noticing their differences, may rid people of the inclination to oversimplify by saying that all necessity is due simply to language, or to conventions, or to syntax. This may be illustrated by the following comparison. A rogator whose

FUNCTIONS AND ROGATORS

175

application to an argument identified in a certain way satisfies the NCDcondition is none the less a rogator, and the value which it takes is the very same thing as it may take in other applications not satisfying the NCD-condition. In particular, if it is a propositional rogator, and its application occurs in the construction of a proposition, then the mere fact that the NCD-condition is satisfied, e.g. if the proposition turns out to be one which is logically true, is no more justification for saying that what we have is not a proposition but a convention or rule, or for saying that it is not true in the same sense as other propositions, than there is for saying that the rogator is no longer a rogator, or that it does not have a value in the usual sense. 24. This completes my account of the applications of the notion of a rogator. I hope these rather condensed remarks show that we can look at some old problems in a new and illuminating way if we make the distinction between a function and a rogator.

INFINITELY LONG TERMS OF TRANSFINITE TYPE W. W. TAIT Stanford University, Stanford, Calif., USA

1. Functionals of higher type were introduced into proof theory by K. Godel in [2], where he gives an interpretation of first order number theory in terms of the impredicative primitive recursive (p.r.) functionals of finite type. The aim of this work was to show that for the consistency of number theory, Gentzen's use of induction up to GO (with respect to p.r. properties) can be replaced by a quite different constructive - but like Gentzen's, non-finitist - principle, namely, the assumption of constructive functionals of finite type and of their closure under p.r. operations. However, another view of Godel's result is possible: Instead of assuming functionals of higher type, we may regard the definitional schemata for the p.r. functionals simply as rules of computation, i.e. for transforming symbols. On this view, Godel's result may be interpreted as a consistency proof relative to the quantifier-free theory of p.r. functionals (his system T), which in turn must be justified by a proof that all the constant numerical terms of the theory can be transformed by the rules of computation into unique numerals. This latter viewpoint is what I will discuss here. It has become especially interesting in virtue of Spector's [7] extension of Godel's interpretation to classical analysis by adding the general principle of bar recursion to the schema for primitive recursion. For, while on any reasonable conception of computability at higher types, the computable functionals are constructively closed under the p.r. operations, there is no known constructively valid interpretation of these operations together with bar recursion.') It appears 1) There is a constructively valid interpretation for bar recursion of lowest type (i.e. in Spector's notation, where c is a sequence of numbers or functions) using the

INFINITEL Y LONG TERMS OF TRANSFINITE TYPE

177

that the best hope for a constructive justification of bar recursion lies in an analysis of the computations of bar recursive functionals. However, here I will discuss only p.r. functionals, or rather a certain generalization of them. On another occasion I will apply the present ideas to the analysis of functionals involving bar recursion of lowest type. William Howard has recently extended Godel's interpretation to ramified analysis, as formulated by Schutte [6] and Feferman [I], using p.r. functionals oftransfinite type. In view of this, it is fitting that we formulate the results of this paper for the wider context of transfinite types, even though we will not discuss Howard's result here. Actually, Howard uses a more complex concept of transfinite types than is introduced here, but his result can be obtained using the present conception. Just as Lorenzen and Schutte greatly simplified the problem of cutelimination for formal proofs involving induction principles by effectively representing such proofs by infinite well-founded proof trees, 1) so a similar device will serve us here in analyzing the computations of functionals involving definition by recursion. Consider, for example, the (not necessarily numerical valued) functional cP defined by the primitive recursion: cp(n+ I, a, b) = b(cp(n, a, b), n). cp(O, a, b) = a, The p.r. functionals of finite type can be generated from such cp by means of A-abstraction and explicit definition (where the definiens may contain and the successor operation, of course). Now, write CPo = sab . a and CPn+ 1 = Xab . b(CPn(a, b), n). Then cP is represented by the "infinite term" (CPo, CPl" .. ) which takes the value CPn for the argument n. The close connection between this infinitary rule of term formation and the rule of infinite induction (used by Schutte in the construction of proof trees), i.e. A(O), A(I), ... != (x)A(x), is made clear if we consider the Godel interpretation of the latter:

°

(t/t)B(O, CPo, t/t), (t/t)B(l, CPl' t/t), •.. !=(Ecp) (x) (t/t)B(x, cp(x), t/t). continuous functionals of finite types. By the methods of this paper, however, we can expect to obtain something more, namely, a least ordinal", such that the functionals of finite type defined by recursion up to '" are closed under bar recursion of lowest type. 1) Lorenzen was first to give a constructive formulation of cut-elimination for infinite proofs, using transfinite induction. Schiitte showed how to determine exactly in some cases which ordinals are involved, thus restoring the precision of Gentzen's original results.

178

W. W. TAIT

For, using our infinitary rule of term formation, the solution for tp is simply qJ = (qJo, qJl' ... ). In dealing with transfinite types, we make two further uses of Lorenzen's and Schutte's idea. First, transfinite types are themselves to be infinite objects. Namely if a, and 'Ci are types for all i < «, o: .:s; CU, and the a i are all distinct, then the set {(a is 'C in i < a is a type. In fact, it is the type of functionals which are defined for objects of any type a., i < a, and whose value for an object of type a, is of type 'Ci' A particular case is when a = 1, in which case the type is simply denoted by (170' 'Co)' Also, we will use another kind of infinite term to piece together functionals of type {(ai' 'Ci)L -c c from functionals of types (ai' 'Ci), i < rx. Namely, if for eachi < rx,qJiisoftypeai,thenqJ = (qJo,qJl,oo.)isoftype{(ai,r)},and for each argument x of type aJi < rx), qJ(x) = qJi(X), We are dealing here only with functionals of a single argument. But it is well-known how to reduce functionals of several variables to functionals of one variable (but of higher type). In the next section we will set up a formal calculus of infinitely long terms, or rather, a "semi-formal" calculus in Schutte's sense, which will codify the constructions of functionals which we have been discussing. In section 3, we will prove by induction up to rx that every term t can be computed, i.e. can be reduced by certain conversion principles to an irreducible term t', where the ordinal a is given explicitly in terms of certain bounds on t. From the proof of computability, in fact, it is easy to give a definition of t' = f(t) where the functional f is defined by predicative transfinite recursion up to rx. "Predicative" here means that f is obtained by explicit definition and recursive definition up to a, where the latter is used only to define numerical valued functions. Thus the definition of f does not involve functionals of higher type (unlike the definitions of the p.r. functionals of finite type). 2. We will use the usual notations for countable ordinals and operations on ordinals, but these should be interpreted in terms of a suitable constructive system of ordinal notations. In fact, for the sake of definiteness, we will assume that all discussion of ordinals here refers to the p.r. wellordering of the natural numbers defined in Schutte [4]. All the ordinal functions which we will use are represented in that ordering by p.r. functions.

179

INFINITELY LONG TERMS OF TRANSFINITE TYPE

The types and their ranks are inductively defined by Tp

1.

Tp 2.

o is a type with

rank RO = O.

If (Ji and t , are types with R(Ji, Rx, < /3 for all i < ct. ~ OJ and if a, #- (J j for i #- j (i, j < ct.), then the set p = {((Ji' 'ri), /3}; < a is a type with Rp = /3.

When the rank of {((J;, 'r;), /3} i < a is not relevant in a given context, we usually abbreviate this type by {((J;, 'rJ}; < a' and for ct. = 1, by ((Jo, 'ro). It would not be satisfactory in Tp 2 to take for p simply {((Ji' 'ri)}; < a and define Rp to be the supremum of the R(Ji and R'r;, since this supremum is not computable from p. On the other hand, our present definition has the unpleasant feature that two types may be extensionally the same, and be distinct only in virtue of their ranks. However, we can easily avoid this difficulty by taking

{(ni'

to mean

pJ, /3}; < a = {((Ji'

'rJ, b}i <

a

In particular, the condition a, #- (Jj in Tp 2 should be interpreted in this way.') The x-terms (i.e. terms of type r) and their lengths are inductively defined as follows: Tm 1.

Each variable a' of type 'r is a r-term with length I a'

Tm 2.

0 is

Tm 3.

If s is a O-term, then so is Ss.

Tm4.

If p = {((Ji,'ri)}i
a O-term.

I0I =

I=

O.

O.

I Ss I = I s 1+1. «, Si is a 'rcterm with is a p-term.

1) D. Scott has suggested, quite reasonably, that we call the object defined by Tp 1 and 2 type notations and distinguish them from the corresponding extensional types. Thus, a i a j in Tp 2 would mean that the types corresponding to the notations a i and aj are distinct. Of course, in speaking about notations, we would have to restrict ourselves to sets {(ai' T i ) , {l} which can be represented by numbers, e.g. recursively enumerable sets. The present formulation allows that our types may be free choice sequences of a certain spread (see the remarks below about the constructive meaning of infinite terms), but it is too early 10 see whether this will be of any particular use.

*

180

w.

W. TAIT

Tm 5. If So, S1' ... are r-term, and I s.; I < 13 for n S 0, then (Si; f3)i <0) is a {(O, r), R,,+ l}-term. I (Si; 13) I = 13· Tm 6.

If r is a p-term, S is a a-term, and (a, r) e p, then (rs) is a r-term, I rs I = max (I r I, I s I) + 1.

Regarding the constructive meaning of this definition, there are two possible views. On the narrower view, infinite types and infinite terms are to be given by effective, and so finite, rules for their construction, and our theorems about terms are ultimately about these rules, and so really only concern finite objects. This is the view expressed by Schutte in connection with his work with infinite proof trees. E.g. see [5], p. 369. Another, more general, viewpoint is that infinite terms are essentially free choice sequences of a certain spread. The details of the construction of the spread are left to the reader; but it will be noted that the possibility of identifying the terms with the choice sequences of a spread depends essentially on the fact that each term has an associated length which exceeds the length of its subterms, and each type {(ai' 'i)' f3} has a rank which exceeds the ranks of the a, and 'i. lt is easy to see that each term has a unique type. In particular, if r is a p-term, s a a-term and rs is a r- and a ,'-term, then (a, r) s p and (a, r') e p, and so , = i', r', s' and t' will denote r-terrns, The rank Rt of a r-term t is simply R". When there is no need for greater explicitness, we will write {Ad", Si} and (s.) for the terms given by Tm 4 and 5. Also, for IX = 1, write Jeauos o for {Aau " siL < a. is intended to denote 0, and S the successor operation, so that the numerals are 0, T = SO, 2 = sT, etc. An occurrence of a variable b in a term is called bound if it is in a context Jeb . s; and otherwise, it is called free. Let t(a U ) be an arbitrary term, and s a a-term. Replace each variable with free occurrences in s in all of its bound occurrences in t(a U ) by a distinct new variable, and replace each part {AaO"', Si' f3} and (Si; 13) in t(a U ) by {Aau , • Si' IX} and (Si; IX), respectively, where IX = I S 1+ 13. Finally, replace each free occurrence of a" in the resulting expression by s. The result will be denoted by t(s). We can assume that the change of bound variables in t(a U ) is done in a unique way, so that t(s) is unique.

o

INFINITELY LONG TERMS OF TRANSFINITE TYPE LEMMA

I t(s) I

~

1:

If t(a")

is a t-term and s a a-term, then t(s) is a t-term with

1s I + I t(a) I·

The proof is by straightforward induction on I t(a) We will write t 1t2 ... t. for ( .. . (t1t2)" .tn ) . Now we can formulate the rules of conversion. 1. {Aa"', si(a"')} r": II. (s;)ii --+ s•. III. (r ;)st --+ (rit)S.

181

--+

I.

s.(r"n).

The relation r =1 s (r reduces to s) is inductively defined by 10. 2°. 3°. 4°. 5°.

If r --+ s then r =, s. If r =1 s then t(r) =1 t(s). If r, =1 s, for each i < a, then {Aa"', r;}i <
(l'

In 2°, t(r) and t(s) are obtained from a term t(a) in the manner specified above. It is clear that if r" =i s, then s is a a-term, and moreover, on the intended interpretation, r" and s denote the same functional. For i = I, II or III, if rs --+ t is an instance of rule i, then rs is said to be i-convertible ti-conv, or simply conv) into t with principal part r. If t contains no conv. subterms, then it is called irreducible or is said to be in normal form. 3. We prove in this section that every term can be reduced to a term in normal form. The non-trivial part of the proof consists in showing that we can reduce t to a term without 1- or III-conv subterms. Dt ~ a will mean that Rr < rt for every principal part r of a 1- or Ill-conv subterm of t. We can read Dt ~ o: as "the degree of t is ~ «", providing that we do not assume Dt to be an ordinal which is computable from t. LEMMA

2:

If Dt(a"),

Ds"

~

o: and Rs" <

rt,

then Dt(s)

~

«.

Let uv be a 1- or III-conv sub term of t(s). We must show that Ru < «, If uv is a subterm of s, this follows from Ds ~ rt. If u = s, it follows from Rs < «. If uv is III-conv and u is of the form sw (so that s is of the form (s.), <w), then Ru < Rs < tx, But in every other case, uv is of the form

182

W. W. TAIT

u'(s)v'(s) where, if uv is f-conv (i = I or III), then u'(a)v'(a) is an z-conv subterm of t(a). But since Dt(a) ::s; a, it follows that Ru = Ru'(a) < a. LEMMA

3:

If Dr, Ds ::s; a, rs is a term, and Rs < a, s a and I t I ::s; Max (I rs I, 1s 1+ 1r I).

then there is a t

with rs =! t, Dt

Proof by induction on I r I. If rs is not 1- or III- cony, then t = rs suffices, since rs is the only possible cony subterm of rs which is neither a subterm of r nor of s.lf r = {Aa'''' r;(aO"')} and s is of type a n> then t = rn(s) suffices by Lemmas 1 and 2. If r = (r;; f3)u, then since I r n I oi;« a and 1t n 1 s Max (I rns I, Is 1+lrnl),forn ~ O. Hence, t = (t .: y)u suffices, where y = Max (Max(f3, I s I) + 1, 1 s I + 13).

4: If Drs ::s; a and Rr a'r; ... rin ~ 0). LEMMA

~

a, then r is of the form (u i ) or else

In fact, r is of the form rOr l ••• rn> where r o is not of the form UV, and Rr., ~ Rr ~ a. Now in all cases other than r o = at and r o = (uJ with n = 0, r o is the principal part of a 1- or lII- cony sub term of rs. But this is impossible since Drs ::s; a. Let x(~) = 2"', and for y "# 0, let x(r) be the iah simultaneous solution 13 of x(~) = 13, for all y' < y. Then x(r) is a normal function of a.

1: If Dt ::s; a ::s; xf~)I'

THEOREM

and 1 t'

I

+

wY, then there is a t' such that t =

r or,

::S;a

The proof is by induction on )I, and within that, by induction on 1 t I. If I t I = 0, then Dt = 0, so that t' = t suffices. Using the fact that the length of a term exceeds the length of its subterms, and that x~Y) is normal in 13, we can set (Ss)' = Ss', paO"'. s;, f3}' = {AaO"'. s;, x~Y)} and (s;; 13)' = = (s~·l' X(Y») Let t • = rs Then there are r' and s' with Dr' , Ds' < , a r - r' p • s=jS', 1 r' I ::s; and I s'l ::s; Xm. If Rr ~ a+w Y, then r is of the form a'r ; ... r« or (uJ, by Lemma 5. Hence, r' is of the form atr~ ... r~ or (u;), and so r's' is not 1- or III-conY. Thus, t' = r's' satisfies the theorem. Assume now that Rr < a+w Y• We must consider three cases: Case 1. y = 0. I.e. Rr s; a+ 1, so that Rs < a. Then by Lemma 4 there is a t' with t=r's' t', Dt' ::s; a and 1 t' I ::s; 21sl+21rl < 2 1t l• Case 2. y = (j + 1. Then since Rr < a+w·· w, we have Rr < a+w· . k for sufficiently large k. Then Dr's' ::s; a + w· . k, and so by k iterated

Xrn

-I

,

183

INFINITELY LONG TERMS OF TRANSFINITE TYPE

applications of the inductive hypothesis for J, there is a t' with r's' Dt' ~ IX and (y)

< XI t

Case 3. 'Y

= lim 'Yn' Then since Rr < n

+ COYk.

IX

t' ,

I'

+ oi', there is a k with

Rr <

IX

+

Hence, Dr' s' ~ IX + co'", and so by the inductive hypothesis for 'Yk' there is a t' with r's' =1 t', Dt' ~ IX and

I t' I < -

(Yk)

Xmax (x\~I. x\~(l+

1

<

(y)

Xltl'

This completes the proof. THEOREM 2: If Dt ~ co~, then there is a t' in normalform with t =' t' and

I t 'I

~

X(.j lt l.

This follows from Theorem I and the following lemma. LEMMA

and I t'

5:

If

I s I t I·

Dt

=

0, then there is a t' in normal form with t = t'

The proof is by induction on I t I. If I t I = 0, then t is in normal form, so we can take t' = t. Set (Ss)' = Ss', {Aa<1'· Si' {J}' = {Aa<1'· s;, {J}, and (s.; 13)' = (s;; (J). Let t = totl ... t n where n > and to is not of the form uv. Since Dt = 0, it follows from Lemma 4 that to is a variable, or else it is of the form (Ui) with n = 1. In the first case, t' = t~t~ ... t~ suffices; in the second case, if t~ is a numeral then t' = u~, and if t~ is not a numeral, then t' = t~t~. This completes the proof that every term t can be reduced to a normal form t', From the proof, it is clear that if we restrict ourselves to terms of length < 13 and degree < IX, then t' = f(t) can be defined by predicative recursion up to X~~) . IX, since the doubie induction in the proof of Theorem 1, up to IX; and within that, up to i t I, can be transformed into an induction up to X~~) . IX. The essential unicity of the normal form for a given term follows from

°

m,

THEOREM

3:

If r -

s, r = t and t is in normal form, then s

=1

t.

184

W. W. TAIT

For, if s is also in normal form, then t can be obtained from s only by trivial uses of 2°, namely, where in going from u(v) to u(w) (where v =1 w), v does not actually occur in u(v), so that u(w) differs from u(v) only by some changes of bound variables. (See the definition of the substitution u(v).) Hence, in this sense, t is a mere notational variant of s. In particular, if r contains no free variables and is a O-term, then all its normal forms must be numerals, and hence, identical. The proof of Theorem 3 proceeds by induction on the "length" of a reduction of r to s, defined in a suitable way. I omit this proof, which is routine, long and unpleasant. 4. Thefinite types are obtained by restricting Tp 2 to the case CI. = 1, so that they are built up from by means of the composition (a, r). Similarly, the terms offinite type are obtained by restricting Tm 4 to the case CI. = 1, so that the terms given by this clause are of the form AaG • s. The impredicative p.r. functionals of finite type are obtained from the terms of finite type (in our sense) by replacing Tm 5 by the schema for primitive recursion given in § 1. But we saw in § 1 how to replace each

°

.2(1)'2

(with k exponents). In particular, let t be a constant (0, 0) - term, i.e, without free variables. Then for every n,j(tn) is a numeral. Then
INFINITELY LONG TERMS OF TRANSFINITE TYPE

185

References [I] S. Feferman, Systems of Predicative Analysis. To appear. [2] K. Godel, Dber eine bisher noch nicht benutzte Erweiterung des finiten Standpunktes. Dialectica 12 (1958) 280-287. [3] G. Kreisel, Inessential Extensions of Heyting's Arithmetic by Means of Functionals of Finite Type. Abstract. JSL 24 (1959) 284. [4] K. Schutte, Kennzeichnung von Ordnungszahlen durch rekursiv erklarte Funktionen. Math. Ann. 127 (1954) 15-32. [5] , BeweistheoretischeErfassung der unendlichen Induktion in der Zahlentheorie, Math. Ann. 122 (1951) 369-389. [6] , Predicative Well-orderings. These Proceedings, p. 280. [7] C. Spector, Provably Recursive Functionals of Analysis: A Consistency Proof of Analysis by an Extension of Principles Formulated in Current Intuitionistic Mathematics. Recursive Function Theory. Proc. of Symposia in Pure Mathematics, Vol V., Am. Math. Soc. (1962) 1-27.

Added in proof The work of Lorenzen to which we refer is "Algebraische und logistische Untersuchungen tiber freie Verbande", Journal of Symbolic Logic 16 (1951) 81-106. However, an earlier treatment of proof theory by means of a constructive theory of infinite proofs is given in P. Novikov, "On the consistency of certain logical calculus", Matematicesky sbovnik 12, no. 3 (1943) 353-369. In particular, a constructive consistency proof for arithmetic is given.

II. SYMPOSIUM ON RECURSIVE FUNCTIONS

CONSTRUCTIVE ORDER TYPES, II) JOHN N. CROSSLEy2) St. Catherine's College, Oxford, UK

Introduction 1. The theory of constructive order types constitutes a new approach to the problem of providing a constructive analogue of ordinal number theory. Recently, Dekker, Myhillet al. (e.g. [8]) have considered a generalization of the notion of cardinal number which may be regarded as a constructive analogue of Cantor's theory. Ordinal number theory may be approached in two ways. (1) Ordinals may be considered as being generated in a certain way (v. [1] § 3, p. 19; [2] p. 87). (2) Ordinals may be regarded as the equivalence classes of well-ordered sets under (arbitrary) one-one order-preserving maps (isotonisms). Church and Kleene ([3]) considered a constructive analogue of (1). In the present work we embark on a constructive analogue of (2). We define constructive order types as equivalence classes of (linear) orderings under effective one-one order-preserving maps (recursive isotonisms). (We are only concerned with denumerable orderings.) In particular, co-ordinals are the equivalence classes of well-orderings obtained under recursive isotonisms. Since there are only denumerably infinitely many recursive isotonisms, co-ordinals are, in general, proper sub-classes of the corresponding classical ordinals. In establishing our results we use classical set theory together with 1) The author is deeply indebted to Prof. G. Kreisel for his valuable suggestions and comments on the problems discussed here. 2) The work presented here was done whilst the author was a Junior Research Fellow at Merton College, Oxford.

190

JOHN N. CROSSLEY

recursive function theory. We define addition, multiplication and exponentiation in a way which agrees (with respect to the orderings) with the classical versions of these functions. Most of our basic results in the additive and multiplicative theory hold not only for co-ordinals but also for a collection of constructive order types we call quords. Quords are the constructive order types of those linear orderings which contain no effective infinite descending chains. As there were close similarities between some aspects of Tarski's Cardinal algebras [16] and Dekker and Myhill's Recursive equivalence types [8], it is not surprising that there should be analogous connections between Tarski's Ordinal algebras [17] and the present work. However, neither in the theory of recursive equivalence types nor in the theory of constructive order types is there a natural analogue of the infinite sum which Tarski introduced (v. [7], p. 197). This may be regarded as the main reason why constructive order types do not naturally form an ordinal algebra. 2. In section I we introduce various kinds of isotonisms, i.e. one-one, order-preserving maps, and show that two recursive linear orderings are recursively isomorphic if and only if they are recursively isotonic. Thus the theory of recursively isomorphic well-orderings is part of the theory of constructive order types. Addition is defined in § II. For this we require the notion of (r.e.) separable relations (cf. [15]) which is studied in § II. 1. The rest of § II is devoted to establishing elementary properties of addition and of an ordering by initial segments (:0:;). In particular, we prove the Separation Lemma (II. 5 . 1) and the Directed Refinement Theorem (II. 5 .2) which are fundamental for much of the later work. The Directed Refinement Theorem also shows that :0:; is a tree ordering, i.e. A :0:; C and B :0:; C imply A :0:; B or B :0:; A, as well as being a quasi-ordering of all constructive order types. We restrict our attention exclusively to linearly ordered sets and their constructive order types from § III. There we consider an analogue of the descending chain condition. We caIl linearly ordered sets which contain no infinite recursive descending chain quasi-well-orderings, and we call their constructive order types quords. For quords we also have the cancellation law A + B = A + C implies B = C. However, quords are not partially well-ordered by :0:;.

CONSTRUCTIVE ORDER TYPES, I

191

Co-ordinals, which we discuss in § IV, are the constructive order types of well-orderings. Although ~ is a partial well-ordering") of C(? (the collection of all co-ordinals) it is not a well-ordering and for every (classical) limit number there exist uncountably many co-ordinals which are subclasses of that classical ordinal. It follows that these co-ordinals are incomparable. Because of this [an analogue of] a classical law for addition fails for co-ordinals. However, when we introduce the notion of a principal number for addition then the [analogue of the] law holds for predecessors of any given principal number. For such co-ordinals we also obtain a unique additive decomposition. A negative result is that a co-ordinal A may be such that there is a (classical) ordinal T, less than the (classical) ordinal of A, for which there is no co-ordinal C which is both < A and of classical ordinal T (example IV.5. I). But we do have the following Representation Theorem (IV. 5 .4): For every (classical) ordinal r, there is a co-ordinal C (or ordinal T) which is such that, for every ordinal Ll < r, there is a co-ordinal D of ordinal A such that D < C. If T is infinite there are uncountably many such co-ordinals (corollary IV. 5 .5). In § V we prove that a collection of quords (a fortiori of co-ordinals) has a least upper bound if, and only if, it has a maximum, i.e. there are no non-trivial least upper bounds. Multiplication is introduced in § VI and some of its basic properties derived. Most of the fundamental classical laws [have analogues which] go through. In order to show that AB = AC implies B = C (if A '# 0) we prove that the (classical) isotonism between representatives of Band C can be extended to a recursive isotonism. Analogously to the situation for addition, the law A < B implies AC ~ BC does not hold in general; but we show that it does hold for predecessors of a given principal number for multiplication. Exponentiation is defined in § VII and the development is very similar to that of § VI. In § VIII we prove that the collections of principal numbers for addiWw tion and multiplication of ordinal less than W W and w , respectively, lie in a single branch of the tree of co-ordinals. This result is best possible in the sense that there exist incomparable principal numbers for addition Ww and multiplication of ordinal co" and w , respectively. 1) :::; is a tree ordering, also, by theorem II. 5.3.

192

JOHN N. CROSSLEY

Because predecessors of a principal number for (e.g.) addition obey all the classical laws for addition, the well-orderings belonging to co-ordinals lying in the branch of the tree just mentioned may be called natural well-orderings (with respect to addition). By increasing the number of functions considered and ensuring that the collections of principal numbers form a nested sequence, we can get characterizations of natural well-orderings up to larger segments of the Cantor second number class in terms of their co-ordinals. 1) Terminology and notation

The development of the theory of constructive order types will be informal but we shall use logical symbolism freely for brevity. We write "&", "v", "I", "---+", "+--).", "3", "V", "E!", "flx" for "and", "or", "not", "implies", "if and only if", "there exists", "for all", "there is a unique", "the least x such that", respectively, and we also use the Anotation (cf. [9], p. 34). We sometimes use dots for bracketing purposes in the usual way. A number means a natural number (0, I, 2, ... ) unless otherwise stated. A set is a collection of numbers and a class is a collection of sets. We denote the set of all natural numbers by.J' and the empty set by 0. We use lower case Greek letters for sets. {x : P(x)} is the set of all elements satisfying the predicateP. &('J.- 13 = {x : x s ('J. & x ¢ f3}. ii = .J' -('J.. e('J. £: 13 means x s ('J. -+ x e 13 and ('J. c: 13 means ('J. £: 13 & ('J. i= 13. <x, y) is the ordered pair of the numbers x, y. ('J. x 13 = {<x, y) : x e ('J. & y e f3}. ('J.2

=

('J.X('J..

A relation is a set of ordered pairs of natural numbers, i.e. a subset of

.J'2. We use upper case bold face letters (A, B,... ) for relations. A relation A is said to be reflexive if

(x, y) e A

-+

<x, x) e A &
The converse of a relation A is {
CONSTRUCTIVE ORDER TYPES, I

193

{x : (3y) «x, y) B A v 1) arguments is the set of all (n-tuples of) numbers for which the function is defined. The range offis the set of values off By f(rx) we mean {I(x) : x B a} and by leA), wherefis a function of one argument, we mean {
: <x, y)

B

A}.

We assume familiarity with classical ordinal number theory (as in e.g.

[1] or [14]). We also assume familiarity with the notions of recursive

and partial recursive functions and recursive and recursively enumerable (r.e.) sets. We sometimes use Turing machine methods for convenience (for details see e.g. [5] or [9]). We make heavy use in the sequel of the facts: (i) If a is a r.e, set and a ~ of, thenf(a) is r.e., (ii) If a is an infinite r.e. set, then a is recursive if, and only if, there is a recursive function which enumerates a in order of magnitude ([13], p. 291). We recall that a set containing no infinite r.e. subset is said to be immune and that there exist immune sets and r.e. non-recursive sets ([6], p. 89, [13], p. 291). We use the well-known (primitive) recursive functions defined by

= !(x+Y) (x+y+ l)+x, j(k(x), lex)) = x j(x, y)

(v. [5], p. 43). j maps .?2 one-one onto f

{j(x, n) : x

and (A ; n) for

B

We write j(a, n) for «}

{<j(x, n),j(y, n) : <x, y)

B A}.

Unexplained notations may be found in [9], p. 538.

194

JOHN N. CROSSLEY

I. Recursive isotonism

1.1. All relations are assumed to be reflexive unless otherwise stated.

A function f from the field of a relation A to the field of a relation B is said to be relation preserving (between A and B) if

(x, y) s A +-+
In the above definition and in definitions I. I .2 and I. I .5, below, f is to be one-one on the whole of its domain (and not merely on the field of A). This condition ensures that in all three cases f- 1 is well-defined on pi (Under definition 1.1.2 we may haveff-t :1: 1.) Clearly isotonism is an equivalence relation. We write RT(A) = {B : B ~ A} and if A = RT(A), then A is said to be a relation type. DEFINITION I. 1.2: Suppose A and B are relations. Then a map p(x) is said to be a recursive isotonism from A to B if (i) p is a partial recursive function, (ii) p is one-one, (iii) {)p ;2 C'A and p( C'A) = C'B, (iv) p is relation preserving between A and B. A is recursively isotonic to B if there is a map p which is a recursive isotonism from A to B. We write p : A ~ B if p is a recursive isotonism from A to B and A ~ B if there is a recursive isotonism from A to B.

We claim that recursive isotonism is an equivalence relation. The identity map is recursive, hence recursive isotonism is reflexive. If p is a one-one partial recursive function, then p - 1 (defined, of course, only on pp) is also partial recursive (see [11], p. 177). It follows that if p : A ~ B, then p-l : B ~ A. It is clear that recursive isotonism is a transitive relation. We can now introduce our next definition. 1.1.3: If A = {B : B ~ A}, then A is said to be a constructive relation type. We write A = CRT(A). DEFINITION

CONSTRUCTIVE ORDER TYPES, I

195

DEFINITION 1.1.4: A function f is said to be a recursive permutation iff is recursive and maps f one-one onto itself. DEFINITION 1.1.5: A relation A is said to be totally recursively isotonic to a relation B if there is a recursive permutation f which is a recursive isotonism from A to B. We write A ~ B if A is totally recursively isotonic to B.

Again, totally recursive isotonism, is an equivalence relation; we write TRRT(A) = {B : B ~ A} and if A = TRRT(A) for some relation A, then A is said to be a total recursive relation type.

1.2. From now on we use upper case Roman letters for constructive relation types (C.R.T.s). The collection ofall C.R.T.s will be denoted by (jI. I. 2. I : (i) A ~ B -. A ~ B -. A ~ B, (ii) There exist relations A, B such that A ~ B but A ~ B, liii) There exist relations C, D such that C ~ D but C D. THEOREM

PROOF. (i)

(ii) Let

(X

*

Clear from definitions 1.1.1, 2 and 5. be a r.e. non-recursive set. Then (X is infinite with an infinite

non-r.e. complement ii. Let

A = {(a, a') : a, a' s (X & a

B = {(b, b') : b, b' s ii & b

~

a'},

~

b'}.

Then A '" B, since A, B both represent well-orderings of type t». But A ~ B implies ii = f«(X) for some partial recursive function! This implies ii is r.e., contradicting the choice of (x. (iii) Let C = {(c, c') : 0 ~ c ~ c'}, and D = {(d, d') : I s d s d'}.

*

Then if f(n) = n+ 1, f: C ~ D. But C D, for if C ~ D by g, then g-l(O) is undefined, which is in contradiction with g being a recursive permutation.

Corollary 1.2.2. (i) RT(A);2 CRT(A);2 TRRT(A),(ii) There is a relation C such that RT(C) ::::> CRT(C) ::::> TRRT(C).

196

JOHN N. CROSSLEY

PROOF. (ii) Let C, D be as above in the proof of theorem 1.2.1, and let E = {<x,y): x::;; y&x,yeO"} where 0" is a non-Leo set. Then D e CRT(C) - TRRT(C) and E s RT(C) - CRT(C). Corollary 1. 2.2 shows that constructive relation types give a finer classification of (denumerable) relations than do (classical) relation types.

1.3. We observe that A ~ B if, and only if, A* ~ B* (and similarly for '" and ~). DEFINITION 1. 3.1: A* is said to be the converse of (the C.R.T.). A if A = CRT(A) and A* = {B : B ~ A*}. THEOREM 1. 3 . 2: (i) A * = {B* : B (ii) {A* : A e.?Jl} = .?Jl, (iii) A** = A.

~

A}

where

A

= CRT(A),

THEOREM 1.3.3: Let A ~ B, (f. = C'A and f3 = C'B. Then (i) (f. is r.e. +-4 f3 is r.e., (ii) (f. is immune +-4 f3 is immune, (iii) There exist relations A', B' such that C'A' is recursive, C'B' is not recursive and A' ~ B'. PROOF. (i), (ii) Left to the reader. (iii) Let f3 be a r.e, non-recursive set enumerated without repetitions by the recursive function ben). Let

B'

=

{
:i

s

j} and A' = {
s

j}.

Then h : A' ~ B'.

1.4. DEFINITION 1.4.1: A relation A is said to be recursive (r.e.) if there is a recursive function f(a, b) (f(a, b, c) such that
(= {x : (3z)j(x, x, z) = O} if A is Le.). DEFINITION 1.4.3: A relation A is said to be recursively isomorphic to a relation B if there is a recursive predicate L(X, y) such that, for some

CONSTRUCTIVE ORDER TYPES,

197

I

isotonism, f, between A and B, I(X, y) ~ f(x) = y. In this case we write I : A == B and if there is such an I we write A == B. If A = {B : B == A} then A is said to be a recursive isomorphism type and we write A = RIT(A). Recursive isomorphism is an equivalence relation. Since (i) I : A == A if I(X, y) ~ X = y, (ii) if I : A == B, then 1* : B == A where I*(X, y) ~ I(Y, x), (iii) if I : A == Band K : B == C, then A : A == C where

A(X, y)

~

(3z) (I(X, z) & K(Z, y))

~

(V'z) (I(X, z)

--t

K(Z, y)),

since A is recursive by [9], theorem VI (p. 284). THEOREM 1.4.4: If A, B are recursive relations, then A == B if; and only if, A ~ B. PROOF. Suppose A, B are recursive relations and I : A == B. Then I(x,y) &1 (x, z) --t y = zby definitionI.4.3, and hence that thefunctionf, defined by f(x) = flyl(X, y), is partial recursive. C1early,jis an isotonism, Thus f: A ~ B. Conversely, suppose f : A ~ B. Then by theorem 1.4.2 IX = C'A and f3 = C'B are recursive. If IX or f3 = 0, then the assertion is trivial. Hence we may assume there exist numbers a e IX and b s f3 such that b = j(a). Set

I(X, y)

~ f(a{l...:...

cix)} + xcix)) = (b+ 1) (1...:... cpCy)) + ycp(y)

where cix) = 1 if x belongs to the recursive set y, = 0 otherwise. It is easily verified that I : A == B. This completes the proof. This theorem allows us, when discussing recursive relations, to work with partial recursive functions rather than with predicates. THEOREM 1. 4.5: There exist r.e. relations which are not recursively isotonic to any recursive relation. PROOF. Let

IX

be a r.e. non-recursive set and let

A = {<x,y) : x = y .v. x e IX &y = x+1}. Since IX is (infinite) r.e. there is a recursive one-one function f such that = f(f) (cf. [5], p. 73).

IX

Hence

<x, y) e A

~

(3z) (I x- y 1{ If(z)-x 1+ 1y-(x+ 1) I} = 0),

JOHN· N. CROSSLEY

198

and it follows that A is a r.e. relation. Suppose p : A ~ B for some recursive relation B = {<x, y) : g(x, y) = O}; then, since C'A = J, p is total and hence recursive. pp = C'B is recursive by theorem 1.4.2. Therefore x e IX +-+ g(p(x), p(x + 1» = 0 which implies ex is recursive, which is a contradiction. We conclude that A is not recursively isotonic to any recursive relation. For a certain class of relations, however, each r.e, relation (of the class) is recursively isotonic to a recursive relation (in that class). We recall that a (reflexive) relation A is said to be a partial ordering if (i) <x, y) e A &
= 0).

Let ex = C'A. If ex is finite there is nothing to prove since then A is finite and hence recursive. Otherwise ex is infinite r.e. and there is a one-one recursive function, g, such that ex = g(J) (cf. [5], p. 73). Since A is linear, (Vx) (Vy) (3z) (f(g(x), g(y), z)

= 0 v f(g(y), g(x), z) = 0).

This is equivalent to (Vx) (Vy) (3z) (j(g(x), g(y), z)· f(g(y), g(x), z) = 0).

(1)

Since A is anti-symmetric, f(g(x), g(y), zo)

= 0 &f(g(y), g(x), Z1) = 0

-+

g(x) = g(y),

and hence x = y and f(g(y) , g(x), zo) = O. Now let B be the relation defined by <x, y) e B

+-+

f(g(x), g(y), liz {[(g(x), g(y), z) . f(g(y), g(x), z) = O})

Then B is recursive, by (1), and clearly g : B

~

A.

=

O.

CONSTRUCTIVE ORDER TYPES, I

199

II. Addition 11.1. If A and B are arbitrary relations, then the ordinal sum of A and

B is defined by

A

+-

B = A u B u (CA x CB).

If the ordinal sum of two relation types is defined as the relation type of the ordinal sum of arbitrary representatives of the given relation types, then this definition is not, in general, unique. This is because the fields of the representative relations may have non-empty intersection in some cases, depending on the choice of representatives. But if we define the relation type of the sum in terms of representatives which do have disjoint fields, then the definition is unique (cf. [19], pp. 341, 345 * 160.48). Two relations are said to be strictly disjoint if their fields are disjoint. We observe that if A, B are reflexive relations, then A n B= 0

~

C'A n C'B = 0.

Now, in order to define a constructive version of ordinal sum we require "constructive disjointness" i.e. CA and CB must be contained in sets which are "effectively disjoint". If this is not the case, then the following situation arises: let IX be a r.e. non-recursive set and let fJ be a r.e, set containing a. Then there is no (partial) recursive function, defined on Ji which agrees with f(x) = x on IX and g(x) = x + 1 on fJ. Hence there can be no (partial) recursive function defined on C(A +- B) where A = IX2 and B = aZ, although A and B are strictly disjoint. DEFINITION 11.1.1: A is r.e. separable from B if there are disjoint r.e. relations Ai' Bi such that A ~ Ai and B ~ Bi. If A is r.e. separable from B we write A)( B. Note. In general we shall be concerned only with r.e. separability and shall omit the qualification "r.e.".

DEFINITION 11.1.2: Ais recursively separable from Bif there are disjoint recursive relations Ai' B, such that A ~ Ai and B ~ Bi. If A is recursively separable from B we write A

>
THEOREM 11.1.3: (i) A)( B ~ A*)( B*, (ii) A B ~A* B*,

><

><

200

JOHN N. CROSSLEY

(iii) A ) ( B -. A)( B, (iv) There exist relations A, B such that A )( B but not A ) ( B. PROOF. (i) ((ii)) follows from the fact that the converse of a r.e. (recursive) relation is r.e. (recursive). (iii) Every recursive relation is r.e, (iv) We call two sets o; P r.e. (recursively) separable if the relations a 2 , p2 are r.e. (recursively) separable. Let !!Z be a consistent incomplete formal system containing formal arithmetic and let

T = {(x, y) : x is (a Godel number of) a proof of the sentence (with GOdel number) y}, To = {x: (3y) (y, x)

8

R)},

R = {<x, y) : x is (a Godel number of) a proof of the negation of the sentence (with Godel number) y} and

Ro

= {x : (3y) (y, x) 8 R)}.

Then T, R are both (primitive) recursive relations (though not reflexive relations [9], p. 252-5) and To and R o are r.e. sets. Let To = T5 and Ro = R~. Then To and Ro are r.e. and disjoint (since !!Z is consistent), i.e. T o ) ( Ro. By [15], theorem 22 (p. 59), To and R o are not recursively separable. Hence To and Ro are not recursively separable. Let A = To and B = Ro and (iv) is established. II. 1.4: (i) A is r.e. (recursively) separable from B if, and only if, there are r.e. (recursive) sets (Xl' Pl such that C'A S (Xi> C'B S Pl and THEOREM

(Xl

n

Pl

= 0·

(ii) If C'A or C'B is finite, then A)

PROOF. (i) If A is r.e. (recursively) separable from B, then there are r.e. (recursive) relations A l , Bl such that A S Ai> B S B. and A l n B, = 0. Let (Xl = C'A l , Pl = C'B l , then (Xu Pl are r.e, (recursive) by theorem 1.4.2. Since A l , Bl are reflexive, x 8 (Xl +-+ <x, x) 8 A l , and similarly for Pl and Bl • Hence A l n Bl = 0 +-+ (Xl n Pl = 0· Conversely, suppose there are r.e, (recursive) sets (Xl' Pl such that C'A s (Xl' C'B S Pl and (Xl n Pl = 0. Let A l = (Xi and Bl = pi. Then A l , Bl are r.e. (recursive) and reflexive and they clearly r.e. (recursively) separate A and B. (ii) This part of the theorem follows at once from (i)

CONSTRUCTIVE ORDER TYPES, I

201

and the fact that every finite set is recursive and so is the complement of a finite set. The second version of part (i) of this theorem is false if the relations are not assumed to be reflexive. For let T, R be the relations defined in the proof of theorem II. I .3. (iv); and suppose that there exist recursive sets " p such that CT s; , and CR s; p where, n p = 0. Then the sets,' = {x : x s C'T and x is (a Godel number of) a single formula} and p' = {x : x e C'R and x is (a Godel number of) a single formula} are recursively separable by r and p, But r = To and p' = R o which contradicts [15] theorem 22. The converse assertion, namely, that if there exist disjoint recursive sets containing the fields of A and B, then A and B are recursively separable, still holds, of course. THEOREM ILl.5: Let (X = CA, /3 = CB; then A)( B if, and only if, there is a partial recursive function, p, such that

f>P:2

(X

U

/3,

pp s; {O, I} (S)

and x e (X u /3 implies x s

(X -

p(x) = 0 .&. x s /3 - p(x) = l.

PROOF. If A )( B, then by the preceding theorem, there are r.e. sets

0(1 :2 0( and /31:2 /3 such that 0(1 n /31

= 0. For arbitrary r.e. set y let

c;(x) be the partial recursive function defined only on y such that c;(x) = 1 for x e y. Then x e 0(1 implies 1 ~ C~,(x) = 0 and x e /31 implies cp,(x) = 1. Let T, be a Turing machine which calculates 1..:... C~, and let

Tp be a Turing machine which calculates Cp,' Further, let T(m, n) = the number (represented) on the tape of the Turing machine T at the m-th step") in the calculation for argument n. Now let a new machine To be defined such that To(m, n) is as follows: (i) If Ta , Tp have not halted before the m-th step for argument n, then To(2m + l , n) = Ta(m + l , n), (ii) If T; has not halted before the (m + 1)-st step and Tp has not halted before the m-th step, then To(2m + 2, n) = TpCm + 1, n), (iii) If 1;. halts at the m-tll step and T p has not halted before the m-th step, then To halts at the (2m + 1)-st step, 1) "Step" does not mean here just one operation of the Turing machine, but a whole phase in the calculation. We assume m ~ 1.

JOHN N. CROSSLEY

202

(iv) If Tp halts at the m-th step and T~ has not halted before the + 1)-st step, then To halts at the (2m + 2)-nd step. Let p(x) be the function defined by the machine To. Then P is partial recursive and satisfies the conclusion of the theorem since, for an argument in a u P, T~ halts if, and only if, Tp does not. Conversely, let a l = {x: p(x) = O} and Pl = {x: p(x) = I}. Then a l and Pl are r.e. and disjoint; the required result follows from the preceding theorem. (m

THEOREM 11.1.6: If A, Bare r.e. (recursive) relations, then A )( B +--+ A ('\ B

=0

(A ) ( B +--+ A ('\ B

= 0).

THEOREM II.I .7: Any two C.R.T.s have recursively separable representatives. PROOF (v. [8], theorem 9(a)). Let A s A and Be B and let

C

= {(2x, 2y): (x, y) e

A},

D = {(2x+ 1, 2y+ 1): (x, y) s B}. Then C

~

A, D

~

Band C ) ( D.

11.2. THEOREM 11.2.1: Let A l +- B1 ~ A l +- Bz·

~

Al

Al ,

s,

~

a; A l )( s, and A l

)(

Bl , then

PROOF. Let a i = C' Ai' Pi = C'B i (i = 1,2). By hypothesis there exist recursive isotonisms p, q such that p: Al ~ Az and q: B1 ~ Bz. P; (i = 1,2) such that P; = 0. Let Further, there are r.e. sets Pl be the partial recursive function with domain ~P ('\ which is equal to P on ~Pl and let Pl be the partial recursive function with range PPI ('\ which is equal to PI on ~Pl' Let ql be the partial recursive function whose definition is obtained by replacing P by q and a by P in the preceding sentence. Then ~Pz ('\ ~ql = 0 and ppz ('\ pqz = 0. Hence r: A l +- Bl ~ A l +- Bl where r is the partial recursive function which is equal to Pz on its domain and equal to qz on its (disjoint) domain r is one-one since PPl ('\ pqz = 0 and Pz- ql are one-one. The other requirements are obviously satisfied. By virtue of this theorem we can now define addition of C.R.T.s uniquely as follows:

a;,

a;

a; ('\ a;

CONSTRUCTIVE ORDER TYPES, I

203

DEFINITION II .2.2: A +B = CRT(A-tB) whereAe A, Be Band A)( B. Notation. 0

= CRT(0).

We write "A+B" for "A-tB" when A)( B.

THEOREM 11.2.3: (i) A+O = O+A (ii) A + B = 0 +-+ A = 0 = B, (iii) (A+B)* = B*+A*.

= A,

PROOF of (ii). Let A s A, B e B where A )( B. Then A + B = A u B u CAC'B) = 0. Hence A = B = 0.

0

implies

THEOREM II. 2.4: + is associative, viz.for all A, B, C e:Jl, A + (B+ C)

= (A+B)+C.

PROOF. By definition II . 2 . 2 there exist A s A, B s Band C e C such that B )( C and A)( {B+C}. Now the latter implies A )( B and A )( C, hence A+B is defined, {A+B})(C and (A+B)+C is well-defined. We leave the reader to verify that A+(B+C) = (A+B)+C. As in the classical case addition is not commutative in general. 11.3. We can now introduce two relations on the collection :Jl of all C.R.T.s. These relations are reflexive and transitive, i.e. are quasiorderings. Later (§§ III, IV) we shall show that the former of these two quasi-orderings is anti-symmetric on a sub-collection of :Jl and is a partial well-ordering of C.R.T.s of well-orderings. DEFINITION 11.3.1: A :-:;; B if there is a C.R.T. C such that A + C = B. A < B if there is a C.R.T. C i= 0 such that A + C = B. A < B A = {<x, Then A, where 11 A = B.

is not, in general, equivalent to A :-:;; B & A i= B. For let y) : y :-:;; x}, B = A [(J - {O}), A = CRT(A) and B = CRT(B). B are both of classical order type w* and clearly B + 11 = A, = CRT({ <0, O)}). But A ~ B under the map x -+ x+ 1, hence

DEFINITION 1I.3.2:A:-:;;* BifthereisaC.R.T.CsuchthatC+A

=

B.

We shall refer to ":-:;;" as "the ordering by initial segments" and":-:;; *" as "the ordering by final segments".

JOHN N. CROSSLEY

204

THEOREM II.3.3: (i) A ::s:; A, (i)* A ::s:;* A, (ii) o s A, (ii)* 0 ::s:; * A, (iii) A ::s:; 0 ~ A = 0, (iii)* A ::s:; * 0 ~ A = 0, (iv) A::s:; B & B ::s:; C --+ A ::s:; C, (iv)* A ::s:; * B & B ::s:; * C --+ A ::s:; * C, (v) s « C --+ A+B ::s:; A+C, (v)* A ::s:;* B --+ A+C::s:;* B+C. PROOF. (i)-(iii)* follow from theorem II. 2.3, (iv)-(v)* follow from theorem II. 2 . 4.

Corollary II.3.4. ::s:; and ::s:;* are quasi-orderings of9f!. THEOREM II.3.5: There exist C.R.T.s of well-orderings, A, B, say, such that A ::s:; B but not A ::s:; * B. PROOF (as in the classical case). Let A be the natural ordering (by magnitude) on J - {O} and let A = CRT(A). Let 11 be as in § II. 3, then, setting B = 11 , clearly A = B+A and B::s:; A. But we cannot have B ::s:; * A, since A has no last element and C + B has last element 0 for every (separable) C.

11.4. We introduce some notation. o

n

I

i~O

ceO =

Ai = A o,

+

L

1

i~O

n

Ai =

L

i~O

A i+A n + 1 •

0, IY..n = {j(a,m): m < n Sc a e cq,

o .o: = {j(a,n) :neJ&aer:t.}; A.O =

0, A.n = {<j(a, m), j(a', m'» : m < m' < n & a, a' s C'A .v. m = m' < n &
A.w = {(j(a, m), j(a', m'» : m < m' & a, a' s C'A .V.m = m'&
A.O = 0, A.(n

+

1) = A.n+A, A.w = CRT(A.w) for A e A.

Part (ii) of the following theorem shows that it is immaterial which element A of A we use to define A. w.

CONSTRUCTIVE ORDER TYPES, THEOREM 11.4.1: (i) A ~ B --+ A.n ~ B .n, (ii) A ~ B --+ A.w ~ B.w, (iii) A . n eA. n, (iv) O.n = 0, (v) A.(m+n) = A.m+A.n, (vi) A.(mn) = (A.m).n, (vii) A. w = A + A .w, (viii) (A.n).w = A.w, (ix) if n > 0, then A .n = 0 +-+ A . to = 0 +-+ A (x) m ~ n --+ A.m ~ A.n, (xi) m

s

m

n

--+

i

I

205

= 0,

n

I

Ai

=0

s I i

Ai'

=0

PROOF. The proofs of the various parts of this theorem are elementary and we only prove parts (ii), (viii) as examples, and leave the other parts to the reader. (ii) Suppose p : A ~ B, then q : A. t» ~ B. w where q(z) = j(pk(z), fez)). (viii) Let A e A and A.w then belongs to A.w by part (ii). Let q(x), rex) be the (primitive) recursive functions such that

x

= nq(x)+r(x) and 0

~

rex) < n

and letp(x) = j(j(k(x), r(l(x))), q(l(x))). Thenp is one-one and (primitive) recursive. We assert that p is relation preserving between A.m and (A.n).m, for

(A.n)m

and where

= {(j(j(a, s), u),j(j(b, t), v) : (u < v .v. u = v & s < t) & a, b e C'A .v. u = v & s = t &
(1)

<x,y)eA.w+-+x =j(a,nq+r)&y =j(a',nq'+r')

o~

r, r' < n & nq-s-r < nq' +r' & a, a' e C'A or nq+r

(2)

= nq' +r' &
Condition (2) is equivalent to:

(q < q' .v. q

= q'

& r < r') & a, a' e C'A

.v. q = q' & r = r' &
(3)

206

JOHN N. CROSSLEY

Comparison of (I) and (3) immediately shows thatp is relation preserving. This completes the proof. THEOREM II.4. 2: (i) A ) ( B ~ A . w )( B. w, (ii) (A+B).n+A = A+(B+A).n, (iii) (A+B).(n+ 1) = A+(B+A).n+B, (iv) (A+B).w = A+(B+A).w.

PROOF. (i) Let p be a partial recursive function satisfying the requirements (S) in theorem II. 1.5 for A and B. Now x e C'A. w ~ k(x) e C'A and similarly for B. w. Hence if x s C'A. w u C'B. w, then x s cA. w ~ pk(x) = 0.&. x e C'B.w ~ pk(x) = 1. (ii), (iii). Proof by induction on n using the associativity of addition (theorem 11.2.4). (iv) Let A s A, Be B, a = C'A and 13 = C'B where A)( B. It is easily verified that (A+B).w = (A; O)+C where C

= ~

m<"

m=O

{(B; m) u (A; m+ 1) u U(f3, m) xj(a, n)]

u U(a, m + 1) xj(f3, n + I)]}.

We construct a recursive isotonism p such that p: C ~ (B+A) .w. Suppose ai' 131 are r.e. sets separating A and B (v. theorem II .1.4.(i)). Let p be the partial recursive function defined only on the r.e. set (al'oo u f31.(0)-j(a1, 0) by p(z)=z

ifzef31'00

=j(k(z),I(z)~I) if

ZBa 1.00-j(a1,0).

By part (i) p is well-defined. It is then readily verified that p has the required properties. 11.5.

11.5.1: (SEPARATION LEMMA.) If A = B+ C and A e A, then there are relations B e Band C e C such that B )(C and B+C = A. LEMMA

PROOF. Suppose A e A and A = B+ C. Then there are relations B' s Band C s C such that B' )( C and for some f, f: B' +C ~ A. Let B = J(B') and C = J(C). Clearly, B e Band C s C. By theorem II. 1 .5,

207

CONSTRUCTIVE ORDER TYPES, I

there is a partial recursive function, p, such that Jp;2 C'B' pp ~ {O, I} and if x s C'B' u C'C' then x s C'B' ...... p(x) =

°.&. x s C'C' ...... p(x)

U

C'C',

= 1.

If y s C'B u C'C then, sincefis one-one, there is a unique x s C'B' u C'C' such that y = f(x). Thus pF 1 is partial recursive, J(pf-l) ;2 C'B u C'C, p(pf- l) ~ {O, I} and if x e C'B u C'C then x e C'B ...... pf-l(X) = 0.&. x e C'C ...... pf-I(X) = 1.

Hence by theorem II. 1.5, B)( e. Clearly B+C complete.

= A,

thus the proof is

11.5.2: (DIRECTED REFINEMENT THEOREM.) If A +C = B+D then there is an E such that either A = B+E and E+C = D THEOREM

or

A+E = Band C = E+D.

PROOF. If D = 0, let E = C. Otherwise we may assume D =F 0. Let A 6 A and C 6 C where A )( C. Then by the Separation Lemma (11.5.1) there are relations B 6 Band D 6 D such that B)( D and A +C = B+ D. Let rx = C'A, {3 = C'B, y = C'C and J = C'D. Then rx u y = P u J. Case 1. If J n rx =F 0, let " = rx - {3 and E = D [ n, By construction, rx = {3 and n s: J. Therefore B)( E and E)(e. Now, B ~ A+C and E ~ A+C, further x 6 {3 and y 6" imply y 6 J and <x, y) 6 Px J ~ A+e. Thus B+ E ~ (A+C) [rx = A. Conversely, if <x, y) e A then either (i) x, y 6 {3, (ii) x, Y 6" or (iii) x s {3 Bc y e n. (We cannot have x & ye {3 since (J x {3) r, (B+ E) = 0 and" £J.) In all these three cases <x, y) s B+ E. Hence A ~ B+ E, and therefore A = B+ E. Similarly, E+C = D. Set E = CRT(E) and the theorem follows in this case. Case 2. If (j n rx = 0, then {3 n y =F 0 or B = 0. In the former case the existence of an E such that A + E = Band C = E + D is proved as in case 1 except that all occurrences of "A" are replaced by "B" and all those of"C" by "D" and vice versa (with corresponding changes in the associated Greek letters) and in the latter case put E = 0. This completes the proof.

u"

6"

208 THEOREM

E such that

JOHN N. CROSSLEY

II. 5.3: If B. w

=

A + C and C

=1=

0, then there exist n, D,

A = B.n+D, D+E = Band E+B.w = C. PROOF. We consider only the non-trivial case where A, B, C are all non-zero. Let Be B, then by the Separation Lemma (II. 5.1) there exist A e A and C s C such that A)( C and B. w = A + C. By assumption C =1= 0 =1= A, hence there is ace c-c where c = j(b, n') for some b e CB and some n' ~ 0. Therefore 11 = {lea) : a e CA} is a set of natural numbers bounded by n', Let n be the maximum number in 11 and let D = A [j(!3, n), E = C [j(!3, n). Then, as in the proof of the Directed Refinement Theorem it is easily verified that D)( E and A = B.n+D, D+ E = Band E+B.w [{x: lex) > n} = C. (1) We observe that B.w [{x: lex) > n} ~ B.w under the map p : x - j(k(x), lex) ~ (n+ 1» defined only on {x ; lex) > n}. Taking C.R.T.s of both sides of the equations in (1) completes the proof.

Note. There is no obvious link between theorems II.5.2 and II.5.3 for the following reason. We know that B.n+B.w = B.w for all n (by theorem II. 1.4. (vii) and induction). Suppose B. w = A + C, then by theorem II. 5.2 it easily follows that for each n, either A ::;; B. n or B.n s A. If A ::;; B.n for some n, then we are through, but A ~ B.n for all n does not l ) imply A ~ B.w nor is ::;; anti-symmetric on~.

We sum up the properties of ::;; in the following theorem. THEOREM II. 5.4: The relation ::;; is a quasi-ordering on ~ and satisfies the following tree condition

A ::;; C and B ::;; C imply A ::;; B or B

s

A.

By definition II. 3 . 1, if A ::;; C and B ::;; C then there exist D, E such that A+D = C and B+E = C. Hence by theorem II.5.2 there is an F such that either PROOF.

or

A+F

A

= Band

= B + F and

D

=

F+E

(1)

=E

(2)

F+ D

1) For it follows from theorems IV.4. 5 that if B for all n, but A :2: B. w (= W).

=

1 and A

=

V, then A > B. n

CONSTRUCTIVE ORDER TYPES, I

209

In case (I), A s B and in case (2), B ~ A. ~ is a quasi-ordering of fJi by corollary II. 3 .4. III. Quords 111.1. We now commence our study of proper subsets of fJi. DEFINITION 111.1. I : A C.R.T. A is said to be a constructive order type (C.O.T.) if there is an A e A which is a linear ordering. Since C.R.T.s are subsets of the corresponding (classical) relation types, if A is a constructive order type then every relation A e A is a linear ordering. DEFINITION III. 1. 2: A sequence {a i};~ 0 in the field of a linear ordering relation A is said to be an infinite recursive descending chain if the function Aiai is recursive and for all i,
if, it contains no splinter.

PROOF. Let A be a linear ordering and suppose that {gJ;"= 0 is an infinite recursive descending chain in A. Define f, a as follows: a = go. f(n) = g(l+ll y{g(y) = tI})

Then {l(a)};"=

0

(v. [18], p.33).2)

is a splinter in A.

1) This use of the word "splinter" is derived from that in [18).

2) By our convention (v. Terminology and notation) we write gi for the value of

gat t.

JOHN N. CROSSLEY

210

Conversely. suppose that {l(a)}(= 0 is a splinter in the linear ordering A. Let g be the function defined by g(O)

=

a, g(n + 1)

= f(g(n».

Then g is totally defined and computable, hence recursive. I.e. {giL"'= 0 is an infinite recursive descending chain in A. THEOREM III. 1.6: If A is a quasi-well-ordering and f is a one-one partial recursive function such that f: A ~ A then f is the identity map on C'A. PROOF. (This proof is essentially that in [14J. p. 264.) If f =F 1 on C' A then there is an a e C' A such that f(a) =F a. Since A is a linear ordering, either (f(a), a) e A or
DEFINITION III.2.2: A C.O.T. A is said to be a quord if there is an A s A which is a quasi-well-ordering. We write .fl for the collection of all quords. It follows at once from example 111.2.1, that there are some quords which are not e.R.T.s of well-orderings, though we shall show later (§ IV) that the cardinal numbers of quords and of e.O.T.s of wellorderings are the same (namely c, theorem IV. 3.3). We shall see that quords possess many additive and multiplicative properties analogous to those of classical ordinals. THEOREM 111.2.3: A is a quord if, and only if, every A s A is a quasiwell-ordering.

211

CONSTRUCTIVE ORDER TYPES, I

PROOF. By definition III. 2.2 there is a B.s A which is a quasi-wellordering. Let A be any other relation e A, then there is an f such that f: A ~ B. Suppose that A is not a quasi-well-ordering, then there is an infinite recursive descending chain {aJ;"= 0 in A. But then {f(a;)};"= 0 is an infinite recursive descending chain in B since lif(a i) is totally defined. This contradicts our assumption and we conclude that A is a quasi-wellordering. The converse is trivial. THEOREM III. 2.4: (i) 0 e .2, (ii) A = B+C implies A e.2 ...... B, C e.2. (iii) A e.2 ...... (3n) (n "# 0 & A.n e..@) ...... (Vn) (A.n e..@) ...... A.w e.2.

PROOF.

Left to the reader.

Let .91 be a collection of C.R.T.s, then if we define A :s; .RIB to mean (3 C) (C e.91 & A + C = B) then :s; is absolute for quords in a certain sense by the following corollary. Corollary III. 2 .5. If A, B e PROOF.

s. then A s

B

+-+ A

:s; fiB.

Immediate from theorem 1I1.2.4.(ii).

THEOREM

111.2.6:

If A

is a quord, then A

= B+A+C -+ C = O. = B+A+C where C"#

PROOF. Suppose A is a quord and A O. Then there exist quasi-well-orderings A e A, Be Band C s C and a recursive isotonism f such that B+A+C is well-defined and f: B+A+ C ~ A. Since C "# 0, C "# 0 and hence there is an element c e C'C and for this element, f(c) e C'A. Now A)( C, hence C'A () C'C = 0 and c "# f(c). But
ir

Corollary III.2.7. If A is a quord, then A+B

=

A +-+B

= O.

Corollary III.2.8. If A is a quord, then B < A ...... B:s; A & B"# A. Corollary Ill. 2 .9. If A or B is a quord, then A :s; B & B

~

A

-+

A

= B.

PROOF. By hypothesis there exist C, D such that A + C = Band B+D = A. Hence A = (A+C)+D = A+(C+D) and B = B+(D+C) and if A is a quord, then C, Dare quords by theorem III. 2.4. (ii) and C = D = 0 by corollary 111.2.7 and theorem 11.2.3; similarly if B is a quord.

212

JOHN N. CROSSLEY

Corollary III. 2 .10. If A or B is a quord, then A :::;; B & B :::;;* A

-+

A

= B.

PROOF. By hypothesis, there exist C, D such that A + C = Band D+B = A. If A is a quord, then A = D+A+C and by theorem III.2.6, C = 0; hence A = B. Similarly if B is a quord.

Corollary III.2.ll. If A is a quord, then A+B PROOF. Suppose Theorem (II. 5 .2) E + B = C or A + E III. 2. 7, E = 0 and

=

A+C+-+B

=

C.

A + B = A + C, then by the Directed Refinement there is an E such that either A = A + E and = A and B = E + C. In either case, by corollary B = C. The converse is trivial.

Corollary III. 2 .12. If A is a quord, then B< C-+A+B
Corollary III.2.9 establishes that z; is a partial ordering of !2. We shall show later (Theorem IV. 2.6) that :::;; is a partial well-ordering of the collection of all C.R.T.s of well-orderings. :::;; is not a partial wellordering of !2 as is shown by the example below. Example III. 2. 13. Let A be as in example III.2.1 and suppose A = { n} and An = A [tin' Then for all n, An is a quasi-well-ordering. Let An = CRT(An), then In < n -+ Am > An since C' Am contains only finitely many more elements than does C'A n (using theorem II.1.4.(ii» and Am ¥- An by corollary III.2.8. Hence the sequence {AJr'= 0 is a strictly descending infinite sequence of quords, thus !2 is not partially well-ordered by :::;;.

n.

Corollary III. 2. 10 may be regarded as a constructive analogue of the following theorem attributed to Lindenbaum (given in [14], p. 248): "If an order type A is an initial segment of an order type B and the order type B is a final segment of the order type A, then A = B." By corollaries III. 2.9 and 10, A = B is equivalent (in !2) both to A ::;; B & B :::;; A and to A :::;; B & B :::;; * A. But it follows from the existence of quords incomparable with respect to :::;; (see below § IV. 4)

CONSTRUCTIVE ORDER TYPES,

I

213

that :-s; * is not anti-symmetric on .£U) For let A, B be two incomparable quords and let C = (A+B).w and D = B+(A+B).w. Then clearly C:-s;* D and D:-S;* C. If C = D, then by theorems II.4.I.(vii) and II 5 . 4 it easily follows that A and B are comparable, contradicting our assumption. 0

IV. Co-ordinals IV.l. In this section we establish some properties of co-ordinals which are the C. R. T.s of well-orderings. We regard classical ordinals as relation types (v. § 1.1) of (denumerable) well-orderings. Hence co-ordinals are sub-classes of the corresponding (classical) ordinals. We use upper case Greek letters for classical ordinals (and variables over the denumerable classical ordinals). Notation. 10 = 0, 11 = CRT { 1, In = Il.n. THEOREM IV .1.1: (i) {In: n s J} £ .2, (ii) In = (i ii) 1m + In = 1m + m (iv) Imon = I mn, (v) 1m + 1m W = I.; t», (vi) if m =I=I- n, Im·w = In·w, (vii) In+A = In+B ~ A = B, (viii) A+ln = B+ln ~ A = B.

I:, 0

°

0

PROOF. We prove only part (viii) leaving the other parts to the reader. (viii) A+ln = B+ln ~ (A+l n)* = (B+l n)* ~ I:+A* = I:+B* (by theorem n.203) ~ In+A* = In+B* (by part (ii)) ~ A* = B* (by (vi)) ~A=B.

DEFINITION IV. I .2: A C. O.T. is said to be finite (or a finite co-ordinal) if it is In for some n. We remark that this definition corresponds to the classical definition of finite sets as sets which are inductive. A search for an analogue of Dekker and Myhill's Isols (v. [8]) proved abortive. 1) This example is based on that in [17] p. 25.

214

JOHN N. CROSSLEY

THEOREM IV. 1.3: Any two linear orderings with fields of the same finite cardinal are totally recursively isotonic. PROOF. Since any two finite sets of the same cardinal can be mapped onto each other in a one-one manner by a recursive permutation (any permutation of the natural numbers which interchanges only a finite number of numbers is recursive), and since every finite linearly ordered set is well-ordered and so is its converse, it follows that any two linearly ordered sets of the same finite cardinal are totally recursively isotonic. IV.2. DEFINITION IV. 2.1: If A is the C.R.T. of a well-ordering, then A is said to be a co-ordinal. We let 'fJ denote the collection of all co-ordinals. If E is a classical ordinal such that E :2 A, then E is said to be the (classical) ordinal of A and we write E = 1 A I. Corollary IV.2.2. If A is a co-ordinal then As; then A = I A I.

1

A

I

and if A is finite

THEOREM IV. 2.3: (i) 0 e -e, (ii) A = B+ C implies A e'fJ ...... B, C s 'fJ, (iii) A s 'fJ ...... (3n) (n "# 0 & A.n s 'fJ) +-+ (Vn) (A.n s 'fJ) ...... A.w s 'fJ. Corollary IV.2. 4. If A, Be 'fJ, then A s B ...... A s 'lB. Thus ~ is absolute for co-ordinals as well as for quords (cf. corollary III.2.5).

LEMMA IV.2.5:

I A+B I = I AI+I

B

I·

THEOREM IV. 2.6: ~ is a partial well-ordering of'fJ and satisfies the tree condition (v. theorem II.5.4). PROOF. As we remarked earlier (§ 111.2) every co-ordinal is a quord, hence ~ is a partial order of'fJ by corollary 111.2.9. Further, ~ satisfies the tree condition by theorem 11.5.4 and corollary IV. 2.4. Now suppose {AJ?= 0 is an infinite descending chain of co-ordinals and let E j = I A j I for each i. Then, by lemma IV. 2.5, {E j }?= 0 is an infinite descending chain of (classical) ordinals under the natural ordering of ordinals. This is impossible, hence ~ is a partial well-ordering of 'fJ.

CONSTRUCTIVE ORDER TYPES,

I

215

Corollary IV.2. 7. If A is a co-ordinal, B:::; A and C:::; A and B = C.

1B I = I C I, then

PROOF. Since :::; is a tree ordering, either B :::; C or C :::; B. Suppose B < C, then there is an E #= 0 such that B + E = C. Therefore, by lemma IV. 2.5, I B I + I E I = I C I and I E I #= 0; thus I B I < I C I where < denotes the classical ordering of ordinals. Similarly we cannot have I C I < I B I· Corollary IV.2.8. If A is a co-ordinal, then &(A)

= {B : B < A} and &+(A) = {B : B :::; A}

are well-ordered by :::;. IV. 3. DEFINITION IV. 3 . 1: A co-ordinal A is said to be infinite if, for some A e A, C'A is infinite. It follows at once that a co-ordinal is infinite if, and only if, it is not finite. THEOREM IV. 3 .2: A co-ordinal A is infinite ff, and only if, In < A for all n. PROOF. Let A e A and suppose that In < A for all n. Then by the Separation Lemma (II. 5 . I), for each n there exist Bn such that An + Bn = A, where

An = {
:

i :::; j

< n} and A = {<ar, a,j) : T :::; Ll < I A I}.

It follows at once that C'A 2 {a i

: i s $} and that C'A is infinite. Conversely, if A is infinite, then using theorem II. I .4. (ii) one easily shows that In < A for all n. We leave the details to the reader.

Notation. By virtue of theorems IV. 1. I and IV. 3.2 we now write "n" for "In" and "$" for "{In: n is a natural number}" where there is no danger of confusion.

THEOREM IV.3.3: (i)$ c Cfl c fl c!3£, (ii) The cardinalities ofCfl, fl,!3£ are all c (the cardinal of the continuum). PROOF. (i) Every finite linearly ordered set is well-ordered, hence

JOHN N. CROSSLEY

216

J! s;

C{}. There exist infinite well-ordered sets, hence J! #- rr5. By example III.2. 1 there exist quasi-well-orderings which are not well-orderings but every well-ordering is a quasi-well-ordering, hence C6' c fl. Finally, let W* be the converse of the natural ordering of the natural numbers and let W* = CRT(W*). Clearly, W* is not a quord. Hence fl c !Jf. (ii) The cardinality of !Jf is :0;: 2N~ since any C.R.T. is an equivalence class of subsets of J!2. But 2N~ = c; thus in order to prove (ii) it suffices to prove that the cardinality of rr5 ~ c. Now every equivalence class of well-orderings contains at most ~o well-orderings since there are only No recursive isotonisms, Further, there are at least c distinct well-orderings of subsets of J!. Hence, if x is the number ofelements ofrr5, then ~o. x ~ c and it follows, using the axiom of choice that x ~ c. This completes the proof.

As in the classical case subtraction does not playa major role, but we introduce the notion now for notational convenience. By corollary III. 2. 11, if A = B + C then C is uniquely determined by A and B; it follows by theorem IV. 3 . 3 that the same is true for co-ordinals. Hence the following definition gives a unique value for A - B (which by Theorem IV. 2. 3(ii) is a co-ordinal if B, A are co-ordinals). DEFINITION IV. 3 .4: If A ~ B and (A and) Bare quords, then A - B is the unique C such that A = B + C. THEOREM IV. 3 . 5: If A, B, Care quords, then (i) A-A = 0,

(ii) (A+B)-A = B, (iii) if B :0;: A, then B+(A-B) = A, (iv) if A+B :0;: C, then C-(A+B) = (C-A)-B. PROOF OF (iv). A + B

:0;:

C

-+

(ElD) (C = A + B + D), hence

C-(A+B) Also, C-A

= B+D and

=

D.

therefore

(C-A)-B = D. THEOREM IV. 3 .6: If I A I is a successor number A + m, where A is a limit number, then for each n there is a unique BII which is comparable with A and of classical ordinal A+n; further, B; = A ± I m-n I (where

CONSTRUCTIVE ORDER TYPES,

I m - n I is the modulus of m - n and the as En < A or En ;:::: A).

+ or

217

I

- sign is taken according

This theorem follows at once from the fact that if A e A then A has a final segment of type m which is finite, and hence, by theorem II. 1 .4. (ii), separable from its complement in A. We leave the details to the reader. It follows from this theorem that every co-ordinal has a unique successor. We shall show later (§ V) that limits of strictly increasing sequences of co-ordinals are never uniquely determined by such sequences without other conditions. IVA. THEOREM IV .4.1: For each limit number A there exist ceo-ordinals

of ordinal A.

PROOF. There are c distinct infinite subsets of .Y. Let each of these be well-ordered with ordinal A (this is possible since A is denumerable). Then these c subsets are spread among, say, x equivalence classes containing at most No members each since there are only No recursive isotonisms. Hence No.X = c and therefore (using the axiom of choice)

x = c.

Corollary IVA. 2. There are c co-ordinals

V~

such that

I V~ I = w.

Notation. W = {
v = {<x,y): x,yep&x ~ y} and V

=

CRT(V).

DEFINITION IV. 4.3: W is said to be the standard well-ordering of type w, W is the standard w-co-ordinal; V is called the generic counterexample.i) THEOREM IV .4.4: (i) 1 + W = W, (ii) 1 + V"# V. PROOF. (i) It is easily verified that W = I. w, hence by theorem 1I.4.1.(vii) with A = 1, 1+ W = W. (ii) Suppose 1 + V = V. Since p is r.e. non-recursive, p is non-empty, 1) Since most of our counterexamples are based on V.

218

JOHN N. CROSSLEY

say ao s p. Therefore there is a recursive isotonism

f such that

»:m

f: {(ao, ao)}+V ~ V. Hence V = {(r(ao),f"(a o

~

n}

and g : W ~ V where g(n) = I"(ao). But then g enumerates p in order of magnitude, and it follows by Post's lemma ([13], p. 291) that p is recursive, contradicting our choice of p. Corollary IVA.5. V and Ware incomparable.

PROOF. By the theorem V '" W. But V < Wor W < V implies Vor W (respectively) is finite. This corollary shows that there exist incomparable co-ordinals, and hence, that there are incomparable quords. It therefore completes the demonstration (end of § 111.2) that ~ * is not anti-symmetric (even on ..2). THEOREM IV.4.6: There exist co-ordinals A, B, C such that A < B but A+C $ B+C. PROOF. Let A = I, B = V and C = W. Then by theorem IV.3.2, A < B, and by theorem IV.4.4.(i)A+C = C. If A+C ~ B+C, then C ~ B + C and B ~ B + C. Hence by theorem IV. 2. 6, Band Care comparable which contradicts corollary IV.4. 5. We now consider important classes of co-ordinals for which the law (+) A < B - A + C ~ B+ C does hold. In fact, if A, B, C are predecessors of the same principal number for addition then (+) holds. Classically, a principal number for addition, otherwise called a y-number ([I], p. 67) or a prime component ([14], p. 279), may be defined as an ordinal II '" 0 satisfying one of the three (equivalent) conditions (l C)-(3C) below. r
r,

+ L1 = II L1

-+

L1

=0

or L1

< II - r + L1 < II

= II

We consider constructive analogues of these conditions, viz.: B < A - B+A = A B + C = A -+ C = 0 or C B, C < A _ B + C < A

=

(I),

A

(2), (3).

CONSTRUCTIVE ORDER TYPES, I

219

THEOREM IV. 4.7: If A is a co-ordinal #- 0, then (i) (1) +-+ (2), (ii) (1) ~ (3), (2) ~ (3), (iii) (3) -+-+ (1), (3) -+-+ (2). PROOF. (i) Suppose B+C = A and (1) holds. Then if C #- 0, B < A, and by (1), B + A = A. Hence by corollary III . 2 . 11, A = C. Conversely suppose (2) holds and B < A. Then there is a C#-O such that B+ C = A and by (2) we have C = A, hence B + A = A. (ii) Suppose (1) holds and B, C < A. Then B+A = A and C+A = A, hence (B+C)+A = B+(C+A) = B+A = A and since A #- 0, B+C< A. That (2) ~ (3) follows from (i). (iii) It suffices to prove (3) +} (1). Let A = V, then B < A implies B is finite, hence (3) holds for V. Since 1 < V, if (1) held we would have 1 + V = V contradicting theorem IV. 4.4. (ii). This completes the proof.

°

DEFINITION IV. 4.8: A co-ordinal A is said to be a principal number for addition if A =1= and B < A ~ B+A = A. If A = 1, then A is called an improper principal number for addition and if A#-1 then A is called a proper principal number for addition. We write£'( +) for the collection of all principal numbers for addition.

THEOREM IV. 4 .9: Every proper principal number is a co-ordinal whose classical ordinal is a limit number. PROOF. Clearly, no finite co-ordinal is a proper principal number. Suppose A is a proper principal number for addition and I A I = A + m, where A is a limit number and m is finite. Then by theorem IV. 3 .6 there is a co-ordinal u; < A such that I Bo I = A. Hence I B o + A I = A.2+m> I A I and consequently Bo+A #- A. THEOREM IV.4.lO: If P e£'( +), then P.w e£'( +) and P < P.w. PROOF. Suppose P is a principal number for addition and A < P. w; then, by theorem 11.5.3, there is an n and a D such that A = P.n+D, where D < P. Hence A+P.w = A+(P+P.w) [by theorem 11.4.1. (vii)] = (A+P)+P.w = (P.n+D+P)+P.w = P.(n+l)+P.w [sinceP

220

JOHN N. CROSSLEY

is a principal number for addition and D < P] = P. co [by (n + I) applications of theorem II.4.I.(vii)]. Hence P.we£(+). Clearly, P < P.w since P "* O. THEOREM IV 04.11: (i) If P is a principal number for addition and A, B, C < P, then A < B--+ A+C s B+C. (ii) Similarly under the hypothesis that A, B, C

s

P.

PROOF. (i) By theorem IV.4.7.(ii), A < B--+ A+C < P&B+C < P. Therefore, by theorem II. 5 . 4, A + C and B + C are comparable. But, classically, ep < 'JI --+ ep + r ~ 'JI+ r, hence by lemma IV. 2.5, A + C s B + C. (ii) follows at once from (i) using P. co instead of P and theorem IVA.IO. Corollary IVA .12. If A, B s P and P is a principal number for addition, then B s A + B. Now we prove that any (non-zero) predecessor of a principal number for addition is uniquely expressible as a finite sum of non-increasing principal numbers for addition.')

THEOREM IV. 4. 13: If 0 < A Cn 2: Cn _ 1 •.. 2: C 1 and A = Cn + ... +Cl • Further, if A = Cn + ... +C1 and A = D m + ... +D l are two decompositions such that P 2: Cn 2: Cn - 1 . . . 2: C 1 and P > D m 2: D m _ 1 . . . 2: D l and all the C, and D, are principal numbers for addition, then n = m and for all r S n, Cr = Dr' Conversely, if A is expressible as Cn + ... +C l where C; 2: Cn _ 1 2: ... C l and all the Cr are principal numbers for addition, then there is a principal number, namely,

c..«

2: A.

PROOF by transfinite induction with respect to the partial well-ordering S. We assume 0 < A < P e£( +) and take as induction hypothesis: If 0 < B < A, then B is uniquely expressible as a finite sum of principal numbers < P. If A is a principal number for addition, then there is nothing to prove. 1)

This theorem was conjectured by A. L. Tritter.

CONSTRUCTIVE ORDER TYPES, I

221

Now suppose A is not a principal number, then there exist B, C such that B+ C = A, where C "# 0, A (and hence B "# 0).

(4)

By corollary IV.4.l2, C < A. Let C 1 be the least C satisfying (4) (i.e. under the ordering by initial segments). We now show that C 1 is a principal number for addition. Suppose C 1 = D + E, then by corollary IV.4 .12, E < P and hence by theorem 11.5.4, C 1 and E are comparable. But 1E 1 ~I C 1 I, hence E s C 1 and by the minimality of C b E = C 1 • Thus C 1 is a principal number by theorem lV.4.7.(i). Now let B 1 be the least B such that B+C 1 = A. Then if B 1 = 0 we only have to prove uniqueness, and otherwise by the hypothesis of the induction, B 1 has a (unique) decomposition B = Cn + + ... +C z where P> C, ~ ... ~ C z and all the Cr(r = 2, ... , n) are principal numbers. Hence A = Cn+ ... +C 1 and, since C 1 < P, all the C, (r = 1, ... , n) are comparable. Suppose C z < C 1 , then by the definition of a principal number for addition, Cz + C 1 = C l' hence A = (C n+ ... +C Z)+C 1

= (C n+··· +C 3)+C 1 •

Now C z "# 0 -+ Cn+ ... +C 3 < Cn+ ... +C z = B 1 • But B 1 was chosen as the least B such that B+C 1 = A. We therefore cannot have C z < C 1 and must have Cz ~ C L: Thus A = C, + ... + C 1 is a decomposition of the required type. As regards uniqueness, letA = Cn+ ... + C 1 and A = Dm+ ... +D 1 be two decompositions of A as a sum of non-increasing principal numbers. By theorem 11.5.4, C, and D m are comparable. Suppose Cn > D m , then D m + C n = C, since C n is a principal number for addition. Therefore A = D m + C n + ... + C 1 and by substituting D m + C, for C, m times more we obtain A = Dm.(m+ 1)+A which implies Dm.(m+ 1) < A. Now if i ~ m, then D;+D m = Dm or Dm.2 according as Di < Dm or D i = Dm. Therefore Dm.(m+l)+A = A < A+Dm.m ~ Dm.2m and hence, by corollary JIL2.11, A ~ Dm.m. This contradicts Dm.(m+l) < A and we therefore cannot have D m < Cn" Similarly, Cn -{: D m and we conclude C; = Dm . Now by corollary 111.2.11 it follows that

222

JOHN N. CROSSLEY

Repeating this argument the minimum of m and n times and letting s be this minimum, we obtain C, r = Dm _, (r = 0, ... , s) and hence either Ct+ ... +C t = OorDt+ ... +D t = 0 where t = 1m - Ill. By theorem 11.2.3. (i i) it follows that t = 0 and hence that n = m and C, = D, for every r. Conversely, if A = Cn+ ... +C t , then as for Dm above, A+Cn.n :::;; :::;; Cn . 2n and hence by theorems IV.4 .10 and 11.4.1. (vii) it easily follows that A < Cn • w which is a principal number for addition. This completes the proof. -r

This theorem is not an immediate corollary of theorem 2, p. 280 in [14] for the following reasons: (i) it may be the case that I P I is a classical principal number while P is not a principal number, e.g. V, (ii) P may be a principal number but I P I may not be a classical principal number (see § VIII .1) and (iii) comparability conditions have to be established.

IV.5. By theorem IV .4. 1 above there are c co-ordinals corresponding to each limit number (and these co-ordinals are therefore incomparable with each other) but there are some limit number co-ordinals which have no predecessors of some smaller ordinal. More formally: Let A, B, C range over co-ordinals, over classical ordinals, then

e

(3A) (3B) (3 e)

(/

A

& (V C)

I = r & IB I = A & A < B & r < e < A (I C 1"1= e v C 1: B v A 1: C».

This is shown by example IV. 5.1 below. If, however, we restrict ourselves to recursive co-ordinals then this situation does not arise. We hope to present the results for recursive co-ordinals in [4]. Example IV. 5 .J. P is as given in § IV.4. Let T be the well-ordering of type w. 2 defined by <x, y) e T +--> x e p & yep v xc y e p &x:::;; y v x, yep & x :::;; y.

Let T = CRT(T). Suppose T = V + V' where I V I = I V' I = w, then by the Separation Lemma (II. 5 .1) there exist relations U, U' such that U )( U' and T = U + U'. Hence C'U and C'U' are contained in disjoint

CONSTRUCTIVE ORDER TYPES, r.e. sets (x, p. But this implies trary to the choice of p.

(X

223

I

= p & P = P and that p is recursive, con-

e

We observe that if the condition on above is satisfied for some successor number l ' then by theorem IV. 3 . 6 it is satisfied for some limit number 2 • On the other hand we do have c co-ordinals which have predecessors representing all ordinals less than that of the given co-ordinal. This is the content of the representation theorem below. We shall use the following classical theorems in proving the representation theorem.

e

e

THEOREM IV.5.2: ([14], p. 379, theorem 1.) Every denumerable ordinal which is a limit number is the limit ofa strictly increasing sequence, of type ill, of ordinals less than the given number. i

THEOREM IV. 5.3: ([14], p. 264, corol/ary 3.) If A and B are isotonic weI/orderings then there is an isotonism f such that every isotonism between A and B is an extension off. THEOREM IV. 5.4: (REPRESENTATION THEOREM.) Let F, ..1 range over (denumerable) classical ordinals, C, D over co-ordinals, then

('IT) (3C)

(I C 1=

r &('1..1) (..1

->

(E!D)

(I

D

1=

L1 & D < C))).

PROOF BY TRANSFINITE INDUCTION. The assertion is trivial if T = O. We assume the assertion holds for all ordinals less than T, If r = e + 1, then by the hypothesis of the induction there is a co-ordinal T such that

I T I = e & ('1..1) (..1 < e

->

(ElD)(1 D

I = ..1 & D <

T)).

Then by theorem IV.3.6, T < T+ I and D < T+ 1 -> D s T. Let C = T + 1, then by corollary IV. 2.7, it easily follows that C has the required properties. If r is a limit number, then by theorem IV. 5.2, F is the limit of a strictly increasing sequence {4>;} j < w of ordinals. We may assume 4>0 #- O. Put II 0 = 4>0' Il, + 1 = 4>j + 1- 4>j (by [14], p. 275, Il, is well-defined). Then

By the hypothesis of the induction, for each i there is a P, such that

224

JOHN N. CROSSLEY

I Pi I = IIi & ('v'A)(A < IIi

(E! D)(I D I = A & D < Pi»'

-+

(5)

Using the axiom of choice, choose a fixed Pi in Pi (such that 0 a CP i for each i 1 Now define

».

C

= {<j(p, m),j(q, n»: p e CPm & q a CP n & m

< n

.v. m = n & a Pm}

and C = CRT(C). Clearly,

L

IC I = Now suppose A

<

i
r, then for

IIi =

r.

some n,

A <

n

L IIi i; 0

and we may assume that n is minimal. Therefore A

where e and T <

n- 1

= L u.s o i; 0

< II n- From (5) it follows that there is a T such that I T I =

r; Let D =

n - 1

e

n

L Pi + T, then I D I = A and D::;; L0 i; 0 i;

Pi'

Since ::;; is a tree-ordering, in order to complete the proof it suffices to prove that, for all n, n L r;« C. i; 0

Let Pen) = C[{x: lex) ::;; n}, then it is easily verified that

Pen) a

n

L Pi' i; 0

Further, let pen) = C[{x: lex) > n}. Then p(n»( pen) since if x a CP(n) U cp(n) (= CC), then xaCP(n)+-->l(x)

s:

U

n .&. xaCp(n)+-->l(x) > n.

Hence, if p is the partial recursive function sg (l(x)-=- n), then p satisfies 1) We shall use this auxiliary condition in the proof of corollary IV. 5.5.

225

CONSTRUCTIVE ORDER TYPES, I

the conditions in theorem II .1. 5. Hence p(n) + p
i

L= 0 Pi <

C

and the proof is complete. Corollary IV.5 .5. There are ceo-ordinals C A for each ordinal T such that I C A I = rand

('ILl) (Ll < PROOF.

r -+ (E! D)( I D I =

Ll & D < C A ) ) .

~

w

(6)

Case 1. If F is a limit number.

Let VA be a co-ordinal such that I v~ I = W & VA i= W, by corollary IV.4.2 there are c such co-ordinals. Let YA e VA and suppose YA = {
C' = {G(p,

» : P e eP

vm ) , j(q, vn

m

& q s eP n & m <

. v. m = n &
/I

(7)

and C' = CRT(C'). As before, I C'I = rand (6) holds with C' replacing CA' Clearly, C '" C' under the map f: x -+ j(k(x), v/(x» and hence, by theorem IV. 5.3, every isotonism between C and C' is an extension off. Therefore, if C = C', then g : C ~ C' for some partial recursive extension of f. In particular, gj(O, m) = j(O, vm ) for every m and hence the map m -+ Vm is partial recursive. This contradicts our choice of VA' and we conclude C i= C'. Similarly, if C" is obtained from Vn then C i= C" and C' i= C" since the former implies that the map m -+ U m is partial recursive and the latter that the map Vm -+ u.; is partial recursive, where Yn = {
Suppose T = e + n where e is a limit number. Then by case 1, there exist c co-ordinals LA such that I LA I = e and (6) holds with LA re-

226

JOHN N. CROSSLEY

placing CA' Let C A = LA follows for this case also.

+ n, then

by theorem IV. 3 . 6 the conclusion

We observe that the limit of a sequence of recursive co-ordinals (coordinals containing a recursive well-ordering) is not uniquely defined either, since by theorem 1. 4. 6 the generic counterexample is a recursive co-ordinal and using this Vas the VA of the corollary proof we obtain a C' "# C. In the case of recursive co-ordinals, however, there are only ~o distinct "limits".') V. Bounds

V.I. Since the ordering by initial segments is a partial order on f2 and on l(/ we can define upper and lower bounds in the usual way. We use the techniques developed in the previous section in order to show that there are no non-trivial upper bounds for collections of quords or co-ordinals. DEFINITION V .1.1: A quord B is said to be a lower (upper) bound for a collection of quords, d, if Qed implies B s Q (B ~ Q).

V. I .2: A quord B is said to be a greatest lower bound (least upper bound) for a collection of quords, d, if B is a lower bound (upper bound) for d and every lower bound (upper bound) for d is ~ B (is ~ B). DEFINITION

By the anti-symmetry of ~, least upper bounds and greatest lower bounds are unique if they exist at all. (The proof of the following lemma is based on the idea in the proof of theorem 4lb in [8].) LEMMA

decessors.

V. I .3: A quord has at most denumerably infinitely many pre-

Let A be a quord and A s A. For fixed A we show that every r.e. set determines at most one predecessor of A and that every predecessor of A determines at least one r.e. set. The lemma follows at once from these results. Let 13 be a r.e. set, then 13 determines at most one predecessor of A as follows: Let B = A [13, then B = CRT(B) ~ A only if there is a r.e. set PROOF.

1) Since there are only ~o recursive co-ordinals.

227

CONSTRUCTIVE ORDER TYPES, I

Y such that B)( A [y and B+A [y = A (using the Separation Lemma II. 5 .1). On the other hand, if B ~ A, then there is a B such that B e Band B is contained in some r.e. set p separating B from A [ (C'A - C'B). This completes the proof of the lemma. LEMMA V.I. 4: A denumerable collection of quords has an upper bound if, and only if, every two members of the collection are comparable. PROOF. Let d = {Ai: i e J'} be a collection of quords. If there is an upper bound, U, for d, then A; ~ U for all i and by theorem II. 5.4, for all i, j, either A; ~ A j or A j ~ Ai' Conversely, suppose A; and A j are comparable for all pairs i, j. We

may assume that there is no maximum A; since the assertion is trivial in that case. We now set Bo = A" B; + 1 = A /(;), where r = Jls{ As =F O}, t(i) = Jls{A s > B i } · Clearly, i < j --+ B, < B j • Hence the C i , defined by Co = B o, C i + 1 B; + 1 -B;, are all non-zero. For each i, let C; be a fixed representative of Ci such that 0 e C'C;. Now let

U

=

{(j(e, m),j(d, n»: e s C'C m & d e C'Cn & m < n . v. m = n & (c, d) e Cm}.

Further, let U = CRT(U), Yn = {x: lex) ~ n}, Urn) = U [Y(n) and urn) = U [~n' We shall prove:

~n

=

{x: lex) > n};

1) urn) =F 0, 2) Urn) )( urn),

3) for all n, Urn) s e; 1) urn) =F 0 since m > n --+ (j(0, m), j(O, m» e urn) by construction and the choice of the C; 2) For each n, x s C'U implies x e C'U(n) ~ sg (l(x) -=-n) = 0 & x e C'u(n) ~ sg (l(x) -=-n) = 1. Hence by theorem II .1.5, 2) holds. 3) U(o) = (Co; 0) ~ Co s Co = B o. Now we assume Urn) e B; and prove Urn + 1) e B n + i -

228

JOHN N. CROSSLEY

By construction, (C n+ 1 ; n+1) <:; Urn»)( Urn)' Hence Urn)+ ( C n+ 1 ; n+1) = T, say, is well-defined and T s Bn+Cn = B n + r- But T = Urn +1) by construction of U and 1'n + r- Hence, for all n, Urn) e Bn" This proves 3)1). Using 1) it follows at once that B; < V for all n and it only remains to prove that V is a quord. Suppose U(Xi' ninr'= 0 is an infinite recursive decending chain in U. Then by the definition of U, {ni: i <; J} has a maximum; let this be n. Then the given chain is also an infinite recursive descending chain in U(n) which is impossible, since U(n) s B; and B; is a quord. Thus the proof is complete.

v.2. An analysis of the proof of corollary IV. 5. 5 shows that there exist c incomparable limits to certain increasing sequences of co-ordinals. We now prove the stronger result that any increasing sequence of coordinals without a maximum has c incomparable co-ordinals whose classical ordinals are all equal to the limit of the classical ordinals of the given sequence. This will be a corollary of the next theorem. THEOREM V. 2. l: A collection of quords has a least upper bound if, and only if, it has a maximum.

The "if" part is trivial. Now suppose that sf is a collection of quords without a maximum, but with a least upper bound L. By lemma V. 1.3, L has at most denumerably infinitely many predecessors, hence sf is at most denumerably infinite. And sf is not finite since sf has no maximum. In order to prove the theorem we construct two incomparable upper bounds V and U' which are, in a certain sense, minimal upper bounds and obtain a contradiction. Since sf is denumerable we construct U and V exactly as in the proof of lemma V. 1.4. Let V be a well-ordering in the generic counterexample, say, V = {
1) We are here using the fact that, for any finite set of one-one partial recursive functions with mutually disjoint domains and ranges, there is a one-one partial recursive function which agrees with each member of the given set on its respective domain.

CONSTRUCTIVE ORDER TYPES, I

229

V.l.4 easily yields that V' is also an upper bound for d (see especially footnote.') p. 224), the details of this verification we leave to the reader. We now prove A < V

-+

(3n) (A < En)'

(I)

Suppose A < V, then there exist relations A, D, such that A)( D, A + D = U and D :1= 0 (by the Separation Lemma 11.5. 1 and corollary 111.2.8). Hence there is a number j(x, m) e CD for some m. Clearly, CA ~ {x : l(x) ~ m}, hence, using the same notation as in the proof of lemma V.l.4,

U = A+D [{x: l(x)

s

m}+um.

Taking C.R.T.s we obtain A s Em and if n = m+ 1, then A < En since c, :1= 0. This proves (1). Similarly one proves (I) with U' replacing V. It follows at once that L ~ V and L ~ U', Thus in order to complete the proof we only need to establish V:I= V But V = U' implies there is a recursive isotonism g such that g : U ~ U'. But U' = l(U) and hence U = g[(U). Hence by theorem III .1. 6, gf = 1 on CU. But this implies the map h : j(O, n) -+ j(O, vn) defined only on numbers of the formj(O, n) has a partial recursive inverse, namely g, which implies that h is partial recursive contrary to our choice of V. Thus the proof is complete. f

•

Corollary V. 2 .2. A collection of co-ordinals has a least upper bound if, and only if, it has a maximum. Corollary V.2. 3. Let d be a collection of co-ordinals with no maximum, but such that all its members are comparable. Further, let lim

A.JII

IA I =

A,

then there exist c incomparable upper bounds V4> such that PROOF.

I V4> I =

A.

Clearly,

IV I=

lim

A.JII

IA I

by the construction of V. We leave the reader to verify that c upper bounds

230

JOHN N. CROSSLEY

U~ can be constructed from the c incomparable co-ordinals corollary IV. 4.2 (cf. proof of corollary IV. 5.5).

V~

given by

V.3. Since 0 ~ A for every quord A, every collection of quords has a lower bound. There exist collections of quords with a greatest lower bound but no minimum as the following example shows. Example V. 3 . 1. Let U be a co-ordinal such that I U I = wand there is a U e U such that C'U is immune. Then, clearly U =F W. U - n is well defined for all n and if m =F n, then U- m =F U- n since otherwise U = r+ U for some r. We shall show later (Lemma VIII.l.4) that this last equation implies U =F W. However, U'
PROOF.

Immediate from theorem 11.5.4.

The converse of this theorem is false. For example, let ir be the collection of all co-ordinals of classical ordinal ro, then every finite co-ordinal is a lower bound for ir but there is no greatest lower bound for the ordinal of any greatest lower bound would be w. VI. Multiplication VI. 1. From now on we shall be principally concerned with co-ordinals and unless otherwise stated all C.R.T.s mentioned will be assumed to be co-ordinals. We give a natural definition of multiplication of C.O.T.s in this section and show that most of the [analogues of the] basic classical laws hold for co-ordinals. There is one striking breakdown, namely in the case of the law (1) A < B -+ AC s BC

CONSTRUCTIVE ORDER TYPES, I

231

which we shall show fails for some co-ordinals. If, however, A, B, C are all predecessors of a principal number for multiplication, then (I) does hold. Notation. A.B = {<j(a, b),j(a', b'»: a, a' e C'A .&:
THEOREM VI. 1. 1: If A, B are reflexive relations (linear orderings, quasi-well-orderings, well-orderings) then A. B is a reflexive relation (linear ordering, quasi-well-ordering, well-ordering). PROOF. All except the case of quasi-well-orderings follow at once from the classical definition of multiplication of relations (cf. [14J, p.229). Suppose A and Bare quasi-well-orderings and that A. B is not. Then there is an infinite recursive descending chain in A. B. Since every element ofthe field of A. B is of the formj(a, b), this chain must be ofthe form {jean> bnn:,= 0 where an e C' A and b; s COB. Let IX = {an} and f3 = {b n } , then there are four cases to consider: and f3 are both finite, infinite and f3 is finite, (iii) IX is finite and f3 is infinite, (iv) IX and f3 are both infinite. (i) (i i)

IX

IX is

(i) is impossible since then {jean> bn)} would only contain a finite number of elements. (ii) Since f3 is finite, there is at least one number be f3 for which {jean, b): an e IX} is infinite. Let the distinct an in this set be a(n;) (i = 0,1, ... ) where i < j ~ n, < nj • Then {j(a(n), b)}?= ° is an infinite recursive descending chain in A. B since

a(no) ain,

+

= a o,

1) = a(Jls{r < s

--+

a, # as})'

It follows at once that {a(ni)}~= 0 is an infinite recursive descending

chain in A which is a contradiction. (iii) This case is dealt with a manner very similar to (ii). We omit the details.

JOHN N. CROSSLEY

232

(iv) Let {b(n)};x;"o be the set of distinct b., where i <} ~ n, < nj' Then every ben) occurs in {b n };:'= 0 at most finitely many times for the following two reasons: 1. If b, = ben) for some fixed i and all} greater than some }o, then there are only finitely many distinct bn> namely, those occurring in b o, ... , b(jo)· 2. If i <} and
This is impossible since B is a quasi-well-ordering. This completes the proof of the theorem. THEOREM VI. 1.2: If Al ~ A2 and B1 ~ B2 , then Ai' B1 ~ A2 · B2 • PROOF. Suppose p : A 1 ~ A2 and q : B1 ~ B2 , then r: Ai' Bi ~ A2 . B2 where rex) = j(pk(x), ql(x». THEOREM VI. 1.3: (A 1.A2).A3

~

A1.(A2.A 3 ) .

PROOF. <x, y) s (Ai' A2 ) . A3 ~ X = j(j(a 1, a 2), a3) & y =

j(j(a~,

a;), a;)

& ai' a; e C' Ai (i = 1, 2, 3)

.&:
x = j(a l,j(a2, a 3» & y = j(a~,j(a;, a;» & a.; e C' Ai (i = 1,2, 3)

a;

.&: e A 2 & a 2 i: a; · v. j(a 2, a3) = j(a;, a;) &
I

(2)

I

(3)

CONSTRUCTIVE ORDER TYPES, I

233

Since j is one-one, conditions (2) and (3) are equivalent and the proof is completed by using the recursive isotonism x

-+

DEFINITION VI. 1. 4: A. B

j(kk(x), j(kl(x), lex))).

=

CRT(A. B) where A s A and Be B.

Theorem VI. 1. 1 guarantees the uniqueness of this definition. (We often write "AB" for "A .B".) Corollary VI.I.5. Multiplication is associative, i.e. (A.B).C A.(B.C).

=

By virtue of this corollary we may omit brackets in a product of several C.O.T.s. THEOREM VI. 1.6: A. B = 0 +-+ A = 0 v B = O. PROOF. j(a, /3)

=0

+-+ a

=0

v f3 =

0.

THEOREM VI. 1. 7: A(B + C) = AB + A C. PROOF. It is sufficient to establish that separability conditions are satisfied, since the proof that the order type is the same on both sides of the equation is proved exactly as in the classical case. Let A e A, B e Band C e C, then x s C'A. B +-+ k(x) s C'A & lex) 8 C'B

and

x e C'A.C

+-+ k(x)

e C'A & lex) e c-c.

Hence by theorem II .1.4.(i) A.B)( A.C

+-+ A =

0

v B)( C.

THEOREM VI.I.8: (i) For all n, A.In = A.n, (ii) A. W

= A.

PROOF. (i) If n = 0, then A.In = 0 = A.n. If n = 1, then A.I = A (by definition § 11.4) and A.I1 = CRT {(j(x, O),j(y, 0) : x, y e C'A & 0

= 0 & (x, y) s A}

where CRT(A) = A. If n > 1, then A.In = A.{Il.n) = (A.I1).n = A.n by the first part of the proof and corollary VI. 1.5. Hence, for all n, A . In = A. n. (ii) A. W = A.{Il'OJ) = (A.I1).OJ = A.OJ.

234

JOHN N. CROSSLEY

VI. 2. By analogy to principal numbers for addition, we now introduce principal numbers for multiplication (v. [1], p. 66). DEFINITION VI. 2. 1: A co-ordinal A is said to be a principal number for multiplication if A i= 0, 1 and

°<

B < A

-+

BA

= A.

If A = 2, then A is called an improper principal number for multiplication, and if A i= 2, then A is called a proper principal number for multiplication. We write £(.) for the collection of all principal numbers for multiplication. THEOREM VI. 2.2: Every proper principal number for multiplication is a co-ordinal whose classical ordinal is a limit number. PROOF. Left to the reader (cf. theorem IV.4.9). As in the classical case, B < A -+ BA = A is a stronger condition than BC = A -+ B = A v C = A. But also, the former condition is stronger than B, C < A -+ BC < A for co-ordinals. For the generic counterexample V satisfies this last condition but is not a principal number for multiplication since 2. V i= Vas we shall show later (lemma VIII.2.4); in fact we show that if n.A = A for any n, then A = W. It follows at once that W is a principal number for multiplication. (Alternatively, that W is a principal number for multiplication follows immediately from theorem II.4.I.(viii).) We now establish analogues of classical laws for multiplication (of co-ordinals) and show that these all go through if the co-ordinals concerned are all predecessors of the same principal number for multiplication. THEOREM VI. 2.3: (i) If B i= 0, then A ~ AB, (it) If B > 1, then A < AB whenever A i= 0. PROOF. We prove only (it) leaving (i) to the reader. (it) B > 1 -+ (E!C) (B = I+C & C i= 0). Hence AB = A (I+C) = A+AC where AC i= 0 if A i= O. Thus A < AB. THEOREM VI.2.4: If A i= 0 and A, B, C are co-ordinals, then AB = AC -+ B = C.

CONSTRUCTIVE ORDER TYPES, I

235

PROOF. Let A e A, Be Band C e C and suppose p : AB ~ AC. Then AB ,.., AC and since AB and AC are well-orderings, it follows that p is an extension of the unique minimal isotonism, Pe, between AB and AC (theorem IV. 5.3). Now, classically, :F 0& = eJ -+ T = J. Therefore there is an isotonism qe (not necessarily partial recursive) such that qe : B ,.., C. Now the map Te :j(a, b) -+ j(a, qe(b» defined only on C'AB is an isotonism between AB and AC. Hence by theorem IV. 5 . 3 p is an extension of r.: Since A :F 0, there is an element, say a o, in C'A. Let p' be the map p with domain and range restricted to {j(ao, n) : n e J}, then p' is partial recursive. Further, if p'(j(ao, x) is defined then its value is j(ao, y) for some y. Now let q' be the map x -+ l(p'(j(ao, x»), then clearly q' is partial recursive and q' agrees with qe on C'B (again by theorem IV. 5.3). q' is one-one, since

e

= q'(y)

q'(x)

er

-+ l(p'(j(ao, x») -+ p'(j(ao,

x)

= l(p' (j(a o, y»)

= j(ao, c) & p'(j(ao, y» = j(ao, c)

(since pp' £ {j(ao, n) : n s J} by construction) -+ j(ao, x) = j(ao, y) -+

x

= y.

(since p is one-one)

Thus q' is partial recursive, agrees with qe on C'B and is one-one and order-preserving, i.e. q' : B ~ C, from which the theorem follows. LEMMA

VI. 2.5: If M is a principal number for multiplication, and

~C:F~~fflOC<M-B<M&C<M

Suppose BC < M, then B < M or C = I by theorem VI.2.3. (ii). In the former case BM = M and in the latter trivially, C < M. Now BC < M -+ BCM = M, and therefore, by theorem VI. 2.4, CM = M. Using theorem VI.2.3.(ii) it follows that C < M. Conversely, C < M -+ CM = M and B < M -+ BM = M. Hence (BC)M = B(CM) = BM = M and by theorem VI.2.3.(ii), BC < M. PROOF.

VI. 3

VI.3.1: (i) If A :F 0, then B < C C -+ AB s AC.

THEOREM

(ii) B

s

-+

AB < AC,

236

JOHN N. CROSSLEY

PROOF. (i) B < C -+ (E tD) (B + D = C & D "# 0). By theorem VI. I. 7, AC= A(B+D) = AB+AD. AD"#O by theorem VI. I. 6, hence AB < Ae. (ii) follows at once from (i). THEOREM VI. 3 .2: There exist co-ordinals A, B, C ("# 0) such that A < B but AC $ Be. PROOF. Let A = 1, B = Vand C = W, then AC = Wand BC = Vw. By theorem VI. 2 .3 . (i), V:s; Vw. Hence if W:s; VW, Wand V are comparable by theorem II. 5 .4 which contradicts corollary IV. 4.5. THEOREM VI. 3.3: If there is a principal number for multiplication such that B, C < M (or equivalently BC < M) then A < B -+ AC :s; Be. PROOF. If B or C = 0 there is nothing to prove. Similarly if A = O. Otherwise, by lemma VI. 2.5, A C < M and BC < M. Hence, by theorem II. 5.4, AC and BC are comparable. Now, classically, 4'> < lJI -+ 4'>r ..s lJIr, hence AC :s; BC. THEOREM VI. 3.4: If A, B, C are co-ordinals, then A C < BC

-+

A < B.

PROOF. If C = 0, then the assertion is trivial. If C "# 0, then by theorem VI. 2.3. (i), A :s; A C and B :s; Be. Hence by the transitivity of :s; and theorem II. 5 . 4, A and B are comparable. By the classical theorem 4'>r < lJIr -+ 4'> < lJI, we have I A I < I B I and hence A < B. THEOREM VI. 3.5: There exist co-ordinals A, B, C such thatA C :s; BC but A :$ B. PROOF. (As in the classical case.) Let A

=

2, B

=

1, C

=

THEOREM VI . 3 .6: If B, C are comparable, then AB < A C

W.

-+

B <

e.

PROOF. Immediate from theorem VI. 3 . 1. (i). THEOREM VI. 3.7: If there is a principal number for multiplication, M such that (AB <) AC < M, then AB < AC -+ B < e. PROOF. By lemma VI.2.5, AB < AC < M -+ B < M & C < M. Hence by theorem II. 5.4, Band C are comparable. The theorem now follows at once from theorem VI. 3.6. We leave the question of prime numbers and unique factorization of certain co-ordinals to a later paper [4].

CONSTRUCTIVE ORDER TYPES, I

237

VII. Exponentiation VII.t. In this section we define exponentiation but restrict our attention to co-ordinals, since although the collection of all C.O.T.s is closed under exponentiation, the collection of quords is not. This result is implicit in [12]. Since some properties of exponentiation depend on multiplicative properties (e.g. (ABf = ABC) it is to be expected that we should not be able to prove analogues of all the classical laws concerning monotonicity. However, if we consider predecessors of principal numbers for exponentiation we get results analogous to those in the preceding section for multiplication. Since we have defined C.O.T.s in terms of classes of sets of ordered pairs of natural numbers and since a classical method of defining exponentiation depends on consideration of finite (descending) sequences in a given ordering, we now define a (primitive) recursive function e which assigns a natural number to each finite sequence of elements in a representative of a C.O.T. which is indexed by a sequence in a representative of another C.O.T. DEFINITION

VII. 1. 1: A symbol of the form

b O •.• b n) ( a o '" an

where n > -1 and the a.; b, (i = 0, ... , n) are natural numbers is said to be a bracket symbol. If n = -1, the symbol is simply 0 which we call the empty bracket symbol and denote by O. We use upper case bold face letters (A, B, C, etc.) for bracket symbols. DEFINITION

VII. 1.2: e(O)

if n ?: 0 and A

= 0; =

(b o '" b

then e(A) =

n

ao '"

n

)

an

p{(ai,b i)

i= 0

where Pi denotes the i-th prime (Po

=

2).

+

1

JOHN N. CROSSLEY

238

THEOREM VII. 1.3: e is a one-one primitive recursive function from the set of all bracket symbols into J. Further, pe is recursive. PROOF. Left to the reader.

VII.2. We define exponentiation of C.O.T.s in this sub-section using the function e. DEFINITION VII. 2.1: If A is a linear ordering and a e C'A, then a is said to be the minimum element of A if b s C'A --+ eA. Clearly, if a linear ordering has a minimum element then it is unique. Notation. We write a = min(A) if a is the minimum element in the ordering A. If {bJ7 = 0 is a sequence of elements in C'B such that

then we write

o ::;;

i

< n

--+

B & b, + 1 i= bi'

--+ b2 --+ •.. --+

bn>s B.

DEFINITION VII. 2 .2: If A, B are linear orderings then E (A, B) is the set of all bracket symbols K such that K

=

bo ... bn) ( ao ... an where ("1m) (m < n --+ am e C'A & am i= min(A))

>

and
=

{<e(K), e(K'): K

0 and B i= 0, then AB = 0; otherwise

= (b o .. , bm ) ao ... am

&K,K'eE(A, B) .&:K

& K'

ao

=o.V.

K i= 0 & [em ::;; n & ("Ir) (r ::;; m --+ a, = a; &

v (3r) ("Is) {(s < r

--+ as

=

a~

& «»; b;) e B & br i= b;.v.b r

b:)

= (b?

an

b,

=

b;n

& b, = b~)

= b; &

Since E(A, B) with the ordering induced by definition VII. 2.3 is equivalent to the classical definition of A raised to the power B (cf. [14J, p. 306 et seq.) and A B is a linear ordering of a subset of J, A B is a relation in our sense.

CONSTRUCTIVE ORDER TYPES, I THEOREM VII.2.4: If Ai ~ Bi (i

=

239

1,2), then A~l ~ A~2.

PROOF. The only non-trivial case is where Ai (or equivalently, A 2 ) is non-empty. Suppose Ai #- 0 and p: Ai ~ A 2 and q: B1 ~ B2 • Let r be the map defined only on pe by

reO) = 0, r(n) = e

q(bo) ... q(b m) ) ( p(ao) ... p(a m)

if n 13 pe and n

= e ( bo ... bm ) . ao ... am

r is partial recursive, since pe is recursive, and is one-one and onto since p, q are one-one and onto and e is one-one. The order-preserving property

follows from the classical case. DEFINITIONVII.2.5:AB = CRT(AB) where AsA and BsB'.J.We sometimes write "A exp B" for "A H" and "A exp B" for "A B" . By theorem VII. 2.4, A B is uniquely defined. THEOREM VII.2.6: (i) If A, B are co-ordinals, then A B is a co-ordinal. (ii) If A, Bare e.O.T.s, then A B is a e.O.T. (iii) (Parikh [12]) IfB is a well-ordering and A is a quasi-well-ordering, then A B is a quasi-well-ordering. (iv) (Parikh [12]) There is a quasi-well-ordering A such that T A is not a quasi-well-ordering where T 13 2. (v) If A B is a quord, and A #- 0, 1 and B #- then A and Bare quords.

°

PROOF OF (v). Suppose A 13 A and B I: B but A is not a quasi-wellordering. Then there is an infinite recursive descending chain, {aJ7'= 0, in A. Since B #- there is an element b in C' B and hence

°

is an infinite recursive descending chain in A B which is impossible. Suppose then that B is not a quasi-well-ordering, then there is an infinite recursive descending chain {bJ7'= 0 in B. Now since A #- 0, 1 there is an element a #- min(A) in C' A. Therefore

JOHN N. CROSSLEY

240

fe(b i ) }

t

00

i~O

a

is an infinite recursive descending chain in A B. This too is impossible and (v) is established. VII. 3. THEOREM VII.3.l: A exp (B+C) = AB.A C • PROOF. Let A GA, B GBand C GC where B)( C. Then B+ C is well-defined and by theorem II. 1. 4. (i) there exist r.e. disjoint sets /3, y such that C'B £ /3 and C'C £ y. C'(A exp (B + C)) = {e(K): KG E (A, B+ C)} and

KG E(A, B+ C)

+-+

K =

(eo

er ) & ('Vi) (a, i= min(A))

ao

a;

& <eo -+ e l -+ ... -+ er ) GB+C. This last clause is equivalent to

<eo -+ el

v

-+ .. , -+

<eo -+ el -+

... -+

v (3 s) (s < r & C'(A B and Kl

<e

.

e

<eo -+

r)

B G C G

... -+

e

es )

G

C

& s + 1 -+ .,. -+ r ) G B). AC) = {j(e(K l ) , e(Kz)) : K l GE(A, B) & K z GE(A, C)}

G E(A,

B) +-+

es + 1 Kl = ( as + 1 and

er )

& <es + 1

-+

es + z

•.. •..

-+ ... -+

er )

er) & ('Vi) (ai i= min(A)) ar G

B

K z G E(A, C) +-+

Kz =

eo ... es) & ('Vi) (a l i= min(A)) ( a o ... as

CONSTRUCTIVE ORDER TYPES,

241

I

Recalling that if r = s, then K 1 = 0 and e(K 1 ) = 0, and if s = -1, K 2 = 0 and e(K2 ) = 0, it follows that the map p defined by p(x) = l(x)* k(x) is order-preserving between A B • A C and A exp (B + C). Now let

.

.

15 = }(e(K 1 ) , e(K2 ) ) : K 1 =

t

(b o ... b

m)

ao

&K = 2

am

(co

a~

Cn)

a~

& (Vi) (ai' a; =f. min(A) &

b,

B

f3 & c, B Y)}

and let q be the map p with domain restricted to 15. Then q is partial recursive since 15 is r.e. Further q is one-one. For suppose q(x) = q(y) = z, say. Then z = pj(e(K 1 ) , e(K 2 )) = e(K) for some bracket symbols K 1 , K 2 , K. But K

=

(eo

ao

e

r)

a;

where a, =f. min (A) and e, B f3 U y, and there is precisely one number s such that -1 ~ i ~ s -+ e, B f3 & s < i ~ r -+ e, B y by the definition of 15. Therefore K 1 and K 2 are uniquely determined by K and our assertion is proved. This we have proved that q is a one-one, partial recursive, order-preserving map between A B • A C and A exp (B + C), i.e. is a recursive isotonism. The theorem follows at once from this. Notation. A O = 1; A n+ 1 = An.A. Corollary VII. 3 .2. If A is a co-ordinal, then AI" = An.

PROOF. If A = 0, the assertion is trivial. If A =f. 0, the reader will easily verify that N
242

JOHN N. CROSSLEY

in all other cases for A, B, C = 0 or 1, both sides are 1. We therefore assume A, B, C i= 0, 1. Let A e A, B e B, C e C, then

Cn) eE(A B, C) qn

(ABf = {<e(D), e(E): D = (co qo &E =

(C~

C~') s E( A B, C) &

qo

qn'

(Vr) (qr = e(Qr) & q; = e(Q;) & Q" Q; e E(A, B)

0 . v. D i= 0 & [en s n' & (Vr) (r s n --+ c, = c; & qr = q;))

:&: D

=

v (3r) ("Is) {(s < r

&

--+ Cs

« c., c;) s C & c, i=

= c; & qs = q;)

c; . v .

c, = c; & e AB +-+ (r,

s

t; & ("Is) (s S

v (3u) ("Iv) {(v e B & »; i= b;u) v (b ru = b;u & <aru' a;u> s A)]}.

Now

ABC = {<e(D), e(E): D = (j(b o, co) .. , j(b m Cn)) s E(A, BC) ao an & E

= (j(b~: c~) ao

: &: D

j(b~,: C~,)) e E(A, BC) an'

= 0 . v. D i= 0 & [en s n'

(Vr) (r

s

n

--+ j(b"

&

c.) = j(b;, c;) & a, = a;))

v (3r) ("Is) {(s < r --+ j(b., cs) = j(b~, c~) & as = a~) & «j(b" cr),j(b;, c;» s BC &j(b" c.) i= j(b;, c;) . v. j(b" c.) = j(b;, c;) & e

An]}.

CONSTRUCTIVE ORDER TYPES, I

But

(j(b r , c.), j(b;, c;» s BC +-+ (c r , c;) & C & c, #- c; . V.

and

c, = c; & (b" b;) & B

Now let p be the partial recursive function defined only on

{e(X) :X= (idoo

s. =

in) & (Vi) (d s pe)}, i

d;

(where we recall that pe is recursive) by (p(O) = 0 and) Co

.•.

Cn

))

p ( e ( e(Qo) ... e(Qn) = e (j(b oo, co)

aoo

where

j(b omo, cO)j(b 1 •0 , c1 ) a omo

•••

a1 , o , "

j(b 1,m" c1 )

•••

a1,m,

... j(bno, cn) ... j(bnmn, Cn») ... ano . .. a nmn

Using the definitions of (ABf and ABC given above the reader will readily verify that p is order-preserving, one-one and onto, from which it follows that p: (AB)c ~ ABC. Taking C.R.T.s completes the proof. As in the classical case, we do not have in general, ACBc = (ABf.

VII. 4. We now introduce principal numbers for exponentiation and show that predecessors of principal numbers for exponentiation satisfy [the analogues of] the classical laws for exponentiation. DEFINITION VII .4.1: A co-ordinal A > 1, is said to be a principal number for exponentiation if

1~ B < A We write £ nentiation.

--+

BA

=

A.

(exp) for the collection of all principal numbers for expo-

244

JOHN N. CROSSLEY

THEOREM VII. 4.2: All principal numbers for exponentiation are infinite co-ordinals whose classical ordinals are limit numbers. PROOF. Left to the reader (cf. theorem IV. 4.9). The condition in definition VII. 4. 1 is stronger than the condition: 1 ~ B, C --+ Be < A. This will be shown later in a manner analogous to that referred to in § VI. 2 by proving that if 2A = A, then W divides A. THEOREM VII. 4 . 3: W is a principal number for exponentiation. PROOF. It suffices to prove that, if N I; In> then W ~ N W . Let N = {(x, y): 0 ~ x ~ y < n}, then clearly N I; In. If S I; of, then s is expressible in the form

where for all i, 0 defined by

~

a, < n. Let f be the (partial) recursive function

°)

f(s) = e r r - 1 ... ( a r a r - 1 '" a o where columns with bottom entry

°

have been omitted.

E·g·f(n 2 .3+n.0+2) Then, if u, v I; of and u a; and b, may be zero,

(2 0)

=e 3 2 .

= nrar+ ... +a o and v = n'br+ ... +b o' where

and

(1)

(We remark that the fact that a" a; _ l' . . . and b" b, _ l' . . . may be zero does not affect the ordering.) But the ordering ~ given by (1) is precisely the ordering in N W of the bracket symbols

(a,r

0 ) and ao

(r .. , 0 ) b; ... b o

where columns with bottom row zero have been omitted. Clearly, one-one. Hence f: W ~ N W and the theorem is proved.

f

is

245

CONSTRUCTIVE ORDER TYPES, I

Corollary VIl.4.4. 2w = W. THEOREM

VIIA.5: If A > 1, then A B = A C

-+

B =

c.

Let A G A, B G Band C G C, and suppose p: A B ~ A': Then A ..... AC and since A B and AC are well-orderings, it follows that p is an extension of the unique minimal isotonism, Pc, between AB and A C• Now, classically, e > 1 & e r = e.1 -+ r = ..1. Therefore there is an isotonism qe (not necessarily partial recursive) such that qe: B ..... C. Now the map PROOF. I) B

defined only on E( A, B) is an isotonism between A B and Ac. Hence by theorem IV. 5.3, p is an extension of r: Since A > 1, there is a non-minimum element, say a O, in C' A. Let p' be the map p with domain and range restricted to

then p' is partial recursive. Further, if p'

(e (:0))

is defined then its value is e

(~o)

for some y. Now let q' be the map

then clearly q' is partial recursive") and agrees with qe on theorem IV. 5.3). q' is one-one, since

c-s (again by

I) We are here using a similar extension procedure to that used in the proof of theorem VI. 2.4. 2) (x)o = exponent of (po =) 2 in the prime factorization of x.

246

JOHN N. CROSS

(e (:0)) = p (e (~O)) 2

2j ( u, q'(x» + d. 3X 1 . . . . . P:" &

--. p'

=

j

( u' , q'(y»

+ -: 3Y1 •

. ..

.

p~m

for some u, u', d, d', n, m, XI' .•. , X n' YI, ... , Ym where d, d' = 0 or 1. O But by the definition of p', any image of p' is of the form 2 j ( a , b) + I and hence d = d' = 1, n = m = 0, u = u' = a O and p'

and p'

(e (:0)) = 2

(e (~o)) =

j

( aO, q'(x»

2j (a

O

, q' ( Y»

+

1

+ 1.

Therefore

from which it follows, since p' and e are one-one, that X = y. Thus we have shown that q' is a recursive isotonism between Band C. This completes the proof. THEOREM

but A :F B.

VII.4. 6: There exist co-ordinals A, B, C such that AC = BC

PROOF (as in the classical case). Let A = 2, B = 3, C = W. Then by theorem VII.4.3, 2w = s". THEOREM PROOF. A

VII.4. 7: C > 1 & A < B --. C" < CB.

< B -. (ElD) (D :F 0 & A+D

= B). Hence by theorem

VII.3.1, C = C" + D = CA.. CD. Now CD:F 0 since C:F 0, hence (3E) (CD = 1+E). Hence CB = C\1+E) = CA+CA.E by theorem VI. 1.6 and C A ~ CB. But B

CA. = C B

-.

CA. E = 0 -. E = 0 -. CD = 1 -+ D = 0

which is a contradiction. This completes the proof.

CONSTRUCTIVE ORDER TYPES, I

247

VIIA.8: (i) If A, C> 1, then A < A C • (ii) If C > 0, then A s A C • LEMMA

PROOF. (i) Since C > 1, there is a D # 0 such that 1 +D = C. Therefore A C = A 1+ D = A.A D by theorem VII.3.!. Now IADI > 1, ,by classical arguments, hence there is an E # 0 such that AD = 1 + E. Hence A C = A(l+E) = A+AE where AE # 0, i.e. A < A C • (ii) follows at once. THEOREM

VII 04.9: There exist co-ordinals A, B, C such that A < B

but A C $ B C •

Let A = 2, B = V and C = W, then by theorem VIIA.3, Wand by lemma VIIA.8, V < V W = B C • Now if A C ::s; B C , then by theorem 11.5.4 and the transitivity of ::S;, Vand Ware comparable, which contradicts the construction of these co-ordinals. PROOF.

AC

=

Thus we see that the analogue of one of the classical laws for exponentiation breaks down in a very similar way to one of the multiplicative laws (theorem VI.3.2). We have, however, theorem VIlA. 11 which is analogous to theorem VI. 3. 3. VII.4. 10: If E is a principal number for exponentiation, then A, B < E -+ A B < E and conversely if A, B > 1. LEMMA

The assertion is trivial if A, B ::s; 1. Otherwise, if E is a principal number for exponentiation, then A < E -+ A E = E and similarly for B. Hence A IBE) = E. Now B < E and therefore there is a C # 0 such that B+C = E. Therefore E = A IBE) = A(B+C) = AB.A c . But A C > 1, since C # 0; hence A C = 1 +D for some D # O. It follows that E = A B(l+D) = AB+ABD where ABD # 0, i.e. A B < E. Conversely, suppose A, B > 1 and A B < E. Then by lemma VII.4. 8 .(i), A < E. Since E is a principal number for exponentiation, E = A E = (ABl = ABE. By theorem VII 04.5 it follows that BE = E and hence by theorem VI. 2.3. (ii) B < E. PROOF.

THEOREM VII.4. ll : If there is a principal number for exponentiation, E, such that B, C < E (or equivalently B C < E or B, C::s; l) then A < B-+ AC::S;~.

PROOF.

By the transitivity of ::s; and lemma VII. 4. 10, A C < E and

248

JOHN N. CROSSLEY

BC < E. Hence by theorem 11.504, A C and BC are comparable. Now, classically, F < Ll -+ t" ~ Lltl>, hence AC < BC -+ A < B. THEOREM

VII 04.12: If A, B, C are co-ordinals, A C < B C

-+

A < B.

If C = 0 then there is nothing to prove. Otherwise, by lemma VII.4.8, A s A C and B s B C and therefore, by theorem 11.5.4 and the transitivity of ~, A and B are comparable. Hence by the ciassical theorem cpr < tpr -+ cp < tp, we have I A I < I B I and hence A < B. PROOF.

THEOREM

VIlA. 13: There exist co-ordinals A, B, C such that I < A C ~ B C but A $ B.

(as in the classical case). Let A = 3, B = 2 and C = W, then by theorem VII 04.3 (proof), A C = BC = W. PROOF

THEOREM

VII A. 14:

If B, C are

comparable and A > I, then

A B < AC PROOF.

-+

B < C.

By theorem VII 04.7.

THEOREM VIlA. 15: If there is a principal number for exponentiation, E, such that A C < E, then

I < A B < AC

-+

B < C.

PROOF. 1 < A < A implies A, B, C are all ~ 1. By lemma VII 04.10, if A C < E, then A, C < E and BC < E -+ B, C < E. Hence by theorem 11.504 and the transitivity of ~, Band C are comparable. Hence by theorem VII .4.14, B < C. B

C

VIII. Natural well-orderings up to

w(J)w

VIII.t. We showed in § IV that the finite co-ordinals are unique but that for each infinite classical ordinal F there exist c mutually incomparable co-ordinals of classical ordinal F. We now go on to give criteria for collections of co-ordinals which contain precisely one representative for each member of a given collection of classical ordinals. Using these we can give simple criteria for recursive well-orderings to be natural well-orderings, in the sense that if two recursive well-orderings are of the same classical ordinal, then they are recursively isomorphic provided

CONSTRUCTIVE ORDER TYPES, I

249

they are of not too large an ordinal and they are both natural wellorderings. By theorem 1.4.4 it is sufficient to describe co-ordinals which contain such natural well-orderings. In this section and the next we work in a slightly more general context: we do not assume that all our wellorderings are recursive, though it will turn out that they are. In [4] we shall extend our results much further as announced in [21]. DEFINITION VIII. 1. 1: I) If d' is a collection of co-ordinals, then d' is said to be T -unique if

IA I = IB I
A, Bed' &

-+

A = B.

d' is said to be strictly Fvunique if d' is T-unique but not A-unique for any A > T. By theorem IV. 3.6 it follows that d' is strictly T-unique if d' is Tunique but not (T + I)-unique. Corollary VIII, 1.2.

f(? is

strictly co-unique.

PROOF. Immediate from corollaries IV. 2.2 and IV.4. 2. We now give two proofs of the following theorem. The first proof does not use multiplication except in the form A. w. 2) The first three lemmata are common to both proofs. THEOREM VIII. 1.3: The collection £"( +) of all principal numbers for addition is strictly wW-unique. LEMMA VIII, 1 .4:

If A is a quord, then B+A

=

A

+-+

B.w

s

A. 3 )

PRooF.4 ) Suppose B. w ::; A, then there is a co-ordinal C such that B,w+C = A. ThereforeB+A = B+(B.w+C) = (B + B.w)+C = B.w+ C (by theorem II.4.1.(vii» = A. Now suppose B + A = A. If B = 0, then the assertion is trivial. If A = 0, then B = 0, hence we may assume A t= 0 t= B. By hypothesis I)

This definition is adapted from [10],

2) Since we may define W by recursion, thus W = Lr», W"+ I = W" .co, 3) Bu» :S A may also be written (3C) (B, W C = A) which brings out the

+

similarity with theorem VIII. 2.2. 4) This Iheorem can also be proved for co-ordinals using a technique similar to thai in the proof of theorem VIII.2,2.

250

JOHN N. CROSSLEY

there exist quasi-well-orderings A, B and a recursive isotonism f such that f: B+ A ~ A where B)( A. Let (X = CA, proof only.

P = CB.

We introduce the following notation for this 00

Poo

=

Boo

=

(xo

= {x: (Vn)f-" (x)

Ao

=

u

"=0

j" + 1 (P),

A [Poo, s

(X)},

A [(Xo'

We shall prove: 1)

(xo

("\

2)

(xo

u

Poo = 0, Poo = (x,

3) Boo e s.»,

4) x e (xo -+ f(x) = x, 5) x e Poo -+ f(x) # x, 6) Boo)( Ao, 7) Boo+Ao = A.

1) If x e Poo, then x = j"(y) for some n > 0, some yep. Hence f-n(x) is defined and t (X; so x t (xo. 2) Since f maps P u (X onto (x, x s (X implies either ('
or (3n) [F"(x) s P].

I.e. x e (X -+ x s (xo v x e Poo. Conversely, x s (xo -+ X = fO(x) s (X and x s Poo -+ x = j"(y) for some yep, some n > 0, i.e. x e (x. 3) Since B)( A there is a partial recursive function p such that if x e p u (X then xs

(X +-+

p(x) = 0 & x s

p +-+

p(x) = 1

(by theorem II .1. 5). We now use p to calculate a function g such that x s Poo

-+

g(x) = j( r-"(x), n -1)

CONSTRUCTIVE ORDER TYPES, I

251

where n = 11,{r'(x) s P & (\'s) (s < r

Step A. Calculate j-I(X). If a value (say) P(XI)'

-+

j-S(x) e oe}.

XI

is obtained, calculate

Three cases arise: 1. No value is obtained for XI or XI is defined but no value is obtained for p(x I ) ; 2. XI is defined and p(x l ) = 0; 3. XI is defined and p(x I ) = 1. We proceed according to cases. Case 1. g(x) is undefined. Case 2. Repeat step A with

Xl

replacing x,

Case 3. g(x) = j(xl, n) where n is the number of times case 2 has arisen in the computation and X I is the value most recently obtained in performing step A.

g is clearly partial recursive. Suppose g(x) = g(y), then g(x) = j(xl,n) = g(y) for some Xl = j-"-I(X) = j-"-l(y). But is one-one, therefore X = Y and g is one-one. We now show g maps Pw onto p. 00. By the definition ofg, g(pw) 5;; p. 00. If j(x,n)ep.oo then f"+l(x)epw and g(f"+I(X» =j(x,n); hence p.oo 5;; Pw. Next we show that g is order-preserving between Bw and B. ca. It suffices to show that if (xo, Yo) s Bw and Xo = rex) and y = f"(y) where x, yep and 0 < r < m -+ rex) e oe and 0 < S < n -+j'(y) e oe, then I ~ m < nor 1 ~ m = n & (x, y) s B. If m > n, then since j is one-one and order-preserving, (jm - "(x), y) s B+ A. But yep and r - "(x) e oe which contradicts B) (A. Hence In ~ n. If m = n, then (x, y) s B+ A where x, yep. We conclude (x, y) e B. This completes the proof of 3).

r:

4) Since A is a quasi-well-ordering and A o 5;; A, A o is a quasi-wellordering. Now j maps oeo = C' Ao onto oe o since X e oeo -+ j-I(X) e oe o & j(x) e oeo which implies oe o 5;; j(oeo) 5;; oe o' But j is order-preserving, hence by theorem III. 1.6, j = 1 on oe o ' 5) x e Pw -+ x = f"(y) for some n > 0, some yep. Since j is one-one, x = j(x) impliesr"(x) = j-" + lex). Butj-"(x) e p andj"?' + I(X) e oe and p n oe = 0 since B )( A. Therefore j(x) ¥ x. 6) Since j is partial recursive, bj is r.e. If x e Pw, then by 6) j(x) ¥ x.

252

JOHN N. CROSSLEY

If XC Ci o, then by 5) f(x) = x. Hence Cio, {J(O are contained in the disjoint r.e. sets {x: x C fJf&f(x) ¥- x} and {x: x s fJf&f(x) = x}. Hence by theorem 1I.1.4.(i) B(O)( A o . 7) By 6), B(O + Ao is well-defined. By 2), C(B(O + Ao) = Ci. By definition B(O ~ A and A o ~ A. It therefore suffices to prove that {JwXCio ~ A and A ~ Bw + A o . If x e {Jw and y s Cio then(3n) (f-n(x) c {J) but ("In) (f-n(y) c «). Hence <J-n(x), rn(y) c {J x rx ~ B + A, for some n, and since f is orderpreserving, <x, y) e A. If (x, y) e A then either (i) x, y e {Jw or (ii) x e {Jw, y s Cio or (iii) x, y c Cio or (iv) x s Cio, Y e {Jw by 2). Hence in order to complete the proof of 7) we only need to show (iv) is impossible. If (iv) holds, then there is an n such that f-n(x) e Cio and f-n(y) c f3 which is impossible since f is order-preserving and (Ci x {J) n (B + A) = 0. We now complete the proof of the lemma. By 3) Bw e B .w. Let C = CRT(A o), then by 7), B.w+C = A and hence B.w ::;; A. LEMMA VIII. 1.5: A co-ordinal A is a principal number for addition if,

and only

if; B <

PROOF.

A

~

B. co ::;; A.

Immediate from definition IV. 4.8 and lemma VIII. 1. 4.

LEMMA VIII. 1.6: If A cYt'( +), then A = wn < A.

wn for some 11, or

i

for all n,

PROOF. If A = 1, the assertion is trivial. If A > 1, then by lemma VIII. 1.5, 1. t» = W::;; A. If A ¥- W, then W < A. Now suppose W n < A (where n > 0). Since A is a principal number for addition, by lemma VIII .1. 5 W n • w = W n + 1 ::;; A. Hence either A = W n for some n or for all 11, W n < A.

LEMMA VIII. 1.7: If P is a principal number for addition, then P. w is a

principal number for addition and there is no principal number Q such that P < Q < P.w.

The first part is a restatement of theorem IV. 4 .10. Suppose Q e.Yf'(+) and P < Q, then by lemma VIII. 1. 5, P. w ::;; Q; hence PROOF.

Q 1:: P.w.

LEMMA VIII. 1.8: W n is a principal number for addition for every n,

CONSTRUCTIVE ORDER TYPES, I

253

PROOF. If n = 0 or 1, then the assertion is trivial. Suppose n > 0 and W n is a principal number for addition, then by lemma VIIL1. 7, W n + 1 = W n • w is a principal number for addition. Hence the lemma is proved by induction. PROOF OF THEOREM VIn. 1 .3 (FIRST VERSION). By lemmata VIII. 1.6 and VIII .1. 8 a co-ordinal A of classical ordinal < W W is a principal number for addition if, and only if, it is of the form Wn • Hence £( +) is wW-unique. Now let V, V' be two incomparable upper bounds for {W n : n e Y} constructed as in corollary V. 2.3. Then 1V I = I V' I = co", Now A < V --. A < W n < V for some n, and similarly for V'. But A < W n--. A+W n = W n and therefore A+V = V and A+V' = V', i.e. V and V' are principal numbers for addition. Thus £( +) is strictly wW-unique. LEMMA VIII.l.9: (i) W m < W n if m < n, (ii) Ifn ~ 1,1+ W n = W n ,

(iii)

If m <

n, W m+ W n = W n.

PROOF. (i) If m < n, then n = m+(n-m). Hence by theorem VII.3.1, W n = Wm+(n-m) = WmW n- m = W m(I+E) [for some coordinal E] = W m+ WmE. Now I W m I < I W n I, hence W m < W n. (i i) By (i), if n ~ 1 then W:s; W n and hence by lemma VIII. 1 .4, 1+ W n = W n • (iii) W m+ W n = Wm(l + W n - m) = wmW n - m = W n if m < n. DEFINITION VIII .1.10: A co-ordinal C (an ordinal T) is said to be a polynomial in W (polynomial in co) if C (T) can be expressed in the form C = W n .a n+ ... +ao = p(W) (r = w n .a n+ ... +a o = p(w)) where the a, are natural numbers and an ¥= O. The degree of p(8p) is n and the rank of p (rk(p)) is the number of non-zero ai' We observe that I p(W)

1

= p(w).

LEMMA VIII. 1 . 11: If p( W) is a polynomial in W of degree < n, then p(W)+ wn = W n • PROOF by induction on the rank of p. If rk(p)

=

1, then p(W) = Wma m

JOHN N. CROSSLEY

254

for some m

~

0, some am #

p(W)+ W n = W n if op

< n,

o.

Applying lemma VIII .1. 9. (iii) am times,

Now assume the lemma holds for rk(p) = m -1 > o. Then peW) = = i!r{a r # O}. Then rk(q) = rk(p)-l. By am applications of lemma VIII. 1. 9. (iii), peW) + wn = q(W) + W n and by the induction hypothesis, q(W)+ W n = W n. q(W)+ Wm.a m where m

LEMMA VIII. 1. 12: If n > 0, then A < nomial in W of degree < n.

wn

if, and only if, A is a poly-

PROOF. By lemma VIII .1.11, peW) < wn if op < n. Now if A < W n, i A I = p(w) for some polynomial in ca. Hence by corollary IV. 2 .7, A

=

peW).

LEMMA VIII .1.13: WWand W V are principal numbers for addition. PROOF. Since n < V, there is a U such that V = n + U. Then W n+ W V = W n+ W n + U = W n(1+ W u) = WnW U since 1 < U and U W< (using lemma VIII. 1.4). Hence Wn+W V = WnW U = n W + U = W V , and W n < W V for every n. Similarly W n < W W for

w

every n. Now every ordinal < W is represented by a polynomial in wand hence by corollary IV. 2.7 and lemma VIII .1.12 we also have, conversely, A < W V --+ A < W n for some n, and similarly for W w . Therefore if A < WV W

A+ W

V

=

A+(W

n+

W

v)

=

(A+ W

n)+

W

V

=

W

n+

W

V

=

W

V

for large enough n (and similarly for W w). Thus W V and W W are principal numbers for addition. PROOF OF THEOREM VIII .1.3 (SECOND VERSION). By lemmata VIII .1. 6, VIII .1.11 and VIII .1.12, every co-ordinal of the form W n is a principal number for addition and there are no other co-ordinals which are principal numbers and have ordinal < co". Hence £( +) is wW-unique. By lemma VIII. 1.13, W W and W V are principal numbers for addition. But W W = W V --+ W = V by theorem VII.4. 5, which contradicts the definitions of W, V. Hence £( +) is strictly wW-unique. It follows at once from theorem VIII. 1.3 that the collection of predecessors of principal numbers of ordinal < W W contains precisely one

CONSTRUCTIVE ORDER TYPES,

I

255

co-ordinal for each ordinal < W W and is closed under addition by theorem IV.4.11. We close this section with an example of a principal number for addition whose classical ordinal is not a (classical) principal number for addition (v. § IV.4). Example VIII .1. 14. Let p, V be as given in § IV. 4. Let IJ( = C'W v and let U = {(x, y) : x, yea & x S y}. Then IJ( is r.e., clearly, but is

not recursive. For

IJ(

recursive implies

{x: (3y) (y = e(~)) &yelJ(}

= p

is recursive, which contradicts the choice of p. U is ofclassical order type t» and W V and U are strictly disjoint (§ 11.1) but clearly not (even r.e.) separable. Hence W V +- U is well-defined, but does not belong to CR T(W v ) + CR T( U), and is of ordinal W W + co. Let P = CR T

(Wv+-U).

Now if p = {v;}:"= 0 where i < j --+ Vi < vj and Vn = V [ {Vi: i < n}, then Vn s nand C'WVn is recursive. Now CRT(WVn) = W n and by theorem II .1. 6 it follows that W n < P for every n. However, W V {: P since W V + B = P implies that W V + U s P which is a contradiction. By theorem IV. 3.6 we similarly have W V + n {: P for all n. Since A < P --+ I A I < W W + w it follows that A < P --+ A < W n for some n. Therefore A+P = A+(Wn+Q) [for some Q since W n < P] = (A+ Wn)+Q = Wn+Q [by lemma VII1.l.8] = P. Hence Pis a principal number for addition. I P I is not a classical principal number for addition since W W < wW+w but wW+(ww+w) > co", VIII. 2. In this section we prove a multiplicative analogue of theorem VIII.l.3. LEMMA VIII.2.1: LI =f. 0 & T > LIT'

--+

r > I",

PROOF. Immediate from theorem 2, p. 292 in [14]. THEOREM VII1.2.2: If A is a co-ordinal, then BA = A ~ B W divides A, i.e. ~ (3C) (A = BWe). PROOF. B WC=A-+BA=B 1 + WC=B wC=A by theorem IVA.4.(i).

256

JOHN N. CROSSLEY

Conversely, suppose BA = A. We may assume that A > 1, since otherwise there is nothing to prove. By hypothesis there exist well-orderings A, B and a recursive isotonism f such that A e A, B e Band f: A

Let

IX

= C' A,

/3

~

BA.

= C' B. We also write

"a
"I a I" for "I CRT(A [{x: x
8 IX,

f(a) = j(b, at) where b 8

/3 and

fear) = j(b, a, + 1) for some b 8

at 8

IX

/3.

Since f is order-preserving,

la 1 = I B I . I a 1 1+ A for some A < I B

I·

Hence lal~IBI.lall and by lemma VIII.2.1, lal~lall. Similarly, 1a, I ~ I a, + 1 I· It follows that, since A is linear,
n(x) = /lr(xr = x; + 1) [= Pr{(lfY(x) = (If)' + l(X)}] is always defined if x 8 IX. If a 8 IX and n(a) = n, then

I an I = I B I· I an I + A where A n(x) then (lft(x) = (If)n(x)(x).

Let C = A[{x: Lf(x) = x} and let D = O(Bw ) where g is the partial recursive function, defined only on pe, which maps only bracket symbol images ofthe form e (no n l bno bn ,

...

•••

ns) where n i , bn , I> of and no > n l > n2 > ... > ns bn,

~0

onto

n l+ln l nl-I min(B) bn , min(B)

no no-lno-2 ( e bno min(B) min(B)

n, ns-I bn, min(B)

0 ) min(B) .

I.e. g(x) inserts the missing positive integers in the top row of e -I(X) and in the columns where an integer was missing inserts min(B) in the bottom row and takes the image under e of the resulting bracket symbol. It is clear that g is one-one, so D is well-defined. We shall now show that A ~ D. C from which it follows at once that A = B W C where C = CRT(C). Let

.((n(X)-1

hex) = J e kf(lf)"(X) -

...

1 ••.

i

0)

., .

kf(lfi(x) ... kf(x) , (if)

n(x») (x)

.

Clearly, h is partial recursive. Suppose hex) = hey), then kh(x) = kh(y) and 111(x) = Lh(y). Hence (If)"(X)(x)

Now, since e is one-one we have and hence, for 0 :c::; r < n(x),

n(x)

= (Lf)"(Y).

=

(I)

n(y)

kf(lf)r(x) = kf(lf)'(y).

(2)

Putting r = n(x)-I in (2) and using (I) we have f(lf)"(X) - I(X)

But f is one-one, hence (If)n(x) - I(X)

= f(lf)n(x) = (If)"(X) -

I(y).

I(y).

258

JOHN N. CROSSLEY

Now assume where s < n(x) = n(y). Then by (2) with r = s-l

(If)S(x)

(kf) (If)' - I(X)

and using (3)

= (If)S(y)

(3)

= (kf) (If)' -

I(y)

f(lf)S - I(X) = f(lf)' - I(y).

By the one-one property off, (If)s - I(X) = (If)s - I(y)

and by induction it follows that x = y. I.e. h is one-one. It is clear that h maps C' A onto C' D. C and it only remains to prove that h is order-preserving. Suppose a
a,
+-+

f(a j )
+-+a j + 1
or

a, + 1 = a; + 1 &

where b, + Hence

1

b, + 1 < B b; + 1

= kf(aJ

a
+-+

an

an = a~ an

=

& b; < B b~ or

a~ & an -

I

=

..... or +-+

an

=

an

an

a~ & ... &

al

a~ -

=

I

& b; -

a~ &

I

b~ -

I

or

b, < B b',

a~ or

= an &

b; < B b~ or (4)

..... or since f(aJ

= j(a

an = a~ & bn = b'; & ... & b 2 = b; & b l
+ I'

b, + I)'

259

CONSTRUCTIVE ORDER TYPES, I

Now h(a)

hea') +-+ an an

=

or

a~ & (3r) (1 :::;

s < r ...... b.

=

(5) b~ & b,

 n(a), then an = an(a) and b; = min(B). Finally, h maps C' A onto C' D. C for suppose

x

=j

(

e

II (

bn

•.. •••

0) ) bo

,c e C'D. C,

where c e C'C (and b, e C'B).

=

Let ao

c, a, + 1

= r:

(j(b r, ar»'

Then ao e C' A and if a, e C' A, then a, + 1 e C' A. In particular, an e C' A and an = h - '(x), We have therefore proved h: A ~ D. C and the theorem is established. LEMMA VIII.2.3: A co-ordinal A is a principal number for multiplication if, and only if,

o<

B < A +-+ B W divides A.

PROOF. Immediate from theorem VIII. 2.2 and definition VI. 2.1. 1 We observe that Wwo = W 1 = W, (Wwn)w = W wn. w = W wn+ by theorems VII. 3.3 and VII. 3.1. LEMMA

either A

VIII. 2.4: If A is a principal number for multiplication, then n n n ww for some n, or for all n, Ww divides A and Ww < A.

=

PROOF. By theorem VI. 2.2, if A is a principal number for multiplication then 2 < A. Hence 2A = A and by theorem VIII.2.2, 2w divides A. But by corollary VII .4.4, 2 w = W. Therefore, if I A I = w, A = W. Otherwise Wdivides A and W < A by theorem VI.2.3.(ii). wm Now suppose W < A for m < n (where II > 1). Then, by lemma m 1 VIII.2.3, if A is principal (Wwm)w = Ww + divides A. By theorem n wm 1 wn VI.2.3.(i), W + :::; A and hence, if I A 1= ww , A = W or, for all r, w w W • divides A and W • < A.

Corollary VIII.2 .5. If A, B are principal numbers for multiplication

260

JOHN N. CROSSLEY

and B < A then A = B < A.

r:

LEMMA

VIII. 2 . 6:

PROOF. p'(W) a

wn

for some n, or for all n, B

=

WOP'.a+q(W)

where

q

p(W)+p'(W)

is a polynomial in Wand =

p'(W).

VIII.2.7: If Dp < op', then WP(W). WP'(W)

=

W p'(W).

By theorem VII.3.3, WP(W). WP'(W) = lemma then follows from the previous one. PROOF.

LEMMA

VIII.2.8:

WP(W)+P'(W).

The

If ap < op', then WP(W)+ WP'(W)

PROOF.

divides A and

Ifp( W), p' ( W) are polynomials in Wand ap < ap', then p(W)+p'(W) = p'(W).

#- O. By lemma VIII. 1. 11, LEMMA

wn

=

Wp'(W).

By lemma VIIL2. 7,

WP(W)+WP'(W)

=

WP(W)+WP(W).WP'(W) =

WP(W) {l+W P'(W)}.

By lemma VIIA.8.(ii), W::::;: WP'(W), hence by lemma VIII. 1.4, = WP'(W). Therefore WP(W)+ WP'(W) = WP'(W).

1+ WP'(W)

LEMMA

VIII. 2.9: If a co-ordinal A is of the form A

=

WPI(W).at

+ ... + WPe(W).a e +

q(W)

(6)

where Pt, ... , Pe and q are polynomials in W such that Pt(W) > P2(W) > ... > peCW) at #- 0 and Pt(W) > W, then

A < Wwn and A+ Wwn = wwn if apt < n. Conversely, if A < wwn for some n, then A is expressible in the form (6) where apt < n. PROOF. We prove the two parts simultaneously. Suppose 0 < A < W then I A I has Cantor normal form (cf. e.g. [14], p. 320)

all. at + ... + roT•. a e + q(ro).

where T i > T 2 > ... > T c-

w:

(7)

CONSTRUCTIVE ORDER TYPES,

261 wO' For each i, T, is a polynomial in ro, since otherwise roT; ~ ro which contradicts A < Wwn. Now to every ordinal of the form (7) there corresponds naturally and in a bi-unique way a co-ordinal of the form (6) (i.e. under the mapping p(ro) -+ p(W)). In order to prove the lemma it therefore suffices by virtue of corollary IV. 2 .7 to prove that A + W W" = W w" where n > 0Pl = degree of the polynomial I', (in co), Suppose oq = m - 1, then A+ Wwn = WPl(W).al +

=

WPl(W).al +

I

+ WPe(W).ae+q(W)+ Wwn +q(W)+(Wm+ Wwn)

by lemma VIII. 2 . 8 = WPl(W).al + ...

= Now by

e

i

L ~

1

WPl(W).al + ...

+ WPe(W).a e+ W m+ Wwn + WPe(W).a e+ Wwn =

by lemma VIII. 2 .8 C, say.

a, applications oflemma VIII.2.8 we have C = Wwn.

LEMMA VIII. 2. 10: (i) (3n) (A < WW'') +4 A < Www.

+4

A < WWv,

(ii) (3n) (A < WW)

PROOF. (i) Let V= n+U, then I U 1= co, By lemma VII.4.8, W ~ W U , hence by lemma VIII. 1. 4, 1 + W U = W U • Now Wwn. WWV = W exp (W n+ W v ) = W exp (W n+ W n +u) = W exp (W n • {1 + W u } ) = W exp (W n . W u) = W exp (W n + U ) = WWV. Hence by lemma VIIA.S, Wwn < Wwv. wn Conversely, suppose A < Wwv, then I A 1< ro for some n. But by lemma VIII. 2.9 there is a co-ordinal A' of the form (6) such that I A' I = I A I and A' < Wwn. Hence by corollary IV.2.7, A = A' and A < WW". (ii) follows at once by substituting 'w' for 'V'. (In this case, U = W.) THEOREM VIII. 2. 11; The collection £(.) of all principal numbers for wO' multiplication is strictly ro -unique. PROOF. By lemmata VIII. 2.4 and VIII. 2.9 every principal number for wn wO' multiplication of classical ordinal < ro is of the form W and con-

262

JOHN N. CROSSLEY n

versely, alI the co-ordinals Ww are principal numbers for multiplication. Hence£{.) is co",W-unique. w V Ww and WW are principal numbers for multiplication, since by the w w v V n. n. proof of lemma VIII. 2.10, Ww Ww = Ww and Ww Ww = WW • ww wv wn Further, A < W or W implies A < W for some n; hence, since w n ww all the Ww are principal numbers for multiplication, A. Ww = W v WV and A. Ww = W • wv But WW,W = W implies, by theorem VII.4. 5 (twice), W = V which is a contradiction. Therefore£"(.) is strictly co"'w -unique, THEOREM VIII .2.12: £' (exp) c £' (.) c £' (+). PROOF. By theorem VII.4. 2 every principal number for exponentiation is infinite. Suppose Pe£(exp), then by lemma VII.4.10, A < P-+ AA 1, then by theorem VII. 4.5, AP = P and hence P is a principal number for multiplication. Now suppose P E £(.), then W :==::; P by lemma VII. 2.4, hence by lemma VIII. 1.4, I+P = P. Therefore if 0 < A < P, P = AP = A(1+P) = A+AP = A+P. I.e. Pe£(+). W W E £(.) - £(exp) since for every F < co'" there is a co-ordinal C < W W but to" is not a (classical) principal number for exponentiation. W 2 E £( +) - £(.) by similar argument. Hence alI the inclusions are strict. THEOREM VIII. 2. 12 indicates how we might extend our classes of co-ordinals to get uniqueness up to higher ordinals. We shall present results obtained by this approach in [4] and [21]. Appendix At. In many theorems concerning (classical) ordinals use is made of the theorem

If a well-ordered set ex is similar to a subset of a well-ordered set then ex is similar to an initial segment of p.

p,

The proof of this theorem requires the axiom of choice. Accordingly, it is not surprising that its analogue fails for C.O.T.s and co-ordinals.

CONSTRUCTIVE ORDER TYPES, I

263

In fact, we have made use of this fact in giving counterexamples to analogues of classical laws like A < B --+ AC :s Be. DEFINITION AI.I: A::s 8 if there is a recursive isotonism from A onto a (linearly ordered) sub-relation of B, i.e. if A ~ A' S; 8. Clearly, if Al ~ A 2 , 8 1 ~ 8 2 and Al ::S 8 1 , then A 2 ::S 8 2 , DEFINITION A I .2: A ::S B if there exist A e A and B e B such that A ::S 8. We write A <. B if A ::S B and A ;f. B. THEOREM AI. 3: (i) A ::S A,

(ii) A ::S B & B S C --+ A ::S C, (iii) A < B --+ A -< B, (iv) there exist co-ordinals A, B such that (a) A -< B but A -cI: B, (b) A ::S B & B S A but A ;f. B.

PROOF (i) - (iii) Left to the reader.

(iva) Let A = V B = (ivb) Let A = V, B = contains an infinite Clearly, V [u ~

W, then clearly V ::S Wand V;f. W. W. Now by Post's lemma ([13], p. 291) p = C'V (naturally ordered) recursive proper subset a, W. Hence W -< V.

DEFINITION AI.4: A e.O.T. A is said to be quasi-finite if A and A* are quords. We write :F for the collection of all quasi-finite C.O.T.s. THEOREM AI.5::F is partially ordered by:::;. PROOF. Suppose A ::S Band B::s A where A, B e:F. If A or B = 0 then A = B = O. We may therefore assume A;f. 0 ;f. B. Suppose g : A ~ 8 1 s; Band h: 8 ~ Al s; A, then f: A ~ Al s; A where f = hg, and Ai ;f. 0. Since A, Al are linear orderings, for every x e C'A either <x,f(x» s A or
on:F.

264

JOHN N. CROSSLEY

References [I] H. Bachmann, Transfinite Zahlen (Berlin 1955). [2] P. Bernays and A. A. Fraenkel, Axiomatic Set Theory (Amsterdam 1958). [3] A. Church and S. C. Kleene, Formal Definition in the Theory of Ordinal Numbers. Fund. Math. 28 (1936) 11-21. [4] J. N. Crossley, Constructive Order Types, II. (To appear). [5] M. Davis, Computability and Unsolvability (New York 1958). [6] J. C. E. Dekker, The Constructivity of Maximal Dual Ideal in Certain Boolean Algebras. Pacific J. Math. 3 (1953) 73-101. [7] , An Expository Account of Isols, Summaries of talks (Summer Institute of Symbolic Logic, Cornell 1957) pp. 189-199. and J. Myhill, Recursive Equivalence Types, University of California [8] Publications in Mathematics, n.s, 3, no. 3, 67-214. [9] S. C. Kleene, Introduction to Metamathematics (Amsterdam 1952). [10] G. Kreisel, Non-uniqueness Results for Transfinite Progressions. Bull. Acad. Polon. Sci 8 (1960) 287-290. [II] J. McCarthy, The Inversion of Functions defined by Turing Machines, Automata Studies. Annals of Maths. Studies, no. 34 (Princeton 1956) 177-181. [12] R. J. Parikh, Some Generalizations of the Notion of Well-ordering (Abstract). Notices Amer, Math. Soc. 9 (1962) 412. [13] E. L. Post, Recursively Enumerable Sets of Positive Integers and their Decision Problems. Bull. Amer. Math. Soc. 50 (1944) 284-316. [14] W. Sierpinski, Cardinal and Ordinal Numbers (Warsaw 1958). [15] R. Smullyan, Theory of Formal Systems. Annals of Maths. Studies, no. 47 (Princeton 1961). [16] A. Tarski, Cardinal Algebras (New York 1949). [17] , Ordinal Algebras (Amsterdam 1956). [18] J. S. UIlian, Splinters of Recursive Functions, JSL 25 (1960) 33-38. [19] A. N. Whitehead and B. Russell, Principia Mathematica, Vol. II 2nd ed. (Cambridge 1927). [20] K. Schutte, Predicative Well-orderings, these Proceedings. p. 280. [21] J. N. Crossley and R. J. Parikh, On Isomorphisms of Recursive Well-orderings (Abstract). JSL 28 (1963) 308.

MULTIPLE SUCCESSOR ARITHMETICS R. L. GOODSTEIN Leicester University, UK

I am going to talk about some recent developments in logic-free formalisations of arithmetic. Primitive recursive arithmetic may be formalised as a simple equation calculus, with substitution and uniqueness rules, and primitive recursive and explicit definitions as the only axioms. For instance we may take as the inference rules A=B A=C

A=B j(A) = j(B)

B=C

j(O) U

=

j(x) = g(x) j(A)

= g(A)

g(O)

j(Sx) = H(x,J(x)) g(Sx) = H(x, g(x)) j(x) = g(x)

where A, B are recursive terms. Rule U is in effect a rule asserting the uniqueness of a function defined by the primitive recursion j(Sx)

=

H(x,J(x)).

In U,jmay contain parameters which may also appear in H. These rules may be sharpened in various ways. We may for instance eliminate H from U, replacing U by 4 special cases. We may also replace the infinity of recursive and explicit definitions by a single axiom of recursion and a single axiom of composition formulated with function variables. However it is not my purpose now to go into these developments. In 1959 V. Vuckovic introduced a very interesting generalisation of the

266

R. L. GOODSTEIN

equation calculus, with a multiplicity of successor functions. Thus in place of the numerals 0, SO, SSO, we have elements 0, S to, S zO, ... , SnO, StSzO, StS30, ... , StSZS30, which are formed by prefixing one of St, Sz, ... .S; to any element of the set, starting with zero. We shall assume that the successors Si are commutative, that is

for any i, j. There is another theory in which we dispense with commutativity, but again I shall not be speaking about this. In place of a pair of defining equations for recursive functions we have now a set of equations 1) 2)

F(x, 0)

= a(x)

F(x, SiY) = b;(x, Y, F(x, y))

i = 1,2, ... , n,

where the b, are subject to the restriction

to ensure that the values of F(x, SiSjY), F(x, SjSiY) obtained from equations (2) are the same. The rules of inference are the same as in the calculus with a single successor except that U now contains a line for each successor. Thus U becomes j(O) = g(O) j(SiX) g(SiX) j(x)

= H;(x,j(x))

=

H;(x, g(x))

i = 1,2, ... , n.

= g(x).

Vuckovic introduced n linear functions x a, Y where X

a, 0=

O=:;;i=:;;n-l

X

X (1iSjY = Si + i (x (1; y)

l=:;;j=:;;n

(where i +j is replaced by its excess over n if in fact i+ j exceeds n). The first of these, (1o, is called addition and denoted by +. Thus x+O = x,

x+Siy

= S;(x+y).

267

MUL TIPLE SUCCESSOR ARITHMETICS

All the familiar properties of + may now be proved exactly as in the one successor system. Thus to prove O+x = x, write L(x) = O+x, R(x) = x then L(O)

=

0+0

= 0,

L(Six)

=

SiLx, RO

yielding

=

Lx

= 0,

RStx

=

SiRx,

Rx.

Addition is commutative and it is associative with all linear operations i.e, (x+ y)er i

Z

=

x+ yeri

Z.

Cross multiplication is defined by xxO = 0 x

X

i

SiY = (x x y)erix

=

1,2, ... n,

and is commutative, and distributive over all linear operations. The predecessor functions PiX are PiO = 0,

PjSix

= x, = SiPjX,

j

=

i

j 1= i.

Finally, x...:... y is defined by x...:...O

=

x,

x...:...Siy

=

P;(x...:...y);

cross multiplication is not distributive over difference. To prove the key equation a+(b...:...a) = b+(a...:...b)

Vuckovic was obliged to apply the uniqueness rule to definition by double recursion in the form

Ft», 0) = a(x) F(x, S,Y) = b;(x, y, F(P,x, y)).

The same situation arose thirty years ago in my first account of the equation calculus, where an application of the uniqueness rule to a double recursion was made to prove the same equation a+(b-=-a)

=

b+(a-=-b)

268

R. L. GOODSTEIN

but I was subsequently able to prove this equation without introducing definition by double recursion. Now many of the results and techniques of the single successor system transferred quite readily to the Vuckovic system, but the rather complicated proof of the key equation did not yield to attempts to make it work in the multiple successor system. While working on this problem my student Mr. M. T. Partis noticed a remarkable similarity between Vuckovic's system and a system whose elements are ordered sets of natural numbers. Thus with a Vuckovic number x we associate natural numbers Xl' X2' .•. , x; in the following way:

and if

o=

(0, 0, ... , 0)

then SiX

=

(Xl' X2' .•. , Xi - l' SiXi' Xi + l' . . . ,

x n) ·

It follows that with S2" ,S2

S3" ,S3

----------------k k 2

3

we associate

We call xl> Xl> ... , x, the components of x. Accordingly with a Vuckovic-function F(x) we associate n functions h(xI' X2' .•. , x n) , i = 1, 2, ., ., n, the components of F. What can we say about these functions j, the components of F? It turns out that if F is recursive in Vuckovic's system, i.e. if F is defined from initial functions SiX, Zx = 0, Ix = X by substitution and recursion then likewise the component functionsh(xl' . . x n) are each primitive recursive. What is more surprising is that one can show that any ordered set of primitive recursive functions h(x l . .. x n) , i = 1,2, ... n, are the components of a function F recursive in Vuckovic's system. One does this by listing the generation by iteration and substitution of each f, from initial functions 0, x+l, x 2 , ••• x-'-y, x+y, thus:

MULTIPLE SUCCESSOR ARITHMETICS

269

°cPu

where in each column cPu is an initial function or formed from previous functions in the column by substitution and iteration; it is readily seen that we may suppose only one column changes at a time. One then shows that each step from one row to the next may be imitated in the Vuckovic system (the initial row is itself the components of the element 0 of the Vuckovic-system). The essential construction is that of a Vuckovicfunction whose ith component is respectively the ith component of the ith function of some set of Vuckovic-functions, To this end we define the component-function Ci(x) thus:") so that Then if we have where

j'l= i,

C;(x) = (0, 0, ... , Xi' 0, ... ,0). F;(x)

= ULf~, ... ,f~)

rC;F;(x)

= Ut,

n, ...,f:)

rg;(x) = gl (x) + gz(x) + g3(X)+ ...

+ gk(X)+ ... + gn(x).

Not only is there this (1, 1) correspondence between Vuckovic-functions and sets of primitive recursive functions but we can imitate in the Vuckovic-system any proof carried out in terms of components and we obtain a Vuckovic-proof. This is because a Vuckovic-proof is a series of steps each of which is effectively a Vuckovic-definition, or a substitution or an appeal to uniqueness, and a proof in components uses the same operations. In particular from the proof of x+(y...:...x) = y+(x...:...y) 1) Variables and numerals in the Vuckovic system are here printed in bold type.

270

R. L. GOODSTEIN

in recursive arithmetic it now follows that x+(y-'-x) = y+(X-'-y) is provable in Vuckovic's system without appeal to double recursion. In primitive recursive arithmetic x+(y-'-x) and x..:.(x-'-y) are respectively the greater and smaller of x, y. In the Vuckovic system two elements are not necessarily comparable but Partis has shown that x+(y-'-x), x-'-(x-'-y) are respectively the least upper and greatest lower bounds of x, y so that the Vuckovic system is a lattice in which x + (y -'-x) is the union, and x -'-(x -'-y) the intersection of x, y. I shall conclude by showing a little of the purely arithmetical resources of the system. Let x· Y = (Xl' Yl' , X n ' Yn)' so that X· = 0, X· SiY = Y = x· y+Cix, and let x = (xi', , x~n) so that

°

x O = 1 = (1, 1, ... ,1), xS,y = xY. (Uix) where UiX j i' i.

We define

a ::; b +--> a -'- b

=

0,

so that a ::; b +--> a, ::; hi, for all i. We define further a Ib

+-->

(3c) (c ::; b & b = ac) (3c;) [(c i

::;

+-->

(Vi) (l ::; i ::; n

--+

b;) & (b i = aic;)]).

If f I a and fib, f is called a common factor of a, b. If h I a and hi b and if k I a, k I b ~ k] h then h is the h.c.f. of a, b, it is easily shown that hi is the h.c.f. of a.; hi' If 1 is the h.c.f. of a, b then a, b are said to be relatively prime. It follows that if a, b are relatively prime then all ai' b, are relatively prime and conversely. If ¢(x) is Euler's function which counts the number of numbers (including 1) which are less than and prime to x, then ¢(x) is primitive recursive and so there is a ([> such that

MUL TIPLE SUCCESSOR ARITHMETICS

271

Since ¢(b j ) = 1 +mjb i when a.; bi are relatively prime therefore a(b)

= l+mb

a(b)

= 1 (mod b)

i.e,

when a, b are relatively prime.

References R. L. Goodstein, Recursive Number Theory (North-Holland Publishing Co. Amsterdam 1957). M. T. Partis, Commutative partially ordered recursive arithmetics. Mathematica Scandinavica 13 (1963) 199-216. V. Vuckovic, Partially ordered recursive arithmetics. Mathematica Scandinavica 7 (1959) 306-320.

UNSOLV ABLE PROBLEMS IN THE THEORY OF COMPUTABLE NUMBERS B. R. MAYOR University of Oslo, Blindern, Norway

With each total, general, recursive singulary functionf on the natural numbers (hereafter "recursive function") one can associate the real number '/' = rx = ± a o . ala2a3 . . . that satisfies:

o: :2: 0 if f(O) is even, rx ao = [f(0)/2] ,

s

0 if f(O) is odd,

(Ll) for all i :2: 1, a, = the remainder on dividing f(i) by 10.

A real number rx is said to be computable if there is a recursive function associated with a, Let R denote the class of real numbers, and C the class of computable numbers. 1: If function f: Rk - R and open interval Q c R k are such that f restricted to Q is continuous, monotone in each argument, and can be effectively calculated for any k-tuple offinite decimals in Q, then the value of f is a computable number for every k-tuple ofcomputable numbers in Q as argument. THEOREM

PROOF: Let (Xl' X2' ... , Xk) be any k-tuple of computable numbers in Q. As all finite decimals are computable - since any function whose value is 0 for all but a finite number of arguments is recursive - it suffices to consider the case whenj'(x., Xl> ••• , x k ) = rx is an infinite decimal. Let fl' f2' .. ·,fk be the recursive functions by which Xl' X 2, ... , X k are presented. For any positive integer m, letflm,f2m, ... ,fkm be the recursive functions given by:

fimO) = f;(j)

o

if)::;; m if) > m

THE THEORY OF COMPUTABLE NUMBERS

273

and d lm, d 2m, ... , d km be the finite decimals associated with 11m,12m, ... , Ikm' As Q is open, there is a positive integer I such that m ~ I implies that k Q includes the 2 arguments 0 there is an m > I such that Pm < e. As IX is not a finite decimal, this ensures that for any positive integer n there is an m > I such that all m-values agree on the first n decimal places. As I is monotone in each argument, these are the first n decimal places of a,

Corollary Ia. C is closed under the rational operations, so it is a field. Corollary lb. C is closed under the elementary functions; such functions as x - t exp x, x - t log x, X - t sin x, x - t n", x -+ "x, have computable values for computable arguments. In particular e = exp 1 is computable [10, p. 256]. THEOREM 2: III : R - t R is a continuous function whose sign can be computed effectively at any finite decimal that is not a root, then all simple roots 011 are computable. PROOF. Again one need only consider the case of a simple root is not a finite decimal. As IX is isolated, there is a finite decimal

such that

IX is

IX that

the only root oif in the closed interval

By Weierstrass' theorem tion: g(i)

IX

= 2d o

is presented by the following recursive func-

2d o+ 1

d,

if (j > 0 and i = 0 if (j < 0 and i :F 0 if 1 :::;; i :::;; k

the least z such that fid-s-z : lO-i) and I(d+(z+ 1)' lO- i ) have opposite signs, otherwise.

274

B. H. MAYOH

Corollary 2a. All the roots of a polynomial with computable coefficients are computable [2, 8]. In particular all algebraic numbers are computable [10, p. 254]. Corollary 2b. x -+ sin x satisfies the requirements, so [10, p. 256].

1t

is computable

Corollary 2c. C is closed under the inverse circular and hyperbolic functions. However C can be closed under a functionfwithout f being effective.

THEOREM 3: There is no effective procedure which, given two recursive functions f1 and f2, will stop and present a recursive function f3 such that

PROOF. One can effectively find the following recursive function for any Turing machine M: g 1 (i)

=

°

if i < 2 or M stops and presents 1 within i steps when started on its own Godel number 9 otherwise,

gii) = 2 if i < 2 or M stops and presents

°within

i steps when started

° + If Let be the digit in the first decimal place of presents ° when run on its own Godel number, then on its own Godel number otherwise.

M stops and dis 1 (0). The usual diagonalisation argument shows that one cannot find d effectively. d

'gt'

'g2'.

(1)

Similar proofs show that subtraction, division, multiplication, extraction of roots, exponentiation and the taking of logarithms are also non-effective. Moreover for any computable number a one can show that x -+ x + a, x = x]« when a #- 0, and x -+ x-a are effective if and only if a is a finite decimal. Thus doubling but not trebling is effective. If we had chosen to work with ternary instead of decimal expansions, the opposite would have been true. Such dependence on the number base can occur as conversion from base p to base q is only effective when q divides a power of p (cf. [5] theorems 3 and 5). It is curious that the existence of a procedure for any of the above non-effective operations does not seem to imply the solvability of the Halting problem [10], though each is reducible

THE THEORY OF COMPUTABLE NUMBERS

275

to the Halting problem. However the Halting problem is equivalent to the problem of finding the computable number that is the limit of a given recursively convergent recursive sequence of rationals. There is a profound analogy between the way in which a computable number is associated with each recursive function and that in which a semigroup is presented by each Thue system. Just as finite semigroups can be given by a multiplication table whilst infinite semigroups require a set of defining relations, so finite decimals can be written down directly whilst infinite decimals must be given by a rule. Just as not all semigroups can be presented by Thue systems, so not all real numbers are computable. Most interesting properties of semigroups are "Markov properties" [3, 7] in the following sense: i) There is a Thue system r 1 that presents a semigroups enjoying P. ii) There is a Thue system that presents an inhibiting semigroup S*, i.e. if S* can be embedded in the finitely presented semigroup S, then S does not enjoy P. iii) P is preserved under isomorphisms. The natural analogue of this definition is: A property of real numbers P is said to be pseudo-Markov if i) There is a recursive function Pfl associated with a number enjoying P. ii) There is a recursive function Pf2 associated with an inhibiting number (X* i.e. if recursive function g is associated with a number (x, that differs from (X* at only finitely many places, then (X does not enjoy P. It is known that (1) for every Markov property P of semigroups, the problem of determining whether or not a given Thue system presents a semigroup enjoying P is unsolvable [3, 7], and that (2) for each recursively enumerable degree of unsolvability D, there is a class of Thue systems A such that the above problem, restricted to A, has D as its degree of unsolvability [1]. N. Shapiro has proved the analogue of (I) [9, theorem 2.2]; the following theorem is the analogue of (2). THEOREM 4: For any pseudo-Markov property P of real numbers and any recursively enumerable degree of unsolvability D, there is a class A of recursive functions such that D is the degree ofunsolvability of the problem

276

B. H. MAYOH

of determining whether or not the number associated with a given recursive function in A enjoys P. PROOF. Let SD be an infinite recursively enumerable set of positive integers whose decision problem has D as its degree of unsolvability. For each positive integer i. one can effectively find the recursive function:

gii) =

IJii)+ 10· j if i # 0 and

"1

j is enumerated amongst the first i

elements of SD'

if i = 0, "1(i)+ 10· j otherwise. (i)

The associated computable number enjoys P if and only if j ¢ S D' so {gl(i), g2(i), ... } will serve for A. 5: If a property P of real numbers is enjoyed by at least one computable number, if any number, agreeing with a number enjoying P at all but a finite number of places, also enjoys P, and if one can recursively enumerate a set of recursive functions such that a computable number enjoys P if and only if it is associated with at least one recursivefunction in the set, then P is a pseudo-Markov property and the problem of determining whether or not the number associated with a given recursive function enjoys P is of degree 0", i.e. has the same degree of unsolvability as the problem of determining whether or not a given partial recursive function is total. THEOREM

PROOF.

f(i)

Consider the recursive function:

=6

if 5 is the remainder of the value of the U+ lj-st listed function for argument i on division by to 5 otherwise.

Its associated number does not enjoy P, and no computable number that agrees with it on all but a finite number of places can enjoy P. Thus P is a pseudo-Markov property. For any partial function p, one can effectively find the recursive functions:

THE THEORY OF COMPUTABLE NUMBERS g(O) = 1 g(i+ 1) = g(i)

g(i) + 1

fiO)

277

if p(g(i) - 1) cannot be computed within i + 1 steps otherwise.

= 0

fp(i+ 1)

= h(i){i+ 1)

if g{i+ 1) = g(i)

6

if g{i + 1) =f. g(i) and the remainder on dividing h(i){i+ 1) by 10 is 5

5

otherwise.

The number associated with fp enjoys P if and only if p is not total. Moreover for each recursive function r one can effectively find the partial recursive function: q(i)

= undefined if r and the i-th listed function always agree modulo

o

10 and they agree exactly for argument otherwise

o.

q is total if and only if the number associated with r does not enjoy P.

For properties of the form "Being algebraic of degree in S" where S is a recursive set ofpositive integers, and in particular for "Being algebraic" and "Being rational", this has been proved in another way [9, theorem 11. 10]. The theorem also applies to the apparently simpler property "Being expressible as a finite decimal". It is known that (3) for every recursively enumerable degree of unso1vability D there is a class A of Thue systems such that the isomorphism problem restricted to A has D as its degree of unso1vability [1], and that (4) the isomorphism problem for semigroups is unsolvable [4,6]. The next two theorems give the analogous results in the theory of computable numbers. THEOREM 6: For every recursively enumerable degree of unsolvability D, there is a class of computable numbers A such that the problem of determining whether or not two recursive functions in A have the same associated number has D as its degree of unsolvability. PROOF. Let SD be as in the proof of theorem 4. For any positive integer i. one can effectively find the recursive function:

278

B. H. MAYOH

.fii) = 0 l+lO'j

10· j

if i = 0 if i #= 0 andj is enumerated among the first i elements of SD otherwise.

Let fo be any recursive function whose value is always O. Then {fo, ft. f2" .. } will serve as A, since "I tj SD" reduces to "Do fo andfj have the same associated number?", and one can effectively find the natural number g* such that f g* = g for any recursive function g in A so "Do the functions g and h in A have the same associated numbers T" reduces to (g* tj SD and h* tj SD) or g* = h*.

This proof may not be available for other conventions that associate real numbers with recursive functions; it is then necessary to distinguish between identical recursive functions that are defined differently in order to reduce the decision problems in theorems 4 and 6 to that of SD' THEOREM 7: The problem of determining whether or not an arbitrary pair of recursive functions have the same associated numbers has the same degree of unsolvability as the Halting problem, viz. 0'. PROOF. Letfo be as in the last proof. For any Turing machine M, one can effectively find the recursive function

fM(i) = 0

if M runs for at least i steps when started on blank tape

1 otherwise.

M stops if and only if 10 and 1M have different associated numbers. Furthermore for any recursive functions f and g one can effectively find a machine M* that compares f(i) with g(i) for i = 0, 1, 2, ... until 'I' and 'g' diverge. M* stops if and only iff and g have different associated numbers. Similarly the following five decision problems are also of this degree: To determine of any pair of recursive functions whether or not the number associated with the first is greater than (not less than, different from, not greater than, less than) the number associated with the second. However one can show - by modifying M* in a suitable fashion - that all six decision problems are solvable when restricted to pairs of functions that have different associated numbers.

THE THEORY OF COMPUTABLE NUMBERS

279

N. Shapiro has proved: If a property P of real numbers is enjoyed by only finitely many computable numbers (and by one at least), then it is a pseudo-Markov property and the problem of determining whether or not the number associated with a given recursive function enjoys P has the same degree of unsolvability as the Halting problem [9, theorem 2.16]. In particular this applies for" a" where a is any computable number. The decision problems for the properties: "> a", "~ a", "#- a", ":::; a", "< a" are also of this degree. However judicious use of M* enables us to solve these six decision problems when restricted to recursive functions with associated numbers different from a. If it seems unnatural to associate a real number with every recursive function, as we have done - e.g. one might prefer to replace clause (,1) in the definition in the first paragraph by "a, = j(i)", or to disregard those decimal expansions that end in a string of nines - then one is free to change the definition; our results will still hold if "recursive function" is replaced throughout by "recursive function that is associated with some number".

=

References [1] [2] [3] [4] [5] [6] [7] [8] [9] [10]

w. W. Boone, Partial Results regarding Word Problems and Recursively Enumerable Degrees of Unsolvability. Bull. Amer. Math. Soc. 68 (1962) 616-623. A. Grzegorczyk, Computable Functionals. Fund. Math. 44 (1957) 61-71. A. A. Markov, Impossibility of Algorithms for recognising some Properties of Associative Systems (in Russian). Dokl. Akad. Nauk. SSR 77 (1951) 953-956. , Impossibility of Certain Algorithms in the Theory of Associative Systems (in Russian). Dokl, Akad. Nauk. SSR 77 (1951) 19-20. A. Mostowski, On Computable Sequences. Fund. Math. 44 (1957) 37-51. , Review of [4], J. Symb. Logic 16 (1951) 215. , Review of [3], J. Symb. Logic 17 (1952) IS\. H. G. Rice, Recursive Real Numbers. Proc, Amer. Math. Soc. 5 (1954) 784-795. N. Shapiro, Degrees of Computability. Ph. D. thesis, Princeton University (1955). A. M. Turing, On Computable Numbers. Proc. London Math. Soc. (2) 42 (1936) 230-265.

PREDICATIVE WELL-ORDERINGS KURT SCHUTTE Kiel University, Germany

A hierarchy of critical numbers of the second number class can be defined in the following way: (1) The l-critical numbers are the s-numbers, (2) An ordinal IX is v-critical (v > 1) if I,,(IX) = IX for every I" is the ordering function of the JJ-critical numbers.

jJ

< v where

These ordering functions I" are normal functions. For JJ < v the set of v-critical numbers is a proper subset of the set of u-critical numbers. If IX is JJ-critical then JJ ~ IX. We say that an ordinal K is strongly critical if it is x-critical, i.e. if 11/0) = K. It turns out that the smallest strongly critical number K o is a least upper bound for predicative reasoning. We define in section 1 an ordering relation -< of equivalence classes of natural numbers representing a sufficiently large segment of the second number class in a constructive way. With respect to this representation of ordinals, we prove in section 3 transfinite induction up to ordinals smaller than K o by using a formal system of ramified type theory which is defined in section 2. In this way well-ordering up to any ordinal IX < K o is provable by predicative methods (if the ordinals are defined in a sufficiently constructive way as in section 1). In another paper (Schutte [7]) there is a proof that well-ordering up to ordinals ;;::: K O cannot be proved by predicative methods. The same result was found independently by S. Feferman [l ]-[3]. Our wellordering -< is based on normal functions according to a construction of Veblen [8]. The well-ordering -< of this paper corresponds to a proper segment of a well-ordering in Schutte [5]. It is very closely related to the

PREDICA TIVE WELL-ORDERINGS

281

well-ordering --< of section 11 in Schutte [6]. Both well-orderings represent the same segment of ordinals. 1. A constructive system of ordinals We use small Latin letters as syntactical variables for natural numbers (including 0) and define a binary relation :S on natural numbers. We denote by <, = and ~ the usual relations on natural numbers. Po denotes the prime number 2. P« for n :j= 0 denotes the n-th odd prime number.

If a :j= 0 then a i denotes the exponent of Pi in the prime factorization of a. Inductive definition of the relation :S (a 4; b denotes the negation of a :S b): a :S b if and only if at least one of the following four conditions is fulfilled: [l]a=Oandb=O.

[2] b 9= 0 and a :S b, for at least one i. [3] a

:j=

0, b 9= 0, and a, :S b, for all i.

[4] a

:j=

0, b 9= 0, and there are numbers m

a, = 0

~

n such that

for all i < m (if m 9= 0),

am:s b

b 4; aj

for all} with m < j < n (if m+ I < n),

b; 4; an ak :S

s,

for all k > n.

Obviously, for any natural numbers a, b it is decidable whether a :S b or

a ~ b. It is easy to prove by mathematical induction (according to the

natural ordering of natural numbers) that :S is a reflexive total ordering relation, i.e.

a :S b or b:s a (totality), If a :S band b :S c then a -< c (transitivity). We define: a == b if and only if a

:S band b :S a,

282 a

KURT SCHUTTE

-< b if and only if

b

~

a.

-< b.

a -I( b denotes the negation of a

Since S is a reflexive total ordering relation it follows that == is an equivalence relation, and < is an irreflexive total ordering relation with respect to the relation ==, i.e. a==a

If a == b then b == a If a == band b == c, then a == c a-l(a

If a -< band b -< c then a -< c a -< b or b -< a or a == b If a == c, b == d, and a -< b, then c

-< d and -<

(reflexivity of ==) (symmetry of ==) (transitivity of ==) (irreflexivity of -<) (transitivity of -<) (trichotomy) (compatibility of == and

-<)

From the definitions of 5, == we can derive the following criteria. Criterion for equivalence. a == b if and only if at least one of the following four conditions is fulfilled: [EI]a=Oandb=O. [E2] a

=1=

0, b

=1=

0, and a, == b, for all i.

[E3] a

=1=

0, b

=1=

0, and there are numbers m < n such that

ai =

°

am == b, a j <, b

-< b; ak -< b" an

for all i < m (if m

=1=

0),

for allj with m < j < n (if m+ I < n), for all k > n.

[E4] The condition corresponding to [E3] by exchanging a and b.

Criterion for the irreflexive ordering relation. a -< b if and only if at least one of the following four conditions is fullfilled: [01] a = [02] b =1=

°and °and

b = 1. a

-< b, for at least one i.

[03] a =1= 0, b =1= 0, and there are numbers m < n such that bm =1=

and a == b"

°

PREDICATIVE WELL-ORDERINGS

* 0, b * 0, and there is a number n such that a, - n.

These criteria for == and -< together are primitive recursive as is the definition [1]-[4] of5' A natural number a is called a finite or a transfinite ordinal (with respect to the relation -<) according as a -< 3 or 3 5 a. It is easy to check: If a is a finite ordinal then 2a is the successor of a. If a is a transfinite ordinal then 2a

== a.

3 is the smallest transfinite ordinal (it represents the ordinal w). To define addition of ordinals we use auxiliary functions ex and which are defined in the following way:

P

a ao if a = 2ao • 3 " ex(a) = { 0 otherwise.

if a = 2ao . 3a "

a1

{3(a) = { a

otherwise.

For addition EB of ordinals we have the definition a

EB

b if a = 0, b- { 2'x(a ) EBb • 3 P( a ) if a O.

*

*

Since !X(a) < a for a 0, the definition is recursive. Therefore a EB b is computable for any natural numbers a, b. One can prove by mathemathical induction: If a

== c and b == d then a EB b == c EB d (compatibility of EB and ==),

If a 5 b then a EB c 5 b EB c (weak monotony on the left), If b

-< c then a EB b -< 0 a EB 0

==

EB c (strict monotony on the right),

a and 0 EB b

a EB (b EB c)

==

== b.

(0 EB b) EB c (associative law).

The difference [) is defined in the following way:

284

J(a, b) =

KURT SCHUTTE

l b

if a

15 (a (a), b)

if a

J(a(a), a(b»

if a

J(a, a(b»

if a

°or b = 0, *' 0, *' 0, and *' 0, *' 0, and *' 0, *' 0, and =

b

fJ(a) <, fJ(b),

b

fJ(a)

b

fJ(b)

== fJ(b),

-« fJ(a).

By mathematical induction it follows that a ffi J(a, b)

== b if a ::S b.

We say that a function f (on natural numbers) is an ordering function (with respect to of a set S (of natural numbers) if it satisfies the following three conditions:

-«)

(1) f(a)

E

S for any natural number a,

(2) If b e S then there is a natural number a such thatf(a) == b, (3) If a

-« b then f(a) -«f(b).

Remark. The uniqueness of ordering functions can be proved only by using the well-ordering of Therefore we cannot speak of the ordering function of a set S before we have a proof of well-ordering for the relation A number a is called a main ordinal (with respect to addition) if a*,O and x EB a == a for all x a. (Obviously, if a == b, then a is a main ordinal if and only if b is a main ordinal.) Using the properties of EB we find the following criterion for main ordinals:

-«.

-«.

-«

*' °is a main ordinal if and only if

(1) A number a = 20 0 • 3 ' with ao a == ao and ao is a main ordinal. D

(2) Every number a

=3

D

'

is a main ordinal.

(3) If a*,O and there is an n > I such that an ordinal.

*' 0, then a is a main

We define a function 2 in the following recursive way: 2(a)

={

2(ao)

if a = 20 0 • 3D '

fJ(a)

otherwise.

and a == a o

Then a _ J U o ) if a is a main ordinal. It follows that 3D as a function

285

PREDICA TlVE WELL-ORDERINGS

of a is an ordering function of the set of main ordinals. (That means that 3a represents the ordinal w lal if a represents the ordinal 1 a I.) By the properties of addition and of main ordinals it follows that: For any number a =l= 0 there are numbers C u ... , Cm (m 2: 1) such that

a == 3C 1 E9 ... E9 3Cm and cm

:5 ... :5 C 1

(Cantor's normal form).

The numbers Cl' . . . , c.; are uniquely determined up to equivalence, and they are computable for any given a =l= O. Therefore multiplication x of ordinals can be defined recursively in the following way: (1) a x b = 0

=

(2) a x 1

(3) If Cm (3

C1

if a = 0 or b = O. (if a =l= 0)

a

:5 ... :5 c 1 (m

E9 ..• E9 3

Cm

)

x 3

2: 1) and e '1= 0 then e

==

3C! Ell e

(4) If (a =l= 0 and) en :5 .. , :5 el (n > 1) then a x (3 e , E9 ..• E9 3en) == (a x 3e , ) E9 •.. E9 (a x 3en) (5) If a

==

C

and b

== d then

axb

== c x d (compatibility of x and ==).

The following properties of this multiplication are provable by mathematical induction:

== b == (a x b) E9 (a x c) If a :5 b then a x c :5 b x c

(distributive law)

If a =l= 0 and b

(strict monotony on the right)

ax 1

==

a and 1 x b

a x (b E9 c)

(axb)xc

==

-< c then a x b <' a x c

ax(bxc)

(weak monotony on the left) (associative law)

Let n be a natural number 2: 1. We say that a number a is (c 1 , critical in the following cases: (1)

Cl

••. ,

cn ) -

= ... c; = 0 and a =l= O.

(2) There is an m such that 1 ::::; m ::::; n, c, = 0 for all i = I, ... , m - 1 (if m > 1), c.; =l= 0, and for all x

-< Cm'

286

KURT SCHUTTE

According to this definition we have the following facts: If a == b then a is (c l , critical.

... ,

cn)-critical if and only if b is (c., ... , cn ) -

°

If n < Nand Cn + I = .. , = CN = then the (CI' ••• , cn}-critical numbers are the same as the (c I , .•. , cN)-critical numbers. Furthermore, we get the following criterion for critical numbers. A number a is (cl' ... , cn)-critical if and only if it satisfies at least one of the following two conditions:

°

[CI] a =l= and a == p~ . p~l ... p~n where b is one of the numbers 0, ao, or a. [C2] a =l= 0, and there is an i such that a == a, and a, is (c I , critical.

... ,

cn) -

Using this criterion one can prove by mathematical induction: If a is (cI ' . . . , cn)-critical then there is a computable number b such that a == p~ . p~' ... p~n. It follows that p~ . p~' ., . p~n as a function of a is an ordering function of the set of (c I , . . . , cn)-critical numbers. A number a is called an e-number if 3a == a. It is easy to check that the (c)-critical numbers are the numbers >- 3c , and the (0, I)-critical numbers are exactly the s-numbers. Every (c I ' . . . , cn)-critical number with Cn =l= O(n > 1) belongs to the set of s-numbers. We say that a number a is a strongly critical number if it is (a, 1)critical. These strongly critical numbers are exactly the (0,2)-critical numbers. We have ordering functions ({J(a, b)

=

2a • 3b • 5 of the set of (b, I)-critical numbers,

and Ka

= 2a

•

52 of the set of strongly critical numbers.

The relation -< can be proved by impredicative methods to be a wellordering using a proof similar to that for the related -<-relation in § 12 of Schutte [6].

PREDICATIVE WELL-ORDERINGS

287

2. A formal system of ramified analysis

2.1. Primitive symbols 1. Numerals (symbols representing the natural numbers), 2. free number variables, 3. free predicate variables, 4. bound variables, 5. symbols for computable functions on natural numbers, 6. the relation symbols =, =1=, <, ::;;, =, ¥=, -<, :S, 7. the connectives " A, V, -+, ~, and the universal quantifier A, 8. comma and parentheses.

2.2. Inductive definition of terms 1. Every numeral is a term, 2. Every number variable is a term, 3. If fis a symbolfor an n-ary computable function (n ~ 1) and t l , are terms, then t«; ... , tn) is a term.

••• ,

tn

2.3. Computable functions Besides other symbols for computable functions we use "max" and

"r" as symbols for the binary computable functions which are defined

in the following way: max

(Zl' Z2)

r(zl' Z2)

= {

Z2

ifz 2 . 1f z1

o

if

Zl

= {Zl if Zl Z2

:s Zl -< Z2 -< Z2

:S Zl

2.4. Abbreviations We omit parentheses if they are superfluous. t 2 ) , Ee (tl' t 2 ) we write t 1 + t 2 , t 1 ED t 2 · Instead of + In the same way we write terms in other cases in the usual way. Instead of r(t 1, t 2 ) we write only (t l' t2 ) ·

«:

2.5. Numerical terms A term is called numerical if it does not contain a free number variable. According to the interpretation of function symbols (as symbols for

288

KURT SCHUTTE

computable functions) every numerical term t has a computable value I t I which is a natural number. 2.6. Elementary prime formulas

An elementary prime formula is a formula where t 1 , t 2 are terms and >- is a relation symbol of our formal system. A numerical prime formula is an elementary prime formula which does not contain a free number variable. According to our interpretation of function symbols and relation symbols every numerical prime formula has a computable truth-value "true" or "false". A numerical prime formula (tl "" t 2) has the value "true" if and only if I t 1 I ~ I t 2 I holds. An elementary prime formula t 2 ) is called verifiable if there is a constructive proof of the following property: Whenever ti, ti are the results of substituting arbitrary numerals for the free number variables in t 1 , t 2 the numerical prime formula (ti "" ti) has the value "true".

«. ""

2.7. Nominal forms

A nominal form is a finite sequence containing no symbols other than primitive symbols of our formal system and a nominal symbol (which is not a primitive symbol of our system). If ~ denotes a nominal form and X denotes a primitive symbol or a finite sequence of primitive symbols then ~(X) denotes the results of substituting X for the nominal symbol in ~. 2.8. The supremum

Let a be a free number variable occurring neither in the nominal form s nor in the term s, and let sea) be a term. We say that the relation sup s == s is verifiable if there is a constructive proof of the following properties: (1) The elementary prime formula sea) :S s is verifiable. (2) Whenever s*, s* are the results of substituting arbitrary numerals for the free number variables in s, s, then for every natural number

289

PREDICATIVE WELL-ORDERINGS

n

<:: I s* I there is a computable numeral

holds.

z such that n

<:: I 5* (z) I

We shall use this relation sup 5 == s only in a metamathematical sense. We shall not define it as a formula of our formal system. 2.9. Inductive definition offormulas and predicates

We define only monadic predicates. To any formula and to any predicate we associate a level s which is a term of our formal system. I. Every elementary prime formula is a formula of level O. 2. If p is a free predicate variable and s is a term then pS is a predicate of level s. 3. If q is a predicate of level sand t is a term then q(t) is a formula of level s. 4. If F is a formula of level s then' F is also a formula of level s. 5. If F I , F 2 are formulas of levels Sl' (F I

A

S2

then

F 2 ) , (F I v F 2 ) , (F I -+ F 2 ) and (F I

are formulas of level max

.-.

F2 )

(Sl' S2)'

6. Suppose (l) ~(a) is a formula of level s(a) where the free number variable a does not occur in the nominal forms ~ and s. (2) Either a does not occur in the superscript of a variable in ~, or ~ does not contain a free predicate variable. (3) The relation sup 5 == s is verifiable where s is a term containing no free number variables other than those which are contained in s. (4) x is a bound variable which does not occur in ~. Then (x~(x» is a predicate of level sand Axlll(x) is a formula oflevel s. 7. Suppose (1) ~(pt) is a formula of level s. (2) The free predicate variable p does not occur in the nominal form Ill. (3) The bound variable x does not occur in 'li. Then Axt~(x') is a formula of level s.

290

KURT SCHUTTE

Remark. According to this definition a formula or a predicate can belong to different levels St, Sz but only if the elementary prime formula St == Sz is verifiable. 2.10. Syntactical variables

a, b, c, d for free number variables, s, t, u, v for terms, z for numerals, x, y for bound variables, p for free predicate variables, q for predicates, Latin capitals for formulas, Gothic letters for nominal forms. We shall also use these letters with subscripts. 2.11. Elementary formulas

An elementary formula is a formula which does not contain a free predicate variable or a bound variable. It is composed only of elementary prime formulas and connectives. A numerical formula is an elementary formula which does not contain a free number variable. According to the truth-values of numerical prime formulas and the interpretation of connectives every numerical formula has a computable truth-value "true" or "false". An elementary formula F is called verifiable if there is a constructive proof that every numerical formula which results by substituting arbitrary numerals for the free number variables in F has the value "true". 2.12. Notations Aform of level s is a formula of level s or a predicate of level s. If a form F contains an expression pU or XU with a free predicate variable p or a bound variable x then we call u a level indicator of F. A level indicator is either a term or the result of substituting bound variables for some free number variables in a term. THEOREM 2. I : The level s of a form F contains no free number variables other than those which are contained in level indicators of F.

291

PREDICATlVE WELL-ORDERINGS So

THEOREM 2.2: If a form F of level s contains a form Fo of level So then S s is verifiable. THEOREM 2.3: (term substitution) Suppose

~(a) is a form of level sea) where a is a free number variable which does not occur in the nominal forms ~, s, (2) t is a term. Then ~(t) is aform of a level s such that set) == s is verifiable. (~(t) is a formula if and only if~(a) is aformula.)

(1)

THEOREM 2.4: (predicate substitution). Suppose (1) ~(i) is a form oflevel u~(i) where t is a term and p is a free predicate variable which does not occur in the nominal form ~, (2) q is a predicate of level aq. Then ~(q) is a form of a level u~(q) such that uq

S

t -+ u~(q)

s u~(i)

is verifiable. m(q) is a formula if and only if ~(pt) is a formula.) These theorems 2. 1-2.4 are immediately provable by induction on the formation rules for formulas and predicates. THEOREM 2.5: IfAxt~(xt) is aformula of level s then t

:S s is verifiable.

PROOF. According to our definition, the formula Axt~(xt) has the same level s as the formula ~(pt) containing a predicate pt of level t. Therefore t S s is verifiable according to Theorem 2.2.

2.13. Axioms

(AI) Every formula which is derivable in intuitionistic logic. (A2) Every elementary formula which is verifiable. (A3) The abstraction axioms: where t is a term and

(x~(x»

(t)

(x~(x»

is a predicate.

~ ~(t)

(A4) The axioms for trivial quantification: t

where t is a term and

=0

-+ Axt~(xt)

Ax~(xt) is

a formula.

292

KURT SCHUTTE

(AS) The axioms for predicate quantification:

Axl'll(XI )

A

oq -< t

-+

'll(q)

where q is a predicate of level oq and Axl'll(x') is a formula. 2.14. Inference rules

(BI) Every inference rule of intuitionistic logic. (B2) The inference rule of mathematical induction: FA Ax[x < a

-+

'll(x)]

-+

'll(a)

=>

F -+ 'll(t)

where a is a free number variable which does not occur in the formula F or in the nominal form'll, 'll(a) is a formula, and t is a term. (B3) The inference rule for predicate quantification: F -+ 'll(p(a, I)

=>

F

-+

Axl'll(x')

where p is a free predicate variable which does not occur in the formula F or in the nominal form'll, and a is a free number variable which does not occur in F, 'll or the term t. Remark. A formula Axl'll (x') has the interpretation: "'ll(q) for all predicates q of level -< r: 2 . 15. Definition

A formula of level s is regularly derivable if it is derivable only from formulas of levels s, such that Sj ::5 S is verifiable. 3. Proof of transfinite induction in the formal system of ramified analysis We define the foIlowing formulas: Pr (q) ( <, -progressivity

+-+

der

Ay [Ax(x

<

y -+ q(x» -+ q(y)]

of the predicate q),

l(q, t) ~ [Pr(q) -+ Ay(y

<

t -+ q(y))]

(transfinite -<-induction up to the ordinal t applied to the predicate q), J(s, t) ~ Ax'l(x', t)

PREDICATlVE WELL-ORDERINGS

293

(transfinite -<-induction for predicates of levels -< s up to the ordinal t), The formulas Pr(q) and I(q, t) have the same level as the predicate q. The formula J(s, t) has the level s. We want to derive J(s, t) for ordinals t smaller than the first strongly critical number "0' In the following derivation we use the abbreviations: "log. der." for "derivable in intuitionistic logic", "by prec." for "it follows from the preceding formula", "by n prec." for "it follows from the last n preceding formulas". To prove transfinite induction up to the first a-number in Gentzen's way [4] we use two binary functions y and t/J which are defined in the following way: y(a,O)

=

t/J(a, b)

o { = A(8(b, a»

a, yea, b+ I)

=

yea, b) 6' a,

if a -< b, if b :S a.

The following formulas are verifiable: (1.1) a =1= 0 -. 3~(a):s a (1.2) a

-< y(3~(a),

a)

(1.3) a <; b 6' y(3",(a,bl, 8(b, a» (1.4) c =1= 0 A a <, b Ell 3' -. t/J(a, b)

-< c.

The verifiability of (I. I) and (1.2) is provable by mathematical induction on a. PROOF OF (1.3). Suppose Then a == b Ell 8(b, a).

By (1.2) 8(b, a)

hence a

-<

b:s a.

-< y(3~(d(b,a»,

(Otherwise the statement is trivial.)

b(b, a»

= y(3 "'(a, b), 8(b, a»,

b Ell 'l'(3l/!(a,b), b(b, a».

PROOF OF (1.4). Suppose c =1= 0 and a -(b, a» = leO) = 0 -< c. If b <, a then b Ell b(b, a) == a -< b Ell 3' implies 8(b, a) -< 3'. By (1.1) it follows that 3~(d(b,a» :S b(b, a) -< 3', hence t/J(a, b) = A(b(b, a» -< c.

294

KURT SCHUTTE

We associate with a predicate q a predicate q(t) ~ Ay[Ax(x

q such that

-< y --. q(x)) -+ Ax(x -< y

EB 31

-+

q(x))].

(According to section 2, q can be defined as a predicate (xlll(x)) of the same level as the predicate q.) Then the following formulas are regularly derivable: (1.5) Ax(x

-< b -+ q(x)) A q(t) --. Ax[x -< b EB

(1.6) I(q, c) --. I(q, 3

C

')'(31, u)

-+

q(x)]

)

(1.7) J(s, A(a)) -+ J(s, a).

The formula (1.5) is derivable by using the inference rule (B2) of mathematical induction with respect to the term u. Derivation of (1.6). c =l= 0 A a Ax(x

a

-<

c

b EB 3 -+ tjJ(a, b)

-< c -+ q(x)) A tjJ(a, b) <

-< b EB ')'(31/1(0, "', b(b, a))

Ax(x

-<

-< c

is the verifiable formula (1.4)

c -+ q(ljJ(a, b))

is log. der.

b -+ q(x)) A q(tjJ(a, b)) -+ [a

c =l= 0 A Ax(x

-< c --. q(x)) A Ax(x -<

c =l= 0

<

c

A

Ax(x

= 0 A Pr(q)

-< b

is the verifiable formula (1.3)

EB

')'(31/1(0, bl,

by (1. 5)

b --. q(x)) --. (a

-<

b EB 3c

--.

by 4 prec.

q(a))

by prec.

c --. q(x)) -+ q(c)

is trivially derivable

--. q(c)

by 2 prec.

Pr(q) -+ Pr(q) Pr(q) A Ax(x

b(b, a)) --. q(a)J

<

I(q, c) -+ I(q, 3

C

c

-+

q(x)) -+ Ax(x

<

c

3

--.

q(x))

is trivially derivable by 2 prec.

)

Derivation of (1.7). is trivially derivable

J(s, A(a)) -+ J(s, A(a) EB 1) J(s, A(a) EB 1) A (b, s)

s =l= 0

-+

(b, s)

-< s

-< s -+ I(p(b, s), lea)

EB 1)

is an axiom (A5) is verifiable

295

PREDICA TIVE WELL-ORDERINGS

I(lb. sl, A(a) EI7 1) I(p(b. sl, 3A(al Ell 1)

~

~

I(lb. sl, 3A(al

by (1.6)

Ell 1)

because a -< 3 A(a l Ell l is verifiable

I(p(b. sl, a)

s =1= 0 A J(s, ,l.(a)) ~ I(p(b. sl, a)

by 5 prec.

s =1= 0 A J(s, ,l.(a)) ~ J(s, a)

by an inference (B3)

s = 0

is an axiom (A4)

J(s, a)

~

by 2 prec.

J(s, -lea)) ~ J(s, a)

LEMMA 1: The formula Jis, 5) is regularly derivable. (The number 5 represents the first a-number with respect to the -<. -ordering.)

PROOF.

Its, 0)

is trivially derivable

Jts, ,l.(a)) ~ Jts, a)

by (1.7)

a =1= 0 a

A

a

-<. 5 ~

-<. 5 ~

-lea)

< a A -lea) -<. 5 is verifiable

Jts, a)

from 3 prec. by mathematical induction by prec.

/(s,5)

For the proof of the next lemma we need binary functions J1 and v which are defined recursively in the following way: J1(a, b)

0

a=O

v(a, b)

0

{ a o =1= 0 ao = 0

J1(a o, b)

v(ao, b)

p(a 1 , b)

Veal> b)

{a 1 S b b < a1

ao

a1

a

b

{a S b b-
0

a

a

b

{a S b n > 2 with an =1= 0 b-
0

a

a

b

a = 2ao • 3a, a

= 2ao • 3a , • 5

a =

2ao • 3a, . 5az

1

-< a z

There is an

296

KURT SCHUTTE

One can prove by induction on a that the following formulas are verifiable:

(2.1) v(a, b)::s b

-<

(2.2) a oF 0 A v(a, b)

b

--+

Ilea, b) < a

(2.3) a = 0 v ..1.(a) < a v a == 2/1(0, b) • 3'(0, b) • 5 LEMMA

2: The formula

PROOF.

We use the following abbreviations:

Ax Ay(y -< t 1 A J(s, x) --+ J(s, 2x • 3Y • 5)) A J(s ffi 1, to) --+ J(s, 2to • 3t 1 • 5) is regularly derivable for arbitrary terms to and t i -

m:(s, t 1 ) ~(s,

~

Ax Ay[y <, t 1

A

J(s, x)

--+

J(s, 2X • 3Y • 5)J

b, t 1) +-+ Ax(x <; b --+ J(s, 2x • 3t l • 5)) def

il(s, a, b, t 1 )

+-+ Ax[x def

< a

--+

(x

-< 2 b • 3t l • 5 --+ J(s, x))J

Derivation of the asserted formula m:(s, td

A

J(S ffi 1, to)

--+

J(s, 2t o • 3t l • 5).

v(a , tl) -: t 1 A a == 2/1(0, tl). 3v(a, ttl. 5 A m:(s, t 1) ( a <, 2b • 3t l • 5 --+ J(s, a)

v(a,

==

t1)

t

( --+ J(s, a)

v(a, t 1 )

1

A

a == 2/1(0, ttl. 3v(a, ttl . 5 A

0

--+

A

il(s, a, b, t 1 )

J(s, a)

A

is derivable by using (2.2) with t 1 instead of b t

1)

-< t 1

-'c(a) < a

=

b,

il(s, a, b, t 1)

A

a

< 2b • 3t 1 • 5

is derivable by the definition of ~ is verifiable by (2. 1)

a == 2/1(0, til. 3v ( a , til. 5 A m:(s, t 1 ) {A a -< 2 b • 3t l • 5 --+ J(s, a)

a

~(s,

A

A

a

-< 2

b

A ~(s,

b, t 1 )

A

il(s, a, b, t 1 ) by 3 prec.

•

t1

3

•

5

--+

J(s, a)

is derivable by using (1.7) is an axiom (A4)

297

PREDICATIVE WELL-ORDERINGS

a = 0 v A(a) < a va == l2I(s,

t I) A

!B(s, b,

t I) A

21'(o,'tl • 3,(o,'tl •

i!(s, a, b,

t I) A

a

5

-< 2b . 3

11

is the verifiable formula (2.3) •

5

J(s, a) by 4 prec.

-+

by mathematical induction

l2I(s, t 1 ) A !B(s, b, t 1 ) l2I(s, t 1 )

-+

-+

J(s, 2b • 3'1. 5)

by prec.

Pr«x J(s, 2"" . 3" . 5»)

by prec.

Prtt» J(s, 2"" . 3" . 5») A J(s EB 1, to)

-+

J(s, 2'0 . 31 1 • 5) because the pre-

dicate (xJ(s, 2"" . 3" . 5» has the level s <, s EB 1

l2I(s,

tl)

A J(s EB 1, to)

-+

J(s, 2'0 . 3'1 . 5)

by 2 prec.

To prove the next lemma we need a binary function 0 which is defined recursively in the following way:

I {36 ( b, P(o» EB O( cx( a), h)

o( a, b) --

if a = 0 or f3(a) if a 4= 0 and b

The following formulas are verifiable: (3.1) a (3.2) a

-< 3

bxO(a,

<

LEMMA

b

3

h)

'" 1 X V -+

8(a. h)

-< 3 xv.

3: The formula

Ay [J(3 U x (y, 3 3 ' ) EB 1, t l ) A

u

J(3 fI; 1 x(h, 3

3

-+

'), tl) -+

J(3 U x (y, 3 3 ' ) , t 2)] J(3

ue

I

x (h, 3

is regularly derivable for arbitrary terms s, 11' 12• u. PROOF. We use the abbreviations

A ~ Ay[J(3 U x (y, 3 3 ' ) EB 1,11)

s(h)';;r 3u fI; 1 x(h, 3 3 ') o(a, h) ';;r O«a, s(b», u).

-+

J(3 U x (y, 3 3 ' ) , t 2 ) ]

3

A

s 4= 0

' ) , t 2)

-<

b,

:S f3(a).

298

KURT SCHUTTE

The formula A depends on 5,1 1,12 , u. The nominal forms sand n depend on 5 and u. The asserted formula of our lemma is A A 5 =!= 0 A J(s(b), 11) -+ J(s(b), ( 2 ) ,

Derivation of this formula: (a, s(b»

-< 3"

s(b) =!= 0

-+

(3.3) s(b) =!= 0 u(a, b)

Ell

1

x (b, 3 3 ' )

s(b) =!= 0

8«a, s(b», u)

3

' ) --+

3" x (u(a, b), 3

A A J(s(b), ( 1) A 3" x (u(a, b), 3 { -+ J(3" x (u(a, b), 3 3 ' ) , ( 2)

(3.4) s(b) =!= 0 A A

(3.5)

5

=!= 0

A

s(b) =!=

A

3

<3

-< s(b) ' ) EEl 1 -< s(b)

by 2 prec. is trivially derivable

x (b, 33 ' ) -+ u(a, b)

=!= 0 --+ o(a, b)

OA 5

-< 3" x u(a,

{ -+ (a, s(b»

<

s(b) =!= 0 A

=!= 0

-< 33

5

(a, s(b»

A

3

{ --+ I(p(a, db), ( ) 2

s =!=

s(b) =!= 0 A A

b) A o(a, b)

3" x (u(a, b), 3

J (3" x (u(a, b), 3

OA

EEl 1 -< s(b) is verifiable

J(s(b), t 1) -+ J(3" x (u(a, b), 3 3 ' ) ,

(a, s(b» <. 3" x o(a, b)

(3.6) s(b) =!=

')

( 2)

by 2 prec.

u(a, b)

(a , s(b»

by 2 prec.

3

3" x (u(a, b), 33 ' ) EEl 1

-+

by (3.2)

is verifiable

-< 3 x (b, 33 ' )

-+ u(a, b)

-< 3 x (b, 33 ' )

-< s(b)

(a, s(b»

-< 3 x (b, 3

-+

OA

A 5

' ) , ( 2) A

3

by (3.3) and (3.5)

<

A

by (3.1)

-< 33 '

is verifiable

')

-< 3" x (u(a, b), 33 ' )

(a, s(b»

by 3 prec.

-< 3" x (u(a, b), 33 ' )

J(3" x (o(a, b), 3 3 ' ) ,

=!= 0

-< 33 ' is verifiable

is an axiom (A5)

( 2) --+

I(p(a'5(b», ( 2 ) by 2 prec.

J(s(b), ( 1) --+ I(p(a, ,(b», (2 )

by (3.4) and (3.6)

s(b)

*0

s(b)

== 0

A

PREDICA TlVE WELL-ORDERINGS A

-+

*0

AS

A

A

A S

*0

A

J(s(b), t l )

J(s(b), t 2 )

-+

by an inference (B3) is an axiom (A4)

J(s(b), ( 2 ) J(s(b),

299

by 2 prec.

J(s(b), t 2 )

II) -+

We define a predicate Qs such that

.

Qs(t) +-+

* 0 then

If S

-+ J(3 3 ("

-<

(t, s)

3

{AY Ax [J(3 3 (" s )

-< 33 '

On the other hand if u

-<

(y, 3 3 ' )

33 ) ,

(y,

sand

3 (" ')\91 X

(y, 33 ' ) , x)

')$ I X

3 3 ("

then u

::5

2:< . 3(" .) . 5)]

')$llil3'

3 3 ("

')lill

== 3 3 ' . X

(u,3 3 ') .

Therefore the formula (4.1)

is verifiable. (Qs is a predicate of level 33 ' if LEMMA

4: The formula S

*0

-+

S

* 0.)

Pr(Qs)

is regularly derivable for any term s. PROOf.

Let t be the term 33 6 ( b ,

(c, 3 3 )

X

(a • • ))

A X(X < a -+ Q.(x» A b < (a, s) A J(3 3 ( b , {-+ J(3 3 ( b , ')$1 X (t, 3 3 ') , 2d • 3(b, s). 5) b

-< (a, s) -+ (r, 3

{ A (b, s)

3 ')

==

1

A 3 3 ( b , ')$1 X

Ax Ay[y -+

X

(t, 3 3 ' ) , d)

is trivially derivable

==

t

$ I

33 (b, . ) X

== b

A X(X -< a -+ Qs(X» A b -< (a, s) {-+ )(3 3 ( a , . ) X (c, 3 3 ' ) , 2d • 3b • 5)

A

s)

J(33<"·

J(3 3 ( a •

-< (a, s) s)

• )

X

X

(c,

A J(3 3 ( a ,

33 ),

2:< .

(c, 33 ' ) EB 1, d)

is verifiable A

X (c,

.)

3Y •

(c, 33 ' )

J(3

3

(a• • )

X (c,

3

3 ),.d)

by 2 prec. 3 3 '),

x)

5)]

-+

J(3 3 ( Q•

• )

X

(c, 33 ), 2d • 3(a. s) • 5)

by Lemma 2

300

KURT SCHUTTE

A X(X -< a -+ Q.(x» /\ J(3 3 ( a . , ) X (c, 33 ' ) Ell 1, d) {-+ J(3 3( a . ,) X (c, 3 3 ' ) , 2d • 3(0,.) . 5) Ay[J(3 3 ( a . /\ S (

-+

X

, )

~ 0/\ J(3

J(3 3 ( a .

')(1)

(y, 33 ' ) Ell 1, d)

3(a,

1

, ) (I)

1 X

-+

J(3 3 ' a .

, )

X

x(c, 33 ' ) , 2d • 3(0,.). 5)

by Lemma 3 1 X

(a.

s

~

0

-< a --+ --+

(y, 33 ' ) , 2d • 3(0,.) . 5)J

(c, 33 ' ) , d)

A X(X <. a --+ Q.(x» /\ s ~ 0/\ J(3 3
Q.(x» /\ s

~

0

--+

(c, 3 3 ') , d) by 2 prec. by prec.

Q.(a)

by prec.

Pr(Q.)

LEMMA

by 2 prec.

5: The formula J(3 3 ' Ell 1, t) /\ t

-< S -+ J(33', 3' . 5)

is regularly derivable for arbitrary terms s, t. PROOF.

s ~ 0

-+

aQ.

J(3 3 ' Ell 1, t) Q.(t)

--+

J(3

3

-< A

<"

33 ' Ell 1/\ Pr(Q.)

-<

aQ. s ) (I)

1X

by (4.1) and Lemma 4

3 3 ' Ell 1/\ P"(Q.) (v, 3

3

'),

-+

Q.(t) is trivially derivable

3("') . 5)

is trivially derivable

-<

s -+ s ~ 0 A (t, s) == t is verifiable (5.1) J(3 3' Ell 1, 1) A t -< S --+ J(3 3' tI! 1 X (v, 33 ' ) , 3' . 5) by 4 prec. t

Let v be the term (a, 3 3 ' ) Ell 1. Then the formula (a, 3 3 ' )

-<

3 3 ' (1) 1

X

(v, 3 3 ' ) is verifiable. Therefore 3

(5.2) J(3 3 ' EIl l x (v, 3 3 ' ) , 3'· 5) -+ l(p(D. 3 ' ), 3" 5) 3 J(3 3' Ell I, t) /\ t -< S -+ l(p(D, 3 \ 3'· 5)

J(3 3 ' Ell I,

1) A t

-<

S --+

J(3 3 ' , 3' . 5)

is derivable by (5.1) and (5.2) by an inference (B3)

We define a monadic function tt in the following recursive way: 1) If b

-<

5 then n:(b)

= O.

PREDICATIVE WELL-ORDERINGS

301

2) If b == 3b • 5 then neb) = b.

-< 3 b • 5 there are

3) If 5 ::5 b 3.1) b = 2

bo

•

b,

3

•

only the following two cases:

Then neb) = max (n(b o), n(b i)

3.2) b = 2bo • 3b , • 5. Then neb) = max (n(b o), bi)' For this function n the following three formulas are verifiable: (6.1) b

-<

3

K

·5

(b) $ 1

(6.2) 5 ::5 b --. 3

K

(b) .

(6.3) b::5 3c • 5 A

5 ::5 b

C =!=

0 --. neb) EEl I ::5 c.

The formulas (6.1) and (6.2) are provable by induction on b. (6.3) is trivial for b -< 5. It follows from (6.2) for 5::5 b. LEMMA

6: The formula

J(s, t)

t ::5 s

A

A

35 ==

S --.

J(s, 3' . 5)

is regularly derivable for arbitrary terms sand t. PROOF. U

~f

max

We use the abbrevations:

«a, s) EEl 1, neb) EEl 2).

v d7r neb) EEl 1 v <, u is verifiable, therefore J(3 3 " EEl 1, v) --. J(3 3 " , 3v

b

-<

•

3' . 5 A t =!= 0 --. v ::5 t

v ::5 t

s t ::5

J(5, t)

A

3

3

"

SA

3 == 5

EEl 1 <

S A V

t =!= 0 A J(s, t) A t ::5 (6.4) { --. J(3 3 " , J" . 5) b-<3

v'5

(a, s) <, 3

S --.

S A

is derivable by Lemma 5

5)

3

3

"

by (6.3)

EEl 1 -< s

::5 t --. J(3 35 ==

S A

3

b

"

<

is verifiable EEl 1, v) is trivially derivable 3' . 5

by 4 prec. by (6.1)

3

"

is verifiable

302

KURT SCHUTTE

(6.5) J(3 3 " , 3"· 5) -+ l(p(a.

s),

t =1= 0 A J(s, t) A t

:5 s A 3 ==

SA

t =1= 0 A J(S, t) A t

:5 s A 3s ==

s

t

=0

J(s, t)

A

s

t

:5 SA 3 == s

((0) = 0, ((n + 1) =

t

t

J(S, 3

-+

•

5

•

5)

-+

s), b) by (6.4) and (6.5)

l(p(a,

by prec. by Lemma 1

s

t

J(S, 3

-+

We define monadic functions ( and

L(a)

b -< 3

J(s, 3t • 5)

-+

by 2 prec.

b)

3~(n)

L(7t(a)) + 1

={ 1

•

if 0

•

L

5)

by 2 prec.

in the following recursive way:

5.

-< a -:

52

otherwise.

Then the following formulas are verifiable: (7.1) Sen) (7.2) a

-< 52

-< 52 -+ a -< S(L(a)).

LEMMA 7: For any natural numbers m :::; n the formula J(s(n), s(m+l)) is regularly derivable. PROOF by induction on m. (1) m = O. Then S(m+ 1) = 5. By Lemma 1 the formula J(s(n), 5) is regularly derivable. (2) 0 < m :::; n. Then sen) is an s-number, i.e. 3~(n) == Sen). Furthermore sCm)

:5 Sen). By the induction assumption

J(S(n), sCm)) is regularly derivable. By Lemma 6 it follows that J(C,(n), sCm + 1)) is regularly derivable.

THEOREM: For any number provable in a predicative way. PROOF. If z

Z

-< 52

transfinite induction up to z is

-< 52 then z -< ((L(Z)) by (7.2).

Therefore it is sufficient to prove: For any natural number n transfinite induction is provable predicatively up to numbers :5 ((n). We prove this statement by induction on n. (1) n

= O. Then ((n) = 0, and the statement is trivial.

PREDICA TlVE WELL-ORDERINGS

303

(2) Suppose the statement is true for n. Then we have a predicative proof that the restriction of our formal system to levels ::S ((n) is a predicative formal system because these levels are well-ordered. According to Lemma 7 we have a proof of J(((n), ((n+ 1» in this formal system. Therefore transfinite induction up to numbers ::S ((n+ 1) is provable predicative1y. References [1] S. Feferman, Constructively provable well-orderings. Notices Amer. Math. Soc. 8 (1961) 495.

[2] S. Feferman, Provable Well-orderings of and Relations between Predicative and Ramified Analysis. Notices Amer. Math. Soc. 9 (1962) 323. [3] S. Feferman, Systems of Predicative Analysis. Text of an invited address delivered to a meeting of the Association for Symbolic Logic at Berkeley, January 1963. [4] G. Gentzen, Beweisbarkeit und Unbeweisbarkeit von Anfangsfallen der transfiniten Induktion in der reinen Zahlentheorie. Math. Annalen 119 (1943) 140-161. [5] K. Schutte, Kennzeichnung von Ordnungszahlen durch rekursiv erklarte Funktionen. Math. Annalen 127 (1954) 15-32. [6] K. Schutte, Beweistheorie (Berlin-Gottingen-Heidelberg 1960). [7] K. Schutte, Eine Grenze fur die Beweisbarkeit der transfiniten Induktion in der verzweigten Typenlogik. To appear in Archiv f. math. Logik und Grundlagenforschung. [8] O. Veblen, Continuous Increasing Functions of Finite and Transfinite Ordinals. Transactions Amer. Math. Soc. 9 (1908) 280-292.

REMARKS ON MACHINES, SETS, AND THE DECISION PROBLEMl) HAO WANG Harvard University, Cambridge, Mass., USA

1. Machines and production systems 1. 1. The basic distinction between monogenic and polygenic systems corresponds to the contrast of calculations with proofs, functions with relations, and machines with production systems. In calculations, we generally have a fixed procedure such that the answer is completely determined by the question. In looking for a proof of a given statement in a given formal system, we have in general an unbounded number of choices at each stage since, for example, there are infinitely many p's such that p ~ q together with p would yield q. If there is a fixed number n such that at each node, there are only n or less choices, then clearly we can get a monogenic system in the search for proofs. A monogenic proof procedure, such as the Herbrand expansion procedure for the predicate calculus, need not give a decision procedure. On the other hand, a monotone system, such that by some criterion the conclusion is always longer or more complex than the premisses, is always decidable when there are finitely many rules only. Thus, given a statement p, the total number of statements which can enter in a proof of p is finite since every rule has a fixed number of premisses. Hence, it is of interest to inquire when a polygenic system is equivalent to a monogenic one, and when either is equivalent to a monotone one. 1.2. A machine which halts on every finite input corresponds to a function from the input to the output. If, on the other hand, we allow, 1) Work for this paper was supported in part by NSF grant GP-228 and in part by Bell Telephone Laboratories, Inc., Murray Hill, New Jersey.

MACHINES, SETS, AND THE DECISION PROBLEM

305

e.g., that the machine can do either of two things at each moment, then for each input we can get many outputs, and we get, in general, a relation Rxy such that y is an output of the machine for the input x. It seems somewhat unnatural to speak of a polygenic machine, but with a Post production system, the distinction between monogenic and polygenic is perfectly natural. In Turing machines, we are usually interested in tapes which are blank for all but a finite number of squares. The consecutive minimum portion containing all marked squares and the square presently under scan could be taken as the string of symbols in a production system. In that case, a machine corresponds to a monogenic production system except for the fact that the former has a scanned square at each moment and has different states. DEFINITION I: A labeled rewriting system is a finite set of rules Pi -+ Q I such that in each PI and Qi exactly one symbol has an arrow above it (the label indicating the square under scan). THEOREM I: There is an effective method by which, given any Turing machine, we get a corresponding monogenic labeled rewriting system in which each Pi (also each Q;) contains exactly two symbols, one of which is labeled. To prove this, we use a Turing machine formulation such that in each state, a machine prints, shifts, and changes state according to the symbol newly under scan. In other words, if there are m states ql' , qm' n symbols Sl' ... , SII' a machine is given by qaSi± ISjqb (a = I, , m; i, j = I, ... , n), so that if the machine is in state qa scanning symbol Sj, it shifts right (+ I, or left, -I) and then scans the next square, ending up in a state qb determined by the newly scanned symbol Sj' It is not hard to verify that this formulation is equivalent to the usual one in the sense that they can simulate each other. With this formulation, we can always use an alphabet with (m + l)n symbols and one state only. Thus, instead of the given state qa and the symbol Sj, we have the symbol (a, i). This is changed to (0, i). After the shift, the scanned symbol is (0, j) which is now changed into (b,j). In other words, for c = I, ... , m and d = I, ... , n, (c, d) is a symbol indicating state c and symbol d, when the square is under scan; a symbol d

306

HAO WANG

in other squares is represented by (0, d). This makes it easy to give a I-state universal machine and yields a measure of the complexity of Turing machines solely by the size of the alphabet (using always 1 state only). This also gives Theorem 1 immediately, since the rules are simply of the forms (a,· i) (0, j)

(0, i) (b; j) for right shift,

(0, j) (a~ i)

(b/j) (0, i) for left shift.

1.3. Multiple tapes naturally make it possible to simulate each m x nk one-tape machine by an (m, n, k) (m states, n symbols, k tapes) machine; but the full force is not used in the simulation and it is desirable to find more accurate measures than these. Recently, P. K. Hooper [7] proved: THEOREM 2: There is a (2,3,2) universal Turing machine; there is a (1,2,4) UTM, having afixed loop/or one a/its/our tapes.

In the realm of "real time computation," Michael Rabin has recently proved that there are calculations which can be performed by two tapes but not by one tape. The whole area of efficient calculations (as against theoretical computability) is wide open and promises much interesting work. Although there are various elegant formulations of Turing machines, they are still radically different from existing computers. To approach the latter, we should use fixed word lengths, random access addresses, accumulator, and permit internal modification of the programs. Alternatively, we could, for example, modify computers to allow more flexibility in word lengths. Too much energy has been spent on oversimplified models so that a theory of machines and a theory of computation which have extensive practical applications have not been born yet. 1 .4. There are a number of conceptually neat results on the theoretical side. We mention a few recent ones at random. The most elegant formulation of Turing machines is perhaps the SSmachines of Shepherdson and Sturgis [16]. An SS-machine is a finite sequence of instructions, each of which is of the following two types.

MACHINES, SETS, AND THE DECISION PROBLEM

Po, Pi: print

°

307

(or 1) at the right end of the string S and go to the next instruction.

SD(k): scan and delete the leftmost symbol of S; if it is 0, go to the next instruction, otherwise, go to instruction k; if S is null, halt.

They have proved: 3: Every Turing machine (in particular, a UTM) can be simulated by an SS-machine. THEOREM

It is particularly easy to simulate these machines by Post production

systems (see [22]).

1.5. A combinatorial system in the most general sense would be any finite set of rules, each of which effectively produces a finite set of conclusions from a finite set of premisses. The most intensively studied case is the one in which each rule has a single premiss and a single conclusion. Such a system is called monogenic if the rules are such that for any string at most one rule is applicable. From this broad class of monogenic systems, Post chooses to consider the tag systems. A tag system is determined by a finite set of rules: i=I, ... ,p,

such that if the first symbol of a string is s., then the first P symbols are removed and the string E i is appended at the end. Since the system is monogenic, s, =1= S j when i =1= j. If the alphabet contains (J symbols, then p = (J. Another natural class is, for want of a better name, the lag systems. A lag system is a set of (JP rules:

such that if the first P symbols of a string are Si 1 ' " Sip' the first symbol, viz., Sil' is deleted and E j is appended at the end of the string. In either case, E, may be the null string. If S, is the length of E, and Sis the maximum among Si' then each system has a prefix number P and a suffix number S. In [11] and [12], Minsky has proved the following remarkable result:

308

HAO WANG

THEOREM 4: There is a tag system with prefix number P = 2 and suffix number S = 4, whose halting problem is unsolvable.

This is improved slightly in [22] to get the suffix number down to S = 3, and then the result is shown to be best possible because every tag system with P = 1 or P ~ S is always decidable (i.e., both its halting problem and its derivability problem). More recently, Cocke and Minsky gave an improved proof of Theorem 4, from which the simplification to S = 3 follows directly. In these considerations, attempts to use the SS-machines have not been possible. A similar result for lag systems is proved in [22] by using SS-machines : THEOREM 5: There is a lag system with P = S = 2, whose halting problem is unsolvable; moreover, when P = 1 or S ~ I, every lag system is decidable.

The tag systems are a subset of Post's monogenic normal systems, each of which has rules of the form such that a given string BiQ becomes QE j by the rule. It is quite easy to use SS-machines to get a normal system with P = S = 2 (P the maximum of the lengths of Bi ) whose halting problem is unsolvable (see [22]). A specially interesting subcase of the normal systems is the l-norrnal systems in which B, is always a single symbol. The l-norrnal systems include all tag and lag systems with P = 1. It is obvious from [22] that the halting problem for every I-normal system is decidable. S. Cook and S. Greibach have strengthened the result, with two radically different proofs, to get also: THEOREM 6: The derivability problem (i.e., whether one string is deducible from another) of every I-normal system is decidable.

1.6. It has been known for quite some time that for Turing machines erasing is dispensible (see [19]). In theory, this result has the practical application that, e.g., paper tapes can be used in place of magnetic tapes. The dispensibility of erasing is understood in the sense that every calculation can in theory be done without erasing. Recently, the consequence problem is considered and it is proved [15]:

MACHINES, SETS, AND THE DECISION PROBLEM

309

THEOREM 7: If T ranges over nonerasing T.M., W ranges over words in their history, I ranges over (finite) inputs, then the relationP(W, T,I) (i.e., W belongs to the history of T with input J) is recursive; on the other hand, for a fixed initial (finite) input, we can find a T.M. with erasing permitted such that the set of words in its history is not recursive.

2. The decision problem and its reduction problem

2. 1. In this part, we consider recent results on the decision and reduction problems of the (restricted) predicate calculus. Since all mathematical theories can be formulated within the framework of the predicate calculus (quantification theory, elementary logic), Hilbert spoke of the decision problem when he was referring to the problem of finding a general algorithm to decide, for each given formula of the predicate calculus, whether it is satisfiable in some nonempty domain (or, has a model). He called this the main problem of mathematicallogic. It is familiar today that this problem in its general form is unsolvable in a technical sense which is widely accepted as implying unsolvability according to the intuitive meaning. An interesting problem is to investigate the limits of decidable subdomains and the underlying reasons for the phenomenon of undecidability. Recently, the general problem has been reduced to the formally simple case of formulas of the form AxEx'AyMxx'y, where Misquantifier-free and contains neither the equality sign nor function symbols. In fact, one can further restrict the class to those AEA formulas in which all predicates are dyadic, and each dyadic predicate G j occurs only in some of the nine possible forms Gsxx, Gjxx', Gjx'x, Gjx'x', Gjyy, Gjxy, Gjyx, Gjx'y, Gsyx', The following is proved in [9]. THEOREM 8: Any AEA class including all formulas which contain only atomic formulas in three of the four forms (xy, yx, x'y, yx') is undecidable; the class of all AEAformulas of the form WXXA U(xy,x'y) A V(yx,Yx'), that ofthe form U (xy, x'y) A V(xy, yx), that ofthe form Uiyx.yx') A V(xy, yx), are all undecidable, where W, U, Vare truth-functional expressions. Moreover, all these classes are reduction classes.

This completely settles the question of decidable and undecidable prefix subclasses of the predicate calculus. This is true even if we allow

310

HAO WANG

formulas in the extended prenex forms, i.e., formulas which are conjunctions of formulas in the prenex normal form. (Compare [9] and [21]). THEOREM 9: An extended prefix form class is a reduction type (and undecidable) if and only if either the prefix of at least one conjunct contains AEA or AAAE as an (order-preserving but not necessarily consecutive) substring, or there are two conjuncts of which the prefixes contain AAA and AE respectively. Moreover, it is decidable if and only if it contains no axioms of infinity. i.e., formulas which have only infinite models. 2.2. In [21], a simpler alternative proof of Theorem 8 is given which has two additional properties: (a) only a small fixed finite number of dyadic predicates are needed, together with arbitrarily many monadic predicates; (b) finite models are preserved in the reduction procedure so that a formula has a finite model if and only if its corresponding AEA formula has a finite model. DEFINITION 2: Consider classes of formulas of the predicate calculus. For any class X, let N(X), I(X), F(X) be the subclasses ofXwhichcontain all formulas in X which have respectively no model, only infinite models, finite models. If R is a reduction procedure which reduces a given class Y to y* and every subclass Z of Y to Z*, then R is said to be a conservative reduction procedure for Y, if (F(Y»* = F(Y*). The following two theorems are proved in [21]: THEOREM 10: If K is the class of all formulas of the predicate calculus and R is a conservative reduction procedure for K, then no two of the three classes N(K*), I(K*), F(K*) are recursively separable. THEOREM 11: If Z is the class of AEA formulas (or some suitable subclass of this, such as A 1 given below), then no two of the three classes N(Z), I(Z), F(Z) are recursively separable. In another direction, Kahr (see [8]) extends Theorem 8 to the following: THEOREM 12: A reduction class for the predicate calculus is the set A 1 offormulas with prefix AEA such that each formula of the set contains only monadic predicates and a single dyadic predicate.

MACHINES, SETS, AND THE DECISION PROBLEM

311

This proof can be modified as in [21] to get Theorem 11 for ..11 and to give a corresponding result for the prefix AAA A AE, and therewith an alternative proof of Suranyi's similar result [18] for the more complex prefix AAA A AAE. 2.3. In studying the AEA case, "dominoes" were first introduced in [20], and are found to be useful for the study. They are also of some independent interest and are reviewed here mainly for the remaining open problems. We assume there are infinitely many square plates (the domino types) of the same size (say, all of the unit area) with edges colored, one color on each edge but different edges may have the same color. The type of a domino is determined by the colors on its edges and we are not permitted to rotate or reflect any domino. There are infinitely many pieces of every type. The game is simply to take a finite set of types and try to cover up the whole first quadrant of the infinite plane with dominoes of these types so that all corners fall on the lattice points and any two adjoining edges have the same color. DEFINITION 3: A (finite) set of domino types is said to be solvable if and only if there is some way of covering the whole first quadrant by dominoes of these types. It is natural to use ordinary Cartesian coordinates and identify each unit square with the point at its lower left hand corner. Then we can speak of the origin (0, 0), the main diagonal x = y, etc. The following general questions on these games have been considered:

DEFINITION 4: The (unrestricted) domino problem. To find an algorithm to decide, for any given (finite) set of domino types, whether it is solvable. The origin- (diagonal-, row-, column-) constrained domino problem. To decide, for any given set P of domino types and a subset Q thereof, whether P has a solution with the origin (the main diagonal, the first row, the first column) occupied by dominoes of types in Q. THEOREM 13: All the constrained domino problems are unsolvable (see [9] and [21]).

312

HAO WANG

The unrestricted domino problem remains open. In fact, as discussed in [20], there are two related open questions. I) Problem I. Is the unrestricted domino problem solvable? Problem 2. Does every solvable domino set have a periodic solution? A positive solution of the second problem would yield also a positive solution of the first problem, but not conversely. The unrestricted domino problem is related to a special subclass of the AEA formulas with dyadic predicates only, viz., those of the form (1)

U(G1xy, ... , GKxy; G1x'y, ... , GKx'y) V(G 1yx,

or briefly, (I)

G1yx', U(xy, x'y)

A

, GKyx; , GKyx'), A

V(yx, yx'),

where U and Vare truth-functional combinations of the components. 14: Given a domino set P we can find a formula F p of the form (I) such that P has a solution if and only if F p has a model; conversely, given a formula F of the form (1), we can find a domino set PE such that F has a model if and only P E has a solution. Hence, the unrestricted domino problem is undecidable if and only if the decision problem of the class of all formulas of the form (I) is unsolvable. (See [21].) THEOREM

2.4. Results on the degree of complexity of AEA formulas are announced in the preliminary report [10]. The whole paper has not been completed because of the unwieldy construction of the simulation. An outline with proofs of the less combinatorial part is reproduced here. The method of simulating Turing machines by domino sets with diagonal constraints, as developed in [9], can be extended to obtain a simulation of each Turing machine X with all its numerical inputs by a single domino set P x such that when X is viewed as a function from inputs to outputs, every diagonal-constrained solution of Px satisfies the condition: if X(n) = I, then.K occurs at the point (ex(n), ex(n»; and if X(n) = 0, then K does not occur at (ex(n), ex(n», where K is a domino type, ex is a fixed monotone increasing recursive function. Expressing the solvability condition for P x by an AEA formula, we can establish the following: 1) Recently (May 1964) Robert Berger has settled both questions in the negative.

MACHINES, SETS, AND THE DECISION PROBLEM

313

LEMMA 1: For every Turing machine X, there is an AEA formula Fx == (x)(Eu)(y)Jxuy which contains a monadic predicate M, such that every model of F x in the domain of natural numbers has the property that the model M* of M separates the sets ii(X(n) = 0) and ii(X(n) = 1), and any such set can be used as M*, with models of other predicates being recursive.

More specifically, identify M*(rx(n)) with K*(rx(n), a(n)), and, for all k, if for no n, cx(n) = k, M*(k) is true. In this way, we shall be able to choose M* which is recursive in ii(X(n) = 0). We shall leave the proof of the lemma out and discuss what consequences we can derive from it. In the intended model, all other predicates of F x are recursive. We use the fact that if F has a model, then Jxx'y, x' being short for x+ 1, has a model in the domain of natural numbers. It can be shown that the formula has no finite models. A nonstandard model must also contain all the natural numbers. This seems sufficient for showing that any RE (recursively enumerable) predicate A is recursive in every model of M, when x(X(x) = 0) and x(X(x) = 1) are suitably chosen (see below). "Recursive in" is defined for natural numbers, but if M* also includes other objects, we seem to require a generalization of the concept, which can be done in the natural manner. In any case, it is true that A is recursive in M* because A is recursive in the standard part of M* already. Further, the restriction to RE models also requires a definition for M* to be RE, one possibility is that its standard part is RE. It may be pointed out, incidentally, that if we require that F x has a unique model relative to the domain of natural numbers and the successor function, then all the predicates must have recursive models by the infinity lemma. Alternatively, we may also wish to relativize the definition of the given RE predicate A. Then we have to define A by a quantificational schema. Hence, we have to begin with all possible models. Another way of proceeding is to confine our attention to models in the domain of natural numbers since otherwise recursive and RE are not defined. This last alternative seems the most natural way. In other words, we are only concerned with models of F x in which the domain is the set of natural numbers and the existential quantifier

314

HAO WANG

is replaced by the successor function. This is not regarded as a weakened condition because otherwise we cannot talk about recursive and RE models. This is indeed the practice followed by earlier authors. LEMMA 2: If A is RE, then there are disjoint RE sets B, C, which are i.e., recursive in A, such that if an RE set D separates Band C, i.e., BcD, C c 15, then A ~ TD, i.e., A is recursive in D; in particular, A ~ T B, and, hence, A = TB. ~ TA,

This follows from the proof (though not the statement) of Theorem 1 in [17]. Observe that unlike recursive separability, we cannot infer from the existence of an RE set D separating Band C, that there is an RE set E separating C and B, i.e., C c E and BeE, since 15 is not RE unless D is recursive. The condition that D is an RE set is essential. Thus, if A is of degree 0', then D must be of degree 0' too. In an unpublished work, Dana Scott shows that there is a degree d < TO' such that any two disjoint RE sets are separable by a set (not necessarily RE) of degree
=

=

It appears likely that if we do not want the stronger result with the restriction to AEA formulas, we can combine Lemma 2 with familiar considerations to get a weaker form of Theorem 15. Thus, for example, we can write a more complex formula characterizing the machine X in Lemma 1. PROOF of Lemma 2. By definition of RE, there exists g:

x eA

=(Eu) (Ey) [x =

U(y)

A

T(g, u, y)]

=(Eu) (Ey)R(x, u, y).

MACHINES, SETS, AND THE DECISION PROBLEM

315

Hence, there is f: x s A == (Ey)T(f, x, y).

Let

x e B == (Ey) [T(f, (x)o, y)

A ..,

(Ez)z

s

yT«X)l' x, z»)

x a C == (Ey) [T(f, (X)O, y) A (Ez)z "yT«X)b x, Z»).

Clearly Band C are disjoint. Since [(Ey) (Fy

A ..,

Gy) v (Ey) (Fy

A

Gy») == (Ey)Fy,

(x e B v x e C) == (x)o eA. It is easy to see that Band C are recursive in A. Thus, if (x)o ¢ A, then x ¢ B, x ¢ C. If (x)o s A, we can determine the unique y such that T(f, (x)o, y). Hence, x e B == .., (Ez)z" yT«X)l' x, z) x s C == (Ez)z" yT«X)l' x, z).

Suppose now BcD, C c 15, and D is RE. Choose e so that D = x(Ey)T(e, x, y). To determine whether we A, we ask just whether x = 2 w3 e belongs to D, i.e., whether (Ey) [y < /lJ(e, x, z) A T(f, w, y»). Case 1. x ¢ D. We have then Hence, w ¢ A, because otherwise, if we A, then x e B. But x e B implies x e D, contrary to hypothesis. Case 2. x e D, We can then find unique z, T«X)l' x, z). Since x e D implies x ¢ C, weA==xeBuC

== x e B. But then, because otherwise, i.e., if z ~ y, then the second half of the condition for x s B cannot be satisfied. Since B can serve as a D, A = T B. 2.5. Since the class U of AEA formulas with dyadic predicates only is unsolvable and a reduction type, it is of interest to consider what

316

HAO WANG

subclasses are solvable. The following is proved in [5] and a more "geometrical" alternative proof is given in [21]. Consider the four forms xy, yx, x'y, yx'. First take any three of them. From Theorem 8 above we know that any subclass of U which includes all formulas whose atomic formulas are in just these three forms is a reduction class and hence is undecidable. Now take any two of the four forms. Combining them with the other five forms yields a subclass of U. In this way we obtain six subclasses of U which divide into three pairs:

= {xy, L = {xy, Q = {xy, J

x'y}, yx}, yx'},

= {yx, yx'}, L* = {x'y,yx'}, Q* = {yx, x'y}. J*

THEOREM 16: With the exception of subsets of Q and those of Q*, a class, determined by the forms of atomic formulas occurring, is decidable if and only if it contains at most two of the four forms xy, yx, x'y, yx'; it contains an axiom of infinity if and only if it contains three forms including either xy and x'y, or yx and yx'.

Problem 3. Is the class of AEA formulas with dyadic predicates only which occur only in contexts with xy, yx', xx, xx', x' x, x' x', yy decidable? Is the class decidable when dyadic predicates occur only in the contexts with xy, yx', xx? Problem 4. Does either case contain an axiom of infinity, i.e., a formula that has only infinite models? If the answer to the latter question is no for either class, then, by familiar arguments, that class is solvable. 2.6. Suranyi has applied his reduction classes with the prefix AAA v AAE to obtain reduction classes with more complex prefixes but fewer predicates. Denton has undertaken to study similar consequences of the AEA reduction: THEOREM 17: The following classes are reduction classes (a) EY1' .. EYnAxEyAzM(n = 1,2, ... ) with one predicate only which is dyadic; (b) AxEy(Pxy A (Pxx 1= Pyy) A Az 1 ... AznM (and therewith AxEyAz1 ... AznM) with only the predicate P.

MACHINES, SETS, AND THE DECISION PROBLEM

317

Part (a) follows from Theorem 12 in exactly the same way as Suranyi's Theorem IV follows from his reduction class with prefix AAA A AAE. Part (b) was announced in [3] and afterwards also proved by another argument using only Theorem 8. In [6], Gegalkine claims that the class of formulas AxEyFxy A AZ 1 .. . AznM with M containing any number of monadic and dyadic predicates is decidable for finite satisfiability. Denton shows in [4] that this would contradict Theorem II for A l' and singles out the mistake in Gegalkine's paper. Further, he proves, by extending Ackermann's work [I], the following: THEOREM 18: The class AxEyPxy A Az t . . . AznM, where M contains only the dyadic P and monadic predicates, is decidable for finite satisfiability.

Problem 5. Is the class AxE)'1' .. EYnAzM, with only a single predicate (dyadic), a reduction class? Problem 6. Is the class in Theorem 18 (or even without the monadic predicates) a reduction class?

3. Sets 3. I. The basic axioms of the system ZF of set theory are extensionality, infinity (unconditional existence) and four axioms of conditional existence: (a) pairs, (b) sum set, (c) power set, (d) replacement (a schema). It is noted in [23] that these axioms (a)-(d) are equivalent to a single axiom (schema): if A is a one-many correlation, x is a set, and A" t = = u(Ev) (t s v A Auv), then there is a set y, y = IA"rrx (I is sum set, tt is power set). Thus, if Auv is v

= {u},

then Ix

= IA"rrx. If Auv is u = {v},

then nx

=

= IA"rrx. If Auv is (Ez)(Ew)(Gzw A u = {z} A V = {w}), then G"» = IA"rrx.

If 0 is (u = v A V =I v)"x, Guv is (u = a A v = 0) v (u = b A V = {O}), then {a, b} = G"rrn:O. The part about {a, b} is familiar from the literature. Previously, Bernays ([2], p. 65) had employed a similar schema y = IA"x to get (b) and (d). Ono ([13]) had introduced a schema which would yield (a), (b), (c), (d) if we add another axiom: (x)(Ey)(y = {x}). As it turns out, in a less explicit way, the same axiom is also needed for the result stated in [23], although Ono's schema is different.

318

HAO WANG

3.2. A different procedure is followed in [24] to get a schema which would yield all the axioms including infinity and extensionality. A partial hull is a transitive set closed with respect to power sets, i.e., PH(x) if (l)Ex s x, (2) ye x ~ ny s x. A natural hull is closed also with respect to sum sets, i.e., NH(x) if PH(x) and (3) y e x ~ Eye x. The natural hull nzz (or partial hull (a) of a set a is the intersection of all natural (partial) hulls x such that a s x. THEOREM 19: In ZF, na can be shown to exist for each a, and to satisfy the conditions (1)-(4), as well as that of being the minimum; similarly for (a with the conditions (1), (2), (4).

Let SEbe obtained from the usual axiom of replacement by substituting nx or (x for the given set x, viz., the schema: If Huv is many-one, then (Ey) (y = H"flX) (or (Ey) (y = H"(x)). Let VE be obtained from SE by adding uniqueness, i.e., by substituting (E!y) for (Ey). Both SE and VE can be expressed in the primitive notation of ZF. THEOREM 20: In the predicate calculus with equality, SE yields all existence axioms of ZF, VE is equivalent to all these axioms plus extensionality.

For some purposes, it is also useful to define other closures, e.g., ca, the transitive closure of a, would be the smallest x, a S x and Ix S x. 3.3. The usual definitions of the class On of the Zermelo-Neumann ordinals do not reveal the intuitive picture of how the ordinals are obtained successively. It is possible to use a "genetic" definition roughly in the tradition of Frege and Dedekind. This approach has the advantage that we can also use different successor functions, e.g., x' = ttx rather than x' = x u {x}. Such different successor functions are useful, e.g., in studying the natural models of von Neumann and Bernays. Two definitions of the Zermelo-Neumann ordinals are found to be adequate: DI.

On 1(x) when x belongs to every set u such that (1) v' s u, if v s u and v' s x'; and (2) Ew e u, if w s u and Ew e x',

DI*. On 2(x) when for every set u, there is w, Ew s u, w = 0; if x e u and for all v, v e u if v' s u.

S

x, and u n w =

MACHINES, SETS, AND THE DECISION PROBLEM

319

In fact, a general theorem on these ordinals is proved: THEOREM 21: If for a predicate On(x) we can prove that for every F, Fy if (1) On(y), (2) (v)(Fv ~ Fv'), and (3) (w)(w s; F ~ F(L'w)); then every x which satisfies On(x) is a genuine ordinal. Hence, if On(x) is a property known to hold for all ordinals, then the definition is adequate. When specialized to finite ordinals, we get: DF. Nn.I») when x belongs to every set u such that (1) 0 e u if 0 ex'; (2) v' e u, if v s u and v' e x', DF*. Nn 2(x) when for every u, 0 e u if (1) x e u, (2) v s u if v' e u. These are also adequate definitions. In fact, DF* is one which had previously been studied by W. V. Quine and K. R. Brown. In all these developments, weak axioms are enough, viz., extensionality, Aussonderung, and self-adjunction (x)(Ey)(y = x u {x}). To get also recursive definitions or transfinite recursions, some strengthening of the axioms in the standard manner is necessary. References [1] W. Ackermann, Beitrage zum Entscheidungsproblem der mathematischen Logik, Math. Annalen 112 (1936) 418-432. [2] P. Bernays and A. Fraenkel, Axiomatic Set Theory. (Amsterdam 1958). [3] J. S. Denton, A Reduction Class with a Single Dyadic Predicate. Notices AMS 10 (1963) 124-125. [4] J. S. Denton, A False Decision Procedure for the Halting Problem. Notices AMS 10 (1963) 125. [5] B. Dreben, A. S. Kahr, and Hao Wang, Classification of AEA Formulas by Letter Atoms, Bulletin AMS 68 (1962) 528-532. [6] I. Gegalkine, Problema razresimosti na konecnik klassah. Ucenye Zapiski Mosk. Gos. Univ. 100 (1946) 155-212. [7] P. K. Hooper, Some Small Multi-tape Universal Turing Machines. Notices AMS 10 (1963) 584. [8] A. S. Kahr, Improved Reductions of the Entscheidungsproblem to Subclasses of AEA Formulas, Symposium on the Mathematical Theory of Machines, Brooklyn Polytechnic Institute (April 1962); Proceedings (New York 1963) 57-70. [9] A. S. Kahr, Edward F. Moore, and Hao Wang, Entscheidungsproblem reduced to the AEA Case. Proc. Nat. Acad. Sci., U.S.A. 48 (1962) 365-377. [10] A. S. Kahr and Hao Wang, Degrees of RE Models of AEA Formulas. Notices AMS 10 (1963) 192-193. [II] M. L. Minsky, Recursive Unsolvability of Post's Problem of Tag. Annals of Mathematics 74 (1961) 437-455. [I2] M. L. Minsky, Universality of (p = 2) Tag Systems. A.I. Memo No. 33 (Cambridge, Mass. 1962).

320

HAO WANG

[13] K. Ono, A Set Theory founded on Unique Generating Principle. Nagoya Mathematical Journal 12 (1957) 151-159. [14] E. L. Post, Formal Reduction of the General Combinatorial Decision Problem. American Journal of Mathematics 65 (1943) 197-215. [15] M. O. Rabin and Hao Wang, Words in the History of a Turing Machine with a Fixed Input. Journal ACM 10 (1963) 526-527. [16] J. C. Shepherdson and H. E. Sturgis, Computability of Recursive Functions. Journal ACM 10 (1963) 217-255. [17] J. R. Shoenfield, Degrees of Formal Systems. Journal of Symbolic Logic 23 (1958) 389-392[18] J. Suranyi, Reduktionstheorie des Entscheidungsproblem (Budapest 1959). [19] Hao Wang, A Variant to Turing's Theory of Computing Machines. Journal ACM 4 (1957) 63-92. [20] Hao Wang, Proving Theorems by Pattern Recognition, II. Bell Systems Technical Journal 40 (1961) 1-41. [21] Hao Wang, Dominoes and the AEA Case of the Decision Problem. Symposium on the Mathematical Theory of Machines, Brooklyn Polytechnic Institute (April 1962); Proceedings (New York 1963) 23-55. [22] Hao Wang, Tag systems and Lag Systems. Math. Annalen (1963) 65-74. [23] Hao Wang, A Universal Axiom of Conditional Set Existence. Notices AMS 10 (1963) 588. [24] Hao Wang, Natural Hulls and Set Existence. Notices AMS 10 (1963) 594.

Formal Systems and Recursive Functions: Proceedings of the Eighth Logic Colloquium, Oxford, July 1963 (Studies in Logic and the Foundations of Mathematics, 40)

Read more

Formal systems and recursive functions: Proceedings Oxford, 1963

Read more

Formal Systems and Recursive Functions

Read more

Proceedings of the Second Scandinavian Logic Symposium (Studies in Logic and the Foundations of Mathematics)

Read more

Logic Colloquium '88 (Studies in Logic and the Foundations of Mathematics)

Read more

Ancient Formal Logic (Studies in Logic and the Foundations of Mathematics, 2)

Read more

The Foundations of Mathematics (Logic)

Read more

Logic Colloquium'76: Proceedings of a conference held in Oxford in July 1976 (Studies in Logic and the Foundations of Mathematics, 87)

Read more

Handbook of Recursive Mathematics : Volume 2: Recursive Algebra, Analysis and Combinatorics (Studies in Logic and the Foundations of Mathematics)

Read more

Mathematical logic and the foundations of mathematics

Read more

Logic Colloquium 1984: Proceedings

Read more

Classification Theory (Studies in Logic and the Foundations of Mathematics)

Read more

Model Theory (Studies in Logic and the Foundations of Mathematics)

Read more

Computability: Computable Functions, Logic, and the Foundations of Mathematics

Read more

Computability: Computable Functions, Logic, and the Foundations of Mathematics

Read more

Logic Colloquium '88: Proceedings

Read more

Logic Colloquium 1976: Proceedings

Read more

Logic: Colloquium Proceedings, 1977

Read more

Logic Colloquium 1982: Proceedings

Read more

Logic 1985: Colloquium Proceedings

Read more

Handbook of Recursive Mathematics : Volume 1: Recursive Model Theory (Studies in Logic and the Foundations of Mathematics)

Read more

Truth and Consequence in Mediaeval Logic (Studies in Logic and the Foundations of Mathematics, 11)

Read more

The Foundations of Intuitionistic Mathematics (Studies in Logic and the Foundations of Mathematics, 39)

Read more

Recursive Functionals (Studies in Logic and the Foundations of Mathematics Vol 131)

Read more

Recursive Functionals (Studies in Logic and the Foundations of Mathematics Vol 131)

Read more

Recursive Analysis (Studies in Logic and the Foundations of Mathematics, 29)

Read more

Intuitionistic Logic, Model Theory and Forcing (Studies in Logic and the Foundations of Mathematics)

Read more

Handbook of Mathematical Logic (Studies in Logic and the Foundations of Mathematics)

Read more

Handbook of Mathematical Logic (Studies in Logic and the Foundations of Mathematics)

Read more

Quantification in Nonclassical Logic (Studies in Logic and the Foundations of Mathematics)

Read more

Recommend Documents

Formal Systems and Recursive Functions: Proceedings of the Eighth Logic Colloquium, Oxford, July 1963 (Studies in Logic and the Foundations of Mathematics, 40)

STUDIES IN LOGIC AND THE FOUNDATIONS OF MATHEMATICS Editors L. E. J. BROUWER, Laren (N.H.) A. HEYTlNG, Amsterdam A. R...

Formal systems and recursive functions: Proceedings Oxford, 1963

Formal Systems and Recursive Functions

STUDIES IN LOGIC AND THE FOUNDATIONS OF MATHEMATICS Editors L. E. J. BROUWER, Laren (N.H.) A. HEYTlNG, Amsterdam A. R...

Proceedings of the Second Scandinavian Logic Symposium (Studies in Logic and the Foundations of Mathematics)

STUDIES I N LOGIC AND THE FOUNDATIONS O F MATHEMATICS VOLUME 63 Editors A. H E Y TI N G , Amsterdam H. J. K E I S L E ...

Logic Colloquium '88 (Studies in Logic and the Foundations of Mathematics)

Ancient Formal Logic (Studies in Logic and the Foundations of Mathematics, 2)

ANCIENT FORMAL LOGIC I. M. B O C H E ~ S K I Professor of Philosophy University of Fribourg 1951 N 0 R T H -HO L L A ...

The Foundations of Mathematics (Logic)

Logic Colloquium'76: Proceedings of a conference held in Oxford in July 1976 (Studies in Logic and the Foundations of Mathematics, 87)

STUDIES IN LOGIC AND THE FOUNDATIONS OF MATHEMATICS VOLUME 87 Editors J. BARWISE, Madison D. KAPLAN, Los Angeles H. J....

Handbook of Recursive Mathematics : Volume 2: Recursive Algebra, Analysis and Combinatorics (Studies in Logic and the Foundations of Mathematics)

Mathematical logic and the foundations of mathematics