LOGIDATA+: Deductive Databases with Complex Objects (Lecture Notes in Computer Science)

Paolo Atzeni (Ed.) LOGIDATA+'. Deductive Databases with Complex Objects Springer-Verlag Berlin Heidelberg NewYork Lond...

Author: Paolo Atzeni

13 downloads 390 Views 13MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Paolo Atzeni (Ed.)

LOGIDATA+'. Deductive Databases with Complex Objects

Springer-Verlag Berlin Heidelberg NewYork London Paris Tokyo Hong Kong Barcelona

Budapest

Series Editors Gerhard Goos Universit~it Karlsruhe Postfach 69 80 Vincenz-Priessnitz-Straf3e 1 D-76131 Karlsruhe, FRG

Juris Hartmanis Cornell University Department of Computer Science 4130 Upson Hall Ithaca, NY 14853, USA

Volume Editor Paolo Atzeni Dipartimento di Informatica e Sistemistica Terza Universit?~ di Roma and Universit~ di Roma "La Sapienza" Via Salaria, 113, 1-00198 Rome, Italy

CR Subject Classification (1991): H.2.1, H.2.3

ISBN 3-540-56974-X Springer-Verlag Berlin Heidelberg New York ISBN 0-387-56974-X Springer-Verlag New York Berlin Heidelberg

This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are liable for prosecution under the German Copyright Law. 9 Springer-Verlag Berlin Heidelberg 1993 Printed in Germany Typesetting: Camera ready by author 45/3140-543210 - Printed on acid-free paper

Preface

This book presents a collection of coordinated scientific papers describing the work conducted and the results achieved within the L O G I D A T A + project, a research action funded by C N R (Consiglio Nazionale delle Ricerche - - the Italian National Research Council), within "Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo." The aim of the L O G I D A T A + project is the definition of advanced database systems that significantly extend the functionalities of the current systems, with specific reference to the application areas for which relational systems are not considered satisfactory. These new systems will allow the definition of d a t a with complex structures, the representation of semantic relationships between objects, and the use of powerful query and update languages. They will be based on a combination of techniques originating from relational databases and logic programming, with contributions from object-oriented programming. Attention will be devoted to the representation of isa-hierarchies and to taxonomic reasoning. The specific goal of the project is the definition, design, and prototype implementation of a database management system with complex structures and a class hierarchy, to be accessed through a rule-based language. The project is now at the conclusion of the first phase, with significant researcl~ work done on the definition of the features of the systems and their theoretical foundations. The second phase will be mainly concerned with the implementation of prototypes. This book presents a homogeneous, integrated view of the scientific results of the project, with respect to all the features of the system. The L O G I D A T A + group involves teams from the following institutions: - CNR, Centro per l'Interazione Operatore-Calcolatore, Bologna - CNR, Istituto di Analisi dei Sistemi ed Informatica, R o m a - CRAI, Rende (Cosenza) Politecnico di Milano, Dipartimento di Elettronica e Informazione Sintesi S.r.l., Modena - Systems & Management S.p.A., Torino - Universits dell'Aquila, Dipartimento di Matematica P u r a e Applicata Universits della Calabria, Cosenza, Dipartimento di Elettronica, Informatica e Sistemistica - Universits di Firenze, Dipartimento di Sistemi e Informatica Universits di Modena, Dipartimento di M a t e m a t i c a - Universits di R o m a La Sapienza, Dipartimento di Informatica e Sistemistica -

-

-

vI The book is organized in four parts. Part I contains two papers, the first giving an overview of the LOGIDATA+ project and the second presenting a general discussion on the combination of deductive and object-oriented features in the database field. Part II is concerned with the description of the LOGIDATA+ model and language: the first and second papers present the data model and the rule-based language, respectively; the third paper illustrates the concepts by means of the detailed description of an application. Parts III and IV report on research results about a number of issues that can lead to significant extensions of the LOGIDATA§ system. Each of them concentrates on some features of the model and language. Specifically, Part III contains results on problems related to structural issues (updates over classes, taxonomic reasoning, and integrity constraints in object-oriented databases) and Part IV deals with deductive issues, essentially extensions of logic programming. Finally, Part V presents the experimental results of the project, with the existing prototypes. I would like to thank all those who have made this book possible. Bruno Fadini, Director of "Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo," and Domenico Sacc~, Coordinator of "Sottoprogetto 5: Sistemi evoluti per basi di dati," have encouraged the LOGIDATA+ action since its inception. The authors of all the papers deserve my gratitude for their timely cooperation in the preparation of the volume and for their help in the revision process. My graduate students Giovanni Barone, Luca Cabibbo, and Gianni Mecca helped in the reviewing process and in the organization of the material. Alfred Hofmann at Springer-Verlag was very efficient in the management of our project.

May 1993

Paolo Atzeni

Table o f C o n t e n t s

P a r t I: T h e F r a m e w o r k

LOGIDATA+: Overview Paolo A i z e n i . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

1

Bridging Objects with Logical Rules: Towards Object-Oriented Deductive Databases Stefano Ceri and Letizia Tanca . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

7

P a r t II: T h e Model and the Language

The LOGIDATA+ Model Paolo A l z e n i , Filippo Cacace, Stefano Ceri, and Lelizia Tanca . . . . . . . . . . . .

20

The LOGIDATA+ Language and Semantics Paolo A t z e n i , Luca Cabibbo, Giansalvalore Mecca, and Letizia Tanca . . . . . 30

Travel Agency: A LOGIDATA+ Application Luca Cabibbo and Giansalvatore Mecca . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

42

P a r t III: Complex Objects

Management of Extended Update Operations Luigi Palopoli and Riccardo Torlone . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

60

Taxonomic Reasoning in LOGIDATA+ D o m e n i c o B e n e v e n l a n o , Sonia Bergamaschi, Claudio Sartori, Alessandro Artale, Francesca Cesarini, and G i o v a n n i Soda . . . . . . . . . . . . . . . .

79

Introducing Taxonomic Reasoning in LOGIDATA+ Alessandro Artale, Francesca Cesarini, and Giovanni Soda . . . . . . . . . . . . . . . .

85

Taxonomic Reasoning with Cycles in LOGIDATA+ D o m e n i c o B e n e v e n t a n o , Sonia Bergamaschi, and Claudio S a r t o r i . . . . . . . .

105

Modeling Semantic Integrity Constraints in Object-Oriented Database Schemas A n n a Formica and Michele M i s s i k o f f . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

129

VIII

Part IV: Deductive Databases

Evaluation of Negative Logic Programs Sergio Greco, Massimo Romeo, and Domenico Sacc~ . . . . . . . . . . . . . . . . . . . .

148

Effective Implementation of Negation in Database Logic Query Languages Nicola Leone, Massimo Romeo, Pasquale Rullo, and Domenico Saceh . . . . 159 Modules in Logic Programming: A Framework for Knowledge Management Annalina Fabrizio, Maurizio Capaccioli, and Sandra Valeri . . . . . . . . . . . . . .

176

Part V: P r o t o t y p e s

LOA: The LOGIDATA+ Object Algebra Umberto Nanni, Silvio Salza, and Mario Terranova . . . . . . . . . . . . . . . . . . . . . .

195

The LOGIDATA+ Prototype System Umberto Nanni, Silvio Salza, and Mario Terranova . . . . . . . . . . . . . . . . . . . . . .

211

MOOD*: An Architecture for Object-Oriented Access to a Relational Database Marco Lugli, Luca Nini, and Sr Ceri . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

233

Prototypes in the LOGIDATA+ Project Alessandro Arlale, Jean-Pierre Ballerini, Sonia Bergamaschi, Filippo Cacace, Stefano Ceri, Francesca Cesarini, Anna Formica, Herman Lain, Sergio Greco, Giorgio Marrella, Michele Missikoff, Luigi Palopoli, Luigi Pichetli, Domenico Sacc~, Silvio Salza, Claudio Sartori, Giovanni Soda, Letizia Tanca, and Marco Toiali . . . . . . . . 252

LOGIDATA+: Overview* Paolo Atzeni Dipartimento di Informatica e Sistemistica, Universitk di Roma "La Sapienza," Via Salaria 113, 00198 Roma, Italy

A b s t r a c t . LOGIDATA+ is part of the subproject "Advanced Database Systems" of the project "Information Systems and Parallel Computation", of the Italian research council. The aim of LOGIDATA+ is the definition of advanced database systems, which significatively extend the functionalities of the current systems, with specific reference to the application areas for which relational systems are not considered satisfactory. These new systems will allow the definition of data with complex structures, the representation of semantic relationships between objects, and the use of powerful query and update languages. They will be based on the combination of techniques originating from relational databases and logic programming, with contributions from object-oriented programming. Attention will be devoted to the representation of isa-hierarchies and to taxonomic reasoning. The specific goal of LOGIDATA+ is the definition, design, and prototypal implementation of a database management system with a complex structure and a class hierarchy, to be accessed through a rule-based language. Extensions of the core are also considered, in various directions: management of various forms of negation, both in the body and in the head of the rules; management of updates, especially with respect to intensionally defined predicates or to rules; modularization; constraints; taxonomic reasoning.

1

Introduction

L O G I D A T A + is a research action aimed at the definition of advanced database systems, which significatively extend the functionalities of the current systems, with specific reference to the application areas for which relational systems are not considered satisfactory. These new systems will allow the definition of data with complex structures, the representation of semantic relationships between objects, and the use of powerful query and update languages. They will be based on the combination of techniques originating from relational databases and logic programming, with contributions from object-oriented programming. Attention will be devoted to the representation of isa-hierarchies and to taxonomic reasoning. The specific goal of L O G I D A T A + is the definition, design, and prototypal implementation of a database management system with a complex structure and a class hierarchy, to be accessed through a rule-based language. * Work supported by CNR, "Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo." The author is now with Terza Universitk di Roma.

LOGIDATA§ is part of a larger national project, on Information Systems and Parallel Computation (officially, "Sistemi Informatici e Calcolo Parallelo, " abbreviated as SICP). More specifically, it is part of a subproject on Advanced Database Systems of the SICP Project. This paper gives a brief overview of the LOGIDATA§ action and of the framework in which it operates. First we describe the SICP Project (Section 2) and its subproject on databases (Section 3). Then we discuss the motivation and the goals of the LOGIDATA§ action, and finally we describe its organization and the way its results are descrbed in the subsequent papers of this book (Section 4).

2

The P r o j e c t "Sistemi Informatici e Calcolo Parallelo"

In the Italian system, a Progetto Finalizzato del CNR is "a set of research activities aimed at reaching goMs that have a significant social and economic interest at the national level, with the involvment of all the components of the scientific system of the country." A Progetto Finalizzato is funded by the national government and administered by Consiglio Nazionale delle Ricerche (CNR, the Italian research council). The Project Sistemi Informatici e Calcolo Parallelo (SICP) was approved by the government in 1987 together with other nine projects. Their general goal is to support the national industry in the development and acquisition of new technology, in order to increase its competitiveness on the international market and to reduce the need for technology import. They also involve the development of prototypes, the study and evaluation of existing system and the definition of proposals for the governmental acitivity in various fields. The total budget of the SICP Project (throughout the five-year length) is 63 billion Lire (about 40 million US dollar, at the current exchange rate). The project is organized in seven subprojetcs, over various areas of computer science and engineering, plus a support initiative for the coordination of laboratories: SP1 SP2 SP3 SP4 SP5 SP6 SP7 SP8

Scientific computation for large systems Dedicated processors Parallel architectures New generation languages Advanced database systems Methods and tools for system design Support systems for intellectual work Support initiative

The Director of the project is Bruno Fadini, Universit~ di Napoli "Federico II." The specific goals of each subproject include both research results that contribute to the advancement of knowledge in the field and more practical results, such as methodsl systems, and services. The activities have been divided in two phase: the first three-year period had mainly research goals with some experimental work whereas the final two year (just started) will mainly be devoted to the finalization of the results, with significant prototypes.

3

The Subproject "Advanced Database Systems"

The goals of the subproject are based on the observation that, although database systems have achieved significant technological and methodological advances in the last two decades, it can be claimed that current systems present various limitations. In fact, a new generation of database systems is under definition and development with the goal of expanding application domains and improving usability. Within this framework, the subproject has launched a research program based on the following issues: (1) intelligent databases; (2) logic query languages for databases; (3) management of multimedia data; (4) integration of heterogeneous databases; (5) tools and methodologies for the end user. Among the responses to the call for proposals, five self-contained research actions were selected, each of which refers to one or more of the above issues and involves teams from both academia and industry: DATABASE++ aims at defining and developing the most significant layers of an object-oriented database system where data model abstraction is integrated with programming language issues. LOGIDATA+ deals with the problem of enhancing database technology by means of the management of complex data items, the representation of semantic relationships among data, and the use of a powerful query and update language, based on a suitable extesion of logic programming. MULTIDATA has two main goals: (a) the definition of a conceptual model with object-oriented flavour forthe representation of multimedia data, and (b) the design and implementation of a multimedia database system for geographic information systems. MULTI+ is mainly concerned with integrating heterogeneous multimedia databases in a distributed context. FIRE aims at defining and developing a user-friendly, flexible interface to information retrieval systems. The coordinator of the Subproject "Advanced database systems" is Domenico Saccg, Univeritg della Calabria.

4

Motivation for the LOGIDATAq- Action

Relational database management systems are currently a "de facto" standard and there is a general agreement that they satisfy the needs of most business oriented applications. On the other hand, during the last few years the need for systems with the database capabilities (management of large sets of persistent data in a shared, reliable, efficient, and effective manner) has emerged in other application areas, such as computer-aided design, computer-aided software engineering, knowledge based, office, and multimedia systems. These areas have a number of common requirements, which are essentially not fulfilled by relational systems:

1. the involved data often have complex structures, which cannot be represented in a natural way in the relational model; 2. the semantics of the data and their mutual relationships are crucial, and need to be expressed in a more explicit way than just by means of values; 3. the operations require an expressive power that is not provided by the traditional languages for relational database systems. In response to each of these needs a research trend has been developed: 1. models that generalize the relational one have been defined, relaxing the first normal form constraint [9,7,12]: values need not be atomic - - they may sets or tuples or combination thereof; 2. models with richer semantics have been studied, based not only on values, but also on the notion of identity, which allows the representation of explicit relationships between objects, of isa hierarchies, and of object sharing; starting points for this trend have been the semantic models, originally developed to be used in the analysis and design phases of application development, and object-oriented languages [2,4,8,11,13]; 3. techniques for the efficient and effective coupling and integration of databases and logic programming languages have been studied, initially referring to the relational model and later to more general models [3,5,6,14,15]; The LOGIDATA+ Action will respond to these needs by means of a system whose model and language have the following features: - the data model is a compatible extension of the relational model, capable of handling complex structures; - the data model allows the representation of semantic relationships by means of object identifiers [10], is-a links, and data functions [1]. - the language is rule based, being an extension of logic programming languages.

5

Achievements

of the LOGIDATA%

Action

Three years after the beginning, the project is now at the conclusion of the first phase, with significant research work done on the definition of the features of the systems and their theoretical foundations. The second phase (two years) will be mainly concerned with the implementation of prototypes. The scientific activity of the project, reported in this book, has been organized along two main directions. The first direction had the goal of a definition of set of basic features, which could set the framework for subsequent research work and for experimental effort. It has included the specification of the data model and of the associated rule-based language, and the definition of a proposal for the architecture of a prototype. The model has been defined with a large variety of construct, in order provide a flexible framework for the definition and

experimentation (at least at the conceptual level) of different, and possibly independent features. In fact, the second direction has dealt with a number of issues arising in the LOGIDATA+ framework or in subsets thereof. Because of their inherent complexity (and because the second direction was started before the end of the first and so before the complete definition of the framework), some of the problems have been studied in more traditional contexts, such as logic programming (without complex structures) or complex-object models (without rule-based language). Both directions have involved some experimental activity, with the development of prototypes. The organization of this book partially reflects the organization of the project. Part II describes the model and language (in the papers by Atzeni et al.), including an extensive example (in the paper by Mecca nad Cabibbo). Parts III and IV concern the advanced issues. More specifically, the papers in Part III describes the results that are mainly related to structural issues: Palopoli and Torlone study the problem of updating a complex-object database; the three papers by Beneventano et al. and by Artale et al. discuss how the LOGIDATA+ model can be extended with taxonomic reasoning, an inference technique for reasoning about is-a relationships; Formica and Missikoff describe how integrity constraints can be handled in a complex-object database. Part IV deals with a number of logic programming issues that can be eventually integrated in the LOGIDATA+ framework: Greco et al. and Leone et al. study advanced forms of negation; Fabrizio et al. describe a proposal for knowledge modularization. Finally, Part V describes the experimental work conducted in both sections of the project. The two papers by Nanni, Salza, and Terranova describe an algebra to used at an intermediate level for the implementation of the LOGIDATA+ language and the general architecture of the prototype system. The paper by Lugli, Nini, and Ceri describes another implementation effort aimed at the contruction of C + + programs that access relational databases. The final paper, by Artale et al., coordinated by Palopoli, contains brief reports on a number of independent implementation efforts correlated with various issues studied in the project.

References 1. S. Abiteboul and S. Grumbach. Col: A logic-based language for complex objects. In EDBT'88 (Int. Con/. on Extending Database Technology), Venezia, Lecture Notes in Computer Science 803, pages 271-293. Springer-Verlag, 1988. 2. S. Abiteboul and R. Hull. IFO: A formal semantics database model. A C M Trans. on Database Syst., 12(4):297-314, December 1987. 3. S. Abiteboul and P. Kanellakis. Object identity as a query language primitive. In A C M SIGMOD International Con/. on Management o/ Data, pages 159-173, 1989. 4. A. Albano, L. Cardelli, and R. Orsini. Galileo: a strongly typed interactive conceptual language. A C M Trans. on Database Syst., 10(2), June 1985. 5. S. Ceri, G. Gottlob, and L. Tanca. Logic Programming and Data Bases. SpringerVerlag, 1989. 6. D. Chimenti, R. Gamboa, R. Krishnamurti, S. Naqvi, S. Tsur, and C. Zaniolo. The LDL system prototype. IEEE Trans. on Knowledge and Data Eng., 2(1):7690, March 1990.

7. R. Hull. A survey of theoretical research on typed complex database objects. In J. Paredaens, editor, Databases, pages 193-256. Academic Press, 1988. 8. R.B. Hull and R. King. Semantic database modelling: Survey, applications and research issues. ACM Computing Surveys, 19(3):201-260, September 1987. 9. G. Jaeschke and H.-J. Schek. Remarks on the algebra for non first normal form relations. In ACM SIGACT SIGMOD Syrup. on Principles of Database Systems, pages 124-138, 1982. 10. S. Khoshafian and G. Copeland. Object identity. In ACM Syrup. on Object Oriented Programming Systems, Languages and Applications, 1986. 11. G.M. Kuper and M.Y. Vardi. A new approach to database logic. In Third ACM SIGACT SIGMOD Syrup. on Principles of Database Systems, 1984. 12. A. Makinouchi. A consideration on normal form of not necessarily normalized relations. In Third International Con]. on Very Large Data Bases, Tokyo, 1977. 13. J. Mylopoulos, P.A. Bernstein, and E. Wong. A language facility for designing database-intensive applications. ACM Trans. on Database Syst., 5(2):185-207, June 1980. 14. S. Naqvi and S. Tsur. A Logical Language ]or Data and Knowledge Bases. Computer Science Press, Potomac, Maryland, 1989. 15. J.D. Ullman. Implementation of logical query languages for databases. A C M Trans. on Database Syst., 10(3):289-321, September 1985.

Bridging objects with logical rules: towards object oriented deductive databases S. Ceri and L. Tanca Dipartimento di Elettronica e Informazione Politecnico di Milano Piazza L. Da Vinci, 32 1-20133 Milano, Italy

A b s t r a c t . A recent direction of database research has been focused on integrating logic programming and object orientation. The rationale of such integration stems from two arguments; on one hand, object-orientation brings powerful data modeling capabilities by adding semantic and structural complexity to value-based relational databases. On the other hand, logic programming languages provide means for expressing queries and updates on a database with a declarative style; further, they provide the ideal language paradigms for expressing constraints and integrity rules. The integration of these two worlds has been investigated theoretically; firm bases for the coexistence of value-based and identity-based concepts within a unique data model have been established, and novel features have been added to logic programming languages in order to give them full power on semantically enriched data models. However, while structural and semantic complexity have been introduced without difficulty in the deductive context, the object oriented concepts that are more related to the data abstraction notion are more difficult to introduce. Therefore, the introduction of structurally and semantically complex objects in the logic programming area is the first step towards full object orientation. Concurrent with the deepening of the theory, several prototypes are currently being designed and implemented; the LOGIDATA§ project is one of these attempts, and the research efforts within this projects will be extensively described in this book.

1

Introduction

Many recent papers have been focused on the problem of integrating objectoriented databases with logic programming [10,24,25,4,14,16,6,21,22]. This integration is very reasonable, as either world provides useful concepts to the other one. Indeed, we argue that each world can give contribution to strenghten the weakest features of the other one. Object-orientation and logic programming have shown different trends of evolution. While the theoretical foundations of logic programming (LP) were firmly established since its introduction, with a universally acknowledged syntax and well-defined semantics (see for example [30]), the object oriented (OO) world has not shown the same uniformity; this situation has propagated from

object-oriented programming languages to object-oriented databases (OODB). The shop-list of concepts that should be provided by any OODB system has already been presented in two "manifestos", which show some overlap, but disagree definitely on many points [t, 33], thus demonstrating the difficulty of providing a standard view. Thus, we view object-orientation more as a programming style (like, for instance, structured programming) than as a programming paradigm. Key features of this style include the concepts of object identity, structural and semantic complexity of objects, the encapsulation principle, and a strong integration between data and their behavior. The absence of a strict programming language standard for OODBs makes LP a possible candidate for programming OODB systems; indeed, we argue that it is possible to use LP while keeping an object-oriented style. The advantages provided by LP include declarativeness and uniformity in the treatment of queries, updates, views, and constraints. The effort of extending LP in order to support an object-oriented style has been most successful for structural features: complex structures, object identity, typing and class hierarchies have been introduced in LP without serious difficulty. The integration process has surely captured the structural and semantic complexity of objects, thus we may refer to the integrated systems and languages as logical languages for complex object dalabases, rather than for fully object oriented databases. Instead, much has yet to be done for behavioral features: fundamental principles of object-oriented programming (such as encapsulation) still have to be supported by LP languages and systems. Table 1 presents a list of classical OODB concepts and indicates how they can be mapped to "equivalent" LP concepts. In the next section, we discuss the data modelling concepts that can be usefully transferred from the OODB to the LP world; in Section 3, we discuss the so-called dynamical or behavioral aspects of OODB's, i.e. those that have to do with data abstraction; then, in Section 4, we examine the advantage of an LP approach, namely, the features which make LP a suitable programming paradigm for OODB's. Section 5 will draw the conclusions.

2 Introduction LP world

of OODB

data

modeling

concepts

to the

The integration between LP and databases moved in the late seventies and early eighties from the observation that the predicative structure of information within LP corresponds to the simple record-based structure of tuples in the relational model. To amplify similarity, the first proposed experimental deductive language, Datalog, was very simple: structured terms of Prolog were eliminated, leaving only constants and variables as the language terms. As a consequence, Datalog literals had the same fiat structure as relational tuples [18].

Logic Programming Object Oriented complex terms complex objects OIDs and surrogates object identity surrogate-based object sharing predicate reference generalization hierarchies generalization hierarchies for predicates for classes unification through inheritance generalizations functions or methods adorned predicates built-in predicates support for aggregation and sorting external predicates control structures and procedures Table 1. OODB concepts that can be supported by LP

2.1

Structural complexity

Approximately at the time when Datalog was first introduced, the relational data model was extended in order to include more sophisticated structures, called nested or N F 2 relations [23, 17]. Such extension was motivated by the need of modelling complex data in the fields of geographical databases, engineering databases, and office automation (among others). The natural step on the Datalog side was to extend the data model accordingly, allowing complex terms, built up by the use of record and set lype construclors [35, 2, 28]. Another source of complexity was the introduction of functional constructs [2, 28], also based on a value-driven approach. Finally, data model complexity was enriched through the use of additional type constructors such as sequences and multisets [17]. In this way, the LP world has developed the theoretical basis for the support of structural complexily, one of the two main directions in OODB data models. This development is far from being successfully solved for practical systems, since the main difficulty is now at the technological level, and consists of providing suitable data structures and operators for efficient data manipulation and access. We believe that progress in providing structural complexity will equally apply to "canonical" OODB systems and to LP-based systems; as a matter of fact, it will probably be achieved by commercial database systems as well, through progressive enhancement of the relational data model. From now on, we refer to the relational data model as enhanced by the use of type constructors, thereby providing support for complex relations. This model is also called value-based, because all elements of the data model represent values.

10

2.2

Object Identity

The fundamental difference between the relational model and the OODB model is the concept of object identity. Each object of the OODB corresponds to one real-world object, and exists regardless of the value of its properties. Objects in OODB with the same structure have the same type; objects with the same type existing in a specific state of the OODB form a class. Objects may refer to other objects explicitly, without mentioning their properties; in particular, one object may be shared by two or more objects which include it as a component. This is the essence of the so-called identity-based data models. In value-based models, the identity of elements is expressed by the use of integrity constraints, in particular by defining a collection of attributes to be a key of a relation. Sharing can also be supported somehow artificially by appropriate design of relations, and the correctness of sharing requires the addition of referential integrity constraints to the schema. The notion of surrogates or tuple identifiers was introduced in many practical relational systems mainly for efficiency reasons, but never accepted as a concept of the relational theory. The extension of LP in order to support object identity has been performed by adding the notion of class; this addition may be achieved in several different ways. In most cases, predicate extensions define classes, and a class predicate has one special argument for the OlD variable, which satisfies the uniqueness property (each object has a unique, distiguished OID). In most realizations, the creation and manipulation of OlD variables is under the control of the system and not of the user. Several methods for manipulating OlD variables are possible, in particular for the creation of new objects. One possibility is OID invention: special rules are used for this purpose, having all LHS arguments bound, except the OlD variable. When the rule is executed, a new object is created for each tuple of LHS arguments satisfying the conditions on the RHS. Another possibility for object creation is the introduction of an explicit new built-in predicate, whose arguments are the class name and attributes of the new object to be created. The implementation of OIDs in such LP systems may be based on the use of surrogates, thus providing an identity-based data model on top of a value-based system implementation. However, defining classes as predicate extensions is only one way of implementing objects. Other approaches [11,29] define objects as logical theories. A logical theory (set of facts and rules) defines an object, a set of constraints (also represented by rules) defines a class, and specializations of these constraints define a class hierarchy. In this case, object identity is nothing else than a name for the theory defining the object. As an example, consider the program:

Name(tom). marriedto(tom, sue). wifeof(W,M)

marr, edto(M, W).

toma y(M, W) *--

ma, i dto(M, W), + marriedto(M,W).

This program defines the object Tom. The two facts are proper of the object,

11 while the two rules are shared by all the objects of the same class (for instance Person). Note that the second rule defines an object update method. Some researchers [40] argue that a strict interpretation of object identity is incompatible with the logic programming style, by showing that if "everything is an object with identifier" many simple logic programs lose their obvious and intuitive semantics. Again, it must be said that the problem is mainly felt when DOOD languages are realized by identifying classes with predicate extensions, and thus objects coincide with ground facts. In systems as above [11] traditional logic programs are objects themselves, thus a new way of programming in logic is invented, in which interaction and cooperation between objects is some kind of interaction between logic programs, and derivation of new ground facts does not necessarily coincide with derivation of new object instances. However, in languages where classes are predicate extensions, we agree with the observation, and advocate the need of including the concept of identity in the LP world without eliminating value based constructs. There are various reasons for this [14]: identity based data items introduce object sharing and inheritance in a natural way; on the other hand, not all data items deserve the status of an object: in particular, database applications often need the definition of ad-hoc queries, and results of queries are well represented as values. Finally, since elements of a class are always distinguished by their identities, we do not have the possibility of removing undesired duplicates: we need values for those computations where elimination of duplicates is needed (e.g. fixpoint computations). 2.3

Semantic complexity

The two key dimensions of semantic complexity in the OODB approach are class hierarchies and objecl sharing. The concept of class hierarchy has long been present in database design [12, 8] or used by persistent database programming languages [34, 7]. Overall methodologies for database design, as presented for instance in [9], give evidence of the need for semantic complexity in data modeling. During the early, conceptual phase of database design, major emphasis is placed on the use of generalization, thereby enhancing the clarity and correctness of conceptual schemas with respect to the real world. However, during the following phase of logical database design, generalization hierarchies are transformed according to the data structures available in the commercial database that has been selected, whose data model normally lacks the notion of generalization. Class hierarchies are supported by all OODB data models; typically, the subclass relationship is any acyclic graph. In some OODB models, all classes are descendent of the same class, the generic object class; in other cases, the class hierarchy has instead multiple roots. The notion of inheritance may be seen in many different ways. In the typical database design tradition, the fundamental property associated to class hierarchies is that each subclass inherits all properties of its superclasses: the whole

12 structure of the superclass is inherited, thus this point of view is generally called structural inheritance. According to this philosophy, subclass extensions are always subsets of superclass extensions. However, other semantics of inheritance have been proposed [15]: the most liberal one establishes that subclasses inherit only procedures attached to superclasses. Superclass extensions do not necessarily include subclass extensions. Moreover, the data structures may differ. The inheritance of procedures (methods) is implemented by overloading procedure names: if the subclass structure is different from the superclass structure, the inherited procedure is redefined for the subclass, but this is totally transparent to the user. This notion of inheritance is similar to that of some OO programming languages. In our opinion, the database environment needs the first kind of inheritance, since in most concrete cases the user must be able to rely on the similarity of the data structures of classes in a hierarchy. Moreover, implementing such a mechanism is less complex, especially in a deductive environment, where a particular type of unification must be defined, that "traverses" class hierarchies. An example of such unification is considering two classes: PERSON, with attributes NAME and ADDRESS, and STUDENT, subclass of PERSON, inheriting its attributes plus the attributes STUDENT-NO and FACULTY. There must be the possibility for two elements of these two classes to unify, since a student can be seen also as a simple person. Consider the query: Who are the persons that live in Milano, at Piazza L. Da Vinci?. Of course we intend that a student living at that address should be included into the answer. Thus object identifiers of persons and students must unify, and the common attributes must unify as well. Another possible variant to the inheritance mechanism is that classes may or may not have multiple superclasses; if this is allowed, then the OODB data model supports multiple inheritance. A typical OODB model constraint may state, in such cases, that all classes with multiple inheritance descend from a unique root. Noticeably, all these OODB concepts can be easily supported by LP, given that it supports class predicates. As already mentioned, the main difficulty in supporting inheritance in LP has to do with the required interpretation of unification for OlD variables; this must be supported not only between terms in subsumption relationship, but also between terms in generalization relationship. Such extended unification may be easily implemented if we suppose that all object instances descending from the same object in a root class are the same, thus sharing the same OlD. This approach requires that all classes inheriting from multiple classes have the same root class. In this way, OID unification is reconducted to conventional term subsumption; at the implementation level, object unification corresponds to surrogate-based equi-joins. Object sharing (and, in general, support of the aggregation abstraction by establishing the part-of relationship between classes) is the other dimension of semantic complexity; again, this may be easily modeled by enabling LP class predicates to mention other classes. Such correspondence may be implemented by allowing some arguments of a class predicate to be OIDs of other classes.

]3

3 Introduction of O O D B behavioral concepts into the LP world In the LP environment, data are retrieved and manipulated by Horn clause programs, that consist of collections of rules; each rule has one literal in the LHS and several literals in the RHS. In the early versions of Datalog, the structure of these literals was very simple; in particular, only positive literals without function symbols are Mlowed in either side of the rules; predicates are partitioned into intensional (IDB) and extensional (EDB) predicates; the latter are extensively stored in the database and are disallowed in the LHS of rules [39, 18]. The distinguishing feature of Datalog with respect to conventional query languages is the ability to express recursion, thus extending their computational power. The original Datalog, however, was very poor with respect to other features, such as aggregation, sorting, computation of standard functions and other classical query language features. This core language has been progressively extended, by adding complex terms, negation in the body and in the head of rules, built-in and externally defined predicates, and various ways to express updates to EDB relations. Rule semantics has also been extended and clarified, dealing with negation and nondeterminism. Such an extended LP language can now be considered as a full-power database programming language [16].

3.1

Methods

Roughly speaking, methods are procedures attached to objects; in virtue of the encapsulation principle, they are the only means to retrieve or manipulate data in OODB's. The interface of methods is provided by their signature, which gives the structure of input and output parameters; the body of the method is the code that actually implements it. Encapsulation dictates that the method user only knows its signature, and has no access to its code. Late binding is used to enable method specialization, namely, the run-time choice of the most suited version of a method, which comes from the most specialized class of the object to which the method is applied. Functions have been chosen in [6, 19] as the LP concept that most immediately corresponds to methods. Conceptually, functions are really appropriate to represent methods in OODB's. In fact, object attributes may be seen as functions that return an extensional value, while object methods are functions that return a computed value, or produce some side effect. Functions have distinguished input and output parameters; their structure is dictated by the function's signature. The function's computation is defined by one or more rules, which constitute the function's body. Another way of simulating methods in LP, especially when the functional construct is not available, is through the use of adorned predicates. A static adornment corresponds to a fixed mapping of each predicate argument to either

14 bound or free. Predicate adornments enable distinguishing input (bound) parameters from output (free) parameters; thus, a predicate may correspond to several methods, each one with a different static adornment. For each given value (binding) of the input arguments, the rules associated to the predicate produce the bindings for the output (free) arguments. Therefore, also in this case rules are used to implement the bodies of methods. An example of this use of adornments is the following rule:

gr-par(G-PAR:X, G-CHILD: Y) -: par(P:X, CHILD:Z), par(P:Z, CHILD: Y). Such a rule can be used for computing the grandparents of a certain person, by binding (i.e. giving an input value to) the Y variable and expecting the X as an answer, or to compute the grandchildren by binding the X variable.

3.2

Built-in and External Predicates

Built-in and external predicates are basic ingredients for adding essential features of query languages to LP. Built-in predicates enable aggregation (nesting and unnesting of structures), the evaluation of aggregate and scalar functions, and the sorting of results. Typically, built-in predicates are not strongly typed and can be computed only when some of their arguments are bound; for instance, consider the mid predicate having as first argument a set and as second argument the minimum element. In this case, the first argument must be bound to a given set of elements, that can be either of type integer or of type real; the second element may be either free (so that it gets bound as result of the predicate evaluation) or bound to a value before evaluation (so that the predicate evaluation returns a truth value). Externally defined predicates provide the ability of invoking general procedures, written in a conventional programming language [26]. This is most useful in practice, for instance for integrating the language with screen management or report formatting procedures, or for enabling the use of mathematical software libraries. Externally defined predicates may be viewed by the logic program as extensional predicates. Thus, they do not completely violate LP declarativeness, and make the language more suited for practical use. On the OODB side, the encapsulation principle is not violated. Indeed, external predicates may easily be seen as methods attached to particular classes, thus allowing the user to see only the signatures, that correspond to the predicate name and arguments.

3.3

Encapsulation and information hiding

Encapsulation is not yet well understood in the context of LP. In principle, encapsulation may be achieved by enabling the manipulation of classes only through the use of functions; information hiding may be achieved by disabling the inspection of the classes' structure and of the functions' bodies. In practice, usually LP programmers have a a fully transparent view of predicates and rules. Controlling side effects due to addition and deletion of rules within rule-based systems may be very difficult. More in general, due to the lack of modularization concepts, it is difficult to create clear interfaces among pieces of programs.

15 To overcome this difficulty, several notions of modularization within LPs are currently being proposed by the research community. For instance, a notion of modularization is currently supported in LOGRES [14]. Modules are used to add or delete rules of a LOGRES database; the application of a given module to a database state is controlled by specifying the mode of application of (sequences of) modules. Another notion of modularization is proposed in [29]. In this case, modules are applied to database states according to a temporal partial ordering, and rule interpretation reflects the time of their creation; rules can be overridden by other rules, thus enabling the modeling of specialization and exceptions. Though the data abstraction and encapsulation principles that underlie the use of methods may be appropriate for many applications, they may prove ineffective in many database applications [40]. Indeed, the theory and practice of database systems has taught us to look for formal properties of the operations we define on data, in order to exploit them during data access optimization. For instance, commutativity of selections with joins is very useful for reducing useless mass memory accesses. These optimization means were fully transferred from the relational to the deductive world, and actually augmented to cope with the increased expressive power of deductive languages (see for instance the magic set optimization method [39], and others [36]). However, supposing to implement database operations by methods, we easily lose this facility. Indeed, a method is typically known either at a very high level of abstraction, through its signature, or by analyzing its code, which is often an arduous task, especially if we want to do it automatically. In a sense, in pure OODB systems we lack an intermediate level of abstraction, which should allow us to ignore implementation details, while exploiting useful formal properties of operations. One should instead consider that methods written through rules are more easily inspectable, and composable automatically; after composition, optimization may apply to them to improve access efficiency while not compromising encapsulation.

4

The LP Advantage

We now turn to examining the "advantages" that can be achieved when LP is used within a OODB style. We argue that LP presents interesting features with respect to C + + - or SmallTalk- like languages which are conventionally used as representative programming languages for OODBs. 4.1

Declarativeness

The obvious difference between conventional programming languages and LP concerns the declarative programming style: in essence, computations are declared based on their meaning (what should be achieved) and disregarding all

]6 procedural features (howit should be achieved). In practice, some degree of procedurality cannot be avoided when LP is used to express database updates or with externally defined or built-in predicates. For instance, the module mechanism of LOGRES [14] allows to concentrate all the procedurality and control which is needed for database evolution in determining the sequence and modes of application of modules, whereas each module execution has a declarative semantics. The use of LP as a database programming language requires a rather deep change in programming habits; such a transition may be made smoother if conventional query language constructs are added to LP languages and if a rich collection of externally defined predicates becomes available as a library. Once programs are written with a declarative style, their optimization, evolution and reverse engineering is much easier; this implies large pay offs in the long run. 4.2

Set-oriented Processing

LP is suited for set-oriented computation, while most OODB languages ar.e suited for tuple-oriented, navigational computation. In this sense, LP is "in the tradition" marked by relational query languages. Conversely, OODB navigational languages are "in the tradition" marked by Codasyl-like, pointer-based languages. A set-oriented query facility has long been recognized as a necessary feature for databases; therefore, most OODB systems are presenting also a set-oriented query language. However, there are some queries that are most efficiently expressed through navigational languages, especially when the expected result of a query is one single element and not an entire set. Nondeterministic semantics, discussed in the following, may give an effective way of expressing computations where the user is satisfied by any solution selected among many possible alternatives. 4.3

Nondeterminism

Nondeterministic computation is required for solving problems which are nondeterministic in nature; in such computations, any possible solution among a set of alternatives is equally acceptable as an answer. Typical examples are configuration problems, which are solved by one of several alternative assignments satisfying appropriate integrity constraints. In this case, the computation of all solutions is superfluous, and therefore a nondeterministic semantics is most efficient. Nondeterministic semantics can be given naturally to LP languages [5, 16]. 4.4

Views and Integrity-preserving Rules

LP provides uniform treatment of views, which are defined as intensional predicates, and integrity-preserving rules, which may be automatically generated by the system as a consequence of structural constraints of the data model (see, for instance, [16]).

17

Such integrity perserving rules can keep the coherence of a database instance with the schema, and ease unification. For instance, consider again the PERSON/STUDENT example of Subsection 2.3. A rule:

person(X) : - student(X). says that all objects of the class STUDENT are also PERSONS. Thus, the answer to the proposed query should automatically include students among the inhabitants of Piazza L. Da Vinci. Other integrity-preserving rules may be semi-automatically generated from general constraints (see [20]). For instance, by the selfanc constraint we require that a person never appears as an ancestor of him/herself:

selfanc : error(X) : -- anc(X,X). In this way, the system will generate a fact error(a) for each individual a that violates the constraint. A query ? - error(X) will display the violating individuals. A rule-based approach to integrity preserving is becoming an essential ingredient of new generation databases, as integrity checking is being moved from database applications into the database system. It is worth noticing that such ingredient is provided for free by LP-based systems [38], where integrity-preserving rules may be interleaved with all other rules, and executed by the same general mechanisms.

5

Conclusions

We have discussed the integration of deductive databases with object orientation. We consider this integration as feasible and desirable, since object-orientation brings powerful data modeling capabilities by adding semantic and structural complexity while logic programming languages provide means for expressing queries, updates, constraints and integrity rules in a declarative style. However, while structural and semantic complexity have been introduced without difficulty in the deductive context, research is still to be done for modeling in logical terms the behavioral concepts of the object oriented world, and the introduction of structurally and semantically complex objects in the logic programming area is only the first step towards full object orientation.

6

Acknowledgments

We thank the anonymous referees for their useful comments and suggestions.

18

References 1. Atkinson, M., F. Bancilhon, D. De Witt, K. Dittrich, D. Maier, S. Zdonik: The Object-Oriented Database System Manifesto. Proc. First Int. Conf. on Deductive and Object-Oriented Databases, Kyoto, 1989. 2. Abiteboul, S., S. Grumbach: COL: a Logic-based Language ]or Complex Objects. Proc. 1988 EDBT. 3. Abiteboul, S.: Updates, a New Frontier. Proc. 1988 ICDT. 4. Abiteboul, S., P.C. Kanellakis: Object Identity as a Query Language Primitive. Proc. 1989 SIGMOD. 5. Abiteboul, S., E. Simon, V. Vianu: Non-deterministic Languages to Express Deterministic Transformations. Proc. 1990 PODS. Bases de Donn~es Avanc6s, September 1989. 6. Abiteboul, S. Towards a Deductive Object-Oriented Database Language. Data and Knowledge Engineering, 1991. 7. Albano, A.: Type Hierarchies and Semantic Data Models. ACM SIGPLAN 83: Symposium on Programming Language Issues in Software Systems, San Francisco, 1983. 8. Atkinson, M.P., P. Buneman, R. Morrison: Data Types and Persistence. SpringerVerlag, 1989. 9. Batini, C., S. Ceri, S.B. Navathe: Database Design using the Entity-Relationship Approach. Benjamin-Cummings, 1991. 10. Beeri, C.: Data Models and Languages for Databases. Proc. 1988 ICDT. 11. Bertino, E. and M. Montesi: Towards a logical object oriented programming language ]or databases, Proc. EDBT '92, Vienna, 1992. 12. Cardelli, L.: A semantics of multiple inheritance. Information and Computation, 76:138-164, 1988. 13. Cacace, F.: Implementing an Object-Oriented Data Model in Relational Algebra: Choices and Complexity. PdM Report n. 90-009. 14. F. Cacace, S. Ceri, S. Crespi-Reghizzi, L. Tanca, R. Zicari: Integrating ObjectOriented Data Modeling with a Rule-Based Programming Paradigm.Proc. 1990 SIGMOD. 15. Ceri, S., F. Cacace, S. Danforth, E. Simon, L. Tanca The language RL and its semantics Report of Esprit Project STRETCH, 1990. 16. F. Cacace, S. Ceri, L. Tanca: Consistency and non-determinism in the Logres language, MFDBS 1991, Lecture Notes in Computer Science, Springer Verlag. 17. Ceri, S., S. Crespi-Reghizzi, G. Lamperti, L. Lavazza, R. Zicari: ALGRES: An advanced database system for complex applications, IEEE-Software, 1990. 18. Ceri, S., G. Gottlob, L. Tanca: Logic Programming and Databases, Springer Verlag, Berlin, 1990. 19. Cacace, F. and L. Tanca: Concurrency in Deductive Object Oriented Databases, Internal Report, Politecnico di Mflano, 1991. 20. Ceri, S., J. Widom: Deriving Production Rules for Constraint Maintenance. Proc. VLDN, Sydney, August 1990. 21. Proc. DOOD 89, Kyoto, Lecture Notes in Computer Science, Springer Verlag, 1989. 22. Proc. DOOD 91, Munich, Lecture Notes in Computer Science n. 566, Springer Verlag, 1991. 23. Fischer, P.C., S.J. Thomas: Operators]or Non First Normal Form Relations. Proc. of the IEEE Computer Software and Applications Conf. 1983, 464-475. 24. Ma.ier, D. :A logic for objects. Oregon Graduate Center Technical Report CS/E86-012, Nov. 1986.

19 25. Kifer, M., J. Wu: A Logic for Object Oriented Programming (Maier's O-Logic Revisited). Proc. 1989 PODS. 26. Krishnamurthy, R., C. Zaniolo: Optimization in a Logic Based Language for Knowledge and Data Intensive Application, in Advances in Database Technology, Proc. 1988 EDBT, Springer Verlag. 27. Kuper, G.M., M.Y. Vardi: A New Approach to Database Logic. Proc. 1984 PODS. 28. Lambrichts, E., P. Nees, J. Paredaens, P. Peelman, L. Tanca:lntegration of Functions in Logic Database Systems, Data and Knowledge Engineering, 1990. 29. Laenens, E., D. Sacca', and D. Vermeir:Eztending Logic Programming, Proc. ACM SIGMOD 1990, Atlantic City, N J, May 1990. 30. LLoyd, J.: Foundations of Logic Programming, Second Extended Edition, Springer Verlag, 1987. 31. Lecluse, C., P. Richard and F. Velez: 02, an Object-Oriented Data Model. Proc. 1988 SIGMOD. 32. Maier D.: A Logic for Objects. Proc. Workshop on Foundations of Deductive Databases and Logic Programming, Washington USA, 1986. 33. The Committee for Advanced DBMS Functions: Third-Generation Data Base System Manifesto. Mem. UCB/ERL M90/28, April 1990. 34. Mylopulos, J., P.A. Bernstein, H.K.T. Wong: A Language Facility for Designing Database-Intensive Applications. ACM Transactions on Database Systems, Vol. 5, No. 2, June 1980, pp. 185-207. 35. Naqvi, S., S. Tsur: A Logical Language for Data and Knowledge Bases. Computer Science Press, New York, 1989. 36. Ramakhrishnan, R. and S. Sudarshan, Aggregation and Relevance in Deductive Databases, Proceedings of the VLDB 1991, Barcelona, Sept. 1991. 37. Schmidt, J.W.; Type Concepts for Database Definition. In Database: Improving their usability and responsiveness, Academic Press, 1978. 38. Tanca, L.; (Re-)Action in Deductive Databases, Proc. 2nd International Workshop on Intelligent and Cooperative Information Systems, Como, Italy, 1991. 39. Ullman, J.D.: Principles of Databases and Knowledge-Base Systems. Volume I, Computer Science Press, Potomac, MD, 1988. 40. Ullman, J.D.: A comparison between deductive and object oriented database systents, Proc. DOOD 91, Munich, Lecture Notes in Computer Science n. 566, Springer Verlag, 1991.

The LOGIDATA-b

Model*

Paolo Atzeni 1, Filippo Cacace 2, Stefano Ceri 2, and Letizia Tanca 2 1 Dipartimento di Informatica e Sistemistica, Universits di Roma "La Sapienza," Via Salaria 113, 00198 Roma, Italy 2 Dipartimento di Elettronica e Informazione, Politecnico di Milano, Piazza Leonardo da Vinci 32, 20133 Milano, Italy

A b s t r a c t . The LoGIDATA+ data model is presented. It is value-based, object-based, and functional at the same: it involves classes (with objectidentity), relations (value based), and functions. Classes can be referred to in relations and functions. Is-a relationships can be defined on classes. Complex values can be built by means of the recursive use of set, tuple, and sequence constructors. As usual in database frameworks, the model has an intensional level (with the notion of scheme) and an extensional level, (with the notion of instance). A data definition language for the description of LOGIDATA+ schemes is also presented.

1

Introduction

This and the following paper in this collection [4] present the LOGIDATA+ data model and language, respectively. The subsequent paper [7] is entirely devoted to the description of a.n application. For this reason, this paper does not contain examples. The central concepts in the data model are the notions of LOGIDATAqscheme and L OGIDATA-I- instance, respectively corresponding to the structure and the current values of a LOGIDATA+ database. With reference to the use of the model as a basis for a deductive language, it is important to note that instances contain both base values, actually stored on physical devices, and derived values, generated by means of rules. However, the basic notions of the model are static in the sense that they are not sensitive to the distinction between base and derived values. The model is similar in spirit to the model in the IQL proposal [3] (and therefore to the model of the 02 system [5]). Some ideas are inherited from other models, including the Logical Data Model [11], COL [1], IFO [2], and LOGRES [8]. The model involves three main constructs, classes, relations and functions. A LOGIDATA+ class is a set of objects, entities for which value equality does not imply identity; this is implemented by means of object identifiers (old's), which are not visible to the users. A LOGIDATA+ relation is a relation as in the nested relational model, with the further possibility of including references * Work supported by CNR, "Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo." The first author is now with Terza Universit~ di Roma.

21 to classes (which are implemented by means of oid's). A LOGIDATA+ function is a (partial) function in the usual sense; its domain and range may have nested structures. Whereas the structures (and instances) of relations and functions may involve references to classes (and objects), the converse is not possible because relations and functions are value-based, and so their elements have no oid. The inclusion of three different structures in the model was motivated mainly by the need for a framework that allowed investigations in various directions, including the interaction between constructs and paradigms. In fact, the LOGIDATA+ model is at the same time value-based, object-based, and functional. This is redundant in many cases, because most applications can be modeled by means of only a few constructs; however, this often requires rather involved or artificial techniques. In a deductive language especially, the availability of a range of constructs can provide the necessary flexibility and ease of use; at the same time, redundant structures may be defined as derived items, by means of rules. In fact: - classes model the sets of identifiable objects (entities in the Entity-Relationship or other semantic models, classes of objects in Object-Oriented models); generalizations or is-a relationships can be naturally defined on them; - relations can handle value-based data (for example, the results of queries that have to be provided as output to a user or to another system are tables of data - - with no oid's) and relationships between classes (relationships in the E-R model); - functions can be used to: 9 define in an orthogonal way relationships between classes (as it is done in the Functional Data Model [6,14]), or those between classes and their attributes [9]; 9 embed built-in functions in rules, regardless of the actual implementations of functions; with suitable restrictions on boundednes of arguments, functions can be used to define interfaces to imperative or functional programming languages; 9 attach procedures to classes; 9 manipulate the structures of complex objects [1]; specifically, they are fundamental in the manipulation of sets and sequences. Tuples in relations, objects in classes, and elements in domains and ranges of functions may have complex structures, built by means of the recursive use of the tuple, set, and sequence constructors. Type names are provided in order to simplify declarations, as in many programming languages. Complex generalization hierarchies can be built by means of is-a relationships with inherited properties. Multiple inheritance is allowed: it is possible that a class be a subset of two or more incomparable classes; some technical restrictions are needed here to guarantee well-definedness. This paper contains three sections, beside this Introduction. Section 2 is concerned with the extensional level of the model, with the notion of scheme. Section 3 presents the data definition language. Section 4 deals with the intensional level, with the notion of instance.

22 2

The

Intensional

Level:

LOGIDATA+

schemes

We fix a countable set 1: of labels (or attribute names) and a finite set B of base type names; associated with each B E B there is a set of base values v(B); the sets of base values are pairwise disjoint. 3 A LOGIDATA+ scheme is a six-tuple S = (T, C, R, F, TYP, ISA), where T, C, It, F are finite, disjoint sets of symbols called type names, class names, relation names, function names, respectively. Before defining the other components of a scheme, an auxiliary, important notion is needed. The type descriptors (or simply types) of S are defined as follows:

-

1. 2. 3. 4. 5. 6.

-

-

if B is a base type name in B, then B is a type of S; if T is a type name in T, then T is a type of S; if C is a class name in C, then C is a type of S; if r is a type of S, then {r} is also a type of S, called a set type; if r is a type of S, then (r) is also a type of S (sequence type); if r l , . . . , rk, with k > 0, 4 are types of S and L1, .., Lk are distinct labels in /:, then (L1 : 7.1,..., Lk : rk) is also a type of S (tuple type); since the labels are distinct, the order of components is immaterial, and so we may assume that, for every permutation i l , . . 9 ik of the first k naturals, (L1 : 7.1,...,Lk :rk) is the same type as (Li, : r i , , . . . , L i k : r/k); 7. nothing else is a type of S. We can now continue with the definition of the components of a LOGIDATA+ scheme. TYP is a function defined on T U C U R U F such that: $ for each T E T, TYP(T) is a type descriptor; 9 for each symbol S E (3 U R, TYP(S) is a tuple type descriptor; 9 for each F E F, TYP(F) is a pair (7"1,7"2) (usually written as 7"1 --* 7"2), where 7"1 (called the domain of F ) is a tuple type descriptor and 7"2 (the range of F ) is a type descriptor; if the range 7"2 is a set type (or a type name that corresponds to a set type) then we say that F is a multivalued function, otherwise that it is a monovalued function, with the condition that the relationship U among type names is a partial order, where T1 _ T2 if T1 occurs in TYP(T2), for T1, T2 E T

- ISA is a partial order over C, with the following conditions: 1. if (C1, C2) E ISA (often written in infix notation, CIlSAC2 and read "C1 is a subclass of C2"), then TYP(C1) and TYP(C2) are constrained as follows: TYP(C1) = (L1 : 7"1,...,Lh : 7"h,...,Lh+p : 7"h+p), TYP(C2) = (L1 : 7"~,...,Lh : r~), with h _> 0, p _> 0, and, for 1 < i _< k, 7"i is 3 This guarantees that the type of each constant is uniquely defined. Moreover, in almost every language the base types are disjoint. 4 That is, we allow the empty tuple; this may be useful for technical reasons.

23

a refinement of r/, according to the following definition. A type 7" is a refinement of a type 7"' (in symbols r _ 7-') if and only if at least one of the following conditions holds: (a) r = r';

7- e T and TYP(r) _~ 7"; 7", 7"' 9 T and TYP(7") ____.TYP(7"'); r, 7"' 9 C and (r, 7"') 9 ISA; 7" = {7"1} and 7"'= {7"~}, with 7"1 -4 7"I; 7" = (7"1) and 7 " = (7"~), with I"1 "~ 7-~; 7- = (L1 : r i , . . . , n k : 7"~), 7"' = (L1 : v ~ , . . . , n k : 7-~) and 7-i -4 r', for l
.

As in every other data model, the scheme gives the structure of the possible instances of the database. Instances of LOGIDATA+ schemes are built, by means of the set, sequence, and tuple constructors, from elementary values, which come from the value-sets associated with the base types, and object identifiers (old's), which are used as indirect references to elements of the classes. The extensional notions will be defined in Section 4. However, we anticipate some issues here, in order to describe the role played by the various elements in a LOGIDATA+ database. Relations, functions, and classes are the three structures for the organization of data in the model: relations and functions are value-based whereas classes contain objects (elements with an oid); all of them may have complex structures, although it is required that the top level constructor for classes, relations, and domains of functions is the tuple constructor, in such a way that we may speak of attributes, and so keep close to traditional database models. Type names are used to refer to user defined types (in the same way as it is done in the traditional programming languages, such as Pascal). The function TYP associates a type with each relation, function, class, and type name, indicating the set of possible values; the function VAL, to be defined later, will associate with each type descriptor the corresponding set of values; the above definition shows several categories of types:

1. base types, which denote predefined sets of values, such as integers, reals, or strings, are obviously types; 2. type names are types themselves (as in the programming languages that allow the use of type names);

24

3. classes are types, 5 because we want to be able to reference their elements (this is implemented by means of oid's);

4. 5. 6. set, sequence, and tuple types can be built from other types (their components), and their values are sets, sequences, and tuples, respectively, of the component types. The partial order on type names guarantees the absence of circular definitions, which would generate infinite structures. It should be noted that circular definitions involving classes and their types are instead legal, because they only involve indirect reference to the objects. The partial order ISA has the usual role of is-a relationship; the condition of refinement is imposed in order to guarantee that the elements of a subclass have a type "compatible" with that of the superclass. 6 Note that, whereas T, C, I t and F are pairwise disjoint, they need not be disjoint from the set of labels s because no ambiguity can arise (for example, a symbol d a t o can be used both as a type name and as a label in the definition of the type for a relation or a class, but in each case it is clear what it denotes).

3

The

LOGIDATA+

Definition

Language

LOGIDATA+ schemes can be declared by means of a language whose syntax is described in Figure 1 (terminal symbols are in t y p e w r i t e r font, non-terminal symbols are underlined; also, the expansion of the non-terminal identifier is not shown: it can be assumed to be a string of letters and dashes). As in most languages, there is a number of context sensitive restrictions on the legal scheme declarations; we will present them together with the definition of semantics, to which they are related. The semantics of a data definition language, as the one we are considering, has to associate with each legal "program" a database scheme. Therefore, we have to describe, the LOGIDATA+ scheme S = (T, C, It, F,TYP, ISA), associated with a scheme declaration, that is a LOGIDATA+ DDL program according to the above syntax. The various components are defined as follows: - Let us say that each type, class, relation, function declaration declares the identifier appearing just after the keyword TYPE, CLASS, RELATION, FUNCTION, respectively. Then, we can say that T, C, It, F, contain the identifiers declared in type, class, relation, and function declarations, respectively. Since they must be disjoint and their types must be defined in a unique manner, we require that no identifier is declared more than once. Also, we require that (i) each non-terminal type-id generates an identifier declared in a previous 5 This is not related to the fact that each class has its own type: it means that classes can be used in building other types, which may therefore refer to the objects in the classes. 6 As we will see in the next section, the syntax of class definitions will in most cases simplify the verification of the refinement condition.

25 scheme-declaration -* {declaration ; } declaration ---* type-declaration [relation-declaration [ class-declaration I function-declaration type-declaration ---* TYPE identifier = type relation-declaration ~ R~.LATIO~Iidentifier : tuple-type function-declaration ~ FI/NCTIO~I identifier : tuple-type TO type class-declaration ~ CLhSS identifier : tuple-type [ CLASS identifier ISh class-id ( , class-id} [: tuple-type ] type ---* set-type [ sequence-type [ class-id [ tuple-type [ type-id tuple-type ---* TUPL~. component ( ; component } END component ---* label { , label} : type set-type ---, SET OF type sequence-type ---* SEQUENCE OF type label ---* identifier class-id --~ identifier type-id ~ identifier Fig. 1. The syntax of the LOGIDATA+ definition language

type declaration, that (ii} each non-terminal class-id generates an identifier declared in some class declaration, and that (iii) each non-terminal class-id appearing in the list following the keyword ISh, generates an identifier declared in a previous class declaration. - The function TYP associates with each identifier the type descriptor defined by the expansion of type in the corresponding declaration, with the obvious correspondence between symbols and keywords, and one exception: if a class declaration involves an • clause, then the associated tuple type descriptor is obtained by considering the components with labels explicitly mentioned (none, if there is no type descriptor in the declaration) and those associated with some class-id in the i s a clause; if a label appears in both, the associated type in the subclass must be a refinement of the homologous type in the superclass; also, if a label appears in more than one superclass, with different types, then it must appear explicitly in the subclass (with a type that must therefore be a refinement of both). -

ISA is the reflexive and transitive closure of the relation ISA0 that contains a pair CIlSAoC2 if and only if C2 appears in the list of class-id's after the keyword ISh in the declaration of C1. Condition (iii) above guarantees that ISA is defined on class names in C and that it is a partial order.

26 4

The

Extensional

Level: LOGIDATA~-

instances

This section is devoted to the formal definition of instances. The first step is the definition of the value-sets associated with types. We assume the existence of a countable set O of oid's. With each type descriptor r we can associate the set VAL(r) of its possible values, called its value-set (we distinguish the same cases as in the definition of type descriptor): 1. if r = B E B, then VAL(r) = v(B); 2. if r is a type name T E T, then its value-set is the value-set of the type associated with it: VAL(T) = VAL(TYP(T)); 3. if r is a class name C E C, then its value-set is the set of the oid's O; 4. if r is a set type, that is, it has the form {r~}, then its value-set is the set of the finite subsets of VAL(r~); 5. if r is a sequence type, r = Ir~), then its value-set is the set of the finite sequences of elements of VAL(V~); 6. if r is a tuple type (L1 : rl,... ,Lk : rk), then VAL(r) = {t : {L1,..., Lk} --* t.J~=lVAL(ri) I t(Li) e VAL(ri), for 1 < i < k} that is, the set of all possible tuples over L1,... ,L~, where a tuple (as in other formal frameworks) is a function from the set of labels to the union of value-sets of the component types, with the restriction that each value belong to the value-set of the corresponding type. With respect to classes, it is important to note that (as in the IQL model [3]) their value-sets only contain oid's: the actual values of the elements of the classes are defined by another function (the function o defined below) that associates with each oid a value (compatible with the types of the classes to which the oid belongs); in this way, it is possible to implement indirect references to objects and other features such as object sharing. Also, for each class, the value-set is the set of all possible oid's: essentially, we can say that oid's are not typed, and so they allow the identification of an object regardless of its type (this is an oft-cited requirement for object oriented systems [10,12]); oid's become typed when they belong to classes. As it is done with relational databases, it would be reasonable to allow for values to be undefined. This issue has a great deal of importance in practical applications, and a wide range of variations, which cannot be considered here. The simplest assumption would be to allow special values, called null values, denoting lack of complete information, and extend all value-sets with the null value. Simple and complex null values, null oid's, undefined values, and so on, deserve further, deep study, which has not been carried out yet. For these reasons, we do not consider null values in the rest of this paper. Now, before introducing the actual the notion of a LOGIDATA+ instance, we give a preliminary definition. A potential instance s of a LOGIDATA+ scheme S = (T, C, R, F, TYP, ISA), is a four-tuple s = ( c , o , r , f ) , where:

27 -

-

-

--

c is a function that associates with each class name C E C a oid's: c(C) C_ O; o is a partial function from O to the union of all the value-sets of S; r is a function that associates with each relation name R E 1% a of the associated value-set: r(R) C_ VAL(TYP(R)); f is a function that associates with each function name F E function from the value-set of the domain of F to the value-set of F: f ( F ) : VAL('rl) --+ VAL(r2) , where TYP(F) ----r 1 ~ 7"2;

finite set of of the types finite subset F a partial of the range

with the conditions: for every C1 mA C2, it is the case that c(C1) C_ c(C2); for every C 9 C, for every o 9 c(C), o(o) is defined and belongs to the value-set of a type r that is the refinement of the type TYP(C); 7 - if c(C~) n c(C2) r 0, then there is a class C in C such that CIISAC and C2IsAC, that is, C1 and C2 have a common ancestor. -

Let us comment on the definition. A potential instance has four components: - c associates with each class name a set of old's, with the conditions that the partial order ISA corresponds to subset relations between the involved classes, and that two classes have a nonempty intersection only if they have a common ancestor in the ISA hierarchy. Two comments are relevant. 1. For each object, there is a "most general class" to which it belongs. This is useful in many applications, as real world objects are partitioned into distinct taxonomies. At the same time, this assumption does not imply a loss of generality: it is always possible to define a limited number (one is sufficient in most cases) of "artificial" common ancestors, possibly with no attributes. 2. On the contrary, as opposed to what happens in other models (including Taxis [13] and IQL [3]), we do not require, for each object, the existence of a "most specific class," because this would imply, for each pair of classes with a nonempty intersection, a class t h a t contains exactly this intersection. In some cases, this could lead to a proliferation of almost meaningless classes: given, for example, a scheme including the class person, with the subclasses m a l e , f e m a l e , student, employee, retired, european, a m e r i c a n , it would be unreasonably heavy to have all the intersection classes, such as e u r o p e a n - m a l e - s t u d e n t , a m e r i c a n - m a l e s t u d e n t , and so on. - o associates values with oid's belonging to classes, with the condition that, for each class C, for each oid o in the instantiation c(C) of C, the value o(o) has a type that is a refinement of the type TYP(C) of the class. Note that 7 The value of o(o) need not belong to the value-set of TYP(C), because o could also belong to some other class C' such that C' ISA C; since each object has only one type, and VAL(TYP(C')) is in general different (often disjoint) from VAL(TYP(C)), o(o) cannot belong to both.

28 if we had forced the values of the objects to belong to the value-sets of the types of the respective classes, then we would have had serious limitations on is-a relationships, as objects could not have different types, and so could not belong to different classes, such as a superclass and a subclass. r associates a relation r = r(R) with each relation name R E R; r is a set of elements taken from the value-set associated with the type of R; that is: R is just a name; TYP(R) is the type descriptor of R; VAL(TYP(R)) is the set of values associated with the type descriptor TYP(R); r = r(R) is a subset of VAL(TYP(R)). - f associates a function f = f ( F ) with each function name F E F, with domain and range as defined by TYP(F). -

The definition of potential instance presents a shortcoming, related to the possibility of dangling references: it is possible to have oid's that are used as values and do not refer to existing elements of classes. This requirement can be enforced by restricting the allowed value-sets with an explicit reference to the values of the classes in the instance, which are given by c; for each type r, we define the value-set with respect to c (in symbols VALc(r)) as a subset of VAL('F): referring to the definition of VAL('r), we have a new definition for classes, VALe(C) -" c(C), whereas for all other cases VALe('/') is defined as VAL(7") except that each occurrence of VAL0 in the right-hand side is replaced with VALe(). Then, the definition of LOGIDATA+ instance can be refined, as follows: an oid-coherent instance (or simply, instance) s of a LOGIDATA+ scheme S = ( T , C , t t , F,TYP, ISA), is a four-tuple s = ( e , o , r , f ) , as in the definition of potential instance, except that each occurrence of VAL0 is replaced with VALe().

References 1. S. Abiteboul and S. Grumbach. A rule-based language with functions and sets. ACM Trans. on Database Syst., 16(1):1-31, March 1991. 2. S. Abiteboul and R. Hull. IFO: A formal semantics database model. ACM Trans. on Database Syst., 12(4):297-314, December 1987. 3. S. Abiteboul and P. Kanellakis. Object identity as a query language primitive. In ACM SIGMOD International Conf. on Management of Data, pages 159-173, 1989. 4. P. Atzeni, L. Cabibbo, G, Mecca, and L. Tanca. The LOGIDATA+ language and semantics. This volume. 5. F. Bancilhon, C. Delobel, and P. Kanellakis, editors. Building an Object-Oriented Database System. Morgan Kaufmann, San Mateo, California, 1992. 6. P. Buneman and R.E. Frankel. FQL - - a functional query language. In ACM SIGMOD International Conf. on Management of Data, pages 52-58, 1979. 7. L. Cabibbo and G. Mecca. Travel agency: a LOGIDATA+ application. This volume. 8. F. Cacace, S. Ceri, S. Crespi-Reghizzi, L. Tanca, and R. Zicari. Integrating object oriented data modelling with a rule-based programming paradigm. In A CM SIGMOD International Conf. on Management of Data, pages 225-236, 1990.

29

9. P.P. Chen. The entity-relationship model: Toward a unified view of data. A C M Trans. on Database Syst., 1(1):9-36, March 1976. 10. S. Khoshafian and G. Copeland. Object identity. In A C M Syrup. on Object Oriented Programming Systems, Languages and Applications, 1986. 11. G.M. Kuper. The Logical Data Model: A New Approach to Database Logic. PhD thesis, Stanford University, 1985. 12. D. Maier. Why isn't there an object-oriented data model. Technical Report CS/E89-002, Oregon Graduate Center, 1989. A condensed version was an invited paper at the IFIP 11th World Computer Congress, San Francisco, August-September 1989. 13. J. Mylopoulos, P.A. Bernstein, and E. Wong. A language facility for designing database-intensive applications. A C M Trans. on Database Syst., 5(2):185-207, June 1980. 14. D.W. Shipman. The functional data model and the data language DAPLEX. A C M Trans. on Database Syst., 6(1):140-173, March 1981.

T h e L O G I D A T A + Language and Semantics* Paolo Atzeni, 1 Luca Cabibbo, 1 Giansalvatore Mecca, 1 and Letizia Tanca 2 Dipartimento di Informatica e Sistemistica, Universitk di Roma "La Sapienza" Via Salaria 113, 00198 Roma, Italy 2 Dipartimento di Elettronica e Informazione, Politecnico di Milano Piazza Leonardo da Vinci 32, 20133 Milano, Italy

A b s t r a c t . A language for the LOGIDATA+ model is presented. The language is rule-based and allows for the management of complex structures, classes, hierarchies and data functions. Negation is allowed in the body of rules. The semantics is based on a fixpoint operator and represents an extension of ordinary declarative-language semantics. The main issue is the management of oid-invention, that is, the creation and manipulation of objects belonging to the classes of a scheme. In order to correctly deal with negation and multivalued data functions, an extended notion of stratification is introduced.

1 Introduction The LOGIDATA+ language, discussed in this paper, refers to the LOGIDATA+ model of data, presented in the previous paper in this volume [6]. It is a declarative language and represents a powerful tool to express data retrieval. The LOGIDATA+ model provides several constructs, such as classes with object identity and isa hierarchies, relations and functions, and a rich set of type constructors (record, set, and sequence). In order to achieve a full expressiveness, the language proposed is an extension of Datalog [11,15] capable of handling all of these features. In particular, it is a rule-based language with negative literals in the body of rules. It allows: - the management of complex objects, built up by means of the set, record and sequence constructors; built-in functions and predicates are provided in order to manipulate complex structures; - creation and manipulation of object identifiers (oid's) (taking isa-hierarchies into account); - user defined data functions. We propose a fixpoint semantics that is deterministic and stratified with respect to negation and multivalued data functions. Many issues of the present work are similar to those presented in the IQL and LOGRES languages [2,9]. * Work supported by CNR, "Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo," and by MURST, within the Project "Metodi formali e strumenti per basi di dati evolute." The first author is now with Terza UniversitY. di Roma. The second author is partially supported by Systems & Management S.p.A.

3] The paper is organized as follows. In Section 2 we introduce the main issues by discussing some examples. In Section 3 we introduce the syntax of the language. The semantics of a LOGIDATA+ set of clauses is presented in Section 4, through the definition of several transformations, needed to compute the semantics of a program over an instance. A more general set of examples is described in the next paper in this volume [8].

2

Overview of the language

A LOGIDATA-}- program is a set of clauses that specifies a transformation from an instance of an input scheme to an instance of an output scheme. Coherently with the presence of isa, we do not require disjointness between input and output schemes (in contrast with other approaches, from Datalog [11,15] to IQL [2]). The main semantic matter connected with the declarative manipulation of objects is the need for oid-invention [2]: let us introduce the problem by discussing an example. Consider a database with two relations: employee, with type (emp-name:string, project:string), that contains the names of the employees of an enterprise, associated with the projects they work for; and manager, with type (mgr-name:string, project:string), containing the names of the managers along with the projects they direct. Assume we want to build the class pair, with type (manager:string, employee:string), that contains all the associations among employees and managers in the same project. Intuitively, we could use the following clause:

pair(OID:X, manager:M, employee:E)*manager(mgr-name:i, project:P), employee(emp-name:Z, project:P) The clause generates an object for each pair, with the associated oid, which has to be different from those already in the database. It is important to note that the need for a new value has no counterpart in a value-based framework: in Datalog, for example, rules produce new tuples of existing values. The generation of new oid's presents some delicate aspects: in the example, a problem arises if employees and managers are allowed to be involved in more than one project. In this case, the clause would create a duplicate of the same pair for each of the projects the pair is involved in. The language, following IQL [2] and LOGRES [9], does not provide explicit syntactic tools capable of enforcing the desired semantics, so that the only reasonable solution consists in inventing a new oid for each distinct ground instance of the body. The use of Skolem Functors, as in [5,12], would allow for different solutions. Another interesting characteristic of the language is the presence of userdefined data functions, which greatly increase the flexibility of the language. In particular, multivalued functions deserve special attention: the language provides a grouping construct, which can be used as a set constructor. Let us give an example: consider the previous scheme, and suppose we declare a function projects, whose type is (emp-name:string) --~ {string}, that associates with each

32 employee name the set of projects he works in; the LOGIDATA+ language allows for grouping rules of the form:

P IN projects(emp-name:E) ~ employee(emp-name:E, project:P) The semantics of this rule is the following: the set of projects associated with the employee e contains each project p such that a tuple (emp-name:e, project:p) belongs to the relation employee. A stratification policy is needed in order to correctly manage such a rule. In Section 4 an extended notion of stratification is presented, in the spirit of COL [1]. The last relevant issue of the language is the management of hierarchies; we choose to model isa relationships among classes by means of auxiliary clauses of the language, called isa clauses, that enforce the containment constraints associated with isa relationships.

3

Syntax of the language

Let S = (C,R,T,F,TYP,ISA) be a L O G I D A T A + scheme and TYPES(S) be the set of type descriptors of S. Also, let VD and V C be disjoint countable sets of variables,used to denote values and object identifiers,respectively. The elements of VD are called value-variables and those in VC, old-variables. Let us define the terms of the language. Even though we would like our language to be typed, it is not possible to associate a unique type with each oid. This is mostly due to the presence of inheritance, which allows for the specialization of oid's from superclasses to subclasses; since we do not require the presence of a most specific class for the objects in the instance, it is not always possible to establish the type of an oid on the basis of the classes it belongs to. For example, given a scheme with three classes, person, student and worker, such that student ISA person and worker ISA person, it is not possible to clearly establish the type of an old belonging both to student and worker, if no working-student class is defined. We actually need to know that such an object is well suited wherever a person or a student or a worker is required. Thus, for each term of the language, we need to know the set of types compatible with it, as follows. 1. 2. 3. 4. 5.

a constant b of a base type B E B is a term compatible with the type B; an oid o E O is a term compatible with each class C E C; a variable v E VC is a term compatible with each class C E C; a variable v E VD is a term compatible with each type r E TYPES(S) -- C; if t l , . . . , tk are terms compatible with the types r l , . . . , rk respectively, and A 1 , . . . , A k are distinct labels i n / : , then (A1 : t l , . . . , A k : tk) is a (tuple) term compatible with the type (Ax : r l , . . . ,Ak : rk); 6. if t l , . . . ,tk are terms compatible with the type r, then { t x , . . . ,tk} is a (set) term compatible with the type {r}; 7. if t l , . . . ,tt are terms compatible with the type r, then < t l , . . . ,tk > is a (sequence) term compatible with the type < r >;

33 8. i f t is a t e r m compatible with the type (A1 : r l , . . . , A k : rk), then for each i E {1, 2 , . . . , k}, t:Ai is a t e r m compatible with the type vi. Such a t e r m is used to access the first-level attributes of the tuple represented by t; 9. nothing else is a term. Oid's and oid variables are called old terms. A t e r m is said to be visible if it contains no oid's. The atoms of the language may have various forms:

1. relation-atoms: R(A1 : t l , . . . , A k : tk), where R is a relation name in R, with type TYP(./~) "- (A1 : Vl,... ,Ak : rk), and for each i E { 1 , 2 , . . . ,k}, ti is a t e r m compatible with ri; 2. class-atoms: C(OID :to,A1 : t l , . . . , A k : t k ) where C is a class name in C, with TYP(C) = (A1 : r l , . . . , A k : rk), to is a t e r m compatible with C, and for each i E { 1 , 2 , . . . , k}, ti is a t e r m compatible with 7"/; 3. function-atoms, that are of two forms: - t Is F(A1 : t l , . . . , A k :tk), where F is a function name in F, with type wYP(F) = (A1 : v l , . . . , A k : vk) ---* v, for each i e { 1 , 2 , . . . , k } , ti is a t e r m compatible with ri, and t is a t e r m compatible with r. These are called simple function atoms; - t IN F(A1 : t l , . . . ,Ak :tk), where F is a function name in F, with type wYP(F) = (A1 : Vl,... ,Ak : 7"k) ---+ {r}, that is, F is a multivalued function, for each i E {1, 2 , . . . , k}, ti is a t e r m compatible with ri, and t is a t e r m compatible with r. These are called grouping-atoms. 3 The class, relation or function name in an a t o m is called the predicate symbol of the atom. A L O G I D A T A + literal is an atom, or an a t o m preceded by the negation connective (~). L O G I D A T A + provides the conventional built-in predicates and functions for arithmetical comparisons and operations: =, <, >, +, ,, /, - . Furthermore, L O G I D A T A + provides built-in's for sets: t E X, Z = X U Y, Z = X - Y, Z = X N Y, n = Card(X); and for sequences: t member X , Z = cons(X, Y ) , Z = append(t, X ) , t = f i r s t ( X ) , t -- l a s t ( X ) , Z = r e s t ( X ) . Built-in predicates are polymorphic. The variables that appear as input arguments of the built-in functions, and some specific variables appearing in the built-in predicates, must be bound by a chain of equalities to some constant or to a variable present in an ordinary predicate. This is necessary because the system must consider some of the predicate arguments as "input arguments", in order to be able to evaluate the built-in predicates and functions. A rule has the form: r : A ~ L 1 , L 2 , . . . , L p , where r is the name of the rule (often omitted), L 1 , . . . , L p (with p > 0) are literals, and A is an atom. A fact 3 Note how the predicate symbol of a grouping atom has to be a multivMued function; this is not required for simple function atoms, since we want to be able to check which is the set of values associated by a multivalued function to each tuple in its domain.

34 is a ground atom (that is, without variables). A clause is a rule or a fact. Given a clause p, it is convenient to define its head and body, denoted with HEAD(p) and BODY(p), respectively. If p is a rule A ~ L1, L 2 , . . . , Lp, then HEAD(p) = A and BODY(p) -- { L 1 , . . . ,Lp}. If p is a fact A, then HEAD(p) = A and BODY(p) is the empty set. Let us introduce various relevant forms of clauses. A clause p is - a relation clause if HEAD(p) is a relation-atom; - an oid-invention clause if HEAD(p) is a class-atom C(OID : X,...), where X is an oid-variable in VC not occurring in BODY(p), called the oid-invention variable of p, and every other variable occurring in HEAD(p) occurs in BODY(p)

as well; -

a specialization clause if HEAD(p) is a class-atom C(OID : X , . . . ) , where X is

an old-variable and BODY(p) contains (at least) a class-atom C'(OID : X , . . . ) such that C and C' have a common ancestor (that is, a class Co 6 C such that CISA Co and C' XSA Co); - a function clause if HEAD(p) is a simple function-atom whose predicate symbol is a monovalued function; a grouping clause if HEAD(p) is a grouping-atom. -

We say that a clause p is visible if it does not contain oid's. Moreover, we say that a clause p is well typed if, every time an old term t occurs in HEAD(p) associated with an attribute of type C 6 C, at least one of the following conditions holds: - t occurs in BODY(p) associated with an attribute A whose type is a class C' such that C' ISA C; - there is a class atom C'(OID : t , A t : t l , . . . , A k : tk) in BODY(p) such that C' ISA C. A LOGIDATA+ program P over a scheme S is a set of relation clauses, oidinvention clauses, specialization clauses, function clauses and grouping clauses that are required to be visible and well typed. 4

Language

Semantics

The logical semantics of LOGIDATA+ clauses and programs is based on the ordinary declarative language semantics. An important extension is related to the already mentioned notion of oid invention. The semantics of the LOGIDATA+ program P over a scheme S is a (partial) m a p p i n g / " p between instances of the scheme. 4 4 In fact, due to the presence of invented old's, the semantics of a LOGIDATA+ program is a relation, and not a mapping, since to a certain instance correspond several instances, which come out to be all isomorphic up to old renaming [2]. This means that, for practical use, we can take one representative of the equivalence class as the chosen instance. Note that all instances belonging to the same equivalence class are not distinguishable from the user's point of view, since oid's are invisible to the users.

35 In order to define the semantics of a L O G I D A T A + p r o g r a m over an instance, we need some preliminary notions. 4.1

Fixpoint operator associated with a set of clauses

Given a scheme S = ( C , R , T , F , T Y P , I S A ) , let us define the semantics of a generic set of L O G I D A T A + clauses over S. T h e Herbrand base HB(S) associated with S is the set of all facts of the language. An interpretation yr is a finite subset of HB(S). A valuation/9 is a partial function f r o m variables to terms, with the constraint t h a t oid terms are associated with variables in VC . We denote the result of/9 over a variable X by means of/9(X). Given a valuation /9, we can extend /9 to the terms of the language in the following way. Given a t e r m t: - if t is a constant or an oid, then O(t) = t; - i f t is a variable X , then O(t) = / 9 ( X ) ; - i f t is a tuple term (A1 : t l , . . . , A k : tk), then /9(t) = (A1 : /9(tl),...,Ak :

- ift - ift -- i f t the

is a set t e r m { t l , . . . , t h } , then/9(t) = {/9(tl),...,/9(th)}; is a sequence t e r m < t l , . . . , t h > , then/9(t) = < /9(tl),...,/9(th) > ; = t'.A is a t e r m involving a dot operator, then/9(t'.A) =/9(t').A, m e a n i n g value of the first-level a t t r i b u t e A of the valuation of t'.

T h e notion of valuation is extended in the n a t u r a l way to atoms, literals and sets of literals. Given a valuation 0 and a literal L we say t h a t 0 is ground on L if O(L) contains no variables. Given an interpretation 5 , a literal L, and a valuation 0 t h a t is ground on L, we define the notion of satisfaction: - if L is a relation atom, a class a t o m or a function a t o m whose predicate symbol is a monovalued function, then 9r satisfies L under valuation O, written 5 ~ O(L), if O(L) e 5; 5 satisfies -,L under valuation 0 if O(L) ~ 5; - if L is a function a t o m of the form t IS F(A1 : t l , . . . , Ak : tk) whose predicate symbol is a multivalued function, let us define the set v = {O(x) [ .T O(Z IN F(A1 : t l , . . . , A k : tk))}; then 5 satisfies L under valuation 0, if v = O(t); ~ satisfies -~L under valuation 0 if v r O(t); - in general, if L involves built-in predicates and functions, U ~ O(L) if O(L) is true in ~" according to the ordinary sense of the built-in predicate in L. Given a clause p and a valuation O, we can consider the application of 0 to the literals of p. Then, we say t h a t :T ~ O(BODY(p))if ~" satisfies all the literals in the b o d y of p, under valuation O. Given an interpretation U and a set of clauses P , we define the valuation domain VD(P, ~ ) as follows: vD(P, 5)

=

{(p, O) I P e P , 0 is a valuation on :T such that, 5 ~ O(BODY(#)), and there is no O' such t h a t O' is an extension of 0 and :T ~ O'(HEAD(p))}

36 The valuation domain is the set of pairs composed of a clause and a ground valuation that participate in the derivation of new facts. If one thinks of jr as the "current state" after a certain computation step, then we are interested in the pairs (p, 0) that contribute to deriving the ground facts for the next computation step. The ground facts can be derived either by using old oid's and constants, or by inventing new oid's. Given a set of clauses P, and a valuation domain VD(P, ~'), a valuation map 7/ is a function that associates, with each of the elements (p, 0) of VD(P, ~r), a valuation ~}(p,0) of the variables in the head of p in the following way:

- for each variable X occurring both in HEAD(p) and BODY(p), r/(p,0)(X) equals 0(X); if p is an old-invention clause and X is its oid-invention variable, then ~/(p,0)(X) is a new object identifier (old) not present in ~'. Moreover, if X is the oid-invention variable of a clause p, X' is the oid-invention variable of a clause pt, (p, 0) and (pC,0t) are elements of the valuation domain, with p ~ p' or 0 ~ 0', then rl(p, O)(X) ~ r}(p', O')(X'). This means that inventions happen in parallel, producing distinct oid's for each clause and each ground substitution. -

Note that, by definition of valuation map and valuation domain, once a clause p has generated an old for a certain substitution, that clause cannot generate any more oid's for the same substitution. We now define the set of new facts produced by a deduction step. Given a program P and interpretation ~', and chosen a valuation domain VD(P, ~-), we have:

NEW(P,3 r) -- {r/(p,9)(HEAD(p)) I (p, 8) E VD(P,.T')} We define the one-step operator 7 p by means of the function NEW. Given a set of clauses P, an interpretation ~r and a valuation domain vD(P, ~), we define: 7 p ( ~ ) = JrUNEW(P,~') The operator 7 p is clearly non-monotonic, because of the presence of negated literals and set-valued functions in LOGIDATA+ clauses. However, for a growing, finitary and stable [13] operator 7 p , we define the fixpoint operator 7~ as follows. Given a set of clauses P and an interpretation ~r, let: .~.o = ~.;

yi = 7p(.Ti-1), for i > O; jr~ = 7~(~') = ~.k such that j r k =

.~-k-1,

if it exists.

37 4.2

Set of facts associated with an instance

An important step in the definition of the semantics of a L O G I D A T A + program P over an instance s of a scheme S = ( C , R , T , F , TYP,ISA) consists in the characterization of a function r that trasforms the input instance s into a set of L O G I D A T A + ground facts, that is, an interpretation over with S. The function r associates a set of facts with each instance s -- (c, r, f, o) of S in the following way. Let r he the set of facts containing: - a fact R(A1 : V l , . . . , A k : ok), for each R E R and each tuple (A1 : v l , . . . ,Ak :ok) in the relation r(R); - a fact C(OID : o, A1 : v l , . . . , A k : ok), for each o E O and for each class C such that o E c(C), where A 1 , . . . ,Ak are the attributes of C and (A1 : v l , . . . ,Ak :ok) is the restriction of o(o) to A 1 , . . . ,Ak; - a f a c t v IS F ( A 1 : V l , . . . , A k : Vk), for each monovalued function F E F and each tuple (A1 : v l , . . . , A k :vk) such that v is the value associated to (A1 : v ~ , . . . , A k :vk) by the function f ( F ) ; -- a fact v IN F ( A 1 : v l , . . . ,Ak : vk), for each multivalued function F E F and each tuple (A1 : v l , . . . , Ak : vk) such that v is the set of values associated to (A1 : v l , . . . , A k :vk) by the function f ( F ) , and v E v. In plain words, r contains: one fact for each tuple in each relation; as m a n y facts for an object o as the number of different classes which o belongs to; 5 one fact for each tuple on which a monovalued function is defined; as m a n y facts for each tuple on which a multivalued function is defined as the number of elements in the corresponding set of values. The function r is defined for every instance, but it can be shown that is not surjective: there are sets of facts that are not in the image of r There are several conditions that can be violated. We sketch some of t h e m informally: - (well-typedness): for each fact, all attributes of the predicate symbol appear and the corresponding terms have the appropriate type. - ( c o n t a i n m e n t ) : for each oid o and each fact C I ( O I D : O , . . . ) , there is a fact C2(OID : o , . . . ) for each class C2 such that C1 IsA C2. This condition requires the satisfaction of the containment constraints corresponding to isa hierarchies. - (disjointness): for each oid o and each pair of classes C1, C2, if both facts CI(OlD : o , . . . ) , and C2(OlD : o , . . . ) appear, then C1 and C2 have a common ancestor in S. - (old-coherence): if an oid o occurs as a value in a fact for an attribute whose type is a class C, then there is a fact C(OlD : o , . . . ) . This condition rules out dangling references. - (functionality): there cannot be two different facts C'(OID : o , . . . , A : v ' , . . . ) and C~(OID : o , . . . , A : v'~,...), with v' # v% T h a t is, two facts for the 5 Note how each of the facts involves only the attributes that are relevant for the corresponding class.

38 same oid have respectively identical values for the common attributes. This happens for monovalued function as well: there cannot be two different facts v Is F(A1 : v l , . . . , A k : vk) and v' Is F(A1 : v ~ , . . . , A k : v~), with vl = v~, 9.., vk = v~, and v • v'. Since the function r is not surjective, its inverse r is not defined for every set of facts. Given a set of facts ~', if r is defined over ~', we will denote with r the instance corresponding to 5 . 4.3

Management

o f isa r e l a t i o n s h i p s

In order to define the semantics of a LOGIDATA+ program it is necessary to deal with hierarchies; in fact isa relationships require generation of facts for the satisfaction of containment constraints - - intuitively, these facts correspond to the propagat!on of oid's along isa relationships. The solution we adopt consists in adding to each program clauses that enforce the isa relationships defined over the corresponding scheme (as it is done in the LOGRES language [9]). Specifically, given a scheme S = (C, R, T, F, TYP, ISA), we define the isa-clauses ff'S for S as follows: {C2(OID : xo,A1 : x l , . . . ,Ak :k) ~ CI(OID : xo,A1 : x l , . . . ,Ak+h : Xk+h)l C1 ISA C2, with TYP(C2) = (A1 : r l , . . . , Ak : rk) and TYP(Cl) ~- (A1 : r l , . . . ,Ak+h : rk+h)} 4.4

Stratification

In order to cope with negation and set-valued functions, we introduce a stratification policy for LOGIDATA+ sets of clauses. Given a clause p, we call head symbol of p the predicate symbol of HEAD(p). Given a LOGIDATA+ set of clauses P, we define the Dependency Graph extended with Negation and Functions, called DGNF, of P as follows: - the set of nodes N(DGNF) is the set of predicate symbols occurring in P; an edge (u, v) belongs to the set of edges E(DGNF) if there is a clause p in P such that u is its head symbol, and such that v occurs in its body; - an edge (u, v) 9 E(DQNF) is marked if there is at least one clause p in P with head symbol u such that either: 1. v is a symbol occurring negatively in the body of p; or 2. v is a multivalued function symbol, appearing in the body o f p in a simple function atom of the form t IS v( .... ). -

A LOGIDATA+ set of clauses P is stratified if it is partitionable into subsets P 1 , . . . ,P,~ such that: - if u E Pi, v 9 P j , (u, v) 9 E(DGNP), then i >_ j; if u 9 Pi, v 9 P j , and (u, v) is a marked edge in E(DGNF), then i > j. -

39 P 1 , . . . , Prn is called a stratification of P. A necessary and sufficient condition for the stratification of a L O G I D A T A + set of clauses P is t h a t the DGNF associated with P has no cycle with a marked edge. Not all sets of clauses are stratified, but detecting stratification is quite easy [1]. It is well known that each s t r a t u m Pi produces a growing, finitary and stable operator [4] 7 p , . This allows for the definition of a fixpoint operator 7 ~ , associated with each Pi. 4.5

Semantics of a program

Given a program P over a scheme S, it is therefore possible to build the L O G I D A T A + set of clauses P P , S = P U gts , obtained by adding to the clauses in P the isa clauses associated with S. If the L O G I D A T A + set of clauses P P , S is stratified, let us denote with P l , 9.., Prn one of the possible stratifications. Given an instance s of S, let ~-s = r be the set of ground facts associated with s. Let us define a set of facts for each stratum:

:r0 = f s ; f'i = 7~,(3ri-1), for i = 1 , . . . , m -

We define the

1;

semantics Yp of the program P over the instance s as follows:

- if the inverse r of the function r is defined over the set of facts 9rrn, let us call s t the instance of S such that s t = r we say t h a t s t is the semantics/~p of the program P over the instance s; - if the function r 1 is not defined over Urn, we say that the semantics f ' p of the program P over the instance s is undefined. There can be various reasons for which the semantics for a stratified program P over an instance s is not defined. They correspond to various extensions of the model and language with respect to the traditional Datalog framework, where the semantics of a program is always defined. - Since the language is not strongly typed, type mismatch may be generated by the program, thus bringing ill-typedness. Typing of the language represents an issue to be further investigated. - Recursion through oid invention can lead to the generation of infinite sets of facts. For example, given a class C, whose tuple type is (Crel : C), and the rule p : C(01D : X , Crel : Y ) *-- C(OID :Y, Crel : Z), the program made of this rule has clearly no model unless the class C is e m p t y in the input instance. - The presence of isa hierarchies and specialization clauses allows for multiple and inconsistent specializations of an object from a superclass to a subclass: this brings to non functional relationships from oid's to object values. Consider the following scheme:

40

CLASS person : TUPLE name:string END; CLASS husband ISA person : TUPLE wife:person END; RELATION marriage : TUPLE husband:person, wife:person END; Suppose we know all the persons and want to fill the class of the husbands, on the basis of the relation marriage, using the following rule:

husband(olD:X, name:H, spouse:Y)*-- marriage(husband:X, wife:Y), person(OID:X, name:H), person(oID:Y, name:W) The problem of inconsistent multiple specializations for the same object arises if persons with more than one wife are allowed in the input instance. In this case, the rule has clearly no defined semantics. - In a similar way, the presence of monovalued functions and functions clauses allows for multiple and inconsistent definition of a function over the same tuple: this brings to non functional relationships from tuples to function values.

5

Conclusions

This paper has presented the LOGIDATA+ language. Syntax and semantics of the language have been introduced. The most interesting issues of the language are related to the novel features of the LOGIDATA+ model, that are: complex structures; classes with oid invention and hierarchies; data functions and grouping constructs used as set constructors; stratified semantics with respect to negation and grouping rules. Some of the proposals, such as old-inventions and language typing, need to be further investigated. We claim that the introduction of Skolem functors, in the spirit of [5,12] would make the semantics of inventions truly declarative. Future work will be devoted to exploring the use of functors in the management of old-inventions. References 1. S. Abiteboul and S. Grumbach. A rule-based language with functions and sets. ACM Trans. on Database Syst., 16(1):1-31, January 1991. 2. S. Abiteboul and P. Kanellakis. Object identity as a query language primitive. In ACM SIGMOD International Con]. on Management o] Data, pages 159-173, 1989. 3. S. Abiteboul and V. Vianu. Datalog extensions for database queries and updates. Journal o] Comp. and System Se., 43(1):62-124, August 1991. 4. K. Apt, H. Blair, and A. Walker. Toward a theory of declarative knowledge. In J. Minker, editor, Foundations of Deductive Databases and Logic Programming, pages 89-148. Morgan Kaufmann, San Mateo, California, 1988. 5. P. Atzeni, L. Cabibbo, and G. Mecca. IsaLog: A declarative language for complex objects with hierarchies. In Ninth IEEE International Con]erence on Data Engineering, Vienna, 1993. 6. P. Atzeni, F. Ca~ace, S. Ceri, and L. Tanca. The LOGIDATA+ model. This volume.

41

7. C. Beeri. A formal approach to object-oriented databases. Data and Knowledge Engineering, 5:353-382, 1990. 8. L. Cabibbo and G. Mecca. Travel agency: a LOGIDATA+ application. This volume. 9. F. Cacace, S. Ceri, S. Crespi-Reghizzi, L. Tanca, and R. Zicari. Integrating object oriented data modelling with a rule-based programming paradigm. In ACM SIGMOD International Conf. on Management of Data, pages 225-236, 1990. 10. L. Cardelli. A semantics of multiple inheritance. Information and Computation, 76(2):138-164, 1988. 11. S. Ceri, G. Gottlob, and L. Tanca. Logic Programming and Data Bases. SpringerVerlag, 1989. 12. R. Hull and M. Yoshikawa. ILOG: Declarative creation and manipulation of object identifiers. In Sixteenth International Conference on Very Large Data Bases, Brisbane, pages 455-468, 1990. 13. J.W. Lloyd. Foundations of Logic Programming. Springer-Verlag, second edition, 1987. 14. S. Naqvi and S. Tsur. A Logical Language for Data and Knowledge Bases. Computer Science Press, Potomac, Maryland, 1989. 15. J.D. Ullman. Principles of Database and Knowledge Base Systems, volume 1. Computer Science Press, Potomac, Maryland, 1988.

Travel Agency: A LOGIDATA + Application L u c a C a b i b b o and Giansalvatore M e c c a Dipartimento di Informatica e Sistemistiea, Universit/t di Roma "La Sapienza" Via Salaria 113, 00198 Roma, Italy

Abstract. Expressive power and flexibility of the LOGIDATA+ model and language are explored. We analyze a program for travel agencies: the problem we consider is concerned with the construction of travels on the basis of the client's specifications. The program queries the database and generates a report with all the available choices in terms of transfers and aeeomodations. The scheme and the clauses use a wide range of language functionalities, as complex objects, object identifiers, classes, relations, functions, is-a relationships and reeursion.

1 Introduction The LOGIDATA + model, introduced in [6], is an object oriented data model with complex structures. It provides several constructs in order to organize data: *

relations, that are sets of tuples built by means of the record, set and sequence constructors; classes, essentially sets of objects, each of them associated with an object identifier (oid); each object may have a complex structure and isa hierarchies are allowed among classes;

9

functions, that associate values to tuples; both monovalued and multivalued functions are allowed.

The L O G I D A T A + language, presented in [5], is a declarative language capable of handling complex structures and objects. It is a suitable extension of ordinary declarative languages, such as Datalog [9], that allows for: 9

creation and management of oid's, through the notion of oid invention;

* grouping constructs used as set constructors for multivalued functions; 9

built-in predicates and functions;

43 9

negation in the body of rules.

The semantics proposed is deterministic and stratified with respect to negated atoms and multivalued functions. In this paper we present an application of the L O G I D A T A + model and language. The problem we focus on is concerned with the construction of travels based on a set of specifications supplied by a client of a travel agency. The client draws a list of towns he wants to visit, with related arrival and departure dates. Upon this, the application program queries the database and, if the specifications are feasible, it generates a report containing information on transfers and accomodations. Each transfer may be performed either by plane or by car. In the latter case, the client needs to rent a car in a rent-a-car agency. Accomodations will be provided by hotels. The database contains all the information about towns, hotels, flights and rental companies. The result we generate is a report for the client, in which are presented all the available options for him, that are: a list of hotels for each town in which an accomodation is needed; a list of the flights available for each Night transfer; a list of the rental companies where a car can be rented for each driving transfer. The paper is organized as follows: in section 2 the input scheme is described, along with the instance stored in the database; section 3 introduces the strategy of the program and a set of clauses and type declarations that define some preliminary transfbrmations; in section 4 the various components of the final report are analized; finally, in section 5 and 6 the creation of the travel and the generation of the report are presented.

2 Input scheme and instance The proposed scheme contains information on the structure of data permanently stored in the database; the relative instance - intuitively - describes a town connections' graph. The graph nodes are towns which represent possible travel destinations. Each town is associated with a hotel set, an airport set and a rent-a-car set; each rent-acar belongs to a rental company in agreement with our travel agency. Arcs of the graph are represented by objects of the class t r a n s f e r ; in this class all the existing links between towns are stored. Each transfer is associated with the departure ( f r o m ) and arrival ( t o ) town, that form a key for the class. A transfer may be either a f l i g h t _ t r a n s f e r , that is a set of flights offered by various air companies, or a d r i v i n g _ t r a n s f e r , that is a transfer "on the road", for which the client needs to rent a car. As you can see in the scheme, there is an "is-a" link between the class t r a n s f e r and the two more specific classes flight_transfer and d r i v i n g _ t r a n s f e r . The latter are disjoint classes that wholly cover the father. We state some simplifying hypothesis on the problem domain. For example, we suppose that rented car daily cost is fixed for each rental company, and it does

44 not depend from the particular rent-a-car and from car models. Futhermore, we

suppose that the attribute name is a key for the class t o w n . The scheme is described by means of the LOGIDATA+ Data Definition Language (DDL) [6], that is quite self-explanatory.

TYPE TYPE TYPE TYPE TYPE TYPE

d a t e = TUPLE d , m , y : i n t e g e r END; time = TUPLE h,m : i n t e g e r END; category = string; cost = integer; mileage = integer; airport = string;

CLASS toMn : TUPLE name : s t r i n g ; state : string; a i r p o r t _ s e t : SET OF a i r p o r t ; h o t e l _ s e t : SET OF h o t e l ; r e n t - a - c a r set : SET OF r e n t - a - c a r

END; CLASS h o t e l

: TUPLE name : s t r i n g ; category : category; d a i l y _ c o s t : cost

END; CLASS r e n t - a - c a r

: TUPLE name : s t r i n g ; company : rental_company

END; CLASS rental_company : TUPLE name : s t r i n g ; d a i l y _ c o s t : cost

END; CLASS t r a n s f e r

: TUPLE from, t o : toNn

END; TYPE schedules = SET OF TUPLE id : string; air_con~any : s t r i n g ; departure, a r r i v a l : time; cost : c o s t ; END CLASS f l i g h t _ t r a n s f e r ISA t r a n s f e r : TUPLE from a i r p , t o _ a i r p : a i r p o r t ; schedules : schedules

END;

45

CLASS d r i v i n g _ t r a n s f e r ISA t r a n s f e r mileage : mileage

: TUPLE

END;

An istance fragment of this scheme portion may be as follows:

CLASS town OlD #1:1

name 'Rome'

state 'Italy'

#t2

'Milan' 'Paris' 'Verona' 'Venice'

'Italy' 'France' 'Italy' 'Italy'

#t3 #t4 #t5

airport set {'Leonardo da Vinci', ' Ciampino' } {'Linate'0 'Malpensa')

{'De Gaulle', 'Orly'} {} {'Marco Polo')

hotel set {...)

rent-a-car set {...}

{ # h l , #h2, #h3} {...) {...} {#h4, #h5, #h6}

(#reel, #rat2) {...) {...} {#rac3, #rac4)

CLASS hotel OlD

name ' Mitano Hilton' 'Grand Hotel Duomo' 'Raffaello' 'Danieli' ' Metropole' 'Londra Palace'

#hl #h2 #h3 #h4 #h5 #h6

category 'lux'

'lUX' 'first' 'Iux' ' first' 'first'

daily cost 350000 280000 270000 420000 250000 180000

CLASS rent-a-car OlD #rac 1 #rac2 #rac3 #rac4

name 'Milano Car' 'Euro Rental' 'Rent-a-car' 'Euro Rental'

company #rco 1 #rco 2 #rco2 #rco3

CLASS rental company OID #rco I #rco2 #rco3

name 'HERTZ' 'AVIS' ' ItalyByCar'

daily cost 50000 80000 6OOO0

CLASS transfer OlD #trl #tr2 #tr3 #tr4 #tr5 #tr6

from #1:1 #t5 #t2 r #1:3 #t4

to #1:2 #tl #t3 #t4 #t2 #t5

46

CLASS flight_transfer OlD

from_airp

to airp

Itrl

'Linata'

#tr2

'Laonardo de Vinci' "Marco Polo'

#tr3

'Linate'

~ da Vinci' 'De Geulle'

~lPtr5

'De Gaulle'

' Linate'

schedules id {('AZ061' ('AZO47' {('AZ214' ('AZ174' {('AZ227' ('AF829' {('AZ226' ('AF828 ~

air company 'Alitalia' 'Alitalla' 'Alita|ia' 'Alitalia' 'Alitalia' 'Air France' 'Alitalia' 'Air France'

departure 08:00 19:00 08:00 21:05 09:00 19:55 07:15 19:50

arrival 09:05 20:05 09:05 22:10 10:35 21:20 08:50 21:15

co=t 180000) 186OOO) } 156000) 141000) I 150000} 180000)} 1500001 176000)}

CLASS driving_transfer

H~176 I m'"' #tr4 #tr6

164 120

!

We shall now comment on the tabular description used. First of all, note how for every class, a column o s has been introduced. It represents the object identifier for the elements of that class. The values of the oid's have been expressed by means of the conventional format #XXnn. We recall that oid's are not visible to users, namely we can use them only as indirect references to elements of classes. It is worth noting that oid's are not constants of the language like elements of base types. We use the proposed format only with the purpose of highlighting the semantic links between objects. For objects that belong to more than one class (i.e. elements of subclasses in isa hierarchies), the corrispondent class table shows the values relevant to that class and not shown in its superclasses, besides the object identifier; the inheritance of the other properties is clearly understood. For the sake of space, values for the types d a t e and t:J.rae have been compacted. Only the information needed to satisfy the proposed specifications are expressed, so that insignificant values are shown with dots ( ' . . . ' ) or left out. Let us consider the following scheme definitions:

TYPE j o u r n e y = TUPLE date : date; transfer : transfer END;

47 RELATION spec : TUPLE id : string; client : string; hotel_category : category; stages : SEQUENCEOF TUPLE town: s t r i n g ; a r r i v a l , d e p a r t u r e : date END END; CLASS t r a v e l :

TUPLE spec_id : s t r i n g ; journeys : SEQUENCE OF j o u r n e y ;

END;

The relation s p e c appears in the scheme: it is used for the client's specifications data entry. The class t r a v e l appears too: reasonably travels invented on the basis of client specifications will be permanently stored in the database.

3 Preliminary transformations We query the database by means of a program application over the current instance. The program consists of a structural part and a behavioural part, namely of a set of type declarations along with a set of clauses. In this and the following sections we comment on the various parts of the program. We have chosen to keep each data definition close to the related clauses in order to make the transformations more readable. From this point of view, a client's travel specification may be easily feeded in with a ground fact such as:

spec (id: 'Rossi/45', client: 'Mario Rossi', hotel_category: ' f i r s t ' , stages: { (town: 'Rome', arrival: 10/12/91, departure: I0/12/91), (town: 'Milan', arrival: 10/12/91, departure: 13/12/91), (town: 'Paris j, arrival: 13/12/91, departure: 13/12/91), (town: 'Milan', arrival: 13/12/91, departure: 15/12/91), (town: 'Verona', arrival: 15/12/91, departure: 15/12/91), (town: 'Venice', arrival: 15/12/91, departure: 19/12/91), (town: ,Rome', arrival: 19/12/91, departure: 19/12/91) )). Note that facts like this are allowed in programs, since the language does not require disjointness between the input and the output schemes. The hypothesis that object identifiers are not visible to users is one of the LOGIDATA + model features; this implies that input and output of data should occur only by means of value based (visible) relations (i.e. relations with no oid's). This is the reason why the specifications are modeled by means of the relation s p e c , that has a value key i d .

48 In the following sections we will often resort to the "dot" operator, applied not only to tuples, as defined in [5], but also to oid's. Given a variable X that represents an oid of a class c, and an attribute A appearing in the type of C, the term X. A refers to the value of the attribute A in the object associated to the old corresponding to X. The program realizes an input-output transformation - with the output scheme obtained from the input one along with the program type declarations - through the use of a set of intermediate transformations. Let us analyze them. The first trasformation performs the construction of the class specifications, that is a more suitable representation of information stored in the relation s p e c . This class will let us manipulate specifications directly by means of the associated object identifiers: this is a helpful feature of the language. In order to build a class from the original relation, we need to invent an object for each tuple in the original relation. The function b u i l d e r performs the construction.

TYPE stage = TUPLE town : town; arrival, departure : date

END;

: TUPLE id : s t r i n g ; client : string;

CLASS s p e c i f i c a t i o n s

hotel_category : category; stages : SEQUENCE OF stage END; s p e c i f i c a t i o n s (OlD:NEW, i d : I D , c l i e n t : C , hotet_category:HC, stages:ST) <-spec ( i d : ID, c l i e n t : C, h o t e l _ c a t e g o r y : HC, stages: S), ST IS b u i l d e r (stages: S). FUNCTION b u i l d e r : TUPLE stages : SEQUENCE OF TUPLE town: s t r i n g ; a r r i v a l , d e p a r t u r e : date END END TO SEQUENCE OF stage; <> IS b u i l d e r (stages: S) <-- S = <>. ST IS b u i l d e r (stages:S) <-not S = <>, F f i r s t S, R rest S, ST1 IS b u i l d e r (stages: R), town (OLD: T, name: F.town), ST = append ((town:T, a r r i v a t : F . a r r i v a l , d e p a r t u r e : F . d e p a r t u r e ) , ST1).

This transformation primarily consists in the substitution of the string t o w n with the corresponding town object. A new object in s p e c i f i c a t i o n s shall be invented for each tuple in s p e c . The manipulation of sequences is possible by resorting to user defined and built-in functions.

49 Note that we allow for "incomplete" atoms in the body of rules, that are atoms whose predicate symbol is a relation or a class in which only some of the attributes appear. In such a case we suppose all remaining attributes bound to different variables never used in the rule.

RELATION spec id

client

'Rosai/45'

hotel_category

'Mario Rossi'

stages town <('Rome' (' Milan' ('Paris' ('Milan' ('Verona' ('Venice' ('Rome'

'first'

arrival 10112191 1O/12/91 1 3112/91 13/I 2/91 15/12/91 15/12/91 19/12/91

departure

101121911 13/12/91 13112/91 15/12/91 15112/91 19112/91 19/12/91

) ) ) ) ) )>

CLASS specifications OlD

id

client

hotel_category

#spl

'Rossi/45'

'Mario Rossi'

'first'

stages town <(#tl (#t2 (#t3 (#'1:2 (#t4 (#1:5 (#tl

arrival 10/12/91 10/12/91 13112/91 1 3/1 2/91 15112191 15/12/91 19/12/91

departure 10/12/91) 13/12/91) 13/12191 ) 1511 2/91 ) 15/12/91) 19112/91) 19/12/91)

Our primary goal is the invention of an object of the class t r a v e l for each asserted specification. A t r a v e l is represented by the sequence of transfers needed to perform it, coherently with a client's specification. As we have seen before, the final outcome must be a value based relation.

4 Report analysis Let us briefly comment on the report construction. We want to generate a report for the client, in which are presented all the available options for him: for each town where it is necessary to stay overnight, we list the hotels of the client's preferred category, with relative daily cost; for each flight transfer, the schedules of all the flights offered by various carriers; finally, we join to each driving transfer a rental period, then list the available rental companies for each of them. We presume that a car may be rented in a certain town and returned to an agency of the same company eventually in a different town.

50

4.1 Rental Report First of all, we realize the transformation from specifications to the sequence of journeys needed to build the travel. The rules concerned with the function j o u r n e y s are based on the following strategy: every time we need to move from town A to town B, we look for a flight transfer; if there is a f l i g h t _ t r a n s f e r - i.e. one or more flights - connecting town A with B, we append it to the sequence and go ahead. If for some couple of towns A and B there is no flight, we look for a driving transfer, and append it to the sequence. It is worth noting that if for some couple of towns there is neither a flight transfer nor a driving transfer, the set of rules defming journeys will fail, so that travel_journeys (spec: S) remains undefined. FUNCTION journeys : TUPLE stages : SEQUENCE OF stage END TO SEQUENCE OF j o u r n e y ; <> IS journeys (stages: S) ~-- R r e s t S, R = <>. J IS journeys (stages: S) <-F f i r s t S, R r e s t S, not R = <>, SEC f i r s t R, f l i g h t _ t r a n s f e r (OID: X, from: F.town, t o : SEC.town), J1 IS journeys (stages: R), J = append ( ( d a t e : F . d e p a r t u r e , t r a n s f e r : X), J1). J IS journeys (stages: S) <-F f i r s t S, R r e s t S, not R = <>, SEC f i r s t R, not f l i g h t _ t r a n s f e r (OID: Y, from: F.town, t o : SEC.town), d r i v i n g _ t r a n s f e r (OID: X, from: F.town, t o : SEC.town), J1 IS journeys ( s t a g e s : R), J = append ( ( d a t e : F . d e p a r t u r e , t r a n s f e r : X), J1). FUNCTION t r a v e l _ j o u r n e y s : TUPLE spec : s p e c i f i c a t i o n s END TO SEQUENCE OF j o u r n e y ; J IS t r a v e l _ j o u r n e y s (spec: S) +- J IS journeys (stages: S . s t a g e s ) .

FUNCTION travel joumeys spec

#spl

date < (10/1 2/91 (13/1 2/91 (13112/91 (15/1 2/91 (15/12/91 (19/12/91

transfer #trl) #tr3) #trS) #tr4) #tr6) #tr2) >

The next step consists in finding out which are the rental periods needed to complete the travel, on the basis of the j o u r n e y s sequence just built.

51 Let us state an example to explain how the program behaves. Suppose we are in Milan; we want to move to Verona and, after visiting Verona, move to Venice. We can't perform such a sequence of movements by plane, since in the database no flight exists among the three towns. In order to perform the trasfer, we need to rent a car in Milan, drive to Verona and then return the car in Venice: two driving transfers result in only one rental period. From now on, we will suppose that the client rents a car any time a non-flight transfer has to be made, and that he uses the same car for every transfer until the next flight. Rental periods are generated by finding the transitive closure of driving transfers; this contains periods of several lenght among which we select only the maximum lenght ones, i.e. the ones not contained in other periods.

RELATION rent : TUPLE spec : specification; from, to : town; mileage : mileage END; rent (spec: S, from: X.transfer.from, to: X.transfer.to, mileage: X.transfer.mileage) eJ IS travel_journeys (spec: S), X member J, driving transfer (OID: X.transfer). RELATION c t o s - r e n t : TUPLE spec : s p e c i f i c a t i o n ;

from, to : town; mileage : mileage END; ctos-rent (spec: S, from: TF, to: TT, mileage: M) *rent (spec: S, from: TF, to: TT, mileage: M). clos-rent (spec: S, from: TF, to: TT, mileage: M) ~-rent (spec: S, from: TF, to: X, mileage: MI), clos-rent (spec: S, from: X, to: TT, mileage: M2), M=MI+M2. FUNCTION rentals : TUPLE spec : specifications END TO SET OF TUPLE from, to: town; mileage: mileage END; (from: TF, to: TT, mileage: M) IN rentals (spec: S) ~-c[os-rent (spec: S, from: TF, to: TT, mileage: M), not rent (spec: S, from: TT), not rent (spec: S, to: TF). FUNCTION rental_company_set : TUPLE town : town END TO SET OF rental_company;

52 R.company IN rentaL_company_set (town: T) <-- R member T.rent-a-car_set. FUNCTION companies : TUPLE town1, town2 : town END TO SET OF TUPLE rentaL_con~lny : s t r i n g ; r : cost

END; (rental_company: R.nar~, daiLy_cost: R.daily_cost) IN companies (town1: T1, town2: T2) *R1 IS rental companyset (town: T1), R2 IS rental_company_set (town: T2), ! = R1 i n t e r s e c t i o n R2, R member !. FUNCTION r e n t a l _ r e p o r t : TUPLE spec : s p e c i f i c a t i o n s END TO SET OF TUPLE from, to : s t r i n g ; mileage : mileage; companies : SET OF TUPLE rentaL_company : s t r i n g ; daily_cost : cost END

END; (from: R.from.name, to: R.to.name, mileage: R.miteage, companies: C) IN r e n t a l _ r e p o r t (spec: S) <-R member rentals (spec: S), C IS companies (town1: R.from, town2: R.to).

RELATIONrent

L.#sp.cI #sp1

,rom ~t 2

,o#t4

m,,.,. 164

~4

#t5

120

I

RELATIONclos-rent I spec #sp 1 #sp 1 #sp1

from #t2 #t4 #t2

to #t4 #t5 ~5

mileage 164 120 264

i

FUNCTIONrentals

l. #sp1 .c

l,rom #t2

,o#t5

284

0

53 FUNCTIONrental_company set town

rental company

{...} #1:2

{#rcol, #roe2}

(...} (...}

#~3

#t4 #t5

{#rco2, #rco3}

FUNCTIONcompanies i t~ it2

town2 #t5

! rental {('AVIS'c~

daily 800001}coat

I

FUNCTION rental_report

I Pecll,om #sp1

4.2

'Milan'

,o

m--

'Venice'

284

oomp--

rental company {('AVIS'

daily cost 80000)}

I

Flight Report

The flight report construction is perfo~e.~ in a very easy way: it just picks up the attribute s c h e d u l e s of each flight tr~sfer and puts it into the report.

FUNCTION f l i g h t _ r e p o r t : TUPLE spec : s p e c i f i c a t i o n s END TO SET OF TUPLE from, to : s t r i n g ; schedules : schedules END; (from:T.from.name, to:T.to.name, schedules:SC) IN f l i g h t _ r e p o r t (spec: S) <-J IS travel_journeys (spec: S), T member J, f l i g h t _ t r a n s f e r (OID: T . t r a n s f e r , schedules: SC).

54 FUNCTION flightreport spec

from

to

#spl

{('Rome'

'Milan'

('Venice'

'Rome'

('Milan'

'Parigi'

('Parigi'

'Milan'

schedules id {('AZ061' ('AZ047' {('AZ214' ('AZ174' {('AZ227' ('AF829' {('AZ226' ('AF828'

air company 'Alitalia' 'Alitalia' 'Alitalia' 'Alitalia' 'Alitalia'

departure 08:00 19:00 08:00 21:05 09:00

'Air France'

19:55

'Alitalia' 'Air France'

07:15 19:50

arrival 09:05 20:05 09:05 22:10 10:35 21:20 08:50 21:15

cost 180000} 185000))) 156000) 141000)}) 150000) 180000)})

150000} 175000)l) I

4.3 Hotel Report Hotel and rental report generation are very similar. The main problem is represented by one day trips. Let us give an example: we are in Milan from December 10 to 13; on December 13 we fly to Paris in the morning, to have a business meeting; in the evening of the same day we fly back to Milan and stay there until December 15. It is clear that we need accomodation in Milan from 10 to 15, and it would be reasonable to stay in the same hotel for the whole period. In order to perform such a trasformation from specifications to accomodations, we use a technique very similar to the one used for rentals:

RELATION days_in_towns : TUPLE spec : s p e c i f i c a t i o n s ; town : town; a r r i v a t , departure : date END; days_in_towns (spec: S, town: T, a r r i v a t : A, departure: D) <(town: T, a r r i v a t : A, departure: D) member $.stages, not A = D. RELATION ctos-days_in_towns : TUPLE spec : s p e c i f i c a t i o n s ; town : town; a r r i v a t , departure : date END; ctos-days_in_towns (spec: S, town: T, a r r i v a t : A, departure: D) <-days_in_towns (spec: S, town: T, a r r i v a t : A, departure: D). ctos-days_in_towns (spec: S, town: T, a r r i v a t : A, departure: DD)
55 FUNCTION n i g h t s _ i n _ t o w n s : TUPLE spec : s p e c i f i c a t i o n s TO SET OF TUPLE town : town; arrival, departure : date END;

END

(town:T, a r r i v a l : A , departure:D) IN n i g h t s _ i n _ t o w n s (spec: S) (--c t o s - d a y s _ i n _ t o w n s (spec: S, town: T, a r r i v a l : A, d e p a r t u r e : D), not days in_towns (spec: S, town: T, a r r i v a l : D), not days_in_towns (spec: $, town: T, d e p a r t u r e : A ) .

The instance would be:

RELATION days in_towns spec #spl #sp 1 #sp1

town #t2 #t2 #t5

arrival 10/1 2/91 13/1 2/91 15/12/91

departure 13/1 2/91 15/1 2/91 19/12/91

RELATION clos-days in towns s ec [ #spl I #spl

town #t2 #tt2 #t5

arrival 10/1 2/91 13/1 2/91 15/12/91

I #sp1

#t2

10/12/91

departure 13/1 2/91 15/1 2191

19/12/91 15/12/91

RELATION nights in towns

#spl

{(#t2 (#t5

10/1 2/91 15/1 2/91

15/1 2191 ) 19/I 2191 )}

Once we have found out how many nights the client will sleep in each visited town, in order to produce a hotel report, we need to select in each town only hotels whose category equals the client's preferred category.

FUNCTION c a t e g o r y _ h o t e l : TUPLE town : town; category : category END TO SET OF TUPLE hotel : string; daily_cost : cost END

56 ( h o t e l : H.name, d a i l y cost: H . d a i t y c o s t ) IN category hotel (town: T, category: H.category) <-H member X.hotel set. FUNCTION h o t e l _ r e p o r t : TUPLE spec : s p e c i f i c a t i o n s END TO SET OF TUPLE town : s t r i n g ; a r r i v a l , departure : d a t e ; h o t e l s : SET OF TUPLE hotel : s t r i n g ; d a i l y _ c o s t : cost END

END; (town: T.name, a r r i v a l : A, departure: Do h o t e l s : H) IN h o t e l _ r e p o r t (spec: S) *-Z IS nights_in_towns (spec: S), (town:T, a r r i v a l : A , departure:D) member Z, H = category_hotel (town: T, category: S.hoteL_category).

FUNCTION hotel_report spec

town

arrival

departure

#sp1

{('Milan' ('Venice'

10/12/91 15/12/91

15/12/91 19/12/91

hotels hotel {('Raffaello' {(' Metropole' ('Londra Palace'

daily cost 2700001}) 250000) 180000)})}

5 Travel invention Travels are invented only if the corresponding Specifications may be unfeasible if:

specifications are feasible.

1. we need to move from town A to town B and the corresponding nodes are not connected in the transfer graph; 2. there exist some rentals such that we need to rent a car in town A and return it in town B and there is no rental company operating in both A and B. We have already considered case 1; to take into account case 2 as well, we use the relation u n f e a s i b l e _ s p e c , that can prevent the travel invention if specifications are found to be unfeasible. Since in our instance there is no inconsistency, u n f e a s i b l e _ s p e c is an empty relation.

57 RELATION unfeasibte_spec : TUPLE spec : s p e c i f i c a t i o n s END; unfeasibte_spec (spec: S) * X IS r e n t a l s (spec: S), Y member X, RR IS companies (town1: Y.from, town2: Y . t o ) ,

RR = ( } .

t r a v e l (OLD: NEW, spec_id: ID, j o u r n e y s : T) * s p e c i f i c a t i o n s (OID: S, i d : IO), T %S t r a v e L _ j o u r n e y s (spec:S), not unfeasibte_spec (spec: S).

RELATION unfeasible_aper

{.,oc

I

CLASS travel ~D

spec_id

#tvl

'Rossi/45'

date < (10/1 2/91 (13/1 2/91 (13112/91 (15112/91 (15/12/91 (19/1 2/91

transfer #trl ) #tr3) #trS) #tr4) #tr6) #tr2) >

6 Final report It is very easy to build the final report exploiting the report functions for rentals, flights and hotels previously presented. Once again it is worth noting that the output relation cannot contain oid's.

58 RELATION r e p o r t : TUPLE id : string; client : string; f l i g h t s : SET OF TUPLE from, to : s t r i n g ; schedules : schedules END r e n t a l s : SET OF TUPLE from, to : s t r i n g ; mileage : mileage; companies : SET OF TUPLE rental_company : s t r i n g ; d a i l y _ c o s t : cost END END h o t e l s : SET OF TUPLE town : s t r i n g ; a r r i v a l , departure : date; h o t e l s : SET OF TUPLE hotel : hotel_name; d a i l y _ c o s t : cost END

END END; r e p o r t ( i d : ID, c l i e n t : C, f l i g h t s : F, r e n t a t s : R, h o t e t s : H)
7 Final remarks We consider this application quite interesting since it highlights many important features of the LOGIDATA + model and language. For example, functions used as set constructors (in the spirit of COL [1] and LDL [10]), are very useful when dealing with complex applications; they result in very readable and modular programs when used along with recursion. Evidently we prefer oid's to values in order to access data; moreover, LOGIDATA+ deals with classes and relations whose top level constructor is the tuple type; this fact allows for a powerful access to data by means of the "dot" operator. This example only marginally exploits "is-a" hierarchies and old invention, but we think these issues to be among the most original and interesting features of the language, deserving further study and development. Note how deterministic semantics influences computations: our program produces reports containing all available choices in terms of hotels, rent-a-cars and flights, since it is not capable of choosing one of them non deterministically. In such cases it would be very useful to have a semantics allowing non deterministic trasformations as well.

59

References [11

Serge Abiteboul, Stephane Grumbach. A Rule-Based Language with Functions and Sets. ACM Trans. on Database Systems, 16 (1): 1-30, March 1991.

[2]

Serge Abiteboul, Paris C. Kanellakis. Object Identity as a Query Language Primitive. ACM SIGMOD Int. Conf. on Management of Data, 1989.

[3]

Serge Abiteboul, Victor Vianu. Datalog extensions for database queries and updates, Journal of Comp. and System Sc., 43 (1): 62-124, August 1991.

[4]

Paolo Atzeni. LOGIDATA+: Progress report. C.N.R.. Technical Report No.5/29, 1990.

[5]

Paolo Atzeni, Luca Cabibbo, Giansalvatore Mecca, Letizia Tanca. The LOGIDATA + language and semantics. This volume.

[6]

Paolo Atzeni, Filippo Cacace, Stefano Ceri, LOGIDATA+ Model. C.N.R.. This volume.

[7]

Paolo Atzeni, Letizia Tanca. The LOGIDATA + Model and Language. Next Generation Information System Technology, Lecture Notes in Computer Science 504. Springer-Verlag, 1991.

[8]

Filippo Cacace, Stefano Ceri, S. Crespi Reghizzi, Letizia Tanca, Roberto Zicari. Integrating object oriented data modelling with a rule based programming paradigm. ACM SIGMOD Int. Conf. on Management of Data, 1990.

[9]

Stefano Ceri, Georg Gottlob, Letizia Tanca. Logic programming and data bases. Springer Verlag, 1989.

[10]

S. Naqvi, S. Tsur. A Logical Language for Data and Knowledge Bases. Computer Science Press, Potomac, Maryland, 1989.

Letizia

Tanca.

The

Management of Extended Update Operations* Luigi Palopoli 1 and Riccardo Torlone ~ i Dip. di Elett. Inform. e Sist., UniversitY. della Calabria, 87036 Rende (CS), Italy 2 IASI-CNR, Viale Manzoni 30, 00185 Roma, Italy

A b s t r a c t . This paper deals with the definition of update language facilities for the purely object-oriented subset of the Logidata-k model. Several update operators are introduced, which correspond to basic manipulations of complex data entities with object identity. These operators can be used in rules bodies for specifying complex update transactions. The resulting language is first informally presented by using several examples. Then, the syntax and the semantics are formally given. Finally, an abstract interpreter is proposed, and its correctness is discussed.

1

Introduction

In the last years a great deal of attention focused on studying, designing and implementing new generation database systems overcoming the faults grafting the traditional ones. In this framework, a number of proposals have been developed, which are based on the integration of efficient techniques and methods for mass storing with advanced linguistic paradigms and data models (logic programming, complex objects, functional models, semantic models). In particular, much work has been devoted, on one hand, to the formalization of novel data models (e.g. [5, 18, 19]) and on the other hand to the design of advanced database query language (e.g. [4, 6, 14, 15, 22]), and to the development of efficient query answering techniques (e.g. [11, 12, 23]). Less attention has been deserved to the problem of defining update languages for advanced database environments, although an adequate support of the update activity is a very important task that any database system has to carry out [2, 3, 9, 10, 13, 17, 24]. In this paper we are concerned with the design of an update language for manipulating a complex objects database. Our data model of reference, which we shall refer to as CO (Complex Objects), is the pure object-oriented subset of the Logidata+ model. With regard to this problem, existing proposals either refer to classical (relational) frameworks [16, 20, 21] or are based on interpreting the heads of logical rules as the specifications of updates defined on the classes associated to the head predicate symbols [1, 6]. In the latter approach, positive head atoms are interpreted as insertions (where the presence of a head variable not occurring in * This work was partially supported by Consiglio Nazionale delle Ricerche, within "Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo, LRC Logidata+". Part of this work was performed while the first author was with CRAI.

61 the body denotes the object identifier (oid) which is to be invented by the system), whereas negated heads are interpreted as deletions [7, 8]. Conversely, we are concerned with designing a rule-based update language for complex objects having a top-down (i.e., procedure-call) oriented computational model. To this aim, we have first defined a number of basic update operators, which correspond to primitive manipulations of data the user may request. Such basic operators are used for the definition of complex transactions, which are organized in the form of sets of (possibly recursive) rules, which we call update rules. We argue that, with this structure, a good naturalness and simplicity of use is obtained. The language has a non-deterministic semantics in the following sense: (i) it includes inventions of object identifiers; (ii) it allows partially specified information to be used; (iii) it include two inherently finitely non-deterministic update operators; (iv) when more than one update transaction can be activated at the time, just one of them is chosen non-deterministically and executed. The paper is organized as follows. In Section 2 the update language is informally presented by using a number of examples. In Section 3 the data model of reference is described. Section 4 and Section 5 contain the update language formal syntax and semantics, respectively. Section 6 contains a description of an abstract interpreter and finally, in Section 7 we summarize our conclusions. 2

Overview:

the Update

Language

by Examples

All the examples of this subsection refer to the database schema whose definition is reported in Fig. 1: it is possible to see that a CO scheme is a collection of classes, each of which denotes a set of objects. Each object is uniquely identified by its object identifier, which is not visible to the user, and is-a relationships can be defined amongst classes. Further, attribute values of objects can be built from explicit references to other objects (in some class). This is obtained using oid's as (components of) attribute values. The current status of reference of the database for all the examples presented is reported in Fig. 2. Notice that the attributes of an object for which the value is unknown have associated the null value _L (for simplicity, we assume that the null value has always a suitable type). In designing our update language, we have considered three basic update activities, namely, insertions, deletions and modifications of objects. The data model of reference is object based and, therefore, we can have different objects stored in the database which are identical in the values of all the attributes (e.g., objects #p2 and #p4 in person, Fig. 2). Thus, for each basic update activity, the user is given the mean to specify whether the presence of the oid's has to affect (undefined modality, denoted by a r subscript) or not (1-defined modality, denoted by a 1 subscript) the execution of the update. A resulting set of six basic update operators is hence obtained: INs1, a new object is inserted using the specified attribute values, and a (new) system invented oid; INSe, a new object is inserted only if there does not already exists some object matching all the attribute values specified; DELe, all the objects matching the specified attribute

62

CLASS person:TUPLE n a m e : s t r i n g ; birthdate:TUPLE dd,mm,yy:s EHD; sons:SET OF person; END; CLASS course:TUPLE n a m e : s t r i n g ; duration:integer END; CLASS student ISA person:TUPLE s t - c o d e : i n t e g e r ; follows:SET OF course END; CLASS worker ISA person:TUPLE j o b : s t r i n g ;

salary:integer END;

Fig. 1. The scheme declaration of the running example person

#p2 #p4 #p5 #p6

name bi~rthdate s o n s dd mm yy T o m [1 6 58] r iTom [I 6 58] r Tom [4 10 58] {#p6} IJohn [12 3 80] -J_

s t u d e n t (iaa person)

course #cl #c2 #c3

duration name 140 oper. syst, • compilers 160 databases

worker ( i s a person)

#p2

11056

{#c1,#c3}

#p4

220005

_J_

#p6

23974

{#c2}

#p5

195005

taxi-driver

II

Fig. 2. A database instance of the scheme in Fig. 1

values are deleted; DELl, one non-deterministically chosen object matching the specified attribute values is deleted; MOD#, all the objects matching the specified attribute values are modified; MODI, one non-deterministically chosen object matching the specified attribute values is modified. The set of basic update operations further includes the "null effect" update operator, denoted with SKIP.

Ezamp[e 1. Assume we want to store a new person in the database, and that we know only its name, say John, and we want the new person object to be stored only if there does not exists a person having John as the name; in this case we use" *-- INS#(person(X, [name: john])).

W e point out that the variable with no associated attribute label, X in our case, denotes the whole object. This variable takes the (system invented) value of the old of the new object as its value.

63 Updating a database is, to a major extent, an inherently sequential activity. This procedurality explicitly reflects in the language, and updates are executed from left to right, in the order in which they appear. Thus, in ~-- ul, u2, u, is executed first, and then us is executed. Further, in our language, it is possible to specify conditional updates. A conditional update is one which is executed only if some logical guard is satisfied. For instance, consider the update ~-- cl, u, c2 where cl and c2 are conditions (i.e., conjunctions of logical literals) and u is an update predicate. The effect of executing the above conditional update coincides with the execution of the simple update ~-- u only if: (i) the condition ci is true with respect to the database status at the time of execution, and (ii) the condition c2 is true with respect to the status the database would assume if u would be run. Thus, cl stands for a pre-condition, whereas c2 is a post-condition, to the execution of u. With this semantics, our language makes it possible to program complex transaction on the database: a sequence of updates which has a non null overall effect only if several (initial, intermediate or final) guard conditions are satisfied. Moreover, the evaluation of cl produces bindings which can be used to supply values to update operators following it into the left to right order (only ul in the case at hand), and also to specify join conditions to other literals.

Ezample 2. Assume that we want to add a new son with name J i m and birthdate 1-1-91 to those of the person named Tom, born on 1-6-58. This can be accomplished by: ~- INSr (person(X. [name :jim. birthdat e :[dd :I'.~ :1. yy :91] ] ) ). person(Y. [name:jim.birthdate: [dd: l.mm: l.yy:91]]). M O D 1 (per s on (Z. [name :tom. birthdat e :[dd: 8. ~u,: I. yy :68] .sons :S] ) I [sons :SU{Y}] ).

where the second literal of the goal returns the binding for the variable Y, that is, the oid of a person named J i m and born on 1-1-91. Recall that the choice on which person (out of # p 2 and #;,4) is to be modified is taken by the system nondeter ministically.

Ezample 8. Assume that a worker who earns more than 300005 is to be removed provided that the database will still store at least one taxi-driver. This can be p r o g r a m m e d as follows: ~-- worker(X, [ s a l a r y : S ] ) ,S>30000$,DBL1 ( w o r k e r ( X ) ) , worker (W. [job :taxi-driver] ).

If after the execution of the deletion there is not a taxi-driver stored, the postcondition fails and the update has a null overall effect, otherwise one person earning more than 300005 is removed from the database. Up to this point we presented updates whose syntax is much similar to ordinary logic goals. More generally, updates will be organized in the form of rules

64 which implement complex manipulations. Rules are activated calling the execution of the corresponding head predicates. The head predicate of an update rule may include variables in its arguments which are used to supply input data to the transaction - note that, since oid's are not directly visible to the user, input data can not include oid's. Moreover, once an update predicate has been defined (writing a rule in which it occurs as the head), it can be used in the body of other update rules. In particular, the head update predicate can be used in the body of some rule transitively associated with it, henceforth obtaining a set of recursive update rules. In general, the execution of a set of update rules proceeds as follows. One of the rules associated with the update predicate is activated nondeterministically, and its body is executed. On fail (when some guard condition is not satisfied), another update rule associated with the called update predicate is tried, and so on until no more executions can be activated. The possibility of using recursion in programming the updates gives our language a good expressivity. The following examples should give the reader a hint on this.

Ezample ~. Assume we have decided to raise the salary of a category of .workers of a certain amount for each son. This can be accomplished by a recursive update, as follows: r a i s e ( J , A ) ~ - - w o r k e r ( X , [ j o b : J , s o n s : S ] ) , S ~ { } , p e r s o n ( Y ) , Y E S, MODl(worker(X,[salary:Sal,sons:S1])l

[salary:Sal+A,sons:Sl-{Y}]), raise(J,A),MODl(.orker(X,[sons:S2])l[sons:S2U{Y}]). raise(J,A) ~-~(.orker(X,[job:J,sons:S]),S ~ {}),SKIP(). Intuitively, the update works s follows: first a worker with at least one son is searched for; if it does not exist, then the transaction is terminated; otherwise, one of his/her sons is selected; next, this son is temporarily deleted from those of the selected worker, and the salary is raised by the requested amount; the transaction is recursively called to perform the same set of operations again (notice that the next group of operation will process one worker having some sons, possibly the one processed in the current step); on exit of the recursive call, the set of sons of the currently manipulated worker is reset to the correct value, adding back again the previously deleted sons. For instance, if we want to raise the salary of the taxi-drivers of 10005 for each son, then the transaction can be activated using § 10005). We point out that we have decided to treat updates as denoting pure "dbstate transformations", and therefore, update literals do not produce bindings. Before closing this section, it is worth giving one example regarding integrity enforcement. Consider, for instance, the classes s t u d e n t and c o u r s e of Fig. 2. The execution of a delete operation +---DEL 1 ( t o u r s e(X)) could cause an integrity error to arise, if the removed course were followed by some student (dangling reference). It is interesting to note that this situation can be easily managed using our update language. For instance, the following update could be executed

65 by the system in the place of ,--DELt(course(X)) , which guaratees referential consistency: ~- course(X) ,DELl (course(X)) ,MODqb ( s t u d e n t ( Y , [follows:Z] [ [folio,,: Z-{X}] ) ). 3

A Model

for Complex

Objects

with

Identity

For a formal definition of the update languages we shall refer to a simple d a t a model, which we call CO (a short hand for Complex Objects), deriving from the model of reference of Logidata+. CO is a purely object-oriented database model as the only allowed structure construct is the class. A CO class is a set of objects, complex entities for which value equality does not imply identity this is implemented by means of object identifiers, which are not visible to the user. Also, generalization hierarchies among classes can be built by means of is-a relationships with inherited properties. It should be said that several concepts in this models derive from analogous concepts in the L o g i d a t a + model with slight differences. However, for sake of convenience, in this section we completely present the CO model. Then, in the following sections, we will formally present the update language for this model. We fix a countable set .A of attribute names and a finite set /3 of base type names; associated with each B G B there is a set of base values v ( B ) ; the sets of base values are pairwise disjoint. A CO scheme is a triple S = (12, TYP, ISA) with associated a set TYPBS(S) of type descriptors, where: - C is a set of symbols called class names; - A type descriptor (or simply a type) of S is one of the following: 1. if B is a base type name in B, then B is a type of S; 2. if C is a class name in 12, then C is a type of $ (note t h a t classe names are types so that classes can be used in constructing other types, which may therefore refer to the objects in the classes). 3. if ~- is a type of S, then {r} is also a type of S, called a set type; 4. if rl,...,~-h, with k > 0, are types of $ and A 1 , . . . , A k are distinct attribute names in .A, then [A1 : v , , . . . , A k : vk] is also a type of S (called a tuple type) in which the order of components is immaterial; 5. {} and [] are types of S, called the empty set type and the empty tuple type respectively 3. is a function t h a t associates to each symbol in C a tuple type of S (that is, the top level constructor for classes is the tuple one); - ISA is a partial order over C, with t h e condition that if C1 ISA C2, then TYP(C1) is a refinement of TYP(C2) according to the definition given below.

--

T Y P

The definition of refinement for tuple types expresses the idea t h a t a type 7- is a refinement of another type 7-' if ~- has at least the components of r ' , and possibly some more. It can be formally defined as follows. 3 They are useful for technical reasons.

66 A type r is a refinement of a type r ' (in symbols r -<_ r ' ) if and only if at least one of the following conditions hold:

1. r = r ' ; 2. r , r ' E C and v ISA r'; 3. r = {rl} and either r ' = {} or r ' -- {r~} with ri -< r~; 4. r = [Ai : r i , . . . , A k : r k , . . . , A h + p : rk+p], r ' = [Ai : r ~ , . . . , A k : r~], with k>0, p>0, andri_r/',for 1 0, then VAL(r) = {t(A1 ...Ak) lt(A~) ~ vA~(ri), for 1 < i < / c } u {_1_}

that is, the set of all possible tuples over A1,...,Ak, with the restriction that each value belongs to the value-set of the corresponding type; 5. vat,({}) is the empty set and VAL([]) is the empty tuple.

A potential instance s of a CO scheme S : (C, TYP, ISA), is a couple s : (c, o), where: (i) e is a function that associates with each class name C E C a finite set of oid's (c(C) C_ 0 ) , (ii) o is a partial function from 0 to the union of all the value-sets of the types of S; with the conditions that for every C E C, for every o E c(C), o(o) is defined and belongs to the value-set of a type r that is a refinement of the type TYP(C). The definition of potential instance presents a shortcoming, related to the possibility of inconsistency a m o n g the occurrences of oid's in an instance. In principle, it is possible to have oid's belonging to a class C but not to a superclass

67 of C and oid's that appear in two classes that are not in is-a relationship; also, it is possible to have oid's that are used as values and do not refer to exisisting elements of classes (dangling references). This last situation can be avoided by restricting the allowed value-sets to the values of the classes in the instance, which are given by e, or, in case of incomplete information, to none of them, which is indicated by I . Therefore, for each type r, we define the value-set with respect to e (in symbols VALe(r)) as a subset of VAL(r): referring to the definition of VAL(r), we have a new definition for classes, VALe(C) = e ( C ) U {_l_}, whereas for all other cases VALe(V) is defined as VAL(v) except t h a t each occurrence of vat,() in the right hand side is replaced with VALe(). Then, the definition of CO instance can be refined, as follows: an oid-coherent instance (or simply, an instance) s of a CO scheme S = (C, TYP, ISA), is a couple s = (e, o), as in the definition of potential instance, with the conditions: 1. for every C1 ISA (72, it is the case that e(C1) C e(C2); 2. if e(C1) N e(C~) r ~ then there is a class C in C such that C1 IsA C and C2 ISA C; 3. for the value sets, each occurrence of VAL() is replaced with VALe().

4

Update

Language

Syntax

Let S = (C, TYP, ISA) be a CO scheme, U be a set of symbols called update predicate names such that each u E U has a fixed arity and each argument of u has associated a fixed type of S, B = {INSr INS1, DELl, DELl, MOD1, MOD~b,SKIP} be the set of the basic update operators and, for each type r of S, let V,- be a countable set of variable names of type v. Moreover, we assume to have at disposal a predefined set F of interpreted built-in functions (algebraic and set operators), and a predefined set P of built-in predicates (equality and set membership) of fixed arity, with each argument (and range for functions) of a fixed type of S. Often, the infix notation will be used for built-in predicates and functions (a E X , X + 5, and so on). A constant is a value from the union of the base values of S; since the base values are pairwise disjoint, each constant has a unique type. An (ezternal) term 4 is one of the following: - a variable or a constant; - if f E F with arity I > 0 and ti is a t e r m of the same type associated with the i-th argument of f , for 1 0, then { t l , . . . , t m } is a set t e r m whose type is { l u b ( r l , . . . , rm)} if m > 0, and {} if m = 0 (in this case it denotes the e m p t y set); 4 We use the adjective ezternal to distinguish these terms from the internal ones, to be introduced later.

68 - if t~ is a t e r m of type vi, for 1 0, and A 1 , . . . , A , ~ are distinct attribute names in ,4, then [A1 : Q , . . . , A,~ : t,~] is a tuple t e r m whose type is [A1 : r l , . . . ,A,~: rn] if n > 0, and [] if n = 0 (in this case it denotes the e m p t y tuple). Note that a t e r m has always associated a well defined type of S. Moreover, we point out that external terms are built, by means of function and set and tuple constructors, from variables and base values and therefore a term, by definition, do not contain object identifiers (as well as null values). This condition guarantees that oid's cannot be managed by the programmers, according to the fact that they are invisible to the users and can be managed by the system only. A t e r m is ground if no variable occurs in it. A t e r m is closed if either it is a ground function t e r m or all the function terms occurring in it are ground. Note that since we have considered only interpreted functions, in a closed t e r m all the function can be replaced by their actual images. An atom is one of the following: - if C G C, X is a variable of a class type r such that C _ r (so that the possible values for X are oid's for C or for a superclass of C) and t is a tuple term of a type r ' such that TYP(C) ~ r ' (so that, in the case of partial information, there is no need to specify all the possible attributes of TYP(C) but just some them), then C(X,t) is an object atom, where X and t are called the object variable (it denotes the old of an object) and the object specification of the a t o m respectively; for simplicity, if t is the empty tuple, C(X) will be used instead of C(X, []); - i f p E P with arity m > 0 and, for 1 < i < m, ti is a t e r m of the same type associated with the i-th argument of p, then p(tl,... ,t,~) is a built-in atom.

A literal can be static or dynamic. A static literal can be in turn simple or composed. A simple static literal is an a t o m (positive a t o m ) or an a t o m preceded by the symbol -1 (negated atom). A composed static literal is a conjunction of static literals delimited by brackets and preceded by the symbol --. A dynamic literal can be instead basic or defined. A basic dynamic literal can be built using the basic update operators as follows: - if O is an object atom, then IN$1(O) and INS~b(O) are basic inserting literals; if O is an object atom, then DSLI(O) and D~.L~(O) are basic deleting literals; if O = C(X, t) is an object a t o m and t' is a tuple t e r m such that the type of t is a refinement of the type of t', then MODI(O I t') and MOD@(OJr') are basic modifying literals; SKIP() is the null-effect basic update literal. -

-

-

A defined dynamic literal (hereinafter an update literal for short) has the form u(tl,...,ti), where u G U with arity I > 0 and, for 0 < i < l, ti is a t e r m of the same type associated with the i-th argument of u. A literal is closed if every t e r m occurring in it is closed and it is ground if every t e r m occuring in it is ground.

69 An (update) rule is an expression of the form U ,--- L 1 , . . . , L , ~ . where U (the head) is an update literal and L 1 , . . . , L n (the body) is a conjunction of (static or dynamic) literals, with at least one dynamic literal occuring in it. An update goal is a rule which has an e m p t y head, that is, an expression of the form ,--- L 1 , . . . , L,,. An blC(9 program (a short-hand for Updating Complex Object) for the scheme S is a finite set of rules. Finally, let P be a program and G a goal; we say that G is a goal for P if each update literal occuring in G also occurs in the head of some rule in P.

5

Update

Language

Semantics

In this section we formally define a semantics of L/C(9 programs. Before that, we a d a p t some known definitions (as substitution, derivation and so on) to our context and then we introduce two concepts that will be used in the following. The former (extension of a term) allows to consider a t e r m of a type r as an incompletely specified t e r m of a type more specialized than (that is a refinement of) r, whereas the latter captures the notion of "matching" among a t e r m and a complex object stored in the database.

5.1

Preliminaries

First, since in giving the semantics we have to deal with oid's, the definition of a t e r m needs to be internal by allowing the presence of object identifiers in it. So, we introduce the notion of internal term, defined as an external t e r m with the only difference that the oid's and the null value are also terms. Moreover, an internal atom (literal) is built as an a t o m (literal) from internal terms instead of external terms. In particular, an internal object a t o m has the form C(i, t) where is either the object variable or an oid, whereas t is an internal tuple term. In the sequel we will consider only internal terms, atoms and literals and therefore the adjective 'internal' will be often omitted. Now, let S be a gO scheme, 9 the set of the variables for S ( t h a t is, = I..JrETYPES(s)V~- ) and F the union of the value sets of S (that is, F = UrETYPES(s)VAL(r)). A substitution O is a function from 9 to $ U F such that: (1) there exists a finite subset Y = { X 1 , . . . , X,~} of $ such that 0(X) # X if and only if X E V, and (2) if X E Y N V~ then either 0(X) E V~ or 0(X) E VAL(r). A substitution 0 is usually indicated by listing the variables in V and the corresponding values: {X1/O(X1),..., Xn/O(X,~)}. If t is a term, then g0 denotes the t e r m obtained by replacing each variable X in t with the corresponding 0(X). If tO is ground, then g is a ground substitution for t. We denote with 8• the ground s substitution such that 0 • =_L, for every X E $, and with 0,~r the ground substitution which associates to object variables a (new) oid in (9 not already appearing in the instance s. If L is a literal, then L0 is the literal obtained by replacing each term t appearing in L with tO. Similarly, if R is a rule, then R0 is the rule obtained by replacing each literal L appearing in R with L0. Two rules

70 R I and R~ are variants if and only if there exist substitutions 0 and a such t h a t RiO = R~. and R 2 a = R1. Given a pair of literals L and M , L and M are unifiable if there exists a substitution O, called a unifier, such t h a t LO = MO. As usual, a most general unifier, or mgu, of a pair of literals L a n d M is a unifier O such t h a t , for each other unifier ~r, there exists a n o t h e r substitution )~, such that 0)~ = (r. It is possible to show t h a t an m g u is unique up to r e n a m i n g of variables. Let G be the u p d a t e goal *-- L 1 , . . . , L j , . . . , L m and R be the u p d a t e rule L ~-- L~,. ,L~. If L and Lj are unifiable and O is their mgu, then we say t h a t the goal G* defined as ~ L I O , . . . ,Li_IO, L'IO,... , L,O; L j + I O , . . . , LmO, is derived f r o m G and R using 0. Let t be a closed t e r m whose type is r E TYPES(S) and let C be another type of S such t h a t C -~ r. D e f i n i t i o n 1. T h e eztension o f t to r ' (in symbols ezt~-,(t)) is the t e r m obtained as follows: -

-

-

-

i f t is a variable then ezt~,(t) is a variable in V~-, not a p p e a r i n g elsewhere; if t is a constant then ezt~-,(t) = t; if t = { t t , . . . , t m } , m > 0, and 7' = {7-~) then we have t h a t ezt,-,(t) = if t = [At : t l , . . . , A n : t,~], I" = [A1 : V l , . . . , A n : r,~] and r ' = [A1 : t : C , A , ~ + I : r ~! + l , . . 9 ,A,~+p : v~+p], with n > 0 and p > O, 7 " 1', . . . , A,~ _ _ then we have t h a t ezt~-,(~) = [A1 : e z t ~ ( t t ) , . . . , A ~ : ezt,-,(t,~),An+l : X , ~ + I , . . . ,An+n : X,~+p], where, for 1 < i < p, X~+i is a variable in V,-,+, not appearing elsewhere.

It is easy to see t h a t the operation of extension of a t e r m is unique up to r e n a m i n g of variables, and therefore we can assume t h a t ezt is indeed a function from terms to terms. Let t be a g r o u n d t e r m of type 7- E TYPES(S) and 3' E F an element from the value sets of S. D e f i n i t i o n 2. t matches with 3" (in symbols t ~-, 3') if and only if 3' E VAL(r) a n d one o f the following conditions hold: t is a c o n s t a n t and 7 = t; - t = { Q , . . . , t , ~ ) , m > O, and for each 3'1 G 3' there is a t e r m ti, 1 < i < m, such t h a t ti H "rl, and, vice versa, for each ti, 1 0, and, for l
-

Finally, let O = C({,t) be a value to O (-10) with respect to in s if there exists (not exists) an such t h a t o = t0 and eztr(t)O ~

closed object atom; we can associate a t r u t h an instance s of S as follows: O (-~O) is true element o E e(C) and a g r o u n d substitution 0 o(o), being r the type of o(o), and it is false

71 otherwise. If an atom O is true in an instance s, then we say that a substitution verifing the condition above is a valuation of O in s.

Ezample 5. Let O = worker(X, [name : Tom,b i r t h d a t e : [yy : 58]])}, and s be the instance in Fig. 2. Then we have that O is true in s as the object # p 4 matches with the object specification of O. In fact, its extension to the type worker is: [ n a m e : T o m , b i r t h d a t e : [dd:D,mm:M,yy:58] ,sons : S , s a l a r y : S a l , j o b : J ) ] , and a valuation of O in s is: 6 = {X/#p4,D/1, M/6, S/@, Sal/220006, J/_l_}. 5.2

The semantics

Let S be a CO scheme and u be a generic basic update operator. We start by considering the basic dynamic literals of the language: we say that a basic dynamic literal B, is workable if (1) B is either an inserting or a deleting literal and it is closed, (2) B = u ( C ( X , t ) [ t ' ) is a modifying literal and: (a) C(X,t) is closed, and (b] each variable in t' occuring in a function term also occurs in t. For instance, the basic dynamic literal INSl (worker (X, [name : Y, s a l a r y : 22.100] ) is workable, whereas DEL1 (worker [name : P a u l , s a l a r y : 3*Y] ) is not. The semantics of the basic operators is based on the notion of set of potential solugions for the application of a workable basic dynamic literal B to an instance s of S, denoted with T~(s), and defined as follows: - TiNs,(c(~,t))(s ) contains only s if i is ground and it is already in o(C), otherwise it contains all the possible pairs s' = (c', o') obtained from s by adding to c(C) a new oid o '~w E O not already appearing in s, and by associating to o'(o '~e~") the extension of t to TYP(C), in which all variables are replaced with the null value: o'(o '~*") = eztTVP(C)(t)6X; -- TiNs,(c(i,t))(s ) contains only s if either ~ is ground and it is already in o(C), or C(~,t) is true in s, otherwise it coincides with

TiNsl(c(~,t))(s);

TDr,L,(C(~,t))(S ) contains only s if C(~, t) is false in s, otherwise it contains

-

all the possible pairs s' = (c', o') obtained from s by eliminating from e(C) each oid o such that o(o) matches with *; - TDBLdC(~,t))(S ) contains only s if C(~, t) is false in s, otherwise it contains all the possible pairs s' = (c', o') obtained from s by eliminating from e(C) just one oid o such that o(o) matches with t; TMOD~(C(~,t)It,)(s ) contains only s if C(~, t) is false in s, otherwise it contains all the possible pairs s' = (c', o') obtained from s by replacing o(o) with the extension of t' to TYP(C) to which 6 is applied, for every o such that o(o) matches with t with valuation 6: o'(o) = eztTyp(c)(t')6; TMODl(C(~,t)lt,)(s ) contains only s if C(~, t) is false in s, otherwise it contains all the possible pairs s' = (c', o') obtained from s by replacing o(o) with the extension of t' to TYP(C) to which 6 is applied, for just one o such that o(o) matches with t with valuation 6: o'(o) = eztTVp(c)(t')6; -- TSKIP()(s) contains only s. -

-

-

72

Let s : ( c , o ) be a coherent instance of S, then for any workable basic dynamic literal B the set TB(s) is a set of potential instances orS.

Lemma3.

Proof. Let ( c ' , o ' ) E TB(s), we have to show that it is a potential instance of S. The claim is trivial for B = sKIP(), whereas, in the case of a deleting literal, it easily follows from the fact that, by construction, (c', o') is a sub-instance of s. s 0 and e_t If B is an inserting literal, then, by definition of the substitutions O,~., and by construction, c' is a function from classes to oid's, and o' is a function from oid's to objects of allowed types, since eztTyp(c)(t)O_l, is a ground term of type wvP(C). Finally, let B -- u(C({, t) It') be a modifying literal; then c' = c. Moreover, a valuation O of C({, t) is a ground substitution for ezt~(t'), where T is the type of o({8), since, by definitions, T is a refinement of TYP(C), and so of t, and so of t'. It follows that ezt~(t')8 is a ground term, whose type is a refinement of TYP(C), and therefore, by construction, o' is again a function from oid's to objects of allowed types. [] Let a = B 1 , . . . , B,~ be a sequence of basic dynamic literals. Then, the set of potential solutions for the application of ~ a to an instance s of S is the set of pairs s,, = (e,~,o,~) obtained from s by applying in sequence B1, B2,.. in such a way t h a t at each step one potential solution for the application of the literal is selected for the following step. Let /3 = D 1 , . . . , D,~ be a conjunction of closed static literals. We say that ;3 (-~(;3)) is true in s if there exists (there does not exist) a set of valuations e l , . . . ,e~,, 1 < p < n, of all the object atoms of ;3 such that ;301...Op is true with respect to definitions of the built-in literals, and it is false otherwise. If ;3 is true, then the composition of the valuations 0 1 , . . . , Or is called a valuation of;3 in s. For instance, p e r s o n ( X , [ n a m e : M i k e , s a l a r y : Y ] ) , (u is true in the instance in Fig. 1 because of the valuation 8 = ~[X/#pT, Y/50000}. Now, let ~t = / ~ l , a l , B ~ , . . . , B , - 1 , a , , ~ , , + , , n > 1, be a sequence of literals such that ;31 is a sequence of static literals and c~j a sequence of basic dynamic literals, for 1 < i < n + 1, 1 < j < n. Then, we say that the goal ~ /~ is applicable to an instance s of S if, for 1 < i < n and So = s, there exists a valuation ei of ;3ie, . . . e i - 1 in si-1, such that ; 3 i + 1 e l . . . e l is true in si, where si is a potential solution for the application of ~ c,i e l . . . e l to si_,. If ,---/~ is applicable, there is associated to it a notion of set of potential solutions for the application 4-- # to s. Thus, for each set of dynamic literals in the sequence, the set of static literals preceding t h e m are evaluated in the current state; if t h e y are true the corresponding bindings are passed to the dynamic literals that are executed if the static literals following them are true in a potential solution of their application - in this case, it becomes the current state. Let P be a / / C O program for S and G = ~ L 1 , . . . ,L,~ be a goal for P; an ezpansion G* of G with respect to P is the last element of a sequence of goals G = Go, G 1 , . . . , Gk = G ~ such that: if Uj is an update literal occurring in Gi-1 (i > 0) and there is a rule R in P such t h a t Uj and the head of R unify with unifier a, then Gi is obtained from Gi_ 1 by replacing the literal Uj with the body of R, and by applying cr to the obtained goal. If G * does not contain update

73 literals, then we say that it is a full expansion of G with respect to P. Note that every full expansion can be viewed as a goal of the form *---p, where p is a sequence as defined previously. Finally, let s be an instance of S, if there exists a full expansion G ~ of G with respect to P that is applicable to s, then a (possible) result for the application of P U ~G} to s is a potential solution for the application of G* to s, otherwise the result is s itself. T h e o r e m 4. Let s be a coherent instance of S, then a result for the application of P U ~G~ to s is a potential instance of S.

Proof. If there does not exist a full expansion of G with respect to P that is applicable to s, the claim is trivial. So, let G* be such a goal; it has in general the form ,--- /31, a1,/32,...,/3m-l,C~,n,/3m+l, m > 1, where /3i is a sequence of static literals and c~i a sequence of basic dynamic literals, for 1 < i < m. Since G* is applicable, then by definition, the application of G ~ to s corresponds to the application of the goal *---B 1 , . . . , B,~, where, for 1 < i < n, Bi is a workable dynamic literal of the form Bjk01 ...0j, where Bjk belongs to a i , for 1 < j < m, and 8 1 , . . . , 8j are the valuations of the positive object literals preceding c~j in G *. Therefore, it is sufficient to show that the potential solution for the application of a workable goal of basic dynamic literals to an instance of S is a potential instance of S. Consider the definition of potential solution for the application to s of a goal G of this form, and note that L e m m a 3 is still valid if the argument of TD is a potential instance instead of a coherent instance; then, the claim follows by L e m m a 3 and by induction on the number of basic dynamic literals in G. [] It should be noted that: (i) for simplicity, but by abuse of notation, the same punctuation symbol "," is used here with two meanings; it stands for sequential composition if it preceeds or follows a dynamic literal, it stands for logical ' a n d ' otherwise; and (ii) in general the result of the application of an update goal to a coherent instance is not a coherent instance. Before closing this section, we point out that, in verifing for applicability of a sequence of literals, some problems may arise. In particular it may be the case that at a certain point an object literal needs to be evaluated but is not closed, or that a potential solution for the application of a basic dynamic literal needs to be found but the literal is not workable. To capture this notion we say that a generic literal is feasible i f : (1) it is an object literal and it is closed, (2) it is a built-in literal and it is ground, (3) it is a basic dynamic literal and it is workable; and that a goal is always feasible if it is never the case that in verifing for applicability, at some point all the literals that need to be computed are not feasible. It turns out that this property can be achieved by means of syntactical restrictions, as stated by the following result. We say that a variable is left-bound if it occurs to the right of a positive object literal, in which the same variable occurs in a closed argument; and that a goal (a rule) is safe if every variable that occurs in a function term, or in a built-in literal, or in an update literal, (either occurs in the head or) is left-bound. A program P is safe if every rule in P is safe.

74 P r o p o s i t i o n 5. If P and G are safe, then every full ezpansion of G with respect

to P is always feasible. Proof. We prove the claim by showing that a safe goal is always feasible and that if P and G are safe, then any expansion of G with respect to P is safe. By way of contradiction assume that a goal G' is safe and, in verifing for applicability, at same point a literal Lj needs to be computed and it is not feasible. By definition, it means that a variable, say X, occurs in Lj (if Lj is a built-in literal) or in a function term occurring in Lj (if Lj is an object literal or a basic dynamic literal). Since the goal is safe, X is left-bound and therefore it also occurs in a closed argument of a positive object literal, say Lk, occurring to the left of Lj in G r. By definition, when Lj needs to be computed, Lk has been already evaluated and so the valuation of Lk (that is ground for X ) has been applied t o Li, and therefore such a variable does not exist - a contradiction. Now, let G' be a safe goal and G" be a goal derived from G I and a safe rule R. Then, by the safety conditions, each variable which is not left-bound occurs in the head of R and therefore it becomes left-bound in G", it follows that G" is safe. From this fact, if G* is an expansion of G with respect to P, by induction on the length of the sequence of goals computed in deriving G, it follows that G* is safe, and so the claim follows, n Actually, some of the safety conditions above could be relaxed, but in this form they are easy to check and also, they are useful in defining a simple interpreter, as shown in Section 6.

6

Implementation

Issues

In this section we give some hints on how a I2CO program can be implemented. It is not our intent to define a complete interpreter, we rather present some results that can be used in a practical implementation. Let S be a CO scheme; by the given semantics, an interpreter for a / ~ C O program P for S and a goal G for P consists of a procedure that takes as input P, G and an instance s for S, and attemps to construct a full expansion of G with respect to P that is applicable to s. We say that a computation of P U ~G} with respect to s is an attempt to construct such an expansion. A computation can be implemented by means of a sequence of pop and push operations in the stack of five-tuples < G~., s~, r g~., 5~ >, which we call states of the computation, where: G~ is a goal, sx is a potential instance (the current instance of the state), r is a set of substitutions, 7~. is a set of rules in P, and ,S~ is a set of instances of S. The stack is initialized by pushing in it the initial state < G, s, r r >, The generic step of the computation consists in examining the leftmost literal, say Lj, of the goal in the state at the top of the stack, which we call the top state and denote with < Gt, st, ~t,7~t,,St >. Then, if one of the following conditions is verified, the corrisponding operations are performed, otherwise a sequence of pop occurs until the leftmost literal in Gt is

?5 a non built-in literal (in the sequel, if G is a goal and L a literal in G, G - L denotes the goal obtained by eliminating L from G). -

-

-

- -

if Lj i s a positive object literal and there exists a valuation 8 of Lj in st such that 0 r Or, then: (1) Ot := Ot U{$}, (2) push < (Gt - Lj)O, st,@,@,@ > in the stack; if Lj is a static literal different from a positive object one and it is true in st, then push < Gt - Lj, st, @,@,@> in the stack; if Lj is an update literal and there is (a variant of) a rule R in P (in such a way that R does not have any variable which already appears in Gt), such that R ~ 7~t and Lj and the head of R unify with unifier a, then: (1) Rt := 7~t U {R}, (2) push < G', st, @,@,@> in the stack, where G' is the goal derived from Gt and R using or; if Lj is a basic dynamic literal and there is a potential instance s' E TLj(st) such that s' r &, then: (1) St := St U {s'}, (2) push < Gt - Lj, s', @,@> in the stack.

The computation ends when the stack is empty (in this case we say that the computation is failed) or when a point is reached where Gt is empty (in this case we have a successful computation). T h e o r e m 6. / f there emists a successful computation of P U {G} with respect to s, then (i) there is a full ezpansion of G with respect to P that is applicable to s, and (ii) st is a result of the application of P U {G} to s. Proof. Consider a successful computation of P U { G } and assume that the length of the stack s of this computation is l + 1. We can build a g o a l G t inspecting the goals G, Gt, . . . , Gi in the elements of s as follows: initially Go = G, then, for i = 1 , . . . , 1, if G~ is obtained from Gi-1 and a rule R using an unifier a, then Gi is also obtained from Gi-1 and R using cr (in fact, by induction on l, it is possible to show that ~ is also an unifier of G and R), otherwise Gi = Gi-1. By construction, for 1 < i < l, the goals Gi are all expansions of G with respect to P. Moreover, the final goal Gz is a full expansion, that is it does not contain update literals, since otherwise, by the given computation and by construction of GI,Asuch literals would be also in Gz, which instead is empty. Now we claim that Gt is applicable to s. In fact, by construction, Gz contains all the static literals that are examined during the computation and, since it is successful, they are all true in the current instances in the states of the computation. But the current instance in a state of the computation is just a potential solutions for the application of the last basic dynamic literals computed, that i~. the first basic dynamic literal which appears to the left of each static literal in Gt. So, for each static literal Li in Gz, let sk be a potential solutions for the application of the first basic dynamic literals occuring to the left of L~; we have that: (i) if Li is an positive object literal, then an instantiation of L~ with the valuation of the previous positive object atom in true in sk, (ii) Li is a static literal different from a positive object literal and an instantiation of it is true in sk. Then, the claim

76 follows by induction on the number of literals in Gz. In particular the final state st is a potential result for the application of the rightmost basic dynamic literal in G, and therefore, by definition, it is a result of the application of P U {G} to s. [] In general, with a procedure as above, termination is not guaranteed because of recursion, but an algorithm like that is simple and easy to implement. In drawing a computation it may be the case that the leftmost literal of the top goal is not 'computable', because, for instance, it is a basic dynamic literal and it is not workable. We say that a computation flounders if at same point, the leftmost literal of the goal in the top state is not feasible. Again, this situation can be avoided by means of syntactical restriction, as stated by the following proposition. P r o p o s i t i o n 7. I / P and G are 8ale, then a computation/or P LJ {G} never flounders.

Proo/. By way of contradiction assume that G is safe and, during a computation, the leftmost literal Lj of the goal Gt in the top state is not feasible. By definition, it means that a variable, say X, occurs in L i (if Lj is a built-in literal) or in a function term occurring in Lj (if L i is an object literal or a basic dynamic literal). Since G is safe, (a renaming of) the variable X is left-bound and therefore it also occurs in a closed argument of a positive object literal, say Lk, occurring to the left of Lj in G. By the given computation of P U {G}, if the leftmost literal of Gt is L.r it means that Lk has been already eliminated from Gt and the valuation of Lk (that is ground for X) has been applied to all the literal of Gt, therefore such a variable does not exist - a contradiction. [] From a practical point of view, several improvement can be made to the procedure described above: for instance incremental instances can be used at each step instead of entire instances and opportune valuations for the object literals can be chosen looking ahead in the goal. We will not discuss further this point because it is beyond of the scope of this paper. 7

Conclusions

In this paper, extended update operations have been studied in the framework of a rule based non-deterministic update language for complex object databases. The language, called HCO, is built around a set of basic update operations which, we argue, correspond to a reasonable set of basic manipulations of data stored in a complex object database. The managed data are always in the form of objects, and thus identity on values do not imply identity (this is obtained using object identifiers). This fact had strongly impacted on the choice of basic update operations. Basic update operations can be used to build complex transactions, which are declared in the form of rules, and which are activated by calling the corresponding head literal. The defined update literals can in turn be used in other update

77 rules bodies, even recursively. Thus, the language has a procedure-call based (i.e., top-down) computational model, as opposed to other languages which use a b o t t o m - u p model [6, 14]. Rule bodies are executed strictly from left to right. In rule bodies, conditions can be specified which serve the two purposes of controlling computation a n d constructing unifications. The language is non-determinstic in the following sense: (i) basic insertions semantics uses oid invention; (ii) three out of seven basic u p d a t e operations are non deterministic, since they apply just to one of the selected objects, chosen arbitrarily; (iii) all literals in conditions can be partially specified; (iv) the non basic u p d a t e literals call mechanism is non deterministic in that if there is more than one applicable rule, one of t h e m is chosen and executed. To conclude, let us briefly discuss some further points. We notice t h a t to allow more complex conditions in rule bodies would improve language naturalness. In particular, it should be useful to take advantage of the full power of logic programnfing, using (ordinary logical) goals to activate complex enquiring (and unification building) transactions on the database. This extention of HCO can be carried out without so much difficulty. Other issues related to the design of a top-down oriented updated language for complex object databases merit attention. In particular, the relationships between update transaction execution and the presence of (inherent and/or defined) integrity constraints is to be studied. Second, the expressibility of the language is to be investigated. These points will be the object of future research activities.

References 1. S. Abiteboul. Towards a deductive object-oriented database language. Data and Knowledge Engineering, 5:263-287, 1990. 2. S. Abiteboul. Updates, a new frontier. In ICDT'88 (Second International Conference on Data Base Theory}, Brutes, Lecture Notes in Computer Science J26, pages 1-18, Springer-Verlag, 1988. 3. S. Abiteboul and G. Grahne. Update semantics for incomplete databases. In Eleventh International Conf. on Very Large Data Bases, Stockolm, 1985. 4. S. Abiteboul and S. Grumbach. A rule-based language with functions and sets. A C M Trans. on Database Syst., 16(1):1-30, March 1991. 5. S. Abiteboul and R. Hull. IFO: a formal semantics database model. ACM Trans. on Database Syst., 12(4):297-314, December 1987. 6. S. Abiteboul and P. Kanellakls. Object identity as a query language primitive. In ACM SIGMOD International Conf. on Management of Data, pages 159-173, 1989. 7. S. Abiteboul and V. Vianu. Datalog extensions for database queries and updates. Journal of Comp. and System Sc., 43(1):62-124, August 1991. 8. S. Abitebout and V. Vianu. Procedural and declarative database update languages. In Seventh ACM SIGACT SIGMOD SIGART Syrup. on Principles of Database Systems, pages 240-250, 1988. 9. P. Atzeni and R. Torlone. Efficient updates to independent schemes in the weak instance model. In ACM SIGMOD International Conf. on Management of Data, pages 84-93, 1990. SIGMOD Record 19(2).

78 10. P. Atzeni and R. Torlone. Updating relational databases through weak instance interfaces. ACM Trans. on Database Syst., 17(4):718-746, December 1992. 11. F. BancUhon, D. Maler, Y. Sagiv, and J.D. Ullman. Magic sets and other strange ways to implement logic programs. In Fifth ACM SIGACT SIGMOD Syrup. on Principles of Database Systems, 1986. 12. F. BancUhon and R. Ramakrishnan. An amateur's introduction to recursive query processing strategies. In A CM SIGMOD International Conf. on Management of Data, pages 16-52, 1986. 13. F. BancUhon and N. Spyratos. U p d a t e semantics of relational views. ACM Trans. on Database Syst., 6(4):557-575, 1981. 14. F. Cacace, S. Cerl, S. Crespi-Reghizzi, L. Tanca, and R. Zicari. Integrating object oriented d a t a modelling with a rule-based programming paradigm. In ACM SIGMOD International Conf. on Management of Data, pages 225-236, 1990. 15. W. Chen and D.S. Warren. C-logic for complex objects. In Eigth ACM SIGACT SIGMOD SIGART Syrup. on Principles of Database Systems, pages 369-378, 1989. 16. C. de Malndreville and E. Simon. Modelling non-deterministic queries and updates in deductive databases. In Fourteenth International Conf. on Very Large Data Bases, Los Angeles, pages 395-406, 1988. 17. R. Fagin, J.D. Ullman, and M.Y. Vardi. On the semantics of updates in databases. In Second ACM SIGACT SIGMOD Syrup. on Principles of Database Systems, pages 352-365, 1983. 18. K.B. Hull and C.K. Yap. The format model: a theory of database organization. Journal of the ACM, 31(3):518-537, July 1984. 19. G.M. Kuper and M.Y. Vardi. A new approach to database logic. In Third ACM SIGACT SIGMOD Syrup. on Principles of Database Systems, pages 14-15, 1984. 20. S. Manchanda and D.S. Warren. A logic-based language for database updates. In J. Minker, editor, Foundations of Deductive Databases and Logic Programming, pages 363-394, Morgan Kauffman, Los Altos, 1988. 21. S.A. Naqvi and R. Krishnamurthy. Database updates in logic programming. In Seventh ACM SIGACT SIGMOD SIGART Syrup. on Principles of Database Systems, pages 251-262, 1988. 22. L. Palopoli. Testing logic programs for local stratification. Theoretical Computer Science, 103(2):205-234, 1992. 23. D. Sacc~ and C. Zaniolo. The generalized counting method of recursive queries for database. Theoretical Computer Science, 62(3):187-220, 1987. 24. M. Winslett. A framework for the comparison of update semantics. In Seven& ACM SIGACT SIGMOD SIGART Syrup. on Principles of Database Systems, pages 315-324, 1988. 25. C. Zaniolo. Database relations with null values. Journal of Comp. and System Sc., 28(1):142-166, 1984.

T a x o n o m i c R e a s o n i n g in L O G I D A T A + Domenico Beneventano 1 and Sonia Bergamasehi 1 and Claudio Sartori 1 Alessandro Artale 2 and Francesca Cesarini 2 and Giovanni Soda s 1 CIOC-CNR- Universita di Bologna - Italia 2 Dipartimento di Sistemi e Informatica - Universita di Firenze - Italia A b s t r a c t . Taxonomic reasoning is a typical inference task performed by many AI knowledge representation systems. We illustrate the effectiveness of taxonomic reasoning techniques as an active support to knowledge acquisition and schemas design in the advanced database environment LOGIDATA +, supporting complex objects and a rule-based language. The developed idea is that, by extending complex object data models with defined classes, it is possible to infer ISA relationships (i.e. compute subsumption) between classes on the basis of their descriptions. From a theoretical point of view, this approach makes it possible to give a formal definition of consistency to a schema, while, from a pragmatic point of view, it is possible to automatically classify a new class in the correct position of a given taxonomy.

1

W h a t is Taxonomic Reasoning?

Taxonomic Reasoning is an inference technique, introduced by the object-centered knowledge representation systems, developed in the Artificial Intelligence research area from the KL-ONE model [BS85], which permits the automatic classification, i.e. determination of the precise place for a class in a given taxonomy. A taxonomic reasoner finds all the isa relationships between a new class description and the classes' taxonomy already given, by discovering the implicit isa relationships hidden in the classes' descriptions. These systems propose to model intensional knowledge by means of the so-called terminological logic languages [PATEg0] which permit a class structure (type) to be described by a composition of terms to which a precise extensional semantics is associated. Further, classes can be distinguished as primitive classes, where the class structure is interpreted as a set of necessary conditions and defined classes, where the class structure is interpreted as a definition, i.e. a set of necessary and sufficient conditions. Thus, the extension of a defined class corresponds to the domain object set whose structure conforms to that description. In this way, subsumption relationships between classes can be computed by syntactically comparing class descriptions in a fashion similar to subtype relationship c o m p u t a t i o n [CARDS4] [LRV88] [LR89a] and implicit isa relationships can be discovered. Unfortunately, it has been proved that subsumption computation can give rise to tractability problems for m a n y developed languages [BL84] [BL87] [NEBE90] [FATE89] [NEBE90a] [DHLMNN91] [DLN91] [SCH89] [NEBE88] and particular attention must therefore be given to studying the relationships existing between

80 languages expressive power and the related algorithms' complexity, in order to obtain efficient systems. The main result has been obtained for languages which not allow cyclic class descriptions in [DLNN91] by precisely individuating the boundary of tractable languages. Further, [NEBE91], has dealt with languages allowing cyclic class descriptions. The semantics of defined classes is the basis for taxonomic reasoning and has effect on both intensional and extensional levels: -

-

a class can be automatically recognized as subsumed by a defined class; an individual can be automatically recognized as a member of a defined class.

For example, without any world knowledge, we can determine that the class Person subsumes the defined class of those persons that have only friends among doctors" : ( Person w i t h e v e r y friend isa Doctor) which in turn subsumes the class ( Person w i t h e v e r y friend isa (Doctor w i t h a speciality is surgery)) and any object that is a person and whose friends are all doctors can be automatically recognized as a member of the more specialized class described above. Further, any object that is a person and whose friends are all doctors with a speciality in surgery, only if the last described class is defined, can be most precisely recognized as a member of the last class. Traditionally database models assume a class description as a set of necessary conditions, to be satisfied by any individual belonging to that class: the individual will be checked and manually inserted in the class. In addition, only the ISA relationships explicitly declared (and their transitive closure) are known. This semantics is captured by primitive classes. Notice t h a t defined classes capture the semantics of database views. It is the authors' opinion that extending complex object data models semantics by defined classes can be profitable for many database research topics as we argue in the following. The effectiveness of taxonomic reasoning for supporting schema design has been shown in previous works on conceptual models [BCST89] [BS92]; we can here summarize some advantages: 1. the isa relationships which can be inferred from class descriptions are made explicit; in other words, the user's taxonomy is enriched with implicit isa relationships; 2. the user isa relationships are checked with respect to the subsumption relationship; 3. the schema consistency is checked by discovering ISA cycles and incoherent classes (i.e. classes with always empty extension); 4. the schema can be transformed into a minimal form where the redundancies with respect to inheritance are removed.

81

Results 2. and 3. have already been achieved in Object Oriented Database Systems (OODB) [LRV88] [LR89a] by computation of the refinement relationships. However, in spite of the similarity of syntactic type checking techniques, the purpose is quite different. In fact, the refinement computation provides a passive consistency check on types, as the relationships computed are all the possible isa relationships between classes. The subsumption algorithm, if it is complete, has a more active role: it computes all ISA relationship which are implied by user's descriptions, even if not explicitly stated (1.) and allows computation of the minimal description of a class with respect to specialization ordering (4.). Thus, the more active role of building a minimal class taxonomy can be played. Furthermore, the relevance of this technique for other main topics in database research, such as recognition of objects and query validation as well as optimization [BGN89] [BBMR89] is outstanding, and will be briefly explained in section 2. The present work applies Taxonomic Reasoning to the LOGIDATA + model, but is also appliable to many popular complex object models such as [KV84] [LRV88] [AK89] [LR89a]. As explained above, taxonomic reasoning requires an extended semantics with respect to the tradition of database environment. For this reason, our reference model LOGIDATA* extends the LOGIDATA + model as follows: - defined class semantics; - set type with cardinality boundaries. By the second extension the knowledge about cardinalities, which is usually expressed in database models as an integrity constraint, can be integrated in the object description and hence used for consistency check.

2

Related works

The related works to be considered come from both the database and the AI areas. We can identify a first group of works dealing with conceptual schema validation (with some contribution based on subsumption) and a second group of works exploiting subsumption for other database research topics. Most of the works of the first group differ substantially from the author's approach since they do not consider defined classes [AH87] [AP88b] [DBL89] [LR89a] [LRV88]. In this perspective, the main activity is to check concept description consistency with respect to the explicit given specialization ordering. Our approach, together with [AK86] [FS86] [BCST89] [DD89] [BS92] provides a more active role, allowing computation of the specialization ordering on the basis of concepts descriptions. In particular, in [BS92] the authors, by extending semantic models with defined classes and exploiting taxonomic reasoning, propose a general formal framework for the validation/restructuration of schemas. In this work, taxonomic reasoning techniques are extended to deal with the new expressive capabilities of complex object data models, such as object identifiers, sets, sequences and cyclic definitions (in the second paper). The capability of dealing

82 with both defined classes and cyclic definitions takes hints from [NEBE91] and [BN91]. The other important group of works includes the application of subsumption computation to instances validation, instances recognition and query processing [BBMR89] [BGN89] [Brar The novel feature is that DDL and DML are identical, thus providing uniform treatment of data objects, query objects and view objects. The classification algorithm finds the correct placement for a query object in a given object taxonomy and the fundamental criterion for this placement is the subsumption relationship between two object classes (union of the instances of the descendant object classes satisfy the query). In [BGN89] the proposed model CANDIDE presents, by a n6tation more familiar to the database researcher, an extension of KANDOR [PSS4] [PS86] (a Frame Description Language, or FDL) representing standard data types of database environments (range, set, composition). In [BBMR89], the proposed model, CLASSIC, has a notation more in the FDLs tradition, query processing is tractable as the subsumption algorithm is polynomial and complete. Furthermore, the paper shows how, thanks to the semantics of defined concepts, recognition of an object of an application domain as an instance of a class is automatic. Finally, the effectiveness of classification for intensional query answering is shown. To summarize, CANDIDE proposes itself as a new conceptual model, while CLASSIC presents a system based on an FDL (coming from AI environment) and its feasibility for database objectives. The present work does not face these topics, but the general formal framework given can constitute a valid support to extend the above results to a database environment supporting OODM. The two following papers of this volume present taxonomic reasoning techniques in LOGIDATA+ with different features. Introducing taxonomic reasoning in LOGIDATA+ by A. Artale, F. Cesarini, G. Soda supports incomplete object descriptions, presents a complete subsumption algorithm and provides a proof of its polynomial complexity. Taxonomic reasoning with cycles in LOGIDATA+ by D. Beneventano, S. Bergamaschi, C. Sartori supports cyclic class descriptions, presents a complete subsumption algorithm and provides a definition of consistency with respect to inheritance of a database schema.

References [ACCT90]

[AH871 [AK861

P. Atzeni, F. Cacace, S. Ceri, and L. Tanca. The LOGIDATA+ model: Technical Report 5/20, CNR- Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo - $5, 1990. S. Abiteboul and R. Hull. IFO: A formal semantic database model. ACM Transactions on Database Systems, 12(4):525-565, 1987. H. Ait-Kaci. Type subsumption as a model of computation. In L. Kershberg, editor, 1st Int. Workshop on Expert Database Systems, pages 115-140. The Benjamin-Cimmings Publishing Company Inc., 1986.

83 [AK89]

S. Abiteboul and P. Kanellakis. Object identity as a query language primitive. In SIGMOD, pages 159-173. ACM Press, 1989. [AP88a] P. Atzeni and D.S. Parker. Formal properties of net-based knowledge representation schemes. Data and Knowledge Engineering, 3:137-147, 1988. [AP88b] P. Atzeni and D.S. Parker. Set containment inference and syllogisms. Theoretical Computer Science, 62:39-65, 1988. [BBMR89] A. Borgida, R.3. Brachman, D.L. McGuinness, and L.A. Resnick. CLASSIC: A structural data model for objects. In SIGMOD, pages 58-67, Portland, Oregon, 1989. ACM. [BCST89] S. Bergamaschi, L. Cavedoni, C. Sartori, and P. Tiberio. On taxonomical reasoning in E/R environment. In C. Batini, editor, 7th Int. Conf. on the Entity Relationship Approach - Roma - 1988, pages 443-454, Amsterdam, 1989. Elsevier Science Publisher B.V. (North-Holland). H.W. Beck, S.K. Gala, and S.B. Navathe. Classification as a query [BGN89] processing technique in the CANDIDE data model. In 5th Int. Conj. on Data Engineering, pages 572-581, Los Angeles, CA, 1989. [BL87] R.J. Braehman, and H.J. Levesque. Expressiveness and Tractability in Knowledge Representation and REasoning. Computational Intelligence, 3:78-93, 1987. R.J. Brachman, and H.J. Levesque. The tractability of Subsumption in [BL84] Frame-Based Description Languages. A A A I , 34-37, 1984. S. Bergamaschi and B. Nebel. The complexity of multiple inheritance [BN91] in complex object data models. Workshop on A I and Objects - I J C A I '91, Sidney - Australia, August 1991 (also available as Technical Report 74 CIOC- CNR, Viale Risorgimento,2 Bologna, Italia, December 1991). R.3. Brachman, D.L. McGuiness, P.F. Patel-Schneider, L. Alperin [Brac91] Resnick, A. Borgida. Living with Classic: when and how to use a KLONE language, in [Sowa91] [BS85] R.J. Brachman and J.G. Schmolze. An overview of the KL-ONE knowledge representation system. Cognitive Science, 9(2):171-216, 1985. [BS92] S. Bergamaschi and C. Sartori. On taxonomic reasoning in conceptual design. A C M Transactions on Database Systems, Vol.17, N.3, Sept, 1992. [CARD84] L. Cardelli. A semantics of multiple inheritance, em Semantics of Data Types - Lecture Notes in Computer Science N. 173, 1984, pages 51-67, 1984. Springer-Verlag. [DBL89] G. Di Battista and M. Lenzerini. A deductive method for EntityRelationship modelling. In 15th Int. Conf. on Very Large Databases, pages 13-21, Amsterdam - NL, August 1989. [DD89] L.M.L. Delcambre and K.C. Davis. Automatic validation of objectoriented database structures. In 5th Int. Conf. on Data Engineering, pages 2-9, Los Angeles, CA, 1989. [DHLMNN91] F.M. Donini and B. Hollunder and M. Lenzerini and A. Marchetti Spaccamela and D. Nardi and W. Nutt. The Complexity of Existential Quantification in Concept Languages. rap.01.91, Dipartimento di Informatica e Sistemistica, Roma, 1991. [DLN91] F.M. Donini, M. Lenzerini, D. Nardi and W. Nutt. The Complexity of Concept Languages. In J. Allen and R. Fikes and E. Sandewall, editors, KR '91 - 2nd Int. Con] on Principles of Knowledge Representation and

84

[DLNN91] [FS86]

[KV84] [LRS9a] [LR89b] [LRV881 [NEBE881 [NEBE90] [NEBE91]

[NEBE90a] [PS841

[PS861 [PATE89] [PATEg0 l [PS861 [SCH891

[Sowa91]

Reasoning, pages 151-162, 1991. Morgan Kauffmann Publishers, Inc., Cambridge - MA. F. M. Donini, M. Lenzerini, D. Nardi and W. Nutt. Tractable Concept Languages. In IJCAI 91, Sidney, Australia,1991. T. Finin and D. Silverman. Interactive classification as a knowledge acquisition tool. In L. Kershberg, editor, Expert Database Systems, pages 79-90. The Benjamin/Cummings Publishing Company Inc., 1986. G.M. Kuper and M.Y. Vardi. A new approach to database logic. In PODS '84, pages 86-96. SIGACT-SIGMOD-SIGART, ACM Press, 1984. C. Lecluse and P. Richard. Modelling complex structures in objectoriented databases. In Syrup. on Principles of Database Systems, pages 362-369. ACM SIGACT-SIGMOD-SIGART, 1989. C. Lecluse and P. Richard. The 02 data model. In Int. Conf. On Very Large Data Bases, 1989. C. Lecluse, P. Richard, and F. Velez. 02, an object-oriented data model. In SIGMOD, pages 424-433, Chicago - IL, June 1988. ACM. B. Nebel. Computational complexity of terminological reasoning in BACK.. In Artificial Intelligence. vol. 34, n.3, 1988. B. Nebel. Terminological reasoning is inherently intractable. In Artificial Intelligence. vol. 43, n.2, 1990. B. Nebel. Terminological cycles: semantics and computational properties. In J. Sowa, editor, Principles of Semantic Networks. Morgan Kaufmann, 1991. B. Nebel. Reasoning and Revision in Hybrid Representation Systems. Lecture Notes in Artificial Intelfigence, n. 422, 1990. P.F. Patel-Schneider. Small can be beautiful in knowledge representation. In Workshop on Principles of Knowledge-Based Systems, pages 11-16, Denver- Colorado, December 1984. IEEE. P.F. Patel-Schneider. A four-valued semantics for frame-based description languages. In AAAI, pages 344-348, 1986. P.F. Patel-Schneider. Undecidability of Subsumption in NIKL. Artificial Intelligence, 39:263-272, 1989. P.F. Patel-Schneider. Pratical Obiect-Based Knowledge Representation for Knowledge-Based System. In/ormation System, vol.15, n.1, 1990. P.F. Patel-Schneider. A four-valued semantics for frame-based description languages. In AAAI, pages 344-348, 1986. M. Schmidt-Schauss. Subsumption in KL-ONE is undecidable. In Brachmann, R.J. and Levesque, H.J. and R. Reiter, editors, KR '89 - 1st Int. Conf. on Principles of Knowledge Representation and Reasoning, pages 421-431, Morgan Kauffmann Publishers, Inc.,TorontoCanada. Sowa J.F. editor Principles of Semantic Networks, Morgan Kaufmann Publishers.

Introducing T a x o n o m i c Reasoning in LOGIDATA + Alessandro Artale and Francesca Cesarini and Giovanni Soda Dipartimento di Sistemi e Informatica - Universit~ di F i r e n z e Via S . M a r t a , 3 - 5 0 1 3 9 F i r e n z e - Italia e-mail: giovanni~ingfi 1.cineca.it A b s t r a c t . In this paper we give a syntax and define a denotational semantics for describing classes in LOGIDATA*, and give a s u b s u m p t i o n algorithm that is sound, complete, and polynomial. The syntax allows us to describe both primitive and defined acyclic classes by means of t h e tuple and set type constructors; in particular, it is also possible to state cardinality constraints on sets. Semantics is defined by means of an interpretation function that maps the descriptions given at an intensional level into the value domain. This interpretation also takes t h e undefined element n i l into account, thus allowing us to deal with a form of incomplete knowledge in a way that is semantically correct. After introducing defined classes and the notion of interpretation function, it is possible to formally define the concept of subsumption, an inference technique that makes it possible reasoning about objects and classes on the basis of their descriptions.

1

Introduction

This paper deals with introducing in LOGIDATA + [3] the paradigm of taxonomic reasoning. The characteristics of taxonomic reasoning and its relevance for a database environment are outlined in [2]. In particular, taxonomic reasoning is based on the concept of defined classes (i.e., classes whose structural description gives necessary and sufficient conditions for the objects t o belong to them); therefore, it is possible to infer specialization relationships among classes, besides the ones explicitly stated by ISA clauses, and recognize instances. Furthermore, taxonomic reasoning is also useful for answering at both an extensional and intensional level and optimizing query execution. The introduction of taxonomic reasoning in LOGIDATA + provides for a deductive capability that is limited with respect to some logical languages but that provides a well-founded and uniform inferential service with computational tractability that can be exploited for all the above mentioned features. For this purpose, we define a formal framework suitable for dealing with defined classes and give a sound, complete, and polynomial algorithm for computing specialization among classes. In our work we do not consider recursive definitions of classes, because our study moves from suggestions derived from terminological languages that usually do not allow this possibility. However, our framework supports incomplete object descriptions, allowing us to deal with incomplete knowledge.

86

The LOGIDATA* schema and a syntax by means of which it is possible to describe primitive and defined acyclic classes is defined in section 2, this syntax provides for tuple and set type constructors and cardinality constraints on sets can be stated. A denotational semantics that supports a kind of incomplete knowledge about objects is described in section 3. For this purpose the functions associating values to tuple labels and object identifiers are suitalbly extended in order to take a null value into account; furthemore, the interpretation function is explicitly linked to the class definition syntax in order to obtain legal instances. Some examples of deductions on the extensional level of the Knowledge-Base are illustrated in section 4. Section 5 is devoted to characterize the notions of schema coherence and subsumption, and the syntactic constraints for coherence are illustrated in section 6. Section 7 discusses the computational complexity of the subsumption algorithm. We make our concluding remarks in section 8. The Appendix A shows an example of schema definition and interpretation, while in the Appendix B the soundness and completeness of the subsumption algorithm are proven.

2

Schema

Definition

Language

The LOGIDATA* model derives from LOGIDATA+ but contains some important differences in order to fit taxonomic reasoning. The main structure of LOGIDATA* is the class that denotes sets of objects, each of which is identified by an Object Identifier (old). Objects in classes can have complex structures obtained by repeatedly using the tuple and set constructors 1. Furthermore, type names are provided to simplify user declarations. Further, set types with cardinality constraints are introduced, integrating in the object description a kind of integrity constraint in the database environment, and classes are distinguished as primitive and defined. We consider, as base types B, the two countable, non-finite set of strings and integers, which are disjoint. Let C p be a countable set of primitive class names, CD a countable set of defined class names, (3 = C p U C o , T a countable set of type names, L a countable set of labels; sets B, Cp, C o and T are pairwise disjoint, CD including the universal class, which will be denoted by the symbol T. We indicate by T ( L , B, C, T ) (often abbreviated simply as 7") the whole set of type descriptors, over L, B, C, T defined as follows: - each base type name B E B is a type; - each class name C E (3 is a type; - each type name T E T is a type; if t is a type, then {t}(,,~in,max) is also a type called set type, where min is a not-negative integer, max is a positive integer or oo, and m a x >_ rain; - e m p t y s e t is the empty set type; if t l , . . . , t k , with k > 0, are types and l l , . . . , l ~ are distinct labels, then [11 : t l , . . . , lk : tk] is also a type, called tuple type. -

-

1 The sequence type (0) can be easily added.

87 Let ISA be a partial order over C: C ISA C' states that each object belonging to class C also belongs to class C', i.e., class C is a subclass (or a specialization) of C'. Let TYP be a function from T U C to the set of type descriptors in 7"(L, B, C, T). TYP is such that: for each T E T, TYP(T) is a tuple or set type descriptor of 7-; - for each C E C, TYP(C) is a tuple type descriptor of 7"; T Y P ( T ) is the empty tuple [ ], i.e. the tuple type with no labels. -

D e f i n i t i o n 1 L O G I D A T A * S c h e m a . A LOGIDATA* schema is a five-tuple S : ( C p , C D , T , TYP, ISA) where T, Cp, CD, ISA and TYP are defined as above. A LOGIDATA* schema can be defined by means of the syntax illustrated below. We note that we don't take any recursive definitions into account, in accordance with the approach usually used by the Knowledge-Representation-Models we refer to [11]. The ISA clause allows the system to exploit the mechanism of inheritance; multiple inheritance is allowed, with some restrictions on the legal schema declarations assuring type compatibility (see sect. 6). <declaration> := I

:= t y p e =

:= TI class <prim-def> ISA *: < t y p e - i d > ] class <prim-def> ISA *

:= I <set-type> < t y p e > := I I I

:= [*] < c o m p o n e n t > := < l a b e l > : < t y p e > <set-type> := { }<mi~>,<max> l EmptySet := string I integer <prim-def> := < m i n > := "not-negative integer" < nlax> : : "positive integer" I "*" indicates 0 or more repetitions. This syntax allows for instance to describe the concept of father as the set of all "individuals who are persons with at least one child and all children are persons" : class Father :- ISA Person [child:{Person}l,oo] We show a more detailed example of schema definition in the Appendix A.

88 3

A Semantics

Dealing

with

Incomplete

Knowledge

We design our database system LOGIDATA* in such a way that it supports incomplete information about objects [4]. Thereafter, information about an object is added incrementally, allowing us to create an object about which nothing is known except that it belongs to the universal class T or to assert that it is an instance of some classes without necessarily specifying values for the fillers of its properties. D e f i n i t i o n 2 Values. A set 12 of values

is

I~'

-

:D l.J {91.1 12T U 12S where

:D = t.l~=l:Di and /)i are the sets of values associated with the base type names Bi E B. Each element v E / ) is a basic value. O is a countable set of symbols, called object identifiers, disjoint from ~ . - 12T is the set of tuple values: 12T = {v, I vt is a mapping, vt : L ~ 12t9 { n i l } } . L is a set of labels. We denote with [Ix : V l , . . . , l k : v~] the total mapping defined on {11,..., lk} such that v, (ll) = vi E 12 t9 { n i l } , i = 1 , . . . , k. - 12s is the set of set-values: 12s = {vs I v, C 12}. A set-value is denoted by { v l , . . . , v ~ } , with vi E 12, i = 1 , . . . , k . -

We note that the object identifiers are values, too. Since objects are usually considered to be pairs of identifiers and values, we assume the existence of a function that assigns values to object identifiers. D e f i n i t i o n 3 V a l u e A s s i g n m e n t a n d D o m a i n . Value assignment is a total mapping, denoted by 6, that associates a tuple value with each object-identifier: 6 : 0 - - ~ 12T t.J { n i l } . The d o m a i n ~ is the couple E = (O, 6). We assume that: - 6(0) = nil when an object is created but is not assigned to any class because nothing is known about its properties (in this case o is a member of the universal class T); - vt (l) = nil for labels with undefined value.

The nil element makes it possible to treat incomplete knowledge maintaining vt and 6 total mapping. D e f i n i t i o n 4 I n t e r p r e t a t i o n . Given a LOGIDATA* schema S, let 7" be the type descriptors in S. Given a domain ,U, the interpretation f u n c t i o n I of S over 12 is a function from 7" to 212 such that: 2~(Si) = 2)i,

2:(C) C O,

2:(W) C_ 12 - O.

The interpretation is extended to types in 7" as follows: Z([ll : t l , . . . , l k

:tk]) = { v t e

12T I vt is defined atleast on { l l , . . . , l k } and v, (li) e Z ( t , ) , i 1,...,k}.

Z ( e m p t y s e t ) = {$}.

z ({t}cn,m)) = {v. e VS I n --<11v0 C_ Z(t) I1_<m}.

89 This interpretation is obviously not satisfactory as an instance of a database, as we want objects of a class to agree on the type description associated to the class. Thus we introduce the notion of legal interpretation and define an interpretation function that accounts for incomplete knowledge and syntactic class structure.

Let 9 : C --~ 2 CP be a function that associates to each class C 6 C = C o U C p the set of all its primitive superclasses:

9 (C) = {Cp 6 C p

I CISACp

andCpykC}

then we have the following definition. D e f i n l t i o n 5 Legal I n t e r p r e t a t i o n . Given a LOGIDATA* schema S, let 7" be the set of type descriptors in S. Given a domain 27 = (O,6), let 2: be a function defined from type descriptors in S to 212 (i.e., 2; : 7" ---* 212); 2- is a legal interpretation function (or, briefly, interpretation function) of S over 12 if and only if: 1. 2-(i) = :P/ 2. 2- (< tuple-type >) = 2-([11 : tl...lk: tk]) = {Yt E 12T { Vt is defined at least on { l l . . . l k } and

v,(li) 6 2-(ti),i = 1,...,k} 3. 2-(<set-type>) = :T-.({t}m,n) = {Us ~ 12s ] m _~<[[vs C 2-(t)I1_< ~}, and: (a) z({t}0,oo) = {v, e vs I v, c z ( t ) } (b) 2-(EmptySet) = {0} 4. 2- (< type-id >) = 2-(type type-id = ) = 2 ; ( < t y p e > ) 5. 2-() = 37(C) C_ O, and: (a) 27(T) ~ O (b) :/:(class Ca -- ISA C ~ . . . C , < t u p l e - t y p e > ) =

= :/:(Ca). = f'li"=12-(Cj ['] {o 6 0 (c) 27 (class Cp < ISA C 1 . . . C n

I 6(0) e Z(TYP(Cd))}

< tuple-type >) =

= 2-(Cp) C_ Nin=12-(Ci) N {o 6 0 [ 6(0)

E

2-(TYP(Cp))}

For the interpretation of TYP(C) we have the following recursive definition. D e f i n i t i o n 6 I n t e r p r e t a t i o n of T Y P ( C ) . Given the class declarations: c l a s s Co < prim-def> < tuple-type >0 c l a s s C1 <prim-def> ISA Co < t u p l e - t y p e > l c l a s s Cn+l <prim-def>

we have 77 (TYP(Co)) Z (TYP(C1))

ISA Co...Cn

n+l

= 2: (< tuple-type >o) = Z (TYP(C0)) N 2"(< tuple-type>l)

2-(TYP(C.+I)) " , (-~n , i = t Z(TYP(CI)) N 2-(n+x)

90 This semantics, allows us to consider the ISA relationship as an inclusion between classes. Moreover, each value can have more than one type [5]: when a value is of type t, then it is of type t', too, in the case that 2: (t) C I(t'). Remark that the interpretation of a class is a set of objects which have a value according to the type of the class; furthermore, these objects must also belong to the interpretation of the classes appearing in the ISA clause. As regards the concept of primitive/defined class, the interpretation of a defined class consists of all the objects verifying the above-mentioned constraints, while the interpretation of a primitive class is a subset of them. For example, in the case of class Person _< [name:string birthdate:Date] class Project - [proj-code:string description:string/ class Student - ISA Person [regist-num:int enrolled:College enrolled-course: {string} 1,10] the interpretation of Person is a subset of the objects having a name and a birthdate, while the interpretation of Project is the set of all the objects having a proj-code and a description. Thus, an object having a name and a birthdate must be explicitly asserted belonging to the Person class, while an object having a proj-code and a description always belongs to Project. The interpretation of Student consists of all the objects that belong to Person, have a regist-num, are enrolled to a College, and are enrolled to some courses. Because of the recursive definition of the TYP's interpretation, these objects also have a name and a birthdate (inherited from Person); as a matter of fact, the tuple values of the Student objects are given by the intersection of the tuple values defined at least on the name and birthdate labels, and the tuple values defined at least on the regist-num, enrolled and enrolled-course labels. In the Appendix A we show a complete example of schema interpretation.

4

Deductions

on Instances

The above introduced semantics provides the LOGIDATA* database with a powerful means for manipulating the explicit information about the domain (extensional level) and thus allows the system to have an active role in deducing new relationships rather than being just a passive repository of data. The fact that, for a fixed schema S, a set of values 1/, a domain E = (O, 8) and an interpretation for the primitive classes, the set of instances of a defined class is unambiguously determined, allows a LOGIDATA* database to recognize new instances of a class. So, for example, given the schema specification reported in Appendix A, if # J o h n is defined in such a way that: #John E I (Person) b(#John) = [name:John birthdate:[day:25 month:7 year:1953] emp-code:I23435 salary:2000 works-at:#Ote prj:#LogDB budget:[prj-amount: 1500]] then the system recognizes the object # J o h n as an Employee and a PrjLeader without explicitly asserting these facts and includes # J o h n in the answer to a

91

query about the instances of Employee and PrjLeader. In addition, our system deals with incompletely specified objects in a way that is semantically coherent. For example, we can assert that: #Mary E Z(Student), with attributes: regist-num = 876362 enrolled = #Yale enrolled-course = {mathematics,physic} without specifying the fillers for the name and birthdate attributes of the # M a r y object (see in the Appendix A the description of the Student class) and without any explicit assertion about the #Yale object. Then, given that the fillers of the enrolled attribute in the Student class must belong to the College class, LOGIDATA* includes the #Yale object in the College class and assignes an undefined value to the unspecified properties of both # M a r y and #Yale objects: #Yale E 2-(College) 6(#Yale) = [name:nil address:nil courses:nil] 6(#Mary) = [name:n//birthdate:nil regist-num:876362 enrolled:#Yale enrolled-course: {mathematics,physic}] Therefore, LOGIDATA* deals with incrementally evolving descriptions, allowing the user to maintain a partial view of the domain of discourse; incomplete information may be gradually eliminated as new knowledge is acquired. 5

Schema

Coherence

and

Subsumption

An interesting property of classes is disjointness [6], i.e., whether two class descriptions denote necessarily mutually exclusive sets. In LOGIDATA* two class descriptions can be disjoint because of non-overlapping restrictions over the same label. In the following schenm: class Person ~ [name:string birthdate:string telephone:string] class Childless-Person - ISA Person [children:EmptySet] class Fertile-Parent - ISA Person [children:{Person}l,5] class Successful-Parent--' ISA Person [children:{Person}620] Childless-Person, Fertile-Parent and Successful-Parent are disjoint classes given that, for the children label:

I ({Perso,,}l,5) fq Z ({Person}6,2o) = 2- (EmptySet) fq Z ({Person}l,5) = Z (EmptySet) M Z ({Person}6,2o) = 0 In general, the following definition holds. D e f i n i t i o n 7 D i s j o i n t t y p e s . Given a schema S, two types tl and t2 are disjoint if and only if for each domain S and for each interpretation function 2" defined over V, 2 ( t l ) CI2"(t-_,) = 0 . An incoherent type description contains two or more disjoint types. For example, the following class descriptions are incoherent: class C1 - ISA Fertile-Parent Successful-Parent class C2 - ISA Person [telephone:int]

92 D e f i n i t i o n 8 I n c o h e r e n t t y p e . Given a schema S a type t is incoherent if and only if for each domain S and for each interpretation function 2: defined over V, z (t) = 0. We say that a schema S is coherent if it does not contain incoherent types. LOGIDATA* does not permit to create such types in order to always enforce a consistent knowledge state (see sect. 6, where the above definitionition is translated into syntactic constraints). The introduction of the interpretation function, together with the notion of defined class, allows us to formalize the concept of subsumption between classes and define an algorithm for its computation. D e f i n i t i o n 9 S u b s u m p t i o n . Given classes C1 and C2, C2 subsumes C1 (C2 C1) or suBs(C2, C1)) if and only if for each domain X' and for each interpretation function/: defined over V, 7: (C1) C_ 2: (C2) holds. In our framework, every ISA clause corresponds to a subsumption relationship: if C1 ISA C2 then C2 subsumes C1. The opposite is not necessarily true; a class can subsume another one even if subsumption is not esplicitly defined by means of an ISA clause. Because our interpretation function is totally based on structural characteristics, the meaning of a structured description is only determined by its internal structure. This allows us to make an algorithm to deduce all the subsumption relationships among classes implicitly given by the structural conditions appearing in the class descriptions. For this purpose we introduce an ordering on T ( L , B, C , T ) called refinement (in symbols _<), based on type's syntactic features. D e f i n i t i o n 10 R e f i n e m e n t . Type t is a refinement of type t ~ (t _~ t' or if and only if"

REF(t, t'))

R1 R2a R2b n2c R3a R3b R3c

tETUCUBandt = t~ t E T and TYP(t) < t ~ t ~ E T a n d t < T Y P ( t ~) t, t' E T and TYP(t) < TYP(t') t ECpUCD andt ~=T t E C p U C D a n d t ~ E C P a n d t ~E4i(t) t E C p U C D a n d t ~ E C o and: 1. VC~ e •(t') then C~ e ~(t),too 2. TYP(t) _< TYP(t') R 4 t = [ / 1 : t l . . . l k : t k . . . l k + p : t k + p ] t ' = [ l l :t~ .lk : k], with k>0, p>0, and ti < t~ , for i = l , . . . , k R 5 t -~ {tl}rn,n and t' -'~ { t'1}P,q, with tl _< t~ m > p and n_0. ~ ,

t I

9

Refinement between tuple types and set types agrees with the usual subtyping constraints. As regards classes, a class refines a primitive one only if an

93 ISA clause is explicitly stated, while, for defined classes, refinement depends on both refinement between their tuple types and inclusion between their primitive superclasses. The following theorem, based on refinement, characterizes subsumption syntactically.

Given classes C1 and C2, C2 subsumes C1 (suss(C2,C1) or CIC_C2) i~ and only i] C1 refines C2 (i.e., C1
Theoremll.

Proof. The proof that the theorem agrees with the above given subsumption definition (i.e., sound and complete) is in Appendix B. Due to the formalism previously introduced (i.e., syntax + semantics + subsumption), we obtain an object-oriented data model based on classification, allowing the self organization of the classes into the subsumption hierarchy. In other words, all the subsumption relationships are computed by the system, independently of user-given ISA links. With reference to the schema shown in the Appendix A, a classification algorithm finds out the following subsumption relationships:

RschProject E Project, PrjLeader E Employee, Secretary E_ Employee PrjManager E Employee, Attending - Student C_ Student 6

Syntactic

Constraints

for Coherence

In this section we characterize the notion of coherent type from a syntactic point of view defining a correct type description. Furthermore, as regards multiple inheritance, some syntactic constraints are added to a class description using the ISA clause in order for the type of the subclass elements to be compatible with the superclass type [3]. The following definition gives rise to an algorithm looking for correct type descriptions that can be shown to be consistent with respect to the language semantics (see sect. 3). D e f i n i t i o n 12 C o r r e c t t y p e d e s c r i p t i o n .

1. t = EmptySet or t = T the description of t is correct. 2. t = B i , w i t h B i E B the description of t is correct. 3. t = [ll : t l , . . . , In : tn] the description of t is correct if li • lj Vi,j=l, ... ,n with i t j , and each ti is a correct type description. 4. t = { t I }min,ma. the description of t is correct if t ~ is a correct type description, min_<max and max>O.

94 5. t y p e t = t' (where t =) the description of t is correct if t ~ is a correct set or tuple type description. 6. class C <prim-def> ISA C 1 . . . C n the description of class C is correct if is a correct type description. Furthermore, for labels appearing in some Ci belonging to the ISA clause and in , the type associated with such labels in must be a refinement of the type associated with the omologous labels in Ci. Moreover, if the same label appears in more than one Ci with different types, then it must explicitly appear in with a type that is the refinement of all the previous ones.

The following theorem shows the soundness of the above given definition with respect to the definition of incoherent type (see sect. 5).

T h e o r e m 13 S o u n d n e s s of correct t y p e description. Given a schema S, let 7- be the set of type descriptors and t be a correct type description in 7", then there exists a domain S and an interpretation fnnction Y. defined over 1~ such that Z (t) # O.

For example, the definition of RschLeader is correct because the prj label appears in both PrjLeader and the tuple type associated with RschLeader, and {RschProject}l,2 _< {Project}l,3. The definition of Supervisor shows an example of multiple inheritance; the budget label is present in Supervisor with a type that refines both types appearing in PrjLeader and PrjManager, respectively. We can now characterize syntactically the TYP function, with respect to both the definition of semantics (see sect. 3) and the above given definition of correct type description. Given a coherent schema S, TYP associates with each symbol S E C O T the type descriptor defined by the expansion of the nonterminal symbols appearing in the corresponding declaration, with the following exception: if a class is described by means of an ISA clause, then TYP(C) is a tuple type with all the labels that appear in the superclass descriptions and in the tuple type that describes the class. In the case that the same label appears in the tuple type that describes the class and in one or more superclass descriptions, then TYP(C) presents such label as it is defined in the tuple type describing the class. Using ISA in a class description makes it possible to exploit inheritance, i.e., the mechanism that allows a class to inherit properties of its superclasses. From the above definitions, it is clear that multiple inheritance is only allowed in a framework of "re-definition", and re-definition is subjected to some type refinement constraints; a similar approach also appears in O2 [8]. We show some results obtained by evaluating TYP :

95 TYP(Date) = [day:int month:int year:int] TYP(RschLeader) = [name:string birthdate:Date emp-code:string salary:int works-at:Company prj:{RschProject}l,2 budget:[prj-amount:int]] TYP(Supervisor) = [name:string birthdate:Date emp-code:string salary:int works-at:Company prj:{Project}l,3 has-secretary:Secretary budget :[prj-amount :int external-amount:int budget-plan:int]] TYP(Person) = [name:string birthdate:Date] TYP(Woman) = [name:string birthdate:Date].

7

Computational complexity

In the environment of KL-ONE-like languages, particular attention is given to studying the relationships existing between the model definition language's expressive power and the related subsumption problem's complexity [7][9][12] in order to obtain tractable systems. As far as the complexity of SUBs(C1,C2) is concerned, we can repeat the same considerations made in [10] about the intractability of terminological reasoning. As a matter of fact, in the worst case, suBs(C1,C2) performs subsequent expansions of both classes until it is possible to calculate suBs(C1,C2), with C1 and C) being completely expanded classes; C is a completely expanded class if TYP(C) does not. contain any type-names or class-names: these names are replaced by their descriptions. The transformation from C into C is possible in a finite number of steps because our language does not have any recursive definitions, but the trouble is that it induces expressions of size O(mn), where m is the size of the unexpanded class C [10][1], as the following example shows. Let B0 be a base type and: class C1 [l:B0 l':B0] class C2 [l:C1 l'iC1] class C,~ [l:Cn_l l':C,_l] In this case the size of C,~ is O(2~). Now we define two measures accounting for the name expansion process inside every type and we show that the subsumption algorithm is polynomial with respect to these measures. D e f i n i t i o n 14 D e p t h of a T y p e . We indicate by d(t) the depth of a generic type: - If t = B is a basic type, d(t)=0 - If t = EmptySet., d(t)--0 -

If t = T, d(T)=0

- If t = [11 : t l , . . . , l m : t,,], d ( t ) = l + m a x { d ( h ) , with i = l , . . . , m } - If t = {tl},,i,,,,ax, d ( t ) = l + d ( t l )

96 If t = C is a class name, d(t)=d(TYP(C)) - If t = T is a type name, d(t)=d(WYP(T)). -

D e f i n i t i o n 15 E x p a n d e d Size o f a T y p e . Let t be a generic type, we indicate by It [ the expanded size of t: Ift If t Ift - Ift - Ift If t - If t -

-

-

-

= = = = = = =

B is a basic type, [ t [=1 EmptySet, [ E m p t y S e t [=1 T, then I T [=1 [il : t l , . . . , l m : trn], It [~-Eim=l(l'3 t" [ti l) {tx},~i,~,max, It I= 1+ I tl l; C is a class name, [ C I=l[ # ( c ) II + I TYP(C) [ T is a type name, [ T [=[ TYP(T) [.

The following theorem can be proven by induction on type depth (we denote the algorithm that calculates if t < t' by means of REF(t,t')). T h e o r e m 16 S u b s u m p t i o n A l g o r i t h m is P o l y n o m i a l . Given two classes C1 and C2, SUBS(C1, C2) runs in O(I C1 [ x [C2 [) time, i.e., SUBS(C1, C~) is polynomial in the expanded size of a class. Proof. Since SUBs(C1, C~)=TRUE if and only if REF(C~, C 0 = T R U E , we prove that REF(C2,CI) runs in O(I C1 I x [ C~ I) time; further, since the worst case happens only in the presence of defined classes, we suppose that in every computation of REF(Ci,C)) Cj is a defined class. REF(Cx,C2) is true if and only if: - r C_ ~(C1); tiffs is done in O([[ ~(C1)II • II ~(c2) II) steps - REF(TYP(C1),TYP(C~))=TRUE, with TYP(C1) = [Ix : t l , . . . , l n : tn] TYP(C~) = [l~ : t ~ , . . . , F m : t~n] Given that d(C2)=d(TYP(C2)), the proof proceeds by induction on the depth of TYP(C~). 1. If d(WVP(C~))=l then d(t})=0 Vt~ in TYP(C2) (d(C2)=l and I C2 I = m + II ~(C_~) II) 9 Tiros, for each label I} the R E F ( T Y P ( C 1 ) , T Y P ( C 2 ) ) procedure must scan all the lj appearing in TYP(C1) (at most [ TYP(C1) I steps), looking for a label lj = l~ with tj = t} or tj e C if t} = T; this is done in O(1 TYP(C1) I x I TYP(C2) [) steps, then REF(C1, C2) runs in O([ C1 I x IV2 [) time. 2. If d(TYP(C2))=2 (max{d(tJ}=l), then, for each label l~ in TYP(C~) we nmst find the corresponding lj in TYP(C1) (at most ] TYP(C1) [ steps) and then call REF(tj, t}) with d(t}) <1; the following cases hold: (a) d(t})=0, then REF(tj,t}) takes O(] tj ] x It} [) steps. (b) d(t~)=l with t} tuple type. Let: I tj = [ljl : t j l . . . l j n : tin] and t} = [l~i: t~l...l~rn : tim ] In this case, given that d(t}k)=0 Vk = 1,..., m, the analysis is similar to that of point 1; then REF(tj,t~) takes O(I tj I x [t~ l) steps.

97 (c) d(t~)=l with t~ set type. Let: I tj = {tjl}m,n or tj -- EmptySet and t~ = {til}p,~ with d(t~l)--0. There must be: a) re>p, nKq and REF(tjl,t~I)=TRUE; or b) p=0. In both cases REF(tj, t~) takes O(I tj ] • I t~ ]) steps. (d) d(t~)=l with t~ class name. There must be: a) r C 4~(tj); this is done in O(I I ~/t(t~)]] x ]1 ~(t/) ]]) steps b) REF(TYP(ij),TYP(t~))=TRUE, where TYP(t~) is a tuple type with d(TVP(t~)) = 1. Then as we have shown in point 1., R.EF(TYP(tj), TYP(F, takes O(I WYP(tj) I • I TYP(t~) I) steps. Given that, for a class name I C I = [1 ~(C) II + I TYP(C) I, then the total effort for computing REf(tj,t~) is O(I tj ] x }t~ ]). In all the cases we have examinated, REF(tj, t~) runs in O(} ij I x I t~ I) steps, then the total effort for the m labels is: m

m

E ( [ T Y P ( C 1 ) I + I t J ] • I t~[) -< [TYP(C1)] x E ( l + ] t } i=1

I) =

i=1

t TYP(C1) I x [ TYP(C2) I and REF(C1, C~) runs in O([ C1 ] x ] C2 [) steps. 3. Let us assume that for d(TYP(C~))=k (d(C:)=k), REF(TYP(C1), TYP(C2)) runs in O(} TYP(C1) I x I TYP(C,) ]) steps. This implies that: (a) REF(C1, C2) with d(C~)=k runs in O(] 6"1 ] x ]C~ ]) time (b) Given that TYP(C,) is a generic tuple type, then REF(tl,t2) with tl,t2 tuple types and d(t2)=k runs in O(] tl ] x It2 ]) steps (c) Ift~ is a set type with d(t~)=k-1, then REF(tj, t~.) runs in O(} t/ I x ] t} I) steps. 4. Suppose that d(TYP(C2))=k+I, then max{d(t D I tl is in TYP(C:)}=k (d(C2)=k+l); then, for each label l~ in TYP(C2) we must find the corresponding 11 in TYP(C1) (at most [ TYP(C~) I steps) and then recursively call REF(tj,t~) with d(/~) <_k; the following cases hold: (a) t~ is a class name or a tuple type; then, by induction, REF(tj,t~) with d(t~)~k runs in O(I tj ] x [t~ [) steps (b) t i is a set type with d(t~)=k. Let: tj = {tjl}m,n or tj = EmptySet and t~ = {t~l}v,q with d(t~l)--k-1. There must be: a) m>p ,n~q and REF(tjl, th)=TRUEwith d(t~l)=k-1; or b) p=0. In both cases, by induction, REF(h,t;) takes o(} 6 ] x I tl I) steps. As shown in the conclusion of point 2., REF(TYP(C1),TYP(Cu)) runs in O(I TYP(C~) ] x ] TYP(C2) ]) steps, then R.EF(C1,Cu) runs in O(I C1 I X IC2 i) steps.

98

8

Conclusions

Our study aims at endowing an object-oriented database model with the inference capabilities related to subsumption. Subsumption and its applications have been throughly investigated in the field of Knowledge Representation systems. While database models usuMly refer to different deductive paradigms, such as logic, recent studies pointed out usefulness of taxonomic reasoning for dealing with data management at both an intensional and extensionM level. The main result of our paper is to show how this capability can be added to an objectoriented data model, proposed in a database environment, increasing then its features still maintaining the original ones; as a matter of fact, we develop a formal framework for treating values and objects, and types and classes in a well-founded and uniform way that agrees with taxonomic reasoning. Our future work will deal with managing extensional knowledge, investigating the features provided at this level by taxonomic reasoning, in particular we will investigate the instance recognition problem. To date, the Knowledge Representation Systems emphasized the intensional level aspect of the Knowledge Base, while is our opinion that providing the system with inferential techniques acting both at an intensional and extensional level will improve the system efficiency and usability.

References 1. S. Abiteboul and R. Hull. IFO: A Formal Semantic Database Model. In ACM TODS, vol. 12, n.4, 1987. 2. A. Artale, D. Beneventano, S. Bergamaschi, F. Cesaxini, C. Sartori and G. Soda. Taxonomic Reasonig in LOGIDATA+. In this volume, LOGIDATA+:Deductive Databases with complex objects, P.Atzeni ed., Lecture Notes in Computer Science, Springer-Verlag. 3. P. Atzeni, F. Cacace, S. Ceri, L. Tanca. The LOGIDATA+ model. In this volume, LOGIDATA+ :Deductive Databases with complex objects, P.Atzeni ed., Lecture Notes in Computer Science, Springer-Verlag. 4. R.J. Brachman, D.L. MacGuinness, P.F. Patel-Schneider and L.A. Resnick. Living with CLASSIC: When and How to Use a KL-ONE-Like Language. In Principles of Semantic Networks, J.Sowa, Morgan Kaufmann Publishers, Inc., 1990. 5. L. Cardelli. A Semantics of Multiple Inheritance. Semantics of Data Types, Springer-Verlag, pp.51-67, 1984. 6. L.M.L. Delcambre and K.C. Davis. Automatic Validation of Object-Oriented Database Structures. In Proceedings of Int. Conf. on Data Engineering, 1989. 7. F.M. Donini, M. Lenzerini, D. Nardi, W. Nutt. The complexity of concept languages. In Proceedings of the 2nd Int. Conf. on Principles of Knowledge Representation and Reasoning, KR-91, Morgan Kaufmann, 1991. 8. C. Lecluse and P. Richard. Modelling Complex Structures in Object-Oriented Databases. In ACM PODS, pp.360-367,1 1989. 9. H.J. Levesque and R.J. Brachman. Expressiveness and Tractability in Knowledge Representation and Reasoning. In Computational Intelligence, 3:78-93, 1987,

99 10. B. Nebel. Terminological Reasoning is Inherently Intractable. In IWBS Report 8~, September 1989. 11. B. Nebel. Reasoning and Revision in Hybrid Representation Systems. In Lecture Notes in Artificial Intelligence, n. 422, Springer-Verlag, 1990. 12. P.F. P~tel-Schneider. Undecidability of subsumption in NIKL. In Artificial lntel. ligence, 39:263-272, 1989.

A

Appendix

In this Appendix we present an example of schema specification, using the LOGIDATA* syntax, along with an example of schema interpretation. The class and type names are indicated with a capital letter. Schema specification type class class class class

Date Organization Consortium Project RschProject

= [day:int month:int year:int] [name:string residence:string] < [members: {Organization} 1,oo] - [proj-code:string description:string] - [proj-code:string description:string has-units:Consortium] class College - [name:string address:string courses:{string} 1,oo] class University - ISA Organization [has-colleges:{College}l,oo] class Company -" ISA Organization [emp-num:int turnover:int] class Person _< [name:string birthdate:Date] class Woman < ISA Person class Student --" ISA Person [regist-num:int enrolled:College enrolled-course: {string} 1,10] class Attending-Student - ISA Person [regist-num:int enrolled:College enrolled-course: {string} 2,10] class Employee - ISA Person [emp-code:string salary:int works-at:Company] class Unemployed - ISA Person [works-at:EmptySet] class Secretary ISA Woman [emp-code:string salary:int works-at:Company] class PrjLeader ---' ISA Person [emp-code:string salary:int works-at:Company prj:{Project}l,3 budget:[prj-amount:int]] class RschLeader -" ISA PrjLeader [prj:{RschProject}l,2] class PrjManager - ISA Person [emp-code:string salary:int works-at:Company has-secretary:Secretary budget :[prj-amount :int external-amount :int]] Class Supervisor - ISA PrjLeader PrjManager [budget: [prj-amount :int external-amount:int budget-plan:int]]

100

Schema interpretation Given the following domain: O = {#Eng,#Med,#Art,#FI-univ,#Ote,#cl,#LogDB,#OODB,#Anne,#Al, #Roland ,#Robert ,#Max} 6(#Eng) = [name:Engineering a~ldress:Via,S.Marta,3,FI courses:n//] 6(#Med) = [name:Medicine address:Viale,Morgagni,l,FI courses:n//] 6(#Art) = [name:Liberal-Arts address:Piazza, Brunelleschi,FI courses:hi~] 6( # F I - univ) = [name:Universita'-Firenze residence:Firenze has-colleges:{#Eng,#Med,#Art }] 6(#Ote) = [name:Ote-Biomedica residence:Firenze emp-num:lb00 turnover:1000] 6(#cl) = [members:{#FI-univ,#Ote}] 6(#LogDB) = [prj-code:DB14 description:deductive-database] 6(#OODB) = [prj-code:DB15 description:O-O-database has-units:#cl] 6(#Anne) = [name:Anne birthdate:[day:22 month:ll year:1964] emp-code:I76009 salary :1200 works-at:#Ote] 6(#Al) = [name:Alexander birthdate:[day:25 month:7 year:1964] emp-code:I76010 salary:2000 works-at:#Ote prj:{#LogDB} budget:[prj-amount:1500]] 6(#Roland) = [name:Roland birthdate:[day:2 month:6 year:1947] emp-code:I302 salary:4000 works-at:#Ote has-secretary:#Anne prj:{#LogDB} budget:[prj-amount:1500 external-amount:500 budget-plan:3000]] 6(#Robert) = [name:Robert birthdate:[day:7 month:5 year:1954] emp-code:I205 salary:2000 work~-at:#Ote prj: {#OODB } budget:[prj-amount:1000]] 6(#Max) = [name:Max birthdate:[day:nil month:7 year:1969] regist-num:176487 enrolled:#Eng enrolled-course: {mathematics,physic, chemistry}] Fixed the interpretation for the primitive classes: Z (Organization) = {#FI-univ,#Ote} Z(Consortium) = {#el} Z (Person) = {#Anne,#Al,#Roland,#Robert,#Max} Z (Woman) = {#Anne} Z (Secretary) = {#Anne} The interpretation of the defined classes is unambiguously determined: Z(University) = {#fI-univ} Z(Company) = {#Ote} Z (College) = {#Eng,#Med,#Art} Z (Project) = {#LogDB,#OODB} Z (RschProject) = {#OODB} Z (Employee) = {#Anne,#Al,#Roland,#Robert}

101

2" (Unemployed) 2" (PrjLeader)

Z (PrjManager) Z (Supervisor) Z (RschLeader) Z (Student) 2" (Attending- Student) B

={} = = = = = =

{#Al,#Roland} {#Roland} {#Roland} {#Robert} {#Max} {#Max}.

Appendix

We prove that the SUBS algorithm calculates subsumption between classes. First, we must prove that if SUBS(C~, C2)=TRUEthen 27(C2) C 2" (Ct) (soundness); then, we must prove that if 2"(C2) C 2"(C~) then SUBS(C1, C2)=TRUE (completeness). Note that the system does not have cyclic references; therefore, it is possible to use the induction principle in the proofs.

Let C1 and C2 be two classes; if SUBS(Ca, C2)=TRUE, then for each domain ~, for each set of values )) and for each interpretation fu,,ction 2" over 12: 2"(C2) C 2"(C1). T h e o r e m 17 S o u n d n e s s of t h e S U B S A l g o r i t h m .

Proof Since SUBS(C1,C2)=TRUE if and only if REF(C2, C1)=TRUE, we prove that, given two types t and t', ift < t' then VZ', V and V2" over I): 2"(t) _C 2"(t'). Assuming that t < t', one of the following cases holds (see sect. 5): R 1 , R 3 a - b Trivial. R 4 t = [ll : tl . . . . ,lm: tin] and t' = [1~ : t~,...,/In: tin]. with t defined at least on all the labels appearing in t' and such that Vi~, i = l , ... ,n, 31j for which l~ = lj and REF(tj, t~)=TRUE. Therefore, by induction, Z (tj) C_Z (t~) and, because of point 2 of the interpretation function: 2.(t) _C

z(t'). R 5 t = {tl},~,n and t ' = {t]}p,q, with R E F ( t l , t ~ ) = T R U E , m > p and n_ 0; therefore iT.(EmptySet) C_2" ({t i }0,m)). R 3 c t a n d t ' E C w i t h t ' E C D and: - ~(t') g ~ ( t )

- REF(TYP(t),TYP(t'))=TRUE Therefore, by induction 2"(TYP(t)) _C 2.(TYB(t')); then, due to point 5.5 of the interpretation definition, 2" (t) C 2" (t').

102

The completeness of the SOBS algorithm can be proven on the basis of the following two lemmas. The first lemma allows us to construct an interpretation function in such a way that a particular value vl is in the interpretation of each tuple type except for a particular exception. L e m m a 18. Let L be a set of labels, l* G L; t a generic type; 12 a set of values. Moreover, let 27 be an interpretation function over l~; vl and v* E !) such that: -

v* e 2 7 ( t )

vl is a tuple value - vl(li) -" nil Vii E L, for li ~ l ~ -

-

o1(1 ~ = v ~

Then, for every tuple type: x - - ~1 : t l , . . . , l k

:tk]

if :c does not.contain l ~ : u, with u such that REF(t, u)=FALSE, then v~ E 27(x). Proof. There are two cases. 1. The 1~ label is not present in z. Due to our construction, Vl is surely defined over all the labels appearing in x and vl (li) e 9 (ti)U {nil} V i = l , ... ,k. Then, for point 2 of the interpretation definition, vl E 27(z). 2. The I~ label is present in z: x = [ll : tl . . . l ~ : s . . . l k : t~] with s generic type such that t < s. It is enough to prove that vl(i ~ E 27(s). Due to our construction, vl(l*) = v~ E 27(t); but we have 27(t) C Z(s) because t < s, then Vl(l*) E 27(s). Therefore, Vl E 2" (x). L e m m a l 9 . For each t E T ( L , B , C , T ) it is possible to find n distinct values v l , . . . , v n E Y and an interpretation function 27 defined over l~ such that vx,...,

v , e :~ (t).

Proof. The proof follows from the consideration that, for each basic type Bi, 27(Bi) is a countable set. Therfore, it's easy to show by induction that the lemma holds. Now we can prove tile completeness of the subsumption algorithm. T h e o r e m 20 C o m p l e t e n e s s o f t h e S U B S A l g o r i t h m . Given two classes C1 and C~, if27 ( e l ) C 27(6~) , for every set of values ]] and for every interpretation function Y. over V, then SUBS(C2, C1)=TRUE.

Proof. Given that SUBS(C2, C1) is true if and only if REF(C1, C2) is true, we show that, given two generic types x and y, if R E F ( z , y)=FALSE, it is always possible to find a set 1~ and an interpretation function 27 over 12 such that: 27(~) r 27(y).

103

1. If x, y have different types (for example, x is a tuple type and y a set type), then Z ( x ) N 27(y) = O because the sets D , O , 1;T, 1;S are pairwise disjoint; therefore, 27(x) r 27(y). 2. If x, y e B (they are base types), REF(x, y ) - F A L S E r x ~ y. Then 27 (x) :2"(y), because the base types are pairwise disjoint. 3. If x, y are tuple types, we have two different cases. (a) y contains l ~ : u, while x does not contain the l ~ label. We build a tuple value vl defined exclusively on L - {i.} and such that vl(li) = nil Vii E L - {lO}. Let Vl E Y and 27 be an interpretation function over V, then vl e 27(x) but vl r Z (y). (b) y contains l ~ : u, x contains l ~ : t but t is not a refinement of u. Since REF(t, u)=FALSE, by induction, there are 27*, Y* and v* E V* such that v* e 27* (t) but v* • Z* (u). Now we build a tuple value vl such that Vl(li) = nil Vii E L - {lO} and vl(i ~ = v*. Let V = V* U{vl} (with Vl # v*); let 2" be an interpretation function over 1; such that, for every type s, 2"(s) _C I* ( s ) U {Vl}, and in particular v* E Z(t). Then vl(l ~ = v* ~ 2-(u); therefore, vl t/ 27(Y). On the other hand, assuming v ~ = v*, the conditions of Lemma 14 (case 2) are satisfied, indeed vl E Z (x). Then Z (x) ~ Z (y). 4. If x , y are set types with x -- {t},,,m and y = {u}p,q and re,p>0, we have two different cases: (a) REF(t, u)--FALSE. By induction, there are 2-*, V*, and v* E V* such that v* E 2"* (t) but v* ~ 2-*(u). Let v0 be a set value such that v o = { v * , v l , . . . , V n _ l } ; V = V*U {Vo,Vl,...,Vn-1}; let 2" be an interpretation function over V such that vl E Z(t) Vi=l . . . . ,n-1 (follows from lemma 15) and for every type s, 2-(s) C Z* (s) U {v0, v l , . . . , v n - i } . Then v* E Z i t ), while v* ~ 2-(u). Therefore, since v0 = { v * , v l , . . . , v n - i } C 2-(t) and II v0 II= n then, due to the interpretation of set type, Vo 9 27(x) but vo ~. Z(y). Then 27(x) r 27(y). (b) n

q. Let n q is similar. 5. x = EmptySet and y = {t},~,m with n > l . 27(EmptySet) = {13} ~ {v, 9 1;s I v, C 27(t)and n
27ix) r Z(y) ill OII = 0). 6. If x, y are two classes, we have the following cases: (a) y E C p and there is not an explicit ISA such that xlSA y. Given the semantics of a primitive class, it is alway possible to build 27,1; and o 9 O C V such that o 9 27(x) but o r 2"(y). (b) y E CD and 3Cp 9 C p such that: Cp 9 4~(y) but Cp r O(x). Let 2732 and o 9 O C_ 1; such that o 9 Z ( x ) but o r then, given

104

that I (y) C_I (Cp), o • 2: (y). (c)

REF(TYP(x),TYP(y))--FALSE. Given that x is a coherent class, it must be TYP(x) ~ TYP(Cp) VC~ e r Now by induction, there are it*, Y*, v* such that v* 9 2:* (TYP(z)) but v* r 2:~ (TYP(y)). Let oo be an object identifier such that 6(o0) - v*; I) = F* U {o0}; let 2: be an interpretation function over Y such that: a) for every type s, Z (s) C Z* (s) U {oo}, b) VCp 9 ~(x)o0 9 Z (Cp). Then, from a), v* E Z (TYP(x)) while the assumption b) is valid given that, ~f(o0) = v* 9 2: (TYP(z)) C 2: (TYP(Cp)) VCp 9 ~(x). On the other hand, v* ~ 2: (TYP(y)), therefore, due to the definition of class interpretation, o0 e I ( x ) but o0 r Z(y).

T a x o n o m i c R e a s o n i n g w i t h Cycles in LOGIDATA + Domenico Beneventano and Sonia Bergamaschi and Claudio Sartori CIOC

-

CNR, Dipartimento di Elettronica, Informatica e Sistemistica Viale Risorgimento 2, 40136 Bologna, Italy e_malh {domenico,sonia,claudio }@deis64.cineca.it

A b s t r a c t . This paper shows the subsumption computation techniques for a LOGIDATA+ schema allowing cyclic definitions for classes. The formal framework LOGIDATA~vc , which extends LOGIDATA* to perform taxonomic reasoning in the presence of cyclic class definitions is introduced. It includes the notions of possible instances of a schema; legal instance of a schema , defined as the greatest fixed-point of possible instances; subsumption relation. On the basis of this framework, the definitions of coherent type and consistent class are introduced and the necessary algorithms to detect incoherence and compute subsumption in a LOGIDATA+ schema are given. Some examples of subsumption computation show its feasibility for schema design and validation.

1

Introduction

This paper shows the subsumption computation techniques for a LOGIDATA + schema allowing cyclic definitions for classes. The reader is referred to Taxonomic Reasoning in LOGIDATA +, by A. Artale, F. Cesarini, G. Soda, D. Beneventano, S. Bergama.schi, C. Sartori, in this volume for tile motivation of taxonomic reasoning in this enviromnent. When class definitions are allowed to contain cycles, the models of the schema must be found by means of some fixed-point operation, hence it becomes necessary to precisely define the starting points and the fixed-point which are to be allowed. In section 2 the formal, the framework LOGIDATA* model, is given. In section 3 a running example and an intuitive introduction to the paper topic are given. In section 4 a formal definition of a L O G I D A T A ~ yc schema is given, the notion of possible instance is introduced and its problems are discussed. In section 5 the legal instance of a schema is formally defined as the greatest fixed-point of possible instance and a computation algorithm is provided; this is done by introducing a modified schema where the primitive classes are transformed in order to find a valid starting point and compute the fixed-point. The main results on the fixed-points are also recalled. In section 6 the subsumption relation is formally defined. In section 7 the definitions of coherent type and consistent class are introduced, as a support for * This work was partially supported by the Italian project Sistemi informatici e Calcolo Parallelo, subproject 5, objective LOGIDATA+ of the National Research Council

(CNR).

106

schema validation. Then the validated schema is simplified in order to effectively compute the subsumption relation. Finaliy, in section 8 some examples of subsumption computation are given and its feasibility for schema design and validation and object recognition is shown.

2

T h e LOGIDATA* M o d e l

In order to fit taxonomic reasoning in LOGIDATA + some modifications are necessary. The LOGIDATA* model has been individuated as a formal framework which is common with the work done in Introducing taxonomic reasoning in LOGIDATA +, in this volume. In section 4 some additions will also been introduced in order to deal with cycles. The main structure of LOGIDATA* is the class that denotes sets of objects, each of which is identified by an object identifier (oid). Objects in classes can have complex structures obtained by repeatedly using the tuple and set constructors 2. Furthermore, type names are provided to simplify user declarations. Further, set types with cardinality constraints are introduced, integrating in the object description a kind of integrity constraint in the database environment and classes are distinguished as primitive and defined. 2.1

Database Schema

We consider, as base types B, the two countable, non-finite set of strings and integers, which are disjoint. Let C p be a countable set of primitive class names, CB a countable set of defined class names, C = C p U CD, T a countable set of type names, L a countable set of labels; sets B, Cp, CD and T are pairwise disjoint, C o including the universal class, which will be denoted by the symbol T. We indicate by 7-(L, B, C, T ) (often abbreviated simply as 7") the whole set of type descriptors, over L, B, C, T defined as follows: - each base type name B E B is a type; each class name C E C is a type; - each type name T E T is a type; - if t is a type, then {t}(min,max) is also a type called set type, where min is a not-negative integer, max is a positive integer or c~, and max > min; e m p t y s e t is the empty set type; - i f Q , . . . , t k , with k > 0, are types and l l , . . . , l k are distinct labels, then [11 : t l , . . . , lk : tk] is also a type, called tuple type. -

-

Let ISA be a partial order over C: C ISA C' states that each object belonging to class C also belongs to class C', i.e., class C is a subclass (or a specialization) of C'. Let TYP be a function from T U C to the set of type descriptors in 7"(L, B, C, T). TYP is such that: - for each T E T, TYP(T) is a tuple or set type descriptor of 7"; 2 The sequence type (0) can be easily added.

107

- for each C E C, TYP(C) is a tuple type descriptor of 7-; T Y P ( T ) is the empty tuple ~, i.e. the tuple type with no labels. D e f i n i t i o n 1 ( L O G I D A T A * S c h e m a ) . A LOGIDATA* schema is a five-tuple S - ( C p , C D , T , TYP, ISA) where T, Cp, CD, ISA and TYP are defined as above. 2.2

Database

Values

L e t / ) i be the set of values associated with the base type name Bi E B. L e t / ) be T) = O~=liDi. Each element v E / ) is a basic value. We assume that sets T)i are

pairwise disjoint. Let O be a countable set of symbols called object identifiers (o is a generic object identifier), disjoint f r o m / ) . D e f i n i t i o n 2 ( V a l u e s ) . A set I / o f values is Y = / ) U O U/)T O 1/s where VT is the set of tuple values: VT = {vt ] vt is a mapping, vt : L --~ 12}. We denote with [ll : V l , . . . , l k : vk] the total mapping defined on { l l , . . . , l k } such that vt (li) = vi E "12,i = 1 , . . . , k . - Vs is the set of set-values: /;s = (v~ I vs C_ V}. A set-value is denoted by ( V l , . . . , v k } , with vi E V,i = 1 , . . . , k . -

We note that the object identifiers are values, too. Since objects are usually considered to be pairs of identifiers and values, we assume the existence of a function that assigns values to object identifiers. D e f i n i t i o n 3 ( V a l u e A s s i g n m e n t a n d D o m a i n ) . Value assignment is a total mapping, denoted by 6, that associates a tuple value with each object-identifier: 6:O , V T . T h e d o m a i n ~ is the couple Z = ( O , 6). 2.3

Interpretation and Database Instance

The LOGIDATA* syntax is associated with a semantic which regulates the relationship between the intensional and extensional levels of the model. D e f i n i t i o n 4 ( I n t e r p r e t a t i o n ) . Given a LOGIDATA* schema S, let 7-be the type descriptors in S. Given a domain L', the i n t e r p r e t a t i o n f u n c t i o n E of S over V is a function from 7- to 2 V such that:

z (c) c_ o

Z(T) c v - o The interpretation is extended to types in 7- as follows: I([ll:tl,...,lk:tk])

= {vt E VT]Vt is defined atleast on {11,...,1~} and vt

(li) E I ( t i ) , i = 1 , . . . , k } .

2: ( e m p t y s e t ) = {q}}.

z ({t}r

= {v, 9 Vs I n <11 v, c_ z (t) II_< m}.

108

This i n t e r p r e t a t i o n is obviously not satisfactory as an instance of a database, as we w a n t objects of a class to agree on the type description associated to the class. T h u s in the two papers we will introduce the notion of legal instance of a d a t a b a s e for acyclic and cyclic schema declarations.

3

Defined Classes, Cycles and Subsumption

T h e e x a m p l e shown in table 1 will be used t h r o u g h o u t the paper, in order to give an intuitive account of the formal f r a m e w o r k developed and the s u b s u m p t i o n computation.

t y p e StringSet t y p e Date p r i m - c l a s s Person p r i m - c l a s s Organization class Structure p r i m - c l a s s Division

= = = = = =

prim-class

=

Employee

class Clerk class Department

= =

class Secretary class Office

= =

class Typist

=

prim-class

Section

=

{string}(a,3) [ day: integer, month: integer, year: integer] [ name : string, birth-date: Date] [ name : string] isa Organization [ collaborates : {Structure}(0,a) isa Organization [ employs : {Employee}o,20), turnover: integer] isa Person [ salary : integer, work-in : Division, emp-code: integer] isa Employee [ work-in : Department ] isa Division [ head : Clerk, employs : { Clerk }(1,5) ] isa Employee [ level : integer] isa Division [ head : Typist, employs : {Typist}(1,5) ] isa Employee [ work-in : Office , level : integer, qualification : StringSet] isa Division , Structure [ name : string, head : Clerk ]

T a b l e 1. Example of LOGIDATA~y c schema declaration

T h e e x a m p l e syntax is quite intuitive: declarations prefixed with the keyword t y p e introduce type n a m e s in order to simplify declarations, as is usual in m a n y p r o g r a m m i n g languages. In our case, two type names, one type having sets of string as values and a tuple type are introduced. ~ denote tuples, {} sets, string and integer base type; i s a denotes explicit isa relationship. A class is described by a list of superclasses and a tuple of differentiae properties. Notice that, isa relationship is p a r t of a class description and superclasses can be ancestors (not necessarily parents) of the described classes. T h e m a i n extension, with respect

109

to well-known complex object data models, is the distinction between primitive classes declaration (Person, Organization, Division, Employee, Section) and defined classes (Structure, Employee, Clerk, Department, Secretary, Office, Typist). From an extensional point of view we can show the differences introduced by this distinction with an example. Consider the objects John, Mary and Store:

John is an Employee has 2 for level

Mary is a Person has 1600 for salary has 65782 for emp-code work in Store

Store is a Division

The object John can be automatically recognized as a Secretary, since Secretary is a defined class and its extension can be computed. On the contrary, the object Mary cannot be recognized as an Employee, even if it is qualified with all the properties of that class, since that class is primitive. For cycles we need further considerations: Typist is a cyclic defined class (Typist mentions Office in its declaration and vice versa). It is non-obvious to define the semantics of cyclic definitions. For example:

John is an Employee has 2 for level has some qualifications work in Research-office

Research-office is a Division has John as head has John as employee

John is recognized as a Typist if Research-office is a Office and viceversa. This case points out the need for a fixed-point semantics in order to formalize the meaning of cyclic definitions. Furthermore, notice that also Division and Employee give rise to circular descriptions but, as they are primitive, we do not need any fixed-point semantics. Let us comment subsumption at the intensional level. From the definition of Secretary, it is quite intuitive to conclude that all elements of Typist are also elements of Secretary, consequently Typist is subsumed by Secretary. The subsnmption computation will enrich the ISA hierarchies provided by the user with those computed by the taxonomic reasoning system. In Figure 1 we represent the schema description after subsumption computation (the only property labels which give raise to cyclic description has been drawn). The computed isa are denoted in the figure with a dashed line. The dashed line with a , denote the computed ISA which are related to cycles and then depend on the choice of the fixed-point semantics. This point is one of the main results of this contribution and is dealt with by the techniques developed in the AI knowledge representation area [Neb91]. 4

Database

Schema

and Instance

This section introduces the formal definition of a LOGIDATA~y c schema and extends LOGIDATA* to support cyclic definitions. Cycles are recognized by

110

person * )

organiz.n

works

division

employee

structure

ploys

works secretary

section *

clerk employs

\

I

/*

.~

\. works

typist

off:ice employs

Fig. 1. Schema with user-defined and computed hierarchies

means of the notion of dependence. D e f i n i t i o n 5 ( D e p e n d s o n ) . N depends on N', where N,N' G T U C if N' is contained in the expression defining N, TYP(N), or if N depends on N" and N" depends on N'. D e f i n i t i o n 6 ( L O G I D A T A ~ y c S c h e m a ) . Given a set of base types B and a finite set of attributes L, a LOGIDATA~y c schema is a five-tuple S = ( C p , CD, T, TYP, ISAo), where: -- C p , C D and T, are defined as in section 2; - TYP is a function on C U T such that: 9 for each T G T, TYP(T) is a set type or tuple type of 7"; the relation depends on, restricted to type names, is a partial, non-reflexive order, 9 for each C G C, TYP(C) is a tuple type of 7-3; the relation depends on, restricted to class names, can contain cycles (i.e. it is a reflexive partial order); we say that a class C is cyclic when C depends on C.

111

- ISAo is a partial, non-reflexive order over C and is completed with C ISAu T for any C E C. The ISAu relationship is the ISA provided by the user. In the following the transitive closure ISA~ of the user-supplied ISA, will also be considered. Each schema S = ( C p , CD, T, TYP, ISAu) is associated with a set of type descriptors T. The total order on type names prevents recursive type descriptions, which would generate infinite structures. As seen above, a class C is specified by means of the structure of its instances (TYP(C)) and the supersets of its instances (the ISA~.). Thus, the type specification must agree with the type specifications of superclasses. In section 7 this agreement will be formally specified by means of the notion of consistent

schema. Multiple inheritance is allowed: a class can inherit from two or more classes, but in case of ambiguity (i.e. when the same label comes from two or more parent classes) the attribute must be redefined. For this reason, the attribute n a m e in the class Section of table 1 has been redefined. The schema S = ( C p , C o , T, TYP, ISAu) corresponding to the example of section 1 is shown in table 2. 4.1

Instance of a LOGIDATA~y C Schema

An arbitrary instance of a schema S = (Cp, CD, T, TYP, ISAu) is the range (subset of I)(O)) of an arbitrary interpretation 5 for a given domain ~U. In the following, for simplicity, we will indicate with the same symbol, 5, an interpretation and the instance it produces. Of course, the only interpretations of interest are those giving an instance which is in accordance with the schema descriptions. In order to formally define these interpretations, let us consider, for a given interpretation 5, the set of object identifiers associated to a class name C which are compatible with both the type definitions and the user defined hierarchy ISAu: D e f i n i t i o n 7 ( P o s s i b l e I n s t a n c e ) . Given a domain S , an interpretation/7 is a possible instance of a schema S if and only if VCp E C p , VCD E C o and VT E T we have:

N

C ISAu C,

z(q )

C ISAu C, 5 (W) = 5 (TYP(T)),

(3)

a A class type is constrained to be a tuple, in accordance with the LOGIDATA + model; the subsumption algorithm can easily be extended to any class type, as shown in [BN91].

112

C=CpUCD= {Person, Organization, Division, Section } L3 {Structure, Employee, Clerk, Depart ment, Typist, Office, Secret ary } T = { StringSet, Date} Structure ISAv Organization, Division ISAu Organization, Clerk ISAu Employee, Employee ISAu Person, Department ISAu Division, Typist ISAu Employee, Office ISAu Division, Secretary ISAu Employee, Section ISAu Division. The T Y P function is the following: TYP(StringSet) TYP(Date) TYP(Person)

= {string}o,3 ) = [day : integer, month : integer, year : integer] = [name : string, birth-date : Date] -I

TYP(Organization) = [name: string I TYP(Structure) TYP(Division) TYP(Employee) TYP(Clerk) TYP(Department) TYP(Secretary) TYP(Office) TYP(Typist)

TYP(Section)

[name: string, collaborates : {Structure}(0,a)] = [name: string, employs : {Employee}(1,20), turnover : integer] = [name : string, birth-date : Date, salary : integer, Work-in : Division, emp-code : integer] = [name : string, birth-date : Date, salary : integer, work-in : Department, emp-code : integer] = [name : string, employs : {Clerk}(l,s), turnover : integer, head : Clerk] = [name : string, birth-date : Date, salary : integer, work-in : Division, emp-code : integer, level: integer] = [name: string, employs : {Typist}(1,5), turnover : integer, head : Typist] = [name: string, birth-date: Date, salary : integer, emp-code : integer, qualification : StringSet, work-in : Office, level : integer] = [name : string, collaborates : {Structure}(0,3), turnover: integer, employs: {Employee}(1,20), head : Clerk] =

T a b l e 2. Example of LOGIDATA~vc schema

Notice t h a t Z (T) = 0 ; in fact, T Y P ( T ) = D and :Z (D) = l;w , then the set {o E O I 6(o) E Z ( T Y P ( T ) ) } is equal to O. Further, r e m e m b e r t h a t VC, C ISAuT, thus the last intersection in the expressions 1 and 2 has at least one c o m p o n e n t which coincides with O. As an e x a m p l e of possible instance, let us consider a subset of the s c h e m a introduced above: C p = {Person, Organization, Division}, CD = {Employee, Clerk, D e p a r t m e n t , Secretary, Structure}. Let us consider the d o m a i n shown in table 3. In general, for a given schema S and d o m a i n Z: = ( O , 6 ) , there are m a n y

113

0

6(ol) ~(o~) ~(o3) 8(04) 8(os) 608) 8(07) 8(o8) 8(0,) 8(olo) ~(o11) ~(oa~)

Co1 ~02, 03, 04,05,06,07, 08,09,010,011,012 }

= = = = = = = -= =

[name: "A", collaborates: {o2}] [name: "B" , collaborates : {03}] [name: "C", collaborates: {oa }] [name: "D" , collaborates : {o5, 06}] Cname : "E", collaborates: {o,}1 [name: "F" , collaborates : {}] [name: "G", collaborates: {}] [name: "Research", employs: {oxo, oaa }, head: o,o, turnover: 13] [name: "Development", employs : {oa2}, head :o12, turnover: 16] [name: "Rupert", birth-date : [day : 18, month : 9, year: 1951], salary : !0000, works-in : os,emp--code : 250] = [name : "Robert", birth-date : [day : 28, month : 6, year : 1957], salary : 9000, works-in : os, emp-code : 132, qualification : {administrative}] = [name : "Mark", birth-date : [day : 5, month : 4, year : 1967], salary : 10000, works-in : o9, emp-code : 278 qualification : {engineer}, level: 1]

T a b l e 3. Example of domain

possible instances (depending on the primitive assignment). For a defined class C o , the possible instance is a set of object identifiers influenced by both the type definitions and the user defined hierarchy ISAu. T h e c o m p u t a t i o n of this set is related to the depends on relation, introduced in definition 5. If a class does not depend on itself, its possible instances can be univocally determined, given a d o m a i n Z and the interpretations of base types and primitive classes. For cyclic defined classes the possible instance, given the d o m a i n and the primitive classes, m a y not be unique, and additional choices are necessary. This p r o b l e m has been recognized also in [KV84], when dealing with queries. In our example, Employee, Clerk and D e p a r t m e n t ( a m o n g others) have recursive types. Let us consider the following interpretations of primitive classes:

Z (Person) = {o10, o11, o12} I (Organization) = {Ol, o2, o3, o4, 05,06, o7, os, o9} Z (Division) = {ol, o~, 03, o4, 05, or o7, os, o9}

T h e above interpretation admits, a m o n g others, the following interpretations of the defined classes Employee, Clerk, D e p a r t m e n t , and Structure:

114

271(Structure)

271(Employee) 271(Clerk) 2:x(Department) 272(Structure)

Z (Employee) 272(Clerk) 272(Department) 273(Structure) /:3(Employee) 273(Clerk)

273(Department)

= {04, 05,06, 07} {O10, Oll, O1~ {} = {04, 05, 06, 0~}

{o10, o11, o1 } {OLO,o11} {08} = {Ol, 05,06, o7} ---- {010 , 011,012} : {010,011,012}

{os, o9}

It can be observed that the interpretation of a cyclic defined class (see Employee) can be univocally computed when its description contains at least one reference to a primitive class (Division). But the interpretation of a cyclic defined class cannot be uniquely computed if its description contains classes which refer directly or indirectly to itself and are all defined (Clerk, Department, and Structure). The interpretations of Clerk, Department, and Structure vary between a minimum and a maximum. Since the definition of a possible instance is recursive, we can say that every possible instance is a fixed-point. The schema interpretation is then to be specified by choosing the possible instances which are legal instances, that is, are accepted as instances of a given schema. A possible choice is to adopt descriptive semantics, which accepts any possible instance as a legal instance. The descriptive semantics has the drawback of allowing many different instances for the same domain. Another choice is the least fixed-point semantics, which corresponds to a bottom-up computation and leads to the legal instance I1. In particular, the least fixed-point solution does not allow cycles in the data 3. From a semantic point of view this happens to be satisfactory when the cycle involves only one relationship, as in Structure, while it is less acceptable when more relationships are involved, as in Clerk and Department. The greatest fixed-point semantics corresponds to a top down evaluation and leads to the legal instance 273. In this case, the oids {o10, o11, o12} and {os, 09} are inserted in 27(Clerk) and 27 (Department) respectively. On the other hand, the cyclic objects Ol, 02, o3 are now inserted in 27(Structure). In [Neb91] an extensive survey on terminological knowledge representation formalisms with cycles is given and the computational problems are also considered, whereas in [BeBe92] the adoption of onsemanticse or another, with respect 3 Cycles in the data exist when an element is connected to itself via one or more

relationships.

115

to complex object data models, is discussed. Our choice, as in [BN91], is to adopt the greatest fixed-point, since it is more adequate to our intuition when dealing with cyclic class definitions, such as Clerk and Department. It is worth remembering that also in [LR89] a greatest fixed-point solution is adopted but is used there to compute the greatest possible extension of classes, while the actual extensions are strictly user-defined and, in our view, this corresponds to a schema with primitive classes only.

5

Legal Instance of a L O G I D A T A ~ y c S c h e m a

Definition 8 (Legal I n s t a n c e ). The legal instance ofa LOGIDATA~yc schema is the greatest fixed-point of all the possible instances which have identical interpretation of the atomic primitive classes. This section aims to provide an operational definition of the legal instance, hence the corresponding algorithm. In particular, the schema is transformed in order to introduce the atomic primitive classes and find a valid starting point from which the greatest fixed-point can be computed. 5.1

Canonical S c h e m a G e n e r a t i o n

Let S = (Cp, CD, T, TYP, ISA) be a schema, associated with a set of type descriptors T(L, B, C, T) and ~U a domain. In the previous section, the interpretation of primitive classes has been used as a starting point for interpretation of the defined classes. In general, this operation is subject to some constraints, since the description of the primitive class contains a set of necessary conditions to be satisfied by its instances. The constraints are due both to the structures and the interpretations of other classes. Let us consider, for instance, with reference to the example of the previous section, the object o13 with the following v a l u e:

[name: "Store", employs : {o10}, head: o11, turnover: 10, collaborates: {}] The arbitrary assignment o13 E 77(Section), may not be allowed or, in other words, this interpretation is not a good starting point, since it depends on the assignment oll E 2: (Clerk) and 27(Clerk) must be computed, as it is a defined class. In order to obtain a correct initial assignment, we define the new schema S, called canonical schema, which is obtained from S by substituting each primitive class Cp with a new primitive class Cp called atomic primitive class without any structure, and a defined class with the same structures as Cp. The transformation is shown in figure 2: the portion of schema of part a) is transformed into the portion of part b). Formally, we associate to a schema S over T(L, B, C, T) a schema S = ( Cp, CD, T, TYP, ISAo) over T(L, B, C, T) with the following properties:

116

1. C = C p [.J C D such that CD = Cp U CD, and C p is an isomorphic copy of Cp. 2. T = 0. In order to simplify the formalization it is convenient to have a schema S not containing the type names, by replacing the type name T by TYP(T). This is recursively produced by the function: 7 :T(L,B,C,T)--*T(L,B,C,T)

7 (t) = TYP(t) if t 9 T 7(t) = t i f t

9 BUC

7 ({t}(min,rnax)) -- {7 (t)}(min,max) 7 (emptyset) = emptyset

7([11 :tl,...,lk :tk])

=

[ll :7(tl),...,lk :7(tk)]

Since in set T there is a strict partial order and T is finite, there exists a natural number n such that 7 n = 7 n+l and we set ~ = {Tn(t) It e 7-}. 3. TYP(C) = TYP(C) V C 9 CD TYP(Cp) = TYP(T) VCp 9 Cp 4. ISAu is the same as ISAu with the following additional relations: Cp ISAo Cp

VCp E Cp

will indicate a possible instance of schema S. P r o p o s i t i o n 9. For any possible instance Z of S, there exists a possible instance of S, and viceversa, such that -

-

27 (c) = 27(c)

D

vc e cD

P r o o f S k e t c h Let Z be a possible instance of S; it follows immediately that 27 with 27 (C) = Z (C) VC E CD is a possible instance of S. In the opposite direction, it is easy to find an interpretation of the primitive atomic classes such that the result follows, ra Having proved the equivalence between the possible instance 7: of S and Z of S, we will, in the following, use 7: for simplicity. We are now able to formalize the intuitive result that the possible instance of an acyclic defined class can be uniquely constructed from the initial partial interpretations that assign oids to atomic primitive classes. Such initial partial interpretations will be denoted by 27. P r o p o s i t i o n 1 0 . Let S be a schema without cyclic descriptions. For any domain, any initial partial interpretation :~ can be uniquely extended to a possible instance of-S and then of S.

117

P r o o f S k e t c h If there are no cyclic descriptions then the class n a m e can be ordered so that each class names C E CD depends only on preceding class names and there are class names which do not depend on other class names (only on T). As the interpretation of the base types are predefined, and those of the atomic primitive classes are given, the interpretation of the class names can be constructed in the obvious way, starting from the class names which do not depend on other class names [] The characteristic of uniqueness will be maintained also in presence of classes with cycles. This result will be obtained by means of the fixed-point semantics, as introduced in the previous section.

person* ' )

person'*)

/

[

employee*)

(emp,o,ee)

I

( c,erk ) a)

b)

Fig. 2. Trasformation of the schema S in the schema

5.2

Computation

of the Legal Instance

In the present section we introduce a complete lattice of interpretations k~ and a m a p p i n g F : ~P --+ k~. A fixed-point of F will be a possible instance of a

118

LOGIDATA~y c schema. This allows the application of some results concerning the fixed-point techniques, which are drawn from [Sch86, Llo87, Baa90], to the computation of the legal instance. Let T ( L , B, C, T) be a set of types and S a domain. We denote with ~ the set of all interpreation over ,U with the same initial partial interpretation :~4, i.e. all interpretations that have identical interpretation of atomic primitive classes. The relation C is defined on #2 as follows:

z' E_Z

g Z(C)

VCeCD.

i.e. ~ is ordered componentwise by the inclusion relation over the interpretation of defined classes s. ~ t ordered by E is a complete lattice; least upper bounds (lub) are obtained by componentwise set union and greatest lower bounds (glb) by componentwise set intersection. The least element of ~ is Zj. (27j_(C) = 0, VC) and the greatest is ZT (ZT (C~ = O, VC). Let S be a schema over T(L, B, C, T) and F a function mapping intrepretations to interpretations as follows:

such that

r(z(c))={oeOl6(o)eZffVP(c))}n(

z(c,)) C iSAu C,

By comparing the above equation with the possible instance of defined classes, we can see that an interpretation 27 E ~ is a possible instance of S over ,U, if and only if 77 is a fixed-point of F. The function F is monotonic on ~ t (it immediately follows from the fact that / ' is downward w-continuous, as proved in the next section); consequently,/' has a fixed-point, or, equivalently, any :~ can be extended to a possible instance of S. More precisely,/' has the greatest fixed-point, the least fixed-point and, possibly, other fixed-points in between. By proposition 10, in a schema S without class names and having circular descriptions, the least fixed-point and the greatest fixed-point are identical, i.e. F has a unique fixed-point. In the following we show how the greatest legal instance can be constructed from a given initial partial interpretation. P r o p o s i t i o n 11. Let ~ be the set of all interpretations over ~ and F be the corresponding mapping. Then

g/p (r) = N r' (zT) i>0

4 The dependence of k~t on L' will be left unexpressed. 5 We note that, for all type t, if 2"~ ___ 2" then :T~(t) C_ 2"(t) V,U, i.e. the relation E_ can be extended to all types.

119

Proof. First of all, we show any increasing chain Z0 E lub({F(Ei);i > 0]). In fact, be the interpretation defined

that function F is upward w-continuous, i.e., for 1:1 _ Z~ E ... we have F(iub({Zi;i>O])) = let X = { Z 1 , . . . , Z , } , a subset of ~P~, and let Z by:

(C) :

5

~k (C)

k--I

C e -C D

Then, for all t E 7": Z (t) - N~=IEk (t). Furthermore, from the definition of F it is easy demonstrate that we have ~t

= n r(zk) k--I

Hence mapping F is downward w-continuous, thus the following result holds: gfp(C) = glb ({C (T); n _ 0}).

W e are now able to give the algorithm for the computation of greatest legal instance. First of all, we recall that C ~ (IT) = IT, C { (T) = F (C {-I (T)); F is monotonic, hence by induction C i-1 (ZT) _~ C i (ET). Consequently, from the expression of proposition I 1 it follows that: gfp(C) = C ~ (ZT)

where ~ is the least natural number such that F 7+I (ZT) = FT(ET). In order to simplify the algorithm it is convenient to find another expression for the greatest fixed-point of C. The notion of atomic primilive generalizations is introduced as follows:

Gp(C) -- {Cp I C ISAe Cp] In other words, Gp(C) is the set of atomic primitive classes reached with the ISAs relationship. Hence a possible instance can be expressed, according to the equation of the possible instance of defined classes, as follows:

Z(C) =

~

I(Cp~) n {o e O I 6(0) 6 Z (Y-Y-P(C))} n

(4)

cj , ap (c) C ISAE C{

where Z is the initial partial interpretation, as introduced in section 5. By observing the above equation it is naturM to choose, as the initial interpretation, the intersection of the ancestor primitive atomic classes, that is:

z+(c) =

N Cp,eGp (C)

+(+,)

120

Obviously, for any fixed-point 77 of r we have 27* _ 77; hence, g y p ( F ) is the greatest fixed-point of r which is less than or equal to 27*. Then g y p ( F ) can be expressed as glb ( { r n (27*); n > 0}); thus, by the above considerations, we have g y p ( r ) = r ~ (27*) where ~ is the least natural number such that r ~+1 (77*) = r z (27*). Finally, in table 4 we show the algorithm for the computation of greatest legal instance. As an example of computation, let us consider the following :~:

9 Initialitazion and iteration - class types:

If i = 0 then 2"o(C) =

N

t(C--pp~)

C p j e G p (C)

If i > 0 then Z, (C) = Z0 (C) n {o e O I ~(o) e z,_~ (YV-P(c)) } n

C ISAB Cr -

-

-

base types: Ii (Bk) = 79k ; set types: Z~ ({t}(.,m~) = {v. 9 Vs I " --<11"0 C_ Z, (t) I1_< m}; empty set: 2"i (emptyset) = {0}; tuple types: 27i ([1, : t~,...,lk : t~]) = {vt E I)T I vt is defined at least on {]1, . . . ,lk }

and vt(1,) E 2"i (ti),i = 1,...,k}. 9 Stop -

Zi = / ? i + 1

13

Table 4. A l g o r i t h m 1 [Computation of legal instance]

:~(Person) = :~(Employee) = {010,011,012} :~(Organization) = :~(Division) = {01,02, 03, 04,05,06,07, os, 09} Table 5 illustrates the activity of algorithm 1 for the computation of the gypinstance, given the above initial partial interpretation.

121

Class

Structure Clerk

Zo Ol,

9 9 .

Z1 ,

09

010~Ol1~O12

01,

Department o a , . . . , 09 Typist o10, on, o12 Ot~ce

9 9 9 , Or

O101011~O12

01 ~ 9 9 9 ~ 09

Z= O1,

9 9 9 , 07

O10~Ol1~O12

Za = Z O1,

9 .

.

,

Or

O101011~O12

os, 09

os, 09

os, oo

o12

o12

o~2

0 8 ~ 09

09

09

Table 5. Example of computation of a g/p-instance

6

Subsumption

The subsumplion relationship can be introduced with reference to the legal instance. D e f i n i t i o n 12 ( S u b s u m p t i o n ) . Given two types t and t' of a L O G I D A T A ~ y c schema S, we say that t' subsumes t, written t <: t', if and only if:

w

vz on 5, z (0 g 2" (t').

It immediately follows that < is transitive, reflexive and not symmetric (i.e. it is a pre-order) and induces an equivalence relationship _~ on types: t "~t ' ~

t < t ' A t' < t.

A class which explicitly inherits properties from another class is, of course, subsumed by that class. P r o p o s i t i o n 13. Given a L O G I D A T A ~ y c schema S and two classes C, C' E C: C ISAt, C' ::> C < C'. P r o o f S k e t c h The proof follows immediately from the definition of 2" (C). [] The inverse does not generally hold, as we will show by computing that Typist < Secretary, Typist _< Clerk and Office < Department even if not expliciy stated.

7

Coherent

and

Consistent

Schemata

One of the main objectives of taxonomic reasoning in a database environment is the validation of a schema. In a L O G I D A T A ~ y c schema there are two possible sources of validity problems: type definitions and ISA relationships. For this reason we will provide two definitions related to validity: coherence, with respect to types, and consistence, with respect to ISA and inheritance. Coherence and consistence can be violated independently. Nevertheless, we are interested in schemata which are both coherent and consistent, and it is possible to give a unique syntactical characterization for the two properties.

122

D e f i n i t i o n 14 ( C o h e r e n t t y p e s ) . Given a schema S = (Cp, CD, T, TYP, ISAu), a type t in S is coherent if and only if:

3 ~ ~z over •, z (t) ~

{}

A schema S is coherent if and only if it contains only coherent types. As mentioned in section 4, the type specification of class C must agree with the type specifications of superclasses of C. We are now able to formally specify this agreement by means of the notion of consistent schema. Definition15

(Consistency and inheritance).

Given a schema S = (Cp,

CD, T, TYP, ISAu) a class C E C is consistent with respect to inheritance if and only if: TYP(C)__TYP(C')

VC I C ISA~ C'

A schema S is consistent if and only if it contains only consistent classes. In a consistent schema we do not allow two related classes (C ISAu C') to have arbitrary associated types; the type associated to a class describes the internal structure of the instances of the class. An instance of a class (say Employee) being also an instance of its superclass Person, we want the instances to share common structures. It is worth noting that the concepts of consistence and coherence are independents. For example, let us consider the classes Wide-Division, Recorder and Middle-Division with the following descriptions: class Wide-Division = isa Division[employs: {Employee}(30,40)] class Recorder = isa Employee[work-in : Wide-Division] class Middle-Division = isa Division[employs: {Employee}(lo,30)] and with the following types: TYP(Wide-Division) = [name : string, employs : {Employee}(ao,40), turnover : integer] TYP(Recorder) = [name: string, birth-date : Date, salary : integer,1 work-in : Wide-Division, emp--code : integer] TYP(Middle-Division) = [name: string, employs : {Employee}(10,30), turnover : integer] In LOGIDATA~y c the incoherence can derive from the redefinition of common labels. The Wide-Division class, is not coherent, as the employs label is redefined on a type obviously disjoint from the corresponding type of Division, and is not consistent (TYP(Wide-Division) ~ TYP(Division)). The Wide-Divisibn class is not consistent (TYP(Wide-Division) ~ TYP(Division)) but is coherent (in fact, it admits the objects of the Division class with n employees, where 10 < n < 20). Finally, as TYP(Recorder) is not coherent and thus subsumed by any other type, the Recorder class is not coherent but is consistent.

123

7.1

C o m p u t a t i o n of t h e Legal I n s t a n c e

In this section, we will show how computation of the legal instance can be simplified when the schema is coherent and consistent. Intuitively, by comparing equation 4 and definition 15 it can be seen that the following set: (

N

{o e O ] 8(o) e Z (TYP(Ci))})

C IsAB Ci does not influence any legal instance 2" (C) if the schema is coherent and consistent. The effect of coherence is not obvious due to the presence of the cyclic defined classes, as shown in the following example. Let us consider the defined classes C1 , C2 and C3 with C2 ISAo C1, TYP(CI) = [a:inleger], TYP(C2) = [a : C3], TYP(C3) = [b : C2]. The class C2 is consistent but not coherent, as 2"(TYP(C~)) is disjoint from Z(TYP(C1)); we cannot then simplify 2"(C2) by eliminating the set {o E O ] 3(o) E 2"(TYP(C1))}. For this reason, we will consider a simplified schema, say S, derived from -S, where the ISAE assertions between defined classes are ignored, since they are implicitly contained in function TYP. The portion of schema of part b) of figure 2 is thus transformed as shown in figure 3. We will then prove the equivalence of the computation of the legal instance in S and in S. Let S = (Cp, CD, T, TYP, ISAu) be a coherent and consistent schema and be the associated schema defined as in section 5. It is already proved that the computation of a legal instance of S is equivalent to computation of a legal instance of S. In the simplified schema S = (Cp, CD,T, TYP, IS~-'Av) the ISAu relationship is defined as follows: C ISAu Cp

iff C ISA~ Cp,

VC E CD,

VCp E Cp

P r o p o s i t i o n 16. Let S be a coherent and consistentschema, I a primitive assignment and E the corresponding legal instance of S and ~ the corresponding legal instance of'S. Then

z (c) = i (c)

vc e

~D

Proof. The proof is in [Ben91]. The new formulation of the legal instance allows to be simplified algorithm 1 by rewriting the computation for class types as follows: -

class types: - If i = 0 then 2"0 (C) = [~CpjeGp (C):~(Cpj); - I f / > 0 then 2"i (C) = 2"0 (C) N{o E O ] 6(0) e 1i-1 (TYP(C))}

124

person*

person

employee

clerk

Fig. 3. Trasformation of the schema S into schema

7.2

Subsumption Computation

The purpose of this section is to give a syntactical characterization for the subsumption relationship and for the coherence and consistency checks. In the following we will show how the computation of subsumption for a consistent and coherent schema can be performed by the boolean function s u b s on the basis of a syntactical comparison of type descriptions 6. The same function s u b s can be used to check if a syntactically correct schema S is coherent and consistent. Notice that we cannot adopt an algorithm like the one presented in Introducing taxonomic reasoning in LOGIDATA*, as the presence of cyclic defined classes could lead to endless loops. As an example, consider the case of the classes Office and Department: the computation of Office < Department leads a new computation of some comparison Office < Department. The structure of the algorithm is strictly related to that of algorithm 1. In s The algorithm considers only types in canonical form, the extension to types with names is straightforward.

125

particular, the initialization step for classes takes, as a s t a r t i n g point, the subs u m p t i o n s which immediately follow from the user-provided ISA hierararchy. T h e implications which are not compatible with type descriptions are then discarded.

s u b s is a function s u b s : T x T --* {true, f a l s e } defined as: * Initialitazion and iteration class types:

-

-

-

-

If i = 0 then subsi(C',C) = true iff G p (C') C_ G p (C); If i > 0 then subsi(C',C) = true iff (subs0(C',C) = true) A (subsi_l (TYP(C'),TYP(C)) = true); base types:

subsi(B',B) = true iff B = B'. -

set types:

- subsi({t'}( .... ),{t}(p,q)) = true iff (subsi(t',t) = true ^ n _ q); - subsi({t'}( .... ) , e m p t y s e t ) = true iff n = 0; - subs, ( e m p t y s e t , e m p t y s e t ) = true; - tuple types:

subs,([...,lk:

tk . . . .

],[ ....

l',: t'j,...]) =

true

iff Vk 3 j :

= Z;) ^ t

(subsi (tk,ti) = true); . Stop - subsi+l = subsiC3.

T a b l e 6. A l g o r i t h m 2 [Computation of subsumption]

T h e algorithm stops after a finite n u m b e r of steps; in fact, the initialization sets a finite n u m b e r of relationships and subsequent steps can only exclude relationships (if s u b s / = f a l s e then s u b s j = f a l s e for j > i), there exists a n a t u r a l n u m b e r i such t h a t s u b s / = s u b s i + l and we set s u b s = s u b s / . Theorem

17. Given two types t and t' o f a coherent and c o n s i s t e n t s c h e m a S ; t < t'

r

subs(t',t) = true.

Proof. T h e proof is in [Ben91].

An i m m e d i a t e consequence of this result is a partial inverse for proposition 13, valid only for primitive classes. C o r o l l a r y 1 Given a coherent and consistent schema S and two classes Cp ECp andC EC: C _~ C p ,r

C ISAu C p .

126

The following theorem shows how the checking coherence and the checking consistence of a schema can be performed by algorithm 2. T h e o r e m l 8 . A schema S is coherent and consistent iffVC: subs(TYP(Ci),TYP(C)) = true

VCi I C ISAo Ci[3

Proof. The proof is in [Ben91].

8

E x a m p l e s of Taxonomic Reasoning

Table 7 shows an example of computation of the function subs for the schema introduced in section 1. The table includes only the most significant results.

subs(C, C')

subs0

sUbSl

subs2

subs3

subs4

subs

(Clerk, Employee) (Clerk, Typist) (Typist, Clerk) (Clerk, Secretary) (Secretary, Clerk) (Typist, Employee) (Typist, Secretary) (Secretary, Typist) (Department, Division) (Department, Office) (Office, Department) (Department, Section) (Office, Division) (Office, Section

true

true

true true true true true true true true true true true true true

true false true false false false true false true true true false true

false true false false false false false true false true false true false false

false true false false false false false true false true false false false false

false true false false false false false true false true false false false false

false true false false false false false true false true false false false false

Table 7. Selected values for function subs

It is easy to extend the results of table 7 in order apply theorem 18 and to verify that the given schema is coherent and consistent. Consequently, we can say that subs computes the subsumption for the given schema. Besides the most intuitive subsumptions, subs computes the following relations: Typist < Secretary, Typist < Clerk and Office < Department. As an example of the effectiveness of subsumption computation for schema consistency checks, we add the following class definition: class Adm-Department = isa Department[employs: {Typist}(1,5)]

127

If the consistency check is performed only by explicit isa relationships, AdmDepartment would be detected as inconsistent with respect to strict inheritance, as the redefined property employs has value Typist which has not been explicitily declared as a specialization of Clerk. By subsumption, Typist is computed as a specialization of Clerk, therefore the class Adm-Department is recognized as consistent with respect to the schema and automatically classified as a specialization of the Department class. Further, by subsumption computation, we can detect equivalence between classes with different explicit descriptions, thus detecting synonymous ones and avoiding redundancies. Suppose the following classes X and Y be given: class X

isa Employee, Secretary [work-in :Y]

class Y = isa Department [head: X, employs: {X}(1,5)] The following equivalences are detected: X _~ Typist Y "-" Office. Therefore, X and Y can be simply stored as synonymous. It is worth noting that all three semantics (greatest, descriptive and least) mentioned in section 4 produce legal instances which can be considered valid (i.e. useful at some respect). Our choice for the greatest fixed-point pushes objects down in the hierarchy (i.e. towards more specific classes) and this allows a set of inferences which are, in our opinion, more interesting. Algorithm 2 can be used to restructure a schema and the result is a schema which is equivalent to the original one only with respect to the greatest fixed-point. Roughly speaking, this leads to computation of a greater set of ISA relationships and, possibly, to considering equivalent classes which, with other, more conservative semantics could be different.

References Baader, F. Terminological cycles in KL-ONE-based KR-languages. In Proeeedings of the 8th National Conference of the American Association for Artificial Intelligence, Boston, Mass., 1990. Computation of Subsumption with Cycles in [Ben91] Beneventano, D. LOGIDATA+. Technical Report 80, CIOC - CNR, Bologna, October 1991. [BeBe92] Beneventano, D., and Bergamaschi, S. Subsumption for Complex Object Data Models. Proceedin9s of the International Conference on Database Theory, Berlin -1992, Springer Verlag Publisher. Bergamaschi, S., and Nebel, B. Theoretical fondations of complex object [BN91] data models Technical Report 5/91, CNR, Progetto Finalizzato Sistemi Informatica e Calcolo Parallelo, Sottoprogetto 5, January 1992. [KV84] Kuper, G.M., and Vardi, M.Y. A new approach to database logic. In PODS '84, pages 86-96. SIGACT-SIGMOD-SIGART, ACM Press, 1984. Lecluse, C., and Richard, P. The O~ data model. In Int. Conf. On Very [LR89] Large Data Bases, 1989. Lloyd, J.W. Foundations of Logic Programming. Springer Verlag, Berlin, [Llo87] 1987. [Baa90]

128

[Neb91] [Sch86]

B. Nebel. Terminological cycles: semantics and computational properties. In J. Sowa, editor, Principles o.f Semantic Networks. Morgan Kaufmann, 1991. D.A. Schmidt. Denotational Semantics: A Methodology ]or Languages Development. Allyn and Bacon, Boston, 1986.

Modeling Semantic Integrity Constraints in Object-Oriented Database Schemas Anna Formica, Michele Missikoff IASI CNR - Viale Manzoni 30, 1-00185 Rome, Italy

A b s t r a c t . Recent years have witnessed a continuous evolution models towards richer and more expressive paradigms. Along

of database the line of enriching the modeling capabilities, Object-Oriented databases (OODBs) have been introduced. In this paper, we propose a further enhancement to OODB models aiming at enriching the database s c h e m a by explicitly declaring semantic integrity constraints therein. In the paper, we present an Object-Oriented data definition language, referred to as TO~. It allows the construction of an OODB schema using the well known data structuring mechanisms, such as NF2 attributes, complex types, and multiple inheritance in class (type) hierarchies. In addition, TO,,L allows the further enrichment of the schema by expressing explicit integrity constraints. The proposal includes the definition of the formal semantics of TO~, according to a denotational approach, and the notion of correctness of schemas, with particular emphasis on the legality of ISA hierarchies. The proposed language is the basis of the prototype MOSAICO, an environment for the design and rapid prototyping of OODB applications developed at IASI.

1 Introduction Object-Oriented database (OODB) systems appear to be the candidate successors of relational database systems. With respect to the latter, the former are characterized by richer functionalities and more expressive data languages [8], [2], [15], [30]. In particular, an OODB system is a DBMS and an Object-Oriented system [6]. Being a DBMS, it is characterized by persistence and advanced data management functions, such as secondary storage management, concurrence, ad hoc query facility. Being an Object-Oriented system, it is endowed with the following features: object identity, complex objects (possibly recursively built), inheritance and hierarchical organization of types and classes, encapsulated methods, overriding combined with late binding, extensibility and computational completeness. The field of OODB systems appears, at present, characterized by a strong experimental activity which contrasts with the lack of a unifying view of the data

This research has been partially supported by "Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo" of CNR, Subproject 5, Group Logidata+, and Subproject 6, Group Infokit.

130

model and formal foundations. It appears difficult to reach large consensus on a clear and formal specification of the model, on a declarative data manipulation language (something similar to SQL for relational databases), and on techniques for enhancing performance in data retrieval with query optimization. Furthermore, most of current Object-Oriented database systems lack direct support for formulating complex semantic integrity constraints. Nevertheless the existence of facilities to handle such constraints is an essential premise to improve the development and management of complex applications. In a data model, constraints are required for semantic and integrity reasons. In terms of semantics, they allow to enhance the expressiveness of the model by defining schemas which capture the real world situations in a more detailed way. In terms of integrity, they prevent certain iypes of inconsistencies, caused by misunderstanding or inaccuracy, by restricting the possible database states to those that satisfy such constraints.

1.1 Related Work

In contrast with the great demand for data integrity in database applications, there are only few OODB systems that provide explicitly this type of capabilities, and, in particular, that provide a declarative constraint language. This issue has been mainly investigated for traditional database systems [33], and in particular in relational and deductive databases areas. The latter, in particular, has been influenced by logic programming and, more recently, by Constraint Logic Programming [24]. Let us briefly survey the literature in these areas. In relational models, this problem has been widely studied. For example, in [11], [14], integrity constraints have been formalized providing a general purpose Constraint Specification Language which is a natural extension of the data definition language. In [31], a model of integrity control is presented, that addresses both validity and completeness of a relational database, also dealing with the integrity of the answers produced by the queries. Integrity constraints have been also studied in logic programming area. Constraint Logic Programming (CLP) [24], [13] is an extension of logic programming, where mechanism of unification is replaced by the more general notion of solvability of a set of constraints. Deductive databases have been developed taking advantage of the two above mentioned research areas: relational databases and logic programming [19]. The aim of this approach is to use logical languages to reason and query about database content. In this area, integrity constraints are closed first order formulas that database is required to satisfy [22], [3], [29]. As said before, the problem of expressing declarative constraints in an ObjectOriented data model has been analyzed by only a few authors. We recall, for example, the ALICE language [36], [34], that is used to express complex constraints by means of Horn logic expressions. Another proposal is related to the PIROL project [20], that is devoted to support complex objects and explicit semantic integrity constraints. Similarly, the ODE [21] system allows the definition of constraints, focusing on

131

actions to be taken in presence of violations. These proposals are very rich. However, their complexity precludes the possibility of investigating theoretical aspects of database schemas and, in particular, to define a clear declarative semantics needed to support the design of correct database schemas. In this paper, we propose a formal model for Object-Oriented databases which allows the definition of a schema in terms of types and constraints. The goal is to supply a uniform frame for the database designer, to define at intensional level not only the structural component (essentially the types) but also to enrich it with semantic integrity constraints. The intensional component of the model is presented by means of the language TO~ (Typing as Query Language). TO.,6represents the synthesis of the work carded out both at a theoretical [18], [16] and experimental [25] levels, within the project MOSAICO, at IASI-CNR, and the national research activity LOGIDATA+ [5], which aims at extending the functionalities of the Object-Oriented database systems in the direction of deductive databases. The guide lines of TOJ~ have been influenced by existing OODB systems, such as 0 2 [26], [27], [28], ORION [7], ODE [21], and by theoretical researches in the field [9], [ 12], [4]. The main features of TO,6model are: (i) clear separation between types and classes (and, therefore, between intensional and extensional components of the database), (ii) formal definition of class as a collection of objects, extension of a type in the schema, (iii) possibility of expressing explicit integrity constraints within the database schema, (iv) complex types modeling as sets of necessary and sufficient conditions for the corresponding objects. These characteristics are formalized using a pure denotational semantics and represent new issues with respect to existing proposals in the fields [27], [21], [7]. Existing systems present characteristics such as: (a) use of class names to define types, (b) semantics informally defined or, eventually, defined following an operational approach, (c) the notion of class derived from programming languages, i.e. as a structure encapsulating data and behaviour, rather than databases, i.e. as extension of a type declared in the schema, (d) absence of mechanisms associating integrity constraints to types in the schema, and integrity enforcement performed by methods, (e) complex types considered, essentially, for structuring purposes rather than for semantic integrity. The rest of the paper is organized as follows. Section 2 gives an overview of the main static characteristics of Object-Oriented data models, focusing on specific TO.# features. In Section 3, the innovative features of the TO.~ data model are introduced: the uniform view of type structure with ISA hierarchy and integrity constraints. Section 4 presents the formal syntax of TO,L and the notion of TO.,L schema. Furthermore, the definition of legal ISA hierarchy is given, and the normal form of types is introduced. In Section 5, the formal semantics of TO~ types and integrity constraints is shown, followed by the definitions of interpretation and model of a Tt2L schema. Finally, conclusions are given.

132

2 Main Features of Object-Oriented Data Models In this section, the main features of the static component of an Object-Oriented database model are presented [6]. It is agreed that a desirable feature of a database model is a clear separation between the intensional (i.e. schema) and extensional (i.e. data) components of the database [35]. This goal is not easily achieved in OODB model, due to its origin rooted in programming languages and the complexity of the model itself. For example, in the 0 2 data model, types are defined in terms of classes and their semantics are tightly interwoven. A second point is the co-existence of objects and values: what are their respective roles, when to use one or the other. We believe that avoiding their mixing will substantially increase the simplicity of the model, without loss of expressiveness. Let us briefly analyze both these issues. In TO~ data model, the intensional component (i.e. the schema) is represented by type-definitions. Type-definitions are T0.,6 formulas that can be composed without referring to database classes, objects, values, nor to any other extensional element. Classes and objects are not expressible in TO~ they belong to the realm of data. With this approach, we reach a complete separation between intensional and extensional components of the database. In this respect, the approach is closely related to logical theories, where types correspond to logical formulas and classes to their interpretations. About the dichotomy between objects and values, we agree with [9] and, in particular, with the notion of abstract object. Abstract objects represent any conceivable conceptualization of real (and sometimes unreal) world entities, from a simple integer or string, such as 27 or "ferrafi", to complex entities, such as persons and cars. Objects are denoted by a special class of names, called identifiers, or oids for short. There is a special class of "built-in" objects, on which the system is able to perform well defined manipulations, such as integers or strings. For those "fully axiomatized" objects, referred to as values, the state corresponds to the oid. In this perspective, values are reduced to a special case of oids, i.e. identifiers of fully axiomatized objects. Complex values do not exist, we have complex objects instead. Here, the Object-Oriented features of the TO~ data model are briefly presented. They are illustrated following the points exposed by [6], with few modifications due to the specificity of our proposal. Obiects and Values - Objects are the basic entities of an OODB, which is essentially seen as a collection of them. All objects are atomic and, unlike other proposals, they are not endowed with an "internal" state. Nevertheless, we distinguish elementary and complex objects essentially on the base of the relationships with other atomic objects. In this view, we can use the term state, for a complex object, to indicate the set of relationships with other objects. Such a state is associated to a complex object, and is represented by a tuple of labeled oids. Values are just special objects: they are elementary objects known to the system, and they do not need to be defined by an associated tuple. Obiect identity - This is a central notion for OODB models. Every object has an identity, given to it when it is created. Such an identity is unique within the database

133

and immutable throughout its lifetime. Oids give the possibility to specify the relationships that complex objects have with other objects. Object sharing is a desirable characteristic that originates from the use of complex objects within tuples. Comolex obiects - Complex objects are represented by those oids that have an associated tuple. Tuple constructors can be nested to any (finite) depth. Since we restrict the components to be only oids, we obtain complex objects inherently different from complex values [1], [23]. For example, elementary objects (values) are: 34, ted, fiat, john, 21, math, 150. Complex objects are: #per 12, #car3, #stud 1, and their corresponding tuples are: [name:ted, age:34, vehicle:#car3], [maker:fiat, speed: 150], [name:john, age:21, vehicle:#car3, dept: math]. Note that, to increase readability, oids of complex objects start with the character # (pound). Recursive obiects - This is a feature common to expressive Object-Oriented data models. The use of oids of complex objects in a tuple allows the reference to another tuple which, in turn, can reference the former. There are no limitations in the use of this mechanism, having also oids referencing their own structures (i.e. self-recursive objects) or participating in a circular chain of arbitrary length. Below, two recursive objects are given. mgr: (#empl2, [name:john, salary:50K$, head_of: #d3]) dept: (#d3, [name:toys, floor: 4, manager: #empl2]) Type - A type is an abstract representation of the characteristics common to a set of objects. Given an object, it is always possible to decide whether it satisfies a type or not. Type-definitions are sentences of TO,6 having a label, a definition expressed as a tuple of typed properties and, eventually, one or more integrity constraints associated to them. A database schema consists of a collection of type-definitions which, therefore, represents the first information to be given when creating a database. Then objects can be entered according to the types in the schema (unlike other systems, such as ORION [7], untyped objects are not allowed in TO# databases). The presence of type labels plays a primary role, allowing the definition of recursive types and, very desirable feature, avoiding the use of class names in type definitions. In the following, two type-definitions are illustrated. mgr := [name:string, salary:string, head_of: dept] dept := [name:string, floor:integer, manager: mgr]

134

Nested Tuoles - In a tuple, a property can be typed using a type label or another tuple. The former allows object sharability at extensional level, the latter allows the definition of nested tuple types whose instances will be represented by nested tuples of objects not sharable by other objects. For example: person := [name:string, address: [street:string, city:string]]. Recursive Types -TO~ allows a rich mechanism for recursion. We have a selfrecursive type when at least a property is typed using the same label of the typedefinition. In general, recursion is achieved when a type has a property typed with the label of a type-definition that, in turn, has a property typed with the former. The following is an example of recursive type-definition (note the use of curly brackets to indicate multi-valued properties): person := [name:string, age:integer, child:{person}]. Subtyping - Types are organized according to a generalization hierarchy, using the very powerful relationship of subsumption[10], also referred to as refinement [5]. The semantics of T0.~ subsumption includes the three kinds of structured inheritance inclusion, constraint, and specialization inheritance [6] - introduced in existing OODB systems. Essentially, subtyping implies the inclusion between the corresponding extensions of types, i.e. between the sets of objects that satisfy such types (classes). Therefore, subtyping is a partial order relationship over the set of types. Class - Classes form the extensional component of the database. There is one class for each type-definition in the schema. Classes are the repositories for objects, therefore, they represent an extensional notion. The notions of type and class are disjoint: they correspond, respectively, to the notion of formula and interpretation in a logical theory. In this view, the schema of a database is seen as a theory and the database as an interpretation. A database update represents a transition from an interpretation to another one. Extensibilitv - The set of types is extensible, in the sense that, given a database, new types can be defined, and there is no distinction between existing and newly defined types [6]. In the next section, we introduce the two main features of the TOJ~data model: the ISA hierarchy, based on strict refinement, and semantic integrity constraints. Their rigorous formulation and formal foundation are not common to existing ObjectOriented database [6].

3 Enriching OODB Schemas with Constraints The TO_Ldata model allows the definition of implicit and explicit constraints. Implicit constraints are expressed inside the tuple of the type definitions as, for example, cardinality constraints or domain constraints. Explicit constraints are associated to

135

tuples. They enrich the expressiveness of the model adding declarative knowledge to the schema, thus allowing the reduction of the procedural component of the database. We also consider inherent constraints, i.e. constraints that are imposed by the model itself, such as refinement constraints, related to the ISA construct, or aggregation constraints, related to tuples of typed properties. In this paper, T0,,6 refinement constraint will be analyzed only for what concerns the structural component of the language. Refinement is briefly introduced below, followed by a description of implicit and explicit integrity constraints. The next section will focus on refinement in a more detailed way.

3.1 The ISA Hierarchy In TO~, the ISA construct is used to define a type in terms of its supertypes. This construct can be followed by a tuple of typed properties. This represents the structural definition of the type (the dynamic definition, represented by methods, will not be tackled in this paper). A type defined by means of the ISA construct inherits the typed properties of its supertypes (it inherits explicit constraints as well, but here we concentrate on the structural aspects only). For each typed property, inheritance can be absolute, composed or refined. Inheritance is absolute if the property belongs to only one supertype and is not re-defined in the subtype. Inheritance is composed if the property belongs to at least two supertypes. Finally, inheritance is refined if the property belongs to supertypes and to the definition of the subtype. If inheritance is absolute, the typed property is inherited without any modification. In the second case, a composition of typing for properties having the same name is required. As we will show in Section 4, such a composition is obtained by considering a type (if there exists) that is subsumed [18] by the given ones. Finally, in case of refined inheritance, an overriding is required. In particular, the typed properties of the supertypes will be overridden by the one specified in the definition, if the latter is a refinement of the former. In case of properties belonging to more than one supertype and also to the definition of the type, composition is applied before overriding. The above conditions determine the legality of ISA hierarchies. This issue will be formally tackled in Section 4. In this subsection, some examples are provided. The following ISA hierarchy is legal: person := [name:string, age:integer] parent := ISA person [age:(10..150), child:{person} 1,5]. Note that TO~ allows the typing of properties by explicitly defining a range of values in parenthesis. Furthermore, in the example, cardinality constraints are associated to the multi-valued property child. In applying the refined inheritance to the property age, the condition of refinement between the type integer and the set of values specified in the tuple is satisfied. Refinement is violated in the following type-definitions:

136

vehicle := [maker:string, color: {(red,blue,green) }1,3] car := ISA vehicle [maxspeed:integer, color: {(yellow,red)} 1,2] , because the types of the property color are not in subsumption relationship. In the following, we sketch the TOoLrefinement rules, that have been developed in a first version in [18], on the line of [12]. Two types are in refinement (subsumption) relationship if the subtype is obtained from the supertype by: (i) adding one or more properties to the properties of the supertype, or (ii) refining the cardinality constraints of one or more multi-valued properties of the supertype, or (iii) refining one or more properties of the supertype by defining subsets of enumerated types, or (iv) refining one or more properties of the supertypes by typing them with labels of refined types. A type-definition which is not defined by means of the ISA construct is said to be in normal form. In the next section, the problem of the legality of the ISA hierarchy is analyzed in detail, and the related problem of normal form reduction of a typedefinition will.be addressed.

3.2 Implicit Constraints Being expressed in the tuple, implicit constraints are introduced through the flexible typing mechanism for properties. Implicit constraints can be classified into the following four cases. - Tvnin~ constraints In the preceding section, we have seen that properties can be typed using type labels. For example, in the type-definition: person := [name:string, age:integer, vehicle: {car }] car is a type label which needs to be defined in terms of a TO.# sentence. It imposes a typing constraint on the property vehicle, requiring that the vehicles of a person, whenever they are present, have to be cars.

- Domain constraints TO,# properties can be typed also using enumeration or an interval specification. Domain constraints have the form of a list, or an interval (in case of well ordered domains), defined using constant symbols. In the following example: car := [maker:(Fiat,Ford,BMW), m_speed:(120..200), owner:person] the instances of the type car will be only objects produced by one of the three listed firms. Furthermore, the maximal speed will be any integer falling in the specified interval. - (Tardinalitv con,traints As already mentioned, multi-valued properties allow the association of a set of objects to another object. In TO.#, it is possible to express constraints on the (minimal and maximal) cardinality of the sets denoted by multi-valued properties. For example:

137

person := [name: {string }1,3, age:integer, vehicle:{ car} +]. If m,M represent the interval extremes, in TO,t2sentences the absence of curly brackets is a short-hand for m -- M = 1, while curly brackets without cardinality constraints or followed by the + symbol are short forms for m = 0, M = o~, and m = 1, M -- oo, respectively. - Referential constraints A referential constraint is obtained combining typing and cardinality constraints. For example, in the following type-definition:

person := [name:string, age:integer, vehicle:car] each person is required to have one (and only one) vehicle, and this vehicle must be a car .

3.3

Explicit

Constraints

TO,L explicit constraints are comparisons over properties, expressed using the dotnotation formalism. They are specified in the schema, being part of the typedefinition. They are referred to as "0-constraints", where "0" stands for arithmetic comparators, such as "=", ";e", ">", ">". Comparisons are declared between values and, more generally, oids. For example: person := [name:string, age:integer, ssn:integer, child: {person} 1,5], icl: this.ssn ~ person.ssn teacher := ISA person [student: {person }+], ic2: this.student ;e this.child employee := ISA person [salary:integer, boss:employee], ic3: this.salary < this.boss.salary In the above schema, three explicit constraints are given. Note that, the first has the right hand side defined using a type-label, while the other ones use the keyword "this". In particular, the constraint icl states that the social security number (ssn) of a person must be different from the one of all the other persons. The second, expresses that teachers cannot teach to their children. The third states that an employee has to earn less than the salary of his boss. From this example follows that, in explicit integrity constraints, a type label denotes an entire class, while the keyword "this" refers to a single object. Note that, when the right hand side of the comparison starts with the type label of the type being defined, as for the integrity constraint icl, the type label denotes the corresponding class, less the object denoted by "this". In Section 5, we will see that a "0-constraint" is assumed to be universally quantified on the denoted sets (universal quantification on paths). For instance, in the above example, ic2 requires that all the students of a teacher have to be different from all his/her children. Furthermore, in TOj2, given a type-definition, explicit integrity constraints must be satisfied by every object in the associated class (universal quantification on classes). In this sense, all the teachers have to satisfy the constraint

138

ic2. With this assumption, the keyword "this" stands for a pseudo-variable ranging over the entire class.

4 F o r m a l Issues on D a t a b a s e Definition In this section, the syntax of the language TO~ is formally presented. The language is a direct derivation of 00.6, developed to analyze the relationships between ObjectOriented systems and the Frame-based systems proposed in Artificial Intelligence [32], [18]. Previous proposals of T0~ have been presented in [16], [17]. In the second subsection, the formal definition of TO~ schema is given. Finally, the legality of the ISA hierarchy is formally introduced and, successively, the normal form reduction of a type-definition, that is not in normal form, is illustrated. These notions are the basis of the static verification performed to check the correctness of TO~ schemas.

4.1 T0~ Syntax In TO~, we have terms and sentences. Terms can be tterms, p_terms, and c_terms. A t_term is a type label. A p_term is a property name. A c_term is a constant, or a sequence of p_terms (path) preceded by a t_term or the keyword "this". Sentences can be atomic, type-sentences, constraint expressions and type-definitions. An atomic sentence is simply a name, e.g. car, person, color, integer, or an enumerated set, e.g. (4,8,9,22) or (red,green,yellow). A type-sentence is defined using two basic constructors (not mutually exclusive): ISA and tuple. A constraint expression is a binary comparison predicate, taking two c_terms as arguments. A type-definition associates a t_term to a type-sentence or to an atomic sentence and, eventually, to one or more constraint expressions. In the following boxes, the formal syntax of TO~ is presented: non-terminal symbols are in small plain characters, while terminal symbol are in bold. Symbols in italics are user-defined strings.

Definition 4.1 Syntax of T0~. ::= t t e r m := [, ..... ] ::= IISA t t e r m ... t_term ::= t_term I I (value_set) I [<tp>,...,<tp>] ::= integer I real I boolean I string I T O P <tp> ::= p_term:{}m, M

m,M ~ 1~0 u {~}, m < M

The following box shows, in particular, how the value_set construct can be expressed. It canbe enumerated, or an interval specification. The interval can be open or closed.

139

Definition

4.1 continued: the (value_set) construct.

9 Enumerated Example : (red, green, yellow) (1,4,7,12) 9 Interval Stw,cification - (n..N) n,N are included - (n<..N) n is not included and N is included - (n..
- (..N) if the interval is (- oo,N) - (n..) if the interval is (n,+ =) where n,N e 9~ and n_< N.

TO.X;explicit integrity constraints are now formally introduced. They are binary comparison predicates taking c_terms as arguments. This simple form can be easily generalized to boolean expressions of binary comparisons [17]. A c t e r m is a constant, or it is defined by using a path. A path is a sequence of p_terms composed using the dot-notation formalism. Dot-notation is a means to traverse data connections in a database. It selects the oids on which the constraint must hold. The syntax of integrity constraints is given in the box below. The terminal symbol "this" stands for a pseudo-variable ranging over the class denoted by the type being defined.

Definition 4.1 continued: integrity constraints. ::= label : ::= <0> ::= this.<path> ::= this.<path> I t_term.<path> I k <path> ::= p_term .... .p_term <0> ::= < 1 > 1 < 1 > 1 = 1 ~

4.2 TO# Schemas

In this subsection, we define the properties that a set of type-definitions must satisfy in order to be a TO/: schema. A non empty set of type-definitions S is fully defined (defined for short) if and only if all the t_terms used to form type-sentences and constraint expressions in S are

140

left hand side of type-definitions in S. Then, we have the following definition of T0os schema. Definition 4.2 TQL schema. A TO~ schema is a defined set of type-definitions,n Consider the set of To~ type-definitions: person := [name: {string} 1,2, age:integer, child: {person}] student := ISA adult [age:(18..40), college:university] This is not a schema because the t_terms adult and university do not appear in the left hand side of any type-definition, nor are they basic types. On the contrary, the set: person :-- [name: {siring }1,2, age:integer, child: {person }] student := ISA person [age:(18..40), college:string] is a TO,L schema.

4.3 Legality of the ISA Hierarchy In Section 3, we have seen that a type, defined by means of the ISA construct, inherits the definition of its supertypes. Typed properties of supertypes are inherited as they are (absolute inheritance), by composition (composed inheritance), or by refinement (refined inheritance). We have also anticipated that, the legality of the ISA hierarchy is related to the refinement of types of common properties. In this subsection, we give the formal definition of legal ISA hierarchy. Then, some examples, related to it, follow.

Definition 4.3 Legal ISA hierarchy. Given a schema, the ISA hierarchy of such a schema is legal iff ISA is not cyclic, and for each type-definition with ISA, the following conditions are verified.

1) Comoosed inheritance: for each property p_termh belonging to at least two supertypes (eventually belonging also to the definition of the type), as in the following: t_term := ISA t_terml ... t_termr [...] t_terml := [.... p-termh:{bodyl}ml,M1 .... ] t_termn := [.... p_termh:{bodyn}mn,Mn.... ] where 2 < n < r, there exists bodyh such that: if bodyi, i -- 1..n, are basic types or sets of values, than bodyh = N i bodyi r O if bodyi, i = i..n, are type labels or tuples, than bodyh is a type label of the schema, that is the greatest lower bound (glb) of bodyi, i = 1..n, according to the partial order induced by the subsumption relationship over types. Furthermore, there exists an interval of cardinality, [m h, Mh], such that,

141

for i = 1..n: [mh, Mh] = 0 i [mi, Mi] ~ O. Then, the inherited typed property, will be the following one: p-termh: {b~ }mh,Mh"

2) Refined inheritance: for each property p_term h belonging to supertypes and to the definition of the type, as in the following: t_term := ISA t terml ... t__termr [ .... p_termh:{bodyh}mh,Mh .... ] t_termi := [.... p_termh:{bodyi}mi,Mi .... ] where 1 < i < r, it results: bodyh is subsumed by bodyi. mi_< mh and Mh < Mi. The inherited typed property will be the following one: p_termh: {bodyh }mh,Mh. -

[] Note that, as already mentioned in Section 3, when a typed properly must be inherited by composition and by refinement, composed inheritance will be performed before than the refined one. Therefore, conditions related to refined inheritance will be verified on the typed properties inherited by composition. For example, the ISA hierarchy of the following schema is legal: vehicle := [maker:string, color:(red,green,blue)] car := ISA vehicle [colon(red)]. In fact, the property color has to be inherited by refinement, and the conditions related to types and cardinality intervals of such a property are satisfied. Also the following ISA hierarchy is legal: young := [name:string, age:(0..30)] mother := [name:string, sex:(female), age:(10..150)] youngmother := ISA young mother [age:(10..25)] because the property age is inherited first by composition, with typing (10..30), and then by refinement, with typing (10..25) specified in the tuple. On the contrary, the following ISA hierarchy: person := [name:string, age:(0..150), vehicle: {car}2, 3] student := [name:string, age:(18..40), vehicle: {car }0,1 ] gr_student := ISA person student [age:(23..40)] car := [maker:string, color:string] is not legal, because the conditions for composed inheritance are not satisfied. In fact, even if the property age is correctly inherited first by composition and then by refinement, for what concerns the property vehicle, the intervals of cardinality have empty intersection, Also the following ISA hierarchy:

142

student := [name:string, age:(18..40)] mother := ISA student [age:(20..150)] is not legal. In fact, the type of the property age is not a refinement of the corresponding type in the supertype student. However, these are patterns easy to correct; in fact, in order to obtain a legal ISA hierarchy, it is sufficient to consider the intersection (or a subset of it) of the types of the property age. For the example, the system will suggest to the designer to modify the type of the property age of a mother, with the interval (20..40). Similarly, consider the example: person := [name:string, age:(0.. 150)] teenager := ISA person [age:(13.. 19)] young := ISA teenager [age:(18..25)]. Also in this case, the system notifies the illegality and suggests, for example, the following correction: young:= ISA teenager [age:(18.. 19)].

4.3.1 Normal Form Reduction

In this subsection, we deal with the problem of reducing in normal form a typedefinition expressed by means of the ISA construct. To this end, inheritance is thoroughly applied (we consider schemas having legal ISA hierarchies) and typedefinitions are re-written by using only typed properties. In the following, it is shown how this problem is related to the possibility of verifying subtyping between types on the basis of their typed properties only. We show that this happens if the typedefinitions with ISA are normal form reducible. Let us introduce the following definition. Definition 4.4 Normal form reducible type-definition.

A type-definition with the ISA construct is normal form reducible iff, applying inheritance, it can be transformed into a semantically equivalent type-definition that is in normal form. ra Note that, two type-definitions are semantically equivalent iff they denote the same set of objects for all the interpretations. The above definition states that, if a type-definition is normal form reducible, then there exists another type-definition, in normal form, semantically equivalent to it. Normalized type-definitions are then used to verify subtyping. On the contrary, if the type-definition is not normal form reducible, subtyping is axiomatic, and cannot be proven. In such cases, the ISA construct represents more than inheritance. It represents an axiomatic set containment constraint that cannot be modeled by any structural component. For example, consider the following schema:

143

person := [name:string, age:(0..150)] student := ISA person [age:(18..40), friend:student]. In this case, the second type-definition is normal form reducible because it can be transformed into the following type-definition, semantically equivalent to it: student := [name:string, age:(18..40), friend:student]. Therefore, in this case, it is possible to verify that person is a supertype of student also using the last type-definition, i.e. only on the basis of their respective typed properties. Consider now the schema: person := [name:slxing, friend:person] student := ISA person [friend:student]. The ISA hierarchy of this schema is legal since student is a subtype of person. However, if we apply inheritance, and we override the property friend, we obtain the following type-definition: student := [name:string, friend:student] that is not semantically equivalent to the original one since student is not a refinement of person any more. In this case, the ISA conslluct is axiomatic and, therefore, it is not possible to establish that a student is a subtype of person on the ground of theh' structure, i.e. their typed properties. In this sense, we say that, if a type-definition is not normal form reducible, then the refinement relationship between types and supertypes is not effectively computable. Finally, we introduce the following definition. Definition 4.5 ISA certifiable schema. A schema containing at least one type-definition with ISA is ISA certifiable iff all its type-definitions with ISA are normal form reducible. [] Therefore, in a ISA certifiable schema, the system is able to verify the hierarchical relationships between type-definitions on the ground of their structure, and no axiomatic set containment relationships are given.

5 Formal

Semantics

of TQL

The semantics of TO~ given here follows a pure denotational approach [10]. With this approach, a type is a formula and a class is simply one of its possible extensions. Given an interpretation, there is a tight and immutable association between types and classes: given a type, there is only one class that corresponds to it. In this perspective, a class update causes a transition from an interpretation to another one. An interpretation is represented by a database state. Preliminary work, in this direction, has been carried out by comparing OODB systems and knowledge representation systems [32], [18].

144

Definition 5.1 The semantics of T0,# types and integrity constraints. Let 5", be a finite set of oids representing a given state of the Application Domain, T the set of T0,,6 sentences, and P ( c T ) the set of p_terms. Consider a function ~ from T to the powerset ~o(~): ~ : T ---> ~o(~) and a function I'I from P to the powerset ~o(Y, x Y,): 11: P-->

~(XxX).

Then, 8 is an extension function over Y. with respect to the type-definition:

t_terma := type [, c_expr ..... c_expr] iff the values of 8 on the type and integrity constraints are constructed starting from the values of their components as follows. Given a path = p_term 1.... .p_term n, we define the set Spath,x as follows:

S~path,x = { x } , ath,x = { Y ~ Y-[ <x,y> ~ Ilt p_term 1 ~ } Spath,x = { y ~ ~ l 3 (Yl .... Y n - 1 ) : ( x , y l ) ~ I I t P termlY, (Yl,Y2) r I-ltp_term2[ ..... (Yn-I,Y) ~ 1-Itp_te~mn[ }

if n = 0; ifn = 1; if n > 2 .

The extensions of types and an integrity constraints are computed following the structure of the syntax. Extension

of

tVDeS:

a) ~ b o d y [ b) ~ t I S A t_term...t_term body[ = (A i ~t_termi[) n ~tbody[ ~ body [ = c) ~ t t term [ e) ~t (value set) [ = {value set} n Z 0 gt[tp,..._tp][: Oj g t [ 6 j ] [ g~bt[ =

g) ~ intei~er ~ = Z n Z h) ~ r e a l ? = R n Z i) ~ b o o l e a n ~ = {true,false} n Z

1) ~ s t r i n g [ = $ m) ~ T O P ~ = Z

nZ

~ [tp] ~ = ~ [p_term:{bodY}m,M] [ = E 1 n E 2 where: E l = { X ~ 5".[ V y ~ Sn t e r m , x ~ y ~ E2={x~ X l m
~body[}

145

and ISpterm,x I represents the cardinality of the set Sp_term,x. Extension of inte~ritv constraints: ~ c_expr ~ = ~ label: c_term 1 0 c_term2 ~= I x e E [ V y e ~ c _ t e r m l [ , V z e ~ c _ t e r m 2 [ ~ yOz} ~ c_term [ = a) ~ k ~ = {k] b) 8~ this. path ~ = C) G~ttermb. path ~ P a l ~ Spath,z I z ~ ~ttermb ~, z ;~ x if t_terma = ttermb } where t_termb is any type label and t_terma is the label of the type-definition we are analyzing. []

Definition 5.2 Interpretation of a TO,/3 schema. An interpretation of a T0d2schema is a 3-tuple 3 = <~,~ ,H> where ~ represents the Application Domain, I-I is a function as defined above, and ~ is an extension function over ~ with respect to each type-definition of the schema. [] Definition 5.3 Model of a TO~ schema. A model of a T0d2schema is an interpretation 3 = <~,~ ,I-I> such that, for each typedefinition in the schema: t_term := type [, c_expr ..... c_expr] it results: ~ t_term [ = ~ type t [ n (tqj ~ {c_exprj })]. [] Given an interpretation, the mapping between type-definitions and subsets of the Application Domain is at the basis of the query processing in TO.~ [16]. Given a query, expressed as a TO,L type-definition, the solution is simply its extension, determined under the given interpretation (i.e. the current state of the database).

6 Conclusion In this work, an Object-Oriented database model, enriched by integrity constraints, has been presented. The model has been described using the language TOOL.This is a formalism conceived as data definition language and as constraint specification language. ISA hierarchies have been investigated with the aim of introducing the notions of legal ISA hierarchy and normal form reduction of a type-definition. The latter is a mechanism based on inheritance that allows the rewriting of a typedefinition with ISA by using the inherited set of typed properties. Furthermore, the expressive power of TQ6 has been analysed, using a declarative semantics. Currently, in order to have systems able to completely certify the correctness of schemas, we are analysing the problem of the satisfiability of a set of T0~ integrity constraints and, in particular, the finite satisfiability of a Tt2/~schema.

146

TO,~ is the basis of the prototype MOSAICO, an environment that supports the design and rapid prototyping of Object-Oriented database applications. MOSAICO is currently under development at IASI-CNR. The prototype has been implemented in BIM-Prolog on a Sun workstation.

References

1.

2. 3.

4. 5. 6. 7.

8. 9. 10. 11. 12. 13. 14. 15. 16.

S.Abiteboul, C.Beeri; "On the power of Languages for Manipulating Complex Objects"; International Workshop on Theory and Applications of Nested Relations and Complex Objects; Darmstadt, 1987. R.Agrawal, N.H.Gehafii; "ODE (Object Database and Environment): The Language and the Data Model"; Proc. of ACM SIGMOD 89 Conference; 1989. P. Asirelli, P. Inverardi, A. Mustaro; "Improving Integrity Constraint Checking in Deductive Databases"; Lecture Notes in Computer Science 326, 72-86, ICDT'88; 1988. S.Abiteboul, P.C.Kanellakis; "Object Identity as a Query Language Primitive"; SIGMOD '89; 1989. P.Atzeni, L.Tanca; "The LOGIDATA+ model and language"; Next Generation Information Systems Technology, LCNS 504, Springer Verlag, 1991. M.Atkinson, F.Bancilhon, D.DeWitt, K.Dittrich, D.Maier, S.Zdonik; "The Object-Oriented Database System Manifesto"; Technical Report, Altair 30-89, 1989. J.Banerjee, H.Chou, J.F.Garza, W.Kim, D.Woelk, N.Ballou; "Data Model Issues for Object-Oriented Applications"; Readings in Database Systems, M.Stonebraker (Ed.), Morgan Kaufmann Pub., 1988. F.Bancilhon; "Object-Oriented Database Systems"; 7th ACM SIGACTSIGMOD-SIGART Symp. on Principles of database Systems; 1988. C.Beeri; "A formal approach to object-oriented databases"; Data & Knowledge Engineering 5; 353-382; North-Holland, 1990. RJ.Brachman, Hj.Levesque; "The tractability of Subsumption in Frame-Based Description Languages"; Proc. of National Conference on Artificial Intelligence - AAAI 84, 34-37; Austin, 1984. E. Bertino, D. Musto; "Correctness of Semantic Integrity Checking in Database Management Systems"; Acta Informatica 26, 25-57; 1988. L.Cardelli; "A Semantics of Multiple Inheritance"; Lecture Notes in Comp. Science, No.173, Springer Verlag; 1984. J.Cohen; "Constraint Logic Programming"; Communications of the ACM; Vol.33, No.7; July 1990. S.Ceri, J.Widom; "Deriving Production Rules for Constraint Maintenance"; Proc. of the 16th VLDB Conference; Brisbane, Australia 1990. D.Fishman et al.; "Iris: an object-oriented database management system"; ACM TOIS 5(1), 46-69, 1987. A.Formica, M.Missikoff; "Materialization of recursive objects in ObjectOriented Databases"; Proc. of the Ninth International Symposium Applied Informatics; Innsbruck, 1991.

147

17. A.Formica, M.Missikoff; "Adding Integrity Constraints to Object-Oriented Database"; ISMM First InternationalConference on Information and Knowledge Management (CIKM-92), Baltimore,November 1992. 18. A.Formica, M.Missikoff, S.Vazzana; "An Object-Oriented Data Model for Artificial Intelligence Applications"; Next Generation Information Systems Technology, LNCS 504, Springer Verlag, 1991. 19. H.Gallaire et al.; "Logic and Databases: A Deductive Approach"; Computing Surveys; vo1.16, n.2; June 1984. 20. R.Gernert, N.Greif; "Modelling of Complex Objects and Semantic Integrity Constraints in Product Databases"; Informatik Informationem - Report No.2/1990; Berlin 1990. 21. N.Gehani, H.V.Jagadish, "Ode as an Active Database: Constraints and Triggers", Proc. of the 17th VLDB Conference, Barcelona, Sept. 1991. 22. R. Kowalski, F.Sadri, P.Soper; "Integrity Checking In Deductive Databases"; Proc. of the 13th VLDB Conference; 61-69, Brighton; 1987. 23. G.M.Kuper, M.Y.Vardi; "A New Approach to Database Logic"; Proc. of ACM Symposium on Principles on Database Systems, 1984. 24. C. Lassez; "Constraint Logic Programming"; BYTE, 171-176, August 1987. 25. H.Lam, M.Missikoff; "Mosaico: A Specification and Rapid Prototyping Environment for Object-Oriented Database Applications"; Technical Note December 1992. 26. C.Lecluse, P.Richard; "The 0 2 database programming language"; Proc. of VLDB '89 Conference; Amsterdam, 1989. 27. C.Lecluse, P.Richard; "Modeling Complex Structures in Object-Oriented Databases"; Proc. of ACM PODS Conference; 1989. 28. C.Lecluse, P.Richard, F.Velez; "02: an Object-Oriented Data Model"; Proc. of ACM SIGMOD Conference; Chicago, 1988. 29. G.Moerkotte, S.Karl; "Efficient Consistency Control in Deductive Databases"; Lecture Notes in Computer Science 326, 118-128, ICDT'88; 1988. 30. D.Maier, A.Otis, A.Purdy; "Development of an object-oriented dbms"; Quart. Bull. IEEE Database Engineering 8, 1985. 31. A.Motro; "Integrity = Validity + Completeness"; ACM Transactions on Database Systems, Vol.14, No.4, 480-502; December 1989. 32. M.Missikoff, S.Vazzana; "OOL: an Object Oriented Language for Knowledge Representation"; Proc. of IV International Symposium on Knowledge Engineering, Barcelona, May 1990. 33. T.Sheard, D.Stemple; "Automatic Verification of Database Transaction Safety"; Proc. of ACM TODS, Vol. 14, No.3; September 1989. 34. S.D. Urban, L.M.L. Delcambre; "Constraint Analysis: a Design Process for Specifying Operations"; Transactions on Knowledge and Data Engineering; March 1991. 35. J.D.Ullman; "Principles of Database and Knowledge-base Systems"; vol.I; Computer Science Press; 1988. 36. S.D. Urban; "ALICE: An Assertion Language for Integrity Constraint Expression"; COMPSAC Proceedings; Orlando, September 1989.

Evaluation of Negative Logic Programs * Sergio Greco, Massimo Romeo and Domenico Saccg DEIS-UNICAL, 87030 Reade, Italy

A b s t r a c t . Negative programs are logic programs where negation may arise also in the head of rules and have been recently introduced to handle exceptions to general rules. In this paper we present efficient techniques for computing the intended model of stratified negative programs.

1

Introduction

Several recent papers have addressed the problem to extend logic programming to allow negation not only in the body of the rules but also in the head in order to express exceptions to rules. For instance, suppose that the papers that are eligible to a prize are those in the April issue of the journal "JACM" plus the ones that are referred in eligible papers. This can be stated by the following program:

eligibl,(P) +-- app,ars_april_j acre(P) 9 ligibl,(P) *-- ,ligibl,(Q), r,f,r,_to(Q, P) Suppose now that someone objects and says: those papers that are corrected in the May issue cannot be considered eligible for the prize; furthermore, those papers with more than three ce[Bauthors are not eligible either. These refinements in the definition of eligible can be listed as exceptions in the following way: ~ e l i g i b l e ( P ) ~-- appears..may_jacm(Q), i s _ a _ c o r r e c t i o n ( Q , P) ~ e l i g i b l e ( P ) ~- number_of_authors(P, ~), N > 3 Exceptions are added without changing the structure of the original program. Two mainly extensions have been proposed in the literature. The first one extends standard logic programming with a new form of negation, called classical negation [6, 10]. This approach permits two different negations: negation as default (denoted by not) and classical negation (denoted by -~). Programs consist of rules where negation by default can appear only in the body while classical negation can appear only in the head. The second approach considers only one form of negation and, therefore, allows for a uniform treatment of negation. Moreover, following this approach, it is possible to recognize a large class of programs, stratified negative programs, for which the definition of the intended model is a natural extension of the notion * Work supported by the CNR project "Sistemi Informatici e Calcolo ParMlelo" subproject "Logidata+'.

149

of stratified model as given for classical program. In this paper we present an algorithm for the efficient computation of the stratified model for a negative program and a technique for extending the magic set method to restrict this computation to those elements that are relevant in answering a query. The rest of the paper is organized as follows. In sections 2 we review some basic definitions and results concerning negative programs. In section 3 we present the class of stratified negative programs and we describe an algorithm for computing their stratified models. In section 4, we present an extension of the magicset optimization technique for negative logic programs. Finally, in section 5, we present our conclusions.

2

Negative

Programs

In this section we present our language, that is logic p r o g r a m m i n g without function symbols where negation may arise not only in the body but also in the head of rules. This language is an extension of I)ATALOG-' [15, 4, 11] and will be here called "DATALOG . An atom is a formula of the language of the form p(tl, ...,tin) where p is a predicate symbol of a finite arity m _> 0 and t92 ...,tin are variables or constants (arguments of the atom). A literal is either an a t o m (positive literal ) or its negation (negative literal ). An a t o m A, and its negation (i.e., the literal --A) are said to be the complement of each other. Moreover, if/3 is a literal, then --,B denotes the complement of B. A negative rule (or, simply, a rule) is of the form: A0 *-- A1, ..., An. where A0 is a literal (head of the rule) and A1, ..., A,, is a conjunction of literals (body of the rule). If A0 is positive then the rule is a seminegalive rule; moreover, if also A1,..., Am are all positive then the rule is a positive rule (or Horn clause). A rule with empty body is a fact. Given a rule r, H ( r ) and B(r) denote respectively the head and the body of r. With a little abuse of notation and whenever no confusion arises, we shall also see B(r) as a set of literals, i.e., B(r) = {A1,..., A,~}. A rule is a fact if it has an e m p t y body and is ground if it is variable free. A term, atom, literal or rule is ground if it is variable free. A -'I)ATALOG program is a set of rules. If all rules are seminegative (resp., positive) then the program is called seminegative or a DATALOG-'p r o g r a m (resp., positive or a DATALOGprogram). Let 79 be a "DATALOG program. 79+ (resp., 79-) denotes the set of all seminegative (resp., non-seminegative) rules in 79. The Herbrand's Universe of 79 (denoted by H~,) is the set of all constants occurring in 79. The Herbrand's Base of 79 (denoted by B~,) is the set of all possible ground predicates whose predicate symbols occur in 79 and whose arguments are elements of H~,. As there are no function symbols, both H~, and B~, are finite.

150

A ground instance of a rule r in 7) is a rule obtained from r by replacing every variable X in r by ~b(X), where ~ is a mapping from the set of all variables occurring in 7) to H r . The set of all ground instances of all rules in 7) is denoted by ground(7)). Let X be a set of ground literals whose atoms are in B~, ; then -~X denotes the set {-~AIA ~ X } , X + (resp., X - ) denotes the set of all positive (resp., negative) literals in X. Moreover, X" denotes all elements of the Herbrand Base which do not occur in X, i.e., X" = {AIA E B~, and neither A nor -,A E X}. A set of literMs X is said to be consistent if there not exist a literal A E X such that --,A E X. An (partial) interpretation for 7) is any consistent subset of B~, O -~B~,. An interpretation I is total if 7 is empty. Let I be an interpretation. Given a ground literal A, we say that A is true in I (resp., false in I) if A E X (resp., -~A E I). A conjunction A1, ...,An of ground literals is true in I if each Ai is true in I, and is false in I if there exists some Ai that is false in I. If a literal (or a conjunction) is neither true nor false then it is undefined in X. An interpretation I for 7) is a (partial) model for 7) if 1. for each A in I + and for each r in ground(7)) with H(r) = -~A, B(r) is false in I, and 2. for each --A in I - , either there exists r in ground(7)) with H(r) = -~A such that B(r) is true in I or for each rule r in ground(P) with H(r) = A, B(r) is false in I. Moreover, if 7 is empty then the model I is total. The existence of a total model is not guaranteed. In other words, given a model M, a ground literal may occur in M only if there is no rule that could contradict it (possibly by assigning suitable values to undefined literals in M-). However, a negative literal can be contradicted if it is reconfirmed by an non-seminegative rule. This corresponds to say that non-seminegative rules represent a refinement of the knowledge supplied by seminegative rules, thus they are a kind of exceptions to a theory. It turns out that a rule r in P - is meaningful only if there exists al least one rule in P+ whose head predicate symbol is is equal to that of r. From now on, we assume that P only contains meaningful rules. The problem of selecting the "intended" model for a "DhTtLOG program, i.e., the model which captures the semantics of the program, has been discussed in [7] suitable extensions of founded [14] well-founded [lS] and stable model semantics [5] have been proposed. In this paper we shall only consider the intended model only for the class of stratified "DATALOG programs, that are defined in the next section. To this end, we need to extend the classical immediate consequence transformation Tp [9] to negative programs. For each interpretation I, we define

7"p(I) = { A [ r E ground(P) A U(r) = A A B(r) is true in I } { A I r E ground(P)- A U(r) = -~A A B(r) is not false in I }

151

Recall that, according to our notation, ground(P)- contains all non-seminegative rules in ground(P). If P is a DATAL0(I" program then Tp coincides with Tp. We have that T~, is monotonic in the finite lower semi-lattice , where 2; is the family of all interpretations of P. T h e r e f o r e the least fixpoint of T~, denoted by T~r exists and there exists a natural i, i ~ 0 such that ~ ( 0 ) -As proven in [7], T ~ ( 0 ) is a model for P but not a total model; it is indeed a rather poor model as it contains only those negative elements that can be derived from the rules and such elements are not in general sufficient to derive all possible positive elements because of negative literals in the rule bodies. Nevertheless, as shown in the next section, a particular use of Tp(0) will eventually allow to determine the intended model of stratified "DATALOG programs. 3

Stratified

Negative

Programs

Let us first extend the concept of dependency graph as given for D A T A L O G ~ programs [15]. Given a -'DATALOG program P , the dependency graph of P , denoted by Gp, is a directed graph < N , A > such that N is the set of all predicate symbols occurring in P and A is defined as follows. Given two predicate symbols p and q, there exists an arc from q to p in A if there exists a rule r in P such that (i) the head predicate symbol of r is p, and (ii) one of the literals in the body of r, say Q, has q as predicate symbol. Moreover, the arc is marked if either Q is positive and H(r) is negative or Q is negative and H(r) is positive. Given two predicate symbols p and q we say that q <_ p if there is a path from q to p in Gp; moreover, q < p if there is a path in Gp from q to p passing through at least one negative arc. P is stratified if there exists no cycle in Gp containing a marked arc.

Example 1. Consider the program ~01 and its associated dependency graph G~,I, that are shown in Figure 1. As Gpl does not contain cycles with marked arcs, then the program 7) 1 is stratified. rich(X)

~- i n h e r i t s ( X , Y ) ,

~rich(X) ~-benefactor(X). rich(john). rich(mary),

rich(Y),

rfch----n

~

~. \

benefactor

inherits

- - F i g u r e 1. Stratified Program P l - -

Consider now the program P~ and its dependency graph Gp2 , that are shown in Figure 2. As Gp2 contains a cycle with a marked arc, the program I)2 is not stratified.

152

rich(john). rich(X) +--inherits(X,Y),

] ~poor(Y).

~rich(X) ~--benefactor(X). poor(X) ~ - i n h e r i t s ( X , Y ) , - r i c h ( Y ) . poor(mary),

1

rich 9

l benefactor

~

poor

~..~

l inherits

- - F i g u r e 2. Non Stratified Program --

A stratified dependency graph of 79, denoted by (~,, is a directed graph < 15, A > such that 15 is a partition of the set of nodes of G~,, A is: {(Ni, Nj)INi, Nj E 15/~ 3k E Ni and i G Njs.t.(k, i) E G~,} and the following two conditions holds: (1) for each Ni E 15, there exist no p, q in Ni such that p < q, and (2) for each Ni, Nj E 15, if there exist p E Ni and q E Nj for which p < q, then for each iOE Ni and q G Nj, ~ , such that for each Ni, 1 represents a stratification of the predicate symbols in 79. Note that in general a program has several stratified dependency graphs; moreover, given a stratified dependency graph, there are several stratifications. In case of seminegative programs, the above definitions coincide with the definitions of stratified programs and stratification as given in [1].

Ezample 2. Consider the stratified program 793 that is shown in Figure 3) together with its dependency graph. p(X)

~--r(X,Y), q(Y).

~p(X) ( X, Y, V) ,) , p(Y). ~q(Y). q(X) ~ s~(rX -p(X)

+-r(X,Y).

I

P 9

l

q

"1

1

r

s

- - F i g u r e 3. Stratified Program 793 --

There are three stratified dependency graphs for 793 that are shown in Figure 4. The first graph gives the stratifications: Sx = < {r}, {s}, {p, q} > and $2 = < {s}, {r},{p,q} >. The second and third graph gives the stratification $3 = < {r}, {s,p, q} > and $4 = < {r, s}, {p, q} >, respectively. Note that the each node of the first stratified dependency graph corresponds to the set of nodes of a strong component of the dependency graph; moreover, the stratificatitions induced by this graph are the finest stratifications.

153

{p,q}

/

{r}

\

{p.q,s}

{p,q}

t{r}

t-

{s}

- - F i g u r e 4. Stratified Dependency Graphs

-

-

$3 = < {r}, {s}, {p, q} > and $4 = < {s}, {r}, {p, q} >. The topological orders $3 and $4 contain the elements corresponding to the strong components of the dependency graph. Let < N1,...,Nm > be a stratification of P and for each i, 1 < i < m, let 7)i = {rlr E 7) and the head predicate symbol of r is in Ni} and B~ = {AIA E Bp and the predicate symbol of A is in Ni}. Then we define:

M+= M~- =

it=

M~= .

~

1

7

- M +)

M t LJ J ' ~ ( M I ) M 1- U--,(B~ - i + )

6

+

Mm_ 1 I.J -p~(.rV/rn--1) M(

_I

- M,+).

M,~ is the stratified model of 7). Note that the stratified model is total and unique, thus it is independent from the stratified dependency graph and the stratification that have been chosen. In Figure 5 we present an algorithm for computing the stratified model of 7) . This algorithm first reads the program and, then, calls the procedure construct_dependency_graph that returns the dependency graph. Using the definition of dependency graph, this procedure can be easily implemented in such a way that its time complexity is linear in the size of the program. Then the algorithm calls the procedure strong_components which computes the strong components of the dependency graph. As the strong components define a partition of nodes that satisfy the conditions of a stratified dependency graph, the procedure also determines a topological sort for this partition (if any) and, then, it returns an ordered list of sets of predicate symbols, which represents a stratification of the program. If there exists no topological sort, then the procedure notifies that the program is not stratified through the boolean variable stratified. Using well-known graph algorithms, this procedure can be easily implemented in such a way that its time complexity is linear in the number of arcs and, therefore, in the size of the program. Once a stratification < C1, ..., Crn > is available, the computation of the positive literals in the stratified model is started, according to the order of the stratitification; this order coincides with order of the list C. First of all, the function sub_program returns the subprogram /5 that is obtained from P by taking all rules whose head predicate symbols are in (~. Then we compute N + = M + applying the naive method to a transformation T that is much simpler than J~. In fact T is evaluated in the following way:

T~,(X +) = {AIA = U(r), ,'E ground(75), B(r) is true in X + }

154

A l g o r i t h m Stratified.Model; Var P, #: Program;G:Graph;stratified: boolean; Q,M,N:set of Literal;

~ :set of Predicate.Symbol; C:list o] set of Predicate..Symbol; begin input(P);

construct_dependency-graph(P, G) ; strong_components(G, C, stratified); if stratified t h e n M + := $; for each C E C d o N + := P := sub_program(P, ~) Repeat

Q+ := N + ; N := T ( P , M + u Q + ) ; N + := N + - - , N until N + =

Q+;

M + := M + o N + endfor;

output('The stratified model is", M +) else

Output (" The program is not stratified") endlf

end; - - Figure 5. Algorithm Stratified_Model - -

where a positive literal B is true in X + if B E X + and a negative literal -~B is true in X + if B ~ X +. Thus T, as the classical transformation T, is independently applied to each rule, without having to check possible contradictions as it is instead required by 7~. This means that ~ can be computed in time linear on the size of the ground program. Possible negative literals that are derived by T are used to remove complementary positive literals. This subtraction can be done in O(h x !o9 h) where h is the size of the Herbrand Base. The number of iterations is obviously bound on the size of the Herbrand Base since at least a new positive literal is to be derived in every step. It turns out that the overall time complexity of the algorithm is O(h x loeh x l) where ! is the size of the ground program 9 . The algorithm can be improved by using hash indices for the subrtaction replacing the naive method with the semi-naive one in the computation of the fixpoint of T [16]; however, the application of the semi-naive method needs some attention for taking care of subtraction and, for the sake of brevity, it is not reported here. For the sake of the presentation, the actual implementation of the various procedures used in the algorithm are not described either. Note that the algorithm makes use of suitable data structures such as Literal, (i.e., a predicate symbol followed by a list of bound arguments), Program (i.e., a set o f lists of literals, where each list represents a rule), and Graph (i.e., the nodes are predicate symbols and the arcs are represented by adjacency lists).

155

4

Query

Answers

The algorithm of Figure 4 computes the whole set of the positive literals in the stratified model. This computation becomes exuberant in the case of answering queries, where only a subset of the stratified model is required. In the case of seminegative programs, this drawback is removed by using program rewriting techniques such as magic sets [2, 3], counting [2, 12, 3], magic counting [13] and others [16]. We claim that rewriting techniques can be also applied to stratified "DATALOG programs using simple adjustments for passing the binding on rules with negative heads. We next show how this can be done for the case of the magic set method, that is assumed to be familiar to the reader. Let 79 be a stratified "DAThL06 program, M + be the set of the positive literals in the stratified model of P, and Q be a positive literal (query). The answer of the query Q, denoted by AO, is the set of all elements in M + which unify with Q. We compute A O as follows: 1. Find the strong components of the dependency graph of 79; 2. Construct the program 75 by taking each seminegative rule in 79 without any modification and by replacing each other rule r into § as follows: H(~) = --,H(r) and for each B in B(r), "-,B is in B(f) if the predicate symbols of B and H(r) belong to the same strong components or B is in B(f) otherwise; 3. Rewrite the program 75 for the query Q by applying the magic set method; let Magic(790) and Modified(75) be the sets of magic rules and of modified rules in the rewritten program, respectively; 4. Construct the program Modified(79Q) by taking the rules in Modified(75) derived from seminegative rules of 79 and by modifying each other rule r as follows: the head H(r) is replaced by --,H(r) and each body atom B which is mutually recursive with H(r) with -,B; (this corrresponds to do the reverse substitution of step 2); 5. If the program 790 = Magic(79o)UM~ is stratified then use the algorithm of Figure 4 for computing the stratified model M+Q and A O consists of all literals in M+Q which unify with Q.

Example3. Consider the followin~l program 79 with the query Q = p(1,Y) : p(X,Y) +-- a(X,Y). p(X,Y) +-- a(X,Z), p(Z,Y). ~p(X,Y) +-- q(X,Y). q(X,Y) +-- B(X,Y). q(X,Y) +-- b(X,Z), q(Z,Y).

I

p,

q

The dependency graph G~, pictured above does not contain cycles with marked arcs and then 79 is stratified. In order to generate the rewritten program we first replace the non-seminegative rule -,p(X,Y)~--q(X, u with the seminegative rule p(X, Y),---q(X, Y) (step 2) and then we apply the standard magic-set method to the program (step 3). The magic-set method generates among other the modified

156

rule p(X, Y) ~- magic..p(X), q(X, Y) that is derived from the non-seminegative rule. Then we replace this rule with the modified rule --,p(X, Y),--,*agie..p(X), q(X, Y) (step 4). The rewritten program 79c,~is still stratified and is as follows:

magic..p(a) -magic..p(Z) ',-- magic..p(X), a(X, Z).

q(X, Y) ~-- magic_q(X), b(X, Y). q(X, Y) ~- .agir b(X, Z), q(Z, V).

.agio_qCX) -.agic.pCX)

I

I l

magic-q(Z) +-- magic_q(X), b(X, Z).

p(x,,) ~-

-asir

~p(X, Y) *---

magic..p(X), q(X, Y).

q

a(x, Y).

l

I "?"-"'7

0

As shown in the next example, it may happen that the program 790 is not stratified. Example,~. Consider the following program P with the query p(1,Y):

p(X,Y) ~-- a(X,Y). p(X,Y) +-- p(X,Z), p(Z,Y). ~p(X,Y) *-- q(X,Y). q(X,Y) ~-- b(X,Y). q(X,Y) ~-- q(X,Z), q(Z,Y).

~ --

q

The rewritten program "190 and its dependency graph are given below. The dependency graph is not stratified because there is a cycle with a marked arc, then the rewritten program 790 is not stratified.

magic-p(1) magic..p(Z) +- magic-p(X), p(X, Z).

q(X, Y) ~ magic..q(X), b(X, Y). q(X, Y) ~- ~,~gi~_q(x), q(x, z), q(z, v).

magic_q(Z) +---magic.q(X),

q(x,z).

p(X, Y) ~-p(X, Y) +-~p(X, Y) '--

~(x,Y). p(x,z), p(z,v). q(x,Y).

0

magic-p(X), magic-p(X), magic-p(X),

P'

q

II II

157

If the program 79Q happens not to be stratified, the only possibility to answer the query is to compute the whole stratified model by applying the algorithm of Figure 4 to :P rather than to PQ. However, the binding propagation of T~Q can be exploited also in the case PQ is not stratified by using the technique described in [8].

5

Conclusion

"DATALOG programs are function-free logic programs where negation m a y arise also in the head of rules and allow to single out exceptions to a theory without having to overload general rules with refinements for special cases. In this paper we have presented efficient techniques for computing the intended model of a stratified "DATALOG program or the part of it that is relevant to answer a query. These techniques do not require to transform a -'DATALOG p r o g r a m into an equivalent logic program without negated rule heads (serninegative version) as they are directly applied to the original program. This is very i m p o r t a n t since the seminegative version of a stratified -~DATALOG program m a y loose stratification so that finding the intended model becomes much more complex.

References 1. K. Apt, H. Blair, A. Walker, Towards a Theory of Declarative Knowledge, In Foundations of Deductive Databases and Logic Programming, Minker, J. (ed.), Morgan Kaufman, Los Altos, pages 88-148, 1988. 2. F. Bancilhon, D. Mayer, Y. Sagiv, and J.F. Ullman. Magic sets and other strange ways to implement logic programs. In Proceedings of the Fifth ACM Symposyum on Principles of Database Systems, pages 1-15, 1986. 3. C. Beeri and R. Ramakrisnhan. On the power of magic. In Journal of Logic Programming, 10 (3 & 4), pages 255-299, 1991. 4. S. Ceri, G. Gottlob, L. Tanca, Logic Programming and Databases, Springer-Verlag, 1990. 5. M. Gelfond and V. Lifschitz. The stable model semantics of logic programming. In Proceedings of the Fifth Intern. Conference on Logic Programming, pages 10701080, 1990. 6. M. Gelfond, V. Lifschhitz, Logic Programs with Classical Negation, In Proe. of the Seventh Int. Conf. Logic Programming, pages. 579-597, 1990. 7. S. Greco, D. Sacck, Negative Logic Programs, In Proc. of the North American. Conf. on Logic Programming, pages 480-497, 1990. 8. S. Greco, D. Sacck, Magic Set Transformation for Negative Logic Programs, Technical Report, 1992. 9. J.W. Lloyd, Foundations o/Logic Programming, Springer Verlag, Berlin, 1987. 10. R.A. Kowalski, F. Sadri, Logic Programming with Exeptions, In Proc. of the Seventh Int. Conf. Logic Programming, pages 598-616, 1990. 11. N. Leone, M. Romeo, P. Rullo, D. Sacck, Effective Implementation of negation in Database Logic Query Languages, In this volume. 12. D. Sacch and C. Zaniolo, The generalized counting method of recursive logic queries for databases. In Theoretical Computer Science, No. 62, pages 187-220, 1989.

158

13. D. Sacc~ and C. Zaniolo, Magic-counting methods. In Proceedings o] the 1987ACM SIGMOD Int. Conf. on Management o] Data, pages 149-59, 1987. 14. D. Sacc~t, C. Zaniolo, Stable models and Non-determinism for logic programs with negation, In Proc. A CM SIGMOD-SIGACT Syrup. on Principles o] Database Systems, pages 205-217, 1990. 15. J.D. Ullman, Principles o] Database and Knowledge-Base Systems, Vol. 1, Computer Science Press, Rockville, Md., 1988. 16. J.D. Ullman, Principles o.f Database and Knowledge-Base Systems, Vol. 2, Computer Science Press, Rockville, Md., 1989. 17. A. Van Gelder, Negation as Failure Using Tight Derivations for Logic Programs, in Foundations of Deductive Databases and Logic Programming, Minker, J. (ed.), Morgan Kaufman, Los Altos, pages 88-148, 1988. 18. A. Van Gelder, K.A. Ross, and J.S. Schlipf. The well-founded semantics for generM logic programs. In Journal o] ACM, Vol. 38, No. 3, pages 149-176, 1991.

Effective Implementation of Negation in Database Logic Query Languages * Nicola Leone 1 , Massimo Romeo 2, Pasquale Rullo 3 and Domenico Sacc~ 1 1 DEIS-UNICAL, 87030 Rende, Italy 2 ISI-CNR, 87030 Rende, Italy 3 Dip. di Matem~tica-UNICAL, 87030 Rende, Italy

A b s t r a c t . Total stable models provide a powerful semantics for D A T A L O ~ ~ programs which increases the expressive power of current database query l~nguage by means of non-determinism. An efficient algorithm for determining one of stable models of a DATAL0(I"programs, if any, is presented

so that stable models may have also a practical interest.

1

Introduction

is a logic programming language with negation in the rule bodies but without functions symbols. It is used as a database query language that, because of recursion, has an expressive power greater than relational algebra. The semantics of a DATALOG-~ program without negation is rather simple: the meaning of the program, i.e., the intended model, is given by the m i n i m u m model, that is the intersection of all total models. As soon as there are negative goals in some rule, the selection of the "intended" model becomes more complex and is a key issue of the current research. Total stable models have been recently proposed as the semantics for logic programs with negation and, then, also for DATAL06-~ programs. However, the existence of multiple stable models for the same program (or, on the other side, the lack of total stable) has caused some conceptual difficulties to researchers as the canonical meaning of a logic program is traditionally based on a unique model. Our claim is that this anomaly of total stable models is not a drawback but provides the ground for adding non-determinism to logic programming. Total stable models are also criticized because of the lack of an efficient procedure for their computation. In fact computing a stable model for DATALOG" programs is NP-hard. Also for this aspect, our claim that this is not a drawback as long as DATALOG" programs are able to formulate NP-hard problems; indeed, it must be required that finding a stable model be polinomyal only if the p r o g r a m formulates a polynomial problem. In this paper we present an effective algorithm for computing a stable model of a DATALOG" program. This algorithm is based on a backtracking search strategy and on some new interesting characterizations of stable models. DATALOG"

* Work partially supported by the CNR project "Sistemi Informatici e Calcolo Parallelo', subproject "LOGIDATA+".

160

The paper is organized as follows. In section 2 we give the preliminary definitions and basic notation and we present the theoretical results for characterizing stable models in Section 3. Finally we describe the algorithm in Section 4.

2

DATALOG

with

Negation

Let us first review the basic concepts and notation of the database logic query language DATILOGwith negation (or simply DATALOG") [16], that is Horn clauses plus negated goals in rules but without function symbols [7]. An atom is a formula of the language of the form p(tl, ..., tin) where p is a predicate symbol of a finite arity m > 0 and tl, ..., tm are variables or constants (arguments of the atom). A literal is either an atom (positive literal ) or its negation (negative literal ). An atom A, and its negation (i.e., the literal -~A) are said to be the complement of each other. Moreover, if B is a literal, then -,B denotes the complement of B. A rule is a formula of the language of the form A ~ A1,...,An. where A is an atom (head of the rule) and A1,...,An is a (possibly, empty) conjunction of literals (body of the rule). Given a rule r, we shall denote the head of the rule by H ( r ) and the body by B(r), i.e., H ( r ) = A and B(r) = A1,..., An. With a little abuse of notation and whenever no confusion arises, we shall also see B(r) as a set of literals, i.e., B(r) = {A~, ...,An}. A rule with empty body is called a fact. A rule without negative literals in the body is positive, i.e., it is a Horn clause. A term, atom, literal or rule is ground if it is variable free. A DATALOG-' program is a finite set of rules. A DATALOG" program is positive if all its rules are positive; in this case it is also called a DITALOG program. Let a DATALOG"program 79 be given. The Herbrand universe for 79, denoted H~, is the set of all constants occurring in 79. If there is no constant in the program then one is added arbitrarily. The Herbrand Base of 79, denoted BT~, is the set of all possible ground atoms that can be constructed using the constants in HT, and the predicate symbols occurring in 79. Note that both H~, and B~, are finite. A ground instance of a rule r in 79 is a rule obtained from r by replacing every variable X in r by r where r is a mapping from all variables occurring in r to H~,. The set of all ground instances of all rules in 79 is denoted by ground(79). Let X be a set of ground literals whose atoms are in B~,; then -,X denotes the set {-~AIA E X}, X + (resp., X - ) denotes the set of all positive (resp., negative) literals in X. Moreover, X denotes all elements of the Herbrand Base which do not occur in X, i.e., -X = {AIA E B~, and neither A nor -,A E X}. Given a ground literal A, we say that A is true in X (resp., false in X) if A E X (resp., -,A E X). A conjunction A1,..., A , of ground literals is true in X if each Ai is true in X, and is false in X if there exists some A / t h a t is false in X. Note that

161

a literal (or a conjunction) can be both true and false in X or, even, neither true nor false; in the latter case, it is undefined in X. Given X C_ B~,U~BT~ and Y C_ B~,, Y is an unfounded set w.r.t. X if for each rule r in ground(7)) with H ( r ) E Y, B(r) is false in X or B(r) A Y # 0 [18, 19]. The union of all unfounded sets w.r.t. X, is also an unfounded set w.r.t. X , is called the Greatest Unfounded Set, and is denoted by GUS~,(X). As pointed out in [18, 19], GUS~, is a monotonic transformation from 2 B ~ u ' ~ to 2 B ' . Let us now introduce another important transformation from 2 BT'~ to 2 BT', called the immediate consequence transformation and denoted by T~,[7]. This transformation is defined as follows: for each X C B~, O --B~,, T~,(X) = {AIA = H ( r ) , r E ground(7)) and B(r) is true in X}. Observe that T~, is monotone in a finite complete lattice and, then, the least fixpoint of T~,, denoted by T~~ exists and coincides with T~(0), for some natural k, where T~(0) = 0, and for each i > O, T~(O) = T~,(T~-I(O)) [15]. Let us now introduce the important notions of interpretation and model in the domain of partial interpretations. Given I C_ Bp U -~Bp, I is a (partial) interpretation of 7) if it is consistent, i.e., I + A - - I - : 0 so that a ground literal cannot be, at the same time, true and false in [. Moreover, if 7 = 0, the interpretation I is called total and, in this case, any ground literal will be either true or false in I. As far as the notion of model is concerned, we first recall the classical definition of model for total interpretations: a total interpretation M of 7) is a total modelofT) if for each r in ground(P), (1) H(r) is true in M or (2) B ( r ) is false in M. As shown in [14], such models have the following characterization: a total interpretation M is a total modelifand only i f - - , M - is an unfounded set w . r . t . M . We point out that there are several alternative views on how this definition model is to be extended to the domain of partial interpretations [4, 11, 13, 14, 18, 19]. Following [13, 14], we shall say that an interpretation M of 7) is a (partial) model ofT) if for each r in ground(7)), (1) H(r) is not false in M (thus it is true or undefined), or (2) B ( r ) is false in M. Note that, in case M is a total interpretation then the above condition (1) reduces to H(r) is true in M. Thus, this definition of model is a rather natural extension of the definition of total model. This is confirmed by the fact that models preserve the same characterization in terms of unfounded sets as total models: an interpretation M is a model if and only if --,M- is an unfounded set w . r . t . M . A DATALOG" program 7) has in general several models but only one of them, the so-called "intended" model, represents the natural meaning of it. It is wellknown that if 7) is positive then the intended model is the total model M7 whose set of positive literals M + coincides with T ~ (0), thus M + contains all literals that can be inferred from the rules of the program. Note that MT is also the m i n i m u m model of 7), i.e., for each total model M of 7) M + C_ M +. On the other hand, in case 7) is not positive, MT is not any-more guaranteed to be a model as M T = --,(B~, - T ~ (0)) is not necessarily an unfounded set. Therefore, the issue of what model to base the semantics of a DATAL0(]-" program becomes much more complex and alternative definitions of the intended model have been

162

made in the literature, e.g., negation by failure [3], stratified model [1, 2, 9, 17], locally-stratified model [10], well-founded model [18, 19], total stable models [6], and various extensions of stable model in the domain of partial interpretations [12, 13, 14]. As discussed in the next section, we shall take the total stable model as the intended model of a DATALOG"~ program. 3

Total

Stable

Models

A first common requirement of the various proposals of semantics for nonpositive programs is justifiability[20], i.e., the positive literals of the intended model must be inferred from the rules of the program through the immediate consequence transformation as for positive programs, but possibly using negative literals as additional axioms for the inferences. This requirement is found under several names in the work of several authors (e.g., the notion justifiability used in [20] is similar to that of 'stability' [6] or that of 'founded models' [13]), and can be formalized by the following condition: D e f i n i t i o n 1. Given a model M of T~, the stability condition holds for M if there

exists X C_ M - such that S~,x($ ) = M +, where the transformation ST,,x : 2 n'u-'B" ---* 2 n" is defined as follows: for each Y C_ B~ U ",BT,, ST,,x(Y) = T~ ( X U Y). Thus the negative literals in X are used to infer the positive literals in M +. Obviously if such an X exists then we can use also M - instead of X for such inferences; so we could have used directly M - in Definition 1. The reason we have stressed that the whole M - is not necessary in the definition will become clear later in the paper when we shall use a suitable subset of M - to compute a justified model. A second requirementi s minimal undefinedeness, i.e., the intended model M may not retain literals in M that could be otherwise included in M + or M - . This requirement can be implemented in different ways, each of them resulting in an alternative definition of intended model. The strictest implementation requires that M be empty, thus M is a total model; this requirement together with the one of Definition 1 provides the following definition of total stable model that is equivalent to the original definition given in [6]: D e f i n i t i o n 2 . A model M of'P is a (total) stable model of 1) if both M is total

and the stability condition holds for M . In this paper we shall assume that the intended model of a DATALOG-"program be a total stable model that, from now on, we shall call stable model for short. We point out that a DATALI36"program could have several alternative stable models or even none. Therefore, as uniqueness and existence have been a basic requirement for any classical definition of intended models, it would seem that our choice for intended models has a serious drawback. Instead our claim is that the lack of the two above properties is an actual advantage as it increases the

163

expressive power of DhTALOG"~. Indeed, the existence of several intended models entails for the implementation of a powerful non-deterministic choice mechanism as demonstrated by the following two examples taken from [14]. On the other hand, the lack of stable models means that the problem has no solution (see Example 2). Example 1. The following program states that glasses must be colored with one of the available colors but two glasses cannot share the same color. Colors and glasses are defined by a number of facts. colored(G, C) ~-- color(C), glass(G),-,diffChoice(C, G). di f fChoice(C, G) +-- colored(C, 1~), G r G. d i f f C h o i c e ( C , G) +-- colored(C, G), C r C. Note that the non-determinism is here used to perform a "don't care" choice. This program admits at least one stable model, thus the problem has a solution no matter is the base of facts for colors and glasses. In particular, if the number of available colors is less than the number of glasses then some glass will have assigned no color. ! Example 2, Consider the following program that computes a kernel of a directed graph whose nodes and edges are defined by a number of facts with predicate symbol n and e, respectively. A kernel is a subset of nodes such that no two nodes in the kernel are joined by an edge whereas every node that is not in the kernel is joined to some node of the kernel. Finding a kernel of a graph is an NP-complete problem [5].

a(x) joined_to_iV(X) disconnectedY_f rom~ conneded_N kernel kernel

.(x),-,n(x). ,+- e(Y, X), fi(Y). ,--- ~( X ) , - , j o i n e d J o . ~ (X ) . ,-- h( X ), joined_to_N ( X ). ~ --,disconnected.T_from_N , --,connected_IV. ~-- --,kernel.

The first two rules realize a partition of the nodes into two classes 2V and -N. In a stable model of the program the former class 2V will represent the kernel, while the latter will collect all nodes outside the kernel. If M is a stable model of the program then kernel is in M+ ; in fact, because of the last rule, --,kernel cannot be in M, as the falsity of kernel would falsify the rule kernel <-----,kernel. (the body would be true while the head w.ould be false). Hence kernel is in M +, but, since stable model are justified, then kernel has been derived from the previous rule; so both -',disconnected3_from..N and --,connected.f[ are in M - . Hence, the set of nodes Ig determined by M is a kernel. On the other side, if the graph has no kernel, then kernel remains undefined and the program has no stable models. Thus the number of stable models coincides with the number of distinct kernels in the graph. The program describes the kernel problem using

164

a direct non-deterministic formulation: the last rule forces to select among all possible partitions the one which satisfies the kernel conditions. The goal of this paper is to present an efficient algorithm for computing one of the stable models of a given DATALOG"program or reporting that the program admits no stable models. To this end, we use the following characterization of stable model [19]: F a c t 1 A total interpretation M of a given DATALOG" program 79 is a stable model of 79 if and only if M is a fixpoint of the transformation W r : 2 B ~ ' v ' B ~ - > 2 B~v'B~ defined as follows: for each X C Bp U ~ B p , W p ( X ) = TT~(X) U

n Note that, as Wp is a monotonic transformation in a complete finite lattice, there exists a natural k _> 0 such that W ~ ( 0 ) = W~(0). Thus W ~ ( 0 ) i s the least fixpoint of W~, and is called the well-founded model of P [18, 19]. The following property holds. F a c t 2 The well-founded model of a DATALOG" program 7) is a model of 79 and is contained in any stable model of 7~. O We point out that, as a stable model is total, it is sufficient to return just the positive literals of it. Therefore, negative literals need not be computed unless they are necessary to perform inferences of positive literals. This means that, in order to efficiently compute a stable model, the transformation W~, should be modified by replacing the (3US with a subset of it containing only those literals in the (:;US whose falsity is useful to draw positive conclusions. This subset is called the Greatest Useful Unfounded Set ((3UUS).

Example 3. Consider the program ae-b. C 4--- -'~d~

and let M - 0. It is easy to see that GUS(M) - {a,b,d}. However, only the element d is useful for positive derivations; so GUUS(M) : {d}. We next give a formal definition of the GUUS. To this end, we first need to introduce the important notion of Possibly Unfounded Conjunction. D e f i n i t i o n 3. Let 7) be a DATALOG" program and X C_ B~ U'~B~,. A non-empty subset Y o f ' X is a possibly unfounded conjunction w.r.t. X if there exists a rule r in ground(P) such that:

1. 2. 3. ~.

H(r) qL X +, and ~ Y C B(r), and all positive iiterals in B(r) are true in X , and no negative literal in B(r) is false in X .

165

The set of all possibly unfounded conjuctions w.r.t. X is denoted by PUCST~(X). Moreover, given Z C_ B~,, the restriction of PUCS~(X) to Z, denoted by I:UCS~,(X/Z), is PUCS~(X/Z) = {CIC ~ PgCS~(X) ^ C C Z}.

0 Example 4. Consider the program a~--b. c 4--- -~d. e *-- -~e,-~b. and let M = O. It is easy to see that t ~ C S p , M ( M ) -" {{d},{b,e}}. Note that, given Y = {b, d}, PUCS~,.M(M/Y) = {{d}} since the possibly unfounded conjunction {b, e} is not contained in Y. We are now ready to give the definition of GUUS. D e f i n i t i o n 4 . Let a D A T A L O G ~ program 79 and X C Bp U -~BT~ be given. The Greatest Useful Unfounded Set of 79 w.r.t. X is

6vus (x) :

U

(c).

ce ~ c s ~ (x/c~s~ (x))

Example 5. Consider the program of Example 4. For M = 0, we have that GUST~(M) = {a, b, d}. Since the only possibly unfounded conjunction that is contained in {a, b, d} is {d}, GUUST~(M) = {d}. We next show that the well-founded model can be determined by replacing the (3US with the GUUS in the transformation W~,. D e f i n i t i o n 5. Given a DATALOG~ program 79, the transformation VT~ is defined

as follows: for each X C_ B~, tO -,BT~

v~,(x)

= T ~ ( x ) u ~ a ~ ; s ~ ( T ~ (x) u x - ) u x -

The transformation V~, is not monotone. Nevertheless, the sequence V~(O), i > O, V~(O) = V~,(V~-I(O)), is a monotonic sequence of models. Hence, the fixpoint V~~ exists and, as the Herbrand base is finite, there is a natural i such that V ~ ( 0 ) = Vr Now we are in a position to state a fundamental result that has been proven in [8]. F a c t 3 Let 79 be a DATALOG" program and Mwl be the well-founded model of 79. Then, M = V~(O) is a model of 79, M + = MgI,+ and M U -,GUS~,(M) =

M~I.

O

166 From the above fact, it follows that the positive literals of the well-founded model can be found using the transformation V~,. Note that if the well-founded model is total then it is also stable; therefore, in this case, no further computation is needed. Recognizing whether the well-founded is total or not can be easily done using the following result of [8]. F a c t 4 Let 7" be a DATAL0fl" program and M = V~~ Then the wellfounded model ofT" is total (i.e., it is a stable model) if and only if PUCS~,(M) =

!

O. Example 6. Let us compute the stable model for the following program c

~-.. --le.

a~-.-c. b+--~a.

d,.-b. e+...-f,

As V~,(O) = O, we have T ~ ( V # ( O ) ) = 0, PUCS~,(O) = { { a } , {e}}, GUS-p(O) = {e, f } and, then, GUUS~,(O) = {e}; so V~,(O) = V~(V#(O)) = {--,e}. NowT~(V~,(O)) = {a, c} and ~ V S ~ ( { a , c,-,e}) = 0 as m C S ~ ( { ~ , e,-,e}) = O; so V#(O) = V~,(V~(~)) = {a, e,-,e}. As V#(O) = V~,(V~>(O)) = {a, c, ~e} = V~>(O), V~>(O) is the fixpoint of V~,. Therefore, {a,c} is the set of positive literals in the well-founded model Mw! of 7'. Since PUCS~,(V~,(O)) = 0, the well-founded model is total, thus {--,b,-~d-,e,--,f} = M~, 1 . If the test of Fact 4 fails, we have to perform further computation to deter, mine a stable model. To this end, we have to move beyond the fixpoint V ~ ( 0 ) . Let us show how this can be done using first an example.

Example 7. Consider the following program: |

a ~ - -nO, "aC. C *--- "na.

b *--- -~a. a~--c.

We can easily see that O is the well.founded model Mtoi that is obviously partial and coincides with V~~ We have that PUCS~,(Mwl) = {{a}, {b, c}}. Now, in order to move beyond the well-founded model M~ol , we assume the falsity of all literals in one of the elements of PUCS~,( Mw l ) to draw new inferences. For instance, if we add both ",b and --% we get that a is true, i.e., we obtain M = {a, c, -~b}, that is a stable model (Note that selecting b or e alone is not sufficient to draw new inferences.) On the contrary, adding -~a leads to a contradiction, as a will be then derived. O We now formalize the above intuition by introducing an extension of V~,.

167

D e f i n i t i o n 6 . Let P be a DhTALOG~ program and C be a choice function, i.e., a function which selects an arbitrary element from a set. Then the transformation Vp,c : 2BT"u'B~ ---+2 BT"u'B~ is defined as follows: for each X C BT~ U "-,B~,, V~,e(X) = f V~(X) ifV~(X) # X [ x o { - ~ c ( P u c s ~ , ( x ) ) } otherwise

D There are as many transformations V~',c as the number of different functions C. Furthermore, for any choice function C, the sequence Vr i > 0, is a monotonic increasing sequence of subsets of B~, U -~B~, and the least fixpoint of it, denoted by r ~ , c ( 0 ) , coincides with some element of the sequence. The following result holds. F a c t 5 Let P be a D A T A L O G program. Then 1. for each choice function C, V~(O) C_ (/~,c(O), 2. if for some choice function C, M = (/~,c(0) is an interpretation of 79 and PUCS.p(M) = 0, then M is a model of 79, and M U--,M is a stable model of 79; 3. if N is a stable model of 79 then there exists a choice function C such that N = M U-~'ff, where M = 9~,c(0 ).

D Facts 3, 4 and 5 provide the ground for constructing an efficient algorithm for computing a total stable model. This algorithm is described in the next section.

4

Computing the Total Stable Model

A l g o r i t h m Stable_Model; var M : Literals; P : Program; stable : Boolean begin i n p u t (P); Compute_V ~176 (P, M); Compute_V ~ ( P, stable, M); if stable t h e n o u t p u t ("A stable model of :P', M +) else o u t p u t ("no stable model for P") endlf end Stable_Model. Fig. 1. Algorithm Stable_Model

168

In this section we present an algorithm for computing a stable model that is based on the results of the previous section. This algorithm, described in Figure 1, takes a DATALOG"~program P in input and first fires the function Compute_Voo which computes the fixpoint V ~ (0). Then, it calls the procedure Compute_Voo which searchs for a choice function C satisfying the conditions of Part (2) of Fact 5. In case of a successful search, the procedure Compute_Voo notifies that a choice function C has been found by assigning the value true to the boolean variable stable and returns M = r~.c(0); the algorithm then outputs M + which coincides with the set of positive literals of a total stable model. (Because of totality, there is no need to output negative literals.) On the other hand, a possible search failure is indicated by the value false for the variable stable; in this case, the algorithm reports that the program admits no total stable models. Finally, observe that Program and Literals are data types for storing sets of rules and of ground literals, respectively.

P r o c e d u r e Compute_Vcc(P : Program; var M : Literals); var 57/: Literals; begin M := ~; repeat

~1 := T~176 M-; M := .~f U -~(7~]US(P,,~I) until M = / f / end Compute_Vet; Fig. 2. Procedure Compute_V ~162

The procedure Compute_Voo is shown in Figure 2 and performs the computation of V~'(@) using a simple iteration scheme. Note that the procedure makes use of two functions: T ~ and GUUS. The function T ~ simply computes the fixpoint T ~ ( M ) ; for instance, it can make use of relational algebra and of an additional fixpoint operator [16]. For the sake of presentation, we assume that Too is a built-in function. The function GUUS computes tim GUUS and will be described later in this section. Note that, by Fact 3, the procedure Compute_Voo computes the set of all positive literals of the well-founded model. After computing M = V~~ the procedure Compute_f z~ of Figure 3 is called. This procedure tests whether PUCST,(M) = @. If this test succeeds, then M + is the set of positive literals of a total stable model by Fact 4 and, therefore, the procedure returns the value true for the variable stable as the well-founded model is also stable. Otherwise, by Part (2) of Fact 5, the procedure has to find a choice function C such that M is an interpretation of T' and PUCSv(M) = 0, where M = fz~,c(@). The procedure is essentially based on a backtracking strategy (from which the name Backtracking Fixpoint [13]). To this end, the function make_list stores the elements of PUCSv(M) into a list L of possibly unfounded

169

Compute_f'~ : Program; var stable : BOOLEAN; v a r M : Literals); v a r N, M : Interpretation; L : List of Literals; Y : Literals; begin if PUCS(P, M) = ~ t h e n stable := true Procedure

else

stable := false; L := make_list(P, trdCS(P, M)); N := M; while not stable a n d (L ~ ~) do Y := pop(L); (* pop the first possibly unfounded conjunction from L *) M := N U -~Y; r e p e a t (* compute the next fixpoint of Vv *) ~I := TCc(P,M) U M-; M := h;/U -~GUUS(P, l~l) until (M = J~/) or (M N ~M ~ 0); if M A-.M = 0 t h e n (* M is an interpretation *) Cornpute_l/'~ (P, stable, M) endif endwhile endif e n d Compute_f/~ ; Fig. 3. Procedure Compute_V ~

conjunctions, that is ordered by according to any predefined ordering of the Herbrand Base. We construct the choice function by taking the first element of L; M is stored into N in order to possibly backtrack to the next element. Then we move to a new fixpoint M of V~,. Of course as soon as we realize that M becomes non-consistent (i.e., it is not an interpretation) we try with the next possibly unfounded conjunction of the list. If the are no other elements left in the list, the procedure stops returning the value false for the variable stable. In case no contradiction arises then the procedure Cornpute_V ~ is recursively called to test whether PUCSp(M) is e m p t y for the fixpoint M of V~. It the test succeeds then M + is the set of positive literals of a stable model by Part (2) of Fact 5 and, therefore, the procedure returns the value true for the variable stable as a total stable model has been found. Otherwise, we move to a next fixpoint of V~,. In sum, if a choice function C satisfying the condition of Part (2) of Fact 5 exists, it will be eventually found by the backtracking search strategy. On the other side, if such a function does not exists, the program has no stable model by Part (3) of Fact 5; so the procedure correctly reports this situation by returning the value false for the variable stable. Note that the procedure makes use of the functions PUCS and (:~US, that will be defined later in the next section, and of the simple functions make_list and pop that can be thought of as a built-in functions.

170

Example 8. Consider the following program: a+--b. c~--~a.

d +---c, ~e, ~ f . e *--~d.

f *-- - d . g +---e,-,g. The procedure of Figure 1 works as follows. First, the procedure Compute_V ~176 returns M = {c, ~a}; then the procedure Compute_~/~ is invoked for the first time. Since PUCST~(M) ~ 0, stable is set to false and, by selecting the alphabetic order, L 1 is set to < { d } , { e , f } >; moreover, N 1 -- M - {c,-~a}. (We use a superscript for identify local variables of different instances of the proce. dure Compute_~/~.) Then {d} is taken from L 1 so that L 1 becomes < {e, f } > and M = {c, e, f , - a , ",d} is computed. Since M is an interpretation, the procedure Compute_f/~ is invoked for the second time. Since again t~CS~,(M) 0, the list L 2 = < {g} > is created; so, by taking the first element, M be. comes {c, e, f, g, -,a,',d, - g } and, then, is not consistent. So the next possibly unfounded conjunction of L 2 is to be considered. But L 2 is empty; so the procedure stops and the first instance of the procedure Cornpute_~/~ is resumed. Since stable is false, the possibly unfounded conjunction {e, f } is taken from L 1 and M = {c, d, ~a,'-,e, ",f} is obtained. Since M is an interpretation, the procedure Cornpute_V ~ is invoked again. This time FUCS~,(M) = 0; so stable is set to true and the procedure stops. The first instance of the procedure Compute_f/~ resumes and, as stable is true, it stops so that the main program can take over and it outputs M + = {c, d}. A stable model is indeed {c, d, ",a, "~b, ",e, - f , ~g}. Let us finally show how the functions for computing the PUGS and the GUUS can be implemented. Definition 3 can be immediately used to devise a simple algorithm for computing the / ~ C S . We present this algorithm as a function (see Figure 4) that makes use of the special predicate symbol not that is evaluated as follows: given X C B~, U-,B~, and a ground literal A, not(A) is true in M if A is not in M, otherwise it is false. The function constructs a program/~ by taking all the rules of P with negative literals in the body and restructuring them so that the above negative literals are moved to the rule heads. The head of a rule is actually a conjunction; if the body is true in M (i.e., all positive literals are true, no negative literal is false and the head is undefined) then the whole conjunction is derived by applying the immediate consequence transformation. Possible non-undefined literals in the conjunction are removed in the next iteration. Example 9. Consider a program having { a, b, c, d} as Herbrand Universe and containing the following rule:

p(X)

171

F u n c t i o n PUCS(P : Program; M : Literals) : Set o] Literals; vat PUCS1, PUCS2 : Set o] Literals;C : Literals; P : Program; begin

P := 0; for e a c h r E P s.t.

B(r)- # 0 d o

(* let B ( r ) - = -~A1, ...,--A, *) P := Pt.J {-~B(r)- ~-- B(r) +, not(-~B(r)-), not(H(r)), not(-~H(r)).} endfor PUCS~ := T(P, m); PUCS2 := 0; for e a c h C E PUCS1 d o C := C - "-M-; if C # 0 t h e n PUC9~ := PUCS2 u {C} endlf; endfor; return ~ C S 2 end PUCS;

Fig. 4. Function PUGS

This rule in transformed by the function PUCS of Figure 4 into the following rule: {s(X), u ( X ) } ~-- q(X), not(s(X)), not(u(X)), not(p(X)), not(-~p(X)).

Let M = {q(a), q(b), q(c),-,s(b), ~u(a), ~u(b), p(c),--p(d)}. By applying the transformation T to the above rule, we obtain PUCSx - {{s(a), u(a)}, {s(b), u ( b ) ) ) . Let us now construct IrdCS2. In the first conjunction {s(a), u(a)}, u(a) is in - , M - ; so only {s(a)} is added to FUCS,. As both s(b) and u(b) are in - , M - , the next conjunction is removed. Therefore, I=UCS1 = {{s(a)}}. Definition 4 is not very effective as it is based on the definition of the GUS whose c o m p u t a t i o n we want to avoid. Therefore we shall next give a different definition of the GUUS. To this end, we need to expand the t=UCS to include all undefined a t o m s t h a t can be useful to prove the unfoundness of elements in the FUCS.

Example 10. Consider the program rl : f ~-- a,~b. r 2 : g ~ C, ~d. r3:b~---e. r4:e~---b.

172

and let M = {a}. Now, observing rule rx, it is easy to see that proving that b is unfounded is useful for the derivation of f (i.e., {b} is a possibly unfounded conjunction). The same reasoning does not apply to d, as c is not true in M (see rule r2). On the other hand, as b depends positively on e (see rule r3), the unfonndness ore is needed to prove the nnfoundness orb (and vice versa). We now formalize the intuition of the preceding discussion by giving the definition of closure of PUCS. D e f i n i t i o n 7 . Given a D A T A L O G program P and X C_ B~, U",B~, , the closure of m C S ~ , ( X ) , denoted by m C S ; , ( X ) , is defined as follows:

1. A is in PUCS~,(X) if it is in some element of PUCS~,(X), and e. A E "X is in PUCS~,(X) if there exists r E ground(P) with head in PUCS.},(X) such that B(r) is not false in X and A e B(r) +.

Example 11. Given the program in the Example 10 and the model M = {a}, it is immediately recognized that I~CS~,( M ) = {b, e}. Following [11], given X C_ B~, U -,By, we define the transformation F'p,x as follows: for each Y C_ B~,,

F~,,x(Y) = {A G YiVr e ground(P), H(r) # A V B(r)is false inX U -~Y} The following important property is proven in [8]. F a c t 6 Let P be a DATALOG" program and X C_ B~, U "~B~,. Then,

6UUS~, ( X ) = PUCS~ ( X / F~,,x ( t:UCS~ ( X ) ) ) .

Thus Fact 6 is a constructive definition of the GUUS that is suitable for a simple and efficient implementation as shown in the function GUUS of Figure 5. Note that in this function we construct rules whose body is of the form < Q1 > < Q2 >, where Q1 and Q2 are two conjunctions of literals. Moreover we introduce a function T which is applied to a program and a pair of set of literals, say X and Y, so that for every rule of the program, Q1 is evaluated w.r.t. X whereas Q2 is evaluated w.r.t.Y. Finally, F p , M ( Y ) is computed as Y - ~'P,M(Y) where

P~',M(Y) = {A E Y l 3 r G g r o u n d ( P ) , H ( r ) = A and B ( r ) is not false in M U - , Y } .

173

F u n c t i o n (3UUS(P : Program; M : Literals) : Literals; var/5, ~ : Program; PUCS.P : Set of Literais; PUCSO, I ~ C S . , Y, GUUSO : Literals; begin PUCS_P := PUCS(P,-M); PUCSO := 0; (*Compute first all elements of the possibly unfounded conjunctions*) for each C 6 PUCS_P do PUCSO :=/~dCS0 U {C} endfor; 15 := 0; (* Construct/5 for computing the PUCS* *) for each r 6 P do for each A 6 B(r) + do /5 :=/5 U {A ~---< H ( r ) > < not(-~B(r)), not(A) > .} endfor endfor; PUCS* := t~CSO; (* Compute the I=UCS* *) repeat Y : : PUCS*; PUCS* := 7~(/5, Y, M) until Y = PUCS*; P := 0; (* Construct P for computing F~,M(PUCS* ) *) for each r E P do (* let B ( r ) + = A1,..., A, and B ( r ) - = -~B1,...,-~B,~ *) P := PU {H(r) ~---< H(r)) > < not(-~B(r)) > .} endfor; (* Compute F~,M(PUCS. ) *) repeat Y := PUCS*; PUCS* := PUCS * - T ( P, PUCS* , -~t=UCS * UM) until Y = PUCS*; GUUS0 := 0; (* Compute the GUUS *) for each C E PUCS-P do if C C PUCS* t h e n GUUS0 : : GUUS0 W C endif; endfor; r e t u r n GUUS0 e n d (~7]S; ^

Fig. 5. Function ~ S

-

-

174

Example 12. Consider the followin 9 program: a 6.- .-.nb, "~c. a +-- " a d . C ~

-rid, -no.

d~e,f. e~---d. Let M = O. The function (Z/US starts computing trUCS-P = {{b, c}, {d}, {d, c}}. Then it sets I:UCS* = {b, e, d} and constrncts the program P consisting of the following rules: e *--< d > < not(~e), not(-~f), not(e) > . f ~--< d > < not(-ne), not(-nf), n o t ( f ) > . d *---< e > < not(--,d), not(d) > .

At this point PUCS* is saturated through a fizpoint computation. At the f r s t step, 7"(P, {b, c, d},O) gives the set {b, c, d, e , f } ; this set is recomputed, in the next step so that it is the fixpoint. Thus FUCS* = {b, c, d, e, f}. Next the function constructs the program P consisting of the following rules: a ,--< a > < not(b), not(c) > . a , - < a > < not(d) > . c , - - < c > < not(d), not(d), not(c) > . d ,---< d > < not(--,e), not(--,f) > . e ~---< e > < not(",d) > .

Another fizpoint computation is started. At the first step T ( ~ , {b, c, d, e, f}, { - b , - c , ~ d , - e , - f } ) = {c}

thus c is to be removed from t~CS*. As in the next step T ( ~ , {b, d, e, f}, {-,b,-,d,-,e,-~f}) = 0 {b, d, e, f} is the fixpoint. Finally, the function computes the GUUS by restricting PUCS-P to {b, d, e, f } . The only possibly unfounded conjunction that is contained in {b, d, e, f} is {d}; so the ~ S is {d}. B References

1. Apt, K., Bair, H., and Walker, A., "Towards a Theory of Declarative Knowledge", in Foundations of Deductive Databases and Logic Programming , J. Minker (Ed.), Morgan Kauffman, pp. 89-148, 1988. 2. Chandra, A., and D. Hard, "Horn Clauses and Generalization", Journal at Logic Programming 2, 1, pp. 320-340, 1985. 3. Clark, K.L., "Negation as Failure", in Logic and Data Bases, (Gallaire and Minker, eds), Plenum Press, New York, pp. 293-322, 1978.

175

4. Fitting, M., and M. Ben-Jacob, "Stratified and Three-valued Logic Programming Semantics", Proc. 5th Int. Conf. and Syrup. on Logic Programming, MIT Press, Cambridge, Ma, pp. 1054-1068, 1988. 5. Garey, M. R., and Johnson, D.S., Computers and Intractability, W.H. Freeman and Company, 1979. 6. Gelfond, M., and Lifschitz, V., "The Stable Model Semantics for Logic Programming", Proc. 5th lnt. Conf. and Syrup. on Logic Programming, MIT Press, Cambridge, Ma, pp. 1070-1080, 1988. 7. Lloyd, J. W., Foundations of Logic Programming, Springer-Verlag, 1987. 8. Leone, N., Rullo, P., "Safe Computation of the Well-Founded Semantics of DATALOG Queries", Information System 17, 1, 1992. 9. Naqvi, S.A., "A Logic for Negation in Database Systems", in Foundations ol Deductive Databases and Logic Programming, (Minker, J., ed.), Morgan Kaufman, Los Altos, 1987. 10. Przymusinski, T.C., "On the Declarative Semantics of Stratified Deductive Databases and Logic Programs", in Foundations ol Deductive Databases and Logic Programming, (Minker, J. ed.}, Morgan Kaufman, Los Altos, pp. 193216, 1987. 11. Przymusinski, T.C., "Every logic program has a natural stratification and an iterated fixed point model", Proc. ACM Syrup. on Principles of Database Systems, pp. 11-21, 1989. 12. Przymusinski T.C., "Extended stable semantics for normal and disjunctive programs", Proc. of the 7th lnt. Conf. on Logic Programming, MIT Press, Cambridge, 1990, pp. 459-477. 13. Sacca, D. and Zaniolo, C., "Partial Models, Stable Models and NonDeterminism in Logic Programs with Negation", Proc. ACM Syrup. on Principles of Database Systems, pp. 205-218,1990. 14. Sacca, D. and Zaniolo, C., "Determinism and Non-Determinism in Logic Programs with Negation", unpublished manuscript, 1992. 15. Tarski, A., "A Lattice Theoretical Fixpoint Theorem and its Application", Pacific Journal of Mathematics 5, pp. 285-309, 1955. 16. Ullman, J, D., Principles of Database and Knowledge-Base Management System , Academic Press, 1988. 17. Van Gelder, A., "Negation as Failure Using Tight Derivations for Logic Programs", Proc. 3rd IEEE Syrup. on Logic Programming, Springer-Verlag, pp. 127-138, 1986. 18. Van Gelder A., Ross K., and J.S. Schlipf, "Unfounded Sets and Well-Founded Semantics for General Logic Programs", A CM SIGMOD-SIGA CT Symp. on Principles of Database Systems, pp. 221-230, 1988'. 19. Van Gelder A., Ross K., and J.S. Schlipf, "The Well-Founded Semantics for General Logic Programs", Journal of the ACM 38, 3, pp. 620-650, 1991. 20. You,J-H. and L.Y. Yuan, "Three-Valued Formalization in Logic Programs: is it really needed?", A CM SIGMOD-SIGA CT Syrup. on Principles of Database Systems, pp. 172-181, 1990

Modules in Logic Programming: a Framework for Knowledge Management* Annalina Fabrizio, Maurizio Capaccioli, Sandra Valeri Systems & Management ~ via Vittorio Alfieri 19 10121 Torino, Italy

Abstract. This paper describes a proposal for knowledge modularization in a logic language. Our aim is to define structuring mechanisms for the integration and composition of modules in order to provide software reusability, clean problem definition and non-standard programming techniques. The proposed mechanisms include the ability to define modules and connections between them. A program module can be combined with other modules (by using the link mechanism) to offer a programming paradigm based on the separation of the problem in independent subproblems. A suitable set of operators on logic modules offers a kernel for a rational reconstruction of knowledge representation and programming-inthe-large techniques. Modules can be represented by using different technologies: a module can be a data base or a C language program.

1

Introduction

The purpose o f this paper is to describe a structuring mechanism for logic programming languages. Prolog is a rule based logic language, hence showing characteristics o f syntax simplicity that, on one hand allow a highly declarative description, on the other show the problem o f code understanding. The lack o f structuring results in little readable programs, so it is difficult to maintain and to reuse them. A structuring mechanism is required to organise a complex program in smaller, interrelated units. By describing the problem through separated and cooperating program modules, it is possible to specify problems o f a high degree o f complexity. Because each module is designed to solve a particular part o f the problem, it is easier to write and to reuse it. Interesting features o f this approach are simplicity o f description, flexibility of use and readability. There is another important issue that deserves more attention: the integration of the logic language with other programming * Work partially mpported by CNR, "PF Sistemi lnformatici e Calcolo Parallelo, Sottoprogelto 5, LRC LOGIDATA+" o Work completed by tile authors actually employed in other companies.. You can contact: Annalina Fabrizio, via U. Viale I/A 56124 Pisa Italy.

177

languages, programming paradigms and representation technologies, in particular with a relational DBMS. The proposal has been realised through the metaprogramming approach [2], which provides a powerful and conceptually neat technique for the definition and implementation of new formalisms. Metaprograms treat other programs as data. The writing of metaprograms is particularly easy in Prolog [3,14] due to the equivalence of programs and data: both are Prolog terms. In the paper we present our modularization mechanisms. Three different points of view are considered: the structuring of a logic program in related typed modules, the composition of logic modules and the integration with modules that use different representation technologies. Some of the ideas on modularization, structuring and integration among different modules of knowledge are based on the Epsilon system [8,9,15,16] and are presented in [10]. In order to define knowledge composition we based our work on the studies for the definition of operators defining an algebra on theories and hierarchical operators among theories [4,12,13]. A first version of this proposal can be found in [11]. In the follow we sketch the paper contents. In the second section we present the basic concept of our knowledge structuring that are link and module, then the structured program shape and four different types of link are presented and the practical use is shown by an example. In the third section we show the algebraic operators on which the operators we propose are based, then we describe the operators we propose and how construct a new module with them. The fourth section deals with the integration of logic modules with C program and a relational database. The C module and the database module are shown. Then you can find conclusion and references in the fifth and sixth section respectively.

2

Knowledge

Structuring

The proposed structuring mechanisms include the ability to define modules and connections between them. A program module can be combined with other modules (by using the link mechanism) to offer a programming paradigm based on the separation of the problem in independent sub-problems. To describe our proposal we consider Prolog rules.

2.1

Logic Modules and Links

A logic module is a set of logic rules and has a name. A logic module contains Prolog rules. To start up the resolution of a goal it is necessary to specify in which module the goal itself has to be evaluated. The metapredicate

query(M, G) represents the evaluation of goal G inside the module M. For example, if M is defined as follows:

178

M: p(X):- q(X).

q~a). the request query(M, p(a)) will be successful. A module can be related to other modules, meaning that the knowledge contained into a module is used "together" with the knowledge of another one. To this purpose we define a mechanism called link. A link is a semantic relation between modules whose effect is to define an "extended" knowledge, obtained from the cooperation of the knowledge belonging to the involved modules. Suppose we have modules M, N related by a link from M to N. Let M defined as above and N as follows:

N: q(b).

p(c). A link from M to N implies that to solve a goal inside module M it is possible also to refer to the knowledge in N. The knowledge contained in module M is extended with the knowledge contained in module N. We provide a set of links (see section 2.3) in order to support several ways for extending the knowledge of a module. It is worthwhile to note that a link has a "direction" from M to N, representing an order in the consultation of M and N: the module M is first consulted and then N is consulted. So a link is graphically represented by an arrow connecting the source module and the destination module of the link. The direction of a link is even more important in a "chain" of modules. In the example shown in figure 1, the consultation of the knowledge for the solution of a query passes through the chain of modules until the answer is found or when all modules have been consulted.

M1

~link-->

I I M2

~link~>

...

~link~>

I

Mn

Fig. 1. Chain of modules Several links having the same source module and different destination modules can be defined. In order to find a solution of a query, in the example shown in figure 2, the knowledge in M1 is extended with the knowledge in M2 or in M3.

179

lin' lM21 '

link - - >

I

M3

I

Fig. 2. M1 knowledge is extended with the knowledge in M2 or in M3 Moreover, several source modules can exploit the same destination module as shown in figure 3. The destination module represents the knowledge shared among the source modules.

M,I, link --> M3

link -->

I M2 [/ Fig. 3. M1 and M2 share the knowledge in M3

2.2 Programs A program is defined by a pair <M, L>, where M represents a set of modules and L represents a set of links. The knowledge base dictionary describes the modules and the links among them through a set of metapredicates of the form module(A) and link(A, B, Type), where A and B are module names and Type is a predefined link Type. It is possible to create/delete a link from a module M1 to a module M2 adding/removing the metapredicate link(M1, M2, Type) to/from the dictionary. To allow the dynamic creation and deletion of links at run time the metapredicates makelink(M1,M2, Type) and dellink(M1,M2, Type) can be used. These metapredicates can be used as any other predicate inside a clause. For example, rules contained in a module can be accessed just under a condition. Let be P = <{M1, M2}, { }> a program where M1, M2 are defined as follows:

180

MI:

q(X):-

p(X), makelink(M1,M2,Type), fiX), dellink(M1,M2,Type).

q(b). p(a). M2:

p(b). r(a).

r(b). Consider the following query to be solved: query(Ml, q(X)) The link from M1 to M2 is created only if the goal p(X) is successful, hence making M2 accessible for the resolution of r(X) predicate. Note that, during the evaluation of r(X), the program is P' = <{M1, M2},{link(M1,M2,Type)}>; the metapredicate dellink(M1,M2, Type), establishes that the link existence is restricted to the whole resolution of the goal r(X). Finally it is worthwhile to note that modules can exchange knowledge by using explicitly the query metapredicate. The query metapredicate can appear inside a clause in a module, so it is possible to solve a goal in a given module independently of the structure of the program induced by the links.

2.3

Link Types

Different ways can be identified to relate two or more modules [10]. By means of different types of links it is possible to combine the knowledge of distinct modules by using each time the appropriated link type for a specified problem interaction. The cooperation of rules contained in different modules rise from two kinds of orthogonal modalities: open~close and use/consultance. They state in which manner we can refer to the knowledge of a related module, to evaluate a goal in a source module. Open/Close Modality. The open~close cooperation modality between modules deals with visibility: it states which part of knowledge in the destination module has to be consulted.

open modality: all the predicates defined in the destination module are visible from the source module; by this way the knowledge contained in a module is extended with the whole knowledge contained in the related module.

close modality: only the predicates of the destination module that are not defined in the source module are visible; by this way the knowledge contained in a module is extended with the knowledge contained in the related module only for predicates that are not already defined.

181

Use/Consultance Modality. The use/consultance cooperation modality between modules deals with how a goal is evaluated inside a related module, namely how to consult the knowledge of the destination module. 9

consultance modality: the goal evaluation in the destination module takes

place, eventually with a return value. The destination module is a "black box": it takes a goal and returns failure or success, together with the bindings for variables. The evaluation process in the destination module is hidden. 9

use modality: the goal evaluation in the destination module starts up the search of the goal definition in the module; the first found definition is evaluated in the source module, observing the same evaluation rule. In this case the source and destination modules have to be written in the same language.

The combination of the above cooperation modalities originates four types of knowledge structuring represented by four link types between modules: openuse, closeuse, openconsultance, closeconsultance.

For example, let be P=< {M1, M2}, {link(M1,M2,Type)} >, where M1 and M2 are defined as follows and Type represents a generic link type: MI: p(a). q(X):-r(X). M2: qCo). r(X):-p(X). Consider the query: query(M1, q(X)). The following table summarises the answers that could be obtained with the combination of the two different m(~.lalities: modality

open

close

use

{ q(a), q(b) }

{ q(a) }

consultance

{ q(b) }

{}

table 1 If the link from M1 to M2 is o p e n u s e , the solutions are { q(a), q(b) }. The answers obtained by using an openuse link are the same achievable from a "sorted"

182

union of the rules contained in M1 and M2, but by using an openuse link we can have best results in efficiency and modularity. In fact we can use the two modules both alone and together and, in addition, they can be dynamically disconnected and connected with others. Moreover, if a goal is obtained only exploiting M1 rules, the search is done on a less number of rules. If the link from M1 to M2 is closeuse the solution is { q(a) }. In this case the clause q(b) in M2 is not used in the resolution because a clause defining q is already existing in M1. In any case with the use modality, any clause is interpreted starting from the source module, even though its definition resides in a different module. If the link from M1 to M2 is openconsultance the solution is { q(b) }, in this case q(a) is not a solution because the evaluation of r(X) considers only the rules in M2 and it fails. If the link is closeconsult the solution is { }. In this case q(a) is not a solution as already seen for the openconsultance link, and q(b) is not a solution since the close modality hide the additional definition of q in M2. In any case, with the consultance modality the evaluation takes place inside the destination module. 2.4

An Example

The proposed knowledge structuring mechanisms permit to define the knowledge concerning an entity into a module and to extend it, through links, by using additional knowledge contained into separate and distinct modules. By this way is possible to deal with specialisation and generalisation of subjects and to support the exception handling. It is also possible to solve directly a given query in a specified module by using an explicit call, thus ignoring the structure of the program given by the links. Suppose the program P is defined as follows: P =< {vialli, football_player, person, gullit, matches, teams}, { link( vialli, football_player, openuse). link( vialli, person, openuse). link( gullit, footballplayer, closeuse). link( gullit, person, openuse). link( football_player, matches, closeconsultance). }> Figure 4 shows a graphical representation of P

vialli

I enue l, payol

I

1"

openuse

closeuse

$

i

cloecon

matches

teams

Fig. 4. Graphical representation of P

183

Suppose the modules are so defined: football player: nationality(italian). address(X):- team(T), query(teams,address(T,X)). gain(X,Y):- match(Y), percentage(Y, Z), takings(Y,W), X is W*Z. teams: address(juventus, "via... Torino"). colours(milan, rossonero). address(milan, "via... Milano")) colours 0uventus, bianconero). person: address(X):- age(Y), Y < 18, father(F), query(F, address(X)). address(X):- wife(Y), query(Y, address(X)). age(X):- bornyear(Y), currentyear(Z), X is Z-Y. vialli: team(juventus). match0uventus-milan). percentage(juventus-milan, 0.001) percentage(X, 0.005):- awaymatch(X). gullit: bomyear(.. ). team(milan). matchOuventus-milan). nationality(dutch). percentage(... ). matches: takings(X,Y)... gain(X,Y)...

football__player describes players of italian football teams, person contains private data of people. We define particular characteristics of two football players, i.e. Vialli and Gullit, respectively in vialli and gullit modules. The openuse link between vialli and football_player will express that anything valid for italian players is also valid for Vialli; for example the fact of being an italian person. The closeuse link from gullit tofootball__player can be used to describe that Gullit has general characteristics of an italian football player, but he has same properties different from the general ones, e.g. he is not italian. With the use modality

184

it is possible to describe the knowledge for generalisation/specialisation, while the

close modality can be used to manage the exceptions. To know how much Vialli earns we can require the evaluation of the goal

query(viaUi, gain(X, Y)) In the vialli module is not defined gain, so the evaluation refers to the knowledge offootball__player where gain(X, Y) is defined, gain(X, Y) representing profits X of a football player on the basis of his percentage on the takings of the match Y. The resolution in module vialli of the clause defining gain(X, Y) will bind the variable Y to "juventus-milan" and the variable Z to 0,001. At this point, since in vialli is not present any information on takings, the resolution of takings( 'Tuventus-milan", W) considers thefootball_.player module knowledge. The module does not contain that information and then the matches module is consulted, yielding the goal solution. It is worthwhile to note that in this example a consultance link is used to relate the matches module, so the takings definition is not provided because we are interested only in the answer provided by the module and takings is solved only inside the matches module rather than in the chain of modules starting from the vialli module. Moreover, the link to matches is close so it is possible to redefine gain, also defined in football..player, with a different meaning because it can only be used locally. We can execute query(gullit, address(X)) to know the address of Gullit. This query starts the search for address in guUit and then infootball_.player. The evaluation of the body of address will bind the variable T to "milan" and therefore the metapredicate query(teams, address(milan, X)) is called: so address is explicitly activated in teams, giving back as a result the team address. Note that if another solution for address is requested, we could obtain the Gullit wife address. The explicit use of the knowledge of a module through the query metapredicate can be obtained also by using the makelink and dellink predicates. For example in football__players the clause defining address could be substituted as follows: address(X):- team 03, makelink(football_players, teams, closeconsultance), address(T,X), dellink(football_players, teams, closeconsultance). By this way, the search of a definition for the Gullit address will start from the gullit module, while by using the query metapredicate, the evaluation is immediately made active in teams.

3

Module Composition Operators

A suitable set of operators on logic modules offers a kernel for a rational reconstruction of knowledge representation and programming-in-the-large techniques. It is possible to compose the knowledge of two logic modules in a single module. This can be obtained introducing in the language operators on modules whose

185

effect is to produce a new module. The user can explicitly query the new module referring a specific operator through the query metapredicate: query(lVll operator M2, Goal) A module which result from the evaluation of M1 operator M2 will be created and the query Goal will be evaluated in this module. To define our operators we refer to [12,13] where a set of operators defining an algebra on logic theories is provided, and to [4] where hierarchical operators between theories are defined based on these algebraic operators. We propose operators based on the algebraic operators shown in the following. The intersection operator produces a new module that contains intuitively the shared knowledge of the argument modules. If M1, M2 contain a common predicate p" MI: p(tl ..... m):--Bodyl . . .

M2: ~ 1 7 6

p(sl ..... sn):-Body2 . ~

then the module M obtained by the operation intersection(M1, M2) contains: p((tl ..... m)~t) :- (Bodyl, Body2)~ where kt is the most general unifier between (tl ..... tn) and (sl ..... sn). The union operator produces a new module whose clauses are given by the union of the clauses of the two arguments. The negation operator produces a new module containing the definition of new predicates that are the negation of the predicates of the argument. An example is in the follow. Suppose program P = < ({ uefa, champions, cups }, { }) >, where: uef~ participate(X):- won(X, uefa, 89). participate(X):- classified (2, championship, 89). participate(X):-... regular_team(X):- players(X,Y), Y>=I 1, Y<=22.

186

cups: participate(x):- won(X, cups, 89),competitor(x). participate(X):- won(X, italycup, 89). regular_team(x):- foreign_players(X,Y), Y<=3. champions: participate(X):- won(X, champions, 89). participate(X):- won(x, championship, 89). regular_team(x):- reserves(X,Y), Y<=5.

participate(X) describes rules so that team X can participate to the relative tournament; regular_team(X) describes the characteristics of a candidate team for the tournament. An intersection operator takes sense between modules showing a common knowledge. An intersection operator between uefa e cups underlines the common characteristics for both cups. For example an intersection operator between uefa and cups defines another module from which we can obtain the team that can participate to both the cups. The following query: query((uefa intersection cups), participate(X)). produces the following temporary module: uefa intersection cups: participate(X):(X). participate(X):participate(X):-

won(X, uefa, 89), won(X, cups, 89), competitor won(X, uefa, 89), won(X, italycup, 89). classified(2, championship, 89), won(X, cups, 89), competitor (X). classified(2,championship,89), won(X, italycup,

participate(X):89). participate(X):-... regular_team(X):- players (X,Y) Y>= 11, Y<=22, foreign_players (X,Z), Z<=3.

The union operator between uefa and champions modules creates a new module underlining the properties of a candidate team (regular_team) and the characteristics of participation of a team (participate) to the Uefa cup or the Champions cup. The goal: query (uefa union champions, participate(X)). will produce file module:

187

uefa union champions: participate(X):won(X, uefa, 89). participate(X):classified(2, championship, 89). participate(X):participate(X):won(X, champions, 89). participate(X):won(X, championship, 89). regular_team(X):- players(X,Y ), Y>= 11, Y<=22. regular_team(X):- reserves(X,Y), Y<--5. The negation operator on cups module creates a new module that underlines the characteristics of no participation of teams to the Cup's cup by using the predicate -participate, negation of the predicate participate. The goal: query(negation(cups), -participate(X)). will produce the module: negation cups: -participate(X):- -won(X, cups, 89), -won(X, italycup, 89). -participate(X):- -competitor (X), -won(X, italycup, 89).

3.1

Composition Operators

Now we present the set of operators we propose. They are based on the ones previously presented. The operators can be used only between logic modules. The cat operator produces a new module whose clauses are given appending the clauses of the first module with those of the second one. (It is the same of union operator). For example, suppose program P = ({M1,M2}, {}), where M1, M2 are defined as follows: MI: p(X):-q(X). r(a). p(b). M2: q(a). q(X):-r(X). then module M = M1 cat M2 will be:

188

M:

p(X):-qO0. r(a). pCo). q(a). qOO:-rO0. The knowledge represented in module M is the same knowledge that can be obtained by the program: P = <{M1, M2}, { link(M1, M2, openuse) }> We can choose to use one of these solutions according to the problem we want to solve. For example, by using the cat operator we have a better efficiency, while by using the openuse link we have a greater modularity. Given two modules M1 and M2, the union without repetition operator produces a new module M: M -- M1 unionwr M2 that will contains:

1)

All the predicates of M1 that are not defined in M2; if the clause C: p(tl,t2 ..... tn):- Bodyl is in M1 and it does not exist in M2 a clause with head p then C is in M.

2)

All the predicates of M2 that are not defined in M1; if the clause C: p(tl,t2 ..... tn):- Body2 is in M2 and it does not exist in M1 a clause with head p then C is in M.

3)

All the predicates defined in both modules are treated as the following: if p(tl,t2 ..... m):- Bodyl is in MI and p(sl,s2 ..... sn):- Body2 is in M2 and it exists the most general unifier IX of (sl,s2 ..... sn) and (tl,t2 ..... tn) then p((tl,t2 ..... tn)IX):- (Bodyl, Body2)IX is in M.

Intuitively, the resulting module M includes the common knowledge to M1 and M2, and the knowledge useful to evaluate the common knowledge. This operation corresponds to define a module M as the intersection of M1 and M2 and create two closeuse links: the first one from M to M1 and the second one from M to M2. The close modality is used to hide definitions common to M1 and M2, and the use modality is chosen to access definitions in rule bodies. For example, suppose:

189

P = < {MI, M2}, {} > where M1, M2 are defined as follows: MI:

p(X):- q(X). q(a). q(f(X)):- r(x).

r(f(c)). M2:

s(b). q(X):-t(f(X)). t(f(X)):-t(x).

t(0). then M = M1 unionwr M2 will be: M:

p(X):- q(X). r(f(c)).

s(b). q(a):-t(f(a)). q(f(X)):- r(x), t(f(X)).

t(f(x)):-t(x). t(0). Generally, we can describe in the first module a general knowledge and in the second one a more specified part of the same field of knowledge, so the module resulting by using the unionwr operator, will contain the application of general knowledge to the specific description. Finally, the user can explicitly query a module obtained by using the above operators through the query metapredicate: query(M1 operator M2, Goal) the module which result from the evaluation of M1 operator M2 will be constructed and stored, so it will be available for further queries until the program will remain active; the constructed module will be deleted when the user leaves the program.

190

4

Integrating Different Kinds of Knowledge

It can be interesting to use, in a structured program, modules that can be written with other programming languages or other programming technologies. Modules that are the destination of a consultance link give back only bindings for variables, so we can suppose that a goal in a destination module can be solved by any interpreter. In fact, the representation technology of the destination module of a consultance link can be different from that used for the source module. For example the source module can be a logic program and the destination module retrieving facts from a data base data through a database management interface program. In order to define different type of module we have to extend the description of a module in the knowledge base dictionary. It becomes:

module (M, Type) where Type can be:

simple

logic modules without structure, namely modules interacting without exploiting links;

structured logic modules structured with links; database

database modules;

C

modules written in C language.

The reason for having simple modules besides structured ones is that structured modules are metainterpreted by a suitable metainterpreter, while simple modules can be interpreted directly by the underlying Prolog system: their execution is therefore more efficient. A goal in a logic structured module could be evaluated in a C module or in a database module or in a logic module (if a link exists to these modules) and in the original logic module it is transparent.

4.1 C Modules A C module is composed of a set of functions written in C language. It is represented by a description module containing the list of object programs that constitute the module and the list of functions associated to logic predicates. The logic module can use the function contained in a C module with the metapredicate query(Module, P(X)), where Module is the name of the module and P is a predicate associated to a function defined in the C module. A C module can be used only as destination module of a consultance link. The functions described in the description module will be implicitly accessible only by structured modules source of openconsultance or closeconsultance link towards the C module.

191

4.2

Database

Modules

A database module is composed of a set of tuples of a relational data base. It is represented by a description module containing the list of relations and names and types of each field. It represents the interface towards a relational database. The logic module can use the knowledge contained in a database module with the metapredicate query(Module, Q), where Module is the name of the module and Q is the query to the database. In logic programming, the database is defined to be the set of facts. However, there is nothing in the language that prohibits it from working directly on large, permanently stored relational databases: because of the natural correspondence between logic programming and the relational data model, from a conceptual point of view the integration of a relational DBMS in Prolog-based languages turns out to be particularly interesting and easy to pursue [6]. Databases are included in the proposal as modules of a particular type, called database: A database module represents the knowledge contained in a relational database. It is possible to use from a logic program data stored in the relational database by a query in the database module. Each database module is the Prolog data dictionary of the relational database that it represents. It is possible both to work with an already existing database and to create a new one. Let us see an example of a database module named company that contains three relations: personnel, offices and projects: company: offices( off_num :: smallint, location :: char(15), manager :: integer, address :: char(25), city :: char(25), state :: char(2), country :: char(25), zipcode :: char(5)). projects( proj_num :: smallint, start_date :: date, end_date :: date, manager :: integer, description :: char(120)).

192

personnel( pers_num :: serial, fname :: char(15), lname :: char(20), soc_sec__no :: char(11), hire_date :: date, title :: char(25), salary :: money, last_review :: date, last_raise :: decimal, address :: char(25), city :: char(25), state :: char(2), zipcode :: char(5), off_num :: smallint, post_date :: date, proj_num :: smallint). The data dictionary shows, for each relation, the name and type of each field with the syntax: FieldName::FieldType, where FieldType is one of the types supported by the underlying DBMS. The system provides commands for updating both data and the data base schema. Commands which operate on the whole database are carried out as instances of operations on modules. These are the operations for creating, selecting, quitting and deleting a database. Other statements enable to create, modify and drop database tables. It is possible to use existing databases where views and indexes are defined. In particular, views are treated as standard relations; moreover, it is possible to define logical views through Prolog programs. The most interesting feature of the integration with a DBMS is the ability to use from a logic program data stored in a relational database. In particular, the language metapredicate query(Module, Query) allows a database module to be queried from another one (of course, database modules can only be queried from logic modules and cannot query other modules). This general mechanism allows data stored in a database to be used from logic modules. In the following, we show a logic program contained in a simple module named simplemod that performs some deductions by relating different tables and elaborating data contained in the company relational database represented through the above company module. The program works on logic views of the database relations, built in simplemod by using the query metapredicate.

193

simplemod: offices_view(Num,Manag,City) :query(company, offices(Num,_,Manag,_,City . . . . . . )). proj ects_view(Num,Manag) :query(company, projects(Num. . . . ,Manag,_)). personnel_view(Num,Name, Salary,Off_num,Proj_num) :query(company, personnel(Num,_,Name . . . . . . . Salary, . . . . . . . . . . . . Off num,_,Proj_num)). manager_o f(Pers,Manag) :projects_view(Proj,Manag), personnel_view(Pets . . . . . . . Proj). check_projecLbalance(Proj,Res) :- projects_view(Proj,Manag), find bahance(Manag,Res). find_balance(Manag,balanced) :- \+(check_salary(Manag)), !. find_balance(Manag,unbalanced). check_salary(Manag) :- manager_of(Pers,Manag), personnel_view(Pers,_,Pers_sal,_,__), personnel_view(Manag,_,Man_sal,_,_), Man_sal < Pers_sal, !. If we write the previous example in a structured module named structmod linked to company with a closedconsultance link, the logical views can be defined as follows: struclmod: offices_view(Num,Manag,City) :- offices(Num,_,Manag,_,City . . . . . ). projects_view(Num,Manag) :- projects(Num ..... Manag,_). personnel_view(Num,Name, Salary,Off_num,Proj__num) :personnel(Num,_,Name . . . . . . . Salary . . . . . . . . . . . . . Off num,_,Proj_num)

The rest of the program is the same as before, and the knowledge base dictionary of this knowledge base looks as follows: module(company,~tabase) module(structmod,structured) link(structmod,company,closedconsultance)

194

As you can see, it is no longer necessary to call explicitly the database module company_from the logic program, since data contained in the inherited module are automatically retrieved. The logic program is independent from the name of the inherited module and is not affected by the fact that the inherited module is a database module: it uses in the same way both logic predicates and predicates which correspond to database relations. In this sense the integration between Prolog and DBMS becomes completely transparent. Another type of link which can be used by structuredmodules to access database modules is the openconsultancelink. In this case, a predicate can be defined partly in a structured module and partly in a database module, e.g. there may exist facts in the structured module with the same functor and arity as a data base relation, and they will be treated by the structuredmodule in the same way.

5

Conclusions

Rule programming languages have characteristics of declarativity and simplicity of learning and use, however they have problems concerning readability and flexibility in code management. These disadvantages are caused mainly by the flat structure of logic programs, and they result more remarkable when complex problems are considered. The proposed solution is to organise a logic program into separately defined modules and to relate them. Using this approach, it is also possible to integrate a logic language with other paradigms in a simple manner and finally non standard programming techniques are available through module composition operators. A prototype exists [6] that implements the proposed mechanisms. The prototype is written in Prolog language using metaprogramming techniques. The prototype provides a programming environment allowing the user to define the structure of a program by providing facilities to create and destroy links, modules and programs.

References [1] [2] [3] [4] [5] [6]

P. Atzeni, F. Cacace, S. Ceri, L. Tanca. The LOGIDATA+ model. Technical report Logidata+ 5120,July 1990. K. A. Bowen and R. A. Kowalski. Amalgamating language and metalanguage in logic programming. In K. L. Klark and S. A. Tarnlund Eds., Logic Programming, Academic Press 1982, pages.153-172 K. A. Bowen and T. Weinberg. A recta-level extention of Prolog. In 1985 IEEE Symposium on Logic Programming, IEEE Computer Socierty Press 1985, pages 48-53 A. Brogi, P. Mancarella, D. Pedreschi, F. Turini. A Hierarchies through Basic Meta-Level Operators. In Proc. Workshop on Metaprogramming in Logic, META 90, Leuven, Belgio, May 1990. S. Ceri, G. Gottlob, L. Tanca. Logic Programming and Databases. SpringerVerlag, 1990. P. Coscia, A. Fabrizio. Moduli in Logidata+: il prototipo. Technical Report Logidata+ 5/78, August 1991.

LOA: the LOGIDATA+ Object Algebra U m b e r t o N a n n i l, Silvio S a l z a 2'3, M a r i o T e r r a n o v a 3 1 Universitg de L'Aquila, Dipartimento di Matematica Pura ed Applicata, Coppito, L'Aquila, Italy Universitg di Roma "La Sapienza", Dipsartimento di Informatica e Sistemistica, via Salaria 113, 1-00198 Roma, Italy Consiglio Nazionale delle Ricerche, Istituto di AnMisi dei Sistemi ed Informatica, viale Manzoni 30, 1-00185 Roma, Italy

A b s t r a c t . In this paper we present the Logidata Object Algebra (LOA), an algebra for complex objects which has been developed within LOGIDATA+, a national project funded by the Italian National Research Council, as an internal language in a prototype system for the management of extended relational databases with complex object types. LOA is a set-oriented manipulation language which was conceived as an internal language for a prototype system. More precisely it is supposed that user programs, expressed in the external high level language have to be compiled into LOA programs, and then processed by an underlying Object Manager. In other words LOA is supposed to play in our prototype the same role as the relational algebra in a relational system with a very high level user environment. T h a t is providing a suitable formal framework to develop efficient access to mass storage and powerful query optimization strategies. The algebra refers to a d a t a model that includes structured d a t a types and object identity, thus allowing both classes of objects and value-based relations. LOA must support a rule based language with possible recursive programs with limited forms of negation. LOA programs explicitly include a FIXPOINT operator over a set of equations. Another original feature of the algebra is the ability to cope with object identity, and, at the same time, to preserve the capability of handling value based complex structures without identity. This is obtained by extending the semantics of classical operators of the relational algebra to deal with the LOGIDATA d a t a structures, and by introducing additional operators, such as type conversion and oid invention. The paper also briefly discusses some implementation issues of the FIXPOINT operator and the other algebraic primitives.

1

Introduction

In the last d e c a d e the r e l a t i o n a l m o d e l as p r o p o s e d by C o d d [14] has been widely a d o p t e d for s t a n d a r d a p p l i c a t i o n s , b e c a u s e of its simple a n d u n i f o r m s t r u c t u r e and of the larger deal of i n d e p e n d e n c e between t h e logical a n d p h y s i c a l level.

196

However the relational approach has proved to be not satisfactory for several non-conventional applications, as CAD, CASE, office automation, multimedia databases and knowledge bases. In all these cases the flat structure of the model makes the representation of complex and structured data clumsy. Several proposals have been made to extend the relational model, both overcoming the restrictions imposed by First Normal Form [18, 15, 27, 26], and introducing the concept of object identity [20, 21, 19, 4, 28]. For such extended models several authors have presented query and manipulation languages based on calculus [8], on algebra [2, 15, 1] and logic [3]. Dealing with the LOGIDATA language and model requires to support a rich and expressive environment, that is a rule based language in conjunction with an object oriented data model. In this context the conceptual distance between the user (external) language and the implementation level, both in terms of data structure and language constructs, can be tackled by properly choosing some intermediate level in query processing (the internal language), in which user transactions have to be compiled before proceeding in the execution process on physical data. Of course such an internal language must have at least the same expressive power as the external language. Our choice is to adopt an intermediate set-oriented manipulation language that will play in our prototype a role equivalent to that of the relational algebra in a relational system, that is providing a suitable framework to develop efficient access methods for data handling and storage, and powerful query processing strategies. This approach makes especially sense when the object oriented system is aimed at traditional database applications, where the typical transaction involves large sets of objects. For this class of applications indeed the relational systems perform already quite well, and the motivation for using the object oriented model lies mostly in making more natural both the user language and the design phase, and, in general, in having a more direct representation of the real world in the schema. Therefore in such a context the relational model can still be used as an internal level of representation, by mapping the object schema into a relational schema. According to these ideas a prototype system has been developed within LOGIDATA+, a national project funded by the Italian National Research Council, that has the goal of integrating logic programming and extended relational databases, with complex data types. In general an approach to database query and management based on an intermediate language may have the disadvantage that it makes less effective the optimization process, which must be broken in two distinct phases, but has a number of advantages. A good level of modularity can be achieved in the design of the global architecture: this has the obvious consequence to separate different kinds of problems (source code interpretation and data processing) in different modules and, more interestingly, it allows a very fast and cheap experimentation of different solutions. This seems to be a basic requirement in

197

an experimental context in which both the user language and the data model can be subject to possible settlements, and furthermore different hardware/software environments can be tested for prototyping. Moreover the existence of such an intermediate language (in turn subjected to possible extensions) might point out the consequences, in terms of practical performance, of different choices at user language level and implementation of lower level primitives. Let us consider two possible choices for an intermediate (internal) language, characterizing the underlying data management support. In both cases they consist of a traditional environment enriched by several features in order to deal with the expressive power of the external language: -

-

a rule based language on a fiat (relational) data model, that is a sort of datalog, supporting tuple identity, and limited forms of negation; a procedural algebraic language on an object oriented data model, in which additional features must be embodied to handle recursive queries.

We remark that in the case of a fiat relational data model it has been shown that these two approaches - - generalized versions of datalog and relational algebra - - may lead to the same expressive power, provided that suitable enhancements and restrictions are defined such as, for example, a limited form of negation (or complementation) and the absence of functions producing new data values not included in the active domain of the data base. Moreover this can be achieved with reasonable (polynomial) computational complexity (see, for example [12]). Unfortunately in the case of a more complex data model, this is not true, and the problem can achieve an arbitrary complexity [22, 17] and even become undecidable [4]. In this paper we focus on the algebraic approach, consisting in a suitable extension of the relational algebra to support the features of the LOGIDATA data model and external language. The Logidata Object Algebra (LOA) extends the relational algebra in several aspects, incorporating and extending some ideas and features from heterogeneous sources [6, 18, 15, 27, 2, 26, 28]: - the semantics of the classical operators of the relational algebra are suitably redefined in order to deal with the richer object oriented data model; - more operators are included to build and navigate complex data structures; - procedural features are included, such as the FIXPOINT operator, in order to deal with recursive programs. In this paper the problem of extracting data from an existing database is considered, and more precisely the problem of expressing by an algebraic language recursive queries in an object oriented environment. This extends the approach that the authors presented in [24] to deal with recursion. The rest of this paper is organized as follows. In the next section the basic elements of the formal definition of the LOGIDATA+ model are summarized. In section 3 an overview of the features of LOA is described together a basic semantics of LOA. In section 4 we introduce the language used to express complex conditions inside the operators of the extended relational algebra. These

198

operators are discussed in section 5, and in the following one we introduce the structure and conversion operators to explicitly handle transformations between objects and values.

2

The

Data

Model

LOA provides set-oriented operators to manipulate collections of complex objects and structured values, which are defined according to the L O G I D A T A + model [7], whose main features are the following: - it allows the definition of types, functions, and two kinds of data collections: the class (based on object identity), and the relation (value based); d a t a are basically structured with the tuple, set, and sequence constructors; - an isa hierarchy with multiple inheritance can be defined in the set of classes. -

Given a finite set of domains D 1 , . . . , D D with domain names :Dx,... ,:Do, a countable domain of object identifiers 12, and a countable set of attribute names A1,.Az,..., we refer to a database schema composed by the following elements: - A finite set of types names, or shortly types, 01,..., 0o, and the corresponding

type definitions. A finite set of classes C x , . . . , C c with names C 1 , . . . , C c . - A finite set of relations R x , . . . , R R with names T~l,... , ~ n . - An ISA hierarchy between the classes. -

The domain f2 contains the object identifiers that are associated to the objects, each one having an identifier which is unique in all the database. Classes are collections of objects of the same type, and relations are collections of structured values. T y p e definitions allow to build structured types from the basic types associated to the domains, and, for object types, to add the identity. A value set, i.e. the set of all possible vahms, is associated to each type. More formally: -

-

-

-

-

A type 0 is either a value type r or an object type w. A domain name :Di is a value type and the corresponding value set is Di 9 If 0 1 , . . . , 0 , are types with value sets V i , . . . , V , , then r = (Ax : 01,...,A,~ : 0,) defines a tuple type r, with value set V~ = V1 • ... x V , . Round brackets denote the tuple constructor. If 0 is a type and V is the corresponding value set then r = {0} defines a set type r with value set V~ = PART(V), i.e. the powerset of V. Curly brackets denote the set constructor.

- If 0 is a type and V is the corresponding value set then r = (0) defines a sequence type r with value set V~ = SEQ(V), i.e. the set of all the sequences over V. Angle brackets denote the sequence constructor. If r is a tuple value type with value set V, and C is a class name, then = [C, r] is an object type, and the corresponding value set is V,~ = 12 x V.

199

FAMILY(father,mother,children,address)

\

s~

~name,sex,age) it 9

\\

, ,///

1

~:7"~<,.,,,.,, l/"'~176

\ \ ,\

E,MP.I.,Q.YT~(*,position)

/

(department,salary) Curriculum(Ostud,exams) IntegerJ/

/ / Name(givermames,lastname)

r \

String (course,grade)

/\

Stung Integer Fig. 1. A sample schema

String

/ \

String

Integer

200

Each relation is defined on a value type and each class is defined on an object type. More precisely, as we shall see below, there is a one to one correspondence between object types and classes. We may now introduce the notion of refinement as a partial order relationship in the set of types, according to the following definition. A type 0 (either an object type or a value type) is a refinement of a type 0 I (in symbols # _ 0I) if and only if one of the following condition holds: -

0=0';

- 0 = w = [C,r], 0' = w' = [C',r'], with r _ r'; - r = (,41 : O l , . . . , A k + p :Ok+p), r ' = (.Az :0~ . . . . ,Ak : O~), with Oi _-< O;, for l
3

Basic Features of the Logidata Object Algebra

User programs are written in the LOGIDATA+ language, that is a rule based language on an object oriented data model, supporting identity. A stratified form of negation is allowed in the user external language. If a logic program is stratified, it is possible to rewrite it as a set of formulas which are monotonic, i.e. a repeated application of the formulas increases the collection of data stored in the variables. We define a stratified semantics for our FIXPOINT operator. In fact, deciding whether a set of equations is stratifiable is computationally easy. Indeed it would be possible to adopt an inflationary semantics: this approach, used in [4, 9], is strictly more expressive than the stratified semantics [12] and has been proved to coincide with the least fixpoint [16] in the case of finite structures. Nevertheless, in the case of a more complex data model (including oid invention and functions with infinite domain), stratification is not a sufficient condition for a fixpoint of a set of equations to exist [4] and the computation may not terminate. The study of satisfactory solutions for this problem deserves further research effort. In the seminal paper [6] several ways of introducing a fixpoiat operator (or equivalent constructs) within a database language are considered. This has been

201

Domain names: String;

I n t e g e r ; Sex.

Value types: rname = (givennames : (String), lastname : String); Tpers = (name : marne,sex : Sex, age : Integer); rstud = (name : ~aame, sex : Sex, age : Integer, curriculum : rcurr); rcurr = (#stud: Integer, exams: {(course: String, grade: Integer)}); r e m p l = (name : marne, sex : Sex, age : Integer, position : Tpos); Tpos = (department : String, salary : Integer); r f a m = (father: Wpers, mother : wpers, children: (wpers), addr : String).

Object types: wpers -- [Person, rpers]; O;stud ----[Student, rstud]; Wempl = [Employee, rempl ]. Relations: Family : rfam. Classes: Person; Student; Employee.

ISA hierarchy: Student ISA Person; Employee ISA Person.

Fig. 2. Type definitions for the sample schema

202

done with quite different approaches such as, for example the alpha operator [5] or the pointwise recursion [17]. In [12] an overview of the topic is given, comparing the expressive power and complexity of the various constructs. Examples of an approach integrating a rule based language and an object oriented data model are IQL [4], and LOGRES [9]. An algebraic approach integrating recursion and a complex data model can he found in [13]. In the following we provide a very high level description of the overall processing of user programs, clarifying the role of LOA in our approach. At the user (external) level, a program is expressed in a rule-based declarative language, and then a compiler is in charge to determine its structure and stratification, by using standard techniques (see for example [29, 11]), and eventually to rewrite it as an equivalent LOA program. A LOA program is a sequence of blocks. Each block can be a single LOA algebraic equation or a fizpoint block. Each fixpoint block consists of a FIXPOINT operator applied to a set of LOA algebraic equations. These blocks are sequentially evaluated by the LOA processor, which maintains the mapping between the object schema and the relational schema, and translates the object algebra expressions into relational algebra with conditioned loops constructs to compute the stratified fizpoint of the block, as shown, for example, in [4, 9]. A more detailed description of a LOA program is the following9 A LOA program P is a sequence of fixpoint blocks: P = (B1,B2 . . . . ,Bin), where the generic block Bk is either a single equation Rk or it is built up by a FIXPOINT operator applied to a set of equations: (FIXPOINT{Rk,1, Rk,2 . . . . , Rk,nk }). Each equation Rk,i has the form Vk,i ---- gk,i where the left hand side is a typed variable and the right hand side Ck,i is a LOA expression on typed variables and constants. The type of each variable can be a class type or a relation type, and a constant is a class or relation in the knowledge base. A variable Vhj can occur in the right hand side of a rule Rk,i : Vk,i = gk,i if it occurs in the left hand side of some other equation either in the same block or in a previous blocks, that is if h < k (thus including the variable Vk,i itself). In the case where a block Ilk is constituted by a single equation Rk : Vk = gk, then Ck can only contain references to variables in the previous blocks9 Moreover, since the program is stratified, if a variable Vhj appears as a second term of a difference operator (hence coming from a negated predicate) it can not he defined in the same block, but only in the previous ones. The structure of a LOA program can be summarized as follows:

( FIXPOINT{VI,1 -- ~1,1(V1,1,...,

Vl,nl)

V1,2 = gl,2(Vl,1,..., Vl,.,) v~,., = ~ , . , ( v ~ : , . . . ,

v~,.,)}

FIXPOINT{V2,1 -~- ~2,1(V1,1,... , Vl,n,, V2,1,..., V2.,~ = 8 2 , , ~ ( V i : , . . . , vk = 8k(v1,~, . . . , V ~ . , , , . . . ,

V2,n2)

V1,,,, V2:, . . . ,

V2,,,,)}

vk=1:,..., v~_~,,~_,)

203

FIXPOINT

{ Vm,1 . .

-~-

~rn,l(V1,1,...,

Ul,n1,

" " , Vk, " " ,

Vm,l,...,

Vm,n,~)

,

V~,.~

-- ~m,.~

( VI,I,

. . . , VI,.,

, "

" , Vk,

" " " , V,.,1,

. . . , Vm,.~)

} )

In the execution phase each block is evaluated by the LOA processor in sequence according the stratified fixpoint. A simple description of the inflationary fixpoint and a comparison with other fixpoint computations in the case of the flat relational model is given in [12] and a more detailed presentation can be found in [11]. Not much is known about fixpoint computations in the case of a more complex data model (see, for example [4, 9]). In our case the fixpoint of a LOA program is computed by interpreting each block as a cycle containing a sequence of assignments on the variables in the left hand side of any equation. We remark that the presence of several fixpoint blocks (and several equations within each block) should not increase the expressive power of the language (see [16] for the case of a flat model) but makes more flexible and natural the description of a LOA program; furthermore this simplifies the compilation process, that is the translation of user queries from the external language to LOA. In the following sections we specify the syntax and semantics of the operators used in LOA expressions. As far as the algebraic operators are concerned, we generalize previous proposals. These basically refer to the Nested Relation model [15, 27, 2] having only the relation and tuple constructors, thus producing a recursive schema with a tree structure. Abiteboul and Beeri [1] consider a set constructor adding to the algebra a powerset operator to reach the expressive power of the domain independent calculus. In our case, besides considering different kinds of constructors, the main original contributions are related to the need to deal with object identity, and, at the same time, to preserve the capability of handling value based complex structures without identity. These features are not considered in most of the previous proposals which referred to strictly value based models. Object identifiers are treated as special atomic values that cannot be directly accessed, and have to be preserved in the relational operations. This had to be taken into account in the definition of the semantics of the operators, and demanded for additional operators to convert objects into structured values and vice versa. The extension of the operators of the relational algebra is attained by introducing in the conditions a set of navigational operators to move through complex data structures, e.g. to take components from structures, to extract elements from sequences, and to transform data types. Nes~ and Unnest operators are defined as in [18, 26]. These allow respectively the grouping of several tuples introducing a set constructor over an attribute or group of attributes, and the distribution of the elements of a set attribute over a set of tuples. Similar operators are also provided for the tuple constructor. We may point out that, due to the introduction of the navigational operators in the conditions, the role of the Nest and Unnest is mainly restricted to the

204

restructuring of data. Further primitives are defined to transform objects or their components into tuples and vice versa. The latter operation generates new objects and requires object invention.

4

Conditions on Structured Types

To extend the operators of relational algebra we need to extend the definition of the conditions to deal with structured d a t a types. In the relational algebra, selection and ~9-join operators require as additional argument a clause or condition, that is a predicate which must be satisfied by the tuples to contribute to build the resulting relational table. In the most general ease a predicate is a boolean formula over simple predicates, and each of t h e m has an operator (a binary predicate), and two atomic values as operands, which are either two attributes of the tuple(s) or one attribute and a constant (only in the case of a selection clause). In the more complex LOGIDATA model, a clause is used in selection and join as well, and it has the same basic structure, consisting of a boolean formula over simple predicates. In this case the operands of the simple predicates are not necessarily atomic values but can have a structure of arbitrary complexity. We introduce the notion of derived component, which is an expression that can be used in the conditions to denote an operand extracted from an object or a tuple: in general it is a structured value that is a part of it or can be built up from it. Formally, given a relation T~ or a class C of type/9: - If ,4 is an attribute of type/9 then ,4 denotes a derived component from ,4 of type/9. - If ,4 is an attribute of type /9, with corresponding value type r = (,41 : /91,... ,,4k : /gk), and Ei is a derived component from ,4i of type OE, then ,4.Ei is a derived component from attribute ,4 of type/gE. - I f , 4 is an attribute of type {/gB}, with corresponding value type rB ----(,41 : /91,... ,,4k : /gk), and Ei is a derived component from ,4i of type /gE, then A.Ei is a derived component from attribute `4 of type {/gE}. If `4 is an attribute of type (0B), with corresponding value type VB ----(`41 : /91,... ,`4k : /gk), and Ei is a derived component from `4i of type OE, then A.Ei is a derived component from attribute ,4 of type (/gE). - If E denotes a derived component of type /gE, with an associated value type "rE, then: - if"rE = {{/gB}} then FLAT(E) denotes a derived component of type {/gB}; - if "rE = ((6B)) then FLAT(E) denotes a derived component of type (/gB); - if "rE = (/gB), then P o s ( E , n) denotes a derived component of type 0B (the n-th element in the sequence), and SET(E) denotes a component of type -

-

{/gB};

- IfT~ is a relation of type 7" = (,41 :/91,... ,Ak : /gk), and and Ei is a derived component from `4i of type/gE, then Ti.Ei is a derived component from the

205

-

relation T~ of type OE, and ~ denotes a derived component of type r, i.e. a tuple in the relation. I f C is a class of type 0 with associated value type r = (.A1 : 0 1 , . . . ,.Ak : 0k), and El is a derived component from .Ai of type 0E, then C.Ei is a derived component from the class C of type 9E, and C denotes a derived component of type 0, i.e. an object in the class.

With reference to the schema in the Figures 2 and 3, the following are examples of components: -

-

-

S t u d e n t : denotes an object of type Wstud in the class S t u d e n t . S t u d e n t . c u r r i c u l u m . e x a m s : denotes the set of couples ( c o u r s e , d e g r e e ) of an object of the class S t u d e n t . F a m i l y . c h i l d r e n . n a m e . g i v e n n a m e s : denotes the sequence of sequences of givennames of the children of a given family. To get the set of names one should write: Family.FLAW(SEW(r

Components are used to build conditions,i.e.boolean predicates based on the comparisons between components and constants of a compatible type. Formally: - If E 1 and E2 are components of value type TI and 7"2,with value sets V I -V 2 - V, then EloPE2 and E1oPv are conditions, where oP is a comparison operator, and v E V. - If Ez and E~ are components of object type ~vl and w~, then El = E2 or

E1 :/: E2 are conditions. - If E1 is a component of type O and E2 is a: component of type {0} then E1 E E2 and E1 r E2 are conditions. - Every boolean expression whose terms are conditions is a condition. Note that only equality and inequality comparisons are allowed between components of object type, and that comparison may take place between objects of different types. In all these cases the comparison is based on the object identity.

5

Extending

the

Operators

of Relational

Algebra

The traditional operators of the relational algebra, SELECT, PROJECT and JOIN, are extended in two ways. First more powerful conditions are allowed, using the framework introduced in the previous section. The other extension concerns the definition of the type of the result according to the type of the operands and the structure of the conditions. The SELECT extracts from a relation or a class the subset of elements satisfying a given condition. Formally: - If Tr is a relation of type r and / a condition on the type r, then TO' r SELECT(T~;f) is a relation of type r. - If C is a class of type w = [C, r] and ] a condition on the objects of C, then C' r SELECT(C; f ) is a class of type w' = [C', 7-].

206

The PROJECT extends the relational operation in the sense that, instead of a subset of attributes one has to specify the supertype of the operand type that contains the required information. Note that all the nested levels of the data structure are kept in the result. Formally: - If T~ is a relation of type r and r ___ r ' then 7~' r PROJECT(~; r ' ) is a relation of type v'. - If C is a class of type w = [C, r] and r _ r', then C' r PROJECT(C; r t) is a class of type w' = [C', r']. The projection includes the duplicate elimination. This can be effective only when the operand is a relation. The JOIN is defined in such a way that, regardless of the type of the operands, that can be classes or relations in any combination, the result is a binary relation, i.e. one further level is added to the structure of the type. Formally, if X1 and 2'2 are classes or relations of types 01 and 82, a n d / i s a condition involving components from types 01 and 02, then 7~' r JOIN(X1., X2; f ) is a relation of type r' = ( A 1 : 8 1 , A 2 : 0~). With reference to the schema in Fig. t the following are sample queries: Homonyms~ SELECT (Family ;Family. father, name SET (givennames) N

Family. FLAT (SET ( children, name SET (giv snnames ) ) )~

Homonymscontains all the families in which at least one child shares a name with the father. F_ n a m e s ~ = P R O J E C T ( F a m i l y ; ( f a t h e r : ( n a m e : ( g i v e n n a m e s ) ) ) ) .

F_names contains the givennames of the fathers in Family. Note that, as the type of the result is a supertype of the type of the operand, it maintains the complete nested structure around the projected attributes. To flatten the structure, one should use the structure operators introduced in the next section. Artists~JOIN(Family,Student;(Family.SET(children)~Student) A (Student.curriculum.exams.course,'fine arts')).

A

A r t i s t s is a binary relation of type: (Family : tram, S t u d e n t : Wstud ) and contains all the families with a child who passed the exam of fine arts. Additional operators are provided for set operations. These are straightforward extensions of the corresponding relational operators UNION, INTERSECT, DIFFERENCE. 6

Structure

and

Conversion

Operators

The need for these operators arises from the richer structure of the types in the object oriented data model. They are required in the algebra to attain the full capability of data restructuring. This includes the transformation of the

207

components in a structure, from values to objects, and vice versa. Some of these operators have been introduced in algebras for the nested relation model [26, 10]. The structure operators apply only to relations, and produce a relation as a result. T h e y allow to modify the structure by means of actions such as gathering several attributes in a tuple and multiple values of an attribute in a set. More specifically NEST is used to collect in a set the group of distinct values of an attribute of a tuple that share the same value of the rest of the tuple. Formally, ifT~ is a relation of type r = (A1 : 01,... ,Ak : Ok) then 7~' r NEST(T~;.Ak) is a relation of type r = ( , 4 1 : 0 1 , . . . , A k : {Ok}). UNNEST is the reverse operation, and generates a separate tuple in the result relation for every distinct value in an attribute of type set. Formally, if 7~ is a relation of type r = (,41 : 01,... ,Ak : {Ok}) then 7~' r UNNEST(~;.Ak) is a relation of type r I = (-41 : 01,... ,.Ak : Ok). With the same definition the UNNEST can be applied also to sequences. For example the following steps compute a binary relation that associates to each father the set of his wives: C o u p l e s ~ PROJECT(Family ; (father, mother) ) P o l y g a m y ~ NEST (Couples ;mother) O n the other end the following steps compute from Family the relation Parent that contains the couples (parent, child): U..Familyr

UNNEST

Parent ~ UNION PROJECT

(Family ;children)

(PROJECT

(U..Family ;(father :Person, children :Person) ), (mother: Person, children: Person) ) )

(U..Family;

In addition to NEST and UNNEST two more operators are provided, C L U S T E R that groups a subset of attributes in a tuple to generate a subtuple in a single attribute, and M E L T which does the reverse operation, i.e.flattens a tuple structure with a nested tuple attribute. Conversion operators basically allow to convert values into objects and vice versa. Depending if the whole structure or part of it is to be converted, different operators are required: - CLASS: transforms the elements of a relation into objects, thus defining a new class and the corresponding object type. - DECLASS: transforms the objects in a class in structured values, by depriving them of the object identity, and thus eliminating duplicate values from their result. - OBJECT: converts a value component inside a tuple structure into a component of object type, thus defining a new class. - VALUE: converts an object component inside a tuple structure into the corresponding value. Note that the execution of the conversion operators m a y require the invention of object identifiers.For further details and examples one should refer to [23].

208

7

Conclusions

In this paper we present LOA, that is an algebra for the manipulation of complex objects. The algebra refers to a data model (LOGIDATA+) dealing with structured data types and two different kinds of data collections: classes, with object identity enforcement, and value based relations. Moreover, due to the inclusion of a FIXPOINT operator, using LOA it is possible to express recursive programs. With respect to the basic relational primitives, more powerful conditions are provided, based on a set of navigational operators that allow to move through complex data structures; these are to be used in conjunction with the classical operators of projection, selection and join. Additional new operators are defined to manipulate the structure of data, and to attain the full capability of data restructuring. The algebra has been developed within the project LOGIDATA+ as an internal language in the prototype object oriented system. The algebraic set-oriented approach proves to be effective, especially when moving to the object oriented model traditional database applications, where transactions typically operate on large sets of objects. In the prototype system, user programs are to be compiled into LOA programs which are interpreted by an underlying object manager. Details of the implementation of the object manager are described in [25]. Although the algebra has been originally conceived as an internal language for the prototype, it proves also to be quite effective in expressing complex queries. It may then form the basis for the definition of a high level user oriented query and manipulation language.

References 1. S. Abiteboul and C. Beeri. On the Power of Languages for the Manipulation of Complex Objects. Technical Report 846, INRIA, 1988. 2. S. Abiteboul and N. Bidoit. Nonfirst normal form relations: an algebra allowing data restructuring. Journal of Comp. and System Sc., 33 (1), 361-393, 1986. 3. S. Abiteboul and S. Grumbach. Col: a logic-based language for complex objects. Proceedings International Conference on Extending Database Technology), Venezia, Lecture Notes in Computer Science, 303, 271-293, Springer-Verlag, 1988. 4. S. Abiteboul and P. Kanellakis. Object identity as a query language primitive. Proceedings ACM SIGMOD International Conf. on Management of Data, 159173, 1989. 5. R. Agrawal. Alpha: An Extension of Relational Algebra to Express a Class of Recursive Queries. IEEE Transactions on Software Engineering, 14 (7), July 1988. 6. A. V. Aho and J. D. Ullman. Universality of Data Retrieval Languages. Proceedings ACM Conference on Programming Languages, 110-120, 1979. 7. P. Atzeni and L. Tanca. The LOGIDATA+ Rule Language. Proceedings Workshop "Information Systems '90", Kiev, 1990.

209 8. F. Bancilhon and S. Khoshafian. A calculus for complex objects. Journal of Comp. and System Sc., 38(3):326-340, 1989. 9. F. Cacace, S. Ceri, S. Crespi-Reghizzi, L. Tanca, and R. Zicari. Integrating Object-Oriented Data Modeling with a Rule-Based Programming Paradigm. Proceedings ACM SIGMOD International Conf. on Management of Data, 225236, 1990. 10. S. Ceri, S. Crespi-Reghizzi, G. Lamperti, L.A. Lavazza, and R. Zicari. ALGRES: an advanced database system for complex applications. IEEE Software, 7(4), July 1990. 11. S. Ceri, G. Gottlob and L. Tanca. Logic Programming and Databases. Springer Verlag, 1990. 12. A. K. Chandra. Theory of Database Queries. Proceedings ACM Symposium on Principles of Database Systems, 1-9, 1988. 13. L. S. Colby. A Recursive Algebra and Query Optimization for Nested Relations. Proceedings ACM SIGMOD International Conf. on Management of Data, 273283, 1989. 14. E.F. Codd. A relational model for large shared data banks. Communications of the ACM, 13 (6), 377-387, 1970. 15. P.C. Fischer and S.J. Thomas. Operators for non-first-normal-form relations. Proceedings ]EEE Computer Software Applications, pages 464-475, 1983. 16. Y. Gurevich and S. Shelah. Fixed-point extensions of first-order logic. Annals of Pure and Applied Logic, 32, Noth Holland, 1986, 265-280. Also in Proceedings Int. Conf. on Foundations of Computer Science, 346-353, 1985. 17. R. Hull and J. Su. On accessing Object-Oriented Databases: Expressive Power, Complexity, and Restrictions. Proceedings A CM SIGA C T SIGMOD Symposium on Principles of Database Systems, 147-158, 1989. 18. G. Jaeschke and H.-J. Schek. Remarks on the algebra for non first normal form relations. Proceedings ACM SIGACT SIGMOD Symposium on Principles of Database Systems, 124-138, 1982. 19. S. Khoshafian and G. Copeland. Object identity. Proceedings ACM Symposium on Object Oriented Programming Systems, Languages and Applications, 1986. 20. G.M. Kuper. The Logical Data Model: A New Approach to Database Logic. PhD thesis, Stanford University, 1985. 21. G.M. Kuper and M.Y. Vardi. A New Approach to Database Logic. Proceedings Third A CM SIGA CT SIGMOD Symposium on Principles of Database Systems, 1984. 22. G.M. Kuper and M.Y. Vardi. On the complexity of queries in the logical data model. Proceedings International Conference on Data Base Theory, 267-280, 1988. 23. U. Nanni, S. Salza, and M. Terranova. LOA: the LOGIDATA+ Object Algebra. Technical Report 5/23, Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo, 1990. 24. U. Nanni, S. Salza, and M. Terranova. An Algebraic Approach to the Manipulation of Complex Objects. Proceeding Hawaii International Conference on System Sciences, Kaloa, Hawaii, January 7-10, 1992. 25. U. Nanni, S. Salza, and M. Terranova. The LOGIDATA+ Prototype System. In this volume. 26. M. A. Roth, H. F. Korth, and A. Silberschatz. Extended Algebra and Calculus for Nested Relational Databases. ACM Trans. on Database Syst., 13 (4), 389417, December 1988.

210

27. H.-J. Schek and M.H. SchoU. The relational model with relation-valued attributes, ln]ormation Systems, 1986. 28. M.H. Scholl and H.-J. Schek. A relational object model. Proceedings International Conference on Data Base Theory, Paris, Lecture Notes in Computer Science, 470, 89-105, 1990. 29. J. D. Ullman. Database and Knowledge-Base Systems (voi. I). Computer Science Press, 1988.

The LOGIDATA+ Prototype System Umberto Nanni 1, Silvio Salza u,3, Mario Terranova 3 I Universit?* de L'Aquila, Dipartimento di Matematica Pura ed Applicata, Coppito, L'Aquila, Italy Universitg di Roma "La Sapienza', Dipartimento di Informatica e Sistemistica, via Salaria 113, 1-00198 Roma, Italy 3 Consiglio Nazionale delle Ricerche, Istituto di Analisi dei Sistemi ed Informatica, viale Manzoni 30, 1-00185 Roma, Italy A b s t r a c t . In this paper we present a prototype system based on the LOGIDATA+ model, hence supporting a rule based language on a data model with structured data types, object identity and sharing. The system has an interactive user interface, with a units interaction composed by LOGIDATA+ programs, that can extract information from the knowledge base and/or modify the schema. A program consists of a set of rules written in the LOGIDATA+ language, and of additional directives to handle the data output and/or updates to the schema. The intermediate results and the updates to the schema, affect a temporary working environment connected to the working session, but can also be saved in a (user or global) permanent environment. The system uses LOA (LOGIDATA+ Object Algebra) as an intermediate internal language. User programs are compiled, with a set of transformations that includes rewriting and stratification, and then translated into LOA programs, i.e. sequences of fixpoint systems of algebraic equations. The object oriented schema is mapped into a relational schema, and the database is actually managed by a relational DBMS, that provides the basic support for the permanent storage of data as well as for concurrency control and recovery. The object algebra expressions can then be mapped into relational algebra expressions, thus relying on the efficiency of the RDBMS for the access to mass storage structures and the efficient execution of set-oriented operations. Moreover a main memory database has been included in the architecture, to improve the performance in the evaluation of the fixpoint systems, by keeping in main memory the intermediate results.

1

Introduction

This paper presents the architecture of the L O G I D A T A + prototype system, an experimental object oriented database management system, developed within the L O G I D A T A + project. The primary goal of the prototype was to implement the main ideas in the data model [3] and the language, but a good deal of effort has been devoted to study several important problems connected to the efficient implementation of object oriented systems, and to analyze the various solutions in terms of cost and performance. This has led to a modular architecture, in the sense that each module is connected to a different topic, that had been the object of research activity during

212

the project: The first issue is the persistent storage of complex objects, and has been studied in terms of access cost, and processing cost of transactions. Other interesting problems are connected to the language interface, like rewriting of rule based programs in a context with structured data types, and dynamic management of temporary environments. Finally a crucial issue is how to perform efficiently the computation of the least fixed point on large mass memory sets of data. A main implementation choice has been to utilize an existing, commercial, relational DBMS, to implement the persistent storage of the objects, instead of building the prototype on a file system with variable length records, like it has been done for example in O~ [5]. Our choice has several motivations. First, of course, feasibility and ease of implementation. But, beside this, it can be claimed that such choice is rather reasonable also in terms of performance, at least for a large class of end user applications. These are all those applications that do not require as a basic operation the instantiation of complex objects. For instance, most business applications that already perform reasonably well on relational systems, but that could take great advantage in being designed and maintained in an object oriented framework. The architecture of the prototype system is presented in Figure 1. The diagram shows several layers in the architecture, which corresponds to the various phases of the transformation, and of the evaluation of LOGIDATA+ programs. The first layer corresponds to the language interface. The input language is the LOGIDATA+ language [4], and the output language is LOA (LOGIDATA+ Object Algebra), an algebra for complex objects developed within the project [17]. The programs are first transformed by using rewriting techniques, then stratified and translated into a sequence of blocks, each formed by a system of LOA equations. Evaluating a program requires to compute the least fixed point of these blocks. The second layer is composed by the schema managerand the relational mapper. The first module generates and manages the mapping between the object oriented schema and the relational schema, and the relational mapper translates the LOA equations into the relational algebra, according to that mapping. This requires a first level of optimization. The last layer corresponds to the fixpoint evaluation of the systems of relational algebraic equations. This is performed by the procedural code generator that produces a program written in a special internal code, and by the fixed point evaluator which finally executes the procedural program. The procedural code has primitives to perform relational operations both on relations stored in the Mass Storage DataBase (MSDB) managed by the RDBMS, and on temporary structures stored in the Main Memory Da~hbase (MMDB). This gives the way to efficiently carry on the fixed point computation, by maintaining in main memory the intermediate results, and requires a further level of optimization, to find the tradeoff between fast computation in main memory and moving data back and forth between MSDB e MMDB.

213 LANGUAGE INTERFACE

LOA / EQUATION/~/

~

TYPE ~DEFINITIONS

--~A~O--. i

l

,C--A

REELAuTIOoNNAL

~

I

OBJECT

~' ~

RELcAHT/ONAL

RDBMS

MAIN MEMORY DATABASE

Fig. 1. The LOGIDATA+ Prototype Architecture

214

The paper is organized as follows. In the next Section we present the language interface, and discuss the process of rewriting the rule-based programs and translating them into the object algebra. Next in Section 3 we show how to map the object schema into a relational schema. Sections 4 and 5, present the lower part of the architecture, and discuss how to optimize the least fixed point computation. Finally Section 6 deals with the translation and the optimization of an important class of object oriented queries.

2

The

language

interface

This section is devoted to the description of the language interface of our prototype which provides an interactive environment where the user can execute a sequence of LOGIDATA§ programs. Each program may simply consist of a query to be answered, or may have side effects on both the schema or the data, or, in general, it might consist of any combination of these basic functions. Side effects affect the behavior of the system either in the current working section, or permanently, as required by the user, who is given an explicit control over the evolution of the working environment, that is the collection of intensional and extensional data which are considered in the evaluation of the programs in the current working section. A remarkable example of an explicit control over the evolution of data and/or metadata is shown in [7]. In the current state of the implementation, a program may include negation in the body with the additional constraint to be stratified. When intensional predicates are to be instantiated a stratified semantics is used [10]. This might be later generalized to an inflationary semantics [11], which can be applied to a wider class of programs and is strictly more expressive [2, 9]. The choice of an inflationary semantics received further legitimation in [13]. Nevertheless, while dealing with a complex data model (based on object identity), this approach does not guarantee that the evaluation of the program terminates, as shown in [1]. The unit of interaction with the object base is a LOGIDATA program, consisting in a set of rules, plus a set of (possibly implicit) directives to direct the output and to handle the evolution of the environment. These include the specification of what among the new classes or relationship defined within the program has to be instantiated or what intensional or extensional data are to be made permanent, thus affecting what is called the permanent environment. More precisely, in the middle of a working section, both the LOGIDATA schema and database can be thought to be partitioned into a permanent environment and a working (temporary) one. The latter in general contains references to the permanent environment and is affected by the side effects of the programs. The user can make permanent a consistent set of data and/or metadata, with the constraint that no permanent d a t a / m e t a d a t a can refer to what is contained in the working environment. What is initially a set of rules in a program is transformed, through the evaluation process, in a set of LOA equations that eventually will be interpreted

215

as a sequence of assignment statements. In any case any equation (or rule) has a left side (head) which is the name of a predicate to be defined, and a right hand side (body) that will be successively transformed until it becomes a LOA expression. Of course several rules with the same head will produce a single assignment statement, as shown later. A user program may refer to extensional predicates, corresponding to classes or relationships having associated collections of data in the extensional data base, and to intentional predicates which are defined by means of rules. The definition of an intensional predicate may appear either within the program itself, or in the schema (in either the working or the permanent environment). These are called views, by analogy with the standard database terminology. An additional constraint, analogously to the one common in the relational model, is the safety of the rules. A variable x which, appear as argument in a built-in predicate (such as an arithmetic comparison) must have a bounded domain. If x appears in the same rule as argument of a non negated intensional or extensional predicate, it has a bounded domain. Otherwise in the rule there must be a chain of equality predicates imposing that x is equal to another variable y having a bounded domain. The main modules in the language interface are the following: - the LOGIDATA compiler takes user programs as input, applies possible rewriting techniques, and generates a set of rectified rules; - the separator individuates the part of the program concerning addressing of output and management of the object schema, excluding them from being processed together with the declarative part of the program; - the view handler individuates the predicates used in the program corresponding to views in the schema, and includes the rules defining them inside the program itself: the resulting program refers only to predicates defined therein or to extensional predicates; - the stratifier/sequencer finds outs the stratification of the program according standard techniques and individuates the inherent sequential structure of the program (if any) providing a fragmentation of the program in a sequence of

blocks; The external processing of user programs in our prototype is now drawn in more detail. The compiler, which is not extensively described in this paper, performs a first scanning of the program and applies rewriting techniques to the source LOGIDATA+ program. Furthermore in this first phase the rules are rectified [20], in the sense that a renaming of variables with possible substitution of constants within each rule is performed, so that any rule with the same head has exactly the same sequence of variables as arguments. The separator module can be considered a part of the compiler and simply separates the part of the program consisting in primitives to direct output and/or handling the environment. The view handler is in charge to include in the program the rules defining the predicates corresponding to the view-predicates. One of the functions of the

216

schema handler is in fact to store such intensional definitions in "compiled" form. Of course this mechanism to describe views by means of rules (with possible negation) is strictly more powerful than a purely algebraic description (i.e. not using operators to handle recursion). The stratifier/sequencer, as remarked before, operates in a standard fashion (as described, for example, in [20, 8]). In particular it builds the dependency graph which will be used to find the stratification of the negation, thus giving a partial ordering among the rules according the strata. Actually the sequencer individuates the strongly connected components of the dependency graph, finding out the inherent sequential structure of the program. In general an overall sequential structure arises when there are portions of the program which are not mutually recursive. As an example, if a program P uses a view-predicate V, in the dependency graph the connected component containing the node V (included by the view handler) has no edge coming from nodes corresponding to intensional predicates defined within the program (note that the program can not redefine some predicate which the view V is defined on). This means that the predicates corresponding to the nodes in that connected component might be instantiated before the rest of the program. A simple syntactical transformation of the rules converts them into algebraic form, and furthermore all the rules with the same head are grouped to give raise to a single LOA equation. Any LOA equation has a form V = g, where V is the name of the variable/predicate being defined, and E is a LOA ezpression (whose evaluation will be described in the following sections). Moreover V corresponds to a node in the dependency graph, namely to the node with the same name. The other nodes in the dependency graph, by virtue of the view handler, can only be nodes corresponding to extensional predicates. The stratifier/sequencer is in charge to perform a fragmentation of the program into a sequence of blocks, each of them corresponding to a strongly connected component of the dependency graph. Each block consists of either a single LOA equation or a fizpoint blocks. Any fixpoint block will be interpreted as a F I X P O I N T operator applied to the set of equations it includes. The total ordering of the blocks in the resulting sequence must fillfill the constraint that any equation is not allowed to refer to variables/predicates which are defined in the following blocks. Note that if the program is stratified, in the dependency graph a negated arc can not occur within a strongly connected component. This means that if a fixpoint block will include rules with negation in the body, the negated predicates either are defined in a previous blocks, or are extensional. The resulting sequence of blocks is what we call a LOA program. The language interface also manages the working and permanent environments as described above, allowing a reasonable context to work interactively. In more detail, the schema manager will be able to support a hierarchy of environments, according the following behavior, which resembles the behavior of interactive systems in very different contexts. The LOGIDATA schema (and database) has to be considered partitioned in environments. An environment A

217

is based on the environment B if the ( m e t a ) d a t a contained in A are allowed to refer d a t a in B. The constraint is that this relationship defines a partial order among environments. As an example, suppose that G is a single global environment which is permanent. Then several environment U1, U 2 , . . . , UN based on G can be defined. Each Ui may correspond to a user, or a group of users, and contains the customized permanent data, such as views. At any time an interactive user logs into the system, a new e m p t y working environment is built. In this manner the side effects of user programs affects only the working environment, and are reflected in the subsequent programs only within the current working session. At any time the user may ask to make permanent a consistent portion of the working environment. This is allowed by the schema manager only if the portion of the environment to be transferred does not contain references to the working environment. Given the definition of a LOGIDATA schema, is not difficult to implement primitives which, for example, allow the user to require a view V defined during the current working section to become permanent together with any other things in the working environment which C is based on. Of course this management results effective also in the case that many users are simultaneously using the system. The support of a DBMS with built-in locking features on data will result usefill in the implementation of these features in the prototype. The hierarchy of enviromnents is only visible to the schema manager, that provides the mapping from any variable/predicate referred in a particular environment to a unique collection of data (class or relation), or to a view within the schema.

3

Mapping Objects into Relations

For sake of simplicity in this and the following sections we shall refer to a data model, which is a simplified version of the L O G I D A T A + model, with tuple and set constructors and object identity. The model actually contains the relevant aspects of most object oriented models. Therefore the results we present in the paper can be easily extended to a more general framework. More formally given a finite set of domains D 1 , . . . , D D with domain names D 1 , . . . , D D , a countable domain of object identifiers ~, and a countable set of attribute names .,41,.A2,..., we refer to an object schema 0 composed by the following elements: - A finite set of types names, or shortly types, 01,..., 0o, and the corresponding

type definitions. - A finite set of classes C 1 , . . . , C c

with names C l , . . . , C c .

The domain /? contains the object identifiers associated to the objects, that are unique in all the database. The classes are collections of objects of the same type. T y p e definitions allow to build structured types from the basic types associated to the domains, and, for object types, to add the identity. A value set, i.e. the set of all possible values, is associated to each type:

218

- A type 0 is either a value type r or an object type w. - Each domain name 19i is a value type (called base type), and the corresponding value set is Di 9 If 8 1 , . . . , a,~ are types with value sets V l , . . . , V,~, then r = (A1 : 81, . . . , A , : 8n) defines a tuple type r, with value set V r = V1 x ... • Round brackets denote the tuple constructor. - If 8 is a tuple type ;and V is the corresponding value set then r = {0} defines a set type r with value set V r = PART(V), i.e. the powerset of V. Curly brackets denote the set constructor. - If r is a tuple value type with value set V, and C is a class name, then w = [C, r] is an object type, and the corresponding value set is V~ = / 2 x V. -

Note that we require a tuple constructor inside each set constructor. This, as we shall see later, is to have a more direct representation of the schema into the relational model. For similar reasons we assume that tuple constructors cannot be directly nested. Both assumptions do not produce any loss generality. According to our definitions each class corresponds to an object type. Furthermore, if wl and w2 are objects types, and w2 appears in the definition o f ~ l , we say that w2 is part-of wl. The structure of the objects in the schema can be effectively represented by the object graph. The graph contains a node for every type in the schema that has a tuple constructor at the outermost level. The nodes corresponding to object types are called class nodes. The nodes are connected by two different types of arcs. The set links and the old links. The former represent a set constructor, and the latter a part-of relationship. The definition of a sample object schema and the corresponding object graph are presented in Figures 2 and 3. In the figure solid lines represent set-links and dashed lines represent old-links. Note that the graph only represents the nested structure of the objects, and the part-of relations between the classes. Attributes of base type are not explicitly represented. The graph may be cyclic if recursive type definitions are allowed. We now address the problem of mapping the object oriented schema with a relational schema. This obtained through a transformation Oc, that generates a normalized relational schema S c that we callcanonical schema, defined as follows: - There is a relation Ri in S c for each node ni in the object graph. If (a/l, a~, , ...., ak) i is the tuple constructor corresponding to the node, then the relation Ri has schema (b0,ba,b i i 9,. .... ,bD. Attributes in the canonical schema are either of base type, corresponding to base types in the object schema, or of oid type, or of link type. - If a~ is an attribute of base type the corresponding attribute b~ in P~ has the same type. - If ni is a class node, then R/ is called a c-relation and b~ has oid type (oid attribute), and is a key to the relation. -

219

Domains (base types): Char; S t r i n g ;

Integer.

Value types: rstud = (name : String, code : I n t e g e r , curriculum : rcurr, a t t e n d s : r a t t ) ; rteache r = (name : String, teaches : {rteaches}, h i s t o r y : {this}); r c l a s s e s = (course : String, room : String, time : Integer); Tcurr = ( c o u r s e : String, t e s t : { r t e s t } ) ; treat = (year : Integer, grade : Char); rat t = (schedule : Wclasses); rteache s = (schedule : O:classes); 7~nis = (course : String, year : Integer).

Object types: Wstud -----[Student, rstud]; Wteache r = [Teacher, ~teacher]; ":classes = [Classes, rclasses].;

Classes: Student; Teacher; Classes. Fig. 2. A sample object schema

I PERSON I

STUDENT

/

I

I

\

/

,,\ ~ TEST 3 Fig. 3. The object graph

d/"

CLASSES

I

TEACttER

\

I

220 If ni is a non-class node with a set link from attribute a~ of node nj, then the corresponding relation R/ is called a s-relation, and both b~ in Rj and b~ are of link type (link attributes). - If a~ is an attribute of object type, then b~ has oid type. -

The link attributes are introduced to represent the unnesting of set constructors. Similarly oid attributes are used to implement the object identity, and to represent the part-of relationship. According to these definitions we may map the Object Oriented DataBase (OODB) into a pure relational canonical database, in which in every c-relation there is a tuple for every object in the corresponding class, and for every srelation there is a tuple for every element of every set. The canonical schema for the OODB of Figure 2 is reported in Figure 4.

STUDENT(#student: Old, name: String, code: Integer, @curriculum: Link, @attends: Link); TEACHER(#teacher: Oid, name: String, @teaches: Link, @history: Link); CLASSES(#classes: Old, course: String, room: String, time: Integer); CURRICULUM(@curricultm: Link, code: Integer, @test: Link); TEST(@test: Link, year: Integer, grade: Char); ATTENDS(@attends: Link, schedule: 0id-CLASSES); TEACHES(@teaches: Link, schedule: Oid-CLASSES); HISTORY(@history: Link, course: String, year: Integer)

Fig. 4. The canonical schema

Note that all the information contained in the OODB is preserved in the canonical schema, and that a reverse transformation ~9~ 1 can be defined from Sc toO. Once we have defined the mapping (9c between the object schema and canonical schema, we may consider the problem of evaluating a query consisting of an expression L" of the object algebra. More precisely, computing ~' transforms the object schema O in a new schema O'. We then say that a set of relational expressions (El, E 2 , . . . , Ek) is equivalent to ~: according to the mapping 69c, if the canonical schema S c is transformed into a new schema S ' c , such that Z ' c corresponds to O' in the mapping (9c. It can be easily shown that El, E2, . . . , Ek can be computed for any •, and are expressions of the relational algebra, with the only addition of the old-invention if restructuring primitives are included in the object algebra, as proposed in [16]. The result of the original query can be obtained by applying the inverse transformation Oc 1. Therefore the set of fixpoint algebraic equations generated by the language interface can be computed on the canonical database, through the repeated

221

evaluation of relational expressions. We address the problem in more detail in the following section.

4

The

Procedural

Code

and the Fixed

Point

Evaluator

As we have shown in the previous sections, LOGIDATA+ programs are first translated into fixpoint blocks of object algebra equations, and then transformed, according to the canonical mapping, into fixpoint systems of relational algebraic equations. To perform the evaluation the Code generator generates a procedural program, written in an internal code, that is later executed by the Fixed Point Evaluator. Non recursive equations are directly translated into relational queries and the computation is performed by the mass memory RDBMS. We will discuss how to optimize this step in Section 6. In this section we address the problem of the efficient evaluation of sets of mutually recursive equations. The least fixed point is computed with the semi-naive algorithm [6]. This algorithm notably solves by iteration a system of algebraic equations. For every unknown relation in the system two relations are maintained during the computation. The first one, called differential relation, contains the tuples produced during the current iteration, and the second one, the integral relation, cumulates all the tuples produced by the previous iterations. Only the new tuples are used in the next iteration. The computation terminates when no new tuples have been produced during the last iteration. More specifically the system of equations generated by the Relational Mapper has the form:

Ri = E ~ Ri = E i ( R , , . . . , R ~ , B 1 , . . . , B k ) ,

i=

1,...,m

(1)

.

i=l,...,m

.

(2)

where the /~ are the unknown relations to be computed, B 1 , . . . , B~ the base relations of the canonical database involved in the computation, and the first set of equations define the base values for the Pq. The Ei and E ~ are expressions of the relational algebra. Referring to this system the schema of the semi-naive evaluation is given in Figure 5. The first two group of steps (al . . . . ,a,n and b l , . . . , bin) are executed only once to initialize the integral and differential relations; the remaining steps are iterated until the integral relations stabilize. More specifically steps (cl, ..., cm) incrementally compute the new tuples, using the expressions OEi, that are the derivatives of the Ei. The last two groups of steps (dl, 999 dm and e l , . . . ,era) perform a peculiar union-difference operation between the integral relations and the corresponding differential relations. That is the new tuples are added to the integral relations and the old tuples are deleted from the differential relations.

222

am : R., ~ 0 bl : c~R1 *-- E ~

bin: OR~ ,- E~

Bk)

cl : OR1 *-- O E I ( R 1 , . . . , R m , ~ g R 1 , . . . , O R m , B 1 , . . . , B k ) l c,~ : ~gR m ~ O E ~ ( R 1 , . . . , R,,~ , (gR 1 , . . . , (gR,~ , B 1 , . . . , B k ) d l : 01~1 ..- OR1 - (R1 n O R 1 )

am : O R m ~ O R ~ - ( R m n O R m ) e l : R1 ~ R 1 U OR1

em: Rm ~ Rm U CgRm

Fig. 5. Semi-naive evaluation of a fixpoint block

Hence the semi-naive evaluation requires to maintain several, rapidly evolving intermediate relations, and the repeated executions of relational operations, some efficiently supported by the RDBMS (like Join and Select), and some not, like the union-difference performed at the end of each iteration. For these reasons the architecture of the prototype system includes the Main Memory Database (MMDB), and specific primitives have been defined in the instruction set of the procedural code. The MMDB provides for the storage of the intermediate relations, their access structures, and for the efficient execution in main memory of the relational operations. Different physical structures are provided for the integral and differeatial relations. Both are sorted, but the latter are managed as a simple list in main memory, while the former ones are organized in some kind of paged structure that allows, if needed, a partial overflow to mass memory. As for the index structures in main memory, the problem has been thoroughly investigated in [14] and [15]. According to this analysis, and taking into account that in our application the relations are highly dynamical, we adopted the B*-tree, which makes a reasonable tradeoff between the access cost and the memory cost. The size of the blocks was chosen in order to allow, if needed, an easy overflow of the index to mass memory. When this happens, the blocks are

223

gradually paged out and managed with a LRU policy. The procedural code is generated in such a way that indices are always maintained on the join attributes. If there is an index only on one attribute (which is usually the case), the relations are sorted and the join is performed through a merge in linear time. Otherwise a nested loop is used. More specifically the procedural code provides three classes of primitives: -

The MMDB Primitives allow to create the temporary relations and the corresponding access structures, and to execute in main memory some relational operations: -

--

-

CREATE: creates a new relation with a given schema in the MMDB or in the Mass Storage Database (MSDB), and sets up indexes on the specified attributes (or groups of attributes). It has to be specified if the relation is an integral or a differential one, since, as we will see later, different physical representations are used. coPY: performs a copy on the specified relation. This is used to initialize the temporary relations. PROJECT: performs a projection on a temporary relation. The result is stored too in the MMDB. JOIN: performs a join. Both the operands and the result are in the MMDB. SWAP:performs the union-difference operation, needed in the semi-naive evaluation. The operands are an integral relation and the corresponding differential relation. The swAP adds to the integral relation the new tuples, and deletes from the differential relation the old tuples.

- The Communication Primitives allow to move the relations between the MMDB and the MSDB, to print the result of a query, and to request the RDBMS to execute a subquery: --

--

-

LOAD:moves an extensional relation to the MMDB. STORE: moves a temporary relation from the MMDB to the MSDB. EXTRACT: requests the RDBMS to compute a SQL query and to move the result to the MMDB. EXECUTE: requests the RDBMS to compute a SQL query, and to keep the result in mass memory for further computation.

- The Control Primitives allow to express the iterative computation that, as we have seen earlier, is required by the semi-naive evaluation: begins a block of procedural code that has to be iterated until some specified integral relations stabilize. marks the end of a fixpoint loop. HALT:stops the computation.

--

F

I

--

E

N

--

X

D

P

F

O

I

I

N

X

:

T

:

As an example let us consider the very simple system, that defines the transitive closure of a relation P: A($1, $2) = a$2=~P($1, $2) .

(3)

224

A($1, $2)

=

A($1, $2) O A($1, $2) re,l=,2 P($1, $2) .

(4)

which corresponds to the semi-naive computation: al: A($1,$2) *-- 0 bm: 0A($1, $2) ~ a$2=aP($1, $2) cl: 0A($1, $2) *- 0A($1, $2) N,1=,2 P($1, $2) dl: OA($1, $2) *-- 0A($1, $2) - (A($1, $2) n 0A($1, $2)) e l : A($1, $2) *-- A($1, $2) U 0A($1, $2) The corresponding procedural code is given in Figure 6. Note that the computation is partially performed in main memory. The interaction with the RDBMS is limited to lines 8 and 9. At every iteration the new tuples in the relation DELTA-A are stored in the MSDB (line 8), and then the RDBMS is requested to compute a semijoin and to move the result to the MMDB (line 9). At the end of the iteration the integral relations are updated, and only the new tuples are left in the differential relations (line 10).

01: CREATE~ A($1:CHAR(12),$2:CHAR(12)) INDEX ($1,$2) 02: CREATENN DELTA-A($1:CHAR(12),$2:CHAR(12)) 03: CREATENN XI($1:CHAR(12)) 04: CREATEMS X2($1:CHAR(12)) 05: EXTRACTDELTA-A($1,$2) SOL(SELECTP.$1, P.$2 FROM P

WHEREP.$2 = "a") 06: FIXPOINT DELTA-A 07: PROJECT X1($1) FROMDELTA-A($1,$2) 08: STORE X1($1) INTO X2($1) 09: EXTRACTDELTA-A($1,$2) SOL(SELECTP.$1, P.$2 FROM P, X2 WHEREP.$2 = Xl.$1) I0: SWAP DELTA-A, A 11: ENDFIX

Fig. 6. Procedural code for the semi-naive evaluation of Figure 5

5

Optimizing the Fixed Point Computation

An optimization phase is provided to generate a procedural code that exploits the different options available in the architecture, to perform the computation either directly in main memory, or in mass storage through the RDBMS. As we

225

already pointed out, this is especially interesting in the semi-naive evaluation of recursive sets of equations. Basically there are three different options in performing the semi-naive evaluation: -

-

-

Mass Storage Evaluation. All the relational operations are performed on the

MSDB through the EXECUTE command, and all the intermediate are stored in the MSDB too. Tight Coupling. Most of the computation takes place in the MMDB. Repeated interaction with the RDBMS takes place at each iteration, when, using the STORE and EXTRACT commands, the differential relations are moved to the MSDB, to perform a semijoin and bring back the result to the MMDB for the next iteration. Loose Coupling. All the extensional relations involved in the query are initially moved to the MMDB where all the computation takes place.

The first option proves to be valuable only in extreme situations, when both the base relations and the intermediate results have a very large size. Actually the RDBMS is especially designed to handle efficiently such cases. In other situations this choice is not reasonable because of the overhead in the RDBMS, and of the inefficiency of the union and difference operations, that instead are integrated and efficiently performed by the swAP command in the MMDB. Normally the optimizer must choose between the remaining two options, i.e. tight and loose coupling, and decide which extensional relations are to be entirely loaded. This is done by estimating and comparing, on one side the I/O cost connected to all the accesses to the mass storage relations needed to perform the semijoins when working in tight coupling mode, and on the other side the cost of loading the whole relations. In the latter case all the blocks of the relation are accessed, but sequentially. The problem then becomes that of estimating the number of iterations in the seminaive evaluation, and the number of tuples of a given extensional relation that are accessed at each iteration. To see how this can be done, let us consider a binary relation with both the attributes on the same domain, and represent it as a graph where the nodes are the values in the domain, and the arcs represent the tuples of the relation (Figure 7). Using an approach similar to the one introduced in [6], we make the assumption that the graph is layered, with h layers, and define ~gp and 69c as the average number of arcs respectively entering and leaving a node. In our example the average number of parents and children for every individual. These parameters can then be estimated from the following values, that, in turn, can be easily extracted from the MSDB catalog: -

N : number of tuples in the relation. Np: number of distinct values assumed by the first attribute. No: number of distinct values assumed by the second attribute. Npc: number of values common to both attributes.

226

bh.1 4

Q

/

P

@

Q

0 @ 0

/

\

I

\ I

/

,

\

0p

/

\

\

/ /

!

\

#

S

S

S

g

4

I

@ 9 D

b0 Fig. 7. The relation graph

In fact, according to the definition of Op and Oe: N o,,=~

N eo=~.

(~1

Assuming a regularstructure in the graph, the shape of the graph can be deduced from the values of Op and Oc. We have two cases: - O v = Oc. In this case we assume for the graph a rectangular shape. The number of layers h and the number of nodes b can then be expressed as:

-

b = N~ - N ~ o .

(6)

h = 1+ [b-~ 1

(7)

Op # Oc. In this case we assume for the graph a trapezoidal shape, and express accordingly h and the the number bi of nodes in layer i. For instance, if Op < Oc, and referring to Figure 7: b0=

Nc-Npc

bh-l=Np-Npc

(8)

.

.

(9)

227

h = 1+

f

log

(10)

To give an example of how this can lead to practical results, let us refer to the procedural code in Figure 6, and show how to compute an upper bound to the number of accesses performed in the MSDB to the relation A at each iteration of the EXTRACT at line 09. To get the upper bound we take the node corresponding to the constant in the query (i.e. a) in the layer 0 of the graph. This is, of course, the worst case, and r e q u i r e s h - 1 iterations. During the computation, iteration i passes from a subset AZ/_I of the nodes of the layer / - 1 to the subset of the nodes of layer / that are connected to the nodes in Af/-1. Now, if we assume that the nodes are randomly connected, and if we call ni the cardinality of Af/, i t c a n be proved that:

:

{i

b,(1-

1

i= x,...,h-

1

Then the number of tuples ti accessed in the base relation in iteration i is given by:

ti = [•pni-,]

i= 1,...,h-

1 .

(12)

The actual number of I/O accesses needed at each iteration can then be computed by using the well known formula of Yao [21]. 6

Object

Selection

Queries

In this section we discuss the issues connected to the translation of non-recursive object oriented queries into relational queries. The results extend of course also to the translation of the algebraic expressions which are part of fixpoint blocks. To discuss the problem we restrict to a specific class of queries, that consist in extracting from a class the set of objects that satisfy a given condition. We shall call these object selection queries, and we will see in a while that these queries are far more powerful than a relational selection. In order to define conditions on structured objects, we first introduce, referring to an instance of the OODB, the notion of value tree of an object. - The node of the tree are of four different types: tuple, set, oid and value; these represent respectively the constructors in the type of the object, the oid of the object and subobjects, and the base values. - The root of the tree is a node of type old, and has only one child of type tuple, which represent the outermost constructor in the object type. - Each node of type tuple has one child for every attribute in the corresponding constructor; if the attribute is of base type, the node is a leaf and represents the value of the attribute; if the attribute is of object type the node represents the oid of the subobject; if the attribute is of set type the node is of set type.

228

- Each node of set type has one child for every element in the corresponding set. - Each node of old type is the root of the value tree of the corresponding object. Note that even in the case of recursive type definition the value of an object is still represented by a tree. In such cases the tree may become infinite, but this is not a problem since, as we shall see in a while, we only need to consider a finite portion of it. Referring to the object schema in Figure 2, a sample value tree of an object of the class S T U D E N T is represented in Figure 8. The purpose of the vahie tree is to expand the set constructors, explicitly representing all the combinations of values in the object. Let us now consider the atomic values in the tree, i.e. the base values in the leaves, and the old in the oid nodes. Each of these atomic values is characterized by a value path, defined as the sequence of attributes traversed in the tree by the path from the root to the node. We may then define an atomic condition, by associating it to a value path, and say that the condition is satisfied by any atomic value on the value tree that has that path. For instance referring to the value tree of Figure 8 the atomic condition: STUDENT.curriculum.test.year

= 1990

is satisfied by two different atomic values. To set up more complex conditions, i.e boolean expressions built on atomic conditions, we need then to allow distinction between atomic values belonging to different elements of the same set. To do so, we introduce the notion of instance tree, defined as a subtree of the value tree, in which for every set node only one child, if any, is retained. We then say that an object satisfies a condition if and only if it exists for that object an instance tree that satisfies the condition, and that a selection query Q(C; B), on the class C selects all the objects in C that satisfy the condition B. For instance considering the object selection query:

Ql:(STUDENT;STUDENT.curriculum.course ='MATH1' A STUDENT.curriculum.test.grade =' A' A STUDENT.curriculum.test.year= 1990) we see that the object of Figure 8 is not selected by the query, although each atomic condition is satisfied by an atomic value in the value tree. Instead the query: 02 : (STUDENT;STUDENT.curriculum.course ----'MATH1 ~ A STUDENT.curriculum.test.grade =' A' A STUDENT.attends.course ----'MATH2')

229

ISTUDENT ]

1991

A

Fig. 8. The value tree

selects the object, since the condition is satisfied by the instance tree of Figure 9. According to what we have seen in Section 3 an object selection query can be readily translated into a a relational query on the canonical schema. The resulting relational expression is in general quite complex, and its optimization is a crucial problem. This is because evaluating conditions on objects with complex nested structure, may require to perform a large number of joins, mostly on the link attributes introduced by the mapping of the schema. To discuss the optimization strategy we focus on a sub-class of the selection queries, the sd-queries in which the condition/3 is decomposable, i.e. has conjunctive form B = B1/~/32 A . . . / ~ Bk, and each Bi contains only atomic conditions on the attributes of a single tuple constructor. For these queries the optimization problem has a very clean fornmlation, since they can be translated into relational SPJ queries, consisting in a series of selections on the base relations of the canonical schema, followed by a chain of joins on link attributes. As an example Figure 10 shows the relational query tree for query q2. In most RDBMs, such lnultijoin queries are usually computed with a nested loop algorithm, provided that indices are maintained on all the join attributes. This avoids the materialization of the intermediate results, and exploits the

230

I ......./ /

S'l'r~l)EN'r I sToea2 I

"'""

] cu.'iculum

..."

",;..

\

o.o.~.ooo~ i()i y.....~, coume ..." 1r PHY8

me MATH1

~()i year/ ... lg90

year/

'...grade .. 0

llt~ y~ 199'

~ A18

3

CL.008 ~.....r....:-.....~:-: ~ e

/" ', ,..'room"

CHEM

~r C2

"- tlme

"~

1

o.o.oJf..o., i:..(,.y.......t )i

,.~176176176176176176 {

"test ,...#L...~ i l I i /

MATH2

\grade B

de

nA

Fig. 9. The instance tree

selectivity of the select and the join conditions. In our case, at least for the sdqueries, this strategy fits very well, since all the joins are performed on the link and oid attributes, and it is very reasonable to keep an index on each link and oid attribute. The optimization of the nested loop computation of the multijoin queries has been given a good deal of attention in the literature. The basic problem is the ordering of the joins to maximize the selectivity along the query tree, and hence to minimize the number of page fetches. The problem is NP-complete, and therefore intractable for large number of joins. Nevertheless some reasonably efficient heuristic algorithms have been proposed [12, 19]. In our case the cost model can be easily defined, since the joins are on link attributes. Hence estimates of their extensional and statistical parameters, such as originality and multiplicity, can be computed from a user supplied quantitative characterization of the object schema, e.g. average cardinality of the set attributes etc. This kind of approach is particularly meaningful in the case of precompiled queries. Moreover in a context of predefined transactions we may consider another level of optimization, based on the idea of improving the execution cost by

231

ourse-'MATH2'

~

,

~ STUDENT

_ CLASSES

,

ATTENDS

CURRICULUM

F i g . 10. The relational query tree

means of transformations oil the canonical schema. These consist in precomputing some of the joins on the link attributes. This leads to consider all the class of admissible schemata, where the optimal schema for a given workload, is the one with the best tradeoff between the increase in access cost due to larger relations, and the saving connected to prejoins. These issues are discussed in [18] , where a detailed cost model is presented, and the applicability of some heuristics is analyzed.

References 1. S. Abiteboul and P. Kanellakis. Object identity as a query language primitive. In ACM SIGMOD International Con]. on Management o] Data, pages 159-173, 1989. 2. S. Abiteboul and V. Vianu. Procedural and declarative database update languages. In Seventh AUM SIGACT-SIGMOD-SIGART Syrup. on Principles of Database Systems, 1988. 3. P. Atzeni, F. Cacace, S. Ceri, and L. Tanca. The LOGIDATA+ model. Rapporto LOGIDATA+ 5/20, Progetto Finallizzato Sistemi Informatici e Calcolo Parallelo, 1990.

232

4. P. Atzeni and L. Tanca. The LOGIDATA+ Rule Language. Rapporto LOGIDATA+, Politecnico di Milano e IASI-CNR, Roma, 1990. Presented at the Workshop "Information Systems '90", Kiev. 5. F. Bancilhon et al. The design and implementation of 02 an object-oriented database system. In Advances in Object-Oriented Database Systems, Proc. Second Int. Workshop on Object-Oriented Database Syst., K. Dittrieh, Ed., Bad Munster, FRG, 1988. 6. F. Bancilhon and R. Ramakrishnan. An amateur's introduction to recursive query processing strategies. In ACM SIGMOD International Conf. on Management of Data, pages 16-52, 1986. 7. F. Cacace, S. Ceri, S. Crespi-Reghizzi, L. Tanca, and R. Zicari. Integrating object oriented data modelling with a rule-based programming paradigm. In ACM SIGMOD International Conf. on Management of Data, pages 225-236, 1990. 8. S. Ceri, G. Gottlob, and Tanca L. Logic Programming and Databases. SpringerVerlag, Berlin, 1990. 9. A.K. Chandra. Theory of database queries. In Seventh ACM SIGACT-SIGMODSIGART Symp. on Principles of Database Systems, pages 1-9, 1988. 10. A.K. Chandra and D. Harel. Horn clauses and generalization. Journal of Logic Programming, 2(1):320-340, 1985. 11. Y. Gurevich and S. Shelah. Fixed-point extensions of first-order logic. Annals of Pure and Applied Logic, 32:265-280, 1986. Also in Proceedings Int. Conf. on Foundations of Computer Science, pp. 346-353, 1985. 12. T. Ibaraki and T. Kameda. On the optimal nesting order for computing nrelational joins. ACM Trans. on Database Syst., 9(3):483-502, 1984. 13. P.G. Kolaitis and C. Papadimitriou. Why not negation by fixpoint. In Seventh A CM SIGA CT SIGMOD SIGART Syrup. on Principles of Database Systems, pages 231-239, 1988. 14. T. J. Lehman and M. 3. Carey. Query processing in main memory database management systems. In ACM SIGMOD International Conf. on Management of Data, 1986. 15. T. J. Lehman and M. J. Carey. A study of index structures for main memory database management systems. In Twelfth International Conference on Very Large Data Bases, Kyoto, pages 294-303, 1986. 16. U. Nanni, S. Salza, and M. Terranova. An algebraic approach to the manipulation of complex objects. In 25-th Hawaii International Conference on System Sciences, 1EEE Press, January 1992. 17. U. Nanni, S. Salza, and M. Terranova. LOA: the LOGIDATA+ Object Algebra. Technical Report 5/23, Progetto Finalizzato Sistemi Informatici e Calcolo Parallelo, 1990. 18. S. Salza and M. Terranova. Efficient Implementation of Object-oriented Databases on Relational Systems. Technical Report, IASI-CNR, 1991. 19. E.J. Shekita and M.J. Carey. A performance evaluation of pointer-based joins. In ACM SIGMOD International Conf. on Management of Data, pages 300-311, 1990. 20. J.D. Ullman. Principles of Database and Knowledge Base Systems. Volume 1, Computer Science Press, Potomac, Maryland, second edition, 1982. 21. S. B. Yao. Approximating block accesses in database organizations. Communications of the ACM, 20(4):260-261, 1977.

MOOD * A n A r c h i t e c t u r e for O b j e c t O r i e n t e d A c c e s s a R e l a t i o n a l D a t a B a s e **

to

Marco Lugli, Luca Nini 1 and Stefano Ceri 2 1 CICAIA, Universits degli Studi di Modena 2 Politecnico di Milano

A b s t r a c t . In this paper we describe MOOD, a system which may be used to easily and efficiently build C + + programs accessing a relational database. MOOD has its own data model, which is Object-Oriented, and supports object identity, generalization hierarchies, and object sharing; by processing schema definitions, the MOOD system builds a relational schema and a C + + Class Library that enable the interaction between a C + + programming environment and a conventional relational database. We associate a C + + class to each class in the MOOD schema. Methods of this class may be used to access, create or modify MOOD objects. In particular, MOOD primitives provide tools for expressing complex, set-oriented queries to extract objects by traversing the MOOD schema along semantics finks. These primitives generate SQL queries in order to extract relevant tuples and assign them to Cq-+ objects; the interface thus developed solves the impedence mismatch between C + + (recordoriented) and SQL (set-oriented). MOOD can be used as a low-level programming environment for building applications; in particular, it is currently considered as an environment for implementing the Logidata+ language.

1

Introduction

In recent years, the language C + + has emerged as a de-facto standard for the development of object-oriented applications. Object-orientation is in-fashion as a programming style, and indeed has been able to improve p r o g r a m m e r ' s productivity and the quality and reusability of software being produced. On the other hand, C + + is a programming language; as such, it lacks of several concepts for data management which exist in conventional databases, such as reliability, integrity, security, and support for shared access. Therefore, database m a n a g e m e n t is one of the difficult features in the development of object-oriented applications. * MOdena Object Database ** This work has been partially supported by CINECA, CICAIA - Universits degli Studi di Modena, and by Progetto Finalizzato Informatica e Calcolo Parallelo, Subproject LOGIDATA+

234

This difficulty is currently addressed by a number of competitive approaches. The revolutionary approach aims at the development of new object oriented languages with persistency; examples include systems like Orion, Gemstone, 02. The evolutionary approach aims at extending SQL, the de-facto standard for relational databases, incorporating a number of object-oriented features into it. None of these approach will probably bring to C + + programmers the environment that they would probably prefer, namely, a transparently persistent C++; and, on the other hand, such extension would probably be too powerful to be also effective. In this paper, we propose a third approach, where C + + programmers need to rely on current, conventional relational technology for database storage; however, the system that we propose, MOOD, takes the responsibility of generating both the database design and the methods for database access that would otherwise be programmed within the application. As such, MOOD can be considered as a low-level programming tool for interfacing object-oriented programming with the relational technology. In this interface, MOOD eliminates a significant source of conceptual difficulties and errors which was recently denoted as the "impedence mismatch" between record-oriented languages (such as C + + ) and set-oriented languages (such as SQL). This mismatch arises whenever a query to a relational system is used within a record-based language, and is commonly solved by depositing some of the resulting tuples into a few buffer records and then controlling, from the programmming language, a producer-consumer mapping, where new tuples are retrieved and deposited as soon as the old one have been used; these mechanisms are currently implemented by SQL c u r s o r s . One additional problem, which exists in the mapping between programming languages and relational queries, concerns the differences in data types between the language data structures and flat relations; to reduce type coercion, usually the language data structures are also flattened, but this reduces the effectiveness of programs. In this paper, we show how MOOD solves both the impedence and type mismatch problems. MOOD has its own data model, which is Object-Oriented, and supports object identity, generalization hierarchies, and object sharing; by processing schema definitions, the MOOD system builds a relational schema and a C + + Class Library that enables the interaction between a C + + programming environment and a conventional relational database. Since both relational data definitions and C + + class descriptions are generated by MOOD, there is no type mismatch problem. We associate a C + + class to each class in the MOOD schema. Methods of this class may be used to access, create or modify MOOD objects. In particular, MOOD primitives provide tools for expressing complex, set-oriented queries to extract objects by traversing the MOOD schema along semantics links. These primitives generate SQL queries in order to extract relevant tuples and assign them to C + + objects; the interface thus developed eases the impedence mismatch between C + + and SQL. In first place, the correspondence between meth-

235

ods and SQL queries is ensured by MOOD, that generates the correct SQL queries from high-level descriptions of methods. In second place, the methods which are generated by MOOD include primitives for the selective iteration through results, implementing a high-level cursor. Though this paper is focused on programs which interact heavily with relational data-bases, indeed MOOD is best suited to writing classical objectoriented applications, where the language C + + has proved to be most successful. MOOD, however, bridges these applications with some persistent stored data. Overall guideline of the project has been generality and portability; the use of commercially available products and languages ensures that this objective is met. 2

Data

Model

In this section, we describe the MOOD data model. 2.1

Rationale

Various research groups all over the world have concentrated their work in the study of new data models called object-oriented models. In these models, objects are grouped into Classes; each class has an associated set of Methods, which are the only applicable computations to the class. In general, objects of the same class have the same type. Common feature to these models is to support concepts which are lacking from the simpler relational model, such as:

- Object identity: each object has an intrinsic identity, which does not depend on the value of the object. Identity is usually provided by associating to each object a system-wide unique object identifier. Generalization hierarchies between classes: though a few alternative interpretations exist for them, the usual interpretation of partial orders between classes, say C1 < C2, is to say that the set of objects of class C1 (or: the extent of C1) is a subset of the set of objects of class C2, and that the type of T1 is a refinement of the type of T2. - Object Sharing: objects Call relate to each other with part-subpart relationships, and in particular each object can be shared by multiple other objects. -

In designing the MOOD data model, our goal was to choose the most commonly used and representative concepts of object-oriented data models, so that MOOD could be considered a general basis for the development of objectoriented applications. 2.2

Description

A MOOD database consists of a. set of classes. Classes are sets of objects having an unique object identifier (oid), which are not hidden to the users. Each class has associated a type. Class (type) names must be unique in the schema.

236

Classes and their types are defined through class definitions. Some class names useful for building other class definitions may have a forward declara-

tion. A class type is built from other types with a tuple constructor. Each of the types involved in a class definition must be one of the following: - A basic type provided by MOOD. Currently they are integer and string (fixed length strings). - A previosly declared class type. - A sequence of the above types. Sequences, however, are not used arbitrarily, as orthogonal type constructors. Thus, sequences allow to represent multivalued mappings (e.g., a person is mapped to multiple children and to multiple addresses), but not general complex, structured values (e.g. a the address above is a simple string, and not the thriple street, city, state). Each type in a class definition denotes a property of objects of the class. Properties have names. For example, name and age are properties of the class Persons. Properties corresponding to basic types are given values compatible with those types. Properties corresponding to classes assume as values either the oids of existing objects in the referred class, or the nil value. Class valued properties model part_of associations. We distinguish between a direct and inverse sides. Inverses are always explicit: whenever class A refers to class B through attribute x, then class B must have an attribute y denoting the inverse relation. This is declared with a specific syntactic construct. The user should be aware of the fact that inverse associations are traversed less effectively; he is offered the option of modelling a single part_of association through two independent associations, with an opposite choice of direct and inverse link, but then must also preserve the correctness of these redundant associations when they are manipulated by programs. Class hierarchies are supported in MOOD. Given two classes A and B such that B is a subclass of A (A isa B), then: - Objects belonging to class B have the same properties of those in A, optionally extended by others. - Each object of B is also an object of A, i.e. it may be used wherever an object of class A is used.

2.3

Syntax

Figure 2.3 contains the context-free grammar of the MOOD DDL. Items enclosed in curly brackets may be repeated zero or more times. Multiple rules for the same non terminal correspond to alternative productions. Figure 2.3 describes the schema declaration in MOOD for a classical PersonStudent-Department database. Note that, in view of the definition in Fig. 2.3, each person may direct multiple departments while each department has a single director; students are specific persons with codes and sequence of grades.

237

MoodSchema ~ ClassDescription { ClassDescription } ClassDescription --~ c l a s s class_name : f o r w a r d ; ClassDescription --. c l a s s class_name ( property_name : T y p e { , property_name : T y p e }

); ClassDescription ~ c l a s s class_name i s a superclass_name ( property_name : Type { , property_name : Type }

); T y p e ---* BaseType T y p e --~ s e q u e n c e BaseType BaseType BaseType BaseType BaseType

--* --* ~ ~

int char(n)

class_name class_name i n v e r s e o f class_name

F i g . 1. Grammar of MOOD Schema Definition Language c l a s s Person f o r w a r d ; c l a s s Department ( DepName: char(20), Director: Person

); c l a s s Person ( Name: char(20), Age: i n t , Directs: s e q u e n c e Department i n v e r s e o f Director

); c l a s s Student i s a Person ( Code: i n t , Grades: s e q u e n c e i n t

);

F i g . 2. Example of schema definition in MOOD.

238

3

Architecture

This section describes the use of the MOOD system in the context of the development of C + + applications. The general architecture of MOOD is illustrated in Fig. 3. The MOOD system is composed of three components: -

A compiler of the 1~IOOD schema definitions. The compiler checks definitions for consistency (in particular, verifies that all inverse relations are correctly specified) and then produces two outputs: 9 Source code for the C + + Class Library which includes C + + definitions for classes and methods derived by analyzing a MOOD schema. These create the Mood Application Library (MAL). 9 Source code containing SQL schema definitions required to build a relational database incorporating suitable relations to model the objects in the MOOD schema. - The (Mood Generic Library (MGL). It contains general MOOD primitives, whose use will be described in the next section. The Access Methods (AM) for accessing the DBMS; AMs are specialized for the specific sofware products being used for database access, and contain all non-portable software, which is thus localized, though stored within the MGL. Currently, two AMs are based on the ORACLE C interface and Informix, other AMs could be easily added.

-

The C + + schema generated the MOOD schema compiler is produced according to the following rules: 1. Each MOOD class Ca is represented with a C + + class with the same name. It is either a subclass of other C + + classes corresponding to MOOD superclasses of Ca, or a subclass or the predefined class Tuple. Each class Ca is also associated to an object of the predefined type Handle, called Ca-DB. The Handle is used for ;tccessing the database through low-level access methods. 2. Each property of type int, char(n), or sequence is represented with a method of the same name and type int, char[], or Sequence, respectively; Sequence is a predefined type with associated methods. 3. Each property of type Class type A is represented with a method returning a reference (pointer) to a C + + object of class A. Obviously we cannot represent an object-oriented MOOD schema in a relational schema with ~t one-to-one correspondence. The relational model lacks such features as inheritance, composite attributes, object identity, tuple-valued attributes, etc. The SQL schema produced by the MOOD schema compiler generates a normalized relational representation, built as it follows: 1. Each class of the MOOD schema is represented with a SQL relation having the same name. All SQL relations obtained this way have an attribute Old of type integer. 2. Each property of type int or char(n) is represented with an attribute of SQL type INTEGER or CHAR(n), respectively.

239

+ ..................

]

+

MOOD Schema

J

+ ..................

+

II MOOD Compile [[ ++====mm~m===§

II

II

C++ Compile II It + ..................

I

MAL

+ ..................

Sql Compile ~ Execution II II +

i +

+ ..................

+

i Relational DB

J

+ ..................

+ . . . . . . . . . . . . . . . . . .

+

+

+ . . . . . . . . . . . . . . . . . .

f User C++ Program J i + ..................

+

li il II

M(]L / AM

+ ..................

+

i +

il II il

If c++ Compile & Link II + . . . . . . . . . . . . . . . . . .

+

JExecutable Program i + . . . . . . . . . . . . . . . . . .

+

Fig. 3. Software Architecture for using MOOD.

3. Each direct property of typ~ Class is represented with the Old of the corresponding object, physically stored as an attribute of type INTEGER. It will contain the value 0 if the associated Old is nil. 4. Each property of type seq/lence is represented using an additional relation to bring the schema in first normal form. Each relation has: (a) The Oid attribute of the relation it refers to. (b) An integer attribute Pos specifying the ordinal position of the value in the sequence. (c) An attribute for the value. Its type is either integer or string. The name of this relation is the catenation of the class name and the attribute name.

240

5. An index is built over the Oid attribute of each relation; relations corresponding to sequences have a composite index on the Oid and Pos attributes. The relational schema corresponding to the example of Fig. 2.3 is shown in Fig. 3. Note that no table implements the inverse o f the Director association. When a property is declared as i n v e r s e , the corresponding method invokes a query returning the required element by inspecting the direct property, extensionally stored in the database.

person(0id, Name, Age) department(0id, DepName, Director) s t u d e n t ( 0 i d , Code) studentMarks(0id, Seq, Marks) Fig. 4. Relational schema for the MOOD database of Fig. 2

At run time, before using MOOD libraries, the programmer must indicate which data-base interface should be used, and open it. This is an explicit and somehow undesired link, but in this way the Access Methods AM in the MGL are absolutely general; they refer to variable names that are initialized through this activity. The Database Definition is specific for the particular DBMS: when using Oraclel one needs to specify a username and a password, whereas with Informix the database name is sufficient. An example declaration could be 3. Database

*SehoolDatabase

= new

InformixDatabase("School");

The database must be opened, i.e. the program must call the LogOn method before each operation on the database, and LogOff at the end of the execution.

4

MOOD Application Programs

MOOD application programs are written using standard C + + ; however, the MGL library contains a number of generalized data types and methods that may be used in applications; they provide the basic functionalities for database access and for navigation upon the retrieved data. 3 This is a C + + definition for a variable S c h o o l D a t a b a s e of type "pointer to Database" initialized with an instance of the class I n f o r m l x D a t a b a s e , specialized for the database whose external name is School.

241

4.1

Initialization

T h e M O O D primitives are methods of an object type called N a v i g a t i o n N o d e s . This object may be considered as a meta variable that can hold subsets of database objects (i.e. objects of the MOOD schema) belonging to a specified class. Navigation nodes are declared by application programmers in correspondence to the sets of objects that they want to manipulate in their programs; each navigation node is associated to a unique C + + class, but several navigation nodes may correspond to the same class. Further, each navigation node is associated to an object that serves as a handler into the database. The handler of a class is a special object which is responsible for accessing the relational database for retrieving an object from it; it also maintains a cache of retrieved objects so that, when the requested object is already in main memory, no external access to the database is done. T h e following lines present the declarations of two NAVIGATION_NODE (the C + + type for a Navigaion Node). S T U D E N T _ . D B and P E R S O N _ D B are the handlers for the classes Student and Person.

NAVIGATIONNODE My_Student_Navigation_Node( SchoolDatabase, STUDENT_DB); NAVIGATION_NODE AllPersons( SchoolDatabase, PERSON_DB);

4.2

Use of Navigation Nodes

Each NAVIGATION_NODE has a 'current object' which may be changed by iteration primitives (Foreach). One can access this object with the method Current(). Thus, a normal C + + variable can be given a value which is achieved by using the Current method on a NAVIGATION_NODE, as illustrated by the following code fragment: Student s ; .

~

.

s = AS_STUDENT(My_Student_Navigation ~

.

printf('~ 9

.

l|ode. Current ()) ;

9

'', s - > N a m e ( ) ) ;

~

Having an object of some database type, one can access the attributes of it via C + + methods (as in s --. Name()). One interesting feature of this model of operations (thanks to a standard feature of the C + + language) is that, in order to follow a path through part_of relations, one has just to use C + + pointerfollowing as if exploring an in-memory structure. For instance, in the following code fragment we indicate how the name of the director of a department m a y be printed:

242

Department d; .

.

.

d = .... ; printf (. . . . ~

.

4.3

~

9

9

d->Director()->Name()

...

);

~

MOOD Primitives

The following primitives are part of the standard MOOD library. We describe each of them, including some fragments of C + + / M O O D code to exemplify their use.

Foreach(Variable); Follow(Attribute, Variable); Clear(Variable); Sql( V ariable, " whereclause" ) ; And(Vres, V1, V2);

Or(V~, v~, V2); Not(Yr,,, V); Join(Join_Condition, V1, V2); BuildRel(Result, l~, ..., Vn);

Foreach. This primitive implements iteration over a set of Tuples, contained in Variable (a NAVIGATION_NODE ); for instance, in the following program fragment it selects all persons currently in the navigation node AlIPersons; their names and ages are subsequently printed.

NAVIGATION_NODEAIIPersons(PERSON_DB); .

o ~

while (AllPersons.Forsach~)) { printf("...", AllPersons.Current()->Nams(), AllPersons.Current()->Age() ); } At each iteration through the loop, the 'current object' of AllPersons will return a different object contained in AllPersons (the order of iteration is not specified). Follow. The Follow primitive may be used to navigate in the database through part_of relations. Follow is normally used in conjunction with Foreach, as exemplified by the following code fragment which connects each director to the departments he directs.

243

NAVIGATION_NODE AIIDepartments(DEPARTMENT_DB); NAVIGATION_NODE AIIDirectors(PERSON_DB); Person p; while (AllDirectors.Foreach()) AllDirectors.Follow("Directs",

AllDepartments);

The Follow primitive has a source and a target navigation node variables; as effect of executing the primitive, the target variable contains all the objects that are connected to objects in the source variable through the part_of link. Note that one could navigate along part_of links through the corresponding methods, hut without instantiating the target navigation node; further, the Follow primitive is more efficient than the use of methods with part_of links of type sequence. Clear. It simply empties a NAVIGATION-NODE variable. It may be used to prepare a NAVIGATION_NODE for the next computation. For instance: NAVIGATION_NODE Persons_Older_Than_20(PERSON_DB);

Persons_Older_Than_20.Clear(); Persons_Older_Than_20.Sql( "age > 20" ); Persons_Older_Than_20.Sql( "age < I0" ); // Persons_Older_Than_20 now contains // the union of the results of both // SQL operations.

A n d , Or a n d Not. This functions implement the obvious boolean operations on the NAVIGATION_NO-DEs; and~or are applied to pairs of navigation nodes and generate their intersection or union; not generates the complement of an instantiated navigation node with respect to the values contained in the database for that node. NAVIGATION_NODE Persons_Older_20(PERSON_DB); NAVIGATION_NODE Toy_Directors(PERSON_DB); NAVIGATION_NODE Toy_Directors_Older_20(PERSON_DB); Persons_O1der_20.Sql( "age > 20" ); Toy_Directors.Sq1( "DepName = Toy* "); And(Toy_Directors_Older_20, Persons_Older_20, Toy_Directors);

244

Sql. Sql executes a query on the table associated to the current navigation node Vamable, and loads it with the resulting tuples, after type coercion. For example, we retrieve all persons more than 20 years old, and print their name, as it follows:

NAVIGATION_NODE Persons_Older_Than_20(PgRSON_DB); .

.

~

Persons_Older_Than_20.Sql( "age > 20" );

while (Persons_Older_Than_20.Foreach()) printf("...", Persons_Older_Than_20.Current()->Name()

);

The query expression can contain complex predicates, i.e. joins with other relations; this primitive, however, selects the result on t h e target table. The predicate may involve condition on attributes of subobjects, but in doing so the programmer must know the underlying relational schema. Full joins can be obtained with the primitive Join described later in this section. J o i n . In order to execute efficiently a relational full join operation, MOOD provides the primitve Join. Given a join condition (which, as in the Sql primitive, may be any legal SQL predicate) and two NAVIGATION_NODE s, MOOD executes a query on the relational database joining the relations associated with the NAVIGATION..NODEs. Consider for example the following program fragment:

NAVIGATION_NODE Directors (PERSON_DB) ; NAVIGATION_NODE Departments (DEPARTMENT_DB) ; , . .

Join("Person. Oid = Department.Director", Directors, Departments) ; .

.

.

To execute the join, MOOD will execute the following SQL query: select * from person, department where person.oid = department.director

After the execution of a Join primitive, MOOD loads selected objects into the corresponding NAVIGATION_NODE Directors and Departments, and sets up additional information between objects related by the join (as after the execution of a Follow primitive) to support a possible, subsequent execution of the BuildRel primitive. B u i l d R e l . This function must be preceded by a series of navigations with F o l low and Foreach, or with Join. Its effect is to build a relation that describes

245

all selected navigations. More precisely, the relation contmns one tuple for each distinct path ~ o m all selected objects in V1 to all selected objects in V~ (n is limited to a given max value, MOOD-implementation dependent). The only constraint is that all navigation nodes re~renced by the ~ must have been previously instantiated, through, Foreach, Follow, and Join primitives.

// Link each director with the department he directs NAVIGATION_NODE AIIDepartments(DEPARTMENT_DB); NAVIGATION_NODE AllDirectors(PERSON_DB); Person p; 9

.

,

.hile (AllDirectors.Foreach()) AllDirectors.Follow("Directs",

AllDepartments);

// Generate a relation linking departments // to their directors, called // "persons_departments" .

.

.

BuildRe1( "persons_departments", A11Persons, AliDepartments );

4.4

I m p l e m e n t i g t h e Isa H i e r a r c h y

The isa hierarchy C1 < C2, defined in a MOOD schema, is translated as follows:

1. Relational Schema. Each class corresponds to a different relation; therefore, R1 and R2 are generated. However, their Old attributes take values from the same domain D. If objects O1 E C1 and 02 E C2 are hierarchically related (O1 < O~), then the corresponding tuples tl E R1 and t2 E R2 have the same Old value. 2. C § 2 4 Data 7 Structures. We use the default C + + inheritance; thus, subclasses automatically inherit all methods of the superclass. Since we want to be able to use an object at each level in the isa hierarchy, we must store the representations of the attributes of all the subclasses in the top object. By doing so, all the objects in the hierarchy are the same length, and we may have a single pointer to an object, independently to the class to which it belongs; we may use a simple C + + type cast to change the class of the object 4. On the contrary, if we store the attributes in the object to which they belong, then objects at different level in the hierarchy would have different 4 Though a normal cast would suffice, the macro AS_class should always be used since it checks if the object really belongs to the class we are casting it to. Objects mantain a record of their identity, and trying to use an object with the wrong identity (i.e. asking the marks of a Person which is not a Student) would cause a Runtime Error.

246

length, and we couldn't have the same pointer to the objetct regardless of the class. 4.5

Building a Mood Application

We now detail the steps which are needed to run MOOD applications. First of all, the user must compile the MOOD schema. Let us assume that it is contained in a file named school.too, then the following commands are executed (mc is the

mood compiler): 7. m c -s s c h o o l . m o 7. mc -c s c h o o l . m o

The first command produce a file school.sql that should be run on the relational database for table creation. The second command produces, for each MOOD class~ one file with the .cc extension, containing C + + methods source code, and one with the .h extension, containing declarations to be imported in each user program. It also produces the files school.cc and school.h which contain general definitions about the schema. In our example, the produced files are: person.cc, department cc, student, cc, person.h, department.h, student h, school.cc, schooi.h. They are next compiled in a library, using the conventional C + + compiler and utilities. Finally, the programmer can write his application. This is a normal C + + program, which imports the MOOD definitions, defines appropriate navigation nodes, and uses the MOOD primitives. When the program is completed, it must be compiled and linked with the library just created. It can then be normally executed.

5

Example

A sample (real) program which queries the database for various operations is the following: I 2 3 4 5 6 7

#include #include #include #include #include #include #include

8

Database *SchoolDatabase =

9 I0

11

<stream.h> <School.h> <Student.h>

new InformixDatabase("School"); NAVIGATION_lODE

Persons_Older_Than_25(SchoolDatabase,

PERSON_DB),

247

Directors(SchoolDatabase, PERSON_DB), Departments(SchoolDatabase, DEPARTMENT_DB), Student_1OO(SchoolDatabase, STUDENT_DB);

12 13 14

15

main(int argc, char **argv)

16

{

17 18

SchoolDa~abase->LogOn(); Persons_Older_Than_25.Sql("age

19 20

//Sql - pers cout << "Names of employees older than 2Skn";

21

//while -i while (Persons_Older_Than_25.Foreach()) { Student s = AS_STUDENT(PersonsOlder_Than_25.Current()); Person p = AS_PERSON(Persons_OlderThan_25.Current());

22 23 24 25 26

27 28

c o u t << p->Name() << "\t"; c o u t < J o b ( ) << " \ n " ;

29 30 31 32 33 34

i~ (s)

> 25");

cout << "\tCode: " << s->Code() << "\nkn"; else cout << "\t(This is not a Student !!)\nkn";

} tout << "\n";

35 36

//Sql - dep Departments.Sql( ....);

37 38 39 40

//while -2 while (Departments.Foreach()) { Departments.Follow("Director",

41 42

//while -3 cout << "Department data\n";

43 44 45 46 47

while (Departments.Foreach()) { D e p a r t m e n t d = AS_DEPARTMENT(Departments. C u r r e n t ( ) ) ; c o u t << d->DepName() << " \ t " ; c o u t << d - > D i r e c t o r ( ) - > N a m e ( ) << " \ n " ;

Directors);

}

}

248

48 49

50 51 52 53 54 55 56 57 58 59

60 61

cout << "\n"; tout << "Directors data\n"; while (Directors.Foreach()) { Person p = AS_PERSON(Directors.Current()); c o u t << p->Name() << "\n"; cout << "\n"; t o u t << " B u i l d R e l : Departments - Directors\n"; BuildRel("ne.rel", ~Departments, ~Directors, NULL); tout << "\n"; .hile (Persons_Older_Than_25.Foreach()) { Student s = AS_STUDENT(Persons_Older_Than_25.Current()); if (s == NULL) continue;

Pix i; c o u t << s->Name() << " \ t " ; Sequence v = s->Votes();

62 63 64 65 66 67

tOUt << "Votes: "; if ( ! S E m p t y ( v ) ) {

f o r ( i = S S t a r r ( v ) ; i != O; S N e x t ( v , i ) ) t o u t << " " << *SAZ(v, i , i n t * ) ;

68

}

69

70 71 72 73 74

{

} tOUt << "\n" ;

> SchoolDatabase->LogOff();

We now describe in detail each part of the program. - Lines 1-7 include (i.e. they import) the external declarations needed by the program. - Lines 8-14 define the database interface and the NAVIGATION_NODE used. Each NAVIGATION_NODE is connected to the database and to the particular class. - The program starts executing at line 17 doing a LogOn to the database, by using the method L o g O n ( ) . - At line 18 the program makes its first query to the database. It asks the MAL to retrieve some Persons from the relational database and to create the corresponding NAVIGATION_NODE P e r s o n s _ O l d e r _ T h a n _ 2 5 , so that the program will be able to operate on them, as it will do in lines 22-33. It iterates over all the M O O D objects just retrieved displaying some data about them. It also checks which Persons are also Students by trying a coercion to the desired class (the macro AS_STUDENT) and checking if the resulting object pointer is NULL (i.e. the nil object).

249

-

-

-

-

6

Line 36 executes a query on the Departments class, retrieving all the objects. At lines 38-40 we see how to execute a Follow between Departments and Directors. The effect of this loop is to retrieve and put in the NAVIGATION_NODE D i r e c t o r s pointers to all the persons ( D i r e c t o r s ) which are directors of some department. MOOD also stores information needed to remember which department caused the inclusion of which person (for this particular couple of NAVIGATION_NODE ). This information will be used by the B u i l d R e l at line 56. Lines 56-72 iterate over all the persons (retrieved in line 18) which are also students to display their votes. Lines 67-69 iterates over the votes of a single student. The expression *SAt(v, i, int *) extracts the i-th vote of the student. Macros S S t a r t and S N e x t control the iteration. Finally, at line 73, the programs Logs Off the database.

Previous

Related

Work

MOOD presents an interface between an object-oriented language and a relational database; related work concerns various methods for interfacing databases with programming languages. 6.1

Logic Systems

Logic offers an ideal setting for these intefaces because of the "natural" correspondence between predicaies in logic and relalions in databases. Several systems and prototypes proposed in the literature couple Prolog to a relational database, including CGW, PRIMO, BERMUDA, ESTEAM, and Q U I N T U S - P R O L O G [3]. The coupling is almost always loose, i.e. the interaction with the database takes place independently of the inference process. Some of the predicates used in the prolog program are classified database predicates. When the program refers to one of them, the underlying coupling system transforms the logic goal into a SQL query; the query is executed by the relational system; and results are transferred from into the main-memory environment of the Prolog program. 6.2

Object-oriented Databases

A significant trend in database systems is toward the design of object-oriented data models where it is possible to describe complex objects with identity (see [9], [10], [11]). In this field, there should be in principle no need for interfacing with programming languages, since these systems propose new-generation languages which are computationally complete and persistent. However, in ahnost all of them the need is felt for interfacing the most popular programming languages (e.g. C02 between C and 0~, [9]), because conventional programming languages remain the most conventional vehicles for writing applications. This consideration has influenced our work.

250

6.3

H o s t L a n g u a g e Interfaces to SQL

Since SQL is not computationally complete, it has always been hosted by conventional programming languages, like COBOL or C or PASCAL, with statements like: EXEC SQL SELECT COUNT(*)

INTO :COUNT-EMPLOYEES FROM EMPLOYEES

END EXEC

The above instructions store the count of employees into a variable COUNTEMPLOY-EES which must be known to both SQL and the host language. This is a first mechanism for communication, eased by the fact that the query result is a singleton relation. When instead queries select sets or multisets, communication is achieved through cursors. First a cursor must be associated with a query, and then the host program can use the F E T C H instruction to retrieve successive tuples from the query result into the variables of the programming language. This approach forces the programmer to use two languages in the same program, and to alternate tuple-based with set-oriented processing. Compared to cursors, the approach defined in this paper is at a higher level: the programmer can use methods and primitives for all its interaction with the database. Primitives such as Foreach and Follow allow navigations across semantic links, while the primitive BuildRel assemble the results of computations in a powerful ways. However, the full power of SQL is available, through Sql and Join primitives, if one wants to lower the level of the interaction. 7

Conclusions

The design and implementation of MOOD has now reached a stable state. In particular, all functions described in this paper have been implemented. Given the loos-coupling approach, the prototype is not an high performance system. This meets our expectations: we were aiming at verifying the feasibility of the approach, not its performance. The current state of MOOD is not definitive, in particular we expect it to evolve along various directions:

11 Methodology ~J Tools. In order to turn MOOD into a workable programming system, it must include some tools, to enable the programmer to design and tailor his application: schema analyzers, automatic code generators for "standard" tasks, and so on. 2. Access to existing databases. MOOD builds its own database to store classes. It is conceivable to do the opposite, i.e. starting from an existing relational database and then build the C + + classes to access the database. We could then use all primitives developed for MOOD in this new context.

251

3. Dynamic MOOD. MOOD is a static system: it is not possible to extend the schema at runtime. This is because C + + is a statically typed language. Nevertheless, if one wants to build interactive applications, it would be nice to have the capability to define new classes at runtime. While C + + prevents us to define new types, it has features that may give some run-time flexibility. 4. Development of other systems with MOOD. MOOD can be used as a tool for building other systems. We are considering to implement L O G I D A T A + in C + + using MOOD; other applications could be in the context of graphical database interfaces to generate navigational queries through pointing devices on the schema.

References 1. P. Atzeni, F. Cacace, S. Ceri, L. Tanca: "The Logidata+ Model" ; Rapporto Interno,

Obiettivo Logidata+, 1990. 2. F. Cacace, S. Ceri, S. CrespiReghizzi, L. Tanca, R. Zicari: "The LOGRES Project: Integrating Object Oriented Data Modeling with a Rule Based Programming Paradigm."; Rapporto Interno n. 89039, Politecnico di Milano, Dipartirnento di

Elettronica, 1989. 3. S. Ceri, d. Gottlob, L. Tanca: "Logic Programming and Databases"; Springer Vet-

lag, 1990. 4. B. Stroustrup: "The Cq--t- Programming Language", Addison Wesley, 1986. 5. S. Ceri, d. Gottlob, G. Wiederhold: "Interfacing relational databases and Prolog efficiently."; Proc. 1st Int. Conf. Expert Database Systems, Charleston, 1986. 6. E. Ioannidis et at.: "BERMUDA- an architectural perspective on interfacing Prolog to a database machine."; Computer Science Technical Report 723, University of

Wisconsin, Madison, Wis. Oct. (1987).. 7. F. Gozzi: "Design and implementation of efficient interfaces between logic programming environments and relational databases - complex predicates (in Italian)";

Diploma Dissertation, Computer Science School, University of Modena, Modena, Italy, Dec. (1987).. 8. M. Lugli: "Design and implementation of efficient interfaces between logic programming environments and relational databases - metainterpreter and simple predicates (in Italian)"; Diploma Dissertation, Computer Science School, University of

Modena, Modena, Italy, Dec. (1987)_ 9. O. Deux et at.: "The 02 System"; Communications of the ACM, October 1991,

Vol. 34 No. 10 10. C. Lamb, G. Landis, J. Orenstein, D. Weinreb: "The Objectstore Database System"; Communications of the ACM, October 1991, Vol. 34 No. 10 11. P. Butterworth, A. Otis, J. Stein: "The Gemstone Object Database Management System"; Communications of the ACM, October 1991, Vol. 3,4 No. 10

P r o t o t y p e s in t h e LOGIDATA

- Project 1

A. Artale, J.P. Ballerini, S. Bergamaschi, F. Cacace, S. Ceri, F. Cesarini A. Formica, H. Lain, S. Greco, G. Marrella, M. Missikog, L. Palopoli L. Pichetti, D. Saccd, S. Salza, C. Sartori, G. Soda L. Tanca and M. Toiati

Authors' affiliations:

A. Artale, F. Cesarini and G. Soda: Dipartimento di Sistemi ed Informatica, Universit~ di Firenze, Italy. J.P.BaUerini, S. Bergamaschi and C. Sartori: CIOC-CNR, Bologna, Italy. F. Cacace, S. Ceri and L. Tanca: Dipartimento di Elettronica, Politecnico di Milano, Milano, Italy. A. Formica, M. MissikoI~, L. Pichetti, S. Salza and M. Toiati: IASI-CNR, Roma, Italy. S. Greco, G. Marrella, L. Palopoli and D. Saccd: Dipartimento di Elettronica, Informatica e Sistemistica, Universit~ della Calabria, Rende, Italy. H.Lam: Database Systems R&D Center, University of Florida, USA.

1

Presentation

(L. P a l o p o l i )

The purpose of the LOGIDATA+ project has been to design and implement an advanced database environment supporting a number of advanced features such as a complex data model integrating value-based, object-based and functionbased data entities, a logic-based query language with negation including module specifications and exception handling, a complex object basic query machine efficiently implementing query answering on LOGIDATA+ databases, a logic based language for updating the LOGIDATA+ databases and, finally, a taxonomic classifier to perform type classification based querying on the schema and to be used also as kernel component of a tool for computer aided database design. The LOGIDATA+ project was designed as a 3 plus 2 years project, where the former three years were intended to serve for researching about a number of open problems in the field covered by the project, and the latter two years

253

were intended to serve for developing a complete prototype of the LOGIDATA+ system. This paper covers the prototypes development activities carried out in the former three years of the project. There are a number of research topics amongst that covered by the former three years of the LOGIDATA+ project whose study has been followed by a prototyping activity. Taxonomic reasoning, complex forms of negation, modularization, implementation models and techniques are among those research topics. In the sequel of this section, the prototypes are briefly overviewed. Three prototypes correspond to the research developed on the problem of adding taxonomic reasoning facilities in a complex object database framework. The first prototype (Section 2) is called LOGITAX and has been developed at the Dipartimento di Sistemi ed Informatica - Universits di Firenze by A. Artale, F. Cesarini e G. Soda. LOGITAX is a prepocessor which allows to build a taxonomy of classes defined according to an extension of the LOGIDATA+ data model, called LOGIDATA*. The second prototype (Section 3) is called ODL-DESIGNER and has been developed at the CIOC-CNR- Bologna by J.P. Ballerini, S. Bergamaschi and C. Sartori. ODL-DESIGNER is a tool supporting the acquisition of complex object schemata, which are checked for consistency and minimality. The third prototype (Section 4), called MOSAICO has been designed and implemented at IASI-CNR, Roma, by A. Formica, H.Lam, M. Missikoff and M.Toiati. MOSAICO is a tool designed to help the rapid prototyping of database applications based on a complex object data model. One prototype has been developed which regards the problem of augmenting classical logic programming with very powerful kinds of negation. The prototype, called PATRIOTS Ms (Section 5), has been developed at the Dipartimento di Elettronica Informatica e Sistemistica - Universitk della Calabria by S. Greco, G. Marrella and D. Sacc~. Its purpose is to implement several techniques defined within the project to compute non-deterministic stable semantics and semantics with exceptions in logic languages with negation. The LOGRES prototype (Section 6) can be thought of as being associated to a many-fold research topic: studying the modularization issue in logic languages for databases, studying the problem of update and query specifications on complex object databases through a logic based language and, finally, investigating the suitability of non first normal form relational databases as kernel on which to implement complex object databases. This prototype, has been developed at the Dipartimento di Elettronica, Politecnico di Milano by F. Cacace, S. Ceri and L. Tanca. The LOGRES prototype translates complex object declarations in the extended relational model of the Algres language. A corresponding translations takes place for rules too, which are then executed on the Algres interpreter. The last prototype presented in this overview relates to the problem of efficiently bridging a logic query language environment to an existing (commercial) relational database. The focus in this case is on the efficiency of the coupling between the two environments. The prototype, called LOGIBASE+ (Section 7),

254

has been developed at IASI-CNR, Roma, by S. Salza and L. Pichetti. The system has been defined to extend the capability of a commercial database management system by providing a rule based interface and a number of modules to optimize and evaluate logic queries. The paper closes with a reference section and a final section in which the table and the figures referred throughout are reported. 2

LOGITAX:

A classifier

for LOGIDATA+

(A. Artale,

F. Cesarini, G. Soda) LOGITAX (LOGIdata TAXonomy) is a tool developed in the LOGIDATA+ project for organizing class descriptions in a taxonomic graph. The taxonomy refers to the concept of subsumption; a class C subsumes a class C ~ if and only if C' is a subclass of C. Therefore the taxonomy is built by determining all the subsumers and subsumees of each class. Subsumption can be determined by examining the class descriptions, according to a suitable denotational .semantics. This kind of approach strongly relies on the concept of defined class, i.e., a class whose type definition gives necessary and sufficient conditions for an object to belong to it. The taxonomy makes it possible to perform some inferences (taxonomic reasoning) that are useful in several applications, such as conceptual schema design, instance recognition and query answering. Since taxonomic reasoning is not usually present in Object-Oriented Data models, its introduction in the LOGIDATA+ model has required a suitable formal framework. The classifier implemented refers to the LOGIDATA* Data Model which allows us to distinguish between defined and primitive classes. A class is called primitive if its definition gives necessary conditions for an object to belong to it, it is called defined if the conditions are necessary and sufficient. Subsumption between primitive classes must be stated by user ISA declarations, whereas, in the case of defined classes, it can be computed examining the structure of the class definitions: Therefore, the classifier can be used as a preprocessor for checking both user definitions and discovering ISA links not explicitly stated by the user. Since LOGIDATA* supports multiple inheritance and acyclic definitions, LOGITAX manages a Direct Acyclic Graph (DAG) whose nodes represent the classes. Futhermore, the graph is maintained in a minimal form, that is, there is a link between nodes x and y if and only if y subsumes z and there is not any z such that y subsumes z and z subsumes z.

2.1

LOGITAX Structure

The classification process consists of two main steps: Parsing and Insertion.

Parsing In this step, the description of the new class is acquired and translated into a canonical form. The parser recognizes whether or not an input string is a

255

LOGIDATA* description, making a distinction between a type_declaration and a class_declaration. During this phase LOGITAX verifies the absence of recursive definitions and description correctness. An automatic translation mechanism for converting a user class description into a canonical form is embodied in the parser. The canonization-process generates the class structure that is used in the next phase of classification; multiple inheritance is managed by expanding the class names appearing in the ISA clause.

Insertion The aim of this step is to arrange the class, expanded in the previous phase, within the graph. Insertion is performed by : 1) finding all the direct parent classes (the Most Specific Subsumers, MSSs); 2) finding all the immediate descendants (the Most General Subsumees, MGSs). Furthermore, all the operations necessary for maintaining the graph in a minimal form are performed. Search of the Most Specific Subsumers In this step, a class is put under its most specific subsumers in the graph hierarchy on the basis of its properties. Since the data model supports multiple inheritance, it is possible for a class to be subsumed by two or more incomparable classes. In determining the most specific subsumers we take into account the graph's structure and the ISA clause appearing in the user class description, operating in both a top-down and bottom-up way. Let x be the class we want to introduce; the search process is carried out by means of the following steps: 1. A list of classes that subsume x is built by only considering the user-specified ISAs and the transitivity of the links present in the graph. This process is performed by means of a bottom-up navigation of the graph. 2. For each class of the list, all its direct subclasses are examined looking for a subsumption relationship. If one of these subclasses subsumes z then LOGITAX proceeds top-down examining all its direct subclasses, otherwise, it skips onto the next subclass. In this way, the classifier finds out the Most Specific Subsumers of x, i.e., the classes that subsume x and do not have subclasses that in turn subsume x; a link is then asserted between z and each of its Most Specific Subsumers.

Search of the Most General Subsumees In this step LOGITAX looks for classes y subsumed directly by z (i.e., it does not exist z for which x subsumes z and z subsumes y). Due to the transitivity of subsumption, any MGS of z is subsumed by all the MSS of z. Therefore, in order to find all the MGSs of x, it is sufficient to choose a MSS of x and to determine whether or not its descendants are subsumed by x. At the end of this phase, all the redundant links are eliminated, thus the graph minimality is maintained. LOGITAX is implemented in C-Prolog under Ms/Dos.

256

3

The

ODL-DESIGNER

Prototype

(J.P.

Ballerini,

S.

Bergamaschi, C. Sartori) ODL-DESIGNER is an active tool which supports automatic building of classes and types taxonomies, preserving consistency and minimality of a database schema. It is based on the inference technique taxonomic reasoning, developed in the AI environment. Taxonomic reasoning is founded on the computation of subsumption relationships between two classes on the basis of their descriptions and allows the discovery of implicit ISA relationships between derived classes (also named defined of virtual), which are analogous to database wiews and whose descriptions constitute necessary and sufficient conditions. ODL-DESIGNER is based on a subsumption algorithm tractable and complete which is able to compare very general class and types descriptions, as is addressed in papers Taxonomic reasoning in LOGIDATA-t- and Taxonomic reasoning with cycles in LOGIDATA-I-, in this volume. In ODL-DESIGNER a class is expressed as genus, i.e. a conjunction of ancestors (not necessarily parent) class names, and differentiae, i.e. class descriptions obtained by freely nest'ing the tuple, sequence and set operators. Futher, set cardinalities are expressed and, besides primitive classes (whose descriptions constitute necessary conditions), derived classes are also supported. The main feature, with respect to LOGITAX and to Mosaico (see Sections 2 and 4) is the support of cyclic definitions. The tool is able to guarantee a passive consistency check for a taxonomy of primitive classes: given a new class , by subsumption, it is possible to compare its description with a given class taxonomy and to detect if it is incoherent (subsumed by the empty class)and thus refuse it. A more active role can be played if the taxonomy includes also derived classes: for a coherent class description a minimal description (i.e., a rewritten description on the basis of its most specific generalizations) is computed and thus the class is placed in the right position of a taxonomy. In this way, semantic equivalence of classes can be recognized (i.e., different names and/or syntactic descriptions which correspond to the same minimal description) and redundancies with respect to a taxonomy are removed. Figure 1 shows the architecture of ODL-DESIGNER. The program is divided into two main functional groups: F I : allows the creation and the transformation of an ODL schema from a syntactical and semantical point of view, checks if the ISA relation graph and the relations between types are cycle free and, if the schema is correct, generates the canonical form of the ODL schema; F2: finds out the minimal ODL schema starting from the canonical ODL schema. Figure 2 depicts the structure of CREATE, the schema creation procedure: the user interface allows schema creation by directly inserting a description or by giving the name of a text file containing the descriptions. The input schema i s t h e n analyzed from a syntactical and semantical point of view and, in case of success, the absence of ISA cycles and type cycles is checked. If the schema is well formed (i.e. the previous checks are positive), the canonical schema is

257

generated, to recognize possibly incoherent classes or types. In this phase an interaction with the user is necessary when errors are detected, in order to ease the design process. The canonical schema becomes the input to the taxonomic reasoner which generates the minimal schema. The ADD procedure has a structure very similar to that of CREATE. The main difference is that it allows a simplified reasoning algorithm, since it compares new descriptions, which are potentially incorrect/conflicting, with a corpus of well formed definitions. Finally the MODIFY procedure is similar to ADD, as it starts from an existing schema, but in this case the knowledge has a non monotonic growth and then the coherence computation for the modified schema has to be done from scratch.

4 MOSAICO: a System for Specification and Rapid Prototyping of Object-Oriented Database Application (A.Formica, H.Lam, M.Missikoff, M.Toiati) The system Mosaico has been conceived to support the design and rapid prototyping of data intensive applications based on Object-Oriented Databases (OODBs). The system is described illustrating first its architecture, then its data model, and finally some implementation issues. 4.1

Architecture of the system Mosaico

The architecture of the system is depicted in Figure 3. The functions supplied by the systems can be grouped into four main subsystems. - O O D B A p p l i c a t i o n S p e c i f i c a t i o n - This subsystem support the designer in the process of constructing the specification of an application, using the specification language T Q L + + . An application is specified by defining a set of types and a set of associated actions. Types can be defined from scratch or imported (and possibly adjusted) from a type library. New types are inserted in the type library for future reuse. Type and action definitions form the Object-Oriented application specification, referred to as schema + + (the term schema without pluses is used for the structural specification alone, i.e. data definition without actions). - O O D B A p p l i c a t i o n V e r i f i c a t i o n - The formal nature of the language T Q L + + allows the semantic verificaton of the specification of an application. Mosaico processes the specification aiming at finding incorrect descriptions, such as incomplete, inconsistent, or unsafe assertions. To this end the specification is translated to an internal (logical) format and is supplied to a theorem prover. - R a p i d p r o t o t y p i n g - This subsystem is devoted to the compilation of the s c h e m a + + and the production of executable code. The code produced implements a prototype of the application. To actually run the prototype, it

258

is generally required to load a few objects. To this end Mosaico supplies a language, Lobster, for object definition and the initial load of the database. - Q u e r y processing tool - Having defined the schema and having populated the database, it is possible to perform query functions. One key feature of the system is the possibility of querying the database using the same language conceived for the Data Definition: TQL. In essence the enquiry is performed by defining new q u e r y - types, then the system retrieves all the objects that satisfy the query-type and constructs the a n s w e r class. 4.2

The TQL++ data model

The data model of TQL system is based on the traditional elements that can be found in other proposals of the field. In particular objects have identifiers (oids), which are immutable throughout their life, and unique, i.e. each object has a different identifier. Objects have a state, given in terms of the values of the properties. Properties, when evaluated, can be associated to basic values, corresponding to built-in types (such as integer or string), or to complex objects. In the latter case, referenced objects are identified by means of oids. Mosaico allows the designer to specify a database application by defining a schema++, i.e. a set of complex types and their associated actions. Types include the specification of the structure of the objects, semantic integrity constraints, and the protocol (i.e. set of action declarations). Actions define objects' behavior in a declarative way. The data definition component Of a type is composed by a tuple of typed properties (due to lack of space, we skip the behavioral part). A property can be typed with: (i) basic types, (ii) an explicit set of values (enumerated type), (iii) an interval, (iv) a user-defined type, (v) a tuple of typed properties. A property can be single-valued or multivalued. In the latter case it is possible to impose cardinality constraints. Types are organized in a generalization hierarchy. Example I. The type person and student, and two objects, defined according to the syntax of TQL and Lobster, respectively:

person := [name:err, addr: [street:err,

city:str],

tel:{int}o,3,

age:(O..I80), sex:(N,F),vehicle:car] student := ISA person [college:sir] PERSON = {(#p3: [name:john, addr: [street: broadway, city:S.F.], tel:34S6667, age:34, sex:M, vehicle:#cl2]), (#pT: [name:amy, addr: [street: tinyWay, city:L.A.],

te1:{7666345, 7666543}, age:22, sex:F, vehicle:#c12]) In the example we assume that the type car has been defined. The oids are indicated by a code that starts with the character # (pound).

259

4.3

Implementation issues

A first prototype of Mosaico has been developed on a SUN workstation, using BIM-Prolog. Current version generates prototypes in the form of Prolog code. A code generation for generating C + + code is under development.

5 A System Prototype for the Computation of DATALOG Programs with negation (S. Greco, G. Marrella, D. Sacc&) DATALOG is a Horn clause language without function symbols which is mainly used as query language for relational databases [4, 15]. Because of recursion, its expressive power is greater than that of classical database languages based on predicate calculus; nevertheless, DATALOG is unable to fulfill all requirements of new demanding knowledge-based applications. Therefore, many research projects are recently involved in adding new features to DATALOG [5, 11, 10, 16]. In [9] and [7] two particular usages of negation have been proposed to add, respectively, non-determinism and exception mechanisms to DATALOG and have described a number of algorithms for implementing their semantics. In this section we describe a prototype that has been used both to experiment the expressive power of the two mechanisms and to verify the feasibility of their implementation. 5.1

Architecture of the Prototype

The prototype handles queries on DATALOG programs with negation, whose base of facts is stored in mass memory (files or relational databases). The architecture of the prototype is pictured in Figure 4. There are six main modules in the prototype: the User Interface, the Schema Manager, the Program Manager, the Query Compiler, the Interpreter and the Database Interface. The User Interface manages the dialogue with the user and provides the access to the three main services supported by the prototype: (i) design of the fact base schemata, (ii) definition of DATALOG programs on the fact base of one of the available schemata, and (iii) submission of queries. The Schema Manager handles the data dictionary of the schemata used by the DATALOG programs; in particular, for each schema, it stores the name of each base predicate symbol, the name of the corresponding relation, and the database or the file where the tuples of each relation are stored. The Program Manager receives DATALOG programs from the user interface and stores them into a repository. Each program is partitioned into a partially ordered set of subprograms (components), each of them consisting of the rules which define mutually recursive predicates. The Query Executor receives from the user interface a query on a program and compiles the program in such a way that all answers of the query are computed possibly using constants for restricting this computation. The result of a query

260

compilation is a new program (rewritten program) whose execution generates all query answers. The steps performed by the executor are the followings: 1. selecting the components of the program that are useful to compute the query (and possible desirable properties such as program stratification are checked to devise a better execution strategy); 2. reordering goals appearing in the body of rules in order to guarantee safety and improve the performance of execution; 3. generating the rewritten program; 4. sending the rewritten program to the interpreter; 5. forwarding the answers received from the interpreter to the user interface and, if required, storing the rewritten program into the repository for further executions. The Interpreter consists of a program that computes a DATALOG program using classical fixpoint computations as well as the algorithms of [9,7]. It is written using meta-programming in PROLOG so that the powerful mecfianisms of this language represent the basic components of the DATALOG machine; e.g., unification is used to implement the operations of relational algebra. The Database Interface provides the access to external database relations or files. It realizes a 'tight' coupling, that is, the loading of external data as facts in main memory is optimized using the knowledge on the history of previouslyexecuted retrievals for caching [4]. Also this module is realized through rectaprogramming in PROLOG.

5.2

An Example of Implementation of the Prototype

As shown in Figure 4, the module Interpreter is implemented as a PROLOG meta-program. In order to give an idea of how this module is realized, next we sketch the implementation of the part of the Interpreter which computes the fixpoint of a DATALOG program. This computation is a key operation in the evaluation of DATALOG programs and is the basis for the actual implementation of the algorithms of [9,7] As already mentioned, a program P is partitioned into a number of components {P1,-.., Pn} that are partially ordered in such a way that each predicate appearing in a subprogram Pi depends only on predicates appearing in subprograms Pj for which j _
compute (Component s),

ev Z(X). The predicate relevant.program computes the list [Pkl, Pk2, ..., Pk,~] of components that are relevant for the execution of the query. The sequence of components are listed in the above order. The Interpreter computes the components following this order.

261

compute(B )9 r lOthercomponents]) ~- seminaive(Pj, 0), compute(Othercomponents). The predicate seminaive computes each component Pj using the seminaive strategy [15]. The iterative process of the seminaive computation is enforced using the construct fail inside the body of the rules. The second argument of seminaive stores the index of the iteration. s e m i n a i v e ( P j , O) *-- e x i t _ r u l e ( P j , R u l e ) , e v a l . / u l e ( R u l e , 0),

fail. seminaive(Pj,I)~- I > O, recursive_rule(Pj,Rule), eval_rule(Rule, I), fail. seminaive(Pj, I) ~- generated_tuples(Pj), I 1 is I + 1 seminaive(Pj,Ii). seminaive(_,_). The computation of the above rules can be summarized as follows: 1. at step 0 the exit rules of the component Pj (first rule) are computed ; 2. at step I > 0 the recursive rules of Pj (second rule) are computed; 3. the computation of the second rule is repeated until no new facts are generated; at this point, if at least one fact has been generated in the current step, the third rule starts the computation for a new step, otherwise the computation halts, since the fixpoint has been reached. For the evaluation of a rule, we first evaluate the body, and then, if the body is true, we store the head. Notice that, because the rules are safe, the head of the rule, after the evaluation of the body, is bound, that is, it does not contain free arguments.

eval_rule((Head : --Body), I) +- eval_body(Body, I), store(Head, I), fail. The body of a rule is true if all its goals are true. The predicate eval_body is then:

eval_body((B1, B2) ,I) +- !,eval_predicate(Bl, I), eval_body(B2, I). eval_body(Bl, I) +eval_predicate(Bi, I).

262

The predicate eval_predicate first checks if there exists in main memory a tuple that matches with the specified predicate. If such a tuple does not exists and if the argument is a base predicate then it call the Database Interface to load more tuples from the database. As already pointed out, this prototype has been implemented to experiment the expressive power of the stable model semantics [9] and of negation in the rule heads [7]. For the case of programs with negative head rules (the so-called negative programs), the prototype requires stratification whereas no restriction is asked for non-negative programs. If imposing stratification simplifies the computation, on the other side it reduces the expressive power as argued in [9]; therefore, future work is devoted to deal with unstratified negative programs. The present name of the prototype, P A T R I O T S Ms (Possibly Another Tool for Rapid Implementation of Total Stable Model Semantics), is not stable; we are waiting for new developments of the prototype to select a more impressive name.

6

The

LOGRES

prototype:

a logic language

for object

oriented databases (F. Cacace, S. Ceri, L. Tanca) The Logres system is an advanced database system based on the integration of object oriented data modeling and a rule based language for the specification of queries and updates. The data model includes generalization hierarchies, object identity and object sharing. The rule language can be seen as an extension of Datalog, designed to handle complex objects. A Logres program consists of two parts: 1. data structure declaration, which may include the definition of classes (sets of objects with identity) and relations (sets of ordinary tuples), and which is specified defining the type associated to each class or relation; 2. rule program, which consists of a set of logical rules partitioned into modules (also called "Logres applications"). The Logres system prototype translates class definitions into the extended relational model of the Algres language [3]. The rules in the program are translated into extended relational algebra expressions which are evaluated by the Algres interpreter when a query is requested. Algres is an advanced system for the development of applications which need a capability of defining and manipulating complex data structures. The Algres abstract machine executes relational algebra operations on non first normal form relations which are stored in a memory workspace. The Algres environment consists of the Algres abstract machine and several interface modules. The rational underlying using Algres as prototyping tool for the Logres system is two-fold: - evaluating the representational capabilities of Algres against the highly complex data model of Logres, which is significantly outside the realm of the ordinary relational model;

263

- verifying the feasibility of building generic object based DBMSs using ordinary relational systems taking advantage of their technological maturity. In order to obtain the Aigres representation of the main problems are to be solved:

Logres data model, four

the tuples in an Algres relation have a fixed schema; counterwisely, objects belonging to a generalization hierarchy in Logres must be looked at as variant records and therefore their structure do not obey to a fixed schema; so, there exists a structural mismatch which has be taken into account; Algres is value-oriented like any other relational system and therefore it does not support data identity, which is a basic component of the Logres model; - the Logres model allows object sharing, which is not supported in Algres; - cyclic schema definitions are allowed for Logres classes whereas they can not be used within the Algres model. -

-

In the implementation of the Logres system, many design choices were motivated by efficiency reasons. Before presenting the most important of those design choices it is therefore worth illustrating what are the relatively most frequent operations ordinarily carried out on Logres data structures. The working hypothesis on frequencies gives the following as the maximally frequent operations: - to request all the objects belonging to a given class; - to evaluate attribute values of a given object; -- to evaluate attribute values of an object which is part of another object (navigation of a part-of relationship). Thus, the translation of the Logres data model into the defined along the following design choices:

Algres one has been

1. an Algres relation is created for each class or relation defined in the declaration part of a Logres program; then, Algres relations corresponding to Logres classes belonging to a certain generalization hierarchy are related using severa] auxiliary distinct Algres relations; 2. the schema of each Algres relation corresponding to a Logres class includes a supplementary attribute named Self which is used to simulate object identifiers; 3. each Algres relation corresponding to a Logres class has type s e q u e n c e and is ordered with respect to the Self attribute; so doing, the navigation along a hierarchical relationships can be realized through merge-join operation between Algres relations, obtaining a linear complexity in the cardinalities of the involved classes; 4. whenever a part-of relationship exists in Logres between two classes, a supplementary AIgres relation is created which includes pairs of (values simulating Logres) object identifiers expressing the part-of relationships; this relation is ordered along one of its two attributes and is used for indexed accesses to the involved relations.

264

As already mentioned, a Logres program consists of a declaration part in which all the data entities used in the application are defined, and of a set of rules written according to a syntax similar to the Datalog one, which express the manipulation to be carried over the defined data structures. It is known from the theory of deductive databases that the minimal model of a Datalog program can be computed bottom up as the fixed point of a transformation associated to the translation of Datalog rules into relational algebra expressions. This technique can be also adopted in the case of Logres programs. Thus the evaluation of a generic Logres programs consist of the following steps: 1. the dependency graph associated to the given program is first constructed; then, it is partitioned into a collection of connected components each of which can be evaluated in successive applications of the fixed point procedure; 2. the relational algebra expression associated to each rule of the given program is generated; 3. the fixed.point procedure is applied to each of the connected component of the dependency graph, along the order computed at steep 1. Steps 1 and 2 are statically executed by the Logres compiler which translate its source program into an object program coded in ALICE. The integrated utilization of Algres and the C language seems to be very well suited for the efficient implementation of Logres rules and the instructions needed for the execution control, respectively. The object program coded in ALICE has the following structure: - the structure of the Algres entities corresponding to Logres objects is generated during the declaration compilation phase; Algres relations corresponding to extensional predicates in the Logres logic program are loaded in the workspace by several call of the POI module, which manages persistent entities; - a loop controlling a number of Aigres expressions for rule evaluations is included for each connected component of the dependency graph; conditions which check for the termination of the fixed point evaluation process are associated with each loop; - depending on the specific instruction included in the given Logres programs the Algres relation associated to intentional predicates in the Logres programs are either stored in mass memory by invoking the POI Module or used for query answering. -

7

The

LOGIBASE~-

prototype

(S. S a l z a , L. Pichetti)

LOGIBASE+, is a prototype system for the management of large deductive databases developed at IASI-CNR [14]. The aim of the project has been to overcome the practical limitations of the classic logic programming systems (e.g., Prolog), that can reasonably perform only on applications of limited size. The system extends the capability of a commercial relational database management

265

system (Oracle), by providing a rule based interface and additional processing modules to optimize and efficiently evaluate logic queries. The system has been especially conceived to provide a good performance on the evaluation of recursive queries. This has been attained on one hand by relying on the efficient execution of set-oriented operations by the relational DBMS, and on the other hand by introducing additional capabilities for the optimizations of queries, the management of temporary relations and the execution of the relational primitives in main storage. More precisely a Main Memory Database (MMDB) has been implemented to store temporary relations with suitable access structures, and new relational primitives have been defined and implemented to efficiently carry on the fixed point evaluation. This has required a large implementation effort, but finally allowed a dramatic improvement in the performances. Great attention has been devoted to query optimization. This is indeed a very important feature in a relational DBMS, and becomes a critical issue when recursive queries are taken into account. The problem has been largerly addressed in the literature but although no final solution to it has been given [1,12,13]. Traditional optimization consists of logical rewriting methods. In our system we have added another level, the physical optimization, to take into account the possibility of performing the relation operations either in main memory or through the RDBMS. The crucial issue in the design has been the efficient coupling with RDBMS. The architecture of the prototype provides indeed a series of options, that range from having all the computation performed by the RDBMS, to loose coupling, i.e., loading the relations entirely in main memory, and to tight coupling, i.e., working with frequent and more selective interactions with the RDBMS. The first option allows to keep the RDBMS efficiency on non recursive queries, while the other gives room to the optimization procedure for recursive queries that selects the kind of coupling based on the estimation of the execution cost: The architecture of the LOGIBASE+ prototype is sketched in Figure 5. The diagram shows three layers in the architecture which correspond to the various phases of the transformation and of the evaluation of deductive queries. The first layer corresponds to the logic level, where the transaction is originally expressed as a logic program in a deductive rule-based language. More specifically Rule based Interface provides the user access to the schema of the deductive database and allows to consult the database catalog, to define new intensional predicates, to add new rules for them and to submit queries. The latter are in fact logic programs, and are passed, for a first level optimization, to the Logical Optimizer. This module utilizes the rewriting techniques to produce a new logic program, equivalent to the original one, but better suited for an efficient b o t t o m up evaluation. The second layer translates the declarative program into a procedural program, written in an internal code to be evaluated against the extensional database. The procedural code includes the primitives for communication with the RDBMS, and the control structures needed to perform the fixed point evaluation of re-

266

cursive queries. Additional primitives are provided to efficiently perform the relational operations directly in main memory. The procedural code generation is strictly connected to a second level of optimization which we shall call physical optimization, in sense that it strongly depends on the physical allocation of the database, i.e., on the extensional parameters and the access structure in the mass storage database (MSDB), managed by Oracle, and on the current status of the Main Memory Database. The third layer corresponds to the evaluation phase, which is performed by the Fixed-point Evalnator. This module interacts on one end with the ttDBMS, to which it may directly request the evaluation of expression of the relational algebra, and on the other end with the MMDB. The latter is used to store the intermediate results as temporary relations, and to efficiently perform in main memory the operations that are required during the fixed point evaluation. The prototype has been implemented on a Sun Sparcl workstation, running Oracle version 6 under Unix. The performance of the prototype has been studied on a sample workload that has been randomly generated for the purpose. The database is the typical family database, with a relation Parent of 50000 tuples, and a relation Person of 30000 tuples. More precisely, the relation Parent is structured as a layered graph with 6 layers of 5000 nodes each, and two arcs entering and two arc leaving each node. The sample workload included several queries. For the non recursive queries the response times were compared to that of the equivalent queries directly submitted to Oracle through the SQLPLUS interactive interface. The times were indeed pretty much the same. This because, in such cases, the system adopts the strategy of directly translating into SQL expression which is entirely executed by Oracle. As a sample recursive query we used the same-generation, that has been indeed heavily used in the literature. The query was run for several values of the constant. In fact, referring to the layered structure of the relation graph, the execution cost depends on the layer to which belongs the node corresponding to the constant in the query. More precisely (assuming a linear cost for the joins) the cost goes exponentially with the number of layers that are traversed. Some experimental results are reported in Table 1 for several values of the node layer. The table gives the total elapsed time and the time spent by Oracle. Considering the size of the relation and the number of layer, these data seem to be quite satisfactory, although a direct comparison is not easy, since most of the systems presented in the literature work in main memory and with considerably smaller relations.

267

References 1. F. Banchilon, R. Ramakrishnan, An amateur's introduction to recursive query processing strategy, Proc. ACM SIGMOD Conf. on Managment of Data, 16-52, 1986. 2. F.Cacace, S.Ceri, S. Crespi Reghizzi, L. Tanca, R. Zicari, Integrating object oriented data modeling with a rule-based programming paradigm, Pvoc. A C M SIGMOD Conf. on the Management of Data, Atlantic City, 1990. 3. S.Geri, S. C'respi Reghizzi, G. Lamperti, L. Lavazza, R. Zicari, Algres: an advanced database system for complex applications, IEEE Software, July 1990. 4. S. Ceri, G. Gottlob, G. Wiederhold, Efficient Database Access Through Prolog, IEEE Transactions on Software Engineering, Feb. 1989. 5. D. Chimenti, R. Gamboa, R. Krishnarnurty, S. Naqvi, S. Tsur, C. Zaniolo, The LDL System Prototype, IEEE Transactions on Knowledge and Data Engineering, Vol. 2, No. 1, March 1990. 6. A.Formiea, M.Missikoff; Adding Integrity Constraints to Object-Oriented Database; Proc. of the ISMM First International Conference on Inforvnation and Knowledge Management (CIKM 9~), Baltimore, Maryland, 1992. 7. S. Greco, M. Romeo, D. Sacc~., Evaluation of Negative Logic Programs, in this volume. 8. H.Lam, M.Missikoff; Mosaico: A Specification and Rapid Prototyping Environment for Object-Oriented Database Applications; Tech Report, IASI, Jan. 1993. 9. N. Leone, M. Romeo, P. Rullo, D. Sacc~, Effective Implementation of Negation in Database Logic Query Languages, in this volume. 10. G. Pbipps, M. A. Derr, K.A. Ross, Glue-NAIL!: A deductive database system, Proc. A C M SIGMOD Conference, Denver, USA, June 1991. 11. R. Ramakrishnan, D. Srivastava, S. Sudarshan, CORAL - Control, Relations and Logic, Proc. 18th VLDB Conference, Vancouver, Canada, August 1992. 12. D. Sacca', C. Zaniolo, The generalized counting method for recursive logic queries, Proc. ICDT, Rome, LNCS 243, 1986. 13. D. Sacca', C. Zaniolo, Magic counting methods, Proc. ACM SIGMOD Conf. on Managment of Data, 49-59, 1987. 14. S. Salza, L. Pichetti, Logibase: an efficient implementation of a rule based interface to a commercial database management system, V-th Int. Conf. on Systems of Data and Knowledge Bases, Lvov, URSS, Sept. 1991. 15. J.D. Ullman, Principles of Database and Knowledge-Base Systems, Vol. 1 and 2, Computer Science Press, Rockville, Md., 1989. 16. J. Vaghani, K. Ramamohanarao, D. Kemp, Z. Somogyi, P. Stuckey, The Aditi deductive database systems, Proc. of the NACLP'90 Workshop on Deductive Database Systems, Austin, USA, October 1990.

268

8

Figures and Tables

creation modification addition F1

v] canonical form generation of an ODL schema

F2

canonical ODL schema

well for medness check

subsumption computation

minimal ODL Schema computed ISA relations table F i g . 1. O D L - D e s i g n e r ' s F u n c t o n a l A r c h i t e c t u r e

incoherent classes an types

269

Schema \

w

\

/ ",,,

,/

"~UserInterface~

I Sintactical-Semantical Analizer / Cyclesl~econgniserJ

CanonicalForm~'~ Generator J J J

I I

j

I

89

J

Incoherent ClassandTypes

f r~xonomlc -%=

(Canonical~ Schema. J

Fig.2. ODL-Designer:CREATEF~dnction

Schema

Computed ISARelations Table

270

~, Application user

~( Design team member Result of f DB application / analysis ~

Object DB test cases

I~

,

~

~ h I

I Tool

~9 ss~ ..........1..................

~

++

I -

9

l

i qu'ery I

/

[Schema Editor ] i l and Browser .'k ~ - ~ l Semantic I Verification Module

l | '

,

[iobie~DB"ii i . ~ Generator ~ i

i

r

I .. Object D )~

IPrototype

!I c,^.s !l ('_~"Y~,,. ~.t "] Code iI "~. . . . . . . Ii ' :.................................. :

Specif cation of DB application domain

F i g . 3. Functional Architecture o f Mosaico

I

I

~

271

Q•-

Answer

Programs/ Schema

I~=i he

Query-

Fig. 4. Architecture of the PATRIOT prototype

272

INTERACTI V E USER RULE-BASED

INTEFACE ~,.~,,~ ~..~~.,~

OPTIMIZER REWRITTEN RULES MMDB

GENERATOR& t

CODE

EDB CATALOG

-'~'-"t,..Ys,c..oPT,,,,z~. l

PROCEDURAL CODE

EVALUATO~

ORACLE RDBMS

/

MAINMEMORY DATABASE (MMDB) Fig. 5. The LOGIBASE+ prototype architecture.

273

layer result cardin~lity e l a p s e d t i m e Oracle t i m e 0

683

109.4 sec

31.5 sec

1

167

23.7 sec

8.5 sec

2

42

6.3 sec

2.2 sec

3

10

2.2 sec

0.6 sec

4

3

1.6 sec

0.4 sec

T a b l e 1. Performance on the saxne-generation query

Lecture Notes in Computer Science For information about Vols. 1-620 please contact your bookseller or Springer-Verlag

-

Vol. 621: O. Nurmi, E. Ukkonen (Eds.), Algorithm Theory SWAT '92. Proceedings. VIII, 434 pages. 1992.

Vol. 643: A. Habel, Hyperedge Replacement: Grammars and Languages. X, 214 pages. 1992.

Vol. 622: F. Schmalhofer, G. Strube, Th. Wetter (Eds.), ContemporaryKnowledge Engineering and Cognition. Proceedings, 1991. XII, 258 pages. 1992. (Subseries LNAI).

Vol. 644: A. Apostolico, M. Crochemore, Z. Galil, U. Manber (Eds.), Combinatorial Pattern Matching. Proceedings, 1992. X, 287 pages. 1992.

Vol. 623: W. Kuieh (Ed.), Automata, Languages and programming. Proceedings, 1992. XlI, 721 pages. 1992. Vol. 624: A. Vomnkov (Ed.), Logic Programming and Automated Reasoning. Proceedings, 1992. XlV, 509 pages. 1992. (Subseries LNAI).

Vol. 645: G. Pernul, A M. Tjoa (Eds.), Entity-Relationship Approach - ER '92. Proceedings, 1992. XI, 439 pages, 1992. Vol. 646: J. Biskup, R. Hull (Eds.), Database Theory ICDT '92. Proceedings, 1992. IX, 449 pages. 1992.

Vol. 625: W. Vogler, Modular Construction and Partial Order Semantics of Petri Nets. IX, 252 pages. 1992.

Vol. 647: A. Segall, S. Zaks (Eds.), Distributed Algorithms. X, 380 pages. 1992.

Vol. 626: E. B6rger, G. J~iger, H. Kleine Biining, M. M. Richter (Eds.), Computer Science Logic. Proceedings, 1991. VIII, 428 pages. 1992.

Vol. 648: Y. Deswarte, G. Eizenberg, J.-J. Quisquater (Eds.), Computer Security - ESORICS 92. Proceedings. XI, 451 pages. 1992.

Vol. 628: G. Vosselman, Relational Matching. IX, 190 pages. 1992.

Vol. 649: A. Pettorossi (Ed.), Meta-Programming in Logic. Proceedings, 1992. XII, 535 pages. 1992.

Vol. 629: I. M. Havel, V. Koubek (Eds.), Mathematical Foundations of Computer Science 1992. Proceedings. IX, 521 pages. 1992.

Vol. 650: T. Ibaraki, Y. Inagaki, K. lwama, T. Nishizeki, M. Yamashita (Eds.), Algorithms and Computation. Proceedings, 1992. XI, 510 pages. 1992.

Vol. 630: W. R. Cleaveland (Ed.), CONCUR '92. Proceedings. X, 580 pages. 1992.

Vol. 651: R. Koymans, Specifying Message Passing and Time-Critical Systems with Temporal Logic. IX, 164 pages.

Vol. 631: M. Brnynooghe, M. Wirsing (Eds.), Programming Language Implementation and Logic Programming. Proceedings, 1992. XI, 492 pages. 1992. Vol. 632: H. Kirchner, G. Levi (Eds.), Algebraic and Logic Programming. Proceedings, 1992. IX, 457 pages. 1992.

1992.

Vol. 652: R. Shyamasundar (Ed.), Foundations of Software Technology and Theoretical Computer Science. Proceedings, 1992. XII1, 405 pages. 1992.

Vol. 633: D. Pearce, G. Wagner (Eds.), Logics in AI. Proceedings. VIII, 410 pages. 1992. (Subseries LNAI).

Vol. 653: A. Bensoussan, J.-P. Verjus (Eds.), Future Tendencies in Computer Science, Control and Applied Mathematics. Proceedings, 1992. XV, 371 pages. 1992.

Vol. 634: L. Boag6, M. Cosnard, Y. Robert, D. Trystram (Eds.), Parallel Processing: CONPAR 92 - VAPP V. Proceedings. XVII, 853 pages. 1992.

Vol. 654: A. Nakamura,M. Nivat, A. Saoudi, P. S. P. Wang, K. Inoue (Eds.), Parallel Image Analysis. Proceedings, 1992. VIII, 312 pages. 1992.

Vol. 635: J. C. Derniame (Ed.), Software Process Technology. Proceedings, 1992. VIII, 253 pages. 1992.

Vol. 655: M. Bidoit, C. Choppy (Eds.), Recent Trends in Data Type Specification. X, 344 pages. 1993.

Vol. 636: G. Comyn, N. E. Fuchs, M. J. Ratcliffe (Eds.), Logic Programming in Action. Proceedings, 1992. X, 324 pages. 1992. (Subseries LNAI).

Vol. 656: M, Rusinowitch, J. L. R6my (Eds.), Conditional Term Rewriting Systems.Proceedings, 1992. XI, 501 pages. 1993.

Vol. 637: Y. Bekkers, J. Cohen (Eds.), Memory Management. Proceedings, 1992. XI, 525 pages. 1992.

Vol. 657: E. W. Mayr (Ed.), Graph-Theoretic Concepts in Computer Science. Proceedings, 1992. VIII, 350 pages. 1993.

Vol. 639: A. U. Frank, I. Campari, U. Formentini (Eds.), Theories and Methods of Spatio-Temporal Reasoning in Geographic Space. Proceedings, 1992. XI, 431 pages. 1992. Vol. 640: C. Sledge (Ed.), Software Engineering Education. Proceedings, 1992. X, 451 pages. 1992. Vol. 641: U. Kastens, P. Pfahler (Eds.), Compiler Construction. Proceedings, 1992. VIII, 320 pages. 1992. Vol. 642: K. P. Jantke (Ed.), Analogical and Inductive Inference. Proceedings, 1992. VIII, 319 pages. 1992. (Subseries LNAI).

Vol. 658: R. A. Rueppel (Ed.), Advances in Cryptology EUROCRYPT '92. Proceedings, 1992. X, 493 pages. 1993. Vol. 659: G. Brewka, K. P. Jantke, P. H. Schmitt (Eds.), Nonmonotonie and Inductive Logic. Proceedings, 1991. VIII, 332 pages. 1993. (Sabseries LNAI). Vol. 660: E. Lamma, P. Mello (Eds.), Extensions of Logic Programming. Proceedings, 1992. VIII, 417 pages. 1993. (Subseries LNAI).

Vol. 661: S. J. Hanson, W. Remmele, R. L. Rivest (Eds.), Machine Learning: From Theory to Applications. VIII, 271 pages, 1993.

Vol. 684: A. Apostolico, M. Crochemore, Z. Galil, U. Manber (Eds.), Combinatorial Pattern Matching. Proceedings, 1993. VIII, 265 pages. 1993.

Vol. 662: M. Nitzberg, D. Mumford, T. Shiota, Filtering, Segmentation and Depth. VIII, 143 pages. 1993.

Vol. 685: C. Rolland, F. Bodart, C. Cauvet (Eds.), Advanced Information Systems Engineering. Proceedings, 1993. XI, 650 pages. 1993.

Vol. 663: G. v. Bochmann, D. K. Probst (Eds.), Computer Aided Verification. Proceedings, 1992. IX, 422 pages. 1993. Vol. 664: M. Bezem, J. F. Groote (Eds.), Typed Lambda Calculi and Applications. Proceedings, 1993. VIII, 433 pages. 1993. Vol. 665: P. Enjalbert, A. Finkel, K. W. Wagner (Eds.), STACS 93. Proceedings, 1993. XIV, 724 pages. 1993. Vol. 666: J. W. de Bakker, W.-P. de Roever, G. Rozenberg (Eds.), Semantics: Foundations and Applications. Proceedings, 1992. VIII, 659 pages. 1993. Vol. 667: P. B. Brazdil (Ed.), Machine Learning: ECML 93. Proceedings, 1993. XII, 471 pages. 1993. (Subseries LNAI). Vol. 668: M.-C. (3audel, J.-P. Jouannaud (Eds.), TAPSOFT '93: Theory and Practice of Software Development. Proceedings, 1993. XII, 762 pages. 1993. Vol. 669: R. S. Bird, C. C. Morgan, J. C. P. Woodcock (Eds.), Mathematics of Program Construction. Proceedings, 1992. VIII, 378 pages. 1993. Vol. 670: J. C. P. Woodcock, P. (3. Larsen (Eds.), FME '93: Industrial-Strength Formal Methods. Proceedings, 1993. XI, 689 pages. 1993. Vol. 671: H. J. Ohlbach (Ed.), GWAI-92: Advances in Artificial Intelligence. Proceedings, 1992. XI, 397 pages. 1993. (Subseries LNAI). Vol. 672: A. Barak, S. Guday, R. G. Wheeler, The M(3SIX Distributed Operating System. X, 221 pages. 1993. Vol. 673: G. Cohen, T. Mora, O. Moreno (Eds.), Applied Algebra, Algebraic Algorithms and Error-Correcting Codes. Proceedings, 1993. X, 355 pages 1993. Vol. 674: G. Rozenberg (Ed.), Advances in Petri Nets 1993. VII, 457 pages. 1993. Vol. 675: A. Mulkers, Live Data Structures in Logic Programs. VIII, 220 pages. 1993.

Vol. 686: J. Mira, J. Cabestany, A. Prieto (Eds.), New Trends in Neural Computation. Proceedings, 1993. XVII, 746 pages. 1993. Vol. 687: H. H. Barrett, A. F. Gmitro (Eds.), Information Processing in Medical Imaging. Proceedings, 1993. XVI, 567 pages. 1993. Vol. 688: M. (3authier (Ed.), Ada - Europe '93. Proceedings, 1993. VIII, 353 pages. 1993. Vol. 689: J. Komorowski, Z. W. Ras (Eds.), Methodologies for Intelligent Systems. Proceedings, 1993. XI, 653 pages. 1993. (Subseries LNAI). Vol. 690: C. Kirchner (Ed.), Rewriting Techniques and Applications. Proceedings, 1993. XI, 488 pages. 1993. Vol. 691: M. Ajmone Marsan (Ed.), Application and Theory of Petri Nets 1993. Proceedings, 1993. IX, 591 pages. 1993. Vol. 692: D. Abel, B.C. Ooi (Eds.), Advances in Spatial Databases. Proceedings, 1993. XIII, 529 pages. 1993. Vol. 693: P. E. Lauer (Ed.), Functional Programming, Concurrency, Simulation and Automated Reasoning. Proceedings, 1991/1992. XI, 398 pages. 1993. Vol. 694: A. Bode, M. Reeve, G. Wolf (Eds.), PARLE '93. Parallel Architectures and Languages Europe. Proceedings, 1993. XVII, 770 pages. 1993. Vol. 695: E. P. Klement, W. Slany (Eds.), Fuzzy Logic in Artificial Intelligence. Proceedings, 1993. VIII, 192 pages. 1993. (Subseries LNAI). Vol. 696: M. Worboys, A. F. Grundy (Eds.), Advances in Databases. Proceedings, 1993. X, 276 pages. 1993. Vol. 697: C. Courcoubetis (Ed.), Computer Aided Verification. Proceedings, 1993. IX, 504 pages. 1993. Vol. 698: A. Voronkov (Ed.), Logic Programming and Automated Reasoning. Proceedings, 1993. XIII, 386 pages. 1993. (Subseries LNAI).

Vol. 676: Th. H. Reiss, Recognizing Planar Objects Using Invariant Image Features. X, 180 pages. 1993.

Vol. 699: G. W. Mineau, B. Moulin, J. F. Sowa (Eds.), Conceptual Graphs for Knowledge Representation. Proceedings, 1993. IX, 451 pages. 1993. (Subseries LNAI).

Vol. 677: H. Abdulrab, J.-P. P~cuebet (Eds.), Word Equations and Related Topics. Proceedings, 1991. VII, 214 pages. 1993.

Vol. 700: A. Lingas, R. Karlsson, S. Carlsson (Eds.), Automata, Languages and Programming. Proceedings, 1993. XI1, 697 pages. 1993.

Vol. 678: F. Meyer auf der Heide, B. Monien, A. L. Rosenber 8 (Eds.), Parallel Architectures and Their Efficient Use. Proceedings, 1992. XII, 227 pages. 1993.

Vol. 701: P. Atzeni (Ed.), LOGIDATA+: Deductive Databases with Complex Objects. VIII, 273 pages. 1993.

Vol. 679: C. Fermllller, A. Leitsch, T. Tammet, N. Zamov, Resolution Methods for the Decision Problem. VIII, 205 pages. 1993. (Subseries LNAI). Vol. 681: H. Wansin 8, The Logic of Information Structures. IX, 163 pages. 1993. (Subseries LNAI). Vol. 682: B. Bouchon-Meunier, L. Valverde, R. R. Yager (Eds.), IPMU '92 - Advanced Methods in Artificial Intelligence. Proceedings, 1992. IX, 367 pages. 1993. Vol. 683: G.J. Milne, L. Pierre (Eds.), Correct Hardware Design and Verification Methods. Proceedings, 1993. VIII, 270 Pages. 1993.