Innovative Applications of Information Technology for the Developing World: Proceedings of the 3rd Asian Applied Computing Conference (Advances in Computer ... and Engineering: Reports and Monographs)

INNOVATIVE APPLICATIONS OF INFORMATION TECHNOLOGY FOR THE DEVELOPING WORLD Advances in Computer Science and Engineer...

14 downloads 2117 Views 24MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

INNOVATIVE APPLICATIONS OF INFORMATION TECHNOLOGY FOR THE

DEVELOPING WORLD

Advances in Computer Science and Engineering: Reports and Monographs Editor-i n-Chief: Erol Gelenbe (Imperial College) Advisory Editors: Manfred Broy (Technische Universitaet Muenchen) Gerard Huet (INRIA) Published

Vol. 1 New Trends in Computer Networks edited by T. Tugcu, M. @glayan, F. Alagoz (Bogazici Univ., Turkey) and E. Gelenbe (Imperial College, U K ) Vol. 2 Innovative Applications of Information Technology for the Developing World edited by L. M . Patnaik (Indian Inst. of Sci., India), A . K. Talukder (Indian Inst. of Info. Tech. India), D. Bhattarai (Nepal Engg. College, Nepal), H. M . Pradhan (Kathmandu Engg. College, Nepal), S. Jha (Nepal Engg. College, Nepal) and S. Iyengar (Louisiana State Univ., USA)

INNOVATIVE APPLICATION s OF INFORMATION TECHNOLOGY FOR THE DEVELOPING WORLD Proceedings of the 3rd Asian Applied Computing Conference Kathmandu, Nepal

10

~

12 December 2005

Editor-in-Charge

Lalit Mohan Patnaik Indian Institute of Science, India

Editors

Asoke K. Talukder Indian Institute of Information Technology, India

Deepak Bhattarai Nepal Engineering College, Nepal

Hirendra Man Pradhan Kathmandu Engineering College, Nepal

Sudan Jha Nepal Engineering College, Nepal

Sitharama lyengar Louisiana State Universiiy,USA

Imperial College Press

Published by

Imperial College Press 57 Shelton Street Covent Garden London WC2H 9HE Distributed by

World Scientific Publishing Co. Re. Ltd. 5 Toh Tuck Link, Singapore 596224 USA ofice: 27 Warren Street, Suite 401-402, Hackensack, NJ 07601

UKofSice: 57 Shelton Street, Covent Garden, London WC2H 9HE

British Library Cataloguing-in-PublicationData A catalogue record for this book is available from the British Library.

INNOVATIVE APPLICATIONS OF INFORMATION TECHNOLOGY FOR THE DEVELOPING WORLD Proceedings of the 3rd Asian Applied Computing Conference (AACC 2005) Copyright 0 2007 by Imperial College Press All rights reserved. This book, or parts tliereoJ may not be reproduced in any,forni or by any means, electronic or mechanical, including photocopying, recording or any information storage and retrieval system now known or to be invented, without written permission from the Publisher.

For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required frpm the publisher.

ISBN- 13 978- 1-86094-827-5 ISBN-10 1-86094-827-8

Printed in Singapore by World Scientific Printers (S) Pte Ltd

MESSAGE FROM THE GENERAL CHAIR It is gratifying to note that we have been able to organize for the third time in sucesssion an International Conference related to Information Technology, in this lovely city of Kathmandu. This Conference “Innovative Applications of Information Technology for Developing World” (Asian Applied Computing Conference (AACC-2005) is aimed at discussing Innovative Information Technology solutions, particularly of revelance to the developing world. The first in the series, ITPC 2003 was help in Kathmandu and the next one AACC 2004 was help in the same city. We had a very enthusiastic response for the previous two Conferences. This year too we received over 233 submissions from several countries across the world and about 133 papers have been accepted for presentation. An excellent Conference of this type has derived continuous support from the local organizing committee members; programme Chairs, reviewers, invited speakers, tutorial speakers, and above all a large number of volunteers. My sincere thanks and warm appreciation to each one of them, particularly to Mr. Sudan Jha who has done a wonderful job of coordinating among several of us involved in the various aspects of the Conference and to Prof. Asoke Talukder who has worked hard to put together an interesting technical programme for the Conference. I hope the delegates will enjoy the technical presentations at the Conference and will carry with them pleasant memories of the Conference and the wonderful city of Kathmandu. L M Patnaik

V

This page intentionally left blank

L. M. Patnaik Indian Institute of Science Bangalore, India 1alitcii)micro.iisc.ernet.in

Erol Gelenbe S. S. Iyengar Imperial College Louisiana State University London, UK Los Angeles, USA [email protected] [email protected] Asoke K. Talukder Indian Institute of Information Technology Bangalore, India akt@,iiitb.ac.in, [email protected] Kentaro Toyama Microsoft Research India Bangalore, India [email protected] (Innovative Applications) Tomio Hirata Nagoya University Japan [email protected] (Soft Computing, Systems & Architectures)

Takao Nishizeki Tohoku University Sendai, Japan [email protected] (Algorithm and Computation) U. B. Desai Indian Institute of Information Technology, Bombay, India [email protected] (Community Informatics, Mobile and Ubiquitous Computing)

Jay Bagga Ball State University USA [email protected] (Formal Methods) ~~

Hirendra Man Pradhan Chairman Kathmandu Engineering College [email protected]

Deepak Bhattarai Principal Nepal Engineering College [email protected]

vii

viii

Chand S. Rana Principal Kathmandu Engineering College [email protected] Sudan Jha Assistant Professor Nepal Engineering College [email protected],j [email protected] Ramesh K. Shrestha Assitant Professor Kathmandu Engineering College [email protected] Suprio Dasgupta Honeywell International Inc. Bangalore, India [email protected] Shin-Ya Kobayashi (Asia) Ehime University Japan [email protected] Sumanth Yenduri (USA & Europe) University of Southern Mississippi USA sumanth(ii,bit.csc.lsu.edu

Andrew James Simmonds (Pacific) University of Technology Sydney Australia [email protected]

Jan Kratochvil Charles University Prague, Czech [email protected]

M. H. Kori Lucent Technologies Bangalore India j [email protected]

ix

Andrzej Proskurowski University of Oregon, USA [email protected] Antonio M. Peinado University of Granada Spain [email protected] Takao Asano Chuo University Tokyo, Japan, [email protected] Hans-Gerhard Gross Fraunhofer Institute, Germany [email protected] B.M. Mohan Indian Institute of Technology Kharagpur, India [email protected] Jayasree Dattagupta Indian Statistical Institute India [email protected] J.C. Misra IIT Kharagpur, India j [email protected] Nik Kasabov Auckland University of Technology, Newzealand [email protected] Mike Reed United Nations University PR China [email protected]

Debabrata Das Indian Institute of Information Technology, Bangalore, India [email protected] Priyadarshi Nanda UTS, Australia [email protected] Yiuming Cheung Hong Kong Baptist University Hong Kong [email protected] Jinwen Ma Peking University China j [email protected] Louise Perkins University of Southern Mississippi USA [email protected] Rajgopal Kannan Louisiana State University USA [email protected] Xiao Zhou Tohoku University Sendai, Japan [email protected] Puneet Gupta Infosys Technologies Limited Bangalore, India [email protected] Visakan Kadirkamanathan University of Sheffield UK [email protected]

X

Sanjay Madria University of Missouri-Rolla USA [email protected] Michael Best Media lab, MIT, USA [email protected] Shekhar Borgaonkar HP Labs India, Bangalore [email protected] Mahesan Niranjan University of Sheffield, UK [email protected] Rajesh Veeraraghavan Microsoft Research Lab India [email protected] JosC L. PCrez-Cordoba University of Granada Spain [email protected] Partha Sarathi Roop University of Auckland, UK [email protected] Indrani Medhi Microsoft Research Lab Bangalore, India [email protected]

Peter Thomas Carey Thomas Pty Ltd Australia [email protected] Sushil Meher AIIMS, Delhi, India [email protected] Aynal Haque BUET, Bangladesh [email protected] Paddy Nixon University College Dublin Ireland [email protected] Victoria E. Shnchez University of Granada Spain [email protected] Nimmi Rangaswamy Microsoft Research Lab India [email protected] Hujun Yin UMIST , UK [email protected] Udai Singh Pawar Microsoft Research Lab Bangalore, India [email protected]

CONTENTS Algorithm and Computation Combinatorial Generation of Matroid Representations: Theory and Practice P . Hlineny Detection of Certain False Data Races from Runtime Traces K. Sinha and R. Gupta Representing Series-Parallel Graphs as Intersection Graphs of Line Segments in Three Directions M. Bodirsky, C. Dangelmqr and J. Kara Finding the Convex Hull of a Dense Set P. Valtr Probabilistic Load Balancing with Optimal Resources Set of Nodes in Clusters N. P. Gopalan and K. Nagarajan A New Heuristic Algorithm for Multi-Dimension Multiple-Choice Knapsack Problem in Distributed System Md. Waselul Haque Sadid, M. Nazrul Alam, Md. A1 Mamun, A . H. M. Sarwar Sattar, Mir Md. Jahangir Kabir and Md. Rabiul Islam

32

37

45

50

The Complexity of the Pk Partition Problem and Related Problems in Bipartite Graphs J. Monnot and S. Toulouse

53

Accelerating Boolean SAT Engines Using Hyper-Threading Technology T. Schubert, M. Lewis and Bernd Becker

58

xi

xii

A New Strategy for Solving Multiple-Choice Multiple-Dimension Knapsack Problem in Pram Model Md. Waselul Haque Sadid, Md. Rabid Islam, S. M. Kamrul Hasan and Md. Mostafa Akbar A Constant-Time Selection Algorithm on An Larpbs M. Arock and R. Ponalagusamy Reducing Crossing Number of Multi-Color Rectilinear Steiner Trees Using Monochromatic Partitioning S. Majumber, B. B. Bhattachatya and S. M. A. Jafri Minimizing Capacitated Tree Covers of Graphs Y. Karuno and H. Nagamochi Evaluation of Shortest Path Routing Algorithms in Multi Connected Distributed Loop Networks R. Vasappanavara, M Seetarumnath Real Time Services Over the Internet A . J. Simmonds and G. Mercunkosk

63

68

73

78

83

86

Experimental Studies on Representation Complexity and Error Rates of Iteratively Composed Features K. Haruguchi, H. Nagamochi and T Ibaruki

92

An Application of Spanning Arborescence Algorithm to Scalar Multiplications Adachi and T. Hirata

97

Computer Aided Diagnosis in Digital Mammograms: Hybrid Meta-Heuristic Algorithms for Detection of Microcalcifications K. Thangavel and M. Karnan

104

Efficient Branch-and-Bound Algorithms for Weighted Max-2-SAT Y. Koga, M Yagiuru, M. Nonobe, T. Imamichi, H. Nagamochi and T. Ibaraki

120

Tamper Resisance via Poisson Distribution Seetha Mahalaxmi and P. R. K. Murti

125

xiii

Case: Cellular Automata Based Symmetric Encryption S. Tripathy and S. Nandi

127

Community Informatics

133

THINK! : Towards Handling Intuitive and Nurtured Knowledge V. Ananthukrishnan and R. Tripathi

135

LOKVANI -An e-ffort to empower Citizen A. Kumar, A. Singh, S. Puneet and A. Shukla

144

Content Based Image Retrieval Using Simple Features - A Survey V. Sudhumani, P. Koulgi and C. R. Venugopal

154

Design and Development of a Data Mining System for Superstore Business S. M Shamimul Hasan and I. Haque

155

Trusted Domain Web Servies for Delivering e-Government Services S. Swamynathan, A . Kannan, T. V. Geetha

163

Knowledge Discovery from Web Usage Data: Survey G. T. Ruju, P. S. Sathyanarayana and L. M. Patnaik

179

Innovative Applications for the Developing World

183

Credit Rating Model for Rural Borrowers S. S. Satchidananda and R. Panikar

185

Virtual Digital Libraries: An Advanced Methods and Technologies R. P. Kumar and S. K. Meher

197

Role of Internet-Based GIS in Effective Natural Disaster Management A . S. Bhalchandra, S. S. Rautmare and S. V. Waghmare, S. R. Bhalchandra

199

A Text-Free User Interface for Employment Search Medhi, B. Pitti and K. Toyama

203

xiv

VERIFI - An Integrated System with Voice Chat, E-learning, Remote Control, Internet Access and Fraud Detection for an Intranet Mala, N. S. Dhaneel Kumar and P. Kumar

209

Enterprise Relationship Management for Higher Education “An Approach to Integrated System Management” S. S. Raza Abidi and M. Nadeem

213

Locating Cell Phone Towers in a Rural Enviroment H.A. Eiselt and V. Marianov

218

E-Governance of Land Records G. Kondekar

22 1

Mobile and Ubiquitous Computing

223

MAC Layer Fairness Issues in Multi-Hop Wireless Ad Hoc Networks V. S. Deeshitulu, S. Nandi and A. R. Chowdhury

225

A Framework for Routing in Mobile Ad Hoc Networkds A. Goel and A. K. Sharma

230

Efficient Channel Allocation Based on Priority of the Video in VoD Systems N. Sujatha, K. V. Rajesh, K. Girish, K. R. Venugopal and L. M. Patnaik

233

Real Time Video and Audio Transmition Across the Network M. Manjurul Islam, Mir Md. Jahangir Kabir, Md. Robiur Rahman and B. Ahmed

238

Mobile Payments: Partner or Perish? E. Lawrence, A. Zmijewska and S. Pradhan

240

Combadge: A Voice Messaging Device for the Masses J, L. Frankel and D. Bromberg

248

Mobile Agents Aided Routing in Mobile Ad Hoc Networks H. M. P. Shekhar and K. S. Rumanatha

257

xv

Service Discovery in A Grid or Ubiquitous Computing Environment K. Sinha Genetic Algorithm Based Routing for Multicast Communication in Mobile Networks Mala, R. Shriram, S. Agarwal and S. Selvakumar

263

268

Reception-Aware Power Control in Ad Hoc Mobile Networks N. Shrestha and B. Mans

273

Uniform Mobility Model for Ad Hoc Networks S. K. Bodhe, Amrita A. Agashe and Anil A. Agashe

278

Secure Cluster Head Election Algorithm for Wireless Sensor Networks Chennakesavulu, B. Dey and S. Nandi

287

Maximizing Life-Time of Wireless Sensor Networks B. K. Meena and V. Vallieswaran

292

An Efficient Mobile Controlled and Network Assisted Handoff Proposal for IEEE 802.11 Networks Thirumala R e d 4 and S. Nandi

297

Effect of Network Policies on Internet Traffic Engineering R. Nanda and A . J. Simmonds

300

Modified Cluster Based Broadcasting in Ad Hoc Wireless Network A. Puranik and S. K. Pandey

308

Natural Language Processing

319

An Implementation Level Formal Model for Javabeans B. P. Upadhycya and B. Keshari

32 1

Handling Honorification in Dobhase: Online English-To-Nepali Machine Translation System B. Keshari, J. Bhatta and S. K. Bista Speech Synthesis Through Waveform Concatenation for Sanskrit E. Waghmare and A. S. Bhalchandra

330

340

xvi

Soft Computing Performance Measurement of Real Time Object Detection and Recognition Mir Md. Jahangir Kabir, S. Halder, Md. Robiur Rahman, Md. W. H. Sadid, M M. Manjurul Islam and Md. Nazrul Islam Mondal

347

349

A Symmetric Encryption Technique through Recursive Modulo-2 Operation of Paired Bits of Streams (RMOPB) P. K. Jha and J. K. Mandal

353

Mathematical Model and Analysis of the Simplest Fuzzy Two-Term (PIED) Controller B. M. Mohan and A. Sinha

3 60

A Framework for Topic Categorization of XML Documents Using Support Vector Machines K. G. Srinivaza, S. Sharath, K. R. Venugopal and L. M. Patnaik

367

Analysis of Medical Image Compression Using Statistical Coding Methods R. Ponalagusamy and C. Saravanan

372

Fuzzy Based Selection of Paths in a Decision Tree with Imprecision and Partial Specification R. Vasappanavara, D. Ravi, V. Gustam and V. Anand

377

A Graph Algorithm for Finding the Minimum Test Set of a BDD-Based Circuit G. Paul, B. B. Bhattacharya, A. Pal and A. Das

3 82

Particle Swarm Optimization Based PI Controller Tuning for Fermentation Processing K. Valarmathi, D. Devaraj and T. K. Radhakrishnan

3 87

Software Reliability Growth Modeling for Exponentiated Weibull Function with Actual Software Failures Data U. Bokhari and N. Ahmad

390

xvii

Speech Recognition Recognition of Facial Pattern by Modified Kohonen's Self Organizing Map (MKSOM) and Analyze the Performance S. M. Kamrul Hasan, Md. Nazrul Islam Mondal, N. Tarannum, Md. Tanvir Ahmed Chowdhury

397

399

Others

407

Intrusion Detection System (IDS) Using Network Processor P. G. Shete and R. A. Patil

409

Development of Remotely Sensed Spatial Data Analysis and Management Software System (RSSDAMSS) N. Badal, G. K. Sharma, K. Kant Detection and Correction of Vertical Skew in Characters T. Vasudev, G. Hemanthkumar and P. Nagabhushan

412

423

Evolutionary Metaheuristic Approaches for the Vehicle Routing Problem with Time Windows S. Tripathi and B. Minocha

43 5

Transparent Remote Execution of Processes in Cluster of Heterogeneous UNIX Systems Aryal and A. Khadka

446

Distributed Dynamic Scheduling - An Experiment with PVM S. Ramachandram and J. Aurangabadkar

458

This page intentionally left blank

Algorithm and Computation

This page intentionally left blank

COMBINATORIAL GENERATION OF MATROID REPRESENTATIONS: THEORY AND PRACTICE

PETR HLINEN+* Faculty of Informatics, Masaryk University in Brno, Botanickd 68a, 60200 Brno, Czech Rep. E-mail: [email protected]. cz

Matroids (also called combinatorial geometries) present a strong combinatorial generalization of graphs and matrices. Unlike isomorph-free generation of graphs, which has been extensively studied both from theoretical and practical points of view, not much research has been done so far about matroid generation. Perhaps the main problem with matroid generation lies in a very complex internal structure of a matroid. That is why we focus on generation of suitable matroid representations, and we outline a way how to exhaustively generate matroid representations over finite fields in reasonable computing time. In particular, we extend here some enumeration results on binary (over the binary field) combinatorial geometries by Kingan et al. We use the matroid generation algorithm of [P. HlinBnjr, EquivalenceFree Exhaustive Generation of Matroid Representations] and its implementation in MACEK;see http: //w.cs vsb. cz/hlineny/MACEK.

.

1. Introduction Matroids, introduced by Whitney in 1935, present a common generalization of the concept of independency in graphs and matrices. We follow Oxley lo in our matroid terminology, and we refer the reader to the full version of this paper for explanations. In graph theory, one often uses pictures to visualize particular graphs. On contrary, such visualization is very difficult in matroid theory - it is almost impossible to give a “nice drawing” of a general matroid in rank higher than 3. That fact makes matroid research more difficult, and brings a strong need for an automated matroid generation routine. (It is often such that proving a theorem in structural matroid theory requires one to check many small cases by hand. As matroid researchers know themselves, ‘Supported by the Institute of Theoretical Computer Science ITI, project no. 1M0545.

4

checking the “small cases” can be quite long and painful, and prone to errors.) However, great complexity and enormous numbers of (even small) matroids makes the task much harder than generation of, say, graphs. A promising tractable approach is to generate matroid representations as matrices. Matroids represented by matrices over finite fields play an important role in structural matroid theory, similar to the role that graphs embedded on a surface play in structural graph theory. Hence exhaustive generation routines for matroid representations over finite fields could have important applications in matroid research. However, the history of matroid enumeration is rather short. We refer to g for a nice overview and bibliography. In particular, we mention here the old work of Acketa on enumeration and construction of small binary matroids, which is closely related to further mentioned work and to our contribution in Section 3. Perhaps the main problem with matroid generation lies in a very complex internal structure of a matroid - a single matroid on n elements carries an amount of information exponential in n. That is why we are looking for generation of suitable matroid representations which would be easier to handle. Incidently, such are matrix representations of matroids over finite fields, which happen to be interesting and useful in matroid structure theory. We refer the works of Dharmatilake and Kingan as two examples of tasks of matroid-representations generation for the purpose of obtaining theoretical structural results. Dharmatilake used computer generation of binary matroid representations in order to find the binary excluded minors for matroids of branchwidth three. He, in fact, found all 10 (3 new) of them, but his search was not finished. We have completed the search exhaustively using our (faster) generation routine; and moreover, we have found all the ternary excluded minors for branch-width three and generated many more such excluded minors over larger fields. Among some recent works we cite Kingan et al. 8 , and (unpublished) Pendavingh l l . Kingan et al. present a generation algorithm for matroid representations, which we compare with our generation routine in Section 2, and we significantly extend their enumeration results in Section 3. On the other hand, Pendavingh systematically generates matroids in general (as set systems), which is computationally very hard. The point of our interest in his work is that some nontrivial outcomes of his search (e.g. the small excluded minors for representability over GF(5) and G F ( 7 ) ) agree with results we can obtain with our tools.

5

2. Matroid Extension Generation Algorithm

A simple approach to combinatorial generation of certain objects is the following: Exhaustively construct all possible “presentations” of the objects of given size (called also labeled objects or labeled representations), and then select one representative of each isomorphism class by means of an isomorphism tester. Since there are typically many presentations for each one resulting object, and the procedure requires to isomorph-test each pair of the constructed presentations, that approach quickly becomes infeasible. A better solution is provided by the technique of a “canonical construction path” which has been explicitly formulated by McKay ’, and used (explicitly or implicitly) in many combinatorial searches. The idea of canonical construction path is briefly described here: Select a small base object. Then, out of all ways how to construct our big object by single-element steps from the base object (construction paths), define the lexicographically smallest one (the canonical construction path). Adapt the construction process so that all non-canonical extensions at each step are thrown away immediately (assuming that the canonical lexicographic order is hereditary). In this way every object is generated only once, and no explicit isomorphism tests are necessary. (Though the isomorphism problem may be implicitly contained in the definition of the canonical order.) It appears that the canonical construction path technique has not been fully implemented before in matroid generation. The works mentioned in Section 1 usually use variations of the above simple generation method. Although Kingan et al. claim to use the canonical construction path method, they implement it only partially - there is no definition of a canonical order, but an isomorphism tester is used to select the representatives of one-element matroid extensions. The main theoretical contribution of our work is in a complete adaptation of the canonical construction path technique for exhaustive generation of matroid representations over finite fields. (We can, moreover, generate matroid representations over so called l 2 “partial fields” .) The adaptation is not at all obvious due to specific difficulties with matroid representations. Our base object is a common minor of all generated matroid representations, and one-element extensions and coextensions play the role of single-element steps. Altogether, the generation steps form what we call an elimination sequence of a generated matroid representation.

6

3. Practical Computations We have used an implementation of our algorithm in MACEK for carrying out several successful matroid generation projects, e.g. 4,5. (MACEKimplements also numerous other matroid structural computations, like minor, connectivity, or isomorphism tests, generation of representations, etc.) Our results concerning enumeration of binary combinatorial geometries (i.e. of simple binary matroids) are summarized in Table 1. We have independently verified all the enumeration results of Kingan et al. s on up to 11 elements, and moreover we have added significant new results for matroids on 12 and 13 elements. For instance, the number of 8-element rank-5 simple binary matroids can be obtained by running MACEKwith ‘GF(2)’ ‘Qext-simple;!extendsize 5 3’ ‘1;l’. MACEK 1.2.09 (26/08/05) starting.. .

......

-652- In total 15 (co-)extensions of 1 matrix-sequence generated over GF(2).

(One needs the latest version 2 1.2.09 of MACEKto run this.) Table 1 An enumeration of small simple binary matroids (* new entries).

For interested readers, we add a short summary of real running times of our enumerations (after some tuning-up). All the results on at most 11 elements are quite easy, they take just seconds or at most minutes to finish. For 12 elements, the computation took approximately 3 hours (rank 6), 13 hours (rank 7), and 16 hours (rank 9); for 13 elements it took 11 hours (rank 6), 110 hours (rank 7), 260 hours (rank 8), 300 hours (rank 9), and

7

140 hours (rank 10). (Computing times are normalized to lGHz CPU.) 4. Conclusions We have presented a matroid extension generation scheme which fully implements the idea of a generation via a canonical construction path by McKay. Compared with other matroid generation programs, the implementation of our algorithm appears to be remarkably faster and more powerful in generation of matroids representable over finite fields. We have already used it to prove some results of theoretical interest, and significantly extended known enumeration results of small matroids. In current work we are extending the enumeration in Table 1to ternary and other representable matroids. (For instance, we have so far verified the ternary results of 8 . )

Acknowledgments The large-scale computations summarized in Table 1 have been run on the minos computing cluster at West Bohemia University (IT1 center).

References 1. D.M. Acketa, A construction of non-simple matroids o n at most 8 elements, J. Combin. Inform. System Sci. 9, 121-132. 2. J.S. Dharmatilake, Binary Matroids of Branch- W i d t h 3, PhD. dissertation, Ohio State University, 1994. 3. P. HlinBnf, T h e MACEKProgram, http://www. cs .vsb.cz/hlineny/MACEK, 2002-2005, version 1.2.09. (With an online trial interface.) 4. P. HlinBnf, O n the Excluded Minors for Matroids of Branch- W i d t h Three, Electronic Journal of Combinatorics 9 (2002), #R32. 5. P. HlinBni, Using a Computer in Matroid Theory Research, Acta Math. Univ. M. Belii 11 (2004), 27-44. 6. P. HlinBnf, Equivalence-Fk-ee Exhaustive Generation of Matroid Representations, submitted. 7. S.R. Kingan, Matroid Structure, Ph.D. dissertation, 1994, Louisiana State University. 8. R.J. Kingan, S.R. Kingan, W. Myrvold, O n matroid generation, Proceedings of the 34th Southeastern Int. Conf. on Combinatorics, Graph Theory, and Computing, Congressus Numerantium 164 (2003), 95-109. 9. B.D. McKay, Isomorph-free exhaustive generation, J. Algorithms 26 (1998), 306-324. 10. J.G. Oxley, Matroid Theory, Oxford University Press 1992. 11. R. Pendavingh, personal communication, 2004. 12. C. Semple, G.P. Whittle, Partial Fields and Matroid Representation, Advances in Appl. Math. 17 (1996), 184-208.

DETECTION OF CERTAIN FALSE DATA RACES FROM RUNTIME TRACES* KOUSHIK SINHA

Honeywell Technology Solutions Lab Bangalore, India 560076 Email: [email protected] RAJIV GUPTA

Department of Computer Science University of Arizona, Tucson, AZ 85719 Email: gupta @cs.arizona.edu

We present a family of algorithms for the on-the-fly detection of certain false data race conditions from runtime generated trace in a multithreaded environment. Our algorithms do not require instrumentation of the source code for catching data race conditions and can be implemented as low cost hardware solution. By filtering out false and benign race conditions from the reported races, the user is presented with a smaller set of potentially harmful data race conditions, hence speeding up the debugging process, especially for large multithreaded programs. The proposed algorithms are characterized by low overhead in terms of hardware support and effect on program execution time.

1. Introduction A data race occurs when two or more threads concurrently access a shared location without synchronization, with at least one. of the accesses being a write access. Their occurrences are often considered as manifestation of erroneous programming since their presence might lead to unpredictable results and incorrect program execution. Two fundamentally different types of races can occur, that can capture the essence of different types of bugs in different classes of parallel programs. General races cause nondeterministic execution and are failures in programs intended to be deterministic. Data races cause non-atomic execution of critical sections and are failures in nondeterministic programs that access and modify shared data in critical section. [l]attempts to study and characterize the *Address for Correspondence: Koushik Sinha, Honeywell Technology Solutions Lab, India 560076. Email: [email protected]

8

9

differences between these two types of races using a formal model. Bernstein’s conditions states that atomic execution is guaranteed if shared variables that are read and modified by the critical section are not modified by any other concurrently executing section of the code. A violation of these conditions has typically been called a data race or access-anomaly or simply a race [ 11. Data race detection involves one of the two main approaches - either static or dynamic detection. Some methods also use a combination of the two approaches. The static methods perform a compile-time analysis of the source code.The advantage is that they can perform a global check on the program, being able to detect any data race that might potentially occur in the program [7,3]. Their major drawback is their conservativeness as they can neither know nor understand program semantics, resulting in large number of false alarms. The dynamic race detection methods, which are further subdivided into postmortem and on-the-Jlytechniques use some tracing mechanism to detect whether a particular execution of a program exhibits data races. The postmortem techniques [5,3] collect information about the order of synchronization and computation events during the execution and create a trace of the run. On termination of execution, this trace is analyzed to detect potential data races. The on-the-fly methods buffer partial trace information in memory, analyze it, and detect data races as they occur [4,2,10,8]. Dynamic methods have the advantage over static methods in that they detect races that only occur during the actual executions. Their main drawback lies in the fact that they check programs locally by considering only specific program execution paths each time. If the program takes a different execution path, different data races may pop up. Hence in order to detect all data races, all possible execution paths need to be checked, which in general is not practical. Secondly, existing dynamic techniques in the literature, especially on-the-fly techniques impose high slowdowns on the program as they typically have high runtime overhead. Finally, the issue of detection granularity (i.e., the size of the memory block or unit on which detection is actually performed) is also of importance. A small detection granularity incurs higher overheads and may result in potentially missing some of the data races, while a large one can lead to false race detections. Data races can be distinguished into two types : race conditions that are used to make a program intentionally nondeterministic and races that are caused by programming errors. The former class is generally referred to as synchronization races that allow for competition between threads to enter a critical section. The distinction between the two is actually a matter of abstraction. At the implementation level of the synchronization operations, a synchronization race is caused by a genuine data race on a synchronization variable (e.g., spin locks, polling etc.). Hence, it makes it difficult to distinguish between synchronization and data

10

races at the machine instruction level, particularly in the case of dynamic detection methods. RecPlay [3]attempts to identify synchronization operations by using the synchronization primitives code located in dynamically loadable synchronization libraries. In their method, by statically linking the implementation of the newly defined synchronization operations, and by dynamically linking the new code of the low-level synchronization operations, the former will be traced, while the latter are not. Races on variables that are used for the purpose of implicit or explicit synchronization and races caused by silent stores [9] on variables unprotected by locks do not change the program behavior in any harmful way. Such races are hence benign races and can be ignored during debugging. Our work focusses on two problems : detection of false data races [l]from runtime traces and secondly, distinction between data races that may lead to anomalies in program behavior and those which do not result in incorrect program execution. For the latter, we have taken some specific instances of data races and then shown why and how they can be ignored. For example, if two threads attempting to write the same value to a shared memory location (better known as a silent store) without any synchronization, then such a situation may safely be ignored as a data race, without introducing any error in the execution of the program. The elimination of the above two categories from data race conditions will result in significantly fewer races being reported, thus aiding the developer to concentrate on the truly harmful race conditions. We have developed algorithms with low overhead on run time as well as on memory space, for on-the-fly detection of such false data races in concurrent program execution by multiple threads using only runtime generated trace as the input. 2. System Model

The algorithms proposed in this paper assume a multithreaded environment, where, in addition to access to local memory, threads can also read and write from shared locations of the process within which they reside. A shared location is assumed to any memory allocated from the heap as well as any global and static objects. This model similar to that used in [2]. In an sequentially consistent (SC) memory model, all instructions from all threads appear to form a total order which is consistent with the program order in each thread [5].Thisimplies that a read or load on any shared memory location will always return the most recently written value to that address. The drawback with sequential consistency is that it is very restrictive. In our work, we have assumed a weakly ordered memory model where the only requirement is that the program should appear sequentially consistent in the absence of any data races.

11

In order to facilitate the detection of some false data races under a load-store architecture, we generate a trace record whenever any one of the following two events occur : 0 0

A LOAD of a shared memory location A STORE at a shared memory location

The event trace record corresponding to each of the above events consists of the following fields : 0

0

For a LOAD instruction, a record of the form ( proc-id, instr, s, current-Val) is generated. For a STORE instruction, we generate a record of the store event containing the following fields : ( proc-id, instr, s, current-wal, newwal). There are however, two special cases :

1) If the instruction was an SC and it failed, then we set instr to SCF in the event record. 2 ) Otherwise if the SC instruction succeeded, then we set instr to SCS in the event record. The purpose of individual fields of the event records are as follows, 1) proc-id : The id of the thread currently trying to access the shared memory location. 2) instr : The instruction currently being executed, i.e. either a LOAD or a STORE instruction. 3) s : The address of the shared location being accessed. 4) current-Val : The value currently stored at location s. 5 ) newval : The value that the current STORE instr is trying to write to location s. A. Trace File Implementation Details

Our trace consists of event records generated only for load or store instructions that access shared memory locations. Since we do not have the luxury of instrumenting the source code to identify local memory addresses from shared ones, we use the following simple method for determining if an accessed memory location is shared or not : For every processor, two bound registers are maintained, that keeps track of the currently executing thread’s stack address bounds. Since the local variables to a thread would reside on the thread’s stack, whenever there is a LOAD/STORE instruction, we check the accessed memory address against that of the bound registers. If it is out of the bounds, then we assume that the memory location is not

12

part of the thread’s stack and hence is a shared memory location. Consequently, we generate an event record for the corresponding instruction. The use of bound registers impose only minimal additional hardware overhead, while ensuring that any allocation from the heap or any memory access outside the local stack is treated as a shared variable. A second implementation detail concerning the trace is whether we maintain a separate trace file for each processor in the processor’s local cache memory or whether we keep one unified trace file. For the purpose of our algorithm we have assumed that there exists a single trace file, accessible to all the processors for recording trace events. We assume that if two or more processors try to write to the trace file simultaneously, then the order in which they get the write accesses would be determined by the Bus Arbiter. The size of the trace file is assumed to be finite and when the maximum number of records are written to it, we resume from the beginning of the file, overwriting the old trace events.

Example 1. Consider the following C code and its equivalent MIPS assembly code. I* shared variables : count, total, release *I I* Memory address locations of shared variables : *I I* count = 1000 *I I* release = 1004 *I I* total = 1012 *I I* Assume the value of total := 4 and release := 0 initially *I

if (count == total) count = 0; release = 1; I* reset release *I I* more to come - so wait *I else spin until (release == 1); I* busy waiting *I endif 1 2 3 4 5

6 I 8 9 spin : 10

LD LD BNE SD LD DADDUI SD

r2, O(r3) r5, O(r6) r2, 15, spin

; load count ;load total

fl, r2, O(r4) r2,rO, #I r2,0(r4)

; count 0 ; load release ;release t 1 ; store updated release

LD BEQZ

r2,0(r4) r2, spin

; load release ; spin as long as release = 0

-

;test (count !=total)

Suppose the above code is executed simultaneously by two threads TI and T2. Figure 1 shows a possible sequence and ordering of instructions executed by threads TI and Tz when both execute the above code simultaneously.

13 Thread 1

Thread 2 LD 12. O(13)

LD r2, o(r3) LD r5, o(r6) BNE 12.15. spin

SD 10,0(13)

LD 15,0(I6) BNE 12, r5, spin

LD 12. O(14) BEQZ 12, spin

LD 12, o(r4) LD 12, o(r4) BEQZ r2, spin DADDUI 12.10.#1 SD r2,0(14)

LD T2,0(14) BEQZ 12, spin

Fig. 1. A possible sequence and ordering of instructions executed by two threads

For the above sequence and ordering of executed instructions in Figure 1, a sample generated trace file may contain the following event records in the order shown below :

1. (Tz,L D , 1000,3) 2. ( T I L , D , 1000,4) 3 . (Ti,L D , 1012,4) 4. (Tz,L D , 1012,4) 5. (T2,L D , 1004,O) 6. (Ti,S D , 1000,4,0) 7. (Ti,L D , 1004,O) 8. (T2,L D , 1004,O) 9. (Ti,S D , 1004,0,1) 10. (T2,L D , 1004,l) Whenever a thread i executing on some processorp writes a trace event record, the processor p starts checking for data race conditions using the trace file available to that point of time and the algorithms given in this paper. 3. Preliminaries

The most widely accepted definition of data races is based on the happens-before relation first suggested by Lamport 111 for message passing systems. This definition also allows recognition of false data races and can be found in [2]. hb

Definition 1. The happens-before partial order, denoted by +, is defined for all computational events (reads, writes) and synchronization events (releases, ac-

14 hb

quires) that happen in a specific execution.We say that a ----f p if : (1) a and p are any two events performed by the same thread, with a preceding p in the program order, or ( 2 ) a is a release and p is its corresponding acquire, both operating on hb hb the same synchronization object S, or (3) a + y and y 4p.

Definition 2. We say that two events a and p are synchronized if a % p or hb p+ a. We say that a and p are concurrent, if they are not synchronized. Definition 3. An update to a shared memory location s is called an idempotent update if, 1) Either all threads write a single common value to that location, or 2 ) If the last access, elast on s was a read operation and if the value being attempted to be written by the current thread is identical to the value stored in location s by the last write operation preceding elast.

The fundamental definition of a data race condition is based on the violation of Bernstein’s conditions, i.e., a race condition is defined as two unsynchronized accesses to some common shared memory location, where at least one of the accesses modifies it [3]. However, this definition in our opinion fails to recognize some simple cases which even though violate Bernstein’s conditions when looked at locally, do not introduce any ambiguity in the outcome of a program if viewed in a bigger perspective of the program. In order to clarify the above statement, we present below three categories of simultaneous shared memory accesses by multiple threads, which though violate Bernstein’s conditions, are not really data races. 1) Idempotent Updates : If a set of the concurrent accesses constitute an idempotent update, then the order in which those accesses are made is irrelevant since the final outcome remains unchanged. 2 ) Synchronization Primitives : Most architectures provide some primitive instructions to ensure atomic update of memory locations. For example, a successful execution of an LL instruction followed by an SC instruction on a shared memory location for MIPS machines, can be treated as an atomic update primitive. It is quite straightforwardto detect such instruction(s) and data races on such shared locations and those occurring within the set of instructions protected by them. Such races can be disregarded as false races. Moreover, certain shared memory accesses, while not being idempotent updates, are introduced deliberately as a method of synchronizing accesses between threads - an example being the barrier synchronization primitive. Such synchronization primitives may contain data races internal to them. However,

15 from the programmer’s perspective, such races are used to make a program intentionally nondeterministic and hence should not be reported as data races. Later on in this paper we show how barriers can be identified from runtime trace and hence data races that may exist within a barrier operation can be eliminated from the set of feasible races reported. 3) Implicit Synchronization Operations : It is possible to create a synchronized access to a shared memory location by contending threads without the use of explicit synchronization primitives such as barriers. In such cases, the program semantics impose an order on the execution of the threads. We present in this paper an algorithm to detect such implicit synchronizations.

Lemma 1. The category of synchronization primitives is really a special case of idempotent updates. Proof : If multiple threads are competing to enter a critical section through the use of LL-SC instruction pair, then we observe that each of the threads update the shared location used for locking with the value 0, to relinquish its hold on the lock. Similarly, the barrier primitive too can be considered as special case of idempotent updates since when all the threads arrive at the barrier, the barrier (i.e. the shared variables constituting it) is reinitialized. The problem with data race detection from a runtime trace, generated in a multiprocessor environment, is that we usually do not have sufficient trace history information, being constrained by system memory, to determine whether a seemingly data race condition is a true data race or whether it falls under one of the above three categories and hence may safely be ignored. By increasing the trace information stored, we can filter out more benign data races. Theoretically, it would be possible to catch all true data races and reject the benign ones if we had a trace file of infinite size. However, in practice, the trace file size would generally be small, and hence it is not possible to filter out all of the benign data races arising from the three cases stated above. An example would further elucidate the point. Example 2. Consider a simple implementation of the barrier synchronization primitive. The C code and its equivalent MIPS assembly language codes are shown below. /* Barrier Code Implementation Version 1 */ lock(counter1ock); /* entering CS */ if (count == 0 ) release = 0; /* reset release first */ count = count 1; /* count arrivals at barrier */ unlock(counter1ock); /* release lock */ if (count == total) /* all have arrived */

+

16 count = 0; release = 1;

I* release processes */ I* more to come - so wait *I

else

spin until (release == 1); endif;

Suppose the above code is executed simultaneously by two threads executing on two different processors and we get the following trace file from a runtime instance as shown in Figure 2. For the purpose of clarity, we have only shown the load and store instructions in Figure 2.

LL r2. o(r1)

LL r2, o(r1)

S C r2, o(r1) LD r2, o(r3)

LL r2, o(r1)

SD r0, o(r4)

LL r2, o(r1)

SD r2, o(r3) SD rO, o(r1) LL r2, o(r1)

SC r2, o(r1) LD r2, o(r3)

SD r2, o(r3) SD rO, o(r1)

Fig. 2. An instance of simultaneous execution of barrier code by two threads

1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15.

(Ti, L L , lOO0,O) (Tz,L L , 1000,0) (Tz,SCF, 1000,0,1) (Ti,scs,1000,0,1) (Ti, L D , 1004,O) (Tz,L L , 1000,l) (T2,SCF, l O O O , O , 1) (T2,L L , 1000,l) (Tz,SCF, 1000,0,1) (Ti, S D , 1008,1,0) (Ti,S D , 1004,0,1) (Ti,S D , 1000, 1 , O ) (Tz,L L , 1000,O) (T2,SCS, lOOO,O, 1) (T2,L D , 1004,l)

17

16. (T2,S D , 1004,1,2) 17. (T2,S D , l O O O , l , 0) If our trace file had a size of 4 trace events, then storing of the updated value of count by thread 7'1 in line 8 and the load by T2 in line 12 would trigger the detection of a data race. However, if our trace file could store say 20 trace events or more, then we could infer from the trace file that the update of the shared variable count occurs only within a critical section, from the presence of LL and SC instruction pairs for both the threads and hence there are no data races involved in the simultaneous execution of the code. 1 lockit : 2 3 4

5 6 7 ; enter critical section 8 9 10

11 12 13L1 : 14 15 16 ; unlock 17 18 19 ; remainder of barrier synchronization code 20 21 22 23 24 25 26 27 28 29 spin :

30

LL BNEZ DADDUI SC BEQZ

r2,O(rl) r2, lockit r2,rO. #I r2,O(rl) r2, lockit

LD BNEZ SD

r2,0(r3) r2,LI rO,O(r4)

load counterlock spin if lock held by another processor locked value store conditional locked value if store failed. not atomic

-

; load count

;release

0

+1

DADDUI r2, r2, #I SD r2, O(r3)

; count + count ; store updated count

SD

rO,O(rl)

; counterlock + 0

LD LD BNE SD LD DADDUI SD

r2,O(r3) rS,O(r6) r2, r5, spin rO,O(r3) r2,0(r4) r2.10, #I r2, O(r4)

; load count ; load total

;test (count !=total) ;count t 0 ; load release ; release t 1 ; store updated release

LD BEQZ

r2, O(r4) 12, spin

; load release ; spin as long as release = 0

The classical definition of race is in a sense too conservative and fails to address the situation where race detection needs to be done from only the dynamically generated runtime trace. We first present a classical definition of race conditions [2] and then show how it needs to be modified to work in a situation where all race condition detections need to be done only from runtime traces. Definition 4. We say that there exists a data race between two accesses to the same shared location s, Q and /3, if,

1) a and p are executed by distinct threads 2) Q and ,Ll are not synchronized, i e . , they are concurrent events 3) At least one of a or p is a write access.

18

Before presenting our modified definition of a race condition, we describe some useful notations :

7 = Trace file to which events corresponding to every read or write access to some shared location is recorded. .(a) = record number of trace event a in trace file 7,l5 T ( U ) 5 A. ~ ( a<)~ ( p=)record number of event p is less than record number of event a in 7. A = maximum number of records that trace file 7 can store. /.(a) - T ( ~ ) I = number of records between two events a and /3 in trace file 7, both inclusive. S = set of the following three possible relations between two concurrent events a and ,8 accessing the same shared location, where at least one of them is write operation : 1) Events a and ,B constitute an idempotent update. We call such relations as

2. 2) Events a and p represent attempt to acquire a lock on a synchronization primitive to enter a critical section (e.g. a barrier).We denote such relations by E. 3) Events a and p are totally ordered in time through some implicit synchronization operation such as message passing. Such relations will be denoted by M .

@ ( ap) , = relation between two concurrent events a and p accessing the same share location, with at least one of them for writing. Observation 1. If a and ,b’ are two concurrent accesses to the same shared location, then either .(a) < ~ ( por) ~ ( p<) .(a). We now present a modified form of the classical definition of race (definition 4) for on-the-fly race detection of potentially harmful data races in a multithreaded environment.

Definition 5. We say that a data race may exist between two accesses to the same shared location, a and ,8, if, 1) 2) 3) 4)

a and

are executed by distinct threads they are not synchronized, i.e., they are concurrent at least one of a or p is a write access .(a), ~ ( p >) 0 and .(a), and \.(a) - @)I I A,

).(.

<

m.

T(P) F A, where

19

Lemma 2. For on-the-jly detection of race conditions using only the trace file I of size A, a race condition can be detected ifi I T ((. ) - ~ ( p ) 5l A, even if there exits a true data race between events a and p, where a and p are two concurrent accesses to the same shared location with at least one of them being a write operation. Lemma 3. Let R’= set of races detected from the simultaneous execution of some program P by several threads on a multiprocessor system, using a trace31e I of size A. Let R = set of all data races contained in P that may occur if P is executed simultaneously by multiple threads. Then for any given size A of the trace$file 7 ,R’ R.

c

Corollary 1. As A

-+

Corollary 2. I f A

= 0 or

00,

R’4 R. A = 1, R’= 0.

Lemma 4. In order to detect data racesfrom a runtime trace recorded in a trace file I of size A, we must have A 2 2. Definition 6. We say that a true data race exists between two accesses to the same shared location, a and /3, if, 1) 2) 3) 4)

a and /3 are executed by distinct threads they are not synchronized, i.e., they are concurrent at least one of a or /3 is a write access and I T ((. ) - T(/~)I 5 A, T ( Q ) , T ( / ~ )> 0 and T(a)

< m.

5) @ ( a0) , k

5 A, where

T(cY),T(/~)

s.

4. Elimination of Some Benign Data Races Based on our above modified definition of a race condition, we present in this paper several techniques to detect benign data races from a dynamic trace and hence reduce the number of reported races. Whenever a shared memory location is accessed for load or store by some thread i executing on processor p , a trace event record for the corresponding instruction is written to the trace file and the algorithms given in this paper are executed one after another by processor p until either 1) we detect that the event does not generate a data race, or 2) all the algorithms fail to eliminate the race condition and hence we conclude that the given event may cause a data race. Our first example demonstrates how some of the supposed races in a program that involve idempotent updates and synchronization primitives can be detected as not being harmful data races.

20 Timad 1

Thread 2

Thread 1

Thd2

Thread 1

Fig.3. All read-write combinations on count are idempotent updates Thread I

Thread 2

Thread 1

Fig.4. Possible read-write dependencies on release

If the code for barrier in example 2 were to be executed simultaneously by two or more threads, then we would have observed the following read-write dependencies from the execution trace : A data race on the update of the variable counterlock in lines 1 to 5. The variable count may be updated by more than one thread in line 14. Possible reset of release to zero by more than one thread in line 11. Two or more threads may simultaneously read and/or update count on lines 21 and 24. 5) Possible simultaneous read-write access to release from lines 25 to 29.

1) 2) 3) 4)

An analysis of the execution traces however reveals that it is possible to eliminate some of the above data races. For example, the race on the variable countelock is in reality a synchronization race. In general, it is easy to distinguish synchronization races from actual data races as synchronization races generally involve the use of some well-defined synchronization primitives. In this case for example, we can determine that there is a synchronization race on countelock from the presence of LL and SC instruction pair. The LL and SC instruction pair is a standard way to enforce an atomic update of a memory location in MIPS. In fact, the MIPS code of example 2 represent a spin lock. Now considering the increment of count in line 14 and that of release in line 11, we note that their updates must be preceded by a successful acquisition of a lock on the variable counterlock. In this case, we can infer whether a lock has

21

been acquired or not, if and only if the access to count or release is preceded by the block of instructions from lines 1 to 5 in any execution trace. Hence, even though there may be a read-write dependency on the access to these two variables, we can safely conclude that there can be no data races on them because their updates are protected within a critical section. However, considering the access to count and release in between lines 21 to 29, there are no synchronized accesses to these memory locations and these accesses can be regarded as true data races. Because their accesses are not synchronized, three possible scenarios may arise where read-write dependencies may arise on these variables, as depicted in figures 3 and 4. Considering Figure 3(a), we note that if any thread reads count before a thread executes a write operation on it, it may get any value in the range 1 to total, and there is a dependency arc from the read operation to the write. Consequently at this stage, without more knowledge about the semantics of the code, such a dependency will be reported as a data race. But after one or more threads succeed in executing a write on count, regardless of the order of the writes, all write operations update count with the value zero. Similarly in the case of release, multiple execution of updates on release, will always result in writing the same value to the memory location, in this case which is a 1. Thus, for such idempotent updates, even though there are no explicit synchronization between the accesses to the memory locations, we can say that there does not exist any data race on such variables as the order of writes clearly does not introduce any ambiguity on the value stored in such memory locations. Hence from the above discussion we can conclude that even though an execution trace of the barrier code in example 2 might contain five races, an analysis of the traces reveal that some of those data races may be removed by simply considering locking instructions like LL-SC pairs and idempotent updates. We now present two algorithms to detect data races in idempotent updates and synchronization primitives.

A. Races involving Idempotent Updates Our first algorithm detects idempotent updates and removes such concurrent accesses from the list of reported data races. From the definition of idempotent updates in section 2, we can arrive at the following lemma : For each shared memory location s, we require the following two information (Figure 5) derived from the runtime trace file, besides the current trace event :

0

lustop : The nature of the last access - whether it was a write or a read operation. lastvul : The value stored in memory location s prior to access of s by the instruction s.lastop.

22

Fig. 5 . Additional attributes required for detecting idempotent updates

O u r algorithm to detect idempotent updates is presented below. The instruction types LOAD and STORE refer to any generic instruction that accesses a memory location for read or write operation respectively.

Input : Trace file 7 Algorithm detectidempotent-updates : the last event record (Y written to the trace file; 7101 S : the shared memory location being currently accessed; instr : current instruction to be executed; current-Val : data value currently stored in memory location s; newval : data value that the current instruction is trying to write at s; begin I* Let i denote the id of the thread trying to access s *I if (memory accessed == shared) then write to trace file I either event record (i,instr, s, current-Val) or (i,instr, s, current-wal, newval) at location I [ a ] depending on whether the current instr type = LOAD or STORE; if (I[a].instr == LOAD) or (I[a].instr == STORE) then Search 7 backwards from current event a for recent most access to location I [ a ] . s . Call that trace event p. let lastop = I[P].instr; let lastVal= I[a].current-val; if (lastop = I[a].instr == LOAD) then; endif;

I* nothing to do */

I* The next two tests define an idempotent update *I if (lastop = 7[a].instr == STORE) and (I[a].current-val == I[cu].newval)then I* first condition of idempotent update *I I* not a race condition *I

endif; if (lastop == LOAD) and (I[a].instr == STORE) and (lastVal== I[a].newval) then I* second condition of idempotent update *I

23 I* not a race condition *I

endif if (I[a].instr == STORE) and (I[a].current-val # I[a].newval) then /* Not an idempotent update */ endif; endif; end.

B. Races Associated with Entering Critical Sections Our next algorithm detects atomic update of any shared memory location s (such as an LL-SC instruction pair in MIPS). To detect a critical section (CS), we use a state machine as shown in Figure 6. other instructions

wctions

w

any othcr instmction

Fig. 6 . State machine to detect Critical Sections in MIPS

The MIPS architecture ensures that any update to the memory location specified by the LL instruction before a store conditional (SC) to the same address occurs, then SC fails. Moreover any context switch between the two instructions also results in a failure. A simple check of the status of the link register would give us the information of whether a store conditional instruction succeeded. A transition to state A2 in Figure 6 indicates entering a critical section protected by a lock acquired on the shared location s. Only if the thread that successfully entered the critical section executes an update on s, do we assume that the program has left the critical section. Hence all read-write dependencies within the critical section are not considered as races. The following algorithm is executed by each thread a to detect whether it is executing inside a critical section.

24 Input :Trace file 7; Algorithm detect-atomic-updates 7 [ a ] : last event record in 7 ; state : current state in the range [A0 to Az]; : shared memory location accessed by current LL instruction; s_ll begin if (memory accessed == shared) then let 7 [ a ]be the corresponding event. /* initial state = A0 */ if (I[a].instr == LL) and memory accessed, s == shared) then state = A1 ; /* hoping for an atomic update */ s l l = 7 [ a ] . s ; /* remember the location LL accessed */ endif; if (state == A1) and (I[a].instr # SC) then remain in srare A1; if (state == A1) and (I[a].instr == SCS) then state = Az; /* state = A2 implies successful atomic update of lock variable - entering Critical Section */ if (state == A2) and (7[a].s == d l ) and (instr == STORE) then /* Let newVal= value that the current STORE instruction of */ /* thread i is attempting to write at shared location s */ if (‘T[a].current-val # 7[a].newval) then state = Ao; /* exit Critical Section */ else remain in state A z ; endif end.

C. Barrier Code Detection

Example 1 demonstrates an implementation of a barrier. We saw in the previous two sections how some of the data races may be removed by simply considering idempotent updates and identifying races on shared locations to gain access to critical sections. In this section we outline an algorithm to identify a barrier from the runtime trace. A barrier forces all threads to wait until all threads reach the barrier and then releases all of the threads. It is a common synchronization primitive in programs with parallel loops. Our algorithm is based on the state machine depicted in Figure 7. The state machine is an augmented version of the one in Figure 6 . A typical implementation of a barrier is done with two spin locks - one to protect a counter that keeps track of the number of threads waiting at the barrier and another to hold the threads until the last thread arrives at the barrier. An observation arising from looking at different algorithms for the barrier synchronization operation reveals that any correct implementation of the barrier needs contain the following sequence of operations in order :

1) Entering a critical section for exclusive update to shared locations.

25

2) An increment or a decrement operation on some shared location X . 3) A comparison operation on X . 4) Updating location X either through a complementing instruction or a store instruction and also updating some other shared location Y.

complement X

or update X

location complement X or update X with 0 or 1

on a shared location X other instructions

Fig. 7. State Machine to detect barriers in MIPS

Of course we can have an implementation of a barrier where operations 3 and 4 could be outside a critical section [6] as given example 2. But such a implementation suffers from the bug that it is possible for the waiting threads to go into an infinite wait if one thread is trapped at the previous instance of the barrier, and the number of threads can never reach the value of total. A typical correct solution as shown below in the C-code would be to count the number of threads as they exit the barrier and not allow any thread to enter or initialize the barrier until all threads have exited the last instance. This would imply placing all updates on the two shared locations used as spin locks, to be made inside the critical section. I* A Correct Barrier Implementation *I local-sense = !local-sense; I* toggle local-sense *I lock(counter1ock); I* enter CS *I count = count + 1; I* keeps track of arriving threads - shared variable *I if (count == total)

{ count = 0; I* reset counter *I release = local-sense; /* release processes - shared variable *I

1

unlock(counter1ock); I* exit CS *I spin(re1ease == localsense); I* wait until all threads arrive *I

26

Our algorithm assumes that this is the case and hence we are able to detect a subset of all barrier implementations - ones which conform to the above discussion. The rules for transitions from state Ao to A3 are the same as in the algorithm detect-atomic-updates and hence are omitted in order to illustrate only those that deal with the detection of a barrier from the runtime trace. Input : Trace file 7 ; Algorithm detect-barrier-code 7[a] : last event record in trace file 7; state : current state in the range [A0 to A T ] ; s-ll : shared memory location accessed by the LL instruction; begin /* look for a CS definition */ if (state = A2) then /* look for an incremenVdecrement instruction *I if (increment or decrement on a shared location X ) then state = A3; /* Now look for a compare instruction on X */ if (state == A s ) and ( X is compared with some value) then state = Ad; if (the next instruction on X is a complementing instruction) or (I[a].newval== 0) or (I[a].newval == 1) then state = A s ; if (state == A5) and (7[a].instr == STORE) and ( 7 [ a ] . s == Y )then state = A7; endif; if (state = A4) and ( I [ a] .in str== STORE) and (T[a].s == Y )then state = A s ; if (state == A s ) and (((instr == a complementing instruction) and ( T[ a] .s == X ) ) or (I[a].newval== 0) or (I[a].newval == 1)) then state = A7; endif; if(state == A7) and (I[a].instr == STORE) and ( I [ a ] .== s s-ll) then /* Let newval = value that the current STORE instruction of */ /* the thread is attempting to write at shared location s */ if (I[a].current~val# 7[a].newval) then /* Barrier detected from runtime trace */ out(”Barrier Primitive detected”); state = Ao; /* exit Critical Section */

endif; endif endif; endif; end.

The use of the above three algorithms : detectjdempotent-updates, detect-atomic-updates and detect-barrier-code on a runtime trace generated from example 2 would identify all the data race conditions as benign data races.

27

D. Races with Implicit Synchronization Operations In certain situations, even if there be no explicit synchronization operations, the program semantics may impose certain restrictions on the order of execution of instructions by different threads. Such an example is illustrated in Figure 8, where a naive race detection algorithm would detect four races, two on the variable z and two on flag. A typical implementation of the while spin loop in MIPS as depicted in figure 8 would look something like this, loop :

LD BEQZ

r2,O(rl) r2, loop

; assume [rl] + flag, a shared variable ; loop while flag = 0

However, after taking the semantics of the trace fragments into account, there are only three races that can actually occur. The data race on 5 between the instructions z = y and a = z can not occur, provided that flag was set to FALSE at initialization. A good race detector must not detect such benign races and report only the feasible ones. The subsequent discussion aims at developing an algorithm to detect such implicit synchronization operations that make certain races impossible.

Lemma 5. If the program semantics is such that a total ordering is imposed on the execution of the threads by some implicit synchronization, then no data race is generated for the shared locations involved, even though there may not be any explicit synchronization primitives to control access to those shared location(s).

r=y.

Fig. 8. A trace with implicit synchronization operation

Our algorithm makes use of a state machine to keep track of the status of a shared variable being used in such implicit synchronizations. Depending on the status of the state machine and some additional state variables, we can determine dynamically from a runtime trace whether there are any such implicit synchronization operations that control access to certain shared variables. A description

28

Q

flag = TRUE

Fig. 9. A I-bit state machine

of the state machine and other state variables used in our algorithms is presented below. 0 0

A 1-bit state machine which functions as shown in Figure 9 : shared-var-bit : if ((instr == LOAD) and (memory accessed = shared)) then shared-var-bit = 1; else shared-var-bit = 0;

0

syncflag-bit : The variable is set when the following conditions hold : if ((instr == Cond. Jump) and (shared-varbit == 1)) then syncflag-bit = 1;

0

0

0

sync-varaddr : Contains the location of the shared variable being accessed during the conditional jump instruction. It is set simultaneously with the sync-jug hit variable. state-change-time : Records the timestamp of when the state machine changed its state from S1 to Sz. In addition to the above state variables our algorithm makes use of two other attributes from the dynamic trace, associated with each shared variable, as demonstrated in Figure 10:

- Proc-id : The id of the processor that last accessed the location. - TS : The timestamp of the last access. The system clock may be used for the timestamping purpose. We now present the algorithms to maintain the state machine and for detection of race conditions caused by implicit synchronization primitives.

29

shzed l m m

Fig. 10. Each shared variable is augmented with 2 attributes

Input : Trace file 7; Algorithm statemachine : current value stored at location sync-var-addr; current-val : value that a thread is attempting to write at sync-var-addr; newval : current state of the state machine; state begin /* initialization code */ if ((instr == LOAD) and (memory accessed == shared)) then /* loaded a shared variable *I shared-var-bit = 1; else shared-var-bit = 0; state = S1; endif if ((instr == cond. jump) and (shared-var-bit == 1)) then /* the shared variable is being used for synchronization */ sync-flagbit = 1; sync-var-addr = address of shared memory location used for the jump; /* Let flag denote the variable corresponding to sync-var-addr */ I* state machine transitions *I I* Let i = id of currently executing thread */ if (instr == STORE) and (flag.proc-id # i) and (newval # current-Val) then /* update of synchronization variable *I I* if here, some other thread other than the currently executing */ /* thread changed the value of the synchronization variable *I state-change-time = current system time; /* record first update to flag */ state = S2; I* value read = TRUE *I call(false-race-detector);

endif; endif end.

Algorithm false-racedetector begin if ((shared-var-bit) and (sync-flag-bit) and (state = Sz)) then I* Let z = the shared vxiable being accessed */ I* Let f l a g =variable corresponding to sync-varaddr */ I* Let us also assume that i is the id of the thread *I I* accessing z. Then we have : *I

30 current-time = time of current access of x ;

if ( z . p r o c i d # i) then I* the same thread not accessing the *I I* shared variable again, otherwise trivial case of not a data race *I if ((x.TS < current-time) and (x.TS < statexhange-time) and (fZag.TS I* Both the previous access of I and the write *I I* of flag occurred before the c m n t access of x *I I* Hence the 2 accesses to z are ordered in time *I I* by the spinning on variable f l a g . *I print(”This access to ’2’is nor u race condition”); endif; endif: endif end.

5 currentrime)) then

Referring to trace in Figure 8, the statement (flag.TS 5 current-time) in the algorithm false-racedetector makes it possible to handle the simultaneous access to flag and x. Also the inclusion of the condition clause (x.TS < state-change-time) makes it possible to detect the feasible data races between statements such as, 1) x = 2; and x = y; x = 2; and a = x;

5. Conclusion We have presented a new definition of race condition in terms of runtime generated trace and developed several algorithms for the on-the-fly detection certain benign data race conditions that may be discarded during debugging, without introducing any potentially harmful effect on the program behavior. Our algorithms do not require instrumentation of the source code for detecting race conditions and are characterized by low overhead in terms of hardware support and effect on program execution time. Future work may include exploring other such false race conditions and testing the algorithms on actual runtime traces in order to compare the number of false races eliminated from a trace and the overhead imposed on the system in terms of hardware and performance slowdown.

References [l] R. H. B. Netzer and B. P. Miller, ”What are race conditions ? Some issues and formalizations,” ACM Letters on Programming Languages and Systems, vol. 1, no. 1, March 1992, pp. 74-88. [2] E. Pozniansky and A. Schuster, ”Efficient on-the-fly data race detection in multithreaded C++ programs,” ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming (PPoPP),June 1113,2003, San Diego, USA, pp. 179-190.

31 [3] M. Ronsse and K. De Bosschere, ”Recplay : a fully integrated practical recordreplay system,” ACM Trans. on Computer Systems, vol 17, no. 2, May 1999, pp. 133-152. [4] M. Prvulovic and J. Torrellas, ”Reenact : using thread-level speculation mechanisms to debug data races in multithreaded codes,” Proc. of 30th Annual International Symp. on Comp. Arch. (ISCA-30), June 2003, pp. 110-121. [S] M. Xu, R. Bodik and M. D. Hill, ”A flight data recorder for enabling full-system multiprocessor deterministic replay,” Proc. of 30th Annual International Symp. on Comp. Arch. (ISCA-30), June 2003, pp. 122-135. [6] J. L. Hennessey and D. A. Patterson, ”Computer architecure, a quantitative approach,” Morgan Kaufmann Publishers, 3Td Ed., 2003. [7] J. Mellor-Crummey, ”Compile-time support for efficient data race detection in shared-memory parallel programs,” Proc. I993 ACWONR workshop on Parallel and distributed debugging, San Diego, USA, pp. 129-139. [8] S. Savage et. al., ”Eraser: a dynamic data race detector for multithreaded programs,” ACM Trans. on Computer Systems, vol. 15, No. 4,November 1997, pp. 39141 1. [9] M. M. K. Martin et. al., ”Correctly implementing value prediction in microprocessors that support multithreading or multiprocessing,” Proc. 34th Annual A C M E E E International Symp. on Microarchitecture ,2001, pp. 328-337. [lo] R. O’Callahan and J-D Choi, ”Hybrid dynamic data race detection,” ACM SIGPLAN Symp. on Principles and Practice of ParaJlel Programming (PPoPP), June 1113,2003, San Diego, USA, pp. 167-178. [ 111 A. Goel, A. RoyChowdhury and T. Mitra, ”Compactly representing parallel program executions,” ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming (PPoPP), June 1113,2003, San Diego, USA, pp. 191-202.

REPRESENTING SERIES-PARALLEL GRAPHS AS INTERSECTION GRAPHS OF LINE SEGMENTS IN THREE DIRECTIONS

MANUEL BODIRSKY Department of Algorithms and Complexity, Institute for Computer Science, Humboldt University, Berlin, Germany [email protected]

CORNELIA DANGELMAYR Institute for Mathematics 11, Fkee Uniuersity, Berlin, Germany [email protected]

JAN KAM* Department of Applied Mathematics, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic [email protected]

In this paper we show that series-parallel graphs (i.e., K4-minor free graphs) can be represented as contact intersection graphs of straight-line segments in three directions. Moreover, in our representations no two segments of the same direction intersect.

1. Introduction

Given a set of segments S in the plane, its intersection graph Gs has a vertex for every segment, and two vertices are adjacent if the corresponding segments intersect. We also say that S is the representation of Gs. An intersection graph of segments is called a contact graph if no two segments cross (i.e., if two segments have a common point, then this point is an endpoint of at least one of them). Intersection graphs of segments have been ~~

~

‘Supported by project lM0021620808 of the Ministry of Education of the Czech Republic. Part of the work was done while the author has been visiting Humboldt University, with the support by a Marie Curie Fellowship of the European Graduate Program “Combinatorics, Geometry, and Computation’’ under contract number HPMT-CT-2001-00282.

32

33

intensively studied in the past. The following question of Scheinerman6 has motivated a lot of research: Is every planar graph the intersection graph of a set of segments in the plane? H. de Fraysseix, P. Ossona de Mendez and J. Pach5 and independently I. Ben-Arroyo Hartman, I. Newman and R. Ziv3 have shown that every planar bipartite graph is the contact graph of a set of segments in two directions. N. de Castro et aL2 have extended the result and showed that planar triangle-free graphs are contact graphs of segments in three directions. Probably the best result in this direction is the paper by H. de Fraysseix and P. Ossona de Mendez4 showing that every 4-connected 3-colorable plane graph is the contact graph of segments and that any 4-colored planar graph without an induced C4 using 4 colors is an intersection graph of segments. In this paper we show that every series-parallel graph has a contact representation by segments in three directions. The paper of Fkaysseix et aL4 covers all series-parallel graphs, but cannot be adapted to give contact representations using only three directions. Three directions are best possible, because a series-parallel graph might contain a triangle. For the same reason, our work is not covered by the work of N .de Castro et aL2. Furthermore, our construction appears to be simpler. 2. Series-parallel Graphs

A series-parallel network ( S P network) is a graph with two distinguished vertices s and t , called a source and a sink, inductively defined as follows: let G I and G2 be two SP networks where 51 is the source of G I , tl is the sink of G I ,and similarly s2, t2 are the source and the sink of G2. Then the following graphs are also SP networks: The graph G , that is created from G I and G2 by identifying tl with s2. The source of G , is s1 and the sink of G , is t2. This operation is called a serial composition. The graph G , that is created from G I and G2 by identifying s1 with s2 and tl with t2. The source of G , is the vertex created by identification of the sources and the sink of G , is the vertex created by identification of the sinks. This operation is called a parallel composition.

A series-parallel graph ( S P graph) is a graph G such that the 2-connected components of G are SP networks. It is a well-know fact' that SP graphs

34

can be equivalently characterized as &-minor free graphs. An SP network G is called maximal if each serial composition during its creation was followed by a parallel composition with an edge. 3. Intersection Graphs of Line Segments

We study intersection graphs of segments in three directions in which no two segments with the same direction can intersect, no three segments share a common point, and a common point of two segments is an endpoint of exactly one of them. We denote this class of graphs by PURE-3-SEG-CONTACT. The three directions we consider in our paper are horizontal, vertical and diagonal (defined by the equations x = 0, y = 0, and x = y, respectively). In the rest of the paper we consider only segments and lines in one of these three directions. For a vertex v let V denote the segment representing the vertex v. A cone is an area between two different halflines ZI, 12 ending in a single common point. We always consider only cones with an angle smaller than or equal . say that a cone is bounded by two segments 01, 0 2 if they both to ~ 1 2 We contain the point 11 n 1 2 , one of the segments is contained in one of the halflines (say 01 c 1 1 ) and the other segment shares more than one point with the other half line (say 0 2 n 12 contains more than one point). See Figure 1 for an example of a cone. Let C be a cone bounded by two segments 01 and 0 2 . We say that C is admissible if there exists a segment r c C such that r is in the (unique) direction different from the directions of 0 1 and 0 2 and r shares a common point with 01 and a common point with 0 2 . Moreover, these common points are not endpoints of 01 or 0 2 . We say that r is assuring admissibility of C. We begin with an easy lemma about touching representations.

Lemma 3.1. Let G = (V,E) be a graph and IG its representation in PURE-3-SEG-CONTACT.T h e n f o r each subgraph GI = (Vl,E’) of G there as a representation IGI of the graph G’ in PURE-3-SEG-CONTACT. f i r thermore, IGI is obtained f r o m IG by shortening some of the segments and deleting some of the segments. Proof. We first remove the segments from IG that correspond to vertices not present in GI. Thus we obtain a representation of G[V’].Now let F = E(G[V’])\El. If F = 0, we are done. Otherwise consider e = {u,v} E F . Segments U and Tj meet in a common point in IG[v’] and wlog. U ends there. We can shorten 21 so that it does not intersect V,but all other intersection

35

points are preserved. Obtained representation will clearly represent graph G[V’] - e. If we repeat the above procedure for each edge e E F, we clearly obtain a representation I& of G’. 0 4. Representations of SP Graphs Now we present a lemma about representations of SP networks.

Lemma 4.1. For every maximal SP network G with source s and sank t , every admissible cone C bounded by two segments a and a’, and for every segment r assuring admissibility of C there exists a representation IG of G in PURE-3-SEG-CONTACT such that a = S , (TI = j, and IG is inside the triangle defined by 0 , ( T I , and r.

Proof. We prove the lemma by induction on the size of G. If G contains just s and t connected by an edge we are immediately done (a and (T’ already represent the graph). If G is a serial composition of graphs GI and G2, let u denote the common node of the two composed graphs. We represent u as a segment ii = r given in the statement of the lemma. It is now easy to check (see Figure 1) that there are admissible cones bounded by a and r and bounded by (T’ and r. Hence GI can be represented in the admissible cone bounded by a and r limited by a segment as shown in Figure 1 by induction. Similarly we can represent G2 between a‘ and r.

Figure 1. In the left: a cone bounded by segments u1 and u2. The cone is admissible and 7 assures its admissibility. In the right: an example of a representation of a serial composition of G1 and G2.

Otherwise G is a parallel composition of GI and G2. We can first by induction represent GI in C limited by r and then choose a segment r1in C parallel to 7 and close enough to the intersection of (T and a’ so that it

36

does not intersect any segment of the representation of GI.Then we can represent Gz in C limited by r’ by induction, and we are done. 0 Now we introduce a lemma allowing us to make a graph 2-connected. We omit its proof due to space limitations.

Lemma 4.2. Let G be a graph without K4 as a minor. T h e n there is a graph H such that G is a subgraph of H , H is &connected, and H does n o t contain K4 as a minor. Theorem 4.1. Let G = (V,E ) be a n SP graph. T h e n G has a representat i o n IG E PURE-3-SEG-CONTACT. Proof. First we use Lemma 4.2 to obtain a graph H that is 2-connected, does not contain a K4 minor, and contains G as a subgraph. Then we add edges to H and obtain a maximal SP network H’. The application of Lemma 4.1 to H’ yields a representation that lies between two arbitrary touching segments o and a’ in different directions. Finally we apply Lemma 3.1 and obtain a representation of G. 0 Acknowledgement The authors would like to thank anonymous referees for useful comments and simplification of the proof.

References 1. Andreas Brandstadt, V. Bang Le, and Jeremy P. Spinrad. Graph Classes: A Survey, SIAM Monographs on Discrete Mathematics and Applications, 2000.

2. N. de Castro, F. J. Cobos, J. C. Dana and A. MArquez. Triangle-Free Planar Graphs as Segment Intersection Graphs, Journal of Graph Algorithms and Applications, Volume 6, pp. 7-26, 2002. 3. I. Ben-Arroyo Hartman, I. Newman and R. Ziv. On grid intersection graphs, Discrete Math. 87, pp. 41-52, 1991. 4. Hubert de Fraysseix and Patrice Ossona de Mendez. Contact and Intersection Representations, GD 2004, LNCS, Volume 3383, pp. 217-227, 2005. 5. Hubert de Fraysseix, Patrice Ossona de Mendez and Janos Pach. Representation of planar graphs by segments. In Colloquia Mathematica Societatis Jdnos Bolyai, Intuitive Geometry, Szeged (Hungary), 1991. 6. E. R. Scheinerman. Intersection classes and multiple intersection parameters of graphs. PhD thesis, Princeton University 1984.

Finding the convex hull of a dense set Pave1 Valtr Department of Applied Mathematics and Institute for Computer Science, Charles University, Malostransk6 n8m. 25, 118 00 Praha 1, Czech Republic

> 0, a set A of n points in the plane is called adense, if the ratio between the maximum and the minimum distance in A is at most a f i . We show that the convex hull of an a-dense set of size n can be found in O(nlog(min{n,a})) steps, which is worst-case asymptotically optimal in terms of n and a in the computation model of algebraic decision trees of a fixed order. We also show an asymptotically worst-case optimal output-sensitive bound O ( nlog(min{n, a, h } ) ) ,where h is the number of vertices of the convex hull. Abstract. For a

1

Introduction

The planar convex hull problem is the problem of finding the sequence of vertices of the convex hull of a given finite planar point set, where the vertices appear in the order in which they appear around the convex hull. A less demanding version of the convex hull problem is the planar extremal points problem which asks for the set of vertices of the convex hull of a given finite planar point set, irrespective of their order around the hull. The computational complexity of the convex hull problem was one of the first problems investigated in computational geometry. The first efficient algorithm solving the convex hull problem in time O(nlogn), where n is the size of the input point set, was proposed by Graham [Gr72] more than 30 years ago. Several further algorithms solving the convex hull problem in time O ( nlog n ) were published in the second half of the seventies [BS78,Pr79,PH77,Sh78]. It was also shown that the bound O ( nlog n) is worst-case optimal for the convex hull problem and for the extremal points problem on different computational models [Av82,EB80,PH77,Ya81]. Bentley and Shamos [BS78] investigated the convex hull problem for random sets and showed algorithms with expected linear time for random sets defined in several different ways. Kirkpatrick and Seidel [KS86] have found an algorithm solving the convex hull problem in time O(nlogh), where h is the size of the output, i.e., the number of vertices of the convex hull of the input set of n points. Kirkpatrick and Seidel [KS86] also proved that the bound O ( nlog h ) is optimal in terms of n and h in the worst case. Further convex-hull algorithms with the running time O(n1ogh) were given in [Ch96,CSY97]. A survey on the convex hull problem can be found in [Se97]. Here we investigate the convex hull problem for so-called a-dense sets. For two points x and y in the plane, let Ixy( denote the distance between x and y. 37

38 Let A be a set of n points in the plane. We define the spread of A , q ( A ) ,by

q ( A ) := For a

> 0,

max{labl : a, b E A } min{ lab1 : a , b E A, a # b} '

we say that the set A is a-dense, if q ( A )

5 a&. It is known

d w

M that one can find an arbitrary large a-dense set if and only if a 2 1.05. Combinatorial and other properties of dense sets have been studied in [AlSO,AKP89,EVW94,Er05,Va92,Va93,Va94,Va96,Ve97]. The paper [Er05] motivated us to prepare our paper. The research described in this paper originated during PhD studies of the author a t the Free University Berlin and was partly described in the author's PhD thesis [Va94]. In Section 2 we show an algorithm for the convex hull problem which, in particular, runs in linear time if the input set is a-dense for any fixed constant

a. Theorem 1 T h e convex hull of a set A of n points in the plane can be found in t i m e O ( n log a ) , where the ratio between the largest and the smallest distance determined by pairs of points of A is a f i . On the other hand, we show that the bound O(nlog(min{n,a})) for the complexity of the convex hull problem (which we get by combining the general bound with the bound in Theorem 1) is asymptotically worst-case optimal in terms of n and a in the model of algebraic decision trees of a fixed order. In fact, we show the lower bound n(nlog(min{na})) even for a weaker version of the extremal points problem.

Theorem 2 Let d be fixed. T h e n n(nlog(min{n,a})) steps are necessary, in the worst case, t o decade whether a given subset A' of a given a-dense set A of size n is or is not the set of vertices of the convex hull of A , with any d t h order algebraic decision tree algorithm. Thus, already if a 1 0 ( n E ) ,E > 0, we cannot gain anything on the complexity of the convex hull problem from the a-density of the input set, and any O(n1og n)-time algorithm is asymptotically worst-case optimal in this case vertices in this case). (although the convex hull can have at most O(TZ'/~+") For comparison, note that one can decide whether a given set A of size n is a-dense in time O(n1ogn) since both the largest and the smallest distance in A can be found in time O(n1ogn) (e.g., see [PS85]). However, the bound O(n1ogn) is asymptotically worst-case optimal. Using the same technique, we can also establish an asymptotically tight output-sensitive bound:

Theorem 3 T h e convex hull of a n a-dense set A of n points in the plane can be found in t i m e O(nlog(min{n,a,h})), where h is the number of vertices of

39

the convex hull of A . This bound is asymptotically worst-case optimal (provided h 5 afi'). Our convex hull algorithm is quite simple and is based on the well-known QuickHull algorithm [Ed77,By88]. We describe it in Section 2. The lower bound is shown in Section 3. Theorem 3 is proved in Section 4.

2

The algorithm

Let A be a set of n points in the plane which is a-dense. We shall first assume that the minimum distance in A is known to be 1. We also assume that a 5 n1I4. (If Q > n1/4then we can use general convex hull algorithms with running time O(n1ogn) 5 O(n1oga)). We use a variant of a so-called QuickHull algorithm [Ed77,By88]. In the algorithm we shall use a recursive procedure HULL (Aabc,M ) which finds, for a triangle Aabc with L(acb) 2 7r/2 and for a set M of points in the triangle Aabc with a, b $ M , the convex hull of the set MU { a , b} and writes its vertices different from a and b in the order in which they appear in the hull from a to b to a sequence S. Here is the procedure:

C

b

a

Fig. 1. Divide-and-conquer for the convex hull problem.

Procedure HULL (Aabc,M ) 1. If the distance of the point c to the line ab is smaller than 1,then find the convex hull of MU{a, b } by any general method which requires time O(lMI log(lMI)), write its vertices in the required order to the sequence S , and EXIT. 2. Among the points of M with the largest distance to the line ab, find the point m E M which is nearest to the point a. Let 1 be the line containing m and The technical condition h 5 (t.6 eliminates pathological cases with h M A . q(A ) = A . (t.6 where , the relation between h and q (A) strongly determines the structure of the set A .

40

parallel to the side ab, and let ma and mb be the points of intersection of 1 with the sides ac and bc, respectively (see Fig. 1). 3. Find Ma= ( M \ { m } )n A a m m , and Mb = ( M \ { m } )n Ambmb. 4. Run HULL ( A a m m , , M a ) . 5 . Write m to the sequence S . 6. Run HULL (Ambmb,Mb). The correctness of the procedure is left to the reader. Here is our convex hull algorithm:

Algorithm CONVEX-HULL ( A ) 1. Find the smallest axis-parallel rectangle stuv containing all the points of A. 2. Among the points of A lying on the line st, find the rightmost point p t . Among the points of A lying on the line tu, find the topmost point p,. Among the points of A lying on the line uv,find the leftmost point p,. Among the points of A lying on the line us, find the bottommost point p,. 3. Find A1 = A n ( A p t t p , \ { p t , p u } ) , Az = A n (Ap,up, \ {p,,pu}), A3 = A n ( A P ~ V\ P{ ,P ~ , P ~A4 H ,= A n (Ap,spt \ { p s , p t H . 4. Run HULL ( A p t p u t , A I ) , 5 . Write p , to the sequence S , 6. Run HULL (Ap,p,u, A z ) , 7. If p , # p , then write p , to the sequence S , 8. Run HULL (Ap,p,v, A3), 9. If p , # p , then write p , to the sequence S , 10. Run HULL (Ap,pts,A4), 11. If pt # ps then write pt to the sequence S . It is shown in [Va94] that Algorithm CONVEX-HULL ( A ) requires time at most O ( nlog a ) for an a-dense set A of n points. Because of space limitation, we do not give the proof here. If the minimum distance in A is not known then it is possible to proceed similarly as above with the main exception that we change the order of calls of procedure HULL so that each call of HULL ( A a b c , M ) precedes all calls of HULL (Aabc,M ) with substantially smaller area of Aabc. Instead of a direct call of procedure HULL (Aabc,M ) we always write the pair (Aabc,M ) to LIST, where LIST is a set collecting pairs of parameters of those calls of procedure HULL which are to be done in future. This is explained in detail in [Va94].

3

The lower bound

For a positive integer d, a d-th order algebraic decision-tree algorithm T for testing membership in a set W c R" is represented by a rooted tree whose internal nodes are labeled by multivariate polynomials of degree a t most d and whose leaves are labeled either YES or NO. Each internal node has out-degree three; the out-going edges are labeled <, >, and = reflecting possible outcomes

41

on comparison of polynomial with 0. Every input z E R" defines a unique path from the root to a leaf in the obvious way. We say that T decides the membership in W if, for every z E R", the vector z defines a path to a YES leaf if and only ifzEW. The proof of Theorem 2 uses some ideas of Ben-Or [B083] and Kirkpatrick and Seidel [KS86]. We follow Kirkpatrick and Seidel in applying the following result.

Theorem 4 (Ben-Or [BO83]) Let W R" be a n y set in R" and let T be any d t h order algebraic decision tree that decides the membership an W. If W has N disjoint connected components then T m u s t have height R(1og N - n ) . We shall define a modified version of the extremal points problem which clearly does not have a larger complexity than the extremal points problem. The MEP-problem (modified extremal points problem) asks to decide, for a given sequence A = ( a l ,a 2 , . . . ,a,) of n points in the plane, the existence of an integer k > 2 and a circle C such that the following three conditions hold: (Cl) The points a l , a2,. . . ,a k lie on the circle C and form a regular k-gon a 1 a2 . . .a k , (c2) all the points a 2 k + l , a 2 k + 2 , . . . ,a, have distance at least rc 1 and a t most 2 r c to the center of C , where rC is the radius of C , (C3) the vertices of the convex hull of A are exactly the points

+

a11 a 2 , . . ., a 2 k .

Theorem 5 For 10 5 a 5 4,the MEP-problem for a-dense sets A requires R(n1oga) steps in the worst case, with any d t h order decision algorithm. Theorem 5 will imply Theorem 2 for 10 5 a 5 fi,since the existence of k and C satisfying conditions (Cl) and (C2) can be established in time O ( n ) and then, if such (unique) k and C exist, the validity of condition (C3) can be established in time necessary for deciding whether a given subset is or is not the set of vertices of the set A.

Proof of Theorem 5. Let n E W and 10 5 a 5 fi be fixed. Set t := According to Theorem 4, it suffices to prove that the set

[&I.

Ma = { ( a l , . . . ,an) E R2" : the MEP - problem is answered in the affirmative for A = ( a l ,a2,. . . ,a,) and k = t } has at least tn-2t disjoint connected components containing at least one a-dense set. Consider all sequences A = ( a l ,a z 1 .. . ,a,) satisfying conditions ( C l ) ,(C2), (C3) with k = t and r = such that the vertices of the convex hull of A appear in the following order: a l l U k + l , a2, a k + 2 , . . . , a k , a 2 k . Each of the points @ k + l , a 2 k f 2 , . . . , a , must lie in one of the k triangles a l a k + l U 2 , a 2 a k + 2 a 3 , . . . , a k a z k a l (see Fig. 2). If we place points U k + l , a k + 2 , . . . , a 2 k suitably, then it is possible, for every i = 1 , 2 , . . . , k l to place, for instance, all the

9,

42

points a 2 k + l , a 2 k + Z 1 . . . ,a, in the triangle aiak+iai+l so that they obey condition (C2) and that each mutual distance in the set A = {al, a2,. . . , a,} is at least one (which implies that A is a-dense). It follows that the n - 2k points a2k+l, U 2 k + 2 , . . . ,a, can be placed into the k triangles a l U k + l I & , a 2 a k + 2 U 3 , . . . , aka2kal in any of the kn-2k ways so that, moreover, conditions (Cl), (C2),(C3) hold and the set A is a-dense. Thus, there are at least kn-2k different a-dense sets of size n which satisfy conditions (Cl), (C2), (C3) and obviously lie in different components of the set Ma.

el

Fig. 2. A set from Ma with k = t = 4.

Now we can show our lower bound.

Proof of Theorem 2. Let n E N and a be fixed. If a < 1 then there are no large a-dense sets. If 1 5 a 5 10 then the bound O(nlog(min{n,aZpha})) = O ( n ) is obvious - we have to visit all the n points at least once in the worst case. If 10 5 a 5 fi then the bound O(nlog(min{n,aZpha})) = O(n1oga) follows from Theorem 5 . Finally, if a 2 fi then the bound O(nlog(min{n,aZpha})) = el O(n1ogn) follows from Theorem 5 with a = fi.

43

4

Output-sensitive bounds

Here we prove Theorem 3. To obtain the upper bound in it, we simultaneously run any three algorithms with the respective running times O ( nlog n ) , O ( nlog a ) , O ( nlog h ) , and stop a t the moment when the first of the three algorithms gives an output. To prove the lower bound, we proceed similarly as in the proof of Theorem 5 . If h 5 10 or fi 5 10 (say), then the bound in the theorem is linear and it follows from the fact that we have to visit each point a t least once. If 10 < h 5 fithen we set t := [ h / 2 ] and proceed as in the proof of Theorem 5. If 10 < fi < h 5 a& then we set t := [fi] as in the proof of Theorem 5. We change the three conditions in the definition of the MEP-problem to the following three conditions: (Cl) The points a1 ,a2,. . . ,ah lie on the circle C and form a regular k-gon a1u2 . . .ak, (C2') all the points ah+l, ah+2,. . . ,a, have distance at least r c 1 and Accorda t most 2rc to the center of C, (C3') the vertices of the convex hull of A are exactly the points a1 , a2, . . . , a h . ing to Theorem 4 , it suffices to prove that the set

+

M , = { (a1 ,. . . ,a,) E RZn: the MEP - problem is answered in the affirmative for A = ( a l ,a z , . . . , a,) and k = h } has a t least ( t / 3 ) " - h disjoint connected components containing a t least one adense set. Consider all sequences A = ( a l la 2 , . . . , a , ) satisfying conditions ( C l ) ,(CZ'), (C3') with k = t and r = such that the vertices a l , a 2 , . . . ,a 4 k j 3 appear in the same places as in the proof of Theorem 5. The points a4k/3+1,a4k/3+2,.. . ,a h appear on the convex hull of A in this order proportionally in the 2 k / 3 gaps in the chain of the points ak/3+1,ak/3+2,. . . , a h , a l . Each of the points a h + l , a h + 2 , . . . , a , must lie in one of the k / 3 triangles alak+la2, a~ak+2a3,. . . ,ak/3a4k/3ak/3+1.If we place the points ak+l , Uk+2,.. . ,a h suitably, then it is possible, for every i = 1 , 2 , . . . , k / 3 , to place, for instance, all the points ah+l,ah+z,.. . , a , in the triangle aiak+iai+l so that they obey condition (C2') and that each mutual distance in the set A = { a l ,a z , . .. , a , } is a t least one (which implies that A is a-dense). It follows that the n - h points ah+l, ah+2,. . . , a, can be placed into the k triangles alak+la2, a2uk+2a3,.. . ,aka2kal in an arbitrary of the ( l ~ / 3 ) " - ~ ways so that, moreover, conditions ( C l ) , (CZ'), (C3') hold and the set A is adense. Thus, there are a t least ( l ~ / 3 ) " different -~ a-dense sets of size n which satisfy conditions (Gl), (C2'), (C3') and obviously lie in different components of the set M,.

9,

a

References [A1901

R. Alexander, Geometric methods in the study of irregularities of distribution, Gombinatorica 10 (1990), 115-136.

44 [AKP89] N. Alon, M. Katchalski, and R. Pulleyblank, The maximum size of a convex polygon in a restricted set of points in the plane, Discrete Comput. Geom. 4 (1989), 245-251. [Av82] D. Avis, On the complexity of finding the convex hull of a set of points, Discrete Appl. Math. 4 (1982), 81-86. [B083] M. Ben-Or, Lower bounds for algebraic computation trees, Proc. 15th ACM Symposium on Theory of Computing, Boston (1983), 80-86. [BS78] J.L. Bentley and M.I. Shamos, Divide and conquer for linear expected time, Inform. Proc. Lett. 7 (1978), 87-91. [By881 A. Bykat, Convex hull of a finite set of points in two dimensions, Inform. Proc. Lett. 7 (1978), 296-298. [Ch96] T.M. Chan, Output-sensitive results on convex hulls, extreme points, and related problems, Discrete Comput. Geom. 16 (1996), 369-387. [CSY97] T.M. Chan, J . Snoeyink, C. Yap, Primal dividing and dual pruning: outputsensitive construction of four-dimensional polytopes and three-dimensional Voronoi diagrams, Discrete Comput. Geom. 18 (1997), 433-454. [Ed771 W. Eddy, A new convex hull algorithm for planar sets, ACM Trans. Math. Software 3 (1977), 398-403. [EB80] P. van Emde Boas, On the O ( nlog n) lower bound for convex hull and maximal vector determination, Inform. Proc. Lett. 10 (1980), 132-136. [EVW94] H. Edelsbrunner, P. Valtr, and E. Welzl, Cutting dense point sets in half, Discrete Comput. Geom. 17 (1997), 243-255. [Er05] J. Erickson, Dense point sets have sparse Delaunay triangulations, Discrete Comput. Geom. 33 (2005), 83-115. [Gr72] R.L. Graham, An efficient algorithm for determining the convex hull of a planar point set, Inform. Process. Lett. 1 (1972), 132-133. [KS86] D.G. Kirkpatrick and R. Seidel, The ultimate planar convex hull algorithm?, SIAM J. Comput. 15 (1986), 287-299. [Pr79] F.P. Preparata, An optimal real time algorithm for planar convex hulls, Comm. ACM 22 (1979), 402-405. [PH77] F.P. Preparata and S.J. Hong, Convex hulls of finite sets of points in two and three dimensions, Comm. ACM 20 (1977), 87-93. [PS85] F.P. Preparata and M.I. Shamos, Computational geometry, Springer-Verlag, New York (1985). [Se97] R. Seidel, Convex hull computations, in: J.E. Goodman (ed.) et al., Handbook of discrete and computational geometry, 361-375, Boca Raton, CRC Press Series on Discrete Mathematics and its Applications, CRC Press 1997. [Sh78] M.I. Shamos, Computational geometry, Ph.D. thesis, Yale Univ., New Haven (1978). [Va921 P. Valtr, Convex independent sets and 7-holes in restricted planar point sets, Discrete Comput. Geom. 7 (1992), 135-152. [Va93] P. Valtr, On mutually avoiding sets, in: R.L. Graham and J . NeSetfil (eds.), The Mathematics of Paul ErdGs 11, Springer, Algorithms and Combinatorics 14 (1997), 324-332. [Va94] P. Valtr, Planar point sets with bounded ratios of distances, PhD. thesis, Free Univ. Berlin (1994). [Va96] P. Valtr, Lines, line-point incidences and crossing families in dense sets, Combinatorica 16 (1996), 269-294. [Ve97] K. Verbarg, Approximate center points in dense point sets, Inform. Process. Lett. 61 (1997) 271-278. [Ya81] A.C. Yao, A lower bound to finding convex hulls, J. ACM 28 (1981), 780-789.

PROBABILISTIC LOAD BALANCING WITH OPTIMAL RESOURCES SET OF NODES IN CLUSTERS N.P.GOPALAN’, K.NAGARAJAN~ Department of Computer Science and Engineering, National Institute of Technology, Tiruchirappalli- 620 015, Tamilnadu, India [email protected], [email protected] A probability model for task allocation to nodes in a cluster for better CPU utilization and

reduces VO and communication overhead is presented in this paper. For thc NP-hard problem of task assignment, load balancing is done with a local and global schedulcr (LS and GS). A newly arriving task need not wait at the home nodc (HN) for the want of resources as GS takes care of it. Using thrce different queucs for the proccsses, GS assigns thc best set of optimal resources at a foreign node (FN) with a minimal transfer cost to the proccsscs. The experimentally simulated results show that the model compares well with the existing models and well suited for time and space sharing systems with minimal I/O overhead and excellent resource utilization

1

Introduction

Clusters are becoming the primary platform for executing, demanding, scientific engineering and commercial applications [ 11. They provide a cost-effective, highperformance, time-sharing and multi-user environment for executing parallel and sequential tasks. When lengthy processes operate in a dynamic environment with continuously changing conditions, the load balancing (LB) becomes a critical issue in the clusters, where loads on nodes may not be uniform. Most of the clusters are used for dedicated applications, which use parallel programming languages and static task assignment (RR and Random allocation) policies completely ignoring the current state of the resource utilization. On the other hand, the space sharing global resource management (Gang Scheduling and back filling with Gang scheduling) [4] partitions the clusters into disjoint sets of machines for executing parallel tasks to increase the resource utilization. This is mostly found in batch processing systems. To improve the utilization of the resources and throughput, it is essential that loads are to be balanced among the nodes. Similarly, the processor’s response time and average task waiting time are to be managed efficiently. The LB may be implemented using either by Process Migration (PM) and /or by Remote Execution (RE) [ 6 ] . During a PM, the running process is stopped and checkpointed and data are transferred to a FN.The PM is completed by “crashing” a process at the HNand restarting it at the FN from the transferred checkpoint data, thus maintaining the global consistency and safe data communication. In RE, when a process is transferred to another node and started, no memory is allocated for it, This causes fewer overheads and is significant while smaller processes are considered. This

45

46

paper addresses the user-level dynamic LB based on the appropriate choices of PM and RE, which can be achieved with interdependent LS and GS. The LS contains a Ready Queue (LRQ) with a “transferring and task assignment mechanism”. It runs at each node of the cluster. 2

The Proposed Model

The Cluster has a finite set of (n+l) nodes, N = {No,N1... N,} in which No denotes the HN and N,(15 i I n) are FNSwith the known probabilities of PNO,PN1.. .PNnfor the accommodation of a newly arriving task at Ni. The computation of each PNi is based on the availability of the (i) CPU and Memory during the process run and (ii) Number of processes waiting in queue and their properties [6]. For each node ‘i’, there is a compatibility matix Mi = [ d i 3, where 0 5 d i I 1, that decides the PM to the kth FN from the jthqueue. The value of d i can be computed using PACE [lo], before the process execution starts. When d i =1, then FNkis incompatible for ith process’s migration. The choice of the best optimal resource of a node is obtained as OptR (Nk) = Vk Min ( d i ) . Initially, one of the queues from GS is chosen and the resources from possible FNs are allocated to all the processes in that queue as for as possible before switching to another queue, using the function OPtR(Nk). The set of chosen Nks are referred as compatible sub-set {xc}. The set obtained by excluding {x,} from FNs is referred as incompatible sub-set {yic}.For the next queue in GS, the sought resources are chosen from {yic}instead of from all FNs. The above process is repeated until either {yic}becomes null or an optimal compatible subset couldn’t be found from it. The entire procedures are then repeated all over again from the start (from the set of all FNs) until all processes (in all the queues of GS) are assigned with optimal resources from FNS.The optimized transfer cost is computed as follows: Let the set S be possible compatible FNs. Find a set Ncmpc S with a minimal transfer cost. When the process pi’(ithprocess in j* queue) has a compatible sub-set x, in S, call it a compatible process. The is, conditional probability for the compatibility of a process

CJ

When the process

-

FNS} {Nc,},

C’is incompatible, then the sub-set yicis obtained as NIncmp={all

the conditional probability of incompatibility of the process

P ( y , ) = 1 - P ( n , ) and

P ( a N - F N s )=

c

‘Nk

N , ~ o l l_ F l i s

From (1) and (2), the optimal transfer cost is ,

c’ is, (2)

47

A process

f‘,! is selects a F N only if it can maximize the information per unit

cost of the process. i.e.,

mi,)}

q)

= {flxc)log f i x c )-fiic)log When the processes in Where, IPU(Nk, RAQ require the remote resources frequently, in order to reduce the I/O and communication overhead, the processes may be transferred to those FNs where the resources are available to a maximum extent and for the most of the time. Let T be the total time to run a process and a be the probability that follows an exponential distribution for it to require a remote node for time duration Top[ with in T. Let Tstore,TReeivebe the times required to store and retrieve the checkpoint image from the stable storage respectively. It can be seen that TStore = TDelay + Tsys+ TRecord. Where, TDelayis the delay incurred while transferring a checkpoint to a stable storage, TSysis the delay in sending a system message during checkpointing and TRecord is the delay involved while saving a checkpoint on a stable storage. Similar notations can be defined for TRetrive, The time required to restart at F N is, E = (1 - a To,, )( T S , _ + To,, 1+ a To,, * (Ts,,,+ To,, + T,, + 1 / 2 * To,, ) (5) The equation (5) may be simplified as, E = TS,‘,+ Top, + a [T,, To,, + 1 1 2T,, 1 (6) Then, the total cost of PM in RAQ is, OPtT (Nk) 4-E. (7) The processes are always in consistent state [3, 71 in GRQ and RAQ because either they await for starting or it may be waiting for an event (for I/O or remote resource) to occur. Hence, the synchronization becomes simple. On the other hand, the PSQ contains a huge number of independent processes whose executions are in progress. The processes may also migrate for reasons mentioned in [ 12, 131. Hence, scheduling becomes a complicated issue as the nodes are in communication on the fly with the system in an inconsistent state. To schedule efficiently and elegantly synchronize, the processes in PSQ are partitioned into smaller groups based on their causal dependency [3]. If N, is the number of casually related processes and ‘Csyn’ is the cost required for the process synchronization then, The total cost of PM in PSQ is, OptT(Nk)+ E + Csyn*N,. (8) ,,,ve

,r,e

3

Experimental Results

The experiments were performed on a cluster of PCs under Linux 2.4.18. The cluster consists of 16 dual computing nodes connected by a 100 MB/s Ethernet. The computing nodes are equipped with Pentium I11 processor running at 1 GHz, 128 h4B of main memory and 20 GB of stable storage. All program implementations use

48

the LAMMPI version 1.2.5. Test programs were compiled using the GNU GCC version 2.96. In Addition to the system utilization and I/O overhead the average response timelslowdown time in time sharing systems and waiting time in space sharing system are computed and compared during PM and RE. 1

Probabilidic Model -ProbabilidicModsl

+Probabilistic +BFGS

i

Model

2 5000 i= 4000 r

2

3000 n .42000 a,

p1000 a

z o

0.4 0.5 0.6 0.7 0.8 0.9

0.325 0.35 0.375

0.3

0.4

0.425

0.45

VO Overhead

Utilization

Figure I. Average task waiting time Figure 2. Mean Slowdown versus Vs Utilization in space sharing system I/O overhead in Space sharing system. The system utilization q is given by, 77 = nit,/tm x N , where, N is the

=:,

total number of nodes, ni is the number of nodes used by task i, ti is the execution time for the task i and ,t is = ;, * , for all m tasks. The result of the current model

c

is compared with Gang-Scheduling (GgS) and backfilling combined with Gang Scheduling (BFGS) in the space-sharing environments. Discrete event simulation is used in space-sharing system. Each task requires a specified number of processors, m s for a certain time, and also has a user estimate for its runtime. The tasks are classified as: very short (< 30 sec), short (< 5 min), medium (< 1 hr), long (< 10 hr), and very long (> 10 hr). A single data file supplied by the user specifies its details. The batch size is set to 5,000 tasks, as recommended by MacDougall for open systems under high load [lo]. Depending on the length of the workload, 3 to 10 batches were executed. All experiments use 32 processors with the batch size of 5,000 tasks and the results are presented in figure 1 and 2. From the results, the ability to fill the holes actually improved when the load is very heavy while using the probability models. Further, Figure 2 presents the mean slowdown versus I/O overhead, where each measurement is averaged over 10 different batches. Observe that, the probabilistic model outer perform with 72% improvements for the VO overhead (0.425 or higher) of GgS and give approximately 38 % better results for BFGS.

49

4

Conclusions

The experiments are conducted in space sharing environments. The LS assesses for the resource availability at the HN periodically during the process run. The GS reduces the task waiting time and the average process response time by employing three different queues. The choice of best optimal resource node is determined for avoiding the frequent process migration. Using the conditions either (PN~ < PN, and Opt (Nj)) or (PNj= l), (where Ni is HN and Nj is a FN) the optimal resource (Opt(N,)) is chosen from either Opt@,) or OptT(N,). The LS attempts to equally distribute the resources in the cluster during the process run. Thus, the proposed model utilizes the unused system resources in the cluster network.

References 1. T.E. Anderson, D.E. Culler, and D.A. Patterson, and the NOW team, “A Case for NOW,” IEEE Micro, vol. 15, no. 1, pp. 54-64, Feb. 1995. 2. Board, J. A., Et.al. “Modeling bimolecular: Larger scales, longer durations”, IEEE Computational Science and Engineering, 1(4), 1994. 3. Chiang, S-H., Arpaci Dusseau, and Vernon, M.K., “The Impact of More Accurate Requested Runtimes on Production Job Scheduling Performance,” Job Scheduling Strategies f o r Parallel Processing, pp. 103-127, SpringerVerlag, 2002. 4. DEJAN S. Et.al. “Process Migration”, ACM Computing Suweys, Vol. 32, No. 3, pp. 241-299, September 2000. 5. Gopalan, N.P., and Nagarajan, K., “Self-Refined Fault Tolerance in HPC using Dynamic Dependent Process Groups”, Lecturer Notes on Computer Science (LNCS), OSpringer-Verlag, LNCS 3741, pp. 153 - 158, Dec 2005. 6. Gopalan, N.P., and Nagarajan, K., “Probabilistic Load Balancing with Optimal Resources Set of Nodes in Clusters with Minimal Transfer Cost using MPI”, ACCST Research Journal. (Accepted) 7. Gong, L., Et.al., “Performance Modeling and Prediction of Non-Dedicated Network Computing,” IEEE Trans. Computers, vol. 51, no. 9, pp. 1041-1050, Sept. 2003. 8. Harchol-Balter M., and A. Downey, “Exploiting Process Lifetime Distribution for Dynamic Load Balancing,” ACM Trans. Computer Systems, vol. 15, 1997. 9. MacDougall, M.H., Simulating Computer Systems: Techniques and Tools, MIT Press, 1987. 10. Nudd, G.R., Kerbyson, D.J., Papaefstatathiou, E., Eta1 “PACE: a toolset for the performance prediction of parallel and distributed systems”, Int. J. High Perform. Computer Applications, 14, (3), PP 228 -251,2000.

A NEW HEURISTIC ALGORITHM FOR MULTI-DIMENSION MULTIPLE-CHOICE KNAPSACK PROBLEM IN DISTRIBUTED SYSTEM MD. WASELUL HAQUE S A D D , MIRZA NAZRUL ALAM,MD. AL MAMUN, A. H. M. SARWAR SATTAR, MIR MD. JAHANGIR KABIR AND MD. RABIUL ISLAM Department of Computer Science & Engineering Rajshahi University of Engineering & Technology, Bangladesh E-mail: { whsadid, mirza58bd, cse-marnun)@yahoo.com The Multi-Dimension Multiple-Choice Knapsack Problem (MMKP), a variant of the classical 0-1 Knapsack Problem (KP), is an NP-Hard problem and its exact solution is not feasible for real time problems. Therefore heuristic based approximation algorithms are developed for MMKP. The MMKF' has a knapsack with a multidimensional capacity constraint and groups of items where each item having a utility value and a multidimensional weight constraint. The problem is to maximize the total value of the items in the knapsack but with the constraint of not exceeding the knapsack capacity. The existing heuristics do not perform better for large number of MMKP data sets. In this paper we proposed a distributed algorithm D-MHEU that achieves approximately 95.5% of the optimal value with much reduced time complexity.

1

Introduction

In this paper we have developed a distributed heuristic algorithm named D-MHEU for Multi-Dimension Multiple-Choice Knapsack Problem (MMKP). The algorithm is implemented and we get the total value achieved by this algorithm. For the same data, we have also executed the M-HEU [l, 21 algorithm, which is a sequential algorithm and achieves 96% of optimal value. Our solution is then benchmarked with the result of M-HEU and we find that it is on an average approximately 99% to the value of M-HEU. But the time requirement of our algorithm is reduced significantly with increasing the number of clients. 2

Background and Preliminaries

The MMKP is a variant of the KP [l, 2,4]. Let there be n groups of items. Group i has 1, items. Each item of the group has a particular value and it requires m resources. The objective of the MMKP is to pick exactly one item from each group for maximum total value of the collected items, subject to m resource constraints of the knapsack. Toyoda [5] applied the greedy method to solve Multidimensional Knapsack Problem (MDKP). Khan [2] has applied this concept to provide a heuristic, HEU to 50

51

solve the MMKP. Later, M-HEU is developed by Akbar et al.[ I], modifies HEU and consists of the following three steps [ 11: h Finding a feasible solution by upgrading the selected items of the groups. h Improving the solution value by selecting a feasible higher-valued item from the groups subject to resource constraints, i.e., by feasible upgrades. P Improving the solution value by one infeasible upgrade followed by one or more downgrades.

3

Distributed Algorithm D-MHEU

In D-MHEU, the solution is upgraded by choosing an item with the highest relative change of aggregate resource consumption, AU; . Aarv= {v,b }is a vector where v = Aav, if Aav is positive or AvvI Aav, if Aav is negative and b =1, if Aav is positive or 0, if Aav is negative Here the change of aggregate resource consumption, Auv

the change of revenue,

AVU =

= C(rjP[;]k- r g k ) x

- vg and ck = Crjpp[;1k, where

ck and

hi]is the currently

selected item in group i. We have assumed that the initial solution is feasible. The items in each group are sorted according to their value and then the sorted data are distributed among different client by the server. In our algorithm the total number of group are distributed by assigning [n/pl groups to each of the (p-1) number of clients and the rest of the groups to the last client. The clients choose an item with the highest AU; and send the index of the group number and item number with highest value ofAui;., then server upgrades the original solution based on this received data. The selection for solution is made as much as possible subject to the resource constraints with the single calculation of AU; . This is done until the solution is feasible. The overall complexity of D-MHEU is -(. P

+ C) , where c

is the communication delay, proportional to the number of processors. 4

Performance Analysis

In order to study the performance of our algorithm, we have performed experiments on an extensive set of problem sets with both D-MHEU and M-HEU. Figure 1 shows the performance of D-MHEU, Figure 2 (a) shows the time comparison of DMHEU with M-HEU and Figure 2 (b) shows the time requirement of D-MHEU for different number for processors by varying the number of groups.

52

3t

r,-...~...~...~.~~=.~.

loo]

c

--D-MHEU(Correla1ed) I--.

z

99

50

25

0

100

75

Number of Groups

Figure 1. Performance of D-MHEU normalized with M-HEU for the MMKP data sets with 1=5 and m=5.

f

--CM-HEUIUncorrelatcd)

;

.I

i

3

I 0

r;

I

3 20 G

$

i

F

o

200

400

600

Number of Groups

2 (a)

800

1000

140 120

--Cn=500

100

+n=1000 +"=I500 --en=2000 +n=2500 +n =3000 +n=3500 -n=4000 -n=4500 -R-n=5000

:: 40 20

o 2

3 N u m b e r o f Processors

2

0)

4

Figure 2. Time required by D-MHEU and M-HEU for the MMKP data sets with m=10 and [=I0 by varying (a) Number of groups, @) Number of processors.

5

Conclusion

It is our main goal to reduce the time requirements for solving large MMKP problems. It is observed that D-MHEU achieves approximately 99% of the value of M-HEU but the time requirement is much less than that of M-HEU. If all the steps of M-HEU can be completely implemented in distributed system then the performance of D-MHEU will be increased. We may consider to cope this in our algorithm. Some other heuristics as C-HEU, I-HEU will be considered also in future for distributed algorithm. References 1. Akbar M. M., Manning E. G., Shoja G. C. and Khan S., Heuristic Solutions for the Multiple-Choice Multi-Dimension Knapsack Problem, International Conference on Computational Science, San Fracsisco, California, (2001) 659-668. 2. Khan S., Li K. F., Manning E. G. and Akbar M. M., Solving the Knapsack Problem for Adaptive Multimedia Systems, Studia Informatica Universalis,Vol.2( 1),(2002)16 1 - 181 . 3. Martello S . and Toth P., Algorithms for Knapsack Problems, Annuls of Discrete Mathematics, Volume 3 1 , (1 987) 70-79. 4. Newton A. H., Sadid W. H. and Akbar M. M., A Parallel Heuristic Algorithm for Multiple-Choice Multi-Dimensional Knapsack Problem, 6'hInternational Conference on Computer and Information Technology, Bangladesh, (2003). 5 . Toyoda Y., A Simplified Algorithm for Obtaining Approximate Solution to Zero-one Programming Problems, Management Science, Vol. 21, (1975)1417-1427.

THE C O M P L E X I T Y OF THE PK P A R T I T I O N P R O B L E M A N D R E L A T E D P R O B L E M S IN B I P A R T I T E G R A P H S

J. MONNOT Uniuersite' Paris Dauphine, LAMSADE, CNRS UMR 7024, Place du Mare'chal d e Lattre de Tassigny, 75775 Paris cedex 16, Bunce E-mail: [email protected] S. TOULOUSE

Uniuersite' Paris 13, LIPN, CNRS UMR 7030, 99 au. J.-B. Cle'ment, 93430 Villetaneuse, fiance E-mail: [email protected]

'

In this paper, we continue the investigation made in about PI, packing problems, focusing here on their complexity, according t o both the constant k and the maxk P is NP-complete ~ ~ ~ in imum degree of the input graph: we prove that P bipartite graphs for any k 2 3 in graphs with maximum degree 3 (and this even if the graph is planar), whereas it becomes polynomial-time computable in graphs with maximum degree 2. T h e obtained results actually extend t o the minimum k-path partition problem and t o the maximum PI, packing problem. Moreover, we provide new approximation results for those two problems, when k is worth 3.

1. Introduction

consists, given a simple graph The PI,partitioning problem (P~PARTITION) G = ( V , E )on k x n vertices, in deciding whether there exists or not a partition of V into vertex-disjoint PI,,where a PI, is a path on k vertices. This problem has been proven to be NP-complete for any k 2 3, polynomial otherwise. PI,PARTITION has been widely studied in the literature, mainly because its relationship with the minimum k-path partition problem (MIN~-PATHPARTITION) and the maximum PI, packing problem (M A X P ~ P A C K I NofGwhich ), applications are in particular broadcasting in telecommunication networks and the minimum vehicle routing problem l1l2, respectively. This former consists in computing a minimum-size partition of V into vertex-disjoint paths of length at most k - 1, whereas this latter consists in finding a collection of vertex-disjoint PI, that is of 1039,11

53

~

~

54

Figure 1. The gadget H ( c i ) when ci is a 3-uplet.

maximum-size (of maximum weight when considering the weighted case, M A X W P ~ P A C K I N The G ) . pieces of evidence that are not presented here may be found in '. 2. Complexity results

The minimum k-path partition problem is intractable, even on some special graph classes (comparability graphs lo, cographs 9 , bipartite chordal graphs l o , when k is part of the input); nevertheless, it turns to be polynomialtime solvable in trees 1 1 , in cographs when k is fixed and in bipartite permutation graphs l o . Most of the proofs of NP-completeness actually also establish the NP-completeness of P~PARTITION.

'

Theorem 2.1. P k PARTITION, M A x PPACKING, ~ MINk-PATHPARTITION both are NP-complete in bipartite graphs with m a x i m u m degree 3, f o r any k 1 3; this facts even holds i f the graph is planar, f o r k = 3. Proof. We only prove the even case. The proof puts to the fore a reduction from the k-dimensional matching problem (kDM in short), which is known to be NP-complete 5 , to P~PARTITION. An instance of kDM consists of a subset = ( ~ 1 , .. . ,c,} C x1 x . . . x x k of k-tuples, where X I , . . . , X I , are k pairwise disjoint sets of size n. A matching is then a subset M C C such that no elements in M agree in any coordinate, and the purpose of kDM is to answer the question: does there exist a perfect matching M on C, that is, a matching of size n? Starting from an instance I = (C, X1 x . . . x X k ) of kDM, we build an instance G = ( V , E )of P~PARTITION, where G is a bipartite graph of maximum degree 3, as follows: 0 To each k-tuple ci E C, we associate a gadget H ( c i ) that consists of a collection {Pi>', . . ., of k vertex-disjoint P k i,q {Pi>Q = { a ; q ! . . . ,ak } } q = l,...,k , which are connected by the means of the edges [ U ~ ~ , U ; ~ for + ~ q] = 1 to k - 1 in order to form a (k 1)-th PI, i,k { a ) 1 , . . . ,al } (see Figure 1 for an illustration when k = 3). 0 To each element ej E X I U . . . U X k , we associate a gadget H ( e j ) that consists of a cycle {wi,. . . ,v i j + l , wi} on N j 1 vertices, where dj counts

c

+

+

55

the number of k-tuples ci E C that contain ej and N j = k(2dj - 1). The vertices of index 2 k ( p - 1) 1 for p = 1 to d j will be denoted by 1;. 0 Finally, for any couple ( e j ,ci) such that ci qth coordinate is worth ej, the. two gadgets H(ci) and H ( e j ) are connected by the means of an edge . [a;q,li(i,ql].Those edges must be chosen in such a way that each vertex 1; from any H ( e j ) gadget will be connected to exactly one gadget H(ci). This construction obviously leads to a bipartite graph G of maximum degree 3 within polynomial time in the input size; furthermore, we claim that there exists a perfect matching M C C iff there exists a partition P* of V ( G )into Pk. The argument is based on the following property:

+

Prop 2.1. ( i ) In any Pk-partition P of V ( G ) ,and for any i E [ l , m ] , P contains either Pior Qi, where Piand Q are defined as (see Figure 2 for an illustration from 3DM): Qi E [ l , m ] , Q q €[ l , k ] , P i l q = { ~ ~ ~ , . . . , ~ ~ ~Qilq ~ l = ~ ~{ qU }~ ,~ , Qi E [ l , m ] , Pi= Ut=lPiiq U { U ~ ~ , U. .>.,af"}, ~ , Qi = Ut=lQiiq

. . . , U ~ ~ , U ~

denotes the vertex from some H ( e j ) t o which a;q is connected. (ii)In any Pk-partition P of V ( G ) ,and for any j E [l,kn], P contains one of the collections {Pg}p=l,,,,,dj,where Pi is defined as (see Figure 3 for an illustration):

where

li1q

Q j E [ l , k n ] , V pE [1,dj], Pi =

Pi U QL

Pi

where denotes the path PiiQfrom some H ( c i ) to which 1; belongs and Q; is the unique possible Pk-partition of V ( H ( e j ) )\ {l;}. Hence, from a partition P* t o a perfect matching M : set M = I PiC P*}; from a perfect matching M to a partition P*: put Pi in P* if ci E M , put Qi otherwise; then complete the partition adding the appropriate collections Qjp. Finally, concerning the planar case, consider that the proposed construction transforms an instance of PLANAR 3DM-3, which still is NP-complete, into a planar graph. 0 {ci

If we decrease the maximum degree of the graph down to 2, the problems M A X P ~ P A R T I T I O MNA, X P ~ P A C K Iand N G MIN~-PATHPARTITION obviously turn to be polynomial-time computable. The same fact holds for M A X W P ~ P A C K I Nalthough G, its establishment is a little bit complicated.

Proposition 2.1. MAXWPkPACKING is polynomial in graphs with maximum degree 2, for any k > 3.

56

li,l

0

Pi

Figure 2.

Figure 3.

li.2

0

p,3

a

Qi

Two possible vertex partitions of H ( c i ) into 2-length paths.

One of the d j possible vertex partitions Pi of H ( e j ) into 2-length paths.

Proof. We reduce the problem of computing an optimum solution of MAXWP~PACKING in graphs with maximum degree 2 t o the problem of computing a maximum weight independent set (MAxWIS)in a chordal graph, which is known to be polynomial '. Let I = (G,w) be such an instance of MAXWP~PACKING and assume that G consists of a single (necessarily elementary) chain. We thus build the instance ( H ,w) of MAxWIS where the vertex set of H represents the PI, of G and two vertices v{ # v: are linked in H iff the corresponding paths Pi and Pj of the initial graph share a t least one common vertex. Hence, the set of independent sets in H corresponds to the set of Pk-packings in G. Now, one has to observe that H is chordal, or, equivalently, recursively simplicia1 in order to conclude.0 3. Approximation results

Whereas there is (to our knowledge) no specific approximation results for MAXWP~PACKING, one can deduce from the maximum weighted k-packing problem that the problem is - &)-approximable (and thus, in particular, MAXWP~PACKING is (; - &)-aproximable). Note that the special case where the graph is complete has been most widely studied (see, for instance, 6)7). The same lack of knowledge holds for MIN~-PATHPARTITION nevertheless, let us note that MINPATHPARTITION (where there is no longer a constraint on the path length) may not be constant approximable since, otherwise, we could obtain a PTAS for the traveling salesman problem with weights 1 and 2 when opt(I) = n,which is not possible, unless P = N P . The approximation results that are stated here are mainly obtained by the means of matching heuristics. For lack of space, we only provide the algo-

(A

57

rithm for MIN3-PATHPARTITION.

Theorem 3.1. M A X W P ~ P A C K I NisG1/2-approximable in bipartite graphs with maximum degree 3 and this ratio is tight. This results holds in the unweighted case without any constraint on the degree of the graph. Theorem 3.2. MIN3-PATHPARTITION is 3/2-approximable in general graphs and this ratio is tight.

Minimum 3 - P a t h p a r t i t i o n 1 Compute a maximum matching M I on G ; 2 Build a bipartite graph G2 = ( L ,R;E2) defined by:

L Mi, R V \ V(M1) E2 : [l[z,g~, ru]E E2 iff [x,4

E E A [Y,74 E E ; 3 Compute a maximum matching M2 on G2; 4 Output P the 3-paths partition deduced from MI, M Z and V V ( M 1 lJ M2).

\

References 1. E. Arkin, R. Hassin. On local search for weighted packing problems. Mathematics of Operations Research, 23: 640-648, 1998. 2. C. Bazgan, R. Hassin, and J. Monnot. Approximation algorithms for some routing problems. Discrete Applied Mathematics, 146: 3-26, 2005. 3. M. Dyer, A. Frieze. Planar 3DM is NP-complete. J . Algorithms, 7:174-184, 1986. 4. A. Frank. Some Polynomial Time Algorithms for Certain Graphs and Hypergraphs. Proceedings of the 5th British Combinatorial Conference, Congressus Numerantium X V, Utilitas Mathematicae, Winnipeg, 211-226, 1976. 5. M. R. Garey, D. S. Johnson. Computers and intractability. A guide to the theory of NP-completeness. C A, Beeman, 1979. 6. R. Hassin, S. Rubinstein. An Approximation Algorithm for Maximum Packing of 3-Edge Paths. I n . Process. Lett., 63: 63-67, 1997. 7. J. Monnot, S. Toulouse. Approximation results for the weighted P4 partition problem. F.C.T.’2005,LNCS 3623, 377-385, 2005. 8. J. Monnot, S. Toulouse. The complexity of the Pk partition problem and related problems in bipartite graphs. Rapport LIPN 2005-04 (research report). 9. G. Steiner. On the k-Path partition problem in cographs. Cong. Numer., 14789-96, 2000. 10. G. Steiner. k-Path partitions in trees. T. C. S., 290:2147-2155, 2003. 11. J-H Yan, G. J. Chang, S. M. Hedetniemi, S.T. Hedetniemi. On the k-path partition of graphs. Discrete Applied Mathematics, 78:227-233, 1997.

ACCELERATING BOOLEAN SAT ENGINES USING HYPER-THREADING TECHNOLOGY TOBIAS SCHUBERT

MATTHEW LEWIS

BERND BECKER

Institute for Computer Science Albert-Ludwigs-University of Freiburg, Germany E-mail: {schubert,lewis, becker}@informatik.uni-freiburg.d e In this paper we analyze the practical advantage of using Intel’s Hyper-Threading Technology (HTT) in the field of SAT solving. Nowadays most Intel Pentium 4 processors support HTT and can act as either one physical or as two logical CPUs. To test the performance of both scenarios we parallelized a sequential state-ofthe-art solver. Experimental results are provided, showing that the distributed approach using the two logical processors outperforms the comparable sequential algorithm - executed on the physical processor - by about 20% on average.

1. I n t r o d u c t i o n The problem of proving that a Boolean formula is satisfiable (SAT) is one of the fundamental NP-complete problems in computer science. It is not only of theoretical interest but also of practical relevance, for instance in the field of Computer-Aided Design. Many powerful SAT algorithms were developed in recent years: on one hand sequential solvers such as Siege [2] or zChaff [4], and on the other, distributed approaches like PSato [7] or PaSAT [6]. In [5] we presented PaMira, a powerful parallel SAT checking procedure for a cluster of workstations. The core algorithm is based on the state-ofthe-art solver Mira and incorporates all essential features of a modern SAT engine. The parallel programming model has been carried out according to a master/slave concept, using the MPICH implementation [l]of the Message Passing Interface standard. We worked out the main points of our approach and demonstrated the feasibility by a series of experiments. In this paper we focus on a very special multiprocessor system: Intel Pentium 4 processors with Hyper-Threading Technology (HTT). From the user’s perspective this kind of processors either appear as one physical CPU (HTT disabled) or as two logical ones (HTT enabled). The drawback of this technology is that the logical processors are sharing nearly all physical hardware resources. So, it is not clear, if there is an increase in performance when running a parallel SAT solver on the two logical CPUs instead of executing a sequential algorithm in the single-processor mode. To answer this question many experiments are provided in this paper, analyzing both scenarios in detail. For objectively testing we present a reimplementation of PaMira optimized for the 2-processor mode (+ PaMiraHT). In contrast to [5] the master process is entirely eliminated and the two processes are communicating directly with each other. The results show that a distributed SAT algorithm is an interesting alternative when dealing with Intel processors providing Hyper-Threading Technology. To keep the paper self-contained we proceed as follows: first the sequential Mira algorithm is presented. Section 3 focuses on the realization of PaMiraHT, before the experimental results are discussed in Section 4. Finally, a conclusion is given in Section 5. The paper assumes the reader has a good understanding 58

59

of the inner workings of a modern SAT engine and it therefore will not present an overview of SAT checking techniques.

2. Mira In this section we give a brief overview of the most important parts of our sequential Mira algorithm. Like most SAT procedures nowadays Mira is also an extension of the classical Davis-Putnam method. For selecting a free variable and assigning a truth value to it, Mira provides three different selection strategies: zChaff’s Variable State Independent Decaying Sum (VSIDS), Siege’s Variable Move to Front (VMTF), and a combination of both of them switching the heuristic after a predefined period of time (VMTF/VSIDS). The Boolean Constraint Propagation (BCP) procedure - necessary t o detect conflicts and implications is based on zChaff’s Watched Literals, minimizing the total number of clauses that have to be evaluated. In case of a conflicting assignment the reasons for the conflict are analyzed and are stored as a conflict clause. This clause is added to the clause set and contains all information to prevent the algorithm from repeating the error in the future. Again, our approach is similar to zChaff, determining the reasons for a conflicting assignment according to the 1 UIP Learning Scheme. In addition to this, Mira incorporates several other features like implication queue sorting, restarts, and clause deletion. Due to the page limit we skip a detailed summary here and refer the reader to [3]. ~

3. PaMiraHT

We now present the parallelization of Mira. Due to the hardware domain under consideration, PaMiraHT consists of two processes to be executed on the two logical CPUs of an Intel Pentium 4 processor with HTT. Thereby, each process is running a modified sequential Mira procedure as described in Sections 3.1 and 3.2. As the underlying communication protocol we have chosen the MPICH implementation [1] of the Message Passing Interface standard (version 1.2.6 for Linux). It provides a standardized core of C/C++ library routines to easily develop distributed algorithms and to establish communication between the processes by transferring so-called messages. 3.1. Dynamic Search Space Partitioning For parallel SAT solving the overall search space of a given benchmark problem has to be divided into disjoint portions to be treated in parallel. We adopt a dynamic search space splitting technique proposed by Zhang et al. in 1997 [7], which is based on the concept of guiding paths. A guiding path (GP) is defined as a path in the search tree from the root to the current node, with additional information attached to the edges: (1) The variable 21 and its truth value selected at level I . (2) A special flag storing the type of the variable z1: open corresponds to a decision variable, while closed stands for an implication. Due to the definition it is quite clear, that all open variables in a G P are candidates for a search space division, because the opposite truth value has not been checked so far. Based on GPs, the dynamic work stealing method

60

implemented in PaMiraHT is carried out as follows: If one of the two processes is idle during runtime it sends a request to the second one. In principle this process is now free to pick any open variable from its own GP to provide a new unchecked subproblem. In PaMiraHT, the one located closest to the root of the GP will always be selected. Then the encoding of the new portion of the search space is rather simple by taking all entries in the GP that precede the open variable and by flipping the value assigned to the open variable. After sending back the new GP the open variable is marked closed on both processes to guarantee that they are working on disjoint parts of the overall search space. Finally, the two processes continue their search. During the startup phase only one of the two processes is started, solving the complete benchmark problem. The second process remains idle and as a consequence the search space will be divided after the first decision. 3 . 2 . Knowledge Sharing Usually, in a parallel SAT solver the processes independently generate conflict clauses for their own use (in [6] also referred to as lemmas). Every conflict clause generated by a conflicting assignment of the variables is a piece of information that the corresponding process has learnt about the problem and that might be helpful t o cut off parts of the search space. Knowledge that directly forces a conflict on the second process is especially helpful in increasing the performance of the overall system. In these cases the receiving process could immediately stop its analysis of the current part of the search tree, because it has been already proven unsatisfiable. We have put this idea of Knowledge Sharing into practice using two phases: selecting/broadcasting suitable clauses, and the integration of them into the database of the receiving process. 3.2.1. Selecting and Broadcasting Clauses

A priori, it does not seem t o be clear, which criteria should be used t o determine the clauses that are shared. In [5] we introduced four different Clause Selection Strategies and empirically evaluated their effectiveness. It clearly turned out that “selecting all conflict clauses not exceeding a fixed maximum clause length and containing at least one literal of the initial GP” is the most powerful heuristic. This is also the strategy we focus on in this paper. So, only clauses that are satisfying this criteria will be selected by the processes. To reduce the communication overhead, each process is using its own clause buffer to transfer several conflict clauses as one single MPI message, containing the complete buffer. All suitable conflict clauses will be stored in this buffer and forwarded to the other process only when it is completely filled. In Section 4 the maximum clause length used by the heuristic is specified by the parameter Maximum Clause Length, while the number of clauses that could be stored in the clause buffer is denoted by Buffer Size. 3.2.2. Integrating Clauses

In regards to the Buffer Size the receiving process typically gets more than one lemma per communication task. First, all conflict clauses are added to the database. Afterwards, the clauses are checked one by one and the necessary

61

operations depending on the clause status - will be performed immediately. For instance, if a received clause directly forces a conflict, the conflict analysis procedure is called and backtracking will undo any wrong decisions. A complete overview of all special cases can be found in [ 5 ] . ~

4. Experimental Results

To measure the performance of our approach we conducted a large number of experiments comparing PaMiraHT with our sequential Mira solver. All comparisons were executed on a Pentium 4 processor with HTT running at 3.2 GHz and equipped with 1 GB RAM / 1024 kB L2 cache. As mentioned in Section 2, Mira provides three decision heuristics that can be selected independently for the two processes. We evaluated two setups: In the first one - Standard - the two processes are running the same decision heuristic switching between VMTF and VSIDS after each restart. In the second one - Advanced - the processes are using different strategies: VSIDS on process 1 and VMTF on process 2. Additionally, we combined these two setups with varying parameters with respect to the clause buffers. In this paper we present the two fastest runs including knowledge sharing and compare them to the similar configurations without exchanging conflict clauses between the processes. Table 1. Parameter settings of the performed configurations of PaMiraHT. Setup

# Processes

s1

2

s2

2

s4

2 2

s3

Decision Heuristic Standard Advanced Advanced Standard

Knowledge Sharing No No Yes Yes

Buffer Size

Maximum Clause Length

-

-

50 25

15 10

A total of 31 medium-size benchmark instances from the SAT2002 Competition were selected. All of them are unsatisfiable industrial problems. The particular class of benchmarks was chosen in such a way that it is complex enough to objectively evaluate the performance of PaMiraHT and small enough to allow running multiple tests within a reasonable time frame. The CPU times Mira and PaMiraHT needed to solve all 31 benchmarks are given in Table 2. In the first two setups knowledge sharing was disabled. The results outline, that PaMiraHT is faster, when the two processes are selecting the variables according to different decision strategies. Of all runs using the default decision heuristic of Mira (Standard), configuration S4 was performing best and leads to a 5% speedup compared to Mira. Among all tested configurations of PaMiraHT, setup S3 was the most powerful one outperforming Mira by about 20% on average! Comparing configurations S2 and S3 shows that the cooperation between the two processes results in an additional speedup of 8%. 5 . Conclusion

In this paper we presented PaMiraHT, a distributed SAT checking procedure. The underlying sequential algorithm includes many of the optimization

62 Table 2. Experimental results. All run times are in seconds and are always t h e sum of the CPU times needed to solve all instances of the corresponding benchmark subset.

Benchmark dp09u08 f2clk-30 fifO8-200 wO8-10 w10-70

rand-net40 ezfact48-6/7/8/9/10 5cnf-3900-3900-060

pyhala-braun-30-4-1/2/3/4 Urquhart-SS-b1/7/10 avg-checker-5-34 grid-10-20 ro~e-0150/0300 xi-32 ’ Total

SDeeduD

#Inst. 1 1 1 1 1 7 5 1 4 4 1 1

2 1 31

Mira 142.11 136.60 88.50 75.32 47.46 1300.46 584.45 491.50 442.83 800.35 145.49 43.47 206.56 92.21 4597.31

-

s1 98.45 197.78 144.46 85.30 78.48 1280.57 398.42 278.48 201.90 1139.68 159.10 50.22 285.90 139.15 4537.89 1.01

s2 47.61 78.96 138.26 59.08 55.63 955.50 344.71 170.20 165.37 1505.02 123.50 82.15 369.82 71.93 4167.74 1.10

s3 51.67 76.42 109.48 53.45 33.16 898.93 358.08 199.68 162.09 1273.01 118.15 157.85 269.92 83.46 3845.35 1.20

54 109.20 148.73 136.82 90.95 68.78 941.23 400.18 287.92 221.54 1255.05 126.66 156.64 300.28 116.81 4360.79 1.05 ~~~

techniques introduced in recent years. For the parallel execution a dynamic work stealing method and a mechanism t o exchange relevant information between the processes was integrated into PaMiraHT. Since the goal of this work was to analyze t h e effects of Intel’s Hyper-Threading Technology in the area of SAT solving, both features were optimized for a 2-processor scenario. The experimental results clearly demonstrate t h a t PaMiraHT - using the two logical CPUs of a Pentium 4 processor - is on average 20% faster than the comparable sequential algorithm, performed on the physical CPU only. So, distributed SAT algorithms are an interesting alternative when dealing with hyper-threading enabled processors. Lastly, with current trends leading towards multicore CPUs, the work presented here should allow for a good starting point when developing SAT algorithms for future CPUs.

References 1. W. Gropp, E. Lusk, N. Doss, and A. Skjellum. A high-performance, portable 2.

3. 4.

5. 6. 7.

implementation of the MPI message passing interface standard. Parallel C o m puting, 22 (6):789-828, 1996. R. Lawrence. Efficient Algorithms for Clause-Learning SAT Solvers. Master’s thesis, Simon Fraser University, 2004. M. Lewis, T . Schubert, and B. Becker. Speedup Techniques utilized in Modern SAT Solvers - An Analysis in the MIRA Environment. In 8th International Conference o n Theory and Applications of Satisfiability Testing, 2005. M.W. Moskewicz, C.F. Madigan, Y. Zhao, L. Zhang, and S. Malik. Chaff Engineering an Efficient SAT Solver. In Design A u t o m a t i o n Conference, 2001. T. Schubert, M. Lewis, and B. Becker. PaMira - a Parallel SAT Solver with Knowledge Sharing. In 6th International Workshop o n Microprocessor Test and Verification, 2005. C. Sinz, W. Blochinger, and W. Kiichlin. PaSAT - Parallel SAT-Checking with Lemma Exchange: Implementation and Applications. In LZCS 2001 Workshop o n Theory and Applications of Satisfiability Testing, 2001. H. Zhang, M. Bonacina, and J . Hsiang. PSATO: a Distributed Propositional Prover and its Application to Quasigroup Problems. Journal of Symbolic C o m putation, 21(4) :543-560, 1996.

A NEW STRATEGY FOR SOLVING MULTIPLE-CHOICE MULTIPLEDIMENSION KNAPSACK PROBLEM IN PRAM MODEL MD. WASELUL HAQUE SADID, MD. RABIUL ISLAM AND S. M. KAMRUL HASAN Department of Computer Science & Engineering, Rajshahi University of Engineering & Technolom, Bangladesh E-mail: [email protected], rabiul-cseayahoo. corn, khrajeeb@yahoo. ca DR. MD. MOSTOFA AKBAR Department of Computer Science and Engineering, Bangladesh University of Engineering and Technologv, Bangladesh E-mail: [email protected]. bd This paper presents a new heuristic algorithm for the Multiple-Choice Multi-Dimension Knapsack Problem (MMKP) in PRAM model. MMKF' is a variant of the classical 0-1 knapsack problem, has a knapsack with multidimensional capacity constraints, groups of items, each item having a utility value and multidimensional resource constraints. The problem is to maximize the total value of the items in the knapsack with the constraint of not exceeding the knapsack capacity. Since MMKP is an NP-Hard problem, its exact solution is not suitable for real time problems, so heuristic based approximation algorithms are developed. We present a parallel heuristic algorithm here that mns in O(lognl(log n + log rn + log log l ) ) time in CRCW PRAM machine; taking O(n log n(log n + ml))operations. Experimental results show that it achieves 95% optimal solution on average.

1

Introduction

Multiple-choice Multi-Dimension Knapsack Problem ( M W ) [ 1, 3 , 4 ] is a variant of the classical 0-1 KP, where there are different groups of items and exactly one item can be picked up from each group; and the constraints are multi-dimensional. Let there be n groups of items, Group i containing li items, Item j of Group i has a utility value vy and an m dimensional resource cost Ti/= (Ti/,,rV2,..., r V m ).The

resource constraint

fi = ( R , , R, ,..., R,

is to maximize the utility value

is also m dimensional. The M M K P problem

)

v = c?1=1

z y = l z ) = l x g r g k I Rk where1 I k I m

x . . v . . subject to the resource constraint f=l g g

, xij

E

(0,l) a n d ~ ) = l ~ = i i1 . Figure 1

illustrates an MMKP with 3 groups and 2 resource requirements. Values and resource requirements of each item are shown inside the boxes representing items. Objective is to pick up exactly one item from each group to maximize the total 63

64

value of picked items maintaining items I 1

5

.

m

I

v=10 rl=5, r2=7

Crlof picked

V=15 r1=6, r1=3

items I 17 and

r1=7, rl=7

Cr2of picked

Maximum allowable

Resource rl=4, r1=7

r1=8,r1=2

rl=5, r2=5

r1=6, r2=4

rl=5, r1=3

Type rl: 17 Type r2: 15

1-1 Figure 1. Example of an MMKP

2

Related Works

There are different algorithms for solving variants of the classical 0-1 KP. Toyoda [6] proposed a new measurement called aggregate resource consumption by applying the greedy method to the Multi Dimensional Knapsack problem (MDKP). Khan [4] has applied the concept of aggregate resource consumption to pick a new item in a group to solve the MMKP. His heuristic HEU selects the lowest-valued items by utility or revenue of each group as initial solution. It then upgrades the solution in the subsequent iterations. Later, M-HEU, a heuristic developed by Akbar et al.[l], modifies HEU by adding a pre-processing step to find a feasible solution if the initial solution is not feasible. It also uses a post-processing step to improve the total value of the solution found by HEU with one upgrade followed by one or more downgrade. Knapsack problems also have been subject to much work on the development of efficient parallel algorithms [2]. For exact methods Branch-and-Bound and Dynamic programming (DP) are the most useful approaches. A DP algorithm for 01 KP may run in O(nc lo&)/ p + n log p ) for p processors, where c is the capacity of the knapsack [2]. A pipeline-architecture containing a linear array of p processors has a time complexityO(nc/p + M ) [2]. A divide and conquer approximation algorithm with a time complexityo(ios2 (n)iog(c)) on (,I&) processors is presented in [2]. Another approximation algorithm [2] realized on the processors. hyperbolic architecture with a time complexityo(iog2(n)ios(c)) on

&*)

3

Parallel Heuristic Algorithm for MMKP

In this section we present a PRAM heuristic algorithm for solving MMKP. We start with an initial feasible solution and try to reach a near-optimal solution by upgrading the solution iteratively. Items of each group in the MMKP are sorted in

65

non-decreasing order according to the value associated with them. First the relative change in aggregate consumed resource, A d v of each item is calculated in parallel for each group. A d , = {v,b }is a vector, where

'I/

A U 'I ~= ( A V . .

AU ij'

0) I' f

< 0, or ( A U ~ , I )if

AU i j -

AU#

>o

And Aarv>Aark~, if b(AdG)>b(Auru)or (b(Aa',)=b(Adkl) and v ( A a r o ) > v ( A d ~ ) ) Where, the change of aggregate resource consumption, - rVk ). ck and the change of revenue, A V .I/. = v zp[i] . -vij. A ~. =. 1 ' I k

Ck

= Crip[i1k

,p[i] is the currently selected item in group i.

Then we find the items with the highest A d v from each group in parallel. The items found in the previous step, are sorted in non-decreasing order with respect to the value of their relative change in aggregate consumed resource using pipelined merge sort algorithm. We want to upgrade our current solution by replacing nN2h items, termed as 'scheduled upgrade' in Iteration h with larger valued items without violating the constraints. If it is greater than n, we cannot take them all and hence we have to consider only n items in such cases, which is 'tentative upgrade' and the difference between tentative upgrade and scheduled upgrade is defined as pending upgrade. In every iteration we have tentative upgrade = min (pending upgrade + scheduled upgrade, n). Then the prefix sum is computed of the resource consumptions of the tentative upgrade items. Finally, we find out the maximum number of items for which the solution will be feasible and these items will be upgraded. The PRAM algorithm for MMKP is given below: Step 1: Construct the initial solution with the lowest valued item in each group. Step 2: In Iteration h (0 I h I log n o , execute the following sub-steps. Step 2.1: For all group, relative aggregate resource consumption, A d , of each item having a higher utility value than the item selected from the same group are computed in parallel. Step 2.2: Find the item with highest A d , in each group using parallel maximum finding algorithm. Step 2.3: Sort the items found in Step 2.2 in descending order of A d , using parallel sorting algorithm. Step 2.4: Compute prefix sums of consumed resources for the first min (pending upgrade + n) items found in Step 2.3 Step 2.5: Using parallel maximum finding algorithm, find the maximum number of items that can be upgraded at a time in this iteration without violating the constraint and then perform the upgrade. Step 3: Deliver the solution if there is no dummy item in the final solution. 3.1

Complexity Analysis

Step 1 and Step 3 can be performed easily in O(1) parallel time, O(n) operations on EREW PRAM.

66

Step 2.1 runs in O(log m + log n) time using O(lnm + nm) = O(lnm) operations on CREW PRAM. Step 2.2 directly employs the parallel maximum finding algorithm on I items in each group and can be done in O(loglog r) time with O(nr) operations on CRCW PRAM. In Step 2.3 we apply the parallel sorting algorithm on n elements which can be done in O(log n) time, O(nlog n) operations on CREW PRAM. Step 2.4 can be performed in O(log n ) time using O(mn) operations in CRCW PRAM. In Step 2.5, we can employ the parallel maximum finding algorithm directly and it will take O(loglog n) time with O(n) operations on CRCW PRAM. Since the Step 2 iterates for lognl time, the overall running time of Step 2 should be O(log nl(log n + log m + log log I ) ) . The number of operations needed is O(nlognl(logn+lm)). The PRAM model required is CRCW.

3.2

Experiment Results

It is not possible to provide a PRAM machine by using normal multiprocessors, so that the above algorithm is simulated as if it runs on a CRCW PRAM. For each set of parameters n, I and m, we used 10 uncorrelated as well as 10 correlated MMKP instances. For the same data we have also executed the upper bound and the MHEU algorithm. Our solution is then benchmarked with the result of upper bound as well as M-HEU and we find that it is on an average 98.5% of the value of M-HEU, and about 95% of the upper bound. h

-Correlated

M -HEW

- - - X - - -U n c o r r e l a t e d

0.92

2

2

-Correlated HEU

0.9 0.88

0

2oo

4oo

,ooo

8oo

6oo

M -HEW

Parallel

--~--Uncorrelated H EU

Parallel

NumberofCroups

Figure 2. Performance of Parallel HEU and M-HEU normalized with the upper bound for the MMKP data sets with 1=10 and m=10

-Correlated

M -HEW

- - - x - - -U n c o r r e l a t e d M - H E U --orre

lated Parallel H EU

2o

3o

4o

5o

6o

--*--Uncorrelated HEU

Parallel

Number ofResource Constraints

Figure 3. Performance of Parallel HEU and M-HEU normalized with the upper bound for the MMKP data sets with n=500 and !=I0

67

-Correlated --H--Uncorrelated

M-HEU

M -HEU

--tC o r r e l a t e d

0

20

40

6o

-. * - -

Parallel HEU U n c o rre l a t e d P a r a l l e I H EU

N u m b e r o f Items

Figure 4. Performance of Parallel HEU and M-HEU normalized with the upper bound for the MMKP data sets with n=500 and m=10

4

Conclusion

W e have presented a parallel heuristic that runs on CRCW PRAM in qnlogi(log?+rnJ) operations O(1og nl(1ogn + logm + loglogl)) time using exploiting o(n1ogn + nlm)/(logn + logm + loglog[) processors. In this algorithm, we have used the same candidate item evaluation criteria as used in HEU by Khan or M-HEU by Akbar. In our algorithm we have tried to replace items from different groups in parallel with the items in their respective groups. Since we only considered upgrading of the solution as in hill-climbing approach, there might be some cases where w e may get stuck in a local maximum. To recover, we have to downgrade our solution and then upgrade later on as in simulated annealing technique. For fkture improvement we may think of adopting this. There is no parallel algorithm for exact solution of MMKP. We may work on that further.

References 1. 2. 3.

4. 5. 6.

Akbar M. M., Manning E. G., Gholamali C. Shoja and Khan S., Heuristic Solutions for the Multiple-Choice Multi-Dimension Knapsack Problem, International Conference on Computational Science, San Fracsisco, California, (2001) pp. 659-668. Alexandrov V. N. and Megson G. M., Parallel Algorithms for Knapsack type problems, World Scienti@cPublishing Co. Pte. Ltd., (1999). Ahmed B., Muqim A. and Sadid W. H., A Heuristic Solution of The Multiple-Choice Multiple-Dimension Knapsack Problem by Constructing Convex Hull in Parallel Method, ZCCSA-2004, San Diego, California, ( 2004) pp. 105-109. Khan S., Li K. F., Manning E. G. and Akbar M. M., Solving the Knapsack Problem for Adaptive Multimedia Systems, Studia Informatica Universalis, Vol. 2( I), (2002) pp. 161-181. Martello S . and Toth P., Algorithms for Knapsack Problems, Annals of Discrete Mathematics, Vol. 3 1, (1987) pp. 70-79. Toyoda Y ., A Simplified Algorithm for Obtaining Approximate Solution to Zero-one Programming Problems, Management Science, Vol. 21, (1975) pp. 1417-1427.

A CONSTANT-TIMESELECTIONALGORITHM ON AN LARPBS MICHAEL AROCK Department of Computer Applications, National Institute of Technology, Trichy 620 015, Tamilnadu, INDIA E-mail: [email protected] RPONALAGUSAMY Department of Mathematics, National Institute of Technology, Trichy 620 015, Tamilnadu, INDIA E-mail: [email protected] (Corresponding author) The selection problem is to determine the q* smallest element in a set S of n elements, for any given positive integer q. This paper proposes a parallel algorithm for the problem on a Linear Array with a Reconfigurable Pipelined Bus System (LARPBS). The algorithm requires constant time only on an LARPBS of O(n) processors, where IS1 = n. We claim that the algorithm is cost optimal, as it matches with the sequential lower bound algorithm employs simple prune-and-search strategy.

1

a(n).The

Introduction

The selection problem has drawn attention of many computer scientists since the advent of sequential computers. It is still continuing even today in the age of High-Performance Computing. In our focus of multiprocessors with optical interconnection networks, many have contributed to the solution of the problem with their algorithms. Li et al. [l] had developed an algorithm with O(1og n) parallel running time for an LARPBS. Cole and Yap [2] gave a solution to the problem with O((1og log n)2) time on Parallel Comparison Tree (PCT) model. Then, Cole [3] individually solved the problem on EREW PRAM model with O(1og n log*n)-time algorithm. Later, Ajtai et al. [4] presented an improved algorithm with O(log log n) time on PCT model, again. Back to PRAM models, Chaudhuri et al. [5] presented an algorithm of O(log n / log log n) time on CRCW variant. All the above-mentioned algorithms employed n processors. Pan [6] developed a scalable parallel algorithm on LARPBS with O(n log n / p ) time employing p processors. Rajesekarn and Sahni [7] found a constant time algorithm with O(n2) processors for an Array with Reconfigurable Optical Bus (AROB) model. At this juncture, we propose a parallel algorithm with 0(1) time on an LARPBS of n processors using prune-and-search approach only. The rest of the paper is organized as follows: Section 2 describes and develops the pseudo-code for a parallel selection algorithm and also mentions some salient features of the model, LARPBS, while section 3 concludes with a discussion. 68

69

2

The Parallel Selection on an LARPBS

2.1 Description

Our algorithm is a recursive parallel algorithm based on a modified version of the linear-time sequential algorithm for selection [S]. Let S be the sequence of elements drawn from a linearly ordered set and q be an integer (21) with which it is to be determined the element whose rank is q. Let IS( = n, n being the input size. We also employ N (= n ) processors on the LARPBS to accommodate the sequence S . Also, let Q be a small integer constant to be found later when analyzing the cost of the algorithm. We employ a model called Linear Array with Reconfigurable Pipelined Bus System (LARPBS). It has an array of N processors PI, P2...PNconnected by an optically pipelined bus. To maintain the propagation delays same, it assumes that the bus has the same length of fiber between consecutive processors. A bus cycle is an end-to-end propagation delay on the bus. The time complexity of an algorithm written for this model is found in terms of time steps, when a single time step consists of one bus cycle and one local computation. Optical switches are incorporated in each segment between every pair of consecutive processors, to bring in reconfiguration facility on the bus. In general, it is possible to configure the bus to accommodate many such independent bus systems depending on computation and communication needs [9]. One of its salient features is given as lemma 1 below. For details, refer to Pan and Li [9].

Lemma 1 [9]:

One-to-one communication, broadcasting, multicasting, multiple multicasting, binary prefix sum computation, and ordered compression can be carried out in O(1) bus cycles on the LARPBS model.

First, we divide S into nlQ subsequences of Q elements each, by reconfiguring the system into n/Q sub-LARPBSs. Since each subsequence has Q (being fixed constant) elements, we can find the median of the subsequence in constant time. Now, recursively repeat the process of subdividing the smaller subsequence of n/Q medians to find the median of the nlQ medians found previously, until n < Q. Once n goes below Q, we can sort and find the qthelement directly. Let M be the median of the medians, residing at P,. PI broadcasts M to all the other processors, using which the processors subdivide the original S into three subsequences SI, S2, S3 of elements of S smaller than, equal to and larger than M, respectively. If IS11 2 q, qth element must be in S1, or if IS11 + I Sd 2 q. the median of the medians, M, is the qth element; otherwise [q- IS11 + I S$ element in S3 is the required element. The complete pseudo-code along with time analysis is presented in the next subsection.

70

2.2 The Pseudo-code and Time Analysis 2.2.1 The Parallel Algorithm

Algorithm: L-PARASEL (S, q ) Input: The sequence S and, an integer q. Let L be an LARPBS with number of processors equal to ISI. Output: The q" smallest element of S. Method: Step 1: if IS1 < Q then sort S and return the q" element directly else i) Reconfigure the LARPBS L into sub- LARPBS, Li, i =1 to (SI / Q . Each L,has Q elements. ii) for i = 1 to n/Q do in parallel a) Each Li finds the median m, of its Q elements in constant time (as Q is a fixed constant). b) Sets a flag (MF) to be 1 for each processor that possesses an mi. Reconfigure all Li's back to the original system L. Step 2: Using ordered compression operation, bring all constant (mi's) of the processors whose MF flags are set to 1. Reconfigure the system L into M' and L' where M' possesses all mi's. The sequence of mi's is also termed as M'. Step 3: Call recursively L- PARASEL(M', IM'I / 2) using the sub-LARPBS M and store the median of the medians, M in the processor PI of the system M . Reconfigure M' and L' back to the system L. Step 4: Broadcast M to all the other processors of L and set the flags LE to 1 for the processors whose elements are less than M. Rest are set to 0. Step 5: By ordered compression, bring all contents (Mi' < M) of the processors whose LE flags are turned 1. Reconfigure M into L1, and L" such that Ll has all Mi' < M. Apply binary-prefix-sum operation on LI (where LE bit-flags are set to 1) to find the count. (Corresponding to described in sub-section 2.1). if lLll 2 q then Call recursively L-PARASEL(L1, q). Step 6: i) Employ one-to-one communication to send lLll to the first processor of L". ii) Sub-divide L" into two sub-LARPBSs, L2 and L3, by setting flags EQ and GT to 1 with respect to condition Mi = M and Mi > M respectively. Apply binary-prefix-summation operation on L2 (where all EQ bit-flags are set to 1) to find the count I L2 I (corresponding to IS21 described in subsection 2.1) at the first processor of L2. It also possesses I L11 sent from L1 as per step 6(i). if I L1l + I L21 2 q then return M

71

else

. Call L-PARASEL( L3, q-( lLll + lbl))

0

2.2.2 Time Complexity Analysis We present the time-complexity requirement analysis of the algorithm in the form of a theorem below along with a lemma. Lemma 2: The LARPBS of Q ( afixed constant) elements can find the median of Q elements in constant time. Proof: As it is directly from lemma 1, we leave it to the reader. Now, we claim 0(1) bus cycles for finding the qthsmallest element in a given sequence S on an LARPBS in the following theorem: Theorem: The parallel algorithm L-PARASEUS, q) runs in O(1)time. Proof For stepl, we can either sort (as at most 4 elements are there) or apply the actual steps presented in the proof of Lemma 2 above. It takes 0(1) time, Q being a fixed constant. Excepting recursive calls, steps 2, 4, 5 and 6(ii) employ any of broadcasting, ordered compression or binary-prefix-sum operations of an LARPBS only, each of which requires 0(1) bus cycles as per Lemma 1. If t(n) denotes the parallel running time for input size n, then the time taken for recursive call made in step 3 with n/Q elements (I M' I = n/Q) can be represented as t(nlQ).From [16], one obtain that either I L1 I or I L3 I is bounded above by 3n/4. i.e. I L1 I I 3 d 4 and I L3 I 5 3n/4. Hence, the final time complexity is obtained by solving the following recurrence relation: ifn
3

Discussion and Conclusion

In this paper, we have proposed a parallel algorithm for selection problem with 0(1) time and n processors. Its cost (the product of time and number processor

72

employed) matches with the sequential lower bound Q(n). Hence, we claim our work is cost optimal too. As the result of a comparative study made among the parallel solutions for the problem in question, we present below a table, Table 1: Table 1. Comparative study of parallel algorithms for selection problem.

References 1. K.Li, Y.Pan, and S.Q. Zheng, (eds.): Parallel Computing Using Optical Interconnections. Kluwer Academic Publishers, Boston Massachusetts (1998). 2. R. Cole and C. K. Yap: A Parallel Median Algorithm. Information Processing Letters. 20 (1985) pp. 137-139. 3. R.J. Cole: An Optimally Efficient Selection Algorithm. Information Processing Letters. 26 (198711988) pp. 295-299. 4. Miklos Ajtai, Janos Komlos, W. L. Steiger, and Endre Szemeredi: Optimal Parallel Selection Has Complexity O(log log n). Journal of Computer and System Sciences. 38 (1989) pp. 125-133. 5. S. Chaudhuri, T. Hagerup, and R. Raman: Computer Science. Springer-Verlag. (1993) pp. 352-361. 6. Y. Pan: Order Statistics on Optically Interconnected Multiprocessor Systems. Opti. Laser Tech. 26 (1994) pp. 281-287. 7. S. Rajasekaran and S. Sahni: Sorting, Selection, and Routing on the Array with Reconfigurable Optical Buses. IEEE Trans. on Parallel and Distributed Systems. 8 (1997) pp. 1123-1132. 8. M. Blum, R.W. Floyd, V. Pratt, R.L. Rivest, and R.E. Tarjan: Time bounds for selection. J. Comput. Syst. Sci. 7 (1972) pp. 448-461. 9. Y. Pan and K. Li: Linear Array with a Reconfigurable Pipelined Bus System Concepts and Applications. Journal of Information Sciences. 106 (1998) pp. 237-258.

REDUCING CROSSING NUMBER OF MULTI-COLOR RECTILINEAR STEINER TREES USING MONOCHROMATIC PARTITIONING

S. MAJUMDER', B. B. BHATTACHARYA2, S. M. A. JAFR13

IIIT, Kolkata 700 091, India, E-mail: s-majQvsnl.com ISI, Kolkata 700 108, India, E-mail: bhargabaisical.ac.in IIIT, Kolkata 700 091, India, E-mail: smajafrzOgmail.com

For wire routing in VLSI design, often the terminals belonging to the same net need to be connected electrically using conducting paths. To minimize wire-length, for each net, the connecting paths may together comprise of either a geometric minimum spanning tree or a rectilinear Steiner tree in case of manhattan routing. If we mark the trees for distinct nets with different colors, then a common goal in global routing is to minimize the number of crossings between the edges of different colors. This reduces the number of vias required and thus helps in better utilization of the metal layers. In this paper, we have proposed a fast and novel technique to do the same. The experimental results obtained are quite encouraging.

1. Introduction

In layout synthesis of a VLSI chip, rectangular circuit blocks are isothetically placed within a rectangular floorplan. To each block, a set of terminals is attached. All the terminals of the same net are to be connected electrically in two phases - global routing and detailed routing. The global phase first generates a loose route for each net and then the actual layout gets fixed in the detailed phase. For most chips, minimizing the total wire-length is the main criterion for routing l . However, minimizing the number of vias is also a common objective for a number of routers, as they are expensive. Vias are used when a wire needs to go from one routing layer to another. Increase in number of vias may decrease the yield of the chip and also introduce parasitic capacitance. This paper is organized as follows. The next section introduces the background of this work. Section 3 presents the solution methodology. The experimental results comes in Section 4 and then the h a 1 section concludes this novel discussion. 73

74

2. Background of the Problem In routing of multi-terminal nets, for each distinct net n, the primary goal is to interconnect all the terminals of n using minimum amount of wire. This is similar to the problem of minimization of total cost of edges in a Steiner tree l . The terminals of the net n are the demand points. All the other grid-points in the underlying grid graph that are not demand points, then become candidates for Steiner points. A Steiner tree with rectilinear edges is called a Rectilinear Steiner Tree (RST). A Rectilinear Steiner Minimum Tree (RSMT) is an RST with minimum cost among all RSTs. The optimal routing in the manhattan model turns out to be finding the RSMT, which is NP-hard. Hwang proved the following theorem 2 .

Theorem 2.1. Let COStMST and CostRsMT be the costs of a minimum cost rectilinear spanning tree and rectilinear Steiner minimum tree, respecCost < 3. tively. Then - 2 So many heuristic algorithms use Minimum Spanning Tree (MST) as a starting point and apply local modifications to obtain an RST, which can have at most 1.5 times the optimal weight 3 . Actually, the edges should be selected to have the maximum overlap amongst themselves. We also follow a similar approach. Given the demand points, we first draw the geometric MST and then convert it to the rectilinear version. We further perform a few local improvements to generate some RST. In recent literature also, Steiner tree generation has received considerable attention 4 . Minimizing total wire-length is however not the only goal of routing. Let k multi-terminal nets on the plane be marked with k distinct colors. We present a strategy for routing these k nets so that the edge-crossings between the trees of different colors in minimized. Each such crossing leads to an use of a via. The intention is to reduce the number of expensive vias that will in turn lead to better utilization of the metal layers.

3. Solution Methodology Separating a set of multi-colored points into monochromatic partitions (cells) using minimum number of axis-parallel straight lines is a well-known problem of computational geometry. It is NP-complete and we use a fast heuristic H f for this problem to solve the problem of this paper. Let us consider k nets { n l , n 2 . ..n k } , and their corresponding sets of terminals {TI, T2 . . . Tk}.We mark the point-positions of the terminals of Ti using color ci. Let P, be the set of all the colored points mentioned above.

75

We want to connect all the points of the same color using rectilinear lines of that color keeping two objectives in mind. The total length of line segments should be minimized and the total number of crossings between segments of different colors should also be minimized. This problem is computationally hard, and it remains so even with only the first objective. Next, we present a heuristic that tries to satisfy both the objectives in a reasonable way. ::.

: :.

:.:

..............................................

. ... .

. . .: j . .

* :

:

............................................... ... .,. .: : .. .

...

11;

.............

.....

. .

~

...............................................

~

1I

............

J; j.

, .....

I I

. .

., ..

Figure 1. A typical example showing the steps of the heuristic

3.1. Steps Followed by the Heuristic H a

The different steps followed H , are shown in Fig. 1. We first run the heuristic Hfon the set of colored-points P, so that the points get segregated into monochromatic cells by a small set of axis-parallel lines L (Fig. l(b)). From the cells that contain more than one point, we choose a representative point to get a reduced set of points P, on the plane (Fig. l(c)). For each

76

distinct color ci, we then use H , to generate an RST ti (Fig. l(d), (e)). Let the set of such RSTs be RG (global RSTs). However, while generating these RSTs we put an additional restriction as described below. Suppose an edge e E ti will connect two points p and q of color ci. Starting from p it either takes a vertical or a horizontal route. Once it hits the boundary of its own cell, it starts using only those isothetic line segments that are subsets of the lines belonging to L. It may use as many line segments till it hits the boundary of the monochromatic cell in which q is placed. It then uses a last horizontal or vertical segment to get connected to q. Finally, in each of the monochromatic cells we use H , locally to connect all the points present in that cell (Fig. l(f)). Let the set of such RSTs be RL (local RSTs). We call two RSTs X and Y to have intersected, if any edge of X geometrically crosses any edge of Y . From the method of construction, it follows that any two local RSTs do not intersect among each other. Also, a local RST will not intersect with a global RST of different color. Two global RSTs however may have intersections among themselves (Fig. l(g)). If we had directly used H , to connect the points of same color forming an RST instead of doing the monochromatic partitioning, the total wire-length would have been less. However by our method the number of crossings amongst multi-color edges will be less compared to the direct approach.

4. Experimental Results

We have tested the performance of H , on a set of randomly generated problem instances. Table 1 presents the comparative study of our strategy against the direct approach in terms of total wire-length and number of crossings. The results have two broad divisions. One of them is called RSTbased that is exactly same as discussed in the previous section. The other one is MST-based, where the local RSTs are being replaced by MSTs in Eucledian metric. In case of RST(MST)-based experiment, we compare our results with RSTs (Eucledian MSTs) being drawn directly with the original set of points. We have named our two-step approach as hybrid approach. Note that, heuristic H , has been significantly successful in reducing the number of crossings among the edges of different colors in both types of experiments. The total wire-length increases from the direct approach as predicted earlier. However, the percentage of increase is much less for RSTbased experiment, which is more realistic as routings in VLSI domain are mostly manhattan. The CPU time required in a pentium-based machine has been negligible, and has not been reported for brevity.

77 Table 1. Performance of the heuristic H , Type of Exp.

MST based

RST based

# of

#of

nets (colors)

term. (points)

3 3 3 4 5 5 6 7 7 8 3 3 3 4 5 5 6 7 7 8

45 35 90 80 50 60 51 86 69 84 45 35 90 80 50 60 51 86 69 84

Total wire length Direct Hybrid % more in aPP. app. hybrid 100 % 7271 4913 10007 10910 10000 9783 10176 14653 13940 14169 9018 5907 12176 12992 11948 11560 11889 17946 16185 17111

9785 6596 12783 13963 12852 12167 12699 19590 16991 17700 10013 6947 13114 14095 12936 12318 12776 19632 17020 17774

34 34 27 27 25 24 24 33 21 24 11 17 7 8 8 6 7 9 5 3

Number of Crossings Direct app.

19 7 31 33 38 33 37 92 77 79 20 11 33 37 44 38 39 91 75 85

Hybrid app. 100 %

% more in direct

12 4 26 27 26 24 32 71 63 63 12 4 26 27 26 24 32 71 63 63

58 75 19 22 46 37 15 29 22 25 66 175 26 37 69 58 21 28 19 34

5. Conclusion

In this paper, we have marked the trees for distinct nets with different colors, and then proposed a geometry-based heuristic that reduces the number of crossings between the edges of different colors. This may help to reduce the number of expensive vias required in a VLSI chip. The experimental results reported show the effectiveness of our approach. Acknowledgment: We sincerely thank Mr. Vikram Chowdhary of IIIT, Kolkata for helping us with the experiments.

References 1. Sherwani, N. A., Algorithms for VLSI Physical Design Automation, 2nd Edition. Kluwer Academic Publishers (1995). 2. Hwang, F. K., On steiner minimal trees with rectilinear distance, SIAM Journal of Applied Mathematics, 30(1) (1976) 104-114.

3. Ho, J. M., Vijayan, G., Wong, C. K., A new approach to the rectilinear steiner tree problem, IEEE T C A D , 9(2) (1985) 185-193. 4. Warme, D. M., A New Exact Algorithm for Rectilinear Steiner Trees, International Symposium on Mathematical Programming, (1997). 5. Majumder, S., Nandy, S. C., Bhattacharya, B. B., Separating MultiColor Points on a Plane with Fewest Axis-Parallel Lines, Tech. Report ISI/ACMU/VLSI - 1, Kolkata, (2005).

MINIMIZING CAPACITATED TREE COVERS OF GRAPHS*

YOSHIYUKI KARUNO Kyoto Institute of Technology, Sakyo, Kyoto 606-8585, Japan E-mail: [email protected]

HIROSHI NAGAMOCHI Kyoto University, Sakyo, Kyoto 606-8501, Japan E-mail: [email protected] Let G = (V, E ) be a simple connected graph such that each vertex 21 E V and each edge e E E are weighted by non-negative reals h(v) and w(e), respectively. The . . , X,} of capacitated subtree cover problem asks to find a partition X = {Xi,. V and a set 7 = {Ti,. . . , T,} of subtrees such that each Ti contains Xi so that the cost of each Ti does not exceed a specified capacity d (> 0 ) , where the cost of each Ti is defined by the sum of weights of edges in Ti and weights of vertices in Xi. The objective is to minimize the number p (= 1x1 = 171)of subtrees. In this paper, we propose a constant factor approximation algorithm to the problem.

1. Introduction Let G = (V, E ) be a simple connected graph with a vertex set V and an edge set E . Define n = IVI and m = IEl. Let ( G , w ) denote a graph G = (V, E ) such that each edge e E E is weighted by a non-negative real w(e). The edge-weight w(e) for an edge e = (u, w) with end vertices u,w E V may be denoted by w(u, w). The graph may be denoted by (G, w , ~ if) a vertex r E V is specified as the root. We mean by G‘ C G that G’ is a subgraph of G. For a subgraph G’ C G , we denote by V(G’) and E(G’) the vertex set and edge set of G’, respectively, and by w(G‘) the sum of edge-weights in G‘. Let dzstG(u,w) denote the length of a shortest path between vertices u and w in a given graph (G,w). For a rooted graph (G, w, T ), define l~ = maxvEy~ Z S ~ G ( T , w). Let (S, h ) be a specified subset S of V such that each vertex v E S is weighted by a non-negative real h(v) (i.e., the vertex-weight). The sum of vertex-weights in a subset S’ S is denoted by h(S’). A collection X of disjoint subsets X I , . . . , X, of S (C V ) ‘This research was partially supported by a Scientific Grant in Aid from the Ministry of Education, Culture, Sports, Science and Technology of Japan.

78

79

is called a partition of S if their union is S , where some X i may be empty, and is called a p-partition of S if 1x1 = p . For a given tree T = ( V , E ) and a subset X C V of vertices, let T ( X ) denote the minimal subtree of T that contains X . We say that T ( X ) is induced by X . The minmax and capacitated subtree cover problems are formulated as follows.

Minmax Subtree Cover Problem (MSCP): 0 Input: An instance I = (G = (V,E),w,V,h,p)which consists of an edge-weighted graph (G, w), a set (V,h) of weighted vertices and an integer p E [2, n]. 0 Feasible solution: A ppartition X = { X I , .. . , X,} of V and a set 7 = {TI,. . . , T,} of p subtrees such that X i V(Ti)(1 5 i 5 p ) . 0 Goal: Minimize c o s t ~ ( X , I:= ) rnaxlli5,{w(Ti) h(Xi)}. Minmax Rooted-Subtree Cover Problem (MRSCP): 0 Input: An instance I = (G = (V,E),w,r,V,h,p) which consists of a rooted edge-weighted graph (G, w, T ) , a set (V,h ) of weighted vertices and an integer p E [2, n]. 0 Feasible solution: A ppartition X = { X I ,. . . , X,} of V and a set 7 = {TI,.. . , T,} of p subtrees such that X i U { T } C V(Ti)(1 5 i 5 p ) . 0 Goal: Minimize c o s t ~ ( XI ,) := maxllis,{w(Ti) h(Xi)}. Capacitated Subtree Cover Problem (CSCP): 0 Input: An instance I = (G = (V,E),w,V, h , d ) which consists of an edge-weighted graph (G, w), a set (V,h ) of weighted vertices and a positive real d. 0 Feasible solution: A partition X = { X I , .. . , X,} of V and a set 7 = {TI,.. .,T,} of subtrees such that X i C V(Ti) (1 5 i 5 p ) and c o s t ~ ( X , T:= ) rnaxllil,{w(Ti) h ( X i ) }5 d. 0 Goal: Minimize the number p (= 1 x1 = 171)of subtrees. Capacitated Rooted-Subtree Cover Problem (CRSCP): 0 Input: An instance I = (G = (V,E ) ,w, T , V ,h, d ) which consists of a rooted edge-weighted graph (G, w, T ) , a set (V,h ) of weighted vertices and a positive real d. 0 Feasible solution: A partition X = { X I , .. . , X,} of V and a set 7 = { T I , . . ,T,} of subtrees such that X i U { T } C V(Ti) (1 5 i 5 p ) and c o s t ~ ( X , I:= ) maxlli5p{w(Ti) h ( X i ) } I d. 0 Goal: Minimize the number p (= = 171)of subtrees.

+

+

+

+ 1x1 In this paper, we propose an O ( m + nlogn) time 7-approximation algorithm to the CSCP, and an O ( m + n 2 ) time papproximation to the

80

+

CRSCP, where p is an integer such that & ( p 3 ) / ( p - 3) 5 d. A (4 + E ) approximation algorithm to the MSCP and a (3-2/(p+l))-approximation t o the MRSCP have been known, where E is a specified constant. A brief review of the subtree cover problems in trees was provided by the authors '. 2. Preliminaries

We say that a solution ( X ,7) induces edge-disjoint (resp., vertex-disjoint) subtrees if for any two X i , X j E X , subtrees Ti and Tj are edge-disjoint (resp., vertex-disjoint).

Lemma 2.1. For an instance I = ( T , w , S ,h , p ) of the MSCP in a tree T = (V,E ) , there exists a pair of a p-partition X of S (2 V ) and a set 7 of p subtrees with c o s t I ( X , I ) 5 max

h ( S ) , max h ( v ) } VES

(1)

that induces edge-disjoint subtrees. Such a solution ( X , 7 ) can be obtained in O ( n ) time. Note that Lemma 2.1 holds not only for V , but also for any subset S V ) . For an instance of the MSCP in a tree, each subtree Ti E 7 in a solution ( X , 7 ) is induced by each X i E X , i.e., Ti = T ( X i ) . From Lemma 2.1 with p = 7 , we derive the following property.

(c

Lemma 2.2. For an instance I = ( T , w , S ,h , d ) of the CSCP in a tree T = ( V , E ) with max{h(w) I 'u E S } 5 d and w ( T ) h ( S ) 5 4d, there exists a pair of a 7-partition X of S (C V ) and a set 7 of 7 subtrees with c o s t I ( X , ' T ) 5 d. Such a solution ( X , 7 ) can be obtained in O ( n ) time.

+

3. Approximation Bounds

The proposed algorithm to the CSCP is described as follows.

ALGORITHM 1 Input: An instance I = (G, w, V,h, d ) of the CSCP. Output: A solution ( X , 7 ) to 1 such that c o s t I ( X , I ) 5 d and 1x1 = 1 7 1 5 7p* , where p* denotes the minimum number of subtrees t o the I . STEP 1. If max{h(v) I 'u E V } > d , then return fail (since there is no feasible solution to the given instance I ) . STEP 2. Remove from G all edges of weight greater than d. Let G I , . . . ,G, be the connected components resulting from G.

81 STEP 3. Perform the following substeps (i)N(iv) for each connected component G k (k = 1 , . . . ,y): (i) Compute a minimum spanning tree f k of graph (Gk,w>. (ii) Estimate the number of subtrees to be assigned into Gk by P k :=

[ { w ( F k )4-h ( v ( G k ) ) ) / ( 2 d )4-1/21 .

(2)

(iii) For the instance Ilk] = ( f k , w,V ( G k ) ,h , p k ) of the MSCP, obtain a pair of a pk-partition Adk] = { X p ’ , . . . ,X k l ) of v ( G k ) and a set 7[k] = {Tf“’,. . . ,Tpk [kl } of pk edge-disjoint subtrees by using the

O(n) time algorithm in Lemma 2.1. By ( 2 ) and Lemma 2.1, the C m t ~ [ k(X[’], ] of this solution is bounded by

cost

(iv) Generate pk instances of the MSCP, I / k 1= (Tjk’, w ,X l k l, h, 7 ) , 1 I z I pk. For each instance I/’], find a pair of a 7-partition =

{Xj? I 1 I j I 7) of X j k l and a set = {Tj,;]I 1 5 j 5 7) of 7 edge-disjoint subtrees by the O ( n ) time algorithm in Lemma 2.1. !YI”I’P4. Let q := x z = l p k and jj := 7q. Return @-partition X := {XI: I 1 5 k 5 y, 1 5 i 5 p k , 1 I j I 7) of V and 7 := {T&’ I 1 5 k 5 y, 1 5 z 5 p k , 1 5 j 5 7) of jj edge-disjoint subtrees.

Lemma 3.1. Let q = x l = l p k be the s u m of estimated numbers pk of subtrees obtained in STEP 3(ii) of ALGORITHM 1. T h e n q I p * .

Proof. Let p ; be the minimum number of subtrees to cover Gk in the context of the CSCP. Then p* = C & l p ; holds. It suffices to show that pk 5 p; holds for each k = 1 , . . . , y. Let TT, . . . , TP*;,denote the subtrees that cover G k , where the cost of each Ti* is at most d. By adding at most p; - 1 edges, we can connect these p; subtrees to obtain a spanning tree of Gk. By STEP 2, the weight of each edge e E E(Gk) is also at most d. Since f k is a minimum spanning tree of (Gk,w), we have w(fk)

+ h(V(Gk))I

w(T:) + h(V(Gk))+ (&

- 1) . d

llilp;

< p; . d + (p; - 1 ) . d = 2dp; - d .

From this and the integrality of p;, pk

Ip;

holds for pk in ( 2 ) .

0

82

Theorem 3.1. For a n instance I = (G,w ,V ,h, d ) of the CSCP, let p* be the minimum number of subtrees. T h e n there exists a pair of a partition X = {XI,...,X,} of V and a set 7 = {TI,..., TG} of subtrees with 5 7p*. Such a solution ( X ,7 )can be obtained in O ( m n l o g n ) time.

+

Proof. As observed in (3), each solution ( X [ k l7Ik]) , obtained in STEP 3(iii) has the cost at most 4d. Hence the resulting solution ( X , I is ) a 7approximate solution by Lemmas 2.2 and 3.1. Now we consider the time complexity of ALGORITHM 1. STEP 1 and STEP 2 clearly require O ( n ) time and O(m+n)time, respectively. For each connected component Gk in STEP 3, let n k = IV(Gk)l and m k = IE(Gk)l,respectively. STEP 3(i) takes ~ ( m k + n k10gnk) time to compute a minimum spanning tree Tk. STEP 3(ii) runs in O ( n k ) time. STEP 3(iii) takes O(nk)time by calling the algorithm in Lemma 2.1 once. By Lemma 2.1, T,[”,z = 1, . . . , p k are edge-disjoint. Then STEP 3(iv) takes O ( c l < i l p k IV(T,kl)I) = O ( c l l i l p k IE(T,!kl)l) = O ( n k ) time. The computation-of STEP 3 is repeated y times, and thus it requires C;=,O(mk n k log n k ) C;=,O(nk) = O ( m n log n ) time (since Cz=,n k = n and C;=,mk 5 m). Since trees in 7 are edge-disjoint, STEP 4 takes O ( n )time. Therefore, ALGORITHM 1runs in O(m+nlog n) time. 0

+

+

+

We can also show the following result for the CRSCP, but the detail is omitted due t o the space limitation.

Theorem 3.2. For an instance I = (G,w,r, V,h, d ) of the CRSCP with max{h(v) I v E V } 5 d and an integerp such that l ~ ( p + 3 ) / ( p - 3 ) 5 d , let p* be the m i n i m u m number of subtrees. Then there exists a solution ( X ,7 ) such that 1x1 = 1 7 1 5 p . p*. Such a solution ( X ,7 )can be obtained in O ( m n log n p * n ) (= O ( m n’)) time.

+

+

+

References 1. G. Even, N. Garg, J. Konemann, R. Ravi and A. Sinha, Min-max tree covers of graphs, Oper. Res. Lett. 32 (2004) 309-315. 2. Y . Karuno and H. Nagamochi, Scheduling vehicles on trees, Paczfic J. Optim. l(2005) 527-543. 3. H. Nagamochi and K. Okada, A faster 2-approximation algorithm for the minmax ptraveling salesmen problem on a tree, Discrete Appl. Math. 140 (2004) 103-114. 4. H. Nagamochi, Approximating the minmax rooted-subtree cover problem, Inst. Electron. Inform. Comm. Eng. Runs. E88-A (2005) 1335-1338.

EVALUATION OF SHORTEST PATH ROUTING ALGORITHMS IN MULTI CONNECTED DISTRIBUTED LOOP NETWORKS RAMESH VASAPPANAVARA Dept of Computer Science and Engineering, GVP College of Engineering, Visakhapatnam530042.India, E-mail: [email protected] MN SEETARAMNATH Dept of Computer Science and Engineering, A U College of Engineering, Visakhapatnam. India E-mail: [email protected] Loop networks with multiple hops offer smaller diameters, path lengths and better faulttolerance. These networks, also known as multi connected distributed loop (MCDL) networks in the literature, have extensive uses in LAN, parallel processing, and multi processing environments. Results exist for finding shortest path between a single pair in O(6) time where 6 is diameter of distributed loop [4], and in O(h/g + log h), and g is GCD(N, h), where N is number of nodes h is hop size, without any knowledge of diameter [l]. In this paper we compare these algorithms based on their performance at various network loads.

1

Introduction

Multi connected distributed loop (MCDL) networks, also known as undirected double loop networks, have become popular networks in the design of local area networks, supercomputing architectures, distributed memory multiprocessor systems. In [3] authors have presented details regarding comparative studies of shortest path algorithms, faulttolerance for MCDL. In [2] Sinha and Mukhopadyaya obtained shortest paths in O(6) time where 6 is the diameter. Algorithms at { 1,4] do not require knowledge of diameter . In [4], Chalamaiah and Ramamurthy proposed an O(h/g + log(h))algorithm to compute the shortest path where h is the hop size and N is no. of nodes and g = GCD(N, h). In [4], authors proposed an algorithm that fills an array of size N with an integer which is shortest distance to node 0, with complexity O(6) . In this paper, we present a comparison based on performance against increasing number of stations and varying network load conditions .

2

Evaluation of Algorithms

This algorithm’s[13 time complexity is independent of the diameter.. Hence using isomorphic property , we need to calculate a total of 2*(n-1) paths as stated above. Therefore, total time to fill the array, is of the order 0 (2*(n-l)*(ldg + log (h)). To calculate the shortest path between one source destination pair, we need to access only one single array element i.e. c. If ‘p’ paths are calculated it is p*c. therefore, total time is 0 (2*(n-l)*(h/g + log (h) + p*c). For the algorithm at [4], once the array is filled, the kthelement gives the shortest path

83

84 from node 0 to node the k. . The number of array elements that need to be accessed is variable and can be a maximum of d elements, where d is the diameter. If ‘p’ paths are to be calculated, total time required is d + d*p*c.

3.0 Test Results These algorithms[ 1,4] were implemented on IBM iv Pentium Machines and the time actually consumed by these algorithms computed by varying number nodes up to 6000. . The results are at Fig 1. It is observed that algorithm [l] performs better and displayed a flat response , indicating the independence of diameter for computing the shortest path for large n. The algorithm at [4] time consumed raises exponentially confirming O(6) nature of the algorithm. Its performance is better only for networks with less number of nodes. Charts 1 shows the times we calculated for paths p = 50 where p is number of shortest paths computed.. The network considered is 200 nodes and hop sizes were h= 4, 5, 6, 8, 10. The diameters obtained were, 16 and 14 respectively. ‘D’ indicates algorithm at [4] and ‘h/g+log h’ implies algorithm at [ 13. Charts 2 and 3 show the time performance of the two algorithms for a network having 200 processors, with hop sizes 4 and 10 respectively. Here we have considered number of path lengths as parameter . nodes.

I

Variation

o f l i m e

V s

N

( N o . o f S t a t i o n s )

1 4 0

p

1 2 0

E

1 0 0

fi

8 0

BfE

..

6 0

40 2 0

n 2000

0

4 0 0 0

6 0 0 0

N ( N o . o f S t a t i o n s )

Fig 1: Time units consumed by algorithms [ll ,4] p = 5 0 ~

4

5

6 H o p size

Chart 1: Time units (delay) vs. hop size, n=50

8

10

-

.. .

85

Chart 2:Delay vs. no of links traversed n=200, h=4

3

Conclusions

This paper considered MCDL networks , of the type G(n,+h,+l)G(n,+h,-l)G(n,-h,+l) G(n,h,-I). We have critically analyzed the shortest path algorithms at [4 ,111 1, that do not demand prior knowledge of diameter, for their suitability for adoption for practical implementations on MCDL networks. Algorithm at [ 11 takes O[h/g+log(h)] where g = GCD(n,h) which is independent of diameter and computes shortest path in terms of type and number of links. Algorithm at [4] takes ‘d’ (diameter) time steps. Since diameter is dependent on the number of processors and hop size, the time complexity increases with increase in number of processors. Algorithm at [ 11 owing to diameter independence and flat and constant response times with increase in no of processors and path lengths is apt for computing shortest paths in MCDL networks than [4].

References N. Chalamaiah and B. Ramamurthy, “Finding shortest paths in distributed loop networks”, Information processing letters, 67, pp. 157-161, 1998. Krishnendu Mukhopadhyaya and Bhabani P. Sinha, “Fault-Tolerant Routing in Distributed Loop Networks”, IEEE Trans. on Computers, Vol. 44, No. 12, 1995. Caley M. J. and Schluter D., The relationship between local and regional diversity. Ecology 78 (1997) pp. 70-80. G.Praveen Babu, Ramesh Vasappanavara and Nimmagadda Chalamaiah “Comparative Study of shortest path routing algorithms in MCDL Networks” Proceedings of Asia Pacific International Conference ObComAPC-2004: Parallel and Distributed Computing Technologies, (December, 13-15, 2004:Vellore Institute of Technology, Vellore, T.N., India Raj Katti and V.V. Bapeswara Rao, “An array based technique for routing messages in distributed loop networks”, presented at : ISCAS 2002 IEEE International Symposium on Circuits and Systems, 26 -29 May, 2002, pages V 9-12,.

REAL TIME SERVICES OVER THE INTERNET A. J. SIMMONDS Faculty of IT, Universiiy of Technology Sydney, PO Box 123, Broadway, NSW2007, Australia, [email protected] G .MERCANKOSK WesternAustralian TelecommunicationsResearch Institute, University of WesternAustralia, 35 Stirling Highwq, Crawley, WA 6009, Australia, [email protected] Real time Internet traffic such as Voice over Ip (VoIP) is difficult to estimate and simulate. Since the process is non-ergodic we must use ensemble averages rather than tine averages, and for combinations of sources the simulation time can be excessive. We give some analytical results for Constant Bit Rate (CBR) traffic for the n*D/D/l queue, and for a CBR queue with varying source rates we present an expression for the Interval of Significance 00s).This is the minimum number of time slots necessary to simulate the traffic, important because of the considerable computation time often involved in simulating this queue. Our results help in dimensioning networks for CBR services. Keywords: CBR, VoP, queuing teletraffic, traffic, ensemble, dimensioning

1

Introduction

Delay sensitive applications such as Voice over IP (VoIP), video and audio streaming, etc. are becoming increasingly important to support both multimedia and the push towards all IP networks. We term such services ‘real time services’ as they must deliver their information in a timely manner. They can be supported by Constant Bit Rate (CBR) or real time Variable Bit Rate (VBR) transmission mechanisms, which either implicitly (CBR) or explicitly (VBR) provide timing information. Here we only consider CBR traffic streams. We assume a constant packet length and source packet generation rate for CBR traffic. A common technique to minimize the effect of jitter [l] on real time services is to delay the first packet at the receiver by d?. Assuming that the overall queuing delay of the call is statistically bound by a known value d (the expected maximum queuing jitter), the timing structure of the call can be fully recovered if d = d?. Hence we choose d? = d. N.B. for VoIP traffic the ITU-T Recommendation G .114 specifies a maximum end-to-end delay (including d?)of 150 ms. As an example of a typical VoIP scenario, for a G.711 codec with two 10 ms samples per packet, the expected packet arrival rate is 50 packetds and the packet length is 200 Bytes &] (hence the total IP date rate is 80 kbps to carry a 64 kbps digitized telephone call). 86

87

For real time services there is no time for an automatic repeat request (ARQ) process. Hence such services muct cope with some loss of packets, which implies that the signal must contain redundancy (e.g. a human speech signal can have many imperfections and still be intelligible). However, there is a limit to the number of packets lost before the signal breaks up. One cause of loss in the Internet is when packets are dropped because of congestion. It is anti-social, and even ultimately self-defeating, to seek to guard against such loss by sending extra packets. Instead we need to ensure a Quality of Service (QoS) such that delay and loss are both constrained within acceptable limits for real time services. Techniques such as Diffserv PI, IntServ PI, or MPLS [5] provide some guarantees of QoS over the Internet for selected flows. But a network must be properly dimensioned to provide an appropriate QoS. Our aim here is to provide some theory and background for dimensioning networks for CBR services. In section 2 we review a key result by Roberts and Virtamo [6]: the derivation of the tail function for the G/D/l queue (using Kendall's notation arrival processldeparture processhumber of servers, where G is a general and D is a deterministic distribution). We extend this result to multiple CBR flows with the same packet length and source rate (the n*D/DIl queue) in section 3. In section 4 we consider a queue with multiple CBR flows of the same packet length but varying source rates (the nl*D,/DI1, n2*D2/D/1, ... n,*DJDIl queue) and derive an analytic result for the minimum simulation run length (the Interval of Significance E). 2

G/D/1 queue tail distribution

We expect the call arrival pattern to be stable over many time slots (possibly 1000s of time slots) since we are dealing with packets of sampled and digitized real time information, e.g. for a single VoIP call as perceived by the user, many packets are generated and each packet needs to be treated as a call by the packet switching system. The process is non-ergodic and we must use ensemble averaging techniques rather than time averages (the ensemble average gives the fraction of arrival patterns which result in a delay greater than x, whilst the time average gives the fraction of packets that experience a delay greater than x [7]). We follow Roberts and Virtamo [6] in deriving the result for a G/D/l queue. Beginning by defining terms: 1. The unit of time is the call service time, thus in time s we clear or service a maximum of s calls (packets). 2. 'The time interval (t - s, t) is from time (t - s) to t and is divided into s time slots of unit time, starting with time slot t - s and ending with time slot t - 1.

88

3. Lf is the number of calls in the queue at the beginning of timeslot t . 4. For a stable queue N(t - s,t) < s over a sufficiently long time interval, where N(u,t) is the number of call arrivals in time interval @,t). We define the excess function @(s) = N ( t - s , t ) - s , withf(0) = 0 the limiting case when the time interval is 0. 5. Any number of calls N(t, t + 1) can arrive in a timeslot and the first call in the queue will be serviced when the server is free. 6. The condition for a stable queue (4 above) means that for some timeslot t the queue length L, = 0 and the server will become free during that time slot. This will occur an infinite number of times, but let the first timeslot before t when the server is again free be t - so, i.e. Lf,o = 0. 7. Since the number of calls in the system (queue and server) > 0 for t-so < q < t there will be so - 1 service completions in the period ( t - so, t). 8. We now relax our stipulation that L, = 0. Given from 6 above that Lf-d = 0, what is the value of L,? Noting that the number of arrivals since t - so is N(t - so, t ) : L, =L,-,,+N(t-s0,t)-(so -1) Lt = N ( t - s o , t ) - s o + I L, = @(so)+ 1 9. Let n = N(t - s - 1, t - s), i.e. the number of arrivals in the time slot t - s - 1, then N ( t - s -1,t) = n + N ( t - s , t ) @ (s 1) = @(s) + n -1 If L, > r thenf(s) takes every possible value 0 I $ ( s ) 5 r as s increases beyond so. Hence if Lf > r for any 0 = q = Y , there exists anx > 0 such thatf(x) = q .

+

Proposition. The excess function@(s)= N ( t - s , t ) - s , for s = 0, 1, 2, ..., has the following properties: (i) there exists a positive integer u such that $(s) < 0 for s > u. This follows directly from 4 above to ensure a stable queue. (ii) Lf-,a = 0 (from 6 above). (iii) if& = Y f 1 there exists a positive integer w such that @(w)= r (from 8 above). (iv) ifLf > Y there exists a positive integer x such that 0 5 @ ( x )I r (from 9 above). corollary, simultaneously satisfying properties (iii) and (iv): (v) if Lf > Y there exists a positive integer s* such that @(s*) = Y (andf(s) < Y for s > s*, or equivalently L,,o = 0 from 6 above). Let S,(r) = P{Lf = r } = Pcf(s) = r } , and let Q,(r) = P{L, > r } , the tail (or complementary) distribution for a buffer size Y . By inspection:

89 m

s , ( r ) = x P { N ( t - s , t ) - s = r , L 1 _ , =o> s=l

But for L, > r thenf(s*)

=r

(corollary (v) above). Thus: m

Q , ( r ) = P{$(s*)= r } = x P { N ( f - s * , t ) - s * = r , L , - , = 0 } S+l

x ca

Q, (r)=

P { N ( t - s, t ) = r + S,L,...~ = 0}

s=l

This is the tail distribution for the G/D/I queue. This expression can also be derived in a more general way using the BeneS method [8].

3

Constant packet length and source rate, n"DIDI1 queue

If we now consider n periodic sources, each transmitting one cell (i.e. a fixed packet size) in a period D (the n*D/D/l queue), then it is shown in [7] that for a stable system (n < D ) it is sufficient to sumover the period D. Each call arrival is assumed to be independent and uniformly distributed aver the period D and the call arrival rate ? = n/D. From the G/D/l result: D

Q , ( r ) = x P { N ( f - s , t ) = r + s , L , - S = 0)

(1)

s=l

However, since we are considering the ensemble average of the possible arrival patterns which will cause a buffer overflow, with n sources and a buffer size of r we only need to sum over n - r timeslots rather than the full period D . The first term inside the sum of Eq. 1 is the probability of exactly r + s arrivals in an interval of length s, which is given by the binomial distribution as in the first three terms inside the sum in Eq. 2. This leaves n - (r + s) arrivals in sn interval D - s to consider, hence the last term in Eq. 2 (i.e. the probability that L,+ = 0) is given by 1 - ?', where the arrival rate ?' is (n - Y - s )/(D- s). This is a direct and shorter derivation for Eq. 2 than given by Roberts and Virtamo in [ 6 ] ,noting that the prababilities in Eq. 1 are independent. Q,(r) = 0 for r = n, whilst

Eq. 2 is the probability of a packet from an additional source (n' = n + 1) being lost assuming no packets from the n sources are lost, for buffer size r packets and period D timeslots. Note that Q(0)= n/D, and for n - r = 1 then Qt(r) = 1/D" as expected. Hence the overall probability of one or more packets being lost for n' sources is given by:

90 n'-1

R, ( r ) = 1- n ( l - Q, ( Y ) ) for n' > I

(3)

n=l

4

Constant packet length and varying source rates

A queue with multiple CBR flows of the same packet length, but varying source rates, can be characterized as the nl*DI/DII, n2*D*IDIl, ... n,*DJDIl for rn different types of sources and n = nl + n2 + ... + n, sources. In this case it would appear that we need to consider the sum Q,(r) over the period which is the lowest common multiple (LCM) of the different rates, that being the period over which the arrival pattern will repeat. The simulation time required for this can be considerable. However, [7] shows that we only need to consider a shorter interval E, termed the Interval of Significance (IoS), which results in shorter simulation times than using the LCM (an iterative method to calculate E is presented in [7]). As considered above in section 3 and more fully in [7], for an n*DIDIl queue it is necessary and sufficient to run the simulation for n timeslots, rather than a full period of D time slots. For rn different types of sources we note that a simulation must therefore run for at least nmar timeslots, where nmuxis the maximum of ni (1 = i = rn). We can account for the other types of sources by adding a contribution from each type, weighted by the ratio of the minimum simulation run length (E) to the source period

(oil: E=n,,+

niE i = l , ; e m Di

Rearranging gives us an analytic expression for the Interval of Significance:

From the condition for stability of the queue: p, + p2 + ...p,

m

+ = En,<1 , then i=l

Di

E = nmuxas required. Also, for an n*DIDIl queue E = n as required. Considering the case simulated in [7] with 3 types of flows: consisting of 1 flow of period 6.67 time slots, 10 flows of period 33.33 and 450 flows of period 1000, where it was found by the iterative method that E = 824 time slots. By inspection the LCM is 1000, and it is shown [7] that the results obtained in running the simulation over 824 time slots were identical to those obtained using 3000 time slots. Applying fi. 4 allows us to calculate E = 818.2 = 819, which is in excellent agreement with the value derived by the iterative method in [q.

91

5

Conclusion

We have considered Constant Bit Rate (CBR) flows over the Internet. We followed the ensemble average derivation of the tail distribution for the G/D/I queue (Eq. 1) by Roberts and Virtamo [6]. We directly derive an analytical result for the tail distribution of the n*D/D/l queue (Eq. 2) from the G/D/l queue, and then in Eq. 3 give the overall probability of one or more packets being lost for n‘ sources. However, the nl*DI/D/l, nZ*D2/D/l, ... n,*DJD/l queue (multiple CBR flows of different rates but the same packet size) needs to be simulated. The simulation time for this queue can be considerable, but we derive an analytical expression for the Interval of Significance (Eq. 4) which enables simulation times to be minimized. Future work would consider multiple CBR flows with constant but differing packet sizes, and VBR flows. References

1.

Mercankosk G, The ‘Virtual Wire’ Per Domain Behaviour - Analysis and Extensions, July 2000, Differentiated Services WG, Internet Draft. Available at http://www.ee.uwa.edu.au/-guven/vwpdb.pd€

2.

Cisco Systems Technology White Paper, Traffic Analysis for Voice over IP, available from http://www.cisco.com/en/US/tech/tk652/tk7O~/technolo~ies~w~ i te-paper 0 918 6a0 0 8 0Od6b7 4 shtml [accessed 20/6/05].

.

Blake S. et al, in An Architecture f o r Differentiated Services, Dec. 1998, RFC 2475, IETF Network WG. 4. Braden R., Clark D., and Shenker S., Integrated services in the internet architecture: an overview, June 1994, RFC 1633, IETF Network WG. 5. Eric C. et al, in Multiprotocol Label Switching Architecture, RFC 303 1, Jan. 2001, IETF Network WG. 6. Roberts J. and Virtamo J., The Superposition of Periodic Cell Arrival Streams in an ATMMultiplexer, IEEE Trans on Communications, V. 39, No. 2, pp. 298 -303, Feb. 1991. 7. Sim M. J. and Mercankosk G, Superposition of Periodic Streams -Interval of Significance, 1lth IEEE International Conference on Networks 2003, Perth, pp. 87 - 92,2003. 8. Barcel6 Ordinas J. M., The Benes approach applied to ATM networks, Chapter 4 of PhD thesis, Analytical Models f o r the Multiplexing of Worst Case Traffic Sources and their application to ATM Traffic Control, available from 3.

http://recerca.ac.upc.edu/~AR~ES/CompNet/papers/thesis-~ose MBarcelo pdf [accessed 17/6/05].

.

EXPERIMENTAL STUDIES ON REPRESENTATION COMPLEXITY AND ERROR RATES OF ITERATIVELY COMPOSED FEATURES

K. HARAGUCHI AND H. NAGAMOCHI Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Kyoto, Japan E-mail: { kazuyah,nag} @amp.i.kyoto-u.ac..jp T. IBARAKI

Department of Informatics, School of Science and Technology, Kwansei Gakuin University, Sanda, Japan E-mail: [email protected]

We consider constructing a classifier c : ( 0 , l ) " H ( 0 , l ) from a training set of n-dimensional binary vectors. A classifier is usually represented by a representation model such as decision tree (DT) and iteratively composed feature (ICF). In this paper, we discuss the following issues through computational experiments: 1) relation between the representation complexity y, the error on the training set e and the expected error E of an ICF classifier and 2) comparison between ICF and DT. For l ) ,we observe that a simpler ICF is more likely to attain a small E , among the ICFs having the same e. For 2), we see that DTs generated by C4.5 and ICFs generated by our algorithm are competitive in terms of E .

1. Introduction

We consider the classification problem to learn a hidden Boolean function y : {0,1}" H (0, l},called an oracle, from a training set of n-dimensional binary vectors. Let X denote the training set. Each vector x E X , called an example, is labeled as y(x) 6 {0,1} by the oracle. We decompose X into X = X' U X o , where X1 = {x E X I y(x) = 1) and X o = {x E X I y(x) = O}. The problem is to construct a classifier c : ( 0 , l}n++(0, l}, a Boolean function such that c = y (or c N y) from a training set X . This problem has a broad application in such fields as data mining, machine learning, pattern recognition, and so on.' The empirical error e ( c ) of a classifier c is the error rate on the training 92

93

set

X;

The generalization emor E ( c ) of a classifier c is the expected error rate for future examples which are drawn from the same domain as the training set. A classifier is usually implemented by a representation model such as decision tree (DT)6 and iteratively composed feature (ICF) where ICF was proposed in our previous work. Let r denote a classifier implemented by some representation model, and we write the representation complexity needed for the implementation of r by y ( r ) . In this paper, we discuss the following issues through computational experiments: 1) relation between y, e and E of an ICF classifier and 2) comparison between ICF and DT. For l), we observe that a simpler ICF is more likely to attain a small E , among the ICFs having the same e; we confirm that ICF agrees with the theories of minimum description length (MDL) principle' and VC d i m e n ~ i o nas , ~ other representation models do. For 2), we see that DTs generated by C4.56 and ICFs generated by our algorithm are competitive in terms of E ; it is difficult to develop an algorithm to construct ICFs whose generalization errors are better (i.e., smaller) than DTs by C4.5, although it was shown4 that, if there exists a DT t with e ( t ) 5 E , then there also exists an ICF f with e(f) 5 E and y ( f ) = O ( y ( t ) ) (i.e., if there exists a simple DT with small empirical error, which is assumed to attain small generalization error as is known in the theories of MDL principle and VC dimension, then there also exists a simple ICF with small empirical error). ,374

2. Iteratively Composed Features

We review the representation model ICF. Let A = ( a l , . . . ,a,} denote a set of n attributes of data, where aj ( x ) = xj for all j = 1,.. . , n and x E (0, I}". Let us consider each attribute a j as a special type of features. A feature is in general composed of a nonempty subset S of other features, and is defined as a Boolean function fs : (0,l)' H {0,1}. If S is a singleton S = {g}, then we define fs = g. (Thus, f{aj} = aj for each aj E A.) Once constructed, fs can be used as a variable of a new feature. Figure1 illustrates an iteratively composed feature f = fs with S = {f1,f2,ag}, where f 1 and f 2 are features composed of some sets of attributes. In the figure, nodes represent features whose input variables are depicted by the incoming arcs.

94

Figure 1. A feature f = f { f ~ , f z , a s )

Note that we may assume 22's' functions for a feature fs. In the computational experiments, we define fs by the majority operation; we write the projection of a vector x E ( 0 , l}n on S as e.g., for f = fs in Fig. 1, z)s = {fl(x), f2(x),xg}. For each z E ( 0 , l}s, let = {x E X1 I 21s = z } and X z , z = {x E X o I 51s = z } . Then, we define fs(z) = 1 (resp., 0 ) if IXg,,l > IX;,,( (resp., IX;,,( 5 \X'&l). It can be easily shown that,3 for any set S of features, if fs is composed by the majority operation, then e(fs) 5 e(g) holds for any g E S. We define the representation complexity ~ ( f as ) the bit length needed to implement the structure of an ICF f . It is proportional to the number of nodes and edges in the graph representation of f , but the details are omitted. In the computational experiments, we use the following algorithm to generate ICFs.

XIS;

Algorithm G E N l C F Input: A training set X with n attributes A = { a l , . . . , an}, maximum number of fan-ins k,, (2 5 k,, 5 n ) and parameters TO and 71. Output: A set G of features. Step 1: Let G1 := {fs 1 S C A, 2 5 JSI5 k,,,}, G := A U G1 and t := 2. Step 2: Let Gt := 0. Repeat the following TO times. Step 2-1: Repeat selecting S C G with IS1 E [2, k,,,] and SnGt-l # 8 randomly at most 7 1 times, until fs satisfies e(fs) < e ( g ) for all g E S. Step 2-2: If such an fs is found, then let Gt := Gt U {fs}. Step 3: Let G := G U Gt and t := t + 1. If t 5 n, then return to Step 2. Step 4: Output G and halt. We use the parameters

kmax = 3

to 6 and ro = TI = 50.

95

3. Computational Experiments Benchmark Data Sets. For benchmark instances, we take 10 real data sets from UCI Machine Learning R e p ~ s i t o r ye.g., ; ~ breast-cancer-wisconsin. The data sets contain from 250 to 1600 examples, and have from 3 to 34 numerical and/or categorical attributes. We modify the data sets so as to use them in our framework (e.g., binarization). The resulting binary data sets contain from 6 to 16 attributes. For each data set, we randomly choose 75% of the whole examples as the training set, and the rest 25% as the test set. We estimate the generalization error by the error rate on the test set, which is defined analogously with Eq. (1). Relation between Representation Complexity and Error. For each k,, = k, we partition the generated features Gk into 5 sets G f , . . . , GF by y so that y(f) < y(f’) holds for any f E G t and f’ E Gt, with s < s’ (1 5 s < s’ 5 5 ) . Let us define G ~ ( E= ) {f E Gt I e ( f ) = E} and E:(E) be the average of the estimated generalization error E ( f ) of f E G ~ ( E ) . We investigate how often E:(e) 5 E $ ( E )holds for 1 5 s < s’ 5 5. Let p(s, s’) denote the rates that E:(E) 5 E:,(E) holds among all E such that F$(e),F: (E) # 8, for all Ic and all data sets. Table 1shows the rates p(s, s’). For s = 1 , 2 , p(s, s’) 2 0.5 hold in almost all cases. It indicates that, if we have some ICFs with similar empirical error, a simple one is more likely to attain a small generalization error, as suggested by the theories of MDL principle and VC dimension. Table 1. Rates p ( s , s’) s=1 2 3 4

s’=2 0.600

3 0.584 0.528

4 0.554 0.485 0.416

5 0.633 0.560 0.430 0.433

Comparison of ICF and DT. We next compare ICF with DT. In this experiment, we use C4.56 to generate DTs. C4.5 has several adjustable parameters used to control the final decision tree; e.g., the parameter to adjust the target size of the final decision tree (confidence level). We generate lo4 decision trees by C4.5 from the given training set by determining

96 the parameters at random. Let T denote the set of generated DTs. For each data set, let E m i n denote the smallest E attained by all the classifiers generated by G E N l C F and C4.5. For each R E { T ,G3, . . . ,G 6 } , let R* = { T E R I E ( T ) 5 E m i n 0.025}, and let P(R) = lR*l/lRl, the rate of classifiers in R whose generalization errors are reasonably small; i.e., larger P(R) is preferred. We take the rank in P(R) for each R and the average of the ranks among all data sets; for R = T ,G3,. . . ,G 6 ,the average is 2.9, 2.1, 2.6, 2.9, 3.2, respectively. From this result, we cannot draw any conclusion which representation model is definitely superior, though ICF is assumed t o be superior t o DT as seen in Sec. 1.

+

4. Conclusion In this paper, we investigated a relation between representation complexity and error for ICF, and compared two sets of classifiers based on DT and ICF. For the former, we observe that among those features having similar empirical error, a simple one is more likely t o attain a small generalization error. For the latter, we see that classifiers generated by G E N l C F and C4.5 are competitive in terms of generalization error. It is an important future work t o develop a method for generating ICFs that attain better generalization error than the DTs by C4.5.

References 1. U. Fayyad, G. Piatetsky-Shapiro and S. Padhraic, From Data Mining to

Knowledge Discovery in Databases. A1 Magazine 17,No.3, 37-54 (1996). 2. P. Griinwald, Model Selection Based on Minimum Description Length. J. Math. Psychology 44, 133-152 (2000). 3. K. Haraguchi, T. Ibaraki and E. Boros, Classifiers Based on Iterative Compositions of Features. Proc. 1st Intl. Conf. on Knowledge Engineering and Decision Support (ICKEDS 2004), 143-150 (Porto, Portugal, Aug. 2004). 4. K. Haraguchi, H. Nagamochi and T. Ibaraki, Compactness of Classifiers b y Iterative Compositions of Features. Proc. 4th Japanese-Hungarian Symposium on Discrete Mathematics and Its Applications, 92-98 (Budapest, Hungary, Jun. 2005). 5. S. Hettich, C. L. Blake and C. J. Merz, UCI Repository of Machine Learni n g Databases [http://www.ics.uci.edu/-mlearn/MLRepository.html].Irvine, CA: University of California, Dept. Information and Computer Science (1998). 6. J. R. Quinlan, ed., 174.5: Programs for Machine Learning. Morgan Kaufmann (1993). 7. B. D. Ripley, ed., Pattern Recognition and Neural Networks. Cambridge University Press (1996).

An Application of Spanning Arborescence Algorithm to Scalar Multiplications Daisuke Adachil and Tomio Hirata2 Hitachi-Cable Ltd. 3550 Kidamari-cho, Tsuchiura-shi, Ibaraki, 300-0026 Japan adachi.daisuke0hitachi-cable.co.jp

Graduate School of Information Science, Nagoya University Furou-cho, Chikusa-ku, 464-8603 Nagoya, Japan hirataQis.nagoya-u.ac.jp

Abstract. The elliptic curve cryptographic scheme is a promising public key scheme since keys can be shorter comparing with other schemes. The window method is an efficient method for computing scalar multiplication which is a significant operation in the elliptic curve cryptographic scheme. This paper proposes an application of a graph algorithm for efficient computation of the scalar multiplication.

Keywords: spanning arborescence, public key cryptosystem, elliptic curve, window method

1

Introduction

This paper proposes an application of the graph algorithm for finding “the minimum weighted spanning arborescence” [3] in a digraph. An arborescence is the minimal spanning subgraph of a digraph such that each vertex has at most one incoming edge and is reachable from a root. Thus it can be considered as the directed counterpart of a spanning tree. However, the minimum weighted arborescence problem is more difficult than the minimum spanning tree problem, since a simple greedy strategy does not work. The minimum weighted spanning arborescence in a digraph G = (V, E ) can be found in O(lEllVl) [2] time. Using some data structures, it can be computed in O( IEl log, IVl) time. Fulkerson also gave an algorithm [3] of time complexity O(lE1logz IVl). The algorithm we use here is this version. We apply an algorithm for the minimum weighted spanning arborescence problem t o the scalar multiplication on an elliptic curve. Scalar multiplication is a dominant operation in various procedures in the elliptic curve cryptographic scheme such as key generation, encryption, decryption and others. Thus an efficient computation of the scalar multiplication leads t o the reduction of execution time of the elliptic curve cryptographic scheme. 97

98

2 2.1

Preliminaries Addition of Points on the Elliptic Curve

“Addition” of points is known as a basic operation of the elliptic curve cryptographic scheme. Let P and Q be points on an elliptic curve of Weierstrass form y 2 = x3 ax b. The result of “addition” of P and Q, denoted as P Q, is a point on the curve. P Q can be obtained in the following way. First, compute the non-trivial third intersection between the elliptic curve and the line which passes through P and Q. And then, P Q is a symmetric point of the intersection to the x-axis. If P and Q are identical, this addition is called “doubling” and denoted as 2P. In the computation of 2P, the tangent line at P is used instead of the line passing through P and Q . Note that, in actual applications of elliptic curve cryptosystems, the elliptic curve is defined over a finite field. That is, the elliptic curve is not represented as a curve in the plane but a set of discrete points. This set forms a finite group. We consider an elliptic curve E p : y 2 = x3 ax b (mod p ) (a,b E G F ( p ) )in this paper.

+ +

+

+

+

+ +

2.2

Scalar multiplication

A point nP is defined as the sum of n number of Ps, n 7

nP= P + P + . . . + P ,

that is, nP is the result of n - 1 additions of P. Scalar multiplication is the computation of nP from given a scalar n and a point P on the elliptic curve. Usually, the scalar n is given as a huge number, say, more than 2l6’. Thus, in order to compute nP efficiently, we do not compute n-1 additions. The “window method” is an efficient method for the scalar multiplication. The window method is a kind of efficient exponentiation method which divides the binary representation of n into consecutive w bits and treated them as w-bits numbers (see [5] for example). The method consists of two stages, the precomputation stage and the evaluation stage. The precomputation stage first divides the binary representation of n into segments of length w.Each segment is called a “window” and w is called the window size. Then, the stage computes points V P for each window value v. For example, if n = (100110111100)~ and we choose w = 3, the binary representation of n is divided in the following way.

n = (100110 111 10012 Since 4,6 and 7 appear as a window value v, points 4P, 6P and 7P are computed in this stage.

99

The evaluation stage computes n P by repeating, for every w bits, w doublings and an addition. The results of the precomputation stage are used in this addition. In the previous example, n P is computed in the following way. nP = 23(23(23.4P

+ 6P) + 7 P ) + 4P

In actual applications, the window method is used with various improving techniques. For example, we slide windows so that every window has ‘1’ as its LSB. Then the number of window values can be reduced. Such window method is called “the sliding window method”. We consider the sliding window method in this paper. 2.3

Field arithmetic in addition and doubling

The addition and doubling on the elliptic curve defined over a finite field are implemented by several kinds of arithmetic operations over the field. These field arithmetic operations include multiplication, squaring, inversion, addition and subtraction over the field. In these arithmetic operations, multiplication, squaring and inversion are much expensive than addition and subtraction. Thus we concentrate on the reduction of the number of execution of the former arithmetic operations. For simplicity, we identify a field squaring with a field multiplication in this paper. The coordinate system for representing the elliptic curve determine the number of field multiplications and field inversions performed in the computation of an addition P + Q and a doubling 2P. For example, an addition takes 16 field multiplications and a doubling takes 8 field multiplications on the Jacobian coordinates if a = -3 [6]. But, on the affine coordinates [ 6 ] ,an addition takes 2 field multiplications and 1 field inversion, and a doubling takes 3 field multiplication and 1 field inversion. We choose the Jacobian coordinates as the coordinate system for representing the elliptic curve in this paper.

3

Application of a graph algorithm

This section describes how a graph algorithm is applied for an efficient computation of the scalar multiplication. 3.1

Precomputation stage of the window method

When we compute n P with the window method, there are two approaches for determining the window size w. The one is to define w a small value, say, w << log, log, n. The other is to define w as a rather large value. In the first approach, all odd integers between 1 to 2” - 1 are likely to appear as a window value w. So, all U PSare computed systematically in the precomputation stage as follows. P + P = 2P, P + 2 P = 3P,

. . . , (2” - 3 ) P + 2 P = (2” - l ) P

100

The second approach can reduce the number of window value u. That is, the window values are some odd integers between 1 and 2” - 1. Thus, the systematic calculation indicated above is not efficient for this case. Usually the second approach uses some heuristics for efficient computation of UPS.The most well-known heuristics is the Bos-Coster method [l].Our proposal is based on the second approach, and we use a graph algorithm.

3.2

Precomputation using a graph algorithm

Let u1, . . . , u1 be the window values appeared in the precomputation stage. As noted in the last section, the systematic method mentioned there produces unnecessary scalar multiplications. A naive method is to compute each viP independently. However, we notice that some ujP could be efficiently computed from other uiP . For example, suppose we have already computed uiP and u j = 2ui +1, then we need not compute ujP from scratch. We just do one doubling and one addition, since uj P = 2uiP+P. Thus, if it is possible to find an appropriate order of computation of uiPs, we can reduce the execution time of the precomputation stage. The proposed method computes all uiPs by the following steps. 1. For every pair (ui,u j ) , we estimate the computation cost c(ui, vj) from uiP to UjP. 2. Let V = (u1, . . . , ul} U {l,2} and G = (V,E ) be a “complete” weighted digraph, that is, for each vertex pair there are two edges directing reverse each other and the edge (ui,u j ) has weight c ( q , u j ) . 3. We find an optimal order of computation of all uiPs by applying the minimum arborescence algorithm. We describe the detail of every step from now. 3.3

Estimation of the cost of the computation from viP to vjP

We define the cost of computation from uiP t o ujP as follows. Let n A ( n D ) be the number of addition( doublings) performed in the computation from uiP to ujP . Then the cost is defined to be d ’ n A no, where d is the ratio of “the number of field multiplications executed in an addition” to “the number of field multiplications executed in a doubling”. When uj < ui,we first compute y P = (ui- u j ) P by the binary method. The binary method is the window method when w = 1. That the binary method computes y P using v(y) - 1 additions and X(y) doublings, where v ( z ) i s the number of non-zero bits in the binary representation of z and X(z) = [log, z]. Then, we obtain the inverse of y P ( - y P ) and compute ujP = uiP ( - y P ) . Finally, we store ujP. We also store y P for a possible use in the future. In this case, the cost is defined as d . ~(y) X(y). Note that an inversion of a point on the elliptic curve does not require field multiplication. When u j > ui,we consider the following two computations of ujP. Let y be the unique value such that 2Yui 5 u j <: 2Y+‘vi. Let r l , 1-2 be r1 = uj - 2Yvi, and

+

+

+

101

Fig. 1. (a) A digraph D , (b) A spanning arborescence of D rooted at vT ~g = 2Y+’vi - vj. The first computation is as follows. If T I P has already been computed, we compute vjP = 2YviP r i p . Otherwise, we first compute qlP by the binary method, where q1 = Lr1/2Y].Then, we set Y = viP qlP and ) P every i = y, y - 1, . . . , 0. This repeat the computation Y t 2Y ( ~ l [ i ] for computation has the cost c1 = d . u ( ~ 1 ) (X(q1) y). If T I Por qlP has been already stored, this cost should be reduced. The second computation is as follows. If rzP has been already stored, we compute vjP = 2Y+biP (-rzP). Otherwise, we first compute qgP by the ] . we set Y = viP (-qzP) and binary method, where qz = L T ~ / ~ Y + ~Then, P every i = y 1, y, . . . , 0. Note repeat the computation Y +- 2Y ( - r ~ [ i ] )for This computation has the cost cg = d . v ( T ~ ) ( X ( q 2 ) y 1). If rgP or qgP been already stored, this cost should be reduced. Finally, we set min(c1, cg} as the cost of the computation of vj from vi. Note that for the following cases, we respectively consider more efficient computation.

+

+

+

+

+

+

+

3.4

+

vj = IC . vi is satisfied, we compute vjP by the binary method. The cost is d ( v ( k ) - 1) X(k). If vj = w i f 2 is satisfied, we compute v j P by adding ( f 2 P ) to viP. We can assume that 2P has been already stored because 2P = P P must be computed at first. Thus, the cost is d.

- If -

+

+ + +

+

+

Finding an optimal computation order

Our computation starts from P and calculates each vi successively. This computation corresponds to a traversal of every vertex of G. We start from a vertex ‘l’, which corresponds to P (we denote this vertex as vr). Then we visit each vertex from already visited one. We do not visit a vertex already visited. It is easy to see that this traversal corresponds to a “spanning arborescence” of G rooted at v,. The rooted spanning arborescence of a digraph D is a subgraph of D such that each vertex has at most one incoming edge and is reachable from the root (see Fig. 1).A spanning arborescence rooted at T gives a semi-order for computing all viPs and the sum of the edge weights in a spanning arborescence represents the cost of the computation. Thus, we want to find a “minimum weight spanning arborescence” in the graph G. To find an optimal order of the computation of all viPs, we perform the following steps.

102 1. Find a minimum weight spanning arborescence of G using the Fulkerson’s algorithm (31 . This algorithm finds a minimum weight spanning arborescence in (C3(IEIlog, IVI))-time . 2. From the optimal semi-order obtained above, we sort the vertices topologically. This gives an optimal order of computation of all wiPs.The topological sort runs in polynomial time (O(lV1 IEI)) .

+

In the rest of the paper, “the proposed method” means the window method which incorporate the above procedure in the precomputation stage.

Comparison with other methods

4

This section compares the performance of the proposed method with the sliding window method by experiments. 4.1

Experimental envi r onm ent

We have implemented the following two variant of the window method on a PC: the sliding window method and the proposed method. We have counted the number of additions and doublings executed in the computation of nP. Experiments are done for 10,000 times for each window size and we calculate their averages. The experimental environment is as follows: - elliptic curve : NIST P-192 [7] . This curve has the coefficient a = -3, so d = 1618 = 2.0. - coordinate system : the Jacobian coordinates. - a 192-bit scalar n and a point P on the curve are given randomly. - using GNU MP library for implementing the field arithmetic operations on a 192-bit prime field. 4.2

Experimental results

We evaluate the overall performance of the proposed method against the sliding window method. Figure 2 shows the average cost of the computation of nP. The left (right) figure shows the experimental results when the binary representation (the canonical signed binary representation (CSBR) 3, is used. From these results, the followings are observed. when w 5 4, the cost of the proposed method is the same as that of the sliding window method. - when w 2 5, the cost of the proposed method is less than that of the sliding window method. -

“Canonical” means that no two non-zero digits are adjacent in the representation. This representation is also called (width-2) Non-Adjacent Form (NAF) [8] .

103

2701 2

”

3

4

”

5

”

6 7 Windaw length

8

” I0

9

2

3

4

5

6 7 Window length

8

9

I0

Fig. 2. Cost of the computation of nP

5

Conclusion

We have proposed an application of the arborescence algorithm t o the scalar multiplication on an elliptic curve. Experiments show that the proposed method is superior t o the sliding window method for the window size 20 2 4. Although, it is found that the proposed method is inferior t o the Bos-Coster method [l]for this range of the window size. Improvement of the proposed method is remained as a future problem.

References 1. J. Bos, and M. Coster, “Addition chain heuristics,” Proceedings of CRYPTO’89 (1989), 400-407. 2. J. Edmonds, “Optimum branchings,” Journal of Research of the National Bureau of Standards, 71B (1967), no.4, 233-240. 3. D. R. Fulkerson, “Packing rooted directed cuts in a weighted directed graph,” Math.Program., 6 (1974), 1-13. 4. GNU MP LIBRARY, version ver 4.1.3, 2004. (http://www.swox.com/gmp/) 5. D.M. Gordon, “A survey of fast exponentiation methods”, J. Algorithms, 27 (1998), 129-146. 6. A. Miyaji, T. Ono, H. Cohen, “Efficient elliptic curve exponentiation,” Advances in Cryptology-Proceedings of ICICS ’9’7, LNCS 1334 (1997), Springer-Verlag, 282290. 7. NIST, “Recommended elliptic curves for federal government use”, 1999.

(http://csrc.nist .gov/encryption/dss/ecdsa/NISTReCur.pdf) 8. J.A. Solinas, “Efficient arithmetic on Koblitz curves”, Designs, Codes and Cryptography, 19 (2000), 195-249.

COMPUTER AIDED DIAGNOSIS IN DIGITAL MAMMOGRAMS: HYBRID META-HEURISTIC ALGORITHMS FOR DETECTION OF MICROCALCIFICATIONS K. THANGAVEL Department of Mathematics, GRI-Deemed University, Gandhigram -624302, Tamil Nadu M.KARNAN Dep t. of Computer Science, Gandhigram Rural Institute-Deemed University, Gandhigram 624302, Tamil Nadu, India. Fax: 91-4551-227229, karnanme(iivahoo. com R.SIVAKUMAR, A. KAJA MOHIDEEN Department of CSE,R. V.S.Collegeof Engg. & Tech., Dindigul, Tamil Nadu, India Mammography is currently the best method for early detection of breast cancer. In this paper, Simulated Annealing (SA), Tabu Search (TS), Genetic Algorithm (GA), and Ant Colony Optimization (ACO) and three novel hybrid combinations of TS and ACO such as Sequential TS-ACO, a Hybrid ACORS, and a Sequential ACO-TS are proposed for mammogram segmentation. Initially the mammogram images are enhanced and Markov Random Field (MRF) is applied to label the image pixels. The meta-heuristic algorithms are applied to found out the optimum label for segmentation. The Free-response Receiver Operating Characteristic (FROC) analysis was performed to compare the segmentation performance of the proposed algorithms. Statistical comparative analysis conclude that all of the three proposed novel techniques are significantly better than each of their non-hybrid competitors, and furthermore the Sequential ACO-TS provides the superior solution of all.

1

Introduction

Breast cancer is the most common malignancy in women in the United States and is the second leading cause of cancer-related mortality. Each year nearly 200,000 American women develop invasive breast cancer and more than 45,000 will die from the disease. The survival rate from breast cancer is significantly improved in women in whom breast cancers are clinically occult and are detected by mammography alone [ 2 ] . An increasing number of countries have started mass screening programs that have resulted in a large increase in the number of mammograms requiring interpretation. In the interpretation process, radiologists carefully search each image for any visual sign of abnormality. However, abnormalities are often embedded in and camouflaged by varying densities of breast tissue structures. Indeed, estimates indicate that between 10% and 30% of breast radiologists miss cancers during routine screening. With the aim of improving the accuracy and efficiency of screening programs for the detection of early signs of breast cancer, a number of research projects are focusing on developing methods for

104

105 Computer Aided Diagnosis (CAD) to assist radiologists in diagnosing breast cancer 1231. Thangavel et ul., [28] presented a good review on various methods for detection of microcalcifications. A key requirement in reducing the mortality rate due to breast cancer is to identify and remove malignant tumors at an early stage before they metastasize and spread to neighboring regions. Evidence of a breast tumor is usually indicated by the presence of a dense mass and a change in the texture or distortion in the mammogram. Consequently, the focus during diagnosis is on identifylng such abnormal regions, as well as on classifying the type of tumor that caused the abnormality. Two different techniques are used in the interpretation of mammograms. The first technique consists of a systematic search of each mammogram for visual patterns symptomatic of tumors. Such as, a bright, approximately circular blob with hazy boundary might indicate the presence of a circumscribed mass. The second technique, the asymmetry approach, consists of a systematic comparison of corresponding regions in the left and right breast. Significant structural asymmetries between the two regions can indicate the possible presence of a tumor. Once the suspicious regions are extracted from the background tissue, the textural features can be extracted to classify the microcalcifications into benign and malign. The feature selection algorithms can also be used to detect the optimum features, to eliminate the redundant and irrelevant features from the feature set. Thangavel et ul., proposed the feature selection algorithms based on meta-heuristic algorithms and rough set based reduction algorithms, which also improve the accuracy of the classifiers [29], [32]. Thangavel and Karnan [27] proposed the asymmetry principle using GA and ACO. In that work, bilateral subtraction is achieved in two steps. First, the mammogram images are enhanced using median filter, pectoral muscle region is removed and the border and nipple position of the mammogram is detected for both left and right images. The enhancement technique is evaluated by signal to noise ratios. Further GA is applied to enhance the detected border. The figure of merit is calculated to identify whether the detected border is exact or not. And the nipple position is identified for both left and right images using GA and ACO, and their performance is studied. The proposed algorithm for detection of breast border and nipple position are compared with the gradient-based algorithms for detection of breast border and nipple identification proposed by Mendez et al. [21] Second, using the border points and nipple position as the reference the mammogram images are aligned and subtracted to extract the suspicious region. Results obtained with a set of mammograms indicate that this method can improve the sensitivity and reliability of the systems for automated detection of breast tumors i.e. microcalcification. One of the demerits of the method is, which requires mammogram pair of a single patient to detect the microcalcification. This paper focused on processing a single mammogram image to find abnormalities.

106

In this paper, initially the mammogram image is smoothened using median filter and the pectoral muscle region is removed from the breast region. Next, Markov Random Filed (h4FW) is applied to label the mammogram image pixels. To optimize this MRF based image segmentation the meta-heuristic algorithms such as Simulated Annealing (SA), Tabu Search (TS), Genetic Algorithm (GA), and Ant Colony Optimization (ACO) and three novel hybrid algorithms consist of a Sequential TS-ACO, a Hybrid ACO/TS, and a Sequential ACO-TS are proposed. The rest of the paper is organized as follows: the following section describes the preprocessing and enhancement. Section 2 describes the segmentation of microcalcifications using meta-heuristic algorithms. The performance of the proposed algorithms is compared in section 3. And Section 4 concludes the work. 1.I

Preprocessing and Enhancement

In order to obtain digital mammograms with a completely uniform background, the input images are preprocessed. This reduces the effects of any film emulsion irregularities or noise, which may have been present in the original data. This step can also enhance the image contrast. The basic requirement is that no significant data should be lost. If a noise removal method is used, care should be taken to ensure that all information about microcalcifications in the image is left undamaged. In this work, initially the mammogram images are enhanced by median filter to remove the high frequency components (i.e. noise) from the image [21], [24]. Then the pectoral muscle region is removed using histogram based thresholding and morphological operations [27]. The enhanced mammogram image is further used for segmentation. 2

Segmentation of Mammogram

Segmentation is the initial step for any image analysis. There are two different tasks for segmentation of mammogram images. One is to obtain the locations of suspicious areas to assist radiologists for diagnose. The other is to classify the abnormalities of the breast into benign or malignant. Mammogram image segmentation has been approached from a wide variety of perspectives. Regionbased approach, morphological operation, multiscale analysis, h z z y approaches and stochastic approaches have been used for mammogram image segmentation but with some limitations. Markov random field model was used to deal with the spatial relations between the labels obtained in an iterative segmentation process [16], [ 171, [32]. The merit of MRF image modeling is that it provides a strong exploitation of the pixel correlations.

107

2.1

Markov Random Filed (W)

The mammogram image is stored in a two-dimensional matrix. A kernel is extracted for each pixel, [Kernel is a 3x3 window of neighborhood pixels] and unique label is assigned to all the kernels whose intensity values are equal. [18], [20], [26] For each kernel in the image, the posterior energy h c t i o n value U(x) is calculated as given bY U(x)={C [(y-p)2/ (2*0*) 3 + Clog (0)+ CV(X)> (1) where, y is the intensity value of pixels in the kernel, p is the mean value of the kernel, o is the standard deviation of the kernel, V is the potential function of the kernel, and x is the label of the pixel. If xlis equal to x2 in a kernel, then V (x) = p, otherwise 0, where p is visibility relative parameter (p >= 0). The segmentation results can be further enhanced via thc application of Maximum a Posteriori (MAP) segmentation estimation scheme. The MAP estimate can be written as: P(x1y) = exp (-U(x)), the challenge of finding the MAP estimate of the segmentation is search for the optimum label which minimizes the posterior energy function U(x). The meta-heuristic algorithms can be used to find out the optimum label. The following sections present the implementation of the metaheuristic algorithms for mammogram image segmentation. 2.2

Simulated Annealing (SA)

Simulated Annealing [ 191 is a statistical cooling optimization technique that generates a biased random search and employs Monte Carlo simulations under a variable probability schedule. In simulated annealing, successive state transitions are carried out from a current state to a new neighboring state. New states are generated by randomly performing moves in the current design. Moves are performed on any structural process design feature that is considered a degree of freedom for optimization. In the present case, moves are performed on the number of stages, on the flow rates of the streams and on the location of the streams. The types of moves that can be performed are expansion, contraction and alteration. Kirckpatrick et al. [19] and Cerny [5] proposed the use of SA for combinatorial problems [I], [6], [ 101, [31]. In this paper, initially a label i is selected randomly. In order to determine the value of the initial temperature, Tbeginis computed by solving the expression: Pa = e ACITbegin and hence, Tbegin= ( -AC ) / In Pa (2) Here AC represents the average deterioration value, which is computed as the cumulative value of the values of all worsening moves possible from the initial solution divided by the number of moves, which have caused a deterioration of the MAP function value. Parameter Pa represents the acceptance fraction, i.e. the ratio of the accepted to the total number of generated moves. Then a neighbor pixel 0 ) is selected and U(x) is calculated. The difference (d) between the U(x) value of both pixel is calculated. If d is less than 0 then the label of the jthpixel is assigned to i"

108 pixel. Then the cooling function is used for the reduction of the temperature in a simple geometric function. The temperature at iteration t, Tt, is obtained from the temperature of the previous iteration as follows: Tt = RTt-1 (3) where R represents the cooling rate. This procedure is repeated till the temperature value is near zero. The label from the final iteration is selected as optimal one to segment the mammogram. [Hint: In this work, the values assigned for P, = 0.5 and R = 0.991. Figure 1 shows the original mammogram and the segmented mammogram image by SA. Algorithm for segmentation using SA Select an initial label i Select an initial temperature T t (-AC) / In P, Set temperature change counter t = 0; Repeat Set repetition Counter n = 0; Repeat Generate next label j, a neighbor of i, - U(xj) d t U(X~) if d < 0 Then i = j else if random (0,l) < exp(d/T) Then i = j n=n+l Until n = N(t) t=t+l Tt = RTt.1 Until (T<0.001)

~

Figure 1. (a) Original Mammogram, @) Segmented Mammogram by SA

109

2.3

Tabu Search (TS)

Among the metaheuristics with memory presented in this section, TS is the only one that has been explicitly developed with a memory, or more exactly, with a set of memories. The term TS has been proposed for the first time by Glover [12]. The basic idea of this method is to locally and repeatedly modify a solution while memorizing these modifications to avoid visiting the same solutions to avoid visiting the same solutions twice or in a cyclic manner. To this end, modifications or characteristics of modifications are stored in lists that for bid their use for a certain number of iterations, thus leading to the names of taboo list and TS. A new candidate move in fact brings the solution to its best neighbor, but if the move is present in the tabu list, it is accepted only if it decreases the objective function value below the minimal level so far achieved. This process is repeated until a stopping criterion is reached. Glover and Laguna provided a good overview of TS and its applications [ 131, [25]. For mammogram segmentation, in each iteration a label is selected randomly and MAP estimate is calculated and added to the Tabu list. Then the U(x) value is compared with the values generated in the earlier iterations, the optimum one is selected from the list and considered as the best solution. The stopping criterion of this algorithm is reaching to the limited number of iterations between current iteration and iteration that best solution is reached. The following figure shows the segmented image from TS.

Figure 2. Segmented Mammogram by TS

Algorithm for segmentation using TS i t select a label s t calculate the U(xJ nbiter = 0; Current iteration bestiter = 0; iteration when the best solution has been found bestsol = s; best solution nbmax t 30 T t O While ( (s >S* ) and (nbiter - bestiter < nbmax)) do [S' t solutions calculated in the earlier iterations]

110 nbiter = nbiter + 1 Add the solution s to s* Choose a solution from S* which minimizing U(x) Update the the tabu list T If s < bestsol then bestsol = s* bestiter = nbiter * s=s End while

2.4

Genetic Algorithm (GA)

Genetic Algorithm was conceived by Holland [15]. Genetic algorithms start with a set of randomly selected chromosomes as the initial population that encodes a set of possible solutions. In GA, variables of a problem are represented as genes in a chromosome, and the chromosomes are evaluated according to their fitness values using some measures of profit or utility that we want to optimize. Recombination typically involves two genetic operators: crossover and mutation. The genetic operators alter the composition of genes to create new chromosomes called offspring. In crossover operator, it generates offspring that inherits genes from both parents with a crossover probability P,. In Mutation operator, it inverts randomly chosen genes in a chromosome with a mutation probability P,. The selection operator is an artificial version of natural selection, a Darwinian survival of the fittest among populations, to create populations from generation to generation, and chromosomes with better fitness values have higher probabilities of being selected at the next generation. ARer several generations, genetic algorithms can converge to the best solution. In our GA implementation, the labels of the image pixels are considered as population strings and the U(x) values are considered as fitness values. The cumulative probability of the fitness values is calculated for reproduction. In reproduction, a random number is generated; the label, which has the cumulative fitness value greater than this random number, has the increment by 1 with its selection count. This procedure is repeated the number of times equal to the population size. Then the population is reproduced with the selected labels such as each string is copied corresponding to the number of times it has been selected. In crossover, random pairs of strings are selected and a random number is generated to represent the number of bits to be exchanged between the parents. In mutation, a bit position is selected randomly and its value is complimented to generated new string. At the end of applying genetic operators, the new population is generated. The next iteration is continued with the new population of strings. This procedure is repeated for number of times. In each iteration the minimum of the population is calculated

111

and which is considered as the optimum label for mammogram segmentation. Figure 3 shows the segmented mammogram image by GA.

Figure 3. Segmented Mammogram by GA

Algorithm for segmentation using GA 1. 2. 3.

4.

5.

6.

7.

P t initial population of Labels; U(x) t considered as fitness values Calculate the probability of each fitness value Calculate the cumulative probability, CP Reproduction a. r t random() b. if (CPi > r) count=count + 1 for CPi c. Repeat the steps (a) and (b) for all the population strings. d. If count=O, then delete that Pi e. Reproduce the population by copying the selected strings with the corresponding number of times it has been selected. Crossover a. r t random() b. Randomly select the pair of strings for matting c. if (r >= Pc), k t random bit position interchange the bits after kthposition in parent1 & parent2. repeat crossover for all the pairs. Mutation a. r t random() b. if (r >= P,,,), k t random bit position complement the value of the kthbit repeat this steps for all the strings. P,,, t new population; P t P,,, ;

112

2.5

Ant Colony Optimization (ACO)

Dorigo and colleagues [7] first proposed ant algorithms as a multi-agent approach to difficult combinatorial optimization problems such as the Traveling Salesman Problem (TSP) and the Quadratic Assignment Problem (QAP). There is currently much ongoing activity in the scientific community to extend and apply ant-based algorithms to many different discrete optimization problems. Ant colony optimization (ACO) is a constructive population-based search technique to solve optimization problems by using principle of pheromone information [8], [22]. It is an evolutionary approach where several generations of artificial agents in a cooperative way search for good solutions [3], [14]. Agents are initially randomly generated on nodes, and stochastically move from a start node to feasible neighbor nodes. During the process of finding feasible solutions, agents collect and store information in pheromone trails. Agents can online release pheromone while building solutions. In addition, the pheromone will be evaporated in the search process to avoid local convergence and to explore more search areas. Thereafter, additional pheromone is deposited to update pheromone trail offline so as to bias the search process in favor of the currently optimal path [4], [9], [l 11. In the first step the pheromone value of each ant are initialized with the initial pheromone (To). And for each ant select a random pixel from the image which has not been selected previously. To find out the pixels is been selected or not, a flag value is assigned for each pixel. Initially the flag value is assigned as 0, once the pixel is selected the flag is changed to 1. This procedure is followed for all the ants. For each ant a separate column for pheromone and flag values are allocated in the solution matrix. The pheromone values for all the randomly selected pixels are updated using the following equation: (4) Tnew = (1 - P) * Told + P * TO where Toldand T,,, are the old and new pheromone values, and p is rate of pheromone evaporation parameter in local update, ranges from [0,1] i.e., 0 < p < 1. Calculate the posterior energy function value for all the selected pixels by the ants from the solution matrix. Then compare the posterior energy function value for all the randomly selected pixels from each ant, to select the minimum value from the set, which is known as ‘Local Minimum’ (Lmin) or ‘Iterations best’ solution. This value is again compared with the ‘Global Minimum’ (Gmin). If the local minimum is less than global minimum, then the global minimum is assigned with the current local minimum. Then the ant, which generates this local minimum value, is selected and whose pheromone is updated using the following equation: (5) Tnew = (1 - a) * Toid + a * AToid, where Toldand T,,, are the old and new pheromone values, and a is rate of pheromone evaporation parameter in global update called as track’s relative importance, ranges from [0,1] i.e., 0 < a < 1, and A is equal to ( 1 / Gmin). For the remaining ants their pheromone is updated as: 9

113

(6) Tnew= (1 - a) * Told, here, the A is assumed as 0. Thus the pheromones are updated globally. This procedure is repeated for all the image pixels. At the final iteration, the Gmin has the optimum label of the image. To further enhance the value, this entire procedure can be repeated for more number of times. Figure 4 shows the segmented mammogram by ACO.

Figure 4. Segmented Mammogram by ACO

Algorithm for segmentation using ACO 1. Li t Label Matrix for fitness values 2. Initialize all the ants with To, the initial pheromone value 3. Repeat for M times a. For each ant k = 1..K and For each pixel i. q t random() ii. if (q So); Li t max(L) iii. else Li t random() End b. For each ant and For each pixel If (Li =max(L)) then Tnew=(l -P) * Told+f * TO End c. For each ant locally improve the fitness value d. Lin t mir1(F2~); e. If (Lin < G,i, ) then Gmin= &in f. Update the pheromone of the ants globally; Tnew= (1 -a)* Told + a * ATold,where 0 < a < 1, and A t (l/Gmin) for the ant which generates the G,i,, for the remaining ants A t O ; 4. End

114

3

Hybrid Methods

By studying the behavior of each of the above pure methods, a properly chosen combination may be expected to yield improved performance. Here, three hybrid methods that combine the best metaheuristics of above discussed methods, ACO and TS, are implemented in various ways, Sequential TS-ACO (STA): In this method, first use TS algorithm and, based on the best solution reached by TS, justify amount of initial pheromones in ACO path. This causes increasing initial pheromones in the path of better solutions and so guides the ants to fine-tune this path faster. To achieve above strategy, TS is first applied and then pheromone is updated globally before applying ACO and use final solution of TS as a good initial solution in this part. Then ACO is used in the main framework but with modified amount of initial pheromone. In fact the ant in the beginning of their efforts is guided in finding the best path. Hence the amount of initial pheromone affects the quality of initial as well as final solutions can be investigated. The following figure illustrates the procedure for sequential TS-ACO algorithm. Figure 5 illustrates the procedure for STA algorithm. And figure 6 shows the

Local Pheromone Update

Consider bs as the path of first ant

. ACS

Final Solution

Figure 5. The block diagram of Sequential TS-ACS

Figure 6. Segmented Mammogram by STA

HybridACO/TS (HAT): In this method ACO is the main algorithm and TS is used in local search part of it. TS is terminated before it converges and the process continues for the remaining steps of ACO. In other words, this hybrid method has

115 another strong metaheuristic searching engine in the local search, which is a strategic and important part. The following figure shows the flow diagram for hybrid ACO/TS algorithm. Initialization

Start ACO with construction of k ants

Gmin +- best solution of TS

Local Search using TS

Final Solution Figure 7. The block diagram of Hybrid ACSRS

Figure 8. Segmented Mammogram by HAT

Sequential ACO-TS (SAr): Since ACO can produce better initial solutions, and at the same time, TS in comparison to ACO has highest cost reduction, in this hybrid method, at first, use ACO algorithm and then use the solution of this as a good initial solution in the TS algorithm. The following figure shows the block diagram for sequential ACO-TS algorithm. Initialization

ACO

TS Figure 9. The block diagram of Sequential ACS-TS

Figure 10. Segmented Mammogram by SAT

bs t best solution of

Initial solution of TS +- bs

116 4

Performance Analysis

The mammograms used in this experiment were c-tained from Mammographic Image Analysis Society (MIAS) database. To validate the accuracy of the segmentation method, compared with the ground truth, the Free-response Receiver Operating Characteristic (FROC) analysis was performed. A region extracted in the segmented image, which overlaps 75% of the region with a true abnormality as provided in the MIAS database is called True Positive (TP) detection. All other regions extracted by the algorithm are labeled as False Positives (FP). 1 0.8

0.6

e a

0.4 0.2 0

0

0.2

0.4 0.6 Fake Posit&

0.8

1

Figure 11. FROC curves for Segmentation using Meta-Heuristic Algorithms

The segmented image is thresholded using 10 operating points and the detection rates are calculated. A plot is made between the percentage of TP and FP at each operating points to generate the FROC curve. Figure 11 shows the FROC curves generated on the full test set from the proposed algorithms. The area under the FROC curve (A, value) is used for evaluating diagnostic performance. The higher value of A, indicates the perfect performance. The A, value for the proposed algorithm is 0.96. The Table 1 shows the comparison of detection rate between the proposed algorithms. The exact true positive rates are given in Table 1. Algorithms Detection Rate

SA

GA

TS

ACO

STA

HAT

SAT

65.4%

71.6%

72.8%

82.9%

85.5%

88.2%

96.2%

Discussion: In the Sequential TS-ACO, the amount of pheromones are modified by

applying TS before ACO. In this way, ants select the paths, which have more pheromone on them. In the other word the labels with more initial pheromones have more chances to be selected and others have less. So biased search in the solution space and the algorithm may converge faster in local minima. In the Hybrid ACO/TS, using TS is time-consuming for each ant and therefore one iteration of ACO takes a substantially longer duration, and so for a complete run of it. Therefore only a few iterations of the main algorithm can be applied in the indicated points.

117

The Sequential ACO-TS method at first produces a good solution by using ACO but in final iterations of ACO, solutions are converged, because of a lots of pheromone in good regions and a poor amount of it in bad ones. In that time ACO causes poor improvement in the final solution and it is useless for us to continue it. Therefore ACO is interrupted before its convergence and then use TS to search the neighborhood region. TS search the neighborhood of solution and can find the minimum point in the close points. The reason for choosing TS is, considering results of previous section; it worked better than other metaheuristic methods. The characteristics of Sequential ACO-TS method allows it to work better than others. Also the final results, which have been shown in, Table 1 shows that the Sequential ACO-TS works better than Sequential TS-ACO and Hybrid ACO/TS. 5

Conclusion

In this paper, four conventional metaheuristic methods (Simulated Annealing, Tabu Search, Genetic Algorithm and Ant Colony Optimization) and proposed three new hybrid methods for segmentation of microcalcifications in digital mammograms. When comparing the results of the conventional methods, ACO showed better ability to find best solutions while TS algorithm worked better terms of improving its performance over time. In comparison, all of the three hybrid methods demonstrate better results than all metaheuristic methods. Among the hybrids, the Sequential ACO-TS is better than others. Acknowledgement

The first author acknowledges the University of Grant Commission (UGC), New Delhi, for extending financial support under UGC Special Assistance Programme. References

Aarts E. and Korst J., Simulated Annealing and Boltzmann Machines (Wiley, Chichester, 1989). Alberta Cancer Board, Alberta, Canada, www.cancerboard.ab.ca/screentest. Screen Test: (2004). Beckers, R. Deneubourg J. L. and Goss S., Trails and U-turns in the selection of the shortest path by the ant Lasius. niger: J. Theor. Bio. 159 (1992) pp. 397415. Bell J. E. and McMullen P. R., Ant colony optimization techniques for the vehicle routing problem. Adv. Engineering Znformatics 18 (2004) pp. 41-48.

118

5. Cerny V., A themo-dynamical approach to the traveling salesman problem: an efficient simulation algorithm.,Journal of Optimization Theoly Application 45 (1985) pp. 41-51. 6. Collins N., Eglese R. and Golden, B., Simulated annealing an annotated bibliography. American Journal of Mathematical and Management Sciences, 8 (1998) pp. 209-307. 7. Dorigo M., Bonabeau E. and Theraulaz G., Ant algorithms and stigmergy. Future Generation Computer Systems 16 (2000) pp. 851-871. 8. Dorigo M. and Gambardella L. M., Ant colonies for the traveling salesman problem. BioSystems 43 (1997) pp. 73-8 1. 9. Dorigo M. and Stutzle T., Ant Colony Optimization. (PHI, New Delhi, 2005). 10. Eglese R.W., Simulated Annealing: A Tool for Operational Research. European Journal of Operational Research, 46 (1990) pp. 271-281. 11. Ge Y., Meng Q. C., Yan C. J. and Xu, J., A Hybrid Ant Colony Algorithm for Global Optimization of Continuous Multi-Extreme Functions. Proceedings of the Third International Conference on Machine Learning and Cybernetics, Shanghai, (2004) pp. 2427-2432. 12. Glover F., The General Employee-Scheduling Problem: An Integration of Management Science and Artificial Intelligence. Computers and Operations Research 15 (1986) pp. 563-593. 13. Glover, F. and Laguna, M.: Tabu Search, Kluwer Academic Publishers (1997). 14. Goss S., Aron S., Deneubourg J. L. and Pasteels J. M., Self-organized shortcuts in the Argentine ant. Naturwissenschaften 76 (1989) pp. 579-581. 15. Haupt R.L. and Haupt S.E., Practical Genetic Algorithms. (Wiley-Interscience Publication, John Wiley, New York 1997). 16. Karssemeijer N., A stochastic model for automated detection of calcifications in digital mammograms. 12Ih International Conference I P M , Wye, UK (1992) pp. 227-238. 17. Karssemeijer N.: Adaptive noise equalization and recognition of microcalcification clusters in mammograms. Int. J. Pattern Recognition Art$ Zntell. 7(6) (1993) pp. 1357-1376. 18. Kervrann C. and Heitz F., A Markov Random Field Model-Based Approach to Unsupervised Texture Segmentation Using Local and Global Statistics. IEEE Transactions on Image Processing 4(6) (1995) pp. 856 - 862. 19. Kirckpatric S., Gellat C. and Vecchi M., Optimization by Simulated Annealing. Science 220 (1983) pp. 671-680. 20. Li H.D., Kallergi M., Clarke L.P., Jain V.K. and Clark R.A., Markov Random Field for Tumor Detection in Digital Mammography. IEEE Transactions on Medical Imaging 14(3) (1995) pp. 565 - 576. 21. Mendez A. J., Tahocesb G., Lado J., Souto M., Correa J. L. and Vidal J. J., Automatic Detection of Breast Border and Nipple in Digital Mammograms. Computer Metho& and Programs in Biomedicine 49 (1996) pp. 253-262.

119

22. Ouadfel S. and Batouche M., MRF-based image segmentation using Ant Colony System. Electronic Letters on Computer Vision and Image Analysis 2(2) (2003) pp. 12-24. 23. Peitgen H.O., editor. Proceedings of the 6th International Workshop on Digital Mammography, Brernen, Germany, Springer- Verlag (2002). 24. Qian W., Clarke L. P., Kallergi M. and Clark R. A., Tree-Structured Nonlinear Filters in Digital Mammography. IEEE Trans. on Medical Imaging 13(1) (1994) pp. 25-36. 25. Rosing K.E., ReVelle C.S., Rolland E., Schilling D.A. and Current, J.R., Heuristic concentration and Tabu Search: a head to head comparison, European Journal of Operational Research 104 (1998) pp. 93-99. 26. Speis A. and Healey G., An Analytical and Experimental Study of the Performance of Markov Random Fields Applied to Textured Images Using Small Samples. IEEE Transactions on Image Processing 5(3) (1996) pp. 447 458. 27. Thangavel K. and Karnan M., Computer Aided Diagnosis in Digital Mammograms: Detection of Microcalcifications by Meta Heuristic Algorithms. International Journal on Graphics Vision and Image Processing 7 (2005) pp. 41-55. 28. Thangavel K., Kaman M., Siva Kumar R. and Kaja Mohideen A., Automatic Detection of Microcalcification in Mammograms-A Review. International Journal on Graphics Vision and Image Processing, 5( 5 ) (2005) pp. 3 1-61. 29. Thangavel K., Karnan M., Siva Kumar R. and Kaja Mohideen A.: MetaHeuristic Algorithms for Feature Selection and Classification of Microcalcifications In Digital Mammograms. International Journal on Graphics Vision and Image Processing (2OOS)(Accepted). 30. Thangavel K., Karnan M., Siva Kumar R. and Kaja Mohideen A., Performance Analysis of Rough Reduct Algorithms in Mammogarms, International Journal on Graphics Vision and Image Processing (2005) (Accepted). 31. Van Laarhoven P. and Aarts E., Simulated Annealing: Theory and Practice. (Kluwer Academic Publishers, Dordrecht, The Netherlands, 1987). 32. Veldkamp W.J.H. and Karssemeijer N.: An improved method for detection of microcalcification clusters in digital mammograms. The SPIE Conference on Image Processing 3661 (1999) pp. 512-522.

EFFICIENT BRANCH-AND-BOUND ALGORITHMS FOR WEIGHTED MAX-2-SAT

Y . K O G A ~ M. , Y A G I U R A ~K., NONOBE§ T. I M A M I C H I ~H, . N A G A M O C H I ~AND T. I B A R A K I ~ t K y o t o University, Yoshidahonmachi, Sakyo-ku, Kyoto, 606-8501 Japan E-mail: {koga, irna, nag} @amp.i.kyoto-u.ac.jp %NagoyaUniversity, Furo-cho, Chikusa-ku, Nagoya, 464-8603 Japan E-mail: [email protected] §Hosei University, 3-7-2 Kajino-cho, Koganei-shi, Tokyo, 184-8584 Japan E-mail: nonobeok. hosei.ac.jp nKwansei Gakuin University, 2-1 Galcuen, Sanda-shi, Hyogo, 669-1337 Japan E-mail: [email protected] Given a set of m clauses on n propositional variables, where each clause contains at most two literals and is weighted by a positive real, MAX-2-SAT asks to find a truth assignment that maximizes the total weight of satisfied clauses. In this paper, we propose two branch-and-bound algorithms for solving MAX-2-SAT exactly. Our algorithms feature two new transformation rules and two efficient algorithms for computing lower bounds. Computational comparisons on benchmark instances disclose that these algorithms are highly effective in reducing the number of search tree nodes as well as computation time of branch-and-bound algorithms.

1. Introduction

In this paper, we deal with the weighted MAX-2-SAT problem, which is one of the representative combinatorial optimization problems and is known to be NP-hard. Variables x~jE {0,1} ( j E N = {1,2,. . . ,n } ) and their negations %j = 1 - xj are called literals and a disjunction of literals (e.g.,Ci = 2 1 V Z,) is called a clause. A clause Ci is satisfied if the value of at least one literal is 1, and is unsatisfied otherwise. Given m clauses C = {Cl,Cz, . . . , Cm}each having at most two literals and positive weights w = (w1, wz,. . . ,wm), the MAX-2-SAT asks for a truth assignment that maximizes the sum of the weights of satisfied clauses. Since mid 199O’s, several branch-and-bound algorithms have been proposed for MAX-2-SAT. In the subsequent discussion, for consistency of notations, we consider MAX-2-SAT as a minimization problem of the total weight of unsatisfied clauses. One of the important common ideas is the 120

121

use of logical rules for fixing variables or simplifying clauses without losing the optimality of given i n s t a n ~ e s l > ~Such > ~ . rules are called transformation rules. In addition, algorithms that compute lower bounds on the total weight of unsatisfied clauses have been intensively studied7. Lower bounds are important for the following test in branch-and-bound algorithms. Denoting the incumbent value by UB (i.e., the value of the best solution found by then), and by LB a lower bound on the total weight of unsatisfied clauses for a given partial problem, we can terminate solving this partial problem with UB 5 LB. This is called the lower bound test. In this paper, we propose two branch-and-bound algorithms for MAX2-SAT, featuring two new transformation rules and two efficient algorithms for computing lower bounds. 2. Lower bound based on conflict cycles

Denoting implication by +, a clause a V b is equivalent to ti + b and b + a. Given an instance (C, w)of MAX-2-SAT, we define an edge-weighted digraph G(c,,,) that represents all implications from the given clauses. The vertices are 2n literals x1,22,.. . ,x,, 31, 3 2 , . . . , Z,, and for each clause Ci = a V b E C with a # b, the graph has two edges (a,b) and (6, a ) each having weight wi. where each of these two edges is called the dual edge of the other. Edges (a,b) and (6, a ) correspond to 7i + b and 6 =+ a , respectively. For a unit clause Ci = c, the graph has an edge (Z, c ) with weight wi (note that c is equivalent to c V c). Figure 1 shows an example of G(c,,,).

Figure 1. (21

v 221

An example of G(c,,,) for C = b2

v 23, 2 3 v ? 4 ? I1

and w = (3,2,5,1,4)

247 b l )

Figure 2 .

An example of a conflict cycle

For a variable x, if there exists a pair of paths, one from x to 3 and the other from 3 to x (see Fig. 2), then no truth assignment can simultaneously satisfy all the clauses that correspond to those edges in the paths. Hence at least one clause among them is unsatisfied. We call the cycle formed by such a pair of paths a confiict cycle. (Note that it is allowed to use the same edge more than once in the cycle.) For convenience, we say that a cycle contains a clause if an edge in

122

the cycle corresponds to the clause, and call two cycles are disjoint if they contain no common clauses. Given a set of mutually disjoint conflict cycles, at least one clause is unsatisfied in each conflict cycle, and for any conflict cycle, the weight of the unsatisfied clause is a t least the minimum weight in the cycle. Therefore, if a set of disjoint conflict cycles are found, we can get a lower bound by taking the sum of minimum weights in the disjoint cycles. This argument can be extended to the case in which some conflict cycles have common edges, because any two conflict cycles can be considered as disjoint by duplicating each common edge and distributing its weight to the two edges so that the sum of the weights of the two edges is equal to the weight of the original edge. We call the algorithm to compute a lower bound based on this idea CONFLICT-CYCLES, but we omit the details due to space limitations. 3. Lower bound from the set covering formulation

MAX-2-SAT is equivalent to the problem of finding a set of edges (and their dual edges) with the minimum total weight to be deleted from graph G ( C , ~ ) so that the resulting graph has no conflict cycle. This is formulated as the set covering problem as follows. Let H = { 1 , 2 , . . . , q } be the index set of all conflict cycles (q can be exponentially large) and Si C H be the set of indices of conflict cycles containing a clause Ci (i.e., h E Si iff the conflict cycle whose index is h contains Ci). We define a q x m 0,l-matrix (ahi) by ahi = 1 h E Si. Let ( H , ( a h i ) ) be a set covering problem that minimizes the objective function f ( 2 ) = CiEM W i Z i subject to ahizi 2 1 for V h E H and zi E {0,1} for Vi E M , where M = { 1 , 2 , . . . , m}. The following lemma is useful for getting lower bounds.

*

xiEM

Lemma 3.1. Let SCPLP(H') be the linear programming (LP) problem defined f o r any subset H' c H of conflict cycles. Then the optimal value of SCPLP(H') gives a lower bound on the total weight of unsatisfied clauses. We therefore choose a subset H' c H of a manageable size to solve its LP relaxation. We start with a small H' and iteratively add conflict cycles to increase the lower bound; an idea similar to the column generation technique originally used by Gilmore and Gomory4. We call the algorithm to compute a lower bound based on this idea SCPLP. 4. Transformation rules

Transformation rules simplify a given instance by fixing some variables or merging some clauses together without losing its optimality. Such rules fix

123

a portion of weights to be satisfied (resp., unsatisfied), which we denote by ws";l"t(resp., wufiisat>.Let OPTunsat(C1 w) be the optimal value (total weight of unsatisfied clauses) of an instance (C,w) and s u m ( w ) be the sum of the weights of all clauses. A transformation rule replaces (C,w) with a simplified instance (C', 20') so that OPTUnsat(C, w) = OPTUnsat(C',w') w:&,~ and s u m ( w ) = sum(w') wsfiaxt wufiisat axe satisfied. We propose two new transformation rules, which are inspired by the lower bounds LB3 and LB4 by Shen and Zhang?. For simplicity, we explain our rules for the unweighted case (wi= 1,VZ E M), because generalizing them t o the weighted case is easy.

+

+

+

Reversible clause Replace three clauses a V b, a, with ii V 6,and set w,",", = 1 and wufinxsat= 1.

6 E C,

if they exist,

Reversible clause I1 Replace four clauses a V c, b V F, 6 , 6 E C, if they exist, with two clauses ii V c and 6 V c, and set ws",x,= 1 and w:,xSat = 1. The merit of these rules is that they can increase wW,ficsat, which results in raising the lower bound. We also incorporate in our algorithm the following six existing rules: pure literal', dominating unit clause6, unit clause bounding almost common clauses2, complementary unit clause6 and max p o w fixing5.

',

5. Experimental Results In this section, we report experimental results. We abbreviate the branchand-bound algorithm using CONFLICT-CYCLES as BB-C and the one using S C P L P as BB-S. The algorithms were coded in C language and run on a P C (Xeon 3.06GHz). The problem instances in these results are unweighted random instances generated by Borchers and F'urman3. For comparison purposes, we also show the results of three solvers: (1) SZ the branch-and-bound algorithm for MAX-2-SAT by Shen and Zhang7, (2) MaxSolver - the branch-and-bound algorithm for MAXSAT by Xing and Zhangg, and (3) CPLEX - a general-purpose integer programming solver CPLEX9.0. SZ and MaxSolver were run on a P C (Pentium 4 2.4GHz) and CPLEX was run on a P C (Xeon 3.06GHz). Table 1 shows the experimental results. The numbers of nodes for SZ and MaxSolver were not reported in their papers. As for B B X , the number of nodes is much less than CPLEX, and the computation time is the smallest among all solvers for large n. As for BB-S, the number of nodes is the smallest among all solvers. However, the computation time of S C P L P is

124

Table 1. Computational results for unweighted MAX-2-SAT instances SZ MaxSolver CPLEX BB-C BBS n m unsat time time node time node time node time 50 300 32 0.07 0.07 2139 1.22 141 0.20 1 0.55 50 350 41 0.11 0.12 4858 2.64 101 0.21 5 1.88 50 400 45 0.06 0.09 3352 55 0.17 2.61 1 0.74 63 50 450 0.21 0.65 46405 28.72 493 0.63 13 8.37 50 500 66 0.15 0.42 11792 9.58 171 0.33 9 6.69 100 300 15 0.40 0.04 148 0.18 67 0.20 1 0.26 100 400 29 0.98 0.32 4179 2.90 167 0.43 5 2.02 100 500 44 62.00 11.82 57689 47.34 1947 4.50 35 31.81 100 600 65 130.00 106.22 1029295 1055.11 7053 16.86 119 168.23 150 300 4 0.08 0.06 6 0.13 11 0.19 1 0.20 150 450 22 7.11 1.93 2834 2.68 243 0.70 1 0.67 150 600 38 54.00 10.41 20012 24.42 951 2.92 3 4.52 n: #variable m: #clause unsat: optimal value node: #search tree node time: computation time (sec) Note. CPU times are on different computers. expensive, and hence BB-S is slower than other solvers for small instances, while it becomes competitive as the size of instances becomes larger. This results disclosed t h a t BB-C and BB-S are highly effective in reducing the number of search tree nodes as well as computation time.

References 1. T. Alsinet, F. Manya, and J. Planes. Improved branch and bound algorithms for max-sat. In Sixth International Conference on Theory and Applications of Satisfiability Testing, pages 408-415, 2003. 2. N. Bansal and V. Raman. Upper bounds for maxsat: Further improved. In

3.

4. 5. 6. 7. 8.

9.

I S A A C '99: Proceedings of the 10th International Symposium on Algorithms and Computation, pages 247-258, London, UK, 1999. Springer-Verlag. B. Borchers and J. Furman. A two-phase exact algorithm for MAX-SAT and weighted MAX-SAT problems. Jounal of Combinatorial Optimization, 2:299306, 1999. P. C. Gilmore and R. E. Gomory. A linear programming approach to the cutting stock problem. Operations Research, 9:849-859, 1961. E. A. Hirsch. A new algorithm for MAX-2-SAT. In STACS 8000: 17th Annual Symposium on Theoretical Aspects of Computer Science, pages 65-73, 2000. R. Niedermeier and P. Rossmanith. New upper bounds for maximum satisfiability. Journal of Algorithms, 36( 1):63-88, 2000. H. Shen and H. Zhang. Improving exact algorithms for max-2-sat. In Eighth International Symposium on Artificial Intelligence and Mathematics, 2004. R. J. Wallace. Enhancing maximum satisfiability algorithms with pure literal strategies. In McCalla, editor, Advances in Artificial Intelligence, 1 l t h Biennial Conference of the Canadian Society for Computational Studies of Intelligence (AI96), volume 1081, 1996. Z. Xing and W. Zhang. Maxsolver: An efficient exact algorithm for (weighted) maximum satisfiability. Artificial Intelligence, 164(1-2):47-80, 2005.

TAMPER RESISTANCE VIA POISSON DISTRIBUTION D SEETHA MAHALAXMI J N T U, Hyderabad,AP,India [email protected]

DR.P R K MURTI Universiw of Hyderabad,AP,India prkmcs@uohyd. ernet.in

1 Watermarking

Watermarking is a technique where we are going to embed secret message in a cover message. There are two types of watermarking. They are Static watermarking and Dynamic watermarking. Static watermarks are stored in the application executable itself. Dynamic Watermarks are stored in a program execution state, rather code itself. This makes them easier to tamperproof against obfuscating transformation. Technique such as watermarking and fingerprinting have been developed to discourage piracy [l], however, if no protective measures are taken, an attacker may be able to remove and/or destroy watermarks and fingerprints with relative ease once they have been identifed. For this reason, methods such as source code obfuscation [3],code encryption [6] and self-verying code have been developed to help achieve some measure of Tamper-Resistance. One application of obfuscating address computations is to hide from an attacker the true target of some control transfer, In Software Tamper Resistance: Obstructing static analysis of programs [2] they have focused their efforts at the source code level. At the assembly level, however, even such modifications are translated into unambiguous control transfers such as direct and conditional jumps. A method of indirection [5,6] in which direct control transfers such as call and jump instructions are replaced with calls to specialized functions that are responsible for directing control to the intended targets in some stealthy manner. The intent is that the successors of each transformed basic block will be difficult to discover. The specialized functions, which they have termed as 125

126

branch functions are used in order to achieve this. The Dynamic path based watermarking [4] was another technique proposed where it makes use of the forward branching. Path-Based Watermarking is that which embeds the watermark in the runtime branching structure of the program. In our work we have used Poission distribution to increase the strength of the Watermark against the reverse engineering attacks. Poisson distribution is a distribution related to the probability of events which are extremely rare but which have large number of independent opportunities. The Poisson distribution is used to model the number of events occurring within a given time interval. There are various types of attacks that are possible on the watermarked program. Some of them are additive attack, subtractive attack, distortive attack and reverse engineering. In our paper we have mainly focused on protecting the watermarked program from reverse engineering. For the implementation of the same we have made use of the Poisson Distribution formula. These makes the watermarked program free fi-om the reverse engineering, as the value of h is fixed. References [I] Brigit Pfitzmann and Matthias Schunter, Asymmetric fingerprinting, Lecture Notes in Computer Science, 1996. [2] Chenxi Wang, Jonathan Hill, John Knight, and Jack Davidson, Software Tamper Resistance: Obstructing static analysis of programs, Technical Report CS-2000-12, 12 2000. [3] Cho W, Lee I, and Park S, Against Intelligent Tampering: Software Tamper Resistance by extended control flow obfuscation, in Proc. World Multiconference on Systems, Cybernetics, and Informatics, International Institute of Informatics and Systematics, 2001. [4] Collberg C, Carter E, Debray S, Huntwork A, Linn C, Stepp M, Dynamic Path Based Software Watermarking, Department of Computer Science University of Arizona Tucson, AZ 85721, USA. [5] Cullen Linn, Saumya Debray, John Kececiog, Enhancing Software TamperResistance via Stealthy Address Computations Department of Computer Science, University of Arizona, Tucson, AZ 8572 1. [6] Tomas Sander and Christian F. Tshudin, On software protection via function hiding, Lecture Notes in Computer Science, 1525: 111-123, 1998.

CASE: CELLULAR AUTOMATA BASED SYMMETRIC ENCRYPTION

S. TRIPATHY AND S. NAND1 Department of Computer Science and Engineering, Indian Institute of Technology Guwahati, India. Email: {som,sukumar} @iitg.ernet.in

This paper proposes CASE, a new symmetric encryption algorithm using onedimensional Reversible Cellular Automata (RCA) and Non-autonomous Cellular Automata (NCA). CASE supports 128-bit block size and 128/ 192/ 256-bit keys to meet the AES specification7. Design principle of CASE follows the most desirable properties like confusion and diffusion. The scheme is more attractive for its simple implementation.

1. Introduction The necessity for faster cryptosystems calls for efficient hardware realization and parallel implementation of the same. However, it is difficult to find an encryption technique with high parallelism, unless it is especially constructed. Several CA based encryption schemes have been proposed and analyzed in 1,2,3,4 for this reason. Unfortunately most of these schemes are insecure or resource inefficient. Sen et al. proposed a Cellular Automata based Cryptosystem (CAC) in where non-linearity is achieved by mixing the affine cellular automata with non-affine transformations. Bao in pointed out that this scheme is vulnerable to chosen plaintext attacks. Recently, Seredynski et.al., proposed a reversible CA (RCA) based encryption algorithm in This scheme satisfies the strict avalanche criteria, but it needs extra communication overhead (cipher size). This paper, proposes CASE, a CA based Symmetric Encryption scheme for fast implementation. It uses the inherent parallelism of Cellular Automata (CA), and its modular architecture simplifies the hardware realizations. The proposed scheme is presented after introducing Cellular Automata in next section.

'

'.

127

128

2. Cellular automata (CA)

Cellular automata (CA) are discrete dynamical systems consisting of a grid of identical finite state machines whose states are updated synchronously at discrete time steps according to a local update rule f . For a 2-state, 3-neighborhood CA, the evolution of the ith cell can be represented as a function of the present state of ( i - l ) t h ,zth, and ( i + l ) t hcells as z i ( t + l ) = f(zi-l(t),zi(t),zi+l(t)). So there are 223 = 256 possible different next state functions, and each of them can be specified by a decimal integer called the rule number. A CA is said to be periodic boundary if the extreme cells are adjacent to each other (more details about CA please refer to Non-autonomous Cellular Automata (NCA): By incorporating external inputs to the CA, one obtains many possible successor configurations for a particular configuration depending on the external inputs. Such a CA is named as Non-autonomous CA (NCA). A local state q ( t ) for ith cell on time t of an NCA may be expressed as: zi(t 1) = f ( z i - l ( t ) ,z i ( t ) , z i + l ( t ) ) q ( t ) where, is the external input for zth state. Reversible Cellular Automata (RCA): A cellular automata is said to be reversible, if the previous configuration can be restored from the current configuration. In this proposal we use a class of reversible CA (RCA) where the configuration of the ith state on clock cycle t 1 is determined by the states of the r-neighborhood configuration at clock cycle t and the configuration of ith state at (t - l)thclock cycle. For example a 3-neighborhood RCA can be expressed as zi(t 1) = f ( z i - l ( t ) , z i ( t ) , ~ i + l ( t ) , z-i (1)). t ").

+

+

+

+

3. CA based Symmetric Encryption Scheme (CASE) The proposed scheme can be scaled to any size of 4wbits with key size of 4w/6w/8w where, w is the word size. In this paper we discuss CASE using 128-bit (w=32-bit) block size. Two processing (Encryption/ Decryption and Key schedule) phases of CASE discussed subsequently use 3-neighborhood periodic boundary CA. 3.1. Encryption/ Decryption phase The encryption phase has 16 similar rounds and an output transformation (half round) at the last step to make both the encryption and decryption structure equal. Each round of this phase comprises of two stages. To achieve the desired level of non-linearity CASE selects RCA class based on

129

CA rule 30 in the first stage and NCA based on rule 218 in the second stage. The last round operates with one-stage (the first-stage) only. When using RCA described earlier, initial state configuration is set up with corresponding key sub-block k: and the previous configuration by the plaintext/ (intermediate) ciphertext sub-block X . The 128-bit plaintext block P is partitioned into four 32-bit sub-blocks ( P I , . . . , p 4 ) . These four plain text sub-blocks are transformed into the four cipher text sub-blocks (q, . . . ,c4) under the control of 84 key sub-blocks for (16 and half rounds) complete operation. The pseudocode for encryption procedure is depicted below. The decryption process is essentially same as the encryption process, but uses the different key sub-blocks. Pseudocode for CASE-Encryption: //uppercase alphabetset represent 128-bit block and //corresponding lowercase alphabetsets specify their 32-bit sub-blocks. Input: P (pl1p2,p3,p4):plain text block, K: user selected key Output: C (cl,c2,c3,c4): cipher block IC (icl,ic2,ic3,ic4) :intermediate cipher block, al,..,a4,tl..t5 are 32-bit array Ek[r][i]:ithkey sub-block for rth round. begin Keyschedule(K); //Calling subroutine for key sub-block generation IC := P; For r := 1 to 16 do al:=RCA(Ek[r][l],icl);a2:=RCA(Ek[r] [2],ic2);// Stage1 a3:=RCA(Ek[r] [3],ic3);a4:=RCA(Ek[r] [4],ic4); t l := a1 €3 a3; t 2 := a2 @ a4; t 3 := NCA(Ek[r][5],t l ) ;t4 := NCA(t3, t2); t5 := NCA(t2,t3); //Stage2 icl := a1 @ t4; ic2 := a3 @ t4; ic3 := a2 @ t5; ic4 := a4 €3 t5; cl:= RCA(Ek[r][l],icl);c2:= RCA(Ek[r][2],ic3); //Last-half round c3:= RCA (Ek [r ] [31,ic2); c4:=RCA (Ek [r] [41 ,ic4); end.

3.2. K e y Schedule phase

User-selected key K of size (kw = 4/6/8wbit) 128-/ 192-/ 256-bits, is partitioned into kw sub-blocks which are directly used as the key sub-blocks ( k k l , . . . ,kkk,). Those key sub-blocks are used to derive the next sequences of key sub-blocks using a pseudo random sequence generator (PRG). This PRG is constructed by a one-dimensional periodic boundary, 128-cell CA (rule-45) to thwart against related key attacks. The decryption key sub-

130

blocks (Dk) are computed from encrypted key sub-blocks (Ek). The pseudocode for the proposed key schedule is presented below. Pseudocode for key-schedule phase: Input : K: key //size of key is kw (32-bit) words represented as k[l]..k[kw] Output: Ek[r][i],Dk[r][i] // ith key sub-blocks for rth round kk[i]: ith 32-bit subblock. begin for i := 1 t o kw do kk[i] := k[i]; j : = l ; i := kw 1; t:=l; while (i < 84) do //Total number of key sub-block = 84 (kk[i],..,kk[i+3]):= PRG(kkB],..,kk[j+3])//Call PRG using CA j:=j+4; i:=i+4; for r:= 1 to 16 do for i := 1 t o 5 do Ek[r][i]:= kk[t++]; for i := 1 to 4 do Ek[r][i]:= kk[t++]; //Last round key for r:= 2 to 16 do //Determination of Decryption Key Dk Dk [r][ 11 := Ek [ 18-r][ 11; Dk [r][4]):=Ek [ 18-r][4]; Dk[r][2] := Ek[l8-r][3]; Dk[r][3] := Ek[l8-r][2];Dk[r][5]=Ek[l7-r][5]; Dk [ 11[ 11:= Ek [ 171[ 11; Dk [ 11[2]:=Ek [ 171[2]; Dk [ 11[3] :=Ek [ 171[3]; Dk [ 11[4]:= Ek [ 171[4]; Dk [ 11[5]:= Ek [ 161[51; Dk [ 171[11 :=Ek [ 11[11; Dk[17][2] := Ek[l][2];Dk[17][3] := Ek[l][3]; Dk[17][4]):=Ek[l][4]; end.

+

4. Discussion

Diffusion:The purpose of diffusion on a cipher is to spread out redundancy of plain text all over the cipher i.e each bit of the plain text t o contribute as many cipher bits. CASE satisfies the diffusion requirement by the clever arrangement of 3 NCA at the second stage called mixing stage. I t is clear that after completion of second round each output (IC) bit depends on 104-plaintext bits. Confusion: Confusion makes the relationship between key and statistics of the cipher bits complex and non-linear. The RCA class based on rule 30 and 225, and NCA based on rule 218 for their non-linearity. Moreover, during a single round of CASE each output bit depends on 66 key-bits of the corresponding round key-sub-block which, makes us to achieve the desired level of confusion. Simulation Results: Webster and Tavares defined a most desirable property for an encryption scheme called strict avalanche criterion (SAC) in

131 According to this property, each ciphertext bit should change with a probability of 50% whenever a single bit of plaintext or key changes. The operations of CASE are verified using a customized C based simulator and the simulation result shows that changing of a random bit in the key or the plain text translates into a 63-bit or 64-bit change in the respective cipher text output. Implementation Analysis: Structure of the proposed scheme is well suited for both hardware and software implementations. Both the encryption/ decryption as well as key-schedule phase use only 3-neighborhood CA based components. Therefore, easily implementable in software. The advantage of RCA and NCA in CASE is their inherent parallelism that makes high-speed hardware implementation possible. Moreover unlike AES CASE does not use any S-Boxes that needs extra storage. Thus all constructions are very efficient to implement, especially on low-end devices, which have very small memory and computational power. The detail analysis have been omitted because of space constraints. 5,

5. Conclusions This paper presented a new CA based symmetric encryption scheme using one-dimensional 3-neighborhood RCA and NCA. This uses 128 bit blocks as input message with 1281 1921 256-bit key t o generate a cipher of size 128-bits to meet the AES specification. This scheme satisfies SAC (Strict Avalanche Criterion). The modularity, parallelism and local communications of RCA and NCA induce the low cost and high-speed implementation.

References 1. Bao, F.: Cryptanalysys of a partially known cellular automata cryptosystem. IEEE trans. On Comp. Vol. 53 N o 11 (2004) 1493-1497 2. Nandi,S., Kar,B.K., Pal chaudhuri, P.: Theory and Application of Cellular Automata in Cryptography. IEEE Trans. On Comp.Vo1 43. No 12 (1994)

1346-1357 3. Sen, S. et. al.: Cellular automata based cryptosystem (CAC) ACRI-2002 LNCS 2513 (2002) 303-314 4. Seredynski, M., Pienkosz, K., Bouvry, P.: Reversible Cellular Automata Based Encryption. Proc. of NPC-2004. LNCS-3222 (2004) 411-418 5. Webster, A.F., Travers, S.E.: On the design of S-Boxes. Advances in Cryptology Crypto 85. LNCS 218 (1985) 523-534 6. Wolfram, S.: A New kind of Science, Wolfram media Inc. (2002) 7. http://www.nist.gov/aes

This page intentionally left blank

Community Informatics

This page intentionally left blank

THINK! :TOWARDS HANDLING INTUITIVE AND NURTURED KNOWLEDGE DR. MURIYANKULANGARA V ANANTHAKRISHNAN and RAJENDRA TRIPATHI KReSIT, Indian Institute of Technology Bombay Mumbai 400 076, India manantha@&iitb.ac. in, rajt@it. iitb.ac.in ABSTRACT The immensely rich culture and heritage of a country needs to be stored for posterity. But very little has been done, especially for countries in Asia and the Far East, all of whom have significant artefacts even to this day. Unfortunately, modern society and development has had a telling effect on these ”priceless” assets of yore, so much so that they could become extinct even in our lifetimes. A system that could capture intuitive knowledge of yesterday, disseminate it today and nurture the developments of tomorrow is the need of the day. The authors conceive of an ICT-based system that could facilitate this stupendous effort and ensure perpetual value addition and enrichment.

Nature of Knowledge Knowledge Management (KM) is a much-hyped and talked about topic since the late 1990’s and every organisation around the world has been viewing this as a possible panacea to its battle against brain-drain! Further, with the proliferation of IT and IT-related disciplines across board, KM is looked at yet another logical application of ICT. It is here that organisations have flawed and that too beyond redemption. Hildreth and Kimble (1) argue that “the approach is flawed and some knowledge simply cannot be captured. A method is needed which recognises that knowledge resides in people, not in machines and documents”. They go on to say that “KM approaches have tended to concentrate on attempts to capture and control what is sometimes termed ‘structured knowledge”’. “What many software vendors tout as Knowledge Management systems are only existing information retrieval engines, groupware systems or document management systems with a new marketing tagline” (2). Conklin (3) uses the term formal and informal knowledge: formal is one that is found in books, 135

136

manuals and documents and that can be shared in seminars whereas informal knowledge is described as the knowledge that is applied in the process of creating formal knowledge. Yet another viewpoint is put up by Rulke et a1 (4), this time how an organisation views it viz., Transitive knowledge (organisation self-knowledge - knowing what you know) and resource knowledge (knowing who knows what). Seely Brown et a1 (5) use terms like ‘know-how’ and ‘know-what’: ‘know what’ is the explicit knowledge that may be shared by several people while ‘know-how’ is about competency, the particular ability to put know-how into practice. Leonard et al (6) describe knowledge as a continuum, ‘tacit’ at one end and ‘explicit’ or ‘codified’ at another. Unfortunately, most knowledge lies between these extremes. Tacit elements are subjective, experiential and created in the ‘here and now’ while explicit elements are objective, rational and in the ‘then and there’. Classification of Knowledge

According to Polanyi (7), “...all knowledge is either tacit or rooted in tacit knowledge. It is the kind of knowledge that cannot be articulated because it has become internalised in the unconscious mind’. Nonaka’s spiral model of knowledge (8) states that ‘knowledge creation and sharing become part of the culture of an organisation’. According to Nonaka, the ‘tacit’ to ‘explicit’ transition and development occurs regularly through repetitive cycles of Socialisation, Externalisation, Combination and Externalisation (Figure 1). EXPLlC IT

TACI T

V

m

m

internalisation

I

Learning by Dung Fig.I : Nonaka S Spiral OfKnowledge

137

Knowledge is socially constructed and any attempt to follow a cognitiverepresentational transition simply tends to make the soft knowledge hard. A possible solution to the dilemma lies in “Communities of Practice (Cop’s)” which can assist the creation and propagation of such knowledge. Davenport et a1 (9) have put up a possible solution by stating: “The more rich and tacit the knowledge is the more technology should be used to enable to share that knowledge directly”. Indigenous Knowledge (IK) and the global scenario According to the World Bank: (a) IK is local knowledge which is unique to a particular culture or society (b) IK is the basis for local-level decision-making in agriculture, healthcare, food preparation, education and natural resources management (c) IK is held by communities and provides problem-solving strategies for communities (d) IK is tacit knowledge and is difficult to codify and is embedded in community practices, institutions, relationships and rituals IK has its own unique advantages in that it provides problem-solving strategies for local communities (especially the poor) and represents an important contribution to global development knowledge. However, it is underutilised and runs the risk of becoming extinct. The next relevant question is: Who possess indigenous knowledge? According to the ILO (lo), indigenous communities consist of those people having a historical continuity with pre-invasion and pre-colonial societies that developed on their territories or part of them. Going a step further, Principle 22 of the Rio Declaration states that “Indigenous people and their communities and other local communities have a vital role in environmental management and development because of their knowledge and traditional practices. States should recognise and duly support their identity, culture and interests and enable their effective participation in sustainable development”. Last but not the least, indigenous knowledge is not confined to tribal groups or the inhabitants or the rural folk, but every community possesses it whether, rural or urban, settled or nomadic, original settlers or migrants. Chambers (1 1) settles for the term “rural people’s knowledge” as a more inclusive term. Rural communities in Africa have a detailed knowledge of their physical environment like soil characteristics, plant species and water requirements. Proper communication and documentation can result in an

138

excellent taxonomy of useful flora and fauna that could provide food, shelter, medicines and raw material for implements, animal diseases and their treatment and protection of endangered species. Conservation of water and marine life is yet another area that could be cared for using indigenous knowledge. Added to this is the “usurping” of land for building multipurpose dams all over Asia. This has resulted in farmers losing jobs. A proper integration of social and ecological concerns would result in the formulation of rehabilitation efforts with rural societies to restore damaged aquatic environments from hydroelectric projects (12). The development of aquaculture as an alternate employment for displaced rice farmers in Indonesia makes interesting reading. According to Dahlman and Westpahl (13), they assign the success to ‘technological mastery’ of the system i.e., ‘the autonomous ability to identify, select, and generate technological improvements and changes’. The forthcoming International conference on IK Systems (IKS) in Africa and their Relevance to Sustainable Development in November 2005 in Brussels (Belgium) will be the platform for experts worldwide to put their heads together to come out with feasible approaches to (a) Increasing the understanding of the role of IKS in Sustainable Development (b) Raising adequate awareness of the importance of IKS (c) Analysing the major constraints in the adaptation of IKS in promoting Sustainable Development Intuitive and Nurtured Knowledge Intuitive knowledge is one that is acquired only after using knowledge in “perceptually rich, dynamic situations” whereby implicit learning processes are elicited. Intuitive knowledge is difficult to verbalise and is based on “complex selection and inference processes, which are to a large extent, believed to be unconscious to the individual (14). Nurtured Knowledge, according to the authors, is the knowledge that is developed on one’s foundation of intuitive knowledge. This is very essential in order to ensure that the new knowledge is properly integrated with the existing knowledge of any individual, be it a child or an adult. This, in practice, means: (a) Assessing the learner’s social, cultural and family background (b) Analysing the learner’s cognitive, affective and psychomotor skills (Bloom’s Taxonomy) (c) Studying the learner’s habitat and its environment

139

(d) Understanding the means of livelihood (e) Identifying the Communities of Practice Our current educational systems and curricula focus on macro-level planning with very little flexibility for individual variations, other than the mother tongue. Thereby, the current system defeats the very meaning of indigenous knowledge (IK).and its appropriate transfer to modem society to result in gains all around. The current paper looks at “intuitive” and “nurtured” knowledge as the respective equivalents of “tacit” and “explicit”, “unstructured” and ‘‘structured’’ etc. THINK!: Towards Handling Intuitive and Nurtured Knowledge

Needles to say, intuitivehacit knowledge is available in abundance amongst the village/rural elders, ethnic groups, tribals and nomads. This knowledge is subsequently passed on from generation to generation through word of mouth. However, with the rapid urbanisation and conversion of rural areas to semiurban areas, there is a flight of the younger generation of the rural people to “greener” pastures. Further, the elders prefer to stay back in their villages to look after their lands and feel secure amidst their folks. This “generation” divide, however, could ultimately result in the loss of indigenous knowledge (IK). But, can it be prevented? Yes, it can, provided it is harnessed, organised and made available on public sites to (a) People residing in urbanhemi-urban areas (b) Academia to enrich their knowledgeiexperience (c) Libraries to retain them for posterity A mechanism is needed at this stage to make the above a reality. Although one can think in terms of a universal model to capture, organise and disseminate indigenous knowledge, it is worthwhile thinking in terms of a specific area/product/service. Starting with this, one could develop the model, starting with the first step and ending with the last. The current development will focus on agriculture as the area and rice/wheat as the specific crop. Harnessing IK

The steps involved in the “data/information” collection are the following: (i) (ii) (iii)

Identify crop Identify an area where the crop is grown Identify a specific village in the area

140

(iv) (v) ( 4 (vii) (viii) (W (XI

(xi) (xii) (xiii) (xiv) (xv> (xvi) (xvii)

Meet the Panchayat of the village Identify “champion” farmers Short-list oneltwo of the ”champion” farmers from Step (v) Dialogue with the farmers to collect vital data on factors that influence quality and quantity of crop Sift knowledge to separate those involving indigenous knowledge Look for the farmers’ reaction to new technology Find out the farmer’s adaptatiodtendency to integrate new technology Find out the farmer’s reasons for rejecting the new technology Discover the advantages of new technology adoption Identify problems in adopting new technology and causes for its rejection Issues in the transfer of IK to the younger generation Causes for the younger generation being averse to new technology Methods for increased retention of IK for posterity Understand current hazards in working and steps to minimiseleliminate them

It would not be out of place here to illustrate some of the stages mentioned above through specific instanceslexampleslscenarios. However, the selection for each stage is not specific to a crop but is more to define the type of information that could be collecteddiscussed (Table 1). Table 1: Stepwise harnessing of IK (Illustrative examples) Step

No. (ii)

I Identify an area where the crop

(iii) (vi) (vii)

I Illustrative example for gathering IK Kanpur District (UP)

is grown Identify a specific village in the area Short-list one/two of the ”champion” farmers

Bithoor

I 1 Dialogue with the farmers to 1

Mr. Rajman Yadav

collect vital data on factors that influence quality and quantity of crop

(viii)

1

0

Sift knowledge to separate those involving indigenous knowledge

0

I

Availability of water Water levels in the fields Soil conditions Tilling & re-sowing Reaping Anticipating weather conditions Making silos Making cots Baskets Roues Huts/tiles/roofs

141

Look for the farmers’ reaction to new technology

Find out the farmer’s adaptation/ tendency to integrate new technology

Find out the farmer’s reasons for rejecting the new technology

Discover the advantages of new technology adoption

Issues in the transfer of IK to the younder generatiion Causes for the younger generation being averse to IK

Methods for increased retention of IK for posterity

Understand current hazards in working and steps to minimise/eliminate them

Water prospecting & digging wells Herbal insecticides Detergents, soaps, cosmetics etc tend to pollute the water that are released to the fields or percolate through the subsoil into the wells Utility products made of non-biodegradable materials tend to “poison” the environment Threshers Pumps Tractors Mechanised methods of making plates/glasses from leaves and clay Post sales support Availability of resources (power/fuel/repairs) Polluting nature of equipment Redundant manpower (a social issue!) Peoole tend to become lazv! Efficiency Safety Higher Productivity Reduced health hazards Aesthetic Mobilitv Transfer through observations, demonstrations, do’s and don’ts, preparing materials, managing animals They prefer to move to cities Want to get rich with minimal effort (city vis-a-vis village) Carried away by the fast life and glamour Easygoing Addressing village panchayats, attending fairs “Honoured farmers to address across villages Arranging village level seminars/contact programmes Agricultural Institutes to organise visits to villages and facilitate interaction with farmers Helping farmers publish their work in relevant magazines, broadcast through radio and TV Injuries while at work and corresponding treatment (traditional/modern) Climatic influences on health and work Technology related issues (new equipment) and their correction/maintenance

142

Going about harnessing IK: The ToolsAWethodologies This can be best described by looking at each of the ICT media available today and assessing its possibilities in helping harness IK from the various indigenous “pandits” across the country and putting it in a central repository. Table 2: IK Harnessing methodologies vs. ICT Technologies Methodology Onsite interviews with IK owners IK owners soeakine from radio stationsibroadcast centres

t Telephone (Fixed) Mobile Telephony

Television

I

Videoconferencing Speech-to-text

pomprehensive Portal

Telecons with IK owners can be organised based on soecified agreed schedules Interviews Online transmission 0 Direct relay from the field to classrooms, expert panels Canned programmes for future use Newsgathering Live discussions, field reports, live demos Conversion to text (in more than one language) of the IK owners’ knowledge The kiosk owner becomes an agent to gather 0 The IK owner’s expertise Disseminate the knowledge on demand 0 Promote the sharing of knowledge

I

Provide end-to-end features for Transfer & storage of IK Access to IK Updates on IK Debates on IK

Acknowledgements The authors wish to thank Prof. Kirthi Ramamritham, Head, KReSIT, Indian Institute of Technology, Mumbai (India), for the opportunities provided to experiment with and come out with the adaptation of ICT technologies to various activities in the rural environment like education, culture, agriculture and healthcare. References Hildreth P J & Kimble C

T h e duality of knowledge, Information Research, 8(1), Paper No. 1 4 2 , 2 0 0 0

143 Offsey S

Knowledge Management: Linking people to knowledge for bottom-line results, Journal of Knowledge Manage-ment, l(20, 113-112, 1997

Cocklin E J

Designing organisational memory: Preserving intellectual assets in a knowledge economy, (httn:/icopnexus.orrr/doin.~df),1996

Rulke D, Zaheer S

Transactive Knowledge & performance in the retail food industry, New York University Stem School of Business, New York, 1998

Seely Brown J & Duguid P

Organising Knowledge, California Review, 40(3), 90-1 1 1 , 1998

Leonard D & Sensiper S

The role of tacit knowledge in group innovation, California Management Review, 40(3), 112-132, 1998

Polanyi M

The tacit dimension, Routledge & Kegan Paul, London, 1967

Nonaka I

The knowledge creating company, Harvard Business Review, 69(Nov-Dec), 96-104, 1991

Davenport T & Prusak L

Working knowledge. How organisations manage what they know, Harvard Business School Press, Cambridge, 1998

ILO

Indigenous and Tribal Peoples Convention, 1989

Chambers R

Rural Development: Putting the last first, Longmans, London, 1983

Costa Pierce B A

From farmers to fishes: Developing reservoirs aquaculture for people displaced by dams, Technical Paper No. 369, World Bank, Washington, 1997

Dahlman C J & Westpahl L E

The meaning of technological mastery in relation to the transfer of technology, Annals of American Academy of Political Science, 458, 12-25, 1981

Swaak J, Van Joolingen W R

Supporting simulation-based learning; The effects of model progression and assignments on definitional and intuitive knowledge, Learning & Instruction, 8(3), 235-252, 1998

Management

LOKVANI- An e-ffort to empower Citizen Amod Kumar', Amarpal Singh2, Shweta Puneet3 , Amit Shukla3 District Magistrate, Collectorate, Sitapur-261001,India Amodkumar7 [email protected] http://www.sitapur.nic.in District Informatics Oficer, Collectorate, Sitapur-261001, India [email protected] Amity Business School, Amity Campus, Sector-44, Noida-201303, India shwetau~abs.arnitv.cd~i,anii ts3COabs.ainitv.cdu

Abstract. Information and Communication Technology has made paradigm

shift in administration of developing nations. India's National e-Governance Plan has given administrative and monetary support for development of eGovernance portal. This has made flow possible seamless flow of information between governments, citizens and other organizations. One successful paradigm -the public-private partnership project Lokvani had its inception on November 9, 2004 in the district of Sitapur in Uttar Pradesh, India. It is a single window, self sustainable e-governance solution providing transparent, accountable and responsive administration for grievance handling; land record maintenance as well as an eclectic mixture of essential services. Lokvani caters to the needs of three major players resulting in a symbiotic and mutually beneficial relationship among its benefactors. The e-Governance portal has been appreciated by various newspapedagencies and has been showcased as one of the most successful, popular and influential e-governance solution implemented in India.

1 Introduction E-Governance is defined as the use of information technology for the purpose of enabling and improving the efficiency with which government services and information are provided to the citizens, employees, businesses, and other government agencies (McClure 2001). E-Governance supports good governance, which in turn promotes progress in developing countries. Another key benefit is that the democratic, business and governmental aspects of governance are simplified and improved. This further leads to a reduction in costs with provisions of better services to citizens and businesses. Hence numerous governments are in the process of modeling innovative and efficient models of electronic governance. This also leads to a reduction of multiple and redundant points of contact between government and citizens (Luck & rubin 1989; Harman 2001; Kulkarni 2001; Thomas 2001). Lokvani is a public-private partnership E-Governance program which has been initiated with the combined efforts of both the district administration as well as the Na-

144

145

tional Informatics Centre as in district of Sitapur which has an 88% rural population with 38.86% literacy rate (Times of India, 2005; Pioneer 2005). It is an outstanding example of a highly cost-efficient, economically self-reliant and user financed community network. This solution in targeted at 3.6 million citizens residing within the district, located in the province of Uttar Pradesh, which is the world’s fifth largest political entity in terms of population. Lokvuni has been projected as a commitment to the people in providing them with transparent, credible and accountable systems of governance. This system is grounded in the rule of law, encompassing civil, political as well as economic and social rights underpinned by accountable and efficient public administration for multiphase development of rural people. The primary objective of the IT solution is to bridge the digital divide and “connect” the common man to the strategy makers in a seamless fashion. This paper provides information about the implementation of the Lokvuni project, its services offered and the impact resulting from the implementation of the solution. The paper begins with a discussion on the related work in the field of E-Governance. Section 3 discusses the problems with the traditional system prior to the implementation of the project. This is followed by section 4 which describes the operational model of the Lokvuni system. Section 5 deals with the description of numerous services provided by the Lokvani system. Section 6 and 7 discusses the benefits and impact of the Lokvani system. Finally the paper concludes with the advantages of such a system.

2 Related work There are presently several initiatives in order to bridge the digital divide (Nanthikesan, 2000) in developing countries. A study (Marwah, 2004) has focused on analyzing the likely impact of ICT application and has assessed the information and communication needs of poor households in Thailand and Nepal and has made a comparison with the findings of a similar study conducted in India in 2000 (Marwah 2000). For example, in the north-eastern states of India, the federal government has set up satellite-based Internet connectivity in the form of Community Information Centers (CICs) (Gogio 2004)for low economic development areas. Each CIC is equipped with a satellite, interface unit, hub, server, modem, UPS, generators, and several computers. The primary purpose of this project is to provide Internet services namely email, remotely administered lectures, online courses, e-medicine, weather information and employment notifications. The users pay only a nominal fee which helps support the operating expenses and sustain the project. The governments of Andhra Pradesh and Kerala (provinces of India) have also offered their citizens with convenience in all transactions relating to the government via one-stop services termed “eSeva” in Andhra Pradesh and “FRIENDS” (fast, reliable, instant, efficient network for disbursement of service) in Kerala (Bedi & Srivastva 2002; Aruna 2003; Rahdakumari 2004).

146

Gyandoot (Rajora 2002) is a project initiated by the state government of Madhya Pradesh, India in order to set up an intranet system which connects rural cyber kiosks throughout the Dhar district of India. In the Gyandoot network, each kiosk possesses a computer system, installed at a total cost of $1500 per kiosk. This intranet provides a variety of local data, including information about the district, current market prices for crops and livestock, weather reports, and a village newspaper. E-government services include access to a host of local government services: driving license applications, registration of births and deaths, application for income, caste and domicile certificates, and public complaints. These services’ prices range from INR 5 to INR 10. Other offerings include bus and railway timetables, a telephone directory, public safety information, and job vacancies. However, Gyandoot cannot be yet considered as a success due to the fact that in spite of two years of existence, the usage of the system has remained far below acceptable levels. Moreover, it does not have a clear model of financial sustainability. The kiosks are susceptible to declining use resulting in a declining income which finally undermines their sustainability. On the other hand, in Lokvani system, kiosk centers are set up in the existing cyber cafes and computer training institutes. This has ensured the financial viability and long-term sustainability of the kiosks due to an alternate source of stable income.

3 Issues with conventional mechanism of Administration The conventional administrative mechanisms are constrained by their inability to reach out to most of the citizens. The only way out of this issue is to use technology effectively to make governance more approachable for citizens. Government offices have traditionally been the only dependable source of information regarding various schemes. As a result, the process of obtaining the birth, death, domicile, caste, income and other certificates, has been marred with red tapism and bureaucratic hassles. Citizens have had no other option other than personally visiting the District headquarter/ Tehsil for government related work and/or getting their grievances addressed. In Sitapur, the citizens had been allotted a specified time span of two hours within which they could interact with the District Manager. This caused a great strain on the system because of which very few grievances could be addressed satisfactorily. Generally district administration officials in India have a very large population base under their purview. Hence, without the deployment of Information and Communication Technology, the task of managing the services becomes insurmountable. But prior to Lokvani system the infrastructure was abysmally inadequate due to a very limited number of computer systems and non existential computer networks. Another factor contributing to the perceived inefficiencies in governance has been the inability of the administration to effectively monitor the government officers. Again, ICT can be a tool to address this drawback.

147

4 Operational Model of Lokvani "Lokvani" in hindi means the voice of the people. The Lokvani model has been formulated keeping in mind the three key stakeholders - (a) the citizens, (b) the government and (c) the IT entrepreneursKiosk operators. Essentially, since the IT literacy (and also any form of literacy) is very low in Sitapur, hence the kiosks form an interface between the IT enabled government and the IT illiterate citizens (ref Error! Reference source not found.). 4.1 Key Players

The key players in the model are as follows:

Government. A society by the name of Lokvuni was constituted at the district level to implement the project autonomously to reduce some of the bureaucratic pressures. All the financial decisions have been under the purview of the society itself. The rationale for such a framework is that the budgets of small districts have a limited scope for extra expenditure and the process of getting finance is a long drawn out one and complicated one. The Lokvuni society meets its recurring expenses from the money received from the registration of kiosks, short term and lifetime membership fees. The initial costs for setting up the society were also negligible as the hosting services was provided free of cost by the National Informatics Centre. IT entrepreneurs / Kiosk owners. The key achievement of the solution is its selfsustainability and long-term financial viability. The conversion of existing cyber cafeskomputer training institutes into Lokvuni Centers was a key factor driving the financial success. This step ensured that extraneous capital was not a vital requirement for the solution. The society signed contracts with existing kiosk owners for the purpose of registering them as Lokvuni franchisees with only a nominal annual fee of INR 1,000. The kiosks are run by IT entrepreneurs. A typical kiosk has an internet enabled PC, a printer and a webcam. It also has a CD ROM drive. Some kiosks also have a power backup (typically, power is available 5 hours a day). Kiosks earn profits through the income generated from various services of Lokvuni provided to the citizens. In addition, the kiosks can also generate some extra revenue by providing disparate facilities like computer education, computer typing, digital photography, internet access resulting in cross sales.

148

Citizens. The citizens are the reason as well the force behind the solution. Although, the kiosk owners and the government seem to be the driving force, it is the citizen who finally is the target audience. The citizens form the customer base for which the model has been designed. The citizens save tremendous amount of opportunity cost and effort in getting government services. In an economy riddled with poverty, it is an enormous burden on the citizens to forego daily wages to obtain government services.

4.2 Geographical spread There are 42 uniformly distributed kiosk centers at the block and tehsil level. A black dot in Figure Figure 2 represents the location of a kiosk centre in Sitapur. More than one kiosk can be situated at the same place.

SITAPUR

Figure

N

: Layout of Lokvani Kiosks with Black Dots representing the Kiosk location

5 Services offered by Lokvani Lokvani system has empowered the citizens by generating awareness towards their rights through a seamless flow of information. It is an outstanding manifestation of the “right to information”. The services offered by Lokvani encompass a wide range of government departments (Department of Public Grievances, District Administration, Development Department and Department of Land and revenue). The services offered by Lokvani are (a) Grievance and Petitions, (b) land records, (c) tender service, (d)

149 employment services, (e) information related to government schemes and ( f ) information about government services.

5.1 Lokvani Grievance and Petitions Services This is the most popularly used service of the Lokvani system as of now. This service allows citizens to register and then track the status of their petition via a nearby Kiosk centre. The complaint is then transferred to designated officials, who can read but cannot modify it. It has many unique features including one which enables the citizens to follow up on their complaint on the move with the help of a mobile phone.

5.2 Land Record System Information about the type of land, list of villages and details regarding the allotment of land in villages is available online in the local language. Individuals can view the land records through just a nominal payment. In the case that the information regarding a particular land record is not available online at the kiosk centre, the applicant gets to receive it within a stipulated period of 5 days by speed post.

5.3 Tender Services Notices regarding the tenders and their terms and conditions are published under the Lokvani Tender Service. The forms are also available for download. Interested contractors can send the filled tender forms through speed post to the concerned offices. Results and comparative charts of all bids are displayed on Internet within 24 hours of allotment.

5.4 Employment Services The Lokvani system provides information on all vacancies in the district as well as downloadable application forms for job seekers. Detailed information regarding the financial help provided by the government under various self-employment schemes is also available.

5.5 Information relating to Government Schemes The data of various schemes hnded by the Central and State Governments through various Developments and Social Welfare Departments is accessible via Lokvani. Application forms for social schemes like Old Age Pension Scheme, National Family Benefit Scheme, Professional and Vocational Education, Loan for the Physically Handicapped, Loan for the Development of Small Scale/Handicraft/Cottage industry

150 are made available for download. Citizens can down load these forms and submit them through traditional methods.

5.6 Information about Development Works Lokvuni also provides a list of developmental programs which are running under various departments like Educational Department, Jal Nigam, Electricity Department, Food and Civil Supply Department, Social Welfare Department, Public Works Department, Revenue Department and other Development Departments. It also provides information about the people who have received employment under the National Food for Work Scheme and allotees of homes under the Indira Awas Yojana. Information about the development work done under various schemes like National Food for Work Scheme, Member of Parliament Development Scheme, and Member of Legislative Assembly Development Scheme is also available online. Detailed information about the food allocated to Kotedars and other agencies is also freely available.

5.7 Single Window System Lokvani Single window system deals with the filing of application for Birth, Death, Income, and Domicile Certificates at the Kiosk centers. These certificates are received after the due completion of the verification process. The system has been introduced on a trial run basis.

6 Benefit of the Lokvani System Some of the key benefactors of the Lokvuni system are.

6.1 Citizens

Citizens can easily obtain pertinent information fiom the kiosks that are conveniently located in every block and in a few villages as well. Unlike the traditional method, people are not required to visit the district/ tehsil headquarter and as a result save on precious time, money and effort. Filing of grievances and their follow up has also been streamlined. Citizens can receive disposal reports and can track the status of their filed complaint without personally visiting any government officer. Computerization of land records has precluded the dependence on the Lekhpal for furnishing the official documents. Citizens can access information about various government schemes and their preconditions through kiosks. They can also obtain the list of persons who are benefiting under various schemes. A complaint can be filed against the concerned officer in the case of any discrepancy. For example, if a person benefit-

151

ing from the Food for Work, Indra Awas Yojana, Mid Day Meal or Prime Minister Gram Sadak Yojana does not fully fulfill the prescribed criteria specified by the government. Moreover teachers can view their GPF account detail at any kiosk. Previously, they had to wait weeks end for their account details. Online tender services have significantly reduced the preexisting monopoly of some influential contractors. Results and comparative charts of all bids are displayed on Lokvani within 24 hours of the allotment. This has drastically abated the likelihood of illegal negotiations after the allocation of tender.

6.2 Government Before the implementation of Lokvani, there existed no easy method of checking the time taken by an officer for solving his assigned complaints. This encourages a lackadaisical approach and dereliction in solving problems. However in the new system, the officer is assigned a fixed period of 15 days within which he has to redress the issue. This strict schedule has dramatically increased the efficiency and accountability of officers. Moreover the District Magistrate and the citizens can access the progress report of the work by any officer. Transparency brought about by the easy availability of information on land records has reduced the possibility of land scams.

6.3 Kiosks Kiosk operators are earning extra money besides their regular income, without any extra investment. This has caused the number of registrations to climb up drastically. Apart from this their earnings from their staple business also went up. Hence each and every one of the level benefits from Lokvani model. This is a highly desirable feature which although quite hard to achieve, can be single handedly attributed to the success of any e-governance solution. Until now, we have always had a very liberal policy regarding copyright. However, we have now had to introduce a copyright form, which we ask contributing authors to complete and sign. (It is sufficient if one author from each contribution signs the form on behalf of all the other authors.) The copyright form is located on our Web page at http://www.springer.de/comp/lncs/copyrigh.html. The printed form should be completed and signed and sent on to the volume editors either by normal mail or by fax, who then send it on to us, together with the printed manuscript.

7 Conclusion Lokvani system is a self-sustainable solution that efficiently and effectively caters to the needs of its three key stakeholders that are (a) the citizens, (b) the administration and (c) the kiosks operators/ the IT entrepreneurs.

152 The self-sustainability of Lokvani is derived from the innovative operational model wherein existing cyber cafes are being leveraged to act as franchisees of the system. Such a strategy avoids upfront investments from the government to set up such kiosks as well as avoids any operational support from the government. This is possible as the system generates its own funds from the citizens and contributes to the earnings of the kiosk operators. Another dimension of sustainability is provided by the transparency created by the system which has allowed the press to take up issues related to administrative efficacy. This has created tremendous citizen pressure which will ensure that the system continues to exist even when the champions of the system move out. The citizens are obviously the key beneficiaries from the solution as is evident from the number of petitions filed per day and the efficiency with which they are now being resolved. The system has also made government more approachable for the ordinary citizen who had to earlier forego wages to stand in a queue to merely file in a petition. The citizen would then have to follow it up with countless visits to various government offices to find out the status and outcome of the petition that was filed. Clearly, with the Lokvani system, the citizen is able to put his or her time to a more productive use which definitely adds up to the local GDP. The Lokvani project has successfully showcased a paradigm that can be incorporated by other administrative organizations. It has also proved that a low literacy rate and financial constraints is not a barrier for implementing a successful e-Governance project. Sitapur has a negligible computer literacy rate. However this did not prove to be a deterrent to the project as the existing kiosks were used as an interface between the systems and the citizens. A key learning from this solution is that word of mouth is the most effective and efficient mechanism for generating awareness among rural and semi-urban citizens that have low literacy rates. Change management was identified as the toughest task for this project. Attitudes of officers towards learning new technology have not been up to the desired levels. Therefore, it is necessary to have a programmatic approach to change management in order to ensure an effective implementation of the system. In order to make the solution robust and scalable, the solution is now being migrated to Oracle 10G. Lokvani has brought about, a well appreciated transparency to the workings of the administration. People were unaware of their rights as well the possible support provided by the government under various schemes. Widespread awareness as well as a stronger formulated public opinion can go a long way in the fight against corruption. The success of any project is gauged by the extent of which it achieves its predefined goals. Going by this metrics, the Lokwani system has proved to be a,major success in the field of e-governance. It has not only met the expectations but surpassed them in every imaginable way.

153

References I.

2. 3.

4.

5. 6. 7. 8.

9. 10. 11. 12. 13. 14. 15. 16.

17. 18.

McClure, David: Electronic Government: Challenges Must Be Addressed with Effective Leadership and Management(200 1) Trinkle, S: Moving Citizens from in Line to Online: How the Internet is Changing How Government Serves its Citizens. September 30(2001) Rajora, Rajesh: Bridging the Digital Divide: Gyandoot - the Model for Community Networks. New Delhi, India: Tata McGraw-Hi11(2002) Harman, C: Knowledge, E-government and the citizen, Knowledge Management Review, Vol. 4, No. 3: pp 3-lS(2001) Kulkarni, V: Emerging E-Karnataka. Siliconindia, EBSCO Publication, Vol. 5, No. 6: pp 80-82(2001) Luck,D. J. and Rubin, S: Marketing Research. New Delhi. Prentice Hall of India, pp. 780-786(1989) Thomas, B.R: Electronic Governance and Electronic Democracy. Living and Working in the Connected World. Australia, Riley Information Services Inc, pp. 3844(2001) Aruna, S: Kerala’s ambitious mission-Survey of Indian Industries”, The Hindu Publication, Vol. No. 1 pp 255-257(2002) Bedi, K. and Srivatsava, S: Govemment@net - New governance, Opportunities for India”, Sage Publications, pp. 14-25(2002) Radhakumari, Ch. (2004): Information Technology & its role in the society-A case study of two southern states of India. Proceedings of the International Conference on E-Governance. Colombo, Sri Lanka(2004) Marwah, R: Impact of Information Technology on the Poor in India. Background paper in Pigato, M, World Bank; ICT, Poverty and Development in Sub-Saharan Akica and South Asia(2000) Manvah, Reena: Information and Communication Technology, Poverty and Development in Thailand and Nepal (Comparison with India). Proceedings of Asia Fellows Annual conference, Bankok, August(2004) Nanthikesan, S.: Trends in Digital Divide. Harvard Center for Population and Development Studies, November 2000. Cecchini, S.: Back to Office Report: Information and Communications Technology for Poverty Reduction in Rural India, Mimeo, World Bank, Washington, DC(2001) Best, M.L. and. Maclay, C.M: Community Internet Access in Rural Areas: Solving the Economic Sustainability Puzzle in The Global Information Technology. Report 2001-2002: Readiness for the Networked World, Oxford University Press, 2002. Times of India : e-Governance: Sitapur shows the way. Page no. 1, February, (2005) PCQUEST: Strategic Framework for effective e-Governance”, Page No. 68, February (2005) Subir Hari Singh: Ways and Means of Bridging the Gap between Developed and Developing Countries. High-Level Panel on Information Technology and Public Administration a t United Nations, N e w York (2000)

CONTENT BASED IMAGE RETRIEVAL USING SIMPLE FEATURES -A SURVEY M.V. SUDHAMANI , DR. PRASHANTH KOULGI Dept of ISE, Dept ofE&C, Siddaganga Institute of Technology, Tumkur. DR.C.R.VENUGOPAL Professor, Dept.ofE&C, S.J. College of Engg. Mysore. Visveswaraiah Technological Universiv. Belgaum, Karnataka, India ,

Email: mvsitdha raiCa7,hofmail.com The emergence of multimedia technology and the rapidly expanding image and video collections on the Internet have attracted significant research efforts in providing tools for effective retrieval and management of Visual data. Image retrieval is based on the availability of a representation scheme of Image content. Image content descriptors may be visual features such as color, texture, shape, and spatial relationships, or semantic primitives. Content- based image retrieval (CBIR) is an automatic or semi-automatic retrieval of image data from an imagery database based on semantic similarity between the imagery content. In order to ensure the semantic retrieval, certain level of knowledge must be incorporated into CBIR system. This paper describes the contentBased Image Retrieval (CBIR) methodology that retrieves images on the basis of automatically derived features present in the images. The Objective is to build useful descriptive information retrieval model using content based information retrieval techniques that are both efficient and understandable for retrieving images from the image databases. Fundamental techniques for content-based image retrieval have been discussed including visual content description, similarity or distance measures, indexing scheme, user interaction and system performance evaluation. This paper proposes the research work that can be carried out by selecting best-suited techniques or modified techniques of CBIR. General visual features most widely used in content-based image retrieval are color, texture, shape, and spatial information. Color is usually represented by the color histogram, color correlogram, color coherence vector, and color moment under a certain color space. Texture can be represented by Tamura feature, Wold decomposition, SAR model, Gabor and Wavelet transformation. Shape can be represented by moment invariants, turning angles, Fourier descriptors, circularity, eccentricity, and major axis orientation and radon transform. A 2D string usually represents the spatial relationship between regions or objects. Several systems have been proposed in recent years in the framework of content-based retrieval of still images. Although some characteristics are common we have to make this work different from all those based on a number of different approaches mainly differing in terms of number and type of extracted features, Degree of automation and domain independence, Feature extraction algorithms, Indexing methods, Processing complexity in database population, and Query.

154

Design and Development of a Data Mining System for Superstore Business S.M.Shamimu1 Hasan', Dr.Indrani Haque2 Lecturer, Department of Computer Science, Stamford University, Bangladesh. [email protected]

* Associate Professor, School of Engineering and Computer Science, Independent University Bangladesh. [email protected]

Abstract: Data mining allows extracting diamonds of knowledge from historical data and predicting outcomes of future situations. In developed countries people are relying more and more on data mining to find business solutions. It is also possible for us in developing countries to be benefited from the power of data mining technology. The goal of this paper is to apply data mining theory and principles in a typical business environment. We describe a data mining system specially designed for superstore business. This system allows the user to effectively build multidimensional data model named data cube and browsing it, discover different types of interesting association among products and also searching in natural language technology. It has been shown, how data mining technology could be useful specifically in superstore business.

1 Introduction The process of analyzing data from different perspectives and summarizing it into useful information is commonly known as data mining. Although data mining is a relatively new term, the technology is not. Data mining grew as a direct consequence of availability of a large reservoir of data. Computerized data collection had already started in the 1960s allowing for retrospective data analysis. Advent of relational database technology and structured query languages in the 1980s are allowed for ondemand dynamic data analysis. However, the revolutionary advancement in computer storage technology and the massive growth of stored data in the 1990s gave rise to the concepts of data warehousing and data mining. This paper describes a data mining system specially designed for superstore business. This system allows the user to effectively build multidimensional data model named data cube and browsing it, discover different types of interesting association among products and also searching in natural language technology. 155

156

2 Literature Review Data mining, the extraction of hidden predictive information from large database is a powerful new technology with great potential to help companies on the most important information in their data warehouses [ 11. A data warehouse refers to a database that is maintained separately from an organization’s operational databases. It consolidates large amount of data, perhaps collected from multiple sources such as online transaction processing (OLTP) system, flat file etc, over a long periods of time [2]. On-Line Analytical Processing (OLAP) is a part of the data warehouse technology that enables users to examine warehouse data interactively [3]. Data warehouse and OLAP tools are based on multidimensional data model. This model views in the form of a data cube [4]. 2.1 DataCube

To facilitate analyses and visualization, data in a data warehouse are typically organized by a multidimensional model which is often referred to as an OLAP cube or data cube [5]. It is defined by dimensions and facts. Extremely fast retrieval of data is possible in data cube. We usually visualize a cube as a 3-D geometric structure but in data warehousing the data cube is n-dimensional. Dimensions are the entities with respect to which an organization wants to keep records. Each dimension may have a table associated with it, called a dimension table. A multidimensional data model is typically organized around a central theme this theme is represented by a fact table [4] Fact table is a central table in a data warehouse schema that contains numerical measures and keys relating facts to dimension tables. Figure 1 shows three dimensional cube “Sales”. It contains the sales data of a superstore collected from different locations over the past years. ‘Location’, ‘Product’ and ‘Year’ are the three dimensions. The fact or measure displayed is total-sales. The measurement in the cell marked by (“Uttara”, “Foods”, “2000”) indicates that total sales value of foods in uttara in 2000 is 45,000 Taka. Dhanmondl I I I All From data cube it is possible to Clothes ... . Foods p111 retrieve a value for each measure at each possible Product hierarchy level for each Figure 1: A 3-D Sales Data Cube. dimension.

I

I

I

I

I

2.2 OLAP Operations Typical OLAP operations include: - Roll-up: to increase the aggregate level to view data at a higher level of summarization. Data aggregation is the process of taking numerous records and collapsing them into a single summary record. Drill-down: to decrease the aggregate level to view data in more detail. Slice: to select and project

157

on one dimension of a data cube. Dice: to select and project on two or more dimensions of a data cube. Pivot: to re-orient the multidimensional view of data [ 3 ] .

2.3 Association Analysis Association analysis finds interesting association or correlation among a large set of data items. It is generally used in ‘market basket analysis’. For example, which items are commonly purchased at the same time? Help retailers organize customer incentive schemes and store layouts more efficiently. Following is a bread and butter example: 500,000 transactions 20,000 transactions contain bread (4%) 30,000 transactions contain butter (6%) 10,000transactions contain both bread and butter (2%) Support: Support measures how often items occur together, as a percentage of the total transactions. Support can be expressed as [4] #-of -tuples - containing -both - A - and - B support ( A s B ) = (1)

total -# - of

- tuples

where A and B are sets of items and ‘#’ means number. In the bread and butter example, butter and bread occur together 2% of the time (10,000/500,000). Confidence: Confidence measures how much a particular item is dependent on another. Confidence can be expressed as [4] #-of -tuples -containing -both - A - and - B confidence (A-B) = (2) #-of -tuples - containing - A Because 20,000 transactions contain bread and 10,000 also contain butter, So it can be said thatWhen people buy bread they also buy butter 50% of the time. The inverse one, which would be stated as: When people buy butter they also buy bread 1/3 of the time has a confidence of 33.33% (computed as 10,000/30,000).

3 Data Mining in Context of Superstore Business in Developing World Presently data mining technology is widely used in the developed countries. A number of standardized data miming software are now available free on the Internet. It is possible for us in Developing countries to be benefited from the power of data mining technology. A number of organizations in these countries are now using different types of database software for there daily transaction processing and has over the time accumulated a huge amount of raw data. But unfortunately the existing data analysis methods are quite unsatisfactory. So it can be said that we are data rich but information poor. Data mining technology is suitable to apply in these databases to get out the hidden patterns. Superstore Business is one of the appropriate areas to apply data mining technology. Data mining can perform the task like market basket analysis, analysis of the effectiveness of sales campaigns, customer segmentation for superstore business.

158

4 Implementation For implementation purpose we used Microsoft SQL Server 2000 Analysis Services as a data mining engine. It is specifically designed to create and maintain multidimensional data structures and provide multidimensional data in response to client queries [ 6 ] .The analysis manager application communicates with the analysis server by using a layer named decision support objects (DSO). DSO is a set of programming instructions that allows an application to control analysis services. The front-end of our system is developed in Microsoft Visual Basic 6.0 which is a powerful and complete windows application development system. In this system we used English query that helps users to find cube information using plain English instead of a formal query language. For OLAP analysis it is required to create a cube first. In our system cube is created programmatically by using DSO.

4.1 Building Data Warehouse The cube created in our system is based on Microsoft Access warehouse database which contain a superstore’s inventory information. This database has 1,53527 records .The figure 2 shows the structure of the warehouse database, which consists of a single fact table: FactTableFinal. The FactTableFinal has four dimension keysEmployeeID, ProductID, CustomerID, TimeID and two measures Amount and Quantity. The EmployeeID, CustomerID, TimeID key is joined to a single dimension table but the ProductID key is joined to a chain of dimension tables.

Figure 2: Snapshot of Data Warehouse Structure.

4.2 Create Cube Option “Creation Cube” form of our system is specially designed to create a cube in the analysis server. This option uses warehouse data for that purpose. The name of this cube is “Sales Cube”. For modeling this cube we used snow-flake schema [4]. After creation of this cube our system processes it.

159

4.3 Browse Cube Data In our system the cube data is browsed by using Microsoft Excel’s PivotTable Report. PivotTable report is a Microsoft office component. It is capable to work with data which are stored outside the Excel and quickly combines and compares large amounts of data. The form in figure 3 is prepared by embedding Microsoft Excel’s PivotTable Report and Microsoft Excel Pivotchart Report to a Microsoft Visual Basic 6.0 form. This PivotTable Report and PivotTable Chart are connected to the “Sales cube” in Analysis Server. In PivotTable report it is possible to browse the cube data and user can also take a printout of it. 4.4 Drill Down to Member Children in a PivotTable Report Much of the benefit of working with an OLAP cube is the ability to drill down to view member’s details and zoom back to see the big picture. The Excel PivotTable report allows navigating members, either one at a time or all together. If any one doubleclicks the “Year” label then a sublevel that are children of “Year” appear. Double click again to hide the children. The root of the “Year” is time dimension. The dropdown list for a dimension on the row or column axis or page axis contains a hierarchy of members and each member has a check box. It is possible to select multiple members from a dimension. Click the drop-down arrow next to the “Year” (Figure:4) and expand show all, 2000, Q4, December, and check Friday and also expand 2001, Q1, January and check Friday. Similarly expand the employee drop down menu and check Counter-1 and click Ok then a corresponding PivotTable Report will be generated.

Figure 3: Snapshot of Drill down a Member

Figure 4: Dropdown List in PivotTable Report. Which shows the total amount and quantity sold of all products, to all customers in year 2000’s December’s Friday and year 2001’s January’s Friday in Counter-1. PivotTable Report also allows dragging a dimension to or from page axis to column or row axis.

160

4.5 Pivotchart Report A Pivotchart report provides a graphical representation of the data. Users can change the layout and data displayed in a Pivotchart report just as in a PivotTable report. Bar chart in figure 5 shows a comparison of total amount of food and non food products sold by all the three counters of a superstore in year 2000 and 2001. If a user just want to know the position in 2000 then from the year drop down list he just check the 2000 and the report is automatically modified with year 2000's data.

Figure 5 : Snapshot of Bar Chart.

4.6 Association Rule There is Association rule builder in our system which allows the user to build association rule based on different types of products and also present it in the "Result" section (Figure: 6).

4.7 Searching in Natural Language System Searching in Natural language system (English Query) is a very recent technology. In English query system, it allows end-users to ask questions in English, rather than in a

161 computer language such as SQL [7]. Using SQL Analysis Server and Microsoft SQL Server, a detailed analysis and design of this system was done [7].

4.7.1 Phrasing and sentence structure of English query Phrasings are a way of expressing relationships among entities [7]. The types of phrasings and their sentence structures in English Query are used, as shown in table-1. Figure 7 shows the relationship between entity and Adjective phrasing and the product information retrieve using this phrasing and with the help of question "What are the three most popular products?'

Table 1: Phrasings and Sentence Structures in English Query. ~~~

Phrasing Name/ID Phrasing

Sentence Structure Entity that is Name/ID+ are the name of +Entity being Named

Trait Phrasing

Subject+ have+ Object

Preposition Phrasing

Subject+ are +Preposition+ Object

Adjective Phrasing

Subject+ Adjective type+ Entity that contains Adjective

Verb Phrasing

Subject +Verb, Subject +Verb+ Object Subject +Verb+ Object +Object Object + are +Verb, Object + are +Verb+ Object

~

Example PoructName are the name of Products. Ex: List/show/Display all the Products names. Product-ids have Product-name Ex: List the Product ids and their Product names Products are in Product-category-name Ex: W c h products names are in food category ?

Popular-Poduct are adjective describing Product. Ex: what are the name of three most popular Droducts? Employees-sold-products Ex: Who sold most "BREAD (BIG)"?

162

W

t

Adjective Phrasing

Figure 7: Snapshot of Popular Products Name and their Quantity Sold Using Adjective Phrasing.

5 Conclusion T h e size of superstore database is huge but unfortunately the existing data analysis methods in developing countries are quite unsatisfactory. However this big database has a lot of hidden information. In order to get this important information data mining techniques could b e applied. It will help to optimize business decisions, increase the value of each customer and communication, and improve satisfaction of customer with services. Like business sector any sectors can b e benefited by applying data mining technology.

References 1. V.Saravanan and K.Vivekanandan, “Design and Development of A Unified Data Mining System Using Intelligent Agents”, Proceedings of Asia Pacific Conference on Parallel and Distributed Computing Technologies ObComAPC-2004, Vellore Institute of Technology, Vellore, Tamil Nadu, India, (2004). 2. W. H. Inmon., “Building the Data warehouse”, Q E D Technical Publishing Group, Wellesley, Massachusetts, (1992). 3. Qing Chen, “Mining exceptions and quantitative association rules in OLAP data cube”, (1999). Web source, http://www.collectionscanada.c~obj/s4/f2/dskl/tape7/PQDD~0024/MQ5 1312.pdf 4. J. Han and M.Kamber, “Data Mining: Concepts and Techniques”, Morgan Kaufmann Publishers, (2000). 5. SChaudhuri and U.Dayal, “An Overview of data warehousing and OLAP technology”, ACM SIGMOD Record, 26:65-74, (1997). . . 6. Microsoft Analysis server Online Manual. 7. Md. Nobir Uddin, Md. Geaur Rahman, A.K.M. Akhtar Hossain, Subrata Pramanik, Dr. Ramesh Chandra Debnath, “A New Approach of Developing a Hospital Management System Using Natural Language Technology (English Query)”, Proceedings of 7th International Conferences on Computer & Information Technology (ICCIT), BRAC University, Dhaka ,(2004).

Trusted domain Web Services for Delivering eGovernment Services Swamynathan S, Kannan A, Geetha T V Department of Computer Science and Engineering College of Engineering, Anna University, Chennai 600 025, INDIA Telefax:9 1-044-22301337. swainviis(c~aununiv.edu,kannan~~annauniv.cdu, rctami I(&mnauniv.edu

Abstract. Providing web service to various clients is one of the important aspects of service providers. Generally service providers maintain a server relating to their business domain and provide services for their clients. Rule server is an emerging concept through which various business servers establish B2B interface and thereby enable the rule server to provide reactive services to various rule users through active XML rule technology. In these types of web services there will not be any interaction between web service providers. But in general, when a client requests for a service, the service provider may have to consult other service providers and based on their results, the service provider may in turn have to provide the web service to the client. Various service providers can group to form a trusted domain. This enables the service providers to effectively serve its clients by co-coordinating with different domain servers offering different services within the trusted domain. Particularly, in an e-government scenario, providing a service generally requires more than one server’s interaction. Hence in this paper, a system architecture has been proposed, in which, agent based trusted domain servers will interact with each other in order to provide co-operative web services for a single user. This requires the conversion of composite web service into multiple sublevel web services, optimizing the order of execution of sublevel web services and keeping track of the status of client’s request. In order to provide intelligent egovernment, reactive behavior has been introduced into the system, which will be used by the trusted domain administrators, so as to improve the performance of the trusted domain.

Keywords Reactive web services, active ECA rule, XML, agent, e-government, SOAP 1.0 Introduction

Web service is both a process and a set o f protocols for finding and connecting to software exposed as services over the web. It provides a RPC style mechanism for accessing resources on web. Reactive web service mechanism uses a push technology 163

164

through which information is pushed to the client based on the matching of new event occurrences with predefined user interests [ 11. In a reactive web service system, the user may not be aware of all the service providers and the time of occurrence of the event associated with the service. Hence by sending the request to the rule server the user just waits for the service to be provided. The rule server is responsible for composing the rule, identifying the server where the necessary service will be provided and directing the server to send the result directly to the user. In case the required service is not currently available in the server, then the rule engine associated with the application server will be updated with the event of interest. Whenever the event of interest takes place, then if the necessary condition gets satisfied an appropriate action will take place thereby the results will be sent to the client who initiated the rule [8][9]. Generally, each domain server offers its own reactive web services and does not participate in co-operative reactive environment. Massimo Mecella et a1 [2] proposed a repository of workflow components for cooperative application, which describes the need for web services in a cooperative environment. However, since reactive web services were not incorporated, which may be required in most real time cases, push technology was not exploited. In reality, each web service may have associated business rules and these business rules may force the client to provide necessary information belonging to other web servers. This literally expects the client to understand the business logic of each web service and forces, procurement of necessary information from various service providers. This problem may frequently arise in an e-government scenario, where many web services belonging to various domain servers may have to interact with each other for providing the web services. A solution to this problem is providing cooperative web services, which enable the client to put the requested service to the appropriate web service provider, which in turn will take care of communicating with other domain web servers. Due to security and authentication problems, these domain servers need to be in a trusted environment. This paper discusses a system architecture, which allows trusted domain servers to interact with each other in order to provide co-operative and reactive web services. This requires the conversion of composite web service into multiple sub level web services, optimizing the order of execution of sub level web services, keeping track of the status of the client’s request and monitoring of events for intelligent egovernment.

2.0 Related work XML is a technology for representing data, which is in a platform-neutral serializatiodmarshalling format. The unfolding of the web service paradigm which was described by Akhil sahai et a1 [3] explains the genesis of web services, the state of the art and the promises that the web services paradigm holds. The Simple Object Access Protocol (SOAP) [5] defines an XML messaging protocol that allows applications to communicate. It defines what goes in as XML message and how to

165 process it, rules for encoding instances of application-defined data. Push Technology is the ability of sending relevant information to clients in reaction to new events, and is a fundamental aspect of modem information systems. A client needs full information about a web service in order to consume it effectively. WSDL and UDDI are the technologies that provide the necessary information about the web services, to the client. Web Service Description Language (WSDL) defines an XML grammar used to describe all aspects of how a web service is to be accessed, including endpoint address information, the transport protocol to be used, and the wire format of the messages to be exchanged. Universal Description Discovery and Integration (UDDI)] is a SOAP based protocol used to access repositories of web service metadata. In addition, WSDL, UDDI and SOAP support the implementation of push technology in order to provide cooperative reactive web services. Angela Bonifati et a1 [ l ] describes the Pushing reactive services to XML repositories using active rules in which, the reactive service was considered in XML repositories belonging to a single server. If there were many servers related to the event of interest, then the client will be provided with multiple results as and when the event occurs in the respective servers. The concept of considering a client’s request as a complex web service and dividing the same into multiple small web services and executing it in a distributed way is an important aspect in e-government scenario. Hence it becomes necessary to visualize pushing reactive services in a distributed environment. The need for web services in a co-operative environment was described by Massimo Mecella et a1 [2] in which, the interaction between web services was the core issue. But the reactive behavior becomes important among web services in a cooperative environment. This becomes inevitable in an e-government application where a web service may have to handshake with other services. Hence in this paper, we propose a system architecture for Multi-Server Reactive Web Services for intelligent e-government. This system relieves the client from bothering about the business rules associated with each web service and the intermediate web services that need to be serviced in order to provide the final solution. The issues involved in implementing the system includes the identification of number of intermediate web services, optimizing the order of execution of each web services and keeping track of the status of each client’s rule request. Once the client gives a request, the cooperative group of service providers (called trusted domain) will serve the request optimally, by dynamically allocating sublevel tasks to the various mzmbers of the trusted domain. Whenever, a trusted domain administrator frames a new rule, a reactive behavior is initiated into the trusted domain, by monitoring a set of events related to the rule.

3.0 System Architecture The block diagram shown in Figure 1 provides the necessary framework for agent based Trusted Domain Reactive Web Services for intelligent e-government. This system includes Trusted Domain Web Servers, Trusted Domain Service Scheduler, Rule Broker, Rule Tracker, Web Service Registry, Dependency Registry, Trusted Domain Administrator and Web Service Mapper.

166 The main component of the architecture is the group of web servers that form a trusted domain. Associated with the trusted domain is a regulating authority, which manages complex web services. Connected to the trusted domain are the Trusted Domain Service Scheduler and the Rule Tracker, both of which help in executing complex web services. Such web servers within the trusted domain consist of the related services

Web Saruia

Direct Client

Information

Dspsndsncy

Information

I

Swuics

Rsqussl

I

WsbSsruics Information

! ~ i g ~ 1.r eArchitecture of Trusted Domain Reactive web service system

that it offers, along with the rule template repository. The Web service registry is similar to the UDDI except that the web services registered with the Web service registry, are made available to the Trusted Domain Service Scheduler, which frames the rule. Whenever a web server is enhanced with a new complex web service, the n e c e s s ~web services offered by other servers, which are required to complete the task, are identified and sent to the Trusted Domain Service Scheduler, which in turn creates the rule schema. The initiating web server also provides the dependency information associated with the already existing web services in the dependency registry. The dependency information can be categorized into two namely Explicit Dependencies and implicit Dependencies. Implicit Dependencies exist between those services where the output of one service happens to be the input of another service. Explicit Dependencies exist between those services where there is a preferred order in which the services are to occur. The rule ternplate framed is assigned a unique id and

167

is appended to the rule repository of the Trusted Domain Service Scheduler and the initiating web server. At a later point of time if a new request for framing a Service Rule arrives, and if that involves an existing Rule or a set of Web Services that has dependency information, then the Trusted Domain Service Scheduler optimizes the Rule Plan by incorporating the predefined preferences. The Web service registry incorporates the web service details and helps the trusted domain web servers to avail the services. The Domain Web Servers in the trusted domain, while adding a web service to their service list, provides necessary details to the Web service registry and register their services and also provide any dependency information to the Dependency registry. The Dependency Registry is used by the Trusted Domain Service Scheduler for deciding on the order in which, the subtasks of the complex service are to be executed. The service requested by the Trusted Domain Administrator, will be mapped using web service mapper and provides a description about the requested service to the rule broker. The Rule Broker introduces the monitoring behavior into the respective trusted domain server and thereby initiates the reactive behavior. When the client gives a request to the Concerned web server, the web server matches the service request with the set of web services it offers. If the web server itself can directly handle the service, then it does so and sends back the result to the client. In case the service request needs to be executed in a distributed manner, making use of multiple web services offered by different web services, then the service is said to be complex. In this case, the request is received by the concerned web server (Initiating web server), and a rule instance is created based on the matching rule template associated with it, and is forwarded to the rule tracker. The Rule Tracker parses the embedded rule and figures out the first web service to be invoked and invokes it. If the execution type happens to be serial, the Rule Tracker waits for the response or else if the execution type happens to be parallel, then it invokes other web services within the same level. The Rule Tracker then sends the service response directly to the client.

4.0 Domain Specification The e-government scenario involves multiple domains and a request arising out of a domain has a very high probability that it involves a service from another domain. If the tasks of each domain were to be automated with out a centralized point of view, it would lead to islands of automation. The architecture proposed here involves a trusted domain monitored by a Regulating Authority, who is held responsible for the inter domain request that are framed as rules. In this scenario, the web servers which are described in Table 1 are considered and the table shows the inter dependencies among them for any client’s request. All the domain servers honor and serve the request, which are formulated by the Trusted Domain Service Scheduler. Some of the client’s request, which involves the co-operation of other servers, is listed below.

168

Requests originating from Direct Client 1. Request for land registration requires Encumbrance Certificate procurement, obtaining address proof from Civil Supplies and should have paid Property tax and Income tax for the last six months. 2. Request for starting a new company requires the possession of active license from Industrial Development Corporation, clearance from Pollution Control Board 3. Request for new passport requires Police clearance and address proof from civil supplies and should have paid the Income Tax and Property Tax for the past six months These requests involve optimization but do not incorporate reactive behavior.

CI

I

-

Citizen Information

IT - Income Tax

IDC Corporation

I

Industrial

Development

PCB - Pollution Control Board

PT - P r o p e r t y Tax

CSR - Civil Supplies Records

RG - R e g i s t r a t i o n l G e n l

POL - Police

LR - L a n d R e g i s t r a t i o n

RPO

I

P - Parallel Activity

I

-

I

Regional Passport Office

S - Serial Activity

I

Table 1. Inter dependency between the web servers

Requests originating from the Trusted Domain Administrators 1. If a company has not paid the electricity bill for 3 months and has not paid the telephone bill for 6 months and has not paid the corporation tax for 12 months deactivate its license. 2. Deactivate driving license, if a person is a part of 5 accidents or convicted in two accidents.

169

3. If a person has not paid income tax or property tax for more than 6 months deactivate passport and civil supplies. These requests just involve reactive behavior and do not incorporate optimization.

4.1 Mapping of User request When a user makes a complex request to a web server, the request has to be mapped with the services offered. Using a GUI, each request is obtained and then logged into a log file with an appropriate description about the service and the input parameters are stored. The log file is monitored for a set of expected events using DOM model and this request is matched with the existing rule schema within the initiating web server’s rule repository and an instance of the rule schema is created and forwarded to the rule tracker. The rule tracker assigns a unique rule instance id and the rule instance is logged into its rule instance repository.

5.0 Generic Rule and Web services The implementation of the multi server reactive web service requires a generic rule specification and web service specification in XML format. Section 5.1 summarizes the generic rule specification and section 5.2 summarizes the generic web service specification.

5.1 Rule Schema Specification The generic rule schema specification is shown in Figure 2. The RULE-INSTANCE-ID is an unique id provided to each instance of the rule logged into the Rule repository of the Rule Tracker and hence the Rule Tracker differentiates between same request from different citizens. The RULE-ID specifies the rule template id that is assigned to the rule when it was formulated. The RULE-TOTAL-LEVELS specifies the total levels in the Current rule. The RULE-COMPLETED-LEVELS specifies the total levels that have been completed at any given point of time. The LEVELS is a collection that stores all the available LEVELS within it. The LEVEL contains the elements like LEVEL-ID which identifies the level, EXECUTION-TYPE which specifies whether the sub levels are to be executed in serial or parallel, LEVEL-OPERATOR which specifies the operator within that LEVEL, SUB-LEVEL which represents a service that is to be executed, SUB-LEVEL-ID which identifies the SUB-LEVEL, SERVICE-ID which uniquely identifies a web service or a rule, SERVICE-BASE-URL which specifies the base url of the web service, the SERVICE-NAME specifies the name of the

170

service, the SERVICE-PROVIDER specifies the server name which offers the service and METHOD-NAME specifies the name of the web method, PARAMETERS-LIST which is a collection of the parameters for the service, IN which specifies the collection of input parameter set, IN-PARAMETER which specifies a single input parameter set, PARAMETERNAME which specifies the name of the input parameter, PARAMETER-TYPE which specifies the data type of input parameter, OUT-PARAMETER which specifies a single Output parameter set, OUTPUT which represents the output. The SERVICE-TYPE which specifies the SERVICE-TYPE ( SYNCHRONOUS [ASYNCHRONOUS), SUBLEVEL-STATUS, which specifies the SUB-LEVEL-STATUS (COMPLETED 1 NOT-COMPLETED), LEVEL-STATUS which specifies the LEVEL-STATUS (COMPLETED I NOT-COMPLETED). The SERVICETYPE specifies whether the service request is to be made as a SYNCHRONOUS one or as an ASYNCHRONOUS one. In case of SYNCHRONOUS service request, the Web Service is mostly a query or a processing on available data and does not require human intervention or a time interval and the results are available immediately. Hence in such cases, on the completion of the web service call, the output is obtained and the rule instance is updated with the output and the sub level status is set to completed. But on the contrary, an ASYNCHRONOUS service request may require human intervention or a time interval for the event to occur. The time interval may be a matter of seconds or days or any time quantum. Hence in an ASYNCHRONOUS service request, the Rule Tracker registers the web service request in the appropriate web server and that server monitors for the expected event and later, on occurrence of the event and completion of the request, another web service calls back and notifies the result to the Rule Tracker. The reactiveness of web services and the push technology comes to picture here. The OUTPUT contains the resultant set, [i.e. result of the web service] and may be used by any successor web services or be forwarded to the Client. It can either be a TRUE or FALSE, or a REPORT based on the analysis of data or a collection representing matching elements or a value. For instance, a web service that validates Citizen-ID provides TRUE or FALSE, as a result, whereas a web service that provides a report from Industrial Advisory committee has the output form of XML structure. In either case it is serialized and stored.

171 CRULEIID-ULE:ID>

CLEVEL-ID&EVEL-ID> <EXECUTION-TYPEWXECUTION-TYPE> -=LEVEL-OPERATOR&EVEL-OPERATOR> <SUB-LEVELS> <SUB-LEVEL> CSUB-LEVEL-IDWSUB_LEVEL-ID> <SERVICE_IDWSERVICE_ID> CSERVICE_NAMEWSERVICE_NAME, cSERVICE-PROVIDERX/SERVICE- PROVIDER> <SERVICE_BASE_URMSERVICE-BASE-U~

<SERVICE-NAMESPACEWSERVICE-NAMESPACE> CMETHOD-IDmETHOD-ID> <METHOD-NAMMETHOD-NAME>

CPARAMETER_NAME>

cmARAMETER-VALUE>

<SERVICE-TWEWSERVICE-'I?TE> COUTPUTWOUTPUT, <SUB-LEVEL-STATUSWSUB-LEVEL-STATUS> CLEVEL-STATUS&EVEL-STAlVS> -=LEVELS>

CACTION-SUB-LEVEL>

<SERVICE-NAME> <SERVICE-BASE-URMSERVICE-BASE-UW

<SERVICE-NAMESPACEWSERVICE-NAMESPACE> CMETHOD-IDWMETHOD-ID> <METHOD-NAM-ETHOD-NAME>

cmARAMETER-TWE> cPARAMETER_VALUE> -=AN-PARAMETER> -=AN> COUT-PARAMETER> cmARMETER-T~E>

CPARAMETER-VALUED4PARAMETER-VALUE> qOUT-PARAMETER>

qouT,

4RULE INSTANCE>

Figure. 2. Generic Rule Schema Specification

172

The LEVEL-OPERATOR specifies the Operator within that LEVEL. It can be either [AND I OR I NOT I NO-OP] where AND specifies all the SUB-LEVELS within that LEVEL (must return TRUE). The operator OR specifies that at least one of the SUB-LEVELS within that LEVEL should return TRUE. NO-OP specifies that if all the SUB-LEVELS within that LEVEL are completed, then the system may proceed to the next LEVEL, if any. NOT specifies that on negation of the specified web services, the control proceed to the next LEVEL. The Event and the Condition part are being clubbed together. The events are being monitored and on occurrence of the event, the condition values, passed as input parameters, are being evaluated and if the output turns to be the expected value, then the appropriate Action module is executed. The action module consists of a set of action sub modules each having a unique value. The RULE-DESTINATION-EMAIL-ID specifies where the results are to be forwarded. The RULE-OUTPUT-DESCRIPTION specifies the output message that accompanies the output of the rule.

5.2 Web Service Catalog Specification The web service provided by each web server, need to provide the meta data about its features to other trusted domain web servers. Hence it becomes necessary to arrive at a web service catalog description. The web service catalog for the proposed system is described in Figure 3. <WEB-SERVICE-CATALOG> <WEB-SERVICE>

<WEB-SERVICE-IDx/WEB-SERVICE-ID> <WEB-SERVICE-NAME><EB-SERVICE-NAME>

<WEB-SERVICE-DESCRIPTIONx/WEB-SERVICE-DESCRIPTION> <WEB-SERWCE-PROVIDER>~EB-SERVICE- PROVIDER> <WEB-SERVICE-BASE-URL~B-SERWCE-BASE-URL>

<WEB-SERVICE-NAMESPACE>4WEB-SERVICE-NAMESPACE> <WEB-METHODS> <WEB-METHOD>

<WEB-METHOD-ID>UWEB-METHOD-ID> <WEB-METHOD-NAMEx/WEB-METHOD-NAME> <WEB-METHOD-DESCRIPTION>4WEB-METHOD_DES

-4N-PARAMETER2

173 40UT-PARAMETER>
<WEB-METHOD-TYPE> WB-METHOD> WB-SERVICE-CATALOG>

Figure 3. Generic web service catalog specification

Each domain web server may offer various web services, Since a trusted domain has been proposed, the web services are to be consumed by the rules formulated within the domain. Hence, these web services may not be registered with an external registry such as UDDI. Hence, a separate registry setup called Web Service Catalog, which stores all the necessary information that is required to access the service has been used. Each web server, while offering a service, has to register with the Web Service Registry by providing appropriate information. The Web Server Catalog assigns a unique id “WEB-SERVICE-ID” to each web service registered with it. The WEB-SERVICEPROVIDER specifies the name of the domain web server that offers the service and the WEB-SERVICE-DESCRIPTION describes in a short text note the service offered by the web service. The WEB-SERVICEBASE-URL specifies the access path to the service and the WEB-SERVICENAME specifies the name of the service class that is offered. The WEB-METHODS is a collection that embeds the various methods offered with in the service. The WEB-METHOD is a structure that stores information associated with each method. The WEBMETHOD-ID specifies the unique id that is being assigned to a web method within the service and the WEB-METHOD-NAME specifies the name of the web method. The WEB-METHOD-DESCRIPTION provides a short text note about the method offered. The PARAMETER-LIST stores the input and output parameter info, namely their name, type and value. The WEBMETHOD-TYPE specifies whether the service is to be accessed synchronously or asynchronously.

6.0 Implementation Once the user request is mapped to a rule and the rule instance is created and logged into the Rule Instance Repository, the Rule Tracker takes control of execution of the rule and waits for response from the initially invoked web service. On getting back a response from a web service, the Rule Tracker sets the SUB-LEVEL-STATUS of that SUB-LEVEL to “COMPLETED’. The Rule Tracker also queries to find whether all web services within that LEVEL have been completed. This is accomplished by verifying the SUB-LEVEL-STATUS of all the SUB-LEVELS in case the execution type is parallel or the SUBLEVEL-STATUS of the last SUB-LEVEL in case the execution type is serial. If the execution type is

174

parallel and the LEVEL-OPERATOR happens to be “ O R and any one of the completed SUB-LEVEL’S OUTPUT happens to be TRUE then the Rule Tracker sets the LEVEL-STATUS to “COMPLETED”. Then it invokes the next level if any. On setting each level’s status to “COMPLETED, the Rule Tracker increments the value of RULECOMPLETED-LEVELS by one. Then it also verifies whether RULE-TOTAL-LEVELS equals RULE-COMPLETEDLEVELS. If so, executes the action sub module whose action value matches with the rule output. Also it provides the response to the service requester identified by the RULE-DESTINATION -EMAIL-ID through an email notification, if it exists. For the rules framed by the trusted domain administrators, where the reactive behavior is exhibited, the rule templates are logged into the appropriate web server(s) that could initiate the rule. In each web server, there are two XML repositories, namely rule template repository and rule instance repository. Initially the rule template, which gets initiated in this web server, is added to the rule template repository and added to the list of listeners for the expected event. If an event occurs then it is notified to the event listener. For each rule template in the list of listeners, rule instances in the rule instance repository are verified and those with a matching rule id are updated into the rule instance repository of rule tracker. If there is no matching rule instance a new rule instance is created and logged into the rule tracker’s rule instance repository. Certain complex rules may be served for partial execution. In those cases, there will not be a matching rule template. So when a rule instance is forwarded to queue up in the rule instance repository, it is also added to the set of listeners of the particular event. So when a notification for that event occurs, and if the condition is satisfied, then the rule instance in the rule instance repository of the rule tracker is updated. In this system, rules framed are bound to change dynamically and hence the rule structure is to be flexible enough to adapt those changes. One easier way to invoke a web service is by constructing a URL, which uses HTTP GET or POST protocol to consume the web service, through a browser application. The drawback is that these protocols are not intended to transfer complex data types. Hence, the SOAP protocol, which is a lightweight protocol, is used which can handle complex data types. Apart from this, using SOAP protocol aids in providing a built-in security. This architecture requires, the web services to be dynamically configured and consumed. This requires consuming the service contract in WSDL format at runtime, generating a proxy class, compiling it as an assembly and loading the generated assembly. C#’s CodeDom package provides classes and interfaces to handle the above requirements. Once the assembly is loaded, the type has to be identified and the web method offered by the type is to be invoked. It is required that the input parameter information and the output format needs to be known to consume the service. This is accomplished by, the classes in C#’s Reflection package. The event handling is taken care through the DOM event model and the delegates in C#.

175

7.0 Results The direct client's request and the possible ways through which the sub level tasks can be optimized have already been discussed. In this section, a sample complex web service request provided by the trusted domain administrator has been considered. Its associated sublevel web service rule specifications for various web servers are shown in Figure 4, 5, 6 and 7. This shows, how the reactive behavior for monitoring various events has been handled. If a company has not paid the electricity bill for 3 months and has not paid the telephone bill for 6 months and has not paid the corporation tax for 12 months deactivate its license

RI29<1RULE-INSTANCE-IDD R345-=dRULE_ID>

<EXECUTION-TYPE>SERIALL

<SUB-LEVEL.+ <SUB-LEVEL>

<SUB-LEVEL-ID>I4SUB-LEVEL-ID> <SERVICE_ID>WS237 <SERVICE-NAME>ELECTRCITY_DEFAULTERS_INFO4SER~CE-NAMED <SERVICE~BASE~URLDhtp://Electricity_TN~IN.co~Se~ce~4SERVICE~BASE~U~ <METHOD-ID>44METHOD-ID> <METHOD-N AME>IS_DEFAULTERWETHOD-N AME>

REGISTRATION-NO-=JPARAMETER-NAME>

ROOOOl GN-PARAMETER>

INTEGER-=JPARAME~R-~E> GIN-PARAMETED GN>

BOOLEAN-=JPARAMETER-TYPE> 4OUT-PARAMETERD 40UTD
<SERVICE-TYPE>SYNCHRONOUS4SERWCE-TYPE> <SUB-LEVEL-STATUS>NOT-COMPLETED4SUB-LEVEL-STATUS> 4 S U B LEVEL>

Figure 4. Sample rule instance specification for first web server

176 <SUB-LEVEL> <SUB_LEVEL_ID>t4SUB~LEVEL_ID> <Srrs_LEVEL_ID>t4S~-LEVEL-ID> <SERVICE ID>WS457<SERVICE ID>

Figure 5. Sample sublevel rule instance for second web server

RECISTRATION_N(WPARAMETE STRING<mARAMETER_TYPE> ROOOOl@ARAMETER VALUE> GN_PARAMETE~> GN> clN_PARAMETER> DURATION INTECER~ARAMETER-TYPE> lzcMETER-V~UE> GN_PARAMETER> COUP

BOOLEAN

40UP <mARAMETERS_LISP
<sERvlCE_TYPE>SYNCHRONOUS4SERVICE-TYPE> <SUB_LEVEL_STATUS>NOT_cOMPLETED q/SUB_LEVEL> 4SUB-LEVELW ~LEVEL~sTATUS~NOT~COMPLETEDcnEVEL_STATUS~ dLEVED

Figure 6. Sample sublevel rule instance for third web server

177 CACTIONS

TRUEBOOLEAN LICENSE DEAC"WATEo ~RULE~DES~NATION~EMAn~~D~~TECH~YAHOO.COM~ULE~DES~NATION~E~L~~D> CACTION-SLE-LEVED

CACTION_SUB_LEVEL~~>l4ACTION~SUB_LEVEL~lD~ 6ERVICE-lD>WS343YSERVICE_ID>

6ERVICE-NAME>LICENSE-INFO4SERVICE-NAMD 6 E R V I C E _ B A S E _ U R ~ h n p : / / R e g i s t r P t l o n _ T N >

<METHOD-lD>sclMETHOD_ID> cMETHOD-NAME>DEACTIVATI~LICENSE

S T ~ G ~ A ~ ~ R - ~ E > CPARAMETER_VALuE>ROOOOl~~E~R~VALuE> 4ARAMETERS_LIST, 4ACTION-SUB-LEVED

Figure 7. Sample action part for the rule instance

8.0 Conclusions A system for distributed reactive web services has been implemented. A trusted domain server, which includes multiple web servers have been realized, and the web services related to each web server have been implemented. In order to handle the distributed reactive web service system, a generic rule schema specification has been designed. The common web service schema for all the web services, provided by various web servers, with in the trusted domain server have been specified. In order to handle complex data types and messaging between various web servers, SOAP protocol has been used. The dynamic configuration and consumption of web services have been taken care by using C# CodeDom package and identifying the objects at run time was handled using the C#'s reflection package. The event handling to support reactive mechanism was dealt using DOM events and delegates. As a future work, the facilities of the trusted domain could be extended to other web servers with more security features and provisions.

References 1. 2.

3.

Bonifati, A., Ceri, S., Paraboschi, S.: Pushing Reactive Services to XML Repositories using Active Rules, Computer Networks 39(5) (2002) 645-660 Mecella, M., Pernici, B., Rossi, M., Testi, A.: A Repository of Workflow Components for Cooperative e-Applications, Proceedings of the 1st IFIP TC8 Working Conference on E-CommerceB-Business (2001) 73-92 Sahai, A., Graupner, S., Kim W.: The Unfolding of the Web Services Paradigm, HPL2002-130. Internet Encyclopedia.

178 4. 5. 6. 7. 8.

9. 10.

11.

12. 13. 14.

[4]XML Path Language (XPath). httu://www.w3.orn/TRIXPATI-I. Simple Object Access Protocol (SOAP) httn://~~~.w3.or~/TR/2000/SOAP. James Bailey, Alexandra Poulovassilis and Peter Wood.: An Event - Condition Action Databases: Model & Implementation, 1 WWW Conference (2002) Diego Lopez de Ipifia.: An ECA Rule-Matching Service for Simpler Development of Reactive Applications, Proc. of IFIP/ACM Middleware (200 1) Bonifati, A., Ceri, S., Paraboschi, S.: Active Rules for XML: A new paradigm for eservices. VLDB Journal, lO(1) (2001) 39-47 Ceri.S., Widom, J.: Active Database Systems: Triggers and Rules for Advanced Database Processing Morgan Kaufmann (1996) Norman W.Paton.: Active Rules in Database Systems. Springer-Verlag New York Inc ( 1999) Abiteboul, S., Amann, B., Cluet, S., Eyal, A,, Mignet, L., Milo, T.: Active Views for Electronic Commerce. Proc. of VLDB (1999) Abiteboul, S.,Benjelloun, O., Milo, T. : A Data-Centric Perspective on Web Services (Preliminary Report). Technical Report 212, INRIA, (2001) Abiteboul, S., Benjelloun, O., Milo, T., Manolescu, L., Weber, R.: Active XML: Peerto-Peer Data and Web Services Integration (demo). Proc. of VLDB (2002) Abiteboul, S., Cobena, G., Nguyen B., Preda, M.: Monitoring XML data on the web. Proc. of ACM SIGMOD (2001)

KNOWLEDGE DISCOVERY FROM WEB USAGE D A T A SURVEY

*

G T RAN', P.S.SATHYANARAYANA', L.M.PATNAIK 'Department of Electronics & Communication B.MS. College of Engineering Bangalore Department of Computer Science & Automation, Indian institute of science Bangalore

1

ABSTRACT Knowledge discovery from the web usage data (secondary data generated by the user interactions with the web) has become very critical for effective and efficient managing of the activities related to sbusiness, eservices, education, personalization, web site management and so on. Mining web access logs that contain substantial data about user access patterns on one or more web localities is an emerging research area. We are aiming to design &. implement a tool Knowledge Discoverer from Web Usage Data WWLJLl) that extracts interesting knowledge fiom web usage data. Here, we present a survey of the recent developments in this area.

2

Introduction

Databases with millions of records and thousands of fields are now common in Business, Medicine, Engineering and the Sciences. Online data continues to grow at an explosive pace, due to the Internet and widespread use of database technology. This has created an immense opportunity and need for methodologies of knowledge discovery in databases (KDD). KDD focuses upon building automated techniques for extracting useful knowledge from the databases using several techniques from computer science and Artificial Intelligence to search for relationships and global patterns in large databases leading to non trivial extraction of implicit, unknown, and potentially useful information from databases. The WWW is an immense source of data that can come from w e b content or from the w e b usage. With the transformation of the Web into the primary tool for electronic commerce, it is imperative for organizations and companies, who have invested millions in Internet and Intranet technologies, to track and analyze user access patterns. Most web sites are setup with little knowledge on the navigational behavior of the users accessing then. Knowledge about navigation patterns occurring in or dominating the usage of a website can greatly help the site's owner or administrator in improving its quality. Web mining is that area of data mining which deals with the extraction of interesting know ledge from the WWW[ 13. It has three categories: Web content mining, Web structure mining, and Web Usage mining. Web usage mining mines the server log files. Typical applications are those based on user modeling techniques such as web personalization and adaptive web 179

180 sites. KDWUD works on user profiles, user access patterns and navigation paths that are being heavily used by scommerce companies, for tracking customer behavior on their sites. 3

Related work

The recent years have seen the flourishing of research in the area of web mining and specifically of web usage mining. The first workshop on this topic WebKDD was held in 1999. As many believe, it i s Oren Etzioni, first proposed the term of web mining [l]. Ref [ 2 ] defines the web mining and also provides the categorization in web content mining, web structure mining, and web usage mining; then it provides a survey mainly focused on the results in the area of web content mining. Recently, RefI71 has presented an overview of the soft computing techniquesweural Networks, Fuzzy logic, Genetic Algorithms) used in web miniig applications with a specific focus on web content mining. This survey is based on more than 100 papers published since 2000 to till date on the topic of web usage mining. None of the existing commercial tools have the prediction and pre-fetching capabilities. Our research work includes the design of automated tools for automatic discovery of user access patterns from web log data using neural network concepts and intelligent agents with Prediction & Pre-fetching capabilities. We have proposed an agent based architecture for knowledge discovery from www transactions [Raju and Suresh Khandige, 20031 and identified the neural network approaches for implementing the various agents. We ire aiming to build automated tool KDWUD that is both accurate and understandable for extracting useful, implicit, and novel knowledge from the Web usage data using Advanced Machine learning techniques, Intelligent Agents and Neural Networks. 4

Data Sources and KDWUD Process

KDWUD applications are based on the data collected from three main sources: Web servms, Proxy servers, a d Web Clients. An Example of a Web Server Logis as shown below: - <method> <protocol>

   203.30.5.14.5 www.acr-news.org - [01/Jun/l999:03:0921 -06001 “GET /Calls/O WOM html HTTP/I.0“ KDWUD process has three main tasks: Preprocessing, Pattern Discovery, and Pattern Analysis. Preprocessing has a fundamental role in KDWUD applications. it comprises of four different tasks: i) Data cleaning; ii) Identification & reconstruction of user’s sessions; iii) The retrieving of information about page content and structure; and iv) Data formatting. Pattern Dicovery Techniques includes Association rules, sequential patterns, and Clustering & Classification.



 181



Association rules are used to find associations among web pages that frequently appear together in user’s sessions. Typical result has the form A,html, B.html + Chtmlwhich states that if a user has visited page A . htmland B.html, it is very likely that, in the same session, the same user has also visited page Chtml. Sequential Patterns are used to discover frequent subsequences among large amount of sequential data. Typical sequential pattern has the form [27]: The 70% of the users who first visited A.html and then visited B.html ajlerwards have also accessed page Chtml in the same session. Clustering & Classification techniques allow us to develop a profile for clients who access particular server files based on demographic information available on those clients or based on their access patterns.



5



Conclusion and Future Trends



Results of KDWUD can be utilized for: Personalization, System Improvement, Site modification, Business Intelligence, Usage characterization. scommerce, 6 business, and e-education applications. E-commerce companies may use for tracking customer behavior on their web sites as well as to understand and better serve the needs of web based applications. The proposed framework KDWUD seems to work very well for the problems considered. Future research issues in this concern are: Development of Clustering algorithms for grouping log entries into transactions using neural networks and Development of knowledge based intelligent agents to interpret the discovered rules using neural networks.



6



References



As it is not possible to cite all the papers here, Reference includes only the important papers. 1. 0. Etzioni, The world-wide Web: quagmire or gold mine? Communications of the ACM 39 (1 1) (1996) 6 5 4 8 . 2. R. Kosala, H. Blockeel, Web mining research: a survey, SIGKDD: SIGKDD explorations: newsletter of the special interest group (SIG) on knowledge discovery & data mining, ACM 2 (1) (2000) 1-1 5. 3. http://www.w3.org/Daemon/User/Con.g/ (1 995). 4. J. Srivastava, R. Cooley, M. Deshpande, P.-N. Tan, Web usage mining: discovery and applications of usage patterns from web data, SIGKDD Explorations 1 (2) (2000) 12-23. 5. S. Pal, V. Talwar, P. Mitra, Web Mining in soft computing framework: relevance, state of the art and future directions, IEEE Transactions on Neural Networks 13 (5) (2002) 1 163- 1 177. 6. B Mobashen, H Desai, T Luo. “ Discovery of aggregate usage profiles for web personalization”, in proceedings of KDD-2000 workshop on web mining for e commerce, Boston, Aug 2000.



 This page intentionally left blank



 Innovative Applications for the Developing World



 This page intentionally left blank



 CREDIT RATING MODEL FOR RURAL BORROWERS SSSATCHIDANANDA,FUMESH B. PANIKAR,PRADEEP [email protected]. in, [email protected] ABSTRACT With the renewed focus of the Reserve Bank of India and Finance Ministry to the provision of Micro-credit in the Rural Sector and growing competition in terms of reach and financial spread, Banks and Financial Institutions have necessarily had to foray into providing Rural Credit. An effective Credit Rating System for the Rural Sector is the order of the day to ensure successful implementation in a risk free manner. This document offers a technology-based approach for such a Rating Model as well as implementation methodology.



Introduction



Improving the quality of rural lending is essential to promote capital formation and growth in the rural areas as well as to ensure the financial viability of bank lending. The risk in rural lending can be mitigated through a robust system of rating of rural individuals. Application of information and communication technology plays a useful role in building such a system. This paper conceptualizes a technology-based approach for rural credit rating. It is claimed that adopting this approach will result in credit quality improvement and benefit both the banks and its rural clientele. The advantage for the banks is that they are able to reduce risk, defaults and consequential losses, while the people can get faster, better quality service, low interest rates through this credit rating system. An institutional benefit is the evolution of a strong decentralized viable credit rating system in rural areas. A viable credit rating system also helps reduce increase of non-performing assets (NPA) for the financial institutions and in the process, ensure increased profitability and better utilization of funds. The credit rating system can be extended to help in customer acquisition and retention activities of the institution as well.. 185



 186



Section 1: Need for Rural Credit Rating for Lending Effective credit risk management requires that the risk of each obligor be rated in relation to over all population. The individual ratings are a necessary tool for differentiating the risk of various proposals. The need for credit rating has assumed urgency and importance in recent times. Currently, banks are required to quantify the risks of their credit portfolio, and provide adequate capital to cover such risks in terms of ‘the latest Basel I1 capital requirements. Of course, the compliance with this requirement will be as per the specifications issued by the National Bank supervisors like Reserve Bank of India with respect to India. Increasing competition in financial services forces banks to differentiate between various customers with a view to avoiding adverse selection and moral hazard. If banks grant loans to their customers without quantifying the risks associated with these customers, there is likelihood of taking excess risk or making a bad choice. This situation can be alleviated through utilization of credit rating for informed decision-making and judgment with regard to credit. Since banks are under constant competition, pressure to reduce the spread and charges to keep the business, and therefore it is critical for them to make the right evaluation of the risks, which they are undertaking. What is needed for this purposes is a fairly sophisticated system of assessing the risks, capable of fine shades of differentiation and discrimination. The banking sector in India is becoming increasingly corporatised. The pressure to ensure profitability and adequate returns to the shareholders has to service their capital has become important for survival and growth. With cutthroat competition and reduced margins in the urban sector, banks are looking at diversifying their portfolios and broad-basing them to cover the rural markets. (Refer [I]) This is to ensure better returns as well as allow for more sources for cash flows. Feasibility in implementing a credit rating system allows banks to adopt such a strategy. The use of technology for the rating system will enable greater transparency and objectivity. It will also facilitate better data management and analysis. Further the compliance to the Basel I1 requirements paves way for a rigorous credit rating process both internally and externally. Section 2: Challenges in Rating Rural Customers Professional credit rating agencies both National and International have not entered the rating of the rural clientele in a variegated and developing economy like India. Listed below are some of the factors that have been limiting adoption of credit rating in the rural markets:



 187



- Non-availability of data / information in a standard form at regular intervals - Opaqueness of the information about the rural people and economy.



-



The geographical distance and access problems.



- The difficulties for authenticating the information. - Perceived non -viability of such operations, and above all. - The daunting magnitude of such mammoth operations involving 400 to 500 million individuals in a country like India. (Refer [ 11) The standard approach to credit rating therefore is neither relevant nor flexible in the context of developing economies. The challenge, therefore, is to come up with a methodology that is both cost effective and easy to implement, even if such an approach does not achieve the accuracy and consistency of the kind that is possible to achieve in an urban areas, yet it would represent an improvement in the situation with beneficial results. (Refer [2]) Section 3: Proposed Approach



We propose a technology-based approach for the rural credit rating. The essential elements of such an approach are the following: a. Encouraging the evolution of the self-financing, economic -incentive-based credit rating system at the village level. b. Extensive use of technology for creating a common rural information infrastructure. c. Economical, technology- intensive methodology for collection and updating of the data continuously. d. Use of computerized model for deriving a credit rating. The objectives of the above approach are the following: 1. To enable access to self- validated data / information with documentary proof in the digital form to the service- providers at low costs. 2. To provide techniques and tools for evaluation of the credit on an individual basis rather than in a standardized manner. 3. To apply the information and communication technologies appropriately for reducing costs, delays, increasing accuracy, objectivity and removing governance problems. 4. To enable more efficient capital allocation and improve quality of lending. The central idea in our approach is that the basic information I data required for catering to needs of the people, needs to be available in a digital form and it should be always kept current through regular updates, and validated with



 188



reference to independent sources and through internal statistical processes. Further, automated credit rating models are to be used to generate credit rating of the village people as and when required by the financing agencies. The innovation of this model is to provide for facilitators in each village who will be acting as an agent of multiple banks for the purpose of providing information in respect of the people in their village. He/She has to be the entrepreneur in the village and his/her compensation is based on his performance in the form of fees / commission. Such a performance-based remuneration will provide economic incentive for efficiency and reliability as compared to the existing formal systems of collection of data that have been time-consuming, costly and not delivering good results. We have identified and used the following parameters as being relevant for Rural Credit Rating:



- personal assets and income - bank payment history



-



credit history



- overdue amounts - outstanding loans



-



credit type (whether it is a fresh credit or new credits taken).



We have further broken down each one of these parameters into a number of factors. For example, the personal background includes data on education, total annual income, allocation history, property history, asset ownership, and bank accounts and external influence. The payment history covers number of installments paid, number of defaults in the past and the period of default and other negative public records. The credit history covers two factors, time periods since the account was open and the time since last activity. In the credit type, we capture total number of accounts and types of accounts of that individual. In the new credit, we want to cover the operations of recently opened accounts as per the recent credit enquiry and whether there has been positive credit after problems in those accounts, if any. The rating system used in this case is the expert- judgment based rating. The purpose of this system is to arrive at a credit score for an individual whom the banks can use evaluate creditworthiness and the interest rate to be charged. The details are given in Annexure A for Sample Scale and Factors. Establishment of Credit rating Agency



The possible approaches for credit rating in rural areas are the following:



 189



a. The rural bank branches collect information and prepare the ratings of the borrowers. b. Professional credit rating agencies carry out credit rating studies. c. Establishing a centralized credit rating agency by an act of legislature. d. Decentralized, incentive-based, entrepreneurial technology based approach for credit rating operations. While the banks, in theory, can come up with the credit rating queries and have got the required experience, in actual practice, based on the experience of last five decades in rural lending, they have systemic problems in carrying out this activity, such as the reluctance of the qualified people to work in rural areas and the lack of incentive for performing their work. For instance, despite the regulatory requirement in India that the banks should carry out a survey of all the villages in the area of operation, such village surveys are not yet available in most cases, let alone the question of being current. In many cases, it has not even been possible to collect all information about villages. The question of collecting authentic information about the villages and updating them appears presently an infeasible task for the bank branches. As already described in section 1, the professional agencies may not have the appropriate methodology for doing the credit rating of this magnitude and are unlikely to find it an economically viable proposition even if they come up with an appropriate model, given in the magnitude of the work and the peculiarities of the rural information. A typical response to situations of this kind in the past has been to make a law establishing a new agency to do the task. Unless the operational problems are addressed, any new agency brought into existence will fail to deliver the goods. Typically the Government established agency / Government owned agencies are likely to suffer from systemic problems of inefficiency and operational difficulties. Therefore, we favor an entrepreneur based, decentralized approach. Section 4: Role of Technology



The credit rating system for rural markets will use technology at different levels. Information and communication technology may be used at both operational and computational levels to ensure scientific credit rating. The typical operational cycle consists of the following activities:



-



Collection and collation of information Validation of information with documents Feeding information to the credit rating system



 190



-



Providing periodical updates on individual as well as the village Access to credit rating for decision-making as well as market research for new products.



The above processes can be addressed through effective utilization of question~aires and storage of the resultant i n f o ~ a t i o n in a d o c ~ e n t management system. S u ~ p o ~ i ndocuments g can be scanned and the images classified, linked, indexed and stored. A system of generation of infor~ation feed from di~erentsystems to the central repository will ensure sharing of credit history and other key events relating to individuals as well as the village, among financial insti~tions. The computational part of the system will take into account personal and socio-economic factors to establish a credit rating system for a village(s) or rural market. Periodical flow of accurate information and analysis of patterns across villages and over time will be part of the ath he ma tical model. This will be used to ~ne-tunethe credit rating model generation to create a learning system that improves with time. A data-w~ehousingand mining approach ensures that a corrective and self-improving system is in place. The critical success factors of a technology-based system in this case are the following:



 191



1. Availability of server and public network on a large distributed scale The system should be up and running throughout the year and accessible from the remotest comers of the country. Access should be possible over Internet domain as well as private networks, to ensure appropriate reach from different geographies. The system should be scalable in terms of the information that it can store and process. A network of redundant servers across geography should be able to ensure availability as well as sharing of processing load. 2. Ability to extend and customize The system should be extensible and able to accommodate newer kinds of inputs as well as additional factors and models for rating. The changing economic scenario and automation of different departments and facilities will ensure availability of newer data for factoring into the credit rating system. 3. Ability to Integrate The rating system should be integrated into numerous other systems to collect input data as well as deliver credit summaries. The system should also be able to integrate to the numerous banking software systems in the market. This can be achieved by providing multiple interfaces using web services, messaging, export-import, digitally signed credit papers, and other innovative ways of accepting requests and publishing credit information. 4. Security Credit rating information needs to be kept secure. While this information is private to the creditor, the agencies and the banks, it is information that can be potentially dangerous in the wrong hands. As the system is open to the Internet domain and as information may be exchanged in online and oMine formats, the data needs to be secured using different techniques. This could include encrypted data, digitally signed and encrypted data that can be accessed only by authorized certificate holders. Typically, the system and processes should be certified for compliance to IS and Security norms. (Refer [3]). 5. Affordability Finally the system should be affordable at the grass roots level. The equipment that is used for data collection as well as for communication should be commonly available technology. This ensures that price as well as maintenance cost of such technology is affordable. Given the progress in technology and falling prices of the hardware and bandwidth, the above solution appears viable and affordable.



 192



Section 5: Conclusion Credit rating of rural individuals in developing countries is a desirable proposition. However, it does not appear to have been attempted in most of the countries, like India, on account of intractability, the magnitude of the task and the operational difficulties involved such as cost and sustainability. We have developed the technology-based methodology and approach for addressing this problem. However, the key element in our approach is the encouraging evolution of an internal participation of the villager to source the information about the developments on an on-going basis actuated by the economic incentives. The hture work lies in testing this model in the field and to perfect it and provide robustness. Further research also can be done for its adoption of its various scenarios. References [ 11 The Fortune at the Bottom of the Pyramid - CK Prahlad



[2] Rural Credit Rating Guidelines issued by People’s Bank of China (htt~://www.uhc.aov.cn/cnalish/) [3] COBIT(3rd Edition): Audit Guidelines (2000), COBIT Steering Committee [4] FICO Rating Standards from myFICO. (htt~://www.mvfco.com) [ 5 ] Rating information and models from FairIsaac. (http://www.fairisaac.com ) [6] Bank for International Settlements (httn://www.bis.orp, )



Annexure A- Sample Credit Rating Model Desipn Methodoloev of the Rating Svstem for Rural Borrowers To know how a potential lender views a person on whether to grant him loan or not, it is important to know one’s credit score. Each financial institution has its own type of rating system (Refer [4], [ 5 ] and [ 6 ] )and hence the way by which the credit score is calculated will vary from one place to another. In our rating system, the rating scale has been chosen to be from 0 to 1000 for ease of computation and understanding. The rating scale can be seen as given below: Rating Scale: Excellent VeryGood Acceptable Uncertain Risky



-



-



Greater than 900 850To900 775 To 850 700-775 Less than 700



 193



e Variou§



actor§



Given below are the various Categories, their respective weight and their co~espondingscores for assessing the rural borrowers :



Now we shall see the various factors within each category and the implementation part will consist of assigning a suitable rating to each of the factors. l~~entation:



a.



b. iii. > 5 - 10 C. Total Number of Past Dues ( 0.2 ) i. < 5 ii. 5 - 7 iii. > 7 d. Time Period of Past D i. < 1 week ii. < 1 month iii. < 6 months iv. > 6 months - 1 Does he / she have any negative public records ? (0.4 ) e. i. Yes - 35



 194



2. PERSONAL, INCOME & ASSETS ( 0 - 200 ) a. Personal i. Identification Confirmation ( 0.25 ) 1. Yes - SO 2. NO - 20 ii. Education ( 0.15 ) 1. Graduate - 30 2. Literate - 15 3. Illiterate - 10 iii. Legal History ( 0.15 ) 1. None - 30 2. Case - I0 3. Convicted - 5 iv. Joint Application with female family members ? ( 0.05 ) 1. Yes - 10 2. NO - 5 v. External influence ? ( 0.05 ) 1. Yes - 5 2. NO - 1 0 b. Income i. Total Annual Income ( 0.15 ) 1. <12000 - 10 2. 12000-50000 - 15 3. 50001 -200000 - 20 4. 200001 -500000 - 25 5 . >500000 - 30 c. Assets i. Property Ownership ( 0.1 ) 1. <20000 - 5 2. 20000-50000 - 7 3. 50000-100000 - 10 4. 100000-500000 - 15 5. >500000 - 20 ii. Consumer Goods Ownership ( 0.05 ) 1. <10000 - 5 2. 10000-25000 - 7 3. >25000 - 10



 195



iii.



Vehicle Ownership ( 0.05 ) 1. <5000 - 2 2. 5000 - 20000 - 3 3. 20001-100000 - 7 4. > 100000 - 10



3. CREDIT HISTORY ( 0 - 150 ) a. Length of time since account was opened ( 0.6 ) i. < 1 year - 20 ii. 1 -5years - 3.5 ... 6 - 10 years 111. - 70 iv. > 10 years - 90 b. Time since last activity ( 0.4 ) i. < 1 week - 60 ii. 1 - 4 weeks - 45 ... 111. 1 - 2 months - 25 iv. > 2 months - 15 4. DUE AMOUNT ( 0 - 200 ) a. Total Amount he/she owes to the bank ( factor weight = 1 ) i. < 10000 - 200 ii. <50000 - 175 iii. < 100000 - 150 iv. < 500000 - 100 V. >500000 - 60 5. CREDIT TYPE ( 0 - 100 ) a. Total number of accounts in his/ her name ( 0.6 ) i. 1 - 60 ii. 2 - 3 - 40 iii. > 3 - 20 b. Type of accounts ( 0.4 ) i. Mortgage - 40 ii. Installment - 20 ... 111. Revolving - 15



6. NEW CREDIT ( 0 - 100 ) a. Number of Recently Opened Accounts ( 0.3 ) i. Nil - 30 ii. 1 - 20 iii. > 1 - 1 0



 196



b. Number of Recent Credit Inquiry ( 0.2 ) i. Nil - 30 ii. 1 - 2 - 1 0 iii. 2 - 5 c. Time since Recent Credit Inquiry ( 0.2 ) i. > 1 year - 30 ii. < 1 year - 15 iii. < 1 month - LO iv. < 1 week - 5 d. Has he / she established positive credit after encountering payment problems ? ( 0.3 ) i. Yes - 30 ii. No - 1 0 Now the total credit score in each category is the sum of the credit scores of the options selected for each factor in that category. And in a similar way, the sum of credit scores of each category gives the total credit score of the borrower.



 VIRTUAL DIGITAL LIBRARIES: AN ADVANCED METHODS AND TECHNOLOGIES DR. R.P. KUMAR B.B. Dikshit Librar All India Institute of Medical Sciences Ansari Nagar, New Delhi I 10029, India drrpkumar@hotrnail. corn



S.K. MEYER, M. Tech, MCA, PGDBM Department of Computer Facility All India Institute of Medical Sciences Ansari Nagar, New Delhi I 10029, India sushilmeher@yahoo. corn ABSTRACT Background: The development Virtual Digital Libraries of an open source system that



functions as a repository for the digital research and educational material produced by members of a research university or organization. Running such an institutionally based, multidisciplinary repository is increasingly seen as a natural role for the libraries and archives of research and teaching organizations. As their constituents produce increasing amounts of original material in digital formats - much of which is never published by traditional means - the repository becomes vital to protect the significant assets of the institution and its faculty. In this paper it has been discussed the functionality and design, and its approach to various problems in digital library and archives design. The second part discusses the implementation of DSpace at MIT, plans for federating the system, and issues of sustainability. Problem: It is an attempt to address a problem that faculty/researcher have been expressing to the Libraries for the past few years. As faculty and other researchers develop research materials and scholarly publications in increasingly complex digital formats, there is a need to collect, preserve, index and distribute them: a time-consuming and expensive chore for individual faculty and their departments, labs, and centers to manage themselves. Method: It provides a way to manage these research materials and publications in a professionally maintained repository to give them greater visibility and accessibility over time. Names are a vital building block for the digital library. Names are needed to identify digital objects, to register intellectual property in digital objects, and to record changes of ownership. They are required for citations, for information retrieval, and are used for links between objects.



197



 198



These names must be unique. This requires an administrative system to decide who can assign them and change the objects that they identify. They must last for very long time periods, which exclude the use of an identifier tied to a specific location, such as the name of a computer. Names must persist even if the organization that named an object no longer exists when the object is used. There need to be computer systems to resolve the name rapidly, by providing the location where an object with a given name is stored. A unique string used to identify digital objects. The handle is independent of the location where the digital object is stored and can remain valid over very long periods of time. A global handle server provides a definitive resource for legal and archival purposes, with a caching server for fast resolution. The computer system checks that new names are indeed unique, and supports standard user interfaces, such as Mosaic. A local handle server is being added for increased local control. Conclusion Virtual Digital Libraries will play an important role in the future of academic libraries and archives Organization should look forward to production collaboration with other institutions in this area.



 ROLE OF INTERNET-BASED GIS IN EFFECTJYE NATURAL DISASTER MANAGEMENT DR. MRS. A.S. BHALCHANDRA Govt. College of Engineering,Aurangabad 431 005, MS MR. S. S . RAUTMARE Govt. College of Engineering, Aurangabad 431 005 MRS. S. V. WAGHMARE MIT, Aurangabad DR. S. R. BHALCHANDRA Govt. Medical College, Latur. Abstract: This paper presents importance of GIS as a major tool for effective planning, communication, and training in the various stages of the disaster management cycle. The prime concern during any disaster is the availability of the spatial information, and the dissemination of this information to all concerned. Internet-based CIS can play a key role in this aspect by providing cost-effective information at various stages of the disaster life cycle, with a much wider reach. The following aspects have been covered: how Internetbased CIS can be used as a very effective tool for disaster management, in the various stages of the disaster management life cycle, some examples, the Indian perspective, and the SWOT analysis (strength, limitations, opportunities and the risks) of using the Internet-based GIS for disaster management.



Introduction Natural disasters have made their presence felt to every one hand struck in hurry. Every time the world has been caught unaware for the volume of destruction. Every one new that it is coming, but could not avert its aftermath. Tsunami, cycloness, Caterina, earthquakes, flood, landslide, wildfire are the forms and shapes, these disasters take to strike heavily on the mankind. Everytime they have struck, they have caused voluminous damages in terms of men and material. Governments, police, NGOs, organizations for civil services are left with huge tasks to carry. Rehabilitation, diseases, epidemics, hunger, shelter need to be given priorities as life continue to exist through these means. Life can come back to normal, if these disasters are managed appropriately as 199



 200



they cannot be avoided. The management tools need to be as speedy and formidable as possible. Medicines, and help from Doctores is the most vital part of management. Lot of lives can be saved through the Medical help. The help coming from the various resources and the minimum guidelines for the sufferers need to be channelised effectively so as to reach the maximum. Multiple paths need to be undertaken. Announcements on loudspeakers, announcements through radio networks, communication through Internet based GIs, communication through mobile and satellite networks, dialogues through television networks are some of the effective means to reach the people. The paper focuses on role of internet based GIS in effective natural disaster management. CIS as a Powerful Toolfor Disaster Management



Geographical information system (GIs), will definitely help managing the disaster. Access to information is crucial for the effective management of disasters. GIS provides online timely and accurate information. Money as well as time is spent on finding the relevant information. The information is stored redundantly in several places and in several formats. Maps and spatial information are important components of the overall information. Hence mapping and spatial information acquisition becomes vital for any disaster management effort.



The Emerging Role of World Wide Web in Disaster Management The WWW is an effective tool for communication. It provides a means to exchange ideas, knowledge and technology. As an effective tool for communication, it can be invaluable for disaster management. This globalization will have far-reaching impacts, and hopefully, the catastrophic events will become less disastrous with the increasing use of WWW and networks. Effective information management help managing disaster. But, GIS on Internet, which could have powerful implications for disaster management, is yet to be fully explored. Integration of GIS and the WWW will lead to increase of the usage and accessibility of spatial data. GIS is normally restricted to a community of trained experts.



The potential of Internet GIsfor effective Disaster Management Thousands of web sites provide images and maps of the earth, but this information remains underutilized for disaster management. Having been isolated earlier in desktop applications and back-office servers, geospatial



 20 1



technologies are now undergoing a transformation to become better suited for the Web. Geo-enabled web services can be integrated in space and time for better decision-making, learning, and research.



Stages in the Disaster Management Lifecycle There are five important phases of disaster management: Disaster prevention, Disaster mitigation, Disaster preparedness, Emergency management and Disaster recovery. Disaster prevention, disaster mitigation and disaster preparedness constitute the pre-disaster planning phase. Pre-disaster planning is the process of preparing in advance. Disaster prevention is to eliminate or avoid harmful natural phenomena and their effects. Disaster mitigation deals with reducing human suffering and property loss. Disaster preparedness encompasses to limit the impact of natural phenomena by structuring responsed and establishing a mechanism for effectivng a quick and orderly reastion. Emergency management covers responding to disaster by various organizations. Disaster recovery provides relief after the disaster has struck. It deals with providing food and shelter to the disaster victims, restoring normal conditions by providing financial and technical asssistance. Table 1: SWOT Analysis for the use of Internet GIS Technology Strength



Weakness



1) Wider Reach



1) The dependency of Internet



2) Simple to use



2) Time required for analysis and modeling



3) Facilitates Cooperative effort



3) Higher bandwidth requirement



4) Faster Dissemination of information



1) Mobile internet still not popular



Opportunities



rhreats (Risks)



1) Use of communication and networking



1) A comprehensive Internet based GIS application



2) The reach can be extended to handheld devices



3) Real time analysis possibility 4) Reduction in the cost of Internet GIS



could be difficult to develop, and slower than a similar desktop GIS application (Solution: Only put minimal functionality on the web, and use it only as a supplementary tool) 2) The high initial cost of developing such application (Solutions: Shared funding by various players involved in the disaster management cycle)



 202



Conclusion The opportunities created by Internet based spatial applications are immense, and are being universally accepted. However, apart from bandwidth constraints, the technology involved in web applications offers some unique challenges for application developers. Despite all its constraints Internet GIs, can be immensely helpful in managing disasters.



References [ 11 Indian Express Newspapers (Bombay) Ltd.



[2] www.oDengis.org [3] www,govtech.net [4] G.D. Kunders, Hospitals, planning and management [ 5 ] Hindustan Times



 A TEXT-FREE USER INTERFACE FOR EMPLOYMENT SEARCH INDRANI MEDHI’



BHARATHI PIITI~



KENTARO TOYAMA’



Microsofr Research, Bangalore, India’ Birla Institute of Technology and Science, Pilani, Indid Kelesa Konnection is a text-free application consisting of user-interface components which, together, can lower the barriers to computer usage for those who are functionally unable to read, and who, for the most part are also computer illiterate [l]. It allows two user groups, domestic helpers and employers - with different literacy levels and computer skills - to communicate on a common platform. It gives both user groups a wider range of choices to meet their specific needs, unlike in the present situation where the process is informal, inefficient, and mediated by self-appointed agents. In this paper, we discuss the design process and the resulting application design for the site that would be exposed to potential employees. We used an ethnographic design process in which we sought user feedback from the very beginning to understand the responses and requirements of our subjects. The resulting design is based on key lessons that we gained through this process [4] [ 5 ] . The UI eliminates the need for text, uses unabstracted cartoons versus simplified graphics, provides voice feedback for all functional units, and allows actions on mouseover where possible. Through repeated interaction and many informal usability studies, we have found that our design is strongly preferred over standard computer interfaces, by the communities which we address.



Introduction Most computer applications because of heavy use of text pose an accessibility barrier to those who are unable to read fluently [ 11. If the UI has to be understood even by illiterate or semi-literate people, then much of the interface may need to be relying primarily on icons [1][4][5][7], photographs [4][5][6], and audio playback [1][7]. Our project proposes an experiment similar to the “Hole in the Wall” project, a wellknown experiment in India [3] in which children learned basic computer skills with no explicit teaching. However ours is aimed at an adult audience and bears the additional intent to provide information that this user group might be interested in [2]. Since adults tend to be more inhibited and less curios about technology we anticipated that significant modification to the standard windows-based GUI would be required to entice adults to interact with a computer. For this purpose we perform in-depth user studies of a target community and through an iterative prototyping process are designing an application adapted to the local requirements [5]. Although we are working with a particular application addressed at a specific community, we believe that many of these UI elements are applicable to other similar communities [ 11. Design Process The Target Community: Most of our subjects in three slum communities in urban Bangalore, India, were illiterate or semi-literate women (highest education attained being schooling up to the sixth grade). They worked as domestic helpers at nearby households while the husbands were daily wage laborers. Everyone spoke Kannada or Tamil and a very few spoke Hindi. The average household income was INR 800- INR 3000 per month. Almost all the households had television sets; a few had music systems and LPG burners.



Ethnographic Design: Ethnographic design is a process which focuses on understanding users in specific contexts, analyzing and synthesizing this information, incorporating 203



 204



insights into a design, checking the design with the user group at each step, accommodate changes in the next iteration [4][5] and make the most of user experience 181. We held subject interviews and returned to them to evaluate our designs and inco~oratedthe changes in the next prototype we designed [4]. There were specific challenges we faced: being accepted by the community, making the users feel comfortable to talk and extracting relevant information from them [4], helping them overcome fear while using technology, designing comprehensible information for an illiterate user group [ 1][2], representing religious, social and cultural affiliations in the design. Right from the initial site visit we had started taking a Tablet PC along with us. Many of the women were shy and reluctant to hold the stylus and draw on the screen. We therefore asked then to draw something they were very familiar with- the rangoli on Paint (as in Figure 2), and thus were successful in removing some bit of the fear and reluctance.



igure 1. Testing prototype



Figure 2. Rangoli drawn on tablet PC



F~gure3. Site visit



To further establish rapport we visited them more regularly and made detailed observations of their activities, living environments and objects, their economics, interactions and coinmunications. 3.Design Iterations Choosing the Application::After extensive discussions with the slum women we decided to provide content on prospective job information for adult women who are currently working as domestic helpers. At present, our subjects got job information through word of mouth or through informal agents within the slum. They often continued working at the same place for low wages because they were not aware of better opportunities elsewhere. Determining Needs: Our focus-group-style discussions of 40-50 women showed that the that the women needed the following information to decide on the job: Specific tasks requested by the employer, Timings (hours to be worked), Break-up of wages by task, Address of the employer, Number of rooms in the residence; Number of people residing and location within the city. Based on this, we decided that we should design at least three kinds of pages: a page allowing users to select jobs by location, a job overview page, and a job description page. : We knew that the information had to be in Prototyping the Job Description Page: graphical form, since our target users were not generally literate [l] [4] [5]. We made few sketches of the same activity (e.g., sweeping), ranging in degree of abstraction. In general, our subjects appeared to prefer semi-abstract cartoon representations. We found that they were better able to identifl activities as actions when the cartoon included action cueswater ~ ~ i inna faucet, g steam puffing out from a kettle, without which they identified



 205 the drawings with the location where objects were kept (e.g., the kitchen), rather than the associated action (e.g., cooking). We mocked up a simple prototype in which an icon plays an audio file on mouseover [ 11 [7]. Most women were thrilled to hear a computer speak in their native language. We decided that all elements of the user interface would be accompanied by voice feedback. Based on these, we created four different information designs, as shown in Figure 4. These differ in their layout, level of complexity, and amount of i n f o ~ a t i o ndelivered.



Option I



Option 4



Ophon 3



opholl 2



Figure 4. Initial designs for the job decription page.



We received three types of feedback after testing these out: u o ~ or s ~ i s i n t e r ~ r e t eicons: Some icons were not interpreted the way we expected [I] [5]. For example, our initial design for a residence is shown in Figure 5 (left). We thought this was a universal symbol for a house or residence, but our subjects perceived it as a village hut or their own residence and were confked. They felt that prospective employers would live in a tall apartment complex; with their feedback, we redesigned this logo as shown in Figure 5 (right).



Initial



Final



Figure 5. Designs for the “residence” icon



on: We found that in some cases, Hindu subjects and ~ u s l i msubjects interpreted the placement of icons dif€erently. Probably because Urdu is written from right to left, Muslim culture tends to think of time as Rowing from right to left, as opposed to left to right ( in Figure 6). An arrow placed between the time indicators fixed this particular problem.



Initial



Final



Figure 6. Ambiguity in iconic representation due to cultural biases



s: We had used a matrix of checks and crosses to show what activities needed to take place in which rooms. They were unable to relate the tick marks and crosses to the individual rooms. So, we replaced them with explicit associations between room and task (see Figure 7) [4].



 206



Figure 7. Indicators of which tasks required in which rooms. The matrix structure(1efi)was not understood.



:The first interface of the job location map showed how many houses offered jobs on a street map. But after testing, we found that the women never used any maps, Instead they would find out the nearest landmark to that place if they were to go on their own or go along with someone who knew the location.



Initial



Final



esign of location pages



Generally, our subjects were not strong on sense of absolute direction. They understood the approximate direction relative to their local landmarks. So, we made our subjects identify areas they knew and divided the whole of Bangalore into those. After that we made individual boxes depicting these places and added photos of Iandmarks f a ~ i l i a rto them (See Figure Il(right)) [5]. On mousing, there is voice feedback that calls out the placename [l] [7].



4. Principles of text-Free UI In this way, we extracted several design principles that we believe apply not only to women in the pa~icularslums we worked in, but also for potential users who are both illiterate and computer illiterate. The general features of the design are as follows:



a) Minimal use of text (except for numbers): The application had to have interfaces that did not rely on text [1] except numbers. Therefore job information presented in the form of icons and visuals is accompanied by numbers in the UI [4] IS], which



represent the wage break-ups. b) Use of unabstracted cartoons versus simplified graphics: ics : The initial sketches forms with action Which were tested showed that users could identify sketch realistic sketch c)



cues where required, much better than abstract graphic forins [l]. Action on mouseover where possible: o§§ible:Many users had not used writing devices (pen, pencil, etc.),leave alone aouse, mouse for a long time. They were unable to press the stylus with required intensity when asked to click on the screen [Z]. So, wherever possible, action takes place on mouseover, instead of clicks. aek for all functional units: In the application, each ~nctionalunit on the visual display is supported by voice feedback in the local language (Kannada, in this case) in order to enable immediate understanding of what the icon represents. Voice feedback is available on mouseover [l] [7].



 207 e)



Consistent "help" icon on all screens: There is a consistent "help" icon with voice instructions on mouse-over [1] [7], on all screens to help the user recognize instructions while navigating through this application.



5. The Final Prototype: The above principles which evolved through user studies are generally applicable to all the pages of the application.



ml £



Introduction page



Job listing page



Job information page



Figure 9, Designs of the four pages of the final prototype of the application.



Introduction page: We intentionally left this screen simple to avoid overloading firsttime users with a lot of information, Location page: Already discussed above. Job listing page: In this section, the jobs available are listed along with the basic information about each job. Job description page: This page compiles all relevant details about a particular job. 6. Work Several people designing for illiterate and semi-illiterate user groups, highlight these things- minimal or no use of text [1][2], easy navigability [1][2][4][5][7], audio input [1] and audio output [1][7], iconic elements [1][4][5][7], numeric formats [4][5][7] relevant content [2]; animation into images to tell a better story [1]; controlling visual attention by using properties of motion, shape, color, etc [6]. In our current work, the aspects of the above work that we have used are- providing relevant information, minimal use of text, easy navigation through very less number of clicks, voice feedback, and visual cues and we have also leveraged numeric literacy. At the same time we have experimented with the use of unabstracted cartoons versus simplified graphics. We have included standard visual elements for indicating motion. We also emphasize on action on mouseover where possible. 7. Conclusions and Future Work In this paper, we have presented a text-free UI applied to the particular application of providing information about employment opportunities for domestic labourers [2]. We discovered several design elements which we believe are applicable to any user group that is illiterate and new to computer use [1]. In future work, we expect to add a feature in which a short movie playing continuously would describe how to use the application. We also hope to begin work on the employerside of the site. We have found that most employers are themselves literate, but generally computer illiterate, suggesting that yet another UI would be appropriate for them. Finally, we expect that a database system will mediate between employers and employees,



 208



allowing them to effective communicate, even though they see different designs for similar information.



References [11 M. Huenerfauth, Developing design recommendations for computer interfaces accessible to illiterate users. Master’s thesis, University College Dublin, (2002). [2] A. Chand, Designing for the Indian rural population: Interaction design challenges. Development by Design Conference, (2002). [3] S. Mitra. Self organizing systems for mass computer literacy: Findings from the hole in the wall experiments. International Journal of Development Issues, Vol. 4, No. 1 (2005) 71 - 8 1 [4] T. Parikh, K. Ghosh and A. Chavan, Design Considerations for a Financial Management System for Rural, Semi-literate Users. ACM Conference on ComputerHuman Interaction, (2003) [5] T. Parikh, K. Ghosh and A. Chavan, Design Studies for a Financial Management System for Micro-credit Groups in Rural India. ACM Conference on Universal Usability, (2003) [6] Mark A. Folk, and R. Davis, Query by attention: Visually searchable information maps. International Conference on Information Visualization IV (200 1). IEEE Computer Society, (2001), 85



[7] T. Parikh, HISAAB: An Experiment in Numerical Interfaces, Media Lab Asia Panel Discussion, Baramati Initiative on ICT and Development, (2002) [8] Master of Design Degree. Institute of Design. htttd/www.id.iit.edu/Prad/mdes.html.



 VERIFI - An integrated system with Voice chat, Elearning, Remote control, Internet access, and Fraud detection for an Intranet C. Mala', Dhaneel kumar. N. S . z, Pankaj Kumar3 Department of Computer science and Engineering, National Institute of Technology, Tiruchirappalli,Tamil Nadu 'mala@ nitt.edu, *[email protected] , [email protected] Abstract. With computers becoming part of everyone's life, interconnection of computers into an Intranet or an Internet enables peer groups in institutes, industries, hospitals, government offices, etc. to share resources and information. But the networks today have become very susceptible to malpractices by some unscrupulous minds who wish to disrupt its proper functioning or steal some vital information from it. Though it is not possible to prevent anyone from doing such activities, the network can be made immune to such fraud or intrusions. The methods currently available for intrusion detection require extra hardware and software and they do not support any other applications that can be provided by the Intranet. So, an integrated Voice Chat, E-learning, Remote computer control, Internet access, and Fraud detection system for Intranet (VERIFI) is proposed in this paper.



1. Introduction Intranet, the interconnection of computers in a localized area can also be used as an elearning tool in institutes and for consultation in hospitals, for audio conferencing in industries, if it can broadcast or multicast the information. Moreover, as the cost of providing internet access to every user of an Intranet is more, there is a requirement to have one internet connection and to enable all users in the intranet to share the same connection. To provide normal functioning of the network, to prevent hackers from entering our network and disrupting the routine usage of the network, it is required to detect intrusion or any fraud done by any user to the Intranet. This motivated us to develop an integrated Voice Chat, E-learning, Remote computer control, Internet access, and Fraud detection system for Intranet (VERIFI).



2. Proposed VERIFI system The Intrusion detection techniques proposed in [ l , 21 and the E-learning sessions conducted in schools or colleges as in [3,4], and the telecommunication system proposed in [5] not only require a major change in hardware and software of the com209



 210



puter system and the server of the intranet but is a costly affair. The proposed VERIFI system consists of two modules namely an Administrator Module (AM) and a User Module (UM). The functions performed by the AM as shown in Fig 1 are Voice chat (VC), E-learning Tool (ET), Remote computer control (RCC), Remote Registry Change (RRC), Remote Windows Explorer (RWE), Internet Access from intranet (IA), and Fraud Detection (FD). The functions performed by the UM are FD,ET, IA, and VC. 2.1 Proposed algorithms The algorithm executed by the Server and the Clients of VERIFI system are as follows: 2.1.1. Server side algorithm: 1) 2) 3) 4) 5) 6)



Create sockets for connection and bind sockets to required port Wait for connection from client side in block mode If connection request is true then accept the request and insert in queue While connection is active repeat steps 5 and 6 Wait for next connection from client side in block mode While connection request is true for FD /EL/RCC/RRC/RWC/IA Accept the connection; create a thread for respective connection and sevice the request.



2.1.2. Client side algorithm: 1) Open a port and connect to the listening port of the server side 2) If connection is successful then create a thread for that connection 3) While connection is active, service the request 4) Stop this thread processing and go to the parent thread 2.2 Components of the AM: The individual components of the AM module are implemented as follows: 2.2.1. Fraud detection: The different types of attacks [7] including port scanning will be immediately detected by this module and an alert signal is given to the administrator 2.2.2. E- learning Tool: The users in an Institute or an Industry are able to securely participate in E- learning sessions using Whiteboard, which can be used by the administrator to broadcast/ multicast details on the board to all or a group of users. 2.2.3. Remote computer control This tool is used to get the system configuration, history of computer, to open a desired site on remote PC, to set date and time of remote PC, to access and modify files in remote PC, to shutdown, logoff, and restart or to lock the computer.



 211



2.2.4. Remote registry change The registry is the main software control of the system. Using this module, it is possible to remove the startup entries to avoid any virus attack to the system and to the intranet. 2.2.5. Remote Windows explorer This enables the administrator to view the windows explore of the remote computer so that he can share, copy, paste, edit, and delete the files and folders of the remote client from his desktop 2.2.6. Internet Access from Intranet To reduce the cost involved in accessing the internet, IA module enables secured internet access can be provided to all the users of the intranet by using proxy server. 2.2.7. Voice chat Instead of sending messages from the administrator to the user, VC enables the administrator to participate in audio conferencing in an intranet environment without any extra hardware. 2.3 Components of the User Module The User Module consists of the tools for Fraud Detection, E-learning, Internet access, and Voice chat and their functions are similar to that of the AM. FRAUD DETECTION



INTERNET ACCESS



E-LEARNING TOOL



Fig. 1. ADMINISTRATOR MODULE OF VERIFI SYSTEM 2.4. Implementation and testing The VERIFI system is implemented and tested in our Institute’s intranet, which supports UNIX and Windows platform with 1200 nodes. It has been found that with



 212



this VERIFI system, staff members can closely monitor the usage of computers by students from their desktop; they can conduct e-learning session, participate in audio conferencing, and remotely access the student’s desktop.



3. Conclusion As the usage of Intranet is becoming popular in institutes, industries, hospitals, and government offices it is required to protect them from F r a u d intrusion from malicious users. The VERIFI system proposed in this paper provides a hacker free intranet, which can also be used as an e- learning tool with additional utilities such as remote computer control, remote registry change, remote windows explorer, voice chat and with internet access from any node in the intranet. This VERIFI system can also be used in any internet environment with no extra cost in hardware.



References [l]. Mutz, D, Vigna, G, Kemmerer, R, “An experience developing an IDS stimulator for the black-box testing of network intrusion detection systems”, Proceedings of 19th Annual Computer Security Applications Conference, 2003. [2]. Mukherjee, B, Heberlein, L.T, Levitt, K.N, “Network intrusion detection ”, IEEE Network, Volume 8, Issue 3, May-June 1994, pp 26-41. [3]. Forman, D, “Cultural change for the e-world”, Proceedings of International Conference on Computers in Education (ICCE ’02), IEEE 2002. [4]. Lee Chye Seng; Tan Tiong Hok , “Humanizing e-learning ”, Proceedings of the International Conference on Cyberworlds (CW’03)JEEE 2003. [5]. Akio Koyama, Tadahiko Abiko, Noriyuki Kamibayashi, Norio Narital, “Care communication system for improving inpatients Quality of life”, Proceedings of 15‘hInternational workshop on Database and Expert systems, 2004, pp313-317. [6]. John McHugh, Alan Christie, and Julia Allen, “Defending yourself the role of Intrusion detection systems”, IEEE software, 2000, pp 42-51. [7]. www .infoswwriters.com



 ENTERPRISE RELATIONSHIP MANAGEMENT FOR HIGHER EDUCATION “AN APPROACH TO INTEGRATED SYSTEM MANAGEMENT” SYED SAJJAD RAZA ABIDI AND MUHAMMAD NADEEM Shaheed Zurfkar Ali Bhutto Institute of Science and Technology, Karachi, Pakistan Emails: {sajjad nadeem)@zabist.edu.pk Information technology in general and Internet in specific have revolutionized the way of doing business, integration and interaction between different business components for With the accelerating pace of changes in facilitating the stakeholders in the domain. technological, regulatory, and economic environments as well as in stakeholder values, it is important to adapt quickly in order to survive and sustain business operations. This research has focused on addressing the specific requirements of a Enterprise Relationship Management (ERM). Customer has a pivotal role in any business, it is important for a business to satisfy its clients in order to survive in the market, a customer with a positive attitude towards a business, engages in positive word-of-mouth communication which affects the bottom line of the business. During this research, evaluation & investigation of available CRM systems for education sector was done, and CRM system for education sector was proposed. In second phase based on proposed CRM framework, a through investigation was done for developing business architecture, and then performing requirement analysis along with proposing software architecture; performing risk analysis for the proposed solution. Finally, based on the results of second activity, the best solution was suggested for an enterprise for managing their relationship.



1



Enterprise Relationship Management



Enterprise Relationship Management ( E M ) relates to solutions allowing an enterprise to share customer, product, competitor and market information to accomplish goals of meeting long-term customer satisfaction and increased revenues [ 11. This research addressed both the theoretical and practicable implementation aspects of various fundamentals of Enterprise Relationship Management (ERM). This research centered on “Model-Based Methodology” for Enterprise based Software Engineering, the basic idea is that when groups within an enterprise want to engineer their collaboration, this methodology help in modeling their processes or activities. The methodology covers the enterprise and the virtual enterprise. 2



ERM Architecture



Like other business institutions, a higher education institution also needs to make their relationship stronger by satisfying their clients in order to service in business. 213



 214



ana aging Critical Customer Relationships in higher education is a complex and cumbersome process. There would be little coordination among, and even within, service areas across the institute. This duplication of services is not only inefficient and costly to the institute, but the absence of corporate standards means that multiple messages are being c o ~ u n i c a t e dto clients. The institute’s s i ~ i ~ c a n t i n v e s ~ e n tin its corporate image and reputation are thus being diluted as the consequence of fragmented and inconsistent services that convey a bad impression to the public. This means that educational institutes cannot stand behind any service promise it would like to make to its clients, as there is no mechanism by which to deliver, let alone monitor, such an undertaking. RM-EDU can help an educational institution in continuously service & monitoring their clients, hence making their relationship stronger.



for Education Architecture



2. I



(Source: Author’s ownj



~ D U Tactics ~ r o p o s ~ dR ~ - Implementation



Organization too often embarks on their ERM initiative without having clear vision of where they want to go to how they intend to get there. It is necessary for com~aniesto take their time to properly set ERM vision and use a stepw~se ~ e t h o d o l to o ~create and realize their E M implementation tactics. Industry trend [2,3,4,5] shows that, most of the organization follows a 10-step approach to formulate ERM strategy



 215 ReletSonshipManapment Peaple lrwea



Figure 2 RM Tactics Formulation Process



3



Proposed ERM Design



The design of front-end application (e2RM) uses the requirements that were gathered during analysis to create a blueprint for the hture system. The successful design was build on what was learned in earlier phases and leads to smooth implementation by creating a clear accurate plan of what needs to be done. According to research plan and the proposed solution that come up with the solution which includes a front-end application called e2Rh4 and its integration with back-end application i.e., ZabDesk.



 216



3.I



Front-End Application - e2RM



e2RM is the database engineering and communication system for the entire office. Just like the internet, it provides employees with access information at one central location where all your organization information is kept. e2RM is integrated with back office ERP system to control the data redundancy and updated data across organization. By having an integrated environment the organization staff will be better informed, highly motivated, in result they will work more effectively. eZRM contains two major applications i.e., relationship management and campaign management. All components are integrated to provide complete and updated information across organization. 3.2



Back-End ERP Application - ZabDesk



ZabDesk as an ERP system has been designed for automation of business process at any academic institute. Automating the academic, administrative and financial processes. The software is a modular, clientlserver based system that covers all academic management tasks. The solution provides both web and desktop interface for academic, faculty, students, finance, management, records & registration, and library activities. 3.3



Component for Integration of e2RMwith ZabDesk



Component for Integration of eZRMwith ZabDesk - e’RMZDInt, is a component which integrates e2RM with ZabDesk. Its primary service is to provide ongoing synchronization of data among these two systems. With this component, students, faculty, staff, management, items (products i.e., programs, and other saleable items) and price information from ZabDesk is transferred regularly to eZRM,giving access to up-to-date information from e2RM or ZabDesk. The design of front-end application (eZRM) uses the requirements that were gathered during analysis to create a blueprint for the future system.



4



Proposed Architecture Assessment



Software expert Tom Gilb in his book: Principle of Software Engineering Management says that “If you do not actively attack risks, they will actively attack you” [6]. The suggested framework in this study having following risk analysis steps: Step 1: Brainstorming of customer value processes Step 2: Compile a list of features Step 3: Determine the customer likelihood



 217



Step 4: Identify the impact of failure for CRMfeaturedattributes Step 5: Assign numerical values Step 6: Compute the risk priority 5



Acknowledgements



The authors are extremely thankhl to Almighty ALLAH the most beneficent, merciful and compassionate for giving the ability to perceive knowledge and wisdom to carry out the research. Thanks to Dr. Javed R. Laghari, TI. Vice President and Project director, Shaheed Zulfikar Ali Bhutto, Institute of Science and Technology, SZABIST for providing the state-of-the-art facilities and research environment, which made this work possible. References 1. ERh4 definition at Webopedia, Web Encyclopedia www.webopedia.com 2. CRh4 Toolbox, Industry Articles on CRM Strategy & Implementation, http://crm.ittoolbox.com/nav/t.asp?t=511&p=5 11&hl=5 11 3. Thomas Hannigan & Christina Palendrano, "Preparing for CRM Success CRM Initiatives: The Path from Strategy to Implementation" for ITtoolbox CRh4 at http://crm.ittoolbox.com/documents/document.asp?i=2594 4. Kristina Markstrom, "Secrets of CRh4 Success: Strategies for achieving the vision through end user adoption" for ITtoolbox CRM at http://crm.ittoolbox.com/documents/document.asp?i=259 1 5. Bill Donlan, "Anatomy of a Successhl CRh4 Implementation", for ASP News at http://www.aspnews.com/strategies/article.php/3519991 6. Technical report CMU/SEI-92-TR-30 SIE, Software Development Risk: Opportunity, Not Problem by Roger L. Van Scoy http://www.sei.cmu.edu/publications/documents/92.reports/92. tr.030.html



 Locating Cell Phone Towers in a Rural Environment H.A. Eiseltl. Vladimir Marianov2 Faculty of Business Administration, University of New Brunswick, Fredericton, NB Canada [email protected]



Department of Electrical Engineering, Pontifica Universidad Catblica de Chile, Santiago Chile [email protected]



Abstact Logistics is one of the main features missing in developing countries. Typically, roads are few, they tend to be two-lane in mediocre shape, communication lines such as telephones are not readily available, electricity and other utilities are scarce, particularly in rural areas. Given such a situation in which the separating effect of distances is exaggerated, emergencies cannot be dealt with efficiently: first, as appropriate facilities such as ambulances, emergency helicopters, fire brigades, police or army units may not exist in sufficient number and with appropriate equipment, and secondly that even if they were to exist, they would not be able to reach the location of the emergency rapidly enough. In many of these counties, steps are undertaken to improve the situation, but more often than not, demands posed by the fast-growing population outnumbers and outclasses the improvements that were made by the authorities with their limited resources. One of the ways to improve the situation, especially given difficult and undeveloped terrain, is to establish efficient lines of communication. Cell phones are a natural way to do that as they do not require literacy as internet connections do, and their hardware requirements are likewise modest. While they will not be able to overcome the lack of personnel, equipment, or roads to get them to the site where they are needed, they do enable decision makers and emergency personnel to deploy resources available nearby in order to minimize the damage.



As far as the technology is concerned, cell phones essentially are connected in wireless fashion to a transmission tower (a Radio Base Station or RBS), which, in turn, is connected though a communication satellite to the switching facilities of the mobile company and to the public telephone network. In the same way, the satellite connects the switching facilities to some other tower, which is connected wireless to the cell phone of the other party that is being contacted. As such, the situation is similar to one of a two-level hub: the fixed hub on the higher level is the switch, and the hubs with the variable locations are the transmission towers, while the cell phones themselves represent the customers. 218



 219



Additionally, power can be a limited resource in rural environments. This is particularly true when the reliability of regular power supplies is low (due to brownouts, storms, mudslides, etc), and the cell towers have to make frequent use of backup power systems, like solar cells, wind generators and others. In these cases, it is desired to reduce the power use to its minimum. Given this situation, we can identify two types of variables. First, there are the locations of the transmitters, and secondly, there will be the power of the transmitter that does determine its reach. This problem is similar to that of determining the transmitters for radio and television stations and it was dubbed Communications Design Problem by Lee in 1991. As in each location model, we are to choose the appropriate space, be it continuous, discrete, or on a network. While continuous space appears to be the most obvious choice, we have rejected this option due to its intractability. Consider this. Even in a perfectly flat plane, the reach of each transmitter would be a circle, so that the task would be to locate a given number of circles, so as to capture the largest possible number of customers. This problem is known to be NP-hard, and it does not even consider the possibility of different power of transmitters, or potential secondary objectives. Our model of choice works in a discrete space. In other words, the customers are assumed to be located in a finite number of known locations, and there is also a finite number of potential locations for the transmitters. As far as modeling is concerned, it should be mentioned that it may be useful to include not only villages or individual houses as customer locations, but also points along major traveling routes (as temporary customer locations at which accidents can occur), airports, and police stations (where attacks by insurgents may happen), as well as power stations, dams, or similarly sensitive logistics installations and other strategic targets. Our basic model can then be described as follows. Given a finite number of feasible facility locations, locate a fixed number p (that is determined by the budget) locations for the transmitters as well as their respective powers (that may possibly be chosen from a finite number of choices), so as to capture the largest possible number of customers. Initially, in the context of the developing world, these customers will include mostly officials that deal with various emergency locations, such as health care workers, firemen, police, and army personnel. Later, as prosperity of the general population increases, the very same network can be used by civilians. It should also be mentioned that in areas with a large number of tourists, they will be using the same network, being charged similar (or even higher) rates they pay in their home countries, thus partially or even largely financing the system. The model will also include a secondary objective that is designed to measure the potential loss in case of an interruption of service. Such an interruption may be due to a malfunction of a facility, or to insurgent attacks on a facility, similar to those witnessed in the past on micro power generating stations in Nepal. Losses of this nature are similar to those studied in network reliability theory. One possibility to model such losses is to assume that at any one time, a single node will be out of commission. A pessimistic (or cautious) decision maker will then attempt to minimize the



 220



worst such loss, resulting in a typical minimax objective. Note that while the two objectives appear to be diametrically opposed, they are not. The reason is that while the capture objective includes only sites that are populated, the minimax loss objective also, or even predominantly in the case of security, includes sites with important installations. That allows for compromise solutions, which is what this research is all about. The problem is first formulated as a mixed-integer linear programming problem, and exact solutions are determined. The special structure of the constraints allows the solution of small to medium-sized problems. For larger problems, heuristic algorithms are developed. One such heuristic works sequentially in the “locate-first, power-second” fashion, similar to the popular “locate-first, route-second‘’ employed in many location - routing methods. Another approach uses marginal analyses, i.e., decides at any stage of a construction heuristic whether to step up the power of any of the transmitters, or locate an additional tower. In all solution methods, tradeoff curves between the capture and the loss objective are determined.



 E-GOVERNANCE OF LAND RECORDS VIJAY GOPINATH KONDEKAR Shri Sant Gajanan Maharaj College of Engineering, Shegaon, INDIA



E-mail: [email protected] Drawing of the land records and their e-Governance is discussed using a computer program which has been developed for plotting its, to the scale. It can be an effective way of drafting all the land maps and particularly, the rural agricultural lands. E-Governance will lead to easy maintenance of the land maps or plots, durability and enhancement of the life of the map records, correctness in the calculations of the map data.



Plotting of Land Records by a computer Program “SURVEYPLOT”



2



“SURVEYPLOT”- program uses given dimensions in chains and annas for chainages and offsets and fiirther links to AutoCAD, using AutoLISP which draws the required plot of the land record. This also gives the properties of the plot as its area, boundaries etc. The -generalized mathematical equations are derived from geometry and can draw any irregular plots given in the using AutoCAD & “SLJRVEYF’LOT” particular format. 3 Flow Chart for e-Governance of Land Records An example of e-Governance of land records at a villageltown level is given belo below. 1 2



3



Existing Land Records in the Survey Number, as available in the concerned office (Fig. 1)__ ’ig.2) Create the plot of the Land Record with the use of computer program “SURVEYPLOT’ (Fig.2) Create Data Bank of all existing Land Records for the Survey Number ~



I



I



‘3



References 1. Brian W. Kemighan, Dennis M. Ritchie, “The C Programming”,Prentice Hall of India Pvt. Ltd 2. William M. Oliver, “IllustratedAUTOLISP”,Wordware Publishing, Inc. 221



 This page intentionally left blank



 Mobile and Ubiquitous Computing



 This page intentionally left blank



 MAC LAYER FAIRNESS ISSUES IN MULTI-HOP WIRELESS AD HOC NETWORKS



M V S DEEKSHITULU AND SUKUMAR NAND1 Indian Institute of Technology, Guwahati Guwahati 780139, Assam, India E-mail: { vsdm, sukurnar) @iitg.ernet.in ATANU ROY CHOWDHURY Infosys Technologies Ltd 44 Electronic City, Bangalore 5601 00, Karnataka, India E-mail: [email protected]



Wireless ad hoc networks consist of mobile/static nodes interconnected by wireless multi-hop communication links. Unlike conventional wireless networks, ad hoc networks have no fixed network infrastructure or administrative support. Moreover supporting appropriate quality of service and fairness for wireless ad hoc networks is a complex and difficult task because of the imprecise network state information. In this paper, we evaluate IEEE 802.11 in terms of its fairness in multi-hop scenarios. Moreover we identify the reasons for the existing unfairness and then propose a purely distributed algorithm to provide fair access to all flows. Our simulation results show that our proposed protocol approximates a round robin scheduler, thereby providing considerable improvement in terms of fairness and throughput in multi-hop flow scenarios.



1. Introduction



Wireless ad-hoc networks are attracting considerable research interest because of their ease of deployment and maintenance. Moreover in multihop wireless ad-hoc networks, fair allocation of bandwidth among different nodes or flows is one of the critical problems that affects the serviceability of the entire network. The critical observation from our survey is that prevalent techniques for multi-hop fairness, based on promiscuous overhearing, starts t o fail when the hop count exceeds three. In this paper, we present a round-robin flowlevel scheduling schema to achieve global fairness for the entire network. 225



 226



2. Unfairness in IEEE 802.11 and its impacts



The unfairness exhibited by IEEE 802.11 is clearly explained in [ 5 ] . For each flow, Constant Bit Rate (CBR) traffic is adopted and the traffic rate is large enough for a single flow to occupy the entire channel capacity. Long-term Unfairness in Asymmetrical Information Scenarios: In this case a sender who is in the range of the receiver of some other flow is able to get more information about the contention in the media than other senders. Concealed Information Problem: The main problem leading to unfairness in the CSMA/CA protocols is that the senders cannot obtain precise information about the contention in the medium. While the impact of long-term unfairness is obvious 2 l the short-term unfairness at MAC layer also leads to a large delay and jitter in both real time streaming traffic as well as adaptive traffic like TCP. Because of the single flow domination the behavior of on-demand routing protocols is also affected. This results in misleading information about link states. Moreover the short term unfairness causes starvation of some sub-flows leading to bottlenecks in the flow. The main reason for this unfairness is because a sender is unaware of when exactly the contention period starts and ends. Moreover there exists location dependent contention as well as issues like lack of receiver participation in contention resolution.



3. Proposed protocol



Our main aim is to provide fair (equal) access to the medium for all flows without degrading the effective throughput. To achieve this fairness we propose a distributed algorithm that approximates a round-robin schedular . Sub-flow: Two nodes of a flow those fall under each others transmission range and one of them identified as sender and the other as receiver. A sub-flow is identified by its sender and receiver pair. As part of its bookkeeping each node is required to maintain 3 lists LEzternal 1 LSender and LReceiver as explained below. L ~ ~ t IDS ~ of~ active ~ ~ sub-flows l : of which the node is not a participant. This is the information regarding external contention. LSender: IDS of active sub-flows for which the node is the sender. L R ~IDS~of active ~ ~sub-flows ~ ~ for ~which : the node is the receiver.



 227



3.1. Estimation of number of Active sub-flows ( n )



Whenever a node hears or overhears a frame, it inserts the corresponding sub-flow ID into the corresponding list along with a timestamp.Duplicate entries are refreshed with the latest timestamps, whereas stale entries are deleted after a timeout.This timeout period is long enough to transmit a certain number (say W e )of packets. Clearly, the value of We affects the precision of n. However ne, the current estimate of n can be over or underestimated and this leads to bandwidth wastage. In order to cope with the over-estimation problem, a mechanism called inactive notification is proposed. Whenever a flow sends out its last packet in its queue the sender should inform the other nodes that the flow is becoming inactive. Moreover the receiver also piggybacks this information in the responding frames CTS/ACK, as some nodes may not overhear the RTS/Data. All the nodes that hear the notification should delete this flow ID from their corresponding list. Thus we can use large value of We to cope with the under estimation problem. 3.2. Incorporating Receiver Cooperation



We note that the active sub-flows that are exclusively in the transmission range of the receiver also affect the transmission of the sub-flow. Therefore we propose a mechanism called receiver feedback to cope with this problem. The sender of a prospective sub-flow is being informed about the contention at the receiver through the acknowledgement packet. The contention at the receiver is expressed in Eq 1



N = LEzternal



+ LSender + LReceiver - L S R



(1) where L S R represents the sub-flows in which the corresponding node that requires cooperation is acting as sender. 3.3. Scheduling Algorithm



Using the estimation algorithm proposed in previous subsection, the sender as well as the receiver of a sub-flow obtains the number of active sub-flows currently conflicting with it. We define a set of rules that a sender should follow while contending for an idle medium. Rule 1: If the node is the intended recipient of the ACK frame, it should compute its mode of contention. The mode of contention is computed by checking whether it has transmitted a packet for each of the active sub-flows for which it is acting as



 228



sender. If not, then the node remains in active contention mode. Rule 2: If the node has transmitted the required number of packets, then it should be an restrictive mode for a calculated duration. The current estimate of contention at the potential sender is ne,



+ LReceiver



(2) Also let the estimate of contention at the receiver, informed through the ACK frame, be n,. The actual contention of the sub-flow is given by the maximum of ne and n,. This allows us to use the concept of spatial reuse, where the sub-flows involving either the sender or the receiver exclusively can access the medium simultaneously. The waiting period Twaitis ne = LEzternal



Twait =



m a x ( n e , n,) x TSUCCESS



(3)



Where T s u c c ~ s sis the time needed for successful transmission of a packet when the number of total active sub-flows in the system is one. Rule 3: The other nodes, overhearing the ACK packet, should start contending for the medium if they are not in restrictive mode. Rule 4: Whenever a node receives an ACK packet, the mode of the flow is recomputed. If the node is to be in restricted mode, then the duration must necessarily be recomputed. Thus every sub-flow attempts to restricts itself from contention after availing its chance of accessing the channel. This self-preempting feature allows the nodes to schedule themselves in a round robin fashion and prevents any sub-flow or flow from dominating for a prolonged duration. Moreover, at any instant of time, we are successful in reducing the number of nodes contending for the medium. 4. Simulation Results and Performance Evaluation



The simulations, performed in ns-2.27, are measured over fairness and throughput. The well known Jains index (eq. 4)is used to measure fairness N



N



i=l i= 1 Where N is the total number of flows that share the medium and yi is the bandwidth utilized by the flow i over a certain amount of time. We notice that the performance of IEEE 802.11 is inconsistent. It shows a good degree of fairness in low contention topologies, which degrades in high contention topologies. Our proposed protocol performs better in this count. We are also able to improve on system throughput considerably.



 229



(a) Scenario 1



Throughput



(b) Scenario 2



1



PP



I 802.11



(c) Fairness 1



Throughput



(d) Fairness 2



I



PP



1802.11



I



(f) Throughput 2 Figure 1. Fairness Indices and Throughput for Scenarios 1 & 2



5. Conclusion I n this paper we identify reasons for unfairness in multi hop ad-hoc networks and propose a novel solution achieving global fairness. We have also tried t o give channel access t o each node in a round robin fashion, using a purely distributed algorithm.



References 1. V. Kanodia, et a1 “Ordered Packet Scheduling in Wireless Ad Hoc Networks: Mechanisms and Performance Analysis”, In Proc. of MOBIHOC, June 2002. 2. H. Lou, e t a1 “A Self-coordinating Approach to Distributed Fair Queueing in Ad Hoc Wireless Networks”, In IEEE Infocom, 2001. 3. Y . Wang, B. Bensaou, “Achieving Fairness in IEEE 802.11 DFWMAC with variable packet lengths”, In Proc. of IEEE Globecom 2001. 4. T. Nandagopal, T. Kim, X, Gao and V. Bhargavan, “Achieving MAC layer fairness in wireless packet nerworks”, In Proc. ACM MOBICOM, Aug 2000. 5 . Li Zhifei, “Performance Evaluation and Improvement of IEEE 802.11 MAC Protocol in Wireless Ad-hoc Networks”, PhD Thesis, NTU, Singapore, 2003. 6. Z. Fang, B. Bensaou, and Y. Wang, L‘PerformanceEvaluation of a Fair Backoff Algorithm for IEEE 802.11 DFWMAC”, In ACM MOBIHOC, 2002.



 A FRAMEWORK FOR ROUTING IN MOBILE AD HOC NETWORKS AMIT GOEL AND A.K. SHARMA YMCA Institute of Engineering, Faridabad, India E-mail: goelmit1 (cihvhoo.coiii, rtrhokkalc?(ii!r~diff~iiuil. coin MANET’s have dynamic, random, multi-hop topologies consisting of relatively low bandwidth and limited battery power. The existing unicast routing protocols suffer from many drawbacks such as network wide flooding, processing overheads etc. [I]. A framework called “Location Aware Routing Protocol (LARP)” has been designed that overcomes the drawbacks. This paper presents the algorithmic details of the proposed protocol



1



Introduction



MANET is multi hop wireless network that result from the cooperative engagement of a collection of mobile hosts without any centralized access point [l]. Since the mobile nodes in the MANET dynamically establish routes among themselves to form their own network, such networks are also known as infrastructure less networks. MANET is an excellent tool to handle the situations like disaster recovery, crowd control, etc. where no infrastructure exists at the time of occurrence of such events. The routing in MANET becomes extremely challenging especially because of its inherent dynamic nature coupled with constraints like limited bandwidth etc. In this paper, a new routing protocol for MANET is being proposed. It utilizes the technology of GPS in order to discover the probable location of the users which in turn could help in increasing the efficacy of the route discovery process. The algorithmic details of proposed protocol called “Location Aware Routing Protocol (LARP) ” are provided in the sections given below.



2



LARP- The Proposed Protocol



Before proceeding to discuss the algorithm of LAW, a brief discussion about various packets, tables required by the protocol is given in following subsections. 2.1



Data Packet (DP) Header



This packet is used for exchange of data between the mobile nodes [ 13. The format of data packet header is given in Fig 1. Bytes 1 2 4 Figure 1. Format of Data-Packet Header.



4



230



4-64



(4 bits)



 23 1



2.2



Location Knowledge Table (LKT)



In this framework, the technology of GPS has been employed. Each node maintains location of every other node in the ad hoc network into a table called Location Knowledge Table as shown in Fig. 2. In order to maintain up-to-date location of mobile nodes, LKT is updated after regular intervals of time. The last three locations of mobile nodes are maintained for predicting the mobility pattern of the nodes.



Figure 2.Format of LKT.



2.3



Neighbor Awareness Table (NAT)



Each node in a cell maintains information about its neighboring nodes by broadcasting a Beacon-Request packet (see Fig 3). The format of Beacon-Request packet is given in Fig 4.Every recipient node adds the information of sender into a table called Neighbor Awareness Table (NAT) [1,2]. The addresses of neighboring nodes in NAT are stored in the decreasing order of their stable power signals as shown in Fig 5The neighbor nodes, which receive Beacon-Request packet in turn acknowledge by sending a Beacon- Reply packet. The Format of BeaconReply packet is given in Fig. 6. 2.4



Best Neighbor Node Selection Algorithm (BNNSA)



In order to provide the least number of nodes that a data packet (DP) may visit during its travel from source to destination, an algorithm called BNNSA has been designed [I]. This algorithm attempts to discover the best neighbor node to which DP should be forwarded. A search for destination I intermediate node is carried out. If the destination node is with in the same cell, then DP is forwarded to the node through a function called Forward ( ). In a situation when destination node is not in the vicinity of source node then BNNSA chooses the farthest node which is located in similar direction as that of destination node and has not been a visited node in NAT through a function called Get-Farthest-Node () as next intermediate destination for DP. The reason for selecting farthest node is that such a node has a higher probability of being in proximity of the destination node as compared to a nearer node that is likely to have almost same NAT entries as the node in question.



2.5



The Proposed Protocol for Routing of Data Packet: LARP



A Location Aware Routing Protocol (LARP) is designed that routes the DP from source to destination. In fact, it is the core algorithm, which sends DP efficiently by



 232



suitably using BNNSA, LKT and NAT. Firstly, LARP checks the value of TimeTo-Live (TTL). If the value of DP.TTL is found to be greater than 16, then DP has to be retransmitted from the source of DP. In case TTL is lesser than 16, the destination address of DP is compared with address of the visited node to establish whether the DP is meant for itself or not. If DP is meant for the visited node then the DP is read else DP. TTL value is incremented by one and BNNSA ( ) is called to forward DP to its destination. As soon as a DP arrives at a node, the node uses the L A W algorithm to route DP to its destination.



202.31.12.10 202.31.12.12 202.31.12.15 Figure 3. Beacon initiation by Node



I



Figure 6. Format of Beacon-Reply packet



3



Conclusion



LARP is a table driven routing protocol that drastically reduces the flooding of data packets by using BNNSA’s Global Positioning System (GPS) and NAT. The channel overhead has also been significantly curtailed leading to reduction in packet processing and thus saving of valuable battery back up. References



1. A.K. Sharma and A. Goel, “Moment to Moment Node Transition Awareness Protocol (MOMENTAP) ”, International Journal of Computer Applications (IJCA) Special Issue, IASTED, Vol. 2711, Jan 05, pp. 1-9.



 EFFICIENT CHANNEL ALLOCATION BASED ON PRIORITY OF THE VIJlEO IN VOD SYSTEMS D. N. SUJATHA~,RAJESH K. vl, GIRISH K ~VENUGOPAL , K. R’ AND L. M. PATNAIK~ ’ Department of Computer Science and Engineering, Bangalore Universiv. ‘Microprocessor Applications Laboratory, Indian Institute of Science, Bangalore. [email protected], [email protected]. in Abstract. Today’s technology offers easy access to multimedia services through high-speed communication networks. A Video-on-Demand (VoD) service allows customers to connect to an on-line video server and asynchronously watch videos. A Video-on-Demand system is typically implemented by Client-Server architecture. This paper presents a scheme of channel allocation based on priority of the requested video, to increase effkient channel utilization and lower the download time. This allocation scheme shows better performance when compared to conventional way of allocating channels.



1



Introduction



With the advent of networking technologies, Video-on-Demand has become an important application, A Video-on-Demand system can be designed with three major network configurations viz., Centralized, Networked and Distributed. When a client requests for a particular video to be played, the video server is responsible for streaming the request reserves sufficient processing capability and network bandwidth for the video stream to guarantee continuous playback of the video. Sridhar et al. [ 11proposed multicast patch with cache to provide truly adaptive VoD service. The idea of Mcache is to use batching, patching and prefix caching technique to complement one another. Mounir et al. [2] proposed an incremental channel design at the server side with a specific broadcast schedule so that the users can choose among a range of bandwidths to download the video at the cost of their access latency. In Bandwidth Efficient Video-on-Demand Algorithm (BEVA) [3], batching and patching have been proposed to reduce the bandwidth requirements of a VoD system that is not dependent on the user interactions. The determination of efficient transmission scheme for Video-on-Demand services selects an appropriate delivery scheme for the videos according to both priority of videos and the customer reneging behavior [4]. An algorithm to allocate channel effectively based on the priority of the video is discussed in [ 5 ] . Jack et a1.[6] have proposed channel folding technique as an aggressive client-side caching which is used to merge clients from one multicast channel to the other. The channel folding algorithm is applied to Unified Video-on-Demand (UVOD) architecture to demonstrate and to quantify its impact on the performance and trade off in multicast video distribution architecture. 233



 234



2 2.I



System Environment DeJinitions



1. Priority of video is a dynamic entity, it depends on the number of users requesting for a video at any instant. 2. Download time is the time required for a video to be streamed to the local server from the multimedia server. 3. Blocking factor is the ratio of the number of rejected requests to the total number of requests. 2.2



Architecture



Server Management Unit: The steps in the server management are as follows a) The user requests the local server for a video. b) The video is first searched on the local server and then on the multimedia server. If available then the request is accepted. c) The video stream is transferred from the multimedia server to the local server and finally to the users. d) The virtual circuit is set up between client and the local server. 2. Batching: All the users must wait for a minimum of S(t) seconds, called as batching time. The users requesting for the same video during batching time are serviced using multicasting so that bandwidth requirement is optimized. 3. Scheduler: If the request is not present in the local server, in the worst case, it may take up to x seconds to wait before the required video is made available to the local server. The network model aims to solve this problem by allocating channels between multimedia server and the local server based on priority. 1.



3



Problem



Given a VoD Network M(L,C,U), the multimedia server M consists of a finite set of local servers L=(lr, h, . . . , In), a finite set of optical channels C = { ( ~ i ,CjVCi, ci € L Ci # Cj}. Each local server li, where i=1,2,. . . n is connected to a set of users U = (u,, u2,. . . , U") (where Ui # w). The objectives of this work are to allocate channels effectively, and to reduce the download time at the local server.



Algorithm: Eficient Channel Allocation based on Priority of the Video in Video-on-DemandSystems (ECPV) The following notations are used in this algorithm, ri : Request for video i in local server; ni : Total number of requests for video i in local server; v[ 1 : An array containing all videos available in the multimedia server; s[ : An array which keeps track of the videos which are being served; freech : Free channels in the local server at any given time; chi,,: Channels allocated at any instant of time t for a video i in the local server; totch : Total number of channels in the local server.



 235



Request-Service routine in the local server. freech = totch if ( v[ does not contain video i ) then reject request ri else q =q + I sfd = video i i = s[d if fieech = totch) then chi,, = totch freech = nuN else for( p = 0 ;p < q ;p++) sum = sum+ rSLpI end for for(p = I ; p < q ; p + + ) new channels = (total channels * requestsfor video p)/sum j = SI~I



d = chj,,- new channels if ((chj,,- d ) >= 1) then de-allocate d channels Chj.1 = chj,, - d freech =freech + d end if end for chi,,=fi.eech fieech = null end if end if



De-allocation routine in the local server do while(true) stream all videos if (Video i finished streaming) then de-allocate chi,, freech =freech + chi,, for(p = 0 ; p < q ; p + + ) sum = sum+ rSLpI end for for ( p = I ; p <= q ; p + + ) new channels = gree channels * requestsfor video p)/sum j = sfp] ch,,, = ch,, + new channels end for end if end do



 236



When a request arrives, the video will be streamed to the user if it is available at the local server. Otherwise, it is checked at the multimedia server and if the video is not available then the request is rejected. On arrival of the first request, the entire bandwidth is allocated to the user. Otherwise the channels are reallocated in proportion to the number of requests for a video. It is ensured that every streaming video is allocated a channel of minimum bandwidth. Let the number of requests for a video vi of length Zi be ri.where i = { I , 2 . . . m}.The bandwidth for each channel is b and the number of channels is assumed to be N, therefore the total bandwidth available is bN.The channels allocated at any instant t for vi can be represented as chi,,= (ri*N)/(Zi=, ,om (RJ). The bandwidth allocated to the corresponding video vi is b * The time required to download the i" video = n; when t= Z,=lII (b * cki,J. The number of videos that are downloaded MD can be represented as Ci=llo k (vJ whenC,=I,o,,(5 *chi , J =hi. 4



Performance Analysis



The performance parameters of interest are channel allocation, channel utilization of the local servers, blocking rate and the download time. Figure 1 shows that the number of channels allocated for the most popular videos at any given time is more compared to the channels allocated using the BEVA approach. 40



AEVA-



- -ECW-



,



1



D



o



.



.



flNA---



ECPV-



70



Fig. 1 No. of Requests vs. Channels Allocated



Fig. 2 Video Number vs. Download Time



Figure 2 is plotted by simulating both the schemes for the same amval time and video length. It is observed that the downloading time is comparatively lower than that required for the earlier approach. This is because of the allocation of channels based on priority. Figure 3 shows the number of videos downloaded at any instant of time. We can say that the time taken by ECPV approach is lower than that of earlier technique. Figure 4 shows the blocking rate of the requested videos. The simulated results show that lower numbers of requests are rejected in ECPV approach than the other algorithm. Hence the ECPV approach is much better than BEVA approach in the operating region.



 237 BEVA



-



ECPV -*-



i



Fig. 3 Time vs. Downloaded Movies



Fig. 4 Number of Requests vs. Block Rate



Conclusions



5



This paper proposes channel allocation for Video-on-Demand systems. The proposed ECPV algorithm utilizes the channels effectively, efficiently and fairly. The time required to download the videos is lower when compared to the other algorithm. It is observed that the number of videos downloaded is much higher and the blocking rate is lower in our approach. Further work can be carried out to estimate the cost characteristics of Video-on-Demand systems.



References 1. Sridhar Ramesh, Injong Rhee, Katherin Guo, “Multicast with cache (Mcache):



2. 3. 4. 5. 6.



An Adaptive Zero-Delay Video-on demand Services”, IEEE Trans. on Circuits and Systems for Video Technology, 2001. Mounir A Tantaoui, Kein A. Hua, Tai T. Do, “Broadcatch: A Periodic Broadcast Technique for Heterogeneous VoD’, IEEE Trans. on Broadcasting, vol. 50, no. 3, pp. 289-301, Sep. 2004. Santosh Kulkarni, “Bandwidth Efficient video on Demand Algorithm”, Tenth Intl. Conf. on Telecommunications, vol. 2, pp. 1335-1342,2003, W. F. Poon, K. T. Lo, and J. Feng, “Determination of Efficient Transmission Scheme for VoD Services”, IEEE Trans. on Circuits and Systems for Video Technology, vol. 13, no. 2,pp. 188-192, Feb. 2003. D. N. Sujatha, Venugopal K. R and L. M. Patnaik, “Design Issues in Channel Allocation for VoD’, Technical Report, University Visveswaraya College of Engineering, Bangalore University, Bangalore, Dec. 2003. Jack Y.B. Lee, “Channel Folding - An algorithm to improve Efficiency of Multicast VoD Systems”, IEEE Trans. on Multimedia, vol. 7, no. 2, pp. 366378, April 2005.



 REAL TIME VIDEO AND AUDIO TRANSMITION ACROSS THE NETWORK M.M. MANJURUL ISLAM, MIR MD. JAHANGIR KABIR, MD. ROBIUR RAHMAN AND BOSHIR AHMED Computer Science and Engineering, Rajshahi University ofEngineering and Technology,Rajshah i-6204, Bangladesh. E-mail:[email protected]. sugar-31 [email protected]



Receiving and transmitting audio and video streams across the network .RTP (Real Time Protocol) is used for data transmitting over the network. This paper attempts to transmit each of the tracks within the given input media. For which RTP session is created for each media track. For that purpose, the RTP Session API instead of the DataSink API is used for flexibility. RTP format is set for each track. For video, special attention is taken to ensure that the input sizes are usable for RTP transmission. Real-time scaling is applied when necessary. Audio and video synchronizations are maintained as enough can be possible. Such that video and audio transfers in real time; there is no file retrieval latency. The video and audio result in compelling another computer in network. Real time video and audio data can be effectively served over the network with the proper transmission protocol.



1



Introduction



The aim of this experiment is to test the feasibility and usability of the recently available telecommunication technology for Real Time Videio Conference,even when they are remote fiom the site of the transmission.The systems design for entire clients and servers has concentrated on RTP retrieval and the structuring of document-based information . This is reflected in the use of the file transfer mode of RTP retrieval and the use of stream-based protocols Transmitting the media beneath phases are applied 1. Creation a Processor with a DataSource that represents the data I want to transmit. 2. Configure the Processor to output RTP-encoded data. 3. Get the output from the Processor as a Datasource



238



 239



2



Experimental Results:



We carried out several experiments over the network. Our test data set consisted of two MPEG movie files, two MP3 files and three different Capture source. Table I tabulates the test values are given below



Table 1. Test data of different types of video



Video Session



Name .



~



_



_



_



Bit Rate(avg) _



-



.



SmalLmpg



2500 kbps



Large.mpg



2200 kbps



22 f p s



Small1.mpg



2600 kbps



28 f p s



420 kbps 510 kbps ~



Large1.mpg



2240 kbps



25 f p s



450 kbps



Small.mp3



510 kbps



Large.mp3



480 kbps



Smalll.mp3



3



Conclusion



We have described the experiments of video conference on network. By using Local area network(LAN) transmitting media streams with RTP technique, it is more reliable than full file transfer paradigm using TCP technique.Our experiments conclude that it is possible to transmit good quality video over the present day Network with RTP. Out work indicates that a video-enhanced Network is indeed possible. The audio video transmission can perform also as:i) Voice chatt: Two client can chatting by transmitting video or audio from web camera or microohone each other and receiving. ii) Netmeeting: By the same process some client can meet together across network.



 Mobile Payments: Partner or Perish? Elaine Lawrence Agnieszka Zmijewska Sojen Pradhan Building 10, Jones Street, Broadway NSW 2007, Australia [email protected]



Abstract. Mobile payment hype posits that Mobile Commerce will deliver ecommerce services directly into the consumers’ hands - anytime, anywhere - using wireless technologies. However our research has indicated that in the short term at least these services will like& be low value and various mobile payment initiatives have failed or will fail. Applying mobile access to computing creates both tremendous commercial opportunities and complexity, yet will make computing globally pervasive and ubiquitous. Providing such services is fraught with problems such as legal and security issues as well as the need to track payments whilst the consumer is on the move and the mobile device may drop out. Customers prefer a choice in the channels through which they do business and the same applies to mobile payments as well. The research questions are: Is it necessary to partner to achieve success in the mobile payment area? Is it a case of partner or perish? This paper reports on three Australian Case studies to explore the current state of mobile financial transactions and illustrates the difficulties faced by both mobile payment enablers and I 0090consumers.



1. Introduction Recent research in Australia has revealed that the reality of mobile commerce is fraught with difficulties. Consumers are concerned by privacy and security issues as well as the legal implications of doing business in a mobile environment. Furthermore it is not just consumers who are worried - merchants who would hope to add an extra channel for payment of services are also troubled and recent history has not given them much cause for optimism. However wireless access is a convenient and easy way for customers to extend the availability of e-commerce services to fblfill their needs any time, anywhere. Mobile commerce makes it possible for them to order and pay for goods or services regardless of where they happen to be [3]. However mobile payments are still in an early stage of development. Although they have only been around for several years, new systems are appearing and disappearing every month. As stated by San Murugesan [7], the success of innovative mobile applications depends not just on the soundness of the technology but also on an organization’s ability to bring that technology to market and get it adopted by customers. 240



 `



This paper reports on how three mobile payment players have taken the technology and used their organization’s skills to bring their mobile payment products to market. Some have succeeded, some are struggling but all are hoping to win in the mobile space. Section 2 of this paper gives an overview of the background to mobile payment systems. Section 3 presents the methodology used in this research while Section 4 sets out the results of the case studies. Finally, section 5 provides a conclusion and an overview of -future research based on the results of this study.



2. Background to Mobile Payment Solutions A study conducted by MORI (Market and Opinion Research Int~ational), Britain’s largest independent market research agency, revealed that mobile phone users view mobile commerce as complementary to alternative Internet commerce [11. The study was done globally in six different countries, by interviewing 11,000 people across those markets. The study also showed that 90 % of them were interested in using m-commerce services and they also showed their willingness to pay extra for the convenience of making purchases with m-commerce. This study shows that convenience and control will play a pivotal role in the acceptance of m-commerce in the future. The Global head of Market Research, Nokia Networks said, It is clearporn this study that the market sees in-commerce as a natural extension of e-commerce [ 11. One of the reasons for the potential phenomenal growth of the m - c o ~ e r c is e that the mobile user offen finds wireless data to be more convenient and cost effective to use than wireless voice 1121. These could involve different applications such as mobile banking, m-broker services, mobile money transfer and mobile payments as shown in the fig& 1 below:



Traflsa~iofls(Source:1131Vashney and Ve~er,



3. Methodology Y This paper follows on fiom previous studies on mobile payment systems and their impact in the mobile enterprise undertaken by the researchers from 2003 - 2005 [S,6 , 8, 9, 10, 14, 15, 161. The paper reports on an exploratory study of three Australian



 242



companies involved in the supplying of systems capable of providing mobile payments. Sekaran [ 111 is reported as saying that exploratory studies are useful when researchers do not know much about the situation at hand. Exploratory research is conducted into an issue when there are very few earlier studies that can provide information about this issue [2]. In such research, the focus is on gaining insights and familiarity with the research area for more rigorous investigation at a later stage. Case studies are a typical technique used in exploratory research. Interviews are common in exploratory studies, as they provide the ability to identify relevant issues when little is known about the topic being investigated [2]. This research was approached in several phases. First, a literature review was undertaken to understand and draw out the critical success factors associated with mobile payments. These include the consumers’ perception and the merchants’ acceptance of the mobile payment instrument, especially as the latter bears the higher cost and risks, especially if there are difficulties in attracting partners (see Case Studies). Finally the technology is a vital enabler to achieve or improve success [4]. The literature review and subsequent analysis of the issues and synergies drawn from the literature led to the development of the following research question: Is it necessary to partner to achieve success in the mobile payment area? Is it a case ofpartner or perish? Our research has involved interviewing of staff of three mobile payment providers here in Australia. These interviews are then reported as case studies. Denzin and Lincoln [ 2 ] acknowledge that unstructured interviewing provides a greater breadth than the other (interview) types, given its qualitative nature. The research was not only seeking to understand the approach taken by developers and suppliers of mobile financial transaction devices, but also to understand some of the risks and issues associated with mobile financial transactions. They also point out that “...the goal of unstructured interviewing is understanding; it becomes paramount for the researcher to establish rapport ” [2]. A rapport engendering semi-structured interview approach was adopted to both elicit issues and risks, and constrain the interview content to the research focus. The semi- structured interviews were based and on the following subjects: b an overview of the company, b the mobile payment products and specific issues such as usability and security, their perspective on enabling technologies, b the current market situation and b experiences with potential third party partnership to market the product



4. Case Studies 4.1 CardAccess is an innovative Sydney company that offers a range of exciting mobile payment options. It recently was awarded $lmillion dollars from the Federal Government of Australia to develop Bluetooth credit card and smart card reader CASPad. This hand-held secured bank-accepted device will be used to secure the



 243



bank purchase or payment values with encryption to provide EFTPOS security to the home PC, mobile or television set-top box. Another of their offerings is Mobepay which is a mobile payment solution that enables Merchants to manually process credit card transactions via their mobile phone or PDA. The Merchant enters the cardholder's details into a secure online application. This is transmitted via GPRS or Ix in real time to the CAS payment gateway where it processes the payment, then returns the response (accepted or declined) to the Merchant's mobile device. See Figure 2 in the right hand side:



Fig. 2. Mobepay (www.cardaccess.com.au)



In early trials of the CASPad system, this company used Infrared, but discontinued as users were inconvenienced by the need to have line of sight for operational purposes. Two devices must be held accurately for the beams to pass through. The interviewee, one of the key stakeholders, emphasized that they decided to use Bluetooth instead as it was inbuilt on many mobile devices. Each card reader CASPad has a unique key and triple DBS encryption. There is end-to-end triple DBS between the CASPad and the Bank Gateway to ensure secure transport layer communication through encryption. A Bank Gateway retains Merchant product and service details of the transactions. This information is available for the Merchant and Cardholder for one year and is password protected using a unique audit number which is passed during each transaction. CASPad is a small, light battery powered, rechargeable card reader. It can be recharged from its charger or from a PC USB port or car charger. When customer cards are inserted into the CASPad, it displays on the Mobile phone into which merchants can enter information. Also it can be used in PCs or other card readers (ECR) at shops. The retail cost is AUD $300. It has the capability to read bank smart cards with EMV compliance, drivers' licenses, medicard, local RSL club membership cards and others with magnetic strips Track 1 IATA (International Air Transport Association) standard and Track 2 ABA (American Banking Association) Standard. It can also read any card systems embedded with RFID (ISO 01443 A & B), for instance, the new NSW Ministry of transport Mifare"T" card and its non-Phillips competitive standard card. The speed of transaction is the same as regular card readers. The Operating system is Simbian OS which is available on most Mobile telephones with Bluetooth for example new Nokias, Sony Ericssons and any Microsoft OS (all PDAs and mobile phones XDAs, HP). CardAccess has been developing personalized versions for Coca Cola vending machines, parking meters, and others. The cost of Bluetooth has been reduced dramatically which should help people to use this



 244



technology in near future. It also provides some reconciling functions which could be extremely usefd for chain-stores or Aggregators with huge volume of transactions per day. CardAccess Partnership with Third Parties Interviews with key stakeholders and site visits at the company revealed that their system has had excellent mpayment results. One key inhibiting factor has been establishing partnerships with the banks as CardAccess initially had a difficult time convincing banks to trust their security system. In Australia the BIG four banks, Westpac, ANZ, Commonwealth and National Australia Bank have enormous clout in the marketplace. CardAccess now works with WestPac and is close to cooperation with the Commonwealth Bank and fifth biggest bank St George. One interviewee believes that Australian mpayment providers are inhibited by the banks who sees them as potential rivals to their own financial viability. Another spokesperson said, “Australia and the United States are well behind Europe and Asia because of the lack of cooperation between mpayment providers and financial institutions”. He felt that the telecommunication providers were much easier to partner with as they did not see the mpayment providers as competitors but as partners. The consensus was that mpayment providers must partner or perish and in this case they have been able to partner with one bank and will have other banks onside shortly.



4.2 SmarQ Case Study The Sydney Company Mosaique has developed the SmarQ system for mobile payments. This system requires a GPRS enabled mobile phone, a wireless reader and wireless receipt printer. The mpayment system provided by Mosaique is very similar to the CardAccess system. Partnership between these two companies is illustrated by the fact that Smarq uses CardAccess’s backend system even though it is marketed as a totally separate brand. This system uses the following technologies: 0 0



GPRS Bluetooth



0 0



MIDP 2.0 Java EMVland2



SmarQ has simple verification process that eliminates the risk of card skimming fraud. After swiping the credit card, the last 4 digits are keyed in on mobile phone, the number must match with that of the real account or else transaction is not processed. If supported by bank CVC2lCW2 (3 digit security code printed on the back of the credit card) can also be checked for extra security. Its major advantage is its security as all customer details are triple DES encrypted and no customer details are kept on the phone when the application is closed. The actual application resides on the phone and presents a consistent user interface to the user and allows the user to follow through simple steps. Customers swipe their credit card, then enter the last 4 digits of the Credit card number, which is mandatory, and finally enter the 3 or 4 digit CID (Credit Card Identifier) number (optional). The



 245



device proceeds with the transaction if card and magnetic stripe details are a match the transaction is refused if there is no match. The receipt is printed and signed. An electronic receipt in the form of an SMS also can be sent to customer’s mobile.



SmarQ Partnership with Third Parties As reported in the previous case study, one of the key stakeholders of the company stated that banks are both opportunity and road block for them. Banks have large number of customers and, if customers or merchants ask for this mobile solution ‘SmarQ’, then the banks will utilize this system. The key stakeholder also expressed “...its greatest opportunity is in building up partnership with banks”. In an attempt to achieve this, SmarQ is running a pilot study with one of the major banks in Australia. A first attempt at partnering has already been established and it is now a matter of consumers being made aware of the products to assist companies like this to survive. Telecommunication companies are in a way competing with banks to provide this sort of service. In terms of collecting money from customers, the issue remains: should banks or telcos be the collector? Thus there is more competition between banks and telcos to provide this service and get some portion of service fee.



4.3 Case Study 3 :mHits The Canberra based company Thiri Pty Ltd developed the mHits platform to enable users to top-up pre-paid mobile recharge voucher at any time from their mobile handsets. For financial transactions, users do not need access to a credit card, but they must use a 3rd party system. &its established Paypal accounts through which the transactions take place. This platform was developed as an attempt to eliminate the telcos’ monopoly of billing processes and distribution chain for handset initiated transactions. mHits later expanded its vision to allow users to buy other products such as ringtones and games. Simple SMS messages are used. A non premium rate number for the SMS is used so that tariff (charge for ordering or sending a SMS) can be set independently of a Telco. However, during registration, users have to open a Paypal account, if they do not have one already set up. All registration information is secure and fully compliant with Australian Privacy regulations. During the registration processes an access code is sent to mobile as a SMS Usability Issues. It uses SMS as the primary interface medium. Ordering and delivering pre-paid re-charge vouchers is the logical next step in the evolution of prepaid mobile. This can be done anytime, anywhere. It is a free service. There is no additional charge for order or delivery of vouchers. Users only pay the standard retail price for the re-charge voucher ordered. Users send a SMS text message with their network ID [for example OTS (Optus), VDF (Vodafone), TLS (Telstra) and so on] to a dedicated mobile number 0428 696 448 [0428 my mhit]. Users receive a SMS text message containing the PIN for their re-charge including re-charge instructions. Users call their network re-charge number (free call) and enter their PIN to apply credit to their pre-paid account. Alternatively, recharge can be done online www.mhits.com.au. The following diagram shows how it works



 246



mHits Partnership with Third Parties mHits is the only network independent re-charge provider to service all major Australian pre-paid networks. Since &its is servicing all major network, any pre-paid mobile users can use this service. mHits partnered with a third party billing company called PAYbySNAP which works like a simple bank account. However one of the key stakeholders mentioned that it is almost impossible for small companies like this to get a bank license as a deposit taking institution. Fig. 3. mHits (www.mhits.com.au) Consequently the company had to suspend SnapAccount. They believed that banks were ready to partner with this system but as none did, in April 2005, the service was suspended despite a really intensive effort by the company to interest the banks. PAYbySnap hopes that someone in future will come to partner or buy this system. The failure of PAYbySNAP to get a partner has led to the closure of mHits.



5. Conclusion The success of mobile commerce depends on the delivery of attractive service packages that are personalised for the individual with easy to use terminals at the right price. Despite the optimistic forecasts for mpayment solutions, there remain only few examples of successful programs. Since mobile phone subscriptions continue to grow globally, it still remains a clear opportunity for micropayment market. The collapse of &its in Australia has surprised many in this industry. One main reason for the closing down of these services has been the inability to successfully partner, demonstrating that third party involvement is crucial for any mpayment system. The Australian CardAccess and SmarQ systems also demonstrate that partnering with third parties, such as banks and telcos, is vital for the success of these systems. Without such partners, mpayment providers are more likely to perish.



 247



References [ 1] Cellular. Co.Za, “Mass market ready and waiting for mobile commerce”, Consumer s t u 4 for Nokia [Online] Available:http://www.cellular.co.za/news~2001/05252001 [2] Denzin, Norman K. & Lincoln, Yvonna, Handbook of Qualitative Research, Sage Publications, 1994 [3] Eskadenia.com, “M-Commerce”, eskadenia.com, [Online] Available: http://www .eskadenia.corn/misc/M-commerce.asp [4] Hort, C. Gross, S, Fleisch, E. “Critical Success factors of Mobile payment” M- Lab - The mobile and ubiquitous computer lab http://www.m-lab.ch/pubs/WP13-abstract.pdf2005 University of St Gallen [5] Lawrence, E. Culjak, G and Injam, S., “M-Enterprise Technology: Diffusion of Innovation Awareness, Adoption and Uptake”, The Second International Conference on Mobile Business, Vienna, Austria, 23/24 June 2003, pp 15 - 26. [6] Lawrence, E. Culjak, G and Injam, S., “Bluetooth in the M-Enterprise: true blue believers?’ International Conference on Information Technology: Coding and Computing, Institute of Electrical and Electronics Engineers, Inc., Las Vegas, 2003, pp 281 - 286. [7 ] Murugesan, San, 2005, “Open Statement”, Cutter IT Journal, June 2005. [8] Pradhan, S.,Lawrence, E., Newton, S. & Das, J. “Bluetooth Potential in the M-Enterprise: A Feasibility Study”. Proceedings of the International Conference on Information Technology, ITCC 2004, Las Vegas 5 - 7 April 2004. [9] Pradhan, S.,“Mobile Commerce in Financial Services”, Proceedings of the International Business Information Management ( I B I M ) Conference 2003, Cairo, 16 - 19 Dec 2003. [lo] Pradhan, S.,“Mobile Commerce for Online Investors”, Proceedings of The International Conference on Information Technology, Prospects and Challenges in the 2 1“ century, (ITPC), Katmandu, Nepal, 23 - 26 May 2003. [I 1] Sekaran, U., Research methodsfor Business: A Skill-Building Approach, 2”dEdition, John Wiley & Sons, USA, 1992. [12] Strachey, Jane, “M-Commerce: The status quo”, Mobile Commerce World, Gemplus, 7 April, 2003. [131 Varshney, Upakar and Vetter, Ron, “Mobile Commerce: Framework, Applications and Networking Support”, Mobile Networks and Applications, Kluwer Academic Publishers, Netherlands, 2002. [I41 Zmijewska, A., Lawrence, E., Steele, R. “Classifying m-Payments - a User-centric Model”. Proceedings of the Infernational Conference on Mobile Business, New York, US, 2004. [151 Zmijewska, A., Lawrence, E., Steele, R., “Towards Understanding of Factors Influencing User Acceptance of Mobile Payment Systems”. Proceedings of the IADIS International Conference WWW/Internet, Madrid, Spain, October 2004. [16] Zmijewska, A.,Lawrence, E., Steele, R., “Towards a Successful Global Payment System in Mobile Commerce”. IADIS International Conference E-Commerce December 2004, Lisbon, Portugal, 2004 [17lhyperlinks for case studies a. http://www.cardaccess.com.au b. http://www.smarq.com.au c. http://www.mhits.com.au



 COMBADGE: A VOICE MESSAGING DEVICE FOR THE MASSES JAMES L. FRANKEL AND DANIEL BROMBERG it sub is hi Electric Research Laboratories, 201 Broadway, Cambridge, 1LL4 02139, USA E-mail: frankclkimcrl .corn, cianiclhiidaluni .iiiit. cdu While the computer and communication revolutions have had a profound impact on the ways in which we all lead our lives, this impact has not reached much of the world’s population. The Combadge’ project has created a portable device and affordable service that bring personto-person communication technology to the developing world. This system design allows even illiterate people to exchange messages utilizing easy-to-use devices that communicate over inexpensive heterogeneousnetworks.



1.Overview/Concept ilithium (see figs. 1 and 2) is a small, handheld computing device intended to be a test bed for applications that require neither a keyboard, pointing device, nor display. Combadge is a voice messaging application akin to AOL IM’, but embedded in an appliance-like device in which audio messages are recorded, transmitted, and played back. In this paper, we will use the term Combadge henceforth to refer to the Combadge application as embedded in the Dilithium device.



Figure 2. Front side of motherboard.



Figure 1. Fully-assembled.



To demonstrate the use of these devices, imagine a sample interaction using Combadge by one user, Kirk, who wants to send a voice message to another user, Spock. Kirk would push the push-to-talk button on his Combadge and utter ““New message for Spock,” then release the button. The Combadge would say, “Record your message for Spock.” Kirk would then push the button again and utter ““We should meet on the bridge in five minutes,” then release the button. After each button press, the Combadge will acknowledge with a low-pitched beep and after each button release, the Combadge will acknowledge with a higher-pitched beep. After a brief delay, Spock’s Combadge will say, “You have one new message,” the five blue LEDs located under each of the translucent left and right side push-to-talk buttons will alternately flash, and the front left green LED will repeatedly flash 248



 249 once separated by a longer pause to indicate one new unheard message. When Spock is able to listen to the message, he presses the button and says, “Play new messages,” then releases the button. Spock’s Combadge replies by playing back Kirk’s recorded message exactly as recorded by Kirk’s Combadge. This is followed by a final low oscillating tone to indicate the end of message playback. The scenario above is exemplary of the ease of use, simplicity of interface, and accessibility to a broad range of uses. A concise set of commands allows status inquiry and message navigation, as well as the ability to customize the application’s behavior. However, none of these more elaborate commands need be used by the nayve user. This interface makes the Combadge accessible to the illiterate as well as appealing to young children, the elderly, and to those who don’t like the intrusiveness of cellular phones because of the expectation of real-time interaction at indeterminate times (see section 3 below). One of our goals in the development of this system is to make current communication technology accessible and available to the less privileged and less educated people of the world3. To do so, we have created a system in which literacy and wealth are not necessary - we have reduced the cost of the device and the cost of the service over which the device communicates. To demonstrate the effectiveness of such a device, we worked with the TIER Project4 at the University of California at Berkeley to deploy Combadges in Tamil Nadu, India in collaboration with the MS Swaminathan Research Foundation (MSSFF)’ during the summer of 2005. In contrast to voice mail on cellular phones, the system we have designed is simple, optimized for voice messaging, doesn’t require any tree-like menu navigation, and uses voice input and output exclusively. Of course, our software and concepts could be implemented on a cellular phone at the additional cost of that platform. In some sense, the contribution of this project is the aggregation of many concepts into a cleanly designed and easily usable appliance. While somewhat similar to walkie-talkies, Combadge takes the paradigm of rapid exchanges between two remote parties using push-to-talk communication and extends that paradigm by transmitting digitized stored messages to named devices over long distances. 1.1



Earlier Work



Background work includes computer-telephony integration (voice mail6, Phoneshell’), speech recognition (Chatter’, SpeechActs?, standalone handheld speech recognition (VoiceNotes”), and one-way voice messaging over pager networks’ ’. The wearable computing arena (Nomadic Radio’*) has touched upon voice-enabled access to services, but not specifically for voice messaging applications. More recent work includes handheld devices requiring constant contact to a high-speed network and server-based speech recognition (SirnPh~ny’~, Vocera Communication~’~”’),PAN-based devices (G-TEK Electronics PWG-300’6), and numerous VOIP handsets.



 250



This project has extended earlier work by distributing more computation to the handheld device - voice recognition, compression, phonebook maintenance, and message caching are now performed in the device itself - allowing the Combadge to function whether connected or disconnected from the network and over networks of varying bandwidth and latency. 2



Accessibility of Communications Technology



The primary audience for Combadge is the multitudes of people who have not benefited from the digital revolution, thus resulting in the Digital Divide”. For much of the world’s population, cellular phones and the Internet are still not available or are too expensive. In many cases, a small number of shared phones are used by a large population. In our target rural village communities, many people want to communicate with relatives and friends who have moved to other villages and to cities but illiteracy makes it difficult or impossible. In constructing an interface that is usable by illiterate people, we focused on voice rather than text as the message content and command dialog. All of the prompts are spoken by the device and all commands are spoken by the user using a simple push-to-talk interface. The device requires little or no training: essentially it has a single push-to-talk button (one on the left for right-handed users and vice versa) and a simple command structure. The voice recognition software is almost completely speaker independent (see sections 3 and 6) and requires no training. In order to make communications technology accessible to the less affluent, both the device and service costs must be reduced. In most handheld devices, the three most expensive components are the radio transceiver, the processor chip, and the display. In order to reduce the cost of the Dilithium hardware platform, we have designed an interface that neither requires nor includes a display. In addition, we do not use either a keyboard or a pointing device. It is worth noting that the low prices charged for cellular phones do not reflect the cost to produce the device. Cellular phone device costs are recouped over the life of the service contract. When comparing costs, we are comparing the cost of a Dilithium device manufactured in production quantities to the cost of a cellular phone in similar quantities. Our goal of reaching the third world has required a less expensive service model. We have reduced the operating costs through several means: (1) we utilize a data packet protocol rather than a reserved channel as used by cellular voice communication, (2) we compress all messages prior to transmission, (3) we can use either 802.1 l b or GSM/GPRS for transmission. Using a data packet protocol does not guarantee real-time transmission of our messages, but this is not an issue because the application is based on queued voice messages. The data packet protocol does guarantee perfect transmission ( i e . , data integrity), a feature absent from cellular phones. The compression we perform need not be accomplished in real-time that is, we can employ better compression at the cost of delaying the transmission.



 251 This compressed message utilizes less transmission time (Le., less radio spectrum and less power), leaving more spectrum for other users and prolonging battery life. This allows cells to either be larger in size to encompass more devices or allows a higher density of devices per cell. When a local area 802.11b network is available, the Combadge will utilize that network, which is usually free or available at lower cost. Compared to the cellular network, the local area network has higher bandwidth, allows unmetered communication, has lower latency, requires lower transmit power and, therefore, requires less battery power for communication thereby resulting in longer battery life. One important case in which the network would be free is when an 802.1 1b network is installed for Combadge communication. 802.11b connectivity may also be provided by long distance networks (see section 4.1).



3



Design Issues and Usability



Interaction with the Combadge is structured around the principal method that humans use for communication - speech. This design point results in a device that is approachable and not intimidating. A multi-threaded implementation allows a natural way for the user to control the interaction. By pushing the push-to-talk button and speaking the next input expected by the Combadge, a user can barge in to jump past any phrase currently being spoken. If the push-to-talk button is pressed just for a moment, the current transaction will be canceled. A user might barge in to truncate a familiar long prompt or to cancel sending a message. Each Combadge supports a phonebook of associations between a familiar nickname and the identity of a Combadge (or, potentially, a grouped list of Combadges). The nickname is called a nametag and is spoken by the user. In addition to running a speaker-independent speech recognition system, each Combadge also runs a speaker-dependent speech recognition system that is used to detect nametags. As is the case with voice mail and e-mail, the Combadge presents an interface that encourages messages to be answered when appropriate for the recipient rather than when demanded by the sender. This non-interrupting model is often the most suitable mode of interaction. Much care was invested in the user interface to ensure that the Combadge would act consistently in the presence or absence of a network connection, except for having the capability to transmit or receive messages. All speech recognition is performed within the Combadge and, therefore, dead spots in a network are invisible to a user. Whenever there are new messages and the recipient Combadge is connected to a network, those messages are automatically pushed to the Combadge. Even messages that have already been heard are kept cached within the Combadge unless the memory they use is needed for other incoming or outgoing messages. In addition, new outgoing messages are stored in the Combadge until the Combadge is able to send them to the server. Connectivity status is visible when the user requests to hear a message that is not currently in the Combadge’s message cache. In



 252



this case, the user is told, “that message is unavailable.” Utility commands also indicate the connectivity state by giving the count of unsent messages (displayed on an LED, too) and the success percentage of pinging the server. If the quality of the connection to a network is poor, the degraded communication channel will increase the time it takes to send or receive messages, but eventually the exact original message will be transmitted. In addition to not requiring a continuous connection to our local network, our system does not require a continuous connection to the Internet either. An Internet connection (or at least a connection to a bridgehouter) is only needed when the destination of a message is on a different local area network. This behavior is ideal for a DakNetI8-like store-and-forward transmission system in the third world in which motorcycles or buses sporadically connect to an 802.11b network and drop off and pick up messages. In addition, several en ti tie^'^*'^^'^ are investigating long-range 802.1 1 spin-off technologies that would also be suitable networks to use with Combadge. In principle, our design can lead to increased battery life. As noted above, the Combadge transmits for a shorter time because we can perform better (non-realtime) compression. Also, by sequentially performing audio input, then compression, then transmission, the Combadge reduces its peak power demands. The lack of a display and its backlight also lead to lower power consumption. Unlike the multitude of voice messaging systems accessible through the PSTN (Public Switched Telephone Network), the Combadge presents a single consistent way to send and receive voice messages to and from any system. Interfaces to voice mail, e-mail, PSTN and other systems (fax, pagers, etc.) are all possible. 4



Description of the Dilithium Device and Operating System



The Dilithium platform is based on the Intel StrongARM processor. It contains 64 Mbytes each of SDRAM and Flash memory. There is an integrated GSM/GPRS modem and on-board SIM for wide-area networking. An optional daughterboard provides one or two Compact Flash (CF) slots for 802.1 l b local-area networking, additional Flash memory, hardwired Ethernet, or other uses. Two on-board silicon MEMS microphones are used for audio input and can provide active noise cancellation. A relatively wide dynamic-range speaker is used for audio output. The audio CODEC supports many standard sampling rates. The many LEDs on the device offer both visual feedback and visual appeal. The front central LED may be used for input in addition to its usual fhction as an output”. Dilithium includes a two-axis accelerometer that can be used to determine the device’s orientation and movement. This can be used to detect and utilize gestures to control an application. A vibrator is on the motherboard to allow silent operation as necessary. For connection with the outside world, a JTAG connector is provided to initialize the device, a USB connector is provided to allow communication with other USB devices and to recharge Combadge’s battery, extra serial ports are pro-



 253



vided to function as a console terminal port and to communicate with other serial devices, two 2.5mm stereo micro phone jacks are available for stereo audio input and output to supplant the built-in microphones and speaker. Finally, there are two push-to-talk momentary pushbuttons on the left and right sides, a power-on button on the top edge, and a reset button recessed on the bottom edge. The Dilithium hardware required a new boot loader to initialize and test the CPU, Flash and RAM memory, and the peripherals and then to appropriately download and pass parameters to the operating system. Because the Dilithium device is part of a research project with diverse, changing, and expanding goals and is being used by other projects, Linux was chosen as the base operating system. This decision leverages all existing Linux software (including networking code) and all pre-compiled binaries for our CPU architecture. The starting point for the port was Familiar Linux 2.4.1923. Once the port was completed, much of Linux was immediately available. Code development used an ARM cross-compiler and remote connection tools such as “ssh.” Linux has provided a multi-language environment in which the Combadge system has been written. C was used for Linux and the Linux device drivers. C++ was used for the speech library. Python was used for much of the application layer. Shell scripts were used for 0s interfacing and utility operations. 5



Low-level Combadge Addressing



Each Combadge has a unique address associated with it. The address is the same no matter how the Combadge happens to be connected to the Internet or to other Combadges. Because there are several possible networks through which a Combadge may be connected (802.11b (WiFi), GSWGPRS, Ethernet, Ethernet over USB, etc.) and because of a desire to enable a Combadge to initially determine its own address without requiring an administrator, a Combadge address is determined based on which hardware is present, Once an address has been determined, the Combadge stores that address in an “identity” file in the Flash memory file system.



6



The Combadge Audio Interface



All Combadge commands are non-modal and expressed in a single utterance ( i e . , they don’t require navigation through tree-like menus or successive prompting) except for phonebook management. The current audio interface implementation accepts and emits either English or Tamil (selectable via an audio command). The subset of commands that a nafve user might utilize are “New message for ,” “Play new messages,” “Play next,” “Play previous,” and “Reply.” A much richer set of commands exists for the advanced user, grouped by voice message sending and receiving, phonebook management, and utility functions.



 254 Speech recognition is performed via a uniform interface named RISTZ4for Reduced Instruction Speech Toolkit. This interface has been implemented to support many underlying speech recognition engines. We are currently using Embedded Speechworks SDK” for English recognition and SPHINX-IIZ6(with substantial modifications to remove floating-point operations) for Tamil. Any command that may modify the phonebook or the device configuration requires confirmation. The information being confirmed is always repeated back to the user as part of the confirmation request. Command cancellation is always one option given to the user. 6.I



Multi-threaded approach



The Combadge application supports a number of independent activities. In order to eliminate timing dependencies and to simplify the code for these activities, a multithreaded approach was taken. The user interaction thread implements the command set and blocks only on input from the user. Separate threads deal with long-lived or blocking actions such as speech recognition, message transport, and time-out processing. This design guarantees responsiveness to the user by always being ready to accept a user command. Multi-threading allowed the implementation of a “bargein” capability. During the execution of a command, a user may press one of the push-to-talk buttons and invoke a new command or skip directly to the next step of a multi-step command (as would be the case for “New message for ” or “Create contact”), aborting any partially uttered speech output from the previous command. In the degenerate case in which one of the push-to-talk buttons is pressed, but no new command is uttered by the user, any partially uttered speech output from the previous command is aborted and no new command is run. In this degenerate case, if the user is within a multi-step command, the command is cancelled. Similarly, if a user is at an input stage within a multi-step command, the expiration of a timer in a timer thread will cause cancellation of the command.



7



Message Storage and Transport



Each voice message is eventually packaged inside an e-mail message and sent using standard SMTP. Stable storage of all messages ever sent to a Combadge is accomplished using a standard IMAP e-mail system. Each Combadge uses its RAM as a cache for messages. The Combadge can store three classes of messages (listed from most to least important): (1) new outgoing messages dictated by the user but not yet sent to the server, (2) new incoming messages that have not yet been heard, and (3) incoming messages that have been heard but are stored in the Combadge’s message cache. Caching messages permits more full-functioned operation when a Combadge is not connected to a network. If the cache becomes full, lower priority class messages are discarded in favor of higher priority class messages.



 255



The current Combadge uses our own connection-based protocol to send messages from a Combadge to a server and to receive messages from a server. A specified server runs a Combadge Daemon program, “cbd.” The “cbd” is responsible for communicating with the Combadge. Figure 3 outlines the network infrastructure. PSTN



Combadge over WiFi



Wireless



cbd



PSTN



SMTP



Voicernail



A



GPRS



Cornbadge over GPRS



Figure 3. Network infrastructure.



Communication is involved in at least four distinct stages: (1) when a Combadge is powered-on or reconnects after losing contact with the “cbd” server in order to authenticate itself to the server and to participate in an initializing dialog, (2) when a nametag is added to the phonebook in order to validate the Combadge address of the added device, (3) when a voice message is uploaded to the server, and (4) when a voice message is downloaded to the Combadge. 8



Summary



In this paper, we have presented the rationale and design for a voice-messaging device. The device has been fully operational for over a year with a prototype software implementation operational for over two years. The implementation process involved creating new hardware, writing a new boot loader, porting Linux to the device, creating protocols and services, and developing a new application. We have proven the ability to create a useful device that communicates solely through speech. Combadge has been designed to provide handheld communication technology for the developing world and for users who have not currently accepted such technology by providing an implementation adapted to their needs. The result is a simple interface built on a device with low manufacture and service costs. Our future case studies will test the potential deployments. 9



Acknowledgements



The authors would like to thank David Anderson, Ryan Bardsley, Paul Beardsley, Bret Harsham, Mark Haslett and co-workers at Americad, Joe Marks, Barry



 256



Perlman, Divya Ramachandran, Bent Schmidt-Nielsen, Bruce Stetson and the M.A.D.S. staff, and many other Combadge beta testers for their feedback. References www.merl.com/proiects/ComBadae www.aim.com “Tech’s Future,” Business Week, September 27,2004 htto://tier.cs.berkelev.edu www.mssrf.orcl The 1984 Olympic Message System: A Test of Behavioral Principles of System Design, John D. Gould, et. al., Communications of the ACM, Volume 30, Number 9, Sept. 1987 Multimedia Nomadic Services on Today‘s Hardware, Chris Schmandt, IEEE Network, September/October 1994 Chatter: A Conversational Learning Speech Interface, Eric Ly and Chris Schmandt, AAAI ‘94 Spring Symposium on Multi-Media Multi-Modal Systems, March 1994 Talking vs. Taking: Speech Access to Remote Computers, Nicole Yankelovich, CHI ‘94 Conference Companion, ACM Conf. on Human Factors in Computing Systems, April 1994 l o VoiceNotes: A Speech Interface for a Hand-Held Voice Notetaker, Lisa J. Stifelman, Barry Arons, Chris Schmandt, Eric A. Hulteen, Proceedings of INTERCHI, April 1993 ’ I Voice-Over-FLEX: Value Creation for Paging Camers and Subscribers, H. B. Choi and Henry Law, IIC Taipei Conference Proceedings, June 1999 l2 Speaking and Listening on the Run: Design for Wearable Audio Computing, Nitin Sawhney and Chris Schmandt, Proceedings of ISWC’98, International Symposium on Wearable Computing, October 1998 l 3 SimPhony: a voice communication tool for distributed workgroups, Vidya Lakshmipathy and Chris Schmandt, Video Procs. and Procs. Supplement of UbiComp 2004, Sept. 2004 l 4 “Science fiction? Not any more,” The Economist Technology Quarterly, Sept. 18,2004 www.vocera.com l6 www.gtek.com.tw l 7 Technology and development: The real digital divide, The Economist, March 12,2005 “DakNet: Rethinking Connectivity in Developing Nations,” IEEE Computer Outlook, Jan. 2004 (www.firstmilesolutions.com/DakNet IEEE Computer.pdf) l9 5G Wireless Communications -( 2o Efficient MAC Protocol for long distance 802. I 1 links, Sergiu Nedevschi and Rabin Patra (http://tier.cs.berkelev.edu/docs/proiects/wirelessmac.htm1) 2 1 First Mile Solutions, LLC (www.firstinilcsolutioiis.com) 22 Very Low-Cost Sensing and Communication Using Bidirectional LEDs, P. H. Dietz, W. S. Yerazunis, D. L. Leigh, Intl. Conference on Ubiquitous Computing (UbiComp), Oct. 2003 23 fainiliar.handheIds~org 24 RIST is a simulified sDeech recornition interface implemented at Mitsubishi Electric Research Laboratohes (MERL), Cambridge, MA, USA 25 Speechworks now part of ScanSoft, Inc. (www.scansoft.com/specchworks) 26 The SPHINX-I1 Speech Recognition System: An Overview, Xuedong Huang, et. al., Computer Speech and Language, 7(2), pp. 137-148, 1992 ~



 MOBILE AGENTS AIDED ROUTING IN MOBILE AD HOC NETWORKS SHEKHAR H.M.P’. AND K.S. RAMANATHA Wireless Networking Laboratory, Department of Computer Science & Engineering, M S Ramaiah Institute of Technology, MSRIT Post, MSRnagar,Bangalore-560054,INDIA E-mail: {shekharhmp, ksramantha}@msrit. edu, Unicast routing protocols proposed for Mobile Ad hoc Networks have certain limitations. These protocols try to discover the route on-demand by flooding route request messages, which results in increased end-to-end latency and also consumes considerable amount of network bandwidth. These limitations make protocols not suitable for real-time multimedia communication. This paper proposes a Mobile Agents aided Routing Protocol (MAR€’),for routing in Mobile Ad Hoc Networks. Mobile agents collect network connectivity information and disperse this information across the network. This network connectivity information forms the “ready routes” that can be used during route discovery and route maintenance phases. Thus, MARP avoids the expensive route discovery/maintenance process. MARP uses Ad Hoc Ondemand Distance Vector routing (AODV) as the underlying protocol. MARP and AODV protocols are compared through extensive simulations. Network throughput, end-to-end delay and routing overhead are the performance parameters considered. Results show that the ready routes provided by MARP, when compared to traditional on-demand routing protocols such as AODV, results in substantial reduction in end-to-end delay and improvement in network throughput.



1



Introduction



The route discovery process of on-demand routing protocols of Mobile ad Hoc Networks involves flooding of control packets throughout the network. Since the bandwidth is limited in Mobile Ad Hoc Networks this process is expensive and results in higher end-to-end delay. Also, they use the re-routing strategy for route maintenance. Route maintenance is done when a link along a route breaks due to mobility, congestion or other reasons. The re-routing strategy involves doing route discovery for a new route by an intermediate node or by the source. In either case route discovery floods the network with control packets that adds to the congestion in the network. Further the dynamic nature of Mobile Ad Hoc Networks causes the links to break too ofien. This means the expensive route maintenance process is called repeatedly resulting in increased end-to-end delay, higher routing overhead and degradation in throughput. These limitations make the existing protocols prone to congestion and not suitable for multimedia communication [ 11. Intelligent mobile agent technology has been used as a solution for several networking problems such as routing, security etc. [2,3,4]. Mobile agents can be described as packets that move ‘Shekhar H M P is a PhD research scholar at Birla Institute of Technology and Science, Pilani, Rajasthan - 333031, INDIA.



257



 258



around the network assistingldoing useful work to enhancelmonitor the performance of the network by making intelligent decisions. The advantages of using mobile agents are that they are highly distributed, cooperative, and flexible enough to adapt to continuous and unpredictable changes. In this paper, we propose a Mobile Agent aided Routing Protocol to overcome the limitations of the current on-demand routing protocols proposed for Mobile Ad Hoc Networks [6]. 2



Mobile Agents Aided Routing Protocol



Agent Creation and Termination At the start of the network no connectivity information is available at the nodes. So each node creates INIT-AGENT-CNT number of agents. This is expected to provide the connectivity information at each node at the beginning of the network. The agents do not have an explicit time to live. Instead they are expected to serve the network as long as the network exists. But the agents will inevitably get dropped because of link breakages and wireless losses. As a result MARP needs to regulate the number of agents in the network. A fiequency-based approach is adopted. At each node the number of agents visiting it in the past =-INTERVAL is measured. If the number of agents exceeds the maximum threshold of MAX-NUMBERAGENTS then the next agent to visit this node is processed and deleted. However if the number of agents to visit this node within the past =-INTERVAL drops below the minimum threshold of MIN-NUMBER-AGENTS then a new agent is created at this node. Thus agent population is regulated. Mobile Agent Migration Agent migrates fiom one node in the network to another on a random basis. A “no return” principle is used to avoid loops and thereby gather useful connectivity information. Mobile Agent Packet Format The packet format for the mobile agent is given below:



I



Type



I



history list



I



type :typeof packet. history list : contains the node address. Each node forwarding this packet will append its address to the list. Figure 1. Mobile Agent Packet Structure.



The type field identifies the packet to be an agent. The history list is the list of nodes visited by this agent recently. When the agent is created the history list is NULL i.e. it has no entries. The history list can hold up to MAX-HISTORY number of node addresses. When the (MAX-HISTORY + 1) th entry is to be made into the history list, the first entry is discarded.



 259



Alternate Route Table To understand the updates made by agents at each node a description of the alternate route table is needed. At each node in the network an Alternate Route Table is maintained. The alternate routes brought by the agents are updated into this table as described in the next section, mobile agent execution. The routing protocol searches the alternate route table when it requires new routes to destinations or an alternate route to reroute the packets during link breakages. The alternate route table format is given in Figure 2. |



DestID



|



PrevHop



|



Expire



|



Status



|



DestID : destination address. PrevHop : previous hop information to DestID Expire : value which specifies when this route expires. Status : route has expired (Status=0) or route is valid (Status = 1) Figure 2. Alternate Route Table.



To retrieve the route to a destination, say X, the alternate route table is looked up for X in the DestID field. The previous hop of X is copied onto a list. With the previous hop of X as the destination, a lookup for its PrevHop is done. This process is repeated till the PrevHop is equal to the current node's address. The reverse of the entries in the list of PrevHops gives the desired route to the destination X. Mobile Agent Execution The updates done by the agent when it arrives at a node is described in this section. The agent updates the alternate route table with the ready routes (connectivity information) available in its history list. For each entry in the history list a lookup of the alternate route table is done. If the entry doesn't exist a new entry is made or else the entry is updated. While adding a new entry or updating an old entry the "expire field" is also updated to (CURRENT TIME + RTIMEOUT), where RTIMEOUT represents the time for which the entry remains valid. The "status field" is also made VALID. 3



Simulation Results



3.1



Simulation Model



The MARP and AODV protocols have been simulated in NS-2 (ns-2.28) network simulator. The considerations made for the simulation environment and protocol settings are shown in Table I and II. Table 1. Simulation Environment. Network Space: Simulation Time: Number of Nodes:



1500X300 meters 900 seconds



50



 260



Table 2. Protocol Settings INIT AGENT CNT: X INTERVAL MAX NUMBER AGENTS MIN NUMBER AGENTS MAX HISTORY



3.2



2 5 seconds 20 5 10



Results



The performance analysis of MARP in comparison to AODV was carried out using the following metrics: Average End-to-End Delay, Average Network Throughput and Routing Overhead. MARP and AODV protocols are compared by varying parameters such as mobility and network load. Average End-to-End Delay The average end-to-end delay is defined as the average amount of time taken by a packet to move fiom the source to the intended destination. It includes delay due to route discovery, buffering at the interface queue, retransmissions and propagation. MARP show a considerable decrease in end-to-end delay when compared with AODV protocol. This is due to the fact that in MARP ‘ready routes’ would be made available by the mobile agents moving around the network. This is substantiated in Fig. 3 and 6 for CBR traffic. As mobility is increased we observe an increase in delay as seen in Fig 3. But, MARP shows reduced delay relative to AODV. Figure 6 show that MARP has lesser delay than AODV as the load on the network is increased. Thus, MARP is more scalable than AODV. Average Network Throughput Average network throughput is defined as the average of the actual amount of data that is moved fiom the sources to the destinations per second. This does not include the control overhead of the protocol. In Fig. 4, we observe the throughput decrease as the network tends towards dynamic state i.e. as mobility is increased. Here, MARP gives better throughput that AODV. As the load on the network is increased we observe the throughput reach a peak and fall as seen in Fig. 7 and MARP performs better than AODV.



 261



"i ,/'



.... ,.I...



........................



I



,." *l'



.'. . . . . . . .



&...



........



_L_J



~



*; *;.



. . . . . . .



_I



. . . . .



M*Y



Figure 3 , 4 & 5. End-to-End Delay Vs Mobility, Throughput Vs Mobility, and Routing Overhead Vs Mobility.



....



.........................



\\



/:



*



ir



"Yu-s(I-



.m



~i



I=



:



.i1



Figure 6 , 7 & 8. End-to-End Delay Vs Rate, Throughput Vs Rate, and Routing Overhead Vs Rate.



Routing Overhead The Routing Overhead is the ratio of the total number of control packets generated to the total number of packets generated (control packets + data packets). Routing overhead is found to be much lesser in case of MARP because the expensive route discovery mechanism is bypassed. The addition of Mobile Agents is an overhead but the reduction in AODV's control packets overcompensates for this overhead and net result is a reduction in the routing overhead in most cases. In Fig 5 and 8, it can be clearly seen that MAFU' follows AODV but is more efficient that AODV in most cases. As mobility is increased, as in Fig 5, the network becomes more dynamic with more link breakages and hence more control packets are needed to maintain the network topology. MARP demonstrates much better performance than AODV because AODV repeatedly floods its route request packets to maintain network connectivity while MARP uses its ready routes. 4



Conclusions



The current routing protocols like AODV, DSR etc. use flooding mechanism for route discovery and maintenance which is expensive and these results in higher end-to-end delay and lower throughput. We have proposed a mobile agent based approach, MARP, to overcome these problems and make the routing protocol more



 262



efficient. MARP demonstrates reduced end-to-end delay and higher throughput which makes MARP suitable for multimedia communication. As a future work we intend to make MARP congestion aware.



5



Acknowledgements



This work was supported by a grant fiom AICTE, New Delhi under R&D grants Ref: 8020/R&D/R&d-48/2001-2002.



References 1. Elizabeth M Royer and Chai-Keong Toh, “A Review of Current Routing Protocols for Ad Hoc Mobile Wireless Networks”, IEEE Personal Communications, Apr. 2003, pp. 46- 55. 2. H. Matsuo and K. Mori, “Accelerated Ants Routing in Dynamic Networks”, Proc. of Intl. Conf. on Software Engineering, Artificial Intelligence, Networking and ParalleUDistributed Computing, Aug. 2001, pp 333-339. 3. N. Minar, K. H. Kramer and P. Maes, “Cooperating Mobile Agents for Dynamic Network Routing”, Software Agents for Future Communication Systems, Chapter 12, Springer Verlag, 1999. 4. R. R. Choudhary, S. Bhandhopadhyay and K. Paul, “A Distributed Mechanism for topology discovery in Ad hoc Wireless Networks Using Mobile Agents,” Proc. of Mobicom, Aug. 2000, pp. 145-146. 5. C. E. Perkins, “Ad Hoc On Demand Distance Vector (AODV) Routing,” IETF Internet Draft, RFC 3561 available at: http://ietf org/html.charters/manet-charter. html. 6. Shekhar H M P, and K S Ramanatha, “Agent Based Routing in Mobile Ad Hoc Networks”, Technical Report, Wireless Networking Laboratory, Department of Computer Science & Engineering, M S Ramaiah Institute of Technology, India, 2005.



 SERVICE DISCOVERY IN A GRID OR UBIQUITOUS COMPUTING ENVIRONMENT



KOUSHIK SINHA Honeywell Technology Solutions Lab Bangalore, India 560076 Email: s i n h a h u @yahoo.com We present two efficient, deterministic, distributed and scalable service discovery algorithms that are suitable for both grid and ubiquitous computing environments. Our first service discovery algorithm assumes the presence of collision detection capabilities at each of the nodes or devices and runs in O(n log n) time in the worst case, n being the number of nodes. The second algorithm does not assume collision detection capabilities of the devices. It has a worst case run time of O ( n D )time, D being the diameter of the network.



1. Introduction Grid computing has recently emerged as an important field, distinguished from conventional distributed computing by its focus on large-scale resource sharing and orientation towards high-performance computing [4].Resource sharing in a grid is highly controlled, with resource providers and clients defining clearly what is shared in a networked environment. As the grid is a service-oriented architecture, a service discovery protocol for the grid needs to be efficient in locating services within the grid [ 1,571. We propose in this paper two efficient deterministic, distributed service discovery protocols, suitable for both grid and ubicomp environment. The first algorithm assumes collision detection capability of the nodes and runs in O ( nlog n ) time, n being the number of nodes. The second algorithm does not assume such collision detection ability of the nodes and requires O (n D)time in the worst case, where D is the diameter of the network graph. We have also attempted to address the issues of scalability, robustness, simplicity and to some extent locality- awareness in both of the service discovery algorithms.



2. Proposed DSD Algorithm We present the first of our proposed algorithms, the deterministic service discovery (DSD) algorithm. The system model for the algorithm is as follows : 263



 264



2.1. System Model We assume a network of n nodes. Communication is through either broadcast over a single radio channel or through wired links. All communication through wirelesdwirelined links are synchronized in time and starts at fixed time slot boundaries. Each node is assigned a unique identifier from the set { 1 , 2 , . . . , n}. If the exact value of n is not known, then we can apply the technique described in [3] to find an upper bound on the node id within a factor of 2. For the sake of simplicity of understanding, we assume the number of nodes n is known in our discussion. Initially the nodes do not possess any information about the network topology. The service discovery initiator is distinguished as the source node or the root and broadcasts a query message M . In every node and the message M , there are two important fields pertaining to the queried service : 1) snume : A generic name of the requested service, e.g., printer, scanner etc., and 2) sdesc : A description of the requested service, in the form of some metadata. A match on either or both the fields generates a service found response. The query message M contains another important field - hopdistance, which can be set by the source node to search for a service only within a certain locality of its current location. By default, the hopdistance is set to some negative number, indicating a desire to search the entire network. If a non-zero positive value is specified, then with each retransmission of the message, the hopdistance is reduced by 1 until it reaches 0. When it reaches zero, the query message is no longer retransmitted.



1



source-node



1



messageid



1



s-name



I



s-desc



1



hop-distance



I



Figure 1 . Structure of service discovery query message



A typical message A4 would look like as shown in Figure 1. It is assumed for the algorithm DSD that the mobile terminals possess collision detection capabilities. Throughout our discussions, we will use the terms node id and address interchangeably to identify a particular node.



2.2. Protocol Description We first give some definitions along with the relevant functions and control signals used in our later discussions. Definition 2.1. A node is said to be eligible if it is yet to transmit M . Definition 2.2. For a given node i, we say an adjacent node j is privileged if it is an eligible neighbor of i, and it has the largest id among all neighbors of node i.



Definition 2.3. We say node i is a predecessor of node j if j is the privileged neighbor of node i.



 265 The algorithm uses the functions $ndprivilegednode, isneighborpresent and transfer-control, which are the same as those described in [2]. Due to lack of space we omit the description of these functions. send-queryresponse(j, a, query-f lag, servicing-node) : If j receives M from a node i with the request for some specific service, and if j offers that particular service, then j transmits a send-query-response with the query- f Zag set to 1 and servicing-node set to j. Also, j retransmits M if it receives M from a privileged neighbor (say u)with query-flag set to 1. If j does not offer the requested service and it has no eligible neighbor who offers it, j sends a send-query-response signal to node i with the query-f lag field set to 0. The proposed deterministic service discovery algorithm is presented below : Algorithm DSD /* Initiation of transmission of M by root */ var root : integer; I* initiator of service query */ begin searchservice(0, root, root, M ) ; I* The first parameter in the above call to searchservice protocol refers to */ /* the predecessor of the third parameter 0 indicates that root has no predecessor */ end. Each node i, 1 5 i 2 n, executes the following procedure. Procedure searchservice(pred, root, i, M ) var neighbor-present : boolean; var j, query- f lag,servicing-f lag : integer; begin query-flag = 0; if (received message M ) then if (M.s-name = i.s-name) or (Ms-desc = is-desc) then /* Service found at node i */ set query- f lag = 1; set servicing-node = i; transmit send-queryxesponse(i,pred, 1, i);exit; /* end - found service */ I* otherwise continue searching network */ if (M.hop-distance = 0) then transmit send-queryxesponse(i,pred, 0, i);exit; I* nothing found within locality *I M.hopdistance = M.hopdistance - 1;transmit message M; neighborqresent = false; do if (ismeighbor-present(i)= true) then j +- find-privilegedmode(i); Zj > 0 then neighbor-present = true; transfer-control(i, j); wait until j signals that it has no more eligible neighbors or wait for a send-queryxesponse message or wait until a time-out occurs if (send-queryxesponse msg from j) then if (received query- flag = 1)then servicing-node = received servicing-node;



 266



send-query_response(i,pred, 1, servicing-node); else neighbor-present = false; while (neighbor-present = true); if (i # root) then send-query_response(i,pred,query- f lag, servicing-node); else end broadcast of M ; I* all nodes have been queried *I end.



Lemma 2.1. The construction of a DFS tree by the algorithm DSD requires 2n [log nl - 2n -t2 transmission slots. Proof omitted for the sake of brevity.



0



Theorem 2.1. Algorithm DSD requires O(n1og n) time to query a network of n nodes for a service. Proof: Follows directly from lemma 2.1 and the additional slots required for the query message and the control signals. 0 3. Algorithm clique-basedservice-discovery This proposed service discovery algorithm is especially suitable for scenarios where a backbone network (wired or wireless) exists, while other devices communicate wirelessly with the interconnected access points of this backbone network. The grid network can then be perceived as a set of cliques that are connected to each other by some intermediate nodes as depicted in Figure 2.



Figure 2. A typical grid or ubicomp network



In Figure 2, we call the nodes 5 , 7 , 9 and 11 the bridging nodes. The bridging nodes are those which connect at least two nodes that are not directly connected to each other by an edge. The subgraph formed of the bridging nodes is called the backbone network. The nodes 1,3 and 5 form a clique. 3.1. Algorithm clique-based-service-discover3 The algorithm consists of the following steps to be executed by each node i of the network. We assume that time is slotted and all communications are initiated at slot boundaries. The ithtime slot in a round of n slots is reserved for the ithnode. Step 1 : Broadcast the own id and also its resource information in the form of a < s-name, s-desc > tuple to all neighbors in the designated time slot i .



 267



Step 2 : If message received from neighbor j in j t h time slot, then update neighbor-list of node i with the information. from the neighbor j. Step 3 : Find the maximal clique C to which the node i belongs. Step 4 : Find the neighbor set Bi which are not members of C. Step 5: If Bi # 0, then set a local variable bridge-nodei to 1, and send the value of this variable to all neighbors of i. bridge-nodei is zero otherwise. Step 6 : If i is a bridging node then exchange information with every other neighboring bridging node about available resources at the nodes of their respective cliques. Lemma 3.1. Algorithm clique-based-service-discovery requires O ( n D ) time slots to complete service discovery in a grid network, where D is the diameter of the network. 0 Proof omitted for the sake of brevity. 4. Conclusion We have proposed two new distributed, scalable service discovery algorithms for ubiquitous and grid computing environments. The first service discovery algorithm runs in O(n1og n ) time and is highly scalable, robust and resilient to mobility of the users or devices. The second service discovery runs in O ( n D )time, and does not assume any collision detection ability of the nodes. It uses local directories to answer service request queries, and hence may be able to answer service queries faster on an average than algorithm D S D . Future work may include comparing the performances of the proposed algorithms in a grid or ubiquitous computing environment with existing protocols like UPnP, SLP, HaVi, Jini etc.



References 1. C. Bettstetter and C. Renner, “A comparison of service discovery protocols and implementation of service location protocol,” Proc. of EUNICE 2000 (2000). 2. K. Sinha and P. K. Srimani, “Broadcast and gossiping algorithms for mobile ad hoc networks based on breadth-first traversal,” Roc. 6th htl. Workshop on Distributed Computing (IWDC),LNCS 3326, Springer, pp. 459-470 (2004). 3. M. Chrobak, L. Gasieniec and W. Rytter, “Fast broadcasting and gossiping in radio networks,” Proc. 41st IEEE FOCS’2000), pp. 575-581 (2000). 4. I. Foster, “What is the grid ? A three point checklist,” Tech. rep., Argonne National Laboratory and University of Chicago (2002). 5. A. Friday, N. Davies and E. Catterall, “Supporting service discovery, querying and interaction in ubiquitous computing environments,”MobiDE 2001, pp. 7-13 (2001). 6. E. Guttman, “Service location protocol : automatic discovery of IP network services,” IEEE Internet Computing, 3(4), pp. 71-80 (1999). 7. H. Lican, W. Zhaohui and P. Yunhe, “A scalable and effective architecture for grid services discovery,” Proc. of SemPGRID, Hungary (2003).



 GENETIC ALGORITHM BASED ROUTING FOR MULTICAST COMMUNICATION IN MOBILE NETWORKS



’,



’,



’,



C. MALA R. SHRIRAM SHASHANK AGARWAL S. SELVAKUMAR Department of Computer Science and Engineering, National Institute of Technology, Tiruchirappalli - 620 015, Tamil Nadu, India E-mail: I [email protected], [email protected], [email protected],[email protected] The advent of Mobile cellular networks has enabled people in any part of the world to communicate with each other any time. In recent years, there is a growing need for the mobile users to participate in real time applications, viz., video conferencing, multiparty games, online auctions, access to distributed databases, etc. All these applications require a Multicast Tree (MT) to be constructed among the group members with the source of multicast being the root of the MT. Traditional methods used in a wired network to construct a MT take into account only the distance or delay between the nodes. These methods fail because of the inherent dynamism in a mobile network. To overcome this problem, and to improve the QoS, a novel genetic algorithm (GA) based approach to construct an Optimal Multicast Tree (OMT) with the constraints, viz., probability of delay over a path, queuing delay at a node, and residual bandwidth of a link is proposed in this paper. Keywords: Mobile network, Genetic algorithm, Optimal Multicast Tree, constraints.



1



Introduction



Due to the nomadic lifestyle of the mobile users in recent years, there is a demand to make use of the mobile networks not only for communication but also for participation in real time applications, viz., multi group conferencing, multi party games, online auctions, access to distributed databases, etc. [ l , 61. All these applications need the construction of a MT rooted at the source of multicast. As the traditional methods [2, 4, 111 of constructing a MT are for a fixed wired network, these methods fail for a highly dynamic Mobile network [ l , 3, 81. So, in this paper, a GA based approach [12] to construct an OMT with three different constraints viz., probability of delay over a path, queuing delay at a node, and residual bandwidth of a link to improve the QoS of Multicast [2, 3, 4, 71 in a Mobile environment is proposed. 2



Proposed approach



Of the three basic techniques used for construction of MT[lO], MT with Rendezvous Points (RP) eliminates flooding present in Source based routing, enables to conduct multi group conferencing comprising of multiple senders and 268



 269



Figurel. A mobile user network with multicast groups A and B ••Multicast tree links of Group A Wireless link between router & user receivers without the computational overhead in Steiner trees [3] and is more suitable for the dynamic nature of group members. 2.1



Proposed Network



To implement Multicast communication in a Cellular network [13], a graph G = (N, E) representing a Cellular network of N Base stations and E links is considered. The RP located at each base station functions as a Rendezvous Server (RS) if the source of the MT is in its coverage or as a router providing service to the mobile users. The roles of RS and a router are mentioned below. 2.1.1



Roles of a Rendezvous Server



The RS maintains two data structures namely Group Table (GT) and a Group Membership table (GMT). The GT contains the names of all groups maintained by the RS. A Group Membership table is maintained for every group in the RS as follows: Table 1. GMT in Router 1 based on Figure 1. User name



Router name



MAI MA4



Rl R5



RS updates entries in GMT as per users movements. 2.1.2



Roles of a Router



Each router maintains a table called Foreign Member Table (FMT) as follows:



 270



Table 2. FMT in Router 6 based on Figure 1. Group name



User name



MA5 MB 2



A B



RS name



Rl R7



This table contains the list of users in its coverage, their group names and the address of the RS of that group. When the router receives a data packet from the group user, it forwards it to the group's RS as shown in Fig. 1. As there are frequent changes in the position of the mobile users [8, 9] and as it is required to construct a MT satisfying the QoS requirements [4], this motivated us to propose a GA based OMT computation in this paper. 2.2



Proposed algorithm to construct an OMT



Step 1: Y i e N, each i transfers information regarding constraints, viz., probability of delay over a path, queuing delay at a node, and residual bandwidth of a link to RS. Step 2: This information is used as an input to the Genetic algorithm to compute the optimal path to each of the routers from the RS. Step 3: From the optimal paths computed from Step 2, an OMT is computed using Genetic algorithm, with RS as the root of the tree. 2.2.1



Formulation of Fitness Function



The first constraint in computing the OMT is the probability of delay in a path P of length m. From the definition of Erlang-K distribution, with arrival rate J, the probability of delay over a path P of length m less than t is given by the expression: Fl = (yT-'e1")/ (m-1)! for 1< m > N-l (1) The second constraint is the queuing delay at the individual nodes. If every node is assumed to have N buffers, the queuing delay at a node is given by the expression: F2 = min (5, t) for 1< i > N (2) where t is the time required to load or free a buffer and 8i is the number of free buffers at node i. The third constraint, the residual bandwidth of a link in the network after allocating bandwidth bm for a link m is given by (cm- bm), where c m is the capacity of a link m e P. The fraction of total bandwidth available as residual bandwidth for a path P is given by the following expression: F3 = min (cm - bj for 1< m > N-l (3) From these three constraints, the formulated fitness function, which is a Maximization function, is given by the following expression:



F=F1+ 1/F2 +F3



(4)



 271



2.3



Simulation and Performance Analysis



The simulation of a Cellular network with mobile users is done using JAVA and the results are tabulated in Table 3 and plotted in the graphs in Fig. 2, 3, and 4. From the graphs in Fig.2, 3, and 4, it is inferred that the GA based approach



30



-Without



GA



0 0



4



8



%-



12 16 20 24 28



9



13



14



15



17



Number of Nodes



Call Arrival Rate Figure 2. Call Arrival Rate Vs Call Service Rate



12



Figure 3. No. of nodes Vs Computational Overhead



nz



-. i m



a3 0 0



25



50



75



100 I 2 5 150 175



C a It Arrival R ate



Figure 4 . Call Arrival Rate Vs Call Blocking Rate



converges faster to the optimized solution than the traditional algorithms. A comparison of the performance of a Cellular network [ 131 with OMT constructed



 272



with RPs using GA with that of MT constructed using Steiner trees [ 111 and that using SBR [ 111 is shown in Fig. 4. From the graph in Fig. 4,it is inferred that the Call blocking rate [ 131 in GA approach is drastically reduced compared to Steiner trees and Source based routing.



3



Conclusion



With the increasing demand for group communication in mobile networks, there is a growing need to construct an OMT faster, satisfying the QoS requirements. In this paper, a GA based OMT construction algorithm using RPs for a Cellular network is proposed. The simulation results show that this algorithm performs better than the traditional algorithms for Multicast in terms of speed of computation. Also it is seen that there is considerable reduction in Call dropping rate. As such, it can be very well used by any multi group communication in a Mobile environment.



References 1. Hrishikesh Gossain, Carlos de Morals Cordeiro, and Dharma P. Agrawal,



“ Multicasting in Wireless Environment ”, E E E Communications Magazine, VOl. 40, Jun 2002, pp 116-123 2. Jinquan Dai, Touhai Angchuan, Hung Keng Pung, ‘‘ QROUTE : an QoS- guranteed multicast routing ”, Computer Communications, 2003,pp 90-88. 3. Nilanjan Baneqe and Sajal K. Das, “MODeRN: Multicast On- Demand QoS-based Routing in Wireless Networks ”, IEEE Proceedings of Vehicular Technology Conference, 2001, pp 2167-171 4. Xin Yuan, “ Heuristic algorithms for multi constrained QoS routing”, IEEE/ACM Transactions on Networking, vol. 10, no. 2, Apr 2002 pp. 244-256 5. D..Zapala, ‘‘ Alternate path routing for Multicast”, IEEWACM Transactions on Networking, vol. 12, no. 1, Feb 2004 pp. 30-43 6. D.Zapala, “ Multicast routing support for real-time applications ”, Ph.D. Dissertation, University of Southern California, Los Angels,1997 7 . S.Deering, D.Estrin, “An overview of quality of service routing for next- generation high- speed networks: problems and solutions”, IEEE Network, vol. 12, Dec 1998, pp. 64-79. 8. Charles E. Perkins, “Mobile networking in the Internet”, Mobile Networks and Applications, volume 3 , 1999, pp 319 - 334. 9. R.H. Katz, “Adaptation and mobility in wireless information systems”, IEEE Personal Communication Magazine, vol 1, 1994, pp 6-17. 10. Joshua Auerdach et. a1 “ Multicast Group Membership Management”, IEEWACM Transactions on Networking, vol 11,2003, pp 166-175. 11. Ralph Wittmann, Martina Zitterbart, “ Multicast omunication -Protocols and Applications ”, Morgaun Kaufmann Publishers, 2001 12. David E. Goldberg “Genetic Algorithms in Search, Optimization, and Machine Learning”, 2“d Edition. 2004 13. Jochen Schiller, “ Mobile Communications ”, 2“dEdition, Pearson Education, 2003.



 RECEPTION-AWARE POWER CONTROL IN AD HOC MOBILE NETWORKS



N. SHRESTHA AND B. MANS Department of Computing, Macquarie University, North Ryde 21 09, Australia E-mail: { nirisha, bmans} @ics.mq.edu. au Energy is a scarce resource in ad hoc mobile networks, making power control a crucial technique. For sake of simplicity, most existing power control protocols only consider the energy cost of transmissions. In this paper, we consider a more realistic model of power control in which the energy cost of the receiving nodes is also taken into account. This paper presents protocols that can not only achieve energy savings but can actually increase the effectiveness by bringing significant energy savings per successful packet for denser networks.



1. Introduction



In Mobile Ad Hoc Network (MANET), the nodes are usually powered by limited energy resources. By default, they use maximum power to transmit packets. Reduced transmission power results in increased spatial reuse, less interference and has been shown to increase the throughput and energy consumption [l-31, so effective power control techniques in the nodes is a critical issue. Most of the existing power control methodologies only consider the cost of transmission. They calculate the minimum neighborhood which ensures connectivity of the network while reducing transmission power costs. Direct communication links are removed if found t o cost more than communicating through an intermediate node. Protocols in [2, 31 use topological properties of the network like node degree, coverage over an angle, etc., t o recalculate neighborhood. Unfortunately, reception energy costs may be comparable to the transmission costs and should be included in realistic models. (Optimal routing may not be tractable, as it was proved in [4] that finding a unicast path that guarantees enough remaining energy locally at each node is an NP-complete problem even when all the nodes transmit at the same power.) In [ 5 ] , an analytical model for optimal transmission range for energy minimization while considering reception costs is 273



 274



presented. The protocol for minimum power network in [6] also considers the reception energy (the paper however does not evaluate the effect of the protocol on the throughput). Finally, little analysis has been done on the efficiency of power-aware protocols towards the energy vs. throughput costs (i.e., the number of Joules required per successfully delivered packet). Energy savings can be inadequate as they could be arbitrarily achieved by substantially increasing the length or number of back-offs and thus reducing the number of packets successfully received. This paper aims to include reception power in power control, attempting t o minimize the number of receptions. We investigate its impact on energy and throughput of the network. In section 2, we present our protocol for including reception costs in power control, then we analyze its performance through simulation results in section 3 and further discussions in section 4. 2. Reception-Aware Power Control



This power control scheme tries to maintain a minimum power neighborhood, while considering the transmission cost of the transmitter(s), reception cost of the receiver(s) and reception costs at the overhearing nodes. Each node u sends a t maximum transmission power, periodic hello packets containing the power of transmission of this packet, a list of the issuing node's immediate neighbors and their current transmission powers. For each hello packet from neighbor 21, u measures the power of reception PR, and calculates the path gain G using the transmission power PT(v,) indicated in the packet. Using this information, Signal t o Interference Ratio (SIR) measured during packet reception SIR,, the SIR threshold parameter SIRth, (default lOdB in IEEE 802.11 networks), and the minimum reception threshold PR~,, , the minimum transmission power required to reach t o this neighbor p ~ ~ is ~ calculated , ( ~ ~as)in [7] and shown in equation 1. The calculated power is stored in the neighborhood table while the powers of the two-hop nodes from the hello packets are stored in the two-hop neighborhood table.



For each neighbor w already in its neighbor list, it then calculates if going through this new neighbor t o reach the existing neighbor will cost less energy as in [6] as follows: The energy consumed E~(zy) while transmitting from node 2 to node



 275



where, PT(ZY) is the transmission power required to transmit from any node 1: t o y , PR is the power required to receive a packet, I is the number of nodes in the interference region of this transmission, including the receiving node and t is the transmission time of the packet. If the energy to transmit directly from u to v is greater than to transmit through w , ie., ET(UW) > ET(UW) ET(ww), then neighbor w is marked and added to 2-hop neighborhood, with node w as its 1-hop precedent. The marked nodes are now considered as 2-hop neighbors only.



+



3. Performance Evaluation 3.1. Simulation Setup



(a) Network with 50 nodes



(b) Network with 100 nodes ( c ) Network with 120 nodes



Figure 1. Average Energy Consumption of Nodes.



The reception-aware power control scheme was implemented in NS2 simulator and Optimised Link State Routing (OLSR) protocol was used to evaluate the efficiency of the protocol. 32 to 120 nodes were randomly placed in a square field of 1000 x lOOOm with default transmission radius of 250m. For each network size, 10 simulations were run; for each run, 4 Constant Bit Rate (CBR) flows were generated for 100 seconds of the simulation time. The performance of reception-aware power control (Rec-Aware PwrCtrl) was compared t o IEEE 802.11 type of networks, one without any power control (No PwrCtrl) and one with transmission power control only (PwrCtrl), where minimum power t o reach the existing neighbors was used for transmission (i.e., there is no change in the neighborhood). The power consumption was separated into electronic consumption (constant)



 276



and power amplifier consumption (variable part) as in [6]. Thus, equation 2 becomes:



ET(ZY) = (PT~(zY) + P T e l e c + PRelec



x 1) x



t



(3) Here, P T ( P~R e l e~c respectively) ~ ~ represents the difference between total transmission (reception respectively1 cost and the idle energy consumption. Both were set to 250mW, while the idle energy consumption was set to 0. Each node had six discrete transmission power levels ( P T ~similar ) t o the six power levels available in the Cisco Aironet 350 wireless cards.



3.2. Simulation Results



,



. . ......



0.05



lim 1-1



(a) Network with 50 nodes



Figure 2.



lime (SEC)



O l " ' . ' ' " . 0 10 20 30 4 50 EU 70 8U 80 I W Tim 1-1



(b) Network with 100 nodes (c) Network with 120 nodes



Average Energy Consumption per Successful Packet Transfer (Utility).



For all the settings of the simulations, the average energy consumption for the 10 runs are shown in Figure 1. Both the power control schemes were found to consume less energy than the network without power control, saving up to almost 82% energy for some nodes. Rec-Aware PwrCtrl was found t o perform comparatively better, with an energy saving of upto almost 14% on average compared to about 5% average savings for PwrCtrl scheme. The overall gain was upto almost 29% in Rec-Aware PwrCtrl, compared t o almost 12% in the PwrCtrl scheme. Figure 2 shows the average energy consumption per successful packet sent (utility). The average number of packets that were sent were similar for sparser networks, but the Rec-Aware PwrCtrl could send upto about 13% more CBR packets in denser networks. The number of these packets that were successfully received were also comparable to No PwrCtrl scheme, with loss of up to about 10% in average. The interesting observation was that the average energy spent for each packet that were successfully sent (utility) showed a general decrease for the power control schemes. Rec- Aware PwrCtrl took



 277



14.4% less energy in average t o send a successful packet compared to No PwrCtrl. T h e PwrCtrl scheme showed similar utility as the No PwrCtrl scheme, with small loss of about 1% in average. The paired t-test was used t o compare the energy gain of the RecAware PwrCtrl scheme and PwrCtrl Scheme. T h e difference was found to be statistically significant, with the two-tailed probability of 0.036. 4. Discussions and Conclusion Thus, in this paper, it was seen t h a t the average energy consumption decreased with power control techniques, and Rec-Aware PwrCtrl performed better at retaining this energy conservation in larger and denser networks. It was also observed t h a t though the average packets sent and successfully received were comparable for different protocols for all network sizes, the utility remained better for Rec-Aware PwrCtrl. It indicates t h a t receptionaware protocol is likely t o choose path with less number of receivers, increasing the number of successful packets, while spending less energy.



References 1. S. Narayanswamy, et al., The COMPOW protocol for power control in ad hoc networks: Theory, architecture, algorithm, implementation, and experimentation. European Wireless Conference. (2002) 2. R. Wattenhofer, L. Li, V. Bahl, and Y. Wang, Distributed topology control for power eficient operation in multihop wireless ad hoc networks. IEEE INFOCOM. (2001) 3. V. Rodoplu and T.H. Meng, Minimum energy mobile wireless network. IEEE Journal on Selected Areas in Communications 17 (1999) 4. G. Allard, and B. Mans, Reducing the energy drain in multihop ad hoc networks. IEEE proceedings, International Workshop on Localized Communication and Topology Protocols for Ad hoc Networks (LOCAN’05) Washington, DC, USA (2005) 5 . Y. Chen, E.G. Sirer, and S. B. Wicker, On selection of optimal transmission power for ad hoc networks. Proceedings of the 36th Annual Hawaii International Conference on System Sciences (HICSS’03) - Tkack 9, Washington, DC, USA (2003) 6. B. H. Liu, Y. Gao, C.T. Chou, and S. Jha, An energy eficient select optimal neighbor protocol for wireless ad hoc networks. Proceedings of the 29th Annual IEEE International Conference on Local Computer Networks (LCN’04), Washington, DC, USA, IEEE Computer Society, 626-633 (2004) 7. X. Lin, Y. Kwok, and K. Lau, A new power control approach for IEEE 802.21 ad hoc networks. Proceedings of the 14th IEEE International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC’2003), Beijing, China, 2 , 1761-1765 (2003)



 UNIFORM MOBILITY MODEL FOR AD HOC NETWORKS



Prof. Dr. S.K. BODHE. Professor, Dept of E &TC ,Rajarshi Shahu College of Engg Tathawade, Pune ;Maharashtra,; INDIA E-rnai1:s-k-bodhe @ indiatimes. corn Mrs. AMRITA ANIL AGASHE Asst. Prof: Etrx., Dr.JJMCOE.Jaysingpur. , Maharashtra, INDIA Mr. ANIL A.AGASHE Walchand College of Engg. Sangli. Maharashtra, INDIA E -mail: agashe-anill@ redrffmail.com



The tremendous demand from social market are pushing the development of mobile communications faster than ever before leading to plenty of new techniques emerging. Mobile Ad-Hoc Networks are ideal technology to deploy an instant communication network for civilian and military applications. Mobility management is the fundamental technology used to automatically support mobile terminals enjoying their services while simultaneously roaming freely without the disruption of communications. For the evaluation of mobility management schemes for an ad hoc networks, tests should be canied out under realistic conditions including a sensible transmission range, data traffic models and mobility models. This paper gives the survey of mobility models that are used in the simulations of ad hoc networks and the new mobility model which is a good combination of Random walk mobility model, Random waypoint mobility model and Random Direction mobility model with solution of initialization problem dealing with Random waypoint mobility model.



Introduction The performance of mobility management schemes strongly depends upon the mobility model that accurately represents the mobile nodes that will eventually utilize the given protocol. Currently there are two types of mobility models used in simulation of networks: Traces and Synthetic models. Traces are those mobility patterns that are observed in real life systems. Traces provides accurate information especially when they involve a large number of participants and appropriately long observation period. For ad hoc networks it is necessary to use synthetic models. Synthetic models attempts to realistically represent the behavior of mobile nodes without the use of traces. In this paper we represent several synthetic mobility models that have been proposed for the performance evaluation of ad hoc networks. 278



 279 In this paper we take review of following different synthetic entity mobility models for ad hoc networks, and discuss about Uniform Mobility Model. 1) 2)



3)



Random walk Mobility Model: A mobility model based on random directions and speeds. Random Way point Mobility Model: A model that includes pause times between changes in destination and speed. Random Direction Mobility Model: A model that forces mobile nodes to travel to the edge of the simulation area before changing direction and speed. 1) Random Walk Mobility Model: It is widely used mobility model, which is sometimes referred to as Brownian motion. Random walk mobility model was first described mathematically by Einstein in 1926. [ 11 Since many entities in nature move in extremely unpredictably ways, the Random walk mobility model was developed to mimic this erratic movement. In this mobility model the mobile node moves from it’s current location to new location by randomly choosing a direction and speed in which to travel. The new speed and direction both are chosen from predefined ranges (speedmin, speedmax) and (0,2n) respectively. Each movement in the Random walk mobility model occurs in either a constant time interval ‘t’ or a constant distance traveled ‘d’, at the end of which a new direction and speed are calculated. If Mobile node reaches the boundary, it “bounces” off the simulation border with an angle determined by the incoming direction. The mobile node then continues along this new path. Many derivatives of the Random walk mobility model have been developed including 1-D, 2-D, 3-D and 4-D walk. The 2-D Random walk mobility model is of special interest, since the Earth’s surface is modeled using a 2-D representation. Fig . (1) shows example of movement observed from this 2-D model. At each point mobile node randomly chooses a direction between 0 and 2.n and speed between (Speedmin., Speedmax.) In the Random walk Mobility model, the mobile node may change direction after travelling specified distance instead of specified time. The Random walk Mobility model is a memory less mobility pattern because it retains no knowledge concerning it’s past locations and speed values. The current speed and location of mobile node is independent of it’s past speed and direction.



 280



800 600



-



400 200 --



0



50



100



150



200



250



300



Fig. 1: Random movements observed for 2-D Random walk mobility model.



2) Random Way point Mobility Model: The Random waypoint model includes pause times between changes in direction and/or speed [2]. The mobile node begins by staying in one location for certain period of time i.e. a pause time. Once this time expires, the mobile node chooses a random destination in the simulation area and a speed that is uniformly distributed between (speedmin, speedmax). The mobile node then travels towards the newly chosen destination at the selected speed. Upon arrival, the mobile node pauses for a specified time period before starting the process again. Fig. 2 shows an example travelling pattern of the mobile node using Random Waypoint Mobility Model. The movement pattern of the mobile node using Random waypoint mobility model is similar to the Random walk mobility model if pause time is zero. In most of the performance evaluations which uses Random waypoint model, the mobile nodes are initially distributed randomly. This initial distribution of mobile nodes is not representative of the manner in which nodes distribute themselves when moving. The average mobile node neighbor percentage is the cumulative percentage of total mobile nodes that are a given



 281 mobile node’s neighbor. e.g. If there are 50 mobile nodes in network and node has 10 neighbors, then the nodes current neighbor percentage is 20%. A neighbor of the mobile node is a node within the mobile model’s transmission range. There is high variability in average mobile node neighbor percentage. This produces high variability in performance results. But by discarding some initial simulation time this problem can be overcome. There is also complex relationship between node speed and pause time. A scenario with test mobile nodes and long pause time produces a more stable network than a scenario with slower mobile nodes and shorter pause times.



I



1000 800 600 400 200 0 50



100



150



200



250



300



Fig. 2: Movements of mobile node observed for 2-D Random waypoint mobility model



3) Random Direction Mobility Model: A Random Direction Mobility Model was created to overcome density waves in the average number of neighbors produced by Random waypoint mobility model. A density wave is the clustering of nodes in one part of the simulation area. In the case of Random waypoint mobility model, the clustering occurs near the center of simulation area. In the Random waypoint mobility model, the probability of the mobile node choosing a new destination



I



 282 that is located in the center of the simulation the center of the simulation area, or destination which requires travel through the middle of simulation area is high. Thus the mobile nodes appear to converge, disperse and converge again. In order to alleviate this type of behavior and promote semi constant number of neighbors throughout the simulation, the Random Direction Mobility Model was developed [3]. In this Model, mobile nodes choose a random direction in which to travel similar to Random walk mobility model. The Mobile node then travels to the border of the simulation area in that direction once the simulation boundary is reached, the mobile mode pauses for a specified time, chooses another angular direction (between 0 and 180 degrees) and continues the process, since mobile nodes travel to, and usually pause at the border of the simulation area, the average hop count for data packets using the Random Direction Mobility Model will be much higher than the average hop count of most other mobility models.



4)Uniform Mobility Model: Uniform mobility model collects good features of Random walk mobility model, Random waypoint mobility model and Random Direction mobility model. For each node the initial position is chosen in two simple steps: Choose an initial path and then choose a point on the path. The probability density of any chosen path is proportional to the length of path. In other words , the initial path is chosen by choosing end points (XI , yI) and



(x2 ,y 2 ) i n such a way that joint probability density of these two points is proportional to the distance between them. A convenient way to do this is by rejection sampling [4]. For this following steps are adopted. 4.1) Rejection Sampling Method: 1) Generate two points (xi , y1) and (x2 , y2 ) uniformly on the unit square.



2)



Compute



r=



J(%- x1 l2 + (Y2 - Y l ) ”



Generate a random value



3) 4) If



5)



./z u, uniformly on (0,1)



u,< r ,then accept (xi , y1) and (x2 ,y2 ) , otherwise go to step 1. Generate a random value u, uniformly on (0,1)



 283



6) The initial location of the node is



bz



072x1 + (1- u z u2 Y 1 + (1 - u2 )Y 2 ) 4.2) Speed of node: At any time t , the node has speed . For any to be the expected positive number M define the function F, 7



(s)



(0, M Define F ( S ) = 1irnM+-F, ( S )



proportion of time in the interval



s



).



F ( S ) is cumulative distribution function of stationary distribution of s . If node is traveling at speed s along a path of length 1 , the time spent on the path is )/s . Therefore the conditional density f(j0 of speed,



Then Define



given path length is proportional to path,



x . Since speed is chosen independent of



f(s) is proportional to mean path length, it follows that f(s)is



x.



proportional to 4.3) Pause Time: Here it is assumed that at each destination , a pause time P is chosen according to probability density function h(P).In practice ,



h(P) is a uniform distribution but in principle this need not be so. Further we assume that P is independent of speed s and locations . Begin by computing the proportion of time that a node is paused. We refer to the travel between two consecutive destinations as an excursion. Let T be the time spent traveling on excursion excluding. Let E(P) denote the expected length of pause and let two pauses. P,,,,



E ( T ) is time elapsed in traveling between



is the long run proportion of time spent paused.



4A)Direction of node: Mobile nodes continue to choose random directions but they are no longer forced to travel to the simulation boundary before stopping to change direction. Instead ,an mobile node chooses a random direction and selects a destination anywhere along that direction to travel. Simulation: For simulation area chosen is 1000 X 1000; which is a square. The simulation for 2-D Uniform Mobility Model is carried out for 25 nodes to find new x and y location by considering above described speed, pause time



 284



and direction of motion of node. This simulation has been performed by using Matlab .



Fig. 3:cdf plot for new x location



 285



fig. 4:cdf plot for new y-location



Conclusion A brief review of mobility models for ad hoc networks is taken. In order to construct more efficient mobility model for simulation we have developed a mobility model called as ‘Uniform Mobility Model’. In this model the stationary distributions for speed and pause time, and random directions are considered, for a node moving in rectangular area. For initial location six steps mentioned in section 4.1 are used those eliminates the initialization discrepancy which is present in Random waypoint mobility model.



 286



References [ 11 M.Sanchez , P.Manzoni ;Java based simulator for ad hoc networks [2] D.Johnson ,D.Maltz.; Dynamic source routing in ad hoc wireless networks. [3] E. Royer ,P.M. Melliar -Smith , L .Moser; An analysis of the optimum node density for ad hoc mobile networks.;IEEE 2001 [4] William Navidi , Tracy Camp ;Stationary Distributions for the Random Waypoint Mobility Model .



 SECURE CLUSTER HEAD ELECTION ALGORITHM FOR WIRELESS SENSOR NETWORKS



V. CHENNAKESAVULU B. DEY AND S. NAND1 Indian Institute of Technology Guwahati ,India, {nchenna, bdey,sukumar} @ iitg.ernet.in



Security is the one of the main issue presently got attention by most of sensor network researchers. Many sensor network routing protocols have been proposed without giving proper attention to security aspects. The presence of a single compromised node can invert the purpose of the protocol. We have proposed a ‘trust based cluster head election’ scheme that allows only trusted nodes to acquire greater responsibilities in the network like aggregation and cryptography information maintenance. Present scheme introduces the concept of ‘Super Nodes’ which are having high voting power in these elections. We present statistical results for the number of nodes the adversary has to compromise to get these responsibilities. Present scheme also considers energy resources of nodes during the cluster head election process. A secure routing protocol can be built on this approach.



1. Introduction Wireless Sensor networks consists of hundreds or thousands of nodes randomly distributed over the terrain, all nodes are low-power and low-cost and are possibly mobile. Considering security, security can be provided in different layers of protocol stack. Considering security to achieve a secure system, security must be integrated into every component. Security can be provided in different layers of protocol stack. . Providing security in routing layer is a challenging task as routing layer is most vulnerable to attacks and also very important in the aspect of network operations. Existing routing protocols are heavily vulnerable to compromised node attacks.This paper deals with providing security in routing layer. Several routing protocols exist for sensor networks. Out of them cluster based routing protocols are widely applicable to wireless sensor networks. LEACH is a simple cluster based routing protocol that divides the whole network into clusters and elects the respective cluster heads. The Energy aware clustering protocol deals with energy issues in cluster formation. The Density estimation clustering protocol concentrates on cluster density. In Cluster based routing 11213



287



 288



protocols Cluster Head is the most important node. It is the only aggregation point in the cluster and also maintains information regarding the cluster; if a compromised node becomes a cluster head it can get the control of whole cluster. The present ‘Secure Cluster Head Election’ scheme promises that only trusted nodes can become cluster head. 2. The Secure Cluster Head Election Scheme



The ‘secure cluster head election scheme’ runs through three phases namely, Initial Phase, Key setup phase and Cluster Head Election Phase. In the initial phase of the Cluster Head Election Scheme we use the protocols LEACH Energy Aware Clustering Protocol and Density Estimation Clustering Protocol to find clusters and cluster heads. In the Key Setup Phase all cryptographic information required for the safe run of the protocol is established. Sink assigns different identities to different clusters and also it establishes secret keys with the clusters. Within a cluster, cluster head assigns local ids to all sensor nodes in the cluster and establishes secret keys with the sensor nodes. Every node in the cluster identifies its neighbors and establishes secret keys with those neighbors. We assume that during these phases no adversary is present and all nodes are alive. All messages exchanged in the network are signed with the source of that message with the secure keys available between neighbors and between sensor nodes and cluster head. Every message includes the ‘id’of it’s source.



’,



2.1. Cluster head election Phase The present cluster head initiates election process and elects a new cluster head. In this scheme it has a set of nodes (P) which are referred as “Super Nodes” having high voting capability over normal sensor nodes in these elections. These “Super Nodes” are taken as the last ‘IPI’ round cluster heads. This scheme follows an innovative strategy of ‘Threshold based Trust Value’ calculation for cluster head elections, which decides a particular node is trusted or compromised. During the elections every node gives its votes towards its Active Neighbors (Message Forwarding neighbors) to the present cluster head. A node can give either a positive or a negative vote for any of its Active Neighbors depending on how much it trusts that particular neighbor. In this algorithm, every node has equal chance of electing as a cluster head. A node which is elected as a cluster head cannot contest for next



 289



‘r’(= lPl) rounds of elections. This is to prevent the node from becoming cluster head again and again which drains out its energy. 2.1.1. Trust value calculation For every node its neighbors maintain its “Trust Values”. Initially a node keeps its ‘Trust Value = 0.5’ for all of its neighbors. If all the messages forwarded through a neighbor reaches the cluster head then it makes its “Trust Value” entry for that forwarding neighbor as ‘1’else it makes its “Trust Value” as ‘O’.If a node is not using some of its neighbors for the communication then it won’t change the ‘Trust Value’ of those neighbors. A Sensor node sends a vote for its neighbor as positive if its ‘Trust Value = 1’ for that neighbor, negative if its ‘Trust Value =0’ and no vote if its ‘Trust Value = 0.5’. Trust Values are preserved from round to round. 2.1.2. Salt value calculation Whenever a sensor node wants to send some message to the cluster head it includes a Nonce (sequence number) in the message and sends the message. Sensor node maintains a table for its forwarding neighbors called ‘Salt Table’ and in that table it maintains a value called ‘Salt Value’ for each of its forwarding neighbors. Cluster head also maintains a ‘Salt Table’ for the sensor nodes present in the cluster and the Salt Table has an entry called ‘Salt Sum’ for each sensor node in the cluster. If a sensor node wants to send a message to the cluster head it selects one of its forwarding neighbor and forwards the message through that neighbor. After sending the message it updates the ‘Salt Value’ of the corresponding forwarding neighbor by adding the sequence number of the message it has just forwarded through that neighbor to the current ‘Salt Value’ field. In the same way the cluster head, if it receives a message from any of the sensor nodes it updates the ‘Salt Sum’ field of corresponding sensor node in its ‘Salt Table’ by adding the sequence number of the message it has received from that sensor node. After a set of messages have been received the cluster head gives a cumulative ‘Reply’ message to the sensor node and in that message it gives the ‘Salt Sum’ value of that corresponding sensor node. The sensor node after receiving the ‘Reply’ message tallies the ‘Salt Sum’ value with the sum of the ‘Salt Value’s present in its ‘Salt Table’. It finds out the sequence numbers of the missing messages and identifies the neighbors through which these messages are transferred and makes its ‘Trust Value’ as false for those neighbors.



 290



2.1.3. Voting and Winner Selection During the election process every node sends its votes to the cluster head. Present cluster head ignores the votes from the nodes that are declared as compromised in the previous rounds. For each node in the cluster, the cluster head process only the negative votes. If the number of negative votes for a node crosses threshold (T) percentage of its total votes then it is declared as a compromised node. The votes from the nodes which are found t o be compromised either in this round or in the previous rounds are ignored and those nodes won’t be considered for the cluster head elections. Now the positive votes of remaining nodes are considered for further processing. The present cluster head process the positive votes of these nodes and calculates the positive vote weight for each node unless the node is a ‘Super Node’ i.e. not in set P. The top 5 highest vote scored nodes are considered for further processing as candidates for next cluster head. Credit Weight will be calculated for each candidate node t o select the winner. The Credit Weight includes Positive Vote Weight 6 , Location Weight of the node Loci, and Weight of Available Energy Resources Ei. These weights are taken as the normalized values individually The Credit Weight is calculated as follows W c r e d i t = Penergy



Ei + Pwote & + 80,Loci



(1)



Here Penergyr Pwote,Piocare the amount of preference given to energy ,vote and location criteria respectively and Penergy Pwote Pioc = 1 .



+



+



3. Analysis Let ‘n’ be the no. of nodes distributed per cluster, out of which [PInodes which are previous round cluster heads considered as “Super Nodes” and ‘r’ nodes ( IPI ) will be out of competition as previous ‘r’ round cluster heads cannot participate in the election process. So we have ‘n-r’ nodes competing for cluster head elections with uniform probability of ‘l/(n-r)’ each. N is the ‘highest number of votes’ polled for a sensor node in the cluster in this round, this is nothing but the Max. Active Neighbor Hood Value in the network. The no. of nodes t o be compromised for an adversary t o make his compromised node as cluster head in a ‘k’hop cluster with Threshold percentage T as 50% follows this equation. SO= N/2; ( r 1)terrns



+



s, = croNT+1/2T+1 - Cr1NT/2T + CrzNT-1/2‘-1



- ..(-)W/2,



if?- > 0; (2)



 291



If we allow max 4 hop nodes in the cluster the no. of nodes that the adversary has to compromise will be around S3 = N4/16 - 3



* N 3 / 8 + 3 * N2/4 = ( N / 2 ) 2 [ ( N / 2 )-2 (3 * N / 2 ) + 31



No.of MaxNeighnodes bourhood



Max no. of



Threshold (T) per-



Nodes to be compro-



(3)



’% of nodes to be com-



10 Above table analyzes that an adversary has t o compromise more than half of the nodes t o get itself as a cluster head in most of the cases.



4. Conclusion In this paper we have proposed a ‘Trust Based Secure Cluster Head Election Algorithm’ which stops a compromised node from becoming a cluster head in turn makes a way for the design of secure routing in sensor networks. We have shown in the analysis how better our approach prevents a compromised node getting important responsibilities of the network.



References 1. Heinzelman, A. Chandrakasan, and H. Balakrishnan, “Energy-efficient communication protocol for wireless sensor networks,” in the Proceeding of the Hawaii International Conference System Sciences, Hawaii, January 2000. 2. M.Younis, M. Youssef and K . Arisha, “Energy-Aware Routing in ClusterBased Sensor Networks”, in the Proceedings of the 10th IEEE/ACM International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTSZOOZ),Fort Worth, TX, October 2002 3. Robert D.Nowak, “Distributed EM Algorithms for Density Estimation and Clustering in Sensor Networks”JEEE Transactions On Signal Processing, Vo1.51, No. 8, August 2003.



 MAXIMIZING LIFE-TIME OF WIRELESS SENSOR NETWORKS BHUPENDRA KUMAR MEENA Reliability Engineering, Indian Institute of Technology Bombay, Powai Mumbai-400076 INDIA E-mail: [email protected] VALLIESWARAN V. K.R.School of Information Technology, Indian Institute of Technology Bombay, Powai Mumbai-400076 INDIA E-mail: [email protected] This paper focuses on Optimal routing mes to maximize the duration over which the sensing task can be performed, but requires future Knowledge. Two different scenarios with blue noise based selection of nodes for routing are simulated. One with all nodes having equal functionality and other with a node acting of coordinator summarises data and replies queries. The algorithm for selection of coordinator and query replying is presented. The number of messages for queries (in networks) and network life time of different sizes is monitored using a java simulation. This paper concludes that sensors networks life time depends on data sent per query and number of nodes in network.



1



Introduction



Wireless sensor nodes can be deployed on a battlefield and organize themselves in a large-scale ad-hoc network. Traditional routing protocols do not take into account that a node contains only a limited energy supply. Optimal routing tries to maximize the duration over which the sensing task can be performed, but requires future Knowledge. This paper considers that applications of wireless sensor networks where a spatially band-limited physical phenomenon (e.g., temperature, pressure, low-frequency vibrations) can be sufficiently monitored by a subset of the nodes (randomly) deployed in the environment. In other words, the total number of sensors deployed is such that if the sensors were placed uniformly within the area, the Nyquist criteria [l] would be met in terms of spatial frequency. While such a random distribution does not provide ideal uniform coverage, it does guarantee within a reasonable likelihood that the area is covered densely enough to meet, if not exceed, the Nyquist criteria requirements within most sub regions. This paper proposes a method that determines which sensors within the more densely covered sub regions should be selected to acquire data from the environment and which nodes should remain inactive in order to conserve energy. The proposed method is especially suitable for applications where it is desirable to trade spatial resolution for sensor network longevity.



292



 293



Our proposed method chooses sensor subsets such that the sensor positions can be mapped into the blue-noise binary patterns that are used in many image processing applications [l]. This method guarantees that the subsets would be chosen such that each subset provides a near optimal signal-to-noise ratio for the given sensor distribution and desired number of active sensors [2]. Meanwhile, the method also guarantees that the sensor nodes with the minimum residual energy are the primary candidates for deselect ion (i.e., they are the first to be turned off). And then routing should be performed through high energy nodes. This work investigates how a blue noise sampling techniques [4 ][5][6] being used to trace out highly residual energy nodes.



2. RELATED WORK:Let’s suppose following is a wireless sensor network having three regions as shown:



n



Figure 1 . A wireless sensor network with three regions as A, B, C and each region contains their high residual energy nodes(marked ‘N’) and low residual energy nodes(marked ‘n’). [8][lo]. *below is the image grid representation of the above Wireless sensor network. [7]



Figure 2. Image grid representation of the above WSN Firstly, the high residual energy nodes (as marked ‘1’ by image grid) of each region separately have to be selected.



 294



3. PROPOSED ENERGY EFFICIENT ALGORITHM: Now, after getting the high residual energy nodes[8] (as marked 1 by image grid) of each region, is proposed in [1][3]. This algorithm considers only selection of nodes according to residual energy of the node. This does not consider routing energy cost. The selection of these nodes have to be communicated, and so this also consumes a large amount of energy. Every message send from one node to other reduces the energy stored in the entire system. Message power = transmitter power * Time (1) The Power spent on transmitting one message is proportional to product of transmitter power and time for transmitting the message. If the number of messages in the system increases the life of the sensor network decreases. So, to reduce the number of message and selection complexity the following improved algorithm is used. Step I . Initially each selected nodes of regions will transmit an empty packet to all the other nodes of the same region. Step 2. Then every node in the region replies with number of number of empty packets received. The node with maximum number is the coordinator of the region Step 3: Then after getting reply from others, each node will have to maintain a routing table to the node with coordinator of the region & information about its nearest neighbour. Step 4: Step 2 and 3 must be followed for all the regions in similar fashion. Step 5: Now, all the selected nodes of each region got an idea about their nearest neighbour and leader. Step 6: Then, all the transmission in any region will be proceeds in an energy efficient way with covering shortest paths and through higher residual energy nodes to coordinator of the region. Step 7: The region coordinator does the processing of the query and send reply to the other region with result in a single message instead of all sensors messages. Operations are like maximud minimud average sensed values of the region this can be done by simple Digital circuit added to the sensor.



4. SIMULATION RESULTS From the paper [3] results, increase in longevity of network by choosing high energy nodes for routing is figured out. Sensors are simulated as objects in java and blue nose based sensor selector as global object where it stores the energy level of every node and dictates selection based on power formula given in paper [3] for 100x100 grid. The following assumption are made Introduced one query to get data from the sensors in a region in 100x100 grids. One sensor will last for 1000 hrs without any packet transfer. Power for lmin will be consumed by 100 packet transfer from the node. Number of sensors data will be used to get accurate result for a query is used.



 295



The number of messages per query on average and the life time of sensor network is calculated with uniform distribution of sensors in the grid (as in the previous algorithms). But the number of messages drastically reduced due to elimination of broadcasting of data.



I



life time graph In improved algorithm 1000 800



c



sensors per query(in



.f 600 -4-



400



t



200 networks life for 5 sensors per query(in hrs 1



0



I



networks life for 1( sensors per query(in



no of Sensors



Figure 3. Life time of sensors In figure 3, x-axis represents the number nodes, y-axis represents the network life time in number of hours and the three different lines shows different number of nodes used for data in each query. Improvement in life time with respect to other algorithms (due to reduction in broadcast of data) is high. But the factor (how much message transfer affect the life of network) has two components one is data sent per query and other is number of nodes used up in processing of query. Comparing both the graphs it shows mrparisonoftwoalgorlthms



cuuerrise dpithn



Figure 4. Number of packets in network per query on an average decrease in network life time proportional to data sent per query. In the improved algorithm, the decrease in network life time is comparatively constant, but it affects initial value. In both the cases with respect to increase in number of nodes there is decrease in life time. The decrease is in continuously increasing fashion (but not linearly). This gives us an idea that, with increase in number of nodes to the decrease in life time, may be in proportional to root function of the number of nodes



 296



5. CONCLUSIONS 1.



Our proposed algorithm gives the better approach of using the blue noise sampling technique. 2. By using our improved algorithm the life time of Wireless sensor networks gets increased up to 32 percent in compared with the old blue noise selection method ( as shown in figure 4) , 3. As the number of selected nodes (per query) decreases then the reliability and operability of the WSN goes up. 4. The life time is inversely proportional to number of sensors selected for each query and some root function of number of sensors in network.



6. FUTURE WORK Our plan is to extends this blue noise sampling techniques[3] to include various other parameters of wireless sensor networks such as pressure, temperature and other environmental and WSN factors so that the selection of the nodes and their power Consumption calculations can be made more accurate. Currently this paper considers only packets transfer but, in future, we would like to include interference, collision and retransmission occurring in message transmissions.



REFERENCES [ 13 Mark Perillo, Zeljko Ignjatovic and Wendi Heinzelman, An energy conservation method for wireless sensor networks employing a blue noise sampling technique. IPSN, 2004 [2] S. Hiller, 0. Deussen, and A. Keller, .Tiled Blue Noise Samples,.in Proceedings of Vision, Modeling, and Visualization, 2001, p.265.271. [3] Zeljko Ignjatovic, Mark Perillo, Wendi Heinzelman, Mark Bocko, An energy conservation method for wireless sensor network using blue noise spatial sampling technique.ACM, 2004 [4] T. Mitsa and K. J. Parker, .Digital Halftoning Technique Using a Blue-Noise Mask,.in Journal of the Optical Society of America,vol. A:9, Nov. 1992, pp. 1920.1929. [5]R. A. Ulichney, .Dithering with Blue Noise,. in Proceedings of the IEEE, Jan. 1988. [6] J. chou and D. Pterovic, A Distributed and adaptive signal processing approach to reducing energy consumption in Sensor networks, IEEE (INFOCOM), 2003. [7] H.G. Feichtinger and K.H. Grochenig, Theory and practice of irregular sampling, 1994. [8] P. Ishwar, A. Kumar and K. Ramachandran, Distributed sampling for Dense Sensor Networks,(IPSN), 2003. [9] S . S. Pradhan, J. Kusuma, and K. Ramachandran, Distributed Compression in a Dense Microsensor Network. IEEE Signal Processing Magazine, March 2002. [ 101. R. Stasinski and J. Konrad. Improved POCS-based image reconstruction from irregularly-spaced samples. In Proceedings of the XI European Signal Processing Conference, 2002.



 AN EFFICIENT MOBILE CONTROLLED AND NETWORK ASSISTED



HANDOFF PROPOSAL FOR IEEE 802.11 NETWORKS THIRUMALA REDDY M AND SUKUMAR NAND1 Department of Computer Science & Engineering Indian Institute of Technology Guwahati, Assam. India E-mail: {m.thim, sukumar}@iitg.ernet.in The small coverage area of WLAN Access Points creates frequent hand-offs for mobile users. If the latency of these hand-offs is high, then the users of synchronous multimedia applications such as VoIP will experience excessive jitter. The dominating factor in W A N hand-offs has been shown to be the discovery of the candidate set of next access points. In this paper, we describe a mobile controlled and network assisted handoff procedure. The proposed procedure reduces the probing time to almost negligible. It is well suited for QoS assistance, easily coexists with FMIP, has very less maintenance cost and doesn’t need any changes to the existing hardware.



1



Introduction



The small cell size in IEEE 802.11 WLAN creates frequent handoffs potentially causing delays or disruption of communications if the latency of the hand-off is high. A major component of the hand-off process is identifying the new best available access point (AP). The methods suggested in [ 1,2,3] are low-latency handoff mechanisms but have some implementation overheads like additional information maintenance, extra hardware requirements. In this paper we describe a low latency handoff mechanism, which falls under mobile controlled and network assisted handoffs category. The proposed method works with minimum maintenance overhead and with out any extra hardware requirements. The paper is organized as follows. In Section I1 we review how 802.1 1 handoff mechanisms function and previous work in improving their overheads. In Section 111 we describe our handoff method followed by conclusion in Section IV. 2



Background and Related Work



The 802.11 handoff process can be divided into three distinct logical steps: (i) Discovery (ii) Authentication and (iii) Reassociation. In the discovery phase the mobile node scans all the channels (probing) and get the information about the neighboring APs (takes around 350ms) and then select one among them for the handoff. In the authentication and reassociation phases the MN authenticate and associate with the selected AP (takes around 40ms for these two phases). As the 297



 298



probe delay had the major contribution in overall handoff latency, in [ 1,2,3] the authors proposed some methods to reduce the probe delay. In [ l ] Shin et all proposed a method using neighbor graph (NG) and nonoverlap graph concepts. The experimental results of [ 11 gives that the probing delay is reduced to 70 ms with NG and 59ms with NG-pruning. Even though this method works properly it involves neighbor graph and non-overlap graphs maintenance overhead. In [2] Ishwar Ramani and Stefan Savage proposed SyncScan algorithm. In this algorithm the mobile node performs the channel switching at specified times, and catches the beacon packets (it eliminates the lengthy probing phase), for which all the APs and mobile nodes clocks should be synchronized, which is very difficult in the reality. In [3] Sang-Hee Park et all proposed a mobile controlled and network assisted hand off method by using A P s with dual radio frequency (RF) modules. This method reduces the handoff latency to zero, and provides QoS assisted handoff algorithm. But, to implement this algorithm we need APs with dual RF modules. It’s very difficult to update the existing 802.11 network infrastructure with these APs.



3



The Proposed Handoff Scheme



The coverage region of AP is divided into 3 zones based on signal strength levels. The zone 1 is said to be the safe zone and the probability of handoff while the mobile node is in this zone is zero. All the necessary actions required for the handoff process take place in zone2. The actual handoff takes place in zone3. Consider a scenario with 3 APs APl, AP2 and AP3 which are listening in channels 1, 6 and 11 respectively. Let’s assume that initially the mobile node is associated with APl and there in zone 1. As the mobile node is moves to zone 2 the sequence of operations that takes are as fallows 1. The Mobile node sends the Move Messages with self identity and the attached AP Identity in each channel (except the current listening channel) with sufficient time gap between each channel switching. The neighbor APs (AP2 and AP3) receive the Move Messages sent in their listening channel. 2. AP2 and AP3 send the Move Detection messages with self identity, listening channel, mobile node identity and signal strength level to APl . 3. AP1 sends a Move Confirm message per channel to the mobile node. On receiving a Move Confirm message for a particular channel from the attached AP, the mobile node stops sending the Move Messages on that particular channel. In case Move Confirm message is not received for a channel, the mobile node will send the Move Message for some predefined number of times



 299



and then stops by assuming there is no AP listening on that channel in near vicinity. 4. After entering into the zone3 the mobile node sends the Move Messages in each channel just once and waits for predefined amount of time. Then it sends the Handoff initiate message to the Attached AP (APl). 5. AP1 selects the candidate AP for handoff (Lets take AP3 in our scenario) and sends the context and identity of the mobile node to AP3. The context includes the necessary information to reduce the authentication and association delays. 6. AP1 sends the handoff response to the mobile node with the candidate AP (AP3) identity and listening channel number. 7. Mobile node performs disassociation with AP1. 8. Mobile node performs the authentication and reassociation process with AP3 and continues the operation with AP3. In the explained procedure we completely eliminated the lengthy probing phase with couple of additional messages and we reduced the authentication and reassociation delay by transferring the mobile node context information prior to the hand off. With this the probing delay will become zero and the time required for authentication and reassociation will come to around 30ms. So, by following our approach the mobile node can finish the layer-2 handoff in around 30ms. This method does not have any of implementation overheads like graphs maintenance, clock synchronization and works with out any extra hardware requirements. This is best suitable for QoS assisted handoffs and can easily coexist with FMIP. 4



Conclusion



In this paper we proposed an efficient mobile controlled and network assisted handoff algorithm which eliminates the need of scanning phase and reduce the time required for authentication and reassociation. References 1. M. Shin, A. Mishra, and W. Arbaugh, “Improving the latency of 802.11 handoffs using neighbor graphs”, in Proceedings of the ACM MobiSys Conference, Boston, MA, June 2004. 2. Ishwar Ramani and Stefen Savage “Syncscan: Practical Fast Handoff for 802.11 Infrastructure Networks” IEEE InfoCom 2005. 3. Sang-Hee Park, Hye-Soo Kim, Chun-Su Park, Kyunghun Jang and Sung-Jea KO “QoS Provision Using Dual RF Modules in Wireless Lan”, European Symposium on Ambient Intelligence (EUSAI), Eindhovenn, the Netherlands, NOV8-10,2004, LNCS 3295, pp. 37-48,2004.



 EFFECT OF NETWORK POLICIES ON INTERNET TRAFFIC ENGINEERING PRIYADARSI NANDA, ANDREW JAMES SIMMONDS Faculty of IT University of Technology Sydney, PO Box 123, Broadway, NSW2007, Australia, e-mail: nanda/[email protected] An important objective of Internet traffic Engineering is to facilitate reliable network



operations by providing proper QoS to different services through mechanisms which will enhance network integrity and achieve network survivability. Current Internet architecture is very much distributed in nature interconnected by Internet Service Providers (ISPs), where the central goal of each service provider is to enhance emergent properties of their network by providing better service qualities with strong emphasis on economic considerations. Hence, service providers always aim at setting up their objective function based upon economic considerations and governed by their network wide policies to get the best result. In this paper we present a scheme in which Autonomous System (AS) relationships are central to any policy decision imposed by individual ISPs. Based on these policy relationships, we propose a framework which is expected to match the need for better QoS, uniform Internet wide service management and contribute efficiently towards traffic engineering. This paper presents an integrated approach comprising traffic engineering, routing and policy mechanisms for better management of QoS over the Internet. Keywords: Policy Based Network, Traffic Engineering, BGP



1



Introduction



With massive deployment of applications over the Internet in recent years and the need to manage them efficiently with a desired QoS in mind, the current Internet must shift from a Best Effort (BE) model to a service oriented model. Services requiring higher QoS currently have to satisfy their applications with the BE model, as there is no practical way of providing flow based guarantees. This is due to the fact that current Internet is very much decentralized in managing traffic flows between various Autonomous System (AS) domains, and involves functions such as: inter-domain routing, load balancing, traffic engineering, etc. Also due to heterogeneous and independent policy mechanisms used by individual ASS (designed and implemented by individual ISPs for their own benefits), end-to-end QoS path set up for various traffic is a complex task. Each ISP then uses their traffic engineering measures to balance traffic flows, both for incoming and out going traffic. Current Internet Traffic Engineering heavily depends on proper routing parameter configuration for both Intra and Inter Domain routing protocols such as Border Gateway Protocol (BGP) (Interior BGP within the domain and Exterior BGP between domains) in order to configure the routers across various domains. Due to rich policy features supported by BGP, it is also considered as the most important routing protocol especially for inter-domain traffic between various domains, hence it is the industry standard. In spite of several draw backs such as: complex decision 300



 30 1



process for path determination, inability to collect global policy on routing from other domains, unable to cope with route flapping, and higher convergence delay, BGP is still considered the industry de-facto standard for Internet connectivity. These drawbacks of BGP are due to its distributed nature of operation. Also, because BGP can be used to enforce network level policies involving routing decisions by different ASS, a minor error in configuring these parameters may result in some disastrous consequences. In this paper our aim is to extract AS level policies based on business relationships and Service Level Agreement (SLAs), and then apply BGP based traffic engineering techniques for efficient management of traffic over the Internet. Section two presents AS level policies and their relationship, while Section three describes BGP based traffic engineering approach by identifying the parameters which have a direct impact on the AS relationship given in section two. We present OUT proposed framework in section four having traffic engineering as the central component. Section five concludes the paper with hture directions. 2



AS relationships and their effect on Network wide Policy



Current Internet with more than 14,000 Autonomous Systems (ASS) reflects tremendous growth both in its size and complexity since its commercialization. These ASS may be classified in different categories based on number of users they serve such as Internet Service Providers (ISPs), organizations such as Corporations and Universities with their own administrative domains, etc. Based on the property of each AS and their inter connection strategies in the Internet, it is also important to develop different routing policies which may influence routing decisions for certain traffic to achieve desired end-to-end QoS for various applications. In [5], the author classifies the types of routes that may appear in BGP routing tables based upon the relationships between the ASS, and presents a heuristic algorithm giving the degree of connectivity by inferring AS relationships from BGP routing tables. Based on various network properties, type of connectivity between various domains and traffic transportation principles, these ASS may be classified [5] under the following categories: Stub, Multi-homed and Transit ASS. A stub AS is usually an end user’s internal network. One of the most important properties of stub network is that hosts in a stub network never carry traffics for other networks (i.e. no transit service). Multi-homed ASS have connections to more than one ISP for protecting against failure in one of its ISP connections as well offering load balancing between multiple ISPs. Transit ASS are described as multihomed due to multiple connections with other service providers and carry both local as well as transit traffic. Such networks are generally the ISPs located within the Internet hierarchy (tierl, tier2, tier3). Transit networks play an important role in terms of supporting QoS, load balancing using traffic engineering, and fault management across a whole range of Internet services. Also these transit networks



 302



must work co-operatively with a standard policy set to support QoS across the Internet. ASS can also be classified based upon various business relationships as well as contractual agreements. These relationships and agreements between the ASS play an important role to support end-to-end QoS and are fundamental to our research in this paper. They are classified as following: a. Customer-Provider relationship: In a customer-provider relationship scenario, customer buy services (network connectivity and service support) from its provider which is typically an ISP. Similarly, a tier3 ISP would buy some services their upstream tier2 service provider, etc. Network architecture supporting customer-provider relationship must address issues in relation with Service Level Agreements (SLA) enforced between the customer and its providers. b. Peer-to-Peer relationship: Two ASS offering connectivity between their respective customers without exchange of any payments are said to have a peering relationship. Hence these two ASS agree to exchange traffic between their respective customers fiee of charge. Such relationship and agreement between two ASS would mutually benefit both, perhaps because roughly equal amounts of traffic flow between them. Peer-to-Peer relationship may turn into customer-provider if one party depends more heavily on the interconnection arrangement than the other one [7]. Hence, ASS must monitor on a regular basis the amount of traffic flowing into their regions from their peers. c. Sibling relationship: Sibling relationship may be established between two or more ASS if they are closely placed to each other. In this situation, the relationship allows the administrative domains to provide connectivity to the rest of the Internet for each other. Also sometimes called mutual transit relationship, and may be used to provide backup connectivity to the Internet for each other. Sibling relationship between ASS may also be used for load balancing among various services. In order to design a new Internet architecture based on above AS properties and their relationship, it is important that the architecture must first of all support them and then derive related policies to enforce them across the Internet. Such an objective will then be able to answer the following issues when deployed across the Internet. 1. Resource management with multiple service support 2. End-to-End QoS for individual service 3. Load Balancing 4. Fault management 5. Facilitate Overlay routing 6. Security related to information sharing between ASS Using BGP, an ISP may select its own administrative policy to carry traffic for its own customer as well other ISPs customers (as a peer or transit provider). Though such an approach works reasonably well for most of the ISPs individually to satisfy their personal objectives and maximize their profits, it does not address the impact of such an approach on a global scale. Achieving end-to-end QoS for any



 303



application should have uniform and constant resource allocation throughout the path traveled between various domains. Hence our proposed framework is designed to answer those unsolved questions presently faced in the Internet traffic management. Before presenting the architecture, the following section presents a few of the BGP approaches for policy enforcement between ASS. We have tried to present those policy issues and how BGP is currently configured based on AS relationships as mentioned above. 3



BGP and its support for Internet Traffic Engineering



Border Gateway Protocol (BGP-4) [6] was a simple path vector protocol when first developed and the main purpose of BGP was to communicate path level information between ASS so as to control route selection process between them. Using such path level announcements by neighbors, an AS decides which path to use in order to reach specific prefixes. One of the main reasons ASS use BGP for Inter-domain routing is to enable their own policies to be communicated to their neighbors and subsequently across the whole Internet. A few traffic engineering strategies based on policy issues are outlined as following: 3.1 Use of local Preference to influence Path announcement: In a customer-provider relationship, providers prefer routes learned from their customers over routes learned from peers and providers when routes to the same prefix are available. In order to implement such a policy (prefer customer route advertisements) using the Local preference attribute, ISPs in general must assign higher local preference value to the path for a given prefix learned from their customer. In [8], the authors describe the use of assigning a non-overlapping range of Local preference values to each type of peering relationships between ASS and mention that local preference be varied within each range to perform traffic engineering between them. Hence Local preference attribute in BGP influences traffic engineering measures by controlling outgoing traffic between the ASS. 3.2 Use of AS path prepending and Multi Exit Discriminator (MED) attributes to influence transit and peering relationships: ISPs may influence the amount of incoming traffic to load balance on different links connected to their neighbors. Such a scheme can be implemented by selectively exporting the routes to their neighbors. BGP makes use of AS path prepending technique where an ISP can apply its own AS number multiple times and announce the path to its neighbor. Because BGP decision process [3] selects lowest AS path length, such a technique will then force the neighbor to choose another path if available instead of a prepended AS path announced by the neighbor. But such a scheme is often selected manually on a trial and error basis. In our model we propose to use AS path prepending only when the relationship between peers is based upon strict exchange of traffic without monetary involvement. Also, ISPs



 304



having multiple links to other ISPs can use the MED attribute in order to influence the link that should be used by the other ISP to send its traffic toward a specific destination. However, use of MED attribute must be negotiated before hand between two peering ISPs. In our proposed model, MED attribute is only used between transit ISPs having multiple links to other ISPs. 3.3 Use of community attribute for route export to neighbors: ISPs have been using the BGP community attributes for traffic engineering and providing their customers a finer control on the redistribution of their routes [9]. Internet Assigned Numbers Authority (IANA) typically assigns a block of 65536 community values to each AS, though few of them are only used to perform community based traffic engineering. Using these attributes, by tagging them into the AS path announcement an ISP may ask its neighbor or customer to perform a set of actions on the AS path when distributing that path from its neighbor [ l l , 141. By doing so, ISPs are able to control their incoming traffic in a better way. However, because community based attributes are yet to be standardized, there is a need for uniform structure in these attributes in order to apply them in the Internet. Also because each community attribute value requires defining a filter for each supported community in the BGP router, such process will add more complexity to an already fragile BGP and increase the processing time of the BGP message [3]. We defined a new community based attribute the “policy attribute” which can be used by neighbors to get further information on quality of service support by their announced path. Such a community attribute will be used in our future research for policy co-ordination between neighboring domains and is not discussed in this paper. In summary, AS relationships play an important role in the Internet connectivity and contribute significantly towards designing a new architecture for the Internet. BGP based traffic engineering can be made more scalable by carefully selecting and configuring the attributes based on business relationships between various ASS. We present a framework using such policy issues in Inter-domain routing and traffic engineering in the next section. 4



Policy based traffic engineering framework



Our proposed framework is based upon the fact that ISPs must communicate with their neighbors to get a fair picture about which relationships they must hold with them in order to apply specific traffic engineering policies at their respective domains. At the same time ISPs must also protect themselves against any route instabilities and routing table growth which may otherwise occur due to misconfigurations or problems in other ISPs. The overall components of the architecture are presented below, where our aim is to achieve a scalable solution if such framework is deployed across the Internet.



 305 ISP Business Relationship and Higher level Traffic control



Intra domain relationship and Device Configuration



Proposed Framework integrating AS policy with traffic engineering The entire architecture is represented as a 3 layer model and in this paper we mainly consider our frame work at layer-2, the middle layer which holds necessary components describing how network policies can be included in traffic engineering. Also, our prior discussions on ASS relationship in section 2 revealed that such relationships are important to consider for supporting proper QoS in the Internet. But obtaining data on ISP relationship is a difficult task, as no ISP reveals such data to their competitors. Hence we propose to use measurement based approach where an ISP collects and ranks the ASS based on the frequency of its presence in the routing table. A heavily used AS in the path list is the one where some kind of traffic engineering should be applied if selected for next hop forwarding. For example the decision of selecting Local preference is very much local to an ISP in order to balance its outgoing traffic (selecting the path to forward to the next ISP). On the other hand, an AS which is used less frequently has better chance to support QoS resources and is less congested [ 151. Traffic Engineering Mapper (TEM) is a repository that holds AS relationships and the hierarchy for interconnectivity between various ASS. TEM is responsible for directing those relationships to the Attribute Selector as well maintaining a list of those attributes once selected. Because the routing table holds necessary information regarding import and export policy filters, as well the attributes associated with them, TEM also investigates their validity in the ISP routing base. One of the export rules which the TEM must perform is directing the provider to send all the routes that the provider knows from its neighbors. Similarly TEM must ensure other peer or providers routes are not sent while sending routes to a provider or peer. TEM is an essential component of our architecture. Finally the decision on traffic engineering is taken by the Load Balancing module which receives necessary inputs regarding which attributes to be applied and to which paths they should be applied. The policy database holds policy information regarding how the AS may change routing for its own domain. Also included in the policy database is information on list of remote ASS which are customers of this AS, and pricing structures imposed by the SLAs of its providers.



 306



Such information is given to the load balancing module which then takes a final decision on traffic engineering. The process is the same for both importing and exporting a route between neighboring ISPs. Several efforts on finding right solution to BGP based traffic engineering and Autonomous system relationships have been explored in the past [ 1,3,5,8-12,141. While most of the authors describe the drawbacks of BGP in the first instance and then propose their schemes on better management of BGP for traffic engineering, our approach is somewhat the reverse. In our proposed framework we have considered the relationship issue between ISPs as the central module in defining necessary traffic engineering policies for the Internet. Based on our framework, we have performed few preliminary experiments to investigate the effect of AS relationship policies on Traffic engineering. The experiments are conducted using OPNET Modeler 11.O. In our simulation experiments two different ASS are connected to each other through their respective ISPs using simple, multi-home, peering and sibling relationships. Both network convergence and throughput with and without policy are investigated and it is observed that, though with policy network convergence becomes slower, but the overall throughput is improved for different applications. Due to space limitation detailed results are not given in this paper.



5



Conclusion and Future work



In this paper we have investigated various relationship issues between ASS and how to use them for resource control over the Internet. Current Internet connectivity between ASS is quite complex because of heterogeneous policy structure which sometimes forces Internet traffic to travel through a back door, affecting the AS’S own traffic and downgrading business relationships with its customers. Such approaches may look good for those ASS trying to adopt a backdoor strategy for forwarding their traffic, but results in disaster consequences for others. In this paper our effort has been to propose a framework which can address load balancing based on AS relationship, resource management through selecting proper traffic engineering attributes, and provide support for end-to-end QoS over the Internet. Our future work in this area of research will be to explore validity of our proposed model along with other components of our architecture. We also aim to investigate the significance of our architecture addressing issues such as fault management, policy based overlay routing, and security related to information sharing between ASS.



 307



References 1. Feamster N., Winick J., and Rexford J., A model of BGP routing for Network Engineering, SIGMETRICSIPerformance’04,June 12 - 16,2004, New York, USA 2. Awduche D. 0. et al, in Afi-ame work for Internet Traflc Engineering, Drafi2, IETF 2000 3. Quoitin B. et al, in A Pegormance evaluation of BGP based traflc Engineering 4. Yavatkar R., Pendarakis D., and Guerin R., Afvame work forpolicy based admission control, RFC 2753, January 2000 5. Gao L., On Inferring autonomous system relationships in the Internet, IEEE/ACM Transactions on Networking, Vo1.9, No. 6, December 2001 6. Rekhter Y. and Li T., A border gateway protocol 4 (BGP-4), Internet Draft-ietf-idr-bgp4-17.txt, January 2002 7. Huston G., Peering and Settlements- Part-1, The Internet Protocol Journal, CISCO Systems 8. Caesar M. and Rexford J., BGP routing Policies in ISP networks, UC Berkeley Technical report, UCB/CSD-05-1377, March 2005 9. Quoitin B. et al, in Internet Traffic Engineering with BGP, IEEE Communications Magazine, May 2003 10. Agarwal S., Chuah C. N., and Katz R., OPCA, Robust Interdomain Policy Routing and Traflc Control, OPENARCH 2003 11. Uhlig S., Bonaventure O., and Quoitin B., Interdomain traffic engineering with minimal BGP configurations, 18th International Teletraffic Congress, 3 1 August - 5th September 2003, Berlin, Germany. 12. Di Battista G., Patrignani M. and Pizzonia M., Computing types of relationships between autonomous systems, IEEE INFOCOM 2003 13. Quoitin B. and Bonaventure O., A co-operative approach to Interdomain traflc engineering, NGI 2005 14. Quoitin B. et al, in Interdomain traflc engineering with BGP, Computer Communications (Elsevier) 15. Yahaya A. D. and Suda T., iREX: Inter-domain QoS Automation using Economics, IEEE Consumer Communications and Networking Conference, CCNC-2006, January-2006, USA



 MODIFIED CLUSTER BASED BROADCASTING IN AD HOC WIRELESS NETWORK ASHOK PURANIK Directorate of Technical Education, Kanpur, U.P., India E-mail: [email protected]



SHRI KRISHNA PANDEY Amity University, Gautam Budh Nager, Uttar Pradesh, India E-mail: pandey-shrikrishna@redQjjail. corn In an ad-hoc network, a packet is transmitted by a node is received by its neighbors. Redundant transmissions cause congestion, contention and collisions. The total numbers of transmissions (Forward Nodes) are used as cost criterion for broadcasting. The basic objective is to reduce broadcast redundancy by minimizing number of forward nodes without introducing excessive overhead and save scare resources such as energy and bandwidth. The problem of finding the minimum number of forward nodes is NP-complete. In this paper, cluster architecture of network is used. It chooses forward node set (consists of clusterheads and gateways) to relay the broadcast packet. Each clusterhead computes its forward node set that connects its adjacent clusterheads. A non-clusterhead node just relays the broadcast packet if it is selected as a forward node or else it does nothing. So broadcast operation is restricted only to clusterheads and nodes in locally selected forward node set. The information of clusterheads is piggybacked with broadcast packet to further reduce each forward node set. In simulation, effect of node transmitter range is checked on the performance of algorithms in the terms of average number of Forward Nodes, Broadcast Redundancy, and Broadcast Delivery Rate.



1



Introduction



An ad hoc wireless network [l] is a collection of wireless mobile hosts forming a temporary network without the aid of any centralized administration or standard support services. Each host commits to work as host as well as router .The inherent limitations of the ad-hoc wireless network is bandwidth, energy and dynamic topologies, requires routing protocol should be simple, efficient and robust .In general, an ad hoc wireless network can be represented as a unit-disk graph G=(V,E), where V represents a set of wireless mobile hosts and E represents a set of 308



 309



edges . An edge (u,v) indicates that both hosts u and v are within their transmission range and connections are based on geographic distances of hosts . In Broadcasting, a packet is delivered to all nodes in a network is a basic operation and has extensive application in ad-hoc network such as route query process in several routing protocols such as Dynamic Source Routing (DSR) [2], Ad-hoc on-demand Distance Vector Routing (AODV) [3], Zone Routing Protocol (ZRP) [4] etc, sending an error message to erase invalid routes or an efficient mechanism for reliable multicast in fast moving ad-hoc wireless network. In wireless network [l], a packet transmitted by a node can reach all its neighbors. The total number of transmissions (forward nodes) is used as cost criterion for broadcasting. The problem of finding minimum number of forward nodes is NP-complete .Source and forward nodes form a flood tree [5] such that any other node in network is adjacent to a node in a tree. Efficient Broadcast algorithm provides every node receive the broadcast packet and requires only a few nodes (forward nodes) to forward the broadcast packet and reduce broadcast storm problem .Main aim is to reduce forward nodes for reducing redundancy and save scarce resources such as bandwidth and energy .



2



Related Work



A straightforward approach for broadcasting is blind flooding, each node will be obligated to rebroadcast the packet whenever it receives packet first time and generates many redundant transmissions redundant transmissions may cause a more serious broadcast storm problem [6] causes contention and collisions.



Figure 1. Redundant Transmissions by blind Flooding



Many broadcast algorithms besides blind flooding are proposed [5],[6],[7],[8],[9],[ lo]. These algorithms utilize neighborhood to reduce redundant packets In [5], Lim and Kim prove that building a minimum flood tree is an NPcomplete problem. A subset of nodes is called a dominating set if every node in network is either in the set or a neighbor of a node in the set. Two approximation algorithms: - Self Pruning and Dominant Pruning. The Dominant Pruning (DP) algorithm [5] is one of the promising approaches that utilize 2-hop neighbourhood information to reduce redundant transmissions and



 310



consider as an approximation to minimum flood tree problem .Further extended by two algorithms Total Dominant Pruning [TDP] and Partial Dominant Pruning [PDP] [ 11 utilize 2-hop neighbourhood information effectively. The forward node list is selected to cover all the nodes with in two hops. TDP will generate a smaller number of forward node set than DP but introduces some overhead when broadcast packet piggybacks 2-hop neighbourhood information while PDP without using piggyback techniques, directly extracts the 1-hop neighbours of common neighbours of both sender and receiver from receiver’s 2-hop neighbour set. PDP avoids the extra cost as in TDP and achieves the same performance [7]. Peng and Lu [8] propose a scalable broadcast algorithm; node does not rebroadcast the broadcast packet if all of its neighbors have received the packet from previous transmissions. In [9] , Stojmenovic et a1 study a CDS based broadcast algorithm that uses only internal nodes to forward the broadcast packet .Internal nodes are dominating nodes derived by Wu and Li’s marking process [ 101 i.e. nodes that are not internal nodes only receive the broadcast packet without forwarding it and reduces the redundant transmissions. . In conclusion, CDS based routing [ 101 is good solution as it centralized the whole network into small sub network and there is no need to recalculate routing tables. So for an efficient routing to design distributed algorithm for calculation of CDS in adhoc wireless network where connections of nodes are determined by their geographical distances. 3



Proposed Work



The clustered architecture [11],[12],[13] of an ad-hoc network is a 2-level hierarchical network converts a dense network into sparse network and has node information locally which is suitable for scalability, so an efficient broadcasting protocol based on the cluster architecture is proposed and choose a subset of nodes , called forward nodes, to relay broadcast packet .Each clusterhead computes its forward node set to cover other clusterheads within its vicinity. A non-clusterhead node just relays the broadcast packet if it is in forward node set otherwise does nothing. The forward node set is computed such that all the clusterheads in network can be connected and broadcast packet is delivered to entire network. The information about the forward node set is also piggybacked with the broadcast packet to fiuther reduce its forward node set Therefore , the broadcast operation is restricted only to clusterheads and nodes in locally selected forward node sets. 3.1



CLUSTER FORMATION AND MAINTAINANCE



In a clustered network, network is partitioned into clusters with one clusterhead per cluster and neighbours directly joining to clusterhead in cluster as non-clusterhead



 311



members. Clusterheads are elected using election process based on lowest-id, highest-degree and LCC (least cluster change) [3],[12],[14]. A gateway is a nonclusterhead node in a cluster that has a neighbour in another cluster (i.e. gateways are needed to connect clusterheads to-gather). The clusterheads and gateways together form a CDS of network. The cluster -based broadcast algorithm only requires nodes in this CDS forward the broadcast packet while other nodes do not participate in packet forwarding process. In proposed work, lowest-id algorithm [ 121 is used for clusterhead election. Once a cluster is formed, a non-clusterhead node never challenges the current clusterhead. If a clusterhead moves into an existing cluster, clusterhead of higher ID gives up its role of clusterhead. If a clusterhead moves out of a cluster, the remaining non - clusterhead nodes in this cluster determines their new clusters. A node that has clusterhead neighbours will take adjacent clusterhead has lowest ID as its new clusterhead and joins in that cluster. Otherwise cluster formation process applied to form new clusters.



3.2



Adjacent Cluster head Information Gathering



There are two ways to define a clusterhead v's adjacent clusterhead set C (v): 1. 3-hop coverage: C (v) includes all clusterheads in N3 (v). 2. 2.5-hop coverage: C (v) includes all clusterheads in N2 (v) and clusterheads that have non -clusterhead members in N2 (v). N'(v)-> represents the i-th hop neighbour set of nodes whose distances from v are within i hops . A node is called a 1-hop (2-hop) gateway if it is used to connect an adjacent clusterhead that is 2-hops (3-hops) away. A clusterhead v compute its adjacent clusterhead set C(v) from the neighbour set information periodically sent by 1-hop and 2-hop gateways .The clusterheads in C(v) are grouped into two classes :1. The clusterheads that are 2-hops away from v represented by C2(v). 2. The clusterheads that are 3-hops away from v represented by C3(v) i.e. C(v) =C2(v) u C3(v) For neighbors cluster information, each node exchanges information with its neighbor.



 312



Figure 2. A network showing 2.5 Hop coverage area of Node v 3.3



Forward Node Set Selection and Reduction



Clusterhead v’s forward node set is a subset of gateways by which v connect to the clusterheads in C(v) , v connects to a clusterhead in C2(v) via a 1-hop gateway and it connects to a clusterhead in C3(v) via two 2-hop gateways. At each clusterhead, greedy algorithm is used to determine forward node set. When a clusterhead v receives a packet from another clusterhead u via u’s forward node set F(u), v selects a minimum number of gateways to form it’s forward node set F(v) , by which v connect to all the clusterheads in C(v). The forward node set is organized as{fi ,Rp} where fi is a 1-hop gateway used to connect to clusterheads in C2(v) and Ri is a set of 2-hop gateways that are neighbours of fi and are used to connect to clusterheads in C3(v). Ri may be empty if none in C3(v) is considered. In beginning, all the clusterheads in C(v) are uncovered . When a forward node  is selected ,some clusterheads in Cz(v) andor C3(v) are covered . f is selected if it can cover the maximum number of uncovered clusterheads in C(v) directlyhndirectly via its neighbours , After f is determined, R can also be determined in a similar way. Assume F(v) ={, . .. . ..} is v’s forward node set, U(v) is subset of C(v) which is uncovered . At the ith iteration, the forward node fi is selected from Nl(v). S(fi) is the subset of clusterheads in U(v) that is covered by fi and consists of two sets :1. Clusterheads in C2(v) that is directly covered by fi . 2. Clusterheads in C3(v) that fi covers indirectly via nodes in Ri. The fi with the maximum size of S(fi) will be firstly selected .Set Ri ={ rj I rj E N’(fi) n Nz(v)} where rj is a 2-hop gateway that can cover the clusterheads in C3(v). The selection of rj is same as used for 6.



 313



Figure 3. A Forward Node  of cluster head v



Consider two adjacent clusterheads u and v, clusterhead u broadcast packet uses its forward node set to deliver a packet to all clusterheads in C(u), it piggybacks C(u) and u with the broadcast packet reaches clusterhead v where it is used by the receiver to prune more clusterheads from the coverage . Since all the clusterheads in C(u) u {u} are covered by u’s forward node set , they don’t need to be covered again when v computes its forward node set . In 2.5 -hop coverage method, if v is 2-hop away from u and u uses a path (u,f,r,v) to deliver the packet to v. Clusterheads in N’(r) that are not in c(u) also receive the broadcast packet and excluded from U(v) .Therefore U(v)=C(v)-C(u)-{u} -N’ ( r ) . 3.4



Broadcast Process and Termination



When a source node initiates a broadcast process, it follows steps:1. If the source is not clusterhead, it just sends packet to it’s clusterhead. 2. When a clusterhead receives the broadcast packet for first time, it chooses its forward node set to forward packet to all its adjacent clusterheads. The adjacent clusterheads of forwarding clusterhead are piggybacked with broadcast packet for forwarding purpose. If the received packet is a duplicate one, the clusterhead does nothing. 3. When a non-clusterhead node receives the broadcast packet for first time and if it belongs to forward node set, it relays packet. By this approach, all clusterheads in network will receive the packet .All clusterheads re-broadcast packet in their clusters, so all nodes in entire network will receive the packet.



 314



2



7



4 J



0 -"nmclurlnad head



Figure 4. A Simple Clustered Network



4



Performance Simulation



In simulation, average number of forward nodes generated, broadcast redundancy and broadcast delivery rate at different node transmitter range for different algorithms.



4.I



Average Number of Forward Nodes



Average numbers of forward nodes are generated till broadcast process terminate.



Nmrnb., d N d . . b N . M



N"mbrol,-~".wml



Figure 5. The average number of forward nodes with transmitter range 25 to 70



 315 4.2



Broadcast Redundancy



The broadcast redundancy is defined as the average number of broadcast packets a node receives. This is directly related to number of forward nodes.



---



.



a



,



*



-



-



t



Figure 6. The average number of packets a node receives with transmitter range 25 to 70



Above figs shows that number of forward nodes decreases results into decrease in average number of packets received by a node decreases . So , broadcast redundancy is reduced.



 316



4.3



Broadcast Delivery Rate



The broadcast delivery rate is defined as the number of hosts that receive the packet over the total number of hosts in network .



Figure 7. The Broadcast delivery rate at transmitter range 25 to 40



Above figs shows that host movement causes collisions and links failures. So as speed of host increases broadcast delivery rate decreases as transmitter range increases. Results The number of clusters is relatively stable while the size of network increases. The way of selection for cluster construction affects the size of forward node set. The proposed algorithm has good performance when the node’s transmission range is small and number of nodes is large. The proposed algorithm is relatively insensitive to size of network. Conclusion



In this paper, broadcast algorithm in a clustered network to relay the broadcast packet by using the forward node set is described. This is based on 2.5-hop coverage method and suitable for scalability with keeping node information locally. The broadcast operation is restricted only to clusterheads and the nodes locally selected forward node sets and generate small forward node set. The broadcast algorithm is more efficient and the forward node set is relatively stable for different



 317



sizes of the networks. This algorithm reduces the broadcast redundancy and save scarce resources like bandwidth and energy without introducing excessive overhead and time-delay measured by sequential rounds of information exchanges.



References 1. W.Lou and Jie Wu, “On Reducing Broadcast Redundancy in Ad- hoc Wireless Networks”, IEEE Transaction on Mobile Computing vol-1 no2 2002. 2. D.B.Johnson and D.A.Maltz ,Mobile Computing , Dynamic Source Routing in Ad-hoc wireless network, pages 153-18 1,Kluwer Academic Publishers,l996 . 3. C. Perkins and E.M. Royer, “Ad-hoc on-demand Distance Vector Routing” Proc .2ndIEEE WMCSA ,pages 90-100, Feb 1999. 4. M.R. Pearlman and Z.J.Haas, “ Determining the Optimal Configuration of the Zone Routing Protocol ,” IEEE Journal Selected Areas in Communication vol -17,no8 pp 1395-1414 Feb 1999. 5. H.Lim and C. Kim, “Flooding In Wireless Ad-hoc Network” Comp. Comm. Journal vol24 no 3-4 pp353-363,2001 . 6. S.Ni ,Y.Tseng ,Y.Chen and J.Sheu, “The Broadcast Storm Problem In Mobile Ad-hoc Network ” Proc.MOBICOM 99 ppl5 1-162 Aug 99. 7. A.Qayyum, L.Viennot and A.Laouiti “Multipoint Relaying For Flooding Broadcast Message In Mobile Wireless Network,” Proc.Hawaii Intel con’f systems sciences 35 Jan 2002. 8. W.peng and X.C.Lu, “On The Reduction Of Broadcast Redundancy In Moile Ad-hoc Network ,”MOBICOMM pp 129-130 Aug 2000 . 9. I.Stojmenovic, S.Seddigh and J.Zunic,“Dominating Sets and Neighbour Elimination Based Broadcasting Algorithms in wireless Networks,”IEEE Personal Comm. Vo16 no 2 pp 46-55 1999. 10. J.Wu and H.Li , “ On Calculating Connected Dominating Sets For Efficient Routing In Ad-hoc Networks “ proc of ACM DIALM 99 pages7-14, Aug 1999. 11. C.C.Chiang and M.Gerla,“Routing And Multicast In Multihop Mobile Wireless Networks,” Proc. Of IEEE Intl. Conf. on universal personal comm.,1997. 12. M.Gerla and J.Tsai, “Multicluster, Mobile, Multimedia Radio Network,” Wireless Networks pp 255-265 1995. 13. P.Krishna,N.H.Vaidya,M.Chatterjee, D.K. Pradhan,“A Cluster Based Approach For Routing In Ad-hoc Networks,” Proceedings of the Second USENIX Symposium on Mobile And Location-Independent Computing pages 1-1 0,1995. 14. A.Ephremides, J.E.Wieselthier and D.J.Baker “A Design Concept For Reliable Mobile Radio Networks With Frequency Hopping Signaling.” Proc. of the IEEE 75(1): 56-73 ,1987.



 This page intentionally left blank



 Natural Language Processing



 This page intentionally left blank



 AN IMPLEMENTATION LEVEL FORMAL MODEL FOR JAVABEANS BHIM PRASAD UPADHYAYA



Department of Computer Science Maharishi University of Management 1000 North 4th Street, Fairfield, I A 52557, U S A E-mail: [email protected] BIRENDRA KESHARI



Department of Comptuer Science and Engineering Kathmandu University P. 0. Box 6250, Kathmandu, Nepal E-mail: birendraQku. edu. n p A formal model containing design level formalisms provides the concepts, notations and properties that can be used while designing and implementing the software of any size and complexity with JavaBeans. This abstraction level does not provide much information in debugging the incorrect programs. Hence, it is necessary to introduce a level of formalisms that provides a way of executing the bean programs step by step on a computer. This paper presents formal aspects of specifying and executing bean programs by refining a formal model with the aid of an observational semantics.



1. Introduction



The formal model containing design level formalisms presented inUL04a~UL04b1UL04c provides the concepts, notations and properties that can be used while designing and implementing the software of any size and complexity with JavaBeansTM. This abstraction level does not provide much information in debugging the incorrect programs. Hence, it is necessary to introduce a level of formalisms that provides a way of executing the bean programs step by step on a computer. This paper presents the formal aspects of specifying and executing bean programs with the aid of an observational erna antic^^^^^^ by refining the formal model presented in,UL04b 321



 322



A formal model of a language can be constructed by defining set of algebraic l a w ~ . ~ Programs ~ ~ ~ can ~ y be developed ~ ~ ~ +through ~ ~ stepwise refinement following set of r ~ l e In the~ implementation . ~ level, the operational detail of JavaBeansTM are provided by presenting the formal analysis of Java interfaces, characterizing events, properties, introspection through naming conventions (well-formedness conditions) and signature patterns. This paper also presents the extension of MCode presented in.UL04b 2. Formalisms for the J a v a Interface Specification The design level interface includes a set of Java interface methods as one of its members. The methods declared outside the set J I M D e c are usually implemented directly by a JawaBean class. Those methods which are actually needed to model the seams of a system are gathered in the set J I M D e c so as to be declared in Java interfaces which are implemented by the corresponding JawaBean class. The primitive Java interface can be specified as



I : (CDec,JIMDec) There are no denotational changes as compared toULOIcand to conform with the Java programming language following condition is imposed. Two Java interfaces are composable if no constant is declared with different type in CDecl and CDec2. Mathematically,



VT1 x1 E I1.CDeclb'T2 x2 E I2.CDec(xl



= 2 2 + T I = T2)



Definition 1. (Java Interface Inheritance) Let JIl and JI2 be two Java interfaces such that no field in JI1 is redeclared in JI2. Then the interface JIl extends JI2 is defined as CDec =df CDecl U CDec2



JIMDec



=df



JIMDecl U JIMDec2



3. Supporting JavaBeansTM Features with Java Classes The JawaBeans and Java classes are in the same level of granularity. Granularity here means the finer details that are taken into account. In absence of separate keywords and syntactical rules all the JawaBeans features have to be implemented with the help of available Java language features. This section extends and applies the FOOSLLJLLo3in order to specify JawaBean classes.



~



 323



3.1. A JavaBean Class



A bean declaration cdecls is of the form: cdecls



::= cdecllcdecls where



cdecl



is a restricted Java class declaration for a bean. A bean’s properties are declared as private fields. Hence, the class declaration with partial definition is of the form:



class cll e x t e n d s c12 { private U1 u1 = a l l Uz uz = a2,. . . Urn u, public ( U l ) g e t u l ~ ) { c g p l } public (void) setul(new U1 u l ) { c s p l } public (U2) getuz~){c,p2} public (void) setu2(new U2 u2){cspz}



= a,;



public (urn)getum(){cgpm} public (void) setu,(new Um U m ) { C s p m } public methodl(T1lzl’T12 1 / 1 i T 1 3 z l ) { c l } public method2 (2121 2: I T22 2 2 1 T23 & ) { c 2 }



Well-formedness For a read-only property, the property should be accompanied by a getter method that follows the signature pattern. The return type should be same to that of the type of the property. A read as well as write property has to be accompanied by a getter and a setter method. The well-formedness condition for getter method of the read and write property is same to that of the read only property. The setter method should also be formed following the signature pattern. This method does not return data. Further, the type of input parameter should be same to that of the type of the corresponding property. The order of method declaration does not matter. Both getter and setter are required to be public methods. 3.2. Supporting Events An event support in a JavaBean class is reflected by incorporating a pair of registry methods. Also, the presence of an event requires the listener bean to implement respective event handling method. It can be implemented in



 324



two different ways-directly in the body of listener bean class and indirectly in a dummy listener (adaptor class).



class cll extends cZz{ public (synchronized) (void) addEL,(EL, eLis){c,l,_} public (synchronized) (void) removeEL, ( E L , eLis){ c,ld, }



1 where n



20



Well-formedness The name of the registration method should be formed by combining the string add t o the event listener type name. The type of the parameter to be passed should be of type event listener. Similarly, the event listner type name is prefixed by the string remove while forming the de-registration method name. The type of formal parameter should be the same to that of the formal parameter in the registration method. Both the registration and de-registration methods are required t o be synchronous methods. This implies that the event propagation is synchronous. Both of these methods should not return any value.



3.3. Incorporating Bound Properties



class ell extends cia{ public (synchronized)(void) a d d P L ( P L p L i s ){cplr} public (synchronized)(void) r e m o v e P L ( P L pLis){cpld}



1 Well-formedness The presence of bound property should be reflected by declaring corresponding registration and de-registration methods which the listeners can invoke directly or indirectly. However, this is not one-to-one relationship. While forming the registration and de-registration methods names it is necessary to suffix corresponding property change listener type



 325



name. The same string should appear in the type name of the formal parameters. The other well-formedness conditions are same to that of simple properties.



3.4. Incorporating Constrained Properties The presence of constrained properties is reflected by including a pair of vetoable change listener registry methods. Also, it has to be essentially reflected in the code of corresponding setter methods. In actual Java program the corresponding setter methods also throw the exception Property VetoException. Generally, constrained properties require more services than the bound properties. All the constrained properties are bound properties but not vice-versa.



class cll extends cl2{ public (synchronized)(void) addPL(PL pLis){cPzr} public (synchronized)(void) removePL(PL pLis){cpldj public (synchronized)(void) addVL(VL wLis){c,zr} public (synchronized)(void) removeVL(VL vLis){c,ld j



1 Well-formednessThe well-formedness conditions for constrained properties are same to that of bound properties except the listener type. Here, it should be vetoable type instead of simply property change listener type. 3.5. Indexed and Boolean Properties Support



There are two different ways that the indexed properties are retrieved and set-first, retrieving or setting single element within the list and second, retrieving or setting all the elements within the list. Following JuwaBean class captures these essentials.



class cll extends cl2



 326



public (U,) getu,(int idx){cgipsm} public (U, [I) getu, (1{Cgipam } public (void) setu,(int idx,new Urn urn) {Csipsm}



public (void) setu,(new



U, u,[]){csipam}



} where U[] refers t o the array of element of type U . Now, the interpretation for well-formedness conditions should be straight forward. Boolean properties are the properties with binary value. Hence, following captures the essentials. class cll extends c12



{ public (bool) isu, () { cgblpm} public (void) setu,(new bool u,){cS~lpm}



1 where bool is a type that has only two truth values-true or false. Interpretation for well-formedness condition is straight forward as before. 4. Predicative Analysis of Introspection



This section shows how predicative logic can be used.



4.1. Extracting Information f r o m Well-formedness Conditions The well-formedness conditions presented in Section 3 can be used t o extract information(the complete list is available inULo4‘).For example, Number of read only properties can be calculated as: count(read0nlyProperty) = card(I.GetMDec) - card(I.Set M Dec) where count(x) gives the occurrences of x and card(X) gives the cardinality of the set X . The algorithm for ‘card’ is simple string matching and counting algorithm. 4.2. Determining Signature Patterns



Along with the naming conventions, signature patterns are also to be checked in order to infer about a bean as parameters and hence the sig-



 327



nature are important for the operation of a bean. These are done by low level reflection, so string matching algorithms can be used for introspection. However, we can capture essential logic through predicative logic in order t o infer about a bean. For example (complete list can be seen inuLo4'), de-registration signature pattern can be hunted as below: isDeRegistrationTo(op,event) = is(op,removeEventListener) A returns(op,type(event)) A h a s P a m m e t e r 0f Type(op,type(event)) 5. Implementing



In order to establish a link between the abstract model and implementationoriented formalisms sets containing program texts are introduced. For example (complete list available inULo4') pText is used to denote the declaration program text of a property p . Hence, for n number of properties the set PDecText contains respective declaration program texts. Symbolically, PDecText = { p l T e x t , p z T e x t , .. . ,p,Text} With the aid of above definitions and notations the following implementation refinement laws have been identified for the correct implementation. (1) (v;P ) Limp ull (2) (a) card(I.PDec) = card( I .GetM Dec) = card( J B . G e t M T e x t ) (b) Number of read only properties = card(I.PDec) - card( J B S e t M T e x t ) (c) Number of read and write properties = card( J B . S e t M T e x t ) = card(I.SetMDec) (3) ( M S p e 4 o p ) ;P ) L i m p ( M T e x t ) The following domain restriction relation should always hold when contracts are implemented by means of properties. (JB.PDecText U JB.ADecText U J B . B D e c T e x t ) a Ctr.MSpec 6. Related Work



We adopted refinement to formalize the replaceability of services offered by a JuvaBeun and a JavaBean itself. Developing programs through step-wise



 328



refinement is a commonly used technique.Wi'71~BSR03 In,wir71a task (performed by an application) is divided into subtasks and data (specification) are decomposed into data structures. More importantly, a refinement step distinguishes itself by making some design decisions. InlBSRo3every refinement step adds some features in order to create a complex program from a simple program. 7. Conclusions In this paper, we presented an implementation level formal model for JuvaBeans which refines the design level formal model presented in our earlier work. The model supports unambiguous lower level specification and implementation of JavaBeans. Contract based replacement allows implementation of the design level model with future releases of Java technologies too. Future work includes formal modelling of COM/DCOM and CORBA that leads to a general component calculus.



References [BSCOS] Paul0 Borba, August0 Sampaio, and Marcio CornBlio. A refinement algebra for object-oriented programming. In Luca Cardelli, editor, LNCS, number 2743, pages 457-482 [BSROS] Don Batory, Jacob Neal Sarvela, and Axel Rauschmayer. Scaling stepwise refinement. In Proc. of 25th International Conference on Software Engineering, Oregon, May 2003. IEEE computer society. [HHJ+87] C. A. R. Hoare, I. J. Hayes, He Jifeng, C. C. Morgan, A. W. Roscoe, J. W. Sanders, I. H. Sorensen, J. M. Spivey, and B. A. Sufrin. Laws of programming. Communications of the ACM, 30(8):672-686, August 1987. [Hoa93] Charles Antony Richard Hoare. Algebra and models. In Proceedings of the 1st ACM SIGSOFT symposium on Foundations of software engineering, volume 18, pages 1-8. [Hoa99] Charles Antony Richard Hoare. Theories of programming: top-down and bottom-up and meeting in the middle. In Proceedings of 26th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, Texas, 1999. [LJLLOS] Zhiming Liu, He Jifeng, Xiaoshan Li, and Jing Liu. Unifying views of UML. Report 288, UNU/IIST, October 2003. [ULO4a] Bhim Prasad Upadhyaya and Zhiming Liu. A design level formal model for JavaBeansTM. In Nagib Callaos, William Lesso, Shusaku Nomura, and Jianna Zhang, editors, Proc. of the sth SCIZOO4, pages 191-196, Orlando, July 2004. [UL04b] Bhirn Prasad Upadhyaya and Zhiming Liu. Formal support for development of JavaBeansTM component systems. In Proc. of the 28th COMPSAC 2004, pages 23-28. IEEE Computer Society.



 329 [ULO~C]Bhim Prasad Upadhyaya and Zhiming Liu. A Formal Model for JavaBeansTM. Report 305, UNU/IIST, September 2004. Also available online at www.iist.unu.edu/newrh/ III/l/docs/techreports/report305.pdfas at Nov 2005. [Wir7l] Niklaus Wirth. Program development by stepwise refinement. Communications of the ACM, 14(4):221-227, April 1971.



 HANDLING HONORIFICATION IN DOBHASE: ONLINE ENGLISH-TO-NEPALI MACHINE TRANSLATION SYSTEM



BIRENDRA KESHARI, JAGANNATH BHATTA AND SANAT KUMAR BISTA Information and Language Processing Research Lab Kathmandu University Dhulikhel, Kavre, Nepal E-mail: { birendra,jbhatta,sanat} @ku.edu.np



In machine translation systems, the problem of computing social status of individuals from the text in the source language that doesn’t have elaborate honorification system, and using this information to generate text in target language having elaborate honorification system is non-trivial. In this paper, we present techniques to compute social status of the individuals in English source text and encode it in the parse tree. The social status is divided into different levels according to Nepali language honorification system so that proper Nepali honorific morphemes can be directly generated from the parse tree. The technique can be extended to handle honorification in machine translation from English to other languages with honorification system similar to Nepali.



1. Introduction



Dobhase” is an first order logic encoded, rules based online English-toNepali machine translation system which relies on transfer based machine translation approach. The system analyzes English text and applies syntactic and lexical transfer rules to obtain Nepali sentences. Similar to other languages of the world that have honorification system like Korean, Japanese, Hindi etc., Nepali has elaborate system of honorification. Knowledge of the social status of the addressee is required while speaking Nepali sentences correctly. In Nepali, the linguistic realization of honorification is manifested by honorific morphemes and words. In English, there are no direct evidences like honorific morphemese to ease the computation of social status of individuals. Due to the divergence between the aThis ongoing project is supported by PAN Asia ICT R&D grant and is available online at http://nlp.ku.edu.np.



330



 331



language pair, it is necessary to compute social status of the individuals in the source English text by some means in such a way that this information could be used to generate Nepali text with proper honorification for quality translation. To the best of our knowledge, presently, there is no machine translation system that outputs Nepali text with proper honorification. This is the compelling factor for this research work. Honor neutral output makes the text sound unnatural and sometimes ambiguous. Most of the MT systems that include honorific target language do not heavily deal to generate proper honorification. One of the main reasons is that they don't go beyond sentence level. However, systems that are supposed to translate multiple sentences at a time, must generate text with appropriate honorification. We discuss the approaches that can be used to generate Nepali text with honorification. In our approach, social status of individuals are computed mainly by the help of bilingual lexicon and pronominal anaphora resolution. For this, sufficient information about social status of nouns are put in the bilingual lexicon. We also need to mark other categories that take part in Nepali honorification with appropriate honor-level information. The information computed by these methods is used to generate proper humble words and honorofic morphemes at later stages during Nepali morphology generation. We discuss about the related works in section 2. Section 3 discusses about the honorification system in Nepali language. In section 4, we describe approaches used to compute social status/honor-levels in English sentences and techniques used to generate proper honorific morphemes in Nepali sentences. Conclusions have been finally presented in section 5.



2. Related Work



Some of the related works done for other languages that are sources of inspiration are discussed below. Ref. 1 describes signed-based approach to compute relative social status of the individuals involved in Korean dialogue. It uses information about social status and the information about sentence-external individuals such as speaker and addressee to explain why a sentence is felicitous in a restricted context and whether a dialogue is coherent or not. The approach makes a good use of Korean honorific morphemes to compute social status. However, our system demands computation of social status of individuals in English text. Since, honorific morphemes are absent, different approach



 332



to determine the social status is required and this is a part of our work presented in this paper. Moreover, the social status has to be set to one of the different levels found in Nepali language. For determining the social status of pronouns referring to human, pronominal anaphora resolution is required. The first procedure for pronoun resolution was developed by Winogard2. He chooses possible antecedents based on syntactic positions in which subject is favoured over object and both are favoured over the object of preposition. In addition, focus element are given more preference and it is determined from answers to wh-questions and from indefinite noun phrases in yes-no questions. Ref. 3 and Ref. 4 present one of the first and best methods for pronominal anaphora resolution which is mostly based on syntactic information. The algorithm uses syntactic constraints on pronominalization which are used to search the tree. The algorithm starts at the pronoun on the parse tree and ends at the possible antecedent. Ref. 5 presents a simple but rapid syntax based approach. The algorithm utilizes f-structure that results from a full parse and applies a set of well-known heuristics (constraints and preferences). It applies heuristics in sequential manner until one candidate remains. This approach has been implemented and integrated into the KANTOO English-to-Spanish MT system. We use similar approach in our system. A brief survey of the major works in anaphora resolution has been presented in Ref. 6 . 3. Nepali Honorification System



In Nepali society, people are ranked according to their social status. The social rank of addressee with respect to the speakers are important factors to be considered during conversation. Another important factor is the relative age difference between the speaker and the addressee. Social relationship of the speaker with addressee and referent, his/her social status and age govern the Nepali honorification system. The following subsections discuss about the honorification system found in different categories and at different levels in Nepali language.



3.1. Pronoun Honorification At lexical level, Nepali personal pronouns are divided into different honorlevels. Table 1, Table 2 and Table 3 list 1st person, 2nd person and 3rd person pronouns at different levels respectively.



 333 Table 1. Nepali personal pronouns( 1P). Non-Hon Mid-Hon Table 2.



we haami



-



Nepali personal pronouns(2P).



Non-Hon Mid-Hon High-Hon Very High-Hon Roval-Hon Table 3.



i ma haami



ta timi tapai hajur sarkar



timiharu timiharu tapaiharu hajurharu sarkarharu.



Nepali personal pronouns(3P). he/she



Non-Hon Mid-Hon High-Hon Very High-Hon Royal-Hon



U



they uniharu



uni waha mausuf



wahaharu hajurharu. mausujharu.



The honor-levels in increasing orders are non-honorific(plain), midhonorific, high-honorific, very-high-honorific and royal-honorific. In Nepali, personal pronouns are made plural by adding suffix ham. 3. 2. Noun Honorification



Nepali proper nouns (referring to human) and common nouns(referring to post/occupation) can be divided into three levels; non-honorific, midhonorific and royal-honorific. In practice, all proper nouns can be at nonhonorific and mid-honorific levels but only some common nouns (honorable) can be at honorific(mid and royal) levels. Non-honorific nouns are bare form nouns and they can be transformed into mid-honorific form in three ways; 1) by adding words jyu and ji just after them, 2) by adding word shree just before them and, 3) by applying both rules (1) and (2). Other suffixes with noun like case markers (le,lai,ko etc.), plural marker ( h u m ) etc. are added after jyu and ji. Examples of royal-honorific nouns are ’king’(ruujaa),’queen’(ruani),’prince’(ruajkumaaru) etc. Table 4 shows examples of some proper and common nouns at different honor-levels. In Nepali, some common nouns referring to human are divided into two levels; non-honorific and honorific. Non-honorific nouns end at ’o’.The corresponding honorific form can be formed by deleting ’0’and adding ’aa’. Table 5 lists some examples of such nouns.



 334 Table 4. Nepali proper and common nouns at different honor-levels. David John Ram doctor owner



Non-Hon David John Ram dactar saahu



Mid-Hon shree David j y u shree J o h n j y u Ram jyu dactar j y u saahu ii



Table 5 . Some Nepali common nouns at different honor-levels.



son nephew



Non-Honorific choro bhanjo



Honorific chora bhanja



3.3. Verb Honorification



Linguistically, honorification in verb is manifested by honorific suffixes and this depends upon the honor-level of its subject. In fact, subject and it’s verb should agree in honor-level. When the subject is pronoun the possible honor-level of verb equals five. With noun, referring to human, three levels are possible. Table 6 lists the suffixes that are added to Nepali verb roots to reflect a particular honor-level in a particular tense. Table 6 . Nepali verb morphemes for honorification. Non-Hon Mid-Hon High-Hon Very High-Hon Roval-Hon



Present as au hun baksi baksi



Past is au bha baksi baksi



Future as au hun baksi baksi



Non-honorific and mid-honorific affixes are added at the end i.e. after addition of other inflections. Whereas, high-honorific and very highhonorific affixes are added just after the aspect inflections. With royalhonorific subjects’ entirely different set of verbs are used. For instance, ’go’ with subjects having honor-level other than royal is mapped to Nepali root jaa. But when the subject has royal honor-level’ ’go’ is mapped to sawaari. Some other examples are ’eat’(jyunaar) ’wash hand’( mukhaari) ’see’(najar) etc.



3.4. Adjective Honorification Some Nepali adjectives can be classified into non-honorific and honorific. Non-honorific adjectives ends with ’0’and their corresponding honorific



 335



form can be formed by replacement of '0'with 'aa'. This is very similar to rule for Nepali common nouns described in section 3.2. Some examples have been listed in Table 7. Honorification of adjective is determined by Table 7. cation. small fat



Nepali adjective honorifiNon-Honorific



Honorific



sano mot0



Sanaa motaa



the social status(honorific/non-honorific) of noun to which it modifies. For example, sumo choro(smal1 son, non-honorific) and saanaa choraa(smal1 son, honorific) are correct but saano choraa is not correct because s u m o is non-honorific, whereas choraa is honorific. 4. Handling Honorification



The entire problem of handling honorification in a machine translation system can be divided into two sub-tasks; determining social status/honorlevel of the individuals(noun and pronoun) in source language and reflecting it in the target language through proper Nepali honorific morphemes generation. Besides, adjectives and verbs should also be marked with proper honor-level because they show honorification in Nepali language. The following subsections discusses about techniques used for determination of honor-level and generation Nepali honorific morphemes in more detail.



4.1. Computing Honor-Level For the sake of simplicity in analysis, the categories discussed in section 3 can be divided into two categories; dependent and independent types. Dependent types are those whose honor-level depends on honor-level of independent types. It is required to compute the honor-level of independent types and then derive the honor-level of dependent types. Nepali noun falls under independent category while rest of the categories discussed in section 3 fall under dependent type. The computation of social status of individuals is achieved by the computation of honor-levels of nouns and pronouns in the current sentence. 4.1.1. Determining Noun Honor-Level. Determination of the honor-level of proper nouns referring to human is a difficult task. Pragmatic analysis is required to know about the status, age



 336



etc. of the person and we do not go for this. However, the English titles such as Mr, Mrs, Miss etc. that appear before proper nouns in English are mapped t o equivalent titles like shreemaan, shreemati, sushree etc. in Nepali by the help of bilingual lexicon. Common nouns referring to posts and occupations are tagged with appropriate honor-level in the bilingual-lexicon. For this feature hon is set to ’non’,’mid’or ’royal’ depending on whether the honor-level is non-honorific, mid-honorific or royal-honorific respectively. Figure 2 shows portion of the bilingual-lexicon to illustrates the idea.



. . .) lex(teacher,guru,cat:n..hon:mid . . .1. lex(king,rAjA,cat:n..hon:royal . . .) .



lex(servant,nokAra,cat:n..hon:non



Figure 1. Common nouns tagged with honor-level



Such common nouns get their honor-level by lexicon lookup just before parsing. Common nouns(shown in Table 5) that can have honorific form are very less in numbers. Moreover, the decision of making a choice between these two forms (non-honorific and honorific) requires knowledge of relationship between speaker and addressee (different from the common noun under consideration) and the relative age information which requires heavy computation. Avoiding them has very little impact on the overall performance of the system and therefore, we translate them to honorific form by default which is still understandable.



4.1.2. Pronoun Honor-Level Computation. Honor-level of pronoun can be derived from the honor-level of its antecedent (noun/noun phrase). Thus, pronominal anaphora resolution is required t o compute the honor-level of pronouns in English text. Ref. 9 and Ref. 5 discusses about anaphora resolution that is required because of gender discrepancies between the language pair. We are highly inspired by the approach discussed in Ref. 5 and we use similar approach for anaphora resolution.



4.1.3. Verb and Adjective Honor-Level Computation. The system uses feature based unification for agreement among the phrases and to set constraints in the grammatical rules. Adjective get its honor-



 337



level from the noun to which it modifies. Similarly, honor-level of verb is set to honor-level of its subject. These are encoded as agreement rules in the grammar. 4.2. Generating Honorific Nepali Morphemes



The computed honor information of each category is encoded in the parse tree. Transfer rules are applied to the parse tree to generate appropriate Nepali syntax. Morphology generation rules are applied to each terminal to generate proper inflections, thereby giving a naturalistic finish to the sentence. The honor information encoded into the parse tree is used for honorific morpheme generation during morphology generation. 5. Implementation Issues



Frozen Expression Expander



+



Compound Word Handler



J



Camp,



Word



Lexical



1-



Morphology Generation



(3)



1 Nepali sentence



Figure 2.



Dobhase system architecture



Figure 3 shows different stages during the entire machine translation process. Determination of honor-level takes place at phases (1) and (2)



 338



shown in the figure. Similarly, Generation of honor takes place during phase (3). The honor-level of Nepali nouns are determined during morphological analysis. During syntactic analysis, the system uses passive chart parsing technique to shallow parse the English sentence and outputs its constituents/phrases. The output of shallow parser is given to pronominal anaphora resolution module which, after resolving the anaphora, gives its output to parse tree generation module. Dependent types of lexical items get their honor-level during parse tree generation. After computation of honor-levels, each terminal(noun, pronoun, adjective and verb) in the parse tree has a hon feature set to appropriate value which indicates it’s honor-level. Proper inflections to reflect appropriate honor in the Nepali sentence are generated during the morphology generation phase. Such morphology generation rules are designed using the honorific rules that can be extracted from section 3 and are implemented as prolog facts. 6. Conclusion



In this paper, we presented the honorification system in Nepali language at different levels. We discussed about the different approaches that can be used to determine honor-level of different categories by analyzing the English text and techniques to reflect proper honor in the output Nepali sentences. Future work includes developing metrics to test and evaluate the system’s ability to handle honorification and apply them to evaluate the system. The techniques discussed in this paper can be used to handle honorification in other natural language processing applications like dialog systems, natural language generation systems etc. that include Nepali or any other language that has honorification system very similar to Nepali.



7. Acknowledgement We are very thankful to Mr. Deepesh Man Shrestha and Mr. Bhim Regmi for their valuable suggestions and comments.



References 1. Done-Young Lee: Computation of Relative Social Status on the Basis of Honorification in Korean. In Proceedings of the 16th International Conference on Computational Linguistics COLING 1996, Center for Sprogteknologi, Copenhagen, Denmark. (1996)



 339 2. T . Winogard: Understanding natural langauge. New York: Academic Press. (1972) 3. Jerry Hobbs: Pronoun Resolution. Research Report 76-1. New York:Department of Computer Science, City University of New York. (1976) 4. Jerry Hobbs: Resolving pronoun references. Lingua. 44 (1978) 339-352 5 . Teruko Mitamura, Eric Nyberg, Enrique Torrejon, David Svoboda and Kathryn Baker: Pronominal Anaphora Resolution in the KANTOO Englishto-Spanish Machine 'Ranslation System. In Proceedings of M T Summit VIII. (2001) 6. Ruslan Mitkov: Anaphora Resolution: the state of the art. Technical Report, University of Wolver-hampton. (1999) 7. Ruslan Mitkov, Hiongun Kim, Kyan-Hyuk Lee and Key-Sun Choi: The lexical transfer of pronominal anaphors in Machine Trans1ation:the English-toKorean case. In Proceedings of the SEPLN'94 conference, Cordoba, Spain. (1994) 8. Saggion Horacio and Ariadne Carvalho: Anaphora resolution in a machine translation system. In Proceedings of the International Conference "Machine translation, 10 years on", 4.1-4.14. Cranfield, UK. (1994) 9. Ruslan Mitkov and Paul Schmidt: On the complexity of anaphora resolution in Machine Tkanslation. In Carlos Martin- Vide Editor, Mathematical and computational analysis of natural language. Amsterdam: John Benjamins. (1998).



 SPEECH SYNTHESIS THROUGH WAVEFORM CONCATENATION FOR SANSKRIT MR. V.E. WAGHMARE Lecturer, Dept. of E&TC Govt. College of Engineering Aurangabad - 431 005, MS, India Ernail: [email protected] DR. MRS. A. S. BHALCHANDRA Asst. ProJ, Dept. of E&TC Govt. College of Engineering Aurangabad- 431 005, MS, India Ernail: [email protected] Abstract We report the basic methodology for the development of speech synthesis through waveform concatenation approach in Sanskrit language. Storing waveform in look-uptable, rules for concatenation and speech generation are basic components of this method. Text to speech synthesis has been developed. The text input is correlated to the look-uptable, which is stored in memory, and accordingly stored waveforms are concatenated to form the Sanskrit



1. Introduction



During last few decades the communication aids have been developed from talking calculators to modem three-dimensional audiovisual applications. The application field for speech synthesis is becoming wider. A text-to-speech system has been developed steadily over the last decades and it has been incorporated into several new applications. For most practical applications such as access to databases through the telephone network or aid to handicapped people, the intelligibility and comprehensibility of synthetic speech have reached the acceptable level. The fimction of a text-to-speech (TTS) synthesis system is to synthesize speech from a generic text in a given language. The most commonly used techniques in present systems are based on formant and concatenative synthesis. The latter one is becoming more and more 340



 341



popular since the methods to minimize the problems with the discontinuity effects in concatenation points are more effective and it also provides more natural and individual sounding speech. The majority of successful commercial systems employ the concatenation of polyphones[ 13. The polyphones comprise groups of two (diphones), three (triphones) or more phones and may be extracted from nonsense words, by segmenting the desired grouping of phones at stable spectral regions. For instance, the word life, whose phonetic representation may be written as [IaIfl, can be implemented by the concatenation of four polyphones: #I + la + aIf + f#, where the symbol # represents the silence phone. In a concatenation-based synthesis, the conservation of the transition between two adjacent phones is crucial to assure the quality of the synthesized speech. With the choice of polyphones as the basic subunits, the transition between two adjacent phones is preserved in the recorded subunits, and the concatenation is carried out between similar phones (1 + 1, a + a, f + f). The generation of the speech waveform by means of a computer model controlled by a small number of parameters has long been an objective of speech research [ 2 ] . To construct such a model, one can exploit knowledge about the physiology of the speech-production apparatus [3] and about the properties displayed by the speech signal. While major studies have been made towards a high-quality, controllable model of speech, this goal has not yet been attained. This is particularly well illustrated by the architecture of state-of-the-art speechsynthesis systems. In such systems the linguistic input parameters do not directly control a waveform synthesis model, but rather these linguistic parameters are used to find and modify natural-speech segments that are stored in a database. The modified segments are then concatenated to obtain a speech signal. Synthesis of a meaningful speech signal by a computer program requires a description, in some form, of the vocal tract resonance corresponding to that speech signal. Phonology of a language for the speech synthesis is most important for text to speech synthesis. Phonology is a part of the linguistic system a speaker knows, and is concerned with lexical identity. The phonology of English language varies in particular contexts. 2. Sanskrit Language



Sanskrit is the perfect language in the same sense as mathematics for pronunciation, making it most suitable for the speech synthesis. In Sanskrit language, there is zero discrepancy between what is written and what is



 342



pronounced. Arthur A. Macdonel [4] has written about the greatness of Sanskrit language in comparison with European languages: “We Europeans, 2500 years later, and in a scientific age, still employ an alphabet which is not only inadequate to represent all the sounds of our language, but even preserve the random order which vowels and consonants are jumbled up as they were in the Greek adaptation of the primitive Semitic arrangement of 3000 years ago”. Rick Briggs, NASA researcher has suggested that, the “structures constructed by Panini (A Sanskrit grammarian) could be useful in the development of efficient, high level computing language, avoiding the use of alphanumeric operator symbols. Quotes showered on the Sanskrit language by Prof. James Santucci California State University: 1 2 3 4 5 6



Emphasized the study of sounds Developed morphological explanations Stressed in phonetics the place and manner of articulation Described language in a formal manner and not as a logical system Developed a Meta language Approached the sense of the phoneme



Sanskrit alphabets are also called as a brahmavarnamala. According to Shri Pandit Hebbar [2] Sanskrit have 50 units which are divided into 8 groups, which are discussed below.



3. Building Database for Speech Signal: 3.1. Format For Input Text The input text could be available in any of the formats and can be correlated to the look-up-table, which is stored in memory. However; the issue of various formats could be conveniently separated from the synthesis engine. The text processing front end [5] can provide appropriate conversion of various formats Table 1. Block diagram for Speech synthesis



Look-uptable



Speaker



 343



of input text into this transliteration scheme. Sanskrit language have 50 units which are divided into 8 groups are shown into the Table 1. Lookup table with standard code is prepared. Building Speech Database: All the basic phones of Sanskrit language are recorded from a Sanskrit Pandit having Ph.D. in Sanskrit language. Recording took place using Shure SM 58 microphone, whose frequency response is 50 - 15,000 Hz and SNR is 30 dB under lab conditions. Spoken units were recorded at a sampling rate of 44.1 KHz. Table shows sample waveform of few units from each group. Total analysis is performed by using MATLAB. Methodology of Generation of Units Preprocessing of phones [3] is done before applying the methodology for the formation of the word$. Preprocessing involves removing vowel part from all the units other than 1 group units. A standard lookup table is prepared and waveforms are stored in the lookup table. A synthesis engine takes user input, selects appropriate waveform from lookup table and sends it to the speaker. Sanskrit has well defined mathematical rules for the concatenation of units and their morphological transformation during concatenation. Figure 3 shows methodology for generation of Sanskrit Units. Group 1 Unit are complete unit and called as vowels. Group 2-8 units are called as half vowels. Complete unit is the addition of any unit from Group 2 - 8 with the unit in the group 1 after preprocessing and morphological operations. By using this methodology 34 * 16 = 244 basic units can be derived. Conclusions This methodology can be used for the developing a perfect Sanskrit speech synthesis either through the technique of waveform concatenation or through the other signal processing methods.



0



The concatenate method provides more natural and individual sounding speech Minimizes the problems with the discontinuity effects in concatenation points and so it becomes more effective. Quality with some consonants may vary considerably and the controlling of pitch and duration may be in some cases difficult, especially with longer units



 344 Table 2. Units and their time domain wave representation



 345 Table 2. (Cont’d.) Units and their time domain wave representation.



2-8 unit



1 Unit



Figure 3. Flowchart for Unit generation



In principle, speech synthesis may be used in all kind of human-machine interactions. For example, in warning and alarm systems synthesized speech may be used to give more accurate information of the current situation. Using speech instead of warning lights or buzzers gives an opportunity to reach the warning signal for example from a different room. Speech synthesizer may also



 346



be used to receive some desktop messages from a computer, such as printer activity or received e-mail. In the future, if speech recognition techniques reach adequate level, synthesized speech may also be used in language interpreters or several other communication systems, such as videophones, videoconferencing, or talking mobile phones. If it is possible to recognize speech, transcribe it into ASCII string, and then re synthesize it back to speech, a large amount of transmission capacity may be saved. With talking mobile phones it is possible to increase the usability considerably for example with visually impaired users or in situations where it is difficult or even dangerous to try to reach the visual information. It is obvious that it is less dangerous to listen than to read the output from mobile phone for example when driving a car. Acknowledgement I would like to thank Dr. W.Z Gandhare, Prof. G.K. Andurkar, Prof. N.G. Pantawane, Renuk Pawade, Nilesh Kandalgaonkar (Machine Intelligence Group Members) for their continuous assistance in the development process. References [I] P. Mermelstein, “Articulatory model for the study of speech production,” .I. Acoust. SOC.Amer:, vol. 53, no. 4, pp. 1070-1082, 1973. [2] Pandit Ganpatishastri Hebbar, “Bhartiyu Lipinche Muulik Ekrup”, Maharashtra State center for culture and literature” [3] K. Ishizaka and J. L. Flanagan, “Synthesis of voiced sounds from a two-mass model ofthe vocal cords,” Bell Syst. Tech. J., vol. 51, no. 6, pp. 1233-1268, 1972. [4] Arthur A Macdonell, “History of Sanskrit Literature”, Motilal Banarasidas Pub. ISBN: 8120800354, p.717. [5] Geoff Bristow, “Electronic Speech Synthesis - Techniques, technology and applications”, McGraw-Hill Book Company, 1985 pp 120-14.



 Soft Computing



 This page intentionally left blank



 PERFORMANCE MEASUREMENT OF REAL TIME OBJECT DETECTION AND RECOGNITION MIR MD. JAHANGIR KABIR, SAMIR HALDER, MD. ROBIUR RAHMAN, MD. W.H. SADID, M.M. MANJURUL ISLAM AND MD. NAZRUL ISLAM MONDAL Computer Science and Engineering, Rajshahi University of Engineering and Technology,Rajshahi-6204, Bangladesh. E-mail: sagar-31 S@yahoo. com,[email protected],[email protected] One of the most difficult problems in image processing is detecting objects in an image. The method presented in this paper is designed to detect artificial object in natural environments. It is based on the fact that artificial objects may have any directional viewings. It uses neural network architecture for the image to detect the image of different view. This method consists of taking every possible image for every object and from many directions. This method allows detecting unknown objects in various natural environments. It works with different image resolutions and it has a low processing time, making it suitable for real-time operations.



1



Introduction



Several arbitrary objects are added in the drawing window. And our implementation is to detect them. One of the problems is that what happens when the objects are in rotated form. In static object detection and justification it is of no great problem since we have to work with the spatial coordinates. And here rotating the objects changes the bounding volume in a different orientation. So the boundary volume determines the object and the accuracy depends on the grid-cell size as will be described in details later. While in dynamic object detection, it is much complex. Because we will read a portion of image inside the viewing window. So we have to have some initial knowledge about the rotated objects structure. Now, when we have the rotated version of the image, still we will be reading an image of a particular position - the viewing window. So according to the previous knowledge there will be a decision of the object type. 2



Static Object Detection and Recognition



GRID is a technique that subdivides the whole coordinate system into smaller segments to track the objects in it. GRID is a collision detection scheme for mobile objects in real time systems. The grid may be 2D or 3D, divides the entire scene into distinct cells (uniform spatial subdivision). The grid requires no explicit storage of cells or content. Only a 349



 350



single variable is required: CELL-SIZE. CELL-SIZE should be significantly larger than most objects in the scene to reduce chances of objects spanning multiple cells. While a single object spanning multiple cells is acceptable (for example, objects on a cell boundary). Thus CELL-SIZE will affect performance. In a 2D grid, each cell is uniquely represented by two integers generated by the hash function. A 3D grid requires three integers. Neither grid cells, nor their contents, are explicitly stored. Each object in the scene has a parameter denoting the grid current cells it occupies. The algorithm for determining cells-occupied is as below: 1. Clear the cells occupied list of the object. 2. Set a = the maximum grid cell occupied by the object. 3. Insert a into the cells occupied list. 4. Set b = the minimum grid cell occupied by the object. 5. If a = b go to step-16. 6. Insert b into the cells occupied list. 7. Set point, p = maximum grid cell. 8. Set p.x = min.x. 9. Set grid cell, c = hashed grid cell of point p. 10. If c = b go to step-16. 11. Insert c into the cells occupied list. 12. Get a grid cell, d. 13. Set d.x = a.x 14. Set d.z = a.z 15. Insert d into the cells occupied list. 16. End



3



Dynamic Object Detection and Recognition



From the drawing window we will get the image data that is available inside the viewing window. Then this image data or pattern is passed into the following algorithm. By using this algorithm, we will get an output. Now if this output is one of the valid ones, an information is provided. But if there is no valid output is found then there is no confirmation that will be given by the algorithm. In the meantime, if a translation or rotation is provided then the viewing window is translated or rotated accordingly. The Dynamic Object detection concept is shown belown: Learning Object: eural Network [Learning lgorithm with input, hidden and butput layer nodes]



Stable Network



I



 351



Output of the Find a match from the current P P u t Object +stable neural network --+sensed Object



4



Recognition Accuracy



m



After completion of the learning the updated weights and thresholds are saved in a file which are used in the testing phase of an unknown object. To verify the object, new sample of an object image is taken and features are extracted to form a feature matrix and applied it to the net for testing. If it generates the error within the rate of tolerance the identification is successful otherwise not or failed 100



90 110 70 60 50



40



30 20



4



I



lo 0 0



5



10



20



30



35



Rate of distortion



Figure 1. Accuracy Chart



5



Discussion



This detection method based on object shape provides very good results for detecting artificial objects in natural environments. It allows an easy detection of unknown artificial objects in various environments. Furthermore, this method can be used with relatively low resolution images, and has a low processing time.



 352



Possible improvements of this method include a method of image segmentation more refined than the simple division in fixed-size sub-frames. It may also be possible to eliminate the natural uniform surfaces which can make the detection fail. However, because of its relatively low processing time, it could be used for detecting objects on video sequences. And this kind of method has other potential applications, like face detection or content-based image retrieval. 6



Acknowledgements



The authors are gratefid to the department of Computer Science and Engineering of Rajshahi University of Engineering and Technology, for using the lab, to the teachers who helped the authors for carrying out this paper and by giving innovative ideas. References



1. A.J. Lipton, H. Fujiyoshi, and R. S. Patil, “Moving target classification and tracking from real-time video,” in Proceedings IEEE Image Understandingworkshop, 1998, pp. 129-1 36. 2. C. Stauffer and W.E.L. Grimson, “Learning Patterns of Activity using RealTime Tracking,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 747 757, August 2000. 3. C. Wren, A. Azarbayejani, T. Darrell, and A. Pentland, “Pfinder: Real-Time Tracking of the Human Body,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 780-785, July 1997. 4. Donald Ham and M. Pauline Baker., Computer Graphics (2”d Edition). Prentice Hall of India Private Limited, October,1999. ISBN 81-2030944-8. 5. J. Ong and S. Gong, “A Dynamic Human Model using Hybrid 2D-3D Representations in Hierarchical PCA Space,” in Proceedings of the British Machine Vision Conference, Nottingham, September 1999, pp. 33-42, BMVA Press. 6. G. A. Jones J. Orwell, P. Remagnino, “From Connected Components to Object Sequences,” in First IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, Grenoble, March 31st 2000, pp. 129-136.



 A SYMMETRIC ENCRYPTION TECHNIQUE THROUGH RECURSIVE MODULO-2 OPERATION OF PAIRED BITS OF STREAMS (RMOPB)



' School



P.K.JHA', J. K. MANDAL of Engineering and Technology Purbanchal Universiw, Biratnagar, Nepal



E-mail: [email protected] Professor, Department of Computer Application, Kalyani Government Engineering College, Kalyani, Nadia, West Bengal, India - 741235, India, E-mail: jkmandal@rediffmail. com



Abstract:The technique considers a message as binary string on which a Recursive Modulo-2 Operation of Paired Bits of Streams (RMOPB) is applied. A block of n bits is taken as an input stream, where n varies from 4 to 256, fiom a continuous stream of bits and the technique operates on it to generate the intermediate encrypted stream. The same operation is performed repeatedly for different block sizes as per the specification of a session key of a session to generate the final encrypted stream. A comparison of the proposed technique with existing and industrially accepted RSA has also been done in terms of frequency distribution and homogeneity of source and encrypted files. Key words: Recursive Modulo-2 Operation of Paired Bits of Streams RMOPB), Cipher text, Block cipher, Session Key. 1. Introduction The encryption process [6, 8, 9, 10, 11,121 makes the document into cipher text, which will not be legible to them. Many algorithms are available each, of which has merits and demerits [l, 2, 3, 4, 5, 71. No single algorithm is sufficient for this application. As a result researchers are working in the field of cryptography to enhance the security hrther. In this paper a new technique is proposed where the source message is considered as a stream of binary bits and the technique transforms the document into unintelligent form from where the original message can also be recovered into the original form using the same technique. The technique has been implemented using C language. Section 2 of the paper deals with the principle of Recursive Modulo-2 Operation of Paired Bits of Streams (RMOPB). A proposal for key generation is described in section 3. Results are given in section 4. Analysis about the technique is made in section 5 Conclusions are drawn in section 6 and references are drawn in section 7. 2. Principle of Recursive Modulo-2 operation of Paired Bits of Streams (RMOPB) The technique, considers the plaintext as a stream of finite number of bits N, and is divided into a finite number of blocks, each also containing a finite number of bits n, where, 1<= n <= N. Let P = soo sol s02 s03 so4 ... so,., is a block of size n in the plaintext. Then the first intermediate block 1, = sl0 sI1 sI2 sI3 sI4 ... s',.~can be generated from P in the following way: 1 1 - 0 0 sos I - s o s I B S 02 S03 353



 354 1



-



0



s'2s3-sos



0



0



0



1CDsqsg



s'.1 s1.J+I= so.. so,, @ so.l+J+2 s 0i+j+3, 0 <= i < (n-1), 0 <= j < (n-1); CD 1-J I-J+1 stands for the exclusive-OR operation. In the same way, the second intermediate block 12 = S'O sz1s22 sZ3sZ4 ... s2,.1 of the same size (n) can be generated by: s2 2 1 0s 1 - s 0 s'I CD s 1 2 ~ ' 3 s2 2 - 1 I I 2 s 3 - S o s 1CD S ' ~ S5 2 2 - 1 1 S i s j+l- s i-j s i-j+l CD ~ ' i + j + Zs 1i+j+3, o <= i < (n-11, 1 <=j < (n-l);a stands for the exclusive-OR operation. If this process continues for a finite number of iterations, the source block P is regenerated forming a cycle, which depends on the value of block size n. Any intermediate block in the recursive process may term as intermediate encrypted block for that source block. The operation is repeated for the whole stream in the source. The same operation is performed for whole stream number of time with a varying block sizes. k such iteration is done and the final intermediate stream after k iterations generates the encrypted stream. All of the different block sizes and k constitute the key for the session. This key may be considered as session key for that particular session. 3. Generation of Session Key To ensure the successful encryption of the proposed technique with varying size of blocks, a 133 bit key format consisting of 14 different segment has been proposed here [ 1,11,6,8,9]. For the segment of rank the R, there can exist a maximum of N=216-R blocks, each of unique size of S=216-R, R starting from 0 and moving till 13. Segment with R=O formed with the first maximum 65536 blocks, each of size 65536 bits 0 Segment with R=l formed with the next maximum 32768 blocks, each of size 32768 bits Segment with R=2 formed with the next maximum 16384 blocks, each of size 16384 bits 0 Segment with R=3 formed with the next maximum 8192 blocks, each of size 8 192 bits 0 Segment with R=4 formed with the next maximum 4096 blocks, each of size 4096 bits Segment with R=5 formed with the next maximum 2048 blocks, each of size 2048 bits Segment with R=6 formed with the next maximum 1024 blocks, each of size 1024 bits 0 Segment with R=7 formed with the next maximum 512 blocks, each of size 512 bits ,



 355



Segment with R=8 formed with the next maximum 256 blocks, each of size 256 bits Segment with R=9 formed with the next maximum 128 blocks, each of size 128 bits Segment with R=10 formed with the next maximum 64 blocks, each of size 64 bits Segment with R=l 1 formed with the next maximum 32 blocks, each of size 32 bits Segment with R=12 formed with the next maximum 16 blocks, each of size 16 bits Segment with R=13 formed with the next maximum 8 blocks, each of size 8 bits. With such a structure, the key space becomes of 133 bits long and a file of the maximum size of around 699.05 MB can be encrypted using the proposed technique. 3.1 Vulnerabili@ We can consider the time required to use a brute force approach, which simply involves trying every possible key until an intelligent translation of the cipher text into plain text is obtained. On average, half of all possible key must be tried to achieve success. Table 3.1 shows how much time is involved for various key spaces. Results are shown for four binary sizes. The 56-bit key sizes used with the DES (Data Encryption Standard) algorithm, and the 168-bit key size is used for triple DES. The minimum key size specified for AES (Advanced Encryption Standard) is 128-bits. Results are also shown for what are called substitution codes that use a 26 characters key, in which all possible permutations of the 26 characters serves as keys. For each key size, the results are shown assuming that it takes 1 ps perform a single decryption, which is a reasonable order of magnitude for today's machine. Within the use of massively parallel organizations of microprocessors, it may be possible to achieve processing rates may orders of magnitude greater. The final column of the table 3.1 considers the result for a system that can process a1 millions keys per microsecond. As one can see at this performance level, DES can no longer be considered computationally secure. In the proposed technique key size is 133 bits though this key may be changed to other size as per the requirements [13, 141.



 356



:xhaustiveKey Search Key Size (bits) 32



, 4.



No. Of Alternate Keys 232=4.3*10’



Time Required at 1 encryption / ps 2”= 35.8 minutes



zs5ps=l142 years



56 128



256=7.2*1016 2Iz8=3,4*1038



2



”’ps=5.4*1OZ4years



168



2168=3 ,7*1OsO



2



16’



26 Characters



26! 4*1OZ6ps



2*1OZ6ps=6.4*10i2years



133@roposedRMOPB)



2I3’=l .08*1040



2132ps = I .7*1oZ6years



ps=5.9*10~~ years



Results In this section the results of implementations are presented. The implementation is made through high-level language. Table 4.1 represents the encryption time, decryption time, size before and after encoding and decoding. The encryption time varies from 0.054945 to 0.164835.The decryption time varies from 0.000000 to 0.164835 for the present implementation. Figure 4.1 gives a relationship between encryption times against decryption time. The frequency distribution graphs for source and encrypted files for both RMOPB and RSA techniques is given in figure 4.2. Chi-square test has also been done and presented in table 4.2 for source files and encrypted files. Results of the Chi square tests are compared with RSA technique for source files and encrypted files.



 357



Encryption Tjrn* (In Mconds)



Figure 4.1 :Decryption time against Encryption time for RMOPB Technique



f Encrypted File



Figure 4.2: Frequency distribution between viewprev.cpp and its encrypted file (cascaded) for RMOPB Technique



Graph in figure 4.2 shows the frequency of characters in a message and that frequency of characters in the encrypted message for RMOPB technique. The close observation reveals that the character frequencies are more evenly distributed for the RMOPB technique. In connection with the Brute-force attack of decrypting the message by the hackers, it may be difficult to decrypt the message if the message is encrypted using the proposed key system or like manner. 4.1 Tests for Homogeneity The Chi-square test has also been performed using source file and encrypted files for Recursive Modulo-2 Operation of Paired Bits of Streams and existing RSA Technique Table 4.2:Comparative results between RMOPB technique and RSA technique their Chi - Square values and corresponding degree of freedom Source Value of ChiValue of Chi-Square File Square(RMOPB) (RSA) viewprev.cpp 90345 1015121 olecli2.cpp



866561



750711



for .cpp file for Degree of Freedom 88 87



 358



Table 4.2 shows the values of Chi-square for different file size, which shows that the value of Chi-square is increasing as file size is increasing. The Chisquare is highly significant at 1 % level of significant. So we may conclude the source files and encrypted files are non homogeneous in both RMOPB and RSA techniques. 5. Analysis Analyzing all the results presented in section 4, following are the points obtained on the proposed technique: The encryption time and the decryption time vary linearly with the size of the source file and both are comparable Out of the different categories of files considered here, Chi Square values for .CPP files are the highest. The frequency distribution test applied on the source file and the encrypted file shows that the characters are all well distributed. Chi Square values for this proposed technique and those for the RSA system highly compatible. Except some cases the Chi square values in proposed technique shows better result in comparison to RSA.



6 . Conclusion The technique presented here is a simple, takes little time to encode and decode, though the block length is quite high. The encoded string will not generate any overhead bits. It can be easily implemented in any high level language for practical application purpose to provide security in message transmission. Acknowledgement: The authors express a deep sense of gratitude to the Department of Computer Science and Application, University of North Bengal, Dist. Darjeeling and Kalyani Government Engineering College, Kalyani, Nadia, West Bengal, India for providing necessary infrastructure support for the work. The work is carried out under the sanction of an Indo-Nepal collaborative research work where the scholarship to one of the author is provided by the UGC, Nepal.



 359



References 1.



2.



3.



4.



5. 6. 7.



8. 9.



10. 11. 12. 13. 14.



Jha P. K., Mandal J. K., “A Bit Level Symmetric Encryption Technique Through Recursive Arithmetic Operation (RAO) to Enhance the Security of Transmission” Proceedings of ICT Conference 2005, An International Conference of Computer Association of Nepal to be held during 26-27‘hJanuary, 2005 at BICC, Kathmandu, Nepal. Dutta S et al. “Ensuring e-Security using a Private-key Cryptographic System Following Recursive Positional Modulo-2 Substitutions” AACC 2004, LNCS 3285, Springer-Verlag Berlin Heidelberg, pp. 324-332,2004, Mandal J. K. and Dutta S., “A Space-Efficient Universal Encoder for Secured Transmission”, International Conference on Modeling and Simulation (MS’ 2000Egypt), Cairo, April 11-14, , pp-193-201,2000. Mandal J.K. and Dutta S., “A Universal Bit-Level Encryption Technique”, Proceedings of the 7th State Science and Technology Congress, Jadavpur University, West Bengal, India, February 28 - March 1, pp-INF02, 2000. D. Welsh ,“Codes and Cryptography”, Oxford: Claredon Press, 1988 J. Seberry and J. Pieprzyk , “An Introduction to Computer Security”, Australia: Prentice Hall of Australia, 1989. D. Boneh, “Twenty Years of Attacks on RSA Cryptosystem” in notices the American Mathematical Society (AMS), vol46, no 2, pp 203-213,1998. B. Schneier, “Applied Cryptography”, Second Edition, John Wiley & Sons Inc, 1996. C. Coupe, P. Nguyen and J. Stern, “ The Effectiveness of Lattice Attacks against Low-Exponent RSA”, Proceedings of Second International Workshop on Practice and Theory in Public Key Cryptography, PKC’99, ~011560,of lecture notes in Computer Science, Springer-Verlag, ,pp 204-218,1999. RSA Vulnerabilities”, Journal of Cryptology, vol 10, pp 233-260,1997. M. Wiener, “Cryptanalysis of Short RSA Secret Exponents”, IEEE transactions on Information Theory, vol36, pp 553-558,1990. V P Gulati, A Saxena and D Nalla, “On Determination of Efficient Key Pair”, Journal of the Institution of Engineers (India), vol 84,pp 1-3, May 2003. “ Cryptography and Network security ” by William Stallings published by Pearson education (Singapore) Pte. Ltd. 2004. Recent Advances in Computing and Communication “proceedings of the 12’h international Conference on Advanceed computing and communications, Dec 1518,2004, Ahmedabad, India.



 MATHEMATICAL MODEL AND ANALYSIS OF THE SIMPLEST FUZZY TWO - TERM (PI/PD) CONTROLLER



B. M.MOHAN* AND ARPITA SINHA Department of Electrical Engineering, I n d i a n Institute of Technology, Kharagpur-721902, India, E-mail : mohanOee. iitkgp. ernet. in, arpitasinhaoee.iitkgp. ernet. in



This paper reveals a mathematical model for the simplest fuzzy two-term (PI/PD) controller employing r-function type and L-function type input fuzzy sets, Afunction type output fuzzy sets, algebraic product t- norm, bounded sum t- conorm, Mamdani minimum inference and center of sums (COS) defuzzification. T h e p r o p erties of this model are studied. Using the well-known small gain theorem we establish sufficient conditions for bounded-input bounded-output (BIBO) stability of feedback systems containing the above controller as a subsystem.



1. Introduction From the ongoing research in the field of Fuzzy Control Systems, it has been observed that fuzzy PI and fuzzy PD controllers outperform their traditional counterparts - linear PI and PD controllers. This aspect was first theoretically proved by Ying and the relationship between the conventional PI controllers and the simplest fuzzy PI controllers that employ drastic product inference had been established. Subsequently, Ying revealed analytical structures of the simplest fuzzy PI controllers using Mamdani minimum, and Larsen product inference methods. On similar grounds, Malki et. al. made an attempt to develop the simplest fuzzy PD controller, and compared its performance with that of the conventional PD controller. Analytical structures for the simplest fuzzy PD controllers were revealed using different combinations of t-norms and t-conorms, and inference methods. Input-output structures of fuzzy controllers with nonlinear input fuzzy sets, singleton output fuzzy sets, product AND operator, Mamdani type of fuzzy rules, and centroid defuzzifier were studied l . Following the approach given in and 5 , mathematical model for fuzzy two-term (PI/PD) controllers is obtained in this paper by employing *Corresponding author,Fax:+91-3222-282262, 255303, 282700



360



 361



algebraic product t-norm, bounded sum t-conorm, r-function type and Lfunction type fuzzy sets for inputs, A-function type fuzzy sets for output, linear control rules, Mamdani minimum inference, and COS defuzzification. Properties of this controller are investigated. Also an attempt is made to establish BIB0 stability conditions for the feedback system containing this mathematical model as a subsystem. 2. A fuzzy PI/PD controller



The incremental control signal (velocity algorithm) generated by discretetime PI controller is given by



Au(kT) = u ( k T )- ~ [ ( -k 1)T] = K$w(kT)



+ K,dd(kT)



(1)



while the control effort produced by discrete-time PD controller is given by



u(kT)= K$d(kT)



+ K&J(kT)



(2)



where K $ , Kf,and K$ are respectively the proportional, integral and derivative constants of discrete-time PI and PD controllers, T is the sampling period, d ( k T ) = e ( k T ) ,the error signal and the velocity signal



w(kT) = { d ( k T ) - d [ ( k - l)T]}/T



(3)



Based on Eqs. (1)-(3), the principal structures of fuzzy PI controller



Figure 1.



Block diagram of typical fuzzy PI/PD control system



and fuzzy PD controller are shown in Figure 1, in which N d , N,, NLA and N l l represent scaling factors, and d~ (kT), V N (kT) , AUN(kT)and u~(kT) represent normalized versions of d( kT),w( kT),Au(kT)and u(kT), respectively. The error signal e ( k T ) (displacement d ( k T ) ) , and the first-order time derivative of e ( k T ) (velocity w(kT))are fuzzified by a combination of L-type and F-type membership functions (illustrated in Figure 2) whose mathe-



 362



matical description is respectively given by pn.2 =



i



where x is dN or



1



0



VN.



-L<X<-l -1 <x 1 l<x


<



pp.z=



{



-L<x<-~ -I < x 1 1 l<x


0



<



(4)



Note that pn.2 + p p . x = 1. The membership functions



Figure 2.



Input and output fuzzy sets



for the normalized output ( A u N ( ~ for T ) P I control and U N ( ~ for T ) PD control) are shown in Figure 2. The following linear control rules are considered for fuzzy P I controllers: R1) If dN is p.d AND V N is n.u then AUNis 00;Rz) If dN is n.d AND V N is n.v then AUNis 0 - ; R3) If dN is n.d AND V N is p.v then AUNis 0,; R4) If dN is p.d AND V N is p.v then A u N is O+. The above control rules also hold good for fuzzy PD controller if A u N is replaced by U N . The AND operation considered in the rule base is algebraic product t-norm which is defined as f i ( d N ,V N ) = p i ( d N ) . p j ( v ~ ) , where i and j are the ith and j t h fuzzy sets on dN and V N respectively. The degree of match, found by using algebraic product t-norm, is used to determine the inferred output fuzzy set by using Mamdani minimum inference which is defined as min(fi, p(output)), where fi is the outcome of algebraic product t-norm operation. The reference output fuzzy set (triangular), and the inferred output fuzzy set (shown with hatching) corresponding to Mamdani minimum inference are shown in Figure 3(a). The area of inferred fuzzy set is given by f i ( 2 - fi)H All possible input combinations (I&) of normalized inputs d N ( k T ) and v ~ ( l c Tare ) shown in Figure 3(b). The outcomes of fuzzy control rules for all IC regions with algebraic product t-norm are given in Table 1. It can be seen from the control rule base that the output fuzzy set 00is fired two times. The outcomes of rule 1 ( f i ~ and ) rule 3 (/is) corresponding t o output fuzzy set 00may be different. Bounded sum t-conorm is used to take care of the OR operation between rule 1 and rule 3, which is defined



 363



‘*dN



0-L



0)



(a>



Figure 3. (a)Illustration of Mamdani minimum inference (b)Regions of fuzzy PI/PD controller input combinations



= min(1, f i 1 + f i 3 } = P I + f i g According to the well-known COS defuzzification method, the crisp value of control output is given by



as



fi1/3



A(fi2X-H) + A ( f i 1 / 3 ) ( 0+ ) A(fi4)(H) (5) A(b2.)+ A(fi1/3)+ A(fi4) where A(fiz),A(fi4)and A(fi1/3)are the areas of membership grades f i 2 , f i 4 and f i ~ respectively. p A u ~ ( kis)replaced by u ~ ( kfor ) fuzzy PD controller. Au,(k) =



3. Mathematical model and its analysis Using I?-function type and L-function type input fuzzy sets, three Rfunction type output fuzzy sets, linear control rules, algebraic product tnorm, bounded sum t-conorm, Mamdani minimum inference, and COS defuzzification, the expressions for the incremental control effort Au(kT)for fuzzy PI controller is as shown in Table 1. Since d ~ ( k Tand ) w ~ ( k Tare ) always the inputs to the two-term (PI/PD) controller, see Eqs.(l) and (2), and Au(kT) is the output of the PI controller while u ( k T ) is the output of the PD controller, the control laws in Table 1 are also valid for fuzzy PD controllers if Au(kT) is replaced by u ( k T ) and u [ ( k- 1)T] is made equal to zero. Therefore if the models of fuzzy P I controller are known, it is not necessary t o derive the mathematical models for fuzzy P D controller separately. The expression for incremental control effort in regions IC1 t o IC4 can be written as



+



Au(kT) = P x [ d , v ( k T ) v ~ ( k T ) ] (6) H x 1 x [312- d ~ ( k T ) ~ p , 7 ( k T ) ] where P = NAU(714 - [ d L ( k T ) W$(kT)]P - dL(kT)v$(kT)}(7)



+



The plot of Au(kT) is shown with 1 = H = NaU = 1 in Figure 4(a). On analyzing the mathematical model, the following properties are found:



 364 Table 1. Outcomes of rules and control output in IC regions ICS



(R1)



(RZ)



(R3



1



Au(kT)



(R4)



Pl



P2



P3



P4



P p . d P n .v



Pn.dPn.v



Pn.dPp.u



Pp.dPp.v



Pn.v



0



0



Pp.d



Pn.d



0



0.5XH NAu



PP.V



0.5XH



0



0



Pn.u



PP.V



0



0



0



Pn.d



Pp.d



1



0



0



0



0



1



0



0



0



0



1



0



0



0



0



1



NAU 0.5XH NAU 0.5XH $!$%+lJ NAU



0



(a) The surface generated by incremental control effort Au is continuous at any point in the input space. This can be observed from the Au plot shown in Figure 4(a). (b) It is obvious from Eq.(6) that incremental control effort Au, being a product of p and ( d N VN), becomes zero at the point ( d N , V N ) = (0, 0 ) irrespective of the value of p. (c) It can be observed



+



. . .. . .. . . . .......



................ .......



........



*UN



........



........



................ ........ 0'



0.2



0.4 (a)



0.6



0.8 1 vN=dNline



Figure 4. Plots of (a) Au(k) with 1 = H = NA,, = 1 and (b) AUN with 1 = H = 1



from Figure 4(b) that along the d N = V N line, the normalized incremental control effort AUN monotonically increases from its zero value (occurring at (0, 0)) as d N and VN increase. (d) The incremental control effort Au(k) attains a minimum value of at the point ( - I , -1) and the maximum at the point ( 2 , l ) in the input space. All these desirable value of properties are applicable to fuzzy PD controllers also.



 365 4. BIBO stability analysis



We make use of the well known small gain theorem in to derive sufficient conditions for BIBO stability of feedback systems involving the fuzzy PI controller in Sec. 3 as a subsystem. Consider the feedback system shown in Figure 5(a). According to the small gain theorem, if ~ ( G I )the , gain of GI, and 72(Gz), the gain of G2, have a product smaller than unity, then any bounded-input pair (211, u2) produces a bounded-output pair ( y l , y2). We consider the general case wherein the process G2 under control is nonlinear, denoted by N. By defining r(lcT) = ul(lcT),y(lcT) = yz(lcT) = Ne2(lcT), e(lcT) = el(lcT), Au(lcT)= yl(lcT) = Glel(lcT), u[(k- 1)T]= u2(kT) and u(lcT)= e2(kT) in Figure 5(a), we obtain an equivalent closed-loop system as shown in T )sI~ p ~ > ~ l e l ( k TNd ) I ;x h f d = Figure 5(b). Let Md = ~ u p ~ > ~ l d ( k= +lel(lcT) - el[(h - I)T]II + ~ d ; N, x 1; M~ = SUP^,^ I ~ ( / ~=T ) I Mw= 1. FromEqs. (6) and (7) we get



Figure 5.



(a) A feedback system (b) A typical fuzzy PI control system



 366



From Table 1, we have IIu(ICT)II = H/NaU in regions IC 10 and IC 12. This implies that 71 = 0. Since y1 = 0 and 7 2 = IlNll, t o ensure 7 1 7 2 I 1, the condition < 00 follows. So the sufficient condition for the nonlinear fuzzy P I control system to be BIBO stable can be stated as follows. Theorem: A sufficient condition for the system shown in Figure 5 t o be BIBO stable is: (i) the given nonlinear process N has a bounded norm i.e., llNll < 00, and (ii) the parameters 1, H , Nd, N,, NaU and T of fuzzy P I controller should satisfy the inequality



The above stability theorem is equally applicable in the case of fuzzy P D control systems also if NaU is replaced by NU.



5. Conclusion



A mathematical model for fuzzy PI/PD controllers has been developed via r-function type and L-function type input fuzzy sets, A-function type output fuzzy sets, algebraic product t-norm, bounded sum t-conorm, Mamdani minimum inference and COS defuzzification. It has been shown that if mathematical model of fuzzy P I controller is known, mathematical model for fuzzy P D controller need not be derived separately. Upon carefully investigating the properties of the fuzzy controller, it has been found that the fuzzy PI/PD controller model has all desirable control properties. A BIBO stability condition for the corresponding fuzzy PI/PD control system has been presented. Acknowledgement The second author thanks the CSIR, India for providing fellowship for the above work.



References 1. A. Haj-Ali and H. Ying, Int. J. Fuzzy Systems, 3,60 (2003). 2. C. A. Desoer and M. Vidyasagar, Feedback Systems: Input-Output Properties, Academic Press, NY, 1975. 3. H. A. Malki, H. Li and G. Chen, IEEE Trans. on Fuzzy Systems, 2,245 (1994). 4. B. M. Mohan and A. V. Patel, IEEE Trans. on Systems, Man, and Cybernetics-Part B, 32 , 239 (2002). 5. H. Ying, Fuzzy Control & Modeling: Analytical Foundations and Applications, IEEE press, 2000.



 A FRAMEWORK FOR TOPIC CATEGORIZATION OF XML DOCUMENTS USING SUPPORT VECTOR MACHINES SRlNlVASA K G’, SHARATH S’,VENUGOPAL K R’, L M PATNAIK’ ‘ Department of CSE,Bangalore University, Bangalore. kw.FrinivLi.sCam.srir. edu, .shurathsriniva.s~,~muil. com, vkraiiik(k!v.snl.corn ’Microprocessor Applications Laboratory, IISc, Bangalore. lalit(i7hicro.iisc.ernet.in ABSTRACT. Extensible Markup Language (XML) has emerged as a medium for interoperabilityover the Internet. As the number of documents published in the form of XML is increasing, there is a need for categorization of XML documents into specific user interest categories. However, manually performing the categorization task is not feasible due to the sheer amount of XML documents available on the Internet. In this paper, we present a machine learning approach to topic categorization which makes use of a multi class Support Vector Machine (SVM) for exploiting the semantic content of XML documents. The SVM is supplemented by a feature selection technique which is used to extract the useful features. Experimental evaluations performed over a wide range of XML documents indicate that the proposed approach significantly improves the performance of the topic categorization task, with respect to accuracy and efficiency.



1



Introduction



In this paper, we have explored the possibility of topic categorization of XML documents using Support Vector Machines (SVM). The semantic information in the self describing tags is used for the purpose of categorization. A feature extraction strategy which can improve the efficiency of the SVM is also employed. Experimentation evaluations which compare the proposed technique with other classification techniques are also provided. The related work with respect to topic categorization in XML and the usage of SVMs are discussed in [ 1 , 2 , 3 , 4 ] . 2



Feature Set Construction



Consider a training set with n XML documents. Let TI = {tl,t2,...,t,} represent the initial tag pool, constructed using each of the distinct tags in the training set XML documents and I represents the total number of distinct tags. The ti represent the ith tag in the tag pool. The purpose of feature selection is to obtain an optimal tag set To= {tl,t2,...,t,,,}, such that m < 1. Feature selection has two advantages: First, the dimension of the feature vector is reduced. This can help in faster computations. Second, the features, or in the present context the tags, which are spread over all the categories are eliminated and importance is given to those tags that are representatives for a particular category. This can help in greater accuracy of the categorization task. In order to perform feature selection each tag in the tag pool is 367



 368



weighed using a standard weighing scheme. We analyze two weighing schemes: tjidf [ l ] and t-statistics [2]. In the tjidf scheme, tJ; the number of times a tag ti appears in an XML document and idf; the inverse of the document frequency in the collection that contains ti are calculated. The weight of a tag ti in T,is given by,



where wi is the weight of the ith tag in TI, tf; is the frequency of the ith tag in the document, N is the total number of documents in the collection and dJ; is the number of documents in which the ith tag occurs. This scheme assigns weights to tags proportional to the number of times it occurs in document. However, the tags which are common among all the classes are assigned lower weights. But, in the case of XML documents only the information about the existence or the non-existence of a tag is sufficient to classify the documents and there is no necessity to calculate the tag frequency. Hence another weighing scheme based on t-statistics [2] is considered. Here, the initial expression vector for the training samples is given by F" = (A", f:,



...,A"),where1 I x I rn, rn is the number of training samples and



I is the number of features. Here, F" represents the training sample x, and f,", f:,



...,A" represent



all the features of this sample. Each sample is labeled



with D E {-l,+l} . All samples belonging to a particular class are considered as positives for the class and the rest of the samples are considered as negatives. The following values are computed for each tag: n+ and n- ,the number of positive and negative training samples. u ,:



and p,:, the mean values of the ith tag over the positive and the



negative samples respectively. 6: and 6,; , the standard deviations of the ithfeature respectively over the positive and the negative samples. The weight of a tag is given by,



Equation (2) measures the normalized feature value difference between two groups. Higher weights are assigned to the tags that are discriminators for a class. Also, tags which are spread among all the categories are assigned low weights. In both the tjidf and the t-statistics weighing schemes only the top k % of the tags with highest tag weights are chosen as dimensions for the optimal tag set To .



 369



After the feature selection process, a feature vector for an input XML document must be constructed. The XML document is parsed and all the tags present in the document are extracted. Only binary values are assigned as dimensions of the feature vector. The feature vector of XML document consists of a “1” if the tag is present in the document and “0” otherwise. The feature vector thus constructed is the input to the multi class SVM [ 6 ] .



3



SVM for Topic Categorization



For a binary SVM, the training sample is {(xl,yl), (xay2),...(x,y,)}, where xi represents the feature vector for the ifh sample and yi = {-1,+1} i.e., a a class label “+1” or “-1” is associated with each of the training samples. If we assume the profiles to be linearly separable then the equation of the hyperplane that does the separation is given by, wTx+b=O where x is an input vector, w is an adjustable weight vector and b is the bias. Thus, wTxi+b1: 0, ifyi = + I wTxi+b< 0 , ifyi = -1 or equivalently, yi(wTxt+b)1 1 (3) The training samples (xi,yi)for which Equation 3 is satisfied with a equality sign are called Support Vectors. Support Vectors represents the classification samples that are most difficult to classify. Hence, maximizing the margins between the Support Vectors results in a model which has good generalization properties. In order to take care of the non-separable data points a non-negative scalar variable is introduced in (3). The variable is known as the slack variable. The resulting equation for a soft margin S V M is given by, Yi(WTXi+b)2 1- 6 A soft margin SVM solves the following optimization problem



<



c



c



subject to yi(wTxi+b)2 1- 6 , > 0 where C is a variable selected by the user. A quadratic programming algorithm is usually used to solve the constrained optimization problem of (4). In order to classify samples that are not linearly separable the feature space can be mapped into a high dimensional linearly separable space using kernel tricks. However, in the



 370



present context where tags represent the features, the linear kernel is efficient compared to polynomial kernels. The binary SVM can be extended to support multi class classification using the “All-vs-One” approach. Here P binary SVMs are used where, P is the number of predefined topics. Each of the P SVMs are trained using the training set documents. When a new XML document has to be categorized the feature vectors for the document are constructed and are individually provided as inputs to the P SVMs. Each SVM decides whether the given document is in its own topic or not. In “Allvs-One approach” for categorization of XML documents, each S V M calculates the distance to the optimal hyperplane. A document belongs to a particular category if the corresponding SVM representing the category yields maximum positive distance value. 4



Performance Analysis



The variations in the accuracy of the categorization with the values of k are shown in Fig 1. It can be observed that the accuracy of the t-statistics feature selection is greater than the gidffeature selection scheme for all values of k. This is because the t-statistics scheme is able to select an optimal number of features which are effective discriminators for the various classes. From Fig 1, it can also be seen that the accuracy is highest when k is in the range of 60-70%. When the value of k is low, the number of features selected will be less. This affects the generalization performance of the SVM and it performs poorly over the test set. When the value of k is high most of the features get selected. This results in poor classification accuracy of the SVM. A value of k in the range of 60-70 YOresults in good classification accuracy as it chooses an optimal number of tags as features.



loo



20



40 60 v a l w of k (%)



80



im



Fig 1: Variation of accuracy with value k



0’



’



20



40



60



80



Number of Training Samples



100



Fig 2: F1 measure using the two schemes



In addition to the accuracy of the system, precision, recall and FI-measure can be used to evaluate the classification system. Precision is the percentage of predicted documents for the given topic that are correctly classified. Recall is the



 371 percentage of total documents for the given topic that are correctly classified. Also, high precision values leads to low recall and vice versa. A classification framework which can exhibit optimum precision and recall values is required. Hence, the FImeasure, which is a weighted combination of precision and recall is used for evaluation. The F1-measure is given by, 2.precision.recall F, = precision + recall The FI-measure for the two feature selection schemes is shown in Fig. 2. The T-statistics feature selection scheme results in better FI-measure values. In order to verify the accuracy of the proposed technique, we compare it with two traditional classification techniques: k-Nearest Neighbor and Decision Tree classifier and it is observed that the SVM classifier with the t-statistics feature selection strategy outperforms the KNN classifier and the Decision Tree techniques for the XML topic categorization problem. 5



Conclusions



We have proposed a framework for topic categorization of XML documents that makes use of Support Vector Machines for the purpose of classification. Feature selection schemes that can improve the accuracy of the SVM are presented. The All-Vs-One Multiclass S V M is used to assign a topic category to the input XML document from among the various pre-specified categories. References 1. G. Salton, Automatic Text Processing, Addison-Wesley, pp. 279-28 1 , 1989. 2. Liu, H., Li, J. & Wong, L, “A comparative study on feature selection and classification methods using gene expression profiles and proteomic patterns”, Genome Informatics Tokyo, 2002. 3 . F. Sebastiani, “Machine learning in automated text categorization” ACM Computing Surveys, vol. 34, no. 1, pp. 1-47, March 2002. 4. Thorsten Joachims, “Text Categorization with Support Vector Machines: Learning with Many Relevant Features”, Proceedings of ECML-98, 10th European Conference on Machine Learning, pp. 137-142, 1998. 5. A. L. Diaz and Lovell, XML Generator, http://www.alpha works.ibm.com/tech/xmlgenerator,1999. 6. Srinivasa K G, Venugopal K R and L M Patnaik, “A Soft Computing Approach for XML Data Mining”, Technical Report, Department of Computer Science and Engineering, University Visvesvaraya College of Engineering, Bangalore University, June 2005.



 ANALYSIS OF MEDICAL IMAGE COMPRESSION USING STATISTICAL CODING METHODS R. PONALAGUSAMY Assistant Professor, Department of Mathematics, National Institute of Technology, Tiruchirappalli,Tamilnadu, India. Pin - 620015.



E-mail: [email protected](Corresponding Author) C. SARAVANAN Computer Programmer, Department of Computer Applications, National Institute of Technology, Tiruchirappalli, Tamilnadu, India. Pin - 620015. E-mail: [email protected] In this paper medical image compression using statistical coding techniques is discussed and a comparative study has been performed. Statistical coding algorithms choosen are Huffman Coding and Arithmetic Coding. To illustrate the effectiveness of the two statistical methods, several medical images have been considered. The medical images Magnetic Resonance Image (MRI), Computer Tomography (CT), Ultrasound, and X-ray are compressed and decompressed by adopting the above two techniques. It is generally noticed that the Arithmetic coding technique is better in speed. Huffman coding is better in compression rate when the image size is lesser than 1MB. Arithmetic coding is better both in speed and compression rate when the image size is greater than or equal to IMB.



1



Introduction



Medical images have been playing a pivotal diagnostic tool for predicting an accurate stage of a particular disease. Different types of medical images are used for diagnosing, namely MRI, CT, Ultrasound, X-ray etc. Further, these medical images are stored in a computer for future analysis. If required, these medical images are sent to other medical experts for opinion using email or internet. Compression techniques have been used to reduce the medical image size resulting disk space and network bandwidth utilization will be warranted. It is of interest to mention that compression is achieved by removing data redundancy [2]. Compression techniques are categorized into two types; Lossy and Lossless compression. Lossy compression technique produces an image, which looks like the source image but where some pixels data differ from the source image. Lossless compression technique produces an image with no difference from the source image. Lossy compression achieves more data compression than the lossless technique [5]. Lossy compression techniques are used in the entertainment field. In the medical images the pixels information are most important. So, lossless compression techniques are used in medical images.



372



 373



2



Statistical Coding Techniques



Huffman coding and Arithmetic coding are well known statistical data compression techniques. Both the techniques create statistical table from the source data. The statistical table will have a list of unique symbols and probability value for the symbols. Using the statistical table, the coding techniques compress the medical images.



2.1 Huffman Coding Huffman coding creates a binary tree based on the statistical table and assigns the shortest possible variable length code for the symbols. The higher frequency symbols are assigned minimum length code and the lower frequency symbols are assigned far maximum length code. Source data symbols are replaced with the corresponding Huffman code to get compressed data. Table 1. Source reduction



From the data given in table- 1, the Huffman code generated and the corresponding Binary code are listed in the following table-2.



Table 2. Shortest variable length Huffman code



The formula for computing average length of the Huffman code is written as [ 11.



 374



i=l



where, m is the number of symbols, pi is the probability value of the i” symbol and li is the length of the ithsymbol. Using equation (l), the average length of the Huffman code becomes



Lavg= 0.3 (2) +0.2 (2) + 0.2 (2) + 0.1 (3) + 0.1 (4) + 0.1 (4) = 2.5 bits/symbol. The Huffman code requires 2.5 bitdsymbol which is less as compared to binary code which requires 3 bits/symbol.



2.2 Arithmetic Coding Arithmetic coding generates open and close interval values from 0.0 to 1.0 based on the symbols and their related probability values [6].The open and close interval values for the above example are tabulated in the following table-3. Table 3. Open and close interval values



U



I



!



0.2 0.1 0.1 0.1



0.0 - 0.3 0.3 - 0.5 0.5 - 0.7 0.7 - 0.8 0.8 - 0.9 0.9 - 1.0



It starts encoding the source symbols by reading the symbols one by one and its interval values. Finally, we will get the encoded value of the symbols, which is a floating point number between 0.0 and 1.0 [4]. Let us consider the word “EAII!” for encoding. For each source code, encoding procedure is explained below. Initially After encoding source code



[0,1) E[0.3,0.5) A[0.3,0.36) 1[0.33,0.342) 1[0.336,0.3384) ![0.33816, 0.3384)



 375



Any value between [0.38816, 0.33841 is the encoded value for the word “EAII!”. In the reverse, the encoded value will produce all the source codes “EAII!” by simply applying the decoding procedure. The following Fig. 1 gives the pictorial view of the arithmetic encoding process.



1



a



C



i



I



0.342



I



-



\



I



_ I _



I



t.



0.33



-



m



Figure 1. Different stages of encoding the source code “eaii!”



3



Results



Different sizes Magnetic Resonance Image (MRI), Computer Tomography (CT), Ultrasound, and X-ray images are considered. The images are of size 64 x 64 pixels (5,176 bytes), 128 x 128 pixels (17,464 bytes), 256 x 256 pixels (66,616 bytes) and 1024 x 1024 pixels (IMB). Table-4. lists out the source image name, original image size, number of unique symbols (gray levels) of the source image, compressed image size, time taken for compression and decompression using Huffman coding and Arithmetic coding. Table 4. Huffman Coding versus Arithmetic Coding



 376



4



Discussion



It is pertinent to pinpoint out that the compression ratio of Arithmetic coding is higher than that of Huffman coding when the size of image is greater than or equal to 1 MB. Medical images are having more than or equal to 1024 x 1024 pixels and the corresponding file size will be more than or equal to 1 MB. Hence, it is concluded that the Arithmetic coding plays a key role in analyzing the medical images. References 1. Hankerson D, Harris GA, Johnson PD, Introduction to Information Theory and



Data Compression, CRC Press, (1998) 2. Gonzalez RC, Woods RE, Digital Image Processing, Pearson Education, India, (2002) 3. Huffman DA, A method for the construction of minimum-redundancy codes, Proc. Inst. Radio Eng. 40(9), September, (1952), pp. 1098-1101 4. Moffat A, Neal RM, Witten IH, Arithmetic Coding Revisted, ACM Transactions on Information Systems, 16(3): (1998) pp. 256-294 5. Salomon, Data Compression, 2"dEdition, Springer, (2001) 6. Witten IH, Neal RM, Cleary JG, Arithmetic Coding for Data Compression, Communications of the ACM, 30(6), (1987)



 FUZZY BASED SELECTION OF PATHS IN A DECISION TREE WITH IMPRECISION AND PARTIAL SPECIFICATION RAMESH VASAPPANAVARA



D RAVI



Dept. of Computer Science & Engineering, GVP College of Engineering, Visakhapatnam INDIA ~Iusuut~utiuvu~u~~vah~~. corn E-mail: riime~~h



GAUTAMV



ANAND V I L K , Bangalore, INDIA E-mail: iitununu’(ivuhoo.c.om



BITS Pilani.



Estimation of costs, effort, and duration and other parameters for a large, complex, multi-layered, project management is a difficult process, requiring effective forecasting tools under imprecision and partial truths. This task becomes all the more important when a company is embarking on a new project with no history data base backing or it is a start up company. This paper presents a fuzzy based approach wherein selection of next node in a decision tree is not restricted to predefined node in a DSS tree as in a conventional Decision tree System, but affords selection of virtual decision node lying in between two nodes in a DSS tree. 1 Introduction



One of the most significant problems faced by the project managers of a complex project is accurate forecasts of efforts, durations and costs other project parameters. Unfortunately accurate forecasts and tighter project control stems from competent and mature organizations, Others owing to their lack of history databases and competencies, do not have accurate projections and run the risks of time and cost overruns[3]. The details on building and using expert system is available at [l]. In [2] Henrik Legind Larsen presents an approach to flexible information access systems using Soft Computing in that query criteria is softened by applying a common model for both numeric and non numeric attributes. In [5], Zhiping Fan, Jian Ma has worked on Multiple Attribute Decision Making (MADM) which refers to the problem of selecting among alternatives associated with multiple, usually conflicting, attributes. We refer the reader to [4] for detailed approaches to software project estimations. In this paper we present an Expert System for estimations that combines conventional methods and also soft computing techniques for handling imprecision and uncertainty. 2 Decision Tree and Traversal Mechanisms



The decision nodes in a decision tree can be labeled using attributes of their children. The label refers to the complete list of attributes of the child nodes that helps us in making a decision regarding which node to be included in the decision 377



 378



path in the next step. In Figure 1 we have shown each node having two attributes ,00 & 11 ,which can be used to order the child nodes of a decision node from left to right. The decision tree contains 16 goal nodes and 15 decision nodes . We have also shown decision path comprising of nodes vo, V I , v4, v9 and VZO. The corresponding path identifierbid) is : 00110011. The path is shown as solid line. A matured organization will have access to history data base that would indicate governing function based on LOC and FP metrics for Effort Estimations [4] . Given X , we access the look up table (Table 2 & Fig 1) to ascertain corresponding a & b. For example pid of 001 10011 produces a=.9589 and b= 1.36, the constants required for determining effort for a particular company , for projects with no uncertainty. Sometimes it may not be possible for the user to select all the attributes that determine a child node , thus giving rise to virtual nodes(shown with dotted line)



 1 1 Node



Bit String



00000000 00000011 00001 100



1 1 I A(n)



1 1 .482 .456 ,645



379



B(n)



n



Ji .98



p, I



I



"



Rules



Fig 2: Model for Fuzzy System Logic-Mapping from 2-input variable to 1-out-of-9 output states.



16



I



11111100 11111111



I 2.4157 I



3.091



I 2.5784 I 3.541



Table 1 : look up table for constants a(n) & b(n) 3.0 Fuzzy Threshold Mechanism and Methodology In our model , input member fhctions I;,o, I;,, are fed into Fuzzy System Logic controller(FSLC) (Fig 2). Set of fuzzy rules , specified at Table 2 to any one of nine outcomes indicated, Response is



00



1 THEN



I



Traverseto



Response is



I IF



I



I



IF



I Responseis I Responseis



p 10



Response is



Table 2: Set of fuzzy rules



THEN



Traverse to



I THEN



I Traverse to



-- I THEN 1 Traverse to



 380



1 : Affirmative 0 : Negative Uncertain, No Response (or) : donotknow



-



Table 3:Responses at each stage



3.1 Encoding Decision Paths as Binary Strings The user may supply only part of attribute information, allowing more nodes to be included at each step. In particular, the user may either (a) select a specific attribute of a child, or (b) leave the attribute unspecified, or (c) omit the attribute for inclusion. For allowing such specifications, we must refine our representation of the child nodes using binary representation to include virtual nodes , those that are not in the initial decision tree , to be specified. The virtual nodes are those that share some attributes with one child node and the rest with other child nodes in the initial decision tree. (We exclude the case in which all the attributes are omitted or left unspecified.) In fact, for a decision node with a list of k decision attributes, we can allow as many as the 2k child nodes (including the virtual nodes and those in the initial tree). Thus, at each step, we take the nodes in the next step to be the union of all the strings, whose attributes match the specification by the user. However, we now have to construct decision attributes for each virtual node for subsequent steps. In Figure 1 we have shown the virtual nodes that are traversed by dotted line. . Figure 1 and Table 3 illustrate traversal based on responses and boundaries, and information stored at each stage/level. For example to reach the virtual node V' , the table indicates , the response required from the user is 01 and V' has lower bound of V1 and an upper bound of Vz.



3.2 Determination of variables in a Effort Estimation Function - An Example. The virtual nodes generated in the decision paths result in nodes at leafnode level that do not coincide with the goal nodes in the initial decision tree. For each such node at the leaf-node level, we create an additional Table 3: upper and lower bounds goal node, The lexicographic ordering on the extended binary string encoded paths render the goal nodes with a natural Nod



Respons



LB



UB



 381



ordering that is compatible with the ordering they have as child nodes of their parents. It now remains to find a way to choose suitable values of the parameters for the additional goal nodes that have been introduced. For this purpose, we suggest that the parameters be interpolated such that between any two consecutive goal nodes in the initial decision tree, the interpolation curve is the monotonic. That is, for two consecutive real goal nodes, nl and n2, if a(nJ and a(n2) are the values of the parameter a then, for all virtual nodes n and n ’ with nl = n = n ’ =, n2, a(n,) = a(n) = a(n ’) = a(n2). Similarly, for the other parameter, b. This method allows us to define the values of the parameters for all goal nodes including those generated newly by allowing virtual nodes. Let the decision path under uncertainty and imprecision be represented(Figure 1) by dotted path vo, v’, v4, vz and v3, The corresponding path identifierbid) is : 01111001. From Table 3, it can be detected that virtual node (V3) boundaries are Vz0and Vzlwhich are goal nodes with numbers 6 and 7. The values of a(v3) and b(v3) can be interpolated using suitable mapping function. 4.0 Conclusions



Complex projects need firm and well defined plans and to prevent cost and time over runs need effective monitoring and control. This aspect calls for a reliable and accurate forecasts for project estimations like Effort , duration , Complexity , and costs etc. Thus we need an Expert system that provides accurate forecasts under imprecision and partial truths. The prototype presented is a fuzzy based expert system for forecasting Effort Estimation under imprecision . References 1. httu://www.cee.hw.ac.uk/-alison/ai3notes/Expert System 2. Henrik Legind Larsen - An Approach to Flexible Information Access Systems using Soft Computing - 32”d Hawaii International Conference of System Sciences, 1999. 3. Magnr Jorgesen ,Molokken Ostvold “Reasons for Software Effort Estimation Error: Impact of Respondent Role, Information Collection Approach, and Data Analysis Method “ ,IEEE Transactions on Software EngineeringVol. 30, No. 12; December2004, pp. 993-1007 4. Pankaj Jalote ,” An Integrated Approach to Soft ware Engineering” 2 nd Edition , Narosa Publishing House,2004,Chapter-4 (Planning a Software Project),Pg no. 166-170. ISBN - 81-7319-271-5. 5 . Zhiping Fan, Jian Ma - An Approach to Multiple Attribute Decision Making Based on Incomplete Information on Alternatives - 32”dHawaii International Conference of System Sciences, 1999.



 A GRAPH ALGORITHM FOR FINDING THE MINIMUM TEST SET OF A BDD-BASED CIRCUIT GOPAL PAUL’, BHARGAB B. BHATTACHARYA’, AJIT PAL’, AND ANNAPURNA DAS’



’ Dept. of Electronics, CEG, Anna University, Chennai - 600 025



’Dept. of Comp. Sc. & Engg., Indian Institute of Technology, Kharagpur - 721 302 gpau/ca/@yahoo.wrn,bhargab@isica/.ac.in,apa/@cse.iitkgp.emetin,anu279@hotrnai/.corn



The Binary Decision Diagram (BDD) is a powerful vehicle for large-scale functional specification and circuit design. In this paper, we consider the open problem of generating in polynomial time, the exact minimum set (T) of test vectors for detecting all single stuck-at faults in such a BDD-based circuit. We show that, for a single-output circuit, 1 TI = 2k, where k is the minimum number of paths that cover all the edges of the BDD graph. The value of k, and consequently the test set T, can be readily determined by running the max-flow algorithm on a network derived from the BDD, followed by a simple graph traversal. This procedure not only generates the optimal test set in polynomial time, but also obviates the need of employing an ATPG (Automatic Test Pattern Generator) and a fault simulator.



1 Introduction Binary Decision Diagrams (BDDs) are widely used in electrical and computer engineering, namely for VLSI circuit specification, logic synthesis, design verification, testing [l,2, 51. Boolean functions described as BDDs are suitable for direct technology mapping to multiplexor (h4UX)-based circuits and FPGAs [3].It is shown that such a MUX-based circuit can be made fully testable for all single stuck-at faults and path-delay faults by using only one additional input and an inverter [4].Further, the test generation algorithm, which is NP-hard [14] for a general circuit, becomes very simple for this special class of circuits. A simple graph algorithm can be employed to generate the test vectors for detecting all stuckat faults in polynomial time [4].However, the problem of generating in polynomial time, a minimum-size test set for a BDD-based circuit, remains still open. In this paper, we consider the problem of generating the exact minimum set (T) of test vectors for testing all single stuck-at faults in a BDD-based circuit. We show that, for a single-output circuit, TI = 2k, where k is the minimum number of paths that cover all the edges of the BDD graph. The value of k, and consequently the test set T, can be readily determined by running the max-flow min-cut algorithm [6,101 on a flow network derived from the BDD, followed by a simple graph traversal. This procedure not only generates the optimal test set in polynomial time, but also obviates the need of employing an ATPG and a fault simulator. Experiments on benchmark circuits demonstrate that the proposed technique provides significant reduction of the number of test patterns compared to those generated by an ATPG tool. For multi-output circuits, the procedure requires slight enhancement.



I



382



 383



2 Preliminaries A Boolean function F(xl,x2, ..., x,,) can be represented by a BDD, which is a directed acyclic graph (DAG) with one root node and two leaf nodes labeled as 0 and 1 [I, 2, 51. Except the two leaf nodes, every other node denotes a variable xi, and has exactly two outgoing (decision) edges labeled 0 and 1, which represent the corresponding Shannon decomposition at that node for x, = 0 and 1 respectively. Each directed path from the root to the leaf node 1 (0) (called a complete path), represents a product term of F( where the polarity of a variable is determined by the label of its decision edge along the path. An example of a BDD with the variable ordering (A, B, C, D, E } is shown in Fig. 1, where the solid (dashed) arcs represent the I (0) edges. The nodes are labeled as 1, 2, ..., 13; the root node is labeled 1, and the terminal nodes 12 and 13 denote F = 1 and 0 respectively. A BDD is called ordered (OBDD) if each variable appears at most once on each complete path, and if the variables appear in the same order in all other complete paths [2]. An OBDD is called reduced (ROBDD) if it is devoid of any isomorphic subgraph or any redundant node. The BDDs used here refer to ROBDDs only.



0,



3 BDD-based Circuit Implementation A BDD can be directly mapped to a combinational circuit implemented with 2-input MUX blocks. Each non-terminal node is replaced by a MUX block, and its control input is connected to the variable associated with that node. The 0 (1) arc emanating from this node is connected to the 0 (1) input of the MUX. The MUX-inputs corresponding to the arcs hitting the terminal node 0 (1) are connected to constant logic value 0 (1). The output of each Mux (representing a node v of the BDD) is connected to the inputs of the MUX-blocks, whose representative nodes are the predecessors of the node v in the BDD. Finally, the output of the MUX that replaces the root node, implements the desired function F. Example 1: A MUX implementation of the BDD of Fig. 1 is shown in Fig. 2. By adding one additional input t and an inverter, a =-based circuit mapped from a ROBDD can be made fully testable for all single stuck-at faults [4].The input line to be fed by constant logic 1 (0) should be tied to the line t ( 4 Determining the Minimum-Size Test Set In a BDD-based circuit, polynomial-time operations may be employed for test generation [2, 41, although they may not guarantee minimality of the test set. Here we show that the minimum test set can be directly found fiist by finding a maximum set of independent edges known as “strong-cut” in the BDD, followed by certain graph traversal operations.



t>.



4.1 Fault model



We consider the single stuck-at fault model on all lines in the circuit, i.e., the inputs/output/control line of each MUX, and on the line t and all its fanout branches (Fig. 2). To consider faults in the interior of the MUX-blocks (Fig. 3a), we assume each of them be implemented with AND-OR-NOT gates, where the possible fault



 384



sites are shown (Fig. 3b). The stuck-at faults on line 5, 6, 7 can be collapsed to those on 1, 2, 3, 4.The fault line 3 stuck-at-1 is tested by the only test { D x y } = 101, which also tests the stem fault line 8 stuck-at-0. Similarly, the test of line 1 stuck-at1, will test line 8 stuck-at-1. By applying fault equivalence and dominance [9], the fault set can be collapsed to a smaller set, the sites of which are shown in Fig. 3c.



9



6



F



Fig. 1: An example of BDD



Fig. 2: A fully testable realization using a control input t



The complete set of all single stuck-at faults in the entire circuit after collapsing, is now classified into two groups: Type-I: stuck-at-0/1 faults on all inputs to the MUX-blocks, and on the line t; Type 2: stuck-at-1 faults on all the fanout branches of the MUX-control lines (like lines 1, 3 as in Fig. 3c). 4.2 Complete paths and tests The test of a type-1 fault is a vector that sensitizes a path from t to the function output F through that fault site. For example, in Fig. 2, to test the fault on line u stuck-at-0, one may apply the vector { t E D C B A } = 1 - 0 1 1 1. For the fault u stuck-at 1, a test can be found just by flipping the value oft, i.e., 0 - 0 1 1 1. Thus, traversing a complete path through the BDD covering the edge u, and noting the required values of the control lines will suffice; the two tests (for u stuck-at-0 or 1) can then be determined by fixing t = 1 or 0. To test all type-1 faults, therefore, we need to consider faults on all the edges of the BDD. Just two vectors determined by a complete path as above can test further, all type-1 faults appearing on this path. A complete path P in a BDD is said to cover an edge e if P passes through e. Thus, we have the following theorem:



 385



Theorem 1. The minimum number of tests that detect all type-1 faults in a singleoutput BDD-based circuit is 2k, where k is the minimum number of complete paths that cover all the edges of the BDD. Y



X



X



V



x v



D Z



o stuck-at-0



QD represents stuck-at-0 or stuck-at-1 fault



(a)



(b)



0



Z



stuck-at-1



(c)



I Z



Fig. 3: Stuck-at faults in a BDD-based circuit In order to detect a type-2 fault on the control line of a MUX, a complete path P should be chosen through this path with a special property. To demonstrate this, consider a fault on the control line D in the MUX labeled 7 in Fig. 2. We choose a complete path 1-2-4-7-10, which passes through the MUX node 7. To detect the type-1 fault line q stuck-at-1 requires the test { t E D C B A ) = 1 0 1 1 1 1 or 0 1 1 1 1 1. Therefore, the test set for type-1 fault must include any one from the above pair. However, only the first vector can test the type-2 fault (like line 3 stuck-at-1 as in Fig. 3c) on the 0-branch of the control line D,not the second one. In other words, the variable E must be set to 0, so that the fault-free complete path and the faulty path (in the presence of the above type-2 fault) arrive at the two complementary leaf nodes of the BDD. This vector will also detect the stuck-at-0 fault on the control line D. If the BDD is reduced, such paths, and in turn test vectors, will always exist. We here conjecture that the k paths for detecting type-1 faults can be appropriately chosen to satisfy the above complementary property for each of the MUX-blocks. 4.3 Determining k by jlow algorithm We now consider the problem of determining k, the minimum number of complete paths that cover all the arcs of the BDD. This is equivalent to finding the maximum independent set of nodes in a comparability graph, a solution of which can be found in polynomial time by running a flow-based algorithm [lo]. The max-flow problem can be solved by several well-known algorithm [6]. The algorithm by Goldberg-Tarjan [ 111 computes the max-flow in time O(nm logn2/m), where n (m)is the number of nodes (arcs) in the flow graph. Since in a BDD, the outdegree of each node is at most 2, the complexity will be reduced to O(n2 logn), where n is the total number of nodes in the BDD. 4.4 Test generation by graph traversal Once the strong-cut is known, a simple linear traversal algorithm can be employed to determine k complete paths, which cover all the arcs in the BDD. In such a cover,



 386



each arc of the cut will appear exactly once on only one path. For each of these paths, 2 tests can be found directly as discussed in Section 4.2. These 2k tests will detect all type-1 faults. In order to cover all type-2 faults by the same set of 2k tests, each selected path should also satisfy the complementary property. To select such paths, O(n2) additional search time may be required. Hence, the overall complexity of generating the minimum test set including flow analysis becomes O(n2logn). 5 Experimental Results We have tested our algorithm on several LGSynth’91 benchmark circuits [ 121, the ROBDDs of which are generated using the CUDD package [8].To solve the maxflow problem we have used the RELAX4 routine developed by Bertsekas and Tseng [7]. Our traversal algorithm then generates the 2k test vectors for each of these benchmark examples. Results are compared with those generated by the SIS1.2 ATPG tool of Berkeley [13]. For larger circuits the result shows around 45% reduction in the number of test patterns, while overall average reduction is above 30%. For multi-output functions, the procedure requires slight modification. 6 Conclusion A new algorithm has been described for generating the minimum-size test set for detecting all single stuck-at faults in a BDD-based circuit. The method does not need an ATPG. Further work will be reported later for handling path-delay faults. References [ l ] Akers, S. B., Binary decision diagrams, IEEE Trans. Comput. C-27 (1978) pp. 509-516. [2] Bryant, R. E., Graph-based algorithms for Boolean function manipulation, IEEE Trans. Comput. C-35 (1986) pp. 677-691. [3] Gunther, W. and Drechsler, R., ACTion: Combining logic synthesis and technology mapping for MUX-based FPGAs, J. Systems Architecture 46( 14) (2000) pp. 1321-1334. [4] Drechsler, R., Shi, J. and Fey, G., Synthesis of fully testable circuits from BDDs, IEEE Trans. CAD 23(3) (2004) pp. 440-441. [5] Ebendt, R., Fey, G . and Drechsler, R., Advanced BDD Optimization (2005) Springer. [6] Hochbaum, D., Graph Algorithms and Nehvork Flows (2003) IEOR 266 Notes UCB. [7] Bertsekas, D. P. and Tseng, P., RELAX-IV: A Faster Version ofthe RELAX Codefor Solving Minimum Cost Flow Problems, http://web.mit.edu/dimitribJwwwlRELAX4.tt. [8] Somenzi, F., CUDD: CU Decision Diagram Package, http://bessie.colorado.edu/-fabio/. [9] Abramovici, M., Breuer, M. A. and Friedman, A. D., Digital System Testing and Testable Design (1990) Computer Science Press, New York. [ 101 Kagaris, D. and Tragoudas, S., Maximum independent sets on transitive graphs and their applications in testing and CAD, Proc. ICCAD (1997) pp. 736-740. [ 111 Goldberg, A. V. and Tarjan, R. E., A new approach to the maximum-flow problem, Journal of the ACM 35(4) (1988) pp. 921-940. [12] S. Yang, Logic Synthesis and Optimization Benchmarks User Guide (1991) Tech. Rep., Microelectron. Center of North Carolina. (131 Sentovich, E., et al., SZS: A System for Sequential Circuit Synthesis (1992) Tech. Rep., Univ. Calif., Berkeley, CA. [ 141 Garey, M. R. and Johnson, D. S., Computers and Intractability: A Guide to the Theory of NP-completeness (1979) W. H. Freeman and Co., USA.



 PARTICLE SWARM OPTIMIZATION BASED PI CONTROLLER TUNING FOR FERMENTATION PROCESS K.VALARMATH1 AND D.DEVARAJ A. K. College of Engineering, Anand Nagar, Kri‘shnankoil, Tamilnadu, India [email protected] T.K.RADHAKRISHNAN National Institute of Technology, Trichirappalli, Tamilnadu, India Abstract. The activities related to Fermentation process are uncertain and nonlinear in nature. It poses many challenging characteristics like multivariable interactions, unmeasured state variables and time varying parameters. Proportional Integral Derivative control is a control strategy that has been successfully used over many years for the fermentation process. It has been noticed that PID controllers are often poorly tuned. The Ziegler Nichols tuning method is designed to give g o d response but not satisfactory for many systems, because the systems are very sensitive to parameter variations. Additionally, ZN methods are not easy to apply in their original form on working plants. To overcome this problem Particle Swarm Optimization Technique has been proposed to tune the optimal values of PI constants. The performance of these controllers is compared on the settling time and Mean Square Error for various set point and trajectoty response of the fermentation process with traditional tuning method.



1 Introduction Recently Evolutionary computation techniques have been proposed to tune the PID controller by taking into account all non-linearities and additional process characteristics. Oliveira et a1 [ 13 used a standard GA to determine initial estimates for a variety of classes of linear time-invariant (LTI) system Porter and Jones [2] proposed a GA-based technique as a digital PID controller. Wang and Kwok [3] tailored a GA using inversion and pre selection of ‘micro-operators’ to PID controller tuning. Despite the simple concepts involved, GA make leads premature convergence and also it takes long time to reach the solution. At this point appears the idea of using PSO [4]to obtain a PID tuning that meets all the requirements of minimization of performance index by the designer. PSO can be easily implemented and it is computationally inexpensive, since its memory and CPU speed requirements are low [ 5 ] . It uses only primitive mathematical operators. In this paper, we propose adaptive tuning of PI controller using PSO for the control of continuous fermenter. It needs regulation of either the cell, substrate or product concentration. Here the feed substrate is employed as a manipulated input and the product concentration as a controlled output. The Integral square error is used as the cost function. 387



 388



2. Mathematical Model and PSO Implementation for PI Tuning The mathematical model of fermentation process consists of different models like structured and unstructured models. The differential equations governing the operation of the fermentation process [6] is given by X =-DX+pX



(1)



1



S=D(Sf -S)--pX Y XIS



(2)



P = -DP+ (ap +p)X



(3)



S



a



(4)



Kmtst-



Ki



The process control perspective the cell mass yield Yws and maximum specific growth rate p,,, is unmeasured disturbances because they may exhibit time varying behavior. Here feed substrate is employed as manipulated input and productivity of the fermenter is regulated by the product concentration. In this paper, the problem of parameter tuning is formulated as an optimization problem and PSO is applied to identify the optimal parameters. The objective of the optimization problem is to minimize the oscillations and achieve a quick settling. While applying the PSO for any problem, the issues to be addressed are a) Solution representation and b) Formation of evaluation function



3. Simulation Results and Discussion The closed loop PI controller cascaded with the process is tuned for the optimal values of kp and ki using PSO algorithm. Experiments were conducted with +lo% variations in growth rate and cell mass yield. The design parameter is specified in Table.l.The optimal PSO settings are Number of generations -10, Population size -10, w - 0.4 to 0.9 & cI, c2. 2 Corresponding settling time, integral square error, controller parameters and computing time are computed for PI controller using IMC, GA and PSO. From the response of Fig. 1 and Fig. 2 for -10% disturbances in max growth rate it is observed that conventional controller has not settled and exponentially deviates from the settling time, which is minimum in PSO tuned method. Fig. 3 shows the response of set point changes in product concentration.



 389 Table 1. Design parameters for fermentation process Process variables Variables Nominal value Process Input D-Dilution rate (manipulated variable) 0.15h-’ variables +Feed substrate concentration 20 g/I X-Cell concentration 7.038 g/l 2.404 g/l Process S-Substrate concentration State variables P-Product concentration 24.81 gil p - Specific growth rate 0.4g/g YX/S.Cell-mass yield Process state parameters p, .max. specific growth rate 0.48 h-’ P,. Product Saturation Constant 50 gn K, - Substrate Saturation Constant 1.2 g/1 K, - Substrate Inhibition Constant 22 f l a - Kinetic Parameters 2.2 g / g 0.2h-’ p - Kinetic Parameters



Setpoint Changes in product concentration



-pso-pi



-@-pi



-imp



21.4 24.2



21



23 8 8235 s j 2 233 2I



: -



-



222;



22.6 $224



e



22.2 22



21.6 21.6 21.4 21 2



I



1 11 21 31 41 51 61 71 81 01



line [see]



I



Fig. 1 & 2. Comparison of product concentration and Feed substrate conc. for -10% disturbances in Max.growh rate



21



Fig. 3. Set point changes in Product concentration



References [ 11. Oliveira, P., Sequeira, J., and Sentieiro, J., “Selection of controller parameters using genetic algorithms”, Engineering Systems with Intelligence. Concepts, Tools, and Applications, Kluwer Academic Publishers, Dordrecht, Netherlands, 1991, pp.43 1-438. [ 2 ] . Porter, B., and Jones, A.H., “Genetic tuning of digital PID controllers”, Electronics Letters, Vol. 28(9), 1992, pp. 843-844. [3]. Wang, P., & Kwok, D. P. (1992),“Auto tuning of classical PID controllers using an advanced genetic algorithm”. IEEE Proceedings, 1224. [4].Kennedy, J., and Eberhart, R. C. (1995). “Particle Swarm Optimization” . Proc. IEEE International conference on Neural Networks (Perth, Australia), IEEE Service center, Piscataway, NJ, pp. IV: 1942 - 1948. [5]. Parsopoulos K.E.,Vrahatis M.N., “Recent approaches to global optimization problems through Particle Swarm Optimization”, Natural Computing 1: 235-306, 2002. [6]. Atsushi Aoyama, Venkat venkatasubramanian , “Internal model control frame work using neural networks for the modeling and control of a bioreactor”, Engng Application of Artificial Intelligence, vo1.8.,N0.6.pp.689-701,1995



 SOFTWARE RELIABILITY GROWTH MODELING FOR EXPONENTIATED WEIBULL FUNCTION WITH ACTUAL SOFTWARE FAILURES DATA M. U. BOKHAFU' AND N. AHMAD' 'Department of Computer Science, Aligarh Muslim University,Aligarh, India Email: muhokhuri-'01)4(ilivahoo.c.o. in, [email protected], 'University Department of Statistics and Computer ApplicationsiT. M Bhagalpur University,Bhagalpur-812007, India, Email: nesur hjg)(hjVuhoo.co.in A number of testing-effort functions for modeling software reliability growth based on non-homogeneous Poisson process (NHPP) have been proposed in the past decade. Although these models are quite helpful for software developers and have been widely applied in the industries or research centers, we still need to put more testing-effort function into software reliability growth model for accuracy on estimate of the parameters. In this paper, we will consider the case where the time dependent behaviors of testing-effort expenditures are described by Exponentiated-Weibull ( E V curves. Software Reliability Growth Models (SRGM) based on the NHPP are developed which incorporates the EW testing-effort expenditure during the software- testing phase. It is assume that the error detection rate to the amount of testing-effort spent during the testing phase is proportional to the current error content. For the models, the software reliability measures such as the number of remaining faults, fault detection rate and the estimation methods of the model parameters by Least Square and Maximum Likelihood are investigated through three numerical experiments on real data from various software projects. The evaluation results are analyzed and compared with other existing models to show that the proposed SRGM with EW testing-effort has a fairly better faults prediction capability and it depicts the real-life situation more faithfully. This model can be applied to a wide range of software system. In addition, the optimal release policy for this model, based on reliability criterion is discussed.



1.



Introduction



In recent years, software has permeated industrial equipments and consumer products. Generally, a lot of development resources are consumed by software development projects. During the software-testing phase, software reliability is highly related to the amount of development resources spent on detecting and correcting latent software errors, i.e. the amount of test effort expenditures. Therefore, software reliability is the probability that the given software functions correctly under a given environment, during the specified period of time. It is a key factor in software quality. Yamada et al.[1,2] proposed software reliability growth models incorporating the effect of test-effort expenditures on the software reliability growth in order to develop a realistic model for software reliability evaluation. They described the time-dependent behaviour of test-effort expenditures by using Rayleigh, Exponential, and Weibull curves respectively. 390



 391



In this paper, we describe the time-dependent behavior of test-effort expenditures by the EW model [3], since its curve is a flexible with a wide variety of possible expenditure patterns in real software projects. Currently, there are few studies for the use of the EW failure model in reliability and survival analysis [4], so this paper is to promote its use in software reliability analyses. A software reliability growth model based on NHPP is developed [ 5 ] and its parameters are estimated by Least Square Estimation and Maximum Likelihood Estimation methods. Experiments have also been performed based on three real software data and the results show that the proposed SRGM with EW testingeffort function is wide and effective models for software and is more realistic. Comparisons of predictive capability between various models are presented and the results show that the SRGM with EW testing-effort function can estimate the number of initial faults better than the other models. 2.



EW Testing Effort Function



We offer an EW curve as the Test-Effort Function (TEF) due to its flexibility in describing the test-effort expenditure patterns. The TEF, representing cumulative test resource expenditure at time [0, t] given by an EW curve is



where a, j3, m and 8 are constant parameters, a is the total amount of test-effort expenditures, j3 is the scale parameter, and m and 8 are shape parameters. The current testing effort expenditure at testing time t is



In particular, when 8 = 1 we have Weibull testing -effort function, for 8 = m = 1 it become Exponential testing effort function, for 8 = 1, m = 2, it represents Rayleigh testing effort function. 3. Software Reliability Growth Model Following the usual assumption in the area of SRGM [ 1,2], we assume that the no. of detected error to the current test effort is proportional to the current error content . let m(t) represent the expected mean number of errors detected by testing calendar time t which is assumed to be a bounded non-decreasing function o f t with m (0) = 0. Then, using the EW test-effort function in (l), we have :



where a is the initial error content in the system, and r is the error detection rate per error. Solving the differential Eq. (3), we have



 392



The conditional Software reliability function (Software Release-Time Based on Reliability Criteria) after the last failure occurs at time t is R = ~ ( ~= e-[m(r+x)-m(Ol 1 ~ )



4.



and In R = -[m(t



+ x) - m(t)]



(5)



Fitting The Model to Real Software Data and Analysis



First Data Set: The first set of real data is from the study by Ohba[6]. In order to estimate the parameters a, p, m and 8 of the EW distributed function, we fit the actual testing-effort data into Eqs. (1) and (2) and solve it by using the method of least squares. These estimated parameters are: n



A



6 =112.9117, p=0.000031, %



=2.8114,and 8 =0.3968 Fig.( 1) shows the fitting of the estimated testing-effort. Using the estimated parameters a, p, m and 0 the other parameters a, r in (4) can be solved by a = 565.643, r = .01965 MLE method for these failure data: For these estimates, the optimality was checked numerically. From Fig.( 1) and the comparison criteria in Table 1 shows that our SRGM is better fit than the other models for PL/l application program. In addition, substituting the llm



estimated parameters p , m and 8 in equation of t,, the testing effort function reaches the maximum at time t = 14.6596 weeks which corresponds to w(t) = 2.679643 CPU hours and W(t) = 36.26993 CPU hours. Besides, the expected number of errors removed up to this time t,, is 565.1887 Suppose this software system is desired that this testing would be continued till the operational reliability is equal to 0.85 (at At = 0.1). From Eqs. (5) and (4) we get t = 46.0162 weeks. If the desired reliability is 0.95, then t = 57.4856 weeks. If the desired reliability is 0.99, then t= 69.2183 weeks.



Second Data Set: The second set of real data is the pattern of discovery of errors by Tohma et al.[7]. Following the same procedure as the first data set, the estimated values of parameters are: n



A



2 =95.3087, p=1.53E-07, % =5.41, 8 = 0.2974 a = 94.7817 and r = 0.0253 From Fig.(2) and Table 2, we conclude that our proposed model gets reasonable prediction in estimating the number of software faults and fits this data set better other than. The testing effort fimction reaches the maximum at time t = 16.59069771 debug days which corresponds to w(t) = 5.41064598 CPU hours and W(t) = 70.78228488 CPU hours. Besides, the number of errors



 393



removed up to this time t,, is 78.969. We get testing time t = 20.698 days, if reliability is equal to 0.95. If the desired reliability is 0.99, then t=23.5131 days



Third Data Set: The third set of real data in this paper is the System T1 data of the Rome Air Development Center (RADC) projects and cited from Musa et al.[8] . Following the same procedure as the first data set, the estimated values of parameters of the EW testing-effort function are: A



A



& =26.83, P=6.165E-09, 2 =6.5508, 6 =0.95, a=133.8737 and r = 0.15465 From Fig. (3), and Table 3 shows that our SRGM is better fit than the other models for debugging data. and the testing effort function reaches the maximum at time t = 17.8456 debug days which corresponds to w(t) = 3.52347 CPU hours and W(t) = 17.1039 CPU hours. Besides, the number of errors removed up to this time t,, is 124.3613. If the desired reliability is 0.92 (0.98), then t = 20.4325 (22.0548) weeks.



Conclusion In this paper, we have discussed a SRGM based on NHPP, which incorporates EW testing-effect expenditure. We conclude that the EW testing-effort function can be used to represent a software reliability growth process for a wide range of testing effort curve and give a reasonable predictive capability for the real failure data. We also observe that the EW testing-effort curve gives better estimates than Exponential, Rayleigh, and Weibull type consumption curves.



References 1. 2.



3. 4. 5. 6. 7.



8.



Yamada S.,Ohtera H., and Norihisa H., Software Reliability Growth Model With Testing -Effort, IEEE Trans. on Reliability, R-35( 1986), 19-23. Yamada S., Hishitani J. and Osaki S., Software Reliability Growth Model With Weibull Testing-Effort: A Model and ApplicationJEEE Tramon Reliability,R42( 1993),100-105. Mudholkar G.S., and Srivastava D..K., Exponentiated Weibull Family for Analyzing Bathtub Failure-Real Data, IEEE Trans. on Reliability, 42 (1993), 299-302. Nassar M.M., and Eissa F.H., Bayesian Estimation for The Exponentiated Weibull Model, Communication in Statistics-Theory and Methods, 33(2004), 2343-2362. Ascher H., and Feingold H., Repairable Systems Reliability: Modeling, Inference, Misconceptions and Their Causes, (Marcel Dekkes, New York. 1984). Ohba M., Software Reliability Analysis Model IBM J. Res. Develop., 28(4)(1984) , 428-443. Tohma Y., Jacoby R., Murata Y., and Yamamoto M., Hyper-Geometric Distribution Model to Estimate the Number of Residual Software Fault, Proceeding of COMPSAC-89, (Orlando, 1989), 610-617. Musa J.D., Iannino A. and Okumoto K., Software Reliability: Measurement, Prediction, Application, (McGraw-Hill.(1987)).



 Table 1 - Comparison results of the 1st data



Observedestimated current TEF vs Time Figure 1



Table 2 - Comparison results of the 2nd data



Observedestimated current TEF vs Time Figure 2



 395



Model



a



R



MSE



Eq.(4) ofEW Model G-0Model Exponential Model Rayleigh Function Delayed sshaped Model



133.87



0.154



78.551



142.32 137.2



0.124 0.156



2438.3 3019.7



866.94



0.00962



237.196



0.0963446



89.240 9 245.24 6



2



4



6



8



10



12



14



16



I8



Time (wek)



Observdestimated current TEF vs Time Figure 3



 This page intentionally left blank



 Speech Recognition



 This page intentionally left blank



 RECOGNITION OF FACIAL PATTERN BY MODIFIED KOHONEN’S SELF ORGANIZING MAP (MKSOM) AND ANALYZE THE PERFORMANCE S. M. KAMRUL HASAN Department of Computer Science & Engineering, Rajshahi University of Engineering & Technology. Rajshahi-6204, Bangladesh E-mail: khraieehG&ahoo.ca [‘I MD. NAZRUL ISLAM MONDAL[’], NAFISA TARANNUVI[~],MD. TANVIR AHMED CHOWDHURY 13] [‘*21 Department



of Computer Science C? Engineering, Rajshahi University of Engineering & Technology, Rajshahi-6204, Bangladesh Department of Computer Science, Premiere University, Chittagong, Bangladesh E-mail: ninihd(dvahoo.com, mitunhitr~,vahoo.com,tunv ir@,vahoo.com



The candidate system highlights the modification of ordinary face recognition strategy. Real valued face pattern remove the undesirable changes of input due to the shifting and intensity variation. The two stage system developed by Modified KSOM technique allows identifying the face patterns at various poses. Modification developed in the determination of neighborhood size and consideration of existing patterns. Modified technique allows a high performance learning strategy and optimum stability of the network. Input of the system is a function of gray level. Former stage is concerned with the conversion of visual pattern into a raw format to process with the MKSOM network. The processed pattern is the input vector for neural network. The system also deals with performance measurement of the MKSOM with some other existing pattern recognition techniques. Modification made with the pattern of input which avoids the traditional binary input. This approach extremely minimize the learning time comparing with the existing pattern recognition system. By avoiding propagation, it is possible to minimize the computation and weight adoption. MKSOM maintains the individuality of patterns with its weight set.



Major Area: Mainly the paper focuses on the practical security system. But the research is based on the practical implementation of an intelligent system which is developed to identify and recognize a human face i.e. a person from some learned data. Concept of the work comes from the structure of brain which is called Neural Network which is the major area of the research. Keywords: MKSOM, Winner Node, Masking, Adopted Weight, Backpropagation. 399



 400



1



INTRODUCTION



The human ability to recognize faces is quite remarkable and to infer intelligence or character from facial appearance is suspect. The face is the focus of first attention to recognize a person. We can recognize thousands of faces learned throughout our lifetime and can recognize familiar faces at a glance even after years of separation. But this skill is not useful despite if there is a change in the faces due to viewing conditions, expression, aging, and distractions such as glasses, beards or changes in hairstyle. Face recognition has become an important issue in many applications such as security systems, credit card verification and criminal identification. Although it is clear that People are good at face recognition, it is not at all obvious how faces are encoded or decoded by the human brain. Researchers are working for a long time to develop a computer system, which can act as a human being. The system will perform visual sensing like identifying objects such as home, office, motor car, faces of people, Handwritten or printed words and aural sensing like voices recognition. But the computers can perform repetitive actions like calculating the sum of few hundred digit numbers and poorer at the task of processing the vast quantities of different types of data a vision system requires. Instead of these difficulties the researchers are still working for systems that are artificially intelligent in any general sense that would recognize. Human face recognition by computer system has been studied for more than twenty years. Unfortunately developing a computational model of face recognition is quite difficult, because faces are complex, multi-dimensional visual stimuli. Therefore, face recognition is a very high-level computer vision task, in which many early vision techniques can be involved. So, we develop an automatic system to find neutral faces in images for face recognition. A neutral face is a relaxed face without contraction of facial muscles and facial movements. It is the states of a person’s face most of the time, i.e. it is the facial appearance without any expression. 2



SYSTEM OUTLINE



In face recognition system, it needs to learn the machine about the facial image of the human being, which the machine can recognize further. Any facial image is learnt in some pre-defined ways and stored into a knowledge base. While identifying the face, previous stored values are used. Thus the system contains two phases 1.



2.



Learning Phase Recognizing Phase



 401



2.1 Learning Process The first task is to learn the machine about the feature vectors of the input pattern. We followed six basic steps to learn the machine about any pattern: 1. 2. 3. 4. 5. 6.



Acquiring an image Pre - Processing Filtering the image Feature extraction Training Save the pattern to knowledge base



The input images, which have been taken, may be 24-bit colored or 8-bit grayscale. Actually, maximum types of pattern recognition system need not to consider the color information of the image. Moreover, it is complex to work with a 3 dimensional matrix. So, for the face recognition system, it is better to use a grayscale image than color image. The system considers both color and grayscale image, but convert the color image into a grayscale. Input images may contain lots of noises due improper acquisition i.e. scanning or getting picture through digital camera. So dusts and scratches may appear inside the acquired image. These types of dusts alter the structure of the pattern and make disturbance throughout the process, especially during face detection. So it is essential to remove the uncertain noises by image filtering. It is better to use Lowpass Median Filter for noise removing which is used in the system. Original images do not contains only the human face; it may contain a background and blank portions by the four sides. So it is essential to extract the exact face from the image and where there is a face detection; there needs to detect the edge of the image. By the edge detection the background is dark and only the edge of the facial image exists. Edge detection is very usehl for detecting the actual face properly from the input image. Several methods may be used for edge detection purpose. As for example - prewitt, sobel, roberts, laplacian, canny, zero crossed etc. Among these filtering techniques, Prewitt filter is used for our research because it is more appropriate than other filtering techniques. After detecting the edge of the face of the image it is very easy to extract the actual face of the image. 2.2 Feature Detection Algorithm



Steps for determining the boundary of the face are as follows SET BackGr := Img[O][O]; SET Thresh := (MIN(Img[i]Lj])+MAX(Img[i]Lj]))/2; where i=0,1,2, ...... ,M and j=0,1,2 ,.....,N 3. SET Counter K = O , L:=O, X:=M/2; 4. REPEAT Step 4 to 5 until Threshl(BackGr-Img[K][N/2])and KIM 5. SET K:=K+l; 1. 2.



 402



6. 7. 8. 9. 10. 11. 12. 13. 14.



SET TopBound=K; REPEAT Step 7 to 8 until Threshl(BackGr-Img[M/2][L])and LIN SET L:=L+l; SET LeftBound=L; REPEAT Step 10 to 11 until Thresh<(BackGr-Img[M2][X]) and L a SET X:=X-1; SET RightBound=X; SET BotttomBound:=TopBound+ROLJND((RightBound-LeftBound)* 1.40); SET Height:=BottomBound-TopBoundand Width:=RightBound-LeftBound;



3



GRAPHICAL VIEW OF FACE DETECTION



1



Figure 1.1



1



1



Figure 1.3



Figure 1.2



I



I



I



,loo%



I Figure 1.4



Figure 1.5



Figure 1.6



Figure 1 , This is a caption for figure one This is a caption for figure one This is a caption for figure one and one.



 403



Learning has been started from taken the input from a file and fit it to the Modified Kohonen’s Self Organizing Network. Pattern size has been fixed to 5 6 x 4 0 so the number of input node is 2240 which is sufficient. The set of input is processed throughout the network and produced a set of weights and the selected node is called winner node. The grid size is set to l o x 10 that means total 100 output nodes. Each of the input nodes is connected to each of the output nodes. So there are 224000 connection exists between the input node and output node. Also total 2240 loo random weights are needed in this network. The adopted weights and winner node index which are the identities of the facial pattern is stored in a knowledge base. 4



MODIFICATIONS



The basic operation of the Kohonen’s network is to classify the input patterns with a y1 weight matrix where is the number of nodes in input layer and is set of the grid size. Existing learning system considers the previously learned patterns while adopting the weight matrix for the current input pattern that is avoided by the proposed subsystem. We highlights the following modification in existing recognition system -



1. Adoption of Weights: Existing recognition systems deals with the previously stored patterns which are already been learnt. It increases the learning time exhaustively. Learning time for each pattern is a factor of the number of previously learned patterns. But the modified system only tries to operate on the recently given pattern sample. It avoids the previously learned patterns for the swiftness of learning process. 2. Regular recognition system using KSOM offers the modification of weights for all the connections among the two layers. It indicates the static size of neighbors. Due to the rapid change of neighborhood size, number of weight adoption easily decreased with the time. The modified MKSOM system proposes a function for changing the neighborhood size along with the change of the distance of winner node -



PO + 1) = P(t>5



m(4(9



-



4(t - 1))



(1)



EXPERIMENTAL RESULTS



Performance was measured with the existing system that was built on Kohonen’s SOM technique. Performance of the system is nearly equal for the few numbers of patterns. But with the rapid change of input size, performance of the system also changes rapidly. The system was tested with up to 25 sample patterns learn at a



 404



time. Firstly, the learning time of the modified system is challenging. On the other hand, the system was tested with some other patterns which were not previously learned. But some other similar patterns i.e. same pattern with a few noise or on a different pose were learnt by the system. During the accuracy test, patterns were learned 10 at a time. Accuracy of the system was measured by the rate of identification where the patterns were not previously learnt. KSOM +MKSOM



-A-



I



5



2



10



15



20



25



Number of Patterns



Figure 2. This is a caption for figure one This is a caption for figure one This is a caption for figure one and one.



For Maxirrum 100 Patterns



I-A-



KSOM --C MKSOM



Figure 3. Accuracy Measurement Compared with KSOM.



I



 405



6



CONCLUSION



Till now, no intelligent system has been developing which gives hundred percent correct outputs. Always there occur some errors due to the variation of input patterns. Face recognition is a very complicated system because human faces change depending on their age, expressions etc. There are a lot of expressions for a human being. So it is not possible to learn all types of expressions into the network. As a result there is a cause for the unrecognizing. Moreover due to the performance variations of the input device face can not be detected correctly and pattern may change extremely. In this paper we have developed and illustrated a recognition system for human faces using Kohonen self-organizing map (SOM). The main objective of our face recognition system was to obtain a model that is easy to learn i.e. minimization of learning time, react well with different facial expressions with noisy input and optimize the recognition as possible. In our system we used Kohonen self organizing map instead of Backpropagation technique for learning because Backpropagation takes several hours to learn while Kohonen self organizing map takes only a few seconds to learn. So, in this respect our system is very efficient and easy to learn. References



1. Carey S., and Diamond R.: From Piecemeal to Configurational Representation ofFaces, Science 195, pp. 312 313 (1977). 2. Bledsoe, W. W.: The Model Method in Facial Recognition, Panoramic Research Inc. Palo Alto, CA, Rep. PRI: 15 (August 1966). 3. Bledsoe, W. W .” Man-Machine Facial Recognition, Panoramic Research Inc. Palo Alto, CA, Rep. PRI: 22 (August 1966). 4. Fischler, M. A., and Elschlager, R. A., The Representation and Matching of Pictorial Structures, IEEE Trans. on Computers, c-22.1 (1973). 5. Yuille, A. L., Cohen, D. S., and Hallinan, P. W.: Feature Extraction from Faces Using Deformable Templates, Proc. of CVPR (1989). 6. Kohonen, T.: Self-organization and Associative Memory, Berlin: SpringerVerlag (1989). 7. Kohonen, T., and Lehtio, P., Storage and Processing of Information in Distributed Associative Memory Systems (1981). 8. Fleming, M., and Cottrell, G.: Categorization of Faces Using Unsupervised Feature Extraction, Proc. of IJCNN, Vol. 90(2) (1990). 9. Kanade, T.: Picture Processing System by Computer Complex and Recognition of Human Faces, Dept. of Information Science, Kyoto University (1973). 10. Burt, P.: Smart Sensing within a Pyramid Vision Machine, Proc. of IEEE, Vol. 76(8) (1988) 139-153. 11. Commercial Systems and Applications Wed Nov 27 18:01:36 EST 1996 12. 



 406



13. Rafael C. Gonzalez, Richard E. Woods: Digital Image Processing, First Edition (1992). 14. 13. Artificial Neural Networks Technology 15. WRL: http ://www.dacs.dtic.mil/ neural-nets .htmb 16. Simon Haykin: Neural Networks, A Comprehensive Foundation (1999), Pearson education, Inc. Singapore. Chapter 1.9: Historical Notes. 17. J G Taylor: Hand Book for Neural Computation, IOP publishing Ltd. Oxford University Press (1997) 1-7. 18. COGANN-92: International Workshop on Combination of GA and NN, Baltimore, Maryland, USA (1992). 19. Industrial Applications of Neural Networks (research reports Esprit, I.F. Croall, J. P. Mason) 20. Muratori, S. and Rinaldi, S., Limit cycles and Hopf bifurcations in a Kolmogorov type system. Modeling, Identification and Control 10 (1988) pp. 91-99. 21. Sousa Reis C., Monteiro Marques V., Calvario J., Marques J. C., Melo R. and Santos R., ContribuiGZo para o estudo dos povoamentos bent6nicos (substrato m6vel) da costa ocidental portuguesa, Oecologia aquatica 6 (1982) pp. 91-105. 22. Wallach D. and Goffinet B., Mean squared error of prediction as a criterion for evaluating and comparing system models, Ecol. Modelling 44 (1989) pp. 299-306.



 Others



 This page intentionally left blank



 INTRUSION DETECTION SYSTEM (IDS)USING NETWORK PROCESSOR MR. SHETE P.G M. Tech student I1 nd Year, Pune Institute Engineering &Technology, Pune (India) E-mail: [email protected] MR.PATIL R.A Lecturer in E&TC dept. Pune Institute Engineering &Technology, Pune (India) E-mail: ray @ etc.oiet.co.in In the field of Network Security as Intrusion detection system the attack can be identified using a second generation Intel network processor M P 2400,it has high throughput and low latency ,The hardware architecture of the network processor consists of xscale core processor with eight micro-engines and also it has SRAM,DRAM,MAC,LM. The method we can use is correlating attacks targeted to single host. Users can apply microcode assembly or high-level Micro-C to develop program that send packet-processing commands to the eight MEv2 chips. Each micro engine contains four types of 32-bit data path registers, Thus the high severity can be recognized from which attacker has attached.



1.



Introduction



There is no disputing the facts that the number of hacking and intrusion incidents is increasing year on year as technology rolls out. Unfortunately in today's interconnected e-commerce world there is no hiding place: Whether the motivation is financial gain, intellectual challenge, espionage, political, or simply trouble-making, you may be exposed to a variety of intruder threats. Obviously it is just common sense to guard against this, but business imperative. Intrusion Detection Systems are like a burglar alarm for your computer network... they detect unauthorized access attempts. They are the first line of defense for your computer systems.



Intrusion Detection System (IDS) IDS is a program that will detect and inform you of network misuse, this could range from someone attacking your network to your employees running a Quake server or looking at pornography. NP advantages Relatively low cost, Straightforward hardware interface, Facilities to access Memory, Network interface devices Programmable, Ability to scale to higher Data rates, Packet rates 409



 410



etwork processors are a form of multiprocessor-on-a-c~pembedded devices tuned to the task of moving packets in network switches, network routers and network ineerfaces.



3. ARCHITECTURE



thatcan connecttoaco pro~essortoaaawatepaiketpmmcaoi.



FIG. 1



FIG. 2



IXP 2400: ~ r i f XP2400 t ~ ~ network ~ ~ processor features and benefits



P2400 is a highly integrated and power efficient next-generation ne~wor rocessor. It offers wire-speed OC-48 n e t w o r ~ n gdata plane processing as well as ntrol plane capability on a single chip. As shown in Figure 1, each I ~ P 2 4 0 0 c o n t ~ n eight s ~nlti-threadedpacket-processing micro engines, a low-power ~ e n e r a l - p ~ p o sIntel@ e XScaleW micro architecture core, network media and switch fabric interfaces, memo^ and PCI controllers, and interfaces to flash P and pe~pheraldevices. The media and switch fabric interfaces readily connect to i n d ~ ~ y - s t a n dframers, ~d MAC devices, or switch fabric. These inte~acesget networking data into and out of the IXP2400 efficiently.



4. IMPLEMENTATION There are three types of IDS implementations



I . Co~elaeingattacks targeted to single host



 411



2. Correlating attacks spanning multiple destinations 3. Correlating attacks based on Time.



One of the method and its algorithm



Correlating attacks targeted to single host This method will try to identify events that are targeted for a specific host rather than multiple hosts Algorithm For each high seventy event SO. Let FO be a destination that has SO. Select events by splitting the whole data into multiples of “w” time window. For each of this time window Let F1 is list of all such events that has occurred to the same destination FO but before SO. For each event in F1 find the weight. Where weight=no. of occurrences of the event across different time window in “w”. Let T1 be the set of top N weights and C1 be the confidence (formula (weightho of time intervals with F1) * 100) Show that SO would happen if T1 happens with the confidence level C1. 5. CONLUSION Using IXP 2400 Network Processor we can achieve high through put and low packet latency also the Band Width of the packet processing can be increased and thus the network can be protected from high seventy attacks.



References



1. 2. 3. 4.



www.intel.com www.securitvDocs.com www.snort.org ieee potentials



 DEVELOPMENT OF REMOTELY SENSED SPATIAL DATA ANALYSIS AND MANAGEMENT SOFTWARE SYSTEM (RSSDAMSS) N.BADAL, G . K. SHARMA, KRISHNA KANT Abstract: Advances in geographical information systems (GIS) and supporting data collection technology have resulted in the rapid collection of a huge amount of spatial data. However, existing data mining techniques are not able to extract knowledge in a better manner from high dimensional remotely sensed data in large spatial databases, while data analysis in typical knowledge discovery software is limited to non-spatial data. Therefore, the aim of the remotely sensed spatial data analysis and management software system (RSSDAMSS) presented in this article, which is an AM/FM/GIS based multiuser software package and that is able to provide flexible machine learning tools for supporting an interactive knowledge discovery process in large centralized or distributed spatial databases. RSSDAMSS offers an integrated tool for rapid software development for data analysis professionals, researchers as well as systematic experimentation by spatial domain experts without prior training in machine learning or statistics. When the data are physically dispersed over multiple geographic locations, the RSSDAMSS system allows data analysis to be conducted at distributed sites by exchanging control and knowledge rather than raw data through slow network connections. Index Terms - AM/FM/GIS, RSSDAMSS, MLC



I. Introduction



Automated Mapping / Facility Management / Geographic Information System (AM/FM/GIS) has been widely used in various utilities these years[ 11. Because of the large valued demand for engineering design and operational analysis software.



N.Badal is with Computer Sc. & Engineering Department, Kamla Nehru Institute of Technology, KNIT, Sultanpur, UP, INDIA. Email: [email protected] G. K. Sharma. is with the Department of Computer Sc. & Engineering, Indian Institute of Information Technology and ManagemenJIITM, Gwalior, M.P, INDIA. Email: gksharma58 @yahoo.co.in Krishna Kant is with the Department of Computer Sc. & Engineering, Motilal Nehru national Institute of Technology, MNNIT, Allahabad, UP, India. Email: [email protected]



412



 413



A PC-Based multiuser software package developed in 1994 that was useful in many different utilities. On the basis of it, further research on integration engineering applications was conducted. But these were not sufficient for huge data sets gathered from different sources i.e. remote sensing etc. In recent years, the contemporary data mining community has developed a plethora of algorithms and methods used for different tasks in knowledge discovery within huge databases. Some of them are publicly available, and a researcher, who wishes to compare a new algorithm with existing algorithms, or analyze real data, finds the task daunting. Furthermore, as algorithms become more complex, and as hybrid algorithms combining several approaches are suggested, the task of implementing such algorithms from scratch becomes increasingly time consuming. It is well known that there is no universally best data-mining algorithm across all application domains including remotely sensed data. To increase the robustness of data mining systems, one can use an integrated data mining architecture to apply different kinds of algorithms and/or hybrid methods to a given remotely sensed data set. The most common are toolbox architectures where several algorithms are collected into a package, from which the most suitable algorithm for the target problem is somehow chosen. An example is the Machine Learning Library (MLC++) [2], which is designed to provide researchers with a wide variety of tools that can accelerate algorithm development. MLC increase the software reliability and also provide comparisons among different algorithms. This is achieved through a library of C++ classes and functions implementing the most common algorithms based on object-oriented methodology. In addition, the array of tools provided by MLC++ gives the user a good starting basis for implementing new algorithms. Clementine [3] is an another example which is an integrated tool that implements data mining algorithms of two knowledge discovery paradigms, namely rule induction and neural networks. Rule induction includes the wellknown algorithm ID3 [4] over C is for building decision tree. With respect to neural networks, Clementine includes the back propagation algorithm [5], extended with a “pruning” method mainly for classification tasks and a Kohonen network [6] for clustering tasks. Advances in spatial databases have allowed for the collection of huge amounts of data in various GIS applications ranging from remote sensing and satellite telemetry systems, to computer cartography and environmental planning. A sub field of data mining that deals with the extraction of implicit knowledge and spatial relationships not explicitly stored in spatial databases is called spatial data mining. However, it appears that no GIS system with significant spatial data mining functionality is currently available. There



 414



has been some spatial data mining software development, but most systems are primarily based on minor modifications of the previous non-spatial data mining systems. The GeoMiner system [7] is a spatial extension of the relational data mining system DBMiner [S], which has been developed for interactive mining of multiple-level knowledge in large relational databases and data warehouses. The DBMiner system implements a wide spectrum of data mining functions, including characterization, comparison, association, classification, prediction and clustering. By incorporating several interesting data mining techniques, including OLAP (On-line Analytical Processing) and attribute-oriented induction, the system provides a userfriendly, interactive data mining environment with good performance. GeoMiner uses the SAND (Spatial and Nonspatial Data) architecture for the modeling of spatial databases and includes the spatial data cube construction module, spatial on-line analytical processing (OLAP) module, and spatial data mining modules. In addition, different data mining algorithms for spatial data may be implemented in different programming environments. To allow end-users to benefit from multiple spatial data mining approaches, there is a need for the development of a software system, which will integrate all implemented methods in a single environment and thus reduce the user’s efforts in planning their management actions. Precision agriculture is one of the applications, which may prosper from novel spatial data mining techniques. Agricultural producers are collecting large amounts of spatial data using global positioning systems to geo reference sensor readings and sampling locations. It is hoped that these data will result in improved within-field management and lead to greater economic returns and environmental stewardship. However, as it is known, standard data mining methods are insufficient for precision agriculture, because of the spatial dimension of data. Therefore, for precision agriculture and other applications mentioned above, spatial data mining techniques are necessary in order to successfully perform data analysis and modeling. Furthermore, precision agriculture data are inherently distributed at multiple farms and cannot be localized on any one machine for a variety of practical reasons including physically dispersed data sets over many different geographic locations, security services and competitive reasons. With the growth of networks this is often seen in other domains. In such situations, it is advantageous to have a distributed data mining system that can learn from large databases located at multiple data sites. The JAM system [9], intended for learning from such databases, is a distributed, scalable and Portable agent-based data mining software package that employs a general approach to scaling data mining applications. JAM provides a set of



 415



learning programs that compute models from data stored locally at a site, and a set of methods for combining multiple models learned at different sites. However, the JAM software system doesn’t provide any tools for spatial data analysis. Therefore, our software system attempts to support flexible spatial data mining in centralized or distributed scenarios. In addition to providing an integrated tool for more systematic experimentation to data mining professionals, our project aims to offer an easy-to-use data mining software system for non-technical people, usually experts in their fields but with little knowledge of data analysis and intelligent data mining techniques. Our main goal was to construct a test environment for both standard and spatial data mining algorithms, that could quickly generate performance statistics ( e g prediction accuracy), compare various algorithms on multiple data sets, implement hybrid algorithms (e.g. boosting, non-linear regression trees) and graphically display intermediate results, learned structure and final prediction results. To efficiently achieve this goal, we introduced a RSSDAM software system that executes programs developed in different environments (C, C++, MATLAB) through a unified control and a simple Graphical User Interface (GUI). A detailed description of software organization, architecture and functionalities are described in the next Sections. 11. Software Organization and Architecture



Software Organization The organization of the SDAM software system, shown in Figure 1, represents an integration of data mining algorithms in different programming environments under a unique GUI.



Figure 1. An organization of RSSDAM software system under unified GUI



 416



The interface allows a user to easily select data mining algorithms and methods. This interface is implemented using the Visual Development Studio environment. The majority of the algorithms have been developed in the MATLAB programming environment [lo], but some of them have been implemented using C and C++. Many of The implemented algorithms represent our research activities in the past few years. Since MATLAB is an interpreter and cannot generate executable code, we used Visual MATCOM software [l I] and ActiveX controls to incorporate algorithms developed in MATLAB into our system. This allowed us to fully utilize h c t i o n s that could be more quickly developed in MATLAB than in a language such as C/C++. Visual MATCOM software compiles MATLAB functions into corresponding C++ or DLL (Dynamic Link Library) procedures. These C++ procedures as well as original C and C++ implementations of data mining algorithms were then compiled into executable programs or DLL procedures (see Figure 1) which can be easily invoked fiom our software system. Thus, the software system avoids running the slow MATLAB interpreter and uses fast DLL procedures. Using different optimization and configuration options, computation speed can be 1.5 to 18 times faster than using MATLAB [ 1 13. Not all MATLAB procedures can be processed through Visual MATCOM. For these procedures, we established an inter-program connection between SDAM and MATLAB using ActiveX controls. This requires that the MATLAB package is available on a machine running our software. Each ActiveX control object supports one or more named interfaces. An interface is a logically related collection of methods, properties and events. Methods are similar to function calls in that they are a request for the object to perform some action. Properties are state variables maintained by the ActiveX object, and the events are notifications that the control objects forwards back to the main application. Therefore, the software system simply creates a MATLAB object, sends a request to it to perform some function and waits for a notification relating to the performed function. The RSSDAM software system can be run fiom a local machine, or remotely through connection software (e.g. VNC [12] in our case) from any machine in a Local Area Network (LAN) or from any Internet connected machine (Figure 1). At the present stage of development only one user at a time can remotely control the RSSDAM software System on a remote machine. This user can use data from the remote machine and from the user’s local machine (local data). Since the user’s data may be confidential, security services are implemented through the password,



 417



which the user must enter at the beginning of a session. A challenge-response password scheme is used to make the initial connection: the server sends a random series of bytes, which are encrypted using the typed in password, and then returned to the server, which checks them against the ‘right‘ answer. After that the data is decrypted and could, in theory, be watched by other malicious users, though it is a bit harder to snoop this kind of session than other standard protocols. The user can remotely use learning algorithms to build prediction models for each remote data set. These models can be combined later into a new “synthesized” model in order to use the knowledge from all available data sets with the aim of achieving higher prediction accuracy than from using local prediction models built on individual sites. A more advanced distributed SDAM software system includes model and data management over multiple distributed sites under central GUI (Figure 2).



Figure 2. The organization of the distributed RSSDAM software system



In this scenario, each distributed site has its own local data, the SDAM package, file transfer and remote connection software. The user is able to perform spatial data mining operations at distributed data sites from the central GUI machine to build local models without moving raw data through slow network connections. The central GUI allows transferring learned models among sites in order to apply them at other locations or to integrate them to achieve better global prediction accuracy. In general, there are several ways for combining predictions by exchanging models. In first approach, each user i, i = 1,. ..N (Figure 2) uses some learning algorithm on one or more local spatial databases DBi, to produce a local classifier Ci. Now, all local classifiers can be sent to a central repository, where these classifiers can be combined into a new global classifier GC using majority or weighted majority principle. This



 418



classifier GC is now sent to all the individual users to use it as a possible method for improving local classifiers. Another Possibility is that each user i sends to every other user the local classifier Ci. Now, these classifiers Ci can be combined at local sites using the same methods as before. More complex methods for combining classifiers include boosting over very large distributed spatial data sets. One approach is for each boosting iteration to select a small random sample of data at each distributed site. When it is possible we transfer these data sets to a central repository site to build a global classifier. However, this is not possible or desirable for some applications and is inefficient in general. For real distributed learning over many sites, more sophisticated methods for exchanging boosting parameters among dispersed sites is required [ 131, and this is one of our current research focuses. Software Architecture



Due to the complex nature of spatial data analysis and modeling, the implemented algorithms are subdivided to six process steps: data generation and manipulation, data inspection, data preprocessing, data partitioning, modeling and model integration (Figure 3). Since not all spatial data analysis steps are necessary in the spatial data mining process, the data flow arrows in Figure 3 show which preprocessing steps can be skipped. The Figure also outlines how the modules are connected among themselves, how they use the original data, and how they manipulate the data with intermediate results and constructed models. An important issue in our software design is an internal file organization to document the results of SDAM processes. Two types of process are documented: pre-modeling and model construction. To save pre-modeling information, the following files are saved: 1) An operation history file, which contains the information about the data file from which the resulting file is constructed, the performed operation, the name with its parameters, and also the eventual resulting parameters of the performed operation. 2) The resulting file containing the data generated by the performed operation to document the model construction process, two files are saved for every model: I) A model parameters file with sufficient information for saving or transforming the model to a different site. 2) A model information file contains all information necessary to describe this model to the user.



 419



Figure 3. Internal architecture of RSSDAMSS



111. Software Functionalities



The SDAM software system is designed to support the whole knowledge discovery process. Although SDAM includes numerous functions useful for non-spatial data, the system is intended primarily for spatial data mining, and so in this section we focus on spatial aspects of data analysis and modeling. Description of the RSSDAMSS Functions The functions of RSSDAMSS are illustrated in figure 4. Graphic maps acts as a basic GUI of RSSDAMSS, through which user can perform most of manipulations including all the engineering functions [ 141. The distribution database system (DDBMS) not only maintains the distribution database but also serves the requirements fiom the GUI. At this stage it also include the function of the following sub tasks Data generation and manipulation Data inspection Data preprocessing Data Normalization Feature selection and extraction Data partitioning



 420



User authority management provides multiuser support. Users can customized and extend RSSDAMSS by data dictionary management. It also includes the error handler and diagnosis subsystem for robustness.



Figure 4. Functions of RSSDAM Software System



IV. Conclusion and Future Work



This paper introduces a distributed software system for remotely sensed spatial data analysis and management in an attempt at providing an integrated software package to researchers and users both in data mining and spatial domain applications. From the perspective of data analysis professionals, numerous spatial data mining algorithms and extensions of non-spatial algorithms are supported under a unified control and a flexible user interface. We have



 42 1



mentioned several problems researchers in data mining currently face when analyzing huge spatial data, and we believe that the RSSDAM software system can help address these. On the other side, RSSDAMSS methods for spatial data analysis and management are available to domain experts for real-life spatial data analysis needed to understand the impact and importance of driving attributes and to predict appropriate management actions. The most important advantage of the RSSDAM software system is that it preserves the benefits of an easy to design and use Windows-based Graphical User Interface (GUI), quick programming in MATLAB and fast execution of C and C++ compiled code as appropriate for data mining purposes. Support for the remote control of a centralized RSSDAM software system through LAN and World Wide Web is useful when data are located at a distant location (e.g. a farm in precision agriculture), while a distributed RSSDAMSS allows knowledge integration from data located at multiple sites. The RSSDAM software system provides an open interface that is easily extendible to include additional data mining algorithms. Hence, more data mining functionalities will be incrementally added into the system according to our research and development plan. Furthermore, more advanced distributed aspects of the RSSDAM software system will be fbrther developed. Namely, simultaneous multi-user connections and real time knowledge exchange among learning models in a distributed system are some of our important coming tasks. Our current experimental and development work on the RSSDAMSS addresses the enhancement of the power and efficiency of the data mining algorithms on very large databases, the discovery of more sophisticated algorithms €or remotely sensed spatial data management, and the development of effective and efficient learning algorithms for distributed environments. Nevertheless, we believe that the preliminary RSSDAM software system described in this article could already be of use to users ranging from data mining professionals and spatial data experts to students in both fields. References 1. Wayne Carr, P.E., Milsoft Integrated solutions, Inc. ‘‘ Interfacing Engineering Analysis With Automated Mapping”, Paper No. 94 A4, 1994 Conference IEEE. 2. Kohavi, R., Sommerfield D., Dougherty J., “Data Mining using MLC++, a Machine Learning Library in C++”, International Journal of Artificial Intelligence Tools, Vol. 6 , NO. 4, pp. 537-566, 1997. 3. Khabaza, T., Shearer, C., ”Data Mining with Clementine”, IE Colloquium on Knowledge Discovery in Databases”, Digest No. 1995/021(B), pp. 1/1-1/5. London, IEE, 1995.



 422 4. Quinlan, J. R., “Induction of decision trees,” MachineLearning, Vol. I., pp. 81-106, 1986. 5. Werbos, P., Beyond Regression: New Tools for Predicting and Analysis in the Behavioral Sciences, Harvard University, Ph.D. Thesis, 1974. Reprinted by Wiley and Sons, 1995. 6. Kohonen, T, “The self-organizing map,” Proceedings of the IEEE, 78, pp. 14641480, 1990. 7. Han, J., Koperski, K., Stefanovic, N., “GeoMiner: A System Prototype for Spatial Data Mining”, Proc 1997 ACM-SIGMOD Int’l Con$ on Management of Data(SIGMOD’97), Tucson, Arizona, May 1997. 8. Han, J., Chiang, J., Chee, S., Chen, J., Chen, Q., Cheng, S., Gong, W., Kamber, M., Koperski, K., Liu, G., Lu, Y., Stefanovic, N., Winstone, L., Xia, B., Zaiane, 0. R., Zhang, S., Zhu, H., “DBMiner: A System for Data Mining in Relational Databases and Data Warehouses”, Proc. CASCON‘97: Meeting of Minds, Toronto, Canada, November 1997. 9. Stolfo, S.J., Prodromidis, A.L., Tselepis, S., Lee, W., Fan, D., Chan, P.K., “JAM: Java Agents for Metalearning over Distributed Databases,” Proc. KDD-97and AAAI97 Work. on AI Methods in Fraud and Risk Management, 1997. 10. MATLAB, The Language of Technical Computing, The MathWorks Inc., January 1998. 11. MIDEVA, MATCOM & Visual Matcom, User’s Guide, MathTools Ltd., October 1998. 12. Richardson, T., Stafford-Fraser, Q., Wood, K. R., Hopper, A., “Virtual Network Computing“, IEEE Internet Computing, V01.2 No.1, pp33-38, JanFeb1998. 13. Fan W., Stolfo S. and Zhang J. “The Application of AdaBoost for Distributed, Scalable and On-line Learning”, Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discoveiy and Data Mining, San Diego, California, August 1999. 14. Whei-Min Lin, Ming-Tong Tsay and Su-Wei-Wu, “ Application of Geographic Information System to Distribution Information Support”, IEEE Transaction on Power system, Vol. 11, No. 1, Feb 1993.



 DETECTION AND CORRECTION OF VERTICAL SKEW IN CHARACTERS T.VASUDEV G.HEMANTHKUMAR P.NAGAE%HUSHAN DoS in Computer Science, University of Mysore Manasagangotri, Mysore, Karnataka, India banglivasu@yahoo. com ABSTRACT In this paper, a method for detecting the vertical skew and correcting the skew for characters, specifically when subjected for recognition is presented. Character skew correction is an essential preprocessing event required during document understanding. If the skew in characters are not corrected then the readability of OCRs gets reduced. The method consists of three stages. In the first stage, the direction of skew is identified in character at a macro level with a knowledgebase. In the second stage the amount or angle of skew to its base line is detected with the support of line drawing algorithm. In the final stage, the detected skew in the character is corrected by transformation process. The method has been tested with various printed and handwritten samples and readability analysis is performed with respect to an OCR. Key words: character recognition, skew detection and correction, knowledgebase, line drawing algorithm, transformation process.



1. Introduction



Character recognition is still an open problem for research in the area of image processing [2,7,9,10]. Character recognition is an essential stage in document image analysis (DIA) for the purpose of automatic data reading or extraction from document image [8,9,10,11]. Though hundreds of algorithms are reported in literature for printed and handwritten character recognition, majority of the robust methods claim maximum recognition rate up to 97% [2,3,7]. The remaining percentage of failure is due to the variability found in handwritten characters. Hence, there is much scope to reduce the level of variability found in handwritten characters in order to increase the recognition rate. The level of variability is reduced by some extent through preprocessing techniques [ 1,7]. Preprocessing involves tasks like removal of noise, thinning, filling disconnection, skew correction, separating connected characters etc [ 1,2,3,4]. It is 423



 424



experimented that in many of the methods, appropriate preprocessing on character yields better recognition rates. The character which is segmented from a document image, before it is recognized undergoes a series of different preprocessing viz., noise removal, skew correction, joining disconnections, normalization and thinning [ 1,2,3]. Skew is the orientation introduced into the character while writing. It is an inherent characteristic of many authors to write the characters with some orientation (skew), either to the left or right to the base line. Such skewed characters develop bottlenecks to the investigation of generic methods for recognition. Literature reveals that most of the recognition algorithms are developed assuming the characters are skew corrected. When skewed characters are subjected for the recognition in such algorithms, the recognition rates are found relatively low i,e., larger the skew, lower the recognition rate[2,9,10]. Hence it is required to detect the skew in characters and has to be corrected before it is subjected for recognition in order to have better efficiency. Considerable amount of work is reported in literature on skew detection and correction of document image [ 13,14,15]. Document skew detection algorithms cannot be extended to detect the skew in handwritten characters since the features considered for skew detection at document level are different from the features considered for skew detection at character level. Generally, global features of the document like finding line orientations using Hough transformations[ 131, slope between nearest-neighbor chain(NNC) obtained in the documents[ 141, horizontal projection profile of the document and its Wigner-Ville distribution for documents[ 151 etc, are considered in case of document skew detection. Where as local features are to be considered for detecting a skew in case of characters. Very few attempts have been towards detecting skew at character level, which is evident from literature. This motivated us to make an attempt to detect skew and correct the characters before recognition.



Figure(1): Samples of skewed characters



Skew in character is found in most of the languages both in printed and handwritten documents. Few samples of skewed characters are shown in figure( 1). Such skewed characters definitely reduce the efficiency of recognition



 425



algorithms[2,9,10] and essentially the skew is to be corrected before recognition in order to have higher efficiency in recognition algorithm. In this paper, we have made an attempt to develop a three stages approach for skew detection and correction of printed and handwritten characters subjected for recognition which is segmented from the document image and free from noise. It is required to know the orientation of characters viz., left-oriented or rightoriented to take further necessary action. To make a macro level decision on the direction of skew, a knowledge base is to be developed which assists to find the direction of skew and same is explained in section 2. Once the direction of skew is detected, next the angle of skew is required to be found. In order to find the angle of skew in the character a line drawing algorithm is adopted, which is explained in section 3. Skew is corrected by a transformation process, in which the character is repositioned, is explained in section 4. Algorithm of the implementation is given in section 5. Experimental results and failure analysis is explained in section 6. Finally the work is concluded in section 7. 2. Knowledgebase to Determine the Direction of Skew



Knowledgebase is a repository of derived information[6]. An inference engine is used for deriving such knowledge on direction of skew. In the proposed method, small knowledgebase is built about the comer areas of the rectangular boundary enclosing the character. The comer areas are heuristically fixed by the triangle formed with sides equal to 1/5'h of width and l h t hof height respectively at each comers of the rectangle as shown in figure(2). The knowledgebase is developed using the heuristics on non skewed structure of the character. The developed knowledgebase directs to make a macro level classification whether the character is left skewed or right skewed or not skewed from the base line. 1/5$ of v(dth \



Figure(2): Corner areas heuristically fixed in the boundary enclosing character



 426



Important inferences that can be derived from the handwritten characters in a document: i.



Absence of a part of character in all the four comer areas indicate the character is not skewed ii. Presence of a part of character in all the four comer areas indicate the character is not skewed iii. Presence of a part of character in any of the three comer areas and absence of a part of character in one comer area indicate the character is not skewed iv. Presence of a part of character in left top and right bottom comer areas and absence of a part of character in left bottom and right top comer areas indicate the character is skewed to left from base line v. Presence of a part of character in left bottom and right top comer areas and absence of a part of character in left top and right bottom comer areas indicate the character is skewed to right from base line With the assistance of derived inferences from the knowledgebase, a macro level classification of skew direction is made and then the character is subjected to detection of angle of skew. The angle of skew in character is determined by drawing a suitable line that discriminates the character on one side of the line and remaining area on the other side of the line.



3. Detection of Angle of Skew A digital differential analyzer (DDA) line drawing algorithm[ 121 draws a line between any two specified points. An imaginary line between two specified points with in the boundary enclosing the character helps in further processing. A series of lines are drawn based on the direction of skew as given by the algorithm in section 5. Two of such lines drawn on either side of the character act as discriminators to classify the rectangular area enclosing the character into two sub areas, one area consisting the character and the other area with out any part of character. Figure(3) shows the two lines on either side of the character drawn that act as discriminators. These two lines indicate the angle of skew in character on either side. Majority of the skewed characters show different angles on both sides. Skew correction should be made towards one of the lines that give better result. One of the two discriminating lines is decided as best fit line depending on the angle formed by the line with the boundary and that gives better correction. The character is repositioned based on the best fit line selected through transformation process.



 427



Figure(3): Discriminating Lines for characters



4. Transformation Process



A best fit line is one of the two discriminating lines which results better skew correction. It also indicates the direction and amount of shift to be made to the character in order to reposition the character. Mathematically [4], transformation process can be represented as, d(XY) = Tk(XJY)I



where g is the original character image in spatial coordinates g’ is the skew corrected image in the same coordinate system and T is the point processing transformation function for skew correction and is defined as



x ’= (x-d) for skew correction towards left x ’= (x+ d) for skew correction towards right



‘d’ is the distance between a point on discriminating line and a point on boundary line towards the discriminating line with same ‘y’ ordinate. This transformation process will result in skew corrected character. The result of transformation process towards left direction and right direction is shown in figure(4).



Original Image



Transformation to left



Transformation to right



Figure(4): Result of transformation process



5. Methodology



The method used in skew detection and correction for characters is presented here as an algorithm.



 428



1. Identify the direction of skew with the help of knowledgebase 2. If the character is not skewed then exit 3. If the character is left skewed then a. identify the nearest black pixel (x17yl)to the left boundary line b. identify the nearest black pixel (x2,y2) on the bottom line to the left boundary line C. draw a line L1 between the above points (xl,yl) and (x2,y2) d. repeat until the area below the line L1 is with out any black pixel (i) x 2 = x 2 -1 (ii) draw line LI between points (xl,yl) and (x2,y2) e. identify the nearest black pixel (x3,y3)to the right boundary line f. identify the nearest black pixel (xq,y4) on the top line to the right boundary line g. draw a line L2 between the points (x3,y3) and (x4,y4) h. repeat until the area above line L2 is with out any black pixels (i) x 4 = x 4 + 1 (ii) draw line L2 between (x3,y3)and (x4,y4) 1. find the angle TI formed between line LI and bottom line j. find the angle T2 formed by the line L2 and top line k. If TI > T2then select line LI and perform left-shift and exit 1. If T2> TI then select line L2 and perform right-shift and exit 4. If the character is right skewed then a. identify the nearest black pixel(xl,yl)to the left boundary line b. identify the nearest black pixel (x2,y2)on the top line to the left boundary line C. draw a line L1 between the above points (xl,yl) and (x2,y2) d. repeat until the area above the line LI is with out any black pixel (i) x2=x2- 1 (ii) draw line L1 between points (xl,yl) and (x2,y2) e. identify the nearest black pixel (x3,y3)to the right boundary line f. identify the nearest black pixel (x4,y4) on the bottom line to the right boundary line g. draw a line L2 between the points (x3,y3)and (x4,y4) h. repeat until the area below line L2 is with out any black pixels (i) xq=x4+ 1 (ii) draw line L2 between (x3,y3)and (xq,y4) 1. find the angle T1 formed between line L1 and top line j. find the angle T2 formed by the line L2 and bottom line



 429



k. If TI > T2 then select line LI and perform Left-shift and exit 1. If T2 > T I then select line L2 and perform Right-shift and exit Left-shift: 1. Repeat for each point of the best fit line a. shift-dist = distance of point from the left boundary line b. move all the black pixels to the right of the point on the same row to left by an amount shift-dist Right-shift: 1. Repeat for each point of the best fit line a. shift-dist = distance of point from the right boundary line b. move all the black pixels to the left of the point on the same row to right by an amount shift-dist Result of the implemented algorithm on various input cases and analysis of result are discussed in the next section. 6. Experimental Results The developed method is tested on more than 300 printed and handwritten characters with different shapes thicknesses and sizes. Some interesting samples of the input and result of the experiment is tabulated in table(1). The method produces satisfactory results for all the input characters having skew between 82 - 50 degrees to base line i.e., 8 - 40 degrees to vertical line. The characters with smaller skew (1 - 8 degrees) are not detected in the method because of the limitations in the developed knowledgebase and characters with such small skew do not affect much on recognition. Method fails in case of characters with skew beyond 40 degrees in correction process as the height of the character is considerably reduced. It is noticed that usually characters with skew more than 40 degrees are not printed or written. Hence the method can detect and correct the usual situation of skewed characters. Experimental results reveal that the skew correction towards the direction that has larger angle gives better results in most of the cases. This can be noticed from the table(1). It is also observed that some amount of distortion is introduced in the skew corrected characters because of pixel movements depends on the points used for line drawing. This is due to the inherent limitation of any line drawing algorithm. These distortions introduced into characters do not have much impact on recognition algorithms. We have focused only on vertical skew in the characters. Internal skew in some characters like ‘T’, ‘J’, ‘7’, ‘L’ etc., are not detected due to the presence of horizontal line in the



 430 Table(1): Sample results of the experiment



5 3 '.b



~



I.....



B ! ...



5



3 R ? c



Y



71.07



66.04



let



69.01



7158



,i=hr



64.06



61.07



tighr



75%



7429



let



70.01



67.01



SlZbr



55.78



5- .5-



I&



I



9 8 B B 5



5



3



3



A



A



e



f?



-



........



z 3



6a25



2



6-



Idt



60.94



69.62



let



34.86



w.6l



,i..br



7 3.93



61.61



7



7



3



3



T)



1



6



6



Table(2): Sample results of failure cases Oiignal Characrer



F h g box& Lines



[A . .:



:



L J



:



1i.



..................



B



Reason



not daecred



faillwein Lmmr-Led~bare



not daerred



presence ofholimnmlstroke



nor dEmred



presence of hoiimnral smoke



not damred



s k m less rhan 8 d q = e s



. .:



..........



L .



Diiwtion of s k demred



8



j



 431



character which restricts the method. The result for such skewed character is shown in table(2). The method is tested for characters of varied thicknesses. The method is tested for characters of different languages like Kannada and Hindi. Some of the failure cases are shown in table(2). The failures are mainly due to smaller skew and presence of horizontal lines in characters. Majority of the OCRs require the characters to be fkee from skew. The efficiency of the OCRs is reduced as the skew in characters are increased. However the readability will be increased for the skew corrected text. As an experimental study, the comparison of readability before skew correction and after skew correction using our method is shown in table(3) with respect to an English OCR "Readiris Pro 9".Input for the experiment is lines of English text with skew and skew corrected text. The skew correction of text involves segmentation of each character fiom the input and each character is subjected to skew correction on left side using our approach. Skew corrected characters are then concatenated and then the concatenated text is subjected to recognition by OCR.



Text wlth skew



Skew comcted text



Recognition by OCR



Recognition by OCR



I



Readabihy



Readability



YANDLEWITHCRRE



IP..NDLRIVITHCt-.RE.



XI%



SOURCEOFRlCHNESS



-0U RtEOFRlCHNESS



11Yh



INTERIORS



IMERIORS



188%



r/ENLONSYSTEMS



VENLONSYSTEMS



wm



SPORTSCLlrB



SpoRcTCLU9



iR?4



I



 432 Table(4): Analysis of readability before and after skew correction by OCR Input Text Before skew correction After skew correction



Readability range 20%-67% 70%- 100%



I I



I Average readability 55% 85%



To perform the analysis of the result on skew correction of our approach, about 100 lines of printed text with different character size, skew, style are considered. Few instances from worst case to best case are shown in the table(3). Analysis of variation in readability before and after skew correction is summarized and shown in table(4). It is evident from the table(4) that the average readability of considered OCR is around 55% when the input text is with skewed characters. Where as when the same text is skew corrected and then subjected to recognition by OCR, shows an average readability around 85%. The implemented approach enhances the readability of OCR by another 30%. The main reason for the approach to show only an average of 85% readability is due to the limitations in the skew detection and correction process. Limitations in the process are (i) small skews in the characters are ignored, (ii) knowledgebase is insufficient for classification of all possible cases, (iii) horizontal skew in character is not detected, (iv) internal skew with in character is ignored due to horizontal lines. These limitations restrict the method in resulting lower rate of skew detection and correction and consequently the readability of the OCR is enhanced proportionately. 7. Conclusion The method is developed without involving any complex mathematical model and adopts simple point processing technique in spatial domain and the computational complexity is only O(n2). For any image processing application time complexity of O(n2) is relatively considered to be less expensive. However there is some computational expense is involved in repeated line drawing process. The approach detects and corrects the skew in characters to enhance the average readability of the OCR from 55% to 85%. The developed method detects and corrects skew in printed and handwritten characters of various size and thickness which are in the 8 - 40 degree range from vertical line. The skew beyond this range is not responded properly by the method due to limitations in the knowledgebase. Transformation process introduces some distortions due to the limitations in the line drawing algorithm. In addition, the transformation process also reduces the actual height of the character to the vertical height of the boundary enclosing the character. The



 433



developed algorithm is ideal for detection and correction of skew in characters that are subjected for recognition. The developed method is very much sensitive to noise. The approach considerably increases the readability of an OCR. Readability analysis is required on characters of other languages to study the behavior of the method on the characters of the languages. There is more scope to improve the knowledgebase to detect skew beyond 8 - 40 degree, reduce distortions in skew corrected characters, detect and correct horizontal skew in characters, develop method for rotation based correction instead of pixel shift correction. The method can be further enhanced to detect variable skew with in a character and then correct the same. An investigation is under process to derive a transformation function to retain the height of the character after skew correction. References [l] Gorman L, Kasturi R, “Document Image analysis”, IEEE Computer Society Publication, 1998. [2] B.N.Chatterji, “Feature Extraction Methods for Character Recognition”, in proc. National Pre-Conference Workshop on Document Processing, Mandya, India, pp 7-20, July 200 1. [3] Nagabhushan P, “Document Image Processing”, Proc. National Pre-Conf. Workshop on Document Processing, Mandya, India, 2001, pp 114. [4] Rafael C Gonzales & Richard E Woods, “Digital Image Processing”, Addison Wesley Publication. [5] Mura1i.S & Nagabhushan.P, “Recognition of Hand written English Characters Using Tangent Feature Values: novel appraoch ”, National Conf. On Systems Engineering,Thiruvannamalai,Tamilnadu,2001, pp 1 14. [6] Rich & Knight, “Artificial Intelligence ”, TMH, 1991. [7] Rejean Plamondon, Srihari N S, “On-line and Off-line handwriting recognition: a comprehensive survey”, Jrnl. PAMI, Vol. 22, No. 1 , pp. 63-84,2000 181 Stephan Bauman et al., “Message extraction from printed documents”. Proc. ICDAR, 1997. [9] Suen C Y, Koerich A L, Sabourine R, “Large vocabulary off-line handwriting recognition: A survey”, Jrnl. Pattern Analysis Application, Vol. 6, pp. 97-121, 2003. [ 101 Thomas Bayer et al., “Information extraction from paper documents”, Proc. Hand book of character recognition and image analysis, pp 653-677. [I 11 Yaun Y T & Liu .I,“Information acquisition and storage of forms in document processing”, Proc. ICDAR, 1997. [I 21 Donald Hearn, M.P.Baker, “Computer Graphics”, Pearson Education, Znd Edition, 2003 1131 A. Amin and S. Fischer “A Document Skew Detection Method Using the Hough Transform”, Journal of Pattern Analysis and Applications, vol 3, pp 243-253, 2000.



 434 [14] Yue Lu and Chew Lim Tan, “A nearest-neighbor chain based approach to skew estimation in document images”, pattern Recognition letters, vol 24, pp 23 15-2323, 2003. [15] E. Kavallieratou, N Fakotakis and G Kokkinakis, “Skew angle estimation for printed and handwritten documents using the Wigner-Ville distribution”, Image and Vision Computing, vol20, pp 813-824,2002.



 EVOLUTIONARY METAHEURISTIC APPROACHES FOR THE VEHICLE ROUTING PROBLEM WITH TIME WINDOWS SASWATI TRIPATHI, BHAWNA MINOCHA Amity School of Computer Sciences, Noida, India stripathi@ascs. amity.edu,[email protected] The Vehicle Routing Problem with Time Windows (VRPTW) is an important problem in logistics. This paper reviews the research on metaheuristic algorithms comprising Tabu Search and Evolutionary Algorithms for VRPTW. The main types of evolutionary algorithms for the VRPTW are Genetic Algorithms and Evolutionary Strategies which may also be described as Evolutionary metaheuristics to distinguish them from other metaheuristics. Along with these evolutionary metaheuristics, this paper reviews heuristic search methods that hybridize ideas of evolutionary computation with some other search technique, such as tabu search, guided local search, hierarchal tournament selection. In addition to the basic features of each method, experimental results for the 56 benchmark problem with 100 customers of Solomon (1987) are presented and analyzed.



1. Introduction Vehicle Routing Problem with Time Windows (VRPTW) is one of the most renowned problems in contemporary operations research. VRPTW is the generalization of the VRP where the service at each customer must start within an associated time windows and the vehicle must remain at the customer location during service. Soft time windows can be violated at a cost, while hard time windows do not allow for a vehicle to arrive at a customer after the latest time to begin service. If it arrives before the customer is ready to begin service, it waits. It can be described as follows. Let G= (V, E) be a connected directed graph with node set V = V N U { Vo } and arc set E, where VN= { vie V I i= 1 , 2 , . ...,n} stands for customers, each of which can be serviced only within a specified time interval and Vo stands for the central depot, where all routes start and end. Each node vi e V has an associated demand qi that can be a delivery form or a pickup for the depot and service time si with service window [ei,li]. The set E of arcs with non negative weights represents the travel distances di, between every two distinct nodes vi and v, and the corresponding travel time ti,. If the vehicle reaches the customer vi before the ei, a waiting time occurs. The routes schedule 435



 436



time is the sum of the travel time, waiting time and the service time. The primary objective of VRPTW is to find the minimum numbers of tours, without violating vehicle’s capacity constraints Q and the customer’s time windows. The tours correspond to feasible routes starting and ending at the depot. A secondary objective is often to minimize the total distance traveled or to minimize total schedule time. VRPTW is NP-hard. Even finding a feasible solution to the VRPTW with a fixed vehicle size is itself an NP-complete problem. Thus techniques like exact optimization and heuristics approaches are used for solving VRPTW. Main focus in early surveys of solution techniques for the VRPTW, Fisher [ 161, Toth et al. [39], are on exact optimization techniques. Because of the complex nature of VRPTW and its wide applicability to real life situations, solution techniques like heuristic which are capable of producing high quality solutions in limited time are of prime importance. But, over the last few years Metaheursitics are the core of recent work on approximation methods for the VRPTW. Metaheuristics such as tabu search (Potvin et al. [31]), ant algorithms (Gambardella [ 17]), simulated annealing (Thangiah et al. [37]) and evolutionary algorithms (Homberger et al. [20]) are used as solution techniques for VRPTW. In this paper we surveyed and compared two metaheursitics, tabu search and evolutionary algorithms, for VRPTW. Tabu search algorithms for the VRPTW are discussed in section 2. Section 3 divides evolutionary algorithms developed for VRPTW into genetic algorithms and evolutionary strategies. Section 4, discusses hybridize ideas of evolutionary computation with some other search technique, such as tabu search. In section 5, we described computational results for the some of the described metaheuristics and Section 6 concludes the paper. 2. Tabu Search Algorithms



Tabu search uses short and long term memory to avoid cycling and to orient the search towards unexplored regions of the solution space. The initial solution is usually created with some cheapest heuristics. After creating an initial solution, an attempt is made to improve it using local search with one or more neighborhood structures and best-accept strategy. This neighborhood is created through an exchange procedure, called CROSS exchange (Taillard et al. [35]) that swaps sequences of consecutive customers between two routes. This operator generalizes both the 2-opt*(Potvin et al. [29]) and Or-opt exchanges (Or [27]), but it is a special case of the h-interchanges (Osman [28]), since it restricts the subsets of customers chosen in each route to be consecutive. Over last few years, several efficient tabu search approaches have been proposed.



 437



Garcia et al. [ 181 proposed the first application of tabu search for VRPTW. Their method uses Soloman’s I1 insertion heuristics to create an initial solution and 2-0pt* and Or-opt exchanges for improvement. Taillard et al. [35] described a metahuristic based on tabu search for the VRP with soft time windows. Lim et al. [24] propose an improved Greedy Randomized Adaptive Search Procedure (GRASP) framework by techniques including multiple initialization and solution reuse. A new technique of smoothed dynamic Tabu search is embedded into the GRASP to improve the performance. Cordeau et al. [14] introduce a refinement to a previously proposed Tabu search algorithm for vehicle routing problems with time windows. This refinement yields new best known solutions on a set of benchmark instances of the multi-depot, the periodic and the site-dependent vehicle routing problems with time windows. The main features of these tabu search heuristics are summarized in Table 1. Table 1. Features of Tabu Search for VRPTW Authors



Initial solution



Neighborhood operators



Garcia etal. [ 19941



Solomon’s I1 heuristic



2-opt*, Or-opt



Taillard et al. ~9951



Solomon’s I1 heuristic



CROSS



Potvin et al. ri996i



Solomon’s I1 heuristic



2-opt*, Or-opt



Chiang et al. [1997]



Modification of Russell



1-interchanges



Brand50 et al. [ 19991



Insertion heuristic



Relocate, exchange, GENI



I



I



Route min.



Yes



I



I



INo I



I



Yes



I



INo I No



Features



Neighborhood restricted to arcs close in distance Soft time window, adaptive memory Neighborhood restricted to arcs close in distance Reactive tabu search Neighborhood restricted to arcs close in distance



3. Evolutionary Algorithms (EA) EA is characterized by maintaining asset of solution candidates that undergoes a selection process and is manipulated by genetic operators. It has definitely been among the most suitable approaches for tackling the VRPTW. They are divided into three main subclasses: genetic algorithms, evolution strategies and evolutionary programming.



 438



3.1. Genetic Algorithms A genetic algorithm is a randomized search technique operating on a population of individuals (solutions). The search is guided by the fitness value of each individual. The creation of new generation primarily consists of four phases: Representation, Selection, Recombination and Mutation. Thangiah et al. [38] were the first to apply GA to VRPTW and uses bit string representation in vehicle routing context. He describes a method called GIDEON that assigns customers to vehicles by partitioning the customers into sectors by genetic algorithms and customers within each formed sector are routed using cheapest insertion method. Potvin et al. [30] propose a genetic algorithm GENEROUS that directly applies genetic operators to solutions, thus avoiding the coding issues. The main features of the genetic algorithms proposed for the VRPTW are summarized in Table 2. Table 2. Features of Genetic Algorithms for VRPTW Fitness



-119951Potvin et al.



Insertion heuristics I



I



Recombination/



Mutation



Random change of bit values I



I



No. of vehicles Reinsertion of a route + Total route or route segment into time another parent



Combinations of relocate operator to eliminate routes and Or-opt



No. of vehicles Reinsertion with + Total route modified Solomon’s time heuristics



Relocate to reduce the number of routes and nearest neighbor for within - route ordering



Braysy [1999]



Random Clustering + No. of vehicles Reinsertion with modified Solomon’s Solomon’s insertion + Total route time + waiting heuristics time



Relocate to reduce the number of routes



Tan et al. [2001]



Solomon’s insertion, Not defined h-interchanges, random



Jung et al. [2002]



+ random



Solomon’s insertion



[ 19961



Berger et al. [I9981



I Nearest neighbor



Solomon’s insertion



Total route time



PMX



Random swap of nodes



Selecting arcs based on Random splitting of routes. Or-opt, 2D image of a relocation, 2-opt* solution and nearest neighbor rule selected solution



 439



3.2. Evolutionary Strategies (ES) ES manipulate population of individuals, which represent solutions of an optimization problem. Due to an integrated selection mechanism the iterative calculation of a sequence of population favors the generation of better solutions. Differences to GA exist with regard to the representation of problem and the search operators. ES dispense with the encoding of individuals and instead simulate the evolution process directly on the level of problem solutions. In contrast to GA, mutation operators are given a superior role in comparison to the recombination operators. Homberger et ul. [20] propose two evolutionary strategies for the VRPTW. The individuals of the starting population are generated by means of a stochastic approach that is based on the savings algorithm of Clark et aZ.[12]. Selection of parents is done randomly and only one offspring is created through the recombination of pair of parents. Thus, a number of 2 > p offspring is created, where p is the population size. At the end, fitness values are used to select p offspring for the next generation. The first out of the two proposed metaheursitics evolutionary strategy ES 1 skips the recombination phase. The second strategy, ES2, uses the uniform order-based crossover to modify the initially randomly created mutation codes. Mester [25] proposes that in the beginning, all customers are served by separate routes. Then a set of six initial solutions is created using cheapest reinsertions of single customers with varying insertion criteria. The best initial solution obtained is used as a starting point for the ES. The main features of the evolution strategies proposed for the VRPTW are summarized in Table 3. Table 3. Features of Evolutionary Strategies for VRPTW Author



Initial Population



Fitness



Recombination/ Crossover



Mutation



Homberger



Stochastic saving



No. of vehicles + Total route time + elimination of shortest route



Uniform orderbased to create sequence for controlling Or-opt



Or-opt, 2-opt* and h -interchanges, Or-opt for route elimination



Not defined



Not defined



Or-opt, 2-opt* and h -interchanges, GENIUS, modified



et al. [ 19991 heuristic



Mester



Cheapest insertion with varying criteria



 440



4. Hybrids



In this section we review heuristic search methods that hybridize ideas of evolutionary computation with some other search technique, such as tabu search, guided local search and hierarchical tournament selection. Wee Kit et al. [40] describe a hybrid genetic algorithm, where a simple tabu search based on cross, exchange, relocate and 2-opt neighborhoods, is applied on individual solutions in the later generations to intensify the search. The GA is based on random selection of parent solutions and two crossover operators. The first operator tries to modify the order of the customers in the first parent by trying to create consecutive pairs of customers according to the second parent. The second crossover operator tries to copy common characteristics of parent solutions to offspring by modifying the seed selection procedure and cost function of an insertion heuristic Alvarenga et al. [ 11 proposes a three phase approach. Initially, a hierarchical tournament selection genetic algorithm is applied. After then, the two phase approach, the genetic and set partitioning, is applied to minimize the travel distance. The stochastic PFIH ( Push Forward Insertion Heuristic - Solomon [34]) is used to generate the initial population. Nine fitness criteria’s are defined to permit the identification and to eliminate one more route or a customer in the shortest route.. The main features of the hybrid algorithms proposed for the VRPTW are summarized in Table 4. Table 4. Features of Hybrids for VRPTW Author



Phase - I



Braysy et al. Random Insertion heuristic. Modification of LNS, reinsertion with [2000] modified Solomon’s heuristic.



Wee Kit et al. [2001]



Mester et al. [2003]



’



Phase - I1



Modified LNS, Modified CROSS exchanges, Or-opt, insertion heuristic, relocate.



Random selection, Reordering by first Use tabu search based on CROSS exchange, relocate and 2-opt* crossover operator, Modification of neighborhoods. Solomon’s insertion heuristic by second crossover operator. Cheapest insertion heuristics, Guided ’ Modified Evolution Strategies local Local Search, relocate, I-interchange, search algorithm of Mester [25]. Filling procedure of Bent et al. [2 ] is used. 2-opt* neighborhood



i



5. Analysis of Results



In this section we compared and analyzed the above described metaheuristics algorithms, using the results obtained for Solomon’s [34] well known 56



 441 Table 5. Comparison of Metaheuristic Algorithms



Gambardellaetal. [ 19991



12.00 2.73 1217.73 967.57



10.00 828.38



3.00 589.86



11.63 1382.42



Taillard et al. [I9971 Brand80 et al. [ 19991



12.17 1209.35 12.58 1205 12.08 1210.14



10.00 828.38 10.00 829



3.00 589.86 3.00 591 3.00 589.86



11.50 3.38 1389.22 1 1 17.44 12.13 3.50 1371 1250 11.50 3.25 1389.78 1134.52



Cordeau [2001]



Genetic Algorithms 112.75 Thangiah 1300.25 [I9951 Potvin et al. 12.58 1296.83 [I9961 Berger et al. 12.58 [I9981 1261.58 Brayysy 12.58 [ 19991 1272.34 Tan et al. 13.17 [200 I] 1227 Jung et al. 13.25 [2002] 1179.95 Evolutionary Strategies Homberger etal. 11.92



I



[2001] [2003] [2005]



2.82 1016.58 3.18 995 2.73 969.57



I1224



410 57523 425 58562 407 57556



13.18 1124.28 3.00 1117.64 3.09 1030.01 3.09 1053.65 5.00 980 5.36 878.41



IlO.00 13.00 112.50 892.11 749.13 1474.13 12.13 10.00 3.00 838.11 590.00 1446.25 10.00 3.00 12.13 834.61 594.25 1441.35 10.00 3.00 12.13 857.64 624.31 1417.05 10.11 3.25 13.50 861 619 1427 10.00 3.00 13.00 828.38 589.86 1343.64



13.38 I429 1411.13 65074 3.38 422 1368.13 62634 3.50 424 1284.25 60539 3.38 423 1256.80 60962 5.00 478 1123 58605 6.25 486 1004.21 54779



12.73



I 10.00



13.25



I 1203.32 1951.17 1221.06



10.00 828.38



3.25 407 1129.19 57525



13.00



1833.32 1593.00



I 11.63



I406



I 1382.06



1132.79 57265



957.43



828.48



589.93



1389.89



1159.37 57952



1012



828.4



590.9



1417



1195



58912



 442



benchmark problems. These problems have hundred customers, a central depot, capacity constraints, time windows on the time of delivery, and a total route time constraint. The C1 and C2 classes have customers located in clusters and in the R1 and R2 classes the customers are at random positions. The RCl and RC2 classes contain a mix of both random and clustered customers. Each class contains between 8 to 12 individual problem instances and all problems in any case have the same customer locations and the same vehicle capacities, only time window differs. In the term of time window density, the problems have 25%, 50%, 75% and 100% time windows. C1, R1, RCl problems have a short scheduling horizon, and require 9 to 19 vehicles. Short horizon problems have vehicles that have small capacities and short route times, and cannot service many customers at one time. Classes C2, R2 and RC2 are more representative of “long-haul” delivery with longer scheduling horizons and fewer 2-4 vehicles. Both travel time and distance are given by the Euclidean distance between points. The CNV/CTD indicates the cumulative number of vehicles (CNV) and cumulative total distance (CTD). In Table 5, we list computational results for the methods for which computing times have been reported in a systematic fashion. 6. Conclusion The VRP and VRPTW, belonging to the class of the NP-hard combinatorial optimization problems, require heuristic solution strategies for most real life instances. In this paper we have surveyed the various metaheuristic VRPTW methodologies. Evolutionary Strategies of Homberger et al. [ 19991 and Mester [2002] , hybridization of evolution strategies with other search techniques by Gehring et al. [2001] , Homberger et al. [2001], Berger et al. [2003] and Alvarenga [2005] seem to achieve the best performance.. Homberger et al. [1999] performs best for R1 and RC1, while Mester [2002] performs best in R2 and RC2. For C1 and C2, almost all papers report optimal solution. The quality of the solution obtained with these techniques is usually much better compared to traditional construction heuristics and local search algorithms. To summarize, it seems that it is important to include special methods for route reduction, combine several different search methods, employ some improvement heuristics, and create a high quality initial population to achieve the best robustness. References [l] Alvarenga, G.B., Silva, R.M.A., Sampaio, R.M.,”A hybrid algorithm for the Vehicle Routing Problem with Time Window”, accepted paper, 2005.



 443 [2] Bent, R., Hentenryck, P., “A Two-stage Hybrid local search for the Vehicle Routing Problem with Time Windows”, to appear in Transportation Sciences, 2002 [3] Berger, J., Salois, M., Begin, R., “A Hybrid Genetic Algorithm for the Vehicle Routing Problem with Time Windows”, In Proceedings of the 12‘h Bienneal Conference of the Canadian Society for Computational Studies of Intelligence, 114127, Springer-Verlag, Berlin 1998 [4] Berger, J., Barkaoui, M., Braysy, O., ”A Route-directed Hybrid Genetic Approach for the Vehicle Routing Problem with Time Windows”, Information Systems and Operation Research 41, 179-194,2003. [5] Brandgo, J.,” Meta Heuristic for the Vehicle Routing Problem with Time Windows”. In Meta Heuristics: Advances and Trends in Local Search Paradigms for Optimization, 19-36, Kluwer, Boston, MA, 1999. [6] Br’dysy, O., “A Hybrid Genetic Algorithm for the Vehicle Routing Problem with Time Windows”, Unpublished, Vaasa University, Vaasa, Finland, 1999. [7] Braysy, O., “A New Algorithm for the Vehicle Routing Problem with Time Windows based on Hybridization of a Genetic Algorithm and Route Construction Heuristics”, Proceedings of the University of Vaasa, Research Papers 227, Vaasa, Finland, 1999. [S] Braysy, O., “Genetic Algorithms for the Vehicle Routing Problem with Time Windows”, Arpakannus1/2001~ Special issue on Bioinformatics and Genetic Algorithms, :33-38, 200 1 [9] Braysy, O., Berger, J., Barkaoui, M., “A New Hybrid Evolutionary Algorithm for the Vehicle Routing Problem with Time Windows”, Presented at the Route 2000workshop, Skodsborg, Denmark, August 2000. [lo] Carlton, W.B., “A Tabu Search approach to the General Vehicle Routing Problem”, Ph.D thesis, University of Texas at Austin, TX, 1995. [Ill Chiang, W. C., Russell, R.A., “A Reactive Tabu Search Metaheuristic for the Vehicle Routing Problem with Time Windows”, INFORMS Journal on Computing, 9:41 7-430, 1997. [I21 Clarke, G., Wright, J. V., ”Scheduling of Vehicles from the Central Depot to a numberof delivery points”, Operation Research, 12568-581, 1964. [I31 Cordeau, J.F., Laporte, G., Mercier, A., “A unified Tabu Search for Vehicle Routing Problem with Time Windows”. Technical Report CRT-00-03, Centre for Research on Transportation, Montreal, Canada, 2000. [14] Cordeau, J.F., Laporte, G., Mercier, A,, “Improved Tabu Search Algorithm for the handling of route duration constraints in Vehicle Routing Problems with Time Windows”, Journal of the Operational Research Society, 55(5):542-546,2004 [I51 Duhamel, C., Potvin, J.Y., Rousseau, J.M., ”A Tabu Search Heuristic for the Vehicle Routing Problem with Backhauls and Time Windows”, Transportation Science 3 1 : 49-59, 1999. [16] Fisher, M. L.,” Optimal Solution of Vehicle Routing Problems Using Minimum Ktrees“, Operations Research 42:626-642, 1994. [17] Gambardella, L. M., Taillard, E., Agazzi, G.,” MACS-VRPTW: A Multiple Ant Colony System for Vehicle Routing Problems with Time Windows”, New Ideas in Optimization, McGraw-Hill, 1999.



 444 [I81 Garcia, B.L, Potvin, J.Y.,Rousseau, J.M, ‘‘ A Parallel Implementation of the Tabu Search Heuristic for Vehicle routing Problem with Time Window constraints”, Computers & Operations Research 2 1, 1025-1033, 1994. [I91 Gehring, H., Homberger, J., “Parallelization of a Two-Phase Metaheuritic for Routing Problem with Time Windows”, Asia-Pacific Journal of Operation Research 18,35-47, 2001. [20] Homberger, J., Gehring, H., “Two Evolutionary Meta Heuristics for the Vehicle Routing Problem with Time Windows”. INFORMS Journal on Computing, 37(3):297-3 18, 1999. [21] Homberger, J., Gehring, H., “A Two-Phase Hybrid Metaheuristic for the Vehicle Routing Problem with Time Windows”, Working Paper, Department of Economics, University of Hagen, Germany, 2001. [22] Jung, S., Moon, B. R., “ A Hybrid Genetic Algorithm for the Vehicle Routing Problem with Time Windows”, In Proceddings of Genetic and Evolutionary Computation Conference, San Franciso, Morgan Kaufmann Publishers : 1309-13 16, 2002 [23] Kohl, N., Desrosiers, J., Madsen, O.B., Solomon, M.M., Soumis, F.,” 2-Path Cuts for the Vehicle Routing Problem with Time Windows,” Transportation Science, 33: 101 -1 16,1999. [24] Lim, A,, Wang, F.,” A Smoothed Dynamic Tabu Search Embedded GRASP for mVRPTW”, ICTAI, 00: 704-708,2004, [25] Mester, D. “An Evolutionary Strategies Algorithm for Large Scale Vehicle Routing Problem with Capacity and Time Window Restrictions”, working paper, Institute of Evolution, University of Haifa, Israel, 2002. [26] Mester, D., Braysy, O.,“ Active Guided Evolution Strategies for Large Scale Vehicle Routing Problem with Time Window”, to appear in Computers & Operation Research, 2003. [27] Or, I., “Traveling Salesman-type Combinatorial Optimization Problems and their relation to the logistics of regional blood banking”, Ph.D. dissertation, Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, IL, 1976. [28] Osman, I. H., “Metastrategy Simulated Annealing and Tabu Search Algorithms for the Vehicle Routing Problem”, Annals of Operation Research, 4 1:421-45 1 , 1993 [29] Potvin, J., Rousseau, J.M.,”An Exchange Heuristic for Routing Problems with Time Window”, Journal of Operational Research Society, 46: 1433- 1446, 1995. [30] Potvin, J., Bengio. S. “The Vehicle Routing Problem with Time Windows part 11: Genetic Search. INFORMS Journal on Computing, 8(2):165-172, 1996. [3 I] Potvin, J., Kervahut, T., Garcia, B., Rousseau, J.M., “The Vehicle Routing Problem with Time Windows - Part I: Tabu Search”, INFORMS Journal on Computing, 8:158-164, 1996. [32] Russell, R.A., “Hybrid Heuristics for the Vehicle Routing Problem with Time Windows”. Transportation Science, 29: 156-166, 1995. [33] Shaw, P., “Using constraint Programming and Local Search Methods to solve Vehicle Routing Problems”, Principles and Practice of Constraint Programming, Springer-Verlag :417-431 , 1998.



 445 [34] Solomon, M.M.,” Algorithms for Vehicle Routing and Scheduling Problems with Time Window [35] Constraints”, Operations Research 35,254-265, 1987. [36] Taillard, E.D., Badeau, P., Gendreau, M., Guertin, F., Potvin, J.,“ A Tabu Search Heuristics for the Vehicle Routing Problem with Soft Time Windows”. Transportation Science, 3 1 :170-186, 1997. [37] Tan, K. C., Lee, L. H., Ou, K., “Hybrid Genetic Algorithms in Solving Vehicle Routing Problems with Time Window Constraint”, Asia-Pacific Journal of Opeartion Research, 18:121-130,2001. [38] Thangiah, S., Osman, I. H.,Sun, T.,“ Hybrid Genetic Algorithm, Simulated Annealing and Tabu Search Methods for Vehicle Routing Problems with Time Windows”, Technical Report UKC/OR94/4, Institute of Mathematics and Statistics, University of Kent, Canterbury, UK, 1994. [39] Thangiah, S.,“ Vehicle Routing with Time Windows using Genetic Algorithms”, In Application Handbook of Genetic Algorithms: New Frontiers, volume 11, pp: 253277, CRC Press, Boca Raton, 1995. [40] Toth, P., Vigo, D.,“ The Vehicle Routing Problem, Monographs of Discrete Mathematics and Applications”, S.I.A.M,Philadelphia, PA, 2002. [41] Wee Kit, H., Chin, J., Lim, A,, “A hybrid Search Algorithm for the Vehicle Routing Problem with Time Windows”, International Journal on Artificial Intelligence Tools 10,43 I-449,200 I .



 TRANSPARENT REMOTE EXECUTION OF PROCESSES IN CLUSTER OF HETEROGENEOUS UNIX SYSTEMS* MANISH ARYAL Department of Electronics and Computer Kathmandu Engineering College aryalmanish@gmail. corn ADESH KHADKA Department Electronics and Computer Kathmandu Engineering College adesh.khadka@gmail. corn ABSTRACT In this paper, we propose a protocol for load sharing among a cluster of heterogeneous Unix workstations. Our protocol, called the Distributed Process Management Protocol (DPMP), not only enables load sharing using nonpreemptive process migration but also seamlessly integrates the processes running on a network of machines. Remote processes can be accessed (for signalling, for example), in the same way as local processes making process migration highly transparent to the users and the applications. DPMP also has builtin mechanisms to detect and recover from node and network failures. DPMP can be implemented at either the kernel or the user level. We also describe an implementation of DPMP within the Linux kernel. Preliminary performance studies show that the performance gains obtained by using DPMP are substantial.



1. Introduction



A typical network with several personal workstations has enormous aggregate computing potential. However most of this potential is not realized since a majority of the workstations are idle at any given point in time [3]. On the other hand, some users find their own workstations inadequate for their computing requirements. Using the CPU cycles of idle or lightly loaded workstations is thus an attractive method to increase the CPU utilization as well as to satisfy the



'The authors are students at the Department of Electronics and Computer Engineering, Institute of Engineering, Pulchowk Campus, Nepal.



446



 447



needs of heavy users. Process migration has been widely proposed as a mechanism to achieve load sharing among workstations. Process migration can be preemptive or non-preemptive. In nonpreemptive process migration, a job, when initiated, can be migrated to some other machine on the network where it executes to completion. Thus, in a Unix system, a process is considered for migration only when it calls the exec system call. Preemptive process migration, on the other hand, can cause the execution site of a job to change even during its execution. The main advantage of non-preemptive migration over preemptive migration is that it is simpler to implement and can accommodate heterogeneity of architectures and operating systems among the machines comprising the distributed system, while retaining most of the performance benefits of preemptive process migration. The ability to share processing load among heterogeneous machines is important since most networks today comprise machines with several different architectures and operating systems. 1.1. Related Work



The existing systems that allow process migration can be classified into the following categories: (a) systems for preemptive migration among a cluster of machines with the same architecture and running the same operating system, (b) systems for preemptive migration among a cluster of heterogeneous machines, and (c) systems for nonpreemptive process migration. A number of preemptive process migration systems for a cluster of homogeneous machines have been implemented. Some of them are based on experimental operating systems such as Charlotte [l], V [15], Sprite [3, 41, and Locus [ 111. Some systems such as Mosix [2] implement preemptive process migration by modifying existing operating systems. Some user level process migration systems have also been developed, such as Condor [7], GLUnix [6] and the systems described in References [5, 8, 101. Although user level implementations are quite portable, they cannot offer full transparency because of lack of access to kernel data structures related to a process. For example, they typically require applications to be relinked with a special library that intercepts system calls and attempts to duplicate the process state usually available only in the kernel, at the user level. The relinking is however not necessary in systems that support dynamically linked libraries. Complete Unix semantics are usually not preserved in user level implementations. For example, in Condor, a migrated process cannot create child processes, or handle signals. Also user level implementations are likely to be slower than kernel implementations. A common drawback of all these preemptive process migration systems is that



 448



they do not support heterogeneity, that is, a process cannot be migrated from a machine to a machine with a different architecture. Tui [14] is a system that allows preemptive migration of a process in a heterogeneous environment. Tui uses a special compiler that inserts type information in the binary. This information is used to extract a machine independent snapshot of the process state at the time of migration. This implies that the application important since most networks today comprise machines with several different architectures and operating systems. cannot use any typeunsafe language features and thus most existing applications cannot be migrated using Tui. Systems that implement non-preemptive process migration include Solaris MC [13], OSF/l AD TNC [16] and Utopia [17]. OSF/1 AD TNC extends the OSF/l operating system for a message passing based multi-computer and is more suitable for message passing based multiprocessor systems than for a network of relatively autonomous workstations. Solaris MC adds nonpreemptive process migration and distributed process management facilities to Solaris by adding a layer of distributed objects on top of the existing Solaris code. The main drawback of Solaris MC and OSF/l AD TNC are that they assume the same operating system to be running on all machines in the cluster. Utopia is a user level implementation of remote execution. The process migration facility in Utopia is not fully transparent. For example, some applications (such as the shell) need to be rewritten to take advantage of load sharing. There are some breaches in the Unix semantics also; distributed process groups are not supported for example. Failure detection and recovery is a neglected area in most systems for process migration. For example, in many process migration systems, such as Condor, Sprite, and Mosix, a process has residual dependencies on its originating host even after it has been migrated to another machine. This is because some of the system calls made by the migrated process are forwarded to its originating machine for execution. This means that if the originating machine goes down, a process may not be able to continue to run even if it had earlier migrated to another machine. Most systems do not consider node failures and recovery from these failures at all (Locus, Solaris MC, and GLUnix are some of the exceptions). 1.2. Design Goals and Approach



Our goal was to build a non-preemptive process migration system for a network of workstations with heterogeneous architectures and running possibly different flavors of Unix, with the following features.



 449



Transparency The system should be able to non-preemptively migrate processes with a high degree of transparency i.e., the process should get the same environment as it would have on its originating machine. Existing applications should be able to take advantage of migration without any changes and without recompilation. Heterogeneity The system should be able to accommodate heterogeneity in both architectures and operating systems of the workstations that share their loads. Separation of Mechanisms and Policies The mechanisms for process migration should be independent of the policies for process migration. Performance The protocol overhead should be as low as possible. Scalability The system should be able to scale up to a large number of nodes. The design should therefore avoid centralized components, broadcast messages etc. Fault Tolerance Partial failures (node and network failures) should not affect the working of the rest of the system. In particular, failure of the originating node of a job should not affect its execution if it is executing on another node. After the failed components are restored to working order, they should be able to re-integrate with the rest of the system. Our basic approach to allowing operating system heterogeneity among the nodes that share their load is to define a protocol for load sharing. We call our protocol the Distributed Process Management Protocol (DPMP). The protocol not only enables non-preemptive process migration but also allows management of processes in a distributed manner (for example, to send signals to remote processes, notifying death of a process to its parent process etc.). DPMP defines a set of messages that are exchanged by the machines in the cluster, and a set of conventions that all machines in the cluster must follow so that migrated processes can run in an environment very similar to the one that they would have found on their originating machines. DPMP also has mechanisms that aid in failure detection and recovery and allow correct operation while a part of the system is down. Since the semantics of non-Unix like operating systems such as Windows NT differ widely from those of Unix like operating systems, the protocol focuses only on Unix variants. However it does not assume any particular flavor of Unix and thus permits process migration across machines that may differ in either or both of architecture and operating system. DPMP does not address any policy issues regarding process migration. Thus it has no



 450



messages that can be used to exchange load information among machines. This decision cleanly separates migration mechanisms from migration policies. Any load sharing policy and any protocol to exchange load information can thus be used in conjunction with DPMP. DPMP may be implemented either inside the kernel, or at the user level by using a library that intercepts system calls. A kernel implementation is likely to be more efficient than a user level implementation. A user level implementation would be more portable but may also entail some loss in transparency. For example, statically linked programs will need to be relinked with the system call interception library. Note that it is also possible that some machines in the cluster implement DPMP in their kernels while others implement it at a user level. Currently we have only implemented the protocol inside the Linux kernel. The chief characteristics of our approach that distinguish it from other systems for process migration are its high degree of transparency, support for architectural and operating system heterogeneity, support for detection of and recovery from failures and correct operation under partial failures. Another feature of our approach is the focus on the protocol rather than on the implementation. DPMP can be efficiently implemented at the kernel level, and can also be implemented at the user level with some loss in performance. The rest of the paper is organized as follows. In the following two sections, we describe the DPMP conventions and the DPMP messages respectively. Section 4 describes the DPMP mechanisms for failure detection and recovery. In Section 5, we describe the implementation of DPMP within the Linux kernel. Section 6 describes the performance evaluation of the implementation. Section 7 concludes the paper. 2. DPMP Conventions



In order that a migrated process sees approximately the same environment on the machine where it finally executes as the one it would have got on its originating host, DPMP requires all hosts in a load sharing cluster to adhere to certain conventions. In this section, we describe these conventions. 2.1. Uniform File System View Clearly a uniform file system view is important for process migration to work transparently. We use the standard NFS distributed file system [ 121 to achieve this view. However we do not insist that the file system view be absolutely identical on all machines. We believe that for most applications that would potentially benefit from process migration (for example, long running CPU



 451



and/or memory intensive applications, and parallel applications), it is enough that the user areas be accessible via the same paths on all the machines in the cluster. This has the added advantage that the heavily used files (such as system binaries, libraries, temporary areas etc.) can be local on each machine. An issue related to maintaining a uniform file system view is to also ensure that all the users and user groups have the same user and group identifiers all the machines in the cluster. The Network Information Service (NIS) [9], available on most Unix platforms, can be used to achieve this. 2.2. Process Identifiers In order to provide transparency to users, a process should retain its process identifier (pid) when it migrates. Thus processes should have pids that are unique within the cluster. We use a scheme similar to Locus and Sprite to achieve this. Each machine in the cluster is required to assign pids to processes in accordance with the following convention. The pid of a process has three parts - a cluster wide unique machine identifier (each machine in the cluster is given a unique machine identifier), a process identifier unique within the machine, and an epoch number that is incremented every time that the machine is rebooted. The epoch number is used to implement recovery after a node failure and its role is described later in Section 4. 3. DPMPMessages



DPMP is a UDP based request-reply protocol. Timeouts and retransmissions are used to handle lost messages. Authentication is currently done just by verifying that the sender node id included in the message is known to have the same IP address as the one from which this message has been sent and that the source port number is a privileged port. Support for more secure, encryption based, authentication mechanisms will be considered in future versions of the protocol. DPMP defines several messages for managing remote processes. In Section 3.1, we summarize the important DPMP messages. Proper handling of open files during process migration is important and involves tricky issues. We discuss these issues and our solutions separately in Section 3.2. 3.1. Message Types



The TRANSFER message is used to migrate a process that has just called exec to another machine. The state information for the process, such as information about open files, signal disposition, priority etc., are included in the message. DPMP allows a process to migrate at most once during its lifetime even if calls



 452



the exec system calls multiple times. This assumption considerably simplifies the protocol; for example, it is easy to locate the process given its pid since the id of the originating node can be extracted from the pid and the originating node always correctly knows the current execution site of the process. This decision does not significantly reduce the opportunities for load sharing since most Unix processes call exec at most once. The message LOCATE can be sent to the originating node of a process to determine its current execution site. The current execution site of a process can be queried for information about the process by sending it the PROC INFO message. A SIG PROC request is used to send a signal to a remote process while the SIG CHILD message is used to notify a remote parent process of the death of a child process. The CHANGE PARENT message is used to make a process change its parent. This is typically used when the parent process of a process exits and the process is inherited by the init process on its current execution site. Some other messages relate to obtaining and changing process group and session related information. The information about a process group is distributed among the originating site of the group leader process of the process group (this site is called the storage site for the group) and other hosts that have some processes in the group. The storage site of a group maintains the list of local processes belonging to the group and the list of other hosts that have some processes in the group. Other hosts just store the list of local processes belonging to the group. The GROUP INFO message can be sent to the storage site of a group to obtain information about its group members. The CHANGE PROCESS GROUP message is used to change the group of a remote process or to update the group information stored in the originating site of the group leader. The SIG PROC message can also be used to send a signal to all processes belonging to a specified group on a host. A special INIT message is used for failure detection and recovery. The INIT message and the issues related to failures are discussed in Section 4. To overcome the barrier of differences among various flavors of Unix, DPMP makes certain standardizations. For example, the same signal may have different numbers on different variants of Unix (as an example the ‘Stop’ signal is number 19 in Linux and number 17 in BSD4.4). Similarly, the range of process priorities is typically different in the various Unix flavors. To accommodate these differences, DPMP defines a standard set of signals and a standard priority range. While sending a signal, using the SIG PROC message, the sending host is required to



 453



convert the local signal number for the specified signal to the corresponding one defmed by DPMP. The receiving host converts this DPMP defined number to its own corresponding number for the same signal. Note that this does not require the native numbering of signals to change on any platform. Indeed doing that will break any existing applications that send or handle signals. Similarly, when a process is migrated, its base priority value is scaled before being sent to the destination host which scales it back to its own native priority range.



3.2. Transferring Open File Information When a process calls exec, it may already have some devices, files, sockets etc., open. These remain open after the process starts executing the new program. Thus if the process is to be migrated when it calls exec, information about its open file descriptors must also be sent to the destination host where the files should be reopened. Currently we can only migrate those processes that have only regular files open at the time of exec. To allow the process to reopen the files on its destination host, the information about open file descriptors, the names of files that they correspond to, the modes of opening and the offsets, is included as part of the TRANSFER message. The path of the current directory is also sent. For efficiency, before actually attempting to migrate a process a host may check that the process does not have any files open that will not be accessible on the destination machine (for example, files belonging to a local file system that is not exported through NFS). It is possible, that a file that is open in a process may be deleted by the same or some other process. In such a scenario, the process can continue to access the file. However, the migration of such a process will fail since the required file cannot be opened on the destination machine. The open file information may also, optionally, include the NFS file handle of the file, the root file handle of the file system to which the file belongs, and the IP address of the file server for each open file. This information can be used by the destination site to ensure that the reopened file is indeed the one that was opened by the process before migration, and not one, for example that was later created with the same name after the original was deleted. Another issue related to migration of open files is the possible sharing of the file offset by a parent and its child process. If the two processes are executing on different machines, this sharing is not possible to implement without significantly changing the NFS protocol. Fortunately, hardly any applications actually seem to use this feature of Unix file semantics and thus we ignore this issue altogether.



 454



4. Failure Detection and Recovery



Correct execution in the case of partial failures and recovery from failures was an important design goal of DPMP. In this section, we discuss the DPMP mechanisms that help in detecting and recovering from failures. These mechanisms ensure that (a) network failures are by and large transparent to users, and (b) in the case of node failures, the processes that were running on a failed node appear to the rest of the system as if they were killed. Since DPMP is a UDP based protocol, timeouts and retransmissions are used to take care of lost messages and transient network failures. Since retransmissions can cause the same request to be executed twice on the server and this is unacceptable for non-idempotent requests (for example, for-the TRANSFER request), the server is required to store the previously sent replies and can send the same reply without actually re-executing the request on receiving a duplicate request packet. This can be done efficiently since the recovery protocol employed on a host reboot (described below) ensures that the reply store does not need to survive host crashes, and thus need not be stored on permanent storage. Sequence numbers included in all request messages are used to detect duplicate requests. When a host fails, the processes that were running on it obviously fail. Also, the host loses its references to processes that originated on it but had migrated to other hosts before the crash. Similarly, it loses the membership information for the process groups and sessions for which it was the storage site. These groups and sessions might very well have included remote processes. Thus while the host is down, these processes, groups and sessions are inaccessible to the rest of the cluster i.e., the current execution sites of such processes and the list of members for such groups and sessions cannot be determined from their ids. After the host recovers from failure (i.e., when it reboots), it should attempt to recover this information. The INIT message is used to aid this recovery. DPMP requires that after booting, a host must first send an INIT message to another node, before sending or accepting any other DPMP request to/from that node. The receipt of an INIT message indicates that the sending host had failed and has now come up again. In response to the INIT message, a host sends information information about the global system state that is relevant to the recovering host. Let us assume that a host B receives an INIT message from host A. The reply of this message includes the list of those processes running on B that had originated on A. Host A uses this information to rebuild references for these processes. The reply also includes the list of groups that had originated on A and have some group



 455



members executing on B. Host A uses this information to rebuild its group tables. Apart from sending this reply, host B also assumes that those processes that had originated on B but had migrated to A have been killed and removes process references for such processes and notifies their parent processes. If a recovering host is not able to send the INIT message to all the other hosts in the cluster (due to temporary network failures) immediately after rebooting, it may resume its normal operation with an incomplete picture of the global system state. In particular, it may not be aware of processes that had earlier originated on it, had migrated to other hosts and are still running. It may thus allocate the pid of such a process to a new process, causing two processes with the same pids to exist in the system. To avoid this problem, epoch numbers are used. Each host maintains its epoch number which is simply a counter that is incremented on each reboot. The epoch number is included in all newly generated pids. This avoids the assignment of the pid of a process that is already running somewhere in the cluster to a new process. 5. A Kernel Implementation of DPMP



In this section, we describe our implementation of DPMP. We have currently implemented DPMP inside the Linux 2.2.9 kernel. The implementation architecture attempts to minimize changes to the existing kernel and adds modules on top of the existing kernel code. These modules are: the Globalization layer, the DPMP client, the DPMP server, and the Policy module. Each of the modules exports a well defined interface to the other modules. The modules use the existing Linux kernel, to which some well defined modifications are made, for accessing and manipulating local processes. We next describe these modules briefly. The Globalization Layer provides a DPMP aware implementation of the process related system calls. For such system calls, the corresponding hnction in the globalization layer determines whether a remote or a local process is involved. In the former case, it invokes a function in the DPMP client module to perform the operation on the remote process, otherwise it invokes a function of the old kernel code (typically a system call entry point in the old kernel) to perform the operation on a local process. The DPMP server receives DPMP requests from the other hosts in the cluster and performs the requested operations. As mentioned earlier, it also maintains a store of previously sent replies so that duplicate requests can be replied to without re-executing the requested operation.



 456



The DPMP client module exports functions to perform various operations on remote processes. The functions in this module thus hide the protocol level details of accessing remote processes from the globalization layer. Finally the policy module is intended to make policy decisions regarding whether and where a process should be migrated, and whether an incoming process should be accepted or not. In the current implementation, the migration policy is to simply execute an eligible process to a randomly chosen host. 6. Conclusion In this paper, we have proposed a protocol based approach for load sharing among heterogeneous UNIX machines using non-preemptive process migration. The proposed Distributed Process Management Protocol allows non-preemptive migration of processes to machines with possibly different architecture and running different flavors of UNIX. The migration is transparent to the application programs and the migrated processes can be accessed just like local processes. DPMP also incorporates mechanisms to detect and recover from network and node failures. DPMP can be implemented either at kernel or at the user level. We have also described the current implementation of DPMP inside the Linux kernel. Performance measurements show that reasonable speedups can be achieved even with the current unoptimized implementation. The DPMP project is still in a relatively early stage. Currently we are working on the design of a protocol for exchanging load information and using it to implement more realistic load sharing policies. Ways to improve the efficiency of DPMP and its implementation are also being studied. We are also exploring extending NFS to allow transparent access to remote devices so that processes with open devices can also be migrated. DPMP extensions to allow migration of processes with open pipes are also being considered. In the near future, we also plan to develop a portable, user-level implementation of DPMP. References [I] Y. Astsy and R. Finkel. "Designing a process migration facility: the Charlotte experience". IEEE Computer, 22(9):47-56, September 1989. [2] A. Barak and 0. La'adan. "The MOSIX multicomputer operating system for high performance cluster computing". Journal of Future generation Computer Systems, 13(4-5):361-372, March 1998. [3] Douglis. Transparent Process Migration in the Sprite Operating System. Ph.D. dissertation, Computer Science Division, Dept. of Electrical Engg. and Computer Sciences, University of California, Berkeley, Sept. 1990.



 457 [4] F. Douglis and J. Ousterhout. “Transparent process migration: design alternatives and the Sprite implementation”. Software: Practice & Experience, 2 1(8):757-785, August 1991. [5] D. Freedman. “Experience building a process migration subsystem for UNIX’. In Usenix Technical Conference Proceedings, pages 349-356, Dallas, TX, January 1991. [6] D. P. Ghormley, D. Petrou, S. H. Rodrigues, A. M. Vahdat, and T. E. Anderson. Glunix: A global layer unix for a network of workstations. Technical report, Computer Science Division, University of California, Berkeley, August 1997. [7] M. Litzkow, M. Livny, and M. Mutka. “Condor - a hunter f idle workstations”. In Proc. 8th International Conference on Distributed Computer Systems, 1988. [8] K. I. Mandelberg and V. S. Sunderam. “Process migration in UNIX networks”. In Usenix Technical Conference Proceedings, pages 357-363, Dallas, TX, February 1988. [9] Sun microsystems : Network configuration manual. [lo] S. Petri and H. Langend’orfer. “Load balancing and fault tolerane in workstation clusters migrating groups of communicating processes”. Operating Systems Review, 29(4):25-36, October 1995. [ l 11 G. J. Popek and B. J. Walker. The LOCUS Distributed System Architecture. Computer Systems Series, The MIT Press,1985. [12] R. Sandberg, D. Goldberg, S. Klliman, D. Walsh, and B. Lyon. “Design and implementation of the Sun network filesystem”. In Usenix Technical Conference Proceedings, pages1 19-131, June 1985. [ 131 K. Shirriff. “Building distributed process management on anobject-oriented framework”. In Usenix Technical Conference Proceedings, pages 119-13 1, Anaheim, CA, January 1997. [14] P. Smith and N. C. Hutchinson. “Heterogeneous process migration: The Tui system”. Sofhvare: Practice & Experience, 28(6):611439, May 1998. [15] M. Theimer, K. Lantz, and D. Cheriton. “Preemptive remote execution facilities for the V system”. In Proc. 1 Oth ACM Symp. on Operating System Principles, pages 212, December 1985. [16] R. Zajcew, P. Roy, D. Black, C. Peak, P. Guedes, B. Kemp, J. LoVerso, M. Lebensperger, M. Barnett, F. Rabii, and D. Netterwala. “An OSFil Unix for Massively Parallel Multicomputer”. In Usenix Technical Conference Proceedings, pages 449468, San Diego, CA, 1993. [17] S. Zhou, X. Zheng, J. Wang, and P. Delisle. “Utopia: a load sharing facility for large, heterogenous distributed computer systems”. Software: Practice & Experience, 23( 12):1305-1336, December 1993. in UNIX networks”. In Usenix Technical Conference Proceedings, pages 357-363, Dallas, TX, February 1



 DISTRIBUTED DYNAMIC SCHEDULING -AN EXPERIMENT WITH PVM Dr S Ramachandram [email protected] Jayesh Aurangabadkar [email protected] Dept of CSE, University College of Engg,Osmania University,Hyderabad-7 Abstract The communication technology has made possible the cluster and grid computing. PVM (Parallel virtual Machine) a layer on linux operating system, provides cluster computing environment in a Local Area network. The scheduling used by PVM does not make use of any state information of individual participating nodes and as a result its performance is sub-optimal. The work envisages devising a dynamic scheduling strategy that improves the performance in terms of turnaround time and speedup. The influence of various static and dynamic parameters is studied through experiments and a goodness value is derived to measure the load at the participating nodes. The goodness value computed is used to schedule the tasks on the nodes. The performance measured in terms of turnaround time and speedup is compared with the existing PVM scheduler. 1. Introduction Parallel Programming uses multiple computers, or computers with multiple internal processors, to solve a problem at a greater computational speed than using a single computer. It also offers the opportunity to tackle larger problems; that is, problems with high computation or large memory requirements.. The computers must be interconnected by a network and a software environment must be present for inter computer message passing 2. PVM Computing model PVM is a software system that enables a collection of heterogeneous computers to be used as a coherent and flexible concurrent computational resource. The overall objective of the PVM system is to enable collection of computers to be used cooperatively for concurrent or parallel computation. The individual computers may be shared or local-memory multiprocessors, vector supercomputers, specialized graphics engines, or scalar workstations that may be interconnected by a variety of networks, such as Ethernet, FDDI. The PVM computing model is based on the notion that an application consists of several tasks. Each task is responsible for part of the application's computational workload. 3 Limitations of Parallel Virtual Machine The message passing mechanism of PVM is not optimized for any particular architecture/ operating system and use of sub-optimal mechanisms leads to considerable degradation of system performance. PVM daemons are user processes from the viewpoint of the underlying operating system, there is a possibility that the PVM daemon might crash due to the lack of memory. Even when two communicating process are executing on the same machine process communication is through the uses of sockets, which can be replaced with more efficient mechanism.. There is no support for scheduling of users process in the PVM systems. This lack of a scheduling policy makes it impossible to achieve time 458



 459



parallelism among PVM tasks, thereby decreasing the performance of a parallel application running in the Virtual machine.. 4. Dynamic Load Distribution Load distribution seeks to improve the performance of distributed system usually in terms of response time or resource availability, by allocating workload among a set of co-operating hosts. The division of system load can take place statically or dynamically. Load distribution algorithms can be broadly characterized as: Static, Dynamic, and Adaptive Dynamic load distribution algorithms use system state information at least in part to make load distribution decisions, while static algorithms make no use of such information. Adaptive load distribution algorithms are a special case of dynamic load distribution algorithms, in that they adapt their activities by dynamically changing the parameters of the algorithm to suit the system changing state.. Load distribution schemes are divided into policies and mechanisms. Policy is the set of choices that are made to dstribute the load. Mechanism carries out the physical distribution of load and provides any information required by policies. The components of a dynamic load distribution algorithm are: Participation policy, Location Selection policy, and Candidate Selection policy. The mechanisms that are required to implement a dynamic load distribution algorithm are: Load Metric mechanism, Load Information mechanism, and Transfer mechanism 5. Enhanced PVM scheduler We intend to implement a scheduler which considers the static and dynamic factors which could effect the scheduling decision. The key components of the scheduler as described in section 4 have been implemented as follows. Participation Po1icy:All nodes which satisfy the task requirements as indicated by the flags parameter are selected. Apart from this the nodes whose load exceed a threshold value are excluded from the scheduling algorithm. Location Po1icy:This is ignored as the existing PVM system has well defined mechanisms to handle heterogeneity, and explicit location selection. Candidate Po1icy:This policy is also present in the exiting system. All task-spawn requests which do not have explicit choices as of the host on which they have to be spawned, become “candidates” for distribution in the load distribution algorithm. Load Metric mechanism:The load metric mechanism will be discussed in detail in the following section. Load Communication Mechanism:Load communication is achieved in the following manner. Load information collected at a node as determined by the “load metric policy” is to be distributed to every other node in the system. On receiving load from a remote node, a node may choose to update its estimate of the load on the remote machine in one of the three following ways. i) Use a history factor to update the average load info about the remote node. ii)Accept the load information as it is, ( same as (i) with history factor 0 )



 460



iii) Reject load information as receiver has better estimate. ( e.g. immediately after spawning a task to a remote node, the load info received from that node could be inconsistent. ) The value to be assigned to the “history factor” of a node would depend on the kind of activities the node is utilized for. Transfer Mechanism:The existing mechanism is used with slight modifications to include additional task parameters. The Load Metric Mechanism:The aim of the load metric mechanism is to find an workload descriptor which is representative of the processing power of a given node. It should be noted that the objective of the scheduling algorithm is to minimize the turnaround time of an application using the PVM system and not to achieve load balancing across the system. The assumptions made are: No priori knowledge about the resource requirement of incoming tasks. 0 Turnaround time of the task as the performance criteria. A goodness value is assigned to each node which reflects the number of tasks that can be scheduled on the node to minimize the turnaround time of the application. This goodness value is in fact a combination of the static processing power of the load as well as the dynamic workload on it. The effects of the following factors that could influence the ‘goodness’ value on a load was analyzed. Static factors: CPU clock speed, Size of Random Accesses Memory (RAM), MIPS Rating of the node ( Performance of FPU), Number of CPUs per node. Dynamic Factors: Number of running processes., Priorities of running processes, Amount of free RAM However, effects of some important factors which were ignored are:Speed of various IIO channels,Cache size and processor cache coupling. CPU clock speed :The results obtained below as shown in fig 1, indicate that CPU clock speed could be used as a measure of system performance provided all other factors are similar. mu .I



~



r



n



.



I



.



I



r



n



.



[Fig 1. CPU Speed vs. turn-around time] [Fig 2. No of running processes vs Execution time] Number of running processes: The effect of number of processes in run state (run queue length) having same priority as the benchmarking program is found is shown in fig 2. Our results show that run-queue length linearly effects the turn around time of an application on the node. Processor Utilization (Level of Idleness): The effect of the idleness on job execution time is shown below in Figure 3. Our results show that the turn around



 461



time of an application on a given node varies almost linearly with processor utilization (or inversely with the idle ~ s ugfawu-nul~;.-mcmim~ l process execution). m



v=



Fig 3. [Level of CPU Idleness Vs Execution time1



f-



!= Em



N u ~ b e rof CPUs per node: We have assumed that the numbers of CPUs result in a linear speedup, because we expect CPU bound processes with very limited UO to form the bulk of the job mix input to the scheduler. As a result of our study we boil down to CPU ME'S rating, number of processor on the node, run queue length, priorities of the processes in the run queue as factors, which determine the goodness value of the node. Each of the factors would be assigned weights in order to predict the turn around time of the node at given load cond :When the free FL4M of the system falls below a certain Size experiences high sapping activity which effects the turn thres around time of the benchmark. Hence we consider the effect of RAM only when the he system fall below a certain threshold value. free 6. T ler Algorithm The algorithm is incorporated into existing PVM by adding dynamic scheduling component. The scheduler component is structured as client server, the master PVM acting as client and the participating nodes as the servers. infor~ationmechanism -Data $tructures used LoadInfo // Free RAMon the node Integer FreeRam Integer MPSrating // Mips rating determined by whetstone test // 1 min load maintained by Linux kernel Integer AvgLoad Integer noCPUs // No. of CPUs on the node Integer RunQueueLength // No Of running processes on the node // Priorities of running processes Integer Priorities(] Load information mechanism OperaEions 0 ~eE~oadInfo Called by SERVER in the inJinite loop when CLIENT connects. INPUT: None OUPUT: Nodevalue,RunQueLength , noCPUs , MPSrating BEGIN Constants used Integer RamThresh //threshold value of EreeRam lnteger LoadThresh //threshold value for excess load //Avg loadper host itz system Float AvgLoad Float RamFactor //NodeVal inhibitor when Ram threshold Valueis reached //Integer no CPUs this can be obtainedfrom /proc/cpuinfo$le



H-



"4



Mdubep



-



 462



~ E ~ I N Call S y ~ n s o ~ Call GetNoRunning~ Call ~ e t C P ~ I d l e ~ Float Nodevalue // Initial~zation Nodevalue = noCPUs ~ ~ P S r a t i n g I . 0 // Init~alizationends rocess in Run~ueueLength ity o s ~ n n ~process ng



r +weight(prio, temp) DONE odev value



= Nodevalue



/factor ; m ~ r e s h//Free ) ram less than threshold devalue /RamFactor //decrement n ode value byfactor



ENDIF >Avgload ~ L o a d ~ h r e s h ) / / ~>o savg t sys load by threshold 0 //exclude h o s t ~ o m partic~~tion ENDI~ END The above firnction implements the load metric and participation policies of scheduler. The function weig i o r i ~ x) , makes use of the constants to return the factor by which the No of the node has to be decremented for each ority x. e: Indicates how the maxirnum number of processes that are allowed to run on a node more than the average system load. Each node has its o reshold value. This value has no significance for the slower nodes of the system. t for the most powerful nodes should have values which indicate their relative average load in the system.^ ~ d superiori~( in ~ a r o time) in the system.A high threshold low value of threshold -1 .O ensure g power exceeds the average value has to be used for nodes processing power of the system by a large margin.The enhanced distributed scheduler was tested using a parallel matrix multiplication program developed for *~ l ~ p l i the ~ ~ purpose. " Fig4



Fig5



i 0



IW



2M



3w



4w



sm sw 7w am am



Order of the matrix



 463



Performance of enhanced scheduler: To measure the performance of the enhanced scheduler the following method was used. Background processes of varying priorities were made to run on the nodes of the cluster, following which the matrix multiplication program was run with 4 sub tasks to be spawned. Turnaround Time: The turnaround time of the application for one to three nodes in the cluster is shown in fig 4. Speedup: Speedup=T(M)/T(WS), where T(WS) is the turn around time obtained using the weighted scheduling and T(RR) is the turnaround time obtained using the round robin scheduler. The fig 5 , clearly indicates that the speed up is evident when the order of the matrix is grater than 200. For smaller sizes the greater communication overhead in sending and receiving load information by the daemons, inhibits any performance benefits achieved due to the load distribution. 7. Conclusions The Enhanced PVM scheduler using a weighted round-robin mechanism have shown improvement over the static scheduling mechanisms under high loads for the case of non interactive computational -intensive (CPU bound) parallel applications involving very limited YO. At lower loads, how ever the overhead of communicating load information across the cluster outweighs the performance gain.The scheduling mechanism could be extended to take into account I/O bound applications, as large number of current distributed applications involves database accesses at least to a certain extent. System parameters such as disk and device latencies, VO bandwidth, etc., will have to be studied to incorporate this facility. References [l] Geist , Beguelin, Dongarra, Jiang, Manchek, Sunderam, “A Users guide to PVM: Parallel Virtual Machine”. 1994. MIT Press. [2] David Browning, “EmbarrassinglyParallel Benchmark under PVM”, Computer Science Corp., NASA Ames Research Labs. [3] J.Dongara et a1 “ PVM- experiences, current status and Future Directions”, ORNL- Tenessee. [4] David HM Spector, “BuildingLinux Clusters” - 0-Reilly [5] Adam Beguelin et al, “PVMexperiences, current status and Future Direction ”, ORNL. [6] Dror G. Feiltson, “Scheduling Parallel Jobs On Clusters”, Institute of Computer Science, The Hebrew University of Jerusalem [7] Russel W Clarke, Benjamin TB Lee,” Cluster Operating System”, Monash University. [ 8 ] Thomas Kunz, “ The influence of different workload descriptions on a heuristic load balancing scheme”, Technical University Damstadh. [9] Singhal M., Shivaratri N.G, “Advanced concepts in Operating Systems”, McGraw-Hill,1994.


	        	    
    		
    		    
    		    
    		
		    			
    
		
	    
		
		Proceedings of the Conference on Applied Mathematics and Scientific Computing
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Proceedings of the Conference on Applied Mathematics and Scientific Computing
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Proceedings of the Conference on Applied Mathematics and Scientific Computing
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Advances in Computer Science and Information Technology. Computer Science and Information Technology: Second International Conference, CCSIT 2012, ... and Telecommunications Engineering)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Advances in Computer, Information, and Systems Sciences, and Engineering: Proceedings of Ieta05, Tene05 and Eiae05
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Fuzzy Information and Engineering: Proceedings of the Second International Conference of Fuzzy Information and Engineering (ICFIE) (Advances in Intelligent and Soft Computing
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Proceedings of the European Computing Conference 2
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Advances in Engineering and Technology
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Agent Technologies and Web Engineering: Applications and Systems (Advances in Information Technology and Web Engineering Book)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Advances in Computer and Information Sciences and Engineering
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Electrical Engineering and Applied Computing
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Applied Computing: Second Asian Applied Computing Conference, AACC 2004, Kathmandu, Nepal, October 29-31, 2004. Proceedings
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Proceedings of the 3rd European Conference on Computer Network Defense (Lecture Notes in Electrical Engineering, 30)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Advances in Computing and Information Technology - ACITY 2011
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Electronic Engineering and Computing Technology
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		History of Computing and Education 2 (HCE2): IFIP 19th World Computer Congress, WG 9.7, TC 9: History of Computing, Proceedings of the Second Conference ... Federation for Information Processing)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		O for Cluster Computing (Innovative Technology Series)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Advances in Computer Science and Information Technology, Part I
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Web Engineering Advancements and Trends: Building New Dimensions of Information Technology (Advances in Information Technology and Web Engineering (Aitwe))
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Information Technology in Health Care 2007 - Proceedings of the 3rd International Conference on Information Technology in Health Care: Socio-technical ... Studies in Health Technology and Informatics
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Technology for Modelling: Electrical Analogies, Engineering Practice, and the Development of Analogue Computing (History of Computing)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Trends in Computer Science, Engineering and Information Technology - CCSEIT 2011
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Applied computing, computer science, and advanced communication proceedings
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Engineering Applications of Neural Networks Part I (IFIP Advances in Information and Communication Technology)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Communicating With Smart Objects: Developing Technology for Usable Persuasive Computing Systems (Innovative Technology Series. Information Systems and Networks)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Computer and Computing Technologies in Agriculture II, Volume 3: The Second IFIP International Conference on Computer and Computing Technologies in Agriculture ... in Information and Communication Technology)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Computer and Computing Technologies in Agriculture II: The Second IFIP International Conference on Computer and Computing Technologies in Agriculture ... in Information and Communication Technology)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Soft Computing: Methodologies and Applications (Advances in Soft Computing) (Advances in Intelligent and Soft Computing)
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Practical applications of soft computing in engineering
	    
	
	
	    Read more
	
    
		    			
    
		
	    
		
		Practical Applications of Soft Computing in Engineering
	    
	
	
	    Read more


	
            
                
                    Recommend Documents
                
		
		    						    
    
	
	    
	
    
    
	
	    
		Proceedings of the Conference on Applied Mathematics and Scientific Computing	    
	
	
	    PROCEEDINGS OF THE CONFERENCE ON APPLIED MATHEMATICS AND SCIENTIFIC COMPUTING

 Proceedings of the Conference on Applied...
	
    
    
    
						    
    
	
	    
	
    
    
	
	    
		Proceedings of the Conference on Applied Mathematics and Scientific Computing	    
	
	
	    PROCEEDINGS OF THE CONFERENCE ON APPLIED MATHEMATICS AND SCIENTIFIC COMPUTING

 Proceedings of the Conference on Applie...
	
    
    
    
						    
    
	
	    
	
    
    
	
	    
		Proceedings of the Conference on Applied Mathematics and Scientific Computing	    
	
	
	    PROCEEDINGS OF THE CONFERENCE ON APPLIED MATHEMATICS AND SCIENTIFIC COMPUTING

 Proceedings of the Conference on Applied...
	
    
    
    
						    
    
	
	    
	
    
    
	
	    
		Advances in Computer Science and Information Technology. Computer Science and Information Technology: Second International Conference, CCSIT 2012, ... and Telecommunications Engineering)	    
	
	
	    Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering Editorial Bo...
	
    
    
    
						    
    
	
	    
	
    
    
	
	    
		Advances in Computer, Information, and Systems Sciences, and Engineering: Proceedings of Ieta05, Tene05 and Eiae05	    
	
	
	    Advances in Computer, Information, and Systems Sciences, and Engineering

 Advances in Computer, Information, and Syste...
	
    
    
    
						    
    
	
	    
	
    
    
	
	    
		Fuzzy Information and Engineering: Proceedings of the Second International Conference of Fuzzy Information and Engineering (ICFIE) (Advances in Intelligent and Soft Computing	    
	
	
	    Advances in Soft Computing Editor-in-Chief: J. Kacprzyk

40

 Advances in Soft Computing Editor-in-Chief Prof. Janusz Ka...
	
    
    
    
						    
    
	
	    
	
    
    
	
	    
		Proceedings of the European Computing Conference 2	    
	
	
	    Table of Contents

Part 1: Neural networks and applications ............................................. 1 Chapter 1: H...
	
    
    
    
						    
    
	
	    
	
    
    
	
	    
		Advances in Engineering and Technology	    
	
	
	    ADVANCES IN E N G I N E E R I N G AND T E C H N O L O G Y

 This page intentionally left blank

 ADVANCES IN ENGINEERIN...
	
    
    
    
						    
    
	
	    
	
    
    
	
	    
		Agent Technologies and Web Engineering: Applications and Systems (Advances in Information Technology and Web Engineering Book)	    
	
	
	    Agent Technologies and Web Engineering: Applications and Systems Ghazi Alkhatib Applied Science University, Jordan Davi...
	
    
    
    
						    
    
	
	    
	
    
    
	
	    
		Advances in Computer and Information Sciences and Engineering	    
	
	
	    Advances in C omp uter and Inf ormation Sciences and Engineering

 A dvances in C omp uter and Inf ormation S ciences a...